Xianzheng Ma (马宪政)
Greeting! I am currently a DPhil Student (10.2023- ) at Department of Engineering Science, University of Oxford, supervised by Prof. Victor Prisacariu and Prof. Iro Laina.
I am a part of both the Visual Geometry Group as well as Active Vision Group.
Previously, I obtained my Bachelor's and Master's degrees from Wuhan University in 2018 and 2021.
My research interests include LLMs, 3D computer vision and robotics, especially using LLMs's world knowledge to enpower the 3D world understanding and interaction.
For any suggestions or collaborations, please reach out to me at xianzheng@robots.ox.ac.uk.
Xianzheng Ma Yash Bhalgat Brandon Smart Shuai Chen Xinghui Li Jian Ding Jindong Gu Dave Zhenyu Chen Songyou Peng Jia-Wang Bian Philip H Torr Marc Pollefeys Matthias Nießner Ian D Reid Angel X. Chang Iro Laina Victor Adrian Prisacariu
TPAMI under review
paper | arxiv | project page | code | bibtex
We survey the papers of 3D understanding, generation, and embodied agent tasks empowered by LLMs and other foundataion models (CLIP, SAM)
Ziyu Guo Renrui Zhang Xiangyang Zhu Yiwen Tang Xianzheng Ma Jiaming Han Kexin Chen Peng Gao Xianzhi Li Hongsheng Li Pheng-Ann Heng
ICLR under review
paper | arxiv | project page | code | bibtex
We propose a 3D multi-modal model for general 3D learning, Point-Bind, and the first 3D large language model, Point-LLM
Min Shi Zihao Huang Xianzheng Ma Xiaowei Hu Zhiguo Cao
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023 Highlight
paper | arxiv | project page | code | bibtex
We propose a two-stage framework--CapeFormer for category-agnostic pose estimation.
Yulu Gan Xianzheng Ma Yan Bai Yihang Lou Renrui Zhang Nian Shi Lin Luo
AAAI Conference on Artificial Intelligence (AAAI) 2023 Outstanding Student Paper
paper | arxiv | project page | code | bibtex
We offer an alternative and new solution for continual test-time adapation by learning visual domain prompt.
Xianzheng Ma Zhixiang Wang Yacheng Zhan Yinqiang Zheng Zheng Wang Dengxin Dai Chia-Wen Lin
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 Oral
paper | arxiv | project page | code | bibtex
We alleviate the domain gap caused by mixed fog influence and style variation without labels.
Xianzheng Ma Hossein Rahmani Zhipeng Fan Bin Yang Jun Chen Jun Liu
AAAI Conference on Artificial Intelligence (AAAI) 2022
We offer an reinforcement learning based method to ultilize the temporal information in videos to train a robust pose estimator.
Xian Zhong* Xianzheng Ma* Shidong Tu Kui Jiang Wenxin Huang Zheng Wang (* means equal contribution)
International Joint Conference on Artificial Intelligence (IJCAI) 2022
We propose a real-world rainy driving dataset for semantic segmentation and devise an unsupervised joint optimization framework based on contrastive learning.