Fei Xue

I am a research scientist at Nvidia Dynamic Vision and Learning Group, working on 4D foundation model. Prior to that, I received my PhD degree from the University of Cambridge under the supervision of Prof. Roberto Cipolla and Dr. Ignas Budvytis in 2025. I obtained the Bachelor and Master degrees from Peking University under the supervision of Prof. Hongbin Zha . As a student, I have been fortunate to be an intern at UiSee, SenseTime, and NVIDIA.

My research interests lie in foundation 3D/4D model, large-scale 3D reconstruction and localization for autonomous driving, robotics, and AR/VR. I am also very interested in VLM and VLA and their applications for spatial understanding and reasoning.

Email / Github / Twitter / Google Scholar / LinkedIn

News

2025.03 MATCHA is accepted to CVPR 2025 as a Highlight paper.
2024.07 Add SFD2 and IMP to image-matching-webui, an awesome online demo of feature extraction and matching methods. Check it out for comparisons with others.
2024.04 Release of PRAM and VRS-NeRF project page!
2023.05 Two papers are accepted by CVPR 2023!
2022.05 One paper is accepted by CVPR 2022!
2021.10 One paper is accepted by TPAMI 2022!
2020.05 Two papers are accepted by CVPR 2020!
2020.05 Two papers are accepted by ICCV 2019!
2019.05 One paper is accepted by CVPR 2019 as oral!
2018.06 Two papers are accepted by ACCV 2018!

Academic Activities

Outstanding reviewer of: CVPR, ICCV.
Conference reviewer of: CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR.
Journal reviewer of: TPAMI, PR.

Research ( Google Scholar )

	MATCHA:Towards Matching Anything Fei Xue, Sven Elflein, Laura Leal-Taixé Qunjie Zhou CVPR 2025 (Highlight) Paper / Code / Project A comprehensive study of the performance of previous features (e.g., Superpoint, DISK, Dust3R, Mast3R) on different matching tasks; One feature descriptor for all matching tasks.
	PRAM: Place Recognition Anywhere Model for Efficient Visual Localization Fei Xue, Ignas Budvytis, Roberto Cipolla Arxiv 2024 Paper / Code / Project We propose Place Recognition Anywhere Model (PRAM) for efficient large-scale localization which automatically defines landmarks in any scenes and recognizes these landmarks for both coarse and fine localization. Previous works of semantic-aware features SFD2 and geometric-aware matcher IMP are used.
	VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field Fei Xue, Ignas Budvytis, Daniel Olmeda Reino, Roberto Cipolla ECCVW 2024 Code A NeRF-based localization pipeline with sparse rendering for high efficiency. Previous works semantic-aware features SFD2 and geometric-aware matcher IMP are used.
	IMP: Iterative Matching and Pose Estimation with Adaptive Pooling Fei Xue, Ignas Budvytis, Roberto Cipolla CVPR 2023 Paper / Code / Video / Slides / Poster We mebed geometric constraint into graph-based matcher (e.g. SuperGlue) to make it work more accurate and robust in challenging conditions (e.g. large viewpoint changes, repetitve textures). Attention scores are used to remove useless keypoints progressively to achieve higher effciency.
	SFD2: Semantic-guided Feature Detection and Description Fei Xue, Ignas Budvytis, Roberto Cipolla CVPR 2023 Paper / Code / Video / Slides / Poster Semantics are very useful for local feature detection and description especially for long-term tasks. However, explicit usage of semantics requires segmentation networks and has severe semantic uncertainties. In this paper, we implicitly embed semantics into detection and description to detect robust keypoints and extract semantically-augmented descriptors.
	Efficient Large-scale Localization by Global Instance Recognition Fei Xue, Ignas Budvytis, Daniel Olmeda Reino, Roberto Cipolla CVPR 2022 Paper / Code / Video / Slides / Poster Our first trial on large-scale localization by recognition. We define global instances on building facades which are discriminative for coarse localization and robust to appearance changes. At test time, we recognize these global instances and use them for city-scale localization.
	Deep Visual Odometry with Adaptive Memory Fei Xue, Xin Wang, Junqiu Wang, Hongbin Zha TPAMI 2022 Paper / Code / An end-to-end VO system with tracking, remembering and refining components. It works impressively well in autonomous driving and robotics scrnarios.
	Learning Multi-view Camera Relocalization with Graph Neural Networks Fei Xue, Xin Wu, Shaojun Cai, Junqiu Wang CVPR 2020 Paper / Code An end-to-end localization frameowrk which formulates multi-view inputs as a graph and leverages GNN for multi-view information fusion. It works very well in scenarios where a single-view input leads to errors due to similar structures etc.
	Self-Supervised Deep Visual Odometry with Online Adaptation Shunkai Li, Xin Wang, Yingdian Cao Fei Xue, Zike Yan, Hongbin Zha CVPR 2020 (oral) Paper / Code An end-to-end VO framework with online adaptation at test time to enhance its ability of working in more general environments.
	Local Supports Global: Deep Camera Relocalization with Sequencen Enhancement Fei Xue, Xin Wang, Zike Yan, Qiuyuan Wang, Junqiu Wang, Hongbin Zha ICCV 2019 Paper / Code
	Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry Shunkai Li, Fei Xue, Zike Yan, Xin Wang, Zike Yan, Hongbin Zha ICCV 2019 Paper / Code
	Beyond Tracking: Selecting Memeory and Refining Poses for Deep Visual Ododmetry Fei Xue, Xin Wang, Shunkai Li, Qiuyuan Wang, Junqiu Wang, Hongbin Zha CVPR 2019 (oral) Paper / Code
	Guided Feature Selection for Deep Visual Odometry Fei Xue, Qiuyuan Wang, Xin Wang, Wei Dong, Junqiu Wang, Hongbin Zha ACCV 2019 Paper / Code
	Continuous-time Stereo Visual Odometry Based on Dynamics Model Xin Wang, Fei Xue, Qiuyuan Wang, Xin Wang, Wei Dong, Junqiu Wang, Hongbin Zha ACCV 2019 Paper / Code

Fei Xue @2023 Total Visitors:

Thanks to Jon Barron for the website template.