Xuyang Bai

Xuyang BAI 白旭阳

A Machine Learning Engineer at Apple and contributor for Object Capture and Spatial Scenes. I obtained my PhD degree (2018-2022) in Computer Science and Engineering at HKUST, advised by Prof. Chiew-Lan Tai at Vision and Graphics Group. I also worked closely with Prof. Long Quan and Prof. Hongbo Fu. Prior to HKUST, I received my BSc in Electronic Engenieering from Beijing Normal University.

Email / CV / Google Scholar / Twitter / Github

News

2026.02: Two papers are accepted by ICLR 2026.

2022.07: Extended version of VMNet is accepted by PAMI.

2022.07: One paper is accepted by ECCV2022.

2022.05: PhD-ed :)

2022.03: Code for TransFusion has been released.

2021.11: Our work TransFusion outperforms all the non-ensembled methods in the leaderboard of nuScenes detection and achieves the 1st place in the leaderboard of nuScenes tracking on open track.

Research

I'm interested in 3D computer vision, machine learning, computer graphics. Specifically, I did some works about building correspondences (for point clouds or images) and scene understanding (detection or segmentation). Free free to drop me an e-mail if you are insterested in my work.

	Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting Yixing Lao, Xuyang Bai, Xiaoyang Wu, Nuoyuan Yan, Zixin Luo, Tian Fang, Jean-Daniel Nahmias, Yanghai Tsin, Shiwei Li, Hengshuang Zhao ICLR, 2026 paper / project page
	Sharp Monocular View Synthesis in Less Than a Second Lars Mescheder, Wei Dong, Shiwei Li, Xuyang Bai, Marcel Santos, Peiyun Hu, Bruno Lecouat, Mingmin Zhen, Amaël Delaunoy, Tian Fang, Yanghai Tsin, Stephan R. Richter, Vladlen Koltun ICLR, 2026 paper / code / project page
	Vision-Centric BEV Perception: A Survey Yuexin Ma, Tai Wang, Xuyang Bai, Huitong Yang, Yuenan Hou, Yaming Wang, Yu Qiao, Ruigang Yang, Dinesh Manocha, Xinge Zhu TPAMI, 2024 paper / code / bibtex
	Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching Hongkai Chen, Zixin Luo, Yurun Tian, Xuyang Bai, Ziyu Wang, Lei Zhou, Mingmin Zhen, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan CVPR Workshop, 2024 paper / bibtex
	One Training for Multiple Deployments: Polar-based Adaptive BEV Perception for Autonomous Driving Huitong Yang, Xuyang Bai, Xinge Zhu, Yuexin Ma ICRA, 2023 paper / bibtex
	Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation of Indoor Scenes Zeyu Hu, Xuyang Bai, Jiaxiang Shang, Runze Zhang, Jiayu Dong, Xin Wang, Guangyuan Sun, Hongbo Fu, Chiew-Lan Tai TPAMI, 2022 (ICCV 2021 SI invited) paper / code / bibtex
	LiDAL: Inter-frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation Zeyu Hu, Xuyang Bai, Runze Zhang, Xin Wang, Guangyuan Sun, Hongbo Fu, Chiew-Lan Tai ECCV, 2022 paper / code / bibtex
	TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers. Xuyang Bai, Zeyu Hu, Xinge Zhu, Qingqiu Huang, Yilun Chen, Hongbo Fu, Chiew-Lan Tai CVPR, 2022 paper / code / bibtex
	Learning to Match Features with Seeded Graph Matching Network. Hongkai Chen, Zixin Luo, Jiahui Zhang, Lei Zhou, Xuyang Bai, Zeyu Hu, Chiew-Lan Tai, Long Quan ICCV, 2021 paper / code / bibtex
	VMNet: Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation Zeyu Hu, Xuyang Bai, Jiaxiang Shang, Runze Zhang, Jiayu Dong, Xin Wang, Guangyuan Sun, Hongbo Fu, Chiew-Lan Tai ICCV, 2021 (Oral) paper / code / bibtex
	PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency. Xuyang Bai, Zixin Luo, Lei Zhou, Hongkai Chen, Lei Li, Zeyu Hu, Hongbo Fu, Chiew-Lan Tai CVPR, 2021 paper / code / bibtex
	JSENet: Joint Semantic Segmentation and Edge Detection Network for 3D Point Clouds. Zeyu Hu, Mingmin Zhen, Xuyang Bai, Hongbo Fu, Chiew-lan Tai ECCV, 2020 paper / code / bibtex
	ASLFeat: Learning Local Features of Accurate Shape and Localization. Zixin Luo, Lei Zhou, Xuyang Bai, Hongkai Chen, Jiahui Zhang, Yao Yao, Shiwei Li, Tian Fang, Long Quan CVPR, 2020 paper / code / bibtex
	D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features. Xuyang Bai, Zixin Luo, Lei Zhou, Hongbo Fu, Long Quan, Chiew-Lan Tai CVPR, 2020 (Oral) paper / code / bibtex