Bo Zhao
bozhaonanjing [AT] gmail [DOT] com
ABOUT ME
Bo Zhao is a Principal Investigator at Beijing Academy of Artificial Intelligence (BAAI). Before, he received Ph.D. from The University of Edinburgh and M.Eng. from Peking University. He was a research intern in Snap Inc. and SenseTime. His research interests include Data-centric AI, Multimodal LLM, Embodied AI and Machine Learning. He received ICML 2022 Outstanding Paper Award. He was the only nominee of The University of Edinburgh for Informatics-Europe Best Dissertation Award 2023. He received NSFC funding on Dataset Condensation. He served as an Area Chair for NeurIPS'24 and BMVC'24.
I am working on DCAI, MLLM, and their applications, e.g., Agents & Embodied AI. Collaborations are welcome. Feel free to contact me.
News:
I will join School of Artificial Intelligence, Shanghai Jiao Tong University as Associate Professor in few days. I am recruiting Ph.D./Master Students and Research Assistants.
I will co-organize Dataset Distillation Workshops and Challenges on at CVPR'24 and at ECCV'24. Call for Papers.
PUBLICATIONS
Full list in Google Scholar
[ECCV 2024] Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking. Jiyao Zhang*, Weiyao Huang*, Bo Peng*, Mingdong Wu, Fei Hu, Zijian Chen, Bo Zhao, Hao Dong. Project Page. PDF.
[ACL 2024] VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval. Junjie Zhou, Shitao Xiao, Zheng Liu, Bo Zhao, Yongping Xiong. PDF. Code.
[RSS 2024] RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Multi-Modal Large Language Model Learning. Jianhao Yuan, Shuyang Sun, Daniel Omeiza, Bo Zhao, Paul Newman, Lars Kunze, Matthew Gadd. Project Page. PDF. Code.
[ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching. Jianhao Yuan, Jie Zhang, Shuyang Sun, Philip Torr, Bo Zhao#. PDF. Code.
[CVPR 2023 Highlight Paper (Top 2.5%)] Accelerating Dataset Distillation via Model Augmentation. Lei Zhang*, Jie Zhang*, Bowen Lei, Subhabrata Mukherjee, Xiang Pan, Bo Zhao, Caiwen Ding, Yao Li, Dongkuan Xu. PDF.
[WACV 2023] Dataset Condensation with Distribution Matching. Bo Zhao; Hakan Bilen. PDF. Code.
[NeurIPS Workshops 2022] Synthesizing Informative Training Samples with GAN. Bo Zhao; Hakan Bilen. PDF. Code.
[ICML 2022 Outstanding Paper Award (Top 1.7‰)] Privacy for Free: How does Dataset Condensation Help Privacy? Tian Dong; Bo Zhao; Lingjuan Lyu. PDF.
[CVPR 2022] CAFE: Learning to Condense Dataset by Aligning Features. Kai Wang*; Bo Zhao*; Xiangyu Peng; Zheng Zhu; Shuo Yang; Shuo Wang; Guan Huang; Hakan Bilen; Xinchao Wang; and Yang You. PDF.
[ICML 2021] Dataset Condensation with Differentiable Siamese Augmentation. Bo Zhao; Hakan Bilen. PDF. Code.
[ICLR 2021 Oral Paper (Top 1.8%)] Dataset Condensation with Gradient Matching. Bo Zhao; Konda Reddy Mopuri; Hakan Bilen. PDF. Code.
The second-highest rated paper in ICLR 2021. Ranking.
[WACV 2021] Continual Representation Learning for Biometric Identification. Bo Zhao*; Shixiang Tang*; Dapeng Chen; Hakan Bilen; Rui Zhao. PDF. Code.
[arXiv 2020] iDLG: Improved Deep Leakage from Gradients. Bo Zhao; Konda Reddy Mopuri; Hakan Bilen. arXiv. Code.
[WACV 2019] Zero-shot Learning via Recurrent Knowledge Transfer. Bo Zhao; Xinwei Sun; Xiaopeng Hong; Yuan Yao; Yizhou Wang. PDF. Code.
[ICML 2018] MSplit LBI: Realizing Feature Selection and Dense Estimation Simultaneously in Few-shot and Zero-shot Learning. Bo Zhao*; Xinwei Sun*; Yanwei Fu; Yuan Yao; Yizhou Wang. PDF. Code.
[ACM TOG 2018 & SIGGRAPH 2019] EasyFont: A Style Learning based System to Easily Build Your Large-scale Handwriting Fonts. Zhouhui Lian; Bo Zhao; Xudong Chen; Jianguo Xiao. PDF.
[SIGGRAPH ASIA 2016] Automatic Generation of Large-scale Handwriting Fonts via Style Learning. Zhouhui Lian; Bo Zhao; Jianguo Xiao. PDF.
PROJECTS
Bunny: A family of lightweight multimodal models. Project Page. Technical Report.
SegVol: Universal and Interactive Volumetric Medical Image Segmentation. Project Page. PDF.
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models. Project Page. PDF.
COMPETITIONS
[CVPR 2024] The 2nd place in MeViS: Motion expressions guided Video Segmentation. Bin Cao, Yisi Zhang, Xuanxu Lin, Xingjian He, Bo Zhao, Jing Liu. Certificate.