Research

(* equal contribution  ·  † project lead)   Full list on Google Scholar.

SpatialTree
SpatialTree: How Spatial Abilities Branch Out in MLLMs
Yuxi Xiao*, Longfei Li*, Shen Yan, Xinhang Liu, Sida Peng, Yunchao Wei, Xiaowei Zhou, Bingyi Kang
CVPR 2026
Depth Anything 3
Depth Anything 3: Recovering the Visual Space from Any Views
Haotong Lin*, Sili Chen*, Jun Hao Liew*, Donny Y. Chen*, Zhenyu Li, Guang Shi, Jiashi Feng, Bingyi Kang*†
ICLR 2026
Trace Anything
Trace Anything: Representing Any Video in 4D via Trajectory Fields
Xinhang Liu, Yuxi Xiao, Donny Y. Chen, Jiashi Feng, Yu-Wing Tai, Chi-Keung Tang, Bingyi Kang
ICLR 2026
Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots
Minghuan Liu*†, Zhengbang Zhu*, Xiaoshen Han*, Peng Hu*, Haotong Lin, Xinyao Li, Jingxiao Chen, Jiafeng Xu, Yichu Yang, Yunfeng Lin, Xinghang Li, Yong Yu, Weinan Zhang, Tao Kong, Bingyi Kang
ICLR 2026
SpatialTrackerV2: 3D Point Tracking Made Easy
Yuxi Xiao, Jianyuan Wang, Nan Xue, Nikita Karaev, Yuri Makarov, Bingyi Kang, Xing Zhu, Hujun Bao, Yujun Shen, Xiaowei Zhou
ICCV 2025
How Far is Video Generation from World Model? — A Physical Law Perspective
Bingyi Kang*, Yang Yue*, Rui Lu, Zhijie Lin, Yang Zhao, Kaixin Wang, Gao Huang, Jiashi Feng
*Equal Contribution in alphabetical order
ICML 2025
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Sili Chen, Hengkai Guo, Shengnan Zhu, Feihu Zhang, Zilong Huang, Jiashi Feng, Bingyi Kang
CVPR 2025
Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation
Haotong Lin, Sida Peng, Jingxiao Chen, Songyou Peng, Jiaming Sun, Minghuan Liu, Hujun Bao, Jiashi Feng, Xiaowei Zhou, Bingyi Kang
CVPR 2025
VideoWorld
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
Zhongwei Ren, Yunchao Wei, Xun Guo, Yao Zhao, Bingyi Kang, Jiashi Feng, Xiaojie Jin
CVPR 2025
RoboVLMs
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models
Xinghang Li, Peiyan Li, Minghuan Liu, Dong Wang, Jirong Liu, Bingyi Kang, Xiao Ma, Tao Kong, Hanbo Zhang, Huaping Liu
Natural Machine Intelligence, 2025
Superclass
Classification Done Right for Vision-Language Pre-Training
Zilong Huang, Qinghao Ye, Bingyi Kang, Jiashi Feng, Haoqi Fan
NeurIPS 2024
Image Understanding Tokenizer
Image Understanding Makes for A Good Tokenizer for Image Generation
Luting Wang, Yang Zhao, Zijian Zhang, Jiashi Feng, Si Liu, Bingyi Kang
NeurIPS 2024
Depth Anything V2
Lihe Yang, Bingyi Kang†, Zilong Huang, Zhen Zhao, Xiaogang Xu, Jiashi Feng, Hengshuang Zhao
NeurIPS 2024
MADiff
MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu, Minghuan Liu, Liyuan Mao, Bingyi Kang, Minkai Xu, Yong Yu, Stefano Ermon, Weinan Zhang
NeurIPS 2024
Token-Based World Models
Improving Token-Based World Models with Parallel Observation Prediction
Lior Cohen, Kaixin Wang, Bingyi Kang, Shie Mannor
ICML 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang, Bingyi Kang†, Zilong Huang, Xiaogang Xu, Jiashi Feng, Hengshuang Zhao
CVPR 2024
SEEM
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline RL
Yang Yue*, Rui Lu*, Bingyi Kang*, Shiji Song, Gao Huang
NeurIPS 2023
EDP
Efficient Diffusion Policies for Offline Reinforcement Learning
Bingyi Kang*, Xiao Ma*, Chao Du, Tianyu Pang, Shuicheng Yan
NeurIPS 2023
MISA
Mutual Information Regularized Offline Reinforcement Learning
Xiao Ma*, Bingyi Kang*, Zhongwen Xu, Min Lin, Shuicheng Yan
NeurIPS 2023
FreeMask
FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models
Lihe Yang, Xiaogang Xu, Bingyi Kang, Yinghuan Shi, Hengshuang Zhao
NeurIPS 2023
BalFeat
Exploring Balanced Feature Spaces for Representation Learning
Bingyi Kang, Yu Li, Saining Xie, Zehuan Yuan, Jiashi Feng
ICLR 2021
Decoupling
Decoupling Representation and Classifier for Long-Tailed Recognition
Bingyi Kang, Saining Xie, Marcus Rohrbach, Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis
ICLR 2020
Few-shot detection
Few-Shot Object Detection via Feature Reweighting
Bingyi Kang*, Zhuang Liu*, Xin Wang, Fisher Yu, Jiashi Feng, Trevor Darrell
ICCV 2019
PoFD
Policy Optimization with Demonstrations
Bingyi Kang*, Zequn Jie, Jiashi Feng
ICML 2018

Open Projects

BuboGPT
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Yang Zhao*, Zhijie Lin*, Daquan Zhou, Zilong Huang, Jiashi Feng, Bingyi Kang
Open Project, 2023