Ming Hu1,2,3 * · Zhengdi Yu *4 · Feilong Tang1,2,3 · Kaiwen Chen5 · Yulong Li3 · Imran Razzak3 · Junjun He2 · Tolga Birdal4 · Kaijing Zhou †5 · Zongyuan Ge †1
1Monash University · 2Shanghai AI Laboratory · 3MBZUAI · 4Imperial College London · 5Eye Hospital, Wenzhou Medical Univeristy
We introduce OphNet-3D, the first large-scale RGB-D dataset for dynamic 3D hand-instrument reconstruction in ophthalmic microsurgery, supported by an efficient multi-stage annotation pipeline, and propose novel architectures (H-Net and OH-Net) that significantly outperform existing methods in accurate hand and instrument reconstruction tasks.
- [2025-6-26] 🎉🎉🎉 OphNet-3D is accepted by NeurIPS 2025 as a Spotlight Paper!.
- [2025/5/26] Paper is now available. ⭐
- Release dataset
- Release baseline experimental results and checkpoints
Expected in mid-November
@article{hu2025towards,
title={Towards dynamic 3d reconstruction of hand-instrument interaction in ophthalmic surgery},
author={Hu, Ming and Yu, Zhengdi and Tang, Feilong and Chen, Kaiwen and Li, Yulong and Razzak, Imran and He, Junjun and Birdal, Tolga and Zhou, Kaijing and Ge, Zongyuan},
journal={arXiv preprint arXiv:2505.17677},
year={2025}
}For any questions, please contact ming.hu@monash.edu or z.yu23@imperial.ac.uk .

