Home |

What we do

Our research focuses on multimodal intelligence and perception systems

Research on integrating vision, language, and other modalities for intelligent perception systems.

Developing Vision-Language-Action (VLA) model for autonomous robot.

Advanced research in semantic segmentation, object detection, and scene understanding.

Applying AI and deep learning to medical imaging, diagnosis, and healthcare applications. Delving into genomics and protein design with AI.

Publishing cutting-edge research at top-tier conferences including CVPR, ICCV, ECCV, and NeurIPS.

Working with international collaborators and industry partners on innovative AI solutions.