Kaiyang Ji

jiky2024 AT shanghaitech DOT edu DOT cn

main_pic.jpg

2-505, VDI, SIST

393 Middle Huaxia Road, 201210

Shanghai, China

About Me

I am Kaiyang Ji, a first-year master student at Visual & Data Intelligence Center (VDI) in ShanghaiTech University, advised by Prof. Jingya Wang and Prof. Ye Shi. Previously, I graduated from ShanghaiTech University with a major in computer science, advised by Prof. Jingya Wang and Prof. Jingyi Yu.

Research Interest

My research interest broadly lies in computer vision, machine learning, and robotics. Particularly, my current research focuses on Human-Centered 3D Vision, Generative Models and Embodied AI.

I am looking for collaborators and friends. Feel free to contact me if you are interested in these fantasic topics!

Email / Google Scholar / Github

news

Jul 29, 2025 We will organize ICCV 2025 Workshop Challenge “Human-Robot-Scene Interaction and Collaboration”! Call for candidates! :raised_hands:
Jul 24, 2025 Human-X has been accepted by ICCV 2025 as Highlight! :tada::tada::tada:
Jun 26, 2025 Our paper Human-X has been accepted by ICCV 2025! :sparkles:
Sep 01, 2024 I have joined VDI in 24Fall as a CS Master student!
Feb 27, 2024 Our paper S2Fusion has been accepted by CVPR 2024! :sparkles:

selected publications

  1. arXiv 2505
    dreampolicy.png
    One Policy but Many Worlds: A Scalable Unified Policy for Versatile Humanoid Locomotion
    Yahao Fan*, Tianxiang Gui*, Kaiyang Ji*, and 6 more authors
    arXiv preprint arXiv:2505.18780, 2025
  2. ICCV 2025 Hightlight
    human-x.png
    Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis
    Kaiyang Ji, Ye Shi, Zichen Jin, and 5 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025
  3. arXiv 2503
    vlm-rmd.png
    Human-Object Interaction via Automatically Designed VLM-Guided Motion Policy
    Zekai Deng, Ye Shi, Kaiyang Ji, and 3 more authors
    arXiv preprint arXiv:2503.18349, 2025
  4. CVPR 2024
    s2fusion.png
    A unified diffusion framework for scene-aware human motion estimation from sparse signals
    Jiangnan Tang, Jingya Wang, Kaiyang Ji, and 3 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024