CV Page
π― Research Focus
My core research agenda centers on human-centric artificial intelligence, with specialized expertise in:
- π΄οΈ Human Pose Estimation:
3D ReconstructionOcclusion HandlingMultiModel Methods - π Motion Synthesis & Generation:
InteractiveText-Driven ControlLong-Sequence Generation - π Video Motion Capture System:
Multi-ViewHigh AccuracyBiomechanical Analysis
βοΈ Technical Arsenal
- Meta Human:
Stable DiffusionDiTVAEPPOControllable - 3D Vision:
MeshMotion Synthesis3D Pose Estimation - Motion Capture:
Multi-ViewMarker-lessBiomechanics Analysisββ - Multimodal:
ImageTextMotion - Optimization:
KV CacheLoRANoise StrategyCFG
π Key Projects
π Self-Forcing Autoregressive Diffusion for Real-Time Text-Driven Motion Generation
AutoRegress Diffusion DiT Reinforcement Learning (PPO) Real-time Control
- β SOTA Performance: Achieved 300+ FPS with KV Cache acceleration
- β Enabled continuous long-sequence generation via novel self-forcing paradigm
- β Balanced quality/speed via noise scheduling & annealing strategies
π MetaPose: Multimodal Enhancement and Transformation Alignment 3D Pose Restruction
Vision-Text-Spatial Fusion Cross-Modal Alignment Occlusion Robustness
- β IEEE TMM (Q1 Under Review)
- β SOTA Accuracy: Outperformed video-based methods using single-frame input
- β Novel Framework: A unified distribution space for anatomical/textual/visual features
π Multi-View Video Capture for Biomechanics Analysis
ViT Temporal Processing Triangulation
- β EAAI (Q1, Major Revision)
- β Journal of Biomechanics (Top Journal of Orthopedics, Major Revision)
- β High Accuracy: achieve the requirements of clinical biomechanical analysis.It is more accurate than the online markerless motion capture systems available on the market.

π€ Provably Safe Vision-Based Teleoperation for Dual-Arm Surgical Robots
Surgical Robot Gesture Control Teleoperation
- β IEEE ROBIO 2025
Demonstration
β‘ Connection-Aware Graph Convolution Networks
GCN Anatomical/Kinematic Modeling Multi-Level Aggregation
- β HCIS Journal (JCR Q1, Under Review)
- β 40% Faster than video-based approaches while maintaining SOTA accuracy
π©Ί Reinforce Learning Enhanced CTA for Noninvasive Prediction
PPO CTA Medical Image Analysis
- β Liver International (JCR Q1)
- β High accuracy rate of identifying the ROI area of extremely severe lesions

π Recognitions
- National Encouragement Scholarship 2020-2022 (Top 10%)
- 4 SCI Q1 Papers (First-Author, 3 Under Review Now)
- GPA 3.5+ @ Project 985 universitiy
- GPA Top 10% @ Project 211 university
π Publications
- Accept:
Liver International (Q1)ROBIO 2025π₯° - Major Revision:
EAAI (Q1),Journal of Biomechanics (Q2)οΌ - Under Review:
HCIS (Q1),TMM (Q1),AAAI 2026 (CCFA) - Prepare to Submit:
CVPR 2026 (CCFA)
