Skip to main content

Research Repository

Advanced Search

Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion Generation

Wang, Yin; Li, Mu; Liu, Jiapeng; Leng, Zhiying; Li, Frederick W. B.; Zhang, Ziyao; Liang, Xiaohui

Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion Generation Thumbnail


Authors

Yin Wang

Mu Li

Jiapeng Liu

Zhiying Leng

Ziyao Zhang

Xiaohui Liang



Abstract

We address the challenging problem of fine-grained text-driven human motion generation. Existing works generate imprecise motions that fail to accurately capture relationships specified in text due to: (1) lack of effective text parsing for detailed semantic cues regarding body parts, (2) not fully modeling linguistic structures between words to comprehend text comprehensively. To tackle these limitations, we propose a novel fine-grained framework Fg-T2M++ that consists of: (1) an LLMs semantic parsing module to extract body part descriptions and semantics from text, (2) a hyperbolic text representation module to encode relational information between text units by embedding the syntactic dependency graph into hyperbolic space, and (3) a multi-modal fusion module to hierarchically fuse text and motion features. Extensive experiments on HumanML3D and KIT-ML datasets demonstrate that Fg-T2M++ outperforms SOTA methods, validating its ability to accurately generate motions adhering to comprehensive text semantics.

Citation

Wang, Y., Li, M., Liu, J., Leng, Z., Li, F. W. B., Zhang, Z., & Liang, X. (online). Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion Generation. International Journal of Computer Vision, https://doi.org/10.1007/s11263-025-02392-9

Journal Article Type Article
Acceptance Date Feb 8, 2025
Online Publication Date Feb 27, 2025
Deposit Date Mar 27, 2025
Publicly Available Date Mar 27, 2025
Journal International Journal of Computer Vision
Print ISSN 0920-5691
Electronic ISSN 1573-1405
Publisher Springer
Peer Reviewed Peer Reviewed
DOI https://doi.org/10.1007/s11263-025-02392-9
Public URL https://durham-repository.worktribe.com/output/3742944

Files





You might also like



Downloadable Citations