Human Motion and Animation Open access

MoGeFlow: Flowing Through Motion Codebook Geometry for Text-to-Motion Generation

Pengcheng Fang, Tengjiao Sun, Xiaoyu Zhan, Xiaohao Cai and 1 more

arXiv (Cornell University) | Jun 10, 2026

Abstract

Vector-quantized motion tokenizers provide a compact discrete interface for text-to-motion generation, but most motion-code priors treat code indices as unordered categorical labels. This view overlooks a key property of motion codes: they are decoder-bound prototypes of physical movement, and their learned codebooks can carry meaningful local kinematic geometry. We verify this property through codebook diagnostics. Distances between learned PartVQ group-specific codes align with local motion-prototype distances, shuffled controls remove this alignment, and replacing codes with progressively farther neighbors induces monotonically larger decoded motion changes. These results show that motion codebooks exhibit measurable, non-random, and decoder-causal geometry. Based on this observation, we propose \textbf{MoGeFlow}, a text-to-motion model that generates through motion codebook geometry. MoGeFlow represents each motion-code frame as a structured set of PartVQ group-specific code embeddings, learns a text-conditioned continuous flow over these frame states, and projects terminal states back to valid motion codes for frozen decoding. This preserves the compactness and validity of discrete tokenization while replacing categorical code prediction with geometry-aware codebook-space generation. Experiments set new state of the art in R-Precision on HumanML3D and KIT-ML, achieve the best HumanML3D MultiModal Distance and KIT-ML FID among generated methods, and obtain the best MotionMillion R@1, R@2, R@3, and FID under the benchmark protocol.

Direct answer

What can I do from this paper page?

Use this page to scan "MoGeFlow: Flowing Through Motion Codebook Geometry for Text-to-Motion Generation" quickly: start with the summary and abstract, then check the authors, source, topics, and related papers. From here, open Scollr to follow Human Motion and Animation research, save the paper, or map adjacent work.

Authors

Researchers on this paper

Pengcheng Fang

first

Tengjiao Sun

middle

Xiaoyu Zhan

middle | ORCID 0000-0002-2222-0608

Xiaohao Cai

middle | ORCID 0000-0003-0924-2834

Dongjie Fu

last

Research areas

Follow related topics

Latest Human Motion and Animation research Generative Adversarial Networks and Image Synthesis Human Pose and Action Recognition

Citation

BibTeX

@article{Fang2026MoGeFlow,
  title = {MoGeFlow: Flowing Through Motion Codebook Geometry for Text-to-Motion Generation},
  author = {Pengcheng Fang and Tengjiao Sun and Xiaoyu Zhan and Xiaohao Cai and Dongjie Fu},
  journal = {arXiv (Cornell University)},
  year = {2026},
  doi = {10.48550/arxiv.2606.11656},
  url = {https://doi.org/10.48550/arxiv.2606.11656}
}

FAQ

Using this paper in a discovery workflow

How do I find related work for this paper?

Use the related papers and topic links on this page as starting points. In Scollr, you can also open the paper and build a literature map around its references, citing papers, and related work.

How can I keep up with new Human Motion and Animation research papers?

Follow Human Motion and Animation research in Scollr. New papers from the topic flow into a personalized feed, and you can save useful studies to revisit later.

Can I cite this paper from this page?

This page includes a static BibTeX block for MoGeFlow: Flowing Through Motion Codebook Geometry for Text-to-Motion Generation. Always verify the DOI, source, and publication details against the publisher record before submitting a manuscript.

Follow this research in Scollr

Follow the topics and authors behind this paper, save useful studies, and build a literature map when you are ready to go deeper.

Get the app