Human Motion and Animation Open access

KV-Control: Parameter-Efficient K/V Injection for Trajectory-Controlled Text-to-Motion

Tengjiao Sun, Pengcheng Fang, Xiaoyu Zhan, Yanwen Guo and 3 more

arXiv (Cornell University) | Jun 4, 2026

Abstract

Abstract

Text-conditioned 3D human motion models now synthesize plausible motions from prompts, but practical animation and embodied-agent workflows rarely stop at text: a character may need to follow a sketched root path, hit an end-effector target, or satisfy a multi-joint trajectory while still preserving the gait, style, and intent described by language. This exposes a control trade-off. A trajectory controller should be precise without overwriting the pretrained text-conditioned motion prior, yet existing solutions either duplicate large portions of the generator to regain per-layer control access or move much of the cost to test-time optimization. We introduce KV-Control, a compact attention-side control interface for frozen masked text-to-motion transformers. The key idea is to make geometric constraints available as memory inside self-attention rather than injecting them through a global pose token or enforcing them only at the output side. To support this interface, we co-design a part-tokenized motion substrate and controller: \textbf{PartVQ} learns anatomy-aligned part codebooks, T-Concat exposes each frame--part token as an attention-addressable site, and KV-Control injects control-conditioned key/value memories at every self-attention layer while preserving the pretrained query stream, text cross-attention, FFN, and all backbone weights. The resulting adapter adds only trainable injection parameters atop a shared trajectory encoder, yet tracks root and multi-joint constraints with sub-centimeter accuracy under the inherited refinement protocol while retaining text-conditioned motion quality. KV-Control reframes trajectory conditioning as lightweight memory retrieval, providing a small, precise, and transparent control interface for text-to-motion generation.

Direct answer

What can I do from this paper page?

Use this page to scan "KV-Control: Parameter-Efficient K/V Injection for Trajectory-Controlled Text-to-Motion" quickly: start with the summary and abstract, then check the authors, source, topics, and related papers. From here, open Scollr to follow Human Motion and Animation research, save the paper, or map adjacent work.

Authors

Researchers on this paper

Tengjiao Sun

first

Pengcheng Fang

middle

Xiaoyu Zhan

middle | ORCID 0000-0002-2222-0608

Yanwen Guo

middle

Dongjie Fu

middle

Xiaohao Cai

middle | ORCID 0000-0003-0924-2834

Hansung Kim

last

Research areas

Follow related topics

Citation

BibTeX

@article{Sun2026Control,
  title = {KV-Control: Parameter-Efficient K/V Injection for Trajectory-Controlled Text-to-Motion},
  author = {Tengjiao Sun and Pengcheng Fang and Xiaoyu Zhan and Yanwen Guo and Dongjie Fu and Xiaohao Cai and Hansung Kim},
  journal = {arXiv (Cornell University)},
  year = {2026},
  url = {https://arxiv.org/abs/2606.05624}
}

FAQ

Using this paper in a discovery workflow

How do I find related work for this paper?

Use the related papers and topic links on this page as starting points. In Scollr, you can also open the paper and build a literature map around its references, citing papers, and related work.

How can I keep up with new Human Motion and Animation research papers?

Follow Human Motion and Animation research in Scollr. New papers from the topic flow into a personalized feed, and you can save useful studies to revisit later.

Can I cite this paper from this page?

This page includes a static BibTeX block for KV-Control: Parameter-Efficient K/V Injection for Trajectory-Controlled Text-to-Motion. Always verify the DOI, source, and publication details against the publisher record before submitting a manuscript.

Follow this research in Scollr

Follow the topics and authors behind this paper, save useful studies, and build a literature map when you are ready to go deeper.

Get the app