Abstract
Abstract
Controlled character animation requires transferring motion from a driving sequence to a reference character. Prior works heavily rely on intermediate representations, including pose skeletons to represent motion or masked background to represent environment, which inevitably leads to information loss. To address this, we present SCAIL-2, a framework that bypasses those intermediates and achieves \textbf{end-to-end} character animation. By directly concatenating driving videos to the sequence, the model can obtain all the required visual information from the input video. To address the lack of end-to-end data, we unify sub-tasks of character animation with decoupled conditions and then curate a pipeline to synthesize MotionPair-60K, an end-to-end motion transfer dataset containing heterogeneous tasks of character animation. To achieve the unification, we utilize in-context mask conditioning and mode-specific RoPE as soft guidance beyond textual instructions and raw visual information. To address synthetic discrepancy in detailed regions, we propose Bias-Aware DPO to construct preference items to mitigate the errors. Extensive experiments demonstrate that our method substantially outperforms existing state-of-the-art approaches in various character animation tasks. A large subset of synthetic data as well as model weights will be released at our project page: https://teal024.github.io/SCAIL-2/.
Direct answer
What can I do from this paper page?
Use this page to scan "SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning" quickly: start with the summary and abstract, then check the authors, source, topics, and related papers. From here, open Scollr to follow Human Motion and Animation research, save the paper, or map adjacent work.
Research areas
Follow related topics
Citation
BibTeX
@article{Yan2026SCAIL,
title = {SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning},
author = {Wenhao Yan and Fengjia Guo and Zhuoyi Yang and Jie Tang},
journal = {arXiv (Cornell University)},
year = {2026},
doi = {10.48550/arxiv.2606.10804},
url = {https://doi.org/10.48550/arxiv.2606.10804}
}
FAQ
Using this paper in a discovery workflow
How do I find related work for this paper?
Use the related papers and topic links on this page as starting points. In Scollr, you can also open the paper and build a literature map around its references, citing papers, and related work.
How can I keep up with new Human Motion and Animation research papers?
Follow Human Motion and Animation research in Scollr. New papers from the topic flow into a personalized feed, and you can save useful studies to revisit later.
Can I cite this paper from this page?
This page includes a static BibTeX block for SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning. Always verify the DOI, source, and publication details against the publisher record before submitting a manuscript.
Follow this research in Scollr
Follow the topics and authors behind this paper, save useful studies, and build a literature map when you are ready to go deeper.
Get the app