AI

Motion Transfer

Applying one subject's movement to another — teaching Legolas to moonwalk by copying someone who already can.

Motion transfer extracts motion data from one video (the source) and applies it to animate a different subject (the target). The source video provides the movement — how a body moves through space, how limbs articulate through a sequence of actions — which is extracted as a series of pose keypoints or skeletal data. These motion sequences are then applied to drive a target subject: a different person, a brand character, an AI avatar, or a 3D model, producing video of the target performing the same motion as the source without the target having physically performed the motion. This is a generalization of the motion capture techniques used in film VFX — where performer motion drives CGI characters — made accessible through neural network-based approaches that don't require specialized motion capture hardware.

The technical implementation typically involves two stages: pose estimation (using a neural network to extract the skeletal pose sequence from the source video, frame by frame) and pose-conditioned generation (using a ControlNet-style model to generate the target subject performing the extracted pose sequence, maintaining the target's appearance while producing the source's motion). Quality varies by how different the source and target body types are, how complex the motion is, and how much the motion pushes the target model's generalization capability. Smooth, large-motion sequences (walking, basic gestures, simple dances) transfer more reliably than fine-motor or extreme-motion sequences.

For B2B teams working with AI avatars and branded character animation, motion transfer is a compelling tool for creating natural-feeling presentations and explainer content. Rather than describing character motions through text prompts (which produces generic, less naturalistic movement), a content creator can perform the desired gestures and expressions themselves and transfer those motions to the AI avatar or brand character — the avatar inherits the natural, specific, intentional motion of the human performance. This is particularly useful for presenter-style content where natural hand gestures, body language, and pacing are important elements of the communication — the AI avatar can feel as dynamic and naturally expressive as a human recording, driven by an actual human performance rather than procedurally generated motion.

motion transferpose transfercharacter animationAI animationmotion captureAI video

Related terms