View publication

Large language models (LLMs) have the potential to impact a wide range of creative domains, as exemplified in popular text-to-image generators like DALL·E and Midjourney. However, the application of LLMs to motion-based visual design has not yet been explored and presents novel challenges such as how users might effectively describe motion in natural language. Further, many existing generative design tools lack support for iterative refinement of designs beyond prompt engineering. In this paper, we present Keyframer, a design tool that leverages the code generation capabilities of LLMs to support design exploration for animations. Informed by interviews with professional motion designers, animators, and engineers, we designed Keyframer to support both ideation and refinement stages of animation design processes by enabling users to explore design variants throughout their process. We evaluated Keyframer with 13 users with a range of animation and programming experience, examining their prompting strategies and how they considered incorporating design variants into their process. We share a series of design principles for applying LLM to motion design prototyping tools and their potential implication for visual design tools more broadly.

Related readings and updates.

We propose a general-purpose approach for improving the ability of Large Language Models (LLMs) to intelligently and adaptively gather information from a user or other external source using the framework of sequential Bayesian experimental design (BED). This enables LLMs to act as effective multi-turn conversational agents and interactively interface with external environments. Our approach, which we call BED-LLM (Bayesian Experimental Design…

Read more

*= Equal Contribution

Generating realistic lip motion from audio to simulate speech production is critical for driving natural character animation. Previous research has shown that traditional metrics used to optimize and assess models for generating lip motion from speech are not a good indicator of subjective opinion of animation quality. Devising metrics that align with subjective opinion first requires understanding what impacts human…

Read more