Voice-driven 3D avatar animation engine for the browser.
Lip sync · Facial expressions · Body motion · All from audio alone.
V1 phoneme engine — 111-dim output mapped to 52 ARKit blendshapes, ONNX inference, real-time visualization.
Try it →V2 student model — 52 ARKit blendshapes direct prediction, crisp mouth, real-time visualization.
Try it →Side-by-side comparison. Same voice, two animation engines, two avatars. See the difference live.
Try it →