Locara

image-to-video

HF group: Computer Vision · Status: ❌ not built

What it is

Still image → animated video, no text instruction. Useful for animating photos / artwork. Distinct from image-text-to-video which takes a prompt.

Open-weight models

ModelParamsReleasedLicenseQualityNotes
Stable Video Diffusion (SVD-XT)~2 B2023Stability community25 frames at 576x1024Foundational; non-commercial license caveats.
CogVideoX-5B-I2V5 B2024Apache-2.0Strong, controllableSame model as image-text-to-video; can run without text prompt.
DynamiCrafter~1 B2024Apache-2.0Subtle natural motionHair, water, etc.
I2VGen-XL~2 B2024Apache-2.0Photo → motionAlibaba DAMO.

Infrastructure required

Inference

Input

  • ❌ Image input pipeline.

Output

  • ❌ Video file save (10-100 MB).

Storage

  • ❌ Weights cache.
  • Output: fs.user-folder.

Interaction (IPC + SDK)

  • video.animate({ image }) IPC with long-running task progress.

Capabilities (manifest)

  • capabilities.fs.user-selected for input image.
  • capabilities.fs.user-folder for output.
  • capabilities.models[] for the model.

Gaps

Diffusion runtime, image input pipeline, video output IPC, long-running task progress IPC.

See also