Locara

video-classification

HF group: Computer Vision · Status: ❌ not built

What it is

Video → action / activity label.

Open-weight models

ModelParamsReleasedLicenseQualityNotes
VideoMAE-V290 M – 1 B2023CC-BY-NCStrong on KineticsSelf-supervised.
TimeSformer~120 M2021Apache-2.0FoundationalOlder.
Apple Vision Action Classifiern/amacOSAppleLimited classesNative.

Infrastructure required

Inference

  • ❌ Video encoder runtime.

Input

  • ❌ Video file loading + frame sampling.

Output

  • Label + confidence.

Storage

  • ❌ Weights cache.

Interaction (IPC + SDK)

  • video.classify({ path }) IPC.

Capabilities (manifest)

  • capabilities.fs.user-selected.
  • capabilities.models[].

Gaps

Same shared rails: video input pipeline, encoder-only inference for non-text.

See also