`image-classification`

HF group: Computer Vision · Status: ❌ not built

What it is

Image → label (or label distribution) from a fixed set. Distinct from zero-shot-image-classification which takes labels at runtime.

Model	Params	Released	License	Quality	Notes
ConvNeXt-V2 (Large)	~200 M	2023	MIT	Top of pure-vision classifiers	Fast inference.
ViT-Large	~300 M	2020	Apache-2.0	Foundational	Many fine-tunes.
DINOv2 (linear probe)	86 M – 1.1 B	2023	Apache-2.0	Best features for tiny classifier head	See `image-feature-extraction`.
Apple Vision (`VNClassifyImageRequest`)	n/a	macOS	Apple	Built-in scene/object tags	Native API.

❌ Encoder-only inference path for vision models.
✅ Apple Vision API would be cheapest hook (zero-RAM cost) — same Swift-sidecar pattern as locara-vision-ocr.

Apple Vision API would be cheapest hook (zero-RAM cost, no model download) — same shape as the existing locara-vision-ocr crate.