Locara

summarization

HF group: NLP · Status: 🟡 partial (works through chat LLM)

What it is

Long text → short text. Abstractive summarization.

Open-weight models

ModelParamsReleasedLicenseQualityNotes
BART-Large-CNN400 M2020MITFoundationalOld but reliable.
Pegasus-XSum570 M2020Apache-2.0Strong news summarizationOlder.
Any chat LLM1.5-30 B2024-26variousBestLLM with a “summarize” prompt outperforms specialized models.

Infrastructure required

Inference

  • 🟡 Works via chat LLM today.
  • ❌ Specialist encoder-decoder (BART/Pegasus) — same gap as translation.

Input

  • Plain text.

Output

  • Plain text (typically short).

Storage

  • ✅ Weights cache.

Interaction (IPC + SDK)

  • 🟡 Today: llm.chat with a “summarize” prompt.
  • App pattern: long-doc summarization typically chunks input first.

Capabilities (manifest)

  • capabilities.models[] for chat LLM (or specialist).

Gaps

Nothing critical. Specialist summarization models matter if deterministic output structure is required, in which case they share the encoder-decoder runtime gap with translation.

See also