Sample documents
Recognition (Markdown)
See the SIE call
// pick a recognition model in the dropdown
Click a sample on the left.
Extraction
See the SIE calls
// structured (Donut)// NER (GLiNER)
Typed fields will appear here.
OCR is rarely a single-model problem. This demo runs three model classes through one SIE server: a VLM-OCR recognizes the document into Markdown, a fine-tuned Donut emits a JSON tree directly, and a zero-shot NER (GLiNER) pulls typed fields out of the recognition output. Pick a sample on the left, swap any of the three models in the dropdowns, watch SIE hot-swap them with one identifier change.
client.extract(model_id, item)
Three different model architectures (a vision-language model, a
fine-tuned encoder-decoder, a span-based NER), one inference engine,
one HTTP API, one SDK call. Without SIE, this demo would be three
separate inference services with three SDKs, three auth flows, three
rate limits. With SIE, swap a string in client.extract(...)
and the underlying architecture changes.
gliner_multi to
gliner_large. Same labels, same input text,
different confidence scores. Model quality is a single dropdown
away.
// pick a recognition model in the dropdown
Click a sample on the left.
// structured (Donut)// NER (GLiNER)
Typed fields will appear here.