Problem: we keep using frontier LLMs as glue for jobs that are already solved.
Solution: run OCR + NER locally in C# with ONNX Runtime. Deterministic extraction on ingest. Store the entities. Use an LLM later only if you actually need synthesis.
OCR with Tesseract, then BERT NER via ONNX in .NET. No Python, no cloud, no tokens.
This is my 'for beginners' article. I'm DEEP in OCR but realised I never explained the quickest way to do this *locally*.
https://www.mostlylucid.net/blog/simple-ocr-ner-extraction
#CSharp #DotNet #ONNX #OnnxRuntime #OCR #NER #LocalAI #RAG #DocumentAI

