Internal document text extraction and classification service replacing LLaMA 3.1 with faster, cost-effective ML models for entity recognition and document classification.
MLNER is an internal document text extraction and classification service developed to replace large LLM-based pipelines (such as LLaMA 3.1) with a faster, cost-effective, and scalable model. The service extracts key entities from documents such as IDs, financial statements, and utility bills with high accuracy and classifies document types in real time. It was deployed to reduce GPU load while maintaining top-tier extraction performance across various document layouts.