2026-03-17 18:49:49
The entire stack is #opensource: dots.ocr model from #HuggingFace, vLLM for inference, #FastAPI proxy with parallel rendering streaming. Total model size ~12GB, runs comfortably on any…
The entire stack is #opensource: dots.ocr model from #HuggingFace, vLLM for inference, #FastAPI proxy with parallel rendering streaming. Total model size ~12GB, runs comfortably on any…