🚀 Running #OCR at scale with a #Vision #LLM for $0.49/hour
Just deployed dots.ocr (3B parameter Vision LLM by RedNote) on a single
Any #ffmpeg experts here? I want to do one simple thing: make certain colors transparent (then alphaextract to feed into tesseract #OCR). That's it. The colorkey filter is perfect, but it seems to be an absolute impossibility if you want more than one color transparent.
🔍 Hybrid unlocks: borderless table extraction (0.49−0.93 TEDS), #OCR in 80 languages, #LaTeX formula extraction from scientific papers, AI chart & image descriptions
🛡️ Built-in #AI safety: fil…