
2025-06-23 20:10:41
OCR Complaint: Smith College (Defending Education)
https://defendinged.org/complaints/ocr-complaint-smith-college/
http://www.memeorandum.com/250623/p111#a250623p111
OCR Complaint: Smith College (Defending Education)
https://defendinged.org/complaints/ocr-complaint-smith-college/
http://www.memeorandum.com/250623/p111#a250623p111
Zero-shot OCR Accuracy of Low-Resourced Languages: A Comparative Analysis on Sinhala and Tamil
Nevidu Jayatilleke, Nisansa de Silva
https://arxiv.org/abs/2507.18264 https://
Improving OCR using internal document redundancy
Diego Belzarena, Seginus Mowlavi, Aitor Artola, Camilo Mari\~no, Marina Gardella, Ignacio Ram\'irez, Antoine Tadros, Roy He, Natalia Bottaioli, Boshra Rajaei, Gregory Randall, Jean-Michel Morel
https://arxiv.org/abs/2508.14557
#MistralAI Document #AI: Advanced #OCR solution for complex document processing 📄
📺
Intelligent Automation for FDI Facilitation: Optimizing Tariff Exemption Processes with OCR And Large Language Models
Muhammad Sukri Bin Ramli
https://arxiv.org/abs/2506.12093
Bookmarked: calfa-co/hye-tesseract: Open OCR model for Armenian #Armenisch_OCR_Tesseract
Paperwork does OCR on everything I scan. I've just scanned a document with my signature on it. It OCR-ed the signature (which is literally a scrawl on "Michał Górny") as "NBA".
{tesseract} allows you to read text from images https://docs.ropensci.org/tesseract/ it can also be combined with {magick} https://
"Smart glasses offering a combination of sensory substitution based 'raw' vision and AI-based scene description and OCR appears to be technically and economically the most feasible and sustainable way toward meeting expectations, needs and interests of many blind people." https://www.artificialvision…
Design and Implementation of an OCR-Powered Pipeline for Table Extraction from Invoices
Parshva Dhilankumar Patel
https://arxiv.org/abs/2507.07029 https://…
FormGym: Doing Paperwork with Agents
Matthew Toles, Rattandeep Singh, Isaac Song Zhou Yu
https://arxiv.org/abs/2506.14079 https://arx…
02:31—Good Night #Fediverse! 😴💤
I just made myself a new banner for myself, using (mostly) open source software. The only exception was that I used a random website to add an outline to my Rudolf Rocker image, I was too lazy to get out of bed and use GIMP on my PC for that step. For everything else, I used this tool from GitHub:
Finally found a great ad-free and tracking-free #OpenSource document scanner for iOS, with OCR and multi-page PDF output:
https://openscanner.app/
Comparing OCR Pipelines for Folkloristic Text Digitization
Octavian M. Machidon, Alina L. Machidon
https://arxiv.org/abs/2507.19092 https://arxiv.org/pdf/2…
Uncertainty-Aware Complex Scientific Table Data Extraction
Kehinde Ajayi, Yi He, Jian Wu
https://arxiv.org/abs/2507.02009 https://arx…
Bei dem ganzen KI-Gedöns würde ich ja denken, die perfekten und freien, sparsamen Modelle für Autokorrektur und OCR-Erkennung sollte da sein. So als quasi Kernkompetenz von LLMs. Aber hören und lesen tu ich hauptsächlich in Richtung "Chat"-Nutzung.
Kennt wer gute OCR-Tools, die auch Altgriechisch können?
Trump Admin Issues Finding That Harvard Permitted Antisemitism in Violation of Civil Rights Law (The Harvard Crimson)
https://www.thecrimson.com/article/2025/6/30/harvard-hhs-ocr-antisemitism-finding/
http://www.memeorandum.com/250630/p79#a250630p79
ChatGPT: "Smart glasses combining AI scene understanding, OCR, and sensory substitution are decisive market disruptors" https://chatgpt.com/share/68349bdd-b670-8004-8788-893ca3f98ae5
"They render visual implants:
- Less competitive i…
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning
Senqiao Yang, Junyi Li, Xin Lai, Bei Yu, Hengshuang Zhao, Jiaya Jia
https://arxiv.org/abs/2507.13348
Email as the Interface to Generative AI Models: Seamless Administrative Automation
Andres Navarro, Carlos de Quinto, Jos\'e Alberto Hern\'andez
https://arxiv.org/abs/2506.23850
Real-Time AI-Driven Pipeline for Automated Medical Study Content Generation in Low-Resource Settings: A Kenyan Case Study
Emmanuel Korir, Eugene Wechuli
https://arxiv.org/abs/2507.05212
To Claude: Explain how smart glasses with AI-based scene description, OCR for reading print, and visual-to-auditory sensory substitution for "raw vision" will erode the market opportunities for all implantable visual prostheses (retinal implants and brain implants for restoring vision to the blind). https://
Evaluating VisualRAG: Quantifying Cross-Modal Performance in Enterprise Document Understanding
Varun Mannam, Fang Wang, Xin Chen
https://arxiv.org/abs/2506.21604