Tootfinder

Opt-in global Mastodon full text search. Join the index!

@mia@hcommons.social
2025-09-19 14:22:23

Some nice examples in the 'use cases' section of AI for Humanists aiforhumanists.com/guides/usec - from OCR to annotation to identifying voices and styles

@arXiv_csDL_bot@mastoxiv.page
2025-09-17 07:56:49

Layout-Aware OCR for Black Digital Archives with Unsupervised Evaluation
Fitsum Sileshi Beyene, Christopher L. Dancy
arxiv.org/abs/2509.13236

@mgorny@social.treehouse.systems
2025-08-14 19:06:21

Paperwork does OCR on everything I scan. I've just scanned a document with my signature on it. It OCR-ed the signature (which is literally a scrawl on "Michał Górny") as "NBA".

@avstockhausen@fedihum.org
2025-07-09 15:35:02

Bookmarked: calfa-co/hye-tesseract: Open OCR model for Armenian #Armenisch_OCR_Tesseract

@arXiv_csCV_bot@mastoxiv.page
2025-07-18 10:22:32

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning
Senqiao Yang, Junyi Li, Xin Lai, Bei Yu, Hengshuang Zhao, Jiaya Jia
arxiv.org/abs/2507.13348

@arXiv_csCL_bot@mastoxiv.page
2025-09-15 09:43:41

Benchmarking Vision-Language Models on Chinese Ancient Documents: From OCR to Knowledge Reasoning
Haiyang Yu, Yuchuan Wu, Fan Shi, Lei Liao, Jinghui Lu, Xiaodong Ge, Han Wang, Minghan Zhuo, Xuecheng Wu, Xiang Fei, Hao Feng, Guozhi Tang, An-Lan Wang, Hanshen Zhu, Yangfan He, Quanhuan Liang, Liyuan Meng, Chao Feng, Can Huang, Jingqun Tang, Bin Li

@arXiv_csCV_bot@mastoxiv.page
2025-07-10 10:17:11

Design and Implementation of an OCR-Powered Pipeline for Table Extraction from Invoices
Parshva Dhilankumar Patel
arxiv.org/abs/2507.07029

@arXiv_csCL_bot@mastoxiv.page
2025-09-05 09:41:51

E-ARMOR: Edge case Assessment and Review of Multilingual Optical Character Recognition
Aryan Gupta, Anupam Purwar
arxiv.org/abs/2509.03615

@arXiv_csCY_bot@mastoxiv.page
2025-09-04 08:22:51

Integrating Generative AI into Cybersecurity Education: A Study of OCR and Multimodal LLM-assisted Instruction
Karan Patel, Yu-Zheng Lin, Gaurangi Raul, Bono Po-Jen Shih, Matthew W. Redondo, Banafsheh Saber Latibari, Jesus Pacheco, Soheil Salehi, Pratik Satam
arxiv.org/abs/2509.02998

@toxi@mastodon.thi.ng
2025-08-04 15:27:23

Finally found a great ad-free and tracking-free #OpenSource document scanner for iOS, with OCR and multi-page PDF output:
openscanner.app/

@arXiv_csIR_bot@mastoxiv.page
2025-07-04 07:35:01

Uncertainty-Aware Complex Scientific Table Data Extraction
Kehinde Ajayi, Yi He, Jian Wu
arxiv.org/abs/2507.02009 arx…

@grumpybozo@toad.social
2025-09-03 14:49:04

33k one-page TIFFs is an OCR challenge, but it's not insurmountable. fed.brid.gy/r/https://bsky.app

@vform@openbiblio.social
2025-07-05 12:36:50

Bei dem ganzen KI-Gedöns würde ich ja denken, die perfekten und freien, sparsamen Modelle für Autokorrektur und OCR-Erkennung sollte da sein. So als quasi Kernkompetenz von LLMs. Aber hören und lesen tu ich hauptsächlich in Richtung "Chat"-Nutzung.

@michabbb@social.vivaldi.net
2025-07-22 20:15:55

#MistralAI Document #AI: Advanced #OCR solution for complex document processing 📄
📺

@mela@zusammenkunft.net
2025-08-27 01:00:26

Gibt's eine brauchbare Scanner-App für Android, ohne Abo? Braucht kein OCR, nur gute mehrseitige Scans2PDF.

@arXiv_csCV_bot@mastoxiv.page
2025-09-15 09:58:31

VARCO-VISION-2.0 Technical Report
Young-rok Cha, Jeongho Ju, SunYoung Park, Jong-Hyeon Lee, Younghyun Yu, Youngjune Kim
arxiv.org/abs/2509.10105

@nelson@tech.lgbt
2025-08-30 01:35:07

One of my most useful tools these days are things that take screenshots. Greenshot, a Windows tool with excellent usability. And Powertools Text Extractor which lets me OCR bits of text on the screen. Usability is important here: press one button and stuff is copied to clipboard.

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 09:55:02

Zero-shot OCR Accuracy of Low-Resourced Languages: A Comparative Analysis on Sinhala and Tamil
Nevidu Jayatilleke, Nisansa de Silva
arxiv.org/abs/2507.18264

@arXiv_csCV_bot@mastoxiv.page
2025-09-01 09:52:02

Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Shashank Vempati, Nishit Anand, Gaurav Talebailkar, Arpan Garai, Chetan Arora
arxiv.org/abs/2508.21693

@arXiv_csDL_bot@mastoxiv.page
2025-07-28 07:59:01

Comparing OCR Pipelines for Folkloristic Text Digitization
Octavian M. Machidon, Alina L. Machidon
arxiv.org/abs/2507.19092 arxiv.org/pdf/2…

@arXiv_csHC_bot@mastoxiv.page
2025-07-01 11:05:23

Email as the Interface to Generative AI Models: Seamless Administrative Automation
Andres Navarro, Carlos de Quinto, Jos\'e Alberto Hern\'andez
arxiv.org/abs/2506.23850

@arXiv_csCY_bot@mastoxiv.page
2025-07-08 11:48:31

Real-Time AI-Driven Pipeline for Automated Medical Study Content Generation in Low-Resource Settings: A Kenyan Case Study
Emmanuel Korir, Eugene Wechuli
arxiv.org/abs/2507.05212

@arXiv_csCV_bot@mastoxiv.page
2025-08-21 10:04:30

Improving OCR using internal document redundancy
Diego Belzarena, Seginus Mowlavi, Aitor Artola, Camilo Mari\~no, Marina Gardella, Ignacio Ram\'irez, Antoine Tadros, Roy He, Natalia Bottaioli, Boshra Rajaei, Gregory Randall, Jean-Michel Morel
arxiv.org/abs/2508.14557

@michabbb@social.vivaldi.net
2025-07-22 20:15:56

🔄 Significantly improves #RAG pipeline performance by creating context-rich, high-quality text from documents that enhances #AI application accuracy
💼 Addresses critical business challenges where traditional #OCR

@arXiv_csIR_bot@mastoxiv.page
2025-06-30 09:15:30

Evaluating VisualRAG: Quantifying Cross-Modal Performance in Enterprise Document Understanding
Varun Mannam, Fang Wang, Xin Chen
arxiv.org/abs/2506.21604

@arXiv_csIR_bot@mastoxiv.page
2025-08-27 08:26:03

Extracting Information from Scientific Literature via Visual Table Question Answering Models
Dongyoun Kim, Hyung-do Choi, Youngsun Jang, John Kim
arxiv.org/abs/2508.18661