Tootfinder

Opt-in global Mastodon full text search. Join the index!

@axbom@axbom.me
2025-10-29 12:27:59
@… Ja det där underlättar en del. Fick mig att minnas “kreditkortet” med genomskinliga rutor för att lättare avskilja delar av OCR-numret när man skrev av det. Och så hittade jag den här manicken! 😄

https://www.smartasaker.se/sv/ocr-lasare
@heiseonline@social.heise.de
2025-10-21 16:02:00

DeepSeek-OCR: Wie Bilder Chatbots helfen, lange Gespräche zu führen
Chinesische KI-Forscher wollen Chatbots mit Bildern bei langen Kontexten schnell und günstig halten. Optische Kontextkompression soll KI-Assistenten verbessern.

@Techmeme@techhub.social
2025-10-20 18:15:46

DeepSeek releases DeepSeek-OCR, a vision language model designed for efficient vision-text compression, enabling longer contexts with less compute (Jonathan Kemper/The Decoder)
the-decoder.com/deepseeks-ocr-

@mela@zusammenkunft.net
2025-08-27 01:00:26

Gibt's eine brauchbare Scanner-App für Android, ohne Abo? Braucht kein OCR, nur gute mehrseitige Scans2PDF.

@arXiv_csCV_bot@mastoxiv.page
2025-08-21 10:04:30

Improving OCR using internal document redundancy
Diego Belzarena, Seginus Mowlavi, Aitor Artola, Camilo Mari\~no, Marina Gardella, Ignacio Ram\'irez, Antoine Tadros, Roy He, Natalia Bottaioli, Boshra Rajaei, Gregory Randall, Jean-Michel Morel
arxiv.org/abs/2508.14557

@awinkler@openbiblio.social
2025-09-24 15:05:51
Content warning:

@… at "Digital Neo-Latin studies: ideas and perspectives" on efficient #OCR Post-Correction.
#neolatin

@arXiv_csIR_bot@mastoxiv.page
2025-08-27 08:26:03

Extracting Information from Scientific Literature via Visual Table Question Answering Models
Dongyoun Kim, Hyung-do Choi, Youngsun Jang, John Kim
arxiv.org/abs/2508.18661

@mgorny@social.treehouse.systems
2025-08-14 19:06:21

Paperwork does OCR on everything I scan. I've just scanned a document with my signature on it. It OCR-ed the signature (which is literally a scrawl on "Michał Górny") as "NBA".

@arXiv_csDL_bot@mastoxiv.page
2025-09-17 07:56:49

Layout-Aware OCR for Black Digital Archives with Unsupervised Evaluation
Fitsum Sileshi Beyene, Christopher L. Dancy
arxiv.org/abs/2509.13236

@arXiv_csCL_bot@mastoxiv.page
2025-09-05 09:41:51

E-ARMOR: Edge case Assessment and Review of Multilingual Optical Character Recognition
Aryan Gupta, Anupam Purwar
arxiv.org/abs/2509.03615

@mia@hcommons.social
2025-09-19 14:22:23

Some nice examples in the 'use cases' section of AI for Humanists aiforhumanists.com/guides/usec - from OCR to annotation to identifying voices and styles

@Techmeme@techhub.social
2025-10-15 03:20:53

Reducto, which uses OCR with vision language models to convert complex documents into inputs for LLMs, raised a $75M Series B led by a16z at a $600M valuation (Stephanie Palazzolo/The Information)
theinformation.com/articles/st

@datascience@genomic.social
2025-10-07 10:00:01

{tesseract} allows you to read text from images docs.ropensci.org/tesseract/ it can also be combined with {magick} ropen…

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 10:16:42

Logics-Parsing Technical Report
Xiangyang Chen, Shuzhao Li, Xiuwen Zhu, Yongfan Chen, Fan Yang, Cheng Fang, Lin Qu, Xiaoxiao Xu, Hu Wei, Minggang Wu
arxiv.org/abs/2509.19760

@iam_jfnklstrm@social.linux.pizza
2025-10-13 07:13:36

Hjärnblödning på skatteverket. Betalade in skatt, men råkade slå fel ocr, ringde och de skulle fixa. Nu fick jag beslut om utmätning från fogden trots att skatten betaldes för flera månader sedan.

@arXiv_csCL_bot@mastoxiv.page
2025-09-23 12:54:10

SiDiaC: Sinhala Diachronic Corpus
Nevidu Jayatilleke, Nisansa de Silva
arxiv.org/abs/2509.17912 arxiv.org/pdf/2509.17912

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:22:21

Evaluating LLMs for Historical Document OCR: A Methodological Framework for Digital Humanities
Maria Levchenko
arxiv.org/abs/2510.06743 arx…

@arXiv_csCY_bot@mastoxiv.page
2025-09-04 08:22:51

Integrating Generative AI into Cybersecurity Education: A Study of OCR and Multimodal LLM-assisted Instruction
Karan Patel, Yu-Zheng Lin, Gaurangi Raul, Bono Po-Jen Shih, Matthew W. Redondo, Banafsheh Saber Latibari, Jesus Pacheco, Soheil Salehi, Pratik Satam
arxiv.org/abs/2509.02998

@toxi@mastodon.thi.ng
2025-08-04 15:27:23

Finally found a great ad-free and tracking-free #OpenSource document scanner for iOS, with OCR and multi-page PDF output:
openscanner.app/

@grumpybozo@toad.social
2025-09-03 14:49:04

33k one-page TIFFs is an OCR challenge, but it's not insurmountable. fed.brid.gy/r/https://bsky.app

@arXiv_csCL_bot@mastoxiv.page
2025-09-15 09:43:41

Benchmarking Vision-Language Models on Chinese Ancient Documents: From OCR to Knowledge Reasoning
Haiyang Yu, Yuchuan Wu, Fan Shi, Lei Liao, Jinghui Lu, Xiaodong Ge, Han Wang, Minghan Zhuo, Xuecheng Wu, Xiang Fei, Hao Feng, Guozhi Tang, An-Lan Wang, Hanshen Zhu, Yangfan He, Quanhuan Liang, Liyuan Meng, Chao Feng, Can Huang, Jingqun Tang, Bin Li

@nelson@tech.lgbt
2025-08-30 01:35:07

One of my most useful tools these days are things that take screenshots. Greenshot, a Windows tool with excellent usability. And Powertools Text Extractor which lets me OCR bits of text on the screen. Usability is important here: press one button and stuff is copied to clipboard.

@arXiv_csCV_bot@mastoxiv.page
2025-09-01 09:52:02

Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Shashank Vempati, Nishit Anand, Gaurav Talebailkar, Arpan Garai, Chetan Arora
arxiv.org/abs/2508.21693

@arXiv_csCV_bot@mastoxiv.page
2025-09-15 09:58:31

VARCO-VISION-2.0 Technical Report
Young-rok Cha, Jeongho Ju, SunYoung Park, Jong-Hyeon Lee, Younghyun Yu, Youngjune Kim
arxiv.org/abs/2509.10105

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 11:03:49

Detecting Legend Items on Historical Maps Using GPT-4o with In-Context Learning
Sofia Kirsanova, Yao-Yi Chiang, Weiwei Duan
arxiv.org/abs/2510.08385