Tootfinder

Opt-in global Mastodon full text search. Join the index!

@kurtsh@mastodon.social
2025-09-23 04:25:54

xAI workers said they've encountered NSFW content, including AI-generated child sexual abuse material from Grok.
☑️ Musk's AI Tutors Describe 'Disgusting' Content Moderation Job - Business Insider
businessinsider.com/elon-musk-

@arXiv_csCL_bot@mastoxiv.page
2025-09-23 12:57:21

WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing
Yuhang Dai, Ziyu Zhang, Shuai Wang, Longhao Li, Zhao Guo, Tianlun Zuo, Shuiyuan Wang, Hongfei Xue, Chengyou Wang, Qing Wang, Xin Xu, Hui Bu, Jie Li, Jian Kang, Binbin Zhang, Lei Xie
arxiv.org/abs/2509.18004

@ErikJonker@mastodon.social
2025-11-22 10:29:01

Nice annotation of the US-Russia "peace plan" with regard to Ukraine.
samf.substack.com/p/the-witkof

@arXiv_csCV_bot@mastoxiv.page
2025-09-23 13:05:51

Depth Edge Alignment Loss: DEALing with Depth in Weakly Supervised Semantic Segmentation
Patrick Schmidt, Vasileios Belagiannis, Lazaros Nalpantidis
arxiv.org/abs/2509.17702

@arXiv_eessIV_bot@mastoxiv.page
2025-09-23 08:26:30

A Chain-of-thought Reasoning Breast Ultrasound Dataset Covering All Histopathology Categories
Haojun Yu, Youcheng Li, Zihan Niu, Nan Zhang, Xuantong Gong, Huan Li, Zhiying Zou, Haifeng Qi, Zhenxiao Cao, Zijie Lan, Xingjian Yuan, Jiating He, Haokai Zhang, Shengtao Zhang, Zicheng Wang, Dong Wang, Ziwei Zhao, Congying Chen, Yong Wang, Wangyan Qin, Qingli Zhu

@arXiv_csCL_bot@mastoxiv.page
2025-09-23 12:52:10

Make Every Letter Count: Building Dialect Variation Dictionaries from Monolingual Corpora
Robert Litschko, Verena Blaschke, Diana Burkhardt, Barbara Plank, Diego Frassinelli
arxiv.org/abs/2509.17855

@datascience@genomic.social
2025-10-17 10:00:01

{annotater}: Annotate package load calls, so we can have an idea of the overall purpose of the libraries we’re loading: #rstats

@arXiv_csIR_bot@mastoxiv.page
2025-10-09 07:39:40

Can We Hide Machines in the Crowd? Quantifying Equivalence in LLM-in-the-loop Annotation Tasks
Jiaman He, Zikang Leng, Dana McKay, Damiano Spina, Johanne R. Trippas
arxiv.org/abs/2510.06658

@nfdi4culture@nfdi.social
2025-11-04 12:48:39

📣 Wir möchten sehr herzlich zum Workshop "Digitale Annotation von Grafiken mit Antelope" einladen! In dem Kurs werden die Funktionalitäten von Antelope anhand von Daten aus GESAH vorgestellt und es wird erläutert, wie diese für die eigene Forschung genutzt werden können.
📆 Der Kurs findet am Mittwoch, den 12.11., von 10–12 Uhr statt.
📥Anmeldungen sind noch bis morgen, 5.11. möglich!

Sarkophag und antike Friese (Piranesi, Antichità Romane, Bd. III, Taf. LII),
"Graphical illustration", CC 1.0, Bildquelle: https://sah.tib.eu/individual/526404905bee4b328227cf522cbac6aa
@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:35:21

A large-scale, unsupervised pipeline for automatic corpus annotation using LLMs: variation and change in the English consider construction
Cameron Morin, Matti Marttinen Larsson
arxiv.org/abs/2510.12306

@arXiv_csRO_bot@mastoxiv.page
2025-09-30 13:16:11

Annotation-Free One-Shot Imitation Learning for Multi-Step Manipulation Tasks
Vijja Wichitwechkarn, Emlyn Williams, Charles Fox, Ruchi Choudhary
arxiv.org/abs/2509.24972

@sauer_lauwarm@mastodon.social
2025-12-14 10:21:47

*nochmalskicher*
instagram.com/reel/DSKoD3wiAF6

ISTB University of Vienna on Instagram: "We are deeply honoured and delighted that the South Asian, Tibetan, and Buddhist Studies Library has been selected as one of the distinguished institutions to receive the eighty-volume commemorative edition of the Tipitaka, published in Thailand in 2016 to mark the seventieth anniversary of His Majesty King Bhumibol’s accession to the throne. The Thai monarchy has long upheld a well-established tradition of commissioning, presenting, and receiving editions of the Pali Canon. In 1893, King Rama V commissioned the first printed edition of the Tipitaka in Thailand, which was subsequently presented as a gift to institutions in more than twenty-five countries. The 40-volume “King Bhumibol Edition” allows monks and Buddhist laity worldwide to chant the Tipiṭaka in a consistent, rule-based manner. It is accompanied by the 40-volume “Queen Sirikit Edition” which reproduces King Rama V’s use of Syām-Pāli annotation with additional notes. The ISTB library provides an ideal home for this new edition of the Tipitaka. With a collection of more than 70,000 volumes in over ninety Asian languages, it serves as a vital centre for research and teaching in South Asian, Tibetan, and Buddhist Studies at the University of Vienna."
20 likes, 0 comments - istb_univienna on December 12, 2025: "We are deeply honoured and delighted that the South Asian, Tibetan, and Buddhist Studies Library has been selected as one of the distinguished institutions to receive the eighty-volume commemorative edition of the Tipitaka, published in Thailand in 2016 to mark the seventieth anniversary of His Majesty King Bhumibol’s accession to the throne. The Thai monarchy has long upheld a well-established tradition of commissioning, presentin…

@Techmeme@techhub.social
2025-12-05 00:01:54

Micro1, which helps AI labs find experts for data annotation, says it has crossed $100M in annualized revenue and fielded investment offers at a $2.5B valuation (Anna Tong/Forbes)
forbes.com/sites/annatong/2025

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:36:11

Uncertainty-Guided Expert-AI Collaboration for Efficient Soil Horizon Annotation
Teodor Chiaburu, Vipin Singh, Frank Hau{\ss}er, Felix Bie{\ss}mann
arxiv.org/abs/2509.24873

@arXiv_csHC_bot@mastoxiv.page
2025-10-15 08:31:32

A Longitudinal Study on Different Annotator Feedback Loops in Complex RAG Tasks
Sara Rosenthal, Maeda Hanafi, Yannis Katsis, Lucian Popa, Marina Danilevsky
arxiv.org/abs/2510.11897

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:40:51

BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Tomas Ruiz, Siyao Peng, Barbara Plank, Carsten Schwemmer
arxiv.org/abs/2510.12516

@arXiv_csIR_bot@mastoxiv.page
2025-10-09 07:34:50

LLM-Powered Nuanced Video Attribute Annotation for Enhanced Recommendations
Boyuan Long, Yueqi Wang, Hiloni Mehta, Mick Zomnir, Omkar Pathak, Changping Meng, Ruolin Jia, Yajun Peng, Dapeng Hong, Xia Wu, Mingyan Gao, Onkar Dalal, Ningren Han
arxiv.org/abs/2510.06657

@arXiv_csSE_bot@mastoxiv.page
2025-10-01 10:33:57

Explainable Fault Localization for Programming Assignments via LLM-Guided Annotation
Fang Liu, Tianze Wang, Li Zhang, Zheyu Yang, Jing Jiang, Zian Sun
arxiv.org/abs/2509.25676

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 13:44:58

SNAP: Towards Segmenting Anything in Any Point Cloud
Aniket Gupta, Hanhui Wang, Charles Saunders, Aruni RoyChowdhury, Hanumant Singh, Huaizu Jiang
arxiv.org/abs/2510.11565

@arXiv_csCR_bot@mastoxiv.page
2025-10-08 10:05:39

PhishSSL: Self-Supervised Contrastive Learning for Phishing Website Detection
Wenhao Li, Selvakumar Manickam, Yung-Wey Chong, Shankar Karuppayah, Priyadarsi Nanda, Binyong Li
arxiv.org/abs/2510.05900

@arXiv_statME_bot@mastoxiv.page
2025-09-25 09:44:42

Transfer Learning in Regression with Influential Points
Bingbing Wang, Jiaqi Wang, Yu Tang
arxiv.org/abs/2509.20272 arxiv.org/pdf/2509.2027…

@awinkler@openbiblio.social
2025-09-24 14:57:46
Content warning:

research questions by Barbara McGillivray from @… at the end of her presentation in Aarhus. She has recently won funding for the project 'Computational Corpus Annotation for Quantitative Analysis of Latin Lexical Semantics' (COALA), cf.

@arXiv_qbioQM_bot@mastoxiv.page
2025-10-10 08:15:19

Decoding the dark proteome: Deep learning-enabled discovery of druggable enzymes in Wuchereria bancrofti
Shawnak Shivakumar, Jefferson Hernandez
arxiv.org/abs/2510.07337

@arXiv_csIR_bot@mastoxiv.page
2025-10-10 08:13:58

Generation and annotation of item usage scenarios in e-commerce using large language models
Madoka Hagiri, Kazushi Okamoto, Koki Karube, Kei Harada, Atsushi Shibata
arxiv.org/abs/2510.07885

@arXiv_csAI_bot@mastoxiv.page
2025-10-01 11:28:27

LMILAtt: A Deep Learning Model for Depression Detection from Social Media Users Enhanced by Multi-Instance Learning Based on Attention Mechanism
Yukun Yang
arxiv.org/abs/2509.26145

@arXiv_csCY_bot@mastoxiv.page
2025-10-03 07:37:31

Discovering Self-Regulated Learning Patterns in Chatbot-Powered Education Environment
Yilin Lyu, Ren Ding
arxiv.org/abs/2510.01275 arxiv.or…

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 10:09:02

nnFilterMatch: A Unified Semi-Supervised Learning Framework with Uncertainty-Aware Pseudo-Label Filtering for Efficient Medical Segmentation
Yi Yang
arxiv.org/abs/2509.19746

@arXiv_qbioGN_bot@mastoxiv.page
2025-12-10 08:02:21

needLR: Long-read structural variant annotation with population-scale frequency estimation
Jonas A. Gustafson, Jiadong Lin, Evan E. Eichler, Danny E. Miller
arxiv.org/abs/2512.08175

@arXiv_csCL_bot@mastoxiv.page
2025-10-01 11:24:07

An Annotation Scheme for Factuality and its Application to Parliamentary Proceedings
Gili Goldin, Shira Wigderson, Ella Rabinovich, Shuly Wintner
arxiv.org/abs/2509.26406

@arXiv_eessIV_bot@mastoxiv.page
2025-10-10 08:11:39

Curriculum Learning with Synthetic Data for Enhanced Pulmonary Nodule Detection in Chest Radiographs
Pranav Sambhu, Om Guin, Madhav Sambhu, Jinho Cha
arxiv.org/abs/2510.07681

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 14:53:07

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/5]:
- ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving
Yongxuan Lyu, Guangfeng Jiang, Hongsi Liu, Jun Liu

@arXiv_csAI_bot@mastoxiv.page
2025-09-29 10:28:47

Clinical Uncertainty Impacts Machine Learning Evaluations
Simone Lionetti, Fabian Gr\"oger, Philippe Gottfrois, Alvaro Gonzalez-Jimenez, Ludovic Amruthalingam, Alexander A. Navarini, Marc Pouly
arxiv.org/abs/2509.22242

@arXiv_csSE_bot@mastoxiv.page
2025-10-02 09:31:10

Which Programming Language and Model Work Best With LLM-as-a-Judge For Code Retrieval?
Lucas Roberts, Denisa Roberts
arxiv.org/abs/2510.00324

@arXiv_csLG_bot@mastoxiv.page
2025-09-25 10:48:32

A HyperGraphMamba-Based Multichannel Adaptive Model for ncRNA Classification
Xin An, Ruijie Li, Qiao Ning, Hui Li, Qian Ma, Shikai Guo
arxiv.org/abs/2509.20240

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:50:47

Autoproof: Automated Segmentation Proofreading for Connectomics
Gary B Huang, William M Katz, Stuart Berg, Louis Scheffer
arxiv.org/abs/2509.26585

@arXiv_csCL_bot@mastoxiv.page
2025-10-14 21:36:53

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[1/9]:
- SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised...
Shuzheng Si, Zefan Cai, Shuang Zeng, Guoqiang Feng, Jiaxing Lin, Baobao Chang

@arXiv_csRO_bot@mastoxiv.page
2025-09-25 10:21:12

LLM Trainer: Automated Robotic Data Generating via Demonstration Augmentation using LLMs
Abraham George, Amir Barati Farimani
arxiv.org/abs/2509.20070

@arXiv_eessAS_bot@mastoxiv.page
2025-09-24 09:51:34

Training Flow Matching Models with Reliable Labels via Self-Purification
Hyeongju Kim, Yechan Yu, June Young Yi, Juheon Lee
arxiv.org/abs/2509.19091

@arXiv_csIR_bot@mastoxiv.page
2025-10-09 07:42:40

Crossing Domains without Labels: Distant Supervision for Term Extraction
Elena Senger, Yuri Campbell, Rob van der Goot, Barbara Plank
arxiv.org/abs/2510.06838

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:35:00

Active Model Selection for Large Language Models
Yavuz Durmazkeser, Patrik Okanovic, Andreas Kirsch, Torsten Hoefler, Nezihe Merve G\"urel
arxiv.org/abs/2510.09418

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 10:34:52

Table Detection with Active Learning
Somraj Gautam, Nachiketa Purohit, Gaurav Harit
arxiv.org/abs/2509.20003 arxiv.org/pdf/2509.20003

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 11:22:29

MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning
Tajamul Ashraf, Umair Nawaz, Abdelrahman M. Shaker, Rao Anwer, Philip Torr, Fahad Shahbaz Khan, Salman Khan
arxiv.org/abs/2510.08567

@arXiv_csCL_bot@mastoxiv.page
2025-10-09 10:19:41

PIKA: Expert-Level Synthetic Datasets for Post-Training Alignment from Scratch
Shangjian Yin, Shining Liang, Wenbiao Ding, Yuli Qian, Zhouxing Shi, Hongzhi Li, Yutao Xie
arxiv.org/abs/2510.06670

@arXiv_qbioGN_bot@mastoxiv.page
2025-10-01 08:29:37

scUnified: An AI-Ready Standardized Resource for Single-Cell RNA Sequencing Analysis
Ping Xu, Zaitian Wang, Zhirui Wang, Pengjiang Li, Ran Zhang, Gaoyang Li, Hanyu Xie, Jiajia Wang, Yuanchun Zhou, Pengfei Wang
arxiv.org/abs/2509.25884

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 10:48:34

Human-Annotated NER Dataset for the Kyrgyz Language
Timur Turatali, Anton Alekseev, Gulira Jumalieva, Gulnara Kabaeva, Sergey Nikolenko
arxiv.org/abs/2509.19109

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 12:43:02

Unsupervised Active Learning via Natural Feature Progressive Framework
Yuxi Liu, Catherine Lalman, Yimin Yang
arxiv.org/abs/2510.04939 arxi…

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 10:55:01

PAL-Net: A Point-Wise CNN with Patch-Attention for 3D Facial Landmark Localization
Ali Shadman Yazdi, Annalisa Cappella, Benedetta Baldini, Riccardo Solazzo, Gianluca Tartaglia, Chiarella Sforza, Giuseppe Baselli
arxiv.org/abs/2510.00910

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 09:45:11

Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery
Minh Tran, Maksim Siniukov, Zhangyu Jin, Mohammad Soleymani
arxiv.org/abs/2510.01662

@arXiv_csCL_bot@mastoxiv.page
2025-09-30 14:03:41

Metaphor identification using large language models: A comparison of RAG, prompt engineering, and fine-tuning
Matteo Fuoli, Weihang Huang, Jeannette Littlemore, Sarah Turner, Ellen Wilding
arxiv.org/abs/2509.24866

@arXiv_csCL_bot@mastoxiv.page
2025-09-29 11:14:17

The InviTE Corpus: Annotating Invectives in Tudor English Texts for Computational Modeling
Sophie Spliethoff, Sanne Hoeken, Silke Schwandt, Sina Zarrie{\ss}, \"Ozge Ala\c{c}am
arxiv.org/abs/2509.22345

@arXiv_csCL_bot@mastoxiv.page
2025-09-29 11:23:37

Fine-Grained Detection of Context-Grounded Hallucinations Using LLMs
Yehonatan Pesiakhovsky, Zorik Gekhman, Yosi Mass, Liat Ein-Dor, Roi Reichart
arxiv.org/abs/2509.22582

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:36:52

SwissGPC v1.0 -- The Swiss German Podcasts Corpus
Samuel Stucki, Mark Cieliebak, Jan Deriu
arxiv.org/abs/2509.19866 arxiv.org/pdf/2509.1986…