xAI workers said they've encountered NSFW content, including AI-generated child sexual abuse material from Grok.
☑️ Musk's AI Tutors Describe 'Disgusting' Content Moderation Job - Business Insider
https://www.businessinsider.com/elon-musk-
WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing
Yuhang Dai, Ziyu Zhang, Shuai Wang, Longhao Li, Zhao Guo, Tianlun Zuo, Shuiyuan Wang, Hongfei Xue, Chengyou Wang, Qing Wang, Xin Xu, Hui Bu, Jie Li, Jian Kang, Binbin Zhang, Lei Xie
https://arxiv.org/abs/2509.18004
Depth Edge Alignment Loss: DEALing with Depth in Weakly Supervised Semantic Segmentation
Patrick Schmidt, Vasileios Belagiannis, Lazaros Nalpantidis
https://arxiv.org/abs/2509.17702
A Chain-of-thought Reasoning Breast Ultrasound Dataset Covering All Histopathology Categories
Haojun Yu, Youcheng Li, Zihan Niu, Nan Zhang, Xuantong Gong, Huan Li, Zhiying Zou, Haifeng Qi, Zhenxiao Cao, Zijie Lan, Xingjian Yuan, Jiating He, Haokai Zhang, Shengtao Zhang, Zicheng Wang, Dong Wang, Ziwei Zhao, Congying Chen, Yong Wang, Wangyan Qin, Qingli Zhu
https…
Make Every Letter Count: Building Dialect Variation Dictionaries from Monolingual Corpora
Robert Litschko, Verena Blaschke, Diana Burkhardt, Barbara Plank, Diego Frassinelli
https://arxiv.org/abs/2509.17855
{annotater}: Annotate package load calls, so we can have an idea of the overall purpose of the libraries we’re loading: #rstats
Can We Hide Machines in the Crowd? Quantifying Equivalence in LLM-in-the-loop Annotation Tasks
Jiaman He, Zikang Leng, Dana McKay, Damiano Spina, Johanne R. Trippas
https://arxiv.org/abs/2510.06658
📣 Wir möchten sehr herzlich zum Workshop "Digitale Annotation von Grafiken mit Antelope" einladen! In dem Kurs werden die Funktionalitäten von Antelope anhand von Daten aus GESAH vorgestellt und es wird erläutert, wie diese für die eigene Forschung genutzt werden können.
📆 Der Kurs findet am Mittwoch, den 12.11., von 10–12 Uhr statt.
📥Anmeldungen sind noch bis morgen, 5.11. möglich!
A large-scale, unsupervised pipeline for automatic corpus annotation using LLMs: variation and change in the English consider construction
Cameron Morin, Matti Marttinen Larsson
https://arxiv.org/abs/2510.12306
Annotation-Free One-Shot Imitation Learning for Multi-Step Manipulation Tasks
Vijja Wichitwechkarn, Emlyn Williams, Charles Fox, Ruchi Choudhary
https://arxiv.org/abs/2509.24972
Micro1, which helps AI labs find experts for data annotation, says it has crossed $100M in annualized revenue and fielded investment offers at a $2.5B valuation (Anna Tong/Forbes)
http://www.forbes.com/sites/annatong/2025
Uncertainty-Guided Expert-AI Collaboration for Efficient Soil Horizon Annotation
Teodor Chiaburu, Vipin Singh, Frank Hau{\ss}er, Felix Bie{\ss}mann
https://arxiv.org/abs/2509.24873
A Longitudinal Study on Different Annotator Feedback Loops in Complex RAG Tasks
Sara Rosenthal, Maeda Hanafi, Yannis Katsis, Lucian Popa, Marina Danilevsky
https://arxiv.org/abs/2510.11897
BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Tomas Ruiz, Siyao Peng, Barbara Plank, Carsten Schwemmer
https://arxiv.org/abs/2510.12516
LLM-Powered Nuanced Video Attribute Annotation for Enhanced Recommendations
Boyuan Long, Yueqi Wang, Hiloni Mehta, Mick Zomnir, Omkar Pathak, Changping Meng, Ruolin Jia, Yajun Peng, Dapeng Hong, Xia Wu, Mingyan Gao, Onkar Dalal, Ningren Han
https://arxiv.org/abs/2510.06657
Explainable Fault Localization for Programming Assignments via LLM-Guided Annotation
Fang Liu, Tianze Wang, Li Zhang, Zheyu Yang, Jing Jiang, Zian Sun
https://arxiv.org/abs/2509.25676
SNAP: Towards Segmenting Anything in Any Point Cloud
Aniket Gupta, Hanhui Wang, Charles Saunders, Aruni RoyChowdhury, Hanumant Singh, Huaizu Jiang
https://arxiv.org/abs/2510.11565
PhishSSL: Self-Supervised Contrastive Learning for Phishing Website Detection
Wenhao Li, Selvakumar Manickam, Yung-Wey Chong, Shankar Karuppayah, Priyadarsi Nanda, Binyong Li
https://arxiv.org/abs/2510.05900
Decoding the dark proteome: Deep learning-enabled discovery of druggable enzymes in Wuchereria bancrofti
Shawnak Shivakumar, Jefferson Hernandez
https://arxiv.org/abs/2510.07337
Generation and annotation of item usage scenarios in e-commerce using large language models
Madoka Hagiri, Kazushi Okamoto, Koki Karube, Kei Harada, Atsushi Shibata
https://arxiv.org/abs/2510.07885
LMILAtt: A Deep Learning Model for Depression Detection from Social Media Users Enhanced by Multi-Instance Learning Based on Attention Mechanism
Yukun Yang
https://arxiv.org/abs/2509.26145
needLR: Long-read structural variant annotation with population-scale frequency estimation
Jonas A. Gustafson, Jiadong Lin, Evan E. Eichler, Danny E. Miller
https://arxiv.org/abs/2512.08175
An Annotation Scheme for Factuality and its Application to Parliamentary Proceedings
Gili Goldin, Shira Wigderson, Ella Rabinovich, Shuly Wintner
https://arxiv.org/abs/2509.26406
Curriculum Learning with Synthetic Data for Enhanced Pulmonary Nodule Detection in Chest Radiographs
Pranav Sambhu, Om Guin, Madhav Sambhu, Jinho Cha
https://arxiv.org/abs/2510.07681
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving
Yongxuan Lyu, Guangfeng Jiang, Hongsi Liu, Jun Liu
Clinical Uncertainty Impacts Machine Learning Evaluations
Simone Lionetti, Fabian Gr\"oger, Philippe Gottfrois, Alvaro Gonzalez-Jimenez, Ludovic Amruthalingam, Alexander A. Navarini, Marc Pouly
https://arxiv.org/abs/2509.22242
A HyperGraphMamba-Based Multichannel Adaptive Model for ncRNA Classification
Xin An, Ruijie Li, Qiao Ning, Hui Li, Qian Ma, Shikai Guo
https://arxiv.org/abs/2509.20240 https://
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[1/9]:
- SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised...
Shuzheng Si, Zefan Cai, Shuang Zeng, Guoqiang Feng, Jiaxing Lin, Baobao Chang
Crossing Domains without Labels: Distant Supervision for Term Extraction
Elena Senger, Yuri Campbell, Rob van der Goot, Barbara Plank
https://arxiv.org/abs/2510.06838 https://…
Active Model Selection for Large Language Models
Yavuz Durmazkeser, Patrik Okanovic, Andreas Kirsch, Torsten Hoefler, Nezihe Merve G\"urel
https://arxiv.org/abs/2510.09418 …
MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning
Tajamul Ashraf, Umair Nawaz, Abdelrahman M. Shaker, Rao Anwer, Philip Torr, Fahad Shahbaz Khan, Salman Khan
https://arxiv.org/abs/2510.08567
PIKA: Expert-Level Synthetic Datasets for Post-Training Alignment from Scratch
Shangjian Yin, Shining Liang, Wenbiao Ding, Yuli Qian, Zhouxing Shi, Hongzhi Li, Yutao Xie
https://arxiv.org/abs/2510.06670
scUnified: An AI-Ready Standardized Resource for Single-Cell RNA Sequencing Analysis
Ping Xu, Zaitian Wang, Zhirui Wang, Pengjiang Li, Ran Zhang, Gaoyang Li, Hanyu Xie, Jiajia Wang, Yuanchun Zhou, Pengfei Wang
https://arxiv.org/abs/2509.25884
Human-Annotated NER Dataset for the Kyrgyz Language
Timur Turatali, Anton Alekseev, Gulira Jumalieva, Gulnara Kabaeva, Sergey Nikolenko
https://arxiv.org/abs/2509.19109 https://…
PAL-Net: A Point-Wise CNN with Patch-Attention for 3D Facial Landmark Localization
Ali Shadman Yazdi, Annalisa Cappella, Benedetta Baldini, Riccardo Solazzo, Gianluca Tartaglia, Chiarella Sforza, Giuseppe Baselli
https://arxiv.org/abs/2510.00910
Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery
Minh Tran, Maksim Siniukov, Zhangyu Jin, Mohammad Soleymani
https://arxiv.org/abs/2510.01662 htt…
Metaphor identification using large language models: A comparison of RAG, prompt engineering, and fine-tuning
Matteo Fuoli, Weihang Huang, Jeannette Littlemore, Sarah Turner, Ellen Wilding
https://arxiv.org/abs/2509.24866
The InviTE Corpus: Annotating Invectives in Tudor English Texts for Computational Modeling
Sophie Spliethoff, Sanne Hoeken, Silke Schwandt, Sina Zarrie{\ss}, \"Ozge Ala\c{c}am
https://arxiv.org/abs/2509.22345
Fine-Grained Detection of Context-Grounded Hallucinations Using LLMs
Yehonatan Pesiakhovsky, Zorik Gekhman, Yosi Mass, Liat Ein-Dor, Roi Reichart
https://arxiv.org/abs/2509.22582