Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csCL_bot@mastoxiv.page
2025-09-09 11:47:22

Understanding the Influence of Synthetic Data for Text Embedders
Jacob Mitchell Springer, Vaibhav Adlakha, Siva Reddy, Aditi Raghunathan, Marius Mosbach
arxiv.org/abs/2509.06184

@arXiv_csCV_bot@mastoxiv.page
2025-09-08 09:40:40

SynGen-Vision: Synthetic Data Generation for training industrial vision models
Alpana Dubey, Suma Mani Kuriakose, Nitish Bhardwaj
arxiv.org/abs/2509.04894

@arXiv_csAI_bot@mastoxiv.page
2025-10-08 10:25:39

Towards Label-Free Biological Reasoning Synthetic Dataset Creation via Uncertainty Filtering
Josefa Lia Stoisser, Lawrence Phillips, Aditya Misra, Tom A. Lamb, Philip Torr, Marc Boubnovski Martell, Julien Fauqueur, Kaspar M\"artens
arxiv.org/abs/2510.05871

@arXiv_csCR_bot@mastoxiv.page
2025-10-09 09:31:21

Differentially Private Synthetic Text Generation for Retrieval-Augmented Generation (RAG)
Junki Mori, Kazuya Kakizaki, Taiki Miyagawa, Jun Sakuma
arxiv.org/abs/2510.06719

@arXiv_eessIV_bot@mastoxiv.page
2025-09-09 07:40:21

A Synthetic-to-Real Dehazing Method based on Domain Unification
Zhiqiang Yuan, Jinchao Zhang, Jie Zhou
arxiv.org/abs/2509.05374 arxiv.org/p…

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:36:09

High-Fidelity Synthetic ECG Generation via Mel-Spectrogram Informed Diffusion Training
Zhuoyi Huang, Nutan Sahoo, Anamika Kumari, Girish Kumar, Kexuan Cai, Shixing Cao, Yue Kang, Tian Xia, Somya Chatterjee, Nicholas Hausman, Aidan Jay, Eric S. Rosenthal, Soundar Srinivasan, Sadid Hasan, Alex Fedorov, Sulaiman Vesal, Soundar Srinivasan, Sadid Hasan, Alex Fedorov, Sulaiman Vesal

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:01:41

SDQM: Synthetic Data Quality Metric for Object Detection Dataset Evaluation
Ayush Zenith, Arnold Zumbrun, Neel Raut, Jing Lin
arxiv.org/abs/2510.06596

@arXiv_csCL_bot@mastoxiv.page
2025-09-08 10:11:50

Knowledge Collapse in LLMs: When Fluency Survives but Facts Fail under Recursive Synthetic Training
Figarri Keisha, Zekun Wu, Ze Wang, Adriano Koshiyama, Philip Treleaven
arxiv.org/abs/2509.04796

@arXiv_csCR_bot@mastoxiv.page
2025-09-09 11:30:42

Tell-Tale Watermarks for Explanatory Reasoning in Synthetic Media Forensics
Ching-Chun Chang, Isao Echizen
arxiv.org/abs/2509.05753 arxiv.o…

@arXiv_csCL_bot@mastoxiv.page
2025-10-09 10:18:11

Aligning Large Language Models via Fully Self-Synthetic Data
Shangjian Yin, Zhepei Wei, Xinyu Zhu, Wei-Lin Chen, Yu Meng
arxiv.org/abs/2510.06652