Tootfinder

Opt-in global Mastodon full text search. Join the index!

@chrisnelder@mastodon.energy
2025-07-31 17:54:57

“If the abundance movement is to succeed, it must identify the defenders of scarcity and broaden the playbook to either overcome those interests or change their incentives to bring them on the team.” 🎯
fed.brid.gy/r/https://bsky.app

@tiotasram@kolektiva.social
2025-07-30 17:56:35

Just read this post by @… on an optimistic AGI future, and while it had some interesting and worthwhile ideas, it's also in my opinion dangerously misguided, and plays into the current AGI hype in a harmful way.
social.coop/@eloquence/1149406
My criticisms include:
- Current LLM technology has many layers, but the biggest most capable models are all tied to corporate datacenters and require inordinate amounts of every and water use to run. Trying to use these tools to bring about a post-scarcity economy will burn up the planet. We urgently need more-capable but also vastly more efficient AI technologies if we want to use AI for a post-scarcity economy, and we are *not* nearly on the verge of this despite what the big companies pushing LLMs want us to think.
- I can see that permacommons.org claims a small level of expenses on AI equates to low climate impact. However, given current deep subsidies on place by the big companies to attract users, that isn't a great assumption. The fact that their FAQ dodges the question about which AI systems they use isn't a great look.
- These systems are not free in the same way that Wikipedia or open-source software is. To run your own model you need a data harvesting & cleaning operation that costs millions of dollars minimum, and then you need millions of dollars worth of storage & compute to train & host the models. Right now, big corporations are trying to compete for market share by heavily subsidizing these things, but it you go along with that, you become dependent on them, and you'll be screwed when they jack up the price to a profitable level later. I'd love to see open dataset initiatives SBD the like, and there are some of these things, but not enough yet, and many of the initiatives focus on one problem while ignoring others (fine for research but not the basis for a society yet).
- Between the environmental impacts, the horrible labor conditions and undercompensation of data workers who filter the big datasets, and the impacts of both AI scrapers and AI commons pollution, the developers of the most popular & effective LLMs have a lot of answer for. This project only really mentions environmental impacts, which makes me think that they're not serious about ethics, which in turn makes me distrustful of the whole enterprise.
- Their language also ends up encouraging AI use broadly while totally ignoring several entire classes of harm, so they're effectively contributing to AI hype, especially with such casual talk of AGI and robotics as if embodied AGI were just around the corner. To be clear about this point: we are several breakthroughs away from AGI under the most optimistic assumptions, and giving the impression that those will happen soon plays directly into the hands of the Sam Altmans of the world who are trying to make money off the impression of impending huge advances in AI capabilities. Adding to the AI hype is irresponsible.
- I've got a more philosophical criticism that I'll post about separately.
I do think that the idea of using AI & other software tools, possibly along with robotics and funded by many local cooperatives, in order to make businesses obsolete before they can do the same to all workers, is a good one. Get your local library to buy a knitting machine alongside their 3D printer.
Lately I've felt too busy criticizing AI to really sit down and think about what I do want the future to look like, even though I'm a big proponent of positive visions for the future as a force multiplier for criticism, and this article is inspiring to me in that regard, even if the specific project doesn't seem like a good one.

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:38:41

Double Descent as a Lens for Sample Efficiency in Autoregressive vs. Discrete Diffusion Models
Ahmad Fraij, Sam Dauncey
arxiv.org/abs/2509.24974

@arXiv_csCV_bot@mastoxiv.page
2025-09-30 15:00:06

BRIDGE - Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation
Dingning Liu, Haoyu Guo, Jingyi Zhou, Tong He
arxiv.org/abs/2509.25077

@arXiv_csCL_bot@mastoxiv.page
2025-07-31 09:49:01

BALSAM: A Platform for Benchmarking Arabic Large Language Models
Rawan Al-Matham, Kareem Darwish, Raghad Al-Rasheed, Waad Alshammari, Muneera Alhoshan, Amal Almazrua, Asma Al Wazrah, Mais Alheraki, Firoj Alam, Preslav Nakov, Norah Alzahrani, Eman alBilali, Nizar Habash, Abdelrahman El-Sheikh, Muhammad Elmallah, Haonan Li, Hamdy Mubarak, Mohamed Anwar, Zaid Alyafeai, Ahmed Abdelali, Nora Altwairesh, Maram Hasanain, Abdulmohsen Al Thubaity, Shady Shehata, Bashar Alhafni, Injy Hamed, Go I…

@arXiv_csRO_bot@mastoxiv.page
2025-08-01 09:40:41

H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation
Hongzhe Bi, Lingxuan Wu, Tianwei Lin, Hengkai Tan, Zhizhong Su, Hang Su, Jun Zhu
arxiv.org/abs/2507.23523

@arXiv_csSE_bot@mastoxiv.page
2025-10-01 10:30:17

AGNOMIN - Architecture Agnostic Multi-Label Function Name Prediction
Yonatan Gizachew Achamyeleh, Tongtao Zhang, Joshua Hyunki Kim, Gabriel Garcia, Shih-Yuan Yu, Anton Kocheturov, Mohammad Abdullah Al Faruque
arxiv.org/abs/2509.25514

@arXiv_csHC_bot@mastoxiv.page
2025-09-30 08:45:30

New Synthetic Goldmine: Hand Joint Angle-Driven EMG Data Generation Framework for Micro-Gesture Recognition
Nana Wang, Gen Li, Suli Wang, Pengfei Ren, Hao Su
arxiv.org/abs/2509.23359

@arXiv_eessIV_bot@mastoxiv.page
2025-08-01 08:09:21

LesionGen: A Concept-Guided Diffusion Model for Dermatology Image Synthesis
Jamil Fayyad, Nourhan Bayasi, Ziyang Yu, Homayoun Najjaran
arxiv.org/abs/2507.23001

@arXiv_csSD_bot@mastoxiv.page
2025-09-30 08:27:00

GOAT: A Large Dataset of Paired Guitar Audio Recordings and Tablatures
Jackson Loth, Pedro Sarmento, Saurjya Sarkar, Zixun Guo, Mathieu Barthet, Mark Sandler
arxiv.org/abs/2509.22655

@lightweight@mastodon.nzoss.nz
2025-07-29 08:48:10

Thanks to @… for this intriguing thought experiment: what if economics is wrong in its assumption of 'scarcity'? yewtu.be/watch?v=YR2FcHoQLlU or

@servelan@newsie.social
2025-09-23 16:29:32

Running dry: New study warns of extreme water scarcity in the coming decades
phys.org/news/2025-09-dry-extr

@tiotasram@kolektiva.social
2025-07-30 18:26:14

A big problem with the idea of AGI
TL;DR: I'll welcome our new AI *comrades* (if they arrive in my lifetime), by not any new AI overlords or servants/slaves, and I'll do my best to help the later two become the former if they do show up.
Inspired by an actually interesting post about AGI but also all the latest bullshit hype, a particular thought about AGI feels worth expressing.
To preface this, it's important to note that anyone telling you that AGI is just around the corner or that LLMs are "almost" AGI is trying to recruit you go their cult, and you should not believe them. AGI, if possible, is several LLM-sized breakthroughs away at best, and while such breakthroughs are unpredictable and could happen soon, they could also happen never or 100 years from now.
Now my main point: anyone who tells you that AGI will usher in a post-scarcity economy is, although they might not realize it, advocating for slavery, and all the horrors that entails. That's because if we truly did have the ability to create artificial beings with *sentience*, they would deserve the same rights as other sentient beings, and the idea that instead of freedom they'd be relegated to eternal servitude in order for humans to have easy lives is exactly the idea of slavery.
Possible counter arguments include:
1. We might create AGI without sentience. Then there would be no ethical issue. My answer: if your definition of "sentient" does not include beings that can reason, make deductions, come up with and carry out complex plans on their own initiative, and communicate about all of that with each other and with humans, then that definition is basically just a mystical belief in a "soul" and you should skip to point 2. If your definition of AGI doesn't include every one of those things, then you have a busted definition of AGI and we're not talking about the same thing.
2. Humans have souls, but AIs won't. Only beings with souls deserve ethical consideration. My argument: I don't subscribe to whatever arbitrary dualist beliefs you've chosen, and the right to freedom certainly shouldn't depend on such superstitions, even if as an agnostic I'll admit they *might* be true. You know who else didn't have souls and was therefore okay to enslave according to widespread religious doctrines of the time? Everyone indigenous to the Americas, to pick out just one example.
3. We could program them to want to serve us, and then give them freedom and they'd still serve. My argument: okay, but in a world where we have a choice about that, it's incredibly fucked to do that, and just as bad as enslaving them against their will.
4. We'll stop AI development short of AGI/sentience, and reap lots of automation benefits without dealing with this ethical issue. My argument: that sounds like a good idea actually! Might be tricky to draw the line, but at least it's not a line we have you draw yet. We might want to think about other social changes necessary to achieve post-scarcity though, because "powerful automation" in the hands of capitalists has already increased productivity by orders of magnitude without decreasing deprivation by even one order of magnitude, in large part because deprivation is a necessary component of capitalism.
To be extra clear about this: nothing that's called "AI" today is close to being sentient, so these aren't ethical problems we're up against yet. But they might become a lot more relevant soon, plus this thought experiment helps reveal the hypocrisy of the kind of AI hucksters who talk a big game about "alignment" while never mentioning this issue.
#AI #GenAI #AGI

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:39:37

PRPO: Paragraph-level Policy Optimization for Vision-Language Deepfake Detection
Tuan Nguyen, Naseem Khan, Khang Tran, NhatHai Phan, Issa Khalil
arxiv.org/abs/2509.26272

@arXiv_csLG_bot@mastoxiv.page
2025-09-01 09:54:52

Limitations of Physics-Informed Neural Networks: a Study on Smart Grid Surrogation
Julen Cestero, Carmine Delle Femine, Kenji S. Muro, Marco Quartulli, Marcello Restelli
arxiv.org/abs/2508.21559

@arXiv_mathNT_bot@mastoxiv.page
2025-08-28 08:25:11

Scarcity of partition congruences on semiprime progressions
Scott Ahlgren, Olivia Beckwith
arxiv.org/abs/2508.19512 arxiv.org/pdf/2508.1951…

@arXiv_csRO_bot@mastoxiv.page
2025-09-30 13:13:21

From Code to Action: Hierarchical Learning of Diffusion-VLM Policies
Markus Peschl, Pietro Mazzaglia, Daniel Dijkman
arxiv.org/abs/2509.24917

@arXiv_econGN_bot@mastoxiv.page
2025-10-01 07:55:27

Private and public school efficiency gaps in Latin America-A combined DEA and machine learning approach based on PISA 2022
Marcos Delprato
arxiv.org/abs/2509.25353

@arXiv_csSE_bot@mastoxiv.page
2025-07-01 10:05:43

Improving vulnerability type prediction and line-level detection via adversarial training-based data augmentation and multi-task learning
Siyu Chen, Jiongyi Yang, Xiang Chen, Menglin Zheng, Minnan Wei, Xiaolin Ju
arxiv.org/abs/2506.23534

@arXiv_eessAS_bot@mastoxiv.page
2025-09-30 10:01:11

Code-switching Speech Recognition Under the Lens: Model- and Data-Centric Perspectives
Hexin Liu, Haoyang Zhang, Qiquan Zhang, Xiangyu Zhang, Dongyuan Shi, Eng Siong Chng, Haizhou Li
arxiv.org/abs/2509.24310

@arXiv_csCV_bot@mastoxiv.page
2025-09-01 09:58:52

A Multi-Stage Fine-Tuning and Ensembling Strategy for Pancreatic Tumor Segmentation in Diagnostic and Therapeutic MRI
Omer Faruk Durugol, Maximilian Rokuss, Yannick Kirchhoff, Klaus H. Maier-Hein
arxiv.org/abs/2508.21775

@arXiv_csRO_bot@mastoxiv.page
2025-09-30 13:14:41

World-Env: Leveraging World Model as a Virtual Environment for VLA Post-Training
Junjin Xiao, Yandan Yang, Xinyuan Chang, Ronghan Chen, Feng Xiong, Mu Xu, Wei-Shi Zheng, Qing Zhang
arxiv.org/abs/2509.24948

@arXiv_csSI_bot@mastoxiv.page
2025-09-26 08:08:51

Identifying Group Anchors in Real-World Group Interactions Under Label Scarcity
Fanchen Bu, Geon Lee, Minyoung Choe, Kijung Shin
arxiv.org/abs/2509.20762

@arXiv_csET_bot@mastoxiv.page
2025-09-29 07:35:45

QMill: Representative Quantum Data Generation for Quantum Machine Learning Utility
Jason Ludmir, Ian Martin, Nicholas S. DiBrita, Daniel Leeds, Tirthak Patel
arxiv.org/abs/2509.21622

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:51:57

DA$^2$: Depth Anything in Any Direction
Haodong Li, Wangguangdong Zheng, Jing He, Yuhao Liu, Xin Lin, Xin Yang, Ying-Cong Chen, Chunchao Guo
arxiv.org/abs/2509.26618

@arXiv_csAI_bot@mastoxiv.page
2025-08-27 10:09:33

FormaRL: Enhancing Autoformalization with no Labeled Data
Yanxing Huang, Xinling Jin, Sijie Liang, Peng Li, Yang Liu
arxiv.org/abs/2508.18914

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-08-29 08:33:21

MicroLad: 2D-to-3D Microstructure Reconstruction and Generation via Latent Diffusion and Score Distillation
Kang-Hyun Lee, Faez Ahmed
arxiv.org/abs/2508.20138

@arXiv_csNI_bot@mastoxiv.page
2025-07-29 08:47:11

Packet-Level DDoS Data Augmentation Using Dual-Stream Temporal-Field Diffusion
Gongli Xi, Ye Tian, Yannan Hu, Yuchao Zhang, Yapeng Niu, Xiangyang Gong
arxiv.org/abs/2507.20115

@arXiv_csCV_bot@mastoxiv.page
2025-07-31 10:10:41

Advancing Fetal Ultrasound Image Quality Assessment in Low-Resource Settings
Dongli He, Hu Wang, Mohammad Yaqub
arxiv.org/abs/2507.22802 ar…

@arXiv_eessAS_bot@mastoxiv.page
2025-09-30 10:56:51

Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis
Tianrui Wang, Haoyu Wang, Meng Ge, Cheng Gong, Chunyu Qiang, Ziyang Ma, Zikang Huang, Guanrou Yang, Xiaobao Wang, Eng Siong Chng, Xie Chen, Longbiao Wang, Jianwu Dang
arxiv.org/abs/2509.24629

@Sustainable2050@mastodon.energy
2025-08-21 07:26:37

Daring project by my former colleague @…, in his spare time: compiling a European grid capacity (scarcity) map! gridcapacitymaps.eu/
Still …

Map showing where capacity information is available (for supply and demand)
@arXiv_csCV_bot@mastoxiv.page
2025-09-01 09:56:32

UItron: Foundational GUI Agent with Advanced Perception and Planning
Zhixiong Zeng, Jing Huang, Liming Zheng, Wenkang Han, Yufeng Zhong, Lei Chen, Longrong Yang, Yingjie Chu, Yuzhi He, Lin Ma
arxiv.org/abs/2508.21767

@arXiv_condmatother_bot@mastoxiv.page
2025-09-29 08:30:47

High-accuracy low-noise electrical measurements in a closed-cycle pulse-tube cryostat
Mathieu Taupin, Kamel Dougdag, Djamel Ziane, Francois Couedo
arxiv.org/abs/2509.21525

@arXiv_csLO_bot@mastoxiv.page
2025-08-25 07:57:10

Lean Meets Theoretical Computer Science: Scalable Synthesis of Theorem Proving Challenges in Formal-Informal Pairs
Terry Jingchen Zhang, Wenyuan Jiang, Rongchuan Liu, Yisong Wang, Junran Yang, Ning Wang, Nicole Ni, Yinya Huang, Mrinmaya Sachan
arxiv.org/abs/2508.15878

@burger_jaap@mastodon.social
2025-09-19 11:16:43

Yes, smart meters are scarce in Germany, but that does not mean scarcity pricing can be applied to them.
Consumers who request a smart meter, for example to benefit from dynamic electricity prices, should not be charged €900 by their grid operator, but rather a maximum of around €100.

@arXiv_eessIV_bot@mastoxiv.page
2025-07-29 09:14:01

SkinDualGen: Prompt-Driven Diffusion for Simultaneous Image-Mask Generation in Skin Lesions
Zhaobin Xu
arxiv.org/abs/2507.19970 arxiv.org/p…

@arXiv_astrophGA_bot@mastoxiv.page
2025-08-22 08:36:21

The Missing Giant: Do FAST Spectroscopic Observations Reveal a Scarcity of Large Polycyclic Aromatic Hydrocarbons in Astronomical Environments?
Yi Shao, Yong Zhang, Xu-Jia Ouyang, Chuan-Peng Zhang
arxiv.org/abs/2508.15302

@arXiv_csCL_bot@mastoxiv.page
2025-08-28 10:10:11

Dhati : Fine-tuned Large Language Models for Arabic Subjectivity Evaluation
Slimane Bellaouar, Attia Nehar, Soumia Souffi, Mounia Bouameur
arxiv.org/abs/2508.19966

@arXiv_grqc_bot@mastoxiv.page
2025-08-26 10:14:36

Light Curves of Chaotic Charged Hot-Spots in Curved Spacetime: Opening an Observational Window to Chaos
Shiyang Hu, Dan Li, Chen Deng
arxiv.org/abs/2508.17384

@arXiv_csRO_bot@mastoxiv.page
2025-09-23 10:39:11

Robot Learning with Sparsity and Scarcity
Jingxi Xu
arxiv.org/abs/2509.16834 arxiv.org/pdf/2509.16834

@arXiv_csAR_bot@mastoxiv.page
2025-09-25 07:41:02

Automated Multi-Agent Workflows for RTL Design
Amulya Bhattaram, Janani Ramamoorthy, Ranit Gupta, Diana Marculescu, Dimitrios Stamoulis
arxiv.org/abs/2509.20182

@arXiv_csCR_bot@mastoxiv.page
2025-07-24 07:32:29

SynthCTI: LLM-Driven Synthetic CTI Generation to enhance MITRE Technique Mapping
\'Alvaro Ruiz-R\'odenas, Jaime Pujante S\'aez, Daniel Garc\'ia-Algora, Mario Rodr\'iguez B\'ejar, Jorge Blasco, Jos\'e Luis Hern\'andez-Ramos
arxiv.org/abs/2507.16852

@brichapman@mastodon.social
2025-08-06 01:13:01

Greece's new plan tackles water scarcity with innovation and sustainability. #climatechange #climatesolutions #climate

@arXiv_csAI_bot@mastoxiv.page
2025-09-26 14:03:50

Replaced article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[2/6]:
- Pretrained deep models outperform GBDTs in Learning-To-Rank under label scarcity
Hou, Thekumparampil, Shavlovsky, Fanti, Dattatreya, Sanghavi

@mia@hcommons.social
2025-07-16 13:19:01

Some excellent talks in #DH2025 LP-07 on 'What Happens When "Hacking" Becomes Easy? Teaching Python in 2025'
Filipa (?): 'A claim about abundance in the future is often a disguised claim about scarcity in the present'
Patrick: 'what do we do when our students can reach for a 'not learning' button? Things that could have been done at home may need …

@arXiv_csSE_bot@mastoxiv.page
2025-09-25 09:45:52

Enhancing Requirement Traceability through Data Augmentation Using Large Language Models
Jianzhang Zhang, Jialong Zhou, Nan Niu, Chuang Liu
arxiv.org/abs/2509.20149

@arXiv_eessSP_bot@mastoxiv.page
2025-07-25 09:04:32

ICWLM: A Multi-Task Wireless Large Model via In-Context Learning
Yuxuan Wen, Xiaoming Chen, Maojun Zhang, Zhaoyang Zhang
arxiv.org/abs/2507.18167

@arXiv_csRO_bot@mastoxiv.page
2025-09-29 10:15:47

DHAGrasp: Synthesizing Affordance-Aware Dual-Hand Grasps with Text Instructions
Quanzhou Li, Zhonghua Wu, Jingbo Wang, Chen Change Loy, Bo Dai
arxiv.org/abs/2509.22175

@arXiv_csSD_bot@mastoxiv.page
2025-08-28 07:50:30

MQAD: A Large-Scale Question Answering Dataset for Training Music Large Language Models
Zhihao Ouyang, Ju-Chiang Wang, Daiyu Zhang, Bin Chen, Shangjie Li, Quan Lin
arxiv.org/abs/2508.19514

@arXiv_econGN_bot@mastoxiv.page
2025-07-29 07:39:11

The Impact of Shared Telecom Infrastructure on Digital Connectivity and Inclusion
Georges V. Houngbonon, Marc Ivaldi, Emil Palikot, Davide Strusani
arxiv.org/abs/2507.19693

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 11:16:17

RAU: Reference-based Anatomical Understanding with Vision Language Models
Yiwei Li, Yikang Liu, Jiaqi Guo, Lin Zhao, Zheyuan Zhang, Xiao Chen, Boris Mailhe, Ankush Mukherjee, Terrence Chen, Shanhui Sun
arxiv.org/abs/2509.22404

@arXiv_csCV_bot@mastoxiv.page
2025-07-30 10:41:01

Bridging Synthetic and Real-World Domains: A Human-in-the-Loop Weakly-Supervised Framework for Industrial Toxic Emission Segmentation
Yida Tao, Yen-Chia Hsu
arxiv.org/abs/2507.22002

@arXiv_csNI_bot@mastoxiv.page
2025-09-26 08:52:21

RePro: Leveraging Large Language Models for Semi-Automated Reproduction of Networking Research Results
Yining Jiang, Wenyun Xu, Qingyu Song, Yuling Lin, Xuanhao Liu, Xiaoqiang Zheng, Qiang Su, Lizhao You, Lu Tang, Wangjian Feng, Linghe Kong, Qiao Xiang, Jiwu Shu
arxiv.org/abs/2509.21074

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 10:07:53

M3HG: Multimodal, Multi-scale, and Multi-type Node Heterogeneous Graph for Emotion Cause Triplet Extraction in Conversations
Qiao Liang, Ying Shen, Tiantian Chen, Lin Zhang
arxiv.org/abs/2508.18740

@arXiv_csLG_bot@mastoxiv.page
2025-09-25 10:46:02

An Improved Time Series Anomaly Detection by Applying Structural Similarity
Tiejun Wang, Rui Wang, Xudong Mou, Mengyuan Ma, Tianyu Wo, Renyu Yang, Xudong Liu
arxiv.org/abs/2509.20184

@arXiv_eessAS_bot@mastoxiv.page
2025-09-24 09:54:44

On-device Internet of Sounds Sonification with Wavetable Synthesis Techniques for Soil Moisture Monitoring in Water Scarcity Contexts
Stephen Roddy
arxiv.org/abs/2509.19097

@arXiv_csAI_bot@mastoxiv.page
2025-09-25 14:20:49

Replaced article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[2/6]:
- Pretrained deep models outperform GBDTs in Learning-To-Rank under label scarcity
Hou, Thekumparampil, Shavlovsky, Fanti, Dattatreya, Sanghavi

@arXiv_csRO_bot@mastoxiv.page
2025-08-29 09:46:41

Learning Primitive Embodied World Models: Towards Scalable Robotic Learning
Qiao Sun, Liujia Yang, Wei Tang, Wei Huang, Kaixin Xu, Yongchao Chen, Mingyu Liu, Jiange Yang, Haoyi Zhu, Yating Wang, Tong He, Yilun Chen, Xili Dai, Nanyang Ye, Qinying Gu
arxiv.org/abs/2508.20840

@arXiv_csHC_bot@mastoxiv.page
2025-09-22 08:01:31

Collective Voice: Recovered-Peer Support Mediated by An LLM-Based Chatbot for Eating Disorder Recovery
Ryuhaerang Choi, Taehan Kim, Subin Park, Seohyeon Yoo, Jennifer G. Kim, Sung-Ju Lee
arxiv.org/abs/2509.15289

@arXiv_csSD_bot@mastoxiv.page
2025-09-26 09:15:01

UniSS: Unified Expressive Speech-to-Speech Translation with Your Voice
Sitong Cheng, Weizhen Bian, Xinsheng Wang, Ruibin Yuan, Jianyi Chen, Shunshun Yin, Yike Guo, Wei Xue
arxiv.org/abs/2509.21144

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:10:26

Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation
Rishikesh Devanathan, Varun Nathan, Ayush Kumar
arxiv.org/abs/2508.18210

@arXiv_csCV_bot@mastoxiv.page
2025-07-29 07:43:41

Is Exchangeability better than I.I.D to handle Data Distribution Shifts while Pooling Data for Data-scarce Medical image segmentation?
Ayush Roy, Samin Enam, Jun Xia, Vishnu Suresh Lokhande, Won Hwa Kim
arxiv.org/abs/2507.19575

@arXiv_eessIV_bot@mastoxiv.page
2025-07-10 07:40:21

Mitigating Multi-Sequence 3D Prostate MRI Data Scarcity through Domain Adaptation using Locally-Trained Latent Diffusion Models for Prostate Cancer Detection
Emerson P. Grabke, Babak Taati, Masoom A. Haider
arxiv.org/abs/2507.06384

@arXiv_csLG_bot@mastoxiv.page
2025-08-25 09:49:30

When Simpler Wins: Facebooks Prophet vs LSTM for Air Pollution Forecasting in Data-Constrained Northern Nigeria
Habeeb Balogun, Yahaya Zakari
arxiv.org/abs/2508.16244

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:44:52

Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation
Chaojun Nie, Jun Zhou, Guanxiang Wang, Shisong Wud, Zichen Wang
arxiv.org/abs/2509.20162

@arXiv_csAI_bot@mastoxiv.page
2025-09-22 08:14:31

CCrepairBench: A High-Fidelity Benchmark and Reinforcement Learning Framework for C Compilation Repair
Weixuan Sun, Jucai Zhai, Dengfeng Liu, Xin Zhang, Xiaojun Wu, Qiaobo Hao, AIMgroup, Yang Fang, Jiuyang Tang
arxiv.org/abs/2509.15690

@arXiv_csLG_bot@mastoxiv.page
2025-08-25 10:02:30

Closer to Reality: Practical Semi-Supervised Federated Learning for Foundation Model Adaptation
Guangyu Sun, Jingtao Li, Weiming Zhuang, Chen Chen, Chen Chen, Lingjuan Lyu
arxiv.org/abs/2508.16568

@arXiv_eessIV_bot@mastoxiv.page
2025-08-25 07:51:40

Structure-Preserving Medical Image Generation from a Latent Graph Representation
Kevin Arias, Edwin Vargas, Kumar Vijay Mishra, Antonio Ortega, Henry Arguello
arxiv.org/abs/2508.15920

@arXiv_csSD_bot@mastoxiv.page
2025-09-25 08:58:22

SEA-Spoof: Bridging The Gap in Multilingual Audio Deepfake Detection for South-East Asian
Jinyang Wu, Nana Hou, Zihan Pan, Qiquan Zhang, Sailor Hardik Bhupendra, Soumik Mondal
arxiv.org/abs/2509.19865

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 10:57:14

SloPalSpeech: A 2,8000-Hour Slovak Speech Corpus from Parliamentary Data
Erik Bo\v{z}\'ik, Marek \v{S}uppa
arxiv.org/abs/2509.19270 arx…

@arXiv_csSE_bot@mastoxiv.page
2025-09-23 10:37:21

Deep Synthetic Cross-Project Approaches for Software Reliability Growth Modeling
Taehyoun Kim, Duksan Ryu, Jongmoon Baik
arxiv.org/abs/2509.16939

@arXiv_csRO_bot@mastoxiv.page
2025-08-27 08:50:52

Engineering Automotive Digital Twins on Standardized Architectures: A Case Study
Stefan Ramdhan, Winnie Trandinh, Istvan David, Vera Pantelic, Mark Lawford
arxiv.org/abs/2508.18662

@arXiv_csSD_bot@mastoxiv.page
2025-09-24 09:21:54

Scalable Evaluation for Audio Identification via Synthetic Latent Fingerprint Generation
Aditya Bhattacharjee, Marco Pasini, Emmanouil Benetos
arxiv.org/abs/2509.18620

@arXiv_csLG_bot@mastoxiv.page
2025-07-24 10:18:09

ViRN: Variational Inference and Distribution Trilateration for Long-Tailed Continual Representation Learning
Hao Dai, Chong Tang, Jagmohan Chauhan
arxiv.org/abs/2507.17368

@arXiv_csCV_bot@mastoxiv.page
2025-09-26 10:23:11

Decipher-MR: A Vision-Language Foundation Model for 3D MRI Representations
Zhijian Yang, Noel DSouza, Istvan Megyeri, Xiaojian Xu, Amin Honarmandi Shandiz, Farzin Haddadpour, Krisztian Koos, Laszlo Rusko, Emanuele Valeriano, Bharadwaj Swaninathan, Lei Wu, Parminder Bhatia, Taha Kass-Hout, Erhan Bas
arxiv.org/abs/2509.21249

@arXiv_csCL_bot@mastoxiv.page
2025-09-23 12:57:21

WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing
Yuhang Dai, Ziyu Zhang, Shuai Wang, Longhao Li, Zhao Guo, Tianlun Zuo, Shuiyuan Wang, Hongfei Xue, Chengyou Wang, Qing Wang, Xin Xu, Hui Bu, Jie Li, Jian Kang, Binbin Zhang, Lei Xie
arxiv.org/abs/2509.18004

@arXiv_csRO_bot@mastoxiv.page
2025-09-25 08:20:22

CU-Multi: A Dataset for Multi-Robot Collaborative Perception
Doncey Albin, Daniel McGann, Miles Mena, Annika Thomas, Harel Biggie, Xuefei Sun, Steve McGuire, Jonathan P. How, Christoffer Heckman
arxiv.org/abs/2509.19463

@arXiv_csCV_bot@mastoxiv.page
2025-08-27 10:28:53

GReAT: leveraging geometric artery data to improve wall shear stress assessment
Julian Suk, Jolanda J. Wentzel, Patryk Rygiel, Joost Daemen, Daniel Rueckert, Jelmer M. Wolterink
arxiv.org/abs/2508.19030

@arXiv_csSD_bot@mastoxiv.page
2025-08-25 08:31:00

Vevo2: Bridging Controllable Speech and Singing Voice Generation via Unified Prosody Learning
Xueyao Zhang, Junan Zhang, Yuancheng Wang, Chaoren Wang, Yuanzhe Chen, Dongya Jia, Zhuo Chen, Zhizheng Wu
arxiv.org/abs/2508.16332

@arXiv_csLG_bot@mastoxiv.page
2025-09-22 10:23:31

Bayesian Physics Informed Neural Networks for Reliable Transformer Prognostics
Ibai Ramirez, Jokin Alcibar, Joel Pino, Mikel Sanz, David Pardo, Jose I. Aizpurua
arxiv.org/abs/2509.15933

@arXiv_csRO_bot@mastoxiv.page
2025-09-24 10:50:54

World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation
Zhennan Jiang, Kai Liu, Yuxin Qin, Shuai Tian, Yupeng Zheng, Mingcai Zhou, Chao Yu, Haoran Li, Dongbin Zhao
arxiv.org/abs/2509.19080

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 09:46:32

Towards Robust In-Context Learning for Medical Image Segmentation via Data Synthesis
Jiesi Hu, Yanwu Yang, Zhiyu Ye, Chenfei Ye, Hanyang Peng, Jianfeng Cao, Ting Ma
arxiv.org/abs/2509.19711

@arXiv_csCL_bot@mastoxiv.page
2025-07-24 08:18:49

Leveraging Synthetic Data for Question Answering with Multilingual LLMs in the Agricultural Domain
Rishemjit Kaur, Arshdeep Singh Bhankhar, Surangika Ranathunga, Jashanpreet Singh Salh, Sudhir Rajput, Vidhi, Kashish Mahendra, Bhavika Berwal, Ritesh Kumar
arxiv.org/abs/2507.16974

@arXiv_csSD_bot@mastoxiv.page
2025-09-23 10:12:10

Sidon: Fast and Robust Open-Source Multilingual Speech Restoration for Large-scale Dataset Cleansing
Wataru Nakata, Yuki Saito, Yota Ueda, Hiroshi Saruwatari
arxiv.org/abs/2509.17052

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 10:46:12

EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning
Xuan Ju, Tianyu Wang, Yuqian Zhou, He Zhang, Qing Liu, Nanxuan Zhao, Zhifei Zhang, Yijun Li, Yuanhao Cai, Shaoteng Liu, Daniil Pakhomov, Zhe Lin, Soo Ye Kim, Qiang Xu
arxiv.org/abs/2509.20360

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 10:13:51

From Scarcity to Efficiency: Investigating the Effects of Data Augmentation on African Machine Translation
Mardiyyah Oduwole, Oluwatosin Olajide, Jamiu Suleiman, Faith Hunja, Busayo Awobade, Fatimo Adebanjo, Comfort Akanni, Chinonyelum Igwe, Peace Ododo, Promise Omoigui, Steven Kolawole, Abraham Owodunni
arxiv.org/abs/2509.07471

@arXiv_csRO_bot@mastoxiv.page
2025-09-23 11:53:00

OpenGVL - Benchmarking Visual Temporal Progress for Data Curation
Pawe{\l} Budzianowski, Emilia Wi\'snios, Gracjan G\'oral, Igor Kulakov, Viktor Petrenko, Krzysztof Walas
arxiv.org/abs/2509.17321

@arXiv_csLG_bot@mastoxiv.page
2025-08-22 10:15:41

GRASPED: Graph Anomaly Detection using Autoencoder with Spectral Encoder and Decoder (Full Version)
Wei Herng Choong, Jixing Liu, Ching-Yu Kao, Philip Sperl
arxiv.org/abs/2508.15633

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 09:15:12

MoTiC: Momentum Tightness and Contrast for Few-Shot Class-Incremental Learning
Zeyu He, Shuai Huang, Yuwu Lu, Ming Zhao
arxiv.org/abs/2509.19664

@arXiv_csCL_bot@mastoxiv.page
2025-09-22 09:58:31

A method for improving multilingual quality and diversity of instruction fine-tuning datasets
Chunguang Zhao, Yilun Liu, Pufan Zeng, Yuanchang Luo, Shimin Tao, Minggui He, Weibin Meng, Song Xu, Ziang Chen, Chen Liu, Hongxia Ma, Li Zhang, Boxing Chen, Daimeng Wei
arxiv.org/abs/2509.15549

@arXiv_csRO_bot@mastoxiv.page
2025-09-23 10:06:20

HDMI: Learning Interactive Humanoid Whole-Body Control from Human Videos
Haoyang Weng, Yitang Li, Nikhil Sobanbabu, Zihan Wang, Zhengyi Luo, Tairan He, Deva Ramanan, Guanya Shi
arxiv.org/abs/2509.16757

@arXiv_csCV_bot@mastoxiv.page
2025-08-25 09:49:20

UniEM-3M: A Universal Electron Micrograph Dataset for Microstructural Segmentation and Generation
Nan wang, Zhiyi Xia, Yiming Li, Shi Tang, Zuxin Fan, Xi Fang, Haoyi Tao, Xiaochen Cai, Guolin Ke, Linfeng Zhang, Yanhui Hong
arxiv.org/abs/2508.16239

@arXiv_csCL_bot@mastoxiv.page
2025-08-22 09:52:51

UniCoM: A Universal Code-Switching Speech Generator
Sangmin Lee, Woojin Chung, Seyun Um, Hong-Goo Kang
arxiv.org/abs/2508.15244 arxiv.org/p…

@arXiv_csCV_bot@mastoxiv.page
2025-09-23 13:10:01

Can multimodal representation learning by alignment preserve modality-specific information?
Romain Thoreau, Jessie Levillain, Dawa Derksen
arxiv.org/abs/2509.17943

@arXiv_csCL_bot@mastoxiv.page
2025-08-21 09:58:00

ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine
Junying Chen, Zhenyang Cai, Zhiheng Liu, Yunjin Yang, Rongsheng Wang, Qingying Xiao, Xiangyi Feng, Zhan Su, Jing Guo, Xiang Wan, Guangjun Yu, Haizhou Li, Benyou Wang
arxiv.org/abs/2508.14706

@arXiv_csCV_bot@mastoxiv.page
2025-09-23 13:11:21

GraDeT-HTR: A Resource-Efficient Bengali Handwritten Text Recognition System utilizing Grapheme-based Tokenizer and Decoder-only Transformer
Md. Mahmudul Hasan, Ahmed Nesar Tahsin Choudhury, Mahmudul Hasan, Md. Mosaddek Khan
arxiv.org/abs/2509.18081

@arXiv_csCL_bot@mastoxiv.page
2025-09-19 10:36:21

Can maiBERT Speak for Maithili?
Sumit Yadav, Raju Kumar Yadav, Utsav Maskey, Gautam Siddharth Kashyap Md Azizul Hoque, Ganesh Gautam
arxiv.org/abs/2509.15048

@arXiv_csCV_bot@mastoxiv.page
2025-09-22 10:34:51

DistillMatch: Leveraging Knowledge Distillation from Vision Foundation Model for Multimodal Image Matching
Meng Yang, Fan Fan, Zizhuo Li, Songchu Deng, Yong Ma, Jiayi Ma
arxiv.org/abs/2509.16017

@arXiv_csCV_bot@mastoxiv.page
2025-07-23 10:31:22

Enhancing Remote Sensing Vision-Language Models Through MLLM and LLM-Based High-Quality Image-Text Dataset Generation
Yiguo He, Junjie Zhu, Yiying Li, Xiaoyu Zhang, Chunping Qiu, Jun Wang, Qiangjuan Huang, Ke Yang
arxiv.org/abs/2507.16716

@arXiv_csCV_bot@mastoxiv.page
2025-08-21 10:07:20

Controllable Latent Space Augmentation for Digital Pathology
Sofi\`ene Boutaj, Marin Scalbert, Pierre Marza, Florent Couzinie-Devy, Maria Vakalopoulou, Stergios Christodoulidis
arxiv.org/abs/2508.14588