
2025-08-06 10:15:50
Cropping outperforms dropout as an augmentation strategy for training self-supervised text embeddings
Rita Gonz\'alez-M\'arquez, Philipp Berens, Dmitry Kobak
https://arxiv.org/abs/2508.03453
Cropping outperforms dropout as an augmentation strategy for training self-supervised text embeddings
Rita Gonz\'alez-M\'arquez, Philipp Berens, Dmitry Kobak
https://arxiv.org/abs/2508.03453
StepWrite: Adaptive Planning for Speech-Driven Text Generation
Hamza El Alaoui, Atieh Taheri, Yi-Hao Peng, Jeffrey P. Bigham
https://arxiv.org/abs/2508.04011 https://
AnomalyLMM: Bridging Generative Knowledge and Discriminative Retrieval for Text-Based Person Anomaly Search
Hao Ju, Hu Zhang, Zhedong Zheng
https://arxiv.org/abs/2509.04376 http…
EmbedGrad: Gradient-Based Prompt Optimization in Embedding Space for Large Language Models
Xiaoming Hou, Jiquan Zhang, Zibin Lin, DaCheng Tao, Shengli Zhang
https://arxiv.org/abs/2508.03533
Challenges for AI in Multimodal STEM Assessments: a Human-AI Comparison
Aymeric de Chillaz, Anna Sotnikova, Patrick Jermann, Antoine Bosselut
https://arxiv.org/abs/2507.03013
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models
Hyungjin Kim, Seokho Ahn, Young-Duk Seo
https://arxiv.org/abs/2508.03481 h…
The changing role of cited papers over time: An analysis of highly cited papers based on a large full-text dataset
Gege Lin, Nees Jan van Eck, Haiyan Hou, Zhigang Hu
https://arxiv.org/abs/2509.04190
The Impact of Critique on LLM-Based Model Generation from Natural Language: The Case of Activity Diagrams
Parham Khamsepour, Mark Cole, Ish Ashraf, Sandeep Puri, Mehrdad Sabetzadeh, Shiva Nejati
https://arxiv.org/abs/2509.03463
Dear Apple,
It makes sense for you to reorder my frequently used emoji based on how frequently I use them, but please wait until I’ve closed the emoji keyboard before you do.
I wanted to text my wife three ♥️ but after the first it switched to 🤔and then for the third it went to 🤷. It’s a good thing I noticed because
♥️♥️♥️ is very different than ♥️🤔🤷
Precision determination of $\alpha_\text{s}$ from Dijet Cross Sections in the Multi-TeV Range
Jo\~ao Pires
https://arxiv.org/abs/2507.01670 https://…
Some Remarks on the $l_1$-Robust Solution of LexRank Problem
Anna Timonina-Farkas
https://arxiv.org/abs/2509.04131 https://arxiv.org/pdf/2509.04131
Long; central Massachusetts colonial history
Today on a whim I visited a site in Massachusetts marked as "Huguenot Fort Ruins" on OpenStreetMaps. I drove out with my 4-year-old through increasingly rural central Massachusetts forests & fields to end up on a narrow street near the top of a hill beside a small field. The neighboring houses had huge lawns, some with tractors.
Appropriately for this day and this moment in history, the history of the site turns out to be a microcosm of America. Across the field beyond a cross-shaped stone memorial stood an info board with a few diagrams and some text. The text of the main sign (including typos/misspellings) read:
"""
Town Is Formed
Early in the 1680's, interest began to generate to develop a town in the area west of Natick in the south central part of the Commonwealth that would be suitable for a settlement. A Mr. Hugh Campbell, a Scotch merchant of Boston petitioned the court for land for a colony. At about the same time, Joseph Dudley and William Stoughton also were desirous of obtaining land for a settlement. A claim was made for all lands west of the Blackstone River to the southern land of Massachusetts to a point northerly of the Springfield Road then running southwesterly until it joined the southern line of Massachusetts.
Associated with Dudley and Stoughton was Robert Thompson of London, England, Dr. Daniel Cox and John Blackwell, both of London and Thomas Freak of Hannington, Wiltshire, as proprietors. A stipulation in the acquisition of this land being that within four years thirty families and an orthodox minister settle in the area. An extension of this stipulation was granted at the end of the four years when no group large enough seemed to be willing to take up the opportunity.
In 1686, Robert Thompson met Gabriel Bernor and learned that he was seeking an area where his countrymen, who had fled their native France because of the Edict of Nantes, were desirous of a place to live. Their main concern was to settle in a place that would allow them freedom of worship. New Oxford, as it was the so-named, at that time included the larger part of Charlton, one-fourth of Auburn, one-fifth of Dudley and several square miles of the northeast portion of Southbridge as well as the easterly ares now known as Webster.
Joseph Dudley's assessment that the area was capable of a good settlement probably was based on the idea of the meadows already established along with the plains, ponds, brooks and rivers. Meadows were a necessity as they provided hay for animal feed and other uses by the settlers. The French River tributary books and streams provided a good source for fishing and hunting. There were open areas on the plains as customarily in November of each year, the Indians burnt over areas to keep them free of underwood and brush. It appeared then that this area was ready for settling.
The first seventy-five years of the settling of the Town of Oxford originally known as Manchaug, embraced three different cultures. The Indians were known to be here about 1656 when the Missionary, John Eliott and his partner Daniel Gookin visited in the praying towns. Thirty years later, in 1686, the Huguenots walked here from Boston under the guidance of their leader Isaac Bertrand DuTuffeau. The Huguenot's that arrived were not peasants, but were acknowledged to be the best Agriculturist, Wine Growers, Merchant's, and Manufacter's in France. There were 30 families consisting of 52 people. At the time of their first departure (10 years), due to Indian insurrection, there were 80 people in the group, and near their Meetinghouse/Church was a Cemetery that held 20 bodies. In 1699, 8 to 10 familie's made a second attempt to re-settle, failing after only four years, with the village being completely abandoned in 1704.
The English colonist made their way here in 1713 and established what has become a permanent settlement.
"""
All that was left of the fort was a crumbling stone wall that would have been the base of a higher wooden wall according to a picture of a model (I didn't think to get a shot of that myself). Only trees and brush remain where the multi-story main wooden building was.
This story has so many echoes in the present:
- The rich colonialists from Boston & London agree to settle the land, buying/taking land "rights" from the colonial British court that claimed jurisdiction without actually having control of the land. Whether the sponsors ever actually visited the land themselves I don't know. They surely profited somehow, whether from selling on the land rights later or collecting taxes/rent or whatever, by they needed poor laborers to actually do the work of developing the land (& driving out the original inhabitants, who had no say in the machinations of the Boston court).
- The land deal was on condition that there capital-holders who stood to profit would find settlers to actually do the work of colonizing. The British crown wanted more territory to be controlled in practice not just in theory, but they weren't going to be the ones to do the hard work.
- The capital-holders actually failed to find enough poor suckers to do their dirty work for 4 years, until the Huguenots, fleeing religious persecution in France, were desperate enough to accept their terms.
- Of course, the land was only so ripe for settlement because of careful tending over centuries by the natives who were eventually driven off, and whose land management practices are abandoned today. Given the mention of praying towns (& dates), this was after King Phillip's war, which resulted in at least some forced resettlement of native tribes around the area, but the descendants of those "Indians" mentioned in this sign are still around. For example, this is the site of one local band of Nipmuck, whose namesake lake is about 5 miles south of the fort site: #LandBack.
Efficient Item ID Generation for Large-Scale LLM-based Recommendation
Anushya Subbiah, Vikram Aggarwal, James Pine, Steffen Rendle, Krishna Sayana, Kun Su
https://arxiv.org/abs/2509.03746
Graph Representation-based Model Poisoning on Federated LLMs in CyberEdge Networks
Hanlin Cai, Haofan Dong, Houtianfu Wang, Kai Li, Ozgur B. Akan
https://arxiv.org/abs/2507.01694 …
The AudioMOS Challenge 2025
Wen-Chin Huang, Hui Wang, Cheng Liu, Yi-Chiao Wu, Andros Tjandra, Wei-Ning Hsu, Erica Cooper, Yong Qin, Tomoki Toda
https://arxiv.org/abs/2509.01336 …
BALM-TSF: Balanced Multimodal Alignment for LLM-Based Time Series Forecasting
Shiqiao Zhou, Holger Sch\"oner, Huanbo Lyu, Edouard Fouch\'e, Shuo Wang
https://arxiv.org/abs/2509.00622
Handwriting Imagery EEG Classification based on Convolutional Neural Networks
Hao Yang, Guang Ouyang
https://arxiv.org/abs/2509.03111 https://arxiv.org/pdf…
TauGenNet: Plasma-Driven Tau PET Image Synthesis via Text-Guided 3D Diffusion Models
Yuxin Gong (for the Alzheimer's Disease Neuroimaging Initiative), Se-in Jang (for the Alzheimer's Disease Neuroimaging Initiative), Wei Shao (for the Alzheimer's Disease Neuroimaging Initiative), Yi Su (for the Alzheimer's Disease Neuroimaging Initiative), Kuang Gong (for the Alzheimer's Disease Neuroimaging Initiative)
A Gentle Introduction to Algebraic Operads
Felicia Ferraioli
https://arxiv.org/abs/2508.01886 https://arxiv.org/pdf/2508.01886
GHTM: A Graph based Hybrid Topic Modeling Approach in Low-Resource Bengali Language
Farhana Haque, Md. Abdur Rahman, Sumon Ahmed
https://arxiv.org/abs/2508.00605 https://…
SRWToolkit: An Open Source Wizard of Oz Toolkit to Create Social Robotic Avatars
Atikkhan Faridkhan Nilgar, Kristof Van Laerhoven, Ayub Kinoti
https://arxiv.org/abs/2509.04356 h…
A robust and versatile deep learning model for prediction of the arterial input function in dynamic small animal $\left[^{18}\text{F}\right]$FDG PET imaging
Christian Salomonsen, Luigi Tommaso Luppino, Fredrik Aspheim, Kristoffer Wickstr{\o}m, Elisabeth Wetzer, Michael Kampffmeyer, Rodrigo Berzaghi, Rune Sundset, Robert Jenssen, Samuel Kuttner
https:…
MPO: Multidimensional Preference Optimization for Language Model-based Text-to-Speech
Kangxiang Xia, Xinfa Zhu, Jixun Yao, Lei Xie
https://arxiv.org/abs/2509.00685 https://
Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation
Jun Luo, Zijing Zhao, Yang Liu
https://arxiv.org/abs/2508.03300 https://
Seeing Through Green: Text-Based Classification and the Firm's Returns from Green Patents
Lapo Santarlasci, Armando Rungi, Antonio Zinilli
https://arxiv.org/abs/2507.02287
Did you know I studied electrical engineering besides work? On Thursday, I finished my bachelor's thesis with the final presentation and it's time to finally present my project: ZEReader, a microcontroller-based E-Reader.
Inspired by the Open Book Project by Joey Castillo, I designed my own platform from scratch. My focus was on building a reader usable in everyday life that is capable of handling books in the EPUB format. The project is still in a very early phase, but it shows…
Threads plans to let users hide text or images that spoil a piece of entertainment, blurring the text or image that has been marked as a spoiler (Alex Weprin/The Hollywood Reporter)
https://www.hollywoodreporter.com/business/digita…
DICOM De-Identification via Hybrid AI and Rule-Based Framework for Scalable, Uncertainty-Aware Redaction
Kyle Naddeo, Nikolas Koutsoubis, Rahul Krish, Ghulam Rasool, Nidhal Bouaynaya, Tony OSullivan, Raj Krish
https://arxiv.org/abs/2507.23736
Accurate and Consistent Graph Model Generation from Text with Large Language Models
Boqi Chen, Ou Wei, Bingzhou Zheng, Gunter Mussbacher
https://arxiv.org/abs/2508.00255 https:/…
On the inextensibility assumption in the stability of elastic rings: overhaul of a traditional paradigm
Federico Guarracino, Ida Mascolo
https://arxiv.org/abs/2509.02738 https:/…
The teachings of Falun Gong stem entirely from its enigmatic leader,
Li Hongzhi, whom followers view as a “God-like figure.”
Since settling in the United States in the late 1990s, Li has remained reclusive, but his beliefs shape the core tenets of Falun Gong and its many tendrils:
in addition to traditional Buddhist practices like qigong, a type of movement-based meditation,
Falun Gong also teaches that homosexuality creates “bad karma” and is “comparable to organize…
Preliminary design and simulation for CEPC fast luminosity monitor detector based on 4H-SiC
Yanpeng Li, Meng Li, Xingrui Wang, Weimin Song, Xiyuan Zhang, Congcong Wang, Suyu Xiao, Haoyu Shi, Dou Wang, Philip Bambade, Xin Shi
https://arxiv.org/abs/2507.23368
ERank: Fusing Supervised Fine-Tuning and Reinforcement Learning for Effective and Efficient Text Reranking
Yuzheng Cai, Yanzhao Zhang, Dingkun Long, Mingxin Li, Pengjun Xie, Weiguo Zheng
https://arxiv.org/abs/2509.00520
VLMQ: Efficient Post-Training Quantization for Large Vision-Language Models via Hessian Augmentation
Yufei Xue, Yushi Huang, Jiawei Shao, Jun Zhang
https://arxiv.org/abs/2508.03351
PicoAudio2: Temporal Controllable Text-to-Audio Generation with Natural Language Description
Zihao Zheng, Zeyu Xie, Xuenan Xu, Wen Wu, Chao Zhang, Mengyue Wu
https://arxiv.org/abs/2509.00683
Secure Password Generator Based on Secure Pseudo-Random Number Generator
Abel C. H. Chen
https://arxiv.org/abs/2509.02578 https://arxiv.org/pdf/2509.02578
Threads plans to let users hide text or images that spoil a piece of entertainment, blurring the text or image that has been marked as a spoiler (Alex Weprin/The Hollywood Reporter)
https://www.hollywoodreporter.com/business/digita…
SiLVERScore: Semantically-Aware Embeddings for Sign Language Generation Evaluation
Saki Imai, Mert \.Inan, Anthony Sicilia, Malihe Alikhani
https://arxiv.org/abs/2509.03791 http…
First Observation of Solar Neutrino Interactions on $^{13}$C
SNO Collaboration, :, M. Abreu, A. Allega, M. R. Anderson, S. Andringa, D. M. Asner, D. J. Auty, A. Bacon, T. Baltazar, F. Bar\~ao, N. Barros, R. Bayes, E. W. Beier, A. Bialek, S. D. Biller, E. Caden, M. Chen, S. Cheng, B. Cleveland, D. Cookman, J. Corning, S. DeGraw, R. Dehghani, J. Deloye, M. M. Depatie, F. Di Lodovico, C. Dima, J. Dittmer, K. H. Dixon, M. S. Esmaeilian, E. Falk, N. Fatemighomi, R. Ford, A. Gaur, O. I. Go…
VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification.
This paper presents VLAI, a transformer-based model that predicts software vulnerability severity levels directly from text descriptions. Built on RoBERTa, VLAI is fine-tuned on over 600,000 real-world vulnerabilities and achieves over 82% accuracy in predicting severity categories, enabling faster and more consistent triage ahead of manual CVSS scoring. The model and dataset are open-source and integrated…
Benchmarking Filtered Approximate Nearest Neighbor Search Algorithms on Transformer-based Embedding Vectors
Patrick Iff, Paul Bruegger, Marcin Chrapek, Maciej Besta, Torsten Hoefler
https://arxiv.org/abs/2507.21989
Cubic vertex-transitive graphs of girth seven
Maru\v{s}a Lek\v{s}e, Micael Toledo
https://arxiv.org/abs/2508.19880 https://arxiv.org/pdf/2508.19880
TEn-CATS: Text-Enriched Audio-Visual Video Parsing with Multi-Scale Category-Aware Temporal Graph
Yaru Chen, Faegheh Sardari, Peiliang Zhang, Ruohao Guo, Yang Xiang, Zhenbo Li, Wenwu Wang
https://arxiv.org/abs/2509.04086
A Correspondence-Driven Approach for Bilevel Decision-making with Nonconvex Lower-Level Problems
Xiaotian Jiang, Jiaxiang Li, Mingyi Hong, Shuzhong Zhang
https://arxiv.org/abs/2509.01148
Anti-aliasing Algorithm Based on Three-dimensional Display Image
Ziyang Liu, Xingchen Xiao, Yueyang Xu
https://arxiv.org/abs/2507.00527 https://
DUDE: Diffusion-Based Unsupervised Cross-Domain Image Retrieval
Ruohong Yang, Peng Hu, Yunfan Li, Xi Peng
https://arxiv.org/abs/2509.04193 https://arxiv.or…
Enhancing Robustness of Autoregressive Language Models against Orthographic Attacks via Pixel-based Approach
Han Yang, Jian Lan, Yihong Liu, Hinrich Sch\"utze, Thomas Seidl
https://arxiv.org/abs/2508.21206
Beyond QWERTY: A pressure-based text input approach for XR that enables a touch-typing like experience
Fabian R\"ucker, Torben Storch
https://arxiv.org/abs/2507.20741 https…
Towards Trustworthy Sentiment Analysis in Software Engineering: Dataset Characteristics and Tool Selection
Martin Obaidi, Marc Herrmann, Jil Kl\"under, Kurt Schneider
https://arxiv.org/abs/2507.02137
JoyTTS: LLM-based Spoken Chatbot With Voice Cloning
Fangru Zhou, Jun Zhao, Guoxin Wang
https://arxiv.org/abs/2507.02380 https://arxiv…
An Effective Strategy for Modeling Score Ordinality and Non-uniform Intervals in Automated Speaking Assessment
Tien-Hong Lo, Szu-Yu Chen, Yao-Ting Sung, Berlin Chen
https://arxiv.org/abs/2509.03372
Pok\'eAI: A Goal-Generating, Battle-Optimizing Multi-agent System for Pokemon Red
Zihao Liu, Xinhang Sui, Yueran Song, Siwen Wang
https://arxiv.org/abs/2506.23689
CMRAG: Co-modality-based document retrieval and visual question answering
Wang Chen, Guanqiang Qi, Weikang Li, Yang Li
https://arxiv.org/abs/2509.02123 https://
Spotify launches a DM feature to let users share audio and send text-based messages, available to mobile users over 16 years old in "select markets" this week (Jess Weatherbed/The Verge)
https://www.theverge.com/news/765771/spotify-messages-dms-audio…
TeRA: Rethinking Text-driven Realistic 3D Avatar Generation
Yanwen Wang, Yiyu Zhuang, Jiawei Zhang, Li Wang, Yifei Zeng, Xun Cao, Xinxin Zuo, Hao Zhu
https://arxiv.org/abs/2509.02466
The Double-edged Sword of LLM-based Data Reconstruction: Understanding and Mitigating Contextual Vulnerability in Word-level Differential Privacy Text Sanitization
Stephen Meisenbacher, Alexandra Klymenko, Andreea-Elena Bodea, Florian Matthes
https://arxiv.org/abs/2508.18976
LLM-based Triplet Extraction for Automated Ontology Generation in Software Engineering Standards
Songhui Yue
https://arxiv.org/abs/2509.00140 https://arxiv…
Decoding the Poetic Language of Emotion in Korean Modern Poetry: Insights from a Human-Labeled Dataset and AI Modeling
Iro Lim, Haein Ji, Byungjun Kim
https://arxiv.org/abs/2509.03932
A Study on Zero-Shot Non-Intrusive Speech Intelligibility for Hearing Aids Using Large Language Models
Ryandhimas E. Zezario, Dyah A. M. G. Wisnu, Hsin-Min Wang, Yu Tsao
https://arxiv.org/abs/2509.03021
MMBERT: Scaled Mixture-of-Experts Multimodal BERT for Robust Chinese Hate Speech Detection under Cloaking Perturbations
Qiyao Xue, Yuchen Dou, Ryan Shi, Xiang Lorraine Li, Wei Gao
https://arxiv.org/abs/2508.00760
Agent0: Leveraging LLM Agents to Discover Multi-value Features from Text for Enhanced Recommendations
Bla\v{z} \v{S}krlj, Beno\^it Guilleminot, Andra\v{z} Tori
https://arxiv.org/abs/2507.18993
RephraseTTS: Dynamic Length Text based Speech Insertion with Speaker Style Transfer
Neeraj Matiyali, Siddharth Srivastava, Gaurav Sharma
https://arxiv.org/abs/2508.17031 https:/…
Assessing GPTZero's Accuracy in Identifying AI vs. Human-Written Essays
Selin Dik, Osman Erdem, Mehmet Dik
https://arxiv.org/abs/2506.23517 https://
Dissecting Atomic Facts: Visual Analytics for Improving Fact Annotations in Language Model Evaluation
Manuel Schmidt, Daniel A. Keim, Frederik L. Dennig
https://arxiv.org/abs/2509.01460
Explicit and Implicit Data Augmentation for Social Event Detection
Congbo Ma, Yuxia Wang, Jia Wu, Jian Yang, Jing Du, Zitai Qiu, Qing Li, Hu Wang, Preslav Nakov
https://arxiv.org/abs/2509.04202
AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation
Le Wang, Jun Wang, Feng Deng, Chen Zhang, Kun Gai, Di Zhang
https://arxiv.org/abs/2508.00733
Capsule Network-Based Semantic Intent Modeling for Human-Computer Interaction
Shixiao Wang, Yifan Zhuang, Runsheng Zhang, Zhijun Song
https://arxiv.org/abs/2507.00540
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
Yibin Wang, Zhimin Li, Yuhang Zang, Yujie Zhou, Jiazi Bu, Chunyu Wang, Qinglin Lu, Cheng Jin, Jiaqi Wang
https://arxiv.org/abs/2508.20751
SpeechAccentLLM: A Unified Framework for Foreign Accent Conversion and Text to Speech
Cheng Zhuangfei, Zhang Guangyan, Tu Zehai, Song Yangyang, Mao Shuiyang, Jiao Xiaoqi, Li Jingyu, Guo Yiwen, Wu Jiasong
https://arxiv.org/abs/2507.01348
PAL: Designing Conversational Agents as Scalable, Cooperative Patient Simulators for Palliative-Care Training
Neil K. R. Sehgal, Hita Kambhamettu, Allen Chang, Andrew Zhu, Lyle Ungar, Sharath Chandra Guntuku
https://arxiv.org/abs/2507.02122
AudioBERTScore: Objective Evaluation of Environmental Sound Synthesis Based on Similarity of Audio embedding Sequences
Minoru Kishi, Ryosuke Sakai, Shinnosuke Takamichi, Yusuke Kanamori, Yuki Okamoto
https://arxiv.org/abs/2507.00475
T-TExTS (Teaching Text Expansion for Teacher Scaffolding): Enhancing Text Selection in High School Literature through Knowledge Graph-Based Recommendation
Nirmal Gelal, Chloe Snow, Ambyr Rios, Hande K\"u\c{c}\"uk McGinty
https://arxiv.org/abs/2506.12075
FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models
Yuxuan Wang, Tianwei Cao, Huayu Zhang, Zhongjiang He, Kongming Liang, Zhanyu Ma
https://arxiv.org/abs/2507.02714
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/4]:
- CPCL: Cross-Modal Prototypical Contrastive Learning for Weakly Supervised Text-based Person Retri...
Xinpeng Zhao, Yanwei Zheng, Chuanlin Lan, Xiaowei Zhang, Bowen Huang, Jibin Yang, Dongxiao Yu
Examining the Social Communication and Community Engagement of Autistic Adults through an Asynchronous Focus Group
Blade Frisch, Betts Peters, Keith Vertanen
https://arxiv.org/abs/2507.00202
Addressing Tokenization Inconsistency in Steganography and Watermarking Based on Large Language Models
Ruiyi Yan, Yugo Murawaki
https://arxiv.org/abs/2508.20718 https://
Leveraging Generative Models for Real-Time Query-Driven Text Summarization in Large-Scale Web Search
Zeyu Xiong, Yixuan Nan, Li Gao, Hengzhu Tang, Shuaiqiang Wang, Junfeng Wang, Dawei Yin
https://arxiv.org/abs/2508.20559
MUST-RAG: MUSical Text Question Answering with Retrieval Augmented Generation
Daeyong Kwon, SeungHeon Doh, Juhan Nam
https://arxiv.org/abs/2507.23334 https://
Security Tensors as a Cross-Modal Bridge: Extending Text-Aligned Safety to Vision in LVLM
Shen Li, Liuyi Yao, Wujia Niu, Lan Zhang, Yaliang Li
https://arxiv.org/abs/2507.20994 h…
AllSummedUp: un framework open-source pour comparer les metriques d'evaluation de resume
Tanguy Herserant, Vincent Guigue
https://arxiv.org/abs/2508.21389 https://
Sealing The Backdoor: Unlearning Adversarial Text Triggers In Diffusion Models Using Knowledge Distillation
Ashwath Vaithinathan Aravindan, Abha Jha, Matthew Salaway, Atharva Sandeep Bhide, Duygu Nur Yaldiz
https://arxiv.org/abs/2508.18235
An Enhanced Model-based Approach for Short Text Clustering
Enhao Cheng, Shoujia Zhang, Jianhua Yin, Xuemeng Song, Tian Gan, Liqiang Nie
https://arxiv.org/abs/2507.13793
Enhancing Remote Sensing Vision-Language Models Through MLLM and LLM-Based High-Quality Image-Text Dataset Generation
Yiguo He, Junjie Zhu, Yiying Li, Xiaoyu Zhang, Chunping Qiu, Jun Wang, Qiangjuan Huang, Ke Yang
https://arxiv.org/abs/2507.16716
Reasoning-Intensive Regression
Diane Tchuindjo, Omar Khattab
https://arxiv.org/abs/2508.21762 https://arxiv.org/pdf/2508.21762
Arabic Hate Speech Identification and Masking in Social Media using Deep Learning Models and Pre-trained Models Fine-tuning
Salam Thabet Doghmash, Motaz Saad
https://arxiv.org/abs/2507.23661
GUARD: Glocal Uncertainty-Aware Robust Decoding for Effective and Efficient Open-Ended Text Generation
Yuanhao Ding, Esteban Garces Arias, Meimingwei Li, Julian Rodemann, Matthias A{\ss}enmacher, Danlu Chen, Gaojuan Fan, Christian Heumann, Chongsheng Zhang
https://arxiv.org/abs/2508.20757
An Ensemble Classification Approach in A Multi-Layered Large Language Model Framework for Disease Prediction
Ali Hamdi, Malak Mohamed, Rokaia Emad, Khaled Shaban
https://arxiv.org/abs/2509.02446
Handwritten Text Recognition of Historical Manuscripts Using Transformer-Based Models
Erez Meoded
https://arxiv.org/abs/2508.11499 https://arxiv.org/pdf/25…
Text-ADBench: Text Anomaly Detection Benchmark based on LLMs Embedding
Feng Xiao, Jicong Fan
https://arxiv.org/abs/2507.12295 https://
Detection of Adverse Drug Events in Dutch clinical free text documents using Transformer Models: benchmark study
Rachel M. Murphy (Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Amsterdam, The Netherlands), Nishant Mishra (Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Amsterdam, The Netherlands), Nicolette F. de Keizer (Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Amsterdam, T…
Attribution, Citation, and Quotation: A Survey of Evidence-based Text Generation with Large Language Models
Tobias Schreieder, Tim Schopf, Michael F\"arber
https://arxiv.org/abs/2508.15396
ProxAnn: Use-Oriented Evaluations of Topic Models and Document Clustering
Alexander Hoyle, Lorena Calvo-Bartolom\'e, Jordan Boyd-Graber, Philip Resnik
https://arxiv.org/abs/2507.00828
TrInk: Ink Generation with Transformer Network
Zezhong Jin, Shubhang Desai, Xu Chen, Biyi Fang, Zhuoyi Huang, Zhe Li, Chong-Xin Gan, Xiao Tu, Man-Wai Mak, Yan Lu, Shujie Liu
https://arxiv.org/abs/2508.21098
Restoring Rhythm: Punctuation Restoration Using Transformer Models for Bangla, a Low-Resource Language
Md Obyedullahil Mamun, Md Adyelullahil Mamun, Arif Ahmad, Md. Imran Hossain Emu
https://arxiv.org/abs/2507.18448
AutoPCR: Automated Phenotype Concept Recognition by Prompting
Yicheng Tao, Yuanhao Huang, Jie Liu
https://arxiv.org/abs/2507.19315 https://arxiv.org/pdf/25…
Confidence Estimation for Text-to-SQL in Large Language Models
Sepideh Entezari Maleki, Mohammadreza Pourreza, Davood Rafiei
https://arxiv.org/abs/2508.14056 https://
Granite Embedding R2 Models
Parul Awasthy, Aashka Trivedi, Yulong Li, Meet Doshi, Riyaz Bhat, Vignesh P, Vishwajeet Kumar, Yushu Yang, Bhavani Iyer, Abraham Daniels, Rudra Murthy, Ken Barker, Martin Franz, Madison Lee, Todd Ward, Salim Roukos, David Cox, Luis Lastras, Jaydeep Sen, Radu Florian
https://arxiv.org/abs/2508.21085
Fair Play in the Newsroom: Actor-Based Filtering Gender Discrimination in Text Corpora
Stefanie Urchs, Veronika Thurner, Matthias A{\ss}enmacher, Christian Heumann, Stephanie Thiemichen
https://arxiv.org/abs/2508.13169