Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCR_bot@mastoxiv.page
2025-07-16 10:08:21

ARMOR: Aligning Secure and Safe Large Language Models via Meticulous Reasoning
Zhengyue Zhao, Yingzi Ma, Somesh Jha, Marco Pavone, Chaowei Xiao
arxiv.org/abs/2507.11500

@arXiv_condmatsoft_bot@mastoxiv.page
2025-06-17 11:29:21

Flocking as a second-order phase transition in self-aligning active crystals
Marco Musacchio, Alexander P. Antonov, Hartmut L\"owen, Lorenzo Caprini
arxiv.org/abs/2506.12967

@arXiv_csCL_bot@mastoxiv.page
2025-07-16 10:29:11

Internal Value Alignment in Large Language Models through Controlled Value Vector Activation
Haoran Jin, Meng Li, Xiting Wang, Zhihao Xu, Minlie Huang, Yantao Jia, Defu Lian
arxiv.org/abs/2507.11316

@arXiv_csRO_bot@mastoxiv.page
2025-06-16 08:26:50

The Space Between Us: A Methodological Framework for Researching Bonding and Proxemics in Situated Group-Agent Interactions
Ana M\"uller, Anja Richert
arxiv.org/abs/2506.11829

@arXiv_csIR_bot@mastoxiv.page
2025-07-16 08:24:31

LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation
Ziyan Wang, Yingpeng Du, Zhu Sun, Jieyi Bi, Haoyan Chua, Tianjun Wei, Jie Zhang
arxiv.org/abs/2507.10917

@knurd42@social.linux.pizza
2025-07-09 15:15:32

""[…] Red Hat Enterprise Linux for Business Developers […] provides self-serve, no-cost access to Red Hat Enterprise Linux [#RHEL] for enterprise development use.""

@arXiv_csCL_bot@mastoxiv.page
2025-06-17 10:27:53

From Outcomes to Processes: Guiding PRM Learning from ORM for Inference-Time Alignment
Bin Xie, Bingbing Xu, Yige Yuan, Shengmao Zhu, Huawei Shen
arxiv.org/abs/2506.12446

@arXiv_csCL_bot@mastoxiv.page
2025-07-16 10:26:21

FMC: Formalization of Natural Language Mathematical Competition Problems
Jiaxuan Xie, Chengwu Liu, Ye Yuan, Siqi Li, Zhiping Xiao, Ming Zhang
arxiv.org/abs/2507.11275

@arXiv_qbiobm_bot@mastoxiv.page
2025-06-11 09:28:25

Aligning Proteins and Language: A Foundation Model for Protein Retrieval
Qifeng Wu, Zhengzhe Liu, Han Zhu, Yizhou Zhao, Daisuke Kihara, Min Xu
arxiv.org/abs/2506.08023

@arXiv_statML_bot@mastoxiv.page
2025-07-14 08:50:42

Mallows Model with Learned Distance Metrics: Sampling and Maximum Likelihood Estimation
Yeganeh Alimohammadi, Kiana Asgari
arxiv.org/abs/2507.08108

@arXiv_csLG_bot@mastoxiv.page
2025-06-12 08:15:11

Multi-Task Reward Learning from Human Ratings
Mingkang Wu, Devin White, Evelyn Rose, Vernon Lawhern, Nicholas R Waytowich, Yongcan Cao
arxiv.org/abs/2506.09183

@arXiv_csMM_bot@mastoxiv.page
2025-07-14 09:17:52

Visual Semantic Description Generation with MLLMs for Image-Text Matching
Junyu Chen, Yihua Gao, Mingyong Li
arxiv.org/abs/2507.08590

@arXiv_csAI_bot@mastoxiv.page
2025-07-11 09:43:21

Stable Preference Optimization for LLMs: A Bilevel Approach Beyond Direct Preference Optimization
Chengtao Jian, Kai Yang, Ye Ouyang, Xiaozhou Ye
arxiv.org/abs/2507.07723

@benb@osintua.eu
2025-07-07 07:24:39

Trump threatens 10% tariff on countries backing BRICS 'anti-American policy': benborges.xyz/2025/07/07/trump

@arXiv_eessAS_bot@mastoxiv.page
2025-06-11 08:00:05

Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
\v{S}imon Sedl\'a\v{c}ek, Bolaji Yusuf, J\'an \v{S}vec, Pradyoth Hegde, Santosh Kesiraju, Old\v{r}ich Plchot, Jan \v{C}ernock\'y
arxiv.org/abs/2506.08633

@arXiv_csRO_bot@mastoxiv.page
2025-07-14 09:15:02

Joint Optimization-based Targetless Extrinsic Calibration for Multiple LiDARs and GNSS-Aided INS of Ground Vehicles
Junhui Wang, Yan Qiao, Chao Gao, Naiqi Wu
arxiv.org/abs/2507.08349

@arXiv_csCV_bot@mastoxiv.page
2025-07-10 08:14:41

AR2: Attention-Guided Repair for the Robustness of CNNs Against Common Corruptions
Fuyuan Zhang, Qichen Wang, Jianjun Zhao
arxiv.org/abs/2507.06332

@arXiv_csSE_bot@mastoxiv.page
2025-06-10 17:05:59

This arxiv.org/abs/2505.07270 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@seeingwithsound@mas.to
2025-06-24 16:18:27

Aligning visual imagery to the operator improves geospatial situation awareness in a single-display 360-degree periscope concept cognitiveresearchjournal.sprin

@arXiv_csCR_bot@mastoxiv.page
2025-06-13 07:21:40

From Threat to Tool: Leveraging Refusal-Aware Injection Attacks for Safety Alignment
Kyubyung Chae, Hyunbin Jin, Taesup Kim
arxiv.org/abs/2506.10020

@arXiv_qbioGN_bot@mastoxiv.page
2025-07-14 08:51:42

AMRScan: A hybrid R and Nextflow toolkit for rapid antimicrobial resistance gene detection from sequencing data
Kaitao Lai
arxiv.org/abs/2507.08062

@arXiv_astrophIM_bot@mastoxiv.page
2025-07-03 08:32:00

SpecCLIP: Aligning and Translating Spectroscopic Measurements for Stars
Xiaosheng Zhao, Yang Huang, Guirong Xue, Xiao Kong, Jifeng Liu, Xiaoyu Tang, Timothy C. Beers, Yuan-Sen Ting, A-Li Luo
arxiv.org/abs/2507.01939

@arXiv_csLG_bot@mastoxiv.page
2025-07-14 08:15:52

Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions
Simon Matrenok, Skander Moalla, Caglar Gulcehre
arxiv.org/abs/2507.08068 arxiv.org/pdf/2507.08068 arxiv.org/html/2507.08068
arXiv:2507.08068v1 Announce Type: new
Abstract: Aligning large language models with pointwise absolute rewards has so far required online, on-policy algorithms such as PPO and GRPO. In contrast, simpler methods that can leverage offline or off-policy data, such as DPO and REBEL, are limited to learning from preference pairs or relative signals. To bridge this gap, we introduce \emph{Quantile Reward Policy Optimization} (QRPO), which learns from pointwise absolute rewards while preserving the simplicity and offline applicability of DPO-like methods. QRPO uses quantile rewards to enable regression to the closed-form solution of the KL-regularized RL objective. This reward yields an analytically tractable partition function, removing the need for relative signals to cancel this term. Moreover, QRPO scales with increased compute to estimate quantile rewards, opening a new dimension for pre-computation scaling. Empirically, QRPO consistently achieves top performance on chat and coding evaluations -- reward model scores, AlpacaEval 2, and LeetCode -- compared to DPO, REBEL, and SimPO across diverse datasets and 8B-scale models. Finally, we find that training with robust rewards instead of converting them to preferences induces less length bias.
toXiv_bot_toot

@poppastring@dotnet.social
2025-04-29 03:04:20

This is the next stages of Columbus’s long-running effort to update its 1950s-era zoning code. This phase will incorporate a housing component to address affordability issues by better aligning work and residential areas.
We also need to build more houses.

@arXiv_csCL_bot@mastoxiv.page
2025-07-14 09:52:12

ChainEdit: Propagating Ripple Effects in LLM Knowledge Editing through Logical Rule-Guided Chains
Zilu Dong, Xiangqing Shen, Zinong Yang, Rui Xia
arxiv.org/abs/2507.08427

@Techmeme@techhub.social
2025-06-19 06:06:10

Q&A with Hugging Face Chief Ethics Scientist Margaret Mitchell on aligning AI development with human needs, the "illusion of consensus" around AGI, and more (Melissa Heikkilä/Financial Times)
ft.com/content/7089bff2-25fc-4

@arXiv_csCV_bot@mastoxiv.page
2025-06-09 10:09:22

CoMemo: LVLMs Need Image Context with Image Memory
Shi Liu, Weijie Su, Xizhou Zhu, Wenhai Wang, Jifeng Dai
arxiv.org/abs/2506.06279

@askesis@qoto.org
2025-07-01 10:58:46

# Philosophical test fails ChatGPT: AI coherence isn’t enough to prove human mind
The research reveals that #ChatGPT does exhibit proficiency in basic coherence building. It maintains consistent dictional and intentional lines by reusing phrases and aligning responses with contextual topics. It also demonstrates some ability to construct rational coherence by offering logically consistent replies…

@arXiv_quantph_bot@mastoxiv.page
2025-06-02 07:36:12

Leveraging machine learning features for linear optical interferometer control
Sergei S. Kuzmin, Ivan V. Dyakonov, Stanislav S. Straupe
arxiv.org/abs/2505.24032

@arXiv_csGT_bot@mastoxiv.page
2025-06-04 07:21:23

Stochastically Dominant Peer Prediction
Yichi Zhang, Shengwei Xu, David Pennock, Grant Schoenebeck
arxiv.org/abs/2506.02259

@arXiv_csSD_bot@mastoxiv.page
2025-06-02 10:00:45

This arxiv.org/abs/2503.07217 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSD_…

@arXiv_csRO_bot@mastoxiv.page
2025-06-11 08:21:15

ROS-related Robotic Systems Development with V-model-based Application of MeROS Metamodel
Tomasz Winiarski, Jan Kaniuka, Daniel Gie{\l}dowski, Jakub Ostrysz, Krystian Radlak, Dmytro Kushnir
arxiv.org/abs/2506.08706

@arXiv_csIR_bot@mastoxiv.page
2025-07-09 09:22:02

KERAG_R: Knowledge-Enhanced Retrieval-Augmented Generation for Recommendation
Zeyuan Meng, Zixuan Yi, Iadh Ounis
arxiv.org/abs/2507.05863

@arXiv_qbiobm_bot@mastoxiv.page
2025-06-02 07:36:17

Aligning Protein Conformation Ensemble Generation with Physical Feedback
Jiarui Lu, Xiaoyin Chen, Stephen Zhewen Lu, Aur\'elie Lozano, Vijil Chenthamarakshan, Payel Das, Jian Tang
arxiv.org/abs/2505.24203

@arXiv_mathST_bot@mastoxiv.page
2025-06-05 09:51:04

This arxiv.org/abs/2402.17732 has been replaced.
initial toot: mastoxiv.page/@arXiv_mat…

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 17:48:46

This arxiv.org/abs/2501.07071 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_eessAS_bot@mastoxiv.page
2025-06-05 07:22:31

HYFuse: Aligning Heterogeneous Speech Pre-Trained Representations in Hyperbolic Space for Speech Emotion Recognition
Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Swarup Ranjan Behera, Pailla Balakrishna Reddy, Arun Balaji Buduru, Rajesh Sharma
arxiv.org/abs/2506.03403

@arXiv_csSE_bot@mastoxiv.page
2025-06-04 13:37:45

This arxiv.org/abs/2410.05605 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_csSI_bot@mastoxiv.page
2025-05-30 07:21:51

Offline Map Matching Based on Localization Error Distribution Modeling
Ruilin Xu, Yuchen Song, Kaijie Li, Xitong Gao, Kejiang Ye, Fan Zhang, Juanjuan Zhao
arxiv.org/abs/2505.23123

@niqdanger@social.linux.pizza
2025-04-18 23:00:33

I got the tow hitch installed on the Crosstrek today. Holding a tow hitch bar up, while aligning a bolt, and trying to get the nut on said bolt before I lose the bolt inside the frame OR drop the hitch on my face can be quite exciting. Then torquing those to 110 ft/lbs while lying under all of that can be hard on your shoulder. I'm exhausted now.

Danny Glover from Lethal Weapon movie saying he is getting too old for this.
@arXiv_csCL_bot@mastoxiv.page
2025-07-04 09:16:31

MPF: Aligning and Debiasing Language Models post Deployment via Multi Perspective Fusion
Xin Guan, PeiHsin Lin, Zekun Wu, Ze Wang, Ruibo Zhang, Emre Kazim, Adriano Koshiyama
arxiv.org/abs/2507.02595

@arXiv_physicsedph_bot@mastoxiv.page
2025-07-03 08:39:40

Insights from Educators on Building a More Cohesive Quantum Information Science and Engineering Education Ecosystem
Shams El-Adawy, A. R. Pi\~na, Benjamin M. Zwickl, H. J. Lewandowski
arxiv.org/abs/2507.01578

@arXiv_eessIV_bot@mastoxiv.page
2025-06-25 08:52:10

Deformable Medical Image Registration with Effective Anatomical Structure Representation and Divide-and-Conquer Network
Xinke Ma, Yongsheng Pan, Qingjie Zeng, Mengkang Lu, Bolysbek Murat Yerzhanuly, Bazargul Matkerim, Yong Xia
arxiv.org/abs/2506.19222

@arXiv_mathOC_bot@mastoxiv.page
2025-06-26 08:16:00

Fast entropy-regularized SDP relaxations for permutation synchronization
Michael Lindsey, Yunpeng Shi
arxiv.org/abs/2506.20191

@arXiv_csIR_bot@mastoxiv.page
2025-07-08 10:49:41

CTR-Guided Generative Query Suggestion in Conversational Search
Erxue Min, Hsiu-Yuan Huang, Xihong Yang, Min Yang, Xin Jia, Yunfang Wu, Hengyi Cai, Junfeng Wang, Shuaiqiang Wang, Dawei Yin
arxiv.org/abs/2507.04072

@arXiv_csHC_bot@mastoxiv.page
2025-06-19 08:21:49

Case Study for Developing a UXR Point of View for FinOps Product Innovation
Jason Dong, Anna Wu
arxiv.org/abs/2506.15314

@arXiv_csCL_bot@mastoxiv.page
2025-07-03 10:13:10

Gradient-Adaptive Policy Optimization: Towards Multi-Objective Alignment of Large Language Models
Chengao Li, Hanyu Zhang, Yunkun Xu, Hongyan Xue, Xiang Ao, Qing He
arxiv.org/abs/2507.01915

@arXiv_mathNA_bot@mastoxiv.page
2025-06-19 09:05:02

Optimal alignment of Lorentz orientation and generalization to matrix Lie groups
Congzhou M Sha
arxiv.org/abs/2506.14994

@arXiv_csGR_bot@mastoxiv.page
2025-06-24 09:40:30

BulletGen: Improving 4D Reconstruction with Bullet-Time Generation
Denys Rozumnyi, Jonathon Luiten, Numair Khan, Johannes Sch\"onberger, Peter Kontschieder
arxiv.org/abs/2506.18601

@arXiv_csSE_bot@mastoxiv.page
2025-06-04 13:40:30

This arxiv.org/abs/2505.10640 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_csCR_bot@mastoxiv.page
2025-07-03 09:06:10

SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism
Beitao Chen, Xinyu Lyu, Lianli Gao, Jingkuan Song, Heng Tao Shen
arxiv.org/abs/2507.01513

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 10:26:40

Hybrid-Sep: Language-queried audio source separation via pre-trained Model Fusion and Adversarial Diffusion Training
Jianyuan Feng, Guangzheng Li, Yangfei Xu
arxiv.org/abs/2506.16833

@arXiv_mathST_bot@mastoxiv.page
2025-06-27 08:44:49

Robust Alignment via Partial Gromov-Wasserstein Distances
Xiaoyun Gong, Sloan Nietert, Ziv Goldfeld
arxiv.org/abs/2506.21507

@arXiv_csAI_bot@mastoxiv.page
2025-06-18 08:13:14

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Jiahao Qiu, Xinzhe Juan, Yimin Wang, Ling Yang, Xuan Qi, Tongcheng Zhang, Jiacheng Guo, Yifu Lu, Zixin Yao, Hongru Wang, Shilong Liu, Xun Jiang, Liu Leqi, Mengdi Wang
arxiv.org/abs/2506.14728

@arXiv_csCL_bot@mastoxiv.page
2025-07-04 13:16:07

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[3/3]:
- Aligning Frozen LLMs by Reinforcement Learning: An Iterative Reweight-then-Optimize Approach
Zhang, Li, Zeng, Li, Wang, Lin, Lu, Garcia, Hong

@arXiv_csCL_bot@mastoxiv.page
2025-07-02 10:15:40

SAFER: Probing Safety in Reward Models with Sparse Autoencoder
Sihang Li, Wei Shi, Ziyuan Xie, Tao Liang, Guojun Ma, Xiang Wang
arxiv.org/abs/2507.00665

@arXiv_csIR_bot@mastoxiv.page
2025-06-26 08:19:40

CoVE: Compressed Vocabulary Expansion Makes Better LLM-based Recommender Systems
Haochen Zhang, Tianyi Zhang, Junze Yin, Oren Gal, Anshumali Shrivastava, Vladimir Braverman
arxiv.org/abs/2506.19993

@arXiv_csCL_bot@mastoxiv.page
2025-06-03 08:20:30

Not All Jokes Land: Evaluating Large Language Models Understanding of Workplace Humor
Moahmmadamin Shafiei, Hamidreza Saffari
arxiv.org/abs/2506.01819

@arXiv_csSE_bot@mastoxiv.page
2025-06-18 09:02:07

Defining the Game Producer: A Mapping of Key Characteristics and Differentiators of the Professional Behind Digital Game Production
Rafael C. Lopes, Danilo M. Ribeiro
arxiv.org/abs/2506.14409

@arXiv_csCL_bot@mastoxiv.page
2025-06-18 09:07:29

Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality
Yuto Harada, Yusuke Yamauchi, Yusuke Oda, Yohei Oseki, Yusuke Miyao, Yu Takagi
arxiv.org/abs/2506.14681