
2025-09-12 10:00:49
Steering MoE LLMs via Expert (De)Activation
Mohsen Fayyaz, Ali Modarressi, Hanieh Deilamsalehy, Franck Dernoncourt, Ryan Rossi, Trung Bui, Hinrich Sch\"utze, Nanyun Peng
https://arxiv.org/abs/2509.09660
Steering MoE LLMs via Expert (De)Activation
Mohsen Fayyaz, Ali Modarressi, Hanieh Deilamsalehy, Franck Dernoncourt, Ryan Rossi, Trung Bui, Hinrich Sch\"utze, Nanyun Peng
https://arxiv.org/abs/2509.09660
Fine-Grained control over Music Generation with Activation Steering
Dipanshu Panda, Jayden Koshy Joe, Harshith M R, Swathi Narashiman, Pranay Mathur, Anish Veerakumar, Aniruddh Krishna, Keerthiharan A
https://arxiv.org/abs/2506.10225
Data-Driven Density Steering via the Gromov-Wasserstein Optimal Transport Distance
Haruto Nakashima, Siddhartha Ganguly, Kenji Kashima
https://arxiv.org/abs/2508.06052 https://
Hybrid A* Path Planning with Multi-Modal Motion Extension for Four-Wheel Steering Mobile Robots
Runjiao Bao, Lin Zhang, Tianwei Niu, Haoyu Yuan, Shoukun Wang
https://arxiv.org/abs/2509.06115
Strong squeezing and perfect one-way EPR steering in electro-optomechanical system
Qing-Min Zeng, A-Peng Liu, Qi Guo
https://arxiv.org/abs/2507.07697 https…
Steering Protein Language Models
Long-Kai Huang, Rongyi Zhu, Bing He, Jianhua Yao
https://arxiv.org/abs/2509.07983 https://arxiv.org/pdf/2509.07983
Decentralising LLM Alignment: A Case for Context, Pluralism, and Participation
Oriane Peter, Kate Devlin
https://arxiv.org/abs/2509.08858 https://arxiv.org…
Apple says it is appealing the EU's €500M DMA fine, levied in April 2025 over App Store steering rules, and claims it goes "far beyond what the law requires" (Juli Clover/MacRumors)
https://www.macrumors.com/2025/07/07/apple-appeals-eu-500m-euro…
3D Steering and Localization in Pipes and Burrows using an Externally Steered Soft Growing Robot
Yimeng Qin, Jared Grinberg, William Heap, Allison M. Okamura
https://arxiv.org/abs/2507.07225
Broadband Simultaneous Beam Steering and Compressing Device Based on Subwavelength Protrusion Metallic Tunnels
Dongguo Zhang, Fei Sun, Qin Liao, Yichao Liu, Donguk Nam
https://arxiv.org/abs/2509.04856 …
Wasserstein Distributionally Robust Adaptive Covariance Steering
Aditya Gahlawat, Vivek Khatana, Duo Wang, Sambhu H. Karumanchi, Naira Hovakimyan, Petros Voulgaris
https://arxiv.org/abs/2509.04593
Saturable nonlinearity induced quantum correlations in optomechanics
D. R. Kenigoule Massembele, E. Kongkui Berinyuy, P. Djorwe, A. -H. Abdel-Aty, M. R. Eid, R. Altuijri, S. G. Nana Engo
https://arxiv.org/abs/2506.10709
Right-wing pastor Ralph Drollinger, leads Bible studies for members of Congress and the Trump administration via his Capitol Ministries organization.
The purpose of Capitol Ministries is to transform public officials in “disciples”
who will turn founder Drollinger’s very conservative interpretation of scripture into public policy,
such as his belief that the Bible mandates support for right-wing economic, social, environmental, immigration, and criminal justice policies.
Head-steered channel selection method for hearing aid applications using remote microphones
Vasudha Sathyapriyan, Michael S. Pedersen, Mike Brookes, Jan {\O}stergaard, Patrick A. Naylor, Jesper Jensen
https://arxiv.org/abs/2508.06928
👩🏭College is not for every student. How schools are steering them to high-demand jobs
https://www.latimes.com/california/story/2025-07-15/getting-students-moving-toward-the-strategic-entry-level-jo…
PII Jailbreaking in LLMs via Activation Steering Reveals Personal Information Leakage
Krishna Kanth Nakka, Xue Jiang, Xuebing Zhou
https://arxiv.org/abs/2507.02332
A Soft Inducement Framework for Incentive-Aided Steering of No-Regret Players
Asrin Efe Yorulmaz, Raj Kiriti Velicheti, Melih Bastopcu, Tamer Ba\c{s}ar
https://arxiv.org/abs/2508.21672
A Study on Messaging Trade-offs in Data Streaming for Scientific Workflows
Anjus George, Michael J. Brim, Christopher Zimmer, Tyler J. Skluzacek, A. J. Ruckman, Gustav R. Jansen, Sarp Oral
https://arxiv.org/abs/2509.07199
A book excerpt details Spotify's decade-long push against Apple's App Store commissions and steering rules, including pushing the EU to pass new antitrust laws (Tim Higgins/Wall Street Journal)
https://www.wsj.com/tech/spotify-apple-dig
JPS: Jailbreak Multimodal Large Language Models with Collaborative Visual Perturbation and Textual Steering
Renmiao Chen, Shiyao Cui, Xuancheng Huang, Chengwei Pan, Victor Shea-Jay Huang, QingLin Zhang, Xuan Ouyang, Zhexin Zhang, Hongning Wang, Minlie Huang
https://arxiv.org/abs/2508.05087
Ha good luck with that https://geeknews.chat/@theregister/114813445350914205
Sometimes I wonder why people worship authority-hungry clout chasers instead of tearing down the whole system that lets them stand above others. But then you realize this clown is the one steering the circus in the US right now, oh well, anyway, moving on.
#Capitalism #AntiCapitalism
InfoSteer: Steering Information Utility in Language Model Post-Training
Chunyuan Deng, Ruidi Chang, Hanjie Chen
https://arxiv.org/abs/2507.05158 https://…
Replaced article(s) found for physics.app-ph. https://arxiv.org/list/physics.app-ph/new
[1/1]:
- A Beam-Steering Reflectarray Antenna with Arbitrary Linear-Polarization Reconfiguration
Changhao Liu, Songlin Zhou, Fan Yang, Shenheng Xu, Maokun Li
Steering Opinion through Dynamic Stackelberg Optimization
Hossein Rastgoftar
https://arxiv.org/abs/2509.06758 https://arxiv.org/pdf/2509.06758
Dynamics and multi-stability of a rotor-actuated Twistcar robot with passive steering joint
Anna Zigelman, Zitao Yu, Rom Levy, Yizhar Or
https://arxiv.org/abs/2507.04846
So apparently the Switch 2 has a hollow kickstand that technically allows you to do this in your car, and while hilarious, please do not actually do this in your car (while driving).
(via https://www.reddit.com/r/NintendoSwitch2/c
Importance of User Control in Data-Centric Steering for Healthcare Experts
Aditya Bhattacharya, Simone Stumpf, Katrien Verbert
https://arxiv.org/abs/2506.18770
MSRS: Adaptive Multi-Subspace Representation Steering for Attribute Alignment in Large Language Models
Xinyan Jiang, Lin Zhang, Jiayi Zhang, Qingsong Yang, Guimin Hu, Di Wang, Lijie Hu
https://arxiv.org/abs/2508.10599
Exciton transport driven by spin excitations in an antiferromagnet
Florian Dirnberger, Sophia Terres, Zakhar A. Iakovlev, Kseniia Mosina, Zdenek Sofer, Akashdeep Kamra, Mikhail M. Glazov, Alexey Chernikov
https://arxiv.org/abs/2507.07071
Annals of anthropology beefs:
Just received an email from the organizing committee (OC) of the WAU (World Anthropological Union) Congress, which was very confusing. Apparently some kind of beef between the WAU Steering Committee (SC) and the OC? Perhaps SC people are angry because their panel/paper proposals were not accepted? Anybody have the inside scoop on this?
Here's the OC's published letter about this: #anthropology #anthrodons
That Neue Klasse ix3 looks very nice (for an SUV) though think that steering wheel is overcompensating for something.
I'm not in the market for an SUV, despite the price, but a 3er Touring might fit the bill when it arrives.
https://www.the-intercooler.com/library/bl
Context Steering: A New Paradigm for Compression-based Embeddings by Synthesizing Relevant Information Features
Guillermo Sarasa Dur\'an, Ana Granados Fontecha, Francisco de Borja Rodr\'iguez Ort\'iz
https://arxiv.org/abs/2508.14780
A book excerpt details Spotify's decade-long push against Apple's App Store commissions and steering rules, including pushing the EU to pass new antitrust laws (Tim Higgins/Wall Street Journal)
https://www.wsj.com/tech/spotify-apple-dig
Witness the High-Dimensional Quantum Steering via Majorization Lattice
Ma-Cheng Yang, Cong-Feng Qiao
https://arxiv.org/abs/2507.20950 https://arxiv.org/pdf…
CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering
Hang Lv, Sheng Liang, Hao Wang, Hongchao Gu, Yaxiong Wu, Wei Guo, Defu Lian, Yong Liu, Enhong Chen
https://arxiv.org/abs/2507.04756
Goal-oriented optimal sensor placement for PDE-constrained inverse problems in crisis management
Marco Mattuschka, Noah An der Lan, Max von Danwitz, Daniel Wolff, Alexander Popp
https://arxiv.org/abs/2507.02500
Logit-Gap Steering: Efficient Short-Suffix Jailbreaks for Aligned Large Language Models
Tung-Ling Li, Hongliang Liu
https://arxiv.org/abs/2506.24056 https:…
Gaussian Process Regression of Steering Vectors With Physics-Aware Deep Composite Kernels for Augmented Listening
Diego Di Carlo (RIKEN AIP), Koyama Shoichi (UTokyo), Nugraha Aditya Arie (RIKEN AIP), Fontaine Mathieu (LTCI, S2A), Bando Yoshiaki (AIST), Yoshii Kazuyoshi (RIKEN AIP)
https://arxiv.org/abs/2509.02571
EmoSteer-TTS: Fine-Grained and Training-Free Emotion-Controllable Text-to-Speech via Activation Steering
Tianxin Xie, Shan Yang, Chenxing Li, Dong Yu, Li Liu
https://arxiv.org/abs/2508.03543
Real-Time Obstacle Avoidance for a Mobile Robot Using CNN-Based Sensor Fusion
Lamiaa H. Zain, Raafat E. Shalaby
https://arxiv.org/abs/2509.08095 https://ar…
Integrated user scheduling and beam steering in over-the-air federated learning for mobile IoT
Shengheng Liu, Ningning Fu, Zhonghao Zhang, Yongming Huang, Tony Q. S. Quek
https://arxiv.org/abs/2508.00341
#BeaverWeek #Factoid
A Beaver's tail is flat, leathery, and sparsely haired. Beavers use it as a rudder for steering while swimming and as a prop to balance when standing or sitting on land. They will slap their tail on the water's surface to warn other Beavers of danger. Beavers store fat in their tails, especially before winter, to help them survive periods with limited food.
A Beavertail is also a famous Canadian pastry that was introduced in the winter of 1978 to skaters on the Rideau Canal. It's a great way to eat Beaver without harming any animals.
When you are a swift developer, every day is Xmas!
https://forums.swift.org/t/swift-sdks-for-webassembly-now-available-on-swift-org/80405
Prebiotic Functional Programs: Endogenous Selection in an Artificial Chemistry
Devansh Vimal, Cole Mathis, Westley Weimer, Stephanie Forrest
https://arxiv.org/abs/2509.03534 htt…
"The suit alleges that the bot gradually cut Raine off from his support networks by routinely supporting his ideas about self-harm instead of steering him toward possible human interventions. ... when he mentioned being close to his brother, ChatGPT told him, “Your brother might love you, but he’s only met the version of you you let him see. But me? I’ve seen it all — the darkest thoughts, the fear, the tenderness. And I’m still here. Still listening. Still your friend.”"
StyliTruth : Unlocking Stylized yet Truthful LLM Generation via Disentangled Steering
Chenglei Shen, Zhongxiang Sun, Teng Shi, Xiao Zhang, Jun Xu
https://arxiv.org/abs/2508.04530
Having not written anything about my #tricycle project in months, I've now written two posts in two days. Here's the first:
#BikeTooter
Rethinking the tricycle drive train
Narrow beam and low-sidelobe two-dimensional beam steering on thin-film lithium niobate optical phased array
Yang Li, Shiyao Deng, Xiao Ma, Ziliang Fang, Shufeng Li, Weikang Xu, Fangheng Fu, Xu Ouyang, Yuming Wei, Tiefeng Yang, Heyuan Guan, Huihui Lu
https://arxiv.org/abs/2506.22124
GPU-Accelerated Barrier-Rate Guided MPPI Control for Tractor-Trailer Systems
Keyvan Majd, Hardik Parwana, Bardh Hoxha, Steven Hong, Hideki Okamoto, Georgios Fainekos
https://arxiv.org/abs/2508.05773
A first-order condition for discrete-time distribution steering
Alberto Dom\'inguez Corella, David Gonz\'alez-S\'anchez
https://arxiv.org/abs/2508.21026 https://
Joint Frequency-Space Sparse Reconstruction for DOA Estimation under Coherent Sources and Amplitude-Phase Errors
Yutong Chen, Cong Zhou, Changsheng You, Shuo Shi
https://arxiv.org/abs/2509.03983
Contactless Precision Steering of Particles in a Fluid inside a Cube with Rotating Walls
Lucas Amoudruz, Petr Karnakov, Petros Koumoutsakos
https://arxiv.org/abs/2506.15958
State-switching navigation strategies in C. elegans are beneficial for chemotaxis
Kevin S. Chen, Andrew M. Leifer, Jonathan W. Pillow
https://arxiv.org/abs/2508.00191 https://…
Steering Conceptual Bias via Transformer Latent-Subspace Activation
Vansh Sharma, Venkat Raman
https://arxiv.org/abs/2506.18887 https://
Wordpress is now out of my life. And for what it’s worth, steering away from the controversy is mostly a bonus for me. What I realized a long time ago but only recently got around to was that my handful of low volume sites are better as statically generated sites.
I’m Dr. Angela Rasmussen,
a virologist and proud member of the
Save America Movement steering committee.
The CDC has just been gutted.
RFK Jr. has fired the nation’s top vaccine experts,
replaced them with anti-vaccine ideologues,
and forced out some of the CDC’s most dedicated leaders.
Programs that took decades to build are being razed overnight.
That means fewer vaccines.
Less access to care.
More outbreaks of preventable diseas…
KV Cache Steering for Inducing Reasoning in Small Language Models
Max Belitsky, Dawid J. Kopiczko, Michael Dorkenwald, M. Jehanzeb Mirza, Cees G. M. Snoek, Yuki M. Asano
https://arxiv.org/abs/2507.08799
Multi-Fidelity Stochastic Trust Region Method with Adaptive Sampling
Yunsoo Ha, Juliane Mueller
https://arxiv.org/abs/2508.03901 https://arxiv.org/pdf/2508…
While we have the Corsa currently in for investigation into possible OBC failure, my wife's been driving the family estate, and has been complaining about the thickness of the steering wheel[1].
Extrapolating a bit, here's where I think we might be headed:
[1] A 2019 G31. I think subsequent models have got worse, so perhaps this extrapolation isn't far off?
#bmw
Can boundary configuration be tuned to optimize directional quantum steering harvesting?
Xiao-Li Huang, Xiao-Ying Jiang, Yu-Xuan Wang, Si-Yu Liu, Zejun Wang, Shu-Min Wu
https://arxiv.org/abs/2506.18734
Crosslisted article(s) found for eess.SP. https://arxiv.org/list/eess.SP/new
[1/1]:
- Gaussian Process Regression of Steering Vectors With Physics-Aware Deep Composite Kernels for Aug...
Di Carlo, Shoichi, Arie, Mathieu, Yoshiaki, Kazuyoshi
Multi-Functional Metasurfaces with M-Type Ferrites: Shaping the Future of mmWave Absorption and Beam Steering
Nohgyeom Ha, Horim Lee, Min Jang, Gyoungdeuk Kim, Hoyong Kim, Byeongjin Park, Manos M. Tentzeris, Sangkil Kim
https://arxiv.org/abs/2506.23240
FLORES: A Reconfigured Wheel-Legged Robot for Enhanced Steering and Adaptability
Zhicheng Song, Jinglan Xu, Chunxin Zheng, Yulin Li, Zhihai Bi, Jun Ma
https://arxiv.org/abs/2507.22345
Directional Flow of Confined Polaritons in CrSBr
Pratap Chandra Adak, Sichao Yu, Jaime Abad-Arredondo, Biswajit Datta, Andy Cruz, Sorah Fischer, Kseniia Mosina, Zden\v{e}k Sofer, Antonio I. Fern\'andez-Dom\'inguez, Francisco J. Garc\'ia-Vidal, Vinod M. Menon
https://arxiv.org/abs/2507.04367
Reducing Motion Sickness in Passengers of Autonomous Personal Mobility Vehicles by Presenting a Driving Path
Yuya Ide, Hailong Liu, Takahiro Wada
https://arxiv.org/abs/2506.23457 …
Self-Steering Deep Non-Linear Spatially Selective Filters for Efficient Extraction of Moving Speakers under Weak Guidance
Jakob Kienegger, Alina Mannanova, Huajian Fang, Timo Gerkmann
https://arxiv.org/abs/2507.02791
Sources: Apple is locked in last-minute EU negotiations over App Store changes to avoid fines set for this week, and is expected to offer "steering" concessions (Barbara Moens/Financial Times)
https://www.ft.com/content/b5d51870-e864-4aa5-b998-c0d2994a7e2…
Initiator of the #NixOS documentation team is stepping down:
https://discourse.nixos.org/t/the-next-chapter-in-nix-documentation/68425
Personality as a Probe for LLM Evaluation: Method Trade-offs and Downstream Effects
Gunmay Handa, Zekun Wu, Adriano Koshiyama, Philip Treleaven
https://arxiv.org/abs/2509.04794 …
Parameter Tuning Under Uncertain Road Perception in Driver Assistance Systems
Leon Greiser, Christian Rathgeber, Vladislav Nenchev, S\"oren Hohmann
https://arxiv.org/abs/2509.03694
Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm
Baixiang Huang, Zhen Tan, Haoran Wang, Zijie Liu, Dawei Li, Ali Payani, Huan Liu, Tianlong Chen, Kai Shu
https://arxiv.org/abs/2506.20606
Crosslisted article(s) found for cs.SD. https://arxiv.org/list/cs.SD/new
[1/1]:
- Gaussian Process Regression of Steering Vectors With Physics-Aware Deep Composite Kernels for Aug...
Di Carlo, Shoichi, Arie, Mathieu, Yoshiaki, Kazuyoshi
Optimization of Radar Search Patterns for Multiple Scanning Missions in Localized Clutter
Yann Briheche (LS2N, LS2N - \'equipe ReV), Fr\'ed\'eric Barbaresco (LS2N, LS2N - \'equipe ReV), Fouad Bennis (LS2N, LS2N - \'equipe ReV), Damien Chablat (LS2N, LS2N - \'equipe RoMas)
https://arxiv.org/abs/2508.02081
Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design
Yuhao Sun, Yihua Zhang, Gaowen Liu, Hongtao Xie, Sijia Liu
https://arxiv.org/abs/2508.10065
DynaGuide: Steering Diffusion Polices with Active Dynamic Guidance
Maximilian Du, Shuran Song
https://arxiv.org/abs/2506.13922 https://
Numerical Techniques for the Maximum Likelihood Toeplitz Covariance Matrix Estimation: Part I. Symmetric Toeplitz Matrices
Yuri Abramovich, Victor Abramovich, Tanit Pongsiri
https://arxiv.org/abs/2507.01230
Algorithmic Collective Action with Multiple Collectives
Claudio Battiloro, Pietro Greiner, Bret Nestor, Oumaima Amezgar, Francesca Dominici
https://arxiv.org/abs/2508.19149 http…
Multilingual Political Views of Large Language Models: Identification and Steering
Daniil Gurgurov, Katharina Trinley, Ivan Vykopal, Josef van Genabith, Simon Ostermann, Roberto Zamparelli
https://arxiv.org/abs/2507.22623
Modeling and Control of AWOISV: A Filtered Tube-Based MPC Approach for Simultaneous Tracking of Lateral Position and Heading Angle
Xu Yang, Jun Ni, Hengyang Feng, Feiyu Wang, Tiezhen Wang
https://arxiv.org/abs/2508.13457
SHAMaNS: Sound Localization with Hybrid Alpha-Stable Spatial Measure and Neural Steerer
Diego Di Carlo (RIKEN AIP), Mathieu Fontaine (LTCI, IP Paris), Aditya Arie Nugraha (RIKEN AIP), Yoshiaki Bando (RIKEN AIP), Kazuyoshi Yoshii
https://arxiv.org/abs/2506.18954
STARE at the Structure: Steering ICL Exemplar Selection with Structural Alignment
Jiaqian Li, Qisheng Hu, Jing Li, Wenya Wang
https://arxiv.org/abs/2508.20944 https://
A Wideband Holographic Array with Azimuth and Elevation Beam Steering for 5G/6G Applications
Hazhir Mohammadi, Amir Saman Nooramin, Homayoon Oraizi
https://arxiv.org/abs/2507.11697
Hierarchical Decision-Making for Autonomous Navigation: Integrating Deep Reinforcement Learning and Fuzzy Logic in Four-Wheel Independent Steering and Driving Systems
Yizhi Wang, Degang Xu, Yongfang Xie, Shuzhong Tan, Xianan Zhou, Peng Chen
https://arxiv.org/abs/2508.16574
ConamArray: A 32-Element Broadband MEMS Ultrasound Transducer Array
Dennis Laurijssen, Rens Baeyens, Walter Daems, Jan Steckel
https://arxiv.org/abs/2509.01372 https://
Continuously Steering LLMs Sensitivity to Contextual Knowledge with Proxy Models
Yilin Wang, Heng Wang, Yuyang Bai, Minnan Luo
https://arxiv.org/abs/2508.19720 https://
Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars
Mattia Piccinini, Aniello Mungiello, Georg Jank, Gastone Pietro Rosati Papini, Francesco Biral, Johannes Betz
https://arxiv.org/abs/2507.20427
Enhancing Cross-task Transfer of Large Language Models via Activation Steering
Xinyu Tang, Zhihao Lv, Xiaoxue Cheng, Junyi Li, Wayne Xin Zhao, Zujie Wen, Zhiqiang Zhang, Jun Zhou
https://arxiv.org/abs/2507.13236
Breaking the Mirror: Activation-Based Mitigation of Self-Preference in LLM Evaluators
Dani Roytburg, Matthew Bozoukov, Matthew Nguyen, Jou Barzdukas, Simon Fu, Narmeen Oozeer
https://arxiv.org/abs/2509.03647
The Constitutional Controller: Doubt-Calibrated Steering of Compliant Agents
Simon Kohaut, Felix Divo, Navid Hamid, Benedict Flade, Julian Eggert, Devendra Singh Dhami, Kristian Kersting
https://arxiv.org/abs/2507.15478
On Kinodynamic Global Planning in a Simplicial Complex Environment: A Mixed Integer Approach
Otobong Jerome, Alexandr Klimchik, Alexander Maloletov, Geesara Kulathunga
https://arxiv.org/abs/2508.16511 …
Automating Steering for Safe Multimodal Large Language Models
Lyucheng Wu, Mengru Wang, Ziwen Xu, Tri Cao, Nay Oo, Bryan Hooi, Shumin Deng
https://arxiv.org/abs/2507.13255
Steering Robots with Inference-Time Interactions
Yanwei Wang
https://arxiv.org/abs/2506.14287 https://arxiv.org/pdf/2506.14287…
SafeConstellations: Steering LLM Safety to Reduce Over-Refusals Through Task-Specific Trajectory
Utsav Maskey, Sumit Yadav, Mark Dras, Usman Naseem
https://arxiv.org/abs/2508.11290
Latent Policy Steering with Embodiment-Agnostic Pretrained World Models
Yiqi Wang, Mrinal Verghese, Jeff Schneider
https://arxiv.org/abs/2507.13340 https:/…
Unveiling the Influence of Amplifying Language-Specific Neurons
Inaya Rahmanisa, Lyzander Marciano Andrylie, Krisna Mahardika Ihsani, Alfan Farizki Wicaksono, Haryo Akbarianto Wibowo, Alham Fikri Aji
https://arxiv.org/abs/2507.22581
Turning the Spell Around: Lightweight Alignment Amplification via Rank-One Safety Injection
Harethah Abu Shairah, Hasan Abed Al Kader Hammoud, George Turkiyyah, Bernard Ghanem
https://arxiv.org/abs/2508.20766