Neologism Learning for Controllability and Self-VerbalizationJohn Hewitt, Oyvind Tafjord, Robert Geirhos, Been Kimhttps://arxiv.org/abs/2510.08506 https://
Neologism Learning for Controllability and Self-VerbalizationHumans invent new words when there is a rising demand for a new useful concept (e.g., doomscrolling). We explore and validate a similar idea in our communication with LLMs: introducing new words to better understand and control the models, expanding on the recently introduced neologism learning. This method introduces a new word by adding a new word embedding and training with examples that exhibit the concept with no other changes in model parameters. We show that adding a new word allows for …