Tootfinder

Opt-in global Mastodon full text search. Join the index!

@sperbsen@discuss.systems
2025-10-18 05:41:13

So. Much. This.
hci.social/@chrisamaphone/1153

@publicvoit@graz.social
2025-10-16 21:20:48

A small number of samples can poison LLMs of any size:
anthropic.com/research/small-s
"In a joint study with the UK AI Security Institute and the Alan Turing Institute, we found that as few as 250 malicious documents can produce a "

@arXiv_csAI_bot@mastoxiv.page
2025-10-15 10:14:11

Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems
Jiaxin Gao, Chen Chen, Yanwen Jia, Xueluan Gong, Kwok-Yan Lam, Qian Wang
arxiv.org/abs/2510.12462

@ErikJonker@mastodon.social
2025-10-13 14:15:31

And more to read in your freetime, if you are interested in AI, from Bluesky, Phillip Isola.
"Over the past year, my lab has been working on fleshing out theory applications of the Platonic Representation Hypothesis.
Today I want to share two new works on this topic:"
Eliciting higher alignment:
arxiv.org/…

Racist language
allegedly used by leaders of
Young Republican groups in leaked chats
drew widespread condemnation from both sides of the political aisle Tuesday
and prompted the national youth organization to demand the resignations of those involved.
axios.co…

@arXiv_csDC_bot@mastoxiv.page
2025-10-14 10:02:18

An Explorative Study on Distributed Computing Techniques in Training and Inference of Large Language Models
Sheikh Azizul Hakim, Saem Hasan
arxiv.org/abs/2510.11211

@arXiv_csCL_bot@mastoxiv.page
2025-10-14 13:08:58

KnowRL: Teaching Language Models to Know What They Know
Sahil Kale, Devendra Singh Dhami
arxiv.org/abs/2510.11407 arxiv.org/pdf/2510.11407

@frankel@mastodon.top
2025-10-11 16:30:04

Running on the battle-tested #Erlang virtual machine that powers planet-scale systems such as WhatsApp and Ericsson, #Gleam is ready for workloads of any size.
gleam…

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:36:20

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs
Zixin Zhang, Kanghao Chen, Xingwang Lin, Lutao Jiang, Xu Zheng, Yuanhuiyi Lyu, Litao Guo, Yinchuan Li, Ying-Cong Chen
arxiv.org/abs/2510.09507

@Techmeme@techhub.social
2025-10-09 17:40:43

A study finds that as few as 250 malicious documents can produce a "backdoor" vulnerability in an LLM, regardless of model size or training data volume (Anthropic)
anthropic.com/research/small-s

@arXiv_csRO_bot@mastoxiv.page
2025-10-09 09:19:41

UniFField: A Generalizable Unified Neural Feature Field for Visual, Semantic, and Spatial Uncertainties in Any Scene
Christian Maurer, Snehal Jauhri, Sophie Lueth, Georgia Chalvatzaki
arxiv.org/abs/2510.06754

@arXiv_csLG_bot@mastoxiv.page
2025-10-07 13:06:22

Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Jihoon Lee, Hoyeon Moon, Kevin Zhai, Arun Kumar Chithanar, Anit Kumar Sahu, Soummya Kar, Chul Lee, Souradip Chakraborty, Amrit Singh Bedi
arxiv.org/abs/2510.05040

@yaya@jorts.horse
2025-11-06 07:02:13

I gotta step up my Irish learning so we can do some irish anarchism todon.eu/@CrimethInc/115501256

@arXiv_condmatstatmech_bot@mastoxiv.page
2025-10-15 07:54:31

Algorithmic Temperature Induced by Adopted Regular Universal Turing Machine
Kentaro Imafuku
arxiv.org/abs/2510.11737 arxiv.org/pdf/2510.117…

@arXiv_csCL_bot@mastoxiv.page
2025-10-13 10:35:00

Active Model Selection for Large Language Models
Yavuz Durmazkeser, Patrik Okanovic, Andreas Kirsch, Torsten Hoefler, Nezihe Merve G\"urel
arxiv.org/abs/2510.09418

@arXiv_csHC_bot@mastoxiv.page
2025-10-09 07:58:11

A Multimodal GUI Architecture for Interfacing with LLM-Based Conversational Assistants
Hans G. W. van Dam
arxiv.org/abs/2510.06223 arxiv.or…

@philip@mastodon.mallegolhansen.com
2025-11-06 16:45:06

@… I suspect the problem is that culture informs language, and in American English we aren't used to people taking the train many places.
I wonder what the Brits would say.
In Danish the word for "drove" would be appropriate. You could say "I drove to SFO" and it could equally mean by car, bus, or train without any further clarifi…

@arXiv_csIT_bot@mastoxiv.page
2025-10-10 07:36:58

Is star complexity a proxy for information based complexity of graphs?
Russell K. Standish
arxiv.org/abs/2510.07722 arxiv.org/pdf/2510.0772…

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:26:11

TTRV: Test-Time Reinforcement Learning for Vision Language Models
Akshit Singh, Shyam Marjit, Wei Lin, Paul Gavrikov, Serena Yeung-Levy, Hilde Kuehne, Rogerio Feris, Sivan Doveh, James Glass, M. Jehanzeb Mirza
arxiv.org/abs/2510.06783

@arXiv_csDS_bot@mastoxiv.page
2025-10-13 08:21:40

Improved Extended Regular Expression Matching
Philip Bille, Inge Li G{\o}rtz, Rikke Schjeldrup Jessen
arxiv.org/abs/2510.09311 arxiv.org/pd…

@mariyadelano@hachyderm.io
2025-11-13 18:17:50

It’s strange to watch the world ignore that it’s not just the medium that matters, but the message does too.
And by “the message” I mean both the ideas and the precise language or visuals used to communicate them.
Subtle differences in linguistic execution of the same idea in the same format can have radically opposite effects. The same applies to an image captured from a different angle or in a different style or in a different composition.
And yet it feels like most organizations and most people within them are determined to march on ignoring any consideration of subtlety and craft.
#writing #design #art #marketing

@markhburton@mstdn.social
2025-11-25 16:25:51

"AI in the guise of Machine Learning, Deep Learning, GenerativeAI (GenAI), or Large Language Models (LLMs)... can be very useful in certain application areas such as recognising or generating patterns in large data sets. However, their key drawback is that any correctness arguments will be inherently probabilistic as they are usually based on unknown data distributions and are therefore susceptible to errors (sometimes termed “hallucinations”). "

@hex@kolektiva.social
2025-11-28 17:19:57

RE: #fascism, especially American fascism, there's something you can do. If you're in the US, don't buy anything if you can until Dec 2nd. If you do have to buy something, buy second hand, or buy local. If you're outside, don't buy anything from the US or any US company at all.
Spread the word. Keep it on people's mind. Write your own post. Talk to people you know in person. Print out flyers and post them around town.
The system understands the language of money. If you want a response, you have to speak the language the system understands.
Trump is extremely vulnerable. Don't wait for the regime to recovery. Hit it hard right now, with an economic blockade. Who knows, it might just crumble.
#USPol

@karlauerbach@sfba.social
2025-09-27 07:12:49

I've been thinking about that stupid "Unified Executive" rubbish that the maga-klan likes to spout.
They argue that Art II Sect 1 sez "The executive Power shall be vested in a President of the United States of America." meaning that the entire executive, all of it, every bit, is carried within the human frame of the President.
OK.
That language does not admit of any possibility of delegation of any part of that executive authority to another. So if …

@jamesthebard@social.linux.pizza
2025-12-01 17:14:25
Content warning: Advent of Code - Day 1

Not a bad start, part 2 kinda kicked my ass because reading is really, really hard. I made the assumption that there wouldn't be any spins greater than 99 which was a horrible, horrible assumption to make.
However, not too difficult overall, happy enough with the solve.
#adventOfCode

The simple solve for Day 1, will probably do what I usually do and pick a second language to try and go with as the AoC continues.  The code is written in Python using VCS as the IDE.
@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:49:19

Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies
Chunsan Hong, Seonho An, Min-Soo Kim, Jong Chul Ye
arxiv.org/abs/2510.05725

@arXiv_csHC_bot@mastoxiv.page
2025-10-07 10:27:12

Observing Without Doing: Pseudo-Apprenticeship Patterns in Student LLM Use
Jade Hak, Nathaniel Lam Johnson, Matin Amoozadeh, Amin Alipour, Souti Chattopadhyay
arxiv.org/abs/2510.04986

@arXiv_csRO_bot@mastoxiv.page
2025-09-29 10:46:27

See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation
Chih Yao Hu, Yang-Sen Lin, Yuna Lee, Chih-Hai Su, Jie-Ying Lee, Shr-Ruei Tsai, Chin-Yang Lin, Kuan-Wen Chen, Tsung-Wei Ke, Yu-Lun Liu
arxiv.org/abs/2509.22653

@arXiv_csSE_bot@mastoxiv.page
2025-09-30 09:42:31

Satellite: Detecting and Analyzing Smart Contract Vulnerabilities caused by Subcontract Misuse
Zeqin Liao, Yuhong Nan, Zixu Gao, Henglong Liang, Sicheng Hao, Jiajing Wu, Zibin Zheng
arxiv.org/abs/2509.23679

@arXiv_csMA_bot@mastoxiv.page
2025-10-02 08:44:21

Stochastic Self-Organization in Multi-Agent Systems
Nurbek Tastan, Samuel Horvath, Karthik Nandakumar
arxiv.org/abs/2510.00685 arxiv.org/pd…

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 11:20:17

Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation
Ruoyu Chen, Xiaoqing Guo, Kangwei Liu, Siyuan Liang, Shiming Liu, Qunli Zhang, Hua Zhang, Xiaochun Cao
arxiv.org/abs/2509.22496

@arXiv_csCL_bot@mastoxiv.page
2025-10-09 10:25:01

FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline
Haotian Wu, Shufan Jiang, Chios Chen, Yiyang Feng, Hehai Lin, Heqing Zou, Yao Shu, Yanran Li, Chengwei Qin
arxiv.org/abs/2510.06800

@arXiv_csLG_bot@mastoxiv.page
2025-09-26 10:30:41

Go With The Flow: Churn-Tolerant Decentralized Training of Large Language Models
Nikolay Blagoev, Bart Cox, J\'er\'emie Decouchant, Lydia Y. Chen
arxiv.org/abs/2509.21221

@arXiv_csCL_bot@mastoxiv.page
2025-09-23 12:52:50

Unsupervised Learning and Representation of Mandarin Tonal Categories by a Generative CNN
Kai Schenck, Ga\v{s}per Begu\v{s}
arxiv.org/abs/2509.17859

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 10:57:11

POVQA: Preference-Optimized Video Question Answering with Rationales for Data Efficiency
Ashim Dahal, Ankit Ghimire, Saydul Akbar Murad, Nick Rahimi
arxiv.org/abs/2510.01009

@arXiv_csCL_bot@mastoxiv.page
2025-10-02 10:41:31

Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs
Ziliang Wang, Kang An, Xuhui Zheng, Faqiang Qian, Weikun Zhang, Cijun Ouyang, Jialu Cai, Yuhang Wang, Yichao Wu
arxiv.org/abs/2510.00861

@arXiv_csCV_bot@mastoxiv.page
2025-09-22 10:37:11

Robust Vision-Language Models via Tensor Decomposition: A Defense Against Adversarial Attacks
Het Patel, Muzammil Allie, Qian Zhang, Jia Chen, Evangelos E. Papalexakis
arxiv.org/abs/2509.16163 …

@arXiv_csCL_bot@mastoxiv.page
2025-10-06 09:01:39

Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)
Moonkyung Ryu, Chih-Wei Hsu, Yinlam Chow, Mohammad Ghavamzadeh, Craig Boutilier
arxiv.org/abs/2510.02331

@arXiv_csCV_bot@mastoxiv.page
2025-09-23 13:06:41

WISE: Weak-Supervision-Guided Step-by-Step Explanations for Multimodal LLMs in Image Classification
Yiwen Jiang, Deval Mehta, Siyuan Yan, Yaling Shen, Zimu Wang, Zongyuan Ge
arxiv.org/abs/2509.17740

@arXiv_csCL_bot@mastoxiv.page
2025-09-22 10:21:01

Beyond the Score: Uncertainty-Calibrated LLMs for Automated Essay Assessment
Ahmed Karim (Judy), Qiao Wang (Judy), Zheng Yuan
arxiv.org/abs/2509.15926