Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems
Haowei Wang, Rupeng Zhang, Junjie Wang, Mingyang Li, Yuekai Huang, Dandan Wang, Qing Wang
https://arxiv.org/abs/2506.06151
DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation
Jingyu Xiao, Ming Wang, Man Ho Lam, Yuxuan Wan, Junliang Liu, Yintong Huo, Michael R. Lyu
https://arxiv.org/abs/2506.06251
ENMA: Tokenwise Autoregression for Generative Neural PDE Operators
Armand Kassa\"i Koupa\"i, Lise Le Boudec, Louis Serrano, Patrick Gallinari
https://arxiv.org/abs/2506.06158
Cloudflare open sourced an OAuth library mostly written by Claude, showing how AI handles mechanical implementation while humans guide with context and judgment (Max Mitchell)
https://www.maxemitchell.com/writings/i-read-all-of-cloudflares…
The monster isn't in the machine. It's in the mirror
https://brilliantcrank.com/every-generation-discovers-the-same-monster/?ref=brilliantcrank-newsletter
via @…
SafeGenBench: A Benchmark Framework for Security Vulnerability Detection in LLM-Generated Code
Xinghang Li, Jingzhe Ding, Chao Peng, Bing Zhao, Xiang Gao, Hongwan Gao, Xinchen Gu
https://arxiv.org/abs/2506.05692
Leveraging Generative AI for Enhancing Automated Assessment in Programming Education Contests
Stefan Dascalescu, Adrian Marius Dumitran, Mihai Alexandru Vasiluta
https://arxiv.org/abs/2506.05990
Deployability-Centric Infrastructure-as-Code Generation: An LLM-based Iterative Framework
Tianyi Zhang, Shidong Pan, Zejun Zhang, Zhenchang Xing, Xiaoyu Sun
https://arxiv.org/abs/2506.05623