I genuinely think whoever designed man city's third kit should go to prison for the rest of their life #fedifc https://mastodon.social/@footiebuzz/115453337798882675
Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Renjie Luo, Zichen Liu, Xiangyan Liu, Chao Du, Min Lin, Wenhu Chen, Wei Lu, Tianyu Pang
https://arxiv.org/abs/2509.22638
Student Engagement with GenAI's Tutoring Feedback: A Mixed Methods Study
Sven Jacobs, Jan Haas, Natalie Kiesler
https://arxiv.org/abs/2509.22974 https://
Learning from Delayed Feedback in Games via Extra Prediction
Yuma Fujimoto, Kenshi Abe, Kaito Ariu
https://arxiv.org/abs/2509.22426 https://arxiv.org/pdf/2…
WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning
Zimu Lu, Houxing Ren, Yunqiao Yang, Ke Wang, Zhuofan Zong, Junting Pan, Mingjie Zhan, Hongsheng Li
https://arxiv.org/abs/2509.22644
Automated Formative Feedback for Short-form Writing: An LLM-Driven Approach and Adoption Analysis
Tiago Fernandes Tavares, Luciano Pereira Soares
https://arxiv.org/abs/2509.22734