Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csCL_bot@mastoxiv.page
2025-08-20 10:00:40

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
Xiao Liang, Zhongzhi Li, Yeyun Gong, Yelong Shen, Ying Nian Wu, Zhijiang Guo, Weizhu Chen
arxiv.org/abs/2508.14029

@arXiv_csLG_bot@mastoxiv.page
2025-08-20 10:12:20

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration
Zhicheng Yang, Zhijiang Guo, Yinya Huang, Yongxin Wang, Dongchun Xie, Yiwei Wang, Xiaodan Liang, Jing Tang
arxiv.org/abs/2508.13755

@arXiv_csAI_bot@mastoxiv.page
2025-08-19 10:53:40

Reinforcement Learning with Rubric Anchors
Zenan Huang, Yihong Zhuang, Guoshan Lu, Zeyu Qin, Haokai Xu, Tianyu Zhao, Ru Peng, Jiaqi Hu, Zhanming Shen, Xiaomeng Hu, Xijun Gu, Peiyi Tu, Jiaxin Liu, Wenyu Chen, Yuzhuo Fu, Zhiting Fan, Yanmei Gu, Yuanyuan Wang, Zhengkai Yang, Jianguo Li, Junbo Zhao
arxiv.org/abs/2508.12790

@rasterweb@mastodon.social
2025-06-20 16:59:35

I tried out a sign handle but didn't like it so I started to design my own...
Mine is parametric and has a separate clamping piece that attaches with two 8-32 (or 4mm) bolts.
On a 256x256 print bed you can print one about 270mm (10.6") tall.
( Here's the one I originally tried:

Photo of a sign handle.
Render of a sign handle.
@arXiv_csRO_bot@mastoxiv.page
2025-07-18 09:23:22

LaViPlan : Language-Guided Visual Path Planning with RLVR
Hayeon Oh
arxiv.org/abs/2507.12911 arxiv.org/pdf/2507.12911…

@BBC6MusicBot@mastodonapp.uk
2025-08-19 23:09:18

🇺🇦 #NowPlaying on #BBC6Music's #6MusicsIndieForever
The Ting Tings:
🎵 Great DJ
#TheTingTings
rolivaroliva.bandcamp.com/trac
open.spotify.com/track/3kmSzP8

@bilbo_le_hobbit@mamot.fr
2025-06-17 23:44:12

Hello les gens ! Question #OpenStreetMap. J'ai déjŠ contribué modestement Š #OSM par le passé, mais lŠ , il y a un défi autrement plus complexe que j'aurais Š relever : Je souhaiterais ajouter une nouvelle voie publique ainsi qu'un nouvel équipement dans mon village, Š Guern. Il s'agit de l'…

@arXiv_csAI_bot@mastoxiv.page
2025-08-14 07:38:52

MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement
Weitao Jia, Jinghui Lu, Haiyang Yu, Siqi Wang, Guozhi Tang, An-Lan Wang, Weijie Yin, Dingkang Yang, Yuxiang Nie, Bin Shan, Hao Feng, Irene Li, Kun Yang, Han Wang, Jingqun Tang, Teng Fu, Changhong Jin, Chao Feng, Xiaohui Lv, Can Huang
arxiv.org/abs/2508.09670…

@rasterweb@mastodon.social
2025-06-12 18:58:55

You could 3D print a little sign holder handle...
➡️ printables.com/model/1248678-e

@arXiv_csAI_bot@mastoxiv.page
2025-08-19 11:16:00

G$^2$RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance
Yongxin Guo, Wenbo Deng, Zhenglin Cheng, Xiaoying Tang
arxiv.org/abs/2508.13023