
2025-06-06 13:56:00
This story of an ex-Googler is well worth reading. After some personal observations, @… asks interesting questions:
What is crypto mining if not a textbook Captain Planet villain scheme—to kill and raze and destroy for nothing but imaginary tokens proving that you did lots of killing and razing and …
OpenAI says the tokenized OpenAI shares Robinhood has started offering are not equity: "We did not partner with Robinhood...and do not endorse it" (MacKenzie Sigalos/CNBC)
https://www.cnbc.com/2025/07/02/openai-robinhood-tokens.html
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
Purbesh Mitra, Sennur Ulukus
https://arxiv.org/abs/2507.02851 https://a…
This https://arxiv.org/abs/2410.11295 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
Nim on Integer Partitions and Hyperrectangles
Eric Gottlieb, Matja\v{z} Krnc, Peter Mur\v{s}i\v{c}
https://arxiv.org/abs/2506.04991 https://
FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens
Yiming Zhong, Yumeng Liu, Chuyang Xiao, Zemin Yang, Youzhuo Wang, Yufei Zhu, Ye Shi, Yujing Sun, Xinge Zhu, Yuexin Ma
https://arxiv.org/abs/2506.01583
Spring Secret Starter: Managing #Secrets in Your #SpringBoot App
https://lucas-fern…
This https://arxiv.org/abs/2506.02867 has been replaced.
link: https://scholar.google.com/scholar?q=a
TRAPDOC: Deceiving LLM Users by Injecting Imperceptible Phantom Tokens into Documents
Hyundong Jin, Sicheol Sung, Shinwoo Park, SeungYeop Baik, Yo-Sub Han
https://arxiv.org/abs/2506.00089
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Shenzhi Wang, Le Yu, Chang Gao, Chujie Zheng, Shixuan Liu, Rui Lu, Kai Dang, Xionghui Chen, Jianxin Yang, Zhenru Zhang, Yuqiong Liu, An Yang, Andrew Zhao, Yang Yue, Shiji Song, Bowen Yu, Gao Huang, Junyang Lin
https://arx…
Token Communication in the Era of Large Models: An Information Bottleneck-Based Approach
Hao Wei, Wanli Ni, Wen Wang, Wenjun Xu, Dusit Niyato, Ping Zhang
https://arxiv.org/abs/2507.01728
Fast and Simplex: 2-Simplicial Attention in Triton
Aurko Roy, Timothy Chou, Sai Surya Duvvuri, Sijia Chen, Jiecao Yu, Xiaodong Wang, Manzil Zaheer, Rohan Anil
https://arxiv.org/abs/2507.02754
EARN: Efficient Inference Acceleration for LLM-based Generative Recommendation by Register Tokens
Chaoqun Yang, Xinyu Lin, Wenjie Wang, Yongqi Li, Teng Sun, Xianjing Han, Tat-Seng Chua
https://arxiv.org/abs/2507.00715
Controllable Text-to-Speech Synthesis with Masked-Autoencoded Style-Rich Representation
Yongqi Wang, Chunlei Zhang, Hangting Chen, Zhou Zhao, Dong Yu
https://arxiv.org/abs/2506.02997
Founders Fund-backed Ondo Finance and Pantera Capital launch a $250M fund to invest in real-world asset tokenization projects via equity stakes and tokens (Lucinda Shen/Axios)
https://www.axios.com/pro/fintech-deals/2025/07/03/ondo-…
Singularity Protocol for Cross Chain AMM without Intermediate Tokens or Bridges
Sumit Vohra
https://arxiv.org/abs/2505.24337 https://…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Info...
Li, Shi, Gao, Liu, Wang, Chen, Liu, Zhao, Wang, Metaxas
Speculative Decoding via Hybrid Drafting and Rollback-Aware Branch Parallelism
Yuhao Shen, Junyi Shen, Quan Kong, Tianyu Liu, Yao Lu, Cong Wang
https://arxiv.org/abs/2506.01979
DiffSoundStream: Efficient Speech Tokenization via Diffusion Decoding
Yang Yang, Yunpeng Li, George Sung, Shao-Fu Shih, Craig Dooley, Alessio Centazzo, Ramanan Rajeswaran
https://arxiv.org/abs/2506.22362
"Some AI prompts could cause 50 times more CO₂ emissions than others, researchers find"
#AI #ArtificialIntelligence #Technology
Nets-within-Nets through the Lens of Data Nets
Francesco Di Cosmo, Soumodev Mal, Tephilla Prince
https://arxiv.org/abs/2506.22344 https://
A Midsummer Meme's Dream: Investigating Market Manipulations in the Meme Coin Ecosystem
Alberto Maria Mongardini, Alessandro Mei
https://arxiv.org/abs/2507.01963
I have released LLama2.c64 - an LLM running on a C64 with 2MB REU. It runs the Llama2 LLM architecture, using the tokenizer and weights from the Tinystories 260K model.
It's a storytelling model that tries its best to spin your prompt into a story, as if told by a kindergarten child. It will generate one output token about every 8 minutes.
…
MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation
Yakun Song, Jiawei Chen, Xiaobin Zhuang, Chenpeng Du, Ziyang Ma, Jian Wu, Jian Cong, Dongya Jia, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xie Chen
https://arxiv.org/abs/2506.00385
📉 Token That’s Literally USELESS Is Crypto’s Latest Meme Cult
https://www.coindesk.com/markets/2025/06/18/token-that-s-literally-useless-is-crypto-s-latest-meme-cult
This https://arxiv.org/abs/2502.13943 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csAI_…
Vision-Aided ISAC in Low-Altitude Economy Networks via De-Diffused Visual Priors
Yulan Gao, Ziqiang Ye, Zhonghao Lyu, Ming Xiao, Yue Xiao, Ping Yang, Agata Manolova
https://arxiv.org/abs/2507.01574
This https://arxiv.org/abs/2505.19669 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
Big drama over Discord warning a no-code platform called BotGhost that it will be kicked off unless it finds a new way to operate, due to a security vulnerability that has now been fixed.
https://update.botghost.com/#tldr-what-s-happening-with-botghost-and-discord…
Quantize-Sample-and-Verify: LLM Acceleration via Adaptive Edge-Cloud Speculative Decoding
Guangyi Zhang, Yunlong Cai, Guanding Yu, Petar Popovski, Osvaldo Simeone
https://arxiv.org/abs/2507.00605
This https://arxiv.org/abs/2502.09891 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
This https://arxiv.org/abs/2505.22784 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_eco…
When Blockchain Meets Crawlers: Real-time Market Analytics in Solana NFT Markets
Chengxin Shen, Zhongwen Li, Xiaoqi Li, Zongwei Li
https://arxiv.org/abs/2506.02892
RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination
Chong Zeng, Yue Dong, Pieter Peers, Hongzhi Wu, Xin Tong
https://arxiv.org/abs/2505.21925
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective
Yifan Zhong, Fengshuo Bai, Shaofei Cai, Xuchuan Huang, Zhang Chen, Xiaowei Zhang, Yuanfei Wang, Shaoyang Guo, Tianrui Guan, Ka Nam Lui, Zhiquan Qi, Yitao Liang, Yuanpei Chen, Yaodong Yang
https://arxiv.org/abs/2507.01925
HyperCLOVA X THINK Technical Report
NAVER Cloud HyperCLOVA X Team
https://arxiv.org/abs/2506.22403 https://arxiv.org/pdf/2506.22403…
Addressing tokens dynamic generation, propagation, storage and renewal to secure the GlideinWMS pilot based jobs and system
Bruno Moreira Coimbra, Marco Mambelli
https://arxiv.org/abs/2506.07379
Intellectual Property Rights and Entrepreneurship in the NFT Ecosystem: Legal Frameworks, Business Models, and Innovation Opportunities
Pranav Darshan, Rohan J S, Raghuveer Rajesh, Ruchitha M, Sanika Kamath, Manas M N
https://arxiv.org/abs/2507.00172
Trump has promoted his private businesses in unprecedented ways for a sitting president, presenting conflicts of interest.
His ventures into crypto, in particular, have drawn scrutiny because he has simultaneously moved to create a more friendly regulatory environment for the industry.
The $57 million from his stake in the crypto firm World Liberty Financial came from the sale of digital tokens.
That provides a glimpse into Trump’s earnings from cryptocurrency ventures t…
This https://arxiv.org/abs/2406.05298 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
$IF-GUIDE$: Influence Function-Guided Detoxification of LLMs
Zachary Coalson, Juhan Bae, Nicholas Carlini, Sanghyun Hong
https://arxiv.org/abs/2506.01790 h…
UAE-based Aqua 1 Foundation buys $100M worth of tokens from Trump's World Liberty Financial, becoming its largest individual investor ahead of Justin Sun (Muyao Shen/Bloomberg)
https://www.bloomberg.com/news/articles/20
This https://arxiv.org/abs/2503.01148 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_qfi…
Is Your LLM Overcharging You? Tokenization, Transparency, and Incentives
Ander Artola Velasco, Stratis Tsirtsis, Nastaran Okati, Manuel Gomez-Rodriguez
https://arxiv.org/abs/2505.21627
FUSE: Universal Speech Enhancement using Multi-Stage Fusion of Sparse Compression and Token Generation Models for the URGENT 2025 Challenge
Nabarun Goswami, Tatsuya Harada
https://arxiv.org/abs/2506.00809
This https://arxiv.org/abs/2505.19189 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
This https://arxiv.org/abs/2505.14759 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSE_…
When Attention is Beneficial for Learning Wireless Resource Allocation Efficiently?
Jia Guo, Chenyang Yang
https://arxiv.org/abs/2507.02427 https://…
Evasive Random Walks and the Clairvoyant Demon
Aaron Abrams, Henry Landau, Zeph Landau, James Pommersheim, Eric Zaslow
https://arxiv.org/abs/2506.21929 htt…
Breaking the Boundaries of Long-Context LLM Inference: Adaptive KV Management on a Single Commodity GPU
He Sun, Li Li, Mingjun Xiao, Chengzhong Xu
https://arxiv.org/abs/2506.20187
Common Corpus: The Largest Collection of Ethical Data for LLM Pre-Training
Pierre-Carl Langlais, Carlos Rosas Hinostroza, Mattia Nee, Catherine Arnett, Pavel Chizhov, Eliot Krzystof Jones, Ir\`ene Girard, David Mach, Anastasia Stasenko, Ivan P. Yamshchikov
https://arxiv.org/abs/2506.01732
Centre driven Controlled Evolution of Wireless Virtual Networks based on Broadcast Tokens
Vignesh Babu, Atishay Jain, Kannan Karthik
https://arxiv.org/abs/2506.16615
Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models
Zihan Wang, Rui Pan, Jiarui Yao, Robert Csordas, Linjie Li, Lu Yin, Jiajun Wu, Tong Zhang, Manling Li, Shiwei Liu
https://arxiv.org/abs/2506.18945
Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation
Xingyang Li, Muyang Li, Tianle Cai, Haocheng Xi, Shuo Yang, Yujun Lin, Lvmin Zhang, Songlin Yang, Jinbo Hu, Kelly Peng, Maneesh Agrawala, Ion Stoica, Kurt Keutzer, Song Han
https://arxiv.org/abs/2506.19852…
DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling
Anindita Ghosh, Bing Zhou, Rishabh Dabral, Jian Wang, Vladislav Golyanik, Christian Theobalt, Philipp Slusallek, Chuan Guo
https://arxiv.org/abs/2506.18680
CPN-Py: A Python-Based Tool for Modeling and Analyzing Colored Petri Nets
Alessandro Berti, Wil M. P. van der Aalst
https://arxiv.org/abs/2506.12238 https:…
StreamFlow: Streaming Flow Matching with Block-wise Guided Attention Mask for Speech Token Decoding
Dake Guo, Jixun Yao, Linhan Ma, Wang He, Lei Xie
https://arxiv.org/abs/2506.23986
Developers criticize Google for its decision to hide raw reasoning tokens, essential for debugging complex AI workflows, of its flagship model Gemini 2.5 Pro (Ben Dickson/VentureBeat)
https://venturebeat.com/ai/googles-gemin…
This https://arxiv.org/abs/2505.19462 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Your Token Becomes Worthless: Unveiling Rug Pull Schemes in Crypto Token via Code-and-Transaction Fusion Analysis
Hao Wu, Haijun Wang, Shangwang Li, Yin Wu, Ming Fan, Wuxia Jin, Yitao Zhao, Ting Liu
https://arxiv.org/abs/2506.18398
A Variational Framework for Improving Naturalness in Generative Spoken Language Models
Li-Wei Chen, Takuya Higuchi, Zakaria Aldeneh, Ahmed Hussen Abdelaziz, Alexander Rudnicky
https://arxiv.org/abs/2506.14767
This https://arxiv.org/abs/2505.08944 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDC_…
Low-Complexity Semantic Packet Aggregation for Token Communication via Lookahead Search
Seunghun Lee, Jihong Park, Jinho Choi, Hyuncheol Park
https://arxiv.org/abs/2506.19451
Act-With-Think: Chunk Auto-Regressive Modeling for Generative Recommendation
Yifan Wang, Weinan Gan, Longtao Xiao, Jieming Zhu, Heng Chang, Haozhao Wang, Rui Zhang, Zhenhua Dong, Ruiming Tang, Ruixuan Li
https://arxiv.org/abs/2506.23643
Split the Yield, Share the Risk: Pricing, Hedging and Fixed rates in DeFi
Viraj Nadkarni, Pramod Viswanath
https://arxiv.org/abs/2505.22784 https://…
Discrete Audio Tokens: More Than a Survey!
Pooneh Mousavi, Gallil Maimon, Adel Moumen, Darius Petermann, Jiatong Shi, Haibin Wu, Haici Yang, Anastasia Kuznetsova, Artem Ploujnikov, Ricard Marxer, Bhuvana Ramabhadran, Benjamin Elizalde, Loren Lugosch, Jinyu Li, Cem Subakan, Phil Woodland, Minje Kim, Hung-yi Lee, Shinji Watanabe, Yossi Adi, Mirco Ravanelli
Can Sound Replace Vision in LLaVA With Token Substitution?
Ali Vosoughi, Jing Bi, Pinxin Liu, Yunlong Tang, Chenliang Xu
https://arxiv.org/abs/2506.10416 h…
BEAST: Efficient Tokenization of B-Splines Encoded Action Sequences for Imitation Learning
Hongyi Zhou, Weiran Liao, Xi Huang, Yucheng Tang, Fabian Otto, Xiaogang Jia, Xinkai Jiang, Simon Hilber, Ge Li, Qian Wang, \"Omer Erdin\c{c} Ya\u{g}murlu, Nils Blank, Moritz Reuss, Rudolf Lioutikov
https://arxiv.org/abs/2506.06072
A Note on Reconfiguration Graphs of Cliques
Quan N. Lam, Huu An Phan, Duc A. Hoang
https://arxiv.org/abs/2506.07821 https://arxiv.org…
Information-Theoretic Detection of Unusual Source Code Changes
Adriano Torres, Sebastian Baltes, Christoph Treude, Markus Wagner
https://arxiv.org/abs/2506.06508
China's MiniMax open sources MiniMax-M1, a model to handle complicated productivity tasks that supports 1M input tokens and it says surpasses DeepSeek's R1-0528 (Bloomberg)
https://www.bloomberg.com/news/articles/20
CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer
Daiki Takeuchi, Binh Thien Nguyen, Masahiro Yasuda, Yasunori Ohishi, Daisuke Niizumi, Noboru Harada
https://arxiv.org/abs/2506.00800
A Simple Contrastive Framework Of Item Tokenization For Generative Recommendation
Penglong Zhai, Yifang Yuan, Fanyi Di, Jie Li, Yue Liu, Chen Li, Jie Huang, Sicong Wang, Yao Xu, Xin Li
https://arxiv.org/abs/2506.16683
Utility-Driven Speculative Decoding for Mixture-of-Experts
Anish Saxena, Po-An Tsai, Hritvik Taneja, Aamer Jaleel, Moinuddin Qureshi
https://arxiv.org/abs/2506.20675
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[9/11]:
- Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space...
Yuri Kuratov, Mikhail Arkhipov, Aydar Bulatov, Mikhail Burtsev
HLTCOE at LiveRAG: GPT-Researcher using ColBERT retrieval
Kevin Duh, Eugene Yang, Orion Weller, Andrew Yates, Dawn Lawrie
https://arxiv.org/abs/2506.22356 …
IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling
Kuan-Po Huang, Shu-wen Yang, Huy Phan, Bo-Ru Lu, Byeonggeun Kim, Sashank Macha, Qingming Tang, Shalini Ghosh, Hung-yi Lee, Chieh-Chi Kao, Chao Wang
https://arxiv.org/abs/2506.00736
Harvard releases Institutional Books 1.0, a dataset for AI researchers with 242B tokens, from 394M scanned pages and 983K public domain books in 254 languages (Matt O'Brien/Associated Press)
https://apnews.com/article/ai-chatbot-
This https://arxiv.org/abs/2502.18200 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
HASRD: Hierarchical Acoustic and Semantic Representation Disentanglement
Amir Hussein, Sameer Khurana, Gordon Wichern, Francois G. Germain, Jonathan Le Roux
https://arxiv.org/abs/2506.00843
Corrector Sampling in Language Models
Itai Gat, Neta Shaul, Uriel Singer, Yaron Lipman
https://arxiv.org/abs/2506.06215 https://arxiv…
This https://arxiv.org/abs/2409.15104 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDC_…
Detecting Hard-Coded Credentials in Software Repositories via LLMs
Chidera Biringa, Gokhan Kul
https://arxiv.org/abs/2506.13090 https://
OpenAI debuts o3-pro for ChatGPT Pro and Team users and in the API, costing $20/1M input and $80/1M output tokens; Enterprise and Edu will get access next week (Kyle Wiggers/TechCrunch)
https://techcrunch.com/2025/06/10/open
This https://arxiv.org/abs/2505.21700 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
AMPLIFY: Actionless Motion Priors for Robot Learning from Videos
Jeremy A. Collins, Lor\'and Cheng, Kunal Aneja, Albert Wilcox, Benjamin Joffe, Animesh Garg
https://arxiv.org/abs/2506.14198
CodecSlime: Temporal Redundancy Compression of Neural Speech Codec via Dynamic Frame Rate
Hankun Wang, Yiwei Guo, Chongtian Shao, Bohan Li, Xie Chen, Kai Yu
https://arxiv.org/abs/2506.21074
OpenAI announces an 80% price drop for its o3 model and a "flex" mode for synchronous processing that charges $5 for input and $20 for output per million tokens (Carl Franzen/VentureBeat)
https://venturebeat.com/ai/openai-anno
Smooth Operators: LLMs Translating Imperfect Hints into Disfluency-Rich Transcripts
Duygu Altinok
https://arxiv.org/abs/2506.18510 https://
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[3/4]:
- Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency
Chenlong Wang, Yuanning Feng, Dongping Chen, Zhaoyang Chu, Ranjay Krishna, Tianyi Zhou
Detecting Hard-Coded Credentials in Software Repositories via LLMs
Chidera Biringa, Gokhan Kul
https://arxiv.org/abs/2506.13090 https://
From Bytes to Ideas: Language Modeling with Autoregressive U-Nets
Mathurin Videau, Badr Youbi Idrissi, Alessandro Leite, Marc Schoenauer, Olivier Teytaud, David Lopez-Paz
https://arxiv.org/abs/2506.14761
Generating Long Semantic IDs in Parallel for Recommendation
Yupeng Hou, Jiacheng Li, Ashley Shin, Jinsung Jeon, Abhishek Santhanam, Wei Shao, Kaveh Hassani, Ning Yao, Julian McAuley
https://arxiv.org/abs/2506.05781
This https://arxiv.org/abs/2505.17282 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention
Yeonju Ro, Zhenyu Zhang, Souvik Kundu, Zhangyang Wang, Aditya Akella
https://arxiv.org/abs/2506.09316
This https://arxiv.org/abs/2506.06266 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
This https://arxiv.org/abs/2506.01790 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…