“The theories of textuality, discourse, and visuality at the heart of cultural theory remain largely unresponsive to the sonic, failing to confront the powerful, asignifying materiality that characterizes so much experimental work with sound.”
High-Throughput Mapping of Magnetic Properties via the on-the-fly XMCD spectroscopy in a Combinatorial Fe-Co-Ni Film
Y. Yamasaki, N. Sasabe, Y. Ishii, Y. Sekiguchi, A. Sumiyoshiya, Y. Tanimoto, Y. Kotani, T. Nakamura, H. Nomura
https://arxiv.org/abs/2506.20958
Data Efficacy for Language Model Training
Yalun Dai, Yangyu Huang, Xin Zhang, Wenshan Wu, Chong Li, Wenhui Lu, Shijie Cao, Li Dong, Scarlett Li
https://arxiv.org/abs/2506.21545 https://arxiv.org/pdf/2506.21545 https://arxiv.org/html/2506.21545
arXiv:2506.21545v1 Announce Type: new
Abstract: Data is fundamental to the training of language models (LM). Recent research has been dedicated to data efficiency, which aims to maximize performance by selecting a minimal or optimal subset of training data. Techniques such as data filtering, sampling, and selection play a crucial role in this area. To complement it, we define Data Efficacy, which focuses on maximizing performance by optimizing the organization of training data and remains relatively underexplored. This work introduces a general paradigm, DELT, for considering data efficacy in LM training, which highlights the significance of training data organization. DELT comprises three components: Data Scoring, Data Selection, and Data Ordering. Among these components, we design Learnability-Quality Scoring (LQS), as a new instance of Data Scoring, which considers both the learnability and quality of each data sample from the gradient consistency perspective. We also devise Folding Ordering (FO), as a novel instance of Data Ordering, which addresses issues such as model forgetting and data distribution bias. Comprehensive experiments validate the data efficacy in LM training, which demonstrates the following: Firstly, various instances of the proposed DELT enhance LM performance to varying degrees without increasing the data scale and model size. Secondly, among these instances, the combination of our proposed LQS for data scoring and Folding for data ordering achieves the most significant improvement. Lastly, data efficacy can be achieved together with data efficiency by applying data selection. Therefore, we believe that data efficacy is a promising foundational area in LM training.
toXiv_bot_toot
Optimization of Flying Ad Hoc Network Topology and Collaborative Path Planning for Multiple UAVs
Ming He, Peizhao Wang, Haihua Chen, Bin Sun, Hongpeng Wang
https://arxiv.org/abs/2506.17945
We’re on a terrible path that we aren’t even aware exists: collapsing the marine food chain.
The End Permian event did that. It was ugly. Lots of fungal spores in the fossil record, not a lot of bones. Barely anything survived.
We’re having issues with phytoplankton in the #GreatLakes too. Invasive mussels have done huge damage to how the food chain (Esp. in
Steelers DB, West Virginia alum Beanie Bishop disrespects Pitt logo following recent practice
https://www.cbssports.com/nfl/news/steeler
Does someone know how to change the port for an SSH based cache?
The `user@host:port` pattern doesn't work on my machine.
https://discourse.nixos.org/t/how-to-change-port-for-ssh-based-cache/63908
🇺🇦 #NowPlaying on #BBC6Music's #LaurenLaverne
Falle Nioke:
🎵 Falle Le Le Le
#FalleNioke
https://fallenioke.bandcamp.com/track/falle-le-le-le
https://open.spotify.com/track/3ImVZoYdSRJdXHpTXaZweG
Fe contribution to the magnetic anisotropy of $L{1_0}$-ordered FePt thin films studied by angle-dependent x-ray magnetic circular dichroism
Goro Shibata, Keisuke Ikeda, Takeshi Seki, Shoya Sakamoto, Yosuke Nonaka, Zhendong Chi, Yuxuan Wan, Masahiro Suzuki, Tsuneharu Koide, Hiroki Wadati, Koki Takanashi, Atsushi Fujimori
https://…