Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csRO_bot@mastoxiv.page
2025-06-10 16:54:09

This arxiv.org/abs/2410.03483 has been replaced.
initial toot: mastoxiv.page/@arXiv_csRO_…

@arXiv_csLG_bot@mastoxiv.page
2025-07-11 10:23:11

Dynamic Chunking for End-to-End Hierarchical Sequence Modeling
Sukjun Hwang, Brandon Wang, Albert Gu
arxiv.org/abs/2507.07955 arxiv.org/pdf/2507.07955 arxiv.org/html/2507.07955
arXiv:2507.07955v1 Announce Type: new
Abstract: Despite incredible progress in language models (LMs) in recent years, largely resulting from moving away from specialized models designed for specific tasks to general models based on powerful architectures (e.g. the Transformer) that learn everything from raw data, pre-processing steps such as tokenization remain a barrier to true end-to-end foundation models. We introduce a collection of new techniques that enable a dynamic chunking mechanism which automatically learns content -- and context -- dependent segmentation strategies learned jointly with the rest of the model. Incorporating this into an explicit hierarchical network (H-Net) allows replacing the (implicitly hierarchical) tokenization-LM-detokenization pipeline with a single model learned fully end-to-end. When compute- and data- matched, an H-Net with one stage of hierarchy operating at the byte level outperforms a strong Transformer language model operating over BPE tokens. Iterating the hierarchy to multiple stages further increases its performance by modeling multiple levels of abstraction, demonstrating significantly better scaling with data and matching a token-based Transformer of twice its size. H-Nets pretrained on English show significantly increased character-level robustness, and qualitatively learn meaningful data-dependent chunking strategies without any heuristics or explicit supervision. Finally, the H-Net's improvement over tokenized pipelines is further increased in languages and modalities with weaker tokenization heuristics, such as Chinese and code, or DNA sequences (nearly 4x improvement in data efficiency over baselines), showing the potential of true end-to-end models that learn and scale better from unprocessed data.
toXiv_bot_toot

@arXiv_csNE_bot@mastoxiv.page
2025-06-10 08:02:42

Structured State Space Model Dynamics and Parametrization for Spiking Neural Networks
Maxime Fabre, Lyubov Dudchenko, Emre Neftci
arxiv.org/abs/2506.06374

@arXiv_astrophHE_bot@mastoxiv.page
2025-06-10 17:56:10

This arxiv.org/abs/2502.16626 has been replaced.
initial toot: mastoxiv.page/@arXiv_…

@arXiv_csIT_bot@mastoxiv.page
2025-06-10 16:45:59

This arxiv.org/abs/2412.07911 has been replaced.
initial toot: mastoxiv.page/@arXiv_csIT_…

@arXiv_csSD_bot@mastoxiv.page
2025-06-09 07:58:32

WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction
Jakaria Islam Emon, Kazi Tamanna Alam, Md. Abu Salek
arxiv.org/abs/2506.05899

@arXiv_csRO_bot@mastoxiv.page
2025-06-10 16:45:29

This arxiv.org/abs/2409.10283 has been replaced.
initial toot: mastoxiv.page/@arXiv_csRO_…

@arXiv_csCL_bot@mastoxiv.page
2025-07-29 08:31:31

HITSZ's End-To-End Speech Translation Systems Combining Sequence-to-Sequence Auto Speech Recognition Model and Indic Large Language Model for IWSLT 2025 in Indic Track
Xuchen Wei, Yangxin Wu, Yaoyin Zhang, Henglyu Liu, Kehai Chen, Xuefeng Bai, Min Zhang
arxiv.org/abs/2507.19616

@arXiv_mathOC_bot@mastoxiv.page
2025-07-09 09:11:52

Nonstationary Distribution Estimation via Wasserstein Probability Flows
Edward J. Anderson, Dominic S. T. Keehan
arxiv.org/abs/2507.05893

@arXiv_mathLO_bot@mastoxiv.page
2025-08-05 07:56:10

Fibonacci Numbers and Model-Complete Axiomatization of Presburger Arithmetic Expanded with a Beatty Sequence
Mohsen Khani, Ali N. Valizadeh, Afshin Zarei
arxiv.org/abs/2508.02303

@arXiv_csDS_bot@mastoxiv.page
2025-07-08 10:10:40

Greedy Dynamic Matching
Nick Arnosti, Felipe Simon
arxiv.org/abs/2507.04551 arxiv.org/pdf/2507.04551

@arXiv_csIR_bot@mastoxiv.page
2025-06-09 07:42:12

Generating Long Semantic IDs in Parallel for Recommendation
Yupeng Hou, Jiacheng Li, Ashley Shin, Jinsung Jeon, Abhishek Santhanam, Wei Shao, Kaveh Hassani, Ning Yao, Julian McAuley
arxiv.org/abs/2506.05781

@arXiv_csSE_bot@mastoxiv.page
2025-08-04 09:33:30

MCeT: Behavioral Model Correctness Evaluation using Large Language Models
Khaled Ahmed, Jialing Song, Boqi Chen, Ou Wei, Bingzhou Zheng
arxiv.org/abs/2508.00630

@arXiv_csLO_bot@mastoxiv.page
2025-07-08 07:43:40

Omega-regular Verification and Control for Distributional Specifications in MDPs
S. Akshay (Dept of CSE, Indian Institute of Technology Bombay), Ouldouz Neysari (Singapore Management University, University of Tehran), {\DJ}or{\dj}e \v{Z}ikeli\'c (Singapore Management University)
arxiv.org/abs/2507.04286

@arXiv_condmatstrel_bot@mastoxiv.page
2025-07-09 08:50:52

Elementary Steps of Energy Conversion in Strongly Correlated Systems: Beyond Single Quasiparticles and Rigid Bands
V. Moshnyaga, Ch. Jooss P. E. Bl\"ochl, V. Bruchmann-Bamberg, A. Dehning, L. Allen-Rump, C. Hausmann, M. Kr\"uger, A. Rathnakaran, S. Rajpurohit, D. Steil, C. Flathmann, J. Hoffmann, M. Seibt, C. Volkert

@UP8@mastodon.social
2025-07-02 14:50:26

⛓️‍💥 Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations
#ai #llm

@arXiv_physicsbioph_bot@mastoxiv.page
2025-08-07 08:36:03

Probing the statistics of sequence-dependent DNA conformations in solution using SAXS
Heidar J. Koning, Anuradha Pullakhandam, Andrew E. Whitten, Charles S. Bond, Michel Peyrard
arxiv.org/abs/2508.04358

@arXiv_eessSY_bot@mastoxiv.page
2025-07-08 09:33:00

First Contact: Data-driven Friction-Stir Process Control
James Koch, Ethan King, WoongJo Choi, Megan Ebers, David Garcia, Ken Ross, Keerti Kappagantula
arxiv.org/abs/2507.03177

@arXiv_csCG_bot@mastoxiv.page
2025-07-08 07:33:19

Input-Sensitive Reconfiguration of Sliding Cubes
Hugo Akitaya, Matias Korman, Frederick Stock
arxiv.org/abs/2507.04170

@arXiv_csCE_bot@mastoxiv.page
2025-07-28 08:18:01

TrinityDNA: A Bio-Inspired Foundational Model for Efficient Long-Sequence DNA Modeling
Qirong Yang, Yucheng Guo, Zicheng Liu, Yujie Yang, Qijin Yin, Siyuan Li, Shaomin Ji, Linlin Chao, Xiaoming Zhang, Stan Z. Li
arxiv.org/abs/2507.19229

@arXiv_quantph_bot@mastoxiv.page
2025-06-03 08:11:21

hqQUBO: A Hybrid-querying Quantum Optimization Model Validated with 16-qubits on an Ion Trap Quantum Computer for Life Science Applications
Rong Chen, Quan-Xin Mei, Wen-Ding Zhao, Lin Yao, Hao-Xiang Yang, Shun-Yao Zhang, Jiao Chen, Hong-Lin Li
arxiv.org/abs/2506.01559

@arXiv_grqc_bot@mastoxiv.page
2025-06-05 09:54:45

This arxiv.org/abs/2409.03833 has been replaced.
initial toot: mastoxiv.page/@arXiv_grqc_…

@arXiv_csLG_bot@mastoxiv.page
2025-06-05 10:59:04

This arxiv.org/abs/2505.24293 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csCR_bot@mastoxiv.page
2025-06-03 17:30:50

This arxiv.org/abs/2501.18626 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCR_…

@arXiv_econEM_bot@mastoxiv.page
2025-07-01 09:03:23

Testing parametric additive time-varying GARCH models
Niklas Ahlgren, Alexander Back, Timo Ter\"asvirta
arxiv.org/abs/2506.23821

@arXiv_astrophIM_bot@mastoxiv.page
2025-06-04 07:46:17

Performance of the image persistence model for Euclid infrared detectors
B. Kubik, R. Barbier, G. Smadja, S. Ferriol, Y. Conseil, Y. Copin, W. Gillard, S. Dusini, K. Jahnke, E. Prieto, N. Auricchio, E. Balbi, A. Balestra, P. Battaglia, V. Capobianco, R. Chary, L. Corcione, F. Cogato, G. Delucchi, E. Franceschi, L. Gabarra, F. Gianotti, F. Grupp, E. Lentini, S. Ligori, E. Medinaceli, G. Morgante, K. Paterson, E. Romelli, L. Sauniere, M. Schirmer, C. Sirignano G. Testera, M. Trifoglio, A. Troja, L. Valenziano, M. Frailis, M. Scodeggio, J. -C. Barriere, M. Berthe, C. Bodendorf, A. Caillat, M. Carle, R. Casas, H. Cho, A. Costille, F. Ducret, B. Garilli, W. Holmes, F. Hormuth, A. Hornstrup, M. Jhabvala, R. Kohley, D. Le Mignant, P. B. Lilje, I. Lloro, C. Padilla, G. Polenta, J. -C. Salvignol, G. Seidel, B. Serra, A. Secroun, L. Stanco, R. Toledo-Moreo, S. Anselmi, E. Borsato, L. Caillat, C. Colodro-Conde, V. Conforti, J. E. Davies, A. Renzi, F. Dal Corso, S. Davini, A. Derosa, J. J. Diaz, S. Di Domizio, D. Di Ferdinando, R. Farinelli, A. G. Ferrari, F. Fornari, F. Giacomini, O. Krause, F. Laudisio, J. Macias-Perez, J. Marpaud, N. Mauri, R. da Silva, M. Niclas, F. Passalacqua, I. Risso, P. Lagier, A. N. Sorensen, P. Stassi, J. Steinwagner, M. Tenti, C. Thizy, S. Tosi, R. Travaglini, O. Tubio, C. Valieri, S. Ventura, C. Vescovi, J. Zoubian
#toXiv_bot_toot

@arXiv_csAR_bot@mastoxiv.page
2025-06-03 07:16:51

AI Accelerators for Large Language Model In-ference: Architecture Analysis and Scaling Strategies
Amit Sharma
arxiv.org/abs/2506.00008

@arXiv_eessAS_bot@mastoxiv.page
2025-08-05 08:39:50

Guiding an Automatic Speech Recognition Decoder Using Large Language Models
Eyal Cohen (Technion - Israel Institute of Technology), Bhiksha Raj (Carnegie Mellon University), Joseph Keshet (Technion - Israel Institute of Technology)
arxiv.org/abs/2508.02228

@arXiv_csDS_bot@mastoxiv.page
2025-08-07 08:22:33

Counting Distinct Square Substrings in Sublinear Time
Panagiotis Charalampopoulos, Manal Mohamed, Jakub Radoszewski, Wojciech Rytter, Tomasz Wale\'n, Wiktor Zuba
arxiv.org/abs/2508.03930

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 07:19:50

Monitoring Robustness and Individual Fairness
Ashutosh Gupta, Thomas A. Henzinger, Konstantin Kueffner, Kaushik Mallik, David Pape
arxiv.org/abs/2506.00496

@lysander07@sigmoid.social
2025-05-17 07:38:59

In our #ISE2025 lecture last Wednesday, we learned how in n-gram language models via Markov assumption and maximum likelihood estimation we can predict the probability of the occurrence of a word given a specific context (i.e. n words previous in the sequence of words).
#NLP

Slide from the Information Service Engineering 2025 lecture, 03 Natural Language Processing 02, 2.9, Language MOdels:
Title: N-Gram Language Model
The probability of a sequence of words can be computed via contitional probability and the Bayes Rule (including the chain rule for n words). Approximation is performed via Markov assumption (dependency only on the n last words), and the Maximum Likelihood estimation (approximating the probabilities of a sequence of words by counting and normalising …
@arXiv_csNE_bot@mastoxiv.page
2025-07-08 10:43:00

Bridging Expressivity and Scalability with Adaptive Unitary SSMs
Arjun Karuvally, Franz Nowak, Anderson T. Keller, Carmen Amo Alonso, Terrence J. Sejnowski, Hava T. Siegelmann
arxiv.org/abs/2507.05238

@arXiv_qbioQM_bot@mastoxiv.page
2025-05-28 07:37:24

Sequence-Only Prediction of Binding Affinity Changes: A Robust and Interpretable Model for Antibody Engineering
Chen Liu, Mingchen Li, Yang Tan, Wenrui Gou, Guisheng Fan, Bingxin Zhou
arxiv.org/abs/2505.20301

@arXiv_mathNA_bot@mastoxiv.page
2025-08-05 08:03:00

Error estimates of linear decoupled structure-preserving incremental viscosity splitting methods for the Cahn--Hilliard--Navier--Stokes system
Baolin Kuang, Hongfei Fu, Xiaoli Li
arxiv.org/abs/2508.01141

@arXiv_astrophSR_bot@mastoxiv.page
2025-08-04 09:06:50

Effect of Matter Accretion on Lithium Enhancement of Giants
Xuefeng Li, Jianrong Shi, Yan Li, Hongliang Yan, Jinghua Zhang, Fei Guo
arxiv.org/abs/2508.00405

@arXiv_mathST_bot@mastoxiv.page
2025-07-23 08:39:22

Gaussian Sequence Model: Sample Complexities of Testing, Estimation and LFHT
Zeyu Jia, Yury Polyanskiy
arxiv.org/abs/2507.16734

@arXiv_mathPR_bot@mastoxiv.page
2025-07-02 09:09:59

Dispersion models on a circle: universal properties and asymptotic results
Jean-Fran\c{c}ois Marckert, Zo\'e Varin
arxiv.org/abs/2507.00737

@arXiv_csGT_bot@mastoxiv.page
2025-06-03 07:25:23

Geometry Meets Incentives: Sample-Efficient Incentivized Exploration with Linear Contexts
Benjamin Schiffer, Mark Sellke
arxiv.org/abs/2506.01685

@arXiv_qbiobm_bot@mastoxiv.page
2025-07-28 08:39:41

Latent-X: An Atom-level Frontier Model for De Novo Protein Binder Design
Latent Labs Team, Alex Bridgland, Jonathan Crabb\'e, Henry Kenlay, Daniella Pretorius, Sebastian M. Schmon, Agrin Hilmkil, Rebecca Bartke-Croughan, Robin Rombach, Michael Flashman, Tomas Matteson, Simon Mathis, Alexander W. R. Nelson, David Yuan, Annette Obika, Simon A. A. Kohl

@arXiv_eessIV_bot@mastoxiv.page
2025-07-22 10:23:40

DeSamba: Decoupled Spectral Adaptive Framework for 3D Multi-Sequence MRI Lesion Classification
Dezhen Wang, Sheng Miao, Rongxin Chai, Jiufa Cui
arxiv.org/abs/2507.15487

@arXiv_econTH_bot@mastoxiv.page
2025-08-05 07:42:00

Persuasion in the Long Run: When history matters
Hyeonggyun Ko
arxiv.org/abs/2508.01662 arxiv.org/pdf/2508.01662

@arXiv_qbioPE_bot@mastoxiv.page
2025-06-23 10:06:40

An nth-cousin mating model and the n-anacci numbers
Elisa Heinrich Mora, Noah A. Rosenberg
arxiv.org/abs/2506.16577 a…

@arXiv_csCV_bot@mastoxiv.page
2025-06-27 10:20:59

StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning
Chuxin Wang, Yixin Zha, Wenfei Yang, Tianzhu Zhang
arxiv.org/abs/2506.21541

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 08:21:29

SPACE: Your Genomic Profile Predictor is a Powerful DNA Foundation Model
Zhao Yang, Jiwei Zhu, Bing Su
arxiv.org/abs/2506.01833

@arXiv_mathph_bot@mastoxiv.page
2025-06-17 11:12:17

The Kuramoto model on the Sierpinski Gasket
Georgi S. Medvedev, Matthew S. Mizuhara
arxiv.org/abs/2506.12940 arxiv.or…

@arXiv_quantph_bot@mastoxiv.page
2025-08-04 09:48:30

Reducing Quantum Circuit Synthesis to #SAT
Dekel Zak, Jingyi Mei, Jean-Marie Lagniez, Alfons Laarman
arxiv.org/abs/2508.00416

@arXiv_qbioNC_bot@mastoxiv.page
2025-07-22 09:04:40

Dissociating model architectures from inference computations
Noor Sajid, Johan Medrano
arxiv.org/abs/2507.15776 arxiv…

@arXiv_mathOC_bot@mastoxiv.page
2025-06-02 10:21:53

This arxiv.org/abs/2503.14328 has been replaced.
initial toot: mastoxiv.page/@arXiv_mat…

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 10:11:42

Hybrid Tokenization Strategy for DNA Language Model using Byte Pair Encoding and K-MER Methods
Ganesh Sapkota, Md Hasibur Rahman
arxiv.org/abs/2507.18570

@arXiv_csIR_bot@mastoxiv.page
2025-07-29 08:44:51

Integrating LLM-Derived Multi-Semantic Intent into Graph Model for Session-based Recommendation
Shuo Zhang, Xiao Li, Jiayi Wu, Fan Yang, Xiang Li, Ming Gao
arxiv.org/abs/2507.20147

@arXiv_statME_bot@mastoxiv.page
2025-06-17 12:18:09

A Minimum Distance Estimator Approach for Misspecified Ergodic Processes
Jaroslav I. Borodavka, Sebastian Krumscheid, Grigorios A. Pavliotis
arxiv.org/abs/2506.12432

@arXiv_csSD_bot@mastoxiv.page
2025-07-02 08:27:39

Beat and Downbeat Tracking in Performance MIDI Using an End-to-End Transformer Architecture
Sebastian Murgul, Michael Heizmann
arxiv.org/abs/2507.00466

@arXiv_qbioGN_bot@mastoxiv.page
2025-06-25 08:15:39

eccDNAMamba: A Pre-Trained Model for Ultra-Long eccDNA Sequence Analysis
Zhenke Liu, Jien Li, Ziqi Zhang
arxiv.org/abs/2506.18940

@arXiv_csNE_bot@mastoxiv.page
2025-07-31 07:50:21

Pendulum Model of Spiking Neurons
Joy Bose
arxiv.org/abs/2507.22146 arxiv.org/pdf/2507.22146

@arXiv_csAI_bot@mastoxiv.page
2025-07-01 11:11:33

Hybrid Approach for Electricity Price Forecasting using AlexNet and LSTM
Bosubabu Sambana, Kotamsetty Geethika Devi, Bandi Rajeswara Reddy, Galeti Mohammad Hussain, Gownivalla Siddartha
arxiv.org/abs/2506.23504

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 08:21:58

Transformers as Multi-task Learners: Decoupling Features in Hidden Markov Models
Yifan Hao, Chenlu Ye, Chi Han, Tong Zhang
arxiv.org/abs/2506.01919

@arXiv_csGR_bot@mastoxiv.page
2025-06-23 08:18:10

FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space
Black Forest Labs, Stephen Batifol, Andreas Blattmann, Frederic Boesel, Saksham Consul, Cyril Diagne, Tim Dockhorn, Jack English, Zion English, Patrick Esser, Sumith Kulal, Kyle Lacey, Yam Levi, Cheng Li, Dominik Lorenz, Jonas M\"uller, Dustin Podell, Robin Rombach, Harry Saini, Axel Sauer, Luke Smith

@arXiv_astrophSR_bot@mastoxiv.page
2025-06-03 07:58:59

Critical Metallicity of Cool Supergiant Formation. II. Physical Origin
Po-Sheng Ou, Ke-Jung Chen
arxiv.org/abs/2506.01753

@arXiv_heplat_bot@mastoxiv.page
2025-06-24 08:43:20

Topological crystals and soliton lattices in a Gross-Neveu model with Hilbert-space fragmentation
Sergio Cerezo-Roquebr\'un, Simon Hands, Alejandro Bermudez
arxiv.org/abs/2506.18675

@arXiv_physicsoptics_bot@mastoxiv.page
2025-07-29 10:27:11

Inverse scattering transform via affine map: applications to high-speed nonlinear optical communications
Ilia Kuk, Ildar R. Gabitov
arxiv.org/abs/2507.20470

@arXiv_physicsmedph_bot@mastoxiv.page
2025-05-29 10:29:07

This arxiv.org/abs/2501.00256 has been replaced.
initial toot: mastoxiv.page/@arX…

@arXiv_qbiobm_bot@mastoxiv.page
2025-06-23 08:34:20

Aptamer-protein interaction prediction model based on transformer
Zhichao Yan, Yue Kang, Buyong Ma
arxiv.org/abs/2506.16084

@arXiv_csSI_bot@mastoxiv.page
2025-06-12 07:54:01

Alice and the Caterpillar: A more descriptive null model for assessing data mining results
Giulia Preti, Gianmarco De Francisci Morales, Matteo Riondato
arxiv.org/abs/2506.09764

@arXiv_condmatsoft_bot@mastoxiv.page
2025-06-24 10:52:50

Role of bubble positioning in force induced melting of DNA
Bidisha Mukherjee, Amit Raj Singh, Garima Mishra
arxiv.org/abs/2506.18821

@arXiv_mathOC_bot@mastoxiv.page
2025-07-25 08:51:12

General Proximal Quasi-Newton Methods based on model functions for nonsmooth nonconvex problems
Xiaoxi Jia, Peter Ochs
arxiv.org/abs/2507.18363

@arXiv_csRO_bot@mastoxiv.page
2025-06-27 09:10:09

STEP Planner: Constructing cross-hierarchical subgoal tree as an embodied long-horizon task planner
Zhou Tianxing, Wang Zhirui, Ao Haojia, Chen Guangyan, Xing Boyang, Cheng Jingwen, Yang Yi, Yue Yufeng
arxiv.org/abs/2506.21030

@arXiv_csCV_bot@mastoxiv.page
2025-07-25 10:21:42

Captain Cinema: Towards Short Movie Generation
Junfei Xiao, Ceyuan Yang, Lvmin Zhang, Shengqu Cai, Yang Zhao, Yuwei Guo, Gordon Wetzstein, Maneesh Agrawala, Alan Yuille, Lu Jiang
arxiv.org/abs/2507.18634

@arXiv_csIR_bot@mastoxiv.page
2025-06-02 09:59:34

This arxiv.org/abs/2504.10545 has been replaced.
initial toot: mastoxiv.page/@arXiv_csIR_…

@arXiv_grqc_bot@mastoxiv.page
2025-07-22 11:15:30

Dark energy era with a resolution of Hubble tension in generalized entropic cosmology
Priyanka Adhikary, Sudipta Das, Sergei D. Odintsov, Tanmoy Paul
arxiv.org/abs/2507.15273

@arXiv_econTH_bot@mastoxiv.page
2025-07-31 09:09:41

Misspecified Bayesianism
Pooya Molavi
arxiv.org/abs/2507.22775 arxiv.org/pdf/2507.22775

@arXiv_qbioGN_bot@mastoxiv.page
2025-06-03 07:55:44

Uncertainty-Aware Genomic Classification of Alzheimer's Disease: A Transformer-Based Ensemble Approach with Monte Carlo Dropout
Taeho Jo, Eun Hye Lee, Alzheimer's Disease Sequencing Project
arxiv.org/abs/2506.00662

@arXiv_csAR_bot@mastoxiv.page
2025-07-15 09:17:41

Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving
Wonung Kim, Yubin Lee, Yoonsung Kim, Jinwoo Hwang, Seongryong Oh, Jiyong Jung, Aziz Huseynov, Woong Gyu Park, Chang Hyun Park, Divya Mahajan, Jongse Park
arxiv.org/abs/2507.10178

@arXiv_quantph_bot@mastoxiv.page
2025-06-26 10:04:20

Speeding up thermalization and quantum state preparation through engineered quantum collisions
Sofia Sgroi, Salvatore Lorenzo, Luca Innocenti, Paolo A. Erdman, G. Massimo Palma, Mauro Paternostro
arxiv.org/abs/2506.20625

@arXiv_eessSY_bot@mastoxiv.page
2025-07-18 08:26:52

A Stackelberg Game of Demand Response from the Aggregator's Perspective
Seangleng Khe, Parin Chaipunya, Athikom Bangviwat
arxiv.org/abs/2507.12708

@arXiv_csRO_bot@mastoxiv.page
2025-06-30 09:24:30

A MILP-Based Solution to Multi-Agent Motion Planning and Collision Avoidance in Constrained Environments
Akshay Jaitly, Jack Cline, Siavash Farzan
arxiv.org/abs/2506.21982

@arXiv_csCL_bot@mastoxiv.page
2025-07-15 09:13:01

Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in Conversation
Jialong Mai, Xiaofen Xing, Yawei Li, Zhipeng Li, Jingyuan Xing, Xiangmin Xu
arxiv.org/abs/2507.09076

@arXiv_physicsgenph_bot@mastoxiv.page
2025-07-22 08:42:50

The BRS Cohomology of the Wess Zumino Chiral Scalar supersymmetric model with exotic pairs and exotic triplets (E2)
John A. Dixon
arxiv.org/abs/2507.14174

@arXiv_mathST_bot@mastoxiv.page
2025-07-21 08:24:50

Bounds of Shannon entropy and Extropy and their application in exploring the extreme value behavior of a large set of data
Konstantinos Zografos
arxiv.org/abs/2507.13656

@arXiv_qbiobm_bot@mastoxiv.page
2025-07-14 08:02:41

AmpLyze: A Deep Learning Model for Predicting the Hemolytic Concentration
Peng Qiu, Hanqi Feng, Barnabas Poczos
arxiv.org/abs/2507.08162

@arXiv_eessAS_bot@mastoxiv.page
2025-06-17 11:23:45

Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling
Wenmiao Gao, Yang Xiao
arxiv.org/abs/2506.13455

@arXiv_physicsmedph_bot@mastoxiv.page
2025-06-23 09:07:00

Unsupervised deep learning model for fast energy layer pre-selection of delivery-efficient proton arc therapy plan optimization of nasopharyngeal carcinoma
Bohan Yang, Gang Liu, Rirao Dao, Yujia Qian, Ke Shi, Anke Tang, Yong Luo, Jingnan Liu
arxiv.org/abs/2506.15803

@arXiv_qbioPE_bot@mastoxiv.page
2025-06-23 09:54:00

Covariance Decomposition for Distance Based Species Tree Estimation
Georgios Aliatimis, Ruriko Yoshida, Burak Boyak, James Grant
arxiv.org/abs/2506.16425

@arXiv_astrophSR_bot@mastoxiv.page
2025-07-24 08:59:10

A framework for modeling the evolution of young stellar objects
Theo Richardson, Adam Ginsburg, Erik Rosolowsky, Joshua Peltonen, R\'emy Indebetouw
arxiv.org/abs/2507.16944

@arXiv_csRO_bot@mastoxiv.page
2025-06-18 08:35:03

Sequence Modeling for Time-Optimal Quadrotor Trajectory Optimization with Sampling-based Robustness Analysis
Katherine Mao, Hongzhan Yu, Ruipeng Zhang, Igor Spasojevic, M Ani Hsieh, Sicun Gao, Vijay Kumar
arxiv.org/abs/2506.13915

@arXiv_statME_bot@mastoxiv.page
2025-07-14 08:38:12

Nonparametric predictive inference for discrete data via Metropolis-adjusted Dirichlet sequences
Davide Agnoletto, Tommaso Rigon, David B. Dunson
arxiv.org/abs/2507.08629

@arXiv_qbioQM_bot@mastoxiv.page
2025-06-13 09:41:50

Predicting function of evolutionarily implausible DNA sequences
Shiyu Jiang, Xuyin Liu, Zitong Jerry Wang
arxiv.org/abs/2506.10271

@arXiv_qbioGN_bot@mastoxiv.page
2025-07-17 08:24:50

RNAMunin: A Deep Machine Learning Model for Non-coding RNA Discovery
Lauren Lui, Torben Nielsen
arxiv.org/abs/2507.11950

@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:27:40

Mixture of Raytraced Experts
Andrea Perin, Giacomo Lagomarsini, Claudio Gallicchio, Giuseppe Nuti
arxiv.org/abs/2507.12419

@arXiv_qbioQM_bot@mastoxiv.page
2025-06-12 09:53:51

Simulation-trained conditional normalizing flows for likelihood approximation: a case study in stress regulation kinetics in yeast
Pedro Pessoa, Juan Andres Martinez, Vincent Vandenbroucke, Frank Delvigne, Steve Press\'e
arxiv.org/abs/2506.09374

@arXiv_mathOC_bot@mastoxiv.page
2025-06-16 09:58:29

Dictionary Learning Based Regularization in Quantitative MRI: A Nested Alternating Optimization Framework
Guozhi Dong, Michael Hinterm\"uller, Clemens Sirotenko
arxiv.org/abs/2506.11977

@arXiv_csLG_bot@mastoxiv.page
2025-06-12 10:01:31

EnerBridge-DPO: Energy-Guided Protein Inverse Folding with Markov Bridges and Direct Preference Optimization
Dingyi Rong, Haotian Lu, Wenzhuo Zheng, Fan Zhang, Shuangjia Zheng, Ning Liu
arxiv.org/abs/2506.09496

@arXiv_eessAS_bot@mastoxiv.page
2025-07-15 09:17:51

Enhancing Stereo Sound Event Detection with BiMamba and Pretrained PSELDnet
Wenmiao Gao, Han Yin
arxiv.org/abs/2507.09570

@arXiv_qbioGN_bot@mastoxiv.page
2025-06-16 09:30:09

Multimodal Modeling of CRISPR-Cas12 Activity Using Foundation Models and Chromatin Accessibility Data
Azim Dehghani Amirabad, Yanfei Zhang, Artem Moskalev, Sowmya Rajesh, Tommaso Mansi, Shuwei Li, Mangal Prakash, Rui Liao
arxiv.org/abs/2506.11182