Snowpark: Performant, Secure, User-Friendly Data Engineering and AI/ML Next To Your Data
Brandon Baker, Elliott Brossard, Chenwei Xie, Zihao Ye, Deen Liu, Yijun Xie, Arthur Zwiegincew, Nitya Kumar Sharma, Gaurav Jain, Eugene Retunsky, Mike Halcrow, Derek Denny-Brown, Istvan Cseri, Tyler Akidau, Yuxiong He
https://arxiv.org/abs/2508.05904…
Enhancing Online Learning by Integrating Biosensors and Multimodal Learning Analytics for Detecting and Predicting Student Behavior: A Review
Alvaro Becerra, Ruth Cobos, Charles Lang
https://arxiv.org/abs/2509.07742
Ontology-Aligned Embeddings for Data-Driven Labour Market Analytics
Heinke Hihn, Dennis A. V. Dittrich, Carl Jeske, Cayo Costa Sobral, Helio Pais, Timm Lochmann
https://arxiv.org/abs/2509.04942
TemporalFlowViz: Parameter-Aware Visual Analytics for Interpreting Scramjet Combustion Evolution
Yifei Jia, Shiyu Cheng, Yu Dong, Guan Li, Dong Tian, Ruixiao Peng, Xuyi Lu, Yu Wang, Wei Yao, Guihua Shan
https://arxiv.org/abs/2509.04834
Rethinking Analytical Processing in the GPU Era
Bobbi Yogatama, Yifei Yang, Kevin Kristensen, Devesh Sarda, Abigale Kim, Adrian Cockcroft, Yu Teng, Joshua Patterson, Gregory Kimball, Wes McKinney, Weiwei Gong, Xiangyao Yu
https://arxiv.org/abs/2508.04701
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements
Issa Sugiura, Takashi Ishida, Taro Makino, Chieko Tazuke, Takanori Nakagawa, Kosuke Nakago, David Ha
https://arxiv.org/abs/2506.08762
MOSAIC-F: A Framework for Enhancing Students' Oral Presentation Skills through Personalized Feedback
Alvaro Becerra, Daniel Andres, Pablo Villegas, Roberto Daza, Ruth Cobos
https://arxiv.org/abs/2506.08634
Non-Exponential Decay in Finite Photonic Waveguide Arrays
Florian H. Huber, Benedikt Braumandl, Johannes Kn\"orzer, Jonas Himmel, Carlotta Versmold, Robert H. Jonsson, Alexander Szameit, Jasmin Meinecke
https://arxiv.org/abs/2509.06443
Bilinear Quadratic Output Systems and Balanced Truncation
Heike Fa{\ss}bender (Institute for Numerical Analysis, TU Braunschweig), Serkan Gugercin (Department of Mathematics and Division of Computational Modeling and Data Analytics, Academy of Data Science, Virginia Tech), Till Peters (Institute for Numerical Analysis, TU Braunschweig)
https://
VaxPulse: Active Global Vaccine Infodemic Risk Assessment
Gerardo Luis Dimaguila (Epidemiology Informatics, Centre for Health Analytics, Melbourne Children's Campus, Australia, Department of Paediatrics, The University of Melbourne, Australia), Muhammad Javed (Epidemiology Informatics, Centre for Health Analytics, Melbourne Children's Campus, Australia, Global Vaccine Data Network, University of Auckland, New Zealand), Jeremiah Munakabayo (Epidemiology Informatics, Centre for H…
Curated Collaborative AI Edge with Network Data Analytics for B5G/6G Radio Access Networks
Sardar Jaffar Ali, Syed M. Raza, Duc-Tai Le, Rajesh Challa, Min Young Chung, Ness Shroff, Hyunseung Choo
https://arxiv.org/abs/2507.01994
FDABench: A Benchmark for Data Agents on Analytical Queries over Heterogeneous Data
Ziting Wang, Shize Zhang, Haitao Yuan, Jinwei Zhu, Shifu Li, Wei Dong, Gao Cong
https://arxiv.org/abs/2509.02473
Designing Gaze Analytics for ELA Instruction: A User-Centered Dashboard with Conversational AI Support
Eduardo Davalos, Yike Zhang, Shruti Jain, Namrata Srivastava, Trieu Truong, Nafees-ul Haque, Tristan Van, Jorge Salas, Sara McFadden, Sun-Joo Cho, Gautam Biswas, Amanda Goodwin
https://arxiv.org/abs/2509.03741
ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge
Daghash K. Alqahtani, Maria A. Rodriguez, Muhammad Aamir Cheema, Hamid Rezatofighi, Adel N. Toosi
https://arxiv.org/abs/2507.06011
At Berlin Buzzwords 2025, Ved Prakash discussed how Siphon transformed their data pipeline using Apache Iceberg to successfully stream quality data into both Snowflake and Clickhouse simultaneously. In this short talk, you’ll learn about their battle-tested architecture, the performance improvements they’ve achieved, and their strategies for maintaining data consistency across two analytics engines.
Watch the full session:
@… small correction. You can still track people, just not share it with everyone and their dog.
If you have data in your system you're free to use it for analytics. As long as it's anonymized, so, properly aggregated.
No consent needed.
Track Component Failure Detection Using Data Analytics over existing STDS Track Circuit data
Francisco L\'opez, Eduardo Di Santi, Cl\'ement Lefebvre, Nenad Mijatovic, Michele Pugnaloni, Victor Mart\'in, Kenza Saiah
https://arxiv.org/abs/2508.11693
Towards Propagation-aware Representation Learning for Supervised Social Media Graph Analytics
Wei Jiang, Tong Chen, Wei Yuan, Xiangyu Zhao, Quoc Viet Hung Nguyen, Hongzhi Yin
https://arxiv.org/abs/2509.01124
Crosslisted article(s) found for stat.ML. https://arxiv.org/list/stat.ML/new
[1/1]:
- Track Component Failure Detection Using Data Analytics over existing STDS Track Circuit data
L\'opez, Di Santi, Lefebvre, Mijatovic, Pugnaloni, Mart\'in, Saiah
Predictive Analytics for Collaborators Answers, Code Quality, and Dropout on Stack Overflow
Elijah Zolduoarrati, Sherlock A. Licorish, Nigel Stanger
https://arxiv.org/abs/2506.18329
Towards Operational Data Analytics Chatbots -- Virtual Knowledge Graph is All You Need
Junaid Ahmed Khan, Hiari Pizzini Cavagna, Andrea Proia, Andrea Bartolini
https://arxiv.org/abs/2506.22267
How AI and automation are transforming agriculture, enabling autonomous tractors and fruit-picking robots, and improving crop management via data and analytics (William Boston/Wall Street Journal)
https://www.wsj.com/tech/autonomous-farmin
HathiTrust was founded in 2008 as a not-for-profit collaborative of academic and research libraries now preserving 18 million digitized items in the HathiTrust Digital Library. We offer reading access to the fullest extent allowable by U.S. and international copyright law, text and data mining tools for the entire corpus, and other emerging services based on the combined collection.
Crosslisted article(s) found for cs.AI. https://arxiv.org/list/cs.AI/new
[2/10]:
- Track Component Failure Detection Using Data Analytics over existing STDS Track Circuit data
L\'opez, Di Santi, Lefebvre, Mijatovic, Pugnaloni, Mart\'in, Saiah
New Study Shows Google Tracking Persists Even With Privacy Tools
A new SafetyDetectives study reveals the surprising extent of Google tracking across the web in the US, UK, Switzerland, and Sweden. Discover how Google Analytics, AdSense, and YouTube embeds collect your data, even when using DuckDuckGo.
🕵️ https:…
Why Can't I See My Clusters? A Precision-Recall Approach to Dimensionality Reduction Validation
Diede P. M. van der Hoorn, Alessio Arleo, Fernando V. Paulovich
https://arxiv.org/abs/2509.04222
Unlearning Comparator: A Visual Analytics System for Comparative Evaluation of Machine Unlearning Methods
Jaeung Lee, Suhyeon Yu, Yurim Jang, Simon S. Woo, Jaemin Jo
https://arxiv.org/abs/2508.12730
d-DQIVAR: Data-centric Visual Analytics and Reasoning for Data Quality Improvement
Hyein Hong, Sangbong Yoo, SeokHwan Choi, Jisue Kim, Seongbum Seo, Haneol Cho, Chansoo Kim, Yun Jang
https://arxiv.org/abs/2507.11960
OASIS: Object-based Analytics Storage for Intelligent SQL Query Offloading in Scientific Tabular Workloads
Soon Hwang, Junhyeok Park, Junghyun Ryu, Seonghoon Ahn, Jeoungahn Park, Jeongjin Lee, Soonyeal Yang, Jungki Noh, Woosuk Chung, Hoshik Kim, Youngjae Kim
https://arxiv.org/abs/2509.01966
The Promise of Large Language Models in Digital Health: Evidence from Sentiment Analysis in Online Health Communities
Xiancheng Li, Georgios D. Karampatakis, Helen E. Wood, Chris J. Griffiths, Borislava Mihaylova, Neil S. Coulson, Alessio Pasinato, Pietro Panzarasa, Marco Viviani, Anna De Simoni
https://arxiv.org/abs/2508.14032
Lost in Translation? Converting RegExes for Log Parsing into Dynatrace Pattern Language
Julian Fragner, Christian Macho, Bernhard Dieber, Martin Pinzger
https://arxiv.org/abs/2506.19539
Revisiting Graph Analytics Benchmark
Lingkai Meng, Yu Shao, Long Yuan, Longbin Lai, Peng Cheng, Xue Li, Wenyuan Yu, Wenjie Zhang, Xuemin Lin, Jingren Zhou
https://arxiv.org/abs/2506.21811
"You’re one step away from becoming an analytics leader for the digital age.
Northeastern University’s Master of Professional Studies in Analytics program provides in-demand skills, resumé-building experiences, and flexible learning options so you can help organizations make better data-driven decisions."
dear #LinkedIn , I am 80 yrs old. And you are
OrbitChain: Orchestrating In-orbit Real-time Analytics of Earth Observation Data
Zhouyu Li, Zhijing Yang, Huayue Gu, Xiaojian Wang, Yuchen Liu, Ruozhou Yu
https://arxiv.org/abs/2508.13374
Direct tensor processing with coherent light
Yufeng Zhang, Xiaobing Liu, Chenguang Yang, Jinlong Xiang, Hao Yan, Tianjiao Fu, Kaizhi Wang, Yikai Su, Zhipei Sun, Xuhan Guo
https://arxiv.org/abs/2506.14277
Visual Analytics Using Tensor Unified Linear Comparative Analysis
Naoki Okami, Kazuki Miyake, Naohisa Sakamoto, Jorji Nonaka, Takanori Fujiwara
https://arxiv.org/abs/2507.19988 …
GPU Acceleration of SQL Analytics on Compressed Data
Zezhou Huang, Krystian Sakowski, Hans Lehnert, Wei Cui, Carlo Curino, Matteo Interlandi, Marius Dumitru, Rathijit Sen
https://arxiv.org/abs/2506.10092
ARCADE: A RAN Diagnosis Methodology in a Hybrid AI Environment for 6G Networks
Daniel Ricardo Cunha Oliveira, Rodrigo Moreira, Fl\'avio de Oliveira Silva
https://arxiv.org/abs/2507.17861
VIVA: Virtual Healthcare Interactions Using Visual Analytics, With Controllability Through Configuration
J\"urgen Bernard, Mara Solen, Helen Novak Lauscher, Kurtis Stewart, Kendall Ho, Tamara Munzner
https://arxiv.org/abs/2508.09386
Towards Practical Benchmarking of Data Cleaning Techniques: On Generating Authentic Errors via Large Language Models
Xinyuan Liu, Jiahui Chen, Bocheng Hu, Yu Sun, Xinyang Chen, Shaoxu Song
https://arxiv.org/abs/2507.10934
PALM: PAnoramic Learning Map Integrating Learning Analytics and Curriculum Map for Scalable Insights Across Courses
Mahiro Ozaki, Li Chen, Shotaro Naganuma, Valdemar \v{S}v\'abensk\'y, Fumiya Okubo, Atsushi Shimada
https://arxiv.org/abs/2507.18393
Optimizing Edge Gaming Slices through an Enhanced User Plane Function and Analytics in Beyond-5G Networks
Bruno Marques da Silva, Larissa Ferreira Rodrigues Moreira, Fl\'avio de Oliveira Silva, Rodrigo Moreira
https://arxiv.org/abs/2507.17843
PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking
Yan Zhou, Chunwei Liu, Bhuvan Urgaonkar, Zhengle Wang, Magnus Mueller, Chao Zhang, Songyue Zhang, Pascal Pfeil, Dominik Horn, Zhengchun Liu, Davide Pagano, Tim Kraska, Samuel Madden, Ju Fan
https://arxiv.org/abs/2506.16379
Scalable GPU Performance Variability Analysis framework
Ankur Lahiry, Ayush Pokharel, Seth Ockerman, Amal Gueroudji, Line Pouchard, Tanzima Z. Islam
https://arxiv.org/abs/2506.20674
DBMS-LLM Integration Strategies in Industrial and Business Applications: Current Status and Future Challenges
Zhengtong Yan, Gongsheng Yuan, Qingsong Guo, Jiaheng Lu
https://arxiv.org/abs/2507.19254
Crosslisted article(s) found for cs.NI. https://arxiv.org/list/cs.NI/new
[1/1]:
- OrbitChain: Orchestrating In-orbit Real-time Analytics of Earth Observation Data
Zhouyu Li, Zhijing Yang, Huayue Gu, Xiaojian Wang, Yuchen Liu, Ruozhou Yu
Shelby: Decentralized Storage Designed to Serve
Guy Goren, Andrew Hariri, Timothy D. R. Hartley, Ravi Kappiyoor, Alexander Spiegelman, David Zmick
https://arxiv.org/abs/2506.19233
Urbanite: A Dataflow-Based Framework for Human-AI Interactive Alignment in Urban Visual Analytics
Gustavo Moreira, Leonardo Ferreira, Carolina Veiga, Maryam Hosseini, Fabio Miranda
https://arxiv.org/abs/2508.07390
SIREN: Software Identification and Recognition in HPC Systems
Thomas Jakobsche, Fredrik Roberts\'en, Jessica R. Jones, Utz-Uwe Haus, Florina M. Ciorba
https://arxiv.org/abs/2508.18950
Navigating High-Dimensional Backstage: A Guide for Exploring Literature for the Reliable Use of Dimensionality Reduction
Hyeon Jeon, Hyunwook Lee, Yun-Hsin Kuo, Taehyun Yang, Daniel Archambault, Sungahn Ko, Takanori Fujiwara, Kwan-Liu Ma, Jinwook Seo
https://arxiv.org/abs/2506.14820
TrialCompass: Visual Analytics for Enhancing the Eligibility Criteria Design of Clinical Trials
Rui Sheng, Xingbo Wang, Jiachen Wang, Xiaofu Jin, Zhonghua Sheng, Zhenxing Xu, Suraj Rajendran, Huamin Qu, Fei Wang
https://arxiv.org/abs/2507.12298
Replaced article(s) found for cs.DB. https://arxiv.org/list/cs.DB/new
[1/1]:
- SiriusBI: A Comprehensive LLM-Powered Solution for Data Analytics in Business Intelligence
Jiang, Xie, , Shen, Zhang, Lei, Zheng, Li, Li, Huang, Wu, Zhang, Yang, Cui, Chen
BDIViz: An Interactive Visualization System for Biomedical Schema Matching with LLM-Powered Validation
Eden Wu, Dishita G Turakhia, Guande Wu, Christos Koutras, Sarah Keegan, Wenke Liu, Beata Szeitz, David Fenyo, Cl\'audio T. Silva, Juliana Freire
https://arxiv.org/abs/2507.16117
Beyond Self-Regulated Learning Processes: Unveiling Hidden Tactics in Generative AI-Assisted Writing
Kaixun Yang, Yizhou Fan, Luzhen Tang, Mladen Rakovi\'c, Xinyu Li, Dragan Ga\v{s}evi\'c, Guanliang Chen
https://arxiv.org/abs/2508.10310