
2025-09-19 20:28:57
Sources: Oracle is in discussions with Meta to provide computing power for training and deploying AI models, in a deal worth about $20B (Bloomberg)
https://www.bloomberg.com/news/articles/2025-09-19/oracle-in-talks-with-meta-…
Sources: Oracle is in discussions with Meta to provide computing power for training and deploying AI models, in a deal worth about $20B (Bloomberg)
https://www.bloomberg.com/news/articles/2025-09-19/oracle-in-talks-with-meta-…
So I had this terrible idea a few years ago to write some infrastructure automation that provisions a new compute instance, sets up secrets storage, configures IAM roles, authorizes the new instance to be able to provision new instance and roles via infra-as-code automation, and then the new instance tears down the instance and roles that created it, before then creating its own new compute instance, etc
Like a self-propagating glider in Conway's Game of Life, except with cloud inf…
Accelerating Edge Inference for Distributed MoE Models with Latency-Optimized Expert Placement
Tian Wu, Liming Wang, Zijian Wen, Xiaoxi Zhang, Jingpu Duan, Xianwei Zhang, Jinhang Zuo
https://arxiv.org/abs/2508.12851
Helsinki-based DataCrunch, which aims to become Europe's first AI cloud hyperscaler, raised a €55M Series A, bringing its total funding to €76.5M (Tamara Djurickovic/Tech.eu)
https://tech.eu/2025/09/08/datacrunch-raises-eur5…
Learning-Enabled Adaptive Power Capping Scheme for Cloud Data Centers
Yimeng Sun, Zhaohao Ding, Payman Dehghanian, Fei Teng
https://arxiv.org/abs/2508.06994 https://
I continue to be amazed that anyone uses Amazon AWS, or Microsoft Azure, or Google Cloud Services. Only someone who's very happy to completely lock their business into a 3rd party proprietary stack they don't control, and loves paying 20x (2000%!) more than reasonable commodity infrastructure rates would do so. I'm amazed that these businesses are even viable. Just goes to show how much flab most corporations have. They're ripe for disruption and displacement.
Multi-IaC-Eval: Benchmarking Cloud Infrastructure as Code Across Multiple Formats
Sam Davidson, Li Sun, Bhavana Bhasker, Laurent Callot, Anoop Deoras
https://arxiv.org/abs/2509.05303
Google plans to spend $9B in Oklahoma over two years on cloud and AI infrastructure, including a new data center campus and an expansion of its Pryor facility (Jaspreet Singh/Reuters)
https://www.reuters.com/business/google-pledge…
Energy-Aware Data Center Management: A Sustainable Approach to Reducing Carbon Footprint
Rabab Khan Rongon, Krishna Das
https://arxiv.org/abs/2509.10462 https://
A Survey on Task Scheduling in Carbon-Aware Container Orchestration
Jialin Yang, Zainab Saad, Jiajun Wu, Xiaoguang Niu, Henry Leung, Steve Drew
https://arxiv.org/abs/2508.05949 …
Comparative Studies: Cloud-Enabled Adaptive Learning System for Scalable Education in Sub-Saharan
Israel Fianyi, Soonja Yeom, Ju-Hyun Shin
https://arxiv.org/abs/2506.23851
3. Multiple Access Methods 🌐
• Cloud API: https://platform.moonshot.ai (OpenAI/Anthropic compatible) • Self-hosting: Deploy on your own infrastructure • Inference engines: vLLM, SGLang, KTransformers, TensorRT-LLM supported
On The Road - To Xi’An/ Silhouette 👤
在路上 - 去西安/ 轮廓 👤
📷 Pentax MX
🎞️Lucky SHD 400
#filmphotography #Photography #blackandwhite
A Dynamic Approach to Load Balancing in Cloud Infrastructure: Enhancing Energy Efficiency and Resource Utilization
Shadman Sakib, Ajay Katangur, Rahul Dubey
https://arxiv.org/abs/2508.05821
Cross-Service Token: Finding Attacks in 5G Core Networks
Anqi Chen, Riccardo Preatoni, Alessandro Brighente, Mauro Conti, Cristina Nita-Rotaru
https://arxiv.org/abs/2509.08992 h…
Towards On-Device Personalization: Cloud-device Collaborative Data Augmentation for Efficient On-device Language Model
Zhaofeng Zhong, Wei Yuan, Liang Qu, Tong Chen, Hao Wang, Xiangyu Zhao, Hongzhi Yin
https://arxiv.org/abs/2508.21313
M$^2$-MFP: A Multi-Scale and Multi-Level Memory Failure Prediction Framework for Reliable Cloud Infrastructure
Hongyi Xie, Min Zhou, Qiao Yu, Jialiang Yu, Zhenli Sheng, Hong Xie, Defu Lian
https://arxiv.org/abs/2507.07144
Looks like EU funding of #FLOSS will continue: "Open digital infrastructure, from search and identity to cloud and software governance, is now a strategic European priority."
E2B, formerly FoundryLabs, which is developing an open-source, sandboxed cloud infrastructure for AI agents, raised a $21M Series A led by Insight Partners (Mike Wheatley/SiliconANGLE)
https://siliconangle.com/2025/07/28/e2
Application Placement with Constraint Relaxation
Damiano Azzolini, Marco Duca, Stefano Forti, Francesco Gallo, Antonio Ielo
https://arxiv.org/abs/2507.13895
A Fragmentation-Aware Adaptive Bilevel Search Framework for Service Mapping in Computing Power Networks
Jingzhao Xie, Zhenglian Li, Gang Sun, Long Luo, Hongfang Yu, Dusit Niyato
https://arxiv.org/abs/2507.07535
"The Seven Capital Sins of Open Science"
1. Worshiping the 'age factor'
2. Ignoring the value of data reuse and complexity
3. Disrespecting other disciplines
4. Publishing data without a supplementary paper
5. Creating and maintaining a nightmare for machines
6. Refusing to support investment in general infrastructure
7. Creating data without a FAIR and explicit data stewardship plan.
RoCE BALBOA: Service-enhanced Data Center RDMA for SmartNICs
Maximilian Jakob Heer, Benjamin Ramhorst, Yu Zhu, Luhao Liu, Zhiyi Hu, Jonas Dann, Gustavo Alonso
https://arxiv.org/abs/2507.20412
Composable OS Kernel Architectures for Autonomous Intelligence
Rajpreet Singh, Vidhi Kothari
https://arxiv.org/abs/2508.00604 https://arxiv.org/pdf/2508.00…
From Cloud-Native to Trust-Native: A Protocol for Verifiable Multi-Agent Systems
Muyang Li
https://arxiv.org/abs/2507.22077 https://arxiv.org/pdf/2507.2207…
Over-the-Top Resource Broker System for Split Computing: An Approach to Distribute Cloud Computing Infrastructure
Ingo Friese, Jochen Klaffer, Mandy Galkow-Schneider, Sergiy Melnyk, Qiuheng Zhou, Hans Dieter Schotten
https://arxiv.org/abs/2508.07744
Towards System-Level Quantum-Accelerator Integration
Ralf Ramsauer, Wolfgang Mauerer
https://arxiv.org/abs/2507.19212 https://arxiv.org/pdf/2507.19212
Rigid Body Localization and Tracking for 6G V2X: Algorithms, Applications, and Road to Adoption
Niclas F\"uhrling, Hyeon Seok Rou, Giuseppe Thadeu Freitas de Abreu, David Gonz\'alez G., Gonzalo Seco-Granados, Osvaldo Gonsa
https://arxiv.org/abs/2509.01208
Google plans to invest an additional $9B in Virginia through 2026 to boost its cloud and AI infrastructure, including a new data center in Chesterfield County (Emily Forgash/Bloomberg)
https://www.bloomberg.com/news/articles/20
Distilling On-device Language Models for Robot Planning with Minimal Human Intervention
Zachary Ravichandran, Ignacio Hounie, Fernando Cladera, Alejandro Ribeiro, George J. Pappas, Vijay Kumar
https://arxiv.org/abs/2506.17486
Bridging Cloud Convenience and Protocol Transparency: A Hybrid Architecture for Ethereum Node Operations on Amazon Managed Blockchain
S M Mostaq Hossain, Amani Altarawneh, Maanak Gupta
https://arxiv.org/abs/2507.18774
Microsoft and the war in Gaza.
Thanks to the control it exerts over Palestinian telecommunications infrastructure, Israel has long intercepted phone calls in the occupied territories. But the indiscriminate new systemin the Azure cloud of Microsoft allows Israeli intelligence officers to play back the content of cellular calls made by Palestinians, capturing the conversations of a much larger pool of ordinary civilians.
https://www.theguardian.com/world/2025/aug/06/microsoft-israeli-military-palestinian-phone-calls-cloud?CMP=Share_iOSApp_Other
OpenLambdaVerse: A Dataset and Analysis of Open-Source Serverless Applications
Angel C. Chavez-Moreno, Cristina L. Abad
https://arxiv.org/abs/2508.01492 https://
Offloading tracing for real-time systems using a scalable cloud infrastructure
David Jannis Schmidt, Grigory Fridman, Florian von Zabiensky
https://arxiv.org/abs/2507.19953 http…
A deep dive into Oracle's data center strategy and its emergence as an AI computing powerhouse, amid questions about the sustainability of the GPU business (SemiAnalysis)
https://semianalysis.com/2025/06/30/how-oracle-is-winning-the-ai-compute-…
Locked In, Leaked Out: Measuring Isolation via Kernel Locks
Anjali, Michael M. Swift
https://arxiv.org/abs/2507.21248 https://arxiv.org/pdf/2507.21248
NYC-based Carbyne, which offers cloud-based tools for emergency services, raised $100M from AT&T Ventures, Axon Enterprises, Cox Enterprises, and others (Meir Orbach/CTech)
https://www.calcalistech.com/ctechnews/article/skhy4rdvgg
Towards a Decentralized IoT Onboarding for Smart Homes Using Consortium Blockchain
Narges Dadkhah, Khan Reaz, Gerhard Wunder
https://arxiv.org/abs/2508.21480 https://
Taming Cold Starts: Proactive Serverless Scheduling with Model Predictive Control
Chanh Nguyen, Monowar Bhuyan, Erik Elmroth
https://arxiv.org/abs/2508.07640 https://
SuperSONIC: Cloud-Native Infrastructure for ML Inferencing
Dmitry Kondratyev, Benedikt Riedel, Yuan-Tang Chou, Miles Cochran-Branson, Noah Paladino, David Schultz, Mia Liu, Javier Duarte, Philip Harris, Shih-Chieh Hsu
https://arxiv.org/abs/2506.20657
Rethinking Denial-of-Service: A Conditional Taxonomy Unifying Availability and Sustainability Threats
Mark Dorsett, Scott Man, Tim Koussas
https://arxiv.org/abs/2508.19283 https…
Sources: Meta has poached Frank Chu, an Apple exec who led AI teams focused on cloud infrastructure, training and search, despite Meta's plans to slow hiring (Mark Gurman/Bloomberg)
https://www.bloomberg.com/news/articles/20
Capacity Planning and Scheduling for Jobs with Uncertainty in Resource Usage and Duration
Sunandita Patra, Mehtab Pathan, Mahmoud Mahfouz, Parisa Zehtabi, Wided Ouaja, Daniele Magazzeni, Manuela Veloso
https://arxiv.org/abs/2507.01225
SecureV2X: An Efficient and Privacy-Preserving System for Vehicle-to-Everything (V2X) Applications
Joshua Lee, Ali Arastehfard, Weiran Liu, Xuegang Ban, Yuan Hong
https://arxiv.org/abs/2508.19115
Silent Failures in Stateless Systems: Rethinking Anomaly Detection for Serverless Computing
Chanh Nguyen, Erik Elmroth, Monowar Bhuyan
https://arxiv.org/abs/2507.04969
KubeIntellect: A Modular LLM-Orchestrated Agent Framework for End-to-End Kubernetes Management
Mohsen Seyedkazemi Ardebili, Andrea Bartolini
https://arxiv.org/abs/2509.02449 htt…
GlideinBenchmark: collecting resource information to optimize provisioning
Marco Mambelli, Shrijan Swaminathan
https://arxiv.org/abs/2507.21472 https://arx…
C-Koordinator: Interference-aware Management for Large-scale and Co-located Microservice Clusters
Shengye Song, Minxian Xu, Zuowei Zhang, Chengxi Gao, Fansong Zeng, Yu Ding, Kejiang Ye, Chengzhong Xu
https://arxiv.org/abs/2507.18005