Tootfinder

@thomasfuchs@hachyderm.io
2025-07-07 01:38:13

Even if “AI” worked (it doesn’t), there’s many reasons why you shouldn’t use it:
1. It’s destroying Internet sites that you love as you use chat bots instead of actually going to sources of information—this will cause them to be less active and eventually shut down.
2. Pollution and water use from server farms cause immediate harm; often—just like other heavy industry—these are built in underprivileged communities and harming poor people. Without any benefits as the big tech companies get tax breaks and don’t pay for power, while workers aren’t from the community but commute in.
3. The basic underlying models of any LLM rely on stolen data, even when specific extra data is obtained legally. Chatbots can’t learn to speak English just by reading open source code.
4. You’re fueling a speculation bubble that is costing many people their jobs—because the illusion of “efficiency” is kept up by firing people and counting that as profit.
5. Whenever you use the great cheat machine in the cloud you’re robbing yourself from doing real research, writing or coding—literally atrophying your brain and making you stupider.
It’s a grift, through and through.

@bthalpin@mastodon.social
2025-09-06 12:30:43

I'm putting together notes for novices to do simple data analysis with R and the fact that I'm telling them to "cut and paste this inscrutable block of code at the start of your file" reminds me of nothing so much as when I worked for the ESRI in Dublin in the mid 1980s and we used to run SPSS analyses remotely on UCD's Amdahl by sandwiching our SPSS code (on punch cards) between two decks of IBM Job Control Language cards of which we understood nothing whatsoever.

Block of R code:
library(data.table)
library(ggplot2)
library(epiDisplay)
library(foreign)
nlsw88 = read.dta("nlsw88.dta")

@tiotasram@kolektiva.social
2025-08-04 15:49:00

Should we teach vibe coding? Here's why not.
Should AI coding be taught in undergrad CS education?
1/2
I teach undergraduate computer science labs, including for intro and more-advanced core courses. I don't publish (non-negligible) scholarly work in the area, but I've got years of craft expertise in course design, and I do follow the academic literature to some degree. In other words, In not the world's leading expert, but I have spent a lot of time thinking about course design, and consider myself competent at it, with plenty of direct experience in what knowledge & skills I can expect from students as they move through the curriculum.
I'm also strongly against most uses of what's called "AI" these days (specifically, generative deep neutral networks as supplied by our current cadre of techbro). There are a surprising number of completely orthogonal reasons to oppose the use of these systems, and a very limited number of reasonable exceptions (overcoming accessibility barriers is an example). On the grounds of environmental and digital-commons-pollution costs alone, using specifically the largest/newest models is unethical in most cases.
But as any good teacher should, I constantly question these evaluations, because I worry about the impact on my students should I eschew teaching relevant tech for bad reasons (and even for his reasons). I also want to make my reasoning clear to students, who should absolutely question me on this. That inspired me to ask a simple question: ignoring for one moment the ethical objections (which we shouldn't, of course; they're very stark), at what level in the CS major could I expect to teach a course about programming with AI assistance, and expect students to succeed at a more technically demanding final project than a course at the same level where students were banned from using AI? In other words, at what level would I expect students to actually benefit from AI coding "assistance?"
To be clear, I'm assuming that students aren't using AI in other aspects of coursework: the topic of using AI to "help you study" is a separate one (TL;DR it's gross value is not negative, but it's mostly not worth the harm to your metacognitive abilities, which AI-induced changes to the digital commons are making more important than ever).
So what's my answer to this question?
If I'm being incredibly optimistic, senior year. Slightly less optimistic, second year of a masters program. Realistic? Maybe never.
The interesting bit for you-the-reader is: why is this my answer? (Especially given that students would probably self-report significant gains at lower levels.) To start with, [this paper where experienced developers thought that AI assistance sped up their work on real tasks when in fact it slowed it down] (https://arxiv.org/abs/2507.09089) is informative. There are a lot of differences in task between experienced devs solving real bugs and students working on a class project, but it's important to understand that we shouldn't have a baseline expectation that AI coding "assistants" will speed things up in the best of circumstances, and we shouldn't trust self-reports of productivity (or the AI hype machine in general).
Now we might imagine that coding assistants will be better at helping with a student project than at helping with fixing bugs in open-source software, since it's a much easier task. For many programming assignments that have a fixed answer, we know that many AI assistants can just spit out a solution based on prompting them with the problem description (there's another elephant in the room here to do with learning outcomes regardless of project success, but we'll ignore this over too, my focus here is on project complexity reach, not learning outcomes). My question is about more open-ended projects, not assignments with an expected answer. Here's a second study (by one of my colleagues) about novices using AI assistance for programming tasks. It showcases how difficult it is to use AI tools well, and some of these stumbling blocks that novices in particular face.
But what about intermediate students? Might there be some level where the AI is helpful because the task is still relatively simple and the students are good enough to handle it? The problem with this is that as task complexity increases, so does the likelihood of the AI generating (or copying) code that uses more complex constructs which a student doesn't understand. Let's say I have second year students writing interactive websites with JavaScript. Without a lot of care that those students don't know how to deploy, the AI is likely to suggest code that depends on several different frameworks, from React to JQuery, without actually setting up or including those frameworks, and of course three students would be way out of their depth trying to do that. This is a general problem: each programming class carefully limits the specific code frameworks and constructs it expects students to know based on the material it covers. There is no feasible way to limit an AI assistant to a fixed set of constructs or frameworks, using current designs. There are alternate designs where this would be possible (like AI search through adaptation from a controlled library of snippets) but those would be entirely different tools.
So what happens on a sizeable class project where the AI has dropped in buggy code, especially if it uses code constructs the students don't understand? Best case, they understand that they don't understand and re-prompt, or ask for help from an instructor or TA quickly who helps them get rid of the stuff they don't understand and re-prompt or manually add stuff they do. Average case: they waste several hours and/or sweep the bugs partly under the rug, resulting in a project with significant defects. Students in their second and even third years of a CS major still have a lot to learn about debugging, and usually have significant gaps in their knowledge of even their most comfortable programming language. I do think regardless of AI we as teachers need to get better at teaching debugging skills, but the knowledge gaps are inevitable because there's just too much to know. In Python, for example, the LLM is going to spit out yields, async functions, try/finally, maybe even something like a while/else, or with recent training data, the walrus operator. I can't expect even a fraction of 3rd year students who have worked with Python since their first year to know about all these things, and based on how students approach projects where they have studied all the relevant constructs but have forgotten some, I'm not optimistic seeing these things will magically become learning opportunities. Student projects are better off working with a limited subset of full programming languages that the students have actually learned, and using AI coding assistants as currently designed makes this impossible. Beyond that, even when the "assistant" just introduces bugs using syntax the students understand, even through their 4th year many students struggle to understand the operation of moderately complex code they've written themselves, let alone written by someone else. Having access to an AI that will confidently offer incorrect explanations for bugs will make this worse.
To be sure a small minority of students will be able to overcome these problems, but that minority is the group that has a good grasp of the fundamentals and has broadened their knowledge through self-study, which earlier AI-reliant classes would make less likely to happen. In any case, I care about the average student, since we already have plenty of stuff about our institutions that makes life easier for a favored few while being worse for the average student (note that our construction of that favored few as the "good" students is a large part of this problem).
To summarize: because AI assistants introduce excess code complexity and difficult-to-debug bugs, they'll slow down rather than speed up project progress for the average student on moderately complex projects. On a fixed deadline, they'll result in worse projects, or necessitate less ambitious project scoping to ensure adequate completion, and I expect this remains broadly true through 4-6 years of study in most programs (don't take this as an endorsement of AI "assistants" for masters students; we've ignored a lot of other problems along the way).
There's a related problem: solving open-ended project assignments well ultimately depends on deeply understanding the problem, and AI "assistants" allow students to put a lot of code in their file without spending much time thinking about the problem or building an understanding of it. This is awful for learning outcomes, but also bad for project success. Getting students to see the value of thinking deeply about a problem is a thorny pedagogical puzzle at the best of times, and allowing the use of AI "assistants" makes the problem much much worse. This is another area I hope to see (or even drive) pedagogical improvement in, for what it's worth.
1/2

@netzschleuder@social.skewed.de
2025-08-06 21:00:03

board_directors: Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' ar…

board_directors: Norwegian Boards of Directors (2002-2011). 1238 nodes, 1148 edges. https://networks.skewed.de/net/board_directors#net2m_2002-07-01

board_directors — Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' are one-mode projections containing co-memberships among directors.

@arXiv_csSE_bot@mastoxiv.page
2025-10-06 09:05:29

Automatic Building Code Review: A Case Study
Hanlong Wan, Weili Xu, Michael Rosenberg, Jian Zhang, Aysha Siddika
https://arxiv.org/abs/2510.02634 https://a…

Automatic Building Code Review: A Case Study
Building officials, particularly those in resource-constrained or rural jurisdictions, face labor-intensive, error-prone, and costly manual reviews of design documents as projects increase in size and complexity. The growing adoption of Building Information Modeling (BIM) and Large Language Models (LLMs) presents opportunities for automated code review (ACR) solutions. This study introduces a novel agent-driven framework that integrates BIM-based data extraction with automated verification usin…

@gedankenstuecke@scholar.social
2025-08-06 17:09:24

If anyone else who's in the #OpenStreetMap/#opensource cosmos needs a reason to stop using Organic Maps and switch to some free/open alternatives:
In addition to them rolling their own 'data license', it seems like their recent license modifications for code could also be viewed as violating the Apache license/the creation of a non-FOSS license too.
That the devs are unwilling to give a clear answer to those questions speaks volumes imho…
https://github.com/organicmaps/organicmaps/pull/10987

@Techmeme@techhub.social
2025-07-31 12:25:41

Stack Overflow survey: 84% of developers use or plan to use AI tools in their workflow, up from 76% in 2024, and 33% trust AI accuracy, down from 43% in 2024 (Sean Michael Kerner/VentureBeat)
https://venturebeat.com/ai/stack-overflo…

Stack Overflow data reveals the hidden productivity tax of ‘almost right’ AI code
Stack Overflow survey shows that as more enterprise developers actually use AI tools, their expectations aren't being met by reality.

@arXiv_csSE_bot@mastoxiv.page
2025-08-06 08:39:10

Interpreting Performance Profiles with Deep Learning
Zhuoran Liu
https://arxiv.org/abs/2508.02729 https://arxiv.org/pdf/2508.02729

Interpreting Performance Profiles with Deep Learning
Profiling tools (also known as profilers) play an important role in understanding program performance at runtime, such as hotspots, bottlenecks, and inefficiencies. While profilers have been proven to be useful, they give extra burden to software engineers. Software engineers, as the users, are responsible to interpret the complex performance data and identify actionable optimization in program source code. However, it can be challenging for users to associate inefficiencies with the program se…

@x_cli@infosec.exchange
2025-10-02 08:58:42

Just received an email from Jetbrains about data collection in their IDE
> We’re now adding the option to allow the collection of detailed code‑related data pertaining to IDE activity, such as edit history, terminal usage, and your interactions with AI features. This may include code snippets, prompt text, and AI responses.
> If you’re using a non-commercial license, detailed code‑related data collection will be enabled as part of your next IDE update – you will be notified …

@tiotasram@kolektiva.social
2025-08-04 15:49:39

Should we teach vibe coding? Here's why not.
2/2
To address the bigger question I started with ("should we teach AI-"assisted" coding?"), my answer is: "No, except enough to show students directly what its pitfalls are." We have little enough time as it is to cover the core knowledge that they'll need, which has become more urgent now that they're going to be expected to clean up AI bugs and they'll have less time to develop an understanding of the problems they're supposed to be solving. The skill of prompt engineering & other skills of working with AI are relatively easy to pick up on your own, given a decent not-even-mathematical understanding of how a neutral network works, which is something we should be giving to all students, not just our majors.
Reasonable learning objectives for CS majors might include explaining what types of bugs an AI "assistant" is most likely to introduce, explaining the difference between software engineering and writing code, explaining why using an AI "assistant" is likely to violate open-source licenses, listing at lest three independent ethical objections to contemporary LLMs and explaining the evidence for/reasoning behind them, explaining why we should expect AI "assistants" to be better at generating code from scratch than at fixing bugs in existing code (and why they'll confidently "claim" to have fixed problems they haven't), and even fixing bugs in AI generated code (without AI "assistance").
If we lived in a world where the underlying environmental, labor, and data commons issues with AI weren't as bad, or if we could find and use systems that effectively mitigate these issues (there's lots of piecemeal progress on several of these) then we should probably start teaching an elective on coding with an assistant to students who have mastered programming basics, but such a class should probably spend a good chunk of time on non-assisted debugging.
#AI #LLMs #VibeCoding

@arXiv_eessSP_bot@mastoxiv.page
2025-09-03 11:09:53

Lightweight Error-Correction Code Encoders in Superconducting Electronic Systems
Yerzhan Mustafa, Berker Pek\"oz, Sel\c{c}uk K\"ose
https://arxiv.org/abs/2509.00962 ht…

Lightweight Error-Correction Code Encoders in Superconducting Electronic Systems
Data transmission from superconducting electronic circuits, such as single flux quantum (SFQ) logic, to room-temperature electronics is susceptible to bit errors, which may result from flux trapping, fabrication defects, and process parameter variations (PPV). Due to the cooling power budget at 4.2 K and constraints on the chip area, the size of the error-correction code encoders is limited. In this work, three lightweight error-correction code encoders are proposed that are based on Hamming(7,…

@arXiv_csLO_bot@mastoxiv.page
2025-08-04 07:51:41

Building Bigraphs of the real world
Kang Rong Roy Ang
https://arxiv.org/abs/2508.00003 https://arxiv.org/pdf/2508.00003

Building Bigraphs of the real world
This report proposes a formal specification for organising all buildings, streets and administrative areas in the world into a hierarchical space-partitioning tree using data from OpenStreetMap. This hierarchical structure is encoded into a bigraph, serving as a digital twin of the world and capturing complete street connectivity. It presents a tool implemented in OCaml (source code at https://github.com/royangkr/bigraph-of-the-world ) that constructs bigraphs for regions from any part of the w…

@arXiv_csSE_bot@mastoxiv.page
2025-08-04 08:24:41

How Quantization Impacts Privacy Risk on LLMs for Code?
Md Nazmul Haque, Hua Yang, Zhou Yang, Bowen Xu
https://arxiv.org/abs/2508.00128 https://arxiv.org/p…

How Quantization Impacts Privacy Risk on LLMs for Code?
Large language models for code (LLMs4Code) rely heavily on massive training data, including sensitive data, such as cloud service credentials of the projects and personal identifiable information of the developers, raising serious privacy concerns. Membership inference (MI) has recently emerged as an effective tool for assessing privacy risk by identifying whether specific data belong to a model's training set. In parallel, model compression techniques, especially quantization, have gained trac…

@arXiv_eessSY_bot@mastoxiv.page
2025-09-03 12:32:33

IndusGCC: A Data Benchmark and Evaluation Framework for GUI-Based General Computer Control in Industrial Automation
Xiaoran Yang, Yuyang Du, Kexin Chen, Soung Chang Liew, Jiamin Lu, Ziyu Guo, Xiaoyan Liu, Qun Yang, Shiqi Xu, Xingyu Fan, Yuchen Pan, Taoyong Cui, Hongyu Deng, Boris Dudder, Jianzhang Pan, Qun Fang, Pheng Ann Heng
https://arxi…

IndusGCC: A Data Benchmark and Evaluation Framework for GUI-Based General Computer Control in Industrial Automation
As Industry 4.0 progresses, flexible manufacturing has become a cornerstone of modern industrial systems, with equipment automation playing a pivotal role. However, existing control software for industrial equipment, typically reliant on graphical user interfaces (GUIs) that require human interactions such as mouse clicks or screen touches, poses significant barriers to the adoption of code-based equipment automation. Recently, Large Language Model-based General Computer Control (LLM-GCC) has e…

@arXiv_csDC_bot@mastoxiv.page
2025-09-29 07:58:37

Code once, Run Green: Automated Green Code Translation in Serverless Computing
Sebastian Werner, Mathis K\"ahler, Alireza Hakamian
https://arxiv.org/abs/2509.22068 https://…

Code once, Run Green: Automated Green Code Translation in Serverless Computing
The rapid digitization and the increasing use of emerging technologies such as AI models have significantly contributed to the emissions of computing infrastructure. Efforts to mitigate this impact typically focus on the infrastructure level such as powering data centers with renewable energy, or through the specific design of energy-efficient software. However, both strategies rely on stakeholder intervention, making their adoption in legacy and already-deployed systems unlikely. As a result, …

@arXiv_csIT_bot@mastoxiv.page
2025-10-03 08:29:01

On Algebraic Approaches for DNA Codes with Multiple Constraints
Krishna Gopal Benerjee, Manish K Gupta
https://arxiv.org/abs/2510.01750 https://arxiv.org/p…

On Algebraic Approaches for DNA Codes with Multiple Constraints
DNA strings and their properties are widely studied since last 20 years due to its applications in DNA computing. In this area, one designs a set of DNA strings (called DNA code) which satisfies certain thermodynamic and combinatorial constraints such as reverse constraint, reverse-complement constraint, $GC$-content constraint and Hamming constraint. However recent applications of DNA codes in DNA data storage resulted in many new constraints on DNA codes such as avoiding tandem repeats constr…

@arXiv_csAI_bot@mastoxiv.page
2025-08-27 10:10:23

VISION: Robust and Interpretable Code Vulnerability Detection Leveraging Counterfactual Augmentation
David Egea, Barproda Halder, Sanghamitra Dutta
https://arxiv.org/abs/2508.18933

VISION: Robust and Interpretable Code Vulnerability Detection Leveraging Counterfactual Augmentation
Automated detection of vulnerabilities in source code is an essential cybersecurity challenge, underpinning trust in digital systems and services. Graph Neural Networks (GNNs) have emerged as a promising approach as they can learn structural and logical code relationships in a data-driven manner. However, their performance is severely constrained by training data imbalances and label noise. GNNs often learn 'spurious' correlations from superficial code similarities, producing detectors that fai…

@arXiv_csRO_bot@mastoxiv.page
2025-09-30 13:13:21

From Code to Action: Hierarchical Learning of Diffusion-VLM Policies
Markus Peschl, Pietro Mazzaglia, Daniel Dijkman
https://arxiv.org/abs/2509.24917 https://

From Code to Action: Hierarchical Learning of Diffusion-VLM Policies
Imitation learning for robotic manipulation often suffers from limited generalization and data scarcity, especially in complex, long-horizon tasks. In this work, we introduce a hierarchical framework that leverages code-generating vision-language models (VLMs) in combination with low-level diffusion policies to effectively imitate and generalize robotic behavior. Our key insight is to treat open-source robotic APIs not only as execution interfaces but also as sources of structured supervision: …

@arXiv_astrophCO_bot@mastoxiv.page
2025-08-04 09:11:00

Can cosmic rotation resolve the Hubble tension? Constraints from CMB and large-scale structure
Micol Benetti, David A. Cook, Saulo Carneiro
https://arxiv.org/abs/2508.00759 http…

Can cosmic rotation resolve the Hubble tension? Constraints from CMB and large-scale structure
We investigate a relativistic cosmological model with background rotation, sourced by a non-perfect fluid with anisotropic stress. A modified version of the CLASS Boltzmann code is employed to perform MCMC analyses against Cosmic Microwave Background (CMB) and late-time datasets. The results show that current CMB data constrain the present-day rotation parameter to be negligible. As a consequence, the derived cosmological parameters remain consistent with the standard $Λ$CDM values. In contras…

@arXiv_physicsfludyn_bot@mastoxiv.page
2025-09-03 10:02:33

Data-driven modeling for flow reconstruction from sparse temperature measurements
Xicheng Wang, YiMeng Chan, KinWing Wong, Dmitry Grishchenko, Pavel Kudinov
https://arxiv.org/abs/2509.01189

Data-driven modeling for flow reconstruction from sparse temperature measurements
Measurement of the velocity field in thermal-hydraulic experiments is of great importance for phenomena interpretation and code validation. Direct measurement employing Particle Image Velocimetry (PIV) is challenging in some multiphase scenarios where the measurement system would be strongly affected by the phase interaction. In such cases, measurement can only be achieved via sparsely distributed sensors, such as Thermocouples (TCs) and pressure transducers. An example can refer to steam injec…

@mcdanlj@social.makerforums.info
2025-08-15 15:16:06

I recently discovered gojq at work. I was looking to see if there was a pure Golang implementation of jq as a library to use from within Go code. Then I compared performance to native jq to make sure that it wouldn't be too much slower than the original when embedded in Golang.
It ran my complex jq script in ⇔ the time over the same data.
That was a nice free speedup!
Now my ~/bin/jq is a symlink to

GitHub - itchyny/gojq: Pure Go implementation of jq
Pure Go implementation of jq. Contribute to itchyny/gojq development by creating an account on GitHub.

@arXiv_quantph_bot@mastoxiv.page
2025-08-29 10:14:21

Quantum Verifiable Rewards for Post-Training Qiskit Code Assistant
Nicolas Dupuis, Adarsh Tiwari, Youssef Mroueh, David Kremer, Ismael Faro, Juan Cruz-Benito
https://arxiv.org/abs/2508.20907

Quantum Verifiable Rewards for Post-Training Qiskit Code Assistant
Qiskit is an open-source quantum computing framework that allows users to design, simulate, and run quantum circuits on real quantum hardware. We explore post-training techniques for LLMs to assist in writing Qiskit code. We introduce quantum verification as an effective method for ensuring code quality and executability on quantum hardware. To support this, we developed a synthetic data pipeline that generates quantum problem-unit test pairs and used it to create preference data for aligning L…

@arXiv_csCR_bot@mastoxiv.page
2025-10-02 08:28:51

SecureBERT 2.0: Advanced Language Model for Cybersecurity Intelligence
Ehsan Aghaei, Sarthak Jain, Prashanth Arun, Arjun Sambamoorthy
https://arxiv.org/abs/2510.00240 https://…

SecureBERT 2.0: Advanced Language Model for Cybersecurity Intelligence
Effective analysis of cybersecurity and threat intelligence data demands language models that can interpret specialized terminology, complex document structures, and the interdependence of natural language and source code. Encoder-only transformer architectures provide efficient and robust representations that support critical tasks such as semantic search, technical entity extraction, and semantic analysis, which are key to automated threat detection, incident triage, and vulnerability assessm…

@arXiv_csHC_bot@mastoxiv.page
2025-08-01 07:40:50

Evaluating LLMs for Visualization Generation and Understanding
Saadiq Rauf Khan, Vinit Chandak, Sougata Mukherjea
https://arxiv.org/abs/2507.22890 https://…

Evaluating LLMs for Visualization Generation and Understanding
Information Visualization has been utilized to gain insights from complex data. In recent times, Large Language models (LLMs) have performed very well in many tasks. In this paper, we showcase the capabilities of different popular LLMs to generate code for visualization based on simple prompts. We also analyze the power of LLMs to understand some common visualizations by answering questions. Our study shows that LLMs could generate code for some simpler visualizations such as bar and pie charts…

@arXiv_csSE_bot@mastoxiv.page
2025-08-04 07:57:41

GPT-4.1 Sets the Standard in Automated Experiment Design Using Novel Python Libraries
Nuno Fachada, Daniel Fernandes, Carlos M. Fernandes, Bruno D. Ferreira-Saraiva, Jo\~ao P. Matos-Carvalho
https://arxiv.org/abs/2508.00033

GPT-4.1 Sets the Standard in Automated Experiment Design Using Novel Python Libraries
Large Language Models (LLMs) have advanced rapidly as tools for automating code generation in scientific research, yet their ability to interpret and use unfamiliar Python APIs for complex computational experiments remains poorly characterized. This study systematically benchmarks a selection of state-of-the-art LLMs in generating functional Python code for two increasingly challenging scenarios: conversational data analysis with the \textit{ParShift} library, and synthetic data generation and …

@toxi@mastodon.thi.ng
2025-07-17 15:15:51

Added a customizable 2D vector field plot function for #ThingUmbrella

20x20 vector field visualization with vectors visualized as small arrows and directions mapped to different hues. Vector lengths are normalized

20x20 vector field visualization with vectors visualized as small arrows and directions mapped to different hues. Vector lengths are varying.

20x20 vector field visualization with vectors visualized as small dials

20x20 vector field visualization with vectors visualized as small black lines with red arrow heads.

Declarative, functional & multi-format data visualization toolkit based around @thi.ng/hiccup

@arXiv_csNI_bot@mastoxiv.page
2025-08-25 09:18:20

Self-Healing Network of Interconnected Edge Devices Empowered by Infrastructure-as-Code and LoRa Communication
Rob Carson, Mohamed Chahine Ghanem, Feriel Bouakkaz
https://arxiv.org/abs/2508.16268

Self-Healing Network of Interconnected Edge Devices Empowered by Infrastructure-as-Code and LoRa Communication
This Paper proposes a self-healing, automated network of Raspberry Pi devices designed for deployment in scenarios where traditional networking is unavailable. Leveraging the low-power, long-range capabilities of the LoRa (Long Range) protocol alongside Infrastructure as Code (IaC) methodologies, the research addresses challenges such as limited bandwidth, data collisions, and node failures. Given that LoRa's packet-based system is incompatible with conventional IaC tools like Ansible and Terra…

@netzschleuder@social.skewed.de
2025-08-28 05:00:04

board_directors: Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' ar…

board_directors: Norwegian Boards of Directors (2002-2011). 1386 nodes, 1303 edges. https://networks.skewed.de/net/board_directors#net2m_2004-01-01

board_directors — Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' are one-mode projections containing co-memberships among directors.

@arXiv_csDC_bot@mastoxiv.page
2025-07-31 08:43:41

DSPE: Profit Maximization in Edge-Cloud Storage System using Dynamic Space Partitioning with Erasure Code
Shubhradeep Roy, Suvarthi Sarkar, Vivek Verma, Aryabartta Sahu
https://arxiv.org/abs/2507.22801

DSPE: Profit Maximization in Edge-Cloud Storage System using Dynamic Space Partitioning with Erasure Code
Edge Storage Systems have emerged as a critical enabler of low latency data access in modern cloud networks by bringing storage and computation closer to end users. However, the limited storage capacity of edge servers poses significant challenges in handling high volume and latency sensitive data access requests, particularly under dynamic workloads. In this work, we propose a profit driven framework that integrates three key mechanisms which are collaborative caching, erasure coding, and elas…

@arXiv_csCL_bot@mastoxiv.page
2025-07-21 09:48:50

Optimizing ASR for Catalan-Spanish Code-Switching: A Comparative Analysis of Methodologies
Carlos Mena, Pol Serra, Jacobo Romero, Abir Messaoudi, Jose Giraldo, Carme Armentano-Oller, Rodolfo Zevallos, Ivan Meza, Javier Hernando
https://arxiv.org/abs/2507.13875

Optimizing ASR for Catalan-Spanish Code-Switching: A Comparative Analysis of Methodologies
Code-switching (CS), the alternating use of two or more languages, challenges automatic speech recognition (ASR) due to scarce training data and linguistic similarities. The lack of dedicated CS datasets limits ASR performance, as most models rely on monolingual or mixed-language corpora that fail to reflect real-world CS patterns. This issue is critical in multilingual societies where CS occurs in informal and formal settings. A key example is Catalan-Spanish CS, widely used in media and parli…

@tiotasram@kolektiva.social
2025-07-25 10:57:58

Just saw this:
#AI can mean a lot of things these days, but lots of the popular meanings imply a bevy of harms that I definitely wouldn't feel are worth a cute fish game. In fact, these harms are so acute that even "just" playing into the AI hype becomes its own kind of harm (it's similar to blockchain in that way).
@… noticed that the authors claim the code base is 80% AI generated, which is a red flag because people with sound moral compasses wouldn't be using AI to "help" write code in the first place. The authors aren't by some miracle people who couldn't build this app without help, in case that influences your thinking about it: they have the skills to write the code themselves, although it likely would have taken longer (but also been better).
I was more interested in the fish-classification AI, and how much it might be dependent on datacenters. Thankfully, a quick glance at the code confirms they're using ONNX and running a self-trained neural network on your device. While the exponentially-increasing energy & water demands of datacenters to support billion-parameter models are a real concern, this is not that. Even a non-AI game can burn a lot of cycles on someone's phone, and I don't think there's anything to complain about energy-wise if we're just using cycles on the end user's device as long as we're not having them keep it on for hours crunching numbers like blockchain stuff does. Running whatever stuff locally while the user is playing a game is a negligible environmental concern, unlike, say, calling out to ChatGPT where you're directly feeding datacenter demand. Since they claimed to have trained the network themselves, and since it's actually totally reasonable to make your own dataset for this and get good-enough-for-a-silly-game results with just a few hundred examples, I don't have any ethical objections to the data sourcing or training processes either. Hooray! This is finally an example of "ethical use of neutral networks" that I can hold up as an example of what people should be doing instead of the BS they are doing.
But wait... Remember what I said about feeding the AI hype being its own form of harm? Yeah, between using AI tools for coding and calling their classifier "AI" in a way that makes it seem like the same kind of thing as ChatGPT et al., they're leaning into the hype rather than helping restrain it. And that means they're causing harm. Big AI companies can point to them and say "look AI enables cute things you like" when AI didn't actually enable it. So I'm feeling meh about this cute game and won't be sharing it aside from this post. If you love the cute fish, you don't really have to feel bad for playing with it, but I'd feel bad for advertising it without a disclaimer.

@arXiv_mathAG_bot@mastoxiv.page
2025-09-23 09:59:20

Infinite Euclidean Distance Discriminants
Felix Rydell, Emil Horobet
https://arxiv.org/abs/2509.17456 https://arxiv.org/pdf/2509.17456

Infinite Euclidean Distance Discriminants
We study infinite Euclidean distance discriminants of algebraic varieties, defined as the loci of data points whose fibers under the second projection from the Euclidean distance correspondence are positive-dimensional. In particular, these discriminants contain all data points with infinitely many critical points for the nearest-point problem. We present computer code that computes the infinite Euclidean distance discriminant, and use it to present numerous varieties with nonempty such discrim…

@arXiv_astrophEP_bot@mastoxiv.page
2025-09-25 08:38:12

A High-Precision, Differentiable Code for Solar System Ephemerides
Ben Cassese, Malena Rice, Tiger Lu
https://arxiv.org/abs/2509.19549 https://arxiv.org/pd…

A High-Precision, Differentiable Code for Solar System Ephemerides
We present jorbit, a python/JAX library designed to enable modern data-driven numerical studies of the solar system. Written entirely in JAX, an auto-differentiable and optionally GPU accelerated language behind many current large-scale machine learning efforts, jorbit includes an independent implementation of REBOUND's IAS15 integrator and the ability to parse precomputed ephemerides such as the JPL DE series. In its default behavior, jorbit maintains ~1 mas agreement with JPL Horizons on ~dec…

@arXiv_csSE_bot@mastoxiv.page
2025-08-04 09:09:00

Benchmarking LLMs for Unit Test Generation from Real-World Functions
Dong Huang, Jie M. Zhang, Mark Harman, Qianru Zhang, Mingzhe Du, See-Kiong Ng
https://arxiv.org/abs/2508.00408

Benchmarking LLMs for Unit Test Generation from Real-World Functions
Recently, large language models (LLMs) have shown great promise in automating unit test generation, significantly reducing the manual effort required by developers. To effectively evaluate the capabilities of LLMs in this domain, it is crucial to have a well-designed benchmark that accurately reflects real-world scenarios and mitigates common pitfalls. Existing LLM test generation benchmarks are limited by two critical drawbacks: data contamination and structurally simple function code. As a re…

@arXiv_csLG_bot@mastoxiv.page
2025-07-24 10:13:39

EarthLink: Interpreting Climate Signals with Self-Evolving AI Agents
Zijie Guo, Jiong Wang, Xiaoyu Yue, Wangxu Wei, Zhe Jiang, Wanghan Xu, Ben Fei, Wenlong Zhang, Xinyu Gu, Lijing Cheng, Jing-Jia Luo, Chao Li, Yaqiang Wang, Tao Chen, Wanli Ouyang, Fenghua Ling, Lei Bai
https://arxiv.org/abs/2507.17311

EarthLink: Interpreting Climate Signals with Self-Evolving AI Agents
Modern Earth science is at an inflection point. The vast, fragmented, and complex nature of Earth system data, coupled with increasingly sophisticated analytical demands, creates a significant bottleneck for rapid scientific discovery. Here we introduce EarthLink, the first AI agent designed as an interactive copilot for Earth scientists. It automates the end-to-end research workflow, from planning and code generation to multi-scenario analysis. Unlike static diagnostic tools, EarthLink can lea…

@arXiv_csAI_bot@mastoxiv.page
2025-07-16 10:09:51

Modeling Code: Is Text All You Need?
Daniel Nichols, Konstantinos Parasyris, Harshitha Menon, Brian R. Bartoldson, Giorgis Georgakoudis, Tal Ben-Nun, Abhinav Bhatele
https://arxiv.org/abs/2507.11467

Modeling Code: Is Text All You Need?
Code LLMs have become extremely popular recently for modeling source code across a variety of tasks, such as generation, translation, and summarization. However, transformer-based models are limited in their capabilities to reason through structured, analytical properties of code, such as control and data flow. Previous work has explored the modeling of these properties with structured data and graph neural networks. However, these approaches lack the generative capabilities and scale of modern…

@arXiv_csCR_bot@mastoxiv.page
2025-09-30 11:06:51

MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction
Sepideh Abedini (University of Waterloo, Vector Institute), Shubhankar Mohapatra (University of Waterloo), D. B. Emerson (Vector Institute), Masoumeh Shafieinejad (Vector Institute), Jesse C. Cresswell (Layer 6 AI), Xi He (University of Waterloo, Vector Institute)
https://

MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction
Large language models (LLMs) have shown promising performance on tasks that require reasoning, such as text-to-SQL, code generation, and debugging. However, regulatory frameworks with strict privacy requirements constrain their integration into sensitive systems. State-of-the-art LLMs are also proprietary, costly, and resource-intensive, making local deployment impractical. Consequently, utilizing such LLMs often requires sharing data with third-party providers, raising privacy concerns and ris…

@arXiv_csAR_bot@mastoxiv.page
2025-09-11 08:35:03

AutoVeriFix: Automatically Correcting Errors and Enhancing Functional Correctness in LLM-Generated Verilog Code
Yan Tan, Xiangchen Meng, Zijun Jiang, Yangdi Lyu
https://arxiv.org/abs/2509.08416

AutoVeriFix: Automatically Correcting Errors and Enhancing Functional Correctness in LLM-Generated Verilog Code
Large language models (LLMs) have demonstrated impressive capabilities in generating software code for high-level programming languages such as Python and C++. However, their application to hardware description languages, such as Verilog, is challenging due to the scarcity of high-quality training data. Current approaches to Verilog code generation using LLMs often focus on syntactic correctness, resulting in code with functional errors. To address these challenges, we present AutoVeriFix, a no…

@rasterweb@mastodon.social
2025-09-12 03:30:58

I did not make any good progress on my project to read GPX data with Python today. (Trying to read speed.) May need to just read it as XML.
There are a bunch of nerds who do Python and bikes though, so someone might have the code example I need.
#biking #bikeTooter

@Techmeme@techhub.social
2025-07-14 03:55:46

A look at WindBorne, which uses weather balloons and AI to improve forecasting, as potential budget cuts to NOAA threaten its access to public weather data (Tim Fernholz/New York Times)
https://www.

The Future of Weather Prediction Is Here. Maybe.
Thanks to A.I., companies like WindBorne hope to usher in a golden age of forecasting. But they rely in part on government data — and the agency that provides it is in turmoil.

@netzschleuder@social.skewed.de
2025-09-24 05:00:04

board_directors: Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' ar…

board_directors: Norwegian Boards of Directors (2002-2011). 1265 nodes, 1178 edges. https://networks.skewed.de/net/board_directors#net2m_2002-10-01

board_directors — Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' are one-mode projections containing co-memberships among directors.

@arXiv_csNE_bot@mastoxiv.page
2025-07-22 07:59:40

DHEvo: Data-Algorithm Based Heuristic Evolution for Generalizable MILP Solving
Zhihao Zhang, Siyuan Li, Chenxi Li, Feifan Liu, Mengjing Chen, Kai Li, Tao Zhong, Bo An, Peng Liu
https://arxiv.org/abs/2507.15615

DHEvo: Data-Algorithm Based Heuristic Evolution for Generalizable MILP Solving
Primal heuristics play a critical role in improving the efficiency of mixed integer programming (MILP) solvers. As large language models (LLMs) have demonstrated superior code generation abilities, recent MILP works are devoted to leveraging the evolutionary computation approaches with LLMs to generate effective primal heuristics. Although the generated heuristics have achieved better solving performance than the hand-crafted ones with little adaptability, the advantage of current LLM-based met…

@cdarwin@c.im
2025-07-11 03:44:49

OSMnx is a Python package to retrieve, model, analyze, and visualize street networks from OpenStreetMap.
Users can download and model walkable, drivable, or bikeable urban networks with a single line of Python code
-- and then easily analyze and visualize them.
You can just as easily download and work with amenities/points of interest, building footprints, elevation data, street bearings/orientations, and network routing.
If you use OSMnx in your work, please downlo…

@tomkalei@machteburch.social
2025-09-26 11:01:30

It is good IT Security practice to separate data from code. For example, an SQL Injection attack is to get the target to treat data (entered into a form) as code, that is break that barrier.
Now take an "AI agent" doing a task for you like: "Download that podcast that Frank emailed me about". It will read untrusted data (e-mails in my inbox), access sensitive data (e-mails from my friends) get more stuff from the web (the podcast episode), etc.
And all that in an environment where there 1/2

@arXiv_csCL_bot@mastoxiv.page
2025-08-20 08:13:59

Datarus-R1: An Adaptive Multi-Step Reasoning LLM for Automated Data Analysis
Ayoub Ben Chaliah, Hela Dellagi
https://arxiv.org/abs/2508.13382 https://arxiv…

Datarus-R1: An Adaptive Multi-Step Reasoning LLM for Automated Data Analysis
We present Datarus-R1-14B, a 14 B-parameter open-weights language model fine-tuned from Qwen 2.5-14B-Instruct to act as a virtual data analyst and graduate-level problem solver. Datarus is trained not on isolated question-answer pairs but on full analytical trajectories including reasoning steps, code execution, error traces, self-corrections, and final conclusions, all captured in a ReAct-style notebook format spanning finance, medicine, numerical analysis, and other quantitative domains. Our …

@arXiv_csAI_bot@mastoxiv.page
2025-09-10 10:13:01

SCoder: Iterative Self-Distillation for Bootstrapping Small-Scale Data Synthesizers to Empower Code LLMs
Xinyu Zhang, Changzhi Zhou, Linmei Hu, Luhao Zhang, Xiancai Chen, Haomin Fu, Yang Yang, Mengdi Zhang
https://arxiv.org/abs/2509.07858

SCoder: Iterative Self-Distillation for Bootstrapping Small-Scale Data Synthesizers to Empower Code LLMs
Existing code large language models (LLMs) often rely on large-scale instruction data distilled from proprietary LLMs for fine-tuning, which typically incurs high costs. In this paper, we explore the potential of small-scale open-source LLMs (e.g., 7B) as synthesizers for high-quality code instruction data construction. We first observe that the data synthesis capability of small-scale LLMs can be enhanced by training on a few superior data synthesis samples from proprietary LLMs. Building on t…

@arXiv_physicsplasmph_bot@mastoxiv.page
2025-07-23 08:43:02

Efficient dataset construction using active learning and uncertainty-aware neural networks for plasma turbulent transport surrogate models
Aaron Ho (MIT Plasma Science and Fusion Center, Cambridge, USA), Lorenzo Zanisi (UKAEA Culham Centre for Fusion Energy, Abingdon, UK), Bram de Leeuw (Radboud University, Nijmegen, Netherlands), Vincent Galvan (MIT Plasma Science and Fusion Center, Cambridge, USA), Pablo Rodriguez-Fernandez (MIT Plasma Science and Fusion Center, Cambridge, USA), Nath…

Efficient dataset construction using active learning and uncertainty-aware neural networks for plasma turbulent transport surrogate models
This work demonstrates a proof-of-principle for using uncertainty-aware architectures, in combination with active learning techniques and an in-the-loop physics simulation code as a data labeller, to construct efficient datasets for data-driven surrogate model generation. Building off of a previous proof-of-principle successfully demonstrating training set reduction on static pre-labelled datasets, using the ADEPT framework, this strategy was applied again to the plasma turbulent transport prob…

@arXiv_csCR_bot@mastoxiv.page
2025-09-24 09:41:44

Obelix: Mitigating Side-Channels Through Dynamic Obfuscation
Jan Wichelmann, Anja Rabich, Anna P"atschke, Thomas Eisenbarth
https://arxiv.org/abs/2509.18909 https://…

Obelix: Mitigating Side-Channels Through Dynamic Obfuscation
Trusted execution environments (TEEs) offer hardware-assisted means to protect code and data. However, as shown in numerous results over the years, attackers can use side-channels to leak data access patterns and even single-step the code. While the vendors are slowly introducing hardware-based countermeasures for some attacks, others will stay unaddressed. This makes a software-level countermeasure desirable, but current available solutions only address very specific attack vectors or have a n…

@arXiv_astrophHE_bot@mastoxiv.page
2025-09-10 08:35:21

$\texttt{Jipole}$: A Differentiable $\texttt{ipole}$-based Code for Radiative Transfer in Curved Spacetimes
Pedro Naethe Motta, Ben S. Prather, Alejandro C\'ardenas-Avenda\~no
https://arxiv.org/abs/2509.07065

$\texttt{Jipole}$: A Differentiable $\texttt{ipole}$-based Code for Radiative Transfer in Curved Spacetimes
Recent imaging of supermassive black holes by the Event Horizon Telescope (EHT) has relied on exhaustive parameter-space searches, matching observations to large, precomputed libraries of theoretical models. As observational data become increasingly precise, the limitations of this computationally expensive approach grow more acute, creating a pressing need for more efficient methods. In this work, we present $\texttt{Jipole}$, an automatically differentiable (AD), $\texttt{ipole}$-based code f…

@arXiv_csSE_bot@mastoxiv.page
2025-08-28 09:10:31

Functional Consistency of LLM Code Embeddings: A Self-Evolving Data Synthesis Framework for Benchmarking
Zhuohao Li, Wenqing Chen, Jianxing Yu, Zhichao Lu
https://arxiv.org/abs/2508.19558

Functional Consistency of LLM Code Embeddings: A Self-Evolving Data Synthesis Framework for Benchmarking
Embedding models have demonstrated strong performance in tasks like clustering, retrieval, and feature extraction while offering computational advantages over generative models and cross-encoders. Benchmarks such as MTEB have shown that text embeddings from large language models (LLMs) capture rich semantic information, but their ability to reflect code-level functional semantics remains unclear. Existing studies largely focus on code clone detection, which emphasizes syntactic similarity and o…

@netzschleuder@social.skewed.de
2025-09-21 04:00:04

board_directors: Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' ar…

board_directors: Norwegian Boards of Directors (2002-2011). 1796 nodes, 1726 edges. https://networks.skewed.de/net/board_directors#net2m_2006-11-01

board_directors — Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' are one-mode projections containing co-memberships among directors.

@arXiv_csSE_bot@mastoxiv.page
2025-07-31 09:58:21

Tracking research software outputs in the UK
Domhnall Carlin, Austen Rainer
https://arxiv.org/abs/2507.22871 https://arxiv.org/pdf/2507.22871

Tracking research software outputs in the UK
Research software is crucial in the research process and the growth of Open Science underscores the importance of accessing research artifacts, like data and code, raising traceability challenges among outputs. While it is a clear principle that research code, along with other essential outputs, should be recognised as artifacts of the research process, the how of this principle remains variable. This study examines where UK academic institutions store and register software as a unique research…

@arXiv_csRO_bot@mastoxiv.page
2025-09-19 09:14:31

Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods
Adam D. Hines, Alejandro Fontan, Michael Milford, Tobias Fischer
https://arxiv.org/abs/2509.14516

Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods
Event-based localization research and datasets are a rapidly growing area of interest, with a tenfold increase in the cumulative total number of published papers on this topic over the past 10 years. Whilst the rapid expansion in the field is exciting, it brings with it an associated challenge: a growth in the variety of required code and package dependencies as well as data formats, making comparisons difficult and cumbersome for researchers to implement reliably. To address this challenge, we…

@shoppingtonz@mastodon.social
2025-07-21 06:40:18

I want the Micro Processor to order my Pulsar(Level 2 special operations unit) to mine coal and deposit it into the core...
I'll let you know about my progress...
So "mlog" is the "Mindustry Logic" 'language'.
mindustrygame.github.io/wiki/logic/0-introduction/
but I first started here:

Guide: Logic Basics
Logic is a mechanic introduced in Version 6.0 of Mindustry, which allows you to override the default behaviour of blocks and units through a customised block code programming language called MLog that can be edited in text editors when pasted into one. Logic is run through Processors in conjunction with accessory blocks such as the Memory Cell, Switch and Logic Display. It is recommended to have some form of prior programming experience in order to be familiar with data types. The way...

@arXiv_csGR_bot@mastoxiv.page
2025-08-12 09:21:23

LL3M: Large Language 3D Modelers
Sining Lu, Guan Chen, Nam Anh Dinh, Itai Lang, Ari Holtzman, Rana Hanocka
https://arxiv.org/abs/2508.08228 https://arxiv.o…

LL3M: Large Language 3D Modelers
We present LL3M, a multi-agent system that leverages pretrained large language models (LLMs) to generate 3D assets by writing interpretable Python code in Blender. We break away from the typical generative approach that learns from a collection of 3D data. Instead, we reformulate shape generation as a code-writing task, enabling greater modularity, editability, and integration with artist workflows. Given a text prompt, LL3M coordinates a team of specialized LLM agents to plan, retrieve, write,…

@arXiv_csLG_bot@mastoxiv.page
2025-07-11 10:23:11

Dynamic Chunking for End-to-End Hierarchical Sequence Modeling
Sukjun Hwang, Brandon Wang, Albert Gu
https://arxiv.org/abs/2507.07955 https://arxiv.org/pdf/2507.07955 https://arxiv.org/html/2507.07955
arXiv:2507.07955v1 Announce Type: new
Abstract: Despite incredible progress in language models (LMs) in recent years, largely resulting from moving away from specialized models designed for specific tasks to general models based on powerful architectures (e.g. the Transformer) that learn everything from raw data, pre-processing steps such as tokenization remain a barrier to true end-to-end foundation models. We introduce a collection of new techniques that enable a dynamic chunking mechanism which automatically learns content -- and context -- dependent segmentation strategies learned jointly with the rest of the model. Incorporating this into an explicit hierarchical network (H-Net) allows replacing the (implicitly hierarchical) tokenization-LM-detokenization pipeline with a single model learned fully end-to-end. When compute- and data- matched, an H-Net with one stage of hierarchy operating at the byte level outperforms a strong Transformer language model operating over BPE tokens. Iterating the hierarchy to multiple stages further increases its performance by modeling multiple levels of abstraction, demonstrating significantly better scaling with data and matching a token-based Transformer of twice its size. H-Nets pretrained on English show significantly increased character-level robustness, and qualitatively learn meaningful data-dependent chunking strategies without any heuristics or explicit supervision. Finally, the H-Net's improvement over tokenized pipelines is further increased in languages and modalities with weaker tokenization heuristics, such as Chinese and code, or DNA sequences (nearly 4x improvement in data efficiency over baselines), showing the potential of true end-to-end models that learn and scale better from unprocessed data.
toXiv_bot_toot

@arXiv_astrophCO_bot@mastoxiv.page
2025-09-24 09:25:14

The effects on structure of a momentum coupling between dark matter and quintessence
G. N. Candlish, Y. Jaff\'e
https://arxiv.org/abs/2509.19164 https://

The effects on structure of a momentum coupling between dark matter and quintessence
Given the mysterious nature of dark matter and dark energy, and the persistent tensions in cosmological data, it is worthwhile exploring more exotic physics in the dark sector, such as a momentum coupling between dark matter and dark energy, specifically in the form of a quintessence field. In this study, using collisionless N-body numerical simulations with a modified version of the RAMSES code, we follow up previous work to investigate the consequences of this model on dark matter halos and t…

@arXiv_csCR_bot@mastoxiv.page
2025-09-10 10:00:41

ImportSnare: Directed "Code Manual" Hijacking in Retrieval-Augmented Code Generation
Kai Ye, Liangcai Su, Chenxiong Qian
https://arxiv.org/abs/2509.07941 https://

ImportSnare: Directed "Code Manual" Hijacking in Retrieval-Augmented Code Generation
Code generation has emerged as a pivotal capability of Large Language Models(LLMs), revolutionizing development efficiency for programmers of all skill levels. However, the complexity of data structures and algorithmic logic often results in functional deficiencies and security vulnerabilities in generated code, reducing it to a prototype requiring extensive manual debugging. While Retrieval-Augmented Generation (RAG) can enhance correctness and security by leveraging external code manuals, it …

@arXiv_csHC_bot@mastoxiv.page
2025-09-17 10:26:10

Towards an Embodied Composition Framework for Organizing Immersive Computational Notebooks
Sungwon In, Eric Krokos, Kirsten Whitley, Chris North, Yalong Yang
https://arxiv.org/abs/2509.13291

Towards an Embodied Composition Framework for Organizing Immersive Computational Notebooks
As immersive technologies evolve, immersive computational notebooks offer new opportunities for interacting with code, data, and outputs. However, scaling these environments remains a challenge, particularly when analysts manually arrange large numbers of cells to maintain both execution logic and visual coherence. To address this, we introduce an embodied composition framework, facilitating organizational processes in the context of immersive computational notebooks. To evaluate the effectiven…

@arXiv_csSE_bot@mastoxiv.page
2025-07-16 09:02:01

A Code Comprehension Benchmark for Large Language Models for Code
Jayant Havare, Saurav Chaudhary, Ganesh Ramakrishnan, Kaushik Maharajan, Srikanth Tamilselvam
https://arxiv.org/abs/2507.10641

A Code Comprehension Benchmark for Large Language Models for Code
Large Language Models have shown impressive capabilities in coding tasks like code generation and code completion, as they have been trained on a large amount of code data. Also, since one of the core pretraining objectives is Next Token Prediction, these models tends to learn surface-level syntactic patterns in code. However, this does not guarantee code comprehension ability i.e. the ability to capture the semantics of the code. In our opinion, this is the reason why these models often underp…

@arXiv_csDC_bot@mastoxiv.page
2025-09-23 09:25:20

Asteria: Semantic-Aware Cross-Region Caching for Agentic LLM Tool Access
Chaoyi Ruan, Chao Bi, Kaiwen Zheng, Ziji Shi, Xinyi Wan, Jialin Li
https://arxiv.org/abs/2509.17360 http…

Asteria: Semantic-Aware Cross-Region Caching for Agentic LLM Tool Access
Large Language Model (LLM) agents tackle data-intensive tasks such as deep research and code generation. However, their effectiveness depends on frequent interactions with knowledge sources across remote clouds or regions. Such interactions can create non-trivial latency and cost bottlenecks. Existing caching solutions focus on exact-match queries, limiting their effectiveness for semantic knowledge reuse. To address this challenge, we introduce Asteria, a novel cross-region knowledge caching…

@netzschleuder@social.skewed.de
2025-08-14 06:00:04

board_directors: Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' ar…

board_directors: Norwegian Boards of Directors (2002-2011). 1908 nodes, 1892 edges. https://networks.skewed.de/net/board_directors#net2m_2008-04-01

board_directors — Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' are one-mode projections containing co-memberships among directors.

@arXiv_csSE_bot@mastoxiv.page
2025-09-25 09:08:02

Beyond Language Barriers: Multi-Agent Coordination for Multi-Language Code Generation
Micheline B\'en\'edicte Moumoula, Serge Lionel Nikiema, Alb\'erick Euraste Djire, Abdoul Kader Kabore, Jacques Klein, Tegawend\'e F. Bissyande
https://arxiv.org/abs/2509.19918

Beyond Language Barriers: Multi-Agent Coordination for Multi-Language Code Generation
Producing high-quality code across multiple programming languages is increasingly important as today's software systems are built on heterogeneous stacks. Large language models (LLMs) have advanced the state of automated programming, yet their proficiency varies sharply between languages, especially those with limited training data such as Rust, Perl, OCaml, and Erlang. Many current solutions including language-specific fine-tuning, multi-agent orchestration, transfer learning, and intermediate…

@arXiv_csCR_bot@mastoxiv.page
2025-09-26 07:41:31

Can You Trust Your Copilot? A Privacy Scorecard for AI Coding Assistants
Amir AL-Maamari
https://arxiv.org/abs/2509.20388 https://arxiv.org/pdf/2509.20388

Can You Trust Your Copilot? A Privacy Scorecard for AI Coding Assistants
The rapid integration of AI-powered coding assistants into developer workflows has raised significant privacy and trust concerns. As developers entrust proprietary code to services like OpenAI's GPT, Google's Gemini, and GitHub Copilot, the unclear data handling practices of these tools create security and compliance risks. This paper addresses this challenge by introducing and applying a novel, expert-validated privacy scorecard. The methodology involves a detailed analysis of four document ty…

@arXiv_csCL_bot@mastoxiv.page
2025-07-17 09:57:40

Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation
Ziyu Ge, Gabriel Chua, Leanne Tan, Roy Ka-Wei Lee
https://arxiv.org/abs/2507.11966 …

Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation
As online communication increasingly incorporates under-represented languages and colloquial dialects, standard translation systems often fail to preserve local slang, code-mixing, and culturally embedded markers of harmful speech. Translating toxic content between low-resource language pairs poses additional challenges due to scarce parallel data and safety filters that sanitize offensive expressions. In this work, we propose a reproducible, two-stage framework for toxicity-preserving translat…

@arXiv_csSE_bot@mastoxiv.page
2025-07-22 11:42:00

Applying the Chinese Wall Reverse Engineering Technique to Large Language Model Code Editing
Manatsawin Hanmongkolchai
https://arxiv.org/abs/2507.15599 htt…

Applying the Chinese Wall Reverse Engineering Technique to Large Language Model Code Editing
Large language models for code (Code LLM) are increasingly utilized in programming environments. Despite their utility, the training datasets for top LLM remain undisclosed, raising concerns about potential copyright violations. Some models, such as Pleias and Comma put emphasis on data curation and licenses, however, with limited training data these models are not competitive and only serve as proof of concepts. To improve the utility of these models, we propose an application of the "Chinese …

@netzschleuder@social.skewed.de
2025-09-09 16:00:04

board_directors: Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' ar…

board_directors: Norwegian Boards of Directors (2002-2011). 1681 nodes, 1585 edges. https://networks.skewed.de/net/board_directors#net2m_2006-05-01

board_directors — Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' are one-mode projections containing co-memberships among directors.

@arXiv_csSE_bot@mastoxiv.page
2025-07-17 07:59:00

MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Atharva Naik, Lawanya Baghel, Dhakshin Govindarajan, Darsh Agrawal, Daniel Fried, Carolyn Rose
https://arxiv.org/abs/2507.11687

MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Large Language Models, though successful in code generation, struggle with code quality analysis because they are limited by static training data and can't easily adapt to evolving best practices. We introduce MetaLint, a new instruction-following framework that formulates code quality analysis as the task of detecting and fixing problematic semantic code fragments or code idioms based on high-level specifications. Unlike conventional approaches that train models on static, rule-based data, Met…

@arXiv_csLG_bot@mastoxiv.page
2025-07-09 10:25:02

KnowIt: Deep Time Series Modeling and Interpretation
M. W. Theunissen, R. Rabe, M. H. Davel
https://arxiv.org/abs/2507.06009 https://…

KnowIt: Deep Time Series Modeling and Interpretation
KnowIt (Knowledge discovery in time series data) is a flexible framework for building deep time series models and interpreting them. It is implemented as a Python toolkit, with source code and documentation available from https://must-deep-learning.github.io/KnowIt. It imposes minimal assumptions about task specifications and decouples the definition of dataset, deep neural network architecture, and interpretability technique through well defined interfaces. This ensures the ease of importing n…

@arXiv_csSE_bot@mastoxiv.page
2025-07-24 09:18:50

How Do Code Smells Affect Skill Growth in Scratch Novice Programmers?
Ricardo Hidalgo Arag\'on, Jes\'us M. Gonz\'alez-Barahona, Gregorio Robles
https://arxiv.org/abs/2507.17314

How Do Code Smells Affect Skill Growth in Scratch Novice Programmers?
Context. Code smells, which are recurring anomalies in design or style, have been extensively researched in professional code. However, their significance in block-based projects created by novices is still largely unknown. Block-based environments such as Scratch offer a unique, data-rich setting to examine how emergent design problems intersect with the cultivation of computational-thinking (CT) skills. Objective. This research explores the connection between CT proficiency and design-level c…

@netzschleuder@social.skewed.de
2025-07-09 06:00:03

board_directors: Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' ar…

board_directors: Norwegian Boards of Directors (2002-2011). 1544 nodes, 4429 edges. https://networks.skewed.de/net/board_directors#net1m_2007-06-01

board_directors — Norwegian Boards of Directors (2002-2011)
224 networks of the affiliations among board directors due to sitting on common boards of Norwegian public limited companies (as of 5 August 2009), from May 2002 onward, in monthly snapshots through August 2011. Some metadata is included, such as director and company names, city and postal code for companies, and gender for directors. The 'net2m' data are bipartite company-director networks, while the 'net1m' are one-mode projections containing co-memberships among directors.

@arXiv_csSE_bot@mastoxiv.page
2025-07-29 11:11:01

Repairing vulnerabilities without invisible hands. A differentiated replication study on LLMs
Maria Camporese, Fabio Massacci
https://arxiv.org/abs/2507.20977 https://

Repairing vulnerabilities without invisible hands. A differentiated replication study on LLMs
Background: Automated Vulnerability Repair (AVR) is a fast-growing branch of program repair. Recent studies show that large language models (LLMs) outperform traditional techniques, extending their success beyond code generation and fault detection. Hypothesis: These gains may be driven by hidden factors -- "invisible hands" such as training-data leakage or perfect fault localization -- that let an LLM reproduce human-authored fixes for the same code. Objective: We replicate prior AVR studi…

@arXiv_csSE_bot@mastoxiv.page
2025-09-18 09:37:31

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning
Zhaoyang Chu, Yao Wan, Zhikun Zhang, Di Wang, Zhou Yang, Hongyu Zhang, Pan Zhou, Xuanhua Shi, Hai Jin, David Lo
https://arxiv.org/abs/2509.13755

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning
While Code Language Models (CLMs) have demonstrated superior performance in software engineering tasks such as code generation and summarization, recent empirical studies reveal a critical privacy vulnerability: these models exhibit unintended memorization of sensitive training data, enabling verbatim reproduction of confidential information when specifically prompted. To address this issue, several approaches, including training data de-duplication and differential privacy augmentation, have b…

@arXiv_csCR_bot@mastoxiv.page
2025-07-08 10:34:01

Securing Mixed Rust with Hardware Capabilities
Jason Zhijingcheng Yu, Fangqi Han, Kaustab Choudhury, Trevor E. Carlson, Prateek Saxena
https://arxiv.org/abs/2507.03344

Securing Mixed Rust with Hardware Capabilities
The Rust programming language enforces three basic Rust principles, namely ownership, borrowing, and AXM (Aliasing Xor Mutability) to prevent security bugs such as memory safety violations and data races. However, Rust projects often have mixed code, i.e., code that also uses unsafe Rust, FFI (Foreign Function Interfaces), and inline assembly for low-level control. The Rust compiler is unable to statically enforce Rust principles in mixed Rust code which can lead to many security vulnerabilitie…

@arXiv_csSE_bot@mastoxiv.page
2025-07-16 08:18:31

$\texttt{Droid}$: A Resource Suite for AI-Generated Code Detection
Daniil Orel, Indraneil Paul, Iryna Gurevych, Preslav Nakov
https://arxiv.org/abs/2507.10583

$\texttt{Droid}$: A Resource Suite for AI-Generated Code Detection
In this work, we compile $\textbf{$\texttt{DroidCollection}$}$, the most extensive open data suite for training and evaluating machine-generated code detectors, comprising over a million code samples, seven programming languages, outputs from 43 coding models, and over three real-world coding domains. Alongside fully AI-generated samples, our collection includes human-AI co-authored code, as well as adversarial samples explicitly crafted to evade detection. Subsequently, we develop $\textbf{$\t…

@arXiv_csSE_bot@mastoxiv.page
2025-08-19 10:03:50

Strengthening Programming Comprehension in Large Language Models through Code Generation
Xiaoning Ren, Qiang Hu, Wei Ma, Yan Li, Yao Zhang, Lingxiao Jiang, Yinxing Xue
https://arxiv.org/abs/2508.12620 …

Strengthening Programming Comprehension in Large Language Models through Code Generation
Large language models (LLMs) have recently shown impressive results on diverse code-related tasks, benefiting from large-scale training and instruction tuning. However, studies reveal that their grasp of fundamental programming concepts, such as data flow and control flow, remains shallow, leading to fragile performance when code requires deeper reasoning. This limitation restricts the practical adoption of LLMs in real-world software development. To address this issue, this work introduces a c…

@arXiv_csSE_bot@mastoxiv.page
2025-09-19 09:35:11

SALT4Decompile: Inferring Source-level Abstract Logic Tree for LLM-Based Binary Decompilation
Yongpan Wang, Xin Xu, Xiaojie Zhu, Xiaodong Gu, Beijun Shen
https://arxiv.org/abs/2509.14646

SALT4Decompile: Inferring Source-level Abstract Logic Tree for LLM-Based Binary Decompilation
Decompilation is widely used in reverse engineering to recover high-level language code from binary executables. While recent approaches leveraging Large Language Models (LLMs) have shown promising progress, they typically treat assembly code as a linear sequence of instructions, overlooking arbitrary jump patterns and isolated data segments inherent to binary files. This limitation significantly hinders their ability to correctly infer source code semantics from assembly code. To address this …

@arXiv_csSE_bot@mastoxiv.page
2025-07-22 11:51:40

Observing Fine-Grained Changes in Jupyter Notebooks During Development Time
Sergey Titov, Konstantin Grotov, Cristina Sarasua, Yaroslav Golubev, Dhivyabharathi Ramasamy, Alberto Bacchelli, Abraham Bernstein, Timofey Bryksin
https://arxiv.org/abs/2507.15831

Observing Fine-Grained Changes in Jupyter Notebooks During Development Time
In software engineering, numerous studies have focused on the analysis of fine-grained logs, leading to significant innovations in areas such as refactoring, security, and code completion. However, no similar studies have been conducted for computational notebooks in the context of data science. To help bridge this research gap, we make three scientific contributions: we (1) introduce a toolset for collecting code changes in Jupyter notebooks during development time; (2) use it to collect mor…

@arXiv_csSE_bot@mastoxiv.page
2025-09-18 09:14:21

A Regression Testing Framework with Automated Assertion Generation for Machine Learning Notebooks
Yingao Elaine Yao, Vedant Nimje, Varun Viswanath, Saikat Dutta
https://arxiv.org/abs/2509.13656

A Regression Testing Framework with Automated Assertion Generation for Machine Learning Notebooks
Notebooks have become the de-facto choice for data scientists and machine learning engineers for prototyping and experimenting with machine learning (ML) pipelines. Notebooks provide an interactive interface for code, data, and visualization. However, notebooks provide very limited support for testing. Thus, during continuous development, many subtle bugs that do not lead to crashes often go unnoticed and cause silent errors that manifest as performance regressions. To address this, we introd…

@arXiv_csSE_bot@mastoxiv.page
2025-08-11 09:19:19

Improving the Developer Experience with a Low-Code Process Modelling Language
Henrique Henriques, Hugo Louren\c{c}o, Vasco Amaral, Miguel Goul\~ao
https://arxiv.org/abs/2508.06299

Improving the Developer Experience with a Low-Code Process Modelling Language
Context: The OutSystems Platform is a development environment composed of several DSLs, used to specify, quickly build, and validate web and mobile applications. The DSLs allow users to model different perspectives such as interfaces and data models, define custom business logic and construct process models. Problem: The DSL for process modelling (Business Process Technology (BPT)), has a low adoption rate and is perceived as having usability problems hampering its adoption. This is problematic…

@arXiv_csSE_bot@mastoxiv.page
2025-09-15 09:11:11

Targeted Test Selection Approach in Continuous Integration
Pavel Plyusnin, Aleksey Antonov, Vasilii Ermakov, Aleksandr Khaybriev, Margarita Kikot, Ilseyar Alimova, Stanislav Moiseev
https://arxiv.org/abs/2509.10279

Targeted Test Selection Approach in Continuous Integration
In modern software development change-based testing plays a crucial role. However, as codebases expand and test suites grow, efficiently managing the testing process becomes increasingly challenging, especially given the high frequency of daily code commits. We propose Targeted Test Selection (T-TS), a machine learning approach for industrial test selection. Our key innovation is a data representation that represent commits as Bags-of-Words of changed files, incorporates cross-file and addition…

@arXiv_csSE_bot@mastoxiv.page
2025-08-12 10:02:53

Dynamic Benchmark Construction for Evaluating Large Language Models on Real-World Codes
Zhe Zhang, Runlin Liu, Aishan Liu, Xingyu Liu, Xiang Gao, Hailong Sun
https://arxiv.org/abs/2508.07180

Dynamic Benchmark Construction for Evaluating Large Language Models on Real-World Codes
As large language models LLMs) become increasingly integrated into software development workflows, rigorously evaluating their performance on complex, real-world code generation tasks has become essential. However, existing benchmarks often suffer from data contamination and limited test rigor, constraining their ability to reveal model failures effectively. To address these, we present CODE2BENCH, a end-to-end pipeline for dynamically constructing robust and contamination-resistant benchmarks …

@arXiv_csSE_bot@mastoxiv.page
2025-09-10 07:49:31

Aspect-Oriented Programming in Secure Software Development: A Case Study of Security Aspects in Web Applications
Mterorga Ukor
https://arxiv.org/abs/2509.07449 https://

Aspect-Oriented Programming in Secure Software Development: A Case Study of Security Aspects in Web Applications
Security remains a critical challenge in modern web applications, where threats such as unauthorized access, data breaches, and injection attacks continue to undermine trust and reliability. Traditional Object-Oriented Programming (OOP) often intertwines security logic with business functionality, leading to code tangling, scattering, and reduced maintainability. This study investigates the role of Aspect-Oriented Programming (AOP) in enhancing secure software development by modularizing cross-…

Tootfinder

Opt-in global Mastodon full text search. Join the index!