Tootfinder

Opt-in global Mastodon full text search. Join the index!

@qurlyjoe@mstdn.social
2025-08-26 22:26:43

From #boingboing.net
#Byte magazine ran in print from 1975 to 1998, a golden age that began with the first commercially successful personal computer and ending with the Digital Millennium Copyright Act (or the introduction of the iMac, perhaps.) At byte.tsundoku.io, you can explore the visual histo…

@zachleat@zachleat.com
2025-08-29 16:52:04

Is my host fast yet? is a web host TTFB (Time to First Byte) leaderboard: ismyhostfastyet.com/?client=mo

@azonenberg@ioc.exchange
2025-08-30 09:45:33

TIL that the Xilinx PCIe endpoint IP, at least on some Versals (I assume it's likely the same on other parts but I've never looked) has a cursed non-compliant AXI4-Stream interface that uses TKEEP as a *dword* level valid strobe, rather than byte level as the spec requires.
Lovely.
The more I see of Xilinx IP blocks the more my decision to avoid then seems like the right one.

@thomasfuchs@hachyderm.io
2025-07-21 14:44:40

I've a "old #astronomy software" side project and I'm looking for the early Mac software "StarMap" or "Star Map" from Bruce Webster.
So far my searches came up empty, but I'm not a specialist in old Mac software.
Does anyone have a copy of this app?
It's described in detail in a July 1985 BYTE article: #retrocomputing

@cyrevolt@mastodon.social
2025-08-27 22:32:07

A little byte swap here, a change to CONFIG_LOGO_LINUX_VGA16 there, badabing badaboom - the framebuffer console has the right colors!
Linux assumes framebuffers to be little endian.
In case they are not, as I recall reading, the client is supposed to take care of conversion.
But what if the client is the (Linux internal) console driver?
I couldn't find a config option, so I hacked a byte swap into the color map helper (there are many similarly named functions btw):

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 10:11:42

Hybrid Tokenization Strategy for DNA Language Model using Byte Pair Encoding and K-MER Methods
Ganesh Sapkota, Md Hasibur Rahman
arxiv.org/abs/2507.18570

@arXiv_csCR_bot@mastoxiv.page
2025-08-28 09:47:11

The Art of Hide and Seek: Making Pickle-Based Model Supply Chain Poisoning Stealthy Again
Tong Liu, Guozhu Meng, Peng Zhou, Zizhuang Deng, Shuaiyin Yao, Kai Chen
arxiv.org/abs/2508.19774

@cyrevolt@mastodon.social
2025-07-27 14:01:55

What I really like about Rust is how much it feels like the code is practically writing itself thanks to all the excellent crates out there.
Once again, I am making great use of the zerocopy crate, specifically Ref:
docs.rs/zerocopy/latest/zeroco

@kurtsh@mastodon.social
2025-06-14 20:58:12

"The [Tesla] owners are demanding that their lease contracts be terminated and reimbursed for the accrued legal costs."
☑️ Tesla Drivers Sue Elon Musk for Turning Their Cars Into "Extreme" Right-Wing Symbols
futurism.com/the-byte/tesla-dr

@chrysn@chaos.social
2025-07-25 08:32:21

While NUL and NULL both refer to numeric values 0, NUL is used in the context of ASCII (a single byte), whereas NULL is used in the context of pointers.
My headcanon is now that NULL is short for "NUL, Long".

@arXiv_csCL_bot@mastoxiv.page
2025-06-23 08:31:50

Entropy-Driven Pre-Tokenization for Byte-Pair Encoding
Yifan Hu, Frank Liang, Dachuan Zhao, Jonathan Geuter, Varshini Reddy, Craig W. Schmidt, Chris Tanner
arxiv.org/abs/2506.15889

@arXiv_csAR_bot@mastoxiv.page
2025-06-19 08:01:51

From Block to Byte: Transforming PCIe SSDs with CXL Memory Protocol and Instruction Annotation
Miryeong Kwon, Donghyun Gouk, Junhyeok Jang, Jinwoo Baek, Hyunwoo You, Sangyoon Ji, Hongjoo Jung, Junseok Moon, Seungkwan Kang, Seungjun Lee, Myoungsoo Jung
arxiv.org/abs/2506.15613

@frankel@mastodon.top
2025-07-21 16:11:00

Inside the Box: Everything I Did With an #Arduino Starter Kit
lopespm.com/hardware/2025/07/1

@arXiv_csLG_bot@mastoxiv.page
2025-07-24 10:19:49

BGM-HAN: A Hierarchical Attention Network for Accurate and Fair Decision Assessment on Semi-Structured Profiles
Junhua Liu, Roy Ka-Wei Lee, Kwan Hui Lim
arxiv.org/abs/2507.17472

@stiefkind@mastodon.social
2025-06-07 05:36:46

Is somebody aware of something like an “all time index” over all BYTE magazine issues? I'm specifically searching for BYTE articles on DTP (desktop publishing). The magazines itself are available via @… (mostly complete) but I don't want to manually walk through every single issues table of contents.

@arXiv_csDB_bot@mastoxiv.page
2025-06-24 08:22:39

Floating-Point Data Transformation for Lossless Compression
Samirasadat Jamalidinan, Kazem Cheshmi
arxiv.org/abs/2506.18062

@kubikpixel@chaos.social
2025-07-13 12:50:11

An was entdeckt Mensch, dass der anscheinende IT Fachartikel nur plumpes Marketing ist? Soll ich für euch dies mal Bit für Bit durchgehen oder nur ein Kyber Byte da lassen? 🙄😅🤷‍♂️
/s
»[…] Auffallend beim neuesten Mainframe-Modell der Z-Reihe von IBM ist aber das: Er verschlüsselt alle Daten in Echtzeit. Alle bedeutet in diesem Fall tatsächlich jedes einzelne Bit.«
#ibm

@fortune@social.linux.pizza
2025-07-10 22:00:02

BYTE editors are people who separate the wheat from the chaff, and then
carefully print the chaff.

@fanf@mendeddrum.org
2025-07-11 08:42:04

from my link log —
A Mind Is Born: 256 byte Commodore 64 demo.
linusakesson.net/scene/a-mind-
saved 2021-03-19

@maxheadroom@hub.uckermark.social
2025-06-14 09:36:55

@… irgendwie geht bei Heise gerade im Safari auf macOS die Buchung von Abos nicht. Es gibt immer ein Download einer 0 Byte großen Datei ... Passiert auch im Safari iOS wenn ich mich nur versuche anzumelden.

@kcase@mastodon.social
2025-06-06 14:41:32

It seems to me that MCP is a modern, cross-platform corollary to the Mac ecosystem’s AppleScript dictionaries. It's a great standard for discovering API endpoints and calling them in a standard way. (And as a bonus, it doesn't involve keeping track of four-byte codes.)
Because it's associated with the AI buzz, lots of developers are integrating it. But it's not limited to and doesn't have to be used with AI; there's a great opportunity to make it easy for humans…

@alejandrobdn@social.linux.pizza
2025-08-05 16:27:59

AWS deleted my 10-year account and all data without warning
"On July 23, 2025, AWS deleted my 10-year-old account and every byte of data I had stored with them. No warning. No grace period. No recovery options. Just complete digital annihilation".
seuros.com/blog/aws…

@deprogrammaticaipsum@mas.to
2025-06-02 17:49:30

"The RDBMS field is so young, we can actually see it grow through the pages of computer magazines of the 1980s. BYTE Magazine had its first issue dedicated to databases in November 1981, and then another one in October 1984. Dr. Dobb’s Journal did not feature an article about databases until 1984 and did not have many more throughout the decade; actually most of them were authored by Gene Head, and talk about dBASE."

@arXiv_csRO_bot@mastoxiv.page
2025-06-09 08:29:02

BEAST: Efficient Tokenization of B-Splines Encoded Action Sequences for Imitation Learning
Hongyi Zhou, Weiran Liao, Xi Huang, Yucheng Tang, Fabian Otto, Xiaogang Jia, Xinkai Jiang, Simon Hilber, Ge Li, Qian Wang, \"Omer Erdin\c{c} Ya\u{g}murlu, Nils Blank, Moritz Reuss, Rudolf Lioutikov
arxiv.org/abs/2506.06072

@arXiv_qfinCP_bot@mastoxiv.page
2025-08-05 08:39:10

ByteGen: A Tokenizer-Free Generative Model for Orderbook Events in Byte Space
Yang Li, Zhi Chen
arxiv.org/abs/2508.02247 arxiv.org/pdf/2508…

@arXiv_csCL_bot@mastoxiv.page
2025-07-22 12:23:40

Supernova: Achieving More with Less in Transformer Architectures
Andrei-Valentin Tanase, Elena Pelican
arxiv.org/abs/2507.15773

@fanf@mendeddrum.org
2025-06-10 20:42:03

from my link log —
Histogramming bytes with positional popcount, GF2P8AFFINEQB edition.
bitmath.blogspot.com/2024/11/h
saved 2024-11-10

@arXiv_csAR_bot@mastoxiv.page
2025-08-18 07:32:50

OpenCXD: An Open Real-Device-Guided Hybrid Evaluation Framework for CXL-SSDs
Hyunsun Chung, Junhyeok Park, Taewan Noh, Seonghoon Ahn, Kihwan Kim, Ming Zhao, Youngjae Kim
arxiv.org/abs/2508.11477

@stiefkind@mastodon.social
2025-08-08 05:07:58

Dear Fediverse! Being German, I'm quite familiar with German computer magazines. But not with the UK and US world. What were the “professional level” computer magazines in ca. 1980-2010 which wrote about enterprise IT, Unix, supercomputers, hardware? I'm NOT (yet) interested in software engineering or programming languages and also not very much in Linux.
I already know BYTE magazine, Unix Review and several 8bit related magazines. What else?

@arXiv_qfinTR_bot@mastoxiv.page
2025-08-08 12:54:05

Replaced article(s) found for q-fin.TR. arxiv.org/list/q-fin.TR/new
[1/1]:
- ByteGen: A Tokenizer-Free Generative Model for Orderbook Events in Byte Space
Yang Li, Zhi Chen

@arXiv_csCL_bot@mastoxiv.page
2025-08-21 09:32:50

Tokens with Meaning: A Hybrid Tokenization Approach for NLP
M. Ali Bayram, Ali Arda Fincan, Ahmet Semih G\"um\"u\c{s}, Sercan Karaka\c{s}, Banu Diri, Sava\c{s} Y{\i}ld{\i}r{\i}m, Demircan \c{C}elik
arxiv.org/abs/2508.14292

@fanf@mendeddrum.org
2025-06-09 20:42:03

from my link log —
Alan Kay did not invent object-oriented programming.
hillelwayne.com/post/alan-kay/
saved 2025-05-11

@arXiv_csCL_bot@mastoxiv.page
2025-06-18 09:16:00

From Bytes to Ideas: Language Modeling with Autoregressive U-Nets
Mathurin Videau, Badr Youbi Idrissi, Alessandro Leite, Marc Schoenauer, Olivier Teytaud, David Lopez-Paz
arxiv.org/abs/2506.14761

@arXiv_csLG_bot@mastoxiv.page
2025-07-11 10:23:11

Dynamic Chunking for End-to-End Hierarchical Sequence Modeling
Sukjun Hwang, Brandon Wang, Albert Gu
arxiv.org/abs/2507.07955 arxiv.org/pdf/2507.07955 arxiv.org/html/2507.07955
arXiv:2507.07955v1 Announce Type: new
Abstract: Despite incredible progress in language models (LMs) in recent years, largely resulting from moving away from specialized models designed for specific tasks to general models based on powerful architectures (e.g. the Transformer) that learn everything from raw data, pre-processing steps such as tokenization remain a barrier to true end-to-end foundation models. We introduce a collection of new techniques that enable a dynamic chunking mechanism which automatically learns content -- and context -- dependent segmentation strategies learned jointly with the rest of the model. Incorporating this into an explicit hierarchical network (H-Net) allows replacing the (implicitly hierarchical) tokenization-LM-detokenization pipeline with a single model learned fully end-to-end. When compute- and data- matched, an H-Net with one stage of hierarchy operating at the byte level outperforms a strong Transformer language model operating over BPE tokens. Iterating the hierarchy to multiple stages further increases its performance by modeling multiple levels of abstraction, demonstrating significantly better scaling with data and matching a token-based Transformer of twice its size. H-Nets pretrained on English show significantly increased character-level robustness, and qualitatively learn meaningful data-dependent chunking strategies without any heuristics or explicit supervision. Finally, the H-Net's improvement over tokenized pipelines is further increased in languages and modalities with weaker tokenization heuristics, such as Chinese and code, or DNA sequences (nearly 4x improvement in data efficiency over baselines), showing the potential of true end-to-end models that learn and scale better from unprocessed data.
toXiv_bot_toot

@arXiv_csAR_bot@mastoxiv.page
2025-06-12 07:17:10

Exploiting Control-flow Enforcement Technology for Sound and Precise Static Binary Disassembly
Brian Zhao, Yiwei Yang, Yusheng Zheng, Andi Quinn
arxiv.org/abs/2506.09426

@arXiv_qfinCP_bot@mastoxiv.page
2025-08-08 12:53:18

Replaced article(s) found for q-fin.CP. arxiv.org/list/q-fin.CP/new
[1/1]:
- ByteGen: A Tokenizer-Free Generative Model for Orderbook Events in Byte Space
Yang Li, Zhi Chen

@arXiv_csCL_bot@mastoxiv.page
2025-08-08 10:04:32

H-Net : Hierarchical Dynamic Chunking for Tokenizer-Free Language Modelling in Morphologically-Rich Languages
Mehrdad Zakershahrak, Samira Ghodratnama
arxiv.org/abs/2508.05628

@arXiv_csAR_bot@mastoxiv.page
2025-08-06 07:32:20

Towards Memory Specialization: A Case for Long-Term and Short-Term RAM
Peijing Li, Muhammad Shahir Abdurraman, Rachel Cleaveland, Sergey Legtchenko, Philip Levis, Ioan Stefanovici, Thierry Tambe, David Tennenhouse, Caroline Trippel
arxiv.org/abs/2508.02992