Tootfinder

Opt-in global Mastodon full text search. Join the index!

@pbloem@sigmoid.social
2025-09-03 19:14:40

The patient man's loss curve.

A loss curve that goes straight consistently for a long time and then suddenly plummets.
@pbloem@sigmoid.social
2025-07-27 15:10:34

Here's an odd effect (stumbled on by accident). The blue loss curve is from a well-tuned BERT baseline (from the "cramming"paper).
The only thing I changed for the orange is to put a residual connection around each transformer block and to multiply the output of the block by a scalar parameter initialized to 0.
I'm surprised that has such a substantial impact. Not just on the performance, but on the shape of the loss curve.

Two loss curves from a transformer training experiment. One, in blue, has a slight bump and a quick drop. The other, in orange, drops more directly, but ends up about 0.2 nats above the blue one.
@arXiv_hepth_bot@mastoxiv.page
2025-09-01 08:41:03

Probing the Black Hole Interior with Holographic Entanglement Entropy and the Role of AdS/BCFT Correspondence
Fabiano F. Santos
arxiv.org/abs/2508.21224

@arXiv_eessSY_bot@mastoxiv.page
2025-08-21 08:02:30

A Digital Twin-Based Simulation Framework for Safe Curve Speed Estimation Using Unity
Araf Rahman (Clemson University), M. Sabbir Salek (Clemson University), Mashrur Chowdhury (Clemson University), Wayne A. Sarasua (Clemson University)
arxiv.org/abs/2508.14046

@arXiv_astrophHE_bot@mastoxiv.page
2025-07-22 09:50:00

Diversity in Hydrogen-rich Envelope Mass of Type II Supernovae. (III). The mass-loss and evolutionary pathways of the red supergiant progenitors
Qiliang Fang, Takashi J. Moriya, Keiichi Maeda, Andris Dorozsmai, Javier Silva-Farf\'an
arxiv.org/abs/2507.14665

@arXiv_physicsinsdet_bot@mastoxiv.page
2025-08-12 10:25:33

AC Magnetometry Loop Tracer Compatible with Magnetic Calorimetry for Power Loss Analysis
Thomas Veile, Michael Harmel, Mathias Zambach, Philip Holm, Frederik L. Durhuus, Cathrine Frandsen
arxiv.org/abs/2508.07929