Tootfinder

Opt-in global Mastodon full text search. Join the index!

@EarthOrgUK@mastodon.energy
2025-10-14 09:51:03

On Website Technicals (2025-06) - Tech updates: Junited - Rigby to Buttersafe - GPTBot badness, captions, diversion delay, under-volt, X11 fossil. #Junited2025 - earth.org.uk/note-on-site-tech

@chris@mstdn.chrisalemany.ca
2025-09-13 15:11:58

Reading about Baldur von Schirach. Sounds familiar.
“In February 1928 he became a university group leader of the National Socialist German Students' League.”
“He worked to broaden the Nazi Party's appeal to the bourgeoisie. Schirach was supported by Hitler in internal elections, who also wanted the Nazi Party to have a broad social base.”
“Schirach was skilled at bureaucratic power struggles. He founded the School Children's Leagues (Schülerbünde) to create competition to the Hitler Youth. He made an ally of Joseph Goebbels.”
“Schirach was named national youth leader of the party in 1931.”
“With Heinrich Hoffmann, Schirach produced several propaganda books of Hoffmann's photographs, including "Hitler As No One Knows Him", "Youth Around Hitler", and "Hitler in His Mountains". Schirach wrote the captions. The books sold hundreds of thousands of copies, earning Schirach and Hoffmann substantial royalties.”
“On 16 June 1932, he was made Reichsführer of the Party's Hitler Youth organization, and resigned from the Student League. Under Schirach, the Hitler Youth stewarded NSDAP events, and 21 members died in 1932. Schirach described these deaths as "blood sacrifice" for propaganda purposes. One example was Herbert Norkus, a fifteen-year-old boy who was stabbed to death by Communists. In a 31 May 1932 speech, Schirach recounted Norkus's death and called for a "National Socialist dictatorship". Schirach gave a memorial speech on the third anniversary of Norkus's death in January 1935.”
#hitleryouth #fascism #theAmericanFascist

@whitequark@mastodon.social
2025-10-12 17:23:13

none of these words are in the bible

Make Meet calls with Google Meet
Important: Legacy calls upgrade to Meet calls, which have expanded features like live captions, in-call chat, stackable effects, cloud encryption, screen sharing and more.

As users move over to Meet calling, some legacy calling features are being upgraded. A few features, like Family Mode, Moments and Knock Knock, are no longer available.

To use the new calling experience, update your Meet app to the latest version.

When all parties in the call use the latest…
@yaya@jorts.horse
2025-10-10 10:16:55

:bighonk: mastodon.social/@closedcaption

@arXiv_csSD_bot@mastoxiv.page
2025-08-07 08:29:24

MiDashengLM: Efficient Audio Understanding with General Audio Captions
Heinrich Dinkel, Gang Li, Jizhong Liu, Jian Luan, Yadong Niu, Xingwei Sun, Tianzi Wang, Qiyang Xiao, Junbo Zhang, Jiahao Zhou
arxiv.org/abs/2508.03983

@arXiv_csHC_bot@mastoxiv.page
2025-08-28 09:49:21

CapTune: Adapting Non-Speech Captions With Anchored Generative Models
Jeremy Zhengqi Huang, Calu\~a de Lacerda Pataca, Liang-Yuan Wu, Dhruv Jain
arxiv.org/abs/2508.19971

@UP8@mastodon.social
2025-08-05 14:10:56

🤯 Interpretable EEG-to-Image Generation with Semantic Prompts
#eeg #ai

@Migurski@mastodon.social
2025-10-08 20:02:39

Auto-captioning at this tech conference is having a tough time with background room chitchat

Screen with an empty podium and a bunch of weird English gobbledygook in the captions
@davidaugust@mastodon.online
2025-08-06 17:55:39

#USpol

screenshot of a post by Thomas Massie @RepThomasMassie:   A meme featuring two panels with captions.   In the top panel, there is a scene from a movie showing a driver looking shocked inside a vehicle; caption reads: "Democrats leaving Texas to protect their district."  In the bottom panel, there is an image of speaker of the house johnson driving the other way, looking out from a vehicle; caption reads: "Republicans leaving D.C. to protect the Epstein files."  Aug 6, 2025 1:54pm UTC
@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:34:31

Addressing the ID-Matching Challenge in Long Video Captioning
Zhantao Yang, Huangji Wang, Ruili Feng, Han Zhang, Yuting Hu, Shangwen Zhu, Junyan Li, Yu Liu, Fan Cheng
arxiv.org/abs/2510.06973

@samvarma@fosstodon.org
2025-10-02 19:36:23

It is really calling watch a guy with a heavy German accent on a YouTube video and see that the automatically generated captions are basically perfect, and I can't dictate a single sentence in perfect English into my $1400 flagship device without making a correction
*galling
#iOS26

@grumpybozo@toad.social
2025-08-30 23:07:19

Meta meta meta...
WTF is with every video having word-flash captions? The one in this toot is an example of one of multiple constant-flux caption style. THAT'S NOT HOW PEOPLE READ!
I can barely watch such videos. journa.host/@lolgop/1151191352

@EarthOrgUK@mastodon.energy
2025-09-30 09:51:02

On Website Technicals (2020-02) - Tech updates: GSC Review annoyance, CSS dark mode, video captions, lazy loading, srcset issues. - earth.org.uk/note-on-site-tech

@aardrian@toot.cafe
2025-07-22 20:43:16

Reason #2608 I do not trust “AI” to generate captions or transcripts:
“Complete silence is always hallucinated as 'ترجمة نانسي قنقر' in Arabic which translates as 'Translation by Nancy Qunqar'”
More examples in replies.
#a11y #accessibility

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:49:01

MATRIX: Mask Track Alignment for Interaction-aware Video Generation
Siyoon Jin, Seongchan Kim, Dahyun Chung, Jaeho Lee, Hyunwook Choi, Jisu Nam, Jiyoung Kim, Seungryong Kim
arxiv.org/abs/2510.07310

@arXiv_csLG_bot@mastoxiv.page
2025-10-01 11:57:57

Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
John Gkountouras, Ivan Titov
arxiv.org/abs/2509.26594 a…

@arXiv_csCL_bot@mastoxiv.page
2025-09-01 09:40:52

BLUEX Revisited: Enhancing Benchmark Coverage with Automatic Captioning
Jo\~ao Guilherme Alves Santos, Giovana Kerche Bon\'as, Thales Sales Almeida
arxiv.org/abs/2508.21294

@matthiasott@mastodon.social
2025-09-18 13:46:52

Had an amazing time speaking about Web Design Engineering at @… Freiburg last week! 🎉 It was an honour to be invited and to meet so many wonderful people and good friends there – a truly smashing experience! Thank you, everyone! 🤗💚🎈
📸 Photos by @…

Matthias on stage at Smashing Conf Freiburg, talking to the audience, with a monitor behind me displaying live captions.
Me on stage, viewed from afar with a truckload of modern CSS properties and functions on the screen behind me.
Vitaly Friedman and I talking on a red sofa during the Q&A after the talk.
@seeingwithsound@mas.to
2025-09-21 15:26:21

(YouTube, Chinese w/o captions but graphical English subtitles) Blind patient treated with ZM-02 optogenetic gene therapy #RP

@arXiv_eessAS_bot@mastoxiv.page
2025-08-29 08:41:41

Sound event detection with audio-text models and heterogeneous temporal annotations
Manu Harju, Annamaria Mesaros
arxiv.org/abs/2508.20703

@arXiv_csCV_bot@mastoxiv.page
2025-07-29 12:16:11

Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions
Licai Sun, Xingxun Jiang, Haoyu Chen, Yante Li, Zheng Lian, Biu Liu, Yuan Zong, Wenming Zheng, Jukka M. Lepp\"anen, Guoying Zhao
arxiv.org/abs/2507.21015

@EarthOrgUK@mastodon.energy
2025-07-22 19:51:03

On Website Technicals (2025-06) - Tech updates: Junited - Rigby to Buttersafe - GPTBot badness, captions, diversion delay, under-volt, X11 fossil. #Junited2025 - earth.org.uk/note-on-site-tech

@arXiv_csCV_bot@mastoxiv.page
2025-07-28 10:14:11

LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences
Yusuke Hirota, Boyi Li, Ryo Hachiuma, Yueh-Hua Wu, Boris Ivanovic, Yuta Nakashima, Marco Pavone, Yejin Choi, Yu-Chiang Frank Wang, Chao-Han Huck Yang
arxiv.org/abs/2507.19362

@arXiv_csMM_bot@mastoxiv.page
2025-09-22 08:36:11

Jamendo-QA: A Large-Scale Music Question Answering Dataset
Junyoung Koh, Soo Yong Kim, Yongwon Choi, Gyu Hyeong Choi
arxiv.org/abs/2509.15662

@arXiv_csCV_bot@mastoxiv.page
2025-10-06 10:03:49

One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework
Lorenzo Bianchi, Giacomo Pacini, Fabio Carrara, Nicola Messina, Giuseppe Amato, Fabrizio Falchi
arxiv.org/abs/2510.02898

@arXiv_csSD_bot@mastoxiv.page
2025-09-29 09:37:08

Text2Move: Text-to-moving sound generation via trajectory prediction and temporal alignment
Yunyi Liu, Shaofan Yang, Kai Li, Xu Li
arxiv.org/abs/2509.21919

@arXiv_csCL_bot@mastoxiv.page
2025-09-17 09:16:00

MAGIC-Enhanced Keyword Prompting for Zero-Shot Audio Captioning with CLIP Models
Vijay Govindarajan, Pratik Patel, Sahil Tripathi, Md Azizul Hoque, Gautam Siddharth Kashyap
arxiv.org/abs/2509.12591

@arXiv_eessAS_bot@mastoxiv.page
2025-09-19 09:28:31

Aligning Audio Captions with Human Preferences
Kartik Hegde, Rehana Mahfuz, Yinyi Guo, Erik Visser
arxiv.org/abs/2509.14659 arxiv.org/pdf/2…

@arXiv_csCV_bot@mastoxiv.page
2025-09-24 11:09:54

Long Story Short: Disentangling Compositionality and Long-Caption Understanding in VLMs
Israfel Salazar, Desmond Elliott, Yova Kementchedjhieva
arxiv.org/abs/2509.19207

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 10:55:51

JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation
Siheng Wan, Zhengtao Yao, Zhengdao Li, Junhao Dong, Yanshu Li, Yikai Li, Linshan Li, Haoyan Xu, Yijiang Li, Zhikang Dong, Huacan Wang, Jifeng Shen
arxiv.org/abs/2510.00974

@ubuntourist@mastodon.social
2025-09-18 21:33:32

From the Ministry of Truth:
#resist #authoritarianism #fascism #news

EDITORIAL CARTOON:

Official seals for the Department of Defense, the Department of Health
and Human Services and the Department of Justice.

CAPTIONS:

* Department of War

* Department of War on Science

* Department of War on Democrats

Signed: Bramhall'25 (NYDN)
@arXiv_csCV_bot@mastoxiv.page
2025-09-29 11:16:57

LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer
Song Fei, Tian Ye, Lujia Wang, Lei Zhu
arxiv.org/abs/2509.22414

@arXiv_eessAS_bot@mastoxiv.page
2025-07-24 08:00:59

Towards Robust Speech Recognition for Jamaican Patois Music Transcription
Jordan Madden, Matthew Stone, Dimitri Johnson, Daniel Geddez
arxiv.org/abs/2507.16834

@arXiv_csCV_bot@mastoxiv.page
2025-07-25 10:21:02

SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning
Si-Woo Kim, MinJu Jeon, Ye-Chan Kim, Soeun Lee, Taewhan Kim, Dong-Jin Kim
arxiv.org/abs/2507.18616

@arXiv_csCV_bot@mastoxiv.page
2025-07-23 10:31:22

Enhancing Remote Sensing Vision-Language Models Through MLLM and LLM-Based High-Quality Image-Text Dataset Generation
Yiguo He, Junjie Zhu, Yiying Li, Xiaoyu Zhang, Chunping Qiu, Jun Wang, Qiangjuan Huang, Ke Yang
arxiv.org/abs/2507.16716