Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csCV_bot@mastoxiv.page
2025-09-08 09:40:20

SpiderNets: Estimating Fear Ratings of Spider-Related Images with Vision Models
Dominik Pegler, David Steyrl, Mengfan Zhang, Alexander Karner, Jozsef Arato, Frank Scharnowski, Filip Melinscak
arxiv.org/abs/2509.04889

@arXiv_csCL_bot@mastoxiv.page
2025-09-09 12:06:12

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML
Haoyu Dong, Pengkun Zhang, Mingzhe Lu, Yanzhen Shen, Guolin Ke
arxiv.org/abs/2509.06806

@arXiv_csMM_bot@mastoxiv.page
2025-08-08 08:48:12

JPS: Jailbreak Multimodal Large Language Models with Collaborative Visual Perturbation and Textual Steering
Renmiao Chen, Shiyao Cui, Xuancheng Huang, Chengwei Pan, Victor Shea-Jay Huang, QingLin Zhang, Xuan Ouyang, Zhexin Zhang, Hongning Wang, Minlie Huang
arxiv.org/abs/2508.05087

@rigo@mamot.fr
2025-08-06 09:53:48

Ein großer Meilenstein für die Versorgung mit Medikamenten wurde gerade in Frankreich gegangen. Die Pharma-Firmen, die die Produktion von essentiellen Medikamenten einstellen wollen, müssen eine Alternativ-Versorgung sicherstellen oder alle Rechte an eine öffentliche Einrichtung abgeben. Das sollte in Deutschland unbedingt Nachahmung finden!

@arXiv_hepph_bot@mastoxiv.page
2025-10-06 09:43:19

Power corrections to the production of a prompt photon in association with a jet in the $N$-jettiness slicing scheme at NLO QCD
Prem Agarwal, Kirill Melnikov, Ivan Pedron
arxiv.org/abs/2510.03097

@arXiv_csSD_bot@mastoxiv.page
2025-08-05 09:42:20

From Contrast to Commonality: Audio Commonality Captioning for Enhanced Audio-Text Cross-modal Understanding in Multimodal LLMs
Yuhang Jia, Xu Zhang, Yong Qin
arxiv.org/abs/2508.01659

@arXiv_csCV_bot@mastoxiv.page
2025-09-05 10:25:31

Aesthetic Image Captioning with Saliency Enhanced MLLMs
Yilin Tao, Jiashui Huang, Huaze Xu, Ling Shao
arxiv.org/abs/2509.04378 arxiv.org/pd…

@arXiv_csAI_bot@mastoxiv.page
2025-09-29 10:29:27

InfiMed-Foundation: Pioneering Advanced Multimodal Medical Models with Compute-Efficient Pre-Training and Multi-Stage Fine-Tuning
Guanghao Zhu, Zhitian Hou, Zeyu Liu, Zhijie Sang, Congkai Xie, Hongxia Yang
arxiv.org/abs/2509.22261

@arXiv_csCV_bot@mastoxiv.page
2025-08-04 10:10:21

Can Large Pretrained Depth Estimation Models Help With Image Dehazing?
Hongfei Zhang, Kun Zhou, Ruizheng Wu, Jiangbo Lu
arxiv.org/abs/2508.00698

@arXiv_csCV_bot@mastoxiv.page
2025-07-30 10:42:51

MetaCLIP 2: A Worldwide Scaling Recipe
Yung-Sung Chuang, Yang Li, Dong Wang, Ching-Feng Yeh, Kehan Lyu, Ramya Raghavendra, James Glass, Lifei Huang, Jason Weston, Luke Zettlemoyer, Xinlei Chen, Zhuang Liu, Saining Xie, Wen-tau Yih, Shang-Wen Li, Hu Xu
arxiv.org/abs/2507.22062