Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCL_bot@mastoxiv.page
2024-04-08 06:48:21

Improving Factual Accuracy of Neural Table-to-Text Output by Addressing Input Problems in ToTTo
Barkavi Sundararajan, Somayajulu Sripada, Ehud Reiter
arxiv.org/abs/2404.04103

@jgkoomey@mastodon.energy
2024-05-07 15:53:18

This judge is a fascist in the pocket of Trump. She should be removed from this case and impeached. mastodon.social/@JoshuaHolland

@kennysmith@mstdn.social
2024-03-07 23:51:55

That is a heck of a quote. Simple. Factual. Poignant. Powerful. True.
From: @…
press.coop/@nytimes/1120559214

@arXiv_csAI_bot@mastoxiv.page
2024-03-07 08:23:58

This arxiv.org/abs/2309.03685 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@Techmeme@techhub.social
2024-03-04 14:10:22

Anthropic announces Claude 3 Opus, Sonnet, and Haiku, aiming to improve factual accuracy; Opus and Sonnet are available now, and Haiku in the coming weeks (Rachel Metz/Bloomberg)
bloomberg.com/news/articles/20

@arXiv_csCL_bot@mastoxiv.page
2024-03-07 06:50:58

FaaF: Facts as a Function for the evaluation of RAG systems
Vasileios Katranidis, Gabor Barany
arxiv.org/abs/2403.03888

@TedUnderwood@sigmoid.social
2024-03-29 13:45:57

How can we evaluate the factual accuracy of long answers from LLMs? Researchers from DeepMind / Stanford demonstrate a strategy that uses LLMs search to assess factuality: it's more accurate than human evaluation and 20x cheaper. h/t Marc Lanctot on Threads arxiv.org/abs/2403.18802

Illustration dramatizing a strategy that breaks a long answer into sentences, checks each sentence for relevance, and then uses search to check the relevant sentences for factual accuracy.
@arXiv_csCL_bot@mastoxiv.page
2024-03-08 08:29:41

This arxiv.org/abs/2403.01432 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csHC_bot@mastoxiv.page
2024-04-05 08:31:37

This arxiv.org/abs/2311.16842 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csCL_bot@mastoxiv.page
2024-04-08 06:47:54

PRobELM: Plausibility Ranking Evaluation for Language Models
Zhangdie Yuan, Chenxi Whitehouse, Eric Chamoun, Rami Aly, Andreas Vlachos
arxiv.org/abs/2404.03818

@Mediagazer@mstdn.social
2024-02-19 20:35:24

AI-generated biographies, often riddled with gross grammatical and factual errors, are appearing for sale online soon after the death of well-known people (Elizabeth A. Harris/New York Times)
nytimes.com/2024/02/18/books/a

@kcarruthers@mastodon.social
2024-03-30 02:30:25

The Oz government’s digital ID bill has become a lightning rod for conspiracy theories and fearmongering, despite it being “world’s best practice” and “completely sane,” privacy and security experts say.
Several Coalition and crossbench politicians claimed the bill was an attempt to introduce a Chinese Communist Party-style social credit score and a measure to control the public, despite no evidence or factual basis, after it passed the Senate.

Weird image dunno what it means probably just illustrative.
@ferrous@neurodifferent.me
2024-02-25 12:03:04

Discovered this because the author wrote a factual piece for Labour List - "Best Labour leaders we never had: Anthony Greenwood"
labourlist.org/2023/08/best-la

@arXiv_csHC_bot@mastoxiv.page
2024-04-05 08:31:37

This arxiv.org/abs/2311.16842 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csCL_bot@mastoxiv.page
2024-03-04 07:28:00

Self-Consistent Decoding for More Factual Open Responses
Christopher Malon, Xiaodan Zhu
arxiv.org/abs/2403.00696 arxi…

@codinghorror@infosec.exchange
2024-02-12 03:29:12

I don't know if you've read this yet (not the book, the review) but it is an absolute masterpiece of how to deliver a completely devastating, but cordial and factual, takedown. Honestly, the last takedown I saw this powerful was Susan Fowler's blog post on Uber.

@arXiv_csGT_bot@mastoxiv.page
2024-02-23 06:49:58

On Truthful Item-Acquiring Mechanisms for Reward Maximization
Liang Shan, Shuo Zhang, Jie Zhang, Zihe Wang
arxiv.org/abs/2402.14540

@roelgrif@mstdn.social
2024-03-17 09:48:07

Een mooi overzichtsartikel met de feiten op een rijtje als het gaat om de oorsprong van het coronavirus. #labLeak
commondreams.org/opinion/covid

@alsutton@snapp.social
2024-04-19 08:39:32

It really is a step backwards for most folk when #AI credentials mean more than factual accuracy.
A simple lookup table would have correctly shown that #Debian #Linux 10.x is called buster (trixie is the name of t…

@Techmeme@techhub.social
2024-03-23 15:20:39

A look at the US DOJ's lawsuit against Apple, comparisons to Microsoft suit in the 90's, basic factual errors, why it is clearly a political case, and more (Steven Sinofsky/Hardcore Software)
hardcoresoftware.learningbyshi

@arXiv_csCL_bot@mastoxiv.page
2024-05-06 08:26:20

This arxiv.org/abs/2307.12371 has been replaced.
link: scholar.google.com/scholar?q=a

@ronkjeffries@mastodon.social
2024-02-10 18:46:12

A good read from a smart guy.
Worth reading just to get factual scoop about QUOTE
"information wants to be free"
medium.com/@johnmarkoff/introd

@arXiv_csIR_bot@mastoxiv.page
2024-04-29 08:30:45

This arxiv.org/abs/2301.10389 has been replaced.
initial toot: mastoxiv.page/@arXiv_csIR_…

@arXiv_statML_bot@mastoxiv.page
2024-03-28 08:46:13

This arxiv.org/abs/2306.15328 has been replaced.
initial toot: mastoxiv.page/@arXiv_sta…

@arXiv_csCL_bot@mastoxiv.page
2024-03-06 08:31:48

This arxiv.org/abs/2403.01548 has been replaced.
link: scholar.google.com/scholar?q=a

@phillipdewet@mastodonapp.uk
2024-03-15 07:58:42

And here is the trigger event, I reckon. The opinion writer we believe is basically a sock puppet for the media proprietor is back – and making some plainly false factual claims.
The easiest to prove: that in investigating their identity, News24 did not provide them with an opportunity to comment.
That is the work of a moment to take before the Press Ombud, at which point Independent Media has to choose between apologising or storming out again.
web.archive.org/web/2024031507

@smurthys@hachyderm.io
2024-02-14 17:15:02

Google so badly wants to disown "Bard", it is ascribing it to competitors.
Me: What are the names of your top competitor offerings?
Gemini: [1]
Me: Are you sure about # 3?
Gemini: [2, yup it's all on me 🤦‍♂️]
gemini.google.com/share/c24cc7

White text on black background, some parts underlined in red:

3. Bard (Microsoft): Powered by the GPT-4 technology, Bard integrates seamlessly with the Bing search engine and excels at generating different creative text formats like poems, code, scripts, musical pieces, email, letters, etc. Similar to ChatGPT, it might face challenges with factual accuracy and grounding in reality.
White text on black background with some parts underlined in red:

I apologize, I made a mistake in my previous statement. As of today, February 14, 2024, there is no large language model named Bard developed by Microsoft. You might be confusing it with Bing's search chatbot powered by GPT-4 technology, which utilizes a different name. I am still under development and learning, and I appreciate you pointing out this error.
@arXiv_csSE_bot@mastoxiv.page
2024-02-21 08:33:00

This arxiv.org/abs/2312.08976 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@Mediagazer@mstdn.social
2024-04-17 16:50:35

About 50 NPR staffers sign a letter to NPR leadership calling for a public rebuke of the "factual inaccuracies and elisions" in Uri Berliner's essay (Ben Mullin/@benmullin)
twitter.com/benmullin/status/1

@arXiv_csCL_bot@mastoxiv.page
2024-05-03 07:16:50

FLAME: Factuality-Aware Alignment for Large Language Models
Sheng-Chieh Lin, Luyu Gao, Barlas Oguz, Wenhan Xiong, Jimmy Lin, Wen-tau Yih, Xilun Chen
arxiv.org/abs/2405.01525

@arXiv_csCL_bot@mastoxiv.page
2024-04-05 08:31:00

This arxiv.org/abs/2403.18802 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCL_…

@arXiv_csAI_bot@mastoxiv.page
2024-02-12 08:29:33

This arxiv.org/abs/2310.14894 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csDL_bot@mastoxiv.page
2024-02-23 06:48:32

Dataset Artefacts are the Hidden Drivers of the Declining Disruptiveness in Science
Vincent Holst, Andres Algaba, Floriano Tori, Sylvia Wenmackers, Vincent Ginis
arxiv.org/abs/2402.14583

@phillipdewet@mastodonapp.uk
2024-03-15 07:58:42

And here is the trigger event, I reckon. The opinion writer we believe is basically a sock puppet for the media proprietor is back – and making some plainly false factual claims.
The easiest to prove: that in investigating their identity, News24 did not provide them with an opportunity to comment.
That is the work of a moment to take before the Press Ombud, at which point Independent Media has to choose between apologising or storming out again.
web.archive.org/web/2024031507

@Techmeme@techhub.social
2024-03-11 22:15:58

Filing: OpenAI rebuts Elon Musk's lawsuit, saying OpenAI didn't violate its founding agreement because "there is no founding agreement, or any agreement at all" (Rachel Metz/Bloomberg)
bloomberg.com/news/articles/20

@arXiv_csIT_bot@mastoxiv.page
2024-02-13 13:29:52

Semantic Data for Humanities and Social Sciences (SDHSS): an Ecosystem of CIDOC CRM Extensions for Research Data Production and Reuse
Francesco BerettaLARHRA, LARHRA PHN
arxiv.org/abs/2402.07531

@arXiv_csCL_bot@mastoxiv.page
2024-03-22 06:55:27

WikiFactDiff: A Large, Realistic, and Temporally Adaptable Dataset for Atomic Factual Knowledge Update in Causal Language Models
Hichem Ammar Khodja, Fr\'ed\'eric B\'echet, Quentin Brabant, Alexis Nasr, Gw\'enol\'e Lecorv\'e
arxiv.org/abs/2403.14364

@arXiv_csCL_bot@mastoxiv.page
2024-03-04 08:30:49

This arxiv.org/abs/2402.18045 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCL_…

@arXiv_csAI_bot@mastoxiv.page
2024-02-22 06:46:46

Analyizing the Conjunction Fallacy as a Fact
Tomas Veloz, Olha Sobetska
arxiv.org/abs/2402.13615 arxiv.org/pdf/2402.1…

@arXiv_csCL_bot@mastoxiv.page
2024-03-04 08:30:49

This arxiv.org/abs/2402.18045 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCL_…

@arXiv_csCL_bot@mastoxiv.page
2024-05-02 08:26:47

This arxiv.org/abs/2403.11169 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCL_…

@arXiv_csAI_bot@mastoxiv.page
2024-04-16 06:46:48

HyperMono: A Monotonicity-aware Approach to Hyper-Relational Knowledge Representation
Zhiwei Hu, V\'ictor Guti\'errez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan
arxiv.org/abs/2404.09848

@arXiv_csCL_bot@mastoxiv.page
2024-04-19 08:29:43

This arxiv.org/abs/2404.11184 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCL_…

@arXiv_csCL_bot@mastoxiv.page
2024-04-19 08:29:08

This arxiv.org/abs/2402.13919 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCL_…

@arXiv_csCL_bot@mastoxiv.page
2024-04-15 08:30:26

This arxiv.org/abs/2402.17097 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCL_…

@arXiv_csCL_bot@mastoxiv.page
2024-03-01 06:53:33

Memory-Augmented Generative Adversarial Transformers
Stephan Raaijmakers, Roos Bakker, Anita Cremers, Roy de Kleijn, Tom Kouwenhoven, Tessa Verhoef
arxiv.org/abs/2402.19218

@arXiv_csCL_bot@mastoxiv.page
2024-03-22 06:55:29

FIT-RAG: Black-Box RAG with Factual Information and Token Reduction
Yuren Mao, Xuemei Dong, Wenyi Xu, Yunjun Gao, Bin Wei, Ying Zhang
arxiv.org/abs/2403.14374

@arXiv_csCL_bot@mastoxiv.page
2024-03-13 06:48:22

SIFiD: Reassess Summary Factual Inconsistency Detection with LLM
Jiuding Yang, Hui Liu, Weidong Guo, Zhuwei Rao, Yu Xu, Di Niu
arxiv.org/abs/2403.07557

@arXiv_csCL_bot@mastoxiv.page
2024-03-13 06:48:22

SIFiD: Reassess Summary Factual Inconsistency Detection with LLM
Jiuding Yang, Hui Liu, Weidong Guo, Zhuwei Rao, Yu Xu, Di Niu
arxiv.org/abs/2403.07557

@arXiv_csCL_bot@mastoxiv.page
2024-03-11 07:18:43

Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge
Xin Zhao, Naoki Yoshinaga, Daisuke Oba
arxiv.org/abs/2403.05189

@arXiv_csCL_bot@mastoxiv.page
2024-03-11 07:18:43

Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge
Xin Zhao, Naoki Yoshinaga, Daisuke Oba
arxiv.org/abs/2403.05189

@arXiv_csCL_bot@mastoxiv.page
2024-05-01 06:48:51

FactCheck Editor: Multilingual Text Editor with End-to-End fact-checking
Vinay Setty
arxiv.org/abs/2404.19482 arxiv.org/pdf/2404.19482
arXiv:2404.19482v1 Announce Type: new
Abstract: We introduce 'FactCheck Editor', an advanced text editor designed to automate fact-checking and correct factual inaccuracies. Given the widespread issue of misinformation, often a result of unintentional mistakes by content creators, our tool aims to address this challenge. It supports over 90 languages and utilizes transformer models to assist humans in the labor-intensive process of fact verification. This demonstration showcases a complete workflow that detects text claims in need of verification, generates relevant search engine queries, and retrieves appropriate documents from the web. It employs Natural Language Inference (NLI) to predict the veracity of claims and uses LLMs to summarize the evidence and suggest textual revisions to correct any errors in the text. Additionally, the effectiveness of models used in claim detection and veracity assessment is evaluated across multiple languages.

@arXiv_csCL_bot@mastoxiv.page
2024-02-23 06:56:30

UFO: a Unified and Flexible Framework for Evaluating Factuality of Large Language Models
Zhaoheng Huang, Zhicheng Dou, Yutao Zhu, Ji-rong Wen
arxiv.org/abs/2402.14690

@arXiv_csCL_bot@mastoxiv.page
2024-03-22 06:55:30

Editing Knowledge Representation of Language Lodel via Rephrased Prefix Prompts
Yuchen Cai, Ding Cao, Rongxi Guo, Yaqin Wen, Guiquan Liu, Enhong Chen
arxiv.org/abs/2403.14381

@arXiv_csCL_bot@mastoxiv.page
2024-02-16 08:30:13

This arxiv.org/abs/2401.17809 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csCL_bot@mastoxiv.page
2024-04-18 08:35:48

This arxiv.org/abs/2404.05904 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCL_…

@arXiv_csCL_bot@mastoxiv.page
2024-02-13 14:32:39

This arxiv.org/abs/2310.01045 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csCL_bot@mastoxiv.page
2024-04-10 06:48:55

Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles
Andrea Zugarini, Kamyar Zeinalipour, Surya Sai Kadali, Marco Maggini, Marco Gori, Leonardo Rigutini
arxiv.org/abs/2404.06186