2024-04-08 06:48:21
Improving Factual Accuracy of Neural Table-to-Text Output by Addressing Input Problems in ToTTo
Barkavi Sundararajan, Somayajulu Sripada, Ehud Reiter
https://arxiv.org/abs/2404.04103
Improving Factual Accuracy of Neural Table-to-Text Output by Addressing Input Problems in ToTTo
Barkavi Sundararajan, Somayajulu Sripada, Ehud Reiter
https://arxiv.org/abs/2404.04103
This judge is a fascist in the pocket of Trump. She should be removed from this case and impeached. https://mastodon.social/@JoshuaHolland/112400339850273355
That is a heck of a quote. Simple. Factual. Poignant. Powerful. True.
From: @…
https://press.coop/@nytimes/112055921492425764
This https://arxiv.org/abs/2309.03685 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csAI_…
Anthropic announces Claude 3 Opus, Sonnet, and Haiku, aiming to improve factual accuracy; Opus and Sonnet are available now, and Haiku in the coming weeks (Rachel Metz/Bloomberg)
https://www.bloomberg.com/news/articles/20
FaaF: Facts as a Function for the evaluation of RAG systems
Vasileios Katranidis, Gabor Barany
https://arxiv.org/abs/2403.03888 https://
This https://arxiv.org/abs/2403.01432 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2311.16842 has been replaced.
link: https://scholar.google.com/scholar?q=a
PRobELM: Plausibility Ranking Evaluation for Language Models
Zhangdie Yuan, Chenxi Whitehouse, Eric Chamoun, Rami Aly, Andreas Vlachos
https://arxiv.org/abs/2404.03818
AI-generated biographies, often riddled with gross grammatical and factual errors, are appearing for sale online soon after the death of well-known people (Elizabeth A. Harris/New York Times)
https://www.nytimes.com/2024/02/18/books/ai-books-biographies.html
The Oz government’s digital ID bill has become a lightning rod for conspiracy theories and fearmongering, despite it being “world’s best practice” and “completely sane,” privacy and security experts say.
Several Coalition and crossbench politicians claimed the bill was an attempt to introduce a Chinese Communist Party-style social credit score and a measure to control the public, despite no evidence or factual basis, after it passed the Senate.
Discovered this because the author wrote a factual piece for Labour List - "Best Labour leaders we never had: Anthony Greenwood"
https://labourlist.org/2023/08/best-labour-leaders-we-never-had-anthony-greenwood/
This https://arxiv.org/abs/2311.16842 has been replaced.
link: https://scholar.google.com/scholar?q=a
Self-Consistent Decoding for More Factual Open Responses
Christopher Malon, Xiaodan Zhu
https://arxiv.org/abs/2403.00696 https://arxi…
I don't know if you've read this yet (not the book, the review) but it is an absolute masterpiece of how to deliver a completely devastating, but cordial and factual, takedown. Honestly, the last takedown I saw this powerful was Susan Fowler's blog post on Uber.
On Truthful Item-Acquiring Mechanisms for Reward Maximization
Liang Shan, Shuo Zhang, Jie Zhang, Zihe Wang
https://arxiv.org/abs/2402.14540 https://…
Een mooi overzichtsartikel met de feiten op een rijtje als het gaat om de oorsprong van het coronavirus. #labLeak
https://www.commondreams.org/opinion/covid-19…
A look at the US DOJ's lawsuit against Apple, comparisons to Microsoft suit in the 90's, basic factual errors, why it is clearly a political case, and more (Steven Sinofsky/Hardcore Software)
https://hardcoresoftware.learningbyshipping…
This https://arxiv.org/abs/2307.12371 has been replaced.
link: https://scholar.google.com/scholar?q=a
A good read from a smart guy.
Worth reading just to get factual scoop about QUOTE
"information wants to be free"
https://medium.com/@johnmarkoff/introduction-28fd6f7df6e0l
This https://arxiv.org/abs/2301.10389 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
This https://arxiv.org/abs/2306.15328 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
This https://arxiv.org/abs/2403.01548 has been replaced.
link: https://scholar.google.com/scholar?q=a
And here is the trigger event, I reckon. The opinion writer we believe is basically a sock puppet for the media proprietor is back – and making some plainly false factual claims.
The easiest to prove: that in investigating their identity, News24 did not provide them with an opportunity to comment.
That is the work of a moment to take before the Press Ombud, at which point Independent Media has to choose between apologising or storming out again.
https://web.archive.org/web/20240315072705/https://www.iol.co.za/sundayindependent/analysis/debunking-the-lies-a-response-to-news24s-special-projects-propaganda-779c9b95-930a-4b70-b845-f710f63c897a
Google so badly wants to disown "Bard", it is ascribing it to competitors.
Me: What are the names of your top competitor offerings?
Gemini: [1]
Me: Are you sure about # 3?
Gemini: [2, yup it's all on me 🤦♂️]
https://gemini.google.com/share/c24cc7
This https://arxiv.org/abs/2312.08976 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSE_…
About 50 NPR staffers sign a letter to NPR leadership calling for a public rebuke of the "factual inaccuracies and elisions" in Uri Berliner's essay (Ben Mullin/@benmullin)
https://twitter.com/benmullin/status/1780635242170028359
FLAME: Factuality-Aware Alignment for Large Language Models
Sheng-Chieh Lin, Luyu Gao, Barlas Oguz, Wenhan Xiong, Jimmy Lin, Wen-tau Yih, Xilun Chen
https://arxiv.org/abs/2405.01525
This https://arxiv.org/abs/2403.18802 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
This https://arxiv.org/abs/2310.14894 has been replaced.
link: https://scholar.google.com/scholar?q=a
Dataset Artefacts are the Hidden Drivers of the Declining Disruptiveness in Science
Vincent Holst, Andres Algaba, Floriano Tori, Sylvia Wenmackers, Vincent Ginis
https://arxiv.org/abs/2402.14583
And here is the trigger event, I reckon. The opinion writer we believe is basically a sock puppet for the media proprietor is back – and making some plainly false factual claims.
The easiest to prove: that in investigating their identity, News24 did not provide them with an opportunity to comment.
That is the work of a moment to take before the Press Ombud, at which point Independent Media has to choose between apologising or storming out again.
https://web.archive.org/web/20240315072705/https://www.iol.co.za/sundayindependent/analysis/debunking-the-lies-a-response-to-news24s-special-projects-propaganda-779c9b95-930a-4b70-b845-f710f63c897a
Filing: OpenAI rebuts Elon Musk's lawsuit, saying OpenAI didn't violate its founding agreement because "there is no founding agreement, or any agreement at all" (Rachel Metz/Bloomberg)
https://www.bloomberg.com/news/articles/20
Semantic Data for Humanities and Social Sciences (SDHSS): an Ecosystem of CIDOC CRM Extensions for Research Data Production and Reuse
Francesco BerettaLARHRA, LARHRA PHN
https://arxiv.org/abs/2402.07531
WikiFactDiff: A Large, Realistic, and Temporally Adaptable Dataset for Atomic Factual Knowledge Update in Causal Language Models
Hichem Ammar Khodja, Fr\'ed\'eric B\'echet, Quentin Brabant, Alexis Nasr, Gw\'enol\'e Lecorv\'e
https://arxiv.org/abs/2403.14364
This https://arxiv.org/abs/2402.18045 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
Analyizing the Conjunction Fallacy as a Fact
Tomas Veloz, Olha Sobetska
https://arxiv.org/abs/2402.13615 https://arxiv.org/pdf/2402.1…
This https://arxiv.org/abs/2402.18045 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
This https://arxiv.org/abs/2403.11169 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
HyperMono: A Monotonicity-aware Approach to Hyper-Relational Knowledge Representation
Zhiwei Hu, V\'ictor Guti\'errez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan
https://arxiv.org/abs/2404.09848
This https://arxiv.org/abs/2404.11184 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
This https://arxiv.org/abs/2402.13919 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
This https://arxiv.org/abs/2402.17097 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
Memory-Augmented Generative Adversarial Transformers
Stephan Raaijmakers, Roos Bakker, Anita Cremers, Roy de Kleijn, Tom Kouwenhoven, Tessa Verhoef
https://arxiv.org/abs/2402.19218
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction
Yuren Mao, Xuemei Dong, Wenyi Xu, Yunjun Gao, Bin Wei, Ying Zhang
https://arxiv.org/abs/2403.14374
SIFiD: Reassess Summary Factual Inconsistency Detection with LLM
Jiuding Yang, Hui Liu, Weidong Guo, Zhuwei Rao, Yu Xu, Di Niu
https://arxiv.org/abs/2403.07557
SIFiD: Reassess Summary Factual Inconsistency Detection with LLM
Jiuding Yang, Hui Liu, Weidong Guo, Zhuwei Rao, Yu Xu, Di Niu
https://arxiv.org/abs/2403.07557
Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge
Xin Zhao, Naoki Yoshinaga, Daisuke Oba
https://arxiv.org/abs/2403.05189 …
Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge
Xin Zhao, Naoki Yoshinaga, Daisuke Oba
https://arxiv.org/abs/2403.05189 …
FactCheck Editor: Multilingual Text Editor with End-to-End fact-checking
Vinay Setty
https://arxiv.org/abs/2404.19482 https://arxiv.org/pdf/2404.19482
arXiv:2404.19482v1 Announce Type: new
Abstract: We introduce 'FactCheck Editor', an advanced text editor designed to automate fact-checking and correct factual inaccuracies. Given the widespread issue of misinformation, often a result of unintentional mistakes by content creators, our tool aims to address this challenge. It supports over 90 languages and utilizes transformer models to assist humans in the labor-intensive process of fact verification. This demonstration showcases a complete workflow that detects text claims in need of verification, generates relevant search engine queries, and retrieves appropriate documents from the web. It employs Natural Language Inference (NLI) to predict the veracity of claims and uses LLMs to summarize the evidence and suggest textual revisions to correct any errors in the text. Additionally, the effectiveness of models used in claim detection and veracity assessment is evaluated across multiple languages.
UFO: a Unified and Flexible Framework for Evaluating Factuality of Large Language Models
Zhaoheng Huang, Zhicheng Dou, Yutao Zhu, Ji-rong Wen
https://arxiv.org/abs/2402.14690
Editing Knowledge Representation of Language Lodel via Rephrased Prefix Prompts
Yuchen Cai, Ding Cao, Rongxi Guo, Yaqin Wen, Guiquan Liu, Enhong Chen
https://arxiv.org/abs/2403.14381
This https://arxiv.org/abs/2401.17809 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2404.05904 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
This https://arxiv.org/abs/2310.01045 has been replaced.
link: https://scholar.google.com/scholar?q=a
Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles
Andrea Zugarini, Kamyar Zeinalipour, Surya Sai Kadali, Marco Maggini, Marco Gori, Leonardo Rigutini
https://arxiv.org/abs/2404.06186