2024-05-06 12:05:39
Someone said “Everything you know about Cinco de Mayo is wrong” but all I know is that it’s the 5th of May but damn even that is incorrect I guess!
Someone said “Everything you know about Cinco de Mayo is wrong” but all I know is that it’s the 5th of May but damn even that is incorrect I guess!
@… thanks for the boost, friend. Unfortunately, I don't think it will be of much use. The person speaks in broken English and is spreading incorrect or just straight useless information. This REALLY sucks that folks like you and I do so much on there to keep people knowledgeable and somebody just squanders an opportunity like that to do such a good thing!😩
@… As someone who is currently appealing an incorrect bill because they’re charging me as out-of-network for a provider that is clearly listed as in-network in their own documents… yeah.
This https://arxiv.org/abs/2404.02124 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
#Rant. There are mansplainers and there are mansplainers who mansplain to me confidently, in writing, the *definitions* of the topics of my expertise. Their definitions are incorrect or shaky and they don't cite their sources or provide a basis. #Mansplaining Ultra Pro Max.
Why do men wa…
Another day, another person barely caught the train on #Poznań Główny station, because of the useless platform numbers and confusing markings.
The "new" train station features two adjacent doors, leading to the same staircase. Above one of them, the label says "platform 3". Above the other, it says "platform 4". How this can be anything but confusing? The most logical (and incorrect) conclusion is that one track belongs to platform 3, while the other to platform 4.
#rail
This https://arxiv.org/abs/2309.04909 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
Determining the difference between local acceleration and local gravity: applications of the equivalence principle to relativistic trajectories
Steven A. Balbus
https://arxiv.org/abs/2403.03965
Optimal Denial-of-Service Attacks Against Status Updating
Saad Kriouile, Mohamad Assaad, Deniz G\"und\"uz, Touraj Soleymani
https://arxiv.org/abs/2403.04489
Modification to the Jeans criterion by external tides: Anisotropic fragmentation and formation of filaments
Guang-Xing Li
https://arxiv.org/abs/2403.02612 https://arxiv.org/pdf/2403.02612
arXiv:2403.02612v1 Announce Type: new
Abstract: The Jeans criterion sets the foundation of our understanding of gravitational collapse. Jog studied the fragmentation of gas under external tides and derived a dispersion relation $$
l' = l_{\rm Jeans} \frac{1} {(1 \lambda_0' / 4 \pi G \rho_0)^{1/2}} \;. $$ She further concludes that the Jeans mass is $m_{\rm incorrect}'=m_{\rm Jeans} ( 1/(1 \lambda_0' / 4 \pi G \rho_0)^{3/2})$. We clarify that due to the inhomogeneous nature of tides, this characteristic mass is incorrect. Under weak tides, the mass is $m \approx \rho\, l_1 l_2 l_3$, where the modifications to Jeans lengths along all three dimensions need to be considered; when the tide is strong enough, collapse can only occur once 1 or 2 dimensions. In the latter case, tides can stretch the gas, leading to the formation of filaments.
🔊 #NowPlaying on KEXP's #DriveTime
Ibibio Sound Machine:
🎵 Political Incorrect
#IbibioSoundMachine #newRelease 🆕 album
#Bandcamp
SSDRec: Self-Augmented Sequence Denoising for Sequential Recommendation
Chi Zhang, Qilong Han, Rui Chen, Xiangyu Zhao, Peng Tang, Hongtao Song
https://arxiv.org/abs/2403.04278
This https://arxiv.org/abs/2405.01461 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
CodeFort: Robust Training for Code Generation Models
Yuhao Zhang, Shiqi Wang, Haifeng Qian, Zijian Wang, Mingyue Shang, Linbo Liu, Sanjay Krishna Gouda, Baishakhi Ray, Murali Krishna Ramanathan, Xiaofei Ma, Anoop Deoras
https://arxiv.org/abs/2405.01567
We are at a stage in human history where huge trust is placed in statistical machines, that are unpredictable and easily manipulated, for important business – and personal – decisions.
Blown away when people respond to screenshots of failed responses with screenshots of successful responses, as if the tool has "learned" the correct response.
As if it can't just as easily provide an incorrect response again.
It's unsettling to me that people don't…
So, it turns out, ASP.NET Core applications without Open Telemetry still send a traceparent header, with the tracing flag off. It also turns out that an ASP.NET Core application that does use OTEL will not send traces if they receive a traceparent header with the tracing flag off. And it also turns out, that the documentation on how to change this behaviour, by using the OTEL_TRACES_SAMPLER environment variable, is incorrect. This variable is not used yet. Fun.
Can Small Language Models be Good Reasoners for Sequential Recommendation?
Yuling Wang, Changxin Tian, Binbin Hu, Yanhua Yu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, Liang Pang, Xiao Wang
https://arxiv.org/abs/2403.04260
Non-profit noyb files a GDPR complaint against OpenAI in Austria on behalf of an unnamed public figure, who found ChatGPT produced his incorrect birth date (Natasha Lomas/TechCrunch)
https://techcrunch.com/2024/04/28/chatgpt-gdpr-complaint-noyb/
@…
Lisa, at your discretion:
@… has been live-tweeting #Trump trial, in a very newbie-friendly, easy to understand way, especially if …
How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee Responses
Jionghao Lin, Zifei Han, Danielle R. Thomas, Ashish Gurung, Shivang Gupta, Vincent Aleven, Kenneth R. Koedinger
https://arxiv.org/abs/2405.00970
Is it incorrect to want to play Sheryl Crow's song 'Soak up the sun' on April 8th
An efficient quantifier elimination procedure for Presburger arithmetic
Christoph Haase, Shankara Narayanan Krishna, Khushraj Madnani, Om Swostik Mishra, Georg Zetzsche
https://arxiv.org/abs/2405.01183
WitheredLeaf: Finding Entity-Inconsistency Bugs with LLMs
Hongbo Chen, Yifan Zhang, Xing Han, Huanyao Rong, Yuheng Zhang, Tianhao Mao, Hang Zhang, XiaoFeng Wang, Luyi Xing, Xun Chen
https://arxiv.org/abs/2405.01668
The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?
Alex Gu, Wen-Ding Li, Naman Jain, Theo X. Olausson, Celine Lee, Koushik Sen, Armando Solar-Lezama
https://arxiv.org/abs/2402.19475
This https://arxiv.org/abs/2402.02612 has been replaced.
link: https://scholar.google.com/scholar?q=a
So I noticed that some bots(?) have been adding foreign language info to my Wikidata entry. Neat!
Except that some of these seem to be incorrect. Most of the non-English ones list "researcher" (which, sure I guess?), but both the Mongolia and Dutch are translated by Google Translate as "scientist."
Anyone know what's up with this? Mistranslation? Error? Is Google Translate wrong? Is this showing up in anyone else's Wikidata pages?
Mongolian: эрд…
How refreshing to see someone actually get the story of Germany's nuclear phase-out right, instead of relying on the same old incorrect tribal tropes!
Why Germany ditched nuclear before coal – and why it won’t go back https://theconversation.com/why-german…
A new form of scam? Our website gets a manageable amount of spam submissions to our contact form. What’s new (to me) is that some of them claim to be individuals requesting refunds for incorrect charges by us. Which is impossible since we have no e-commerce. A spate of these came in about a month ago. Just an interesting pattern. Also, reCAPCHA isn’t blocking them.
Interesting, some empirical research on how GPT4 /ChatGPT performs in summarizing. Still the low rate of errors can be unacceptable in some contexts. As noted "Life-critical medical decisions should remain based on full, critical, and thoughtful evaluation of the full text of research articles in context with clinical guidelines.".
http…
Curiosity-driven Red-teaming for Large Language Models
Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James Glass, Akash Srivastava, Pulkit Agrawal
https://arxiv.org/abs/2402.19464
This https://arxiv.org/abs/2403.01548 has been replaced.
link: https://scholar.google.com/scholar?q=a
Anarchy in the APSP: Algorithm and Hardness for Incorrect Implementation of Floyd-Warshall
Jaehyun Koo
https://arxiv.org/abs/2404.08173 https://
"I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust
Sunnie S. Y. Kim, Q. Vera Liao, Mihaela Vorvoreanu, Stephanie Ballard, Jennifer Wortman Vaughan
https://arxiv.org/abs/2405.00623
Happy π day!
To celebrate, let's look back at this 127-year-old bill that was passed in the Indiana House of Representatives which attempted to legislate a wildly incorrect solution to the squaring a circle problem and thereby legalize an incorrect value of π.
https://en.wikipedia.org/wiki/Indiana_
PVF (Parameter Vulnerability Factor): A Quantitative Metric Measuring AI Vulnerability and Resilience Against Parameter Corruptions
Xun Jiao, Fred Lin, Harish D. Dixit, Joel Coburn, Abhinav Pandey, Han Wang, Jianyu Huang, Venkat Ramesh, Wang Xu, Daniel Moore, Sriram Sankar
https://arxiv.org/abs/2405.01741
This https://arxiv.org/abs/2301.11751 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Constrained Decoding for Code Language Models via Efficient Left and Right Quotienting of Context-Sensitive Grammars
Daniel Melcer, Nathan Fulton, Sanjay Krishna Gouda, Haifeng Qian
https://arxiv.org/abs/2402.17988
This https://arxiv.org/abs/2404.01276 has been replaced.
link: https://scholar.google.com/scholar?q=a
No, really. LLMs are absolutely not fancy autocomplete. Really.
> In this case, the bug was in the step where the model chooses these numbers. Akin to being lost in translation, the model chose slightly wrong numbers, which produced word sequences that made no sense. More technically, inference kernels produced incorrect results when used in certain GPU configurations.
LCPS data van gisteren en vandaag is er nu wel, maar deze is incorrect/incompleet, dus voorlopig geen update.
Wat wel lijkt te kloppen: gisteren en vandaag 19 nieuwe opnames, dat is meer dan de vergelijkbare dagen vorige week.
#qp2t
This https://arxiv.org/abs/2402.00081 has been replaced.
link: https://scholar.google.com/scholar?q=a
It's quite annoying to see a new amazing tool be released and the vast majority using it in the most incorrect way possible, then going "tool is dumb, don't work, broken"....
Well, if you'd start using it in a way that actually makes fucking sense then it wouldn't be "dumb" because it can do the things it's meant to do, fairly well...
This https://arxiv.org/abs/2402.02612 has been replaced.
link: https://scholar.google.com/scholar?q=a
AT&T believes the February 22 outage was "caused by the application and execution of an incorrect process" as it was expanding its network, "not a cyber attack" (ABC News)
https://abcnews.go.com/US/att-outage-impacting…
Former Raiders CB Damon Arnette disputes methamphetamine arrest, claims he has prescription https://www.yardbarker.com/nfl/articles/former_raiders_cb_damon_arnette_disputes_methamphetami…
⛳ @… #Golf
Whoops!
Via PGA TOUR Communications @PGATOURComms
·
5m
Jordan Spieth has been disqualified from The Genesis Invitational for signing for an incorrect scorecard.
Spieth signed for a 3 and made a 4 on No. 4.
Hey hey all my #Krita artists, does anyone else who uses a Wacom notice the app being too sensitive when using the freehand selection / lasso tool?
It often seems to release the selection before I actually lift my pen off the tablet resulting in an incorrect selection. This is annoying for obvious reasons. Any way to fix this?
You know how some foods are labelled “no lactose” or “sugar free” — we need something similar for software services not using LLMs to provide (confabulated and often incorrect) answers to questions.
Revealed: the secret algorithm that controls the lives of Serco’s immigration detainees, The Guardian
'Imagine there’s a secret rating that dictates where you sleep and whether you are forced to wear handcuffs to a doctor’s appointment. And imagine that rating is based on incorrect information or unfair assumptions about the type of person you are.'
This https://arxiv.org/abs/2402.00081 has been replaced.
link: https://scholar.google.com/scholar?q=a
Time check after DST
If your timezone is incorrect now, please let me know. Thanks.
It was, 9PM EST, (6PM PT / 2AM GMT / 3AM CET / 1PM AEDT / 2PM NZST)
@…
GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion
Xueyi Liu, Li Yi
https://arxiv.org/abs/2402.14810 https:/…
This https://arxiv.org/abs/1902.09608 has been replaced.
link: https://scholar.google.com/scholar?q=a
Optimizing Portfolio Management and Risk Assessment in Digital Assets Using Deep Learning for Predictive Analysis
Qishuo Cheng, Le Yang, Jiajian Zheng, Miao Tian, Duan Xin
https://arxiv.org/abs/2402.15994 https://arxiv.org/pdf/2402.15994
arXiv:2402.15994v1 Announce Type: new
Abstract: Portfolio management issues have been extensively studied in the field of artificial intelligence in recent years, but existing deep learning-based quantitative trading methods have some areas where they could be improved. First of all, the prediction mode of stocks is singular; often, only one trading expert is trained by a model, and the trading decision is solely based on the prediction results of the model. Secondly, the data source used by the model is relatively simple, and only considers the data of the stock itself, ignoring the impact of the whole market risk on the stock. In this paper, the DQN algorithm is introduced into asset management portfolios in a novel and straightforward way, and the performance greatly exceeds the benchmark, which fully proves the effectiveness of the DRL algorithm in portfolio management. This also inspires us to consider the complexity of financial problems, and the use of algorithms should be fully combined with the problems to adapt. Finally, in this paper, the strategy is implemented by selecting the assets and actions with the largest Q value. Since different assets are trained separately as environments, there may be a phenomenon of Q value drift among different assets (different assets have different Q value distribution areas), which may easily lead to incorrect asset selection. Consider adding constraints so that the Q values of different assets share a Q value distribution to improve results.
I've published a detailed article about #ChatGPT hallucination minimization using predefined #SPARQL-based query templates scoped to Linked Open Data (LOD) Cloud Knowledge Graphs such as #UniProtKB…
Validating a lutetium frequency reference
Kyle J. Arnold, Scott Bustabad, Qin Qichen, Zhao Zhang, Qi Zhao, Murray D. Barrett
https://arxiv.org/abs/2404.16414 https://arxiv.org/pdf/2404.16414
arXiv:2404.16414v1 Announce Type: new
Abstract: We review our progress in developing a frequency reference with singly ionized lutetium and give estimates of the levels of inaccuracy we expect to achieve in the near future with both the $^1S_0\leftrightarrow{}^3D_1$ and $^1S_0\leftrightarrow{}^3D_2$ transitions. Based on established experimental results, we show that inaccuracies at the low $10^{-19}$ level are readily achievable for the $^1S_0\leftrightarrow{}^3D_1$ transition, and the frequency ratio between the two transitions is limited almost entirely by the BBR shift. We argue that the frequency ratio measured within the one apparatus provides a well-defined metric to compare and establish the performance of remotely located systems. For the measurement of an in situ frequency ratio, relativistic shifts drop out and both transitions experience the same electromagnetic environment. Consequently, the uncertainty budget for the ratio is practically identical to the uncertainty budgets for the individual transitions. If the ratios for two or more systems disagree we can be certain at least one of the clock assessments is incorrect. If they agree, subsequent comparisons on one transition would only differ by relativistic effects. Since motional effects are easily assessed and typically small for a heavy ion, only the differential gravitational red-shift will significantly contribute and this can be confirmed by comparison on the second transition.
Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking
Hong Jin Kang, Fabrice Harel-Canada, Muhammad Ali Gulzar, Violet Peng, Miryung Kim
https://arxiv.org/abs/2404.18881
Kinetic theory of vacuum pair production in uniform electric fields revisited
I. A. Aleksandrov, A. Kudlis, A. I. Klochai
https://arxiv.org/abs/2403.17204 …
This https://arxiv.org/abs/2307.13696 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_…
The number of times someone calls me by the obvious incorrect spelling of my last name is significant enough to mention; almost as many as call me by my father's [different] first name, despite most not even knowing him.
This https://arxiv.org/abs/2312.07159 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIT_…
Time-dependent Stellar Flare Models of Deep Atmospheric Heating
Adam F. Kowalski (University of Colorado, National Solar Observatory, Laboratory for Atmospheric and Space Physics), Joel C. Allred (NASA Goddard Space Flight Center), Mats Carlsson (Institute of Theoretical Astrophysics, University of Oslo, Rosseland Centre for Solar Physics, University of Oslo)
The Reddit user is skeptical about BitSight, considering it ineffective without providing details, and invites others to correct their assumption if incorrect. https://reddit.com/r/cybersecurity/comments/1bzaedv/
#Wordle 968 4/6*
⬜⬜🟨⬜⬜ <1% of 259,020 (209)
🟨🟨⬜🟨🟨 <1% of 904 (1)
🟩🟩🟩🟩⬜ 1% of 180 (1)
🟩🟩🟩🟩🟩
WordleBot
Skill 50/99
Luck 63/99
When I watched the letters turning green I was really thinking I got it, and then boom! no I did not. It made me really laugh as I looked to see what other word it could be!
Well huh, so dumb, that incorrect letter was one I had eliminated on my 1st guess!
This https://arxiv.org/abs/2402.17814 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_grqc_…
This https://arxiv.org/abs/2301.04863 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Finally tracked down why the broken status was broken - #Akkoma's media preview proxy defaults to on and ebay was having absolutely none of it when it tried to get information from there. Maybe it's a broken ebay URL but turning the proxy off made everything work and I can live with that rather than people being able to nerf my timeline by typing in incorrect URLs...
Max-Cut with $\epsilon$-Accurate Predictions
Vincent Cohen-Addad, Tommaso d'Orsi, Anupam Gupta, Euiwoong Lee, Debmalya Panigrahi
https://arxiv.org/abs/2402.18263
A Multimodal Handover Failure Detection Dataset and Baselines
Santosh Thoduka, Nico Hochgeschwender, Juergen Gall, Paul G. Pl\"oger
https://arxiv.org/abs/2402.18319
This https://arxiv.org/abs/2301.11751 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
When to Trust LLMs: Aligning Confidence with Response Quality
Shuchang Tao, Liuyi Yao, Hanxing Ding, Yuexiang Xie, Qi Cao, Fei Sun, Jinyang Gao, Huawei Shen, Bolin Ding
https://arxiv.org/abs/2404.17287
This https://arxiv.org/abs/2404.01276 has been replaced.
link: https://scholar.google.com/scholar?q=a
Validating a lutetium frequency reference
Kyle J. Arnold, Scott Bustabad, Qin Qichen, Zhao Zhang, Qi Zhao, Murray D. Barrett
https://arxiv.org/abs/2404.16414 https://arxiv.org/pdf/2404.16414
arXiv:2404.16414v1 Announce Type: new
Abstract: We review our progress in developing a frequency reference with singly ionized lutetium and give estimates of the levels of inaccuracy we expect to achieve in the near future with both the $^1S_0\leftrightarrow{}^3D_1$ and $^1S_0\leftrightarrow{}^3D_2$ transitions. Based on established experimental results, we show that inaccuracies at the low $10^{-19}$ level are readily achievable for the $^1S_0\leftrightarrow{}^3D_1$ transition, and the frequency ratio between the two transitions is limited almost entirely by the BBR shift. We argue that the frequency ratio measured within the one apparatus provides a well-defined metric to compare and establish the performance of remotely located systems. For the measurement of an in situ frequency ratio, relativistic shifts drop out and both transitions experience the same electromagnetic environment. Consequently, the uncertainty budget for the ratio is practically identical to the uncertainty budgets for the individual transitions. If the ratios for two or more systems disagree we can be certain at least one of the clock assessments is incorrect. If they agree, subsequent comparisons on one transition would only differ by relativistic effects. Since motional effects are easily assessed and typically small for a heavy ion, only the differential gravitational red-shift will significantly contribute and this can be confirmed by comparison on the second transition.
Visual Hallucinations of Multi-modal Large Language Models
Wen Huang, Hongbin Liu, Minxin Guo, Neil Zhenqiang Gong
https://arxiv.org/abs/2402.14683 https:/…
Kinetic theory of vacuum pair production in uniform electric fields revisited
I. A. Aleksandrov, A. Kudlis, A. I. Klochai
https://arxiv.org/abs/2403.17204 …
This https://arxiv.org/abs/2303.00202 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSE_…
This https://arxiv.org/abs/2401.11314 has been replaced.
link: https://scholar.google.com/scholar?q=a
Enhanced Bayesian Personalized Ranking for Robust Hard Negative Sampling in Recommender Systems
Kexin Shi, Jing Zhang, Linjiajie Fang, Wenjia Wang, Bingyi Jing
https://arxiv.org/abs/2403.19276
This https://arxiv.org/abs/2403.18346 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
This https://arxiv.org/abs/2312.04902 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
This https://arxiv.org/abs/2306.09541 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csHC_…
Comment on "Source of black bounces in Rastall gravity''
Manuel E. Rodrigues, Marcos V. de S. Silva
https://arxiv.org/abs/2402.17814 https://<…
This https://arxiv.org/abs/2403.04260 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
This https://arxiv.org/abs/2403.04745 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csRO_…
Comment on "Source of black bounces in Rastall gravity''
Manuel E. Rodrigues, Marcos V. de S. Silva
https://arxiv.org/abs/2402.17814 https://<…
This https://arxiv.org/abs/2404.10357 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2404.07572 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
This https://arxiv.org/abs/2403.10822 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
This https://arxiv.org/abs/2404.10357 has been replaced.
link: https://scholar.google.com/scholar?q=a
On (Mis)perceptions of testing effectiveness: an empirical study
Sira Vegas, Patricia Riofrio, Esperanza Marcos, Natalia Juristo
https://arxiv.org/abs/2402.07222
This https://arxiv.org/abs/2311.04205 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2402.10773 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
This https://arxiv.org/abs/2402.10773 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
This https://arxiv.org/abs/2404.02124 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
AIM: Automated Input Set Minimization for Metamorphic Security Testing
Nazanin Bayati Chaleshtari, Yoann Marquer, Fabrizio Pastore, Lionel C. Briand
https://arxiv.org/abs/2402.10773
AIM: Automated Input Set Minimization for Metamorphic Security Testing
Nazanin Bayati Chaleshtari, Yoann Marquer, Fabrizio Pastore, Lionel C. Briand
https://arxiv.org/abs/2402.10773