Tootfinder

Opt-in global Mastodon full text search. Join the index!

@netzschleuder@social.skewed.de
2025-06-15 21:00:04

wiki_science: Wikipedia Map of Science (2020)
A network of scientific fields, extracted from the English Wikipedia in early 2020. Nodes are wikipedia pages representing natural, formal, social and applied sciences, and two nodes are linked if the cosine similarity of the page content is above a threshold. See <s…

wiki_science: Wikipedia Map of Science (2020). 687 nodes, 6523 edges. https://networks.skewed.de/net/wiki_science
@netzschleuder@social.skewed.de
2025-06-15 04:00:05

wiki_science: Wikipedia Map of Science (2020)
A network of scientific fields, extracted from the English Wikipedia in early 2020. Nodes are wikipedia pages representing natural, formal, social and applied sciences, and two nodes are linked if the cosine similarity of the page content is above a threshold. See <s…

wiki_science: Wikipedia Map of Science (2020). 687 nodes, 6523 edges. https://networks.skewed.de/net/wiki_science
@netzschleuder@social.skewed.de
2025-06-08 02:00:05

wiki_science: Wikipedia Map of Science (2020)
A network of scientific fields, extracted from the English Wikipedia in early 2020. Nodes are wikipedia pages representing natural, formal, social and applied sciences, and two nodes are linked if the cosine similarity of the page content is above a threshold. See <s…

wiki_science: Wikipedia Map of Science (2020). 687 nodes, 6523 edges. https://networks.skewed.de/net/wiki_science
@netzschleuder@social.skewed.de
2025-07-04 13:00:04

wiki_science: Wikipedia Map of Science (2020)
A network of scientific fields, extracted from the English Wikipedia in early 2020. Nodes are wikipedia pages representing natural, formal, social and applied sciences, and two nodes are linked if the cosine similarity of the page content is above a threshold. See <s…

wiki_science: Wikipedia Map of Science (2020). 687 nodes, 6523 edges. https://networks.skewed.de/net/wiki_science
@netzschleuder@social.skewed.de
2025-07-30 22:00:04

wiki_science: Wikipedia Map of Science (2020)
A network of scientific fields, extracted from the English Wikipedia in early 2020. Nodes are wikipedia pages representing natural, formal, social and applied sciences, and two nodes are linked if the cosine similarity of the page content is above a threshold. See <s…

wiki_science: Wikipedia Map of Science (2020). 687 nodes, 6523 edges. https://networks.skewed.de/net/wiki_science
@netzschleuder@social.skewed.de
2025-07-30 23:00:04

wiki_science: Wikipedia Map of Science (2020)
A network of scientific fields, extracted from the English Wikipedia in early 2020. Nodes are wikipedia pages representing natural, formal, social and applied sciences, and two nodes are linked if the cosine similarity of the page content is above a threshold. See <s…

wiki_science: Wikipedia Map of Science (2020). 687 nodes, 6523 edges. https://networks.skewed.de/net/wiki_science
@netzschleuder@social.skewed.de
2025-06-18 22:00:05

wiki_science: Wikipedia Map of Science (2020)
A network of scientific fields, extracted from the English Wikipedia in early 2020. Nodes are wikipedia pages representing natural, formal, social and applied sciences, and two nodes are linked if the cosine similarity of the page content is above a threshold. See <s…

wiki_science: Wikipedia Map of Science (2020). 687 nodes, 6523 edges. https://networks.skewed.de/net/wiki_science
@netzschleuder@social.skewed.de
2025-06-18 11:00:04

wiki_science: Wikipedia Map of Science (2020)
A network of scientific fields, extracted from the English Wikipedia in early 2020. Nodes are wikipedia pages representing natural, formal, social and applied sciences, and two nodes are linked if the cosine similarity of the page content is above a threshold. See <s…

wiki_science: Wikipedia Map of Science (2020). 687 nodes, 6523 edges. https://networks.skewed.de/net/wiki_science
@tiotasram@kolektiva.social
2025-07-19 07:51:05

AI, AGI, and learning efficiency
My 4-month-old kid is not DDoSing Wikipedia right now, nor will they ever do so before learning to speak, read, or write. Their entire "training corpus" will not top even 100 million "tokens" before they can speak & understand language, and do so with real intentionally.
Just to emphasize that point: 100 words-per-minute times 60 minutes-per-hour times 12 hours-per-day times 365 days-per-year times 4 years is a mere 105,120,000 words. That's a ludicrously *high* estimate of words-per-minute and hours-per-day, and 4 years old (the age of my other kid) is well after basic speech capabilities are developed in many children, etc. More likely the available "training data" is at least 1 or 2 orders of magnitude less than this.
The point here is that large language models, trained as they are on multiple *billions* of tokens, are not developing their behavioral capabilities in a way that's remotely similar to humans, even if you believe those capabilities are similar (they are by certain very biased ways of measurement; they very much aren't by others). This idea that humans must be naturally good at acquiring language is an old one (see e.g. #AI #LLM #AGI