~www_lesswrong_com | Bookmarks (687)
-
Shallow review of technical AI safety, 2024 — LessWrong
Published on December 29, 2024 12:01 PM GMTfrom aisafety.world The following is a list of live agendas in...
-
Dishbrain and implications. — LessWrong
Published on December 29, 2024 10:42 AM GMTI believe that AI research has not given sufficient...
-
Notes on Altruism — LessWrong
Published on December 29, 2024 3:13 AM GMTThis post examines the virtue of altruism. I’m less...
-
By default, capital will matter more than ever after AGI — LessWrong
Published on December 28, 2024 5:52 PM GMTI've heard many people say something like "money won't...
-
AI Assistants Should Have a Direct Line to Their Developers — LessWrong
Published on December 28, 2024 5:01 PM GMTThe post makes the suggestion in the title: hopefully,...
-
No, the Polymarket price does not mean we can immediately conclude what the probability of a bird flu pandemic is. We also need to know the interest rate! — LessWrong
Published on December 28, 2024 4:05 PM GMTConsider the following argument made by Tim Babb:So every...
-
The average rationalist IQ is about 122 — LessWrong
Published on December 28, 2024 3:42 PM GMTIn The Mystery Of Internet Survey IQs, Scott revises...
-
Why OpenAI’s Structure Must Evolve To Advance Our Mission — LessWrong
Published on December 28, 2024 4:24 AM GMTThe section "The Future":As we enter 2025, we will...
-
The Robot, the Puppet-master, and the Psychohistorian — LessWrong
Published on December 28, 2024 12:12 AM GMTLenses of Control addressed one of the intuitions behind...
-
What is your personal totalizing and self-consistent worldview/philosophy? — LessWrong
Published on December 27, 2024 11:59 PM GMTEvery major author who has influenced me has "his...
-
Progress links and short notes, 2024-12-27: Clinical trial abundance, grid-scale fusion, permitting vs. compliance, crossword mania, and more — LessWrong
Published on December 27, 2024 11:34 PM GMTMuch of this content originated on social media. To...
-
Greedy-Advantage-Aware RLHF — LessWrong
Published on December 27, 2024 7:47 PM GMTGreedy-Advantage-Aware RLHF addresses the negative side effects from misspecified...
-
Deconstructing arguments against AI art — LessWrong
Published on December 27, 2024 7:40 PM GMTSomething I've been surprised by is just how fierce...
-
From the Archives: a story — LessWrong
Published on December 27, 2024 4:36 PM GMT"You are beautiful, Enkidu, you are become like a...
-
What's the best metric for measuring quality of life? — LessWrong
Published on December 27, 2024 2:29 PM GMTCurrently, to get a drug approved by the FDA...
-
Review: Planecrash — LessWrong
Published on December 27, 2024 2:18 PM GMTTake a stereotypical fantasy novel, a textbook on mathematical...
-
Good Fortune and Many Worlds — LessWrong
Published on December 27, 2024 1:21 PM GMTSummary: The Many-Worlds interpretation of quantum mechanics can help...
-
Letter from an Alien Mind — LessWrong
Published on December 27, 2024 1:20 PM GMTCause wow what is everyone even doing?! You know how...
-
Coin Flip — LessWrong
Published on December 27, 2024 11:53 AM GMTThis was a prose piece I performed for the...
-
If all trade is voluntary, then what is "exploitation?" — LessWrong
Published on December 27, 2024 11:21 AM GMTCapitalism is a force that has lifted billions out...
-
Duplicate token neurons in the first layer of gpt2-small — LessWrong
Published on December 27, 2024 4:21 AM GMTSummary:This is a write-up of some rough work I...
-
What are the most interesting / challenging evals (for humans) available? — LessWrong
Published on December 27, 2024 3:05 AM GMTI want to build a nice testing ground for...
-
Algorithmic Asubjective Anthropics, Cartesian Subjective Anthropics — LessWrong
Published on December 27, 2024 1:58 AM GMTConscious beings can infer the physical contents and laws...
-
Are Sparse Autoencoders a good idea for AI control? — LessWrong
Published on December 26, 2024 5:34 PM GMTBased on a 2-day hackathon brainstorm. Current status: 70%...