~www_lesswrong_com | Bookmarks (682)
-
Re Hanson's Grabby Aliens: Humanity is not a natural anthropic sample space — LessWrong
Published on December 9, 2024 6:07 PM GMTI, Lorec, am disoriented by neither the Fermi Paradox...
-
Zen and The Art of Semiconductor Manufacturing — LessWrong
Published on December 9, 2024 5:19 PM GMTI. BEGINNINGIn the beginning was the Sand.And in the...
-
A toy evaluation of inference code tampering — LessWrong
Published on December 9, 2024 5:43 PM GMTWork done with James Faina, Evan Hubinger and Ethan...
-
Childhood and Education Roundup #7 — LessWrong
Published on December 9, 2024 1:10 PM GMTSince it’s been so long, I’m splitting this roundup...
-
Refuting Searle’s wall, Putnam’s rock, and Johnson’s popcorn — LessWrong
Published on December 9, 2024 8:24 AM GMTIn a recent essay, Euan McLean suggested that a cluster...
-
The first AGI may be a good engineer but bad strategist — LessWrong
Published on December 9, 2024 6:34 AM GMTAGI may have an advantage in engineering, but humans...
-
Keeping self-replicating nanobots in check — LessWrong
Published on December 9, 2024 5:25 AM GMTThis is a random unimportant idea to prevent a...
-
Cognitive Processes — LessWrong
Published on December 9, 2024 5:10 AM GMTThere is a cognitive process going on that can...
-
Subskills of "Listening to Wisdom" — LessWrong
Published on December 9, 2024 3:01 AM GMTA fool learns from their own mistakesThe wise learn...
-
Cognitive Work and AI Safety: A Thermodynamic Perspective — LessWrong
Published on December 8, 2024 9:42 PM GMTIntroduces the idea of cognitive work as a parallel...
-
Intricacies of Feature Geometry in Large Language Models — LessWrong
Published on December 7, 2024 6:10 PM GMTNote: This is a more fleshed-out version of this...
-
The Way According To Zvi — LessWrong
Published on December 7, 2024 5:35 PM GMTZvi Mowshowitz is an influential figure in the Rationalist...
-
Deep Learning is cheap Solomonoff induction? — LessWrong
Published on December 7, 2024 11:00 AM GMTBackground Lucius: I recently held a small talk presenting an...
-
minifest — LessWrong
Published on December 7, 2024 3:50 AM GMTA cozy one-day festival celebrating prediction markets, blogging, economics,...
-
Mask and Respirator Intelligibility Comparison — LessWrong
Published on December 7, 2024 3:20 AM GMT One of the downsides of wearing a mask...
-
Purging Corrupted Capabilities across Language Models — LessWrong
Published on December 6, 2024 10:56 PM GMTby Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash, Amirali Abdullah This...
-
Gradient Routing: Masking Gradients to Localize Computation in Neural Networks — LessWrong
Published on December 6, 2024 10:19 PM GMTWe present gradient routing, a way of controlling where...
-
Understanding Shapley Values with Venn Diagrams — LessWrong
Published on December 6, 2024 9:56 PM GMTDiscuss
-
Model Integrity — LessWrong
Published on December 6, 2024 9:28 PM GMTHi! My collaborators at the Meaning Alignment Institute put...
-
Can AI improve the current state of molecular simulation? — LessWrong
Published on December 6, 2024 8:22 PM GMTHey LW! I recently filmed a two-hour long scientific...
-
Experiments are in the territory, results are in the map — LessWrong
Published on December 6, 2024 3:44 PM GMTI recently read Thomas Kuhn's book The Structure of...
-
A car journey with conservative evangelicals - Understanding some British political-religious beliefs — LessWrong
Published on December 6, 2024 11:22 AM GMTI’m heading home from a family wedding this weekend....
-
Frontier Models are Capable of In-context Scheming — LessWrong
Published on December 5, 2024 10:11 PM GMTThis is a brief summary of what we believe...
-
Expevolu, a laissez-faire approach to country creation — LessWrong
Published on December 5, 2024 7:29 PM GMTI write this post to present expevolu[1], a system...