~www_lesswrong_com | Bookmarks (682)
-
Introducing the Evidence Color Wheel — LessWrong
Published on December 14, 2024 4:08 PM GMTVersion 5.1.0December 12, 2024IntroductionWeighing evidence is hard. Intuition is...
-
An Illustrated Summary of "Robust Agents Learn Causal World Model" — LessWrong
Published on December 14, 2024 3:02 PM GMTThis post was written during Alex Altair's agent foundations...
-
Best-of-N Jailbreaking — LessWrong
Published on December 14, 2024 4:58 AM GMTThis is a linkpost for a new research paper...
-
D&D.Sci Dungeonbuilding: the Dungeon Tournament — LessWrong
Published on December 14, 2024 4:30 AM GMTThis is an entry in the 'Dungeons & Data...
-
Creating Interpretable Latent Spaces with Gradient Routing — LessWrong
Published on December 14, 2024 4:00 AM GMTOver the past few months, I helped develop Gradient...
-
Escape Plan: Brain Preservation ("Cryonics" sort of), Digitization, Metaverse, Off-Planet Hardware, Backups — LessWrong
Published on December 11, 2024 6:05 PM GMTMy thoughts on this are evolving, so I apologize...
-
-
A shortcoming of concrete demonstrations as AGI risk advocacy — LessWrong
Published on December 11, 2024 4:48 PM GMTGiven any particular concrete demonstration of an AI algorithm...
-
Why Isn't Tesla Level 3? — LessWrong
Published on December 11, 2024 2:50 PM GMT Many people who've used Tesla's "Full Self Driving"...
-
Investing in Robust Safety Mechanisms is critical for reducing Systemic Risks — LessWrong
Published on December 11, 2024 1:37 PM GMTThis short paper was written quickly, within a single...
-
Post-Quantum Investing: Dump Crypto for Index Funds and Real Estate? — LessWrong
Published on December 11, 2024 11:59 AM GMTI'll keep this short.Google’s Willow quantum chip significantly outpaces...
-
Low-effort review of "AI For Humanity" — LessWrong
Published on December 11, 2024 9:54 AM GMTStumbled across a book in the new section of...
-
SAEBench: A Comprehensive Benchmark for Sparse Autoencoders — LessWrong
Published on December 11, 2024 6:30 AM GMTAdam Karvonen*, Can Rager*, Johnny Lin*, Curt Tigges*, Joseph...
-
Zombies! Substance Dualist Zombies? — LessWrong
Published on December 11, 2024 6:10 AM GMTIntroductionIn the classical Zombies! Zombies? post, Eliezer has thoroughly...
-
Why empiricists should believe in AI risk — LessWrong
Published on December 11, 2024 3:51 AM GMTEmpiricists are people who believe empirical information (from experiments...
-
Computational functionalism probably can't explain phenomenal consciousness — LessWrong
Published on December 10, 2024 5:11 PM GMTI’ve updated quite hard against computational functionalism (CF) recently (as...
-
Agender/Unemotional Rationality for Human Competition with ASI — LessWrong
Published on December 10, 2024 5:08 PM GMTWe know that females have two X chromosomes. The...
-
o1 Turns Pro — LessWrong
Published on December 10, 2024 5:00 PM GMTSo, how about OpenAI’s o1 and o1 Pro? Sam...
-
Most Minds are Irrational — LessWrong
Published on December 10, 2024 9:36 AM GMTEpistemic status: This is a step towards formalizing some...
-
Base Process — LessWrong
Published on December 10, 2024 6:35 AM GMTIt seems unlikely that a logical information level, such...
-
EC2 Scripts — LessWrong
Published on December 10, 2024 3:00 AM GMT I do a lot of work on ec2...
-
My Mental Model of AI Creativity – Creativity Kiki — LessWrong
Published on December 9, 2024 10:24 PM GMTI went to some lectures on the future of...
-
o1: A Technical Primer — LessWrong
Published on December 9, 2024 7:09 PM GMTTL;DR: In September 2024, OpenAI released o1, its first...
-
Correct my H5N1 research ($reward) — LessWrong
Published on December 9, 2024 7:07 PM GMTThe following is an incorrect and incomplete post about...