~www_lesswrong_com | Bookmarks (687)
-
A Poem Is All You Need: Jailbreaking ChatGPT, Meta & More — LessWrong
Published on October 29, 2024 12:41 PM GMTThis project report was created in September 2024 as...
-
Searching for phenomenal consciousness in LLMs: Perceptual reality monitoring and introspective confidence — LessWrong
Published on October 29, 2024 12:16 PM GMTTo update our credence on whether or not LLMs...
-
AI #87: Staying in Character — LessWrong
Published on October 29, 2024 7:10 AM GMTThe big news of the week was the release...
-
A path to human autonomy — LessWrong
Published on October 29, 2024 3:02 AM GMT"Each one of us, and also us as the...
-
D&D.Sci Coliseum: Arena of Data Evaluation and Ruleset — LessWrong
Published on October 29, 2024 1:21 AM GMTThis is a follow-up to last week's D&D.Sci scenario:...
-
Gwern: Why So Few Matt Levines? — LessWrong
Published on October 29, 2024 1:07 AM GMTMatt Levine is the most well-known newslettrist (“Money Stuff”)...
-
Hiring a writer to co-author with me (Spencer Greenberg for ClearerThinking.org) — LessWrong
Published on October 27, 2024 5:34 PM GMTDiscuss
-
Dario Amodei's "Machines of Loving Grace" sound incredibly dangerous, for Humans — LessWrong
Published on October 27, 2024 5:05 AM GMTWhat Dario lays out as a "best-case scenario" in...
-
Interview with Bill O’Rourke - Russian Corruption, Putin, Applied Ethics, and More — LessWrong
Published on October 27, 2024 5:11 PM GMTThis is cross-posted from my blog and I interviewed...
-
On Shifgrethor — LessWrong
Published on October 27, 2024 3:30 PM GMTA small number of terms are elevated from the...
-
The hostile telepaths problem — LessWrong
Published on October 27, 2024 3:26 PM GMTEpistemic status: model-building based on observation, with a few...
-
What are some good ways to form opinions on controversial subjects in the current and upcoming era? — LessWrong
Published on October 27, 2024 2:33 PM GMTTake a random political issue with two sides A...
-
Video lectures on the learning-theoretic agenda — LessWrong
Published on October 27, 2024 12:01 PM GMTThis is a YouTube playlist of recorded lectures on...
-
Electrostatic Airships? — LessWrong
Published on October 27, 2024 4:32 AM GMTAirships are pretty dang cool. Airplanes need a continuous...
-
A suite of Vision Sparse Autoencoders — LessWrong
Published on October 27, 2024 4:05 AM GMTCLIP-Scope?Inspired by Gemma-Scope We trained 8 Sparse Autoencoders each...
-
Ways to think about alignment — LessWrong
Published on October 27, 2024 1:40 AM GMTI’m listing some “ways to think about alignment”. I’m...
-
Is there a CFAR handbook audio option? — LessWrong
Published on October 26, 2024 5:08 PM GMTI've gotten spoiled by AI readings, and curious if...
-
A superficially plausible promising alternate Earth without lockstep — LessWrong
Published on October 26, 2024 4:04 PM GMT[ Context re dath ilan:- [Keltham reflects on the...
-
Why is there Nothing rather than Something? — LessWrong
Published on October 26, 2024 12:37 PM GMT"Close the darn window! You know it gives me...
-
The Summoned Heroine's Prediction Markets Keep Providing Financial Services To The Demon King! — LessWrong
Published on October 26, 2024 12:34 PM GMTThe Summoned Heroine and the Demon KingThe Summoned Heroine...
-
AI Safety Camp 10 — LessWrong
Published on October 26, 2024 11:08 AM GMTWe are pleased to announce that the 10th version...
-
Arithmetic Models: Better Than You Think — LessWrong
Published on October 26, 2024 9:42 AM GMTLessWrong user dynomight explains how arithmetic is an underrated...
-
Is the Power Grid Sustainable? — LessWrong
Published on October 26, 2024 2:30 AM GMT When I was growing up most families in...
-
A Case for Conscious Significance rather than Free Will. — LessWrong
Published on October 25, 2024 11:20 PM GMTThe following is born out of a frustration with...