~www_lesswrong_com | Bookmarks (706)
-
What are the differences between a singularity, an intelligence explosion, and a hard takeoff? — LessWrong
Published on April 3, 2025 10:37 AM GMTThis is an article in the featured articles series...
-
Emergent Misalignment and Emergent Alignment — LessWrong
Published on April 3, 2025 8:04 AM GMTIn the recent paper titled Emergent Misalignment: Narrow finetuning...
-
More Fun With GPT-4o Image Generation — LessWrong
Published on April 3, 2025 2:10 AM GMTGreetings from Costa Rica! The image fun continues. We...
-
On Pseudo-Principality: Reclaiming "Whataboutism" as a Test for Counterfeit Principles — LessWrong
Published on April 3, 2025 1:37 AM GMTCrossposted from Substack "These are my principles. If you don't...
-
FLAKE-Bench: Outsourcing Awkwardness in the Age of AI — LessWrong
Published on April 1, 2025 5:08 PM GMTIntroductionA key part of modern social dynamics is flaking...
-
Introducing Deepgeek — LessWrong
Published on April 1, 2025 4:41 PM GMTTL;DR: We present Deepgeek, a new AI language model...
-
Comments on Karma systems — LessWrong
Published on April 1, 2025 12:53 PM GMTA Karma system is a resource sharing mechanism based...
-
LessWrong has been acquired by EA — LessWrong
Published on April 1, 2025 1:09 PM GMTHey Everyone,It is with a sense of... considerable cognitive...
-
Insect Suffering Is The Biggest Issue: What To Do About It — LessWrong
Published on April 1, 2025 12:51 PM GMT1 IntroductionCrosspost from my blog. A beetle lay crushed in...
-
Organisation-Level Lock-In Risk Interventions — LessWrong
Published on April 1, 2025 12:42 PM GMTEpistemic status: my own thoughts and opinions after thinking...
-
Keltham's Lectures in Project Lawful — LessWrong
Published on April 1, 2025 10:39 AM GMT(Not an April fools' joke)If anyone wants to return...
-
Was the historical Jesus talking about evolution? (You might be surprised) — LessWrong
Published on April 1, 2025 10:32 AM GMTOut of all the research rabbit holes I've ever...
-
Follow me on TikTok — LessWrong
Published on April 1, 2025 8:22 AM GMTFor more than five years, I've posted an average...
-
New Cause Area Proposal — LessWrong
Published on April 1, 2025 7:12 AM GMTEpistemic status - statistically verified. I'm writing this post to...
-
Why do many people who care about AI Safety not clearly endorse PauseAI? — LessWrong
Published on March 30, 2025 6:06 PM GMTtl;dr:From my current understanding, one of the following two...
-
Extracting proper nouns a model "knows" using entity-detection neurons. — LessWrong
Published on March 30, 2025 4:58 PM GMTIntroductionResearch on Sparse Autoencoders (SAEs) has identified "known entity"...
-
The g-Zombie Formal Argument — LessWrong
Published on March 30, 2025 1:16 PM GMTNote: I created an entry highlighting the formal attack...
-
Memory Persistence within Conversation Threads with Multimodal LLMS — LessWrong
Published on March 30, 2025 7:16 AM GMTIn neuroscience, we learned about foveated vision — our...
-
How I talk to those above me — LessWrong
Published on March 30, 2025 6:54 AM GMTNow and then, at work, we’ll have a CEO...
-
I, G(Zombie) — LessWrong
Published on March 30, 2025 1:24 AM GMTThere is no such thing as philosophy-free science; there...
-
Exercising Rationality — LessWrong
Published on March 29, 2025 7:08 PM GMTOr: Why thinking about blue tentacle arms is not...
-
Climbing the Hill of Experiments — LessWrong
Published on March 29, 2025 8:37 PM GMTA better anything can be achieved with simple tests...
-
Does the AI control agenda broadly rely on no FOOM being possible? — LessWrong
Published on March 29, 2025 7:38 PM GMTFor the purposes of FOOM, I'm defining it as...
-
Yeshua's Basilisk — LessWrong
Published on March 29, 2025 6:11 PM GMTSuppose you’re an AI researcher trying to make AIs...