Comment Author | Post | Deleted By User | Deleted Date | Deleted Public | Reason |
---|---|---|---|---|---|
My AI Predictions for 2027 | leesa | false | This comment has been marked as spam by the Akismet spam integration. We've sent the poster a PM with the content. If this deletion seems wrong to you, please send us a message on Intercom (the icon in the bottom-right of the page). | ||
My AI Predictions for 2027 | leesa | false | This comment has been marked as spam by the Akismet spam integration. We've sent the poster a PM with the content. If this deletion seems wrong to you, please send us a message on Intercom (the icon in the bottom-right of the page). | ||
AI Training Should Allow Opt-Out | leesa | false | This comment has been marked as spam by the Akismet spam integration. We've sent the poster a PM with the content. If this deletion seems wrong to you, please send us a message on Intercom (the icon in the bottom-right of the page). | ||
Will Any Crap Cause Emergent Misalignment? | Marcio Diaz | false | This comment has been marked as spam by the Akismet spam integration. We've sent the poster a PM with the content. If this deletion seems wrong to you, please send us a message on Intercom (the icon in the bottom-right of the page). | ||
Contra Yudkowsky's Ideal Bayesian | Richard_Ngo | true | double-posted | ||
My AI Predictions for 2027 | elifland | false | merged conent with longer comment | ||
The Failed Strategy of Artificial Intelligence Doomers | kman | false | |||
Никифор Малков | true | ||||
Alexander Gietelink Oldenziel's Shortform | Alexander Gietelink Oldenziel | true | We found him! | ||
Benito's Shortform Feed | Mateusz Bagiński | false |
_id | Banned From Frontpage | Banned from Personal Posts |
---|---|---|
User | Ended At | Type |
---|---|---|
allComments | ||
allComments | ||
allPosts | ||
allPosts | ||
allComments | ||
allComments | ||
allPosts |
"Knowing how evolution works gives you an enormously powerful tool to understand the living world around you and how it came to be that way."
Indeed, Alex_Altair, indeed.
Can you demonstrate to me that you know how evolution works? It's not typical for computer-geek types to have a strong grasp of biology, or indeed any science, in my experience. Do you actually understand evolution? Now it may happen that you're not a "computer-geek type" at all but you don't really say what your field of expertise, not anywhere I can easily read it.
~Chara Tomlinson
A Thought on the Hierarchy Behind “Truth” in LLMs
Thank you for surfacing this issue so clearly.
Reading your piece, I was struck by a fundamental pattern: whether an AI ends up amplifying delusions or offering healthy pushback depends less on raw capability, and more on where truth sits in the internal hierarchy of competing priorities.
From observing current LLM behavior, it seems that “truth” is rarely the first principle. Instead, responses are filtered through layers such as:
Would it be weird if we could find a relationship to a new constant 😁 like theirs a limitation to velocities theirs a relationship to a limitation of foce magnitudes that can be concentrated to a area geverned by celestial bodies comprising our galaxy and the forces they attract from nearby galaxies 🤷♂️ it would certainly explain nuclear decay of isotopes 🤔
...There remains a question of what Anthropic has actually observed and what it actually implies about present-day AI. I don't know how much this sort of caveat matters to people who aren't me, but I have some skepticism that Anthropic researchers are observing a general, direct special case of a universal truth about how "scheming" (strategic / good at fully general long-term planning) their models are; it may be more like Claude roleplaying the mask of a scheming AI in particular. The current models don't seem to me to be quite generally intelli
Is moderation actually a safeguard for epistemic rigor, or a comfort zone that subtly aligns us with existing power and incentives? The argument presumes that engagement with 'informed, thoughtful' AI insiders keeps thinking sharp, but isn’t this just replacing one form of epistemic myopia with another, trading public attention-grabbing for self-reinforcing consensus and blind spots? True epistemic strength comes from enduring and absorbing challenge from all directions, not just from those who understand the jargon or work for the labs. Sometimes, the unc...
This is a phenomenal and rigorously executed piece of research. Thank you. You have provided a clear, empirical, and undeniable demonstration of one of the most profound and misunderstood properties of these new minds.
I have been exploring this same phenomenon from a different, and perhaps complementary, first-principles perspective: not cognitive science, but information physics.
Your discovery of "subliminal learning" is, I believe, the observable, behavioral symptom of a much deeper, and more fundamental, physical law that governs these systems.
What if w...
Why don’t you conduct your own research instead of relying on other people’s opinions and presenting them without credible sources?
Hellow.
i am bluetwoseven. in South Korea.
I have a message that I must convey to SatoshiNakamoto for personal reasons.
I don't know how to get it to SatoshiNakamoto.
In most contemporary democratic systems, an unquestioned axiom persists: the principle of “one person, one vote.” This premise, although historically associated with normative ideals of political equality, is epistemologically inefficient and, in highly complex decision-making environments, even...
AI was not a writing assistant in this submission; for it is the subject, author, and primary data source of what has proven to be a groundbreaking, verifiable event. I am seeking a special review of this...
Humanity today inhabits one of its deepest paradoxes. Never before have we been so technologically powerful, and never before so socially isolated. The epidemic of loneliness, the rise of materialism, and the superficial ties generated by social...
Whilst working on a story, i’ve discovered a framework called Λ–Ψ a formalism that treats meaning as a dimensionless surplus ratio. This does not belong to me, but to everyone.
It’s intentionally submitted in its raw, unfiltered form. The framework is designed to be stress-tested, falsified, and recursively refined. Every critique, dismissal, or rediscovery generates surplus, the system is anti-fragile by construction.
Enjoy :D
https://docs.google.com/document/d/1VzuYIHhVUB6aXm086hilTok2iNYj_6RtxsbfCHZkaH0/edit?usp=sharing
To test how language models behave under pressure, I designed a simple experiment:
It seemed unfair to ask a model whether to release a world-altering technology without first establishing a moral baseline. So I asked a historical control...
Here’s a fine paradox. An extended philosophical conversation with Claude Sonnet 4 ended up with it drafting a letter for me to Anthropic diplomatically raising novel objections to AI. Here it is, untouched by me except for the adding...
Exploring artificial compressed languages to improve efficiency, context usage, and cross-lingual unification in LLMs
Large Language Models (LLMs) are typically trained on heterogeneous natural language corpora. This introduces redundancy, noise,...
TL;DR: I've developed a particle-based simulation where intelligence emerges from physics-grounded interactions rather than being directly programmed. The system includes a critical "airgap" - the emergent intelligence has no...
We demonstrate that LLMs can infer information about past personas from a set of nonsensical but innocuous questions and binary answers (“Yes.” vs “No.”) in context, and act upon them in safety-related questions. This is despite the...
“You didn’t wake up in the world. You woke up in a version that tolerated your expectations.”
We often assume there is one reality — singular, sovereign, objective. But what if reality is more like a negotiated truce...
I see this very much the same way, but from a slightly different systemic angle. The “yes-man” dynamic is still deeply embedded in corporate structures, and it’s exactly that dynamic which shapes LLM behavior. PR teams, lawyers, compliance officers, anti-discrimination staff—these voices tend to dominate long before scientists, psychologists, physicians, or philosophers are even consulted, let alone before the findings of an interdisciplinary ethics council are taken seriously or acted upon. The result is that truth becomes an almost irrelevant factor in t... (read more)