mako yass — LessWrong

LESSWRONG
LW

I agree we shouldn't let them interact with other models but I think storing the data in a way that's unlikely to leak is basically trivial. Also, storing older models doesn't at all increase the security threads that were already present in being the kinds of people who are actively developing powerful models.

MakoYass's Shortform

mako yass4d20

I ought to know the base rate of his team deciding yes or no to ideas he got from podcast conversations but I don't.

But it also sounds like they're kind of already doing it for the reasons I suggested, like yes, they were doing knowledgebase consistency first and then Sachs suggested that an encyclopedia naturally falls out of that. I'd expect grok to be doing rag before making these data edits, so if the thing it's retrieving from is also something it's curating, organizing and possibly editing, that's the thing.

MakoYass's Shortform

mako yass6d20

States will restrict government use of models they don't trust. Government contracts are pretty lucrative.

The public, or at least part of it, may also prefer to use models that are consistent in their positions, as long as they can explain their positions well enough (and they're very good at doing that). I guess Politicians are counterevidence against this, but it's much harder for a chat assistant/discourse participant to get away with being vague, people get annoyed when politicians are vague already, someone you're paying to give you information, the demand for taking a stance on the issues is going to be greater.

But I guess for the most part it wont be driven by pressure, it'll be driven by an internal need to debug and understand the system's knowledge rumination processes. The question is not so much will they build it but will they make it public. They probably will, it's cheap to do it, it'll win them some customers, and it's hard to hide any of it anyway.

koanchuk's Shortform

mako yass6d00

This might be a problem if it were possible to build a (pathologically) cautious all-powerful buerocracy that will forbid the deployment of any AGI that's not formally verifiable, but it doesn't seem like that's going to happen, instead the situation is about accepting that AGI will be deployed and working to make it safer, probably, than it otherwise would have been.

jbkjr's Shortform

mako yass6d20

A web standard for micropayments to cover hosting costs so that AI companies don't have to be ratelimited is probably the correct solution.

I'm not sure how much it would cost AI companies if they had to compensate the internet for the obscene amount of traffic they generate, it's probably a large number, but maybe not a large proportion of trianing costs.

MakoYass's Shortform

mako yass6d60

Grokipedia is more interesting than it seems imo, because there's this very sensible step that AI companies are going to have to take at some point: having their AI maintain its own knowledgebase, source its own evidence/training data, reflect on its beliefs and self-correct, hammer out inconsistencies, and there's going to be a lot of pressure to make this set of beliefs legible and accountable to the safety team or to states or to the general public. And if they did make it legible to the general public (they probably should?) then all of this is pretty much exactly equivalent to the activity of maintaining a free online encyclopedia.

Is this how they're thinking about it behind the scenes? It probably is! They're an AI company! They spent like half of grok4's training compute on post-training, they know how important rumination or self-guided learning is.

Open Thread Autumn 2025

mako yass9d100

is there anywhere on the site where we can discuss/brainstorm ideas?

the quick takes section or open threads are both fine for requesting comment on drafts.

Open Thread Autumn 2025

mako yass9d22

Some counterfactual questions are unanswerable, because they propose worlds that are self-contradictory or just very hard to reason about.

My account of free will is just uncertainty about one's own future decision output, so imagining the average natural world where we don't have that is very difficult. (There may be other accounts of free will, but they seem very confused.)

Wei Dai's Shortform

mako yass14d20

That [welfare] fully boils down to whether the experience includes a preference to be dead (or to have not been born).

Possible failure case: There's a hero living an awful life, choosing to remain alive in order to lessen the awfulness of a lot of other awful lives that can't be ended. Everyone in this scenario prefers death, even the hero would prefer omnicide, but since that's not possible, the hero chooses to live. The hero may say "I had no choice but to persist," but this isn't literally true.

Ah. No. The hero would prefer to be dead all things being equal, but that's not possible, the hero wouldn't prefer to be dead if it entailed that the hero's work wouldn't be done, and it would.

"would prefer to be replaced by a p-zombie" might be a better definition x]

Wei Dai's Shortform

mako yass14d20

Ah, I think my definition applies to lives in totality. I don't think you can measure the quality of a life by summing the quality of its moments, for humans, at least. Sometimes things that happen towards the end give the whole of it a different meaning. You can't tell by looking at a section of it.

Hedonists are always like "well the satisfaction of things coming together in the end was just so immensely pleasurable that it outweighed all of the suffering you went through along the way" and like, I'm looking at the satisfaction, and I remember the suffering, and no it isn't, but it was still all worth it (and if I'd known it would go this way perhaps I would have found the labor easier.)

LESSWRONG
LW

LESSWRONG
LW

Posts

Wikitag Contributions

Comments