Templarrr — LessWrong

LESSWRONG
is fundraising!
LW

You don’t actually get to do that. Bayes Rule does not allow one to not update on evidence. Tons of things that happened between 2009 and today should have changed Legg’s estimates, in various directions, including the Transformer paper, and also including ‘nothing important happened today.’

Not necessary. If we're doing batched updates (and noone updates on every single minute detail) and if the events happening in real world align with the timeline "50% of AGI by 2028" then you just update 50% -> 50% every time. Which is pretty much my interpretation what Shane Legg meant in the first place - "what I see happening in the world is exactly what would've happened in the world that has 50% chance of getting AGI by 2028".

ChatGPT 5.1 Codex Max

Templarrr1mo40

looking closer to linear progress

There is no "linear" progress on the chart, 2 reference lines are "exponential" and "superexponential". The Y axis is logarithmic.

AI #132 Part 1: Improved AI Detection

Templarrr4mo1-2

least by my eyes even when they have relatively good taste they all reliably have terrible taste and even the samples people say are good are not good

We can get a lot here if we remember that a lot of "good writing" is centered around "not repeating itself" in different forms (words/phrases/structures etc) and current models are absolutely terrible in that. IF we can add temporary negative weights to the terms that were already used in answer that would decrease to zero with time, we can incentivise the LLMs to utilize wider variety of language.

AI #108: Straight Line on a Graph

Templarrr9mo10

engineer, honestly

First I thought this was hilarious, as in "we really just want an engineer FFS", but then I checked.

Engineer, honesTY. As in "engineer to research and improve models honesty".

Fun With GPT-4o Image Generation

Templarrr9mo00

Fun safety hiccup - the image generator is very persistent in not allowing to draw a hand that touches the blade of the sword, regardless how safe the context is. The hand can hover over it, be close to, touch the guard, but not the blade. I barely made it able to touch a blade by invoking the Mordhau and medieval fencing manuals, and even then it was just one hand on the blade, while it should've been both.

No trouble making it work with a wooden toy sword though, but that defeated the entire point of the picture.

Monthly Roundup #28: March 2025

Templarrr9mo21

Would this even be legal in Germany? No wonder Europe is falling behind.

Case study "how to make your post much worse in a single sentence".

There's literally nothing they describe that requires to do active face recognition (the only part that could be a problem in Europe).
Most of the office spaces use personalized electronic key cards.
Office systems KNOW who just entered.

Solving non-existing problem by harder-then-necessary and illegal-in-some-places way can be fun, but isn't as much of a dunk on others as author believes it to be. Without the last part it was fun experiment of a fellow tech person, with it ...

AI #100: Meet the New Boss

Templarrr11mo10

Which means, in turn, that you must (for that to make any sense) be using the AI in its non-aligned state to align itself and solve all those other problems

Strongly disagree on this.

The text doesn't imply this at all. "While doing it" doesn't mean you will be using AI, it just means that during the development your team uncovers a lot of corner cases and knowledge and skills needed that weren't available to them before they started, which is how most of the engineering projects are done.

You may have general plan, but it is expected that you will come up with the details as your knowledge of the area extends.

Monthly Roundup #26: January 2025

Templarrr1y10

Pointless busywork is bad.

100%. The problem usually is hidden in people mixing "I don't (understand/agree with) the point of something" with "something is pointless".

AI #93: Happy Tuesday

Templarrr1y10

what the median essay, story, or response to the assignment will look like so they can avoid and transcend it all

Obligatory joke about how terrible our education is, that half of the scores are below median!

AI #89: Trump Card

Templarrr1y10

they’re 99% sure are AI-generated, but the current rules mean they can’t penalise them.
The issue is proving it.

That is very much not the issue. The issue is that academy spent last few hundred years to make sure papers are written in the most inhuman way possible. No human being ever talks like whitepapers are written. The "we can't distinguish if this was written by a machine or human that is really good at pretending being one" can't be a problem if it was heavily encouraged for centuries. Also fun reverse-Turing test situation.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

Posts

Wikitag Contributions

Comments

Posts

Wikitag Contributions

Comments