As of two years ago, the evidence for this was sparse. Looked like parity overall, though the pool of "supers" has improved over the last decade as more people got sampled.

There are other reasons to be down on XPT in particular.

Reply

1

Least-problematic Resource for learning RL?

Answer by technicalitiesFeb 19, 202420

I like Hasselt and Meyn (extremely friendly, possibly too friendly for you)

Reply

Dalcy's Shortform

technicalities5mo10

Maybe he dropped the "c" because it changes the "a" phoneme from æ to ɑː and gives a cleaner division in sounds: "brac-ket" pronounced together collides with "bracket" where "braa-ket" does not.

Reply

Shallow review of live agendas in alignment & safety

technicalities8mo10

It's under "IDA". It's not the name people use much anymore (see scalable oversight and recursive reward modelling and critiques) but I'll expand the acronym.

Reply

Shallow review of live agendas in alignment & safety

technicalities8mo32

The story I heard is that Lightspeed are using SFF's software and SFF jumped the gun in posting them and Lightspeed are still catching up. Definitely email.

Reply

Shallow review of live agendas in alignment & safety