Alexander Olivaw
-1
5
Alexander Olivaw has not written any posts yet.

Alexander Olivaw has not written any posts yet.

A few decades ago we were trapped in a house that was slowly sinking into quicksand over the course of decades. Now however, the house IS on fire.
I remember reading that SFT can undermine subsequent RL by inducing pseudo reasoning paths imitated from expert models (at least in Large Vision-Language Models ), do you think these results could be attributed to this behavior, or the results would be the same if only RL was used?
Readow AI is very good at finding similar books to the one/ones you enter.
We can communicate with ants if we wanted to, by studying and reproducing their pheromonal signaling. However, the content of that communication wouldn't be useful for any sort of beneficial trade relationship. Just like the content of our communication with an ASI wouldn't be useful for that ASI to benefit from any kind of trade with us.