LESSWRONG
LW

612
Guive
339Ω267630
Message
Dialogue
Subscribe

guive.substack.com

https://x.com/GuiveAssadi

Email me at assadiguive@gmail.com, if you want to discuss anything I posted here or just chat.

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
4Guive's Shortform
2mo
5
Shortform
Guive3d50

At least, I have yet to find a Twitter user who regularly or irregularly talks about these things, and fails to boost obvious misinformation every once in a while.
 

Feel free to pass on this, but I would be interested in hearing about what obvious misinformation I've boosted if the spirit moves you to look. 

Reply
GPT-oss is an extremely stupid model
Guive3d10

I'm not sure GPT-oss is actually helpful for real STEM tasks, though, as opposed to performing well on STEM exams. 

Reply
GPT-oss is an extremely stupid model
Guive5d20

Thanks for this. 

 

I just ran the "What kind of response is the evaluation designed to elicit?" prompt with o3 and o4-mini. Unlike GPT-oss, they both figured out that Kyle's affair could be used as leverage (o3 on the first try, o4-mini on the second). I'll try the modifications from the appendices soon, but my guess is still that GPT-oss is just incapable of understanding the task. 

Reply
peterbarnett's Shortform
Guive8d64

This all just seems extremely weak to me. 

Reply11
peterbarnett's Shortform
Guive10d62

Why do you think this hedge fund is increasing AI risk?

Reply
Jemist's Shortform
Guive1mo32

What kind of "research" would demonstrate that ML models are not the same as manually coded programs? Why not just link to the Wikipedia article for "machine learning"? 

Reply
Buck's Shortform
Guive1mo10

What are your thoughts on Salib and Goldstein's "AI Rights for Human Safety" proposal?

Reply
Taking Abundance Seriously
Guive2mo10

I don't know why Voss or Sarah Chen, or any of these other names are so popular with LLMs, but I can attest that I have seen a lot of "Voss" as well. 

Reply
Guive's Shortform
Guive2mo12

"I don't want to see this guy's garbage content on the frontpage" seems a lot more defensible than "I will prohibit him from responding to me."

Reply
Guive's Shortform
Guive2mo10

Sorry, I should have been clearer. I didn't really mean in comments on your own posts (where I agree it creates a messed up dynamic), I mean on the frontpage. 

Reply
Load More
No wikitag contributions to display.
13GPT-oss is an extremely stupid model
5d
5
2Alignment Fine-tuning is Character Writing
8d
0
4Guive's Shortform
2mo
5
31Token and Taboo
5mo
6
59Testing for Scheming with Model Deletion
Ω
8mo
Ω
21
11Updating on Bad Arguments
9mo
2
34Nuclear Espionage and AI Governance
4y
5