LESSWRONG
LW

Guive
294Ω264520
Message
Dialogue
Subscribe

guive.substack.com

https://x.com/GuiveAssadi

Email me at assadiguive@gmail.com, if you want to discuss anything I posted here or just chat.

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Rauno's Shortform
Guive9h10

I think the extent to which it's possible to publish without giving away commercially sensitive information depends a lot on exactly what kind of "safety work" it is. For example, if you figured out a way to stop models from reward hacking on unit tests, it's probably to your advantage to not share that with competitors. 

Reply
Critic Contributions Are Logically Irrelevant
Guive9h32

I'm not sure that's even true of leading questions. You can ask a leading question for the benefit of other readers who will see the question, understand the objection the question is implicitly raising, and then reflect on whether it's reasonable. 

Reply
Lessons from the Iraq War for AI policy
Guive7d*30

Vietnam was different because it was an intervention on behalf of South Vietnam which was an American client state, even if the Gulf of Tonkin thing was totally fake. There was no "South Iraq" that wanted American soldiers.

Reply
Raemon's Shortform
Guive7d98

Also, I bet most people who temporarily lose their grip on reality from contact with LLMs return to a completely normal state pretty quickly. I think most such cases are LLM helping to induce temporary hypomania rather than a permanent psychotic condition. 

Reply
So You Think You've Awoken ChatGPT
Guive7d1518

This feels a bit like two completely different posts stitched together: one about how LLMs can trigger or exacerbate certain types of mental illness and another about why you shouldn't use LLMs for editing, or maybe should only use them sparingly. The primary sources about LLM related mental illness are interesting, but I don't think they provide much support at all for the second claim. 

Reply
Shortform
Guive14d30

It took me a minute to read this as an exclamatory O, rather than as "[There are] zero things I would write, were I better at writing."

Reply
TurnTrout's shortform feed
Guive15d32

Can you be more concrete about what "catching the ears of senators" means? That phrase seems like it could refer to a lot of very different things of highly disparate levels of impressiveness. 

Reply
Support for bedrock liberal principles seems to be in pretty bad shape these days
Guive19d10

Singapore is democratic.

Reply
evhub's Shortform
Guive20d10

It is not a paraphrase; the denotation of these sentences is not precisely the same. However, it is also not entirely surprising that these two phrases would evoke similar behavior from the model. 

Reply
the silk pajamas effect
Guive1mo20

Interesting post. Just so you know, there are a few stray XML tags that aren't rendering properly. 

Reply
Load More
31Token and Taboo
3mo
6
59Testing for Scheming with Model Deletion
Ω
6mo
Ω
21
11Updating on Bad Arguments
7mo
2
34Nuclear Espionage and AI Governance
4y
5