LESSWRONG
LW

1079
cozyfae
19050
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
No wikitag contributions to display.
MakoYass's Shortform
cozyfae21h10

there's going to be a lot of pressure to make this set of beliefs legible and accountable to the safety team or to states or to the general public.

Where does this pressure come from?

Reply
On "ChatGPT Psychosis" and LLM Sycophancy
cozyfae3mo91

How does the sycophancy compare between o-series models and 4o? AFAIK only o-series have deliberative alignment applied on them.

Reply
Recent AI model progress feels mostly like bullshit
cozyfae7mo8-5

These machines will soon become the beating hearts of the society in which we live.

An alternative future: due to the high rates of failure, we don't end up deploying these machines widely in production setting, just like how autonomous driving had breakthroughs long ago but didn't end up getting widely deployed today.

Reply
We Should Prepare for a Larger Representation of Academia in AI Safety
cozyfae7mo50

How has these predictions shaken out? How does the growth rate of AI Safety researchers compare between academia & industry?

Reply
How to Make Superbabies
cozyfae8mo10

There may be additional societal and political problems afterwards. But none of those problems actually matter unless the technology works.

What do you think of the argument that "There may be additional technical problems afterwards. But none of those problems actually matter unless we have answers for societal and political problems."?

Reply