LESSWRONG
LW

4293
boazbarak
939Ω63121180
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
5boazbarak's Shortform
18d
15
No wikitag contributions to display.
boazbarak's Shortform
boazbarak6h61

Our paper on scheming with Appolo is now on the arxiv. Wrote up a twitter thread with some of my takes on it: https://threadreaderapp.com/thread/1970486320414802296.html 

Reply
Safety researchers should take a public stance
boazbarak3d252

I  disagree with a number of statements made in the post and do not support an AI development ban or pause. But  I support Leo speaking his mind about this and I think it’s important for OpenAI and other labs to have a culture where employees feel free to speak about such issues.

Reply21
boazbarak's Shortform
boazbarak6d10

Got my copy of 'If anyone builds it everyone dies" and read up to chapter 4 this morning but now have to get to work.. I might write a proper review after I finish it if I get the time. So far, my favorite parable was the professor saying that they trained an AI to be a great chess player and do anything to win, but "there was no wantingness in there, only copper and sand." 

I agree that we are training systems to achieve objectives, and the question of whether they "want" to achieve them or don't is a bit meaningless since they will defintiely act as if they do.

I find the comparisons of the training process to evolution less compelling, but to quote Fermat, a full discussion of where I see the differences between AI training and evolution would require more than a quick take...

Reply
Call for suggestions - AI safety course
boazbarak9d20

Course is now online with all lectures posted on youtube (see course webpage for links) and blog post lecture notes posted here https://www.lesswrong.com/w/cs-2881r 

Reply
boazbarak's Shortform
boazbarak9d60

A blog post on the first lecture of CS 2881r, as well as another blog on the student experiment by @Valerio Pepe , are now posted . See the CS2881r tag for all of these 

https://www.lesswrong.com/w/cs-2881r

The video for the second lecture is also now posted on YouTube 

Reply
boazbarak's Shortform
boazbarak13d20

Posted HW0 now and will post future ones also!

Reply1
boazbarak's Shortform
boazbarak13d10

Video is on youtube too 

Reply
boazbarak's Shortform
boazbarak13d10

Thanks - dropped it for now

Reply
boazbarak's Shortform
boazbarak13d*110

Posted our homework zero on the CS 2881 website, along with video and slides of last lecture and pre-reading for tomorrow's lecture. (Homework zero was required to submit to apply to the course.)

https://boazbk.github.io/mltheoryseminar/ 

Reply
boazbarak's Shortform
boazbarak17d20

Actually I did want to link to the debate between Ajeya and Narayanan. As part of working on the website I wanted to try out various AI tools since I haven't been coding outside the OpenAI codebase for a while, and I might have gone overboard :)

Reply
Load More
5boazbarak's Shortform
18d
15
16AI Safety course intro blog
2mo
0
53Call for suggestions - AI safety course
3mo
23
34Machines of Faithful Obedience
3mo
19
92Six Thoughts on AI Safety
8mo
55
51Reflections on "Making the Atomic Bomb"
2y
7
33The shape of AGI: Cartoons and back of envelope
2y
19
44Metaphors for AI, and why I don’t like them
2y
18
38Why I am not a longtermist (May 2022)
2y
19
42The (local) unit of intelligence is FLOPs
2y
7
Load More