Mislav Jurić — LessWrong

Why AI alignment matters today

Introduction In 2020, around the time when I graduated with a master’s degree in computer science, I had a conversation with Steve Omohundro where we discussed his Basic AI Drives paper (among other things). At that time, there existed concrete demonstrations of where AI alignment could go wrong, but they...

Oct 22, 20256

Does agency necessarily imply self-preservation instinct?

I have recently read some blog posts (this, this and this) about tool AGIs and why agent AGIs are more likely. The argument that convinced me to err on the side of agent AGIs was that they are more likely to have an economical advantage over tool AGIs, as explained...

May 1, 20235

Implementing a Transformer from scratch in PyTorch - a write-up on my experience

Introduction As is discussed in posts such as this one, a good way to test your skills as a machine learning research engineer is to implement a Transformer from scratch in PyTorch. This is exactly what I did. Below I am sharing my experience while doing so. The code I...

Apr 25, 202320