Edy Nastase — LessWrong

Automating Mechanistic Interpretability via Program Synthesis

I have been researching for a while, and it seems to me that there isn't that much progress on "automating" MI using Program Synthesis. The only source I could find is a paper from Max Tegmark's lab. However, this paper has been about for quiet a while, and not that...

Apr 17, 20251

Why are neuro-symbolic systems not considered when it comes to AI Safety?

I am really not sure of why neuro-symbolic systems are considered as alternatives to the current black-box ones? A concrete example I have found (and currently studying) is HOUDINI (https://arxiv.org/pdf/1804.00218). Essentially, it implements neural networks using higher order combinators (map, fold etc.) that were found via enumeration/genetic programming searches. When...

Apr 11, 20253

What are the main arguments against AGI?

Recently, I have been trying to reason why I belive what I belive (regarding AGI). However, it appears to me that there is not enough discussion around the arguments against AGI (more specifically AGI skeptisim). This might be of benefit, especially given that Would this be because the arguments are...

Dec 24, 20241