Hannes Thurnherr

Trying to use NLAs to find out how Qwen 2.5 7B does multiplication

Neural language autoencoders were just introduced by Anthropic. In a fascinating paper, they showed that you can take the residual stream activations of a language model and then train two instantiations of that same model (an encoder and a decoder) to translate those activations into a natural language verbalisation of...

May 1623

Hannes Thurnherr

Hannes Thurnherr

Trying to use NLAs to find out how Qwen 2.5 7B does multiplication

Is training data going to be diluted by AI-generated content?

Sentience in Silicon: The Challenges of AI Consciousness

Decompiling Tracr Transformers - An interpretability experiment

Hannes Thurnherr

Trying to use NLAs to find out how Qwen 2.5 7B does multiplication

Is training data going to be diluted by AI-generated content?

Sentience in Silicon: The Challenges of AI Consciousness

Decompiling Tracr Transformers - An interpretability experiment

Trying to use NLAs to find out how Qwen 2.5 7B does multiplication

Decompiling Tracr Transformers - An interpretability experiment

Sentience in Silicon: The Challenges of AI Consciousness

Is training data going to be diluted by AI-generated content?