What individuals or organizations are actively working on the "marketing" of AI alignment, particularly doing work such as: * Establishing AI alignment as a recognized and respected academic field. * Building the infrastructure to make alignment research more accessible and attractive to traditional researchers and institutions. * Creating resources for...
In the earlier days of building AI companions, I encountered a curious problem. Back then, I used models like Google’s T5-11B for conversational agents. Occasionally, the AI would say strange or outright impossible things, like suggesting, “Would you like to meet and go bowling?” or claiming, “Yes, I know your...
I am actively looking for a tutor/advisor with expertise in AI x-risk, with the primary goal of collaboratively determining the most effective ways I can contribute to reducing AI existential risks (X-risk). Tutoring Goals I suspect that I misunderstand key components of the mental models that lead some highly rational...
I believe there are people with far greater knowledge than me that can point out where I am wrong. Cause I do believe my reasoning is wrong, but I can not see why it would be highly unfeasible to train a sub-AGI intelligent AI that most likely will be aligned...
I recently found myself in a spirited debate with a friend about whether large language models (LLMs) like GPT-4 are mere stochastic parrots or if they can genuinely engage in deeper reasoning. We both presented a range of technical arguments and genuinely considered each other’s points. Despite our efforts, we...
Summary by OpenAI: We use GPT-4 to automatically write explanations for the behavior of neurons in large language models and to score those explanations. We release a dataset of these (imperfect) explanations and scores for every neuron in GPT-2. Link: https://openai.com/research/language-models-can-explain-neurons-in-language-models Please share your thoughts in the the comments!
Does anyone know any good resources for posting in social media to raise awareness about AI X-risk? If not, I might do a writeup myself, so feel free to comment any advice you have.