LESSWRONG
Reinforcement Learning using Layered Morphology (RLLM)
LW

Reinforcement Learning using Layered Morphology (RLLM)

Dec 03, 2023 by MiguelDev
6Intergenerational Knowledge Transfer (IKT)
MiguelDev
1y
0
5RLLMv10 experiment
MiguelDev
1y
0
20A T-o-M test: 'popcorn' or 'chocolate'
MiguelDev
1y
13
7Can RLLMv3's ability to defend against jailbreaks be attributed to datasets containing stories about Jung's shadow integration theory?
MiguelDev
1y
2
4Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B)
MiguelDev
1y
0
16GPT2XL_RLLMv3 vs. BetterDAN, AI Machiavelli & Oppo Jailbreaks
MiguelDev
1y
4
6Research Log, RLLMv2: Phi-1.5, GPT2XL and Falcon-RW-1B as paperclip maximizers
MiguelDev
1y
0
7Reinforcement Learning using Layered Morphology (RLLM)
MiguelDev
1y
0
5An examination of GPT-2's boring yet effective glitch
MiguelDev
1y
3