This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Reinforcement Learning using Layered Morphology (RLLM)
LW
Login
Reinforcement Learning using Layered Morphology (RLLM)
6
Intergenerational Knowledge Transfer (IKT)
MiguelDev
2mo
0
5
RLLMv10 experiment
MiguelDev
2mo
0
20
A T-o-M test: 'popcorn' or 'chocolate'
MiguelDev
2mo
13
7
Can RLLMv3's ability to defend against jailbreaks be attributed to datasets containing stories about Jung's shadow integration theory?
MiguelDev
3mo
2
4
Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B)
MiguelDev
3mo
0
16
GPT2XL_RLLMv3 vs. BetterDAN, AI Machiavelli & Oppo Jailbreaks
MiguelDev
3mo
4
6
Research Log, RLLMv2: Phi-1.5, GPT2XL and Falcon-RW-1B as paperclip maximizers
MiguelDev
4mo
0
7
Reinforcement Learning using Layered Morphology (RLLM)
MiguelDev
6mo
0
5
An examination of GPT-2's boring yet effective glitch
MiguelDev
1mo
3