Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B) — LessWrong