Using Reinforcement Learning to try to control the heating of a building (district heating) — LessWrong