You're reading the public-facing archive of the Category Theory Zulip server.
To join the server you need an invite. Anybody can get an invite by contacting Matteo Capucci at name dot surname at gmail dot com.
For all things related to this archive refer to the same person.
This is on my list of things to look at when I have a spare minute
If you're going to run a RL algorithm and interpret the results as a policy recommendation, then I'm afraid there's unfortunately no choice but to listen to Eliezer Yudkowsky and specify the loss function really really carefully