@johnowhitaker
OK I had to record a quick video and share a dialog showing my first few tests: https://t.co/800pSOmNmD Dialog: https://t.co/Vev8lHDndn In the video, I show how easy it can be to train a model on a custom task with your own reward function. LMK what I should try next :)