Merge branch 'implement-reinforcement-learning' into 'master'

Implement reinforcement learning

See merge request !14
4 jobs for rerun-master-pipeline in 3 minutes and 55 seconds (queued for 3 seconds)
Status Name Job ID Coverage
  Windows
passed masterJobWindows #357710
Windows10

00:01:41

passed masterJobWindows #352749
Windows10

00:02:01

 
  Linux
passed BranchJobLinux #352750

00:02:13

passed PythonWrapperIntegrationTest #352751

00:02:12