Tags

Tags give the ability to mark specific points in history as being important

working_jsp_mcts

a5c293e7 · refactoring · Feb 05, 2023

b61332b1 · working: training a policy improvement by mcts agent from scratch · Jan 29, 2023

working: training of agent with policy improvement by mcts from scratch. no pretraining with stb3.  https://wandb.ai/marcoke/neural_mcts/runs/x2z51l4r/overview?workspace=user-marcoke

working_policy_improvement

6df646b0 · neural value init · Jan 24, 2023

With this code, MCTS improves a mediocre learned policy. It fails to improve a good policy, however.