Monats-Archive: Oktober 2015

Testing framework for multi armed bandits

A Python testing framework for multi armed bandits with implementations of several policies. Instead of the argmax function, they are using choice from the library random to get the required fair selection for the case that several arms have the … Weiterlesen

Veröffentlicht unter Allgemein | Kommentare deaktiviert