Încărcări
Tsitsiklis, John N. - Roy, Benjamin - Feature-Based Methods For Large Scale Dynamic Programming (1996) (10.1007 - bf00114724) - Libgen - Li 0% au considerat acest document utilLearning To Act Using Real-Time Dynamic Programming 0% au considerat acest document utilOptimally Solving Markov Decision Processes Alagoz Ayvaci Linderoth 0% au considerat acest document utilNIPS 1999 Policy Gradient Methods For Reinforcement Learning With Function Approximation Paper 0% au considerat acest document utilFeature-Based Aggregation and Deep Reinforcement Learning 0% au considerat acest document utilRésolution D'un Programme Lin ́eaire Par L'algorithme Du Simplexe 0% au considerat acest document utilRMDP - DivideConquer Methods - Metha - 2015 0% au considerat acest document utilAn Empirical Study of Policy Convergence in Markov Decision Process Value Iteration Zobel 2005 0% au considerat acest document utilAn Adaptive State Aggregation Algorithm For Markov Decision Processes 0% au considerat acest document util