Skip to content

manasv09/CS747-FILA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CS747-FILA

Author: Manas Vashistha


  • Assignment for the course CS747 Foundations of Intelligent and Learning Agents under Prof Shivaram Kalyanakrishnan.

Assignment 1

Estimating average regrets for Multiarmed Bandit Instances using $\epsilon$-greedy, UCB, KL-UCB and Thompson Sampling.

Assignment 2

  • Computing the optimal value functions for MDPs using Value iteration, Linear Programming and Policy improvement.
  • Formulating a maze as an mdp to find the shortest path from a starting state to an end state.

Assignment 3

Windy Gridworld task.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published