IT354

Course Name:

Reinforcement Learning (IT354)

Programme:

B.Tech (AI)

Category:

Programme Specific Electives (PSE)

Credits (L-T-P):

(3-0-2) 4

Content:

Introduction to Reinforcement Learning, Markov Processes Markov Reward Processes (MRPs) Markov Decision Processes (MDPs), MDP Policies, Policy Evaluation, Policy Improvement, Policy Iteration, Value operators, Model-free learning - Q-learning, SARSA, Scaling up: RL with function approximation, RL with function approximation, Imitation learning in large spaces, Policy search, Exploration/Exploitation, Meta-Learning, Batch Reinforcement Learning, Bandit problems and online learning, Solution methods: dynamic programming, Monte Carlo learning, Temporal difference learning, Eligibility traces, Value function approximation, Models and planning, Case studies: successful examples of RL systems, Frontiers of RL research

References:

Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition.

Reinforcement Learning: State-of-the-Art, Marco Wiering and Martijn van Otterlo, Eds

Artificial Intelligence: A Modern Approach, Stuart J. Russell and Peter Norvig.

Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville.

Department:

Information Technology

Contact us

Geetha V.

Head of the Department,
Department of Information Technology,

National Institute of Technology Karnataka,

Surathkal, P. O. Srinivasnagar, Mangalore - 575 025

Phone (Dept office):: +91-824-2473450

Phone (HoD Direct line): +91-824-2474040
Email: hodit [at] nitk [dot] edu [dot] in

Web Admin: Sowmya Kamath S

Connect with us

We're on Social Networks. Follow us & stay in touch.

Main Menu

IT354

Course Name:

Programme:

Category:

Credits (L-T-P):

Content:

References:

Department:

Quick Links

Contact us

Connect with us

Main Menu

IT354

Course Name:

Programme:

Category:

Credits (L-T-P):

Content:

References:

Department:

Quick Links

Contact us

Connect with us

You are here: