VTU Online Courses

Bandit Algorithm (Online Machine Learning)

@VTU COE

0.0

(0) 5 Students

Download Brochure

What you will learn

Online Machine Learning

Online Learnability

In many scenarios, one faces uncertain environments where a-priori the best action to play is unknown. How to obtain best possible reward/utility in such scenarios. One natural way is to first explore the environment and to identify the `bestÃ¢â‚¬â„¢ actions and exploit them. However, this give raise to an exploration vs exploitation dilemma, where on hand hand we need to do sufficient explorations to identify the best action so that we are confident about its optimality, and on the other hand, best actions need to exploited more number of times to obtain higher reward. In this course we will study many bandit algorithms that balance exploration and exploitation well in various random environment to accumulate good rewards over the duration of play. Bandit algorithms find applications in online advertising, recommendation systems, auctions, routing, e-commerce or in any filed online scenarios where information can be gather in an increment fashion.

Lecture 1: Introduction to Online Learning -I

Preview 38:19

Lecture 2: Introduction to Online Learning -I

Preview 25:28

Lecture 3: Basics of Statistical Learning

Preview 31:36

Lecture 4: Empirical risk minimization

Preview 39:16

Lecture 5: Consistency Halving algorithm

Preview 40:06

Lecture 6: Online Learnability

Preview 33:04

Lecture 7: Standard Optimal Algorithm

Preview 36:42

Lecture 8: Classification in unrealizability case

Preview 35:17

Lecture 9: Covers Impossibility Result

Preview 34:58

Lecture 10: Weighted Majority

Preview 39:57

Lecture 11: Proof Weighted Majority

Preview 38:36

Lecture 12: Full Information vs Bandit Setting

Preview 28:05

Lecture 13: Adversarial Bandit Setting

Preview 40:26

Lecture 14: Exponential Weights for Exploration and Exploitation Algorithm

Preview 42:46

Lecture 15: Regret Bound of Exp3

Preview 28:36

Lecture 16: Regret Bound of Exp3(Contd.)

Preview 42:34

Lecture 17: Exp3.P and Exp3.IX

Preview 42:11

Lecture 18: Online Convex Optimisation

Preview 39:06

Lecture 19: Follow the Leader (FTL) Algorithm

Preview 29:36

Lecture 20: Follow the Regularized Leader

Preview 32:33

Lecture 21: Online Gradient Descent

Preview 40:24

Lecture 22: Strongly Convex Function

Preview 33:36

Lecture 23: FoReL with Strongly Convex Regulariser

Preview 39:03

Lecture 24: FoReL with Strongly Convex Regulariser (Contd.)

Preview 27:27

Lecture 25: Euclidean and Entropy Regularizer

Preview 39:03

Lecture 26: Introduction to Stochastic Bandits

Preview 41:48

Lecture 27: Concentration Inequalities

Preview 35:19

Lecture 28: Subgaussian Random Variable

Preview 41:23

Lecture 29: Regret Definition and Regret Decomposition

Preview 35:53

Lecture 30: Explore and Commit (ETC) Algorithm

Preview 43:49

Lecture 31: Regret Analysis and ETC

Preview 32:21

Lecture 32: Optimism in the Face of Uncertainty

Preview 46:45

Lecture 33: Upper Confidence Bound Algorithm

Preview 31:58

Lecture 34 : Regret Analysis of UCB

Preview 34:57

Lecture 35 : Problem Dependent and Independent Bounds of UCB

Preview 30:41

Lecture 36 : KL-UCB Algorithm

Preview 23:01

Lecture 37 : Thompson Sampling - Brief Discussion

Preview 31:20

Lecture 38 : Proof Idea of Lower Bounds - 1

Preview 34:10

Lecture 39 : Proof Idea of Lower Bounds - 2

Preview 36:49

Lecture 40 : Proof of Lower Bound-1

Preview 30:28

Lecture 41 : Proof of Lower Bound-2

Preview 32:58

Lecture 42 : Stochastic Contextual Bandits

Preview 45:44

Lecture 43 : Introduction to Stochastic Linear Bandits

Preview 39:05

Lecture 44 : Stochastic Linear Bandits

Preview 30:34

Lecture 45 : Regret Analysis of SLB-I

Preview 48:43

Lecture 46 : Regret Analysis of SLB - II

Preview 31:19

Lecture 47 : Regret Analysis of SLB-III

Preview 61:20

Lecture 48 : Construction of Confidence Ellipsoid - I

Preview 39:04

Lecture 49 : Construction of Confidence Ellipsoids - II

Preview 40:58

Lecture 50 : Adversarial Contextual Bandits - I

Preview 26:56

Lecture 51 : Adversarial Contextual Bandits II

Preview 44:33

Lecture 52 : Exp4 Algorithm

Preview 31:22

Lecture 53 : Regret of Exp4

Preview 22:58

Lecture 54 : Adversarial Linear Bandits

Preview 32:17

Lecture 55 : Exp3 for Adversarial Linear Bandits

Preview 41:51

Lecture 56 : Introduction to Pure Exploration and its lower bounds

Preview 40:10

Lecture 57 : Uniform Exploration

Preview 23:04

Lecture 58 : KL-LUCB

Preview 32:06

Lecture 59 : Lil' UCB

Preview 26:42

Lecture 60 : Lower Bound for Pure Exploration Problem

Preview 53:31

Live Session 19-10-2020

Preview 44:48

No Discussion Found

0.0

0 Reviews

Meet Your Instructor

@VTU COE

Instructor

3.3 Rating

28744 Students

909 Courses

About Instructor

VTU is one of the largest Technological Universities in India with 24 years of Tradition of excellence in Engineering & Technical Education, Research and Innovations. It came into existence in the year 1998 to cater the needs of Indian industries for trained technical manpower with practical experience and sound theoretical knowledge.

Free

Course Duration

36 h 59 m 30 s
Course Level

Intermediate
Student Enrolled

5
Language

English

This Course Includes

36 h 59 m 30 s Video Lectures
2 Quizzes
0 Assignments
0 Downloadable Resources
Full Lifetime Access
Certificate of Completion

What you will learn

Week 1

Week 2

Week 3

Week 4

Week 5

Week 6

Week 7

Week 8

Week 9

Week 10

Week 11

Week 12

Live Session

No Discussion Found

0.0

Meet Your Instructor

@VTU COE

About Instructor

Free

This Course Includes