Documente Academic
Documente Profesional
Documente Cultură
print-pdf#/
PROBABILISTIC REASONING
AND
DECISION MAKING
ECE 493 T25
Instructor: Mark Crowley
Date: May 6, 2019
Room: E7 4043
localhost:8000/slidesT493/intro.html?print-pdf#/ 1/51
1
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
ABOUT ME
Name: Mark Crowley (crohw-lee)
Pronouns: He/Him
Degree: PhD University of British Columbia 2011 in Computer Science
What do you call me?:
Whatever you feel comfortable with
Prof. Crowley, Prof. Mark, Prof, Mark, sir? -- all ne
localhost:8000/slidesT493/intro.html?print-pdf#/ 2/51
2
5/10/2019
PhD UBC 2011 in Computer Science localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 5/51
4
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
COURSE LOGISTICS
Lecture time : Mondays/Fridays 2:30pm-3:50pm in E7 4043
Tutorials : Wednesdays 5:30pm-6:30pm in E7 4043
Teaching Assistant(s):
Sriram Ganapathi Subramanian s2ganapa@uwaterloo.ca
...someone else (maybe more than one)
Of ce Hours:
Crowley (E5 4114) :Thursdays 4-5pm or by appointment
Sriram (TBD) : Tuesdays 4-5pm
localhost:8000/slidesT493/intro.html?print-pdf#/ 6/51
5
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
In this course we focus from the ground up on the
concepts and skills needed to build systems that can
reason, learn and make decisions in the presence of
uncertainty or using probabilities.
localhost:8000/slidesT493/intro.html?print-pdf#/ 9/51
6
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 19/51
7.2
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 21/51
9
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 22/51
10
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
MORE MOTIVATION
Interpretability
Small Data
Decision Making
Exciting AI Advances : Alpha Go
localhost:8000/slidesT493/intro.html?print-pdf#/ 23/51
11 . 1
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
INTERPRETABILITY
You may want to build probabilistic models with meaning and bounds.
You may want to model causality.
You may want to learn structure of conditional and causal relationships
from data
If you want to do any of these things, you need to know how to model
those relationships. There is a rich set of tools for doing this which can't be
replaced with DNNs.
localhost:8000/slidesT493/intro.html?print-pdf#/ 24/51
11 . 2
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
SMALL DATA
A lot of modern machine learning methods require massive amounts of
training data to work well
What if that's not available for your problem? What do you do?
localhost:8000/slidesT493/intro.html?print-pdf#/ 25/51
11 . 3
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
DECISION MAKING
humans need good models to make decisions
well machines also need good models to make good decisions
so if you can't just trust a Neural Network in your domain then you
need to ability to build a probabilistic reasoning system
we are going to learn how to do that
You also want systems that learn how to make decisions
robotics
autonomous driving
advertisement targeting
game playing
localhost:8000/slidesT493/intro.html?print-pdf#/ 26/51
12
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
ALPHA GO
AI has a history of competitions of Human vs Machine : Backgammon,
Chess (IBM Deep Blue), Poker
in March, 2016 Google's AlphaGo Documentary (with English subtitles) (2017)
what happened?
why was it a big deal
how can you build something like that?
localhost:8000/slidesT493/intro.html?print-pdf#/ 28/51
13
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 29/51
14
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 31/51
16
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 32/51
17
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 33/51
5/10/2019
Basics of Neural Networks (training, back-propagation, gradient
localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 34/51
18 . 1
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
Expectation :
Bellman equations :
Policy gradient :
localhost:8000/slidesT493/intro.html?print-pdf#/ 40/51
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
LEARNING OBJECTIVES
By the end of this course I hope you will be able to :
1. Explain and apply the basic methods of Bayesian modelling including
inference to a given problem of moderate complexity.
2. Explain and apply the basic methods of Bayesian Optimization to
speci c problems.
3. Explain, design and implement Reinforcement Learning algorithms for
given problem descriptions
localhost:8000/slidesT493/intro.html?print-pdf#/ 42/51
20
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
TEACHING STYLE
I will try to enable your learning.
I will not transmit truth at you where your job is to absorb it.
You need to work at it, you need to nd your way to understand the
concepts, and how they are useful to you.
This course provides the space for you to spend time on these topics:
but you should go beyond the material I present
there are lots of materials online about these topics, devour them
localhost:8000/slidesT493/intro.html?print-pdf#/ 43/51
21
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
COURSE REFERENCES
Course Notes : on the web, updated throughout term
[Ermon2019] - First half of notes are based on Stanford CS 228
(https://ermongroup.github.io/cs228-notes/) which goes even more
into details on PGMs.
Other course references (speci c sections will be listed as needed)
[Cam Davidson 2018] - Bayesian Methods for Hackers - Probabilistic
Programming textbook as set of python notebooks.
http://www.cse.chalmers.se/~chrdimi/downloads/book.pdf
[Dimitrakakis2019] - Decision Making Under Uncertainty and
localhost:8000/slidesT493/intro.html?print-pdf#/ 44/51
5/10/2019
Reinforcement Learning localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 45/51
22
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
EVALUATION
localhost:8000/slidesT493/intro.html?print-pdf#/ 46/51
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
GRADE BREAKDOWN
PROBABILISTIC MODELLING
Assignment 1: 7.5% - Probability Theory, Bayesian estimation - writing a
document answering questions, some calculations - alone - probably a
python notebook or matlab
Assignment 2: 7.5%- Graphical Models and Probabilistic Programming -
alone - probably a python notebook or matlab
Midterm: 30% (June 21?)
REINFORCEMENT LEARNING
localhost:8000/slidesT493/intro.html?print-pdf#/ 48/51
5/10/2019
Assignment 3: 15% - larger implementations on standard RL problems,
localhost:8000/slidesT493/intro.html?print-pdf#/
localhost:8000/slidesT493/intro.html?print-pdf#/ 49/51
24
5/10/2019 localhost:8000/slidesT493/intro.html?print-pdf#/
WRAP-UP
Tutorial : Wednesday 5:30pm (same room) - Sriram will do a Python and
Jupyter notebook introduction
Next Class : ...friday... probability review
localhost:8000/slidesT493/intro.html?print-pdf#/ 51/51
26