CSCI 4155/6505 Machine Learning (with Robotics) 2011

◇◇◇

... Instructor: Dr. Thomas Trappenberg ... Office: Room 4216 (in Mona Campbell Building on Coburg RD) .... email: tt@cs.dal.ca .... Office hour: TBA

Course Description

This course discusses learning theories and demonstrates these strategies with robots. The topics include su- pervised learning, in particular maximum likelihood estimation in stochastic models and statistical learning theory including support vector machines, unsupervised learning which inclused generative models, expecta- tion maximization, and Bolzmann machines, and reinforcement learning including Markov decision processes and temporal difference learning. The course includes introductins to the MATLAB programming environ- ment, a refresher of basic probability theory, and the use of a robotics environment.

 

Required Texts and Software

There is no required text for the course. Course notes will be posted on the course web site. But I recomment the following additional reading:

Christopher Bishop, Pattern Recognition and Machine Learning, Springer 2006.

Ethem Alapydin, Machine Learning, MIT Press, second edition, 2011

Lecture notes by Andrew Ng: http://www.stanford.edu/class/cs229/materials.html and http://www.stanford.edu/c

Russell and Norvig, Artificial Intelligence: A modern approach, Prentice Hall

Richard Sutton and Andrew Barto, Reinforcement Learning: An Introduction, MIT press, 1998.

S. Thrun, W. Burgard, and D. Fox, Probabilistic Robotics, MIT Press 2005.

We will be using the Matlab programming environment that is provided on the workstations in the teaching lab. Note that a student license is available for around $100 in case you want to buy one for your laptop or home use.

Evaluation

CSCI 4155: 40% Assignments, 40% Tests, 20% Project

CSCI 6505: 20% Assignments, 40% Tests, 40% Project

Grades will be assigned using the letter grade scale as defined in section 17.1 of Dalhousie Academic Calendar.

 

Lecture notes: (will be regularly updated)

 

Oct 22: Latest script up to chapter on graphical models

Nov 8: Chapter 9 on unsupervised learning

Nov 22: Last two chapters on MDP and RL

 

Schedule (can change)

MDP
Date Topic Test Assignment
Sep 8 Intro, sensing&acting    
Sep 13 Basic Prob   A1, due Sep 15 before class
Sep 15 Basic Prob/Matlab   A2, due Sep 21, midnight
Sep 20 Basic Matlab/ Basic NXT    
Sep 22 Basic NXT   A3, due Sept 29 before class
Sep 27 Control / Basic NXT    
Sep 29 Control/Navigation   A4, due Oct 6 before class
Oct 4 Regression / Gradient Descent    
Oct 6 MLE T1  
Oct 11 Classification   A5
Oct 13 Adaptive Control    
Oct 18 Generative   A6, due Oct 27 bedore class
Oct 20 SVM    
Oct 25 Graphical   Projects: CSCI4155 & CSCI 6505
Oct 27 TBA (Paul or Patrick) T2  
Nov 1 Grapgical Models 2, HHM,    
Nov 3 localization, SLAM    
Nov 8 Unsupervised    
Nov 10 STUDY DAY    
Nov 15 Unsupervised    
Nov 22 MDP T3  
Nov 24 TD learning    
Nov 29 Model-based    
Dec 1 Final Project    
Dec 6 Final T4  


Resources:

Chris Bishop: ICML 2011 Keynote talk with title "Embracing Uncertainty: Applied Machine Learning Comes of Age".

TED talk by Sebastian Thrun

A first demostration program to control the tribot with Matlab: WallAvoidance.m

Some example programs to discuss control. A simple linear contoller: linearController.m and a proportianal feedabck conoller: feedbackController.m

A demonstration program of the A* search algorithm, written by Bob Sturm. A simplified matlab function based on this demonstration is AstarPathPlanner.m.

kmeansdemo.m

EMdemo.m

RBM demo from Geoff Hinton

RBMletter example (zip file)

A brief guide to writing a scientific paper

Academic Integrity & Plegarism:

(Based on the sample statement provided at http://academicintegrity.dal.ca. Written by Dr. Alex Brodsky.)

Please familiarize yourself with the university policy on Intellectual Honesty. Every suspected case will be reported.

At Dalhousie University, we respect the values of academic integrity: honesty, trust, fairness, responsibility and respect. As a student, adherence to the values of academic integrity and related policies is a requirement of being part of the academic community at Dalhousie University.

 

What does academic integrity mean?

Academic integrity means being honest in the fulfillment of your academic responsibilities thus establishing mutual trust. Fairness is essential to the interactions of the academic community and is achieved through respect for the opinions and ideas of others. Violations of intellectual honesty are offensive to the entire academic community, not just to the individual faculty member and students in whose class an offence occurs. (see Intellectual Honesty section of University Calendar)

 

How can you achieve academic integrity?

• Make sure you understand Dalhousies policies on academic integrity.

• Give appropriate credit to the sources used in your assignment such as written or oral work, com- puter codes/programs, artistic or architectural works, scientific projects, performances, web page designs, graphical representations, diagrams, videos, and images. Use RefWorks to keep track of your research and edit and format bibliographies in the citation style required by the instructor (http://www.library.dal.ca/How/RefWorks)

• Do not download the work of another from the Internet and submit it as your own.

• Do not submit work that has been completed through collaboration or previously submitted for another assignment without permission from your instructor. • Do not write an examination or test for someone else.

• Do not falsify data or lab results.

These examples should be considered only as a guide and not an exhaustive list.

 

What will happen if an allegation of an academic offence is made against you?

I am required to report a suspected offence. The full process is outlined in the Discipline flow chart, which can be found at: http://academicintegrity.dal.ca/Files/AcademicDisciplineProcess.pdf and in- cludes the following:

1. Each Faculty has an Academic Integrity Officer (AIO) who receives allegations from instructors.

2. The AIO decides whether to proceed with the allegation and you will be notified of the process.

3. If the case proceeds, you will receive an INC (incomplete) grade until the matter is resolved.

4. If you are found guilty of an academic offence, a penalty will be assigned ranging from a warning to a suspension or expulsion from the University and can include a notation on your transcript, failure of the assignment or failure of the course. All penalties are academic in nature.

 

Where can you turn for help?

• If you are ever unsure about ANYTHING, contact myself.

• The Academic Integrity website (http://academicintegrity.dal.ca) has links to policies, defini tions, online tutorials, tips on citing and paraphrasing.

• The Writing Center provides assistance with proofreading, writing styles, citations.

• Dalhousie Libraries have workshops, online tutorials, citation guides, Assignment Calculator, Ref- Works, etc.

• The Dalhousie Student Advocacy Service assists students with academic appeals and student discipline procedures.

• The Senate Office provides links to a list of Academic Integrity Officers, discipline flow chart, and Senate Discipline Committee.