This class is targeted at anyone who's curious about how mathematics is used by companies like Google, FaceBook, Netflix, etc, for web search, product recommendations, computational advertising, sponsored search auctions, multimedia search, text mining, social network analysis, etc. An alternative course title that would describe 90% of the materials could well have been "The Mathematics of Data Mining, Machine Learning, and Pattern Recognition". The remaining 10% would be on miscellaneous topics like ad auctions.
We will be interested in a handful of mathematical tools for analyzing the sort of complex, high-dimensional, massive datasets that have become ubiquitous in today's world. But this is not a course about statistics. Our tools might well be derived from partial differential equations, harmonic analysis, linear algebra, graph theory, differential geometry, algebraic topology, etc, and they may or may not have obvious interpretations in terms of traditional statistics.
A prerequisite for taking this course is Math 110: Linear Algebra. Knowledge of basic probability theory and multivariate calculus would be helpful. Programming knowledge is useful though not necessary.
Group/Individual | Project Title | Due |
---|---|---|
Fei Yu | Information Retrieval From Hierarchical Data | 11/04 |
Anderthan Hsieh & Chanwoo Myung | Find Your Mate to Date: Learning Algorithm for Matching College Students | 11/04 |
Jonathan Ong & Kabir Seth | SONGrank: Recommender System Based on User Preferences Incorporating Negative Feedback | 11/06 |
Nicholas Boyd & Nicholas Preston | Using Unconventional Data Sources for Music Recommendations | 11/06 |
Nishant Bhat & Max Dama | Black Swan Crime Network Analyst | 11/06 |
Lin Lu & Yi Yuan | Statistical Analysis of the Rise of Divorce Rates | 11/09 |
David Nachum | Nearfind | 11/09 |
Matt Senate & Adam Goldberg | Stylometric Measurement of Cohesion in Multi-Authored Works | 11/09 |
Brian Tang | Simplynk | 11/09 |
Max Moacanin & Tung Phan | Mathematical Predictions of NBA Season Outcomes | 11/09 |
Date | Time | Group/Individual |
---|---|---|
10/05 Mon | 3:00–3:30 | Fei Yu |
10/05 Mon | 3:30–4:00 | Anderthan Hsieh & Chanwoo Myung |
10/07 Wed | 3:00–3:30 | Jonathan Ong & Kabir Seth |
10/07 Wed | 3:30–4:00 | Nicholas Boyd & Nicholas Preston |
10/07 Wed | 4:00–4:30 | Nishant Bhat & Max Dama |
10/07 Wed | 4:30–5:00 | Lin Lu & Yi Yuan |
10/09 Fri | 2:00–2:30 | David Nachum |
10/09 Fri | 2:30–3:00 | Matt Senate & Adam Goldberg |
10/09 Fri | 3:00–3:30 | Brian Tang |
10/09 Fri | 3:30–4:00 | Max Moacanin & Tung Phan |
Location: Evans Hall, Room 2
Times: 3:00–4:00 PM on Mon/Wed/Fri
Instructor: Lek-Heng
Lim
Evans Hall, Room 873
lekheng(at)math.berkeley.edu
(510) 642-8576
Office hours:
12:00–1:00 PM, Mon and Wed, 2:00 AM–3:00 PM, Wed
Here's a list of mathematical topics that we shall examine (some only at a very basic level). Of course everything will be motivated by and made relevant to some cool applications.
To be determined. Course grade is likely to be based more on projects/term papers and less on homeworks/exams.
There is no single official textbook for this course but the following are some useful references. It would not be unwise to first discuss with me before deciding which ones to buy (it depends on your interests).