Skip to content

Nikita-Belyakov/intro_ml_2021

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 

Repository files navigation

2021 Introduction to Machine Learning and Data Analysis

This is a repo for a 2021 course at the Department of Cosmic Research, MSU. The course covers the very basic concepts of ML, it is obligatory for 4th year students, and for MS students it is an elective course.

General info

  • The course consists of 12 lectures and 12 seminars and 5-6 homeworks.
  • All lectures and seminars will be held online via Zoom and probably recorded (Zoom screen capture).
  • PDFs with lecture slides will be published here: https://disk.yandex.ru/d/_tj5jTusnb0CEQ
  • All homeworks are to be submitted online.

!!There will be no exams, final mark is based on homeworks only!!

Communication

Lectures and seminars

The timetable may be change.

  • Lectures: 13:00 - 14:30 (UTC+3)
  • Seminars: 14:40 - 16:00 (UTC+3)

Presentations and videos can be downloaded from here: https://disk.yandex.ru/d/_tj5jTusnb0CEQ

# Date Lecture Seminar Homework
1 2021-09-09 Introduction to the topic How to manage this course HW#1
2 2021-09-16 Metric Algorithms, kNN Classification Quality Metrics -
3 2021-09-23 Decision Trees How to tune hyperparameters -
4 2021-09-30 Linear Models Invited speaker: Vasily Lavrov. Intro to Data Visualization HW#2
5 2021-10-07 SVM PCA and SVD -
6 2021-10-14 Regularized Linear Models Useful web-pages for your DS career HW#3
7 2021-10-21 Time Series Hands-on practice: design your alerting system -
8 2021-10-28 Text Classification Invited Speakers: how to apply and get hired HW#4
9 2021-11-11 Bayes approach, EM TBA -
10 2021-11-18 Clustering Clustering hands-on practice: is your password weak? HW#5
11 2021-11-25 Ranking TBA -
12 2021-12-02 Gradient Boosting TBA HW#6
13 2021-12-09 No Lecture Here TBA -

Homeworks and deadlines

Each homework has its deadline. Submission after deadline will reduce points.

# Name Date Published Deadline Link
1 HW#1 Intro to numpy and pandas 2021-09-10 2021-09-29 23:00:00 +03:00 [link]
2 HW#2 Basic feature selection and hyperparameter tuning 2021-09-30 2021-10-20 23:00:00 +03:00 TBA
3 HW#3 Kaggle Comptetition: Classifiaction 2021-10-14 2021-10-24 23:00:00 +03:00 / 2021-11-03 23:00:00 +03:00 TBA
4 HW#4 TextClassification 2021-10-28 2021-11-17 23:00:00 +03:00 TBA
5 HW#5 Kaggle Competition: Regression 2021-11-18 2021-11-28 23:00:00 +03:00 / 2021-12-01 23:00:00 +03:00 TBA

Late Submission Discounts:

  • Deadlines: HW#1 - 2 weeks, HW#2 and HW#4 - 3 weeks
  • Submission after the deadline: each score is multiplied by 0.5
  • Late submissions are not allowed in HW#3 and HW#5
  • Submission rules for HW#3: 10 days to beat the baseline, 10 days to get your best score. No late submissions
  • Submission rules for HW#5: 10 days to beat the baseline, 10 days to get your best score. No late submissions

HW#3 and HW#5 points

  • Everyone starts with 0.
  • Beating Baseline before Medium Deadline +5
  • 1st place +17
  • 2nd place +15
  • 3rd place +13
  • Places 4-6 +10
  • Places 7-9 +9
  • Places 10-12 +8
  • Places 13-15 +7
  • Places 16-18 +6
  • And so on

Acknowledgements

This course was inspired by

  • The course by Konstantin Vorontsov from the Coursera [link]
  • MIT OpenCourseWare Machine Learning course: Rohit Singh, Tommi Jaakkola, and Ali Mohammad. 6.867 Machine Learning. Fall 2006. Massachusetts Institute of Technology: MIT OpenCourseWare, https://ocw.mit.edu. License: Creative Commons BY-NC-SA.
  • Data Mining In Action: [link]
  • Author's personal experience in Data Science obtained from Yandex School of Data Science [link], working at Yandex [link], Lensa [link] and various ML-projects.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%