Sie sind hier

Machine Learning and Data Mining

Welcome to the page of Machine Learning and Data Mining course of winter terms 2017/2018!

Schedule

Lecture and Tutorial - Machine Learning and Data Mining (6 ECTS; for Master and Bachelor students in Web Science, Computer Science, Computational Visualistics and Business Informatics)

Inter-student communication: Please use the corresponding newsgroup infko-mldm here.

Veranstaltungsnummer: 0432028

Dozent(in) Prof. Dr. Steffen Staab
Time slots

We. 08.30-10.00,
Room G410     till 01.11.2017
Room M001 from 08.11.2017

Dozent(in) Dr. Mahdi Bohlouli, Raphael Menges
Time slots

We. 14.15-15.45, Room A213
Th. 16.15-17.45, Room E113

Lectures are hold on Wednesdays beginning October 18 and start on 8:30AM if not stated otherwise below. Tutorials are hold on Wednesdays and Thursdays beginning October 25.

Preliminaries

This course requires mathematics as taught for CS majors. A compact view of what is needed is available in the DeepLearningBook in Chapters 2, 3, and 4.

Course Material

Date Lecture Topics Slides
18.10.

Motivation and Introduction

machinelearning-0-overview-purged.pdf
25.10. Programming with Python machinelearning-1-intro-to-python.zip
machinelearning-1-intro-to-python.pdf
01.11. No Lecture, Public Holiday  
08.11. Classifcation, K-Nearest-Neighbors machinelearning-2-classification.pptx
machinelearning-2-classification.pdf
15.11. 8:00AM sharp-9.30AM K-Nearest-Neighbors, Bayesian Classification
22.11. 8.30AM-10.00AM First lecture of that day: Decision Trees ML-3-decision-trees.pptx
ML-3-decision-trees.pdf
22.11. 6:00PM to 7:30PM in E011 Second lecture of that day: Random Forest  
29.11 No Lecture  
6.12. Support Vector Machines  
13.12. 8:00AM sharp to 9:25AM Neural Networks / Backpropagation  
10.1. Clustering - K-Means  
17.1. Neural Network Autoencoder  
17.1. 4.15pm Prof. Dr. Marcin Grzegorzek "Medical Data Science -- Extracting Health-related Knowledge from Big Data" Abstract and CV
24.1. Reinforcement learning  
31.1. 8.30am Latent Semantics, Probabilistic Latent Semantics, Topic Models  
31.1. 4:15PM, room to be announced Dr. Thomas Gottron talks about machine learning at credit rating agency Schufa  
7.2. Questions & Answers  

 
 

Date Tutorial Topics Material
25.10. & 26.10. Tutorial and assignment structure, groups and SVN introduction salary.csv
tutorial01.pdf
08.11. & 09.11. Solutions of "Machine Learning Fundamentals", review of next assignment Blackboard
15.11. & 16.11. Solutions of "Simple Classification", review of next assignment Blackboard
22.11. & 23.11.    
29.11. & 30.11.    
06.12. & 07.12.    
13.12. & 14.12.    
10.01. & 11.01.    
17.01. & 18.01.    
24.01. & 25.01.    
31.01. & 01.02.    
07.02. & 08.02.    

Assignments

Please form groups of three people to work on the assignments here, until 26th of October!  They are graded before the next tutorial and it is mandatory to reach 60% of the points in total over all assignments to be allowed to participate in the exam. E.g., if there are 10 assignments each 10 points, you need in total at minimum 60 points in sum over all assigments to participate in the exam.
 

Release Date Assignment Submission Deadline at 9:00AM Sheets
23.10. Machine Learning Fundamentals 06.11. assignment01.pdf
comments01.txt
06.11. Simple Classification 13.11. assignment02.pdf
assignment02.csv
14.11. Decision Tree 20.11. assignment03.pdf
20.11. Naive Bayes Classificator 27.11. assignment04.pdf
dataset.zip
27.11.   04.12.  
04.12.   11.12.  
11.12.   18.12.  
18.12.   08.01.  
08.01.   15.01.  
15.01.   22.01.  
22.01.   29.01.  
29.01.   05.02.  

Exam

  • 1st exam will be on Wednesday, 21st Februrary 2018, at 8:15AM in room D028. The registration to this exam will be opened on 15th January 2018. Permitted students according to assignments can register to the exam. The exam can be found with the number 432028 in Klips.
  • 2nd exam will be on Wednesday, 11th April 2018, at 14:15AM in room tba.

Core Literature & Systems

  • Charu C. Aggarwal. Data Mining: The Textbook. Springer, 2015.
  • Ian Goodfellow, Yoshua Bengio and Aaron Courville. Deep Learning. MIT Press, 2016. http://www.deeplearningbook.org/

Further Literature, Systems & Stuff

Main Conferences

  • NIPS - Neural Information Processing
  • ICML - Int. Conf. on Machine Learning
  • IEEE ICDM - Int. Conf. on Data Mining (different from "ICDM" without "IEEE"!)
  • ACM KDD - Knowledge Discovery
  • ECML/PKDD

Talk by Prof. Dr. Marcin Grzegorzek

Abstract: On the one hand, the demographic change and the shortage of medical staff (especially in rural areas) critically challenge healthcare systems in industrialised countries. On the other hand, the digitalisation of our society progresses with a tremendous speed, so that more and more health-related data are available in a digital form. For instance, people wear intelligent glasses or/and smart watches, provide digital data with standardised medical devices (e.g., blood pressure and blood sugar meters following the standard ISO/IEEE 11073) or/and deliver personal behavioural data by their smartphones. Pattern recognition algorithms that automatically analyse and interpret that huge amount of heterogeneous data towards prevention (early risk detection), diagnosis, assistance in therapy/aftercare/rehabilitation as well as nursing will experience an extremely high scientific, societal and economic priority in the near future. In this talk, apart from a general overview and introduction to the topic, Marcin Grzegorzek will present his scientific vision addressing the research direction motivated above. It includes the development of original pattern recognition algorithms for holistic health assessment. In his research, Marcin considers mainly the steps of prevention/early risk detection as well as therapy assistance in the context of neurodegenerative diseases. After a general introduction of his scientific vision, Marcin will shortly present two of the related projects he currently leads: (1) Cognitive Village: Adaptively Learning, Technical Support System for Elderly (funded by the German Federal Ministry of Education and Research); (2) My-AHA: My Active and Healthy Ageing (EC Horizon 2020). Apart from the development of adaptive machine learning software, aspects of hardware, user acceptance as well as ELSI (Ethical, Legal and Social Implications) are also considered in these projects. Marcin will close his talk by a summary and some insights into possible future scientific directions in the area of medical data science.

Bio:

Marcin Grzegorzek is Head of the Research Group for Pattern Recognition at the University of Siegen, Professor at the University of Economics in Katowice and Chairman of the Board at Data Understanding Lab Ltd. He studied Computer Science at the Silesian University of Technology, did his PhD at the University of Erlangen-Nuremberg, worked scientifically as Postdoc at the Queen Mary University of London as well as at the University of Koblenz-Landau, and did his habilitation at the AGH University of Science and Technology in Kraków. He published around 100 papers in pattern recognition, image processing, machine learning, and multimedia analysis. For the time being, he runs six externally funded research projects. For instance, Marcin coordinates the project Cognitive Village aiming at developing a user-friendly support system for elderly that applies machine learning algorithms for sensor-based health assessment.

Beteiligte: 

Prof. Dr. Steffen Staab

staab@uni-koblenz.de

Dr. Mahdi Bohlouli

bohlouli@uni-koblenz.de

Raphael Menges

raphaelmenges@uni-koblenz.de