Sie sind hier

Data Science

Data Science (cf. the Wikipedia definition of data science) describes an attitude towards treating problems with a set of capabilities that is not located in any classic community, but it is a set of capabilities that cross-breed between disciplines, such as physics, biology, social sciences and economics. It uses elaborate computer science paradigms and needs a background in statistics. It feeds the new as well as the classical economy as well as the medical field.  

Data scientists: IT's new rock stars

(Preliminary) Lecturing Schedule

Lecture: 4pm - 5.30pm  E 313

Tutorial: 5.45pm - 7.15 pm  E 313

 

Date Title Lecturer
19.04 Intro to DS Wagner
  Tutorial: Intro & Logistics Wagner
26.04 Descriptive Statistics & Probabilities (Exercise 1 & allbus data & description) Wagner
03.05. Hypothesis testing, p-values & (Exercise 2) Wagner
  Tutorial: Probabilities & descriptive statistics Wagner
10.05 Power of  Test, Effect Sizes (Exercise 3) Wagner
  Tutorial: Hypothesis testing Wagner
17.05. CI, Nonparametric Stats, Likelihood (Exercise 4) Wagner
  Tutorial Wagner
24.05. MLE, MAP and Bayesian Inference (Exercise 5) Kling
  Tutorial Kling
31.05. Relationships & Regression Wagner
  Exercise Kling
07.06 (Public holiday) -
14.06. Regression Wagner
  Exercise Kling
21.06. Causality Wagner
  Exercise Kling
28.06. Graphical Models 1 Kling
  Exercise Kling
05.07. Graphical Models 2 Kling
  Exercise Kling
12.07. Advanced Inference Methods Kling
  Exercise Kling
19.07. Visualisation and Data Story Telling Wagner
  Exercise Kling
26.07. (No lecture) -
02.08. Exam Wagner/Kling

Exercises

The exercises will be done in groups of X students. For taking part in the exam, solutions for all but one exercise have to be submitted. For this, each group will get an own SVN repository.

Programming will be in IPython with IPython notebooks :)

Literature

  1. Vasant Dhar. Data Science and Prediction. In: Communications of the ACM, December 2013, Vol. 56, No. 12, pp. 64-73
  2. Anand Rajaraman, Jeffrey Ullman, Jure Leskovec, Mining of Massive Datasets, Cambridge University Press (free download)
  3. Jeffrey Stanton, Introduction to Data Science (free download)
  4. John Hopcroft. Foundations of Data Science.
  5. http://www.wolframscience.com/thebook.html
  6. Peter Norvig, Alon Halevy, Fernando Parreira. The unreasonable effectiveness of data. In: IEEE Intelligent Systems, March/April 2009.

 

Beteiligte: 

JProf. Dr. Claudia Wagner

clwagner@uni-koblenz.de

Dr. Christoph Kling

datascience@c-kling.de