Sie sind hier

Introduction to Web Science


  • Examination 
    • Date: 18.02.2019
    • Location: D-028
    • Time:  from 16:00 hrs
  • Welcome to new Web Science students. Slides in PPTX, PDF
  • Lectures on Dec 17, 18 are cancelled
  • Demo Exam on Dec 17, 2018

Video Lecture Recordings


Assignments Date Submission Tutorial 
Tutorial Zero (notebook+slides)
Assignment 1 || PDF 29-Oct-2018 05-Nov-2018 09-Nov-2018
Assignment 2 || PDF 05-Nov-2018 12-Nov-2018 16-Nov-2018
Assignment 3 || PDF 12-Nov-2018 19-Nov-2018 23-Nov-2018
Assignment 4 || PDF 19-Nov-2018 26-Nov-2018 30-Nov-2018
Assignment 5 || PDF 26-Nov-2018 03-Dec-2018 07-Dec-2018
Assignment 6 || PDF 03-Dec-2018 10-Dec-2018 14-Dec-2018
Assignment 7 || PDF 10-Dec-2018 17-Dec-2018  
Assignment 8 17-Dec-2018 07-Jan-2019  
Assignment 9 07-Jan-2019 14-Jan-2019  
Assignment 10 14-Jan-2019 21-Jan-2019  
Assignment 11 21-Jan-2019 28-Jan-2019  



Week Date Topic Subtopics
0. 22-Oct-2018
Introductory Session
  • Understanding Flipped Classroom: PPTX, PDF
  • Survey of Web Science: PPTX, PDF
1. 29-Oct-2018 Ethernet
  • Communication over shared medium
  • Ethernet Header
  • Minimum Package Length vs. Cable Length
  • Collision Detection
30-Oct-2018 Internet Protocol
  • Motivation for IP
  • Classful IPv4
  • IP forwarding algorithm
  • IP header
2. 05-Nov-2018 TCP

Extensions and Corrections: PPTX, PDF

  • End to End Principle and Connection-Oriented Communication
  • Threeway Handshake
  • Port Numbers
  • Sliding Window Protocol
06-Nov-2018 DNS
  • Tree Structure
  • Terminology
  • Zones
  • Review facts about DNS
  • Delegating DNS Query
  • DNS Address Resolution
3. 12-Nov-2018 Internet vs WWW
  • Summary of Internet architecture
  • Motivation and requirement for WWW
  • Design Principles of WWW
  • Uniform Resource Identifier
13-Nov-2018 HTTP
  • Making HTTP Requests
  • A simple Web Client
  • HTTP Header
  • Content Negotiation
4. 19-Nov-2018 Web Content
  • Problem setting for web content formats
  • Working with XML
  • HTML for web document structures
  • Layout elements in HTML
  • Motivation for separation of content and layout
  • Cascading Style Sheets
  • Media Content
  • Meta Data
20-Nov-2018 Dynamic Web Content
  • Basic Server-side Web Programming
  • Forms and HTTP Post request
  • Handling a post request in Java Servlet
  • Client-side JavaScript
  • Ajax and XML HTTP Request
5. 26-Nov-2018 How big is the web
  • Problems with the question of the size of the web
  • 3 ways to study the Web
  • A simplistic Descriptive Model
  • An unrealistic simplistic descriptive model
27-Nov-2018 Descriptive Models
  • Counting words and Documents
  • Typical Length of the Document
  • How to formulate a research hypothesis
  • Number of words needed to understand Simple English Wikipedia
  • Linguistic way of checking simplicity of Text
6. 03-Dec-2018 Advanced Statistical Models
  • Zipf's Law for Text
  • Visually straight lines on Log-log plot
  • Fitting a curve on a log-log plot
  • Zipf's law, Power Law and Pareto Law
04-Dec-2018 Modeling Similarity
  • Similarity Measures and their applications
  • Jaccard Similarity for Sets
  • Cosine similarity for texts
  • Cosine similarity for vector space
  • Probabilistic Similarity Measure - Kulback Leibler Divergence
  • Comparing results of Similarity
7. 10-Dec-2018 Generative Modelling of Web
  • Introduction to Generative Modelling
  • Sampling form Probability Distribution
  • Evaluating a generative model
  • Pitfalls when increasing the number of model parameters
11-Dec-2018 Graph Theoretic Web Modelling
  • Reviewing terms from Graph Theory
  • The standard web graph model
  • Descriptive statistics of the web graph
  • Topology of Web Graph
  • Modelling Graphs with Linear Algebra
8. 17-Dec-2018 Demo Exam
18-Dec-2018 Solution Discussion
9. 07-Jan-2019 Meme Spreading     
08-Jan-2019 Meme Spreading II    
10. 14-Jan-2019 Herding Behaviour I  
15-Jan-2019 Herding Behaviour II    
11. 14-Jan-2019 Online Advertising
  • Introduction to online advertising
  • Metrics of Oline Advertisement
  • Factors impact on Advertising
  • Understanding problems with Click Fraud
21-Jan-2019 User Modelling I    
12. 22-Jan-2019 User Modelling II    
28-Jan-2019 Copyright
29-Jan-2019 Net Neutrality
13. 04-Feb-2019 Misinformation I    
05-Feb-2019 Misinformation II    

Study Materials

Further Material

Important Information


In order to obtain ECTS-Credits, you need to both gain admission to the exam and you need to pass the exam. The exam is passed if you obtain a score of at least 50% in it. 

The Final Examination will be held on February 18 (Monday), 2019 from 16:00 hrs onwards at D-028


You are expected to hand in all the assignments in due time, every week on Wednesday at 11:00 am. The assignments must be done by everyone in the group.  The assignments should be handed in a pdf format with all valid source code and graphs if necessary. Assignments will be treated in-complete if any part of the solution is missing. Students need to accomplish at least 60% of the total scores in assignments in order to attend the final exam. Plagiarism is strictly forbidden and will result in disqualification from the final exam, of both sides of the plagiarism (copying and being copied). Students are also expected to explain some of the assignments in the tutorial sessions.


The SVN repository to submit your assignments is:
Replace GROUPNAME with your group name in lower case letters. ( Example: )
Please remember to have your assignments in different folders under "solutions".

Last Year Students 

Students who previously failed the exam of "Introduction to Web Science" can directly appear for the examination this semester (actually they are obliged to participate in the exam). However, it is advisable to participate in the course and the tutorials. 
Students who have mastered assignments such that they gained admission to participate in last year's exam, but did not register for last year's exam, have to earn admission anew.

Team Formation

For assignments, you have to make a group of 2-3 be in a team. To create your teams, you can click here

Organizational Information

Flipped Classroom Session
Monday 14:00 - 16:00 hrs G - 310
Tuesday 12:00 - 14:00 hrs K - 101


Tutorial Session
Friday 12:00 - 14:00 hrs F - 414

Prof. Dr. Steffen Staab

Jun Sun

Korok Sengupta