Introduction to Web Science
[go to overview]Winter Term 2017 / 2018
News
-
No tutorial on Feb 2. Assignment 11 will be explained on Feb 9.
Assignments
Assignment | Due (@ 11:00 am) | Solution | Solution (Discussion) |
---|---|---|---|
Assignment 1, PDF | 25.10.2017 | Slides | 27.10.2017 |
Assignment 2, PDF | 01.11.2017 | Solution | 03.11.2017 |
Assignment 3, PDF | 08.11.2017 | Solution | 10.11.2017 |
Assignment 4, PDF | 17.11.2017 | Solution | 24.11.2017 |
Assignment 5, PDF | 22.11.2017 | Solution | 24.11.2017 |
Assignment 6, PDF | 29.11.2017 | Solution | 01.12.2017 |
Assignment 7, PDF | 06.12.2017 | Solution | 08.12.2017 |
Extra Tutorial | 15.12.2017 | ||
Assignment 8, PDF | 20.12.2017 | Solution | 22.12.2017 |
Assignment 9, PDF | 10.01.2018 | Solution | 12.01,2018 |
Assignment 10, PDF | 17.01.2018 | Solution | 19.01.2018 |
Assignment 11, PDF | 07.02.2018 | Solution | 09.02.2018 |
Schedule
Week | Dates | Topics | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Week 1 | 23/10/2017 | Ethernet | Communication over shared medium | Ethernet Header | Minimum Package Length vs. Cable Length | Collision Detection | ||||
24/10/2017 | Internet Protocol | Motivation for IP | Classful IPv4 | IP forwarding algorithm | IP header | |||||
Week 2 | 30/10/2017 | TCP | End to End Principle and Connection Oriented Communication | Threeway Handshake | Port Numbers | Sliding Window Protocol | ||||
31/10/2017 | ||||||||||
Week 3 | 6/11/2017 | DNS | Tree Structure | Terminology | Zones | Review facts about DNS | Delegating DNS Query | DNS Address Resolution | ||
7/11/2017 | Internet vs WWW | Summary of Internet architecture | Motivation and requirment for WWW | Design Principles of WWW | Uniform Resource Identifier | |||||
HTTP | Making HTTP Requests | A simple Web Client | HTTP Header | Content Negotiation | ||||||
Week 4 | 13/11/2017 | Web Content | Problem setting for web content formats | Working with XML | HTML for web document structures | Layout elements in HTML | Motivation for separation of content and layout | Cascading Style Sheets | Media Content | Meta Data |
14/11/2017 | Dynamic Web Content | Basic Server side Web Programming | Forms and HTTP Post request | Handling a post request in Java Servlet | Client side Java Script | Ajax and XML HTTP Request | ||||
Week 5 | 20/11/2017 | How big is the web | Problems with the question of the size of the web | 3 ways to study the Web | A simplistic Descriptive Model | An unrealistic simplistic descriptive model | ||||
21/11/2017 | Descriptive Models | Counting words and Documents | Typical Length of the Document | How to formulate a research hypothesis | Number of words needed to understand Simple English Wikipedia | Linguistis way of checking simplicity of Text | ||||
Week 6 | 27/11/2017 | Advanced Statistical Models | Zipf's Law for Text | Visually straight lines on Log log plot | Fitting a curve on a log log plot | Zipf's law, Power Law and Pareto Law | ||||
28/11/2017 | Modellling Similarity | Similarity Measures and their applications | Jaccard Similarity for Sets | Cosine similarity for texts | Cosine similarity for vector space | Probabilistic Similarity Measure - Kulback Leibler Divergence | Comparing results of Similarity | |||
Week 7 | 4/12/2017 | Generative Modelling of Web | Introduction to Generative Modelling | Sampling form Probability Distrubution | Evaluating a generative model | Pittfalls when increasing number of model parameters | ||||
5/12/2017 | Graph Theoritic Web Modelling | Reviewing terms from Graph Theory | The standard web graph model | Descriptive statistics of the web graph | Topology of Web Graph | Modelling Graphs with Linear Algebra | ||||
Week 8 | 11/12/2017 | Meme Spreading | PPTX | |||||||
12/12/2017 | Meme Spreading II | |||||||||
Week 9 | 18/12/2017 | Herding Behaviour I | ||||||||
19/12/2017 | Herding Behaviour II | |||||||||
Week 10 | 08/01/2018 | Online Advertising | Introduction to online advertising | Metrics of Oline Advertisment | Factors impact on Advertising | Understanding problems with Click Fraud | ||||
09/01/2018 | User Modelling | |||||||||
Week 11 | 15/01/2018 |
User Modelling (cont.) |
||||||||
Copyright | ||||||||||
16/01/2018 | Copyright (cont.) |
|||||||||
16/01/2018 | Net Neutrality | |||||||||
Week 12 | 22/01/2018 | DEMO EXAM | ||||||||
23/01/2018 | EXAM Solution Discussion | |||||||||
Week 13 |
29/01/2018 | Net Neutrality (cont.) |
||||||||
30/01/2018 | Misinformation (A) | Part A - PDF | Part A - PPTX | |||||||
Week 14 |
05/02/2018 | Misinformation(B) | ||||||||
06/02/2018 |
Study Materials
- Course materials for this introductory course to Web Science can be found at the Wikiversity page.
- Tutorial Zero Slides : here!
- ipython notebook guide
- latex notations in ipython
- Slides for the lecture on Dynamic Web Content with Python on 14.11.2017.
- Slides for Crawler Development with Python on 17.11.2017
- Notes for log-log plots.
- Slides for Meme Spreading
- Slides for Herding Behaviour
- Online Advertisement videos:
- Introduction to Online Advertisement
- Metrics for Online Advertisement
- Factors impact on Advertisement Campaigns
- Understanding Problems with Click Fraud.
- Optional: Interesting read: "Tracing the Trackers", Yu et. al.
In: Proc. of WWW 2016 ("of the top 200 news sites, as ranked by Alexa, contain at least one tracker, and at least 50% of them contain at least 11").
- Slides for lecture on Socio Economic Aspects of the Web on 19.12.2017
Further Material
- Material last year: https://west.uni-koblenz.de/de/studying/courses/ws1617/introduction-to-web-science
- Hal Berghel. Net Neutrality Reloaded. In: IEEE Computer, 2017, pp. 68-72. (behind IEEE paywall, accessible within university or using VPN)
Important Information
Examination
In order to obtain ECTS-Credits you need to both gain admission to the exam and you need to pass the exam. The exam is passed if you obtain a score of at least 50% in it.
The Final Examination will be held on February 19 (Monday), 2018 from 10:00 am onwards at E-011
The 2nd Examination: April 27, 2018 (Friday), from 12:00 pm onwards at M-001.
Assignments
You are expected to hand in all the assignments in due time, every week on Wednesday at 11:00 am. The assignments must be done by everyone in the group. The assignments should be handed in a pdf format with all valid source code and graphs if necessary. Assignments will be treated incomplete if any part of the solution is missing. Students need to accomplish at least 60% of the assignments in order to attend the final exam. Plagiarism is strictly forbidden and will result in disqualification from the final exam, of both sides of the plagiarism (copying and being copied). Students are also expected to explain some of the assignments.
SVN
The SVN repository to submit your assignments is : https://svn.uni-koblenz.de/westteaching/webscience-1718/GROUPNAME
Replace GROUPNAME with your group name in lower case letters. Please remember to have your assignments in different folders under "solutions" .
Last Year Students
Students who failed to secure passing grades last year can directly appear for the examinations this year. However, it is advisable to participate in the course and the tutorials.
Team Formation
For assignments, you have to make a group of 2-3 to be in a team. To create your teams, you can click here.
Organizational Information
Flipped Classroom Session | ||
Monday | 14:00 - 16:00 hrs | E - 011 |
Tuesday | 12:00 - 14:00 hrs | K - 101 |
Tutorial Session | ||
Friday | 12:00 - 14:00 hrs | F - 414 |