[ English ]
インターネット計測とデータ解析 (GIGA)


科目名インターネット計測とデータ解析 (GIGA) [ シラバス ]

It becomes possible to access a huge amount of diverse data through
the Internet. It allows us to obtain new knowledge and create new
services, leading to an innovation called "Big Data" or "Collective
In order to understand such data and use it as a tool, one needs to
have a good understanding of the technical background in
statistics, machine learning, and computer network systems.
In this class, you will learn about the overview of large-scale data
analysis on the Internet, and basic skills to obtain new knowledge
from massive information for the forthcoming information society.

担当者 長 健二朗
授業期間2013年秋学期 水曜日4時限
授業レベル 学部
参考文献 参考文献リスト

授業ビデオとマテリアルFlash Player
授業ビデオの再生にはFlash Playerが必要です。
第01回2013/09/25 Introduction
- reading material (URL)
- lecture slides (pdf) (URL)
- Big Data and Collective Intelligence
- Internet measurement
- Large-scale data analysis
- exercise: introduction of Ruby scripting language

第02回2013/10/02 Data and variability
- lecture slides (pdf) (URL)
- excercise data (marathon.txt) (URL)
- Summary statistics
- Sampling
- How to make good graphs
- exercise: graph plotting by Gnuplot

第03回2013/10/09 Data recording and log analysis
- lecture slides (pdf) (URL)
- exercise data ( (20MB)) (URL)
- exercise script (parse_accesslog.rb) (URL)
- Network management tools
- Data format
- Log analysis methods
- exercise: log data and regular expression

第04回2013/10/23 Distribution and confidence intervals
- lecture slides (pdf) (URL)
- data for assignment 1 (honolulu-marathon-2012.txt) (URL)
- Normal distribution
- Confidence intervals and statistical tests
- Distribution generation
- exercise: confidence intervals
- assignment 1

第05回2013/10/30 Diversity and complexity
- lecture slides (pdf) (URL)
- Long tail
- Web access and content distribution
- Power-law and complex systems
- exercise: power-law analysis

第06回2013/11/06 Correlation
- lecture slides (pdf) (URL)
- exercise data (correlation-data-1.txt) (URL)
- exercise data (correlation-data-2.txt) (URL)
- Online recommendation systems
- Distance
- Correlation coefficient
- exercise: correlation analysis

第07回2013/11/13 Multivariate analysis
- lecture slides (pdf) (URL)
- Data sensing
- Linear regression
- Principal Component Analysis
- exercise: linear regression

第08回2013/11/27 Time-series analysis
- lecture slides (pdf) (URL)
- exercise data (autocorr_5min_data.txt) (URL)
- exercise data (ifbps-2011.txt) (URL)
- data for assignment 2 ( (164MB) (URL)
- data for assignment2 ( (365MB) (URL)
- Internet and time
- Network Time Protocol
- Time series analysis
- exercise: time-series analysis
- assignment 2

第09回2013/12/04 Topology and graph
- lecture slides (pdf) (URL)
- exercise data (topology.txt) (URL)
- exercise script (dijkstra.rb) (URL)
- Routing protocols
- Graph theory
- exercise: shortest-path algorithm

第10回2013/12/11 Anomaly detection and machine learning
- lecture slides (pdf) (URL)
- exercise script (naivebayes.rb) (URL)
- exercise data (km-data.txt) (URL)
- Anomaly detection
- Machine Learning
- SPAM filtering and Bayes theorem
- exercise: naive Bayesian filter

第11回2013/12/18 Data Mining
- lecture slides (pdf) (URL)
- exercise script (k-means.rb) (URL)
- Pattern extraction
- Classification
- Clustering
- exercise: clustering

第12回2013/12/25 Search and Ranking
- lecture slides (pdf) (URL)
- exercise script (pagerank.rb) (URL)
- exercise data (sample-links.txt) (URL)
- data for final report ( (324MB) (URL)
- sample data for final report ( (URL)
- data for final report ( (URL)
- Search systems
- PageRank
- exercise: PageRank algorithm

第13回2014/01/08 Scalable measurement and analysis
- lecture slides (pdf) (URL)
- Distributed parallel processing
- Cloud computing technology
- MapReduce
- exercise: MapReduce algorithm

Powered by SOI Copyright(c) 2002-2019, Keio University Shonan Fujisawa Campus. All rights reserved.