[ English ]
インターネット計測とデータ解析 (GIGA)


科目名インターネット計測とデータ解析 (GIGA) [ シラバス ]

It becomes possible to access a huge amount of diverse data through
the Internet. It allows us to obtain new knowledge and create new
services, leading to an innovation called "Big Data" or "Collective
In order to understand such data and use it as a tool, one needs to
have a good understanding of the technical background in
statistics, machine learning, and computer network systems.
In this class, you will learn about the overview of large-scale data
analysis on the Internet, and basic skills to obtain new knowledge
from massive information for the forthcoming information society.

担当者 長 健二朗
授業期間2012年秋学期 水曜日4時限
授業レベル 学部
参考文献 参考文献リスト

授業ビデオとマテリアルFlash Player
授業ビデオの再生にはFlash Playerが必要です。
第01回2012/09/26 Introduction
- lecture slides (pdf) (URL)
- reading material (pdf) (URL)
	Big Data and Collective Intelligence,

	Internet measurement,

	Large-scale data analysis,

	exercise: introduction of Ruby scripting language

第02回2012/10/03 Data and variability
- lecture slides (pdf) (URL)
- exercise data (marathon.txt) (URL)
	Summary statistics,


	How to make good graphs,

	exercise: graph plotting by Gnuplot

第03回2012/10/17 Data recording and log analysis
- lecture slides (pdf) (URL)
- exercise data (sample_access_log.bz2 (14MB)) (URL)
- exercise data ( (28MB)) (URL)
- exercise data (test-100lines) (URL)
	Network management tools,

	Data format,

	Log analysis methods,

	exercise: log data and regular expression

第04回2012/10/24 Distribution and confidence intervals
- lecture slides (pdf) (URL)
- data for assignment 1 (URL)
	Normal distribution,

	Confidence intervals and statistical tests,

	Distribution generation,

	exercise: confidence intervals,

	assignment 1

第05回2012/10/31 Diversity and complexity
- lecture slides (pdf) (URL)
	Long tail,

	Web access and content distribution,

	Power-law and complex systems,

	exercise: power-law analysis

第06回2012/11/07 Correlation
- exercise data 2 (URL)
- lecture slides (pdf) (URL)
- exercise data 1 (URL)
	Online recommendation systems,


	Correlation coefficient,

	exercise: correlation analysis

第07回2012/11/14 Multivariate analysis
- lecture slides (pdf) (URL)
	Data sensing,

	Linear regression,

	Principal Component Analysis,

	exercise: linear regression

第08回2012/11/20 Time-series analysis
- lecture slides (pdf) (URL)
- exercise data (autocorr_5min_data.txt) (URL)
- exercise data (ifbps-2011.txt) (URL)
- data for assignment 2 (URL)
	Internet and time,

	Network Time Protocol,

	Time series analysis,

	exercise: time-series analysis,

	assignment 2

第09回2012/11/28 Topology and graph
- lecture slides (pdf) (URL)
- exercise data (topology.txt) (URL)
- exercise script (dijkstra.rb) (URL)
	Routing protocols,

	Graph theory,

	exercise: shortest-path algorithm

第10回2012/12/05 Anomaly detection and machine learning
- lecture slides (pdf) (URL)
- exercise script (naivebayes.rb) (URL)
	Anomaly detection,

	Machine Learning,

	SPAM filtering and Bayes theorem,

	exercise: naive Bayesian filter

第11回2012/12/12 Data Mining
- lecture slides (pdf) (URL)
- exercise script (k-means.rb) (URL)
- exercise data (km-data.txt) (URL)
	Pattern extraction,



	exercise: clustering

第12回2012/12/19 Search and Ranking
- lecture slides (pdf) (URL)
- exercise script (pagerank.rb) (URL)
- exercise data (sample-links.txt) (URL)
- final report data ( (323MB) (URL)
- final report data ( (14MB) (URL)
- final report data ( (28MB) (URL)
	Search systems,


	exercise: PageRank algorithm

第13回2012/12/26 Scalable measurement and analysis
- lecture slides (pdf) (URL)
	Distributed parallel processing,

	Cloud computing technology,


	exercise: MapReduce algorithm

第14回2013/01/09 Privacy Issues
- lecture slides (pdf) (URL)
	Internet data analysis and privacy issues,

	Summary of the class

Powered by SOI Copyright(c) 2002-2019, Keio University Shonan Fujisawa Campus. All rights reserved.