KGC


[ English ]
インターネット計測とデータ解析 (GIGA)

お知らせ


科目名インターネット計測とデータ解析 (GIGA) [ シラバス ]
http://web.sfc.keio.ac.jp/~kjc/classes/sfc2014f-measurement/

It becomes possible to access a huge amount of diverse data through
the Internet. It allows us to obtain new knowledge and create new
services, leading to an innovation called "Big Data" or "Collective
Intelligence".
In order to understand such data and use it as a tool, one needs to
have a good understanding of the technical background in
statistics, machine learning, and computer network systems.
In this class, you will learn about the overview of large-scale data
analysis on the Internet, and basic skills to obtain new knowledge
from massive information for the forthcoming information society.


担当者 長 健二朗
授業期間2014年秋学期 月曜日4時限
授業レベル 学部
参考文献 参考文献リスト


授業ビデオとマテリアルFlash Player
授業ビデオの再生にはFlash Playerが必要です。
受講したい回をクリックしてください。
第01回2014/09/22 Introduction
- lecture slides (pdf) (URL)
- exercise script (count.rb) (URL)
- optional reading material (URL)
- Big Data and Collective Intelligence
- Internet measurement
- Large-scale data analysis
- exercise: introduction of Ruby scripting language


第02回2014/09/29 Data and variability
- lecture slides (pdf) (URL)
- exercise data (marathon.txt) (URL)
- exercise script (mean.rb) (URL)
- exercise script (median.rb) (URL)
- exercise script (stddev2.rb) (URL)
- Summary statistics
- Sampling
- How to make good graphs
- exercise: graph plotting by Gnuplot


第03回2014/10/20 Data recording and log analysis
- lecture slides (pdf) (URL)
- exercise data (sample_access_log.zip) (21MB) (URL)
- exercise script (parse_accesslog.rb) (URL)
- exercise script (access.plt) (URL)
- Network management tools
- Data format
- Log analysis methods
- exercise: log data and regular expression


第04回2014/10/27 Distribution and confidence intervals
- exercise script (box-muller-hist.plt) (URL)
- exercise script (conf-interval.rb) (URL)
- lecture slides (pdf) (URL)
- exercise script (box-muller.rb) (URL)
- exercise script (box-muller-hist.rb) (URL)
- exercise script (conf-interval.plt) (URL)
- Normal distribution
- Confidence intervals and statistical tests
- Distribution generation
- exercise: confidence intervals
- assignment 1


第05回2014/11/10 Diversity and complexity
- lecture slides (pdf) (URL)
- exercise script (count_contents.rb) (URL)
- exercise script (make_ccdf.rb) (URL)
- exercise script (ccdf.plt) (URL)
- Long tail
- Web access and content distribution
- Power-law and complex systems
- exercise: power-law analysis


第06回2014/11/17 Correlation
- lecture slides (pdf) (URL)
- exercise script (correlation.rb) (URL)
- exercise data (correlation-data-1.txt) (URL)
- exercise data (correlation-data-2.txt) (URL)
- exercise script (similarity.rb) (URL)
- exercise data (scores.txt) (URL)
- Online recommendation systems
- Distance
- Correlation coefficient
- exercise: correlation analysis


第07回2014/12/01 Multivariate analysis
- lecture slides (pdf) (URL)
- exercise script (leastsquare.rb) (URL)
- exercise script (regress-1.plt) (URL)
- exercise script (regress-2.plt) (URL)
- Data sensing and GeoLocation
- Linear regression
- Principal Component Analysis
- exercise: linear regression


第08回2014/12/08 Time-series analysis
- lecture slides (pdf) (URL)
- exercise data (autocorr_5min_data.txt) (URL)
- exercise data (ifbps-201205.txt) (URL)
- exercise script (autocorr.rb) (URL)
- exercise script (autocorr.plt) (URL)
- exercise script (hourly_out.rb) (URL)
- exercise script (hourly_out.plt) (URL)
- exercise script (weekview_out.rb) (URL)
- exercise script (week_out.plt) (URL)
- exercise script (correlation_out.rb) (URL)
- Internet and time
- Network Time Protocol
- Time series analysis
- exercise: time-series analysis
- assignment 2


第09回2014/12/15 Topology and graph
- lecture slides (pdf) (URL)
- exercise data (topology.txt) (URL)
- exercise script (dijkstra.rb) (URL)
- Routing protocols
- Graph theory
- exercise: shortest-path algorithm


第10回2014/12/22 Anomaly detection and machine learning
- lecture slides (pdf) (URL)
- exercise script (naivebayes.rb) (URL)
- Anomaly detection
- Machine Learning
- SPAM filtering and Bayes theorem
- exercise: naive Bayesian filter


第11回2015/01/14 Data Mining
- lecture slides (pdf) (URL)
- exercise script (k-means.rb) (URL)
- exercise data (km-data.txt) (URL)
- exercise script (km-results.plt) (URL)
- Pattern extraction
- Classification
- Clustering
- exercise: clustering


第12回2015/01/19 Search and Ranking
- lecture slides (pdf) (URL)
- exercise script (pagerank.rb) (URL)
- exercise data (sample-links.txt) (URL)
- exercise data (links-100k.txt) (URL)
- Search systems
- PageRank
- exercise: PageRank algorithm



Powered by SOI Copyright(c) 2002-2019, Keio University Shonan Fujisawa Campus. All rights reserved.
このサイトの著作権について