KGC


[ Japanese ]
INTERNET MEASUREMENT AND DATA ANALYSIS (GIGA/GI)

NEWS


TitleINTERNET MEASUREMENT AND DATA ANALYSIS (GIGA/GI) [ Syllabus ]
http://web.sfc.keio.ac.jp/~kjc/classes/sfc2016s-measurement/


Faculty Kenjiro Cho
Term2016 Fall
Level Undergraduate


Inquiry - Inquiry about this course


Lecture Video & MaterialsFlash Player
Flash Player is required to play the videos.
Click the lecture title to see lecture materials and video
#012016/04/11 Introduction
- exercise script (count.rb)
- exercise script (count-rubyish.rb)
- optional reading material (URL)
- lecture slides (pdf)
- Big Data and Collective Intelligence
- Internet measurement
- Large-scale data analysis
- exercise: introduction of Ruby scripting language


#022016/04/18 Data and variability
- lecture slides (pdf)
- exercise data (marathon.txt)
- exercise script (mean.rb)
- exercise script (stddev.rb)
- exercise script (stddev2.rb)
- exercise script (median.rb)
- exercise script (marathon.plt)
- exercise script (marathon-cdf.rb)
- exercise script (marathon-cdf.plt)
- Summary statistics
-  Sampling
-  How to make good graphs
- exercise: graph plotting by Gnuplot


#032016/04/25 Data recording and log analysis
- lecture slides (pdf)
- exercise data (sample_access_log.zip) (21MB)
- exercise script (parse_accesslog.rb)
- exercise script (access.plt)
- Network management tools
- Data format
- Log analysis methods
- exercise: log data and regular expression


#042016/05/02 Distribution and confidence intervals
- lecture slides (pdf)
- exercise script (box-muller.rb)
- exercise script (box-muller-hist.rb)
- exercise script (box-muller-hist.plt)
- exercise script (conf-interval.rb)
- exercise script (conf-interval.plt)
- Normal distribution
- Confidence intervals and statistical tests
- Distribution generation
- exercise: confidence intervals
- assignment 1


#052016/05/09 Diversity and complexity
- lecture slides (pdf)
- exercise data (us-surnames.csv)
- exercise script (make_ccdf.rb)
- exercise script (ccdf.plt)
- exercise script (count_contents.rb)
- Long tail
- Web access and content distribution
- Power-law and complex systems
- exercise: power-law analysis


#062016/05/16 Correlation
- lecture slides (pdf)
- exercise script (correlation.rb)
- exercise data (correlation-data-1.txt)
- exercise data (correlation-data-2.txt)
- exercise script (similarity.rb)
- exercise data (scores.txt)
- Online recomendation systems
- Distance
- Correlation coefficient
- exercise: correlation analysis


#072016/05/23 Multivariate analysis
- lecture slides (pdf)
- exercise script (leastsquare.rb)
- exercise script (pca.rb)
- exercise data (pca-data.txt)
- Data sensing and GeoLocation
- Linear regression
- Principal Component Analysis
- exercise: linear regression


#082016/06/06 Time-series analysis
- exercise data (autocorr_5min_data.txt)
- exercise data (ifbps-201205.txt)
- exercise script (autocorr.rb)
- exercise script (autocorr.plt)
- exercise script (hourly_out.rb)
- exercise script (hourly_out.plt)
- exercise script (week_out.rb)
- exercise script (week_out.plt)
- exercise script (correlation_out.rb)
- lecture slides (pdf)
- Internet and time
- Network Time Protocol
- Time series analysis
- exercise: time-series analysis
- assignment 2


#092016/06/13 Topology and graph
- lecture slides (pdf)
- exercise script (dijkstra.rb)
- exercise data (topology.txt)
- exercise data (jr.txt)
- Routing protocols
- Graph theory
- exercise: shortest-path algorithm


#102016/06/20 Anomaly detection and machine learning
- lecture slides (pdf)
- exercise script (naivebayes.rb)
- Anomaly detection
- Machine Learning
- SPAM filtering and Bayes theorem
- exercise: naive Bayesian filter


#112016/06/27 Data Mining
- lecture slides (pdf)
- exercise script (k-means.rb)
- exercise script (km-results.plt)
- exercise data (km-data.txt)
- Pattern extraction
- Classification
- Clustering
- exercise: clustering


#122016/07/04 Search and Ranking
- lecture slides (pdf)
- exercise script (pagerank.rb)
- exercise data (sample-links.txt)
- exercise data (links-100k.txt) (URL)
- Search systems
- PageRank
- exercise: PageRank algorithm


#132016/07/11 Scalable measurement and analysis
- lecture slides (pdf)
- exercise script (wc-map.rb)
- exercise script (wc-reduce.rb)
- exercise data (wc-data.txt)
- Distributed parallel processing
- Cloud computing technology
- MapReduce
- exercise: MapReduce algorithm



Powered by SOI Copyright(c) 2002-2019, Keio University Shonan Fujisawa Campus. All rights reserved.
Copyright of this site