E E C S  MIT Electrical Engineering and Computer Science

Fall 2002 Catalogue Supplement

6.897 Algorithms for Massive Data Sets (H)

TR4, Room 34-302
Professor Piotr Indyk, Room NE43-373, 2-3402
Prereq.: 6.046 or permission of instructor
3-0-9

This subject qualifies as a Theoretical Computer Science concentration subject.

Algorithmic techniques for handling massive data sets.

Computational models: main memory, streaming, external memory.

Numerical data: sorting, searching.

Text data: inverted indices, suffix trees.

Vector data: dimensionality reduction and sketching, nearest neighbor search, SVD, clustering.

Graph data: connected components, spectral partitioning, link analysis (Pagerank, hubs and authorities), models for the web. Compression.


Related page: EECS Fall 2002 Catalogue Supplement
This page:
http://www-eecs.mit.edu/AY02-03/fall-cat/6897.html
Editor: Lisa A. Bella   |   Created: Aug 15, 2002   |   Modified: Aug 15, 2002
Site table of contents  |  Site map  |  Search  |  Your comments and inquiries are welcome.