![]() |
MIT Electrical Engineering and Computer Science
Fall 2002 Catalogue Supplement |
TR4, Room 34-302
Professor Piotr Indyk, Room NE43-373, 2-3402
Prereq.: 6.046 or permission of instructor
3-0-9
This subject qualifies as a Theoretical Computer Science concentration subject.
Algorithmic techniques for handling massive data sets.
Computational models: main memory, streaming, external memory.
Numerical data: sorting, searching.
Text data: inverted indices, suffix trees.
Vector data: dimensionality reduction and sketching, nearest neighbor search, SVD, clustering.
Graph data: connected components, spectral partitioning, link analysis (Pagerank, hubs and authorities), models for the web. Compression.