Big Data

  • This fall, the faculty and students in the Electrical Engineering and Computer Science (EECS) Department at MIT are coming together for a new program that has created a buzz since its announcement last spring. The Advanced Undergraduate Research Program — now officially called the SuperUROP — for EECS department juniors and seniors has already enticed over 200 students with more than 100 exciting research projects proposed by the department's faculty. Read more!
  • Members of the MIT Database Group including Sam Madden, an associate professor in the Department of Electrical Engineering and Computer Science at MIT and co-leader of the 'bigdata@CSAIL' initiative, EECS graduate student Alvin Cheung, and researchers from Cornell University are presenting work this week at the 38th International Conference on Very Large Databases on Pyxis - a new system that automatically streamlines websites’ database access patterns, making the sites up to three times as fast while allowing the types of languages already favored by Web developers.
  • CSAIL/EECS researchers including EECS graduate students Adam Marcus and Eugene Wu and EECS professors Sam Madden, Rob Miller and David Karger, have developed a way for users of crowdsourcing database operations to avoid computational details in the process while cost effectiveness is significantly improved. The new system called Qurk will automatically crowdsource tasks that are difficult or impossible to perform computationally.
  • Professor Srini Devadas has been selected as an Edwin Sibley Webster Professor of Electrical Engineering and Computer Science, joining Prof. Alan Willsky as the second Edwin Sibley Webster chaired professor at MIT. Professor Devadas succeeds nearly sixty years of many prominent faculty members holding this professorship, including Ernst Guillemin in 1960, Lan Jen Chu in 1963, Peter Elias in 1974, and Ronald Rivest in 1992.
  • Professor of applied math and computer science at MIT and head of the Computation and Biology Group, Bonnie Berger, with former and current students, has developed an algorithm that allows researchers to access huge amounts of data in geneome databases despite the rate of genome sequencing that threatens to outpace researchers' ability to analyze the added data.
  • Prof. Hari Balakrishnan and graduate student Keith Winstein have developed an alternative to SSH - a remote log-in program called Mosh for mobile shell - finally allowing for the mobile Internet. They reported their work at the Usenix Annual Technical Conference in Boston this month.
  • The goal of this project is to develop powerful algorithmic sampling techniques which allow one to estimate parameters of the data by viewing only a miniscule portion of it. Such parameters may be combinatorial, such as whether a large network has the "six degrees of separation property", algebraic, such as whether the data is well-approximated by a linear function, or even distributional, such as whether the data comes from a distribution over a large number of distinct elements.
  •   Big Data needs Big Processors, and Big Processors need Big Caches. Increasingly, however, power and thermal considerations dictate that many small processors and many small caches supplant the paradigm of few big processors and caches. The Execution Migration Machine (EM²) project aims to find the best way of using these resources.
  • Modern database management systems (DBMS) have been designed to efficiently store, manage and perform computations on massive amounts of data. In contrast, many existing visualization systems do not scale seamlessly from small data sets to enormous ones. We have designed a three-tiered visualization system called ScalaR to deal with this issue.


Subscribe to Big Data