Views Navigation

Event Views Navigation

MIT Policy Hackathon 2024

Convened by IDSS and TPP students, the Policy Hackathon addresses societal challenges via data and policy analysis. Participants work in teams to develop creative policy solutions to real problems sponsored by partners in government, non-profit, and industry.

Find out more »

Evaluating a black-box algorithm: stability, risk, and model comparisons

Rina Foygel Barber (University of Chicago)
E18-304

Abstract: When we run a complex algorithm on real data, it is standard to use a holdout set, or a cross-validation strategy, to evaluate its behavior and performance. When we do so, are we learning information about the algorithm itself, or only about the particular fitted model(s) that this particular data set produced? In this talk, we will establish fundamental hardness results on the problem of empirically evaluating properties of a black-box algorithm, such as its stability and its average…

Find out more »

SES Dissertation Defense

Bernardo García Bulle Bueno (IDSS)
E18-304

Creating Links: Building an Educational Platform to Ask Questions in Education ABSTRACT In this thesis, I document the findings and process through which with my colleague Salome Aguilar Llanes we built an educational platform (JANN) to do research while having a positive impact on a community. Through JANN we have coordinated more than 100k hours of tutoring sessions and built (to our knowledge) one of the largest databases of educational recordings in the world. Broadly the contributions here are twofold:…

Find out more »

IDSS Community Social

Host: Prof. Fotini Christia (IDSS)
E17-399

All IDSS and extended IDSS community members welcome, including students, postdocs, faculty, and staff. Snacks provided!

Find out more »

Statistical Inference with Limited Memory

Ofer Shayevitz (Tel Aviv University)
E18-304

Abstract:  In statistical inference problems, we are typically given a limited number of samples from some underlying distribution, and we wish to estimate some property of that distribution, under a given measure of risk. We are usually interested in characterizing and achieving the best possible risk as a function of the number of available samples. Thus, it is often implicitly assumed that samples are co-located, and that communication bandwidth as well as computational power are not a bottleneck, essentially making the number…

Find out more »

Winners with Confidence: Discrete Argmin Inference with an Application to Model Selection

Jing Lei (Carnegie Mellon University)
E18-304

Abstract:  We study the problem of finding the index of the minimum value of a vector from noisy observations. This problem is relevant in population/policy comparison, discrete maximum likelihood, and model selection. By integrating concepts and tools from cross-validation and differential privacy, we develop a test statistic that is asymptotically normal even in high-dimensional settings, and allows for arbitrarily many ties in the population mean vector. The key technical ingredient is a central limit theorem for globally dependent data characterized…

Find out more »


MIT Institute for Data, Systems, and Society
Massachusetts Institute of Technology
77 Massachusetts Avenue
Cambridge, MA 02139-4307
617-253-1764