Skip to main content
Home
  • Research
  • Events
    • Recorded Talks
  • Team
  • Education & Outreach
  • Ethics in AI
  • News
Menu

Events

ML+ X Seminar

Oct 8, 2021

Understanding collections of related datasets using dependent MMD coresets

Sinead Williamson

3 - 4 pm

Zoom
United States

Sinead Williamson

Abstract: Understanding how two datasets differ can help us determine whether one dataset under-represents certain sub-populations, and provides insights into how well models will generalize across datasets. Representative points selected by a maximum mean discrepancy (MMD) coreset can provide interpretable summaries of a single dataset, but are not easily compared across datasets. In this talk, I will introduce dependent MMD coresets, a data summarization method for collections of datasets that facilitates comparison of distributions. I will show that dependent MMD coresets are useful for understanding multiple related datasets and understanding model generalization between such datasets.

 

 

Institute for Foundations of Machine Learning

Supported by the National Science Foundation

  • Research
    • Projects
  • Events
    • Recorded Talks
  • Team
  • News
    • Education & Outreach
    • Ethics in AI

Let’s Talk

Contact us with research opportunities, speaker requests or media inquiries.

Connect with us

X

  • Research
  • Events
    • Recorded Talks
  • Team
  • Education & Outreach
  • Ethics in AI
  • News