Skip to main content
Home
  • Research
  • Events
    • Recorded Talks
  • Team
  • Education & Outreach
  • Ethics in AI
  • News
Menu

Events

ML+ X Seminar

Oct 8, 2021

Understanding collections of related datasets using dependent MMD coresets

Sinead Williamson

3 - 4 pm

Zoom
United States

Sinead Williamson

Abstract: Understanding how two datasets differ can help us determine whether one dataset under-represents certain sub-populations, and provides insights into how well models will generalize across datasets. Representative points selected by a maximum mean discrepancy (MMD) coreset can provide interpretable summaries of a single dataset, but are not easily compared across datasets. In this talk, I will introduce dependent MMD coresets, a data summarization method for collections of datasets that facilitates comparison of distributions. I will show that dependent MMD coresets are useful for understanding multiple related datasets and understanding model generalization between such datasets.

 

 

Supported by the National Science Foundation

  • Research
    • Projects
    • Publications
  • Events
    • Recorded Talks
    • RSS
  • Team
  • News
    • Education & Outreach
    • Ethics in AI

Stay connected

Name

Let’s Talk

Contact us with research opportunities, speaker requests or media inquiries.

Connect with us

  • Research
    • Research Projects
    • Publications
  • Events
    • Recorded Talks
  • Team
  • Education & Outreach
  • Ethics in AI
  • News