Certification of Model Robustness in Active Class Selection

Active class selection provides machine learning practitioners with the freedom to actively choose the class proportions of their training data. While this freedom can improve the model performance and decrease the data acquisition cost, it also puts the practical value of the trained model into question: is this model really appropriate for the class proportions that are handled during deployment? What if the deployment class proportions are uncertain or change over time? We address these questions by certifying supervised models that are trained through active class selection. Specifically, our certificate declares a set of class proportions for which the certified model induces a training-to-deployment gap that is small with a high probability. This declaration is theoretically justified by PAC bounds. We apply our proposed certification method in astro-particle physics, where a simulation generates telescope recordings from actively chosen particle classes.

  • Published in:
    ECML PKDD 2021: Machine Learning and Knowledge Discovery in Databases. Research Track European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD)
  • Type:
    Inproceedings
  • Authors:
    M. Bunse, K. Morik
  • Year:
    2021

Citation information

M. Bunse, K. Morik: Certification of Model Robustness in Active Class Selection, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), ECML PKDD 2021: Machine Learning and Knowledge Discovery in Databases. Research Track, 2021, https://doi.org/10.1007/978-3-030-86520-7_17, Bunse.Morik.2021a,