Skip to content
Change the repository type filter

All

    Repositories list

    • Jupyter Notebook
      0600Updated Aug 12, 2024Aug 12, 2024
    • Jupyter Notebook
      Apache License 2.0
      25310Updated Jan 25, 2024Jan 25, 2024
    • CleanML

      Public
      A Benchmark for Joint Data Cleaning and Machine Learning
      Python
      15000Updated Aug 1, 2023Aug 1, 2023
    • jenga

      Public
      Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptions (e.g., missing values, broken character encodings) on the prediction quality of their ML models.
      Jupyter Notebook
      GNU General Public License v3.0
      63501Updated Jun 21, 2023Jun 21, 2023
    • 0000Updated Jun 8, 2023Jun 8, 2023
    • deml-lab

      Public
      Lab tasks for the course on "Data Engineering for Machine Learning"
      Jupyter Notebook
      510015Updated May 1, 2023May 1, 2023
    • caboose

      Public
      Jupyter Notebook
      GNU General Public License v3.0
      2000Updated Apr 24, 2023Apr 24, 2023
    • Code and experiments for our SIGMOD paper on "Learning to Validate the Predictions of Black Box Classifiers on Unseen Data"
      Python
      GNU General Public License v2.0
      1203Updated Mar 24, 2023Mar 24, 2023
    • arguseyes

      Public
      Python
      GNU General Public License v3.0
      2801Updated Feb 21, 2023Feb 21, 2023
    • Rust
      0000Updated Feb 12, 2023Feb 12, 2023
    • Python
      1000Updated Nov 24, 2022Nov 24, 2022
    • 0000Updated Oct 13, 2022Oct 13, 2022
    • Python
      Apache License 2.0
      0209Updated Sep 30, 2022Sep 30, 2022
    • hedgecut

      Public
      Rust
      GNU General Public License v3.0
      1900Updated Sep 1, 2022Sep 1, 2022
    • snapcase

      Public
      Rust
      1101Updated Apr 20, 2022Apr 20, 2022
    • Session-based recommender system: Serenade
      Rust
      Apache License 2.0
      5000Updated Apr 7, 2022Apr 7, 2022
    • 15100Updated Mar 17, 2022Mar 17, 2022
    • Setup of the Jupyter Notebook environment including PySpark, Pandas and DuckDB for course Big Data 2022
      Dockerfile
      0100Updated Feb 3, 2022Feb 3, 2022
    • Micromodels -- A framework for accurate, explainable, data efficient, and reusable NLP models.
      Python
      3000Updated Jan 19, 2022Jan 19, 2022
    • Rust
      MIT License
      0100Updated Sep 21, 2021Sep 21, 2021
    • Java
      GNU General Public License v3.0
      21600Updated Feb 19, 2021Feb 19, 2021
    • Public resources for teaching, etc.
      GNU General Public License v3.0
      0100Updated Jan 15, 2020Jan 15, 2020
    • Code and experiments for our CIDR paper on "A Selection of Machine Learning Models that Can Forget User Data Very Fast"
      Rust
      Apache License 2.0
      0400Updated Jan 15, 2020Jan 15, 2020