Track Awesome Seml Updates Weekly
A curated list of articles that cover the software engineering best practices for building machine learning applications.
🏠 Home · 🔍 Search · 🔥 Feed · 📮 Subscribe · ❤️ Sponsor · 😺 SE-ML/awesome-seml · ⭐ 1K · 🏷️ Computer Science
Feb 07 - Feb 13, 2022
Tooling
- Aim - Aim is an open source experiment tracking tool.
Dec 06 - Dec 12, 2021
Governance
Oct 18 - Oct 24, 2021
Tooling
- REVISE: REvealing VIsual biaSEs (⭐91) - Automatically detect bias in visual data sets.
Oct 04 - Oct 10, 2021
Governance
May 03 - May 09, 2021
Tooling
- Alibi Detect (⭐1.5k) - Python library focused on outlier, adversarial and drift detection.
- PyTorch Lightning (⭐20k) - The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
- Robustness Metrics (⭐418) - Lightweight modules to evaluate the robustness of classification models.
- Seldon Core (⭐3.4k) - An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models on Kubernetes.
- Tensorflow Data Validation (TFDV) (⭐674) - Library for exploring and validating machine learning data. Similar to Great Expectations, but for Tensorflow data.
Apr 26 - May 02, 2021
Governance
Mar 22 - Mar 28, 2021
Model Training
Mar 15 - Mar 21, 2021
Deployment and Operation
Jan 04 - Jan 10, 2021
Governance
Nov 23 - Nov 29, 2020
Deployment and Operation
Governance
Oct 26 - Nov 01, 2020
Broad Overviews
Data Management
Oct 19 - Oct 25, 2020
Tooling
- Archai (⭐373) - Neural architecture search.
- FairLearn - A toolkit to assess and improve the fairness of machine learning models.
- Great Expectations (⭐7.4k) - Data validation and testing with integration in pipelines.
- LiFT (⭐159) - Linkedin fairness toolkit.
- Model Card Toolkit (⭐314) - Streamlines and automates the generation of model cards; for model documentation.
Sep 21 - Sep 27, 2020
Governance
Aug 03 - Aug 09, 2020
Broad Overviews
Jun 29 - Jul 05, 2020
Model Training
Tooling
- Airflow - Programmatically author, schedule and monitor workflows.
- Data Version Control (DVC) - DVC is a data and ML experiments management tool.
- Facets Overview / Facets Dive - Robust visualizations to aid in understanding machine learning datasets.
- Git Large File System (LFS) - Replaces large files such as datasets with text pointers inside Git.
- HParams (⭐126) - A thoughtful approach to configuration management for machine learning projects.
- Kubeflow - A platform for data scientists who want to build and experiment with ML pipelines.
- Label Studio (⭐11k) - A multi-type data labeling and annotation tool with standardized output format.
- MLFlow - Manage the ML lifecycle, including experimentation, deployment, and a central model registry.
- Neptune.ai - Experiment tracking tool bringing organization and collaboration to data science projects.
- Neuraxle (⭐543) - Sklearn-like framework for hyperparameter tuning and AutoML in deep learning projects.
- OpenML - An inclusive movement to build an open, organized, online ecosystem for machine learning.
- Spark Machine Learning - Spark’s ML library consisting of common learning algorithms and utilities.
- TensorBoard - TensorFlow's Visualization Toolkit.
- Tensorflow Extended (TFX) - An end-to-end platform for deploying production ML pipelines.
- Weights & Biases - Experiment tracking, model optimization, and dataset versioning.
May 18 - May 24, 2020
Governance
Apr 06 - Apr 12, 2020
Broad Overviews
Model Training
Deployment and Operation
Mar 30 - Apr 05, 2020
Deployment and Operation
Mar 02 - Mar 08, 2020
Broad Overviews
Data Management
Model Training
Feb 24 - Mar 01, 2020
Data Management
Model Training
Feb 10 - Feb 16, 2020
Data Management
Deployment and Operation
Social Aspects
Feb 03 - Feb 09, 2020
Data Management
Model Training
Deployment and Operation
Social Aspects