ArgusEyes Demonstration

Example project using ArgusEyes in an automated CI workflow using GitHub Actions.

ArgusEyes is a system which allows data scientists to declaratively specify a variety of pipeline issues that they are concerned about. Subsequently, ArgusEyes can instrument, execute and screen the pipeline for the configured pipeline issues, as part of continuous integration processes. ArgusEyes detects complex issues by tracking record-level provenance and understanding the semantics of operations in ML pipelines. ArgusEyes was presented as an abstract at CIDR'22.

We provide three example scenarios (Note that you have to locally install ArgusEyes first to execute them). You can run ArgusEyes to execute the pipeline and screen it for a particular issue issue. Subsequently, you can use an interactive notebook to determine the root cause of the pipeline issue and fix it.

Label errors: detecting mislabeled image in a computer vision pipeline

Source code of the ML pipeline mlinspect-computervision-sneakers.py
Screening configuration: mlinspect-computervision-sneakers-labelerrors.yaml
Github workflow run detecting the label errors
Manual screening: ./eyes-local mlinspect-computervision-sneakers-labelerrors.yaml
Notebook for retrospective debugging: retrospective_labelerrors.ipynb

Data leakage: detecting data leakage in a price prediction pipeline

Source code of the ML pipeline mlflow-regression-nyctaxifare.py
Screening configuration: mlflow-regression-nyctaxifare-dataleakage.yaml
Github workflow run detecting the leakage
Manual screening: ./eyes-local mlflow-regression-nyctaxifare-dataleakage.yaml
Notebook for retrospective debugging: retrospective_dataleakage.ipynb

Fairness: detecting fairness violations in a credit scoring pipeline

Source code of the ML pipeline openml-classification-incomelevel.py
Screening configuration: openml-classification-incomelevel-fairness.yaml
Github workflow run detecting the fairness violation
Manual screening: ./eyes-local openml-classification-incomelevel-fairness.yaml
Notebook for retrospective debugging: retrospective_fairnessviolation.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
datasets		datasets
pipelines		pipelines
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ankleboots.png		ankleboots.png
eyes-local		eyes-local
github-action-dataleakage-screening.png		github-action-dataleakage-screening.png
github-action-fairness-screening.png		github-action-fairness-screening.png
github-action-labelerror-screening.png		github-action-labelerror-screening.png
mlflow-regression-nyctaxifare-dataleakage.yaml		mlflow-regression-nyctaxifare-dataleakage.yaml
mlinspect-computervision-sneakers-labelerrors.yaml		mlinspect-computervision-sneakers-labelerrors.yaml
openml-classification-incomelevel-fairness.yaml		openml-classification-incomelevel-fairness.yaml
requirements.txt		requirements.txt
retrospective_dataleakage.ipynb		retrospective_dataleakage.ipynb
retrospective_fairnessviolation.ipynb		retrospective_fairnessviolation.ipynb
retrospective_labelerrors.ipynb		retrospective_labelerrors.ipynb
sneakers.png		sneakers.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ArgusEyes Demonstration

Label errors: detecting mislabeled image in a computer vision pipeline

Data leakage: detecting data leakage in a price prediction pipeline

Fairness: detecting fairness violations in a credit scoring pipeline

About

Releases

Packages

Languages

License

amsterdata/arguseyes-demo

Folders and files

Latest commit

History

Repository files navigation

ArgusEyes Demonstration

Label errors: detecting mislabeled image in a computer vision pipeline

Data leakage: detecting data leakage in a price prediction pipeline

Fairness: detecting fairness violations in a credit scoring pipeline

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages