# Deepchecks — Continuous Validation for ML Models and Data

> An open-source Python library that runs comprehensive test suites on ML models and datasets to detect data drift, integrity issues, and model performance degradation.

## Install

Save in your project root:

# Deepchecks — Continuous Validation for ML Models and Data

## Quick Use
```bash
pip install deepchecks

python -c "
from deepchecks.tabular import Dataset
from deepchecks.tabular.suites import full_suite
import pandas as pd
from sklearn.ensemble import RandomForestClassifier

train = pd.read_csv('train.csv')
ds = Dataset(train, label='target')
suite = full_suite()
result = suite.run(ds)
result.save_as_html('report.html')
print('Report saved')
"
```

## Introduction
Deepchecks is a Python library for testing and validating ML models and their data throughout the development lifecycle. It provides pre-built test suites that detect common issues like data drift, label leakage, feature importance shifts, and model degradation before they reach production.

## What Deepchecks Does
- Runs automated test suites covering data integrity, train-test validation, and model evaluation
- Detects data drift between training and production distributions
- Identifies label leakage, duplicate samples, and feature-target correlation issues
- Generates interactive HTML reports with visualizations
- Supports tabular, NLP, and computer vision data types

## Architecture Overview
Deepchecks organizes checks into suites. Each check is a self-contained validation unit that accepts a Dataset or Model object and returns a CheckResult with a pass/fail status, a value, and an optional visualization. Suites aggregate results into a SuiteResult that can be exported as HTML or JSON for CI integration.

## Self-Hosting & Configuration
- Install via pip: `pip install deepchecks` (add `[vision]` or `[nlp]` for other modalities)
- Wrap your data in a `Dataset` object specifying label and feature columns
- Run pre-built suites or compose custom suites from individual checks
- Set pass/fail conditions on checks for CI gating
- Export results as HTML reports or JSON for programmatic access

## Key Features
- 50+ built-in checks covering data integrity, distribution, and model performance
- Pre-configured suites for common workflows (train-test validation, full suite, production monitoring)
- Condition-based pass/fail thresholds for automated CI pipelines
- Interactive HTML reports with drill-down visualizations
- Supports tabular (pandas/sklearn), NLP (Hugging Face), and CV (PyTorch) workflows

## Comparison with Similar Tools
- **Great Expectations** — focuses on data quality rules, not model-level checks
- **Evidently** — monitoring dashboards and reports, overlaps on drift detection
- **whylogs** — lightweight data profiling for monitoring, less model-aware
- **Pandera** — schema-level DataFrame validation, no ML model testing
- **MLflow** — experiment tracking platform, no built-in data/model validation suite

## FAQ
**Q: Can Deepchecks run in CI/CD pipelines?**
A: Yes. Set conditions on checks and fail the pipeline if thresholds are breached. Results export as JUnit XML.

**Q: Does it support deep learning models?**
A: Yes. The vision and NLP modules validate PyTorch models and Hugging Face pipelines respectively.

**Q: How do I detect data drift in production?**
A: Compare a reference dataset (training) against a current batch using the data drift suite, which applies statistical tests per feature.

**Q: Is Deepchecks compatible with MLflow or Weights and Biases?**
A: Yes. You can log Deepchecks reports as artifacts in any experiment tracker.

## Sources
- https://github.com/deepchecks/deepchecks
- https://docs.deepchecks.com/

---
Source: https://tokrepo.com/en/workflows/asset-1ed07d7c
Author: AI Open Source