Skip to content

Latest commit

 

History

History
163 lines (142 loc) · 16 KB

README.md

File metadata and controls

163 lines (142 loc) · 16 KB

Awesome Kedro Awesome

kedro logo

An opinionated Python framework for creating reproducible, maintainable and modular data science code.

Contents

Projects

  • Response Recommendation System for BarefootLaw by Kasun Amarasinghe, Carlos Caro, Nupoor Gandhi and Raphaelle Roffo, an extensive Data Science for Social Good (DSSG) at Imperial College London project that recommends responses to law related queries
  • Augury by Craig Franklin, machine-learning functionality for predicting AFL match results in the Tipresias app
  • CausalLift by Yusuke Minami, a Python package for Uplift Modeling in real-world business
  • PipelineX by Yusuke Minami, a Python package to develop pipelines for rapid Machine/Deep Learning experimentation using Kedro and MLflow. Example projects using PyTorch, Pandas, and OpenCV are available.
  • kedro-mlflow-example by Tom Goldenberg, a project that demonstrates how to integrate MLflow with a Kedro codebase
  • kedro-wdbc-tf by Abhinav Prakash, this project uses a Kedro template to create Deep Learning workflow. The model training was done with TensorFlow against the wdbc (Breast Cancer) dataset.
  • twitter-sentiment-analysis by Avi Agarwal, a project that demonstrates how to use Kedro to train and evaluate an NLP-based machine learning model.
  • Anomaly Detection Pipeline with Kedro by Kenneth Leung, a project that demonstrates how to use Kedro for fraud detection on credit card transaction data using an Isolation Forest machine learning model.
  • find-kedro - Automatically construct pipelines using pytest style pattern matching.
  • kedro-accelerator - Speeds up pipelines by parallelizing I/O in the background.
  • kedro-airflow - Makes it easy to deploy Kedro projects to Airflow.
  • kedro-airflow-k8s - Enables running a Kedro pipeline with Airflow on a Kubernetes cluster.
  • kedro-argo - Converts Kedro pipelines to Argo pipelines.
  • kedro-auto-catalog - A configurable replacement for kedro catalog create that allows you to create default dataset types other than MemoryDataset.
  • kedro-azureml - Enables running a Kedro pipeline with Azure ML Pipelines service.
  • kedro-dataframe-dropin - Lets you swap out pandas datasets for modin or RAPIDs equivalents for specialised use to speed up your workflows (e.g on GPUs).
  • kedro-datasets - A collection of Kedro data connectors.
  • kedro-docker - Makes it easy to package Kedro projects with Docker.
  • kedro-dolt - Allows you to expand the data versioning abilities of data scientists and engineers
  • kedro-great - The easiest way to integrate Kedro and Great Expectations.
  • kedro-grpc-server - Creates a gRPC server for your kedro pipelines.
  • kedro-kubeflow - Lets you run and schedule pipelines on Kubernetes clusters using Kubeflow Pipelines.
  • kedro-mlflow - Allows usage of MLFlow in Kedro projects.
  • kedro-neptune - Integration of Kedro with Neptune.ai.
  • kedro-pandas-profiling - "Profiles" data in the catalog.
  • kedro-partitioned - Extends the functionality on processing partitioned data.
  • kedro-sagemaker - Enables running a Kedro pipeline with Amazon SageMaker service.
  • kedro-static-viz - Generates a static Kedro-Viz site (HTML, CSS, JS)
  • kedro-viz - Helps visualise Kedro data and analytics pipelines.
  • kedro-vertexai - Enables running a Kedro pipeline with Vertex AI Pipelines service.
  • kedro-wings - Automatically creates catalog entries to simplify Kedro pipeline writing.- more-kedro - (Hook) library for on the fly typing and validation of parameter dictionaries and default value backed data loading.
  • steel-toes - Prevent changing downstream catalog data on your teammates while developing in parallel.

For more:

Blog posts

For more:

Videos

Intros

Howtos

For more:

### News

Support