Skip to content

inigohidalgo/kedro-ibis-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Kedro IbisDataSet

This was a proof of concept designed to kickstart discussions around integrating Ibis and Kedro. This functionality was merged into Kedro, refer to the official implementation by @deepyaman.


Connector to load data from SQL tables into Python Kedro pipelines using Ibis tables. This allows for connecting to databases and constructing queries dynamically using Kedro's node-pipeline framework to construct Ibis expressions.

To save data into SQL tables, the connector can save either pandas dataframes or ibis expressions.

To initiate a DataSet, currently we must pass a connection string ({backend}://{database-connection-string}). See this section of the Ibis documentation for more details.

How to use

To see an example usage in a kedro pipeline see this project.

1. pip install the package

pip install kedro-ibis-dataset

2. Use as any other Kedro DataSet

credentials.yml

database_creds:
    con: backend://connection-url

catalog.yml

ibis_dataset_table:
  type: kedro_ibis_dataset.IbisDataSet
  table_name: table_name
  credentials: database_creds
  save_args:
    overwrite: true/false

For more information on how to use Ibis to connect to SQL databases see the Ibis documentation.