This was a proof of concept designed to kickstart discussions around integrating Ibis and Kedro. This functionality was merged into Kedro, refer to the official implementation by @deepyaman.
Connector to load data from SQL tables into Python Kedro pipelines using Ibis tables. This allows for connecting to databases and constructing queries dynamically using Kedro's node-pipeline framework to construct Ibis expressions.
To save data into SQL tables, the connector can save either pandas
dataframes or ibis
expressions.
To initiate a DataSet, currently we must pass a connection string ({backend}://{database-connection-string}
). See this section of the Ibis documentation for more details.
To see an example usage in a kedro pipeline see this project.
pip install kedro-ibis-dataset
credentials.yml
database_creds:
con: backend://connection-url
catalog.yml
ibis_dataset_table:
type: kedro_ibis_dataset.IbisDataSet
table_name: table_name
credentials: database_creds
save_args:
overwrite: true/false
For more information on how to use Ibis to connect to SQL databases see the Ibis documentation.