Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add an Optional get_info Method to Datasets for Metadata Retrieval #926

Open
SajidAlamQB opened this issue Nov 7, 2024 · 0 comments
Open

Comments

@SajidAlamQB
Copy link
Contributor

Description

Related to: kedro-org/kedro-viz#1893 (comment)

When trying to retrieve metadata (like file size) from datasets in Kedro Viz we are running into issues, specifically within our DatasetStatsHook. Currently, to obtain this metadata, we need to access private attributes (e.g., _filepath, _fs, _protocol), which is causing inconsistency and compatibility problems with datasets that do not expose these attributes.

Context

By having a standardised, public way to retrieve dataset metadata we can improve best practices. This benefits users who want to collect dataset metadata or build plugins, as it provides a consistent and maintainable approach.

Possible Implementation

Perhaps adding an optional, public method (e.g., get_info) to datasets that can provide metadata like file size, etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

1 participant