Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider caching columns while scanning #805

Open
keith-turner opened this issue Nov 17, 2016 · 0 comments
Open

Consider caching columns while scanning #805

keith-turner opened this issue Nov 17, 2016 · 0 comments

Comments

@keith-turner
Copy link
Contributor

In #804 I made some conservative changes to reuse columns objects. These changes only did this when the user passed in a set of wanted columns. In this case it was wasy because we knew what columns to expect. For the case where a user scans over a row range and does not provide a set of columns, nothing was done in #804. In this case a cache of columns could improve performance in some cases. The cache would need to be smart and not hinder performance in the case where the cache hit rate is low. In the case maybe the cache could just stop looking in the cache and always create new Column objects.

Could also look at caching on the family and qualifier level instead of in addition to the column level.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant