-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature request: Convert standard dataset into a federated dataset #206
Comments
Thanks for filing this! I also think that this will be very useful. A couple of clarifying questions:
|
I think if an iterator of numpy arrays is supported, that would be the most general. The tf.data.Dataset can be converted using
yes. The goal would be to be able to use this dataset with the rest of the fedjax framework. |
Hi, has any work been done for this issue? Is there still a need for it? More generally, what is the state of this repo? Is it still active? Is there work that needs some contribution? I am more than happy to help. |
Hi there. There hasn't been much work done for checking in a general implementation for this but it would be nice to have. We still actively use and maintain this repo and would be more than happy to have you contribute! |
Have you checked out InMemoryFederatedData? It should be sufficient for creating synthetic datasets in most cases. |
Synthetic federated datasets can constructed from standard centralized ones by artificially splitting them among clients. This is usually done using a Dirichlet distribution (e.g. Hsu et al. 2019).
Such synthetic datasets are very useful since we can explicitly control the total number of users, as well as the heterogeneity.
It would be great to have primitives which can automatically convert standard numpy dataset into a FedJax datset.
The text was updated successfully, but these errors were encountered: