Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

what is the specific dataset format? #1

Open
yingyuan830 opened this issue Jan 29, 2022 · 3 comments
Open

what is the specific dataset format? #1

yingyuan830 opened this issue Jan 29, 2022 · 3 comments

Comments

@yingyuan830
Copy link

Hi, I'd like to use my own data to test the tool, but I am confused about the data format of MouseAtlas and not sure how to modify my data like that, could you please give some hints ? Thanks!

@sperfu
Copy link

sperfu commented Jan 30, 2022

Hi there,

The format of our input data is the npz file which you may use load_npz function from scipy.sparse package, its basically the compressed version of numpy array, which is feature by cell matirx. You may modify your input data like that.

Thanks

@yingyuan830
Copy link
Author

Thanks. But I am still confused about the specific meaning of the features:
indices = atlas_data['indices']
indptr = atlas_data['indptr']
format_data = atlas_data['format']
shape = atlas_data['shape']
data = atlas_data['data']
Could you please give more hints so that I can modify my input data like this ?

@ycao94
Copy link
Collaborator

ycao94 commented Feb 14, 2022

Hi, for the time being we are directly using the cell by peak matrix and vector with length n_cell to represent the data. You can modify the dataset file to implement your own data class. We'll be working on customizable data based on Anndata soon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants