Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NorbuKetaka batch2 #241

Closed
eroux opened this issue Feb 15, 2023 · 12 comments
Closed

NorbuKetaka batch2 #241

eroux opened this issue Feb 15, 2023 · 12 comments

Comments

@eroux
Copy link
Contributor

eroux commented Feb 15, 2023

This is a follow-up of

#208

I've uploaded the files of the new (and likely final) batch on

s3://ocr.bdrc.io/NorbuKetaka2/

let's have batch-0001 for the batch id, and the following info.json

{
   "timestamp": "2023-02-01T00:00:00Z"
}
@10zintopjor
Copy link
Contributor

10zintopjor commented Mar 3, 2023

@eroux Do you mean to move those csv files in relevant s3 folder adn create opf?

@eroux
Copy link
Contributor Author

eroux commented Mar 3, 2023

Yes

@10zintopjor
Copy link
Contributor

Hey can u checkout the sample opf.In the software_id in meta should it be norbuketaka2 or as before?

@eroux
Copy link
Contributor Author

eroux commented Mar 6, 2023

thanks! software_id should be as before, I'll look at the sample in a moment

@eroux
Copy link
Contributor Author

eroux commented Mar 6, 2023

oh sorry I realize I forgot to change the batch_id in my initial comment (my bad), it should be batch-0002

@eroux
Copy link
Contributor Author

eroux commented Mar 6, 2023

let's have last_modified set to 2023-02-01T00:00:00Z, but other than that it looks good, thanks!

@10zintopjor
Copy link
Contributor

ok then i have to reimport the files to s3

@eroux
Copy link
Contributor Author

eroux commented Mar 6, 2023

oh, never mind then, having batch-0001 is not a big deal, we can live with that

@10zintopjor
Copy link
Contributor

Hey I have updated the catalog over here.But for the file Works/cf/W1PD133164/norbuketaka2/batch-0002/W1PD133164-I4PD2795.csv the work id W1PD133164 does not have image group id I4PD2795.

@eroux
Copy link
Contributor Author

eroux commented Mar 9, 2023

thanks a lot!

It appears that in that case W1PD133164 should be instead W1PD133161 so the file should be merged with W1PD133161-I4PD2795.csv. Are there other cases like this?

Also, the point of software_id is to indicate the s3 folder, and it should be norbuketaka, not norbuketaka2, so please move the files on s3 (the opf files look good)

I'll go ahead and import the opf files, I'll tell you if I run in any trouble

@10zintopjor
Copy link
Contributor

No that file is the only issue in batch-0002.

@eroux
Copy link
Contributor Author

eroux commented Mar 9, 2023

great!

@eroux eroux closed this as completed Dec 23, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants