You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
Was downloading XML and CSV files for all the papers published in the year 2021 for the query "Transcription factors", the limit was set to 100k papers, and hits were 99k, ideally, it should start the download with a warning but the error is TypeError: 'NoneType' object is not subscriptable
To Reproduce
Steps to reproduce the behaviour:
In your windows command prompt type pygetpapers -q "Transcription factors" -x -c -o TF_database_2021 -k 100000 --startdate 2021-01-01 --enddate 2021-12-31
press 'Enter'
Scroll down to the end
See an error like TypeError: 'NoneType' object is not subscriptable
Expected behaviour
Ideally, it should start the download of all the available XML and CSV files related to the query
Screenshots
Desktop (please complete the following information):
OS: Windows 11
Browser : Firefox
Version : Firefox 95.0
Additional context
it usually works for a small corpus of like 1000 to 100 papers, for example, pygetpapers ran smoothly the above query for the year 2022 and set the limit to 1000 papers, but the actual hits were only 458. it downloaded a corpus of 458 papers with CSV and XML files.
But for a huge corpus usually >1k, it shows the above error message.
The text was updated successfully, but these errors were encountered:
Thanks both,
I suggest that 100K is too large a chunk. Maybe 10K
* it may put strain on the server and get blocked
* when errors occur it may be difficult to locate the documents responsible
- as we have here
* make sure you can actually analyze the downloaded material. If you can't
process 10K, downloading 100K won't gain anything.
Describe the bug
Was downloading XML and CSV files for all the papers published in the year 2021 for the query "Transcription factors", the limit was set to 100k papers, and hits were 99k, ideally, it should start the download with a warning but the error is
TypeError: 'NoneType' object is not subscriptable
To Reproduce
Steps to reproduce the behaviour:
pygetpapers -q "Transcription factors" -x -c -o TF_database_2021 -k 100000 --startdate 2021-01-01 --enddate 2021-12-31
TypeError: 'NoneType' object is not subscriptable
Expected behaviour
Ideally, it should start the download of all the available XML and CSV files related to the query
Screenshots
Desktop (please complete the following information):
Additional context
it usually works for a small corpus of like 1000 to 100 papers, for example, pygetpapers ran smoothly the above query for the year 2022 and set the limit to 1000 papers, but the actual hits were only 458. it downloaded a corpus of 458 papers with CSV and XML files.
But for a huge corpus usually >1k, it shows the above error message.
The text was updated successfully, but these errors were encountered: