Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add dataset: french_fiction_16_18th_century #86

Open
1 task done
davanstrien opened this issue Sep 27, 2022 · 0 comments
Open
1 task done

Add dataset: french_fiction_16_18th_century #86

davanstrien opened this issue Sep 27, 2022 · 0 comments
Labels
dataset Dataset to be added good first issue Good for newcomers

Comments

@davanstrien
Copy link
Collaborator

A URL for this dataset

https://zenodo.org/record/5770866

Dataset description

A corpus containing all digitized French novels from the beginning of print (the first entry is from 1473) to the 18th century.

French novels of the period have been identified using the Y2 quote of the French National Library Catalog that has served to classify past and present collections of novels in France from 1730 to 1996. Combined use of digitized sources from Gallica, Google Books, Archive.org and other digital library made it possible to attain a high representativeness: 78% of the novels of the 1450-1600 and 68% of the novels of the 1600-1700 have been retrieved.

Dataset modality

Text

Dataset licence

Creative Commons Attribution 4.0 International

Other licence

No response

How can you access this data

As a download from a repository/website

size of dataset

500MB-2GB

Confirm the dataset has an open licence

  • To the best of my knowledge, this dataset is accessible via an open licence

Contact details for data custodian

No response

@davanstrien davanstrien added candidate-dataset Proposed dataset to be added dataset Dataset to be added and removed candidate-dataset Proposed dataset to be added labels Sep 27, 2022
@davanstrien davanstrien added the good first issue Good for newcomers label Oct 17, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dataset Dataset to be added good first issue Good for newcomers
Development

No branches or pull requests

1 participant