ipfs-search-extractor

Extract data from ipfs-search' database, for phun and profit.

Requirements

Python 3
pipenv

Usage

pipenv shell
python extractor.py
Behold result.
Tweak parameters (in script).

Example

2018

(ipfs-search-extractor) $ python extractor.py | bzip2 -c > exports/ipfs-search-2018.json.bz2
131645 documents written in 57.59049701690674
First item: 2018-01-16T18:46:00Z
Last item: 2018-12-31T23:58:57Z

Example output

[
  "QmbAvZoiPvAaLY6vFyQSxAaMhzSa5vp2CDi4LzRejpw9DZ",
  "xkcd: Brontosaurus",
  "2018-01-16T18:46:00Z"
],
[
  "QmcZ2a1tQpDUoDFGHhXs6Ga795LAbX2t4FEuTBYWxLYuUP",
  "Botany Readings",
  "2018-01-16T18:46:15Z"
],
...

Field description

For efficiency reasons, we are omitting field names. We're using JSON mainly to avoid encoding issues.

[
  "<CID>",
  "<title>",
  "<first-seen>"
]

Example exports (links on IPFS)

2018 (5.97 MiB)
2019 (10.24 MiB, 131645 documents)
2020 (20.12 MiB, 450436 documents)
2021, until 10-8 (456.52 MiB, 10485760 documents) Note that the greater majority of files on ipfs-search.com seem not to have an extracted title!

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
extractor.py		extractor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ipfs-search-extractor

Requirements

Usage

Example

2018

Example output

Field description

Example exports (links on IPFS)

About

Releases

Packages

Languages

License

ipfs-search/ipfs-search-extractor

Folders and files

Latest commit

History

Repository files navigation

ipfs-search-extractor

Requirements

Usage

Example

2018

Example output

Field description

Example exports (links on IPFS)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages