ipfs-search-extractor

Extract data from ipfs-search' database, for phun and profit.

Requirements

Python 3
pipenv

Usage

pipenv shell
python extractor.py
Behold result.
Tweak parameters (in script).

Example

2018

(ipfs-search-extractor) $ python extractor.py | bzip2 -c > exports/ipfs-search-2018.json.bz2
131645 documents written in 57.59049701690674
First item: 2018-01-16T18:46:00Z
Last item: 2018-12-31T23:58:57Z

Example output

[
  "QmbAvZoiPvAaLY6vFyQSxAaMhzSa5vp2CDi4LzRejpw9DZ",
  "xkcd: Brontosaurus",
  "2018-01-16T18:46:00Z"
],
[
  "QmcZ2a1tQpDUoDFGHhXs6Ga795LAbX2t4FEuTBYWxLYuUP",
  "Botany Readings",
  "2018-01-16T18:46:15Z"
],
...

Field description

For efficiency reasons, we are omitting field names. We're using JSON mainly to avoid encoding issues.

[
  "<CID>",
  "<title>",
  "<first-seen>"
]

Example exports (links on IPFS)

2018 (5.97 MiB)
2019 (10.24 MiB, 131645 documents)
2020 (20.12 MiB, 450436 documents)
2021, until 10-8 (456.52 MiB, 10485760 documents) Note that the greater majority of files on ipfs-search.com seem not to have an extracted title!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

ipfs-search-extractor

Requirements

Usage

Example

2018

Example output

Field description

Example exports (links on IPFS)

Files

README.md

Latest commit

History

README.md

File metadata and controls

ipfs-search-extractor

Requirements

Usage

Example

2018

Example output

Field description

Example exports (links on IPFS)