Skip to content

xiosensei/stash_theporndb_scraper

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

79 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is a python script intended to run from a command line to scrape information for Stash scenes from ThePornDB (metadataapi.net).  Requires Python 3 and running the "development" version (not the "latest"/stable branch) of Stash.

Currently, the script supports setting titles, performers, tags, studios, details, and date.  All fields are optional and can be disabled via config options.  Current title, studio, details, and date are overwritten with new data. Current performers and tags are maintained with new results added. If a new performer/studio/tag is found that's not currently in Stash, the script can optionally add the performer/studio/tag using data from ThePornDB.  It can also optionally (via config options) scrape FreeOnes for performer data, and try to pull an image from Babepedia.

Usage:  Download the script and install the requirements (pip install -t requirements.txt).  Modify the configuration.py to include the URL of your Stash endpoint.  Be sure to specify http or https.  Also include your username and password, if using.  Set any configuration parameters you'd like, which are explained in the script.  

Then, run the script (python scrapeScenes.py).  If run without parameters, the script scrapes all Stash scenes using the options from configuration.py.  Add the -h flag to see other supported command line options.

Successfully scraped scenes are tagged with a custom tag, and excluded from future scans.  

Where ThePornDB returns multiple results for a scene, the script supports automatic disambiguation, manual disambiguation, or skipping the scene. Scenes that are skipped due to ambiguous results can be tagged.  An additional execution of the script can then be run only for previously skipped scenes with the ambiguous results tag. 

About

A python script to scrape Stash data from thePornDB

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%