Skip to content

Commit

Permalink
New URL scene and performer scraper for CDGirls network (stashapp#1823)
Browse files Browse the repository at this point in the history
* Create CDGirls.yml

New Scene and Performer scraper for CD Girls and their network subsites:
- First Timers
- Just Barely 18
- Luxury Lesbians
- Fucked By A Machine
- Sybian Orgasms
- VR Body Shots

* Update SCRAPERS-LIST.md

Added cdgirls.com to scrapers list.
  • Loading branch information
MortonBridges authored May 9, 2024
1 parent 302fe1a commit afc09db
Show file tree
Hide file tree
Showing 2 changed files with 71 additions and 0 deletions.
1 change: 1 addition & 0 deletions SCRAPERS-LIST.md
Original file line number Diff line number Diff line change
Expand Up @@ -400,6 +400,7 @@ catfight-chicks.com|Shopmaker.yml|:heavy_check_mark:|:x:|:x:|:x:|-|-
caughtfapping.com|Adultime.yml|:heavy_check_mark:|:x:|:x:|:x:|Python|-
caughtmycoach.com|Nubiles.yml|:heavy_check_mark:|:x:|:x:|:x:|-|-
cazzofilm.com|PornsiteManager.yml|:heavy_check_mark:|:x:|:heavy_check_mark:|:x:|-|Gay
cdgirls.com|CDGirls.yml|:heavy_check_mark:|:x:|:x:|:heavy_check_mark:|-|-
cdnbus.lol|JavBus.yml|:heavy_check_mark:|:x:|:x:|:heavy_check_mark:|-|-
cfnmeu.com|Williamhiggins.yml|:heavy_check_mark:|:x:|:x:|:x:|-|Gay
chaosmen.com|ChaosMen.yml|:heavy_check_mark:|:x:|:x:|:x:|Python|Gay
Expand Down
70 changes: 70 additions & 0 deletions scrapers/CDGirls.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,70 @@
name: CDGirls
sceneByURL:
- action: scrapeXPath
url:
- cdgirls.com
scraper: sceneScraper

performerByURL:
- action: scrapeXPath
url:
- cdgirls.com
scraper: performeScraper

xPathScrapers:
sceneScraper:
scene:
Title: //meta[@property='og:title']/@content
Date:
selector: //span[contains(.,'Added')]/following-sibling::span
postProcess:
- parseDate: 2-1-2006
Details:
selector: //div[@class='panel-content']/p
concat: ' '
Performers:
Name: //div[@class='model_name']
Image: //div[@class='model_img']/img/@src
Image:
selector: //script[contains(.,'image:')]
postProcess:
- replace:
- regex: .+?image:.+?'([^']+).+
with: $1
Tags:
Name:
selector: //div[@class='video-footer']/strong[contains(.,'Tags')]/following-sibling::p/a
Studio:
Name:
# Some scenes are part of multiple subsites. This grabs the first that's listed.
selector: //div[@class='video-footer']/strong[contains(.,'CDGirls WebSite')]/following-sibling::p/a
postProcess:
- map:
Amateur: First Timers
Barely18: Just Barely 18
Lesbian: Luxury Lesbians
Sexmachines: Fucked By A Machine
Sybian: Sybian Orgasms
Vrbs: VR Body Shots

performeScraper:
performer:
Name: //aside//h3
Height:
selector: //li[contains(.,'Height')]//strong
postProcess:
- replace:
- regex: '^'
with: '0.'
- feetToCm: true
Weight:
selector: //li[contains(.,'Weight')]//strong
postProcess:
- lbToKg: true
EyeColor: //li[contains(.,'Eye Color')]//strong
HairColor: //li[contains(.,'Hair Color')]//strong
Details: //ul[@class='list-unstyled']/li[1]
Image: //div[@class='model_img_lg']/img/@src
Disambiguation:
fixed: CD Girls
# Last Updated May 08, 2024

0 comments on commit afc09db

Please sign in to comment.