Skip to content

leminhnguyen/crawl_subtitle

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

The code for crawling the best subtitle based on votes

Installation

pip install -r requirements.txt

Crawling Steps

1. Crawl the genre and popular films from rottentomatoes

python src/01_crawl_style_and_film_names.py

2. Crawl the good vietsub links for each film from subscene

python src/02_crawl_best_vietsub_url.py

3. Choose the best subtitle (by score) and download with wget

python src/03_download_best_vietsub.py

Author

About

The code for crawling the best subtitles based on votes on https://subscene.com/

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages