Skip to content

Latest commit

 

History

History
15 lines (11 loc) · 647 Bytes

README.md

File metadata and controls

15 lines (11 loc) · 647 Bytes

trustpilot-scraper

Python scraping tool for trustpilot.com reviews.

Dependencies

  • Python 3.6
  • lxml 3.7.2
  • requests 2.12.4

Usage

  1. Set up up and configure scrape.ipynb to scrape the page you want the reviews from into CSV file
  2. Use the clean.ipynb module to convert and clean this CSV file to a format used by most text classification algorithms

Sentiment analysis

After you've run scrape and clean, you can use this example from sci-kit learn to do text classification.