Skip to content

darthbear/scrapy-mongodb-queue

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

scrapy-mongodb-queue

Use scrapy with mongodb as a queue.

The queue can be defined as a FIFO or LIFO. The order will rely on natural ordering (order of the elements on the disk, so the queue won't be strictly ordered as a FIFO or LIFO but should be close to it).

To use it, edit settings.py and add the following line:

  • SCHEDULER = "scrapy_mongodb_queue.scheduler.Scheduler"

Other options:

  • MONGODB_QUEUE_SERVER: mongodb server (default: localhost)
  • MONGODB_QUEUE_PORT: mongodb port (default: 27017)
  • MONGODB_QUEUE_DB: mongodb db (default: scrapy)
  • MONGODB_QUEUE_PERSIST: should the queue be persisted after the crawl or if the crawl is interrupted (default: True)
  • MONGODB_QUEUE_NAME: name of the collection. By default it will be set to the name of the spider postfixed by "_queue"
  • MONGODB_QUEUE_TYPE: can be FIFO or LIFO

About

Use scrapy with mongodb to store the request queues (FIFO or LIFO)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published