A harvester

A harvester used to harvest tweet with geo-location.

The harvester has two parts. friend_harvester and tweet_harvester. a database is used to store search records to avoid tweet duplication.

The friend_harvester only collect friends informaiton starting from a seed user.(AFLNews for exmaple). It sotres new user ID in the database and mark 'friends_harvested' to true after collecting one's friends( also add time stamp in 'last_time_friends_harvested')

The tweet_harvester find a un-visted user in database and collection this user's timeline. After collecting this user's tweet, the tweet_harvester mark the 'tweet_harvested' to true and add time stamp in 'last_time_tweet_harvested'

Getting started

Install TwitterAPI

pip3 install TwitterAPI MongoDB

A Twitter Application Account for developer:apply one

Login with a twitter account and then apply for a application, collect your consumer key and tokens etc.

HOW TO USE IT

In the file support.py (deleted.)

change 'search_tweets' to whichever tokens are you going to search.
change the 'consumer_key','consumer_secret','access_token_key' and 'access_token_secret' to your own token

Exception log

Any exception occured will be record into log file. Usually the only exception is the duplication issue when inserting an existing tweet into database.

Deployment

Add additional notes about how to deploy this on a live system

Built With

Twitter API - API wrapper for harvest
Zelong Cong - Initial work

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
mongodb_harvester		mongodb_harvester
.gitignore		.gitignore
README.md		README.md
Tweet_by_place.py		Tweet_by_place.py
Tweet_by_timeline.py		Tweet_by_timeline.py
connect_db.py		connect_db.py
connect_mongo.py		connect_mongo.py
friends_harvester.py		friends_harvester.py
mongodb_log		mongodb_log
pip3 install update.sh		pip3 install update.sh
query_num_of_tweet.txt		query_num_of_tweet.txt
run_1.sh		run_1.sh
run_harvester.sh		run_harvester.sh
tweet_harvester.py		tweet_harvester.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A harvester

Getting started

HOW TO USE IT

In the file support.py (deleted.)

Exception log

Deployment

Built With

About

Releases

Packages

Languages

zelongc/harvester-mongodb-couchdb

Folders and files

Latest commit

History

Repository files navigation

A harvester

Getting started

HOW TO USE IT

In the file support.py (deleted.)

Exception log

Deployment

Built With

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages