This is a project that was done to score the sentiment on topics of common forms of descrimination.
- Online, real-time twitter analysis of sentiment with tweets related to sectarianism.
- Version .1
- Learn Markdown
- Install requirements.txt with pip
- setup
config.py
- make sure there is labeled, trainable data at
labels/train.csv
(Note: currently needs to be re-gathered and re-labeled) - run
sent.py
- Browse to 5000 port of the machine it is set up on
- /city_statistics/<city_name>/ to view top k neg/pos people in a city
- /state_statistics/<state_two_char_iso_code>/ to view top k neg/pos people in state
- /statistics/ general information charts
- / for Map of tweets
- I actually don't have the final version of the repo because the correct up-to-date csv files are not in this right now, and the existing ones are seriously majorly flawed (no for real, go look at them).
Guess it never got committed and the machine I originally wrote this on is lost.
- Submit a PR to merge into master
Pretty much everything. This was a very introducting project on using Machine Learning on streamed data.
Want to improve it?
- Better / More keywords
- More training data
- Sep classifier for both category and sentiment
- Better classifiers.
- Use a better way to store results from data stream.