Given a Twitterer's geotagged Twitter history, can we:
- Determine a home location?
- Determine if they are geographically at-risk during a hazardous weather event?
Data is retrieved from GNIP for the East Coast during Hurricane Sandy. This process is further outlined in the Data_Aquisition
Directory. All of these tweets may be viewed here: epic.cs.colorado.edu/Twitter-Movement-Derivation/dataset
-
Data is then cleaned and processed with Spark to identify users with (1) tweet in ZoneA. And then clustered with DBScan. The details of this are outlined in the
GeoProcessing
Directory. -
TimeProcessing
then looks at every user's temporal spread over a given week and validates the spatial clustering patterns. -
The
TileProcessing
directory filters all of the tweets by time (before, during, and after), and creates tilesets for visualizations. -
Shelter-In-Place looks for users who stayed.
-
Evacuation looks for users who left at some point before landfall.
Total Tweets (from all Jobs): 3,658,714 This number should be the sum of all the individual jobs, will have to confirm with the Dropbox file
Total Tweets with Geo Tag: 3,632,625... wtf?