twtjson
table stores the raw tweets in JSON format.
column | type | note |
---|---|---|
tid | bigint | Tweet id of the tweet |
tweet | jsonb | Tweet JSON object |
created_at | timestamp | Tweet JSON object |
entity
table contains the entities and their types.
column | type | note |
---|---|---|
entity_id | integer | ID of the entity, used for joining with other tables |
entity_type | string | Type of the entity: hashtag, mentions, etc. |
entity_text | string | Entity itself |
created_at | timestamp | When the entity was first added |
entitytwt
table connects the raw tweet and the entities.
It also stores the botscores accociated with each entity.
column | type | note |
---|---|---|
tid | bigint | Tweet id of the tweet in twtjson table |
entity_id | integer | Entity ID in entity table |
user_id | bigint | User ID |
bot_score | real | Botscores associated with the entity |
tweet_date | timestamp | Creation time of the tweet |
created_at | timestamp | Insertion time of the entity |