Skip to content

create multiples of word clouds to compare documents visually

License

Notifications You must be signed in to change notification settings

casutton/WordStorm

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Word Storm

The objective of this project is to generate word storms, multiples of word clouds to afford visual comparison of groups of documents. Just as a storm is a group of clouds, a word storm is a group of word clouds. Each cloud in the storm represents a subset of the corpus. For example, a storm might contain one cloud per document, or alternatively one cloud to represent all the documents written in each year, or one cloud to represent each track of an academic conference, etc.

Word Storm

In order to make the clouds easy to compare, words that appear in different clouds are placed in similar locations and more informative words are emphasized. In this way, similar documents are be represented by visually similar clouds.

Create a Word Storm

This project allows you to create word storms from your texts, giving you options to personalize their appearance, so you can create beautiful figures to analyze your documents.

The algorithms to generate the storm are build on top of WordCram (web, gitHub), a flexible word cloud generator that creates beautiful clouds using the popular Wordle's layout.

Usage

  1. Set the path to the text files using 'io.DataPath.java'. The project takes the data from 'datapath/input/folder' and returns the storm in 'datapath/output/folder/parameters'
  2. Decide the cloud's size in 'AppletConf'.
  3. Decide the storm configuration using 'StormConf' (number of files, number of words, colors, angles...)
  4. Create a Storm!

Examples

See examples of usage in 'src/wslauncher':

  1. 'StormLauncher.java' to create a coordinated word storm.
  2. 'IndepLauncher.java' to create independent clouds.

About

create multiples of word clouds to compare documents visually

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 98.5%
  • HTML 1.4%
  • Shell 0.1%