google_trigrams.jpg
a set of relational visualizations represent the relative frequencies of trigrams as they appear on the web, based on a massive (100GB) n-gram dataset from Google's corpus archive. n-grams are pieces of sentences. a trigram (n=3), for example, might be "I like food" or "frog is tasty."

the first visualization compares the 120 trigrams of the terms 'He' with 'She', while the other is based on 75 trigrams of 'I' & 'You'. the frequencies of the 2nd word in the trigrams were sorted in decreasing order. words are sized according to the square root of their use frequencies. the color-coded lines act like paths (similar to a tree structure), enumerating all of the occurring trigrams.

[link: chrisharrison.net]

Do you like this blog? Subscribe to its RSS feed, the email newsletter or Twitter to keep up to date!
MORE

google_trigrams2.jpg

google_trigrams3.jpg

google_trigrams4.jpg

ADD A COMMENT
Name (required)
Email Address (required)
URL
Remember personal info?
Your Comment (You may use HTML tags for style)