Sunday, August 12, 2007

"WordCloud - A Squarified Treemap of Word Frequency" - Something like this would be cool in a Feed Reader...

CodeProject - WordCloud - A Squarified Treemap of Word Frequency

"WordCloud is a visual depiction of how many times a word is used, or its frequency if you will, within a given set of words. It does this by: reading in plain text, filtering out "stop words", counting how many times a word is used, and displaying results in a Squarified Treemap. (In the images above, the larger a node and more saturated the color, the more frequent its use.)

..."

When I saw this, my first thought was, "Oh I want something like this in my Feed Reader."

Think high level analysis of the new, unread posts, with a user definable threshold (i.e. don't include words with less than ## occurrences). Then clicking on a word/square brings up the list of posts with that word. And a background thread updating the Map as you read through the posts...

Then of course, I'd also want Concept and Natural Language Processing (NLP) as well as just Word mapping.
(So pretty much an Electronic Data Discovery[EDD]/Electronically Stored Information[ESI] search, analysis and review tool for my web feeds... ;)

Hum...

 

Related Past Post XRef:
Feed Stream Analysis - Web Feed/Post Analysis to Group Like/Related Posts
WordNet
"Statistical parsing of English sentences"
"A Model for Weblog Research"
AddressOf.com - MS Research TreeMap.Net

1 comment:

Cro-Code said...

Hello.
You might be interested in Textanz text analysis tool. Besides word frequency it provides frequencies of phrases and wordforms , concordance, readability parameters, charts, export and more.
Detailed description and screenshots are available at product page .
Sincerely ,
Alexander Potyomkin
Cro-Code