Welcome!
Ever want to visualize text but didn't know where to start? Then this is the resource for you!
Chapters:
-
Setup and Tools
Get your local machine setup for text analysis.
-
Tokenization
Learn about splitting up a document into a set of tokens.
-
Tokens in Documents
Learn to analyze tokens in the context of the larger document they appear in.
-
Document Metrics
Here we start to dive into counts and other metrics that can be used to visualize a document.
-
Comparing Documents
With the metrics we have found, we can start comparing different documents.
-
Tokens over Time
We can visualize progression through a document as the same as progression through time, with interesting results.
-
Clustering Documents
Here we look at finding similar documents through clustering.