LogoLogo
  • About DS Learn
  • Tutorials
    • ¶ Digital Exhibits
      • Getting Started with Digital Exhibits
        • Considerations
        • Basic Steps
          • Site Organization
          • Usability & Accessibility
        • Platforms
    • ¶ Digital Storytelling
      • Introduction to ArcGIS StoryMaps
        • Getting Started
        • Using Content Blocks
        • Importing Maps from David Rumsey
      • Introduction to KnightLab StoryMap JS
      • TimelineJS
    • ¶ 3D Modeling & Immersive Technology
      • Adding 3D Models in Omeka
      • Intro to Photo Processing with Agisoft Metashape for 3D Model Making
      • Tips and Tricks for Taking Photos for 3D Model Creation
      • An Introduction to Apple's Reality Composer AR
      • Importing SketchFab Models into AR for the iPad or iPhone
      • Creating Basic 3D Objects for AR in Blender
      • Introduction to Meshlab
    • ¶ Data Visualization
      • Introduction to Tableau
        • Download and Install Tableau
        • Using Tableau to Visualize COVID-19 Data
        • Tableau DH
        • Resources
      • Beyond Simple Chart in Tableau
        • Beyond Simple chart Examples
      • Google Colab
        • Get Started
        • Data Import
        • Data Wangling
        • Visualization
        • Results Export
      • Out of Box Data Visualization Tools
        • How to use Google Data Studio with Google Sheets
        • Google Data Studio Interface
        • Creating Visualizations in Google Data Studio
    • ¶ Mapping
      • Tiling High-Resolution Images for Knightlab StoryMapJS
      • Hosting and Displaying Zoomable Images on Your Webpage
      • Georectifying Historical Maps using MapWarper
      • Making a Starter Map using Leaflet
    • ¶ REST API
      • How does REST API work?
      • JSON File
      • Get Started with Google Sheets Script Editor
      • Example 1: Extract Data by One Cell
      • Example 2: Extract Data by A Cell Range
    • ¶ Text Analysis
      • Introduction to Text Analysis
        • Step 1: Exercise One
        • Step 2: What is Text Analysis?
        • Step 3: Important Considerations
        • Step 4: Why Voyant and Lexos?
        • Step 5: Exercise Two
      • Text Repositories
      • Text Analysis in JSTOR
        • Overview of Constellate
        • Build A Dataset
        • Create A Stopwords List
        • Word Frequency
  • Digital Scholarship Incubator
    • Schedule
    • Getting Started
    • People
    • Project Guidelines
    • Topics
      • 3D Modeling and Immersive Technologies
        • Part 1: 3D Photogrammetry & Laser Scanning
          • Exercise: Experiment with 3D creation tools
        • Part 2: An Introduction to Apple's Reality Composer AR
          • Exercise: Experiment with Apple RealityComposer AR
      • Anatomy of a DS Project
        • Parts of a DS Project
        • Some DS Project Examples
        • Exercise: Evaluating a DS Project
      • Pedagogy
      • Data and Data Visualization
        • Introduction to Data
        • Introduction to Data Visualization
        • Introduction to Tableau
          • Download and Install Tableau
        • Introduction to Network Visualization
      • Digital Exhibits
        • Exercise 1: Exploring Exhibits
        • Exercise 2: Exhibit.so
      • DS Intro & Methodologies
      • User Experience
        • Usability Exercise
      • Mapping and GIS
        • An Introduction to Mapping, GIS and Vector Data
          • Workshop: Exploring and Creating Vector Data
          • Quick Review: Spatial Data
        • An Introduction to Raster Data and Georeferencing Historical Maps
          • Workshop: Finding and Georeferencing an Historical Map
          • Tutorial: Georectifying Historical Maps using MapWarper
        • Presentation + Workshop: Putting it together in ArcGIS Online
        • Workshop: A Brief Introduction to QGIS
          • Adding Base-maps and Raster Data
          • Adding and Creating Basic Vector Data
          • Styling your data and preparing it for exporting
      • Story Maps
        • Story Map Exercise
      • Text Analysis
        • Exercise 1: Voyant
        • Exercise 2: Python
Powered by GitBook
On this page
  • Part One
  • Part Two

Was this helpful?

Export as PDF
  1. Tutorials
  2. ¶ Text Analysis
  3. Introduction to Text Analysis

Step 1: Exercise One

PreviousIntroduction to Text AnalysisNextStep 2: What is Text Analysis?

Last updated 4 years ago

Was this helpful?

In this two-part exercise, you will dive straight into Voyant to get a sense of how the tool works and what the text analysis process can look like. In part one, you will learn how to copy and paste text into Voyant and see how different Voyant tools work together. In part two, you will learn how to upload a text file to Voyant and a little about preparing a text for text analysis.

Part One

1.) Copy Martin Luther King Jr.'s "I Have a Dream" speech from . (If the site is down, search for a full version of the speech.) Make sure only to copy the speech and not other text on the webpage. (Incorporating any other text will impact the results.)

2.) Go to

3.) Paste the text into the "Add Text" box and click "Reveal":

The results should look something like this:

4.) To get a sense of how Voyant works, click on the word "freedom" in the Cirrus [A], or word cloud, then scroll through the Reader [B] and notice how the word "freedom" is highlighted throughout.

In Trends [C], notice how the line graph only shows "freedom." Now click on one of the line graph points and see how the Contexts [D] changes to show the context in which "freedom" appears throughout the speech. (It should look like the below).

Note: If you click the question mark in the upper right corner of each tool, e.g., Cirrus, you will get an explanation of that specific one.

Text analysis is more often associated with working with a large corpus (for example, all the works of a single author) or an enormous one (for example, all fiction publications from 1800-1900). In the case of a smaller corpus, a single speech being particularly small, using a text analysis tool like Voyant can facilitate close reading and is especially good for examining structure and word usage.

Part Two

Take a minute to look the Voyant instance over. Notice words like "shall, "hath," and any special characters in the word cloud. Notice the Project Gutenberg "boilerplate text" in the Reader. In Trends, notice that the horizontal line has numbers, and notice that as you scroll down in Contexts, the name in the Document column comes from the text file name and never changes.

Notice that words like "shall, "hath," and special characters are no longer in the word cloud. This is because stopwords were applied to remove them. Notice that the boilerplate text is gone. This is because it was deleted from the text file before being uploaded. In Trends, the play names can now be seen below the horizontal line, and they can also be seen in Contexts' Document column. This is because the file, which contained all of Shakespeare's plays, was cut up into individual text files, one for each play. The files were also renamed to represent the play titles. The sonnets, which were also in the text, were removed so that the instance would be solely focused on plays.

As you explore Voyant, keep in mind that it, and all text analysis tools, do not do the analysis. It provides ways into texts that enable users to come to conclusions based on their own knowledge and analytical skills.

1.) Download the following text file. (It's in Project Gutenberg.) It will likely download to your download folder or desktop.

2.) Go to again to launch a new instance and upload the text file by clicking on "Upload," navigating to the downloaded file, and select it. It will automatically "reveal" and will look something like the following:

3.) Now (also seen in the image below) that contains The Complete Works of Shakespeare as well. This time the text was prepared prior to it being uploaded.

The Complete Works of Shakespeare
Voyant
take a look at this Voyant instance
this site
Voyant
5MB
CompleteWorks-Shakespear.txt
Shakespeare Text File
"Freedom" as it appears in Trends and Contexts.