Event box

MiddLab Data Workshops: Introduction to Text Mining in R Online

Text mining is the process of transforming unstructured texts of all kinds (literary, scholarly, journalistic, scientific, etc.) into a form where the language of the documents can be analyzed. Using tidy data principles can help make these tasks easier, more efficient, and more interoperable with other tools. Luckily, R has packages that make this process work very well inside the R environment.

In this lesson, participants will learn:

  • Some basic text mining/analysis concepts
  • How to transform texts (e.g. a novel) into a structured dataset ready to use in R
  • How use tidy data packages (such as dplyr and tidyr) to manipulate text data
  • How to perform basic sentiment analysis and word count tasks in R

Participants should have basic familiarity with R. If you are completely new to R, please be sure to attend the Introduction to R workshop on June 14, 2022. It would also be beneficial for attendees to be familiar with the material covered in our Data wrangling in R with dplyr and tidyr and Creating high quality graphics in R with ggplot2 workshops, if they are able.

Date:
Thursday, July 7, 2022
Time:
1:00pm - 4:00pm
Time Zone:
Eastern Time - US & Canada (change)
Campus:
Middlebury Library - Online
Audience:
  Faculty     Staff     Students  
Categories:
  Data     Digital scholarship     Hands on  
Online:
This is an online event. Event URL will be sent via registration email.
Registration has closed.

Presenter(s)

Profile photo of Leanne Galletly
Leanne Galletly
Profile photo of Wendy Shook
Wendy Shook

I am available to meet with you by Zoom appointment to discuss your research. 

Profile photo of Ryan Clement
Ryan Clement