Skip to content

Loading Events

« All Events

  • This event has passed.

CTSI BERD Seminar: An Introduction to Text Mining for Scientific Inquiry

May 7, 1:30 pm to 4:00 pm

An Introduction to Text Mining for Scientific Inquiry by Tim Brick, PhD

An Introduction to Text Mining for Scientific Inquiry by Tim Brick, PhD on May 7, 1:30-4 p.m.

Register here. 

The language that humans use to interact, communicate, and record their thoughts and experiences carries a great deal of information of interest to the health and behavioral sciences.  For example, individuals may talk about their symptoms, discuss their emotions, record notes from clinical meetings, or post on social media about their experiences.  Unfortunately, this important data often goes unused in scientific analysis because its unstructured nature can make it difficult to model with conventional tools.

Text Mining tools can extract useful information and patterns from unstructured text data in a way that can be easily understood and statistically modeled. This workshop is intended to provide an introduction to common tools for text mining ranging from simple word counts to the transformer models that underlie ChatGPT, and highlight the ways they can be used to answer questions in the behavioral and health sciences. Topics will include finding common topics from a person’s daily reflections; extracting sentiment and emotions from social media comments; automatically generating summaries of meeting notes or scientific articles;  and creating and modeling text in new ways with deep word embeddings. Code examples using R/RMarkdown format will be provided.


May 7
1:30 pm to 4:00 pm



Add events

Anyone affiliated with Penn State Health and Penn State College of Medicine can request access to add events to this calendar. View instructions for getting access here.

Infonet calendar

Faculty, staff and students with Penn State Health network access can view selected internal events on the Infonet calendar here.