Workshops box

Introduction to the analysis of natural language in Python: Session 2/4 in Learning Python for text-mining and the analysis of natural language

Users with access to the gROW learning environment (those with a U/E-number) can find more information about workshops and courses offered by the Radboud Digital Competence Centre here. The gROW environment is the official location for these materials, and gROW is preferred to libcal for sign-ups and for exploring our materials. Follow this link to go directly to the gROW page for the current workshop series. Users who are unable to access gROW are welcome to register using libcal. A overview of the currently scheduled digital methods trainings can be found on libcal after filtering by the category "DCC" (linked here for convenience).

Series overview:

Text-mining refers to techniques which can involve the collection, processing and parsing of text derived from a range of sources (e.g., corpora, digital libraries, web forums). Text-mining is often performed with the goal of analyzing text to gain insights not readily available without the use of digital methods. It is useful to or shares methods with fields which seek to understand words and their context using computer friendly representations (e.g., word embeddings, or word vectors) such as natural language processing, computational models of language, psycholinguistcs, or digital humanities. In this workshop series, students and researchers will learn text-mining and natural language processing (NLP) techniques using the Python programming language for a range of use-cases. Four workshop sessions are currently available. Participants may choose to attend whichever of these self-contained sessions they find useful, but it is recommended that participants who are new to this topic attend at least Session 2. Readers who are interested in learning about additional tools and methods for text-mining and computer-based analysis of natural language are encouraged to consult the recently published text-mining guide, written by the text-mining support group at Information & Library Services (textminingsupport@ru.nl). Questions about these workshops or their contents can be sent via email to daniel.sharoh@ru.nl.

****************

Session 2: Introduction to the analysis of natural language in Python

Do you want to learn how to use Python to analyze natural language? The Digital Competence Centre organizes a workshop session on this topic which is suitable for participants with a broad range of skill-levels.

This workshop provides an introduction to preprocessing, manipulating and analyzing linguistic and textual properties of natural language text in Python, with limited time dedicated to basic concepts in the Python language. Participants in this workshop will learn how to use the Natural Language Toolkit (NLTK) to segment or tokenize a given text in English. They will also learn techniques to analyze text based on this information. This could mean, for example, identifying the individual sentences and paragraphs in a text, labeling all words in a text with a part-of-speech tag, and analyzing word-frequency or co-occurrence statistics for all identified nouns. As a case-study, participants will perform a sentiment analysis (similar to the concept of valence) to quantify properties of the text that can be related to aspects of its emotional content. After this workshop, participants will be prepared to further develop their programming skills individually, and they will be able to write and automate simple text processing steps that can be integrated into a larger workflow.

Please follow the links below to sign-up for the other sessions:

Link to Session 1

Link to Session 3

Link to Session 4

Related LibGuide: Text mining by Nina Lanke