Practical Skills for Working with Linguistic Data

M.A. course, University of Cologne, 2023

Academic Year: 2023-2024

This course equips students with essential skills and techniques to effectively handle linguistic data throughout the data lifecycle. Through hands-on training, students learn essential skills in preprocessing, working with data, and postprocessing using R.

Course Overview

The curriculum covers diverse preprocessing methods for cleaning and preparing primarily text-based linguistic data, enabling further annotation and analysis. Additionally, students explore techniques for data annotation, metadata management, and representation of linguistic information. The course also delves into the postprocessing phase, where students learn how to analyze and visualize linguistic data.

No prior programming experience required - this course is accessible to all students interested in managing and analyzing linguistic data.

Dates, Content, and Mini-tasks

DateContentMini-tasksMini-task Deadline
11.10.2023Course IntroductionNo-
18.10.2023Importance of Practical Skills in Linguistics & R basics (1)Yes24.10.2023
25.10.2023R basics (2) and R markdownYes31.10.2023
01.11.2023No lecture--
08.11.2023Data Management Fundamentals and File Operations in RYes14.11.2023
15.11.2023Data Cleaning and PreprocessingYes21.11.2023
22.11.2023Text Processing Basics (Regular Expressions)Yes28.11.2023
29.11.2023Introduction to Textual Data Types & Data Acquisition from WebYes05.12.2023
06.12.2023Sketch Engine and online queriesYes12.12.2023
13.12.2023Annotation: Manual and AutomaticYes19.12.2023
20.12.2023Exploratory Data AnalysisYes09.01.2024
-Holiday breaks--
10.01.2024Corpus Analysis in RYes16.01.2024
17.01.2024Data VisualizationYes23.01.2024
24.01.2024Advanced TopicsNo-
31.01.2024Useful Tools & Course Wrap-upNo-

Student Feedback Highlights

  • 4.8/5.0 average rating in official faculty evaluation
  • Top 25% ranking among all linguistics courses offered that semester