Corpus Linguistics: Focus on Academic Language In this course, you will learn to use, analyze, and visualize corpus data, to transcribe spoken dialogue, to use online resources and software applications, and to create your own corpus. In addition to such corpus linguistic skills, you will be able to create corpora specifically from your own writing in order to analyse and improve your English.

In session 1, "What is Corpus Linguistics?", you will learn some of the basic concepts of corpus linguistics, you will learn what a corpus is and how to analyze it by using AntConc, i.e. a piece of software used to analyze corpus data.

In session 2, "Collocations and keywords",  we are going to delve deeper into corpus linguistics and see how to extract collocates for certain expressions in our data. Collocates are a sequence of words or terms that co-occur more often than would be expected by chance. An example is the expression “good morning” because good and morning co-occur more often together (in combination) that would be expected by random chance. The same holds true for “Merry Christmas”: merry and Christmas are also collocates.
In addition, we are going to see how to get AntConc to provide us with a list of keywords that are helpful to compare the content (and also the register) of one corpus to another. A keywords is a word which occurs in a text more often than we would expect to occur by chance alone. Key words are calculated by comparing the word frequencies in a text against their expected frequencies derived another corpus, which acts as a reference for the corpus you are interested in (cf. http://en.wikipedia.org/wiki/Keyword_%28linguistics%29).

In session 3, "Compiling a Corpus", we are going to learn how to compile a corpus and you will also be putting this knowledge into practice by building your own personalized corpus.

In session 4, "Academic English & Corpora 1", we are going to apply our knowledge of corpus linguistics to a more in-depth analysis of register, because when writing Academic English, you need to know what characteristic features this register has.

In session 5, "Academic English & Corpora 2", we will continue to look closely at Academic English and apply what we have learned so far to investigate how certain words and constructions are used and if they are appropriate for Academic Discourse.

In session 6, "Corpora in the Classroom", we look at how corpora can be used in ESL (English as a Second Language) and EFL (English as a Foreign Language) class rooms. In addition, we will use the corpora we have compiled earlier to see how term papers differ from research articles and how this can be used to improve your own term papers and your English by using and analyzing corpus data.

Session 7, "Analyzing Linguistic Data", is of a more general nature and will introduce to some very basic concepts of quantitative work in general.
In this session, we will have a look at how you can and should visualize your data. More specifically, this session focuses on how to set up tables and graphs in Microsoft Excel and what you need to consider when doing so.
Also, we will have a look at how you can analyze your data statistically. To do so, this session will introduce you to basic terms of statistics and show you how to perform one of the simplest and most widely used tests, the chi-squared test.