Digitalized language proficiency testing & practice

Projects

C-Test[editor]

The C-test format

The C-test format, which is based on the principle of reduced redundancy, enables teachers and researchers to measure global second language (L2) proficiency. It was first developed in the early 1980s for English and German on the basis of the Cloze test, and since then has become one of the most widely used placement test formats in higher education due to its high validity and favorable cost-benefit ratio. With more than 500 publications for many languages, the C-test is also one of the most researched test instruments in language testing not just for L2 but also L1 and heritage languages.

According to the so-called 'Rule of 2-2-2', a C-test consists of 4 to 6 short texts (with either 20 or 25 gapped words) in which, starting from the second sentence, the second half of every second word is gapped (for words consisting of an odd number of letters, half-plus-1 is gapped); the last sentence is also left intact. Examinees are required to fill in the gapped part of the words (i.e., reconstruct the original words) based on general language knowledge and context knowledge (including grammar, morphology, semantics, and syntax).

While traditionally a paper-based test format, C-tests are increasingly being administered digitally, with gapping, administration, scoring, and feedback all being done automatically.

The collapse of the Soviet Union in 1991 opened opportunities for the former states of the USSR to independently manage the use of their own local languages. It al_____, however, bro_____ with i_____ a ho_____ of chall_____ regarding lingu_____ policy a_____ ideology. Dec_____ of buil_____ a uni_____ Soviet iden_____ prioritised ov_____ ethnic iden_____ gave ri_____ to a num_____ of diff_____ questions rega_____ language u_____ and adminis_____, many o_____ which t_____ former Sov_____ states a_____ still strug_____ to ans_____ today. These include questions about de-russification, the status and use of regional languages, and the re-establishment (using language) of an identity separate from modern-day Russia in a population with a Russian-speaking majority.

Example of an English C-test testlet

C-Test[editor]

The C-Test[editor] is an online tool, developed by Magdalena Wolska as the Galatea platform in 2014 and developed into the current Sabrina Dittrich (neé Galasso) for English, French, Italian, and Spanish as part of a Master thesis in Computational Linguistics in 2016. It has since been extended for other languages throughout 2021-present by various team members in the D.L.P.T.P project. This tool allows users to automatically generate C-test testlets from user-provided texts with optional language-specific considerations.

At the moment, there are multiple teams working on C-tests for different languages, consisting of members from the University of Tübingen and other universities. The focus is twofold: developing linguistic and pedagogical concepts, as well as implementing these concepts with computational linguistics.

The currently available languages in the C-Test[editor] are: Arabic, Chinese, English, French, German, Italian, Russian, Spanish.

Please contact Daniel Stuhlinger if you are interested in accessing the C-Test[editor].

Department-specific placement tests

More information on the application of C-tests as placement tests at the University of Tübingen can be found on the respective department homepage:

Department of Romance Languages · Department of Slavic Languages · Foreign Language Center (FSZ)

Language learning exercises

Currently in early stages of development is a platform intended to allow users to generate exercises to help second language learners (or in the case of Russian, heritage speakers) hone their skills. These exercises target various linguistic features and are generated in a different format depending on the needs of the language and learning unit. At the moment the scoring system grades only correct/incorrect answers, however, it is planned to incorporate more complex exercise types such as translation tasks and provide learners with more detailed automated feedback.

Please see the 'Supported languages' section for more language-specific information.

Supported languages

ARABIC

Teaching and learning Arabic as a foreign language can be supported with the use of effective digital assessment tools. An Arabic extension to the C-Test[editor] has been added in order to facilitate the automatic generation of C-tests for assessing learners' Arabic while taking into account a range of language-specific features. This project comprises a Master thesis in Computational Linguistics in collaboration with LEAD, the Department of Oriental and Islamic Studies (Abteilung für Orient- und Islamwissenschaft) and the Center for Islamic Theology (ZITh) at the University of Tübingen as well as Humboldt University of Berlin and University of Würzburg. Additionally, an Arabic group has been formed for the development of C-tests which consists of academics from Tübingen and other universities, allowing for considering a variety of user-specific needs.

C-Test[editor]

The Arabic option of the C-Test[editor] includes a variety of language-specific features which address some linguistic challenges. The user is able to choose whether or not they would like to:

include or exclude diacritics
exclude the definite article (ال) from gapping (it is always connected to the word as a prefix, which may lead to invalid gapping as the article is sometimes equal in length to half of the length of the word to which they are connected, which results in incorrectly gapping the actual lexical item

Since Arabic is a highly inflectional language, a section is included in the C-Test[editor] which displays the stem of each gapped word.

Additional linguistic features are also included in the C-Test[editor], such as some linguistic complexity and readability metrics, as well as an overview of the number of verbs, adjectives, relative pronouns, etc. which can aid in measuring the difficulty of the text.

Language exercises

Work is currently underway to create exercises for second language learners of Arabic. These will include more general tasks such as fill-in-the-blanks, multiple choice, arranging words in the correct order, etc., as well as language-specific tasks such as filling in diacritics. The current correct/incorrect model of feedback will be expanded to include more elaborate feedback formats with hints to provide individualised assistance to learners.

CHINESE

Chinese is one of the most frequently learned languages in the world, so an effective Chinese proficiency testing tool such as the C-test is beneficial to both learners and teachers in helping identify learners' language proficiency levels. For learners, this means that they can self-assess their language proficiency and receive a recommendation for a course suitable to their individual proficiency level; for teachers this means the possibility to form homogeneous language groups. However, currently little research in this field has been done on non-European languages such as Mandarin Chinese. The C-Test[editor] has therefore been extended to also include Chinese in order to further research in this challenging area. A Chinese group has been formed for C-test development which consists of academics from the University of Tübingen and other universities. This group aims to meet the technical language-specific prerequisites for an automated placement test and discuss conceptual issues, including user-specific questions.

C-Test[editor]

The C-Test[editor] has been extended to include the automatic generation of C-tests in Chinese. The Chinese option provides users with language-specific settings, including:

choosing whether or not to generate the test in simplified or traditional characters
displaying testlets with or without Pinyin and optionally testing tone by number or mark
selecting how many characters a gap should comprise

The gap unit in the editor is a single character rather than individual strokes. In other words, the gapped second half of a word consists of a complete character. The character-based deletion approach could help avoid testing only test takers’ literacy skills. Users are also provided with some linguistic complexity and readability statistics for each text, which are useful for the evaluation of the difficulty level of the text. Users can also see various statistics pertaining to the text, such as the number of lexical or grammatical items, repetitions, etc.

Additionally, the use of EBCL (European Benchmarks for the Chinese Language) vocabulary frequency list as a basic measurement of lexical difficulty is planned to be implemented in the future. By cross-comparing the words in the text with the EBCL list, users can easily see if their texts contain advanced lexical items that are beyond test takers’ learning scopes, and adjust the texts accordingly.

Language exercises

There are currently plans to develop language learning exercises with detailed and automated feedback for learners of Chinese who would like to keep track on their learning progress.

ENGLISH

English has been implemented in the C-Test[editor] from the beginning, so most of the work lies in creating new features. C-tests for English have been tested for many years in both a paper format and electronically at the Foreign Language Center (FSZ) at the University of Tübingen, yielding a large collection of results for comparison. Currently, the C-Test[editor] offers a wide selection of text statistics for English, with multiple complexity and readability analysis features also currently in development.

FRENCH

French has been implemented in the C-Test[editor] from the beginning, so most of the work lies in creating new features. C-tests for French have been tested for many years in both a paper format and electronically at the Foreign Language Center (FSZ) at the University of Tübingen, yielding a large collection of results for comparison. Currently, the C-Test[editor] offers a wide selection of text statistics for French, such as an overview of the number of nouns, verbs, repeated gaps, etc. in the text.

C-tests in French have also been implemented as tests for general language proficiency assessment at the Department of Romance Languages at the University of Tübingen via the Ilias platform. An automated analysis on the results of the placement tests was performed, which allowed for easily visualising testlet-level and broader test-level statistics.

GERMAN

The need for teaching German as second (i.e., German taught in Germany as a foreign language) or foreign language in a university context is growing steadily, both within German-speaking countries and abroad. This requires a means of adequately assessing learners' language skills by means of a placement test, as well as designing lessons and individual learning units according to individual language levels. These lessons, where applicable or necessary, can be offered digitally with automated feedback.

C-Test[editor]

C-tests have long been proven to be very reliable and meaningful as a placement test format at the university level. In order to facilitate digitalisation of language learning and teaching, the C-Test[editor] has now been extended to include German as part of the C-test project at the University of Tübingen, developing linguistic and pedagogical concepts in cooperation with other universities. The C-Test[editor] for German currently enables users to automatically gap the second half of every second word, according to the C-test principle, and to export the gapped text. In development are also options for language-specific characteristics of German, such as the productive occurrence of multi-part compound nouns. Since a clear reconstruction is not to be expected here, an option will be provided to gap the entire compound as a single word or gap the components separately. The C-Test[editor] interface also provides additional information about the generated testlet, such as the type or frequency of the gapped words. Furthermore, the interface provides some basic statistics on the linguistic complexity and readability, such as the Type-Token Ratio of the text. This facilitates the selection of suitable texts for the creation of testlets.

Automated placement testing is also planned for German as a second language. The concepts and format required for this are currently in development.

Language exercises

Finally, an additional platform linked to the C-Test[editor] is being developed in parallel. This platform is planned to host digitalised language exercises for learners of German which will give learners individual feedback (correct/incorrect answers, as well as more detailed hints to guide learners to the correct answers) and provide teachers with the opportunity to choose between different exercise formats, proficiency levels, and language-specific options.

ITALIAN

Italian has been implemented in the C-Test[editor] from the beginning, so most of the work lies in creating new features. C-tests for Italian have been tested for many years in both a paper format and electronically at the Foreign Language Center (FSZ) at the University of Tübingen, yielding a large collection of results for comparison. Currently, the C-Test[editor] offers a wide selection of text statistics for Italian, such as an overview of the number of nouns, verbs, repeated gaps, etc. in the text.

C-tests in Italian have also been implemented as tests for general language proficiency assessment at the Department of Romance Languages at the University of Tübingen via the Ilias platform. An automated analysis on the results of the placement tests was performed, which allowed for easily visualising testlet-level and broader test-level statistics.

RUSSIAN

In teaching Russian as a foreign language, there exists the need for a tool which can automatically generate a valid digital placement test for learners. As the C-test format for Russian is still relatively rare, the conceptual approach for C-tests in Russian was developed as part of the C-test project in collaboration with LEAD and the Department of Slavic Studies. Starting in 2021, digital C-tests offered via the Ilias platform are in use as placement tests for Russian courses at the Department of Slavic Studies, with an automated analysis able to be generated for the results in order to allow for the easy visualisation of testlet-level and broader test-level statistics.

Additionally, there is a currently on-going Russian validation project in which C-tests are being offered to learners and teachers from universities all over Germany in order to collect data on Russian C-tests and further research in this field. This project includes not only C-tests for foreign language learners, but additionally a tailored Cloze test targeted at heritage speakers.

C-Test[editor]

The Russian extension of the C-Test[editor] has been developed in collaboration with LEAD and the Department of Slavic Studies at the University of Tübingen as part of a Master thesis in Computational Linguistics.

Along with the C-test itself, the C-Test[editor] also provides users with some linguistic complexity and readability measures such as various type-token ratios in order to help determine the difficulty level of a text and its appropriateness for foreign language learners and heritage speakers at various proficiency levels. The tool also offers a variety of text statistics, such as gap length, types of gapped words, etc., in order to factor in different teaching and learning needs (e.g., assisting language teachers in quickly and efficiently determining different levels of competence in Russian).

Language exercises

Currently in development for Russian as part of a Master thesis in Computational Linguistics are automatically generated language exercises aimed at heritage speakers.

SPANISH

Spanish has been implemented in the C-Test[editor] from the beginning, so most of the work lies in creating new features. C-tests for Spanish have been tested for many years in both a paper format and electronically at the Foreign Language Center (FSZ) at the University of Tübingen, yielding a large collection of results for comparison. Currently, the C-Test[editor] offers a wide selection of text statistics for Spanish, such as an overview of the number of nouns, verbs, repeated gaps, etc. in the text.

C-tests in Spanish have also been implemented as tests for general language proficiency assessment at the Department of Romance Languages at the University of Tübingen via the Ilias platform. An automated analysis on the results of the placement tests was performed, which allowed for easily visualising testlet-level and broader test-level statistics.

Contact us

Project coordinator

Dr. Claudia Duttlinger

LEAD - Graduate School & Research Network
Language & Education, University of Tübingen

claudia.duttlinger(at)uni-tuebingen.de

Current team members

Hebah Ahmed

Arabic C-tests and exercises

hebah.ahmed(at)semsprach.uni-tuebingen.de

Mourhaf Kazzaz

Student assistant
Arabic C-tests and exercises

m-mourhaf.kazzaz(at)student.uni-tuebingen.de

Ting-Yu Liu

Student assistant
Chinese C-tests and exercises

ting-yu.liu(at)student.uni-tuebingen.de

Lorena Raichle

Student assistant
German C-tests & C-test administration for Department of Romance Languages

lorena.raichle(at)student.uni-tuebingen.de

Nelly Sagirov

Student assistant
English, Russian & German C-tests and exercises

nelly.sagirov(at)student.uni-tuebingen.de

Daniel Stuhlinger

Student assistant
User support

daniel.stuhlinger(at)student.uni-tuebingen.de

Former team members

Matthias Drews

Student assistant
User support, project development, & publicity

matthias.drews(at)student.uni-tuebingen.de

Affiliated People

Prof. Dr. Detmar Meurers

Seminar für Sprachwissenschaft
University of Tübingen

dm(at)sfs.uni-tuebingen.de

Sabrina Dittrich (neé Galasso)

Original author of C-Test[editor] interface and its implementation
of English, French, Italian, and Spanish.

dittrich.sabrina(at)t-online.de

Dr. Magdalena Wolska

Original author of Galatea, the predecessor of the C-Test[editor]

Digitalised Language Proficiency Testing & Practice

Projects

C-Test[editor]

The C-test format

C-Test[editor]

Department-specific placement tests

Language learning exercises

Supported languages

ARABIC

C-Test[editor]

Language exercises

CHINESE

C-Test[editor]

Language exercises

ENGLISH

FRENCH

GERMAN

C-Test[editor]

Language exercises

ITALIAN

RUSSIAN

C-Test[editor]

Language exercises

SPANISH

Tutorials

Placement testing on Ilias

C-Test[editor] manual

Future perspectives

Contact us

Project coordinator

Current team members

Former team members

Affiliated People