OIIC - Online Interactions in Intercomprehension 1.0 (annotated version)
Please use the following text to cite this item or export to a predefined format:
Cervini, Cristiana and Paone, Emanuela, 2026, OIIC - Online Interactions in Intercomprehension 1.0 (annotated version), ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", http://hdl.handle.net/20.500.11752/ILC-2172
Authors
Item identifier
Date issued
2026-04-16
Size
26.058 tokens
Language(s)
Description
The OIIC Annotated Version is a corpus of transcribed and annotated online interactions collected in academic contexts where intercomprehension among Romance languages is used as a pedagogical approach. The data consist of transcriptions of videorecorded interactions involving university students and tutors engaged in collaborative tasks, classroom discussions, and project-related activities in multilingual settings.
This version of the corpus includes multilayer annotation of intercomprehension strategies performed in ELAN. The annotation system captures several dimensions of interaction, including interactional strategies (e.g. clarification requests, confirmation of understanding, agreement and disagreement), metadiscursive strategies (e.g. reformulations, metalinguistic comments), lexical phenomena (e.g. code-switching, calques, specialized vocabulary, translation), non-verbal and prosodic strategies (e.g. slowed speech, emphasis, gestures), participatory dominance phenomena (e.g. cooperative and competitive overlaps or interruptions), and misunderstanding sequences (resolved or unresolved).
The annotated corpus is distributed in multiple complementary formats: ELAN files (.eaf) with time-aligned multilayer annotation, annotated transcripts in text format with embedded annotation labels, and tabular files (.csv) derived from the ELAN annotations to support quantitative analysis. The ELAN files constitute the primary annotated data, while the text and tabular versions are provided to facilitate qualitative reading, corpus exploration, and statistical analysis.
The corpus is accompanied by metadata files describing sessions and participants, as well as documentation of the annotation system, including annotation guidelines and a table of annotation labels and their correspondence with ELAN labels.
This annotated version of the corpus is particularly suitable for research on intercomprehension, multilingual interaction, mediation strategies, classroom discourse, translanguaging, and interactional practices in educational contexts. It can be used for both qualitative and quantitative analyses.
Acknowledgement
University of Bologna
Project code:other
Project name:“L’intercomprensione all’Università: dalla ricerca alla didattica per l’internazionalizzazione”
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- Readme OIIC_annotated_version.txt
- Size
- 18.83 KB
- Format
- text/plain
- Description
- Text
- MD5
- f83ae43f8119681d30e39d2be8a48512

The file preview has not been generated yet. Please try again later or contact the system administrator dspace-clarin-it-ilc-help@ilc.cnr.it
- Name
- OIIC_ANNOTATED_VERSION.zip
- Size
- 1.4 MB
- Format
- application/zip
- Description
- Zip
- MD5
- f404319ee36096fad09dab2c00267091

The file preview has not been generated yet. Please try again later or contact the system administrator dspace-clarin-it-ilc-help@ilc.cnr.it

