Please use the following text to cite this item or export to a predefined format:
Mauri, Caterina; Ballarè, Silvia and Zucchini, Eleonora, 2024, KIParla - KIPasti transcripts, ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", http://hdl.handle.net/20.500.11752/OPEN-1049
dc.contributor.authorMauri, Caterina
dc.contributor.authorBallarè, Silvia
dc.contributor.authorZucchini, Eleonora
dc.date.accessioned2025-10-07T05:23:59Z
dc.date.available2025-10-07T05:23:59Z
dc.date.issued2024-04-30
dc.descriptionThe KIPasti corpus is part of the larger KIParla collection (www.kiparla.it), which can be freely queried through the NoSketch Engine interface. The ParlaBO corpus was compiled within the framework of “DiverSIta – Diversity in spoken Italian” project, funded by the Italian Ministry of University and Research (MUR) (PRIN 2022 PNRR Call). It consists of over 40 hours of spoken data collected in thirteen different Italian regions (Abruzzo, Basilicata, Calabria, Campania, Emilia-Romagna, Lazio, Lombardy, Marche, Apulia, Sardinia, Tuscany, Umbria, Veneto) during mealtime conversations, generally within family settings. The interactions, recorded between 2020 and 2024, involved 145 speakers with different origins, ages, education levels, and occupations. Italian is predominantly used in all interactions, but in most of them (78%), various passages in dialect are also present. The transcriptions have been anonymized. Overall, the module is made up of 63 conversations. This repository contains: - metadata for both speakers (occupation, gender, age, origin, L1, educational achievement) and conversations (collection point, year, languages used), in the metadata subfolder - descriptions of the set of transcription conventions used for this module - for each conversation you will find: .eaf file in eaf/ folder (time-aligned Jefferson-style transcriptions) .txt file in linear-jefferson/ folder (linearized Jefferson-style transcription) .txt file in linear-orthographic/ folder (linearized transcription retaining only orthographic words) .tsv file in tsv/ folder (tokenised version of the transcription). More information can be found in the README.md file. Due to GDPR restrictions, pseudo-anonymized audio files (MP3) are available under a restricted-access license. To request access, please contact the corpus coordinators through the KIParla website and follow the provided procedure. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
dc.identifier.urihttp://hdl.handle.net/20.500.11752/OPEN-1049
dc.language.isoita
dc.publisherAlma Mater Studiorum – Università di Bologna
dc.relation.isreferencedbyhttp://ceur-ws.org/Vol-2481/
dc.relation.isreferencedbyhttps://doi.org/10.60760/unibo/kipasti
dc.rightsCreative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.labelPUB
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source.urihttps://kiparla.it/kipasti/
dc.subjectkitchen-table conversations
dc.subjectspontaneous speech
dc.subjecthuman-human spoken dialogues
dc.subjectspoken Italian
dc.titleKIParla - KIPasti transcripts
dc.typecorpus
local.brandingOPEN
local.contact.personCaterina Mauri caterina.mauri@unibo.it Alma Mater Studiorum – Università di Bologna
local.contact.personSilvia Ballarè silvia.ballare@unibo.it Alma Mater Studiorum – Università di Bologna
local.contact.personLudovica Pannitto ellepanitto@gmail.com Alma Mater Studiorum – Università di Bologna
local.demo.urihttps://kiparla.it/search/
local.files.count4
local.files.size10411557
local.has.filesyes
local.language.nameItalian
local.size.info430407 tokens
local.size.info42 hours
local.size.info63 texts
local.sponsornationalFunds PRIN 2022 PNRR n. P2022RFR8T Unione Europea – NextGenerationEU a valere sul Piano Nazionale di Ripresa e Resilienza (PNRR) – Missione 4 Istruzione e ricerca DiverSIta – Diversity in spoken Italian
metashare.ResourceInfo#ContentInfo.mediaTypetext
 Files in this item
Name
KIPasti_transcripts.zip
Size
9.9 MB
Format
application/zip
Description
Zip
MD5
d13de1af31625f3e639bace05e087285
Preview
  File Preview
Name
README.md
Size
10.68 KB
Format
application/octet-stream
Description
Unknown
MD5
5d78572a24c4e108202b6581124c8b98
Preview
  File Preview
Name
LICENSE
Size
20.36 KB
Format
application/octet-stream
Description
Unknown
MD5
5d4469701edbc9ee68ddc28a92aa7167
Preview
  File Preview
Name
CITATION.cff
Size
3.57 KB
Format
application/octet-stream
Description
Unknown
MD5
ae9faba4035e4ae21109adbdb7b9a4a5
Preview
  File Preview