Please use the following text to cite this item or export to a predefined format:
Sara Goggi, Sara Goggi remo Bindi, Lisa Biagini e Sergio Rossi, 1997, Corpus Parole (3 milions words), ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", http://hdl.handle.net/20.500.11752/ILC-1001
dc.contributor.authorSara Goggi, Sara Goggi remo Bindi, Lisa Biagini e Sergio Rossi
dc.date.accessioned2023-07-24T12:45:54Z
dc.date.available2023-07-24T12:45:54Z
dc.date.issued1997-10-26
dc.descriptionThe PAROLE project (Preparatory Action for Linguistic Resources Organization for Language Engineering) has produced a set of harmonized corpora and lexicons for a large number of European languages. Each corpus, made up of 20 million words, was built up as reference corpus for Human Language Technology applications, to provide full information about a large variety of text types in the language considered, to represent the use of contemporary language and to become the first nucleus of an electronic text library. The texts have been stored using a common format following the standards recommended in the CES (Corpus Encoding Standard), according to flexibility and multifunctionality criteria. The texts belong to a wide range of media and genres, selected in proportions aimed at reflecting their prominence within the society, classified according to medium, genre, topic and time of production.
dc.identifier.urihttp://hdl.handle.net/20.500.11752/ILC-1001
dc.language.isoita
dc.publisherIstituto di Linguistica Computazionale “A. Zampolli” - Consiglio Nazionale delle Ricerche (ILC-CNR)
dc.relation.isreferencedbyhttps://zenodo.org/record/8167985
dc.rightsCreative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
dc.rights.labelPUB
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectCorpus
dc.subjectCorpus linguistics
dc.subjectDatabases
dc.titleCorpus Parole (3 milions words)
dc.typecorpus
local.brandingILC
local.contact.personSara Goggi sara.goggi@ilc.cnr.it Istituto di Linguistica Computazionale “A. Zampolli” - Consiglio Nazionale delle Ricerche (ILC-CNR)
local.demo.urihttp://dbtvm1.ilc.cnr.it/Corpus/Parole.htm
local.files.count1
local.files.size0
local.has.filesyes
local.language.nameItalian
local.size.info63000000 bytes
local.sponsoreuFunds Grant agreement ID: LE24017 EU FP4-TELEMATICS 2C - Specific programme of research and technological development and demonstration in the area of telematic applications of common interest, 1994-1998
metashare.ResourceInfo#ContentInfo.mediaTypetext
 Files in this item
Name
Corpus Parole SGML.rar
Size
60.76 MB
Format
application/octet-stream
Description
Unknown
MD5
0ff7c8ddcf2be0a90679a27daf4c4e93
Preview
  File Preview