Opera del Vocabolario Italiano

Istituto del Consiglio Nazionale delle Ricerche

Consult the Corpus

The OVI databases are free and open access. To access the databases, click on the name of one of the various corpora located under the cat (GATTO) icon next to each description; then click on the name of the corpus at the center of the page.
The ItalNet version of the OVI corpus of Early Italian has no longer been updated since 2005, and therefore is no longer reliable. 
The OVI is responsible only for the databases listed on this page.
From February 2017, the Corpus TLIO and the Corpus OVI dell'Italiano antico are in a phase of deep renewal: displays the updating Criteria here

From January 2019, the Corpus OVI dell'Italiano antico is not lemmatized, but is possible to search the Corpus with GATTO's function of "lemmi muti".

To report any errors or simply make observations or express doubts, please send an e-mail message to the responsibles for lemmatization, Elena Artale e Diego Dotto.

CORPUS OVI DELL'ITALIANO ANTICO  

 

The corpus contains the entire collection of early Italian texts that OVI makes accessible to the public. It can be searched by word forms and with the GATTO's function of lemmi muti).

Access the Corpus

Corpus TLIO per il vocabolario

 

This is the corpus on which the Tesoro della Lingua Italiana delle Origini is written. It can be consulted by forms and by headwords.

Access the Corpus

Corpus TLIO RIDOTTO

 

This is the subcorpus of the Corpus TLIO that can be used for the drafting of the high attestation entries of the Tesoro della Lingua Italiana delle Origini. It can be consulted by forms and by headwords.

Access the Corpus

Corpus LatVolg 

Corpus della latinità volgarizzata

T​he corpus contains the Latin works into medieval vernacular translations ("volgarizzamenti"). It was conceived for lexical, translation studies and onomasiological analyses.



  Access the Corpus  

CORPUS tragallo

Corpus dei testi galloromanzi tradotti in volgare

The corpus contains Old French and Old Provençal works, as sources or models of Italian medieval texts. It was conceived for lexical, translation studies and onomasiological analyses



Access the Corpus

Corpus clpio

Concordanze della Lingua Poetica italiana delle Origini

This is the corpus edited by d'Arco Silvio Avalle in the 70s which contains 12th century manuscripts editions.





  Access the Corpus

Corpus LIRIO

Lirica Italiana delle Origini


Exaustive corpus of poetry from the Origins to 1400, edited by Fondazione Ezio Franceschini and University of Siena. It can be searched by word forms and with the GATTO's function of  lemmi muti ).
The integration in the OVI corpus is in progress.

  Access the Corpus  

Corpus remedia

 Classici Latini Volgarizzati


The corpus contains medical-scientific texts in several Romance languages.






  Access the Corpus  

Corpus divo

Dizionario dei Volgarizzamenti


This corpus contains the “volgarizzamenti” (i.e. the translations of Latin original texts into vernacular versions) of classical and late antiquity texts.




  Access the Corpus  

Corpus CLaVo

 Classici Latini Volgarizzati


The corpus is the collection of the Latin original texts translated into vernacular versions and included in the Corpus DiVo (see previous).





  Access the Corpus  

Corpus italart

L'italiano delle arti tra Medioevo e Rinascimento

Edited by Alessandro Aresti from University of Cagliari, this corpus contains texts related to the history of the Middle Ages and the Renaissance Italian art, edited within the  homonymous project (MSCA 2017 Individual Fellowship, 2018-2020)

  Access the Corpus  

Corpus ChVA

Chartae Vulgares Antiquiores


The oldest Italo-Romance texts (reproduced, edited and commented) edited by Vittorio Formentin, Nello Bertoletti e Antonio Ciaralli. This corpus, thanks to the collaboration with OVI, makes the edited texts, within the homonymous project, searchable. 
The corpus is fully integrated into the OVI corpus.


  Access the Corpus  

CORPUS RDP

 Rime Disperse di Petrarca


Edited by Roberto Leporatti, this corpus contains lyrics ascribed to Francesco Petrarca and published as part of the RDP project.








Access the Corpus

archivio datini

    


The corpus is the collection of the letters published by the Archives of Francesco di Marco Datini, developed by OVI on behalf of the State Archives of Prato. It is lemmatized by topics.
Its evolution is foreseen among the results of the RESTORE project.




  Access the Corpus  

Corpus VEV

Vocabolario storico-Etimologico del Veneziano

Edited by Luca D'Onghia and Lorenzo Tomasin, the corpus of venetian texts is included both in Corpus TLIO and Corpus OVI, published within the homonymous project. ​ It can be searched by word forms and with the GATTO's function of  lemmi muti ).  




Access the Corpus

corpus  ARTESIA

Archivio Testuale del Siciliano Antico


Edited by Mario Pagano, this corpus was implemented in GATTO at the University of Catania, with whose permission it is made available here.
The progressive integration, related to the 14th century section, into the OVI corpus is underway.

  Access the Corpus  

Corpus ATLiSOr

Archivio Testuale della Lingua Sarda delle Origini

Corpus of medical-scientific texts in different languages, edited by Giovanni Lupinu from the University of Sassari in collaboration with OVI; the corpus contains sardinian language texts from the origins to the end of 14th century.


  Access the Corpus  

Corpus cAO (diplo)

Corpus dell'Antico Occitano (edizioni diplomatiche)

Edited by Stefano Asperti, Francesco Carapezza, Maria Careri, Costanzo Di Girolamo, Paolo Di Luca, Giosuè Lachin, Walter Meliga, Paolo Squillacioti, it is the corpus of the diplomatic editions of troubadour songbooks and Occitan prose produced as part of the PRIN 2015 project CAO.

  Access the Corpus  

Corpus CAO (INTER)

Corpus dell'antico occitano (interpretative)

Edited by Stefano Asperti, Francesco Carapezza, Maria Careri, Costanzo Di Girolamo, Paolo Di Luca, Giosuè Lachin, Walter Meliga, Paolo Squillacioti, it is the corpus of the interpretative editions of troubadour songbooks and Occitan prose produced within the PRIN 2015 project CAO.

  Access the Corpus

Corpus ARSNOVA

I testi poetici intonati dai polifonisti dell'Ars Nova

Edited by Vittoria Brancato, Valeria Carrieri, Davide Checchi, Paolo Squillacioti, as part of the ERC AG European Ars Nova project, directed by Maria Sofia Lannutti. It is the corpus of the poetic texts in Italian, French and Latin set to music by the Ars Nova polyphonists. It can be consulted by forms, by headwords (or lemmas) and by hyperlemmas.

  Access the Corpus

Corpus FOLCHETTO

Corpus lemmatizzato delle poesie di Folchetto di Marsiglia

Edited ​​by Paolo Squillacioti, this corpus is a first experimental version of a lemmatized corpus of Trobadour poetry containing the poems of Folchetto di Marsiglia.






  Access the Corpus