NEWS ARCHIVE

Updated versions of the OVI Corpus, the TLIO Corpus, and the OVI Corpus of Early Italian have been released: they are aligned with the new structure developed in the CoVo Project (The corpus of the Italian vocabulary of the origins: philological update and interoperability - PRIN 2015), aimed at the philological and textual renewal of the two corpora.

TLIO Corpus. It is a lemmatized corpus, and includes the texts of reference for the vocabulary.
The editions of two texts already present in outdated editions in the TLIO Corpus have been updated (see list here) and 65 new texts hitherto absent have been inserted (see list here ). The less reliable editions of 8 texts previously present in the corpus in double (or, in one case, triple) edition have also been eliminated (see list here)
The new version of the TLIO Corpus that today is published online includes 2,324 texts (with an increase of 56 units compared to the previous version), for a total of 22,567,996 occurrences, 460,881 distinct graphic forms, 120,331 entries, and 4,007,895 lemmatized occurrences.

OVI Corpus. It is a non-lemmatized corpus (but searchable with the "lemmi muti" GATTOWeb function), which includes the TLIO Corpus and extends it to include all the published texts dating before the end of the XIV Century: it is the corpus that aims to allow the interrogation of the entire textual heritage of early Italian.
There have been inserted texts so far absent, which do not fall within the inclusion criteria of the TLIO Corpus (see list here ).
The new version of the OVI Corpus that today is published online includes 2,446 texts (with an increase of 60 units compared to the previous version), for a total of 23,874,376 occurrences, 479,915 distinct graphic forms.