It is a lemmatized corpus, and includes the texts of reference for the TLIO. 58 new texts hitherto absent have been inserted (see list here).
The new
version of the TLIO Corpus that today is published online includes 3,353 texts
(with an increase of 58 units compared to the September 9, 2024 version), for a
total of 24,160,197 occurrences (with an increment of 111,476 occurrences), 500,446 distinct
graphic forms, 127,579 lemmas, and 4,856,664 lemmatized
occurrences (with an increment of 45,783 occurrences).
It is a non-lemmatized corpus (but searchable with the “lemmi muti” GattoWeb function), which includes the TLIO Corpus and extends it to include all the published texts dating before the end of the XIV Century: it is the corpus that aims to allow the interrogation of the entire textual heritage of early Italian.
The new
version of the OVI Corpus that today is published online includes 3,725 texts
(with an increase of 74 units compared to the
September 9, 2024 version; see list here), for a
total of 30,825,286 occurrences (with an increment of 143,288 occurrences), and 562,342 distinct graphic forms.