It is a lemmatized corpus, and includes the texts of reference for the TLIO. 35 new texts hitherto absent have been inserted (see list here).
The new
version of the TLIO Corpus that today is published online includes 3,245 texts
(with an increase of 35 units compared to the September 18, 2023 version), for a
total of 24,012,722 occurrences (with an increment of 198,173 occurrences), 496,445 distinct
graphic forms, 126,575 lemmas, and 4,708,027 lemmatized
occurrences (with an increment of 85,700 occurrences).
It is a non-lemmatized corpus (but searchable with the “lemmi muti” GATTOWeb function), which includes the TLIO Corpus and extends it to include all the published texts dating before the end of the XIV Century: it is the corpus that aims to allow the interrogation of the entire textual heritage of early Italian.
The new
version of the OVI Corpus that today is published online includes 3,512 texts
(with an increase of 35 units - the same texts inserted in the TLIO Corpus- compared to the
October 18, 2023 version; see list here), for a
total of 30,443,280 occurrences (with an increment of 198,173 occurrences), and 555,757 distinct graphic forms.