Opera del Vocabolario Italiano

Istituto del Consiglio Nazionale delle Ricerche

Search

GATTO is designed to allow lexicographical searches within one or more texts gathered in a corpus, or within dynamically defined corpus subsets.
The majority of searches are carried out in order to find where occurrences of a specified word form can be found within indicated texts. Other searches allow the extraction of different types of information, such as a list of word forms or lemmas, incipits, index locorum and usage statistics.
The Search environment is used to search for information in a previously created and, if necessary, partially or entirely lemmatised, corpus. You can search by word form, lemma, grammatical category or hyperlemma. In this environment, corpus subsets can be dynamically defined and other kinds of information extracted, such as word form and lemma indexes for single texts, single corpus subsets or combinations of more than one corpus subset. You can also display parts of a text, compare the frequency with which different word forms appear in various texts, and view graphical output to show the integral distribution of words in a text and the chronological distribution of located contexts.
Search results can be printed or saved to disk for future use.
Texts within a corpus may be searched in blocks, one by one, or in groups defined according to bibliographical information. You can carry out searches either within the corpus or within one or more of these selections, termed corpus subsets. The composition of each subset can be saved to disk and recalled for future sessions. Up to 6 subsets can be defined at any one time.
Search boxes are graphical elements into which you enter the items (e.g. word forms or lemmas) for which you wish to search. You can use wildcards to extend the search to further words, and there are also a range of options which can change the way in which the search is carried out.
When searching by word form, you can cut and paste a paragraph of text into the relevant window in GATTO. The text will be analysed and the corpus searched for occurrences of the words within it.    

A range of options can be used to size and shorten contexts to be extracted from the corpus and printed or saved to disk.
The final result of the search is a series of contexts. Each context is made up of:

located occurrence,
context,
location,
structural reference.

Each context contains bibliographical details for the text from which it is taken.
If the occurrence is lemmatised, then information about the lemma and, if there is one, hyperlemma, can be obtained.
The size of each context can be individually redefined to a set number of constituent words or periods by directly selecting the words that begin and end the context. It is also possible to use KWIC (KeyWord In Context). 

The contexts obtained at the end of the process can be presented as follows:

in chronological order according to the text from which they are taken, and, within the texts, alphabetically by located word form;
alphabetically by located word form, and, within the word form, in chronological order according to the text from which they are taken;
in chronological order according to the text from which they are taken, and, within the text, in the order in which they appear within it.

By default, texts are ordered chronologically, but this can be altered so that texts are ordered by different crieria, e.g. alphabetically by author or title.
The way in which results are ordered differs slightly according to what kind of search was used to obtain them. 
Generally speaking, word forms and/or lemmas will be ordered alphabetically by the first letter of the word, as would be expected. However, it is also possible to order elements by the last letter of the word. This is known as ordering by inverse lemma or word form. All types of ordering may be in either ascending or descending order.
The material recovered in this way can be extracted, either entirely or in part, and printed or saved to disk in RTF (Rich Text Format). 

The following types of search can be carried out using GATTO: