{"id":23356,"date":"2025-12-17T09:44:32","date_gmt":"2025-12-17T08:44:32","guid":{"rendered":"https:\/\/www.ilc.cnr.it\/?post_type=progetti&#038;p=23356"},"modified":"2025-12-17T10:01:06","modified_gmt":"2025-12-17T09:01:06","slug":"gdliplus","status":"publish","type":"progetti","link":"https:\/\/www.ilc.cnr.it\/en\/progetti\/gdliplus\/","title":{"rendered":"GDLIplus"},"content":{"rendered":"\n<p>Published in 21 volumes between 1961 and 2002, the \u00abGrande dizionario della lingua italiana\u00bb (<em>GDLI<\/em>) is the most important historical dictionary of the Italian language. Like all historical dictionaries, the <em>GDLI<\/em> bases the lexicographical description of words on the rich collection of quotes, which cover the entire history of the Italian language. <\/p>\n\n\n\n<p>Thanks to the digitization work of the <em>GDLI<\/em> carried out by <a href=\"https:\/\/www.ilc.cnr.it\/en\/\" data-type=\"page\" data-id=\"7629\"><strong>Cnr-Istituto di Linguistica Computazionale \u201cAntonio Zampolli\u201d\u00a0<\/strong>(<strong>CNR-ILC<\/strong>)<\/a> in collaboration with the <strong><a href=\"https:\/\/accademiadellacrusca.it\/en\" data-type=\"link\" data-id=\"https:\/\/accademiadellacrusca.it\/en\" target=\"_blank\" rel=\"noreferrer noopener\">Accademia della Crusca<\/a><\/strong>, we can estimate that the corpus of quotes (<em>Corpus GDLIplus<\/em>) includes over two and a half million entries, taken from more than 14,000 sources (and over 6,000 authors), with a total of about 50 million occurrences. <\/p>\n\n\n\n<p>Italian has long remained a \u201cwritten\u201d language: the history of Italian is, in fact, at least until <em>I Promessi Sposi<\/em>, the history of literary Italian. It is therefore easy to understand how the <em>Corpus GDLIplus<\/em> can be considered a formidable resource for the history of the Italian language, useful not only to scholars but also to teachers and students, and even to everyday Internet users. The <strong>GDLIplus project <\/strong>aims to create this resource.<\/p>\n\n\n\n<p>To achieve this goal, two main activities are required:<\/p>\n\n\n\n<ol>\n<li>The corpus must be annotated: each word must be associated with linguistic information (lemma and morpho-syntactic category). Despite recent advances, methods and techniques for automatic language processing are not immediately applicable to historical texts and require specializations at various levels. <\/li>\n\n\n\n<li>The lexicographical origin of the texts in the corpus poses specific management challenges. The most significant issue concerns cases where the same textual passage is cited multiple times under different entries. Implementing the <em>Corpus GDLIplus<\/em> requires developing a strategy for managing repeated examples, and, even before that, establishing a method for their automatic identification.<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>Published in 21 volumes between 1961 and 2002, the \u00abGrande dizionario della lingua italiana\u00bb (GDLI) is the most important historical&hellip;<\/p>\n","protected":false},"author":3,"featured_media":0,"template":"","tag-sottositi":[],"acf":{"type_of_project":"4","acronym":"GDLIplus","title":"A New Resource for the History of Italian: The Corpus of the Quotes in the \u00abGrande dizionario della lingua italiana\u00bb","funding_body":"Regione Toscana | Accademia della Crusca","funding_programme":"Programma regionale FSE+ 2021-2027","grant_agreement":"","start_date":"20251001","end_date":"20270930","role":"Coordinator","project_coordinator":"Elisa Guadagnini (CNR-ILC)","project_chair":[{"person":"Elisa Guadagnini"}],"ilc_research_un":null,"programme_coord":"","contact_person":"","staff":[{"person":"Marco Biffi, Responsabile scientifico per l\u2019Accademia della Crusca"},{"person":"Eva Sassolini (CNR-ILC)"},{"person":"Simonetta Montemagni (CNR-ILC)"},{"person":"Manuel Favaro (CNR-ILC)"},{"person":"Noemi Terreni (CNR-ILC)"}],"documentation":null,"websites":null,"nid":"","lang":"","tnid":""},"fimg_url":false,"jetpack_sharing_enabled":true,"publishpress_future_workflow_manual_trigger":{"enabledWorkflows":[]},"_links":{"self":[{"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/progetti\/23356"}],"collection":[{"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/progetti"}],"about":[{"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/types\/progetti"}],"author":[{"embeddable":true,"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/users\/3"}],"wp:attachment":[{"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/media?parent=23356"}],"wp:term":[{"taxonomy":"tag-sottositi","embeddable":true,"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/tag-sottositi?post=23356"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}