{"id":6997,"date":"2022-08-12T02:04:49","date_gmt":"2022-08-12T00:04:49","guid":{"rendered":"https:\/\/www.ilc.cnr.it\/progetti\/paisa\/"},"modified":"2026-02-06T14:13:02","modified_gmt":"2026-02-06T13:13:02","slug":"paisa","status":"publish","type":"progetti","link":"https:\/\/www.ilc.cnr.it\/en\/progetti\/paisa\/","title":{"rendered":"PAIS\u00c0"},"content":{"rendered":"<div style=\"text-align: justify;\">The overall objective of the project PAIS\u00c0 is to overcome the technological barriers currently preventing web users from having interactive access to and use of large quantities of data of contemporary Italian to improve their language skills. The project is particularly targeted to second generation emigrants from Italy who keep Italian as a native language, but in severely limited usage, and third generation emigrants who have Italian as a second language (L2).<\/div>\n<div style=\"text-align: justify;\">To achieve this goal a large and richly annotated corpus of Italian web texts is created. The novelty of the project is using, for the corpus, a freely distributable sample of texts (Creative Commons license), automatically harvested from the web. Subsequently different annotation layers (morphosyntactic information, dependency relations, etc.) are added to the corpus by applying NLP (natural language processing) tools, which get adjusted and improved in the course of the project by integrating manual annotation data.<\/div>\n<div style=\"text-align: justify;\">Raw and annotated versions of the corpus are freely made available for download. In addition, direct access to the data will be provided via a multifaceted query interface for learners and users of Italian, thus fostering free online access to concrete contexts of use of contemporary Italian.<\/div>\n","protected":false},"excerpt":{"rendered":"<p>The overall objective of the project PAIS\u00c0 is to overcome the technological barriers currently preventing web users from having interactive&hellip;<\/p>\n","protected":false},"author":1,"featured_media":6995,"template":"","tag-sottositi":[],"acf":{"type_of_project":"3","acronym":"PAIS\u00c0","title":"Piattaforma per l\u2019Apprendimento dell\u2019Italiano Su corpora Annotati","funding_body":"Ministero dell'Istruzione, dell'Universit\u00e0 e della Ricerca","funding_programme":"Fondo per gli Investimenti della Ricerca di Base","grant_agreement":"FIRB-2006-RBNE072H7L","start_date":"20090601","end_date":"20120531","role":"Coordinator","project_coordinator":"Universit\u00e0 di Bologna (2009-2011) | ILC (2011-2012)","project_chair":[{"person":"Vito Pirrelli"}],"ilc_research_un":[{"person":"Vito Pirrelli"}],"programme_coord":"","contact_person":"","staff":[{"person":"Claudia Marzi"},{"person":"Marcello Ferro"}],"documentation":null,"websites":[{"website":"http:\/\/www.corpusitaliano.it\/it\/contents\/paisa.html"}],"nid":"251","lang":"it","tnid":"248"},"fimg_url":"https:\/\/www.ilc.cnr.it\/wp-content\/uploads\/2022\/08\/logo-small.png","jetpack_sharing_enabled":true,"publishpress_future_workflow_manual_trigger":{"enabledWorkflows":[]},"_links":{"self":[{"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/progetti\/6997"}],"collection":[{"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/progetti"}],"about":[{"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/types\/progetti"}],"author":[{"embeddable":true,"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/users\/1"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/media\/6995"}],"wp:attachment":[{"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/media?parent=6997"}],"wp:term":[{"taxonomy":"tag-sottositi","embeddable":true,"href":"https:\/\/www.ilc.cnr.it\/en\/wp-json\/wp\/v2\/tag-sottositi?post=6997"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}