Abstract
Senso Comune is an open knowledge base for the Italian language, available through a Web-based collaborative platform, whose construction is in progress. The resource integrates dictionary data coming from both users and legacy resources with an ontological backbone, which provides foundations for a formal characterization of lexical semantic structures (frames). A nucleus of basic Italian lemmas, which have been semantically analyzed and classified, is available for both online access and download. A restricted community of contributors is currently working on increasing the lexical coverage of the resource.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
Besides the authors of this paper, the group, lead by Tullio De Mauro, includes Nicola Guarino, Maurizio Lenzerini and Laure Vieu.
- 3.
IBM Italia Foundation’s symposium La dimensione semantica dell’Information Technology (The semantic dimension of Information Technology), Rome, November 27, 2006.
- 4.
- 5.
See for example: http://multiwordnet.itc.it/english/home
- 6.
- 7.
Lexicons often omit any reference to ontological categories that are not lexicalised in a language, although it sometimes happens, as with EuroWordNet’s ILIs or FrameNet’s non-lexicalised frames.
- 8.
In this respect, our approach is essentially different from OntoNotes [32], where multi-lingual corpora have been annotated with shallow semantic features based on the Omega ontology. Omega contains“no formal concept definitions and only relatively few interconnections” [33] while Senso Comune, conversely, is explicitly grounded on an ontological model (see Sect. 2.2).
- 9.
Hyponymy, antonymy, troponymy, causality, similarity, etc.
- 10.
- 11.
- 12.
For an updated list of wordnet projects see: http://www.globalwordnet.org/
- 13.
- 14.
- 15.
- 16.
There is yet another WordNet for Italian developed by a company, but it is not freely accessible.
- 17.
Grande dizionario italiano dell’uso (Gradit), Torino, UTET
- 18.
Two teams of five linguists each, based in Rome and Bologna, under the supervision of Isabella Chiari and Malvina Nissim, were dedicated to this task.
- 19.
- 20.
- 21.
For the sake of readability, we don’t go through the basic senses of the verb ‘to walk’, also assuming that s1 is adequately selected by the user.
- 22.
Note that we may wish to distinguish descriptions of functions from actual ones, namely those functions which are performed at a certain time by a given object. In the above example we simplify this distinction only focusing on the latter case.
- 23.
Uncertainty will be included only in future releases of the TMEO system.
- 24.
- 25.
See for example: http://msw2.deri.ie/
References
Assem Mv, Gangemi A, Schreiber G (2006) Conversion of WordNet to a standard RDF/OWL representation. In: Proceedings of the international conference on language resources and evaluation (LREC), Genoa, Italy
Baader F, Calvanese D, Mcguinness DL, Nardi D, Patel-Schneider PF (eds) (2003) The description logic handbook: theory, implementation and applications. Cambridge University Press, New York
Basile V, Bos J, Evang K, Venhuizen N (2012) Developing a large semantically annotated corpus. In: To appear in proceedings of LREC, Istanbul, Turkey
Bonial C, Corvey W, Palmer M, Petukhova V, Bunt H (2011) A hierarchical unification of lirics and verbnet semantic roles. In: 2011 fifth IEEE international conference on semantic computing (ICSC), Palo Alto. IEEE, pp 483–489
Bortolini U, Tagliavini C, Zampolli A (1972) Lessico di frequenza della lingua italiana contemporanea. Garzanti, Milano
Buitelaar P, Cimiano P, Haase P, Sintek M (2009) Towards linguistically grounded ontologies. In: Aroyo L, Traverso P, Ciravegna F, Cimiano P, Heath T, Hyvönen E, Mizoguchi R, Oren E, Sabou M, Simperl E (eds) The semantic web: research and applications. Lecture notes in computer science, vol 5554, Springer, Berlin/Heidelberg, pp 111–125
Burchardt A, Erk K, Frank A, Kowalski A, Pado S, Pinkal M (2006) The SALSA corpus: a German corpus resource for lexical semantics. In: Proceedings of LREC 2006, Genoa
Chiarcos C, Nordhoff S, Hellman S (eds) (2012) Integrating WordNet and Wiktionary with lemon. Springer, Frankfurt
Chiari I, Alessandro Oltramari GV (2010) Di cosa parliamo quando parliamo fondamentale? In: Atti del Convegno della Società di linguistica italiana (SLI 2010), Viterbo, pp 221–236
De Mauro T (1965) Introduzione alla semantica. Laterza, Bari
De Mauro T (1980) Guida all’uso delle Parole. Editori riuniti, Roma
De Mauro T, Mancini F, Vedovelli M, Voghera M (1993) Lessico di frequenza dell’italiano parlato (LIP). Etaslibri, Milano
Fellbaum C (1998) WordNet - An Electronic Lexical Database. MIT Press, Cambridge, Massachusetts
Fillmore CJ (1968) The case for case. In: Bach E, Harms T (eds) Universals in linguistic theory. Rinehart and Wiston, New York
Gangemi A (2010) What’s in a schema? A formal metamodel for ECG and FrameNet. Studies in Natural Language Processing. Cambridge University Press, Cambridge
Gildea D, Jurafsky D (2002) Automatic labeling of semantic roles. Comput Linguist 28(3):245–288
Giuglea AM, Moschitti A (2006) Semantic role labeling via FrameNet, VerbNet and PropBank. In: Proceedings of the COLING-ACL, association for computational linguistics, Stroudsburg, PA, USA, ACL-44, pp 929–936. DOI 10.3115/1220175.1220292
Kipper K, Korhonen A, Ryant N, Palmer M (2008) A large-scale classification of English verbs. Lang Resour Eval 42(1):21–40
Lenci A, Bel N, Busa F, Calzolari N, Gola E, Monachini M, Ogonowski A, Peters I, Peters W, Ruimy N, Villegas M, Zampolli A (2000) Simple: a general framework for the development of multilingual lexicons. Int J Lexicogr 13(4):249–263
Lenci A, Calzolari N, Zampolli A (2002) From text to content: computational lexicons and the semantic web. In: Eighteenth national conference on artificial intelligence; AAAI workshop, “Semantic web meets language resources”, Edmonton, Alberta, Canada
Lenci A, Johnson M, Lapesa G (2010) Building an Italian framenet through semi-automatic corpus analysis. In: Proceedings of LREC 2010, Valletta, pp 12–19
Levin B (2003) English verb classes and alternation: a preliminary investigation. University of Chicago Press, Chicago
Màrquez L, Carreras X, Litkowski KC, Stevenson S (2008) Semantic role labeling: an introduction to the special issue. Comput Linguist 34(2):145–159
Masolo C, Gangemi A, Guarino N, Oltramari A, Schneider L (2002) WonderWeb deliverable D17: the WonderWeb library of foundational ontologies. Technical report
Minsky M (1997) A framework for representing knowledge. In: Winston P (ed) Mind design. MIT Press, Cambridge, pp 111–142
Nuzzolese AG, Gangemi A, Presutti V (2011) Gathering lexical linked data and knowledge patterns from FrameNet. In: Proceedings of international conference on knowledge capture (K-CAP2011), Banff
Oltramari A, Mehler A, Kühnberger KU, Lobin H, Lüngen H, Storrer A, Witt A (2012) An introduction to hybrid semantics: the role of cognition in semantic resources, vol 370. Springer, Berlin/Heidelberg, pp 97–109
Pantel P, Bhagat R, Coppola B, Chklovski T, Hovy EH (2007) ISP: learning inferential selectional preferences. In: Proceedings of HLT-NAACL, Rochester, pp 564–571
Patwardhan S, Pedersen T (2006) Using WordNet-based context vectors to estimate the semantic relatedness of concepts. In: Proceedings of the EACL 2006 workshop on making sense of sense: bringing computational linguistics and psycholinguistics together, Trento, Italy, pp 1–8
Pianta E, Bentivogli L, Girardi C (2002) MultiWordNet: developing an aligned multilingual database. In: Proceedings of the first international conference on global WordNet. Central Institute of Indian Languages, Mysore
Poesio M (2005) Domain modelling and NLP: formal ontologies? Lexica? Or a bit of both? Appl Ontol (1):27–33
Pradhan S, Hovy E, Marcys M, Palmer M, Ramshaw L, Weischede R (2002) Ontonotes: a unified relational semantic representation. In: Proceedings of the first IEEE international conference on semantic computing (ICSC-07), Irvine, CA
Prev́ot L, Huang CR, Calzolari N, Gangemi A, Lenci A, Oltramari A (eds) (2010) Ontology and the lexicon. Cambridge University Press, Cambridge/New York
Pustejovsky J (1995) The generative lexicon. MIT Press, Cambridge
Pustejovsky J, Jezek E (2008) Semantic coercion in language: beyond distributional analysis. Ital J Linguist 20(1):175–208
Reale G (1990) A history of ancient philosophy: Plato and Aristotle. SUNY Press, Albany
Resnik P (1996) Selectional constraints: an information-theoretic model and its computational realization. Cognition 61:127–159
Roventini A, Alonge A, Calzolari N, Magnini B, Bertagna F (2000) ItalWordNet: a large semantic database for italian. In: Proceedings LREC 2000, Athens, pp 783–790
Ruppenhofer J, Ellsworth M, Petruck M, Johnson C (2005) FrameNet: Theory and Practice. Available at: http://framenet.icsi.berkeley.edu/
Searle J (1995) The construction of social reality. Paperback. Free Press, New York
Settles B, Small K, Tomanek K (eds) (2010) Proceedings of the NAACL HLT 2010 workshop on active learning for natural language processing. Association for Computational Linguistics, Los Angeles
Shi L, Mihalcea R (2005) Putting pieces together: combining FrameNet, VerbNet and WordNet for robust semantic parsing. In: Proceedings of the 6th international conference on computational linguistics and intelligent text processing, CICLing 2005. Springer, Berlin/Heidelberg/New York, pp 100–111
Simons P (ed) (1987) Parts: a study in ontology. Clarendon Press, Oxford
Stevenson M, Greenwood M (2006) Learning information extraction patterns using wordnet. In: Third international global WordNet conference (GWNC-2006), Jeju Island, Korea, pp 52–60
Suárez-Figueroa MC, Gómez-Pérez A, Motta E, Gangemi A (eds) (2012) Ontology engineering in a networked world. Springer, Berlin
Tonelli S (2010) Semi-automatic techniques for extending the framenet lexical database to new languages. Ph.D. thesis, Department of Language Sciences, Università Ca’ Foscari Venezia, Venezia
Tonelli S, Pighin D, Giuliano C, Pianta E (2009) Semi-automatic development of framenet for italian. In: Proceedings of the FrameNet workshop and masterclass, Milano
Vetere G, Oltramari A, Chiari I, Jezek E, Vieu L, Zanzotto FM (2012) Senso comune, an open knowledge base for Italian. TAL – Traitement Automatique des Langues, Special Issue of the on “Free language resources”. 52(3):217–243
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Oltramari, A. et al. (2013). Senso Comune: A Collaborative Knowledge Resource for Italian. In: Gurevych, I., Kim, J. (eds) The People’s Web Meets NLP. Theory and Applications of Natural Language Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35085-6_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-35085-6_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35084-9
Online ISBN: 978-3-642-35085-6
eBook Packages: Computer ScienceComputer Science (R0)