Abstract
In this paper we present four experiments on the analysis Italian social media texts using a linguistically-based semantic approach. The experiments are respectively: two on newspaper articles about two political crises, one on a twitter corpus centered on political themes, and one on a case study of strategic plan programs of candidates to the presidency of our university. The analyses carried out by the same system, focus on semantic features of texts highlighting three main traits: “factivity” or factuality, “subjectivity” and polarity. The system uses semantic knowledge derived from deep linguistic analysis at propositional level to classify texts at a fine-grained level. As will be shown in the paper, linguistically-based semantic information allows for neat distinction of writing styles when comparing newspapers writing styles, for irony detection in tweets, and in different degrees, for making readability judgements.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
Final overall results are available online, http://www.slideshare.net/vivianapatti9/evalita-sentipolc14.
References
Bagga, A., Pustejovsky, J., Zadrozny, W. (eds.): Syntactic and Semantic Complexity in Natural Language Processing Systems: NAACL-ANLP 2000 Workshop. Association for Computational Linguistics, New Brunswick (2000)
Basile, V., Bolioli, A., Nissim, M., Patti, V., Rosso, P.: Overview of the Evalita 2014 SENTIment POLarity Classification Task. In: Proceedings of EVALITA 2014, Pisa (2014)
Bos, J., Delmonte, R. (eds.): Semantics in Text Processing (STEP), Research in Computational Semantics, vol. 1. College Publications, London (2008)
Esuli, A., Sebastiani, F.: SentiWordNet: a publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation LREC 2006 (2006)
Delmonte, R.: Computational Linguistic Text Processing - Logical Form, Logical Form, Semantic Interpretation, Discourse Relations and Question Answering. Nova Science Publishers, New York (2007)
Delmonte, R., Tonelli, S., Tripodi, R.: Semantic processing for text entailment with VENSES. In: TAC 2009 Proceedings Papers (2010). http://www.nist.gov/tac/publications/2009/papers.html
Delmonte, R.: Computational Linguistic Text Processing - Lexicon, Grammar, Parsing and Anaphora Resolution. Nova Science Publishers, New York (2009)
Delmonte, R.: Predicate argument structures for information extractionfrom dependency representations: null elements are missing, 2013. In: Lai, C., Giuliani, A., Semeraro, G. (eds.) DART 2012: Revised and Invited Papers. Studies in Computational Intelligence, vol. 515, pp. 25–49. Springer, Heidelberg (2012)
Delmonte, R., Gifu, D., Tripodi, R.: Opinion and factivity analysis of Italian political discourse. In: Basili, R., Sebastiani, F., Semeraro, G. (eds.) Proceedings of the 4th Italian Information Retrieval Workshop, IIR 2013, Pisa. CEUR Workshop Proceedings (CEUR-WS.org), vol. 964, pp. 88–99 (2013). http://ceur-ws.org
Delmonte, R., Pallotta, V.: Opinion mining and sentiment analysis need text understanding. Advances in Distributed Agent-based Retrieval Tools. Advances in Intelligent and Soft Computing, pp. 81–95. Springer, Heidelberg (2011)
Delmonte R., Pianta, E.: IMMORTALE - Analizzatore Morfologico, Tagger e Lemmatizzatore per l’Italiano. In: Atti V Convegno AI*IA “Cibernetica e Machine Learning”, pp. 19–22. Napoli (1996)
Delmonte, R., Pianta, E.: Immortal: how to detect misspelled from unknown words. In: BULAG, PCUF, pp. 193–218. Besanon (1998)
Delmonte, R.: Italian lemmatization by rules with getaruns. In: Magnini, B., Cutugno, F., Falcone, M., Pianta, E. (eds.) EVALITA 2011. LNCS, vol. 7689, pp. 239–248. Springer, Heidelberg (2012)
DuBay, W.H.: The Principles of Readability. Impact Information, Costa Mesa (2004)
Heilman, M., Collins-Thompson, K., Eskenazi, M.: An analysis of statistical models and features for reading difficulty prediction. In: Proceedings of the 3rd Workshop on Innovative Use of NLP for Building Educational Applications. Columbus (2008)
Heilman, M., Collins-Thompson, K., Callan, J., Eskenazi, M.: Combining lexical and grammatical features to improve readability measures for first and second language texts. In: HLT-NAACL 2007, pp. 460–467. Association for Computational Linguistics, Rochester (2007)
Kim, S.-M., Hovy, E.: Determining the sentiment of opinions. In: Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), pp. 1367–1373 (2004)
Kao, J., Dan, J.: A computational analysis of style, affect, and imagery in contemporary poetry. In: NAACL Workshop on Computational Linguistics for Literature (2012)
Feng, L., Jansche, M., Huenerfauth, M., Elhadad, N.: A comparison of features for automatic readability assessment. In: Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), Beijing (2010)
Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL), pp. 271–278 (2004)
Pennebaker, J.W., Booth, R.J., Francis, M.E.: Linguistic Inquiry and Word Count (LIWC). http://www.liwc.net/
Petersen, S.E., Ostendorf, M.: A machine learning approach to reading level assessment. Comput. Speech Lang. 23, 86–106 (2009)
Polanyi, L., Zaenen, A.: Contextual valence shifters. In: Wiebe, J. (ed.) Computing Attitude and Affect in Text: Theory and Applications, pp. 1–10. Springer, Dordrecht (2006)
Pollack, M., Pereira, F.: Incremental interpretation. Artif. Intell. 50, 37–82 (1991)
Pollard, S., Biermann, A.W.: A measure of semantic complexity for natural language systems. In: Bagga et al. Proceedings of the 2000 NAACL-ANLP Workshop on Syntactic and semantic complexity in natural language processing systems, vol. 1, pp. 42–46 (2000)
Reyes, A., Rosso, P., Buscaldi, D.: From humor recognition to irony detection: the figurative language of social media. Data Knowl. Eng. 74, 1–12 (2012)
Reyes, A., Rosso, P.: On the difficulty of automatically detecting irony: beyond a simple case of negation. Knowl. Inf. Syst. 40(3), 595–614 (2013)
Roark, B., Mitchell, M., Hollingshead, K.: Syntactic complexity measures for detecting mild cognitive impairment. In: Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing, BioNLP 2007, pp. 1–8. Association for Computational Linguistics, Stroudsburg (2011)
Saur, R., Pustejovsky, J.: Are you sure that this happened? Assessing the factuality degree of events in text. Comput. Linguist. 38(2), 261–299 (2012)
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)
Turney, P.D., Littman, M.L.: Inference of semantic orientation from association. ACM Trans. Inf. Syst. (TOIS) 21(4), 315–346 (2003)
Tonelli, S., Manh, K.T., Pianta, E.: Making readability indices readable. In: Proceedings NAACL-HLT 2012 Workshop on Predicting and Improving Text Readability for target reader populations, pp. 40–48. Montral (2012)
Wiebe, J., Wilson, T., Cardie, C.: Annotating expressions of opinions and emotions in language. Lang. Resour. Eval. 39(2), 165–210 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Delmonte, R., Tripodi, R. (2016). Semantics for Social Media. In: Quesada, J., Martín Mateos, FJ., Lopez-Soto, T. (eds) Future and Emergent Trends in Language Technology. FETLT 2015. Lecture Notes in Computer Science(), vol 9577. Springer, Cham. https://doi.org/10.1007/978-3-319-33500-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-33500-1_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-33499-8
Online ISBN: 978-3-319-33500-1
eBook Packages: Computer ScienceComputer Science (R0)