Documente Academic
Documente Profesional
Documente Cultură
Analysis
An Overview
Algorithms
Wide variety of classification approaches used:
simple keyword counting methods, with or without
scoring
rule-based methods
content analytical methods (statistical)
SVM, nave Bayes and other statistical classifiers
used alone, sequentially, or as a community
Comparison of results does not provide a
definite answer as to which of these methods is
the best for sentiment or subjectivity tagging
choice of features and training domain have a more
impact on accuracy than choice of classification
algorithm
comparison of performance of systems evaluated on
different domains or different feature sets not
conclusive
Summary
Sentiment and subjectivity
analysis has evolved into a
strong research stream in NLP
State-of-the-art systems can
reach up to 90% accuracy on
certain domains
But need a generally applicable
method
Research Directions
Development of semi-supervised machine-
learning approaches that will maximize
the usefulness of the available
resources and ensure domain adaptation
with limited in-domain data
Creation of reliable and extensive
resources such as lists of words and
expressions, syntactic patterns,
combinatorial rules, and annotated
corpora
Creation of uniform ways to denote and
represent sentiment and subjectivity
annotation
Bibliography
Andreevskaia, A. and S. Bergler: 2007, CLaC and CLaC-NB: Knowledge-
based and Corpus-based Approaches to Sentiment Tagging. In: 4th
International Workshop on Semantic Evaluations (SemEval 2007).
Prague, Czech Republic.
Andreevskaia, A., S. Bergler, and M. Urseanu: 2007, All Blogs are
not made Equal. In: International Conference on Weblogs and Social
Media (ICWSM-2007). Boulder, Colorado.
Aue, A. and M. Gamon: 2005, Customizing Sentiment Classifiers to
New Domains: a Case Study. In: RANLP-05, the International
Conference on Recent Advances in Natural Language Processing.
Borovets, Bulgaria.
Bethard, S., H. Yu, A. Thornton, V. Hatzivassiloglou, and D.
Jurafsky: 2004, Automatic Extraction of Opinion Propositions and
their Holders. In: Exploring Attitude and Affect in Text: theories
and application (AAAI-EAAT 2004) .Stanford University.
Blair, L., A. Jaharria, S. Lewis, T. Oda, C. Reichenbach, J.
Rueppel, and F. Salvetti: 2004, Impact of Lexical Filtering on
Semantic Orientation. In: Exploring Attitude and Affect in Text:
theories and application (AAAI-EAAT 2004). Stanford University.
Bibliography
Breck, E., Y. Choi, and C. Cardie: 2007, Identifying expressions of
opinion in context. In: Proceedings of the Twentieth International
Joint Conference on Artificial Intelligence (IJCAI-2007). Hyderebad,
India.
Bruce, R. and J. Wiebe: 2000, Recognizing subjectivity: A case
study of manual processing. Natural Language Engineering 5(2), 187
205.
Chaovalit, P. and L. Zhou: 2005, Movie Review Mining: a Comparison
between Supervised and Unsupervised Classification Approaches. In:
Proceedings of HICSS-05, the 38th Hawaii International Conference on
System Sciences.
Chaumartin, F.-R.: 2007, UPAR7: A Knowledge-based System for
Headline Sentiment Tagging. In: 4th International Workshop on
Semantic Evaluations (SemEval 2007). Prague, Czech Republic.
Cui, H., V. Mittal, and M. Datar: 2006, Comparative Experiments on
Sentiment Classification for Online Product Reviews. In:
Proceedings of the Twenty-First National Conference on Artificial
Intelligence (AAAI-2006). Boston, MA.
Bibliography
Das, S. R. and M. Y. Chen: 2001, Yahoo! For Amazon: Sentiment
extraction from small talk on the Web. In: Asia Pacific Finance
Association Annual Conference (APFA01).
Dave, K., S. Lawrence, and D. M. Pennock: 2003, Mining the Peanut
gallery: opinion extraction and semantic classification of product
reviews. In: (WWW03). Budapest, Hungary, pp. 519528.
Drezde, M., J. Blitzer, and F. Pereira: 2007, Biographies,
Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment
Classification. In: 45th Annual Meeting of the Association for
Computational Linguistics (ACL 2007). Prague, Czech Republic.
Dunning, T.: 1993, Accurate Methods for the Statistics of Surprise
and Coincidence. Computational Linguistics 19, 6174.
Ekman, P.: 1993, Facial expression of emotion.. American
Psychologist 48, 384392.
Fletcher, J. and J. Patrick: 2005, Evaluating the Utility of
Appraisal Hierarchies as a Method for Sentiment Classification. In:
Proceedings of Australian Language Technology Workshop 2005. Sydney,
Australia, pp. 134142.
Bibliography
Gamon, M.: 2004, Sentiment classification on customer feedback
data: noisy data, large feature vectors, and the role of linguistic
analysis. In: Proceeding of COLING-04, the 20th International
Conference on Computational Linguistics. Geneva, CH, pp. 841847.
Gamon, M. and A. Aue: 2005, Automatic identification of sentiment
vocabulary: exploiting low association with known sentiment terms.
In: Proceedings of the ACL-05 Workshop on Feature Engineering for
Machine Learning in Natural Language Processing. Ann Arbor, US.
Goldberg, A. and J. Zhu: 2006, Seeing stars when there arent many
stars: Graph-based semi-supervised learning for sentiment
categorization. In: Proceedings of the HLT-NAACL 2006 Workshop on
Textgraphs: Graph-based Algorithms for Natural Language Processing.
Boston, MA.
Hatzivassiloglou, V. and J. Wiebe: 2000, Effects of Adjective
Orientation and Gradability on Sentence Subjectivity. In: 18th
International Conference on Computational Linguistics (COLING-2000).
Hu, M. and B. Liu: 2004, Mining and summarizing customer reviews.
In: KDD-04. pp. 168177.
Hurst, M. and K. Nigam: 2004, Retrieving topical sentiments from
Online document collection. In: Exploring Attitude and Affect in
Text: theories and application (AAAI-EAAT 2004). Stanford
University.
Bibliography
Kennedy, A. and D. Inkpen: 2006, Sentiment Classification of Movie
Reviews Using Contextual Valence Shifters. Computational
Intelligence 22(2), 110125.
Kim, S.-M. and E. Hovy: 2004, Determining the Sentiment of
Opinions. In: Proceedings COLING-04, the Conference on
Computational Linguistics. Geneva, CH, pp. 13671373.
Kim, S.-M. and E. Hovy: 2005a, Automatic Detection of Opinion
Bearing Words and Sentences. In: Companion Volume to the
Proceedings of IJCNLP-05, the Second International Joint Conference
on Natural Language Processing. Jeju Island, KR, pp. 6166.
Kim, S.-M. and E. Hovy: 2005b, Identifying Opinion Holders for
Question Answering in Opinion Texts. In: Proceedings of AAAI-05
Workshop on Question Answering in Restricted Domains. Pittsburgh,
US.
Kim, S.-M. and E. Hovy: 2006, Extracting Opinions, Opinion Holders,
and Topics Expressed in Online News Media Text. In: Proceedings of
the ACL/COLING Workshop on Sentiment and Subjectivity in Text.
Sydney, Australia.
Koppel, M. and J. Schler: 2006, The importance of neutral examples
for learning sentiment. Computational Intelligence 22(2), 100116.
Bibliography
Mao, Y. and G. Lebanon: 2006, Sequential Models for Sentiment
Prediction. In: Proceedings of the ICML workshop on Learning in
Structured Output Spaces.
McDonald, R., K. Hannan, T. Nevon, M. Wells, and J. Reynar: 2007,
Structured Models for Fine-to-Coarse Sentiment Analysis. In:
Proceedings of the 45th Annual Meeting of the Association for
Computational Linguistics (ACL-2007). Prague, Czech Republic.
Mulder, M., A. Nijholt, M. den Uyl, and P. Terpstra: 2004, A
Lexical Grammatical Implementation of Affect. In: Proceedings of
TSD-04, the 7th International Conference Text, Speech and Dialogue,
Vol. 3206 of Lecture Notes in Computer Science. Brno, CZ, pp. 171
178.
Mullen, T. and N. Collier: 2004, Sentiment analysis using support
vector machines with diverse information sources. In: Proceedings
of EMNLP-04, 9th Conference on Empirical Methods in Natural Language
Processing. Barcelona, Spain.
Pang, B. and L. Lee: 2004, A sentiment education: Sentiment
analysis using subjectivity summarization based on minimum cuts.
In: ACL. pp. 271278. arXiv:cs.CL/040958.
Bibliography
Pang, B., L. Lee, and S. Vaithyanathan: 2002, Thumbs up? Sentiment
classification using machine learning techniques. In: Conference on
Empirical Methods in Natural Language Processing (EMNLP-2002). pp.
7986.
Read, J.: 2005, Using emoticons to reduce dependency in machine
learning techniques for sentiment classification. In: Proceedings
of the ACL-2005 Student Research Workshop. Ann Arbor, MI.
Riloff, E., S. Patwardhan, and J. Wiebe: 2006, Feature Subsumption
for Opinion Analysis. In: Proceedings of EMNLP-06, the Conference
on Empirical Methods in Natural Language Processing. Sydney, AUS,
pp. 440448.
Riloff, E., J. Wiebe, and T. Wilson: 2003, Learning subjective
nouns using extraction pattern bootstrapping. In: W. Daelemans and
M. Osborne (eds.): Proceedings of CONLL-03, 7th Conference on
Natural Language Learning. Edmonton, CA, pp. 2532.
Sahlgren, M., J. Karlgren, and G. Eriksson: 2007, SICS: Valence
Annotation Based on Seeds in Word Space. In: 4th International
Workshop on Semantic Evaluations (SemEval 2007). Prague, Czech
Republic.
Bibliography
Salvetti, F., S. Lewis, and C. Reichenbach: 2004, Impact of lexical
filtering on overall polarity identification. In: Exploring
Attitude and Affect in Text: theories and application (AAAI-EAAT
2004). Stanford University.
Snyder, B. and R. Barzilay: 2007, Multiple Aspect Ranking using the
Good Grief Algorithm. In: Proceedings of NAACL-2007. Washington,
DC.
Stoyanov, V., C. Cardie, D. Litman, and J. Wiebe: 2004, Evaluating
an Opinion Annotation Scheme Using a New Multi-Perspective Question
and Answer Corpus. In: J. G. Shanahan, J. Wiebe, and Y. Qu (eds.):
Proceedings of the AAAI Spring Symposium on Exploring Attitude and
Affect in Text: Theories and Applications. Stanford, US.
Strapparava, C. and R. Mihalcea: 2007, SemEval-2007 Task 14:
Affective Text. In: 4th International Workshop on Semantic
Evaluations (SemEval 2007). Prague, Czech Republic.
Turney, P.: 2002, Thumbs up or thumbs down? Semantic orientation
applied to un-supervised classification of reviews. In: 40th Annual
Meeting of the Association of Computational Linguistics (ACL02)).
pp. 417424.
Bibliography
Whitelaw, C., N. Garg, and S. Argamon: 2005, Using Appraisal
Taxonomies for Sentiment Analysis. In: Proceedings of CIKM-05, the
ACM SIGIR Conference on Information and Knowledge Management.
Bremen, Germany.
Wiebe, J.: 2002, Instructions for Annotating Opinions in Newspaper
Artic. Technical Report TR-01-101, University of Pittsburgh,
Department of Computer Science, Pittsburgh, PA.
Wiebe, J., E. Breck, C. Buckley, C. Cardie, P. Davis, B. Fraser, D.
Litman, D. Pierce, E. Riloff, T. Wilson, D. Day, and M. Maybury:
2003, Recognizing and Organizing Opinions Expressed in World
Press. In: Proceedings of the AA AI Spring Symposium on New
Directions in Question Answering.
Wiebe, J., R. Bruce, M. Bell, M. Martin, and T. Wilson: 2001a, A
corpus study of Evaluative and Speculative Language. In:
Proceedings of the 2nd ACL SIGDial Workshop on Discourse and
Dialogue). Aalberg, Denmark.
Wiebe, J. and E. Riloff: 2005, Creating Subjective and Objective
Sentence Classifiers from Unannotated Texts. In: Proceeding of
CICLing-05, International Conference on Intelligent Text Processing
and Computational Linguistics, Vol. 3406 of Lecture Notes in
Computer Science. Mexico City, MX, pp. 475486.
Bibliography
Wiebe, J., T. Wilson, and M. Bell: 2001b, Identifying Collocations
for Recognizing Opinions. In: Proceedings of the ACL Workshop on
collocation. Toulouse, France.
Wiebe, J., T. Wilson, and C. Cardie: 2005, Annotating expressions
of opinions and emotions in language. Language Resources and
Evaluation 39(23), 165210.
Wiebe, J. M., R. F. Bruce, and T. P. OHara: 1999, Development and
Use of a Gold-Standard Data Set for Subjectivity Classifications .
In: 37th Annual Meeting of the Association for Computational
Linguistics (ACL-99). pp. 246253.
Wilson, T. and J. Wiebe: 2003, Annotating Opinions in the World
Press. In: 4th SIGdial Workshop on Discourse and Dialogue (SIGdial-
03).
Wilson, T., J. Wiebe, and R. Hwa: 2006, Recognizing Strong and Weak
Opinion Clauses. Computational Intelligence 2(22), 7399.
Yu, H. and V. Hatzivassiloglou: 2003, Towards Answering Opinion
Questions: Separating Facts from Opinions and Identifying the
Polarity of Opinion Sentences. In: M. Collins and M. Steedman
(eds.): Proceedings of EMNLP-03, 8th Conference on Empirical Methods
in Natural Language Processing. Sapporo, Japan, pp. 129136.