Documente Academic
Documente Profesional
Documente Cultură
ABHILASH H C 4BD10CS003
CONTENTS
Introduction Sentiment Analysis RapidMiner Demonstration Screen Shots Advantages and Disadvantages Conclusion
INTRODUCTION
Two main types of textual information: Facts and Opinions Most current text information processing methods work with factual information (e.g., web search, text mining) Sentiment analysis or opinion mining, computational study of opinions (sentiments, emotions) expressed in text
Identify the orientation of opinion in a piece of text (blogs, user comments, review websites, community websites, ), in others words determine if a sentence or a document expresses positive, negative, neutral sentiment towards some object?
[ Factual ]
[ Sentimental ]
USES :
Consumer information
Product reviews
Consumer attitudes Trends Politicians want to know voters views Voters want to know politicians' stances and who else supports them Find like-minded individuals or communities
Marketing
Politics
Social
HOW IT IS DONE ?
First eliminate objective sentences, then use remaining sentences to classify document polarity (reduce noise)
RAPIDMINER
Around since 2001 Open source - Community Editions Client/Server model with Server as SaaS(Service as a Software) Most popular for data analytics GUI based - no need to write code Java based - Runs on All Platforms
All usual Windows versions are supported as well as Macintosh, Linux or UNIX systems. Download is available from http://www.rapid-i.com.
WELCOME PERSPECTIVE
DESIGN PERSPECTIVE
DEMONSTRATION STEP 1
STEP 2
STEP 3
STEP 4
STEP 5
STEP 6
STEP 7
STEP 8
ADVANTAGES
Free version has adequate resources to avoid big name options if a small business It is a quality tool, given its ranking among the other commercial products GUI is very user friendly. GUI is used to create data mining operators in XML files XML Standardization is great for utilizing various data sources Ease of use and available tutorials Works on any operating system
DISADVANTAGE
Some options are not available in free product, but you can upgrade Possibly less customer service available for free version There can be some restriction on customized use Beginner may face some difficulty in understanding
CONCLUSION
RapidMiner is an open source learning environment for data mining and machine learning. This environment can be used to extract meaning from a dataset. There are hundreds of machine learning operators to choose from, helpful pre and post processing operators, descriptive graphic visualizations, and many other features. Users with limited knowledge in computer science and programming may find RapidMiner's learning curve to be substantial.
THANK YOU