Documente Academic
Documente Profesional
Documente Cultură
OBJECTIVE
EXISTING SYSTEM
PROPOSED SYSTEM
TECHNOLOGY USED HERE
SYSTEM ARCHITECTURE
WORKFLOW
MODULE DESCRIPTION
ADVANTAGES
DISADVANTAGES
CONCLUSION
FUTURE SCOPE
REFERENCES
OBJECTIVE
As most of the people require review about a product.
• Data Analysis
• Text mining
SENTIMENT ANALYSIS
Sentiment analysis is the most common text classification
tool .
It is used to analyze the incoming message.
And tells whether the underlying sentiment is
positive , negative or neutral.
The level involved in sentiment analysis,
1. Document level
2. Aspect level
3. Sentence level
NATURAL LANGUAGE PROCESSING
It describes the interaction between human & computers.
Example:
-Spell check
-Auto complete
-Spam filters
-Related keyword in search engine
-Voice text messaging
Steps involved in NLP,
1. Sentence segmentation
2. Word tokenization
4. Text lemmatization
6. Dependency parsing
DATA ANALYSIS
Data analysis is the process of applying statistical practices
to,
- Organize
- Represent
- Describe
- Evaluate and
- Interpret the data
SUPERVISED & UNSUPERVISED
LEARNING
SUPERVISED LEARNING
It analyzes the training data and produces an inferred
function. Categories into,
-Regression
-Classification
UNSUPERVISED LEARNING
Is trying to find the hidden structure in labeled data.
It can be categories into,
-Clustering
SPAM REVIEW DETECTION
Spam is defined as the any type of message or
communication originating from either a person or an
organization which is unsolicited and undesired.
Types of spams are,
1.Email spam
2.Advertising articles
3.External link spamming
4.Citations spams
5.Product review spams
TYPES OF SPAM REVIEWS ARE,
1.Untruthful opinions
It is also known as fake reviews.
2.Reviews on brand only
Not comment on the product for the
products but only brands , the manufacturers and sellers.
3.Non reviews
Advertisements.
Other irrelevant reviews containing no opinion.
TEXT MINING
Text mining is also known as TEXT DATA MINING.
It is the process of deriving the high-quality information
from text.
The purpose is too unstructured information , extract
meaningful numeric indices from the text.
TEXT MINING PROCESS
Text pre-processing
Text transformation
Feature selection
Data mining
Evaluate
Applications
1.web mining
2.medical
ALGORITHM & LIBRARIES USED IN
FAKE PRODUCT REVIEWS
TENSORFLOW
SUPPORT VECTOR MACHINE
• Support vector machine is a supervised machine learning
algorithm.
P(c|x)=P(x|c)P(c)
P(x)
RANDOM FOREST CLASSIFIER
Random forest is a flexible , easy to use machine learning
algorithm.
Server
Admin
Positive reviews
Negative reviews
Based on their spam review detection to find the
negative reviews.
GRAPHICAL REPRESENTATION
Use Random forest classifier method ,analyze the
review and generate the graph.
ADVANTAGES
User gets genuine reviews about the product.
User can post their review about the product.
User can send money on valuable products.
DISADVANTAGES
If the social media optimization team uses different ip
address to send the review ,system will fail to track
the fake review.
CONCLUSION
• Business organizations, specialists and academics are
putting forward their efforts and ideas to find the best
system for opinion spam analysis.