Documente Academic
Documente Profesional
Documente Cultură
Contents
“How Old Do You Think I Am?”: A Study of Language and Age in Twitter ..................... 3
Abstract ........................................................................................................................... 3
Abstract ........................................................................................................................... 4
Abstract ........................................................................................................................... 5
Abstract ........................................................................................................................... 6
GENDER PREDICTION 3
“How Old Do You Think I Am?”: A Study of Language and Age in Twitter
Abstract
In this paper we focus on the connection between age and language use, exploring age
prediction of Twitter users based on their tweets. We discuss the construction of a fine-grained
annotation effort to assign ages and life stages to Twitter users. Using this dataset, we explore
age prediction in three different ways: classifying users into age categories, by life stages, and
predicting their exact age. We find that an automatic system achieves better performance than
humans on these tasks and that both humans and the automatic systems have difficulties
predicting the age of older people. Moreover, we present a detailed analysis of variables that
change with age. We find strong patterns of change, and that most changes occur at young ages.
GENDER PREDICTION 4
Abstract
applications. However, online customers often do not provide this kind of information due to
privacy issues and other security related reasons. In this paper, we proposed a method for
predicting the gender of customers based on their catalog viewing data on e-commerce systems,
such as the date and time of access, list of categories and products viewed, etc. We employ a
machine learning approach and investigate a number of features derived from catalog viewing
information to predict the gender of viewers. Experiments were conducted on datasets provided
by the PAKDD’15 Data Mining Competition and achieved the good result. The results 81.2% on
balanced accuracy and 81.4% on macro F1 score showed that basic features such as viewing
time, products/categories features used together with more advanced features such as
customers.
GENDER PREDICTION 5
Abstract
Accurate prediction of shopping channel preferences has become an important issue for
retailers seeking to maximize customer loyalty. In data mining, novel approaches such as neural
networks (NN) have been proposed to predict the probability of class memberships in addition to
statistical methods from marketing modelling. However, Data Mining suggests new approaches
to data preprocessing in order to maximise predictive accuracy, such as rebalancing skewed class
distributions of the target variable. Conflicting best practices exist in data mining and market
modelling, without diffusion between the two disciplines. To reflect this discrepancy, we
evaluate the predictive accuracy of balanced versus imbalanced classification of consumer online
shopping behaviour using logistic regression and NN. Experimental predictions are computed
using socio-demographic, product and Internet shopping specific variables to classify consumers
into “online shoppers”, “browsers” and “non-Internet shoppers” using UK sample survey data.
Our findings suggest that rebalancing data increases accuracy for both methods. In addition, NN
provide superior classification accuracy and limited interpretation of explanations for class
membership
GENDER PREDICTION 6
Abstract
In this paper Regression Modeling Technique is proposed for the retention of customer
and maintains customer loyalty. Prediction attempts to predict the pattern of events on the basis
of the input data Here the aim of the paper is to launch desktops and laptops of various
configurations on the basis of age, Gender, Price and monthly income. This paper aims at how
the concepts of data mining and regression analysis can be applied to achieve the response of the
customer by analyzing the relationship among the various customer related attributes.