Documente Academic
Documente Profesional
Documente Cultură
meeting most needs of the business users. The GUI is not the most user-friendly and it
may take a little more time to learn than some tools but it is a competitively priced
product that is value for money.
Salford systems: provides a host of predictive analytics and data mining tools for
businesses. The company specialises in classification and regression tree algorithms. Its
MARS algorithm was originally developed by world-renowned Stanford statistician and
physicist, Jerome Friedman. The software is easy to use and learn.
KXEN: is one of the few companies that is driving automated analytics. Their products,
largely based on algorithms developed by the Russian mathematician Vladimir Vapnik,
are easy to use, fast and can work with large amounts of data. Some users may not like
the fact that KXEN works like a black box and in most cases, it is difficult to understand
and explain the results.
Angoss: Like Salford systems, Angoss has developed its products around classification
and regression decision tree algorithms. The advantage of this is that the tools are easy
to learn and use, and the results easy to understand and explain. The GUI is very user
friendly and a lot of features have been added over the years to make this a powerful
tool.
MATLAB: is a statistical computing software developed by MathWorks, MATLAB allows
matrix manipulations, plotting of functions and data, implementation of algorithms and
creation of user interfaces. There are many add-on toolboxes that extend MATLAB to
specific areas of functionality, such as statistics, finance, image processing,
bioinformatics, etc. Matlab is not a free software. However, there are clones like Octave
and Scilab which are free and have similar functionality.
R: R is a programming language and software environment for statistical computing and
graphics. The R language is an open source tool and is widely used by the academia.
For business users, the programming language does represent a hurdle. However, there
are many GUIs available that can sit on R and enhance its user-friendliness.
Weka: Weka (Waikato Environment for Knowledge Analysis) is a popular suite of
machine learning software, developed at the University of Waikato, New Zealand. Weka,
along with R, is amongst the most popular open source software used by the business
community. The software is written in the Java language and contains a GUI for
interacting with data files and producing visual results and graphs.
Traditionally focuses on using a consistent set of metrics to measure past performance and
guide business planning. Business Intelligence consists of querying, reporting, OLAP
(online analytical processing), and can answer questions including what happened, how
many, and how often.
BI is a Very broad term that means using the data available to your organization to make
factually based business decisions. This can take on a number of forms and methods but
generally includes doing things such as developing Key Performance Indicators (KPIs),
Trending Analysis, Predictive Modeling, dashboards, etc.
-Data mining
Somewhat more defined, is the process of looking for trends, patterns, or other useful
information within sets of data. This is typically done as part of "exploratory analytics".
This is becoming a much bigger part of the BI process with the rapid adoption of big data
solutions like Hadoop and NoSQL and the new concept of a Data Scientist within an
organization.
-Data modelling
-Artificial intelligence
-Machine learning algorithm
PART B Questions
1. Application of Analytics
-marketing
-Finance
-Pharma
2.10 Techniques
3. Data & Infrastructure
4. Summarization ?
5. Attribution ?
- Regression models
-Market mix models
6.what is concordance pairs ?
- used in predictive analytics
-if scroe >85 % , model is v good
7. What is stationary series?
-mean , variance & auto correlation
-Not a naturally occurring series
-take difference from Raw series will make stationary series
8.What are the components of time series ?
-Trend
-Seasonal
-Cucle
-irrgular
9.How to find forecast is good ?
-Mean
-Moving average
10.Why do you need 2-score before clustering ?