Documente Academic
Documente Profesional
Documente Cultură
and Neural
networks
-Impact on Big data
Big Data
* 3Vs
* Generating Buzz - Scientific data exponential growth
* 2012 - the year of Big Data
* 2013 - the year of Big Data analytics
Machine learning
* Branch of AI
* Focus on the study and construction of systems - predictions on
unseen data
* Applications in search engines, stock market analysis, speech
recognition ,information retrieval etc.
*ML History
1952- Arthur Samuel- first game-playing program - train checkers
Machine learning - gained momentum in early 90s
Learning algorithms come into commercial systems (Bayesian
networks )
Logistics Regression
Neural Network
Support Vector Machine
Clustering
Other popular algorithms:Random forest, Lasso, K-means, SVD etc
learning Algorithm
* Commercial Tools
Mahout, SaS, Matlab
Scikit Python
Cost Function
Minimization
Gradient
Descend
https://www.coursera.org/course/ml
What if m is
3,000,000
Solution 1
Scale up your learning
algorithm for Big Data
In memory computing
summation is not
possible
Traditional
Traditional Learning
Learning
algorithm
algorithm Not
Not working
working
for
for big
big data
data
https://www.coursera.org/course/ml
How do is scale up ?
Rsofia,Shortgun-r
Java-Lingpipe
Shortgun-Pyhon
https://www.coursera.org/course/ml
Computer
4
Data
Split 000
By Map
300
Reduce
000
training
data
Computer
3
Computer 2
Computer
1
Combine
results
Stochastic
Gradient
Descend
R-JAQL Bridge
Haloop
Haloop inherits map reduce from Hadoop. It adds various modification
in order to support iterative map reduce task.
HaLoop has API for easily writing iterative data analysis program
There is a Loop control module in master of Haloop which starts new
map reduce job and and control exit
In case of failure in iterative task the task scheduler and task trackers
facilitate recovery and allow the iterative data analysis to continue.
Apache Mahout
R-Hadoop
Haloop Details
Mahout Details
r-Hadoop
Implementations of
Machine learning
in Big data
Manufacturing and
Government
Quality control
Missile targeting
Loan underwriting
Specimen identification
Six Sigma
Credit scoring
Protein sequencing
Real-estate appraisal
industry
prediction
detection
* Neural Networks
* NVIDIA
Built largest artificial neural network - purpose to simulate and learn behavior
of human brain.
Nearly 6.5 times larger than the one developed by google in 2012
* NUANCE
Leader in Natural Language Processing and speech recognition
* NETFLIX
Uses neural networks on big chunk of user data generated through websites predict better recommendations for its users.
* Sample implementation
- Location Graph
Combines first-party data the big data with platform based on
machine learning
http
://blog.jiwire.com/how-big-data-enables-jiwire-to-deliver-30-or-more-lift-in-campaign-perfor
RMR
- statistics analysis and data visualization
mance/
Graph analytics
Commercial product
raphLab
arcData
Other Projects
http://gigaom.com/2013/05/14/were-witnessing-the-rise-of-the-graph-in-big-data/
LIONsolver -
-was able to differentiate Parkinsons patients from healthy individuals
-show the trend in symptoms of the disease over time
p://successfulworkplace.com/2013/07/31/big-data-crowdsourcing-and-machine-learning-tackle-parkinsons/
University of Cambridge researcher Anastasios Noulas - choosing the best retail location.
magic:
http://gigaom.com/2012/11/03/5-trends-that-are-changing-how-we-do-big-data/
http://www.kaggle.com/
Exponential
Compression
Machine
Learning
Big Data
Real time
Search Of
Big Data
Large Data
Set
Training set
Q-App
http://www.eetimes.com/document.asp?doc_id=1319059
*Thank You