Documente Academic
Documente Profesional
Documente Cultură
1. Abstract
2. Key Words
Hadoop, Big Data, Machine Learning
3. Introduction
2014 Alethe Labs All rights reserved. Alethe Labs and the Alethe Labs logo are trademarks or registered trademarks of Alethe Labs. All other
trademarks are the property of their respective companies. Information is subject to change without notice.
White Paper
Machine Learning in the Enterprise Hadoop
In addition to Map Reduce the following
components are useful in developing
machine learning in Big Data systems
powered by Hadoop.
5. Machine Learning
Machine learning, a branch of artificial
intelligence, concerns the construction
and study of systems that can learn
from data. Machine learning gives
computers the power to learn from data
without being explicitly programmed.
Computers enabled with machine
learning improve their performance by
learning from training data and
previous outcomes.
Collaboration of man and machine
improves the machine learning outputs
faster and gives more accurate results.
Human intuition is another key aspect
to make the machine give better results
every time a new data set is used or a
new query is asked.
6. Apache Mahout
Apache Mahout is a library of machine
learning algorithms, implemented on
top of Apache Hadoop and using the
Map/Reduce paradigm.
Once Big Data is available on Hadoop
Distributed File System (HDFS),
Mahout provides the necessary tools to
automatically
explore
meaningful
information out of those Big Data sets.
Currently Mahout has three use cases
8. Conclusion
Apache Hadoop provides a highly
scalable and cost effective model for Big
Data.
Using
machine-learning
techniques like the Mahout library
makes Hadoop consistently powerful,
delivering better results as the data
grows.
9. References
!
!
!
http://hadoop.apache.org
https://mahout.apache.org
http://en.wikipedia.org/wiki/Machine
_learning
2014 Alethe Labs All rights reserved. Alethe Labs and the Alethe Labs logo are trademarks or registered trademarks of Alethe Labs. All other
trademarks are the property of their respective companies. Information is subject to change without notice.