Sunteți pe pagina 1din 22

Top 20 Big Data

Tools
201
It is a library framework
that allows us to

# proceed distributed
processing of large
data sets across

1 various cluster of
computers. It can be
scaled up to handle
thousands of server
machines.
By the definition, it is a
fast, open source,

#2 general purpose cluster


computing framework.
API’ can be developed
in JAVA, Scala, R and
python languages. This
framework supports to
process large sets of
data across various
clusters of computers
#3 It is an open source real
time big data
computation system
and also free to use. It
can process
unbounded streams of
data in a distributed
real time.
Table is the powerful tool
ever, it helps to simplify
the raw data into an
#4 easily understandable
data sets. Tableau work
nature can be easily
understandable by
professionals who are in
any level of an
organization
Effective management
of large set of data
#5 can be done by
apache cassandra,
without compromising
the performance it
can provide you
scalability and high
ability.
Cassandra is fault
tolerant, decentralized,
Scalable, High performer.
It is also an another
open source, distributed
Big data tool that can
#6 stream process the data
with no hassles.

Provide accurate results


for out of order and
delayed data

Can easily recover


from failures
Faster, easier and highly

#7 secure modern big data


platform. It allows user
to get data from any
environment within a
single and scalable
platform.
#8 Developed by LexisNexis
Risk Solution. It delivers
data processing on a
single platform with a
single programming
language support.
#9 It is an autonomous big
data platform. Wll be
self managed, self-
optimized, it allows
businesses to focus on
better outcomes.
It is an easy to use big
data tool, that focuses on

# 10 statistical reports.

Explores data in seconds.


it helps to cleanse the
data and create charts
in seconds.
We can create
histograms, heatmaps,
and bar charts at
any time
It is the only big data
tool that stores data in
JSON Documents, It
#1 provides distributed
scaling with ultra fault
tolerant. It allows data
1 accessing through
couch replication tool.
This big data tool can be

# 12 used to extract, prepare


and blend the data. It
provides both visualization
and analytics for a
business.
Openrefine is also another
big data tool , it can help
us to work with a large
# 13 amount of messy data.

It helps to explore
large data sets with
easy manner.

Can Link and extend


data set across various
web services.
It is also an another

# 14 open source big data


tool.
Which is used for data
prep, machine learning,
and data model
deployments.
It is a Data quality
analysis tool, inside the
data cleaner there is

# 15 a strong data
profiling technique.

Interactive and
explorative data profiling
D ATA C l e a n e r
feature.Detects fuzzy
records.Validates data
and reports them.
# 16 It is a big data
community , were
businesses,
organizations and
researchers can analyze
their data seamlessly.
It is an open source
# 17 software big data
tool. Can help to
analyze large data
set on hadoop.
Querying and
managing large data
sets at real fast.
It is a community,
capable of handling
trillions of events a
day. Created in 2011

# 18 and open sourced


by linkedin.Initially
this was started as
a messaging
platform then within
a short period it has
been diverged in to
even streaming
platforms,
It is a NoSQL Database
uses graph data

# 19 model comprised of
different vertices to
represent relationships
between nodes .
Graph
Databases
It is a search based
lucene library, distributed,
full-text search engine
with an HTTP web

# 20 interface.

It is compatible on every
platform. Real time,
within a second of
adding the document it
can searchable inside
the search engine.
www.bibrainia.com

S-ar putea să vă placă și