Category: spark

0

Apache TinkerPop

The Gremlin Graph Traversal Machine and Language Apache TinkerPop ™ is a graph computing framework for both graph databases (OLTP) and graph analytic systems (OLAP). Gremlin is the graph traversal language of Apache TinkerPop....

0

Features of RDD & its Operations

Lets look at some of the more appealing features of apache spark and RDD. Apache Spark performs in-memory computation, also it evaluates RDDs lazily i.e. they do not compute their results right away. Instead,...

0

Resilient Distributed Dataset (RDD)

Before we discuss Resilient Distributed Dataset , lets see how do we launch Spark? A Spark shell executable file is usually present in Spark version folder which in turn is present under the “opt”...

0

Apache Spark Architecture

From the image shown above one can easily understand the huge dynamics of spark. The section on the left hand side of the image depicts all the different sources which provides the input data...

0

What is Apache Spark?

Apache Spark is a powerful open source processing engine, with a cluster computing framework. Spark is designed in such a way to ensure lightening fast data processing of large datasets. this includes Batch processing...