Technology Blogs by SAP
Learn how to extend and personalize SAP applications. Follow the SAP technology blog for insights into SAP BTP, ABAP, SAP Analytics Cloud, SAP HANA, and more.
cancel
Showing results for 
Search instead for 
Did you mean: 
former_member89721
Product and Topic Expert
Product and Topic Expert

SAP HANA Vora provides an in-memory processing engine which can scale up to thousands of nodes, both on premise and in cloud. Vora fits into the Hadoop Ecosystem and extends the Spark execution framework.


Concepts and Requirements:


Sap HANA VORA 1.2 consists of the two following main components:

  • SAP HANA Vora Engine:
    SAP HANA Vora instances hold data in memory and boost the performance.
  • SAP HANA Vora Spark Extension Library:
    • Provides access to SAP HANA Vora through Spark.
    • Makes available additional functionality, such as a hierarchy implementation.

These two components are included inside the Vora packages which are available as follows and you could choose based on your Hadoop distribution.

  • SAP HANA Vora for Ambari: VORA_AM<version>.TGZ
  • SAP HANA Vora for Cloudera: VORA_CL<version>.TGZ
  • SAP HANA Vora for MapR: VORA_MR<VERSION>.TGZ


To download the packages: https://launchpad.support.sap.com/#/softwarecenter/search/vora%25201.2


Vora 1.2 supports the following operation systems:

  • SUSE Linux Enterprise Server (SLES) 11 SP3
  • Red Hat Enterprise Linux (RHEL) 6.7 and 7.2

You should also follow the Installation and Administration guide for the compatibility pack installations: http://help.sap.com/hana_vora

Following table shows you the combination of operating system, cluster provisioning tool, and Hadoop distribution:

Remember that the minimal setup for Vora 1.2 is :

  • 4 cores
  • 8 GB of RAM
  • 20 GB of free disk space for HDFS data
  • Note: You can’t install Vora 1.2 on a single node

In order to have the Vora 1.2 running, you have to have the following Vora services installed and configured and I will walk you through their installment and configurations on the clusters.

  • SAP HANA Vora Base: Vora libraries and binaries. Installs on all hosts.
  • SAP HANA Vora Catalog: Vora distributed metadata store. Installs on one node and usually on DLOG node.
  • SAP HANA Vora Discovery Service: Manages service registrations and installs on all nodes. In server mode installs on 3 nodes(Max 7) and selects the bootstrapping host. In client mode, installs on all remaining nodes. Note: You can’t install DS server and client both on the same node.
  • SAP HANA Vora Distributed Log: Provides persistence for Vora Catalog. Install usually on the master node(5 nodes recommended).
  • SAP HANA Vora Thriftserver: Gateway compatible with Hive JDBC connector. Usually install on the jumpbox where DS, DLOG and Catalog servers are not installed.
  • SAP HANA Vora Tools: Web UI for Vora 1.2 modeler. Install on the same node as Vora Thriftserver.
  • SAP HANA Vora V2Server: Vora Engine. Installs on all worker nodes ( Datanodes)

 

The installation and configuration should either happen at the same time for all the services or you should follow the following order to make sure of handling the dependencies:

The following schema shows you the architecture for clusters with 4 nodes and the assignment of different Vora 1.2 services which we will set up in this document:

One Master node, One Server node and two workers.


*** Our assumption is that you have your Hadoop clusters set up with HDFS 2.6.x or 2.7.1, ZooKeeper 3.4.6, Spark 1.5.2, Yarn cluster manager 2.7.1 components.


Installing Vora 1.2 Services:


Step 1) Adding Vora Base: You have to add Vora base on all nodes and they have to be installed as clients as shown below.

— no extra configuration is needed.

— you can click on the proceed button as is shown below even if you get the error since you’re not using MapReduce jobs:

— Click on complete.

— notice that the Vora base is now added to your services:

Step 2) Now we add Vora discovery as 3 Vora discovery servers and one client.

Adding the Vora Discovery client:

-- Vora discovery servers need extra configurations:

— in vora_discovery_bootstrap add the master DNS

— in vora_discovery_servers add your server DNS’s

— proceed and deploy the service

notice that vora discovery service is now installed:

Step 3) Now we add Vora Distributed Log service :


— we install DLOG servers on the same machines where we installed our Discovery Servers.

— No extra configurations are needed.

— click Next-> click Proceed anyway—>click  Complete

— Notice that vora DLOG is now added to the services:

Step 4) Next step is to install Vora Catalog:

— Install Catalog on your master node:



— click Next->click Proceed anyway—>click Complete

— Notice that vora Catalog is added to the services:

Step 5) Time to install V2Server as shown below:



— extra configuration: add the Vora V2Server Worker service to worker1 and worker2 nodes and remove it from your server node.

— click Next->click Proceed anyway—>click Complete

— Notice that vora V2Server is now added to the services:

Step 6) Time to install Vora Thriftserver and Vora Tools:

— you have to add more configurations to the thrift server as it’s shown below:

— add vora_thriftserver_java_home = /usr/lib/jvm/java --this value depends on where JAVA installed on your system

— add vora_thriftserver_spark_home =  /usr/hdp/2.3.4.0-3485/spark --this is your Spark Home value

— click Next-> click Proceed anyway—>click Complete

— Notice that vora thriftServer and Vora tools  are now added to the services:

Now click on HDFS, MaprReduce2 and YARN services which are in red and restart all affected as shown below:

Congratulations!! You now have Vora 1.2 services installed on your clusters.

Step 7) To validate your Vora:

— SSH to your worker1 node and run:

— source /etc/vora/vora-env.sh

— $VORA_SPARK_HOME/bin/start-spark-shell.sh


and you should now see the SQL contexts (Vora SQL Context and Spark SQL Context) bieng available.

16 Comments