Technology Blogs by SAP
Learn how to extend and personalize SAP applications. Follow the SAP technology blog for insights into SAP BTP, ABAP, SAP Analytics Cloud, SAP HANA, and more.
cancel
Showing results for 
Search instead for 
Did you mean: 
AndreasForster
Product and Topic Expert
Product and Topic Expert

This component adds a Pareto Plot to SAP Predictive Analytics.


It also outputs the aggregated data that underlies the plot, so that the transformed data can be used elsewhere.


Disclaimer

Please note that this component is not an official release by SAP and that it is provided as-is without any guarantee or support. Please test the component to ensure it works for your purposes.


Prerequisites

  • Libraries dplyr and gplots have to be installed.

Limitations

Please let me know should you encounter any limitations.

Usage

These parameters can be set by the user:

ParameterDescription
Categorical Variable

The label column by which the numerical variable will be summarised.

Numerical VariableThe column, which will be summarised by the lables found in the categorical variable.

Output columns:

ColumnDescription
Label

The values found in the categorical variable.

ValueThe numerical variable summarised by the row's label. The data is sorted descendingly on this column.
ValueCumulatedThe cumulated value.
PercentThe row's percentual contribution to the total sum of the numerical variable.
PercentCumulatedThe cumulated percentage.

How to Implement

The component can be downloaded as .spar file from GitHub. Then deploy it as described here. You just need to import it through the option "Import/Model Component", which you will find by clicking on the plus-sign at the bottom of the list of the available algorithms.


Example

You can try the pareto plot on the attached dataset, which lists how many passengers embarked on an airplane at San Francisco airport. Select "PassengerCount" as numerical variable and choose a categorical column.


You can see the resulting pareto plot. Here the plot is broken down by geographical region.


You can also see the aggregated data that was used to build the plot.

You could use that data for instance in the "Prepare" tab to easily produce a more interactive pareto plot.