AnalyticsEngine

IBM Analytics Engine powered by Apache Spark provides managed service for consuming Apache Spark with additional features such as auto-scaling, resource quota, and queuing. You can run Spark application interactively by using Jupyter Notebooks and Scripts, both Python and R. The applications can also be run by using jobs from Notebook, Deployment space, or by using the Spark service instance. The IBM Analytics Engine powered by Apache Spark creates on-demand Spark clusters and runs workloads using offerings like Spark applications, Spark kernels, and Spark labs.

The IBM Analytics Engine powered by Apache Spark service is not available by default. An administrator must install this service on the IBM Cloud Pak for Data platform. To determine whether the service is installed, open the Services catalog and check whether the service is enabled.

Each time you submit a job, a dedicated Spark cluster is created for the job. You can specify the size of the Spark driver, the size of the executor, and the number of executors for the job. This enables you to achieve predictable and consistent performance.

When a job completes, the cluster is automatically cleaned up so that the resources are available for other jobs. The service also includes interfaces that enable you to analyze the performance of your Spark applications and debug problems.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
RemoteDataPlane		RemoteDataPlane
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AnalyticsEngine

About

Releases

Packages

Contributors 2

Languages

License

IBM/AnalyticsEngine

Folders and files

Latest commit

History

Repository files navigation

AnalyticsEngine

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages