Notebooks for Apache Spark


New

Notebooks for Apache Spark

Launch a Jupyter Notebook in few clicks and run your Apache Spark jobs from it

Jupyter Notebooks for Apache Spark

Do you use Apache Spark but look for a more interactive way to explore your big data?

The service is free during the alpha. At the end of this lab, all notebooks instances will be deleted.

How it works?

Notebooks for Apache Spark completes the OVHcloud Data Processing product by leveraging our existing serverless Apache Spark service and bringing the capability to run on-demand Python Apache Spark jobs from your Jupyter Notebooks.

Data scientist will find a familiar data science experience through Jupyter Notebooks without the hassle of setting up the Apache Spark infrastructure.

YouTube conditions the playback of its videos on the deposit of tracers in order to offer you targeted advertising based on your browsing.

In order to watch the video, you need to accept the Sharing cookies on third-party platforms privacy category in our Privacy Center. You have the option of withdrawing your consent at any time.

For more information,visit the YouTube cookies policy and the OVHcloud cookies policy .

How to start a Jupyter notebook for Apache Spark in few clicks

How to opt-in this lab?

Nothing simpler! We have put together several guides to help setup and use your first notebook.

First, follow our getting started guide to learn how to create Notebooks for Apache Spark.

Then, try our data cleaning tutorial to hone your skills.

Features & benefits

Accelerate time to market

  • As Data Scientists or Developers, benefit from Jupyter live code editor very simply
  • No hassle of setting up Apache Spark infrastructure
  • Launch your Jupyter notebook in minutes, and directly launch your Apache Spark jobs on demand
  • Accelerate your data project time to deliver

Ease of use

  • Easy to use Control Panel as well as a comprehensive API

About Notebooks for Apache Spark

  • Alpha launched in April 2023
  • DC availability: GRA
  • Apache Spark 3.4.0

How to share your feedback?

Feel free to engage with us and the community on Discord.

You will find our #data-processing channel under the Data Analytics - Public Cloud section.

Limitations

This product being in an alpha testing phase comes with a number of limitations:

  • Kernels are limited to the latest supported version of Spark (currently 3.4.0).
  • Notebooks can only be created in "Public access".
  • Environment are not backed up when stopping a notebook. Remember to save your files before stopping the Jupyter Lab.
  • Alpha
  • Beta
  • General Availability