Launch a Jupyter Notebook server using Python as well as an Apache Spark cluster on Cardinal, Ascend or Pitzer. Check the Changelog for the recent updates.
An interactive app designed for OSC OnDemand that launches a Jupyter Notebook server and an Apache Spark cluster within an Owens batch job.
Prerequisites
This Batch Connect app requires the following software be installed on the compute nodes that the batch job is intended to run on (NOT the OnDemand node):
- Lmod 6.0.1+ or any other
module purgeandmodule loadbased CLI used to load appropriate environments within the batch job before launching the Jupyter Notebook server. - Jupyter Notebook 4.2.3+ (earlier versions are untested but may work for you)
- OpenSSL 1.0.1+ (used to hash the Jupyter Notebook server password)
- Apache Spark 2.1.0+
Install
Use Git to clone this app and checkout the desired branch/version you want to use:
scl enable git19 -- git clone
cd
scl enable git19 -- git checkout
You will not need to do anything beyond this as all necessary assets are installed. You will also not need to restart this app as it isn't a Passenger app.
To update the app you would:
cd
scl enable git19 -- git fetch
scl enable git19 -- git checkout
Again, you do not need to restart the app as it isn't a Passenger app.
Contributing
- Fork it ( https://github.com/OSC/bc_osc_jupyter_spark/fork )
- Create your feature branch (
git checkout -b my-new-feature) - Commit your changes (
git commit -am 'Add some feature') - Push to the branch (
git push origin my-new-feature) - Create a new Pull Request
License
- Documentation, website content, and logo is licensed under CC-BY-4.0
- Code is licensed under MIT (see LICENSE.txt)
- The Jupyter logo is a trademark of NumFOCUS foundation.
- Apache Spark, Spark and the Spark logo are trademarks of the Apache Software Foundation (ASF).