📦 Available software
A huge range of software applications, utilities, and libraries are installed and configured for you. Whether you need Rstudio or Spyder, Julia running in VSCode, popular R or Python packages, of fully configured Jupyter Notebooks, we have you covered.
The list of installed software is so large we make no effort to enumerate everything here, but you can always get an up-to-date list by opening a terminal on the Grid and running
Start with the expectation that all the software you need is already installed and ready to use. If that expectation is ever broken please email research@hbs.edu.
The most recent version of our software environment is activated when you log in to the HBS Grid via NoMachine and nothing further is required unless you wish to use an environment other than the default.
When logging in via SSH from a terminal (i.e., without NoMachine) you must explicitly activate the environment you wish to use after logging in. The default environment can be activated by running
Refer to the terminal environment documentation for details.
Installing Packages or Modules
The HBSGrid uses the conda package manager to install most R packages and Python modules that researchers will need in a central, read-only location. As we cannot anticipate everyone's requirements, you may have the need to install packages not centrally installed.
We recommend the following: - Install packages/modules in home folders for personal, testing, or development work, OR - Install packages/modules in project spaces for production or team use
One can use packages in all three locations, and can have the different versions of the
same package installed in different locations. Note that it is the search path of your R
or Python environment that will determine the priority order. We usually recommend
(highest to lowest):
- home folder ($HOME
or ~
) installations
- project folder installations
- central (cluster) installations
Installing R Packages
For installation in home folders: Use the standard command for install.packages()
command from within R/RStudio will download and install the specified packages in your
home folder by default. If the correct install folder is not already there, R/RStudio will
create it for you:
See the R documentation for additional options (install from another location besides CRAN, overwrite current installation, etc.)
For installation in project spaces: If you are working on a team and this is something for everyone to use, install the packages using the 'lib=' option:
In order for this to be seen when working on your project in this location, one can pre-
or post-pend this location with a .libPaths()
command
as part of a startup script
that runs when one opens a project or via new R session.
Note: as R and RStudio are managed via our central conda installation, one can also use
conda
and conda environments to manage our packages, which we don't recommend.
Installing Python Modules
Python modules can be installed via two methods: using pip install (recommended) or via creating conda environments. The latter is more heavyweight, as one is installing all the python infrasture and core, in addition to the modules you need. See our conda environments documentation if this is what is needed.
For installation in home folders: Include the --user
option to perform this installation:
To update/upgrade a module already installed, include also the --upgrade
option:
The installed modules will be located at $HOME/.local/lib/pythonxx.yy
, with xx.yy
being the python major.minor version.
For installation in project spaces: If you are working on a team and/or for production work, install the modules with pip install, though use the --prefix option instead:
Similar to R, one will need to include this location in the PYTHONPATH using sys.path
as a part of one's first few lines of code or as part of a Python startup script.
Installing Compatible Software on the HBSGrid Cluster
Ideally, the application you need will have a version compatible with the cluster's OS
(operating system; currently Red Hat Enterprise Linux 7, also known as RHEL 7.) Examples
include the PyCharm Python IDE and fzf
command-line fuzzy-finder. If you have a
compatible application, using the terminal, download and extract the program; run any
installer provided (if there is no installer just move the program wherever you like); and
run the software directly from the terminal command
line. If the software is available
in multiple versions, pick one that mentions "RHEL", "RPM", "Linux x86-64 / AMD64", or
just "Linux".
As an illustration, we can install PyCharm as follows:
- Connect to the HBSGrid via NoMachine, start Terminal from the menu, and download Pycharm:
tar
:
3. Submit an interactive job to run Pycharm on a compute node:
If all goes well you will see the Pycharm splash screen and you can start a new Python
project using this popular IDE.
If the application you need does not have a pre-built version compatible with the HBSGrid system or you are having issues installing your compatible software, reach out to research@hbs.edu.
Note: Please do not use sudo, apt-get, dnf, or rpm programs, as these require elevated privileges, which are reserved for the cluster administrators and not mere mortals such as ourselves.