Lecture 07

Cluster and cloud computing resources

Cluster and cloud computing resources

Google Colab

Google Colab provides free cloud-based Jupyter notebooks. Each session is limited to 12 hours of computation, which should be enough for this class. Colab is useful for quick work, but can be tedious since packages need to be reinstalled each time.

Website: colab.research.google.com

Google Colab interface
Figure 1. Google Colab interface with GPU runtime enabled.

Turn GPU mode, default CPU

To enable GPU acceleration, go to:
RuntimeChange Runtime TypeHardware Accelerator

Colab comes preloaded with some packages, including PyTorch.

Installing packages

You can install additional packages in a notebook cell using:

!pip install package_name

GitHub

Colab can be directly connected to GitHub or Google Drive.
When you import a notebook from GitHub, save it to Drive, otherwise your changes will not be saved.

How to get data

Mount your Google Drive to the notebook to access files.
Once mounted, the notebook can open data as usual. Colab does not automatically have access to local data.

Colab is a good way to get free GPU access. There are also many paid vendors for GPU access.

Mounting Google Drive in Colab
Figure 2. Mounting Google Drive in Colab to access datasets.

Once mounted, all your Drive files appear under /content/drive/MyDrive/.
You can then open datasets directly from these folders in your Colab notebook.

Opening dataset files from Google Drive in Colab
Figure 3. Example code showing how to open dataset files once Google Drive is mounted.

CHTC

CHTC is UW’s own cloud computing server.
If you are working with faculty, you can get a free account and access.
Research is conducted on the use of these servers by UW.