Skip to content

Research Clusters

Andrea Antonacci edited this page Sep 2, 2021 · 3 revisions

Computing services and research clusters allow us to outsource computationally expensive jobs to external computer banks. If you are unsure of whether a job should be submitted to a cluster, first ask another RA and then proceed to your supervisor if necessary. Generally speaking, though, you may want to use such services if it would take too long to run the analysis on your own PC.

Blade

Blade Computing is a service by Tilburg University's Library and IT Services (LIS) and find its origins at TiSEM's Finance department. A cost-effective, on-line computing environment was needed, which could be used by PhD's and the research-group alike.

The environment consists of powerful servers on which researchers can interactively work on data and run statistical computations. This Windows based environment, which looks and feels like a regular Windows desktop.

It's accessible through a web-portal. When logged on to the blade 24 CPU’s and 384 GB of memory are available. Furthermore the blade is equipped with SSD’s to speed up your I/O.

Pros:

  • User-friendly
  • Access to Tilburg University's network drives, such as the 'Storage for Researchers'-service.
  • Most of Tilburg University's expensive statistical software is available e.g. StataMP.

Cons:

  • It’s not a cluster; the servers’ resources are the limit.
  • Resources on the server are shared with others.

How to access Blade

You can request access to Blade via the IT Service Desk and filling in a form. Access for external researchers is possible. The guest researcher needs to have a Tilburg University username, a so-called PNIL-account (staff-not-on-payroll). These accounts can be requested at the HRM department.

Once your request is approved, you can access Blade:

If this doesn't work, proceed with the following:

  • On a Windows computer use the following url in web browser and sign in with your TiU credentials (campus\TiU username): https://rdweb.campus.uvt.nl/

  • On a macOS computer use Microsoft Remote Desktop Client for Mac (install from App Store), use this webfeed URL https://rdweb.campus.uvt.nl/RDWeb/feed/webfeed.aspx and sign in with your TiU credentials (campus\TiU username)

  • On a Linux computer use the url https://rdweb.campus.uvt.nl/ to download the rdp file and import the file in Remmina client (Ubuntu software). Edit settings to sign in with your TiU credentials.

You can download a one-page PDF quickstart guide here.

IMPORTANT! Scheduled maintenance

Every Friday after the 2nd Tuesday of the month the TiU Blades will be put offline for maintenance. You will receive an email reminder each month.

Do NOT run your analyses when maintenance is performed, because unsaved work will be lost.

Moreover, each month during the maintenance period, all personal data stored locally on Blade on scratch drives (D:) will be erased! Therefore, you should backup your files before.

Where to save your files

M / S / T drive

  • Use M / S / T disc for storing data that must be retained.

D drive (Scratch)

  • Use only the D-disk (Scratch) for temporary data storage. During standard (monthly) maintenance, the D drive is deleted.

C drive

  • The C drive is only intended for the Operating System and Applications.

E drive

  • The E-disk has been used since 12-10-2018 to store the local user profiles. (This because of possible filling up of the C-drive and thus undermining the entire server).
  • The E-disk contains the user profiles (with, for example, python packages). This E-disk is automatically deleted on restart.

SURFsara's LISA cluster

If Blade Computing is not sufficient then the LISA Cluster may offer a solution. Hundreds of servers can be accessed via a so-called queue system. The execution of a calculation task can be assigned to multiple servers. As soon as the number of required servers is available, the assignment is executed. Some of the servers are linked to each other via a so-called high-speed link (infiniband). This part of the LISA Cluster is a good choice for executing an assignment that requires a great deal of communication between servers. The support desk of SurfSARA can support in use. The support desk has templates and sample applications and offers workshops for beginners.

Pros:

  • Hundreds of servers are available (cluster).
  • Access to a huge amount of software.
  • Users do not have to share the resources, but have exclusive access.

Cons:

  • Steep learning curve. Knowledge is required to be able to build an efficient job.
  • No direct access to the joint network of Tilburg University.
  • There may be a queue and thus waiting time.
  • Less interactive because of the queue principle.

How to access LISA cluster

You can request access to LISA via the IT Service Desk and filling in a form.

Once your request is approved, you can connect to LISA:

Windows

  • There is more than one way to make a connection, we take PuTTY. If that program is not present on your system, you should install it now: go to the PuTTY download page and download putty.exe. Start PuTTY and fill in under 'Host Name (or IP address)'

  • lisa.surfsara.nl Check the 'Connection type' radio button labeled 'SSH', click the 'Open' button.

Linux

Open a terminal window (for Ubuntu users: you find that here: 'Accessories - Terminal'). In that terminal window, type:

  • ssh <your_username>@lisa.surfsara.nl

If the ssh command cannot be found, install the ssh-client. For Ubuntu users:

  • sudo apt-get install openssh-client

MacOS

Open a terminal window (You find that here: 'Applications - Utilities - Terminal'). In that terminal window, type:

  • ssh <your_username>@lisa.surfsara.nl

SURFsara's Cartesius Cluster

Cartesius is a supercomputer in the true sense of the word. All servers in the cluster are linked with high-speed connections to form a very large computer. Cartesius is best suited for calculations in which the individual components (functions and procedures) have a great interdependence. A so-called Graphical Processing Unit (GPU) is also offered on this environment. In some cases, for example with Floating Point calculations, running a program on a Graphics adapter is much more efficient than on a computer processor.

Pros:

  • The most rapid solution.
  • Graphical Processing Unit (GPU) available.
  • A huge amount of software.
  • Users do not have to share resources, but have exclusive access.

Cons:

  • Steep learning curve. Knowledge is required to be able to build an efficient job.
  • No direct access to the joint network of Tilburg University.
  • There may be a queue and thus waiting time.
  • Less interactive because of the queue principle.

You can find more information about the Cartesius cluster and how to connect to it on the IT Service Desk.

Clone this wiki locally