Skip to content

PRACE User Support

Introduction

PRACE users coming to the TIER-1 systems offered through the DECI calls are, in general, treated as standard users, so most of the general documentation applies to them as well. This section shows the main differences for quicker orientation, but often uses references to the original documentation. PRACE users who do not undergo the full procedure (including signing the IT4I AuP on top of the PRACE AuP) will not have a password and thus an access to some services intended for regular users. However, even with the limited access, they should be able to use the TIER-1 system as intended. If the same level of access is required, see the Obtaining Login Credentials section.

All general PRACE User Documentation should be read before continuing reading the local documentation here.

Help and Support

If you need any information, request support, or want to install additional software, use PRACE Helpdesk.

Information about the local services are provided in the introduction of general user documentation Salomon and introduction of general user documentation Barbora. Keep in mind, that standard PRACE accounts don't have a password to access the web interface of the local (IT4Innovations) request tracker and thus a new ticket should be created by sending an email to support[at]it4i.cz.

Obtaining Login Credentials

In general, PRACE users already have a PRACE account set up through their HOMESITE (institution from their country) as a result of a rewarded PRACE project proposal. This includes signed PRACE AuP, generated and registered certificates, etc.

If there is a special need, a PRACE user can get a standard (local) account at IT4Innovations. To get an account on a cluster, the user needs to obtain the login credentials. The procedure is the same as for general users of the cluster, see the corresponding section of the general documentation here.

Accessing the Cluster

Access With GSI-SSH

For all PRACE users, the method for interactive access (login) and data transfer based on grid services from Globus Toolkit (GSI SSH and GridFTP) is supported.

The user will need a valid certificate and to be present in the PRACE LDAP (contact your HOME SITE or the Primary Investigator of your project for LDAP account creation).

For more information, see PRACE FAQ

Before you start using any of the services, do not forget to create a proxy certificate from your certificate:

$ grid-proxy-init

To check whether your proxy certificate is still valid (12 hours by default), use:

$ grid-proxy-info

To access the cluster, several login nodes running the GSI SSH service are available. The service is available from public Internet as well as from the internal PRACE network (accessible only from other PRACE partners).

Access From PRACE Network:

It is recommended to use the single DNS name name-cluster-prace.it4i.cz which is distributed between the four login nodes. If needed, the user can log in directly to one of the login nodes. The addresses are:

Salomon cluster:

Login address Port Protocol Login node
salomon-prace.it4i.cz 2222 gsissh login1, login2, login3 or login4
login1-prace.salomon.it4i.cz 2222 gsissh login1
login2-prace.salomon.it4i.cz 2222 gsissh login2
login3-prace.salomon.it4i.cz 2222 gsissh login3
login4-prace.salomon.it4i.cz 2222 gsissh login4
$ gsissh -p 2222 salomon-prace.it4i.cz

When logging from other PRACE system, the prace_service script can be used:

$ gsissh `prace_service -i -s salomon`

Access From Public Internet:

It is recommended to use the single DNS name name-cluster.it4i.cz which is distributed between the four login nodes. If needed, the user can login directly to one of the login nodes. The addresses are:

Salomon cluster:

Login address Port Protocol Login node
salomon.it4i.cz 2222 gsissh login1, login2, login3 or login4
login1.salomon.it4i.cz 2222 gsissh login1
login2-prace.salomon.it4i.cz 2222 gsissh login2
login3-prace.salomon.it4i.cz 2222 gsissh login3
login4-prace.salomon.it4i.cz 2222 gsissh login4
$ gsissh -p 2222 salomon.it4i.cz

When logging from other PRACE system, the prace_service script can be used:

$ gsissh `prace_service -e -s salomon`

Although the preferred and recommended file transfer mechanism is using GridFTP, the GSI SSH implementation also supports SCP, so for small files transfer, gsiscp can be used:

$ gsiscp -P 2222 _LOCAL_PATH_TO_YOUR_FILE_ salomon.it4i.cz:_SALOMON_PATH_TO_YOUR_FILE_
$ gsiscp -P 2222 salomon.it4i.cz:_SALOMON_PATH_TO_YOUR_FILE_ _LOCAL_PATH_TO_YOUR_FILE_
$ gsiscp -P 2222 _LOCAL_PATH_TO_YOUR_FILE_ salomon-prace.it4i.cz:_SALOMON_PATH_TO_YOUR_FILE_
$ gsiscp -P 2222 salomon-prace.it4i.cz:_SALOMON_PATH_TO_YOUR_FILE_ _LOCAL_PATH_TO_YOUR_FILE_

Access to X11 Applications (VNC)

If the user needs to run X11 based graphical application and does not have a X11 server, the applications can be run using VNC service. If the user is using a regular SSH based access, see this section in general documentation.

If the user uses a GSI SSH based access, then the procedure is similar to the SSH based access, only the port forwarding must be done using GSI SSH:

$ gsissh -p 2222 salomon.it4i.cz -L 5961:localhost:5961

Access With SSH

After they successfully obtain the login credentials for the local IT4Innovations account, the PRACE users can access the cluster as regular users using SSH. For more information, see this section in general documentation.

File Transfers

PRACE users can use the same transfer mechanisms as regular users (if they have undergone the full registration procedure). For more information, see the Accessing the Clusters section.

Apart from the standard mechanisms, for PRACE users to transfer data to/from the Salomon cluster, a GridFTP server running the Globus Toolkit GridFTP service is available. The service is available from public Internet as well as from the internal PRACE network (accessible only from other PRACE partners).

There is one control server and three backend servers for striping and/or backup in case one of them would fail.

Access From PRACE Network

Salomon cluster:

Login address Port Node role
gridftp-prace.salomon.it4i.cz 2812 Front end /control server
lgw1-prace.salomon.it4i.cz 2813 Backend / data mover server
lgw2-prace.salomon.it4i.cz 2813 Backend / data mover server
lgw3-prace.salomon.it4i.cz 2813 Backend / data mover server

Copy files to Salomon by running the following commands on your local machine:

$ globus-url-copy file://_LOCAL_PATH_TO_YOUR_FILE_ gsiftp://gridftp-prace.salomon.it4i.cz:2812/home/prace/_YOUR_ACCOUNT_ON_SALOMON_/_PATH_TO_YOUR_FILE_

Or by using prace_service script:

$ globus-url-copy file://_LOCAL_PATH_TO_YOUR_FILE_ gsiftp://`prace_service -i -f salomon`/home/prace/_YOUR_ACCOUNT_ON_SALOMON_/_PATH_TO_YOUR_FILE_

Copy files from Salomon:

$ globus-url-copy gsiftp://gridftp-prace.salomon.it4i.cz:2812/home/prace/_YOUR_ACCOUNT_ON_SALOMON_/_PATH_TO_YOUR_FILE_ file://_LOCAL_PATH_TO_YOUR_FILE_

Or by using the prace_service script:

$ globus-url-copy gsiftp://`prace_service -i -f salomon`/home/prace/_YOUR_ACCOUNT_ON_SALOMON_/_PATH_TO_YOUR_FILE_ file://_LOCAL_PATH_TO_YOUR_FILE_

Access From Public Internet

Salomon cluster:

Login address Port Node role
gridftp.salomon.it4i.cz 2812 Front end /control server
lgw1.salomon.it4i.cz 2813 Backend / data mover server
lgw2.salomon.it4i.cz 2813 Backend / data mover server
lgw3.salomon.it4i.cz 2813 Backend / data mover server

Copy files to Salomon by running the following commands on your local machine:

$ globus-url-copy file://_LOCAL_PATH_TO_YOUR_FILE_ gsiftp://gridftp.salomon.it4i.cz:2812/home/prace/_YOUR_ACCOUNT_ON_SALOMON_/_PATH_TO_YOUR_FILE_

Or by using the prace_service script:

$ globus-url-copy file://_LOCAL_PATH_TO_YOUR_FILE_ gsiftp://`prace_service -e -f salomon`/home/prace/_YOUR_ACCOUNT_ON_SALOMON_/_PATH_TO_YOUR_FILE_

Copy files from Salomon:

$ globus-url-copy gsiftp://gridftp.salomon.it4i.cz:2812/home/prace/_YOUR_ACCOUNT_ON_SALOMON_/_PATH_TO_YOUR_FILE_ file://_LOCAL_PATH_TO_YOUR_FILE_

Or by using the prace_service script:

$ globus-url-copy gsiftp://`prace_service -e -f salomon`/home/prace/_YOUR_ACCOUNT_ON_SALOMON_/_PATH_TO_YOUR_FILE_ file://_LOCAL_PATH_TO_YOUR_FILE_

Generally, both shared file systems are available through GridFTP:

File system mount point Filesystem Comment
/home Lustre Default HOME directories of users in format /home/prace/login/
/scratch Lustre Shared SCRATCH mounted on the whole cluster

More information about the shared file systems on Salomon is available here.

Hint

The prace directory is used for PRACE users on the SCRATCH file system.

Salomon cluster /scratch:

Data type Default path
large project files /scratch/work/user/prace/login/
large scratch/temporary data /scratch/temp/

Usage of the Cluster

There are some limitations for PRACE users when using the cluster. By default, PRACE users are not allowed to access special queues in the PBS Pro to have high priority or exclusive access to some special equipment like accelerated nodes and high memory (fat) nodes. There may also be restrictions on obtaining a working license for the commercial software installed on the cluster, mostly because of the license agreement or because of insufficient amount of licenses.

For production runs, always use scratch file systems. The available file systems on Salomon is described here.

Software, Modules and PRACE Common Production Environment

All system-wide installed software on the cluster is made available to the users via the modules. For more information about the environment and modules usage, see the Environment and Modules section.

PRACE users can use the "prace" module for PRACE Common Production Environment.

$ ml prace

Resource Allocation and Job Execution

For general information about the resource allocation, job queuing, and job execution, see Resources Allocation Policy.

For PRACE users, the default production run queue is "qprod", the same queue as for the national users of IT4I. Previously the "qprace" was the default queue for PRACE users, but since it gradually became identical with the "qprod" queue, it has been retired. For legacy reasons, the "qprace" queue is enabled on systems where it was the default one, but is not available on current and future systems. PRACE users can also use two other queues "qexp" and "qfree".

Salomon:

queue Active project Project resources Nodes priority authorization walltime
qexp Express queue no none required 32 nodes, max 8 per user 150 no 1 / 1 h
qprod Production queue yes >0 1006 nodes, max 86 per job 0 no 24 / 48 h
qfree Free resource queue yes none required 752 nodes, max 86 per job -1024 no 12 / 12 h
qprace Legacy production queue yes >0 1006 nodes, max 86 per job 0 no 24 / 48 h

Accounting & Quota

The resources that are currently subject to accounting are the core hours. The core hours are accounted on the wall clock basis. The accounting runs whenever the computational cores are allocated or blocked via the PBS Pro workload manager (the qsub command), regardless of whether the cores are actually used for any calculation. See the example in the general documentation.

PRACE users should check their project accounting using the PRACE Accounting Tool (DART).

Users who have undergone the full local registration procedure (including signing the IT4Innovations Acceptable Use Policy) and who have received a local password may check at any time, how many core-hours they and their projects have consumed using the command "it4ifree". Note that you need to know your user password to use the command and that the displayed core hours are "system core hours" which differ from PRACE "standardized core hours".

Note

The it4ifree command is a part of it4i.portal.clients package, located here.

$ it4ifree

Projects I am participating in
==============================
PID         Days left      Total    Used WCHs    Used NCHs    WCHs by me    NCHs by me     Free
----------  -----------  -------  -----------  -----------  ------------  ------------  -------
OPEN-XX-XX  323                0      5169947      5169947         50001         50001  1292555


Projects I am Primarily Investigating
=====================================
PID        Login         Used WCHs    Used NCHs
---------- ----------  -----------  -----------
OPEN-XX-XX user1            376670       376670
           user2           4793277      4793277

Legend
======
WCH   =    Wall-clock Core Hour
NCH   =    Normalized Core Hour

By default, a file system quota is applied. To check the current status of the quota (separate for HOME and SCRATCH), use:

$ quota
$ lfs quota -u USER_LOGIN /scratch

If the quota is insufficient, contact the support and request an increase.