About
What is SCRP?
SCRP is a high-performance computing cluster managed by the Department of Economics of The Chinese University of Hong Kong, designed to serve both research and teaching roles. Although miniscule in size when compared to most HPC clusters, it utilizes many of the same technology such as fast Infiniband interconnect, parallel storage and multi-node workload management.
As of 2024-9-25, the total resources available are:
Resource | Quantity |
---|---|
CPU cores | AMD Zen 4: 880 AMD Zen 3: 512 AMD Zen 2: 576 |
RAM | 17TB |
GPU | NVIDIA H100 NVL: 7 NVIDIA A100/A800: 14 NVIDIA RTX 3090: 8 NVIDIA RTX 3060: 20 |
Distributed Flash Storage | 175TB 24GB/s >500K IOPS |
Archival Storage | 91TB |
Design
SCRP’s design is similar to most HPC cluster and consists of four types of computer servers:
- Login nodes handle user login and light computation.
- Compute nodes handle heavy computation.
- Storage nodes handle file storage.
- Management node handles background tasks.
As a user, you will spend most of your time interacting with the login nodes—tasks such as file transfer and coding should all be done on them. As all nodes share a common file system, your files will be available everywhere even though you are only uploading them to a login node. See Account and Access for details.
You can do light computations on the login nodes, but since they are shared by all users they are not suitable for heavy use. For the latter you will want to run your code on a compute node. SCRP uses the Slurm Workload Manager to allocate computational resources to users. The guides to available software have more details on how to use Slurm.
Hardware Specifications
SCRP consists of 26 nodes in total.
scrp-login-1
- AMD EPYC 7542 (32 cores, 64 threads)
- 256GB DDR4-3200 ECC RAM
- 500GB SATA SSD x 2 + 10TB SATA HDD x 12
- Mellanox ConnectX-6 200Gb/s NIC
scrp-login-2
scrp-node-[1,2,3]
scrp-node-[8,9]
scrp-node-4
scrp-node-5
scrp-node-[6,7]
scrp-node-8
scrp-node-10
scrp-node-11
scrp-node-12
scrp-node-13
scrp-node-[14,18]
scrp-node-[15-17]
scrp-node-19
scrp-node-20
scrp-node-21
- AMD EPYC 7763 x 2 (128 cores, 128 threads)
- 1TB DDR4-3200 ECC RAM
- NVIDIA H100 NVL 94GB PCIe x 7 with NVLink
- 2TB NVMe SSD x 4
- Mellanox ConnectX-6 200Gb/s NIC
scrp-control
is the cluster’s management node.- AMD EPYC 7302 (16 cores, 32 threads)
- 128GB DDR4-2666 ECC RAM
- Storage Disks:
- 4TB Intel P4510 NVMe SSD x 2
- 8TB Intel P4510 NVMe SSD x 2
- Mellanox ConnectX-6 Single 200b/s NIC
scrp-data-[1-2]
are the cluster’s distrubted storage nodes.- AMD EPYC 7542 (32 cores, 64 threads)
- 256GB DDR4-3200 ECC RAM
- Storage Disks:
- 1.6TB Intel P4610 NVMe SSD x 2
- 8TB Intel P4510 NVMe SSD x 6
- Mellanox ConnectX-6 200Gb/s NIC x 2
Staff
Director of Economic Computing Services
Dr. Vinci Chow (vincichow@cuhk.edu.hk)
Assistant Computer Officers
Gary Yeung (garyyeung@cuhk.edu.hk)
Alex Fong (alex-fong@cuhk.edu.hk)