-
Notifications
You must be signed in to change notification settings - Fork 1
/
03-computational-resources.Rmd
86 lines (54 loc) · 5.92 KB
/
03-computational-resources.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
# Computational resources
## Wynton <img src="./images/wynton.png" width="20" />
Wynton is a large, shared high performance computer cluster that is available to all UCSF members. If your work is too large to run on your own laptop or would benefit from a highly parallelized workload, Wynton should be used.
- [Wynton Home](https://wynton.ucsf.edu/hpc/index.html)
- How to [get an account](https://wynton.ucsf.edu/hpc/about/join.html)
- [Log in without password](https://wynton.ucsf.edu/hpc/howto/log-in-without-pwd.html) (optional but useful)
- Learn about [Wynton](https://wynton.ucsf.edu/hpc/index.html) and [Wynton tutorial](https://github.com/ucsf-wynton/tutorials/wiki)
- [Join](https://wynton.ucsf.edu/hpc/support/index.html) Wynton slack (optional)
- Join bioinf_genepitools EPPIcenter Slack channel
### Wynton Best Practices
If you plan to use wynton, you should know about the following.
#### Login vs Development vs Data Transfer Nodes
The login (`log`) node is where users can submit jobs or transfer to other nodes, such as [development nodes](https://wynton.ucsf.edu/hpc/get-started/development-prototyping.html). While submitting jobs to the job scheduler is okay, running software on these nodes is not advised.
The development (`dev`) node is where users *may* run quick tests. This is also the node where you may access [prepackaged software](https://wynton.ucsf.edu/hpc/software/software-modules.html) already available on wynton, or install your own software locally. You can access this node by signing into the login node, and then enter:
```bash
ssh dev1
```
to log onto the dev node. There are [multiple dev nodes](https://wynton.ucsf.edu/hpc/get-started/development-prototyping.html) so if `dev1` is not available, there may be an alternative.
The data transfer (`dt`) node should be used if transferring files to and from wynton.
#### Scratch Drive
There is local (`/scratch`) and global (`/wynton/scratch`) scratch space.
##### Local
Local scratch should be used to hold intermediate files produced by your scripts. This is a much faster drive space that is local to the node you are using to run your script.
##### Global
Global scratch is accessible by all nodes and is where you should keep your large files or store results to be shared with others. To use this space, it is recommended to use the following command to create a folder with your username so that your work can be isolated from others:
```bash
mkdir "/wynton/scratch/$(whoami)"
```
As an example, if your username is `jonsnow`, you would now have a folder for your work under `/wynton/scratch/jonsnow`.
## Aragorn <img src="./images/aragorn.png" width="20" />
Aragorn is a moderately powerful desktop computer housed in our lab (built in 2014) that can be used for small workloads. Generally, if you can run it on your relatively new laptop but don't want to tie up using your computer, Aragorn is a good alternative. However, it is limited in its ability, and should only really be used for short term usage. Longer term jobs (up to 2 weeks) should be run on Wynton.
- How to access Aragorn/theMines Who to ask for account R Studio?
- At least one member should be responsible (ie. an admin) for Aragorn, in charge of adding users.
- For now ask our Bioinformatician, Brian Palmer
**Specs:**
| Component | spec |
| ------------ | ------------------- |
| Processor | Intel i7-4770 |
| RAM | 16gb DDR3 |
| Storage | 250gb SSD + 1tb HDD |
| Power Supply | 430W |
## Arwen <img src="./images/arwen.png" width="20" />
## The Mines ⛰️
The Mines are our network attached storage (NAS), essentially a small server with a very large amount of storage (~60 tb). Used for long term storage of raw data. Data is backed up nightly to an offsite location.
## Nextflow <img src="./images/nextflow.png" width="20" />
Nextflow is a workflow language that can generate job submissions for many HPC scheduler platforms. Recommended for production level pipelines. An example is Sun Grid Engine, the scheduler currently used by Wynton HPC (Jan 2023). More information about Nextflow on their [website](https://www.nextflow.io/). Example pipelines can be found under the EPPIcenter Organization on [GitHub](https://github.com/EPPIcenter) or at [nf-co.re](https://nf-co.re/).
## Installing software
If you are using a shared resource, you will find that there are issues with insufficient privileges to install at the site level. You should instead install binaries locally. One package manager that is recommended is [conda](https://docs.conda.io/en/latest/) or [miniconda3](https://docs.conda.io/en/latest/miniconda.html). Conda environment/ best practices (link somewhere)
## Github <img src="./images/github.png" width="20" />
A code repository where members of the EPPIcenter store pipelines, tools and analysis. GitHub enables collaboration and sharing projects under the [EPPIcenter organization](https://github.com/EPPIcenter). Members are recommended to create a GitHub account and request access from one of the organization admins.
## Docker Hub <img src="./images/docker.png" width="20" />
A repository for docker images, to store and share custom images. A user that needs access to the image can easily [pull](https://docs.docker.com/engine/reference/commandline/pull/) the image from the repository. Another use case is configuring nextflow to use docker, which will automatically pull a specified image. To learn more, please refer to the [docker docs](https://docs.docker.com/get-started/overview/).
## Stack Overflow for Teams <img src="./images/stackoverflow.png" width="20" />
A knowledge sharing [platform](https://stackoverflowteams.com/c/eppicenter/questions) for EPPIcenter questions and answers. Users are encouraged to post questions, tutorials, and answers. Additionally, when a user is stuck on a particular problem, they can search through the EPPIcenter Stack Overflow to seek a solution to their particular problem.