tags
summer2023, collaboratory

[toc]

Collaboratory schedule - DataLab June 2023

Edit: or on

When: June 20th-30th, 2023, ~9am-5pm each day

Where: Shields Library 360 (DataLab), UC Davis main campus.

Contact: ctbrown@ucdavis.edu, hehouts@ucdavis.edu

Parking suggestion: lots 5/5A are not that far, and you can walk through a small redwood grove! MU parking structure is also nearby (A and C parking). TAPS enforces parking; day parking permits can be purchased through the ParkMobile app.

Food: DataLab staff have started a list of quick lunch options on and around campus. Note: on campus venues may be closed due to summer break.

All information will be posted to our GitHub repository, ngs-docs/2023-june-datalab-collaboratory, and will be available indefinitely.

Introduction and Expectations

This workshop is focused on enabling attendees to improve and expand their existing workflows. All activities are optional but we hope to keep it interesting enough that everyone will attend and participate in the all-hands sessions. But you are also welcome to hide out in a corner and work on your own problems and ask for help periodically!

In particular, we hope to fill in a lot of gaps for people in their mental models of computing, and provide many ideas for how to improve the efficiency with which you work and compute!

This is designed to be a super-friendly workshop where you can ask all those questions about computing that you never felt comfortable asking before.

We're looking forward to seeing you all!

Facilitators and Helpers!!!

Lead facilitators

Pamela
- data science, R, team science, etc!
Hannah
- VScode, github desktop(git GUI), R
Wes
- Statistics and R programming, etc.
Sophie
- pop gen, workflows
Dani
- HPC

Helpers

Mo
- ChatGPT and Git Colab, python
Nistara
- R, some git (command line), emacs
Makan
- Pop-up leader for AI/ML
Colton
- workflow, machine learning, multiomics, image processing, benchmarking
Nick
- statistics, R, Python, Julia, etc.

Daily schedule

Each all-hands session below will follow the same basic format:

intro to topic (15 min)
Q&A, discussion and comparison (30 min - 1 hour)
break out into facilitated co-working groups
reconvene at end to coalesce and retrospect; take notes for pop-ups.

We expect to have "pop-up" sessions on additional topics or techniques as needed/desired.

Days will start at ~9:15, with lunch from noon-2pm; we will end before 5pm every day!

Schedule of topics by day

Tue, June 20th - welcome; git

Setup:

make sure you're on wifi (eduroam) and slack (DataLab, #2023-june-collaboratory)!

9:30am: Morning: welcome & introductions

Sticky note exercise/questions: write 3 sticky notes and put them in groups on the back whiteboard!

Name + scientific domain
Name + computational tools/approach/??
Name + goal for workshop (automation, scalability, validation, ???), or "what you want to work on most".

Lunch: pizza!

2pm: Afternoon session: pinning your project down with version control (git and github)

Need help with git, R, python, or any other data science topics? Check the directory of Datalab workshops!

Wed, June 21st - slurm; conda

9:30am Morning session: (ab)using the HPC for fun and profit (slurm, srun, and sbatch)

additional topic: setting your default editor on Linux

2pm: Afternoon session: software installations that (usually) just work (conda)

Th, June 22nd - scripting; organization

Morning session: automating the heck out everything (shell scripts, R, Python)

Afternoon session: dude, where's my file? (organizing your files)