Data Analysis with R Workshop

Welcome

Congratulations on registering for the Data Analysis with R workshop! You have 3 days of hands-on R and data analysis learning ahead of you.

Attendance

Each stage of the workshop builds on previous knowledge. Thus, you are expected to attend the entire workshop. Please set aside any outside appointments or experiments!

Cancellation

If you’re registered but will not attend, please contact your coordinator promptly! Space is limited and there may be a waiting list.

Preparation

Software

You are expected to bring your own computer!

Before the workshop, please install:

R (v3.5 or later), and
RStudio Desktop (v1.1 or later)

Both packages are cross-platform and free.

At the beginning of the workshop we’ll initialize an RStudio project that I have already prepared for you. Thus you should have the software installed and a functional internet connection. Please confirm with you coordinator if you need special access privileges where the workshop is being held.

Appropriate Data Sources

I want everyone to leave the workshop with the skills and confidence to begin tackling their own data analysis needs using R!

Unless explicitly stated, this means that you are expected to bring in a dataset that you want to work on!

In previous workshops, participants have brought in a wide variety of data sources, so I’m looking forward to a challenging and lively workshop.

To keep it simple, a flat text file or Excel file will be the easiest thing to start with.

Think about your data analysis objective. What is your research question? What do you want to find out from your data? I’m not going to tell you what to do, but I’ll do my best to tell you how. Thus, the better you can articulate the what, the easier it will be for me to help you with the how.

Inappropriate Data Sources

Special data types, such as

Images,
Sequencing,
Flow cytometry, or
Mass spectrometry reads,

are beyond the scope of this workshop.

Is it possible to work with these kinds of data in R, but you’ll probably have to install additional packages to read in and analyze them (e.g. from BioConductor, which is not covered in this workshop). You’re likely to encounter idiosyncratic data structures which can be pretty difficult for beginners to wrap their heads around. This makes for an overly-complex and discouraging first encounter with R, which is why I prefer to begin with flat text files.

If this is what you’re working on, then it’s preferable to bring in extracted values instead, e.g. output from ImageJ or MaxQuant, that we can begin processing immediately in R. Then you can expand your workflow to encompass more of your data analysis pipeline afterwards.

Schedule

Please check with your coordinator for the location.

The tentative schedule for the workshop is as follows (exact topics and time may vary!):

Day 1

Time	Topic
9:00 - 11:00	Session I - Introduction, Tidyverse intro, case study
11:00 - 11:15	Break
11:15 - 12:30	Session II - Case study and exercise
12:30 - 13:30	Lunch
13:30 - 15:00	Session III - Functions and exercises
15:00 - 15:15	Break
15:15 - 17:00	Session IV - Objects and exercises

Day 2

Time	Topic
9:00 - 11:00	Session I - Using relational and logical operators
11:00 - 11:15	Break
11:15 - 12:30	Session II - Indexing
12:30 - 13:30	Lunch
13:30 - 15:00	Session III - Exercises or own work
15:00 - 15:15	Break
15:15 - 17:00	Session IV - Exercises or own work

Day 3

Time	Topic
9:00 - 11:00	Session I - Reviewing the Tidyverse
11:00 - 11:15	Break
11:15 - 12:30	Session II - Reviewing the Tidyverse and exercises
12:30 - 13:30	Lunch
13:30 - 15:00	Session III - Exercises or own work
15:00 - 15:15	Break
15:15 - 17:00	Session IV - Exercises or own work

Contact

If you have questions, please contact the workshop trainer.