wtf-rstats

https://rstudio-conf-2022.github.io/wtf-rstats/


Project maintained by rstudio-conf-2022 Hosted on GitHub Pages — Theme by mattgraham

What They Forgot to Teach You About R

rstudio::conf 2022

by Shannon McClintock Pileggi + Jenny Bryan + E. David Aja

rstd.io/wtf-2022-rsc <– The One True URL that links to everything!


🗓️ July 25 and 26, 2022

⏰ 09:00 - 17:00

🏨 Chesapeake 4-6

✍️ rstd.io/conf


Prework

For this workshop, participants will be working from their personal laptops.

Please be ready with current versions of both R and RStudio. This means R >= 4.2.0 and RStudio >= 2022.02.3+492, unless you have a good reason otherwise. See the discussion in Install or upgrade R and RStudio: https://happygitwithr.com/install-r-rstudio.html#install-r-rstudio.

In addition, it is essential that you do the requested pre-workshop Git/GitHub set up in advance: https://happygitwithr.com/workshops.html#pre-workshop-set-up.

Lastly, please make sure your system is ready to build packages. You can confirm this by checking if devtools::has_devel() returns Your system is ready to build packages!. If this returns Could not find tools necessary to compile a package please follow instructions in What They Forgot to Teach You About R Ch 9: Set up an R dev environment.

If you have questions in advance of the workshop, please ask on the RStudio Community thread: https://community.rstudio.com/t/what-they-forgot-to-teach-you-about-r-workshop-rstudio-conf-2022/138999.


Schedule

Day 1

Time Activity Instructor
09:00 - 10:30 1_1: Project-oriented workflow 1 of 2 Shannon Pileggi
10:30 - 11:00 Coffee break  
11:00 - 12:30 1_2: Project-oriented workflow 2 of 2 Shannon Pileggi
12:30 - 13:30 Lunch break  
13:30 - 15:00 1_3: Debugging Shannon Pileggi
15:00 - 15:30 Coffee break  
15:30 - 17:00 1_4: Git/GitHub 1 of 2 Jenny Bryan

Day 2

Time Activity Instructor
09:00 - 10:30 2_1: Personal R Administration 1 of 2 David Aja
10:30 - 11:00 Coffee break  
11:00 - 12:30 2_2: Personal R Administration 2 of 2 David Aja
12:30 - 13:30 Lunch break  
13:30 - 15:00 2_3: Git/GitHub 2 of 2 Jenny Bryan
15:00 - 15:30 Coffee break  
15:30 - 17:00 2_4: Iterating well with purrr Shannon Pileggi

People

Role Name Site GitHub Twitter
Instructor Shannon Pileggi pipinghotdata.com shannonpileggi @PipingHotData
Instructor Jenny Bryan jennybryan.org jennybc @jennybryan
Instructor David Aja edavidaja.com edavidaja @peeltothepithy
TA Amanda Gadrow   ajmcoqui @ajmcoqui
TA Ryan Johnson   ryjohnson09 @ryjohnson09
TA Andrie de Vries   andrie @RevoAndrie
TA Daniel Sjoberg danieldsjoberg.com ddsjoberg @statistishdan
TA Crystal Lewis   Cghlewis @Cghlewis

Overview

This is a two-day hands on workshop designed for experienced R and RStudio users who want to (re)design their R lifestyle. You’ll learn holistic workflows that address the most common sources of friction in data analysis. We’ll work on project-oriented workflows, version control for data science (Git/GitHub!), maintaining your R installation, and how to plan for collaboration, communication, and iteration (incl. RMarkdown).

At the conclusion of the workshop, you should have: knowledge of your R installation that enhances your ability to maintain it, workflows that facilitate collaboration with yourself or others (including version control and strategies for file systems), tools to improve debugging processes, and methods to address repetitive tasks with the purrr package.


Is this course for me?

This course will be appropriate for you if you answer yes to these questions:


Learning objectives

Project-oriented workflow (2 sessions)

Git/GitHub (2 sessions)

Personal R Administration (2 sessions)

Debugging (1 session)

Iterating well with purrr (1 session)


Instructors

Shannon Pileggi PhD is a Data Scientist with more than ten years of experience partnering on data analysis with diverse stakeholders in academic, corporate, tech, public health, and clinical research settings. An enthusiastic educator and statistical collaborator, Shannon started her career in academia as a Lecturer with the Institute for Quantitative Theory and Methods at Emory University, and later as an Assistant Professor at California Polytechnic State University, San Luis Obispo. When she transitioned to industry in 2019, Shannon leveraged open source and community resources to adopt modern programming practices and workflows in R. In 2020, Shannon began to give back to the R community through blogging and developing R-Ladies workshops; she currently serves as a member of the R-Ladies Global Team. Shannon spends her days wrangling data, creating reproducible reports, building shiny apps, developing R packages, and mentoring junior Data Scientists at The Prostate Cancer Clinical Trials Consortium, housed under Memorial Sloan Kettering Cancer Center.

Jenny is a software engineer at RStudio, on the tidyverse team. She is a recovering biostatistician who takes special delight in eliminating the small agonies of data analysis. Jenny is the maintainer of many R packages, in the areas of data import, web APIs, Git/GitHub, and package development. She has been working in R/S for over 20 years and is a member of the R Foundation. Prior to joining RStudio, she was a professor of statistics at the University of British Columbia.

David Aja is a Solutions Engineer at RStudio. He helps data scientists and IT communicate about their needs. Before joining RStudio, he worked as a data scientist in the public sector.


This work is licensed under a Creative Commons Attribution 4.0 International License.