Getting help from TAs and other students

To interact with the instructor and TAs, get feedback, and ask questions, please join our discord community. If you don’t have a Discord account, you’ll need to register for one first by going to Discord. Once you’re registered, you’ll want to download the Discord app for your laptop (better than using Discord in the browser). Then, just click ‘join discord’ at the bottom of the widget below.

In addition to interacting with the rest of the class, on our Discord page you’ll also have an opportunity to meet AImee, an Artificial Intelligence ‘bot’ 🤖 we created that uses the powerful ChatGPT to help you understand (and fix!) code as you progress through the course. You’ll learn more about in one of our labs!


Ebooks on general R/bioconductor

R for Data Science, by Hadley Wickham, is really a wonderful (and free!) online book. If you decide to read this, which I highly recommend, then be sure to complement with the R4DS companion solutions manual.

R Graphics Cookbook - an essential reference book for plotting data in R.

RStudio Cheatsheets - great 1-2pg summaries of the essential R/RStudio functions. Print these out and keep them handy when you’re working in RStudio.

Modern Statistics for Modern Biology - A new book that just came out written two of the leaders in biostats, and it looks to be an excellent R-based guide to the topic, with many great coding examples.


Finding help online

Bioconductor’s help site - full of great tutorials, links to videos, and message board for more more complicated questions.

Biostars - More general bioinformatics questions not pertaining to a specific R package are best addressed here.

StackOverflow - great place for programming questions that are not related specifically to bioinformatics.


Getting help without ever leaving RStudio

Help within R - R has a comprehensive help system built right in! Type ?FunctionName right in the R terminal (no quotes and replace FunctionName with the name of the function you need help with).

package vignettes - Each Bioconductor package contains at least one vignette, a document that provides a task-oriented description of package functionality. Vignettes contain executable examples and are intended to be used interactively. To access the vignette for a package, simply type browseVignettes(packageName) (without quotes)

example datasets - type data() to see all available datasets for the packages you have loaded. These can be extremely useful when learning to use new packages.


Take advantage of your local UPenn and Philly community

Below are a few examples of some of the great local resources.

GEMINI workshop series - GEMINI is a monthly series of day-long workshops geared toward helping researchers access and use software tools for the analysis of large and complex datasets in GEnomics, MIcrobiology aNd Immunology. Each workshop in the series focuses on a single piece of software, involves hands-on active learning, and is free and open to the UPenn biomedical research community, including graduate students, postdocs, faculty and staff.

The GEMINI Series was founded and is co-organized by Allie Greenplate, PhD and Dan Beiting, PhD and is supported by Immune Health, the Institute for Immunology, and the Institute for Infectious and Zoonotic Diseases (IIZD).

The R Penn Group (RPG) - The RPG meets to discuss, learn, and collaborate on topics and projects related to statistics, mapping, data wrangling, visualization, and analysis with R programming language through community focused problem solving. RPG will meet at 12pm on the first floor of the Van-Pelt Dietrich Library Center on the second Thursday of the month throughout the Spring and Fall semesters. All inquiries about the group should be directed to the founder and organizer, Jaj Karajgikar (jajk@upenn.edu), the Applied Data Science Librarian at UPenn.

Weekly ‘Code Review’ - Led by Kyle Bittinger, a Computational Biologist and faculty at CHOP. Each week one person presents some programming code they are working on for their research. Attendees get a chance to hear how code was put together to achieve a goal, while the presenter gets feedback, collective troubleshooting, etc. Less experienced programmers will have a chance to learn about design options when programming. Group meets one Wednesdays @ 9am in Johnson 207. Contact Kyle by email at kylebittinger@gmail.com to be added to the mailing list.

Cytomic Data Analysis Workshop - Led by Wade Rogers, the Director of Computational Biology and Research Informatics for the Path BioResource at the Perelman School of Medicine. Given the pervasive nature of flow cytometry in all aspects of biomedical research, the mutiparameter nature of flow experiments, and the increasing abundance of clinical metadata, it only makes sense to leverage the power of R for analyzing the rich datasets produced by flow cytometry. Wade is an expert user of R/bioconductor for flow analysis, and has been very active in tool development in this area, including the development of a ‘fingerprinting’ method for analysing flow data. Contact Wade by email at rogersw@mail.med.upenn.edu for more information on the workshop.

R-Ladies Philly is part of a world-wide organization to promote gender diversity in the R community. R-Ladies Philly is a free, community-driven group that is open to all levels of R users. While the group name does have ‘Ladies’ in the title, they welcome all individuals who share their values. Monthly meetings are skills-focused, with encouragement of interdisciplinary statistical and coding methods.

Girl Develop It - Philadelphia has a local chapter of GDI that hosts speakers, workshops and social events aimed at bringing together and empowering women to learn web and software development in a judgment-free environment.

FemmeHacks is a all-women collegiate hackathon held in Philadelphia at UPenn’s Huntsman Hall, and is hosted by the Penn Women in Computer Science (WiCS). This year’s hackathon will take place in February. Friday, Feb 24th will consist of social mixers, workshops, and teams will be formed at Penn Engineering (220 South 33rd St), with the hackthon taking place the following day on Saturday the 25th at the Pennovation Center (3401 Grays Ferry Ave).


Take an online course to hone and expand your data science skills

We’ve experimented with several purveyors of online content for bioinformatics over the years (Coursera, EdX, etc.), and they are not free and in many cases are not well-designed or maintained. In my opinion, Datacamp offers the best content, in part because the coding happens direclty in your browser…no need to download or install anything! That’s why we use Datacamp for all our homework and extended learning that makes our Learn R! section. During the course, you’ll actually have access to the full content of Datacamp for free, so why not work toward a certificate in data science using R.