G34-gliomas

Cell-of-origin analysis for "Histone H3.3G34-mutant interneuron progenitors co-opt PDGFRA for gliomagenesis" (Chen*, Deshmukh*, Jessa*, Hadjadj*, et al, Cell, 2020)

View the Project on GitHub fungenomics/G34-gliomas

DOI

G34-gliomas

This repository contains the code & data for the bulk analysis included the G34R/V HGG manuscript (Chen, Deshmukh, Jessa, Hadjadj, et al, Cell, 2020), for the analysis that was performed by our lab.

This repository is meant to enhance the STAR Methods section by providing code for the custom analyses in the manuscript and the exact R dependencies, in order to improve reproducibility for the main results. However, it is not a fully executable workflow. The code is permanently archived on Zenodo at the doi 10.5281/zenodo.7086446.

NOTE: if viewing on GitHub, only code is visible, as inputs, data, and outputs are generally not tracked in git.

Contents:

Map from figures to code

This section contains a pointer from each figure in the paper to the section (§) where it’s generated in the code. For each figure panel, I provide a partial path to the RMD/MD/HTML files within this repository/directory, and then the section in the rendered HTML which specifically produces that panel. As described below in the section on reproducibility, the source data for the figure is typically saved alongside the figure itself.

Figure 2

Figure S2

Figure 3

Figure S5

Figure 5

Figure S6

Notes for reproducibility

rr template & helpers

This repository uses the rr template, which contains a set of R markdown templates to help ensure reproducibility. This template also provides a set of helper functions (located in rr_helpers.R and prefixed by rr_ in the function name) to help encourage documentation.

R and R package versions

The R library for this project is managed with the package renv, which:

  1. maintains an isolated project-specific library in the renv folder,
  2. stores packages according to version
  3. records the R, Bioconductor, and package versions in the file renv.lock, which can be used to reproduce the R package environment elsewhere

The R version used is 3.5.1. The analysis also makes use of our in-house package for scRNAseq visualization, cytobox.

R Markdown

Each markdown/HTML file has a “Reproducibility report” at the bottom, indicating when the document was last rendered, the most recent git commit when it was rendered, the seed, and the R session info.

Figures and source data

For most figures, the source data underlying the plot is saved along side the figure in the respective figures directory. If so, a message is displayed in the markdown/HTML files underneath the chunk which produces the plot, giving the path for the figures/source data within this project directory.

e.g. [figure/source data @ G34-gliomas/bulk_transcriptome_epigenome/figures/01/gsx2_pdgfra_correlation…]

Analysis outputs

For most text file & R object outputs, there is a text file saved next to the object with the extension .desc, with a very brief one-line description of what’s contained in the file.

e.g. for the output file bulk_transcriptome_epigenome/output/02/fgsea_df.tsv, there is an associated description file bulk_transcriptome_epigenome/output/02/fgsea_df.desc

GitHub / version control

This directory is tracked with git and has an associated GitHub repository in the Kleinman lab account at https://github.com/fungenomics/G34-gliomas.

The following are tracked / available on GitHub:

The following are not tracked / available on GitHub:

Contact

Selin Jessa (selin.jessa at mail.mcgill.ca)