-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update Course with more content #31
Conversation
No quiz formatting errors! 🎉 |
No spelling errors! 🎉 |
Re-rendered previews from the latest commit:
Updated at 2023-07-20 with changes from f9404ef |
|
cfbd57e
to
6a76cf6
Compare
582f899
to
b906cdb
Compare
e9041bb
to
2956951
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry I'm late to the party! This PR was still open when I started, but I took a bit to get through it 🙂
|
||
## Motivation | ||
|
||
Identification of neoantigens is a critical step in predicting response to checkpoint blockade therapy and design of personalized cancer vaccines. This is a cross-disciplinary challenge, involving genomics, proteomics, immunology, and computational approaches. We have built a computational framework called pVACtools that, when paired with a well-established genomics pipeline, produces an end-to-end solution for neoantigen characterization. pVACtools supports identification of altered peptides from different mechanisms, including point mutations, in-frame and frameshift insertions and deletions, and gene fusions. Prediction of peptide:MHC binding is accomplished by supporting an ensemble of MHC Class I and II binding algorithms within a framework designed to facilitate the incorporation of additional algorithms. Prioritization of predicted peptides occurs by integrating diverse data, including mutant allele expression, peptide binding affinities, and determination whether a mutation is clonal or subclonal. Interactive visualization via a Web interface allows clinical users to efficiently generate, review, and interpret results, selecting candidate peptides for individual patient vaccine designs. Additional modules support design choices needed for competing vaccine delivery approaches. One such module optimizes peptide ordering to minimize junctional epitopes in DNA vector vaccines. Downstream analysis commands for synthetic long peptide vaccines are available to assess candidates for factors that influence peptide synthesis. All of the aforementioned steps are executed via a modular workflow consisting of tools for neoantigen prediction from somatic alterations (pVACseq and pVACfuse), prioritization, and selection using a graphical Web-based interface (pVACviz), and design of DNA vector–based vaccines (pVACvector) and synthetic long peptide vaccines. pVACtools is available at [https://www.pvactools.org](https://www.pvactools.org). | ||
Identification of neoantigens is a critical step in predicting response to checkpoint blockade therapy and design of personalized cancer vaccines. | ||
This is a cross-disciplinary challenge, involving genomics, proteomics, immunology, and computational approaches. We have built a computational |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a cross-disciplinary challenge, involving genomics, proteomics, immunology, and computational approaches. We have built a computational | |
This is a cross-disciplinary challenge involving genomics, proteomics, immunology, and computational approaches. We have built a computational |
(or if you really like commas, could do challenge, which involves
🙂 )
the data section of our precision medicine bioinformatics course: | ||
[here](https://pmbio.org/module-02-inputs/0002/05/01/Data/). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This works fine. Could also consider making "the data section of our precision medicine bioinformatics course" the link!
|
||
- `annotated.expression.vcf.gz`: A somatic (tumor-normal) VCF and its tbi index file. The VCF has been | ||
annotated with VEP and has coverage and expression information added. It has also been annotated with | ||
custom VEP plugins that provide wild type and mutant version of the full length protein sequences |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
custom VEP plugins that provide wild type and mutant version of the full length protein sequences | |
custom VEP plugins that provide wild type and mutant versions of the full length protein sequences |
(Assuming "wild type" and "mutant" are both versions.)
|
||
For more detailed information on how the variant input file is created, please refer to the | ||
[input file preparation](https://pvactools.readthedocs.io/en/latest/pvacseq/input_file_prep.html) | ||
section of the pVACtools docs |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
section of the pVACtools docs | |
section of the pVACtools docs. |
|
||
For pVACfuse: | ||
|
||
- `agfusion_results`: A AGFusion output directory with annotated fusion calls |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- `agfusion_results`: A AGFusion output directory with annotated fusion calls | |
- `agfusion_results`: An AGFusion output directory with annotated fusion calls |
and expression data. In our example, the VCF has already been annotated with | ||
this data. For more information about how to add coverage and expression data | ||
to your own VCFs, please see [here](https://pvactools.readthedocs.io/en/latest/pvacseq/input_file_prep/readcounts.html) | ||
and [here](https://pvactools.readthedocs.io/en/latest/pvacseq/input_file_prep/expression.html). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't have ready suggestions for replacement words this time, but I think it's better to have descriptive links instead of ones that say here
. (It might be better for SEO, too 🙂 )
|
||
### Transcript Support Level Filter | ||
|
||
The Transcript Support Level (TSL) Filter, removes neoantigen candidates for |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The Transcript Support Level (TSL) Filter, removes neoantigen candidates for | |
The Transcript Support Level (TSL) Filter removes neoantigen candidates for |
for each variants. | ||
|
||
For pVACseq it works as follows. Given a set of neoantigen candidates for a | ||
variant we first group the transcripts into set where all transcripts in a set |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
variant we first group the transcripts into set where all transcripts in a set | |
variant we first group the transcripts into sets where all transcripts in a set |
- Pick all neoantigens with a variant transcript that have a protein_coding Biotype | ||
- Of the remaining candidates, pick the ones with a variant transcript having a | ||
TSL less then the `--maximum-transcript-support-level`. | ||
- Of the remaining candidates, pick the entries with no Problematic Positions | ||
- Of the remaining candidates, pick the ones passing the Anchor Criteria (explained in | ||
more detail further below) | ||
- Of the remaining candidates, pick the one with the lowest MT IC50 Score (Median or Best | ||
depending on the `--top-score-metric`), lowest TSL, and longest transcript. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should be consistent on whether these end with .
s
- Pick all entries with a variant transcript that have a protein_coding Biotype | ||
- Of the remaining entries, pick the ones with a variant transcript having a Transcript Support Level <= `--maximum-transcript-support-level` | ||
- Of the remaining entries, pick the entries with no Problematic Positions | ||
- Of the remaining entries, pick the ones passing the Anchor Criteria (see Criteria Details section below) | ||
- Of the remaining entries, pick the one with the lowest MT IC50 score( Median or Best | ||
depending on the `--top-score-metric`), lowest Transcript Support Level, and longest transcript. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Similarly here with the .
s
New Content Checklist
New content/chapter is in an Rmd file with this kind of format and headers.
New content/chapter contains learning objectives.
Bookdown successfully re-renders and any new content files have been added to the _bookdown.yml.
Spell check runs successfully).
Any newly necessary packages that are needed have been added to the Dockerfile and image.
Images are in the correct format for rendering.
Every new image has alt text and is in a Google Slide.
Each slide is described in the notes of the slide so learners relying on a screen reader can access the content. See https://lastcallmedia.com/blog/accessible-comics for more guidance on this.
The color palette choices of the slide are contrasted in a way that is friendly to those with color vision deficiencies.
You can check this using Color Oracle.