In this session we will continue annotating sequences, but now with a focus on transcripts, which can be coding and non-coding. In fact, the same gene can encode both coding and non-coding transcripts (Poliseno, Lanza, and Pandolfi 2024):
TBD including short and long reads, assembly with/wo ref, trinity, stringtie, spatial transcriptomics, expression dbs, etc
We will follow the introduction and protocol at:
https://eead-csic-compbio.github.io/get_homologues/plant_pangenome/protocol.html
For the exercises you will need
Note that the software GET_HOMOLOGUES-EST is a part of the GET_HOMOLOGUES package, which you installed in session 2.
The files needed this for this session are:
Note that Rmd files are to be opened with Rstudio.
Please make a folder named ‘transcripts/’ in the same GitHub repo of session 1, and write a brief report. See more recommendations here.