Skip to content

cengenproject/splicing_browser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

104 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Create files to display the splicing data in the genome browser.

Overview

There are 3 inputs: the GFF, a list of samples to exclude, and the bams. The outputs are bigwig (for exonic reads) and bed (for splice junctions) files that can be loaded into a JBrowse2 browser.

The input sequencing must be first processed with the bulk_align pipeline, to yield the aligned BAMs.

The "master script" is src/generate_browser_data.sh, which:

  • in R/sj_to_bed, takes the SJ.out.tab files generated by STAR, generates bigBed files with the counts
  • if needed, merge the BAMs (not needed in bsn12), remove multimappers ( bsn9 uses stringtie_quantif/src/prep_alignments.sh, need update)
  • in R/bam_to_bigwig, takes the combined bams, generates bigWig files with exonic counts
  • exports everything in a xxx_browser.tar.gz archive, to be transferred to the genome browser (see splicing_website)

Steps to run

Beforehand, run the bulk_align pipeline.

In older versions (with bsn9), we'd use the "augmented"" annotation created in stringtie_quantif, now (with bsn12) we directly use the Wormbase annotation.

If needed, update the data/outliers_to_ignore.txt list based on QC.

Edit the parameters at the beginning of src/generate_browser_data.sh, and run it. If everything goes well, the slurm log ends with e.g.

Send to vps with:
scp /home/aw853/ycga_project/splicing_browser/data/outs/231121_browser.tar.gz cengen-vps:/var/www/public_data/splicing

use this command to upload, and see repo splicing_website for next steps.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published