https://github.com/smangul1/ROP
Revision 5efd8f3c5027e601bd63f65316c8efd41461278c authored by Harry Taegyun Yang on 04 April 2016, 20:46:54 UTC, committed by Harry Taegyun Yang on 04 April 2016, 20:46:54 UTC
1 parent 06ce1e2
Raw File
Tip revision: 5efd8f3c5027e601bd63f65316c8efd41461278c authored by Harry Taegyun Yang on 04 April 2016, 20:46:54 UTC
Delete mini_tests.py
Tip revision: 5efd8f3
README.md
# ROP 1.0 : Read origin protocol
Author: Serghei Mangul (smangul@ucla.edu), Harry Yang (Taegyun) (harry2416@gmail.com)

##Description
ROP is a computational protocol for profiling the composition of unmapped reads, which failed to map to the human references. 
 
 
 
ROP prococol consist of six steps to characterize the unmapped reads:

1. Quality control. Exclude low-quality, low-complexity and rRNA reads (reads mathing rRNA repeat unit) 
2. Identify lost human reads, which are missed due to the heuristics implemented for computational speed in conventional aligners. These include reads with mismatches and short gaps relative to the reference set, but can also include perfectly matched reads.  
3. Identify lost repeat reads, by mapping unmapped reads to the database of repeat sequences (Repbase 20.7)
4. Identify ‘non-co-linear’ RNAs reads from circRNAs, gene fusions, and trans-splicing events, which combine sequence from distant elements.
5. Identify reads from recomobinations of B and T cell receptors (i.e. V(D)J recombinations)
6. Identify mircobial reads using microbial genomes and phylogenetic marker genes to identify microbial reads and assign them to corresponding taxa

If you use this software, please cite :

##Pre-requisites

##Installation


How to run job array on hoffman2:

1) ls *sh | awk '{i+=1;print "if [ $1 == "i" ];then ./"$1" ;fi"}' > myFunc.sh
2) wc -l myFunc.sh
3) qsub -cwd -V -N tophat2 -l h_data=8G,express,time=10:00:00 -t 1-x:1 myFuncFastWrapper.sh
back to top