https://github.com/zolotarov/dehydrin_promoters
Raw File
Tip revision: e8dedccb0485299646b5017f03ab2de809a8fbb3 authored by zolotarov on 15 July 2015, 15:36:47 UTC
Updated README to include a link to the paper
Tip revision: e8dedcc
dehydrins_not_Phytozome.md
# Information on resources was obtained from:
http://comparative-legumes.org/

# Cajanus cajan (pigeon pea) dehydrins

pwm_K_seg.py was used to search "/home/yzolotarov/sequences/Pigeonpea/genomedata/annotation/2.2 gene_total/Pigeonpea_gene.pep"

>C.cajan_18977 [mRNA] locus=CcLG07:17633269:17634216:+ [translate_table: standard]
>C.cajan_35463 [mRNA] locus=Scaffold131636:152856:153339:+ [translate_table: standard]
>C.cajan_11145 [mRNA] locus=CcLG06:1316246:1316945:+ [translate_table: standard]
>C.cajan_35494 [mRNA] locus=Scaffold000354:69430:70470:- [translate_table: standard]
>C.cajan_05184 [mRNA] locus=CcLG02:7464449:7464823:- [translate_table: standard]

The coding sequences were manually copied from the Pigeonpea.cds file

>C.cajan_18977  [mRNA]  locus=CcLG07:17633269:17634216:+
ATGGCAGAGGAGCCCCAGAACAAGTATGAGACCTCTCAGCCCAGTGAGGT
TGAGGTCAACGATCGTGGTGTTTTTGACTTTCTCGGTAAGAAAAAGGAAG
AAGACAAGCCTCAGGAGGAGGTCATCGTGACCGAGTTTGAGAACAAGGTG
ACAGTGTCAGGGGAAGCTGACAAGAAACAAGAGGAGGGAGAGAAGAAACC
CAGCCTCTTAGAGAAGCTTCACCGATCTGACAGCAGCTCTAGCTCTTCAA
GTGAGGAGGAAGGAGAAGATGGAGAGAAAAAGAAGAAAAAGAAGAAGGAA
AAGAAAGGGTTGAAGGAGAAGATTGAGGAGAAAATAGGGGGTGATCATCA
CAAGGAGGAGGACACAAGTGTTCCAGTTGAGAAAGTTGAGGTTGTTGATG
CCCCACACCCTGAGGAAAAGAAGGGATTCCTCGAGAAGATTAAGGAAAAG
CTACCAGGACAGAAGAAAGAGGAAGCAACACCTCCTCCACCTGTTGCTGC
ATCATCAGATCATGGTGAGGGTGCTCATCATCATGAAGGAGAGGCAAAGG
AGAAGAAAGGTATCTTAGAGAAGATAAAGGAGAAGCTTCCAGGCTATCAC
CCCAAGACAGAGGAAGAAAAGGAAAAGGAAAAGGAGAGTGGTGCTCACTG
A

>C.cajan_35463  [mRNA]  locus=Scaffold131636:152856:153339:+
ATGGCTGAAGCACAAATACGCGACCAGCTCGGGAACCCTATCCCACTCAC
CGATCAATTCGGTAACCCCGTCAAGTTAACCGACGAGAACGGTAATCCCG
TTGTTCTCACCGGAGTGGCTACCACACTCACTGGTACTGTCCCAGATCTC
TTGCCAACCCAAGCTAGGAACAATGAGACAGACCTTGCTCGTTCTTCCAG
TACTACTTCAACCTCTAGCTCTAGCTCTAGCTCTAGCTCGTCTGAGGACG
ATGAGGAGCAGCATGCAGAAATAACCACCGCTAGCCACCCAACTGTAACC
ACCATTAGCCACCCTGAGCCTGAGAAGAAGGGCTTACTCGAGAAGATCAA
AGAAAAATTGCCTGGCCATCTCAACCAGTAG

>C.cajan_11145  [mRNA]  locus=CcLG06:1316246:1316945:+
ATGGCTGAGGAGAAGCAGCACAAGGACAACGAGTACGACAACACTGCTGA
GGTGGAGGTCAAAGACCGTGGGGTTTTAGATTTTCTCGGGAAGAAAAAGG
AAGAAGAGGCCATCGTCACTGAGTTCGACAACAAGGTCAAGGTTTCTGAT
GAACCTGAGACCAAGCTGCAGGTGGAGCAAGAACCAGAAGAGAAGAAACA
CACCCTTCTTGAGAAGCTCCACCGATCCAACAGCAGCTCCAGCTCCTCGA
GTGATGAAGAGGAGGAAGGAGAAGGTGGAGAGAAGAAGAAGAAGAAGAAG
AAAAAGGATAAGATAGGAGGTGGTGGTCGCAAGGAGCAAGACACCACTGT
TCCGATTGAGAGAGTGGAGGTTGAAGCAGACAGTGAGGATAAGAAGGGTT
TCCTCGACAAGATTAAGGAGAAGCTGCCAGGTCAGCACAAGAAGGAGGAG
GAGGAAGTGCCTGTGCCTCCGACATCATCAGAGTGTGATGCTCCCCACAC
TGAGGCTCATGAAGGGGAGAAGAAGGGCCTTTTAGATAAGATCAAAGAGA
AGCTTCCTGGTTATCACCCCAAGACAAATGAGGACAAAGAAAAGGAGATC
GGAACTCATTGA

>C.cajan_35494  [mRNA]  locus=Scaffold000354:69430:70470:-
ATGGCAAGTTATCAGACTCATCACGATGATCAGGGTCGTAAGATCGATGA
GTATGGCAACCCAGTGAGAGAAACTGACCAATATGGCAACCCAGTTCATG
GCACTACTGCCACTGTTACCTACGGCGGTAAAGCCGATAAGCAGCATGGA
ACTACCGGTGTCTATGATCCTAGCAGCGGTAGGAGTCATCACACTACCGG
TGATCATGATATAAGCACCGGTAGACAACATGATACCACCGGTAGTTACA
CCAGTGATATCGGTAGACAACATGAAACTACCGGTGACTATAATTTAGGC
ACCGGTAAACAACATCATGGATTATCTACCGGTGGTTACACAGGTAATAC
TAATACAAAACATGGAACTACAGGTGATTATGATCTAGGCACCGGTAGAC
AACATCAGACTACCGATGCCTATGGTGGGGACACCGGTAGGCAGCATGGA
ACTACCGGTGACTACGGTGGTGGCCCTACCTATGGAGTCACTACCGCGGA
CACCGGCACTGGTCCTAGAAGTGGAACCACCGGTGGCACCGGTTATGGAG
GCAACACTGGGGGTGCACATACGGATGCACGGAATCAGCAACACTATGGA
AAGGAGCATCGTCACCATGACGAGTCTCATGGCGATCACGAGAAGAAAGG
GATAGTGGATAAGATCAAAGAGAAGCTTCCCGGAGGACACAGTGACAAGT
AG

>C.cajan_05184  [mRNA]  locus=CcLG02:7464449:7464823:-
ATGTCAGGGATCGTTCACAAGATTGGGGAGACCCTTCATGTGGGAGGGCA
CAAGAAAGAGGATGAGCACAAGGGGGAGCACCATGGTGGAGAATACAAGG
GAGAGCATCATGGTGAACATGAGCACAAGGGAGAGCACCATGGTGAAGAG
CACAAAGAAGGGTTCATAGACAAGATCAAGGACAAGATCCATGGTGAGGG
AGAGGGTGAGAAGAAGAAGAAGAAGAAAGAGAAGAAGAAGGATGAACATG
GCCATGACCATGGCCATGACAGCAGCAGCAGTGACAGTGATTAG

The promoters were extracted from /home/yzolotarov/sequences/Pigeonpea/genomedata/assembly/Pigeonpea.scafSeq.LG.fa using sequence_finder.py

# Chickpea Cicer arietinum dehydrins

pwm_K_seg.py was used to find the peptides in /home/yzolotarov/sequences/Carietinum/genomedata/annotation/02.gene/Cicer_arietinum_GA_v1.0.gene.pep.fa

>Ca_20934  locus=Ca2:6381156:6381458:+
ATGTCAGGAATCATCAACAAAATTGGAGAAACTCTTCACATTGGAGGACA
CAAAAAAGAAGAGGAACACAAAGGTGAAAAACATGATCAACACAAAGGAG
AAAAACATGATGAACACAAAGGTGAAAAAAAAGGAGAACACAAGGAAGGT
ATAGTTGAGAAAATCAAAGACAAGATCCATGGTGGTGAGAGTCATGAAAA
CAAAGGTGAGAAGAAAAAGGATAAGAAAAAGAAAGATAAGAAGAAAAATG
AACATGGTCATGATCATCATCATGAGAGTAGTAGCAGCAGCGACAGTGAT
TAG
>Ca_08212  locus=Ca3:26932779:26933261:-
ATGGCAGGAATCATTAACAAAATTGGTGAGACCCTTCATATAGGAGGGGA
TAAGAAAGAAGGTGAACACAAAGGAGAGAGCCATGTTGAACAACAACATG
GGTATGGTGGAGAGCACAAAGGAGAGCAACATGGTGTGTATGGAGGAGAG
CACAAAGGAGAGCAACATGGTTTGTTTGGTCATGGAGGTGAGCACAAAGG
AGAGCAACATGGTGTGTTTGGAGGAGAGCACAAAGGAGAGCAGCATGGTG
TGTATGGTGGAGAGCACAAAGGAGAGCAACATGGTCTGTTTGGTCATGGA
GGAGAACACAAGCCAGAACAACATCATGGAGAACAAAAGGAAGGATTTTT
AGAGAAGATCAAGGACAAGGTCCATGGTGAAGGTGGAGAGGGTGAAAATA
TCAAAAAGAAGAAGGATAAGAAGAAACGTGGTGAACATGGTGGTGAACAT
GGCCATGACAGCAGCAGCAGTGATAGTGATTAG
>Ca_04759  locus=Ca5:30675210:30675999:-
ATGGCAGAGGAGAATCATAACAAGTACGAGGAGACCAACACCACCTTTGA
TTCTGATCATTCTGATATCAAAGACAGAGGTGTTTTTGATTTTCTAGGTG
GTAAGAAAAAGGATGAAGATCACAAACATCAACAAGAGGATGTGATCGCC
ACTGAGTTTGATCATAAGGTCACTCTGTCTGATCAACTGGCAGAGACCAA
GAAAGACGAAGATGAAGTTCAAGTTCAAGTTCCAGTTGAAGGAGAGAAGA
AACACAACGTCTTCGACAAGCTTCACCGATCTGGAAGTTCCTCAAGCTCT
TCGAGCGAAGAGGAAGGAGAAGATGGAGAGAAGAAGAAAAAGAAAAAGAA
GGAAAAGAAGAAGGTAGTGGAAGAGGAGGGTTCAGTTGAAGTTGAAAAAG
TAGAAGTTGTGGATGTAACAGCAGCACAACCAACAGAGGAAAAGAAGGGT
TTCCTTGAGAAAATAAAGGACAAGTTACCAGGACACAAGAAAACAGAGGA
ACCACCTGTTGTTGTTGTATCAGAGACAACAAGTCATGATGAAAGTGAAG
ATGCAGTGAAGGAGAAGAAAGGTATTTTGGAAAAGATCAAAGAGAAACTT
CCTGGTTATCATCCTAAGACTACTACTGAACTTGATGACAAAGATCATCA
CAAGGATGACACAACAACCTCTCATTGA
>Ca_04763  locus=Ca5:30694365:30695043:-
ATGGCGGGAGTTCAATTAAGAGACGAACATGGAAACCCAATCAAACTCAC
TGATGAATTCGGTAATCCAGTTAAGTTAACTGATGAACACGGCAACCCTA
TCCACCTTACGGGTGTAGCAACCACCACCCCTCCAACTTCCACAGGTTTT
GGTTTTGGAACCTATGGTGCCGGTGCTTACGGTGGTGGTGCAACCACACA
TCCGACAACGGTGTCAGATCTAATCTCCACTGAACCAGCCAGACACCATC
CCACCGATCAGGCTGCAGGAAGACTTCGTCGCTCCTCCAGTTCAAGCTCT
AGTTCTAGCTCTTCAGAGGATGATGGGCAAGGAGGGAGGAGGAGGAAGAA
AAAGGGAGTGAAGGATAAGATTAAGGAGAAGGTTCCAGGTGTGGGCAAAA
AGGAGCATTCACAGACAACTACAACTACTGTTCCAGCTGCTGCTGGTCCC
CACCCAACTGCAACGGCAACTCATCACCCAGCTGAGCCAACTGATCAGAA
GAAGGGTATACTTGATAAGATCAAAGAAAAATTGCCTGGCCACCATTCCC
ACTGA
>Ca_12999  locus=Ca8:14698711:14699301:-
ATGTCATATAATCAAGGTCAATACGTCGACCAAACTCGTAGGACCGATGA
ATATGGAAACCCAATTGTCCAAGTTGATCAATATGGCAACCCAATAAATC
AAAGTGGTGTTGGGATGACCGGTGAAGCCGGTAGAACATTTGAAAATCCC
GGTTTAACCGGTCACCATGAGCCACATAAACATGGAGATAATTCAAAATC
CCACACCACTAGTTATGGGACCCACACAGGTAGTGGTGGTGGCACTGGAG
ATGATTATGGGACCCACAATACACGTGGTGGTGGTGGAATGACCGGTGCA
GTCGGTAAAATATTTGGAACAACCGATGATACCGGTAATCATCATGGAGT
TGATCAAACCACAACAGATTATGGGACCCACAACCCAGGTAGTTATGGGA
CCAACCCAGGTAGTTATGGGACCAACACAGGAGGTTATGGAAACACAAAC
ACTGGAAGTGGTTATGGAACAAAAGTGGGACAAGAATTTGGAAGAGAGGA
GGCTCATCATCATGGAGGAGAGCAAAAACATGGAGAGAAGAAAGGGATTA
TGGATAAGATTAAGGAGAAACTTCCTGGGACAGGACACTAG
back to top