; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024631 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024631
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold12:20289753..20291455
RNA-Seq ExpressionSpg024631
SyntenySpg024631
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY08849.1 Uncharacterized protein TCM_024087 [Theobroma cacao]2.3e-2135.38Show/hide
Query:  FPYNRFINNLAWAKY-VEMLRRDFLFERGFGDDLP----RFLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEEECPLTGAQEPLMLSVREVGIEG
        F  ++FI+  A+ +Y   ++ +  + ERG   ++P    + +   I +  W QFC +P+     +VREFYAN+  H +                   + G
Subjt:  FPYNRFINNLAWAKY-VEMLRRDFLFERGFGDDLP----RFLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEEECPLTGAQEPLMLSVREVGIEG

Query:  AQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGV
        AQW+ S  E  +F+ + +K E   W+ F+  RLLS+TH S V++DR +L +AI+   SIDVGK+IS  IL   R K   + FP+ IT LC RAGV
Subjt:  AQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGV

KAF4375842.1 hypothetical protein G4B88_026421 [Cannabis sativa]1.3e-2430.5Show/hide
Query:  KYVEMLR-RDFLFERGF--GDD----LPRFLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEEECPLTGAQEPLMLSVREVG----IEGAQWRLSK
        KY+  +R ++F  +RG   GD+    +P +L   I    W Q C  P      +V+EFYAN   HE    +T  +  +  S R++     ++   +  SK
Subjt:  KYVEMLR-RDFLFERGF--GDD----LPRFLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEEECPLTGAQEPLMLSVREVG----IEGAQWRLSK

Query:  TEKR------------TFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGV
           +             F+   LK +      F++  LL T+HDSTVSR+R+ + + I++   I+VGK+I+ EI +C  +  GKLFF   IT  CR A V
Subjt:  TEKR------------TFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGV

Query:  PEDEDDVPLIDKGIID-TPNLARLQRTQEARQGDLVCGIHQMQEQLQLHSSRMEFVERQLQTFWNYVKRRDVALRVALQSNF
        P   D+ P+  KG++   P+ A  + T              M E+L  H +  + +  +LQT WNY + RDV +   L+ N+
Subjt:  PEDEDDVPLIDKGIID-TPNLARLQRTQEARQGDLVCGIHQMQEQLQLHSSRMEFVERQLQTFWNYVKRRDVALRVALQSNF

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.4e-3132.32Show/hide
Query:  FLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEE-------------------------------ECPLTGAQEPLMLSVREVGIEGAQWRLSKTE
        F+   I    W QFCA PE     +VREFYAN+   EE                               E      Q+ L+  +  V   GA+W +S   
Subjt:  FLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEE-------------------------------ECPLTGAQEPLMLSVREVGIEGAQWRLSKTE

Query:  KRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGI
          T   + L   A  W  F+K RLL TTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G 
Subjt:  KRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGI

Query:  IDTPNLARLQR---TQEARQ---------------GDLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWNYVKRRDVALRVALQSNFSKPYLALP
        ID   +AR+ +   T+  +Q               GD++  +  ++++L      Q H  S ++   +Q Q FW Y K RD AL+ ALQ+NF++P    P
Subjt:  IDTPNLARLQR---TQEARQ---------------GDLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWNYVKRRDVALRVALQSNFSKPYLALP

Query:  VFPDDLLNPWIPPPPVEREEDGEEQGQE
         FP ++L         E ++DG  +  E
Subjt:  VFPDDLLNPWIPPPPVEREEDGEEQGQE

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]2.4e-2336.49Show/hide
Query:  LKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLAR
        L   A  W  F+K RLL TTH  TVS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G ID   +AR
Subjt:  LKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLAR

Query:  LQRTQEAR--------------------QGDLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWNYVKRRDVALRVALQSNFSKPYLALPVFPDDL
        +  TQE +                     GD++  +  ++++L      Q H  S ++   +Q Q FW Y K RD AL+ ALQ+NF++P    P FP +L
Subjt:  LQRTQEAR--------------------QGDLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWNYVKRRDVALRVALQSNFSKPYLALPVFPDDL

Query:  LNPWIPPPPVEREEDGEEQGQE
        L         E ++DG  +  E
Subjt:  LNPWIPPPPVEREEDGEEQGQE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.1e-2634.75Show/hide
Query:  IDYHEEECPLTGAQEPLMLSVRE-VGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILD
        +D H E   +    EP +++V E V   GA+W +S     T   + L   A  W  F+K RLL TTH   VS+DR+LL  ++L   SI+VG++I SEI  
Subjt:  IDYHEEECPLTGAQEPLMLSVRE-VGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILD

Query:  CWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GDLVCGIHQMQEQLQLHSSRMEFVERQ
        C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+ +   T+  +Q               GD++  +  ++++L    S+ E   +Q
Subjt:  CWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GDLVCGIHQMQEQLQLHSSRMEFVERQ

Query:  LQTFWNYVKRRDVALRVALQSNFSKPYLALPVFPDDLLNPWIPPPPVEREEDGEEQGQE
         Q FW Y K RD AL+ ALQ+NF++P    P FP ++L         E ++DG  +  E
Subjt:  LQTFWNYVKRRDVALRVALQSNFSKPYLALPVFPDDLLNPWIPPPPVEREEDGEEQGQE

TrEMBL top hitse value%identityAlignment
A0A061F2U9 Uncharacterized protein1.1e-2135.38Show/hide
Query:  FPYNRFINNLAWAKY-VEMLRRDFLFERGFGDDLP----RFLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEEECPLTGAQEPLMLSVREVGIEG
        F  ++FI+  A+ +Y   ++ +  + ERG   ++P    + +   I +  W QFC +P+     +VREFYAN+  H +                   + G
Subjt:  FPYNRFINNLAWAKY-VEMLRRDFLFERGFGDDLP----RFLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEEECPLTGAQEPLMLSVREVGIEG

Query:  AQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGV
        AQW+ S  E  +F+ + +K E   W+ F+  RLLS+TH S V++DR +L +AI+   SIDVGK+IS  IL   R K   + FP+ IT LC RAGV
Subjt:  AQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGV

A0A2P5BCG4 Uncharacterized protein (Fragment)6.9e-3232.32Show/hide
Query:  FLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEE-------------------------------ECPLTGAQEPLMLSVREVGIEGAQWRLSKTE
        F+   I    W QFCA PE     +VREFYAN+   EE                               E      Q+ L+  +  V   GA+W +S   
Subjt:  FLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEE-------------------------------ECPLTGAQEPLMLSVREVGIEGAQWRLSKTE

Query:  KRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGI
          T   + L   A  W  F+K RLL TTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G 
Subjt:  KRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGI

Query:  IDTPNLARLQR---TQEARQ---------------GDLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWNYVKRRDVALRVALQSNFSKPYLALP
        ID   +AR+ +   T+  +Q               GD++  +  ++++L      Q H  S ++   +Q Q FW Y K RD AL+ ALQ+NF++P    P
Subjt:  IDTPNLARLQR---TQEARQ---------------GDLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWNYVKRRDVALRVALQSNFSKPYLALP

Query:  VFPDDLLNPWIPPPPVEREEDGEEQGQE
         FP ++L         E ++DG  +  E
Subjt:  VFPDDLLNPWIPPPPVEREEDGEEQGQE

A0A2P5CEY2 Uncharacterized protein1.2e-2336.49Show/hide
Query:  LKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLAR
        L   A  W  F+K RLL TTH  TVS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G ID   +AR
Subjt:  LKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLAR

Query:  LQRTQEAR--------------------QGDLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWNYVKRRDVALRVALQSNFSKPYLALPVFPDDL
        +  TQE +                     GD++  +  ++++L      Q H  S ++   +Q Q FW Y K RD AL+ ALQ+NF++P    P FP +L
Subjt:  LQRTQEAR--------------------QGDLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWNYVKRRDVALRVALQSNFSKPYLALPVFPDDL

Query:  LNPWIPPPPVEREEDGEEQGQE
        L         E ++DG  +  E
Subjt:  LNPWIPPPPVEREEDGEEQGQE

A0A2P5DXM3 Uncharacterized protein1.5e-2634.75Show/hide
Query:  IDYHEEECPLTGAQEPLMLSVRE-VGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILD
        +D H E   +    EP +++V E V   GA+W +S     T   + L   A  W  F+K RLL TTH   VS+DR+LL  ++L   SI+VG++I SEI  
Subjt:  IDYHEEECPLTGAQEPLMLSVRE-VGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILD

Query:  CWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GDLVCGIHQMQEQLQLHSSRMEFVERQ
        C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+ +   T+  +Q               GD++  +  ++++L    S+ E   +Q
Subjt:  CWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GDLVCGIHQMQEQLQLHSSRMEFVERQ

Query:  LQTFWNYVKRRDVALRVALQSNFSKPYLALPVFPDDLLNPWIPPPPVEREEDGEEQGQE
         Q FW Y K RD AL+ ALQ+NF++P    P FP ++L         E ++DG  +  E
Subjt:  LQTFWNYVKRRDVALRVALQSNFSKPYLALPVFPDDLLNPWIPPPPVEREEDGEEQGQE

A0A7J6FZ22 Uncharacterized protein6.2e-2530.5Show/hide
Query:  KYVEMLR-RDFLFERGF--GDD----LPRFLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEEECPLTGAQEPLMLSVREVG----IEGAQWRLSK
        KY+  +R ++F  +RG   GD+    +P +L   I    W Q C  P      +V+EFYAN   HE    +T  +  +  S R++     ++   +  SK
Subjt:  KYVEMLR-RDFLFERGF--GDD----LPRFLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEEECPLTGAQEPLMLSVREVG----IEGAQWRLSK

Query:  TEKR------------TFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGV
           +             F+   LK +      F++  LL T+HDSTVSR+R+ + + I++   I+VGK+I+ EI +C  +  GKLFF   IT  CR A V
Subjt:  TEKR------------TFQAAYLKSEANTWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGV

Query:  PEDEDDVPLIDKGIID-TPNLARLQRTQEARQGDLVCGIHQMQEQLQLHSSRMEFVERQLQTFWNYVKRRDVALRVALQSNF
        P   D+ P+  KG++   P+ A  + T              M E+L  H +  + +  +LQT WNY + RDV +   L+ N+
Subjt:  PEDEDDVPLIDKGIID-TPNLARLQRTQEARQGDLVCGIHQMQEQLQLHSSRMEFVERQLQTFWNYVKRRDVALRVALQSNF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGGTATCTGTTACCCCTGAGGTGCAGAAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGT
TGCTGCCACAATTGAAGAAGGAGACCCGCAAGAACCCGATGTACAGAACCCAGAGGAGGCTGAGCAGAGAGTCGAGGATACAGAAAAAGTTCAAGAGGAGCAAACAGAGG
AAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAGAAATTTCAGAAGAAGTTCAAGAAAAGCAGGCCGAGGATGTACAAATGCAACAGGCAGAAGATGTTCAGGTAACG
GATAATGAGCCAGTGCAAGAGGCTCAAGTGGAGGTGATCATGCCAGAGGTACCAAAACGTCGCCGCGTTAAGAGGAAAGCAGGCCGCGCTAGGGTTGTCCGAACTGATAC
TCCTTCGCCTCCGACCACTGATTCTGAAAGAGAAAATGCAAGAAGAGAGGAACGGAAAAAGAAGGAAGCTGAGGACAAGGCAAGAGAAGAAGAAGCAAAGAAAGCGGAAG
AGGAGATTTTGCTCAAACGAAGGGCGGAAAAGGGCAAAAGTGTGGCTGAAGCATCGGAAGAACCTGACGAGATTGAGGAATCGAGATTTCCGTACAATCGCTTCATCAAT
AACCTTGCTTGGGCAAAGTATGTTGAGATGCTGAGAAGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATCTGCCACGGTTCTTGAGGACTGGAATAGTGAACCTCGG
CTGGGGTCAATTTTGTGCGAAGCCGGAACCTGTTAATTCCAACATTGTTCGGGAATTTTACGCCAATATTGACTATCACGAAGAGGAGTGCCCGTTGACTGGAGCCCAAG
AGCCATTAATGCTTTCTGTCCGAGAGGTTGGCATTGAGGGGGCCCAGTGGAGACTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAGAGCGAGGCCAAT
ACATGGATGGGTTTCATCAAGCTACGCTTACTGTCGACAACTCACGACTCAACGGTGTCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGA
TGTGGGTAAGATAATTTCTTCTGAGATTCTTGATTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACAATGCTATGCCGAAGGGCAGGGGTGCCAG
AGGATGAGGATGATGTGCCACTAATCGACAAGGGAATAATTGACACACCAAATCTGGCTAGGCTTCAAAGGACGCAGGAAGCACGCCAAGGAGATTTGGTGTGCGGCATC
CACCAAATGCAGGAGCAATTACAGCTGCATTCCAGTAGGATGGAATTTGTTGAAAGGCAATTGCAGACCTTCTGGAACTATGTGAAAAGGAGGGATGTCGCGTTGAGGGT
AGCCTTGCAGTCGAATTTTTCCAAGCCATATCTGGCTTTACCCGTATTCCCTGATGACCTACTGAACCCCTGGATCCCGCCCCCACCTGTTGAACGAGAAGAAGATGGTG
AAGAGCAGGGTCAGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGGTATCTGTTACCCCTGAGGTGCAGAAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGT
TGCTGCCACAATTGAAGAAGGAGACCCGCAAGAACCCGATGTACAGAACCCAGAGGAGGCTGAGCAGAGAGTCGAGGATACAGAAAAAGTTCAAGAGGAGCAAACAGAGG
AAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAGAAATTTCAGAAGAAGTTCAAGAAAAGCAGGCCGAGGATGTACAAATGCAACAGGCAGAAGATGTTCAGGTAACG
GATAATGAGCCAGTGCAAGAGGCTCAAGTGGAGGTGATCATGCCAGAGGTACCAAAACGTCGCCGCGTTAAGAGGAAAGCAGGCCGCGCTAGGGTTGTCCGAACTGATAC
TCCTTCGCCTCCGACCACTGATTCTGAAAGAGAAAATGCAAGAAGAGAGGAACGGAAAAAGAAGGAAGCTGAGGACAAGGCAAGAGAAGAAGAAGCAAAGAAAGCGGAAG
AGGAGATTTTGCTCAAACGAAGGGCGGAAAAGGGCAAAAGTGTGGCTGAAGCATCGGAAGAACCTGACGAGATTGAGGAATCGAGATTTCCGTACAATCGCTTCATCAAT
AACCTTGCTTGGGCAAAGTATGTTGAGATGCTGAGAAGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATCTGCCACGGTTCTTGAGGACTGGAATAGTGAACCTCGG
CTGGGGTCAATTTTGTGCGAAGCCGGAACCTGTTAATTCCAACATTGTTCGGGAATTTTACGCCAATATTGACTATCACGAAGAGGAGTGCCCGTTGACTGGAGCCCAAG
AGCCATTAATGCTTTCTGTCCGAGAGGTTGGCATTGAGGGGGCCCAGTGGAGACTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAGAGCGAGGCCAAT
ACATGGATGGGTTTCATCAAGCTACGCTTACTGTCGACAACTCACGACTCAACGGTGTCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGA
TGTGGGTAAGATAATTTCTTCTGAGATTCTTGATTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACAATGCTATGCCGAAGGGCAGGGGTGCCAG
AGGATGAGGATGATGTGCCACTAATCGACAAGGGAATAATTGACACACCAAATCTGGCTAGGCTTCAAAGGACGCAGGAAGCACGCCAAGGAGATTTGGTGTGCGGCATC
CACCAAATGCAGGAGCAATTACAGCTGCATTCCAGTAGGATGGAATTTGTTGAAAGGCAATTGCAGACCTTCTGGAACTATGTGAAAAGGAGGGATGTCGCGTTGAGGGT
AGCCTTGCAGTCGAATTTTTCCAAGCCATATCTGGCTTTACCCGTATTCCCTGATGACCTACTGAACCCCTGGATCCCGCCCCCACCTGTTGAACGAGAAGAAGATGGTG
AAGAGCAGGGTCAGGAAGATTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEVSVTPEVQKRAEEQEKATEVVAATIEEGDPQEPDVQNPEEAEQRVEDTEKVQEEQTEEVREENTEEVREEISEEVQEKQAEDVQMQQAEDVQVT
DNEPVQEAQVEVIMPEVPKRRRVKRKAGRARVVRTDTPSPPTTDSERENARREERKKKEAEDKAREEEAKKAEEEILLKRRAEKGKSVAEASEEPDEIEESRFPYNRFIN
NLAWAKYVEMLRRDFLFERGFGDDLPRFLRTGIVNLGWGQFCAKPEPVNSNIVREFYANIDYHEEECPLTGAQEPLMLSVREVGIEGAQWRLSKTEKRTFQAAYLKSEAN
TWMGFIKLRLLSTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQEARQGDLVCGI
HQMQEQLQLHSSRMEFVERQLQTFWNYVKRRDVALRVALQSNFSKPYLALPVFPDDLLNPWIPPPPVEREEDGEEQGQED