; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005509 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005509
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold7:27473347..27482494
RNA-Seq ExpressionSpg005509
SyntenySpg005509
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]3.5e-2426.44Show/hide
Query:  AASEEPDEIEESQLPYDRFVNNFARAKYAE-LLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKE---------
        AA+ +P           +FV+N A  +Y E +  R+ + E+GF      +   P F+   I   GW++FC  P      +V+EFYAN+  +         
Subjt:  AASEEPDEIEESQLPYDRFVNNFARAKYAE-LLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKE---------

Query:  ----------DGFQNFPHA--AYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAIL
                  +G    P+    + E++     EQL + ++ + I GAQW LS  G  T     L+  A  W  F+  R+L +TH  T+SR R +L +A+L
Subjt:  ----------DGFQNFPHA--AYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAIL

Query:  RSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------QRMQEVRQGGLIYGINTVLEQLELSASR
            I+VG++I+ +I  C +K  G L+FP+ I  LC ++ V  +  +  L + G +    + R+        ++ +E  +       +T   +   +A  
Subjt:  RSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------QRMQEVRQGGLIYGINTVLEQLELSASR

Query:  QEFAER---------------------QALTFWNYVRNRDANLKKALQ
        QE+ E+                     Q   FW Y R+RD  LKK+ Q
Subjt:  QEFAER---------------------QALTFWNYVRNRDANLKKALQ

EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]2.3e-2333.94Show/hide
Query:  PHFLRTGIADHGWELFCAKPESVNAQVVREFYANI---DKEDGF-QN--FPHAA---------------YNEMVVAPSNEQLSDSVREVGIEGAQWQLSK
        P F+   I  HGW  FC  P +    +VREFYAN+   ++E  F QN   P  A               Y +     ++EQL   + EV IEGA WQ+S 
Subjt:  PHFLRTGIADHGWELFCAKPESVNAQVVREFYANI---DKEDGF-QN--FPHAA---------------YNEMVVAPSNEQLSDSVREVGIEGAQWQLSK

Query:  TGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISGC-WKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFD
         G  T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ ++ + EI  C   +  G L+FP+ I  L  +A VP  + + I+ +
Subjt:  TGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISGC-WKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFD

Query:  KGIIGTPNLARLQRMQEV
         G I T +++R+ + + V
Subjt:  KGIIGTPNLARLQRMQEV

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.7e-3130.89Show/hide
Query:  ERLLKRRAEKGKSVVAASEEPDEIEESQLPYDRFVNNFARAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFY
        ER    R   G   VA         E++    R+ NN        +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFY
Subjt:  ERLLKRRAEKGKSVVAASEEPDEIEESQLPYDRFVNNFARAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFY

Query:  ANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHD
        AN+ D E+      G Q                P   ++E +   + + L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH 
Subjt:  ANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHD

Query:  STVSRERVLLAFAILRSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------------QRMQEVR
         TVS++R+LL  ++L   SI+VG+MI  EI  C  +  G LFFP+ I  LCR A  P    +  L + G I    +AR+               R     
Subjt:  STVSRERVLLAFAILRSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------------QRMQEVR

Query:  QGGLIYGINTVLEQLELSASRQEFAE-----------RQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
               I   L+ LE   S+QE  +           +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  QGGLIYGINTVLEQLELSASRQEFAE-----------RQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.9e-2232Show/hide
Query:  RFVNNFARAKYAE-------LLKRDFLFERGFSGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKED-------GFQ--------------N
        +F +  A  +Y E        ++++F+++     + P F+   I  H W+LFCA PE     +VREFY N+   D       G Q               
Subjt:  RFVNNFARAKYAE-------LLKRDFLFERGFSGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKED-------GFQ--------------N

Query:  FPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISG
         P   ++E V   +  +L   +  V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L   SI+VG+MI  EI  
Subjt:  FPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISG

Query:  CWKKNVGKLFFPNTIMMLCRRAGVP
        C  +  G LFFP+ I  +CR    P
Subjt:  CWKKNVGKLFFPNTIMMLCRRAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.8e-2632.76Show/hide
Query:  VVREFYANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM
        +VREFYAN+ D E+      G Q                P   ++E +   +  +L   +  V   GA+W +S  G  T   + L   A  W  F++ R+
Subjt:  VVREFYANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------------Q
        LPTTH   VS++R+LL  ++L   SI+VG+MI  EI  C  +  G LFFP+ I  LCR A   V+E    L + G I    +AR+               
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------------Q

Query:  RMQEVRQGGLIYGINTVLEQLELSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
        R            +   L+ LE   S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  RMQEVRQGGLIYGINTVLEQLELSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)8.4e-3230.89Show/hide
Query:  ERLLKRRAEKGKSVVAASEEPDEIEESQLPYDRFVNNFARAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFY
        ER    R   G   VA         E++    R+ NN        +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFY
Subjt:  ERLLKRRAEKGKSVVAASEEPDEIEESQLPYDRFVNNFARAKYAELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFY

Query:  ANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHD
        AN+ D E+      G Q                P   ++E +   + + L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH 
Subjt:  ANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHD

Query:  STVSRERVLLAFAILRSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------------QRMQEVR
         TVS++R+LL  ++L   SI+VG+MI  EI  C  +  G LFFP+ I  LCR A  P    +  L + G I    +AR+               R     
Subjt:  STVSRERVLLAFAILRSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------------QRMQEVR

Query:  QGGLIYGINTVLEQLELSASRQEFAE-----------RQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
               I   L+ LE   S+QE  +           +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  QGGLIYGINTVLEQLELSASRQEFAE-----------RQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

A0A2P5DAQ2 Uncharacterized protein9.4e-2332Show/hide
Query:  RFVNNFARAKYAE-------LLKRDFLFERGFSGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKED-------GFQ--------------N
        +F +  A  +Y E        ++++F+++     + P F+   I  H W+LFCA PE     +VREFY N+   D       G Q               
Subjt:  RFVNNFARAKYAE-------LLKRDFLFERGFSGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKED-------GFQ--------------N

Query:  FPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISG
         P   ++E V   +  +L   +  V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L   SI+VG+MI  EI  
Subjt:  FPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISG

Query:  CWKKNVGKLFFPNTIMMLCRRAGVP
        C  +  G LFFP+ I  +CR    P
Subjt:  CWKKNVGKLFFPNTIMMLCRRAGVP

A0A2P5DXM3 Uncharacterized protein1.8e-2632.76Show/hide
Query:  VVREFYANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM
        +VREFYAN+ D E+      G Q                P   ++E +   +  +L   +  V   GA+W +S  G  T   + L   A  W  F++ R+
Subjt:  VVREFYANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------------Q
        LPTTH   VS++R+LL  ++L   SI+VG+MI  EI  C  +  G LFFP+ I  LCR A   V+E    L + G I    +AR+               
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------------Q

Query:  RMQEVRQGGLIYGINTVLEQLELSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
        R            +   L+ LE   S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  RMQEVRQGGLIYGINTVLEQLELSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

W9QTD9 Uncharacterized protein1.1e-2333.94Show/hide
Query:  PHFLRTGIADHGWELFCAKPESVNAQVVREFYANI---DKEDGF-QN--FPHAA---------------YNEMVVAPSNEQLSDSVREVGIEGAQWQLSK
        P F+   I  HGW  FC  P +    +VREFYAN+   ++E  F QN   P  A               Y +     ++EQL   + EV IEGA WQ+S 
Subjt:  PHFLRTGIADHGWELFCAKPESVNAQVVREFYANI---DKEDGF-QN--FPHAA---------------YNEMVVAPSNEQLSDSVREVGIEGAQWQLSK

Query:  TGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISGC-WKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFD
         G  T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ ++ + EI  C   +  G L+FP+ I  L  +A VP  + + I+ +
Subjt:  TGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISGC-WKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFD

Query:  KGIIGTPNLARLQRMQEV
         G I T +++R+ + + V
Subjt:  KGIIGTPNLARLQRMQEV

W9RBS1 Uncharacterized protein1.7e-2426.44Show/hide
Query:  AASEEPDEIEESQLPYDRFVNNFARAKYAE-LLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKE---------
        AA+ +P           +FV+N A  +Y E +  R+ + E+GF      +   P F+   I   GW++FC  P      +V+EFYAN+  +         
Subjt:  AASEEPDEIEESQLPYDRFVNNFARAKYAE-LLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKE---------

Query:  ----------DGFQNFPHA--AYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAIL
                  +G    P+    + E++     EQL + ++ + I GAQW LS  G  T     L+  A  W  F+  R+L +TH  T+SR R +L +A+L
Subjt:  ----------DGFQNFPHA--AYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAIL

Query:  RSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------QRMQEVRQGGLIYGINTVLEQLELSASR
            I+VG++I+ +I  C +K  G L+FP+ I  LC ++ V  +  +  L + G +    + R+        ++ +E  +       +T   +   +A  
Subjt:  RSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARL--------QRMQEVRQGGLIYGINTVLEQLELSASR

Query:  QEFAER---------------------QALTFWNYVRNRDANLKKALQ
        QE+ E+                     Q   FW Y R+RD  LKK+ Q
Subjt:  QEFAER---------------------QALTFWNYVRNRDANLKKALQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGATCACATATCTGGGAAGGCAAAATTGAAATGCGACCGCATTTCTGGAAAAACAGAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAAATCTGTGCTGCAGCAAAG
CTGGGAGCAAAACTGCCACATCACAGCTCGTTATCCAAATTGCCAAACTGAATTCTGTGTGAGTTTGGTGCATGAACGATCCGCCTGGGGTGAGAAGAAGAAGACACCAG
AAGAAAAAGAAGCTAAAAGAATAAGAAAACAGCAGAGGACTGAAGATCAAGAAGTTGCTCAGAAAGCGGCGGAGGATATTATTGTGGAAGAAGATTCGAAAGAACCAGAA
GGACGGAATCAAGAGCAGTCTGAGCCAGGAGTTGCGGATACAGAGGAAGTTCGAGAAGAAAATACAGAGGAAATTCAAGAAAAGCAGGTCGAGGATGTGCAAAAAGAACA
GGCAGAGGTTGCGCCTGAAGAAGTTAGTGAACAAGAACAGGAGGCTCGTGTGGAGCTACTGACTCTGAAAGAGAAAAAGGCTGAAGAAGAAAGGTTGCTCAAGCGAAGGG
CGGAAAAGGGCAAAAGTGTTGTTGCAGCATCGGAGGAACCTGATGAAATAGAAGAGTCACAATTGCCGTATGATCGCTTCGTCAACAATTTTGCCAGAGCAAAATACGCT
GAGCTGCTGAAAAGAGACTTCCTGTTTGAAAGAGGATTTAGCGGTGATCTTCCACATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGAGTTGTTTTGTGCGAAGCC
TGAGTCTGTAAACGCGCAGGTGGTGCGTGAATTTTATGCTAATATTGACAAAGAAGATGGTTTCCAGAATTTCCCCCATGCAGCTTATAATGAGATGGTTGTAGCGCCAT
CTAATGAGCAGTTAAGTGATTCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAGGACATTTCAGTCCGCTTATCTGAAGAGGGAA
GCGAACACGTGGATGGGATTTATCAGACAGAGGATGCTTCCAACGACTCATGACTCGACGGTCTCGAGGGAACGGGTTCTTCTGGCTTTCGCGATTTTGCGGTCTCTTAG
CATTGATGTAGGGAAGATGATTGTTATTGAGATTTCTGGTTGTTGGAAAAAGAATGTGGGGAAACTGTTCTTTCCGAACACAATCATGATGCTTTGCAGAAGAGCAGGGG
TTCCAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGGTACGCCTAACTTGGCACGACTCCAGCGTATGCAGGAGGTACGTCAAGGTGGGCTTATCTAC
GGCATCAACACGGTTTTAGAACAACTGGAACTTTCAGCCAGCAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGTTAGAAATCGTGATGCCAATCT
GAAGAAAGCGCTGCAAGAGAATTTTTCCAAGCCGTATCCAGCCCTTCCAGCATTCCCTGAGGATCTGTTGAACCCTTGGATTCCACCCCCACCTGTTGAAAGAGAAGAGG
AGGATGATGAAGAGCAGGTTTGTCTTCGCGTTAAAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGCTTATGACTGTAAGGCTGCTTTAAGTCTGAAA
AACGAGAATATAAACCCCTTAAAAATGTGTTTTGATATGTCTGATAATAGAGCTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCG
GAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAAATCTGTGCTGCAGCAAAGCTGGGAGCAAAACT
GCCACGTCACAGCTCGTTATCCAAATTGCCAAACTGAATTCTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGATCACATATCTGGGAAGGCAAAATTGAAATGCGACCGCATTTCTGGAAAAACAGAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAAATCTGTGCTGCAGCAAAG
CTGGGAGCAAAACTGCCACATCACAGCTCGTTATCCAAATTGCCAAACTGAATTCTGTGTGAGTTTGGTGCATGAACGATCCGCCTGGGGTGAGAAGAAGAAGACACCAG
AAGAAAAAGAAGCTAAAAGAATAAGAAAACAGCAGAGGACTGAAGATCAAGAAGTTGCTCAGAAAGCGGCGGAGGATATTATTGTGGAAGAAGATTCGAAAGAACCAGAA
GGACGGAATCAAGAGCAGTCTGAGCCAGGAGTTGCGGATACAGAGGAAGTTCGAGAAGAAAATACAGAGGAAATTCAAGAAAAGCAGGTCGAGGATGTGCAAAAAGAACA
GGCAGAGGTTGCGCCTGAAGAAGTTAGTGAACAAGAACAGGAGGCTCGTGTGGAGCTACTGACTCTGAAAGAGAAAAAGGCTGAAGAAGAAAGGTTGCTCAAGCGAAGGG
CGGAAAAGGGCAAAAGTGTTGTTGCAGCATCGGAGGAACCTGATGAAATAGAAGAGTCACAATTGCCGTATGATCGCTTCGTCAACAATTTTGCCAGAGCAAAATACGCT
GAGCTGCTGAAAAGAGACTTCCTGTTTGAAAGAGGATTTAGCGGTGATCTTCCACATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGAGTTGTTTTGTGCGAAGCC
TGAGTCTGTAAACGCGCAGGTGGTGCGTGAATTTTATGCTAATATTGACAAAGAAGATGGTTTCCAGAATTTCCCCCATGCAGCTTATAATGAGATGGTTGTAGCGCCAT
CTAATGAGCAGTTAAGTGATTCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAGGACATTTCAGTCCGCTTATCTGAAGAGGGAA
GCGAACACGTGGATGGGATTTATCAGACAGAGGATGCTTCCAACGACTCATGACTCGACGGTCTCGAGGGAACGGGTTCTTCTGGCTTTCGCGATTTTGCGGTCTCTTAG
CATTGATGTAGGGAAGATGATTGTTATTGAGATTTCTGGTTGTTGGAAAAAGAATGTGGGGAAACTGTTCTTTCCGAACACAATCATGATGCTTTGCAGAAGAGCAGGGG
TTCCAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGGTACGCCTAACTTGGCACGACTCCAGCGTATGCAGGAGGTACGTCAAGGTGGGCTTATCTAC
GGCATCAACACGGTTTTAGAACAACTGGAACTTTCAGCCAGCAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGTTAGAAATCGTGATGCCAATCT
GAAGAAAGCGCTGCAAGAGAATTTTTCCAAGCCGTATCCAGCCCTTCCAGCATTCCCTGAGGATCTGTTGAACCCTTGGATTCCACCCCCACCTGTTGAAAGAGAAGAGG
AGGATGATGAAGAGCAGGTTTGTCTTCGCGTTAAAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGCTTATGACTGTAAGGCTGCTTTAAGTCTGAAA
AACGAGAATATAAACCCCTTAAAAATGTGTTTTGATATGTCTGATAATAGAGCTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCG
GAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAAATCTGTGCTGCAGCAAAGCTGGGAGCAAAACT
GCCACGTCACAGCTCGTTATCCAAATTGCCAAACTGAATTCTGTTGA
Protein sequenceShow/hide protein sequence
MRSHIWEGKIEMRPHFWKNRESVAGRLEGAKSVLQQSWEQNCHITARYPNCQTEFCVSLVHERSAWGEKKKTPEEKEAKRIRKQQRTEDQEVAQKAAEDIIVEEDSKEPE
GRNQEQSEPGVADTEEVREENTEEIQEKQVEDVQKEQAEVAPEEVSEQEQEARVELLTLKEKKAEEERLLKRRAEKGKSVVAASEEPDEIEESQLPYDRFVNNFARAKYA
ELLKRDFLFERGFSGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQNFPHAAYNEMVVAPSNEQLSDSVREVGIEGAQWQLSKTGKRTFQSAYLKRE
ANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVIEISGCWKKNVGKLFFPNTIMMLCRRAGVPVDEGDVILFDKGIIGTPNLARLQRMQEVRQGGLIY
GINTVLEQLELSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQVCLRVKRVYCTIKWVIPCLRAYDCKAALSLK
NENINPLKMCFDMSDNRAKLWQVLRIELKVVIICPCRKNYFAAAELGFAECSESVAGRLEGAKSVLQQSWEQNCHVTARYPNCQTEFC