; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020361 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020361
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold1:24923813..24925490
RNA-Seq ExpressionSpg020361
SyntenySpg020361
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.0e-2127.07Show/hide
Query:  AASEEPDEIEESQLPYDRFVNNLARAKYAEFL-KRDFLFERGFSGDLQH---------FLRTCIADHGWEQFCSKPESVNAQLVREFYANIDKE------
        AA+ +P           +FV+N A  +Y E +  R+ + E+GF   L H         F+   I   GW+ FC  P      LV+EFYAN+  +      
Subjt:  AASEEPDEIEESQLPYDRFVNNLARAKYAEFL-KRDFLFERGFSGDLQH---------FLRTCIADHGWEQFCSKPESVNAQLVREFYANIDKE------

Query:  -------------EGFLAIVR---AYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLAL
                      G L I      + E+      EQL + ++ + I GAQW LS     T     L+  A  W  F+  RL  +TH  T+SR R +L  
Subjt:  -------------EGFLAIVR---AYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLAL

Query:  AILRSLSIEVGNIIADEISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARL--------QHSQEARQGGLVYDINTILEQLALS
        A+L    I VG +I D+I  C +K  G L+FP+ I+ L  ++ V+    +  L + G +D   + R+        +  +E  +       +T   + A +
Subjt:  AILRSLSIEVGNIIADEISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARL--------QHSQEARQGGLVYDINTILEQLALS

Query:  ASRQEFAER---------------------QALTFWNYVKNRDANLKKALQ
        A  QE+ E+                     Q   FW Y ++RD  LKK+ Q
Subjt:  ASRQEFAER---------------------QALTFWNYVKNRDANLKKALQ

EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.1e-2333.64Show/hide
Query:  FLRTCIADHGWEQFCSKPESVNAQLVREFYAN----------------------IDKEEGFLAIVRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTE
        F+   I  HGW QFC  P +    LVREFYAN                      I+   G   +V  Y + A   ++EQL   + EV IEGA W++S   
Subjt:  FLRTCIADHGWEQFCSKPESVNAQLVREFYAN----------------------IDKEEGFLAIVRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTE

Query:  KRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEISGC-WKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKG
          T     LKR A  W  F+  R  P+TH  TV+++RVLL  +IL  +S+ +  I   EI  C   +K G L+FP+ IT L  +A V  ++ + I+ + G
Subjt:  KRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEISGC-WKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKG

Query:  IIDTPNLARLQHSQ
         I T +++R+   +
Subjt:  IIDTPNLARLQHSQ

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.9e-2528.77Show/hide
Query:  LPYDRFVNNLARAKYAEFLKRDFLFERGF------SGDLQHFLRTCIADHGWEQFCSKPESVNAQLVREFYANIDKEEGF-------------LAIVRAY
        + + +F N+ A+A++  F  R+  FE GF       G     +   +    W +F   P SVNA LV+EFYANI K                 +AI R +
Subjt:  LPYDRFVNNLARAKYAEFLKRDFLFERGF------SGDLQHFLRTCIADHGWEQFCSKPESVNAQLVREFYANIDKEEGF-------------LAIVRAY

Query:  NEMAVAPSN---EQLSDAVREVGI------EGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADE
        +   V   +   E+ +D+ +  G+      E  +W   +T + +     L+  A  W  F+K +L PT+H++TVS  R+LL  +++ S  I+VG II  +
Subjt:  NEMAVAPSN---EQLSDAVREVGI------EGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADE

Query:  ISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARLQHSQEARQGGLVY-------DINTILEQLAL-SASRQEFAERQAL-----
        +  C  KK   L FPN IT L ++  V EN  D IL     I    L  L   +  +    V+       + N  +  LAL  A  Q  A+  AL     
Subjt:  ISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARLQHSQEARQGGLVY-------DINTILEQLAL-SASRQEFAERQAL-----

Query:  TFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQE
         F+ YVK+RD  ++   QE         P FP+++L  +      E E D  + P  +
Subjt:  TFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.1e-3229.43Show/hide
Query:  ESQLPYDRFVNNLA-RAKYAEFLKRDFLFERGFSGDLQHFLRTCIADHGWEQFCSKPESVNAQLVREFYANI-DKEEGFLAI------------------
        E++    R+ NN+  R   AE   + F+ +   +     F+   I  H W+QFC+ PE     LVREFYAN+ D EE  + +                  
Subjt:  ESQLPYDRFVNNLA-RAKYAEFLKRDFLFERGFSGDLQHFLRTCIADHGWEQFCSKPESVNAQLVREFYANI-DKEEGFLAI------------------

Query:  ---VRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEIS
           V  ++E     + + L   +  V   GA+W +S     T   + L   A  W  F+K RL PTTH  TVS++R+LL  ++L   SI VG +I  EI 
Subjt:  ---VRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEIS

Query:  GCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARLQH------------------SQEARQGGLVYDINTILEQLALSASRQ-----
         C  +K G LFFP+ IT L + A       +  L + G ID   +AR+                    S     G ++  +  + ++L+    +Q     
Subjt:  GCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARLQH------------------SQEARQGGLVYDINTILEQLALSASRQ-----

Query:  --EFAERQALTFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQ
          +   +Q   FW Y K RD  LKKALQ NF+ P P  PAFP+++L         E + DG  +  +
Subjt:  --EFAERQALTFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQ

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.0e-2632.43Show/hide
Query:  LVREFYANI-DKEEGFLAI---------------------VRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRL
        LVREFYAN+ D EE  + +                     V  ++E     +  +L   +  V   GA+W +S     T   + L   A  W  F+K RL
Subjt:  LVREFYANI-DKEEGFLAI---------------------VRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRL

Query:  PPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDT-----------------PNLA
         PTTH   VS++R+LL  ++L   SI VG +I  EI  C  +K G LFFP+ IT L + A    NE    L + G ID                  P+ +
Subjt:  PPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDT-----------------PNLA

Query:  RLQHSQEARQGGLVYDINTILEQLALSASRQEFAERQALTFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQ
        R   +  +R  G   D+   L+ L    S+QE   +Q   FW Y K RD  LKKALQ NF+ P P  PAFP+++L         E + DG  +  +
Subjt:  RLQHSQEARQGGLVYDINTILEQLALSASRQEFAERQALTFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQ

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)1.0e-3229.43Show/hide
Query:  ESQLPYDRFVNNLA-RAKYAEFLKRDFLFERGFSGDLQHFLRTCIADHGWEQFCSKPESVNAQLVREFYANI-DKEEGFLAI------------------
        E++    R+ NN+  R   AE   + F+ +   +     F+   I  H W+QFC+ PE     LVREFYAN+ D EE  + +                  
Subjt:  ESQLPYDRFVNNLA-RAKYAEFLKRDFLFERGFSGDLQHFLRTCIADHGWEQFCSKPESVNAQLVREFYANI-DKEEGFLAI------------------

Query:  ---VRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEIS
           V  ++E     + + L   +  V   GA+W +S     T   + L   A  W  F+K RL PTTH  TVS++R+LL  ++L   SI VG +I  EI 
Subjt:  ---VRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEIS

Query:  GCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARLQH------------------SQEARQGGLVYDINTILEQLALSASRQ-----
         C  +K G LFFP+ IT L + A       +  L + G ID   +AR+                    S     G ++  +  + ++L+    +Q     
Subjt:  GCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARLQH------------------SQEARQGGLVYDINTILEQLALSASRQ-----

Query:  --EFAERQALTFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQ
          +   +Q   FW Y K RD  LKKALQ NF+ P P  PAFP+++L         E + DG  +  +
Subjt:  --EFAERQALTFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQ

A0A2P5DXM3 Uncharacterized protein5.0e-2732.43Show/hide
Query:  LVREFYANI-DKEEGFLAI---------------------VRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRL
        LVREFYAN+ D EE  + +                     V  ++E     +  +L   +  V   GA+W +S     T   + L   A  W  F+K RL
Subjt:  LVREFYANI-DKEEGFLAI---------------------VRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRL

Query:  PPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDT-----------------PNLA
         PTTH   VS++R+LL  ++L   SI VG +I  EI  C  +K G LFFP+ IT L + A    NE    L + G ID                  P+ +
Subjt:  PPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDT-----------------PNLA

Query:  RLQHSQEARQGGLVYDINTILEQLALSASRQEFAERQALTFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQ
        R   +  +R  G   D+   L+ L    S+QE   +Q   FW Y K RD  LKKALQ NF+ P P  PAFP+++L         E + DG  +  +
Subjt:  RLQHSQEARQGGLVYDINTILEQLALSASRQEFAERQALTFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQ

A0A6A3BU96 Uncharacterized protein9.4e-2628.77Show/hide
Query:  LPYDRFVNNLARAKYAEFLKRDFLFERGF------SGDLQHFLRTCIADHGWEQFCSKPESVNAQLVREFYANIDKEEGF-------------LAIVRAY
        + + +F N+ A+A++  F  R+  FE GF       G     +   +    W +F   P SVNA LV+EFYANI K                 +AI R +
Subjt:  LPYDRFVNNLARAKYAEFLKRDFLFERGF------SGDLQHFLRTCIADHGWEQFCSKPESVNAQLVREFYANIDKEEGF-------------LAIVRAY

Query:  NEMAVAPSN---EQLSDAVREVGI------EGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADE
        +   V   +   E+ +D+ +  G+      E  +W   +T + +     L+  A  W  F+K +L PT+H++TVS  R+LL  +++ S  I+VG II  +
Subjt:  NEMAVAPSN---EQLSDAVREVGI------EGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADE

Query:  ISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARLQHSQEARQGGLVY-------DINTILEQLAL-SASRQEFAERQAL-----
        +  C  KK   L FPN IT L ++  V EN  D IL     I    L  L   +  +    V+       + N  +  LAL  A  Q  A+  AL     
Subjt:  ISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARLQHSQEARQGGLVY-------DINTILEQLAL-SASRQEFAERQAL-----

Query:  TFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQE
         F+ YVK+RD  ++   QE         P FP+++L  +      E E D  + P  +
Subjt:  TFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQE

W9QTD9 Uncharacterized protein5.2e-2433.64Show/hide
Query:  FLRTCIADHGWEQFCSKPESVNAQLVREFYAN----------------------IDKEEGFLAIVRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTE
        F+   I  HGW QFC  P +    LVREFYAN                      I+   G   +V  Y + A   ++EQL   + EV IEGA W++S   
Subjt:  FLRTCIADHGWEQFCSKPESVNAQLVREFYAN----------------------IDKEEGFLAIVRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTE

Query:  KRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEISGC-WKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKG
          T     LKR A  W  F+  R  P+TH  TV+++RVLL  +IL  +S+ +  I   EI  C   +K G L+FP+ IT L  +A V  ++ + I+ + G
Subjt:  KRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLALAILRSLSIEVGNIIADEISGC-WKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKG

Query:  IIDTPNLARLQHSQ
         I T +++R+   +
Subjt:  IIDTPNLARLQHSQ

W9RBS1 Uncharacterized protein4.8e-2227.07Show/hide
Query:  AASEEPDEIEESQLPYDRFVNNLARAKYAEFL-KRDFLFERGFSGDLQH---------FLRTCIADHGWEQFCSKPESVNAQLVREFYANIDKE------
        AA+ +P           +FV+N A  +Y E +  R+ + E+GF   L H         F+   I   GW+ FC  P      LV+EFYAN+  +      
Subjt:  AASEEPDEIEESQLPYDRFVNNLARAKYAEFL-KRDFLFERGFSGDLQH---------FLRTCIADHGWEQFCSKPESVNAQLVREFYANIDKE------

Query:  -------------EGFLAIVR---AYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLAL
                      G L I      + E+      EQL + ++ + I GAQW LS     T     L+  A  W  F+  RL  +TH  T+SR R +L  
Subjt:  -------------EGFLAIVR---AYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPPTTHDSTVSRERVLLAL

Query:  AILRSLSIEVGNIIADEISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARL--------QHSQEARQGGLVYDINTILEQLALS
        A+L    I VG +I D+I  C +K  G L+FP+ I+ L  ++ V+    +  L + G +D   + R+        +  +E  +       +T   + A +
Subjt:  AILRSLSIEVGNIIADEISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARL--------QHSQEARQGGLVYDINTILEQLALS

Query:  ASRQEFAER---------------------QALTFWNYVKNRDANLKKALQ
        A  QE+ E+                     Q   FW Y ++RD  LKK+ Q
Subjt:  ASRQEFAER---------------------QALTFWNYVKNRDANLKKALQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCCGTGACCCCCGAAGCACCGAAAGTAAAGGCAAAGAAGAAGAAGACACCAGAAGAAAA
AGAAGCTAAAAGAAGAAGAAAACAGCAGAGGGCTGTAGATCAAGAAGTTGTTCAGAAAGCGGCGGAGGATGTTATTGTGGAGGAAGATCCGAAAGAACCAGAAGGACGGA
ATCAAGAGCAGTCTGAGTCAGGAGTTGCGGATACAGAGGAAGTTCGAGAAGAAAATACAGAGGAAGTTCAAGAACAGCAGGCCGAGGATGTGCAAGAAGAACAGGCAGAG
GTTGCACCTGAAGGAGGTAATGAGCCAGAACAAGAGGTTCGAGTGGAGGTGATCATGCCGGAGGTACCCAAACGTCGCCGCATTAAGAGAAAAGCGGGCCGCGTCAAGAA
GGAGGCCGAGGATAAAGCAAGAGAGGAAGCAGAGAAAAAAGCTGAAGAAGAAAGATTGCGCAAGCAAAGGGCAGACAGGGGCAAGAGTGTTGCTGCGGCATCAGAGGAAC
CTGATGAAATAGAAGAGTCACAATTGCCGTATGATCGCTTCGTCAACAATCTTGCCAGAGCAAAATATGCAGAGTTCCTGAAAAGAGACTTCCTGTTTGAAAGGGGATTT
AGTGGTGATCTTCAACATTTTCTGAGGACCTGTATTGCAGACCACGGTTGGGAACAGTTTTGTTCAAAGCCTGAATCTGTGAACGCGCAGTTAGTGCGTGAATTCTATGC
AAATATTGACAAAGAAGAAGGTTTCCTAGCGATTGTTCGAGCATATAATGAGATGGCTGTAGCGCCATCCAATGAGCAGCTGAGTGACGCTGTGAGGGAAGTTGGTATTG
AAGGGGCGCAGTGGCGGCTTTCGAAAACAGAGAAAAGGACGTTCCAGTCAGCCTATTTGAAGAGGGAAGCAAATACTTGGATGGGATTTATCAAACAAAGGCTGCCTCCA
ACGACTCATGACTCGACGGTTTCTAGGGAACGAGTGCTTCTGGCTTTAGCTATTTTGCGGTCTCTCAGTATTGAGGTGGGAAATATTATTGCTGATGAAATATCTGGATG
TTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTCCAAGCGAGCAGGAGTTTCGGAGAATGAAGGAGATGTGATATTATTTGACAAGGGAA
TCATTGACACGCCTAACTTGGCGCGGCTTCAGCATTCGCAGGAGGCACGCCAGGGTGGGCTTGTCTACGACATTAACACGATTTTAGAACAACTCGCACTGTCGGCCAGT
AGGCAGGAGTTTGCAGAGAGGCAAGCTTTAACTTTTTGGAACTATGTTAAAAATCGTGATGCCAATCTGAAGAAGGCACTGCAAGAGAATTTTTCCGAACCATTTCCAGC
CCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGATTCCGCCACCGCTTGTCGAGAGAGAAGGAGATGGAGAAGAAGATCCTGGTCAGGAGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCCGTGACCCCCGAAGCACCGAAAGTAAAGGCAAAGAAGAAGAAGACACCAGAAGAAAA
AGAAGCTAAAAGAAGAAGAAAACAGCAGAGGGCTGTAGATCAAGAAGTTGTTCAGAAAGCGGCGGAGGATGTTATTGTGGAGGAAGATCCGAAAGAACCAGAAGGACGGA
ATCAAGAGCAGTCTGAGTCAGGAGTTGCGGATACAGAGGAAGTTCGAGAAGAAAATACAGAGGAAGTTCAAGAACAGCAGGCCGAGGATGTGCAAGAAGAACAGGCAGAG
GTTGCACCTGAAGGAGGTAATGAGCCAGAACAAGAGGTTCGAGTGGAGGTGATCATGCCGGAGGTACCCAAACGTCGCCGCATTAAGAGAAAAGCGGGCCGCGTCAAGAA
GGAGGCCGAGGATAAAGCAAGAGAGGAAGCAGAGAAAAAAGCTGAAGAAGAAAGATTGCGCAAGCAAAGGGCAGACAGGGGCAAGAGTGTTGCTGCGGCATCAGAGGAAC
CTGATGAAATAGAAGAGTCACAATTGCCGTATGATCGCTTCGTCAACAATCTTGCCAGAGCAAAATATGCAGAGTTCCTGAAAAGAGACTTCCTGTTTGAAAGGGGATTT
AGTGGTGATCTTCAACATTTTCTGAGGACCTGTATTGCAGACCACGGTTGGGAACAGTTTTGTTCAAAGCCTGAATCTGTGAACGCGCAGTTAGTGCGTGAATTCTATGC
AAATATTGACAAAGAAGAAGGTTTCCTAGCGATTGTTCGAGCATATAATGAGATGGCTGTAGCGCCATCCAATGAGCAGCTGAGTGACGCTGTGAGGGAAGTTGGTATTG
AAGGGGCGCAGTGGCGGCTTTCGAAAACAGAGAAAAGGACGTTCCAGTCAGCCTATTTGAAGAGGGAAGCAAATACTTGGATGGGATTTATCAAACAAAGGCTGCCTCCA
ACGACTCATGACTCGACGGTTTCTAGGGAACGAGTGCTTCTGGCTTTAGCTATTTTGCGGTCTCTCAGTATTGAGGTGGGAAATATTATTGCTGATGAAATATCTGGATG
TTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTCCAAGCGAGCAGGAGTTTCGGAGAATGAAGGAGATGTGATATTATTTGACAAGGGAA
TCATTGACACGCCTAACTTGGCGCGGCTTCAGCATTCGCAGGAGGCACGCCAGGGTGGGCTTGTCTACGACATTAACACGATTTTAGAACAACTCGCACTGTCGGCCAGT
AGGCAGGAGTTTGCAGAGAGGCAAGCTTTAACTTTTTGGAACTATGTTAAAAATCGTGATGCCAATCTGAAGAAGGCACTGCAAGAGAATTTTTCCGAACCATTTCCAGC
CCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGATTCCGCCACCGCTTGTCGAGAGAGAAGGAGATGGAGAAGAAGATCCTGGTCAGGAGGATTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERDNEEEEVPVTPEAPKVKAKKKKTPEEKEAKRRRKQQRAVDQEVVQKAAEDVIVEEDPKEPEGRNQEQSESGVADTEEVREENTEEVQEQQAEDVQEEQAE
VAPEGGNEPEQEVRVEVIMPEVPKRRRIKRKAGRVKKEAEDKAREEAEKKAEEERLRKQRADRGKSVAAASEEPDEIEESQLPYDRFVNNLARAKYAEFLKRDFLFERGF
SGDLQHFLRTCIADHGWEQFCSKPESVNAQLVREFYANIDKEEGFLAIVRAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQSAYLKREANTWMGFIKQRLPP
TTHDSTVSRERVLLALAILRSLSIEVGNIIADEISGCWKKKVGKLFFPNTITMLSKRAGVSENEGDVILFDKGIIDTPNLARLQHSQEARQGGLVYDINTILEQLALSAS
RQEFAERQALTFWNYVKNRDANLKKALQENFSEPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPGQED