; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039251 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039251
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:39918168..39919786
RNA-Seq ExpressionLag0039251
SyntenyLag0039251
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]9.1e-4734.2Show/hide
Query:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARI-------------------------------------
        MARFWW   K  RGIHWV W+ LCKSK  GG+GF+DLE FNQALLAKQCWRI++ P S +ARI                                     
Subjt:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARI-------------------------------------

Query:  LKW---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN---------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLG-
        L+W   +G S+ VY   W+P     K+ SP  L   TRV DL T +GQWN                       L   D ++WHY+++G++S+KSGY+L  
Subjt:  LKW---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN---------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLG-

Query:  -QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCRSIFDLLRDVRDKVGWGKF-
         +   +S  PS+  D +   +WK  W + I NK+K FLWR   D LP    L  R +    +C  C R  ES LH  W C +  ++ R+      WG   
Subjt:  -QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCRSIFDLLRDVRDKVGWGKF-

Query:  ------------------------ELFVVVLWAVWNCHNQQKFKG
                                 LF  + W +WN  N   F+G
Subjt:  ------------------------ELFVVVLWAVWNCHNQQKFKG

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]4.5e-4634.2Show/hide
Query:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARI-------------------------------------
        MARFWW   K  RGIHWV W+ LCKSK  GG+GF+DLE FNQALLAKQCWRI++ P S +ARI                                     
Subjt:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARI-------------------------------------

Query:  LKW---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN---------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLG-
        L+W   NG S+ VY   W+P     K+ SP  L   T V DL T +GQWN                       L   D ++WHY+++G++S+KSGY+L  
Subjt:  LKW---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN---------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLG-

Query:  -QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCRSIFDLLRDVRDKVGWGKF-
         +   +S  PS   D +   +WK  W + I NK+K FLWR   D LP    L  R +    +C  C R  ES LH  W C +  ++ R+      WG   
Subjt:  -QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCRSIFDLLRDVRDKVGWGKF-

Query:  ------------------------ELFVVVLWAVWNCHNQQKFKG
                                 LF  + W +WN  N   F+G
Subjt:  ------------------------ELFVVVLWAVWNCHNQQKFKG

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]1.6e-4635.02Show/hide
Query:  MSKAMARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILKWNGESVPVYEANWIPYDGGLKVRSPVTLAPE
        + KA+ARFWW      RGIHW  W+ LC++K  GGMGF+D   FNQAL+AKQ WRI+Q P S +AR+LK     V +Y++NW+P     K  SP TL  +
Subjt:  MSKAMARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILKWNGESVPVYEANWIPYDGGLKVRSPVTLAPE

Query:  TRVADLMTETGQWNEQL---------------------GTEDKMVWHYDKSGLFSIKSGYQLGQSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFL
          VADL+ E   W +++                        D+ +WHYDK G +S+KSGYQ+         P SS  DS +  W   W   +  K+KIF+
Subjt:  TRVADLMTETGQWNEQL---------------------GTEDKMVWHYDKSGLFSIKSGYQLGQSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFL

Query:  WRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCR---------------------SIFDLLRDVRDKVGWGKFELFVVVLWAVWNCHN
        WR   + LPT  NL  R +    +C  CG   E  +H    C+                      +  +L +++ K G  + EL VV+ W +W   N
Subjt:  WRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCR---------------------SIFDLLRDVRDKVGWGKFELFVVVLWAVWNCHN

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]7.0e-6337.6Show/hide
Query:  ARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK------------------------------------
        ARFWW   K D+ IHWV+W SL   KC GGMGF+DLE+FN+ALLAKQCWRI+  P+S L+R+LK                                    
Subjt:  ARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK------------------------------------

Query:  -W---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMT-ETGQWNEQL---------------------GTEDKMVWHYDKSGLFSIKSGYQLG-
         W   NG+SV +Y  NW+P    LK+ S   L   +RV+ L+  E G W   +                       ED+++W+Y+K+G++S++SGY++  
Subjt:  -W---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMT-ETGQWNEQL---------------------GTEDKMVWHYDKSGLFSIKSGYQLG-

Query:  QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCR--------------SIFDLL
         +    Q PSSSS + +  WW G WKM I NK+K+FLWR CLDRLPT  NL  RGV++ N C  CGR+GE S+H+FW C+              S F +L
Subjt:  QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCR--------------SIFDLL

Query:  RDVRDKVGWGKFELFVVVLWAVWNCHNQQKF----KGIGSV-EGLVDWVGSYISSFQQA
        R+  + +    FE   VV+W +WN  N + F    K +  +   LV+W   Y   F++A
Subjt:  RDVRDKVGWGKFELFVVVLWAVWNCHNQQKF----KGIGSV-EGLVDWVGSYISSFQQA

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]4.5e-4634.99Show/hide
Query:  MSKAMARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK-------------------------------
        + KAMARFWW  ++  +GIHW  W+ +  SK  GGMGF+DL  FNQAL+AKQ WRI+Q PSS +AR+LK                               
Subjt:  MSKAMARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK-------------------------------

Query:  ------W---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWNEQL---------------------GTEDKMVWHYDKSGLFSIKSGY
              W   NG++V VY  NWIP     K  S  ++  +T VA+L+ E  QW E L                       ED+++WHYDK G +S+KSGY
Subjt:  ------W---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWNEQL---------------------GTEDKMVWHYDKSGLFSIKSGY

Query:  QLGQSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHV---------FWHCRSIFDLLRD
        Q+       + PS S+ D  +  W+  WK+ I  KVKIFLWR   D LPT +NL  + V    MC  C  H E+  H           W   ++ + LR 
Subjt:  QLGQSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHV---------FWHCRSIFDLLRD

Query:  V-RDKVGW---------GKFE--LFVVVLWAVWNCHNQQKFKG
        V R  + W          K E      +LWA+W   N+  F+G
Subjt:  V-RDKVGW---------GKFE--LFVVVLWAVWNCHNQQKFKG

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein4.4e-4734.2Show/hide
Query:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARI-------------------------------------
        MARFWW   K  RGIHWV W+ LCKSK  GG+GF+DLE FNQALLAKQCWRI++ P S +ARI                                     
Subjt:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARI-------------------------------------

Query:  LKW---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN---------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLG-
        L+W   +G S+ VY   W+P     K+ SP  L   TRV DL T +GQWN                       L   D ++WHY+++G++S+KSGY+L  
Subjt:  LKW---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN---------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLG-

Query:  -QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCRSIFDLLRDVRDKVGWGKF-
         +   +S  PS+  D +   +WK  W + I NK+K FLWR   D LP    L  R +    +C  C R  ES LH  W C +  ++ R+      WG   
Subjt:  -QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCRSIFDLLRDVRDKVGWGKF-

Query:  ------------------------ELFVVVLWAVWNCHNQQKFKG
                                 LF  + W +WN  N   F+G
Subjt:  ------------------------ELFVVVLWAVWNCHNQQKFKG

A0A5E4FZN9 PREDICTED: retrotransposon2.2e-4634.2Show/hide
Query:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARI-------------------------------------
        MARFWW   K  RGIHWV W+ LCKSK  GG+GF+DLE FNQALLAKQCWRI++ P S +ARI                                     
Subjt:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARI-------------------------------------

Query:  LKW---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN---------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLG-
        L+W   NG S+ VY   W+P     K+ SP  L   T V DL T +GQWN                       L   D ++WHY+++G++S+KSGY+L  
Subjt:  LKW---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN---------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLG-

Query:  -QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCRSIFDLLRDVRDKVGWGKF-
         +   +S  PS   D +   +WK  W + I NK+K FLWR   D LP    L  R +    +C  C R  ES LH  W C +  ++ R+      WG   
Subjt:  -QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCRSIFDLLRDVRDKVGWGKF-

Query:  ------------------------ELFVVVLWAVWNCHNQQKFKG
                                 LF  + W +WN  N   F+G
Subjt:  ------------------------ELFVVVLWAVWNCHNQQKFKG

A0A6J1DAR4 uncharacterized protein LOC1110189543.4e-6337.6Show/hide
Query:  ARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK------------------------------------
        ARFWW   K D+ IHWV+W SL   KC GGMGF+DLE+FN+ALLAKQCWRI+  P+S L+R+LK                                    
Subjt:  ARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK------------------------------------

Query:  -W---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMT-ETGQWNEQL---------------------GTEDKMVWHYDKSGLFSIKSGYQLG-
         W   NG+SV +Y  NW+P    LK+ S   L   +RV+ L+  E G W   +                       ED+++W+Y+K+G++S++SGY++  
Subjt:  -W---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMT-ETGQWNEQL---------------------GTEDKMVWHYDKSGLFSIKSGYQLG-

Query:  QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCR--------------SIFDLL
         +    Q PSSSS + +  WW G WKM I NK+K+FLWR CLDRLPT  NL  RGV++ N C  CGR+GE S+H+FW C+              S F +L
Subjt:  QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCR--------------SIFDLL

Query:  RDVRDKVGWGKFELFVVVLWAVWNCHNQQKF----KGIGSV-EGLVDWVGSYISSFQQA
        R+  + +    FE   VV+W +WN  N + F    K +  +   LV+W   Y   F++A
Subjt:  RDVRDKVGWGKFELFVVVLWAVWNCHNQQKF----KGIGSV-EGLVDWVGSYISSFQQA

A0A803PPQ0 Uncharacterized protein4.1e-4530.29Show/hide
Query:  ARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK------------------------------------
        A FWW   K ++ +HW +W  LCK K  GG+GF+ L  FNQALLAKQ WR++  P S LAR+LK                                    
Subjt:  ARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK------------------------------------

Query:  -W---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN--------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLGQSA
         W   NG    V+E  W+P   G  +     L P+T++ DL+   GQW                       L  ED + W Y  +G + +KSGY++G+  
Subjt:  -W---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN--------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLGQSA

Query:  WISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCR---------------------SI
         +    SS+ +D I  WWK  W M +  ++K+F WR C + LP   NL +RG+DV   C LCG   E+  H  W C                      S+
Subjt:  WISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCR---------------------SI

Query:  FDLLRDVRDKVGWGKFELFVVVLWAVWNCHNQ--QKFKGIGSVEGLVDWV
        FD++  ++D +   +FE  + ++WA+W   N+   K   +  ++ L+DW+
Subjt:  FDLLRDVRDKVGWGKFELFVVVLWAVWNCHNQ--QKFKGIGSVEGLVDWV

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)4.4e-4734.2Show/hide
Query:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARI-------------------------------------
        MARFWW   K  RGIHWV W+ LCKSK  GG+GF+DLE FNQALLAKQCWRI++ P S +ARI                                     
Subjt:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARI-------------------------------------

Query:  LKW---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN---------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLG-
        L+W   +G S+ VY   W+P     K+ SP  L   TRV DL T +GQWN                       L   D ++WHY+++G++S+KSGY+L  
Subjt:  LKW---NGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTETGQWN---------------------EQLGTEDKMVWHYDKSGLFSIKSGYQLG-

Query:  -QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCRSIFDLLRDVRDKVGWGKF-
         +   +S  PS+  D +   +WK  W + I NK+K FLWR   D LP    L  R +    +C  C R  ES LH  W C +  ++ R+      WG   
Subjt:  -QSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHCRSIFDLLRDVRDKVGWGKF-

Query:  ------------------------ELFVVVLWAVWNCHNQQKFKG
                                 LF  + W +WN  N   F+G
Subjt:  ------------------------ELFVVVLWAVWNCHNQQKFKG

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003102.1e-1452.86Show/hide
Query:  MSKAMARFWWRGEKVDRGIHWVSWKSLCKSK-CYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK
        ++ AM  FWW   +  R I WV+W+ LCKSK   GG+GF+DL  FNQALLAKQ +RI+ QP + L+R+L+
Subjt:  MSKAMARFWWRGEKVDRGIHWVSWKSLCKSK-CYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK

Arabidopsis top hitse value%identityAlignment
AT1G33710.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.2e-0532.89Show/hide
Query:  WISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHC
        W +  P S     ++ W K  W  G   K    +W   LDRLPT   L   G+ +   CGLC    E   H+F  C
Subjt:  WISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHC

AT3G09510.1 Ribonuclease H-like superfamily protein3.3e-1029.7Show/hide
Query:  DKMVWHYDKSGLFSIKSGYQLGQSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWH
        DK++W+Y+ +G ++++SGY L      + +P+ +     +      W + I+ K+K FLWR     L T + L  RG+ +   C  C R  ES  H  + 
Subjt:  DKMVWHYDKSGLFSIKSGYQLGQSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWH

Query:  C
        C
Subjt:  C

AT4G29090.1 Ribonuclease H-like superfamily protein4.4e-1547.69Show/hide
Query:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK
        +A FWWR ++  +G+HW +W  L   K  GG+GFKD+E FN ALL KQ WR++ +P S +A++ K
Subjt:  MARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.1e-0430Show/hide
Query:  LGQSAWISQLPSSSSDDS---------IMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHC
        L ++A  S LPS SS D+          + W K  W    + +  +  W   L+RLPT D L   G+++ +   LC    E+  H+F+ C
Subjt:  LGQSAWISQLPSSSSDDS---------IMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWHC

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-1552.86Show/hide
Query:  MSKAMARFWWRGEKVDRGIHWVSWKSLCKSK-CYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK
        ++ AM  FWW   +  R I WV+W+ LCKSK   GG+GF+DL  FNQALLAKQ +RI+ QP + L+R+L+
Subjt:  MSKAMARFWWRGEKVDRGIHWVSWKSLCKSK-CYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAAGCTATGGCCCGATTCTGGTGGAGGGGGGAGAAAGTGGATCGAGGAATCCATTGGGTGAGTTGGAAGTCCCTATGTAAGTCCAAGTGCTATGGTGGAATGGG
CTTCAAGGATCTGGAGATTTTCAACCAAGCCCTTTTGGCAAAACAGTGCTGGAGAATTGTTCAGCAACCATCCTCCTTCCTTGCGCGTATACTGAAGTGGAATGGGGAAA
GTGTCCCGGTGTATGAAGCGAATTGGATTCCTTATGATGGGGGTCTCAAAGTGCGTTCTCCTGTGACATTGGCCCCAGAGACTAGGGTAGCTGATCTGATGACGGAAACT
GGGCAGTGGAATGAACAGCTTGGTACTGAGGATAAGATGGTATGGCATTATGACAAGTCGGGTCTTTTCTCGATTAAGAGCGGGTATCAGTTAGGCCAATCAGCTTGGAT
CTCACAGCTCCCATCTTCCTCTTCTGACGATTCGATTATGGGTTGGTGGAAGGGGTGTTGGAAAATGGGGATTCTGAATAAGGTAAAGATCTTTTTATGGAGACCATGTC
TAGATCGCCTGCCTACAGTTGATAACCTAGTTTATCGGGGTGTGGATGTGTTGAATATGTGTGGTCTTTGTGGCCGACACGGTGAATCAAGCCTACATGTCTTTTGGCAT
TGCAGGTCTATATTCGATCTCCTTAGAGATGTGAGGGATAAGGTGGGATGGGGAAAATTTGAGTTGTTTGTGGTGGTGCTGTGGGCGGTGTGGAACTGCCACAACCAACA
AAAATTCAAAGGGATTGGGTCTGTGGAGGGACTAGTGGATTGGGTGGGGAGTTACATCTCTTCGTTCCAACAGGCTACCTTGTCTAGTGGCCTTCCGAAGGGAAGGATGG
CAGGCGGGTTTCGGGGTGGTTATTCAGGATTCTGCTGGGCAGGTTATGTTGTCGGCATCATTGGTGCATTAGCACGTACGAAGCCCGGAAATGGCCGAAGGGTGGGCAAC
TGTGAAGGGTATAAGACTGGCTTTGGAGATGGGCTGTTCCCACTGGTGCTGGAGACCGACTCCAGTCGAGTGGCTGGTTTTTTTCATAATGAGGCATTGGAGGACTACTC
TGATTTAGGTTCGCTAGTGGTTGATCTGCGGAAGGAGATTCCGAGGTCTTCATCTGTCGGCTGTATGTTTACCCAGAGAGGGGAAACGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAAGCTATGGCCCGATTCTGGTGGAGGGGGGAGAAAGTGGATCGAGGAATCCATTGGGTGAGTTGGAAGTCCCTATGTAAGTCCAAGTGCTATGGTGGAATGGG
CTTCAAGGATCTGGAGATTTTCAACCAAGCCCTTTTGGCAAAACAGTGCTGGAGAATTGTTCAGCAACCATCCTCCTTCCTTGCGCGTATACTGAAGTGGAATGGGGAAA
GTGTCCCGGTGTATGAAGCGAATTGGATTCCTTATGATGGGGGTCTCAAAGTGCGTTCTCCTGTGACATTGGCCCCAGAGACTAGGGTAGCTGATCTGATGACGGAAACT
GGGCAGTGGAATGAACAGCTTGGTACTGAGGATAAGATGGTATGGCATTATGACAAGTCGGGTCTTTTCTCGATTAAGAGCGGGTATCAGTTAGGCCAATCAGCTTGGAT
CTCACAGCTCCCATCTTCCTCTTCTGACGATTCGATTATGGGTTGGTGGAAGGGGTGTTGGAAAATGGGGATTCTGAATAAGGTAAAGATCTTTTTATGGAGACCATGTC
TAGATCGCCTGCCTACAGTTGATAACCTAGTTTATCGGGGTGTGGATGTGTTGAATATGTGTGGTCTTTGTGGCCGACACGGTGAATCAAGCCTACATGTCTTTTGGCAT
TGCAGGTCTATATTCGATCTCCTTAGAGATGTGAGGGATAAGGTGGGATGGGGAAAATTTGAGTTGTTTGTGGTGGTGCTGTGGGCGGTGTGGAACTGCCACAACCAACA
AAAATTCAAAGGGATTGGGTCTGTGGAGGGACTAGTGGATTGGGTGGGGAGTTACATCTCTTCGTTCCAACAGGCTACCTTGTCTAGTGGCCTTCCGAAGGGAAGGATGG
CAGGCGGGTTTCGGGGTGGTTATTCAGGATTCTGCTGGGCAGGTTATGTTGTCGGCATCATTGGTGCATTAGCACGTACGAAGCCCGGAAATGGCCGAAGGGTGGGCAAC
TGTGAAGGGTATAAGACTGGCTTTGGAGATGGGCTGTTCCCACTGGTGCTGGAGACCGACTCCAGTCGAGTGGCTGGTTTTTTTCATAATGAGGCATTGGAGGACTACTC
TGATTTAGGTTCGCTAGTGGTTGATCTGCGGAAGGAGATTCCGAGGTCTTCATCTGTCGGCTGTATGTTTACCCAGAGAGGGGAAACGGAGTAG
Protein sequenceShow/hide protein sequence
MSKAMARFWWRGEKVDRGIHWVSWKSLCKSKCYGGMGFKDLEIFNQALLAKQCWRIVQQPSSFLARILKWNGESVPVYEANWIPYDGGLKVRSPVTLAPETRVADLMTET
GQWNEQLGTEDKMVWHYDKSGLFSIKSGYQLGQSAWISQLPSSSSDDSIMGWWKGCWKMGILNKVKIFLWRPCLDRLPTVDNLVYRGVDVLNMCGLCGRHGESSLHVFWH
CRSIFDLLRDVRDKVGWGKFELFVVVLWAVWNCHNQQKFKGIGSVEGLVDWVGSYISSFQQATLSSGLPKGRMAGGFRGGYSGFCWAGYVVGIIGALARTKPGNGRRVGN
CEGYKTGFGDGLFPLVLETDSSRVAGFFHNEALEDYSDLGSLVVDLRKEIPRSSSVGCMFTQRGETE