; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005169 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005169
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:11425348..11430319
RNA-Seq ExpressionLag0005169
SyntenyLag0005169
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3462561.1 reverse transcriptase [Gossypium australe]6.3e-3635.22Show/hide
Query:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEE-ND
        MKILSWNVR L NPR  R LRH  +  NPQ+VFF   ++K +    EK+++   + +   V S GS   L L W+  +NI++ SFS  H+D+ I E+   
Subjt:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEE-ND

Query:  RWRFTGFYGNPNPTKKRVEKGRKLLRFEEGSSEHEEAKAIIEKSWVLL---EEEEIYWKQRLREDWLHWRDKNTTWFHARASQR-RKNKIEGIFSKEGAW
        + R TGFYG+P    +                         EK+W LL     +  YW+QR R +WL   D+NT++FH +A+QR R+N I+ +  ++G  
Subjt:  RWRFTGFYGNPNPTKKRVEKGRKLLRFEEGSSEHEEAKAIIEKSWVLL---EEEEIYWKQRLREDWLHWRDKNTTWFHARASQR-RKNKIEGIFSKEGAW

Query:  MEGDEEIGKVVIDYFKGMFQ-STNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQIL
           +EE+ ++   YF  +F   + P +  I S    +   +TE  N  L   FT EE+   +  M P+KAP  D L A+F+QK W ++G +    CL  L
Subjt:  MEGDEEIGKVVIDYFKGMFQ-STNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQIL

Query:  N
        N
Subjt:  N

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]1.7e-3629.95Show/hide
Query:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEEN-D
        MKI+SWNV+ L   RTFR  +   + + PQI+  FLS++K   K  E  ++ L F+NCF+V   G    L LLW   +++ + S+S  H+D  I+ EN  
Subjt:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEEN-D

Query:  RWRFTGFYGNPNPTKK------------------------------------------RVEKGRKLLR----FEEG--------SSEHEEAKAIIEKS--
         WR T  YG+P   +K                                          RV + R+ +R     + G        S+   EAK +  K   
Subjt:  RWRFTGFYGNPNPTKK------------------------------------------RVEKGRKLLR----FEEG--------SSEHEEAKAIIEKS--

Query:  --------WV-------------------------------------------LLEEEEIYWKQRLREDWLHWRDKNTTWFHARASQRR-KNKIEGIFSK
                W                                            +L++EEI+WKQR R DWL   DKNT +FHA+AS RR KN+I GI  +
Subjt:  --------WV-------------------------------------------LLEEEEIYWKQRLREDWLHWRDKNTTWFHARASQRR-KNKIEGIFSK

Query:  EGAWMEGDEEIGKVVIDYFKGMFQSTNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICL
        +G W E  +E+ ++  ++F  +F +T P +  + +A      K+ E  N  L  PF  EE+ + +  M P+KAP  D L A FFQK+W  V       CL
Subjt:  EGAWMEGDEEIGKVVIDYFKGMFQSTNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICL

Query:  QILN
         ILN
Subjt:  QILN

XP_012477795.1 PREDICTED: uncharacterized protein LOC105793429 [Gossypium raimondii]6.3e-3630.42Show/hide
Query:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEE--N
        MKI+ WNVR L +PR  R LR   +  NPQIV  FL ++K   K  E +++   F N   V   G+ S + L W+E + I + S S  H+D+ +  E  +
Subjt:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEE--N

Query:  DRWRFTGFYGNPNPTKKRVE----------------KGRKLLRFEEGSSEHEEAKAIIEKSW--------------------------------------
        + WRFTGFYG+P    K                   K     +FE   +  E  +  I ++W                                      
Subjt:  DRWRFTGFYGNPNPTKKRVE----------------KGRKLLRFEEGSSEHEEAKAIIEKSW--------------------------------------

Query:  --------------------------VLLEEEEIYWKQRLREDWLHWRDKNTTWFHARASQRRK-NKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQST
                                  + +E+EE+YW+QR R +WL   DKNT +FH  AS RR+ N I  + S +G  +  + EI ++   YF+ +F +T
Subjt:  --------------------------VLLEEEEIYWKQRLREDWLHWRDKNTTWFHARASQRRK-NKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQST

Query:  NPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILN
                   +  C  I+ N N+ L+K FT EE+   +KGM  +KAP  D    LFFQKYWD+VG D    CL++LN
Subjt:  NPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILN

XP_023915763.1 uncharacterized protein LOC112027315 [Quercus suber]1.5e-3730.73Show/hide
Query:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEEN-D
        MK+L+WN R L N R  + L    ++ +P IV  FLS++ S+ +  + ++  + FD CF VP++G    L LLWK  +N+ ++SFS  H+D  I+  +  
Subjt:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEEN-D

Query:  RWRFTGFYGNPNPT--------------------------------------------------------------------KKRVEKGR-KLLRFEEGS
         WR TGFYG P+ +                                                                    KKR+++ + +L R E+ S
Subjt:  RWRFTGFYGNPNPT--------------------------------------------------------------------KKRVEKGR-KLLRFEEGS

Query:  SE---HEEAKAIIEKSWVLLEEEEIYWKQRLREDWLHWRDKNTTWFHARASQR-RKNKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQSTNP--ISSII
        +     EE + +  +  VL ++EE  W+QR R  WL   D+NT +FH  A+QR RKN I+ +    G W EGDE +  +++D++  +F S+NP  +  I+
Subjt:  SE---HEEAKAIIEKSWVLLEEEEIYWKQRLREDWLHWRDKNTTWFHARASQR-RKNKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQSTNP--ISSII

Query:  KSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILN
           + V    +++    DL KPF++EE+ + I+ M P KAP  D +  LFFQ YW  VG D     L  LN
Subjt:  KSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILN

XP_037450914.1 uncharacterized protein LOC119321246 [Triticum dicoccoides]7.7e-3430.99Show/hide
Query:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEENDR
        M+ LSWN R L NP   R LR+  +   P ++  F+ ++K   K  E L+ +L F  CF V S G +  + L W + +N+ + +FS  H+D+ + E +  
Subjt:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEENDR

Query:  ---WRFTGFYGNPN--------------------------------------PTK-KRVEKGRKLLRFEE-GSSEHEEAKAIIEKSWVLLEEEEIYWKQR
           WR TGFYG P                                        TK K+++K  +LLR +  G    EE KA + K    L +EEI++KQ 
Subjt:  ---WRFTGFYGNPN--------------------------------------PTK-KRVEKGRKLLRFEE-GSSEHEEAKAIIEKSWVLLEEEEIYWKQR

Query:  LREDWLHWRDKNTTWFHARASQRRK-NKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMF--QSTNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDK
         R  WL   D+NTT++HA A+QR++ N+I  +   +G++ + + E    V  +++ ++  Q  N +S +++     + +++ EN N  L KPF  EE+  
Subjt:  LREDWLHWRDKNTTWFHARASQRRK-NKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMF--QSTNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDK

Query:  VIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILNG
         +  M+PSKAP  D   A FFQ++W++V        L  LNG
Subjt:  VIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILNG

TrEMBL top hitse value%identityAlignment
A0A2N9GKW3 Reverse transcriptase domain-containing protein5.8e-3526.33Show/hide
Query:  FLPSLIQRDIGGGWVPAPPDAMKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLN
        F P      I GG   APP AM  L+WN R L NPRT + +    R+ +P +V  FL ++  D    E+L+  L F N FI  S+     L L WK+ + 
Subjt:  FLPSLIQRDIGGGWVPAPPDAMKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLN

Query:  ISINSFSSGHVDITINE-ENDRWRFTGFYGNPNPTKKRVEKG--------------------RKLLRFEEGSSEHEEA----------------------
        + + SFS  H+D  +NE + D WRFTGFYG P  T KR E                       +L+R EE    H  +                      
Subjt:  ISINSFSSGHVDITINE-ENDRWRFTGFYGNPNPTKKRVEKG--------------------RKLLRFEEGSSEHEEA----------------------

Query:  --------------------KAIIEKSWV-----------------------------------------------------------------LLEEEE
                            +A+    W+                                                                 LL +EE
Subjt:  --------------------KAIIEKSWV-----------------------------------------------------------------LLEEEE

Query:  IYWKQRLREDWLHWRDKNTTWFHARASQR-RKNKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQSTN--PISSIIKSARMVLCKKITENQNQDLIKPFT
          W+QR R +WL   D+NT +FH RA+QR R+N++  +  ++G W     ++  + ++Y+  +FQ+ N   +  ++++   V    +TE  N  L + +T
Subjt:  IYWKQRLREDWLHWRDKNTTWFHARASQR-RKNKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQSTN--PISSIIKSARMVLCKKITENQNQDLIKPFT

Query:  AEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILNGVRTL
        A E+D  +K M P K+P  D L  +F+QKYW ++G D     L  LN  + L
Subjt:  AEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILNGVRTL

A0A2N9I611 Uncharacterized protein4.9e-3425.84Show/hide
Query:  RSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEENDRWRFTGFYG
        R L NPR  R LR   +   P+++  FLS++K + +  E ++  L +D+ F+VPSKG +  L LLW E +++SI S++  H+D  I  +   WRFTGFYG
Subjt:  RSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEENDRWRFTGFYG

Query:  NPNPTKK--------------------------------------------------RVEKG--------------------------------------
        +P   K+                                                  R+++G                                      
Subjt:  NPNPTKK--------------------------------------------------RVEKG--------------------------------------

Query:  ----RKLLRFEEGSSEHEEAKAIIEKSW---------------------------------------------------------------------VLL
             +L RFE+  ++HEE + +I   W                                                                      LL
Subjt:  ----RKLLRFEEGSSEHEEAKAIIEKSW---------------------------------------------------------------------VLL

Query:  EEEEIYWKQRLREDWLHWRDKNTTWFHARASQR-RKNKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQSTNPISSIIKSARMVLCKKITENQNQDLIKP
        E+EE+YWKQR R  WL   D+NT +FH++A+QR +KN ++G+  KEG W +   ++ ++ ++YF+ +F STN +   + S+   + K +T+  NQ L   
Subjt:  EEEEIYWKQRLREDWLHWRDKNTTWFHARASQR-RKNKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQSTNPISSIIKSARMVLCKKITENQNQDLIKP

Query:  FTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILN
        F  EE+D+ I  M+ SKAP  D   A F+QKYW+ VG   +   L +LN
Subjt:  FTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILN

A0A5B6V0I7 Reverse transcriptase3.1e-3635.22Show/hide
Query:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEE-ND
        MKILSWNVR L NPR  R LRH  +  NPQ+VFF   ++K +    EK+++   + +   V S GS   L L W+  +NI++ SFS  H+D+ I E+   
Subjt:  MKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITINEE-ND

Query:  RWRFTGFYGNPNPTKKRVEKGRKLLRFEEGSSEHEEAKAIIEKSWVLL---EEEEIYWKQRLREDWLHWRDKNTTWFHARASQR-RKNKIEGIFSKEGAW
        + R TGFYG+P    +                         EK+W LL     +  YW+QR R +WL   D+NT++FH +A+QR R+N I+ +  ++G  
Subjt:  RWRFTGFYGNPNPTKKRVEKGRKLLRFEEGSSEHEEAKAIIEKSWVLL---EEEEIYWKQRLREDWLHWRDKNTTWFHARASQR-RKNKIEGIFSKEGAW

Query:  MEGDEEIGKVVIDYFKGMFQ-STNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQIL
           +EE+ ++   YF  +F   + P +  I S    +   +TE  N  L   FT EE+   +  M P+KAP  D L A+F+QK W ++G +    CL  L
Subjt:  MEGDEEIGKVVIDYFKGMFQ-STNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIICLQIL

Query:  N
        N
Subjt:  N

A0A803QB95 Uncharacterized protein1.6e-3727.65Show/hide
Query:  PDAMKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITIN-E
        P AM IL WNV+ L NP T R+L        PQ+V  F+ +SK +    E L   L F  CF+V +KG +  L LLW E +   + SFS  H+D  I  E
Subjt:  PDAMKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVDITIN-E

Query:  ENDRWRFTGFYGNPNPTKKRVE-------------------------------------KGRKLLRFEEGSSEHEEAKAIIEKSW---------------
        E   WRFTGFY +P+P+++                                        + R    FE   ++ E+   I+  +W               
Subjt:  ENDRWRFTGFYGNPNPTKKRVE-------------------------------------KGRKLLRFEEGSSEHEEAKAIIEKSW---------------

Query:  -----------------------------------------------------VLLEEEEIYWKQRLREDWLHWRDKNTTWFHARASQRR-KNKIEGIFS
                                                             +LL++EE +WKQR R  WL   DKNT +FH +AS R+ KN I+G+  
Subjt:  -----------------------------------------------------VLLEEEEIYWKQRLREDWLHWRDKNTTWFHARASQRR-KNKIEGIFS

Query:  KEGAWMEGDEEIGKVVIDYFKGMFQSTNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIIC
         +  W+ G++++G+V  DYF  +F S +     ++  + ++  +I+   N+ L++PF  EE+   ++ ++P KAP  D L  LF++K W  +G +   +C
Subjt:  KEGAWMEGDEEIGKVVIDYFKGMFQSTNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIIC

Query:  LQILN
        L I+N
Subjt:  LQILN

A0A803QCP7 Uncharacterized protein1.3e-3427.13Show/hide
Query:  GWVPAPPDAMKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVD
        G +   P AM +L WNV+ L NP T  +L  +  S NP++V  F+S+ + +    E L+  L F  CF+V ++G +  + L W   +   + SFS  H+D
Subjt:  GWVPAPPDAMKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVLVLLWKESLNISINSFSSGHVD

Query:  ITIN-EENDRWRFTGFYGNPNPTK--------KRVE----------------------------------------------------------KGRK--
          I  EE   WRFTGFYG+P+PT+        KR+                                                            GRK  
Subjt:  ITIN-EENDRWRFTGFYGNPNPTK--------KRVE----------------------------------------------------------KGRK--

Query:  --------------------------------------------------------LLRFEEGSSEHEEAKAIIE-----KSWVLLEEEEIYWKQRLRED
                                                                   FE   ++ EE   II+     K  ++L +EE +WKQR R  
Subjt:  --------------------------------------------------------LLRFEEGSSEHEEAKAIIE-----KSWVLLEEEEIYWKQRLRED

Query:  WLHWRDKNTTWFHARASQRR-KNKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQSTNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMN
        WL   DKNT  FH +AS R+ KN I+G+  +   W+  +  +GKV  DYFK +F S       +   + ++   I  + N+ L++PFT E++ KV++ + 
Subjt:  WLHWRDKNTTWFHARASQRR-KNKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQSTNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMN

Query:  PSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILN
        P KAP  D L +LF++K+W  +G +   +CL ILN
Subjt:  PSKAPRRDDLEALFFQKYWDVVGTDTKIICLQILN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACAAGGAGCGGATGAGGACAACCGGGGAGAAATCGGGCTGGGAGATAAACCCAAGAGGCGAAACCGACAAGTGGGACGGGCCAAGACCGAAGGGGTCGGGCTTTTG
GCCCGACCCCCTGCTCGCTCGGGCCGAGCCCATCCGGCTCCCTTTGGTCCCTACCGCCTCTGGCCGCTCCGGTTCCGCCTGGTTTGACCCGAAACGCCTCCGAACTCCTA
AAAATCCTAGGAGGATGAGTAGGTCACGTCTTCCCCCCTCATCTACAAATTTACTATTGGTGGCACGTGAAGGACGGGGATACCATCTACATCCTCGCCCTATCCCTGAC
TGGGGATTCCCTGTCCCATTAGGGGCGGGGCTCTGTGGCGATCCGATCCCACAAGAATTTTTGCCATCCCTAATTCAAAGGGATATCGGCGGAGGCTGGGTGCCAGCCCC
ACCGGATGCCATGAAAATATTAAGCTGGAACGTTAGGAGTTTGGAGAATCCTCGAACATTCCGAGCACTTCGACACAAATTCAGAAGTGTAAATCCTCAGATAGTTTTTT
TTTTTTTATCAAAATCTAAAAGCGATACCAAAATTGAGGAAAAGCTGAAAAAAGACTTAGTTTTCGACAACTGTTTTATAGTTCCTAGCAAAGGGAGCAACAGTGTGTTA
GTGCTTTTATGGAAGGAGAGTTTGAACATTAGCATCAATTCCTTCTCATCTGGGCACGTTGACATCACTATTAATGAGGAAAATGATAGATGGAGGTTCACAGGCTTTTA
TGGGAATCCCAATCCTACTAAAAAGAGAGTGGAGAAAGGTAGAAAGCTCTTGAGATTCGAAGAAGGTTCGTCGGAGCACGAGGAAGCCAAGGCGATTATCGAGAAGTCTT
GGGTCCTTTTGGAAGAAGAAGAGATTTACTGGAAGCAACGTTTGAGGGAGGATTGGCTCCATTGGAGAGATAAAAATACGACATGGTTCCATGCTCGGGCATCCCAAAGA
AGGAAGAACAAAATTGAAGGCATCTTCAGTAAAGAAGGCGCTTGGATGGAAGGAGACGAGGAGATTGGTAAAGTGGTCATTGATTATTTCAAAGGCATGTTCCAATCCAC
AAATCCTATCAGCAGCATTATTAAGAGTGCGAGAATGGTTCTGTGCAAGAAGATAACGGAAAACCAAAATCAGGACCTTATAAAACCATTCACTGCAGAGGAGTTAGATA
AGGTCATTAAAGGGATGAACCCATCAAAAGCCCCGAGAAGAGATGATTTGGAAGCTTTATTCTTTCAAAAATATTGGGATGTGGTGGGAACCGACACAAAAATTATATGC
CTTCAAATTTTGAATGGAGTGAGGACATTAGATCGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTACAAGGAGCGGATGAGGACAACCGGGGAGAAATCGGGCTGGGAGATAAACCCAAGAGGCGAAACCGACAAGTGGGACGGGCCAAGACCGAAGGGGTCGGGCTTTTG
GCCCGACCCCCTGCTCGCTCGGGCCGAGCCCATCCGGCTCCCTTTGGTCCCTACCGCCTCTGGCCGCTCCGGTTCCGCCTGGTTTGACCCGAAACGCCTCCGAACTCCTA
AAAATCCTAGGAGGATGAGTAGGTCACGTCTTCCCCCCTCATCTACAAATTTACTATTGGTGGCACGTGAAGGACGGGGATACCATCTACATCCTCGCCCTATCCCTGAC
TGGGGATTCCCTGTCCCATTAGGGGCGGGGCTCTGTGGCGATCCGATCCCACAAGAATTTTTGCCATCCCTAATTCAAAGGGATATCGGCGGAGGCTGGGTGCCAGCCCC
ACCGGATGCCATGAAAATATTAAGCTGGAACGTTAGGAGTTTGGAGAATCCTCGAACATTCCGAGCACTTCGACACAAATTCAGAAGTGTAAATCCTCAGATAGTTTTTT
TTTTTTTATCAAAATCTAAAAGCGATACCAAAATTGAGGAAAAGCTGAAAAAAGACTTAGTTTTCGACAACTGTTTTATAGTTCCTAGCAAAGGGAGCAACAGTGTGTTA
GTGCTTTTATGGAAGGAGAGTTTGAACATTAGCATCAATTCCTTCTCATCTGGGCACGTTGACATCACTATTAATGAGGAAAATGATAGATGGAGGTTCACAGGCTTTTA
TGGGAATCCCAATCCTACTAAAAAGAGAGTGGAGAAAGGTAGAAAGCTCTTGAGATTCGAAGAAGGTTCGTCGGAGCACGAGGAAGCCAAGGCGATTATCGAGAAGTCTT
GGGTCCTTTTGGAAGAAGAAGAGATTTACTGGAAGCAACGTTTGAGGGAGGATTGGCTCCATTGGAGAGATAAAAATACGACATGGTTCCATGCTCGGGCATCCCAAAGA
AGGAAGAACAAAATTGAAGGCATCTTCAGTAAAGAAGGCGCTTGGATGGAAGGAGACGAGGAGATTGGTAAAGTGGTCATTGATTATTTCAAAGGCATGTTCCAATCCAC
AAATCCTATCAGCAGCATTATTAAGAGTGCGAGAATGGTTCTGTGCAAGAAGATAACGGAAAACCAAAATCAGGACCTTATAAAACCATTCACTGCAGAGGAGTTAGATA
AGGTCATTAAAGGGATGAACCCATCAAAAGCCCCGAGAAGAGATGATTTGGAAGCTTTATTCTTTCAAAAATATTGGGATGTGGTGGGAACCGACACAAAAATTATATGC
CTTCAAATTTTGAATGGAGTGAGGACATTAGATCGCTAA
Protein sequenceShow/hide protein sequence
MYKERMRTTGEKSGWEINPRGETDKWDGPRPKGSGFWPDPLLARAEPIRLPLVPTASGRSGSAWFDPKRLRTPKNPRRMSRSRLPPSSTNLLLVAREGRGYHLHPRPIPD
WGFPVPLGAGLCGDPIPQEFLPSLIQRDIGGGWVPAPPDAMKILSWNVRSLENPRTFRALRHKFRSVNPQIVFFFLSKSKSDTKIEEKLKKDLVFDNCFIVPSKGSNSVL
VLLWKESLNISINSFSSGHVDITINEENDRWRFTGFYGNPNPTKKRVEKGRKLLRFEEGSSEHEEAKAIIEKSWVLLEEEEIYWKQRLREDWLHWRDKNTTWFHARASQR
RKNKIEGIFSKEGAWMEGDEEIGKVVIDYFKGMFQSTNPISSIIKSARMVLCKKITENQNQDLIKPFTAEELDKVIKGMNPSKAPRRDDLEALFFQKYWDVVGTDTKIIC
LQILNGVRTLDR