; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036157 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036157
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:40609185..40610826
RNA-Seq ExpressionLag0036157
SyntenyLag0036157
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015384883.1 uncharacterized protein LOC107176610 [Citrus sinensis]2.0e-3529.17Show/hide
Query:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWAD-KQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERG---------------
        GGL LLW +++++ IV++S HHI   V   D K WR TG+YG P+ + +  TW L+RR+       W+  GDFNEIL  +EK RG               
Subjt:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWAD-KQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERG---------------

Query:  --------PCRG----------------------------QRLMDEVKAYNKEWAKSDHHPILLYIGTQLHQLG--RRKGKRFWFEGFWFRREECKKIIQ
                 CRG                            ++ + E+ AYN +   SDH PI+L +  +   +   +R   R +++  W    ECK+I+ 
Subjt:  --------PCRG----------------------------QRLMDEVKAYNKEWAKSDHHPILLYIGTQLHQLG--RRKGKRFWFEGFWFRREECKKIIQ

Query:  E-------------------------------------STDQESADWK--------------------------------------SHHESEEIVKVVKD
        E                                     ++ ++   WK                                      S   +EEI      
Subjt:  E-------------------------------------STDQESADWK--------------------------------------SHHESEEIVKVVKD

Query:  FKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN
          P KA GPDG PA F+Q +W +V    I  CL ILN   S+   NHT+IA+IPKV +   VSD+RPISLCNV Y+I+AK I N
Subjt:  FKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN

XP_024033492.1 uncharacterized protein LOC112095617 [Citrus clementina]2.0e-3529.17Show/hide
Query:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWAD-KQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERG---------------
        GGL LLW +++++ IV++S HHI   V   D K WR TG+YG P+ + +  TW L+RR+       W+  GDFNEIL  +EK RG               
Subjt:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWAD-KQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERG---------------

Query:  --------PCRG----------------------------QRLMDEVKAYNKEWAKSDHHPILLYIGTQLHQLG--RRKGKRFWFEGFWFRREECKKIIQ
                 CRG                            ++ + E+ AYN +   SDH PI+L +  +   +   +R   R +++  W    ECK+I+ 
Subjt:  --------PCRG----------------------------QRLMDEVKAYNKEWAKSDHHPILLYIGTQLHQLG--RRKGKRFWFEGFWFRREECKKIIQ

Query:  E-------------------------------------STDQESADWK--------------------------------------SHHESEEIVKVVKD
        E                                     ++ ++   WK                                      S   +EEI      
Subjt:  E-------------------------------------STDQESADWK--------------------------------------SHHESEEIVKVVKD

Query:  FKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN
          P KA GPDG PA F+Q +W +V    I  CL ILN   S+   NHT+IA+IPKV +   VSD+RPISLCNV Y+I+AK I N
Subjt:  FKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN

XP_024043257.1 uncharacterized protein LOC112099952 [Citrus clementina]2.0e-3531.4Show/hide
Query:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWAD-KQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERG---------------
        GGL LLW  E D++IV++S HHI   V   + K+WR TG+YG P+   +  TW LLRR+    +  WL  GDFNEIL  SEK  G               
Subjt:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWAD-KQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERG---------------

Query:  --------PCRGQRLMDEVKAY--------------NKEWAK--------------SDHHPILLYI--GTQLHQLGRRKGKRFWFEGFWFRREECKKIIQ
                 C+G       K Y              +K W +              SDH P++L +   T+L    ++  +RF +E  W   E CK I+Q
Subjt:  --------PCRGQRLMDEVKAY--------------NKEWAK--------------SDHHPILLYI--GTQLHQLGRRKGKRFWFEGFWFRREECKKIIQ

Query:  E---------STDQESADWKSHHES---------------------------------------------EEIVKVVKDFKPLKARGPDGFPAVFYQSNW
        +           D  +A  K+ + S                                             EE+   +    P KA GPDG PA F+Q +W
Subjt:  E---------STDQESADWKSHHES---------------------------------------------EEIVKVVKDFKPLKARGPDGFPAVFYQSNW

Query:  ETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN
         +V    I+ CL ILN   ++   NHT+I LIPK ++   VSD+RPISLCNV Y+I+AK I N
Subjt:  ETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN

XP_024195790.1 uncharacterized protein LOC112198938 [Rosa chinensis]3.4e-4333.04Show/hide
Query:  GGLCLLWKEEVDVTIVNFSVHHIH--TNVVWADKQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERGPCRGQRLMD-----
        GGLCLLW +++ V++ ++S  HI    N +   + WRFTG+YGQP  + RH TW+L++ +    +  WLLGGDFNEIL   EKE GP RG+R MD     
Subjt:  GGLCLLWKEEVDVTIVNFSVHHIH--TNVVWADKQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERGPCRGQRLMD-----

Query:  -------------------------EVK------AYNKEWA--------------KSDHHPILLYIGTQLHQLGRRKGKRFWFEGFWFRREECKKIIQES
                                 E+K        N  W               +SDH PIL+ +  +  +  +RK K+F FE FW R  +C+ +++  
Subjt:  -------------------------EVK------AYNKEWA--------------KSDHHPILLYIGTQLHQLGRRKGKRFWFEGFWFRREECKKIIQES

Query:  -----------------------------TDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHT
                                     +D+E+    S  + EE+ K +K   P KA GPDGF   FYQ  W+ VG   +    + LN +  ++  N T
Subjt:  -----------------------------TDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHT

Query:  HIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN
         + LIPKV     V   RPISLCNV YKI +K + N
Subjt:  HIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN

XP_042952220.1 uncharacterized protein LOC122289300 [Carya illinoinensis]4.4e-3532.12Show/hide
Query:  GGLCLLWKEEVDVTIVNFSVHHIHTNV--VWADKQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERGPCRGQRLMDEVK--
        GGL   WKE ++  I++++  HI   V  V  +  W  TG YG P  S RH +W +L+ +     + WL  GDFNEIL   EK  GP R    M+  +  
Subjt:  GGLCLLWKEEVDVTIVNFSVHHIHTNV--VWADKQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERGPCRGQRLMDEVK--

Query:  -------------------------AYNKE----------WA--------------KSDHHPILLYIGTQLHQLGRRKGKRFWFEGFWFRREECKKIIQE
                                 A+ KE          W+               SDH PI +    +L  +  RK K F FE  W   EEC K ++E
Subjt:  -------------------------AYNKE----------WA--------------KSDHHPILLYIGTQLHQLGRRKGKRFWFEGFWFRREECKKIIQE

Query:  S----------------------TDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIP
        +                      TD  +      +  +E+ + +     L + GPDGFPA+FYQ NW  VG Q     L++LN   S+ A N T+IALIP
Subjt:  S----------------------TDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIP

Query:  KVWEATLVSDYRPISLCNVSYKIIAKAIVN
        K    T V+++ PISLCNV+YK+I+K I N
Subjt:  KVWEATLVSDYRPISLCNVSYKIIAKAIVN

TrEMBL top hitse value%identityAlignment
A0A2N9GDA6 Uncharacterized protein2.6e-3329.57Show/hide
Query:  GGLCLLWKEEVDVTIVNFSVHHIHTNV-VWADKQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEK------------------
        GGL LLW++EV + + + S+ HI   + +   + W FTG YG  + S R  +W LLRR+H +DD  WL+ GDFNE+L  SEK                  
Subjt:  GGLCLLWKEEVDVTIVNFSVHHIHTNV-VWADKQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEK------------------

Query:  -------------------------------ERGPCR--GQRLMDEVKAYNKEWAKSDHHPILLYIGTQLHQLGRRKGKRFWFEGFWFRREECKKIIQES
                                       +RG C    + L    +  +  ++ SDH  ++L +   L +      KRF FE  W + + C+++IQ +
Subjt:  -------------------------------ERGPCR--GQRLMDEVKAYNKEWAKSDHHPILLYIGTQLHQLGRRKGKRFWFEGFWFRREECKKIIQES

Query:  TDQESA---------------------DWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKV
          Q  +                     D       EEI   +    P KA GPDG   +FYQ  W  VG       L+     H +++ N+THI+LIPK 
Subjt:  TDQESA---------------------DWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKV

Query:  WEATLVSDYRPISLCNVSYKIIAKAIVN
            +++ +RPISLCNV +KII+K + N
Subjt:  WEATLVSDYRPISLCNVSYKIIAKAIVN

A0A5B6V0V0 Reverse transcriptase1.2e-3332.38Show/hide
Query:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWAD--KQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERGPCRGQRLMD-----
        GGLCL WKE++ V + +FS  HIH  +   D   +WRFTG+YG P    ++  W+LLRR+ + D   WL+ GDFNEIL+   K  G  R ++ MD     
Subjt:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWAD--KQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERGPCRGQRLMD-----

Query:  ------------EVKAYNKEWAKS---------DHHPILLYIGTQLH-QLGRRKGKRFWFE---GFWFR--------------------------REECK
                    E +   KE +K          +   +   I T++H  L   K + +W +     W +                           EE +
Subjt:  ------------EVKAYNKEWAKS---------DHHPILLYIGTQLH-QLGRRKGKRFWFE---GFWFR--------------------------REECK

Query:  KIIQ--ESTDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPIS
        +I +  E  D  S  ++   ++E+I   +K+  P KA G DGFPA+F+Q  W  VG+      L +LN    +  +N   I LIPK+ + T + ++RPIS
Subjt:  KIIQ--ESTDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPIS

Query:  LCNVSYKIIAKAIVN
        LC V YK+IAKAIV+
Subjt:  LCNVSYKIIAKAIVN

A0A5B6WI49 Reverse transcriptase1.8e-3434.34Show/hide
Query:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWA-DKQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERGPCRGQRLMDEVKAYN
        GGL L W+ +  V + +FS  HI   V  A +K+WR TG YG P  + R   W+LLRR+ N  +  WL+  DFNEIL+ SEK+ G  R ++ M+E +   
Subjt:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWA-DKQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERGPCRGQRLMDEVKAYN

Query:  KEWAKSDHHPILLYIGTQL-------------HQLGRRKGK-RFWFEGFWFRREECKKIIQE---------------------------STDQESADWKS
        ++   +D    L Y G  L              +L R  G    W       R   K+I+ +                             ++++ + K+
Subjt:  KEWAKSDHHPILLYIGTQL-------------HQLGRRKGK-RFWFEGFWFRREECKKIIQE---------------------------STDQESADWKS

Query:  HHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN
        +   EEI   +K+ +P KA G DG PA+FYQ  W  +G+     CL+ LN    V   N T+I LIPKV     +S +RPISLCNV YK+IAKAI N
Subjt:  HHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN

A0A6P4B957 uncharacterized protein LOC1074292314.7e-3530.26Show/hide
Query:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWADKQWRF-TGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEIL-----------------WDSEKE
        GGL L W+ ++ V + ++SV HI + ++   K W + TG YG P+ + RH +W+LLRR+ +     W + GDFNEIL                 W + +E
Subjt:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWADKQWRF-TGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEIL-----------------WDSEKE

Query:  RGPCRGQRLMDE-------------VKAYNKEWAKSDHHPILLYIGTQLHQLGRRKGKRFWFEGFWFRREECKKII------------------------
         G    Q L+D              V   +   + SDH PILL +     +   + GKRF FE  W +  EC++II                        
Subjt:  RGPCRGQRLMDE-------------VKAYNKEWAKSDHHPILLYIGTQLHQLGRRKGKRFWFEGFWFRREECKKII------------------------

Query:  -------------------------------------QESTDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILN
                                                +D+ + +      ++E+   +    P KA G DG PA+F+Q  W  VG Q    CL++LN
Subjt:  -------------------------------------QESTDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILN

Query:  GDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN
         +  V   NHT IALIPK+ E   V+DYRPISLC V YK+I+K IVN
Subjt:  GDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN

A0A803PPS5 Uncharacterized protein4.9e-3231.72Show/hide
Query:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWADKQ-WRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERGPCRGQRLMDEVK---
        G L LLW  EV   I +FS  HI + +   D Q WRFTG YG PD + R  +W LL RV       W++ GDFNEIL    K  G  +   LM+  +   
Subjt:  GGLCLLWKEEVDVTIVNFSVHHIHTNVVWADKQ-WRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEKERGPCRGQRLMDEVK---

Query:  ---------------------------------AYNKEW--------------AKSDHHPILL--YIGTQLHQLGRRKGKRFWFEGFWFRREECKKIIQE
                                           N+EW                SDH P+LL     +Q   L  +   RF FE  W   E+C +II +
Subjt:  ---------------------------------AYNKEW--------------AKSDHHPILL--YIGTQLHQLGRRKGKRFWFEGFWFRREECKKIIQE

Query:  STDQESADWKSHHE-------------------------SEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIA
        S   ESA+  +  E                         S +I + ++   PLKA G DG   +FY+  W T+G++    CL ILN    V     T I 
Subjt:  STDQESADWKSHHE-------------------------SEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIA

Query:  LIPKVWEATLVSDYRPISLCNVSYKIIAKAI
        LIPK  ++  +S+++PISLCNV YKI+AK +
Subjt:  LIPKVWEATLVSDYRPISLCNVSYKIIAKAI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.4e-0433.33Show/hide
Query:  EIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVW-EATLVSDYRPISLCNVSYKIIAKAIVN
        EIV ++      K+ GPDGF A FYQ   E +    +     I        ++    I LIPK   + T   ++RPISL N+  KI+ K + N
Subjt:  EIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVW-EATLVSDYRPISLCNVSYKIIAKAIVN

P08548 LINE-1 reverse transcriptase homolog1.1e-0432.63Show/hide
Query:  SEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVW-EATLVSDYRPISLCNVSYKIIAKAIVN
        S EI   +++    K+ GPDGF + FYQ+  E +    +     I         +   +I LIPK   + T   +YRPISL N+  KI+ K + N
Subjt:  SEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVW-EATLVSDYRPISLCNVSYKIIAKAIVN

P11369 LINE-1 retrotransposable element ORF2 protein9.3e-0430.71Show/hide
Query:  REECKKIIQESTDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNG-DHSV-------RAWNHTHIALIPKVW-
        R +  K+ Q+  D       S    +EI  V+      K+ GPDGF A FYQ+  E        D + IL+   H +        ++    I LIPK   
Subjt:  REECKKIIQESTDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNG-DHSV-------RAWNHTHIALIPKVW-

Query:  EATLVSDYRPISLCNVSYKIIAKAIVN
        + T + ++RPISL N+  KI+ K + N
Subjt:  EATLVSDYRPISLCNVSYKIIAKAIVN

P14381 Transposon TX1 uncharacterized 149 kDa protein2.2e-0528.57Show/hide
Query:  EEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAI
        +E+ + ++     K+ G DG    F+Q  W+T+G        E         +     ++L+PK  +  L+ ++RP+SL +  YKI+AKAI
Subjt:  EEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein9.5e-1242.53Show/hide
Query:  EEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKII
        +EI   V      KA GPD F A F+  +W  V D TIA   E     H ++ +N T I LIPKV     +S +RP+S C V YKII
Subjt:  EEIVKVVKDFKPLKARGPDGFPAVFYQSNWETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACTGTGAAGAGGCGGGACTGTTTCTGGATGTTTTATGTTGGAATGTTCGTGGGTTGGGGAGCCCATGGACATTTCGAAATCGGGGGATTGTGTCTCTTATGGAA
GGAAGAGGTTGATGTCACAATCGTTAATTTTTCGGTTCATCATATTCATACGAATGTTGTGTGGGCCGACAAACAATGGAGATTTACAGGTGTTTATGGTCAACCAGATC
ATTCCCTTAGACATCGAACGTGGGACCTTCTACGTAGGGTGCACAATCATGATGATTCTGCATGGCTTTTGGGAGGAGATTTTAATGAAATACTATGGGATTCTGAGAAA
GAAAGAGGCCCATGCAGAGGCCAACGTCTTATGGATGAAGTTAAGGCCTATAATAAGGAATGGGCAAAATCTGATCATCATCCTATTTTGCTATACATCGGGACTCAACT
CCACCAACTAGGTCGAAGGAAAGGAAAAAGATTCTGGTTTGAAGGATTTTGGTTTCGTCGGGAGGAGTGTAAAAAGATCATTCAGGAATCTACAGATCAGGAATCTGCAG
ATTGGAAATCTCATCATGAGTCAGAGGAAATTGTTAAGGTAGTAAAAGATTTCAAGCCTTTAAAGGCTCGGGGACCGGATGGGTTTCCAGCCGTGTTTTATCAGAGTAAC
TGGGAGACTGTAGGTGATCAAACAATAGCAGATTGCTTAGAGATTCTGAATGGGGATCATTCGGTTCGAGCATGGAACCATACTCATATTGCTCTAATACCTAAAGTTTG
GGAGGCCACGTTAGTTTCCGATTACCGACCTATTAGTTTATGCAATGTGTCTTACAAAATTATAGCTAAAGCCATTGTGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGACTGTGAAGAGGCGGGACTGTTTCTGGATGTTTTATGTTGGAATGTTCGTGGGTTGGGGAGCCCATGGACATTTCGAAATCGGGGGATTGTGTCTCTTATGGAA
GGAAGAGGTTGATGTCACAATCGTTAATTTTTCGGTTCATCATATTCATACGAATGTTGTGTGGGCCGACAAACAATGGAGATTTACAGGTGTTTATGGTCAACCAGATC
ATTCCCTTAGACATCGAACGTGGGACCTTCTACGTAGGGTGCACAATCATGATGATTCTGCATGGCTTTTGGGAGGAGATTTTAATGAAATACTATGGGATTCTGAGAAA
GAAAGAGGCCCATGCAGAGGCCAACGTCTTATGGATGAAGTTAAGGCCTATAATAAGGAATGGGCAAAATCTGATCATCATCCTATTTTGCTATACATCGGGACTCAACT
CCACCAACTAGGTCGAAGGAAAGGAAAAAGATTCTGGTTTGAAGGATTTTGGTTTCGTCGGGAGGAGTGTAAAAAGATCATTCAGGAATCTACAGATCAGGAATCTGCAG
ATTGGAAATCTCATCATGAGTCAGAGGAAATTGTTAAGGTAGTAAAAGATTTCAAGCCTTTAAAGGCTCGGGGACCGGATGGGTTTCCAGCCGTGTTTTATCAGAGTAAC
TGGGAGACTGTAGGTGATCAAACAATAGCAGATTGCTTAGAGATTCTGAATGGGGATCATTCGGTTCGAGCATGGAACCATACTCATATTGCTCTAATACCTAAAGTTTG
GGAGGCCACGTTAGTTTCCGATTACCGACCTATTAGTTTATGCAATGTGTCTTACAAAATTATAGCTAAAGCCATTGTGAACTGA
Protein sequenceShow/hide protein sequence
MMTVKRRDCFWMFYVGMFVGWGAHGHFEIGGLCLLWKEEVDVTIVNFSVHHIHTNVVWADKQWRFTGVYGQPDHSLRHRTWDLLRRVHNHDDSAWLLGGDFNEILWDSEK
ERGPCRGQRLMDEVKAYNKEWAKSDHHPILLYIGTQLHQLGRRKGKRFWFEGFWFRREECKKIIQESTDQESADWKSHHESEEIVKVVKDFKPLKARGPDGFPAVFYQSN
WETVGDQTIADCLEILNGDHSVRAWNHTHIALIPKVWEATLVSDYRPISLCNVSYKIIAKAIVN