; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015310 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015310
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr12:10132210..10137450
RNA-Seq ExpressionLag0015310
SyntenyLag0015310
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEW19298.1 peroxidase 72 [Tanacetum cinerariifolium]1.7e-5055.5Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGSVMYL
        MDVKT FLNG+L E+IYM QPEGFV+ GQE KVC+L KS YGLK APKQ +EKF++TL++N F +N  D CVY K +   +VIICL+VD+MLI G+ M +
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGSVMYL

Query:  MNYTR---------PDIAYVVSRLSSSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGASIPV-SLHCDSQAAYT
        +N T+          DI      L +ST+GYVF  GG A S KS KQT   RSTME+EF++LE A +EAEWLRS L+ +PLW   + V  +HCDS AA T
Subjt:  MNYTR---------PDIAYVVSRLSSSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGASIPV-SLHCDSQAAYT

KAD6453934.1 hypothetical protein E3N88_08640 [Mikania micrantha]2.4e-5241.44Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------
        MDVKT FLNGDL EEIYM+QPEGFV+ G E KVC+L+KSLYGLKQAPK+ YEKF+ TL  + ++VN+SD+CVYSK     YV+ICL+VD+MLIF      
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------

Query:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA
                                                                                             GSVM+LMNYTRPDIA
Subjt:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA

Query:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA
        Y VSRLS                                                     SST+GYVF+ GGGA S KS KQTCI RSTMESEFI LELA
Subjt:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA

Query:  EQEAEWLRSLLQDVPLWGAS-IPVSLHCDSQAA
         QEAEWLR LL D+P WG + +P+ LHCDS+AA
Subjt:  EQEAEWLRSLLQDVPLWGAS-IPVSLHCDSQAA

KAG7551885.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]2.3e-4740.54Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLI-------
        MDVKT FLNGDL EEIYM+QPEGF+I GQE+KVC+L KSLYGLKQAPKQ +EKF++TL+ N F+ N  DTCV+SK+    YVIICL+VD+MLI       
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLI-------

Query:  ------------------------------------------------------------------------------------FGSVMYLMNYTRPDIA
                                                                                             GSVMYLMN TRPDIA
Subjt:  ------------------------------------------------------------------------------------FGSVMYLMNYTRPDIA

Query:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA
        YVVSRLS                                                     +ST+G+VF   GGA + KS KQTCI RSTMESE I LELA
Subjt:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA

Query:  EQEAEWLRSLLQDVPLWGASIP-VSLHCDSQAA
         QEAEWLR+LL D P+ G   P VS+ CDSQAA
Subjt:  EQEAEWLRSLLQDVPLWGASIP-VSLHCDSQAA

RVW67328.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-5052.11Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGS----
        MDVKTTFLNG+L EEIYM QPEGF+ PGQE KVC+L KSLYGLKQAPKQ +EKF++ +++N F +N  D C+Y K     YVI+CL+VD+MLI GS    
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGS----

Query:  -------VMYLMNYTRPDIAYVV-------------SRLSSSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGAS
               + Y  NY      Y               S+ S ST+GYVF  GG   S KS KQTCI RSTMESEFI ++ A +EA+WLR+ L+D+P W   
Subjt:  -------VMYLMNYTRPDIAYVV-------------SRLSSSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGAS

Query:  IP-VSLHCDSQAA
        +P + +HCDSQ A
Subjt:  IP-VSLHCDSQAA

TYK06518.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]2.9e-5843.67Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------
        MDVKT FLNG+L+EEIYM QPEGF I GQE+KVC+L+KSLYGLKQAPKQ YEKFN+TL+ N F +NSSDTCVYSK+ G D ++ICL+VD+MLIF      
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------

Query:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA
                                                                                             GSVMYLMNYTRPDIA
Subjt:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA

Query:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA
        Y VSRLS                                                     +ST+GYVF+ GGGA S KS KQTCI RSTMESEFI LELA
Subjt:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA

Query:  EQEAEWLRSLLQDVPLWGASIPVSLHCDSQAA
         QEAEW+++LL DVPLWG S+PVS+ CDSQAA
Subjt:  EQEAEWLRSLLQDVPLWGASIPVSLHCDSQAA

TrEMBL top hitse value%identityAlignment
A0A2N9EQT1 Integrase catalytic domain-containing protein1.4e-5041.14Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------
        MDVKT FLNGDL EEIYM QPEGFV+ GQE+KVC+L+KSLYGLKQAPKQ +EKF+ TLV+N F VN SD CVYSK  G   VIICL+VD+MLIF      
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------

Query:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA
                                                                                             GSVM+LMN TRPDIA
Subjt:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA

Query:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA
        Y VSRLS                                                     +ST+GYVF  GGGA S KS KQTC  RSTMESEF+ LE A
Subjt:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA

Query:  EQEAEWLRSLLQDVPLWGASIP-VSLHCDSQAA
          EAEWLR+LL D+PLW   +P +++HCDSQAA
Subjt:  EQEAEWLRSLLQDVPLWGASIP-VSLHCDSQAA

A0A2N9H4B0 Uncharacterized protein1.4e-5041.14Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------
        MDVKT FLNGDL EEIYM QPEGFV+ GQE+KVC+L+KSLYGLKQAPKQ +EKF+ TLV+N F VN SD CVYSK  G   VIICL+VD+MLIF      
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------

Query:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA
                                                                                             GSVM+LMN TRPDIA
Subjt:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA

Query:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA
        Y VSRLS                                                     +ST+GYVF  GGGA S KS KQTC  RSTMESEF+ LE A
Subjt:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA

Query:  EQEAEWLRSLLQDVPLWGASIP-VSLHCDSQAA
          EAEWLR+LL D+PLW   +P +++HCDSQAA
Subjt:  EQEAEWLRSLLQDVPLWGASIP-VSLHCDSQAA

A0A438G546 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-5152.11Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGS----
        MDVKTTFLNG+L EEIYM QPEGF+ PGQE KVC+L KSLYGLKQAPKQ +EKF++ +++N F +N  D C+Y K     YVI+CL+VD+MLI GS    
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGS----

Query:  -------VMYLMNYTRPDIAYVV-------------SRLSSSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGAS
               + Y  NY      Y               S+ S ST+GYVF  GG   S KS KQTCI RSTMESEFI ++ A +EA+WLR+ L+D+P W   
Subjt:  -------VMYLMNYTRPDIAYVV-------------SRLSSSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGAS

Query:  IP-VSLHCDSQAA
        +P + +HCDSQ A
Subjt:  IP-VSLHCDSQAA

A0A5D3C5T2 Ty1-copia retrotransposon protein1.4e-5843.67Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------
        MDVKT FLNG+L+EEIYM QPEGF I GQE+KVC+L+KSLYGLKQAPKQ YEKFN+TL+ N F +NSSDTCVYSK+ G D ++ICL+VD+MLIF      
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------

Query:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA
                                                                                             GSVMYLMNYTRPDIA
Subjt:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA

Query:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA
        Y VSRLS                                                     +ST+GYVF+ GGGA S KS KQTCI RSTMESEFI LELA
Subjt:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA

Query:  EQEAEWLRSLLQDVPLWGASIPVSLHCDSQAA
         QEAEW+++LL DVPLWG S+PVS+ CDSQAA
Subjt:  EQEAEWLRSLLQDVPLWGASIPVSLHCDSQAA

A0A5N6PGV2 Reverse transcriptase Ty1/copia-type domain-containing protein1.2e-5241.44Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------
        MDVKT FLNGDL EEIYM+QPEGFV+ G E KVC+L+KSLYGLKQAPK+ YEKF+ TL  + ++VN+SD+CVYSK     YV+ICL+VD+MLIF      
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIF------

Query:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA
                                                                                             GSVM+LMNYTRPDIA
Subjt:  -------------------------------------------------------------------------------------GSVMYLMNYTRPDIA

Query:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA
        Y VSRLS                                                     SST+GYVF+ GGGA S KS KQTCI RSTMESEFI LELA
Subjt:  YVVSRLS-----------------------------------------------------SSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELA

Query:  EQEAEWLRSLLQDVPLWGAS-IPVSLHCDSQAA
         QEAEWLR LL D+P WG + +P+ LHCDS+AA
Subjt:  EQEAEWLRSLLQDVPLWGAS-IPVSLHCDSQAA

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.1e-1345.19Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFG--VDYVIICLFVDEMLIFGSVM
        MDVKT FLNG LKEEIYM  P+G  I    D VC+L K++YGLKQA +  +E F   L   EF+ +S D C+Y    G   + + + L+VD+++I    M
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFG--VDYVIICLFVDEMLIFGSVM

Query:  YLMN
          MN
Subjt:  YLMN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-2250Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGV-DYVIICLFVDEMLIFGSVMY
        +DVKT FL+GDL+EEIYM QPEGF + G++  VC+L KSLYGLKQAP+Q Y KF+S + +  ++   SD CVY K F   +++I+ L+VD+MLI G    
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGV-DYVIICLFVDEMLIFGSVMY

Query:  LMNYTRPDIA
        L+   + D++
Subjt:  LMNYTRPDIA

P25600 Putative transposon Ty5-1 protein YCL074W1.1e-0737.63Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLI
        MDV T FLN  + E IY+ QP GFV     D V +L   +YGLKQAP    E  N+TL    F  +  +  +Y +      + I ++VD++L+
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-1235.78Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGSVMYL
        +DV   FL G L +++YM QP GF+   + + VC+L+K+LYGLKQAP+  Y +  + L+   F+ + SDT ++    G   V + ++VD++LI G+   L
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGSVMYL

Query:  MNYTRPDIA
        ++ T  +++
Subjt:  MNYTRPDIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-0335.21Show/hide
Query:  STNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGASIPVSLHCDSQAA
        STNGY+   G    S  S+KQ  +VRS+ E+E+ ++     E +W+ SLL ++ +     PV ++CD+  A
Subjt:  STNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGASIPVSLHCDSQAA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.3e-1338.46Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGSVMYL
        +DV   FL G L +E+YM QP GFV   + D VC+L+K++YGLKQAP+  Y +  + L+   F+ + SDT ++    G   + + ++VD++LI G+   L
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGSVMYL

Query:  MNYT
        + +T
Subjt:  MNYT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.6e-0435.21Show/hide
Query:  STNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGASIPVSLHCDSQAA
        STNGY+   G    S  S+KQ  +VRS+ E+E+ ++     E +W+ SLL ++ +  +  PV ++CD+  A
Subjt:  STNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGASIPVSLHCDSQAA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.9e-1339.18Show/hide
Query:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQE----DKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLI
        +D+   FLNGDL EEIYM  P G+     +    + VC LKKS+YGLKQA +Q + KF+ TL+   F+ + SD   + K+    ++ + ++VD+++I
Subjt:  MDVKTTFLNGDLKEEIYMIQPEGFVIPGQE----DKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTGAAAACAACATTTCTCAATGGAGATTTAAAAGAGGAGATATATATGATACAACCCGAAGGCTTTGTAATTCCTGGTCAAGAAGACAAGGTATGTCAACTTAA
AAAATCTCTTTATGGTCTTAAACAAGCTCCCAAACAATCGTATGAGAAATTTAATAGTACCTTAGTAAACAATGAGTTTATGGTGAATTCTTCCGACACATGTGTTTACT
CGAAATTGTTTGGTGTTGATTATGTGATTATTTGCTTGTTTGTTGATGAAATGCTTATTTTCGGTAGCGTGATGTACCTGATGAATTATACTAGACCTGATATTGCATAT
GTTGTGAGTAGATTGAGTAGCTCAACTAACGGGTATGTGTTCGTTTTTGGAGGAGGAGCTACATCATTGAAATCTGAAAAGCAGACGTGCATTGTGCGATCCACCATGGA
GTCAGAGTTTATAACTCTTGAGCTAGCAGAGCAAGAGGCGGAGTGGTTGAGAAGTTTACTTCAAGACGTACCACTGTGGGGGGCGTCTATTCCAGTCTCCTTGCACTGTG
ATTCACAAGCAGCATATACGCCTGAGACATGGAGTTGTGAAACAATTGTTGAAAAGTGGAATTATCTCCTTGGAGTATACTTGATGAAGCATGACGTGATTGTGAAGGTG
TGGCCGCCTTCTATGAAAGAGTGTAATGGATCTCTTTCTAGAGATTTCACAATGACCCAAGCGTCTACGTCCACGTGTCAATCCGGCGTCCACACACTCCAGCAGTTGAC
CAGTCCGTTCGTTGACCGAGTTTGGTGCCTTTTTTGCGTGCCTTTGACCGAGGAGTTGTTATGCGTGGACTCCTCCATGGCTGACACGTGGCAGCTGGACCCTGTTTTGT
GGAAGACGTCTGGAGTTGTGAAGTCTGGTGTTCGGGACGTGAAAAAGGCCAAAGAAACAGAGTTGGATCAAGTTGGAAGCTTATTGAATAAGGGCAAATCGAGGACCAGC
GTCGAGACCCTAGACCTTGGGCGTCTCGACGCTGCACACTTTACTTTTCTAATTTGGGCGGCAGCAGACACAGCATCGCGACGCTCTGTCAATAGCGTTGTGACGCTCTC
ATATTTTTCCTTTACAGAATGCTGGATTCACTACAGCGTCGCGACGCTGTGCCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGTGAAAACAACATTTCTCAATGGAGATTTAAAAGAGGAGATATATATGATACAACCCGAAGGCTTTGTAATTCCTGGTCAAGAAGACAAGGTATGTCAACTTAA
AAAATCTCTTTATGGTCTTAAACAAGCTCCCAAACAATCGTATGAGAAATTTAATAGTACCTTAGTAAACAATGAGTTTATGGTGAATTCTTCCGACACATGTGTTTACT
CGAAATTGTTTGGTGTTGATTATGTGATTATTTGCTTGTTTGTTGATGAAATGCTTATTTTCGGTAGCGTGATGTACCTGATGAATTATACTAGACCTGATATTGCATAT
GTTGTGAGTAGATTGAGTAGCTCAACTAACGGGTATGTGTTCGTTTTTGGAGGAGGAGCTACATCATTGAAATCTGAAAAGCAGACGTGCATTGTGCGATCCACCATGGA
GTCAGAGTTTATAACTCTTGAGCTAGCAGAGCAAGAGGCGGAGTGGTTGAGAAGTTTACTTCAAGACGTACCACTGTGGGGGGCGTCTATTCCAGTCTCCTTGCACTGTG
ATTCACAAGCAGCATATACGCCTGAGACATGGAGTTGTGAAACAATTGTTGAAAAGTGGAATTATCTCCTTGGAGTATACTTGATGAAGCATGACGTGATTGTGAAGGTG
TGGCCGCCTTCTATGAAAGAGTGTAATGGATCTCTTTCTAGAGATTTCACAATGACCCAAGCGTCTACGTCCACGTGTCAATCCGGCGTCCACACACTCCAGCAGTTGAC
CAGTCCGTTCGTTGACCGAGTTTGGTGCCTTTTTTGCGTGCCTTTGACCGAGGAGTTGTTATGCGTGGACTCCTCCATGGCTGACACGTGGCAGCTGGACCCTGTTTTGT
GGAAGACGTCTGGAGTTGTGAAGTCTGGTGTTCGGGACGTGAAAAAGGCCAAAGAAACAGAGTTGGATCAAGTTGGAAGCTTATTGAATAAGGGCAAATCGAGGACCAGC
GTCGAGACCCTAGACCTTGGGCGTCTCGACGCTGCACACTTTACTTTTCTAATTTGGGCGGCAGCAGACACAGCATCGCGACGCTCTGTCAATAGCGTTGTGACGCTCTC
ATATTTTTCCTTTACAGAATGCTGGATTCACTACAGCGTCGCGACGCTGTGCCTATAG
Protein sequenceShow/hide protein sequence
MDVKTTFLNGDLKEEIYMIQPEGFVIPGQEDKVCQLKKSLYGLKQAPKQSYEKFNSTLVNNEFMVNSSDTCVYSKLFGVDYVIICLFVDEMLIFGSVMYLMNYTRPDIAY
VVSRLSSSTNGYVFVFGGGATSLKSEKQTCIVRSTMESEFITLELAEQEAEWLRSLLQDVPLWGASIPVSLHCDSQAAYTPETWSCETIVEKWNYLLGVYLMKHDVIVKV
WPPSMKECNGSLSRDFTMTQASTSTCQSGVHTLQQLTSPFVDRVWCLFCVPLTEELLCVDSSMADTWQLDPVLWKTSGVVKSGVRDVKKAKETELDQVGSLLNKGKSRTS
VETLDLGRLDAAHFTFLIWAAADTASRRSVNSVVTLSYFSFTECWIHYSVATLCL