; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0249681 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0249681
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTransposon Ty1-H Gag-Pol polyprotein
Genome locationCMiso1.1chr09:15615294..15616875
RNA-Seq ExpressionCmc09g0249681
SyntenyCmc09g0249681
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0043167 - ion binding (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]3.9e-17368.41Show/hide
Query:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN
        AIV DSKKQWKDA         + Q+ S   +  + +  QSKWIYKIKPGTGG+SKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILS A+HF+
Subjt:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN

Query:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV
        MFIEQMDVTT    GELEEVIY AQPKGY+VKGKEDMV  LHKS+YGLKQSPRQWYIRFDTFI+KQGFH NSYDACVYWK SQKGTYIYLLLYV+DMILV
Subjt:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV

Query:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL------------------------------
        SKDYAEICELKKQLSNEFEMKDLGELK ILGMDVK D+EKGLLTI  ESYVIKLLEKYN+S   AVSTPL                              
Subjt:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL------------------------------

Query:  ------------------------------------HLILNFLR--------------LNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQ
                                              +L +L+               + LL   T     ADLDKRRSLSGHIFRLY NVVSWKV +Q
Subjt:  ------------------------------------HLILNFLR--------------LNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQ

Query:  PVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK
        PV ALSTTESEYIS GEAVKEAVWLKRIVGELL Q+FIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDV+LVKVHTVENLSDML K
Subjt:  PVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK

KAA0066139.1 hypothetical protein E6C27_scaffold21G001400 [Cucumis melo var. makuwa]3.5e-19886.47Show/hide
Query:  AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT
        AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQ EGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT
Subjt:  AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT

Query:  TIGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKK
         IGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTY+YLLLYV+DMILVSKDYAEICELKK
Subjt:  TIGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKK

Query:  QLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNV
        QLSNEFEMKDLGELKMILGMDVKGDREK                         VSTPL    +  +L+    ++  + E  DLDKRRSLSGHIFRLYDNV
Subjt:  QLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNV

Query:  VSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLS
        V+WKVT+Q VAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHE+SKHIDVKFHYI+NVIAQKDVQLVKVHTVENLS
Subjt:  VSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLS

Query:  DMLIKFRESIWCLK
        DMLIKFRES+WCLK
Subjt:  DMLIKFRESIWCLK

TYK13826.1 putative polyprotein [Cucumis melo var. makuwa]6.0e-17468.61Show/hide
Query:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN
        AIV DSKKQWKDA         + Q+ S   +  + +  QSKWIYKIKPGTGG+SKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILS A+HF+
Subjt:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN

Query:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV
        MFIEQMDVTT    GELEEVIY AQPKGY+VKGKEDMV  LHKS+YGLKQSPRQWYIRFDTFI+KQGFH NSYDACVYWK SQKGTYIYLLLYV+DMILV
Subjt:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV

Query:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL------------------------------
        SKDYAEICELKKQLSNEFEMKDLGELK ILGMDVK D+EKGLLTI  ESYVIKLLEKYN+SG  AVSTPL                              
Subjt:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL------------------------------

Query:  ------------------------------------HLILNFLR--------------LNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQ
                                              +L +L+               + LL   T     ADLDKRRSLSGHIFRLY NVVSWKV +Q
Subjt:  ------------------------------------HLILNFLR--------------LNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQ

Query:  PVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK
        PV ALSTTESEYIS GEAVKEAVWLKRIVGELL Q+FIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDV+LVKVHTVENLSDML K
Subjt:  PVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK

TYK15184.1 hypothetical protein E5676_scaffold790G00530 [Cucumis melo var. makuwa]1.6e-19886.71Show/hide
Query:  AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT
        AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQ EGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT
Subjt:  AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT

Query:  TIGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKK
         IGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYV+DMILVSKDYAEICELKK
Subjt:  TIGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKK

Query:  QLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNV
        QLSNEFEMKDLGELKMILGMDVKGDREK                         VSTPL    +  +L+    ++  + E  DLDKRRSLSGHIFRLYDNV
Subjt:  QLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNV

Query:  VSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLS
        V+WKVT+Q VAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHE+SKHIDVKFHYI+NVIAQKDVQLVKVHTVENLS
Subjt:  VSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLS

Query:  DMLIKFRESIWCLK
        DMLIKFRES+WCLK
Subjt:  DMLIKFRESIWCLK

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]6.6e-15768.87Show/hide
Query:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN
        AIV DSKKQWKDA         + Q+ S   +  + +  QSKWIYKIKPGTGG+SKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILS A+HF+
Subjt:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN

Query:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV
        MFIEQMDVTT    GELEEVIY AQPKGY+VKGKEDMV  LHKS+YGLKQSPRQWYIRFDTFI+KQGFH NSYDACVYWK SQKGTYIYLLLYV+DMILV
Subjt:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV

Query:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL--HLILNFLRLNV-----------------
        SKDYAEICELKKQLSNEFEMKDLGELK ILGMDVK D+EKGLLTI  ESYVIKLLEKYN+S   AVSTPL  H  L+  +  V                 
Subjt:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL--HLILNFLRLNV-----------------

Query:  ---------------LLLNKTLIKEC--ADLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCD
                         ++   +  C   D DK   L G      D   +  +  +    L    +EYIS GEAVKEAVWLKRIVGELL Q+FIPIIHCD
Subjt:  ---------------LLLNKTLIKEC--ADLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCD

Query:  SQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK
        SQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDV+LVKVHTVENLSDML K
Subjt:  SQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK

TrEMBL top hitse value%identityAlignment
A0A5A7UB25 Putative gag-pol polyprotein1.9e-17368.41Show/hide
Query:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN
        AIV DSKKQWKDA         + Q+ S   +  + +  QSKWIYKIKPGTGG+SKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILS A+HF+
Subjt:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN

Query:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV
        MFIEQMDVTT    GELEEVIY AQPKGY+VKGKEDMV  LHKS+YGLKQSPRQWYIRFDTFI+KQGFH NSYDACVYWK SQKGTYIYLLLYV+DMILV
Subjt:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV

Query:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL------------------------------
        SKDYAEICELKKQLSNEFEMKDLGELK ILGMDVK D+EKGLLTI  ESYVIKLLEKYN+S   AVSTPL                              
Subjt:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL------------------------------

Query:  ------------------------------------HLILNFLR--------------LNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQ
                                              +L +L+               + LL   T     ADLDKRRSLSGHIFRLY NVVSWKV +Q
Subjt:  ------------------------------------HLILNFLR--------------LNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQ

Query:  PVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK
        PV ALSTTESEYIS GEAVKEAVWLKRIVGELL Q+FIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDV+LVKVHTVENLSDML K
Subjt:  PVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK

A0A5A7VLE3 Uncharacterized protein1.7e-19886.47Show/hide
Query:  AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT
        AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQ EGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT
Subjt:  AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT

Query:  TIGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKK
         IGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTY+YLLLYV+DMILVSKDYAEICELKK
Subjt:  TIGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKK

Query:  QLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNV
        QLSNEFEMKDLGELKMILGMDVKGDREK                         VSTPL    +  +L+    ++  + E  DLDKRRSLSGHIFRLYDNV
Subjt:  QLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNV

Query:  VSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLS
        V+WKVT+Q VAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHE+SKHIDVKFHYI+NVIAQKDVQLVKVHTVENLS
Subjt:  VSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLS

Query:  DMLIKFRESIWCLK
        DMLIKFRES+WCLK
Subjt:  DMLIKFRESIWCLK

A0A5D3CTV2 Putative polyprotein2.9e-17468.61Show/hide
Query:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN
        AIV DSKKQWKDA         + Q+ S   +  + +  QSKWIYKIKPGTGG+SKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILS A+HF+
Subjt:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN

Query:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV
        MFIEQMDVTT    GELEEVIY AQPKGY+VKGKEDMV  LHKS+YGLKQSPRQWYIRFDTFI+KQGFH NSYDACVYWK SQKGTYIYLLLYV+DMILV
Subjt:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV

Query:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL------------------------------
        SKDYAEICELKKQLSNEFEMKDLGELK ILGMDVK D+EKGLLTI  ESYVIKLLEKYN+SG  AVSTPL                              
Subjt:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL------------------------------

Query:  ------------------------------------HLILNFLR--------------LNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQ
                                              +L +L+               + LL   T     ADLDKRRSLSGHIFRLY NVVSWKV +Q
Subjt:  ------------------------------------HLILNFLR--------------LNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQ

Query:  PVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK
        PV ALSTTESEYIS GEAVKEAVWLKRIVGELL Q+FIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDV+LVKVHTVENLSDML K
Subjt:  PVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK

A0A5D3CW76 Uncharacterized protein7.6e-19986.71Show/hide
Query:  AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT
        AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQ EGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT
Subjt:  AIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT

Query:  TIGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKK
         IGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYV+DMILVSKDYAEICELKK
Subjt:  TIGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKK

Query:  QLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNV
        QLSNEFEMKDLGELKMILGMDVKGDREK                         VSTPL    +  +L+    ++  + E  DLDKRRSLSGHIFRLYDNV
Subjt:  QLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNV

Query:  VSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLS
        V+WKVT+Q VAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHE+SKHIDVKFHYI+NVIAQKDVQLVKVHTVENLS
Subjt:  VSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLS

Query:  DMLIKFRESIWCLK
        DMLIKFRES+WCLK
Subjt:  DMLIKFRESIWCLK

A0A5D3DNU1 Putative gag-pol polyprotein3.2e-15768.87Show/hide
Query:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN
        AIV DSKKQWKDA         + Q+ S   +  + +  QSKWIYKIKPGTGG+SKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILS A+HF+
Subjt:  AIVFDSKKQWKDA---------RAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFN

Query:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV
        MFIEQMDVTT    GELEEVIY AQPKGY+VKGKEDMV  LHKS+YGLKQSPRQWYIRFDTFI+KQGFH NSYDACVYWK SQKGTYIYLLLYV+DMILV
Subjt:  MFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILV

Query:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL--HLILNFLRLNV-----------------
        SKDYAEICELKKQLSNEFEMKDLGELK ILGMDVK D+EKGLLTI  ESYVIKLLEKYN+S   AVSTPL  H  L+  +  V                 
Subjt:  SKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL--HLILNFLRLNV-----------------

Query:  ---------------LLLNKTLIKEC--ADLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCD
                         ++   +  C   D DK   L G      D   +  +  +    L    +EYIS GEAVKEAVWLKRIVGELL Q+FIPIIHCD
Subjt:  ---------------LLLNKTLIKEC--ADLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCD

Query:  SQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK
        SQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDV+LVKVHTVENLSDML K
Subjt:  SQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.4e-5631.57Show/hide
Query:  DSKKQWKDARAQSCSFY---------KRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIE
        D K  W++A     + +         KR  +     S+W++ +K    G +  RYKARLVA+G+TQK  +D+ E F+PV R SS R ILS  I +N+ + 
Subjt:  DSKKQWKDARAQSCSFY---------KRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIE

Query:  QMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTY---IYLLLYVEDMILVS
        QMDV T    G L+E IY   P+G  +    D V  L+K+IYGLKQ+ R W+  F+  + +  F  +S D C+Y  +  KG     IY+LLYV+D+++ +
Subjt:  QMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTY---IYLLLYVEDMILVS

Query:  KDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLN---------------------
         D   +   K+ L  +F M DL E+K  +G+ ++   +K  + +   +YV K+L K+N+  CNAVSTPL   +N+  LN                     
Subjt:  KDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLN---------------------

Query:  -----------------------------------------VLLLNKTLIKE-----CADLD------KRRSLSGHIFRLYD-NVVSWKVTIQPVAALST
                                                  L+  K L  E       D D       R+S +G++F+++D N++ W    Q   A S+
Subjt:  -----------------------------------------VLLLNKTLIKE-----CADLD------KRRSLSGHIFRLYD-NVVSWKVTIQPVAALST

Query:  TESEYISFGEAVKEAVWLKRIVGELLLQKFIPI-IHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK
        TE+EY++  EAV+EA+WLK ++  + ++   PI I+ D+Q  I +A NPS H+R+KHID+K+H+ R  +    + L  + T   L+D+  K
Subjt:  TESEYISFGEAVKEAVWLKRIVGELLLQKFIPI-IHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein3.1e-1628.5Show/hide
Query:  YKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT---TIGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIR
        YKAR+V +G TQ     +  I +  + H+ I++ L  A + NMF++ +D+       +LEE IY   P   +       V  L+K++YGLKQSP++W   
Subjt:  YKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVT---TIGELEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIR

Query:  FDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGEL------KMILGMDVKGDREKGLLTILHESYVI
           ++   G  +NSY   +Y     +   + + +YV+D ++ + +   + E   +L + FE+K  G L        ILGMD+  ++  G + +  +S++ 
Subjt:  FDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGEL------KMILGMDVKGDREKGLLTILHESYVI

Query:  KLLEKYN
        ++ +KYN
Subjt:  KLLEKYN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.0e-8139Show/hide
Query:  RWFQSKWIYKIKPGTGGDSK-PRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKE
        R  + KW++K+K    GD K  RYKARLV KG+ QK+G+DF EIFSPVV+ +SIR ILS A   ++ +EQ+DV T    G+LEE IY  QP+G++V GK+
Subjt:  RWFQSKWIYKIKPGTGGDSK-PRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKE

Query:  DMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGELKMILGMDVK
         MV  L+KS+YGLKQ+PRQWY++FD+F+  Q + +   D CVY+K   +  +I LLLYV+DM++V KD   I +LK  LS  F+MKDLG  + ILGM + 
Subjt:  DMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGELKMILGMDVK

Query:  GDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL--HL-------------------------------------------------------------
         +R    L +  E Y+ ++LE++N+     VSTPL  HL                                                             
Subjt:  GDREKGLLTILHESYVIKLLEKYNISGCNAVSTPL--HL-------------------------------------------------------------

Query:  ---ILNFLR-----------LNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFI
           IL +LR            + +L   T      D+D R+S +G++F      +SW+  +Q   ALSTTE+EYI+  E  KE +WLKR + EL L +  
Subjt:  ---ILNFLR-----------LNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFI

Query:  PIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK
         +++CDSQSAI L+KN  +H R+KHIDV++H+IR ++  + ++++K+ T EN +DML K
Subjt:  PIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.9e-4429.69Show/hide
Query:  KWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCL
        +WI+  K  + G S  RYKARLVAKGY Q+ G+D+ E FSPV++ +SIR++L  A+  +  I Q+DV      G L + +Y +QP G+  K + + V  L
Subjt:  KWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCL

Query:  HKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKG
         K++YGLKQ+PR WY+    +++  GF  +  D  ++  L +  + +Y+L+YV+D+++   D   +      LS  F +KD  EL   LG++ K  R   
Subjt:  HKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKG

Query:  LLTILHESYVIKLLEKYNISGCNAVSTPL------------------------------------------------------HL-----ILNFL-----
         L +    Y++ LL + N+     V+TP+                                                      HL     IL +L     
Subjt:  LLTILHESYVIKLLEKYNISGCNAVSTPL------------------------------------------------------HL-----ILNFL-----

Query:  ------RLNVLLLNKTLIKECA-DLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQ-KFIPIIHCDSQSAI
              + N L L+     + A D D   S +G+I  L  + +SW    Q     S+TE+EY S      E  W+  ++ EL ++    P+I+CD+  A 
Subjt:  ------RLNVLLLNKTLIKECA-DLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQ-KFIPIIHCDSQSAI

Query:  HLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK
        +L  NP  H R KHI + +H+IRN +    +++V V T + L+D L K
Subjt:  HLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.3e-4529.91Show/hide
Query:  KWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCL
        +WI+  K  + G S  RYKARLVAKGY Q+ G+D+ E FSPV++ +SIR++L  A+  +  I Q+DV      G L + +Y +QP G+  K + D V  L
Subjt:  KWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDMVFCL

Query:  HKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKG
         K+IYGLKQ+PR WY+   T+++  GF  +  D  ++  L +  + IY+L+YV+D+++   D   +      LS  F +K+  +L   LG++ K  R   
Subjt:  HKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKG

Query:  LLTILHESYVIKLLEKYNISGCNAVSTP-----------------------------------------------------------LHLILNFL-----
         L +    Y + LL + N+     V+TP                                                           L  +L +L     
Subjt:  LLTILHESYVIKLLEKYNISGCNAVSTP-----------------------------------------------------------LHLILNFL-----

Query:  ------RLNVLLLNKTLIKECA-DLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQ-KFIPIIHCDSQSAI
              + N L L+     + A D D   S +G+I  L  + +SW    Q     S+TE+EY S      E  W+  ++ EL +Q    P+I+CD+  A 
Subjt:  ------RLNVLLLNKTLIKECA-DLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQ-KFIPIIHCDSQSAI

Query:  HLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK
        +L  NP  H R KHI + +H+IRN +    +++V V T + L+D L K
Subjt:  HLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-5031.85Show/hide
Query:  KWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDM----
        KW+YKIK  + G  + RYKARLVAKGYTQ+EG+DF E FSPV + +S++LIL+ +  +N  + Q+D++     G+L+E IY   P GY  +  + +    
Subjt:  KWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVTTI---GELEEVIYTAQPKGYKVKGKEDM----

Query:  VFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGD
        V  L KSIYGLKQ+ RQW+++F   ++  GF ++  D   + K++    ++ +L+YV+D+I+ S + A + ELK QL + F+++DLG LK  LG+++   
Subjt:  VFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGD

Query:  REKGLLTILHESYVIKLLEKYNISGCNAVSTPLH--------------------------LILNFLRLNV-LLLNK-TLIKECADL--------------
        R    + I    Y + LL++  + GC   S P+                           + L   RL++   +NK +   E   L              
Subjt:  REKGLLTILHESYVIKLLEKYNISGCNAVSTPLH--------------------------LILNFLRLNV-LLLNK-TLIKECADL--------------

Query:  -----------------------------DKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIP-IIHCDS
                                     D RRS +G+   L  +++SWK   Q V + S+ E+EY +   A  E +WL +   EL L    P ++ CD+
Subjt:  -----------------------------DKRRSLSGHIFRLYDNVVSWKVTIQPVAALSTTESEYISFGEAVKEAVWLKRIVGELLLQKFIP-IIHCDS

Query:  QSAIHLAKNPSHHERSKHIDVKFHYIR
         +AIH+A N   HER+KHI+   H +R
Subjt:  QSAIHLAKNPSHHERSKHIDVKFHYIR

ATMG00810.1 DNA/RNA polymerases superfamily protein3.9e-0637.5Show/hide
Query:  IYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILN
        +YLLLYV+D++L       +  L  QLS+ F MKDLG +   LG+ +K     GL  +    Y  ++L    +  C  +STPL L LN
Subjt:  IYLLLYVEDMILVSKDYAEICELKKQLSNEFEMKDLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILN

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.2e-0752.73Show/hide
Query:  KWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTA
        KW++K K  + G +  R KARLVAKG+ Q+EG+ F E +SPVVR ++IR IL+ A
Subjt:  KWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTGTATTTGATTCAAAGAAACAATGGAAGGATGCTAGGGCACAGAGTTGTTCTTTTTACAAAAGAATCAGACATGGTCGTTGGTTCCAATCAAAGTGG
ATTTATAAAATCAAGCCAGGTACAGGAGGTGACAGTAAGCCTAGGTATAAGGCTAGGTTGGTAGCCAAGGGCTACACTCAAAAGGAAGGAGTTGACTTTCATGAG
ATTTTCTCTCCAGTGGTGAGGCATTCGTCCATTAGATTAATCTTATCTACTGCTATTCACTTTAATATGTTTATTGAACAGATGGATGTCACCACAATTGGAGAA
CTGGAGGAAGTGATTTACACGGCTCAACCTAAGGGCTATAAGGTGAAGGGTAAGGAAGACATGGTTTTTTGTCTTCACAAGTCCATCTATGGACTAAAACAATCT
CCAAGACAATGGTATATCAGGTTCGATACTTTCATTATGAAGCAAGGATTTCACGAGAATTCATATGATGCTTGTGTTTACTGGAAACTATCTCAGAAAGGTACG
TACATCTATCTACTGTTATATGTAGAAGATATGATACTAGTGTCTAAGGATTATGCTGAAATCTGTGAACTCAAGAAACAATTGAGTAATGAGTTTGAAATGAAA
GATTTAGGTGAACTAAAAATGATCTTAGGCATGGATGTAAAAGGGGATAGAGAGAAAGGTTTGTTAACCATTTTGCATGAGAGTTATGTAATTAAACTACTTGAA
AAGTATAATATATCTGGTTGCAACGCAGTTTCAACACCCTTACATCTCATTTTAAACTTTCTCCGTCTCAATGTCCTGTTACTGAACAAGACCTTGATAAAAGAA
TGTGCAGATCTTGATAAAAGAAGGTCTCTATCAGGTCACATTTTTCGCTTGTATGATAATGTTGTCAGTTGGAAAGTTACCATACAACCAGTTGCTGCTTTGTCA
ACTACTGAGTCAGAATATATTTCTTTTGGTGAAGCAGTTAAGGAAGCAGTATGGTTAAAAAGAATTGTTGGTGAGTTGTTACTGCAGAAGTTTATTCCTATCATC
CATTGTGATAGCCAGAGTGCTATTCATCTTGCGAAGAATCCATCTCATCATGAACGATCTAAGCATATCGATGTCAAATTTCATTACATCAGAAACGTTATTGCT
CAGAAAGATGTTCAACTGGTCAAAGTTCATACAGTTGAGAATTTGTCAGATATGTTAATCAAATTTAGAGAATCAATTTGGTGTCTAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTGTATTTGATTCAAAGAAACAATGGAAGGATGCTAGGGCACAGAGTTGTTCTTTTTACAAAAGAATCAGACATGGTCGTTGGTTCCAATCAAAGTGG
ATTTATAAAATCAAGCCAGGTACAGGAGGTGACAGTAAGCCTAGGTATAAGGCTAGGTTGGTAGCCAAGGGCTACACTCAAAAGGAAGGAGTTGACTTTCATGAG
ATTTTCTCTCCAGTGGTGAGGCATTCGTCCATTAGATTAATCTTATCTACTGCTATTCACTTTAATATGTTTATTGAACAGATGGATGTCACCACAATTGGAGAA
CTGGAGGAAGTGATTTACACGGCTCAACCTAAGGGCTATAAGGTGAAGGGTAAGGAAGACATGGTTTTTTGTCTTCACAAGTCCATCTATGGACTAAAACAATCT
CCAAGACAATGGTATATCAGGTTCGATACTTTCATTATGAAGCAAGGATTTCACGAGAATTCATATGATGCTTGTGTTTACTGGAAACTATCTCAGAAAGGTACG
TACATCTATCTACTGTTATATGTAGAAGATATGATACTAGTGTCTAAGGATTATGCTGAAATCTGTGAACTCAAGAAACAATTGAGTAATGAGTTTGAAATGAAA
GATTTAGGTGAACTAAAAATGATCTTAGGCATGGATGTAAAAGGGGATAGAGAGAAAGGTTTGTTAACCATTTTGCATGAGAGTTATGTAATTAAACTACTTGAA
AAGTATAATATATCTGGTTGCAACGCAGTTTCAACACCCTTACATCTCATTTTAAACTTTCTCCGTCTCAATGTCCTGTTACTGAACAAGACCTTGATAAAAGAA
TGTGCAGATCTTGATAAAAGAAGGTCTCTATCAGGTCACATTTTTCGCTTGTATGATAATGTTGTCAGTTGGAAAGTTACCATACAACCAGTTGCTGCTTTGTCA
ACTACTGAGTCAGAATATATTTCTTTTGGTGAAGCAGTTAAGGAAGCAGTATGGTTAAAAAGAATTGTTGGTGAGTTGTTACTGCAGAAGTTTATTCCTATCATC
CATTGTGATAGCCAGAGTGCTATTCATCTTGCGAAGAATCCATCTCATCATGAACGATCTAAGCATATCGATGTCAAATTTCATTACATCAGAAACGTTATTGCT
CAGAAAGATGTTCAACTGGTCAAAGTTCATACAGTTGAGAATTTGTCAGATATGTTAATCAAATTTAGAGAATCAATTTGGTGTCTAAAATAG
Protein sequenceShow/hide protein sequence
MAIVFDSKKQWKDARAQSCSFYKRIRHGRWFQSKWIYKIKPGTGGDSKPRYKARLVAKGYTQKEGVDFHEIFSPVVRHSSIRLILSTAIHFNMFIEQMDVTTIGE
LEEVIYTAQPKGYKVKGKEDMVFCLHKSIYGLKQSPRQWYIRFDTFIMKQGFHENSYDACVYWKLSQKGTYIYLLLYVEDMILVSKDYAEICELKKQLSNEFEMK
DLGELKMILGMDVKGDREKGLLTILHESYVIKLLEKYNISGCNAVSTPLHLILNFLRLNVLLLNKTLIKECADLDKRRSLSGHIFRLYDNVVSWKVTIQPVAALS
TTESEYISFGEAVKEAVWLKRIVGELLLQKFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVQLVKVHTVENLSDMLIKFRESIWCLK