; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0186041 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0186041
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr07:3985092..3986219
RNA-Seq ExpressionCmc07g0186041
SyntenyCmc07g0186041
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.5e-17682.67Show/hide
Query:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS
        MLKSI+ILLSIATFY+YEIWQMDVKT F NGNLEESIYM QP+GFI + QEQKV KLQKSIYGLKQASRSWNIRFDTAIK YGFEQNV+E CVYKK+VNS
Subjt:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS

Query:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP
        +VAFL+LYVDDILL+ NDV YL D+KKWL TQFQMKDL +AQY+LGIQIVRNRKN+TLAMSQASYIDK+LSR KMQNSKKG LP+R+GIH SKEQC KTP
Subjt:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP

Query:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL
        ++VEDMRNIPY+SAV SLMYAMLCTRPDICYSVG+V+RYQSN  RDHWT VKNILKYLRRT++YML+Y  KDLILTGYTDSD+Q+DK+ R STSGSVFTL
Subjt:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL

Query:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        +GG +VWRS+KQTCIA+STME EYV ACEAAKEAVWL+KFL DLE +VPNMHLPITLY DNSGAVANS+EPRSHK
Subjt:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-16879.2Show/hide
Query:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS
        MLKSI+ILLSIATFYDYEIWQMDVKT F NGNLEESI+M+QP+GFI +GQEQKV KL +SIYGLKQASRSWNIRFDTAIK YGF+QNV+E CVYKK+   
Subjt:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS

Query:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP
         VAFLVLYVDDILL+ NDVGYL D+K WLA QFQMKDL +AQYVLGIQI+R+RKN+TLA+SQA+YIDK+L R  MQNSKKGLLP+R+G+H SKEQ  KTP
Subjt:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP

Query:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL
        ++VEDMR IPYASAV SLMYAMLCTRPDICY+VG+V+RYQSN   DHWT VK +LKYLRRT+DYML+Y  KDLILTGYTDSD+QTDK++R STSGSVFTL
Subjt:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL

Query:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        +GG +VWRSIKQ CIA+STME EYV ACEAAKEAVWL+KFL DLE +VPNM+LPITLY DNSGAVANS+EPRSHK
Subjt:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-16879.2Show/hide
Query:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS
        MLKSI+ILLSIATFYDYEIWQMDVKT F NGNLEESI+M+QP+GFI +GQEQKV KL +SIYGLKQASRSWNIRFDTAIK YGF+QNV+E CVYKK+   
Subjt:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS

Query:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP
         VAFLVLYVDDILL+ NDVGYL D+K WLA QFQMKDL +AQYVLGIQI+R+RKN+TLA+SQA+YIDK+L R  MQNSKKGLLP+R+G+H SKEQ  KTP
Subjt:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP

Query:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL
        ++VEDMR IPYASAV SLMYAMLCTRPDICY+VG+V+RYQSN   DHWT VK +LKYLRRT+DYML+Y  KDLILTGYTDSD+QTDK++R STSGSVFTL
Subjt:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL

Query:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        +GG +VWRSIKQ CIA+STME EYV ACEAAKEAVWL+KFL DLE +VPNM+LPITLY DNSGAVANS+EPRSHK
Subjt:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

KAA0066797.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-18597.03Show/hide
Query:  MAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDL
        MAQPKGFIEKGQEQKV KLQKSIYGLK ASRSWNIRFDTAIKFYGFEQNV+E CVYKKVVNSIVAFLVLYVDDILLVKNDVGYLIDIKKWL TQFQMKDL
Subjt:  MAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDL

Query:  RDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTR
        RDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTR
Subjt:  RDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTR

Query:  YQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLK
        YQSNTKRDHWTIVKNILKYLRRTKDYML+YSTKDLILTGYTDSDYQTDKN RNSTSGSVFTLHGGTIVWRSIKQTCIANSTME EYV ACEAAKEAVWLK
Subjt:  YQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLK

Query:  KFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        KFLRDLEKIVPNMHLPITLYGDNSGAV NSREPRSHK
Subjt:  KFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-17081.07Show/hide
Query:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS
        M+KSI+ILLSIATFYDYEIWQMDVKTTF N NLEESIYM QP+ FI+KGQEQK+ KLQKSIYGLKQASRS NIRFDTAIK YG EQNV+E CVYK+++NS
Subjt:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS

Query:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP
         VAFLVLYVDDILL+ NDVG+L DIKKWLA QFQMKDL +AQYVLG+QIVRNRKN+TLAMSQ SYIDKMLSR KM NSKKGLLPYRYGIH SKEQC KTP
Subjt:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP

Query:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL
        ++VEDM NIPYASAV SLMY MLCTRP+ICYSVG+V+R QS   RDHWT VKNILKYLRRTKDYML+Y +KDLILTGYTD  +QTDK+ R STSG VFT+
Subjt:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL

Query:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        +GG +VWRSIKQ+CIA+STME EYV  CEAAKEAVWLKKFL DLE +VPNMHLP TLY DNSGAV NSREPRSHK
Subjt:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein1.6e-16879.2Show/hide
Query:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS
        MLKSI+ILLSIATFYDYEIWQMDVKT F NGNLEESI+M+QP+GFI +GQEQKV KL +SIYGLKQASRSWNIRFDTAIK YGF+QNV+E CVYKK+   
Subjt:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS

Query:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP
         VAFLVLYVDDILL+ NDVGYL D+K WLA QFQMKDL +AQYVLGIQI+R+RKN+TLA+SQA+YIDK+L R  MQNSKKGLLP+R+G+H SKEQ  KTP
Subjt:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP

Query:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL
        ++VEDMR IPYASAV SLMYAMLCTRPDICY+VG+V+RYQSN   DHWT VK +LKYLRRT+DYML+Y  KDLILTGYTDSD+QTDK++R STSGSVFTL
Subjt:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL

Query:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        +GG +VWRSIKQ CIA+STME EYV ACEAAKEAVWL+KFL DLE +VPNM+LPITLY DNSGAVANS+EPRSHK
Subjt:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

A0A5A7UYE8 Gag/pol protein1.6e-16879.2Show/hide
Query:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS
        MLKSI+ILLSIATFYDYEIWQMDVKT F NGNLEESI+M+QP+GFI +GQEQKV KL +SIYGLKQASRSWNIRFDTAIK YGF+QNV+E CVYKK+   
Subjt:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS

Query:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP
         VAFLVLYVDDILL+ NDVGYL D+K WLA QFQMKDL +AQYVLGIQI+R+RKN+TLA+SQA+YIDK+L R  MQNSKKGLLP+R+G+H SKEQ  KTP
Subjt:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP

Query:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL
        ++VEDMR IPYASAV SLMYAMLCTRPDICY+VG+V+RYQSN   DHWT VK +LKYLRRT+DYML+Y  KDLILTGYTDSD+QTDK++R STSGSVFTL
Subjt:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL

Query:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        +GG +VWRSIKQ CIA+STME EYV ACEAAKEAVWL+KFL DLE +VPNM+LPITLY DNSGAVANS+EPRSHK
Subjt:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

A0A5A7VI37 Gag/pol protein4.3e-18597.03Show/hide
Query:  MAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDL
        MAQPKGFIEKGQEQKV KLQKSIYGLK ASRSWNIRFDTAIKFYGFEQNV+E CVYKKVVNSIVAFLVLYVDDILLVKNDVGYLIDIKKWL TQFQMKDL
Subjt:  MAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDL

Query:  RDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTR
        RDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTR
Subjt:  RDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTR

Query:  YQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLK
        YQSNTKRDHWTIVKNILKYLRRTKDYML+YSTKDLILTGYTDSDYQTDKN RNSTSGSVFTLHGGTIVWRSIKQTCIANSTME EYV ACEAAKEAVWLK
Subjt:  YQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLK

Query:  KFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        KFLRDLEKIVPNMHLPITLYGDNSGAV NSREPRSHK
Subjt:  KFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

A0A5D3BX45 Gag/pol protein1.3e-17081.07Show/hide
Query:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS
        M+KSI+ILLSIATFYDYEIWQMDVKTTF N NLEESIYM QP+ FI+KGQEQK+ KLQKSIYGLKQASRS NIRFDTAIK YG EQNV+E CVYK+++NS
Subjt:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS

Query:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP
         VAFLVLYVDDILL+ NDVG+L DIKKWLA QFQMKDL +AQYVLG+QIVRNRKN+TLAMSQ SYIDKMLSR KM NSKKGLLPYRYGIH SKEQC KTP
Subjt:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP

Query:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL
        ++VEDM NIPYASAV SLMY MLCTRP+ICYSVG+V+R QS   RDHWT VKNILKYLRRTKDYML+Y +KDLILTGYTD  +QTDK+ R STSG VFT+
Subjt:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL

Query:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        +GG +VWRSIKQ+CIA+STME EYV  CEAAKEAVWLKKFL DLE +VPNMHLP TLY DNSGAV NSREPRSHK
Subjt:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

E2GK51 Gag/pol protein (Fragment)7.3e-17782.67Show/hide
Query:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS
        MLKSI+ILLSIATFY+YEIWQMDVKT F NGNLEESIYM QP+GFI + QEQKV KLQKSIYGLKQASRSWNIRFDTAIK YGFEQNV+E CVYKK+VNS
Subjt:  MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNS

Query:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP
        +VAFL+LYVDDILL+ NDV YL D+KKWL TQFQMKDL +AQY+LGIQIVRNRKN+TLAMSQASYIDK+LSR KMQNSKKG LP+R+GIH SKEQC KTP
Subjt:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP

Query:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL
        ++VEDMRNIPY+SAV SLMYAMLCTRPDICYSVG+V+RYQSN  RDHWT VKNILKYLRRT++YML+Y  KDLILTGYTDSD+Q+DK+ R STSGSVFTL
Subjt:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL

Query:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        +GG +VWRS+KQTCIA+STME EYV ACEAAKEAVWL+KFL DLE +VPNMHLPITLY DNSGAVANS+EPRSHK
Subjt:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.9e-5434.29Show/hide
Query:  LKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVY---KKVV
        + S + +LS+   Y+ ++ QMDVKT F NG L+E IYM  P+G         V KL K+IYGLKQA+R W   F+ A+K   F  +  + C+Y   K  +
Subjt:  LKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVY---KKVV

Query:  NSIVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLK
        N  + +++LYVDD+++   D+  + + K++L  +F+M DL + ++ +GI+I    +   + +SQ++Y+ K+LS+  M+N      P    I++   + L 
Subjt:  NSIVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLK

Query:  TPKDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLI----LTGYTDSDYQTDKNTRNSTS
        + +D     N P  S +  LMY MLCTRPD+  +V +++RY S    + W  +K +L+YL+ T D  L++  K+L     + GY DSD+   +  R ST+
Subjt:  TPKDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLI----LTGYTDSDYQTDKNTRNSTS

Query:  GSVFTLHG-GTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK
        G +F +     I W + +Q  +A S+ E EY+   EA +EA+WLK  L  +   + N   PI +Y DN G ++ +  P  HK
Subjt:  GSVFTLHG-GTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-8043.47Show/hide
Query:  LKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVY-KKVVNS
        + SI+ +LS+A   D E+ Q+DVKT F +G+LEE IYM QP+GF   G++  V KL KS+YGLKQA R W ++FD+ +K   + +  ++ CVY K+   +
Subjt:  LKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVY-KKVVNS

Query:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP
            L+LYVDD+L+V  D G +  +K  L+  F MKDL  AQ +LG++IVR R +R L +SQ  YI+++L R  M+N+K    P    +  SK+ C  T 
Subjt:  IVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTP

Query:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL
        ++  +M  +PY+SAV SLMYAM+CTRPDI ++VG+V+R+  N  ++HW  VK IL+YLR T    L +   D IL GYTD+D   D + R S++G +FT 
Subjt:  KDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTL

Query:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMH-LPITLYGDNSGAVANSREPRSH
         GG I W+S  Q C+A ST E EY+ A E  KE +WLK+FL++L      +H     +Y D+  A+  S+    H
Subjt:  HGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMH-LPITLYGDNSGAVANSREPRSH

P25600 Putative transposon Ty5-1 protein YCL074W7.6e-3030.57Show/hide
Query:  MDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVAFLVLYVDDILLVKNDVGY
        MDV T F N  ++E IY+ QP GF+ +     V++L   +YGLKQA   WN   +  +K  GF ++  E  +Y +  +    ++ +YVDD+L+       
Subjt:  MDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVAFLVLYVDDILLVKNDVGY

Query:  LIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDVEDMRNIPYASAVRSLMYA
           +K+ L   + MKDL      LG+ I     N  + +S   YI K  S  ++   K    P    +  SK     T   ++D+   PY S V  L++ 
Subjt:  LIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDVEDMRNIPYASAVRSLMYA

Query:  MLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMY-STKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHGGTIVWRSIK-QTCIANST
            RPDI Y V +++R+    +  H    + +L+YL  T+   L Y S   L LT Y D+ +    +  +ST G V  L G  + W S K +  I   +
Subjt:  MLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMY-STKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHGGTIVWRSIK-QTCIANST

Query:  MEVEYVTACEAAKE
         E EY+TA E   E
Subjt:  MEVEYVTACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.6e-4131.45Show/hide
Query:  SIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVA
        SI+I+L +A    + I Q+DV   F  G L + +YM+QP GFI+K +   V KL+K++YGLKQA R+W +     +   GF  +V+++ ++       + 
Subjt:  SIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVA

Query:  FLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDV
        ++++YVDDIL+  ND   L +    L+ +F +KD  +  Y LGI+    R    L +SQ  YI  +L+R  M  +K    P       S     K     
Subjt:  FLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDV

Query:  EDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDY-MLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHG
        E      Y   V SL Y +  TRPDI Y+V  ++++      +H   +K IL+YL  T ++ + +     L L  Y+D+D+  DK+   ST+G +  L  
Subjt:  EDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDY-MLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHG

Query:  GTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSH
          I W S KQ  +  S+ E EY +    + E  W+   L +L      +  P  +Y DN GA      P  H
Subjt:  GTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-4331.45Show/hide
Query:  SIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVA
        SI+I+L +A    + I Q+DV   F  G L + +YM+QP GF++K +   V +L+K+IYGLKQA R+W +   T +   GF  +++++ ++       + 
Subjt:  SIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVA

Query:  FLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDV
        ++++YVDDIL+  ND   L      L+ +F +K+  D  Y LGI+    R  + L +SQ  Y   +L+R  M  +K    P       +     K P   
Subjt:  FLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDV

Query:  EDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDY-MLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHG
        E      Y   V SL Y +  TRPD+ Y+V  +++Y      DHW  +K +L+YL  T D+ + +     L L  Y+D+D+  D +   ST+G +  L  
Subjt:  EDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDY-MLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHG

Query:  GTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSH
          I W S KQ  +  S+ E EY +    + E  W+   L +L   +   H P+ +Y DN GA      P  H
Subjt:  GTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-4330.35Show/hide
Query:  LKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQE----QKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKV
        L S+K++L+I+  Y++ + Q+D+   F NG+L+E IYM  P G+  +  +      V  L+KSIYGLKQASR W ++F   +  +GF Q+ ++   + K+
Subjt:  LKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQE----QKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKV

Query:  VNSIVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCL
          ++   +++YVDDI++  N+   + ++K  L + F+++DL   +Y LG++I R+     + + Q  Y   +L    +   K   +P    + FS     
Subjt:  VNSIVAFLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCL

Query:  KTPKDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTK-DLILTGYTDSDYQTDKNTRNSTSGS
         +  D  D +   Y   +  LMY  + TR DI ++V  ++++    +  H   V  IL Y++ T    L YS++ ++ L  ++D+ +Q+ K+TR ST+G 
Subjt:  KTPKDVEDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTK-DLILTGYTDSDYQTDKNTRNSTSGS

Query:  VFTLHGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAV
           L    I W+S KQ  ++ S+ E EY     A  E +WL +F R+L+  +P +  P  L+ DN+ A+
Subjt:  VFTLHGGTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAV

ATMG00240.1 Gag-Pol-related retrotransposon family protein7.4e-0433.33Show/hide
Query:  TRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYS-TKDLILTGYTDSDYQTDKNTRNSTSG
        TRPD+ ++V  ++++ S ++      V  +L Y++ T    L YS T DL L  + DSD+ +  +TR S +G
Subjt:  TRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYS-TKDLILTGYTDSDYQTDKNTRNSTSG

ATMG00810.1 DNA/RNA polymerases superfamily protein2.6e-1731.2Show/hide
Query:  FLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDV
        +L+LYVDDILL  +    L  +   L++ F MKDL    Y LGIQI  +     L +SQ  Y +++L+   M + K    P    ++ S     K P D 
Subjt:  FLVLYVDDILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDV

Query:  EDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDY-MLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHG
         D R+I     V +L Y  L TRPDI Y+V +V +         + ++K +L+Y++ T  + + ++    L +  + DSD+    +TR ST+G    L  
Subjt:  EDMRNIPYASAVRSLMYAMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDY-MLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHG

Query:  GTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVW
          I W + +Q  ++ S+ E EY      A E  W
Subjt:  GTIVWRSIKQTCIANSTMEVEYVTACEAAKEAVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAAGTCAATTAAAATACTCTTGTCCATCGCCACATTTTATGATTATGAAATTTGGCAGATGGATGTTAAGACAACTTTTTTTAATGGAAATCTTGAGGAGAGTAT
CTATATGGCTCAACCAAAGGGGTTCATAGAAAAGGGTCAAGAACAAAAGGTTTTTAAGCTTCAGAAATCCATTTATGGTTTGAAGCAAGCATCTAGATCCTGGAATATAA
GATTTGATACTGCGATCAAGTTTTATGGCTTTGAACAAAATGTTAACGAATCTTGTGTTTACAAAAAGGTCGTCAATTCCATTGTAGCATTCTTAGTATTATATGTAGAT
GATATTCTACTCGTTAAAAATGACGTAGGTTATCTTATTGATATCAAGAAATGGCTTGCTACGCAATTTCAAATGAAAGATTTGAGAGATGCACAATATGTTCTTGGAAT
CCAAATTGTTCGGAATCGTAAAAATAGAACACTAGCCATGTCTCAAGCATCTTATATTGACAAAATGTTGTCTAGATGTAAAATGCAGAATTCCAAAAAGGGTTTGCTGC
CATACAGATATGGAATTCATTTTTCAAAGGAACAATGTCTTAAGACACCTAAAGATGTTGAGGATATGAGAAATATTCCTTATGCTTCCGCTGTCAGGAGTTTGATGTAT
GCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCACTAGGTATCAGTCCAATACCAAACGTGATCATTGGACAATCGTTAAGAATATTCTAAAATA
TCTTAGAAGAACAAAAGACTACATGCTCATGTATAGTACTAAGGATCTGATCCTTACTGGGTACACTGATTCTGATTACCAAACAGATAAAAATACTAGAAATTCTACAT
CAGGATCGGTATTCACTCTACATGGAGGAACAATCGTATGGAGAAGCATAAAACAAACTTGTATAGCCAACTCCACAATGGAAGTCGAATATGTAACAGCTTGCGAAGCA
GCAAAGGAAGCAGTATGGCTAAAGAAATTCTTAAGAGATCTGGAAAAAATTGTTCCAAATATGCATCTGCCAATCACTTTATACGGTGACAACAGTGGTGCAGTTGCAAA
TTCACGAGAACCTAGAAGCCATAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTAAGTCAATTAAAATACTCTTGTCCATCGCCACATTTTATGATTATGAAATTTGGCAGATGGATGTTAAGACAACTTTTTTTAATGGAAATCTTGAGGAGAGTAT
CTATATGGCTCAACCAAAGGGGTTCATAGAAAAGGGTCAAGAACAAAAGGTTTTTAAGCTTCAGAAATCCATTTATGGTTTGAAGCAAGCATCTAGATCCTGGAATATAA
GATTTGATACTGCGATCAAGTTTTATGGCTTTGAACAAAATGTTAACGAATCTTGTGTTTACAAAAAGGTCGTCAATTCCATTGTAGCATTCTTAGTATTATATGTAGAT
GATATTCTACTCGTTAAAAATGACGTAGGTTATCTTATTGATATCAAGAAATGGCTTGCTACGCAATTTCAAATGAAAGATTTGAGAGATGCACAATATGTTCTTGGAAT
CCAAATTGTTCGGAATCGTAAAAATAGAACACTAGCCATGTCTCAAGCATCTTATATTGACAAAATGTTGTCTAGATGTAAAATGCAGAATTCCAAAAAGGGTTTGCTGC
CATACAGATATGGAATTCATTTTTCAAAGGAACAATGTCTTAAGACACCTAAAGATGTTGAGGATATGAGAAATATTCCTTATGCTTCCGCTGTCAGGAGTTTGATGTAT
GCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCACTAGGTATCAGTCCAATACCAAACGTGATCATTGGACAATCGTTAAGAATATTCTAAAATA
TCTTAGAAGAACAAAAGACTACATGCTCATGTATAGTACTAAGGATCTGATCCTTACTGGGTACACTGATTCTGATTACCAAACAGATAAAAATACTAGAAATTCTACAT
CAGGATCGGTATTCACTCTACATGGAGGAACAATCGTATGGAGAAGCATAAAACAAACTTGTATAGCCAACTCCACAATGGAAGTCGAATATGTAACAGCTTGCGAAGCA
GCAAAGGAAGCAGTATGGCTAAAGAAATTCTTAAGAGATCTGGAAAAAATTGTTCCAAATATGCATCTGCCAATCACTTTATACGGTGACAACAGTGGTGCAGTTGCAAA
TTCACGAGAACCTAGAAGCCATAAATAG
Protein sequenceShow/hide protein sequence
MLKSIKILLSIATFYDYEIWQMDVKTTFFNGNLEESIYMAQPKGFIEKGQEQKVFKLQKSIYGLKQASRSWNIRFDTAIKFYGFEQNVNESCVYKKVVNSIVAFLVLYVD
DILLVKNDVGYLIDIKKWLATQFQMKDLRDAQYVLGIQIVRNRKNRTLAMSQASYIDKMLSRCKMQNSKKGLLPYRYGIHFSKEQCLKTPKDVEDMRNIPYASAVRSLMY
AMLCTRPDICYSVGMVTRYQSNTKRDHWTIVKNILKYLRRTKDYMLMYSTKDLILTGYTDSDYQTDKNTRNSTSGSVFTLHGGTIVWRSIKQTCIANSTMEVEYVTACEA
AKEAVWLKKFLRDLEKIVPNMHLPITLYGDNSGAVANSREPRSHK