; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G22600 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G22600
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr1:18156738..18159836
RNA-Seq ExpressionCSPI01G22600
SyntenyCSPI01G22600
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0043167 - ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043826.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]2.0e-29071.05Show/hide
Query:  MDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGRSEKKSWKGKE
        MDQSK LEENL+EFQKI+VDLNNIGEKMSDENQA             VKAAIKYGRDSLTMS+VLDALKTR+LEIKKE KD ELLM RGRSEKKSWKGKE
Subjt:  MDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGRSEKKSWKGKE

Query:  KSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV-------------------------------------------------
        +S   ++ G                    ++     TS ANV DGY+SAE+                                                 
Subjt:  KSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV-------------------------------------------------

Query:  ----------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAI
                        + G GS+QIATHDGM+R+ TNV YVP+LKRNLIS+G+LDRSG T KSENGVMKVTKGSLVKL+GTLR+GL+VLEGTTVS SAAI
Subjt:  ----------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAI

Query:  ASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYFLSIIDDFSR-
        AS KVTDMSMLWHKRLAHVSERGLQALSQQGLL GVK+VELPFCEHCIM KSTRV+FGKGKH T GILDYVHSDLWGP K  SM GSRYF+SIIDDFSR 
Subjt:  ASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYFLSIIDDFSR-

Query:  -------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA
                           KKQVENQTGRKVKYLRTDNGLEFVNNKFN FCKS+GITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA
Subjt:  -------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA

Query:  AQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTFNETEMSYCVI
        AQT CYLINRS STAL+LKTPQEVWTGKAPSL+HLR FGC+ YAHVKDGKLNKR LKC+FIGYPQGVKGYKLWC+EKG+NKCII  DVTFNETEM YCV 
Subjt:  AQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTFNETEMSYCVI

Query:  EQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQAPIRYGYADL
        EQQKQ+  DHV TEVR+ SE+RPS+ LD   +Q PLVS+ E T Q E DGIQSQQERILIDEG   E+SSSNN+LQNYQLTRDRVQRER APIRYGYADL
Subjt:  EQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQAPIRYGYADL

Query:  VAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI
        VAYALTCAA+SIEAE LTFEEAIVSDSKKQWKDAME ELFSL KNQTWSLVPKPPNQKLI
Subjt:  VAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI

KAA0045141.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.1e-28567.94Show/hide
Query:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL
        M STRFEV+KF+  GDFALWRKKIR ILVQ KVAKILDEE LPE I ESEK+DMDEM YSTILLYL D+V RLVD+ATTTG LWKKLESLYLTKSLPNK+
Subjt:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL

Query:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQAVKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGRSEKKSWKGKEKSS
        Y+KEKFFG KMDQSK LEENL+EFQKIIVDLNNI                          VLDALKTR+LEIKKE KD+ELLM RGRSEKKS KGKE+SS
Subjt:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQAVKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGRSEKKSWKGKEKSS

Query:  SKEASGSQYGPGETSSANVIDGYDSAEV----------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENG
         + ++         +SA ++DGYDS E                       + G GS+QIATHDGM+R+ TNVRYVP+L RNLIS+G+LDRSG T KSEN 
Subjt:  SKEASGSQYGPGETSSANVIDGYDSAEV----------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENG

Query:  VMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNG
        VMKVTKGSLVKLRGTLR+GL+VLE T VS SAAIAS KVTDM MLWHKRLAHVSERGLQALSQQGLL GVK+VEL FCEHCIM KSTRV+FGKGKHT  G
Subjt:  VMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNG

Query:  ILDYVHSDLWGPMKVPSMEGSRYFLSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTP
        ILDYVHSDLWG MK  SM G RYF+SIIDDFSR                    KKQVE QTGRK+KYLRTDNGLEFVNNKFN FCKS+GITRHFTVTYTP
Subjt:  ILDYVHSDLWGPMKVPSMEGSRYFLSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTP

Query:  QQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQG
        QQNGLAERFN+TIME TRCLLTNASLPLKFWGEAAQT CYLINRS S AL+LKTP EVW GKAPSLDHL  FGC+TY HVKDGKLNKR LKCMFIGYPQG
Subjt:  QQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQG

Query:  VKGYKLWCLEKGINKCIIGIDVTFNETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHS
        VKGYKLWCLEKG+ KCII  DVTFNETEM YCV EQQKQ+  DH                                                        
Subjt:  VKGYKLWCLEKGINKCIIGIDVTFNETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHS

Query:  EKSSSNNNLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVP
                     LTRDR QRER APIRYGYADLVAYALTCA + IEAE LTFEEAIVSDSKKQWKDAMEAELFSL KNQTWSLVP
Subjt:  EKSSSNNNLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVP

KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]0.0e+0072.53Show/hide
Query:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL
        M STRFEV+KF+  GDFALWRKKIRAILVQ KVAKILDEE LP+ I ESEK+DMDEM Y TILLYL D+V RLVDEATTTG LWKKLESLYLTKSLPNK+
Subjt:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL

Query:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR
        Y+KEKFFGYKMDQSK LEENL+EFQKI+VDLNNIGEKMSDENQA             VKAAIKYG DSLTMS+VLDALKTR+LEIKKE KD ELLM RGR
Subjt:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR

Query:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------
        SEKKSWKGKE+S   ++ G                    ++     TS ANV DGY+SAE+                                       
Subjt:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------

Query:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE
                                  + G GS+QIATHDGM+R+ TNVRYVP+LKRNLIS+G+LDRSG T KSENGVMKVTKGSLVKLRGTLR+GL+VLE
Subjt:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE

Query:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF
        GTTVS SAAIAS KVT+MSMLWHKRLAHVSERGLQALSQQGLL GVK+VELPFCEHCIM KSTRV+FGKGKHTT GILDY+HSDLWGP K  SM GSRYF
Subjt:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF

Query:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        +SIIDDFSR                    KKQVENQTGRKVKYLRTDNGLEFVNNKFN FCKS+GITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF
        SLPLKFWGEAAQT CYLINRS STAL+LKTPQEVWTGKAPSL+HLR FGC+ YAHVKDGKLNKR LKCMFIGYPQGVKGYKLWC+EKG+NKCII  DVTF
Subjt:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF

Query:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ
        NETEM YCV EQQKQ+  DHV TEVR+ SE+RPS+ LD   +Q PLVS+ E T Q E DGIQSQQERILIDE    E+SSSNN+LQNYQLTRDRVQRER 
Subjt:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ

Query:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI
        APIRYGYADLVAYALTCAA+SIEAE LTFEEAIVSDSKKQWKDAME ELFSL KNQTWSLVPKPPNQKLI
Subjt:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0072.87Show/hide
Query:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL
        M STRFEV+KF+  GDF+LWRKKIRAILVQ KVAKILDEE LP+ I ESEK+DMDEM YSTILLYL D+V RLVDEATTTG LWKKLESLYLTKSL NK+
Subjt:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL

Query:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR
        Y+KEKFFGYKMDQSK LEENL+EFQKI+VDLNNIGEKMSDENQA             VKAAIKYGRDSLTMS+VLDALKTR+LEIKKE KD ELLM RGR
Subjt:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR

Query:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------
        SEKKSWKGKE+S   ++ G                    ++     TS ANV DGY+SAE+                                       
Subjt:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------

Query:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE
                                  + G GS+QIATHDGM+R+ TNVRYVP+LKRNLIS+G+LDRSG T KSENGVMKVTKGSLVKLRGTLR+GL+VLE
Subjt:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE

Query:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF
        GTTVS SAAIAS KVTDMSMLWHKRLAHVSERGLQALSQQGLL GVK+VELPFCEHCIM KSTRV+FGKGKHTT GILDYVHSDLWGP K  SM GSRYF
Subjt:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF

Query:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        +SIIDDFSR                    KKQVENQTGRKVKYLRTDNGLEFVNNKFN FCKS+GITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF
        SLPLKFWGEAAQT CYLINRS STAL+LKTPQEVWTGKAPSL+HLR FGC+ YAHVKDGKLNKR LKCMFIGYPQGVKGYKLWC+EKG+NKCII  DVTF
Subjt:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF

Query:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ
        NETEM YCV EQQKQ+  DHV TEVR+ SE+RPS+ LD   +Q PLVS+ E T Q E DGIQSQQERILIDEG   E+SSSNN+LQNYQLTRDRVQRER 
Subjt:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ

Query:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI
        APIRYGYADLVAYALTCAA+SIEAE LTFEEAIVSDSKKQWKDAME ELFSL KNQTWSLVPKPPNQKLI
Subjt:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0073.1Show/hide
Query:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL
        M STRFEV+KF+  GDFALWRKKIRAILVQ KVAKILDEE LP+ I ESEK+DMDEM YSTILLYL D+V RLVDEATTTG LWKKLESLYLTKSLPNK+
Subjt:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL

Query:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR
        Y+KEKFFGYKMDQSK LEENL+EFQKI+VDLNNIGEKMSDENQA             VKAAIKYGRDSLTMS+VLDALKTR+LEIKKE KD ELLM RGR
Subjt:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR

Query:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------
        SEKKSWKGKE+S   ++ G                    ++     TS ANV DGY+SAE+                                       
Subjt:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------

Query:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE
                                  + G GS+QIATHDGM+R+ TNVRYVP+LKRNLIS+G+LDRSG T KSENGVMKVTKGSLVKLRGTLR+GL+VLE
Subjt:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE

Query:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF
        GTTVS SAAIAS KVTDMSMLWHKRLAHVSERGLQALSQQGLL GVK+VELPFCEHCIM KSTRV+FGKGKHTT GILDYVHSDLWGP K  SM GSRYF
Subjt:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF

Query:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        +SIIDDFSR                    KKQVENQTGRKVKYLRTDNGLEFVNNKFN FCKS+GITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF
        SLPLKFWGEAAQT CYLINRS STAL+LKTPQEVWTGKAPSL+HLR FGC+ YAHVKDGKLNKR LKCMFIGYPQGVKGYKLWC+EKG+NKCII  DVTF
Subjt:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF

Query:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ
        NETEM YCV EQQKQ+  DHV TEVR+ SE+RPS+ LD   +Q PLVS+ E T Q E DGIQSQQERILIDEG   E+SSSNN+LQNYQLTRDRVQRER 
Subjt:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ

Query:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI
        APIRYGYADLVAYALTCAA+SIEAE LTFEEAIVSDSKKQWKDAME ELFSL KNQTWSLVPKPPNQKLI
Subjt:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI

TrEMBL top hitse value%identityAlignment
A0A5A7TP18 Putative gag-pol polyprotein9.8e-29171.05Show/hide
Query:  MDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGRSEKKSWKGKE
        MDQSK LEENL+EFQKI+VDLNNIGEKMSDENQA             VKAAIKYGRDSLTMS+VLDALKTR+LEIKKE KD ELLM RGRSEKKSWKGKE
Subjt:  MDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGRSEKKSWKGKE

Query:  KSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV-------------------------------------------------
        +S   ++ G                    ++     TS ANV DGY+SAE+                                                 
Subjt:  KSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV-------------------------------------------------

Query:  ----------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAI
                        + G GS+QIATHDGM+R+ TNV YVP+LKRNLIS+G+LDRSG T KSENGVMKVTKGSLVKL+GTLR+GL+VLEGTTVS SAAI
Subjt:  ----------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAI

Query:  ASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYFLSIIDDFSR-
        AS KVTDMSMLWHKRLAHVSERGLQALSQQGLL GVK+VELPFCEHCIM KSTRV+FGKGKH T GILDYVHSDLWGP K  SM GSRYF+SIIDDFSR 
Subjt:  ASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYFLSIIDDFSR-

Query:  -------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA
                           KKQVENQTGRKVKYLRTDNGLEFVNNKFN FCKS+GITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA
Subjt:  -------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA

Query:  AQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTFNETEMSYCVI
        AQT CYLINRS STAL+LKTPQEVWTGKAPSL+HLR FGC+ YAHVKDGKLNKR LKC+FIGYPQGVKGYKLWC+EKG+NKCII  DVTFNETEM YCV 
Subjt:  AQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTFNETEMSYCVI

Query:  EQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQAPIRYGYADL
        EQQKQ+  DHV TEVR+ SE+RPS+ LD   +Q PLVS+ E T Q E DGIQSQQERILIDEG   E+SSSNN+LQNYQLTRDRVQRER APIRYGYADL
Subjt:  EQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQAPIRYGYADL

Query:  VAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI
        VAYALTCAA+SIEAE LTFEEAIVSDSKKQWKDAME ELFSL KNQTWSLVPKPPNQKLI
Subjt:  VAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI

A0A5A7TUE4 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-28667.94Show/hide
Query:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL
        M STRFEV+KF+  GDFALWRKKIR ILVQ KVAKILDEE LPE I ESEK+DMDEM YSTILLYL D+V RLVD+ATTTG LWKKLESLYLTKSLPNK+
Subjt:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL

Query:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQAVKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGRSEKKSWKGKEKSS
        Y+KEKFFG KMDQSK LEENL+EFQKIIVDLNNI                          VLDALKTR+LEIKKE KD+ELLM RGRSEKKS KGKE+SS
Subjt:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQAVKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGRSEKKSWKGKEKSS

Query:  SKEASGSQYGPGETSSANVIDGYDSAEV----------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENG
         + ++         +SA ++DGYDS E                       + G GS+QIATHDGM+R+ TNVRYVP+L RNLIS+G+LDRSG T KSEN 
Subjt:  SKEASGSQYGPGETSSANVIDGYDSAEV----------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENG

Query:  VMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNG
        VMKVTKGSLVKLRGTLR+GL+VLE T VS SAAIAS KVTDM MLWHKRLAHVSERGLQALSQQGLL GVK+VEL FCEHCIM KSTRV+FGKGKHT  G
Subjt:  VMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNG

Query:  ILDYVHSDLWGPMKVPSMEGSRYFLSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTP
        ILDYVHSDLWG MK  SM G RYF+SIIDDFSR                    KKQVE QTGRK+KYLRTDNGLEFVNNKFN FCKS+GITRHFTVTYTP
Subjt:  ILDYVHSDLWGPMKVPSMEGSRYFLSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTP

Query:  QQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQG
        QQNGLAERFN+TIME TRCLLTNASLPLKFWGEAAQT CYLINRS S AL+LKTP EVW GKAPSLDHL  FGC+TY HVKDGKLNKR LKCMFIGYPQG
Subjt:  QQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQG

Query:  VKGYKLWCLEKGINKCIIGIDVTFNETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHS
        VKGYKLWCLEKG+ KCII  DVTFNETEM YCV EQQKQ+  DH                                                        
Subjt:  VKGYKLWCLEKGINKCIIGIDVTFNETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHS

Query:  EKSSSNNNLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVP
                     LTRDR QRER APIRYGYADLVAYALTCA + IEAE LTFEEAIVSDSKKQWKDAMEAELFSL KNQTWSLVP
Subjt:  EKSSSNNNLQNYQLTRDRVQRERQAPIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVP

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class0.0e+0072.53Show/hide
Query:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL
        M STRFEV+KF+  GDFALWRKKIRAILVQ KVAKILDEE LP+ I ESEK+DMDEM Y TILLYL D+V RLVDEATTTG LWKKLESLYLTKSLPNK+
Subjt:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL

Query:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR
        Y+KEKFFGYKMDQSK LEENL+EFQKI+VDLNNIGEKMSDENQA             VKAAIKYG DSLTMS+VLDALKTR+LEIKKE KD ELLM RGR
Subjt:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR

Query:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------
        SEKKSWKGKE+S   ++ G                    ++     TS ANV DGY+SAE+                                       
Subjt:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------

Query:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE
                                  + G GS+QIATHDGM+R+ TNVRYVP+LKRNLIS+G+LDRSG T KSENGVMKVTKGSLVKLRGTLR+GL+VLE
Subjt:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE

Query:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF
        GTTVS SAAIAS KVT+MSMLWHKRLAHVSERGLQALSQQGLL GVK+VELPFCEHCIM KSTRV+FGKGKHTT GILDY+HSDLWGP K  SM GSRYF
Subjt:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF

Query:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        +SIIDDFSR                    KKQVENQTGRKVKYLRTDNGLEFVNNKFN FCKS+GITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF
        SLPLKFWGEAAQT CYLINRS STAL+LKTPQEVWTGKAPSL+HLR FGC+ YAHVKDGKLNKR LKCMFIGYPQGVKGYKLWC+EKG+NKCII  DVTF
Subjt:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF

Query:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ
        NETEM YCV EQQKQ+  DHV TEVR+ SE+RPS+ LD   +Q PLVS+ E T Q E DGIQSQQERILIDE    E+SSSNN+LQNYQLTRDRVQRER 
Subjt:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ

Query:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI
        APIRYGYADLVAYALTCAA+SIEAE LTFEEAIVSDSKKQWKDAME ELFSL KNQTWSLVPKPPNQKLI
Subjt:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI

A0A5A7UB25 Putative gag-pol polyprotein0.0e+0072.87Show/hide
Query:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL
        M STRFEV+KF+  GDF+LWRKKIRAILVQ KVAKILDEE LP+ I ESEK+DMDEM YSTILLYL D+V RLVDEATTTG LWKKLESLYLTKSL NK+
Subjt:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL

Query:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR
        Y+KEKFFGYKMDQSK LEENL+EFQKI+VDLNNIGEKMSDENQA             VKAAIKYGRDSLTMS+VLDALKTR+LEIKKE KD ELLM RGR
Subjt:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR

Query:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------
        SEKKSWKGKE+S   ++ G                    ++     TS ANV DGY+SAE+                                       
Subjt:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------

Query:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE
                                  + G GS+QIATHDGM+R+ TNVRYVP+LKRNLIS+G+LDRSG T KSENGVMKVTKGSLVKLRGTLR+GL+VLE
Subjt:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE

Query:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF
        GTTVS SAAIAS KVTDMSMLWHKRLAHVSERGLQALSQQGLL GVK+VELPFCEHCIM KSTRV+FGKGKHTT GILDYVHSDLWGP K  SM GSRYF
Subjt:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF

Query:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        +SIIDDFSR                    KKQVENQTGRKVKYLRTDNGLEFVNNKFN FCKS+GITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF
        SLPLKFWGEAAQT CYLINRS STAL+LKTPQEVWTGKAPSL+HLR FGC+ YAHVKDGKLNKR LKCMFIGYPQGVKGYKLWC+EKG+NKCII  DVTF
Subjt:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF

Query:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ
        NETEM YCV EQQKQ+  DHV TEVR+ SE+RPS+ LD   +Q PLVS+ E T Q E DGIQSQQERILIDEG   E+SSSNN+LQNYQLTRDRVQRER 
Subjt:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ

Query:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI
        APIRYGYADLVAYALTCAA+SIEAE LTFEEAIVSDSKKQWKDAME ELFSL KNQTWSLVPKPPNQKLI
Subjt:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI

A0A5D3DNU1 Putative gag-pol polyprotein0.0e+0073.1Show/hide
Query:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL
        M STRFEV+KF+  GDFALWRKKIRAILVQ KVAKILDEE LP+ I ESEK+DMDEM YSTILLYL D+V RLVDEATTTG LWKKLESLYLTKSLPNK+
Subjt:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKL

Query:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR
        Y+KEKFFGYKMDQSK LEENL+EFQKI+VDLNNIGEKMSDENQA             VKAAIKYGRDSLTMS+VLDALKTR+LEIKKE KD ELLM RGR
Subjt:  YLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQA-------------VKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGR

Query:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------
        SEKKSWKGKE+S   ++ G                    ++     TS ANV DGY+SAE+                                       
Subjt:  SEKKSWKGKEKSSSKEASG--------------------SQYGPGETSSANVIDGYDSAEV---------------------------------------

Query:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE
                                  + G GS+QIATHDGM+R+ TNVRYVP+LKRNLIS+G+LDRSG T KSENGVMKVTKGSLVKLRGTLR+GL+VLE
Subjt:  --------------------------LMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLE

Query:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF
        GTTVS SAAIAS KVTDMSMLWHKRLAHVSERGLQALSQQGLL GVK+VELPFCEHCIM KSTRV+FGKGKHTT GILDYVHSDLWGP K  SM GSRYF
Subjt:  GTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYF

Query:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        +SIIDDFSR                    KKQVENQTGRKVKYLRTDNGLEFVNNKFN FCKS+GITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  LSIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF
        SLPLKFWGEAAQT CYLINRS STAL+LKTPQEVWTGKAPSL+HLR FGC+ YAHVKDGKLNKR LKCMFIGYPQGVKGYKLWC+EKG+NKCII  DVTF
Subjt:  SLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF

Query:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ
        NETEM YCV EQQKQ+  DHV TEVR+ SE+RPS+ LD   +Q PLVS+ E T Q E DGIQSQQERILIDEG   E+SSSNN+LQNYQLTRDRVQRER 
Subjt:  NETEMSYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQ

Query:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI
        APIRYGYADLVAYALTCAA+SIEAE LTFEEAIVSDSKKQWKDAME ELFSL KNQTWSLVPKPPNQKLI
Subjt:  APIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.8e-5525.88Show/hide
Query:  FALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKLYLKEKFFGYKMDQSKG
        +A+W+ +IRA+L +Q V K++D   +P  + +S KK  +    STI+ YL D          T   + + L+++Y  KSL ++L L+++    K+     
Subjt:  FALWRKKIRAILVQQKVAKILDEENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKLYLKEKFFGYKMDQSKG

Query:  LEENLNEFQKIIVDLNNIGEKMSDENQ--------------AVKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKD------------------EELLMV
        L  + + F ++I +L   G K+ + ++               + A      ++LT++ V + L  + ++IK +H D                    L   
Subjt:  LEENLNEFQKIIVDLNNIGEKMSDENQ--------------AVKAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKD------------------EELLMV

Query:  RGRSEKKSWKG------------------------------KEKSSSKE-----ASGSQYGPGETSSANVIDG--------------------YDSAEVL
        R    KK +KG                              K K + K+     + G  +   E ++ +V+D                      DS EV+
Subjt:  RGRSEKKSWKG------------------------------KEKSSSKE-----ASGSQYGPGETSSANVIDG--------------------YDSAEVL

Query:  MGI-------GSIQIATHDGMIRV-------FTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGT-LRNGLHVLEGTTVSDSAAIAS
          +       G    AT  G++R+         +V +  E   NL+S+  L  +G + + +   + ++K  L+ ++ + + N + V+     S +A    
Subjt:  MGI-------GSIQIATHDGMIRV-------FTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGT-LRNGLHVLEGTTVSDSAAIAS

Query:  DKVTDMSMLWHKRLAHVSERGL------QALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTN--GILDYVHSDLWGPMKVPSMEGSRYFLSII
         K  +   LWH+R  H+S+  L         S Q LL  + ++    CE C+  K  R+ F + K  T+    L  VHSD+ GP+   +++   YF+  +
Subjt:  DKVTDMSMLWHKRLAHVSERGL------QALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTN--GILDYVHSDLWGPMKVPSMEGSRYFLSII

Query:  DDFSR---------KKQV-----------ENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPL
        D F+          K  V           E     KV YL  DNG E+++N+   FC  KGI+ H TV +TPQ NG++ER  RTI E+ R +++ A L  
Subjt:  DDFSR---------KKQV-----------ENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPL

Query:  KFWGEAAQTVCYLINRSHSTAL--DLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKD--GKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF
         FWGEA  T  YLINR  S AL    KTP E+W  K P L HLR FG + Y H+K+  GK + +  K +F+GY     G+KLW  +    K I+  DV  
Subjt:  KFWGEAAQTVCYLINRSHSTAL--DLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKD--GKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTF

Query:  NETEM-SYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEG--VHSEKSSSNNNLQNYQLTRD
        +ET M +   ++ +   + D  E+E    ++  P+    +   + P  S+     QF  D  +S+ +    D    + +E  + +    N Q  +D
Subjt:  NETEM-SYCVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEG--VHSEKSSSNNNLQNYQLTRD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.3e-10532.88Show/hide
Query:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILD-EENLPETIAESEKKDMDEMVYSTILLYLPDDVF-RLVDEATTTGVLWKKLESLYLTKSLPN
        M+  ++EVAKF+    F+ W++++R +L+QQ + K+LD +   P+T+   +  D+DE   S I L+L DDV   ++DE T  G+ W +LESLY++K+L N
Subjt:  MTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILD-EENLPETIAESEKKDMDEMVYSTILLYLPDDVF-RLVDEATTTGVLWKKLESLYLTKSLPN

Query:  KLYLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQAV-------------KAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMV-
        KLYLK++ +   M +      +LN F  +I  L N+G K+ +E++A+                I +G+ ++ +  V  AL       KK     + L+  
Subjt:  KLYLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQAV-------------KAAIKYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMV-

Query:  -RGRSEKKS--------WKGKEKSSSKEASGSQY-----------------GPGETS-----------------------------------SANVID--
         RGRS ++S         +GK K+ SK    + Y                 G GETS                                   S  V+D  
Subjt:  -RGRSEKKS--------WKGKEKSSSKEASGSQY-----------------GPGETS-----------------------------------SANVID--

Query:  -------------------------GYDSAEVLMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRN
                                 G  S   + GIG I I T+ G   V  +VR+VP+L+ NLIS   LDR GY     N   ++TKGSLV  +G  R 
Subjt:  -------------------------GYDSAEVLMGIGSIQIATHDGMIRVFTNVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRN

Query:  GLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSM
         L+             A D+++    LWHKR+ H+SE+GLQ L+++ L+   K   +  C++C+  K  RV F         ILD V+SD+ GPM++ SM
Subjt:  GLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSM

Query:  EGSRYFLSIIDDFSRK--------------------KQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTR
         G++YF++ IDD SRK                      VE +TGRK+K LR+DNG E+ + +F  +C S GI    TV  TPQ NG+AER NRTI+E+ R
Subjt:  EGSRYFLSIIDDFSRK--------------------KQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTR

Query:  CLLTNASLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHV---KDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINK
         +L  A LP  FWGEA QT CYLINRS S  L  + P+ VWT K  S  HL+ FGC  +AHV   +  KL+ + + C+FIGY     GY+LW   K   K
Subjt:  CLLTNASLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHV---KDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINK

Query:  CIIGIDVTFNETEMSYC--VIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQ
         I   DV F E+E+     + E+ K  I+ +  T         PS   + +S +S   +  E + Q E  G   +Q   L DEGV   +  +    Q+  
Subjt:  CIIGIDVTFNETEMSYC--VIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQ

Query:  LTRDRVQRERQAPIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQK
        L R   +R R    RY   + V         S + E  + +E +    K Q   AM+ E+ SLQKN T+ LV  P  ++
Subjt:  LTRDRVQRERQAPIRYGYADLVAYALTCAANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQK

P93293 Uncharacterized mitochondrial protein AtMg003006.8e-2345.3Show/hide
Query:  GVMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTN
        GV+KV KG    L+G   + L++L+G+  +  + +A +   D + LWH RLAH+S+RG++ L ++G L   K   L FCE CI  K+ RV F  G+HTT 
Subjt:  GVMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTN

Query:  GILDYVHSDLWGPMKVP
          LDYVHSDLWG   VP
Subjt:  GILDYVHSDLWGPMKVP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-3227.54Show/hide
Query:  EVLMGIGSIQIATHDGMIRVFT--------NVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLEGTT--------VSDSA
        +V++  GS    +H G   + T        N+ YVP + +NLIS+       Y   + NGV      +  +++  L  G+ +L+G T        ++ S 
Subjt:  EVLMGIGSIQIATHDGMIRVFT--------NVRYVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLEGTT--------VSDSA

Query:  AIA-----SDKVTDMSMLWHKRLAHVSERGLQA-LSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYFLS
         ++     S K T  S  WH RL H +   L + +S   L       +   C  C++ KS +V F +    +   L+Y++SD+W    + S +  RY++ 
Subjt:  AIA-----SDKVTDMSMLWHKRLAHVSERGLQA-LSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYFLS

Query:  IIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASL
         +D F+R                    K  +EN+   ++    +DNG EFV      +    GI+   +  +TP+ NGL+ER +R I+E    LL++AS+
Subjt:  IIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASL

Query:  PLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVK---DGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVT
        P  +W  A     YLINR  +  L L++P +   G +P+ D LR FGC+ Y  ++     KL+ +  +C+F+GY      Y   CL    ++  I   V 
Subjt:  PLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVK---DGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVT

Query:  FNE
        F+E
Subjt:  FNE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.2e-3228.47Show/hide
Query:  EVLMGIGSIQIATHDGMIRVFTNVR--------YVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSL-VKLRGTLRNGLHVLEGTT--------VSDS
        +V++  GS    TH G   + T+ R        YVP + +NLIS+  L      C +    ++    S  VK    L  G+ +L+G T        ++ S
Subjt:  EVLMGIGSIQIATHDGMIRVFTNVR--------YVPELKRNLISIGDLDRSGYTCKSENGVMKVTKGSL-VKLRGTLRNGLHVLEGTT--------VSDS

Query:  AAIA-----SDKVTDMSMLWHKRLAHVSERGLQA-LSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYFL
         A++       K T  S  WH RL H S   L + +S   L       +L  C  C + KS +V F     T++  L+Y++SD+W    + S++  RY++
Subjt:  AAIA-----SDKVTDMSMLWHKRLAHVSERGLQA-LSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYFL

Query:  SIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNAS
          +D F+R                    K  VEN+   ++  L +DNG EFV      +    GI+   +  +TP+ NGL+ER +R I+E    LL++AS
Subjt:  SIIDDFSR--------------------KKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNAS

Query:  LPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKD---GKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDV
        +P  +W  A     YLINR  +  L L++P +   G+ P+ + L+ FGC+ Y  ++     KL  +  +C F+GY      Y   CL     +      V
Subjt:  LPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKD---GKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDV

Query:  TFNE
         F+E
Subjt:  TFNE

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein4.8e-2445.3Show/hide
Query:  GVMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTN
        GV+KV KG    L+G   + L++L+G+  +  + +A +   D + LWH RLAH+S+RG++ L ++G L   K   L FCE CI  K+ RV F  G+HTT 
Subjt:  GVMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRFGKGKHTTN

Query:  GILDYVHSDLWGPMKVP
          LDYVHSDLWG   VP
Subjt:  GILDYVHSDLWGPMKVP

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.7e-1442.68Show/hide
Query:  NRTIMERTRCLLTNASLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLK
        NRTI+E+ R +L    LP  F  +AA T  ++IN+  STA++   P EVW    P+  +LR FGC  Y H  +GKL  R  K
Subjt:  NRTIMERTRCLLTNASLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAACGACGCTCGTTTGCGGCGTTACTCCATTGGTGATTGGCTTCGGGCAGAAGCAACTTTCTTGCTTCAGACTTGTTTCAGAGTTGTATTCCGTCGGTGCCGAAA
GTATAACAAGTGGTATCAGAGCTTCATTCGATCCAAAGAAGAATTTGGGTTTTGTGTGGTAGTTTGTTGTTTACATATTCTTGGTTTAATCAGTTCTGTAAAGATGACTT
CAACACGCTTTGAGGTGGCTAAGTTTGACGACATGGGTGATTTTGCTCTTTGGAGGAAAAAGATTAGAGCTATTTTAGTTCAACAAAAAGTAGCTAAAATCTTAGATGAA
GAGAACCTTCCTGAAACTATTGCAGAAAGTGAGAAAAAGGATATGGATGAAATGGTCTATTCAACGATCCTTCTGTATCTGCCAGATGATGTGTTTAGGCTTGTAGATGA
GGCTACTACTACAGGGGTGTTGTGGAAGAAGCTAGAAAGTCTTTACTTGACAAAATCATTGCCAAATAAATTATATCTAAAGGAGAAATTTTTTGGATATAAGATGGACC
AAAGTAAAGGCTTAGAAGAGAACTTGAATGAATTTCAGAAGATTATAGTTGATCTCAACAACATCGGTGAGAAGATGTCAGATGAGAATCAAGCAGTTAAGGCAGCTATT
AAATATGGTCGGGATTCATTGACCATGAGTGTAGTGTTGGATGCCTTAAAGACTAGAAGTCTCGAAATTAAGAAAGAACACAAGGATGAAGAGTTACTCATGGTCAGAGG
AAGGAGTGAGAAAAAGAGCTGGAAAGGCAAAGAGAAGAGTTCCAGCAAGGAAGCATCGGGTAGCCAATATGGCCCAGGTGAGACTAGTTCAGCAAATGTTATTGATGGGT
ATGATTCGGCAGAGGTCTTGATGGGAATTGGTTCAATCCAAATTGCAACACATGACGGAATGATCAGAGTCTTCACTAATGTTAGATATGTTCCGGAACTCAAACGTAAT
CTAATATCTATTGGTGATTTAGATAGATCAGGTTATACTTGTAAATCTGAAAATGGAGTTATGAAAGTTACCAAGGGTTCTTTGGTTAAACTGAGGGGAACCTTGAGGAA
TGGTTTGCATGTGTTGGAAGGTACTACAGTTTCAGACAGTGCTGCTATTGCATCAGACAAAGTGACAGATATGTCTATGTTATGGCACAAAAGGTTAGCTCATGTGAGTG
AAAGAGGCTTACAAGCACTCTCTCAACAAGGTTTGTTGAGAGGAGTTAAAGATGTTGAACTACCATTTTGTGAGCATTGTATAATGAGAAAGTCTACCAGAGTAAGGTTT
GGGAAAGGGAAGCACACGACCAACGGTATTTTGGATTATGTTCATTCAGATTTGTGGGGTCCTATGAAGGTGCCTTCTATGGAAGGTTCAAGATACTTCTTATCTATCAT
TGACGATTTTTCAAGGAAGAAGCAGGTTGAAAACCAAACAGGTAGGAAGGTCAAGTATTTGAGGACAGATAATGGTTTAGAATTTGTGAATAACAAATTCAACACCTTTT
GCAAATCGAAGGGAATTACGAGACATTTTACTGTTACGTACACTCCACAACAAAATGGTTTGGCTGAGAGGTTTAACAGAACTATTATGGAACGTACTAGGTGTCTCTTG
ACTAATGCTTCATTACCCTTGAAATTTTGGGGGGAAGCTGCTCAAACAGTATGTTATCTCATTAATAGGAGTCATTCTACAGCTTTAGACTTAAAAACTCCACAAGAGGT
ATGGACAGGTAAAGCTCCAAGTTTAGATCATCTCAGAGCGTTCGGATGCTCAACTTATGCTCATGTTAAAGATGGAAAGCTGAACAAGAGGGTACTGAAATGCATGTTTA
TTGGTTATCCTCAGGGAGTCAAAGGTTATAAACTTTGGTGCTTGGAGAAGGGGATAAATAAATGCATTATCGGCATAGATGTAACTTTTAATGAGACAGAAATGTCGTAT
TGTGTCATAGAGCAACAGAAACAGAAGATTGTTGATCATGTTGAGACAGAGGTCAGAGTTGATTCAGAAATACGACCATCAGTTGGCTTAGATGTATCTAGTGATCAGTC
ACCACTAGTTTCACAAACAGAGGCTACACACCAGTTTGAATCTGATGGTATACAGTCTCAACAGGAGAGGATTTTGATTGATGAGGGAGTTCACAGTGAAAAAAGCTCAA
GTAATAATAACCTACAGAACTATCAGCTTACTCGGGACAGGGTTCAAAGGGAAAGACAGGCTCCCATAAGGTATGGTTATGCTGACTTAGTTGCTTATGCTCTTACTTGC
GCAGCTAACAGTATTGAAGCAGAGGCTCTTACTTTTGAAGAGGCAATCGTATCTGATTCTAAGAAACAATGGAAGGATGCTATGGAGGCAGAGTTGTTCTCTTTACAAAA
GAATCAGACATGGTCATTGGTTCCAAAGCCCCCTAACCAGAAGCTCATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAACGACGCTCGTTTGCGGCGTTACTCCATTGGTGATTGGCTTCGGGCAGAAGCAACTTTCTTGCTTCAGACTTGTTTCAGAGTTGTATTCCGTCGGTGCCGAAA
GTATAACAAGTGGTATCAGAGCTTCATTCGATCCAAAGAAGAATTTGGGTTTTGTGTGGTAGTTTGTTGTTTACATATTCTTGGTTTAATCAGTTCTGTAAAGATGACTT
CAACACGCTTTGAGGTGGCTAAGTTTGACGACATGGGTGATTTTGCTCTTTGGAGGAAAAAGATTAGAGCTATTTTAGTTCAACAAAAAGTAGCTAAAATCTTAGATGAA
GAGAACCTTCCTGAAACTATTGCAGAAAGTGAGAAAAAGGATATGGATGAAATGGTCTATTCAACGATCCTTCTGTATCTGCCAGATGATGTGTTTAGGCTTGTAGATGA
GGCTACTACTACAGGGGTGTTGTGGAAGAAGCTAGAAAGTCTTTACTTGACAAAATCATTGCCAAATAAATTATATCTAAAGGAGAAATTTTTTGGATATAAGATGGACC
AAAGTAAAGGCTTAGAAGAGAACTTGAATGAATTTCAGAAGATTATAGTTGATCTCAACAACATCGGTGAGAAGATGTCAGATGAGAATCAAGCAGTTAAGGCAGCTATT
AAATATGGTCGGGATTCATTGACCATGAGTGTAGTGTTGGATGCCTTAAAGACTAGAAGTCTCGAAATTAAGAAAGAACACAAGGATGAAGAGTTACTCATGGTCAGAGG
AAGGAGTGAGAAAAAGAGCTGGAAAGGCAAAGAGAAGAGTTCCAGCAAGGAAGCATCGGGTAGCCAATATGGCCCAGGTGAGACTAGTTCAGCAAATGTTATTGATGGGT
ATGATTCGGCAGAGGTCTTGATGGGAATTGGTTCAATCCAAATTGCAACACATGACGGAATGATCAGAGTCTTCACTAATGTTAGATATGTTCCGGAACTCAAACGTAAT
CTAATATCTATTGGTGATTTAGATAGATCAGGTTATACTTGTAAATCTGAAAATGGAGTTATGAAAGTTACCAAGGGTTCTTTGGTTAAACTGAGGGGAACCTTGAGGAA
TGGTTTGCATGTGTTGGAAGGTACTACAGTTTCAGACAGTGCTGCTATTGCATCAGACAAAGTGACAGATATGTCTATGTTATGGCACAAAAGGTTAGCTCATGTGAGTG
AAAGAGGCTTACAAGCACTCTCTCAACAAGGTTTGTTGAGAGGAGTTAAAGATGTTGAACTACCATTTTGTGAGCATTGTATAATGAGAAAGTCTACCAGAGTAAGGTTT
GGGAAAGGGAAGCACACGACCAACGGTATTTTGGATTATGTTCATTCAGATTTGTGGGGTCCTATGAAGGTGCCTTCTATGGAAGGTTCAAGATACTTCTTATCTATCAT
TGACGATTTTTCAAGGAAGAAGCAGGTTGAAAACCAAACAGGTAGGAAGGTCAAGTATTTGAGGACAGATAATGGTTTAGAATTTGTGAATAACAAATTCAACACCTTTT
GCAAATCGAAGGGAATTACGAGACATTTTACTGTTACGTACACTCCACAACAAAATGGTTTGGCTGAGAGGTTTAACAGAACTATTATGGAACGTACTAGGTGTCTCTTG
ACTAATGCTTCATTACCCTTGAAATTTTGGGGGGAAGCTGCTCAAACAGTATGTTATCTCATTAATAGGAGTCATTCTACAGCTTTAGACTTAAAAACTCCACAAGAGGT
ATGGACAGGTAAAGCTCCAAGTTTAGATCATCTCAGAGCGTTCGGATGCTCAACTTATGCTCATGTTAAAGATGGAAAGCTGAACAAGAGGGTACTGAAATGCATGTTTA
TTGGTTATCCTCAGGGAGTCAAAGGTTATAAACTTTGGTGCTTGGAGAAGGGGATAAATAAATGCATTATCGGCATAGATGTAACTTTTAATGAGACAGAAATGTCGTAT
TGTGTCATAGAGCAACAGAAACAGAAGATTGTTGATCATGTTGAGACAGAGGTCAGAGTTGATTCAGAAATACGACCATCAGTTGGCTTAGATGTATCTAGTGATCAGTC
ACCACTAGTTTCACAAACAGAGGCTACACACCAGTTTGAATCTGATGGTATACAGTCTCAACAGGAGAGGATTTTGATTGATGAGGGAGTTCACAGTGAAAAAAGCTCAA
GTAATAATAACCTACAGAACTATCAGCTTACTCGGGACAGGGTTCAAAGGGAAAGACAGGCTCCCATAAGGTATGGTTATGCTGACTTAGTTGCTTATGCTCTTACTTGC
GCAGCTAACAGTATTGAAGCAGAGGCTCTTACTTTTGAAGAGGCAATCGTATCTGATTCTAAGAAACAATGGAAGGATGCTATGGAGGCAGAGTTGTTCTCTTTACAAAA
GAATCAGACATGGTCATTGGTTCCAAAGCCCCCTAACCAGAAGCTCATTTAA
Protein sequenceShow/hide protein sequence
MVNDARLRRYSIGDWLRAEATFLLQTCFRVVFRRCRKYNKWYQSFIRSKEEFGFCVVVCCLHILGLISSVKMTSTRFEVAKFDDMGDFALWRKKIRAILVQQKVAKILDE
ENLPETIAESEKKDMDEMVYSTILLYLPDDVFRLVDEATTTGVLWKKLESLYLTKSLPNKLYLKEKFFGYKMDQSKGLEENLNEFQKIIVDLNNIGEKMSDENQAVKAAI
KYGRDSLTMSVVLDALKTRSLEIKKEHKDEELLMVRGRSEKKSWKGKEKSSSKEASGSQYGPGETSSANVIDGYDSAEVLMGIGSIQIATHDGMIRVFTNVRYVPELKRN
LISIGDLDRSGYTCKSENGVMKVTKGSLVKLRGTLRNGLHVLEGTTVSDSAAIASDKVTDMSMLWHKRLAHVSERGLQALSQQGLLRGVKDVELPFCEHCIMRKSTRVRF
GKGKHTTNGILDYVHSDLWGPMKVPSMEGSRYFLSIIDDFSRKKQVENQTGRKVKYLRTDNGLEFVNNKFNTFCKSKGITRHFTVTYTPQQNGLAERFNRTIMERTRCLL
TNASLPLKFWGEAAQTVCYLINRSHSTALDLKTPQEVWTGKAPSLDHLRAFGCSTYAHVKDGKLNKRVLKCMFIGYPQGVKGYKLWCLEKGINKCIIGIDVTFNETEMSY
CVIEQQKQKIVDHVETEVRVDSEIRPSVGLDVSSDQSPLVSQTEATHQFESDGIQSQQERILIDEGVHSEKSSSNNNLQNYQLTRDRVQRERQAPIRYGYADLVAYALTC
AANSIEAEALTFEEAIVSDSKKQWKDAMEAELFSLQKNQTWSLVPKPPNQKLI