; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0025721 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0025721
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr01:26636217..26637872
RNA-Seq ExpressionCmc01g0025721
SyntenyCmc01g0025721
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034863.1 gag/pol protein [Cucumis melo var. makuwa]8.7e-30699.62Show/hide
Query:  VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT
        VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT
Subjt:  VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT

Query:  DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPN
        DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQN VLERRNRTLLDMVWSMMSYAHLPN
Subjt:  DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPN

Query:  SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI
        SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI
Subjt:  SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI

Query:  REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI
        REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI
Subjt:  REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI

Query:  KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTI
        KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQM VKTTI
Subjt:  KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTI

Query:  LNDNLEETIYMQQPEGFIIPC
        LNDNLEETIYMQQPEGFIIPC
Subjt:  LNDNLEETIYMQQPEGFIIPC

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]6.1e-29190.91Show/hide
Query:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS
        +CSAKLENNLYVLR LTSKALLN EMFKTA+TQNKRL ISP+ENAHLWHLRLGH+NLNRIE+LVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGH +
Subjt:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS

Query:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG
        KEPLELVHSDLCGPMN KARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMEC IVSQLSAPG
Subjt:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG

Query:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP
        TPQQN V ERRNRTLLDMV SMMSYAHLPNSF GYAVQ  VYILNCVPSKS+SE PLKLW G KGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYP
Subjt:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP

Query:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT
        KGTR GYFY  KDNKVFVSTNATFLEEDHIRE+KPRSKIVLNELS E+TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLT
Subjt:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT

Query:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS
        ETLTVISDG+I DPLTFKKAMEDVDKDEWIKAMNLELESMYFN VWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS 
Subjt:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS

Query:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP
        VAMLKSIRILLSIAAYFDYEIWQMDVKT  LN NLEETIYMQQPEGFIIP
Subjt:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]6.1e-29190.91Show/hide
Query:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS
        +CSAKLENNLYVLR LTSKALLN EMFKTA+TQNKRL ISP+ENAHLWHLRLGH+NLNRIE+LVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGH +
Subjt:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS

Query:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG
        KEPLELVHSDLCGPMN KARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMEC IVSQLSAPG
Subjt:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG

Query:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP
        TPQQN V ERRNRTLLDMV SMMSYAHLPNSF GYAVQ  VYILNCVPSKS+SE PLKLW G KGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYP
Subjt:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP

Query:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT
        KGTR GYFY  KDNKVFVSTNATFLEEDHIRE+KPRSKIVLNELS E+TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLT
Subjt:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT

Query:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS
        ETLTVISDG+I DPLTFKKAMEDVDKDEWIKAMNLELESMYFN VWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS 
Subjt:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS

Query:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP
        VAMLKSIRILLSIAAYFDYEIWQMDVKT  LN NLEETIYMQQPEGFIIP
Subjt:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP

TYJ96675.1 gag/pol protein [Cucumis melo var. makuwa]8.7e-30699.62Show/hide
Query:  VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT
        VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT
Subjt:  VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT

Query:  DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPN
        DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQN VLERRNRTLLDMVWSMMSYAHLPN
Subjt:  DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPN

Query:  SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI
        SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI
Subjt:  SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI

Query:  REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI
        REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI
Subjt:  REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI

Query:  KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTI
        KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQM VKTTI
Subjt:  KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTI

Query:  LNDNLEETIYMQQPEGFIIPC
        LNDNLEETIYMQQPEGFIIPC
Subjt:  LNDNLEETIYMQQPEGFIIPC

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]6.1e-29190.91Show/hide
Query:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS
        +CSAKLENNLYVLR LTSKALLN EMFKTA+TQNKRL ISP+ENAHLWHLRLGH+NLNRIE+LVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGH +
Subjt:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS

Query:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG
        KEPLELVHSDLCGPMN KARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMEC IVSQLSAPG
Subjt:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG

Query:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP
        TPQQN V ERRNRTLLDMV SMMSYAHLPNSF GYAVQ  VYILNCVPSKS+SE PLKLW G KGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYP
Subjt:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP

Query:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT
        KGTR GYFY  KDNKVFVSTNATFLEEDHIRE+KPRSKIVLNELS E+TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLT
Subjt:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT

Query:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS
        ETLTVISDG+I DPLTFKKAMEDVDKDEWIKAMNLELESMYFN VWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS 
Subjt:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS

Query:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP
        VAMLKSIRILLSIAAYFDYEIWQMDVKT  LN NLEETIYMQQPEGFIIP
Subjt:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.9e-29190.91Show/hide
Query:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS
        +CSAKLENNLYVLR LTSKALLN EMFKTA+TQNKRL ISP+ENAHLWHLRLGH+NLNRIE+LVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGH +
Subjt:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS

Query:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG
        KEPLELVHSDLCGPMN KARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMEC IVSQLSAPG
Subjt:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG

Query:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP
        TPQQN V ERRNRTLLDMV SMMSYAHLPNSF GYAVQ  VYILNCVPSKS+SE PLKLW G KGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYP
Subjt:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP

Query:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT
        KGTR GYFY  KDNKVFVSTNATFLEEDHIRE+KPRSKIVLNELS E+TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLT
Subjt:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT

Query:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS
        ETLTVISDG+I DPLTFKKAMEDVDKDEWIKAMNLELESMYFN VWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS 
Subjt:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS

Query:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP
        VAMLKSIRILLSIAAYFDYEIWQMDVKT  LN NLEETIYMQQPEGFIIP
Subjt:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP

A0A5A7SWF4 Gag/pol protein4.2e-30699.62Show/hide
Query:  VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT
        VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT
Subjt:  VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT

Query:  DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPN
        DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQN VLERRNRTLLDMVWSMMSYAHLPN
Subjt:  DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPN

Query:  SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI
        SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI
Subjt:  SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI

Query:  REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI
        REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI
Subjt:  REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI

Query:  KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTI
        KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQM VKTTI
Subjt:  KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTI

Query:  LNDNLEETIYMQQPEGFIIPC
        LNDNLEETIYMQQPEGFIIPC
Subjt:  LNDNLEETIYMQQPEGFIIPC

A0A5A7V4M1 Gag/pol protein2.9e-29190.91Show/hide
Query:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS
        +CSAKLENNLYVLR LTSKALLN EMFKTA+TQNKRL ISP+ENAHLWHLRLGH+NLNRIE+LVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGH +
Subjt:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS

Query:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG
        KEPLELVHSDLCGPMN KARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMEC IVSQLSAPG
Subjt:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG

Query:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP
        TPQQN V ERRNRTLLDMV SMMSYAHLPNSF GYAVQ  VYILNCVPSKS+SE PLKLW G KGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYP
Subjt:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP

Query:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT
        KGTR GYFY  KDNKVFVSTNATFLEEDHIRE+KPRSKIVLNELS E+TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLT
Subjt:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT

Query:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS
        ETLTVISDG+I DPLTFKKAMEDVDKDEWIKAMNLELESMYFN VWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS 
Subjt:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS

Query:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP
        VAMLKSIRILLSIAAYFDYEIWQMDVKT  LN NLEETIYMQQPEGFIIP
Subjt:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP

A0A5D3BDY3 Gag/pol protein4.2e-30699.62Show/hide
Query:  VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT
        VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT
Subjt:  VTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFT

Query:  DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPN
        DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQN VLERRNRTLLDMVWSMMSYAHLPN
Subjt:  DDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPN

Query:  SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI
        SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI
Subjt:  SFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI

Query:  REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI
        REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI
Subjt:  REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWI

Query:  KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTI
        KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQM VKTTI
Subjt:  KAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTI

Query:  LNDNLEETIYMQQPEGFIIPC
        LNDNLEETIYMQQPEGFIIPC
Subjt:  LNDNLEETIYMQQPEGFIIPC

A0A5D3CPJ6 Gag/pol protein2.9e-29190.91Show/hide
Query:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS
        +CSAKLENNLYVLR LTSKALLN EMFKTA+TQNKRL ISP+ENAHLWHLRLGH+NLNRIE+LVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGH +
Subjt:  MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHIS

Query:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG
        KEPLELVHSDLCGPMN KARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMEC IVSQLSAPG
Subjt:  KEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPG

Query:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP
        TPQQN V ERRNRTLLDMV SMMSYAHLPNSF GYAVQ  VYILNCVPSKS+SE PLKLW G KGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYP
Subjt:  TPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYP

Query:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT
        KGTR GYFY  KDNKVFVSTNATFLEEDHIRE+KPRSKIVLNELS E+TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLT
Subjt:  KGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLT

Query:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS
        ETLTVISDG+I DPLTFKKAMEDVDKDEWIKAMNLELESMYFN VWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS 
Subjt:  ETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSS

Query:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP
        VAMLKSIRILLSIAAYFDYEIWQMDVKT  LN NLEETIYMQQPEGFIIP
Subjt:  VAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-5826.88Show/hide
Query:  ENNLYVLR---LLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELE-----ENSLPVCESCLEGKMTKRPF---TG
        +N L V++   +L +  ++N + +         +N   + N  LWH R GH++  ++ ++ +  + S+       E S  +CE CL GK  + PF     
Subjt:  ENNLYVLR---LLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELE-----ENSLPVCESCLEGKMTKRPF---TG

Query:  KGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQ
        K HI K PL +VHSD+CGP+         YF+ F D ++ Y   YL+++KS+    F+++ A+ E   +  +     D G EY+  + + + ++  I   
Subjt:  KGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQ

Query:  LSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSI---SEIPLKLWIGHKGSLRHFRIWGCQAHV-LETNPKKLEPRS
        L+ P TPQ N V ER  RT+ +   +M+S A L  SF G AV    Y++N +PS+++   S+ P ++W   K  L+H R++G   +V ++    K + +S
Subjt:  LSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSI---SEIPLKLWIGHKGSLRHFRIWGCQAHV-LETNPKKLEPRS

Query:  KLCLFVGY-PKGTR-----DGYFYGLKD---------NKVFVSTNATFLEEDHIREYK----PRSKIVLNELSNESTE----------------------
           +FVGY P G +     +  F   +D         N   V     FL++    E K       KI+  E  NES E                      
Subjt:  KLCLFVGY-PKGTR-----DGYFYGLKD---------NKVFVSTNATFLEEDHIREYK----PRSKIVLNELSNESTE----------------------

Query:  ---LSTRVVEEPSTLARVVHVSSS-----------------IRIRQPKSLGEP------------------------------RRSGRVTNLP-IHYMSL
           + T    E      +  +  S                   + + K  G P                              RRS R+   P I Y   
Subjt:  ---LSTRVVEEPSTLARVVHVSSS-----------------IRIRQPKSLGEP------------------------------RRSGRVTNLP-IHYMSL

Query:  TETL--TVISDGNIGD--PLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE
          +L   V++   I +  P +F +     DK  W +A+N EL +   N  W +  +P+    +  +W++  K    G    +KARLVA+G+TQ   +DYE
Subjt:  TETL--TVISDGNIGD--PLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE

Query:  KIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEG
        + F+ VA + S R +LS+   ++ ++ QMDVKT  LN  L+E IYM+ P+G
Subjt:  KIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-8535.17Show/hide
Query:  LLTSKALLNIEMFKT-AVTQNKRLNISPEE-NAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLC
        L+ +K +    +++T A      LN + +E +  LWH R+GHM+   ++ L K  L+S  +  ++  C+ CL GK  +  F          L+LV+SD+C
Subjt:  LLTSKALLNIEMFKT-AVTQNKRLNISPEE-NAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLC

Query:  GPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRN
        GPM  ++ GG +YF+TF DD SR  +VY+++ K +  + F+++ A VE +  + +K  RSD GGEY   +F+ Y     I  + + PGTPQ N V ER N
Subjt:  GPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRN

Query:  RTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQ--AHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFY
        RT+++ V SM+  A LP SF G AVQ   Y++N  PS  ++ EIP ++W   + S  H +++GC+  AHV +    KL+ +S  C+F+GY     + + Y
Subjt:  RTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQ--AHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFY

Query:  GLKD---NKVFVSTNATFLEEDHIREYKPRSKIVLNEL---------------SNEST--ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEP-----R
         L D    KV  S +  F  E  +R     S+ V N +               S EST  E+S +  +    + +   +   +   +  + GE      R
Subjt:  GLKD---NKVFVSTNATFLEEDHIREYKPRSKIVLNEL---------------SNEST--ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEP-----R

Query:  RSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAK
        RS R       Y S TE + +  D    +P + K+ +   +K++ +KAM  E+ES+  N  + LV+ P G +P+ CKW++K K+  D K+  +KARLV K
Subjt:  RSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAK

Query:  GYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGF
        G+ Q +G+D+++IFS V  + SIR +LS+AA  D E+ Q+DVKT  L+ +LEE IYM+QPEGF
Subjt:  GYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGF

P92520 Uncharacterized mitochondrial protein AtMg008202.1e-1240.7Show/hide
Query:  WIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIA
        W +AM  EL+++  N  W LV  P     +GCKW++K K  +DG +   KARLVAKG+ Q EG+ + + +S V    +IR +L++A
Subjt:  WIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-5025.6Show/hide
Query:  LLTSKALLNIEMFKTAVTQNKRLNISPEENA--HLWHLRLGHMNLNRIEKLVKNGLLSELE-ENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDL
        LL  K    +  +  A +Q   L  SP   A    WH RLGH   + +  ++ N  LS L   +    C  CL  K  K PF+     S  PLE ++SD+
Subjt:  LLTSKALLNIEMFKTAVTQNKRLNISPEENA--HLWHLRLGHMNLNRIEKLVKNGLLSELE-ENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDL

Query:  CGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERR
               +   + Y++ F D ++RY ++Y ++ KS+  E F  +K  +EN+    I TF SD GGE++ L    Y  +  I    S P TP+ N + ER+
Subjt:  CGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERR

Query:  NRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQAH--VLETNPKKLEPRSKLCLFVGYPKGTRDGYF
        +R +++   +++S+A +P ++  YA    VY++N +P+  +  E P +   G   +    R++GC  +  +   N  KL+ +S+ C+F+GY   T+  Y 
Subjt:  NRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQAH--VLETNPKKLEPRSKLCLFVGYPKGTRDGYF

Query:  -YGLKDNKVFVSTNATFLEE-----------DHIREYKPRSKIV--------------------------------------------------------
           L+ +++++S +  F E              ++E +  S  V                                                        
Subjt:  -YGLKDNKVFVSTNATFLEE-----------DHIREYKPRSKIV--------------------------------------------------------

Query:  ----------------------LNELSNESTELSTRVVEEPSTLARVVHVSS-----------------------SIRIRQPKSLGEPRRSGRVTNLPIH
                                  S+++T  +    E PS LA+ +   +                       SI I  P  L +   +     L  H
Subjt:  ----------------------LNELSNESTELSTRVVEEPSTLARVVHVSS-----------------------SIRIRQPKSLGEPRRSGRVTNLPIH

Query:  YMSLTETLTVI----------SDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDG-VKPIGCKWIYKRKRGADGKVQTFKARLVAK
         M       +I          S     +P T  +A++D   + W  AM  E+ +   N  WDLV  P   V  +GC+WI+ +K  +DG +  +KARLVAK
Subjt:  YMSLTETLTVI----------SDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDG-VKPIGCKWIYKRKRGADGKVQTFKARLVAK

Query:  GYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFI
        GY Q  G+DY + FS V    SIRI+L +A    + I Q+DV    L   L + +YM QP GFI
Subjt:  GYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-4925.39Show/hide
Query:  WHLRLGHMNLNRIEKLVKNGLLSELE-ENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKS
        WH RLGH +L  +  ++ N  L  L   + L  C  C   K  K PF+     S +PLE ++SD+       +   + Y++ F D ++RY ++Y ++ KS
Subjt:  WHLRLGHMNLNRIEKLVKNGLLSELE-ENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKS

Query:  EALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNC
        +  + F  +K+ VEN+    I T  SD GGE++ L+  +YL +  I    S P TP+ N + ER++R +++M  +++S+A +P ++  YA    VY++N 
Subjt:  EALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNC

Query:  VPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQAH--VLETNPKKLEPRSKLCLFVGYPKGTRDGYF-YGLKDNKVFVSTNATFLEE---------------
        +P+  +  + P +   G   +    +++GC  +  +   N  KLE +SK C F+GY   T+  Y    +   +++ S +  F E                
Subjt:  VPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQAH--VLETNPKKLEPRSKLCLFVGYPKGTRDGYF-YGLKDNKVFVSTNATFLEE---------------

Query:  ------------------------------DHIREYKPR-----SKIVLNELSNE---STELSTRVVEEPSTLAR-------VVHVSSSIRIRQP-----
                                       H+ +  PR     S +   ++S+    S+ +S+    EP+  +          H + +     P     
Subjt:  ------------------------------DHIREYKPR-----SKIVLNELSNE---STELSTRVVEEPSTLAR-------VVHVSSSIRIRQP-----

Query:  ----KSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNI----------------------------------------------------------GDPL
             S   P ++  +   PI    +    T IS+ N                                                            +P 
Subjt:  ----KSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNI----------------------------------------------------------GDPL

Query:  TFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV-DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIA
        T  +AM+D   D W +AM  E+ +   N  WDLV   P  V  +GC+WI+ +K  +DG +  +KARLVAKGY Q  G+DY + FS V    SIRI+L +A
Subjt:  TFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV-DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIA

Query:  AYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFI
            + I Q+DV    L   L + +YM QP GF+
Subjt:  AYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.8e-2841.48Show/hide
Query:  DPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLS
        +P T+ +A E +    W  AM+ E+ +M     W++   P   KPIGCKW+YK K  +DG ++ +KARLVAKGYTQ EG+D+ + FS V  L S++++L+
Subjt:  DPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLS

Query:  IAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGF
        I+A +++ + Q+D+    LN +L+E IYM+ P G+
Subjt:  IAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGF

ATMG00300.1 Gag-Pol-related retrotransposon family protein5.4e-1140.24Show/hide
Query:  TQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCG
        T    L  + ++   LWH RL HM+   +E LVK G L   + +SL  CE C+ GK  +  F+   H +K PL+ VHSDL G
Subjt:  TQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.6e-0734.15Show/hide
Query:  NRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSK
        NRT+++ V SM+    LP +FR  A    V+I+N  PS +I+  +P ++W     +  + R +GC A++   +  KL+PR+K
Subjt:  NRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.5e-1340.7Show/hide
Query:  WIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIA
        W +AM  EL+++  N  W LV  P     +GCKW++K K  +DG +   KARLVAKG+ Q EG+ + + +S V    +IR +L++A
Subjt:  WIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTCTGCAAAGTTAGAAAATAATCTTTATGTGTTAAGATTATTAACATCTAAAGCCCTTCTTAATATTGAAATGTTCAAAACTGCAGTGACTCAAAATAAAAGACT
TAATATTTCTCCAGAAGAAAATGCTCATCTTTGGCACCTAAGATTAGGGCACATGAATCTCAATAGAATTGAGAAATTAGTAAAGAATGGGCTTCTAAGCGAGTTAGAAG
AAAATTCCTTACCTGTATGTGAGTCATGCCTTGAAGGTAAGATGACCAAAAGACCTTTTACTGGAAAAGGTCATATTTCCAAAGAACCCCTAGAACTTGTACATTCAGAT
CTATGTGGTCCTATGAATGCTAAAGCAAGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTTAATGCAACATAAGTCTGA
AGCCCTTGAAAAGTTCAAGGAATACAAGGCTGAAGTTGAAAACAAATTAAGTAAAACTATTAAAACATTTCGGTCGGATCGAGGTGGAGAGTATATGGATTTGAAATTTC
AAAACTATTTGATGGAATGTGAAATTGTATCTCAACTCTCAGCACCTGGTACACCTCAACAGAATAGTGTATTAGAAAGGAGAAATCGAACCTTGTTGGACATGGTTTGG
TCTATGATGAGTTACGCTCACTTACCTAATTCGTTTCGGGGTTATGCAGTGCAAGCTACAGTCTATATTTTGAATTGTGTTCCATCTAAAAGTATTTCTGAAATACCTTT
GAAATTATGGATTGGTCATAAAGGTAGTTTACGTCATTTCAGAATCTGGGGTTGTCAAGCACACGTGCTTGAGACGAATCCAAAGAAATTGGAACCTCGTTCAAAATTAT
GTTTATTTGTAGGCTACCCCAAAGGAACTAGAGATGGTTATTTCTATGGTCTTAAAGATAATAAAGTGTTTGTATCGACAAATGCTACATTTTTAGAAGAGGACCACATA
AGGGAGTACAAACCGCGTAGTAAGATAGTATTAAATGAACTTTCCAATGAAAGTACTGAACTTTCAACAAGAGTTGTTGAAGAGCCTAGTACATTAGCAAGAGTTGTTCA
TGTCAGTTCATCTATTAGGATACGTCAACCTAAATCGTTGGGCGAACCTCGACGAAGTGGGAGGGTTACAAACTTACCTATTCATTATATGAGTTTAACGGAAACCTTAA
CTGTCATATCTGATGGCAACATTGGGGATCCATTGACTTTTAAGAAGGCAATGGAGGATGTGGATAAAGATGAATGGATCAAAGCTATGAATCTTGAATTGGAATCTATG
TACTTCAATTTAGTCTGGGATCTTGTAGATCAACCTGATGGGGTAAAACCTATAGGTTGTAAATGGATCTACAAGAGAAAAAGAGGTGCAGATGGTAAGGTACAAACTTT
TAAAGCTAGACTAGTGGCAAAGGGTTATACCCAAGTTGAGGGAGTTGACTATGAGAAGATTTTCTCATCTGTTGCCATGTTAAAGTCTATTCGAATACTTTTGTCCATTG
CTGCATATTTTGACTATGAGATTTGGCAAATGGATGTGAAAACTACCATTTTGAATGACAATCTTGAGGAGACCATTTATATGCAACAACCAGAAGGATTCATAATTCCA
TGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTCTGCAAAGTTAGAAAATAATCTTTATGTGTTAAGATTATTAACATCTAAAGCCCTTCTTAATATTGAAATGTTCAAAACTGCAGTGACTCAAAATAAAAGACT
TAATATTTCTCCAGAAGAAAATGCTCATCTTTGGCACCTAAGATTAGGGCACATGAATCTCAATAGAATTGAGAAATTAGTAAAGAATGGGCTTCTAAGCGAGTTAGAAG
AAAATTCCTTACCTGTATGTGAGTCATGCCTTGAAGGTAAGATGACCAAAAGACCTTTTACTGGAAAAGGTCATATTTCCAAAGAACCCCTAGAACTTGTACATTCAGAT
CTATGTGGTCCTATGAATGCTAAAGCAAGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTTAATGCAACATAAGTCTGA
AGCCCTTGAAAAGTTCAAGGAATACAAGGCTGAAGTTGAAAACAAATTAAGTAAAACTATTAAAACATTTCGGTCGGATCGAGGTGGAGAGTATATGGATTTGAAATTTC
AAAACTATTTGATGGAATGTGAAATTGTATCTCAACTCTCAGCACCTGGTACACCTCAACAGAATAGTGTATTAGAAAGGAGAAATCGAACCTTGTTGGACATGGTTTGG
TCTATGATGAGTTACGCTCACTTACCTAATTCGTTTCGGGGTTATGCAGTGCAAGCTACAGTCTATATTTTGAATTGTGTTCCATCTAAAAGTATTTCTGAAATACCTTT
GAAATTATGGATTGGTCATAAAGGTAGTTTACGTCATTTCAGAATCTGGGGTTGTCAAGCACACGTGCTTGAGACGAATCCAAAGAAATTGGAACCTCGTTCAAAATTAT
GTTTATTTGTAGGCTACCCCAAAGGAACTAGAGATGGTTATTTCTATGGTCTTAAAGATAATAAAGTGTTTGTATCGACAAATGCTACATTTTTAGAAGAGGACCACATA
AGGGAGTACAAACCGCGTAGTAAGATAGTATTAAATGAACTTTCCAATGAAAGTACTGAACTTTCAACAAGAGTTGTTGAAGAGCCTAGTACATTAGCAAGAGTTGTTCA
TGTCAGTTCATCTATTAGGATACGTCAACCTAAATCGTTGGGCGAACCTCGACGAAGTGGGAGGGTTACAAACTTACCTATTCATTATATGAGTTTAACGGAAACCTTAA
CTGTCATATCTGATGGCAACATTGGGGATCCATTGACTTTTAAGAAGGCAATGGAGGATGTGGATAAAGATGAATGGATCAAAGCTATGAATCTTGAATTGGAATCTATG
TACTTCAATTTAGTCTGGGATCTTGTAGATCAACCTGATGGGGTAAAACCTATAGGTTGTAAATGGATCTACAAGAGAAAAAGAGGTGCAGATGGTAAGGTACAAACTTT
TAAAGCTAGACTAGTGGCAAAGGGTTATACCCAAGTTGAGGGAGTTGACTATGAGAAGATTTTCTCATCTGTTGCCATGTTAAAGTCTATTCGAATACTTTTGTCCATTG
CTGCATATTTTGACTATGAGATTTGGCAAATGGATGTGAAAACTACCATTTTGAATGACAATCTTGAGGAGACCATTTATATGCAACAACCAGAAGGATTCATAATTCCA
TGTTAA
Protein sequenceShow/hide protein sequence
MCSAKLENNLYVLRLLTSKALLNIEMFKTAVTQNKRLNISPEENAHLWHLRLGHMNLNRIEKLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSD
LCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNSVLERRNRTLLDMVW
SMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI
REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESM
YFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMDVKTTILNDNLEETIYMQQPEGFIIP
C