; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0222731 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0222731
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:11803051..11804205
RNA-Seq ExpressionCmc08g0222731
SyntenyCmc08g0222731
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031931.1 pol protein [Cucumis melo var. makuwa]3.8e-19991.93Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKG GSRSLPQVI A+RASKLL QGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHRE+EFAIELEPGTVPISRAPYRM P ELKELKVQLQ+L DKGFIR S+ PWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK
        GATVFSKIDLRSGYHQLRIKDGDVPKT F SRYGHYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTFVIVFIDDILIYSK EAEHEEHLR+VLQTL DNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK

Query:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV
        LYAKFSKC FWLKQVSFLG+VVSKAGV VDPAKIEAVT W RPSTVSEVRSFLGL GYYRRFVENFSRIA PLTQLTRKGA FV
Subjt:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV

KAA0035137.1 retrotransposon protein [Cucumis melo var. makuwa]4.5e-20092.19Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCS KEVAFNPPSM SFKFKGEGSRSLPQVI A++ASKLL QGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHRE+EFAIELEPGTVPISRAPYRM P ELKELKVQLQ+L DKGFIR SV PWGA VLFVKK DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK
        GATVFSKIDLRSGYHQLRIKDGDVPKT FHSRYGHYEFIVMSFGLTNAPAVF+DLMN+VFREFLDTFVIVFIDDILIYSK EAEH EHLR+VLQTL DNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK

Query:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV
        LYAKFSKC FWLKQVSFLG+VVSKAGV VDPAKIEAVT W RPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGA FV
Subjt:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV

KAA0040188.1 pol protein [Cucumis melo var. makuwa]2.0e-20092.45Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKG GSRSLPQVI A+RASKLL QGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHRE+EFAIELEPGTVPISRAPYRM P ELKELKVQLQ+L DKGFIR SV PWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK
        GATVFSKIDLRSGYHQLRIKDGDVPKT F SRYGHYEFIVMSFGLTNAPAVFMDLMN+VF+EFLDTFVIVFIDDILIYSKMEAEHEEHLR+VLQTL DNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK

Query:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV
        LYAKFSKC FWLKQVSFLG+VVSKAGV VDPAKIEAVT W RPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGA FV
Subjt:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV

KAA0043555.1 pol protein [Cucumis melo var. makuwa]7.7e-20092.45Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
        MLDFDVIL MDWLAANHASIDCS KEVAFNPPSM SFKFKGEGSRSLPQVI  +RASKLL QGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHRE+EFAIELEPGTVPISRAPYRM P ELKELKVQLQ+L DKGFIRSSV PWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK
        GATVFSKIDLR GYHQLRIKDGDVPKT F SRYGHYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTFVIVFIDDILIY K EAEHEEHLRMVLQTL DNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK

Query:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV
        LYA FSKC FWLKQVSFLG+VVSKAGV VDPAKIEAVTSW RPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGA FV
Subjt:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV

KAA0047433.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.5e-20092.45Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKG GSRSLPQVI A+RASKLL QGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHRE+EFAIELEPGTVPISRAPYRM P ELKELKVQLQ+L DKGFIR SV PWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK
        GATVFSKIDLRSGYHQLRIKDGDVPKT F SRYGHYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTFVIVFIDDILIYSK EAEHEEHLR+VLQTL DNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK

Query:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV
        LYAKFSKC FWLKQVSFLG+VVSKAGV VDPAKIEAVT W RPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGA FV
Subjt:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV

TrEMBL top hitse value%identityAlignment
A0A5A7SQU8 Reverse transcriptase1.8e-19991.93Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKG GSRSLPQVI A+RASKLL QGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHRE+EFAIELEPGTVPISRAPYRM P ELKELKVQLQ+L DKGFIR S+ PWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK
        GATVFSKIDLRSGYHQLRIKDGDVPKT F SRYGHYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTFVIVFIDDILIYSK EAEHEEHLR+VLQTL DNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK

Query:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV
        LYAKFSKC FWLKQVSFLG+VVSKAGV VDPAKIEAVT W RPSTVSEVRSFLGL GYYRRFVENFSRIA PLTQLTRKGA FV
Subjt:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV

A0A5A7T0M7 Reverse transcriptase2.2e-20092.19Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCS KEVAFNPPSM SFKFKGEGSRSLPQVI A++ASKLL QGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHRE+EFAIELEPGTVPISRAPYRM P ELKELKVQLQ+L DKGFIR SV PWGA VLFVKK DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK
        GATVFSKIDLRSGYHQLRIKDGDVPKT FHSRYGHYEFIVMSFGLTNAPAVF+DLMN+VFREFLDTFVIVFIDDILIYSK EAEH EHLR+VLQTL DNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK

Query:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV
        LYAKFSKC FWLKQVSFLG+VVSKAGV VDPAKIEAVT W RPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGA FV
Subjt:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV

A0A5A7TB42 Reverse transcriptase9.8e-20192.45Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKG GSRSLPQVI A+RASKLL QGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHRE+EFAIELEPGTVPISRAPYRM P ELKELKVQLQ+L DKGFIR SV PWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK
        GATVFSKIDLRSGYHQLRIKDGDVPKT F SRYGHYEFIVMSFGLTNAPAVFMDLMN+VF+EFLDTFVIVFIDDILIYSKMEAEHEEHLR+VLQTL DNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK

Query:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV
        LYAKFSKC FWLKQVSFLG+VVSKAGV VDPAKIEAVT W RPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGA FV
Subjt:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV

A0A5A7TQ69 Reverse transcriptase3.7e-20092.45Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
        MLDFDVIL MDWLAANHASIDCS KEVAFNPPSM SFKFKGEGSRSLPQVI  +RASKLL QGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHRE+EFAIELEPGTVPISRAPYRM P ELKELKVQLQ+L DKGFIRSSV PWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK
        GATVFSKIDLR GYHQLRIKDGDVPKT F SRYGHYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTFVIVFIDDILIY K EAEHEEHLRMVLQTL DNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK

Query:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV
        LYA FSKC FWLKQVSFLG+VVSKAGV VDPAKIEAVTSW RPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGA FV
Subjt:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV

A0A5A7TV57 Ty3-gypsy retrotransposon protein2.2e-20092.45Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCS KEV FNPPSM SFKFKG GSRSLPQVI A+RASKLL QGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHRE+EFAIELEPGTVPISRAPYRM P ELKELKVQLQ+L DKGFIR SV PWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREIEFAIELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK
        GATVFSKIDLRSGYHQLRIKDGDVPKT F SRYGHYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTFVIVFIDDILIYSK EAEHEEHLR+VLQTL DNK
Subjt:  GATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNK

Query:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV
        LYAKFSKC FWLKQVSFLG+VVSKAGV VDPAKIEAVT W RPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGA FV
Subjt:  LYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.6e-5138.89Show/hide
Query:  KELKVQLQKLHDKGFIRSSVLPWGALVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKT
        +E++ Q+Q + ++G IR+S  P+ + +  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +    V KT
Subjt:  KELKVQLQKLHDKGFIRSSVLPWGALVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKT

Query:  TFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLGNVVSKAGV
         F +++GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+++S    EH + L +V + L    L  +  KC F  ++ +FLG+V++  G+
Subjt:  TFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLGNVVSKAGV

Query:  FVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRK
          +P KIEA+  +P P+   E+++FLGL GYYR+F+ NF+ IA P+T+  +K
Subjt:  FVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRK

P20825 Retrovirus-related Pol polyprotein from transposon 2972.9e-5337.59Show/hide
Query:  PISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG
        PI    Y +  T   E++ Q+Q++ ++G IR S  P+ +    V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G
Subjt:  PISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG

Query:  YHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLK
        +HQ+ + +  + KT F ++ GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+I+S    EH   +++V   L D  L  +  KC F  K
Subjt:  YHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLK

Query:  QVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRK
        + +FLG++V+  G+  +P K++A+ S+P P+   E+R+FLGL GYYR+F+ N++ IA P+T   +K
Subjt:  QVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.9e-5435.51Show/hide
Query:  MTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEP---------VVRDYPDVFPEELPGLPP---HREIEFAIELEPGTVPIS
        +T+    G    +  Q  +   AS L   G +S + S + + E + +  S           + + Y ++   +LP  P    +  ++  IE++PG     
Subjt:  MTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEP---------VVRDYPDVFPEELPGLPP---HREIEFAIELEPGTVPIS

Query:  RAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD
          PY +T    +E+   +QKL D  FI  S  P  + V+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ 
Subjt:  RAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD

Query:  GDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLGNV
         D  KT F +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S+   EH +HL  VL+ L +  L  K  KC F  ++  FLG  
Subjt:  GDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLGNV

Query:  VSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPL
        +    +     K  A+  +P P TV + + FLG++ YYRRF+ N S+IA P+
Subjt:  VSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.4e-5035.23Show/hide
Query:  DYPDVFPEELPGLPPHREIEFAIELEPGT---VPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKK-----DGSMRLCIDYRELNK
        ++P +F   L G+     +E A++ E  T    PI    Y        E++ Q+ +L   G IR S  P+ + +  V KK     +   R+ +D++ LN 
Subjt:  DYPDVFPEELPGLPPHREIEFAIELEPGT---VPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKK-----DGSMRLCIDYRELNK

Query:  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSK
        VT+ + YP+P I+     L  A  F+ +DL SG+HQ+ +K+ D+PKT F +  G YEF+ + FGL NAPA+F  +++ + RE +     V+IDDI+++S+
Subjt:  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSK

Query:  MEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTR
            H ++LR+VL +L    L     K  F   QV FLG +V+  G+  DP K+ A++  P P++V E++ FLG+  YYR+F+++++++A PLT LTR
Subjt:  MEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLGNVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTR

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.9e-5435.51Show/hide
Query:  MTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEP---------VVRDYPDVFPEELPGLPP---HREIEFAIELEPGTVPIS
        +T+    G    +  Q  +   AS L   G +S + S + + E + +  S           + + Y ++   +LP  P    +  ++  IE++PG     
Subjt:  MTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEP---------VVRDYPDVFPEELPGLPP---HREIEFAIELEPGTVPIS

Query:  RAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD
          PY +T    +E+   +QKL D  FI  S  P  + V+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ 
Subjt:  RAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD

Query:  GDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLGNV
         D  KT F +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S+   EH +HL  VL+ L +  L  K  KC F  ++  FLG  
Subjt:  GDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLGNV

Query:  VSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPL
        +    +     K  A+  +P P TV + + FLG++ YYRRF+ N S+IA P+
Subjt:  VSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.0e-2148.96Show/hide
Query:  HLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLG--NVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGA
        HL MVLQ    ++ YA   KCAF   Q+++LG  +++S  GV  DPAK+EA+  WP P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +
Subjt:  HLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLG--NVVSKAGVFVDPAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGACTTTGATGTAATTCTGGGTATGGATTGGTTAGCCGCTAACCATGCCAGCATAGATTGTTCCGGTAAGGAGGTAGCATTTAACCCTCCCTCGATGACCAGTTT
TAAATTTAAGGGAGAAGGGTCAAGGTCGTTACCTCAGGTAATCTTAGCCATGAGGGCCAGCAAACTGCTCATTCAAGGTACTTGGAGTATCTTGGCGAGTGTGGTGGATA
CTAGAGAGGTTGATGTATCCCTGTCATCGGAACCAGTGGTAAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCTCACAGAGAGATTGAGTTTGCC
ATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCATACAGAATGACCCCAACAGAGTTGAAAGAACTGAAAGTGCAGTTACAGAAATTGCATGATAAGGGCTT
CATTCGATCGAGTGTGTTACCTTGGGGTGCACTAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGACTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTA
AGAACAGATATCCCTTGCCCAGGATCGACGATCTGTTTGACCAGTTACAGGGAGCTACAGTGTTCTCTAAGATTGATCTTCGATCGGGATATCATCAGCTGAGGATTAAG
GATGGTGATGTACCGAAGACGACCTTTCATTCCAGATACGGACACTATGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAA
CAAAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGTGTTTATTGATGATATCTTGATATATTCCAAGATGGAGGCCGAGCATGAGGAGCATCTACGTATGGTTCTAC
AAACCCTTTGGGATAATAAATTGTATGCAAAGTTCTCGAAATGTGCGTTTTGGCTGAAGCAGGTGTCCTTTCTAGGCAATGTGGTTTCTAAGGCTGGAGTTTTTGTGGAT
CCAGCTAAGATAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGTAGGTTATTATCGACGGTTTGTGGAGAACTTTTC
CCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTGCTTTTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCGACTTTGATGTAATTCTGGGTATGGATTGGTTAGCCGCTAACCATGCCAGCATAGATTGTTCCGGTAAGGAGGTAGCATTTAACCCTCCCTCGATGACCAGTTT
TAAATTTAAGGGAGAAGGGTCAAGGTCGTTACCTCAGGTAATCTTAGCCATGAGGGCCAGCAAACTGCTCATTCAAGGTACTTGGAGTATCTTGGCGAGTGTGGTGGATA
CTAGAGAGGTTGATGTATCCCTGTCATCGGAACCAGTGGTAAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCTCACAGAGAGATTGAGTTTGCC
ATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCATACAGAATGACCCCAACAGAGTTGAAAGAACTGAAAGTGCAGTTACAGAAATTGCATGATAAGGGCTT
CATTCGATCGAGTGTGTTACCTTGGGGTGCACTAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGACTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTA
AGAACAGATATCCCTTGCCCAGGATCGACGATCTGTTTGACCAGTTACAGGGAGCTACAGTGTTCTCTAAGATTGATCTTCGATCGGGATATCATCAGCTGAGGATTAAG
GATGGTGATGTACCGAAGACGACCTTTCATTCCAGATACGGACACTATGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAA
CAAAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGTGTTTATTGATGATATCTTGATATATTCCAAGATGGAGGCCGAGCATGAGGAGCATCTACGTATGGTTCTAC
AAACCCTTTGGGATAATAAATTGTATGCAAAGTTCTCGAAATGTGCGTTTTGGCTGAAGCAGGTGTCCTTTCTAGGCAATGTGGTTTCTAAGGCTGGAGTTTTTGTGGAT
CCAGCTAAGATAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGTAGGTTATTATCGACGGTTTGTGGAGAACTTTTC
CCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTGCTTTTGTTTAG
Protein sequenceShow/hide protein sequence
MLDFDVILGMDWLAANHASIDCSGKEVAFNPPSMTSFKFKGEGSRSLPQVILAMRASKLLIQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFA
IELEPGTVPISRAPYRMTPTELKELKVQLQKLHDKGFIRSSVLPWGALVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
DGDVPKTTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKMEAEHEEHLRMVLQTLWDNKLYAKFSKCAFWLKQVSFLGNVVSKAGVFVD
PAKIEAVTSWPRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAAFV