; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0068261 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0068261
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:11910029..11911030
RNA-Seq ExpressionCmc03g0068261
SyntenyCmc03g0068261
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO45752.1 pol protein [Cucumis melo subsp. melo]3.1e-18195.8Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLD FV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        IVFIDDILIYSKTEAEHE HLRMVLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRFVENFSR
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        I TPLTQLT+KGAPFVWSKACEDSFQ LKQ LVTA VLTVPDGSGNFVIYSDASKKGLGCVLM QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYLYGEKIQIFTDHKSLKYFFT+ ELNMRQRR
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

KAA0026271.1 pol protein [Cucumis melo var. makuwa]1.8e-18195.5Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHY+FIVMSFGLTNAPAVFMDLMNRVFREFLD FV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        IVFIDDILIYSKTEAEHE HLR+VLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRFVENFSR
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        I TPLTQLT+KGAPFVWSKACEDSFQNLKQ LVTA VLTVPDGSG+FVIYSDASKKGLGCVLM QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYLYGEKIQIFTDHKSLKYFFT+ ELNMRQRR
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

KAA0042119.1 pol protein [Cucumis melo var. makuwa]2.4e-18195.2Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK+GDVPKTAFRSRYGHY+FIVMSFGLTNAPAVFMDLMNRVFREFLD FV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        IVFIDDILIYSKTEAEHE HLRMVLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKA VSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        I TPLTQLT+KGAPFVWSKACEDSFQNLKQ LVTA +LTVPDGSG+FVIYSDAS+KGLGCVLM QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYLYGEKIQIFTDHKSLKYFFT+ ELNMRQRR
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

KAA0048687.1 pol protein [Cucumis melo var. makuwa]5.3e-18195.5Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHY+FIVMSFGLTNAPAVFMDLMNRVFREFLD FV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        IVFIDDILIYSKTEAEHE HLRMVLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRFVENFSR
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        I TPLTQLT+KGAPFVWSKACEDSFQNLKQ LVTA VLTVPDGSG+FVIYSDASKKGLGCVLM QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYLYGEKIQIFTDHKSLKYFFT+ ELNMRQRR
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

TYK04643.1 pol protein [Cucumis melo var. makuwa]1.1e-18396.7Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        MRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        IVFIDDILIYSK EAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGH+VSKA VS+DPAKIEAVTSWT+PSTVSEVRSFLGLAGYYRRFVE FSR
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        I TPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTA VLTVPDGSGNFVIYSDASKKGLG VLM QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ2 Pol protein8.8e-18295.5Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHY+FIVMSFGLTNAPAVFMDLMNRVFREFLD FV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        IVFIDDILIYSKTEAEHE HLR+VLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRFVENFSR
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        I TPLTQLT+KGAPFVWSKACEDSFQNLKQ LVTA VLTVPDGSG+FVIYSDASKKGLGCVLM QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYLYGEKIQIFTDHKSLKYFFT+ ELNMRQRR
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

A0A5A7TLA3 Pol protein1.1e-18195.2Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK+GDVPKTAFRSRYGHY+FIVMSFGLTNAPAVFMDLMNRVFREFLD FV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        IVFIDDILIYSKTEAEHE HLRMVLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKA VSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        I TPLTQLT+KGAPFVWSKACEDSFQNLKQ LVTA +LTVPDGSG+FVIYSDAS+KGLGCVLM QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYLYGEKIQIFTDHKSLKYFFT+ ELNMRQRR
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

A0A5A7UAA8 Reverse transcriptase2.6e-18195.5Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGY+QLRIKD DVPKTAFRSRYGHY+FIVMSFGLTNAPAVFMDLMNRVFREFLD FV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        IVFIDDILIYSKTEAEHE HLRMVLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKARVSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRFVENFSR
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        I TPLTQLT+KGAPFVWSKACEDSFQNLKQ LVTA VLTVPDGSG+FVIYSDASKKGLGCVLM QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYLYGEKIQIFTDHKSLKYFFT+ ELNMRQRR
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

A0A5D3BY03 Pol protein5.5e-18496.7Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        MRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        IVFIDDILIYSK EAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGH+VSKA VS+DPAKIEAVTSWT+PSTVSEVRSFLGLAGYYRRFVE FSR
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        I TPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTA VLTVPDGSGNFVIYSDASKKGLG VLM QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

Q84KB0 Pol protein1.5e-18195.8Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLD FV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        IVFIDDILIYSKTEAEHE HLRMVLQTLRDNKLYAKFSKCEFWLK VSFLGHVVSKA VSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRFVENFSR
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
        I TPLTQLT+KGAPFVWSKACEDSFQ LKQ LVTA VLTVPDGSGNFVIYSDASKKGLGCVLM QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYLYGEKIQIFTDHKSLKYFFT+ ELNMRQRR
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.7e-6838.74Show/hide
Query:  RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFVI
        R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +    V KTAF +++GHY+++ M FGL NAPA F   MN + R  L+   +
Subjt:  RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFVI

Query:  VFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRI
        V++DDI+++S +  EH   L +V + L    L  +  KCEF  +  +FLGHV++   +  +P KIEA+  +  P+   E+++FLGL GYYR+F+ NF+ I
Subjt:  VFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRI

Query:  TTPLTQLTKKGAPFVWSKACEDS-FQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
          P+T+  KK      +    DS F+ LK  +    +L VPD +  F + +DAS   LG VL   G  ++Y SR L  HE NY T + EL A+V+A K +
Subjt:  TTPLTQLTKKGAPFVWSKACEDS-FQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYL G   +I +DH+ L + +   + N +  R
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

P0CT34 Transposon Tf2-1 polyprotein1.5e-6137.23Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        +R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G ++++VM +G++ APA F   +N +  E  ++ V
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        + ++DDILI+SK+E+EH  H++ VLQ L++  L    +KCEF    V F+G+ +S+   +     I+ V  W +P    E+R FLG   Y R+F+   S+
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVF
        +T PL  L KK   + W+     + +N+KQ LV+  VL   D S   ++ +DAS   +G VL  +        V Y S ++   + NY   D E+ A++ 
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVF

Query:  ALKIWRHYLYG--EKIQIFTDHKSL
        +LK WRHYL    E  +I TDH++L
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSL

P0CT41 Transposon Tf2-12 polyprotein1.5e-6137.23Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV
        +R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G ++++VM +G++ APA F   +N +  E  ++ V
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFV

Query:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR
        + ++DDILI+SK+E+EH  H++ VLQ L++  L    +KCEF    V F+G+ +S+   +     I+ V  W +P    E+R FLG   Y R+F+   S+
Subjt:  IVFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSR

Query:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVF
        +T PL  L KK   + W+     + +N+KQ LV+  VL   D S   ++ +DAS   +G VL  +        V Y S ++   + NY   D E+ A++ 
Subjt:  ITTPLTQLTKKGAPFVWSKACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVF

Query:  ALKIWRHYLYG--EKIQIFTDHKSL
        +LK WRHYL    E  +I TDH++L
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy6.7e-6237.1Show/hide
Query:  RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFVI
        RL ID+R+LN+ T+ +RYP+P I  +   L  A  F+ +DL+SGYHQ+ + + D  KT+F    G Y+F  + FGL NA ++F   ++ V RE +     
Subjt:  RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFVI

Query:  VFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRI
        V++DD++I+S+ E++H  H+  VL+ L D  +     K  F+ + V +LG +VSK     DP K++A+  +  P  V +VRSFLGLA YYR F+++F+ I
Subjt:  VFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRI

Query:  TTPLTQLTK-----------KGAPFVWSKACEDSFQNLKQNLVTASV-LTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLE
          P+T + K           K  P  +++   ++FQ L+  L +  V L  PD    F + +DAS  G+G VL  +G+ +   SR LK  EQNY T++ E
Subjt:  TTPLTQLTK-----------KGAPFVWSKACEDSFQNLKQNLVTASV-LTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLE

Query:  LAAVVFALKIWRHYLYGEK-IQIFTDHKSLKYFFTKNELNMRQRR
        L A+V+AL   +++LYG + I IFTDH+ L +       N + +R
Subjt:  LAAVVFALKIWRHYLYGEK-IQIFTDHKSLKYFFTKNELNMRQRR

P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-6937.84Show/hide
Query:  RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFVI
        R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G+HQ+ + +  + KTAF ++ GHY+++ M FGL NAPA F   MN + R  L+   +
Subjt:  RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFVI

Query:  VFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRI
        V++DDI+I+S +  EH   +++V   L D  L  +  KCEF  K  +FLGH+V+   +  +P K++A+ S+  P+   E+R+FLGL GYYR+F+ N++ I
Subjt:  VFIDDILIYSKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRI

Query:  TTPLTQLTKKGAPFVWSK-ACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW
          P+T   KK       K    ++F+ LK  ++   +L +PD    FV+ +DAS   LG VL   G  +++ SR L  HE NY   + EL A+V+A K +
Subjt:  TTPLTQLTKKGAPFVWSK-ACEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR
        RHYL G +  I +DH+ L++     E   +  R
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTKNELNMRQRR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-2444Show/hide
Query:  HLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLG--HVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRITTPLTQLTKKGAPFVW
        HL MVLQ    ++ YA   KC F    +++LG  H++S   VS DPAK+EA+  W  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L KK +   W
Subjt:  HLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLG--HVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRITTPLTQLTKKGAPFVW

Query:  SKACEDSFQNLKQNLVTASVLTVPD
        ++    +F+ LK  + T  VL +PD
Subjt:  SKACEDSFQNLKQNLVTASVLTVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATCTGTTTGACCAGTTACAGGGAGCTACAGTGTT
CTCTAAGATTGATCTTCGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAGATACGGACACTATCAGTTTATTGTGATGT
CTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACAATTTTGTGATCGTGTTTATTGATGACATCTTGATATAT
TCCAAGACGGAGGCCGAGCATGAGGGACATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTAAAGTATGT
GTCCTTTCTAGGCCATGTGGTTTCTAAGGCTAGAGTTTCTGTAGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCT
TCCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCCCGTATAACTACTCCTCTTACTCAGTTGACCAAAAAGGGAGCTCCTTTTGTTTGGAGCAAAGCA
TGTGAGGACAGTTTCCAGAACCTTAAACAGAATCTAGTTACTGCATCGGTTCTTACTGTACCTGATGGTTCTGGCAATTTTGTGATTTATAGTGATGCTTCCAAGAAGGG
TTTGGGTTGTGTATTGATGCACCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAGTTGGCAGCAGTGG
TTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAAATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTAAGAATGAATTGAATATGAGA
CAGCGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATCTGTTTGACCAGTTACAGGGAGCTACAGTGTT
CTCTAAGATTGATCTTCGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAGATACGGACACTATCAGTTTATTGTGATGT
CTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACAATTTTGTGATCGTGTTTATTGATGACATCTTGATATAT
TCCAAGACGGAGGCCGAGCATGAGGGACATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTAAAGTATGT
GTCCTTTCTAGGCCATGTGGTTTCTAAGGCTAGAGTTTCTGTAGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCT
TCCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCCCGTATAACTACTCCTCTTACTCAGTTGACCAAAAAGGGAGCTCCTTTTGTTTGGAGCAAAGCA
TGTGAGGACAGTTTCCAGAACCTTAAACAGAATCTAGTTACTGCATCGGTTCTTACTGTACCTGATGGTTCTGGCAATTTTGTGATTTATAGTGATGCTTCCAAGAAGGG
TTTGGGTTGTGTATTGATGCACCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAGTTGGCAGCAGTGG
TTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAAATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTAAGAATGAATTGAATATGAGA
CAGCGAAGATGA
Protein sequenceShow/hide protein sequence
MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYQFIVMSFGLTNAPAVFMDLMNRVFREFLDNFVIVFIDDILIY
SKTEAEHEGHLRMVLQTLRDNKLYAKFSKCEFWLKYVSFLGHVVSKARVSVDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRITTPLTQLTKKGAPFVWSKA
CEDSFQNLKQNLVTASVLTVPDGSGNFVIYSDASKKGLGCVLMHQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTKNELNMR
QRR