; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0243051 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0243051
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr09:5466568..5468166
RNA-Seq ExpressionCmc09g0243051
SyntenyCmc09g0243051
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042141.1 hypothetical protein E6C27_scaffold67G006300 [Cucumis melo var. makuwa]5.1e-23178.98Show/hide
Query:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK
        MA ME+REL SQLQELVD GFIRPSAS WGAPVLFVKKKDGT RLCI+YRQLNKVTI NKYPLPRIDDLFDQ  GASVFSKIDLRSGYHQLKVKESDI K
Subjt:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK

Query:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG
        TAF+TRYGHYEFL++PFGLTNAPAAFMDLMN VFHSYLDQFVIVFIDDIL YSGSKEKHAEHLRI                CDFWLDRVVFLGHVVSVEG
Subjt:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG

Query:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
        ICVDPQKTE VDKWKRP SVTEIRSFLGLAGYYRRF+E                                                              
Subjt:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        GCVLMQKGKV+AYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFI+HKS KYIFDQKELNLRQRRWLELIKDYDCTIDY PGKANVVADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        LSRKSSHSNITLNSIGSSLLRELKMGE A+S+GKLGSL+ HFQVRPILIDRIIKAQLDDARLRKLAE+VR+NQRLNYSLRGDGALMKYDRLCVPSDQTIK
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKKH
        DQI EEAHSSAY MHPGSTKVY+TLKKH
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKKH

KAA0050527.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.8e-22072.11Show/hide
Query:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK
        MA  EL+EL  QLQELVDKG+IRPS S WGAPVLFVKKKDGT RLCI+YRQLNKVTIRNKYPLPRIDDLFDQLRGA++FSKIDLRSGYHQLKV+ESDI K
Subjt:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK

Query:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG
        TAF+TRYGHYEF +MPFGLTNAPA FMDLMN +FH YLDQFVIVFIDDILVYS  +E H EHLRI                C+FWL++VVFLGHVVS +G
Subjt:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG

Query:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
        + VDPQK EAV  W+RPIS TE+RSFLGLAGYYRRF+E FS+LALPLT LTRKNVKFEW+D CE+SFQELKKRLVTAP+L LP+ G ++ IYCDAS  GL
Subjt:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        GCVLMQ G V+AYASRQLK+HECNYPTHDLELAAVVLALK+WRHYL+GE+CHIF DHKSLKYIFDQKELNLRQRRWLELIKDYDCTI+Y PGKANVVADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        LSRKS      L  I  +LL EL+  +A V+    GSL+  FQVR  L+  I++ Q +D+ L+K  E  +K   + + LR DGA++K  RLCVP+   +K
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKK
        + ILEEAHSSAY MHPGSTK+Y+TLKK
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKK

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.8e-22072.11Show/hide
Query:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK
        MA  EL+EL  QLQELVDKG+IRPS S WGAPVLFVKKKDGT RLCI+YRQLNKVTIRNKYPLPRIDDLFDQLRGA++FSKIDLRSGYHQLKV+ESDI K
Subjt:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK

Query:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG
        TAF+TRYGHYEF +MPFGLTNAPA FMDLMN +FH YLDQFVIVFIDDILVYS  +E H EHLRI                C+FWL++VVFLGHVVS +G
Subjt:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG

Query:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
        + VDPQK EAV  W+RPIS TE+RSFLGLAGYYRRF+E FS+LALPLT LTRKNVKFEW+D CE+SFQELKKRLVTAP+L LP+ G ++ IYCDAS  GL
Subjt:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        GCVLMQ G V+AYASRQLK+HECNYPTHDLELAAVVLALK+WRHYL+GE+CHIF DHKSLKYIFDQKELNLRQRRWLELIKDYDCTI+Y PGKANVVADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        LSRKS      L  I  +LL EL+  +A V+    GSL+  FQVR  L+  I++ Q +D+ L+K  E  +K   + + LR DGA++K  RLCVP+   +K
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKK
        + ILEEAHSSAY MHPGSTK+Y+TLKK
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKK

TYK00844.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.4e-22072.11Show/hide
Query:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK
        MA  EL+EL  QLQELVDKG+IRPS S WGAPVLFVKKKDGT RLCI+YRQLNKVTIRNKYPLPRIDDLFDQLRGA++FSKIDLRSGYHQLKV+ESDI K
Subjt:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK

Query:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG
        TAF+TRYGHYEF +MPFGLTNAPA FMDLMN +FH YLDQFVIVFIDDILVYS  +E H EHLRI                C+FWL++VVFLGHVVS +G
Subjt:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG

Query:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
        + VDPQK EAV  W+RPIS TE+RSFLGLAGYYRRF+E FS+LALPLT LTRKNVKFEW+D CE+SFQELKKRLVTAP+L LP+ G ++ IYCDAS  GL
Subjt:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        GCVLMQ G V+AYASRQLK+HECNYPTHDLELAAVVLALK+WRHYL+GE+CHIF DHKSLKYIFDQKELNLRQRRWLELIKDYDCTI+Y PGKANVVADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        LSRKS      L  I  +LL EL+  +A V+    GSL+  FQVR  L+  I++ Q +D+ L+K  E  +K   + + LR DGA++K  RLCVP+   +K
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKK
        + ILEEAHSSAY MHPGSTK+Y+TLKK
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKK

TYK18080.1 hypothetical protein E5676_scaffold306G004160 [Cucumis melo var. makuwa]5.1e-23178.98Show/hide
Query:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK
        MA ME+REL SQLQELVD GFIRPSAS WGAPVLFVKKKDGT RLCI+YRQLNKVTI NKYPLPRIDDLFDQ  GASVFSKIDLRSGYHQLKVKESDI K
Subjt:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK

Query:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG
        TAF+TRYGHYEFL++PFGLTNAPAAFMDLMN VFHSYLDQFVIVFIDDIL YSGSKEKHAEHLRI                CDFWLDRVVFLGHVVSVEG
Subjt:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG

Query:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
        ICVDPQKTE VDKWKRP SVTEIRSFLGLAGYYRRF+E                                                              
Subjt:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        GCVLMQKGKV+AYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFI+HKS KYIFDQKELNLRQRRWLELIKDYDCTIDY PGKANVVADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        LSRKSSHSNITLNSIGSSLLRELKMGE A+S+GKLGSL+ HFQVRPILIDRIIKAQLDDARLRKLAE+VR+NQRLNYSLRGDGALMKYDRLCVPSDQTIK
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKKH
        DQI EEAHSSAY MHPGSTKVY+TLKKH
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKKH

TrEMBL top hitse value%identityAlignment
A0A5A7TKQ0 Uncharacterized protein2.4e-23178.98Show/hide
Query:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK
        MA ME+REL SQLQELVD GFIRPSAS WGAPVLFVKKKDGT RLCI+YRQLNKVTI NKYPLPRIDDLFDQ  GASVFSKIDLRSGYHQLKVKESDI K
Subjt:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK

Query:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG
        TAF+TRYGHYEFL++PFGLTNAPAAFMDLMN VFHSYLDQFVIVFIDDIL YSGSKEKHAEHLRI                CDFWLDRVVFLGHVVSVEG
Subjt:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG

Query:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
        ICVDPQKTE VDKWKRP SVTEIRSFLGLAGYYRRF+E                                                              
Subjt:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        GCVLMQKGKV+AYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFI+HKS KYIFDQKELNLRQRRWLELIKDYDCTIDY PGKANVVADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        LSRKSSHSNITLNSIGSSLLRELKMGE A+S+GKLGSL+ HFQVRPILIDRIIKAQLDDARLRKLAE+VR+NQRLNYSLRGDGALMKYDRLCVPSDQTIK
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKKH
        DQI EEAHSSAY MHPGSTKVY+TLKKH
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKKH

A0A5A7U2V7 Reverse transcriptase8.7e-22172.11Show/hide
Query:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK
        MA  EL+EL  QLQELVDKG+IRPS S WGAPVLFVKKKDGT RLCI+YRQLNKVTIRNKYPLPRIDDLFDQLRGA++FSKIDLRSGYHQLKV+ESDI K
Subjt:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK

Query:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG
        TAF+TRYGHYEF +MPFGLTNAPA FMDLMN +FH YLDQFVIVFIDDILVYS  +E H EHLRI                C+FWL++VVFLGHVVS +G
Subjt:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG

Query:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
        + VDPQK EAV  W+RPIS TE+RSFLGLAGYYRRF+E FS+LALPLT LTRKNVKFEW+D CE+SFQELKKRLVTAP+L LP+ G ++ IYCDAS  GL
Subjt:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        GCVLMQ G V+AYASRQLK+HECNYPTHDLELAAVVLALK+WRHYL+GE+CHIF DHKSLKYIFDQKELNLRQRRWLELIKDYDCTI+Y PGKANVVADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        LSRKS      L  I  +LL EL+  +A V+    GSL+  FQVR  L+  I++ Q +D+ L+K  E  +K   + + LR DGA++K  RLCVP+   +K
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKK
        + ILEEAHSSAY MHPGSTK+Y+TLKK
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKK

A0A5D3BHI1 Reverse transcriptase8.7e-22172.11Show/hide
Query:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK
        MA  EL+EL  QLQELVDKG+IRPS S WGAPVLFVKKKDGT RLCI+YRQLNKVTIRNKYPLPRIDDLFDQLRGA++FSKIDLRSGYHQLKV+ESDI K
Subjt:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK

Query:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG
        TAF+TRYGHYEF +MPFGLTNAPA FMDLMN +FH YLDQFVIVFIDDILVYS  +E H EHLRI                C+FWL++VVFLGHVVS +G
Subjt:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG

Query:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
        + VDPQK EAV  W+RPIS TE+RSFLGLAGYYRRF+E FS+LALPLT LTRKNVKFEW+D CE+SFQELKKRLVTAP+L LP+ G ++ IYCDAS  GL
Subjt:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        GCVLMQ G V+AYASRQLK+HECNYPTHDLELAAVVLALK+WRHYL+GE+CHIF DHKSLKYIFDQKELNLRQRRWLELIKDYDCTI+Y PGKANVVADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        LSRKS      L  I  +LL EL+  +A V+    GSL+  FQVR  L+  I++ Q +D+ L+K  E  +K   + + LR DGA++K  RLCVP+   +K
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKK
        + ILEEAHSSAY MHPGSTK+Y+TLKK
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKK

A0A5D3BS67 Reverse transcriptase6.7e-22172.11Show/hide
Query:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK
        MA  EL+EL  QLQELVDKG+IRPS S WGAPVLFVKKKDGT RLCI+YRQLNKVTIRNKYPLPRIDDLFDQLRGA++FSKIDLRSGYHQLKV+ESDI K
Subjt:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK

Query:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG
        TAF+TRYGHYEF +MPFGLTNAPA FMDLMN +FH YLDQFVIVFIDDILVYS  +E H EHLRI                C+FWL++VVFLGHVVS +G
Subjt:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG

Query:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
        + VDPQK EAV  W+RPIS TE+RSFLGLAGYYRRF+E FS+LALPLT LTRKNVKFEW+D CE+SFQELKKRLVTAP+L LP+ G ++ IYCDAS  GL
Subjt:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        GCVLMQ G V+AYASRQLK+HECNYPTHDLELAAVVLALK+WRHYL+GE+CHIF DHKSLKYIFDQKELNLRQRRWLELIKDYDCTI+Y PGKANVVADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        LSRKS      L  I  +LL EL+  +A V+    GSL+  FQVR  L+  I++ Q +D+ L+K  E  +K   + + LR DGA++K  RLCVP+   +K
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKK
        + ILEEAHSSAY MHPGSTK+Y+TLKK
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKK

A0A5D3D2Y2 Uncharacterized protein2.4e-23178.98Show/hide
Query:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK
        MA ME+REL SQLQELVD GFIRPSAS WGAPVLFVKKKDGT RLCI+YRQLNKVTI NKYPLPRIDDLFDQ  GASVFSKIDLRSGYHQLKVKESDI K
Subjt:  MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPK

Query:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG
        TAF+TRYGHYEFL++PFGLTNAPAAFMDLMN VFHSYLDQFVIVFIDDIL YSGSKEKHAEHLRI                CDFWLDRVVFLGHVVSVEG
Subjt:  TAFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEG

Query:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
        ICVDPQKTE VDKWKRP SVTEIRSFLGLAGYYRRF+E                                                              
Subjt:  ICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        GCVLMQKGKV+AYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFI+HKS KYIFDQKELNLRQRRWLELIKDYDCTIDY PGKANVVADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        LSRKSSHSNITLNSIGSSLLRELKMGE A+S+GKLGSL+ HFQVRPILIDRIIKAQLDDARLRKLAE+VR+NQRLNYSLRGDGALMKYDRLCVPSDQTIK
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKKH
        DQI EEAHSSAY MHPGSTKVY+TLKKH
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKKH

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.7e-8438.46Show/hide
Query:  RELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTS-----RLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKT
        +E+ SQ+Q+++++G IR S S + +P+  V KK   S     R+ I+YR+LN++T+ +++P+P +D++  +L   + F+ IDL  G+HQ+++    + KT
Subjt:  RELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTS-----RLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKT

Query:  AFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEGI
        AF T++GHYE+L MPFGL NAPA F   MN +    L++  +V++DDI+V+S S ++H + L +                C+F      FLGHV++ +GI
Subjt:  AFQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEGI

Query:  CVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTD-ACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
          +P+K EA+ K+  P    EI++FLGL GYYR+F+  F+ +A P+T   +KN+K + T+   + +F++LK  +   P+L +P    +F +  DAS   L
Subjt:  CVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTD-ACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        G VL Q G  ++Y SR L +HE NY T + EL A+V A K +RHYL G    I  DH+ L +++  K+ N +  RW   + ++D  I Y+ GK N VADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSR
        LSR
Subjt:  LSR

P20825 Retrovirus-related Pol polyprotein from transposon 2972.9e-8038.04Show/hide
Query:  ELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTS-----RLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKTA
        E+ +Q+QE++++G IR S S + +P   V KK   S     R+ I+YR+LN++TI ++YP+P +D++  +L     F+ IDL  G+HQ+++ E  I KTA
Subjt:  ELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTS-----RLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKTA

Query:  FQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEGIC
        F T+ GHYE+L MPFGL NAPA F   MN +    L++  +V++DDI+++S S  +H   +++                C+F      FLGH+V+ +GI 
Subjt:  FQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRI----------------CDFWLDRVVFLGHVVSVEGIC

Query:  VDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACE--RSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL
         +P K +A+  +  P    EIR+FLGL GYYR+F+  ++ +A P+T+  +K  K + T   E   +F++LK  ++  P+L LP    +F +  DAS+  L
Subjt:  VDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACE--RSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGL

Query:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
        G VL Q G  +++ SR L  HE NY   + EL A+V A K +RHYL G +  I  DH+ L+++ + KE   +  RW   + +Y   IDY+ GK N VADA
Subjt:  GCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSR----KSSHSNITLNS
        LSR    ++ HS  T +S
Subjt:  LSR----KSSHSNITLNS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.4e-7333.97Show/hide
Query:  RELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKTAFQTR
        +E+   +Q+L+D  FI PS S   +PV+ V KKDGT RLC++YR LNK TI + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++  D  KTAF T 
Subjt:  RELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKTAFQTR

Query:  YGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHL----------------RICDFWLDRVVFLGHVVSVEGICVDPQ
         G YE+ +MPFGL NAP+ F   M   F     +FV V++DDIL++S S E+H +HL                + C F  +   FLG+ + ++ I     
Subjt:  YGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHL----------------RICDFWLDRVVFLGHVVSVEGICVDPQ

Query:  KTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGLGCVLMQ
        K  A+  +  P +V + + FLG+  YYRRF+   SK+A P+        K +WT+  +++ ++LK  L  +PVL        + +  DAS  G+G VL +
Subjt:  KTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGLGCVLMQ

Query:  ---KGK---VVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
           K K   VV Y S+ L+  + NYP  +LEL  ++ AL  +R+ L+G+   +  DH SL  + ++ E   R +RWL+ +  YD T++YL G  NVVADA
Subjt:  ---KGK---VVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        +SR    +  T+    S  +          S     ++++H  ++ +    +    +   R  +   ++ +  R NYSL  D  +   DRL VP  Q  +
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKK
        + ++   H   +T+  G   V  TL K
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKK

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus6.7e-7737.65Show/hide
Query:  ELMSQLQELVDKGFIRPSASLWGAPVLFVKKK-----DGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKTA
        E+  Q+ EL+  G IRPS S + +P+  V KK     +   R+ +++++LN VTI + YP+P I+     L  A  F+ +DL SG+HQ+ +KESDIPKTA
Subjt:  ELMSQLQELVDKGFIRPSASLWGAPVLFVKKK-----DGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKTA

Query:  FQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRIC---------------DFWLD-RVVFLGHVVSVEGIC
        F T  G YEFL +PFGL NAPA F  +++ +   ++ +   V+IDDI+V+S   + H ++LR+                  +LD +V FLG++V+ +GI 
Subjt:  FQTRYGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRIC---------------DFWLD-RVVFLGHVVSVEGIC

Query:  VDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTR-----------KNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEI
         DP+K  A+ +   P SV E++ FLG+  YYR+F++ ++K+A PLTNLTR             V     +   +SF +LK  L ++ +L  P     F +
Subjt:  VDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTR-----------KNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEI

Query:  YCDASHQGLGCVLMQ----KGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGE-RCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCT
          DAS+  +G VL Q    + + +AY SR L K E NY T + E+ A++ +L   R YLYG     ++ DH+ L +    +  N + +RW   I++Y+C 
Subjt:  YCDASHQGLGCVLMQ----KGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGE-RCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCT

Query:  IDYLPGKANVVADALSR
        + Y PGK+NVVADALSR
Subjt:  IDYLPGKANVVADALSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.9e-7333.97Show/hide
Query:  RELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKTAFQTR
        +E+   +Q+L+D  FI PS S   +PV+ V KKDGT RLC++YR LNK TI + +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++  D  KTAF T 
Subjt:  RELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKTAFQTR

Query:  YGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHL----------------RICDFWLDRVVFLGHVVSVEGICVDPQ
         G YE+ +MPFGL NAP+ F   M   F     +FV V++DDIL++S S E+H +HL                + C F  +   FLG+ + ++ I     
Subjt:  YGHYEFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHL----------------RICDFWLDRVVFLGHVVSVEGICVDPQ

Query:  KTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGLGCVLMQ
        K  A+  +  P +V + + FLG+  YYRRF+   SK+A P+        K +WT+  +++  +LK  L  +PVL        + +  DAS  G+G VL +
Subjt:  KTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGLGCVLMQ

Query:  ---KGK---VVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA
           K K   VV Y S+ L+  + NYP  +LEL  ++ AL  +R+ L+G+   +  DH SL  + ++ E   R +RWL+ +  YD T++YL G  NVVADA
Subjt:  ---KGK---VVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYLYGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADA

Query:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK
        +SR    +  T+    S  +          S     ++++H  ++ +    +    +   R  +   ++ +  R NYSL  D  +   DRL VP  Q  +
Subjt:  LSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQLDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIK

Query:  DQILEEAHSSAYTMHPGSTKVYKTLKK
        + ++   H   +T+  G   V  TL K
Subjt:  DQILEEAHSSAYTMHPGSTKVYKTLKK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein7.0e-2144.23Show/hide
Query:  CDFWLDRVVFLG--HVVSVEGICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPV
        C F   ++ +LG  H++S EG+  DP K EA+  W  P + TE+R FLGL GYYRRF++ + K+  PLT L +KN   +WT+    +F+ LK  + T PV
Subjt:  CDFWLDRVVFLG--HVVSVEGICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRFMEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPV

Query:  LTLP
        L LP
Subjt:  LTLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACTGATGGAATTGAGAGAGCTAATGTCGCAGTTGCAGGAATTAGTGGATAAGGGATTTATTCGACCTAGTGCATCTTTGTGGGGTGCACCGGTGTTGTTTGTTAA
AAAGAAGGATGGTACATCAAGATTATGTATTAACTATCGACAGTTAAATAAGGTAACAATACGTAATAAATATCCTTTGCCCCGAATAGATGATTTGTTCGATCAACTTC
GTGGTGCCTCTGTATTCTCTAAGATTGACTTGAGATCAGGTTATCATCAGTTGAAAGTTAAAGAGTCAGATATCCCGAAAACAGCGTTTCAAACAAGATATGGGCATTAT
GAATTCTTAATAATGCCTTTTGGGTTAACTAACGCCCCTGCAGCCTTCATGGACTTGATGAATTGGGTGTTTCATTCATACCTGGATCAGTTTGTTATTGTGTTCATAGA
TGATATATTGGTTTATTCAGGGAGTAAAGAAAAGCACGCTGAACACCTTAGAATATGCGATTTCTGGCTGGATCGGGTAGTATTTTTGGGTCATGTGGTTTCAGTAGAAG
GAATTTGTGTTGATCCTCAAAAAACTGAAGCAGTAGACAAGTGGAAAAGACCCATCTCCGTCACTGAGATTCGAAGTTTTCTTGGTTTAGCGGGGTATTATCGTCGATTT
ATGGAAGGTTTTTCTAAATTGGCACTTCCATTAACCAACTTAACTAGGAAAAATGTAAAGTTTGAATGGACAGATGCTTGTGAACGGAGTTTCCAAGAGTTGAAGAAAAG
ACTTGTAACAGCTCCAGTTCTAACACTTCCAATTCCCGGTGTGGAATTTGAAATTTATTGTGATGCTTCTCATCAAGGTTTAGGTTGTGTCTTGATGCAAAAAGGAAAAG
TAGTGGCATATGCTTCTAGGCAGTTAAAGAAGCATGAATGTAACTACCCTACTCATGACTTAGAACTGGCAGCAGTAGTCCTTGCATTAAAGCTTTGGCGGCATTACCTT
TATGGTGAGAGATGTCATATATTCATTGATCACAAAAGCTTGAAATACATATTTGATCAGAAGGAACTAAATCTAAGACAGAGAAGATGGTTAGAATTAATCAAGGATTA
TGATTGTACCATTGATTATCTTCCTGGTAAGGCTAATGTGGTGGCAGATGCTTTAAGCAGGAAGTCAAGTCATTCTAACATCACTCTAAATTCGATTGGTTCGTCGCTAT
TGAGAGAATTGAAGATGGGTGAAGCCGCTGTGTCAATAGGCAAGTTGGGAAGTTTAATTGTACATTTCCAAGTGCGACCTATTTTGATAGACCGTATTATTAAAGCCCAG
CTAGATGATGCAAGGCTGAGGAAGTTAGCAGAAGATGTAAGAAAAAACCAAAGATTGAATTATAGCTTGAGGGGTGATGGTGCCTTGATGAAATATGATAGGTTGTGTGT
ACCCAGTGACCAAACGATAAAAGATCAAATTTTAGAGGAAGCACATAGTTCTGCATATACAATGCATCCTGGTAGTACGAAAGTGTATAAAACTTTGAAGAAACATGGTG
GCCAGGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCACTGATGGAATTGAGAGAGCTAATGTCGCAGTTGCAGGAATTAGTGGATAAGGGATTTATTCGACCTAGTGCATCTTTGTGGGGTGCACCGGTGTTGTTTGTTAA
AAAGAAGGATGGTACATCAAGATTATGTATTAACTATCGACAGTTAAATAAGGTAACAATACGTAATAAATATCCTTTGCCCCGAATAGATGATTTGTTCGATCAACTTC
GTGGTGCCTCTGTATTCTCTAAGATTGACTTGAGATCAGGTTATCATCAGTTGAAAGTTAAAGAGTCAGATATCCCGAAAACAGCGTTTCAAACAAGATATGGGCATTAT
GAATTCTTAATAATGCCTTTTGGGTTAACTAACGCCCCTGCAGCCTTCATGGACTTGATGAATTGGGTGTTTCATTCATACCTGGATCAGTTTGTTATTGTGTTCATAGA
TGATATATTGGTTTATTCAGGGAGTAAAGAAAAGCACGCTGAACACCTTAGAATATGCGATTTCTGGCTGGATCGGGTAGTATTTTTGGGTCATGTGGTTTCAGTAGAAG
GAATTTGTGTTGATCCTCAAAAAACTGAAGCAGTAGACAAGTGGAAAAGACCCATCTCCGTCACTGAGATTCGAAGTTTTCTTGGTTTAGCGGGGTATTATCGTCGATTT
ATGGAAGGTTTTTCTAAATTGGCACTTCCATTAACCAACTTAACTAGGAAAAATGTAAAGTTTGAATGGACAGATGCTTGTGAACGGAGTTTCCAAGAGTTGAAGAAAAG
ACTTGTAACAGCTCCAGTTCTAACACTTCCAATTCCCGGTGTGGAATTTGAAATTTATTGTGATGCTTCTCATCAAGGTTTAGGTTGTGTCTTGATGCAAAAAGGAAAAG
TAGTGGCATATGCTTCTAGGCAGTTAAAGAAGCATGAATGTAACTACCCTACTCATGACTTAGAACTGGCAGCAGTAGTCCTTGCATTAAAGCTTTGGCGGCATTACCTT
TATGGTGAGAGATGTCATATATTCATTGATCACAAAAGCTTGAAATACATATTTGATCAGAAGGAACTAAATCTAAGACAGAGAAGATGGTTAGAATTAATCAAGGATTA
TGATTGTACCATTGATTATCTTCCTGGTAAGGCTAATGTGGTGGCAGATGCTTTAAGCAGGAAGTCAAGTCATTCTAACATCACTCTAAATTCGATTGGTTCGTCGCTAT
TGAGAGAATTGAAGATGGGTGAAGCCGCTGTGTCAATAGGCAAGTTGGGAAGTTTAATTGTACATTTCCAAGTGCGACCTATTTTGATAGACCGTATTATTAAAGCCCAG
CTAGATGATGCAAGGCTGAGGAAGTTAGCAGAAGATGTAAGAAAAAACCAAAGATTGAATTATAGCTTGAGGGGTGATGGTGCCTTGATGAAATATGATAGGTTGTGTGT
ACCCAGTGACCAAACGATAAAAGATCAAATTTTAGAGGAAGCACATAGTTCTGCATATACAATGCATCCTGGTAGTACGAAAGTGTATAAAACTTTGAAGAAACATGGTG
GCCAGGTATGA
Protein sequenceShow/hide protein sequence
MALMELRELMSQLQELVDKGFIRPSASLWGAPVLFVKKKDGTSRLCINYRQLNKVTIRNKYPLPRIDDLFDQLRGASVFSKIDLRSGYHQLKVKESDIPKTAFQTRYGHY
EFLIMPFGLTNAPAAFMDLMNWVFHSYLDQFVIVFIDDILVYSGSKEKHAEHLRICDFWLDRVVFLGHVVSVEGICVDPQKTEAVDKWKRPISVTEIRSFLGLAGYYRRF
MEGFSKLALPLTNLTRKNVKFEWTDACERSFQELKKRLVTAPVLTLPIPGVEFEIYCDASHQGLGCVLMQKGKVVAYASRQLKKHECNYPTHDLELAAVVLALKLWRHYL
YGERCHIFIDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIDYLPGKANVVADALSRKSSHSNITLNSIGSSLLRELKMGEAAVSIGKLGSLIVHFQVRPILIDRIIKAQ
LDDARLRKLAEDVRKNQRLNYSLRGDGALMKYDRLCVPSDQTIKDQILEEAHSSAYTMHPGSTKVYKTLKKHGGQV