; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc12g0320231 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc12g0320231
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTransposon Ty3-G Gag-Pol polyprotein
Genome locationCMiso1.1chr12:6477161..6478015
RNA-Seq ExpressionCmc12g0320231
SyntenyCmc12g0320231
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP35881.1 Transposon Ty3-G Gag-Pol polyprotein, partial [Cajanus cajan]1.5e-10764.79Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        M+E+L GKS++ FLDGFSG++QI IA  DQ KTIF+C FG F+++RMPFGLCNA  TFQRCMLSIF+DF+  CIE+FMDDFTVYG+ FD+ L+SL+  L 
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        RCI TNLVLNFEKCHFM   GI+LG ++SSKGIEVD AK++VI  LPYP C++++RSF   AGFYR+F+K+FSK AL L+NLLQKDV  V DD+C  AFD
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK+ LT++PI+Q P W +PFE++CDAS+YALGA+L Q VD     IY+A RTL++AQANY++TEKE L I+F+LDKFRSY++G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

RZB41284.1 Transposon Ty3-G Gag-Pol polyprotein [Glycine soja]1.1e-10563.38Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        M+E+L GKS++ FLDGFSG+ QI IA  DQ KT F+C FG F+++RMPFGLCNA  TFQRCM+SIF+DF+  CIEVFMDDFTVYG+ FD  LNSL  +L 
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        RCI TNLVLNFEKCHFM   GI+LG ++S+KGIEVD AKI+VI  LPYP C++++RSF   AGFYR+FI+DFSK+AL L+NLLQK+V    +D+C  AFD
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK  LT++PI+Q P W  PFE++CDAS+YALGA+L Q +D     IY+A+RTL++AQANY++TEKE L I+F+L+KFRSY++G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

XP_027065608.1 uncharacterized protein LOC113691594 [Coffea arabica]1.8e-10564.79Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQRCM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL 
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        RCI TNLVLN+EKCHFM  HGI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L  LLQKDV+   DDKC  AF+
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V    H IY+A R LN AQ NYS+TEKE L +IF+L+KFRSY++G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

XP_027102722.1 uncharacterized protein LOC113723965 [Coffea arabica]2.1e-10665.14Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQRCM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL 
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        RCI TNLVLN+EKCHFM  HGI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L  LLQKDV+   DDKC  AF+
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V    H IY+A R LN AQ NYS+TEKEFL +IF+L+KFRSY++G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

XP_027118748.1 uncharacterized protein LOC113735992 [Coffea arabica]4.8e-10664.79Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQRCM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL 
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        RCI TNLVLN++KCHFM  HGI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L  LLQKDV+   DDKC  AF+
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V    H IY+A R LN AQ NYS+TEKE LT+IF+L+KFRSY++G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

TrEMBL top hitse value%identityAlignment
A0A151QZW2 Transposon Ty3-G Gag-Pol polyprotein (Fragment)7.2e-10864.79Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        M+E+L GKS++ FLDGFSG++QI IA  DQ KTIF+C FG F+++RMPFGLCNA  TFQRCMLSIF+DF+  CIE+FMDDFTVYG+ FD+ L+SL+  L 
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        RCI TNLVLNFEKCHFM   GI+LG ++SSKGIEVD AK++VI  LPYP C++++RSF   AGFYR+F+K+FSK AL L+NLLQKDV  V DD+C  AFD
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK+ LT++PI+Q P W +PFE++CDAS+YALGA+L Q VD     IY+A RTL++AQANY++TEKE L I+F+LDKFRSY++G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

A0A445EY74 Reverse transcriptase5.1e-10663.38Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        M+E+L GKS++ FLDGFSG+ QI IA  DQ KT F+C FG F+++RMPFGLCNA  TFQRCM+SIF+DF+  CIEVFMDDFTVYG+ FD  LNSL  +L 
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        RCI TNLVLNFEKCHFM   GI+LG ++S+KGIEVD AKI+VI  LPYP C++++RSF   AGFYR+FI+DFSK+AL L+NLLQK+V    +D+C  AFD
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK  LT++PI+Q P W  PFE++CDAS+YALGA+L Q +D     IY+A+RTL++AQANY++TEKE L I+F+L+KFRSY++G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

A0A6P6SHK4 uncharacterized protein LOC1136915948.8e-10664.79Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQRCM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL 
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        RCI TNLVLN+EKCHFM  HGI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L  LLQKDV+   DDKC  AF+
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V    H IY+A R LN AQ NYS+TEKE L +IF+L+KFRSY++G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

A0A6P6VL84 uncharacterized protein LOC1137239651.0e-10665.14Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQRCM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL 
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        RCI TNLVLN+EKCHFM  HGI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L  LLQKDV+   DDKC  AF+
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V    H IY+A R LN AQ NYS+TEKEFL +IF+L+KFRSY++G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

A0A6P6WTG8 uncharacterized protein LOC1137359922.3e-10664.79Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQRCM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL 
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        RCI TNLVLN++KCHFM  HGI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L  LLQKDV+   DDKC  AF+
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V    H IY+A R LN AQ NYS+TEKE LT+IF+L+KFRSY++G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.0e-4737.91Show/hide
Query:  SYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLV
        +YF  +D   GF+QI +      KT FS + G + + RMPFGL NA ATFQRCM  I    + K   V++DD  V+    D  L SL L+ ++    NL 
Subjt:  SYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLV

Query:  LNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSV-VIDDKCMHAFDTLKDKLT
        L  +KC F+      LG +++  GI+ +  KI  IQ  P P   K+I++F    G+YRKFI +F+ IA  +T  L+K++ +   + +   AF  LK  ++
Subjt:  LNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSV-VIDDKCMHAFDTLKDKLT

Query:  SSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
          PIL+ P +   F +  DASD ALGA+L Q      H + +  RTLN  + NYS+ EKE L I+++   FR Y++G
Subjt:  SSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

P10394 Retrovirus-related Pol polyprotein from transposon 4122.2e-3735.21Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        +++QL    YF  LD  SGF+QI +    +  T FS   G + F R+PFGL  A  +FQR M   F+        ++MDD  V G      L +L  +  
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD
        +C   NL L+ EKC F       LG   + KGI  D  K +VIQN P P      R F +   +YR+FIK+F+  +  +T L +K+V     D+C  AF 
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFD

Query:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         LK +L +  +LQ P ++  F I  DAS  A GA+L Q  +     + +A R     ++N S+TE+E   I +++  FR YI G
Subjt:  TLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.2e-3832.77Show/hide
Query:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK
        ++  L    +F  LD  SG++QI +A  D+ KT FS   G + F R+PFGL NA + FQR +  +  + I K   V++DD  ++  +    +  ++ +LK
Subjt:  MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILK

Query:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQ-----------KDVSV
          I  N+ ++ EK  F       LG +VS  G + D  K+  IQ  P P C+  +RSF   A +YR FIKDF+ IA  +T++L+           K + V
Subjt:  RCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQ-----------KDVSV

Query:  VIDDKCMHAFDTLKDKLTSSP-ILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
          ++   +AF  L++ L S   IL+ P +  PF++  DAS   +GA+L Q    +   I    RTL   + NY++ E+E L I+++L K ++++ G
Subjt:  VIDDKCMHAFDTLKDKLTSSP-ILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

P20825 Retrovirus-related Pol polyprotein from transposon 2977.5e-4637.68Show/hide
Query:  YFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVL
        YF  +D   GF+QI +      KT FS + G + + RMPFGL NA ATFQRCM +I    + K   V++DD  ++       LNS+ L+  +    NL L
Subjt:  YFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVL

Query:  NFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDD-KCMHAFDTLKDKLTS
          +KC F+      LG +V+  GI+ +  K+  I + P P   K+IR+F    G+YRKFI +++ IA  +T+ L+K   +     + + AF+ LK  +  
Subjt:  NFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDD-KCMHAFDTLKDKLTS

Query:  SPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
         PILQ P +   F +  DAS+ ALGA+L Q      H I F  RTLN  + NYS+ EKE L I+++   FR Y++G
Subjt:  SPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.3e-5037.41Show/hide
Query:  IEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKR
        +  L    YF  LD  SGF+QI +   D  KT FS   G + F R+PFGL NA A FQR +  I  + I K   V++DD  V+  D+D+   +L L+L  
Subjt:  IEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKR

Query:  CIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQ-----------KDVSVV
            NL +N EK HF+ +    LG +V++ GI+ D  K+  I  +P P  +K+++ F     +YRKFI+D++K+A  LTNL +             V + 
Subjt:  CIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQ-----------KDVSVV

Query:  IDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
        +D+  + +F+ LK  L SS IL  P +  PF +  DAS++A+GA+L Q    +   I +  R+LN  + NY++ EKE L II+SLD  R+Y+ G
Subjt:  IDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-1231.82Show/hide
Query:  LNSLNLILKRCIGTNLVLNFEKCHFMASHGIILG--RLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSV
        +N L ++L+         N +KC F       LG   ++S +G+  D AK+  +   P P    ++R F    G+YR+F+K++ KI   LT LL+K+ S+
Subjt:  LNSLNLILKRCIGTNLVLNFEKCHFMASHGIILG--RLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSV

Query:  VIDDKCMHAFDTLKDKLTSSPILQTPYWNLPF
           +    AF  LK  +T+ P+L  P   LPF
Subjt:  VIDDKCMHAFDTLKDKLTSSPILQTPYWNLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAACAACTAGAAGGGAAATCTTACTTTCTCTTTCTAGATGGATTCTCTGGATTTTATCAAATAATCATTGCATGTGTAGACCAACATAAGACCATCTTC
TCATGTGAATTTGGACCATTTTCTTTTAAAAGAATGCCCTTTGGACTATGTAATGCTCTTGCAACATTTCAAAGATGCATGTTAAGCATATTCACTGATTTCATA
AGAAAATGCATAGAAGTGTTTATGGACGATTTCACAGTTTATGGGAACGATTTTGATTCTTTCTTGAATAGTTTAAATTTGATTTTAAAGAGATGCATTGGTACT
AACTTGGTGCTTAACTTTGAAAAGTGTCATTTCATGGCCTCTCACGGTATAATACTAGGACGCTTAGTATCATCTAAGGGAATAGAAGTTGACAAAGCTAAAATT
AATGTAATTCAAAACTTACCCTACCCCATTTGCTTAAAAGATATTAGATCATTTTTTAGCAGTGCCGGATTTTATAGAAAGTTCATAAAAGACTTTTCTAAGATA
GCTTTGTCTTTGACAAATTTACTTCAAAAAGATGTCTCTGTTGTAATTGATGATAAATGCATGCATGCTTTTGATACTTTGAAAGATAAATTGACTTCTTCTCCT
ATCTTGCAAACACCTTATTGGAACTTACCCTTTGAAATATTGTGTGATGCAAGTGATTACGCATTAGGTGCAATGCTAGGACAAATAGTAGATAACAAATTCCAT
GCTATATATTTTGCATATCGAACTCTAAACTCTGCTCAAGCTAATTACTCCTCAACTGAAAAAGAGTTTTTGACTATAATCTTTTCTCTTGATAAGTTTCGTAGC
TACATAATTGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAACAACTAGAAGGGAAATCTTACTTTCTCTTTCTAGATGGATTCTCTGGATTTTATCAAATAATCATTGCATGTGTAGACCAACATAAGACCATCTTC
TCATGTGAATTTGGACCATTTTCTTTTAAAAGAATGCCCTTTGGACTATGTAATGCTCTTGCAACATTTCAAAGATGCATGTTAAGCATATTCACTGATTTCATA
AGAAAATGCATAGAAGTGTTTATGGACGATTTCACAGTTTATGGGAACGATTTTGATTCTTTCTTGAATAGTTTAAATTTGATTTTAAAGAGATGCATTGGTACT
AACTTGGTGCTTAACTTTGAAAAGTGTCATTTCATGGCCTCTCACGGTATAATACTAGGACGCTTAGTATCATCTAAGGGAATAGAAGTTGACAAAGCTAAAATT
AATGTAATTCAAAACTTACCCTACCCCATTTGCTTAAAAGATATTAGATCATTTTTTAGCAGTGCCGGATTTTATAGAAAGTTCATAAAAGACTTTTCTAAGATA
GCTTTGTCTTTGACAAATTTACTTCAAAAAGATGTCTCTGTTGTAATTGATGATAAATGCATGCATGCTTTTGATACTTTGAAAGATAAATTGACTTCTTCTCCT
ATCTTGCAAACACCTTATTGGAACTTACCCTTTGAAATATTGTGTGATGCAAGTGATTACGCATTAGGTGCAATGCTAGGACAAATAGTAGATAACAAATTCCAT
GCTATATATTTTGCATATCGAACTCTAAACTCTGCTCAAGCTAATTACTCCTCAACTGAAAAAGAGTTTTTGACTATAATCTTTTCTCTTGATAAGTTTCGTAGC
TACATAATTGGATAA
Protein sequenceShow/hide protein sequence
MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGT
NLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFDTLKDKLTSSP
ILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG