; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G3157 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G3157
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionTy3/gypsy retrotransposon protein
Genome locationctg1041:956094..964308
RNA-Seq ExpressionCucsat.G3157
SyntenyCucsat.G3157
Gene Ontology termsGO:0006260 - DNA replication (biological process)
GO:0006281 - DNA repair (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0008622 - epsilon DNA polymerase complex (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062448.1 disease resistance protein [Cucumis melo var. makuwa]1.07e-30984.36Show/hide
Query:  REVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMTMMF
        REVI+LIDSGATHNFIH GVV EL LP+E + KFGVTIGDGTALEGNGICK VE+KLPELTIVADFL I+LG +DVVLGMQWLSTTGFM +HW +MTM+F
Subjt:  REVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMTMMF

Query:  MAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTIDGQ
        M G++QV+LKGDPSLTK ECSLKTIS+TWE ED GFL+EFQKIEIE  +E +SE+EEEG+E+NLPMIR LL + + +FE+PK LPPKRAVDHRIL  +GQ
Subjt:  MAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTIDGQ

Query:  KPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQI
         PINV PYKYGYIQK EIE+LV++MLQA IIRPSRSPYSSPVLLV+K+DGGWRFCVDY KLNQVT++DKFPIPVIEELLDELHG EVFSKLDLRSGYHQI
Subjt:  KPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQI

Query:  RMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNY
        RMKEED+EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYS +ITEHEKHLG+VFNVMKDNQLFANEKKC+IGHSR+NY
Subjt:  RMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNY

Query:  LGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIE
        LGHWISKKGVEADGE VK+MVNWPQP  VSELRGFLGLTGYY+RFVK YGNI APLTKLLQKNGF+W EDAT AFESLKQAMISVPVLALPDFSLPF IE
Subjt:  LGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIE

Query:  TDTSG
        TD SG
Subjt:  TDTSG

KAA0066816.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]7.09e-26070.51Show/hide
Query:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT
        +KG+EVIVLIDSGATHNFIH  +V E  +P+   T+FG+TIGDGT+ +G GIC +VE++L  L +V D L + LGTIDVVLGMQWL TTG M +HW ++T
Subjt:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT

Query:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI
        M+F     +VVLKGDP+L + ECSLKT+ +TWE ED GFLL++Q+ EIE  + +     + GDE  LPMI+ LLH++  +F  P  LPPKR +DHRILT+
Subjt:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI

Query:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY
         GQKPINV PYKYG+ QKEEIEKLV +MLQ GIIRPS SP+SSPVLLV+K+DGGWRFCVDY KLN++T++DKFPIPVIEELLDELHG  +FSKLDL+SGY
Subjt:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY

Query:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR
        HQIRMKEED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQ+F+PFLRR VLVFFDDIL+YS+DITEHEKHLGMVF  ++DNQL+AN KKCV  HS+
Subjt:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR

Query:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF
        ++YLGH ISK GVEAD + VK M+ WP+PKDV+ LRGFLGLTGYYRRFVK YG IAAPLTKLLQKN F W+E+AT AFESLK AM ++PVLALPD+SLPF
Subjt:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF

Query:  NIETDTSGVGLG
         IETD SG GLG
Subjt:  NIETDTSGVGLG

KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]8.93e-26171.09Show/hide
Query:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT
        +KG+ VIVLIDSGATHNFIH   V E NL +E+   F VTIGDGT   G GICKRVE++L  + +VADFLAI+LG +DV+LGMQWL TTG M +HW ++T
Subjt:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT

Query:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI
        M F +G  +++LKGDPSL + EC LKTI +TWE ED GFLLEFQ+ EI  + + + ++  +GDE  +PMI+ LL ++  +FE PK+LPPKR +DHRIL +
Subjt:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI

Query:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY
          Q+PINV PYKYGY+QKEEIEKLV +MLQAG+IRPS SPYSSPVLLV+K+DGGWRFCVDY KLNQVT+SDKFPIPVIEELLDELHG  VFSKLD++S Y
Subjt:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY

Query:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR
        HQIRM+EEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQ+F+PFLRR VLVFFDDIL+YS DI+EHEKHLGMVF V++DN LFAN+KKCVI HS+
Subjt:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR

Query:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF
        + YLGH IS KGV+AD E +K MV WPQPKDV+ LRGFLGL+GYYRRFVKGYG IAAPLT+LLQKN F W+E AT AFE LK AM ++PVLALP++ LPF
Subjt:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF

Query:  NIETDTSGVGLG
         IETD SG GLG
Subjt:  NIETDTSGVGLG

TYK14624.1 uncharacterized protein E5676_scaffold1275G00160 [Cucumis melo var. makuwa]4.88e-26368.95Show/hide
Query:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT
        ++ +E+++LIDSGATHNFIH+ +  +L L +E+ T+FG TIG+GT  +G GIC+RVEVKL E+TI+ADFLA++LG++D VLGMQWL T G M +HW ++T
Subjt:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT

Query:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI
        M F     Q++LKGDP L K ECSL+T+ +TW+ +D GFLLE+  +E+E      ++++E+GDEA++PMIR LL ++  +F  PK LPPKR +DHRILT+
Subjt:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI

Query:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY
          Q+PINV PYKYG++QK EIE LV +MLQ GIIRPSRSPYSS VLLV+K+DGGWRFCVDY KLNQ TVSDKFPIPVIEELLDEL+G  VFSKLDL+SGY
Subjt:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY

Query:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR
        HQIRMKEED+EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQ+F+PFLRR VLVFFDDIL+YS+D+TEHEKHLGM+F V++DNQL+AN KKCV  HSR
Subjt:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR

Query:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF
        + YLGH ISK GVEAD + +++MVNWP+P DV+ELRGFLGLTGYYRRFVKGY NI  PLTKLLQKN F WNE+A   F  LK AM ++PVLALPD+SLPF
Subjt:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF

Query:  NIETDTSGVGLG
         IETD SG GLG
Subjt:  NIETDTSGVGLG

TYK27437.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.66e-30784.36Show/hide
Query:  REVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMTMMF
        REVI+LIDSGATHNFIH GVV EL LP+E + KFGVTIGDGTALEGNGICK VE+KLPELTIVADFL I+LG +DVVLGMQWLSTTGFM +HW +MTM+F
Subjt:  REVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMTMMF

Query:  MAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTIDGQ
        M G++QV+LKGDPSLTK ECSLKTIS+TWE ED GFL+EFQKIEIE  +E +SE+EEEG+E+NLPMIR LL + + +FE+PK LPPKRAVDHRIL  +GQ
Subjt:  MAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTIDGQ

Query:  KPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQI
         PINV PYKYGYIQK EIE+LV++MLQA IIRPSRSPYSSPVLLV+K+DGGWRFCVDY KLNQVT++DKFPIPVIEELLDELHG EVFSKLDLRSGYHQI
Subjt:  KPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQI

Query:  RMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNY
        RMKEED+EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYS +ITEHEKHLG+VFNVMKDNQLFANEKKC+IGHSR+NY
Subjt:  RMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNY

Query:  LGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIE
        LGHWISKKGVEADGE VK+MVNWPQP  VSELRGFLGLTGYY+RFVK YGNI APLTKLLQKNGF+W EDAT AFESLKQAMISVPVLALPDFSLPF IE
Subjt:  LGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIE

Query:  TDTSG
        TD SG
Subjt:  TDTSG

TrEMBL top hitse value%identityAlignment
A0A5A7V4D8 Disease resistance protein5.20e-31084.36Show/hide
Query:  REVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMTMMF
        REVI+LIDSGATHNFIH GVV EL LP+E + KFGVTIGDGTALEGNGICK VE+KLPELTIVADFL I+LG +DVVLGMQWLSTTGFM +HW +MTM+F
Subjt:  REVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMTMMF

Query:  MAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTIDGQ
        M G++QV+LKGDPSLTK ECSLKTIS+TWE ED GFL+EFQKIEIE  +E +SE+EEEG+E+NLPMIR LL + + +FE+PK LPPKRAVDHRIL  +GQ
Subjt:  MAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTIDGQ

Query:  KPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQI
         PINV PYKYGYIQK EIE+LV++MLQA IIRPSRSPYSSPVLLV+K+DGGWRFCVDY KLNQVT++DKFPIPVIEELLDELHG EVFSKLDLRSGYHQI
Subjt:  KPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQI

Query:  RMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNY
        RMKEED+EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYS +ITEHEKHLG+VFNVMKDNQLFANEKKC+IGHSR+NY
Subjt:  RMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNY

Query:  LGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIE
        LGHWISKKGVEADGE VK+MVNWPQP  VSELRGFLGLTGYY+RFVK YGNI APLTKLLQKNGF+W EDAT AFESLKQAMISVPVLALPDFSLPF IE
Subjt:  LGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIE

Query:  TDTSG
        TD SG
Subjt:  TDTSG

A0A5A7VK94 Ty3/gypsy retrotransposon protein3.43e-26070.51Show/hide
Query:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT
        +KG+EVIVLIDSGATHNFIH  +V E  +P+   T+FG+TIGDGT+ +G GIC +VE++L  L +V D L + LGTIDVVLGMQWL TTG M +HW ++T
Subjt:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT

Query:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI
        M+F     +VVLKGDP+L + ECSLKT+ +TWE ED GFLL++Q+ EIE  + +     + GDE  LPMI+ LLH++  +F  P  LPPKR +DHRILT+
Subjt:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI

Query:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY
         GQKPINV PYKYG+ QKEEIEKLV +MLQ GIIRPS SP+SSPVLLV+K+DGGWRFCVDY KLN++T++DKFPIPVIEELLDELHG  +FSKLDL+SGY
Subjt:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY

Query:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR
        HQIRMKEED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQ+F+PFLRR VLVFFDDIL+YS+DITEHEKHLGMVF  ++DNQL+AN KKCV  HS+
Subjt:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR

Query:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF
        ++YLGH ISK GVEAD + VK M+ WP+PKDV+ LRGFLGLTGYYRRFVK YG IAAPLTKLLQKN F W+E+AT AFESLK AM ++PVLALPD+SLPF
Subjt:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF

Query:  NIETDTSGVGLG
         IETD SG GLG
Subjt:  NIETDTSGVGLG

A0A5D3BBH7 Ty3/gypsy retrotransposon protein3.37e-25970.7Show/hide
Query:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT
        +KG+EVIVLIDSGATHNFIH  +V E  +P+   T+FG+TIGDGT+ +G GIC +VE++L  L +V D L + LGTIDVVLGMQWL TTG M +HW ++T
Subjt:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT

Query:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI
        M+F     +VVLKGDP+L + ECSLKT+ +TWE ED GFLL++Q+ EIE  + +       GDE  LPMI+ LLH++  +F+ P  LPPKR++DHRILT+
Subjt:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI

Query:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY
         GQKPINV PYKYG+ QKEEIEKLV +MLQ GIIRPS SP+SSPVLLV+K+DGGWRFCVDY KLN++T++DKFPIPVIEELLDELHG  VFSKLDL+SGY
Subjt:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY

Query:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR
        HQIRM+EED+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQ+F+PFLRR VLVFFDDIL+YS+DITEHEKHLGMVF  ++DNQL+AN KKCV  HS+
Subjt:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR

Query:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF
        ++YLGH ISK GVEAD + VK+M+ WP+PKDV+ LRGFLGLTGYYRRFVKGYG IAAPLTKLLQKN F W+E+AT AFESLK AM ++PVLALPD+SLPF
Subjt:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF

Query:  NIETDTSGVGLG
         IETD SG GLG
Subjt:  NIETDTSGVGLG

A0A5D3CUL0 Reverse transcriptase domain-containing protein2.36e-26368.95Show/hide
Query:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT
        ++ +E+++LIDSGATHNFIH+ +  +L L +E+ T+FG TIG+GT  +G GIC+RVEVKL E+TI+ADFLA++LG++D VLGMQWL T G M +HW ++T
Subjt:  MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMT

Query:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI
        M F     Q++LKGDP L K ECSL+T+ +TW+ +D GFLLE+  +E+E      ++++E+GDEA++PMIR LL ++  +F  PK LPPKR +DHRILT+
Subjt:  MMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTI

Query:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY
          Q+PINV PYKYG++QK EIE LV +MLQ GIIRPSRSPYSS VLLV+K+DGGWRFCVDY KLNQ TVSDKFPIPVIEELLDEL+G  VFSKLDL+SGY
Subjt:  DGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGY

Query:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR
        HQIRMKEED+EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQ+F+PFLRR VLVFFDDIL+YS+D+TEHEKHLGM+F V++DNQL+AN KKCV  HSR
Subjt:  HQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSR

Query:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF
        + YLGH ISK GVEAD + +++MVNWP+P DV+ELRGFLGLTGYYRRFVKGY NI  PLTKLLQKN F WNE+A   F  LK AM ++PVLALPD+SLPF
Subjt:  VNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPF

Query:  NIETDTSGVGLG
         IETD SG GLG
Subjt:  NIETDTSGVGLG

A0A5D3DUJ0 Ty3/gypsy retrotransposon protein8.04e-30884.36Show/hide
Query:  REVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMTMMF
        REVI+LIDSGATHNFIH GVV EL LP+E + KFGVTIGDGTALEGNGICK VE+KLPELTIVADFL I+LG +DVVLGMQWLSTTGFM +HW +MTM+F
Subjt:  REVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMTMMF

Query:  MAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTIDGQ
        M G++QV+LKGDPSLTK ECSLKTIS+TWE ED GFL+EFQKIEIE  +E +SE+EEEG+E+NLPMIR LL + + +FE+PK LPPKRAVDHRIL  +GQ
Subjt:  MAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTIDGQ

Query:  KPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQI
         PINV PYKYGYIQK EIE+LV++MLQA IIRPSRSPYSSPVLLV+K+DGGWRFCVDY KLNQVT++DKFPIPVIEELLDELHG EVFSKLDLRSGYHQI
Subjt:  KPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQI

Query:  RMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNY
        RMKEED+EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYS +ITEHEKHLG+VFNVMKDNQLFANEKKC+IGHSR+NY
Subjt:  RMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNY

Query:  LGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIE
        LGHWISKKGVEADGE VK+MVNWPQP  VSELRGFLGLTGYY+RFVK YGNI APLTKLLQKNGF+W EDAT AFESLKQAMISVPVLALPDFSLPF IE
Subjt:  LGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIE

Query:  TDTSG
        TD SG
Subjt:  TDTSG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.1e-7042.44Show/hide
Query:  SPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGG-----WRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQI
        S Y Y    ++E+E  +  ML  GIIR S SPY+SP+ +V K+        +R  +DY KLN++TV D+ PIP ++E+L +L     F+ +DL  G+HQI
Subjt:  SPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGG-----WRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQI

Query:  RMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNY
         M  E V KTAF T  GHYE+L MPFGL NAPATFQ  MN I RP L +  LV+ DDI+++ST + EH + LG+VF  +    L     KC        +
Subjt:  RMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNY

Query:  LGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNG--FNWNEDATEAFESLKQAMISVPVLALPDFSLPFN
        LGH ++  G++ + E ++A+  +P P    E++ FLGLTGYYR+F+  + +IA P+TK L+KN      N +   AF+ LK  +   P+L +PDF+  F 
Subjt:  LGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNG--FNWNEDATEAFESLKQAMISVPVLALPDFSLPFN

Query:  IETDTSGVGLG
        + TD S V LG
Subjt:  IETDTSGVGLG

P20825 Retrovirus-related Pol polyprotein from transposon 2973.1e-7433.78Show/hide
Query:  KGREVIVLIDSGATHNFIHEGV--VNELNLPVEEKTKFG-VTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTT
        KGR    L+D+G+T N I+E +  +   N   E  T  G +T+ D   L  N I K+ E           ++       D+++G + L     + +++  
Subjt:  KGREVIVLIDSGATHNFIHEGV--VNELNLPVEEKTKFG-VTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTT

Query:  MTMMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFL--LEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGL-FEMPKELPPKRAVDH
         T+     T +++     S       ++   E+    D   +  L+F +  ++  N+E++ +           ++ LL++F+ L ++  ++L     + H
Subjt:  MTMMFMAGTTQVVLKGDPSLTKTECSLKTISETWEMEDHGFL--LEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGL-FEMPKELPPKRAVDH

Query:  RILTIDGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGG-----WRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEV
         +L      PI    Y      + E+E  V +ML  G+IR S SPY+SP  +V K+        +R  +DY KLN++T+ D++PIP ++E+L +L   + 
Subjt:  RILTIDGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGG-----WRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEV

Query:  FSKLDLRSGYHQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFAN
        F+ +DL  G+HQI M EE + KTAF T  GHYE+L MPFGL NAPATFQ  MN I RP L +  LV+ DDI+I+ST +TEH   + +VF  + D  L   
Subjt:  FSKLDLRSGYHQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFAN

Query:  EKKCVIGHSRVNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNG--FNWNEDATEAFESLKQAMISV
          KC       N+LGH ++  G++ +   VKA+V++P P    E+R FLGLTGYYR+F+  Y +IA P+T  L+K         +  EAFE LK  +I  
Subjt:  EKKCVIGHSRVNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNG--FNWNEDATEAFESLKQAMISV

Query:  PVLALPDFSLPFNIETDTSGVGLG
        P+L LPDF   F + TD S + LG
Subjt:  PVLALPDFSLPFNIETDTSGVGLG

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein6.6e-6440.06Show/hide
Query:  ELPPKRA------VDHRILTIDGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIE
        +LPP+ A      V H I    G +   + PY      ++EI K+V K+L    I PS+SP SSPV+LV K+DG +R CVDY  LN+ T+SD FP+P I+
Subjt:  ELPPKRA------VDHRILTIDGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIE

Query:  ELLDELHGVEVFSKLDLRSGYHQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVF
         LL  +   ++F+ LDL SGYHQI M+ +D  KTAF T  G YE+ VMPFGL NAP+TF   M   FR    RFV V+ DDILI+S    EH KHL  V 
Subjt:  ELLDELHGVEVFSKLDLRSGYHQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVF

Query:  NVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFE
          +K+  L   +KKC        +LG+ I  + +        A+ ++P PK V + + FLG+  YYRRF+     IA P+ +L   +   W E   +A E
Subjt:  NVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFE

Query:  SLKQAMISVPVLALPDFSLPFNIETDTSGVGLGQFYHRMANQ
         LK A+ + PVL   +    + + TD S  G+G     + N+
Subjt:  SLKQAMISVPVLALPDFSLPFNIETDTSGVGLGQFYHRMANQ

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus7.0e-6637.57Show/hide
Query:  EGDEANLPMIRKLLHRFKGLFEMP-KELPPKRAVDHRILTIDGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKR-----DGG
        E  +    ++  LL  F  +FE P   +  + AV   I T + Q PI    Y Y    + E+E+ + ++LQ GIIRPS SPY+SP+ +V K+     +  
Subjt:  EGDEANLPMIRKLLHRFKGLFEMP-KELPPKRAVDHRILTIDGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKR-----DGG

Query:  WRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFV
        +R  VD+ +LN VT+ D +PIP I   L  L   + F+ LDL SG+HQI MKE D+ KTAF T  G YEFL +PFGL NAPA FQ +++ I R  + +  
Subjt:  WRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFV

Query:  LVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGN
         V+ DDI+++S D   H K+L +V   +    L  N +K     ++V +LG+ ++  G++AD + V+A+   P P  V EL+ FLG+T YYR+F++ Y  
Subjt:  LVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGN

Query:  IAAPLTKLLQ------------KNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIETDTSGVGLG
        +A PLT L +            K     +E A ++F  LK  + S  +LA P F+ PF++ TD S   +G
Subjt:  IAAPLTKLLQ------------KNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIETDTSGVGLG

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-6339.77Show/hide
Query:  ELPPKRA------VDHRILTIDGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIE
        +LPP+ A      V H I    G +   + PY      ++EI K+V K+L    I PS+SP SSPV+LV K+DG +R CVDY  LN+ T+SD FP+P I+
Subjt:  ELPPKRA------VDHRILTIDGQKPINVSPYKYGYIQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIE

Query:  ELLDELHGVEVFSKLDLRSGYHQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVF
         LL  +   ++F+ LDL SGYHQI M+ +D  KTAF T  G YE+ VMPFGL NAP+TF   M   FR    RFV V+ DDILI+S    EH KHL  V 
Subjt:  ELLDELHGVEVFSKLDLRSGYHQIRMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVF

Query:  NVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFE
          +K+  L   +KKC        +LG+ I  + +        A+ ++P PK V + + FLG+  YYRRF+     IA P+ +L   +   W E   +A +
Subjt:  NVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFE

Query:  SLKQAMISVPVLALPDFSLPFNIETDTSGVGLGQFYHRMANQ
         LK A+ + PVL   +    + + TD S  G+G     + N+
Subjt:  SLKQAMISVPVLALPDFSLPFNIETDTSGVGLGQFYHRMANQ

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein3.1e-0830.13Show/hide
Query:  EVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLG--TIDVVLGMQWLSTTGFMGVHWTTMTMM
        +V+V IDSGAT NFI   +   L LP     +  V +G    ++  G C  + + + E+ I  +FL + L    +DV+LG +WLS  G   V+W      
Subjt:  EVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLG--TIDVVLGMQWLSTTGFMGVHWTTMTMM

Query:  FMAGTTQVVLKGD-PSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSE
        F      + L  +   L +    +K  SE  E ED           IEE    D E
Subjt:  FMAGTTQVVLKGD-PSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSE

AT3G30770.1 Eukaryotic aspartyl protease family protein3.2e-0528.37Show/hide
Query:  EVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKL--GTIDVVLGMQWLSTTGFMGVHWTTMTMM
        +V+V+IDSGAT+NFI + +   L LP     +  V +G    ++  G C  + + + E+ I  +FL + L    +DV+LG           + W      
Subjt:  EVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKL--GTIDVVLGMQWLSTTGFMGVHWTTMTMM

Query:  FMAGTTQVVL-KGDPSLTKTECSLKTISETWEMEDHGFLLE
        F      V L   D  L +    +K  SE +E E     LE
Subjt:  FMAGTTQVVL-KGDPSLTKTECSLKTISETWEMEDHGFLLE

ATMG00850.1 DNA/RNA polymerases superfamily protein1.3e-0655Show/hide
Query:  IQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGW
        +++  ++  + +ML+A II+PS SPYSSPVLLV+K+DGGW
Subjt:  IQKEEIEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein1.3e-3857.36Show/hide
Query:  HLGMVFNVMKDNQLFANEKKCVIGHSRVNYLG--HWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWN
        HLGMV  + + +Q +AN KKC  G  ++ YLG  H IS +GV AD   ++AMV WP+PK+ +ELRGFLGLTGYYRRFVK YG I  PLT+LL+KN   W 
Subjt:  HLGMVFNVMKDNQLFANEKKCVIGHSRVNYLG--HWISKKGVEADGENVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQKNGFNWN

Query:  EDATEAFESLKQAMISVPVLALPDFSLPF
        E A  AF++LK A+ ++PVLALPD  LPF
Subjt:  EDATEAFESLKQAMISVPVLALPDFSLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGGAAGAGAAGTAATTGTCCTCATTGATAGTGGCGCAACACACAATTTCATCCATGAAGGAGTGGTTAATGAACTAAACTTACCCGTGGAGGAAAAAACTAAATT
CGGAGTGACTATTGGCGACGGAACTGCACTGGAGGGAAATGGGATTTGCAAGAGAGTCGAGGTTAAGCTGCCGGAGTTAACAATTGTGGCAGATTTCTTAGCGATCAAAC
TGGGTACAATTGATGTGGTTTTGGGAATGCAGTGGCTCAGCACGACTGGATTCATGGGGGTTCATTGGACGACGATGACAATGATGTTCATGGCAGGAACCACGCAGGTG
GTTTTAAAAGGGGATCCGTCCCTTACAAAAACTGAATGTTCCTTAAAGACGATCTCTGAGACGTGGGAAATGGAGGACCATGGATTTTTGTTGGAATTCCAGAAAATTGA
AATTGAAGAAAATAATGAGGAAGACAGTGAAAGGGAGGAAGAGGGGGATGAAGCAAATTTACCTATGATTCGCAAGCTGTTGCATAGGTTCAAGGGGCTGTTCGAGATGC
CGAAAGAATTACCTCCAAAAAGGGCAGTAGATCATCGAATTCTGACTATCGACGGGCAAAAACCCATCAATGTGAGCCCTTACAAGTACGGGTACATACAGAAAGAAGAG
ATAGAGAAACTGGTGACGAAAATGCTTCAAGCGGGAATTATCCGGCCAAGCCGAAGCCCATATTCCAGTCCGGTATTGCTAGTAAGGAAACGCGATGGTGGGTGGAGATT
TTGTGTAGACTACTGTAAATTGAATCAGGTGACTGTCTCCGATAAATTTCCCATTCCCGTAATCGAAGAACTGCTAGATGAGTTACATGGTGTTGAAGTTTTTTCCAAGC
TGGATTTGCGTTCGGGGTATCACCAAATTCGGATGAAGGAGGAGGATGTTGAAAAAACCGCGTTTCGCACTCACGAAGGGCATTATGAATTTCTAGTTATGCCTTTCGGG
TTAACCAATGCGCCGGCCACTTTTCAGTCCCTAATGAATCAAATTTTCAGGCCCTTCCTAAGACGGTTTGTGTTGGTCTTCTTCGACGATATCCTGATCTATAGTACTGA
TATTACCGAACATGAGAAGCACTTGGGAATGGTGTTCAACGTAATGAAAGACAACCAGTTATTTGCCAATGAAAAGAAATGCGTAATAGGGCACTCACGAGTCAATTATT
TGGGACACTGGATCTCTAAAAAAGGTGTAGAAGCGGATGGGGAAAATGTGAAAGCAATGGTAAACTGGCCCCAACCCAAAGATGTTTCTGAATTAAGGGGATTCCTCGGC
CTCACGGGATATTATCGAAGATTTGTCAAAGGATATGGGAACATTGCTGCTCCCCTGACCAAATTGTTGCAAAAAAATGGGTTCAATTGGAACGAAGATGCCACAGAAGC
TTTCGAGTCTCTGAAACAAGCAATGATCTCGGTACCGGTCTTAGCACTCCCCGATTTTTCCTTACCATTTAACATTGAAACGGACACCTCAGGCGTAGGGCTGGGACAGT
TTTATCACAGAATGGCCAACCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGGAAGAGAAGTAATTGTCCTCATTGATAGTGGCGCAACACACAATTTCATCCATGAAGGAGTGGTTAATGAACTAAACTTACCCGTGGAGGAAAAAACTAAATT
CGGAGTGACTATTGGCGACGGAACTGCACTGGAGGGAAATGGGATTTGCAAGAGAGTCGAGGTTAAGCTGCCGGAGTTAACAATTGTGGCAGATTTCTTAGCGATCAAAC
TGGGTACAATTGATGTGGTTTTGGGAATGCAGTGGCTCAGCACGACTGGATTCATGGGGGTTCATTGGACGACGATGACAATGATGTTCATGGCAGGAACCACGCAGGTG
GTTTTAAAAGGGGATCCGTCCCTTACAAAAACTGAATGTTCCTTAAAGACGATCTCTGAGACGTGGGAAATGGAGGACCATGGATTTTTGTTGGAATTCCAGAAAATTGA
AATTGAAGAAAATAATGAGGAAGACAGTGAAAGGGAGGAAGAGGGGGATGAAGCAAATTTACCTATGATTCGCAAGCTGTTGCATAGGTTCAAGGGGCTGTTCGAGATGC
CGAAAGAATTACCTCCAAAAAGGGCAGTAGATCATCGAATTCTGACTATCGACGGGCAAAAACCCATCAATGTGAGCCCTTACAAGTACGGGTACATACAGAAAGAAGAG
ATAGAGAAACTGGTGACGAAAATGCTTCAAGCGGGAATTATCCGGCCAAGCCGAAGCCCATATTCCAGTCCGGTATTGCTAGTAAGGAAACGCGATGGTGGGTGGAGATT
TTGTGTAGACTACTGTAAATTGAATCAGGTGACTGTCTCCGATAAATTTCCCATTCCCGTAATCGAAGAACTGCTAGATGAGTTACATGGTGTTGAAGTTTTTTCCAAGC
TGGATTTGCGTTCGGGGTATCACCAAATTCGGATGAAGGAGGAGGATGTTGAAAAAACCGCGTTTCGCACTCACGAAGGGCATTATGAATTTCTAGTTATGCCTTTCGGG
TTAACCAATGCGCCGGCCACTTTTCAGTCCCTAATGAATCAAATTTTCAGGCCCTTCCTAAGACGGTTTGTGTTGGTCTTCTTCGACGATATCCTGATCTATAGTACTGA
TATTACCGAACATGAGAAGCACTTGGGAATGGTGTTCAACGTAATGAAAGACAACCAGTTATTTGCCAATGAAAAGAAATGCGTAATAGGGCACTCACGAGTCAATTATT
TGGGACACTGGATCTCTAAAAAAGGTGTAGAAGCGGATGGGGAAAATGTGAAAGCAATGGTAAACTGGCCCCAACCCAAAGATGTTTCTGAATTAAGGGGATTCCTCGGC
CTCACGGGATATTATCGAAGATTTGTCAAAGGATATGGGAACATTGCTGCTCCCCTGACCAAATTGTTGCAAAAAAATGGGTTCAATTGGAACGAAGATGCCACAGAAGC
TTTCGAGTCTCTGAAACAAGCAATGATCTCGGTACCGGTCTTAGCACTCCCCGATTTTTCCTTACCATTTAACATTGAAACGGACACCTCAGGCGTAGGGCTGGGACAGT
TTTATCACAGAATGGCCAACCAATAG
Protein sequenceShow/hide protein sequence
MKGREVIVLIDSGATHNFIHEGVVNELNLPVEEKTKFGVTIGDGTALEGNGICKRVEVKLPELTIVADFLAIKLGTIDVVLGMQWLSTTGFMGVHWTTMTMMFMAGTTQV
VLKGDPSLTKTECSLKTISETWEMEDHGFLLEFQKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLFEMPKELPPKRAVDHRILTIDGQKPINVSPYKYGYIQKEE
IEKLVTKMLQAGIIRPSRSPYSSPVLLVRKRDGGWRFCVDYCKLNQVTVSDKFPIPVIEELLDELHGVEVFSKLDLRSGYHQIRMKEEDVEKTAFRTHEGHYEFLVMPFG
LTNAPATFQSLMNQIFRPFLRRFVLVFFDDILIYSTDITEHEKHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGENVKAMVNWPQPKDVSELRGFLG
LTGYYRRFVKGYGNIAAPLTKLLQKNGFNWNEDATEAFESLKQAMISVPVLALPDFSLPFNIETDTSGVGLGQFYHRMANQ