; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G14890 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G14890
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr5:15622569..15626029
RNA-Seq ExpressionCSPI05G14890
SyntenyCSPI05G14890
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR023780 - Chromo domain
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039128.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]6.6e-28353.67Show/hide
Query:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR
        MK+HW SLTM+FW  G+QI+LKGDPSLI+AEC L TLEKTW  EDQGFLLE+ N E+E ED Y+ + +EK DE ++PM + LL+ Y +IF TPK LPPKR
Subjt:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR

Query:  AIDHRILTLLKHRPINVRPYKYGHV---------------------------PVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--
         IDHRILTL   +PINVRPYKYGH+                           PVLLV KKDGGWRFCVDY KLNQ TVSDKFPIPVIEELLDEL GA   
Subjt:  AIDHRILTLLKHRPINVRPYKYGHV---------------------------PVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--

Query:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK
                                    GHYEFLVM F LTNAPATFQSLMNQ                                       DN L+AN 
Subjt:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK

Query:  NKCVIAHSQIQYLGHMISKRGVEADEDKIR---------NVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL
         KCV AHS+IQYLGH ISK GVEADEDKIR         +V+ LRGFLGLTGYYRRFVKGY  IA PLTKLLQKNAFKW+EE+ TAF  LK+AM T+ +L
Subjt:  NKCVIAHSQIQYLGHMISKRGVEADEDKIR---------NVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL

Query:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL
         LP+WN+PFTIETDASG  L AV+SQ GHPIA +S+ LS +AQ K IY RELM VVLSVQ+WRH+LLGRKF+I+SDQ  LKFLLEQREVQPQF + LTKL
Subjt:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL

Query:  LGYDFEILYQPELQNKIGDALSRMEQPFELNK---------------------------ELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF
        LGYDFEILYQP   NK  DALSR+E   EL++                           +LKK    GGKF+  NGRLLYKGRLV  +TSSLIP++LHTF
Subjt:  LGYDFEILYQPELQNKIGDALSRMEQPFELNK---------------------------ELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF

Query:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL
        +DS+LGGHS FL TYKR+SGELYW+ MK  +KKYV+QC++CQ+NKYEATK AG+LQ +PI D+ILEDWTMDFI+GLP  GG++VIMVVVDR +KY+Y I 
Subjt:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL

Query:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN
        L+HPFSAKQ                                                                             KWD+ IPWAELWYN
Subjt:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN

Query:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-
        TTFHA TK TPF+ VYG SPPPLLSYG KK  NN                        N MKK AD KRRELK +VG EVYLKL+PYRQRSLA KK E  
Subjt:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-

Query:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF
                                         H S +K K G     QHQ P+LT+ FELQL PE VLG+RWN+ELG NEW IKW+GL ++EATWESV+
Subjt:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF

Query:  QTNQQFHNFPLEDKLNLEPRG
        Q NQQF +F LEDK+N+EPRG
Subjt:  QTNQQFHNFPLEDKLNLEPRG

KAE8637598.1 hypothetical protein CSA_022681 [Cucumis sativus]1.1e-29355.44Show/hide
Query:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR
        MKVHW SLTMTFW KG++I+LKGDPSL K+EC L TLEKTW   DQGFLLEFQNYE+  E   E E E K  EE LPM Q LL+ Y ++F  P  LPP+R
Subjt:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR

Query:  AIDHRILTLLKHRPINVRPYKYGHV---------------------------PVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--
        AIDHRILT+   +PINVRPYKYGHV                           PVLLV KKDGGWRFCVDY KLNQVTV+DKFPIPVIEELLDEL+GAT  
Subjt:  AIDHRILTLLKHRPINVRPYKYGHV---------------------------PVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--

Query:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK
                                    GHYEFLVM F LTNAPATFQSLMN+                                       D++LFAN+
Subjt:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK

Query:  NKCVIAHSQIQYLGHMISKRGVEADEDKIR---------NVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL
        +KCVIAHSQ+QYLGH+IS RGVEADEDKIR         +++ LRGFLGLTGYYRRFVK Y EIAAPLTKLLQKNAF W+EE+T AF+ LKLAM TL +L
Subjt:  NKCVIAHSQIQYLGHMISKRGVEADEDKIR---------NVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL

Query:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL
        ALP+W+ PFTIETDASGV L AV+SQ+GHPIA FS+ LS RAQ K IY RELM VVLSVQKWRH+LLGRKFTI+SDQ  LKFLLEQREVQPQF + LTKL
Subjt:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL

Query:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF
        LGYDFEILYQP LQNK+ DALSR +   ELN                            ELK    +GGK+Q  NGRLLYKGR+V  + SSLIP +LHTF
Subjt:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF

Query:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL
        +DSILGGHS FL TYKR+SGEL+WK MKA +K+YVE+CD CQ+NK+EATKPAG+LQ IPI D+ILEDWTMDFI+GLP  GG +VIMVVVDR SKYSY + 
Subjt:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL

Query:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN
        L+HP++AKQ                                                                            +KWDKLIPWAELWYN
Subjt:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN

Query:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNNG-----------------------NIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-
        TTFHA TK TP+Q V+G +PPPLLSYG K+ PNN                        N MKKMAD  RRELKF++G EVYLKLRPYRQRSLA KKCE  
Subjt:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNNG-----------------------NIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-

Query:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF
                                         H S +K K GK    Q QQPILT++FELQLWPE VLG+RWN+ELGGNEW IKWK LPD+EATWESV+
Subjt:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF

Query:  QTNQQFHNFPLEDKLNLEPRG
          NQQF  F LEDK+NLEPRG
Subjt:  QTNQQFHNFPLEDKLNLEPRG

TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.1e-28253.18Show/hide
Query:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR
        MK+HW SLTM FW +G ++VLKGDP+LI+AEC L TLEKTW+ EDQGFLL++Q YE+E ED   K T    DEE LPM Q LL  Y ++F++P +LPPKR
Subjt:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR

Query:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--
        +IDHRILTL   +PINVRPYKYGH                            PVLLV KKDGGWRFCVDY KLN++T++DKFPIPVIEELLDEL+GAT  
Subjt:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--

Query:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK
                                    GHYEF+VM F LTNAPATFQSLMNQ                                       DNQL+AN+
Subjt:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK

Query:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL
         KCV AHSQI YLGH+ISK GVEAD+DK+         ++V+ LRGFLGLTGYYRRFVKGY EIAAPLTKLLQKNAFKW E +T AFE+LK AM+T+ +L
Subjt:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL

Query:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL
        ALP+W++PF IETDASG  L AV+SQN HPIA FS+ LSTRAQ K IY RELM VVLSVQKWRH+LLGR+FTI+SDQ  LKFLLEQREVQPQF + LTKL
Subjt:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL

Query:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF
        LGYDFEILYQP LQNK  DALSRM+   EL                            ++L+ N +  GK+ + NG L+YKGR+V  K+SS+IP +LHTF
Subjt:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF

Query:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL
        +DSILGGHS FL TYKR+SGEL+WK MK  +KKYVEQC+ICQ+NK EATKPAG+LQ +PI DRILEDWTMDFI+GLP  GG++VIMVVVDR SKY+Y + 
Subjt:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL

Query:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN
        ++HPFSAKQ                                                                             KW + IPWAELWYN
Subjt:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN

Query:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-
        TTFH+ T+ TPFQ VYG  PPPL+SYG KK PN+                        N MKK AD KRRELKF+VG EVYLKLRPYRQRSLA K+ E  
Subjt:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-

Query:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF
                                         H S +K K G   + Q QQP LT EFELQLWPETVLG+RW+ ELG NEW +KWKGLPD+EATWESV+
Subjt:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF

Query:  QTNQQFHNFPLEDKLNLEPRG
          NQQF +F LEDK++LEPRG
Subjt:  QTNQQFHNFPLEDKLNLEPRG

TYK27058.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.3e-28253.18Show/hide
Query:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR
        MK+HW SLTM FW +G ++VLKGDP+LI+AEC L TLEKTW+ EDQGFLL++Q YE+E ED   K T    DEE LPM Q LL  Y ++F++P +LPPKR
Subjt:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR

Query:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--
        +IDHRILTL   +PINVRPYKYGH                            PVLLV KKDGGWRFCVDY KLN++T++DKFPIPVIEELLDEL+GAT  
Subjt:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--

Query:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK
                                    GHYEF+VM F LTNAPATFQSLMNQ                                       DNQL+AN+
Subjt:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK

Query:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL
         KCV AHSQI YLGH+ISK GVEAD+DK+         ++V+ LRGFLGLTGYYRRFVKGY EIAAPLTKLLQKNAFKW E +T AFE+LK AM+T+ +L
Subjt:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL

Query:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL
        ALP+W++PF IETDASG  L AV+SQN HPIA FS+ LSTRAQ K IY RELM VVLSVQKWRH+LLGR+FTI+SDQ  LKFLLEQREVQPQF + LTKL
Subjt:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL

Query:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF
        LGYDFEILYQP LQNK  DALSRM+   EL                            ++L+ N +  GK+ + NG L+YKGR+V  K+SS+IP +LHTF
Subjt:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF

Query:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL
        +DSILGGHS FL TYKR+SGEL+WK MK  +KKYVEQC+ICQ+NK EATKPAG+LQ +PI DRILEDWTMDFI+GLP  GG++VIMVVVDR SKY+Y + 
Subjt:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL

Query:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN
        ++HPFSAKQ                                                                             KW + IPWAELWYN
Subjt:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN

Query:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-
        TTFH+ T+ TPFQ VYG  PPPL+SYG KK PN+                        N MKK AD KRRELKF+VG EVYLKLRPYRQRSLA K+ E  
Subjt:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-

Query:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF
                                         H S +K K G   + Q QQP LT EFELQLWPETVLG+RW+ ELG NEW +KWKGLPD+EATWESV+
Subjt:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF

Query:  QTNQQFHNFPLEDKLNLEPRG
          NQQF +F LEDK+ LEPRG
Subjt:  QTNQQFHNFPLEDKLNLEPRG

TYK27963.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.3e-28253.18Show/hide
Query:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR
        MK+HW SLTM FW +G ++VLKGDP+LI+AEC L TLEKTW+ EDQGFLL++Q YE+E ED   K T    DEE LPM Q LL  Y ++F++P +LPPKR
Subjt:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR

Query:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--
        +IDHRILTL   +PINVRPYKYGH                            PVLLV KKDGGWRFCVDY KLN++T++DKFPIPVIEELLDEL+GAT  
Subjt:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--

Query:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK
                                    GHYEF+VM F LTNAPATFQSLMNQ                                       DNQL+AN+
Subjt:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK

Query:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL
         KCV AHSQI YLGH+ISK GVEAD+DK+         ++V+ LRGFLGLTGYYRRFVKGY EIAAPLTKLLQKNAFKW E +T AFE+LK AM+T+ +L
Subjt:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL

Query:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL
        ALP+W++PF IETDASG  L AV+SQN HPIA FS+ LSTRAQ K IY RELM VVLSVQKWRH+LLGR+FTI+SDQ  LKFLLEQREVQPQF + LTKL
Subjt:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL

Query:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF
        LGYDFEILYQP LQNK  DALSRM+   EL                            ++L+ N +  GK+ + NG L+YKGR+V  K+SS+IP +LHTF
Subjt:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF

Query:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL
        +DSILGGHS FL TYKR+SGEL+WK MK  +KKYVEQC+ICQ+NK EATKPAG+LQ +PI DRILEDWTMDFI+GLP  GG++VIMVVVDR SKY+Y + 
Subjt:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL

Query:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN
        ++HPFSAKQ                                                                             KW + IPWAELWYN
Subjt:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN

Query:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-
        TTFH+ T+ TPFQ VYG  PPPL+SYG KK PN+                        N MKK AD KRRELKF+VG EVYLKLRPYRQRSLA K+ E  
Subjt:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-

Query:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF
                                         H S +K K G   + Q QQP LT EFELQLWPETVLG+RW+ ELG NEW +KWKGLPD+EATWESV+
Subjt:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF

Query:  QTNQQFHNFPLEDKLNLEPRG
          NQQF +F LEDK+ LEPRG
Subjt:  QTNQQFHNFPLEDKLNLEPRG

TrEMBL top hitse value%identityAlignment
A0A5A7T8C0 Ty3/gypsy retrotransposon protein3.2e-28353.67Show/hide
Query:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR
        MK+HW SLTM+FW  G+QI+LKGDPSLI+AEC L TLEKTW  EDQGFLLE+ N E+E ED Y+ + +EK DE ++PM + LL+ Y +IF TPK LPPKR
Subjt:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR

Query:  AIDHRILTLLKHRPINVRPYKYGHV---------------------------PVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--
         IDHRILTL   +PINVRPYKYGH+                           PVLLV KKDGGWRFCVDY KLNQ TVSDKFPIPVIEELLDEL GA   
Subjt:  AIDHRILTLLKHRPINVRPYKYGHV---------------------------PVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--

Query:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK
                                    GHYEFLVM F LTNAPATFQSLMNQ                                       DN L+AN 
Subjt:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK

Query:  NKCVIAHSQIQYLGHMISKRGVEADEDKIR---------NVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL
         KCV AHS+IQYLGH ISK GVEADEDKIR         +V+ LRGFLGLTGYYRRFVKGY  IA PLTKLLQKNAFKW+EE+ TAF  LK+AM T+ +L
Subjt:  NKCVIAHSQIQYLGHMISKRGVEADEDKIR---------NVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL

Query:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL
         LP+WN+PFTIETDASG  L AV+SQ GHPIA +S+ LS +AQ K IY RELM VVLSVQ+WRH+LLGRKF+I+SDQ  LKFLLEQREVQPQF + LTKL
Subjt:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL

Query:  LGYDFEILYQPELQNKIGDALSRMEQPFELNK---------------------------ELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF
        LGYDFEILYQP   NK  DALSR+E   EL++                           +LKK    GGKF+  NGRLLYKGRLV  +TSSLIP++LHTF
Subjt:  LGYDFEILYQPELQNKIGDALSRMEQPFELNK---------------------------ELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF

Query:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL
        +DS+LGGHS FL TYKR+SGELYW+ MK  +KKYV+QC++CQ+NKYEATK AG+LQ +PI D+ILEDWTMDFI+GLP  GG++VIMVVVDR +KY+Y I 
Subjt:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL

Query:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN
        L+HPFSAKQ                                                                             KWD+ IPWAELWYN
Subjt:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN

Query:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-
        TTFHA TK TPF+ VYG SPPPLLSYG KK  NN                        N MKK AD KRRELK +VG EVYLKL+PYRQRSLA KK E  
Subjt:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-

Query:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF
                                         H S +K K G     QHQ P+LT+ FELQL PE VLG+RWN+ELG NEW IKW+GL ++EATWESV+
Subjt:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF

Query:  QTNQQFHNFPLEDKLNLEPRG
        Q NQQF +F LEDK+N+EPRG
Subjt:  QTNQQFHNFPLEDKLNLEPRG

A0A5D3BBH7 Ty3/gypsy retrotransposon protein5.5e-28353.18Show/hide
Query:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR
        MK+HW SLTM FW +G ++VLKGDP+LI+AEC L TLEKTW+ EDQGFLL++Q YE+E ED   K T    DEE LPM Q LL  Y ++F++P +LPPKR
Subjt:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR

Query:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--
        +IDHRILTL   +PINVRPYKYGH                            PVLLV KKDGGWRFCVDY KLN++T++DKFPIPVIEELLDEL+GAT  
Subjt:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--

Query:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK
                                    GHYEF+VM F LTNAPATFQSLMNQ                                       DNQL+AN+
Subjt:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK

Query:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL
         KCV AHSQI YLGH+ISK GVEAD+DK+         ++V+ LRGFLGLTGYYRRFVKGY EIAAPLTKLLQKNAFKW E +T AFE+LK AM+T+ +L
Subjt:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL

Query:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL
        ALP+W++PF IETDASG  L AV+SQN HPIA FS+ LSTRAQ K IY RELM VVLSVQKWRH+LLGR+FTI+SDQ  LKFLLEQREVQPQF + LTKL
Subjt:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL

Query:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF
        LGYDFEILYQP LQNK  DALSRM+   EL                            ++L+ N +  GK+ + NG L+YKGR+V  K+SS+IP +LHTF
Subjt:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF

Query:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL
        +DSILGGHS FL TYKR+SGEL+WK MK  +KKYVEQC+ICQ+NK EATKPAG+LQ +PI DRILEDWTMDFI+GLP  GG++VIMVVVDR SKY+Y + 
Subjt:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL

Query:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN
        ++HPFSAKQ                                                                             KW + IPWAELWYN
Subjt:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN

Query:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-
        TTFH+ T+ TPFQ VYG  PPPL+SYG KK PN+                        N MKK AD KRRELKF+VG EVYLKLRPYRQRSLA K+ E  
Subjt:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-

Query:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF
                                         H S +K K G   + Q QQP LT EFELQLWPETVLG+RW+ ELG NEW +KWKGLPD+EATWESV+
Subjt:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF

Query:  QTNQQFHNFPLEDKLNLEPRG
          NQQF +F LEDK++LEPRG
Subjt:  QTNQQFHNFPLEDKLNLEPRG

A0A5D3DU86 Ty3/gypsy retrotransposon protein1.6e-28253.18Show/hide
Query:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR
        MK+HW SLTM FW +G ++VLKGDP+LI+AEC L TLEKTW+ EDQGFLL++Q YE+E ED   K T    DEE LPM Q LL  Y ++F++P +LPPKR
Subjt:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR

Query:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--
        +IDHRILTL   +PINVRPYKYGH                            PVLLV KKDGGWRFCVDY KLN++T++DKFPIPVIEELLDEL+GAT  
Subjt:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--

Query:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK
                                    GHYEF+VM F LTNAPATFQSLMNQ                                       DNQL+AN+
Subjt:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK

Query:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL
         KCV AHSQI YLGH+ISK GVEAD+DK+         ++V+ LRGFLGLTGYYRRFVKGY EIAAPLTKLLQKNAFKW E +T AFE+LK AM+T+ +L
Subjt:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL

Query:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL
        ALP+W++PF IETDASG  L AV+SQN HPIA FS+ LSTRAQ K IY RELM VVLSVQKWRH+LLGR+FTI+SDQ  LKFLLEQREVQPQF + LTKL
Subjt:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL

Query:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF
        LGYDFEILYQP LQNK  DALSRM+   EL                            ++L+ N +  GK+ + NG L+YKGR+V  K+SS+IP +LHTF
Subjt:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF

Query:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL
        +DSILGGHS FL TYKR+SGEL+WK MK  +KKYVEQC+ICQ+NK EATKPAG+LQ +PI DRILEDWTMDFI+GLP  GG++VIMVVVDR SKY+Y + 
Subjt:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL

Query:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN
        ++HPFSAKQ                                                                             KW + IPWAELWYN
Subjt:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN

Query:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-
        TTFH+ T+ TPFQ VYG  PPPL+SYG KK PN+                        N MKK AD KRRELKF+VG EVYLKLRPYRQRSLA K+ E  
Subjt:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-

Query:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF
                                         H S +K K G   + Q QQP LT EFELQLWPETVLG+RW+ ELG NEW +KWKGLPD+EATWESV+
Subjt:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF

Query:  QTNQQFHNFPLEDKLNLEPRG
          NQQF +F LEDK+ LEPRG
Subjt:  QTNQQFHNFPLEDKLNLEPRG

A0A5D3DWA9 Ty3/gypsy retrotransposon protein1.6e-28253.18Show/hide
Query:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR
        MK+HW SLTM FW +G ++VLKGDP+LI+AEC L TLEKTW+ EDQGFLL++Q YE+E ED   K T    DEE LPM Q LL  Y ++F++P +LPPKR
Subjt:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR

Query:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--
        +IDHRILTL   +PINVRPYKYGH                            PVLLV KKDGGWRFCVDY KLN++T++DKFPIPVIEELLDEL+GAT  
Subjt:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--

Query:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK
                                    GHYEF+VM F LTNAPATFQSLMNQ                                       DNQL+AN+
Subjt:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK

Query:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL
         KCV AHSQI YLGH+ISK GVEAD+DK+         ++V+ LRGFLGLTGYYRRFVKGY EIAAPLTKLLQKNAFKW E +T AFE+LK AM+T+ +L
Subjt:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL

Query:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL
        ALP+W++PF IETDASG  L AV+SQN HPIA FS+ LSTRAQ K IY RELM VVLSVQKWRH+LLGR+FTI+SDQ  LKFLLEQREVQPQF + LTKL
Subjt:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL

Query:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF
        LGYDFEILYQP LQNK  DALSRM+   EL                            ++L+ N +  GK+ + NG L+YKGR+V  K+SS+IP +LHTF
Subjt:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF

Query:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL
        +DSILGGHS FL TYKR+SGEL+WK MK  +KKYVEQC+ICQ+NK EATKPAG+LQ +PI DRILEDWTMDFI+GLP  GG++VIMVVVDR SKY+Y + 
Subjt:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL

Query:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN
        ++HPFSAKQ                                                                             KW + IPWAELWYN
Subjt:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN

Query:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-
        TTFH+ T+ TPFQ VYG  PPPL+SYG KK PN+                        N MKK AD KRRELKF+VG EVYLKLRPYRQRSLA K+ E  
Subjt:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-

Query:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF
                                         H S +K K G   + Q QQP LT EFELQLWPETVLG+RW+ ELG NEW +KWKGLPD+EATWESV+
Subjt:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF

Query:  QTNQQFHNFPLEDKLNLEPRG
          NQQF +F LEDK+ LEPRG
Subjt:  QTNQQFHNFPLEDKLNLEPRG

A0A5D3E325 Ty3/gypsy retrotransposon protein2.1e-28253.18Show/hide
Query:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR
        MK+HW SLTM FW +G ++VLKGDP+LI+AEC L TLEKTW+ EDQGFLL++Q YE+E ED   K T +  DEE LPM Q LL  Y ++F +P +LPPKR
Subjt:  MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKR

Query:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--
         IDHRILTL   +PINVRPYKYGH                            PVLLV KKDGGWRFCVDY KLN++T++DKFPIPVIEELLDEL+GAT  
Subjt:  AIDHRILTLLKHRPINVRPYKYGH---------------------------VPVLLV-KKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGAT--

Query:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK
                                    GHYEF+VM F LTNAPATFQSLMNQ                                       DNQL+AN+
Subjt:  ----------------------------GHYEFLVMSFALTNAPATFQSLMNQ---------------------------------------DNQLFANK

Query:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL
         KCV AHSQI YLGH+ISK GVEAD+DK+         ++V+ LRGFLGLTGYYRRFVKGY EIAAPLTKLLQKNAFKW E +T AFE+LK AM+T+ +L
Subjt:  NKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLIL

Query:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL
        ALP+W++PF IETDASG  L AV+SQN HPIA FS+ LSTRAQ K IY RELM VVLSVQKWRH+LLGR+FTI+SDQ  LKFLLEQREVQPQF + LTKL
Subjt:  ALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKL

Query:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF
        LGYDFEILYQP LQNK  DALSRM+   EL                            ++L+ N +  GK+ + NG L+YKGR+V  K+SS+IP +LHTF
Subjt:  LGYDFEILYQPELQNKIGDALSRMEQPFELN---------------------------KELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF

Query:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL
        +DSILGGHS FL TYKR+SGEL+WK MK  +KKYVEQC+ICQ+NK EATKPAG+LQ +PI DRILEDWTMDFI+GLP  GG++VIMVVVDR SKY+Y + 
Subjt:  NDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLIL

Query:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN
        ++HPFSAKQ                                                                             KW + IPWAELWYN
Subjt:  LRHPFSAKQA---------------------------------------------------------------------------RKWDKLIPWAELWYN

Query:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-
        TTFH+ T+ TPFQ VYG  PPPL+SYG KK PN+                        N MKK AD KRRELKF+VG EVYLKLRPYRQRSLA K+ E  
Subjt:  TTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN-----------------------GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN-

Query:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF
                                         H S +K K G   + Q QQP LT EFELQLWPETVLG+RW+ ELG NEW +KWKGLPD+EATWESV+
Subjt:  ---------------------------------HFS-VKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVF

Query:  QTNQQFHNFPLEDKLNLEPRG
          NQQF +F LEDK+ LEPRG
Subjt:  QTNQQFHNFPLEDKLNLEPRG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein6.0e-4526.5Show/hide
Query:  EMEVEDNYE--KETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPK---------RAIDHRILTLLKHRPINVRPYKYGHVPVLLV-KKDGGWRFCVDYH
        E E+ D Y+  K+   + + E+LP     L++ VE+ +    LP +         +A++  I   LK   I          PV+ V KK+G  R  VDY 
Subjt:  EMEVEDNYE--KETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPK---------RAIDHRILTLLKHRPINVRPYKYGHVPVLLV-KKDGGWRFCVDYH

Query:  KLNQVTVSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMSFALTNAPATFQSLMN-------------------
         LN+    + +P+P+IE+LL ++ G+T                              G +E+LVM + ++ APA FQ  +N                   
Subjt:  KLNQVTVSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMSFALTNAPATFQSLMN-------------------

Query:  --------------------QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKL
                            ++  L  N+ KC    SQ++++G+ IS++G    ++ I         +N   LR FLG   Y R+F+    ++  PL  L
Subjt:  --------------------QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKL

Query:  LQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNG-----HPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHH
        L+K+  +KW+   T A EN+K  + +  +L   +++    +ETDAS V + AV+SQ       +P+  +S  +S    N  +  +E++ ++ S++ WRH+
Subjt:  LQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNG-----HPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHH

Query:  LLG--RKFTIISDQTLKFLLEQREVQPQ---FLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFE-LNKELKKN---------FSEGGKFQVV----
        L      F I++D          E +P+          L  ++FEI Y+P   N I DALSR+    E + K+ + N          ++  K QVV    
Subjt:  LLG--RKFTIISDQTLKFLLEQREVQPQ---FLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFE-LNKELKKN---------FSEGGKFQVV----

Query:  -----------------NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRIS-------GELYWKHMKAGVKKYVEQCDICQQNKYEATK
                             L  G L+  K   L+P         I   H E    +  I            WK ++  +++YV+ C  CQ NK    K
Subjt:  -----------------NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRIS-------GELYWKHMKAGVKKYVEQCDICQQNKYEATK

Query:  PAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ-ARKWDKLI
        P G LQ IP S+R  E  +MDFI  LP   G + + VVVDRFSK + L+      +A+Q AR +D+ +
Subjt:  PAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ-ARKWDKLI

P0CT35 Transposon Tf2-2 polyprotein6.0e-4526.5Show/hide
Query:  EMEVEDNYE--KETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPK---------RAIDHRILTLLKHRPINVRPYKYGHVPVLLV-KKDGGWRFCVDYH
        E E+ D Y+  K+   + + E+LP     L++ VE+ +    LP +         +A++  I   LK   I          PV+ V KK+G  R  VDY 
Subjt:  EMEVEDNYE--KETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPK---------RAIDHRILTLLKHRPINVRPYKYGHVPVLLV-KKDGGWRFCVDYH

Query:  KLNQVTVSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMSFALTNAPATFQSLMN-------------------
         LN+    + +P+P+IE+LL ++ G+T                              G +E+LVM + ++ APA FQ  +N                   
Subjt:  KLNQVTVSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMSFALTNAPATFQSLMN-------------------

Query:  --------------------QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKL
                            ++  L  N+ KC    SQ++++G+ IS++G    ++ I         +N   LR FLG   Y R+F+    ++  PL  L
Subjt:  --------------------QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKL

Query:  LQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNG-----HPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHH
        L+K+  +KW+   T A EN+K  + +  +L   +++    +ETDAS V + AV+SQ       +P+  +S  +S    N  +  +E++ ++ S++ WRH+
Subjt:  LQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNG-----HPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHH

Query:  LLG--RKFTIISDQTLKFLLEQREVQPQ---FLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFE-LNKELKKN---------FSEGGKFQVV----
        L      F I++D          E +P+          L  ++FEI Y+P   N I DALSR+    E + K+ + N          ++  K QVV    
Subjt:  LLG--RKFTIISDQTLKFLLEQREVQPQ---FLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFE-LNKELKKN---------FSEGGKFQVV----

Query:  -----------------NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRIS-------GELYWKHMKAGVKKYVEQCDICQQNKYEATK
                             L  G L+  K   L+P         I   H E    +  I            WK ++  +++YV+ C  CQ NK    K
Subjt:  -----------------NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRIS-------GELYWKHMKAGVKKYVEQCDICQQNKYEATK

Query:  PAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ-ARKWDKLI
        P G LQ IP S+R  E  +MDFI  LP   G + + VVVDRFSK + L+      +A+Q AR +D+ +
Subjt:  PAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ-ARKWDKLI

P0CT36 Transposon Tf2-3 polyprotein6.0e-4526.5Show/hide
Query:  EMEVEDNYE--KETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPK---------RAIDHRILTLLKHRPINVRPYKYGHVPVLLV-KKDGGWRFCVDYH
        E E+ D Y+  K+   + + E+LP     L++ VE+ +    LP +         +A++  I   LK   I          PV+ V KK+G  R  VDY 
Subjt:  EMEVEDNYE--KETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPK---------RAIDHRILTLLKHRPINVRPYKYGHVPVLLV-KKDGGWRFCVDYH

Query:  KLNQVTVSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMSFALTNAPATFQSLMN-------------------
         LN+    + +P+P+IE+LL ++ G+T                              G +E+LVM + ++ APA FQ  +N                   
Subjt:  KLNQVTVSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMSFALTNAPATFQSLMN-------------------

Query:  --------------------QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKL
                            ++  L  N+ KC    SQ++++G+ IS++G    ++ I         +N   LR FLG   Y R+F+    ++  PL  L
Subjt:  --------------------QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKL

Query:  LQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNG-----HPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHH
        L+K+  +KW+   T A EN+K  + +  +L   +++    +ETDAS V + AV+SQ       +P+  +S  +S    N  +  +E++ ++ S++ WRH+
Subjt:  LQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNG-----HPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHH

Query:  LLG--RKFTIISDQTLKFLLEQREVQPQ---FLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFE-LNKELKKN---------FSEGGKFQVV----
        L      F I++D          E +P+          L  ++FEI Y+P   N I DALSR+    E + K+ + N          ++  K QVV    
Subjt:  LLG--RKFTIISDQTLKFLLEQREVQPQ---FLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFE-LNKELKKN---------FSEGGKFQVV----

Query:  -----------------NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRIS-------GELYWKHMKAGVKKYVEQCDICQQNKYEATK
                             L  G L+  K   L+P         I   H E    +  I            WK ++  +++YV+ C  CQ NK    K
Subjt:  -----------------NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRIS-------GELYWKHMKAGVKKYVEQCDICQQNKYEATK

Query:  PAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ-ARKWDKLI
        P G LQ IP S+R  E  +MDFI  LP   G + + VVVDRFSK + L+      +A+Q AR +D+ +
Subjt:  PAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ-ARKWDKLI

P0CT37 Transposon Tf2-4 polyprotein6.0e-4526.5Show/hide
Query:  EMEVEDNYE--KETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPK---------RAIDHRILTLLKHRPINVRPYKYGHVPVLLV-KKDGGWRFCVDYH
        E E+ D Y+  K+   + + E+LP     L++ VE+ +    LP +         +A++  I   LK   I          PV+ V KK+G  R  VDY 
Subjt:  EMEVEDNYE--KETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPK---------RAIDHRILTLLKHRPINVRPYKYGHVPVLLV-KKDGGWRFCVDYH

Query:  KLNQVTVSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMSFALTNAPATFQSLMN-------------------
         LN+    + +P+P+IE+LL ++ G+T                              G +E+LVM + ++ APA FQ  +N                   
Subjt:  KLNQVTVSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMSFALTNAPATFQSLMN-------------------

Query:  --------------------QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKL
                            ++  L  N+ KC    SQ++++G+ IS++G    ++ I         +N   LR FLG   Y R+F+    ++  PL  L
Subjt:  --------------------QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKL

Query:  LQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNG-----HPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHH
        L+K+  +KW+   T A EN+K  + +  +L   +++    +ETDAS V + AV+SQ       +P+  +S  +S    N  +  +E++ ++ S++ WRH+
Subjt:  LQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNG-----HPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHH

Query:  LLG--RKFTIISDQTLKFLLEQREVQPQ---FLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFE-LNKELKKN---------FSEGGKFQVV----
        L      F I++D          E +P+          L  ++FEI Y+P   N I DALSR+    E + K+ + N          ++  K QVV    
Subjt:  LLG--RKFTIISDQTLKFLLEQREVQPQ---FLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFE-LNKELKKN---------FSEGGKFQVV----

Query:  -----------------NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRIS-------GELYWKHMKAGVKKYVEQCDICQQNKYEATK
                             L  G L+  K   L+P         I   H E    +  I            WK ++  +++YV+ C  CQ NK    K
Subjt:  -----------------NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRIS-------GELYWKHMKAGVKKYVEQCDICQQNKYEATK

Query:  PAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ-ARKWDKLI
        P G LQ IP S+R  E  +MDFI  LP   G + + VVVDRFSK + L+      +A+Q AR +D+ +
Subjt:  PAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ-ARKWDKLI

P0CT41 Transposon Tf2-12 polyprotein6.0e-4526.5Show/hide
Query:  EMEVEDNYE--KETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPK---------RAIDHRILTLLKHRPINVRPYKYGHVPVLLV-KKDGGWRFCVDYH
        E E+ D Y+  K+   + + E+LP     L++ VE+ +    LP +         +A++  I   LK   I          PV+ V KK+G  R  VDY 
Subjt:  EMEVEDNYE--KETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPK---------RAIDHRILTLLKHRPINVRPYKYGHVPVLLV-KKDGGWRFCVDYH

Query:  KLNQVTVSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMSFALTNAPATFQSLMN-------------------
         LN+    + +P+P+IE+LL ++ G+T                              G +E+LVM + ++ APA FQ  +N                   
Subjt:  KLNQVTVSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMSFALTNAPATFQSLMN-------------------

Query:  --------------------QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKL
                            ++  L  N+ KC    SQ++++G+ IS++G    ++ I         +N   LR FLG   Y R+F+    ++  PL  L
Subjt:  --------------------QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKL

Query:  LQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNG-----HPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHH
        L+K+  +KW+   T A EN+K  + +  +L   +++    +ETDAS V + AV+SQ       +P+  +S  +S    N  +  +E++ ++ S++ WRH+
Subjt:  LQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNG-----HPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHH

Query:  LLG--RKFTIISDQTLKFLLEQREVQPQ---FLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFE-LNKELKKN---------FSEGGKFQVV----
        L      F I++D          E +P+          L  ++FEI Y+P   N I DALSR+    E + K+ + N          ++  K QVV    
Subjt:  LLG--RKFTIISDQTLKFLLEQREVQPQ---FLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFE-LNKELKKN---------FSEGGKFQVV----

Query:  -----------------NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRIS-------GELYWKHMKAGVKKYVEQCDICQQNKYEATK
                             L  G L+  K   L+P         I   H E    +  I            WK ++  +++YV+ C  CQ NK    K
Subjt:  -----------------NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRIS-------GELYWKHMKAGVKKYVEQCDICQQNKYEATK

Query:  PAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ-ARKWDKLI
        P G LQ IP S+R  E  +MDFI  LP   G + + VVVDRFSK + L+      +A+Q AR +D+ +
Subjt:  PAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ-ARKWDKLI

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.1e-2550Show/hide
Query:  QDNQLFANKNKCVIAHSQIQYLG--HMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFEN
        + +Q +AN+ KC     QI YLG  H+IS  GV AD  K+         +N + LRGFLGLTGYYRRFVK Y +I  PLT+LL+KN+ KW+E +  AF+ 
Subjt:  QDNQLFANKNKCVIAHSQIQYLG--HMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFEN

Query:  LKLAMNTLLILALPNWNIPF
        LK A+ TL +LALP+  +PF
Subjt:  LKLAMNTLLILALPNWNIPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTACATTGGTCATCTTTAACCATGACTTTTTGGGCAAAAGGCAAGCAAATTGTGCTTAAGGGAGATCCATCTCTGATTAAGGCAGAGTGTTTCTTGACGACATT
AGAGAAAACTTGGGATATAGAAGATCAAGGATTCCTCTTAGAATTCCAGAATTATGAGATGGAAGTAGAGGATAATTATGAAAAAGAAACAGAGGAAAAGAGAGATGAGG
AAGAATTACCCATGACTCAATCCTTGCTCAAGTATTATGTTGAGATATTTGAGACACCAAAGAGCCTACCTCCAAAAAGGGCTATTGATCACCGTATACTAACTCTACTG
AAACATAGACCTATCAACGTTAGACCTTACAAATATGGTCATGTTCCGGTGTTGTTAGTAAAGAAGGATGGGGGATGGAGATTTTGTGTCGATTATCACAAGCTAAACCA
AGTAACTGTGTCTGATAAGTTCCCAATCCCGGTGATTGAAGAGTTATTAGATGAATTGAATGGAGCAACAGGACACTACGAGTTCCTAGTTATGTCGTTCGCCCTCACAA
ATGCCCCCGCCACCTTCCAGTCATTGATGAATCAGGATAACCAATTATTTGCTAACAAGAATAAGTGTGTGATAGCCCATTCTCAAATCCAATATTTGGGACATATGATA
TCCAAGAGGGGAGTAGAGGCCGATGAGGATAAAATTAGGAACGTCTCTAGGTTAAGGGGTTTCTTGGGACTGACGGGCTATTATAGGAGGTTTGTTAAGGGCTATGAGGA
GATAGCTGCACCACTGACCAAGTTATTACAAAAAAATGCCTTCAAATGGAGTGAGGAATCAACAACTGCCTTCGAAAATCTTAAGCTAGCAATGAATACCTTACTGATTT
TGGCTTTACCAAATTGGAACATACCTTTCACTATTGAAACTGATGCGTCTGGTGTGAGATTAGAAGCTGTGATTTCTCAAAATGGCCATCCAATTGCTTTATTCAGCAAA
ATATTATCTACAAGAGCCCAAAACAAATTTATTTATGCAAGGGAATTGATGACTGTAGTTCTTTCCGTACAAAAGTGGAGACATCACTTGCTTGGGAGAAAGTTCACCAT
AATTTCGGATCAAACTCTCAAATTCTTATTAGAACAAAGAGAGGTGCAGCCCCAATTCCTGGAATGTTTGACAAAGCTCCTTGGCTATGATTTTGAGATACTATACCAGC
CAGAACTTCAAAACAAGATTGGTGATGCTCTATCAAGAATGGAACAACCTTTCGAACTGAACAAGGAATTGAAGAAGAACTTTTCGGAAGGAGGTAAGTTTCAGGTGGTG
AATGGGAGGCTACTTTATAAAGGAAGACTGGTAGGATTTAAAACCTCTTCTCTTATACCAAAGATACTACATACTTTTAATGATTCTATTCTTGGGGGTCACTCCGAATT
TTTAGGAACTTATAAAAGGATCAGTGGAGAACTATACTGGAAGCATATGAAGGCTGGTGTAAAGAAATATGTCGAACAATGCGATATCTGCCAACAAAACAAATATGAAG
CGACCAAGCCTGCTGGGCTACTTCAATCAATTCCCATTTCGGATAGAATATTAGAAGATTGGACAATGGACTTTATTAAGGGGTTACCCCCGGTCGGAGGAGTTGATGTG
ATCATGGTTGTAGTTGACCGATTCAGTAAGTATTCTTACTTGATACTGCTGAGACATCCATTTTCGGCTAAGCAAGCTAGAAAATGGGATAAATTGATTCCTTGGGCAGA
ATTGTGGTACAATACCACCTTCCATGCGTACACCAAGATCACCCCCTTCCAGATTGTATATGGCAGTTCCCCTCCTCCCCTGTTATCCTATGGTCATAAGAAGATTCCTA
ACAATGGGAACATAATGAAAAAAATGGCTGATTGGAAACGAAGGGAGCTGAAGTTTCAAGTAGGACATGAAGTCTACTTAAAATTGAGACCCTATCGACAACGTTCATTA
GCTTGGAAGAAGTGTGAAAACCATTTCTCAGTTAAAACTAAAGCCGGGAAAACAACAAGCCGCCAACATCAGCAACCTATCCTAACGAAAGAGTTTGAGCTGCAATTATG
GCCCGAAACAGTGTTGGGAGTTCGCTGGAACCAGGAACTGGGAGGAAATGAATGGTTTATTAAATGGAAAGGGCTACCGGACAATGAAGCAACATGGGAATCGGTCTTTC
AAACGAACCAACAATTCCATAATTTTCCCCTTGAGGACAAGTTGAACTTGGAGCCGAGGGGAGAGGGTGCTGCAGCCTGGGGAATGGACGAGCTTGACCTAAAGAGCTGG
AACCTCTACAGAGATACAGGTAATGAGGCAGTTGTTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTACATTGGTCATCTTTAACCATGACTTTTTGGGCAAAAGGCAAGCAAATTGTGCTTAAGGGAGATCCATCTCTGATTAAGGCAGAGTGTTTCTTGACGACATT
AGAGAAAACTTGGGATATAGAAGATCAAGGATTCCTCTTAGAATTCCAGAATTATGAGATGGAAGTAGAGGATAATTATGAAAAAGAAACAGAGGAAAAGAGAGATGAGG
AAGAATTACCCATGACTCAATCCTTGCTCAAGTATTATGTTGAGATATTTGAGACACCAAAGAGCCTACCTCCAAAAAGGGCTATTGATCACCGTATACTAACTCTACTG
AAACATAGACCTATCAACGTTAGACCTTACAAATATGGTCATGTTCCGGTGTTGTTAGTAAAGAAGGATGGGGGATGGAGATTTTGTGTCGATTATCACAAGCTAAACCA
AGTAACTGTGTCTGATAAGTTCCCAATCCCGGTGATTGAAGAGTTATTAGATGAATTGAATGGAGCAACAGGACACTACGAGTTCCTAGTTATGTCGTTCGCCCTCACAA
ATGCCCCCGCCACCTTCCAGTCATTGATGAATCAGGATAACCAATTATTTGCTAACAAGAATAAGTGTGTGATAGCCCATTCTCAAATCCAATATTTGGGACATATGATA
TCCAAGAGGGGAGTAGAGGCCGATGAGGATAAAATTAGGAACGTCTCTAGGTTAAGGGGTTTCTTGGGACTGACGGGCTATTATAGGAGGTTTGTTAAGGGCTATGAGGA
GATAGCTGCACCACTGACCAAGTTATTACAAAAAAATGCCTTCAAATGGAGTGAGGAATCAACAACTGCCTTCGAAAATCTTAAGCTAGCAATGAATACCTTACTGATTT
TGGCTTTACCAAATTGGAACATACCTTTCACTATTGAAACTGATGCGTCTGGTGTGAGATTAGAAGCTGTGATTTCTCAAAATGGCCATCCAATTGCTTTATTCAGCAAA
ATATTATCTACAAGAGCCCAAAACAAATTTATTTATGCAAGGGAATTGATGACTGTAGTTCTTTCCGTACAAAAGTGGAGACATCACTTGCTTGGGAGAAAGTTCACCAT
AATTTCGGATCAAACTCTCAAATTCTTATTAGAACAAAGAGAGGTGCAGCCCCAATTCCTGGAATGTTTGACAAAGCTCCTTGGCTATGATTTTGAGATACTATACCAGC
CAGAACTTCAAAACAAGATTGGTGATGCTCTATCAAGAATGGAACAACCTTTCGAACTGAACAAGGAATTGAAGAAGAACTTTTCGGAAGGAGGTAAGTTTCAGGTGGTG
AATGGGAGGCTACTTTATAAAGGAAGACTGGTAGGATTTAAAACCTCTTCTCTTATACCAAAGATACTACATACTTTTAATGATTCTATTCTTGGGGGTCACTCCGAATT
TTTAGGAACTTATAAAAGGATCAGTGGAGAACTATACTGGAAGCATATGAAGGCTGGTGTAAAGAAATATGTCGAACAATGCGATATCTGCCAACAAAACAAATATGAAG
CGACCAAGCCTGCTGGGCTACTTCAATCAATTCCCATTTCGGATAGAATATTAGAAGATTGGACAATGGACTTTATTAAGGGGTTACCCCCGGTCGGAGGAGTTGATGTG
ATCATGGTTGTAGTTGACCGATTCAGTAAGTATTCTTACTTGATACTGCTGAGACATCCATTTTCGGCTAAGCAAGCTAGAAAATGGGATAAATTGATTCCTTGGGCAGA
ATTGTGGTACAATACCACCTTCCATGCGTACACCAAGATCACCCCCTTCCAGATTGTATATGGCAGTTCCCCTCCTCCCCTGTTATCCTATGGTCATAAGAAGATTCCTA
ACAATGGGAACATAATGAAAAAAATGGCTGATTGGAAACGAAGGGAGCTGAAGTTTCAAGTAGGACATGAAGTCTACTTAAAATTGAGACCCTATCGACAACGTTCATTA
GCTTGGAAGAAGTGTGAAAACCATTTCTCAGTTAAAACTAAAGCCGGGAAAACAACAAGCCGCCAACATCAGCAACCTATCCTAACGAAAGAGTTTGAGCTGCAATTATG
GCCCGAAACAGTGTTGGGAGTTCGCTGGAACCAGGAACTGGGAGGAAATGAATGGTTTATTAAATGGAAAGGGCTACCGGACAATGAAGCAACATGGGAATCGGTCTTTC
AAACGAACCAACAATTCCATAATTTTCCCCTTGAGGACAAGTTGAACTTGGAGCCGAGGGGAGAGGGTGCTGCAGCCTGGGGAATGGACGAGCTTGACCTAAAGAGCTGG
AACCTCTACAGAGATACAGGTAATGAGGCAGTTGTTTGTTGA
Protein sequenceShow/hide protein sequence
MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKRAIDHRILTLL
KHRPINVRPYKYGHVPVLLVKKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGATGHYEFLVMSFALTNAPATFQSLMNQDNQLFANKNKCVIAHSQIQYLGHMI
SKRGVEADEDKIRNVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSK
ILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQTLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSRMEQPFELNKELKKNFSEGGKFQVV
NGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDV
IMVVVDRFSKYSYLILLRHPFSAKQARKWDKLIPWAELWYNTTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNNGNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSL
AWKKCENHFSVKTKAGKTTSRQHQQPILTKEFELQLWPETVLGVRWNQELGGNEWFIKWKGLPDNEATWESVFQTNQQFHNFPLEDKLNLEPRGEGAAAWGMDELDLKSW
NLYRDTGNEAVVC