; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C11G218095 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C11G218095
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCla97Chr11:23795513..23800658
RNA-Seq ExpressionCla97C11G218095
SyntenyCla97C11G218095
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR000953 - Chromo/chromo shadow domain
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR023780 - Chromo domain
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059481.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0045.83Show/hide
Query:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF
        MAQ+  EER E  EKE+  L+++++   K+++R++++LR ++ + + E G S                  +      +RSKYKKL MP+F GE P++W++
Subjt:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF

Query:  RAETFFEIHELGS------RLVPLGKQPEAD--------PKLEGDE--------------RENAGVMEIHARRDSHESVAGDKARGSLCRLP-PKESVLV
        RAE FFEI+ L         +V  G Q E D         K+E  E              + + G   I  +++   S    K       LP   ESVL 
Subjt:  RAETFFEIHELGS------RLVPLGKQPEAD--------PKLEGDE--------------RENAGVMEIHARRDSHESVAGDKARGSLCRLP-PKESVLV

Query:  TAFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRL
         AFL GL P+L+AEVMSR    L+ CM+ AQ ++DR++ALKLA  E+   E  R                 ++ +  M  +    KG   +K E P++RL
Subjt:  TAFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRL

Query:  TDAKFKEKRDKEGLCF-----------------KELNVIIAGE---VEELEDIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI
        +DA+F+ + DK GLCF                 +EL + I  E   +EE E  +      +E   +E  E   +  R++   +   TMK++GIV GKE I
Subjt:  TDAKFKEKRDKEGLCF-----------------KELNVIIAGE---VEELEDIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI

Query:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE
        VLID GATHNFIH +L  E  + +     +GI +G+G + +G+G+C  V + L+G+ +V D L + LG +DV+LGM WL TTG + +HWPSLTM F    
Subjt:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE

Query:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI
        ++V LKGDP+L + E SLK L KTWE EDQ +L++ Q  E E E  D + T    +EE LP MIQ LL +  D+F     LPPKR             PI
Subjt:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI

Query:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------
        NVRPYKYGH QK+EIEKLV EML TGIIRPS SPFSS VLLVKKKDG WRFCVDYRK+N+ TIA+KFPIPVIEEL   LHGA++FSKLDL          
Subjt:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------

Query:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH
                 R HEGHYEF+VMP GL NAP TF S MN+VFKPFLR  +LVFFDDILVYS DI+ HE HL +VF  LR+N+L+ANRK+C+F+ S+I YLGH
Subjt:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH

Query:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA
         +SK GVEAD +K+K MLQWP PK+VT L  FLGLTGY               T  +Q +A + D + T A              +L         ETDA
Subjt:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA

Query:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN
        S  GVGAVL Q S PIA+F+Q LSTRA+AKSIYERELMA+VL +QKWRHYLLG +FT++SDQKALKFL+EQ+EVQPQFQKWLTKLL YDFEILYQP LQN
Subjt:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN

Query:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY
        KAADALSRM    E++ L+TT +VD+EVV +EVE D++LQ +I +L+ +P    K+SL  G L+YK R                 +   LG      RTY
Subjt:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY

Query:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV
        KRM+GEL+W+GMK+D+K+YVE C ICQ+NKSEA  PAG+LQPLP+P++I  D TMDF+EGLPK+ G + IMVVVDRLSKY +F+ + HP++AKQV   F+
Subjt:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV

Query:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV
         +I                                                                     W+ ++PW + WYNTT+H S  TTPF  V
Subjt:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV

Query:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG
        YG    PLISY + K  ND VE  L  RD A+ ALKE+L +AQN                +G+EV+LKLRPYRQ+SLA+KR EKL+PK+YGPYR+ E IG
Subjt:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG

Query:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK
        +VAYRL+LP EA+IHNVFH SQL  K+                      PE +LG+ W+ +L   EWL+KW+  P+SEATWE    +  QFP  HLEDK
Subjt:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK

TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0045.63Show/hide
Query:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF
        MAQ+  EER E  EKE+  L+++++   K+++R++++LR ++ + + E G S                  +      +RSKYKKL MP+F GE P++W++
Subjt:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF

Query:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT
        RAE FFEI+ L         +V  G+      +   + R+     ++  R        G K+ G+ L R+  +                     ESVL  
Subjt:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT

Query:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT
        AFL GL P+L+AEVMSR    L+ CM+ AQ ++DR++ALKLA  E+   E  R                 ++ +  M  +    KG   +K E P++RL+
Subjt:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT

Query:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI
        DA+F+ + DK GLCF                 +EL + I  E E  E+  E  EA     +E   +   E   +  R++   +   TMK++GIV GKE I
Subjt:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI

Query:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE
        VLID GATHNFIH +L  E  + +     +GI +G+G + +G+G+C  V + L+G+ +V D L + LG +DV+LGM WL TTG + +HWPSLTM F    
Subjt:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE

Query:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI
        ++V LKGDP+L + E SLK L KTWE EDQ +L++ Q  E E E  D + T    +EE LP MIQ LL +  D+F+    LPPKR             PI
Subjt:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI

Query:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------
        NVRPYKYGH QK+EIEKLV EML TGIIRPS SPFSS VLLVKKKDG WRFCVDYRK+N+ TIA+KFPIPVIEEL   LHGA++FSKLDL          
Subjt:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------

Query:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH
                 R HEGHYEF+VMP GL NAP TF SLMN+VFKPFLR  +LVFFDDILVYS DI+ HE HL +VF  LR+N+L+ANRK+C+F+ S+I YLGH
Subjt:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH

Query:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA
         +SK GVEAD +K+K+MLQWP PK+VT L  FLGLTGY               T  +Q +A + D + T A              +L         ETDA
Subjt:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA

Query:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN
        S  G+GAVL Q S PIA+F+Q LSTRA+AKSIYERELMA+VL +QKWRHYLLG +FT++SDQKALKFL+EQ+EVQPQFQKWLTKLL YDFEILYQP LQN
Subjt:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN

Query:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY
        KAADALSRM    E++ L+TT +VD+EVV +EVE D++LQ +I +L+ +P    K+SL  G L+YK R                 +   LG      RTY
Subjt:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY

Query:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV
        KRM+GEL+W+GMK+D+K+YVE C ICQ+NKSEA  PAG+LQPLP+P++I  D TMDF+EGLPK+ G + IMVVVDRLSKY +F+ + HP++AKQVA  F+
Subjt:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV

Query:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV
         +I                                                                     W+ ++PW + WYNTT+H S  TTPF  V
Subjt:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV

Query:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG
        YG    PLISY + K  ND VE  L  RD A+ ALKE+L +AQN                +G+EV+LKLRPYRQ+SLA+KR EKL+PK+YGPYR+ E IG
Subjt:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG

Query:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK
        +VAYRL+LP EA+IHNVFH SQL  K+                      PE +LG+ W+ +L   EWL+KW+  P+SEATWE    +  QFP  HLEDK
Subjt:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK

TYK21035.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0045.9Show/hide
Query:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF
        MAQ+  EER E  EKE+  L+++++   K+++R++++LR ++ + + E G S                  +      +RSKYKKL MP+F GE P++W++
Subjt:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF

Query:  RAETFFEIHELGS------RLVPLGKQPEAD--------PKLEGDE--------------RENAGVMEIHARRDSHESVAGDKARGSLCRLP-PKESVLV
        RAE FFEI+ L         +V  G Q E D         K+E  E              + + G   I  +++   S    K       LP   ESVL 
Subjt:  RAETFFEIHELGS------RLVPLGKQPEAD--------PKLEGDE--------------RENAGVMEIHARRDSHESVAGDKARGSLCRLP-PKESVLV

Query:  TAFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRL
         AFL GL P+L+AEVMSR    L+ CM+ AQ ++DR++ALKLA  E+   E  R                 ++ +  M  +    KG   +K E P++RL
Subjt:  TAFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRL

Query:  TDAKFKEKRDKEGLCF-----------------KELNVIIAGE---VEELEDIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI
        +DA+F+ + DK GLCF                 +EL + I  E    EE E  +      +E   +E  E   +  R++   +   TMK++GIV GKE I
Subjt:  TDAKFKEKRDKEGLCF-----------------KELNVIIAGE---VEELEDIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI

Query:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE
        VLID GATHNFIH +L  E  + +     +GI +G+G + +G+G+C  V + L+G+ +V D L + LG +DV+LGM WL TTG + +HWPSLTM F    
Subjt:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE

Query:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI
        ++V LKGDP+L + E SLK L KTWE EDQ +L++ Q  E E E  D + T    +EE LP MIQ LL +  D+F     LPPKR             PI
Subjt:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI

Query:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------
        NVRPYKYGH QK+EIEKLV EML TGIIRPS SPFSS VLLVKKKDG WRFCVDYRK+N+ TIA+KFPIPVIEEL   LHGA++FSKLDL          
Subjt:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------

Query:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH
                 R HEGHYEF+VMP GL NAP TF SLMN+VFKPFLR  +LVFFDDILVYS DI+ HE HL +VF  LR+N+L+ANRK+C+F+ S+I YLGH
Subjt:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH

Query:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA
         +SK GVEAD +K+K+MLQWP PK+VT L  FLGLTGY               T  +Q +A + D + T A              +L         ETDA
Subjt:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA

Query:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN
        S  G+GAVL Q S PIA+F+Q LSTRA+AKSIYERELMA+VL +QKWRHYLLG +FT++SDQKALKFL+EQ+EVQPQFQKWLTKLL YDFEILYQP LQN
Subjt:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN

Query:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY
        KAADALSRM    E++ L+TT +VD+EVV +EVE D++LQ +I +L+ +P    K+SL  G L+YK R                 +   LG      RTY
Subjt:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY

Query:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV
        KRM+GEL+W+GMK+D+K+YVE C ICQ+NKSEA  PAG+LQPLP+P++I  D TMDF+EGLPK+ G + IMVVVDRLSKY +F+ + HP++AKQVA  F+
Subjt:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV

Query:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV
         +I                                                                     W+ ++PW + WYNTT+H S  TTPF  V
Subjt:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV

Query:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG
        YG    PLISY + K  ND VE  L  RD A+ ALKE+L +AQN                +G+EV+LKLRPYRQ+SLA+KR EKL+PK+YGPYR+ E IG
Subjt:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG

Query:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK
        +VAYRL+LP EA+IHNVFH SQL  K+                      PE +LG+ W+ +L   EWL+KW+  P+SEATWE    +  QFP  HLEDK
Subjt:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK

TYK27058.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0045.63Show/hide
Query:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF
        MAQ+  EER E  EKE+  L+++++   K+++R++++LR ++ + + E G S                  +      +RSKYKKL MP+F GE P++W++
Subjt:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF

Query:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT
        RAE FFEI+ L         +V  G+      +   + R+     ++  R        G K+ G+ L R+  +                     ESVL  
Subjt:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT

Query:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT
        AFL GL P+L+AEVMSR    L+ CM+ AQ ++DR++ALKLA  E+   E  R                 ++ +  M  +    KG   +K E P++RL+
Subjt:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT

Query:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI
        DA+F+ + DK GLCF                 +EL + I  E E  E+  E  EA     +E   +   E   +  R++   +   TMK++GIV GKE I
Subjt:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI

Query:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE
        VLID GATHNFIH +L  E  + +     +GI +G+G + +G+G+C  V + L+G+ +V D L + LG +DV+LGM WL TTG + +HWPSLTM F    
Subjt:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE

Query:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI
        ++V LKGDP+L + E SLK L KTWE EDQ +L++ Q  E E E  D + T    +EE LP MIQ LL +  D+F+    LPPKR             PI
Subjt:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI

Query:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------
        NVRPYKYGH QK+EIEKLV EML TGIIRPS SPFSS VLLVKKKDG WRFCVDYRK+N+ TIA+KFPIPVIEEL   LHGA++FSKLDL          
Subjt:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------

Query:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH
                 R HEGHYEF+VMP GL NAP TF SLMN+VFKPFLR  +LVFFDDILVYS DI+ HE HL +VF  LR+N+L+ANRK+C+F+ S+I YLGH
Subjt:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH

Query:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA
         +SK GVEAD +K+K+MLQWP PK+VT L  FLGLTGY               T  +Q +A + D + T A              +L         ETDA
Subjt:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA

Query:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN
        S  G+GAVL Q S PIA+F+Q LSTRA+AKSIYERELMA+VL +QKWRHYLLG +FT++SDQKALKFL+EQ+EVQPQFQKWLTKLL YDFEILYQP LQN
Subjt:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN

Query:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY
        KAADALSRM    E++ L+TT +VD+EVV +EVE D++LQ +I +L+ +P    K+SL  G L+YK R                 +   LG      RTY
Subjt:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY

Query:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV
        KRM+GEL+W+GMK+D+K+YVE C ICQ+NKSEA  PAG+LQPLP+P++I  D TMDF+EGLPK+ G + IMVVVDRLSKY +F+ + HP++AKQVA  F+
Subjt:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV

Query:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV
         +I                                                                     W+ ++PW + WYNTT+H S  TTPF  V
Subjt:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV

Query:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG
        YG    PLISY + K  ND VE  L  RD A+ ALKE+L +AQN                +G+EV+LKLRPYRQ+SLA+KR EKL+PK+YGPYR+ E IG
Subjt:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG

Query:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK
        +VAYRL+LP EA+IHNVFH SQL  K+                      PE +LG+ W+ +L   EWL+KW+  P+SEATWE    +  QFP  HLEDK
Subjt:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0045.63Show/hide
Query:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF
        MAQ+  EER E  EKE+  L+++++   K+++R++++LR ++ + + E G S                  +      +RSKYKKL MP+F GE P++W++
Subjt:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF

Query:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT
        RAE FFEI+ L         +V  G+      +   + R+     ++  R        G K+ G+ L R+  +                     ESVL  
Subjt:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT

Query:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT
        AFL GL P+L+AEVMSR    L+ CM+ AQ ++DR++ALKLA  E+   E  R                 ++ +  M  +    KG   +K E P++RL+
Subjt:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT

Query:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI
        DA+F+ + DK GLCF                 +EL + I  E E  E+  E  EA     +E   +   E   +  R++   +   TMK++GIV GKE I
Subjt:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI

Query:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE
        VLID GATHNFIH +L  E  + +     +GI +G+G + +G+G+C  V + L+G+ +V D L + LG +DV+LGM WL TTG + +HWPSLTM F    
Subjt:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE

Query:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI
        ++V LKGDP+L + E SLK L KTWE EDQ +L++ Q  E E E  D + T    +EE LP MIQ LL +  D+F+    LPPKR             PI
Subjt:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI

Query:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------
        NVRPYKYGH QK+EIEKLV EML TGIIRPS SPFSS VLLVKKKDG WRFCVDYRK+N+ TIA+KFPIPVIEEL   LHGA++FSKLDL          
Subjt:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------

Query:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH
                 R HEGHYEF+VMP GL NAP TF SLMN+VFKPFLR  +LVFFDDILVYS DI+ HE HL +VF  LR+N+L+ANRK+C+F+ S+I YLGH
Subjt:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH

Query:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA
         +SK GVEAD +K+K+MLQWP PK+VT L  FLGLTGY               T  +Q +A + D + T A              +L         ETDA
Subjt:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA

Query:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN
        S  G+GAVL Q S PIA+F+Q LSTRA+AKSIYERELMA+VL +QKWRHYLLG +FT++SDQKALKFL+EQ+EVQPQFQKWLTKLL YDFEILYQP LQN
Subjt:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN

Query:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY
        KAADALSRM    E++ L+TT +VD+EVV +EVE D++LQ +I +L+ +P    K+SL  G L+YK R                 +   LG      RTY
Subjt:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY

Query:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV
        KRM+GEL+W+GMK+D+K+YVE C ICQ+NKSEA  PAG+LQPLP+P++I  D TMDF+EGLPK+ G + IMVVVDRLSKY +F+ + HP++AKQVA  F+
Subjt:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV

Query:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV
         +I                                                                     W+ ++PW + WYNTT+H S  TTPF  V
Subjt:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV

Query:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG
        YG    PLISY + K  ND VE  L  RD A+ ALKE+L +AQN                +G+EV+LKLRPYRQ+SLA+KR EKL+PK+YGPYR+ E IG
Subjt:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG

Query:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK
        +VAYRL+LP EA+IHNVFH SQL  K+                      PE +LG+ W+ +L   EWL+KW+  P+SEATWE    +  QFP  HLEDK
Subjt:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK

TrEMBL top hitse value%identityAlignment
A0A5A7V194 Ty3/gypsy retrotransposon protein0.0e+0045.83Show/hide
Query:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF
        MAQ+  EER E  EKE+  L+++++   K+++R++++LR ++ + + E G S                  +      +RSKYKKL MP+F GE P++W++
Subjt:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF

Query:  RAETFFEIHELGS------RLVPLGKQPEAD--------PKLEGDE--------------RENAGVMEIHARRDSHESVAGDKARGSLCRLP-PKESVLV
        RAE FFEI+ L         +V  G Q E D         K+E  E              + + G   I  +++   S    K       LP   ESVL 
Subjt:  RAETFFEIHELGS------RLVPLGKQPEAD--------PKLEGDE--------------RENAGVMEIHARRDSHESVAGDKARGSLCRLP-PKESVLV

Query:  TAFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRL
         AFL GL P+L+AEVMSR    L+ CM+ AQ ++DR++ALKLA  E+   E  R                 ++ +  M  +    KG   +K E P++RL
Subjt:  TAFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRL

Query:  TDAKFKEKRDKEGLCF-----------------KELNVIIAGE---VEELEDIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI
        +DA+F+ + DK GLCF                 +EL + I  E   +EE E  +      +E   +E  E   +  R++   +   TMK++GIV GKE I
Subjt:  TDAKFKEKRDKEGLCF-----------------KELNVIIAGE---VEELEDIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI

Query:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE
        VLID GATHNFIH +L  E  + +     +GI +G+G + +G+G+C  V + L+G+ +V D L + LG +DV+LGM WL TTG + +HWPSLTM F    
Subjt:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE

Query:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI
        ++V LKGDP+L + E SLK L KTWE EDQ +L++ Q  E E E  D + T    +EE LP MIQ LL +  D+F     LPPKR             PI
Subjt:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI

Query:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------
        NVRPYKYGH QK+EIEKLV EML TGIIRPS SPFSS VLLVKKKDG WRFCVDYRK+N+ TIA+KFPIPVIEEL   LHGA++FSKLDL          
Subjt:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------

Query:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH
                 R HEGHYEF+VMP GL NAP TF S MN+VFKPFLR  +LVFFDDILVYS DI+ HE HL +VF  LR+N+L+ANRK+C+F+ S+I YLGH
Subjt:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH

Query:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA
         +SK GVEAD +K+K MLQWP PK+VT L  FLGLTGY               T  +Q +A + D + T A              +L         ETDA
Subjt:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA

Query:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN
        S  GVGAVL Q S PIA+F+Q LSTRA+AKSIYERELMA+VL +QKWRHYLLG +FT++SDQKALKFL+EQ+EVQPQFQKWLTKLL YDFEILYQP LQN
Subjt:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN

Query:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY
        KAADALSRM    E++ L+TT +VD+EVV +EVE D++LQ +I +L+ +P    K+SL  G L+YK R                 +   LG      RTY
Subjt:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY

Query:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV
        KRM+GEL+W+GMK+D+K+YVE C ICQ+NKSEA  PAG+LQPLP+P++I  D TMDF+EGLPK+ G + IMVVVDRLSKY +F+ + HP++AKQV   F+
Subjt:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV

Query:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV
         +I                                                                     W+ ++PW + WYNTT+H S  TTPF  V
Subjt:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV

Query:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG
        YG    PLISY + K  ND VE  L  RD A+ ALKE+L +AQN                +G+EV+LKLRPYRQ+SLA+KR EKL+PK+YGPYR+ E IG
Subjt:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG

Query:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK
        +VAYRL+LP EA+IHNVFH SQL  K+                      PE +LG+ W+ +L   EWL+KW+  P+SEATWE    +  QFP  HLEDK
Subjt:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK

A0A5D3BBH7 Ty3/gypsy retrotransposon protein0.0e+0045.63Show/hide
Query:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF
        MAQ+  EER E  EKE+  L+++++   K+++R++++LR ++ + + E G S                  +      +RSKYKKL MP+F GE P++W++
Subjt:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF

Query:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT
        RAE FFEI+ L         +V  G+      +   + R+     ++  R        G K+ G+ L R+  +                     ESVL  
Subjt:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT

Query:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT
        AFL GL P+L+AEVMSR    L+ CM+ AQ ++DR++ALKLA  E+   E  R                 ++ +  M  +    KG   +K E P++RL+
Subjt:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT

Query:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI
        DA+F+ + DK GLCF                 +EL + I  E E  E+  E  EA     +E   +   E   +  R++   +   TMK++GIV GKE I
Subjt:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI

Query:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE
        VLID GATHNFIH +L  E  + +     +GI +G+G + +G+G+C  V + L+G+ +V D L + LG +DV+LGM WL TTG + +HWPSLTM F    
Subjt:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE

Query:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI
        ++V LKGDP+L + E SLK L KTWE EDQ +L++ Q  E E E  D + T    +EE LP MIQ LL +  D+F+    LPPKR             PI
Subjt:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI

Query:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------
        NVRPYKYGH QK+EIEKLV EML TGIIRPS SPFSS VLLVKKKDG WRFCVDYRK+N+ TIA+KFPIPVIEEL   LHGA++FSKLDL          
Subjt:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------

Query:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH
                 R HEGHYEF+VMP GL NAP TF SLMN+VFKPFLR  +LVFFDDILVYS DI+ HE HL +VF  LR+N+L+ANRK+C+F+ S+I YLGH
Subjt:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH

Query:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA
         +SK GVEAD +K+K+MLQWP PK+VT L  FLGLTGY               T  +Q +A + D + T A              +L         ETDA
Subjt:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA

Query:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN
        S  G+GAVL Q S PIA+F+Q LSTRA+AKSIYERELMA+VL +QKWRHYLLG +FT++SDQKALKFL+EQ+EVQPQFQKWLTKLL YDFEILYQP LQN
Subjt:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN

Query:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY
        KAADALSRM    E++ L+TT +VD+EVV +EVE D++LQ +I +L+ +P    K+SL  G L+YK R                 +   LG      RTY
Subjt:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY

Query:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV
        KRM+GEL+W+GMK+D+K+YVE C ICQ+NKSEA  PAG+LQPLP+P++I  D TMDF+EGLPK+ G + IMVVVDRLSKY +F+ + HP++AKQVA  F+
Subjt:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV

Query:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV
         +I                                                                     W+ ++PW + WYNTT+H S  TTPF  V
Subjt:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV

Query:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG
        YG    PLISY + K  ND VE  L  RD A+ ALKE+L +AQN                +G+EV+LKLRPYRQ+SLA+KR EKL+PK+YGPYR+ E IG
Subjt:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG

Query:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK
        +VAYRL+LP EA+IHNVFH SQL  K+                      PE +LG+ W+ +L   EWL+KW+  P+SEATWE    +  QFP  HLEDK
Subjt:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK

A0A5D3DU86 Ty3/gypsy retrotransposon protein0.0e+0045.63Show/hide
Query:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF
        MAQ+  EER E  EKE+  L+++++   K+++R++++LR ++ + + E G S                  +      +RSKYKKL MP+F GE P++W++
Subjt:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF

Query:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT
        RAE FFEI+ L         +V  G+      +   + R+     ++  R        G K+ G+ L R+  +                     ESVL  
Subjt:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT

Query:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT
        AFL GL P+L+AEVMSR    L+ CM+ AQ ++DR++ALKLA  E+   E  R                 ++ +  M  +    KG   +K E P++RL+
Subjt:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT

Query:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI
        DA+F+ + DK GLCF                 +EL + I  E E  E+  E  EA     +E   +   E   +  R++   +   TMK++GIV GKE I
Subjt:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI

Query:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE
        VLID GATHNFIH +L  E  + +     +GI +G+G + +G+G+C  V + L+G+ +V D L + LG +DV+LGM WL TTG + +HWPSLTM F    
Subjt:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE

Query:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI
        ++V LKGDP+L + E SLK L KTWE EDQ +L++ Q  E E E  D + T    +EE LP MIQ LL +  D+F+    LPPKR             PI
Subjt:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI

Query:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------
        NVRPYKYGH QK+EIEKLV EML TGIIRPS SPFSS VLLVKKKDG WRFCVDYRK+N+ TIA+KFPIPVIEEL   LHGA++FSKLDL          
Subjt:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------

Query:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH
                 R HEGHYEF+VMP GL NAP TF SLMN+VFKPFLR  +LVFFDDILVYS DI+ HE HL +VF  LR+N+L+ANRK+C+F+ S+I YLGH
Subjt:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH

Query:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA
         +SK GVEAD +K+K+MLQWP PK+VT L  FLGLTGY               T  +Q +A + D + T A              +L         ETDA
Subjt:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA

Query:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN
        S  G+GAVL Q S PIA+F+Q LSTRA+AKSIYERELMA+VL +QKWRHYLLG +FT++SDQKALKFL+EQ+EVQPQFQKWLTKLL YDFEILYQP LQN
Subjt:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN

Query:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY
        KAADALSRM    E++ L+TT +VD+EVV +EVE D++LQ +I +L+ +P    K+SL  G L+YK R                 +   LG      RTY
Subjt:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY

Query:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV
        KRM+GEL+W+GMK+D+K+YVE C ICQ+NKSEA  PAG+LQPLP+P++I  D TMDF+EGLPK+ G + IMVVVDRLSKY +F+ + HP++AKQVA  F+
Subjt:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV

Query:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV
         +I                                                                     W+ ++PW + WYNTT+H S  TTPF  V
Subjt:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV

Query:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG
        YG    PLISY + K  ND VE  L  RD A+ ALKE+L +AQN                +G+EV+LKLRPYRQ+SLA+KR EKL+PK+YGPYR+ E IG
Subjt:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG

Query:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK
        +VAYRL+LP EA+IHNVFH SQL  K+                      PE +LG+ W+ +L   EWL+KW+  P+SEATWE    +  QFP  HLEDK
Subjt:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK

A0A5D3DZK6 Ty3/gypsy retrotransposon protein0.0e+0045.63Show/hide
Query:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF
        MAQ+  EER E  EKE+  L+++++   K+++R++++LR ++ + + E G S                  +      +RSKYKKL MP+F GE P++W++
Subjt:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF

Query:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT
        RAE FFEI+ L         +V  G+      +   + R+     ++  R        G K+ G+ L R+  +                     ESVL  
Subjt:  RAETFFEIHELGS------RLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGS-LCRLPPK---------------------ESVLVT

Query:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT
        AFL GL P+L+AEVMSR    L+ CM+ AQ ++DR++ALKLA  E+   E  R                 ++ +  M  +    KG   +K E P++RL+
Subjt:  AFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRLT

Query:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI
        DA+F+ + DK GLCF                 +EL + I  E E  E+  E  EA     +E   +   E   +  R++   +   TMK++GIV GKE I
Subjt:  DAKFKEKRDKEGLCF-----------------KELNVIIAGEVEELEDIQEEMEA----KIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI

Query:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE
        VLID GATHNFIH +L  E  + +     +GI +G+G + +G+G+C  V + L+G+ +V D L + LG +DV+LGM WL TTG + +HWPSLTM F    
Subjt:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE

Query:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI
        ++V LKGDP+L + E SLK L KTWE EDQ +L++ Q  E E E  D + T    +EE LP MIQ LL +  D+F+    LPPKR             PI
Subjt:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI

Query:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------
        NVRPYKYGH QK+EIEKLV EML TGIIRPS SPFSS VLLVKKKDG WRFCVDYRK+N+ TIA+KFPIPVIEEL   LHGA++FSKLDL          
Subjt:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------

Query:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH
                 R HEGHYEF+VMP GL NAP TF SLMN+VFKPFLR  +LVFFDDILVYS DI+ HE HL +VF  LR+N+L+ANRK+C+F+ S+I YLGH
Subjt:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH

Query:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA
         +SK GVEAD +K+K+MLQWP PK+VT L  FLGLTGY               T  +Q +A + D + T A              +L         ETDA
Subjt:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA

Query:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN
        S  G+GAVL Q S PIA+F+Q LSTRA+AKSIYERELMA+VL +QKWRHYLLG +FT++SDQKALKFL+EQ+EVQPQFQKWLTKLL YDFEILYQP LQN
Subjt:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN

Query:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY
        KAADALSRM    E++ L+TT +VD+EVV +EVE D++LQ +I +L+ +P    K+SL  G L+YK R                 +   LG      RTY
Subjt:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY

Query:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV
        KRM+GEL+W+GMK+D+K+YVE C ICQ+NKSEA  PAG+LQPLP+P++I  D TMDF+EGLPK+ G + IMVVVDRLSKY +F+ + HP++AKQVA  F+
Subjt:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV

Query:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV
         +I                                                                     W+ ++PW + WYNTT+H S  TTPF  V
Subjt:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV

Query:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG
        YG    PLISY + K  ND VE  L  RD A+ ALKE+L +AQN                +G+EV+LKLRPYRQ+SLA+KR EKL+PK+YGPYR+ E IG
Subjt:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG

Query:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK
        +VAYRL+LP EA+IHNVFH SQL  K+                      PE +LG+ W+ +L   EWL+KW+  P+SEATWE    +  QFP  HLEDK
Subjt:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK

A0A5D3E325 Ty3/gypsy retrotransposon protein0.0e+0045.9Show/hide
Query:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF
        MAQ+  EER E  EKE+  L+++++   K+++R++++LR ++ + + E G S                  +      +RSKYKKL MP+F GE P++W++
Subjt:  MAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVEEGES-----------------SQGRNGGFNRSKYKKLNMPVFDGEGPDAWIF

Query:  RAETFFEIHELGS------RLVPLGKQPEAD--------PKLEGDE--------------RENAGVMEIHARRDSHESVAGDKARGSLCRLP-PKESVLV
        RAE FFEI+ L         +V  G Q E D         K+E  E              + + G   I  +++   S    K       LP   ESVL 
Subjt:  RAETFFEIHELGS------RLVPLGKQPEAD--------PKLEGDE--------------RENAGVMEIHARRDSHESVAGDKARGSLCRLP-PKESVLV

Query:  TAFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRL
         AFL GL P+L+AEVMSR    L+ CM+ AQ ++DR++ALKLA  E+   E  R                 ++ +  M  +    KG   +K E P++RL
Subjt:  TAFLNGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDR-----------------KREQSPMTGVKPGAKGGLSKKRETPMRRL

Query:  TDAKFKEKRDKEGLCF-----------------KELNVIIAGE---VEELEDIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI
        +DA+F+ + DK GLCF                 +EL + I  E    EE E  +      +E   +E  E   +  R++   +   TMK++GIV GKE I
Subjt:  TDAKFKEKRDKEGLCF-----------------KELNVIIAGE---VEELEDIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAI

Query:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE
        VLID GATHNFIH +L  E  + +     +GI +G+G + +G+G+C  V + L+G+ +V D L + LG +DV+LGM WL TTG + +HWPSLTM F    
Subjt:  VLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGE

Query:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI
        ++V LKGDP+L + E SLK L KTWE EDQ +L++ Q  E E E  D + T    +EE LP MIQ LL +  D+F     LPPKR             PI
Subjt:  DQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE-EGEDKEPTIAEPEEERLPVMIQSLLERNGDIFELSRGLPPKR-----------DAPI

Query:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------
        NVRPYKYGH QK+EIEKLV EML TGIIRPS SPFSS VLLVKKKDG WRFCVDYRK+N+ TIA+KFPIPVIEEL   LHGA++FSKLDL          
Subjt:  NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEEL---LHGASMFSKLDL----------

Query:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH
                 R HEGHYEF+VMP GL NAP TF SLMN+VFKPFLR  +LVFFDDILVYS DI+ HE HL +VF  LR+N+L+ANRK+C+F+ S+I YLGH
Subjt:  ---------RKHEGHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGH

Query:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA
         +SK GVEAD +K+K+MLQWP PK+VT L  FLGLTGY               T  +Q +A + D + T A              +L         ETDA
Subjt:  WVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY---------------TDGVQ-HAKECDVHTTRA--------------SLTKLCFTLHHETDA

Query:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN
        S  G+GAVL Q S PIA+F+Q LSTRA+AKSIYERELMA+VL +QKWRHYLLG +FT++SDQKALKFL+EQ+EVQPQFQKWLTKLL YDFEILYQP LQN
Subjt:  SRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQN

Query:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY
        KAADALSRM    E++ L+TT +VD+EVV +EVE D++LQ +I +L+ +P    K+SL  G L+YK R                 +   LG      RTY
Subjt:  KAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKR-----------------YRRPLG------RTY

Query:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV
        KRM+GEL+W+GMK+D+K+YVE C ICQ+NKSEA  PAG+LQPLP+P++I  D TMDF+EGLPK+ G + IMVVVDRLSKY +F+ + HP++AKQVA  F+
Subjt:  KRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFV

Query:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV
         +I                                                                     W+ ++PW + WYNTT+H S  TTPF  V
Subjt:  REIA--------------------------------------------------------------------WYGWLPWVKYWYNTTYHVSINTTPFNIV

Query:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG
        YG    PLISY + K  ND VE  L  RD A+ ALKE+L +AQN                +G+EV+LKLRPYRQ+SLA+KR EKL+PK+YGPYR+ E IG
Subjt:  YGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQN----------------LGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIG

Query:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK
        +VAYRL+LP EA+IHNVFH SQL  K+                      PE +LG+ W+ +L   EWL+KW+  P+SEATWE    +  QFP  HLEDK
Subjt:  QVAYRLELPREATIHNVFHASQLNKKVR---------------------PEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.8e-6625.71Show/hide
Query:  AEPEEERLPVMIQSLLERNGDIFELSRGLPPKRDAPINVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATI
        AE   E+LP  I+  LE   ++ + +  LP        +R Y     +   +   + + L +GIIR SK+  +  V+ V KK+G  R  VDY+ +N+   
Subjt:  AEPEEERLPVMIQSLLERNGDIFELSRGLPPKRDAPINVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATI

Query:  AEKFPIPVIEELL---HGASMFSKLDL---------RKHEGH----------YEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDIS
           +P+P+IE+LL    G+++F+KLDL         RK + H          +E+LVMP G+  AP  F   +N +      + ++ + DDIL++S+  S
Subjt:  AEKFPIPVIEELL---HGASMFSKLDL---------RKHEGH----------YEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDIS

Query:  THETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY-------------------------
         H  H+  V   L+   L  N+ +C F QS+++++G+ +S++G     E I  +LQW  PKN  +L  FLG   Y                         
Subjt:  THETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY-------------------------

Query:  ------TDGVQHAKECDVHTTRASLTKLCFTLHHETDASRMGVGAVLMQES-----RPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGS--K
              T  +++ K+C V             +  ETDAS + VGAVL Q+       P+ Y++  +S      S+ ++E++A++  ++ WRHYL  +   
Subjt:  ------TDGVQHAKECDVHTTRASLTKLCFTLHHETDASRMGVGAVLMQES-----RPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGS--K

Query:  FTVISDQKAL--KFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQNKAADALSRMPTPTE----------IRVLTTTTLVDLEVVQQEVEMDKDLQNII
        F +++D + L  +   E +    +  +W   L +++FEI Y+P   N  ADALSR+   TE          I  +   ++ D    Q   E   D + + 
Subjt:  FTVISDQKAL--KFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQNKAADALSRMPTPTE----------IRVLTTTTLVDLEVVQQEVEMDKDLQNII

Query:  AKLKEDPDSVPKFSLERGRLLYKK---------RYRRPLGRTYKR---------------MTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQP
            ED        L+ G L+  K         +  R + + Y                 +     W+G++K ++EYV++C  CQ NKS    P G LQP
Subjt:  AKLKEDPDSVPKFSLERGRLLYKK---------RYRRPLGRTYKR---------------MTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQP

Query:  LPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVF-VREIAWYG-----------------W--------------LP
        +P  E+ W  ++MDF+  LP+S G +++ VVVDR SK    +  +    A+Q A++F  R IA++G                 W              LP
Subjt:  LPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVF-VREIAWYG-----------------W--------------LP

Query:  W------------------------------------VKYWYNTTYHVSINTTPFNIVYGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMA
        +                                    V+  YN   H +   TPF IV+  +  P +S  E    +D  ++      +    +KEHL   
Subjt:  W------------------------------------VKYWYNTTYHVSINTTPFNIVYGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMA

Query:  QNLGNEVFLKLR--------------PYRQKSLAKKRCEKLSPKFYGPYRMLERIGQVAYRLELPR--EATIHNVFHASQLNK
         N+  + +  ++                R K+    +  KL+P F GP+ +L++ G   Y L+LP   +    + FH S L K
Subjt:  QNLGNEVFLKLR--------------PYRQKSLAKKRCEKLSPKFYGPYRMLERIGQVAYRLELPR--EATIHNVFHASQLNK

P0CT35 Transposon Tf2-2 polyprotein2.8e-6625.71Show/hide
Query:  AEPEEERLPVMIQSLLERNGDIFELSRGLPPKRDAPINVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATI
        AE   E+LP  I+  LE   ++ + +  LP        +R Y     +   +   + + L +GIIR SK+  +  V+ V KK+G  R  VDY+ +N+   
Subjt:  AEPEEERLPVMIQSLLERNGDIFELSRGLPPKRDAPINVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATI

Query:  AEKFPIPVIEELL---HGASMFSKLDL---------RKHEGH----------YEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDIS
           +P+P+IE+LL    G+++F+KLDL         RK + H          +E+LVMP G+  AP  F   +N +      + ++ + DDIL++S+  S
Subjt:  AEKFPIPVIEELL---HGASMFSKLDL---------RKHEGH----------YEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDIS

Query:  THETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY-------------------------
         H  H+  V   L+   L  N+ +C F QS+++++G+ +S++G     E I  +LQW  PKN  +L  FLG   Y                         
Subjt:  THETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY-------------------------

Query:  ------TDGVQHAKECDVHTTRASLTKLCFTLHHETDASRMGVGAVLMQES-----RPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGS--K
              T  +++ K+C V             +  ETDAS + VGAVL Q+       P+ Y++  +S      S+ ++E++A++  ++ WRHYL  +   
Subjt:  ------TDGVQHAKECDVHTTRASLTKLCFTLHHETDASRMGVGAVLMQES-----RPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGS--K

Query:  FTVISDQKAL--KFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQNKAADALSRMPTPTE----------IRVLTTTTLVDLEVVQQEVEMDKDLQNII
        F +++D + L  +   E +    +  +W   L +++FEI Y+P   N  ADALSR+   TE          I  +   ++ D    Q   E   D + + 
Subjt:  FTVISDQKAL--KFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQNKAADALSRMPTPTE----------IRVLTTTTLVDLEVVQQEVEMDKDLQNII

Query:  AKLKEDPDSVPKFSLERGRLLYKK---------RYRRPLGRTYKR---------------MTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQP
            ED        L+ G L+  K         +  R + + Y                 +     W+G++K ++EYV++C  CQ NKS    P G LQP
Subjt:  AKLKEDPDSVPKFSLERGRLLYKK---------RYRRPLGRTYKR---------------MTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQP

Query:  LPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVF-VREIAWYG-----------------W--------------LP
        +P  E+ W  ++MDF+  LP+S G +++ VVVDR SK    +  +    A+Q A++F  R IA++G                 W              LP
Subjt:  LPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVF-VREIAWYG-----------------W--------------LP

Query:  W------------------------------------VKYWYNTTYHVSINTTPFNIVYGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMA
        +                                    V+  YN   H +   TPF IV+  +  P +S  E    +D  ++      +    +KEHL   
Subjt:  W------------------------------------VKYWYNTTYHVSINTTPFNIVYGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMA

Query:  QNLGNEVFLKLR--------------PYRQKSLAKKRCEKLSPKFYGPYRMLERIGQVAYRLELPR--EATIHNVFHASQLNK
         N+  + +  ++                R K+    +  KL+P F GP+ +L++ G   Y L+LP   +    + FH S L K
Subjt:  QNLGNEVFLKLR--------------PYRQKSLAKKRCEKLSPKFYGPYRMLERIGQVAYRLELPR--EATIHNVFHASQLNK

P0CT41 Transposon Tf2-12 polyprotein2.8e-6625.71Show/hide
Query:  AEPEEERLPVMIQSLLERNGDIFELSRGLPPKRDAPINVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATI
        AE   E+LP  I+  LE   ++ + +  LP        +R Y     +   +   + + L +GIIR SK+  +  V+ V KK+G  R  VDY+ +N+   
Subjt:  AEPEEERLPVMIQSLLERNGDIFELSRGLPPKRDAPINVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATI

Query:  AEKFPIPVIEELL---HGASMFSKLDL---------RKHEGH----------YEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDIS
           +P+P+IE+LL    G+++F+KLDL         RK + H          +E+LVMP G+  AP  F   +N +      + ++ + DDIL++S+  S
Subjt:  AEKFPIPVIEELL---HGASMFSKLDL---------RKHEGH----------YEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDIS

Query:  THETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY-------------------------
         H  H+  V   L+   L  N+ +C F QS+++++G+ +S++G     E I  +LQW  PKN  +L  FLG   Y                         
Subjt:  THETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY-------------------------

Query:  ------TDGVQHAKECDVHTTRASLTKLCFTLHHETDASRMGVGAVLMQES-----RPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGS--K
              T  +++ K+C V             +  ETDAS + VGAVL Q+       P+ Y++  +S      S+ ++E++A++  ++ WRHYL  +   
Subjt:  ------TDGVQHAKECDVHTTRASLTKLCFTLHHETDASRMGVGAVLMQES-----RPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGS--K

Query:  FTVISDQKAL--KFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQNKAADALSRMPTPTE----------IRVLTTTTLVDLEVVQQEVEMDKDLQNII
        F +++D + L  +   E +    +  +W   L +++FEI Y+P   N  ADALSR+   TE          I  +   ++ D    Q   E   D + + 
Subjt:  FTVISDQKAL--KFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQNKAADALSRMPTPTE----------IRVLTTTTLVDLEVVQQEVEMDKDLQNII

Query:  AKLKEDPDSVPKFSLERGRLLYKK---------RYRRPLGRTYKR---------------MTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQP
            ED        L+ G L+  K         +  R + + Y                 +     W+G++K ++EYV++C  CQ NKS    P G LQP
Subjt:  AKLKEDPDSVPKFSLERGRLLYKK---------RYRRPLGRTYKR---------------MTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQP

Query:  LPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVF-VREIAWYG-----------------W--------------LP
        +P  E+ W  ++MDF+  LP+S G +++ VVVDR SK    +  +    A+Q A++F  R IA++G                 W              LP
Subjt:  LPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVF-VREIAWYG-----------------W--------------LP

Query:  W------------------------------------VKYWYNTTYHVSINTTPFNIVYGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMA
        +                                    V+  YN   H +   TPF IV+  +  P +S  E    +D  ++      +    +KEHL   
Subjt:  W------------------------------------VKYWYNTTYHVSINTTPFNIVYGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMA

Query:  QNLGNEVFLKLR--------------PYRQKSLAKKRCEKLSPKFYGPYRMLERIGQVAYRLELPR--EATIHNVFHASQLNK
         N+  + +  ++                R K+    +  KL+P F GP+ +L++ G   Y L+LP   +    + FH S L K
Subjt:  QNLGNEVFLKLR--------------PYRQKSLAKKRCEKLSPKFYGPYRMLERIGQVAYRLELPR--EATIHNVFHASQLNK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.9e-6827.19Show/hide
Query:  LPPKRDAPI-------NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEELLH---GASM
        +P K D  I        ++PY      + EI K+V ++L    I PSKSP SS V+LV KKDG +R CVDYR +N+ATI++ FP+P I+ LL     A +
Subjt:  LPPKRDAPI-------NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEELLH---GASM

Query:  FSKLDLRK--HE-----------------GHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFAN
        F+ LDL    H+                 G YE+ VMP GL+NAP+TF   M + F+     F+ V+ DDIL++SE    H  HL  V   L+   L   
Subjt:  FSKLDLRK--HE-----------------GHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFAN

Query:  RKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY-------------------TDGVQHAKECD--VHTTRASL------
        +K+C F+    ++LG+ +  + +     K  A+  +PTPK V +   FLG+  Y                    D  Q  ++ D  +   +A+L      
Subjt:  RKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY-------------------TDGVQHAKECD--VHTTRASL------

Query:  ----TKLCFTLHHETDASRMGVGAVLMQESRP------IAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQ
             K  + L   TDAS+ G+GAVL +          + YF+++L +  +     E EL+ ++  +  +R+ L G  FT+ +D  +L  L  + E   +
Subjt:  ----TKLCFTLHHETDASRMGVGAVLMQESRP------IAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQ

Query:  FQKWLTKLLEYDFEILYQPELQNKAADALSR---MPTPTEIRVLTT--------------TTLVDLEVVQQEVEMDKDL---QNIIAKLKEDPDSVPKFS
         Q+WL  L  YDF + Y    +N  ADA+SR     TP   R + T                L+ ++ + Q     +D+   ++   KL+        +S
Subjt:  FQKWLTKLLEYDFEILYQPELQNKAADALSR---MPTPTEIRVLTT--------------TTLVDLEVVQQEVEMDKDL---QNIIAKLKEDPDSVPKFS

Query:  LERGRLLYKKRYRRPLGR----------------------TYKRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFV
        LE   + Y+ R   P+ +                      T  +++   YW  ++  + +Y+ +CV CQ  KS      GLLQPLP+ E  W DI+MDFV
Subjt:  LERGRLLYKKRYRRPLGR----------------------TYKRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFV

Query:  EGL-PKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFVREI------------------------------------------------------
         GL P S   + I+VVVDR SK  HFIA     +A Q+  +  R I                                                      
Subjt:  EGL-PKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFVREI------------------------------------------------------

Query:  --------------AWYGWLPWVKYWYNTTYHVSINTTPFNIVYG-TAALPLISYEENKVSNDAVEQQLMARDKAL-VALKEHLGMAQ------------
                       W+ +LP +++ YN+T   ++  +PF I  G     P I  ++   +      +L    KAL +  KE L  AQ            
Subjt:  --------------AWYGWLPWVKYWYNTTYHVSINTTPFNIVYG-TAALPLISYEENKVSNDAVEQQLMARDKAL-VALKEHLGMAQ------------

Query:  ----NLGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIGQVAYRLELPREATIHNVFHASQL
            N+G+ V +    +R     K    K+   + GP+R++++I   AY L+L      H V +   L
Subjt:  ----NLGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIGQVAYRLELPREATIHNVFHASQL

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.3e-6827.2Show/hide
Query:  LPPKRDAPI-------NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEELLH---GASM
        +P K D  I        ++PY      + EI K+V ++L    I PSKSP SS V+LV KKDG +R CVDYR +N+ATI++ FP+P I+ LL     A +
Subjt:  LPPKRDAPI-------NVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEELLH---GASM

Query:  FSKLDLRK--HE-----------------GHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFAN
        F+ LDL    H+                 G YE+ VMP GL+NAP+TF   M + F+     F+ V+ DDIL++SE    H  HL  V   L+   L   
Subjt:  FSKLDLRK--HE-----------------GHYEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFAN

Query:  RKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY-------------------TDGVQHAKECDVHTTRASLTKLC----
        +K+C F+    ++LG+ +  + +     K  A+  +PTPK V +   FLG+  Y                    D  Q  ++ D    +     LC    
Subjt:  RKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY-------------------TDGVQHAKECDVHTTRASLTKLC----

Query:  -------FTLHHETDASRMGVGAVLMQESRP------IAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQF
                     TDAS+ G+GAVL +          + YF+++L +  +     E EL+ ++  +  +R+ L G  FT+ +D  +L  L  + E   + 
Subjt:  -------FTLHHETDASRMGVGAVLMQESRP------IAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKALKFLIEQQEVQPQF

Query:  QKWLTKLLEYDFEILYQPELQNKAADALSRMP---TPTEIRVLTT--------------TTLVDLEVVQQEVEMDKDL---QNIIAKLKEDPDSVPKFSL
        Q+WL  L  YDF + Y    +N  ADA+SR     TP   R + T                L+ ++ + Q     +D+   ++   KL+        +SL
Subjt:  QKWLTKLLEYDFEILYQPELQNKAADALSRMP---TPTEIRVLTT--------------TTLVDLEVVQQEVEMDKDL---QNIIAKLKEDPDSVPKFSL

Query:  ERGRLLYKKRYRRPLGR----------------------TYKRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVE
        E   + Y+ R   P+ +                      T  +++   YW  ++  + +Y+ +CV CQ  KS      GLLQPLP+ E  W DI+MDFV 
Subjt:  ERGRLLYKKRYRRPLGR----------------------TYKRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVE

Query:  GL-PKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFVREI-------------------------------------------------------
        GL P S   + I+VVVDR SK  HFIA     +A Q+  +  R I                                                       
Subjt:  GL-PKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFVREI-------------------------------------------------------

Query:  -------------AWYGWLPWVKYWYNTTYHVSINTTPFNIVYG-TAALPLISYEENKVSNDAVEQQLMARDKAL-VALKEHLGMAQ-------------
                      W+ +LP +++ YN+T   ++  +PF I  G     P I  ++   +      +L    KAL +  KE L  AQ             
Subjt:  -------------AWYGWLPWVKYWYNTTYHVSINTTPFNIVYG-TAALPLISYEENKVSNDAVEQQLMARDKAL-VALKEHLGMAQ-------------

Query:  ---NLGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIGQVAYRLELPREATIHNVFHASQLNKKV-RPE
           N+G+ V +    +R     K    K+   + GP+R++++I   AY L+L      H V +   L K V RP+
Subjt:  ---NLGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIGQVAYRLELPREATIHNVFHASQLNKKV-RPE

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.1e-1227.09Show/hide
Query:  QSPMTGVKP-GAKGGLSKKRETPMRRLTDAKFKEKRDKEGLCFKELNVIIAGEVEELEDIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGI
        Q+ +  +KP G     S++ E     L  AK    + K+G         +  E+EELE     +   +E+  +++  N              + M+  G 
Subjt:  QSPMTGVKP-GAKGGLSKKRETPMRRLTDAKFKEKRDKEGLCFKELNVIIAGEVEELEDIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGI

Query:  VGGKEAIVLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGK--MDVILGMAWLSTTGFIGVHWPS
        +   + +V ID GAT NFI  +LA  L L  + T    +++G  + I+  G C  + + +Q + I  +FL L+L K  +DVILG  WLS  G   V+W +
Subjt:  VGGKEAIVLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGK--MDVILGMAWLSTTGFIGVHWPS

Query:  LTMTFAVGEDQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE
           +F+  +  +TL  +    + +V+ KV  K+    +QE + E +N + E
Subjt:  LTMTFAVGEDQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAE

AT3G30770.1 Eukaryotic aspartyl protease family protein4.2e-0927.44Show/hide
Query:  QLALRSVYGFSAPRTMKMKGIVGGKEAIVLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGK--M
        Q+  +S   F+  + M+  G +   + +V+ID GAT+NFI  +LA  L L  + T    +++G  + I+  G C  + +++Q + I  +FL L+L K  +
Subjt:  QLALRSVYGFSAPRTMKMKGIVGGKEAIVLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVNDFLPLELGK--M

Query:  DVILGMAWLSTTGFIGVHWPSLTMTFAVGEDQVTL-KGDPSLTKTEVSLKVLTKTWETEDQEYL
        DVILG           + W +   +F   +  VTL   D  L +    +K+ ++  + +   YL
Subjt:  DVILGMAWLSTTGFIGVHWPSLTMTFAVGEDQVTL-KGDPSLTKTEVSLKVLTKTWETEDQEYL

ATMG00850.1 DNA/RNA polymerases superfamily protein1.4e-0452.5Show/hide
Query:  VQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEW
        +++  ++  +GEML   II+PS SP+SS VLLV+KKDG W
Subjt:  VQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEW

ATMG00860.1 DNA/RNA polymerases superfamily protein1.3e-1552.05Show/hide
Query:  HLAVVFNVLRENKLFANRKRCIFSQSRIQYLG--HWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY
        HL +V  +  +++ +ANRK+C F Q +I YLG  H +S EGV AD  K++AM+ WP PKN T+L  FLGLTGY
Subjt:  HLAVVFNVLRENKLFANRKRCIFSQSRIQYLG--HWVSKEGVEADGEKIKAMLQWPTPKNVTKLCSFLGLTGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTGTAAGGGGAAGTGGGCTTCTGTTGGGCCGGGTATCCGACAGTTATCTGGGAATTTGGGGAAGGGAGAGGCCAGCACTCTAGAACTTGCTGGGTATCTTGTTCT
TCTATTTTGTGTCTTCTTGTGGATACCTAACAAATTGGTATCAAAGCTCAGTTCAATGCTAGGAATCGCGAGACAAATGGCACAGAAAATGGCGGAGGAAAGATTCGAAG
CTCATGAGAAGGAGTTGACGGTGTTGAGGGACTTGGTAGTTGCGACGTGCAAGAACGTGGAGAGAATGTCAGAGGACCTGCGGACAAAGAACAAGGAAGCAGAGGTCGAA
GAAGGCGAGTCATCACAGGGGAGGAATGGGGGATTCAACCGGAGCAAATATAAGAAGCTCAATATGCCCGTTTTCGATGGAGAAGGCCCCGACGCGTGGATATTTCGGGC
GGAGACTTTCTTCGAGATTCATGAACTAGGTAGTCGATTGGTACCGCTGGGCAAACAACCGGAAGCCGATCCGAAATTGGAAGGAGATGAAAGAGAGAATGCTGGAGTGA
TGGAGATCCACGCTAGAAGGGACTCTCACGAATCAGTTGCTGGCGATAAAGCAAGAGGGAGCTTATGCAGACTACCGCCAAAAGAGAGTGTTTTAGTGACCGCGTTTCTG
AATGGGCTGCATCCATCACTGAAAGCCGAGGTCATGAGTAGGAGACTCATGGGCCTGGACCAGTGTATGTTAGAAGCCCAATCCATAGATGACCGAGATATTGCGCTGAA
GTTAGCCCTCGAAGAAGTCAAGTTTCAGGAGTCCGATCGAAAAAGGGAGCAGTCTCCAATGACCGGAGTGAAACCAGGAGCCAAGGGGGGTCTGAGCAAGAAGAGAGAAA
CGCCGATGAGGAGATTGACTGATGCCAAGTTTAAGGAGAAGAGGGATAAGGAGGGGCTTTGTTTCAAGGAGCTGAATGTGATCATAGCCGGCGAAGTAGAGGAACTTGAA
GATATCCAAGAAGAAATGGAGGCTAAGATAGAAAAAAAGGCAATGGAAGTAGCGGAAAATGTGCAGCTAGCATTGAGATCCGTTTATGGGTTTTCGGCACCAAGAACAAT
GAAGATGAAGGGAATTGTGGGAGGGAAGGAAGCCATCGTCCTCATCGACTGCGGGGCGACCCACAACTTTATCCACCAGAAACTAGCAGATGAACTAAACCTCCAAGTAG
CCGAGACTTTGAATTATGGTATTGTGGTGGGGAATGGAAAGGCCATTCGAGGCCAAGGCGTGTGCAAGTCGGTTATGGTAATGCTCCAAGGGATCACGATAGTCAACGAT
TTCCTACCCTTAGAGCTGGGCAAGATGGATGTTATATTGGGGATGGCATGGCTGAGTACCACCGGATTTATAGGGGTGCATTGGCCCTCATTAACCATGACTTTCGCCGT
GGGAGAGGACCAGGTAACGCTAAAAGGGGACCCGTCATTAACCAAGACAGAAGTGAGTTTGAAGGTTTTGACGAAGACGTGGGAAACAGAGGACCAGGAGTATTTAGTTG
AAATTCAGAACAAGGAGGCGGAAGAAGGGGAAGATAAGGAGCCGACGATAGCTGAACCCGAGGAAGAGCGGTTGCCCGTGATGATTCAAAGCCTTTTGGAGCGCAATGGT
GATATATTTGAACTGTCGAGGGGATTACCCCCCAAGCGGGATGCACCAATCAATGTAAGGCCATACAAGTATGGGCATGTTCAGAAGGATGAGATAGAGAAACTCGTAGG
AGAAATGTTGTCAACGGGAATTATAAGGCCAAGTAAAAGTCCATTTTCGAGTCTGGTACTCCTAGTTAAGAAAAAGGATGGGGAATGGAGATTTTGTGTGGATTATCGTA
AGGTGAACCAGGCCACCATTGCGGAAAAATTCCCGATTCCGGTGATTGAAGAGCTGTTACATGGGGCTAGCATGTTTTCCAAGCTAGATTTGAGAAAGCACGAAGGGCAC
TACGAGTTCTTGGTCATGCCGCTTGGATTGATCAACGCACCAACAACTTTTCTATCCCTAATGAACGAGGTTTTTAAACCGTTTCTAAGAACGTTCATTTTGGTATTCTT
TGACGACATCCTTGTCTATAGTGAGGATATTTCTACACATGAGACTCATTTGGCGGTGGTTTTCAACGTCTTGAGGGAGAACAAGTTGTTTGCCAACAGGAAGAGGTGTA
TTTTCAGCCAGTCAAGGATACAATATTTGGGGCATTGGGTTTCTAAGGAAGGCGTGGAGGCCGATGGTGAGAAAATTAAAGCCATGTTGCAATGGCCTACCCCTAAAAAT
GTGACCAAGCTGTGTAGTTTTTTGGGGCTCACAGGTTATACAGATGGCGTTCAACATGCTAAAGAATGCGATGTCCACACTACCCGTGCTAGCCTTACCAAACTTTGCTT
TACCCTTCACCATGAGACTGATGCTTCTAGGATGGGGGTAGGGGCGGTTCTGATGCAGGAATCTCGGCCAATAGCTTACTTCAACCAAGCTCTATCAACAAGAGCGAGGG
CTAAATCGATATACGAAAGAGAATTGATGGCAATGGTGCTCGTCATGCAAAAATGGAGGCATTATTTGTTGGGTAGTAAGTTTACCGTCATTTCTGACCAGAAGGCCCTC
AAGTTCCTCATAGAGCAACAAGAAGTACAACCCCAATTCCAGAAGTGGTTAACGAAATTACTGGAGTACGACTTTGAAATCCTATATCAACCTGAATTACAGAATAAGGC
GGCGGATGCTTTGTCGAGGATGCCAACACCGACGGAGATTCGAGTGCTAACTACTACGACATTGGTGGATCTGGAGGTAGTGCAGCAAGAGGTCGAGATGGATAAGGACC
TACAGAATATTATTGCCAAGCTAAAGGAGGATCCCGACTCGGTCCCAAAATTCTCACTTGAGAGGGGAAGATTACTTTACAAGAAACGTTATAGGAGGCCACTCGGGCGT
ACATACAAGAGAATGACGGGGGAACTTTATTGGAGGGGCATGAAGAAGGATGTAAAAGAATACGTTGAAAGTTGTGTAATTTGCCAGAAAAACAAGTCAGAAGCAGTCAG
TCCAGCCGGGTTGTTGCAGCCTCTTCCGGTCCCAGAGAAAATATGGGGAGATATCACCATGGATTTTGTGGAAGGATTACCCAAGTCACAAGGGAAGGACTCCATCATGG
TGGTGGTGGATAGACTGAGCAAATATGGGCACTTCATAGCCCTCAGCCACCCATATAATGCGAAACAGGTAGCTCAAGTCTTTGTTAGGGAAATTGCATGGTATGGGTGG
CTACCATGGGTCAAGTATTGGTATAATACTACCTATCATGTTTCAATCAACACCACCCCTTTCAACATTGTTTATGGCACAGCAGCCCTGCCATTGATTTCTTATGAGGA
GAACAAGGTATCGAATGACGCAGTGGAACAACAACTAATGGCGAGAGATAAGGCCTTGGTGGCTCTCAAGGAACACTTGGGAATGGCCCAAAACCTGGGAAATGAGGTCT
TTTTGAAATTGAGACCATATCGCCAAAAGTCGTTAGCCAAGAAGAGATGCGAGAAACTTTCCCCCAAGTTTTACGGGCCATATAGGATGCTTGAAAGAATTGGGCAAGTG
GCCTATCGTCTTGAGCTTCCAAGGGAGGCTACAATCCACAATGTGTTCCACGCATCACAATTGAATAAGAAGGTTCGTCCGGAGGATATATTGGGGGTGCATTGGAATTC
AAAGTTGAGCAAAGAGGAGTGGTTAATCAAATGGGAGAAGTTTCCGGAAAGCGAAGCGACATGGGAGGAAGCAGCATTCATCAAGTACCAATTCCCCGAACTTCACCTTG
AGGACAAGGGGAAGTGGGCTTCGGTTGGGCCGGGAATTCGGCACTTATCTAGGAATTCGGGGAGGGGTTGGCAACCGTTGGGAAGCACTCTAGAACTTGTTGAGTATCTT
GTTCTTCTATTTTGTGTCTTCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTGTAAGGGGAAGTGGGCTTCTGTTGGGCCGGGTATCCGACAGTTATCTGGGAATTTGGGGAAGGGAGAGGCCAGCACTCTAGAACTTGCTGGGTATCTTGTTCT
TCTATTTTGTGTCTTCTTGTGGATACCTAACAAATTGGTATCAAAGCTCAGTTCAATGCTAGGAATCGCGAGACAAATGGCACAGAAAATGGCGGAGGAAAGATTCGAAG
CTCATGAGAAGGAGTTGACGGTGTTGAGGGACTTGGTAGTTGCGACGTGCAAGAACGTGGAGAGAATGTCAGAGGACCTGCGGACAAAGAACAAGGAAGCAGAGGTCGAA
GAAGGCGAGTCATCACAGGGGAGGAATGGGGGATTCAACCGGAGCAAATATAAGAAGCTCAATATGCCCGTTTTCGATGGAGAAGGCCCCGACGCGTGGATATTTCGGGC
GGAGACTTTCTTCGAGATTCATGAACTAGGTAGTCGATTGGTACCGCTGGGCAAACAACCGGAAGCCGATCCGAAATTGGAAGGAGATGAAAGAGAGAATGCTGGAGTGA
TGGAGATCCACGCTAGAAGGGACTCTCACGAATCAGTTGCTGGCGATAAAGCAAGAGGGAGCTTATGCAGACTACCGCCAAAAGAGAGTGTTTTAGTGACCGCGTTTCTG
AATGGGCTGCATCCATCACTGAAAGCCGAGGTCATGAGTAGGAGACTCATGGGCCTGGACCAGTGTATGTTAGAAGCCCAATCCATAGATGACCGAGATATTGCGCTGAA
GTTAGCCCTCGAAGAAGTCAAGTTTCAGGAGTCCGATCGAAAAAGGGAGCAGTCTCCAATGACCGGAGTGAAACCAGGAGCCAAGGGGGGTCTGAGCAAGAAGAGAGAAA
CGCCGATGAGGAGATTGACTGATGCCAAGTTTAAGGAGAAGAGGGATAAGGAGGGGCTTTGTTTCAAGGAGCTGAATGTGATCATAGCCGGCGAAGTAGAGGAACTTGAA
GATATCCAAGAAGAAATGGAGGCTAAGATAGAAAAAAAGGCAATGGAAGTAGCGGAAAATGTGCAGCTAGCATTGAGATCCGTTTATGGGTTTTCGGCACCAAGAACAAT
GAAGATGAAGGGAATTGTGGGAGGGAAGGAAGCCATCGTCCTCATCGACTGCGGGGCGACCCACAACTTTATCCACCAGAAACTAGCAGATGAACTAAACCTCCAAGTAG
CCGAGACTTTGAATTATGGTATTGTGGTGGGGAATGGAAAGGCCATTCGAGGCCAAGGCGTGTGCAAGTCGGTTATGGTAATGCTCCAAGGGATCACGATAGTCAACGAT
TTCCTACCCTTAGAGCTGGGCAAGATGGATGTTATATTGGGGATGGCATGGCTGAGTACCACCGGATTTATAGGGGTGCATTGGCCCTCATTAACCATGACTTTCGCCGT
GGGAGAGGACCAGGTAACGCTAAAAGGGGACCCGTCATTAACCAAGACAGAAGTGAGTTTGAAGGTTTTGACGAAGACGTGGGAAACAGAGGACCAGGAGTATTTAGTTG
AAATTCAGAACAAGGAGGCGGAAGAAGGGGAAGATAAGGAGCCGACGATAGCTGAACCCGAGGAAGAGCGGTTGCCCGTGATGATTCAAAGCCTTTTGGAGCGCAATGGT
GATATATTTGAACTGTCGAGGGGATTACCCCCCAAGCGGGATGCACCAATCAATGTAAGGCCATACAAGTATGGGCATGTTCAGAAGGATGAGATAGAGAAACTCGTAGG
AGAAATGTTGTCAACGGGAATTATAAGGCCAAGTAAAAGTCCATTTTCGAGTCTGGTACTCCTAGTTAAGAAAAAGGATGGGGAATGGAGATTTTGTGTGGATTATCGTA
AGGTGAACCAGGCCACCATTGCGGAAAAATTCCCGATTCCGGTGATTGAAGAGCTGTTACATGGGGCTAGCATGTTTTCCAAGCTAGATTTGAGAAAGCACGAAGGGCAC
TACGAGTTCTTGGTCATGCCGCTTGGATTGATCAACGCACCAACAACTTTTCTATCCCTAATGAACGAGGTTTTTAAACCGTTTCTAAGAACGTTCATTTTGGTATTCTT
TGACGACATCCTTGTCTATAGTGAGGATATTTCTACACATGAGACTCATTTGGCGGTGGTTTTCAACGTCTTGAGGGAGAACAAGTTGTTTGCCAACAGGAAGAGGTGTA
TTTTCAGCCAGTCAAGGATACAATATTTGGGGCATTGGGTTTCTAAGGAAGGCGTGGAGGCCGATGGTGAGAAAATTAAAGCCATGTTGCAATGGCCTACCCCTAAAAAT
GTGACCAAGCTGTGTAGTTTTTTGGGGCTCACAGGTTATACAGATGGCGTTCAACATGCTAAAGAATGCGATGTCCACACTACCCGTGCTAGCCTTACCAAACTTTGCTT
TACCCTTCACCATGAGACTGATGCTTCTAGGATGGGGGTAGGGGCGGTTCTGATGCAGGAATCTCGGCCAATAGCTTACTTCAACCAAGCTCTATCAACAAGAGCGAGGG
CTAAATCGATATACGAAAGAGAATTGATGGCAATGGTGCTCGTCATGCAAAAATGGAGGCATTATTTGTTGGGTAGTAAGTTTACCGTCATTTCTGACCAGAAGGCCCTC
AAGTTCCTCATAGAGCAACAAGAAGTACAACCCCAATTCCAGAAGTGGTTAACGAAATTACTGGAGTACGACTTTGAAATCCTATATCAACCTGAATTACAGAATAAGGC
GGCGGATGCTTTGTCGAGGATGCCAACACCGACGGAGATTCGAGTGCTAACTACTACGACATTGGTGGATCTGGAGGTAGTGCAGCAAGAGGTCGAGATGGATAAGGACC
TACAGAATATTATTGCCAAGCTAAAGGAGGATCCCGACTCGGTCCCAAAATTCTCACTTGAGAGGGGAAGATTACTTTACAAGAAACGTTATAGGAGGCCACTCGGGCGT
ACATACAAGAGAATGACGGGGGAACTTTATTGGAGGGGCATGAAGAAGGATGTAAAAGAATACGTTGAAAGTTGTGTAATTTGCCAGAAAAACAAGTCAGAAGCAGTCAG
TCCAGCCGGGTTGTTGCAGCCTCTTCCGGTCCCAGAGAAAATATGGGGAGATATCACCATGGATTTTGTGGAAGGATTACCCAAGTCACAAGGGAAGGACTCCATCATGG
TGGTGGTGGATAGACTGAGCAAATATGGGCACTTCATAGCCCTCAGCCACCCATATAATGCGAAACAGGTAGCTCAAGTCTTTGTTAGGGAAATTGCATGGTATGGGTGG
CTACCATGGGTCAAGTATTGGTATAATACTACCTATCATGTTTCAATCAACACCACCCCTTTCAACATTGTTTATGGCACAGCAGCCCTGCCATTGATTTCTTATGAGGA
GAACAAGGTATCGAATGACGCAGTGGAACAACAACTAATGGCGAGAGATAAGGCCTTGGTGGCTCTCAAGGAACACTTGGGAATGGCCCAAAACCTGGGAAATGAGGTCT
TTTTGAAATTGAGACCATATCGCCAAAAGTCGTTAGCCAAGAAGAGATGCGAGAAACTTTCCCCCAAGTTTTACGGGCCATATAGGATGCTTGAAAGAATTGGGCAAGTG
GCCTATCGTCTTGAGCTTCCAAGGGAGGCTACAATCCACAATGTGTTCCACGCATCACAATTGAATAAGAAGGTTCGTCCGGAGGATATATTGGGGGTGCATTGGAATTC
AAAGTTGAGCAAAGAGGAGTGGTTAATCAAATGGGAGAAGTTTCCGGAAAGCGAAGCGACATGGGAGGAAGCAGCATTCATCAAGTACCAATTCCCCGAACTTCACCTTG
AGGACAAGGGGAAGTGGGCTTCGGTTGGGCCGGGAATTCGGCACTTATCTAGGAATTCGGGGAGGGGTTGGCAACCGTTGGGAAGCACTCTAGAACTTGTTGAGTATCTT
GTTCTTCTATTTTGTGTCTTCTTGTAA
Protein sequenceShow/hide protein sequence
MGCKGKWASVGPGIRQLSGNLGKGEASTLELAGYLVLLFCVFLWIPNKLVSKLSSMLGIARQMAQKMAEERFEAHEKELTVLRDLVVATCKNVERMSEDLRTKNKEAEVE
EGESSQGRNGGFNRSKYKKLNMPVFDGEGPDAWIFRAETFFEIHELGSRLVPLGKQPEADPKLEGDERENAGVMEIHARRDSHESVAGDKARGSLCRLPPKESVLVTAFL
NGLHPSLKAEVMSRRLMGLDQCMLEAQSIDDRDIALKLALEEVKFQESDRKREQSPMTGVKPGAKGGLSKKRETPMRRLTDAKFKEKRDKEGLCFKELNVIIAGEVEELE
DIQEEMEAKIEKKAMEVAENVQLALRSVYGFSAPRTMKMKGIVGGKEAIVLIDCGATHNFIHQKLADELNLQVAETLNYGIVVGNGKAIRGQGVCKSVMVMLQGITIVND
FLPLELGKMDVILGMAWLSTTGFIGVHWPSLTMTFAVGEDQVTLKGDPSLTKTEVSLKVLTKTWETEDQEYLVEIQNKEAEEGEDKEPTIAEPEEERLPVMIQSLLERNG
DIFELSRGLPPKRDAPINVRPYKYGHVQKDEIEKLVGEMLSTGIIRPSKSPFSSLVLLVKKKDGEWRFCVDYRKVNQATIAEKFPIPVIEELLHGASMFSKLDLRKHEGH
YEFLVMPLGLINAPTTFLSLMNEVFKPFLRTFILVFFDDILVYSEDISTHETHLAVVFNVLRENKLFANRKRCIFSQSRIQYLGHWVSKEGVEADGEKIKAMLQWPTPKN
VTKLCSFLGLTGYTDGVQHAKECDVHTTRASLTKLCFTLHHETDASRMGVGAVLMQESRPIAYFNQALSTRARAKSIYERELMAMVLVMQKWRHYLLGSKFTVISDQKAL
KFLIEQQEVQPQFQKWLTKLLEYDFEILYQPELQNKAADALSRMPTPTEIRVLTTTTLVDLEVVQQEVEMDKDLQNIIAKLKEDPDSVPKFSLERGRLLYKKRYRRPLGR
TYKRMTGELYWRGMKKDVKEYVESCVICQKNKSEAVSPAGLLQPLPVPEKIWGDITMDFVEGLPKSQGKDSIMVVVDRLSKYGHFIALSHPYNAKQVAQVFVREIAWYGW
LPWVKYWYNTTYHVSINTTPFNIVYGTAALPLISYEENKVSNDAVEQQLMARDKALVALKEHLGMAQNLGNEVFLKLRPYRQKSLAKKRCEKLSPKFYGPYRMLERIGQV
AYRLELPREATIHNVFHASQLNKKVRPEDILGVHWNSKLSKEEWLIKWEKFPESEATWEEAAFIKYQFPELHLEDKGKWASVGPGIRHLSRNSGRGWQPLGSTLELVEYL
VLLFCVFL