; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0013230 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0013230
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr1:48586147..48594044
RNA-Seq ExpressionLag0013230
SyntenyLag0013230
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR000953 - Chromo/chromo shadow domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049630.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0056.33Show/hide
Query:  MGLEAIMKAAQRIEDKNRAIES---KTSFGPGRVWKNPIISHANPPKLVEQN-----PIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE
        +GL  +MK A +IE++    +     +++      +     + N   ++E       P++TITL         +E P +RLTD+E Q RREKGLC+RC E
Subjt:  MGLEAIMKAAQRIEDKNRAIES---KTSFGPGRVWKNPIISHANPPKLVEQN-----PIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE

Query:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI
        K+H GH+CK    KEL++LLV +   E    +E+    E  ++  E+ +VE    ELSIN+VVG ++PGTMKVKGR+  +EVV+L+DCGATHNFI++KL+
Subjt:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI

Query:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS
        E +++P+ ET NYG+I+G+ T++KGKG+C K+ + L    V + FLP+EL GVDAILGMQWL ++G T +DWK LT+      +K+T+KGDPSLTKT++S
Subjt:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS

Query:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR
        LK + ++W   DQG+LVE R +     +    D             +L ++ DVF  P+RLPP R I+H I LK G+ PINVRPYRY H QKEE+E+++ 
Subjt:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR

Query:  EMLVAGVIRPST----SPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV
        EML +G+IRPST    SPVLLV+K+DGSWRFCVDYRALN  TVPDKFPIPV+EEL DEL+GA +FSK+DL+ GYHQIR+H EDI KTAFRTHEGHYEF+V
Subjt:  EMLVAGVIRPST----SPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV

Query:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW
        MPFGLTNAPSTFQA                           +E H +HL  VL ++R++ L  N  KC FAK RI YLGH+IS  G+E D +K R++ EW
Subjt:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW

Query:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS
        P P NVRELRGFLGLTGYYR+FV+NYG+ + PLTQL +  A++WTE+ + AFE LK+AMMTLPVLA+PDF+  F +E+DAS  G+GAVL Q+++P+AYFS
Subjt:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS

Query:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA
        + LST+ +++P+YE+ELMA+V AVQ+WRPYLLG++F ++TDQ++L+FL++Q VIQP+YQ+WI KLLGY+FEV Y+PGLENKAADALSR+ P  HL  L+A
Subjt:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA

Query:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV
        PA+LDV++I++EV +D  L+EI+  +       P ++ HQG L +KGRLV+S  S LIP ++H+YHDSV GGHSGFLRTYKR+  ELYW+GMK  +++Y 
Subjt:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV

Query:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI
        E+C  CQ+NK+  LSPAGLL PL IP+ IW DISMDF+EGLPKS G++ I VVVDRLSKYAHFL L HP++AK+VA +F+KEVVRLHGFP+SIVSDRDKI
Subjt:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI

Query:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT
        F+SHFW EMF+L GT+L RS+SYHPQTDGQTE VNK +E YLRCFCGE+P+ W  WL WAEYWYNTTYH S+  TPF+ VYGR PP L+ YG+ +T N+T
Subjt:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT

Query:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV
        LD+QL +RD  +  L+  L  AQ+RMKK+ADL+RR+ +++ G++V++KLRPYRQ+++  +RNEKLSPKYFGPY+ILE+IG +AY+L+LP  A+IHPVFHV
Subjt:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV

Query:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV
        SQL++A+G+ + ++ L P + E  EWI +PE++ AYR N  T EWE LVQWKGLP  E TWE+    + QFP F LEDKV
Subjt:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV

KAA0050511.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0057.42Show/hide
Query:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE
        GL  +MK A +IE++           A +SKT   P +  KN I +           P++TITL         +E P +RL+D+E Q RREKGLC+RC E
Subjt:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE

Query:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI
        K+  GH+CK    KEL++L+V +   E    +E+    E  +++V+   VE    ELS+N+VVG ++PGTMKVKG++  +EVVIL+DCGATHNFI++ L+
Subjt:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI

Query:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS
          + + + ET NYG+I+G+ TAVKGKG+C+ + + L+G  V + FLP++L GVD ILGMQWL +LGVT +DWK L +       K+ ++GDPSLTK  VS
Subjt:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS

Query:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR
        LK L ++WG  DQGFLVE R I     +    D  P   N   +  LL+++A VF  PS LPP R IDH I LK+G+ P+NVRPYRY H QKEE+E+++ 
Subjt:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR

Query:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV
        EML +G+IRPS    +SPVLLV+KRDGSWRFCVDYRALN  T+PDKFPIPVIEEL DEL GA +FSK+DL+ GYHQIR+  EDI KTAFRTHEGHYEFLV
Subjt:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV

Query:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW
        MPFGLTNAPSTFQA                           +E HL+HL  VL ++++  L VNM KC FAK RI YLGH+IS  G+EAD +K R++ EW
Subjt:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW

Query:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS
        P P NVRE+RGFLGLTGYYR+FV+NYG  A PLTQL ++ A++W E+ + AF  LKRAMMTLPVL +PDFS  F +E+DAS  G+GAVLTQ ++P+AYFS
Subjt:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS

Query:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA
        ++LS + +S+P+YE+EL+A+V AVQ+WRPYLLG++F ++TDQ++L++L++Q V+QP+YQKW++KLLGY+FEV Y+PGLENKAADALSR+ P   L  ++A
Subjt:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA

Query:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV
        PA++DV+I+K E  QD  L+EI+R +       P ++L QG L +KGRLV+S++S L+P +LH+YHDSV GGHSGFLRTYKR+T E+YW+GMK  + +Y 
Subjt:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV

Query:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI
        E+C  CQRNK+  L+PAGLL PL IP+ IW DISMDF+EGLPKS+G+D I VVVDRLSKY+HFL L HPF+AK VA  FIKEVVRLHG+PRSIVSDRDK+
Subjt:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI

Query:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT
        F+SHFW E+FRL GT+L RS+SYHPQ+DGQTE VNK +ETYLRCFCGE+P++W  WL WAEYWYNTTYHSS+  TPF+ VYGR PP L++YGD +T N+T
Subjt:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT

Query:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV
        LD+QL +RD T+  L+  L  AQ+RMKK AD +RRE +++EG+ V++KLRPYRQ+++  +RNEKLSPKYFGPYR+LE+IG +AYRL+LP EA+IHPVFHV
Subjt:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV

Query:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV
        SQL++A+G  E V+ L P + EN EWI +PE+V  YR NP T EWE L+ WKGLP  E TWES    + QFP F LEDKV
Subjt:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV

KAA0055700.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0057.3Show/hide
Query:  MGLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCD
        +GL  +MK A +IE++           A +SK      +  K+PI +           P++TITL         +E P +RL+D+E Q RREKGLC+RC 
Subjt:  MGLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCD

Query:  EKFHVGHKCKG---KELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKL
        EK+  GH+CK    KEL++L+V +   E    +E+    E  ++  E P VE    ELS+N+VVG ++PGTMKVKG++ ++EVVIL+DCGATHNFI++ L
Subjt:  EKFHVGHKCKG---KELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKL

Query:  IEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEV
        +  + + + ET NYG+I+G+ TAVKGKG+C+ + + LDG  V + FLP++L GVD ILGMQWL +LGVT +DWK L +       K+ ++GDPSL KT V
Subjt:  IEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEV

Query:  SLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMI
        SLK L +TWG  DQGFLVE R +     +             PI   LL+Q+A VF  P+ LPP R I+H I LK+G+ P+NVRPYRY H QKEE+E+++
Subjt:  SLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMI

Query:  REMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFL
         EML +G+IRPS    +SPVLLV+K+DGSWRFCVDYRALN  T+PDKFPIPVIEEL DEL GA +FSKLDL+ GYHQIR+  EDI KTAFRTHEGHYEFL
Subjt:  REMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFL

Query:  VMPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVE
        VMPFGLTNAPSTFQA                           ++ H++HL  VL +++D  L VNM KC FAK RI YLGH+IS  G+EAD +K R++ E
Subjt:  VMPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVE

Query:  WPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYF
        WP PTNVRE+RGFLGLTGYYR+FV+NYGA A PLTQL ++  ++W  +A++AF  LK+AMMTLPVL +PDF+  F +E+DAS +G+GAVLTQ ++P+AYF
Subjt:  WPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYF

Query:  SQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLS
        S++LS + +++P+YE+EL+A+V AVQ+WRPYLLG++F ++TDQ++L+FL++Q V+QP+YQKW++KLLGY+FEV Y+PGLENKAADALSR PPAVHL+ ++
Subjt:  SQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLS

Query:  APAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQY
        AP ++D++IIK E   D  LQEI R L     + P ++L QG L +KGRLV+ ++S LIP +LH+YHDSV GGHSGFLRTYKR+T E+YW+GMK  I +Y
Subjt:  APAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQY

Query:  VEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDK
         E+C  CQRNK+  LSPAGLL PL IP+ IW DISMDF+EGLPKS+G+D IFVVVDRLSKY HFL L HPFSAK VA  F+KEVVRLHG+PRSIVSDRDK
Subjt:  VEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDK

Query:  IFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNA
        +F+SHFW E+FRL GT+L RS+SYHPQ+DGQTE VNK +ETYLRCFCGE+P +W  WL WAEYWYNTTYHSS+  TPF+ VYGR PP L++YGD +T N+
Subjt:  IFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNA

Query:  TLDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFH
        TLD+QL +RD T+  L+  L  AQ+RMKK+AD +RRE +++EG+ V++KLRPYRQ+++  +RNEKLSPK+FGPYR+LE+IG +AY+L+LP  A+IHPVFH
Subjt:  TLDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFH

Query:  VSQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV
        VSQL++A+G  E V+ L P +  N EWI  PE+V +YR NP T EWE L+ WKGLP  E TWES    + QFP F LEDKV
Subjt:  VSQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV

TYK06572.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0057.27Show/hide
Query:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE
        GL  +MK A +IE++           A +SKT   P +  KN I +           P++TITL         +E P +RL+D+E Q RREKGLC+RC E
Subjt:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE

Query:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI
        K+  GH+CK    KEL++L+V +   E    +E+    E  +++V+   VE    ELS+N+VVG ++PGTMKVKG++  +EVVIL+DCGATHNFI++ L+
Subjt:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI

Query:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS
          + + + ET NYG+I+G+ TAVKGKG+C+ + + L+G  V + FLP++L GVD ILGMQWL +LGVT +DWK L +       K+ ++GDPSLTK  VS
Subjt:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS

Query:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR
        LK L ++WG  DQGFLVE R I     +    D  P   N   +  LL+++A VF  PS LPP R IDH I +K+G+ P+NVRPYRY H QKEE+E+++ 
Subjt:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR

Query:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV
        EML +G+IRPS    +SPVLLV+KRDGSWRFCVDYRALN  T+PDKFPIPVIEEL DEL GA +FSK+DL+ GYHQIR+  EDI KTAFRTHEGHYEFLV
Subjt:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV

Query:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW
        MPFGLTNAPSTFQA                           +E H +HL  VL +++   L VNM KC FAK RI YLGH+IS  G+EAD +K R++ EW
Subjt:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW

Query:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS
        P P NVRE+RGFLGLTGYYR+FV+NYG  A PLTQL ++ A++W E+ + AF  LKRAMMTLPVL +PDFS  F +E+DAS  G+GAVLTQ ++P+AYFS
Subjt:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS

Query:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA
        ++LS + +S+P+YE+EL+A+V AVQ+WRPYLLG++F ++TDQ++L++L++Q V+QP+YQKW++KLLGY+FEV Y+PGLENKAADALSR+ P   L  ++A
Subjt:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA

Query:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV
        PA++DV+I+K E  QD  L+EI+R +       P ++L QG L +KGRLV+S++S L+P +LH+YHDSV GGHSGFLRTYKR+T E+YW+GMK  + +Y 
Subjt:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV

Query:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI
        E+C  CQRNK+  L+PAGLL PL IP+ IW DISMDF+EGLPKS+G+D I VVVDRLSKY+HFL L HPF+AK VA  FIKEVVRLHG+PRSIVSDRDK+
Subjt:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI

Query:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT
        F+SHFW E+FRL GT+L RS+SYHPQ+DGQTE VNK +ETYLRCFCGE+P++W  WL WAEYWYNTTYHSS+  TPF+ VYGR PP L++YGD +T N+T
Subjt:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT

Query:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV
        LD+QL +RD T+  L+  L  AQ+RMKK AD +RRE +++EG+ V++KLRPYRQ+++  +RNEKLSPKYFGPYR+LE+IG +AYRL+LP EA+IHPVFHV
Subjt:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV

Query:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV
        SQL++A+G  E V+ L P + EN EWI +PE+V  YR NP T EWE L+ WKGLP  E TWES    + QFP F LEDKV
Subjt:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV

TYK24654.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0057.27Show/hide
Query:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE
        GL  +MK A +IE++           A +SKT   P +  KN I +           P++TITL         +E P +RL+D+E Q RREKGLC+RC E
Subjt:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE

Query:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI
        K+  GH+CK    KEL++L+V +   E    +E+    E  +++V+   VE    ELS+N+VVG ++PGTMKVKG++  +EVVIL+DCGATHNFI++ L+
Subjt:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI

Query:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS
          + + + ET NYG+I+G+ TAVKGKG+C+ + + L+G  V + FLP++L GVD ILGMQWL +LGVT +DWK L +       K+ ++GDPSLTK  VS
Subjt:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS

Query:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR
        LK L ++WG  DQGFLVE R I     +    D  P   N   +  LL+++A VF  PS LPP R IDH I +K+G+ P+NVRPYRY H QKEE+E+++ 
Subjt:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR

Query:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV
        EML +G+IRPS    +SPVLLV+KRDGSWRFCVDYRALN  T+PDKFPIPVIEEL DEL GA +FSK+DL+ GYHQIR+  EDI KTAFRTHEGHYEFLV
Subjt:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV

Query:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW
        MPFGLTNAPSTFQA                           +E H +HL  VL +++   L VNM KC FAK RI YLGH+IS  G+EAD +K R++ EW
Subjt:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW

Query:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS
        P P NVRE+RGFLGLTGYYR+FV+NYG  A PLTQL ++ A++W E+ + AF  LKRAMMTLPVL +PDFS  F +E+DAS  G+GAVLTQ ++P+AYFS
Subjt:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS

Query:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA
        ++LS + +S+P+YE+EL+A+V AVQ+WRPYLLG++F ++TDQ++L++L++Q V+QP+YQKW++KLLGY+FEV Y+PGLENKAADALSR+ P   L  ++A
Subjt:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA

Query:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV
        PA++DV+I+K E  QD  L+EI+R +       P ++L QG L +KGRLV+S++S L+P +LH+YHDSV GGHSGFLRTYKR+T E+YW+GMK  + +Y 
Subjt:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV

Query:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI
        E+C  CQRNK+  L+PAGLL PL IP+ IW DISMDF+EGLPKS+G+D I VVVDRLSKY+HFL L HPF+AK VA  FIKEVVRLHG+PRSIVSDRDK+
Subjt:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI

Query:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT
        F+SHFW E+FRL GT+L RS+SYHPQ+DGQTE VNK +ETYLRCFCGE+P++W  WL WAEYWYNTTYHSS+  TPF+ VYGR PP L++YGD +T N+T
Subjt:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT

Query:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV
        LD+QL +RD T+  L+  L  AQ+RMKK AD +RRE +++EG+ V++KLRPYRQ+++  +RNEKLSPKYFGPYR+LE+IG +AYRL+LP EA+IHPVFHV
Subjt:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV

Query:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV
        SQL++A+G  E V+ L P + EN EWI +PE+V  YR NP T EWE L+ WKGLP  E TWES    + QFP F LEDKV
Subjt:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV

TrEMBL top hitse value%identityAlignment
A0A5A7U2S1 Ty3/gypsy retrotransposon protein0.0e+0056.33Show/hide
Query:  MGLEAIMKAAQRIEDKNRAIES---KTSFGPGRVWKNPIISHANPPKLVEQN-----PIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE
        +GL  +MK A +IE++    +     +++      +     + N   ++E       P++TITL         +E P +RLTD+E Q RREKGLC+RC E
Subjt:  MGLEAIMKAAQRIEDKNRAIES---KTSFGPGRVWKNPIISHANPPKLVEQN-----PIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE

Query:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI
        K+H GH+CK    KEL++LLV +   E    +E+    E  ++  E+ +VE    ELSIN+VVG ++PGTMKVKGR+  +EVV+L+DCGATHNFI++KL+
Subjt:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI

Query:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS
        E +++P+ ET NYG+I+G+ T++KGKG+C K+ + L    V + FLP+EL GVDAILGMQWL ++G T +DWK LT+      +K+T+KGDPSLTKT++S
Subjt:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS

Query:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR
        LK + ++W   DQG+LVE R +     +    D             +L ++ DVF  P+RLPP R I+H I LK G+ PINVRPYRY H QKEE+E+++ 
Subjt:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR

Query:  EMLVAGVIRPST----SPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV
        EML +G+IRPST    SPVLLV+K+DGSWRFCVDYRALN  TVPDKFPIPV+EEL DEL+GA +FSK+DL+ GYHQIR+H EDI KTAFRTHEGHYEF+V
Subjt:  EMLVAGVIRPST----SPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV

Query:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW
        MPFGLTNAPSTFQA                           +E H +HL  VL ++R++ L  N  KC FAK RI YLGH+IS  G+E D +K R++ EW
Subjt:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW

Query:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS
        P P NVRELRGFLGLTGYYR+FV+NYG+ + PLTQL +  A++WTE+ + AFE LK+AMMTLPVLA+PDF+  F +E+DAS  G+GAVL Q+++P+AYFS
Subjt:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS

Query:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA
        + LST+ +++P+YE+ELMA+V AVQ+WRPYLLG++F ++TDQ++L+FL++Q VIQP+YQ+WI KLLGY+FEV Y+PGLENKAADALSR+ P  HL  L+A
Subjt:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA

Query:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV
        PA+LDV++I++EV +D  L+EI+  +       P ++ HQG L +KGRLV+S  S LIP ++H+YHDSV GGHSGFLRTYKR+  ELYW+GMK  +++Y 
Subjt:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV

Query:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI
        E+C  CQ+NK+  LSPAGLL PL IP+ IW DISMDF+EGLPKS G++ I VVVDRLSKYAHFL L HP++AK+VA +F+KEVVRLHGFP+SIVSDRDKI
Subjt:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI

Query:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT
        F+SHFW EMF+L GT+L RS+SYHPQTDGQTE VNK +E YLRCFCGE+P+ W  WL WAEYWYNTTYH S+  TPF+ VYGR PP L+ YG+ +T N+T
Subjt:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT

Query:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV
        LD+QL +RD  +  L+  L  AQ+RMKK+ADL+RR+ +++ G++V++KLRPYRQ+++  +RNEKLSPKYFGPY+ILE+IG +AY+L+LP  A+IHPVFHV
Subjt:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV

Query:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV
        SQL++A+G+ + ++ L P + E  EWI +PE++ AYR N  T EWE LVQWKGLP  E TWE+    + QFP F LEDKV
Subjt:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV

A0A5A7UAE4 Ty3/gypsy retrotransposon protein0.0e+0057.42Show/hide
Query:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE
        GL  +MK A +IE++           A +SKT   P +  KN I +           P++TITL         +E P +RL+D+E Q RREKGLC+RC E
Subjt:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE

Query:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI
        K+  GH+CK    KEL++L+V +   E    +E+    E  +++V+   VE    ELS+N+VVG ++PGTMKVKG++  +EVVIL+DCGATHNFI++ L+
Subjt:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI

Query:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS
          + + + ET NYG+I+G+ TAVKGKG+C+ + + L+G  V + FLP++L GVD ILGMQWL +LGVT +DWK L +       K+ ++GDPSLTK  VS
Subjt:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS

Query:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR
        LK L ++WG  DQGFLVE R I     +    D  P   N   +  LL+++A VF  PS LPP R IDH I LK+G+ P+NVRPYRY H QKEE+E+++ 
Subjt:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR

Query:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV
        EML +G+IRPS    +SPVLLV+KRDGSWRFCVDYRALN  T+PDKFPIPVIEEL DEL GA +FSK+DL+ GYHQIR+  EDI KTAFRTHEGHYEFLV
Subjt:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV

Query:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW
        MPFGLTNAPSTFQA                           +E HL+HL  VL ++++  L VNM KC FAK RI YLGH+IS  G+EAD +K R++ EW
Subjt:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW

Query:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS
        P P NVRE+RGFLGLTGYYR+FV+NYG  A PLTQL ++ A++W E+ + AF  LKRAMMTLPVL +PDFS  F +E+DAS  G+GAVLTQ ++P+AYFS
Subjt:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS

Query:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA
        ++LS + +S+P+YE+EL+A+V AVQ+WRPYLLG++F ++TDQ++L++L++Q V+QP+YQKW++KLLGY+FEV Y+PGLENKAADALSR+ P   L  ++A
Subjt:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA

Query:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV
        PA++DV+I+K E  QD  L+EI+R +       P ++L QG L +KGRLV+S++S L+P +LH+YHDSV GGHSGFLRTYKR+T E+YW+GMK  + +Y 
Subjt:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV

Query:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI
        E+C  CQRNK+  L+PAGLL PL IP+ IW DISMDF+EGLPKS+G+D I VVVDRLSKY+HFL L HPF+AK VA  FIKEVVRLHG+PRSIVSDRDK+
Subjt:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI

Query:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT
        F+SHFW E+FRL GT+L RS+SYHPQ+DGQTE VNK +ETYLRCFCGE+P++W  WL WAEYWYNTTYHSS+  TPF+ VYGR PP L++YGD +T N+T
Subjt:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT

Query:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV
        LD+QL +RD T+  L+  L  AQ+RMKK AD +RRE +++EG+ V++KLRPYRQ+++  +RNEKLSPKYFGPYR+LE+IG +AYRL+LP EA+IHPVFHV
Subjt:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV

Query:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV
        SQL++A+G  E V+ L P + EN EWI +PE+V  YR NP T EWE L+ WKGLP  E TWES    + QFP F LEDKV
Subjt:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV

A0A5A7UKN8 Ty3/gypsy retrotransposon protein0.0e+0057.3Show/hide
Query:  MGLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCD
        +GL  +MK A +IE++           A +SK      +  K+PI +           P++TITL         +E P +RL+D+E Q RREKGLC+RC 
Subjt:  MGLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCD

Query:  EKFHVGHKCKG---KELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKL
        EK+  GH+CK    KEL++L+V +   E    +E+    E  ++  E P VE    ELS+N+VVG ++PGTMKVKG++ ++EVVIL+DCGATHNFI++ L
Subjt:  EKFHVGHKCKG---KELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKL

Query:  IEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEV
        +  + + + ET NYG+I+G+ TAVKGKG+C+ + + LDG  V + FLP++L GVD ILGMQWL +LGVT +DWK L +       K+ ++GDPSL KT V
Subjt:  IEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEV

Query:  SLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMI
        SLK L +TWG  DQGFLVE R +     +             PI   LL+Q+A VF  P+ LPP R I+H I LK+G+ P+NVRPYRY H QKEE+E+++
Subjt:  SLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMI

Query:  REMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFL
         EML +G+IRPS    +SPVLLV+K+DGSWRFCVDYRALN  T+PDKFPIPVIEEL DEL GA +FSKLDL+ GYHQIR+  EDI KTAFRTHEGHYEFL
Subjt:  REMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFL

Query:  VMPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVE
        VMPFGLTNAPSTFQA                           ++ H++HL  VL +++D  L VNM KC FAK RI YLGH+IS  G+EAD +K R++ E
Subjt:  VMPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVE

Query:  WPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYF
        WP PTNVRE+RGFLGLTGYYR+FV+NYGA A PLTQL ++  ++W  +A++AF  LK+AMMTLPVL +PDF+  F +E+DAS +G+GAVLTQ ++P+AYF
Subjt:  WPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYF

Query:  SQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLS
        S++LS + +++P+YE+EL+A+V AVQ+WRPYLLG++F ++TDQ++L+FL++Q V+QP+YQKW++KLLGY+FEV Y+PGLENKAADALSR PPAVHL+ ++
Subjt:  SQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLS

Query:  APAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQY
        AP ++D++IIK E   D  LQEI R L     + P ++L QG L +KGRLV+ ++S LIP +LH+YHDSV GGHSGFLRTYKR+T E+YW+GMK  I +Y
Subjt:  APAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQY

Query:  VEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDK
         E+C  CQRNK+  LSPAGLL PL IP+ IW DISMDF+EGLPKS+G+D IFVVVDRLSKY HFL L HPFSAK VA  F+KEVVRLHG+PRSIVSDRDK
Subjt:  VEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDK

Query:  IFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNA
        +F+SHFW E+FRL GT+L RS+SYHPQ+DGQTE VNK +ETYLRCFCGE+P +W  WL WAEYWYNTTYHSS+  TPF+ VYGR PP L++YGD +T N+
Subjt:  IFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNA

Query:  TLDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFH
        TLD+QL +RD T+  L+  L  AQ+RMKK+AD +RRE +++EG+ V++KLRPYRQ+++  +RNEKLSPK+FGPYR+LE+IG +AY+L+LP  A+IHPVFH
Subjt:  TLDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFH

Query:  VSQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV
        VSQL++A+G  E V+ L P +  N EWI  PE+V +YR NP T EWE L+ WKGLP  E TWES    + QFP F LEDKV
Subjt:  VSQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV

A0A5D3C5N7 Ty3/gypsy retrotransposon protein0.0e+0057.27Show/hide
Query:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE
        GL  +MK A +IE++           A +SKT   P +  KN I +           P++TITL         +E P +RL+D+E Q RREKGLC+RC E
Subjt:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE

Query:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI
        K+  GH+CK    KEL++L+V +   E    +E+    E  +++V+   VE    ELS+N+VVG ++PGTMKVKG++  +EVVIL+DCGATHNFI++ L+
Subjt:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI

Query:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS
          + + + ET NYG+I+G+ TAVKGKG+C+ + + L+G  V + FLP++L GVD ILGMQWL +LGVT +DWK L +       K+ ++GDPSLTK  VS
Subjt:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS

Query:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR
        LK L ++WG  DQGFLVE R I     +    D  P   N   +  LL+++A VF  PS LPP R IDH I +K+G+ P+NVRPYRY H QKEE+E+++ 
Subjt:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR

Query:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV
        EML +G+IRPS    +SPVLLV+KRDGSWRFCVDYRALN  T+PDKFPIPVIEEL DEL GA +FSK+DL+ GYHQIR+  EDI KTAFRTHEGHYEFLV
Subjt:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV

Query:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW
        MPFGLTNAPSTFQA                           +E H +HL  VL +++   L VNM KC FAK RI YLGH+IS  G+EAD +K R++ EW
Subjt:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW

Query:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS
        P P NVRE+RGFLGLTGYYR+FV+NYG  A PLTQL ++ A++W E+ + AF  LKRAMMTLPVL +PDFS  F +E+DAS  G+GAVLTQ ++P+AYFS
Subjt:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS

Query:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA
        ++LS + +S+P+YE+EL+A+V AVQ+WRPYLLG++F ++TDQ++L++L++Q V+QP+YQKW++KLLGY+FEV Y+PGLENKAADALSR+ P   L  ++A
Subjt:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA

Query:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV
        PA++DV+I+K E  QD  L+EI+R +       P ++L QG L +KGRLV+S++S L+P +LH+YHDSV GGHSGFLRTYKR+T E+YW+GMK  + +Y 
Subjt:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV

Query:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI
        E+C  CQRNK+  L+PAGLL PL IP+ IW DISMDF+EGLPKS+G+D I VVVDRLSKY+HFL L HPF+AK VA  FIKEVVRLHG+PRSIVSDRDK+
Subjt:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI

Query:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT
        F+SHFW E+FRL GT+L RS+SYHPQ+DGQTE VNK +ETYLRCFCGE+P++W  WL WAEYWYNTTYHSS+  TPF+ VYGR PP L++YGD +T N+T
Subjt:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT

Query:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV
        LD+QL +RD T+  L+  L  AQ+RMKK AD +RRE +++EG+ V++KLRPYRQ+++  +RNEKLSPKYFGPYR+LE+IG +AYRL+LP EA+IHPVFHV
Subjt:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV

Query:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV
        SQL++A+G  E V+ L P + EN EWI +PE+V  YR NP T EWE L+ WKGLP  E TWES    + QFP F LEDKV
Subjt:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV

A0A5D3DM31 Ty3/gypsy retrotransposon protein0.0e+0057.27Show/hide
Query:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE
        GL  +MK A +IE++           A +SKT   P +  KN I +           P++TITL         +E P +RL+D+E Q RREKGLC+RC E
Subjt:  GLEAIMKAAQRIEDKN---------RAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPN-NPSHPPKETPLRRLTDSEMQLRREKGLCYRCDE

Query:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI
        K+  GH+CK    KEL++L+V +   E    +E+    E  +++V+   VE    ELS+N+VVG ++PGTMKVKG++  +EVVIL+DCGATHNFI++ L+
Subjt:  KFHVGHKCK---GKELKVLLVADEEPETPFSKEK-ETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLI

Query:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS
          + + + ET NYG+I+G+ TAVKGKG+C+ + + L+G  V + FLP++L GVD ILGMQWL +LGVT +DWK L +       K+ ++GDPSLTK  VS
Subjt:  EDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVS

Query:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR
        LK L ++WG  DQGFLVE R I     +    D  P   N   +  LL+++A VF  PS LPP R IDH I +K+G+ P+NVRPYRY H QKEE+E+++ 
Subjt:  LKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIR

Query:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV
        EML +G+IRPS    +SPVLLV+KRDGSWRFCVDYRALN  T+PDKFPIPVIEEL DEL GA +FSK+DL+ GYHQIR+  EDI KTAFRTHEGHYEFLV
Subjt:  EMLVAGVIRPS----TSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLV

Query:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW
        MPFGLTNAPSTFQA                           +E H +HL  VL +++   L VNM KC FAK RI YLGH+IS  G+EAD +K R++ EW
Subjt:  MPFGLTNAPSTFQA-------------------------PDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADADKFRSMVEW

Query:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS
        P P NVRE+RGFLGLTGYYR+FV+NYG  A PLTQL ++ A++W E+ + AF  LKRAMMTLPVL +PDFS  F +E+DAS  G+GAVLTQ ++P+AYFS
Subjt:  PRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFS

Query:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA
        ++LS + +S+P+YE+EL+A+V AVQ+WRPYLLG++F ++TDQ++L++L++Q V+QP+YQKW++KLLGY+FEV Y+PGLENKAADALSR+ P   L  ++A
Subjt:  QSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSA

Query:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV
        PA++DV+I+K E  QD  L+EI+R +       P ++L QG L +KGRLV+S++S L+P +LH+YHDSV GGHSGFLRTYKR+T E+YW+GMK  + +Y 
Subjt:  PAILDVQIIKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYV

Query:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI
        E+C  CQRNK+  L+PAGLL PL IP+ IW DISMDF+EGLPKS+G+D I VVVDRLSKY+HFL L HPF+AK VA  FIKEVVRLHG+PRSIVSDRDK+
Subjt:  EDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKI

Query:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT
        F+SHFW E+FRL GT+L RS+SYHPQ+DGQTE VNK +ETYLRCFCGE+P++W  WL WAEYWYNTTYHSS+  TPF+ VYGR PP L++YGD +T N+T
Subjt:  FVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNAT

Query:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV
        LD+QL +RD T+  L+  L  AQ+RMKK AD +RRE +++EG+ V++KLRPYRQ+++  +RNEKLSPKYFGPYR+LE+IG +AYRL+LP EA+IHPVFHV
Subjt:  LDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEGEWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHV

Query:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV
        SQL++A+G  E V+ L P + EN EWI +PE+V  YR NP T EWE L+ WKGLP  E TWES    + QFP F LEDKV
Subjt:  SQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.3e-13429.76Show/hide
Query:  LVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDS
        L+D GA  N I+++ +   K+P T   +  +I G     K      K+ I+L+GI++  +FL ++     A         +  TT+        I++  S
Subjt:  LVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDS

Query:  KITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPS--RLP-PHREIDHRIPLKTGSQPIN
        K TL                             ++  ++  V +P + D             + +++ D+ +E +  +LP P + ++  + L   +  + 
Subjt:  KITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPS--RLP-PHREIDHRIPLKTGSQPIN

Query:  VRPYRYPHAQKEEIEKMIREMLVAGVIRPSTS----PVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHT
        +R Y  P  + + +   I + L +G+IR S +    PV+ V K++G+ R  VDY+ LN+   P+ +P+P+IE+LL ++ G+ IF+KLDL+  YH IRV  
Subjt:  VRPYRYPHAQKEEIEKMIREMLVAGVIRPSTS----PVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHT

Query:  EDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQ-------------------------APDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHW
         D  K AFR   G +E+LVMP+G++ AP+ FQ                         +     H++H+  VL  +++ +L +N  KC F + ++ ++G+ 
Subjt:  EDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQ-------------------------APDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHW

Query:  ISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDA-FEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDA
        IS  G     +    +++W +P N +ELR FLG   Y RKF+        PL  L ++D  ++WT    +A EN+K+ +++ PVL   DFS+  ++ETDA
Subjt:  ISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDA-FEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDA

Query:  SSMGLGAVLTQNQ-----RPIAYFSQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLG--QRFLIRTDQQAL--RFLVDQTVIQPEYQKWISKLLGYNFE
        S + +GAVL+Q        P+ Y+S  +S    +  + +KE++AI+K+++ WR YL    + F I TD + L  R   +         +W   L  +NFE
Subjt:  SSMGLGAVLTQNQ-----RPIAYFSQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLG--QRFLIRTDQQAL--RFLVDQTVIQPEYQKWISKLLGYNFE

Query:  VQYRPGLENKAADALSRLPPAVHLAPL-----SAPAILDVQI-------IKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLL-YKGRLVMSSESRLI
        + YRPG  N  ADALSR+       P      S   +  + I       +  E   D +L  ++ +   D  +     L  G L+  K ++++ ++++L 
Subjt:  VQYRPGLENKAADALSRLPPAVHLAPL-----SAPAILDVQI-------IKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLL-YKGRLVMSSESRLI

Query:  PAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLS
          ++  YH+     H G       I     W+G++  I++YV++C TCQ NK+    P G LQP+P  ER WE +SMDF+  LP+S GY+ +FVVVDR S
Subjt:  PAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLS

Query:  KYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLS
        K A  +P +   +A+  A +F + V+   G P+ I++D D IF S  W +        ++ S  Y PQTDGQTE  N+ +E  LRC C   P  W   +S
Subjt:  KYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLS

Query:  WAEYWYNTTYHSSLNTTPFRVVYGRPP---PTLLFYGDQKTNNATLDEQLLNRDKTIDLLRT---QLLKAQDRMKKYADLQRRE-KKYEEGEWVYIKLRP
          +  YN   HS+   TPF +V+   P   P  L     KT+         N  +TI + +T    L     +MKKY D++ +E ++++ G+ V +K   
Subjt:  WAEYWYNTTYHSSLNTTPFRVVYGRPP---PTLLFYGDQKTNNATLDEQLLNRDKTIDLLRT---QLLKAQDRMKKYADLQRRE-KKYEEGEWVYIKLRP

Query:  YRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEAS--IHPVFHVSQLRQAMGSNE
         R  T    ++ KL+P + GP+ +L+K GP  Y LDLP+         FHVS L +   ++E
Subjt:  YRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEAS--IHPVFHVSQLRQAMGSNE

P0CT35 Transposon Tf2-2 polyprotein4.3e-13429.76Show/hide
Query:  LVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDS
        L+D GA  N I+++ +   K+P T   +  +I G     K      K+ I+L+GI++  +FL ++     A         +  TT+        I++  S
Subjt:  LVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDS

Query:  KITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPS--RLP-PHREIDHRIPLKTGSQPIN
        K TL                             ++  ++  V +P + D             + +++ D+ +E +  +LP P + ++  + L   +  + 
Subjt:  KITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPS--RLP-PHREIDHRIPLKTGSQPIN

Query:  VRPYRYPHAQKEEIEKMIREMLVAGVIRPSTS----PVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHT
        +R Y  P  + + +   I + L +G+IR S +    PV+ V K++G+ R  VDY+ LN+   P+ +P+P+IE+LL ++ G+ IF+KLDL+  YH IRV  
Subjt:  VRPYRYPHAQKEEIEKMIREMLVAGVIRPSTS----PVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHT

Query:  EDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQ-------------------------APDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHW
         D  K AFR   G +E+LVMP+G++ AP+ FQ                         +     H++H+  VL  +++ +L +N  KC F + ++ ++G+ 
Subjt:  EDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQ-------------------------APDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHW

Query:  ISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDA-FEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDA
        IS  G     +    +++W +P N +ELR FLG   Y RKF+        PL  L ++D  ++WT    +A EN+K+ +++ PVL   DFS+  ++ETDA
Subjt:  ISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDA-FEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDA

Query:  SSMGLGAVLTQNQ-----RPIAYFSQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLG--QRFLIRTDQQAL--RFLVDQTVIQPEYQKWISKLLGYNFE
        S + +GAVL+Q        P+ Y+S  +S    +  + +KE++AI+K+++ WR YL    + F I TD + L  R   +         +W   L  +NFE
Subjt:  SSMGLGAVLTQNQ-----RPIAYFSQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLG--QRFLIRTDQQAL--RFLVDQTVIQPEYQKWISKLLGYNFE

Query:  VQYRPGLENKAADALSRLPPAVHLAPL-----SAPAILDVQI-------IKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLL-YKGRLVMSSESRLI
        + YRPG  N  ADALSR+       P      S   +  + I       +  E   D +L  ++ +   D  +     L  G L+  K ++++ ++++L 
Subjt:  VQYRPGLENKAADALSRLPPAVHLAPL-----SAPAILDVQI-------IKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLL-YKGRLVMSSESRLI

Query:  PAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLS
          ++  YH+     H G       I     W+G++  I++YV++C TCQ NK+    P G LQP+P  ER WE +SMDF+  LP+S GY+ +FVVVDR S
Subjt:  PAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLS

Query:  KYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLS
        K A  +P +   +A+  A +F + V+   G P+ I++D D IF S  W +        ++ S  Y PQTDGQTE  N+ +E  LRC C   P  W   +S
Subjt:  KYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLS

Query:  WAEYWYNTTYHSSLNTTPFRVVYGRPP---PTLLFYGDQKTNNATLDEQLLNRDKTIDLLRT---QLLKAQDRMKKYADLQRRE-KKYEEGEWVYIKLRP
          +  YN   HS+   TPF +V+   P   P  L     KT+         N  +TI + +T    L     +MKKY D++ +E ++++ G+ V +K   
Subjt:  WAEYWYNTTYHSSLNTTPFRVVYGRPP---PTLLFYGDQKTNNATLDEQLLNRDKTIDLLRT---QLLKAQDRMKKYADLQRRE-KKYEEGEWVYIKLRP

Query:  YRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEAS--IHPVFHVSQLRQAMGSNE
         R  T    ++ KL+P + GP+ +L+K GP  Y LDLP+         FHVS L +   ++E
Subjt:  YRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEAS--IHPVFHVSQLRQAMGSNE

P0CT36 Transposon Tf2-3 polyprotein4.3e-13429.76Show/hide
Query:  LVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDS
        L+D GA  N I+++ +   K+P T   +  +I G     K      K+ I+L+GI++  +FL ++     A         +  TT+        I++  S
Subjt:  LVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDS

Query:  KITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPS--RLP-PHREIDHRIPLKTGSQPIN
        K TL                             ++  ++  V +P + D             + +++ D+ +E +  +LP P + ++  + L   +  + 
Subjt:  KITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPS--RLP-PHREIDHRIPLKTGSQPIN

Query:  VRPYRYPHAQKEEIEKMIREMLVAGVIRPSTS----PVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHT
        +R Y  P  + + +   I + L +G+IR S +    PV+ V K++G+ R  VDY+ LN+   P+ +P+P+IE+LL ++ G+ IF+KLDL+  YH IRV  
Subjt:  VRPYRYPHAQKEEIEKMIREMLVAGVIRPSTS----PVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHT

Query:  EDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQ-------------------------APDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHW
         D  K AFR   G +E+LVMP+G++ AP+ FQ                         +     H++H+  VL  +++ +L +N  KC F + ++ ++G+ 
Subjt:  EDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQ-------------------------APDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHW

Query:  ISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDA-FEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDA
        IS  G     +    +++W +P N +ELR FLG   Y RKF+        PL  L ++D  ++WT    +A EN+K+ +++ PVL   DFS+  ++ETDA
Subjt:  ISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDA-FEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDA

Query:  SSMGLGAVLTQNQ-----RPIAYFSQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLG--QRFLIRTDQQAL--RFLVDQTVIQPEYQKWISKLLGYNFE
        S + +GAVL+Q        P+ Y+S  +S    +  + +KE++AI+K+++ WR YL    + F I TD + L  R   +         +W   L  +NFE
Subjt:  SSMGLGAVLTQNQ-----RPIAYFSQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLG--QRFLIRTDQQAL--RFLVDQTVIQPEYQKWISKLLGYNFE

Query:  VQYRPGLENKAADALSRLPPAVHLAPL-----SAPAILDVQI-------IKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLL-YKGRLVMSSESRLI
        + YRPG  N  ADALSR+       P      S   +  + I       +  E   D +L  ++ +   D  +     L  G L+  K ++++ ++++L 
Subjt:  VQYRPGLENKAADALSRLPPAVHLAPL-----SAPAILDVQI-------IKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLL-YKGRLVMSSESRLI

Query:  PAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLS
          ++  YH+     H G       I     W+G++  I++YV++C TCQ NK+    P G LQP+P  ER WE +SMDF+  LP+S GY+ +FVVVDR S
Subjt:  PAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLS

Query:  KYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLS
        K A  +P +   +A+  A +F + V+   G P+ I++D D IF S  W +        ++ S  Y PQTDGQTE  N+ +E  LRC C   P  W   +S
Subjt:  KYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLS

Query:  WAEYWYNTTYHSSLNTTPFRVVYGRPP---PTLLFYGDQKTNNATLDEQLLNRDKTIDLLRT---QLLKAQDRMKKYADLQRRE-KKYEEGEWVYIKLRP
          +  YN   HS+   TPF +V+   P   P  L     KT+         N  +TI + +T    L     +MKKY D++ +E ++++ G+ V +K   
Subjt:  WAEYWYNTTYHSSLNTTPFRVVYGRPP---PTLLFYGDQKTNNATLDEQLLNRDKTIDLLRT---QLLKAQDRMKKYADLQRRE-KKYEEGEWVYIKLRP

Query:  YRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEAS--IHPVFHVSQLRQAMGSNE
         R  T    ++ KL+P + GP+ +L+K GP  Y LDLP+         FHVS L +   ++E
Subjt:  YRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEAS--IHPVFHVSQLRQAMGSNE

P0CT37 Transposon Tf2-4 polyprotein4.3e-13429.76Show/hide
Query:  LVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDS
        L+D GA  N I+++ +   K+P T   +  +I G     K      K+ I+L+GI++  +FL ++     A         +  TT+        I++  S
Subjt:  LVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDS

Query:  KITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPS--RLP-PHREIDHRIPLKTGSQPIN
        K TL                             ++  ++  V +P + D             + +++ D+ +E +  +LP P + ++  + L   +  + 
Subjt:  KITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPS--RLP-PHREIDHRIPLKTGSQPIN

Query:  VRPYRYPHAQKEEIEKMIREMLVAGVIRPSTS----PVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHT
        +R Y  P  + + +   I + L +G+IR S +    PV+ V K++G+ R  VDY+ LN+   P+ +P+P+IE+LL ++ G+ IF+KLDL+  YH IRV  
Subjt:  VRPYRYPHAQKEEIEKMIREMLVAGVIRPSTS----PVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHT

Query:  EDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQ-------------------------APDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHW
         D  K AFR   G +E+LVMP+G++ AP+ FQ                         +     H++H+  VL  +++ +L +N  KC F + ++ ++G+ 
Subjt:  EDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQ-------------------------APDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHW

Query:  ISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDA-FEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDA
        IS  G     +    +++W +P N +ELR FLG   Y RKF+        PL  L ++D  ++WT    +A EN+K+ +++ PVL   DFS+  ++ETDA
Subjt:  ISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDA-FEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDA

Query:  SSMGLGAVLTQNQ-----RPIAYFSQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLG--QRFLIRTDQQAL--RFLVDQTVIQPEYQKWISKLLGYNFE
        S + +GAVL+Q        P+ Y+S  +S    +  + +KE++AI+K+++ WR YL    + F I TD + L  R   +         +W   L  +NFE
Subjt:  SSMGLGAVLTQNQ-----RPIAYFSQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLG--QRFLIRTDQQAL--RFLVDQTVIQPEYQKWISKLLGYNFE

Query:  VQYRPGLENKAADALSRLPPAVHLAPL-----SAPAILDVQI-------IKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLL-YKGRLVMSSESRLI
        + YRPG  N  ADALSR+       P      S   +  + I       +  E   D +L  ++ +   D  +     L  G L+  K ++++ ++++L 
Subjt:  VQYRPGLENKAADALSRLPPAVHLAPL-----SAPAILDVQI-------IKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLL-YKGRLVMSSESRLI

Query:  PAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLS
          ++  YH+     H G       I     W+G++  I++YV++C TCQ NK+    P G LQP+P  ER WE +SMDF+  LP+S GY+ +FVVVDR S
Subjt:  PAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLS

Query:  KYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLS
        K A  +P +   +A+  A +F + V+   G P+ I++D D IF S  W +        ++ S  Y PQTDGQTE  N+ +E  LRC C   P  W   +S
Subjt:  KYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLS

Query:  WAEYWYNTTYHSSLNTTPFRVVYGRPP---PTLLFYGDQKTNNATLDEQLLNRDKTIDLLRT---QLLKAQDRMKKYADLQRRE-KKYEEGEWVYIKLRP
          +  YN   HS+   TPF +V+   P   P  L     KT+         N  +TI + +T    L     +MKKY D++ +E ++++ G+ V +K   
Subjt:  WAEYWYNTTYHSSLNTTPFRVVYGRPP---PTLLFYGDQKTNNATLDEQLLNRDKTIDLLRT---QLLKAQDRMKKYADLQRRE-KKYEEGEWVYIKLRP

Query:  YRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEAS--IHPVFHVSQLRQAMGSNE
         R  T    ++ KL+P + GP+ +L+K GP  Y LDLP+         FHVS L +   ++E
Subjt:  YRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEAS--IHPVFHVSQLRQAMGSNE

P0CT41 Transposon Tf2-12 polyprotein4.3e-13429.76Show/hide
Query:  LVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDS
        L+D GA  N I+++ +   K+P T   +  +I G     K      K+ I+L+GI++  +FL ++     A         +  TT+        I++  S
Subjt:  LVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDS

Query:  KITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPS--RLP-PHREIDHRIPLKTGSQPIN
        K TL                             ++  ++  V +P + D             + +++ D+ +E +  +LP P + ++  + L   +  + 
Subjt:  KITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPIVDQLLEQYADVFSEPS--RLP-PHREIDHRIPLKTGSQPIN

Query:  VRPYRYPHAQKEEIEKMIREMLVAGVIRPSTS----PVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHT
        +R Y  P  + + +   I + L +G+IR S +    PV+ V K++G+ R  VDY+ LN+   P+ +P+P+IE+LL ++ G+ IF+KLDL+  YH IRV  
Subjt:  VRPYRYPHAQKEEIEKMIREMLVAGVIRPSTS----PVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLDELHGAQIFSKLDLRPGYHQIRVHT

Query:  EDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQ-------------------------APDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHW
         D  K AFR   G +E+LVMP+G++ AP+ FQ                         +     H++H+  VL  +++ +L +N  KC F + ++ ++G+ 
Subjt:  EDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQ-------------------------APDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHW

Query:  ISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDA-FEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDA
        IS  G     +    +++W +P N +ELR FLG   Y RKF+        PL  L ++D  ++WT    +A EN+K+ +++ PVL   DFS+  ++ETDA
Subjt:  ISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDA-FEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDA

Query:  SSMGLGAVLTQNQ-----RPIAYFSQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLG--QRFLIRTDQQAL--RFLVDQTVIQPEYQKWISKLLGYNFE
        S + +GAVL+Q        P+ Y+S  +S    +  + +KE++AI+K+++ WR YL    + F I TD + L  R   +         +W   L  +NFE
Subjt:  SSMGLGAVLTQNQ-----RPIAYFSQSLSTQAQSKPIYEKELMAIVKAVQKWRPYLLG--QRFLIRTDQQAL--RFLVDQTVIQPEYQKWISKLLGYNFE

Query:  VQYRPGLENKAADALSRLPPAVHLAPL-----SAPAILDVQI-------IKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLL-YKGRLVMSSESRLI
        + YRPG  N  ADALSR+       P      S   +  + I       +  E   D +L  ++ +   D  +     L  G L+  K ++++ ++++L 
Subjt:  VQYRPGLENKAADALSRLPPAVHLAPL-----SAPAILDVQI-------IKREVAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLL-YKGRLVMSSESRLI

Query:  PAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLS
          ++  YH+     H G       I     W+G++  I++YV++C TCQ NK+    P G LQP+P  ER WE +SMDF+  LP+S GY+ +FVVVDR S
Subjt:  PAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQPLPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLS

Query:  KYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLS
        K A  +P +   +A+  A +F + V+   G P+ I++D D IF S  W +        ++ S  Y PQTDGQTE  N+ +E  LRC C   P  W   +S
Subjt:  KYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTEAVNKCLETYLRCFCGERPKQWEHWLS

Query:  WAEYWYNTTYHSSLNTTPFRVVYGRPP---PTLLFYGDQKTNNATLDEQLLNRDKTIDLLRT---QLLKAQDRMKKYADLQRRE-KKYEEGEWVYIKLRP
          +  YN   HS+   TPF +V+   P   P  L     KT+         N  +TI + +T    L     +MKKY D++ +E ++++ G+ V +K   
Subjt:  WAEYWYNTTYHSSLNTTPFRVVYGRPP---PTLLFYGDQKTNNATLDEQLLNRDKTIDLLRT---QLLKAQDRMKKYADLQRRE-KKYEEGEWVYIKLRP

Query:  YRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEAS--IHPVFHVSQLRQAMGSNE
         R  T    ++ KL+P + GP+ +L+K GP  Y LDLP+         FHVS L +   ++E
Subjt:  YRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEAS--IHPVFHVSQLRQAMGSNE

Arabidopsis top hitse value%identityAlignment
AT1G11720.1 starch synthase 34.6e-0628.03Show/hide
Query:  RFMNRHDETLSHLIFGGCDIILCQSLHDPVMQVPLKALRYGAAPIAITSS---DNGFGHFPDHDYETTKSAMFINSTFGY---------LSFSQALDEMN
        R +  +DE LSHLI+ G D IL  S+ +P     L A+RYGA P+   +    D  F    DHD E  ++ +   + F +          + ++A+    
Subjt:  RFMNRHDETLSHLIFGGCDIILCQSLHDPVMQVPLKALRYGAAPIAITSS---DNGFGHFPDHDYETTKSAMFINSTFGY---------LSFSQALDEMN

Query:  NNPSEWNHKVFDAMTKDFSWDAECCDIYLSAY
        +    +N      M +D+SW+    + YL  Y
Subjt:  NNPSEWNHKVFDAMTKDFSWDAECCDIYLSAY

AT3G29750.1 Eukaryotic aspartyl protease family protein1.6e-1127.07Show/hide
Query:  HKCKGKELKVLLVADEEPETPFSKEKETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLIEDMKIPVTE
        ++ +  EL  L +   + +    K+    E    E +  T+   M +L    V+  +    M+  G I D +VV+ +D GAT NFI  +L   +K+P + 
Subjt:  HKCKGKELKVLLVADEEPETPFSKEKETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLIEDMKIPVTE

Query:  TMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIEL--SGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITL
        T    +++G    ++  G C  I + +  + + E+FL ++L  + VD ILG +WL  LG T ++W+            ITL
Subjt:  TMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIEL--SGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITL

AT3G30770.1 Eukaryotic aspartyl protease family protein1.1e-0732.29Show/hide
Query:  FSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIEL--SGVDAILG
        F+    M+  G I   +VV+++D GAT+NFIS +L   +K+P + T    +++G    ++  G C  I + +  + + E+FL ++L  + VD ILG
Subjt:  FSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIVIALDGITVAEDFLPIEL--SGVDAILG

AT4G18240.1 starch synthase 45.3e-1023.93Show/hide
Query:  NPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKVAYTPGGIVGCFLSDASDVDTENLRAIVQYGTRMGVQFIFMTTGEMTSRHKELESLQIQ
        NP T+ + L  Q+    + +G  E+    R+Q      E +       +VGC            +R  +     +G QF+ + +  +    +E E ++ Q
Subjt:  NPVTNEWELLVQWKGLPEFEGTWESLEHFRQQFPTFPLEDKVAYTPGGIVGCFLSDASDVDTENLRAIVQYGTRMGVQFIFMTTGEMTSRHKELESLQIQ

Query:  IED-KNVRFMNRHDETLSHLIFGGCDIILCQSLHDPVMQVPLKALRYGAAPIAITSS--DNGFGHFPDHDYETTKSAMFINSTFGYLSFSQALDEMNN--
         +   +VR + ++DE LSH I+   D+ +  S+ +P     + A+RYG+ PIA  +   ++      D    T     F   T     F+ AL+   N  
Subjt:  IED-KNVRFMNRHDETLSHLIFGGCDIILCQSLHDPVMQVPLKALRYGAAPIAITSS--DNGFGHFPDHDYETTKSAMFINSTFGYLSFSQALDEMNN--

Query:  --NPSEWNHKVFDAMTKDFSWDAECCDIYLSAYT
          +  +W   V   M+ DFSW +     Y   YT
Subjt:  --NPSEWNHKVFDAMTKDFSWDAECCDIYLSAYT

ATMG00860.1 DNA/RNA polymerases superfamily protein7.6e-3351.52Show/hide
Query:  LRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLG--HWISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFE
        + HLG VL +   +    N +KC F + +I YLG  H IS  GV AD  K  +MV WP P N  ELRGFLGLTGYYR+FV+NYG    PLT+L ++++ +
Subjt:  LRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLG--HWISAAGVEADADKFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFE

Query:  WTEQAQEAFENLKRAMMTLPVLALPDFSQTFI
        WTE A  AF+ LK A+ TLPVLALPD    F+
Subjt:  WTEQAQEAFENLKRAMMTLPVLALPDFSQTFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCTGGAAGCCATCATGAAGGCGGCCCAAAGAATAGAAGACAAGAATCGGGCCATTGAATCTAAGACTAGTTTCGGCCCAGGAAGAGTGTGGAAAAATCCCATCAT
CAGCCATGCAAATCCACCAAAGTTAGTCGAACAAAACCCCATTAAAACCATTACTTTACCCAATAACCCGTCCCATCCACCGAAAGAAACACCACTTAGACGCTTAACCG
ATTCTGAAATGCAATTAAGAAGGGAAAAAGGATTGTGTTACAGGTGTGATGAAAAGTTCCACGTGGGACACAAATGTAAAGGAAAGGAGTTAAAGGTACTCCTGGTAGCA
GACGAGGAACCAGAGACGCCATTTTCCAAAGAAAAAGAAACCACAGAGTGTCGGGTAGAAGAAGTCGAAGATCCTACTGTGGAAGTCGACATGGCCGAACTTTCCATTAA
CACGGTAGTTGGATTTTCTTCTCCTGGAACGATGAAAGTAAAAGGGAGAATCGAAGACAAAGAGGTGGTTATTCTCGTAGATTGTGGTGCAACACACAATTTTATTTCAC
AAAAGCTGATCGAAGATATGAAGATACCAGTGACAGAAACGATGAACTATGGCATTATCATGGGAACATGGACCGCAGTGAAAGGAAAAGGTATTTGTAAAAAGATTGTG
ATCGCATTAGATGGAATTACCGTTGCTGAAGACTTCTTACCCATAGAGTTGAGTGGGGTTGATGCAATATTAGGCATGCAGTGGCTGAGAACTCTAGGAGTAACGACTAT
TGACTGGAAAACCCTAACGATGGAGATCAAAGTAGGAGATTCCAAGATTACCCTCAAGGGAGATCCCTCCTTGACCAAGACAGAAGTATCGCTTAAACAACTTAAGCGAA
CATGGGGCAAACATGATCAGGGCTTCTTAGTTGAACTAAGAGCAATTACAGCAGCAGTAGGTGACCCGGCCATTTTTGACGCAGTTCCACCATCACCGAACTTACCCATT
GTGGACCAATTATTAGAACAGTATGCTGATGTGTTCTCCGAACCTTCTCGACTTCCTCCTCACAGAGAAATCGATCATCGAATCCCACTGAAGACCGGAAGTCAACCAAT
TAACGTCCGTCCATATCGATATCCTCATGCCCAAAAGGAAGAAATCGAGAAAATGATCAGGGAAATGTTAGTCGCAGGGGTGATTCGACCCAGTACCAGCCCAGTTCTGC
TCGTGAAGAAAAGAGACGGTAGTTGGAGGTTTTGCGTAGACTATCGTGCTTTGAATGAGGCCACTGTGCCAGACAAGTTTCCAATCCCAGTAATTGAAGAACTGTTAGAC
GAACTACACGGGGCGCAGATTTTTTCTAAACTGGACCTACGACCCGGATATCATCAAATCCGTGTACATACTGAAGATATCCCAAAAACGGCGTTTAGGACGCACGAAGG
TCACTACGAATTCCTGGTAATGCCCTTTGGTCTCACTAATGCCCCGTCAACTTTCCAAGCTCCGGATTTAGAGACACACTTAAGACACCTTGGGGCTGTATTGAATGTGA
TGAGGGATAACTCACTAAAAGTAAACATGAGAAAATGTGTGTTTGCGAAAGATCGCATAGACTACTTAGGCCATTGGATTTCGGCGGCAGGCGTAGAAGCTGATGCTGAC
AAATTCAGATCGATGGTAGAATGGCCAAGACCGACAAATGTCAGGGAATTGAGAGGATTTCTAGGGCTCACGGGCTATTATCGAAAGTTTGTGGAAAATTATGGTGCCAA
AGCATTTCCTTTAACACAGTTAACTCGCCGAGATGCGTTCGAATGGACCGAACAAGCCCAAGAAGCGTTTGAGAACTTGAAACGGGCTATGATGACCCTCCCAGTCTTGG
CACTACCTGATTTTTCGCAGACCTTCATAGTAGAAACTGATGCCTCCAGTATGGGTTTGGGAGCTGTATTGACTCAGAACCAAAGGCCAATAGCTTATTTCAGTCAATCA
TTGTCAACACAGGCCCAGTCCAAACCAATCTATGAGAAAGAATTAATGGCAATCGTAAAGGCGGTACAAAAGTGGAGGCCATATCTGTTGGGCCAACGTTTTTTGATCCG
GACAGACCAACAAGCTTTGCGATTCCTTGTAGACCAAACCGTCATACAACCCGAATACCAAAAATGGATCTCAAAGTTGTTGGGCTACAACTTCGAAGTACAGTATCGAC
CCGGATTAGAAAATAAAGCAGCGGACGCTTTATCTCGCCTTCCACCTGCAGTCCATCTTGCACCGTTATCAGCACCGGCCATTCTAGACGTACAAATAATCAAGAGAGAA
GTAGCCCAAGACCACAGGTTGCAAGAGATAGTGCGAGATCTCTTTGTTGATCCCTCCCTTCATCCAAAATTTTCCCTCCATCAAGGAACACTGTTGTATAAGGGAAGATT
AGTAATGTCAAGCGAATCCAGATTAATTCCAGCGGTTCTTCACTCCTATCACGATTCTGTTATAGGCGGACATTCAGGGTTCTTAAGAACCTATAAAAGAATTACGGCTG
AATTATACTGGCAGGGGATGAAAGGAGCGATAAAACAATATGTGGAAGACTGCCAAACCTGCCAACGAAACAAGACCCTAACTCTATCTCCGGCCGGACTTCTGCAGCCC
TTACCCATTCCGGAGCGGATATGGGAAGACATTTCGATGGATTTTGTAGAAGGCTTACCCAAATCCAGAGGATATGACACAATCTTTGTGGTCGTGGACAGGCTCAGCAA
ATATGCACACTTCTTACCCTTAAGCCACCCTTTTTCTGCAAAGAGCGTGGCAGCCTTATTCATAAAAGAAGTGGTACGACTGCACGGATTCCCAAGATCCATAGTTTCAG
ACCGAGATAAAATATTTGTGAGTCATTTTTGGGGAGAAATGTTTCGTCTACAGGGTACTCAATTGAGGAGAAGCACTTCCTATCATCCACAAACAGACGGGCAAACTGAA
GCGGTCAATAAATGCCTTGAAACGTATTTGAGATGCTTTTGTGGTGAAAGACCAAAGCAATGGGAACATTGGTTGTCCTGGGCGGAGTACTGGTATAACACAACATATCA
CTCTTCTCTCAACACAACACCGTTCCGAGTGGTTTATGGCAGACCACCTCCAACACTTTTATTCTACGGGGACCAGAAGACAAACAATGCGACATTAGATGAACAGCTGC
TTAACCGAGACAAAACCATTGATTTGCTCAGGACTCAGTTACTCAAAGCTCAAGATCGAATGAAGAAATACGCTGACCTACAAAGAAGAGAAAAAAAATATGAAGAAGGA
GAATGGGTGTATATCAAGCTCCGTCCTTACCGACAGTCAACAGTGGCTCACAGAAGAAATGAAAAACTATCTCCAAAATATTTTGGGCCTTATCGAATTCTCGAGAAAAT
TGGACCACTAGCCTACCGACTCGACCTGCCAGAAGAAGCTTCCATTCATCCGGTTTTTCATGTATCACAACTTAGGCAGGCGATGGGAAGCAATGAGCGCGTCGAACGAC
TTCCACCCATCTTAACAGAAAACTTTGAATGGATAGCTGAACCAGAGGATGTGATGGCATACCGCACAAATCCAGTCACCAACGAATGGGAACTTTTAGTACAGTGGAAG
GGCCTACCAGAATTCGAGGGAACCTGGGAATCCTTAGAACACTTCAGACAACAATTTCCAACCTTTCCACTTGAGGACAAGGTGGCTTACACCCCCGGGGGTATTGTTGG
ATGCTTTTTATCTGATGCGTCAGATGTTGACACAGAGAACCTGAGAGCAATTGTTCAGTATGGCACCAGAATGGGTGTCCAGTTCATCTTCATGACAACTGGCGAAATGA
CAAGTAGACACAAGGAATTAGAATCTCTTCAAATACAAATTGAGGATAAAAATGTGAGGTTCATGAACAGACATGATGAGACATTGTCACATCTCATTTTTGGAGGTTGC
GATATTATCTTGTGTCAGTCTCTTCACGATCCTGTTATGCAAGTGCCATTGAAGGCTTTGAGATATGGAGCTGCTCCAATAGCAATCACCTCCAGTGACAACGGATTCGG
ACACTTTCCTGATCATGACTACGAGACTACTAAATCAGCCATGTTCATTAACTCTACCTTCGGATATTTGTCCTTCAGCCAAGCCTTGGATGAAATGAATAACAATCCAT
CCGAGTGGAATCATAAAGTGTTTGATGCTATGACAAAGGACTTCTCTTGGGATGCAGAGTGCTGTGATATATATCTTTCTGCTTACACAGCTATGAAGAGCTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCTGGAAGCCATCATGAAGGCGGCCCAAAGAATAGAAGACAAGAATCGGGCCATTGAATCTAAGACTAGTTTCGGCCCAGGAAGAGTGTGGAAAAATCCCATCAT
CAGCCATGCAAATCCACCAAAGTTAGTCGAACAAAACCCCATTAAAACCATTACTTTACCCAATAACCCGTCCCATCCACCGAAAGAAACACCACTTAGACGCTTAACCG
ATTCTGAAATGCAATTAAGAAGGGAAAAAGGATTGTGTTACAGGTGTGATGAAAAGTTCCACGTGGGACACAAATGTAAAGGAAAGGAGTTAAAGGTACTCCTGGTAGCA
GACGAGGAACCAGAGACGCCATTTTCCAAAGAAAAAGAAACCACAGAGTGTCGGGTAGAAGAAGTCGAAGATCCTACTGTGGAAGTCGACATGGCCGAACTTTCCATTAA
CACGGTAGTTGGATTTTCTTCTCCTGGAACGATGAAAGTAAAAGGGAGAATCGAAGACAAAGAGGTGGTTATTCTCGTAGATTGTGGTGCAACACACAATTTTATTTCAC
AAAAGCTGATCGAAGATATGAAGATACCAGTGACAGAAACGATGAACTATGGCATTATCATGGGAACATGGACCGCAGTGAAAGGAAAAGGTATTTGTAAAAAGATTGTG
ATCGCATTAGATGGAATTACCGTTGCTGAAGACTTCTTACCCATAGAGTTGAGTGGGGTTGATGCAATATTAGGCATGCAGTGGCTGAGAACTCTAGGAGTAACGACTAT
TGACTGGAAAACCCTAACGATGGAGATCAAAGTAGGAGATTCCAAGATTACCCTCAAGGGAGATCCCTCCTTGACCAAGACAGAAGTATCGCTTAAACAACTTAAGCGAA
CATGGGGCAAACATGATCAGGGCTTCTTAGTTGAACTAAGAGCAATTACAGCAGCAGTAGGTGACCCGGCCATTTTTGACGCAGTTCCACCATCACCGAACTTACCCATT
GTGGACCAATTATTAGAACAGTATGCTGATGTGTTCTCCGAACCTTCTCGACTTCCTCCTCACAGAGAAATCGATCATCGAATCCCACTGAAGACCGGAAGTCAACCAAT
TAACGTCCGTCCATATCGATATCCTCATGCCCAAAAGGAAGAAATCGAGAAAATGATCAGGGAAATGTTAGTCGCAGGGGTGATTCGACCCAGTACCAGCCCAGTTCTGC
TCGTGAAGAAAAGAGACGGTAGTTGGAGGTTTTGCGTAGACTATCGTGCTTTGAATGAGGCCACTGTGCCAGACAAGTTTCCAATCCCAGTAATTGAAGAACTGTTAGAC
GAACTACACGGGGCGCAGATTTTTTCTAAACTGGACCTACGACCCGGATATCATCAAATCCGTGTACATACTGAAGATATCCCAAAAACGGCGTTTAGGACGCACGAAGG
TCACTACGAATTCCTGGTAATGCCCTTTGGTCTCACTAATGCCCCGTCAACTTTCCAAGCTCCGGATTTAGAGACACACTTAAGACACCTTGGGGCTGTATTGAATGTGA
TGAGGGATAACTCACTAAAAGTAAACATGAGAAAATGTGTGTTTGCGAAAGATCGCATAGACTACTTAGGCCATTGGATTTCGGCGGCAGGCGTAGAAGCTGATGCTGAC
AAATTCAGATCGATGGTAGAATGGCCAAGACCGACAAATGTCAGGGAATTGAGAGGATTTCTAGGGCTCACGGGCTATTATCGAAAGTTTGTGGAAAATTATGGTGCCAA
AGCATTTCCTTTAACACAGTTAACTCGCCGAGATGCGTTCGAATGGACCGAACAAGCCCAAGAAGCGTTTGAGAACTTGAAACGGGCTATGATGACCCTCCCAGTCTTGG
CACTACCTGATTTTTCGCAGACCTTCATAGTAGAAACTGATGCCTCCAGTATGGGTTTGGGAGCTGTATTGACTCAGAACCAAAGGCCAATAGCTTATTTCAGTCAATCA
TTGTCAACACAGGCCCAGTCCAAACCAATCTATGAGAAAGAATTAATGGCAATCGTAAAGGCGGTACAAAAGTGGAGGCCATATCTGTTGGGCCAACGTTTTTTGATCCG
GACAGACCAACAAGCTTTGCGATTCCTTGTAGACCAAACCGTCATACAACCCGAATACCAAAAATGGATCTCAAAGTTGTTGGGCTACAACTTCGAAGTACAGTATCGAC
CCGGATTAGAAAATAAAGCAGCGGACGCTTTATCTCGCCTTCCACCTGCAGTCCATCTTGCACCGTTATCAGCACCGGCCATTCTAGACGTACAAATAATCAAGAGAGAA
GTAGCCCAAGACCACAGGTTGCAAGAGATAGTGCGAGATCTCTTTGTTGATCCCTCCCTTCATCCAAAATTTTCCCTCCATCAAGGAACACTGTTGTATAAGGGAAGATT
AGTAATGTCAAGCGAATCCAGATTAATTCCAGCGGTTCTTCACTCCTATCACGATTCTGTTATAGGCGGACATTCAGGGTTCTTAAGAACCTATAAAAGAATTACGGCTG
AATTATACTGGCAGGGGATGAAAGGAGCGATAAAACAATATGTGGAAGACTGCCAAACCTGCCAACGAAACAAGACCCTAACTCTATCTCCGGCCGGACTTCTGCAGCCC
TTACCCATTCCGGAGCGGATATGGGAAGACATTTCGATGGATTTTGTAGAAGGCTTACCCAAATCCAGAGGATATGACACAATCTTTGTGGTCGTGGACAGGCTCAGCAA
ATATGCACACTTCTTACCCTTAAGCCACCCTTTTTCTGCAAAGAGCGTGGCAGCCTTATTCATAAAAGAAGTGGTACGACTGCACGGATTCCCAAGATCCATAGTTTCAG
ACCGAGATAAAATATTTGTGAGTCATTTTTGGGGAGAAATGTTTCGTCTACAGGGTACTCAATTGAGGAGAAGCACTTCCTATCATCCACAAACAGACGGGCAAACTGAA
GCGGTCAATAAATGCCTTGAAACGTATTTGAGATGCTTTTGTGGTGAAAGACCAAAGCAATGGGAACATTGGTTGTCCTGGGCGGAGTACTGGTATAACACAACATATCA
CTCTTCTCTCAACACAACACCGTTCCGAGTGGTTTATGGCAGACCACCTCCAACACTTTTATTCTACGGGGACCAGAAGACAAACAATGCGACATTAGATGAACAGCTGC
TTAACCGAGACAAAACCATTGATTTGCTCAGGACTCAGTTACTCAAAGCTCAAGATCGAATGAAGAAATACGCTGACCTACAAAGAAGAGAAAAAAAATATGAAGAAGGA
GAATGGGTGTATATCAAGCTCCGTCCTTACCGACAGTCAACAGTGGCTCACAGAAGAAATGAAAAACTATCTCCAAAATATTTTGGGCCTTATCGAATTCTCGAGAAAAT
TGGACCACTAGCCTACCGACTCGACCTGCCAGAAGAAGCTTCCATTCATCCGGTTTTTCATGTATCACAACTTAGGCAGGCGATGGGAAGCAATGAGCGCGTCGAACGAC
TTCCACCCATCTTAACAGAAAACTTTGAATGGATAGCTGAACCAGAGGATGTGATGGCATACCGCACAAATCCAGTCACCAACGAATGGGAACTTTTAGTACAGTGGAAG
GGCCTACCAGAATTCGAGGGAACCTGGGAATCCTTAGAACACTTCAGACAACAATTTCCAACCTTTCCACTTGAGGACAAGGTGGCTTACACCCCCGGGGGTATTGTTGG
ATGCTTTTTATCTGATGCGTCAGATGTTGACACAGAGAACCTGAGAGCAATTGTTCAGTATGGCACCAGAATGGGTGTCCAGTTCATCTTCATGACAACTGGCGAAATGA
CAAGTAGACACAAGGAATTAGAATCTCTTCAAATACAAATTGAGGATAAAAATGTGAGGTTCATGAACAGACATGATGAGACATTGTCACATCTCATTTTTGGAGGTTGC
GATATTATCTTGTGTCAGTCTCTTCACGATCCTGTTATGCAAGTGCCATTGAAGGCTTTGAGATATGGAGCTGCTCCAATAGCAATCACCTCCAGTGACAACGGATTCGG
ACACTTTCCTGATCATGACTACGAGACTACTAAATCAGCCATGTTCATTAACTCTACCTTCGGATATTTGTCCTTCAGCCAAGCCTTGGATGAAATGAATAACAATCCAT
CCGAGTGGAATCATAAAGTGTTTGATGCTATGACAAAGGACTTCTCTTGGGATGCAGAGTGCTGTGATATATATCTTTCTGCTTACACAGCTATGAAGAGCTTGTGA
Protein sequenceShow/hide protein sequence
MGLEAIMKAAQRIEDKNRAIESKTSFGPGRVWKNPIISHANPPKLVEQNPIKTITLPNNPSHPPKETPLRRLTDSEMQLRREKGLCYRCDEKFHVGHKCKGKELKVLLVA
DEEPETPFSKEKETTECRVEEVEDPTVEVDMAELSINTVVGFSSPGTMKVKGRIEDKEVVILVDCGATHNFISQKLIEDMKIPVTETMNYGIIMGTWTAVKGKGICKKIV
IALDGITVAEDFLPIELSGVDAILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAAVGDPAIFDAVPPSPNLPI
VDQLLEQYADVFSEPSRLPPHREIDHRIPLKTGSQPINVRPYRYPHAQKEEIEKMIREMLVAGVIRPSTSPVLLVKKRDGSWRFCVDYRALNEATVPDKFPIPVIEELLD
ELHGAQIFSKLDLRPGYHQIRVHTEDIPKTAFRTHEGHYEFLVMPFGLTNAPSTFQAPDLETHLRHLGAVLNVMRDNSLKVNMRKCVFAKDRIDYLGHWISAAGVEADAD
KFRSMVEWPRPTNVRELRGFLGLTGYYRKFVENYGAKAFPLTQLTRRDAFEWTEQAQEAFENLKRAMMTLPVLALPDFSQTFIVETDASSMGLGAVLTQNQRPIAYFSQS
LSTQAQSKPIYEKELMAIVKAVQKWRPYLLGQRFLIRTDQQALRFLVDQTVIQPEYQKWISKLLGYNFEVQYRPGLENKAADALSRLPPAVHLAPLSAPAILDVQIIKRE
VAQDHRLQEIVRDLFVDPSLHPKFSLHQGTLLYKGRLVMSSESRLIPAVLHSYHDSVIGGHSGFLRTYKRITAELYWQGMKGAIKQYVEDCQTCQRNKTLTLSPAGLLQP
LPIPERIWEDISMDFVEGLPKSRGYDTIFVVVDRLSKYAHFLPLSHPFSAKSVAALFIKEVVRLHGFPRSIVSDRDKIFVSHFWGEMFRLQGTQLRRSTSYHPQTDGQTE
AVNKCLETYLRCFCGERPKQWEHWLSWAEYWYNTTYHSSLNTTPFRVVYGRPPPTLLFYGDQKTNNATLDEQLLNRDKTIDLLRTQLLKAQDRMKKYADLQRREKKYEEG
EWVYIKLRPYRQSTVAHRRNEKLSPKYFGPYRILEKIGPLAYRLDLPEEASIHPVFHVSQLRQAMGSNERVERLPPILTENFEWIAEPEDVMAYRTNPVTNEWELLVQWK
GLPEFEGTWESLEHFRQQFPTFPLEDKVAYTPGGIVGCFLSDASDVDTENLRAIVQYGTRMGVQFIFMTTGEMTSRHKELESLQIQIEDKNVRFMNRHDETLSHLIFGGC
DIILCQSLHDPVMQVPLKALRYGAAPIAITSSDNGFGHFPDHDYETTKSAMFINSTFGYLSFSQALDEMNNNPSEWNHKVFDAMTKDFSWDAECCDIYLSAYTAMKSL