; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G14820 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G14820
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr5:15424035..15442177
RNA-Seq ExpressionCSPI05G14820
SyntenyCSPI05G14820
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0034645 - cellular macromolecule biosynthetic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0051536 - iron-sulfur cluster binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR025977 - Nuclear condensin complex subunit 3, C-terminal domain
IPR021109 - Aspartic peptidase domain superfamily
IPR016197 - Chromo-like domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core
IPR000953 - Chromo/chromo shadow domain
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061073.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0060.84Show/hide
Query:  DMHKSIENLTEEVKESSRTKQREES-CASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV------
        +M K++E L +E+KE+   K++EES   S+GS +K KGK  ET+  +   G  +DRSKYKKLEMP+F G+NPESWVYRAEH+F+IN  ++ EKV      
Subjt:  DMHKSIENLTEEVKESSRTKQREES-CASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV------

Query:  ------------------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLE
                                K  MFE F+  G +SLGARLIRI+Q+GSY +Y+KKF+ YSAPL  MAES+L+DAFVTGLE +LQAEV+SRHP TLE
Subjt:  ------------------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLE

Query:  DCMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKY
        DCM+EAQ+V+DR+ A+KL+  E G        +     K K   + +K   + +++ MKQ++IPIKGN+ + EP VK LS++EFRARLD+GLCFRCN+KY
Subjt:  DCMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKY

Query:  SPGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVIN-----------------------------ELNL
        SPGHRCK K  RELMFFI NEEEE EE + +EE   ETVEL+TLE+  +T I L+T+   +SKG +                              EL L
Subjt:  SPGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVIN-----------------------------ELNL

Query:  PVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTTQ--------------------
         ++  T FG TIG+ T   G G+C+RVEVKL E+ I+ADFLA++LG +D VLGMQWL T G + ++WP++TM+F  G  Q                    
Subjt:  PVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTTQ--------------------

Query:  ---------------KIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLV-----------VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEML
                        +E+E ++   ++ +EEGDEA++PMIR LL ++K +            +DHRILT+  QKPINVRPYKYG+IQK EIEKLV EML
Subjt:  ---------------KIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLV-----------VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEML

Query:  QAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPF
        Q G+IRPSR  YSSPVLL++K DGG RFCVDYRKLNQ T+SDKFPI VIEELLDEL+GA VFSKLDL+SGYHQI+MKEEDVEKTAFRTHEGHYEFLVMPF
Subjt:  QAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPF

Query:  GLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQP
        GLTNAPATFQSLMNQ+FKPFLRR VLVFFDD+L+YS DI E E+HLGMVF V++DNQL+AN KKCV  HS++ YLGH ISK GVEAD +K+++MVNWP+P
Subjt:  GLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQP

Query:  KDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKL
         DV+ELRGFLGLTGYYRRFVKGY NIA PLTKLLQ+N F WNEDA  AF  LK AM ++ VLALPD+SLPF I TDASG GLGAVLSQ G PIA++SQKL
Subjt:  KDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKL

Query:  SPRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRL
        S RAQAKSIYERELMAVV++                                    +WLTKLLGYDFEILYQPG LNKAADALSR+ P +EL+ + +  +
Subjt:  SPRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRL

Query:  LDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQC
        +D+ +V EEV +DE LQ I+  LK+  E    ++W + +LLYKG+LVL +TSSLIP LLHTFHDSVLGGHSGFLRTYK M+GE++W GMKND+KKYVEQC
Subjt:  LDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQC

Query:  ETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLS
        E CQRNK +A  PAG+LQPLP+P+ ILEDWTMDFIEGLPKAGG + IMVVVDRL+K A+FITLKHPF+AKQVA  F  +I+ +HGIPNSI++DRDK+FLS
Subjt:  ETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLS

Query:  QFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQ
         FW+ELF +MG  LKRSTAFHPQTDGQTE+VN+CLETYL CFCNEQP KW + IPWAELWYNT FH+  KTTPF+ VYGR PP L+ YGD+KT NN VE 
Subjt:  QFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQ

Query:  LLKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQL
        +LKERD  +SALKENL  AQNRMKK ADL+RRELK KVG+EVYLKL+PYRQRSLA+K+ EKLAP++YGPY+++EEIG VAYRLDLP EA IHNVFHISQL
Subjt:  LLKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQL

Query:  KLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP
        K KLG  Q +QH  P LTE+FELQ++PE VLG+RWN E+GAN+WL         +ATWESVY MNQQ+PSFHLEDK   +P
Subjt:  KLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP

KAA0066550.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0065.35Show/hide
Query:  LTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV---------------
        +TEEVKE  R K REES ASEGS LK KGK GET+    FT  SSDRSKYKKL +P+F G+NPE+W+YRAEHYFDINE  DEEKV               
Subjt:  LTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV---------------

Query:  ---------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLEDCMQEAQMV
                       K+ MFEHFK+PG  SLGARLIRIKQDG Y DYLKKFL YSAPL EMAES+LIDAFVTGLETNLQ EV SRHPVTLE+C       
Subjt:  ---------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLEDCMQEAQMV

Query:  SDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPS--VKWLSNSEFRARLDKGLCFRCNEKYSPGHRCK
                       S  P                       K +++ MKQ+++PIKGN+ ++EP   VK LS+SEFRARLDKGLCF C E+        
Subjt:  SDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPS--VKWLSNSEFRARLDKGLCFRCNEKYSPGHRCK

Query:  GKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVINELNLPVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKI
                  +  E+E+ +EV         T E E   + G  +            GV+ EL LP+E + KFGVTIGD TALEG G+CK VE+KLPEL I
Subjt:  GKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVINELNLPVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKI

Query:  VADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTT-----------------------------------QKIEIEENNEEDSEREEEGDEA
        VADFL I+LG +DVVLGMQW  T GF+ +HWP+MTM FM G++                                   QKIEIE  +E +SE+EEEG+E+
Subjt:  VADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTT-----------------------------------QKIEIEENNEEDSEREEEGDEA

Query:  NLPMIRKLLHRF-------KGL----VVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLN
        NLPMIR LL +        KGL     VDHRIL  +GQ PINVRPYKYGYIQK EIE+LV+EMLQA IIRPSR  YSSPVLL++K DGG RFCVDY KLN
Subjt:  NLPMIRKLLHRF-------KGL----VVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLN

Query:  QVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYS
        QVTV+DKFPI VIEELLDELHGAEVFSKLDLRSGYHQI+MKEED+EKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNQIF+PFLRRFVL          
Subjt:  QVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYS

Query:  TDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQ
                          DNQLFANEKKCVIGHSR+NYL                                GFLGLTGYYRRFVK YGNIAAPLTKLLQ+
Subjt:  TDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQ

Query:  NGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLSPRAQAKSIYERELMAVVMA----------------
        NGF+W EDA  AFESLKQAMISV VLALPDFSLPF I TDASG GLGAVLSQN RPIAYFSQKLSP  QAKSIYERELMAVVMA                
Subjt:  NGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLSPRAQAKSIYERELMAVVMA----------------

Query:  --------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLLDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWV
                            RWLTKLLGYDFEILYQPGL NKAADALSR+    E ++L +  LLD+++V  EV++DEELQGIIE+LKE+PEGKANYQW 
Subjt:  --------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLLDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWV

Query:  SDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIE
        +  L YKG+LVLS+ S+LIP+LLHTFHDSVLGGH GFLRTYK MNGEIHW+GMKNDVKKYVEQCE CQRNKT+AL+PAGLLQPLPLPNLILEDWTMDFIE
Subjt:  SDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIE

Query:  GLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLE
        GLPKAGG+DSIMVVVDRLSKMAHFI LKHPF+AKQVAEKF +EIISKHGI NSIVTDRDKVFLS FWKELFTAMG SLKRSTAFHPQTDGQTE+VNRCLE
Subjt:  GLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLE

Query:  TYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQADLRRRELKF
        TYL CFCNEQPTKWHKCIPWAELWYNTTFH+ AKTTPFQ+VYGRPPP L+ YGD K+NNNSVEQLLKERDLVISALKENL+ AQNRMKKQADL R+ELKF
Subjt:  TYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQADLRRRELKF

Query:  KVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWN
        KVGDEVYLKLRPYRQRSLA+KRCEKLAPKFYGPYR++EEIGEVAYRL+LP EA+IHNVFH+SQLKLKLG +  +QH+PPALTE+FELQVEPEAVLG+RWN
Subjt:  KVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWN

Query:  SEMGANDWL---------EATWESVYAMNQQYPSFHLEDK
        +++GAN+WL         EATWE V AMNQQYPSFHLEDK
Subjt:  SEMGANDWL---------EATWESVYAMNQQYPSFHLEDK

TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0060.88Show/hide
Query:  DMHKSIENLTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKVKKG----
        +M KS++ L +E+++ S  K++EES  S+GS +K KGK  ETD+         DRSKYKKLEMP+F G+NPESWVYRAEH+F+IN   + EKVK      
Subjt:  DMHKSIENLTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKVKKG----

Query:  --------------------------MFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLED
                                  MFE F+  G +SLGARLIRI+Q+GSY DY+KKF+NYSAPL  MAES+L DAF+TGLE  LQAEV+SRHP TLED
Subjt:  --------------------------MFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLED

Query:  CMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKYS
        CM  AQ+V+DR+LA+KLA  E G   P+ SE+       K+  N EK   + +E+ MKQ++IP+KG+Y + EP VK LS++EFRARLDKGLCFRCNEKYS
Subjt:  CMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKYS

Query:  PGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKG-----------------------------VINELNLP
         GHRCK K  RELM FI NEEE  EE  G E    E VE+  L    +T I  R I   T+KG                             ++ E  +P
Subjt:  PGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKG-----------------------------VINELNLP

Query:  VEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTF--------MAGTT--------------
        +   T+FG+TIGD T+ +G G+C +VE++L  L++V D L + LG IDVVLGMQWL T G + +HWP++TM F        + G                
Subjt:  VEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTF--------MAGTT--------------

Query:  -------------QKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLV-----------VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQ
                     Q+ EIE  + +       GDE  LPMI+ LLH++  +            +DHRILT+ GQKPINVRPYKYG+ QKEEIEKLV EMLQ
Subjt:  -------------QKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLV-----------VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQ

Query:  AGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFG
         GIIRPS   +SSPVLL++K DGG RFCVDYRKLN++T++DKFPI VIEELLDELHGA VFSKLDL+SGYHQI+M+EED+EKTAFRTHEGHYEF+VMPFG
Subjt:  AGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPK
        LTNAPATFQSLMNQ+FKPFLRR VLVFFDD+L+YS+DITEHE+HLGMVF  ++DNQL+AN KKCV  HS+++YLGH ISK GVEAD +KVK+M+ WP+PK
Subjt:  LTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPK

Query:  DVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLS
        DV+ LRGFLGLTGYYRRFVKGYG IAAPLTKLLQ+N F W+E+A  AFESLK AM ++ VLALPD+SLPF I TDASG GLGAVLSQN  PIA+FSQKLS
Subjt:  DVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLS

Query:  PRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLL
         RAQAKSIYERELMAVV++                                    +WLTKLLGYDFEILYQPGL NKAADALSRM+  +EL  L +  ++
Subjt:  PRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLL

Query:  DVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCE
        D+E+V +EV++DEELQ +I+ L+ NP  +  Y   +  L+YKG++VLSK+SS+IP+LLHTFHDS+LGGHSGFLRTYK M+GE+ W GMK D+KKYVEQCE
Subjt:  DVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCE

Query:  TCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQ
         CQRNK++A  PAG+LQPLP+P+ ILEDWTMDFIEGLPKAGG + IMVVVDRLSK A+F+T+KHPF+AKQVA +F  +I+ +HGIP SI++DRDK+F+S 
Subjt:  TCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQ

Query:  FWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQL
        FWKELF AM   LKRSTAFHPQTDGQTE+VN+CLETYL CFCNEQP KWH+ IPWAELWYNTTFHS  +TTPFQ VYGRPPP LI YGD+KT N+ VE L
Subjt:  FWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQL

Query:  LKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLK
        LKERDL ISALKENL  AQNRMKK AD +RRELKFKVGDEVYLKLRPYRQRSLA+KR EKLAPK+YGPYR+ E IGEVAYRLDLP EA IHNVFHISQLK
Subjt:  LKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLK

Query:  LKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP
        LKLG   ++Q   P LT +FELQ+ PE VLG+RW+ E+GAN+WL         EATWESVY+MNQQ+PSFHLEDK + +P
Subjt:  LKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP

TYJ99303.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0060.7Show/hide
Query:  DMHKSIENLTEEVKESSRTKQREES-CASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV------
        +M K++E L +E+KE+   K++EES   S+GS +K KGK  ET+  +   G  +DR+KYKKLEMP+F G+NPESWVYRAEH+F+IN  ++ EKV      
Subjt:  DMHKSIENLTEEVKESSRTKQREES-CASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV------

Query:  ------------------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLE
                                K  MFE F+  G +SLGARLIRI+Q+GSY +Y+KKF+ YSAPL  MAES+L+DAFVTGLE +LQAEV+SRHP TLE
Subjt:  ------------------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLE

Query:  DCMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKY
        DCM+EAQ+V+DR+ A+KL+  E G        +     K K   + +K   + +++ MKQ++IPIKGN+ + EP VK LS++EFRARLD+GLCFRCN+KY
Subjt:  DCMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKY

Query:  SPGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVIN-----------------------------ELNL
        SPGHRCK K  RELMFFI NEEEE EE + +EE   ETVEL+TLE+  +  I L+T+   +SKG +                              +L L
Subjt:  SPGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVIN-----------------------------ELNL

Query:  PVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTTQ--------------------
         ++  T FG TIG+ T   G G+C+RVEVKL E+ I+ADFLA++LG +D VLGMQWL T G + +HWP++TM+F  G  Q                    
Subjt:  PVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTTQ--------------------

Query:  ---------------KIEIEENNEEDSEREEEGDEANLPMIRKLLHRF-------KGLV----VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEML
                        +++E ++   ++ +EEGDEA++PMIR LL ++       KGL     +DHRILT+  QKPINVRPYKYG+IQK EIEKLV EML
Subjt:  ---------------KIEIEENNEEDSEREEEGDEANLPMIRKLLHRF-------KGLV----VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEML

Query:  QAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPF
        Q G+IRPSR  YSSPVLL++K +GG RFCVDYRKLNQ T+SDKFPI VIEELLDEL+GA VFSKLDL+SGYHQI+MKEED+EKTAFRTHEGHYEFLVMPF
Subjt:  QAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPF

Query:  GLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQP
        GLTNAPATFQSLMNQ+FKPFLRR VLVFF D+L+YS DITEHE+HLGMVF V++DNQL+AN KKCV  HS++ YLGH ISK GVEAD +K+++MVNWP+P
Subjt:  GLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQP

Query:  KDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKL
         DV+ELRGFL LTGYYRRFVKGY NIA PLTKLLQ+N F WNEDA  AF  LK AM ++ VLALPD+SLPF I TDASG GLGAVLSQ G PIA++SQKL
Subjt:  KDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKL

Query:  SPRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRL
        S RAQAKSIYERELMAVV++                                    +WLTKLLGYDFEILYQPG LNKAADALSR+ P +EL+ + +  +
Subjt:  SPRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRL

Query:  LDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQC
        +D+ +V EEV +DE LQ I+  LK+  E    ++W + +LLYKG+LVL +TSSLIP LLHTFHDSVLGGHSGFLRTYK M+GE++W GMKND+KKYVEQC
Subjt:  LDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQC

Query:  ETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLS
        E CQRNK +A  PAG+LQPLP+P+ ILEDWTMDFIEGLPKAGG + IMVVVDRL+K A+FITLKHPF+AKQVA  F  +I+ +HGIPNSI++DRDK+FLS
Subjt:  ETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLS

Query:  QFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQ
         FW+ELF +MG  LKRSTAFHPQTDGQTE+VN+CLETYL CFCNEQP KW + IPWAELWYNTTFHS  KTTPF+ VYGR PP L+ YGD+K  NN VE 
Subjt:  QFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQ

Query:  LLKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQL
        +LKERD  +SALKENL  AQNRMKK ADL+RRELK KVG+EVYLKL+PYRQRSLA+K+ EKLAP++YGPY+++EEIG VAYRLDLP EA IHNVFHISQL
Subjt:  LLKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQL

Query:  KLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP
        K KLG  Q +QH  P LTE+FELQ++PE VLG+RWN E+GAN+WL         +ATWESVY MNQQ+PSFHLEDK   +P
Subjt:  KLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP

TYK21035.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0060.81Show/hide
Query:  DMHKSIENLTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKVKKG----
        +M KS++ L +E+++ S  K++EES  S+GS +K KGK  ETD+         DRSKYKKLEMP+F G+NPESWVYRAEH+F+IN   + EKVK      
Subjt:  DMHKSIENLTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKVKKG----

Query:  --------------------------MFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLED
                                  MFE F+  G  SLGARLIRI+Q+GSY DY+KKF+NYSAPL  MAES+L DAF+TGLE  LQAEV+SRHP TLED
Subjt:  --------------------------MFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLED

Query:  CMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKYS
        CM  AQ+V+DR+LA+KLA  E G   P+ SE+       K+  N +K   + +E+ MKQ++IP+KG+Y + EP VK LS++EFRARLDKGLCFRCNEKYS
Subjt:  CMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKYS

Query:  PGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKG-----------------------------VINELNLP
         GHRCK K  RELM FI NEEE  EE  G E      VE+  LE   +T I  R I   T+KG                             ++ E  +P
Subjt:  PGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKG-----------------------------VINELNLP

Query:  VEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTF--------MAGTT--------------
        +   T+FG+TIGD T+ +G G+C +VE++L  L++V D L + LG IDVVLGMQWL T G + +HWP++TM F        + G                
Subjt:  VEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTF--------MAGTT--------------

Query:  -------------QKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGL-----------VVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQ
                     Q+ EIE  + +     + GDE  LPMI+ LLH++  +           ++DHRILT+ GQKPINVRPYKYG+ QKEEIEKLV EMLQ
Subjt:  -------------QKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGL-----------VVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQ

Query:  AGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFG
         GIIRPS   +SSPVLL++K DGG RFCVDYRKLN++T++DKFPI VIEELLDELHGA VFSKLDL+SGYHQI+M+EED+EKTAFRTHEGHYEF+VMPFG
Subjt:  AGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPK
        LTNAPATFQSLMNQ+FKPFLRR VLVFFDD+L+YS+DITEHE+HLGMVF  ++DNQL+AN KKCV  HS+++YLGH ISK GVEAD +KVK+M+ WP+PK
Subjt:  LTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPK

Query:  DVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLS
        DV+ LRGFLGLTGYYRRFVKGYG IAAPLTKLLQ+N F W+E+A  AFESLK AM ++ VLALPD+SLPF I TDASG GLGAVLSQN  PIA+FSQKLS
Subjt:  DVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLS

Query:  PRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLL
         RAQAKSIYERELMAVV++                                    +WLTKLLGYDFEILYQPGL NKAADALSRM+  +EL  L +  ++
Subjt:  PRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLL

Query:  DVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCE
        D+E+V +EV++DEELQ +I+ L+ NP  +  Y   +  L+YKG++VLSK+SS+IP+LLHTFHDS+LGGHSGFLRTYK M+GE+ W GMK D+KKYVEQCE
Subjt:  DVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCE

Query:  TCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQ
         CQRNK++A  PAG+LQPLP+P+ ILEDWTMDFIEGLPKAGG + IMVVVDRLSK A+F+T+KHPF+AKQVA +F  +I+ +HGIP SI++DRDK+F+S 
Subjt:  TCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQ

Query:  FWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQL
        FWKELF AM   LKRSTAFHPQTDGQTE+VN+CLETYL CFCNEQP KWH+ IPWAELWYNTTFHS  +TTPFQ VYGRPPP LI YGD+KT N+ VE L
Subjt:  FWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQL

Query:  LKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLK
        LKERDL ISALKENL  AQNRMKK AD +RRELKFKVGDEVYLKLRPYRQRSLA+KR EKLAPK+YGPYR+ E IGEVAYRLDLP EA IHNVFHISQLK
Subjt:  LKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLK

Query:  LKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP
        LKLG   ++Q   P LT +FELQ+ PE VLG+RW+ E+GAN+WL         EATWESVY+MNQQ+PSFHLEDK   +P
Subjt:  LKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP

TrEMBL top hitse value%identityAlignment
A0A5A7UYM1 Ty3/gypsy retrotransposon protein0.0e+0060.84Show/hide
Query:  DMHKSIENLTEEVKESSRTKQREES-CASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV------
        +M K++E L +E+KE+   K++EES   S+GS +K KGK  ET+  +   G  +DRSKYKKLEMP+F G+NPESWVYRAEH+F+IN  ++ EKV      
Subjt:  DMHKSIENLTEEVKESSRTKQREES-CASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV------

Query:  ------------------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLE
                                K  MFE F+  G +SLGARLIRI+Q+GSY +Y+KKF+ YSAPL  MAES+L+DAFVTGLE +LQAEV+SRHP TLE
Subjt:  ------------------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLE

Query:  DCMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKY
        DCM+EAQ+V+DR+ A+KL+  E G        +     K K   + +K   + +++ MKQ++IPIKGN+ + EP VK LS++EFRARLD+GLCFRCN+KY
Subjt:  DCMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKY

Query:  SPGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVIN-----------------------------ELNL
        SPGHRCK K  RELMFFI NEEEE EE + +EE   ETVEL+TLE+  +T I L+T+   +SKG +                              EL L
Subjt:  SPGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVIN-----------------------------ELNL

Query:  PVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTTQ--------------------
         ++  T FG TIG+ T   G G+C+RVEVKL E+ I+ADFLA++LG +D VLGMQWL T G + ++WP++TM+F  G  Q                    
Subjt:  PVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTTQ--------------------

Query:  ---------------KIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLV-----------VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEML
                        +E+E ++   ++ +EEGDEA++PMIR LL ++K +            +DHRILT+  QKPINVRPYKYG+IQK EIEKLV EML
Subjt:  ---------------KIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLV-----------VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEML

Query:  QAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPF
        Q G+IRPSR  YSSPVLL++K DGG RFCVDYRKLNQ T+SDKFPI VIEELLDEL+GA VFSKLDL+SGYHQI+MKEEDVEKTAFRTHEGHYEFLVMPF
Subjt:  QAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPF

Query:  GLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQP
        GLTNAPATFQSLMNQ+FKPFLRR VLVFFDD+L+YS DI E E+HLGMVF V++DNQL+AN KKCV  HS++ YLGH ISK GVEAD +K+++MVNWP+P
Subjt:  GLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQP

Query:  KDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKL
         DV+ELRGFLGLTGYYRRFVKGY NIA PLTKLLQ+N F WNEDA  AF  LK AM ++ VLALPD+SLPF I TDASG GLGAVLSQ G PIA++SQKL
Subjt:  KDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKL

Query:  SPRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRL
        S RAQAKSIYERELMAVV++                                    +WLTKLLGYDFEILYQPG LNKAADALSR+ P +EL+ + +  +
Subjt:  SPRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRL

Query:  LDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQC
        +D+ +V EEV +DE LQ I+  LK+  E    ++W + +LLYKG+LVL +TSSLIP LLHTFHDSVLGGHSGFLRTYK M+GE++W GMKND+KKYVEQC
Subjt:  LDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQC

Query:  ETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLS
        E CQRNK +A  PAG+LQPLP+P+ ILEDWTMDFIEGLPKAGG + IMVVVDRL+K A+FITLKHPF+AKQVA  F  +I+ +HGIPNSI++DRDK+FLS
Subjt:  ETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLS

Query:  QFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQ
         FW+ELF +MG  LKRSTAFHPQTDGQTE+VN+CLETYL CFCNEQP KW + IPWAELWYNT FH+  KTTPF+ VYGR PP L+ YGD+KT NN VE 
Subjt:  QFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQ

Query:  LLKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQL
        +LKERD  +SALKENL  AQNRMKK ADL+RRELK KVG+EVYLKL+PYRQRSLA+K+ EKLAP++YGPY+++EEIG VAYRLDLP EA IHNVFHISQL
Subjt:  LLKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQL

Query:  KLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP
        K KLG  Q +QH  P LTE+FELQ++PE VLG+RWN E+GAN+WL         +ATWESVY MNQQ+PSFHLEDK   +P
Subjt:  KLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP

A0A5A7VHA7 Ty3/gypsy retrotransposon protein0.0e+0065.35Show/hide
Query:  LTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV---------------
        +TEEVKE  R K REES ASEGS LK KGK GET+    FT  SSDRSKYKKL +P+F G+NPE+W+YRAEHYFDINE  DEEKV               
Subjt:  LTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV---------------

Query:  ---------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLEDCMQEAQMV
                       K+ MFEHFK+PG  SLGARLIRIKQDG Y DYLKKFL YSAPL EMAES+LIDAFVTGLETNLQ EV SRHPVTLE+C       
Subjt:  ---------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLEDCMQEAQMV

Query:  SDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPS--VKWLSNSEFRARLDKGLCFRCNEKYSPGHRCK
                       S  P                       K +++ MKQ+++PIKGN+ ++EP   VK LS+SEFRARLDKGLCF C E+        
Subjt:  SDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPS--VKWLSNSEFRARLDKGLCFRCNEKYSPGHRCK

Query:  GKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVINELNLPVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKI
                  +  E+E+ +EV         T E E   + G  +            GV+ EL LP+E + KFGVTIGD TALEG G+CK VE+KLPEL I
Subjt:  GKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVINELNLPVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKI

Query:  VADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTT-----------------------------------QKIEIEENNEEDSEREEEGDEA
        VADFL I+LG +DVVLGMQW  T GF+ +HWP+MTM FM G++                                   QKIEIE  +E +SE+EEEG+E+
Subjt:  VADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTT-----------------------------------QKIEIEENNEEDSEREEEGDEA

Query:  NLPMIRKLLHRF-------KGL----VVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLN
        NLPMIR LL +        KGL     VDHRIL  +GQ PINVRPYKYGYIQK EIE+LV+EMLQA IIRPSR  YSSPVLL++K DGG RFCVDY KLN
Subjt:  NLPMIRKLLHRF-------KGL----VVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLN

Query:  QVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYS
        QVTV+DKFPI VIEELLDELHGAEVFSKLDLRSGYHQI+MKEED+EKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNQIF+PFLRRFVL          
Subjt:  QVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYS

Query:  TDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQ
                          DNQLFANEKKCVIGHSR+NYL                                GFLGLTGYYRRFVK YGNIAAPLTKLLQ+
Subjt:  TDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQ

Query:  NGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLSPRAQAKSIYERELMAVVMA----------------
        NGF+W EDA  AFESLKQAMISV VLALPDFSLPF I TDASG GLGAVLSQN RPIAYFSQKLSP  QAKSIYERELMAVVMA                
Subjt:  NGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLSPRAQAKSIYERELMAVVMA----------------

Query:  --------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLLDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWV
                            RWLTKLLGYDFEILYQPGL NKAADALSR+    E ++L +  LLD+++V  EV++DEELQGIIE+LKE+PEGKANYQW 
Subjt:  --------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLLDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWV

Query:  SDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIE
        +  L YKG+LVLS+ S+LIP+LLHTFHDSVLGGH GFLRTYK MNGEIHW+GMKNDVKKYVEQCE CQRNKT+AL+PAGLLQPLPLPNLILEDWTMDFIE
Subjt:  SDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIE

Query:  GLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLE
        GLPKAGG+DSIMVVVDRLSKMAHFI LKHPF+AKQVAEKF +EIISKHGI NSIVTDRDKVFLS FWKELFTAMG SLKRSTAFHPQTDGQTE+VNRCLE
Subjt:  GLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLE

Query:  TYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQADLRRRELKF
        TYL CFCNEQPTKWHKCIPWAELWYNTTFH+ AKTTPFQ+VYGRPPP L+ YGD K+NNNSVEQLLKERDLVISALKENL+ AQNRMKKQADL R+ELKF
Subjt:  TYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQADLRRRELKF

Query:  KVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWN
        KVGDEVYLKLRPYRQRSLA+KRCEKLAPKFYGPYR++EEIGEVAYRL+LP EA+IHNVFH+SQLKLKLG +  +QH+PPALTE+FELQVEPEAVLG+RWN
Subjt:  KVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWN

Query:  SEMGANDWL---------EATWESVYAMNQQYPSFHLEDK
        +++GAN+WL         EATWE V AMNQQYPSFHLEDK
Subjt:  SEMGANDWL---------EATWESVYAMNQQYPSFHLEDK

A0A5D3BBH7 Ty3/gypsy retrotransposon protein0.0e+0060.88Show/hide
Query:  DMHKSIENLTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKVKKG----
        +M KS++ L +E+++ S  K++EES  S+GS +K KGK  ETD+         DRSKYKKLEMP+F G+NPESWVYRAEH+F+IN   + EKVK      
Subjt:  DMHKSIENLTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKVKKG----

Query:  --------------------------MFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLED
                                  MFE F+  G +SLGARLIRI+Q+GSY DY+KKF+NYSAPL  MAES+L DAF+TGLE  LQAEV+SRHP TLED
Subjt:  --------------------------MFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLED

Query:  CMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKYS
        CM  AQ+V+DR+LA+KLA  E G   P+ SE+       K+  N EK   + +E+ MKQ++IP+KG+Y + EP VK LS++EFRARLDKGLCFRCNEKYS
Subjt:  CMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKYS

Query:  PGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKG-----------------------------VINELNLP
         GHRCK K  RELM FI NEEE  EE  G E    E VE+  L    +T I  R I   T+KG                             ++ E  +P
Subjt:  PGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKG-----------------------------VINELNLP

Query:  VEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTF--------MAGTT--------------
        +   T+FG+TIGD T+ +G G+C +VE++L  L++V D L + LG IDVVLGMQWL T G + +HWP++TM F        + G                
Subjt:  VEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTF--------MAGTT--------------

Query:  -------------QKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLV-----------VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQ
                     Q+ EIE  + +       GDE  LPMI+ LLH++  +            +DHRILT+ GQKPINVRPYKYG+ QKEEIEKLV EMLQ
Subjt:  -------------QKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGLV-----------VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQ

Query:  AGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFG
         GIIRPS   +SSPVLL++K DGG RFCVDYRKLN++T++DKFPI VIEELLDELHGA VFSKLDL+SGYHQI+M+EED+EKTAFRTHEGHYEF+VMPFG
Subjt:  AGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPK
        LTNAPATFQSLMNQ+FKPFLRR VLVFFDD+L+YS+DITEHE+HLGMVF  ++DNQL+AN KKCV  HS+++YLGH ISK GVEAD +KVK+M+ WP+PK
Subjt:  LTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPK

Query:  DVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLS
        DV+ LRGFLGLTGYYRRFVKGYG IAAPLTKLLQ+N F W+E+A  AFESLK AM ++ VLALPD+SLPF I TDASG GLGAVLSQN  PIA+FSQKLS
Subjt:  DVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLS

Query:  PRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLL
         RAQAKSIYERELMAVV++                                    +WLTKLLGYDFEILYQPGL NKAADALSRM+  +EL  L +  ++
Subjt:  PRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLL

Query:  DVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCE
        D+E+V +EV++DEELQ +I+ L+ NP  +  Y   +  L+YKG++VLSK+SS+IP+LLHTFHDS+LGGHSGFLRTYK M+GE+ W GMK D+KKYVEQCE
Subjt:  DVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCE

Query:  TCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQ
         CQRNK++A  PAG+LQPLP+P+ ILEDWTMDFIEGLPKAGG + IMVVVDRLSK A+F+T+KHPF+AKQVA +F  +I+ +HGIP SI++DRDK+F+S 
Subjt:  TCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQ

Query:  FWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQL
        FWKELF AM   LKRSTAFHPQTDGQTE+VN+CLETYL CFCNEQP KWH+ IPWAELWYNTTFHS  +TTPFQ VYGRPPP LI YGD+KT N+ VE L
Subjt:  FWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQL

Query:  LKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLK
        LKERDL ISALKENL  AQNRMKK AD +RRELKFKVGDEVYLKLRPYRQRSLA+KR EKLAPK+YGPYR+ E IGEVAYRLDLP EA IHNVFHISQLK
Subjt:  LKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLK

Query:  LKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP
        LKLG   ++Q   P LT +FELQ+ PE VLG+RW+ E+GAN+WL         EATWESVY+MNQQ+PSFHLEDK + +P
Subjt:  LKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP

A0A5D3BJ50 Ty3/gypsy retrotransposon protein0.0e+0060.7Show/hide
Query:  DMHKSIENLTEEVKESSRTKQREES-CASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV------
        +M K++E L +E+KE+   K++EES   S+GS +K KGK  ET+  +   G  +DR+KYKKLEMP+F G+NPESWVYRAEH+F+IN  ++ EKV      
Subjt:  DMHKSIENLTEEVKESSRTKQREES-CASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKV------

Query:  ------------------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLE
                                K  MFE F+  G +SLGARLIRI+Q+GSY +Y+KKF+ YSAPL  MAES+L+DAFVTGLE +LQAEV+SRHP TLE
Subjt:  ------------------------KKGMFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLE

Query:  DCMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKY
        DCM+EAQ+V+DR+ A+KL+  E G        +     K K   + +K   + +++ MKQ++IPIKGN+ + EP VK LS++EFRARLD+GLCFRCN+KY
Subjt:  DCMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKY

Query:  SPGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVIN-----------------------------ELNL
        SPGHRCK K  RELMFFI NEEEE EE + +EE   ETVEL+TLE+  +  I L+T+   +SKG +                              +L L
Subjt:  SPGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVIN-----------------------------ELNL

Query:  PVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTTQ--------------------
         ++  T FG TIG+ T   G G+C+RVEVKL E+ I+ADFLA++LG +D VLGMQWL T G + +HWP++TM+F  G  Q                    
Subjt:  PVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTTQ--------------------

Query:  ---------------KIEIEENNEEDSEREEEGDEANLPMIRKLLHRF-------KGLV----VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEML
                        +++E ++   ++ +EEGDEA++PMIR LL ++       KGL     +DHRILT+  QKPINVRPYKYG+IQK EIEKLV EML
Subjt:  ---------------KIEIEENNEEDSEREEEGDEANLPMIRKLLHRF-------KGLV----VDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEML

Query:  QAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPF
        Q G+IRPSR  YSSPVLL++K +GG RFCVDYRKLNQ T+SDKFPI VIEELLDEL+GA VFSKLDL+SGYHQI+MKEED+EKTAFRTHEGHYEFLVMPF
Subjt:  QAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPF

Query:  GLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQP
        GLTNAPATFQSLMNQ+FKPFLRR VLVFF D+L+YS DITEHE+HLGMVF V++DNQL+AN KKCV  HS++ YLGH ISK GVEAD +K+++MVNWP+P
Subjt:  GLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQP

Query:  KDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKL
         DV+ELRGFL LTGYYRRFVKGY NIA PLTKLLQ+N F WNEDA  AF  LK AM ++ VLALPD+SLPF I TDASG GLGAVLSQ G PIA++SQKL
Subjt:  KDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKL

Query:  SPRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRL
        S RAQAKSIYERELMAVV++                                    +WLTKLLGYDFEILYQPG LNKAADALSR+ P +EL+ + +  +
Subjt:  SPRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRL

Query:  LDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQC
        +D+ +V EEV +DE LQ I+  LK+  E    ++W + +LLYKG+LVL +TSSLIP LLHTFHDSVLGGHSGFLRTYK M+GE++W GMKND+KKYVEQC
Subjt:  LDVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQC

Query:  ETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLS
        E CQRNK +A  PAG+LQPLP+P+ ILEDWTMDFIEGLPKAGG + IMVVVDRL+K A+FITLKHPF+AKQVA  F  +I+ +HGIPNSI++DRDK+FLS
Subjt:  ETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLS

Query:  QFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQ
         FW+ELF +MG  LKRSTAFHPQTDGQTE+VN+CLETYL CFCNEQP KW + IPWAELWYNTTFHS  KTTPF+ VYGR PP L+ YGD+K  NN VE 
Subjt:  QFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQ

Query:  LLKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQL
        +LKERD  +SALKENL  AQNRMKK ADL+RRELK KVG+EVYLKL+PYRQRSLA+K+ EKLAP++YGPY+++EEIG VAYRLDLP EA IHNVFHISQL
Subjt:  LLKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQL

Query:  KLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP
        K KLG  Q +QH  P LTE+FELQ++PE VLG+RWN E+GAN+WL         +ATWESVY MNQQ+PSFHLEDK   +P
Subjt:  KLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP

A0A5D3E325 Ty3/gypsy retrotransposon protein0.0e+0060.81Show/hide
Query:  DMHKSIENLTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKVKKG----
        +M KS++ L +E+++ S  K++EES  S+GS +K KGK  ETD+         DRSKYKKLEMP+F G+NPESWVYRAEH+F+IN   + EKVK      
Subjt:  DMHKSIENLTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKVKKG----

Query:  --------------------------MFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLED
                                  MFE F+  G  SLGARLIRI+Q+GSY DY+KKF+NYSAPL  MAES+L DAF+TGLE  LQAEV+SRHP TLED
Subjt:  --------------------------MFEHFKTPGGESLGARLIRIKQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLED

Query:  CMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKYS
        CM  AQ+V+DR+LA+KLA  E G   P+ SE+       K+  N +K   + +E+ MKQ++IP+KG+Y + EP VK LS++EFRARLDKGLCFRCNEKYS
Subjt:  CMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYVMKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKYS

Query:  PGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKG-----------------------------VINELNLP
         GHRCK K  RELM FI NEEE  EE  G E      VE+  LE   +T I  R I   T+KG                             ++ E  +P
Subjt:  PGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKG-----------------------------VINELNLP

Query:  VEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTF--------MAGTT--------------
        +   T+FG+TIGD T+ +G G+C +VE++L  L++V D L + LG IDVVLGMQWL T G + +HWP++TM F        + G                
Subjt:  VEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTF--------MAGTT--------------

Query:  -------------QKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGL-----------VVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQ
                     Q+ EIE  + +     + GDE  LPMI+ LLH++  +           ++DHRILT+ GQKPINVRPYKYG+ QKEEIEKLV EMLQ
Subjt:  -------------QKIEIEENNEEDSEREEEGDEANLPMIRKLLHRFKGL-----------VVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQ

Query:  AGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFG
         GIIRPS   +SSPVLL++K DGG RFCVDYRKLN++T++DKFPI VIEELLDELHGA VFSKLDL+SGYHQI+M+EED+EKTAFRTHEGHYEF+VMPFG
Subjt:  AGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFG

Query:  LTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPK
        LTNAPATFQSLMNQ+FKPFLRR VLVFFDD+L+YS+DITEHE+HLGMVF  ++DNQL+AN KKCV  HS+++YLGH ISK GVEAD +KVK+M+ WP+PK
Subjt:  LTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPK

Query:  DVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLS
        DV+ LRGFLGLTGYYRRFVKGYG IAAPLTKLLQ+N F W+E+A  AFESLK AM ++ VLALPD+SLPF I TDASG GLGAVLSQN  PIA+FSQKLS
Subjt:  DVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNGRPIAYFSQKLS

Query:  PRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLL
         RAQAKSIYERELMAVV++                                    +WLTKLLGYDFEILYQPGL NKAADALSRM+  +EL  L +  ++
Subjt:  PRAQAKSIYERELMAVVMA------------------------------------RWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLL

Query:  DVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCE
        D+E+V +EV++DEELQ +I+ L+ NP  +  Y   +  L+YKG++VLSK+SS+IP+LLHTFHDS+LGGHSGFLRTYK M+GE+ W GMK D+KKYVEQCE
Subjt:  DVEMVMEEVKRDEELQGIIEVLKENPEGKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCE

Query:  TCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQ
         CQRNK++A  PAG+LQPLP+P+ ILEDWTMDFIEGLPKAGG + IMVVVDRLSK A+F+T+KHPF+AKQVA +F  +I+ +HGIP SI++DRDK+F+S 
Subjt:  TCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQ

Query:  FWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQL
        FWKELF AM   LKRSTAFHPQTDGQTE+VN+CLETYL CFCNEQP KWH+ IPWAELWYNTTFHS  +TTPFQ VYGRPPP LI YGD+KT N+ VE L
Subjt:  FWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQL

Query:  LKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLK
        LKERDL ISALKENL  AQNRMKK AD +RRELKFKVGDEVYLKLRPYRQRSLA+KR EKLAPK+YGPYR+ E IGEVAYRLDLP EA IHNVFHISQLK
Subjt:  LKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLK

Query:  LKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP
        LKLG   ++Q   P LT +FELQ+ PE VLG+RW+ E+GAN+WL         EATWESVY+MNQQ+PSFHLEDK   +P
Subjt:  LKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWL---------EATWESVYAMNQQYPSFHLEDKKAAQP

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.1e-12432.33Show/hide
Query:  KLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELL
        KL    KGL  +  +   + + PI   P   G +Q    E  + + L++GIIR S+ + + PV+ + K +G  R  VDY+ LN+    + +P+ +IE+LL
Subjt:  KLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELL

Query:  DELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVM
         ++ G+ +F+KLDL+S YH I++++ D  K AFR   G +E+LVMP+G++ APA FQ  +N I        V+ + DD+LI+S   +EH +H+  V   +
Subjt:  DELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVM

Query:  KDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQN-GFNWNEDAIEAFESL
        K+  L  N+ KC    S+V ++G+ IS+KG     E +  ++ W QPK+  ELR FLG   Y R+F+     +  PL  LL+++  + W     +A E++
Subjt:  KDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQN-GFNWNEDAIEAFESL

Query:  KQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNG-----RPIAYFSQKLSPRAQAKSIYERELMAVV-----------------------------
        KQ ++S  VL   DFS    + TDAS V +GAVLSQ        P+ Y+S K+S      S+ ++E++A++                             
Subjt:  KQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNG-----RPIAYFSQKLSPRAQAKSIYERELMAVV-----------------------------

Query:  -----------MARWLTKLLGYDFEILYQPGLLNKAADALSRM----NPIVE------LNTLISLRLLD--VEMVMEEVKRDEELQGIIEVLKENPEGKA
                   +ARW   L  ++FEI Y+PG  N  ADALSR+     PI +      +N +  + + D     V+ E   D +L  ++    E+   + 
Subjt:  -----------MARWLTKLLGYDFEILYQPGLLNKAADALSRM----NPIVE------LNTLISLRLLD--VEMVMEEVKRDEELQGIIEVLKENPEGKA

Query:  NYQWVSDKLLY--KGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILED
        N Q + D LL   K +++L   + L  T++  +H+     H G       +     W G++  +++YV+ C TCQ NK++   P G LQP+P      E 
Subjt:  NYQWVSDKLLY--KGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILED

Query:  WTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTE
         +MDFI  LP++ G++++ VVVDR SKMA  +      TA+Q A  F + +I+  G P  I+ D D +F SQ WK+        +K S  + PQTDGQTE
Subjt:  WTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTE

Query:  QVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPP---PSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQ
        + N+ +E  L C C+  P  W   I   +  YN   HS  + TPF+IV+   P   P  +     KT+ NS E +      V   +KE+L T   +MKK 
Subjt:  QVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPP---PSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQ

Query:  ADLRRREL-KFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLP--LEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFEL
         D++ +E+ +F+ GD V +K    R ++    +  KLAP F GP+ V+++ G   Y LDLP  ++ M  + FH+S L+    K +H   L  A  ++ ++
Subjt:  ADLRRREL-KFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLP--LEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFEL

P0CT35 Transposon Tf2-2 polyprotein8.1e-12432.33Show/hide
Query:  KLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELL
        KL    KGL  +  +   + + PI   P   G +Q    E  + + L++GIIR S+ + + PV+ + K +G  R  VDY+ LN+    + +P+ +IE+LL
Subjt:  KLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELL

Query:  DELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVM
         ++ G+ +F+KLDL+S YH I++++ D  K AFR   G +E+LVMP+G++ APA FQ  +N I        V+ + DD+LI+S   +EH +H+  V   +
Subjt:  DELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVM

Query:  KDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQN-GFNWNEDAIEAFESL
        K+  L  N+ KC    S+V ++G+ IS+KG     E +  ++ W QPK+  ELR FLG   Y R+F+     +  PL  LL+++  + W     +A E++
Subjt:  KDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQN-GFNWNEDAIEAFESL

Query:  KQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNG-----RPIAYFSQKLSPRAQAKSIYERELMAVV-----------------------------
        KQ ++S  VL   DFS    + TDAS V +GAVLSQ        P+ Y+S K+S      S+ ++E++A++                             
Subjt:  KQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNG-----RPIAYFSQKLSPRAQAKSIYERELMAVV-----------------------------

Query:  -----------MARWLTKLLGYDFEILYQPGLLNKAADALSRM----NPIVE------LNTLISLRLLD--VEMVMEEVKRDEELQGIIEVLKENPEGKA
                   +ARW   L  ++FEI Y+PG  N  ADALSR+     PI +      +N +  + + D     V+ E   D +L  ++    E+   + 
Subjt:  -----------MARWLTKLLGYDFEILYQPGLLNKAADALSRM----NPIVE------LNTLISLRLLD--VEMVMEEVKRDEELQGIIEVLKENPEGKA

Query:  NYQWVSDKLLY--KGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILED
        N Q + D LL   K +++L   + L  T++  +H+     H G       +     W G++  +++YV+ C TCQ NK++   P G LQP+P      E 
Subjt:  NYQWVSDKLLY--KGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILED

Query:  WTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTE
         +MDFI  LP++ G++++ VVVDR SKMA  +      TA+Q A  F + +I+  G P  I+ D D +F SQ WK+        +K S  + PQTDGQTE
Subjt:  WTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTE

Query:  QVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPP---PSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQ
        + N+ +E  L C C+  P  W   I   +  YN   HS  + TPF+IV+   P   P  +     KT+ NS E +      V   +KE+L T   +MKK 
Subjt:  QVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPP---PSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQ

Query:  ADLRRREL-KFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLP--LEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFEL
         D++ +E+ +F+ GD V +K    R ++    +  KLAP F GP+ V+++ G   Y LDLP  ++ M  + FH+S L+    K +H   L  A  ++ ++
Subjt:  ADLRRREL-KFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLP--LEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFEL

P0CT36 Transposon Tf2-3 polyprotein8.1e-12432.33Show/hide
Query:  KLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELL
        KL    KGL  +  +   + + PI   P   G +Q    E  + + L++GIIR S+ + + PV+ + K +G  R  VDY+ LN+    + +P+ +IE+LL
Subjt:  KLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELL

Query:  DELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVM
         ++ G+ +F+KLDL+S YH I++++ D  K AFR   G +E+LVMP+G++ APA FQ  +N I        V+ + DD+LI+S   +EH +H+  V   +
Subjt:  DELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVM

Query:  KDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQN-GFNWNEDAIEAFESL
        K+  L  N+ KC    S+V ++G+ IS+KG     E +  ++ W QPK+  ELR FLG   Y R+F+     +  PL  LL+++  + W     +A E++
Subjt:  KDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQN-GFNWNEDAIEAFESL

Query:  KQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNG-----RPIAYFSQKLSPRAQAKSIYERELMAVV-----------------------------
        KQ ++S  VL   DFS    + TDAS V +GAVLSQ        P+ Y+S K+S      S+ ++E++A++                             
Subjt:  KQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNG-----RPIAYFSQKLSPRAQAKSIYERELMAVV-----------------------------

Query:  -----------MARWLTKLLGYDFEILYQPGLLNKAADALSRM----NPIVE------LNTLISLRLLD--VEMVMEEVKRDEELQGIIEVLKENPEGKA
                   +ARW   L  ++FEI Y+PG  N  ADALSR+     PI +      +N +  + + D     V+ E   D +L  ++    E+   + 
Subjt:  -----------MARWLTKLLGYDFEILYQPGLLNKAADALSRM----NPIVE------LNTLISLRLLD--VEMVMEEVKRDEELQGIIEVLKENPEGKA

Query:  NYQWVSDKLLY--KGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILED
        N Q + D LL   K +++L   + L  T++  +H+     H G       +     W G++  +++YV+ C TCQ NK++   P G LQP+P      E 
Subjt:  NYQWVSDKLLY--KGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILED

Query:  WTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTE
         +MDFI  LP++ G++++ VVVDR SKMA  +      TA+Q A  F + +I+  G P  I+ D D +F SQ WK+        +K S  + PQTDGQTE
Subjt:  WTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTE

Query:  QVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPP---PSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQ
        + N+ +E  L C C+  P  W   I   +  YN   HS  + TPF+IV+   P   P  +     KT+ NS E +      V   +KE+L T   +MKK 
Subjt:  QVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPP---PSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQ

Query:  ADLRRREL-KFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLP--LEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFEL
         D++ +E+ +F+ GD V +K    R ++    +  KLAP F GP+ V+++ G   Y LDLP  ++ M  + FH+S L+    K +H   L  A  ++ ++
Subjt:  ADLRRREL-KFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLP--LEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFEL

P0CT37 Transposon Tf2-4 polyprotein8.1e-12432.33Show/hide
Query:  KLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELL
        KL    KGL  +  +   + + PI   P   G +Q    E  + + L++GIIR S+ + + PV+ + K +G  R  VDY+ LN+    + +P+ +IE+LL
Subjt:  KLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELL

Query:  DELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVM
         ++ G+ +F+KLDL+S YH I++++ D  K AFR   G +E+LVMP+G++ APA FQ  +N I        V+ + DD+LI+S   +EH +H+  V   +
Subjt:  DELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVM

Query:  KDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQN-GFNWNEDAIEAFESL
        K+  L  N+ KC    S+V ++G+ IS+KG     E +  ++ W QPK+  ELR FLG   Y R+F+     +  PL  LL+++  + W     +A E++
Subjt:  KDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQN-GFNWNEDAIEAFESL

Query:  KQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNG-----RPIAYFSQKLSPRAQAKSIYERELMAVV-----------------------------
        KQ ++S  VL   DFS    + TDAS V +GAVLSQ        P+ Y+S K+S      S+ ++E++A++                             
Subjt:  KQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNG-----RPIAYFSQKLSPRAQAKSIYERELMAVV-----------------------------

Query:  -----------MARWLTKLLGYDFEILYQPGLLNKAADALSRM----NPIVE------LNTLISLRLLD--VEMVMEEVKRDEELQGIIEVLKENPEGKA
                   +ARW   L  ++FEI Y+PG  N  ADALSR+     PI +      +N +  + + D     V+ E   D +L  ++    E+   + 
Subjt:  -----------MARWLTKLLGYDFEILYQPGLLNKAADALSRM----NPIVE------LNTLISLRLLD--VEMVMEEVKRDEELQGIIEVLKENPEGKA

Query:  NYQWVSDKLLY--KGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILED
        N Q + D LL   K +++L   + L  T++  +H+     H G       +     W G++  +++YV+ C TCQ NK++   P G LQP+P      E 
Subjt:  NYQWVSDKLLY--KGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILED

Query:  WTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTE
         +MDFI  LP++ G++++ VVVDR SKMA  +      TA+Q A  F + +I+  G P  I+ D D +F SQ WK+        +K S  + PQTDGQTE
Subjt:  WTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTE

Query:  QVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPP---PSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQ
        + N+ +E  L C C+  P  W   I   +  YN   HS  + TPF+IV+   P   P  +     KT+ NS E +      V   +KE+L T   +MKK 
Subjt:  QVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPP---PSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQ

Query:  ADLRRREL-KFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLP--LEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFEL
         D++ +E+ +F+ GD V +K    R ++    +  KLAP F GP+ V+++ G   Y LDLP  ++ M  + FH+S L+    K +H   L  A  ++ ++
Subjt:  ADLRRREL-KFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLP--LEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFEL

P0CT41 Transposon Tf2-12 polyprotein8.1e-12432.33Show/hide
Query:  KLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELL
        KL    KGL  +  +   + + PI   P   G +Q    E  + + L++GIIR S+ + + PV+ + K +G  R  VDY+ LN+    + +P+ +IE+LL
Subjt:  KLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELL

Query:  DELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVM
         ++ G+ +F+KLDL+S YH I++++ D  K AFR   G +E+LVMP+G++ APA FQ  +N I        V+ + DD+LI+S   +EH +H+  V   +
Subjt:  DELHGAEVFSKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVM

Query:  KDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQN-GFNWNEDAIEAFESL
        K+  L  N+ KC    S+V ++G+ IS+KG     E +  ++ W QPK+  ELR FLG   Y R+F+     +  PL  LL+++  + W     +A E++
Subjt:  KDNQLFANEKKCVIGHSRVNYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQN-GFNWNEDAIEAFESL

Query:  KQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNG-----RPIAYFSQKLSPRAQAKSIYERELMAVV-----------------------------
        KQ ++S  VL   DFS    + TDAS V +GAVLSQ        P+ Y+S K+S      S+ ++E++A++                             
Subjt:  KQAMISVLVLALPDFSLPFNIITDASGVGLGAVLSQNG-----RPIAYFSQKLSPRAQAKSIYERELMAVV-----------------------------

Query:  -----------MARWLTKLLGYDFEILYQPGLLNKAADALSRM----NPIVE------LNTLISLRLLD--VEMVMEEVKRDEELQGIIEVLKENPEGKA
                   +ARW   L  ++FEI Y+PG  N  ADALSR+     PI +      +N +  + + D     V+ E   D +L  ++    E+   + 
Subjt:  -----------MARWLTKLLGYDFEILYQPGLLNKAADALSRM----NPIVE------LNTLISLRLLD--VEMVMEEVKRDEELQGIIEVLKENPEGKA

Query:  NYQWVSDKLLY--KGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILED
        N Q + D LL   K +++L   + L  T++  +H+     H G       +     W G++  +++YV+ C TCQ NK++   P G LQP+P      E 
Subjt:  NYQWVSDKLLY--KGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILED

Query:  WTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTE
         +MDFI  LP++ G++++ VVVDR SKMA  +      TA+Q A  F + +I+  G P  I+ D D +F SQ WK+        +K S  + PQTDGQTE
Subjt:  WTMDFIEGLPKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTE

Query:  QVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPP---PSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQ
        + N+ +E  L C C+  P  W   I   +  YN   HS  + TPF+IV+   P   P  +     KT+ NS E +      V   +KE+L T   +MKK 
Subjt:  QVNRCLETYLCCFCNEQPTKWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPP---PSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQ

Query:  ADLRRREL-KFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLP--LEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFEL
         D++ +E+ +F+ GD V +K    R ++    +  KLAP F GP+ V+++ G   Y LDLP  ++ M  + FH+S L+    K +H   L  A  ++ ++
Subjt:  ADLRRREL-KFKVGDEVYLKLRPYRQRSLAKKRCEKLAPKFYGPYRVMEEIGEVAYRLDLP--LEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFEL

Arabidopsis top hitse value%identityAlignment
AT5G37630.1 ARM repeat superfamily protein1.5e-24255.85Show/hide
Query:  SIKQRTTILQRGLADRSQAVSKECLKLMTDEWLHNCCHGNPVELLEYLDVETYERVGESVMGALLGASLLKLHDDASIQHYIQTSSSATEGDSPHCSPTI
        SIK RTTILQRGLADR+  VS ECLKLM ++WL N C G+P+  L+YLDVETYE V ES +  LL   L+   DD SIQ YI ++   T  +S   +P+I
Subjt:  SIKQRTTILQRGLADRSQAVSKECLKLMTDEWLHNCCHGNPVELLEYLDVETYERVGESVMGALLGASLLKLHDDASIQHYIQTSSSATEGDSPHCSPTI

Query:  QLMEPEVSLYWRMICKHILTEAHAKGSDAAASMGAEAAVYAAEASEKNDLLEKILPATISDYVGLVQAHINAGSSYRFASRQLLLLGTMLDYSDNANRKI
        QLMEPE++LYWR+IC+ +   A AKGSDAA +MGAEAAVYAAEAS+ NDLLE+ILPAT+SDYV LV+AHI AG ++ FASRQLLLLGTMLD+SD    K 
Subjt:  QLMEPEVSLYWRMICKHILTEAHAKGSDAAASMGAEAAVYAAEASEKNDLLEKILPATISDYVGLVQAHINAGSSYRFASRQLLLLGTMLDYSDNANRKI

Query:  AGAFLQEVLHISPDHELDDDGNLVVHGDGINLGGDRDWAISVSGLVRKVHAAAGEFEEIVLEVIEELARPCRERTANCVQWMHCLAVTSLLLENAKSLNF
          +F+QE+L    + ELD+DGN +V GDGINLGGD+DWA +VS L +KVHAA GE+EE++L V+EE+ARPCRERTA+ +QWMH L++TSLLLEN KSL+ 
Subjt:  AGAFLQEVLHISPDHELDDDGNLVVHGDGINLGGDRDWAISVSGLVRKVHAAAGEFEEIVLEVIEELARPCRERTANCVQWMHCLAVTSLLLENAKSLNF

Query:  INGKISGPAELLDSILLPGAKHVHLDVQRISIRCLGLYGLLNKRPNEKILKQLRHSFIKGLPPINIMACKALFDLVLWHGPQEVDKALGQDHILQSSFDK
        + GK   P E+L ++LLPGAKH HLDVQRI+I+ LGL+GLL K+P+E++++QLR +F +  PPI+IMACKAL DL +WH P EVDKA+GQD + Q   D 
Subjt:  INGKISGPAELLDSILLPGAKHVHLDVQRISIRCLGLYGLLNKRPNEKILKQLRHSFIKGLPPINIMACKALFDLVLWHGPQEVDKALGQDHILQSSFDK

Query:  TSFSSINLSEADEDWTTGSLDLLYAGFDNDEKYSSSATNEIESVQTIVTEGFAKILLLSENYPSIPASLHPPLLRKLVNVYFSSE-KDLERLKQCLSVFF
          F+ I+LS A+ED     LDLLYAG ++D+  +S+ ++E ESV+  V EGFAK+LLL E YP++PAS +P +L KL+ +YFS E K+  R KQCLSVFF
Subjt:  TSFSSINLSEADEDWTTGSLDLLYAGFDNDEKYSSSATNEIESVQTIVTEGFAKILLLSENYPSIPASLHPPLLRKLVNVYFSSE-KDLERLKQCLSVFF

Query:  EHYPSLTVSHKRWISESFVPVMRSMWPGMNNNVGGSAAEVANMRKRAVQASRFMLQMMQAPLYGNDTERKEEDGCMGNQEVAGSIGEPPLECTEDGLAIQ
        EHY SL+  HK ++S++FVP++RSMWPG++ N   S+  V+N RKRAVQ SRF+LQMMQ PLY  +T  + E     N+    SI + PL CTE+GLAI+
Subjt:  EHYPSLTVSHKRWISESFVPVMRSMWPGMNNNVGGSAAEVANMRKRAVQASRFMLQMMQAPLYGNDTERKEEDGCMGNQEVAGSIGEPPLECTEDGLAIQ

Query:  IAVEVASFRGKKTPAQKSYVSALCRVLVLLHFRPSEQGAIRVMRRLLCYLVDTASWDKDLVKELKRMGEHLTAIDKQPDLEVTQDQADLILDQLKLEFSF
        IA+E+ SF+ KKT  +K+YV+ALC++LVLLH +PSEQ   +++++LL  L D+   +KDL+KE+K + +HL ++D  P  E+TQDQA+ I + L + ++ 
Subjt:  IAVEVASFRGKKTPAQKSYVSALCRVLVLLHFRPSEQGAIRVMRRLLCYLVDTASWDKDLVKELKRMGEHLTAIDKQPDLEVTQDQADLILDQLKLEFSF

Query:  ---DAEIPPQTPVPCSTKPTRSRRRVKPESSSSDEAMSPTSVPNIVGTIGTRSERASKTAALTRIMNSALKTNNVVDEEDECEDSDDDGDEDDED
           +    PQTP PCSTKP RSRRR + E +SSDE    +  P+   T+ TRS RASK AAL +IM S +K +N VDE+DE E+   D   DD D
Subjt:  ---DAEIPPQTPVPCSTKPTRSRRRVKPESSSSDEAMSPTSVPNIVGTIGTRSERASKTAALTRIMNSALKTNNVVDEEDECEDSDDDGDEDDED

ATMG00860.1 DNA/RNA polymerases superfamily protein1.7e-3656.59Show/hide
Query:  HLGMVFNVMKDNQLFANEKKCVIGHSRVNYLG--HWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWN
        HLGMV  + + +Q +AN KKC  G  ++ YLG  H IS +GV AD  K++AMV WP+PK+ +ELRGFLGLTGYYRRFVK YG I  PLT+LL++N   W 
Subjt:  HLGMVFNVMKDNQLFANEKKCVIGHSRVNYLG--HWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWN

Query:  EDAIEAFESLKQAMISVLVLALPDFSLPF
        E A  AF++LK A+ ++ VLALPD  LPF
Subjt:  EDAIEAFESLKQAMISVLVLALPDFSLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGTTATAACCGACTTTCTAGTGGTGGAACTGGGAAATATTGACGTGGTTTTAGGGATGCAATGGCTAGATACAACCGGCACAATGAAGGTACATTGGCCATCATT
AACCATGACCTTTTGGGCAAAAGACAAGCAAATCGTGTTAAGAGGAGATCCATCCTTAGTGAAAGCTGAATGTTCCTTGAAGACATTGGAAAAAACGTGGGATATAGAAG
ATCAAGGCTTCCTCTTAGAATTCCAGAATTATGAGATGGAAGTAGAAGATGAATATGAAACGGAAACAAAAGGAGATGAGGAAATACCTATGATCCGATCCCTGCTCAAG
TATTATGCGGAGATATTTGAGACACCGAATAAACTACCTCCAAAAAGAGCTATTGATCACCGAATATTAACCTTATCGGACCGTAAACCTATCAACGTAAGACCTTACAA
ATACGGTCATGTGCATAAAGAAGAAATCGAAAAACTTGTGGTCGAGATGCTTCAAGTAGGGGTTATTCGCCCTAGTCATAGCCCTTACTCCAGCCCGGTGTTGTTAGTGG
AAAAGAAAAATGAGGAATTATTAGATGAGCTGAATGGGGCAACGGTTTTCTCGAAATTAGATCTGAAATCTGGTTATCACCAGATTAGAATGAAAGAGGAAGATGTGGAG
AAAACTGCGTTCCGCACACATGAAGGGCACTACGAGTTTCTAGTATTTAAGCATTTCCATCGGCGCTGTGTTTTGGTGTTCTTTGATGATGTACTGGTTTATAGTGCGGA
TATAGATGAACACGCAAAGCATTTGGGCATGGAAGTTGAGAATGATGCTGAACTTAAGAAAATTATAGAAAATCTGAAGAAGAATGCGAACGAAGGAAGCAAGTTTCAAG
GAAGTGTCTTAACAAAGTGGTATCAGAGCCGCAAACATCCTGGGGAGGACACGAGCAATGACACAAAAACAGTGGGAGGAGAGATTGGAAACGATATGCATAAGAGTATA
GAAAATCTGACGGAAGAAGTGAAGGAAAGCAGTCGAACTAAGCAGAGAGAAGAATCTTGTGCCTCAGAGGGTTCTGGGCTTAAAAAGAAAGGAAAGGAGGGAGAGACGGA
CATGAACTCAGTTTTCACTGGAGCATCATCAGATCGAAGTAAATACAAGAAGTTAGAGATGCCTATCTTCACGGGTGATAATCCGGAATCATGGGTCTATAGGGCAGAGC
ATTATTTTGATATCAATGAGTTCGCTGACGAGGAGAAAGTGAAAAAAGGGATGTTCGAACACTTCAAGACACCCGGTGGAGAAAGTTTGGGGGCTCGTCTCATTCGGATT
AAACAAGATGGATCTTACCCCGATTATTTGAAGAAATTCCTGAATTATTCGGCGCCGTTACAGGAGATGGCGGAGAGCATCCTCATCGATGCATTTGTTACAGGTTTGGA
GACGAATTTACAAGCCGAAGTATTAAGTCGTCATCCCGTTACTTTGGAGGACTGTATGCAGGAAGCCCAAATGGTGAGTGATCGTGACTTGGCAATTAAGTTAGCCCTAA
ATGAATGTGGGAGTAATGGGCCAAGGGCATCTGAAGCCCAAACCCAAATGGAAAAAAAAAAGATGACGATAAATGCTGAAAAAAAAGAAGGAAAAGGCAGCGAATACGTA
ATGAAGCAAGTATCGATTCCGATTAAGGGAAATTATTCGAGAGAAGAACCGTCGGTGAAATGGCTATCAAATAGCGAATTTCGTGCAAGGCTAGACAAGGGGCTATGTTT
TCGATGTAACGAAAAATATTCACCAGGGCACCGGTGCAAGGGCAAAACGAATCGCGAGCTGATGTTTTTTATTGCGAATGAAGAAGAAGAATTGGAAGAAGTCAACGGGA
AGGAGGAAGGCGAATTGGAAACTGTTGAACTCGAAACCCTAGAAATCGAGGGGAAGACAGAAATAGCGCTTCGCACCATCTTGGGGTTTACATCAAAAGGAGTGATTAAT
GAGCTAAACTTACCCGTGGAGGAAAAAACTAAATTCGGAGTGACTATTGGCGACGAAACTGCACTAGAGGGAATAGGGGTTTGCAAGAGAGTCGAGGTTAAGCTGCCGGA
GCTAAAAATTGTGGCAGACTTCTTAGCAATCAAATTGGGTAGAATTGATGTGGTTTTGGGAATGCAGTGGCTCTGCACGATTGGATTCATTGGGGTTCATTGGCCGACGA
TGACAATGACGTTCATGGCGGGGACCACTCAAAAGATTGAAATTGAAGAAAATAATGAGGAAGACAGTGAAAGGGAGGAAGAGGGGGATGAAGCAAATTTACCTATGATT
CGCAAGCTGTTGCATAGGTTCAAGGGGCTGGTAGTAGATCATCGAATTCTGACTATCGACGGGCAAAAACCCATCAATGTACGCCCTTACAAGTATGGGTACATACAGAA
AGAAGAGATAGAGAAACTGGTGACAGAAATGCTTCAAGCAGGAATTATCCGGCCAAGCCGAAGACTATATTCCAGTCCGGTATTGCTAATAAGGAAACACGACGGTGGGC
GGAGATTCTGCGTAGACTACCGTAAATTGAATCAGGTGACTGTCTCCGATAAATTCCCCATTCTTGTAATCGAAGAACTGCTAGATGAGTTACATGGTGCTGAAGTTTTT
TCAAAGCTGGATTTGCGTTCGGGGTATCACCAAATTCAGATGAAGGAGGAGGATGTTGAAAAAACCGCGTTTCGCACTCACGAAGGGCATTATGAATTTCTAGTTATGCC
TTTCGGGTTAACCAATGCGCCGGCCACTTTTCAGTCCCTAATGAATCAAATTTTCAAGCCCTTCCTAAGACGGTTTGTGTTGGTCTTCTTCGACGATGTCCTGATTTATA
GCACTGATATTACCGAACACGAGCAACACTTGGGAATGGTGTTCAATGTAATGAAAGACAACCAGTTATTTGCCAATGAAAAGAAATGCGTAATAGGGCATTCACGAGTC
AATTATTTGGGGCACTGGATCTCTAAAAAAGGTGTAGAAGCGGACGGGGAAAAGGTGAAAGCAATGGTAAACTGGCCCCAACCCAAAGATGTTTCTGAATTAAGGGGATT
CCTCGGCCTCACGGGATATTATCGAAGATTTGTCAAAGGATATGGGAACATTGCCGCTCCCCTGACCAAATTGTTGCAACAAAACGGGTTCAATTGGAACGAAGATGCCA
TAGAAGCTTTCGAGTCACTGAAACAAGCAATGATCTCGGTGCTGGTCTTAGCACTCCCCGACTTTTCCTTACCATTTAACATTATAACTGACGCATCAGGTGTAGGGCTG
GGAGCAGTTTTATCACAGAATGGCCGACCAATAGCGTATTTTAGCCAGAAATTATCTCCTAGAGCACAAGCAAAGTCTATTTACGAAAGGGAACTGATGGCAGTGGTAAT
GGCGAGATGGTTGACTAAATTATTGGGTTATGATTTTGAAATTTTGTACCAACCCGGGCTGTTGAATAAAGCAGCAGACGCACTATCTCGCATGAACCCTATTGTGGAAT
TAAATACTTTAATCTCTCTGAGGCTGCTAGATGTTGAGATGGTGATGGAAGAAGTTAAAAGGGATGAGGAATTACAAGGAATAATTGAAGTCCTGAAGGAAAATCCAGAA
GGGAAAGCAAACTATCAATGGGTATCAGACAAATTGCTCTACAAGGGAAAGTTAGTCTTGTCGAAAACCTCTTCTCTTATTCCAACCTTACTCCATACTTTCCACGATTC
AGTGTTAGGGGGCCATTCGGGGTTTCTCAGAACATACAAGCATATGAATGGAGAGATACATTGGGTGGGCATGAAGAATGACGTAAAGAAATATGTCGAACAGTGTGAAA
CATGCCAAAGAAATAAAACAAAAGCACTCTCTCCAGCTGGATTATTACAGCCCTTGCCTTTACCCAACTTAATCCTTGAAGACTGGACGATGGACTTCATAGAAGGGTTG
CCTAAGGCGGGAGGTTTTGATAGTATCATGGTGGTGGTTGACCGACTAAGTAAAATGGCCCACTTCATTACCTTGAAGCACCCATTTACGGCAAAGCAAGTTGCCGAAAA
ATTTAGTAAAGAAATTATTAGCAAACATGGAATTCCAAACTCAATAGTAACGGACCGAGACAAAGTCTTCTTGAGTCAATTTTGGAAAGAATTATTTACAGCCATGGGGG
CTTCGCTGAAAAGAAGTACTGCATTTCACCCTCAAACGGATGGGCAGACTGAGCAGGTGAACCGATGCCTCGAAACGTATCTATGTTGTTTCTGTAACGAACAACCTACG
AAGTGGCATAAATGCATCCCTTGGGCTGAATTGTGGTACAACACCACTTTTCACTCGTTGGCAAAAACAACCCCGTTTCAAATTGTGTATGGGCGCCCCCCACCTTCGCT
GATTCTGTATGGGGATCGAAAGACCAATAATAACAGTGTAGAACAATTATTAAAAGAAAGGGATCTGGTTATTAGTGCATTAAAGGAGAATTTGCTGACAGCCCAGAATA
GAATGAAGAAACAAGCAGATCTAAGGCGTCGGGAACTGAAATTCAAAGTGGGAGATGAGGTTTATTTGAAATTACGACCTTACAGACAACGATCCTTAGCTAAAAAGCGC
TGCGAGAAGTTAGCTCCCAAGTTTTATGGGCCCTATCGTGTAATGGAGGAAATTGGGGAAGTAGCATATCGTTTGGACCTTCCACTTGAGGCAATGATCCATAACGTGTT
TCATATTTCACAGCTAAAACTTAAGTTGGGTAAAACACAACATATACAACACCTACCACCTGCATTGACTGAAGATTTTGAGCTGCAAGTGGAACCGGAAGCAGTGCTGG
GGGTTCGCTGGAACAGCGAGATGGGAGCTAATGATTGGTTGGAAGCGACTTGGGAATCTGTTTATGCTATGAACCAACAGTATCCTTCGTTTCACCTTGAAGACAAGAAG
GCAGCTCAGCCAATCGAATTAAATGAAATGAATAATTGGGTGTGGGGCCCTCCTGGGGATAGAGGGAAGTGTCTTAACAGAATGGTAGTATCTCAAACTTCTACACTTAT
ACCCAAGATCCTACATACTTTCCACGATTCTATCCTCAGGGGACATTCTGGATTTTTAAGAACTTACAAGAGGATGAGTGGAGAACTATATTGGAGGGGTATGAAGGCCG
AAGTAAAGAAATATGTGGAACAATGCGAGATCTGCCAGAGAAACAAATATGAAGCGACCAAGCCAGCTGGGGTACTTCAACCAATTCCAATTCCTGACAGAATCTTAGAA
GATTGGACCATGGACTTTATTGAGGGGTTGCCACTAGCTGGAGGGGTTAATGTGATCATGGTGGTAGTTGATCGGCTGAGTAAGTATTCTTACTTTATATCATTGAGACA
TCCGTTTTCAGCGAAGCAAGTTGCTTCCAAATTCATCGACAGAATAGTAAGCAAGCACGGCATTCCTAAGTCGATTATCACAGATCGGGATAAAATCTTTCTTAGTAATT
TTTGGAAAGAACTGTTCGCAACTATGGGGACTATTCTAAAGAGGAGTACCGCGTTTCACCGACAAACAGATGGGCAGACAGAAAGGACGGTATACGGCAGACCCCCTCCT
CCCCCGTTATCCTATGGTTACAAGAAAACCCCTAATAATAAGGTTGACACACTGTTGAAGGAACGTGACTTAGCCATCAATGCCTTAAAGGAAAACTTACCTTACCGGCA
ACGCTCCTTGGCTCGGAAGAAGTGTGAAAAGTTAGCACCTAAATATTATGGACCCTACAAAATTATTGAAGAAATAGGAGCAGTTGCGTACAGATTGATGCTACGTCCAG
AAGCCAAGTTTGAGTTACAATTATGGCCGGAAATAGTGTTGGGAGTCCGTTGGAATCGGGAATTGGGAGGAAATGAATGGTTAATCAAATGGAAAGGGCTGCCGGACAGT
GAAACAACTTGGGAATCTGTTTTCCAAATGAACCAACAATTCCCTGGCTTTCACCTTGAGAACAAGGTGAACCTAGAACCGAGGGAGGCTGCTAGGTCTAGGAAGGACTA
CTATCTTTTCCTGGGGAGCATCAAACAAAGAACAACAATTTTACAGAGAGGGCTTGCTGATCGTTCTCAAGCTGTTTCAAAGGAGTGCTTAAAATTGATGACAGATGAGT
GGCTACACAACTGCTGCCATGGAAATCCTGTAGAATTGCTAGAGTATCTTGATGTCGAAACCTATGAACGAGTTGGTGAGTCTGTTATGGGGGCTTTATTAGGAGCTAGT
TTGTTGAAACTGCATGACGATGCAAGTATCCAACATTATATACAAACTTCCAGCAGCGCGACAGAAGGAGACTCACCACATTGCAGTCCAACTATACAACTGATGGAACC
AGAAGTTTCTCTTTACTGGAGGATGATTTGTAAGCATATCCTAACAGAAGCACATGCAAAAGGTTCTGATGCTGCAGCTTCTATGGGTGCTGAAGCCGCAGTATATGCAG
CCGAAGCTTCTGAAAAAAATGACCTTTTAGAGAAAATTCTTCCTGCCACTATTTCTGATTATGTAGGCTTAGTCCAAGCTCATATTAATGCTGGGTCCAGTTATCGATTT
GCATCAAGACAGCTACTTTTGCTTGGAACCATGCTTGATTATTCTGATAATGCAAATAGGAAGATTGCTGGTGCATTTCTGCAGGAAGTGTTGCATATATCTCCAGATCA
TGAATTGGATGATGATGGGAACTTGGTTGTTCATGGAGATGGAATCAATCTTGGAGGAGATAGAGATTGGGCAATTTCTGTATCTGGGTTGGTTAGGAAAGTCCATGCTG
CAGCTGGTGAATTTGAAGAAATTGTTCTTGAGGTGATTGAAGAACTTGCTCGACCATGCAGGGAAAGAACTGCAAATTGTGTGCAGTGGATGCACTGTCTTGCTGTGACA
AGCCTTCTCCTGGAAAACGCTAAGTCATTGAATTTTATTAATGGAAAAATCTCTGGACCTGCTGAACTGCTGGATTCAATATTGCTTCCAGGGGCAAAACACGTTCATTT
GGATGTTCAGAGAATTAGTATCCGCTGTCTTGGTCTGTATGGATTGCTGAATAAAAGACCAAATGAGAAAATTCTTAAACAGTTGAGGCATTCCTTCATTAAGGGGCTGC
CTCCAATTAACATAATGGCCTGCAAGGCATTATTTGATTTGGTATTGTGGCATGGTCCCCAGGAGGTTGACAAGGCTCTAGGACAGGATCACATCCTCCAGTCTTCATTT
GATAAGACATCTTTTAGTTCTATAAACCTATCTGAAGCAGATGAAGATTGGACTACGGGATCACTTGATCTCTTATATGCTGGATTTGACAATGATGAGAAGTACAGCTC
TTCAGCAACCAATGAAATTGAGTCCGTTCAAACCATTGTTACAGAGGGGTTTGCAAAGATTCTTCTGCTGAGTGAAAACTATCCAAGCATACCAGCATCTCTACATCCTC
CACTGTTAAGGAAGCTAGTAAACGTTTATTTTTCAAGCGAGAAAGATCTGGAGAGGTTGAAACAATGTCTTTCTGTATTCTTTGAGCATTATCCATCCCTCACCGTTTCT
CATAAGAGGTGGATATCTGAGTCTTTTGTCCCAGTGATGCGTTCAATGTGGCCAGGCATGAACAACAATGTCGGAGGTTCTGCTGCCGAGGTAGCGAATATGCGTAAACG
TGCAGTCCAGGCATCACGCTTTATGCTGCAAATGATGCAGGCTCCTTTATATGGAAATGATACTGAAAGGAAGGAAGAAGATGGATGCATGGGAAATCAGGAAGTTGCTG
GTAGTATTGGAGAACCTCCTCTTGAGTGCACCGAAGACGGGCTTGCCATTCAAATAGCTGTGGAGGTTGCAAGCTTCCGTGGAAAGAAGACACCTGCACAAAAGTCATAT
GTTTCTGCTTTATGCCGGGTTCTTGTGTTGCTTCATTTTCGTCCTTCAGAACAAGGTGCTATAAGGGTAATGAGAAGGCTACTATGCTATTTGGTTGATACTGCATCGTG
GGATAAGGATCTTGTCAAGGAGTTAAAGCGGATGGGAGAGCATCTCACAGCAATTGACAAACAACCAGATCTTGAAGTTACGCAAGATCAAGCTGATCTAATTTTAGATC
AACTAAAACTGGAGTTCAGTTTTGACGCTGAAATTCCTCCACAAACACCAGTCCCATGTTCAACCAAACCTACGCGTTCCAGGAGACGAGTGAAACCTGAGTCTTCATCT
TCCGATGAAGCTATGTCCCCTACATCTGTTCCCAACATTGTTGGGACAATTGGTACACGCTCAGAGAGGGCAAGCAAAACTGCGGCATTGACTAGAATCATGAACAGTGC
ACTTAAGACCAACAACGTAGTTGATGAGGAAGATGAATGTGAAGATTCAGATGACGATGGTGATGAAGACGATGAAGATTCAGATTCAGATGTGACAGAGAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAGTTATAACCGACTTTCTAGTGGTGGAACTGGGAAATATTGACGTGGTTTTAGGGATGCAATGGCTAGATACAACCGGCACAATGAAGGTACATTGGCCATCATT
AACCATGACCTTTTGGGCAAAAGACAAGCAAATCGTGTTAAGAGGAGATCCATCCTTAGTGAAAGCTGAATGTTCCTTGAAGACATTGGAAAAAACGTGGGATATAGAAG
ATCAAGGCTTCCTCTTAGAATTCCAGAATTATGAGATGGAAGTAGAAGATGAATATGAAACGGAAACAAAAGGAGATGAGGAAATACCTATGATCCGATCCCTGCTCAAG
TATTATGCGGAGATATTTGAGACACCGAATAAACTACCTCCAAAAAGAGCTATTGATCACCGAATATTAACCTTATCGGACCGTAAACCTATCAACGTAAGACCTTACAA
ATACGGTCATGTGCATAAAGAAGAAATCGAAAAACTTGTGGTCGAGATGCTTCAAGTAGGGGTTATTCGCCCTAGTCATAGCCCTTACTCCAGCCCGGTGTTGTTAGTGG
AAAAGAAAAATGAGGAATTATTAGATGAGCTGAATGGGGCAACGGTTTTCTCGAAATTAGATCTGAAATCTGGTTATCACCAGATTAGAATGAAAGAGGAAGATGTGGAG
AAAACTGCGTTCCGCACACATGAAGGGCACTACGAGTTTCTAGTATTTAAGCATTTCCATCGGCGCTGTGTTTTGGTGTTCTTTGATGATGTACTGGTTTATAGTGCGGA
TATAGATGAACACGCAAAGCATTTGGGCATGGAAGTTGAGAATGATGCTGAACTTAAGAAAATTATAGAAAATCTGAAGAAGAATGCGAACGAAGGAAGCAAGTTTCAAG
GAAGTGTCTTAACAAAGTGGTATCAGAGCCGCAAACATCCTGGGGAGGACACGAGCAATGACACAAAAACAGTGGGAGGAGAGATTGGAAACGATATGCATAAGAGTATA
GAAAATCTGACGGAAGAAGTGAAGGAAAGCAGTCGAACTAAGCAGAGAGAAGAATCTTGTGCCTCAGAGGGTTCTGGGCTTAAAAAGAAAGGAAAGGAGGGAGAGACGGA
CATGAACTCAGTTTTCACTGGAGCATCATCAGATCGAAGTAAATACAAGAAGTTAGAGATGCCTATCTTCACGGGTGATAATCCGGAATCATGGGTCTATAGGGCAGAGC
ATTATTTTGATATCAATGAGTTCGCTGACGAGGAGAAAGTGAAAAAAGGGATGTTCGAACACTTCAAGACACCCGGTGGAGAAAGTTTGGGGGCTCGTCTCATTCGGATT
AAACAAGATGGATCTTACCCCGATTATTTGAAGAAATTCCTGAATTATTCGGCGCCGTTACAGGAGATGGCGGAGAGCATCCTCATCGATGCATTTGTTACAGGTTTGGA
GACGAATTTACAAGCCGAAGTATTAAGTCGTCATCCCGTTACTTTGGAGGACTGTATGCAGGAAGCCCAAATGGTGAGTGATCGTGACTTGGCAATTAAGTTAGCCCTAA
ATGAATGTGGGAGTAATGGGCCAAGGGCATCTGAAGCCCAAACCCAAATGGAAAAAAAAAAGATGACGATAAATGCTGAAAAAAAAGAAGGAAAAGGCAGCGAATACGTA
ATGAAGCAAGTATCGATTCCGATTAAGGGAAATTATTCGAGAGAAGAACCGTCGGTGAAATGGCTATCAAATAGCGAATTTCGTGCAAGGCTAGACAAGGGGCTATGTTT
TCGATGTAACGAAAAATATTCACCAGGGCACCGGTGCAAGGGCAAAACGAATCGCGAGCTGATGTTTTTTATTGCGAATGAAGAAGAAGAATTGGAAGAAGTCAACGGGA
AGGAGGAAGGCGAATTGGAAACTGTTGAACTCGAAACCCTAGAAATCGAGGGGAAGACAGAAATAGCGCTTCGCACCATCTTGGGGTTTACATCAAAAGGAGTGATTAAT
GAGCTAAACTTACCCGTGGAGGAAAAAACTAAATTCGGAGTGACTATTGGCGACGAAACTGCACTAGAGGGAATAGGGGTTTGCAAGAGAGTCGAGGTTAAGCTGCCGGA
GCTAAAAATTGTGGCAGACTTCTTAGCAATCAAATTGGGTAGAATTGATGTGGTTTTGGGAATGCAGTGGCTCTGCACGATTGGATTCATTGGGGTTCATTGGCCGACGA
TGACAATGACGTTCATGGCGGGGACCACTCAAAAGATTGAAATTGAAGAAAATAATGAGGAAGACAGTGAAAGGGAGGAAGAGGGGGATGAAGCAAATTTACCTATGATT
CGCAAGCTGTTGCATAGGTTCAAGGGGCTGGTAGTAGATCATCGAATTCTGACTATCGACGGGCAAAAACCCATCAATGTACGCCCTTACAAGTATGGGTACATACAGAA
AGAAGAGATAGAGAAACTGGTGACAGAAATGCTTCAAGCAGGAATTATCCGGCCAAGCCGAAGACTATATTCCAGTCCGGTATTGCTAATAAGGAAACACGACGGTGGGC
GGAGATTCTGCGTAGACTACCGTAAATTGAATCAGGTGACTGTCTCCGATAAATTCCCCATTCTTGTAATCGAAGAACTGCTAGATGAGTTACATGGTGCTGAAGTTTTT
TCAAAGCTGGATTTGCGTTCGGGGTATCACCAAATTCAGATGAAGGAGGAGGATGTTGAAAAAACCGCGTTTCGCACTCACGAAGGGCATTATGAATTTCTAGTTATGCC
TTTCGGGTTAACCAATGCGCCGGCCACTTTTCAGTCCCTAATGAATCAAATTTTCAAGCCCTTCCTAAGACGGTTTGTGTTGGTCTTCTTCGACGATGTCCTGATTTATA
GCACTGATATTACCGAACACGAGCAACACTTGGGAATGGTGTTCAATGTAATGAAAGACAACCAGTTATTTGCCAATGAAAAGAAATGCGTAATAGGGCATTCACGAGTC
AATTATTTGGGGCACTGGATCTCTAAAAAAGGTGTAGAAGCGGACGGGGAAAAGGTGAAAGCAATGGTAAACTGGCCCCAACCCAAAGATGTTTCTGAATTAAGGGGATT
CCTCGGCCTCACGGGATATTATCGAAGATTTGTCAAAGGATATGGGAACATTGCCGCTCCCCTGACCAAATTGTTGCAACAAAACGGGTTCAATTGGAACGAAGATGCCA
TAGAAGCTTTCGAGTCACTGAAACAAGCAATGATCTCGGTGCTGGTCTTAGCACTCCCCGACTTTTCCTTACCATTTAACATTATAACTGACGCATCAGGTGTAGGGCTG
GGAGCAGTTTTATCACAGAATGGCCGACCAATAGCGTATTTTAGCCAGAAATTATCTCCTAGAGCACAAGCAAAGTCTATTTACGAAAGGGAACTGATGGCAGTGGTAAT
GGCGAGATGGTTGACTAAATTATTGGGTTATGATTTTGAAATTTTGTACCAACCCGGGCTGTTGAATAAAGCAGCAGACGCACTATCTCGCATGAACCCTATTGTGGAAT
TAAATACTTTAATCTCTCTGAGGCTGCTAGATGTTGAGATGGTGATGGAAGAAGTTAAAAGGGATGAGGAATTACAAGGAATAATTGAAGTCCTGAAGGAAAATCCAGAA
GGGAAAGCAAACTATCAATGGGTATCAGACAAATTGCTCTACAAGGGAAAGTTAGTCTTGTCGAAAACCTCTTCTCTTATTCCAACCTTACTCCATACTTTCCACGATTC
AGTGTTAGGGGGCCATTCGGGGTTTCTCAGAACATACAAGCATATGAATGGAGAGATACATTGGGTGGGCATGAAGAATGACGTAAAGAAATATGTCGAACAGTGTGAAA
CATGCCAAAGAAATAAAACAAAAGCACTCTCTCCAGCTGGATTATTACAGCCCTTGCCTTTACCCAACTTAATCCTTGAAGACTGGACGATGGACTTCATAGAAGGGTTG
CCTAAGGCGGGAGGTTTTGATAGTATCATGGTGGTGGTTGACCGACTAAGTAAAATGGCCCACTTCATTACCTTGAAGCACCCATTTACGGCAAAGCAAGTTGCCGAAAA
ATTTAGTAAAGAAATTATTAGCAAACATGGAATTCCAAACTCAATAGTAACGGACCGAGACAAAGTCTTCTTGAGTCAATTTTGGAAAGAATTATTTACAGCCATGGGGG
CTTCGCTGAAAAGAAGTACTGCATTTCACCCTCAAACGGATGGGCAGACTGAGCAGGTGAACCGATGCCTCGAAACGTATCTATGTTGTTTCTGTAACGAACAACCTACG
AAGTGGCATAAATGCATCCCTTGGGCTGAATTGTGGTACAACACCACTTTTCACTCGTTGGCAAAAACAACCCCGTTTCAAATTGTGTATGGGCGCCCCCCACCTTCGCT
GATTCTGTATGGGGATCGAAAGACCAATAATAACAGTGTAGAACAATTATTAAAAGAAAGGGATCTGGTTATTAGTGCATTAAAGGAGAATTTGCTGACAGCCCAGAATA
GAATGAAGAAACAAGCAGATCTAAGGCGTCGGGAACTGAAATTCAAAGTGGGAGATGAGGTTTATTTGAAATTACGACCTTACAGACAACGATCCTTAGCTAAAAAGCGC
TGCGAGAAGTTAGCTCCCAAGTTTTATGGGCCCTATCGTGTAATGGAGGAAATTGGGGAAGTAGCATATCGTTTGGACCTTCCACTTGAGGCAATGATCCATAACGTGTT
TCATATTTCACAGCTAAAACTTAAGTTGGGTAAAACACAACATATACAACACCTACCACCTGCATTGACTGAAGATTTTGAGCTGCAAGTGGAACCGGAAGCAGTGCTGG
GGGTTCGCTGGAACAGCGAGATGGGAGCTAATGATTGGTTGGAAGCGACTTGGGAATCTGTTTATGCTATGAACCAACAGTATCCTTCGTTTCACCTTGAAGACAAGAAG
GCAGCTCAGCCAATCGAATTAAATGAAATGAATAATTGGGTGTGGGGCCCTCCTGGGGATAGAGGGAAGTGTCTTAACAGAATGGTAGTATCTCAAACTTCTACACTTAT
ACCCAAGATCCTACATACTTTCCACGATTCTATCCTCAGGGGACATTCTGGATTTTTAAGAACTTACAAGAGGATGAGTGGAGAACTATATTGGAGGGGTATGAAGGCCG
AAGTAAAGAAATATGTGGAACAATGCGAGATCTGCCAGAGAAACAAATATGAAGCGACCAAGCCAGCTGGGGTACTTCAACCAATTCCAATTCCTGACAGAATCTTAGAA
GATTGGACCATGGACTTTATTGAGGGGTTGCCACTAGCTGGAGGGGTTAATGTGATCATGGTGGTAGTTGATCGGCTGAGTAAGTATTCTTACTTTATATCATTGAGACA
TCCGTTTTCAGCGAAGCAAGTTGCTTCCAAATTCATCGACAGAATAGTAAGCAAGCACGGCATTCCTAAGTCGATTATCACAGATCGGGATAAAATCTTTCTTAGTAATT
TTTGGAAAGAACTGTTCGCAACTATGGGGACTATTCTAAAGAGGAGTACCGCGTTTCACCGACAAACAGATGGGCAGACAGAAAGGACGGTATACGGCAGACCCCCTCCT
CCCCCGTTATCCTATGGTTACAAGAAAACCCCTAATAATAAGGTTGACACACTGTTGAAGGAACGTGACTTAGCCATCAATGCCTTAAAGGAAAACTTACCTTACCGGCA
ACGCTCCTTGGCTCGGAAGAAGTGTGAAAAGTTAGCACCTAAATATTATGGACCCTACAAAATTATTGAAGAAATAGGAGCAGTTGCGTACAGATTGATGCTACGTCCAG
AAGCCAAGTTTGAGTTACAATTATGGCCGGAAATAGTGTTGGGAGTCCGTTGGAATCGGGAATTGGGAGGAAATGAATGGTTAATCAAATGGAAAGGGCTGCCGGACAGT
GAAACAACTTGGGAATCTGTTTTCCAAATGAACCAACAATTCCCTGGCTTTCACCTTGAGAACAAGGTGAACCTAGAACCGAGGGAGGCTGCTAGGTCTAGGAAGGACTA
CTATCTTTTCCTGGGGAGCATCAAACAAAGAACAACAATTTTACAGAGAGGGCTTGCTGATCGTTCTCAAGCTGTTTCAAAGGAGTGCTTAAAATTGATGACAGATGAGT
GGCTACACAACTGCTGCCATGGAAATCCTGTAGAATTGCTAGAGTATCTTGATGTCGAAACCTATGAACGAGTTGGTGAGTCTGTTATGGGGGCTTTATTAGGAGCTAGT
TTGTTGAAACTGCATGACGATGCAAGTATCCAACATTATATACAAACTTCCAGCAGCGCGACAGAAGGAGACTCACCACATTGCAGTCCAACTATACAACTGATGGAACC
AGAAGTTTCTCTTTACTGGAGGATGATTTGTAAGCATATCCTAACAGAAGCACATGCAAAAGGTTCTGATGCTGCAGCTTCTATGGGTGCTGAAGCCGCAGTATATGCAG
CCGAAGCTTCTGAAAAAAATGACCTTTTAGAGAAAATTCTTCCTGCCACTATTTCTGATTATGTAGGCTTAGTCCAAGCTCATATTAATGCTGGGTCCAGTTATCGATTT
GCATCAAGACAGCTACTTTTGCTTGGAACCATGCTTGATTATTCTGATAATGCAAATAGGAAGATTGCTGGTGCATTTCTGCAGGAAGTGTTGCATATATCTCCAGATCA
TGAATTGGATGATGATGGGAACTTGGTTGTTCATGGAGATGGAATCAATCTTGGAGGAGATAGAGATTGGGCAATTTCTGTATCTGGGTTGGTTAGGAAAGTCCATGCTG
CAGCTGGTGAATTTGAAGAAATTGTTCTTGAGGTGATTGAAGAACTTGCTCGACCATGCAGGGAAAGAACTGCAAATTGTGTGCAGTGGATGCACTGTCTTGCTGTGACA
AGCCTTCTCCTGGAAAACGCTAAGTCATTGAATTTTATTAATGGAAAAATCTCTGGACCTGCTGAACTGCTGGATTCAATATTGCTTCCAGGGGCAAAACACGTTCATTT
GGATGTTCAGAGAATTAGTATCCGCTGTCTTGGTCTGTATGGATTGCTGAATAAAAGACCAAATGAGAAAATTCTTAAACAGTTGAGGCATTCCTTCATTAAGGGGCTGC
CTCCAATTAACATAATGGCCTGCAAGGCATTATTTGATTTGGTATTGTGGCATGGTCCCCAGGAGGTTGACAAGGCTCTAGGACAGGATCACATCCTCCAGTCTTCATTT
GATAAGACATCTTTTAGTTCTATAAACCTATCTGAAGCAGATGAAGATTGGACTACGGGATCACTTGATCTCTTATATGCTGGATTTGACAATGATGAGAAGTACAGCTC
TTCAGCAACCAATGAAATTGAGTCCGTTCAAACCATTGTTACAGAGGGGTTTGCAAAGATTCTTCTGCTGAGTGAAAACTATCCAAGCATACCAGCATCTCTACATCCTC
CACTGTTAAGGAAGCTAGTAAACGTTTATTTTTCAAGCGAGAAAGATCTGGAGAGGTTGAAACAATGTCTTTCTGTATTCTTTGAGCATTATCCATCCCTCACCGTTTCT
CATAAGAGGTGGATATCTGAGTCTTTTGTCCCAGTGATGCGTTCAATGTGGCCAGGCATGAACAACAATGTCGGAGGTTCTGCTGCCGAGGTAGCGAATATGCGTAAACG
TGCAGTCCAGGCATCACGCTTTATGCTGCAAATGATGCAGGCTCCTTTATATGGAAATGATACTGAAAGGAAGGAAGAAGATGGATGCATGGGAAATCAGGAAGTTGCTG
GTAGTATTGGAGAACCTCCTCTTGAGTGCACCGAAGACGGGCTTGCCATTCAAATAGCTGTGGAGGTTGCAAGCTTCCGTGGAAAGAAGACACCTGCACAAAAGTCATAT
GTTTCTGCTTTATGCCGGGTTCTTGTGTTGCTTCATTTTCGTCCTTCAGAACAAGGTGCTATAAGGGTAATGAGAAGGCTACTATGCTATTTGGTTGATACTGCATCGTG
GGATAAGGATCTTGTCAAGGAGTTAAAGCGGATGGGAGAGCATCTCACAGCAATTGACAAACAACCAGATCTTGAAGTTACGCAAGATCAAGCTGATCTAATTTTAGATC
AACTAAAACTGGAGTTCAGTTTTGACGCTGAAATTCCTCCACAAACACCAGTCCCATGTTCAACCAAACCTACGCGTTCCAGGAGACGAGTGAAACCTGAGTCTTCATCT
TCCGATGAAGCTATGTCCCCTACATCTGTTCCCAACATTGTTGGGACAATTGGTACACGCTCAGAGAGGGCAAGCAAAACTGCGGCATTGACTAGAATCATGAACAGTGC
ACTTAAGACCAACAACGTAGTTGATGAGGAAGATGAATGTGAAGATTCAGATGACGATGGTGATGAAGACGATGAAGATTCAGATTCAGATGTGACAGAGAATTAAACCT
CGTTACCATGATAGGTATCTTGAATGTTGTACTAAGCTGTATGATTGGATTATGTGTAGGAGCATCAGTTGTACTGTATATATATGTGTTTCAACTGTAGCAAGTTATAA
AAGTTTGCTTTGTTTGTAAAATGAATGTATAATTTGATTAGTATCTATCTTAGTTGCTTGCTTATTCCTAAAGAATAAGCTCTCTTTTACATT
Protein sequenceShow/hide protein sequence
MTVITDFLVVELGNIDVVLGMQWLDTTGTMKVHWPSLTMTFWAKDKQIVLRGDPSLVKAECSLKTLEKTWDIEDQGFLLEFQNYEMEVEDEYETETKGDEEIPMIRSLLK
YYAEIFETPNKLPPKRAIDHRILTLSDRKPINVRPYKYGHVHKEEIEKLVVEMLQVGVIRPSHSPYSSPVLLVEKKNEELLDELNGATVFSKLDLKSGYHQIRMKEEDVE
KTAFRTHEGHYEFLVFKHFHRRCVLVFFDDVLVYSADIDEHAKHLGMEVENDAELKKIIENLKKNANEGSKFQGSVLTKWYQSRKHPGEDTSNDTKTVGGEIGNDMHKSI
ENLTEEVKESSRTKQREESCASEGSGLKKKGKEGETDMNSVFTGASSDRSKYKKLEMPIFTGDNPESWVYRAEHYFDINEFADEEKVKKGMFEHFKTPGGESLGARLIRI
KQDGSYPDYLKKFLNYSAPLQEMAESILIDAFVTGLETNLQAEVLSRHPVTLEDCMQEAQMVSDRDLAIKLALNECGSNGPRASEAQTQMEKKKMTINAEKKEGKGSEYV
MKQVSIPIKGNYSREEPSVKWLSNSEFRARLDKGLCFRCNEKYSPGHRCKGKTNRELMFFIANEEEELEEVNGKEEGELETVELETLEIEGKTEIALRTILGFTSKGVIN
ELNLPVEEKTKFGVTIGDETALEGIGVCKRVEVKLPELKIVADFLAIKLGRIDVVLGMQWLCTIGFIGVHWPTMTMTFMAGTTQKIEIEENNEEDSEREEEGDEANLPMI
RKLLHRFKGLVVDHRILTIDGQKPINVRPYKYGYIQKEEIEKLVTEMLQAGIIRPSRRLYSSPVLLIRKHDGGRRFCVDYRKLNQVTVSDKFPILVIEELLDELHGAEVF
SKLDLRSGYHQIQMKEEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQIFKPFLRRFVLVFFDDVLIYSTDITEHEQHLGMVFNVMKDNQLFANEKKCVIGHSRV
NYLGHWISKKGVEADGEKVKAMVNWPQPKDVSELRGFLGLTGYYRRFVKGYGNIAAPLTKLLQQNGFNWNEDAIEAFESLKQAMISVLVLALPDFSLPFNIITDASGVGL
GAVLSQNGRPIAYFSQKLSPRAQAKSIYERELMAVVMARWLTKLLGYDFEILYQPGLLNKAADALSRMNPIVELNTLISLRLLDVEMVMEEVKRDEELQGIIEVLKENPE
GKANYQWVSDKLLYKGKLVLSKTSSLIPTLLHTFHDSVLGGHSGFLRTYKHMNGEIHWVGMKNDVKKYVEQCETCQRNKTKALSPAGLLQPLPLPNLILEDWTMDFIEGL
PKAGGFDSIMVVVDRLSKMAHFITLKHPFTAKQVAEKFSKEIISKHGIPNSIVTDRDKVFLSQFWKELFTAMGASLKRSTAFHPQTDGQTEQVNRCLETYLCCFCNEQPT
KWHKCIPWAELWYNTTFHSLAKTTPFQIVYGRPPPSLILYGDRKTNNNSVEQLLKERDLVISALKENLLTAQNRMKKQADLRRRELKFKVGDEVYLKLRPYRQRSLAKKR
CEKLAPKFYGPYRVMEEIGEVAYRLDLPLEAMIHNVFHISQLKLKLGKTQHIQHLPPALTEDFELQVEPEAVLGVRWNSEMGANDWLEATWESVYAMNQQYPSFHLEDKK
AAQPIELNEMNNWVWGPPGDRGKCLNRMVVSQTSTLIPKILHTFHDSILRGHSGFLRTYKRMSGELYWRGMKAEVKKYVEQCEICQRNKYEATKPAGVLQPIPIPDRILE
DWTMDFIEGLPLAGGVNVIMVVVDRLSKYSYFISLRHPFSAKQVASKFIDRIVSKHGIPKSIITDRDKIFLSNFWKELFATMGTILKRSTAFHRQTDGQTERTVYGRPPP
PPLSYGYKKTPNNKVDTLLKERDLAINALKENLPYRQRSLARKKCEKLAPKYYGPYKIIEEIGAVAYRLMLRPEAKFELQLWPEIVLGVRWNRELGGNEWLIKWKGLPDS
ETTWESVFQMNQQFPGFHLENKVNLEPREAARSRKDYYLFLGSIKQRTTILQRGLADRSQAVSKECLKLMTDEWLHNCCHGNPVELLEYLDVETYERVGESVMGALLGAS
LLKLHDDASIQHYIQTSSSATEGDSPHCSPTIQLMEPEVSLYWRMICKHILTEAHAKGSDAAASMGAEAAVYAAEASEKNDLLEKILPATISDYVGLVQAHINAGSSYRF
ASRQLLLLGTMLDYSDNANRKIAGAFLQEVLHISPDHELDDDGNLVVHGDGINLGGDRDWAISVSGLVRKVHAAAGEFEEIVLEVIEELARPCRERTANCVQWMHCLAVT
SLLLENAKSLNFINGKISGPAELLDSILLPGAKHVHLDVQRISIRCLGLYGLLNKRPNEKILKQLRHSFIKGLPPINIMACKALFDLVLWHGPQEVDKALGQDHILQSSF
DKTSFSSINLSEADEDWTTGSLDLLYAGFDNDEKYSSSATNEIESVQTIVTEGFAKILLLSENYPSIPASLHPPLLRKLVNVYFSSEKDLERLKQCLSVFFEHYPSLTVS
HKRWISESFVPVMRSMWPGMNNNVGGSAAEVANMRKRAVQASRFMLQMMQAPLYGNDTERKEEDGCMGNQEVAGSIGEPPLECTEDGLAIQIAVEVASFRGKKTPAQKSY
VSALCRVLVLLHFRPSEQGAIRVMRRLLCYLVDTASWDKDLVKELKRMGEHLTAIDKQPDLEVTQDQADLILDQLKLEFSFDAEIPPQTPVPCSTKPTRSRRRVKPESSS
SDEAMSPTSVPNIVGTIGTRSERASKTAALTRIMNSALKTNNVVDEEDECEDSDDDGDEDDEDSDSDVTEN