; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G191697 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G191697
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCla97Chr10:14331564..14341662
RNA-Seq ExpressionCla97C10G191697
SyntenyCla97C10G191697
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AFK13856.1 Ty3/gypsy retrotransposon protein [Beta vulgaris subsp. vulgaris]0.0e+0044.96Show/hide
Query:  LSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLKWGRNPSPES---KRSRSNPTQAKSTTMIASSTSSMTARTITLNPTLNPTR
        + E  L  KF+ GL  E+Q E+R+ +   L   M   +L   +EE  ++   R   P S         P    S   +  S     A           T+
Subjt:  LSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLKWGRNPSPES---KRSRSNPTQAKSTTMIASSTSSMTARTITLNPTLNPTR

Query:  NSAVSSNTVNFCDTNGK-----TFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLD
        + A++SN       N K     +   G  RRL + +LQ +R KGLC++CDEK+  GH+C++KEL++    + +    E+E++  A+   E  P+  E + 
Subjt:  NSAVSSNTVNFCDTNGK-----TFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLD

Query:  VEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLP-NLTIVNDF
         E+        +LNS+ GL +PKTMKL G + N +VVV+ID  A+HNF+    +  L +  +    +G+ LG GQ+VR TGIC+ V L L   L +V DF
Subjt:  VEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLP-NLTIVNDF

Query:  FPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGT---QDVRPEIRE
         PL LG++DVILGVQWL TLG V  +W T +M F++G     L GD ++ +S++SLK+M++ +R EG G+ LE N ++      ++ +   Q++ P ++E
Subjt:  FPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGT---QDVRPEIRE

Query:  LLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDN
        L+++F+ +F     LPP R H+HA+ LK    PV V  YRYPQFQKDEIERL+ +ML A IIQPSTS FSSPV+LVKKKDGSWRFC+DYRALN  TVPD 
Subjt:  LLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDN

Query:  FPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHM
        +PIPV+DELLDEL   T+F K+DL++GYHQI V+ +D HKTAFRTHEGHYEFLVMPFGL NAP+T                             S +EH+
Subjt:  FPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHM

Query:  EHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNE
         HL  VL +L+ + L  N  KC F    + YLGHVIS  GVA D +K++A+ +W VP                RKFVA Y  I  PLT+ L+KD F W+ 
Subjt:  EHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNE

Query:  ATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLK
            AF  LK AMV  PVLA+P+F+ +FVVETDASG G+GAVLMQ+ RP+AY+S+ L    QLK VYE EL+AI  AVQKW+ YLLGR F  RTDQ+SL+
Subjt:  ATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLK

Query:  FLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPT-LVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMY
        ++ +QR I  ++QKW++KL+GYDFEI YK GL N  ADALSR     VEL A   V GV  A    ++  D  L ++RK +Q+ G+ P  +++    +++
Subjt:  FLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPT-LVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMY

Query:  KGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSE
        KGR V+P +S  IP LL E+H+ P+GGH   +K Y R+  E+ W GM+  V  +V  C +CQQ K     P GLL PLP+  ++WEDISMDFI+GLP S+
Subjt:  KGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSE

Query:  GYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCF
        G ++ILV+VDRLSKYAHF+ LRHP+TA+MVA +F+KE+V LHG P SIVSDRD                   KRS+AYHPQTDGQTE++NR LE YLRCF
Subjt:  GYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCF

Query:  TMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLREL--------RRMKRLADAKRREEEYAVNDWV
            P  WAKWL W E++YNTS HTS KM+ + ++YGR  P ++   +G  +V++++ +L++RD ++ +L        +RMK  AD  R E E+ V D V
Subjt:  TMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLREL--------RRMKRLADAKRREEEYAVNDWV

Query:  YIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGSTITS
        +++L+PYRQ +L      KLA +F GPF ++ RIG  AYKL LPP+   HP+FHVS L+KVVG+      +PP + V     ++P  +L VR++     +
Subjt:  YIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGSTITS

Query:  LMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG----SDRPPIFKKYMRRRKREGR
            E L++W GL   +ATWE+   I  +FP FHLEDKV  WGAG      + P    Y RR  ++G+
Subjt:  LMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG----SDRPPIFKKYMRRRKREGR

KFK28310.1 hypothetical protein AALP_AA8G499800 [Arabis alpina]0.0e+0046.13Show/hide
Query:  DLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLKWGRNPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNPTLNPTRNS
        ++ E  LE  F++GLK  I+  ++ F    L+  M +A+L+    E+       +  P SK +R  PT+       +++T   +   + LN    P    
Subjt:  DLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLKWGRNPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNPTLNPTRNS

Query:  AVSSNTVNFCDTNGKTFSAGS-----FRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLDVE
          S NT +F  T  +  +  +     +RRL   ++  R+  GLC+RCDEK+   H+C KKE+N+ +  E DG  +  E D D          T  T    
Subjt:  AVSSNTVNFCDTNGKTFSAGS-----FRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLDVE

Query:  IEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVNDFFPL
         E AE   L+LNS+ G+ SP TMKL GT+   +VVVLIDS ASHNF+ E LV  L L  +   SYG++ G G +VR  G+C+G+VL L  L I +DF PL
Subjt:  IEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVNDFFPL

Query:  SLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKG-TQDVRPEIRELLQKF
         LGSADVILG++WL +LG+++ +W    M+F +G     LQGD     S ISLKS+M+ V+ +GVG+L+E N +Q+ +  ++ G T +V   +  ++ +F
Subjt:  SLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKG-TQDVRPEIRELLQKF

Query:  QSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDNFPIPV
          +F     LPP R   H + L+     V+V  +RYPQ QK EIE+ V  ML A IIQ STS+FSSPVLLVKKKDGSWRFCIDYRALN +T+PD+FPIP+
Subjt:  QSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDNFPIPV

Query:  VDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHMEHLST
        +D+LLDEL   T+F K+DLKSGYHQI V+ ++V KTAFRTH+GHYEFLVMPFGL NAP+T                            SSLQEH EHL  
Subjt:  VDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHMEHLST

Query:  VLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNEATQSA
        VL +L   +L AN  KC F +SSIEYLGHVIS EGV+ADP K++AM  WP+P                R+FV GYGSI  PLT LL+KDKF W+E    A
Subjt:  VLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNEATQSA

Query:  FDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLKFLLEQ
        F+ LK AM  VPVLAL DF + FVVE+DASG+G+GAVL+Q  +P+AYFSQAL    +LK VYE EL+AIV A+QKWR YLLGR+F  RTDQKSLKFLLEQ
Subjt:  FDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLKFLLEQ

Query:  RIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMYKGRIVL
        R +  +YQ+W+ K++G++F+I YK GLEN AADALSR+  L +L A S    +       +VD++   ++I++ +        GYSV    ++Y G++VL
Subjt:  RIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMYKGRIVL

Query:  PPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSIL
        P  S  I +LL EFHN+ +GGH  V+K  + +   F W GM A ++ FV +C VCQ+ KY ++AP+GLL PLP+   +WEDIS+DF++GLP+SEG+++IL
Subjt:  PPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSIL

Query:  VVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPV
        VVVDRL+KYAHFI L+HP+ A  +A++FI+EIV LHG P ++VSDRD                     STAYHPQTDGQTEV NRGLE  LRCFT   P 
Subjt:  VVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPV

Query:  KWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELRR--------MKRLADAKRREEEYAVNDWVYIKLRP
        KWA +L W E+ YN+SYH++++MT +  +YGR  P+LL ++ G+     ++  L+ERD M+  L++        MK  AD  RRE E+ V D V++KL+P
Subjt:  KWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELRR--------MKRLADAKRREEEYAVNDWVYIKLRP

Query:  YRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVR---RVGSTITSLMD
        YRQ +L    + KLA +F GP+ ++ R+G VAY+L LP +   H  FHVSQL+  VGS      LPP L   +    +P A +GVR   R G        
Subjt:  YRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVR---RVGSTITSLMD

Query:  FEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG-----SDRPPIFKKYMRRRK--REGR
         EVL++W GL   D+TWE    I  QFP F LEDK +F  AG     S++ P+  +Y RR+K  ++GR
Subjt:  FEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG-----SDRPPIFKKYMRRRK--REGR

PKU67907.1 putative mitochondrial protein [Dendrobium catenatum]0.0e+0049.48Show/hide
Query:  DTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLDVEIEAAEKGVLTLNSLA
        + +G T SAGSF++L + ++Q +R KGLC+RC+EK+  GHRCK + L     + +D E  EE  D D   EE++        D ++E AE   ++LNS+ 
Subjt:  DTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLDVEIEAAEKGVLTLNSLA

Query:  GLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVNDFFPLSLGSADVILGVQWLM
        G     TMK++G +  ++VVVLIDS A+HNFI   +   L +  +   SYG+M+GTG+   +TGICKGV ++L  + +V DF PL LGS DVILG++WL 
Subjt:  GLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVNDFFPLSLGSADVILGVQWLM

Query:  TLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEIRELLQKFQSIFRPIDELPPARRH
        TLG+ + +W T  M+  V    +KL+G                                                 ++  LQ+F+ +F+P   LPP R  
Subjt:  TLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEIRELLQKFQSIFRPIDELPPARRH

Query:  DHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDNFPIPVVDELLDELTCVTIFLK
        +H + LK    P++V  YRYPQ QKDEIE+LV +ML   IIQPS S FSSPVLLVKKKDGSWRFC+DYRALN  TVPD FPIPV+DEL+DEL    +F K
Subjt:  DHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDNFPIPVVDELLDELTCVTIFLK

Query:  IDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHMEHLSTVLHVLSDNKLVANLNK
        IDLKSGYHQIR++ +DV KTAFRTHEGHYEFLVMPFGL NAP+T                             + +EH+EHL  VL VL +++L AN  K
Subjt:  IDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHMEHLSTVLHVLSDNKLVANLNK

Query:  CTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNEATQSAFDTLKQAMVEVPVLAL
        C FA + +EYLGHVIS EGVAAD  KIEAM  WP P T              R+FV GY +I  PLT+LL+KD F W EA   AF+ LK+AM  VPVLAL
Subjt:  CTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNEATQSAFDTLKQAMVEVPVLAL

Query:  PDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLKFLLEQRIIVGDYQKWIAKLIG
        PDF Q FV+ETDASG G+GAVLMQN R +AYFSQ L    +LK VYE EL+AIVLA+QKWRPYLLGR F  RTDQ+SLK+LLEQR++  ++Q+W++KL+G
Subjt:  PDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLKFLLEQRIIVGDYQKWIAKLIG

Query:  YDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMYKGRIVLPPTSPTIPLLLQEFHN
        YDFEI Y+ GLEN AADALSR     +  A S  + ++      +  Q+E LR I++ +       PGYSV GE + Y+GR V+P +S  IP LLQEFH 
Subjt:  YDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMYKGRIVLPPTSPTIPLLLQEFHN

Query:  NPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLR
        + +GGH+ V K Y RI  E  W GM+  +   V  C  CQ+ K+M+MAP GLL PL + + +WE+I+MDFIDGLP+SEGY  I VVVDRLSKYAHFIPLR
Subjt:  NPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLR

Query:  HPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTS
        HPYTAV VA+ F++EIV LHG+P+SIVSDRD                   KRSTAYHPQTDGQTEV+NR LE YLRCF  ++P +WAKWLAW EY YNTS
Subjt:  HPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTS

Query:  YHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELRR--------MKRLADAKRREEEYAVNDWVYIKLRPYRQSTLFIHKHPKLAL
        YH++ KMT + ++YGR  P L+ Y   +  V AVD  L+ERD++L EL+R        MK+ AD+KRR+ ++ V D VY+KLRPYRQ T+   ++ KLA 
Subjt:  YHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELRR--------MKRLADAKRREEEYAVNDWVYIKLRPYRQSTLFIHKHPKLAL

Query:  KFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGSTITSLMDFEVLVQWDGLAVEDATWEN
        ++ GPF + ++IG VAY+L LPP    HP+FHVSQLRKVVG       LP TL+     +++P A+ GVR     I    + EV ++W GL   +ATWE 
Subjt:  KFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGSTITSLMDFEVLVQWDGLAVEDATWEN

Query:  RQQIISQFPHFHLEDKVVFWGAGSDRP---------PIFKKYMRRRKREGR
         ++I  QFP FHLEDKVV W  G+  P         P       RR+ +GR
Subjt:  RQQIISQFPHFHLEDKVVFWGAGSDRP---------PIFKKYMRRRKREGR

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0043.89Show/hide
Query:  VSDLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLK------WGRNPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNP
        +SD+ +  +E  F+SGL   I+ E+ +    GL   M  AQL++D E  R          G++ +P S  ++    Q         + +    RTITL  
Subjt:  VSDLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLK------WGRNPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNP

Query:  TLNPTRNSAVSSNTVNFCDTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCK---KKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATL
                            +G+T   G+ +RL D++ Q RREKGLC++C+EKY+A H+CK   ++EL +F+  + +     EE+++  + E ET  A L
Subjt:  TLNPTRNSAVSSNTVNFCDTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCK---KKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATL

Query:  ETLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIV
         T++V  +A     L++NS+ GL+ P TMK++GT+  K+VV+LID  A+HNF+ E LVT+L L     + YG++LG+G +++  GIC+ + + + + T+ 
Subjt:  ETLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIV

Query:  NDFFPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEIRE
         DF PL LG  DVILG+QWL +LG   CDW    + F   +  I ++GD S+ K+++SLK+++K       G L+E  ++  +    +   ++ + EI E
Subjt:  NDFFPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEIRE

Query:  ----LLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHIT
            +L +F+ IF   ++LPP R  +H + LK    PVNV  YRY   QK+E+ERLV +ML + II+PS S +SSPVLLVKKKDGSWRFC+DYRALN++T
Subjt:  ----LLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHIT

Query:  VPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSL
        VPD FPIPVV+EL DEL   ++F KIDLK+GYHQIR+   D+ KTAFRTHEGHYEFLVMPFGL NAP+T                             +L
Subjt:  VPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSL

Query:  QEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKF
        ++H++H+ TV  VL  ++L AN  KC+F  + +EYLGH+IS +GV  DP+KI+A+T WP PT+              RKFV  YG++  PLTQLL+K  F
Subjt:  QEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKF

Query:  GWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQ
         WN   + AF+ LK+AM+ +PVLALP F + F +ETDASG GVGAVL+QN RP+A++S  L +  + + VYE EL+A+VLAVQ+WRPYLLG RF  RTDQ
Subjt:  GWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQ

Query:  KSLKFLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREI------RKTIQDGGQVPPGY
        KSLKFLLEQR++   YQ+W+AKL+GY F++EYK G+EN AADALSR+   V++   +  + ++  +   +V++D +L +I        T+QD       +
Subjt:  KSLKFLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREI------RKTIQDGGQVPPGY

Query:  SVKGELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMD
         +   ++ YK R+V+  TS  IP +L  +H++ VGGH+  ++ Y+RI  E  W GMKA V+ +  +C +CQQ K + ++PAGLLLPL +  +IW DISMD
Subjt:  SVKGELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMD

Query:  FIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRDK-------------------RSTAYHPQTDGQTEVMNR
        F++GLP++ G+  I VVVDRLSKY HFIPL+HPY+A  VA +F+KE+V LHG P SIVSDRD+                   RS+AYHPQ+DGQTEV+NR
Subjt:  FIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRDK-------------------RSTAYHPQTDGQTEVMNR

Query:  GLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELR--------RMKRLADAKRRE
        G+E YLRCF    P +W KW+ W EY YNT++  +L MT + +VYGR  P LLSY         +D  L+ERDEM+  LR        +MK+ AD +RR+
Subjt:  GLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELR--------RMKRLADAKRRE

Query:  EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSI---QPAAI
         EY V D V++K+RPYRQ +L   ++ KL+ K+ GP+ I++RIGPVAYKL+LP +   HP+FHVSQL+K+VG       + PT+Q  D   +    P   
Subjt:  EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSI---QPAAI

Query:  LGVRRVGSTITSLMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAGSDRPPIFKKYMRRRKRE
        L  RR       + ++EV+++WDGL++ + TWE    +  ++P FHLEDKV   G  + RPPI  +Y R+ KR+
Subjt:  LGVRRVGSTITSLMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAGSDRPPIFKKYMRRRKRE

XP_010680400.1 PREDICTED: uncharacterized protein LOC104895554 isoform X1 [Beta vulgaris subsp. vulgaris]0.0e+0045.38Show/hide
Query:  VSDLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIK-DMEEARQLKWGR-----NPSPESKRSRSNPTQ--AKSTTMIASSTSSMTARTITL
        + ++ E+ +  +F+ GLKE I+ E+ M   + +   M +A   +  +     L   R      P P   RS+ +P     K T++     +  T ++   
Subjt:  VSDLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIK-DMEEARQLKWGR-----NPSPESKRSRSNPTQ--AKSTTMIASSTSSMTARTITL

Query:  NPTLNPTRNSAVSSNTVNFCDTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLE
         P +  T+NS  +  T                RRL + +LQ RRE GLC+RCD+K++ GHRC+KKE+++ +    +    EEE       EEE   A+ +
Subjt:  NPTLNPTRNSAVSSNTVNFCDTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLE

Query:  TLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVN
         +  E+   E   L+LNS+ GL SP+TMKL G +  ++VVV++D  A+HNFI    V  L +     +++G+ LGTG  V+  G C+GV+L +  L I  
Subjt:  TLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVN

Query:  DFFPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEI---
        +F PL LG++D+ILGVQWL  LG V  +W +  M+F++G   + LQGD S+ +++ISLK+M++ +R EG G+L+E+N I+ ++     G  D+  E+   
Subjt:  DFFPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEI---

Query:  -RELLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITV
         + LL ++  +F     LPP+R  +H++TLK    PV+V  YRYP  QK EIERLV  ML A IIQPSTS FSSPVLLVKKKDGSWRFC+DYRALN  TV
Subjt:  -RELLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITV

Query:  PDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST-SSL---------------------------Q
        PD +PIPV+DELLDEL    +F K+DLKSGYHQIRV+ +D+HKTAFRTHEGHYEFLVMPFGL NAP+T  SL                           +
Subjt:  PDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST-SSL---------------------------Q

Query:  EHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFG
         H  HL  VLH+L++N L ANL KC F    + YLGHVISA+GVAAD  KI+AM +WP+P T              RKF+A Y  +  PLT  LRKD + 
Subjt:  EHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFG

Query:  WNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQK
        W  A   AF+ LK+AMV  PVLA+PDF Q FV+E DASG G+GAVLMQN RP+A++S  L    +LK +YE EL+AIV+AVQKWR YLLGRRF  RTDQK
Subjt:  WNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQK

Query:  SLKFLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRL-PTLVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGEL
        SLKF++EQR +  +YQ+W++KL+G++FEI YK G+ N  ADALSR  P   EL A     G +     +Q+  D  +++I   +Q  G    G+SV+  L
Subjt:  SLKFLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRL-PTLVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGEL

Query:  VMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLP
        VMYKGRIVLPP SP    LL+ +H++P GGH+  +K Y R+  E+ W GM+ +V  +V+DC +CQQ K  +  PAGLL PLP    +WEDI+MDF++GLP
Subjt:  VMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLP

Query:  RSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYL
         S G ++ILVVVDR +K+AHF+ L+HP+TA  VA  FIKEIV LHG P SI+SDRD                   KRSTAYHPQTDGQ+E +N+ LE YL
Subjt:  RSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYL

Query:  RCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELR--------RMKRLADAKRREEEYAVN
        RCF    P KWA WL W E+ YNTS H S KMT +  +YGR  P L+        VD++D  L+ERD +L +LR        +MK  AD +RR+    V 
Subjt:  RCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELR--------RMKRLADAKRREEEYAVN

Query:  DWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGST
         +VY+KL+PYRQ +L    + KLA ++ GP+ +++RIG VAY+LDLP     HP+FHVSQL+   G++     LP  L       ++P A+L VR     
Subjt:  DWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGST

Query:  ITSLMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG----SDRPPIFKKYMRRRKREGRIF
               EVL++W  L   +ATWE+   ++ +FP FHLEDKV  W AG    + +PP+   Y RR+KR    F
Subjt:  ITSLMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG----SDRPPIFKKYMRRRKREGRIF

TrEMBL top hitse value%identityAlignment
A0A087GEK8 Uncharacterized protein0.0e+0046.13Show/hide
Query:  DLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLKWGRNPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNPTLNPTRNS
        ++ E  LE  F++GLK  I+  ++ F    L+  M +A+L+    E+       +  P SK +R  PT+       +++T   +   + LN    P    
Subjt:  DLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLKWGRNPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNPTLNPTRNS

Query:  AVSSNTVNFCDTNGKTFSAGS-----FRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLDVE
          S NT +F  T  +  +  +     +RRL   ++  R+  GLC+RCDEK+   H+C KKE+N+ +  E DG  +  E D D          T  T    
Subjt:  AVSSNTVNFCDTNGKTFSAGS-----FRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLDVE

Query:  IEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVNDFFPL
         E AE   L+LNS+ G+ SP TMKL GT+   +VVVLIDS ASHNF+ E LV  L L  +   SYG++ G G +VR  G+C+G+VL L  L I +DF PL
Subjt:  IEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVNDFFPL

Query:  SLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKG-TQDVRPEIRELLQKF
         LGSADVILG++WL +LG+++ +W    M+F +G     LQGD     S ISLKS+M+ V+ +GVG+L+E N +Q+ +  ++ G T +V   +  ++ +F
Subjt:  SLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKG-TQDVRPEIRELLQKF

Query:  QSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDNFPIPV
          +F     LPP R   H + L+     V+V  +RYPQ QK EIE+ V  ML A IIQ STS+FSSPVLLVKKKDGSWRFCIDYRALN +T+PD+FPIP+
Subjt:  QSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDNFPIPV

Query:  VDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHMEHLST
        +D+LLDEL   T+F K+DLKSGYHQI V+ ++V KTAFRTH+GHYEFLVMPFGL NAP+T                            SSLQEH EHL  
Subjt:  VDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHMEHLST

Query:  VLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNEATQSA
        VL +L   +L AN  KC F +SSIEYLGHVIS EGV+ADP K++AM  WP+P                R+FV GYGSI  PLT LL+KDKF W+E    A
Subjt:  VLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNEATQSA

Query:  FDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLKFLLEQ
        F+ LK AM  VPVLAL DF + FVVE+DASG+G+GAVL+Q  +P+AYFSQAL    +LK VYE EL+AIV A+QKWR YLLGR+F  RTDQKSLKFLLEQ
Subjt:  FDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLKFLLEQ

Query:  RIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMYKGRIVL
        R +  +YQ+W+ K++G++F+I YK GLEN AADALSR+  L +L A S    +       +VD++   ++I++ +        GYSV    ++Y G++VL
Subjt:  RIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMYKGRIVL

Query:  PPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSIL
        P  S  I +LL EFHN+ +GGH  V+K  + +   F W GM A ++ FV +C VCQ+ KY ++AP+GLL PLP+   +WEDIS+DF++GLP+SEG+++IL
Subjt:  PPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSIL

Query:  VVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPV
        VVVDRL+KYAHFI L+HP+ A  +A++FI+EIV LHG P ++VSDRD                     STAYHPQTDGQTEV NRGLE  LRCFT   P 
Subjt:  VVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPV

Query:  KWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELRR--------MKRLADAKRREEEYAVNDWVYIKLRP
        KWA +L W E+ YN+SYH++++MT +  +YGR  P+LL ++ G+     ++  L+ERD M+  L++        MK  AD  RRE E+ V D V++KL+P
Subjt:  KWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELRR--------MKRLADAKRREEEYAVNDWVYIKLRP

Query:  YRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVR---RVGSTITSLMD
        YRQ +L    + KLA +F GP+ ++ R+G VAY+L LP +   H  FHVSQL+  VGS      LPP L   +    +P A +GVR   R G        
Subjt:  YRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVR---RVGSTITSLMD

Query:  FEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG-----SDRPPIFKKYMRRRK--REGR
         EVL++W GL   D+TWE    I  QFP F LEDK +F  AG     S++ P+  +Y RR+K  ++GR
Subjt:  FEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG-----SDRPPIFKKYMRRRK--REGR

A0A2I0VWY1 Putative mitochondrial protein0.0e+0049.48Show/hide
Query:  DTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLDVEIEAAEKGVLTLNSLA
        + +G T SAGSF++L + ++Q +R KGLC+RC+EK+  GHRCK + L     + +D E  EE  D D   EE++        D ++E AE   ++LNS+ 
Subjt:  DTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLDVEIEAAEKGVLTLNSLA

Query:  GLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVNDFFPLSLGSADVILGVQWLM
        G     TMK++G +  ++VVVLIDS A+HNFI   +   L +  +   SYG+M+GTG+   +TGICKGV ++L  + +V DF PL LGS DVILG++WL 
Subjt:  GLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVNDFFPLSLGSADVILGVQWLM

Query:  TLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEIRELLQKFQSIFRPIDELPPARRH
        TLG+ + +W T  M+  V    +KL+G                                                 ++  LQ+F+ +F+P   LPP R  
Subjt:  TLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEIRELLQKFQSIFRPIDELPPARRH

Query:  DHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDNFPIPVVDELLDELTCVTIFLK
        +H + LK    P++V  YRYPQ QKDEIE+LV +ML   IIQPS S FSSPVLLVKKKDGSWRFC+DYRALN  TVPD FPIPV+DEL+DEL    +F K
Subjt:  DHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDNFPIPVVDELLDELTCVTIFLK

Query:  IDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHMEHLSTVLHVLSDNKLVANLNK
        IDLKSGYHQIR++ +DV KTAFRTHEGHYEFLVMPFGL NAP+T                             + +EH+EHL  VL VL +++L AN  K
Subjt:  IDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHMEHLSTVLHVLSDNKLVANLNK

Query:  CTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNEATQSAFDTLKQAMVEVPVLAL
        C FA + +EYLGHVIS EGVAAD  KIEAM  WP P T              R+FV GY +I  PLT+LL+KD F W EA   AF+ LK+AM  VPVLAL
Subjt:  CTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNEATQSAFDTLKQAMVEVPVLAL

Query:  PDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLKFLLEQRIIVGDYQKWIAKLIG
        PDF Q FV+ETDASG G+GAVLMQN R +AYFSQ L    +LK VYE EL+AIVLA+QKWRPYLLGR F  RTDQ+SLK+LLEQR++  ++Q+W++KL+G
Subjt:  PDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLKFLLEQRIIVGDYQKWIAKLIG

Query:  YDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMYKGRIVLPPTSPTIPLLLQEFHN
        YDFEI Y+ GLEN AADALSR     +  A S  + ++      +  Q+E LR I++ +       PGYSV GE + Y+GR V+P +S  IP LLQEFH 
Subjt:  YDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMYKGRIVLPPTSPTIPLLLQEFHN

Query:  NPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLR
        + +GGH+ V K Y RI  E  W GM+  +   V  C  CQ+ K+M+MAP GLL PL + + +WE+I+MDFIDGLP+SEGY  I VVVDRLSKYAHFIPLR
Subjt:  NPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLR

Query:  HPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTS
        HPYTAV VA+ F++EIV LHG+P+SIVSDRD                   KRSTAYHPQTDGQTEV+NR LE YLRCF  ++P +WAKWLAW EY YNTS
Subjt:  HPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTS

Query:  YHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELRR--------MKRLADAKRREEEYAVNDWVYIKLRPYRQSTLFIHKHPKLAL
        YH++ KMT + ++YGR  P L+ Y   +  V AVD  L+ERD++L EL+R        MK+ AD+KRR+ ++ V D VY+KLRPYRQ T+   ++ KLA 
Subjt:  YHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELRR--------MKRLADAKRREEEYAVNDWVYIKLRPYRQSTLFIHKHPKLAL

Query:  KFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGSTITSLMDFEVLVQWDGLAVEDATWEN
        ++ GPF + ++IG VAY+L LPP    HP+FHVSQLRKVVG       LP TL+     +++P A+ GVR     I    + EV ++W GL   +ATWE 
Subjt:  KFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGSTITSLMDFEVLVQWDGLAVEDATWEN

Query:  RQQIISQFPHFHLEDKVVFWGAGSDRP---------PIFKKYMRRRKREGR
         ++I  QFP FHLEDKVV W  G+  P         P       RR+ +GR
Subjt:  RQQIISQFPHFHLEDKVVFWGAGSDRP---------PIFKKYMRRRKREGR

A0A5A7VJA0 Ty3/gypsy retrotransposon protein0.0e+0043.89Show/hide
Query:  VSDLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLK------WGRNPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNP
        +SD+ +  +E  F+SGL   I+ E+ +    GL   M  AQL++D E  R          G++ +P S  ++    Q         + +    RTITL  
Subjt:  VSDLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLK------WGRNPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNP

Query:  TLNPTRNSAVSSNTVNFCDTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCK---KKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATL
                            +G+T   G+ +RL D++ Q RREKGLC++C+EKY+A H+CK   ++EL +F+  + +     EE+++  + E ET  A L
Subjt:  TLNPTRNSAVSSNTVNFCDTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCK---KKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATL

Query:  ETLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIV
         T++V  +A     L++NS+ GL+ P TMK++GT+  K+VV+LID  A+HNF+ E LVT+L L     + YG++LG+G +++  GIC+ + + + + T+ 
Subjt:  ETLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIV

Query:  NDFFPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEIRE
         DF PL LG  DVILG+QWL +LG   CDW    + F   +  I ++GD S+ K+++SLK+++K       G L+E  ++  +    +   ++ + EI E
Subjt:  NDFFPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEIRE

Query:  ----LLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHIT
            +L +F+ IF   ++LPP R  +H + LK    PVNV  YRY   QK+E+ERLV +ML + II+PS S +SSPVLLVKKKDGSWRFC+DYRALN++T
Subjt:  ----LLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHIT

Query:  VPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSL
        VPD FPIPVV+EL DEL   ++F KIDLK+GYHQIR+   D+ KTAFRTHEGHYEFLVMPFGL NAP+T                             +L
Subjt:  VPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSL

Query:  QEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKF
        ++H++H+ TV  VL  ++L AN  KC+F  + +EYLGH+IS +GV  DP+KI+A+T WP PT+              RKFV  YG++  PLTQLL+K  F
Subjt:  QEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKF

Query:  GWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQ
         WN   + AF+ LK+AM+ +PVLALP F + F +ETDASG GVGAVL+QN RP+A++S  L +  + + VYE EL+A+VLAVQ+WRPYLLG RF  RTDQ
Subjt:  GWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQ

Query:  KSLKFLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREI------RKTIQDGGQVPPGY
        KSLKFLLEQR++   YQ+W+AKL+GY F++EYK G+EN AADALSR+   V++   +  + ++  +   +V++D +L +I        T+QD       +
Subjt:  KSLKFLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREI------RKTIQDGGQVPPGY

Query:  SVKGELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMD
         +   ++ YK R+V+  TS  IP +L  +H++ VGGH+  ++ Y+RI  E  W GMKA V+ +  +C +CQQ K + ++PAGLLLPL +  +IW DISMD
Subjt:  SVKGELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMD

Query:  FIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRDK-------------------RSTAYHPQTDGQTEVMNR
        F++GLP++ G+  I VVVDRLSKY HFIPL+HPY+A  VA +F+KE+V LHG P SIVSDRD+                   RS+AYHPQ+DGQTEV+NR
Subjt:  FIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRDK-------------------RSTAYHPQTDGQTEVMNR

Query:  GLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELR--------RMKRLADAKRRE
        G+E YLRCF    P +W KW+ W EY YNT++  +L MT + +VYGR  P LLSY         +D  L+ERDEM+  LR        +MK+ AD +RR+
Subjt:  GLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELR--------RMKRLADAKRRE

Query:  EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSI---QPAAI
         EY V D V++K+RPYRQ +L   ++ KL+ K+ GP+ I++RIGPVAYKL+LP +   HP+FHVSQL+K+VG       + PT+Q  D   +    P   
Subjt:  EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSI---QPAAI

Query:  LGVRRVGSTITSLMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAGSDRPPIFKKYMRRRKRE
        L  RR       + ++EV+++WDGL++ + TWE    +  ++P FHLEDKV   G  + RPPI  +Y R+ KR+
Subjt:  LGVRRVGSTITSLMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAGSDRPPIFKKYMRRRKRE

A0A5D3BEL2 Ty3/gypsy retrotransposon protein0.0e+0043.89Show/hide
Query:  VSDLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLK------WGRNPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNP
        +SD+ +  +E  F+SGL   I+ E+ +    GL   M  AQL++D E  R          G++ +P S  ++    Q         + +    RTITL  
Subjt:  VSDLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLK------WGRNPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNP

Query:  TLNPTRNSAVSSNTVNFCDTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCK---KKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATL
                            +G+T   G+ +RL D++ Q RREKGLC++C+EKY+A H+CK   ++EL +F+  + +     EE+++  + E ET  A L
Subjt:  TLNPTRNSAVSSNTVNFCDTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCK---KKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATL

Query:  ETLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIV
         T++V  +A     L++NS+ GL+ P TMK++GT+  K+VV+LID  A+HNF+ E LVT+L L     + YG++LG+G +++  GIC+ + + + + T+ 
Subjt:  ETLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIV

Query:  NDFFPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEIRE
         DF PL LG  DVILG+QWL +LG   CDW    + F   +  I ++GD S+ K+++SLK+++K       G L+E  ++  +    +   ++ + EI E
Subjt:  NDFFPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVRPEIRE

Query:  ----LLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHIT
            +L +F+ IF   ++LPP R  +H + LK    PVNV  YRY   QK+E+ERLV +ML + II+PS S +SSPVLLVKKKDGSWRFC+DYRALN++T
Subjt:  ----LLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHIT

Query:  VPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSL
        VPD FPIPVV+EL DEL   ++F KIDLK+GYHQIR+   D+ KTAFRTHEGHYEFLVMPFGL NAP+T                             +L
Subjt:  VPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSL

Query:  QEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKF
        ++H++H+ TV  VL  ++L AN  KC+F  + +EYLGH+IS +GV  DP+KI+A+T WP PT+              RKFV  YG++  PLTQLL+K  F
Subjt:  QEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKF

Query:  GWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQ
         WN   + AF+ LK+AM+ +PVLALP F + F +ETDASG GVGAVL+QN RP+A++S  L +  + + VYE EL+A+VLAVQ+WRPYLLG RF  RTDQ
Subjt:  GWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQ

Query:  KSLKFLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREI------RKTIQDGGQVPPGY
        KSLKFLLEQR++   YQ+W+AKL+GY F++EYK G+EN AADALSR+   V++   +  + ++  +   +V++D +L +I        T+QD       +
Subjt:  KSLKFLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREI------RKTIQDGGQVPPGY

Query:  SVKGELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMD
         +   ++ YK R+V+  TS  IP +L  +H++ VGGH+  ++ Y+RI  E  W GMKA V+ +  +C +CQQ K + ++PAGLLLPL +  +IW DISMD
Subjt:  SVKGELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMD

Query:  FIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRDK-------------------RSTAYHPQTDGQTEVMNR
        F++GLP++ G+  I VVVDRLSKY HFIPL+HPY+A  VA +F+KE+V LHG P SIVSDRD+                   RS+AYHPQ+DGQTEV+NR
Subjt:  FIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRDK-------------------RSTAYHPQTDGQTEVMNR

Query:  GLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELR--------RMKRLADAKRRE
        G+E YLRCF    P +W KW+ W EY YNT++  +L MT + +VYGR  P LLSY         +D  L+ERDEM+  LR        +MK+ AD +RR+
Subjt:  GLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELR--------RMKRLADAKRRE

Query:  EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSI---QPAAI
         EY V D V++K+RPYRQ +L   ++ KL+ K+ GP+ I++RIGPVAYKL+LP +   HP+FHVSQL+K+VG       + PT+Q  D   +    P   
Subjt:  EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSI---QPAAI

Query:  LGVRRVGSTITSLMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAGSDRPPIFKKYMRRRKRE
        L  RR       + ++EV+++WDGL++ + TWE    +  ++P FHLEDKV   G  + RPPI  +Y R+ KR+
Subjt:  LGVRRVGSTITSLMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAGSDRPPIFKKYMRRRKRE

J3SDF5 Ty3/gypsy retrotransposon protein0.0e+0044.96Show/hide
Query:  LSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLKWGRNPSPES---KRSRSNPTQAKSTTMIASSTSSMTARTITLNPTLNPTR
        + E  L  KF+ GL  E+Q E+R+ +   L   M   +L   +EE  ++   R   P S         P    S   +  S     A           T+
Subjt:  LSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLKWGRNPSPES---KRSRSNPTQAKSTTMIASSTSSMTARTITLNPTLNPTR

Query:  NSAVSSNTVNFCDTNGK-----TFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLD
        + A++SN       N K     +   G  RRL + +LQ +R KGLC++CDEK+  GH+C++KEL++    + +    E+E++  A+   E  P+  E + 
Subjt:  NSAVSSNTVNFCDTNGK-----TFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHEVDGEGVEEEIDVDAIREEETMPATLETLD

Query:  VEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLP-NLTIVNDF
         E+        +LNS+ GL +PKTMKL G + N +VVV+ID  A+HNF+    +  L +  +    +G+ LG GQ+VR TGIC+ V L L   L +V DF
Subjt:  VEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLP-NLTIVNDF

Query:  FPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGT---QDVRPEIRE
         PL LG++DVILGVQWL TLG V  +W T +M F++G     L GD ++ +S++SLK+M++ +R EG G+ LE N ++      ++ +   Q++ P ++E
Subjt:  FPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGT---QDVRPEIRE

Query:  LLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDN
        L+++F+ +F     LPP R H+HA+ LK    PV V  YRYPQFQKDEIERL+ +ML A IIQPSTS FSSPV+LVKKKDGSWRFC+DYRALN  TVPD 
Subjt:  LLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDN

Query:  FPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHM
        +PIPV+DELLDEL   T+F K+DL++GYHQI V+ +D HKTAFRTHEGHYEFLVMPFGL NAP+T                             S +EH+
Subjt:  FPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPST----------------------------SSLQEHM

Query:  EHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNE
         HL  VL +L+ + L  N  KC F    + YLGHVIS  GVA D +K++A+ +W VP                RKFVA Y  I  PLT+ L+KD F W+ 
Subjt:  EHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLRKDKFGWNE

Query:  ATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLK
            AF  LK AMV  PVLA+P+F+ +FVVETDASG G+GAVLMQ+ RP+AY+S+ L    QLK VYE EL+AI  AVQKW+ YLLGR F  RTDQ+SL+
Subjt:  ATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLLGRRFTERTDQKSLK

Query:  FLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPT-LVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMY
        ++ +QR I  ++QKW++KL+GYDFEI YK GL N  ADALSR     VEL A   V GV  A    ++  D  L ++RK +Q+ G+ P  +++    +++
Subjt:  FLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPT-LVELSAFSCVIGVNTAVFASQVDQDERLREIRKTIQDGGQVPPGYSVKGELVMY

Query:  KGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSE
        KGR V+P +S  IP LL E+H+ P+GGH   +K Y R+  E+ W GM+  V  +V  C +CQQ K     P GLL PLP+  ++WEDISMDFI+GLP S+
Subjt:  KGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSE

Query:  GYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCF
        G ++ILV+VDRLSKYAHF+ LRHP+TA+MVA +F+KE+V LHG P SIVSDRD                   KRS+AYHPQTDGQTE++NR LE YLRCF
Subjt:  GYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD-------------------KRSTAYHPQTDGQTEVMNRGLEAYLRCF

Query:  TMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLREL--------RRMKRLADAKRREEEYAVNDWV
            P  WAKWL W E++YNTS HTS KM+ + ++YGR  P ++   +G  +V++++ +L++RD ++ +L        +RMK  AD  R E E+ V D V
Subjt:  TMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLREL--------RRMKRLADAKRREEEYAVNDWV

Query:  YIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGSTITS
        +++L+PYRQ +L      KLA +F GPF ++ RIG  AYKL LPP+   HP+FHVS L+KVVG+      +PP + V     ++P  +L VR++     +
Subjt:  YIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGSTITS

Query:  LMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG----SDRPPIFKKYMRRRKREGR
            E L++W GL   +ATWE+   I  +FP FHLEDKV  WGAG      + P    Y RR  ++G+
Subjt:  LMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG----SDRPPIFKKYMRRRKREGR

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.0e-10730.44Show/hide
Query:  PEIRELLQKFQSIFRP--IDELP-PARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRAL
        PE+ ++ ++F+ I      ++LP P +  +  V L  E   + +  Y  P  +   +   + + L + II+ S +  + PV+ V KK+G+ R  +DY+ L
Subjt:  PEIRELLQKFQSIFRP--IDELP-PARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRAL

Query:  NHITVPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPS----------------------------
        N    P+ +P+P++++LL ++   TIF K+DLKS YH IRV+  D HK AFR   G +E+LVMP+G+  AP+                            
Subjt:  NHITVPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPS----------------------------

Query:  TSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLR
        + S  EH++H+  VL  L +  L+ N  KC F  S ++++G+ IS +G     + I+ + QW  P                RKF+     + HPL  LL+
Subjt:  TSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLR

Query:  KD-KFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNG-----RPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLL
        KD ++ W      A + +KQ +V  PVL   DF +  ++ETDAS V VGAVL Q        P+ Y+S  +        V + E++AI+ +++ WR YL 
Subjt:  KD-KFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNG-----RPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLL

Query:  GRRFTERTDQKSLKFLLEQRIIVG-----------DYQKWIAKLIGYDFEIEYKKGLENTAADALSR-------LPTLVELSAFSCVIGVN-TAVFASQV
               +  +  K L + R ++G              +W   L  ++FEI Y+ G  N  ADALSR       +P   E ++ + V  ++ T  F +QV
Subjt:  GRRFTERTDQKSLKFLLEQRIIVG-----------DYQKWIAKLIGYDFEIEYKKGLENTAADALSR-------LPTLVELSAFSCVIGVN-TAVFASQV

Query:  ----DQDERLREIRKTIQDGGQVPPGYSVK-GELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQ
              D +L  +     +  +V     +K G L+  K +I+LP  +     +++++H      H  +  +   I + F W G++  ++ +V++C  CQ 
Subjt:  ----DQDERLREIRKTIQDGGQVPPGYSVK-GELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQ

Query:  AKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD----------
         K  +  P G L P+P  +  WE +SMDFI  LP S GYN++ VVVDR SK A  +P     TA   A +F + ++   G PK I++D D          
Subjt:  AKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD----------

Query:  ---------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRER
                 K S  Y PQTDGQTE  N+ +E  LRC     P  W   ++  + +YN + H++ +MT + IV+ R +P L   +  + + D  D   +E 
Subjt:  ---------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRER

Query:  DEMLRELR--------RMKRLADAKRRE-EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVN--FHPMFHVSQLRK
         ++ + ++        +MK+  D K +E EE+   D V +K    R  T F+HK  KLA  F GPF+++ + GP  Y+LDLP ++   F   FHVS L K
Subjt:  DEMLRELR--------RMKRLADAKRRE-EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVN--FHPMFHVSQLRK

P0CT35 Transposon Tf2-2 polyprotein1.0e-10730.44Show/hide
Query:  PEIRELLQKFQSIFRP--IDELP-PARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRAL
        PE+ ++ ++F+ I      ++LP P +  +  V L  E   + +  Y  P  +   +   + + L + II+ S +  + PV+ V KK+G+ R  +DY+ L
Subjt:  PEIRELLQKFQSIFRP--IDELP-PARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRAL

Query:  NHITVPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPS----------------------------
        N    P+ +P+P++++LL ++   TIF K+DLKS YH IRV+  D HK AFR   G +E+LVMP+G+  AP+                            
Subjt:  NHITVPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPS----------------------------

Query:  TSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLR
        + S  EH++H+  VL  L +  L+ N  KC F  S ++++G+ IS +G     + I+ + QW  P                RKF+     + HPL  LL+
Subjt:  TSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLR

Query:  KD-KFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNG-----RPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLL
        KD ++ W      A + +KQ +V  PVL   DF +  ++ETDAS V VGAVL Q        P+ Y+S  +        V + E++AI+ +++ WR YL 
Subjt:  KD-KFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNG-----RPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLL

Query:  GRRFTERTDQKSLKFLLEQRIIVG-----------DYQKWIAKLIGYDFEIEYKKGLENTAADALSR-------LPTLVELSAFSCVIGVN-TAVFASQV
               +  +  K L + R ++G              +W   L  ++FEI Y+ G  N  ADALSR       +P   E ++ + V  ++ T  F +QV
Subjt:  GRRFTERTDQKSLKFLLEQRIIVG-----------DYQKWIAKLIGYDFEIEYKKGLENTAADALSR-------LPTLVELSAFSCVIGVN-TAVFASQV

Query:  ----DQDERLREIRKTIQDGGQVPPGYSVK-GELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQ
              D +L  +     +  +V     +K G L+  K +I+LP  +     +++++H      H  +  +   I + F W G++  ++ +V++C  CQ 
Subjt:  ----DQDERLREIRKTIQDGGQVPPGYSVK-GELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQ

Query:  AKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD----------
         K  +  P G L P+P  +  WE +SMDFI  LP S GYN++ VVVDR SK A  +P     TA   A +F + ++   G PK I++D D          
Subjt:  AKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD----------

Query:  ---------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRER
                 K S  Y PQTDGQTE  N+ +E  LRC     P  W   ++  + +YN + H++ +MT + IV+ R +P L   +  + + D  D   +E 
Subjt:  ---------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRER

Query:  DEMLRELR--------RMKRLADAKRRE-EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVN--FHPMFHVSQLRK
         ++ + ++        +MK+  D K +E EE+   D V +K    R  T F+HK  KLA  F GPF+++ + GP  Y+LDLP ++   F   FHVS L K
Subjt:  DEMLRELR--------RMKRLADAKRRE-EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVN--FHPMFHVSQLRK

P0CT36 Transposon Tf2-3 polyprotein1.0e-10730.44Show/hide
Query:  PEIRELLQKFQSIFRP--IDELP-PARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRAL
        PE+ ++ ++F+ I      ++LP P +  +  V L  E   + +  Y  P  +   +   + + L + II+ S +  + PV+ V KK+G+ R  +DY+ L
Subjt:  PEIRELLQKFQSIFRP--IDELP-PARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRAL

Query:  NHITVPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPS----------------------------
        N    P+ +P+P++++LL ++   TIF K+DLKS YH IRV+  D HK AFR   G +E+LVMP+G+  AP+                            
Subjt:  NHITVPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPS----------------------------

Query:  TSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLR
        + S  EH++H+  VL  L +  L+ N  KC F  S ++++G+ IS +G     + I+ + QW  P                RKF+     + HPL  LL+
Subjt:  TSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLR

Query:  KD-KFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNG-----RPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLL
        KD ++ W      A + +KQ +V  PVL   DF +  ++ETDAS V VGAVL Q        P+ Y+S  +        V + E++AI+ +++ WR YL 
Subjt:  KD-KFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNG-----RPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLL

Query:  GRRFTERTDQKSLKFLLEQRIIVG-----------DYQKWIAKLIGYDFEIEYKKGLENTAADALSR-------LPTLVELSAFSCVIGVN-TAVFASQV
               +  +  K L + R ++G              +W   L  ++FEI Y+ G  N  ADALSR       +P   E ++ + V  ++ T  F +QV
Subjt:  GRRFTERTDQKSLKFLLEQRIIVG-----------DYQKWIAKLIGYDFEIEYKKGLENTAADALSR-------LPTLVELSAFSCVIGVN-TAVFASQV

Query:  ----DQDERLREIRKTIQDGGQVPPGYSVK-GELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQ
              D +L  +     +  +V     +K G L+  K +I+LP  +     +++++H      H  +  +   I + F W G++  ++ +V++C  CQ 
Subjt:  ----DQDERLREIRKTIQDGGQVPPGYSVK-GELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQ

Query:  AKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD----------
         K  +  P G L P+P  +  WE +SMDFI  LP S GYN++ VVVDR SK A  +P     TA   A +F + ++   G PK I++D D          
Subjt:  AKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD----------

Query:  ---------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRER
                 K S  Y PQTDGQTE  N+ +E  LRC     P  W   ++  + +YN + H++ +MT + IV+ R +P L   +  + + D  D   +E 
Subjt:  ---------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRER

Query:  DEMLRELR--------RMKRLADAKRRE-EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVN--FHPMFHVSQLRK
         ++ + ++        +MK+  D K +E EE+   D V +K    R  T F+HK  KLA  F GPF+++ + GP  Y+LDLP ++   F   FHVS L K
Subjt:  DEMLRELR--------RMKRLADAKRRE-EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVN--FHPMFHVSQLRK

P0CT37 Transposon Tf2-4 polyprotein1.0e-10730.44Show/hide
Query:  PEIRELLQKFQSIFRP--IDELP-PARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRAL
        PE+ ++ ++F+ I      ++LP P +  +  V L  E   + +  Y  P  +   +   + + L + II+ S +  + PV+ V KK+G+ R  +DY+ L
Subjt:  PEIRELLQKFQSIFRP--IDELP-PARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRAL

Query:  NHITVPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPS----------------------------
        N    P+ +P+P++++LL ++   TIF K+DLKS YH IRV+  D HK AFR   G +E+LVMP+G+  AP+                            
Subjt:  NHITVPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPS----------------------------

Query:  TSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLR
        + S  EH++H+  VL  L +  L+ N  KC F  S ++++G+ IS +G     + I+ + QW  P                RKF+     + HPL  LL+
Subjt:  TSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLR

Query:  KD-KFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNG-----RPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLL
        KD ++ W      A + +KQ +V  PVL   DF +  ++ETDAS V VGAVL Q        P+ Y+S  +        V + E++AI+ +++ WR YL 
Subjt:  KD-KFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNG-----RPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLL

Query:  GRRFTERTDQKSLKFLLEQRIIVG-----------DYQKWIAKLIGYDFEIEYKKGLENTAADALSR-------LPTLVELSAFSCVIGVN-TAVFASQV
               +  +  K L + R ++G              +W   L  ++FEI Y+ G  N  ADALSR       +P   E ++ + V  ++ T  F +QV
Subjt:  GRRFTERTDQKSLKFLLEQRIIVG-----------DYQKWIAKLIGYDFEIEYKKGLENTAADALSR-------LPTLVELSAFSCVIGVN-TAVFASQV

Query:  ----DQDERLREIRKTIQDGGQVPPGYSVK-GELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQ
              D +L  +     +  +V     +K G L+  K +I+LP  +     +++++H      H  +  +   I + F W G++  ++ +V++C  CQ 
Subjt:  ----DQDERLREIRKTIQDGGQVPPGYSVK-GELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQ

Query:  AKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD----------
         K  +  P G L P+P  +  WE +SMDFI  LP S GYN++ VVVDR SK A  +P     TA   A +F + ++   G PK I++D D          
Subjt:  AKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD----------

Query:  ---------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRER
                 K S  Y PQTDGQTE  N+ +E  LRC     P  W   ++  + +YN + H++ +MT + IV+ R +P L   +  + + D  D   +E 
Subjt:  ---------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRER

Query:  DEMLRELR--------RMKRLADAKRRE-EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVN--FHPMFHVSQLRK
         ++ + ++        +MK+  D K +E EE+   D V +K    R  T F+HK  KLA  F GPF+++ + GP  Y+LDLP ++   F   FHVS L K
Subjt:  DEMLRELR--------RMKRLADAKRRE-EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVN--FHPMFHVSQLRK

P0CT41 Transposon Tf2-12 polyprotein1.0e-10730.44Show/hide
Query:  PEIRELLQKFQSIFRP--IDELP-PARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRAL
        PE+ ++ ++F+ I      ++LP P +  +  V L  E   + +  Y  P  +   +   + + L + II+ S +  + PV+ V KK+G+ R  +DY+ L
Subjt:  PEIRELLQKFQSIFRP--IDELP-PARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRAL

Query:  NHITVPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPS----------------------------
        N    P+ +P+P++++LL ++   TIF K+DLKS YH IRV+  D HK AFR   G +E+LVMP+G+  AP+                            
Subjt:  NHITVPDNFPIPVVDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPS----------------------------

Query:  TSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLR
        + S  EH++H+  VL  L +  L+ N  KC F  S ++++G+ IS +G     + I+ + QW  P                RKF+     + HPL  LL+
Subjt:  TSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAADPKKIEAMTQWPVPTT--------------RKFVAGYGSIVHPLTQLLR

Query:  KD-KFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNG-----RPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLL
        KD ++ W      A + +KQ +V  PVL   DF +  ++ETDAS V VGAVL Q        P+ Y+S  +        V + E++AI+ +++ WR YL 
Subjt:  KD-KFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNG-----RPLAYFSQALPLTHQLKVVYEWELIAIVLAVQKWRPYLL

Query:  GRRFTERTDQKSLKFLLEQRIIVG-----------DYQKWIAKLIGYDFEIEYKKGLENTAADALSR-------LPTLVELSAFSCVIGVN-TAVFASQV
               +  +  K L + R ++G              +W   L  ++FEI Y+ G  N  ADALSR       +P   E ++ + V  ++ T  F +QV
Subjt:  GRRFTERTDQKSLKFLLEQRIIVG-----------DYQKWIAKLIGYDFEIEYKKGLENTAADALSR-------LPTLVELSAFSCVIGVN-TAVFASQV

Query:  ----DQDERLREIRKTIQDGGQVPPGYSVK-GELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQ
              D +L  +     +  +V     +K G L+  K +I+LP  +     +++++H      H  +  +   I + F W G++  ++ +V++C  CQ 
Subjt:  ----DQDERLREIRKTIQDGGQVPPGYSVK-GELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQ

Query:  AKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD----------
         K  +  P G L P+P  +  WE +SMDFI  LP S GYN++ VVVDR SK A  +P     TA   A +F + ++   G PK I++D D          
Subjt:  AKYMSMAPAGLLLPLPVLQMIWEDISMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRD----------

Query:  ---------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRER
                 K S  Y PQTDGQTE  N+ +E  LRC     P  W   ++  + +YN + H++ +MT + IV+ R +P L   +  + + D  D   +E 
Subjt:  ---------KRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEYNYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRER

Query:  DEMLRELR--------RMKRLADAKRRE-EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVN--FHPMFHVSQLRK
         ++ + ++        +MK+  D K +E EE+   D V +K    R  T F+HK  KLA  F GPF+++ + GP  Y+LDLP ++   F   FHVS L K
Subjt:  DEMLRELR--------RMKRLADAKRRE-EEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIGPVAYKLDLPPNVN--FHPMFHVSQLRK

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein8.1e-1228.71Show/hide
Query:  VDAIREEETMPATLETLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGI
        +D +++++ +   LE L+ +     +G+  L  +  L   K M+  G + + +VVV IDS A+ NFI   L  SL L  S  +   ++LG  Q +++ G 
Subjt:  VDAIREEETMPATLETLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGI

Query:  CKGVVLNLPNLTIVNDFFPLSLG--SADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQE
        C G+ L +  + I  +F  L L     DVILG +WL  LG+   +W   +  F   +  I L  +   ++ Q++ K  MK           E   I+ Q 
Subjt:  CKGVVLNLPNLTIVNDFFPLSLG--SADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQE

Query:  NN
        NN
Subjt:  NN

AT3G30770.1 Eukaryotic aspartyl protease family protein1.2e-0729.17Show/hide
Query:  KTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVNDFFPLSL--GSADVILGVQWLMTLG
        K M+  G +   +VVV+IDS A++NFI + L   L L  S  +   ++LG  Q ++T G C G+ L +  + I  +F  L L     DVILG      L 
Subjt:  KTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTGICKGVVLNLPNLTIVNDFFPLSL--GSADVILGVQWLMTLG

Query:  KVECDWSTSEMQ-FRVGEWNIKLQGDRSV--IKSQISLKSMMKQ
        +    W   +   F   +W      D+ +  + +++ +KS  +Q
Subjt:  KVECDWSTSEMQ-FRVGEWNIKLQGDRSV--IKSQISLKSMMKQ

ATMG00850.1 DNA/RNA polymerases superfamily protein3.9e-0644.83Show/hide
Query:  TLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSW
        TL+ E+  +   L      ++  ++  + +ML ARIIQPS S +SSPVLLV+KKDG W
Subjt:  TLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSW

ATMG00860.1 DNA/RNA polymerases superfamily protein4.1e-2446.97Show/hide
Query:  MEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLG--HVISAEGVAADPKKIEAMTQWPVP--TT------------RKFVAGYGSIVHPLTQLLRKDKFG
        M HL  VL +   ++  AN  KC F    I YLG  H+IS EGV+ADP K+EAM  WP P  TT            R+FV  YG IV PLT+LL+K+   
Subjt:  MEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLG--HVISAEGVAADPKKIEAMTQWPVP--TT------------RKFVAGYGSIVHPLTQLLRKDKFG

Query:  WNEATQSAFDTLKQAMVEVPVLALPDFKQSFV
        W E    AF  LK A+  +PVLALPD K  FV
Subjt:  WNEATQSAFDTLKQAMVEVPVLALPDFKQSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGAAGGCGAAGCGTTGGAATGGCATCAATGGGAGAAGGAGAGGAATCCGATGACATCGTGGGCGGACTTCAAAGAACAACTATTATTGTGATTTCTATCGGCGA
AGGAAGAAGATCGGAAGGCGCAATTCTTCAACCTGAAGCAGGACGGCACAGTGTGAGCGATCTGTCGGAAGCTACCTTGGAGAGCAAATTCGTCAGCGGACTCAAGGAAG
AAATCCAGATGGAGATGAGGATGTTTAGTCTGGTGGGCCTCAAGCCCAAAATGCTCATGGCCCAACTAATCAAAGACATGGAGGAGGCCCGACAACTCAAGTGGGGCAGA
AACCCGAGTCCAGAAAGTAAACGGAGCAGGTCAAACCCGACTCAAGCGAAATCGACAACTATGATCGCCTCCAGCACGAGTTCGATGACAGCCCGCACGATTACTCTCAA
CCCTACTCTCAACCCGACTCGTAATTCTGCTGTTTCTTCAAACACAGTCAATTTTTGCGACACTAATGGTAAGACTTTTTCGGCAGGAAGCTTTCGCAGGTTGATGGATA
GCAAATTGCAAAGTAGAAGGGAGAAAGGGCTATGTTATCGCTGCGATGAAAAGTATACGGCGGGTCACCGTTGTAAGAAGAAGGAGTTGAATCTTTTTATCCAACACGAA
GTCGACGGCGAAGGAGTGGAAGAGGAGATCGACGTTGACGCTATTAGAGAGGAGGAGACGATGCCGGCCACGCTGGAGACATTGGACGTCGAAATAGAGGCAGCTGAAAA
GGGGGTGTTAACTTTGAATTCTTTGGCCGGGCTAGATTCACCAAAAACAATGAAGTTGCAAGGGACTGTGGGCAATAAGCAAGTGGTTGTACTAATTGACAGCTGCGCAT
CACACAATTTCATTGACGAATCACTGGTAACATCGTTGAATCTACTGCAATCACCAATGTCAAGCTATGGGATAATGCTAGGTACTGGACAATCCGTTCGAACAACCGGA
ATTTGTAAAGGTGTTGTTCTAAATCTCCCAAACTTAACAATTGTTAATGATTTCTTTCCTTTATCGTTGGGTAGTGCGGATGTAATCTTGGGAGTACAGTGGTTGATGAC
GCTGGGAAAGGTTGAATGTGATTGGAGTACGTCAGAGATGCAATTTCGAGTGGGTGAATGGAACATAAAACTCCAGGGAGATAGAAGTGTAATAAAGTCACAAATTTCTC
TGAAATCTATGATGAAGCAGGTGAGAGGGGAAGGAGTGGGGATACTACTGGAACTCAATGCAATCCAGACCCAGGAGAACAATCGGATGAAGGGAACTCAAGATGTTCGC
CCAGAAATTCGTGAGCTCTTGCAGAAATTCCAGTCAATTTTTAGACCCATTGACGAATTACCACCAGCAAGGCGCCACGATCATGCTGTTACGTTAAAACCAGAAGAGGG
ACCTGTCAATGTCTGTCTGTATCGATATCCCCAATTCCAAAAAGATGAGATTGAACGCTTGGTTGCAAAAATGTTGACGGCAAGAATAATCCAGCCGAGTACAAGCTCAT
TCTCAAGTCCCGTGCTCTTAGTCAAGAAGAAAGATGGAAGCTGGAGATTTTGTATCGATTACAGGGCCCTCAACCATATTACTGTGCCAGATAACTTCCCGATACCAGTG
GTAGATGAATTGCTAGATGAGTTAACATGTGTGACGATATTTTTGAAAATCGACCTCAAGTCTGGGTACCATCAAATACGGGTTCAGCTCAAAGATGTGCATAAAACGGC
GTTCCGAACACATGAAGGGCATTATGAGTTTCTGGTTATGCCATTCGGACTACGGAATGCCCCGTCAACTTCTTCTTTGCAGGAACATATGGAACATCTATCCACAGTGT
TGCACGTATTGTCTGACAATAAATTGGTGGCCAATTTGAACAAATGCACGTTTGCGACGTCCTCCATTGAGTATCTTGGTCACGTAATCTCTGCAGAAGGGGTGGCGGCA
GACCCAAAGAAAATCGAAGCAATGACGCAATGGCCTGTGCCCACGACACGTAAGTTTGTAGCCGGATATGGGTCGATTGTCCACCCTCTTACACAACTGCTGAGGAAGGA
TAAATTTGGATGGAATGAGGCCACTCAATCTGCCTTTGATACACTCAAACAGGCAATGGTGGAGGTGCCAGTCCTAGCGCTACCAGATTTTAAGCAATCATTTGTGGTGG
AGACCGATGCATCTGGCGTCGGAGTTGGTGCGGTTTTAATGCAAAACGGGAGACCGTTGGCGTATTTTAGTCAGGCGTTACCACTCACTCATCAGTTGAAGGTAGTATAC
GAATGGGAGTTGATTGCCATAGTCCTAGCGGTGCAGAAATGGCGGCCATATCTCTTGGGACGTAGGTTTACGGAACGTACCGACCAAAAGAGTTTGAAATTCCTGTTGGA
GCAGCGGATCATTGTAGGAGACTATCAAAAGTGGATAGCCAAGTTGATAGGATATGACTTTGAGATCGAATACAAAAAGGGATTAGAAAATACGGCAGCCGATGCTCTAT
CTCGATTGCCCACCCTTGTCGAATTATCGGCCTTCAGCTGTGTAATTGGAGTTAATACAGCAGTTTTTGCATCTCAGGTTGATCAGGATGAACGATTACGGGAGATTCGA
AAGACAATTCAAGATGGGGGCCAAGTTCCACCGGGGTATTCAGTGAAAGGTGAGTTGGTGATGTATAAAGGTAGAATCGTGCTGCCGCCAACTTCGCCTACCATTCCGCT
TTTGTTGCAAGAGTTCCATAATAACCCAGTAGGGGGTCACAATGAGGTGGTCAAAGTTTATCAAAGGATCAAGAAGGAATTTATTTGGCCCGGGATGAAAGCATCAGTAC
GTGCATTTGTAGAAGATTGCTCAGTCTGCCAACAGGCCAAGTACATGTCGATGGCACCCGCAGGGTTGTTACTACCGCTCCCTGTGCTGCAGATGATTTGGGAGGATATC
TCAATGGACTTTATTGACGGATTGCCCCGTTCAGAAGGGTACAATTCTATCCTCGTTGTCGTGGATCGGCTTTCAAAATACGCGCACTTTATCCCGCTTAGGCACCCTTA
CACAGCAGTGATGGTTGCTTCGATATTTATTAAAGAGATTGTCCATCTTCATGGTATCCCTAAGAGCATCGTATCTGATCGAGATAAGCGAAGTACGGCCTACCATCCTC
AGACGGATGGGCAAACAGAAGTAATGAATCGAGGATTGGAAGCCTACCTACGTTGTTTCACGATGCAGTCCCCGGTCAAATGGGCAAAATGGTTAGCGTGGACAGAATAC
AATTATAATACATCTTATCACACGTCGTTAAAGATGACCTCGTATGGAATAGTTTATGGTCGATCAGCACCAACGCTACTGTCTTATGATCAAGGCACAGCGGCAGTTGA
TGCTGTGGACTTCTTACTGCGAGAAAGAGATGAGATGTTAAGAGAATTGAGACGGATGAAACGCTTGGCAGACGCTAAGCGACGGGAAGAAGAGTATGCGGTTAATGACT
GGGTCTATATTAAATTGCGACCTTACAGACAGAGCACATTATTCATACACAAACATCCCAAACTCGCTCTGAAATTCGTCGGGCCCTTTCACATTATCGATCGGATCGGG
CCAGTAGCTTACAAACTGGATCTGCCACCTAATGTTAATTTTCATCCAATGTTCCATGTGTCTCAACTTCGTAAGGTGGTCGGTTCTGTCTTGCCCAAGTTTTTGTTACC
ACCTACACTACAAGTCCATGACCCGCCATCCATTCAACCAGCAGCTATTTTGGGGGTTCGGCGTGTCGGATCTACCATAACCTCACTAATGGATTTCGAAGTATTAGTGC
AATGGGATGGACTTGCAGTAGAAGACGCAACATGGGAGAACAGGCAACAAATCATCTCTCAATTCCCACACTTTCACCTTGAGGACAAGGTGGTTTTTTGGGGGGCGGGT
AGTGATAGGCCTCCAATATTTAAGAAATATATGAGAAGAAGGAAAAGAGAAGGGAGAATTTTTCTTGAATCAACGAGCTCTTCTTCGTCTCAAGGGGAGCTTGAGACGTT
AATTGCGGAGTCTCTTACAATACGAACTGCTGAAGGAATTCTTCTTGAATCGACGTGCTCTTCTTTGTCTCAAGGGAAGCTTGAGACGTTAATTGTGATTGGAAAGATAT
CTTCAAGAAGAAAGGGAGTCTCTTACAATACAGAGTGCCGAAGTATTATTGTTCCTAATGTTAGGAATGCCCCTAAGCTTAAAGCCAAATGGATTGATGCTGAGGACAAA
GAATCCATAGGGAATTATCGTGCTCTTAATGCTATTTTTAATGGTGTGGATAGAAATATCTTTCGTCTGATTAGCACATGTGTCTCTAGCAAGGAAGCTTGGGACATTTT
GGTTGTTGCATACGAGGGAACATCAAGAGTTAAGATGTCCAAATTTCAATTGTTGACTACCAAGTTCGAAAATTTGAAAATGCTTGAAGAAGGAAAACTGGTTAGAAAAA
TTCTTAAATATTTGCCGAAGAGGTTTGACATGAAGGTGACTATCATTGAAGAAGCACAGGATATTTCAACTATGAAAGTTGATGAGTTGTTCGGATCCTTACTTACGTTT
GAAATGCCTCTCGACAAAGTTTTGAAGCATTTTAAAAATAGATCTAGGAATTATGTGCCTCGGAATGTTAAAGCAAATGGAGCCTCATCCGAGAAACTAAAAAAATCTCA
AATTCAGAAGAAGGATGTTGACAACTCCAGCATAAGTGATCAAAATAAATACCATTTTTGTTGTAGGGAATGTGAAGGTTTCAGTCACTATCAGTCTGAATGCCCCAATT
TCCTTAAACGACGAAGTAAGAACAAAGTATTATCTTTGTTTGAAGAAGATTTTGAAGATAGCAGTGAATTTGAAGATGTTAAAGCTCTTGTCAACAACATTACTAAGGAC
ATTCATTTGTGCAAAAGTCAAGAAGAAGATTATAGTGGCTGTGGTGAAGAAGAAGATATGCTTGAATCAATTGAAAAGTCTTACCGTATGTTACATCAACAATGGCTTGA
GGATGTAAAAGTCCTGGATGTTCAAAAAGAAAGAATAAGACTCTTCTTACGAACAATCATAGACTCACGACCACTTGATCTAACAGGTTTTGGATACTCCGGAATGAAGA
GAAGAGTTGATGATCAACCTATTTATAAGGGAAAAGGAAAAATTACTATTGGGTGTGATCAGAAGCAGATGTTTGCACCCTTATTCTCAACATTTGCTAAAACATCCTTT
TACAAGATTGTAAACTATGCCAAGACGATATGGGAAAGGCTCTGGTCAAAAGTCTTTAACCCCTCAGTTGAAATAGGATGGAAGCTCCAACACATGACTGGAGAAAAGAA
TTACATTACTGACCTCAAGTCTGTGGAACTTGGCCAAGTGTCTTTTGGTGATGGTGCAACTACAAAGATCATTAGAAAAGGAAAACTAAAGTATCCTGGTCTTTCATTGC
TTGAAGATGTATTGCTTGTTGATGGACTTTCTACTAATCTTATCAGCATAAGTCAATTGTGTGACCAATGTCTAAATGTTAGCTTCTCTAAAGACAAATGGCTTGACACA
AATGATGCCAAAACTACCATCATGACAAGTACACGATATGTTTATAATTGTTACTTGTGGACAGCTGATGTTGCAGATCATAATGCTTCCACTCCACCTATAGCCTATGA
TGTTCGATTGCCTTGCCTTTCTTTCTCTCTCGGTGGTATCGTCAATGTGAATTCATGTGTTAACAAGAAGAGAATTGACTCGATGAAAAACAAGAAAATTGGGAAGCAAA
TGTGTGATGTTAGACATGTTATTATGACCATTGAAGACCTTACATGTCATGTCACTGAGTCCATTGGGTATGTCTCTATAGATACTTCAAATCAATTCACTGCCTCTCTC
AGAAAACCTGAAAGCAATACATCTACTATGATACCAAATCCTAATCGGTGGAAAGGATCAGGTTTACTCTTGAACTCGAAAAGGGAGCAAGGTAGCAGTTCGACTCTCAT
TCGTGGAAAGAGTCAAAGCATTGAGATGAAGAGGAGGAAGAAAAGTAAATGTGCCAACTTGAGTTTTGGGTTAAGAGTGCGCCATAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCGAAGGCGAAGCGTTGGAATGGCATCAATGGGAGAAGGAGAGGAATCCGATGACATCGTGGGCGGACTTCAAAGAACAACTATTATTGTGATTTCTATCGGCGA
AGGAAGAAGATCGGAAGGCGCAATTCTTCAACCTGAAGCAGGACGGCACAGTGTGAGCGATCTGTCGGAAGCTACCTTGGAGAGCAAATTCGTCAGCGGACTCAAGGAAG
AAATCCAGATGGAGATGAGGATGTTTAGTCTGGTGGGCCTCAAGCCCAAAATGCTCATGGCCCAACTAATCAAAGACATGGAGGAGGCCCGACAACTCAAGTGGGGCAGA
AACCCGAGTCCAGAAAGTAAACGGAGCAGGTCAAACCCGACTCAAGCGAAATCGACAACTATGATCGCCTCCAGCACGAGTTCGATGACAGCCCGCACGATTACTCTCAA
CCCTACTCTCAACCCGACTCGTAATTCTGCTGTTTCTTCAAACACAGTCAATTTTTGCGACACTAATGGTAAGACTTTTTCGGCAGGAAGCTTTCGCAGGTTGATGGATA
GCAAATTGCAAAGTAGAAGGGAGAAAGGGCTATGTTATCGCTGCGATGAAAAGTATACGGCGGGTCACCGTTGTAAGAAGAAGGAGTTGAATCTTTTTATCCAACACGAA
GTCGACGGCGAAGGAGTGGAAGAGGAGATCGACGTTGACGCTATTAGAGAGGAGGAGACGATGCCGGCCACGCTGGAGACATTGGACGTCGAAATAGAGGCAGCTGAAAA
GGGGGTGTTAACTTTGAATTCTTTGGCCGGGCTAGATTCACCAAAAACAATGAAGTTGCAAGGGACTGTGGGCAATAAGCAAGTGGTTGTACTAATTGACAGCTGCGCAT
CACACAATTTCATTGACGAATCACTGGTAACATCGTTGAATCTACTGCAATCACCAATGTCAAGCTATGGGATAATGCTAGGTACTGGACAATCCGTTCGAACAACCGGA
ATTTGTAAAGGTGTTGTTCTAAATCTCCCAAACTTAACAATTGTTAATGATTTCTTTCCTTTATCGTTGGGTAGTGCGGATGTAATCTTGGGAGTACAGTGGTTGATGAC
GCTGGGAAAGGTTGAATGTGATTGGAGTACGTCAGAGATGCAATTTCGAGTGGGTGAATGGAACATAAAACTCCAGGGAGATAGAAGTGTAATAAAGTCACAAATTTCTC
TGAAATCTATGATGAAGCAGGTGAGAGGGGAAGGAGTGGGGATACTACTGGAACTCAATGCAATCCAGACCCAGGAGAACAATCGGATGAAGGGAACTCAAGATGTTCGC
CCAGAAATTCGTGAGCTCTTGCAGAAATTCCAGTCAATTTTTAGACCCATTGACGAATTACCACCAGCAAGGCGCCACGATCATGCTGTTACGTTAAAACCAGAAGAGGG
ACCTGTCAATGTCTGTCTGTATCGATATCCCCAATTCCAAAAAGATGAGATTGAACGCTTGGTTGCAAAAATGTTGACGGCAAGAATAATCCAGCCGAGTACAAGCTCAT
TCTCAAGTCCCGTGCTCTTAGTCAAGAAGAAAGATGGAAGCTGGAGATTTTGTATCGATTACAGGGCCCTCAACCATATTACTGTGCCAGATAACTTCCCGATACCAGTG
GTAGATGAATTGCTAGATGAGTTAACATGTGTGACGATATTTTTGAAAATCGACCTCAAGTCTGGGTACCATCAAATACGGGTTCAGCTCAAAGATGTGCATAAAACGGC
GTTCCGAACACATGAAGGGCATTATGAGTTTCTGGTTATGCCATTCGGACTACGGAATGCCCCGTCAACTTCTTCTTTGCAGGAACATATGGAACATCTATCCACAGTGT
TGCACGTATTGTCTGACAATAAATTGGTGGCCAATTTGAACAAATGCACGTTTGCGACGTCCTCCATTGAGTATCTTGGTCACGTAATCTCTGCAGAAGGGGTGGCGGCA
GACCCAAAGAAAATCGAAGCAATGACGCAATGGCCTGTGCCCACGACACGTAAGTTTGTAGCCGGATATGGGTCGATTGTCCACCCTCTTACACAACTGCTGAGGAAGGA
TAAATTTGGATGGAATGAGGCCACTCAATCTGCCTTTGATACACTCAAACAGGCAATGGTGGAGGTGCCAGTCCTAGCGCTACCAGATTTTAAGCAATCATTTGTGGTGG
AGACCGATGCATCTGGCGTCGGAGTTGGTGCGGTTTTAATGCAAAACGGGAGACCGTTGGCGTATTTTAGTCAGGCGTTACCACTCACTCATCAGTTGAAGGTAGTATAC
GAATGGGAGTTGATTGCCATAGTCCTAGCGGTGCAGAAATGGCGGCCATATCTCTTGGGACGTAGGTTTACGGAACGTACCGACCAAAAGAGTTTGAAATTCCTGTTGGA
GCAGCGGATCATTGTAGGAGACTATCAAAAGTGGATAGCCAAGTTGATAGGATATGACTTTGAGATCGAATACAAAAAGGGATTAGAAAATACGGCAGCCGATGCTCTAT
CTCGATTGCCCACCCTTGTCGAATTATCGGCCTTCAGCTGTGTAATTGGAGTTAATACAGCAGTTTTTGCATCTCAGGTTGATCAGGATGAACGATTACGGGAGATTCGA
AAGACAATTCAAGATGGGGGCCAAGTTCCACCGGGGTATTCAGTGAAAGGTGAGTTGGTGATGTATAAAGGTAGAATCGTGCTGCCGCCAACTTCGCCTACCATTCCGCT
TTTGTTGCAAGAGTTCCATAATAACCCAGTAGGGGGTCACAATGAGGTGGTCAAAGTTTATCAAAGGATCAAGAAGGAATTTATTTGGCCCGGGATGAAAGCATCAGTAC
GTGCATTTGTAGAAGATTGCTCAGTCTGCCAACAGGCCAAGTACATGTCGATGGCACCCGCAGGGTTGTTACTACCGCTCCCTGTGCTGCAGATGATTTGGGAGGATATC
TCAATGGACTTTATTGACGGATTGCCCCGTTCAGAAGGGTACAATTCTATCCTCGTTGTCGTGGATCGGCTTTCAAAATACGCGCACTTTATCCCGCTTAGGCACCCTTA
CACAGCAGTGATGGTTGCTTCGATATTTATTAAAGAGATTGTCCATCTTCATGGTATCCCTAAGAGCATCGTATCTGATCGAGATAAGCGAAGTACGGCCTACCATCCTC
AGACGGATGGGCAAACAGAAGTAATGAATCGAGGATTGGAAGCCTACCTACGTTGTTTCACGATGCAGTCCCCGGTCAAATGGGCAAAATGGTTAGCGTGGACAGAATAC
AATTATAATACATCTTATCACACGTCGTTAAAGATGACCTCGTATGGAATAGTTTATGGTCGATCAGCACCAACGCTACTGTCTTATGATCAAGGCACAGCGGCAGTTGA
TGCTGTGGACTTCTTACTGCGAGAAAGAGATGAGATGTTAAGAGAATTGAGACGGATGAAACGCTTGGCAGACGCTAAGCGACGGGAAGAAGAGTATGCGGTTAATGACT
GGGTCTATATTAAATTGCGACCTTACAGACAGAGCACATTATTCATACACAAACATCCCAAACTCGCTCTGAAATTCGTCGGGCCCTTTCACATTATCGATCGGATCGGG
CCAGTAGCTTACAAACTGGATCTGCCACCTAATGTTAATTTTCATCCAATGTTCCATGTGTCTCAACTTCGTAAGGTGGTCGGTTCTGTCTTGCCCAAGTTTTTGTTACC
ACCTACACTACAAGTCCATGACCCGCCATCCATTCAACCAGCAGCTATTTTGGGGGTTCGGCGTGTCGGATCTACCATAACCTCACTAATGGATTTCGAAGTATTAGTGC
AATGGGATGGACTTGCAGTAGAAGACGCAACATGGGAGAACAGGCAACAAATCATCTCTCAATTCCCACACTTTCACCTTGAGGACAAGGTGGTTTTTTGGGGGGCGGGT
AGTGATAGGCCTCCAATATTTAAGAAATATATGAGAAGAAGGAAAAGAGAAGGGAGAATTTTTCTTGAATCAACGAGCTCTTCTTCGTCTCAAGGGGAGCTTGAGACGTT
AATTGCGGAGTCTCTTACAATACGAACTGCTGAAGGAATTCTTCTTGAATCGACGTGCTCTTCTTTGTCTCAAGGGAAGCTTGAGACGTTAATTGTGATTGGAAAGATAT
CTTCAAGAAGAAAGGGAGTCTCTTACAATACAGAGTGCCGAAGTATTATTGTTCCTAATGTTAGGAATGCCCCTAAGCTTAAAGCCAAATGGATTGATGCTGAGGACAAA
GAATCCATAGGGAATTATCGTGCTCTTAATGCTATTTTTAATGGTGTGGATAGAAATATCTTTCGTCTGATTAGCACATGTGTCTCTAGCAAGGAAGCTTGGGACATTTT
GGTTGTTGCATACGAGGGAACATCAAGAGTTAAGATGTCCAAATTTCAATTGTTGACTACCAAGTTCGAAAATTTGAAAATGCTTGAAGAAGGAAAACTGGTTAGAAAAA
TTCTTAAATATTTGCCGAAGAGGTTTGACATGAAGGTGACTATCATTGAAGAAGCACAGGATATTTCAACTATGAAAGTTGATGAGTTGTTCGGATCCTTACTTACGTTT
GAAATGCCTCTCGACAAAGTTTTGAAGCATTTTAAAAATAGATCTAGGAATTATGTGCCTCGGAATGTTAAAGCAAATGGAGCCTCATCCGAGAAACTAAAAAAATCTCA
AATTCAGAAGAAGGATGTTGACAACTCCAGCATAAGTGATCAAAATAAATACCATTTTTGTTGTAGGGAATGTGAAGGTTTCAGTCACTATCAGTCTGAATGCCCCAATT
TCCTTAAACGACGAAGTAAGAACAAAGTATTATCTTTGTTTGAAGAAGATTTTGAAGATAGCAGTGAATTTGAAGATGTTAAAGCTCTTGTCAACAACATTACTAAGGAC
ATTCATTTGTGCAAAAGTCAAGAAGAAGATTATAGTGGCTGTGGTGAAGAAGAAGATATGCTTGAATCAATTGAAAAGTCTTACCGTATGTTACATCAACAATGGCTTGA
GGATGTAAAAGTCCTGGATGTTCAAAAAGAAAGAATAAGACTCTTCTTACGAACAATCATAGACTCACGACCACTTGATCTAACAGGTTTTGGATACTCCGGAATGAAGA
GAAGAGTTGATGATCAACCTATTTATAAGGGAAAAGGAAAAATTACTATTGGGTGTGATCAGAAGCAGATGTTTGCACCCTTATTCTCAACATTTGCTAAAACATCCTTT
TACAAGATTGTAAACTATGCCAAGACGATATGGGAAAGGCTCTGGTCAAAAGTCTTTAACCCCTCAGTTGAAATAGGATGGAAGCTCCAACACATGACTGGAGAAAAGAA
TTACATTACTGACCTCAAGTCTGTGGAACTTGGCCAAGTGTCTTTTGGTGATGGTGCAACTACAAAGATCATTAGAAAAGGAAAACTAAAGTATCCTGGTCTTTCATTGC
TTGAAGATGTATTGCTTGTTGATGGACTTTCTACTAATCTTATCAGCATAAGTCAATTGTGTGACCAATGTCTAAATGTTAGCTTCTCTAAAGACAAATGGCTTGACACA
AATGATGCCAAAACTACCATCATGACAAGTACACGATATGTTTATAATTGTTACTTGTGGACAGCTGATGTTGCAGATCATAATGCTTCCACTCCACCTATAGCCTATGA
TGTTCGATTGCCTTGCCTTTCTTTCTCTCTCGGTGGTATCGTCAATGTGAATTCATGTGTTAACAAGAAGAGAATTGACTCGATGAAAAACAAGAAAATTGGGAAGCAAA
TGTGTGATGTTAGACATGTTATTATGACCATTGAAGACCTTACATGTCATGTCACTGAGTCCATTGGGTATGTCTCTATAGATACTTCAAATCAATTCACTGCCTCTCTC
AGAAAACCTGAAAGCAATACATCTACTATGATACCAAATCCTAATCGGTGGAAAGGATCAGGTTTACTCTTGAACTCGAAAAGGGAGCAAGGTAGCAGTTCGACTCTCAT
TCGTGGAAAGAGTCAAAGCATTGAGATGAAGAGGAGGAAGAAAAGTAAATGTGCCAACTTGAGTTTTGGGTTAAGAGTGCGCCATAGGTGA
Protein sequenceShow/hide protein sequence
MSRRRSVGMASMGEGEESDDIVGGLQRTTIIVISIGEGRRSEGAILQPEAGRHSVSDLSEATLESKFVSGLKEEIQMEMRMFSLVGLKPKMLMAQLIKDMEEARQLKWGR
NPSPESKRSRSNPTQAKSTTMIASSTSSMTARTITLNPTLNPTRNSAVSSNTVNFCDTNGKTFSAGSFRRLMDSKLQSRREKGLCYRCDEKYTAGHRCKKKELNLFIQHE
VDGEGVEEEIDVDAIREEETMPATLETLDVEIEAAEKGVLTLNSLAGLDSPKTMKLQGTVGNKQVVVLIDSCASHNFIDESLVTSLNLLQSPMSSYGIMLGTGQSVRTTG
ICKGVVLNLPNLTIVNDFFPLSLGSADVILGVQWLMTLGKVECDWSTSEMQFRVGEWNIKLQGDRSVIKSQISLKSMMKQVRGEGVGILLELNAIQTQENNRMKGTQDVR
PEIRELLQKFQSIFRPIDELPPARRHDHAVTLKPEEGPVNVCLYRYPQFQKDEIERLVAKMLTARIIQPSTSSFSSPVLLVKKKDGSWRFCIDYRALNHITVPDNFPIPV
VDELLDELTCVTIFLKIDLKSGYHQIRVQLKDVHKTAFRTHEGHYEFLVMPFGLRNAPSTSSLQEHMEHLSTVLHVLSDNKLVANLNKCTFATSSIEYLGHVISAEGVAA
DPKKIEAMTQWPVPTTRKFVAGYGSIVHPLTQLLRKDKFGWNEATQSAFDTLKQAMVEVPVLALPDFKQSFVVETDASGVGVGAVLMQNGRPLAYFSQALPLTHQLKVVY
EWELIAIVLAVQKWRPYLLGRRFTERTDQKSLKFLLEQRIIVGDYQKWIAKLIGYDFEIEYKKGLENTAADALSRLPTLVELSAFSCVIGVNTAVFASQVDQDERLREIR
KTIQDGGQVPPGYSVKGELVMYKGRIVLPPTSPTIPLLLQEFHNNPVGGHNEVVKVYQRIKKEFIWPGMKASVRAFVEDCSVCQQAKYMSMAPAGLLLPLPVLQMIWEDI
SMDFIDGLPRSEGYNSILVVVDRLSKYAHFIPLRHPYTAVMVASIFIKEIVHLHGIPKSIVSDRDKRSTAYHPQTDGQTEVMNRGLEAYLRCFTMQSPVKWAKWLAWTEY
NYNTSYHTSLKMTSYGIVYGRSAPTLLSYDQGTAAVDAVDFLLRERDEMLRELRRMKRLADAKRREEEYAVNDWVYIKLRPYRQSTLFIHKHPKLALKFVGPFHIIDRIG
PVAYKLDLPPNVNFHPMFHVSQLRKVVGSVLPKFLLPPTLQVHDPPSIQPAAILGVRRVGSTITSLMDFEVLVQWDGLAVEDATWENRQQIISQFPHFHLEDKVVFWGAG
SDRPPIFKKYMRRRKREGRIFLESTSSSSSQGELETLIAESLTIRTAEGILLESTCSSLSQGKLETLIVIGKISSRRKGVSYNTECRSIIVPNVRNAPKLKAKWIDAEDK
ESIGNYRALNAIFNGVDRNIFRLISTCVSSKEAWDILVVAYEGTSRVKMSKFQLLTTKFENLKMLEEGKLVRKILKYLPKRFDMKVTIIEEAQDISTMKVDELFGSLLTF
EMPLDKVLKHFKNRSRNYVPRNVKANGASSEKLKKSQIQKKDVDNSSISDQNKYHFCCRECEGFSHYQSECPNFLKRRSKNKVLSLFEEDFEDSSEFEDVKALVNNITKD
IHLCKSQEEDYSGCGEEEDMLESIEKSYRMLHQQWLEDVKVLDVQKERIRLFLRTIIDSRPLDLTGFGYSGMKRRVDDQPIYKGKGKITIGCDQKQMFAPLFSTFAKTSF
YKIVNYAKTIWERLWSKVFNPSVEIGWKLQHMTGEKNYITDLKSVELGQVSFGDGATTKIIRKGKLKYPGLSLLEDVLLVDGLSTNLISISQLCDQCLNVSFSKDKWLDT
NDAKTTIMTSTRYVYNCYLWTADVADHNASTPPIAYDVRLPCLSFSLGGIVNVNSCVNKKRIDSMKNKKIGKQMCDVRHVIMTIEDLTCHVTESIGYVSIDTSNQFTASL
RKPESNTSTMIPNPNRWKGSGLLLNSKREQGSSSTLIRGKSQSIEMKRRKKSKCANLSFGLRVRHR