; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012840 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012840
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSWIM-type domain-containing protein
Genome locationchr1:44846479..44849273
RNA-Seq ExpressionLag0012840
SyntenyLag0012840
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI10421.1 hypothetical protein PRUPE_4G046500 [Prunus persica]2.6e-13935.35Show/hide
Query:  INFHYNGRMVFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMISLVPESRLIHIYVEHNPNI-E
        I F Y G  V  P  +YI G       +D DR+S +E++ M+ +  V N + FYY  PG+  + G   L  D  ++  +  +P +R + +YVEH+ ++ E
Subjt:  INFHYNGRMVFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMISLVPESRLIHIYVEHNPNI-E

Query:  VL-DLTEPIVMERPI--------------------------------FLEWRPIGQQNPLQ---------NQIEFD----------DFVEVDVIGESSLS
        V+ DLT+P + +  I                                F E R        Q         N+ E+D          ++ E D  G +S S
Subjt:  VL-DLTEPIVMERPI--------------------------------FLEWRPIGQQNPLQ---------NQIEFD----------DFVEVDVIGESSLS

Query:  E---------SGSSSLSSESVASL------------NINVNVEEEEVESDIGSYSN--LNTSIGSDEEEAVRG-PEFRVETDMNQVEFRVGMRFTSPKVL
        E             +   + V +L            N   +++ E++  D   Y++  L+++  SDEE   R   E  ++TDM+  +F VGM+F S KVL
Subjt:  E---------SGSSSLSSESVASL------------NINVNVEEEEVESDIGSYSN--LNTSIGSDEEEAVRG-PEFRVETDMNQVEFRVGMRFTSPKVL

Query:  KDAIKEYAVRKAHNIKFIKND-----------------------------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDF
        K AI+ Y   +++  K +KND                             K+Y   H+CS+ F N+N+ S ++++ Y+ R +  P      +  EV ++ 
Subjt:  KDAIKEYAVRKAHNIKFIKND-----------------------------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDF

Query:  QVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLM
           +S  Q Y+A+RKA   I G+  +QY+KLW+Y  EIR++N GS++ ++ D       P F+R YVC  ACK+GF+AGCRP I +DACHLKG  QGQL+
Subjt:  QVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLM

Query:  AAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCN
         ++G+D +D++YPIA+AV E E+KDSW WFL LL  D+G  S  G+TFISDQQKGL   FE   P  SHR+CVRHLYGNF+++FKG ALK   W AA  +
Subjt:  AAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCN

Query:  NMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILK
        N  DFN  M ++  L   AY +L       W+RH F    KCD LLNNL E FN+ +L AR+KPI+ + E IR  IM R+  K + I++ +  I  +I  
Subjt:  NMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILK

Query:  KLEKSIRVSRNAIPLWAGNGKFEV-ESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNT
        KLEK+  +S    P++ GNG F+V ++   Q+ V+L  RTC+C  W+++GIP  HAV  I++ +++ E FVD  Y  +A +  YS  + P+     WP  
Subjt:  KLEKSIRVSRNAIPLWAGNGKFEV-ESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNT

Query:  TYEPILPPVIRRPPGRPRKQR--KQDDDEAGTSSTNISRK---FRCGRCRLSGHNSRSCK
           P+LPP++++ PGRP+K+R  K  + E     T + +K     C +C   GHN  +CK
Subjt:  TYEPILPPVIRRPPGRPRKQR--KQDDDEAGTSSTNISRK---FRCGRCRLSGHNSRSCK

XP_020080379.1 uncharacterized protein LOC109704055 [Ananas comosus]2.9e-13534.76Show/hide
Query:  FHINFHYNGRMVFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEE---FYYVRPGYTLSTGFKVLLTDADVVQMISLVPESRLIHIYVEHN
        F +  H+ G   F+P   Y+G +       D D +SL++++    D   +  EE   +Y+ +PG +L+ G K L  D DV +M     ++  + +Y+E  
Subjt:  FHINFHYNGRMVFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEE---FYYVRPGYTLSTGFKVLLTDADVVQMISLVPESRLIHIYVEHN

Query:  PNIEVLDLTEPIV---------MERPIFLEWRPIGQQNPLQ---------------------------------NQIEFDDFVEVDVI---------GE-
        P I  ++ ++ I+          E+ I  +    G+Q   Q                                 N+   DD V  + +         GE 
Subjt:  PNIEVLDLTEPIV---------MERPIFLEWRPIGQQNPLQ---------------------------------NQIEFDDFVEVDVI---------GE-

Query:  --SSLSESGSSSLSSESVASL---------NINVNVE----------------EEEVESDIGSYSNLNTSIGSDEEEAV---RGPEFRVETDMNQVEFRV
          ++ S+  S SL  ES   L         N++ +VE                +   ++D      L++   S E++ V   R PEFR ETDM   +F +
Subjt:  --SSLSESGSSSLSSESVASL---------NINVNVE----------------EEEVESDIGSYSNLNTSIGSDEEEAV---RGPEFRVETDMNQVEFRV

Query:  GMRFTSPKVLKDAIKEYAVRKAHNIKFIKND-----------------------------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLA
        GM F S K  + AI+EY+++  +NIK +KN+                             K Y  +H C + F+N+ + S+W+AK Y++RFR+ P W L 
Subjt:  GMRFTSPKVLKDAIKEYAVRKAHNIKFIKND-----------------------------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLA

Query:  EIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACH
           E V  D  V +SR++ YRA+RKA R I GS  EQY  LWDYA E+++SN GST  I+CD         F+  YVC  ACK GFLAGCRP I+LD C 
Subjt:  EIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACH

Query:  LKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALK
        LK    GQL+AA+GVD +D ++PIA+AVV+ E+  SW WFL  L  D+   ++  +TF+SD+QKGL+   E+  P   HRFCVRHLY NFQ  +KG  LK
Subjt:  LKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALK

Query:  TLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRM
           W AAR + +  F   M  I      AY++L+  P + WSR  F   PKCD LLNN+ E FN  +L AR KPI+ + E IR  +M R+  K E   + 
Subjt:  TLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRM

Query:  DKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPM
        +  I  +I+KK EK    S    P  AG+GKF+V S   Q+VVDL  +TC+C  W ++G+PCPHA+  + +   +PED+V D Y  +     Y+H + P+
Subjt:  DKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPM

Query:  NGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDEAGTSSTNISR---KFRCGRCRLSGHNSRSCKGPVKRRSRKGSSSTFMGFNIPCQNIGTQESATN
        NG D+W  T  EPI PP  ++ PGRP+  RK+   E  TS+  + R   +  C +C   GHN  +C     RRSR G         +P +    +   TN
Subjt:  NGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDEAGTSSTNISR---KFRCGRCRLSGHNSRSCKGPVKRRSRKGSSSTFMGFNIPCQNIGTQESATN

Query:  PSTPHGTPS
         +TP  TP+
Subjt:  PSTPHGTPS

XP_020410029.1 uncharacterized protein LOC18790135 [Prunus persica]4.0e-13238.55Show/hide
Query:  NVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKAHNIKFIKND----------------------
        +VE +  +S+   Y+  N    SDEE     PEF   TDM   +F +G+ F+  KV + A++EY++ +  ++ FI+N+                      
Subjt:  NVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKAHNIKFIKND----------------------

Query:  --------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSN
                KTY+G+HTC++ +EN  + S W++K Y+   +S P W +    E V+KD+   +SR Q YRA+ KA + I GS NEQY ++WDY  E+R++N
Subjt:  --------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSN

Query:  VGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFS
         G+T  ++CD   +  +P F+R YVC  A KEGF AGCRP I LD C LK    GQL+ A+GVD ++  + IA+AVVE+E K+SW WFL LL  D    +
Subjt:  VGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFS

Query:  TKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLS--GIPIKYWSRHAFPCAP
          G+TFISD+QKGLV  FE+  P   HRFC RHL+ N++  FK  +L+  FW+A+    +  F  AM ++ +LS  AY++L+    P ++WS+  F    
Subjt:  TKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLS--GIPIKYWSRHAFPCAP

Query:  KCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVES-GTSQYVVDLDVRT
        KCD LLNNL ESFNA +L  R++PI+ +FE IR ++M RI  + +K+   +K I  RI KK+E +   +   + +W+G GKF+V + G +QY+VDL++RT
Subjt:  KCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVES-GTSQYVVDLDVRT

Query:  CTCGLWQISGIPCPHAVQCIYYV-KKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDE----------AG
        C+C  W + G PC HA+  I Y       DFVDD Y K   + TY + +LPMNG D+W  + + P LPP     PGRPRK R ++ +E            
Subjt:  CTCGLWQISGIPCPHAVQCIYYV-KKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDE----------AG

Query:  TSSTNISR------KFRCGRCRLSGHNSRSCKGPVKRRSRKGSSSTFMGFNIPCQNIGTQESATNP---STPHGTPS
         S T +SR        +CG+C   GHN R+C   +  +++  +           +  GT+ + T P   ST  G PS
Subjt:  TSSTNISR------KFRCGRCRLSGHNSRSCKGPVKRRSRKGSSSTFMGFNIPCQNIGTQESATNP---STPHGTPS

XP_020419098.1 uncharacterized protein LOC18777017 isoform X1 [Prunus persica]5.2e-13238.4Show/hide
Query:  NVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKAHNIKFIKND----------------------
        +VE +  +S+   Y+  N    SDEE     PEF   TDM   +F +G+ F+  KV + A++EY++ +  ++ FI+N+                      
Subjt:  NVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKAHNIKFIKND----------------------

Query:  --------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSN
                KTY+G+HTC++ +EN  + S W++K Y+   +S P W +    E V+KD+   +SR Q YRA+ KA + I GS NEQY ++WDY  E+R++N
Subjt:  --------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSN

Query:  VGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFS
         G+T  ++CD   +  +P F+R YVC  A KEGF AGCRP I LD C LK    GQL+ A+G+D ++  + IA+AVVE+E K+SW WFL LL  D    +
Subjt:  VGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFS

Query:  TKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLS--GIPIKYWSRHAFPCAP
          G+TFISD+QKGLV  FE+  P   HRFC RHL+ N++  FK  +L+  FW+A+    +  F  AM ++ +LS  AY++L+    P ++WS+  F    
Subjt:  TKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLS--GIPIKYWSRHAFPCAP

Query:  KCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVES-GTSQYVVDLDVRT
        KCD LLNNL ESFNA +L  R++PI+ +FE IR ++M RI  + +K+   +K I  RI KK+E +   +   + +W+G GKF+V + G +QY+VDL++RT
Subjt:  KCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVES-GTSQYVVDLDVRT

Query:  CTCGLWQISGIPCPHAVQCIYYV-KKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDE----------AG
        C+C  W + G PC HA+  I Y       DFVDD Y K   + TY + +LPMNG D+W  + + P LPP     PGRPRK R ++ +E            
Subjt:  CTCGLWQISGIPCPHAVQCIYYV-KKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDE----------AG

Query:  TSSTNISR------KFRCGRCRLSGHNSRSCKGPVKRRSRKGSSSTFMGFNIPCQNIGTQESATNP---STPHGTPS
         S T +SR        +CG+C   GHN R+C   +  +++  +           +  GT+ + T P   ST  G PS
Subjt:  TSSTNISR------KFRCGRCRLSGHNSRSCKGPVKRRSRKGSSSTFMGFNIPCQNIGTQESATNP---STPHGTPS

XP_020419099.1 uncharacterized protein LOC18777017 isoform X2 [Prunus persica]1.0e-13238.69Show/hide
Query:  NVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKAHNIKFIKND----------------------
        +VE +  +S+   Y+  N    SDEE     PEF   TDM   +F +G+ F+  KV + A++EY++ +  ++ FI+N+                      
Subjt:  NVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKAHNIKFIKND----------------------

Query:  --------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSN
                KTY+G+HTC++ +EN  + S W++K Y+   +S P W +    E V+KD+   +SR Q YRA+ KA + I GS NEQY ++WDY  E+R++N
Subjt:  --------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSN

Query:  VGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFS
         G+T  ++CD   +  +P F+R YVC  A KEGF AGCRP I LD C LK    GQL+ A+G+D ++  + IA+AVVE+E K+SW WFL LL  D    +
Subjt:  VGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFS

Query:  TKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLS--GIPIKYWSRHAFPCAP
          G+TFISD+QKGLV  FE+  P   HRFC RHL+ N++  FK  +L+  FW+A+    +  F  AM ++ +LS  AY++L+    P ++WS+  F    
Subjt:  TKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLS--GIPIKYWSRHAFPCAP

Query:  KCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVES-GTSQYVVDLDVRT
        KCD LLNNL ESFNA +L  R++PI+ +FE IR ++M RI  + +K+   +K I  RI KK+E +   +   + +W+G GKF+V + G +QY+VDL++RT
Subjt:  KCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVES-GTSQYVVDLDVRT

Query:  CTCGLWQISGIPCPHAVQCIYYV-KKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDE----------AG
        C+C  W + G PC HA+  I Y       DFVDD Y K   + TY + +LPMNG D+W  + + P LPP     PGRPRK R ++ +E            
Subjt:  CTCGLWQISGIPCPHAVQCIYYV-KKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDE----------AG

Query:  TSSTNISR------KFRCGRCRLSGHNSRSC------KGPVKRRSRKGSSSTFMGFNIP
         S T +SR        +CG+C   GHN R+C      K     + ++G+ +T +    P
Subjt:  TSSTNISR------KFRCGRCRLSGHNSRSC------KGPVKRRSRKGSSSTFMGFNIP

TrEMBL top hitse value%identityAlignment
A0A251PFQ5 ZnF_PMZ domain-containing protein1.2e-13935.35Show/hide
Query:  INFHYNGRMVFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMISLVPESRLIHIYVEHNPNI-E
        I F Y G  V  P  +YI G       +D DR+S +E++ M+ +  V N + FYY  PG+  + G   L  D  ++  +  +P +R + +YVEH+ ++ E
Subjt:  INFHYNGRMVFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMISLVPESRLIHIYVEHNPNI-E

Query:  VL-DLTEPIVMERPI--------------------------------FLEWRPIGQQNPLQ---------NQIEFD----------DFVEVDVIGESSLS
        V+ DLT+P + +  I                                F E R        Q         N+ E+D          ++ E D  G +S S
Subjt:  VL-DLTEPIVMERPI--------------------------------FLEWRPIGQQNPLQ---------NQIEFD----------DFVEVDVIGESSLS

Query:  E---------SGSSSLSSESVASL------------NINVNVEEEEVESDIGSYSN--LNTSIGSDEEEAVRG-PEFRVETDMNQVEFRVGMRFTSPKVL
        E             +   + V +L            N   +++ E++  D   Y++  L+++  SDEE   R   E  ++TDM+  +F VGM+F S KVL
Subjt:  E---------SGSSSLSSESVASL------------NINVNVEEEEVESDIGSYSN--LNTSIGSDEEEAVRG-PEFRVETDMNQVEFRVGMRFTSPKVL

Query:  KDAIKEYAVRKAHNIKFIKND-----------------------------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDF
        K AI+ Y   +++  K +KND                             K+Y   H+CS+ F N+N+ S ++++ Y+ R +  P      +  EV ++ 
Subjt:  KDAIKEYAVRKAHNIKFIKND-----------------------------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDF

Query:  QVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLM
           +S  Q Y+A+RKA   I G+  +QY+KLW+Y  EIR++N GS++ ++ D       P F+R YVC  ACK+GF+AGCRP I +DACHLKG  QGQL+
Subjt:  QVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLM

Query:  AAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCN
         ++G+D +D++YPIA+AV E E+KDSW WFL LL  D+G  S  G+TFISDQQKGL   FE   P  SHR+CVRHLYGNF+++FKG ALK   W AA  +
Subjt:  AAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCN

Query:  NMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILK
        N  DFN  M ++  L   AY +L       W+RH F    KCD LLNNL E FN+ +L AR+KPI+ + E IR  IM R+  K + I++ +  I  +I  
Subjt:  NMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILK

Query:  KLEKSIRVSRNAIPLWAGNGKFEV-ESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNT
        KLEK+  +S    P++ GNG F+V ++   Q+ V+L  RTC+C  W+++GIP  HAV  I++ +++ E FVD  Y  +A +  YS  + P+     WP  
Subjt:  KLEKSIRVSRNAIPLWAGNGKFEV-ESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNT

Query:  TYEPILPPVIRRPPGRPRKQR--KQDDDEAGTSSTNISRK---FRCGRCRLSGHNSRSCK
           P+LPP++++ PGRP+K+R  K  + E     T + +K     C +C   GHN  +CK
Subjt:  TYEPILPPVIRRPPGRPRKQR--KQDDDEAGTSSTNISRK---FRCGRCRLSGHNSRSCK

A0A2N9FZE3 SWIM-type domain-containing protein1.5e-14835.83Show/hide
Query:  EVFHINFHYNGRM-VFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMIS-LVPESRLIHIYVEH
        + F +  HYNG M        YIGG   V   ++ D  S+ E+  M  +L V+N  +FYYV PG  LS G + L TD+ +++M   L P +  + +YVEH
Subjt:  EVFHINFHYNGRM-VFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMIS-LVPESRLIHIYVEH

Query:  NPNIEVLDLTEPIVMERPIFLE--------------------------------WRPIGQQNPL-----QNQIEFDDFVEVDVIGESSL-----------
               +   P+ M++PI +                                    +   N +      + + F+   +V +  + S            
Subjt:  NPNIEVLDLTEPIVMERPIFLE--------------------------------WRPIGQQNPL-----QNQIEFDDFVEVDVIGESSL-----------

Query:  --SESGSSSLSSESVASLNINVNVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKAHNIKFIKND
          S+       +     ++I+  +++  + SD GS   LN+   SD+E     P+FR  TD++   F +GM F     LK+A+  Y ++    ++F KN+
Subjt:  --SESGSSSLSSESVASLNINVNVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKAHNIKFIKND

Query:  KTYI----------------GD-------------HTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRIN
        +  +                GD             HTCSR F +R ++S WVAK Y++ FR  P+  L E+I +V KD +V +SRS+AYRA++ A   I 
Subjt:  KTYI----------------GD-------------HTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRIN

Query:  GSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEA
        GS  EQY+++ DY  E+ R+N G+T K+      +     F+R YVC   CK+GFLAGCRP I LDACHLKG   GQL+AA+GVDG++ +YPIA+AV EA
Subjt:  GSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEA

Query:  ETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQ
        E+ ++WTWFL  L  DIG     G+ F+SDQQKGLVP  ++      HRFCVRHL+ NF+K  KG  LK L W AAR + +++F+  M E+  +S  A+ 
Subjt:  ETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQ

Query:  YLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGK
         L G  +  W+RHAFP  PKCD LLNNL E+FN+ ++ AR KPII + ETIR+ +M RI K  + + +    I  RI +KL+K    S+   P W G  +
Subjt:  YLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGK

Query:  FEVESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRK
        +EV S   +Y+VD+  ++C C  W ++GIPC HAV+ I Y     ED+VDD ++KK  +  YSH + P NGPD WP    +P+LPP+ RR PGRP++  +
Subjt:  FEVESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRK

Query:  QDDDEAGTSSTNISR---KFRCGRCRLSGHNSRSCK---------GPVKRR---------SRKGSSS--------TFMGFNIPCQNIGTQESA
        + D +   +S  + R     +CG+C   GHN RSCK         G  + R         S KG+           FMGF IP   +  +  A
Subjt:  QDDDEAGTSSTNISR---KFRCGRCRLSGHNSRSCK---------GPVKRR---------SRKGSSS--------TFMGFNIPCQNIGTQESA

A0A2N9HT52 SWIM-type domain-containing protein1.2e-14535.08Show/hide
Query:  EVFHINFHYNGRM-VFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMIS-LVPESRLIHIYVEH
        + F +  HYNG M        YIGG   V   ++ D  S+ E+  M  +L V+N  +FYYV PG  LS G + L TD+ +++M   L P +  + +YVEH
Subjt:  EVFHINFHYNGRM-VFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMIS-LVPESRLIHIYVEH

Query:  NPNIEVLDLTEPIVMERPIFL-EWRPIGQQNPLQNQIEFDD------FVEVDVIGESSLSESGSSSLSSESVASLNINVN--------------------
               +   P+ M++PI + ++      + +  ++  D        + + ++ +  +S    S  S   +AS + ++                     
Subjt:  NPNIEVLDLTEPIVMERPIFL-EWRPIGQQNPLQNQIEFDD------FVEVDVIGESSLSESGSSSLSSESVASLNINVN--------------------

Query:  ----------VEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKAHNIKFIKNDKTYI---------
                  +++  + SD GS   LN+   SD+E     P+FR  TD++   F VGM F     LK+A+  Y ++    ++F KN++  +         
Subjt:  ----------VEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKAHNIKFIKNDKTYI---------

Query:  -------GD-------------HTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDY
               GD             HTCSR F +R ++S WVAK Y++ FR  P+  L E+I +V KD +V +SRS+AYRA++ A   I GS  EQY+++ DY
Subjt:  -------GD-------------HTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDY

Query:  AAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLL
          E+ R+N G+T K+      +     F+R YVC   CK+GFLAGCRP I LDACHLKG   GQL+AA+GVDG++ +YPIA+AV EAE+ ++WTWFL  L
Subjt:  AAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLL

Query:  QTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRH
          DIG     G+ F+SDQQKGLVP  ++      HRFCVRHL+ NF+K  KG  LK L W AAR + +++F+  M E+  +S  A+  L G  +  W+RH
Subjt:  QTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRH

Query:  AFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVESGTSQYVVD
        AFP  PKCD LLNNL E+FN+ ++ AR KPII + ETIR+ +M RI K  + + +    I  RI +KL+K    S+   P W G  ++EV S   +Y+VD
Subjt:  AFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVESGTSQYVVD

Query:  LDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDEAGTSSTNI
        +  ++C C  W ++GIPC HAV+ I Y     ED+VDD ++KK  +  YSH + P NGPD WP    +P+LPP+ RR PGRP++  ++ D +   +S  +
Subjt:  LDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDEAGTSSTNI

Query:  SR---KFRCGRCRLSGHNSRSCK---------GPVKRR---------SRKGSSS--------TFMGFNIPCQNIGTQESATNPSTPHGTPSATNTSVAGT
         R     +CG+C   GHN RSCK         G  + R         S KG+           FMGF IP   +  +  A        +        A  
Subjt:  SR---KFRCGRCRLSGHNSRSCK---------GPVKRR---------SRKGSSS--------TFMGFNIPCQNIGTQESATNPSTPHGTPSATNTSVAGT

Query:  PFAATSSARTPFAAATPSA---GTPFAATTPSAGT
            +   R P   ++  +   G+P A  + +  T
Subjt:  PFAATSSARTPFAAATPSA---GTPFAATTPSAGT

A0A2N9I0N2 SWIM-type domain-containing protein6.4e-14434.66Show/hide
Query:  EVFHINFHYNGRM-VFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMIS-LVPESRLIHIYVEH
        + F +  HYNG M        YIGG   V   ++ D  S+ E+  M  +L V+N  +FYYV PG  LS G + L TD+ +++M   L P +  + +YVEH
Subjt:  EVFHINFHYNGRM-VFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMIS-LVPESRLIHIYVEH

Query:  NPNIEVLDLTEPIVMERPIFLE--------------------------------WRPIGQQNPL-----QNQIEFDDFVEVDVIGESSL-----------
               +   P+ M++PI +                                    +   N +      + + F+   +V +  + S            
Subjt:  NPNIEVLDLTEPIVMERPIFLE--------------------------------WRPIGQQNPL-----QNQIEFDDFVEVDVIGESSL-----------

Query:  -----------SESGSSSLSSESVASLNINVNVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKA
                   S+       +     ++I+  +++  + SD GS   LN+   SD+E     P+FR  TD++   F +GM F     LK+A+  Y ++  
Subjt:  -----------SESGSSSLSSESVASLNINVNVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVGMRFTSPKVLKDAIKEYAVRKA

Query:  HNIKFIKNDKTYIGD----HTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAE
          ++    +   I      HTCSR F +R ++S WVAK Y++ FR  P+  L E+I +V KD +V +SRS+AYRA++ A   I GS  EQY+++ DY  E
Subjt:  HNIKFIKNDKTYIGD----HTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAE

Query:  IRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTD
        + R+N G+T K+      +     F+R YVC   CK+GFLAGCRP I LDACHLKG   GQL+AA+GVDG++ +YPIA+AV EAE+ ++WTWFL  L  D
Subjt:  IRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTD

Query:  IGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFP
        IG     G+ F+SDQQKGLVP  ++      HRFCVRHL+ NF+K  KG  LK L W AAR + +++F+  M E+  +S  A+  L G  +  W+RHAFP
Subjt:  IGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFP

Query:  CAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVESGTSQYVVDLDV
          PKCD LLNNL E+FN+ ++ AR KPII + ETIR+ +M RI K  + + +    I  RI +KL+K    S+   P W G  ++EV S   +Y+VD+  
Subjt:  CAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVESGTSQYVVDLDV

Query:  RTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDEAGTSSTNISR-
        ++C C  W ++GIPC HAV+ I Y     ED+VDD ++KK  +  YSH + P NGPD WP    +P+LPP+ RR PGRP++  ++ D +   +S  + R 
Subjt:  RTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDEAGTSSTNISR-

Query:  --KFRCGRCRLSGHNSRSCK---------GPVKRR---------SRKGSSS--------TFMGFNIPCQNIGTQESATNPSTPHGTPSATNTSVAGTPFA
            +CG+C   GHN RSCK         G  + R         S KG+           FMGF IP   +  +  A        +        A     
Subjt:  --KFRCGRCRLSGHNSRSCK---------GPVKRR---------SRKGSSS--------TFMGFNIPCQNIGTQESATNPSTPHGTPSATNTSVAGTPFA

Query:  ATSSARTPFAAATPSA---GTPFAATTPSAGT
         +   R P   ++  +   G+P A  + +  T
Subjt:  ATSSARTPFAAATPSA---GTPFAATTPSAGT

M5X0G1 ZnF_PMZ domain-containing protein (Fragment)1.2e-13935.35Show/hide
Query:  INFHYNGRMVFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMISLVPESRLIHIYVEHNPNI-E
        I F Y G  V  P  +YI G       +D DR+S +E++ M+ +  V N + FYY  PG+  + G   L  D  ++  +  +P +R + +YVEH+ ++ E
Subjt:  INFHYNGRMVFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMISLVPESRLIHIYVEHNPNI-E

Query:  VL-DLTEPIVMERPI--------------------------------FLEWRPIGQQNPLQ---------NQIEFD----------DFVEVDVIGESSLS
        V+ DLT+P + +  I                                F E R        Q         N+ E+D          ++ E D  G +S S
Subjt:  VL-DLTEPIVMERPI--------------------------------FLEWRPIGQQNPLQ---------NQIEFD----------DFVEVDVIGESSLS

Query:  E---------SGSSSLSSESVASL------------NINVNVEEEEVESDIGSYSN--LNTSIGSDEEEAVRG-PEFRVETDMNQVEFRVGMRFTSPKVL
        E             +   + V +L            N   +++ E++  D   Y++  L+++  SDEE   R   E  ++TDM+  +F VGM+F S KVL
Subjt:  E---------SGSSSLSSESVASL------------NINVNVEEEEVESDIGSYSN--LNTSIGSDEEEAVRG-PEFRVETDMNQVEFRVGMRFTSPKVL

Query:  KDAIKEYAVRKAHNIKFIKND-----------------------------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDF
        K AI+ Y   +++  K +KND                             K+Y   H+CS+ F N+N+ S ++++ Y+ R +  P      +  EV ++ 
Subjt:  KDAIKEYAVRKAHNIKFIKND-----------------------------KTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDF

Query:  QVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLM
           +S  Q Y+A+RKA   I G+  +QY+KLW+Y  EIR++N GS++ ++ D       P F+R YVC  ACK+GF+AGCRP I +DACHLKG  QGQL+
Subjt:  QVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLM

Query:  AAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCN
         ++G+D +D++YPIA+AV E E+KDSW WFL LL  D+G  S  G+TFISDQQKGL   FE   P  SHR+CVRHLYGNF+++FKG ALK   W AA  +
Subjt:  AAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCN

Query:  NMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILK
        N  DFN  M ++  L   AY +L       W+RH F    KCD LLNNL E FN+ +L AR+KPI+ + E IR  IM R+  K + I++ +  I  +I  
Subjt:  NMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILK

Query:  KLEKSIRVSRNAIPLWAGNGKFEV-ESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNT
        KLEK+  +S    P++ GNG F+V ++   Q+ V+L  RTC+C  W+++GIP  HAV  I++ +++ E FVD  Y  +A +  YS  + P+     WP  
Subjt:  KLEKSIRVSRNAIPLWAGNGKFEV-ESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNT

Query:  TYEPILPPVIRRPPGRPRKQR--KQDDDEAGTSSTNISRK---FRCGRCRLSGHNSRSCK
           P+LPP++++ PGRP+K+R  K  + E     T + +K     C +C   GHN  +CK
Subjt:  TYEPILPPVIRRPPGRPRKQR--KQDDDEAGTSSTNISRK---FRCGRCRLSGHNSRSCK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase1.5e-2323.55Show/hide
Query:  GDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRI-------SRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTV
        G H C  E  N + ++  +        R QP  S AE+ +  +K F   +       S      A+ KA +R  G  ++ ++ +    + +  SN G  V
Subjt:  GDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRI-------SRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTV

Query:  KIQCDRATED-SRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGY
          Q D  T D     FR  +  F    +GF   CRP I +D  +L G  + +LM A   D  +  +P+A+AV +  + DSW WFL  ++  +     +G 
Subjt:  KIQCDRATED-SRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGY

Query:  TFISDQQKGLVPTFEE-----FFPTVSHRFCVRHLYGNFQKEFKGLALKTLFW--KAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCA
          IS     ++    E       P   HRFC+ HL         G      F   +A   +  ++F+  M EI   +  A+++L   P   W+       
Subjt:  TFISDQQKGLVPTFEE-----FFPTVSHRFCVRHLYGNFQKEFKGLALKTLFW--KAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCA

Query:  PKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKL-----IMIRITKKHE------KINRMDKEIGN----RILKKLEK-----------SIRVSRNAI
             L ++    +   ++    + +  + +  RK+     +M+   +  +      K++R   + G+     +++KLE+              + R+A 
Subjt:  PKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKL-----IMIRITKKHE------KINRMDKEIGN----RILKKLEK-----------SIRVSRNAI

Query:  PLWAGNGK----FEVESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEP-ILPPV
         +     K        + ++  +V L+  TCTCG +Q +  PC HA+     +K  P  +VDD Y  +    TYS    P+     WP     P ++PPV
Subjt:  PLWAGNGK----FEVESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEP-ILPPV

Query:  IRRPPGRPRKQRKQDDDE
        I  PP +   + K+ D E
Subjt:  IRRPPGRPRKQRKQDDDE

AT1G64255.1 MuDR family transposase2.0e-2825.1Show/hide
Query:  YIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCD
        Y G HTC       +  S +         R  P  +++E+ +  KK     +  S    A+ KA +R+ G  ++ ++      + +  SN G  V  + D
Subjt:  YIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCD

Query:  RATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQ
             +   F   +  F    EGF   CRP I +D  +L    Q +LM A GVD  +  +P+A+AV +  + D W WFL  ++  +     KG   IS  
Subjt:  RATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQ

Query:  QKGLVPTFEEF-----FPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWS-------RHA-FPCA
           ++    E       P   HRF + H Y  F + F    L     +A   +   +F   M +I   +  A ++L   P   W+       R+      
Subjt:  QKGLVPTFEEF-----FPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWS-------RHA-FPCA

Query:  PKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLE--KSIRVSRNAIPLWAGNGKFEVESG--TSQYVVDL
         K  F + N  E    +V  +    ++ LF+ +R       +     +N  D      ++ KLE  ++  V+ + I     N  F+V +     + +V L
Subjt:  PKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLE--KSIRVSRNAIPLWAGNGKFEVESG--TSQYVVDL

Query:  DVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEP-ILPPVIRRPPGRP
           +CTCG +Q    PC HA+     +K  P  +VDD Y  + L  TY+     +     WP  +  P +LPPVI  PP  P
Subjt:  DVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEP-ILPPVIRRPPGRP

AT1G64260.1 MuDR family transposase4.1e-3423.94Show/hide
Query:  IGQQNPLQNQIEF-------DDFVEVDVIGESSLSESGSSSLSSESVASLNINVNVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFR
        + + +P  N +EF        D V V V   S L ES  SS   + +         ++E V+++ GS   L   + S   +    P   ++ D    +  
Subjt:  IGQQNPLQNQIEF-------DDFVEVDVIGESSLSESGSSSLSSESVASLNINVNVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFR

Query:  VGMRFTSPKVLKDAIKEYAVRKAHNIKFIKNDK------------------------------TYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWS
        +G+ F     LK A+  + +R+  N    + +K                               Y G HTCS E+ N +  S + A       R QP  S
Subjt:  VGMRFTSPKVLKDAIKEYAVRKAHNIKFIKNDK------------------------------TYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWS

Query:  LAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDA
        +AE+ +  K+     +  S+    + +  +R+ G  ++ ++ +    +    SN G  V  Q D         FR  +  F    EGF   CRP I +D 
Subjt:  LAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDA

Query:  CHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEE-----FFPTVSHRFCVRHLYGNFQKE
          L G  Q +LM A GVD  +  +P+A+AV +  + DSW WF   ++  +     K    IS   + +V    E       P   H+FC+ HL   F   
Subjt:  CHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGCFSTKGYTFISDQQKGLVPTFEE-----FFPTVSHRFCVRHLYGNFQKE

Query:  FKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKP---------IIQLFETIRK
        F+   L++L  +A   N  ++F+  M +I   +  A+++L  IP     RH +  A         +     AL    R  P         ++ +F+ +R 
Subjt:  FKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLSESFNALVLPARNKP---------IIQLFETIRK

Query:  LIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEV--ESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDD
             ++  +  +NR         + KLE+ +  S   +        F+V   S   +++V L+V TCTC  +Q    PC HA+     +K  P  +VD+
Subjt:  LIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEV--ESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIYYVKKKPEDFVDD

Query:  VYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPP
         Y  +    TY+    P+     WP     P L P
Subjt:  VYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAGGTGTTTCATATCAATTTTCATTATAATGGTCGTATGGTATTCCTGCCGAGTCTTAATTACATTGGTGGTAGGCAAGCAGTTATTCAAGGGATAGATTGTGA
TAGGGTCTCGTTGATTGAGTTACGCATGATGCTGACTGATCTTAATGTATTCAATGTTGAGGAGTTTTATTATGTTAGGCCTGGATATACACTGTCTACTGGGTTCAAGG
TTTTGTTAACTGATGCTGATGTTGTACAAATGATTAGTTTAGTTCCTGAAAGTAGACTGATCCACATTTATGTTGAACATAATCCTAACATTGAAGTTTTAGACCTAACT
GAACCGATTGTAATGGAAAGACCAATTTTTTTGGAGTGGCGACCTATTGGTCAGCAAAACCCGCTTCAAAACCAAATAGAATTTGATGATTTTGTTGAGGTAGATGTGAT
TGGAGAGTCTTCGCTAAGTGAGAGTGGAAGTTCTAGTCTGTCTAGTGAGTCCGTTGCTTCACTGAATATCAATGTCAATGTTGAAGAAGAGGAAGTAGAGTCAGATATTG
GGTCATATTCAAATCTCAATACATCTATAGGGTCTGACGAAGAAGAGGCTGTTAGGGGTCCTGAATTTAGGGTCGAGACGGATATGAACCAGGTAGAATTTAGGGTAGGG
ATGAGGTTTACCAGCCCGAAGGTCTTAAAAGATGCTATAAAAGAATATGCTGTAAGAAAGGCTCACAATATAAAATTCATAAAGAATGATAAGACATACATTGGGGACCA
TACATGCAGTAGGGAATTTGAAAACAGGAACATGAACTCCAATTGGGTTGCCAAGATGTATTTGGAACGGTTTAGGAGTCAACCTAATTGGTCTCTTGCTGAAATTATTG
AGGAGGTGAAGAAGGACTTTCAAGTAAGAATTTCTCGAAGTCAAGCCTATAGGGCAAGGCGTAAGGCAAACAGAAGGATCAACGGTAGCCTAAATGAACAATATAAGAAA
CTTTGGGATTATGCTGCAGAGATTAGACGATCCAATGTTGGATCAACTGTGAAAATTCAATGTGATAGGGCTACTGAAGATTCTCGCCCGGTGTTTAGGAGGTGTTATGT
CTGTTTTAAGGCTTGCAAGGAAGGGTTCTTGGCGGGTTGCAGGCCATTTATTGCCTTAGATGCTTGCCATCTAAAAGGTCCTTGTCAAGGACAATTAATGGCTGCAATTG
GTGTTGACGGGGACGACAGTTTGTATCCTATTGCGTGGGCAGTGGTTGAGGCCGAGACGAAGGATAGTTGGACATGGTTCCTGGTTTTGCTACAGACAGACATCGGTTGT
TTCTCGACTAAAGGATACACTTTCATCTCAGATCAACAAAAGGGTTTAGTTCCGACATTTGAGGAGTTTTTTCCAACTGTATCTCACCGTTTCTGTGTGCGACATCTATA
CGGAAATTTTCAAAAAGAATTTAAAGGGTTGGCTTTGAAGACTTTGTTCTGGAAGGCTGCTAGGTGTAACAATATGAAAGATTTTAATGATGCTATGACTGAAATTGGTA
GATTGAGTGTGGGGGCTTACCAATATTTGAGTGGTATTCCTATCAAGTATTGGTCTAGGCATGCATTTCCATGTGCTCCGAAGTGTGATTTTTTGCTTAATAATTTGTCA
GAGTCATTTAATGCACTTGTATTGCCAGCAAGAAACAAGCCCATAATTCAATTGTTTGAAACCATTAGGAAATTGATAATGATTAGGATTACAAAGAAGCATGAAAAAAT
TAATAGGATGGATAAGGAGATTGGAAATAGAATTCTGAAGAAGCTTGAGAAGAGTATAAGGGTGTCTAGGAATGCAATACCTTTGTGGGCTGGAAATGGGAAATTTGAGG
TGGAGTCAGGTACATCTCAGTATGTTGTAGACCTTGATGTGCGAACTTGTACCTGTGGGTTGTGGCAAATAAGTGGGATCCCTTGTCCACATGCTGTGCAATGTATATAT
TATGTGAAAAAGAAGCCCGAAGATTTTGTTGATGATGTATACAGAAAAAAAGCACTAATGGACACATACTCCCATTTTCTTCTTCCTATGAATGGTCCTGACCTTTGGCC
GAATACAACCTACGAGCCCATTCTCCCACCAGTTATCCGAAGACCACCTGGCCGACCAAGAAAACAGAGGAAACAAGATGATGATGAAGCTGGGACTTCGAGTACAAACA
TATCTAGAAAGTTTAGATGTGGTAGATGCAGGTTGTCCGGACACAACTCCAGATCTTGCAAAGGTCCAGTGAAGAGGAGATCAAGGAAGGGATCAAGTTCTACATTCATG
GGATTCAATATTCCATGTCAAAATATTGGGACTCAAGAGTCTGCTACAAACCCATCTACTCCACATGGAACTCCATCTGCTACAAATACGTCTGTTGCTGGGACTCCATT
TGCTGCAACTTCATCTGCTAGGACTCCATTTGCTGCTGCAACTCCATCTGCTGGTACACCATTTGCTGCTACAACTCCATCTGCTGGGACTCAAAGAAAAGGAAAGAAGC
GTGTGTTAAAGAAGATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAGGTGTTTCATATCAATTTTCATTATAATGGTCGTATGGTATTCCTGCCGAGTCTTAATTACATTGGTGGTAGGCAAGCAGTTATTCAAGGGATAGATTGTGA
TAGGGTCTCGTTGATTGAGTTACGCATGATGCTGACTGATCTTAATGTATTCAATGTTGAGGAGTTTTATTATGTTAGGCCTGGATATACACTGTCTACTGGGTTCAAGG
TTTTGTTAACTGATGCTGATGTTGTACAAATGATTAGTTTAGTTCCTGAAAGTAGACTGATCCACATTTATGTTGAACATAATCCTAACATTGAAGTTTTAGACCTAACT
GAACCGATTGTAATGGAAAGACCAATTTTTTTGGAGTGGCGACCTATTGGTCAGCAAAACCCGCTTCAAAACCAAATAGAATTTGATGATTTTGTTGAGGTAGATGTGAT
TGGAGAGTCTTCGCTAAGTGAGAGTGGAAGTTCTAGTCTGTCTAGTGAGTCCGTTGCTTCACTGAATATCAATGTCAATGTTGAAGAAGAGGAAGTAGAGTCAGATATTG
GGTCATATTCAAATCTCAATACATCTATAGGGTCTGACGAAGAAGAGGCTGTTAGGGGTCCTGAATTTAGGGTCGAGACGGATATGAACCAGGTAGAATTTAGGGTAGGG
ATGAGGTTTACCAGCCCGAAGGTCTTAAAAGATGCTATAAAAGAATATGCTGTAAGAAAGGCTCACAATATAAAATTCATAAAGAATGATAAGACATACATTGGGGACCA
TACATGCAGTAGGGAATTTGAAAACAGGAACATGAACTCCAATTGGGTTGCCAAGATGTATTTGGAACGGTTTAGGAGTCAACCTAATTGGTCTCTTGCTGAAATTATTG
AGGAGGTGAAGAAGGACTTTCAAGTAAGAATTTCTCGAAGTCAAGCCTATAGGGCAAGGCGTAAGGCAAACAGAAGGATCAACGGTAGCCTAAATGAACAATATAAGAAA
CTTTGGGATTATGCTGCAGAGATTAGACGATCCAATGTTGGATCAACTGTGAAAATTCAATGTGATAGGGCTACTGAAGATTCTCGCCCGGTGTTTAGGAGGTGTTATGT
CTGTTTTAAGGCTTGCAAGGAAGGGTTCTTGGCGGGTTGCAGGCCATTTATTGCCTTAGATGCTTGCCATCTAAAAGGTCCTTGTCAAGGACAATTAATGGCTGCAATTG
GTGTTGACGGGGACGACAGTTTGTATCCTATTGCGTGGGCAGTGGTTGAGGCCGAGACGAAGGATAGTTGGACATGGTTCCTGGTTTTGCTACAGACAGACATCGGTTGT
TTCTCGACTAAAGGATACACTTTCATCTCAGATCAACAAAAGGGTTTAGTTCCGACATTTGAGGAGTTTTTTCCAACTGTATCTCACCGTTTCTGTGTGCGACATCTATA
CGGAAATTTTCAAAAAGAATTTAAAGGGTTGGCTTTGAAGACTTTGTTCTGGAAGGCTGCTAGGTGTAACAATATGAAAGATTTTAATGATGCTATGACTGAAATTGGTA
GATTGAGTGTGGGGGCTTACCAATATTTGAGTGGTATTCCTATCAAGTATTGGTCTAGGCATGCATTTCCATGTGCTCCGAAGTGTGATTTTTTGCTTAATAATTTGTCA
GAGTCATTTAATGCACTTGTATTGCCAGCAAGAAACAAGCCCATAATTCAATTGTTTGAAACCATTAGGAAATTGATAATGATTAGGATTACAAAGAAGCATGAAAAAAT
TAATAGGATGGATAAGGAGATTGGAAATAGAATTCTGAAGAAGCTTGAGAAGAGTATAAGGGTGTCTAGGAATGCAATACCTTTGTGGGCTGGAAATGGGAAATTTGAGG
TGGAGTCAGGTACATCTCAGTATGTTGTAGACCTTGATGTGCGAACTTGTACCTGTGGGTTGTGGCAAATAAGTGGGATCCCTTGTCCACATGCTGTGCAATGTATATAT
TATGTGAAAAAGAAGCCCGAAGATTTTGTTGATGATGTATACAGAAAAAAAGCACTAATGGACACATACTCCCATTTTCTTCTTCCTATGAATGGTCCTGACCTTTGGCC
GAATACAACCTACGAGCCCATTCTCCCACCAGTTATCCGAAGACCACCTGGCCGACCAAGAAAACAGAGGAAACAAGATGATGATGAAGCTGGGACTTCGAGTACAAACA
TATCTAGAAAGTTTAGATGTGGTAGATGCAGGTTGTCCGGACACAACTCCAGATCTTGCAAAGGTCCAGTGAAGAGGAGATCAAGGAAGGGATCAAGTTCTACATTCATG
GGATTCAATATTCCATGTCAAAATATTGGGACTCAAGAGTCTGCTACAAACCCATCTACTCCACATGGAACTCCATCTGCTACAAATACGTCTGTTGCTGGGACTCCATT
TGCTGCAACTTCATCTGCTAGGACTCCATTTGCTGCTGCAACTCCATCTGCTGGTACACCATTTGCTGCTACAACTCCATCTGCTGGGACTCAAAGAAAAGGAAAGAAGC
GTGTGTTAAAGAAGATGTGA
Protein sequenceShow/hide protein sequence
MDEVFHINFHYNGRMVFLPSLNYIGGRQAVIQGIDCDRVSLIELRMMLTDLNVFNVEEFYYVRPGYTLSTGFKVLLTDADVVQMISLVPESRLIHIYVEHNPNIEVLDLT
EPIVMERPIFLEWRPIGQQNPLQNQIEFDDFVEVDVIGESSLSESGSSSLSSESVASLNINVNVEEEEVESDIGSYSNLNTSIGSDEEEAVRGPEFRVETDMNQVEFRVG
MRFTSPKVLKDAIKEYAVRKAHNIKFIKNDKTYIGDHTCSREFENRNMNSNWVAKMYLERFRSQPNWSLAEIIEEVKKDFQVRISRSQAYRARRKANRRINGSLNEQYKK
LWDYAAEIRRSNVGSTVKIQCDRATEDSRPVFRRCYVCFKACKEGFLAGCRPFIALDACHLKGPCQGQLMAAIGVDGDDSLYPIAWAVVEAETKDSWTWFLVLLQTDIGC
FSTKGYTFISDQQKGLVPTFEEFFPTVSHRFCVRHLYGNFQKEFKGLALKTLFWKAARCNNMKDFNDAMTEIGRLSVGAYQYLSGIPIKYWSRHAFPCAPKCDFLLNNLS
ESFNALVLPARNKPIIQLFETIRKLIMIRITKKHEKINRMDKEIGNRILKKLEKSIRVSRNAIPLWAGNGKFEVESGTSQYVVDLDVRTCTCGLWQISGIPCPHAVQCIY
YVKKKPEDFVDDVYRKKALMDTYSHFLLPMNGPDLWPNTTYEPILPPVIRRPPGRPRKQRKQDDDEAGTSSTNISRKFRCGRCRLSGHNSRSCKGPVKRRSRKGSSSTFM
GFNIPCQNIGTQESATNPSTPHGTPSATNTSVAGTPFAATSSARTPFAAATPSAGTPFAATTPSAGTQRKGKKRVLKKM