; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G018860 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G018860
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionCysteine/Histidine-rich C1 domain family protein
Genome locationCG_Chr05:31118644..31151516
RNA-Seq ExpressionClCG05G018860
SyntenyClCG05G018860
Gene Ontology termsGO:0009611 - response to wounding (biological process)
GO:0031347 - regulation of defense response (biological process)
GO:0035556 - intracellular signal transduction (biological process)
GO:2000022 - regulation of jasmonic acid mediated signaling pathway (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030246 - carbohydrate binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001229 - Jacalin-like lectin domain
IPR001841 - Zinc finger, RING-type
IPR001965 - Zinc finger, PHD-type
IPR002219 - Protein kinase C-like, phorbol ester/diacylglycerol-binding domain
IPR004146 - DC1
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR036404 - Jacalin-like lectin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041587.1 putative Cysteine/Histidine-rich C1 domain family protein [Cucumis melo var. makuwa]0.0e+0087.3Show/hide
Query:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC
        MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAF+C + DCNFHIHQSCLHLP QIHSPFHPFHPLL KTNN+FCT CWQMPSGDVYRCRKC
Subjt:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC

Query:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENR-GNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS
        NFQIDIKCVL DTKSSGLRR SGDQFRHFSHPHPLTLQLEENR  NRVVCFVCDLLIKS PSYFCSQCDTHFHQ+CAELPRE YD+ FHQHPLFLLPNLS
Subjt:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENR-GNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS

Query:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL
        FANFLCDSCNNNCRKFVY+CPHPR C+FNLHVACLQSFNH+HNF  FRNAMDSFDCR+CGKKG+GFPWFCEICHVLAHRKCAKSP  LRT GHH HDL L
Subjt:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL

Query:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG
        TYFR+   N+ RYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQR D QSTP+VDSTMD+ S+ NDE DNEIQCSVHSHNLN  L   I+ KGDRICDG
Subjt:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG

Query:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE
        C+KGLLS SYGCQQCDF+VHKECAKLPK KTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCK CLSTFDIRCTSIKIPFKHPGH HPLSLDRTNEE
Subjt:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE

Query:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH
        H+CE CGEGVKNRVSFRCV CNFYLDAKCATLPLTVRYR + HPLNLTFVEEEE DEYYCDVCEEERE W+W YSCR C F  HLGCVLGEFPFVKSKIH
Subjt:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH

Query:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP
        EAH+HPLSMVMKGKE+      NCGSC EWCGENLAFQCGTCKFNIHAIGRCY QQLKQGKLAYT   FYSRGVELYEQPT Y P+RV  RL+GGKGGNP
Subjt:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP

Query:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE
        WEEKVFSTIR+F+VYH+ CVHAIQIYYEKNGKAVWS KHGGDGGTKYE
Subjt:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE

KAG7024193.1 hypothetical protein SDJN02_13007, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0049.69Show/hide
Query:  MEKLKLLEQPHPHPLTYIEEKTDSREDMFCCVCKKALYPPAFICSPCKFYIHQSCIYFPSRIRSRFHPQHSLSLTETNSDHCHCCWQMPRDCFYTCTLQP
        MEK+KLLE+PHPHPL +IEE+TDSREDMFCCVCKK L+PPAFICS CKFYIHQSCI+ P +I SRFHP H LSLTETNSDHCHCCWQMPRDCFY CT   
Subjt:  MEKLKLLEQPHPHPLTYIEEKTDSREDMFCCVCKKALYPPAFICSPCKFYIHQSCIYFPSRIRSRFHPQHSLSLTETNSDHCHCCWQMPRDCFYTCTLQP

Query:  HCRFIIDIKCTLADTRSLGLNSIGKHFSHSHPLFLEKELATRKLVVCHLCGMLIVSGPAYFCSKCNIRFHKGCAELPQEILQLNQHHHPLFLFPHAKPHS
        HC+FI+DIKC LADT+  GLN +GKHFSHSHPL L  E+ T KLV CH+CG+L+V GP YFCSKC+IRFHK CAELPQEIL+ ++HHHPLFL+PHA    
Subjt:  HCRFIIDIKCTLADTRSLGLNSIGKHFSHSHPLFLEKELATRKLVVCHLCGMLIVSGPAYFCSKCNIRFHKGCAELPQEILQLNQHHHPLFLFPHAKPHS

Query:  FCNSCKNQCLQFVYSCVQCNFNLHVTCRAFSNHKHNFTRLRTIISFQCLLCGWWGRDFPWFCSICHLLAHEKCAELPPSLLVVGHDCPLSFTYIHPFGNQ
        FCN+CKNQC +FVYSCV+C+FNLHV C A S+HKH+F R RTII+F+CLLCGW G  FPWFCSICHLLAHE CAELPPSLLVVGHDCPL+FTY HPF + 
Subjt:  FCNSCKNQCLQFVYSCVQCNFNLHVTCRAFSNHKHNFTRLRTIISFQCLLCGWWGRDFPWFCSICHLLAHEKCAELPPSLLVVGHDCPLSFTYIHPFGNQ

Query:  SKLACDICRKKVEPQFAAYSCSKCMYVVHLNCAGKKYLRGLQQHDGRLYTGGSGRLQTSSQTYINPLGDEDATSNNFSKKMFEEVGSIEILHSNCEKELI
        SKLAC+ICRKKVEPQFAAYSCS+C YVVHL+CAGKKY+RG  + D                  INPLGDE   SNN SKK FEEVGS+EILHSNCE++LI
Subjt:  SKLACDICRKKVEPQFAAYSCSKCMYVVHLNCAGKKYLRGLQQHDGRLYTGGSGRLQTSSQTYINPLGDEDATSNNFSKKMFEEVGSIEILHSNCEKELI

Query:  LCKEEGDNDKQCHGCMQSFSVTKPSYSYSCVKCGFFLHKHCADFPITKRHPLHKHPLTLIATQNVAFQCHACLQFCHGYAYHCEECLYTLDIRCVLIKTK
        LCKEEGD DKQC  CMQ FSV KPSYSYSCVKCGFFLHK CAD P+TKRHPLHKHPLTLI T++VAFQCHACLQFCHG+AYHCEECLYTLDIRCVLI+T+
Subjt:  LCKEEGDNDKQCHGCMQSFSVTKPSYSYSCVKCGFFLHKHCADFPITKRHPLHKHPLTLIATQNVAFQCHACLQFCHGYAYHCEECLYTLDIRCVLIKTK

Query:  KLKHPSHQHLLSLAQNHEDQKCRGCGQSNKTVFECDEGCNNFSLDYRCATLPQKARCKFDGSLLDLTFSVEDETGEYYCDVCEEERNPAMCFYCCKTCRL
        KLKHPSHQHLLSLAQNHED+ C GCG+SN+ VFECD+GCNNFSLDYRCATLPQKARC+FDG L+DL+FSVED+TGEYYCDVCEEERNP +CFY CKTCRL
Subjt:  KLKHPSHQHLLSLAQNHEDQKCRGCGQSNKTVFECDEGCNNFSLDYRCATLPQKARCKFDGSLLDLTFSVEDETGEYYCDVCEEERNPAMCFYCCKTCRL

Query:  AAHPECILGEYPWLKYGSYKTHKHLLALVTEGKMDYSDCDHCGKPCVGNLAYECRRCKFNYFAV--------------------------------HSLQ
        AAHPECILGEYPWLKYGSY+THKHLL+LV EG+ DYSDC+HCGKPC GNLAYECRRCKFN  A+                                 +LQ
Subjt:  AAHPECILGEYPWLKYGSYKTHKHLLALVTEGKMDYSDCDHCGKPCVGNLAYECRRCKFNYFAV--------------------------------HSLQ

Query:  RMEDE-----VVVKIGIRGCEEGARRWDDGAHSTVRQIVINHEKCIYSVNIEYDNNGESIWKPKHGGNKGSISK---------HFSLGEYGGEGGEPWSE
        RM+DE       VKI I GC+EG   WDDGA+ST+R+++I H   I S+++EYD NG SIW  KHGGN+GS+S+           S+  Y G  G  +  
Subjt:  RMEDE-----VVVKIGIRGCEEGARRWDDGAHSTVRQIVINHEKCIYSVNIEYDNNGESIWKPKHGGNKGSISK---------HFSLGEYGGEGGEPWSE

Query:  TFQAIKQLVIHSDEHWIVSIQMEYVNENGDFIMEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPF
            I+ L++ ++        ME   +    IM   +V    +   F +  G  I           QP+                    + L +    PF
Subjt:  TFQAIKQLVIHSDEHWIVSIQMEYVNENGDFIMEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPF

Query:  -----HPFHPLLRKTNNHFCTACWQMPSGDVYRCRKCNFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENRGNRVVCFVCDLLIKSAPSYF
             HP+  + R                        N +  I  +  + +    +                 L   +  G+R      +++++S   +F
Subjt:  -----HPFHPLLRKTNNHFCTACWQMPSGDVYRCRKCNFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENRGNRVVCFVCDLLIKSAPSYF

Query:  CSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLSFANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGD
         S    + H       R + D      P  ++ +L+FA         N R +           F        SF            M +    V G+ G 
Subjt:  CSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLSFANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGD

Query:  GFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGLTYFRDNKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQND
           W+ +                          +GL Y    + +Y    G+  E  F                   +RL  +    +DS          
Subjt:  GFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGLTYFRDNKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQND

Query:  EQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDGCMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKK
             IQ      N N     +  G G            SPS                  +    F  +HL+++                 HG       
Subjt:  EQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDGCMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKK

Query:  CLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEEHSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTV--RYRLEEHPLNLTFVEEEEGDEYYCDVCEE
                                                               +Y D +   L +TV     LE +         E+G ++       
Subjt:  CLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEEHSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTV--RYRLEEHPLNLTFVEEEEGDEYYCDVCEE

Query:  EREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIHEAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYT
                                  +P V SK+   H             SG +    G                    +H I R             +
Subjt:  EREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIHEAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYT

Query:  HTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNPWEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYEEDDGSITWDDGVYSAIR
         +NF  R                           P  + +  TI    +++   +  +Q   E  G   W+P               S TWDDGVYS IR
Subjt:  HTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNPWEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYEEDDGSITWDDGVYSAIR

Query:  RFVVYEREWICSIQIEYDQNGESIWSPKHGENDGSISE-----PEP---------------------------QSKHLNMGPYGGKGGDHWEETFQTIRR
        R VVYEREWICSIQIEYDQNGES WSPKHGE++GS SE     P+                            ++   + GP+G + G+ +     T  +
Subjt:  RFVVYEREWICSIQIEYDQNGESIWSPKHGENDGSISE-----PEP---------------------------QSKHLNMGPYGGKGGDHWEETFQTIRR

Query:  LVIYHG---LWIDSIQMEYEDENQTLLWSEKNGVRPEKFSLGEYGGEGGDPWNESFKTIKQLVINHGMWIDSIQMEYEDENGELVWSKRHGGNGGFQSEV
        +V +HG    ++++I +  +   +  L  E    +P+  +LG+YGG+GGDPW E+F+TI++LVI HG+WIDSIQMEYEDEN  L+WS++HGG+GGF+SEV
Subjt:  LVIYHG---LWIDSIQMEYEDENQTLLWSEKNGVRPEKFSLGEYGGEGGDPWNESFKTIKQLVINHGMWIDSIQMEYEDENGELVWSKRHGGNGGFQSEV

Query:  GPE---------------------------------KFSLGEYGGEGGDPWDENFRTVKQLFGLR----------GMVEMEVLNQRNGIGPEKFSFGKYG
          E                                 K + G +G E G  +   F  +K + G+           G+       Q  G+G EKFS G+ G
Subjt:  GPE---------------------------------KFSLGEYGGEGGDPWDENFRTVKQLFGLR----------GMVEMEVLNQRNGIGPEKFSFGKYG

Query:  GEGGNPWNENFRTIRQLVINHGQWIDSIQMEYENENGELVWSERHGGNGGSQSKCIATLQRMLDEEGSPMTTVKIEIGGGKHGGGPWDDGAYSTIRRLLI
        GEGG PW   FR IRQLVI+HGQWIDSIQMEYE+ENGELVWSE+HGG+GGS+S+ +      LD     + T+           G +DD  Y  +   +I
Subjt:  GEGGNPWNENFRTIRQLVINHGQWIDSIQMEYENENGELVWSERHGGNGGSQSKCIATLQRMLDEEGSPMTTVKIEIGGGKHGGGPWDDGAYSTIRRLLI

Query:  YHKQWICSLHVEYDKNGHSVWGSKRGGNEG--SVSEAKGGIHGSMSSGRSDGLLLIMNNGSTPFNWNMRIRMESWYGPRSMVTQMELPNQSLKTEPEPAA
               SL +E +   +  +G + G      +V     G+HG  S    D + L            + + ++ +Y   + +TQ    +          +
Subjt:  YHKQWICSLHVEYDKNGHSVWGSKRGGNEG--SVSEAKGGIHGSMSSGRSDGLLLIMNNGSTPFNWNMRIRMESWYGPRSMVTQMELPNQSLKTEPEPAA

Query:  APPPQIQVEHIKPRQYGGEGGDGWEDMFRTIKRFVVRHGLWIDSIQIQYEDDNGNLVWSRQHGGDGGSKSENKKEIAMDFDLLNNPHPHPLFFIEEGKHD
        AP                      +  F  I    + H  +   + +Q+                     + K+EIAMDFDLLNNPHPHPLFFIEEGK+D
Subjt:  APPPQIQVEHIKPRQYGGEGGDGWEDMFRTIKRFVVRHGLWIDSIQIQYEDDNGNLVWSRQHGGDGGSKSENKKEIAMDFDLLNNPHPHPLFFIEEGKHD

Query:  EVVFCIRCRRLLRPPAFSCSDSDCNFHIHQSCIDLPPQIHNRFHPQHTLSRTTNNYMCTACSQMPSGDVYRCYGCGFQIDVKCAIADTKATGVRRTIGSE
        EVVFC RCRR+L PPAFSCSDS CNFHIHQSCIDLPP+IHN FHPQH LSRTTNN++C AC QMPSGDVY C  CGFQIDVKCAIADTKA+GVR+T  +E
Subjt:  EVVFCIRCRRLLRPPAFSCSDSDCNFHIHQSCIDLPPQIHNRFHPQHTLSRTTNNYMCTACSQMPSGDVYRCYGCGFQIDVKCAIADTKATGVRRTIGSE

Query:  FRHFSHPHTLTLQQEQNRGTNEIVCVVCGLLIKSGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLL-PFSSPQTICNSCKNDCGEFVYNCSLCG
        FRHFSHPHTLTLQ+EQN  TNEIVCVVCGLLIKSGSSYYFCS CDAHFHQQCAELPREMLN DFH+HPLFLL   +  QTICNSCKNDCGEF+YNCS C 
Subjt:  FRHFSHPHTLTLQQEQNRGTNEIVCVVCGLLIKSGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLL-PFSSPQTICNSCKNDCGEFVYNCSLCG

Query:  FNLHIACLQSFNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKISTKYA
        FNLH+ACLQSFNH HTF K+RN+ QFVCRACGEKG+GFSWYCTICHLSVHKECAE PLTLR F HRLHDLSLTYFRDG+DFVGNK+DCK CG++I TKYA
Subjt:  FNLHIACLQSFNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKISTKYA

Query:  AYGCYKYECNYFVHLGCARTQRIYFNSTMDALDSTDDEDVKIEISGSEIQHFIHHHSLNLFSPEEELGQDRVCDGCMKRLSGPSYGCEECDFFVHKECLE
        AYGCYK  C YFVHL CAR Q    N T+D LDS+DDE+ KIE+SGSEIQHFIHHH L  F  EE+L QDRVCDGCMKRLSGPSYGCEEC FF HKECLE
Subjt:  AYGCYKYECNYFVHLGCARTQRIYFNSTMDALDSTDDEDVKIEISGSEIQHFIHHHSLNLFSPEEELGQDRVCDGCMKRLSGPSYGCEECDFFVHKECLE

Query:  LPRKKRNFLHQHRLNLISIPNFVFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQHPLSHDRTNEDHKCEGCGEGVKHKVAFRCVDCNFYL
        LPRKKRNFLHQHRLNLISIP+FVFQCKACL +FNGFAYHCK CLS FDTRCASIKIPF+HP HQHPLS DR+N+DH CEGC EGVK+K+AFRCVDCNF+L
Subjt:  LPRKKRNFLHQHRLNLISIPNFVFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQHPLSHDRTNEDHKCEGCGEGVKHKVAFRCVDCNFYL

Query:  DAGCATLPLGVRYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKKHEAHKHTMKLGVKGKEEDCVACGESC
        DAGCATLPLGVRYRFDPHPLDL F+EDEE EEYCCDICEEERE GPWFY CQKCSFAAHLDC VGMFP++KLKKHEAHKHT+KLG+KGKEEDCVACGESC
Subjt:  DAGCATLPLGVRYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKKHEAHKHTMKLGVKGKEEDCVACGESC

Query:  AEDLAYECISNCKFKVHAIGLCYHRQLVQGSLAFTNRK----------------------------GGDAWEEKAFTRIRAFLIWHKEWIYSFQIHYEKN
        AE+LAYECISNCKFKVHAIGLCYHRQ+VQGSLAFTNR                             GG AW EK FT+IRAF I H+E IYS QI YEK+
Subjt:  AEDLAYECISNCKFKVHAIGLCYHRQLVQGSLAFTNRK----------------------------GGDAWEEKAFTRIRAFLIWHKEWIYSFQIHYEKN

Query:  GELIWSMKHGGDGGYRSEIHFD
        G+L WS  HG DGG RSE+ FD
Subjt:  GELIWSMKHGGDGGYRSEIHFD

KGN60121.2 hypothetical protein Csa_000943 [Cucumis sativus]0.0e+0065.04Show/hide
Query:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC
        MEFDLVNRPHQHPLFFNEDGRKINGEVA+CSRCRQPLRPPAF+CS+ DCNFHIHQSCLHLP QIHSPFHP HPL  +TNN+FCT CWQMPSGDVYRCRKC
Subjt:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC

Query:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENRG-NRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS
        NFQIDIKCVL DTKSSGLRR SGDQFRHFSHPHPLTLQLEENRG NRVVCFVCDLLIKS PSYFCSQCD HFHQ CAELPRELYD+ FHQHPLFLLPNLS
Subjt:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENRG-NRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS

Query:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL
        FANFLCDSCNNNCRKFVY+CPHPR CKFNLHVACLQSFNH+HNF  FRNAMDSFDCRVCGKKG+GFPWFCEICH+LAHRKCAKSPL LRT GHH HDL L
Subjt:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL

Query:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG
        TYFRD   N  RYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQR D QSTP+VD T ++ S+ N+E DNEIQCSVHSHNLN  L   I  KGDRICDG
Subjt:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG

Query:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE
        C+KGLLS SYGCQQCDF+VHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEAC EYFHGFAYHCK CLSTFDIRCTSIKIPFKHPGH HPLSLDRTNEE
Subjt:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE

Query:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH
        H+CE CGEGV+NRVSFRCV CNFYLDAKCATLPLTVRYR + HPLNLTFVEEEE DEYYCDVCEEERE W+W YSCR C F  HLGCVLGEFPFVKSKIH
Subjt:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH

Query:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP
        EAH+HPLSMVMKGKE+      NCGSC EWC ENLAFQCGTCKFN+HAIGRCY QQLKQGKLAYT   FYSRGVELYE+PTIY P+R PLRL+GGKGGN 
Subjt:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP

Query:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYEE--------------------------------------------DDG----
        WEEKVF+T+R+FVVYH++CVHAIQIYYEKNGKA+WS KHGGDGGTKYE                                             +DG    
Subjt:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYEE--------------------------------------------DDG----

Query:  ------------------------------------------------------------SITWDDGVYSAIRRFVVYEREWICSIQIEYDQNGESIWSP
                                                                    S TWDDG YS IRR VVYE+EWICSIQIEYD NGESI S 
Subjt:  ------------------------------------------------------------SITWDDGVYSAIRRFVVYEREWICSIQIEYDQNGESIWSP

Query:  KHGENDGSISE-----PEP---------------------------QSKHLNMGPYGGKGGDHWEETFQTIRRLVIYHGL---WIDSIQMEYEDENQTLL
         HGEN+GS+SE     P+                            +S     GP+G + G +++    T  +++ +HG+   ++++I +      QT+ 
Subjt:  KHGENDGSISE-----PEP---------------------------QSKHLNMGPYGGKGGDHWEETFQTIRRLVIYHGL---WIDSIQMEYEDENQTLL

Query:  WSEKNGVRPE---KFSLGEYGGEGGDPWNESFKTIKQLVINHGMWIDSIQMEYE--DENGELVWSKRHGGNGGFQSEVGPE-------------------
          +K G++PE     ++G+YGG+GG+PW E+F+TIK++ I HG+WIDS Q++YE  DE G LVW++ +GG GGF + V  E                   
Subjt:  WSEKNGVRPE---KFSLGEYGGEGGDPWNESFKTIKQLVINHGMWIDSIQMEYE--DENGELVWSKRHGGNGGFQSEVGPE-------------------

Query:  --------------KFSLGEYGGEGGDPWDENFRTVKQLFGLRGMVEMEV------LNQRNGIGPEKFSFGKYGGEGGNPWNENFRTIRQLVINHGQWID
                      + + G +G E G  +   F+ +K L G  G   + +      L      G EKFS G+ GGEGG+PW+ENF TIR+LVINHGQWID
Subjt:  --------------KFSLGEYGGEGGDPWDENFRTVKQLFGLRGMVEMEV------LNQRNGIGPEKFSFGKYGGEGGNPWNENFRTIRQLVINHGQWID

Query:  SIQMEYENENGELVWSERHGGNGGSQSKCI
        SIQMEYE+ENGE+V SE+HGGNGGS+S+ +
Subjt:  SIQMEYENENGELVWSERHGGNGGSQSKCI

XP_008466678.2 PREDICTED: uncharacterized protein LOC103504031 [Cucumis melo]0.0e+0087.03Show/hide
Query:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC
        MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAF+C + DCNFHIHQSCLHLP QIHSPFHPFHPLL KTNN+FCT CWQMPSGDVYRCRKC
Subjt:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC

Query:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENR-GNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS
        NFQIDIKCVL DTKSSGLRR SGDQFRHFSHPH LTLQLEENR  NRVVCFVCDLLIKS PSYFCSQCDTHFHQ+CAELPRE YD+ FHQHPLFLLPNLS
Subjt:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENR-GNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS

Query:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL
        FANFLCDSCNNNCRKFVY+CPHPR C+FNLHVACLQSFNH+HNF  FRNAMDSFDCR+CGKKG+GFPWFCEICHVLAHRKCAKSP  LRT GHH HDL L
Subjt:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL

Query:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG
        TYFR+   N+ RYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQR D QSTP+VDSTMD+ S+ NDE DNEIQCSVHSHNLN  L   I+ KGDRICDG
Subjt:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG

Query:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE
        C+KGLLS SYGCQQCDF+VHKECAKLPK KTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCK CLSTFDIRCTSIKIPFKHPGH HPLSLDRTNEE
Subjt:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE

Query:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH
        H+CE CGEGVKNRVSFRCV CNFYLDAKCATLPLTVRYR + HPLNLTFVEEEE DEYYCDVCEEERE W+W YSCR C F  HLGCVLGEFPFVKSKIH
Subjt:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH

Query:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP
        EAH+HPLSMVMKGKE+      NCGSC EWCGENLAFQCGTCKFNIHAIGRCY QQLKQGKLAYT   FYSRGVELYEQPT Y P+R   RL+GGKGGNP
Subjt:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP

Query:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE
        WEEKVFSTIR+F+VYH+ CVHAIQIYYEKNGKAVWS KHGGDGGTKYE
Subjt:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE

XP_038898644.1 uncharacterized protein LOC120086188 [Benincasa hispida]0.0e+0084.07Show/hide
Query:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC
        MEFDL+NRPHQHPLFFNEDGRKI+GEVAYCSRCRQPLRPPAF+CSDSDCNFHIHQSCLHLP QIHSPFHPFHPLL KTNNHFCT CWQMPSG VYRCRKC
Subjt:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC

Query:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENR----GNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLP
        NFQIDIKCVL DTKSSGLR  SGDQFRHFSH HPLTLQLEENR     NR+VCFVCDL+IKS P YFCSQCDTHFHQ+CAELPR+LYD+ FHQHPLFLLP
Subjt:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENR----GNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLP

Query:  NLSFANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHD
        NLSFANFLCDSCNNNCRKFVY+CPHPR CKFNLHVACLQSFNH+HNF TFRNAMDSFDCRVCGKKG+GFPWFCEICHVLAHRKCAKSPL LRTFGHHFHD
Subjt:  NLSFANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHD

Query:  LGLTYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRI
        L LTYFRD   N+IRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDST DYSSTQN+EQDNEI+CSVHSHNLNFFLPEEI GKG RI
Subjt:  LGLTYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRI

Query:  CDGCMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRT
        CDGC+KGLLS SYGC QCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCK CLSTFDIRCTSIKIPFKHPGH HPLS+DRT
Subjt:  CDGCMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRT

Query:  NEEHSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKS
        NEEH+CE CG GVKNRVSFRCV CNFYLDAKCATLPLTVRYR ++HPLNLTFVEEE  DEYYCDVCEEERE W+W YSCR C+FV HL CV+GEFPFVKS
Subjt:  NEEHSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKS

Query:  KIHEAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKG
        KIHEAHRHPLSMVMKGKE+     KNCGSC E C ENLAFQCGTCKFN+HAIGRCY QQLKQGKLAYT  NFYSRGVELYEQPTIY PVRVPLR YGGKG
Subjt:  KIHEAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKG

Query:  GNPWEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE----EDDGSITWDDGVYSAIRRFVVYEREWICSIQIEYDQNGESIWSPK
        GNPWEEKVFSTIR+FVVYH+ECVHAIQIYYEKNGKAVWS KHGGDGGTKYE      D  +    G YS +      ER  I S+ +E +     I+ P 
Subjt:  GNPWEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE----EDDGSITWDDGVYSAIRRFVVYEREWICSIQIEYDQNGESIWSPK

Query:  HGENDGSISEPEPQSK
          EN    S P  + K
Subjt:  HGENDGSISEPEPQSK

TrEMBL top hitse value%identityAlignment
A0A0A0LDB7 Uncharacterized protein0.0e+0086.23Show/hide
Query:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC
        MEFDLVNRPHQHPLFFNEDGRKINGEVA+CSRCRQPLRPPAF+CS+ DCNFHIHQSCLHLP QIHSPFHP HPL  +TNN+FCT CWQMPSGDVYRCRKC
Subjt:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC

Query:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENRG-NRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS
        NFQIDIKCVL DTKSSGLRR SGDQFRHFSHPHPLTLQLEENRG NRVVCFVCDLLIKS PSYFCSQCD HFHQ CAELPRELYD+ FHQHPLFLLPNLS
Subjt:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENRG-NRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS

Query:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL
        FANFLCDSCNNNCRKFVY+CPHPR CKFNLHVACLQSFNH+HNF  FRNAMDSFDCRVCGKKG+GFPWFCEICH+LAHRKCAKSPL LRT GHH HDL L
Subjt:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL

Query:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG
        TYFRD   N  RYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQR D QSTP+VD T ++ S+ N+E DNEIQCSVHSHNLN  L   I  KGDRICDG
Subjt:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG

Query:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE
        C+KGLLS SYGCQQCDF+VHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEAC EYFHGFAYHCK CLSTFDIRCTSIKIPFKHPGH HPLSLDRTNEE
Subjt:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE

Query:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH
        H+CE CGEGV+NRVSFRCV CNFYLDAKCATLPLTVRYR + HPLNLTFVEEEE DEYYCDVCEEERE W+W YSCR C F  HLGCVLGEFPFVKSKIH
Subjt:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH

Query:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP
        EAH+HPLSMVMKGKE+      NCGSC EWC ENLAFQCGTCKFN+HAIGRCY QQLKQGKLAYT   FYSRGVELYE+PTIY P+R PLRL+GGKGGN 
Subjt:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP

Query:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE
        WEEKVF+T+R+FVVYH++CVHAIQIYYEKNGKA+WS KHGGDGGTKYE
Subjt:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE

A0A1S3CRI5 uncharacterized protein LOC103503932 isoform X10.0e+0082.39Show/hide
Query:  MDFDLLNNPHPHPLFFIEEGKHDEVVFCIRCRRLLRPPAFSCSDSDCNFHIHQSCIDLPPQIHNRFHPQHTLSRTTNNYMCTACSQMPSGDVYRCYGCGF
        MDFDLLNNPHPHPLFF E+G + EVVFC RCRR LRPPAFSCSDS CNFHIHQSCIDLPPQIHNRFHPQH LSRTTNNY+CT C QMPSGDVY C  CGF
Subjt:  MDFDLLNNPHPHPLFFIEEGKHDEVVFCIRCRRLLRPPAFSCSDSDCNFHIHQSCIDLPPQIHNRFHPQHTLSRTTNNYMCTACSQMPSGDVYRCYGCGF

Query:  QIDVKCAIADTKATGVRRTIGSEFRHFSHPHTLTLQQEQNRGTNEIVCVVCGLLIKSGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFSSP
        QIDVKCAIADTKA+G+R   G++FRHFSHPHTLTL++EQNR T+EI C+VCGLLIKSGSSYYFC +CD++FHQQCAELPREMLN DFH+HPLFLLP SSP
Subjt:  QIDVKCAIADTKATGVRRTIGSEFRHFSHPHTLTLQQEQNRGTNEIVCVVCGLLIKSGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFSSP

Query:  QTICNSCKNDCGEFVYNCSLCGFNLHIACLQSFNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDG
        QTICNSCKNDCGEFVYNCSLC FNLHIACLQSF H H+F +YRNRTQFVCRACGEKG+GFSWYC ICHLSVHK+CA+ PLTLRIF HRLHDLSLTYFRD 
Subjt:  QTICNSCKNDCGEFVYNCSLCGFNLHIACLQSFNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDG

Query:  VDFVGNKIDCKICGDKISTKYAAYGCYKYECNYFVHLGCARTQRIYFNSTMDALDSTDDEDVKIEISGSEIQHFIHHHSLNLFSPEEELGQDRVCDGCMK
        VDFVGNKIDCKICG+KI TKYAAYGCYKY CNYFVHL CARTQ I FNST+D LDST+DE+VKIEISGSEIQHFIHHHSLNL+SPEEELGQDRVCDGCMK
Subjt:  VDFVGNKIDCKICGDKISTKYAAYGCYKYECNYFVHLGCARTQRIYFNSTMDALDSTDDEDVKIEISGSEIQHFIHHHSLNLFSPEEELGQDRVCDGCMK

Query:  RLSGPSYGCEECDFFVHKECLELPRKKRNFLHQHRLNLISIPNFVFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQHPLSHDRTNEDHKC
        RLS PSYGCEECDFFVHKECLELPRKKRNFLHQH L+LISIPNFVFQC+ACL YFNGFAYHCK CLS FDTRC SIKIPFKHP+HQHPLS DRTNEDHKC
Subjt:  RLSGPSYGCEECDFFVHKECLELPRKKRNFLHQHRLNLISIPNFVFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQHPLSHDRTNEDHKC

Query:  EGCGEGVKHKVAFRCVDCNFYLDAGCATLPLGVRYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKKHEAH
        EGCGEGVKHKVAFRCVDCNF+LDAGCATLPLGVRYRFDPHPLDL F+E+EE+EEYCC+ICEEERE GPWFYGCQKC+FAAHLDC VGMFPY+KLKKHEAH
Subjt:  EGCGEGVKHKVAFRCVDCNFYLDAGCATLPLGVRYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKKHEAH

Query:  KHTMKLGVKGKEEDCVACGESCAEDLAYECISNCKFKVHAIGLCYHRQLVQGSLAFTNR----------------------------KGGDAWEEKAFTR
        KHTMKLGVKGKEEDCVAC ESCAEDLAYECISNCKFKVHA G CYH Q+V GSLAFTNR                             GG+AW+EK FT 
Subjt:  KHTMKLGVKGKEEDCVACGESCAEDLAYECISNCKFKVHAIGLCYHRQLVQGSLAFTNR----------------------------KGGDAWEEKAFTR

Query:  IRAFLIWHKEWIYSFQIHYEKNGELIWSMKHGGDGGYRSEIHFD
        I+ F I H  WIYSFQ HYEK GELIWS+KHGGDGG +SE+ FD
Subjt:  IRAFLIWHKEWIYSFQIHYEKNGELIWSMKHGGDGGYRSEIHFD

A0A1S3CT35 uncharacterized protein LOC1035040310.0e+0087.03Show/hide
Query:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC
        MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAF+C + DCNFHIHQSCLHLP QIHSPFHPFHPLL KTNN+FCT CWQMPSGDVYRCRKC
Subjt:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC

Query:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENR-GNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS
        NFQIDIKCVL DTKSSGLRR SGDQFRHFSHPH LTLQLEENR  NRVVCFVCDLLIKS PSYFCSQCDTHFHQ+CAELPRE YD+ FHQHPLFLLPNLS
Subjt:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENR-GNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS

Query:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL
        FANFLCDSCNNNCRKFVY+CPHPR C+FNLHVACLQSFNH+HNF  FRNAMDSFDCR+CGKKG+GFPWFCEICHVLAHRKCAKSP  LRT GHH HDL L
Subjt:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL

Query:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG
        TYFR+   N+ RYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQR D QSTP+VDSTMD+ S+ NDE DNEIQCSVHSHNLN  L   I+ KGDRICDG
Subjt:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG

Query:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE
        C+KGLLS SYGCQQCDF+VHKECAKLPK KTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCK CLSTFDIRCTSIKIPFKHPGH HPLSLDRTNEE
Subjt:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE

Query:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH
        H+CE CGEGVKNRVSFRCV CNFYLDAKCATLPLTVRYR + HPLNLTFVEEEE DEYYCDVCEEERE W+W YSCR C F  HLGCVLGEFPFVKSKIH
Subjt:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH

Query:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP
        EAH+HPLSMVMKGKE+      NCGSC EWCGENLAFQCGTCKFNIHAIGRCY QQLKQGKLAYT   FYSRGVELYEQPT Y P+R   RL+GGKGGNP
Subjt:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP

Query:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE
        WEEKVFSTIR+F+VYH+ CVHAIQIYYEKNGKAVWS KHGGDGGTKYE
Subjt:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE

A0A1S4E6E7 uncharacterized protein LOC103503932 isoform X20.0e+0082.17Show/hide
Query:  MDFDLLNNPHPHPLFFIEEGKHDEVVFCIRCRRLLRPPAFSCSDSDCNFHIHQSCIDLPPQIHNRFHPQHTLSRTTNNYMCTACSQMPSGDVYRCYGCGF
        MDFDLLNNPHPHPLFF E+G + EVVFC RCRR LRPPAFSCSDS CNFHIHQSCIDLPPQIHNRFHPQH LSRTTNNY+CT C QMPSGDVY C  CGF
Subjt:  MDFDLLNNPHPHPLFFIEEGKHDEVVFCIRCRRLLRPPAFSCSDSDCNFHIHQSCIDLPPQIHNRFHPQHTLSRTTNNYMCTACSQMPSGDVYRCYGCGF

Query:  QIDVKCAIADTKATGVRRTIGSEFRHFSHPHTLTLQQEQNRGTNEIVCVVCGLLIKSGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFSSP
        QIDVKCAIADTKA+G+R   G++FRHFSHPHTLTL++EQNR T+EI C+VCGLLIKSGSSYYFC +CD++FHQQCAELPREMLN DFH+HPLFLLP SSP
Subjt:  QIDVKCAIADTKATGVRRTIGSEFRHFSHPHTLTLQQEQNRGTNEIVCVVCGLLIKSGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFSSP

Query:  QTICNSCKNDCGEFVYNCSLCGFNLHIACLQSFNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDG
        QTICNSCKNDCGEFVYNCSLC FNLHIACLQSF H H+F +YRNRTQFVCRACGEKG+GFSWYC ICHLSVHK+CA+ PLTLRIF HRLHDLSLTYFRD 
Subjt:  QTICNSCKNDCGEFVYNCSLCGFNLHIACLQSFNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDG

Query:  VDFVGNKIDCKICGDKISTKYAAYGCYKYECNYFVHLGCARTQRIYFNSTMDALDSTDDEDVKIEISGSEIQHFIHHHSLNLFSPEEELGQDRVCDGCMK
        VDFVGNKIDCKICG+KI TKYAAYGCYKY CNYFVHL CARTQ I FNST+D LDST+DE+VKIEISGSEIQHFIHHHSLNL+SPEEELGQDRVCDGCMK
Subjt:  VDFVGNKIDCKICGDKISTKYAAYGCYKYECNYFVHLGCARTQRIYFNSTMDALDSTDDEDVKIEISGSEIQHFIHHHSLNLFSPEEELGQDRVCDGCMK

Query:  RLSGPSYGCEECDFFVHKECLELPRKKRNFLHQHRLNLISIPNFVFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQHPLSHDRTNEDHKC
        RLS PSYGCEECDFFVHKECLELPRKKRNFLHQH L+LISIPNFVFQC+ACL YFNGFAYHCK CLS FDTRC SIKIPFKHP+HQHPLS DRTNEDHKC
Subjt:  RLSGPSYGCEECDFFVHKECLELPRKKRNFLHQHRLNLISIPNFVFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQHPLSHDRTNEDHKC

Query:  EGCGEGVKHKVAFRCVDCNFYLDAGCATLPLGVRYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKKHEAH
        EGCGEGVKHKVAFRCVDCNF+LDAGCATLPLGVRYRFDPHPLDL F+E+EE+EEYCC+ICEEERE GPWFYGCQKC+FAAHLDC VGMFPY+KLKKHEAH
Subjt:  EGCGEGVKHKVAFRCVDCNFYLDAGCATLPLGVRYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKKHEAH

Query:  KHTMKLGVKGKEEDCVACGESCAEDLAYECISNCKFKVHAIGLCYHRQLVQGSLAFTNR----------------------------KGGDAWEEKAFTR
        KHTMKLGVKGKEEDCVAC ESCAEDLAYECISNCKFKVHA G CYH Q+V GSLAFTNR                             GG+AW+EK FT 
Subjt:  KHTMKLGVKGKEEDCVACGESCAEDLAYECISNCKFKVHAIGLCYHRQLVQGSLAFTNR----------------------------KGGDAWEEKAFTR

Query:  IRAFLIWHKEWIYSFQIHYEKNGELIWSMKHGGDGGYRSEIHFDSS
        I+ F I H  WIYSFQ HYEK GELIWS+KHGGDGG +SE   D S
Subjt:  IRAFLIWHKEWIYSFQIHYEKNGELIWSMKHGGDGGYRSEIHFDSS

A0A5A7TIW3 Putative Cysteine/Histidine-rich C1 domain family protein0.0e+0087.3Show/hide
Query:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC
        MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAF+C + DCNFHIHQSCLHLP QIHSPFHPFHPLL KTNN+FCT CWQMPSGDVYRCRKC
Subjt:  MEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDVYRCRKC

Query:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENR-GNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS
        NFQIDIKCVL DTKSSGLRR SGDQFRHFSHPHPLTLQLEENR  NRVVCFVCDLLIKS PSYFCSQCDTHFHQ+CAELPRE YD+ FHQHPLFLLPNLS
Subjt:  NFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENR-GNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLS

Query:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL
        FANFLCDSCNNNCRKFVY+CPHPR C+FNLHVACLQSFNH+HNF  FRNAMDSFDCR+CGKKG+GFPWFCEICHVLAHRKCAKSP  LRT GHH HDL L
Subjt:  FANFLCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGL

Query:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG
        TYFR+   N+ RYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQR D QSTP+VDSTMD+ S+ NDE DNEIQCSVHSHNLN  L   I+ KGDRICDG
Subjt:  TYFRD---NKIRYCKICGEKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDG

Query:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE
        C+KGLLS SYGCQQCDF+VHKECAKLPK KTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCK CLSTFDIRCTSIKIPFKHPGH HPLSLDRTNEE
Subjt:  CMKGLLSPSYGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEE

Query:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH
        H+CE CGEGVKNRVSFRCV CNFYLDAKCATLPLTVRYR + HPLNLTFVEEEE DEYYCDVCEEERE W+W YSCR C F  HLGCVLGEFPFVKSKIH
Subjt:  HSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRYRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIH

Query:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP
        EAH+HPLSMVMKGKE+      NCGSC EWCGENLAFQCGTCKFNIHAIGRCY QQLKQGKLAYT   FYSRGVELYEQPT Y P+RV  RL+GGKGGNP
Subjt:  EAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHAIGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNP

Query:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE
        WEEKVFSTIR+F+VYH+ CVHAIQIYYEKNGKAVWS KHGGDGGTKYE
Subjt:  WEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYE

SwissProt top hitse value%identityAlignment
F4HQX1 Jacalin-related lectin 37.2e-0850.98Show/hide
Query:  GKHGGGPWDDGAYSTIRRLLIYHKQWICSLHVEYDKNGHSVWGSKRGGNEG
        G   G  WDDG Y+T+++++I H   I S+ +EYDKNG SVW  KRGG  G
Subjt:  GKHGGGPWDDGAYSTIRRLLIYHKQWICSLHVEYDKNGHSVWGSKRGGNEG

P82859 Agglutinin2.8e-0425.48Show/hide
Query:  EKFSLGEYGGEGGDPWNESFKT--IKQLVINHGMWIDSIQMEYEDENGELVWSKRHGGNG-GFQSEV----GPEKFSLGEYGGEGGDPWDENFRTVKQLF
        E  ++G +GGEGGD W+       I  + I H   I SI  +  DE G L  S++ GG G G++++      PE++ L    G   D W           
Subjt:  EKFSLGEYGGEGGDPWNESFKT--IKQLVINHGMWIDSIQMEYEDENGELVWSKRHGGNG-GFQSEV----GPEKFSLGEYGGEGGDPWDENFRTVKQLF

Query:  GLRGMVEMEVLNQRNGIGPEKFSFGKYGGEGGNPWNENFRTIRQLVINHGQWIDSIQMEYENENGELVWSERHGGNGGSQSKCIATLQRMLDEEGSPMTT
             + +  ++ +   G E   +G YG   G P++                       Y  E G +V    HG +G       A ++    ++ +    
Subjt:  GLRGMVEMEVLNQRNGIGPEKFSFGKYGGEGGNPWNENFRTIRQLVINHGQWIDSIQMEYENENGELVWSERHGGNGGSQSKCIATLQRMLDEEGSPMTT

Query:  VKIEIGGGK---HGGGPWDDGAYSTIRRLLIY-HKQWICSLHVEY-DKNGHSVWGSKRGGNEG
        + +  G G    HGG  WDDG +  IR L +Y     I ++ V Y  K+G  +   K GG  G
Subjt:  VKIEIGGGK---HGGGPWDDGAYSTIRRLLIY-HKQWICSLHVEY-DKNGHSVWGSKRGGNEG

Q5XF82 Jacalin-related lectin 112.8e-0427.27Show/hide
Query:  PEPAAAPPPQIQVEHIKPRQYGGEGGDGWED-MFRTIKRFVVRHG-LWIDSIQIQYEDDNGNLVWSRQHGGDGGSKSENKKEIAMDFDLLNNPHPHPLFF
        P P + PPP+      K +  GG+GGD W+D  F+ +K+  V  G + I +++ +YE     ++ + +HG          KE  + ++     +P     
Subjt:  PEPAAAPPPQIQVEHIKPRQYGGEGGDGWED-MFRTIKRFVVRHG-LWIDSIQIQYEDDNGNLVWSRQHGGDGGSKSENKKEIAMDFDLLNNPHPHPLFF

Query:  IEEGKHDEVV
          EG HD+V+
Subjt:  IEEGKHDEVV

Arabidopsis top hitse value%identityAlignment
AT2G19660.1 Cysteine/Histidine-rich C1 domain family protein1.0e-7028.38Show/hide
Query:  HQHPLF----FNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSG--DVYRCRKCNFQ
        H HPL     F+    K  G V Y            + C++  C    H+ C     +I+   HP HPL    N     +C Q PS   + Y C  C+F+
Subjt:  HQHPLF----FNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSG--DVYRCRKCNFQ

Query:  IDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENRGNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLSFANF
        +D +C         L  ++        H HP+ L      G   +C  C   +  A  Y C++CD  FH +C +L  +   +   QHPL LL      ++
Subjt:  IDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENRGNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLSFANF

Query:  LCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQS---------FNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHF
          + C     KF     H  +C F +   C+++           H+H        +  F C  CG + D  P+FC  C+ + H +C   P ++     H 
Subjt:  LCDSCNNNCRKFVYTCPHPRLCKFNLHVACLQS---------FNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHF

Query:  HDLGLTYFRDNKIRYCKICGEKLEMKFAGYGCYEC-NYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDE-----QDNEIQCSVHSHNLNFFLPEEIEG
        H +  T    +    C +C +K++  + GY C +C NY  H  CA     D     M++        + D       DN I+   H HNL       I  
Subjt:  HDLGLTYFRDNKIRYCKICGEKLEMKFAGYGCYEC-NYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDE-----QDNEIQCSVHSHNLNFFLPEEIEG

Query:  KGDRICDGCMKGLLSPS-YGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLI-SIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPH
        +   +C  C+  + S + Y C+ CDF +H++CA LP+ K H       TL+ +  +    C+ C + F GF Y C   + T D+RC SI+ P  H  H H
Subjt:  KGDRICDGCMKGLLSPS-YGCQQCDFFVHKECAKLPKTKTHFLHQHLLTLI-SIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPH

Query:  PLSLDRTNEEHSCEGCGEGVKNRV-SFRCVGCNFYLDAKCATLPLTVR-YRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCV
        PL   + + + SC  CG+ +   + SF C  C++ LD KCA LP  V+ +R ++HPL L+  E     EY+C+ CE +  +  W Y+C  C  + H+ CV
Subjt:  PLSLDRTNEEHSCEGCGEGVKNRV-SFRCVGCNFYLDAKCATLPLTVR-YRLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCV

Query:  LGEFPF
        +G+F +
Subjt:  LGEFPF

AT3G27500.1 Cysteine/Histidine-rich C1 domain family protein1.0e-7030.87Show/hide
Query:  HPHPLFFIEEGKHDEVVFCIRCRRLLRPPAFSCSDSDCNFHIHQSC---IDLPPQIHNRFHPQHTLSRTT------NNYMCTACSQMPSGDVYRCYGCGF
        H H    +   KH + + C  C R L    +SC  S+C F+IH +C    D    + +  H  H+L   T       +  C  C       ++ C  C  
Subjt:  HPHPLFFIEEGKHDEVVFCIRCRRLLRPPAFSCSDSDCNFHIHQSC---IDLPPQIHNRFHPQHTLSRTT------NNYMCTACSQMPSGDVYRCYGCGF

Query:  QIDVKCAIADTKATGVRRTIGSEFRHFS-HPHTLTLQQEQNRGTNEIVCVVCGLLIKSGSSYYFCSYCDAHFHQQC-AELPREMLNPDFHKHPLFLLPFS
         +D+ C IAD         +G E+ +   HPH L      +R      C  C    +     Y C  C    H++C  EL     +P   +HPL LL   
Subjt:  QIDVKCAIADTKATGVRRTIGSEFRHFS-HPHTLTLQQEQNRGTNEIVCVVCGLLIKSGSSYYFCSYCDAHFHQQC-AELPREMLNPDFHKHPLFLLPFS

Query:  SPQTI---CNSCKNDCGEFVYNCSLCGFNLHIACLQSFN----------HIHTFAKYRNRTQFVCRACGEKGDGFSWYCTIC-HLSVHKECAEFPLTLRI
        +P      C+ C  D G  +Y+C +C FNL + C    +          H HT         FVC ACG KGD   + C  C  ++ H++CA  P  + +
Subjt:  SPQTI---CNSCKNDCGEFVYNCSLCGFNLHIACLQSFN----------HIHTFAKYRNRTQFVCRACGEKGDGFSWYCTIC-HLSVHKECAEFPLTLRI

Query:  FQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKISTKYAAYGCYKYECNYFVHLGCARTQRIYFNSTMDAL-DSTDDEDVKIEISGSEIQHFIHHHSLNLF
          H  H +S  Y     D+      C +C D+I   Y AY C      Y +H  CA    ++    +D + +  +D +       + I HF H H  N  
Subjt:  FQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKISTKYAAYGCYKYECNYFVHLGCARTQRIYFNSTMDAL-DSTDDEDVKIEISGSEIQHFIHHHSLNLF

Query:  SPEEELGQDRVCDGCMKRLSGPS--YGCEECDFFVHKECLELPRKKRNFLHQHRLNL----ISIPNFVFQCKACLNYF-NGFAYHCKKCLSIFDTRCASI
        S ++   +   C  C   +   S  Y C EC F +H+ C  LP KKR+FL    L L    +     V  C+AC   F  GF Y        FD  C+SI
Subjt:  SPEEELGQDRVCDGCMKRLSGPS--YGCEECDFFVHKECLELPRKKRNFLHQHRLNL----ISIPNFVFQCKACLNYF-NGFAYHCKKCLSIFDTRCASI

Query:  KIPFKHPSHQHPLSHDRTNEDH--KCEGCGEGVKHKVAFRCVDCNFYLDAGCATLPLGVRY-RFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGC
         +PF H SH H L + +    +   C+GCG   K +VA  C  CN++LD  CATLPL V   R+D HPL L + +++   +Y CDICE E     WFY C
Subjt:  KIPFKHPSHQHPLSHDRTNEDH--KCEGCGEGVKHKVAFRCVDCNFYLDAGCATLPLGVRY-RFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGC

Query:  QKCSFAAHLDCVVGMFPYIKLK
          C    H+ CVVG   Y K K
Subjt:  QKCSFAAHLDCVVGMFPYIKLK

AT5G37620.1 Cysteine/Histidine-rich C1 domain family protein4.7e-7129.03Show/hide
Query:  FSCSDSDCNF-HIHQSCIDLPPQIHNRFHPQHTLSRTTNN--YMCTACS--QMPSGDVYRCYGCGFQIDVKCAIADTKATGVRRTIGSEFRHFSHPHTLT
        + C++  C     H+ C +   +I +  HP H L+   N+    C  C    +    +Y C  C F++D+ CA            I   +    HP  L 
Subjt:  FSCSDSDCNF-HIHQSCIDLPPQIHNRFHPQHTLSRTTNN--YMCTACS--QMPSGDVYRCYGCGFQIDVKCAIADTKATGVRRTIGSEFRHFSHPHTLT

Query:  LQQEQNRGTNEIVCVVCGLLIKSGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFS-SPQTICNSC-----KNDCGEFVYNCSLCGFNLHIA
           +  +      C  C          Y C  C   FH  CAE   E  +    +HPL LL F+ +P      C     K D  + +++C +C F++  A
Subjt:  LQQEQNRGTNEIVCVVCGLLIKSGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFS-SPQTICNSC-----KNDCGEFVYNCSLCGFNLHIA

Query:  CLQS---------FNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKIST
        C+++           H H       R  F C ACG  G+   ++C  C+  +H++C + P   R+     HD  ++Y R        K  CK+C   +  
Subjt:  CLQS---------FNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKIST

Query:  KYAAYGCYKYECNYFVHLGCARTQRIYFNSTMDALDSTDDEDVKIEISGSEIQHFIHHHSLNLFSPEEE--LGQDRVCDGC-MKRLSGPSYGCEECDFFV
         Y  Y C K   ++ +H  CA  + ++    ++     D+      I  + I+HF H H+L + +  +   L ++ VC+ C ++ LS P Y C++C+F +
Subjt:  KYAAYGCYKYECNYFVHLGCARTQRIYFNSTMDALDSTDDEDVKIEISGSEIQHFIHHHSLNLFSPEEE--LGQDRVCDGC-MKRLSGPSYGCEECDFFV

Query:  HKECLELPRKKRNFLHQHRLNLISIPNFVFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQHPLSHDRTNEDHKCEGCGEGVKHKVAFRCV
        H++C   PRKK +        L++  N +FQC  CL  FNGF Y       + D RCA+I    ++ SHQH L +  T   H C  CG   K    FRC 
Subjt:  HKECLELPRKKRNFLHQHRLNLISIPNFVFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQHPLSHDRTNEDHKCEGCGEGVKHKVAFRCV

Query:  DCNFYLDAGCATLPLGV-RYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKKHEAHKHTMKLGVKGKEEDC
        +C++ L   CA LP  V   R+D HPL L F E   D EY C+ CE +  S  WFY C  C    H+ CVVG F YI   KH +H    K          
Subjt:  DCNFYLDAGCATLPLGV-RYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKKHEAHKHTMKLGVKGKEEDC

Query:  VACGESCAEDLAYECISNCK
        V    S      + C S CK
Subjt:  VACGESCAEDLAYECISNCK

AT5G40320.1 Cysteine/Histidine-rich C1 domain family protein8.0e-7132.46Show/hide
Query:  CVVCGLLIKSGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFSSPQTI----CNSCKNDCGEFVYNCSLCGFNLHIACLQS---------FN
        C  C  L ++G   Y C+ C    H++CAE   E+ +P   +HPL LL    PQ      C  C    G+ VY+CS+C F+L + C ++           
Subjt:  CVVCGLLIKSGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFSSPQTI----CNSCKNDCGEFVYNCSLCGFNLHIACLQS---------FN

Query:  HIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKISTKYAAYGCYKYECNYF
        H HT        +F C ACG + D   + C  C   +H++C   P  + I +H  H +S TYF    D+      C +C  ++  +Y AY C     +Y 
Subjt:  HIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKISTKYAAYGCYKYECNYF

Query:  VHLGCARTQRIYFNSTMDALDSTDD-EDVK------IEISGSEIQHFIH-HHSLNLFSPEEELGQDRV-CDGCMKRL-SGPSYGCEEC-DFFVHKECLEL
        VH  CA  + ++     D L+  D+ ED++        +S   I HF H  H L L         + + C  C++ + +   Y C +C DF +H+ C  L
Subjt:  VHLGCARTQRIYFNSTMDALDSTDD-EDVK------IEISGSEIQHFIH-HHSLNLFSPEEELGQDRV-CDGCMKRL-SGPSYGCEEC-DFFVHKECLEL

Query:  PRKKRNFLHQHRLNLISIPNF---------VFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQHPLSHDRTNEDHK-CEGCGEGVKHKVAF
        PRKKR+ LH H+L L    N          VF C AC    +GF Y C  C    D RC SI  PF +  H HPL   +T+ D K CE C E  +     
Subjt:  PRKKRNFLHQHRLNLISIPNF---------VFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQHPLSHDRTNEDHK-CEGCGEGVKHKVAF

Query:  RCVDCNFYLDAGCATLPLGVRYRFDPHPLDLKFIEDEE-DEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKKH-EAHKHTMKLGVKG-
         C+DC++ LD  CATLP  VRY++D HPL L +  D++    Y C+ICE+E +    FY C+      H++CV+G F Y+K + H E +K   ++ + G 
Subjt:  RCVDCNFYLDAGCATLPLGVRYRFDPHPLDLKFIEDEE-DEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKKH-EAHKHTMKLGVKG-

Query:  KEEDCVACGESCA---------EDLAYECISNCKFK
            C  CG  C           D++Y C   C +K
Subjt:  KEEDCVACGESCA---------EDLAYECISNCKFK

AT5G45730.1 Cysteine/Histidine-rich C1 domain family protein5.0e-7331.78Show/hide
Query:  CVVCGLLIKSGSSY-YFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFSSPQTI------CNSCKNDCGEFVYNCSLCGFNLHIAC---------LQ
        C  CG      S Y Y+C  C+   H  C   P  + +P   +HPL  +   SP+TI      C  C++   + +Y+CS+C F++ + C          +
Subjt:  CVVCGLLIKSGSSY-YFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFSSPQTI------CNSCKNDCGEFVYNCSLCGFNLHIAC---------LQ

Query:  SFNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKISTKYAAYGCYKYEC
          +H HT      +  F C ACG  GD   + C  C   +HK C   P  + I +H  H +S TY     D+     +C +C  K+     A+ C +   
Subjt:  SFNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTICHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKISTKYAAYGCYKYEC

Query:  NYFVHLGCARTQRIYFNSTM-DALDSTDDEDVKIEISGSEIQHFIH-HHSLNLFSPEEELGQDRV-CDGCMKRL-SGPSYGCEECDFFVHKECLELPRKK
        NY VH  CA  + ++    + D  +   +ED    I+  EI HF H  H+L L   +     +++ CD C++ + S P + C EC FF+HK C  LPRKK
Subjt:  NYFVHLGCARTQRIYFNSTM-DALDSTDDEDVKIEISGSEIQHFIH-HHSLNLFSPEEELGQDRV-CDGCMKRL-SGPSYGCEECDFFVHKECLELPRKK

Query:  RNFLHQH--RLNLISIPNFVFQCKACLNYFNGFAYHC---KKCLS---IFDTRCASIKIPFKHPSHQHPLSHDRTNEDHK-CEGCGEGVKHKVAFRCVDC
        RN LH H  RL         F+C +CL YF+GF Y C     C+    +FD RC+SI  PF+H  H HPL   RT+++HK C  CGE  ++ ++  C+ C
Subjt:  RNFLHQH--RLNLISIPNFVFQCKACLNYFNGFAYHC---KKCLS---IFDTRCASIKIPFKHPSHQHPLSHDRTNEDHK-CEGCGEGVKHKVAFRCVDC

Query:  NFYLDAGCATLPLGVRYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIK-----LKKHEAHKHTMKLGVKGKEE
        +F L   CATLP  V++R D H L L       + +  CDICE + ++  W+YGC +C    H++CV+G   Y+K     L       H M   +     
Subjt:  NFYLDAGCATLPLGVRYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIK-----LKKHEAHKHTMKLGVKGKEE

Query:  DCVACGESCA------------EDLAYECISNCKFKVH
         C+ C + C               + Y C   C ++ H
Subjt:  DCVACGESCA------------EDLAYECISNCKFKVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGTTGAAGCTTTTGGAGCAGCCACACCCGCACCCATTGACATACATAGAAGAAAAAACAGACAGCCGAGAAGACATGTTCTGTTGTGTCTGCAAAAAAGCATT
GTATCCCCCAGCTTTCATTTGCTCCCCCTGCAAATTCTACATTCATCAATCCTGCATCTACTTCCCTTCTCGAATCCGATCCCGCTTCCATCCCCAACATTCTCTTTCCC
TCACCGAAACCAACTCCGACCATTGCCATTGTTGCTGGCAGATGCCCCGCGATTGCTTCTACACCTGCACCCTGCAGCCCCATTGCAGATTCATCATTGACATCAAATGC
ACCCTCGCTGACACTAGAAGTCTTGGCTTGAACTCGATAGGCAAACATTTCAGCCATTCACATCCCTTGTTCCTCGAGAAGGAGCTTGCTACAAGGAAGCTGGTTGTTTG
CCATCTTTGTGGAATGCTAATTGTTTCTGGTCCTGCTTACTTTTGCTCTAAATGTAATATCCGTTTTCATAAAGGCTGCGCCGAGCTGCCGCAAGAGATATTGCAACTCA
ACCAACACCATCACCCGCTCTTCCTTTTTCCCCATGCCAAGCCTCATTCTTTCTGCAATAGCTGTAAAAATCAATGCTTGCAATTCGTTTACAGCTGTGTTCAGTGTAAT
TTCAATCTCCATGTAACATGTCGAGCTTTTTCAAATCATAAACACAACTTCACCAGACTGCGTACTATAATTAGCTTTCAATGCCTGTTATGTGGTTGGTGGGGACGTGA
TTTTCCATGGTTTTGTAGCATCTGCCATCTTTTGGCTCATGAGAAATGCGCTGAATTGCCCCCATCCCTCTTGGTTGTTGGACACGACTGTCCTCTCAGTTTCACATACA
TTCACCCTTTTGGGAATCAGAGCAAGTTAGCCTGCGATATCTGCAGAAAGAAGGTTGAACCACAGTTCGCAGCATATAGCTGTTCCAAATGCATGTATGTAGTTCATTTG
AATTGTGCTGGGAAAAAGTACTTGCGGGGACTACAACAACATGATGGGCGTTTATATACAGGAGGAAGTGGCAGACTACAAACTTCATCTCAAACCTACATCAATCCGTT
AGGCGATGAAGATGCAACATCCAATAATTTTTCAAAGAAAATGTTTGAAGAAGTTGGTTCGATTGAGATTCTCCATTCAAATTGTGAAAAGGAGTTGATCCTCTGCAAAG
AAGAGGGTGACAATGATAAACAATGCCATGGATGCATGCAAAGCTTCTCAGTGACTAAGCCATCTTACTCTTACAGTTGTGTTAAATGCGGATTCTTTCTCCACAAACAT
TGTGCTGATTTTCCAATTACAAAGAGGCACCCACTTCATAAACATCCTTTAACTCTGATCGCAACCCAAAATGTGGCATTTCAGTGCCATGCTTGTCTGCAGTTCTGCCA
TGGCTATGCCTACCACTGTGAAGAATGTCTCTACACGCTCGACATTCGCTGTGTTTTGATCAAAACCAAGAAACTGAAGCATCCAAGTCACCAGCACTTACTGTCTCTCG
CACAGAACCACGAGGATCAAAAATGTAGGGGCTGTGGCCAGAGCAACAAGACAGTGTTCGAATGCGACGAAGGCTGCAATAACTTCTCTTTGGACTACAGATGTGCAACC
CTACCGCAGAAAGCAAGATGCAAATTCGATGGTAGTTTGTTGGATCTGACATTCTCTGTGGAAGATGAAACAGGTGAATATTACTGTGATGTATGCGAAGAAGAGAGGAA
CCCAGCCATGTGTTTTTATTGCTGCAAGACATGTCGATTAGCTGCCCACCCTGAGTGCATTCTTGGGGAGTATCCATGGCTCAAGTATGGATCATATAAAACTCATAAGC
ATCTATTGGCTCTTGTGACAGAGGGGAAGATGGATTACTCTGATTGTGATCACTGTGGGAAACCTTGTGTTGGCAACTTGGCCTATGAATGTCGCCGCTGCAAGTTCAAT
TATTTTGCAGTGCATAGTCTACAGAGGATGGAAGATGAGGTGGTGGTGAAGATTGGGATTCGAGGGTGCGAAGAGGGTGCGCGTCGTTGGGATGACGGAGCTCATTCCAC
CGTCAGACAGATTGTGATTAATCATGAAAAGTGTATCTACTCGGTGAACATCGAGTATGACAACAATGGGGAATCAATTTGGAAGCCCAAGCATGGCGGAAACAAGGGTT
CCATTTCCAAGCATTTTAGTTTGGGAGAATACGGTGGGGAAGGCGGGGAACCATGGAGTGAGACCTTTCAAGCAATCAAACAGTTGGTGATTCATAGTGATGAACACTGG
ATTGTTTCTATTCAAATGGAATATGTGAATGAGAATGGAGACTTTATAATGGAGTTTGACCTTGTGAATCGTCCACACCAACACCCATTGTTCTTCAACGAAGACGGAAG
AAAAATCAATGGAGAAGTAGCTTACTGTTCCCGATGCCGTCAACCACTGCGGCCGCCGGCGTTCAGCTGCTCCGACTCCGACTGCAACTTCCACATCCATCAATCATGCC
TTCACCTCCCTTCTCAAATCCATTCCCCCTTCCACCCCTTCCACCCTCTTCTTCGAAAAACCAACAACCATTTCTGTACTGCCTGCTGGCAAATGCCTTCCGGCGACGTT
TACAGATGCCGTAAATGCAATTTTCAAATAGACATCAAATGCGTCCTCGCCGATACAAAATCCAGCGGCTTACGCCGTACCTCCGGCGATCAGTTTCGACATTTCAGCCA
CCCACATCCATTAACTCTTCAACTAGAAGAAAACAGAGGAAACAGAGTGGTTTGTTTTGTCTGTGATTTGCTTATAAAATCAGCCCCTTCTTACTTTTGCTCTCAATGTG
ACACTCATTTCCATCAACAATGCGCGGAATTACCACGCGAGTTGTACGATCTTGGATTTCATCAACACCCTTTATTTCTTCTTCCCAATCTTAGCTTTGCTAATTTCCTC
TGCGATAGCTGCAATAACAACTGCAGAAAGTTTGTTTATACTTGTCCTCATCCTCGCCTCTGCAAATTCAATCTCCATGTAGCTTGTTTACAATCTTTCAATCACCAACA
CAATTTCATCACGTTTCGGAATGCAATGGATTCTTTTGATTGTCGAGTTTGTGGTAAGAAAGGCGACGGATTCCCTTGGTTTTGTGAGATTTGCCATGTTTTAGCGCATA
GAAAATGTGCTAAATCGCCACTAAAGCTAAGAACATTTGGACACCATTTTCATGATCTTGGCCTCACCTACTTCCGCGACAATAAAATTCGTTATTGTAAGATCTGTGGC
GAGAAACTAGAGATGAAATTTGCTGGGTATGGTTGTTACGAATGCAATTACTTTACCCATTTGGATTGTGCTGAAACCCAACGCCTCGACCCACAATCCACGCCCATGGT
TGATTCGACAATGGATTATTCTTCTACCCAAAATGACGAACAGGACAATGAGATTCAATGTTCTGTTCATTCCCATAACTTGAATTTCTTCCTCCCAGAAGAGATTGAGG
GAAAAGGAGATAGAATTTGTGATGGGTGTATGAAGGGCCTTTTAAGTCCATCTTATGGTTGTCAACAATGTGATTTCTTTGTCCACAAAGAATGTGCTAAATTGCCAAAA
ACCAAAACCCATTTCCTCCATCAGCATTTGCTCACTCTCATCTCAATCCCAAATTTCATCTTCCACTGCGAAGCTTGTCGCGAATACTTCCATGGCTTCGCCTACCATTG
TAAGAAATGCCTCTCCACATTCGACATTCGTTGCACTTCCATTAAAATCCCATTTAAACACCCGGGTCATCCACATCCCCTGTCTCTTGACAGAACAAATGAAGAACACA
GTTGCGAGGGTTGTGGGGAAGGAGTGAAAAACAGAGTATCGTTCCGATGTGTCGGCTGCAACTTTTATTTGGACGCGAAATGTGCGACACTGCCACTTACGGTAAGATAC
AGATTGGAGGAGCATCCGTTGAATTTGACATTTGTAGAGGAAGAGGAAGGAGATGAATATTACTGCGATGTTTGTGAAGAAGAAAGAGAGGCGTGGATTTGGTGTTATAG
CTGTCGAAGGTGCAATTTTGTGGGGCATTTGGGTTGTGTTCTTGGAGAGTTTCCGTTTGTGAAATCAAAGATTCATGAAGCTCATAGGCATCCGTTGAGTATGGTAATGA
AAGGGAAGGAAGAAAGTGGCAAGCATAGCAAGAATTGTGGGAGTTGCGGTGAATGGTGTGGTGAGAATTTGGCGTTTCAATGTGGAACTTGCAAGTTTAATATCCATGCA
ATTGGGCGTTGTTACCATCAGCAGCTAAAACAGGGGAAGCTGGCCTATACACACACAAATTTTTACTCTCGAGGAGTTGAACTATATGAACAACCAACTATATATTACCC
TGTTCGTGTACCATTGAGGCTGTATGGAGGCAAAGGGGGAAATCCATGGGAAGAAAAGGTTTTCTCAACAATTAGGTCATTTGTTGTGTATCATAAAGAATGTGTCCATG
CCATTCAAATTTACTACGAGAAGAATGGGAAGGCTGTTTGGTCGCCTAAGCATGGCGGAGATGGTGGAACCAAATACGAGGAAGACGACGGTTCTATCACTTGGGACGAC
GGAGTTTATTCGGCGATCAGACGGTTCGTAGTTTACGAGAGAGAGTGGATCTGTTCCATTCAGATTGAATATGATCAGAATGGAGAATCAATTTGGTCGCCCAAACATGG
TGAAAACGATGGTTCTATTTCGGAGCCTGAGCCTCAGTCGAAGCACTTAAACATGGGACCATATGGAGGCAAAGGTGGAGATCATTGGGAAGAGACTTTTCAAACAATCA
GACGGCTGGTGATTTATCATGGCCTTTGGATCGACTCCATTCAAATGGAATACGAAGATGAGAATCAAACGTTATTATGGTCCGAGAAAAATGGAGTTAGGCCTGAGAAA
TTTAGCTTAGGAGAATATGGAGGCGAAGGTGGAGATCCTTGGAACGAGAGTTTTAAGACAATCAAGCAATTGGTGATCAATCATGGAATGTGGATCGACTCCATTCAAAT
GGAATATGAAGATGAGAATGGGGAGTTGGTGTGGTCTAAGAGGCATGGTGGAAATGGAGGTTTCCAATCAGAGGTTGGACCTGAGAAATTTAGCTTGGGAGAATATGGAG
GCGAAGGTGGAGATCCTTGGGATGAGAATTTTAGGACAGTCAAGCAATTGTTTGGTCTAAGAGGCATGGTGGAAATGGAGGTTCTCAATCAGAGAAATGGGATTGGGCCA
GAGAAATTTAGCTTCGGAAAATATGGAGGTGAAGGTGGAAATCCTTGGAATGAGAATTTTAGGACAATTAGACAATTGGTGATTAATCATGGACAGTGGATCGACTCCAT
TCAAATGGAATATGAAAATGAAAATGGGGAGTTGGTGTGGTCTGAGAGGCATGGTGGAAATGGAGGTTCCCAATCAAAGTGCATCGCTACTCTACAGAGGATGTTAGACG
AAGAGGGTTCCCCGATGACGACGGTGAAGATAGAGATTGGTGGAGGCAAACACGGTGGGGGACCTTGGGATGATGGAGCTTATTCCACCATCAGACGCCTTCTAATTTAT
CACAAACAGTGGATCTGTTCCCTTCATGTCGAGTATGATAAGAACGGCCATTCAGTTTGGGGTTCCAAGCGTGGCGGAAACGAGGGTTCCGTTTCTGAGGCAAAGGGGGG
CATCCATGGGAGTATGTCTTCCGGTCGATCAGACGGTTTGTTGTTGATTATGAACAATGGATCCACTCCATTCAATTGGAATATGAGGATAAGAATGGAAAGTTGGTATG
GTCCAAGAAGCATGGTGACACAGATGGAACTTCCAAATCAGAGCTTAAAGACTGAGCCGGAGCCTGCGGCTGCACCTCCACCCCAAATCCAAGTGGAGCACATTAAACCG
AGACAATATGGAGGTGAAGGTGGGGATGGTTGGGAAGATATGTTTCGGACAATCAAACGATTTGTGGTTCGTCATGGATTGTGGATCGACTCCATTCAAATTCAATATGA
AGATGATAATGGAAACCTAGTGTGGTCTAGGCAGCATGGTGGAGATGGAGGATCCAAATCAGAGAACAAGAAGGAGATAGCCATGGATTTTGACCTTCTGAACAACCCAC
ATCCACACCCATTGTTCTTCATAGAAGAGGGGAAGCATGATGAAGTCGTTTTCTGCATTAGATGCCGTCGACTGTTGCGTCCGCCGGCGTTCAGCTGCTCCGACTCCGAC
TGCAACTTCCATATCCATCAATCTTGTATCGACCTTCCTCCTCAAATCCACAACCGCTTCCACCCCCAACATACTCTTTCTCGGACCACCAACAACTATATGTGTACTGC
CTGTTCGCAAATGCCGTCGGGTGATGTTTATCGGTGCTATGGATGCGGTTTTCAGATTGACGTCAAATGCGCCATCGCCGACACAAAAGCCACCGGTGTACGGCGGACGA
TAGGTAGCGAGTTTCGACATTTCAGCCATCCTCATACATTAACCCTTCAGCAAGAACAAAACAGAGGAACCAATGAAATTGTTTGTGTTGTCTGTGGATTGCTTATAAAA
TCAGGTTCTTCTTATTACTTCTGCTCTTATTGTGATGCCCATTTTCATCAACAATGCGCCGAGCTGCCGCGCGAGATGCTAAACCCTGATTTTCATAAGCACCCTTTATT
TCTTCTTCCCTTTAGCTCTCCCCAAACCATCTGTAATAGTTGCAAAAATGACTGTGGAGAGTTCGTCTATAACTGTTCCTTGTGTGGATTCAACCTTCATATCGCTTGCT
TACAATCTTTCAATCACATACACACGTTCGCCAAATATAGGAACCGGACACAGTTTGTTTGTCGAGCATGTGGTGAGAAAGGGGATGGATTCTCATGGTATTGCACCATT
TGCCATCTTTCGGTTCATAAAGAATGCGCTGAATTTCCGTTAACTTTAAGGATATTTCAACACCGACTCCATGATCTTAGCCTCACCTATTTTCGTGATGGAGTTGATTT
TGTTGGCAACAAGATTGACTGTAAGATTTGCGGGGACAAAATAAGTACCAAATATGCTGCATATGGTTGCTACAAGTATGAATGCAACTACTTTGTCCATTTGGGTTGTG
CTCGAACCCAACGCATCTACTTTAACTCGACAATGGATGCTCTTGATTCTACAGATGATGAAGACGTTAAGATTGAGATTTCTGGCTCTGAGATTCAACATTTCATTCAT
CATCATAGCTTAAATTTGTTTTCCCCTGAGGAGGAGCTTGGACAGGACAGAGTTTGTGATGGTTGTATGAAGCGCCTTTCGGGTCCATCTTATGGCTGTGAGGAGTGTGA
TTTCTTTGTCCACAAAGAATGTCTTGAATTGCCTAGAAAGAAAAGGAACTTCCTCCATCAACATAGGCTCAATCTCATATCAATCCCAAATTTCGTCTTCCAATGCAAAG
CTTGTCTCAATTATTTCAATGGCTTTGCCTACCATTGCAAAAAATGTCTCTCCATATTTGACACCCGATGTGCCTCAATCAAAATCCCATTTAAACACCCTAGTCACCAA
CACCCCTTATCTCATGACCGCACAAACGAAGACCACAAATGTGAAGGTTGTGGGGAGGGAGTGAAGCATAAAGTAGCATTTCGATGTGTCGACTGCAACTTCTATTTGGA
TGCAGGATGTGCGACACTGCCACTTGGAGTAAGATACAGATTTGACCCACATCCTCTAGACCTGAAATTTATAGAGGACGAAGAAGATGAAGAGTATTGTTGTGATATCT
GTGAGGAAGAAAGAGAGTCAGGGCCGTGGTTCTACGGCTGCCAAAAGTGCAGTTTTGCTGCACATTTGGACTGTGTTGTTGGGATGTTTCCTTACATAAAGTTAAAGAAG
CATGAAGCTCATAAGCACACAATGAAACTGGGGGTGAAAGGGAAGGAAGAGGATTGTGTGGCTTGTGGTGAATCATGTGCTGAGGATTTGGCCTATGAATGCATTTCCAA
TTGCAAGTTCAAGGTGCATGCCATTGGGCTGTGTTACCACAGGCAGCTAGTGCAGGGGAGCCTAGCTTTCACCAACCGTAAAGGTGGAGATGCTTGGGAAGAAAAAGCTT
TCACAAGAATCAGAGCATTTCTTATTTGGCATAAAGAATGGATCTACTCCTTTCAAATTCATTATGAGAAGAATGGCGAGTTGATATGGTCAATGAAGCACGGCGGAGAT
GGCGGTTATAGATCTGAGATACATTTTGACTCGTCTCCGCAGCAGCAACCAGCTGGAAGTTCTTCACCGAGGCAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGTTGAAGCTTTTGGAGCAGCCACACCCGCACCCATTGACATACATAGAAGAAAAAACAGACAGCCGAGAAGACATGTTCTGTTGTGTCTGCAAAAAAGCATT
GTATCCCCCAGCTTTCATTTGCTCCCCCTGCAAATTCTACATTCATCAATCCTGCATCTACTTCCCTTCTCGAATCCGATCCCGCTTCCATCCCCAACATTCTCTTTCCC
TCACCGAAACCAACTCCGACCATTGCCATTGTTGCTGGCAGATGCCCCGCGATTGCTTCTACACCTGCACCCTGCAGCCCCATTGCAGATTCATCATTGACATCAAATGC
ACCCTCGCTGACACTAGAAGTCTTGGCTTGAACTCGATAGGCAAACATTTCAGCCATTCACATCCCTTGTTCCTCGAGAAGGAGCTTGCTACAAGGAAGCTGGTTGTTTG
CCATCTTTGTGGAATGCTAATTGTTTCTGGTCCTGCTTACTTTTGCTCTAAATGTAATATCCGTTTTCATAAAGGCTGCGCCGAGCTGCCGCAAGAGATATTGCAACTCA
ACCAACACCATCACCCGCTCTTCCTTTTTCCCCATGCCAAGCCTCATTCTTTCTGCAATAGCTGTAAAAATCAATGCTTGCAATTCGTTTACAGCTGTGTTCAGTGTAAT
TTCAATCTCCATGTAACATGTCGAGCTTTTTCAAATCATAAACACAACTTCACCAGACTGCGTACTATAATTAGCTTTCAATGCCTGTTATGTGGTTGGTGGGGACGTGA
TTTTCCATGGTTTTGTAGCATCTGCCATCTTTTGGCTCATGAGAAATGCGCTGAATTGCCCCCATCCCTCTTGGTTGTTGGACACGACTGTCCTCTCAGTTTCACATACA
TTCACCCTTTTGGGAATCAGAGCAAGTTAGCCTGCGATATCTGCAGAAAGAAGGTTGAACCACAGTTCGCAGCATATAGCTGTTCCAAATGCATGTATGTAGTTCATTTG
AATTGTGCTGGGAAAAAGTACTTGCGGGGACTACAACAACATGATGGGCGTTTATATACAGGAGGAAGTGGCAGACTACAAACTTCATCTCAAACCTACATCAATCCGTT
AGGCGATGAAGATGCAACATCCAATAATTTTTCAAAGAAAATGTTTGAAGAAGTTGGTTCGATTGAGATTCTCCATTCAAATTGTGAAAAGGAGTTGATCCTCTGCAAAG
AAGAGGGTGACAATGATAAACAATGCCATGGATGCATGCAAAGCTTCTCAGTGACTAAGCCATCTTACTCTTACAGTTGTGTTAAATGCGGATTCTTTCTCCACAAACAT
TGTGCTGATTTTCCAATTACAAAGAGGCACCCACTTCATAAACATCCTTTAACTCTGATCGCAACCCAAAATGTGGCATTTCAGTGCCATGCTTGTCTGCAGTTCTGCCA
TGGCTATGCCTACCACTGTGAAGAATGTCTCTACACGCTCGACATTCGCTGTGTTTTGATCAAAACCAAGAAACTGAAGCATCCAAGTCACCAGCACTTACTGTCTCTCG
CACAGAACCACGAGGATCAAAAATGTAGGGGCTGTGGCCAGAGCAACAAGACAGTGTTCGAATGCGACGAAGGCTGCAATAACTTCTCTTTGGACTACAGATGTGCAACC
CTACCGCAGAAAGCAAGATGCAAATTCGATGGTAGTTTGTTGGATCTGACATTCTCTGTGGAAGATGAAACAGGTGAATATTACTGTGATGTATGCGAAGAAGAGAGGAA
CCCAGCCATGTGTTTTTATTGCTGCAAGACATGTCGATTAGCTGCCCACCCTGAGTGCATTCTTGGGGAGTATCCATGGCTCAAGTATGGATCATATAAAACTCATAAGC
ATCTATTGGCTCTTGTGACAGAGGGGAAGATGGATTACTCTGATTGTGATCACTGTGGGAAACCTTGTGTTGGCAACTTGGCCTATGAATGTCGCCGCTGCAAGTTCAAT
TATTTTGCAGTGCATAGTCTACAGAGGATGGAAGATGAGGTGGTGGTGAAGATTGGGATTCGAGGGTGCGAAGAGGGTGCGCGTCGTTGGGATGACGGAGCTCATTCCAC
CGTCAGACAGATTGTGATTAATCATGAAAAGTGTATCTACTCGGTGAACATCGAGTATGACAACAATGGGGAATCAATTTGGAAGCCCAAGCATGGCGGAAACAAGGGTT
CCATTTCCAAGCATTTTAGTTTGGGAGAATACGGTGGGGAAGGCGGGGAACCATGGAGTGAGACCTTTCAAGCAATCAAACAGTTGGTGATTCATAGTGATGAACACTGG
ATTGTTTCTATTCAAATGGAATATGTGAATGAGAATGGAGACTTTATAATGGAGTTTGACCTTGTGAATCGTCCACACCAACACCCATTGTTCTTCAACGAAGACGGAAG
AAAAATCAATGGAGAAGTAGCTTACTGTTCCCGATGCCGTCAACCACTGCGGCCGCCGGCGTTCAGCTGCTCCGACTCCGACTGCAACTTCCACATCCATCAATCATGCC
TTCACCTCCCTTCTCAAATCCATTCCCCCTTCCACCCCTTCCACCCTCTTCTTCGAAAAACCAACAACCATTTCTGTACTGCCTGCTGGCAAATGCCTTCCGGCGACGTT
TACAGATGCCGTAAATGCAATTTTCAAATAGACATCAAATGCGTCCTCGCCGATACAAAATCCAGCGGCTTACGCCGTACCTCCGGCGATCAGTTTCGACATTTCAGCCA
CCCACATCCATTAACTCTTCAACTAGAAGAAAACAGAGGAAACAGAGTGGTTTGTTTTGTCTGTGATTTGCTTATAAAATCAGCCCCTTCTTACTTTTGCTCTCAATGTG
ACACTCATTTCCATCAACAATGCGCGGAATTACCACGCGAGTTGTACGATCTTGGATTTCATCAACACCCTTTATTTCTTCTTCCCAATCTTAGCTTTGCTAATTTCCTC
TGCGATAGCTGCAATAACAACTGCAGAAAGTTTGTTTATACTTGTCCTCATCCTCGCCTCTGCAAATTCAATCTCCATGTAGCTTGTTTACAATCTTTCAATCACCAACA
CAATTTCATCACGTTTCGGAATGCAATGGATTCTTTTGATTGTCGAGTTTGTGGTAAGAAAGGCGACGGATTCCCTTGGTTTTGTGAGATTTGCCATGTTTTAGCGCATA
GAAAATGTGCTAAATCGCCACTAAAGCTAAGAACATTTGGACACCATTTTCATGATCTTGGCCTCACCTACTTCCGCGACAATAAAATTCGTTATTGTAAGATCTGTGGC
GAGAAACTAGAGATGAAATTTGCTGGGTATGGTTGTTACGAATGCAATTACTTTACCCATTTGGATTGTGCTGAAACCCAACGCCTCGACCCACAATCCACGCCCATGGT
TGATTCGACAATGGATTATTCTTCTACCCAAAATGACGAACAGGACAATGAGATTCAATGTTCTGTTCATTCCCATAACTTGAATTTCTTCCTCCCAGAAGAGATTGAGG
GAAAAGGAGATAGAATTTGTGATGGGTGTATGAAGGGCCTTTTAAGTCCATCTTATGGTTGTCAACAATGTGATTTCTTTGTCCACAAAGAATGTGCTAAATTGCCAAAA
ACCAAAACCCATTTCCTCCATCAGCATTTGCTCACTCTCATCTCAATCCCAAATTTCATCTTCCACTGCGAAGCTTGTCGCGAATACTTCCATGGCTTCGCCTACCATTG
TAAGAAATGCCTCTCCACATTCGACATTCGTTGCACTTCCATTAAAATCCCATTTAAACACCCGGGTCATCCACATCCCCTGTCTCTTGACAGAACAAATGAAGAACACA
GTTGCGAGGGTTGTGGGGAAGGAGTGAAAAACAGAGTATCGTTCCGATGTGTCGGCTGCAACTTTTATTTGGACGCGAAATGTGCGACACTGCCACTTACGGTAAGATAC
AGATTGGAGGAGCATCCGTTGAATTTGACATTTGTAGAGGAAGAGGAAGGAGATGAATATTACTGCGATGTTTGTGAAGAAGAAAGAGAGGCGTGGATTTGGTGTTATAG
CTGTCGAAGGTGCAATTTTGTGGGGCATTTGGGTTGTGTTCTTGGAGAGTTTCCGTTTGTGAAATCAAAGATTCATGAAGCTCATAGGCATCCGTTGAGTATGGTAATGA
AAGGGAAGGAAGAAAGTGGCAAGCATAGCAAGAATTGTGGGAGTTGCGGTGAATGGTGTGGTGAGAATTTGGCGTTTCAATGTGGAACTTGCAAGTTTAATATCCATGCA
ATTGGGCGTTGTTACCATCAGCAGCTAAAACAGGGGAAGCTGGCCTATACACACACAAATTTTTACTCTCGAGGAGTTGAACTATATGAACAACCAACTATATATTACCC
TGTTCGTGTACCATTGAGGCTGTATGGAGGCAAAGGGGGAAATCCATGGGAAGAAAAGGTTTTCTCAACAATTAGGTCATTTGTTGTGTATCATAAAGAATGTGTCCATG
CCATTCAAATTTACTACGAGAAGAATGGGAAGGCTGTTTGGTCGCCTAAGCATGGCGGAGATGGTGGAACCAAATACGAGGAAGACGACGGTTCTATCACTTGGGACGAC
GGAGTTTATTCGGCGATCAGACGGTTCGTAGTTTACGAGAGAGAGTGGATCTGTTCCATTCAGATTGAATATGATCAGAATGGAGAATCAATTTGGTCGCCCAAACATGG
TGAAAACGATGGTTCTATTTCGGAGCCTGAGCCTCAGTCGAAGCACTTAAACATGGGACCATATGGAGGCAAAGGTGGAGATCATTGGGAAGAGACTTTTCAAACAATCA
GACGGCTGGTGATTTATCATGGCCTTTGGATCGACTCCATTCAAATGGAATACGAAGATGAGAATCAAACGTTATTATGGTCCGAGAAAAATGGAGTTAGGCCTGAGAAA
TTTAGCTTAGGAGAATATGGAGGCGAAGGTGGAGATCCTTGGAACGAGAGTTTTAAGACAATCAAGCAATTGGTGATCAATCATGGAATGTGGATCGACTCCATTCAAAT
GGAATATGAAGATGAGAATGGGGAGTTGGTGTGGTCTAAGAGGCATGGTGGAAATGGAGGTTTCCAATCAGAGGTTGGACCTGAGAAATTTAGCTTGGGAGAATATGGAG
GCGAAGGTGGAGATCCTTGGGATGAGAATTTTAGGACAGTCAAGCAATTGTTTGGTCTAAGAGGCATGGTGGAAATGGAGGTTCTCAATCAGAGAAATGGGATTGGGCCA
GAGAAATTTAGCTTCGGAAAATATGGAGGTGAAGGTGGAAATCCTTGGAATGAGAATTTTAGGACAATTAGACAATTGGTGATTAATCATGGACAGTGGATCGACTCCAT
TCAAATGGAATATGAAAATGAAAATGGGGAGTTGGTGTGGTCTGAGAGGCATGGTGGAAATGGAGGTTCCCAATCAAAGTGCATCGCTACTCTACAGAGGATGTTAGACG
AAGAGGGTTCCCCGATGACGACGGTGAAGATAGAGATTGGTGGAGGCAAACACGGTGGGGGACCTTGGGATGATGGAGCTTATTCCACCATCAGACGCCTTCTAATTTAT
CACAAACAGTGGATCTGTTCCCTTCATGTCGAGTATGATAAGAACGGCCATTCAGTTTGGGGTTCCAAGCGTGGCGGAAACGAGGGTTCCGTTTCTGAGGCAAAGGGGGG
CATCCATGGGAGTATGTCTTCCGGTCGATCAGACGGTTTGTTGTTGATTATGAACAATGGATCCACTCCATTCAATTGGAATATGAGGATAAGAATGGAAAGTTGGTATG
GTCCAAGAAGCATGGTGACACAGATGGAACTTCCAAATCAGAGCTTAAAGACTGAGCCGGAGCCTGCGGCTGCACCTCCACCCCAAATCCAAGTGGAGCACATTAAACCG
AGACAATATGGAGGTGAAGGTGGGGATGGTTGGGAAGATATGTTTCGGACAATCAAACGATTTGTGGTTCGTCATGGATTGTGGATCGACTCCATTCAAATTCAATATGA
AGATGATAATGGAAACCTAGTGTGGTCTAGGCAGCATGGTGGAGATGGAGGATCCAAATCAGAGAACAAGAAGGAGATAGCCATGGATTTTGACCTTCTGAACAACCCAC
ATCCACACCCATTGTTCTTCATAGAAGAGGGGAAGCATGATGAAGTCGTTTTCTGCATTAGATGCCGTCGACTGTTGCGTCCGCCGGCGTTCAGCTGCTCCGACTCCGAC
TGCAACTTCCATATCCATCAATCTTGTATCGACCTTCCTCCTCAAATCCACAACCGCTTCCACCCCCAACATACTCTTTCTCGGACCACCAACAACTATATGTGTACTGC
CTGTTCGCAAATGCCGTCGGGTGATGTTTATCGGTGCTATGGATGCGGTTTTCAGATTGACGTCAAATGCGCCATCGCCGACACAAAAGCCACCGGTGTACGGCGGACGA
TAGGTAGCGAGTTTCGACATTTCAGCCATCCTCATACATTAACCCTTCAGCAAGAACAAAACAGAGGAACCAATGAAATTGTTTGTGTTGTCTGTGGATTGCTTATAAAA
TCAGGTTCTTCTTATTACTTCTGCTCTTATTGTGATGCCCATTTTCATCAACAATGCGCCGAGCTGCCGCGCGAGATGCTAAACCCTGATTTTCATAAGCACCCTTTATT
TCTTCTTCCCTTTAGCTCTCCCCAAACCATCTGTAATAGTTGCAAAAATGACTGTGGAGAGTTCGTCTATAACTGTTCCTTGTGTGGATTCAACCTTCATATCGCTTGCT
TACAATCTTTCAATCACATACACACGTTCGCCAAATATAGGAACCGGACACAGTTTGTTTGTCGAGCATGTGGTGAGAAAGGGGATGGATTCTCATGGTATTGCACCATT
TGCCATCTTTCGGTTCATAAAGAATGCGCTGAATTTCCGTTAACTTTAAGGATATTTCAACACCGACTCCATGATCTTAGCCTCACCTATTTTCGTGATGGAGTTGATTT
TGTTGGCAACAAGATTGACTGTAAGATTTGCGGGGACAAAATAAGTACCAAATATGCTGCATATGGTTGCTACAAGTATGAATGCAACTACTTTGTCCATTTGGGTTGTG
CTCGAACCCAACGCATCTACTTTAACTCGACAATGGATGCTCTTGATTCTACAGATGATGAAGACGTTAAGATTGAGATTTCTGGCTCTGAGATTCAACATTTCATTCAT
CATCATAGCTTAAATTTGTTTTCCCCTGAGGAGGAGCTTGGACAGGACAGAGTTTGTGATGGTTGTATGAAGCGCCTTTCGGGTCCATCTTATGGCTGTGAGGAGTGTGA
TTTCTTTGTCCACAAAGAATGTCTTGAATTGCCTAGAAAGAAAAGGAACTTCCTCCATCAACATAGGCTCAATCTCATATCAATCCCAAATTTCGTCTTCCAATGCAAAG
CTTGTCTCAATTATTTCAATGGCTTTGCCTACCATTGCAAAAAATGTCTCTCCATATTTGACACCCGATGTGCCTCAATCAAAATCCCATTTAAACACCCTAGTCACCAA
CACCCCTTATCTCATGACCGCACAAACGAAGACCACAAATGTGAAGGTTGTGGGGAGGGAGTGAAGCATAAAGTAGCATTTCGATGTGTCGACTGCAACTTCTATTTGGA
TGCAGGATGTGCGACACTGCCACTTGGAGTAAGATACAGATTTGACCCACATCCTCTAGACCTGAAATTTATAGAGGACGAAGAAGATGAAGAGTATTGTTGTGATATCT
GTGAGGAAGAAAGAGAGTCAGGGCCGTGGTTCTACGGCTGCCAAAAGTGCAGTTTTGCTGCACATTTGGACTGTGTTGTTGGGATGTTTCCTTACATAAAGTTAAAGAAG
CATGAAGCTCATAAGCACACAATGAAACTGGGGGTGAAAGGGAAGGAAGAGGATTGTGTGGCTTGTGGTGAATCATGTGCTGAGGATTTGGCCTATGAATGCATTTCCAA
TTGCAAGTTCAAGGTGCATGCCATTGGGCTGTGTTACCACAGGCAGCTAGTGCAGGGGAGCCTAGCTTTCACCAACCGTAAAGGTGGAGATGCTTGGGAAGAAAAAGCTT
TCACAAGAATCAGAGCATTTCTTATTTGGCATAAAGAATGGATCTACTCCTTTCAAATTCATTATGAGAAGAATGGCGAGTTGATATGGTCAATGAAGCACGGCGGAGAT
GGCGGTTATAGATCTGAGATACATTTTGACTCGTCTCCGCAGCAGCAACCAGCTGGAAGTTCTTCACCGAGGCAA
Protein sequenceShow/hide protein sequence
MEKLKLLEQPHPHPLTYIEEKTDSREDMFCCVCKKALYPPAFICSPCKFYIHQSCIYFPSRIRSRFHPQHSLSLTETNSDHCHCCWQMPRDCFYTCTLQPHCRFIIDIKC
TLADTRSLGLNSIGKHFSHSHPLFLEKELATRKLVVCHLCGMLIVSGPAYFCSKCNIRFHKGCAELPQEILQLNQHHHPLFLFPHAKPHSFCNSCKNQCLQFVYSCVQCN
FNLHVTCRAFSNHKHNFTRLRTIISFQCLLCGWWGRDFPWFCSICHLLAHEKCAELPPSLLVVGHDCPLSFTYIHPFGNQSKLACDICRKKVEPQFAAYSCSKCMYVVHL
NCAGKKYLRGLQQHDGRLYTGGSGRLQTSSQTYINPLGDEDATSNNFSKKMFEEVGSIEILHSNCEKELILCKEEGDNDKQCHGCMQSFSVTKPSYSYSCVKCGFFLHKH
CADFPITKRHPLHKHPLTLIATQNVAFQCHACLQFCHGYAYHCEECLYTLDIRCVLIKTKKLKHPSHQHLLSLAQNHEDQKCRGCGQSNKTVFECDEGCNNFSLDYRCAT
LPQKARCKFDGSLLDLTFSVEDETGEYYCDVCEEERNPAMCFYCCKTCRLAAHPECILGEYPWLKYGSYKTHKHLLALVTEGKMDYSDCDHCGKPCVGNLAYECRRCKFN
YFAVHSLQRMEDEVVVKIGIRGCEEGARRWDDGAHSTVRQIVINHEKCIYSVNIEYDNNGESIWKPKHGGNKGSISKHFSLGEYGGEGGEPWSETFQAIKQLVIHSDEHW
IVSIQMEYVNENGDFIMEFDLVNRPHQHPLFFNEDGRKINGEVAYCSRCRQPLRPPAFSCSDSDCNFHIHQSCLHLPSQIHSPFHPFHPLLRKTNNHFCTACWQMPSGDV
YRCRKCNFQIDIKCVLADTKSSGLRRTSGDQFRHFSHPHPLTLQLEENRGNRVVCFVCDLLIKSAPSYFCSQCDTHFHQQCAELPRELYDLGFHQHPLFLLPNLSFANFL
CDSCNNNCRKFVYTCPHPRLCKFNLHVACLQSFNHQHNFITFRNAMDSFDCRVCGKKGDGFPWFCEICHVLAHRKCAKSPLKLRTFGHHFHDLGLTYFRDNKIRYCKICG
EKLEMKFAGYGCYECNYFTHLDCAETQRLDPQSTPMVDSTMDYSSTQNDEQDNEIQCSVHSHNLNFFLPEEIEGKGDRICDGCMKGLLSPSYGCQQCDFFVHKECAKLPK
TKTHFLHQHLLTLISIPNFIFHCEACREYFHGFAYHCKKCLSTFDIRCTSIKIPFKHPGHPHPLSLDRTNEEHSCEGCGEGVKNRVSFRCVGCNFYLDAKCATLPLTVRY
RLEEHPLNLTFVEEEEGDEYYCDVCEEEREAWIWCYSCRRCNFVGHLGCVLGEFPFVKSKIHEAHRHPLSMVMKGKEESGKHSKNCGSCGEWCGENLAFQCGTCKFNIHA
IGRCYHQQLKQGKLAYTHTNFYSRGVELYEQPTIYYPVRVPLRLYGGKGGNPWEEKVFSTIRSFVVYHKECVHAIQIYYEKNGKAVWSPKHGGDGGTKYEEDDGSITWDD
GVYSAIRRFVVYEREWICSIQIEYDQNGESIWSPKHGENDGSISEPEPQSKHLNMGPYGGKGGDHWEETFQTIRRLVIYHGLWIDSIQMEYEDENQTLLWSEKNGVRPEK
FSLGEYGGEGGDPWNESFKTIKQLVINHGMWIDSIQMEYEDENGELVWSKRHGGNGGFQSEVGPEKFSLGEYGGEGGDPWDENFRTVKQLFGLRGMVEMEVLNQRNGIGP
EKFSFGKYGGEGGNPWNENFRTIRQLVINHGQWIDSIQMEYENENGELVWSERHGGNGGSQSKCIATLQRMLDEEGSPMTTVKIEIGGGKHGGGPWDDGAYSTIRRLLIY
HKQWICSLHVEYDKNGHSVWGSKRGGNEGSVSEAKGGIHGSMSSGRSDGLLLIMNNGSTPFNWNMRIRMESWYGPRSMVTQMELPNQSLKTEPEPAAAPPPQIQVEHIKP
RQYGGEGGDGWEDMFRTIKRFVVRHGLWIDSIQIQYEDDNGNLVWSRQHGGDGGSKSENKKEIAMDFDLLNNPHPHPLFFIEEGKHDEVVFCIRCRRLLRPPAFSCSDSD
CNFHIHQSCIDLPPQIHNRFHPQHTLSRTTNNYMCTACSQMPSGDVYRCYGCGFQIDVKCAIADTKATGVRRTIGSEFRHFSHPHTLTLQQEQNRGTNEIVCVVCGLLIK
SGSSYYFCSYCDAHFHQQCAELPREMLNPDFHKHPLFLLPFSSPQTICNSCKNDCGEFVYNCSLCGFNLHIACLQSFNHIHTFAKYRNRTQFVCRACGEKGDGFSWYCTI
CHLSVHKECAEFPLTLRIFQHRLHDLSLTYFRDGVDFVGNKIDCKICGDKISTKYAAYGCYKYECNYFVHLGCARTQRIYFNSTMDALDSTDDEDVKIEISGSEIQHFIH
HHSLNLFSPEEELGQDRVCDGCMKRLSGPSYGCEECDFFVHKECLELPRKKRNFLHQHRLNLISIPNFVFQCKACLNYFNGFAYHCKKCLSIFDTRCASIKIPFKHPSHQ
HPLSHDRTNEDHKCEGCGEGVKHKVAFRCVDCNFYLDAGCATLPLGVRYRFDPHPLDLKFIEDEEDEEYCCDICEEERESGPWFYGCQKCSFAAHLDCVVGMFPYIKLKK
HEAHKHTMKLGVKGKEEDCVACGESCAEDLAYECISNCKFKVHAIGLCYHRQLVQGSLAFTNRKGGDAWEEKAFTRIRAFLIWHKEWIYSFQIHYEKNGELIWSMKHGGD
GGYRSEIHFDSSPQQQPAGSSSPRQ