; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012602 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012602
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionENTH domain-containing protein
Genome locationscaffold63:1966720..1967808
RNA-Seq ExpressionMS012602
SyntenyMS012602
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015410.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. argyrosperma]6.2e-15177.53Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
        MVRT+KLS LIGLIKDKASQSKAAL+AKPNILSFQLALLRATTHDP+APP  K L+ LLSLGKTSRATAAAA+EVLMDRLQST NSAVALKCL+A+HHI+
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL

Query:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILG-FFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLI
        K+G FILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIETVL  SRILG FFV SSSS  EREKK+EQIS  FNSDLL++TESL+GLI
Subjt:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILG-FFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLI

Query:  EETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGE
        EE +K PH LHLN N LVD+I  FV +DYLSA KEIS RVTEF QRL CLSFGESVELVC LKRLEDCKEKQS GIS  +EIL+ GFWG I E +NLIGE
Subjt:  EETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGE

Query:  TKENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI
        +K+ R+     GKL  T +RMSDSGRFM++  A +YR S+RFGSERFDF CK IPV+GITE YL+
Subjt:  TKENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI

XP_022136401.1 putative clathrin assembly protein At4g40080 [Momordica charantia]1.0e-190100Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
        MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL

Query:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE
        KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE
Subjt:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE

Query:  ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET
        ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET
Subjt:  ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET

Query:  KENRDGTGAGGKLIMTTARMSDSGRFMERANIYRDSIRFGSERFDF
        KENRDGTGAGGKLIMTTARMSDSGRFMERANIYRDSIRFGSERFDF
Subjt:  KENRDGTGAGGKLIMTTARMSDSGRFMERANIYRDSIRFGSERFDF

XP_022929539.1 putative clathrin assembly protein At4g40080 [Cucurbita moschata]1.4e-15077.53Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
        MVRT+KLS LIGLIKDKASQSKAAL+AKPNILSFQLALLRATTHDP+APP  K L+ LLSLGKTSRATAAAA+EVLMDRLQST NSAVALKCL+A+HHI+
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL

Query:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILG-FFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLI
        K+G FILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIETVL  SRILG FFV SSSS  EREKK+EQIS  FNSDLL++TESL+GLI
Subjt:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILG-FFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLI

Query:  EETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGE
        EE +K PH LHLN N LVD+I  FV +DYLSA KEIS RVTEF QRL CLSFGESVELVC LKRLEDCKEKQS GIS  +EIL+ GFWG I E +NLIGE
Subjt:  EETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGE

Query:  TKENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI
        +K+ R+     GKL  T +RMSDSGRFM++  A +YR S+RFGSERFDF CK IPV+GITE YL+
Subjt:  TKENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI

XP_023552000.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]8.1e-15177.26Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
        MVRT+KLS LIGLIKDKASQSKAAL+AKPNI+SFQLALLRATTHDP+APP  K L+ LLSLGKTSRATAAAA+EVLMDRLQST NSAVALKCL+A+HHI+
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL

Query:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILG-FFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLI
        K+G FILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIETVL  SRILG FFV SSSS  EREKK+EQIS  FNSDLL++TESL+GLI
Subjt:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILG-FFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLI

Query:  EETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGE
        EE +K PH LHLN N LVD+I  FV +DYLSA KEIS RVTEF QRL CLSFGESVELVC LKRLEDCKEKQS GIS  +EIL+ GFWG I E +NLIGE
Subjt:  EETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGE

Query:  TKENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI
        +K++R+     GKL  T +RMSDSGRFM++  A +YR S+RFGSERFDF CK IPV+GITE YL+
Subjt:  TKENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI

XP_038903242.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]1.2e-15979.95Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
        MV T+ LS LIGLIKDKASQSKAAL+AKPNILSFQLALLRATTHDP+APP +KHL  LLSLGKTSRATAAAA+EVLMDRLQ+T NSAVALKCL+AVHHI+
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL

Query:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE
        K+GGFILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIETVLS SRILGFFV SS+S EE+EKK+EQIS + NSDLL++TESLVGLIE
Subjt:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE

Query:  ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET
        ET+K PH LHLN N+L D+I  FV DDYLSAMKEIS+RVTEFHQRLSCLSFGESVELVC LKRLEDCKEKQS GIS+KYE+LMD FWG I ETKNLIGE+
Subjt:  ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET

Query:  KENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI
        KEN++    GGKL  T +RMSDSGRFMER  A  YRDS+RFGSERFD  CK  PV G  E Y +
Subjt:  KENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI

TrEMBL top hitse value%identityAlignment
A0A1S3C1C0 putative clathrin assembly protein At4g400801.0e-14677.78Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
        M+ T++LS LIGLIKDKASQSKAAL+AKPNILSFQLALLRATTHDP+APP DKHL+ALLSLGKTSRATAAAA+EVLMDRLQ+THNSAVALKCL+AVHHI 
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL

Query:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE
        K+GGFILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIETVLS SRILGF V SSSS EE E+K+EQIS ++NS+LL+DTESLVGLIE
Subjt:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE

Query:  ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET
        E +K P  LHLN N+LVD+I GFV DDYL+AMKEIS+RVTEFH RL CLSFGESVELVC LKRL+D KEKQSLGI A+YE+LMDGFW  I ETKNLIG +
Subjt:  ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET

Query:  KENRDGTGAGGKLIMTTARMSDSGRFMERANI--YRDSIRFGSERFDFNCK
        KENRDG     KL     R+SDSGRF+ER+N   Y D + F SERF    K
Subjt:  KENRDGTGAGGKLIMTTARMSDSGRFMERANI--YRDSIRFGSERFDFNCK

A0A5A7TT50 Putative clathrin assembly protein1.3e-14677.21Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
        M+ T++LS LIGLIKDKASQSKAAL+AKPNILSFQLALLRATTHDP+APP DKHL+ALLSLGKTSRATAAAA+EVLMDRLQ+THNSAVALKCL+AVHHI 
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL

Query:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE
        K+GGFILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIETVLS SR LGF V SSSS EE E+K+EQIS ++NS+LL+DTESLVGLIE
Subjt:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE

Query:  ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET
        E +K P  LHLN N+LVD+I GFV DDYL+AMK+IS+RVTEFH RL CLSFGESVELVC LKRL+DCKEKQS+GI A+YE+LMDGFW  I ETKNLIG +
Subjt:  ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET

Query:  KENRDGTGAGGKLIMTTARMSDSGRFMERANI--YRDSIRFGSERFDFNCK
        KENRDG     KL     R+SDSGRF+ER+N   Y D + F SERF    K
Subjt:  KENRDGTGAGGKLIMTTARMSDSGRFMERANI--YRDSIRFGSERFDFNCK

A0A6J1C3E8 putative clathrin assembly protein At4g400805.1e-191100Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
        MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL

Query:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE
        KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE
Subjt:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIE

Query:  ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET
        ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET
Subjt:  ETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGET

Query:  KENRDGTGAGGKLIMTTARMSDSGRFMERANIYRDSIRFGSERFDF
        KENRDGTGAGGKLIMTTARMSDSGRFMERANIYRDSIRFGSERFDF
Subjt:  KENRDGTGAGGKLIMTTARMSDSGRFMERANIYRDSIRFGSERFDF

A0A6J1EP16 putative clathrin assembly protein At4g400806.7e-15177.53Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
        MVRT+KLS LIGLIKDKASQSKAAL+AKPNILSFQLALLRATTHDP+APP  K L+ LLSLGKTSRATAAAA+EVLMDRLQST NSAVALKCL+A+HHI+
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL

Query:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILG-FFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLI
        K+G FILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIETVL  SRILG FFV SSSS  EREKK+EQIS  FNSDLL++TESL+GLI
Subjt:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILG-FFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLI

Query:  EETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGE
        EE +K PH LHLN N LVD+I  FV +DYLSA KEIS RVTEF QRL CLSFGESVELVC LKRLEDCKEKQS GIS  +EIL+ GFWG I E +NLIGE
Subjt:  EETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGE

Query:  TKENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI
        +K+ R+     GKL  T +RMSDSGRFM++  A +YR S+RFGSERFDF CK IPV+GITE YL+
Subjt:  TKENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI

A0A6J1JCT4 putative clathrin assembly protein At4g400805.3e-14876.44Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL
        MV T+KLS LIGLIKDKASQSKAAL+AKPNILSFQLALLRATTHDP+APP  K L+ LLS GKTSRATAAAA+EVLMDRLQST NSAVALKCL+A+HHI+
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHIL

Query:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILG-FFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLI
        K+G FILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIETVL  SRILG FFV SSSS  EREKK+EQIS  FNSDLL++TESL+GLI
Subjt:  KHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILG-FFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLI

Query:  EETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGE
        EE +K PH LHLN N LVD+I  FV +DYLSA KEIS RVTEF  RL CLSFGESVELVC LKRLEDCKEKQS GIS  +EIL+ GFWG I E +NLIGE
Subjt:  EETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGE

Query:  TKENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI
        +K+ R+     GKL  T +RMSDSGRFM++  A + R S+RFGSERFDF CK IPV+GITE YL+
Subjt:  TKENRDGTGAGGKLIMTTARMSDSGRFMER--ANIYRDSIRFGSERFDFNCKRIPVVGITEPYLI

SwissProt top hitse value%identityAlignment
Q8GX47 Putative clathrin assembly protein At4g026508.8e-1535.37Show/hide
Query:  KLSGLIGLIKDKASQSKAALIAKPNILS-FQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGG
        KL   IG +KD+ S   A +  + + L+  ++A+++AT HD Y P  DK++  +L L   SR   +A +  L  RL  T N +VALK L+ +  +L  G 
Subjt:  KLSGLIGLIKDKASQSKAALIAKPNILS-FQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGG

Query:  FILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIE
           + ++  F    G   L +SDFRD S   SW+ S++VR YA Y++
Subjt:  FILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIE

Q8H0W9 Putative clathrin assembly protein At5g104102.3e-3131.13Show/hide
Query:  LIGLIKDKASQSKAALI---AKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGGFI
        +IG  KDKAS  KA L+       +    LALL++TT  P  PP+  +++A++S   ++   A AA    + RL+ T N+ VA K L+ +H ++K     
Subjt:  LIGLIKDKASQSKAALI---AKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGGFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIEETTKKP
         +D+        GRN LKL++F D SS ++ ELS W+RWY QY++ +    ++LG F     + +++ ++ +++S+     ++R T+SLV   E    +P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIEETTKKP

Query:  HSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRL---SCLSFGE--SVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGETK
            +  NK+VD I   V +DY   ++ + VR+    +RL        G+    +   +L RL +CKE  S G+  +   L D FW L+ E      E K
Subjt:  HSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRL---SCLSFGE--SVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGETK

Query:  ENRDGTGAGGKLIMTTAR
         N+      G L+ TT +
Subjt:  ENRDGTGAGGKLIMTTAR

Q8L936 Putative clathrin assembly protein At4g400802.9e-9051.42Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIA---KPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVH
        M R    + LIG IKDKASQSKAAL++   K   LSF L++LRATTHDP  PP ++HLA +LS G  SRATA++A+E +M+RL +T ++ VALK L+ +H
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIA---KPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVH

Query:  HILKHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVG
        HI+KHG FILQDQLSVFP +GGRNYLKLS FRD  SP+ WELSSWVRWYA Y+E +LSTSRI+GFF++S+SS   +E+  E +S++ NSDLLR+ ++LVG
Subjt:  HILKHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVG

Query:  LIEETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEI-LMDGFWGLIWETKNL
        L+EE  K P         L D+I   V +DY+S++ E+  R  EF +R + LSFG+++ELVC LKRLE CKE+ S      ++   +DGFWGL+ E K +
Subjt:  LIEETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEI-LMDGFWGLIWETKNL

Query:  IGETKENRDGTGAGGKLIMTTARMS---DSGRFMERANI-YRDSIRFGSERF
        IG  ++N    G   K I+   +     +S RF +R  I Y + +RF S RF
Subjt:  IGETKENRDGTGAGGKLIMTTARMS---DSGRFMERANI-YRDSIRFGSERF

Q8LBH2 Putative clathrin assembly protein At2g016006.1e-1635.71Show/hide
Query:  GLIKDKASQSKAALI-AKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATA--AAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGGFILQ
        G +KD    +K  L+          +A+++AT H    PP D+HL  + +    +RA A  A  I  L  RL  T N  VALK L+ +H +L+ G    +
Subjt:  GLIKDKASQSKAALI-AKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATA--AAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGGFILQ

Query:  DQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGF
        ++L  F   G    L+LS+F+D+SSPI+W+ S+WVR YA ++E  L   R+L +
Subjt:  DQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGF

Q9FKQ2 Putative clathrin assembly protein At5g653701.3e-3131.56Show/hide
Query:  KLSGLIGLIKDKASQSK---AALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILK-
        KL+ L G++KD+ASQ K     L +  N  +  LALL+AT+H    PP DK++  L S   T        ++ ++ RL+ T +  VA KCL+ +H ++K 
Subjt:  KLSGLIGLIKDKASQSK---AALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILK-

Query:  HGGFILQDQL------SVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESL
          G+  +D L          +T G + LKL+D   NSS  + EL+ WV+WY QY++  LS + +LG         E++  +++++S+     +L+  + L
Subjt:  HGGFILQDQL------SVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESL

Query:  VGLIEETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKN
        V L E  + +P +     NK+V  +   +  DY SA++ + +R  E + R++     +  ELV VL++LE+CKE  S   S + + L+  FW L+ + K+
Subjt:  VGLIEETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKN

Query:  L
        +
Subjt:  L

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein4.3e-1735.71Show/hide
Query:  GLIKDKASQSKAALI-AKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATA--AAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGGFILQ
        G +KD    +K  L+          +A+++AT H    PP D+HL  + +    +RA A  A  I  L  RL  T N  VALK L+ +H +L+ G    +
Subjt:  GLIKDKASQSKAALI-AKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATA--AAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGGFILQ

Query:  DQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGF
        ++L  F   G    L+LS+F+D+SSPI+W+ S+WVR YA ++E  L   R+L +
Subjt:  DQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGF

AT4G02650.1 ENTH/ANTH/VHS superfamily protein6.2e-1635.37Show/hide
Query:  KLSGLIGLIKDKASQSKAALIAKPNILS-FQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGG
        KL   IG +KD+ S   A +  + + L+  ++A+++AT HD Y P  DK++  +L L   SR   +A +  L  RL  T N +VALK L+ +  +L  G 
Subjt:  KLSGLIGLIKDKASQSKAALIAKPNILS-FQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGG

Query:  FILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIE
           + ++  F    G   L +SDFRD S   SW+ S++VR YA Y++
Subjt:  FILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIE

AT4G40080.1 ENTH/ANTH/VHS superfamily protein2.0e-9151.42Show/hide
Query:  MVRTRKLSGLIGLIKDKASQSKAALIA---KPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVH
        M R    + LIG IKDKASQSKAAL++   K   LSF L++LRATTHDP  PP ++HLA +LS G  SRATA++A+E +M+RL +T ++ VALK L+ +H
Subjt:  MVRTRKLSGLIGLIKDKASQSKAALIA---KPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVH

Query:  HILKHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVG
        HI+KHG FILQDQLSVFP +GGRNYLKLS FRD  SP+ WELSSWVRWYA Y+E +LSTSRI+GFF++S+SS   +E+  E +S++ NSDLLR+ ++LVG
Subjt:  HILKHGGFILQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVG

Query:  LIEETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEI-LMDGFWGLIWETKNL
        L+EE  K P         L D+I   V +DY+S++ E+  R  EF +R + LSFG+++ELVC LKRLE CKE+ S      ++   +DGFWGL+ E K +
Subjt:  LIEETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEI-LMDGFWGLIWETKNL

Query:  IGETKENRDGTGAGGKLIMTTARMS---DSGRFMERANI-YRDSIRFGSERF
        IG  ++N    G   K I+   +     +S RF +R  I Y + +RF S RF
Subjt:  IGETKENRDGTGAGGKLIMTTARMS---DSGRFMERANI-YRDSIRFGSERF

AT5G10410.1 ENTH/ANTH/VHS superfamily protein1.6e-3231.13Show/hide
Query:  LIGLIKDKASQSKAALI---AKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGGFI
        +IG  KDKAS  KA L+       +    LALL++TT  P  PP+  +++A++S   ++   A AA    + RL+ T N+ VA K L+ +H ++K     
Subjt:  LIGLIKDKASQSKAALI---AKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGGFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIEETTKKP
         +D+        GRN LKL++F D SS ++ ELS W+RWY QY++ +    ++LG F     + +++ ++ +++S+     ++R T+SLV   E    +P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIEETTKKP

Query:  HSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRL---SCLSFGE--SVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGETK
            +  NK+VD I   V +DY   ++ + VR+    +RL        G+    +   +L RL +CKE  S G+  +   L D FW L+ E      E K
Subjt:  HSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRL---SCLSFGE--SVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGETK

Query:  ENRDGTGAGGKLIMTTAR
         N+      G L+ TT +
Subjt:  ENRDGTGAGGKLIMTTAR

AT5G65370.1 ENTH/ANTH/VHS superfamily protein9.6e-3331.56Show/hide
Query:  KLSGLIGLIKDKASQSK---AALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILK-
        KL+ L G++KD+ASQ K     L +  N  +  LALL+AT+H    PP DK++  L S   T        ++ ++ RL+ T +  VA KCL+ +H ++K 
Subjt:  KLSGLIGLIKDKASQSK---AALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILK-

Query:  HGGFILQDQL------SVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESL
          G+  +D L          +T G + LKL+D   NSS  + EL+ WV+WY QY++  LS + +LG         E++  +++++S+     +L+  + L
Subjt:  HGGFILQDQL------SVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESL

Query:  VGLIEETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKN
        V L E  + +P +     NK+V  +   +  DY SA++ + +R  E + R++     +  ELV VL++LE+CKE  S   S + + L+  FW L+ + K+
Subjt:  VGLIEETTKKPHSLHLNANKLVDRILGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKN

Query:  L
        +
Subjt:  L


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGCACAAGAAAGCTCAGCGGCCTTATCGGACTCATCAAAGACAAGGCCTCTCAGAGCAAAGCCGCGCTAATCGCCAAGCCCAACATTCTCTCCTTCCAACTCGC
CCTCCTACGCGCCACCACCCACGACCCCTACGCGCCGCCCCACGACAAGCACCTCGCCGCCCTTCTCTCCCTCGGCAAAACCTCACGCGCCACCGCCGCCGCCGCCATCG
AAGTCCTCATGGACCGCCTCCAGAGCACCCACAACTCCGCCGTCGCCCTCAAGTGCCTCCTCGCCGTCCACCACATCCTCAAACACGGCGGCTTCATTCTCCAAGACCAG
CTCTCTGTTTTCCCCTTCACCGGCGGCAGAAATTATCTCAAACTCTCCGATTTCCGGGACAATTCCAGCCCGATTTCTTGGGAGCTTTCCTCTTGGGTCCGCTGGTACGC
CCAATACATCGAAACCGTCTTGTCCACCTCCCGAATTTTGGGCTTTTTCGTTGCCTCTTCCTCTTCAATTGAGGAGAGGGAGAAAAAATCAGAGCAGATTTCCGCCGTTT
TCAACTCCGATTTGCTCAGAGACACCGAATCTTTGGTGGGTTTAATCGAAGAAACCACCAAGAAGCCTCATTCTTTGCATCTCAACGCCAACAAATTGGTGGACCGGATT
CTCGGATTTGTCGCCGACGATTACTTGTCGGCTATGAAGGAAATTTCCGTCCGAGTTACTGAGTTCCACCAGCGCTTGAGTTGCCTGAGTTTCGGCGAATCGGTCGAGTT
GGTTTGCGTCTTGAAACGCCTCGAGGATTGTAAAGAGAAGCAATCGTTGGGTATTTCTGCAAAATATGAGATTTTGATGGATGGGTTCTGGGGCTTGATTTGGGAAACCA
AGAACTTGATTGGGGAGACCAAGGAAAATCGAGACGGCACCGGCGCCGGCGGTAAATTGATCATGACCACGGCCAGGATGAGCGACTCGGGCCGGTTTATGGAGCGAGCT
AATATTTACCGCGACTCGATCCGGTTCGGTTCGGAGAGGTTCGATTTCAACTGCAAACGGATTCCGGTCGTGGGCATAACGGAACCGTATTTAATTAAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGCACAAGAAAGCTCAGCGGCCTTATCGGACTCATCAAAGACAAGGCCTCTCAGAGCAAAGCCGCGCTAATCGCCAAGCCCAACATTCTCTCCTTCCAACTCGC
CCTCCTACGCGCCACCACCCACGACCCCTACGCGCCGCCCCACGACAAGCACCTCGCCGCCCTTCTCTCCCTCGGCAAAACCTCACGCGCCACCGCCGCCGCCGCCATCG
AAGTCCTCATGGACCGCCTCCAGAGCACCCACAACTCCGCCGTCGCCCTCAAGTGCCTCCTCGCCGTCCACCACATCCTCAAACACGGCGGCTTCATTCTCCAAGACCAG
CTCTCTGTTTTCCCCTTCACCGGCGGCAGAAATTATCTCAAACTCTCCGATTTCCGGGACAATTCCAGCCCGATTTCTTGGGAGCTTTCCTCTTGGGTCCGCTGGTACGC
CCAATACATCGAAACCGTCTTGTCCACCTCCCGAATTTTGGGCTTTTTCGTTGCCTCTTCCTCTTCAATTGAGGAGAGGGAGAAAAAATCAGAGCAGATTTCCGCCGTTT
TCAACTCCGATTTGCTCAGAGACACCGAATCTTTGGTGGGTTTAATCGAAGAAACCACCAAGAAGCCTCATTCTTTGCATCTCAACGCCAACAAATTGGTGGACCGGATT
CTCGGATTTGTCGCCGACGATTACTTGTCGGCTATGAAGGAAATTTCCGTCCGAGTTACTGAGTTCCACCAGCGCTTGAGTTGCCTGAGTTTCGGCGAATCGGTCGAGTT
GGTTTGCGTCTTGAAACGCCTCGAGGATTGTAAAGAGAAGCAATCGTTGGGTATTTCTGCAAAATATGAGATTTTGATGGATGGGTTCTGGGGCTTGATTTGGGAAACCA
AGAACTTGATTGGGGAGACCAAGGAAAATCGAGACGGCACCGGCGCCGGCGGTAAATTGATCATGACCACGGCCAGGATGAGCGACTCGGGCCGGTTTATGGAGCGAGCT
AATATTTACCGCGACTCGATCCGGTTCGGTTCGGAGAGGTTCGATTTCAACTGCAAACGGATTCCGGTCGTGGGCATAACGGAACCGTATTTAATTAAA
Protein sequenceShow/hide protein sequence
MVRTRKLSGLIGLIKDKASQSKAALIAKPNILSFQLALLRATTHDPYAPPHDKHLAALLSLGKTSRATAAAAIEVLMDRLQSTHNSAVALKCLLAVHHILKHGGFILQDQ
LSVFPFTGGRNYLKLSDFRDNSSPISWELSSWVRWYAQYIETVLSTSRILGFFVASSSSIEEREKKSEQISAVFNSDLLRDTESLVGLIEETTKKPHSLHLNANKLVDRI
LGFVADDYLSAMKEISVRVTEFHQRLSCLSFGESVELVCVLKRLEDCKEKQSLGISAKYEILMDGFWGLIWETKNLIGETKENRDGTGAGGKLIMTTARMSDSGRFMERA
NIYRDSIRFGSERFDFNCKRIPVVGITEPYLIK