; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023028 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023028
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionENTH domain-containing protein
Genome locationtig00000729:2116330..2117403
RNA-Seq ExpressionSgr023028
SyntenySgr023028
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152749.1 putative clathrin assembly protein At4g40080 [Cucumis sativus]1.1e-13674.17Show/hide
Query:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH
        M + Q KKL NL+ ALKDKAS+IKATFSI+RRSSSI+VAVVRATTH A NPPS+ RV+AVLALGNDFR STA ACIEAL +RLH+TS+AAVAMKSLFTLH
Subjt:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIED--KEGKISGGVNMDLSGELDVL
        IIVIRGPFNLRDQV+F P YGGRNFLNLSAFRDVSDSEMSD SSWVRWYA VVE NVIV RKLD++LYFRSRN EI++  ++GK+      DLS EL VL
Subjt:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIED--KEGKISGGVNMDLSGELDVL

Query:  VGFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKL
        VGFVERI EVPESLHLQ+ DLVYEVVRLV++NYRLVQ+EI +RV+EIG+R E L++DEL++LVG LTR ENCR +V++LFVNRGK+E+FWEL+KKT+ KL
Subjt:  VGFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKL

Query:  VEEKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG
         E+K++KEEKR+IMV    +SVESTR  NPFVEPGQL+W+P      GPALLPLTVSTVG
Subjt:  VEEKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG

XP_008445571.1 PREDICTED: putative clathrin assembly protein At4g40080 [Cucumis melo]2.3e-14275.98Show/hide
Query:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH
        M + Q KKLKNL  ALKDKASIIKA FSI+RRSSSI+VAVVRATTH A NPPS+ RVAAVLALGNDFR STA ACIEAL +RLH+TS+AAVAMKSLFTLH
Subjt:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG
        IIVIRGPFNLRDQV+F P YGGRNFLNLSAFRDVSDSEM+D SSWVRWYA VVE NVIV RKLD++LYFRSRN EI++   K      +DL+ EL VLVG
Subjt:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG

Query:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE
        FVERI EVPESLHLQ+ DLVYEVVRLV+ENYRLVQREI +RV+EIG+R E L++DEL++LVG L R ENCR +V++LFVNRGKNE+FWEL+K TK K+ E
Subjt:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE

Query:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG
        +K++KEEKR++MV    DSVESTR WNPFVEPGQL+W+P GDGP+GPALLPLTVSTVG
Subjt:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG

XP_022131457.1 putative clathrin assembly protein At4g40080 [Momordica charantia]2.3e-15881.28Show/hide
Query:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDF-RSTACACIEALTDRLHSTSNAAVAMKSLFTLH
        MG+DQ+KKLKNLIDALKDKASIIKATFS HRRSSSI++AVVRATTHD SNPPS+ R+AAVLALGNDF RSTA ACI+ + DRLH+TS+A VAMKSLFTLH
Subjt:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDF-RSTACACIEALTDRLHSTSNAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG
        I+VIRGPF+LRDQV FCPYYGGRNFLNLSAFRDVSDSEMSD SSWVRWYA VVE NVIV R+LD++LY RS N +IEDK+GKIS    +DL GELDVLVG
Subjt:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG

Query:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE
        FVE I E P+SLHLQ+N++VYEVVRLV+ENYRLVQREIS+RVR IGDRA+SL+LDELTQLV  LTRFENCRR++T+LFVNR KNED WEL+K TKAKLVE
Subjt:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE

Query:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG
        +KQMKEEKR+IMVEIRA+SVE TR WNPFVEPGQLLW+PSGD PLGPALLPLTVSTVG
Subjt:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG

XP_023546879.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]5.4e-12369.64Show/hide
Query:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH
        M +DQ KKLKNLIDA KD+ASIIKATFSIHRRSSSI+VAVVRATTH A NPPS+ R+AA+LALGNDFR STA  CI+AL +RLH+T++AAVAMKSLFTLH
Subjt:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEI-EDKEGKISGGVNMDLSGELDVLV
        II IRGPFNL+ +VAF PYYGGRN+LNLSAFRDVSDSEMS+ S WVRWYA VVE N    RKLD++LYFRSRN EI E K+ KI      +L  ELDVLV
Subjt:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEI-EDKEGKISGGVNMDLSGELDVLV

Query:  GFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLV
        GF+ERISEVPESLH+Q++DLVYEVVRLV+E+YRLVQREI +RV EIG+R E L+ DELT+ V  LTR ENCRR+V++LFVNRGKNE+ WEL+  TK KLV
Subjt:  GFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLV

Query:  EEKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG
        E +     +R+ M        ESTR WNPF+EPG L        PLGPALLPLTVSTVG
Subjt:  EEKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG

XP_038884022.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]1.8e-14777.37Show/hide
Query:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH
        M +DQ KKLKNL  ALKDKASIIKAT SI RRSSSI+VAVVRATTH + NPPS+ RVAAVLALGNDFR STA ACIEAL +RLH+TS+AAVAMKSLFTLH
Subjt:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG
        IIVIRGPFNLRDQVA+ P YGGRNFLNLS FRDVSDSEM+D SSWVRWYA VVE NVIV RKLD++LYFRSRN EI +++ K      +D+  EL+VLVG
Subjt:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG

Query:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE
        FVERI EVPESL+LQ+ DLVYEVVRLV+ENYRLVQREI +RV+EIGDR ESL+LDELT+LVG +TR ENCRR++++LFVNRGKNE+FWEL+K TK KL E
Subjt:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE

Query:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG
        +K+MKEEKR+IMVE++A+S ESTR WNPFVEPGQLLW+P+GDGP+GPALLPLTVSTVG
Subjt:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG

TrEMBL top hitse value%identityAlignment
A0A0A0LLA1 ENTH domain-containing protein5.4e-13774.17Show/hide
Query:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH
        M + Q KKL NL+ ALKDKAS+IKATFSI+RRSSSI+VAVVRATTH A NPPS+ RV+AVLALGNDFR STA ACIEAL +RLH+TS+AAVAMKSLFTLH
Subjt:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIED--KEGKISGGVNMDLSGELDVL
        IIVIRGPFNLRDQV+F P YGGRNFLNLSAFRDVSDSEMSD SSWVRWYA VVE NVIV RKLD++LYFRSRN EI++  ++GK+      DLS EL VL
Subjt:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIED--KEGKISGGVNMDLSGELDVL

Query:  VGFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKL
        VGFVERI EVPESLHLQ+ DLVYEVVRLV++NYRLVQ+EI +RV+EIG+R E L++DEL++LVG LTR ENCR +V++LFVNRGK+E+FWEL+KKT+ KL
Subjt:  VGFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKL

Query:  VEEKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG
         E+K++KEEKR+IMV    +SVESTR  NPFVEPGQL+W+P      GPALLPLTVSTVG
Subjt:  VEEKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG

A0A1S3BDW7 putative clathrin assembly protein At4g400801.1e-14275.98Show/hide
Query:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH
        M + Q KKLKNL  ALKDKASIIKA FSI+RRSSSI+VAVVRATTH A NPPS+ RVAAVLALGNDFR STA ACIEAL +RLH+TS+AAVAMKSLFTLH
Subjt:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG
        IIVIRGPFNLRDQV+F P YGGRNFLNLSAFRDVSDSEM+D SSWVRWYA VVE NVIV RKLD++LYFRSRN EI++   K      +DL+ EL VLVG
Subjt:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG

Query:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE
        FVERI EVPESLHLQ+ DLVYEVVRLV+ENYRLVQREI +RV+EIG+R E L++DEL++LVG L R ENCR +V++LFVNRGKNE+FWEL+K TK K+ E
Subjt:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE

Query:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG
        +K++KEEKR++MV    DSVESTR WNPFVEPGQL+W+P GDGP+GPALLPLTVSTVG
Subjt:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG

A0A5A7VCW1 Putative clathrin assembly protein1.1e-14275.98Show/hide
Query:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH
        M + Q KKLKNL  ALKDKASIIKA FSI+RRSSSI+VAVVRATTH A NPPS+ RVAAVLALGNDFR STA ACIEAL +RLH+TS+AAVAMKSLFTLH
Subjt:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG
        IIVIRGPFNLRDQV+F P YGGRNFLNLSAFRDVSDSEM+D SSWVRWYA VVE NVIV RKLD++LYFRSRN EI++   K      +DL+ EL VLVG
Subjt:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG

Query:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE
        FVERI EVPESLHLQ+ DLVYEVVRLV+ENYRLVQREI +RV+EIG+R E L++DEL++LVG L R ENCR +V++LFVNRGKNE+FWEL+K TK K+ E
Subjt:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE

Query:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG
        +K++KEEKR++MV    DSVESTR WNPFVEPGQL+W+P GDGP+GPALLPLTVSTVG
Subjt:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG

A0A6J1BR25 putative clathrin assembly protein At4g400801.1e-15881.28Show/hide
Query:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDF-RSTACACIEALTDRLHSTSNAAVAMKSLFTLH
        MG+DQ+KKLKNLIDALKDKASIIKATFS HRRSSSI++AVVRATTHD SNPPS+ R+AAVLALGNDF RSTA ACI+ + DRLH+TS+A VAMKSLFTLH
Subjt:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDF-RSTACACIEALTDRLHSTSNAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG
        I+VIRGPF+LRDQV FCPYYGGRNFLNLSAFRDVSDSEMSD SSWVRWYA VVE NVIV R+LD++LY RS N +IEDK+GKIS    +DL GELDVLVG
Subjt:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVG

Query:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE
        FVE I E P+SLHLQ+N++VYEVVRLV+ENYRLVQREIS+RVR IGDRA+SL+LDELTQLV  LTRFENCRR++T+LFVNR KNED WEL+K TKAKLVE
Subjt:  FVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVE

Query:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG
        +KQMKEEKR+IMVEIRA+SVE TR WNPFVEPGQLLW+PSGD PLGPALLPLTVSTVG
Subjt:  EKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG

A0A6J1K8Z0 putative clathrin assembly protein At4g400804.4e-12369.64Show/hide
Query:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH
        M +DQ KKLKNLIDA KD+ASIIKATFSIHRRSSSI+VAVVRATTH A NPPS+ R+AA+LALGNDFR STA  CI+AL +RLH+T++AAVAMKSLFTLH
Subjt:  MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFR-STACACIEALTDRLHSTSNAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEI-EDKEGKISGGVNMDLSGELDVLV
        IIVIRGPFNLRD+VAF PYYGGRNFLNLSAFRDVSDSEMS+ S WVRWYA VVE      RKLD +LYFRSRN EI E K+ KI      +L  ELDVL+
Subjt:  IIVIRGPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEI-EDKEGKISGGVNMDLSGELDVLV

Query:  GFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLV
        GF+ERISEVPESLH+Q++DLVYEVVRLV+E+YRLVQREI +RV EIG+R E L+ DELT+ V  LTR ENCRR+V++LFVNRGKN++ WEL+  TK KLV
Subjt:  GFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLV

Query:  EEKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG
        E ++                 ESTR WNPFVEPG L        PLGPALLPLTVSTVG
Subjt:  EEKQMKEEKRLIMVEIRADSVESTRFWNPFVEPGQLLWIPSGDGPLGPALLPLTVSTVG

SwissProt top hitse value%identityAlignment
Q8GX47 Putative clathrin assembly protein At4g026502.1e-1333.7Show/hide
Query:  KLKNLIDALKDKASIIKATFSIHRRSSS---IRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIR
        KLK  I A+KD+ S+  A   +  RSSS   + +AVV+AT HD   P  +  +  +L L +  R+   AC+  L+ RL+ T N +VA+K+L  +  ++  
Subjt:  KLKNLIDALKDKASIIKATFSIHRRSSS---IRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIR

Query:  GPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGG
        G      ++ F    G R  LN+S FRD S S+  D+S++VR YA            LD+ L +R +    + K G   GG
Subjt:  GPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGG

Q8H0W9 Putative clathrin assembly protein At5g104103.7e-1828.88Show/hide
Query:  NLIDALKDKASIIKATFSIHRRSSS----IRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIRGP
        ++I   KDKASI KA   +H   S+    I +A++++TT   + PP+ + V+AV++  N     A A   A   RL  T NA VA KSL  +H ++    
Subjt:  NLIDALKDKASIIKATFSIHRRSSS----IRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIRGP

Query:  FNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVGFVERISE
         + RD+  F     GRN L L+ F D S +   + S W+RWY   +++   V + L           +  +++ ++S      +  + D LV F E I  
Subjt:  FNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVGFVERISE

Query:  VPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDR--------AESLNLDELTQLVGALTRFENCRRRVTLLFVN-RGKNEDFWELIKKTKAKL
         PE   + +N +V E+  LV+E+Y  + R + +R++ + +R           L L++ + L   L R   C+  ++ LF   R   +DFW L++  KA+ 
Subjt:  VPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDR--------AESLNLDELTQLVGALTRFENCRRRVTLLFVN-RGKNEDFWELIKKTKAKL

Query:  VEE--KQMKEEKRLIMVEIRAD
         ++  KQM E   L+   ++ D
Subjt:  VEE--KQMKEEKRLIMVEIRAD

Q8L936 Putative clathrin assembly protein At4g400802.6e-4335.4Show/hide
Query:  NLIDALKDKASIIKATF---SIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIRGPF
        +LI  +KDKAS  KA     +   ++ S  ++V+RATTHD S PP    +A +L+ G   R+TA + +E++ +RLH+T +A VA+KSL  +H IV  G F
Subjt:  NLIDALKDKASIIKATF---SIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIRGPF

Query:  NLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVGFVERISEV
         L+DQ++  P  GGRN+L LSAFRD     M + SSWVRWYA  +E  +   R +   +   S     E+ E  +S   N DL  E+D LVG +E   ++
Subjt:  NLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVGFVERISEV

Query:  PESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLF---VNRGKNEDFWELIKKTKAKL--VEEKQ
        P+        L  ++ +LV E+Y     E+  R  E  +R+ +L+  +  +LV AL R E+C+ R++ +      RG  + FW L+ + K  +  +E+  
Subjt:  PESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLF---VNRGKNEDFWELIKKTKAKL--VEEKQ

Query:  MKEEKRLIMVEIRADSVESTRF
         + EK ++    R    ES RF
Subjt:  MKEEKRLIMVEIRADSVESTRF

Q9FKQ2 Putative clathrin assembly protein At5g653702.5e-1427.54Show/hide
Query:  KLKNLIDALKDKASIIKATFSIHRRSS----SIRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACAC-----IEALTDRLHSTSNAAVAMKSLFTL
        KL  L   LKD+AS +K    +H  SS    +I +A+++AT+H ++NPPS+  V  +       +ST   C     ++A+  RL  T++  VA K L  L
Subjt:  KLKNLIDALKDKASIIKATFSIHRRSS----SIRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACAC-----IEALTDRLHSTSNAAVAMKSLFTL

Query:  HIIV-----IRGPFNLRDQV--AFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLS
        H +V       G  +LR+ +      Y  G + L L+     S     + + WV+WY   ++  + +   L      + +N +   +  ++S      + 
Subjt:  HIIV-----IRGPFNLRDQV--AFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLS

Query:  GELDVLVGFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKN--EDFWEL
         ++D LV   E IS+ P++   + N +V E+  L++++Y    R + IR  E+  R    N     +LV  L + ENC+  ++  F  R K    DFW L
Subjt:  GELDVLVGFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKN--EDFWEL

Query:  IKKTK
        + K K
Subjt:  IKKTK

Q9SA65 Putative clathrin assembly protein At1g030504.4e-1134.21Show/hide
Query:  KLKNLIDALKDKASIIKATFSIHRRSSSIR---VAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIR
        K K  I A+KD+ S+  A   ++ RS+S+    VA+V+AT H+   P  E  +  +L+L +  RS   AC+  L+ RL+ T    VA+K+L  +  ++  
Subjt:  KLKNLIDALKDKASIIKATFSIHRRSSSIR---VAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIR

Query:  GPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNV
        G      ++ F    G R  LN+S FRDVS S   D+S++VR YA  +++ +
Subjt:  GPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNV

Arabidopsis top hitse value%identityAlignment
AT1G03050.1 ENTH/ANTH/VHS superfamily protein3.1e-1234.21Show/hide
Query:  KLKNLIDALKDKASIIKATFSIHRRSSSIR---VAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIR
        K K  I A+KD+ S+  A   ++ RS+S+    VA+V+AT H+   P  E  +  +L+L +  RS   AC+  L+ RL+ T    VA+K+L  +  ++  
Subjt:  KLKNLIDALKDKASIIKATFSIHRRSSSIR---VAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIR

Query:  GPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNV
        G      ++ F    G R  LN+S FRDVS S   D+S++VR YA  +++ +
Subjt:  GPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNV

AT4G02650.1 ENTH/ANTH/VHS superfamily protein1.5e-1433.7Show/hide
Query:  KLKNLIDALKDKASIIKATFSIHRRSSS---IRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIR
        KLK  I A+KD+ S+  A   +  RSSS   + +AVV+AT HD   P  +  +  +L L +  R+   AC+  L+ RL+ T N +VA+K+L  +  ++  
Subjt:  KLKNLIDALKDKASIIKATFSIHRRSSS---IRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIR

Query:  GPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGG
        G      ++ F    G R  LN+S FRD S S+  D+S++VR YA            LD+ L +R +    + K G   GG
Subjt:  GPFNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGG

AT4G40080.1 ENTH/ANTH/VHS superfamily protein1.8e-4435.4Show/hide
Query:  NLIDALKDKASIIKATF---SIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIRGPF
        +LI  +KDKAS  KA     +   ++ S  ++V+RATTHD S PP    +A +L+ G   R+TA + +E++ +RLH+T +A VA+KSL  +H IV  G F
Subjt:  NLIDALKDKASIIKATF---SIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIRGPF

Query:  NLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVGFVERISEV
         L+DQ++  P  GGRN+L LSAFRD     M + SSWVRWYA  +E  +   R +   +   S     E+ E  +S   N DL  E+D LVG +E   ++
Subjt:  NLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVGFVERISEV

Query:  PESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLF---VNRGKNEDFWELIKKTKAKL--VEEKQ
        P+        L  ++ +LV E+Y     E+  R  E  +R+ +L+  +  +LV AL R E+C+ R++ +      RG  + FW L+ + K  +  +E+  
Subjt:  PESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLF---VNRGKNEDFWELIKKTKAKL--VEEKQ

Query:  MKEEKRLIMVEIRADSVESTRF
         + EK ++    R    ES RF
Subjt:  MKEEKRLIMVEIRADSVESTRF

AT5G10410.1 ENTH/ANTH/VHS superfamily protein2.7e-1928.88Show/hide
Query:  NLIDALKDKASIIKATFSIHRRSSS----IRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIRGP
        ++I   KDKASI KA   +H   S+    I +A++++TT   + PP+ + V+AV++  N     A A   A   RL  T NA VA KSL  +H ++    
Subjt:  NLIDALKDKASIIKATFSIHRRSSS----IRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIRGP

Query:  FNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVGFVERISE
         + RD+  F     GRN L L+ F D S +   + S W+RWY   +++   V + L           +  +++ ++S      +  + D LV F E I  
Subjt:  FNLRDQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVGFVERISE

Query:  VPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDR--------AESLNLDELTQLVGALTRFENCRRRVTLLFVN-RGKNEDFWELIKKTKAKL
         PE   + +N +V E+  LV+E+Y  + R + +R++ + +R           L L++ + L   L R   C+  ++ LF   R   +DFW L++  KA+ 
Subjt:  VPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDR--------AESLNLDELTQLVGALTRFENCRRRVTLLFVN-RGKNEDFWELIKKTKAKL

Query:  VEE--KQMKEEKRLIMVEIRAD
         ++  KQM E   L+   ++ D
Subjt:  VEE--KQMKEEKRLIMVEIRAD

AT5G65370.1 ENTH/ANTH/VHS superfamily protein1.8e-1527.54Show/hide
Query:  KLKNLIDALKDKASIIKATFSIHRRSS----SIRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACAC-----IEALTDRLHSTSNAAVAMKSLFTL
        KL  L   LKD+AS +K    +H  SS    +I +A+++AT+H ++NPPS+  V  +       +ST   C     ++A+  RL  T++  VA K L  L
Subjt:  KLKNLIDALKDKASIIKATFSIHRRSS----SIRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACAC-----IEALTDRLHSTSNAAVAMKSLFTL

Query:  HIIV-----IRGPFNLRDQV--AFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLS
        H +V       G  +LR+ +      Y  G + L L+     S     + + WV+WY   ++  + +   L      + +N +   +  ++S      + 
Subjt:  HIIV-----IRGPFNLRDQV--AFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLS

Query:  GELDVLVGFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKN--EDFWEL
         ++D LV   E IS+ P++   + N +V E+  L++++Y    R + IR  E+  R    N     +LV  L + ENC+  ++  F  R K    DFW L
Subjt:  GELDVLVGFVERISEVPESLHLQRNDLVYEVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKN--EDFWEL

Query:  IKKTK
        + K K
Subjt:  IKKTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTAGACCAACGGAAGAAGCTCAAGAACCTTATAGACGCTCTCAAAGACAAGGCCTCCATAATCAAAGCCACCTTCTCGATTCATCGTCGTTCCTCTTCCATCAG
GGTCGCCGTCGTCCGCGCCACCACTCACGACGCTTCCAATCCCCCTTCCGAAAACCGCGTTGCCGCCGTCCTAGCCCTTGGGAATGACTTCCGTTCCACTGCATGCGCCT
GCATCGAAGCCCTCACGGACCGTCTTCATTCCACCTCCAACGCCGCCGTAGCCATGAAATCGCTTTTCACTCTACATATTATTGTAATCCGAGGCCCGTTCAATCTCAGG
GATCAGGTTGCCTTCTGCCCCTATTACGGAGGCCGAAATTTTCTCAACTTGTCCGCATTTCGCGACGTATCGGACTCGGAGATGAGCGACTGGTCTTCTTGGGTGAGATG
GTATGCCGCCGTTGTGGAGCAGAACGTGATTGTTTGGAGGAAATTGGATCAGCTTTTATATTTCCGTTCAAGAAATTACGAAATCGAAGATAAAGAAGGCAAGATTTCGG
GGGGAGTGAATATGGATTTATCGGGGGAGTTGGATGTTCTTGTGGGTTTCGTCGAGCGAATTAGCGAAGTTCCTGAGTCGTTGCATCTGCAGAGGAACGACTTGGTTTAC
GAGGTGGTGAGGTTGGTGATGGAGAATTACAGGTTGGTTCAGCGTGAGATTTCGATCCGAGTTAGGGAAATCGGAGACAGAGCGGAGAGTTTGAATCTGGACGAGTTGAC
GCAGTTGGTGGGTGCCTTGACGCGGTTTGAGAATTGCAGAAGAAGAGTGACGTTGCTGTTTGTGAACAGAGGAAAGAACGAGGATTTTTGGGAATTGATAAAGAAAACGA
AAGCAAAACTGGTGGAGGAGAAGCAAATGAAGGAGGAGAAGAGGTTGATCATGGTGGAAATCAGAGCGGACTCGGTCGAGTCGACTCGGTTCTGGAATCCGTTTGTGGAA
CCGGGTCAGTTGCTTTGGATCCCATCGGGCGATGGACCCTTGGGCCCGGCCCTGCTTCCACTGACCGTTTCAACGGTAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGTAGACCAACGGAAGAAGCTCAAGAACCTTATAGACGCTCTCAAAGACAAGGCCTCCATAATCAAAGCCACCTTCTCGATTCATCGTCGTTCCTCTTCCATCAG
GGTCGCCGTCGTCCGCGCCACCACTCACGACGCTTCCAATCCCCCTTCCGAAAACCGCGTTGCCGCCGTCCTAGCCCTTGGGAATGACTTCCGTTCCACTGCATGCGCCT
GCATCGAAGCCCTCACGGACCGTCTTCATTCCACCTCCAACGCCGCCGTAGCCATGAAATCGCTTTTCACTCTACATATTATTGTAATCCGAGGCCCGTTCAATCTCAGG
GATCAGGTTGCCTTCTGCCCCTATTACGGAGGCCGAAATTTTCTCAACTTGTCCGCATTTCGCGACGTATCGGACTCGGAGATGAGCGACTGGTCTTCTTGGGTGAGATG
GTATGCCGCCGTTGTGGAGCAGAACGTGATTGTTTGGAGGAAATTGGATCAGCTTTTATATTTCCGTTCAAGAAATTACGAAATCGAAGATAAAGAAGGCAAGATTTCGG
GGGGAGTGAATATGGATTTATCGGGGGAGTTGGATGTTCTTGTGGGTTTCGTCGAGCGAATTAGCGAAGTTCCTGAGTCGTTGCATCTGCAGAGGAACGACTTGGTTTAC
GAGGTGGTGAGGTTGGTGATGGAGAATTACAGGTTGGTTCAGCGTGAGATTTCGATCCGAGTTAGGGAAATCGGAGACAGAGCGGAGAGTTTGAATCTGGACGAGTTGAC
GCAGTTGGTGGGTGCCTTGACGCGGTTTGAGAATTGCAGAAGAAGAGTGACGTTGCTGTTTGTGAACAGAGGAAAGAACGAGGATTTTTGGGAATTGATAAAGAAAACGA
AAGCAAAACTGGTGGAGGAGAAGCAAATGAAGGAGGAGAAGAGGTTGATCATGGTGGAAATCAGAGCGGACTCGGTCGAGTCGACTCGGTTCTGGAATCCGTTTGTGGAA
CCGGGTCAGTTGCTTTGGATCCCATCGGGCGATGGACCCTTGGGCCCGGCCCTGCTTCCACTGACCGTTTCAACGGTAGGATAG
Protein sequenceShow/hide protein sequence
MGVDQRKKLKNLIDALKDKASIIKATFSIHRRSSSIRVAVVRATTHDASNPPSENRVAAVLALGNDFRSTACACIEALTDRLHSTSNAAVAMKSLFTLHIIVIRGPFNLR
DQVAFCPYYGGRNFLNLSAFRDVSDSEMSDWSSWVRWYAAVVEQNVIVWRKLDQLLYFRSRNYEIEDKEGKISGGVNMDLSGELDVLVGFVERISEVPESLHLQRNDLVY
EVVRLVMENYRLVQREISIRVREIGDRAESLNLDELTQLVGALTRFENCRRRVTLLFVNRGKNEDFWELIKKTKAKLVEEKQMKEEKRLIMVEIRADSVESTRFWNPFVE
PGQLLWIPSGDGPLGPALLPLTVSTVG