; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033793 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033793
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionENTH domain-containing protein
Genome locationscaffold13:34100048..34101136
RNA-Seq ExpressionSpg033793
SyntenySpg033793
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598708.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]3.4e-13374.29Show/hide
Query:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH
        M IDQ KK KNLI A KDQASIIKATFSIHRRSSSIKVAVVRATTH   NPPSD R+AA+LA GNDFRSSTAF CI+ALM+RLHTT+SAAVAMKSLFTLH
Subjt:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGRKV-DLSGELDVLVGFVER
        II IRGPFNLR +VAF P+YGGRNFLNLSAFRDVSDSEMS+LS WVRWYA VVEHN    RKLD ILY RSRN EI + + RK+ +L  ELDVLVGF ER
Subjt:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGRKV-DLSGELDVLVGFVER

Query:  ICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKER
        I EVPESLH+Q++DLVYEVVRLV+E+YRLVQREIWVRV EIG+RVE +S DELT+ V ILTR ENCRRK+SVLFVNRGKNE+LWELV  TKGKLVE++ R
Subjt:  ICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKER

Query:  KEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG
                  M     ESTR WNPFVEPG L        P GPA LPLTVSTVG
Subjt:  KEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG

XP_004152749.1 putative clathrin assembly protein At4g40080 [Cucumis sativus]3.6e-15181.97Show/hide
Query:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH
        M I QNKKL NL+HALKD+AS+IKATFSI+RRSSSIKVAVVRATTH   NPPSD RV+AVLALGNDFRSSTAFACIEALM RLHTTSSAAVAMKSLFTLH
Subjt:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE
        IIVIRGPFNLRDQV+F P YGGRNFLNLSAFRDVSDSEMSDLSSWVRWYA VVEHNVIV RKLD ILY RSRNCEI D++GR  KVDLS EL VLVGFVE
Subjt:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE

Query:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE
        RICEVPESLHLQ+ DLVYEVVRLV++NYRLVQ+EIWVRVKEIG+RVE LS+DEL++LVGILTR ENCR K+SVLFVNRGK+E+ WELVK T+GKL E+K 
Subjt:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE

Query:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG
         KEEKRMI+V    +SVESTR  NPFVEPGQL+WVP      GPALLPLTVSTVG
Subjt:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG

XP_008445571.1 PREDICTED: putative clathrin assembly protein At4g40080 [Cucumis melo]4.4e-15783.66Show/hide
Query:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH
        M I Q+KKLKNL HALKD+ASIIKA FSI+RRSSSIKVAVVRATTH   NPPSD RVAAVLALGNDFRSSTAFACIEALM RLHTTSSAAVAMKSLFTLH
Subjt:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE
        IIVIRGPFNLRDQV+F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYA VVEHNVIV RKLD ILY RSRNCEI D+ GR  KVDL+ EL VLVGFVE
Subjt:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE

Query:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE
        RICEVPESLHLQ+ DLVYEVVRLV+ENYRLVQREIWVRVKEIG+RVE LS+DEL++LVGIL R ENCR K+SVLFVNRGKNE+ WELVK TKGK+ E+K 
Subjt:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE

Query:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG
         KEEKRM++V    DSVESTR WNPFVEPGQL+WVP GDGP+GPALLPLTVSTVG
Subjt:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG

XP_022131457.1 putative clathrin assembly protein At4g40080 [Momordica charantia]2.1e-16283.66Show/hide
Query:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH
        MGIDQ KKLKNLI ALKD+ASIIKATFS HRRSSSIK+AVVRATTHDP NPPSD R+AAVLALGNDF  STA ACI+ +M RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE
        I+VIRGPF+LRDQV FCP+YGGRNFLNLSAFRDVSDSEMSDLSSWVRWYA VVEHNVIV+R+LD ILYLRS NC+IEDK+G+  ++DL GELDVLVGFVE
Subjt:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE

Query:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE
         ICE P+SLHLQ+N++VYEVVRLV+ENYRLVQREI VRV+ IGDR +SLSLDELTQLV ILTRFENCRRKL+VLFVNR KNEDLWELVKNTK KLVEQK+
Subjt:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE

Query:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG
         KEEKRMI+VE+RA+SVE TR WNPFVEPGQLLWVP+GD PLGPALLPLTVSTVG
Subjt:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG

XP_038884022.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]1.4e-16385.59Show/hide
Query:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH
        M IDQNKKLKNL HALKD+ASIIKAT SI RRSSSIKVAVVRATTH   NPPSD RVAAVLALGNDFRSSTAFACIEALM+RLHTTSSAAVAMKSLFTLH
Subjt:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEI-EDKEGRKVDLSGELDVLVGFVER
        IIVIRGPFNLRDQVA+ P YGGRNFLNLS FRDVSDSEM+DLSSWVRWYA VVE NVIV RKLD ILY RSRNCEI E++  RK+D+  EL+VLVGFVER
Subjt:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEI-EDKEGRKVDLSGELDVLVGFVER

Query:  ICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKER
        ICEVPESL+LQ+ DLVYEVVRLV+ENYRLVQREIWVRVKEIGDRVESLSLDELT+LVGI+TR ENCRRKLSVLFVNRGKNE+ WELVK TKGKL E+K  
Subjt:  ICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKER

Query:  KEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG
        KEEKRMI+VEM+A+S ESTR WNPFVEPGQLLWVP GDGP+GPALLPLTVSTVG
Subjt:  KEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG

TrEMBL top hitse value%identityAlignment
A0A0A0LLA1 ENTH domain-containing protein1.8e-15181.97Show/hide
Query:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH
        M I QNKKL NL+HALKD+AS+IKATFSI+RRSSSIKVAVVRATTH   NPPSD RV+AVLALGNDFRSSTAFACIEALM RLHTTSSAAVAMKSLFTLH
Subjt:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE
        IIVIRGPFNLRDQV+F P YGGRNFLNLSAFRDVSDSEMSDLSSWVRWYA VVEHNVIV RKLD ILY RSRNCEI D++GR  KVDLS EL VLVGFVE
Subjt:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE

Query:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE
        RICEVPESLHLQ+ DLVYEVVRLV++NYRLVQ+EIWVRVKEIG+RVE LS+DEL++LVGILTR ENCR K+SVLFVNRGK+E+ WELVK T+GKL E+K 
Subjt:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE

Query:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG
         KEEKRMI+V    +SVESTR  NPFVEPGQL+WVP      GPALLPLTVSTVG
Subjt:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG

A0A1S3BDW7 putative clathrin assembly protein At4g400802.1e-15783.66Show/hide
Query:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH
        M I Q+KKLKNL HALKD+ASIIKA FSI+RRSSSIKVAVVRATTH   NPPSD RVAAVLALGNDFRSSTAFACIEALM RLHTTSSAAVAMKSLFTLH
Subjt:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE
        IIVIRGPFNLRDQV+F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYA VVEHNVIV RKLD ILY RSRNCEI D+ GR  KVDL+ EL VLVGFVE
Subjt:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE

Query:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE
        RICEVPESLHLQ+ DLVYEVVRLV+ENYRLVQREIWVRVKEIG+RVE LS+DEL++LVGIL R ENCR K+SVLFVNRGKNE+ WELVK TKGK+ E+K 
Subjt:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE

Query:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG
         KEEKRM++V    DSVESTR WNPFVEPGQL+WVP GDGP+GPALLPLTVSTVG
Subjt:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG

A0A5A7VCW1 Putative clathrin assembly protein2.1e-15783.66Show/hide
Query:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH
        M I Q+KKLKNL HALKD+ASIIKA FSI+RRSSSIKVAVVRATTH   NPPSD RVAAVLALGNDFRSSTAFACIEALM RLHTTSSAAVAMKSLFTLH
Subjt:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE
        IIVIRGPFNLRDQV+F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYA VVEHNVIV RKLD ILY RSRNCEI D+ GR  KVDL+ EL VLVGFVE
Subjt:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE

Query:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE
        RICEVPESLHLQ+ DLVYEVVRLV+ENYRLVQREIWVRVKEIG+RVE LS+DEL++LVGIL R ENCR K+SVLFVNRGKNE+ WELVK TKGK+ E+K 
Subjt:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE

Query:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG
         KEEKRM++V    DSVESTR WNPFVEPGQL+WVP GDGP+GPALLPLTVSTVG
Subjt:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG

A0A6J1BR25 putative clathrin assembly protein At4g400809.9e-16383.66Show/hide
Query:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH
        MGIDQ KKLKNLI ALKD+ASIIKATFS HRRSSSIK+AVVRATTHDP NPPSD R+AAVLALGNDF  STA ACI+ +M RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE
        I+VIRGPF+LRDQV FCP+YGGRNFLNLSAFRDVSDSEMSDLSSWVRWYA VVEHNVIV+R+LD ILYLRS NC+IEDK+G+  ++DL GELDVLVGFVE
Subjt:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGR--KVDLSGELDVLVGFVE

Query:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE
         ICE P+SLHLQ+N++VYEVVRLV+ENYRLVQREI VRV+ IGDR +SLSLDELTQLV ILTRFENCRRKL+VLFVNR KNEDLWELVKNTK KLVEQK+
Subjt:  RICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKE

Query:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG
         KEEKRMI+VE+RA+SVE TR WNPFVEPGQLLWVP+GD PLGPALLPLTVSTVG
Subjt:  RKEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG

A0A6J1K8Z0 putative clathrin assembly protein At4g400802.2e-13374.86Show/hide
Query:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH
        M IDQ KKLKNLI A KDQASIIKATFSIHRRSSSIKVAVVRATTH   NPPSD R+AA+LALGNDFRSSTAF CI+ALM+RLHTT+SAAVAMKSLFTLH
Subjt:  MGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLH

Query:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGRKV-DLSGELDVLVGFVER
        IIVIRGPFNLRD+VAF P+YGGRNFLNLSAFRDVSDSEMS+LS WVRWYA VVEH     RKLD ILY RSRN EI + + RK+ +L  ELDVL+GF+ER
Subjt:  IIVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGRKV-DLSGELDVLVGFVER

Query:  ICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKER
        I EVPESLH+Q++DLVYEVVRLV+E+YRLVQREI VRV EIG+RVE LS DELT+ V ILTR ENCRRK+SVLFVNRGKN++LWELV  TKGKLVE++ R
Subjt:  ICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKER

Query:  KEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG
                        ESTR WNPFVEPG L        PLGPALLPLTVSTVG
Subjt:  KEEKRMIVVEMRADSVESTRFWNPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG

SwissProt top hitse value%identityAlignment
Q8GX47 Putative clathrin assembly protein At4g026502.6e-1134.84Show/hide
Query:  NKKLKNLIHALKDQASIIKATFSIHRRSSS---IKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHII
        + KLK  I A+KDQ S+  A   +  RSSS   +++AVV+AT HD   P  D  +  +L L +  R+  + AC+  L +RL+ T + +VA+K+L  +  +
Subjt:  NKKLKNLIHALKDQASIIKATFSIHRRSSS---IKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHII

Query:  VIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNV
        +  G      ++ F    G R  LN+S FRD S S+  D S++VR YA  ++  +
Subjt:  VIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNV

Q8H0W9 Putative clathrin assembly protein At5g104101.3e-2130.99Show/hide
Query:  NLIHALKDQASIIKATFSIHRRSSSIK---VAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHIIVIRGP
        ++I   KD+ASI KA       S+++K   +A++++TT  P  PP+   V+AV++  N   +  AF+   A + RL  T +A VA KSL  +H ++    
Subjt:  NLIHALKDQASIIKATFSIHRRSSSIK---VAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHIIVIRGP

Query:  FNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGI-LYLRSRNCEIEDKEGRKVDLSG----ELDVLVGFVERICE
         + RD+  F     GRN L L+ F D S +   +LS W+RWY   ++    V + L      L +   ++E+K+      +G    + D LV F E IC 
Subjt:  FNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGI-LYLRSRNCEIEDKEGRKVDLSG----ELDVLVGFVERICE

Query:  VPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDR--------VESLSLDELTQLVGILTRFENCRRKLSVLFVN-RGKNEDLWELVKNTKGKL
         PE   + +N +V E+  LV+E+Y  + R + VR++ + +R        +  L L++ + L   L R   C+  LS LF   R   +D W LV+  K   
Subjt:  VPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDR--------VESLSLDELTQLVGILTRFENCRRKLSVLFVN-RGKNEDLWELVKNTKGKL

Query:  VEQKERKEEKRMI
          + E+K  K+MI
Subjt:  VEQKERKEEKRMI

Q8L936 Putative clathrin assembly protein At4g400801.2e-4336.31Show/hide
Query:  NLIHALKDQASIIKATF---SIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHIIVIRGP
        +LI  +KD+AS  KA     +   ++ S  ++V+RATTHDP  PP +  +A +L+ G   R +TA + +E++M+RLHTT  A VA+KSL  +H IV  G 
Subjt:  NLIHALKDQASIIKATF---SIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHIIVIRGP

Query:  FNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGRKV-------DLSGELDVLVGFVERI
        F L+DQ++  P  GGRN+L LSAFRD     M +LSSWVRWYA  +EH +   R +    ++ S +  I  +E  ++       DL  E+D LVG +E  
Subjt:  FNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGRKV-------DLSGELDVLVGFVERI

Query:  CEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLF---VNRGKNEDLWELVKNTKGKL--VE
        C++P+        L  ++ +LV E+Y     E++ R  E  +R  +LS  +  +LV  L R E+C+ +LS +      RG  +  W LV   KG +  +E
Subjt:  CEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLF---VNRGKNEDLWELVKNTKGKL--VE

Query:  QKERKEEKRMIVVEMRADSVESTRF
            + EK ++    R    ES RF
Subjt:  QKERKEEKRMIVVEMRADSVESTRF

Q9FKQ2 Putative clathrin assembly protein At5g653702.3e-1529.35Show/hide
Query:  KLKNLIHALKDQASIIKATFSIHRRSS----SIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMQRLHTTSSAAVAMKSLFT
        KL  L   LKD+AS +K    +H  SS    +I +A+++AT+H   NPPSD  V         F  ST   C     ++A++ RL  T+   VA K L  
Subjt:  KLKNLIHALKDQASIIKATFSIHRRSS----SIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMQRLHTTSSAAVAMKSLFT

Query:  LHIIV-----IRGPFNLRDQV---AFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDK--EGRKVD---
        LH +V       G  +LR+ +         GG N L L+     S     +L+ WV+WY   ++  + +   L     ++ +N   EDK  E ++V    
Subjt:  LHIIV-----IRGPFNLRDQV---AFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDK--EGRKVD---

Query:  ---LSGELDVLVGFVERICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKN--E
           +  ++D LV   E I + P++   + N +V E+  L++++Y    R + +R +E+  RV      +  +LV +L + ENC+  LS  F  R K    
Subjt:  ---LSGELDVLVGFVERICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKN--E

Query:  DLWELVKNTK
        D W LV   K
Subjt:  DLWELVKNTK

Q9SA65 Putative clathrin assembly protein At1g030501.9e-0932.9Show/hide
Query:  NKKLKNLIHALKDQASIIKATFSIHRRSSSIK---VAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHII
        + K K  I A+KDQ S+  A   ++ RS+S+    VA+V+AT H+   P  +  +  +L+L   +  S   AC+  L +RL+ T    VA+K+L  +  +
Subjt:  NKKLKNLIHALKDQASIIKATFSIHRRSSSIK---VAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHII

Query:  VIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNV
        +  G      ++ F    G R  LN+S FRDVS S   D S++VR YA  ++  +
Subjt:  VIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNV

Arabidopsis top hitse value%identityAlignment
AT1G03050.1 ENTH/ANTH/VHS superfamily protein1.3e-1032.9Show/hide
Query:  NKKLKNLIHALKDQASIIKATFSIHRRSSSIK---VAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHII
        + K K  I A+KDQ S+  A   ++ RS+S+    VA+V+AT H+   P  +  +  +L+L   +  S   AC+  L +RL+ T    VA+K+L  +  +
Subjt:  NKKLKNLIHALKDQASIIKATFSIHRRSSSIK---VAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHII

Query:  VIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNV
        +  G      ++ F    G R  LN+S FRDVS S   D S++VR YA  ++  +
Subjt:  VIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNV

AT4G02650.1 ENTH/ANTH/VHS superfamily protein1.9e-1234.84Show/hide
Query:  NKKLKNLIHALKDQASIIKATFSIHRRSSS---IKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHII
        + KLK  I A+KDQ S+  A   +  RSSS   +++AVV+AT HD   P  D  +  +L L +  R+  + AC+  L +RL+ T + +VA+K+L  +  +
Subjt:  NKKLKNLIHALKDQASIIKATFSIHRRSSS---IKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHII

Query:  VIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNV
        +  G      ++ F    G R  LN+S FRD S S+  D S++VR YA  ++  +
Subjt:  VIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNV

AT4G40080.1 ENTH/ANTH/VHS superfamily protein8.3e-4536.31Show/hide
Query:  NLIHALKDQASIIKATF---SIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHIIVIRGP
        +LI  +KD+AS  KA     +   ++ S  ++V+RATTHDP  PP +  +A +L+ G   R +TA + +E++M+RLHTT  A VA+KSL  +H IV  G 
Subjt:  NLIHALKDQASIIKATF---SIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHIIVIRGP

Query:  FNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGRKV-------DLSGELDVLVGFVERI
        F L+DQ++  P  GGRN+L LSAFRD     M +LSSWVRWYA  +EH +   R +    ++ S +  I  +E  ++       DL  E+D LVG +E  
Subjt:  FNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGRKV-------DLSGELDVLVGFVERI

Query:  CEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLF---VNRGKNEDLWELVKNTKGKL--VE
        C++P+        L  ++ +LV E+Y     E++ R  E  +R  +LS  +  +LV  L R E+C+ +LS +      RG  +  W LV   KG +  +E
Subjt:  CEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLF---VNRGKNEDLWELVKNTKGKL--VE

Query:  QKERKEEKRMIVVEMRADSVESTRF
            + EK ++    R    ES RF
Subjt:  QKERKEEKRMIVVEMRADSVESTRF

AT5G10410.1 ENTH/ANTH/VHS superfamily protein8.9e-2330.99Show/hide
Query:  NLIHALKDQASIIKATFSIHRRSSSIK---VAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHIIVIRGP
        ++I   KD+ASI KA       S+++K   +A++++TT  P  PP+   V+AV++  N   +  AF+   A + RL  T +A VA KSL  +H ++    
Subjt:  NLIHALKDQASIIKATFSIHRRSSSIK---VAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHIIVIRGP

Query:  FNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGI-LYLRSRNCEIEDKEGRKVDLSG----ELDVLVGFVERICE
         + RD+  F     GRN L L+ F D S +   +LS W+RWY   ++    V + L      L +   ++E+K+      +G    + D LV F E IC 
Subjt:  FNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGI-LYLRSRNCEIEDKEGRKVDLSG----ELDVLVGFVERICE

Query:  VPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDR--------VESLSLDELTQLVGILTRFENCRRKLSVLFVN-RGKNEDLWELVKNTKGKL
         PE   + +N +V E+  LV+E+Y  + R + VR++ + +R        +  L L++ + L   L R   C+  LS LF   R   +D W LV+  K   
Subjt:  VPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDR--------VESLSLDELTQLVGILTRFENCRRKLSVLFVN-RGKNEDLWELVKNTKGKL

Query:  VEQKERKEEKRMI
          + E+K  K+MI
Subjt:  VEQKERKEEKRMI

AT5G65370.1 ENTH/ANTH/VHS superfamily protein1.6e-1629.35Show/hide
Query:  KLKNLIHALKDQASIIKATFSIHRRSS----SIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMQRLHTTSSAAVAMKSLFT
        KL  L   LKD+AS +K    +H  SS    +I +A+++AT+H   NPPSD  V         F  ST   C     ++A++ RL  T+   VA K L  
Subjt:  KLKNLIHALKDQASIIKATFSIHRRSS----SIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMQRLHTTSSAAVAMKSLFT

Query:  LHIIV-----IRGPFNLRDQV---AFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDK--EGRKVD---
        LH +V       G  +LR+ +         GG N L L+     S     +L+ WV+WY   ++  + +   L     ++ +N   EDK  E ++V    
Subjt:  LHIIV-----IRGPFNLRDQV---AFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDK--EGRKVD---

Query:  ---LSGELDVLVGFVERICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKN--E
           +  ++D LV   E I + P++   + N +V E+  L++++Y    R + +R +E+  RV      +  +LV +L + ENC+  LS  F  R K    
Subjt:  ---LSGELDVLVGFVERICEVPESLHLQRNDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKN--E

Query:  DLWELVKNTK
        D W LV   K
Subjt:  DLWELVKNTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGCTTCTGTGAAAAAATATGCGATGGGCATAGACCAAAACAAGAAGCTCAAGAATCTCATACACGCTCTTAAAGATCAGGCCTCAATAATCAAAGCCACTTTCTC
GATTCATCGCCGATCATCTTCCATTAAAGTCGCCGTCGTCCGCGCCACCACCCACGACCCCGGAAACCCCCCTTCCGACGGCCGAGTCGCCGCCGTGCTCGCCCTTGGAA
ATGACTTCCGTTCCTCAACTGCATTCGCCTGCATCGAAGCGCTTATGCAGCGTCTTCACACAACTTCCAGCGCCGCCGTGGCCATGAAATCGCTTTTCACTCTGCATATA
ATCGTAATTCGAGGTCCGTTCAACCTGAGGGATCAAGTGGCGTTTTGCCCCTTTTACGGAGGGAGAAACTTTCTAAACTTGTCCGCGTTTCGCGACGTATCGGACTCGGA
GATGAGCGACTTGTCGTCTTGGGTGAGATGGTACGCGGCCGTCGTGGAGCATAACGTGATCGTCCAGAGGAAATTGGATGGGATTCTGTATCTGCGTTCAAGAAATTGCG
AAATCGAAGATAAAGAAGGGAGGAAGGTTGATTTATCGGGGGAATTGGACGTTCTTGTGGGTTTTGTGGAGCGGATTTGCGAAGTTCCCGAGTCGTTGCATCTTCAGAGG
AACGATTTGGTTTACGAGGTGGTGAGATTGGTTATGGAGAATTACAGGTTGGTTCAACGGGAGATTTGGGTCCGAGTTAAGGAAATCGGAGACAGAGTGGAGAGTTTGAG
TCTGGACGAGTTGACTCAGTTGGTGGGTATCTTGACGCGGTTCGAGAATTGCAGAAGGAAACTCAGTGTGTTGTTTGTGAACAGAGGGAAGAACGAGGATTTGTGGGAAT
TGGTGAAGAACACGAAAGGGAAACTGGTGGAGCAGAAGGAAAGGAAAGAGGAGAAGAGGATGATCGTGGTGGAGATGAGAGCGGACTCGGTTGAGTCGACTCGGTTCTGG
AACCCGTTTGTTGAACCGGGTCAGTTGCTGTGGGTCCCAACCGGTGATGGACCGCTGGGCCCGGCTCTGCTTCCACTGACCGTTTCAACGGTAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGATTGCTTCTGTGAAAAAATATGCGATGGGCATAGACCAAAACAAGAAGCTCAAGAATCTCATACACGCTCTTAAAGATCAGGCCTCAATAATCAAAGCCACTTTCTC
GATTCATCGCCGATCATCTTCCATTAAAGTCGCCGTCGTCCGCGCCACCACCCACGACCCCGGAAACCCCCCTTCCGACGGCCGAGTCGCCGCCGTGCTCGCCCTTGGAA
ATGACTTCCGTTCCTCAACTGCATTCGCCTGCATCGAAGCGCTTATGCAGCGTCTTCACACAACTTCCAGCGCCGCCGTGGCCATGAAATCGCTTTTCACTCTGCATATA
ATCGTAATTCGAGGTCCGTTCAACCTGAGGGATCAAGTGGCGTTTTGCCCCTTTTACGGAGGGAGAAACTTTCTAAACTTGTCCGCGTTTCGCGACGTATCGGACTCGGA
GATGAGCGACTTGTCGTCTTGGGTGAGATGGTACGCGGCCGTCGTGGAGCATAACGTGATCGTCCAGAGGAAATTGGATGGGATTCTGTATCTGCGTTCAAGAAATTGCG
AAATCGAAGATAAAGAAGGGAGGAAGGTTGATTTATCGGGGGAATTGGACGTTCTTGTGGGTTTTGTGGAGCGGATTTGCGAAGTTCCCGAGTCGTTGCATCTTCAGAGG
AACGATTTGGTTTACGAGGTGGTGAGATTGGTTATGGAGAATTACAGGTTGGTTCAACGGGAGATTTGGGTCCGAGTTAAGGAAATCGGAGACAGAGTGGAGAGTTTGAG
TCTGGACGAGTTGACTCAGTTGGTGGGTATCTTGACGCGGTTCGAGAATTGCAGAAGGAAACTCAGTGTGTTGTTTGTGAACAGAGGGAAGAACGAGGATTTGTGGGAAT
TGGTGAAGAACACGAAAGGGAAACTGGTGGAGCAGAAGGAAAGGAAAGAGGAGAAGAGGATGATCGTGGTGGAGATGAGAGCGGACTCGGTTGAGTCGACTCGGTTCTGG
AACCCGTTTGTTGAACCGGGTCAGTTGCTGTGGGTCCCAACCGGTGATGGACCGCTGGGCCCGGCTCTGCTTCCACTGACCGTTTCAACGGTAGGATAG
Protein sequenceShow/hide protein sequence
MIASVKKYAMGIDQNKKLKNLIHALKDQASIIKATFSIHRRSSSIKVAVVRATTHDPGNPPSDGRVAAVLALGNDFRSSTAFACIEALMQRLHTTSSAAVAMKSLFTLHI
IVIRGPFNLRDQVAFCPFYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAAVVEHNVIVQRKLDGILYLRSRNCEIEDKEGRKVDLSGELDVLVGFVERICEVPESLHLQR
NDLVYEVVRLVMENYRLVQREIWVRVKEIGDRVESLSLDELTQLVGILTRFENCRRKLSVLFVNRGKNEDLWELVKNTKGKLVEQKERKEEKRMIVVEMRADSVESTRFW
NPFVEPGQLLWVPTGDGPLGPALLPLTVSTVG