; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004471 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004471
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionENTH domain-containing protein
Genome locationChr08:17389924..17390997
RNA-Seq ExpressionHG10004471
SyntenyHG10004471
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598708.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]2.0e-13073.74Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAIDQ KK KNLI   KD+ASIIKATFSIHRRSSSIKVAVVRATTHG+RNPPSD R+AA+LA GNDFRSSTAF CI+ALMERLHTT+SAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI-DLSEELEVLVS
        II IRGPFNLR +VAF P YGGRNFLNLSAFRDVSDSEM++LS WVRWYAGVVEHN    RKLDRILYFRSRN EIV    EG+ RKI +L EEL+VLV 
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI-DLSEELEVLVS

Query:  FVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAE
        F ERI EVPESLH+QK DLVYEVV+LVLE+YRLVQREIWVRV EIG+RV+ +S DELTE V IL R+ENCRR +SVLFVNRGKNEE WELV  TKGKL E
Subjt:  FVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAE

Query:  TKGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG
         +            M     ESTR WNPFVE G L          GPA LPLTVSTVG
Subjt:  TKGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG

XP_004152749.1 putative clathrin assembly protein At4g40080 [Cucumis sativus]1.1e-15784.03Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAI QNKKL NL+H LKDKAS+IKATFSI+RRSSSIKVAVVRATTHG+RNPPSD RV+AVLALGNDFRSSTAFACIEALM RLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQV+FFP YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEI ED   GRK K+DLSEEL VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESLHLQKKDLVYEVV+LVL+NYRLVQ+EIWVRVKEIG+RV+ LS+DEL+ELVGIL RLENCR  +SVLFVNRGK+EEFWELVK T+GKL E 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG
        K +KEEKRMIMV    +SVESTR  NPFVE GQL+WVP      GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG

XP_008445571.1 PREDICTED: putative clathrin assembly protein At4g40080 [Cucumis melo]7.4e-16586.55Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAI Q+KKLKNL H LKDKASIIKA FSI+RRSSSIKVAVVRATTHG+RNPPSD RVAAVLALGNDFRSSTAFACIEALM RLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQV+FFP YGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEI E    GRK K+DL+EEL VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESLHLQKKDLVYEVV+LVLENYRLVQREIWVRVKEIG+RV+ LS+DEL+ELVGIL RLENCR  +SVLFVNRGKNEEFWELVKITKGK+AE 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG
        K +KEEKRM+MV    DSVESTR WNPFVE GQL+WVP GDG +GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG

XP_022131457.1 putative clathrin assembly protein At4g40080 [Momordica charantia]1.6e-14677.87Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        M IDQ KKLKNLI  LKDKASIIKATFS HRRSSSIK+AVVRATTH   NPPSD R+AAVLALGNDF  STA ACI+ +M+RLHTTSSA VA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIV R+LDRILY RS NC+I  + K+G+  ++DL  EL+VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VE ICE P+SLHLQK ++VYEVV+LVLENYRLVQREI VRV+ IGDR DSLSLDELT+LV IL R ENCRR L+VLFVNR KNE+ WELVK TK KL E 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG
        K MKEEKRMIMVE+RA+SVE TR WNPFVE GQLLWVP+GD  LGPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG

XP_038884022.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]3.0e-17491.04Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAIDQNKKLKNL H LKDKASIIKAT SI RRSSSIKVAVVRATTHGSRNPPSD RVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQVA+FPCYGGRNFLNLS FRDVSDSEMNDLSSWVRWYAGVVE NVIVDRKLDRILYFRSRNCEIVE   E RKRKID+ EELEVLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESL+LQKKDLVYEVV+LVLENYRLVQREIWVRVKEIGDRV+SLSLDELTELVGI+ RLENCRR LSVLFVNRGKNEEFWELVKITKGKLAE 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG
        K MKEEKRMIMVEM+A+S ESTR WNPFVE GQLLWVPAGDG +GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG

TrEMBL top hitse value%identityAlignment
A0A0A0LLA1 ENTH domain-containing protein5.6e-15884.03Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAI QNKKL NL+H LKDKAS+IKATFSI+RRSSSIKVAVVRATTHG+RNPPSD RV+AVLALGNDFRSSTAFACIEALM RLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQV+FFP YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEI ED   GRK K+DLSEEL VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESLHLQKKDLVYEVV+LVL+NYRLVQ+EIWVRVKEIG+RV+ LS+DEL+ELVGIL RLENCR  +SVLFVNRGK+EEFWELVK T+GKL E 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG
        K +KEEKRMIMV    +SVESTR  NPFVE GQL+WVP      GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG

A0A1S3BDW7 putative clathrin assembly protein At4g400803.6e-16586.55Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAI Q+KKLKNL H LKDKASIIKA FSI+RRSSSIKVAVVRATTHG+RNPPSD RVAAVLALGNDFRSSTAFACIEALM RLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQV+FFP YGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEI E    GRK K+DL+EEL VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESLHLQKKDLVYEVV+LVLENYRLVQREIWVRVKEIG+RV+ LS+DEL+ELVGIL RLENCR  +SVLFVNRGKNEEFWELVKITKGK+AE 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG
        K +KEEKRM+MV    DSVESTR WNPFVE GQL+WVP GDG +GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG

A0A5A7VCW1 Putative clathrin assembly protein3.6e-16586.55Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAI Q+KKLKNL H LKDKASIIKA FSI+RRSSSIKVAVVRATTHG+RNPPSD RVAAVLALGNDFRSSTAFACIEALM RLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQV+FFP YGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEI E    GRK K+DL+EEL VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESLHLQKKDLVYEVV+LVLENYRLVQREIWVRVKEIG+RV+ LS+DEL+ELVGIL RLENCR  +SVLFVNRGKNEEFWELVKITKGK+AE 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG
        K +KEEKRM+MV    DSVESTR WNPFVE GQL+WVP GDG +GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG

A0A6J1BR25 putative clathrin assembly protein At4g400807.5e-14777.87Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        M IDQ KKLKNLI  LKDKASIIKATFS HRRSSSIK+AVVRATTH   NPPSD R+AAVLALGNDF  STA ACI+ +M+RLHTTSSA VA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIV R+LDRILY RS NC+I  + K+G+  ++DL  EL+VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VE ICE P+SLHLQK ++VYEVV+LVLENYRLVQREI VRV+ IGDR DSLSLDELT+LV IL R ENCRR L+VLFVNR KNE+ WELVK TK KL E 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG
        K MKEEKRMIMVE+RA+SVE TR WNPFVE GQLLWVP+GD  LGPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG

A0A6J1HF54 putative clathrin assembly protein At4g400801.3e-13074.3Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAIDQ KKLKNLI   KD+ASIIKATFSIHRRSSSIKVAVVRATTHG+RNPPSD R+AA+LA GNDFRSSTAF CI+ALMERLHTT+SAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI-DLSEELEVLVS
        II IRGPFNL+ +VAF P YGGRNFLNLSAFRD+SDSEM++LS WVRWYAGVVEHN    RKLDRILYFRSRN EIV    EG+ RKI +L EEL+VLV 
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI-DLSEELEVLVS

Query:  FVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAE
        F ERI EVPESLH+QK DLVYEVV+LVLE+YRLVQREIWVRV EIG+RV+ LS DELTE V IL R+ENCR  +SVLFVNRGKNEE WELV  TKGKL  
Subjt:  FVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAE

Query:  TKGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG
              E+R  M  M     ESTR WNPFVE G L         LGPA LPLTVSTVG
Subjt:  TKGMKEEKRMIMVEMRADSVESTRFWNPFVETGQLLWVPAGDGHLGPAPLPLTVSTVG

SwissProt top hitse value%identityAlignment
Q8GX47 Putative clathrin assembly protein At4g026508.4e-1033.95Show/hide
Query:  NKKLKNLIHTLKDKASIIKATFSIHRRSSS---IKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII
        + KLK  I  +KD+ S+  A   +  RSSS   +++AVV+AT H    P  D  +  +L L +  R+  + AC+  L  RL+ T + +VA+K+L  +  +
Subjt:  NKKLKNLIHTLKDKASIIKATFSIHRRSSS---IKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII

Query:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD
        +  G     +Q  FF    G   LN+S FRD S S+  D S++VR YA      + +D +LD
Subjt:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD

Q8H0W9 Putative clathrin assembly protein At5g104107.5e-1930.99Show/hide
Query:  NLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP
        ++I   KDKASI KA       S+++K   +A++++TT     PP+   V+AV++  N   +  AF+   A + RL  T +A VA KSL  +H ++    
Subjt:  NLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP

Query:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-LSEELEVLVSFVERICE
         + RD+  F     GRN L L+ F D S +   +LS W+RWY   ++    V + L           + VE+       +   +  + + LVSF E IC 
Subjt:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-LSEELEVLVSFVERICE

Query:  VPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDR--------VDSLSLDELTELVGILRRLENCRRNLSVLFVN-RGKNEEFWELVKITKGKL
         PE   + +  +V E+ +LV+E+Y  + R + VR++ + +R        +  L L++ + L   L RL  C+ +LS LF   R   ++FW LV++ K   
Subjt:  VPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDR--------VDSLSLDELTELVGILRRLENCRRNLSVLFVN-RGKNEEFWELVKITKGKL

Query:  AETKGMKEEKRMI
        AET+  K  K+MI
Subjt:  AETKGMKEEKRMI

Q8L936 Putative clathrin assembly protein At4g400807.5e-4337.5Show/hide
Query:  NLIHTLKDKASIIKATF---SIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP
        +LI  +KDKAS  KA     +   ++ S  ++V+RATTH    PP +  +A +L+ G   R +TA + +E++MERLHTT  A VA+KSL  +H IV  G 
Subjt:  NLIHTLKDKASIIKATF---SIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP

Query:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI------DLSEELEVLVSFV
        F L+DQ++ FP  GGRN+L LSAFRD     M +LSSWVRWYA  +EH +   R +    +F S     +  HKE  +  +      DL  E++ LV  +
Subjt:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI------DLSEELEVLVSFV

Query:  ERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLF---VNRGKNEEFWELVKITKGKLA
        E  C++P+      K L  ++ +LV E+Y     E++ R  E  +R ++LS  +  ELV  L+RLE+C+  LS +      RG  + FW LV   KG + 
Subjt:  ERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLF---VNRGKNEEFWELVKITKGKLA

Query:  --ETKGMKEEKRMIMVEMRADSVESTRF
          E    + EK ++    R    ES RF
Subjt:  --ETKGMKEEKRMIMVEMRADSVESTRF

Q9FKQ2 Putative clathrin assembly protein At5g653701.9e-1428.66Show/hide
Query:  KLKNLIHTLKDKASIIKATFSIHRRSS----SIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMERLHTTSSAAVAIKSLFT
        KL  L   LKD+AS +K    +H  SS    +I +A+++AT+H S NPPSD  V         F  ST   C     ++A++ RL  T+   VA K L  
Subjt:  KLKNLIHTLKDKASIIKATFSIHRRSS----SIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMERLHTTSSAAVAIKSLFT

Query:  LHIIV-----IRGPFNLRDQV---AFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-
        LH +V       G  +LR+ +         GG N L L+     S     +L+ WV+WY   ++  + +   L      + +N +   + +      +D 
Subjt:  LHIIV-----IRGPFNLRDQV---AFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-

Query:  LSEELEVLVSFVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKN--EEFW
        + ++++ LV   E I + P++   +   +V E+ +L++++Y    R + +R +E+  RV      +  ELV +L +LENC+  LS  F  R K    +FW
Subjt:  LSEELEVLVSFVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKN--EEFW

Query:  ELVKITK
         LV   K
Subjt:  ELVKITK

Q9SA65 Putative clathrin assembly protein At1g030501.6e-0832.72Show/hide
Query:  NKKLKNLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII
        + K K  I  +KD+ S+  A   ++ RS+S+    VA+V+AT H    P  +  +  +L+L   +  S   AC+  L  RL+ T    VA+K+L  +  +
Subjt:  NKKLKNLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII

Query:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD
        +  G     +Q  FF    G   LN+S FRDVS S   D S++VR YA      + +D +LD
Subjt:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD

Arabidopsis top hitse value%identityAlignment
AT1G03050.1 ENTH/ANTH/VHS superfamily protein1.1e-0932.72Show/hide
Query:  NKKLKNLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII
        + K K  I  +KD+ S+  A   ++ RS+S+    VA+V+AT H    P  +  +  +L+L   +  S   AC+  L  RL+ T    VA+K+L  +  +
Subjt:  NKKLKNLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII

Query:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD
        +  G     +Q  FF    G   LN+S FRDVS S   D S++VR YA      + +D +LD
Subjt:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD

AT4G02650.1 ENTH/ANTH/VHS superfamily protein5.9e-1133.95Show/hide
Query:  NKKLKNLIHTLKDKASIIKATFSIHRRSSS---IKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII
        + KLK  I  +KD+ S+  A   +  RSSS   +++AVV+AT H    P  D  +  +L L +  R+  + AC+  L  RL+ T + +VA+K+L  +  +
Subjt:  NKKLKNLIHTLKDKASIIKATFSIHRRSSS---IKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII

Query:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD
        +  G     +Q  FF    G   LN+S FRD S S+  D S++VR YA      + +D +LD
Subjt:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD

AT4G40080.1 ENTH/ANTH/VHS superfamily protein5.3e-4437.5Show/hide
Query:  NLIHTLKDKASIIKATF---SIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP
        +LI  +KDKAS  KA     +   ++ S  ++V+RATTH    PP +  +A +L+ G   R +TA + +E++MERLHTT  A VA+KSL  +H IV  G 
Subjt:  NLIHTLKDKASIIKATF---SIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP

Query:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI------DLSEELEVLVSFV
        F L+DQ++ FP  GGRN+L LSAFRD     M +LSSWVRWYA  +EH +   R +    +F S     +  HKE  +  +      DL  E++ LV  +
Subjt:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI------DLSEELEVLVSFV

Query:  ERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLF---VNRGKNEEFWELVKITKGKLA
        E  C++P+      K L  ++ +LV E+Y     E++ R  E  +R ++LS  +  ELV  L+RLE+C+  LS +      RG  + FW LV   KG + 
Subjt:  ERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLF---VNRGKNEEFWELVKITKGKLA

Query:  --ETKGMKEEKRMIMVEMRADSVESTRF
          E    + EK ++    R    ES RF
Subjt:  --ETKGMKEEKRMIMVEMRADSVESTRF

AT5G10410.1 ENTH/ANTH/VHS superfamily protein5.4e-2030.99Show/hide
Query:  NLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP
        ++I   KDKASI KA       S+++K   +A++++TT     PP+   V+AV++  N   +  AF+   A + RL  T +A VA KSL  +H ++    
Subjt:  NLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP

Query:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-LSEELEVLVSFVERICE
         + RD+  F     GRN L L+ F D S +   +LS W+RWY   ++    V + L           + VE+       +   +  + + LVSF E IC 
Subjt:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-LSEELEVLVSFVERICE

Query:  VPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDR--------VDSLSLDELTELVGILRRLENCRRNLSVLFVN-RGKNEEFWELVKITKGKL
         PE   + +  +V E+ +LV+E+Y  + R + VR++ + +R        +  L L++ + L   L RL  C+ +LS LF   R   ++FW LV++ K   
Subjt:  VPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDR--------VDSLSLDELTELVGILRRLENCRRNLSVLFVN-RGKNEEFWELVKITKGKL

Query:  AETKGMKEEKRMI
        AET+  K  K+MI
Subjt:  AETKGMKEEKRMI

AT5G65370.1 ENTH/ANTH/VHS superfamily protein1.4e-1528.66Show/hide
Query:  KLKNLIHTLKDKASIIKATFSIHRRSS----SIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMERLHTTSSAAVAIKSLFT
        KL  L   LKD+AS +K    +H  SS    +I +A+++AT+H S NPPSD  V         F  ST   C     ++A++ RL  T+   VA K L  
Subjt:  KLKNLIHTLKDKASIIKATFSIHRRSS----SIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMERLHTTSSAAVAIKSLFT

Query:  LHIIV-----IRGPFNLRDQV---AFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-
        LH +V       G  +LR+ +         GG N L L+     S     +L+ WV+WY   ++  + +   L      + +N +   + +      +D 
Subjt:  LHIIV-----IRGPFNLRDQV---AFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-

Query:  LSEELEVLVSFVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKN--EEFW
        + ++++ LV   E I + P++   +   +V E+ +L++++Y    R + +R +E+  RV      +  ELV +L +LENC+  LS  F  R K    +FW
Subjt:  LSEELEVLVSFVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKN--EEFW

Query:  ELVKITK
         LV   K
Subjt:  ELVKITK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATAGACCAAAACAAGAAGCTCAAGAATCTCATACACACTCTCAAAGACAAGGCCTCAATAATCAAAGCTACTTTCTCGATTCATCGCCGTTCATCTTCCATCAA
AGTCGCCGTCGTCCGGGCCACCACCCACGGTTCTCGAAACCCACCTTCCGACGGCCGAGTCGCCGCCGTGCTAGCCCTTGGGAATGACTTCCGTTCCTCGACTGCATTCG
CCTGCATCGAAGCCCTAATGGAGCGGCTTCATACGACCTCTAGCGCCGCCGTGGCTATCAAATCGCTTTTCACTCTGCATATAATTGTAATTCGAGGTCCGTTCAATCTG
AGGGATCAGGTGGCGTTTTTCCCCTGTTACGGAGGGCGAAATTTTCTCAACTTATCCGCGTTTCGCGACGTATCGGACTCGGAGATGAACGACTTGTCGTCTTGGGTGAG
ATGGTACGCGGGGGTTGTGGAGCACAACGTGATTGTTGACAGGAAATTGGATCGAATTCTGTATTTCCGTTCAAGAAATTGCGAAATTGTTGAAGATCATAAAGAAGGGA
GGAAGAGGAAGATTGATTTATCCGAGGAATTGGAGGTTCTTGTGAGTTTTGTAGAGAGAATTTGCGAAGTTCCAGAATCGCTGCATCTTCAGAAGAAGGATTTGGTTTAC
GAGGTGGTTAAATTGGTTCTTGAGAATTACAGGTTGGTTCAGAGGGAGATTTGGGTCCGAGTTAAAGAAATCGGAGACAGAGTCGATAGTTTGAGTCTGGACGAGTTGAC
TGAGTTGGTGGGTATTTTGAGACGGTTGGAAAATTGCAGAAGGAACTTGAGTGTGTTGTTTGTAAACAGAGGGAAGAACGAGGAATTTTGGGAATTGGTGAAAATTACGA
AAGGGAAACTGGCGGAGACGAAGGGGATGAAAGAGGAGAAGAGGATGATAATGGTGGAGATGAGAGCGGACTCGGTCGAGTCGACTCGGTTCTGGAATCCATTTGTTGAA
ACGGGTCAATTGCTGTGGGTCCCAGCGGGTGATGGACACTTGGGCCCGGCTCCGCTTCCACTGACCGTTTCAACGGTAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCATAGACCAAAACAAGAAGCTCAAGAATCTCATACACACTCTCAAAGACAAGGCCTCAATAATCAAAGCTACTTTCTCGATTCATCGCCGTTCATCTTCCATCAA
AGTCGCCGTCGTCCGGGCCACCACCCACGGTTCTCGAAACCCACCTTCCGACGGCCGAGTCGCCGCCGTGCTAGCCCTTGGGAATGACTTCCGTTCCTCGACTGCATTCG
CCTGCATCGAAGCCCTAATGGAGCGGCTTCATACGACCTCTAGCGCCGCCGTGGCTATCAAATCGCTTTTCACTCTGCATATAATTGTAATTCGAGGTCCGTTCAATCTG
AGGGATCAGGTGGCGTTTTTCCCCTGTTACGGAGGGCGAAATTTTCTCAACTTATCCGCGTTTCGCGACGTATCGGACTCGGAGATGAACGACTTGTCGTCTTGGGTGAG
ATGGTACGCGGGGGTTGTGGAGCACAACGTGATTGTTGACAGGAAATTGGATCGAATTCTGTATTTCCGTTCAAGAAATTGCGAAATTGTTGAAGATCATAAAGAAGGGA
GGAAGAGGAAGATTGATTTATCCGAGGAATTGGAGGTTCTTGTGAGTTTTGTAGAGAGAATTTGCGAAGTTCCAGAATCGCTGCATCTTCAGAAGAAGGATTTGGTTTAC
GAGGTGGTTAAATTGGTTCTTGAGAATTACAGGTTGGTTCAGAGGGAGATTTGGGTCCGAGTTAAAGAAATCGGAGACAGAGTCGATAGTTTGAGTCTGGACGAGTTGAC
TGAGTTGGTGGGTATTTTGAGACGGTTGGAAAATTGCAGAAGGAACTTGAGTGTGTTGTTTGTAAACAGAGGGAAGAACGAGGAATTTTGGGAATTGGTGAAAATTACGA
AAGGGAAACTGGCGGAGACGAAGGGGATGAAAGAGGAGAAGAGGATGATAATGGTGGAGATGAGAGCGGACTCGGTCGAGTCGACTCGGTTCTGGAATCCATTTGTTGAA
ACGGGTCAATTGCTGTGGGTCCCAGCGGGTGATGGACACTTGGGCCCGGCTCCGCTTCCACTGACCGTTTCAACGGTAGGATAG
Protein sequenceShow/hide protein sequence
MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGPFNL
RDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSFVERICEVPESLHLQKKDLVY
EVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAETKGMKEEKRMIMVEMRADSVESTRFWNPFVE
TGQLLWVPAGDGHLGPAPLPLTVSTVG