; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G009920 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G009920
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionENTH domain-containing protein
Genome locationchr08:18391061..18392134
RNA-Seq ExpressionLsi08G009920
SyntenyLsi08G009920
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598708.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]3.2e-13174.02Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAIDQ KK KNLI   KD+ASIIKATFSIHRRSSSIKVAVVRATTHG+RNPPSD R+AA+LA GNDFRSSTAF CI+ALMERLHTT+SAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI-DLSEELEVLVS
        II IRGPFNLR +VAF P YGGRNFLNLSAFRDVSDSEM++LS WVRWYAGVVEHN    RKLDRILYFRSRN EIV    EG+ RKI +L EEL+VLV 
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI-DLSEELEVLVS

Query:  FVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAE
        F ERI EVPESLH+QK DLVYEVV+LVLE+YRLVQREIWVRV EIG+RV+ +S DELTE V IL R+ENCRR +SVLFVNRGKNEE WELV  TKGKL E
Subjt:  FVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAE

Query:  TKGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG
         +            M     ESTR WNPFVEPG L          GPA LPLTVSTVG
Subjt:  TKGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG

XP_004152749.1 putative clathrin assembly protein At4g40080 [Cucumis sativus]1.8e-15884.31Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAI QNKKL NL+H LKDKAS+IKATFSI+RRSSSIKVAVVRATTHG+RNPPSD RV+AVLALGNDFRSSTAFACIEALM RLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQV+FFP YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEI ED   GRK K+DLSEEL VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESLHLQKKDLVYEVV+LVL+NYRLVQ+EIWVRVKEIG+RV+ LS+DEL+ELVGIL RLENCR  +SVLFVNRGK+EEFWELVK T+GKL E 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG
        K +KEEKRMIMV    +SVESTR  NPFVEPGQL+WVP      GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG

XP_008445571.1 PREDICTED: putative clathrin assembly protein At4g40080 [Cucumis melo]1.1e-16586.83Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAI Q+KKLKNL H LKDKASIIKA FSI+RRSSSIKVAVVRATTHG+RNPPSD RVAAVLALGNDFRSSTAFACIEALM RLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQV+FFP YGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEI E    GRK K+DL+EEL VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESLHLQKKDLVYEVV+LVLENYRLVQREIWVRVKEIG+RV+ LS+DEL+ELVGIL RLENCR  +SVLFVNRGKNEEFWELVKITKGK+AE 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG
        K +KEEKRM+MV    DSVESTR WNPFVEPGQL+WVP GDG +GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG

XP_022131457.1 putative clathrin assembly protein At4g40080 [Momordica charantia]2.4e-14778.15Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        M IDQ KKLKNLI  LKDKASIIKATFS HRRSSSIK+AVVRATTH   NPPSD R+AAVLALGNDF  STA ACI+ +M+RLHTTSSA VA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIV R+LDRILY RS NC+I  + K+G+  ++DL  EL+VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VE ICE P+SLHLQK ++VYEVV+LVLENYRLVQREI VRV+ IGDR DSLSLDELT+LV IL R ENCRR L+VLFVNR KNE+ WELVK TK KL E 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG
        K MKEEKRMIMVE+RA+SVE TR WNPFVEPGQLLWVP+GD  LGPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG

XP_038884022.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]4.6e-17591.32Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAIDQNKKLKNL H LKDKASIIKAT SI RRSSSIKVAVVRATTHGSRNPPSD RVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQVA+FPCYGGRNFLNLS FRDVSDSEMNDLSSWVRWYAGVVE NVIVDRKLDRILYFRSRNCEIVE   E RKRKID+ EELEVLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESL+LQKKDLVYEVV+LVLENYRLVQREIWVRVKEIGDRV+SLSLDELTELVGI+ RLENCRR LSVLFVNRGKNEEFWELVKITKGKLAE 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG
        K MKEEKRMIMVEM+A+S ESTR WNPFVEPGQLLWVPAGDG +GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG

TrEMBL top hitse value%identityAlignment
A0A0A0LLA1 ENTH domain-containing protein8.6e-15984.31Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAI QNKKL NL+H LKDKAS+IKATFSI+RRSSSIKVAVVRATTHG+RNPPSD RV+AVLALGNDFRSSTAFACIEALM RLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQV+FFP YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEI ED   GRK K+DLSEEL VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESLHLQKKDLVYEVV+LVL+NYRLVQ+EIWVRVKEIG+RV+ LS+DEL+ELVGIL RLENCR  +SVLFVNRGK+EEFWELVK T+GKL E 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG
        K +KEEKRMIMV    +SVESTR  NPFVEPGQL+WVP      GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG

A0A1S3BDW7 putative clathrin assembly protein At4g400805.6e-16686.83Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAI Q+KKLKNL H LKDKASIIKA FSI+RRSSSIKVAVVRATTHG+RNPPSD RVAAVLALGNDFRSSTAFACIEALM RLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQV+FFP YGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEI E    GRK K+DL+EEL VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESLHLQKKDLVYEVV+LVLENYRLVQREIWVRVKEIG+RV+ LS+DEL+ELVGIL RLENCR  +SVLFVNRGKNEEFWELVKITKGK+AE 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG
        K +KEEKRM+MV    DSVESTR WNPFVEPGQL+WVP GDG +GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG

A0A5A7VCW1 Putative clathrin assembly protein5.6e-16686.83Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAI Q+KKLKNL H LKDKASIIKA FSI+RRSSSIKVAVVRATTHG+RNPPSD RVAAVLALGNDFRSSTAFACIEALM RLHTTSSAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        IIVIRGPFNLRDQV+FFP YGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEI E    GRK K+DL+EEL VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VERICEVPESLHLQKKDLVYEVV+LVLENYRLVQREIWVRVKEIG+RV+ LS+DEL+ELVGIL RLENCR  +SVLFVNRGKNEEFWELVKITKGK+AE 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG
        K +KEEKRM+MV    DSVESTR WNPFVEPGQL+WVP GDG +GPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG

A0A6J1BR25 putative clathrin assembly protein At4g400801.2e-14778.15Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        M IDQ KKLKNLI  LKDKASIIKATFS HRRSSSIK+AVVRATTH   NPPSD R+AAVLALGNDF  STA ACI+ +M+RLHTTSSA VA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIV R+LDRILY RS NC+I  + K+G+  ++DL  EL+VLV F
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSF

Query:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET
        VE ICE P+SLHLQK ++VYEVV+LVLENYRLVQREI VRV+ IGDR DSLSLDELT+LV IL R ENCRR L+VLFVNR KNE+ WELVK TK KL E 
Subjt:  VERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAET

Query:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG
        K MKEEKRMIMVE+RA+SVE TR WNPFVEPGQLLWVP+GD  LGPA LPLTVSTVG
Subjt:  KGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG

A0A6J1HF54 putative clathrin assembly protein At4g400802.0e-13174.58Show/hide
Query:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH
        MAIDQ KKLKNLI   KD+ASIIKATFSIHRRSSSIKVAVVRATTHG+RNPPSD R+AA+LA GNDFRSSTAF CI+ALMERLHTT+SAAVA+KSLFTLH
Subjt:  MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLH

Query:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI-DLSEELEVLVS
        II IRGPFNL+ +VAF P YGGRNFLNLSAFRD+SDSEM++LS WVRWYAGVVEHN    RKLDRILYFRSRN EIV    EG+ RKI +L EEL+VLV 
Subjt:  IIVIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI-DLSEELEVLVS

Query:  FVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAE
        F ERI EVPESLH+QK DLVYEVV+LVLE+YRLVQREIWVRV EIG+RV+ LS DELTE V IL R+ENCR  +SVLFVNRGKNEE WELV  TKGKL  
Subjt:  FVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAE

Query:  TKGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG
              E+R  M  M     ESTR WNPFVEPG L         LGPA LPLTVSTVG
Subjt:  TKGMKEEKRMIMVEMRADSVESTRFWNPFVEPGQLLWVPAGDGHLGPAPLPLTVSTVG

SwissProt top hitse value%identityAlignment
Q8GX47 Putative clathrin assembly protein At4g026508.4e-1033.95Show/hide
Query:  NKKLKNLIHTLKDKASIIKATFSIHRRSSS---IKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII
        + KLK  I  +KD+ S+  A   +  RSSS   +++AVV+AT H    P  D  +  +L L +  R+  + AC+  L  RL+ T + +VA+K+L  +  +
Subjt:  NKKLKNLIHTLKDKASIIKATFSIHRRSSS---IKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII

Query:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD
        +  G     +Q  FF    G   LN+S FRD S S+  D S++VR YA      + +D +LD
Subjt:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD

Q8H0W9 Putative clathrin assembly protein At5g104107.5e-1930.99Show/hide
Query:  NLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP
        ++I   KDKASI KA       S+++K   +A++++TT     PP+   V+AV++  N   +  AF+   A + RL  T +A VA KSL  +H ++    
Subjt:  NLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP

Query:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-LSEELEVLVSFVERICE
         + RD+  F     GRN L L+ F D S +   +LS W+RWY   ++    V + L           + VE+       +   +  + + LVSF E IC 
Subjt:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-LSEELEVLVSFVERICE

Query:  VPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDR--------VDSLSLDELTELVGILRRLENCRRNLSVLFVN-RGKNEEFWELVKITKGKL
         PE   + +  +V E+ +LV+E+Y  + R + VR++ + +R        +  L L++ + L   L RL  C+ +LS LF   R   ++FW LV++ K   
Subjt:  VPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDR--------VDSLSLDELTELVGILRRLENCRRNLSVLFVN-RGKNEEFWELVKITKGKL

Query:  AETKGMKEEKRMI
        AET+  K  K+MI
Subjt:  AETKGMKEEKRMI

Q8L936 Putative clathrin assembly protein At4g400807.5e-4337.5Show/hide
Query:  NLIHTLKDKASIIKATF---SIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP
        +LI  +KDKAS  KA     +   ++ S  ++V+RATTH    PP +  +A +L+ G   R +TA + +E++MERLHTT  A VA+KSL  +H IV  G 
Subjt:  NLIHTLKDKASIIKATF---SIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP

Query:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI------DLSEELEVLVSFV
        F L+DQ++ FP  GGRN+L LSAFRD     M +LSSWVRWYA  +EH +   R +    +F S     +  HKE  +  +      DL  E++ LV  +
Subjt:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI------DLSEELEVLVSFV

Query:  ERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLF---VNRGKNEEFWELVKITKGKLA
        E  C++P+      K L  ++ +LV E+Y     E++ R  E  +R ++LS  +  ELV  L+RLE+C+  LS +      RG  + FW LV   KG + 
Subjt:  ERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLF---VNRGKNEEFWELVKITKGKLA

Query:  --ETKGMKEEKRMIMVEMRADSVESTRF
          E    + EK ++    R    ES RF
Subjt:  --ETKGMKEEKRMIMVEMRADSVESTRF

Q9FKQ2 Putative clathrin assembly protein At5g653701.9e-1428.66Show/hide
Query:  KLKNLIHTLKDKASIIKATFSIHRRSS----SIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMERLHTTSSAAVAIKSLFT
        KL  L   LKD+AS +K    +H  SS    +I +A+++AT+H S NPPSD  V         F  ST   C     ++A++ RL  T+   VA K L  
Subjt:  KLKNLIHTLKDKASIIKATFSIHRRSS----SIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMERLHTTSSAAVAIKSLFT

Query:  LHIIV-----IRGPFNLRDQV---AFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-
        LH +V       G  +LR+ +         GG N L L+     S     +L+ WV+WY   ++  + +   L      + +N +   + +      +D 
Subjt:  LHIIV-----IRGPFNLRDQV---AFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-

Query:  LSEELEVLVSFVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKN--EEFW
        + ++++ LV   E I + P++   +   +V E+ +L++++Y    R + +R +E+  RV      +  ELV +L +LENC+  LS  F  R K    +FW
Subjt:  LSEELEVLVSFVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKN--EEFW

Query:  ELVKITK
         LV   K
Subjt:  ELVKITK

Q9SA65 Putative clathrin assembly protein At1g030501.6e-0832.72Show/hide
Query:  NKKLKNLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII
        + K K  I  +KD+ S+  A   ++ RS+S+    VA+V+AT H    P  +  +  +L+L   +  S   AC+  L  RL+ T    VA+K+L  +  +
Subjt:  NKKLKNLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII

Query:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD
        +  G     +Q  FF    G   LN+S FRDVS S   D S++VR YA      + +D +LD
Subjt:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD

Arabidopsis top hitse value%identityAlignment
AT1G03050.1 ENTH/ANTH/VHS superfamily protein1.1e-0932.72Show/hide
Query:  NKKLKNLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII
        + K K  I  +KD+ S+  A   ++ RS+S+    VA+V+AT H    P  +  +  +L+L   +  S   AC+  L  RL+ T    VA+K+L  +  +
Subjt:  NKKLKNLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII

Query:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD
        +  G     +Q  FF    G   LN+S FRDVS S   D S++VR YA      + +D +LD
Subjt:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD

AT4G02650.1 ENTH/ANTH/VHS superfamily protein5.9e-1133.95Show/hide
Query:  NKKLKNLIHTLKDKASIIKATFSIHRRSSS---IKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII
        + KLK  I  +KD+ S+  A   +  RSSS   +++AVV+AT H    P  D  +  +L L +  R+  + AC+  L  RL+ T + +VA+K+L  +  +
Subjt:  NKKLKNLIHTLKDKASIIKATFSIHRRSSS---IKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHII

Query:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD
        +  G     +Q  FF    G   LN+S FRD S S+  D S++VR YA      + +D +LD
Subjt:  VIRGPFNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLD

AT4G40080.1 ENTH/ANTH/VHS superfamily protein5.3e-4437.5Show/hide
Query:  NLIHTLKDKASIIKATF---SIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP
        +LI  +KDKAS  KA     +   ++ S  ++V+RATTH    PP +  +A +L+ G   R +TA + +E++MERLHTT  A VA+KSL  +H IV  G 
Subjt:  NLIHTLKDKASIIKATF---SIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP

Query:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI------DLSEELEVLVSFV
        F L+DQ++ FP  GGRN+L LSAFRD     M +LSSWVRWYA  +EH +   R +    +F S     +  HKE  +  +      DL  E++ LV  +
Subjt:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKI------DLSEELEVLVSFV

Query:  ERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLF---VNRGKNEEFWELVKITKGKLA
        E  C++P+      K L  ++ +LV E+Y     E++ R  E  +R ++LS  +  ELV  L+RLE+C+  LS +      RG  + FW LV   KG + 
Subjt:  ERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLF---VNRGKNEEFWELVKITKGKLA

Query:  --ETKGMKEEKRMIMVEMRADSVESTRF
          E    + EK ++    R    ES RF
Subjt:  --ETKGMKEEKRMIMVEMRADSVESTRF

AT5G10410.1 ENTH/ANTH/VHS superfamily protein5.4e-2030.99Show/hide
Query:  NLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP
        ++I   KDKASI KA       S+++K   +A++++TT     PP+   V+AV++  N   +  AF+   A + RL  T +A VA KSL  +H ++    
Subjt:  NLIHTLKDKASIIKATFSIHRRSSSIK---VAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGP

Query:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-LSEELEVLVSFVERICE
         + RD+  F     GRN L L+ F D S +   +LS W+RWY   ++    V + L           + VE+       +   +  + + LVSF E IC 
Subjt:  FNLRDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-LSEELEVLVSFVERICE

Query:  VPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDR--------VDSLSLDELTELVGILRRLENCRRNLSVLFVN-RGKNEEFWELVKITKGKL
         PE   + +  +V E+ +LV+E+Y  + R + VR++ + +R        +  L L++ + L   L RL  C+ +LS LF   R   ++FW LV++ K   
Subjt:  VPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDR--------VDSLSLDELTELVGILRRLENCRRNLSVLFVN-RGKNEEFWELVKITKGKL

Query:  AETKGMKEEKRMI
        AET+  K  K+MI
Subjt:  AETKGMKEEKRMI

AT5G65370.1 ENTH/ANTH/VHS superfamily protein1.4e-1528.66Show/hide
Query:  KLKNLIHTLKDKASIIKATFSIHRRSS----SIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMERLHTTSSAAVAIKSLFT
        KL  L   LKD+AS +K    +H  SS    +I +A+++AT+H S NPPSD  V         F  ST   C     ++A++ RL  T+   VA K L  
Subjt:  KLKNLIHTLKDKASIIKATFSIHRRSS----SIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFAC-----IEALMERLHTTSSAAVAIKSLFT

Query:  LHIIV-----IRGPFNLRDQV---AFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-
        LH +V       G  +LR+ +         GG N L L+     S     +L+ WV+WY   ++  + +   L      + +N +   + +      +D 
Subjt:  LHIIV-----IRGPFNLRDQV---AFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKID-

Query:  LSEELEVLVSFVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKN--EEFW
        + ++++ LV   E I + P++   +   +V E+ +L++++Y    R + +R +E+  RV      +  ELV +L +LENC+  LS  F  R K    +FW
Subjt:  LSEELEVLVSFVERICEVPESLHLQKKDLVYEVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKN--EEFW

Query:  ELVKITK
         LV   K
Subjt:  ELVKITK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATAGACCAAAACAAGAAGCTCAAGAATCTCATACACACTCTCAAAGACAAGGCCTCAATAATCAAAGCTACTTTCTCGATTCATCGCCGTTCATCTTCCATCAA
AGTCGCCGTCGTCCGGGCCACCACCCACGGTTCTCGAAACCCACCTTCCGACGGCCGAGTCGCCGCCGTGCTAGCCCTTGGGAATGACTTCCGTTCCTCGACTGCATTCG
CCTGCATCGAAGCCCTAATGGAGCGGCTTCATACGACCTCTAGCGCCGCCGTGGCTATCAAATCGCTTTTCACTCTGCATATAATTGTAATTCGAGGTCCGTTCAATCTG
AGGGATCAGGTGGCGTTTTTCCCCTGTTACGGAGGGCGAAATTTTCTCAACTTATCCGCGTTTCGCGACGTATCGGACTCGGAGATGAACGACTTGTCGTCTTGGGTGAG
ATGGTACGCGGGGGTTGTGGAGCACAACGTGATTGTTGACAGGAAATTGGATCGAATTCTGTATTTCCGTTCAAGAAATTGCGAAATTGTTGAAGATCATAAAGAAGGGA
GGAAGAGGAAGATTGATTTATCCGAGGAATTGGAGGTTCTTGTGAGTTTTGTAGAGAGAATTTGCGAAGTTCCAGAATCGCTGCATCTTCAGAAGAAGGATTTGGTTTAC
GAGGTGGTTAAATTGGTTCTTGAGAATTACAGGTTGGTTCAGAGGGAGATTTGGGTCCGAGTTAAAGAAATCGGAGACAGAGTCGATAGTTTGAGTCTGGACGAGTTGAC
TGAGTTGGTGGGTATTTTGAGACGGTTGGAAAATTGCAGAAGGAACTTGAGTGTGTTGTTTGTAAACAGAGGGAAGAACGAGGAATTTTGGGAATTGGTGAAAATTACGA
AAGGGAAACTGGCGGAGACGAAGGGGATGAAAGAGGAGAAGAGGATGATAATGGTGGAGATGAGAGCGGACTCGGTCGAGTCGACTCGGTTCTGGAATCCATTTGTTGAA
CCGGGTCAATTGCTGTGGGTCCCAGCGGGTGATGGACACTTGGGCCCGGCTCCGCTTCCACTGACCGTTTCAACGGTAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCATAGACCAAAACAAGAAGCTCAAGAATCTCATACACACTCTCAAAGACAAGGCCTCAATAATCAAAGCTACTTTCTCGATTCATCGCCGTTCATCTTCCATCAA
AGTCGCCGTCGTCCGGGCCACCACCCACGGTTCTCGAAACCCACCTTCCGACGGCCGAGTCGCCGCCGTGCTAGCCCTTGGGAATGACTTCCGTTCCTCGACTGCATTCG
CCTGCATCGAAGCCCTAATGGAGCGGCTTCATACGACCTCTAGCGCCGCCGTGGCTATCAAATCGCTTTTCACTCTGCATATAATTGTAATTCGAGGTCCGTTCAATCTG
AGGGATCAGGTGGCGTTTTTCCCCTGTTACGGAGGGCGAAATTTTCTCAACTTATCCGCGTTTCGCGACGTATCGGACTCGGAGATGAACGACTTGTCGTCTTGGGTGAG
ATGGTACGCGGGGGTTGTGGAGCACAACGTGATTGTTGACAGGAAATTGGATCGAATTCTGTATTTCCGTTCAAGAAATTGCGAAATTGTTGAAGATCATAAAGAAGGGA
GGAAGAGGAAGATTGATTTATCCGAGGAATTGGAGGTTCTTGTGAGTTTTGTAGAGAGAATTTGCGAAGTTCCAGAATCGCTGCATCTTCAGAAGAAGGATTTGGTTTAC
GAGGTGGTTAAATTGGTTCTTGAGAATTACAGGTTGGTTCAGAGGGAGATTTGGGTCCGAGTTAAAGAAATCGGAGACAGAGTCGATAGTTTGAGTCTGGACGAGTTGAC
TGAGTTGGTGGGTATTTTGAGACGGTTGGAAAATTGCAGAAGGAACTTGAGTGTGTTGTTTGTAAACAGAGGGAAGAACGAGGAATTTTGGGAATTGGTGAAAATTACGA
AAGGGAAACTGGCGGAGACGAAGGGGATGAAAGAGGAGAAGAGGATGATAATGGTGGAGATGAGAGCGGACTCGGTCGAGTCGACTCGGTTCTGGAATCCATTTGTTGAA
CCGGGTCAATTGCTGTGGGTCCCAGCGGGTGATGGACACTTGGGCCCGGCTCCGCTTCCACTGACCGTTTCAACGGTAGGATAG
Protein sequenceShow/hide protein sequence
MAIDQNKKLKNLIHTLKDKASIIKATFSIHRRSSSIKVAVVRATTHGSRNPPSDGRVAAVLALGNDFRSSTAFACIEALMERLHTTSSAAVAIKSLFTLHIIVIRGPFNL
RDQVAFFPCYGGRNFLNLSAFRDVSDSEMNDLSSWVRWYAGVVEHNVIVDRKLDRILYFRSRNCEIVEDHKEGRKRKIDLSEELEVLVSFVERICEVPESLHLQKKDLVY
EVVKLVLENYRLVQREIWVRVKEIGDRVDSLSLDELTELVGILRRLENCRRNLSVLFVNRGKNEEFWELVKITKGKLAETKGMKEEKRMIMVEMRADSVESTRFWNPFVE
PGQLLWVPAGDGHLGPAPLPLTVSTVG