; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0220 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0220
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionENTH domain-containing protein
Genome locationMC08:1567750..1568814
RNA-Seq ExpressionMC08g0220
SyntenyMC08g0220
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152749.1 putative clathrin assembly protein At4g40080 [Cucumis sativus]9.75e-17474.37Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M I Q KKL NL+ ALKDKAS+IKATFS +RRSSSIK+AVVRATTH   NPPSD R++AVLALGNDF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIV R+LDRILY RS NC+I D+ G+  ++DL  EL VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SLHLQK D+VYEVVRLVL+NYRLVQ+EI VRV+ IG+R + LS+DEL++LV ILTR ENCR K++VLFVNR K+E+ WELVK T+ KL E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        +KEEKRMIMV    ESVE TRL NPFVEPGQL+WVP G     PALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

XP_008445571.1 PREDICTED: putative clathrin assembly protein At4g40080 [Cucumis melo]2.36e-18075.77Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M I Q KKLKNL  ALKDKASIIKA FS +RRSSSIK+AVVRATTH   NPPSD R+AAVLALGNDF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIV R+LDRILY RS NC+I D+ G+  ++DL  EL VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SLHLQK D+VYEVVRLVLENYRLVQREI VRV+ IG+R + LS+DEL++LV IL R ENCR K++VLFVNR KNE+ WELVK TK K+ E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        +KEEKRM+MV    +SVE TRLWNPFVEPGQL+WVP GD P+GPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

XP_022131457.1 putative clathrin assembly protein At4g40080 [Momordica charantia]5.78e-25199.72Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE
        IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
        GICEFPDSLHLQKN+MVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

XP_023546879.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]4.66e-15770.51Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M IDQ KKLKNLIDA KD+ASIIKATFS HRRSSSIK+AVVRATTH   NPPSD RLAA+LALGNDF  STA  CIQ +M+RLHTT+SA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKI-EDKQGKISELDLWGELDVLVGFV
        I+ IRGPF+L+ +V F PYYGGRN+LNLSAFRDVSDSEMS+LS WVRWYAGVVEHN    R+LDRILY RS N +I E K  KI EL    ELDVLVGF+
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKI-EDKQGKISELDLWGELDVLVGFV

Query:  EGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQK
        E I E P+SLH+QK+D+VYEVVRLVLE+YRLVQREI VRV  IG+R + LS DELT+ V ILTR ENCRRK++VLFVNR KNE+LWELV  TK KLVE++
Subjt:  EGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQK

Query:  QMKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
             +RM M        E TRLWNPF+EPG L        PLGPALLPLTVSTVG
Subjt:  QMKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

XP_038884022.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]3.73e-18777.75Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M IDQ KKLKNL  ALKDKASIIKAT S  RRSSSIK+AVVRATTH   NPPSD R+AAVLALGNDF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV + P YGGRNFLNLS FRDVSDSEM+DLSSWVRWYAGVVE NVIV R+LDRILY RS NC+I ++Q K  ++D+  EL+VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SL+LQK D+VYEVVRLVLENYRLVQREI VRV+ IGDR +SLSLDELT+LV I+TR ENCRRKL+VLFVNR KNE+ WELVK TK KL E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        MKEEKRMIMVE++A S E TRLWNPFVEPGQLLWVP+GD P+GPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

TrEMBL top hitse value%identityAlignment
A0A0A0LLA1 ENTH domain-containing protein4.72e-17474.37Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M I Q KKL NL+ ALKDKAS+IKATFS +RRSSSIK+AVVRATTH   NPPSD R++AVLALGNDF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIV R+LDRILY RS NC+I D+ G+  ++DL  EL VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SLHLQK D+VYEVVRLVL+NYRLVQ+EI VRV+ IG+R + LS+DEL++LV ILTR ENCR K++VLFVNR K+E+ WELVK T+ KL E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        +KEEKRMIMV    ESVE TRL NPFVEPGQL+WVP G     PALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

A0A1S3BDW7 putative clathrin assembly protein At4g400801.14e-18075.77Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M I Q KKLKNL  ALKDKASIIKA FS +RRSSSIK+AVVRATTH   NPPSD R+AAVLALGNDF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIV R+LDRILY RS NC+I D+ G+  ++DL  EL VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SLHLQK D+VYEVVRLVLENYRLVQREI VRV+ IG+R + LS+DEL++LV IL R ENCR K++VLFVNR KNE+ WELVK TK K+ E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        +KEEKRM+MV    +SVE TRLWNPFVEPGQL+WVP GD P+GPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

A0A5A7VCW1 Putative clathrin assembly protein1.14e-18075.77Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M I Q KKLKNL  ALKDKASIIKA FS +RRSSSIK+AVVRATTH   NPPSD R+AAVLALGNDF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIV R+LDRILY RS NC+I D+ G+  ++DL  EL VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SLHLQK D+VYEVVRLVLENYRLVQREI VRV+ IG+R + LS+DEL++LV IL R ENCR K++VLFVNR KNE+ WELVK TK K+ E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        +KEEKRM+MV    +SVE TRLWNPFVEPGQL+WVP GD P+GPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

A0A6J1BR25 putative clathrin assembly protein At4g400802.80e-25199.72Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE
        IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
        GICEFPDSLHLQKN+MVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

A0A6J1K8Z0 putative clathrin assembly protein At4g400802.53e-15670.51Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M IDQ KKLKNLIDA KD+ASIIKATFS HRRSSSIK+AVVRATTH   NPPSD RLAA+LALGNDF  STA  CIQ +M+RLHTT+SA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKI-EDKQGKISELDLWGELDVLVGFV
        I+VIRGPF+LRD+V F PYYGGRNFLNLSAFRDVSDSEMS+LS WVRWYAGVVEH     R+LD ILY RS N +I E K  KI EL    ELDVL+GF+
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKI-EDKQGKISELDLWGELDVLVGFV

Query:  EGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQK
        E I E P+SLH+QK+D+VYEVVRLVLE+YRLVQREI VRV  IG+R + LS DELT+ V ILTR ENCRRK++VLFVNR KN++LWELV  TK KLVE++
Subjt:  EGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQK

Query:  QMKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
             +RM          E TRLWNPFVEPG L        PLGPALLPLTVSTVG
Subjt:  QMKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

SwissProt top hitse value%identityAlignment
Q8GX47 Putative clathrin assembly protein At4g026502.0e-1133.14Show/hide
Query:  KLKNLIDALKDKASIIKATFSTHRRS-SSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRG
        KLK  I A+KD+ S+  A       S + +++AVV+AT HD   P  D+ +  +L L   + R+   AC+ T+  RL+ T +  VA+K+L  +  ++  G
Subjt:  KLKNLIDALKDKASIIKATFSTHRRS-SSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRG

Query:  PFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCK
              ++ F    G R  LN+S FRD S S+  D S++VR YA      + +  RLD  +  R G  K
Subjt:  PFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCK

Q8H0W9 Putative clathrin assembly protein At5g104103.0e-2029.75Show/hide
Query:  NLIDALKDKASIIKATFSTHRRSSSIK---LAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP
        ++I   KDKASI KA       S+++K   LA++++TT  P+ PP+   ++AV++  N      A A     + RL  T +A+VA KSL  +H ++    
Subjt:  NLIDALKDKASIIKATFSTHRRSSSIK---LAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP

Query:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRI-LYLRSGNCKIEDKQGKISELD---LWGELDVLVGFVEGIC
           RD+  F     GRN L L+ F D S +   +LS W+RWY   ++    V + L      L +   K+E+K  ++S      +  + D LV F E IC
Subjt:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRI-LYLRSGNCKIEDKQGKISELD---LWGELDVLVGFVEGIC

Query:  EFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDE-----LTQLVVILTRFENCRRKLTVLFVN-RAKNEDLWELVKNTKAKLVE
          P+   + +N +V E+  LV+E+Y  + R + VR++ + +R     +       L    ++L R   C+  L+ LF   R   +D W LV+  KA+  +
Subjt:  EFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDE-----LTQLVVILTRFENCRRKLTVLFVN-RAKNEDLWELVKNTKAKLVE

Query:  Q--KQMKEEKRMIMVEIR--AESVEL
        +  KQM E   ++   ++   E +EL
Subjt:  Q--KQMKEEKRMIMVEIR--AESVEL

Q8L936 Putative clathrin assembly protein At4g400801.5e-4336.65Show/hide
Query:  NLIDALKDKASIIKATF---STHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP
        +LI  +KDKAS  KA     +T  ++ S  L+V+RATTHDPS PP +R LA +L+ G    R+TA + +++IM+RLHTT  A VA+KSL  +H +V  G 
Subjt:  NLIDALKDKASIIKATF---STHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP

Query:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISEL---DLWGELDVLVGFVEGICE
        F L+DQ+   P  GGRN+L LSAFRD     M +LSSWVRWYA  +EH +   R +   +   S     E+ +  +S L   DL  E+D LVG +E  C+
Subjt:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISEL---DLWGELDVLVGFVEGICE

Query:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN---EDLWELVKNTKAKL--VEQK
         PD        +  ++ +LV E+Y     E+  R     +R+++LS  +  +LV  L R E+C+ +L+ +     K    +  W LV   K  +  +E  
Subjt:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN---EDLWELVKNTKAKL--VEQK

Query:  QMKEEKRMIMVEIRAESVELTR
          + EK ++    R +  E  R
Subjt:  QMKEEKRMIMVEIRAESVELTR

Q8LF20 Putative clathrin assembly protein At2g254302.6e-1129.76Show/hide
Query:  LKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGPF
        ++  I A+KD+ SI  A  +++  +  +++A+V+AT+HD  +P S++ +  +L L     R   +AC+ ++  RL  T   VVA+K+L  +H ++  G  
Subjt:  LKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGPF

Query:  DLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKI
          ++++++    G R  LN+S FRD + S   D S++VR YAG ++      +RL+  L+ R     +
Subjt:  DLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKI

Q9FKQ2 Putative clathrin assembly protein At5g653701.8e-1226.62Show/hide
Query:  KLKNLIDALKDKASIIKAT---FSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIAC-----IQTIMDRLHTTSSAVVAMKSLFTL
        KL  L   LKD+AS +K       +   + +I LA+++AT+H  +NPPSD+ +         F +ST   C     +  I+ RL  T+   VA K L  L
Subjt:  KLKNLIDALKDKASIIKAT---FSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIAC-----IQTIMDRLHTTSSAVVAMKSLFTL

Query:  HIVV-----IRGPFDLRDQVVF--CPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELD-----
        H +V       G   LR+ +      Y  G + L L+     S     +L+ WV+WY   ++  + +   L     ++  N   EDK+ +   +      
Subjt:  HIVV-----IRGPFDLRDQVVF--CPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELD-----

Query:  -LWGELDVLVGFVEGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN--EDL
         +  ++D LV   E I + P +   + N +V E+  L++++Y    R + +R   +  R     + +  +LV +L + ENC+  L+  F  R+K    D 
Subjt:  -LWGELDVLVGFVEGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN--EDL

Query:  WELVKNTK
        W LV   K
Subjt:  WELVKNTK

Arabidopsis top hitse value%identityAlignment
AT2G25430.1 epsin N-terminal homology (ENTH) domain-containing protein / clathrin assembly protein-related1.8e-1229.76Show/hide
Query:  LKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGPF
        ++  I A+KD+ SI  A  +++  +  +++A+V+AT+HD  +P S++ +  +L L     R   +AC+ ++  RL  T   VVA+K+L  +H ++  G  
Subjt:  LKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGPF

Query:  DLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKI
          ++++++    G R  LN+S FRD + S   D S++VR YAG ++      +RL+  L+ R     +
Subjt:  DLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKI

AT4G02650.1 ENTH/ANTH/VHS superfamily protein1.4e-1233.14Show/hide
Query:  KLKNLIDALKDKASIIKATFSTHRRS-SSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRG
        KLK  I A+KD+ S+  A       S + +++AVV+AT HD   P  D+ +  +L L   + R+   AC+ T+  RL+ T +  VA+K+L  +  ++  G
Subjt:  KLKNLIDALKDKASIIKATFSTHRRS-SSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRG

Query:  PFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCK
              ++ F    G R  LN+S FRD S S+  D S++VR YA      + +  RLD  +  R G  K
Subjt:  PFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCK

AT4G40080.1 ENTH/ANTH/VHS superfamily protein1.1e-4436.65Show/hide
Query:  NLIDALKDKASIIKATF---STHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP
        +LI  +KDKAS  KA     +T  ++ S  L+V+RATTHDPS PP +R LA +L+ G    R+TA + +++IM+RLHTT  A VA+KSL  +H +V  G 
Subjt:  NLIDALKDKASIIKATF---STHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP

Query:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISEL---DLWGELDVLVGFVEGICE
        F L+DQ+   P  GGRN+L LSAFRD     M +LSSWVRWYA  +EH +   R +   +   S     E+ +  +S L   DL  E+D LVG +E  C+
Subjt:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISEL---DLWGELDVLVGFVEGICE

Query:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN---EDLWELVKNTKAKL--VEQK
         PD        +  ++ +LV E+Y     E+  R     +R+++LS  +  +LV  L R E+C+ +L+ +     K    +  W LV   K  +  +E  
Subjt:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN---EDLWELVKNTKAKL--VEQK

Query:  QMKEEKRMIMVEIRAESVELTR
          + EK ++    R +  E  R
Subjt:  QMKEEKRMIMVEIRAESVELTR

AT5G10410.1 ENTH/ANTH/VHS superfamily protein2.2e-2129.75Show/hide
Query:  NLIDALKDKASIIKATFSTHRRSSSIK---LAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP
        ++I   KDKASI KA       S+++K   LA++++TT  P+ PP+   ++AV++  N      A A     + RL  T +A+VA KSL  +H ++    
Subjt:  NLIDALKDKASIIKATFSTHRRSSSIK---LAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP

Query:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRI-LYLRSGNCKIEDKQGKISELD---LWGELDVLVGFVEGIC
           RD+  F     GRN L L+ F D S +   +LS W+RWY   ++    V + L      L +   K+E+K  ++S      +  + D LV F E IC
Subjt:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRI-LYLRSGNCKIEDKQGKISELD---LWGELDVLVGFVEGIC

Query:  EFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDE-----LTQLVVILTRFENCRRKLTVLFVN-RAKNEDLWELVKNTKAKLVE
          P+   + +N +V E+  LV+E+Y  + R + VR++ + +R     +       L    ++L R   C+  L+ LF   R   +D W LV+  KA+  +
Subjt:  EFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDE-----LTQLVVILTRFENCRRKLTVLFVN-RAKNEDLWELVKNTKAKLVE

Query:  Q--KQMKEEKRMIMVEIR--AESVEL
        +  KQM E   ++   ++   E +EL
Subjt:  Q--KQMKEEKRMIMVEIR--AESVEL

AT5G65370.1 ENTH/ANTH/VHS superfamily protein1.3e-1326.62Show/hide
Query:  KLKNLIDALKDKASIIKAT---FSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIAC-----IQTIMDRLHTTSSAVVAMKSLFTL
        KL  L   LKD+AS +K       +   + +I LA+++AT+H  +NPPSD+ +         F +ST   C     +  I+ RL  T+   VA K L  L
Subjt:  KLKNLIDALKDKASIIKAT---FSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIAC-----IQTIMDRLHTTSSAVVAMKSLFTL

Query:  HIVV-----IRGPFDLRDQVVF--CPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELD-----
        H +V       G   LR+ +      Y  G + L L+     S     +L+ WV+WY   ++  + +   L     ++  N   EDK+ +   +      
Subjt:  HIVV-----IRGPFDLRDQVVF--CPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELD-----

Query:  -LWGELDVLVGFVEGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN--EDL
         +  ++D LV   E I + P +   + N +V E+  L++++Y    R + +R   +  R     + +  +LV +L + ENC+  L+  F  R+K    D 
Subjt:  -LWGELDVLVGFVEGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN--EDL

Query:  WELVKNTK
        W LV   K
Subjt:  WELVKNTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCATAGACCAAAAGAAGAAGCTCAAGAATCTCATAGACGCTCTCAAAGACAAGGCCTCCATAATCAAAGCCACCTTCTCCACTCATCGCCGTTCCTCTTCCATAAA
ACTCGCCGTCGTCCGCGCCACCACTCACGACCCCTCAAACCCCCCTTCCGACCGCCGCCTCGCCGCCGTATTAGCCCTTGGGAATGACTTTGGCCGGTCCACTGCAATCG
CCTGCATCCAAACCATCATGGACCGTCTCCACACCACCTCCAGCGCCGTCGTCGCCATGAAATCGCTTTTCACTCTGCATATTGTCGTAATCCGAGGTCCGTTCGATCTC
AGGGATCAGGTGGTCTTCTGCCCCTATTACGGAGGGCGGAATTTTCTCAACTTGTCCGCGTTTCGCGACGTATCGGACTCGGAGATGAGCGACTTGTCGTCTTGGGTGAG
ATGGTACGCGGGCGTCGTGGAGCATAACGTTATCGTCCGGAGGAGATTGGATCGGATTTTGTACTTACGGTCAGGAAATTGCAAAATCGAAGATAAACAGGGCAAGATTT
CGGAACTTGATTTGTGGGGCGAATTGGATGTTCTTGTGGGTTTTGTTGAGGGAATTTGCGAATTTCCGGATTCGTTGCATCTTCAGAAGAACGATATGGTTTACGAGGTG
GTGAGATTGGTGCTGGAGAATTACAGATTGGTTCAGAGGGAGATTTCTGTCCGAGTCAGGGGAATCGGAGACAGAGCGGATAGTTTGAGTCTGGACGAGTTGACTCAATT
GGTGGTCATCTTGACGCGGTTCGAAAATTGTAGAAGAAAACTGACGGTGCTGTTTGTGAACAGGGCGAAGAACGAGGATTTGTGGGAATTGGTAAAGAATACGAAGGCAA
AACTGGTGGAGCAGAAGCAGATGAAAGAGGAGAAGAGGATGATCATGGTGGAGATCAGAGCGGAATCGGTCGAGTTGACTCGGTTATGGAACCCGTTTGTTGAACCGGGT
CAGTTGCTGTGGGTCCCATCGGGTGACGAACCTTTGGGCCCGGCCCTGCTTCCACTGACCGTTTCAACGGTAGGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCATAGACCAAAAGAAGAAGCTCAAGAATCTCATAGACGCTCTCAAAGACAAGGCCTCCATAATCAAAGCCACCTTCTCCACTCATCGCCGTTCCTCTTCCATAAA
ACTCGCCGTCGTCCGCGCCACCACTCACGACCCCTCAAACCCCCCTTCCGACCGCCGCCTCGCCGCCGTATTAGCCCTTGGGAATGACTTTGGCCGGTCCACTGCAATCG
CCTGCATCCAAACCATCATGGACCGTCTCCACACCACCTCCAGCGCCGTCGTCGCCATGAAATCGCTTTTCACTCTGCATATTGTCGTAATCCGAGGTCCGTTCGATCTC
AGGGATCAGGTGGTCTTCTGCCCCTATTACGGAGGGCGGAATTTTCTCAACTTGTCCGCGTTTCGCGACGTATCGGACTCGGAGATGAGCGACTTGTCGTCTTGGGTGAG
ATGGTACGCGGGCGTCGTGGAGCATAACGTTATCGTCCGGAGGAGATTGGATCGGATTTTGTACTTACGGTCAGGAAATTGCAAAATCGAAGATAAACAGGGCAAGATTT
CGGAACTTGATTTGTGGGGCGAATTGGATGTTCTTGTGGGTTTTGTTGAGGGAATTTGCGAATTTCCGGATTCGTTGCATCTTCAGAAGAACGATATGGTTTACGAGGTG
GTGAGATTGGTGCTGGAGAATTACAGATTGGTTCAGAGGGAGATTTCTGTCCGAGTCAGGGGAATCGGAGACAGAGCGGATAGTTTGAGTCTGGACGAGTTGACTCAATT
GGTGGTCATCTTGACGCGGTTCGAAAATTGTAGAAGAAAACTGACGGTGCTGTTTGTGAACAGGGCGAAGAACGAGGATTTGTGGGAATTGGTAAAGAATACGAAGGCAA
AACTGGTGGAGCAGAAGCAGATGAAAGAGGAGAAGAGGATGATCATGGTGGAGATCAGAGCGGAATCGGTCGAGTTGACTCGGTTATGGAACCCGTTTGTTGAACCGGGT
CAGTTGCTGTGGGTCCCATCGGGTGACGAACCTTTGGGCCCGGCCCTGCTTCCACTGACCGTTTCAACGGTAGGA
Protein sequenceShow/hide protein sequence
MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHDPSNPPSDRRLAAVLALGNDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGPFDL
RDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCKIEDKQGKISELDLWGELDVLVGFVEGICEFPDSLHLQKNDMVYEV
VRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQMKEEKRMIMVEIRAESVELTRLWNPFVEPG
QLLWVPSGDEPLGPALLPLTVSTVG