; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001115 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001115
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionENTH domain-containing protein
Genome locationscaffold36:1540668..1541732
RNA-Seq ExpressionMS001115
SyntenyMS001115
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152749.1 putative clathrin assembly protein At4g40080 [Cucumis sativus]2.1e-13574.08Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M I Q KKL NL+ ALKDKAS+IKATFS +RRSSSIK+AVVRATTH   NPPSD R++AVLALG+DF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIV R+LDRILY RS NCE  D+ G+  ++DL  EL VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SLHLQK D+VYEVVRLVL+NYRLVQ+EI VRV+ IG+R + LS+DEL++LV ILTR ENCR K++VLFVNR K+E+ WELVK T+ KL E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        +KEEKRMIMV    ESVE TRL NPFVEPGQL+WVP      GPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

XP_008445571.1 PREDICTED: putative clathrin assembly protein At4g40080 [Cucumis melo]2.2e-14075.49Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M I Q KKLKNL  ALKDKASIIKA FS +RRSSSIK+AVVRATTH   NPPSD R+AAVLALG+DF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIV R+LDRILY RS NCE  D+ G+  ++DL  EL VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SLHLQK D+VYEVVRLVLENYRLVQREI VRV+ IG+R + LS+DEL++LV IL R ENCR K++VLFVNR KNE+ WELVK TK K+ E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        +KEEKRM+MV    +SVE TRLWNPFVEPGQL+WVP GD P+GPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

XP_022131457.1 putative clathrin assembly protein At4g40080 [Momordica charantia]1.1e-19298.59Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTH PSNPPSDRRLAAVLALG+DFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE
        IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNC+ EDKQGKISELDLWGELDVLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
        GICEFPDSLHLQKN+MVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

XP_023546879.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]2.0e-12270.22Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M IDQ KKLKNLIDA KD+ASIIKATFS HRRSSSIK+AVVRATTH   NPPSD RLAA+LALG+DF  STA  CIQ +M+RLHTT+SA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCE-NEDKQGKISELDLWGELDVLVGFV
        I+ IRGPF+L+ +V F PYYGGRN+LNLSAFRDVSDSEMS+LS WVRWYAGVVEHN    R+LDRILY RS N E  E K  KI E  L  ELDVLVGF+
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCE-NEDKQGKISELDLWGELDVLVGFV

Query:  EGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQK
        E I E P+SLH+QK+D+VYEVVRLVLE+YRLVQREI VRV  IG+R + LS DELT+ V ILTR ENCRRK++VLFVNR KNE+LWELV  TK KLVE++
Subjt:  EGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQK

Query:  QMKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
             +RM M        E TRLWNPF+EPG L        PLGPALLPLTVSTVG
Subjt:  QMKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

XP_038884022.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]1.3e-14577.46Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M IDQ KKLKNL  ALKDKASIIKAT S  RRSSSIK+AVVRATTH   NPPSD R+AAVLALG+DF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV + P YGGRNFLNLS FRDVSDSEM+DLSSWVRWYAGVVE NVIV R+LDRILY RS NCE  ++Q K  ++D+  EL+VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SL+LQK D+VYEVVRLVLENYRLVQREI VRV+ IGDR +SLSLDELT+LV I+TR ENCRRKL+VLFVNR KNE+ WELVK TK KL E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        MKEEKRMIMVE++A S E TRLWNPFVEPGQLLWVP+GD P+GPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

TrEMBL top hitse value%identityAlignment
A0A0A0LLA1 ENTH domain-containing protein1.0e-13574.08Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M I Q KKL NL+ ALKDKAS+IKATFS +RRSSSIK+AVVRATTH   NPPSD R++AVLALG+DF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIV R+LDRILY RS NCE  D+ G+  ++DL  EL VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SLHLQK D+VYEVVRLVL+NYRLVQ+EI VRV+ IG+R + LS+DEL++LV ILTR ENCR K++VLFVNR K+E+ WELVK T+ KL E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        +KEEKRMIMV    ESVE TRL NPFVEPGQL+WVP      GPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

A0A1S3BDW7 putative clathrin assembly protein At4g400801.0e-14075.49Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M I Q KKLKNL  ALKDKASIIKA FS +RRSSSIK+AVVRATTH   NPPSD R+AAVLALG+DF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIV R+LDRILY RS NCE  D+ G+  ++DL  EL VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SLHLQK D+VYEVVRLVLENYRLVQREI VRV+ IG+R + LS+DEL++LV IL R ENCR K++VLFVNR KNE+ WELVK TK K+ E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        +KEEKRM+MV    +SVE TRLWNPFVEPGQL+WVP GD P+GPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

A0A5A7VCW1 Putative clathrin assembly protein1.0e-14075.49Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M I Q KKLKNL  ALKDKASIIKA FS +RRSSSIK+AVVRATTH   NPPSD R+AAVLALG+DF  STA ACI+ +M+RLHTTSSA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE
        I+VIRGPF+LRDQV F P YGGRNFLNLSAFRDVSDSEM+DLSSWVRWYAGVVEHNVIV R+LDRILY RS NCE  D+ G+  ++DL  EL VLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
         ICE P+SLHLQK D+VYEVVRLVLENYRLVQREI VRV+ IG+R + LS+DEL++LV IL R ENCR K++VLFVNR KNE+ WELVK TK K+ E+K+
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        +KEEKRM+MV    +SVE TRLWNPFVEPGQL+WVP GD P+GPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

A0A6J1BR25 putative clathrin assembly protein At4g400805.3e-19398.59Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTH PSNPPSDRRLAAVLALG+DFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE
        IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNC+ EDKQGKISELDLWGELDVLVGFVE
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVE

Query:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
        GICEFPDSLHLQKN+MVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ
Subjt:  GICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQ

Query:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
        MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
Subjt:  MKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

A0A6J1K8Z0 putative clathrin assembly protein At4g400806.4e-12270.22Show/hide
Query:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH
        M IDQ KKLKNLIDA KD+ASIIKATFS HRRSSSIK+AVVRATTH   NPPSD RLAA+LALG+DF  STA  CIQ +M+RLHTT+SA VAMKSLFTLH
Subjt:  MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLH

Query:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCE-NEDKQGKISELDLWGELDVLVGFV
        I+VIRGPF+LRD+V F PYYGGRNFLNLSAFRDVSDSEMS+LS WVRWYAGVVEH     R+LD ILY RS N E  E K  KI E  L  ELDVL+GF+
Subjt:  IVVIRGPFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCE-NEDKQGKISELDLWGELDVLVGFV

Query:  EGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQK
        E I E P+SLH+QK+D+VYEVVRLVLE+YRLVQREI VRV  IG+R + LS DELT+ V ILTR ENCRRK++VLFVNR KN++LWELV  TK KLVE++
Subjt:  EGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQK

Query:  QMKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG
             +RM  +       E TRLWNPFVEPG L        PLGPALLPLTVSTVG
Subjt:  QMKEEKRMIMVEIRAESVELTRLWNPFVEPGQLLWVPSGDEPLGPALLPLTVSTVG

SwissProt top hitse value%identityAlignment
Q8H0W9 Putative clathrin assembly protein At5g104105.2e-2028.92Show/hide
Query:  NLIDALKDKASIIKATFSTHRRSSSIK---LAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP
        ++I   KDKASI KA       S+++K   LA++++TT  P+ PP+   ++AV++  +      A A     + RL  T +A+VA KSL  +H ++    
Subjt:  NLIDALKDKASIIKATFSTHRRSSSIK---LAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP

Query:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRL---DRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVEGICE
           RD+  F     GRN L L+ F D S +   +LS W+RWY   ++    V + L     +L       E +D+        +  + D LV F E IC 
Subjt:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRL---DRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVEGICE

Query:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDE-----LTQLVVILTRFENCRRKLTVLFVN-RAKNEDLWELVKNTKAKLVEQ
         P+   + +N +V E+  LV+E+Y  + R + VR++ + +R     +       L    ++L R   C+  L+ LF   R   +D W LV+  KA+  ++
Subjt:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDE-----LTQLVVILTRFENCRRKLTVLFVN-RAKNEDLWELVKNTKAKLVEQ

Query:  --KQMKEEKRMIMVEIR--AESVEL
          KQM E   ++   ++   E +EL
Subjt:  --KQMKEEKRMIMVEIR--AESVEL

Q8L936 Putative clathrin assembly protein At4g400807.4e-4336.34Show/hide
Query:  NLIDALKDKASIIKATF---STHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP
        +LI  +KDKAS  KA     +T  ++ S  L+V+RATTH PS PP +R LA +L+ G    R+TA + +++IM+RLHTT  A VA+KSL  +H +V  G 
Subjt:  NLIDALKDKASIIKATF---STHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP

Query:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISEL---DLWGELDVLVGFVEGICE
        F L+DQ+   P  GGRN+L LSAFRD     M +LSSWVRWYA  +EH +   R +   +   S     E+ +  +S L   DL  E+D LVG +E  C+
Subjt:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISEL---DLWGELDVLVGFVEGICE

Query:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN---EDLWELVKNTKAKL--VEQK
         PD        +  ++ +LV E+Y     E+  R     +R+++LS  +  +LV  L R E+C+ +L+ +     K    +  W LV   K  +  +E  
Subjt:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN---EDLWELVKNTKAKL--VEQK

Query:  QMKEEKRMIMVEIRAESVELTR
          + EK ++    R +  E  R
Subjt:  QMKEEKRMIMVEIRAESVELTR

Q8LF20 Putative clathrin assembly protein At2g254302.2e-1028.49Show/hide
Query:  LKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGPF
        ++  I A+KD+ SI  A  +++  +  +++A+V+AT+ H  +P S++ +  +L L     R   +AC+ ++  RL  T   VVA+K+L  +H ++  G  
Subjt:  LKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGPF

Query:  DLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLR------------SGNCENEDKQGK
          ++++++    G R  LN+S FRD + S   D S++VR YAG ++      +RL+  L+ R            S +  N+D+ G+
Subjt:  DLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLR------------SGNCENEDKQGK

Q9FKQ2 Putative clathrin assembly protein At5g653701.4e-1226.62Show/hide
Query:  KLKNLIDALKDKASIIKAT---FSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIAC-----IQTIMDRLHTTSSAVVAMKSLFTL
        KL  L   LKD+AS +K       +   + +I LA+++AT+H  +NPPSD+ +         F +ST   C     +  I+ RL  T+   VA K L  L
Subjt:  KLKNLIDALKDKASIIKAT---FSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIAC-----IQTIMDRLHTTSSAVVAMKSLFTL

Query:  HIVV-----IRGPFDLRDQVVF--CPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELD-----
        H +V       G   LR+ +      Y  G + L L+     S     +L+ WV+WY   ++  + +   L     ++    +NEDK+ +   +      
Subjt:  HIVV-----IRGPFDLRDQVVF--CPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELD-----

Query:  -LWGELDVLVGFVEGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN--EDL
         +  ++D LV   E I + P +   + N +V E+  L++++Y    R + +R   +  R     + +  +LV +L + ENC+  L+  F  R+K    D 
Subjt:  -LWGELDVLVGFVEGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN--EDL

Query:  WELVKNTK
        W LV   K
Subjt:  WELVKNTK

Q9SA65 Putative clathrin assembly protein At1g030504.4e-1130.6Show/hide
Query:  KLKNLIDALKDKASIIKATFSTHRRS-SSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRG
        K K  I A+KD+ S+  A  +    S S + +A+V+A T H   P  ++ +  +L+L   + RS   AC+ T+  RL+ T    VA+K+L  +  ++  G
Subjt:  KLKNLIDALKDKASIIKATFSTHRRS-SSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRG

Query:  PFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVR---RRLDRILYLRSGNCENEDKQGKISEL
              ++ F    G R  LN+S FRDVS S   D S++VR YA  ++  +  R   R   R +Y   G  + E++    ++L
Subjt:  PFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVR---RRLDRILYLRSGNCENEDKQGKISEL

Arabidopsis top hitse value%identityAlignment
AT1G03050.1 ENTH/ANTH/VHS superfamily protein3.1e-1230.6Show/hide
Query:  KLKNLIDALKDKASIIKATFSTHRRS-SSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRG
        K K  I A+KD+ S+  A  +    S S + +A+V+A T H   P  ++ +  +L+L   + RS   AC+ T+  RL+ T    VA+K+L  +  ++  G
Subjt:  KLKNLIDALKDKASIIKATFSTHRRS-SSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRG

Query:  PFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVR---RRLDRILYLRSGNCENEDKQGKISEL
              ++ F    G R  LN+S FRDVS S   D S++VR YA  ++  +  R   R   R +Y   G  + E++    ++L
Subjt:  PFDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVR---RRLDRILYLRSGNCENEDKQGKISEL

AT2G25430.1 epsin N-terminal homology (ENTH) domain-containing protein / clathrin assembly protein-related1.6e-1128.49Show/hide
Query:  LKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGPF
        ++  I A+KD+ SI  A  +++  +  +++A+V+AT+ H  +P S++ +  +L L     R   +AC+ ++  RL  T   VVA+K+L  +H ++  G  
Subjt:  LKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGPF

Query:  DLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLR------------SGNCENEDKQGK
          ++++++    G R  LN+S FRD + S   D S++VR YAG ++      +RL+  L+ R            S +  N+D+ G+
Subjt:  DLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLR------------SGNCENEDKQGK

AT4G40080.1 ENTH/ANTH/VHS superfamily protein5.3e-4436.34Show/hide
Query:  NLIDALKDKASIIKATF---STHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP
        +LI  +KDKAS  KA     +T  ++ S  L+V+RATTH PS PP +R LA +L+ G    R+TA + +++IM+RLHTT  A VA+KSL  +H +V  G 
Subjt:  NLIDALKDKASIIKATF---STHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP

Query:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISEL---DLWGELDVLVGFVEGICE
        F L+DQ+   P  GGRN+L LSAFRD     M +LSSWVRWYA  +EH +   R +   +   S     E+ +  +S L   DL  E+D LVG +E  C+
Subjt:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISEL---DLWGELDVLVGFVEGICE

Query:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN---EDLWELVKNTKAKL--VEQK
         PD        +  ++ +LV E+Y     E+  R     +R+++LS  +  +LV  L R E+C+ +L+ +     K    +  W LV   K  +  +E  
Subjt:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN---EDLWELVKNTKAKL--VEQK

Query:  QMKEEKRMIMVEIRAESVELTR
          + EK ++    R +  E  R
Subjt:  QMKEEKRMIMVEIRAESVELTR

AT5G10410.1 ENTH/ANTH/VHS superfamily protein3.7e-2128.92Show/hide
Query:  NLIDALKDKASIIKATFSTHRRSSSIK---LAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP
        ++I   KDKASI KA       S+++K   LA++++TT  P+ PP+   ++AV++  +      A A     + RL  T +A+VA KSL  +H ++    
Subjt:  NLIDALKDKASIIKATFSTHRRSSSIK---LAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGP

Query:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRL---DRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVEGICE
           RD+  F     GRN L L+ F D S +   +LS W+RWY   ++    V + L     +L       E +D+        +  + D LV F E IC 
Subjt:  FDLRDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRL---DRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVEGICE

Query:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDE-----LTQLVVILTRFENCRRKLTVLFVN-RAKNEDLWELVKNTKAKLVEQ
         P+   + +N +V E+  LV+E+Y  + R + VR++ + +R     +       L    ++L R   C+  L+ LF   R   +D W LV+  KA+  ++
Subjt:  FPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDE-----LTQLVVILTRFENCRRKLTVLFVN-RAKNEDLWELVKNTKAKLVEQ

Query:  --KQMKEEKRMIMVEIR--AESVEL
          KQM E   ++   ++   E +EL
Subjt:  --KQMKEEKRMIMVEIR--AESVEL

AT5G65370.1 ENTH/ANTH/VHS superfamily protein9.7e-1426.62Show/hide
Query:  KLKNLIDALKDKASIIKAT---FSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIAC-----IQTIMDRLHTTSSAVVAMKSLFTL
        KL  L   LKD+AS +K       +   + +I LA+++AT+H  +NPPSD+ +         F +ST   C     +  I+ RL  T+   VA K L  L
Subjt:  KLKNLIDALKDKASIIKAT---FSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIAC-----IQTIMDRLHTTSSAVVAMKSLFTL

Query:  HIVV-----IRGPFDLRDQVVF--CPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELD-----
        H +V       G   LR+ +      Y  G + L L+     S     +L+ WV+WY   ++  + +   L     ++    +NEDK+ +   +      
Subjt:  HIVV-----IRGPFDLRDQVVF--CPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELD-----

Query:  -LWGELDVLVGFVEGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN--EDL
         +  ++D LV   E I + P +   + N +V E+  L++++Y    R + +R   +  R     + +  +LV +L + ENC+  L+  F  R+K    D 
Subjt:  -LWGELDVLVGFVEGICEFPDSLHLQKNDMVYEVVRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKN--EDL

Query:  WELVKNTK
        W LV   K
Subjt:  WELVKNTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCATAGACCAAAAGAAGAAGCTCAAGAATCTCATAGACGCTCTCAAAGACAAGGCCTCCATAATCAAAGCCACCTTCTCCACTCATCGCCGTTCCTCTTCCATAAA
ACTCGCCGTCGTCCGCGCCACCACTCACCACCCCTCAAACCCCCCTTCCGACCGCCGCCTCGCCGCCGTATTAGCCCTTGGGCATGACTTTGGCCGGTCCACTGCAATCG
CCTGCATCCAAACCATCATGGACCGTCTCCACACCACCTCCAGCGCCGTCGTCGCCATGAAATCGCTTTTCACTCTGCATATTGTCGTAATCCGAGGTCCGTTCGATCTC
AGGGATCAGGTGGTCTTCTGCCCCTATTACGGAGGGCGGAATTTTCTCAACTTGTCCGCGTTTCGCGACGTATCGGACTCGGAGATGAGCGACTTGTCGTCTTGGGTGAG
ATGGTACGCGGGCGTCGTGGAGCATAACGTTATCGTCCGGAGGAGATTGGATCGGATTTTGTACTTACGGTCAGGAAATTGCGAAAACGAAGATAAACAGGGCAAGATTT
CGGAACTTGATTTGTGGGGCGAATTGGATGTTCTTGTGGGTTTTGTTGAGGGAATTTGCGAATTTCCGGATTCGTTGCATCTTCAGAAGAACGATATGGTTTACGAGGTG
GTGAGATTGGTGCTGGAGAATTACAGATTGGTTCAGAGGGAGATTTCTGTCCGAGTCAGGGGAATCGGAGACAGAGCGGATAGTTTGAGTCTGGACGAGTTGACTCAATT
GGTGGTCATCTTGACGCGGTTCGAAAATTGTAGAAGAAAACTGACGGTGCTGTTTGTGAACAGGGCGAAGAACGAGGATTTGTGGGAATTGGTAAAGAATACGAAGGCAA
AACTGGTGGAGCAGAAGCAGATGAAAGAGGAGAAGAGGATGATCATGGTGGAGATCAGAGCGGAATCGGTCGAGTTGACTCGGTTATGGAACCCGTTTGTTGAACCGGGT
CAGTTGCTGTGGGTCCCATCGGGTGACGAACCTTTGGGCCCGGCCCTGCTTCCACTGACCGTTTCAACGGTAGGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCATAGACCAAAAGAAGAAGCTCAAGAATCTCATAGACGCTCTCAAAGACAAGGCCTCCATAATCAAAGCCACCTTCTCCACTCATCGCCGTTCCTCTTCCATAAA
ACTCGCCGTCGTCCGCGCCACCACTCACCACCCCTCAAACCCCCCTTCCGACCGCCGCCTCGCCGCCGTATTAGCCCTTGGGCATGACTTTGGCCGGTCCACTGCAATCG
CCTGCATCCAAACCATCATGGACCGTCTCCACACCACCTCCAGCGCCGTCGTCGCCATGAAATCGCTTTTCACTCTGCATATTGTCGTAATCCGAGGTCCGTTCGATCTC
AGGGATCAGGTGGTCTTCTGCCCCTATTACGGAGGGCGGAATTTTCTCAACTTGTCCGCGTTTCGCGACGTATCGGACTCGGAGATGAGCGACTTGTCGTCTTGGGTGAG
ATGGTACGCGGGCGTCGTGGAGCATAACGTTATCGTCCGGAGGAGATTGGATCGGATTTTGTACTTACGGTCAGGAAATTGCGAAAACGAAGATAAACAGGGCAAGATTT
CGGAACTTGATTTGTGGGGCGAATTGGATGTTCTTGTGGGTTTTGTTGAGGGAATTTGCGAATTTCCGGATTCGTTGCATCTTCAGAAGAACGATATGGTTTACGAGGTG
GTGAGATTGGTGCTGGAGAATTACAGATTGGTTCAGAGGGAGATTTCTGTCCGAGTCAGGGGAATCGGAGACAGAGCGGATAGTTTGAGTCTGGACGAGTTGACTCAATT
GGTGGTCATCTTGACGCGGTTCGAAAATTGTAGAAGAAAACTGACGGTGCTGTTTGTGAACAGGGCGAAGAACGAGGATTTGTGGGAATTGGTAAAGAATACGAAGGCAA
AACTGGTGGAGCAGAAGCAGATGAAAGAGGAGAAGAGGATGATCATGGTGGAGATCAGAGCGGAATCGGTCGAGTTGACTCGGTTATGGAACCCGTTTGTTGAACCGGGT
CAGTTGCTGTGGGTCCCATCGGGTGACGAACCTTTGGGCCCGGCCCTGCTTCCACTGACCGTTTCAACGGTAGGA
Protein sequenceShow/hide protein sequence
MGIDQKKKLKNLIDALKDKASIIKATFSTHRRSSSIKLAVVRATTHHPSNPPSDRRLAAVLALGHDFGRSTAIACIQTIMDRLHTTSSAVVAMKSLFTLHIVVIRGPFDL
RDQVVFCPYYGGRNFLNLSAFRDVSDSEMSDLSSWVRWYAGVVEHNVIVRRRLDRILYLRSGNCENEDKQGKISELDLWGELDVLVGFVEGICEFPDSLHLQKNDMVYEV
VRLVLENYRLVQREISVRVRGIGDRADSLSLDELTQLVVILTRFENCRRKLTVLFVNRAKNEDLWELVKNTKAKLVEQKQMKEEKRMIMVEIRAESVELTRLWNPFVEPG
QLLWVPSGDEPLGPALLPLTVSTVG