; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020654 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020654
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionENTH domain-containing protein
Genome locationtig00153552:893354..894408
RNA-Seq ExpressionSgr020654
SyntenySgr020654
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577320.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]4.1e-11465.65Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVR K  S L+G+IKDKASQSKAAL+AKPNILSFQLALLRATTHDP APP  K L+VLLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLG-FFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI
        KNG FILQDQLSVFP TGGRNYLKLSDFRDSS+P++WELSSWVRWYAQYIET+L  SR+LG FF+GSS+S+ +REKK++QIS  L+SDLL+E ES++GLI
Subjt:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLG-FFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI

Query:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS-------------------------SAN-----RSSW----ETKNMIGE
        EE +K PH LHL GN LV+KI  FVG+DYLSA KEIS RVTE  +RL CLS                         S N     +  W    E +N+IGE
Subjt:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS-------------------------SAN-----RSSW----ETKNMIGE

Query:  SKEKREGGKL---RSTISNSGRFMER--ANLYGDSLRFSSARFDLSCKRIPVV----SYLL
        SK+ RE GKL   +S +S+SGRFM++  A LY  S+RF S RFD +CK IPV+    SYLL
Subjt:  SKEKREGGKL---RSTISNSGRFMER--ANLYGDSLRFSSARFDLSCKRIPVV----SYLL

KAG7015410.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-11365.37Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVR K  S L+G+IKDKASQSKAAL+AKPNILSFQLALLRATTHDP APP  K L+VLLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLG-FFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI
        KNG FILQDQLSVFP TGGRNYLKLSDFRDSS+P++WELSSWVRWYAQYIET+L  SR+LG FF+GSS+S+ +REKK++QIS   +SDLL+E ES++GLI
Subjt:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLG-FFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI

Query:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS-------------------------SAN-----RSSW----ETKNMIGE
        EE +K PH LHL GN LV+KI  FVG+DYLSA KEIS RVTE  +RL CLS                         S N     +  W    E +N+IGE
Subjt:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS-------------------------SAN-----RSSW----ETKNMIGE

Query:  SKEKREGGKL---RSTISNSGRFMER--ANLYGDSLRFSSARFDLSCKRIPVV----SYLL
        SK+ RE GKL   +S +S+SGRFM++  A LY  S+RF S RFD +CK IPV+    SYLL
Subjt:  SKEKREGGKL---RSTISNSGRFMER--ANLYGDSLRFSSARFDLSCKRIPVV----SYLL

XP_022136401.1 putative clathrin assembly protein At4g40080 [Momordica charantia]3.3e-12470.43Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVR +  SGL+G+IKDKASQSKAAL+AKPNILSFQLALLRATTHDP APP++KHLA LLSLGKTSRATAAAAIEVLMDRLQST NSAVALKCL+AVHHI+
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIE
        K+GGFILQDQLSVFP TGGRNYLKLSDFRD+SSP++WELSSWVRWYAQYIET+LSTSR+LGFF+ SS+S E+REKKS+QISA+ +SDLLR+ ES+VGLIE
Subjt:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIE

Query:  ESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLSSANRSS----------------------------------WETKNMIGES
        E+TKKPH LHL  NKLV++ILGFV DDYLSAMKEISVRVTE H+RLSCLS                                        WETKN+IGE+
Subjt:  ESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLSSANRSS----------------------------------WETKNMIGES

Query:  KEKRE----GGKLRST---ISNSGRFMERANLYGDSLRFSSARFD
        KE R+    GGKL  T   +S+SGRFMERAN+Y DS+RF S RFD
Subjt:  KEKRE----GGKLRST---ISNSGRFMERANLYGDSLRFSSARFD

XP_022929539.1 putative clathrin assembly protein At4g40080 [Cucurbita moschata]2.6e-11365.37Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVR K  S L+G+IKDKASQSKAAL+AKPNILSFQLALLRATTHDP APP  K L+VLLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLG-FFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI
        KNG FILQDQLSVFP TGGRNYLKLSDFRDSS+P++WELSSWVRWYAQYIET+L  SR+LG FF+GSS+S+ +REKK++QIS   +SDLL+E ES++GLI
Subjt:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLG-FFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI

Query:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS-------------------------SAN-----RSSW----ETKNMIGE
        EE +K PH LHL GN LV+KI  FVG+DYLSA KEIS RVTE  +RL CLS                         S N     +  W    E +N+IGE
Subjt:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS-------------------------SAN-----RSSW----ETKNMIGE

Query:  SKEKREGGKL---RSTISNSGRFMER--ANLYGDSLRFSSARFDLSCKRIPVV----SYLL
        SK+ RE GKL   +S +S+SGRFM++  A LY  S+RF S RFD +CK IPV+    SYLL
Subjt:  SKEKREGGKL---RSTISNSGRFMER--ANLYGDSLRFSSARFDLSCKRIPVV----SYLL

XP_038903242.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]7.9e-12671.51Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV
        MV  KN S L+G+IKDKASQSKAAL+AKPNILSFQLALLRATTHDP APP EKHL VLLSLGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIAVHHIV
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIE
        KNGGFILQDQLSVFP TGGRNYLKLSDFRDSS+P++WELSSWVRWYAQYIET+LS SR+LGFF+GSSTS+E++EKK++QIS IL+SDLL+E ES+VGLIE
Subjt:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIE

Query:  ESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS---------------------SANRSS---------W----ETKNMIGES
        E++K PH LHL GN+L +KI  FVGDDYLSAMKEIS+RVTE H+RLSCLS                     S   SS         W    ETKN+IGES
Subjt:  ESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS---------------------SANRSS---------W----ETKNMIGES

Query:  KEKREGGKL---RSTISNSGRFMER--ANLYGDSLRFSSARFDLSCKRIPV
        KE +EGGKL   +S +S+SGRFMER  A  Y DSLRF S RFDL+CK  PV
Subjt:  KEKREGGKL---RSTISNSGRFMER--ANLYGDSLRFSSARFDLSCKRIPV

TrEMBL top hitse value%identityAlignment
A0A0A0KXU4 ENTH domain-containing protein1.4e-11266.09Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV
        MV  K  S L+G+IKDKASQSKAAL+AKPNILSFQLALLRATTHD  APP++KHL+ LLSLGKTSRATAA A+EVLMDRLQ+T NSAVALKCLIAVHHI 
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIE
        K+G FILQDQLSVFP TGGRNYLKLSDFRDSS+P++W+LSSWVRWYAQYIET+LS SR+LGFF+GSS S+E++E+K++QIS IL+SDLL+E ES+VGLIE
Subjt:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIE

Query:  ESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLSSAN-------------------------------------RSSWETKNMI
        E +K PH LHL  N+LV+KI  FVGDDYLSAMKEIS+RVTE H RL  LS A                                      RS  ETKN+ 
Subjt:  ESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLSSAN-------------------------------------RSSWETKNMI

Query:  GESKEKREGGKLRST---ISNSGRFMERANL--YGDSLRFSSARFDLS
        GESKE REGGKL  T   +S+SGRFMER N   Y D LRF S RF L+
Subjt:  GESKEKREGGKLRST---ISNSGRFMERANL--YGDSLRFSSARFDLS

A0A1S3C1C0 putative clathrin assembly protein At4g400801.3e-11064.84Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV
        M+  K  S L+G+IKDKASQSKAAL+AKPNILSFQLALLRATTHDP APP++KHL+ LLSLGKTSRATAAAA+EVLMDRLQ+T NSAVALKCLIAVHHI 
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIE
        KNGGFILQDQLSVFP TGGRNYLKLSDFRDSS+P++WELSSWVRWYAQYIET+LS SR+LGF +GSS+S+E+ E+K++QIS I +S+LL++ ES+VGLIE
Subjt:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIE

Query:  ESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLSSAN----------------------------------RSSWETKNMIGES
        E +K P  LHL  N+LV+KI GFVGDDYL+AMKEIS+RVTE H RL CLS                                      S  ETKN+IG S
Subjt:  ESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLSSAN----------------------------------RSSWETKNMIGES

Query:  KEKREGGKL---RSTISNSGRFMERANL--YGDSLRFSSARFDLSCK
        KE R+G KL      IS+SGRF+ER+N   Y D L F S RF L+ K
Subjt:  KEKREGGKL---RSTISNSGRFMERANL--YGDSLRFSSARFDLSCK

A0A6J1C3E8 putative clathrin assembly protein At4g400801.6e-12470.43Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVR +  SGL+G+IKDKASQSKAAL+AKPNILSFQLALLRATTHDP APP++KHLA LLSLGKTSRATAAAAIEVLMDRLQST NSAVALKCL+AVHHI+
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIE
        K+GGFILQDQLSVFP TGGRNYLKLSDFRD+SSP++WELSSWVRWYAQYIET+LSTSR+LGFF+ SS+S E+REKKS+QISA+ +SDLLR+ ES+VGLIE
Subjt:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIE

Query:  ESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLSSANRSS----------------------------------WETKNMIGES
        E+TKKPH LHL  NKLV++ILGFV DDYLSAMKEISVRVTE H+RLSCLS                                        WETKN+IGE+
Subjt:  ESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLSSANRSS----------------------------------WETKNMIGES

Query:  KEKRE----GGKLRST---ISNSGRFMERANLYGDSLRFSSARFD
        KE R+    GGKL  T   +S+SGRFMERAN+Y DS+RF S RFD
Subjt:  KEKRE----GGKLRST---ISNSGRFMERANLYGDSLRFSSARFD

A0A6J1EP16 putative clathrin assembly protein At4g400801.3e-11365.37Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVR K  S L+G+IKDKASQSKAAL+AKPNILSFQLALLRATTHDP APP  K L+VLLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLG-FFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI
        KNG FILQDQLSVFP TGGRNYLKLSDFRDSS+P++WELSSWVRWYAQYIET+L  SR+LG FF+GSS+S+ +REKK++QIS   +SDLL+E ES++GLI
Subjt:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLG-FFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI

Query:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS-------------------------SAN-----RSSW----ETKNMIGE
        EE +K PH LHL GN LV+KI  FVG+DYLSA KEIS RVTE  +RL CLS                         S N     +  W    E +N+IGE
Subjt:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS-------------------------SAN-----RSSW----ETKNMIGE

Query:  SKEKREGGKL---RSTISNSGRFMER--ANLYGDSLRFSSARFDLSCKRIPVV----SYLL
        SK+ RE GKL   +S +S+SGRFM++  A LY  S+RF S RFD +CK IPV+    SYLL
Subjt:  SKEKREGGKL---RSTISNSGRFMER--ANLYGDSLRFSSARFDLSCKRIPVV----SYLL

A0A6J1JCT4 putative clathrin assembly protein At4g400803.5e-11164.82Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV
        MV  K  S L+G+IKDKASQSKAAL+AKPNILSFQLALLRATTHDP APP  K L+VLLS GKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHIV
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLG-FFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI
        KNG FILQDQLSVFP TGGRNYLKLSDFRDSS+P++WELSSWVRWYAQYIET+L  SR+LG FF+GSS+S+ +REKK++QIS   +SDLL+E ES++GLI
Subjt:  KNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLG-FFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI

Query:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS-------------------------SAN-----RSSW----ETKNMIGE
        EE +K PH LHL GN LV+KI  FVG+DYLSA KEIS RVTE   RL CLS                         S N     +  W    E +N+IGE
Subjt:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS-------------------------SAN-----RSSW----ETKNMIGE

Query:  SKEKREGGKL---RSTISNSGRFMERANLYGD--SLRFSSARFDLSCKRIPVV----SYLL
        SK+ RE GKL   +S +S+SGRFM++ N   D  S+RF S RFD +CK IPV+    SYLL
Subjt:  SKEKREGGKL---RSTISNSGRFMERANLYGD--SLRFSSARFDLSCKRIPVV----SYLL

SwissProt top hitse value%identityAlignment
Q8H0W9 Putative clathrin assembly protein At5g104102.5e-2933.75Show/hide
Query:  LMGMIKDKASQSKAALV---AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIVKNGGFI
        ++G  KDKAS  KA LV       +    LALL++TT  P  PPN  +++ ++S   ++   A AA    + RL+ T+N+ VA K LI +H ++K+    
Subjt:  LMGMIKDKASQSKAALV---AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIVKNGGFI

Query:  LQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIEESTKKP
         +D+     L  GRN LKL++F D SS +T ELS W+RWY QY++ +    +VLG F     + +D+ ++ D++S+  +  ++R+ +S+V   E    +P
Subjt:  LQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIEESTKKP

Query:  HYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERL
            +  NK+V++I   V +DY   ++ + VR+    ERL
Subjt:  HYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERL

Q8L936 Putative clathrin assembly protein At4g400802.8e-7356.92Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVA---KPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVH
        M R  +F+ L+G IKDKASQSKAALV+   K   LSF L++LRATTHDP  PP  +HLAV+LS G  SRATA++A+E +M+RL +T ++ VALK LI +H
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVA---KPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVH

Query:  HIVKNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVG
        HIVK+G FILQDQLSVFP +GGRNYLKLS FRD  SP+ WELSSWVRWYA Y+E +LSTSR++GFFI S++S+  +E+  + +S++ +SDLLREI+++VG
Subjt:  HIVKNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVG

Query:  LIEESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS
        L+EE+ K P      G  L +KI   VG+DY+S++ E+  R  E  ER + LS
Subjt:  LIEESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS

Q8LBH2 Putative clathrin assembly protein At2g016001.5e-1535.71Show/hide
Query:  GMIKDKASQSKAALV-AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATA--AAAIEVLMDRLQSTQNSAVALKCLIAVHHIVKNGGFILQ
        G +KD    +K  LV          +A+++AT H    PP ++HL  + +    +RA A  A  I  L  RL  T+N  VALK LI +H +++ G    +
Subjt:  GMIKDKASQSKAALV-AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATA--AAAIEVLMDRLQSTQNSAVALKCLIAVHHIVKNGGFILQ

Query:  DQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGF
        ++L  F   G    L+LS+F+D SSP+ W+ S+WVR YA ++E  L   RVL +
Subjt:  DQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGF

Q8VYT2 Putative clathrin assembly protein At4g259402.2e-1431.37Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPN--ILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKT--SRATAAAAIEVLMDRLQSTQNSAVALKCLIAV
        M    +F   +G IKD  + S    +AK N       +A+++AT H   A P E+H+  + S       RA  A  I  L  RL  T+N  VA+K LI +
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPN--ILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKT--SRATAAAAIEVLMDRLQSTQNSAVALKCLIAV

Query:  HHIVKNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFI-------GSSTSSEDREKKSDQI--SAILSSD
        H  ++ G    +++L  +   G  + L++S+F+D +SP+ W+ S+W+R YA ++E  L   RVL + I       GS  SS++ +  + Q   + +LS +
Subjt:  HHIVKNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFI-------GSSTSSEDREKKSDQI--SAILSSD

Query:  LLRE
         L E
Subjt:  LLRE

Q9FKQ2 Putative clathrin assembly protein At5g653702.5e-2631.43Show/hide
Query:  LMGMIKDKASQSKAALV---AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIVK-----
        L G++KD+ASQ K  +V   +  N  +  LALL+AT+H    PP++K++  L S   T        ++ ++ RL+ T +  VA KCLI +H +VK     
Subjt:  LMGMIKDKASQSKAALV---AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIVK-----

Query:  NGGFILQDQLSVFPL--TGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI
        NG   L++ ++   L  T G + LKL+D   +SS  T EL+ WV+WY QY++  LS + VLG        +ED+  ++ ++S+     +L++I+ +V L 
Subjt:  NGGFILQDQLSVFPL--TGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI

Query:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMK-------EISVRVTESHERL-------SCLSSANRSSWETKNMIGE
        E  + +P     K NK+V ++   +  DY SA++       E++VRV + +E +       +C    +  SW +K +I +
Subjt:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMK-------EISVRVTESHERL-------SCLSSANRSSWETKNMIGE

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein1.1e-1635.71Show/hide
Query:  GMIKDKASQSKAALV-AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATA--AAAIEVLMDRLQSTQNSAVALKCLIAVHHIVKNGGFILQ
        G +KD    +K  LV          +A+++AT H    PP ++HL  + +    +RA A  A  I  L  RL  T+N  VALK LI +H +++ G    +
Subjt:  GMIKDKASQSKAALV-AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATA--AAAIEVLMDRLQSTQNSAVALKCLIAVHHIVKNGGFILQ

Query:  DQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGF
        ++L  F   G    L+LS+F+D SSP+ W+ S+WVR YA ++E  L   RVL +
Subjt:  DQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGF

AT4G25940.1 ENTH/ANTH/VHS superfamily protein1.6e-1531.37Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVAKPN--ILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKT--SRATAAAAIEVLMDRLQSTQNSAVALKCLIAV
        M    +F   +G IKD  + S    +AK N       +A+++AT H   A P E+H+  + S       RA  A  I  L  RL  T+N  VA+K LI +
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVAKPN--ILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKT--SRATAAAAIEVLMDRLQSTQNSAVALKCLIAV

Query:  HHIVKNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFI-------GSSTSSEDREKKSDQI--SAILSSD
        H  ++ G    +++L  +   G  + L++S+F+D +SP+ W+ S+W+R YA ++E  L   RVL + I       GS  SS++ +  + Q   + +LS +
Subjt:  HHIVKNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFI-------GSSTSSEDREKKSDQI--SAILSSD

Query:  LLRE
         L E
Subjt:  LLRE

AT4G40080.1 ENTH/ANTH/VHS superfamily protein2.0e-7456.92Show/hide
Query:  MVRAKNFSGLMGMIKDKASQSKAALVA---KPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVH
        M R  +F+ L+G IKDKASQSKAALV+   K   LSF L++LRATTHDP  PP  +HLAV+LS G  SRATA++A+E +M+RL +T ++ VALK LI +H
Subjt:  MVRAKNFSGLMGMIKDKASQSKAALVA---KPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVH

Query:  HIVKNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVG
        HIVK+G FILQDQLSVFP +GGRNYLKLS FRD  SP+ WELSSWVRWYA Y+E +LSTSR++GFFI S++S+  +E+  + +S++ +SDLLREI+++VG
Subjt:  HIVKNGGFILQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVG

Query:  LIEESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS
        L+EE+ K P      G  L +KI   VG+DY+S++ E+  R  E  ER + LS
Subjt:  LIEESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERLSCLS

AT5G10410.1 ENTH/ANTH/VHS superfamily protein1.7e-3033.75Show/hide
Query:  LMGMIKDKASQSKAALV---AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIVKNGGFI
        ++G  KDKAS  KA LV       +    LALL++TT  P  PPN  +++ ++S   ++   A AA    + RL+ T+N+ VA K LI +H ++K+    
Subjt:  LMGMIKDKASQSKAALV---AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIVKNGGFI

Query:  LQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIEESTKKP
         +D+     L  GRN LKL++F D SS +T ELS W+RWY QY++ +    +VLG F     + +D+ ++ D++S+  +  ++R+ +S+V   E    +P
Subjt:  LQDQLSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIEESTKKP

Query:  HYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERL
            +  NK+V++I   V +DY   ++ + VR+    ERL
Subjt:  HYLHLKGNKLVEKILGFVGDDYLSAMKEISVRVTESHERL

AT5G65370.1 ENTH/ANTH/VHS superfamily protein1.8e-2731.43Show/hide
Query:  LMGMIKDKASQSKAALV---AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIVK-----
        L G++KD+ASQ K  +V   +  N  +  LALL+AT+H    PP++K++  L S   T        ++ ++ RL+ T +  VA KCLI +H +VK     
Subjt:  LMGMIKDKASQSKAALV---AKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIVK-----

Query:  NGGFILQDQLSVFPL--TGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI
        NG   L++ ++   L  T G + LKL+D   +SS  T EL+ WV+WY QY++  LS + VLG        +ED+  ++ ++S+     +L++I+ +V L 
Subjt:  NGGFILQDQLSVFPL--TGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLI

Query:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMK-------EISVRVTESHERL-------SCLSSANRSSWETKNMIGE
        E  + +P     K NK+V ++   +  DY SA++       E++VRV + +E +       +C    +  SW +K +I +
Subjt:  EESTKKPHYLHLKGNKLVEKILGFVGDDYLSAMK-------EISVRVTESHERL-------SCLSSANRSSWETKNMIGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCGCGCTAAAAATTTCAGTGGGCTCATGGGAATGATCAAAGACAAAGCTTCCCAGAGCAAGGCGGCGCTGGTGGCCAAGCCCAACATTCTCTCCTTCCAACTCGC
CCTCCTCAGAGCCACCACCCACGACCCCTTGGCGCCGCCGAACGAAAAGCACCTCGCTGTCCTTCTCTCTCTCGGGAAAACCTCACGCGCCACCGCCGCCGCCGCCATTG
AAGTCTTGATGGACCGCCTTCAAAGCACCCAAAACTCCGCCGTCGCCCTTAAGTGCCTCATCGCAGTCCACCACATCGTTAAGAACGGTGGCTTCATTCTACAAGACCAA
CTCTCTGTTTTTCCGTTAACCGGCGGCAGAAATTACCTCAAGCTCTCCGACTTCCGGGACAGTTCGAGTCCGGTAACGTGGGAGCTTTCTTCTTGGGTCCGATGGTACGC
CCAGTACATCGAAACCATCCTCTCTACCTCACGTGTTTTGGGGTTCTTCATTGGTTCTTCAACTTCAAGCGAAGATAGGGAGAAAAAATCCGACCAGATCTCGGCGATTT
TGAGCTCCGATTTACTCAGAGAGATCGAATCTATGGTGGGTTTAATCGAAGAAAGTACGAAAAAGCCTCATTATTTGCATCTGAAAGGCAACAAGTTGGTGGAGAAAATC
CTCGGCTTTGTCGGTGACGATTACTTGTCGGCTATGAAAGAAATTTCTGTCCGAGTTACCGAGTCTCATGAGCGGCTGAGTTGCCTGAGTTCGGCGAATCGGTCGAGTTG
GGAGACGAAGAATATGATTGGGGAGTCCAAGGAAAAGCGAGAGGGCGGTAAATTGAGGAGCACGATAAGCAATTCGGGTCGGTTCATGGAGCGGGCTAATCTTTACGGCG
ACTCGCTCCGGTTCAGTTCGGCGAGGTTCGATCTCAGCTGCAAACGGATTCCGGTTGTATCATATTTGCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCGCGCTAAAAATTTCAGTGGGCTCATGGGAATGATCAAAGACAAAGCTTCCCAGAGCAAGGCGGCGCTGGTGGCCAAGCCCAACATTCTCTCCTTCCAACTCGC
CCTCCTCAGAGCCACCACCCACGACCCCTTGGCGCCGCCGAACGAAAAGCACCTCGCTGTCCTTCTCTCTCTCGGGAAAACCTCACGCGCCACCGCCGCCGCCGCCATTG
AAGTCTTGATGGACCGCCTTCAAAGCACCCAAAACTCCGCCGTCGCCCTTAAGTGCCTCATCGCAGTCCACCACATCGTTAAGAACGGTGGCTTCATTCTACAAGACCAA
CTCTCTGTTTTTCCGTTAACCGGCGGCAGAAATTACCTCAAGCTCTCCGACTTCCGGGACAGTTCGAGTCCGGTAACGTGGGAGCTTTCTTCTTGGGTCCGATGGTACGC
CCAGTACATCGAAACCATCCTCTCTACCTCACGTGTTTTGGGGTTCTTCATTGGTTCTTCAACTTCAAGCGAAGATAGGGAGAAAAAATCCGACCAGATCTCGGCGATTT
TGAGCTCCGATTTACTCAGAGAGATCGAATCTATGGTGGGTTTAATCGAAGAAAGTACGAAAAAGCCTCATTATTTGCATCTGAAAGGCAACAAGTTGGTGGAGAAAATC
CTCGGCTTTGTCGGTGACGATTACTTGTCGGCTATGAAAGAAATTTCTGTCCGAGTTACCGAGTCTCATGAGCGGCTGAGTTGCCTGAGTTCGGCGAATCGGTCGAGTTG
GGAGACGAAGAATATGATTGGGGAGTCCAAGGAAAAGCGAGAGGGCGGTAAATTGAGGAGCACGATAAGCAATTCGGGTCGGTTCATGGAGCGGGCTAATCTTTACGGCG
ACTCGCTCCGGTTCAGTTCGGCGAGGTTCGATCTCAGCTGCAAACGGATTCCGGTTGTATCATATTTGCTGTAA
Protein sequenceShow/hide protein sequence
MVRAKNFSGLMGMIKDKASQSKAALVAKPNILSFQLALLRATTHDPLAPPNEKHLAVLLSLGKTSRATAAAAIEVLMDRLQSTQNSAVALKCLIAVHHIVKNGGFILQDQ
LSVFPLTGGRNYLKLSDFRDSSSPVTWELSSWVRWYAQYIETILSTSRVLGFFIGSSTSSEDREKKSDQISAILSSDLLREIESMVGLIEESTKKPHYLHLKGNKLVEKI
LGFVGDDYLSAMKEISVRVTESHERLSCLSSANRSSWETKNMIGESKEKREGGKLRSTISNSGRFMERANLYGDSLRFSSARFDLSCKRIPVVSYLL