; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G08880 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G08880
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionENTH domain-containing protein
Genome locationChr4:6643185..6644623
RNA-Seq ExpressionCSPI04G08880
SyntenyCSPI04G08880
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045306.1 putative clathrin assembly protein [Cucumis melo var. makuwa]1.1e-16687.54Show/hide
Query:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF
        M+NTK+LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPSDKHLSALLSLGKTSRATAA AVEVLMDRLQTTHNSAVALKCLIAVHHIF
Subjt:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF

Query:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISR LGF VGSS SNEE ERKTEQISGI NS+LLK+TESLVGLIE
Subjt:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE

Query:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT
        EISKMP CLHLNRNRLVDKIY FVGDDYL+AMK+ISIRVTEFHHRLG LSFGESVELVCALKRL+DCKEKQSMGIFA+YEVL+DG W SIR   ETKNL 
Subjt:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT

Query:  GESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ
        G SKE+R+G KL + +RR+SDSGRF+ERSNASSY D+L F SERF LTY GFQ
Subjt:  GESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ

XP_004137285.1 putative clathrin assembly protein At4g40080 [Cucumis sativus]4.0e-19899.45Show/hide
Query:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
        MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
Subjt:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC

Query:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET
        LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET
Subjt:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET

Query:  ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS
        ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSF ESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS
Subjt:  ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS

Query:  IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ
        IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMER NASSYRDLLRFGSERFVLTYDGFQ
Subjt:  IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ

XP_008455635.1 PREDICTED: putative clathrin assembly protein At4g40080 [Cucumis melo]3.1e-16687.54Show/hide
Query:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF
        M+NTK+LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPSDKHLSALLSLGKTSRATAA AVEVLMDRLQTTHNSAVALKCLIAVHHIF
Subjt:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF

Query:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGF VGSS SNEE ERKTEQISGI NS+LLK+TESLVGLIE
Subjt:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE

Query:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT
        EISKMP CLHLNRNRLVDKIY FVGDDYL+AMKEISIRVTEFHHRLG LSFGESVELVCALKRL+D KEKQS+GIFA+YEVL+DG W SIR   ETKNL 
Subjt:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT

Query:  GESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ
        G SKE+R+G KL + +RR+SDSGRF+ERSNASSY D+L F SERF LTY GFQ
Subjt:  GESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ

XP_023552000.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]2.5e-15279.72Show/hide
Query:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
        +  S+ LSMV TKKLSSLIGLIKDKASQSKAALLAKPNI+SFQLALLRATTHD HAPP+ K LS LLSLGKTSRATAA A+EVLMDRLQ+T NSAVALKC
Subjt:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC

Query:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILG-FFVGSSRSNEEKERKTEQISGILNSDLLKE
        LIA+HHI K+GDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVL ISRILG FFVGSS SN E+E+KTEQISG  NSDLLKE
Subjt:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILG-FFVGSSRSNEEKERKTEQISGILNSDLLKE

Query:  TESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIR
        TESL+GLIEE+SK+PHCLHLN N LVDKIY+FVG+DYLSA KEIS RVTEF  RLG LSFGESVELVCALKRLEDCKEKQS GI   +E+L+ G WGSIR
Subjt:  TESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIR

Query:  SIQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDG
           E +NL GESK+HRE GKL +TK R+SDSGRFM++ NA  YR  +RFGSERF  T  G
Subjt:  SIQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDG

XP_038903242.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]3.7e-16487.22Show/hide
Query:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF
        MV+TK LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPP +KHL  LLSLGKTSRATAA AVEVLMDRLQTT NSAVALKCLIAVHHI 
Subjt:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF

Query:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGFFVGSS SNEEKE+KTEQISGILNSDLLKETESLVGLIE
Subjt:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE

Query:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT
        E SKMPHCLHLN NRL DKIY+FVGDDYLSAMKEISIRVTEFH RL  LSFGESVELVCALKRLEDCKEKQS GI +KYEVL+D  WGSIR   ETKNL 
Subjt:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT

Query:  GESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGF
        GESKE++EGGKL +TK R+SDSGRFMER+ A SYRD LRFGSERF LT  GF
Subjt:  GESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGF

TrEMBL top hitse value%identityAlignment
A0A0A0KXU4 ENTH domain-containing protein1.9e-19899.45Show/hide
Query:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
        MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
Subjt:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC

Query:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET
        LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET
Subjt:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET

Query:  ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS
        ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSF ESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS
Subjt:  ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS

Query:  IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ
        IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMER NASSYRDLLRFGSERFVLTYDGFQ
Subjt:  IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ

A0A1S3C1C0 putative clathrin assembly protein At4g400801.5e-16687.54Show/hide
Query:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF
        M+NTK+LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPSDKHLSALLSLGKTSRATAA AVEVLMDRLQTTHNSAVALKCLIAVHHIF
Subjt:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF

Query:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGF VGSS SNEE ERKTEQISGI NS+LLK+TESLVGLIE
Subjt:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE

Query:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT
        EISKMP CLHLNRNRLVDKIY FVGDDYL+AMKEISIRVTEFHHRLG LSFGESVELVCALKRL+D KEKQS+GIFA+YEVL+DG W SIR   ETKNL 
Subjt:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT

Query:  GESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ
        G SKE+R+G KL + +RR+SDSGRF+ERSNASSY D+L F SERF LTY GFQ
Subjt:  GESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ

A0A5A7TT50 Putative clathrin assembly protein5.1e-16787.54Show/hide
Query:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF
        M+NTK+LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPSDKHLSALLSLGKTSRATAA AVEVLMDRLQTTHNSAVALKCLIAVHHIF
Subjt:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF

Query:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISR LGF VGSS SNEE ERKTEQISGI NS+LLK+TESLVGLIE
Subjt:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE

Query:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT
        EISKMP CLHLNRNRLVDKIY FVGDDYL+AMK+ISIRVTEFHHRLG LSFGESVELVCALKRL+DCKEKQSMGIFA+YEVL+DG W SIR   ETKNL 
Subjt:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT

Query:  GESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ
        G SKE+R+G KL + +RR+SDSGRF+ERSNASSY D+L F SERF LTY GFQ
Subjt:  GESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDGFQ

A0A6J1EP16 putative clathrin assembly protein At4g400807.9e-15280Show/hide
Query:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
        +  S  LSMV TKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPP+ K LS LLSLGKTSRATAA A+EVLMDRLQ+T NSAVALKC
Subjt:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC

Query:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILG-FFVGSSRSNEEKERKTEQISGILNSDLLKE
        LIA+HHI K+GDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVL ISRILG FFVGSS SN E+E+KTEQISG  NSDLLKE
Subjt:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILG-FFVGSSRSNEEKERKTEQISGILNSDLLKE

Query:  TESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIR
        TESL+GLIEE+SKMPHCLHLN N LVDKIY+FVG+DYLSA KEIS RVTEF  RLG LSFGESVELVCALKRLEDCKEKQS GI   +E+L+ G WGSIR
Subjt:  TESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIR

Query:  SIQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDG
           E +NL GESK+ RE GKL +TK R+SDSGRFM++ NA  YR  +RFGSERF  T  G
Subjt:  SIQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDG

A0A6J1JCT4 putative clathrin assembly protein At4g400803.9e-15179.72Show/hide
Query:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
        ++ S  LSMV+TKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPP  K LS LLS GKTSRATAA A+EVLMDRLQ+T NSAVALKC
Subjt:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC

Query:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILG-FFVGSSRSNEEKERKTEQISGILNSDLLKE
        LIA+HHI K+GDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVL ISRILG FFVGSS SN E+E+KTEQISG  NSDLLKE
Subjt:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILG-FFVGSSRSNEEKERKTEQISGILNSDLLKE

Query:  TESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIR
        TESL+GLIEE+SKMPHCLHLN N LVDKIY+FVG+DYLSA KEIS RVTEF HRLG LSFGESVELVCALKRLEDCKEKQS GI   +E+L+ G WGSIR
Subjt:  TESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIR

Query:  SIQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDG
           E +NL GESK+ RE GKL +TK R+SDSGRFM++ NA   R  +RFGSERF  T  G
Subjt:  SIQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERSNASSYRDLLRFGSERFVLTYDG

SwissProt top hitse value%identityAlignment
Q8H0W9 Putative clathrin assembly protein At5g104101.1e-3030.67Show/hide
Query:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFI
        +IG  KDKAS  KA L+       +    LALL++TT   + PP+  ++SA++S   +  A A  A    + RL+ T N+ VA K LI +H + K     
Subjt:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISKMP
         +D+        GRN LKL++F D S+ ++ +LS W+RWY QY++ +  + ++LG F     + ++K  + +++S      ++++T+SLV   E I   P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISKMP

Query:  HCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRL---GWLSFGE--SVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQ---ETKN
            + +N++VD+I   V +DY   ++ + +R+     RL   G    G+    +    L RL +CKE  S G+F +   L D  W  +  ++   E KN
Subjt:  HCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRL---GWLSFGE--SVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQ---ETKN

Q8L936 Putative clathrin assembly protein At4g400801.7e-8250Show/hide
Query:  SSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGD
        + LIG IKDKASQSKAAL++   K   LSF L++LRATTHD   PP ++HL+ +LS G  SRATA+ AVE +M+RL TT ++ VALK LI +HHI K G 
Subjt:  SSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGD

Query:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISK
        FILQDQLSVFP +GGRNYLKLS FRD  +P+ W+LSSWVRWYA Y+E +LS SRI+GFF+ S+ S   KE   E +S + NSDLL+E ++LVGL+EE  K
Subjt:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISK

Query:  MPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYE-VLIDGLWGSIRSIQETKNLTGES
        +P         L DKI   VG+DY+S++ E+  R  EF  R   LSFG+++ELVCALKRLE CKE+ S      ++   IDG WG    + E K + G  
Subjt:  MPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYE-VLIDGLWGSIRSIQETKNLTGES

Query:  KEHREGGKLCKT------KRRVSDSGRFMERSNASSYRDLLRFGSERF
        +++   G++ K+      + +  +S RF +R     Y + +RF S RF
Subjt:  KEHREGGKLCKT------KRRVSDSGRFMERSNASSYRDLLRFGSERF

Q8LBH2 Putative clathrin assembly protein At2g016008.4e-1831.61Show/hide
Query:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATA--APAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFILQD
        G +KD  S     +          +A+++AT H +  PP D+HL  + +    +RA A  A  +  L  RL  T N  VALK LI +H + ++GD   ++
Subjt:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATA--APAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR---SNEEKERKTEQISGILNSDLLKETESLVGLI
        +L  F   G    L+LS+F+D S+PI+WD S+WVR YA ++E  L   R+L +   + R   SN  +++   +   +   +LL++  +L  L+
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR---SNEEKERKTEQISGILNSDLLKETESLVGLI

Q9FKQ2 Putative clathrin assembly protein At5g653705.5e-3333.94Show/hide
Query:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFK-
        KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPSDK+++ L S   T        V+ ++ RL+ T +  VA KCLI +H + K 
Subjt:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFK-

Query:  ----DGDFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESL
            +G+  L++ ++     +T G + LKL+D   +S+  + +L+ WV+WY QY++  LSI+ +LG        NE+K  +T+++S      +LK+ + L
Subjt:  ----DGDFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESL

Query:  VGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKE
        V L E IS  P       N++V ++   +  DY SA++ + IR  E + R+      +  ELV  L++LE+CKE
Subjt:  VGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKE

Q9LVD8 Putative clathrin assembly protein At5g572003.2e-1730.46Show/hide
Query:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDLHAPPSDKHLSALLSLGKT--SRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFIL
        G +KD  +      LAK N       +A+++AT H + +PP ++H+  + S       RA  A  +  L  RL  T N  VA+K LI +H   ++GD   
Subjt:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDLHAPPSDKHLSALLSLGKT--SRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFIL

Query:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR-----SNEEKERKTEQISGILNSDLLKETESLVGLI
        +++L    ++  R+ L++S+F+D ++P++WD S+WVR YA ++E  L   R+L + + + R         K  +T  +SG    DLL++  +L  L+
Subjt:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR-----SNEEKERKTEQISGILNSDLLKETESLVGLI

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein6.0e-1931.61Show/hide
Query:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATA--APAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFILQD
        G +KD  S     +          +A+++AT H +  PP D+HL  + +    +RA A  A  +  L  RL  T N  VALK LI +H + ++GD   ++
Subjt:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATA--APAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR---SNEEKERKTEQISGILNSDLLKETESLVGLI
        +L  F   G    L+LS+F+D S+PI+WD S+WVR YA ++E  L   R+L +   + R   SN  +++   +   +   +LL++  +L  L+
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR---SNEEKERKTEQISGILNSDLLKETESLVGLI

AT4G40080.1 ENTH/ANTH/VHS superfamily protein1.2e-8350Show/hide
Query:  SSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGD
        + LIG IKDKASQSKAAL++   K   LSF L++LRATTHD   PP ++HL+ +LS G  SRATA+ AVE +M+RL TT ++ VALK LI +HHI K G 
Subjt:  SSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGD

Query:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISK
        FILQDQLSVFP +GGRNYLKLS FRD  +P+ W+LSSWVRWYA Y+E +LS SRI+GFF+ S+ S   KE   E +S + NSDLL+E ++LVGL+EE  K
Subjt:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISK

Query:  MPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYE-VLIDGLWGSIRSIQETKNLTGES
        +P         L DKI   VG+DY+S++ E+  R  EF  R   LSFG+++ELVCALKRLE CKE+ S      ++   IDG WG    + E K + G  
Subjt:  MPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYE-VLIDGLWGSIRSIQETKNLTGES

Query:  KEHREGGKLCKT------KRRVSDSGRFMERSNASSYRDLLRFGSERF
        +++   G++ K+      + +  +S RF +R     Y + +RF S RF
Subjt:  KEHREGGKLCKT------KRRVSDSGRFMERSNASSYRDLLRFGSERF

AT5G10410.1 ENTH/ANTH/VHS superfamily protein8.1e-3230.67Show/hide
Query:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFI
        +IG  KDKAS  KA L+       +    LALL++TT   + PP+  ++SA++S   +  A A  A    + RL+ T N+ VA K LI +H + K     
Subjt:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISKMP
         +D+        GRN LKL++F D S+ ++ +LS W+RWY QY++ +  + ++LG F     + ++K  + +++S      ++++T+SLV   E I   P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISKMP

Query:  HCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRL---GWLSFGE--SVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQ---ETKN
            + +N++VD+I   V +DY   ++ + +R+     RL   G    G+    +    L RL +CKE  S G+F +   L D  W  +  ++   E KN
Subjt:  HCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRL---GWLSFGE--SVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQ---ETKN

AT5G57200.1 ENTH/ANTH/VHS superfamily protein2.3e-1830.46Show/hide
Query:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDLHAPPSDKHLSALLSLGKT--SRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFIL
        G +KD  +      LAK N       +A+++AT H + +PP ++H+  + S       RA  A  +  L  RL  T N  VA+K LI +H   ++GD   
Subjt:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDLHAPPSDKHLSALLSLGKT--SRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFIL

Query:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR-----SNEEKERKTEQISGILNSDLLKETESLVGLI
        +++L    ++  R+ L++S+F+D ++P++WD S+WVR YA ++E  L   R+L + + + R         K  +T  +SG    DLL++  +L  L+
Subjt:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR-----SNEEKERKTEQISGILNSDLLKETESLVGLI

AT5G65370.1 ENTH/ANTH/VHS superfamily protein3.9e-3433.94Show/hide
Query:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFK-
        KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPSDK+++ L S   T        V+ ++ RL+ T +  VA KCLI +H + K 
Subjt:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFK-

Query:  ----DGDFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESL
            +G+  L++ ++     +T G + LKL+D   +S+  + +L+ WV+WY QY++  LSI+ +LG        NE+K  +T+++S      +LK+ + L
Subjt:  ----DGDFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESL

Query:  VGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKE
        V L E IS  P       N++V ++   +  DY SA++ + IR  E + R+      +  ELV  L++LE+CKE
Subjt:  VGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGATTTCTCTCAGCCTTTCAATGGTGAACACAAAAAAACTGAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCTCAAAGCAAAGCCGCCCTTCTGGCAAAGCC
AAACATACTGTCCTTTCAACTCGCTCTCCTCCGAGCCACCACTCACGACCTCCACGCGCCACCCAGCGACAAGCACCTCTCTGCCCTTCTCTCTCTTGGCAAAACCTCAC
GCGCCACCGCCGCTCCTGCCGTTGAAGTCTTAATGGACCGCCTCCAAACCACCCATAACTCCGCCGTCGCTCTCAAGTGCCTTATCGCCGTCCATCACATCTTCAAGGAT
GGCGACTTTATTCTTCAAGACCAGCTCTCTGTTTTTCCCTTCACTGGTGGTAGAAACTACCTCAAACTCTCTGATTTCCGCGACAGTTCCAATCCCATCTCTTGGGACCT
TTCCTCTTGGGTCCGATGGTACGCTCAGTACATCGAAACTGTTTTGTCTATTTCCCGGATTTTGGGGTTTTTTGTCGGTTCTTCAAGGTCCAACGAAGAGAAGGAGAGAA
AAACAGAACAGATTTCAGGGATTTTGAACTCCGATTTGCTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAATTTCGAAAATGCCTCACTGTTTGCATCTGAAT
AGAAACAGATTGGTGGATAAGATCTACAGCTTTGTCGGTGATGATTATTTGTCGGCCATGAAGGAAATTTCAATCCGAGTTACCGAGTTTCACCACCGGCTCGGTTGGCT
CAGTTTCGGCGAATCGGTCGAGTTGGTTTGCGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAGCAATCCATGGGAATTTTTGCAAAGTACGAAGTTTTGATAGATGGAC
TTTGGGGTTCCATCCGTTCCATCCAAGAGACCAAGAATTTGACTGGGGAATCGAAGGAACATCGAGAGGGCGGTAAATTGTGCAAGACGAAGAGGAGGGTCAGCGACTCG
GGCCGGTTTATGGAGCGGTCTAATGCTAGTTCTTATCGTGACCTTCTTAGATTCGGGTCGGAACGGTTCGTTTTAACCTACGACGGTTTCCAGTAA
mRNA sequenceShow/hide mRNA sequence
GAAGACCAAAACCGACTTTTGAAAGGCCAAATATTAAAATACATCCAATTGAAAATTAATAATAAATAAAAGAAATTGTACTCATTTCCAAATCAAATCAATTTCTCCAT
AAGTAAAAATAATTTATTCTCTGTTTGTTAGATGTTGATTTCTCTCAGCCTTTCAATGGTGAACACAAAAAAACTGAGTTCCCTAATTGGACTCATCAAAGACAAAGCCT
CTCAAAGCAAAGCCGCCCTTCTGGCAAAGCCAAACATACTGTCCTTTCAACTCGCTCTCCTCCGAGCCACCACTCACGACCTCCACGCGCCACCCAGCGACAAGCACCTC
TCTGCCCTTCTCTCTCTTGGCAAAACCTCACGCGCCACCGCCGCTCCTGCCGTTGAAGTCTTAATGGACCGCCTCCAAACCACCCATAACTCCGCCGTCGCTCTCAAGTG
CCTTATCGCCGTCCATCACATCTTCAAGGATGGCGACTTTATTCTTCAAGACCAGCTCTCTGTTTTTCCCTTCACTGGTGGTAGAAACTACCTCAAACTCTCTGATTTCC
GCGACAGTTCCAATCCCATCTCTTGGGACCTTTCCTCTTGGGTCCGATGGTACGCTCAGTACATCGAAACTGTTTTGTCTATTTCCCGGATTTTGGGGTTTTTTGTCGGT
TCTTCAAGGTCCAACGAAGAGAAGGAGAGAAAAACAGAACAGATTTCAGGGATTTTGAACTCCGATTTGCTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAAT
TTCGAAAATGCCTCACTGTTTGCATCTGAATAGAAACAGATTGGTGGATAAGATCTACAGCTTTGTCGGTGATGATTATTTGTCGGCCATGAAGGAAATTTCAATCCGAG
TTACCGAGTTTCACCACCGGCTCGGTTGGCTCAGTTTCGGCGAATCGGTCGAGTTGGTTTGCGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAGCAATCCATGGGAATT
TTTGCAAAGTACGAAGTTTTGATAGATGGACTTTGGGGTTCCATCCGTTCCATCCAAGAGACCAAGAATTTGACTGGGGAATCGAAGGAACATCGAGAGGGCGGTAAATT
GTGCAAGACGAAGAGGAGGGTCAGCGACTCGGGCCGGTTTATGGAGCGGTCTAATGCTAGTTCTTATCGTGACCTTCTTAGATTCGGGTCGGAACGGTTCGTTTTAACCT
ACGACGGTTTCCAGTAATACCTATACCGGAATCGTAGTTACTACTTGCTACCAAAATATAATTATGGAAAAATGAGGTATGTAGCATCCATTTAAGTTTCATTTTGATAT
GTGACAAAATATAGTAACCTTCTGAATTTTAGCCAAATTTATGTATGTCACGTGGACTACCAATTTATCTCTTGGTTGTCCCTGCAGTGTAATTATTTTAATATTTCCTC
TCATGAATG
Protein sequenceShow/hide protein sequence
MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKD
GDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISKMPHCLHLN
RNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFGESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLTGESKEHREGGKLCKTKRRVSDS
GRFMERSNASSYRDLLRFGSERFVLTYDGFQ