; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G008700 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G008700
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionENTH domain-containing protein
Genome locationGy14Chr4:6720334..6721785
RNA-Seq ExpressionCsGy4G008700
SyntenyCsGy4G008700
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045306.1 putative clathrin assembly protein [Cucumis melo var. makuwa]2.63e-21186.97Show/hide
Query:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF
        M+NTK+LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPSDKHLSALLSLGKTSRATAA AVEVLMDRLQTTHNSAVALKCLIAVHHIF
Subjt:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF

Query:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISR LGF VGSS SNEE ERKTEQISGI NS+LLK+TESLVGLIE
Subjt:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE

Query:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT
        EISKMP CLHLNRNRLVDKIY FVGDDYL+AMK+ISIRVTEFHHRLG LSF ESVELVCALKRL+DCKEKQSMGIFA+YEVL+DG W SIR   ETKNL 
Subjt:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT

Query:  GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ
        G SKE+R+G KL + +RR+SDSGRF+ER NASSY D+L F SERF LTY GFQ
Subjt:  GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ

XP_004137285.1 putative clathrin assembly protein At4g40080 [Cucumis sativus]8.97e-256100Show/hide
Query:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
        MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
Subjt:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC

Query:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET
        LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET
Subjt:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET

Query:  ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS
        ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS
Subjt:  ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS

Query:  IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ
        IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ
Subjt:  IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ

XP_008455635.1 PREDICTED: putative clathrin assembly protein At4g40080 [Cucumis melo]1.07e-21086.97Show/hide
Query:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF
        M+NTK+LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPSDKHLSALLSLGKTSRATAA AVEVLMDRLQTTHNSAVALKCLIAVHHIF
Subjt:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF

Query:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGF VGSS SNEE ERKTEQISGI NS+LLK+TESLVGLIE
Subjt:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE

Query:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT
        EISKMP CLHLNRNRLVDKIY FVGDDYL+AMKEISIRVTEFHHRLG LSF ESVELVCALKRL+D KEKQS+GIFA+YEVL+DG W SIR   ETKNL 
Subjt:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT

Query:  GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ
        G SKE+R+G KL + +RR+SDSGRF+ER NASSY D+L F SERF LTY GFQ
Subjt:  GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ

XP_023552000.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]8.47e-19380.11Show/hide
Query:  SLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIA
        S+ LSMV TKKLSSLIGLIKDKASQSKAALLAKPNI+SFQLALLRATTHD HAPP+ K LS LLSLGKTSRATAA A+EVLMDRLQ+T NSAVALKCLIA
Subjt:  SLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIA

Query:  VHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFF-VGSSRSNEEKERKTEQISGILNSDLLKETES
        +HHI K+GDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVL ISRILGFF VGSS SN E+E+KTEQISG  NSDLLKETES
Subjt:  VHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFF-VGSSRSNEEKERKTEQISGILNSDLLKETES

Query:  LVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQ
        L+GLIEE+SK+PHCLHLN N LVDKIY+FVG+DYLSA KEIS RVTEF  RLG LSF ESVELVCALKRLEDCKEKQS GI   +E+L+ G WGSIR I+
Subjt:  LVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQ

Query:  ETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDG
           NL GESK+HRE GKL +TK R+SDSGRFM++ NA  YR  +RFGSERF  T  G
Subjt:  ETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDG

XP_038903242.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]4.54e-20886.93Show/hide
Query:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF
        MV+TK LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPP +KHL  LLSLGKTSRATAA AVEVLMDRLQTT NSAVALKCLIAVHHI 
Subjt:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF

Query:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGFFVGSS SNEEKE+KTEQISGILNSDLLKETESLVGLIE
Subjt:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE

Query:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT
        E SKMPHCLHLN NRL DKIY+FVGDDYLSAMKEISIRVTEFH RL  LSF ESVELVCALKRLEDCKEKQS GI +KYEVL+D  WGSIR   ETKNL 
Subjt:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT

Query:  GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGF
        GESKE++EGGKL +TK R+SDSGRFMER  A SYRD LRFGSERF LT  GF
Subjt:  GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGF

TrEMBL top hitse value%identityAlignment
A0A0A0KXU4 ENTH domain-containing protein4.34e-256100Show/hide
Query:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
        MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC
Subjt:  MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKC

Query:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET
        LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET
Subjt:  LIAVHHIFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKET

Query:  ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS
        ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS
Subjt:  ESLVGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRS

Query:  IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ
        IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ
Subjt:  IQETKNLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ

A0A1S3C1C0 putative clathrin assembly protein At4g400805.17e-21186.97Show/hide
Query:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF
        M+NTK+LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPSDKHLSALLSLGKTSRATAA AVEVLMDRLQTTHNSAVALKCLIAVHHIF
Subjt:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF

Query:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGF VGSS SNEE ERKTEQISGI NS+LLK+TESLVGLIE
Subjt:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE

Query:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT
        EISKMP CLHLNRNRLVDKIY FVGDDYL+AMKEISIRVTEFHHRLG LSF ESVELVCALKRL+D KEKQS+GIFA+YEVL+DG W SIR   ETKNL 
Subjt:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT

Query:  GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ
        G SKE+R+G KL + +RR+SDSGRF+ER NASSY D+L F SERF LTY GFQ
Subjt:  GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ

A0A5A7TT50 Putative clathrin assembly protein1.27e-21186.97Show/hide
Query:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF
        M+NTK+LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPSDKHLSALLSLGKTSRATAA AVEVLMDRLQTTHNSAVALKCLIAVHHIF
Subjt:  MVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIF

Query:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISR LGF VGSS SNEE ERKTEQISGI NS+LLK+TESLVGLIE
Subjt:  KDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIE

Query:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT
        EISKMP CLHLNRNRLVDKIY FVGDDYL+AMK+ISIRVTEFHHRLG LSF ESVELVCALKRL+DCKEKQSMGIFA+YEVL+DG W SIR   ETKNL 
Subjt:  EISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLT

Query:  GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ
        G SKE+R+G KL + +RR+SDSGRF+ER NASSY D+L F SERF LTY GFQ
Subjt:  GESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDGFQ

A0A6J1EP16 putative clathrin assembly protein At4g400804.76e-19280.79Show/hide
Query:  LSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHH
        LSMV TKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPP+ K LS LLSLGKTSRATAA A+EVLMDRLQ+T NSAVALKCLIA+HH
Subjt:  LSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHH

Query:  IFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFF-VGSSRSNEEKERKTEQISGILNSDLLKETESLVG
        I K+GDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVL ISRILGFF VGSS SN E+E+KTEQISG  NSDLLKETESL+G
Subjt:  IFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFF-VGSSRSNEEKERKTEQISGILNSDLLKETESLVG

Query:  LIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETK
        LIEE+SKMPHCLHLN N LVDKIY+FVG+DYLSA KEIS RVTEF  RLG LSF ESVELVCALKRLEDCKEKQS GI   +E+L+ G WGSIR I+   
Subjt:  LIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETK

Query:  NLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDG
        NL GESK+ RE GKL +TK R+SDSGRFM++ NA  YR  +RFGSERF  T  G
Subjt:  NLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDG

A0A6J1JCT4 putative clathrin assembly protein At4g400809.40e-19180.51Show/hide
Query:  LSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHH
        LSMV+TKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPP  K LS LLS GKTSRATAA A+EVLMDRLQ+T NSAVALKCLIA+HH
Subjt:  LSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHH

Query:  IFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFF-VGSSRSNEEKERKTEQISGILNSDLLKETESLVG
        I K+GDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVL ISRILGFF VGSS SN E+E+KTEQISG  NSDLLKETESL+G
Subjt:  IFKDGDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFF-VGSSRSNEEKERKTEQISGILNSDLLKETESLVG

Query:  LIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETK
        LIEE+SKMPHCLHLN N LVDKIY+FVG+DYLSA KEIS RVTEF HRLG LSF ESVELVCALKRLEDCKEKQS GI   +E+L+ G WGSIR I+   
Subjt:  LIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETK

Query:  NLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDG
        NL GESK+ RE GKL +TK R+SDSGRFM++ NA   R  +RFGSERF  T  G
Subjt:  NLTGESKEHREGGKLCKTKRRVSDSGRFMERPNASSYRDLLRFGSERFVLTYDG

SwissProt top hitse value%identityAlignment
Q8H0W9 Putative clathrin assembly protein At5g104105.6e-3030.69Show/hide
Query:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFI
        +IG  KDKAS  KA L+       +    LALL++TT   + PP+  ++SA++S   +  A A  A    + RL+ T N+ VA K LI +H + K     
Subjt:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISKMP
         +D+        GRN LKL++F D S+ ++ +LS W+RWY QY++ +  + ++LG F     + ++K  + +++S      ++++T+SLV   E I   P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISKMP

Query:  HCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRL--------GWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQ---E
            + +N++VD+I   V +DY   ++ + +R+     RL        G L   +   L   L RL +CKE  S G+F +   L D  W  +  ++   E
Subjt:  HCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRL--------GWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQ---E

Query:  TKN
         KN
Subjt:  TKN

Q8L936 Putative clathrin assembly protein At4g400801.1e-8149.71Show/hide
Query:  SSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGD
        + LIG IKDKASQSKAAL++   K   LSF L++LRATTHD   PP ++HL+ +LS G  SRATA+ AVE +M+RL TT ++ VALK LI +HHI K G 
Subjt:  SSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGD

Query:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISK
        FILQDQLSVFP +GGRNYLKLS FRD  +P+ W+LSSWVRWYA Y+E +LS SRI+GFF+ S+ S   KE   E +S + NSDLL+E ++LVGL+EE  K
Subjt:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISK

Query:  MPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYE-VLIDGLWGSIRSIQETKNLTGES
        +P         L DKI   VG+DY+S++ E+  R  EF  R   LSF +++ELVCALKRLE CKE+ S      ++   IDG WG    + E K + G  
Subjt:  MPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYE-VLIDGLWGSIRSIQETKNLTGES

Query:  KEHREGGKLCKT------KRRVSDSGRFMERPNASSYRDLLRFGSERF
        +++   G++ K+      + +  +S RF +R     Y + +RF S RF
Subjt:  KEHREGGKLCKT------KRRVSDSGRFMERPNASSYRDLLRFGSERF

Q8LBH2 Putative clathrin assembly protein At2g016008.4e-1831.61Show/hide
Query:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATA--APAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFILQD
        G +KD  S     +          +A+++AT H +  PP D+HL  + +    +RA A  A  +  L  RL  T N  VALK LI +H + ++GD   ++
Subjt:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATA--APAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR---SNEEKERKTEQISGILNSDLLKETESLVGLI
        +L  F   G    L+LS+F+D S+PI+WD S+WVR YA ++E  L   R+L +   + R   SN  +++   +   +   +LL++  +L  L+
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR---SNEEKERKTEQISGILNSDLLKETESLVGLI

Q9FKQ2 Putative clathrin assembly protein At5g653702.4e-3334.31Show/hide
Query:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFK-
        KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPSDK+++ L S   T        V+ ++ RL+ T +  VA KCLI +H + K 
Subjt:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFK-

Query:  ----DGDFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESL
            +G+  L++ ++     +T G + LKL+D   +S+  + +L+ WV+WY QY++  LSI+ +LG        NE+K  +T+++S      +LK+ + L
Subjt:  ----DGDFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESL

Query:  VGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKE
        V L E IS  P       N++V ++   +  DY SA++ + IR  E + R+     A+  ELV  L++LE+CKE
Subjt:  VGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKE

Q9LVD8 Putative clathrin assembly protein At5g572003.2e-1730.46Show/hide
Query:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDLHAPPSDKHLSALLSLGKT--SRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFIL
        G +KD  +      LAK N       +A+++AT H + +PP ++H+  + S       RA  A  +  L  RL  T N  VA+K LI +H   ++GD   
Subjt:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDLHAPPSDKHLSALLSLGKT--SRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFIL

Query:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR-----SNEEKERKTEQISGILNSDLLKETESLVGLI
        +++L    ++  R+ L++S+F+D ++P++WD S+WVR YA ++E  L   R+L + + + R         K  +T  +SG    DLL++  +L  L+
Subjt:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR-----SNEEKERKTEQISGILNSDLLKETESLVGLI

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein6.0e-1931.61Show/hide
Query:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATA--APAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFILQD
        G +KD  S     +          +A+++AT H +  PP D+HL  + +    +RA A  A  +  L  RL  T N  VALK LI +H + ++GD   ++
Subjt:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATA--APAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR---SNEEKERKTEQISGILNSDLLKETESLVGLI
        +L  F   G    L+LS+F+D S+PI+WD S+WVR YA ++E  L   R+L +   + R   SN  +++   +   +   +LL++  +L  L+
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR---SNEEKERKTEQISGILNSDLLKETESLVGLI

AT4G40080.1 ENTH/ANTH/VHS superfamily protein7.7e-8349.71Show/hide
Query:  SSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGD
        + LIG IKDKASQSKAAL++   K   LSF L++LRATTHD   PP ++HL+ +LS G  SRATA+ AVE +M+RL TT ++ VALK LI +HHI K G 
Subjt:  SSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGD

Query:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISK
        FILQDQLSVFP +GGRNYLKLS FRD  +P+ W+LSSWVRWYA Y+E +LS SRI+GFF+ S+ S   KE   E +S + NSDLL+E ++LVGL+EE  K
Subjt:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISK

Query:  MPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYE-VLIDGLWGSIRSIQETKNLTGES
        +P         L DKI   VG+DY+S++ E+  R  EF  R   LSF +++ELVCALKRLE CKE+ S      ++   IDG WG    + E K + G  
Subjt:  MPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYE-VLIDGLWGSIRSIQETKNLTGES

Query:  KEHREGGKLCKT------KRRVSDSGRFMERPNASSYRDLLRFGSERF
        +++   G++ K+      + +  +S RF +R     Y + +RF S RF
Subjt:  KEHREGGKLCKT------KRRVSDSGRFMERPNASSYRDLLRFGSERF

AT5G10410.1 ENTH/ANTH/VHS superfamily protein4.0e-3130.69Show/hide
Query:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFI
        +IG  KDKAS  KA L+       +    LALL++TT   + PP+  ++SA++S   +  A A  A    + RL+ T N+ VA K LI +H + K     
Subjt:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISKMP
         +D+        GRN LKL++F D S+ ++ +LS W+RWY QY++ +  + ++LG F     + ++K  + +++S      ++++T+SLV   E I   P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISKMP

Query:  HCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRL--------GWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQ---E
            + +N++VD+I   V +DY   ++ + +R+     RL        G L   +   L   L RL +CKE  S G+F +   L D  W  +  ++   E
Subjt:  HCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRL--------GWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQ---E

Query:  TKN
         KN
Subjt:  TKN

AT5G57200.1 ENTH/ANTH/VHS superfamily protein2.3e-1830.46Show/hide
Query:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDLHAPPSDKHLSALLSLGKT--SRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFIL
        G +KD  +      LAK N       +A+++AT H + +PP ++H+  + S       RA  A  +  L  RL  T N  VA+K LI +H   ++GD   
Subjt:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDLHAPPSDKHLSALLSLGKT--SRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKDGDFIL

Query:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR-----SNEEKERKTEQISGILNSDLLKETESLVGLI
        +++L    ++  R+ L++S+F+D ++P++WD S+WVR YA ++E  L   R+L + + + R         K  +T  +SG    DLL++  +L  L+
Subjt:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSR-----SNEEKERKTEQISGILNSDLLKETESLVGLI

AT5G65370.1 ENTH/ANTH/VHS superfamily protein1.7e-3434.31Show/hide
Query:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFK-
        KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPSDK+++ L S   T        V+ ++ RL+ T +  VA KCLI +H + K 
Subjt:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFK-

Query:  ----DGDFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESL
            +G+  L++ ++     +T G + LKL+D   +S+  + +L+ WV+WY QY++  LSI+ +LG        NE+K  +T+++S      +LK+ + L
Subjt:  ----DGDFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESL

Query:  VGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKE
        V L E IS  P       N++V ++   +  DY SA++ + IR  E + R+     A+  ELV  L++LE+CKE
Subjt:  VGLIEEISKMPHCLHLNRNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGATTTCTCTCAGCCTTTCAATGGTGAACACAAAAAAACTGAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCTCAAAGCAAAGCCGCCCTTCTGGCAAAGCC
AAACATACTGTCCTTTCAACTCGCTCTCCTCCGAGCCACCACTCACGACCTCCACGCGCCACCCAGCGACAAGCACCTCTCTGCCCTTCTCTCTCTTGGCAAAACCTCAC
GCGCCACCGCCGCTCCTGCCGTTGAAGTCTTAATGGACCGCCTCCAAACCACCCATAACTCCGCCGTCGCTCTCAAGTGCCTTATCGCCGTCCATCACATCTTCAAGGAT
GGCGACTTTATTCTTCAAGACCAGCTCTCTGTTTTTCCCTTCACCGGTGGTAGAAACTACCTCAAACTCTCTGATTTCCGCGACAGTTCCAATCCCATCTCTTGGGACCT
TTCCTCTTGGGTCCGATGGTACGCTCAGTACATCGAAACTGTTTTGTCTATTTCCCGGATTTTGGGGTTTTTTGTGGGTTCTTCAAGGTCCAACGAAGAGAAGGAGAGAA
AAACAGAACAGATTTCAGGGATTTTGAACTCCGATTTGCTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAATTTCGAAAATGCCTCACTGTTTGCATCTGAAT
AGAAACAGATTGGTGGATAAGATCTACAGCTTTGTCGGTGATGATTATTTGTCGGCCATGAAGGAAATTTCAATCCGAGTTACCGAGTTTCACCACCGGCTCGGTTGGCT
CAGTTTCGCCGAATCGGTCGAGTTGGTTTGCGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAGCAATCCATGGGAATCTTTGCAAAGTACGAAGTTTTGATAGATGGAC
TTTGGGGTTCCATCCGTTCCATCCAAGAGACCAAGAATTTGACTGGGGAATCGAAGGAACATCGAGAGGGCGGTAAATTGTGCAAGACGAAGAGGAGGGTCAGCGACTCA
GGCCGGTTTATGGAGCGGCCTAATGCTAGTTCTTATCGTGACCTTCTTAGATTCGGGTCGGAACGGTTCGTTTTAACCTACGACGGTTTCCAGTAA
mRNA sequenceShow/hide mRNA sequence
AAGAAATTGTACTCATTTCCAAATCAAATCAATTTCTCCATAAGTAAAAAAAAATTATTCTCTGTTTGTTAGATGTTGATTTCTCTCAGCCTTTCAATGGTGAACACAAA
AAAACTGAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCTCAAAGCAAAGCCGCCCTTCTGGCAAAGCCAAACATACTGTCCTTTCAACTCGCTCTCCTCCGAGCCA
CCACTCACGACCTCCACGCGCCACCCAGCGACAAGCACCTCTCTGCCCTTCTCTCTCTTGGCAAAACCTCACGCGCCACCGCCGCTCCTGCCGTTGAAGTCTTAATGGAC
CGCCTCCAAACCACCCATAACTCCGCCGTCGCTCTCAAGTGCCTTATCGCCGTCCATCACATCTTCAAGGATGGCGACTTTATTCTTCAAGACCAGCTCTCTGTTTTTCC
CTTCACCGGTGGTAGAAACTACCTCAAACTCTCTGATTTCCGCGACAGTTCCAATCCCATCTCTTGGGACCTTTCCTCTTGGGTCCGATGGTACGCTCAGTACATCGAAA
CTGTTTTGTCTATTTCCCGGATTTTGGGGTTTTTTGTGGGTTCTTCAAGGTCCAACGAAGAGAAGGAGAGAAAAACAGAACAGATTTCAGGGATTTTGAACTCCGATTTG
CTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAATTTCGAAAATGCCTCACTGTTTGCATCTGAATAGAAACAGATTGGTGGATAAGATCTACAGCTTTGTCGG
TGATGATTATTTGTCGGCCATGAAGGAAATTTCAATCCGAGTTACCGAGTTTCACCACCGGCTCGGTTGGCTCAGTTTCGCCGAATCGGTCGAGTTGGTTTGCGCGTTGA
AACGGCTCGAGGATTGCAAAGAAAAGCAATCCATGGGAATCTTTGCAAAGTACGAAGTTTTGATAGATGGACTTTGGGGTTCCATCCGTTCCATCCAAGAGACCAAGAAT
TTGACTGGGGAATCGAAGGAACATCGAGAGGGCGGTAAATTGTGCAAGACGAAGAGGAGGGTCAGCGACTCAGGCCGGTTTATGGAGCGGCCTAATGCTAGTTCTTATCG
TGACCTTCTTAGATTCGGGTCGGAACGGTTCGTTTTAACCTACGACGGTTTCCAGTAATACCTATACCGGAATCGTAGTTACTACTTGCTACCAAAATATAATTATGGAA
AAATGAGGTATGTAGCATCCATTTAAGTTTCATTTTGATATATAGTAACCTTCTGAATTTTAGCCAAATTTATGTATGTCACGTGGACTACCAATTTATCTCTTGGTTGT
CCCTGCAGTGTAATTATTTTAATATTTCCTCTCATGAATGATGATAGTGTAATTAATTACTTGGATATATGTGTTCCCATTGTCACTACAATGTAAATAAAATCAATTTA
TGCAACAATTTTTATAACGTCT
Protein sequenceShow/hide protein sequence
MLISLSLSMVNTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDLHAPPSDKHLSALLSLGKTSRATAAPAVEVLMDRLQTTHNSAVALKCLIAVHHIFKD
GDFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWDLSSWVRWYAQYIETVLSISRILGFFVGSSRSNEEKERKTEQISGILNSDLLKETESLVGLIEEISKMPHCLHLN
RNRLVDKIYSFVGDDYLSAMKEISIRVTEFHHRLGWLSFAESVELVCALKRLEDCKEKQSMGIFAKYEVLIDGLWGSIRSIQETKNLTGESKEHREGGKLCKTKRRVSDS
GRFMERPNASSYRDLLRFGSERFVLTYDGFQ