; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011785 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011785
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionENTH domain-containing protein
Genome locationChr01:11770556..11771644
RNA-Seq ExpressionHG10011785
SyntenyHG10011785
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577320.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]5.6e-16885.95Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII
        MVRTKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHII
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQISG LNSDLLKETESL+GLI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI

Query:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE
        EE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV EF QRLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Subjt:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE

Query:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK
        SK+ RE GKL RTKS+MSDSGRFM++ NA  YR S+RFGSERFD T KG PVLGITESY LLK
Subjt:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK

KAG7015410.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-16785.67Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII
        MVRTKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHII
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQISG  NSDLLKETESL+GLI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI

Query:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE
        EE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV EF QRLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Subjt:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE

Query:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK
        SK+ RE GKL RTKS+MSDSGRFM++ NA  YR S+RFGSERFD T KG PVLGITESY LLK
Subjt:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK

XP_022929539.1 putative clathrin assembly protein At4g40080 [Cucurbita moschata]2.8e-16785.67Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII
        MVRTKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHII
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQISG  NSDLLKETESL+GLI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI

Query:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE
        EE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV EF QRLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Subjt:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE

Query:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK
        SK+ RE GKL RTKS+MSDSGRFM++ NA  YR S+RFGSERFD T KG PVLGITESY LLK
Subjt:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK

XP_023552000.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]8.1e-16785.12Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII
        MVRTKKLSSLIGLIKDKASQ+KAALL KPNI+SFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHII
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQISG  NSDLLKETESL+GLI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI

Query:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE
        EE SK+PHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV EF QRLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Subjt:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE

Query:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK
        SK++RE GKL RTKS+MSDSGRFM++ NA  YR S+RFGSERFD T KG PVLGITESY LLK
Subjt:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK

XP_038903242.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]1.2e-18392.54Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII
        MV TK LSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP EKHL VLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHI+
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRD+SNPISWELSSWVRWYAQYIETVLSISRILG FVGSS+SNEEKEKK EQISGILNSDLLKETESLVGLIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIE

Query:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGES
        ETSKMPHCLHLNGNRL DKIYAFVGDDYLS +KEISIRV EFHQRL CLSFGESVELVCALKRLEDCKEKQS GIS+KYEVL+DEFWGSIRETKNLIGES
Subjt:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGES

Query:  KENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK
        KEN+EGGKLARTKS+MSDSGRFMERA AGSYRDS+RFGSERFDLT KGFPV G  ESYFLLK
Subjt:  KENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK

TrEMBL top hitse value%identityAlignment
A0A0A0KXU4 ENTH domain-containing protein5.3e-16486.93Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII
        MV TKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHD HAPPS+KHLS LLSLGKTSRATAA AVEVLMDRLQTT NSAVALKCLIAVHHI 
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRD+SNPISW+LSSWVRWYAQYIETVLSISRILG FVGSS SNEEKE+K EQISGILNSDLLKETESLVGLIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIE

Query:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIR---ETKNLI
        E SKMPHCLHLN NRLVDKIY+FVGDDYLS +KEISIRV EFH RLG LSF ESVELVCALKRLEDCKEKQSMGI AKYEVL+D  WGSIR   ETKNL 
Subjt:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIR---ETKNLI

Query:  GESKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGF
        GESKE+REGGKL +TK ++SDSGRFMER NA SYRD +RFGSERF LTY GF
Subjt:  GESKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGF

A0A1S3C1C0 putative clathrin assembly protein At4g400804.6e-16084.53Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII
        M+ TK+LSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPPS+KHLS LLSLGKTSRATAAAAVEVLMDRLQTT NSAVALKCLIAVHHI 
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRD+SNPISWELSSWVRWYAQYIETVLSISRILG  VGSSSSNEE E+K EQISGI NS+LLK+TESLVGLIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIE

Query:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGES
        E SKMP CLHLN NRLVDKIY FVGDDYL+ +KEISIRV EFH RLGCLSFGESVELVCALKRL+D KEKQS+GI A+YEVL+D FW SIRETKNLIG S
Subjt:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGES

Query:  KENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGF
        KENR+G KL++ + ++SDSGRF+ER+NA SY D + F SERF LTYKGF
Subjt:  KENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGF

A0A5A7TT50 Putative clathrin assembly protein1.6e-16084.53Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII
        M+ TK+LSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPPS+KHLS LLSLGKTSRATAAAAVEVLMDRLQTT NSAVALKCLIAVHHI 
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRD+SNPISWELSSWVRWYAQYIETVLSISR LG  VGSSSSNEE E+K EQISGI NS+LLK+TESLVGLIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIE

Query:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGES
        E SKMP CLHLN NRLVDKIY FVGDDYL+ +K+ISIRV EFH RLGCLSFGESVELVCALKRL+DCKEKQSMGI A+YEVL+D FW SIRETKNLIG S
Subjt:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGES

Query:  KENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGF
        KENR+G KL++ + ++SDSGRF+ER+NA SY D + F SERF LTYKGF
Subjt:  KENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGF

A0A6J1EP16 putative clathrin assembly protein At4g400801.3e-16785.67Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII
        MVRTKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHII
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQISG  NSDLLKETESL+GLI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI

Query:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE
        EE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV EF QRLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Subjt:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE

Query:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK
        SK+ RE GKL RTKS+MSDSGRFM++ NA  YR S+RFGSERFD T KG PVLGITESY LLK
Subjt:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK

A0A6J1JCT4 putative clathrin assembly protein At4g400802.4e-16484.3Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII
        MV TKKLSSLIGLIKDKASQ+KAALL KPNILSFQLALLRATTHDPHAPP  K LSVLLS GKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLIA+HHI+
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHII

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRD+SNPISWELSSWVRWYAQYIETVL ISRILG  FVGSSSSN E+EKK EQISG  NSDLLKETESL+GLI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILG-AFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLI

Query:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE
        EE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RV EF  RLGCLSFGESVELVCALKRLEDCKEKQS GIS  +E+LL  FWGSIRE +NLIGE
Subjt:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGE

Query:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK
        SK+ RE GKL RTKS+MSDSGRFM++ NA   R S+RFGSERFD T KG PVLGITESY LLK
Subjt:  SKENREGGKLARTKSKMSDSGRFMERANAGSYRDSIRFGSERFDLTYKGFPVLGITESYFLLK

SwissProt top hitse value%identityAlignment
Q8H0W9 Putative clathrin assembly protein At5g104104.6e-3231.07Show/hide
Query:  LIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFI
        +IG  KDKAS  KA L+       +    LALL++TT  P+ PP+  ++S ++S   +  A AA +  +   RL+ T+N+ VA K LI +H +IK+    
Subjt:  LIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIEETSKMP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ +  + ++LG+F     + ++K ++ +++S      ++++T+SLV   E     P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIEETSKMP

Query:  HCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRL---GCLSFGE--SVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGESK
            +  N++VD+I   V +DY  +++ + +R+    +RL   G    G+    +    L RL +CKE  S G+  +   L D+FW  + E      E K
Subjt:  HCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRL---GCLSFGE--SVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGESK

Query:  ENREGGKLA
         N++  +LA
Subjt:  ENREGGKLA

Q8L936 Putative clathrin assembly protein At4g400802.8e-9051.37Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVH
        M R    + LIG IKDKASQ+KAAL+   TK   LSF L++LRATTHDP  PP  +HL+V+LS G  SRATA++AVE +M+RL TT ++ VALK LI +H
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVH

Query:  HIIKNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVG
        HI+K+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E +LS SRI+G F+ S+SS   KE+  E +S + NSDLL+E ++LVG
Subjt:  HIIKNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVG

Query:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYE-VLLDEFWGSIRETKNL
        L+EE  K+P      G  L DKI   VG+DY+S I E+  R  EF +R   LSFG+++ELVCALKRLE CKE+ S      ++   +D FWG + E K +
Subjt:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYE-VLLDEFWGSIRETKNL

Query:  IGESKENREGGKLART------KSKMSDSGRFMERANAGSYRDSIRFGSERF-DLTYKGFPVLG
        IG  ++N   G++ ++      + K  +S RF +R   G Y + +RF S RF ++    FPV G
Subjt:  IGESKENREGGKLART------KSKMSDSGRFMERANAGSYRDSIRFGSERF-DLTYKGFPVLG

Q8LBH2 Putative clathrin assembly protein At2g016005.1e-1530.93Show/hide
Query:  GLIKDKASQTKAALL-TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATA--AAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQ
        G +KD    TK  L+          +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK LI +H +++ G    +
Subjt:  GLIKDKASQTKAALL-TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATA--AAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQ

Query:  DQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSS---SSNEEKEKKAEQISGILNSDLLKETESLVGLI
        ++L  F   G    L+LS+F+D+S+PI+W+ S+WVR YA ++E  L   R+L     +     SN  ++K   +   +   +LL++  +L  L+
Subjt:  DQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSS---SSNEEKEKKAEQISGILNSDLLKETESLVGLI

Q8LF20 Putative clathrin assembly protein At2g254302.5e-1434.9Show/hide
Query:  IGLIKDKAS--QTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQ
        IG +KD+ S    K A    P++   ++A+++AT+HD   P SEK++  +L+L   SR    A V  +  RL  T++  VALK L+ VH ++  G  I Q
Subjt:  IGLIKDKAS--QTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQ

Query:  DQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSIS
        +++ ++    G   L +SDFRD ++  SW+ S++VR YA Y++  L ++
Subjt:  DQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSIS

Q9FKQ2 Putative clathrin assembly protein At5g653706.4e-3432.89Show/hide
Query:  KLSSLIGLIKDKASQTK---AALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKN
        KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPS+K+++ L S   T        V+ ++ RL+ T +  VA KCLI +H ++K+
Subjt:  KLSSLIGLIKDKASQTK---AALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKN

Query:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESL
          G+  +D L          +T G + LKL+D   NS+  + EL+ WV+WY QY++  LSI+ +LG        NE+K  + +++S      +LK+ + L
Subjt:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESL

Query:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKN
        V L E  S  P       N++V ++   +  DY S I+ + IR  E + R+      +  ELV  L++LE+CKE  S   S + + L+ +FW  + + K+
Subjt:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKN

Query:  L
        +
Subjt:  L

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein3.6e-1630.93Show/hide
Query:  GLIKDKASQTKAALL-TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATA--AAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQ
        G +KD    TK  L+          +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK LI +H +++ G    +
Subjt:  GLIKDKASQTKAALL-TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATA--AAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQ

Query:  DQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSS---SSNEEKEKKAEQISGILNSDLLKETESLVGLI
        ++L  F   G    L+LS+F+D+S+PI+W+ S+WVR YA ++E  L   R+L     +     SN  ++K   +   +   +LL++  +L  L+
Subjt:  DQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSS---SSNEEKEKKAEQISGILNSDLLKETESLVGLI

AT2G25430.1 epsin N-terminal homology (ENTH) domain-containing protein / clathrin assembly protein-related1.8e-1534.9Show/hide
Query:  IGLIKDKAS--QTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQ
        IG +KD+ S    K A    P++   ++A+++AT+HD   P SEK++  +L+L   SR    A V  +  RL  T++  VALK L+ VH ++  G  I Q
Subjt:  IGLIKDKAS--QTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQ

Query:  DQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSIS
        +++ ++    G   L +SDFRD ++  SW+ S++VR YA Y++  L ++
Subjt:  DQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSIS

AT4G40080.1 ENTH/ANTH/VHS superfamily protein2.0e-9151.37Show/hide
Query:  MVRTKKLSSLIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVH
        M R    + LIG IKDKASQ+KAAL+   TK   LSF L++LRATTHDP  PP  +HL+V+LS G  SRATA++AVE +M+RL TT ++ VALK LI +H
Subjt:  MVRTKKLSSLIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVH

Query:  HIIKNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVG
        HI+K+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E +LS SRI+G F+ S+SS   KE+  E +S + NSDLL+E ++LVG
Subjt:  HIIKNGGFILQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVG

Query:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYE-VLLDEFWGSIRETKNL
        L+EE  K+P      G  L DKI   VG+DY+S I E+  R  EF +R   LSFG+++ELVCALKRLE CKE+ S      ++   +D FWG + E K +
Subjt:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYE-VLLDEFWGSIRETKNL

Query:  IGESKENREGGKLART------KSKMSDSGRFMERANAGSYRDSIRFGSERF-DLTYKGFPVLG
        IG  ++N   G++ ++      + K  +S RF +R   G Y + +RF S RF ++    FPV G
Subjt:  IGESKENREGGKLART------KSKMSDSGRFMERANAGSYRDSIRFGSERF-DLTYKGFPVLG

AT5G10410.1 ENTH/ANTH/VHS superfamily protein3.3e-3331.07Show/hide
Query:  LIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFI
        +IG  KDKAS  KA L+       +    LALL++TT  P+ PP+  ++S ++S   +  A AA +  +   RL+ T+N+ VA K LI +H +IK+    
Subjt:  LIGLIKDKASQTKAALL---TKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIEETSKMP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ +  + ++LG+F     + ++K ++ +++S      ++++T+SLV   E     P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIEETSKMP

Query:  HCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRL---GCLSFGE--SVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGESK
            +  N++VD+I   V +DY  +++ + +R+    +RL   G    G+    +    L RL +CKE  S G+  +   L D+FW  + E      E K
Subjt:  HCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRL---GCLSFGE--SVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGESK

Query:  ENREGGKLA
         N++  +LA
Subjt:  ENREGGKLA

AT5G65370.1 ENTH/ANTH/VHS superfamily protein4.6e-3532.89Show/hide
Query:  KLSSLIGLIKDKASQTK---AALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKN
        KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPS+K+++ L S   T        V+ ++ RL+ T +  VA KCLI +H ++K+
Subjt:  KLSSLIGLIKDKASQTK---AALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKN

Query:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESL
          G+  +D L          +T G + LKL+D   NS+  + EL+ WV+WY QY++  LSI+ +LG        NE+K  + +++S      +LK+ + L
Subjt:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESL

Query:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKN
        V L E  S  P       N++V ++   +  DY S I+ + IR  E + R+      +  ELV  L++LE+CKE  S   S + + L+ +FW  + + K+
Subjt:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKN

Query:  L
        +
Subjt:  L


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCGCACAAAAAAGTTGAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCTCAAACCAAAGCCGCGCTTCTCACTAAGCCCAACATTCTCTCCTTTCAACTCGC
CCTCCTCCGAGCCACCACTCACGATCCCCACGCGCCGCCCAGCGAGAAGCACCTCTCTGTTCTTCTCTCTCTTGGCAAAACCTCTCGCGCCACCGCCGCTGCTGCCGTTG
AAGTCTTAATGGACCGCCTTCAAACCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCGCTGTCCACCACATCATCAAGAACGGCGGCTTCATTCTACAAGACCAG
CTCTCTGTTTTTCCCTTCACCGGCGGCAGAAACTACCTTAAACTCTCCGATTTCCGCGACAATTCCAATCCCATTTCTTGGGAGCTTTCCTCTTGGGTTCGATGGTACGC
TCAGTACATCGAAACTGTCTTGTCTATTTCCCGAATTTTGGGGGCTTTTGTTGGTTCTTCTAGCTCGAATGAAGAGAAGGAGAAAAAAGCAGAGCAGATTTCGGGGATTT
TGAACTCCGATTTGCTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAACTTCGAAAATGCCTCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAGATC
TACGCCTTTGTCGGTGACGATTACTTGTCGGTTATTAAGGAAATTTCAATCCGAGTTGCAGAGTTTCACCAGCGGCTCGGTTGCCTGAGTTTCGGCGAATCGGTCGAGTT
GGTTTGCGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAGCAATCCATGGGAATTTCTGCAAAGTACGAAGTTTTGTTGGATGAATTTTGGGGTTCCATTAGAGAGACCA
AGAATTTGATTGGGGAATCCAAGGAAAATCGAGAGGGCGGTAAATTGGCCAGGACGAAGAGCAAAATGAGCGACTCGGGCCGGTTTATGGAGCGGGCTAATGCTGGTTCT
TATCGCGACTCAATTCGGTTCGGTTCTGAGCGATTCGATTTAACCTACAAAGGGTTTCCAGTCCTAGGTATAACGGAATCGTACTTTCTGCTAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCGCACAAAAAAGTTGAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCTCAAACCAAAGCCGCGCTTCTCACTAAGCCCAACATTCTCTCCTTTCAACTCGC
CCTCCTCCGAGCCACCACTCACGATCCCCACGCGCCGCCCAGCGAGAAGCACCTCTCTGTTCTTCTCTCTCTTGGCAAAACCTCTCGCGCCACCGCCGCTGCTGCCGTTG
AAGTCTTAATGGACCGCCTTCAAACCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCGCTGTCCACCACATCATCAAGAACGGCGGCTTCATTCTACAAGACCAG
CTCTCTGTTTTTCCCTTCACCGGCGGCAGAAACTACCTTAAACTCTCCGATTTCCGCGACAATTCCAATCCCATTTCTTGGGAGCTTTCCTCTTGGGTTCGATGGTACGC
TCAGTACATCGAAACTGTCTTGTCTATTTCCCGAATTTTGGGGGCTTTTGTTGGTTCTTCTAGCTCGAATGAAGAGAAGGAGAAAAAAGCAGAGCAGATTTCGGGGATTT
TGAACTCCGATTTGCTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAACTTCGAAAATGCCTCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAGATC
TACGCCTTTGTCGGTGACGATTACTTGTCGGTTATTAAGGAAATTTCAATCCGAGTTGCAGAGTTTCACCAGCGGCTCGGTTGCCTGAGTTTCGGCGAATCGGTCGAGTT
GGTTTGCGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAGCAATCCATGGGAATTTCTGCAAAGTACGAAGTTTTGTTGGATGAATTTTGGGGTTCCATTAGAGAGACCA
AGAATTTGATTGGGGAATCCAAGGAAAATCGAGAGGGCGGTAAATTGGCCAGGACGAAGAGCAAAATGAGCGACTCGGGCCGGTTTATGGAGCGGGCTAATGCTGGTTCT
TATCGCGACTCAATTCGGTTCGGTTCTGAGCGATTCGATTTAACCTACAAAGGGTTTCCAGTCCTAGGTATAACGGAATCGTACTTTCTGCTAAAATGA
Protein sequenceShow/hide protein sequence
MVRTKKLSSLIGLIKDKASQTKAALLTKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLIAVHHIIKNGGFILQDQ
LSVFPFTGGRNYLKLSDFRDNSNPISWELSSWVRWYAQYIETVLSISRILGAFVGSSSSNEEKEKKAEQISGILNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKI
YAFVGDDYLSVIKEISIRVAEFHQRLGCLSFGESVELVCALKRLEDCKEKQSMGISAKYEVLLDEFWGSIRETKNLIGESKENREGGKLARTKSKMSDSGRFMERANAGS
YRDSIRFGSERFDLTYKGFPVLGITESYFLLK