; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G191830 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G191830
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionENTH domain-containing protein
Genome locationCla97Chr10:16517916..16523272
RNA-Seq ExpressionCla97C10G191830
SyntenyCla97C10G191830
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577320.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]1.9e-16585.59Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIV
        MVRTKKLSSLIGLIKDKASQSKAALLAKPN+LSFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATA AA+EVLMDRLQ+TQNSAVALKCLI++HHI+
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIV

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSSN E+E+K EQISGFLNSDLLKETESL+GLI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLI

Query:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGE
        EE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEISTRVTEF QRLGCLSFGESVELVC LKRLEDCKEKQ  GIS  +E+L+  FWGSIRE +NLIGE
Subjt:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGE

Query:  SKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLG
        SK+ RE GKL RTKSRMSDSGRFM++ NA  YR S+RFGS+RFD T KG PVLG
Subjt:  SKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLG

XP_004137285.1 putative clathrin assembly protein At4g40080 [Cucumis sativus]2.2e-16686.87Show/hide
Query:  VSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLI
        +SLSL+MV TKKLSSLIGLIKDKASQSKAALLAKPN+LSFQLALLRATTHD HAPPS+KHLS LLSLGKTSRATA  AVEVLMDRLQTT NSAVALKCLI
Subjt:  VSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLI

Query:  SVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETES
        +VHHI K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGFFVGSS SNEEKERK EQISG LNSDLLKETES
Subjt:  SVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETES

Query:  LVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIR---
        LVGLIEE SKMPHCLHLN NRLVDKIY+FVGDDYLS MKEIS RVTEFH RLG LSF ESVELVC LKRLEDCKEKQ MGI AKYEVL+D  WGSIR   
Subjt:  LVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIR---

Query:  ETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGF
        ETKNL GESKE+RE GKL +TK R+SDSGRFMER NASSYRD LRFGS+RF LTY GF
Subjt:  ETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGF

XP_022929539.1 putative clathrin assembly protein At4g40080 [Cucurbita moschata]4.9e-16682.35Show/hide
Query:  PINFTSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRL
        PI   S+L+C     S  L+MVRTKKLSSLIGLIKDKASQSKAALLAKPN+LSFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATA AA+EVLMDRL
Subjt:  PINFTSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRL

Query:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ
        Q+TQNSAVALKCLI++HHI+KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSSN E+E+K EQ
Subjt:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ

Query:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY
        ISGF NSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RVTEF QRLGCLSFGESVELVC LKRLEDCKEKQ  GIS  +
Subjt:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY

Query:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLG
        E+L+  FWGSIRE +NLIGESK+ RE GKL RTKSRMSDSGRFM++ NA  YR S+RFGS+RFD T KG PVLG
Subjt:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLG

XP_023552000.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]3.8e-16682.09Show/hide
Query:  PINFTSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRL
        PI   S+L+C     S+ L+MVRTKKLSSLIGLIKDKASQSKAALLAKPN++SFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATA AA+EVLMDRL
Subjt:  PINFTSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRL

Query:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ
        Q+TQNSAVALKCLI++HHI+KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSSN E+E+K EQ
Subjt:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ

Query:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY
        ISGF NSDLLKETESL+GLIEE SK+PHCLHLNGN LVDKIYAFVG+DYLS  KEISTRVTEF QRLGCLSFGESVELVC LKRLEDCKEKQ  GIS  +
Subjt:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY

Query:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLG
        E+L+  FWGSIRE +NLIGESK++RE GKL RTKSRMSDSGRFM++ NA  YR S+RFGS+RFD T KG PVLG
Subjt:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLG

XP_038903242.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]2.7e-18092.68Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIV
        MV TK LSSLIGLIKDKASQSKAALLAKPN+LSFQLALLRATTHDPHAPP EKHL VLLSLGKTSRATA AAVEVLMDRLQTTQNSAVALKCLI+VHHIV
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIV

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS+SNEEKE+K EQISG LNSDLLKETESLVGLIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE

Query:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES
        ETSKMPHCLHLNGNRL DKIYAFVGDDYLS MKEIS RVTEFHQRL CLSFGESVELVC LKRLEDCKEKQ  GIS+KYEVLMDEFWGSIRETKNLIGES
Subjt:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES

Query:  KENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLGTR
        KEN+E GKLARTKSRMSDSGRFMERA A SYRDSLRFGS+RFDLT KGFPV GTR
Subjt:  KENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLGTR

TrEMBL top hitse value%identityAlignment
A0A0A0KXU4 ENTH domain-containing protein1.1e-16686.87Show/hide
Query:  VSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLI
        +SLSL+MV TKKLSSLIGLIKDKASQSKAALLAKPN+LSFQLALLRATTHD HAPPS+KHLS LLSLGKTSRATA  AVEVLMDRLQTT NSAVALKCLI
Subjt:  VSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLI

Query:  SVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETES
        +VHHI K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGFFVGSS SNEEKERK EQISG LNSDLLKETES
Subjt:  SVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETES

Query:  LVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIR---
        LVGLIEE SKMPHCLHLN NRLVDKIY+FVGDDYLS MKEIS RVTEFH RLG LSF ESVELVC LKRLEDCKEKQ MGI AKYEVL+D  WGSIR   
Subjt:  LVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIR---

Query:  ETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGF
        ETKNL GESKE+RE GKL +TK R+SDSGRFMER NASSYRD LRFGS+RF LTY GF
Subjt:  ETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGF

A0A1S3C1C0 putative clathrin assembly protein At4g400803.9e-16185.1Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIV
        M+ TK+LSSLIGLIKDKASQSKAALLAKPN+LSFQLALLRATTHDPHAPPS+KHLS LLSLGKTSRATA AAVEVLMDRLQTT NSAVALKCLI+VHHI 
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIV

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGF VGSSSSNEE ERK EQISG  NS+LLK+TESLVGLIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE

Query:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES
        E SKMP CLHLN NRLVDKIY FVGDDYL+ MKEIS RVTEFH RLGCLSFGESVELVC LKRL+D KEKQ +GI A+YEVLMD FW SIRETKNLIG S
Subjt:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES

Query:  KENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGF
        KENR+  KL++ + R+SDSGRF+ER+NASSY D L F S+RF LTYKGF
Subjt:  KENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGF

A0A5A7TT50 Putative clathrin assembly protein1.3e-16185.1Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIV
        M+ TK+LSSLIGLIKDKASQSKAALLAKPN+LSFQLALLRATTHDPHAPPS+KHLS LLSLGKTSRATA AAVEVLMDRLQTT NSAVALKCLI+VHHI 
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIV

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISR LGF VGSSSSNEE ERK EQISG  NS+LLK+TESLVGLIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE

Query:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES
        E SKMP CLHLN NRLVDKIY FVGDDYL+ MK+IS RVTEFH RLGCLSFGESVELVC LKRL+DCKEKQ MGI A+YEVLMD FW SIRETKNLIG S
Subjt:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES

Query:  KENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGF
        KENR+  KL++ + R+SDSGRF+ER+NASSY D L F S+RF LTYKGF
Subjt:  KENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGF

A0A6J1EP16 putative clathrin assembly protein At4g400802.4e-16682.35Show/hide
Query:  PINFTSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRL
        PI   S+L+C     S  L+MVRTKKLSSLIGLIKDKASQSKAALLAKPN+LSFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATA AA+EVLMDRL
Subjt:  PINFTSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRL

Query:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ
        Q+TQNSAVALKCLI++HHI+KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSSN E+E+K EQ
Subjt:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ

Query:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY
        ISGF NSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RVTEF QRLGCLSFGESVELVC LKRLEDCKEKQ  GIS  +
Subjt:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY

Query:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLG
        E+L+  FWGSIRE +NLIGESK+ RE GKL RTKSRMSDSGRFM++ NA  YR S+RFGS+RFD T KG PVLG
Subjt:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLG

A0A6J1JCT4 putative clathrin assembly protein At4g400806.5e-16482.38Show/hide
Query:  SSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQN
        S+L+C  +  S  L+MV TKKLSSLIGLIKDKASQSKAALLAKPN+LSFQLALLRATTHDPHAPP  K LSVLLS GKTSRATA AA+EVLMDRLQ+TQN
Subjt:  SSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQN

Query:  SAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQISGFL
        SAVALKCLI++HHIVKNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSSN E+E+K EQISGF 
Subjt:  SAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQISGFL

Query:  NSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMD
        NSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEISTRVTEF  RLGCLSFGESVELVC LKRLEDCKEKQ  GIS  +E+L+ 
Subjt:  NSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMD

Query:  EFWGSIRETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLG
         FWGSIRE +NLIGESK+ RE GKL RTKSRMSDSGRFM++ NA   R S+RFGS+RFD T KG PVLG
Subjt:  EFWGSIRETKNLIGESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLG

SwissProt top hitse value%identityAlignment
Q8GX47 Putative clathrin assembly protein At4g026503.1e-1434.23Show/hide
Query:  TKKLSSLIGLIKDKASQSKAALLAKPNVLS-FQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN
        + KL   IG +KD+ S   A +  + + L+  ++A+++AT HD + P  +K++  +L L   SR    A V  L  RL  T+N +VALK LI +  ++ +
Subjt:  TKKLSSLIGLIKDKASQSKAALLAKPNVLS-FQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN

Query:  GGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIE
        G    + ++  F    G   L +SDFRD+S   SW+ S++VR YA Y++
Subjt:  GGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIE

Q8H0W9 Putative clathrin assembly protein At5g104101.6e-3130.74Show/hide
Query:  LIGLIKDKASQSKAALL---AKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFI
        +IG  KDKAS  KA L+       V    LALL++TT  P+ PP+  ++S ++S   +  A   AA    + RL+ T+N+ VA K LI +H ++K+    
Subjt:  LIGLIKDKASQSKAALL---AKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIEETSKMP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ +  + ++LG F     + ++K  + +++S +    ++++T+SLV   E     P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIEETSKMP

Query:  HCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRL---GCLSFGE--SVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGESK
            +  N++VD+I   V +DY   ++ +  R+    +RL   G    G+    +   +L RL +CKE  + G+  +   L D+FW  + E      E K
Subjt:  HCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRL---GCLSFGE--SVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGESK

Query:  ENREDGKLA
         N++  +LA
Subjt:  ENREDGKLA

Q8L936 Putative clathrin assembly protein At4g400801.0e-8950.68Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLA---KPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVH
        M R    + LIG IKDKASQSKAAL++   K   LSF L++LRATTHDP  PP  +HL+V+LS G  SRATA +AVE +M+RL TT ++ VALK LI +H
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLA---KPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVH

Query:  HIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVG
        HIVK+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E +LS SRI+GFF+ S+SS   KE   E +S   NSDLL+E ++LVG
Subjt:  HIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVG

Query:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYE-VLMDEFWGSIRETKNL
        L+EE  K+P      G  L DKI   VG+DY+S + E+ TR  EF +R   LSFG+++ELVC LKRLE CKE+        ++   +D FWG + E K +
Subjt:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYE-VLMDEFWGSIRETKNL

Query:  IGESKENREDGKLART------KSRMSDSGRFMERANASSYRDSLRFGSQRF-DLTYKGFPVLGTRATC
        IG  ++N   G++ ++      + +  +S RF +R     Y + +RF S RF ++    FPV G R  C
Subjt:  IGESKENREDGKLART------KSRMSDSGRFMERANASSYRDSLRFGSQRF-DLTYKGFPVLGTRATC

Q8LBH2 Putative clathrin assembly protein At2g016001.4e-1429.53Show/hide
Query:  GLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAA--VEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFILQD
        G +KD  S     +          +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK LI +H +++ G    ++
Subjt:  GLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAA--VEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS---SSNEEKERKAEQISGFLNSDLLKETESLVGLI
        +L  F   G    L+LS+F+D S+PI+W+ S+WVR YA ++E  L   R+L +   +     SN  +++   +       +LL++  +L  L+
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS---SSNEEKERKAEQISGFLNSDLLKETESLVGLI

Q9FKQ2 Putative clathrin assembly protein At5g653707.9e-3432.23Show/hide
Query:  KLSSLIGLIKDKASQSK---AALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN
        KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPS+K+++ L S   T        V+ ++ RL+ T +  VA KCLI +H +VK+
Subjt:  KLSSLIGLIKDKASQSK---AALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN

Query:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESL
          G+  +D L          +T G + LKL+D   +S+  + EL+ WV+WY QY++  LSI+ +LG        NE+K  + +++S +    +LK+ + L
Subjt:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESL

Query:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKN
        V L E  S  P       N++V ++   +  DY S ++ +  R  E + R+      +  ELV VL++LE+CKE  +   S + + L+ +FW  + + K+
Subjt:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKN

Query:  L
        +
Subjt:  L

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein9.9e-1629.53Show/hide
Query:  GLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAA--VEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFILQD
        G +KD  S     +          +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK LI +H +++ G    ++
Subjt:  GLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAA--VEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS---SSNEEKERKAEQISGFLNSDLLKETESLVGLI
        +L  F   G    L+LS+F+D S+PI+W+ S+WVR YA ++E  L   R+L +   +     SN  +++   +       +LL++  +L  L+
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS---SSNEEKERKAEQISGFLNSDLLKETESLVGLI

AT4G02650.1 ENTH/ANTH/VHS superfamily protein2.2e-1534.23Show/hide
Query:  TKKLSSLIGLIKDKASQSKAALLAKPNVLS-FQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN
        + KL   IG +KD+ S   A +  + + L+  ++A+++AT HD + P  +K++  +L L   SR    A V  L  RL  T+N +VALK LI +  ++ +
Subjt:  TKKLSSLIGLIKDKASQSKAALLAKPNVLS-FQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN

Query:  GGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIE
        G    + ++  F    G   L +SDFRD+S   SW+ S++VR YA Y++
Subjt:  GGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIE

AT4G40080.1 ENTH/ANTH/VHS superfamily protein7.2e-9150.68Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLA---KPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVH
        M R    + LIG IKDKASQSKAAL++   K   LSF L++LRATTHDP  PP  +HL+V+LS G  SRATA +AVE +M+RL TT ++ VALK LI +H
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLA---KPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVH

Query:  HIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVG
        HIVK+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E +LS SRI+GFF+ S+SS   KE   E +S   NSDLL+E ++LVG
Subjt:  HIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVG

Query:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYE-VLMDEFWGSIRETKNL
        L+EE  K+P      G  L DKI   VG+DY+S + E+ TR  EF +R   LSFG+++ELVC LKRLE CKE+        ++   +D FWG + E K +
Subjt:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYE-VLMDEFWGSIRETKNL

Query:  IGESKENREDGKLART------KSRMSDSGRFMERANASSYRDSLRFGSQRF-DLTYKGFPVLGTRATC
        IG  ++N   G++ ++      + +  +S RF +R     Y + +RF S RF ++    FPV G R  C
Subjt:  IGESKENREDGKLART------KSRMSDSGRFMERANASSYRDSLRFGSQRF-DLTYKGFPVLGTRATC

AT5G10410.1 ENTH/ANTH/VHS superfamily protein1.2e-3230.74Show/hide
Query:  LIGLIKDKASQSKAALL---AKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFI
        +IG  KDKAS  KA L+       V    LALL++TT  P+ PP+  ++S ++S   +  A   AA    + RL+ T+N+ VA K LI +H ++K+    
Subjt:  LIGLIKDKASQSKAALL---AKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIEETSKMP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ +  + ++LG F     + ++K  + +++S +    ++++T+SLV   E     P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIEETSKMP

Query:  HCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRL---GCLSFGE--SVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGESK
            +  N++VD+I   V +DY   ++ +  R+    +RL   G    G+    +   +L RL +CKE  + G+  +   L D+FW  + E      E K
Subjt:  HCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRL---GCLSFGE--SVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGESK

Query:  ENREDGKLA
         N++  +LA
Subjt:  ENREDGKLA

AT5G65370.1 ENTH/ANTH/VHS superfamily protein5.6e-3532.23Show/hide
Query:  KLSSLIGLIKDKASQSK---AALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN
        KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPS+K+++ L S   T        V+ ++ RL+ T +  VA KCLI +H +VK+
Subjt:  KLSSLIGLIKDKASQSK---AALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN

Query:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESL
          G+  +D L          +T G + LKL+D   +S+  + EL+ WV+WY QY++  LSI+ +LG        NE+K  + +++S +    +LK+ + L
Subjt:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESL

Query:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKN
        V L E  S  P       N++V ++   +  DY S ++ +  R  E + R+      +  ELV VL++LE+CKE  +   S + + L+ +FW  + + K+
Subjt:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKN

Query:  L
        +
Subjt:  L


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGTAATAAAAGAAAAGCTATCCAATTCAATTCCAATTAATTTCACTTCTTCTCTCTGGTGTGTCTTTGTAAATGTTTCTCTCAGCCTTGCAATGGTTCGCACAAA
AAAATTGAGTTCCTTAATTGGACTCATCAAAGACAAAGCCTCTCAAAGCAAAGCCGCGCTTCTCGCCAAGCCCAACGTTCTCTCCTTTCAACTCGCTCTCCTCCGAGCCA
CCACTCACGACCCCCACGCCCCGCCCAGCGAGAAACACCTCTCTGTTCTTCTCTCTCTTGGCAAAACCTCTCGTGCGACCGCCGGTGCCGCCGTTGAAGTCCTAATGGAC
CGCCTCCAAACCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCTCCGTTCACCATATCGTCAAGAACGGCGGCTTCATTCTGCAAGACCAGCTCTCTGTTTTTCC
CTTCACCGGCGGCAGAAACTACCTTAAACTCTCGGATTTCCGCGACAGTTCCAATCCCATTTCTTGGGAGCTTTCCTCTTGGGTTCGATGGTACGCTCAGTACATCGAAA
CTGTTTTGTCTATTTCCCGAATTTTGGGGTTTTTTGTTGGTTCATCAAGCTCGAATGAAGAGAAGGAGAGAAAAGCAGAGCAGATTTCGGGGTTTTTGAACTCCGATTTG
CTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAACTTCGAAAATGCCTCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAGATCTACGCCTTTGTCGG
TGACGATTACTTGTCAGGTATGAAGGAAATTTCAACCCGAGTTACAGAGTTTCACCAGCGGCTCGGTTGCTTGAGTTTCGGCGAATCGGTCGAGTTGGTTTGCGTGTTGA
AACGGCTCGAGGATTGCAAAGAAAAGCAAATCATGGGAATTTCTGCAAAGTACGAAGTTTTGATGGATGAATTCTGGGGATCCATTAGAGAGACCAAGAATTTGATTGGG
GAGTCGAAGGAAAATCGAGAGGACGGTAAATTGGCCAGGACGAAGAGCAGGATGAGCGACTCGGGCCGGTTTATGGAGCGGGCTAATGCTAGTTCTTATCGCGACTCGCT
TCGGTTCGGTTCGCAGCGGTTCGATTTAACCTACAAAGGGTTTCCGGTTCTAGGAACGCGAGCAACTTGTGGATGCATTGACAACCATTTAACACAGGTTGATGTGTTAG
CAACCCATAGATGTGTTGACATACTTGAACACAGGAACCCAAAGTTATCTATGGACGCATATCAACCAGAAACGCAAGGCAGCAGAGAATCAGATAAGAATCTTGAGAGA
TTGTCGGGAGATTGA
mRNA sequenceShow/hide mRNA sequence
CCAAAAACGACTTTAGAACGCCAAATATTAAAATACCTCCAATTGGAAAAAATAATAATGATAGTAATAAAAGAAAAGCTATCCAATTCAATTCCAATTAATTTCACTTC
TTCTCTCTGGTGTGTCTTTGTAAATGTTTCTCTCAGCCTTGCAATGGTTCGCACAAAAAAATTGAGTTCCTTAATTGGACTCATCAAAGACAAAGCCTCTCAAAGCAAAG
CCGCGCTTCTCGCCAAGCCCAACGTTCTCTCCTTTCAACTCGCTCTCCTCCGAGCCACCACTCACGACCCCCACGCCCCGCCCAGCGAGAAACACCTCTCTGTTCTTCTC
TCTCTTGGCAAAACCTCTCGTGCGACCGCCGGTGCCGCCGTTGAAGTCCTAATGGACCGCCTCCAAACCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCTCCGT
TCACCATATCGTCAAGAACGGCGGCTTCATTCTGCAAGACCAGCTCTCTGTTTTTCCCTTCACCGGCGGCAGAAACTACCTTAAACTCTCGGATTTCCGCGACAGTTCCA
ATCCCATTTCTTGGGAGCTTTCCTCTTGGGTTCGATGGTACGCTCAGTACATCGAAACTGTTTTGTCTATTTCCCGAATTTTGGGGTTTTTTGTTGGTTCATCAAGCTCG
AATGAAGAGAAGGAGAGAAAAGCAGAGCAGATTTCGGGGTTTTTGAACTCCGATTTGCTTAAAGAGACCGAATCTTTGGTGGGTTTAATCGAAGAAACTTCGAAAATGCC
TCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAGATCTACGCCTTTGTCGGTGACGATTACTTGTCAGGTATGAAGGAAATTTCAACCCGAGTTACAGAGTTTC
ACCAGCGGCTCGGTTGCTTGAGTTTCGGCGAATCGGTCGAGTTGGTTTGCGTGTTGAAACGGCTCGAGGATTGCAAAGAAAAGCAAATCATGGGAATTTCTGCAAAGTAC
GAAGTTTTGATGGATGAATTCTGGGGATCCATTAGAGAGACCAAGAATTTGATTGGGGAGTCGAAGGAAAATCGAGAGGACGGTAAATTGGCCAGGACGAAGAGCAGGAT
GAGCGACTCGGGCCGGTTTATGGAGCGGGCTAATGCTAGTTCTTATCGCGACTCGCTTCGGTTCGGTTCGCAGCGGTTCGATTTAACCTACAAAGGGTTTCCGGTTCTAG
GAACGCGAGCAACTTGTGGATGCATTGACAACCATTTAACACAGGTTGATGTGTTAGCAACCCATAGATGTGTTGACATACTTGAACACAGGAACCCAAAGTTATCTATG
GACGCATATCAACCAGAAACGCAAGGCAGCAGAGAATCAGATAAGAATCTTGAGAGATTGTCGGGAGATTGATATGCTTGATATGCGGCGAGAGTGGTGGATCTAAATTA
AACCGGAAATCACGCGATTAGGAGAGATGGACGCAAGGCAGAAATTCAGCCTCCAATCATGCAGATTTGCTAGTGATTGCAACGCAACATCTATAAAAGGGTAGACCTGA
AGACAAAAATGTTGTTGAACCTCTCGGAAGAATTTCTAAGTGACAACAGAGTTCTCTCTAAACGACAACCAGAGACAAATAGCTCGAAAGAGAAGAGTCTTCCCTCCGCC
TGACCACTTTAC
Protein sequenceShow/hide protein sequence
MIVIKEKLSNSIPINFTSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNVLSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAGAAVEVLMD
RLQTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDL
LKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISTRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIG
ESKENREDGKLARTKSRMSDSGRFMERANASSYRDSLRFGSQRFDLTYKGFPVLGTRATCGCIDNHLTQVDVLATHRCVDILEHRNPKLSMDAYQPETQGSRESDKNLER
LSGD