; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G193600 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G193600
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionENTH domain-containing protein
Genome locationCicolChr10:14430496..14431647
RNA-Seq ExpressionCcUC10G193600
SyntenyCcUC10G193600
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577320.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]3.3e-16385.55Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIV
        MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATAAAA+EVLMDRLQ+TQNSAVALKCLI++HHI+
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIV

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSSN E+E+K EQISGFLNSDLLKETESL+GLI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLI

Query:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGE
        EE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RVTEF QRLGCLSFGESVELVC LKRLEDCKEKQ  GIS  +E+L+  FWGSIRE +NLIGE
Subjt:  EETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGE

Query:  SKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL
        SK+ RE GKL RTKSRMS SGRFM++ NA  YR S+RFGSERFD T  G PVL
Subjt:  SKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL

XP_004137285.1 putative clathrin assembly protein At4g40080 [Cucumis sativus]1.1e-16687.71Show/hide
Query:  VSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLI
        +SLSL+MV TKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPS+KHLS LLSLGKTSRATAA AVEVLMDRLQTT NSAVALKCLI
Subjt:  VSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLI

Query:  SVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETES
        +VHHI K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGFFVGSS SNEEKERK EQISG LNSDLLKETES
Subjt:  SVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETES

Query:  LVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIR---
        LVGLIEE SKMPHCLHLN NRLVDKIY+FVGDDYLS MKEISIRVTEFH RLG LSF ESVELVC LKRLEDCKEKQ MGI AKYEVL+D  WGSIR   
Subjt:  LVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIR---

Query:  ETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGF
        ETKNL GESKE+RE GKL +TK R+S SGRFMER NASSYRD LRFGSERF LTY GF
Subjt:  ETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGF

XP_022929539.1 putative clathrin assembly protein At4g40080 [Cucurbita moschata]5.1e-16482.57Show/hide
Query:  PINFSSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRL
        PI   S+L+C     S  L+MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATAAAA+EVLMDRL
Subjt:  PINFSSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRL

Query:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ
        Q+TQNSAVALKCLI++HHI+KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSSN E+E+K EQ
Subjt:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ

Query:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY
        ISGF NSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RVTEF QRLGCLSFGESVELVC LKRLEDCKEKQ  GIS  +
Subjt:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY

Query:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL
        E+L+  FWGSIRE +NLIGESK+ RE GKL RTKSRMS SGRFM++ NA  YR S+RFGSERFD T  G PVL
Subjt:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL

XP_023552000.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]8.7e-16482.04Show/hide
Query:  PINFSSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRL
        PI   S+L+C     S+ L+MVRTKKLSSLIGLIKDKASQSKAALLAKPNI+SFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATAAAA+EVLMDRL
Subjt:  PINFSSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRL

Query:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ
        Q+TQNSAVALKCLI++HHI+KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSSN E+E+K EQ
Subjt:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ

Query:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY
        ISGF NSDLLKETESL+GLIEE SK+PHCLHLNGN LVDKIYAFVG+DYLS  KEIS RVTEF QRLGCLSFGESVELVC LKRLEDCKEKQ  GIS  +
Subjt:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY

Query:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL
        E+L+  FWGSIRE +NLIGESK++RE GKL RTKSRMS SGRFM++ NA  YR S+RFGSERFD T  G PVL
Subjt:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL

XP_038903242.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]1.6e-17893.45Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIV
        MV TK LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP EKHL VLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLI+VHHIV
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIV

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS+SNEEKE+K EQISG LNSDLLKETESLVGLIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE

Query:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES
        ETSKMPHCLHLNGNRL DKIYAFVGDDYLS MKEISIRVTEFHQRL CLSFGESVELVC LKRLEDCKEKQ  GIS+KYEVLMDEFWGSIRETKNLIGES
Subjt:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES

Query:  KENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPV
        KEN+E GKLARTKSRMS SGRFMERA A SYRDSLRFGSERFDLT  GFPV
Subjt:  KENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPV

TrEMBL top hitse value%identityAlignment
A0A0A0KXU4 ENTH domain-containing protein5.3e-16787.71Show/hide
Query:  VSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLI
        +SLSL+MV TKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPS+KHLS LLSLGKTSRATAA AVEVLMDRLQTT NSAVALKCLI
Subjt:  VSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLI

Query:  SVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETES
        +VHHI K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGFFVGSS SNEEKERK EQISG LNSDLLKETES
Subjt:  SVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETES

Query:  LVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIR---
        LVGLIEE SKMPHCLHLN NRLVDKIY+FVGDDYLS MKEISIRVTEFH RLG LSF ESVELVC LKRLEDCKEKQ MGI AKYEVL+D  WGSIR   
Subjt:  LVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIR---

Query:  ETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGF
        ETKNL GESKE+RE GKL +TK R+S SGRFMER NASSYRD LRFGSERF LTY GF
Subjt:  ETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGF

A0A1S3C1C0 putative clathrin assembly protein At4g400809.7e-16185.67Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIV
        M+ TK+LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPS+KHLS LLSLGKTSRATAAAAVEVLMDRLQTT NSAVALKCLI+VHHI 
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIV

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGF VGSSSSNEE ERK EQISG  NS+LLK+TESLVGLIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE

Query:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES
        E SKMP CLHLN NRLVDKIY FVGDDYL+ MKEISIRVTEFH RLGCLSFGESVELVC LKRL+D KEKQ +GI A+YEVLMD FW SIRETKNLIG S
Subjt:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES

Query:  KENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGF
        KENR+  KL++ + R+S SGRF+ER+NASSY D L F SERF LTY GF
Subjt:  KENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGF

A0A5A7TT50 Putative clathrin assembly protein3.3e-16185.67Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIV
        M+ TK+LSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPS+KHLS LLSLGKTSRATAAAAVEVLMDRLQTT NSAVALKCLI+VHHI 
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIV

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISR LGF VGSSSSNEE ERK EQISG  NS+LLK+TESLVGLIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIE

Query:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES
        E SKMP CLHLN NRLVDKIY FVGDDYL+ MK+ISIRVTEFH RLGCLSFGESVELVC LKRL+DCKEKQ MGI A+YEVLMD FW SIRETKNLIG S
Subjt:  ETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGES

Query:  KENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGF
        KENR+  KL++ + R+S SGRF+ER+NASSY D L F SERF LTY GF
Subjt:  KENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGF

A0A6J1EP16 putative clathrin assembly protein At4g400802.5e-16482.57Show/hide
Query:  PINFSSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRL
        PI   S+L+C     S  L+MVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP+ K LSVLLSLGKTSRATAAAA+EVLMDRL
Subjt:  PINFSSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRL

Query:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ
        Q+TQNSAVALKCLI++HHI+KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSSN E+E+K EQ
Subjt:  QTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKERKAEQ

Query:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY
        ISGF NSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RVTEF QRLGCLSFGESVELVC LKRLEDCKEKQ  GIS  +
Subjt:  ISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKY

Query:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL
        E+L+  FWGSIRE +NLIGESK+ RE GKL RTKSRMS SGRFM++ NA  YR S+RFGSERFD T  G PVL
Subjt:  EVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL

A0A6J1JCT4 putative clathrin assembly protein At4g400801.5e-16181.17Show/hide
Query:  PINFS----SSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVL
        PI +S    S+L+C  +  S  L+MV TKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP  K LSVLLS GKTSRATAAAA+EVL
Subjt:  PINFS----SSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVL

Query:  MDRLQTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKER
        MDRLQ+TQNSAVALKCLI++HHIVKNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSSN E+E+
Subjt:  MDRLQTTQNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSNEEKER

Query:  KAEQISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGI
        K EQISGF NSDLLKETESL+GLIEE SKMPHCLHLNGN LVDKIYAFVG+DYLS  KEIS RVTEF  RLGCLSFGESVELVC LKRLEDCKEKQ  GI
Subjt:  KAEQISGFLNSDLLKETESLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGI

Query:  SAKYEVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL
        S  +E+L+  FWGSIRE +NLIGESK+ RE GKL RTKSRMS SGRFM++ NA   R S+RFGSERFD T  G PVL
Subjt:  SAKYEVLMDEFWGSIRETKNLIGESKENREDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL

SwissProt top hitse value%identityAlignment
Q8H0W9 Putative clathrin assembly protein At5g104101.4e-3130.42Show/hide
Query:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFI
        +IG  KDKAS  KA L+       +    LALL++TT  P+ PP+  ++S ++S   +  A AA +  +   RL+ T+N+ VA K LI +H ++K+    
Subjt:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIEETSKMP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ +  + ++LG F     + ++K  + +++S +    ++++T+SLV   E     P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIEETSKMP

Query:  HCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRL---GCLSFGE--SVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGESK
            +  N++VD+I   V +DY   ++ + +R+    +RL   G    G+    +   +L RL +CKE  + G+  +   L D+FW  + E      E K
Subjt:  HCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRL---GCLSFGE--SVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGESK

Query:  ENREDGKLA
         N++  +LA
Subjt:  ENREDGKLA

Q8L936 Putative clathrin assembly protein At4g400804.8e-8851.39Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVH
        M R    + LIG IKDKASQSKAAL++   K   LSF L++LRATTHDP  PP  +HL+V+LS G  SRATA++AVE +M+RL TT ++ VALK LI +H
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVH

Query:  HIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVG
        HIVK+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E +LS SRI+GFF+ S+SS   KE   E +S   NSDLL+E ++LVG
Subjt:  HIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVG

Query:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYE-VLMDEFWGSIRETKNL
        L+EE  K+P      G  L DKI   VG+DY+S + E+  R  EF +R   LSFG+++ELVC LKRLE CKE+        ++   +D FWG + E K +
Subjt:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYE-VLMDEFWGSIRETKNL

Query:  IGESKENR---EDGKLARTKSRMSY-SGRFMERANASSYRDSLRFGSERF-DLTYTGFPV
        IG  ++N    E   +   K    Y S RF +R     Y + +RF S RF ++    FPV
Subjt:  IGESKENR---EDGKLARTKSRMSY-SGRFMERANASSYRDSLRFGSERF-DLTYTGFPV

Q8LBH2 Putative clathrin assembly protein At2g016002.7e-1429.53Show/hide
Query:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATA--AAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFILQD
        G +KD  S     +          +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK LI +H +++ G    ++
Subjt:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATA--AAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS---SSNEEKERKAEQISGFLNSDLLKETESLVGLI
        +L  F   G    L+LS+F+D S+PI+W+ S+WVR YA ++E  L   R+L +   +     SN  +++   +       +LL++  +L  L+
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS---SSNEEKERKAEQISGFLNSDLLKETESLVGLI

Q9FKQ2 Putative clathrin assembly protein At5g653704.0e-3432.56Show/hide
Query:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN
        KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPS+K+++ L S   T        V+ ++ RL+ T +  VA KCLI +H +VK+
Subjt:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN

Query:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESL
          G+  +D L          +T G + LKL+D   +S+  + EL+ WV+WY QY++  LSI+ +LG        NE+K  + +++S +    +LK+ + L
Subjt:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESL

Query:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKN
        V L E  S  P       N++V ++   +  DY S ++ + IR  E + R+      +  ELV VL++LE+CKE  +   S + + L+ +FW  + + K+
Subjt:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKN

Query:  L
        +
Subjt:  L

Q9LVD8 Putative clathrin assembly protein At5g572001.2e-1429.44Show/hide
Query:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKT--SRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFIL
        G +KD  +      LAK N       +A+++AT H   +PP E+H+  + S       RA  A  +  L  RL  T+N  VA+K LI +H  ++ G    
Subjt:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKT--SRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFIL

Query:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGS-----SSSNEEKERKAEQISGFLNSDLLKETESLVGLI
        +++L    ++  R+ L++S+F+D ++P++W+ S+WVR YA ++E  L   R+L + + +     +S    K  +   +SG    DLL++  +L  L+
Subjt:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGS-----SSSNEEKERKAEQISGFLNSDLLKETESLVGLI

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein1.9e-1529.53Show/hide
Query:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATA--AAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFILQD
        G +KD  S     +          +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK LI +H +++ G    ++
Subjt:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATA--AAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS---SSNEEKERKAEQISGFLNSDLLKETESLVGLI
        +L  F   G    L+LS+F+D S+PI+W+ S+WVR YA ++E  L   R+L +   +     SN  +++   +       +LL++  +L  L+
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS---SSNEEKERKAEQISGFLNSDLLKETESLVGLI

AT4G40080.1 ENTH/ANTH/VHS superfamily protein3.4e-8951.39Show/hide
Query:  MVRTKKLSSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVH
        M R    + LIG IKDKASQSKAAL++   K   LSF L++LRATTHDP  PP  +HL+V+LS G  SRATA++AVE +M+RL TT ++ VALK LI +H
Subjt:  MVRTKKLSSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVH

Query:  HIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVG
        HIVK+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E +LS SRI+GFF+ S+SS   KE   E +S   NSDLL+E ++LVG
Subjt:  HIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVG

Query:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYE-VLMDEFWGSIRETKNL
        L+EE  K+P      G  L DKI   VG+DY+S + E+  R  EF +R   LSFG+++ELVC LKRLE CKE+        ++   +D FWG + E K +
Subjt:  LIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYE-VLMDEFWGSIRETKNL

Query:  IGESKENR---EDGKLARTKSRMSY-SGRFMERANASSYRDSLRFGSERF-DLTYTGFPV
        IG  ++N    E   +   K    Y S RF +R     Y + +RF S RF ++    FPV
Subjt:  IGESKENR---EDGKLARTKSRMSY-SGRFMERANASSYRDSLRFGSERF-DLTYTGFPV

AT5G10410.1 ENTH/ANTH/VHS superfamily protein1.0e-3230.42Show/hide
Query:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFI
        +IG  KDKAS  KA L+       +    LALL++TT  P+ PP+  ++S ++S   +  A AA +  +   RL+ T+N+ VA K LI +H ++K+    
Subjt:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIEETSKMP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ +  + ++LG F     + ++K  + +++S +    ++++T+SLV   E     P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESLVGLIEETSKMP

Query:  HCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRL---GCLSFGE--SVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGESK
            +  N++VD+I   V +DY   ++ + +R+    +RL   G    G+    +   +L RL +CKE  + G+  +   L D+FW  + E      E K
Subjt:  HCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRL---GCLSFGE--SVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGESK

Query:  ENREDGKLA
         N++  +LA
Subjt:  ENREDGKLA

AT5G57200.1 ENTH/ANTH/VHS superfamily protein8.5e-1629.44Show/hide
Query:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKT--SRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFIL
        G +KD  +      LAK N       +A+++AT H   +PP E+H+  + S       RA  A  +  L  RL  T+N  VA+K LI +H  ++ G    
Subjt:  GLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKT--SRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKNGGFIL

Query:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGS-----SSSNEEKERKAEQISGFLNSDLLKETESLVGLI
        +++L    ++  R+ L++S+F+D ++P++W+ S+WVR YA ++E  L   R+L + + +     +S    K  +   +SG    DLL++  +L  L+
Subjt:  QDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGS-----SSSNEEKERKAEQISGFLNSDLLKETESLVGLI

AT5G65370.1 ENTH/ANTH/VHS superfamily protein2.8e-3532.56Show/hide
Query:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN
        KL++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPS+K+++ L S   T        V+ ++ RL+ T +  VA KCLI +H +VK+
Subjt:  KLSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTTQNSAVALKCLISVHHIVKN

Query:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESL
          G+  +D L          +T G + LKL+D   +S+  + EL+ WV+WY QY++  LSI+ +LG        NE+K  + +++S +    +LK+ + L
Subjt:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETESL

Query:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKN
        V L E  S  P       N++V ++   +  DY S ++ + IR  E + R+      +  ELV VL++LE+CKE  +   S + + L+ +FW  + + K+
Subjt:  VGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKN

Query:  L
        +
Subjt:  L


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACTATCCAATTCAATTCCAATTAATTTCTCTTCTTCTCTCTGGTGTGTCTTTGTAAATGTTTCTCTCAGCCTTGCAATGGTTCGCACAAAAAAGTTGAGTTCCTT
AATTGGACTCATCAAAGACAAAGCCTCTCAAAGCAAAGCCGCGCTTCTCGCCAAGCCCAACATTCTCTCCTTTCAACTCGCTCTCCTCCGAGCCACCACTCACGATCCCC
ACGCCCCGCCCAGCGAGAAACACCTCTCTGTTCTTCTCTCTCTTGGCAAAACCTCTCGTGCGACCGCCGCTGCTGCCGTTGAAGTCCTAATGGACCGCCTCCAAACCACC
CAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCTCCGTTCACCATATCGTCAAGAACGGCGGCTTCATTCTACAAGATCAGCTCTCTGTTTTTCCCTTCACCGGCGGCAG
AAACTACCTTAAACTCTCGGATTTCCGCGACAGTTCCAATCCCATTTCTTGGGAGCTTTCCTCTTGGGTTCGATGGTACGCTCAGTACATCGAAACTGTTTTGTCTATTT
CCCGAATTTTGGGGTTTTTTGTTGGTTCATCAAGCTCGAATGAAGAGAAGGAGAGAAAAGCAGAGCAGATTTCGGGGTTTTTGAACTCCGATTTGCTTAAAGAGACCGAA
TCTTTGGTGGGTTTAATCGAAGAAACTTCGAAAATGCCTCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAAATCTACGCCTTTGTCGGTGACGATTACTTGTC
AGGTATGAAGGAAATTTCAATCCGAGTTACAGAGTTTCACCAGCGGCTCGGTTGCTTGAGTTTCGGCGAATCGGTCGAGTTGGTTTGCGTGTTGAAACGGCTCGAGGATT
GCAAAGAAAAGCAAATCATGGGAATTTCTGCAAAGTACGAAGTTTTGATGGATGAATTCTGGGGTTCCATTAGAGAGACCAAGAATTTGATTGGGGAGTCGAAGGAAAAT
CGAGAGGACGGTAAATTGGCCAGGACGAAGAGCAGGATGAGCTACTCGGGCCGGTTTATGGAGCGGGCTAATGCTAGTTCTTATCGCGACTCGCTTCGGTTCGGTTCAGA
GCGGTTCGATTTAACCTACACAGGGTTTCCGGTTCTA
mRNA sequenceShow/hide mRNA sequence
ATGATACTATCCAATTCAATTCCAATTAATTTCTCTTCTTCTCTCTGGTGTGTCTTTGTAAATGTTTCTCTCAGCCTTGCAATGGTTCGCACAAAAAAGTTGAGTTCCTT
AATTGGACTCATCAAAGACAAAGCCTCTCAAAGCAAAGCCGCGCTTCTCGCCAAGCCCAACATTCTCTCCTTTCAACTCGCTCTCCTCCGAGCCACCACTCACGATCCCC
ACGCCCCGCCCAGCGAGAAACACCTCTCTGTTCTTCTCTCTCTTGGCAAAACCTCTCGTGCGACCGCCGCTGCTGCCGTTGAAGTCCTAATGGACCGCCTCCAAACCACC
CAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCTCCGTTCACCATATCGTCAAGAACGGCGGCTTCATTCTACAAGATCAGCTCTCTGTTTTTCCCTTCACCGGCGGCAG
AAACTACCTTAAACTCTCGGATTTCCGCGACAGTTCCAATCCCATTTCTTGGGAGCTTTCCTCTTGGGTTCGATGGTACGCTCAGTACATCGAAACTGTTTTGTCTATTT
CCCGAATTTTGGGGTTTTTTGTTGGTTCATCAAGCTCGAATGAAGAGAAGGAGAGAAAAGCAGAGCAGATTTCGGGGTTTTTGAACTCCGATTTGCTTAAAGAGACCGAA
TCTTTGGTGGGTTTAATCGAAGAAACTTCGAAAATGCCTCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAAATCTACGCCTTTGTCGGTGACGATTACTTGTC
AGGTATGAAGGAAATTTCAATCCGAGTTACAGAGTTTCACCAGCGGCTCGGTTGCTTGAGTTTCGGCGAATCGGTCGAGTTGGTTTGCGTGTTGAAACGGCTCGAGGATT
GCAAAGAAAAGCAAATCATGGGAATTTCTGCAAAGTACGAAGTTTTGATGGATGAATTCTGGGGTTCCATTAGAGAGACCAAGAATTTGATTGGGGAGTCGAAGGAAAAT
CGAGAGGACGGTAAATTGGCCAGGACGAAGAGCAGGATGAGCTACTCGGGCCGGTTTATGGAGCGGGCTAATGCTAGTTCTTATCGCGACTCGCTTCGGTTCGGTTCAGA
GCGGTTCGATTTAACCTACACAGGGTTTCCGGTTCTA
Protein sequenceShow/hide protein sequence
MILSNSIPINFSSSLWCVFVNVSLSLAMVRTKKLSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSVLLSLGKTSRATAAAAVEVLMDRLQTT
QNSAVALKCLISVHHIVKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSNEEKERKAEQISGFLNSDLLKETE
SLVGLIEETSKMPHCLHLNGNRLVDKIYAFVGDDYLSGMKEISIRVTEFHQRLGCLSFGESVELVCVLKRLEDCKEKQIMGISAKYEVLMDEFWGSIRETKNLIGESKEN
REDGKLARTKSRMSYSGRFMERANASSYRDSLRFGSERFDLTYTGFPVL