; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018586 (gene) of Snake gourd v1 genome

Gene IDTan0018586
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionENTH domain-containing protein
Genome locationLG01:14443529..14446673
RNA-Seq ExpressionTan0018586
SyntenyTan0018586
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577320.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]2.7e-15783.57Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL
        MVRT+K+SSLIGLIKDKASQSKAALL+KPNILSFQLALLRATTHDPHAPP+ K LS LLSLGK SRATAA+A+EVLMDRLQSTQNSAVALK LIA+HHI+
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSS  ERE+K EQIS  LNSDLLKETESL+ LI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLI

Query:  EETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGE
        EE +K PHCLHLNGN LVDKIYAFVG+DYLSA KEIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFWGSIRE +NLIGE
Subjt:  EETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGE

Query:  SKENRDGGKLARTKSRMSDSGRFLVRPN--LYRDSVRFGSARFDFSCKEIPVL
        SK+ R+ GKL RTKSRMSDSGRF+ + N  LYR SVRFGS RFDF+CK IPVL
Subjt:  SKENRDGGKLARTKSRMSDSGRFLVRPN--LYRDSVRFGSARFDFSCKEIPVL

KAG7015410.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-15783.29Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL
        MVRT+K+SSLIGLIKDKASQSKAALL+KPNILSFQLALLRATTHDPHAPP+ K LS LLSLGK SRATAA+A+EVLMDRLQSTQNSAVALK LIA+HHI+
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSS  ERE+K EQIS   NSDLLKETESL+ LI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLI

Query:  EETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGE
        EE +K PHCLHLNGN LVDKIYAFVG+DYLSA KEIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFWGSIRE +NLIGE
Subjt:  EETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGE

Query:  SKENRDGGKLARTKSRMSDSGRFLVRPN--LYRDSVRFGSARFDFSCKEIPVL
        SK+ R+ GKL RTKSRMSDSGRF+ + N  LYR SVRFGS RFDF+CK IPVL
Subjt:  SKENRDGGKLARTKSRMSDSGRFLVRPN--LYRDSVRFGSARFDFSCKEIPVL

XP_022136401.1 putative clathrin assembly protein At4g40080 [Momordica charantia]7.2e-15883.24Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL
        MVRT+K+S LIGLIKDKASQSKAAL++KPNILSFQLALLRATTHDP+APP +KHL+ LLSLGK SRATAA+A+EVLMDRLQST NSAVALK L+AVHHIL
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIE
        K+GGFILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIETVLS SRILGFFV SSSSIEERE+K EQISA+ NSDLL++TESLV LIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIE

Query:  ETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGES
        ETTKKPH LHLN N+LVD+I  FV DDYLSA+KEIS+RVTEFH+RLSCLSFGESVELVC LKRLEDCKEKQ +GISAKYEILMDGFWG I ETKNLIGE+
Subjt:  ETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGES

Query:  KENRD----GGKLARTKSRMSDSGRFLVRPNLYRDSVRFGSARFDF
        KENRD    GGKL  T +RMSDSGRF+ R N+YRDS+RFGS RFDF
Subjt:  KENRD----GGKLARTKSRMSDSGRFLVRPNLYRDSVRFGSARFDF

XP_022929539.1 putative clathrin assembly protein At4g40080 [Cucurbita moschata]1.4e-15683.29Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL
        MVRT+K+SSLIGLIKDKASQSKAALL+KPNILSFQLALLRATTHDPHAPP+ K LS LLSLGK SRATAA+A+EVLMDRLQSTQNSAVALK LIA+HHI+
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSS  ERE+K EQIS   NSDLLKETESL+ LI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLI

Query:  EETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGE
        EE +K PHCLHLNGN LVDKIYAFVG+DYLSA KEIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFWGSIRE +NLIGE
Subjt:  EETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGE

Query:  SKENRDGGKLARTKSRMSDSGRFLVRPN--LYRDSVRFGSARFDFSCKEIPVL
        SK+ R+ GKL RTKSRMSDSGRF+ + N  LYR SVRFGS RFDF+CK IPVL
Subjt:  SKENRDGGKLARTKSRMSDSGRFLVRPN--LYRDSVRFGSARFDFSCKEIPVL

XP_038903242.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]8.5e-16787.18Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL
        MV T+ +SSLIGLIKDKASQSKAALL+KPNILSFQLALLRATTHDPHAPP EKHL  LLSLGK SRATAA+AVEVLMDRLQ+TQNSAVALK LIAVHHI+
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSS+S EE+E+K EQIS ILNSDLLKETESLV LIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIE

Query:  ETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGES
        ET+K PHCLHLNGNRL DKIYAFVGDDYLSA+KEISIRVTEFH+RLSCLSFGESVELVCALKRLEDCKEKQ  GIS+KYE+LMD FWGSIRETKNLIGES
Subjt:  ETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGES

Query:  KENRDGGKLARTKSRMSDSGRFLVR--PNLYRDSVRFGSARFDFSCKEIPV
        KEN++GGKLARTKSRMSDSGRF+ R     YRDS+RFGS RFD +CK  PV
Subjt:  KENRDGGKLARTKSRMSDSGRFLVR--PNLYRDSVRFGSARFDFSCKEIPV

TrEMBL top hitse value%identityAlignment
A0A0A0KXU4 ENTH domain-containing protein5.2e-15484.06Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL
        MV T+K+SSLIGLIKDKASQSKAALL+KPNILSFQLALLRATTHD HAPPS+KHLS LLSLGK SRATAA AVEVLMDRLQ+T NSAVALK LIAVHHI 
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIETVLSISRILGFFVGSS S EE+ERK EQIS ILNSDLLKETESLV LIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIE

Query:  ETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIR---ETKNLI
        E +K PHCLHLN NRLVDKIY+FVGDDYLSA+KEISIRVTEFH RL  LSF ESVELVCALKRLEDCKEKQ MGI AKYE+L+DG WGSIR   ETKNL 
Subjt:  ETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIR---ETKNLI

Query:  GESKENRDGGKLARTKSRMSDSGRFLVRPNL--YRDSVRFGSARF
        GESKE+R+GGKL +TK R+SDSGRF+ RPN   YRD +RFGS RF
Subjt:  GESKENRDGGKLARTKSRMSDSGRFLVRPNL--YRDSVRFGSARF

A0A5A7TT50 Putative clathrin assembly protein1.2e-15081.27Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL
        M+ T+++SSLIGLIKDKASQSKAALL+KPNILSFQLALLRATTHDPHAPPS+KHLS LLSLGK SRATAA+AVEVLMDRLQ+T NSAVALK LIAVHHI 
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIE
        KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISR LGF VGSSSS EE ERK EQIS I NS+LLK+TESLV LIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIE

Query:  ETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGES
        E +K P CLHLN NRLVDKIY FVGDDYL+A+K+ISIRVTEFH RL CLSFGESVELVCALKRL+DCKEKQ MGI A+YE+LMDGFW SIRETKNLIG S
Subjt:  ETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGES

Query:  KENRDGGKLARTKSRMSDSGRFLVRPNL--YRDSVRFGSARFDFSCK
        KENRDG KL++ + R+SDSGRF+ R N   Y D + F S RF  + K
Subjt:  KENRDGGKLARTKSRMSDSGRFLVRPNL--YRDSVRFGSARFDFSCK

A0A6J1C3E8 putative clathrin assembly protein At4g400803.5e-15883.24Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL
        MVRT+K+S LIGLIKDKASQSKAAL++KPNILSFQLALLRATTHDP+APP +KHL+ LLSLGK SRATAA+A+EVLMDRLQST NSAVALK L+AVHHIL
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIE
        K+GGFILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIETVLS SRILGFFV SSSSIEERE+K EQISA+ NSDLL++TESLV LIE
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIE

Query:  ETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGES
        ETTKKPH LHLN N+LVD+I  FV DDYLSA+KEIS+RVTEFH+RLSCLSFGESVELVC LKRLEDCKEKQ +GISAKYEILMDGFWG I ETKNLIGE+
Subjt:  ETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGES

Query:  KENRD----GGKLARTKSRMSDSGRFLVRPNLYRDSVRFGSARFDF
        KENRD    GGKL  T +RMSDSGRF+ R N+YRDS+RFGS RFDF
Subjt:  KENRD----GGKLARTKSRMSDSGRFLVRPNLYRDSVRFGSARFDF

A0A6J1EP16 putative clathrin assembly protein At4g400806.6e-15783.29Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL
        MVRT+K+SSLIGLIKDKASQSKAALL+KPNILSFQLALLRATTHDPHAPP+ K LS LLSLGK SRATAA+A+EVLMDRLQSTQNSAVALK LIA+HHI+
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSS  ERE+K EQIS   NSDLLKETESL+ LI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLI

Query:  EETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGE
        EE +K PHCLHLNGN LVDKIYAFVG+DYLSA KEIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFWGSIRE +NLIGE
Subjt:  EETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGE

Query:  SKENRDGGKLARTKSRMSDSGRFLVRPN--LYRDSVRFGSARFDFSCKEIPVL
        SK+ R+ GKL RTKSRMSDSGRF+ + N  LYR SVRFGS RFDF+CK IPVL
Subjt:  SKENRDGGKLARTKSRMSDSGRFLVRPN--LYRDSVRFGSARFDFSCKEIPVL

A0A6J1JCT4 putative clathrin assembly protein At4g400805.2e-15482.44Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL
        MV T+K+SSLIGLIKDKASQSKAALL+KPNILSFQLALLRATTHDPHAPP  K LS LLS GK SRATAA+A+EVLMDRLQSTQNSAVALK LIA+HHI+
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHIL

Query:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVL ISRILG FFVGSSSS  ERE+K EQIS   NSDLLKETESL+ LI
Subjt:  KNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILG-FFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLI

Query:  EETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGE
        EE +K PHCLHLNGN LVDKIYAFVG+DYLSA KEIS RVTEF  RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFWGSIRE +NLIGE
Subjt:  EETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGE

Query:  SKENRDGGKLARTKSRMSDSGRFLVRPN--LYRDSVRFGSARFDFSCKEIPVL
        SK+ R+ GKL RTKSRMSDSGRF+ + N  L R SVRFGS RFDF+CK IPVL
Subjt:  SKENRDGGKLARTKSRMSDSGRFLVRPN--LYRDSVRFGSARFDFSCKEIPVL

SwissProt top hitse value%identityAlignment
Q8H0W9 Putative clathrin assembly protein At5g104101.6e-3030.74Show/hide
Query:  LIGLIKDKASQSKAALL---SKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFI
        +IG  KDKAS  KA L+       +    LALL++TT  P+ PP+  ++S ++S   +  A AA +  +   RL+ T+N+ VA KSLI +H ++K+    
Subjt:  LIGLIKDKASQSKAALL---SKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIEETTKKP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ +  + ++LG F     + +++  + +++S+     ++++T+SLVS  E    +P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIEETTKKP

Query:  HCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGESK
            +  N++VD+I   V +DY   ++ + +R+    ERL        G+    +    L RL +CKE    G+  +   L D FW  + E      E K
Subjt:  HCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGESK

Query:  ENRDGGKLA
         N+   +LA
Subjt:  ENRDGGKLA

Q8L936 Putative clathrin assembly protein At4g400802.5e-8951.85Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLS---KPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVH
        M R    + LIG IKDKASQSKAAL+S   K   LSF L++LRATTHDP  PP  +HL+ +LS G  SRATA+SAVE +M+RL +T ++ VALKSLI +H
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLS---KPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVH

Query:  HILKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVS
        HI+K+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E +LS SRI+GFF+ S+SS   +E   E +S++ NSDLL+E ++LV 
Subjt:  HILKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVS

Query:  LIEETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEI-LMDGFWGSIRETKNL
        L+EE  K P      G  L DKI   VG+DY+S++ E+  R  EF ER + LSFG+++ELVCALKRLE CKE+        ++   +DGFWG + E K +
Subjt:  LIEETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEI-LMDGFWGSIRETKNL

Query:  IGESKENRDGGKLART------KSRMSDSGRFLVRPNL-YRDSVRFGSARF
        IG  ++N   G++ ++      + +  +S RF  R  + Y + VRF S RF
Subjt:  IGESKENRDGGKLART------KSRMSDSGRFLVRPNL-YRDSVRFGSARF

Q8LBH2 Putative clathrin assembly protein At2g016003.2e-1533.99Show/hide
Query:  GLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATA--ASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFILQD
        G +KD        + S+       +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK+LI +H +L+ G    ++
Subjt:  GLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATA--ASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGF
        +L  F   G    L+LS+F+D S+PI+W+ S+WVR YA ++E  L   R+L +
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGF

Q9FKQ2 Putative clathrin assembly protein At5g653707.8e-3030.56Show/hide
Query:  KISSLIGLIKDKASQSK---AALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKN
        K+++L G++KD+ASQ K     L S  N  +  LALL+AT+H  + PPS+K+++ L S            V+ ++ RL+ T +  VA K LI +H ++K+
Subjt:  KISSLIGLIKDKASQSK---AALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKN

Query:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESL
          G+  +D L          +T G + LKL+D   +S+  + EL+ WV+WY QY++  LSI+ +LG         E++  + +++S+     +LK+ + L
Subjt:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESL

Query:  VSLIEETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKN
        V L E  + +P       N++V ++   +  DY SA++ + IR  E + R++     +  ELV  L++LE+CKE      S + + L+  FW  + + K+
Subjt:  VSLIEETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKN

Query:  L
        +
Subjt:  L

Q9LVD8 Putative clathrin assembly protein At5g572001.2e-1429.15Show/hide
Query:  GLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKA--SRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFILQD
        G +KD  +   A + S+       +A+++AT H   +PP E+H+  + S       RA  A  +  L  RL  T+N  VA+K LI +H  L+ G    ++
Subjt:  GLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKA--SRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISA---------ILNSDLLKETESLVSLI
        +L    ++  R+ L++S+F+D ++P++W+ S+WVR YA ++E  L   R+L + +       E ER P+   A         +   DLL++  +L  L+
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISA---------ILNSDLLKETESLVSLI

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein2.2e-1633.99Show/hide
Query:  GLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATA--ASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFILQD
        G +KD        + S+       +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK+LI +H +L+ G    ++
Subjt:  GLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATA--ASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGF
        +L  F   G    L+LS+F+D S+PI+W+ S+WVR YA ++E  L   R+L +
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGF

AT4G40080.1 ENTH/ANTH/VHS superfamily protein1.8e-9051.85Show/hide
Query:  MVRTQKISSLIGLIKDKASQSKAALLS---KPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVH
        M R    + LIG IKDKASQSKAAL+S   K   LSF L++LRATTHDP  PP  +HL+ +LS G  SRATA+SAVE +M+RL +T ++ VALKSLI +H
Subjt:  MVRTQKISSLIGLIKDKASQSKAALLS---KPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVH

Query:  HILKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVS
        HI+K+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E +LS SRI+GFF+ S+SS   +E   E +S++ NSDLL+E ++LV 
Subjt:  HILKNGGFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVS

Query:  LIEETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEI-LMDGFWGSIRETKNL
        L+EE  K P      G  L DKI   VG+DY+S++ E+  R  EF ER + LSFG+++ELVCALKRLE CKE+        ++   +DGFWG + E K +
Subjt:  LIEETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEI-LMDGFWGSIRETKNL

Query:  IGESKENRDGGKLART------KSRMSDSGRFLVRPNL-YRDSVRFGSARF
        IG  ++N   G++ ++      + +  +S RF  R  + Y + VRF S RF
Subjt:  IGESKENRDGGKLART------KSRMSDSGRFLVRPNL-YRDSVRFGSARF

AT5G10410.1 ENTH/ANTH/VHS superfamily protein1.1e-3130.74Show/hide
Query:  LIGLIKDKASQSKAALL---SKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFI
        +IG  KDKAS  KA L+       +    LALL++TT  P+ PP+  ++S ++S   +  A AA +  +   RL+ T+N+ VA KSLI +H ++K+    
Subjt:  LIGLIKDKASQSKAALL---SKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIEETTKKP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ +  + ++LG F     + +++  + +++S+     ++++T+SLVS  E    +P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIEETTKKP

Query:  HCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGESK
            +  N++VD+I   V +DY   ++ + +R+    ERL        G+    +    L RL +CKE    G+  +   L D FW  + E      E K
Subjt:  HCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGESK

Query:  ENRDGGKLA
         N+   +LA
Subjt:  ENRDGGKLA

AT5G57200.1 ENTH/ANTH/VHS superfamily protein8.5e-1629.15Show/hide
Query:  GLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKA--SRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFILQD
        G +KD  +   A + S+       +A+++AT H   +PP E+H+  + S       RA  A  +  L  RL  T+N  VA+K LI +H  L+ G    ++
Subjt:  GLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKA--SRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISA---------ILNSDLLKETESLVSLI
        +L    ++  R+ L++S+F+D ++P++W+ S+WVR YA ++E  L   R+L + +       E ER P+   A         +   DLL++  +L  L+
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISA---------ILNSDLLKETESLVSLI

AT5G65370.1 ENTH/ANTH/VHS superfamily protein5.5e-3130.56Show/hide
Query:  KISSLIGLIKDKASQSK---AALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKN
        K+++L G++KD+ASQ K     L S  N  +  LALL+AT+H  + PPS+K+++ L S            V+ ++ RL+ T +  VA K LI +H ++K+
Subjt:  KISSLIGLIKDKASQSK---AALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKN

Query:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESL
          G+  +D L          +T G + LKL+D   +S+  + EL+ WV+WY QY++  LSI+ +LG         E++  + +++S+     +LK+ + L
Subjt:  -GGFILQDQL------SVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESL

Query:  VSLIEETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKN
        V L E  + +P       N++V ++   +  DY SA++ + IR  E + R++     +  ELV  L++LE+CKE      S + + L+  FW  + + K+
Subjt:  VSLIEETTKKPHCLHLNGNRLVDKIYAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKN

Query:  L
        +
Subjt:  L


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCGGACTCAAAAAATCAGTTCTCTAATTGGACTCATCAAAGACAAAGCCTCTCAAAGCAAAGCCGCCCTTCTCTCCAAGCCCAACATTCTCTCCTTCCAACTCGC
CCTCCTCCGCGCCACCACCCACGACCCCCACGCGCCGCCCAGCGAGAAGCATCTATCCACCCTTCTTTCTCTCGGCAAAGCCTCACGCGCCACCGCTGCTTCCGCCGTTG
AAGTCTTAATGGACCGCCTCCAAAGCACCCAGAACTCCGCCGTCGCCCTCAAGTCCTTAATTGCCGTCCACCATATTCTCAAGAACGGCGGCTTCATTCTACAAGACCAG
CTCTCTGTTTTCCCCTTCACCGGCGGCCGAAATTACCTCAAACTCTCCGATTTCCGCGACAGTTCGAACCCGATTTCTTGGGAGCTTTCCTCTTGGGTCCGATGGTACGC
CCAATACATCGAAACTGTTTTGTCCATTTCGCGAATTCTGGGGTTTTTTGTTGGTTCTTCAAGCTCAATCGAAGAGAGGGAGAGAAAACCAGAGCAGATTTCGGCAATTT
TGAACTCCGATTTGCTCAAAGAGACCGAATCCTTGGTGAGTTTAATCGAAGAAACTACAAAAAAGCCTCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAGATC
TACGCCTTTGTCGGCGACGATTACTTGTCGGCTCTGAAGGAAATTTCAATCCGAGTTACTGAGTTTCACGAGCGGCTCAGTTGCCTGAGTTTCGGCGAATCGGTCGAGTT
GGTTTGTGCATTGAAACGGCTCGAGGATTGCAAAGAAAAGCAATGGATGGGTATTTCTGCGAAGTACGAAATTTTGATGGATGGGTTTTGGGGTTCAATTCGAGAGACAA
AGAACTTGATTGGGGAGTCTAAGGAAAATCGAGACGGCGGTAAATTGGCGAGGACGAAGAGCAGGATGAGCGACTCGGGCCGGTTTTTGGTGCGGCCTAATCTTTATCGC
GACTCGGTCCGGTTCGGTTCGGCGCGGTTCGATTTCAGCTGTAAGGAGATTCCGGTTCTAGTCAAAGTCAACTTCCTCGGCTTCTGGTTTTTGTCAATCAACAATTCAAT
TCCATCGAAGATTGGGCCAGGCCCCAAGGCCCAACCTTTCATTTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAACAGAGAAAAACAATCATAGAGAATCAAACAGGTATTAAAGAATAAAGAGGCAAAAACGACTTGAGAGGCCAAATATTAGCTCGATTTCTGGTGCCTCCAAAAG
CTATCCTAATTACAATTCTCATTTTCTCCATAACCAAAATCTCCTCTACTCTGTGTAGCTGTGAATGTTTCTCTCAGTCTTTCAATGGTGCGGACTCAAAAAATCAGTTC
TCTAATTGGACTCATCAAAGACAAAGCCTCTCAAAGCAAAGCCGCCCTTCTCTCCAAGCCCAACATTCTCTCCTTCCAACTCGCCCTCCTCCGCGCCACCACCCACGACC
CCCACGCGCCGCCCAGCGAGAAGCATCTATCCACCCTTCTTTCTCTCGGCAAAGCCTCACGCGCCACCGCTGCTTCCGCCGTTGAAGTCTTAATGGACCGCCTCCAAAGC
ACCCAGAACTCCGCCGTCGCCCTCAAGTCCTTAATTGCCGTCCACCATATTCTCAAGAACGGCGGCTTCATTCTACAAGACCAGCTCTCTGTTTTCCCCTTCACCGGCGG
CCGAAATTACCTCAAACTCTCCGATTTCCGCGACAGTTCGAACCCGATTTCTTGGGAGCTTTCCTCTTGGGTCCGATGGTACGCCCAATACATCGAAACTGTTTTGTCCA
TTTCGCGAATTCTGGGGTTTTTTGTTGGTTCTTCAAGCTCAATCGAAGAGAGGGAGAGAAAACCAGAGCAGATTTCGGCAATTTTGAACTCCGATTTGCTCAAAGAGACC
GAATCCTTGGTGAGTTTAATCGAAGAAACTACAAAAAAGCCTCACTGTTTGCATCTGAATGGAAACAGATTGGTGGATAAGATCTACGCCTTTGTCGGCGACGATTACTT
GTCGGCTCTGAAGGAAATTTCAATCCGAGTTACTGAGTTTCACGAGCGGCTCAGTTGCCTGAGTTTCGGCGAATCGGTCGAGTTGGTTTGTGCATTGAAACGGCTCGAGG
ATTGCAAAGAAAAGCAATGGATGGGTATTTCTGCGAAGTACGAAATTTTGATGGATGGGTTTTGGGGTTCAATTCGAGAGACAAAGAACTTGATTGGGGAGTCTAAGGAA
AATCGAGACGGCGGTAAATTGGCGAGGACGAAGAGCAGGATGAGCGACTCGGGCCGGTTTTTGGTGCGGCCTAATCTTTATCGCGACTCGGTCCGGTTCGGTTCGGCGCG
GTTCGATTTCAGCTGTAAGGAGATTCCGGTTCTAGTCAAAGTCAACTTCCTCGGCTTCTGGTTTTTGTCAATCAACAATTCAATTCCATCGAAGATTGGGCCAGGCCCCA
AGGCCCAACCTTTCATTTAATCGGATTACTGTAGAAAATTTTGGGCTATTAAAAAGTAGGGATGCACATCAGTCAGAAGCAACTCTAGAAGGAATTATAAGGGGAGTTCA
AAAACTCATTAAAGAAGGAAGCTCTATGGGGGATTACCTAGAGAGAACTGTCCGAGTCACGGCCACGGGTGTTGGACCCTGCCCCCTCTCGACATTTGGCAATTGAACTC
GGATGCCTTGTGGATTGAGGAGAGCAGCAGTGTTAGGCCGATTGGGTTGGATAGTCCGTGACTCTGAGGGATCTCTAATCTGCACCGGTTGCAAGAAATTCTTAGCTTGG
AAGCGAAAGCTGTGTTGGAAGGATTAACTCACCTCCATTGGTCGAGGGGCGAGGATTGTTGTGAAGCTGGATGCCTTGGGGGTTGTCAACCTGCTAAATGGGAAGGAAAG
GGACCCGACTGAGATTGCTTACACCATCTCTGCAATCAGGGAAATGGCACCATGCATCTTTATGAGATGTGTCCTTTTCCTACTGCCCGAGAATGGAAATCCATCGTGAA
TTTGGGTGGCGTACTGTTGAGGATTCCACATTAGACAAGCGAAGATACACCTCACAATATACATATAAGATATATGAGCTATTCCTTTTATTGCCAATTATTTGTTTTGA
GATAGAACCATATATAATCTAATATGGTATCTTATCTTTATTTTCCAATATTTGTATATAGAAATGTCTTAAGATTATATGTCATATCAATAATGTCCTTGCTAATCTAT
ATTAAAATTTACATCAC
Protein sequenceShow/hide protein sequence
MVRTQKISSLIGLIKDKASQSKAALLSKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKASRATAASAVEVLMDRLQSTQNSAVALKSLIAVHHILKNGGFILQDQ
LSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETVLSISRILGFFVGSSSSIEERERKPEQISAILNSDLLKETESLVSLIEETTKKPHCLHLNGNRLVDKI
YAFVGDDYLSALKEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQWMGISAKYEILMDGFWGSIRETKNLIGESKENRDGGKLARTKSRMSDSGRFLVRPNLYR
DSVRFGSARFDFSCKEIPVLVKVNFLGFWFLSINNSIPSKIGPGPKAQPFI