; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030056 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030056
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionENTH domain-containing protein
Genome locationscaffold6:11793265..11794314
RNA-Seq ExpressionSpg030056
SyntenySpg030056
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577320.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]9.2e-16083.52Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRTKK SSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP+ K LS LLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGS SS  ERE+KTEQISG LNSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNGNGLVDKIYAFVG+DYLSA  EIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK+ R+ GKL RTKSRMSDSGR+M++  A +YR  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

KAG7015410.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.7e-15983.24Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRTKK SSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP+ K LS LLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGS SS  ERE+KTEQISG  NSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNGNGLVDKIYAFVG+DYLSA  EIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK+ R+ GKL RTKSRMSDSGR+M++  A +YR  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

XP_022929539.1 putative clathrin assembly protein At4g40080 [Cucurbita moschata]4.6e-15983.24Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRTKK SSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP+ K LS LLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGS SS  ERE+KTEQISG  NSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNGNGLVDKIYAFVG+DYLSA  EIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK+ R+ GKL RTKSRMSDSGR+M++  A +YR  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

XP_023552000.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]4.6e-15982.95Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRTKK SSLIGLIKDKASQSKAALLAKPNI+SFQLALLRATTHDPHAPP+ K LS LLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGS SS  ERE+KTEQISG  NSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNGNGLVDKIYAFVG+DYLSA  EIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK++R+ GKL RTKSRMSDSGR+M++  A +YR  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

XP_038903242.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]4.4e-17088.32Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MV TK  SSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP EKHL  LLSLGKTSRATAAAAVEVLMDRLQ+TQNSAVALKCLIAVHHIV
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIE
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+LSISR+LGFFVGS +S EE+E+KTEQISGILNSDLLKETESLVGLIE
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIE

Query:  EATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS
        E +K PHCLHLNGN L DKIYAFVGDDYLSAM EISIRVTEFH+RLSCLSFGESVELVCALKRLEDCKEKQ  GIS+KYE+LMD FW SIRETKNLIG S
Subjt:  EATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS

Query:  KENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        KEN++GGKL RTKSRMSDSGR+MER  A  YRD LRFGS+RFDL+CKG PV
Subjt:  KENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

TrEMBL top hitse value%identityAlignment
A0A0A0KXU4 ENTH domain-containing protein1.2e-15784.33Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MV TKK SSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHD HAPPS+KHLS LLSLGKTSRATAA AVEVLMDRLQ+T NSAVALKCLIAVHHI 
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIET+LSISR+LGFFVGS  S EE+ERKTEQISGILNSDLLKETESLVGLIE
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIE

Query:  EATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIR---ETKNLI
        E +K PHCLHLN N LVDKIY+FVGDDYLSAM EISIRVTEFH RL  LSF ESVELVCALKRLEDCKEKQ MGI AKYE+L+DG W SIR   ETKNL 
Subjt:  EATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIR---ETKNLI

Query:  GHSKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKG
        G SKE+R+GGKL +TK R+SDSGR+MER  AS YRDLLRFGS+RF L+  G
Subjt:  GHSKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKG

A0A5A7TT50 Putative clathrin assembly protein1.5e-15583.33Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        M+ TK+ SSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPS+KHLS LLSLGKTSRATAAAAVEVLMDRLQ+T NSAVALKCLIAVHHI 
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIE
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+LSISR LGF VGS SS EE ERKTEQISGI NS+LLK+TESLVGLIE
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIE

Query:  EATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS
        E +K P CLHLN N LVDKIY FVGDDYL+AM +ISIRVTEFH RL CLSFGESVELVCALKRL+DCKEKQ MGI A+YE+LMDGFW+SIRETKNLIG S
Subjt:  EATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS

Query:  KENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKG
        KENRDG KL + + R+SDSGR++ER  AS Y D+L F S+RF L+ KG
Subjt:  KENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKG

A0A6J1C3E8 putative clathrin assembly protein At4g400804.6e-15782.03Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRT+K S LIGLIKDKASQSKAAL+AKPNILSFQLALLRATTHDP+APP +KHL+ LLSLGKTSRATAAAA+EVLMDRLQST NSAVALKCL+AVHHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIET+LS SR+LGFFV S SS+EERE+K+EQIS + NSDLL++TESLVGLIE
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIE

Query:  EATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS
        E TKKPH LHLN N LVD+I  FV DDYLSAM EIS+RVTEFH+RLSCLSFGESVELVC LKRLEDCKEKQ +GISAKYEILMDGFW  I ETKNLIG +
Subjt:  EATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS

Query:  KENRD----GGKLVRTKSRMSDSGRYMERASIYRDLLRFGSDRFD
        KENRD    GGKL+ T +RMSDSGR+MERA+IYRD +RFGS+RFD
Subjt:  KENRD----GGKLVRTKSRMSDSGRYMERASIYRDLLRFGSDRFD

A0A6J1EP16 putative clathrin assembly protein At4g400802.2e-15983.24Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRTKK SSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP+ K LS LLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGS SS  ERE+KTEQISG  NSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNGNGLVDKIYAFVG+DYLSA  EIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK+ R+ GKL RTKSRMSDSGR+M++  A +YR  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

A0A6J1JCT4 putative clathrin assembly protein At4g400801.0e-15682.67Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MV TKK SSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPP  K LS LLS GKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHIV
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGS SS  ERE+KTEQISG  NSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNGNGLVDKIYAFVG+DYLSA  EIS RVTEF  RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK+ R+ GKL RTKSRMSDSGR+M++  A + R  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

SwissProt top hitse value%identityAlignment
Q8H0W9 Putative clathrin assembly protein At5g104108.4e-3131.47Show/hide
Query:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFI
        +IG  KDKAS  KA L+       +    LALL++TT  P+ PP+  ++S ++S   +  A AA +  +   RL+ T+N+ VA K LI +H ++K+    
Subjt:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIEEATKKP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ L  + +VLG F   + + +++  + +++S      ++++T+SLV   E    +P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIEEATKKP

Query:  HCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQLMGISAKYEILMDGFW
            +  N +VD+I   V +DY   +  + +R+    ERL        G+    +    L RL +CKE  L G+  +   L D FW
Subjt:  HCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQLMGISAKYEILMDGFW

Q8L936 Putative clathrin assembly protein At4g400809.4e-9151.85Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVH
        M R   F+ LIG IKDKASQSKAAL++   K   LSF L++LRATTHDP  PP  +HL+ +LS G  SRATA++AVE +M+RL +T ++ VALK LI +H
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVH

Query:  HIVKNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVG
        HIVK+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E LLS SR++GFF+ S SS   +E   E +S + NSDLL+E ++LVG
Subjt:  HIVKNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVG

Query:  LIEEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEI-LMDGFWNSIRETKNL
        L+EEA K P      G  L DKI   VG+DY+S++NE+  R  EF ER + LSFG+++ELVCALKRLE CKE+        ++   +DGFW  + E K +
Subjt:  LIEEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEI-LMDGFWNSIRETKNL

Query:  IGHSKENRDGGKLVRT------KSRMSDSGRYMERASI-YRDLLRFGSDRF
        IG+ ++N   G++ ++      + +  +S R+ +R  I Y + +RF S RF
Subjt:  IGHSKENRDGGKLVRT------KSRMSDSGRYMERASI-YRDLLRFGSDRF

Q8LBH2 Putative clathrin assembly protein At2g016003.8e-1533.99Show/hide
Query:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATA--AAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFILQD
        G +KD  S     +          +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK LI +H +++ G    ++
Subjt:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATA--AAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGF
        +L  F   G    L+LS+F+D S+PI+W+ S+WVR YA ++E  L   RVL +
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGF

Q9FKQ2 Putative clathrin assembly protein At5g653707.6e-3232.23Show/hide
Query:  KFSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVK-
        K ++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPS+K+++ L S   T        V+ ++ RL+ T +  VA KCLI +H +VK 
Subjt:  KFSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVK-

Query:  ----NGSFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESL
            NG   L++ ++     +T G + LKL+D   +S+  + EL+ WV+WY QY++  LSI+ VLG         E++  +T+++S      +LK+ + L
Subjt:  ----NGSFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESL

Query:  VGLIEEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKN
        V L E  + +P       N +V ++   +  DY SA+  + IR  E + R++     +  ELV  L++LE+CKE  L   S + + L+  FW  + + K+
Subjt:  VGLIEEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKN

Query:  L
        +
Subjt:  L

Q9LVD8 Putative clathrin assembly protein At5g572005.8e-1629.7Show/hide
Query:  FSSLIGLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKT--SRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKN
        F    G +KD  +      LAK N       +A+++AT H   +PP E+H+  + S       RA  A  +  L  RL  T+N  VA+K LI +H  ++ 
Subjt:  FSSLIGLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKT--SRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKN

Query:  GSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGS-----MSSVEERERKTEQISGILNSDLLKETESLVG
        G    +++L    ++  R+ L++S+F+D ++P++W+ S+WVR YA ++E  L   RVL + + +      S    +  +T  +SG    DLL++  +L  
Subjt:  GSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGS-----MSSVEERERKTEQISGILNSDLLKETESLVG

Query:  LI
        L+
Subjt:  LI

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein2.7e-1633.99Show/hide
Query:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATA--AAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFILQD
        G +KD  S     +          +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK LI +H +++ G    ++
Subjt:  GLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATA--AAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGF
        +L  F   G    L+LS+F+D S+PI+W+ S+WVR YA ++E  L   RVL +
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGF

AT4G40080.1 ENTH/ANTH/VHS superfamily protein6.7e-9251.85Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVH
        M R   F+ LIG IKDKASQSKAAL++   K   LSF L++LRATTHDP  PP  +HL+ +LS G  SRATA++AVE +M+RL +T ++ VALK LI +H
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLA---KPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVH

Query:  HIVKNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVG
        HIVK+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E LLS SR++GFF+ S SS   +E   E +S + NSDLL+E ++LVG
Subjt:  HIVKNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVG

Query:  LIEEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEI-LMDGFWNSIRETKNL
        L+EEA K P      G  L DKI   VG+DY+S++NE+  R  EF ER + LSFG+++ELVCALKRLE CKE+        ++   +DGFW  + E K +
Subjt:  LIEEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEI-LMDGFWNSIRETKNL

Query:  IGHSKENRDGGKLVRT------KSRMSDSGRYMERASI-YRDLLRFGSDRF
        IG+ ++N   G++ ++      + +  +S R+ +R  I Y + +RF S RF
Subjt:  IGHSKENRDGGKLVRT------KSRMSDSGRYMERASI-YRDLLRFGSDRF

AT5G10410.1 ENTH/ANTH/VHS superfamily protein6.0e-3231.47Show/hide
Query:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFI
        +IG  KDKAS  KA L+       +    LALL++TT  P+ PP+  ++S ++S   +  A AA +  +   RL+ T+N+ VA K LI +H ++K+    
Subjt:  LIGLIKDKASQSKAALL---AKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIEEATKKP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ L  + +VLG F   + + +++  + +++S      ++++T+SLV   E    +P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIEEATKKP

Query:  HCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQLMGISAKYEILMDGFW
            +  N +VD+I   V +DY   +  + +R+    ERL        G+    +    L RL +CKE  L G+  +   L D FW
Subjt:  HCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQLMGISAKYEILMDGFW

AT5G57200.1 ENTH/ANTH/VHS superfamily protein4.1e-1729.7Show/hide
Query:  FSSLIGLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKT--SRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKN
        F    G +KD  +      LAK N       +A+++AT H   +PP E+H+  + S       RA  A  +  L  RL  T+N  VA+K LI +H  ++ 
Subjt:  FSSLIGLIKDKASQSKAALLAKPN--ILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKT--SRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKN

Query:  GSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGS-----MSSVEERERKTEQISGILNSDLLKETESLVG
        G    +++L    ++  R+ L++S+F+D ++P++W+ S+WVR YA ++E  L   RVL + + +      S    +  +T  +SG    DLL++  +L  
Subjt:  GSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGS-----MSSVEERERKTEQISGILNSDLLKETESLVG

Query:  LI
        L+
Subjt:  LI

AT5G65370.1 ENTH/ANTH/VHS superfamily protein5.4e-3332.23Show/hide
Query:  KFSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVK-
        K ++L G++KD+ASQ K     L +  N  +  LALL+AT+H  + PPS+K+++ L S   T        V+ ++ RL+ T +  VA KCLI +H +VK 
Subjt:  KFSSLIGLIKDKASQSK---AALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVK-

Query:  ----NGSFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESL
            NG   L++ ++     +T G + LKL+D   +S+  + EL+ WV+WY QY++  LSI+ VLG         E++  +T+++S      +LK+ + L
Subjt:  ----NGSFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESL

Query:  VGLIEEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKN
        V L E  + +P       N +V ++   +  DY SA+  + IR  E + R++     +  ELV  L++LE+CKE  L   S + + L+  FW  + + K+
Subjt:  VGLIEEATKKPHCLHLNGNGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKN

Query:  L
        +
Subjt:  L


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCGCACAAAAAAGTTCAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCACAGAGCAAAGCCGCCCTTCTCGCCAAGCCCAATATTCTCTCCTTCCAGCTCGC
CCTCCTCCGCGCCACCACCCACGACCCCCACGCGCCGCCCAGCGAGAAGCACCTCTCCACCCTTCTCTCTCTCGGAAAAACCTCACGCGCCACCGCCGCTGCCGCCGTCG
AAGTCTTAATGGACCGCCTCCAAAGCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTTATCGCCGTTCACCACATCGTCAAAAACGGCAGCTTCATTCTACAAGACCAG
CTCTCTGTTTTTCCTTTCACCGGCGGCAGAAATTACCTCAAACTCTCCGATTTCCGCGACAGTTCGAATCCGATTTCTTGGGAGCTTTCCTCTTGGGTCCGATGGTACGC
CCAATACATCGAAACTCTTTTGTCCATTTCCCGAGTTTTGGGGTTTTTTGTTGGATCTATGAGCTCAGTCGAAGAGAGGGAGAGAAAAACAGAGCAGATTTCGGGGATTT
TGAACTCCGATTTGCTCAAAGAGACCGAATCCTTGGTGGGTTTAATCGAAGAAGCTACGAAAAAGCCTCACTGTTTGCATCTGAATGGAAACGGATTGGTGGATAAGATC
TACGCCTTTGTCGGTGACGATTACTTGTCGGCTATGAATGAAATTTCAATCCGAGTTACTGAGTTTCACGAGCGGCTCAGTTGCCTCAGTTTCGGTGAATCGGTTGAGTT
GGTTTGTGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAACAATTGATGGGTATTTCTGCAAAATACGAAATTTTGATGGATGGGTTTTGGAATTCGATTCGAGAGACGA
AGAATTTGATTGGGCACTCAAAGGAAAATCGAGACGGCGGTAAATTGGTGAGGACGAAGAGCCGGATGAGCGACTCGGGCCGGTATATGGAGCGGGCTAGTATTTATCGC
GACTTGCTCCGGTTCGGTTCGGATCGGTTCGACTTGAGCTGTAAAGGAATTCCGGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGCGCACAAAAAAGTTCAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCACAGAGCAAAGCCGCCCTTCTCGCCAAGCCCAATATTCTCTCCTTCCAGCTCGC
CCTCCTCCGCGCCACCACCCACGACCCCCACGCGCCGCCCAGCGAGAAGCACCTCTCCACCCTTCTCTCTCTCGGAAAAACCTCACGCGCCACCGCCGCTGCCGCCGTCG
AAGTCTTAATGGACCGCCTCCAAAGCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTTATCGCCGTTCACCACATCGTCAAAAACGGCAGCTTCATTCTACAAGACCAG
CTCTCTGTTTTTCCTTTCACCGGCGGCAGAAATTACCTCAAACTCTCCGATTTCCGCGACAGTTCGAATCCGATTTCTTGGGAGCTTTCCTCTTGGGTCCGATGGTACGC
CCAATACATCGAAACTCTTTTGTCCATTTCCCGAGTTTTGGGGTTTTTTGTTGGATCTATGAGCTCAGTCGAAGAGAGGGAGAGAAAAACAGAGCAGATTTCGGGGATTT
TGAACTCCGATTTGCTCAAAGAGACCGAATCCTTGGTGGGTTTAATCGAAGAAGCTACGAAAAAGCCTCACTGTTTGCATCTGAATGGAAACGGATTGGTGGATAAGATC
TACGCCTTTGTCGGTGACGATTACTTGTCGGCTATGAATGAAATTTCAATCCGAGTTACTGAGTTTCACGAGCGGCTCAGTTGCCTCAGTTTCGGTGAATCGGTTGAGTT
GGTTTGTGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAACAATTGATGGGTATTTCTGCAAAATACGAAATTTTGATGGATGGGTTTTGGAATTCGATTCGAGAGACGA
AGAATTTGATTGGGCACTCAAAGGAAAATCGAGACGGCGGTAAATTGGTGAGGACGAAGAGCCGGATGAGCGACTCGGGCCGGTATATGGAGCGGGCTAGTATTTATCGC
GACTTGCTCCGGTTCGGTTCGGATCGGTTCGACTTGAGCTGTAAAGGAATTCCGGTCTAG
Protein sequenceShow/hide protein sequence
MVRTKKFSSLIGLIKDKASQSKAALLAKPNILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFILQDQ
LSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSMSSVEERERKTEQISGILNSDLLKETESLVGLIEEATKKPHCLHLNGNGLVDKI
YAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHSKENRDGGKLVRTKSRMSDSGRYMERASIYR
DLLRFGSDRFDLSCKGIPV