; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018973 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018973
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionENTH domain-containing protein
Genome locationchr5:37194889..37195938
RNA-Seq ExpressionLag0018973
SyntenyLag0018973
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577320.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]1.3e-15883.24Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRTKK SSLIGLIKDKASQSKAALLAKP+ILSFQLALLRATTHDPHAPP+ K LS LLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGSSSS  ERE+KTEQISG LNSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNG GLVDKIYAFVG+DYLSA  EIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK+ R+ GKL RTKSRMSDSGR+M++  A +YR  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

KAG7015410.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-15882.95Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRTKK SSLIGLIKDKASQSKAALLAKP+ILSFQLALLRATTHDPHAPP+ K LS LLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGSSSS  ERE+KTEQISG  NSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNG GLVDKIYAFVG+DYLSA  EIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK+ R+ GKL RTKSRMSDSGR+M++  A +YR  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

XP_022929539.1 putative clathrin assembly protein At4g40080 [Cucurbita moschata]6.6e-15882.95Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRTKK SSLIGLIKDKASQSKAALLAKP+ILSFQLALLRATTHDPHAPP+ K LS LLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGSSSS  ERE+KTEQISG  NSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNG GLVDKIYAFVG+DYLSA  EIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK+ R+ GKL RTKSRMSDSGR+M++  A +YR  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

XP_023552000.1 putative clathrin assembly protein At4g40080 [Cucurbita pepo subsp. pepo]6.6e-15882.67Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRTKK SSLIGLIKDKASQSKAALLAKP+I+SFQLALLRATTHDPHAPP+ K LS LLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGSSSS  ERE+KTEQISG  NSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNG GLVDKIYAFVG+DYLSA  EIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK++R+ GKL RTKSRMSDSGR+M++  A +YR  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

XP_038903242.1 putative clathrin assembly protein At4g40080 [Benincasa hispida]6.4e-16988.03Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MV TK  SSLIGLIKDKASQSKAALLAKP+ILSFQLALLRATTHDPHAPP EKHL  LLSLGKTSRATAAAAVEVLMDRLQ+TQNSAVALKCLIAVHHIV
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIE
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+LSISR+LGFFVGSS+S EE+E+KTEQISGILNSDLLKETESLVGLIE
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIE

Query:  EATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS
        E +K PHCLHLNG  L DKIYAFVGDDYLSAM EISIRVTEFH+RLSCLSFGESVELVCALKRLEDCKEKQ  GIS+KYE+LMD FW SIRETKNLIG S
Subjt:  EATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS

Query:  KENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        KEN++GGKL RTKSRMSDSGR+MER  A  YRD LRFGS+RFDL+CKG PV
Subjt:  KENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

TrEMBL top hitse value%identityAlignment
A0A0A0KXU4 ENTH domain-containing protein1.7e-15684.05Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MV TKK SSLIGLIKDKASQSKAALLAKP+ILSFQLALLRATTHD HAPPS+KHLS LLSLGKTSRATAA AVEVLMDRLQ+T NSAVALKCLIAVHHI 
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISW+LSSWVRWYAQYIET+LSISR+LGFFVGSS S EE+ERKTEQISGILNSDLLKETESLVGLIE
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIE

Query:  EATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIR---ETKNLI
        E +K PHCLHLN   LVDKIY+FVGDDYLSAM EISIRVTEFH RL  LSF ESVELVCALKRLEDCKEKQ MGI AKYE+L+DG W SIR   ETKNL 
Subjt:  EATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIR---ETKNLI

Query:  GHSKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKG
        G SKE+R+GGKL +TK R+SDSGR+MER  AS YRDLLRFGS+RF L+  G
Subjt:  GHSKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKG

A0A5A7TT50 Putative clathrin assembly protein2.1e-15483.05Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        M+ TK+ SSLIGLIKDKASQSKAALLAKP+ILSFQLALLRATTHDPHAPPS+KHLS LLSLGKTSRATAAAAVEVLMDRLQ+T NSAVALKCLIAVHHI 
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIE
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+LSISR LGF VGSSSS EE ERKTEQISGI NS+LLK+TESLVGLIE
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIE

Query:  EATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS
        E +K P CLHLN   LVDKIY FVGDDYL+AM +ISIRVTEFH RL CLSFGESVELVCALKRL+DCKEKQ MGI A+YE+LMDGFW+SIRETKNLIG S
Subjt:  EATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS

Query:  KENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKG
        KENRDG KL + + R+SDSGR++ER  AS Y D+L F S+RF L+ KG
Subjt:  KENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKG

A0A6J1C3E8 putative clathrin assembly protein At4g400808.7e-15681.74Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRT+K S LIGLIKDKASQSKAAL+AKP+ILSFQLALLRATTHDP+APP +KHL+ LLSLGKTSRATAAAA+EVLMDRLQST NSAVALKCL+AVHHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIE
        K+G FILQDQLSVFPFTGGRNYLKLSDFRD+S+PISWELSSWVRWYAQYIET+LS SR+LGFFV SSSS+EERE+K+EQIS + NSDLL++TESLVGLIE
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIE

Query:  EATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS
        E TKKPH LHLN   LVD+I  FV DDYLSAM EIS+RVTEFH+RLSCLSFGESVELVC LKRLEDCKEKQ +GISAKYEILMDGFW  I ETKNLIG +
Subjt:  EATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHS

Query:  KENRD----GGKLVRTKSRMSDSGRYMERASIYRDLLRFGSDRFD
        KENRD    GGKL+ T +RMSDSGR+MERA+IYRD +RFGS+RFD
Subjt:  KENRD----GGKLVRTKSRMSDSGRYMERASIYRDLLRFGSDRFD

A0A6J1EP16 putative clathrin assembly protein At4g400803.2e-15882.95Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MVRTKK SSLIGLIKDKASQSKAALLAKP+ILSFQLALLRATTHDPHAPP+ K LS LLSLGKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHI+
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGSSSS  ERE+KTEQISG  NSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNG GLVDKIYAFVG+DYLSA  EIS RVTEF +RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK+ R+ GKL RTKSRMSDSGR+M++  A +YR  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

A0A6J1JCT4 putative clathrin assembly protein At4g400801.5e-15582.39Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV
        MV TKK SSLIGLIKDKASQSKAALLAKP+ILSFQLALLRATTHDPHAPP  K LS LLS GKTSRATAAAA+EVLMDRLQSTQNSAVALKCLIA+HHIV
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIV

Query:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI
        KNG FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIET+L ISR+LG FFVGSSSS  ERE+KTEQISG  NSDLLKETESL+GLI
Subjt:  KNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLG-FFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLI

Query:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH
        EE +K PHCLHLNG GLVDKIYAFVG+DYLSA  EIS RVTEF  RL CLSFGESVELVCALKRLEDCKEKQ  GIS  +EIL+ GFW SIRE +NLIG 
Subjt:  EEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGH

Query:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV
        SK+ R+ GKL RTKSRMSDSGR+M++  A + R  +RFGS+RFD +CKGIPV
Subjt:  SKENRDGGKLVRTKSRMSDSGRYMER--ASIYRDLLRFGSDRFDLSCKGIPV

SwissProt top hitse value%identityAlignment
Q8H0W9 Putative clathrin assembly protein At5g104101.6e-2931.12Show/hide
Query:  LIGLIKDKASQSKAALL---AKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFI
        +IG  KDKAS  KA L+      ++    LALL++TT  P+ PP+  ++S ++S   +  A AA +  +   RL+ T+N+ VA K LI +H ++K+    
Subjt:  LIGLIKDKASQSKAALL---AKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIEEATKKP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ L  + +VLG F     + +++  + +++S      ++++T+SLV   E    +P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIEEATKKP

Query:  HCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQLMGISAKYEILMDGFW
            +    +VD+I   V +DY   +  + +R+    ERL        G+    +    L RL +CKE  L G+  +   L D FW
Subjt:  HCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQLMGISAKYEILMDGFW

Q8L936 Putative clathrin assembly protein At4g400802.1e-9051.85Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLA---KPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVH
        M R   F+ LIG IKDKASQSKAAL++   K   LSF L++LRATTHDP  PP  +HL+ +LS G  SRATA++AVE +M+RL +T ++ VALK LI +H
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLA---KPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVH

Query:  HIVKNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVG
        HIVK+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E LLS SR++GFF+ S+SS   +E   E +S + NSDLL+E ++LVG
Subjt:  HIVKNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVG

Query:  LIEEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEI-LMDGFWNSIRETKNL
        L+EEA K P      G  L DKI   VG+DY+S++NE+  R  EF ER + LSFG+++ELVCALKRLE CKE+        ++   +DGFW  + E K +
Subjt:  LIEEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEI-LMDGFWNSIRETKNL

Query:  IGHSKENRDGGKLVRT------KSRMSDSGRYMERASI-YRDLLRFGSDRF
        IG+ ++N   G++ ++      + +  +S R+ +R  I Y + +RF S RF
Subjt:  IGHSKENRDGGKLVRT------KSRMSDSGRYMERASI-YRDLLRFGSDRF

Q8VYT2 Putative clathrin assembly protein At4g259404.9e-1527.49Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKT--SRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHH
        M     F   +G IKD  + S A +          +A+++AT H   A P E+H+  + S       RA  A  +  L  RL  T+N  VA+K LI +H 
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKT--SRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHH

Query:  IVKNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFV----------GSSSSVEERERKTEQISGILNSDL
         ++ G    +++L  +   G  + L++S+F+D ++P++W+ S+W+R YA ++E  L   RVL + +           SS +V+    +T +   + + +L
Subjt:  IVKNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFV----------GSSSSVEERERKTEQISGILNSDL

Query:  LKETESLVGLI
        L++  +L  L+
Subjt:  LKETESLVGLI

Q9FKQ2 Putative clathrin assembly protein At5g653703.2e-3031.56Show/hide
Query:  KFSSLIGLIKDKASQSK---AALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVK-
        K ++L G++KD+ASQ K     L +  +  +  LALL+AT+H  + PPS+K+++ L S   T        V+ ++ RL+ T +  VA KCLI +H +VK 
Subjt:  KFSSLIGLIKDKASQSK---AALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVK-

Query:  ----NGSFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESL
            NG   L++ ++     +T G + LKL+D   +S+  + EL+ WV+WY QY++  LSI+ VLG         E++  +T+++S      +LK+ + L
Subjt:  ----NGSFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESL

Query:  VGLIEEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKN
        V L E  + +P         +V ++   +  DY SA+  + IR  E + R++     +  ELV  L++LE+CKE  L   S + + L+  FW  + + K+
Subjt:  VGLIEEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKN

Query:  L
        +
Subjt:  L

Q9LVD8 Putative clathrin assembly protein At5g572001.0e-1528.5Show/hide
Query:  FSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKT--SRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGS
        F    G +KD  +   A +          +A+++AT H   +PP E+H+  + S       RA  A  +  L  RL  T+N  VA+K LI +H  ++ G 
Subjt:  FSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKT--SRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGS

Query:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGS-----SSSVEERERKTEQISGILNSDLLKETESLVGLI
           +++L    ++  R+ L++S+F+D ++P++W+ S+WVR YA ++E  L   RVL + + +     +S    +  +T  +SG    DLL++  +L  L+
Subjt:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGS-----SSSVEERERKTEQISGILNSDLLKETESLVGLI

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein3.5e-1633.99Show/hide
Query:  GLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATA--AAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFILQD
        G +KD  S     +          +A+++AT H    PP ++HL  + +    +RA A  A  +  L  RL  T+N  VALK LI +H +++ G    ++
Subjt:  GLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATA--AAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFILQD

Query:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGF
        +L  F   G    L+LS+F+D S+PI+W+ S+WVR YA ++E  L   RVL +
Subjt:  QLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGF

AT4G40080.1 ENTH/ANTH/VHS superfamily protein1.5e-9151.85Show/hide
Query:  MVRTKKFSSLIGLIKDKASQSKAALLA---KPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVH
        M R   F+ LIG IKDKASQSKAAL++   K   LSF L++LRATTHDP  PP  +HL+ +LS G  SRATA++AVE +M+RL +T ++ VALK LI +H
Subjt:  MVRTKKFSSLIGLIKDKASQSKAALLA---KPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVH

Query:  HIVKNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVG
        HIVK+G FILQDQLSVFP +GGRNYLKLS FRD  +P+ WELSSWVRWYA Y+E LLS SR++GFF+ S+SS   +E   E +S + NSDLL+E ++LVG
Subjt:  HIVKNGSFILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVG

Query:  LIEEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEI-LMDGFWNSIRETKNL
        L+EEA K P      G  L DKI   VG+DY+S++NE+  R  EF ER + LSFG+++ELVCALKRLE CKE+        ++   +DGFW  + E K +
Subjt:  LIEEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEI-LMDGFWNSIRETKNL

Query:  IGHSKENRDGGKLVRT------KSRMSDSGRYMERASI-YRDLLRFGSDRF
        IG+ ++N   G++ ++      + +  +S R+ +R  I Y + +RF S RF
Subjt:  IGHSKENRDGGKLVRT------KSRMSDSGRYMERASI-YRDLLRFGSDRF

AT5G10410.1 ENTH/ANTH/VHS superfamily protein1.1e-3031.12Show/hide
Query:  LIGLIKDKASQSKAALL---AKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFI
        +IG  KDKAS  KA L+      ++    LALL++TT  P+ PP+  ++S ++S   +  A AA +  +   RL+ T+N+ VA K LI +H ++K+    
Subjt:  LIGLIKDKASQSKAALL---AKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFI

Query:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIEEATKKP
         +D+        GRN LKL++F D S+ ++ ELS W+RWY QY++ L  + +VLG F     + +++  + +++S      ++++T+SLV   E    +P
Subjt:  LQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIEEATKKP

Query:  HCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQLMGISAKYEILMDGFW
            +    +VD+I   V +DY   +  + +R+    ERL        G+    +    L RL +CKE  L G+  +   L D FW
Subjt:  HCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERL---SCLSFGE--SVELVCALKRLEDCKEKQLMGISAKYEILMDGFW

AT5G57200.1 ENTH/ANTH/VHS superfamily protein7.1e-1728.5Show/hide
Query:  FSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKT--SRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGS
        F    G +KD  +   A +          +A+++AT H   +PP E+H+  + S       RA  A  +  L  RL  T+N  VA+K LI +H  ++ G 
Subjt:  FSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKT--SRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGS

Query:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGS-----SSSVEERERKTEQISGILNSDLLKETESLVGLI
           +++L    ++  R+ L++S+F+D ++P++W+ S+WVR YA ++E  L   RVL + + +     +S    +  +T  +SG    DLL++  +L  L+
Subjt:  FILQDQLSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGS-----SSSVEERERKTEQISGILNSDLLKETESLVGLI

AT5G65370.1 ENTH/ANTH/VHS superfamily protein2.3e-3131.56Show/hide
Query:  KFSSLIGLIKDKASQSK---AALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVK-
        K ++L G++KD+ASQ K     L +  +  +  LALL+AT+H  + PPS+K+++ L S   T        V+ ++ RL+ T +  VA KCLI +H +VK 
Subjt:  KFSSLIGLIKDKASQSK---AALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVK-

Query:  ----NGSFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESL
            NG   L++ ++     +T G + LKL+D   +S+  + EL+ WV+WY QY++  LSI+ VLG         E++  +T+++S      +LK+ + L
Subjt:  ----NGSFILQDQLS--VFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESL

Query:  VGLIEEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKN
        V L E  + +P         +V ++   +  DY SA+  + IR  E + R++     +  ELV  L++LE+CKE  L   S + + L+  FW  + + K+
Subjt:  VGLIEEATKKPHCLHLNGYGLVDKIYAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKN

Query:  L
        +
Subjt:  L


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCGCACAAAAAAGTTCAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCACAGAGCAAAGCCGCCCTTCTCGCCAAGCCCAGTATTCTCTCCTTCCAGCTCGC
CCTCCTCCGCGCCACCACCCACGATCCCCACGCGCCGCCCAGCGAGAAGCACCTCTCCACCCTTCTCTCTCTCGGCAAAACCTCACGCGCCACCGCTGCTGCCGCCGTCG
AAGTCTTAATGGACCGCCTCCAAAGCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCGCCGTTCACCACATCGTCAAAAACGGCAGCTTCATTCTACAAGACCAG
CTCTCTGTTTTTCCTTTCACCGGCGGCAGAAATTACCTCAAACTCTCCGATTTCCGCGACAGTTCGAATCCGATTTCTTGGGAGCTGTCCTCTTGGGTCCGATGGTACGC
CCAATACATCGAAACTCTTTTGTCCATTTCCCGAGTTTTGGGGTTCTTTGTTGGATCTTCGAGCTCGGTCGAAGAGAGGGAGAGAAAAACAGAGCAGATTTCGGGGATTT
TGAACTCCGATTTGCTCAAAGAGACCGAATCCTTGGTGGGTTTAATCGAAGAAGCTACGAAAAAGCCTCACTGTTTGCATCTGAATGGATACGGATTGGTGGATAAGATC
TACGCCTTTGTCGGTGACGATTACTTGTCGGCTATGAATGAAATTTCGATCCGAGTTACTGAGTTTCACGAGCGGCTCAGTTGCCTCAGTTTCGGTGAATCGGTTGAGTT
GGTTTGTGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAACAATTGATGGGTATTTCTGCAAAATACGAAATTTTGATGGATGGGTTTTGGAATTCGATTCGAGAGACGA
AGAATTTGATTGGGCACTCAAAGGAAAATCGAGACGGCGGTAAATTGGTCAGGACGAAGAGCCGGATGAGCGACTCGGGCCGGTATATGGAGCGGGCTAGTATTTATCGC
GACTTGCTCCGGTTCGGTTCAGATCGATTCGATTTGAGCTGTAAAGGAATTCCGGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGCGCACAAAAAAGTTCAGTTCCCTAATTGGACTCATCAAAGACAAAGCCTCACAGAGCAAAGCCGCCCTTCTCGCCAAGCCCAGTATTCTCTCCTTCCAGCTCGC
CCTCCTCCGCGCCACCACCCACGATCCCCACGCGCCGCCCAGCGAGAAGCACCTCTCCACCCTTCTCTCTCTCGGCAAAACCTCACGCGCCACCGCTGCTGCCGCCGTCG
AAGTCTTAATGGACCGCCTCCAAAGCACCCAAAACTCCGCCGTCGCCCTCAAGTGTCTAATCGCCGTTCACCACATCGTCAAAAACGGCAGCTTCATTCTACAAGACCAG
CTCTCTGTTTTTCCTTTCACCGGCGGCAGAAATTACCTCAAACTCTCCGATTTCCGCGACAGTTCGAATCCGATTTCTTGGGAGCTGTCCTCTTGGGTCCGATGGTACGC
CCAATACATCGAAACTCTTTTGTCCATTTCCCGAGTTTTGGGGTTCTTTGTTGGATCTTCGAGCTCGGTCGAAGAGAGGGAGAGAAAAACAGAGCAGATTTCGGGGATTT
TGAACTCCGATTTGCTCAAAGAGACCGAATCCTTGGTGGGTTTAATCGAAGAAGCTACGAAAAAGCCTCACTGTTTGCATCTGAATGGATACGGATTGGTGGATAAGATC
TACGCCTTTGTCGGTGACGATTACTTGTCGGCTATGAATGAAATTTCGATCCGAGTTACTGAGTTTCACGAGCGGCTCAGTTGCCTCAGTTTCGGTGAATCGGTTGAGTT
GGTTTGTGCGTTGAAACGGCTCGAGGATTGCAAAGAAAAACAATTGATGGGTATTTCTGCAAAATACGAAATTTTGATGGATGGGTTTTGGAATTCGATTCGAGAGACGA
AGAATTTGATTGGGCACTCAAAGGAAAATCGAGACGGCGGTAAATTGGTCAGGACGAAGAGCCGGATGAGCGACTCGGGCCGGTATATGGAGCGGGCTAGTATTTATCGC
GACTTGCTCCGGTTCGGTTCAGATCGATTCGATTTGAGCTGTAAAGGAATTCCGGTCTAG
Protein sequenceShow/hide protein sequence
MVRTKKFSSLIGLIKDKASQSKAALLAKPSILSFQLALLRATTHDPHAPPSEKHLSTLLSLGKTSRATAAAAVEVLMDRLQSTQNSAVALKCLIAVHHIVKNGSFILQDQ
LSVFPFTGGRNYLKLSDFRDSSNPISWELSSWVRWYAQYIETLLSISRVLGFFVGSSSSVEERERKTEQISGILNSDLLKETESLVGLIEEATKKPHCLHLNGYGLVDKI
YAFVGDDYLSAMNEISIRVTEFHERLSCLSFGESVELVCALKRLEDCKEKQLMGISAKYEILMDGFWNSIRETKNLIGHSKENRDGGKLVRTKSRMSDSGRYMERASIYR
DLLRFGSDRFDLSCKGIPV