; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G004140 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G004140
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionHyccin
Genome locationCmo_Chr04:2046440..2047699
RNA-Seq ExpressionCmoCh04G004140
SyntenyCmoCh04G004140
Gene Ontology termsGO:0046854 - phosphatidylinositol phosphorylation (biological process)
GO:0072659 - protein localization to plasma membrane (biological process)
GO:0005829 - cytosol (cellular component)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR018619 - Hyccin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600228.1 Family With Sequence Similarity 126 Member B-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.2e-23299.52Show/hide
Query:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS
        MDFHRNPSVNNRHSPSP SSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPH DSFFLSSLADSDRPALSLLSSHDAYSVISSALS
Subjt:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS

Query:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
        SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
Subjt:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK

Query:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
        PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
Subjt:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED

Query:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
        VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
Subjt:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA

Query:  TNAYTPSKDKKSEILLVSK
        TNAYTPSKDKKSEILLVSK
Subjt:  TNAYTPSKDKKSEILLVSK

KAG7030887.1 hypothetical protein SDJN02_04924, partial [Cucurbita argyrosperma subsp. argyrosperma]7.5e-20992.09Show/hide
Query:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS
        MDFHRNPSVNNRHSPSP SSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPH DSFFLSSLADSDRPALSLLSSHDAYSVISSALS
Subjt:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS

Query:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
        SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
Subjt:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK

Query:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
        PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKED                     
Subjt:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED

Query:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
                 VETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
Subjt:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA

Query:  TNAYTPSKDKKSEILLV
        TNAYTPSKDKKSEILL+
Subjt:  TNAYTPSKDKKSEILLV

XP_022943163.1 uncharacterized protein LOC111447976 [Cucurbita moschata]4.0e-234100Show/hide
Query:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS
        MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS
Subjt:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS

Query:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
        SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
Subjt:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK

Query:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
        PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
Subjt:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED

Query:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
        VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
Subjt:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA

Query:  TNAYTPSKDKKSEILLVSK
        TNAYTPSKDKKSEILLVSK
Subjt:  TNAYTPSKDKKSEILLVSK

XP_022990402.1 uncharacterized protein LOC111487267 [Cucurbita maxima]9.5e-22897.85Show/hide
Query:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS
        MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTA ADNDPMHSWWESVSKARSRIHALSSILPPH DSFFLSSLADSDRPALSLLSSHDAYSVISSALS
Subjt:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS

Query:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
        SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLV IPDLSQPSLYHSPLNK
Subjt:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK

Query:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
        PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRA IIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFD EDGLDIDG SEKRALEYGDEI+D
Subjt:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED

Query:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
        VSVKMGNLQVETCGNN DDSEPKGFRIPLPWELLQPL+RILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
Subjt:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA

Query:  TNAYTPSKDKKSEILLVSK
        TNAYTPSKDKKSEILLVSK
Subjt:  TNAYTPSKDKKSEILLVSK

XP_023523369.1 uncharacterized protein LOC111787583 [Cucurbita pepo subsp. pepo]3.5e-23099.05Show/hide
Query:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS
        MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPH DSFFLSSLADSDRPALSLLSSHDAYSVISSALS
Subjt:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS

Query:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
        SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
Subjt:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK

Query:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
        PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRA IIGVALDCYYKQISQMPSWSKLELC SAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
Subjt:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED

Query:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
        VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAA AATA
Subjt:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA

Query:  TNAYTPSKDKKSEILLVSK
        TNAYTPSKDKKSEILLVSK
Subjt:  TNAYTPSKDKKSEILLVSK

TrEMBL top hitse value%identityAlignment
A0A0A0KZ90 Uncharacterized protein1.1e-20588.94Show/hide
Query:  MDFHRNPSVNNRHSPSP-TSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSAL
        MDFHRNPS+NNRHS SP +SS SSTT  HNP+A TA+AD DPMHSWWESVSKARSRIHALSSILPPH DSFFLSS+ADSDRPALSLLSSHDAYSVISSAL
Subjt:  MDFHRNPSVNNRHSPSP-TSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSAL

Query:  SSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLN
        SSS+SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPS PSLAGFEAVLLALYSSEVKSRAGKPV+V+IPDLSQPSLYHSP+N
Subjt:  SSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLN

Query:  KPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIE
        KPNS AQAQ RPSVGVL PSLEPQNAVKSTKRACI+GVALDCYYKQISQMPSWSKLE CRSAASWAGQDCCC REFDKEDG D+ GFSEKRALEY DEIE
Subjt:  KPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIE

Query:  DVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAAT
        D S +MG LQ+E CGNNS+DSEPKG RIPLPWELLQP+LRILGHCLLAPLNSQDVKD ASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAA 
Subjt:  DVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAAT

Query:  A-----TNAYTPSKDKKSEILLVSK
        A     +NA TPSKDKK EILLVSK
Subjt:  A-----TNAYTPSKDKKSEILLVSK

A0A1S3C1D0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034957695.1e-20388.32Show/hide
Query:  MDFHRNPSVNNRHSPSP-TSSVSSTTVPH--NPSAT---TATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSV
        MDFHRNPS+NNRHS SP +SS SSTT P   NP+AT   +A+AD DPMHSWWESVSKARSRIHALSSILPPH DSFFLSS+ADSDRPALSLLSSHDAYSV
Subjt:  MDFHRNPSVNNRHSPSP-TSSVSSTTVPH--NPSAT---TATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLY
        ISSALSSS SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPS PSLAGFEAVLLALYSSEVKSRAGKPV+V+IPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLY

Query:  HSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEY
        HSPLNKPNS AQAQ RPSVGVL PSLEPQNAVKSTKRACI+GVALDCYYKQISQMPSWSKL  CRSAASWAGQDCCC REFDKEDGLD+ GFSEKRALEY
Subjt:  HSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEY

Query:  GDEIEDVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAA
         DEIED S +MG LQ+E CGNNS+DSEPKG RIPLPWELLQP+LRILGHCLL PLNSQDVKD ASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAA
Subjt:  GDEIEDVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAA

Query:  AKAATA---TNAYTPSKDKKSEILLVSK
        AKAA A   +NA TPSKDKK EILLVSK
Subjt:  AKAATA---TNAYTPSKDKKSEILLVSK

A0A6J1FWG4 uncharacterized protein LOC1114479761.9e-234100Show/hide
Query:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS
        MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS
Subjt:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS

Query:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
        SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
Subjt:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK

Query:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
        PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
Subjt:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED

Query:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
        VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
Subjt:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA

Query:  TNAYTPSKDKKSEILLVSK
        TNAYTPSKDKKSEILLVSK
Subjt:  TNAYTPSKDKKSEILLVSK

A0A6J1J9X7 uncharacterized protein LOC1114826045.1e-20387.41Show/hide
Query:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSAT------TATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSV
        MDFHRNPS+NNRHS SP+SS SSTT  HNPSA+      TATAD+DPMHSWWESVSKARSRIHALSSILPPH DSFFLSS+ADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSAT------TATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPS PSLAGFEAVLLALYSSEVKSRAGKPVLV+IPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLY

Query:  HSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEY
        HSP NKP+SVAQAQ RPSVGVL PSLEPQNAVKSTKRACI+GVALDCYYKQI  MPSWSKLE CRSAASWAGQDCCCKREFDKED L+I GFSEKRALE+
Subjt:  HSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEY

Query:  GDEIEDVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAA
         DEIEDVS +MG LQ+E  G+NSDDSEPK FRIPLPWELLQP+LRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR RAA
Subjt:  GDEIEDVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAA

Query:  AKAA----TATNAYTPSKDKKSEILLVSK
        AKAA    +++NA TPSKDKK EILLVSK
Subjt:  AKAA----TATNAYTPSKDKKSEILLVSK

A0A6J1JIL2 uncharacterized protein LOC1114872674.6e-22897.85Show/hide
Query:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS
        MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTA ADNDPMHSWWESVSKARSRIHALSSILPPH DSFFLSSLADSDRPALSLLSSHDAYSVISSALS
Subjt:  MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALS

Query:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK
        SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLV IPDLSQPSLYHSPLNK
Subjt:  SSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNK

Query:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED
        PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRA IIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFD EDGLDIDG SEKRALEYGDEI+D
Subjt:  PNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIED

Query:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
        VSVKMGNLQVETCGNN DDSEPKGFRIPLPWELLQPL+RILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA
Subjt:  VSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA

Query:  TNAYTPSKDKKSEILLVSK
        TNAYTPSKDKKSEILLVSK
Subjt:  TNAYTPSKDKKSEILLVSK

SwissProt top hitse value%identityAlignment
Q5R977 Protein FAM126B4.9e-0931.01Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSVAQ
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +Y+ E+  + G  K +   IP LS+PS+YH P +   S+A 
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSVAQ

Query:  AQ----FRPSVGVLCPSLEPQ-NAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR
         +        + V+   L PQ     +  R  ++   + CY   I  MP+ S   LCR
Subjt:  AQ----FRPSVGVLCPSLEPQ-NAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR

Q5ZM13 Hyccin4.6e-0729.9Show/hide
Query:  LSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSS
        +SS A + +   +L+SS   Y VI    S  +     +P+CH L++ + S +  L    L FLP L   YL+   S S D  S   +   EA+LL +Y+ 
Subjt:  LSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSS

Query:  EVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSV----AQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR
        E+  + G  K +   IP LS+PS+YH P +  +      A +Q   S  V       +  + +  R  ++   L CY   +S MP+ S   LC+
Subjt:  EVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSV----AQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR

Q6P9N1 Hyccin4.6e-0729.94Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSV--
        +P+CH L++ + S +  L    L FLP L   YL+      S S    S    EA+LL +Y+ E+  + G  K +   IP LS+PS+YH P +  +    
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSV--

Query:  --AQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR
          A +Q   S  V       +  + +  R  ++   L CY   ++ MPS S   LC+
Subjt:  --AQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR

Q8C729 Protein FAM126B6.4e-0931.01Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSVAQ
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +Y+ E+  + G  K +   IP LS+PS+YH P +   S+A 
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSVAQ

Query:  AQ----FRPSVGVLCPSLEPQ-NAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR
         +        + V+   L PQ     +  R  ++   + CY   I  MP+ S   LCR
Subjt:  AQ----FRPSVGVLCPSLEPQ-NAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR

Q8IXS8 Protein FAM126B4.9e-0931.01Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSVAQ
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +Y+ E+  + G  K +   IP LS+PS+YH P +   S+A 
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSVAQ

Query:  AQ----FRPSVGVLCPSLEPQ-NAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR
         +        + V+   L PQ     +  R  ++   + CY   I  MP+ S   LCR
Subjt:  AQ----FRPSVGVLCPSLEPQ-NAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR

Arabidopsis top hitse value%identityAlignment
AT5G21050.1 LOCATED IN: chloroplast1.5e-5339.73Show/hide
Query:  TSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPA-LSLLSSHDAYS-VISSALSSSVSGSGSDPLCHWL
        + S SS   P +P A T  ++    ++  ES +K ++ I +LS+I            + +++ P+ +++L   +A S  ISS L    SG+G + LC WL
Subjt:  TSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPA-LSLLSSHDAYS-VISSALSSSVSGSGSDPLCHWL

Query:  YDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSP--LNKPNSVAQAQFRPSV
        YDTF S++P L+L+VL F+PL++ LYLSRV       P     AGFEAVLLALY+ E  SRAG+ + V IPDLS PS+YH    L + N+        ++
Subjt:  YDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSP--LNKPNSVAQAQFRPSV

Query:  GVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIEDVSVKMGNLQVETC
         V+  +L+P   V+ST+RA I+GVAL+ YY +IS+MP  SKL  C S   WAGQ+     E ++     I   S+    +   E E+V++          
Subjt:  GVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIEDVSVKMGNLQVETC

Query:  GNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLA-PLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQL
              SE    RIPLPWELLQP+LRILGHCLL   +  +++ +AA+ A + LY R+ HD+ P+ ILAT SL++L
Subjt:  GNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLA-PLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQL

AT5G64090.1 FUNCTIONS IN: molecular_function unknown5.5e-12557.86Show/hide
Query:  MDFHRNPSVNNRHSPSPTSSVSSTTVPH------NPSAT---------TATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFF-------LSSLADS
        MDF   PS     SPSP+SS SS+T PH       P+AT         +A AD DPMHSWWESVSK RSRI +LSS+L    DS F       +SSLADS
Subjt:  MDFHRNPSVNNRHSPSPTSSVSSTTVPH------NPSAT---------TATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFF-------LSSLADS

Query:  DRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG
        DRPALSLLSS  AYS+IS++L +  SGSGSDPLC WLY+T+LSSDP LRLVVLSF PLL  +YLSR+H  SSDS S PSL+GFEAVLLA+Y++EVK+RAG
Subjt:  DRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG

Query:  KPVLVAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKE
        KP+LV IPDLSQPSLYH+P N  +    +    SVGVL P LEPQ AVKSTKRA I+GV L CY+K+ISQMP+WSKLE C+ +ASWAGQDC CK + D++
Subjt:  KPVLVAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKE

Query:  DGLDI---DGF--------SEKRALEYGDEIEDVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYA
        +   +   +GF        S  R+LE  ++ + ++++    Q+ +  N       +G RIPLPWEL QP LRILGHCLL+PLN++DVKDAAS AVR LYA
Subjt:  DGLDI---DGF--------SEKRALEYGDEIEDVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYA

Query:  RASHDLVPQVILATRSLIQLDNRTRAAAKAATA------TNAYTPSKDKKSEILLVSK
        RASHDL PQ ILATRSL+ LD   R + K   A      +N  TPSK KK EILL SK
Subjt:  RASHDLVPQVILATRSLIQLDNRTRAAAKAATA------TNAYTPSKDKKSEILLVSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTCCACCGGAATCCTTCCGTCAACAACCGCCATTCCCCCTCCCCCACCTCCTCTGTCTCCTCCACCACCGTCCCACACAACCCCTCCGCCACCACCGCCACTGC
CGATAACGACCCTATGCACTCATGGTGGGAGTCCGTTTCCAAAGCCCGCTCTCGCATCCACGCTCTTTCCTCCATCCTTCCCCCTCATTGCGACTCGTTTTTTCTCTCCT
CTCTCGCCGATTCCGACCGGCCGGCCCTCTCTCTTTTGTCCTCTCACGATGCTTACTCCGTTATCTCCTCTGCTCTCTCCTCCTCCGTCTCTGGATCTGGTTCTGACCCT
CTCTGCCACTGGCTTTACGATACTTTTCTCTCTTCCGATCCCCATCTCCGCCTTGTTGTTCTTTCCTTTCTTCCACTTCTTTCCTCTTTGTATCTCTCTCGCGTTCATTC
TACTTCCTCCGATTCCCCTTCTCCTCCTTCTCTCGCCGGCTTTGAGGCTGTGCTTCTCGCGCTTTATTCCTCTGAGGTTAAGTCTCGGGCTGGGAAGCCTGTTCTTGTCG
CGATTCCTGATCTTTCGCAGCCTTCTCTTTACCATTCTCCTCTGAATAAGCCCAATTCTGTTGCCCAAGCTCAATTCAGGCCATCCGTTGGAGTTCTTTGCCCTTCGCTT
GAACCACAGAACGCGGTGAAGTCAACCAAAAGAGCTTGTATCATTGGCGTCGCTCTCGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCTTGAACT
CTGTCGCTCTGCGGCGTCGTGGGCTGGGCAAGATTGTTGCTGCAAGAGAGAATTTGATAAAGAAGATGGTTTGGATATTGATGGGTTTTCGGAGAAAAGGGCTTTGGAGT
ATGGGGATGAAATAGAGGATGTTTCAGTAAAAATGGGTAACCTACAAGTTGAGACGTGTGGGAACAATTCCGATGATTCAGAACCTAAGGGGTTCAGAATTCCGCTTCCA
TGGGAGCTTTTGCAGCCATTACTTAGAATTTTAGGACATTGTTTATTGGCTCCTTTGAATTCACAAGATGTTAAGGATGCAGCTTCCGTTGCTGTAAGGTGTTTATATGC
AAGGGCATCTCATGATTTAGTACCGCAGGTGATATTGGCAACTCGGAGTCTTATTCAGCTTGACAACAGAACTCGAGCGGCTGCAAAGGCTGCAACAGCAACAAATGCTT
ACACACCCAGCAAAGATAAGAAATCAGAAATCTTATTGGTCTCAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTCCACCGGAATCCTTCCGTCAACAACCGCCATTCCCCCTCCCCCACCTCCTCTGTCTCCTCCACCACCGTCCCACACAACCCCTCCGCCACCACCGCCACTGC
CGATAACGACCCTATGCACTCATGGTGGGAGTCCGTTTCCAAAGCCCGCTCTCGCATCCACGCTCTTTCCTCCATCCTTCCCCCTCATTGCGACTCGTTTTTTCTCTCCT
CTCTCGCCGATTCCGACCGGCCGGCCCTCTCTCTTTTGTCCTCTCACGATGCTTACTCCGTTATCTCCTCTGCTCTCTCCTCCTCCGTCTCTGGATCTGGTTCTGACCCT
CTCTGCCACTGGCTTTACGATACTTTTCTCTCTTCCGATCCCCATCTCCGCCTTGTTGTTCTTTCCTTTCTTCCACTTCTTTCCTCTTTGTATCTCTCTCGCGTTCATTC
TACTTCCTCCGATTCCCCTTCTCCTCCTTCTCTCGCCGGCTTTGAGGCTGTGCTTCTCGCGCTTTATTCCTCTGAGGTTAAGTCTCGGGCTGGGAAGCCTGTTCTTGTCG
CGATTCCTGATCTTTCGCAGCCTTCTCTTTACCATTCTCCTCTGAATAAGCCCAATTCTGTTGCCCAAGCTCAATTCAGGCCATCCGTTGGAGTTCTTTGCCCTTCGCTT
GAACCACAGAACGCGGTGAAGTCAACCAAAAGAGCTTGTATCATTGGCGTCGCTCTCGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCTTGAACT
CTGTCGCTCTGCGGCGTCGTGGGCTGGGCAAGATTGTTGCTGCAAGAGAGAATTTGATAAAGAAGATGGTTTGGATATTGATGGGTTTTCGGAGAAAAGGGCTTTGGAGT
ATGGGGATGAAATAGAGGATGTTTCAGTAAAAATGGGTAACCTACAAGTTGAGACGTGTGGGAACAATTCCGATGATTCAGAACCTAAGGGGTTCAGAATTCCGCTTCCA
TGGGAGCTTTTGCAGCCATTACTTAGAATTTTAGGACATTGTTTATTGGCTCCTTTGAATTCACAAGATGTTAAGGATGCAGCTTCCGTTGCTGTAAGGTGTTTATATGC
AAGGGCATCTCATGATTTAGTACCGCAGGTGATATTGGCAACTCGGAGTCTTATTCAGCTTGACAACAGAACTCGAGCGGCTGCAAAGGCTGCAACAGCAACAAATGCTT
ACACACCCAGCAAAGATAAGAAATCAGAAATCTTATTGGTCTCAAAATAA
Protein sequenceShow/hide protein sequence
MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDP
LCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCPSL
EPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIEDVSVKMGNLQVETCGNNSDDSEPKGFRIPLP
WELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATATNAYTPSKDKKSEILLVSK