; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G9635 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G9635
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionprotein CHUP1, chloroplastic-like
Genome locationctg1673:3810191..3815283
RNA-Seq ExpressionCucsat.G9635
SyntenyCucsat.G9635
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK00280.1 protein CHUP1 [Cucumis melo var. makuwa]0.091.92Show/hide
Query:  IRGL--EEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIM
        ++GL  EEEEEEEA+SI+D TSQVNGRTSDLEDGDHSSDE QVLL PQRNSENWLL   +KEEKVPEFL EN+KIE ERLLKL+MELEERKVKLEGEL+M
Subjt:  IRGL--EEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIM

Query:  CDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVK
        CDGIKYSETDVMELRKQLDAKN+DISMLNNTISSLQAERKILKEEILKGALMKKELEE RGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVK
Subjt:  CDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVK

Query:  KEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI
        KEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKLEVMKARIKTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI
Subjt:  KEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI

Query:  NACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQ
        NACLRYELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E G+ ETDHESNFSHPFS  IDNLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQ
Subjt:  NACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQ

Query:  GGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSG
        GG SAVSPP    SSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVE+SLQQKYSTYKEHHKLAIGSEKQIKEK E+E+AKSSG
Subjt:  GGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSG

Query:  DSSSSNLEYEDISMRK-NATLVLKLAQMKMNKISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREAS
        DSSS NLEY DISMRK +ATL LKLAQMK NKISCEPDSQ DN+STN IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA 
Subjt:  DSSSSNLEYEDISMRK-NATLVLKLAQMKMNKISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREAS

Query:  FSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISR
        FSNMEDIVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV ALLQTRDT ISR
Subjt:  FSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISR

Query:  YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK
        YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHN LSGP+KEPNREFLLFQGVRFASRVHK
Subjt:  YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK

XP_008467205.1 PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo]0.091.04Show/hide
Query:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGL--EEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLP
        MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPT NCSENGED KKN          GL  EEEEEEEA+SI+D TSQVNGRTSDLEDGDHSSDE QVLL P
Subjt:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGL--EEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLP

Query:  QRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEIL
        QRNSENWLL   +KEEKVPEFL E++KIE ERLLKL+MELEERKVKLEGEL+MCDGIKYSETDVMELRKQLDAKN+DISMLNNTISSLQAERKILKEEIL
Subjt:  QRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEIL

Query:  KGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARI
        KGALMKKELEE R KIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKLEVMKARI
Subjt:  KGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARI

Query:  KTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIG
        KTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E G
Subjt:  KTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIG

Query:  EAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKS
        + ETDHESNFSHPFS  IDNLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQGG SAVSPP    SSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKS
Subjt:  EAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKS

Query:  VNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRK-NATLVLKLAQMKMNKISCEPDSQYDNNSTN
        VNSVATSFQLMSKSVE+SLQQKYSTYKEH+KLAIGSEKQIKEK E+E+AKSSGDSSS NLEY DISMRK +ATL LKLAQMK NKISCEPDSQ DN+STN
Subjt:  VNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRK-NATLVLKLAQMKMNKISCEPDSQYDNNSTN

Query:  FISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFG
         IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA FSNMEDIVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAFG
Subjt:  FISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFG

Query:  YQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEK
        YQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV ALLQTRDT ISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGP+K
Subjt:  YQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEK

Query:  EPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        EPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELR+RVHTETGQ+N
Subjt:  EPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

XP_011655490.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]0.0100Show/hide
Query:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR
        MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR
Subjt:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR

Query:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
        NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
Subjt:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG

Query:  ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
        ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Subjt:  ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT

Query:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
        LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
Subjt:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA

Query:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
        ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
Subjt:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN

Query:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS
        SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS
Subjt:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS

Query:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
        SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
Subjt:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK

Query:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
        LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
Subjt:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN

Query:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
Subjt:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

XP_031741050.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]0.099.88Show/hide
Query:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR
        MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR
Subjt:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR

Query:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
        NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
Subjt:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG

Query:  ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
        ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Subjt:  ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT

Query:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
        LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
Subjt:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA

Query:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
        ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
Subjt:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN

Query:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS
        SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS
Subjt:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS

Query:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
        SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
Subjt:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK

Query:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
        LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
Subjt:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN

Query:  REFLLFQGVRFASRVHKFA
        REFLLFQGVRFASRVHK A
Subjt:  REFLLFQGVRFASRVHKFA

XP_031741051.1 protein CHUP1, chloroplastic isoform X3 [Cucumis sativus]0.096.8Show/hide
Query:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR
        MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR
Subjt:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR

Query:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
        NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
Subjt:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG

Query:  ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
        ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Subjt:  ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT

Query:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
        LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAG                           KEIGEA
Subjt:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA

Query:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
        ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
Subjt:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN

Query:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS
        SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS
Subjt:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS

Query:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
        SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
Subjt:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK

Query:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
        LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
Subjt:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN

Query:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
Subjt:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

TrEMBL top hitse value%identityAlignment
A0A061ECQ9 Hydroxyproline-rich glycoprotein family protein isoform 42.16e-24450.48Show/hide
Query:  RISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGE----------DVKKNVKQVHQKIIR--GLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSD
        R+  VVA SIAA+A+KQL +++  S        SENGE          D KK     +  + +  G +EEEEE+   IS   ++VNG   D+ D D    
Subjt:  RISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGE----------DVKKNVKQVHQKIIR--GLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSD

Query:  EFQVLLLPQRNSENWLLDDN---RKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQ
        EF+ LL  +   E  L  D     + EK+ E  + N+  ELERL  L+ ELEER+VKLEGEL+   G+K  E+D+ EL++QL  K  +I MLN TISSLQ
Subjt:  EFQVLLLPQRNSENWLLDDN---RKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQ

Query:  AERKILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQEL
        +ERK L+E+I  GA +KKELE  R KIKELQRQIQLDANQTK +LL LKQ+VS LQAKE+EA+K +AE+ KK KA K+ E+E  EL+ KN+ELQHE +EL
Subjt:  AERKILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQEL

Query:  TSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAK
        T KL+  +A+I  L+ MTETEI  + REE   L+  NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q P G+ SAR LNKS SPKS+E AK
Subjt:  TSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAK

Query:  QLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPP---------STIDSSHRWKDPLEAVM
        QL+LEYAG E G+ +TD ESNFSHP S+  ++L+N SI SS SR SS  +KP+    LKK  R++  SSAVS P         S I  S   + PLEA+M
Subjt:  QLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPP---------STIDSSHRWKDPLEAVM

Query:  ALSA--------------------ETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSN
          +A                    ET T+  +R QVSS  S NSVATSF LMS+SV+ SL++KY  YK+ HKLA+  EKQIK+K +  RA+  GD S+ +
Subjt:  ALSA--------------------ETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSN

Query:  LEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFISSPTSS---------------------GGEVHRGSELVQFNRKMMKP-----------
         + E     K   L  KLAQ+K   +    DS   +N    + S T S                      G    G       +    P           
Subjt:  LEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFISSPTSS---------------------GGEVHRGSELVQFNRKMMKP-----------

Query:  ---------------EVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSS
                        VK  +ETQ D  V +LA E+R ASF+++ED+V+FV WLDE+LS LVD   +L+HFDWP+ K DALREAAF YQ L+KL +++SS
Subjt:  ---------------EVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSS

Query:  FVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVR
        FVD+P L CE AL KM  LL+KVEQSVYALL+TRD  ISRY+E GIP++WLLD GVVGKIK+  V+LARKYMKR+  E + L+GPEKEPNREF+L QG+R
Subjt:  FVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVR

Query:  FASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        FA RVH+FAGGFD++SMKAFEELRSRVH++ G+ N
Subjt:  FASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

A0A0A0KT25 Uncharacterized protein0.099.17Show/hide
Query:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR
        MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQ       GLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR
Subjt:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR

Query:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
        NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
Subjt:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG

Query:  ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
        ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Subjt:  ALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT

Query:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
        LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
Subjt:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA

Query:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
        ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
Subjt:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN

Query:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS
        SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS
Subjt:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS

Query:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
        SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
Subjt:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK

Query:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
        LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
Subjt:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN

Query:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
Subjt:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

A0A1S3CSZ9 protein CHUP1, chloroplastic-like0.091.04Show/hide
Query:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGL--EEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLP
        MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPT NCSENGED KKN          GL  EEEEEEEA+SI+D TSQVNGRTSDLEDGDHSSDE QVLL P
Subjt:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGL--EEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLP

Query:  QRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEIL
        QRNSENWLL   +KEEKVPEFL E++KIE ERLLKL+MELEERKVKLEGEL+MCDGIKYSETDVMELRKQLDAKN+DISMLNNTISSLQAERKILKEEIL
Subjt:  QRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEIL

Query:  KGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARI
        KGALMKKELEE R KIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKLEVMKARI
Subjt:  KGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARI

Query:  KTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIG
        KTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E G
Subjt:  KTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIG

Query:  EAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKS
        + ETDHESNFSHPFS  IDNLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQGG SAVSPP    SSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKS
Subjt:  EAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKS

Query:  VNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRK-NATLVLKLAQMKMNKISCEPDSQYDNNSTN
        VNSVATSFQLMSKSVE+SLQQKYSTYKEH+KLAIGSEKQIKEK E+E+AKSSGDSSS NLEY DISMRK +ATL LKLAQMK NKISCEPDSQ DN+STN
Subjt:  VNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRK-NATLVLKLAQMKMNKISCEPDSQYDNNSTN

Query:  FISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFG
         IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA FSNMEDIVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAFG
Subjt:  FISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFG

Query:  YQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEK
        YQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV ALLQTRDT ISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGP+K
Subjt:  YQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEK

Query:  EPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        EPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELR+RVHTETGQ+N
Subjt:  EPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

A0A5D3BMR7 Protein CHUP10.091.92Show/hide
Query:  IRGL--EEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIM
        ++GL  EEEEEEEA+SI+D TSQVNGRTSDLEDGDHSSDE QVLL PQRNSENWLL   +KEEKVPEFL EN+KIE ERLLKL+MELEERKVKLEGEL+M
Subjt:  IRGL--EEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIM

Query:  CDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVK
        CDGIKYSETDVMELRKQLDAKN+DISMLNNTISSLQAERKILKEEILKGALMKKELEE RGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVK
Subjt:  CDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVK

Query:  KEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI
        KEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKLEVMKARIKTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI
Subjt:  KEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI

Query:  NACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQ
        NACLRYELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E G+ ETDHESNFSHPFS  IDNLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQ
Subjt:  NACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQ

Query:  GGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSG
        GG SAVSPP    SSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVE+SLQQKYSTYKEHHKLAIGSEKQIKEK E+E+AKSSG
Subjt:  GGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSG

Query:  DSSSSNLEYEDISMRK-NATLVLKLAQMKMNKISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREAS
        DSSS NLEY DISMRK +ATL LKLAQMK NKISCEPDSQ DN+STN IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA 
Subjt:  DSSSSNLEYEDISMRK-NATLVLKLAQMKMNKISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREAS

Query:  FSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISR
        FSNMEDIVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV ALLQTRDT ISR
Subjt:  FSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISR

Query:  YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK
        YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHN LSGP+KEPNREFLLFQGVRFASRVHK
Subjt:  YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK

A0A6J1DWY5 protein CHUP1, chloroplastic-like3.06e-30462.66Show/hide
Query:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR
        M ++ V+VAVSIAAYAIKQLTIRSW+S  LPT NCSENGE  +KN          GL+ EE++  NSI+   SQV+G +SD E  +         LLP R
Subjt:  MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQR

Query:  NSENWLLDDNRKEE-KVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILK
        +SE+ LLD N+KEE KVPE  +EN+KIEL+RLLKL+MELEERKVKLE EL+M D +K  ++D  EL K+L+AK++D+SMLN TISSLQAERK L+EEI+K
Subjt:  NSENWLLDDNRKEE-KVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILK

Query:  GALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIK
        GA MKKELEE +GKIKELQRQ+QLDANQTKE L  LK+RVSTLQAKEEEAVKKEA+LY+K KAAK FE+E GELK KNR+LQ E +ELTSKLEVM+ARI 
Subjt:  GALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIK

Query:  TLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGE
        TLT +TE+EIIT+EREE +KL+  NE+L KQLEGLQMNRFSEVEELVYLRW+NACLRYELR+N+   GESA  L+KS SPKSKEKAKQLMLEYAG   G+
Subjt:  TLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGE

Query:  AETDHESNFSHP-FSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTL-SEVRLQVSSRK
         ETDHESNFSHP FSS I++ +NTS  SSRSRTSSFR                                 WKDPLEA +A S ETLT  SEV+ QVSSR 
Subjt:  AETDHESNFSHP-FSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTL-SEVRLQVSSRK

Query:  SVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGS--EKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNS
        SVNSVATSFQ MS+S E+S++QKYS YKEHHKL IG   EKQIKEK E ER K                                N    EP        
Subjt:  SVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGS--EKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNS

Query:  TNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDG-MEILEHFDWPKRKTDALREA
                          E V+F++K+MK EVK  MET+ D LVM L M+V+  SF+NMED+VSFVIWLD+K SSLVD  + ILEHFDWP+ K+DALREA
Subjt:  TNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDG-MEILEHFDWPKRKTDALREA

Query:  AFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSG
        A  YQ LMKL EEVSSFVD+PKLT EVAL  M+SLL K+EQSV+A+L+ R+  IS+YEELGIP+DWLLD GVVGK+KVL VELARKYMKRI+ E NALSG
Subjt:  AFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSG

Query:  PEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQ
        P KEPNREFLL QGVRFASRVH+FAGGFD +SMKAFEELR+R+HTE GQ
Subjt:  PEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQ

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.1e-18845.98Show/hide
Query:  RISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLL-
        RI  VVA SIAA  +K+L ++         +  S+NGE  D +++V     ++ K ++  EEEEEEE   I+   +Q  G  SD  D D    EF+ LL 
Subjt:  RISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLL-

Query:  ----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKI
             P  + +N L  +  ++E+  E  +  +  ELERL +L+ ELEER+VKLEGEL+   G+K  E+D++EL++QL  K  +I MLN TI+SLQAERK 
Subjt:  ----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKI

Query:  LKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLE
        L+EE+ +  +++KELE  R KIKELQRQIQLDANQTK +LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+  ELK KNRELQHE +EL+ KL+
Subjt:  LKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLE

Query:  VMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLMLE
          +ARI TL+ MTE++ + K REE   LK  NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+LMLE
Subjt:  VMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLMLE

Query:  YAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMALSA
        YAG E G+ +TD ESN+S P S   D+ +N S+DSS SR SSF +KP     LKK  +++  SS  S PS             S ++ + PLE++M  +A
Subjt:  YAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMALSA

Query:  --------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK----------
                            ET  L  +R Q    S  + +NSVA SF +MSKSV+  L +KY  YK+ HKLA+  EK IK K +  RA+          
Subjt:  --------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK----------

Query:  -----------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCE-----PDSQYDNNSTNFISS---------------------PTSS
                         ++GD S+ + E  +    +NA  V K+  + + K         P S     STN  S+                     P   
Subjt:  -----------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCE-----PDSQYDNNSTNFISS---------------------PTSS

Query:  GG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEVREA
        GG                   +VHR  ELV+F + +MK E                                     VK  +ETQ D  V +LA EVR +
Subjt:  GG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEVREA

Query:  SFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTIS
        SF+++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM  LL+KVEQSVYALL+TRD  IS
Subjt:  SFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTIS

Query:  RYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        RY+E GIP+DWL D GVVGKIK+  V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+FAGGFD++SMKAFEELRSR  TE+G  N
Subjt:  RYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein3.0e-4538.71Show/hide
Query:  NNSTNFISSPTSSGGEVHRGSELVQ---FNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTD
        +NS N   S   +  +V+     +     NR      +K  +ET +   +  L  +V    FS+MED++ FV WLD++L++L D   +L+HF WP++K D
Subjt:  NNSTNFISSPTSSGGEVHRGSELVQ---FNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTD

Query:  ALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEH
         L+EAA  Y++L KL +E+SS+ D+P +   VAL KM +LLDK EQ +  L++ R +++  Y++  IP++W+LD G++ KIK   ++LA+ YM R+  E 
Subjt:  ALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEH

Query:  NALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRV
         +    ++E  +E LL QGVRFA R H+FAGG D +++ A EE++ RV
Subjt:  NALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRV

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein7.6e-19045.98Show/hide
Query:  RISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLL-
        RI  VVA SIAA  +K+L ++         +  S+NGE  D +++V     ++ K ++  EEEEEEE   I+   +Q  G  SD  D D    EF+ LL 
Subjt:  RISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLL-

Query:  ----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKI
             P  + +N L  +  ++E+  E  +  +  ELERL +L+ ELEER+VKLEGEL+   G+K  E+D++EL++QL  K  +I MLN TI+SLQAERK 
Subjt:  ----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKI

Query:  LKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLE
        L+EE+ +  +++KELE  R KIKELQRQIQLDANQTK +LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+  ELK KNRELQHE +EL+ KL+
Subjt:  LKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLE

Query:  VMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLMLE
          +ARI TL+ MTE++ + K REE   LK  NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+LMLE
Subjt:  VMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLMLE

Query:  YAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMALSA
        YAG E G+ +TD ESN+S P S   D+ +N S+DSS SR SSF +KP     LKK  +++  SS  S PS             S ++ + PLE++M  +A
Subjt:  YAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMALSA

Query:  --------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK----------
                            ET  L  +R Q    S  + +NSVA SF +MSKSV+  L +KY  YK+ HKLA+  EK IK K +  RA+          
Subjt:  --------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK----------

Query:  -----------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCE-----PDSQYDNNSTNFISS---------------------PTSS
                         ++GD S+ + E  +    +NA  V K+  + + K         P S     STN  S+                     P   
Subjt:  -----------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCE-----PDSQYDNNSTNFISS---------------------PTSS

Query:  GG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEVREA
        GG                   +VHR  ELV+F + +MK E                                     VK  +ETQ D  V +LA EVR +
Subjt:  GG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEVREA

Query:  SFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTIS
        SF+++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM  LL+KVEQSVYALL+TRD  IS
Subjt:  SFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTIS

Query:  RYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        RY+E GIP+DWL D GVVGKIK+  V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+FAGGFD++SMKAFEELRSR  TE+G  N
Subjt:  RYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein7.6e-19045.98Show/hide
Query:  RISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLL-
        RI  VVA SIAA  +K+L ++         +  S+NGE  D +++V     ++ K ++  EEEEEEE   I+   +Q  G  SD  D D    EF+ LL 
Subjt:  RISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLL-

Query:  ----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKI
             P  + +N L  +  ++E+  E  +  +  ELERL +L+ ELEER+VKLEGEL+   G+K  E+D++EL++QL  K  +I MLN TI+SLQAERK 
Subjt:  ----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKI

Query:  LKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLE
        L+EE+ +  +++KELE  R KIKELQRQIQLDANQTK +LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+  ELK KNRELQHE +EL+ KL+
Subjt:  LKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLE

Query:  VMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLMLE
          +ARI TL+ MTE++ + K REE   LK  NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+LMLE
Subjt:  VMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLMLE

Query:  YAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMALSA
        YAG E G+ +TD ESN+S P S   D+ +N S+DSS SR SSF +KP     LKK  +++  SS  S PS             S ++ + PLE++M  +A
Subjt:  YAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMALSA

Query:  --------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK----------
                            ET  L  +R Q    S  + +NSVA SF +MSKSV+  L +KY  YK+ HKLA+  EK IK K +  RA+          
Subjt:  --------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK----------

Query:  -----------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCE-----PDSQYDNNSTNFISS---------------------PTSS
                         ++GD S+ + E  +    +NA  V K+  + + K         P S     STN  S+                     P   
Subjt:  -----------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCE-----PDSQYDNNSTNFISS---------------------PTSS

Query:  GG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEVREA
        GG                   +VHR  ELV+F + +MK E                                     VK  +ETQ D  V +LA EVR +
Subjt:  GG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEVREA

Query:  SFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTIS
        SF+++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM  LL+KVEQSVYALL+TRD  IS
Subjt:  SFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTIS

Query:  RYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        RY+E GIP+DWL D GVVGKIK+  V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+FAGGFD++SMKAFEELRSR  TE+G  N
Subjt:  RYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein2.7e-16347.37Show/hide
Query:  KILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSK
        K L+EE+ +  +++KELE  R KIKELQRQIQLDANQTK +LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+  ELK KNRELQHE +EL+ K
Subjt:  KILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSK

Query:  LEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLM
        L+  +ARI TL+ MTE++ + K REE   LK  NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+LM
Subjt:  LEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLM

Query:  LEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMAL
        LEYAG E G+ +TD ESN+S P S   D+ +N S+DSS SR SSF +KP     LKK  +++  SS  S PS             S ++ + PLE++M  
Subjt:  LEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMAL

Query:  SA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK--------
        +A                    ET  L  +R Q    S  + +NSVA SF +MSKSV+  L +KY  YK+ HKLA+  EK IK K +  RA+        
Subjt:  SA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK--------

Query:  -------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCE-----PDSQYDNNSTNFISS---------------------PT
                           ++GD S+ + E  +    +NA  V K+  + + K         P S     STN  S+                     P 
Subjt:  -------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCE-----PDSQYDNNSTNFISS---------------------PT

Query:  SSGG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEVR
          GG                   +VHR  ELV+F + +MK E                                     VK  +ETQ D  V +LA EVR
Subjt:  SSGG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEVR

Query:  EASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTT
         +SF+++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM  LL+KVEQSVYALL+TRD  
Subjt:  EASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTT

Query:  ISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        ISRY+E GIP+DWL D GVVGKIK+  V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+FAGGFD++SMKAFEELRSR  TE+G  N
Subjt:  ISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-4129.23Show/hide
Query:  HENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKS
        H      S   V+    + + ++ E E + K      KL  E+  +I  LE    ++  E++E   LR   A L     +N+    E  R        ++
Subjt:  HENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKS

Query:  KEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS-
        +EK  +  +    K +     DH  + S  F   +D    +++  S  R  S R  P      + +   +  + ++S     D     KD +E+    S 
Subjt:  KEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS-

Query:  AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHH--------KLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLV
        +E LT S     V SR  V  V       S S+  S + +     +             +  +      V              +L      +R+   +V
Subjt:  AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHH--------KLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLV

Query:  -LKLAQMKMNKISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSL
            + M+ +  +   DS    N+        S+  ++    E    NR +    +K  +ETQ D  +  L  EV  A+FS++ED+V FV WLD++LS L
Subjt:  -LKLAQMKMNKISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSL

Query:  VDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIK
        VD   +L+HF+WP++K DALREAAF Y  L KL  E S F ++P+ +   AL KM +L +K+E  VY+L + R++  ++++   IP+DW+L+ G+  +IK
Subjt:  VDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIK

Query:  VLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHT
        +  V+LA KYMKR+  E  A+ G    P  E L+ QGVRFA RVH+FAGGFD+++MKAFEELR +  +
Subjt:  VLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAGAATAAGTGTTGTTGTTGCTGTTTCAATTGCAGCTTATGCAATTAAGCAGCTCACAATCAGATCATGGACCTCATTTTTCTTGCCTACCACCAACTGTTCAGA
AAATGGAGAGGACGTGAAGAAGAATGTAAAGCAAGTACACCAAAAAATTATTCGTGGATTGGAGGAAGAAGAAGAAGAGGAAGCTAATTCAATAAGTGATACAACTAGTC
AAGTTAATGGTAGAACCTCTGATCTTGAAGATGGAGATCATAGTTCAGATGAGTTTCAAGTACTTCTATTACCACAACGAAATTCCGAAAATTGGTTGCTTGATGACAAT
AGGAAAGAAGAAAAAGTGCCTGAATTCCTTATAGAAAACAGTAAGATTGAATTGGAAAGGTTGCTGAAATTATTGATGGAATTGGAGGAGAGGAAAGTGAAACTTGAAGG
TGAATTGATCATGTGCGACGGAATAAAATACAGTGAAACCGACGTCATGGAGTTACGAAAGCAGCTGGACGCCAAGAATGATGATATCAGTATGCTTAATAACACAATCA
GCTCTTTGCAAGCTGAGAGAAAGATTCTAAAAGAAGAGATATTGAAGGGAGCATTGATGAAGAAGGAACTAGAGGAGGGTAGGGGCAAGATTAAGGAGCTGCAGAGGCAA
ATTCAGCTTGATGCAAACCAAACAAAAGAACGTTTATTATTACTCAAACAACGAGTTTCCACTTTGCAGGCAAAAGAGGAAGAGGCTGTCAAGAAAGAAGCAGAACTTTA
TAAGAAACAGAAAGCGGCTAAGGATTTCGAGGTAGAATTTGGGGAGCTTAAGTGGAAGAATCGAGAGCTTCAACACGAAAATCAGGAACTAACTTCTAAACTAGAAGTTA
TGAAGGCCAGAATCAAAACACTGACTAAGATGACAGAGACTGAAATAATCACCAAAGAAAGAGAAGAGGCTCAAAAGTTAAAGAGTGAAAATGAGGATCTCATAAAGCAA
CTTGAAGGGCTCCAGATGAATAGGTTTAGTGAAGTTGAAGAGTTAGTGTACCTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCTTCGAAACAACCAGATACCGGCAGG
TGAATCCGCTCGTTATCTCAATAAGAGTTCAAGTCCAAAGTCAAAAGAAAAGGCAAAACAACTCATGTTAGAGTATGCAGGAAAGGAGATTGGAGAAGCAGAAACAGATC
ATGAAAGCAACTTTTCTCATCCATTCTCTTCTGAGATCGATAATCTTGAAAACACTTCAATTGATAGTTCAAGAAGCAGAACAAGTAGTTTCAGAGAGAAGCCTAATTCA
AATTTGAGTTTGAAGAAATTGATCAGAAACCAAGGTGGTTCGAGTGCTGTTTCGCCTCCCAGCACAATCGATTCGAGCCACAGATGGAAGGATCCTTTAGAAGCAGTGAT
GGCTCTGAGTGCTGAAACTTTAACTCTCTCAGAGGTTAGATTGCAAGTTTCTTCAAGAAAATCTGTCAACTCAGTTGCAACATCATTCCAACTGATGTCTAAATCAGTTG
AACAAAGTCTACAACAGAAATATTCAACTTATAAAGAGCATCATAAGTTGGCCATAGGGAGTGAGAAGCAAATCAAGGAAAAGGTTGAGAACGAAAGGGCAAAAAGTTCT
GGCGACTCTTCAAGTTCGAATTTAGAATACGAAGACATTAGCATGAGAAAGAATGCAACTTTAGTCCTTAAACTTGCCCAAATGAAGATGAATAAAATTTCTTGTGAACC
AGATAGCCAATATGATAATAATAGCACAAATTTTATATCCAGTCCAACTTCTTCTGGAGGTGAAGTTCATCGAGGATCCGAGTTAGTTCAATTCAATCGAAAAATGATGA
AACCTGAGGTGAAAGATCATATGGAAACTCAAAGAGATCACTTGGTTATGGCATTGGCAATGGAAGTTAGAGAAGCGTCTTTCTCCAACATGGAAGATATCGTCTCCTTC
GTTATATGGCTAGACGAAAAGTTATCTTCTTTGGTTGATGGGATGGAGATTCTCGAACATTTCGATTGGCCTAAGCGAAAAACAGATGCTTTAAGAGAAGCAGCCTTTGG
ATATCAGAAGCTGATGAAACTAAGAGAAGAAGTTTCTTCTTTTGTTGACAATCCCAAACTCACATGTGAAGTTGCTTTAAACAAAATGAACTCCTTGCTAGATAAGGTGG
AGCAGAGTGTGTATGCATTGTTACAAACAAGGGACACGACGATTTCGCGATACGAGGAACTCGGAATTCCCATTGATTGGTTGTTGGATTGTGGAGTTGTTGGGAAGATA
AAGGTGTTATGTGTAGAATTGGCAAGGAAGTACATGAAACGCATAGTAAAAGAACACAATGCATTGAGTGGACCTGAAAAAGAACCAAATAGAGAATTTTTGCTTTTCCA
AGGAGTGCGTTTTGCTTCTCGCGTTCATAAGTTTGCAGGAGGCTTTGATTCGAAGAGCATGAAAGCTTTTGAAGAACTGAGAAGTCGTGTTCATACAGAGACAGGACAAC
AAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAGAATAAGTGTTGTTGTTGCTGTTTCAATTGCAGCTTATGCAATTAAGCAGCTCACAATCAGATCATGGACCTCATTTTTCTTGCCTACCACCAACTGTTCAGA
AAATGGAGAGGACGTGAAGAAGAATGTAAAGCAAGTACACCAAAAAATTATTCGTGGATTGGAGGAAGAAGAAGAAGAGGAAGCTAATTCAATAAGTGATACAACTAGTC
AAGTTAATGGTAGAACCTCTGATCTTGAAGATGGAGATCATAGTTCAGATGAGTTTCAAGTACTTCTATTACCACAACGAAATTCCGAAAATTGGTTGCTTGATGACAAT
AGGAAAGAAGAAAAAGTGCCTGAATTCCTTATAGAAAACAGTAAGATTGAATTGGAAAGGTTGCTGAAATTATTGATGGAATTGGAGGAGAGGAAAGTGAAACTTGAAGG
TGAATTGATCATGTGCGACGGAATAAAATACAGTGAAACCGACGTCATGGAGTTACGAAAGCAGCTGGACGCCAAGAATGATGATATCAGTATGCTTAATAACACAATCA
GCTCTTTGCAAGCTGAGAGAAAGATTCTAAAAGAAGAGATATTGAAGGGAGCATTGATGAAGAAGGAACTAGAGGAGGGTAGGGGCAAGATTAAGGAGCTGCAGAGGCAA
ATTCAGCTTGATGCAAACCAAACAAAAGAACGTTTATTATTACTCAAACAACGAGTTTCCACTTTGCAGGCAAAAGAGGAAGAGGCTGTCAAGAAAGAAGCAGAACTTTA
TAAGAAACAGAAAGCGGCTAAGGATTTCGAGGTAGAATTTGGGGAGCTTAAGTGGAAGAATCGAGAGCTTCAACACGAAAATCAGGAACTAACTTCTAAACTAGAAGTTA
TGAAGGCCAGAATCAAAACACTGACTAAGATGACAGAGACTGAAATAATCACCAAAGAAAGAGAAGAGGCTCAAAAGTTAAAGAGTGAAAATGAGGATCTCATAAAGCAA
CTTGAAGGGCTCCAGATGAATAGGTTTAGTGAAGTTGAAGAGTTAGTGTACCTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCTTCGAAACAACCAGATACCGGCAGG
TGAATCCGCTCGTTATCTCAATAAGAGTTCAAGTCCAAAGTCAAAAGAAAAGGCAAAACAACTCATGTTAGAGTATGCAGGAAAGGAGATTGGAGAAGCAGAAACAGATC
ATGAAAGCAACTTTTCTCATCCATTCTCTTCTGAGATCGATAATCTTGAAAACACTTCAATTGATAGTTCAAGAAGCAGAACAAGTAGTTTCAGAGAGAAGCCTAATTCA
AATTTGAGTTTGAAGAAATTGATCAGAAACCAAGGTGGTTCGAGTGCTGTTTCGCCTCCCAGCACAATCGATTCGAGCCACAGATGGAAGGATCCTTTAGAAGCAGTGAT
GGCTCTGAGTGCTGAAACTTTAACTCTCTCAGAGGTTAGATTGCAAGTTTCTTCAAGAAAATCTGTCAACTCAGTTGCAACATCATTCCAACTGATGTCTAAATCAGTTG
AACAAAGTCTACAACAGAAATATTCAACTTATAAAGAGCATCATAAGTTGGCCATAGGGAGTGAGAAGCAAATCAAGGAAAAGGTTGAGAACGAAAGGGCAAAAAGTTCT
GGCGACTCTTCAAGTTCGAATTTAGAATACGAAGACATTAGCATGAGAAAGAATGCAACTTTAGTCCTTAAACTTGCCCAAATGAAGATGAATAAAATTTCTTGTGAACC
AGATAGCCAATATGATAATAATAGCACAAATTTTATATCCAGTCCAACTTCTTCTGGAGGTGAAGTTCATCGAGGATCCGAGTTAGTTCAATTCAATCGAAAAATGATGA
AACCTGAGGTGAAAGATCATATGGAAACTCAAAGAGATCACTTGGTTATGGCATTGGCAATGGAAGTTAGAGAAGCGTCTTTCTCCAACATGGAAGATATCGTCTCCTTC
GTTATATGGCTAGACGAAAAGTTATCTTCTTTGGTTGATGGGATGGAGATTCTCGAACATTTCGATTGGCCTAAGCGAAAAACAGATGCTTTAAGAGAAGCAGCCTTTGG
ATATCAGAAGCTGATGAAACTAAGAGAAGAAGTTTCTTCTTTTGTTGACAATCCCAAACTCACATGTGAAGTTGCTTTAAACAAAATGAACTCCTTGCTAGATAAGGTGG
AGCAGAGTGTGTATGCATTGTTACAAACAAGGGACACGACGATTTCGCGATACGAGGAACTCGGAATTCCCATTGATTGGTTGTTGGATTGTGGAGTTGTTGGGAAGATA
AAGGTGTTATGTGTAGAATTGGCAAGGAAGTACATGAAACGCATAGTAAAAGAACACAATGCATTGAGTGGACCTGAAAAAGAACCAAATAGAGAATTTTTGCTTTTCCA
AGGAGTGCGTTTTGCTTCTCGCGTTCATAAGTTTGCAGGAGGCTTTGATTCGAAGAGCATGAAAGCTTTTGAAGAACTGAGAAGTCGTGTTCATACAGAGACAGGACAAC
AAAATTAA
Protein sequenceShow/hide protein sequence
MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDN
RKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRQ
IQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQ
LEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNS
NLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSS
GDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF
VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKI
KVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN