; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G20150 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G20150
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionprotein CHUP1, chloroplastic-like
Genome locationChr5:21330740..21335759
RNA-Seq ExpressionCSPI05G20150
SyntenyCSPI05G20150
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK00280.1 protein CHUP1 [Cucumis melo var. makuwa]0.0e+0091.79Show/hide
Query:  IRGL--EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIM
        ++GL  EEEEEEEA+SI+DATSQVNGRTSDLEDGDHSSDE QV LLPQRNSENWLL   +KEEKVPEFL EN+KIE ERLLKL+MELEERKVKLEGEL+M
Subjt:  IRGL--EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIM

Query:  CDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVK
        CDGIKYSETDVMELRKQLDAKN+DISMLNNTISSLQAERKILKEEILKGALMKKELEE RGKIKELQR+IQLDANQTKERLLLLKQRVSTLQAKEEEAVK
Subjt:  CDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVK

Query:  KEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI
        KEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKLEVMKARIKTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI
Subjt:  KEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI

Query:  NACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQ
        NACLRYELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E G+ ETDHESNFSHPFS  IDNLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQ
Subjt:  NACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQ

Query:  GGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSG
        GG SAVSPP    SSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVE+SLQQKYSTYKEHHKLAIGSEKQIKEK E+E+AKSSG
Subjt:  GGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSG

Query:  DSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREAS
        DSSS NLEY DISMR K+ATL LKLAQMK N+ISCEPDSQ DN+STN IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA 
Subjt:  DSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREAS

Query:  FSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISR
        FSNMEDIVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV ALLQTRDT ISR
Subjt:  FSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISR

Query:  YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK
        YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHN LSGP+KEPNREFLLFQGVRFASRVHK
Subjt:  YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK

XP_008467205.1 PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo]0.0e+0090.81Show/hide
Query:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGL--EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLL
        MMNRISVVVAVSIA YAIKQLTIRSWTSFFLP TNCSENGED KKN          GL  EEEEEEEA+SI+DATSQVNGRTSDLEDGDHSSDE QV LL
Subjt:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGL--EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLL

Query:  PQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEI
        PQRNSENWLL   +KEEKVPEFL E++KIE ERLLKL+MELEERKVKLEGEL+MCDGIKYSETDVMELRKQLDAKN+DISMLNNTISSLQAERKILKEEI
Subjt:  PQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEI

Query:  LKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKAR
        LKGALMKKELEE R KIKELQR+IQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKLEVMKAR
Subjt:  LKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKAR

Query:  IKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEI
        IKTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E 
Subjt:  IKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEI

Query:  GEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRK
        G+ ETDHESNFSHPFS  IDNLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQGG SAVSPP    SSHRWKDPLEAVMALSAETLTLSEVRLQVSSRK
Subjt:  GEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRK

Query:  SVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQYDNNST
        SVNSVATSFQLMSKSVE+SLQQKYSTYKEH+KLAIGSEKQIKEK E+E+AKSSGDSSS NLEY DISMR K+ATL LKLAQMK N+ISCEPDSQ DN+ST
Subjt:  SVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQYDNNST

Query:  NFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAF
        N IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA FSNMEDIVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAF
Subjt:  NFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAF

Query:  GYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPE
        GYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV ALLQTRDT ISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGP+
Subjt:  GYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPE

Query:  KEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        KEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELR+RVHTETGQ+N
Subjt:  KEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

XP_011655490.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]0.0e+0099.53Show/hide
Query:  MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQR
        MNRISVVVAVSIA YAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISD TSQVNGRTSDLEDGDHSSDEFQVLLLPQR
Subjt:  MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQR

Query:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
        NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
Subjt:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG

Query:  ALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
        ALMKKELEEGRGKIKELQR+IQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Subjt:  ALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT

Query:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
        LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
Subjt:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA

Query:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
        ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
Subjt:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN

Query:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS
        SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMN+ISCEPDSQYDNNSTNFIS
Subjt:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS

Query:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
        SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
Subjt:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK

Query:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
        LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
Subjt:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN

Query:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
Subjt:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

XP_031741050.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]0.0e+0099.39Show/hide
Query:  MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQR
        MNRISVVVAVSIA YAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISD TSQVNGRTSDLEDGDHSSDEFQVLLLPQR
Subjt:  MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQR

Query:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
        NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
Subjt:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG

Query:  ALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
        ALMKKELEEGRGKIKELQR+IQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Subjt:  ALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT

Query:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
        LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
Subjt:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA

Query:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
        ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
Subjt:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN

Query:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS
        SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMN+ISCEPDSQYDNNSTNFIS
Subjt:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS

Query:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
        SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
Subjt:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK

Query:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
        LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
Subjt:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN

Query:  REFLLFQGVRFASRVHKFA
        REFLLFQGVRFASRVHK A
Subjt:  REFLLFQGVRFASRVHKFA

XP_031741051.1 protein CHUP1, chloroplastic isoform X3 [Cucumis sativus]0.0e+0096.33Show/hide
Query:  MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQR
        MNRISVVVAVSIA YAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISD TSQVNGRTSDLEDGDHSSDEFQVLLLPQR
Subjt:  MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQR

Query:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
        NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
Subjt:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG

Query:  ALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
        ALMKKELEEGRGKIKELQR+IQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Subjt:  ALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT

Query:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
        LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIP                           AGKEIGEA
Subjt:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA

Query:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
        ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
Subjt:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN

Query:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS
        SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMN+ISCEPDSQYDNNSTNFIS
Subjt:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS

Query:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
        SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
Subjt:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK

Query:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
        LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
Subjt:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN

Query:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
Subjt:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

TrEMBL top hitse value%identityAlignment
A0A061ECQ9 Hydroxyproline-rich glycoprotein family protein isoform 41.3e-20050.32Show/hide
Query:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE----------DVKKNVKQVHQKIIR--GLEEEEEEEANSISDATSQVNGRTSDLEDGDH
        M+ R+  VVA SIA +A+KQL +++  S        SENGE          D KK     +  + +  G +EEEEE+   IS   ++VNG   D+ D D 
Subjt:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE----------DVKKNVKQVHQKIIR--GLEEEEEEEANSISDATSQVNGRTSDLEDGDH

Query:  SSDEFQVLLLPQRNSENWLLDD---NRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTIS
           EF+ LL      E  L  D     + EK+ E  + N+  ELERL  L+ ELEER+VKLEGEL+   G+K  E+D+ EL++QL  K  +I MLN TIS
Subjt:  SSDEFQVLLLPQRNSENWLLDD---NRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTIS

Query:  SLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHEN
        SLQ+ERK L+E+I  GA +KKELE  R KIKELQR+IQLDANQTK +LL LKQ+VS LQAKE+EA+K +AE+ KK KA K+ E+E  EL+ KN+ELQHE 
Subjt:  SLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHEN

Query:  QELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKE
        +ELT KL+  +A+I  L+ MTETEI  + REE   L+  NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q P G+ SAR LNKS SPKS+E
Subjt:  QELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKE

Query:  KAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVS---------PPSTIDSSHRWKDPLE
         AKQL+LEYAG E G+ +TD ESNFSHP S+  ++L+N SI SS SR SS  +KP+    LKK  R++  SSAVS          PS I  S   + PLE
Subjt:  KAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVS---------PPSTIDSSHRWKDPLE

Query:  AVMALSA--------------------ETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSS
        A+M  +A                    ET T+  +R QVSS  S NSVATSF LMS+SV+ SL++KY  YK+ HKLA+  EKQIK+K +  RA+  GD S
Subjt:  AVMALSA--------------------ETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSS

Query:  SSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFISSPTSS---------------------GGEVHRGSELVQFNRKMMKP--------
        + + + E     K   L  KLAQ+K  R     DS   +N    + S T S                      G    G       +    P        
Subjt:  SSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFISSPTSS---------------------GGEVHRGSELVQFNRKMMKP--------

Query:  ------------------EVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREE
                           VK  +ETQ D  V +LA E+R ASF+++ED+V+FV WLDE+LS LVD   +L+HFDWP+ K DALREAAF YQ L+KL ++
Subjt:  ------------------EVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREE

Query:  VSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQ
        +SSFVD+P L CE AL KM  LL+KVEQSVYALL+TRD  ISRY+E GIP++WLLD GVVGKIK+  V+LARKYMKR+  E + L+GPEKEPNREF+L Q
Subjt:  VSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQ

Query:  GVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        G+RFA RVH+FAGGFD++SMKAFEELRSRVH++ G+ N
Subjt:  GVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

A0A0A0KT25 Uncharacterized protein0.0e+0098.7Show/hide
Query:  MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQR
        MNRISVVVAVSIA YAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQ       GLEEEEEEEANSISD TSQVNGRTSDLEDGDHSSDEFQVLLLPQR
Subjt:  MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQR

Query:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
        NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG
Subjt:  NSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKG

Query:  ALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
        ALMKKELEEGRGKIKELQR+IQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Subjt:  ALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT

Query:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
        LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA
Subjt:  LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEA

Query:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
        ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN
Subjt:  ETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVN

Query:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS
        SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMN+ISCEPDSQYDNNSTNFIS
Subjt:  SVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS

Query:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
        SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK
Subjt:  SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQK

Query:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
        LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN
Subjt:  LMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPN

Query:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
Subjt:  REFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

A0A1S3CSZ9 protein CHUP1, chloroplastic-like0.0e+0090.81Show/hide
Query:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGL--EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLL
        MMNRISVVVAVSIA YAIKQLTIRSWTSFFLP TNCSENGED KKN          GL  EEEEEEEA+SI+DATSQVNGRTSDLEDGDHSSDE QV LL
Subjt:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGL--EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLL

Query:  PQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEI
        PQRNSENWLL   +KEEKVPEFL E++KIE ERLLKL+MELEERKVKLEGEL+MCDGIKYSETDVMELRKQLDAKN+DISMLNNTISSLQAERKILKEEI
Subjt:  PQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEI

Query:  LKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKAR
        LKGALMKKELEE R KIKELQR+IQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKLEVMKAR
Subjt:  LKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKAR

Query:  IKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEI
        IKTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E 
Subjt:  IKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEI

Query:  GEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRK
        G+ ETDHESNFSHPFS  IDNLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQGG SAVSPP    SSHRWKDPLEAVMALSAETLTLSEVRLQVSSRK
Subjt:  GEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRK

Query:  SVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQYDNNST
        SVNSVATSFQLMSKSVE+SLQQKYSTYKEH+KLAIGSEKQIKEK E+E+AKSSGDSSS NLEY DISMR K+ATL LKLAQMK N+ISCEPDSQ DN+ST
Subjt:  SVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQYDNNST

Query:  NFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAF
        N IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA FSNMEDIVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAF
Subjt:  NFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAF

Query:  GYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPE
        GYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV ALLQTRDT ISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGP+
Subjt:  GYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPE

Query:  KEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        KEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELR+RVHTETGQ+N
Subjt:  KEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

A0A5D3BMR7 Protein CHUP10.0e+0091.79Show/hide
Query:  IRGL--EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIM
        ++GL  EEEEEEEA+SI+DATSQVNGRTSDLEDGDHSSDE QV LLPQRNSENWLL   +KEEKVPEFL EN+KIE ERLLKL+MELEERKVKLEGEL+M
Subjt:  IRGL--EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIM

Query:  CDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVK
        CDGIKYSETDVMELRKQLDAKN+DISMLNNTISSLQAERKILKEEILKGALMKKELEE RGKIKELQR+IQLDANQTKERLLLLKQRVSTLQAKEEEAVK
Subjt:  CDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVK

Query:  KEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI
        KEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKLEVMKARIKTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI
Subjt:  KEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI

Query:  NACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQ
        NACLRYELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E G+ ETDHESNFSHPFS  IDNLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQ
Subjt:  NACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQ

Query:  GGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSG
        GG SAVSPP    SSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVE+SLQQKYSTYKEHHKLAIGSEKQIKEK E+E+AKSSG
Subjt:  GGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSG

Query:  DSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREAS
        DSSS NLEY DISMR K+ATL LKLAQMK N+ISCEPDSQ DN+STN IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA 
Subjt:  DSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREAS

Query:  FSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISR
        FSNMEDIVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSV ALLQTRDT ISR
Subjt:  FSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISR

Query:  YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK
        YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHN LSGP+KEPNREFLLFQGVRFASRVHK
Subjt:  YEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK

A0A6J1DWY5 protein CHUP1, chloroplastic-like6.4e-24562.4Show/hide
Query:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQ
        +M ++ V+VAVSIA YAIKQLTIRSW+S  LP TNCSENGE  +KN          GL + EE++ NSI+ A SQV+G +SD E  +         LLP 
Subjt:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQ

Query:  RNSENWLLDDNRKEE-KVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEIL
        R+SE+ LLD N+KEE KVPE  +EN+KIEL+RLLKL+MELEERKVKLE EL+M D +K  ++D  EL K+L+AK++D+SMLN TISSLQAERK L+EEI+
Subjt:  RNSENWLLDDNRKEE-KVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEIL

Query:  KGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARI
        KGA MKKELEE +GKIKELQR++QLDANQTKE L  LK+RVSTLQAKEEEAVKKEA+LY+K KAAK FE+E GELK KNR+LQ E +ELTSKLEVM+ARI
Subjt:  KGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARI

Query:  KTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIG
         TLT +TE+EIIT+EREE +KL+  NE+L KQLEGLQMNRFSEVEELVYLRW+NACLRYELR+N+   GESA  L+KS SPKSKEKAKQLMLEYAG   G
Subjt:  KTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIG

Query:  EAETDHESNFSHP-FSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTL-SEVRLQVSSR
        + ETDHESNFSHP FSS I++ +NTS  SSRSRTSSF                                 RWKDPLEA +A S ETLT  SEV+ QVSSR
Subjt:  EAETDHESNFSHP-FSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTL-SEVRLQVSSR

Query:  KSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAI--GSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNN
         SVNSVATSFQ MS+S E+S++QKYS YKEHHKL I  G EKQIKEK E ER K                                N    EP       
Subjt:  KSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAI--GSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNN

Query:  STNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVD-GMEILEHFDWPKRKTDALRE
                           E V+F++K+MK EVK  MET+ D LVM L M+V+  SF+NMED+VSFVIWLD+K SSLVD  + ILEHFDWP+ K+DALRE
Subjt:  STNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVD-GMEILEHFDWPKRKTDALRE

Query:  AAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALS
        AA  YQ LMKL EEVSSFVD+PKLT EVAL  M+SLL K+EQSV+A+L+ R+  IS+YEELGIP+DWLLD GVVGK+KVL VELARKYMKRI+ E NALS
Subjt:  AAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALS

Query:  GPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQ
        GP KEPNREFLL QGVRFASRVH+FAGGFD +SMKAFEELR+R+HTE GQ+
Subjt:  GPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQ

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic9.0e-18845.65Show/hide
Query:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQV
        M  RI  VVA SIA   +K+L ++         +  S+NGE  D +++V     ++ K ++  EEEEEEE   I+   +Q  G  SD  D D    EF+ 
Subjt:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQV

Query:  LL-----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAE
        LL      P  + +N L  +  ++E+  E  +  +  ELERL +L+ ELEER+VKLEGEL+   G+K  E+D++EL++QL  K  +I MLN TI+SLQAE
Subjt:  LL-----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAE

Query:  RKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTS
        RK L+EE+ +  +++KELE  R KIKELQR+IQLDANQTK +LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+  ELK KNRELQHE +EL+ 
Subjt:  RKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTS

Query:  KLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQL
        KL+  +ARI TL+ MTE++ + K REE   LK  NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+L
Subjt:  KLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQL

Query:  MLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMA
        MLEYAG E G+ +TD ESN+S P S   D+ +N S+DSS SR SSF +KP     LKK  +++  SS  S PS             S ++ + PLE++M 
Subjt:  MLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMA

Query:  LSA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK-------
         +A                    ET  L  +R Q    S  + +NSVA SF +MSKSV+  L +KY  YK+ HKLA+  EK IK K +  RA+       
Subjt:  LSA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK-------

Query:  --------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-----PDSQYDNNSTNFISS---------------------P
                            ++GD S+ + E  +    +NA  V K+  + + +         P S     STN  S+                     P
Subjt:  --------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-----PDSQYDNNSTNFISS---------------------P

Query:  TSSGG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEV
           GG                   +VHR  ELV+F + +MK E                                     VK  +ETQ D  V +LA EV
Subjt:  TSSGG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEV

Query:  REASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDT
        R +SF+++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM  LL+KVEQSVYALL+TRD 
Subjt:  REASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDT

Query:  TISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
         ISRY+E GIP+DWL D GVVGKIK+  V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+FAGGFD++SMKAFEELRSR  TE+G  N
Subjt:  TISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein2.3e-4538.71Show/hide
Query:  NNSTNFISSPTSSGGEVHRGSELVQ---FNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTD
        +NS N   S   +  +V+     +     NR      +K  +ET +   +  L  +V    FS+MED++ FV WLD++L++L D   +L+HF WP++K D
Subjt:  NNSTNFISSPTSSGGEVHRGSELVQ---FNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTD

Query:  ALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEH
         L+EAA  Y++L KL +E+SS+ D+P +   VAL KM +LLDK EQ +  L++ R +++  Y++  IP++W+LD G++ KIK   ++LA+ YM R+  E 
Subjt:  ALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEH

Query:  NALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRV
         +    ++E  +E LL QGVRFA R H+FAGG D +++ A EE++ RV
Subjt:  NALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRV

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein6.4e-18945.65Show/hide
Query:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQV
        M  RI  VVA SIA   +K+L ++         +  S+NGE  D +++V     ++ K ++  EEEEEEE   I+   +Q  G  SD  D D    EF+ 
Subjt:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQV

Query:  LL-----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAE
        LL      P  + +N L  +  ++E+  E  +  +  ELERL +L+ ELEER+VKLEGEL+   G+K  E+D++EL++QL  K  +I MLN TI+SLQAE
Subjt:  LL-----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAE

Query:  RKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTS
        RK L+EE+ +  +++KELE  R KIKELQR+IQLDANQTK +LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+  ELK KNRELQHE +EL+ 
Subjt:  RKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTS

Query:  KLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQL
        KL+  +ARI TL+ MTE++ + K REE   LK  NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+L
Subjt:  KLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQL

Query:  MLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMA
        MLEYAG E G+ +TD ESN+S P S   D+ +N S+DSS SR SSF +KP     LKK  +++  SS  S PS             S ++ + PLE++M 
Subjt:  MLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMA

Query:  LSA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK-------
         +A                    ET  L  +R Q    S  + +NSVA SF +MSKSV+  L +KY  YK+ HKLA+  EK IK K +  RA+       
Subjt:  LSA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK-------

Query:  --------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-----PDSQYDNNSTNFISS---------------------P
                            ++GD S+ + E  +    +NA  V K+  + + +         P S     STN  S+                     P
Subjt:  --------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-----PDSQYDNNSTNFISS---------------------P

Query:  TSSGG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEV
           GG                   +VHR  ELV+F + +MK E                                     VK  +ETQ D  V +LA EV
Subjt:  TSSGG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEV

Query:  REASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDT
        R +SF+++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM  LL+KVEQSVYALL+TRD 
Subjt:  REASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDT

Query:  TISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
         ISRY+E GIP+DWL D GVVGKIK+  V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+FAGGFD++SMKAFEELRSR  TE+G  N
Subjt:  TISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein6.4e-18945.65Show/hide
Query:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQV
        M  RI  VVA SIA   +K+L ++         +  S+NGE  D +++V     ++ K ++  EEEEEEE   I+   +Q  G  SD  D D    EF+ 
Subjt:  MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQV

Query:  LL-----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAE
        LL      P  + +N L  +  ++E+  E  +  +  ELERL +L+ ELEER+VKLEGEL+   G+K  E+D++EL++QL  K  +I MLN TI+SLQAE
Subjt:  LL-----LPQRNSENWLLDDNRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAE

Query:  RKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTS
        RK L+EE+ +  +++KELE  R KIKELQR+IQLDANQTK +LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+  ELK KNRELQHE +EL+ 
Subjt:  RKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTS

Query:  KLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQL
        KL+  +ARI TL+ MTE++ + K REE   LK  NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+L
Subjt:  KLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQL

Query:  MLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMA
        MLEYAG E G+ +TD ESN+S P S   D+ +N S+DSS SR SSF +KP     LKK  +++  SS  S PS             S ++ + PLE++M 
Subjt:  MLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMA

Query:  LSA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK-------
         +A                    ET  L  +R Q    S  + +NSVA SF +MSKSV+  L +KY  YK+ HKLA+  EK IK K +  RA+       
Subjt:  LSA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK-------

Query:  --------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-----PDSQYDNNSTNFISS---------------------P
                            ++GD S+ + E  +    +NA  V K+  + + +         P S     STN  S+                     P
Subjt:  --------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-----PDSQYDNNSTNFISS---------------------P

Query:  TSSGG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEV
           GG                   +VHR  ELV+F + +MK E                                     VK  +ETQ D  V +LA EV
Subjt:  TSSGG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEV

Query:  REASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDT
        R +SF+++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM  LL+KVEQSVYALL+TRD 
Subjt:  REASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDT

Query:  TISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
         ISRY+E GIP+DWL D GVVGKIK+  V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+FAGGFD++SMKAFEELRSR  TE+G  N
Subjt:  TISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein7.9e-16347.12Show/hide
Query:  KILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSK
        K L+EE+ +  +++KELE  R KIKELQR+IQLDANQTK +LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+  ELK KNRELQHE +EL+ K
Subjt:  KILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSK

Query:  LEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLM
        L+  +ARI TL+ MTE++ + K REE   LK  NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+LM
Subjt:  LEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLM

Query:  LEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMAL
        LEYAG E G+ +TD ESN+S P S   D+ +N S+DSS SR SSF +KP     LKK  +++  SS  S PS             S ++ + PLE++M  
Subjt:  LEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMAL

Query:  SA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK--------
        +A                    ET  L  +R Q    S  + +NSVA SF +MSKSV+  L +KY  YK+ HKLA+  EK IK K +  RA+        
Subjt:  SA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK--------

Query:  -------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-----PDSQYDNNSTNFISS---------------------PT
                           ++GD S+ + E  +    +NA  V K+  + + +         P S     STN  S+                     P 
Subjt:  -------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-----PDSQYDNNSTNFISS---------------------PT

Query:  SSGG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEVR
          GG                   +VHR  ELV+F + +MK E                                     VK  +ETQ D  V +LA EVR
Subjt:  SSGG-------------------EVHRGSELVQFNRKMMKPE-------------------------------------VKDHMETQRDHLVMALAMEVR

Query:  EASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTT
         +SF+++ED+++FV WLDE+LS LVD   +L+HFDWP+ K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM  LL+KVEQSVYALL+TRD  
Subjt:  EASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTT

Query:  ISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN
        ISRY+E GIP+DWL D GVVGKIK+  V+LA+KYMKR+  E +++SG +K+PNREFLL QGVRFA RVH+FAGGFD++SMKAFEELRSR  TE+G  N
Subjt:  ISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.0e-4229.23Show/hide
Query:  HENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKS
        H      S   V+    + + ++ E E + K      KL  E+  +I  LE    ++  E++E   LR   A L     +N+    E  R        ++
Subjt:  HENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKS

Query:  KEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS-
        +EK  +  +    K +     DH  + S  F   +D    +++  S  R  S R  P      + +   +  + ++S     D     KD +E+    S 
Subjt:  KEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS-

Query:  AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHH--------KLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLV
        +E LT S     V SR  V  V       S S+  S + +     +             +  +      V              +L      +R+   +V
Subjt:  AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHH--------KLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLV

Query:  -LKLAQMKMNRISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSL
            + M+ +  +   DS    N+        S+  ++    E    NR +    +K  +ETQ D  +  L  EV  A+FS++ED+V FV WLD++LS L
Subjt:  -LKLAQMKMNRISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSL

Query:  VDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIK
        VD   +L+HF+WP++K DALREAAF Y  L KL  E S F ++P+ +   AL KM +L +K+E  VY+L + R++  ++++   IP+DW+L+ G+  +IK
Subjt:  VDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIK

Query:  VLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHT
        +  V+LA KYMKR+  E  A+ G    P  E L+ QGVRFA RVH+FAGGFD+++MKAFEELR +  +
Subjt:  VLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAACAGAATAAGTGTTGTTGTTGCTGTTTCAATTGCAGTTTATGCAATTAAGCAGCTCACAATCAGATCATGGACCTCATTTTTCTTGCCTACCACCAACTGTTC
AGAAAATGGAGAGGACGTGAAGAAGAATGTAAAGCAAGTACACCAAAAAATTATTCGTGGATTGGAGGAAGAAGAAGAAGAGGAAGCTAATTCAATAAGTGATGCAACTA
GTCAAGTTAATGGTAGAACCTCTGATCTTGAAGATGGAGATCATAGTTCAGATGAGTTTCAAGTACTTCTATTACCACAACGAAATTCCGAAAATTGGTTGCTTGATGAC
AATAGGAAAGAAGAAAAAGTGCCTGAATTCCTTATAGAAAACAGTAAGATTGAATTGGAAAGGTTGCTGAAATTATTGATGGAATTGGAGGAGAGGAAAGTGAAACTTGA
AGGTGAATTGATCATGTGCGACGGAATAAAATACAGTGAAACCGACGTCATGGAGTTACGAAAGCAGCTGGACGCCAAGAATGATGATATCAGTATGCTTAATAACACAA
TCAGCTCTTTGCAAGCTGAGAGAAAGATTCTAAAAGAAGAGATATTGAAGGGAGCATTGATGAAGAAGGAACTAGAGGAGGGTAGGGGCAAGATTAAGGAGCTGCAGAGG
AAAATTCAGCTTGATGCAAACCAAACAAAAGAACGTTTATTATTACTCAAACAACGAGTTTCCACTTTGCAGGCAAAAGAGGAAGAGGCTGTCAAGAAAGAAGCAGAACT
TTATAAGAAACAGAAAGCGGCTAAGGATTTCGAGGTAGAATTTGGGGAGCTTAAGTGGAAGAATCGAGAGCTTCAACACGAAAATCAGGAACTAACTTCTAAACTAGAAG
TTATGAAGGCCAGAATCAAAACACTGACTAAGATGACAGAGACTGAAATAATCACCAAGGAAAGAGAAGAGGCTCAAAAGTTAAAGAGTGAAAATGAGGATCTCATAAAG
CAACTTGAAGGGCTCCAGATGAATAGGTTTAGTGAAGTTGAAGAGTTAGTGTACCTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCTTCGAAACAACCAGATACCGGC
AGGTGAATCCGCTCGTTATCTCAATAAGAGTTCAAGTCCAAAGTCAAAAGAAAAGGCAAAACAACTCATGTTAGAGTATGCAGGAAAGGAGATTGGAGAAGCAGAAACAG
ATCATGAAAGCAACTTTTCTCATCCATTCTCTTCTGAGATCGATAATCTTGAAAACACTTCAATTGATAGTTCAAGAAGCAGAACAAGTAGTTTCAGAGAGAAGCCTAAT
TCAAATTTGAGTTTGAAGAAATTGATCAGAAACCAAGGTGGTTCGAGTGCTGTTTCGCCTCCCAGCACAATCGATTCGAGCCACAGATGGAAGGATCCTTTAGAAGCAGT
GATGGCTCTGAGTGCTGAAACTTTAACTCTCTCAGAGGTTAGATTGCAAGTTTCTTCAAGAAAATCTGTCAACTCAGTTGCAACATCATTCCAACTGATGTCTAAATCAG
TTGAACAAAGTCTACAACAGAAATATTCAACTTATAAAGAGCATCATAAGTTGGCCATAGGGAGTGAGAAGCAAATCAAGGAAAAGGTTGAGAACGAAAGGGCAAAAAGT
TCTGGCGACTCTTCAAGTTCGAATTTAGAATACGAAGACATTAGCATGAGAAAGAATGCAACTTTAGTCCTTAAACTTGCCCAAATGAAGATGAATAGAATTTCTTGTGA
ACCAGATAGCCAATATGATAATAATAGCACAAATTTTATATCCAGTCCAACTTCTTCTGGAGGTGAAGTTCATCGAGGATCCGAGTTAGTTCAATTCAATCGAAAAATGA
TGAAACCTGAGGTGAAAGATCATATGGAAACTCAAAGAGATCACTTGGTTATGGCATTGGCAATGGAAGTTAGAGAAGCGTCTTTCTCCAACATGGAAGATATCGTCTCC
TTCGTTATATGGCTAGACGAAAAGCTATCTTCTTTGGTTGATGGGATGGAGATTCTCGAACATTTCGATTGGCCTAAGCGAAAAACAGATGCTTTAAGAGAAGCAGCCTT
TGGATATCAGAAGCTGATGAAACTAAGAGAAGAAGTTTCTTCTTTTGTTGACAATCCCAAACTCACATGTGAAGTTGCTTTAAACAAAATGAACTCCTTGCTAGATAAGG
TGGAGCAGAGTGTGTATGCATTGTTACAAACAAGGGACACGACGATTTCGCGATACGAGGAACTCGGAATTCCCATTGATTGGTTGTTGGATTGTGGAGTTGTTGGGAAG
ATAAAGGTGTTATGTGTAGAATTGGCAAGGAAGTACATGAAACGCATAGTAAAAGAACACAATGCATTGAGTGGACCTGAAAAAGAACCAAATAGAGAATTTTTGCTTTT
CCAAGGAGTGCGTTTTGCTTCTCGCGTTCATAAGTTTGCAGGAGGCTTTGATTCGAAGAGCATGAAAGCTTTTGAAGAACTGAGAAGTCGTGTTCATACAGAGACAGGAC
AACAAAATTAA
mRNA sequenceShow/hide mRNA sequence
CCTGTCCTGAGAAGAAGACAAAAACACCACCAGAACCTTAATTCTCACTTTTGTTCTTTCTGTCCACTTTCAAGATCTCTAGCTAAGTTAAAGATGATGAACAGAATAAG
TGTTGTTGTTGCTGTTTCAATTGCAGTTTATGCAATTAAGCAGCTCACAATCAGATCATGGACCTCATTTTTCTTGCCTACCACCAACTGTTCAGAAAATGGAGAGGACG
TGAAGAAGAATGTAAAGCAAGTACACCAAAAAATTATTCGTGGATTGGAGGAAGAAGAAGAAGAGGAAGCTAATTCAATAAGTGATGCAACTAGTCAAGTTAATGGTAGA
ACCTCTGATCTTGAAGATGGAGATCATAGTTCAGATGAGTTTCAAGTACTTCTATTACCACAACGAAATTCCGAAAATTGGTTGCTTGATGACAATAGGAAAGAAGAAAA
AGTGCCTGAATTCCTTATAGAAAACAGTAAGATTGAATTGGAAAGGTTGCTGAAATTATTGATGGAATTGGAGGAGAGGAAAGTGAAACTTGAAGGTGAATTGATCATGT
GCGACGGAATAAAATACAGTGAAACCGACGTCATGGAGTTACGAAAGCAGCTGGACGCCAAGAATGATGATATCAGTATGCTTAATAACACAATCAGCTCTTTGCAAGCT
GAGAGAAAGATTCTAAAAGAAGAGATATTGAAGGGAGCATTGATGAAGAAGGAACTAGAGGAGGGTAGGGGCAAGATTAAGGAGCTGCAGAGGAAAATTCAGCTTGATGC
AAACCAAACAAAAGAACGTTTATTATTACTCAAACAACGAGTTTCCACTTTGCAGGCAAAAGAGGAAGAGGCTGTCAAGAAAGAAGCAGAACTTTATAAGAAACAGAAAG
CGGCTAAGGATTTCGAGGTAGAATTTGGGGAGCTTAAGTGGAAGAATCGAGAGCTTCAACACGAAAATCAGGAACTAACTTCTAAACTAGAAGTTATGAAGGCCAGAATC
AAAACACTGACTAAGATGACAGAGACTGAAATAATCACCAAGGAAAGAGAAGAGGCTCAAAAGTTAAAGAGTGAAAATGAGGATCTCATAAAGCAACTTGAAGGGCTCCA
GATGAATAGGTTTAGTGAAGTTGAAGAGTTAGTGTACCTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCTTCGAAACAACCAGATACCGGCAGGTGAATCCGCTCGTT
ATCTCAATAAGAGTTCAAGTCCAAAGTCAAAAGAAAAGGCAAAACAACTCATGTTAGAGTATGCAGGAAAGGAGATTGGAGAAGCAGAAACAGATCATGAAAGCAACTTT
TCTCATCCATTCTCTTCTGAGATCGATAATCTTGAAAACACTTCAATTGATAGTTCAAGAAGCAGAACAAGTAGTTTCAGAGAGAAGCCTAATTCAAATTTGAGTTTGAA
GAAATTGATCAGAAACCAAGGTGGTTCGAGTGCTGTTTCGCCTCCCAGCACAATCGATTCGAGCCACAGATGGAAGGATCCTTTAGAAGCAGTGATGGCTCTGAGTGCTG
AAACTTTAACTCTCTCAGAGGTTAGATTGCAAGTTTCTTCAAGAAAATCTGTCAACTCAGTTGCAACATCATTCCAACTGATGTCTAAATCAGTTGAACAAAGTCTACAA
CAGAAATATTCAACTTATAAAGAGCATCATAAGTTGGCCATAGGGAGTGAGAAGCAAATCAAGGAAAAGGTTGAGAACGAAAGGGCAAAAAGTTCTGGCGACTCTTCAAG
TTCGAATTTAGAATACGAAGACATTAGCATGAGAAAGAATGCAACTTTAGTCCTTAAACTTGCCCAAATGAAGATGAATAGAATTTCTTGTGAACCAGATAGCCAATATG
ATAATAATAGCACAAATTTTATATCCAGTCCAACTTCTTCTGGAGGTGAAGTTCATCGAGGATCCGAGTTAGTTCAATTCAATCGAAAAATGATGAAACCTGAGGTGAAA
GATCATATGGAAACTCAAAGAGATCACTTGGTTATGGCATTGGCAATGGAAGTTAGAGAAGCGTCTTTCTCCAACATGGAAGATATCGTCTCCTTCGTTATATGGCTAGA
CGAAAAGCTATCTTCTTTGGTTGATGGGATGGAGATTCTCGAACATTTCGATTGGCCTAAGCGAAAAACAGATGCTTTAAGAGAAGCAGCCTTTGGATATCAGAAGCTGA
TGAAACTAAGAGAAGAAGTTTCTTCTTTTGTTGACAATCCCAAACTCACATGTGAAGTTGCTTTAAACAAAATGAACTCCTTGCTAGATAAGGTGGAGCAGAGTGTGTAT
GCATTGTTACAAACAAGGGACACGACGATTTCGCGATACGAGGAACTCGGAATTCCCATTGATTGGTTGTTGGATTGTGGAGTTGTTGGGAAGATAAAGGTGTTATGTGT
AGAATTGGCAAGGAAGTACATGAAACGCATAGTAAAAGAACACAATGCATTGAGTGGACCTGAAAAAGAACCAAATAGAGAATTTTTGCTTTTCCAAGGAGTGCGTTTTG
CTTCTCGCGTTCATAAGTTTGCAGGAGGCTTTGATTCGAAGAGCATGAAAGCTTTTGAAGAACTGAGAAGTCGTGTTCATACAGAGACAGGACAACAAAATTAATCATAT
TTTTCTCACTTTTTTTTTTGTTCGTATTGTATAATTAATCTTACCTTATTTTATGTAGATATAATCTCAATAAATTTTATGAAAAGTTGATGGATGATCACAAATATAAG
AAAGAGAGAGTAGGGAGGGTG
Protein sequenceShow/hide protein sequence
MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDD
NRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQR
KIQLDANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIK
QLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPN
SNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKS
SGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVS
FVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGK
IKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTETGQQN