; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G06090 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G06090
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein CHUP1, chloroplastic-like
Genome locationClcChr01:5823318..5830695
RNA-Seq ExpressionClc01G06090
SyntenyClc01G06090
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008467205.1 PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo]0.0e+0082.51Show/hide
Query:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLD---EEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL-QR
        MMN+  ++VAVS+AAYAI+QLTIRSW+S FLPTNCSENGED +KN         GLD   EEEEE +S+NDA+S+VNGR  DLEDGD +SDE QVLL QR
Subjt:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLD---EEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL-QR

Query:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKG
         S+N LL   KKEEKVPE   E++KIE ERL+KLVMELEERKVKLEGEL+MCDGIKYSETDV ELRKQL+AK +DISMLN TISSLQAERKIL+EEI+KG
Subjt:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKG

Query:  ALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT
        ALMKKELEEAR KIK+LQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAEL+KKQKAAKD EVELGELK KNRELQHEKQELTSKLEVMKARIKT
Subjt:  ALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT

Query:  LTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQA
        LTK+TESEIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AGESARYLNKSSSPKS+EKAKQLMLEYAG+EFGQ 
Subjt:  LTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQA

Query:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN
        ETDHESNFSHPFS GI++L+NTSIDSSRSRTSSFSEKPNSNLSLKKLIRN+ G SAVS P I GSSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVN
Subjt:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN

Query:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI
        SVATSFQLMSKSVEESL+QKYSTY+E++KLAIGSEKQIKEKAE+E+AKSSGD+SS NLEY+D SMR K A LP+KLAQMK NK SC+PDSQ D DS N+I
Subjt:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI

Query:  SNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQ
        SNPT SGGEVHRG EL++FN+KM+KPEVKA METQGD LVVALAMEVREA F++MED+VSF I LDEKLSSLVD MEILEHFDWP  KTDALREAAF YQ
Subjt:  SNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQ

Query:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP
        KLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS   L  TRD  ISRYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+ P+KEP
Subjt:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP

Query:  NREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS
        NREFLL QG         FAG FD +SMKAFEELR+RVHTE GQK+
Subjt:  NREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS

XP_011655490.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]0.0e+0083.45Show/hide
Query:  MNKFGILVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL--QR
        MN+  ++VAVS+AAYAI+QLTIRSW+S FLP TNCSENGED++KNV+Q +++I+ GL +EEEEE NS++D +S+VNGR  DLEDGD +SDEFQVLL  QR
Subjt:  MNKFGILVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL--QR

Query:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKG
         S+N LLDDN+KEEKVPE  IENSKIELERL+KL+MELEERKVKLEGEL+MCDGIKYSETDV ELRKQL+AK DDISMLN TISSLQAERKIL+EEI+KG
Subjt:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKG

Query:  ALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT
        ALMKKELEE RGKIK+LQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKD EVE GELK KNRELQHE QELTSKLEVMKARIKT
Subjt:  ALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT

Query:  LTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQA
        LTK+TE+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AGESARYLNKSSSPKSKEKAKQLMLEYAG E G+A
Subjt:  LTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQA

Query:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN
        ETDHESNFSHPFSS I++L+NTSIDSSRSRTSSF EKPNSNLSLKKLIRN+ GSSAVS P  I SSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVN
Subjt:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN

Query:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI
        SVATSFQLMSKSVE+SL+QKYSTY+E+HKLAIGSEKQIKEK ENERAKSSGD+SS NLEY D SMR K A L +KLAQMKMNK SC+PDSQYD +S N I
Subjt:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI

Query:  SNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQ
        S+PT SGGEVHRG EL++FNRKM+KPEVK  METQ D LV+ALAMEVREASF++MED+VSF IWLDEKLSSLVD MEILEHFDWPK KTDALREAAF YQ
Subjt:  SNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQ

Query:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP
        KLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS Y L  TRD TISRYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+ PEKEP
Subjt:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP

Query:  NREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS
        NREFLL QG         FAG FD +SMKAFEELRSRVHTE GQ++
Subjt:  NREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS

XP_031741050.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]0.0e+0082.83Show/hide
Query:  MNKFGILVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL--QR
        MN+  ++VAVS+AAYAI+QLTIRSW+S FLP TNCSENGED++KNV+Q +++I+ GL +EEEEE NS++D +S+VNGR  DLEDGD +SDEFQVLL  QR
Subjt:  MNKFGILVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL--QR

Query:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKG
         S+N LLDDN+KEEKVPE  IENSKIELERL+KL+MELEERKVKLEGEL+MCDGIKYSETDV ELRKQL+AK DDISMLN TISSLQAERKIL+EEI+KG
Subjt:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKG

Query:  ALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT
        ALMKKELEE RGKIK+LQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKD EVE GELK KNRELQHE QELTSKLEVMKARIKT
Subjt:  ALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT

Query:  LTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQA
        LTK+TE+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AGESARYLNKSSSPKSKEKAKQLMLEYAG E G+A
Subjt:  LTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQA

Query:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN
        ETDHESNFSHPFSS I++L+NTSIDSSRSRTSSF EKPNSNLSLKKLIRN+ GSSAVS P  I SSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVN
Subjt:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN

Query:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI
        SVATSFQLMSKSVE+SL+QKYSTY+E+HKLAIGSEKQIKEK ENERAKSSGD+SS NLEY D SMR K A L +KLAQMKMNK SC+PDSQYD +S N I
Subjt:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI

Query:  SNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQ
        S+PT SGGEVHRG EL++FNRKM+KPEVK  METQ D LV+ALAMEVREASF++MED+VSF IWLDEKLSSLVD MEILEHFDWPK KTDALREAAF YQ
Subjt:  SNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQ

Query:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP
        KLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS Y L  TRD TISRYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+ PEKEP
Subjt:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP

Query:  NREFLLLQGFAGNFDFESMKAFEELRSRVHTEA
        NREFLL QG                 SRVH EA
Subjt:  NREFLLLQGFAGNFDFESMKAFEELRSRVHTEA

XP_031741051.1 protein CHUP1, chloroplastic isoform X3 [Cucumis sativus]0.0e+0080.26Show/hide
Query:  MNKFGILVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL--QR
        MN+  ++VAVS+AAYAI+QLTIRSW+S FLP TNCSENGED++KNV+Q +++I+ GL +EEEEE NS++D +S+VNGR  DLEDGD +SDEFQVLL  QR
Subjt:  MNKFGILVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL--QR

Query:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKG
         S+N LLDDN+KEEKVPE  IENSKIELERL+KL+MELEERKVKLEGEL+MCDGIKYSETDV ELRKQL+AK DDISMLN TISSLQAERKIL+EEI+KG
Subjt:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKG

Query:  ALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT
        ALMKKELEE RGKIK+LQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKD EVE GELK KNRELQHE QELTSKLEVMKARIKT
Subjt:  ALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT

Query:  LTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQA
        LTK+TE+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AG+                           E G+A
Subjt:  LTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQA

Query:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN
        ETDHESNFSHPFSS I++L+NTSIDSSRSRTSSF EKPNSNLSLKKLIRN+ GSSAVS P  I SSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVN
Subjt:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN

Query:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI
        SVATSFQLMSKSVE+SL+QKYSTY+E+HKLAIGSEKQIKEK ENERAKSSGD+SS NLEY D SMR K A L +KLAQMKMNK SC+PDSQYD +S N I
Subjt:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI

Query:  SNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQ
        S+PT SGGEVHRG EL++FNRKM+KPEVK  METQ D LV+ALAMEVREASF++MED+VSF IWLDEKLSSLVD MEILEHFDWPK KTDALREAAF YQ
Subjt:  SNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQ

Query:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP
        KLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS Y L  TRD TISRYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+ PEKEP
Subjt:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP

Query:  NREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS
        NREFLL QG         FAG FD +SMKAFEELRSRVHTE GQ++
Subjt:  NREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS

XP_038906491.1 protein CHUP1, chloroplastic-like isoform X1 [Benincasa hispida]0.0e+0085.37Show/hide
Query:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLLQRESQN
        MMN+ G+LVAVS+ AYAIRQLTIRSWSSLF P NCSENGED +KN         GLDEEEEE NS+ND +S+VNGR  D+EDGD  SDEF+VLL RES+N
Subjt:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLLQRESQN

Query:  RLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKGALMK
          LDDNKKEEKVPEIQIEN+KIELERLVKLVMELEERK KLEGEL+MCD IKYSETDVTELRKQL+AK DDISMLN TISSLQAERKILQEEI+KGALMK
Subjt:  RLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKGALMK

Query:  KELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKI
        KELE ARGKIK+LQRQIQLDANQTKEHLLLLKQRVSALQAKEEEA+KKEAELYKKQKAAKD EVELGELKRKNRELQHEK EL SKLEVMKARIKTLTK+
Subjt:  KELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKI

Query:  TESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQAETDH
        TESEI+TKEREEAQKLKSENEDLIK LERLQMNRF+EVEELVYLRWINACLRYELRDNEIS GESARYLNKS SPKSKEKAKQLMLEYAGLE GQ ETDH
Subjt:  TESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQAETDH

Query:  ESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNSVAT
        ESNFSHPFSSGIED+DNTSIDSSRSRTSSF EKPNSNLSLKKLIRN  GSSAVSSP IIGSSHRWKDPLEAVMALSAETLTLSEVRL+VSS KSVNSVAT
Subjt:  ESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNSVAT

Query:  SFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMISNPT
        SFQLMSKSV+ESLKQKYSTY+E+ KLA+GSEKQIKEKA NERAKSSGDA S   EY+DT++R KPAILP++L QMKMN+TS DPDSQ+D DSKNMISNPT
Subjt:  SFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMISNPT

Query:  -SGGEVHRGPELLRFNRKMIKPEVKADMETQGDLVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKL
         SGGEVHRGPEL+RFNRK++KPEV AD+ETQGDLVVALAMEVREASF++MEDVVSF I LDEK  SLV+ MEIL+HFDWPK KTDAL EAAF YQKLMKL
Subjt:  -SGGEVHRGPELLRFNRKMIKPEVKADMETQGDLVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKL

Query:  KEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFL
        +EEVSSFVDNPKLTCEVALNKMNSL+DKVEQS YGLF TRD TIS+YEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIV EHNAL+ PEKEP+REFL
Subjt:  KEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFL

Query:  LLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS
        L QG         FAG FDFESMKAFEELRSRVHTEAGQK+
Subjt:  LLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS

TrEMBL top hitse value%identityAlignment
A0A061ECQ9 Hydroxyproline-rich glycoprotein family protein isoform 41.0e-20250.8Show/hide
Query:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE-------DMEKNVRQYRRILCGL-------DEEEEETNSLNDASSEVNGRIFDLEDGDRN
        M+ + G +VA S+AA+A++QL +++  S       SENGE       +   N +Q+      L       +EEEE+   ++   + VNG   D+ D D  
Subjt:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE-------DMEKNVRQYRRILCGL-------DEEEEETNSLNDASSEVNGRIFDLEDGDRN

Query:  SDEFQVLLQRESQNRLLDD---NKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQ
          EF+ LL  E +  L  D     + EK+ E ++ N+  ELERL  LV ELEER+VKLEGEL+   G+K  E+D+ EL++QL+ KT +I MLN TISSLQ
Subjt:  SDEFQVLLQRESQNRLLDD---NKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQ

Query:  AERKILQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQEL
        +ERK LQE+I  GA +KKELE AR KIK+LQRQIQLDANQTK  LL LKQ+VS LQAKE+EA+K +AE+ KK KA K+ E+E+ EL+RKN+ELQHEK+EL
Subjt:  AERKILQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQEL

Query:  TSKLEVMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAK
        T KL+  +A+I  L+ +TE+EI  + REE   L+  NEDL+KQ+E LQMNRFSEVEELVYLRW+NACLRYELR+ +   G+ SAR LNKS SPKS+E AK
Subjt:  TSKLEVMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAK

Query:  QLMLEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSP----------RIIGSSHRWKDPLEAV
        QL+LEYAG E GQ +TD ESNFSHP S+G EDLDN SI SS SR SS S+KP+    LKK  R+K  SSAVSSP          RI  S H  + PLEA+
Subjt:  QLMLEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSP----------RIIGSSHRWKDPLEAV

Query:  MALSA--------------------ETLTLSEVRLKVSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSP
        M  +A                    ET T+  +R +VSS  S NSVATSF LMS+SV+ SL++KY  Y++ HKLA+  EKQIK+KA+  RA+  GD S+ 
Subjt:  MALSA--------------------ETLTLSEVRLKVSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSP

Query:  NLEYNDTSMRTKPAILPVKLAQMKMNKT-SCDPDSQYDDSKNMIS---------------------------------NPTSGGEVHRGPELLRFNRKMI
           ++  + R KP ILP KLAQ+K       D   Q +D K + S                                 N T+ G+    P L      + 
Subjt:  NLEYNDTSMRTKPAILPVKLAQMKMNKT-SCDPDSQYDDSKNMIS---------------------------------NPTSGGEVHRGPELLRFNRKMI

Query:  KP--------------EVKADMETQGDLVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSS
         P               VKAD+ETQGD V +LA E+R ASFT +ED+V+F  WLDE+LS LVDE  +L+HFDWP+ K DALREAAFEYQ L+KL++++SS
Subjt:  KP--------------EVKADMETQGDLVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSS

Query:  FVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG--
        FVD+P L CE AL KM  LL+KVEQS Y L  TRD  ISRY+E GIP++WLLD GVVGKIK+S V+LARKYMKR+  E + L  PEKEPNREF+LLQG  
Subjt:  FVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG--

Query:  -------FAGNFDFESMKAFEELRSRVHTEAGQ
               FAG FD ESMKAFEELRSRVH++ G+
Subjt:  -------FAGNFDFESMKAFEELRSRVHTEAGQ

A0A0A0KT25 Uncharacterized protein0.0e+0083.43Show/hide
Query:  MNKFGILVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQYRRILCGL-DEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL--QRE
        MN+  ++VAVS+AAYAI+QLTIRSW+S FLP TNCSENGED++KNV+Q      GL +EEEEE NS++D +S+VNGR  DLEDGD +SDEFQVLL  QR 
Subjt:  MNKFGILVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQYRRILCGL-DEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL--QRE

Query:  SQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKGA
        S+N LLDDN+KEEKVPE  IENSKIELERL+KL+MELEERKVKLEGEL+MCDGIKYSETDV ELRKQL+AK DDISMLN TISSLQAERKIL+EEI+KGA
Subjt:  SQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKGA

Query:  LMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTL
        LMKKELEE RGKIK+LQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKD EVE GELK KNRELQHE QELTSKLEVMKARIKTL
Subjt:  LMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTL

Query:  TKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQAE
        TK+TE+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AGESARYLNKSSSPKSKEKAKQLMLEYAG E G+AE
Subjt:  TKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQAE

Query:  TDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNS
        TDHESNFSHPFSS I++L+NTSIDSSRSRTSSF EKPNSNLSLKKLIRN+ GSSAVS P  I SSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVNS
Subjt:  TDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNS

Query:  VATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMIS
        VATSFQLMSKSVE+SL+QKYSTY+E+HKLAIGSEKQIKEK ENERAKSSGD+SS NLEY D SMR K A L +KLAQMKMNK SC+PDSQYD +S N IS
Subjt:  VATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMIS

Query:  NPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQK
        +PT SGGEVHRG EL++FNRKM+KPEVK  METQ D LV+ALAMEVREASF++MED+VSF IWLDEKLSSLVD MEILEHFDWPK KTDALREAAF YQK
Subjt:  NPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQK

Query:  LMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPN
        LMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS Y L  TRD TISRYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+ PEKEPN
Subjt:  LMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPN

Query:  REFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS
        REFLL QG         FAG FD +SMKAFEELRSRVHTE GQ++
Subjt:  REFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS

A0A1S3CSZ9 protein CHUP1, chloroplastic-like0.0e+0082.51Show/hide
Query:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLD---EEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL-QR
        MMN+  ++VAVS+AAYAI+QLTIRSW+S FLPTNCSENGED +KN         GLD   EEEEE +S+NDA+S+VNGR  DLEDGD +SDE QVLL QR
Subjt:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLD---EEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL-QR

Query:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKG
         S+N LL   KKEEKVPE   E++KIE ERL+KLVMELEERKVKLEGEL+MCDGIKYSETDV ELRKQL+AK +DISMLN TISSLQAERKIL+EEI+KG
Subjt:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKG

Query:  ALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT
        ALMKKELEEAR KIK+LQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAEL+KKQKAAKD EVELGELK KNRELQHEKQELTSKLEVMKARIKT
Subjt:  ALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT

Query:  LTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQA
        LTK+TESEIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AGESARYLNKSSSPKS+EKAKQLMLEYAG+EFGQ 
Subjt:  LTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQA

Query:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN
        ETDHESNFSHPFS GI++L+NTSIDSSRSRTSSFSEKPNSNLSLKKLIRN+ G SAVS P I GSSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVN
Subjt:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN

Query:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI
        SVATSFQLMSKSVEESL+QKYSTY+E++KLAIGSEKQIKEKAE+E+AKSSGD+SS NLEY+D SMR K A LP+KLAQMK NK SC+PDSQ D DS N+I
Subjt:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI

Query:  SNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQ
        SNPT SGGEVHRG EL++FN+KM+KPEVKA METQGD LVVALAMEVREA F++MED+VSF I LDEKLSSLVD MEILEHFDWP  KTDALREAAF YQ
Subjt:  SNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQ

Query:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP
        KLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS   L  TRD  ISRYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+ P+KEP
Subjt:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP

Query:  NREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS
        NREFLL QG         FAG FD +SMKAFEELR+RVHTE GQK+
Subjt:  NREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS

A0A5D3BMR7 Protein CHUP10.0e+0085.47Show/hide
Query:  DEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL-QRESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSE
        +EEEEE +S+NDA+S+VNGR  DLEDGD +SDE QVLL QR S+N LL   KKEEKVPE   EN+KIE ERL+KLVMELEERKVKLEGEL+MCDGIKYSE
Subjt:  DEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLL-QRESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSE

Query:  TDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKK
        TDV ELRKQL+AK +DISMLN TISSLQAERKIL+EEI+KGALMKKELEEARGKIK+LQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAEL+KK
Subjt:  TDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKK

Query:  QKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYEL
        QKAAKD EVELGELK KNRELQHEKQELTSKLEVMKARIKTLTK+TESEIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYEL
Subjt:  QKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYEL

Query:  RDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSS
        R+N+I AGESARYLNKSSSPKS+EKAKQLMLEYAG+EFGQ ETDHESNFSHPFS GI++L+NTSIDSSRSRTSSFSEKPNSNLSLKKLIRN+ G SAVS 
Subjt:  RDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSS

Query:  PRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLE
        P I GSSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVNSVATSFQLMSKSVEESL+QKYSTY+E+HKLAIGSEKQIKEKAE+E+AKSSGD+SS NLE
Subjt:  PRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLE

Query:  YNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMISNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVV
        Y+D SMR K A LP+KLAQMK NK SC+PDSQ D DS N+ISNPT SGGEVHRG EL++FN+KM+KPEVKA METQGD LVVALAMEVREA F++MED+V
Subjt:  YNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMISNPT-SGGEVHRGPELLRFNRKMIKPEVKADMETQGD-LVVALAMEVREASFTHMEDVV

Query:  SFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPI
        SF I LDEKLSSLVD MEILEHFDWP  KTDALREAAF YQKLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS   L  TRD  ISRYEELGIPI
Subjt:  SFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPI

Query:  DWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG
        DWLLDCGVVGKIKV CVELARKYMKRIVKEHN L+ P+KEPNREFLL QG
Subjt:  DWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG

A0A6J1DWY5 protein CHUP1, chloroplastic-like2.0e-25463.91Show/hide
Query:  MMMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLLQRESQ
        M+M K G+LVAVS+AAYAI+QLTIRSWSS  LPTNCSENGE  EKN         GLD EE++ NS+N A+S+V+G   D         E + LL R+S+
Subjt:  MMMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLLQRESQ

Query:  NRLLDDNKKEE-KVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKGAL
        +RLLD NKKEE KVPE  +EN+KIEL+RL+KLVMELEERKVKLE EL+M D +K  ++D TEL K+LEAK +D+SMLN TISSLQAERK LQEEIVKGA 
Subjt:  NRLLDDNKKEE-KVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKGAL

Query:  MKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLT
        MKKELEEA+GKIK+LQRQ+QLDANQTKEHL  LK+RVS LQAKEEEAVKKEA+LY+K KAAK  E+ELGELK+KNR+LQ EK+ELTSKLEVM+ARI TLT
Subjt:  MKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLT

Query:  KITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQAET
         +TESEIIT+EREE +KL+  NE+L KQLE LQMNRFSEVEELVYLRW+NACLRYELRDNE   GESA  L+KS SPKSKEKAKQLMLEYAGL FGQ ET
Subjt:  KITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQAET

Query:  DHESNFSHP-FSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTL-SEVRLKVSSRKSVN
        DHESNFSHP FSSGIED DNTS  SSRSRTSSF                                 RWKDPLEA +A S ETLT  SEV+ +VSSR SVN
Subjt:  DHESNFSHP-FSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTL-SEVRLKVSSRKSVN

Query:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAI--GSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNM
        SVATSFQ MS+S EES+KQKYS Y+E+HKL I  G EKQIKEKAE ER K+S                                                
Subjt:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAI--GSEKQIKEKAENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNM

Query:  ISNPTSGGEVHRGPELLRFNRKMIKPEVKADMETQGDLVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDE-MEILEHFDWPKEKTDALREAAFEYQ
                  +  PE +RF++K++K EVKADMET+GDLV+ L M+V+  SFT+MEDVVSF IWLD+K SSLVDE + ILEHFDWP+ K+DALREAA EYQ
Subjt:  ISNPTSGGEVHRGPELLRFNRKMIKPEVKADMETQGDLVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDE-MEILEHFDWPKEKTDALREAAFEYQ

Query:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP
         LMKL EEVSSFVD+PKLT EVAL  M+SLL K+EQS + +   R+  IS+YEELGIP+DWLLD GVVGK+KV  VELARKYMKRI+ E NAL+ P KEP
Subjt:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEP

Query:  NREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQK
        NREFLLLQG         FAG FD ESMKAFEELR+R+HTEAGQK
Subjt:  NREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic7.1e-18545.23Show/hide
Query:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVL
        M  + G +VA S+AA  +++L ++       P+  S+NGE  D E++V        + +    +EEEEE   +N   ++  G   D  D D    EF+ L
Subjt:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVL

Query:  LQRESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKI
        L  E +  L DD+   EK       E+++  +  ELERL +LV ELEER+VKLEGEL+   G+K  E+D+ EL++QL+ KT +I MLN TI+SLQAERK 
Subjt:  LQRESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKI

Query:  LQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLE
        LQEE+ +  +++KELE AR KIK+LQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ KL+
Subjt:  LQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLE

Query:  VMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMLE
          +ARI TL+ +TES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVYLRW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+LMLE
Subjt:  VMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMLE

Query:  YAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSP----------RIIGSSHRWKDPLEAVMALSA
        YAG E GQ +TD ESN+S P S G +D DN S+DSS SR SSFS+KP     LKK  ++K  SS  SSP          R+  S ++ + PLE++M  +A
Subjt:  YAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSP----------RIIGSSHRWKDPLEAVMALSA

Query:  --------------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAK----------
                            ET  L  +R +    S  + +NSVA SF +MSKSV+  L +KY  Y++ HKLA+  EK IK KA+  RA+          
Subjt:  --------------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAK----------

Query:  -----------------SSGDASSPNLEYNDTSMRTKPA-ILPVKLAQMKMN--KTSCDPDSQYDDSK--NMIS----------------------NPTS
                         ++GD S+ + E N+       A +  +KL  ++    +    P       K  N+ S                       P  
Subjt:  -----------------SSGDASSPNLEYNDTSMRTKPA-ILPVKLAQMKMN--KTSCDPDSQYDDSK--NMIS----------------------NPTS

Query:  GG-------------------EVHRGPELLRFNRKMIKPE-------------------------------------VKADMETQGDLVVALAMEVREAS
        GG                   +VHR PEL+ F + ++K E                                     VKAD+ETQGD V +LA EVR +S
Subjt:  GG-------------------EVHRGPELLRFNRKMIKPE-------------------------------------VKADMETQGDLVVALAMEVREAS

Query:  FTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISR
        FT +ED+++F  WLDE+LS LVDE  +L+HFDWP+ K DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQS Y L  TRD  ISR
Subjt:  FTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISR

Query:  YEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS
        Y+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E ++++  +K+PNREFLLLQG         FAG FD ESMKAFEELRSR  TE+G  +
Subjt:  YEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein3.6e-4341.28Show/hide
Query:  NRKMIKPEVKADMETQGDLVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTC
        NR      +KAD+ET+G+ +  L  +V    F+ MEDV+ F  WLD++L++L DE  +L+HF WP++K D L+EAA EY++L KL++E+SS+ D+P +  
Subjt:  NRKMIKPEVKADMETQGDLVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTC

Query:  EVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FA
         VAL KM +LLDK EQ    L   R +++  Y++  IP++W+LD G++ KIK + ++LA+ YM R+  E  +    ++E  +E LLLQG         FA
Subjt:  EVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FA

Query:  GNFDFESMKAFEELRSRV
        G  D E++ A EE++ RV
Subjt:  GNFDFESMKAFEELRSRV

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein5.1e-18645.23Show/hide
Query:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVL
        M  + G +VA S+AA  +++L ++       P+  S+NGE  D E++V        + +    +EEEEE   +N   ++  G   D  D D    EF+ L
Subjt:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVL

Query:  LQRESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKI
        L  E +  L DD+   EK       E+++  +  ELERL +LV ELEER+VKLEGEL+   G+K  E+D+ EL++QL+ KT +I MLN TI+SLQAERK 
Subjt:  LQRESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKI

Query:  LQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLE
        LQEE+ +  +++KELE AR KIK+LQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ KL+
Subjt:  LQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLE

Query:  VMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMLE
          +ARI TL+ +TES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVYLRW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+LMLE
Subjt:  VMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMLE

Query:  YAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSP----------RIIGSSHRWKDPLEAVMALSA
        YAG E GQ +TD ESN+S P S G +D DN S+DSS SR SSFS+KP     LKK  ++K  SS  SSP          R+  S ++ + PLE++M  +A
Subjt:  YAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSP----------RIIGSSHRWKDPLEAVMALSA

Query:  --------------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAK----------
                            ET  L  +R +    S  + +NSVA SF +MSKSV+  L +KY  Y++ HKLA+  EK IK KA+  RA+          
Subjt:  --------------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAK----------

Query:  -----------------SSGDASSPNLEYNDTSMRTKPA-ILPVKLAQMKMN--KTSCDPDSQYDDSK--NMIS----------------------NPTS
                         ++GD S+ + E N+       A +  +KL  ++    +    P       K  N+ S                       P  
Subjt:  -----------------SSGDASSPNLEYNDTSMRTKPA-ILPVKLAQMKMN--KTSCDPDSQYDDSK--NMIS----------------------NPTS

Query:  GG-------------------EVHRGPELLRFNRKMIKPE-------------------------------------VKADMETQGDLVVALAMEVREAS
        GG                   +VHR PEL+ F + ++K E                                     VKAD+ETQGD V +LA EVR +S
Subjt:  GG-------------------EVHRGPELLRFNRKMIKPE-------------------------------------VKADMETQGDLVVALAMEVREAS

Query:  FTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISR
        FT +ED+++F  WLDE+LS LVDE  +L+HFDWP+ K DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQS Y L  TRD  ISR
Subjt:  FTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISR

Query:  YEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS
        Y+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E ++++  +K+PNREFLLLQG         FAG FD ESMKAFEELRSR  TE+G  +
Subjt:  YEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein5.1e-18645.23Show/hide
Query:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVL
        M  + G +VA S+AA  +++L ++       P+  S+NGE  D E++V        + +    +EEEEE   +N   ++  G   D  D D    EF+ L
Subjt:  MMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVL

Query:  LQRESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKI
        L  E +  L DD+   EK       E+++  +  ELERL +LV ELEER+VKLEGEL+   G+K  E+D+ EL++QL+ KT +I MLN TI+SLQAERK 
Subjt:  LQRESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKI

Query:  LQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLE
        LQEE+ +  +++KELE AR KIK+LQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ KL+
Subjt:  LQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLE

Query:  VMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMLE
          +ARI TL+ +TES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVYLRW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+LMLE
Subjt:  VMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLMLE

Query:  YAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSP----------RIIGSSHRWKDPLEAVMALSA
        YAG E GQ +TD ESN+S P S G +D DN S+DSS SR SSFS+KP     LKK  ++K  SS  SSP          R+  S ++ + PLE++M  +A
Subjt:  YAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSP----------RIIGSSHRWKDPLEAVMALSA

Query:  --------------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAK----------
                            ET  L  +R +    S  + +NSVA SF +MSKSV+  L +KY  Y++ HKLA+  EK IK KA+  RA+          
Subjt:  --------------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAK----------

Query:  -----------------SSGDASSPNLEYNDTSMRTKPA-ILPVKLAQMKMN--KTSCDPDSQYDDSK--NMIS----------------------NPTS
                         ++GD S+ + E N+       A +  +KL  ++    +    P       K  N+ S                       P  
Subjt:  -----------------SSGDASSPNLEYNDTSMRTKPA-ILPVKLAQMKMN--KTSCDPDSQYDDSK--NMIS----------------------NPTS

Query:  GG-------------------EVHRGPELLRFNRKMIKPE-------------------------------------VKADMETQGDLVVALAMEVREAS
        GG                   +VHR PEL+ F + ++K E                                     VKAD+ETQGD V +LA EVR +S
Subjt:  GG-------------------EVHRGPELLRFNRKMIKPE-------------------------------------VKADMETQGDLVVALAMEVREAS

Query:  FTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISR
        FT +ED+++F  WLDE+LS LVDE  +L+HFDWP+ K DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQS Y L  TRD  ISR
Subjt:  FTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISR

Query:  YEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS
        Y+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E ++++  +K+PNREFLLLQG         FAG FD ESMKAFEELRSR  TE+G  +
Subjt:  YEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.3e-15746.68Show/hide
Query:  KILQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSK
        K LQEE+ +  +++KELE AR KIK+LQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ K
Subjt:  KILQEEIVKGALMKKELEEARGKIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSK

Query:  LEVMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLM
        L+  +ARI TL+ +TES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVYLRW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+LM
Subjt:  LEVMKARIKTLTKITESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLM

Query:  LEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSP----------RIIGSSHRWKDPLEAVMAL
        LEYAG E GQ +TD ESN+S P S G +D DN S+DSS SR SSFS+KP     LKK  ++K  SS  SSP          R+  S ++ + PLE++M  
Subjt:  LEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKSGSSAVSSP----------RIIGSSHRWKDPLEAVMAL

Query:  SA--------------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAK--------
        +A                    ET  L  +R +    S  + +NSVA SF +MSKSV+  L +KY  Y++ HKLA+  EK IK KA+  RA+        
Subjt:  SA--------------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAK--------

Query:  -------------------SSGDASSPNLEYNDTSMRTKPA-ILPVKLAQMKMN--KTSCDPDSQYDDSK--NMIS----------------------NP
                           ++GD S+ + E N+       A +  +KL  ++    +    P       K  N+ S                       P
Subjt:  -------------------SSGDASSPNLEYNDTSMRTKPA-ILPVKLAQMKMN--KTSCDPDSQYDDSK--NMIS----------------------NP

Query:  TSGG-------------------EVHRGPELLRFNRKMIKPE-------------------------------------VKADMETQGDLVVALAMEVRE
          GG                   +VHR PEL+ F + ++K E                                     VKAD+ETQGD V +LA EVR 
Subjt:  TSGG-------------------EVHRGPELLRFNRKMIKPE-------------------------------------VKADMETQGDLVVALAMEVRE

Query:  ASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATI
        +SFT +ED+++F  WLDE+LS LVDE  +L+HFDWP+ K DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQS Y L  TRD  I
Subjt:  ASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATI

Query:  SRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS
        SRY+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E ++++  +K+PNREFLLLQG         FAG FD ESMKAFEELRSR  TE+G  +
Subjt:  SRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FAGNFDFESMKAFEELRSRVHTEAGQKS

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.6e-4644.09Show/hide
Query:  NRKMIKPEVKADMETQGDLVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTC
        NR +    +K D+ETQGD +  L  EV  A+F+ +EDVV F  WLD++LS LVDE  +L+HF+WP++K DALREAAF Y  L KL  E S F ++P+ + 
Subjt:  NRKMIKPEVKADMETQGDLVVALAMEVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTC

Query:  EVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FA
          AL KM +L +K+E   Y L   R++  ++++   IP+DW+L+ G+  +IK++ V+LA KYMKR+  E  A+      P  E L++QG         FA
Subjt:  EVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQG---------FA

Query:  GNFDFESMKAFEELRSRVHT
        G FD E+MKAFEELR +  +
Subjt:  GNFDFESMKAFEELRSRVHT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATCAAATTTGCTATTTAACCGTTATTTTATGATGATGAACAAATTTGGTATTCTTGTTGCTGTTTCAGTTGCAGCTTATGCAATTAGGCAGCTCACAATCAGATC
ATGGAGCTCATTATTCTTGCCTACCAACTGTTCAGAAAATGGAGAAGACATGGAGAAGAACGTAAGGCAATACCGAAGAATTCTTTGTGGATTGGATGAAGAAGAAGAGG
AAACTAATTCATTAAATGATGCAAGTAGTGAAGTTAATGGTAGGATCTTTGATCTTGAAGATGGAGATCGTAATTCAGATGAGTTTCAAGTACTTTTACAACGAGAGTCC
CAAAATCGGTTGCTTGATGACAATAAGAAAGAAGAAAAAGTGCCTGAAATCCAGATAGAAAACAGTAAGATTGAATTGGAAAGGTTGGTGAAACTAGTGATGGAATTGGA
GGAGAGGAAAGTGAAACTTGAAGGTGAATTGGTCATGTGCGACGGAATAAAATACAGTGAAACCGACGTCACAGAGTTACGAAAGCAGCTCGAGGCCAAGACTGATGATA
TCAGTATGCTTAATACCACAATCAGCTCTTTGCAGGCTGAGAGAAAGATTCTACAAGAAGAGATAGTGAAGGGAGCACTGATGAAGAAGGAACTAGAGGAGGCTAGGGGA
AAGATTAAGGATCTGCAAAGGCAAATTCAGCTTGATGCAAACCAAACAAAAGAACACTTATTATTACTCAAACAACGAGTTTCTGCTTTGCAGGCGAAAGAGGAAGAGGC
CGTCAAGAAAGAAGCAGAACTCTATAAGAAACAGAAAGCGGCGAAGGATTCCGAGGTAGAACTTGGGGAGCTTAAGCGGAAGAATCGAGAGCTTCAACACGAAAAGCAGG
AACTAACTTCTAAACTAGAAGTTATGAAGGCCAGAATCAAAACACTGACTAAAATAACAGAGAGTGAAATAATCACTAAGGAAAGAGAAGAAGCTCAAAAGTTAAAGAGT
GAAAATGAGGATCTGATAAAGCAACTTGAAAGGCTTCAGATGAATAGGTTTAGTGAAGTTGAAGAGTTAGTCTACCTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCT
TCGAGACAACGAGATATCGGCAGGCGAATCTGCTCGTTATCTCAATAAGAGTTCAAGTCCAAAGTCAAAAGAAAAGGCAAAACAACTCATGTTAGAGTATGCAGGATTGG
AGTTTGGACAAGCAGAAACAGATCATGAAAGCAACTTTTCTCATCCATTCTCCTCTGGGATTGAAGATCTTGACAATACTTCAATTGATAGTTCAAGGAGCAGAACTAGT
AGTTTCAGTGAGAAGCCTAATTCAAATTTGAGTTTGAAGAAATTGATCAGAAACAAAAGTGGTTCGAGTGCTGTTTCGTCTCCACGCATAATCGGTTCGAGCCACAGATG
GAAGGATCCTTTAGAAGCAGTGATGGCTCTGAGTGCTGAAACTCTAACTCTCTCAGAAGTTAGATTAAAAGTTTCTTCAAGAAAATCTGTCAACTCAGTTGCAACGTCAT
TCCAACTCATGTCTAAATCAGTTGAAGAAAGTCTAAAACAGAAATATTCAACATATGAAGAGAATCATAAGTTGGCCATAGGGAGTGAGAAGCAAATCAAGGAAAAGGCT
GAGAATGAAAGGGCAAAAAGTTCTGGTGATGCTTCGAGTCCGAATTTAGAATACAATGACACTAGCATGAGAACGAAGCCTGCAATTTTACCCGTTAAACTTGCTCAAAT
GAAGATGAATAAAACTTCTTGTGATCCAGATAGCCAATATGATGATAGCAAAAATATGATCTCCAATCCAACTTCTGGAGGTGAAGTTCATCGAGGACCGGAGTTACTTC
GGTTCAATCGAAAAATGATTAAACCTGAGGTGAAAGCTGATATGGAAACTCAAGGAGACTTGGTTGTGGCATTGGCAATGGAAGTTAGAGAAGCGTCTTTCACACACATG
GAAGACGTCGTCTCCTTCGCTATATGGCTCGATGAAAAGTTATCTTCTTTGGTTGATGAGATGGAGATTCTCGAGCATTTCGATTGGCCCAAGGAAAAAACAGATGCATT
AAGAGAAGCAGCGTTTGAATATCAGAAGCTGATGAAACTAAAAGAAGAAGTCTCTTCTTTTGTTGACAATCCCAAACTCACATGTGAAGTTGCTTTAAACAAAATGAACT
CTTTGCTAGATAAGGTGGAGCAGAGTGCCTATGGATTGTTTCATACAAGGGATGCAACGATTTCGCGATATGAGGAACTCGGAATTCCCATTGATTGGTTGTTGGATTGT
GGAGTTGTTGGCAAGATAAAGGTGTCATGTGTAGAATTGGCAAGGAAGTACATGAAACGCATAGTAAAAGAACATAATGCACTAAATAGACCTGAGAAAGAACCAAACAG
AGAGTTTTTGCTCTTACAAGGATTTGCAGGAAACTTTGATTTCGAGAGCATGAAGGCTTTTGAAGAGCTGAGAAGTCGTGTCCATACAGAGGCAGGACAAAAAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACATCAAATTTGCTATTTAACCGTTATTTTATGATGATGAACAAATTTGGTATTCTTGTTGCTGTTTCAGTTGCAGCTTATGCAATTAGGCAGCTCACAATCAGATC
ATGGAGCTCATTATTCTTGCCTACCAACTGTTCAGAAAATGGAGAAGACATGGAGAAGAACGTAAGGCAATACCGAAGAATTCTTTGTGGATTGGATGAAGAAGAAGAGG
AAACTAATTCATTAAATGATGCAAGTAGTGAAGTTAATGGTAGGATCTTTGATCTTGAAGATGGAGATCGTAATTCAGATGAGTTTCAAGTACTTTTACAACGAGAGTCC
CAAAATCGGTTGCTTGATGACAATAAGAAAGAAGAAAAAGTGCCTGAAATCCAGATAGAAAACAGTAAGATTGAATTGGAAAGGTTGGTGAAACTAGTGATGGAATTGGA
GGAGAGGAAAGTGAAACTTGAAGGTGAATTGGTCATGTGCGACGGAATAAAATACAGTGAAACCGACGTCACAGAGTTACGAAAGCAGCTCGAGGCCAAGACTGATGATA
TCAGTATGCTTAATACCACAATCAGCTCTTTGCAGGCTGAGAGAAAGATTCTACAAGAAGAGATAGTGAAGGGAGCACTGATGAAGAAGGAACTAGAGGAGGCTAGGGGA
AAGATTAAGGATCTGCAAAGGCAAATTCAGCTTGATGCAAACCAAACAAAAGAACACTTATTATTACTCAAACAACGAGTTTCTGCTTTGCAGGCGAAAGAGGAAGAGGC
CGTCAAGAAAGAAGCAGAACTCTATAAGAAACAGAAAGCGGCGAAGGATTCCGAGGTAGAACTTGGGGAGCTTAAGCGGAAGAATCGAGAGCTTCAACACGAAAAGCAGG
AACTAACTTCTAAACTAGAAGTTATGAAGGCCAGAATCAAAACACTGACTAAAATAACAGAGAGTGAAATAATCACTAAGGAAAGAGAAGAAGCTCAAAAGTTAAAGAGT
GAAAATGAGGATCTGATAAAGCAACTTGAAAGGCTTCAGATGAATAGGTTTAGTGAAGTTGAAGAGTTAGTCTACCTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCT
TCGAGACAACGAGATATCGGCAGGCGAATCTGCTCGTTATCTCAATAAGAGTTCAAGTCCAAAGTCAAAAGAAAAGGCAAAACAACTCATGTTAGAGTATGCAGGATTGG
AGTTTGGACAAGCAGAAACAGATCATGAAAGCAACTTTTCTCATCCATTCTCCTCTGGGATTGAAGATCTTGACAATACTTCAATTGATAGTTCAAGGAGCAGAACTAGT
AGTTTCAGTGAGAAGCCTAATTCAAATTTGAGTTTGAAGAAATTGATCAGAAACAAAAGTGGTTCGAGTGCTGTTTCGTCTCCACGCATAATCGGTTCGAGCCACAGATG
GAAGGATCCTTTAGAAGCAGTGATGGCTCTGAGTGCTGAAACTCTAACTCTCTCAGAAGTTAGATTAAAAGTTTCTTCAAGAAAATCTGTCAACTCAGTTGCAACGTCAT
TCCAACTCATGTCTAAATCAGTTGAAGAAAGTCTAAAACAGAAATATTCAACATATGAAGAGAATCATAAGTTGGCCATAGGGAGTGAGAAGCAAATCAAGGAAAAGGCT
GAGAATGAAAGGGCAAAAAGTTCTGGTGATGCTTCGAGTCCGAATTTAGAATACAATGACACTAGCATGAGAACGAAGCCTGCAATTTTACCCGTTAAACTTGCTCAAAT
GAAGATGAATAAAACTTCTTGTGATCCAGATAGCCAATATGATGATAGCAAAAATATGATCTCCAATCCAACTTCTGGAGGTGAAGTTCATCGAGGACCGGAGTTACTTC
GGTTCAATCGAAAAATGATTAAACCTGAGGTGAAAGCTGATATGGAAACTCAAGGAGACTTGGTTGTGGCATTGGCAATGGAAGTTAGAGAAGCGTCTTTCACACACATG
GAAGACGTCGTCTCCTTCGCTATATGGCTCGATGAAAAGTTATCTTCTTTGGTTGATGAGATGGAGATTCTCGAGCATTTCGATTGGCCCAAGGAAAAAACAGATGCATT
AAGAGAAGCAGCGTTTGAATATCAGAAGCTGATGAAACTAAAAGAAGAAGTCTCTTCTTTTGTTGACAATCCCAAACTCACATGTGAAGTTGCTTTAAACAAAATGAACT
CTTTGCTAGATAAGGTGGAGCAGAGTGCCTATGGATTGTTTCATACAAGGGATGCAACGATTTCGCGATATGAGGAACTCGGAATTCCCATTGATTGGTTGTTGGATTGT
GGAGTTGTTGGCAAGATAAAGGTGTCATGTGTAGAATTGGCAAGGAAGTACATGAAACGCATAGTAAAAGAACATAATGCACTAAATAGACCTGAGAAAGAACCAAACAG
AGAGTTTTTGCTCTTACAAGGATTTGCAGGAAACTTTGATTTCGAGAGCATGAAGGCTTTTGAAGAGCTGAGAAGTCGTGTCCATACAGAGGCAGGACAAAAAAGTTGAA
TATTATATGTTCCTCAGTTTTTCTTTTACTATTTGTATTGTATAATTATCTCTACCTTACTTTATGTAGATAATTTCAAGAAATTTTATGAATGTATTAGGAAAAAAAGA
AAGAAAAGAAATGAAAAATGATGGATAAGAATAATAATCAAAGTTGATGAGGCCTAATCTAAAACCTTGAGGGGCAAAGAATTGTAACTAAGGCCTCTTCAAGACACAAA
ATATAAGAAAGAGAGAGTAGAGAGGGTGATGATCAATACAGACAGTGTCAACTATTCAAAAATGTCAACAAAAAGGG
Protein sequenceShow/hide protein sequence
MTSNLLFNRYFMMMNKFGILVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLDEEEEETNSLNDASSEVNGRIFDLEDGDRNSDEFQVLLQRES
QNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLEAKTDDISMLNTTISSLQAERKILQEEIVKGALMKKELEEARG
KIKDLQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKITESEIITKEREEAQKLKS
ENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLMLEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTS
SFSEKPNSNLSLKKLIRNKSGSSAVSSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKA
ENERAKSSGDASSPNLEYNDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMISNPTSGGEVHRGPELLRFNRKMIKPEVKADMETQGDLVVALAMEVREASFTHM
EDVVSFAIWLDEKLSSLVDEMEILEHFDWPKEKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATISRYEELGIPIDWLLDC
GVVGKIKVSCVELARKYMKRIVKEHNALNRPEKEPNREFLLLQGFAGNFDFESMKAFEELRSRVHTEAGQKS