; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G006030 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G006030
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionprotein CHUP1, chloroplastic-like
Genome locationCicolChr01:6109270..6116656
RNA-Seq ExpressionCcUC01G006030
SyntenyCcUC01G006030
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008467205.1 PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo]0.0e+0082.25Show/hide
Query:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLD---EEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL-QLE
        MN+  ++VAVS+AAYAI+QLTIRSW+S FLPTNCSENGED +KN         GLD   EEEEEA+S+NDA+S+VNGR  DLEDGD +S E QVLL Q  
Subjt:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLD---EEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL-QLE

Query:  SQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGA
        S+N LL   KKEEKVPE   E++KIE ERL+KLVMELEERKVKLEGEL+MCDGIKYSETDV ELRKQL AK +DISMLN TISSLQAERKIL+EEI+KGA
Subjt:  SQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGA

Query:  LMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTL
        LMKKELEEAR KIKELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAEL+KKQKAAKD EVELGELK KNRELQHEKQELTSKLEVMKARIKTL
Subjt:  LMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTL

Query:  TKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAE
        TKMTESEIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AGESARYLNKSSSPKS+EKAKQL+LEYAG+EFGQ E
Subjt:  TKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAE

Query:  TDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNS
        TDHESNFSHPFS GI++L+NTSIDSSRSRTSSFSEKPNSNLSLKKLIRN+ G SAV  P I GSSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVNS
Subjt:  TDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNS

Query:  VATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMIS
        VATSFQLMSKSVEESL+QKYSTY+E++KLAIGSEKQIKEKAE+E+AKSSGD+SS NLEY D SMR K A LP+KLAQMK NK SC+PDSQ D DS N+IS
Subjt:  VATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMIS

Query:  NPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQK
        NPTSSGGEVHRG EL+ FN+KMM+PEVKA METQGD LVVALAM+VREA F++MED+VSF I LDEKLSSLVD MEIL+HFDWP  KTDALREAAF YQK
Subjt:  NPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQK

Query:  LMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPN
        LMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS   L  TRD  I RYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+GP+KEPN
Subjt:  LMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPN

Query:  REFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS
        REFLL QG         FAG FD +SMKAFEELR+RVHTE GQK+
Subjt:  REFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS

XP_011655490.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]0.0e+0083.1Show/hide
Query:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL--QL
        MN+  ++VAVS+AAYAI+QLTIRSW+S FLP TNCSENGED++KNV+Q +++I+ GL +EEEEEANS++D +S+VNGR  DLEDGD +S EFQVLL  Q 
Subjt:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL--QL

Query:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKG
         S+N LLDDN+KEEKVPE  IENSKIELERL+KL+MELEERKVKLEGEL+MCDGIKYSETDV ELRKQL AK DDISMLN TISSLQAERKIL+EEI+KG
Subjt:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKG

Query:  ALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT
        ALMKKELEE RGKIKELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKD EVE GELK KNRELQHE QELTSKLEVMKARIKT
Subjt:  ALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT

Query:  LTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQA
        LTKMTE+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AGESARYLNKSSSPKSKEKAKQL+LEYAG E G+A
Subjt:  LTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQA

Query:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN
        ETDHESNFSHPFSS I++L+NTSIDSSRSRTSSF EKPNSNLSLKKLIRN+ G SAV  P  I SSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVN
Subjt:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN

Query:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI
        SVATSFQLMSKSVE+SL+QKYSTY+E+HKLAIGSEKQIKEK ENERAKSSGD+SS NLEY+D SMR K A L +KLAQMKMNK SC+PDSQYD +S N I
Subjt:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI

Query:  SNPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQ
        S+PTSSGGEVHRG EL+ FNRKMM+PEVK  METQ D LV+ALAM+VREASF++MED+VSF IWLDEKLSSLVD MEIL+HFDWPK KTDALREAAF YQ
Subjt:  SNPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQ

Query:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEP
        KLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS Y L  TRD TI RYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+GPEKEP
Subjt:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEP

Query:  NREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS
        NREFLL QG         FAG FD +SMKAFEELRSRVHTE GQ++
Subjt:  NREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS

XP_031741050.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]0.0e+0082.47Show/hide
Query:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL--QL
        MN+  ++VAVS+AAYAI+QLTIRSW+S FLP TNCSENGED++KNV+Q +++I+ GL +EEEEEANS++D +S+VNGR  DLEDGD +S EFQVLL  Q 
Subjt:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL--QL

Query:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKG
         S+N LLDDN+KEEKVPE  IENSKIELERL+KL+MELEERKVKLEGEL+MCDGIKYSETDV ELRKQL AK DDISMLN TISSLQAERKIL+EEI+KG
Subjt:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKG

Query:  ALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT
        ALMKKELEE RGKIKELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKD EVE GELK KNRELQHE QELTSKLEVMKARIKT
Subjt:  ALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT

Query:  LTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQA
        LTKMTE+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AGESARYLNKSSSPKSKEKAKQL+LEYAG E G+A
Subjt:  LTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQA

Query:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN
        ETDHESNFSHPFSS I++L+NTSIDSSRSRTSSF EKPNSNLSLKKLIRN+ G SAV  P  I SSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVN
Subjt:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN

Query:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI
        SVATSFQLMSKSVE+SL+QKYSTY+E+HKLAIGSEKQIKEK ENERAKSSGD+SS NLEY+D SMR K A L +KLAQMKMNK SC+PDSQYD +S N I
Subjt:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI

Query:  SNPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQ
        S+PTSSGGEVHRG EL+ FNRKMM+PEVK  METQ D LV+ALAM+VREASF++MED+VSF IWLDEKLSSLVD MEIL+HFDWPK KTDALREAAF YQ
Subjt:  SNPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQ

Query:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEP
        KLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS Y L  TRD TI RYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+GPEKEP
Subjt:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEP

Query:  NREFLLLQGFAGDFDFESMKAFEELRSRVHTEA
        NREFLL QG                 SRVH EA
Subjt:  NREFLLLQGFAGDFDFESMKAFEELRSRVHTEA

XP_031741051.1 protein CHUP1, chloroplastic isoform X3 [Cucumis sativus]0.0e+0080.02Show/hide
Query:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL--QL
        MN+  ++VAVS+AAYAI+QLTIRSW+S FLP TNCSENGED++KNV+Q +++I+ GL +EEEEEANS++D +S+VNGR  DLEDGD +S EFQVLL  Q 
Subjt:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQ-YRRILCGL-DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL--QL

Query:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKG
         S+N LLDDN+KEEKVPE  IENSKIELERL+KL+MELEERKVKLEGEL+MCDGIKYSETDV ELRKQL AK DDISMLN TISSLQAERKIL+EEI+KG
Subjt:  ESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKG

Query:  ALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT
        ALMKKELEE RGKIKELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKD EVE GELK KNRELQHE QELTSKLEVMKARIKT
Subjt:  ALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKT

Query:  LTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQA
        LTKMTE+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AG+                           E G+A
Subjt:  LTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQA

Query:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN
        ETDHESNFSHPFSS I++L+NTSIDSSRSRTSSF EKPNSNLSLKKLIRN+ G SAV  P  I SSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVN
Subjt:  ETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVN

Query:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI
        SVATSFQLMSKSVE+SL+QKYSTY+E+HKLAIGSEKQIKEK ENERAKSSGD+SS NLEY+D SMR K A L +KLAQMKMNK SC+PDSQYD +S N I
Subjt:  SVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMI

Query:  SNPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQ
        S+PTSSGGEVHRG EL+ FNRKMM+PEVK  METQ D LV+ALAM+VREASF++MED+VSF IWLDEKLSSLVD MEIL+HFDWPK KTDALREAAF YQ
Subjt:  SNPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQ

Query:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEP
        KLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS Y L  TRD TI RYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+GPEKEP
Subjt:  KLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEP

Query:  NREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS
        NREFLL QG         FAG FD +SMKAFEELRSRVHTE GQ++
Subjt:  NREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS

XP_038906491.1 protein CHUP1, chloroplastic-like isoform X1 [Benincasa hispida]0.0e+0085.12Show/hide
Query:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQLESQNR
        MN+ G++VAVS+ AYAIRQLTIRSWSSLF P NCSENGED +KN         GLDEEEEEANS+ND +S+VNGR  D+EDGD  S EF+VLL  ES+N 
Subjt:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQLESQNR

Query:  LLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGALMKK
         LDDNKKEEKVPEIQIEN+KIELERLVKLVMELEERK KLEGEL+MCD IKYSETDVTELRKQLKAK DDISMLN TISSLQAERKIL+EEI+KGALMKK
Subjt:  LLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGALMKK

Query:  ELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKMT
        ELE ARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEA+KKEAELYKKQKAAKD EVELGELKRKNRELQHEK EL SKLEVMKARIKTLTKMT
Subjt:  ELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKMT

Query:  ESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAETDHE
        ESEI+TKEREEAQKLKSENEDLIK LERLQMNRF+EVEELVYLRWINACLRYELRDNEIS GESARYLNKS SPKSKEKAKQL+LEYAGLE GQ ETDHE
Subjt:  ESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAETDHE

Query:  SNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNSVATS
        SNFSHPFSSGIED+DNTSIDSSRSRTSSF EKPNSNLSLKKLIRN  G SAV SP IIGSSHRWKDPLEAVMALSAETLTLSEVRL+VSS KSVNSVATS
Subjt:  SNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNSVATS

Query:  FQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMISNPTS
        FQLMSKSV+ESLKQKYSTY+E+ KLA+GSEKQIKEKA NERAKSSGDA S   EYDDT++R KPAILP++L QMKMN+TS DPDSQ+D DSKNMISNPTS
Subjt:  FQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMISNPTS

Query:  SGGEVHRGPELLWFNRKMMRPEVKADMETQGDLVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLK
        SGGEVHRGPEL+ FNRK+M+PEV AD+ETQGDLVVALAM+VREASF++MEDVVSF I LDEK  SLV+ MEIL+HFDWPKGKTDAL EAAF YQKLMKL+
Subjt:  SGGEVHRGPELLWFNRKMMRPEVKADMETQGDLVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLK

Query:  EEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLL
        EEVSSFVDNPKLTCEVALNKMNSL+DKVEQS YGLF TRD TI +YEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIV EHNAL+GPEKEP+REFLL
Subjt:  EEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLL

Query:  LQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS
         QG         FAG FDFESMKAFEELRSRVHTEAGQK+
Subjt:  LQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS

TrEMBL top hitse value%identityAlignment
A0A061ECQ9 Hydroxyproline-rich glycoprotein family protein isoform 44.1e-20450.75Show/hide
Query:  KFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE-------DMEKNVRQYRRILCGL-------DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHE
        + G +VA S+AA+A++QL +++  S       SENGE       +   N +Q+      L       +EEEE+   ++   + VNG   D+ D D+   E
Subjt:  KFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE-------DMEKNVRQYRRILCGL-------DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHE

Query:  FQVLLQLESQNRLLDD---NKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAER
        F+ LL  E +  L  D     + EK+ E ++ N+  ELERL  LV ELEER+VKLEGEL+   G+K  E+D+ EL++QLK KT +I MLN TISSLQ+ER
Subjt:  FQVLLQLESQNRLLDD---NKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAER

Query:  KILREEIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSK
        K L+E+I  GA +KKELE AR KIKELQRQIQLDANQTK  LL LKQ+VS LQAKE+EA+K +AE+ KK KA K+ E+E+ EL+RKN+ELQHEK+ELT K
Subjt:  KILREEIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSK

Query:  LEVMKARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLV
        L+  +A+I  L+ MTE+EI  + REE   L+  NEDL+KQ+E LQMNRFSEVEELVYLRW+NACLRYELR+ +   G+ SAR LNKS SPKS+E AKQL+
Subjt:  LEVMKARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLV

Query:  LEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSP----------RIIGSSHRWKDPLEAVMAL
        LEYAG E GQ +TD ESNFSHP S+G EDLDN SI SS SR SS S+KP+    LKK  R+K   SAV SP          RI  S H  + PLEA+M  
Subjt:  LEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSP----------RIIGSSHRWKDPLEAVMAL

Query:  SA--------------------ETLTLSEVRLKVSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLE
        +A                    ET T+  +R +VSS  S NSVATSF LMS+SV+ SL++KY  Y++ HKLA+  EKQIK+KA+  RA+  GD S+    
Subjt:  SA--------------------ETLTLSEVRLKVSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLE

Query:  YDDTSMRTKPAILPVKLAQMKMNKT-SCDPDSQYDDSKNMISNP--------------------------------TSSGGEVHRGPELLWFNRKMMRP-
        +   + R KP ILP KLAQ+K       D   Q +D K + S                                  T++ G+    P L      +  P 
Subjt:  YDDTSMRTKPAILPVKLAQMKMNKT-SCDPDSQYDDSKNMISNP--------------------------------TSSGGEVHRGPELLWFNRKMMRP-

Query:  -------------EVKADMETQGDLVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVD
                      VKAD+ETQGD V +LA ++R ASFT +ED+V+F  WLDE+LS LVDE  +L+HFDWP+GK DALREAAFEYQ L+KL++++SSFVD
Subjt:  -------------EVKADMETQGDLVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVD

Query:  NPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG-----
        +P L CE AL KM  LL+KVEQS Y L  TRD  I RY+E GIP++WLLD GVVGKIK+S V+LARKYMKR+  E + L GPEKEPNREF+LLQG     
Subjt:  NPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG-----

Query:  ----FAGDFDFESMKAFEELRSRVHTEAGQ
            FAG FD ESMKAFEELRSRVH++ G+
Subjt:  ----FAGDFDFESMKAFEELRSRVHTEAGQ

A0A0A0KT25 Uncharacterized protein0.0e+0083.08Show/hide
Query:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQYRRILCGL-DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL--QLE
        MN+  ++VAVS+AAYAI+QLTIRSW+S FLP TNCSENGED++KNV+Q      GL +EEEEEANS++D +S+VNGR  DLEDGD +S EFQVLL  Q  
Subjt:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLP-TNCSENGEDMEKNVRQYRRILCGL-DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL--QLE

Query:  SQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGA
        S+N LLDDN+KEEKVPE  IENSKIELERL+KL+MELEERKVKLEGEL+MCDGIKYSETDV ELRKQL AK DDISMLN TISSLQAERKIL+EEI+KGA
Subjt:  SQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGA

Query:  LMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTL
        LMKKELEE RGKIKELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAELYKKQKAAKD EVE GELK KNRELQHE QELTSKLEVMKARIKTL
Subjt:  LMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTL

Query:  TKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAE
        TKMTE+EIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AGESARYLNKSSSPKSKEKAKQL+LEYAG E G+AE
Subjt:  TKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAE

Query:  TDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNS
        TDHESNFSHPFSS I++L+NTSIDSSRSRTSSF EKPNSNLSLKKLIRN+ G SAV  P  I SSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVNS
Subjt:  TDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNS

Query:  VATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMIS
        VATSFQLMSKSVE+SL+QKYSTY+E+HKLAIGSEKQIKEK ENERAKSSGD+SS NLEY+D SMR K A L +KLAQMKMNK SC+PDSQYD +S N IS
Subjt:  VATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMIS

Query:  NPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQK
        +PTSSGGEVHRG EL+ FNRKMM+PEVK  METQ D LV+ALAM+VREASF++MED+VSF IWLDEKLSSLVD MEIL+HFDWPK KTDALREAAF YQK
Subjt:  NPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQK

Query:  LMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPN
        LMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS Y L  TRD TI RYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+GPEKEPN
Subjt:  LMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPN

Query:  REFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS
        REFLL QG         FAG FD +SMKAFEELRSRVHTE GQ++
Subjt:  REFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS

A0A1S3CSZ9 protein CHUP1, chloroplastic-like0.0e+0082.25Show/hide
Query:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLD---EEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL-QLE
        MN+  ++VAVS+AAYAI+QLTIRSW+S FLPTNCSENGED +KN         GLD   EEEEEA+S+NDA+S+VNGR  DLEDGD +S E QVLL Q  
Subjt:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLD---EEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL-QLE

Query:  SQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGA
        S+N LL   KKEEKVPE   E++KIE ERL+KLVMELEERKVKLEGEL+MCDGIKYSETDV ELRKQL AK +DISMLN TISSLQAERKIL+EEI+KGA
Subjt:  SQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGA

Query:  LMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTL
        LMKKELEEAR KIKELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAEL+KKQKAAKD EVELGELK KNRELQHEKQELTSKLEVMKARIKTL
Subjt:  LMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTL

Query:  TKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAE
        TKMTESEIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYELR+N+I AGESARYLNKSSSPKS+EKAKQL+LEYAG+EFGQ E
Subjt:  TKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAE

Query:  TDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNS
        TDHESNFSHPFS GI++L+NTSIDSSRSRTSSFSEKPNSNLSLKKLIRN+ G SAV  P I GSSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVNS
Subjt:  TDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNS

Query:  VATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMIS
        VATSFQLMSKSVEESL+QKYSTY+E++KLAIGSEKQIKEKAE+E+AKSSGD+SS NLEY D SMR K A LP+KLAQMK NK SC+PDSQ D DS N+IS
Subjt:  VATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMIS

Query:  NPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQK
        NPTSSGGEVHRG EL+ FN+KMM+PEVKA METQGD LVVALAM+VREA F++MED+VSF I LDEKLSSLVD MEIL+HFDWP  KTDALREAAF YQK
Subjt:  NPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQK

Query:  LMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPN
        LMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS   L  TRD  I RYEELGIPIDWLLDCGVVGKIKV CVELARKYMKRIVKEHNAL+GP+KEPN
Subjt:  LMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPN

Query:  REFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS
        REFLL QG         FAG FD +SMKAFEELR+RVHTE GQK+
Subjt:  REFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS

A0A5D3BMR7 Protein CHUP10.0e+0085.2Show/hide
Query:  DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL-QLESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSE
        +EEEEEA+S+NDA+S+VNGR  DLEDGD +S E QVLL Q  S+N LL   KKEEKVPE   EN+KIE ERL+KLVMELEERKVKLEGEL+MCDGIKYSE
Subjt:  DEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLL-QLESQNRLLDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSE

Query:  TDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKK
        TDV ELRKQL AK +DISMLN TISSLQAERKIL+EEI+KGALMKKELEEARGKIKELQRQIQLDANQTKE LLLLKQRVS LQAKEEEAVKKEAEL+KK
Subjt:  TDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKK

Query:  QKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYEL
        QKAAKD EVELGELK KNRELQHEKQELTSKLEVMKARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLE LQMNRFSEVEELVYLRWINACLRYEL
Subjt:  QKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYEL

Query:  RDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLS
        R+N+I AGESARYLNKSSSPKS+EKAKQL+LEYAG+EFGQ ETDHESNFSHPFS GI++L+NTSIDSSRSRTSSFSEKPNSNLSLKKLIRN+ G SAV  
Subjt:  RDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLS

Query:  PRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLE
        P I GSSHRWKDPLEAVMALSAETLTLSEVRL+VSSRKSVNSVATSFQLMSKSVEESL+QKYSTY+E+HKLAIGSEKQIKEKAE+E+AKSSGD+SS NLE
Subjt:  PRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGDASSPNLE

Query:  YDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMISNPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVV
        Y D SMR K A LP+KLAQMK NK SC+PDSQ D DS N+ISNPTSSGGEVHRG EL+ FN+KMM+PEVKA METQGD LVVALAM+VREA F++MED+V
Subjt:  YDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYD-DSKNMISNPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGD-LVVALAMKVREASFTHMEDVV

Query:  SFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPI
        SF I LDEKLSSLVD MEIL+HFDWP  KTDALREAAF YQKLMKL+EEVSSFVDNPKLTCEVALNKMNSLLDKVEQS   L  TRD  I RYEELGIPI
Subjt:  SFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPI

Query:  DWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG
        DWLLDCGVVGKIKV CVELARKYMKRIVKEHN L+GP+KEPNREFLL QG
Subjt:  DWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG

A0A6J1DWY5 protein CHUP1, chloroplastic-like1.4e-25263.27Show/hide
Query:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQLESQNR
        M K G++VAVS+AAYAI+QLTIRSWSS  LPTNCSENGE  EKN         GLD EE++ NS+N A+S+V+G   D         E + LL  +S++R
Subjt:  MNKFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQLESQNR

Query:  LLDDNKKEE-KVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGALMK
        LLD NKKEE KVPE  +EN+KIEL+RL+KLVMELEERKVKLE EL+M D +K  ++D TEL K+L+AK +D+SMLN TISSLQAERK L+EEIVKGA MK
Subjt:  LLDDNKKEE-KVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGALMK

Query:  KELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKM
        KELEEA+GKIKELQRQ+QLDANQTKEHL  LK+RVS LQAKEEEAVKKEA+LY+K KAAK  E+ELGELK+KNR+LQ EK+ELTSKLEVM+ARI TLT +
Subjt:  KELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKM

Query:  TESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAETDH
        TESEIIT+EREE +KL+  NE+L KQLE LQMNRFSEVEELVYLRW+NACLRYELRDNE   GESA  L+KS SPKSKEKAKQL+LEYAGL FGQ ETDH
Subjt:  TESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAETDH

Query:  ESNFSHP-FSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTL-SEVRLKVSSRKSVNSV
        ESNFSHP FSSGIED DNTS  SSRSRTSSF                                 RWKDPLEA +A S ETLT  SEV+ +VSSR SVNSV
Subjt:  ESNFSHP-FSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTL-SEVRLKVSSRKSVNSV

Query:  ATSFQLMSKSVEESLKQKYSTYEENHKLAI--GSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMIS
        ATSFQ MS+S EES+KQKYS Y+E+HKL I  G EKQIKEKAE ER K+S                                                  
Subjt:  ATSFQLMSKSVEESLKQKYSTYEENHKLAI--GSEKQIKEKAENERAKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMIS

Query:  NPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGDLVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDE-MEILQHFDWPKGKTDALREAAFEYQK
                 +  PE + F++K+M+ EVKADMET+GDLV+ L M V+  SFT+MEDVVSF IWLD+K SSLVDE + IL+HFDWP+GK+DALREAA EYQ 
Subjt:  NPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGDLVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDE-MEILQHFDWPKGKTDALREAAFEYQK

Query:  LMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPN
        LMKL EEVSSFVD+PKLT EVAL  M+SLL K+EQS + +   R+  I +YEELGIP+DWLLD GVVGK+KV  VELARKYMKRI+ E NAL+GP KEPN
Subjt:  LMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPN

Query:  REFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQK
        REFLLLQG         FAG FD ESMKAFEELR+R+HTEAGQK
Subjt:  REFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic2.9e-18644.68Show/hide
Query:  KFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQL
        + G +VA S+AA  +++L ++       P+  S+NGE  D E++V        + +    +EEEEE   +N   ++  G   D  D D+   EF+ LL  
Subjt:  KFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQL

Query:  ESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILRE
        E +  L DD+   EK       E+++  +  ELERL +LV ELEER+VKLEGEL+   G+K  E+D+ EL++QLK KT +I MLN TI+SLQAERK L+E
Subjt:  ESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILRE

Query:  EIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMK
        E+ +  +++KELE AR KIKELQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ KL+  +
Subjt:  EIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMK

Query:  ARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLVLEYAG
        ARI TL+ MTES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVYLRW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+L+LEYAG
Subjt:  ARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLVLEYAG

Query:  LEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSP----------RIIGSSHRWKDPLEAVMALSA---
         E GQ +TD ESN+S P S G +D DN S+DSS SR SSFS+KP     LKK  ++K   S   SP          R+  S ++ + PLE++M  +A   
Subjt:  LEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSP----------RIIGSSHRWKDPLEAVMALSA---

Query:  -----------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGD--ASSPNLE
                         ET  L  +R +    S  + +NSVA SF +MSKSV+  L +KY  Y++ HKLA+  EK IK KA+  RA+  G   A  P L 
Subjt:  -----------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGD--ASSPNLE

Query:  YDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMIS-------------------NPTSSGG---------------------------------
             ++ K  ++P  +          +  ++   S+N  +                    P S+GG                                 
Subjt:  YDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMIS-------------------NPTSSGG---------------------------------

Query:  ----------------------EVHRGPELLWFNRKMMRPE-------------------------------------VKADMETQGDLVVALAMKVREA
                              +VHR PEL+ F + +M+ E                                     VKAD+ETQGD V +LA +VR +
Subjt:  ----------------------EVHRGPELLWFNRKMMRPE-------------------------------------VKADMETQGDLVVALAMKVREA

Query:  SFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATIL
        SFT +ED+++F  WLDE+LS LVDE  +L+HFDWP+GK DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQS Y L  TRD  I 
Subjt:  SFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATIL

Query:  RYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS
        RY+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E ++++G +K+PNREFLLLQG         FAG FD ESMKAFEELRSR  TE+G  +
Subjt:  RYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein5.6e-4438Show/hide
Query:  SQYDDSKNMISNPTSSGGEVHRGPELL---WFNRKMMRPEVKADMETQGDLVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGK
        ++ D+S+N+  +   +  +V+     +     NR      +KAD+ET+G+ +  L  KV    F+ MEDV+ F  WLD++L++L DE  +L+HF WP+ K
Subjt:  SQYDDSKNMISNPTSSGGEVHRGPELL---WFNRKMMRPEVKADMETQGDLVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGK

Query:  TDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVK
         D L+EAA EY++L KL++E+SS+ D+P +   VAL KM +LLDK EQ    L   R +++  Y++  IP++W+LD G++ KIK + ++LA+ YM R+  
Subjt:  TDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVK

Query:  EHNALNGPEKEPNREFLLLQG---------FAGDFDFESMKAFEELRSRV
        E  +    ++E  +E LLLQG         FAG  D E++ A EE++ RV
Subjt:  EHNALNGPEKEPNREFLLLQG---------FAGDFDFESMKAFEELRSRV

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein2.0e-18744.68Show/hide
Query:  KFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQL
        + G +VA S+AA  +++L ++       P+  S+NGE  D E++V        + +    +EEEEE   +N   ++  G   D  D D+   EF+ LL  
Subjt:  KFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQL

Query:  ESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILRE
        E +  L DD+   EK       E+++  +  ELERL +LV ELEER+VKLEGEL+   G+K  E+D+ EL++QLK KT +I MLN TI+SLQAERK L+E
Subjt:  ESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILRE

Query:  EIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMK
        E+ +  +++KELE AR KIKELQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ KL+  +
Subjt:  EIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMK

Query:  ARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLVLEYAG
        ARI TL+ MTES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVYLRW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+L+LEYAG
Subjt:  ARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLVLEYAG

Query:  LEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSP----------RIIGSSHRWKDPLEAVMALSA---
         E GQ +TD ESN+S P S G +D DN S+DSS SR SSFS+KP     LKK  ++K   S   SP          R+  S ++ + PLE++M  +A   
Subjt:  LEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSP----------RIIGSSHRWKDPLEAVMALSA---

Query:  -----------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGD--ASSPNLE
                         ET  L  +R +    S  + +NSVA SF +MSKSV+  L +KY  Y++ HKLA+  EK IK KA+  RA+  G   A  P L 
Subjt:  -----------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGD--ASSPNLE

Query:  YDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMIS-------------------NPTSSGG---------------------------------
             ++ K  ++P  +          +  ++   S+N  +                    P S+GG                                 
Subjt:  YDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMIS-------------------NPTSSGG---------------------------------

Query:  ----------------------EVHRGPELLWFNRKMMRPE-------------------------------------VKADMETQGDLVVALAMKVREA
                              +VHR PEL+ F + +M+ E                                     VKAD+ETQGD V +LA +VR +
Subjt:  ----------------------EVHRGPELLWFNRKMMRPE-------------------------------------VKADMETQGDLVVALAMKVREA

Query:  SFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATIL
        SFT +ED+++F  WLDE+LS LVDE  +L+HFDWP+GK DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQS Y L  TRD  I 
Subjt:  SFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATIL

Query:  RYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS
        RY+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E ++++G +K+PNREFLLLQG         FAG FD ESMKAFEELRSR  TE+G  +
Subjt:  RYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein2.0e-18744.68Show/hide
Query:  KFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQL
        + G +VA S+AA  +++L ++       P+  S+NGE  D E++V        + +    +EEEEE   +N   ++  G   D  D D+   EF+ LL  
Subjt:  KFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGE--DMEKNV-----RQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQL

Query:  ESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILRE
        E +  L DD+   EK       E+++  +  ELERL +LV ELEER+VKLEGEL+   G+K  E+D+ EL++QLK KT +I MLN TI+SLQAERK L+E
Subjt:  ESQNRLLDDNKKEEKVP-----EIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILRE

Query:  EIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMK
        E+ +  +++KELE AR KIKELQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ KL+  +
Subjt:  EIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMK

Query:  ARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLVLEYAG
        ARI TL+ MTES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVYLRW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+L+LEYAG
Subjt:  ARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLVLEYAG

Query:  LEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSP----------RIIGSSHRWKDPLEAVMALSA---
         E GQ +TD ESN+S P S G +D DN S+DSS SR SSFS+KP     LKK  ++K   S   SP          R+  S ++ + PLE++M  +A   
Subjt:  LEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSP----------RIIGSSHRWKDPLEAVMALSA---

Query:  -----------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGD--ASSPNLE
                         ET  L  +R +    S  + +NSVA SF +MSKSV+  L +KY  Y++ HKLA+  EK IK KA+  RA+  G   A  P L 
Subjt:  -----------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGD--ASSPNLE

Query:  YDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMIS-------------------NPTSSGG---------------------------------
             ++ K  ++P  +          +  ++   S+N  +                    P S+GG                                 
Subjt:  YDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMIS-------------------NPTSSGG---------------------------------

Query:  ----------------------EVHRGPELLWFNRKMMRPE-------------------------------------VKADMETQGDLVVALAMKVREA
                              +VHR PEL+ F + +M+ E                                     VKAD+ETQGD V +LA +VR +
Subjt:  ----------------------EVHRGPELLWFNRKMMRPE-------------------------------------VKADMETQGDLVVALAMKVREA

Query:  SFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATIL
        SFT +ED+++F  WLDE+LS LVDE  +L+HFDWP+GK DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQS Y L  TRD  I 
Subjt:  SFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATIL

Query:  RYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS
        RY+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E ++++G +K+PNREFLLLQG         FAG FD ESMKAFEELRSR  TE+G  +
Subjt:  RYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQKS

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.3e-15745.82Show/hide
Query:  KILREEIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSK
        K L+EE+ +  +++KELE AR KIKELQRQIQLDANQTK  LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV++ ELKRKNRELQHEK+EL+ K
Subjt:  KILREEIVKGALMKKELEEARGKIKELQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSK

Query:  LEVMKARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLV
        L+  +ARI TL+ MTES+ + K REE   LK  NEDL+KQ+E LQMNRFSEVEELVYLRW+NACLRYELR+ +  AG+ SAR L+K+ SPKS+ KAK+L+
Subjt:  LEVMKARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGE-SARYLNKSSSPKSKEKAKQLV

Query:  LEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSP----------RIIGSSHRWKDPLEAVMAL
        LEYAG E GQ +TD ESN+S P S G +D DN S+DSS SR SSFS+KP     LKK  ++K   S   SP          R+  S ++ + PLE++M  
Subjt:  LEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSEKPNSNLSLKKLIRNKCGWSAVLSP----------RIIGSSHRWKDPLEAVMAL

Query:  SA--------------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGD--AS
        +A                    ET  L  +R +    S  + +NSVA SF +MSKSV+  L +KY  Y++ HKLA+  EK IK KA+  RA+  G   A 
Subjt:  SA--------------------ETLTLSEVRLK---VSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENERAKSSGD--AS

Query:  SPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMIS-------------------NPTSSGG----------------------------
         P L      ++ K  ++P  +          +  ++   S+N  +                    P S+GG                            
Subjt:  SPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMIS-------------------NPTSSGG----------------------------

Query:  ---------------------------EVHRGPELLWFNRKMMRPE-------------------------------------VKADMETQGDLVVALAM
                                   +VHR PEL+ F + +M+ E                                     VKAD+ETQGD V +LA 
Subjt:  ---------------------------EVHRGPELLWFNRKMMRPE-------------------------------------VKADMETQGDLVVALAM

Query:  KVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTR
        +VR +SFT +ED+++F  WLDE+LS LVDE  +L+HFDWP+GK DALREAAFEYQ LMKL+++V+SFVD+P L+CE AL KM  LL+KVEQS Y L  TR
Subjt:  KVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTR

Query:  DATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQK
        D  I RY+E GIP+DWL D GVVGKIK+S V+LA+KYMKR+  E ++++G +K+PNREFLLLQG         FAG FD ESMKAFEELRSR  TE+G  
Subjt:  DATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG---------FAGDFDFESMKAFEELRSRVHTEAGQK

Query:  S
        +
Subjt:  S

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-4644.09Show/hide
Query:  NRKMMRPEVKADMETQGDLVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTC
        NR +    +K D+ETQGD +  L  +V  A+F+ +EDVV F  WLD++LS LVDE  +L+HF+WP+ K DALREAAF Y  L KL  E S F ++P+ + 
Subjt:  NRKMMRPEVKADMETQGDLVVALAMKVREASFTHMEDVVSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTC

Query:  EVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG---------FA
          AL KM +L +K+E   Y L   R++   +++   IP+DW+L+ G+  +IK++ V+LA KYMKR+  E  A+ G    P  E L++QG         FA
Subjt:  EVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVVGKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQG---------FA

Query:  GDFDFESMKAFEELRSRVHT
        G FD E+MKAFEELR +  +
Subjt:  GDFDFESMKAFEELRSRVHT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATCAAATTTGCTATTTAAGAAGATGAACAAATTTGGTATTATTGTTGCTGTTTCAGTTGCAGCTTATGCAATTAGGCAGCTCACAATCAGATCATGGAGCTCATT
ATTCTTGCCTACCAACTGTTCAGAAAATGGAGAAGACATGGAGAAGAACGTAAGGCAATACCGAAGAATTCTTTGTGGATTGGATGAAGAAGAAGAGGAAGCTAATTCAT
TAAATGATGCAAGTAGTGAAGTTAATGGTAGGATCTTTGATCTTGAAGATGGAGATCTTAATTCACATGAGTTTCAAGTACTTTTACAACTAGAGTCCCAAAATCGGTTG
CTTGATGACAATAAGAAAGAAGAAAAAGTGCCTGAAATCCAGATAGAAAACAGTAAGATTGAATTGGAAAGGTTGGTGAAACTAGTGATGGAATTGGAGGAGAGGAAAGT
GAAACTTGAAGGTGAATTGGTCATGTGCGACGGAATAAAATACAGTGAAACCGACGTCACAGAGTTACGAAAGCAGCTCAAGGCCAAGACCGATGATATCAGTATGCTTA
ATACCACAATCAGCTCTTTGCAGGCTGAGAGAAAGATTCTACGAGAAGAGATAGTGAAGGGAGCACTGATGAAGAAGGAACTAGAGGAGGCTAGGGGAAAGATTAAGGAG
CTGCAAAGGCAAATTCAGCTTGATGCAAACCAAACAAAAGAACACTTATTATTACTCAAACAACGAGTTTCTGCTTTGCAGGCGAAAGAGGAAGAGGCCGTCAAGAAAGA
AGCAGAACTTTATAAGAAACAGAAAGCGGCGAAGGATTCCGAGGTAGAACTTGGGGAGCTTAAGCGGAAGAATCGAGAGCTTCAACACGAAAAGCAGGAACTAACTTCTA
AACTAGAAGTTATGAAGGCCAGAATCAAAACACTGACTAAAATGACAGAGAGTGAAATAATCACTAAGGAAAGAGAAGAAGCTCAAAAGTTAAAGAGTGAAAATGAGGAT
CTGATAAAGCAACTTGAAAGGCTTCAGATGAATAGGTTTAGTGAAGTTGAAGAGTTAGTCTACCTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCTTCGAGACAACGA
GATATCAGCAGGCGAATCTGCTCGTTATCTCAATAAGAGTTCAAGTCCAAAGTCAAAAGAAAAGGCAAAACAACTCGTGTTAGAGTATGCAGGATTGGAGTTCGGACAAG
CAGAAACAGATCATGAAAGCAACTTTTCTCATCCATTCTCCTCTGGGATTGAAGATCTTGACAATACTTCAATTGATAGTTCAAGGAGCAGAACTAGTAGTTTCAGTGAG
AAGCCTAATTCAAATTTGAGTTTGAAGAAATTGATCAGAAACAAGTGTGGTTGGAGTGCTGTTTTGTCTCCGCGCATAATCGGTTCGAGCCACAGATGGAAGGATCCTTT
AGAAGCAGTGATGGCTCTGAGTGCTGAAACTCTAACTCTCTCAGAAGTTAGATTGAAAGTTTCTTCAAGAAAATCTGTCAACTCAGTTGCAACGTCATTCCAACTCATGT
CTAAATCAGTTGAAGAAAGTCTAAAACAGAAATATTCAACATATGAAGAGAATCATAAGTTGGCCATAGGGAGTGAGAAGCAAATCAAGGAAAAGGCTGAGAATGAAAGG
GCAAAAAGTTCTGGTGATGCTTCGAGTCCGAATTTAGAATATGATGACACTAGCATGAGAACGAAGCCTGCAATTTTACCCGTTAAACTTGCTCAAATGAAGATGAATAA
AACTTCTTGTGATCCAGATAGCCAATATGATGATAGCAAAAATATGATCTCCAATCCAACTTCTTCTGGAGGTGAAGTTCATCGAGGACCGGAGTTACTTTGGTTCAATC
GAAAAATGATGAGACCTGAGGTGAAAGCTGATATGGAAACTCAAGGAGACTTGGTTGTGGCATTGGCAATGAAAGTTAGAGAAGCGTCTTTCACACACATGGAAGATGTT
GTCTCCTTCGCTATATGGCTCGATGAAAAGTTATCTTCTTTGGTTGATGAGATGGAGATTCTCCAGCATTTCGATTGGCCCAAGGGAAAAACAGATGCATTAAGAGAAGC
AGCGTTTGAATATCAGAAGCTGATGAAACTAAAAGAAGAAGTCTCTTCTTTTGTTGACAATCCCAAACTCACATGTGAAGTTGCTTTAAACAAAATGAACTCCTTGCTAG
ATAAGGTGGAGCAGAGTGCCTATGGATTGTTTCATACAAGGGATGCAACGATTTTGCGATACGAGGAACTCGGAATTCCCATTGATTGGTTGTTGGATTGTGGAGTTGTT
GGCAAGATAAAGGTGTCATGTGTAGAATTGGCAAGGAAGTACATGAAACGCATAGTAAAAGAACATAATGCACTAAATGGACCTGAGAAAGAACCAAACAGAGAGTTTTT
GCTCTTACAAGGATTTGCAGGAGACTTTGATTTCGAGAGCATGAAGGCTTTTGAAGAGCTGAGAAGTCGTGTCCATACAGAGGCAGGACAAAAAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGATCAAATTTGCTATTTAAGAAGATGAACAAATTTGGTATTATTGTTGCTGTTTCAGTTGCAGCTTATGCAATTAGGCAGCTCACAATCAGATCATGGAGCTCATT
ATTCTTGCCTACCAACTGTTCAGAAAATGGAGAAGACATGGAGAAGAACGTAAGGCAATACCGAAGAATTCTTTGTGGATTGGATGAAGAAGAAGAGGAAGCTAATTCAT
TAAATGATGCAAGTAGTGAAGTTAATGGTAGGATCTTTGATCTTGAAGATGGAGATCTTAATTCACATGAGTTTCAAGTACTTTTACAACTAGAGTCCCAAAATCGGTTG
CTTGATGACAATAAGAAAGAAGAAAAAGTGCCTGAAATCCAGATAGAAAACAGTAAGATTGAATTGGAAAGGTTGGTGAAACTAGTGATGGAATTGGAGGAGAGGAAAGT
GAAACTTGAAGGTGAATTGGTCATGTGCGACGGAATAAAATACAGTGAAACCGACGTCACAGAGTTACGAAAGCAGCTCAAGGCCAAGACCGATGATATCAGTATGCTTA
ATACCACAATCAGCTCTTTGCAGGCTGAGAGAAAGATTCTACGAGAAGAGATAGTGAAGGGAGCACTGATGAAGAAGGAACTAGAGGAGGCTAGGGGAAAGATTAAGGAG
CTGCAAAGGCAAATTCAGCTTGATGCAAACCAAACAAAAGAACACTTATTATTACTCAAACAACGAGTTTCTGCTTTGCAGGCGAAAGAGGAAGAGGCCGTCAAGAAAGA
AGCAGAACTTTATAAGAAACAGAAAGCGGCGAAGGATTCCGAGGTAGAACTTGGGGAGCTTAAGCGGAAGAATCGAGAGCTTCAACACGAAAAGCAGGAACTAACTTCTA
AACTAGAAGTTATGAAGGCCAGAATCAAAACACTGACTAAAATGACAGAGAGTGAAATAATCACTAAGGAAAGAGAAGAAGCTCAAAAGTTAAAGAGTGAAAATGAGGAT
CTGATAAAGCAACTTGAAAGGCTTCAGATGAATAGGTTTAGTGAAGTTGAAGAGTTAGTCTACCTTCGTTGGATCAATGCTTGCTTGAGGTATGAGCTTCGAGACAACGA
GATATCAGCAGGCGAATCTGCTCGTTATCTCAATAAGAGTTCAAGTCCAAAGTCAAAAGAAAAGGCAAAACAACTCGTGTTAGAGTATGCAGGATTGGAGTTCGGACAAG
CAGAAACAGATCATGAAAGCAACTTTTCTCATCCATTCTCCTCTGGGATTGAAGATCTTGACAATACTTCAATTGATAGTTCAAGGAGCAGAACTAGTAGTTTCAGTGAG
AAGCCTAATTCAAATTTGAGTTTGAAGAAATTGATCAGAAACAAGTGTGGTTGGAGTGCTGTTTTGTCTCCGCGCATAATCGGTTCGAGCCACAGATGGAAGGATCCTTT
AGAAGCAGTGATGGCTCTGAGTGCTGAAACTCTAACTCTCTCAGAAGTTAGATTGAAAGTTTCTTCAAGAAAATCTGTCAACTCAGTTGCAACGTCATTCCAACTCATGT
CTAAATCAGTTGAAGAAAGTCTAAAACAGAAATATTCAACATATGAAGAGAATCATAAGTTGGCCATAGGGAGTGAGAAGCAAATCAAGGAAAAGGCTGAGAATGAAAGG
GCAAAAAGTTCTGGTGATGCTTCGAGTCCGAATTTAGAATATGATGACACTAGCATGAGAACGAAGCCTGCAATTTTACCCGTTAAACTTGCTCAAATGAAGATGAATAA
AACTTCTTGTGATCCAGATAGCCAATATGATGATAGCAAAAATATGATCTCCAATCCAACTTCTTCTGGAGGTGAAGTTCATCGAGGACCGGAGTTACTTTGGTTCAATC
GAAAAATGATGAGACCTGAGGTGAAAGCTGATATGGAAACTCAAGGAGACTTGGTTGTGGCATTGGCAATGAAAGTTAGAGAAGCGTCTTTCACACACATGGAAGATGTT
GTCTCCTTCGCTATATGGCTCGATGAAAAGTTATCTTCTTTGGTTGATGAGATGGAGATTCTCCAGCATTTCGATTGGCCCAAGGGAAAAACAGATGCATTAAGAGAAGC
AGCGTTTGAATATCAGAAGCTGATGAAACTAAAAGAAGAAGTCTCTTCTTTTGTTGACAATCCCAAACTCACATGTGAAGTTGCTTTAAACAAAATGAACTCCTTGCTAG
ATAAGGTGGAGCAGAGTGCCTATGGATTGTTTCATACAAGGGATGCAACGATTTTGCGATACGAGGAACTCGGAATTCCCATTGATTGGTTGTTGGATTGTGGAGTTGTT
GGCAAGATAAAGGTGTCATGTGTAGAATTGGCAAGGAAGTACATGAAACGCATAGTAAAAGAACATAATGCACTAAATGGACCTGAGAAAGAACCAAACAGAGAGTTTTT
GCTCTTACAAGGATTTGCAGGAGACTTTGATTTCGAGAGCATGAAGGCTTTTGAAGAGCTGAGAAGTCGTGTCCATACAGAGGCAGGACAAAAAAGTTGAATATTATATG
TTCCTCAGTTTTTCTTTTACTATTTGTATTGTATAATTATCTCTAGCTTACTTTATCTAGATAATCTCAAGAAATTTTATGAATGTAATAGGAAAAAAAGAATGAAAAGA
AATGAAAAATGATGGATAAGAATAATAATCAAAGTTGATGAGGCCTAATCTAAAACCTTGAGGGCAAAGAATTGTAACTAAGGCCTCTTCAAGACTCAAAATATAAGAAA
GAGAGAGTAGAGAGGGCGATGATCAATAGACACAGTGTCAACTATTCAAAAATGTCACCAAAAAGGG
Protein sequenceShow/hide protein sequence
MRSNLLFKKMNKFGIIVAVSVAAYAIRQLTIRSWSSLFLPTNCSENGEDMEKNVRQYRRILCGLDEEEEEANSLNDASSEVNGRIFDLEDGDLNSHEFQVLLQLESQNRL
LDDNKKEEKVPEIQIENSKIELERLVKLVMELEERKVKLEGELVMCDGIKYSETDVTELRKQLKAKTDDISMLNTTISSLQAERKILREEIVKGALMKKELEEARGKIKE
LQRQIQLDANQTKEHLLLLKQRVSALQAKEEEAVKKEAELYKKQKAAKDSEVELGELKRKNRELQHEKQELTSKLEVMKARIKTLTKMTESEIITKEREEAQKLKSENED
LIKQLERLQMNRFSEVEELVYLRWINACLRYELRDNEISAGESARYLNKSSSPKSKEKAKQLVLEYAGLEFGQAETDHESNFSHPFSSGIEDLDNTSIDSSRSRTSSFSE
KPNSNLSLKKLIRNKCGWSAVLSPRIIGSSHRWKDPLEAVMALSAETLTLSEVRLKVSSRKSVNSVATSFQLMSKSVEESLKQKYSTYEENHKLAIGSEKQIKEKAENER
AKSSGDASSPNLEYDDTSMRTKPAILPVKLAQMKMNKTSCDPDSQYDDSKNMISNPTSSGGEVHRGPELLWFNRKMMRPEVKADMETQGDLVVALAMKVREASFTHMEDV
VSFAIWLDEKLSSLVDEMEILQHFDWPKGKTDALREAAFEYQKLMKLKEEVSSFVDNPKLTCEVALNKMNSLLDKVEQSAYGLFHTRDATILRYEELGIPIDWLLDCGVV
GKIKVSCVELARKYMKRIVKEHNALNGPEKEPNREFLLLQGFAGDFDFESMKAFEELRSRVHTEAGQKS