; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr009290 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr009290
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationtig00007747:5..3311
RNA-Seq ExpressionSgr009290
SyntenySgr009290
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34042.1 hydroxyproline-rich glycoprotein family protein [Cucumis melo subsp. melo]1.1e-14579.62Show/hide
Query:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS
        MEQ+GKSTA   S + S  S  GRVS KAMESPK +VSV  V+STPQS VKKQSSRVSRSLT NAP         KK RDGE  GVSARTVNRGGLKQVS
Subjt:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS

Query:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS
         RR  S AGSC NV+DCNGVKS LQEKL FAE+LIKDLQSQL+ LKEEL+KSQ+LN+ELQSQNDLLVRDLAAAEAK A+ASNN +R++V+E  Q   +D+
Subjt:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS

Query:  QILENGKLQAQPSSSCQNVRNLECKA-PPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHLL
        Q LENGKL+ QPSSSC+NVR+L+CKA PPR  PPPPPPPLP QS+PRA ATQKSPDL+RLFHSLRKKEGKRDPPLLGKP AINAHNSIVGEIQNRSAHLL
Subjt:  QILENGKLQAQPSSSCQNVRNLECKA-PPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHLL

Query:  AIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        AIKADIETKGEFINGLIDKVLVAA+TDIED+LKFVDWLDSQLSSLADERAVLKHFKWPEKKAD MREAAIEYR
Subjt:  AIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

XP_004147632.1 protein CHUP1, chloroplastic [Cucumis sativus]2.3e-14679.14Show/hide
Query:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS
        MEQ+GKS A   S + S  S  GRVS KAMESPK +VSV AV+STPQS VKKQSS+VSRSLT N P         KK RDGE  GVSARTVNRGGLKQV 
Subjt:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS

Query:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS
         RR  SGAGSC NV+DCNGVKS LQEKLCFAE+LIKDLQSQL+ LKEEL KSQ+LN ELQSQNDLLVRDLAAAEAK A+ SNN +R++V+E  Q +A+D+
Subjt:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS

Query:  QILENGKLQAQPSSSCQNVRNLECKAPPRR--PPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHL
        Q LENGKL+ QPSSSC+NVR+L+CK PP R  PPPPPPPPLP QS+PRA ATQKSPDL+RLFHSLRKKEGKRDPPLLGKP AINAHNSIVGEIQNRSAHL
Subjt:  QILENGKLQAQPSSSCQNVRNLECKAPPRR--PPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHL

Query:  LAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        LAIKADIETKGEFINGLIDKVLVAA+TDIED+LKFVDWLDSQLSSLADERAVLKHFKWPEKKAD MREAAIEYR
Subjt:  LAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

XP_008439003.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]1.1e-14579.62Show/hide
Query:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS
        MEQ+GKSTA   S + S  S  GRVS KAMESPK +VSV  V+STPQS VKKQSSRVSRSLT NAP         KK RDGE  GVSARTVNRGGLKQVS
Subjt:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS

Query:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS
         RR  S AGSC NV+DCNGVKS LQEKL FAE+LIKDLQSQL+ LKEEL+KSQ+LN+ELQSQNDLLVRDLAAAEAK A+ASNN +R++V+E  Q   +D+
Subjt:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS

Query:  QILENGKLQAQPSSSCQNVRNLECKA-PPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHLL
        Q LENGKL+ QPSSSC+NVR+L+CKA PPR  PPPPPPPLP QS+PRA ATQKSPDL+RLFHSLRKKEGKRDPPLLGKP AINAHNSIVGEIQNRSAHLL
Subjt:  QILENGKLQAQPSSSCQNVRNLECKA-PPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHLL

Query:  AIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        AIKADIETKGEFINGLIDKVLVAA+TDIED+LKFVDWLDSQLSSLADERAVLKHFKWPEKKAD MREAAIEYR
Subjt:  AIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

XP_022138255.1 protein CHUP1, chloroplastic isoform X1 [Momordica charantia]3.2e-14881.75Show/hide
Query:  MEQRGKSTAAATSASNSAMSAR-GRVSSKAME----SPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGG
        MEQRGKSTAA TS  NSAM +R GRVS +AME    SPKPLVS  AVQSTPQSVVKKQSSRVSRSL LNAP         KK RDGE   VSARTVNRGG
Subjt:  MEQRGKSTAAATSASNSAMSAR-GRVSSKAME----SPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGG

Query:  LKQVSQRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQP
        LKQVSQ RRSSG GSC+ V DC   KSELQEKLCFAE+LIKDL+SQLL+LKEELQKSQ+LN+ELQSQNDLLVRDLAAAE KLANASNN Q+         
Subjt:  LKQVSQRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQP

Query:  NAKDSQILENGKLQAQPSSSCQNVRNLECKAPP-RRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNR
         AK++Q LENGKLQAQPSSSCQNVR+L CKAPP R  PPPPPPPLP QSLPRAV TQKSPDLIRLFHSLRKKEGKRDPPLLGKP  INAHNSIVGEIQNR
Subjt:  NAKDSQILENGKLQAQPSSSCQNVRNLECKAPP-RRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNR

Query:  SAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        S+HLLAI+ADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKAD MREAAIEYR
Subjt:  SAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

XP_022138256.1 protein CHUP1, chloroplastic isoform X2 [Momordica charantia]7.2e-14881.75Show/hide
Query:  MEQRGKSTAAATSASNSAMSAR-GRVSSKAME----SPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGG
        MEQRGKSTAA TS  NSAM +R GRVS +AME    SPKPLVS  AVQSTPQSVVKKQSSRVSRSL LNAP         KK RDGE   VSARTVNRGG
Subjt:  MEQRGKSTAAATSASNSAMSAR-GRVSSKAME----SPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGG

Query:  LKQVSQRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQP
        LKQVSQ RRSSG GSC+ V DC   KSELQEKLCFAE+LIKDL+SQLL+LKEELQKSQ+LN+ELQSQNDLLVRDLAAAE KLANASNN Q          
Subjt:  LKQVSQRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQP

Query:  NAKDSQILENGKLQAQPSSSCQNVRNLECKAPP-RRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNR
         AK++Q LENGKLQAQPSSSCQNVR+L CKAPP R  PPPPPPPLP QSLPRAV TQKSPDLIRLFHSLRKKEGKRDPPLLGKP  INAHNSIVGEIQNR
Subjt:  NAKDSQILENGKLQAQPSSSCQNVRNLECKAPP-RRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNR

Query:  SAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        S+HLLAI+ADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKAD MREAAIEYR
Subjt:  SAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

TrEMBL top hitse value%identityAlignment
A0A0A0L5G9 Uncharacterized protein1.1e-14679.14Show/hide
Query:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS
        MEQ+GKS A   S + S  S  GRVS KAMESPK +VSV AV+STPQS VKKQSS+VSRSLT N P         KK RDGE  GVSARTVNRGGLKQV 
Subjt:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS

Query:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS
         RR  SGAGSC NV+DCNGVKS LQEKLCFAE+LIKDLQSQL+ LKEEL KSQ+LN ELQSQNDLLVRDLAAAEAK A+ SNN +R++V+E  Q +A+D+
Subjt:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS

Query:  QILENGKLQAQPSSSCQNVRNLECKAPPRR--PPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHL
        Q LENGKL+ QPSSSC+NVR+L+CK PP R  PPPPPPPPLP QS+PRA ATQKSPDL+RLFHSLRKKEGKRDPPLLGKP AINAHNSIVGEIQNRSAHL
Subjt:  QILENGKLQAQPSSSCQNVRNLECKAPPRR--PPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHL

Query:  LAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        LAIKADIETKGEFINGLIDKVLVAA+TDIED+LKFVDWLDSQLSSLADERAVLKHFKWPEKKAD MREAAIEYR
Subjt:  LAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

A0A1S4DT91 protein CHUP1, chloroplastic5.6e-14679.62Show/hide
Query:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS
        MEQ+GKSTA   S + S  S  GRVS KAMESPK +VSV  V+STPQS VKKQSSRVSRSLT NAP         KK RDGE  GVSARTVNRGGLKQVS
Subjt:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS

Query:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS
         RR  S AGSC NV+DCNGVKS LQEKL FAE+LIKDLQSQL+ LKEEL+KSQ+LN+ELQSQNDLLVRDLAAAEAK A+ASNN +R++V+E  Q   +D+
Subjt:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS

Query:  QILENGKLQAQPSSSCQNVRNLECKA-PPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHLL
        Q LENGKL+ QPSSSC+NVR+L+CKA PPR  PPPPPPPLP QS+PRA ATQKSPDL+RLFHSLRKKEGKRDPPLLGKP AINAHNSIVGEIQNRSAHLL
Subjt:  QILENGKLQAQPSSSCQNVRNLECKA-PPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHLL

Query:  AIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        AIKADIETKGEFINGLIDKVLVAA+TDIED+LKFVDWLDSQLSSLADERAVLKHFKWPEKKAD MREAAIEYR
Subjt:  AIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

A0A6J1C8Y7 protein CHUP1, chloroplastic isoform X11.6e-14881.75Show/hide
Query:  MEQRGKSTAAATSASNSAMSAR-GRVSSKAME----SPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGG
        MEQRGKSTAA TS  NSAM +R GRVS +AME    SPKPLVS  AVQSTPQSVVKKQSSRVSRSL LNAP         KK RDGE   VSARTVNRGG
Subjt:  MEQRGKSTAAATSASNSAMSAR-GRVSSKAME----SPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGG

Query:  LKQVSQRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQP
        LKQVSQ RRSSG GSC+ V DC   KSELQEKLCFAE+LIKDL+SQLL+LKEELQKSQ+LN+ELQSQNDLLVRDLAAAE KLANASNN Q+         
Subjt:  LKQVSQRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQP

Query:  NAKDSQILENGKLQAQPSSSCQNVRNLECKAPP-RRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNR
         AK++Q LENGKLQAQPSSSCQNVR+L CKAPP R  PPPPPPPLP QSLPRAV TQKSPDLIRLFHSLRKKEGKRDPPLLGKP  INAHNSIVGEIQNR
Subjt:  NAKDSQILENGKLQAQPSSSCQNVRNLECKAPP-RRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNR

Query:  SAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        S+HLLAI+ADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKAD MREAAIEYR
Subjt:  SAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

A0A6J1C980 protein CHUP1, chloroplastic isoform X23.5e-14881.75Show/hide
Query:  MEQRGKSTAAATSASNSAMSAR-GRVSSKAME----SPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGG
        MEQRGKSTAA TS  NSAM +R GRVS +AME    SPKPLVS  AVQSTPQSVVKKQSSRVSRSL LNAP         KK RDGE   VSARTVNRGG
Subjt:  MEQRGKSTAAATSASNSAMSAR-GRVSSKAME----SPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGG

Query:  LKQVSQRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQP
        LKQVSQ RRSSG GSC+ V DC   KSELQEKLCFAE+LIKDL+SQLL+LKEELQKSQ+LN+ELQSQNDLLVRDLAAAE KLANASNN Q          
Subjt:  LKQVSQRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQP

Query:  NAKDSQILENGKLQAQPSSSCQNVRNLECKAPP-RRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNR
         AK++Q LENGKLQAQPSSSCQNVR+L CKAPP R  PPPPPPPLP QSLPRAV TQKSPDLIRLFHSLRKKEGKRDPPLLGKP  INAHNSIVGEIQNR
Subjt:  NAKDSQILENGKLQAQPSSSCQNVRNLECKAPP-RRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNR

Query:  SAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        S+HLLAI+ADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKAD MREAAIEYR
Subjt:  SAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

E5GC44 Hydroxyproline-rich glycoprotein family protein5.6e-14679.62Show/hide
Query:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS
        MEQ+GKSTA   S + S  S  GRVS KAMESPK +VSV  V+STPQS VKKQSSRVSRSLT NAP         KK RDGE  GVSARTVNRGGLKQVS
Subjt:  MEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLTLNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVS

Query:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS
         RR  S AGSC NV+DCNGVKS LQEKL FAE+LIKDLQSQL+ LKEEL+KSQ+LN+ELQSQNDLLVRDLAAAEAK A+ASNN +R++V+E  Q   +D+
Subjt:  QRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNNVQREAVAEYPQPNAKDS

Query:  QILENGKLQAQPSSSCQNVRNLECKA-PPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHLL
        Q LENGKL+ QPSSSC+NVR+L+CKA PPR  PPPPPPPLP QS+PRA ATQKSPDL+RLFHSLRKKEGKRDPPLLGKP AINAHNSIVGEIQNRSAHLL
Subjt:  QILENGKLQAQPSSSCQNVRNLECKA-PPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAINAHNSIVGEIQNRSAHLL

Query:  AIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        AIKADIETKGEFINGLIDKVLVAA+TDIED+LKFVDWLDSQLSSLADERAVLKHFKWPEKKAD MREAAIEYR
Subjt:  AIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic2.0e-3147.27Show/hide
Query:  PPRRPPP-----------PPPPPLPFQSLPRAVA----TQKSPDLIRLFHSLRKKEGKRD--PPLL--GKPPAINAHNSIVGEIQNRSAHLLAIKADIET
        PP  PPP           PPPPP P  +L R         ++P+L+  + SL K+E K++  P L+  G   +  A N+++GEI+NRS  LLA+KAD+ET
Subjt:  PPRRPPP-----------PPPPPLPFQSLPRAVA----TQKSPDLIRLFHSLRKKEGKRD--PPLL--GKPPAINAHNSIVGEIQNRSAHLLAIKADIET

Query:  KGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        +G+F+  L  +V  +++TDIED+L FV WLD +LS L DERAVLKHF WPE KAD +REAA EY+
Subjt:  KGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein1.5e-5841.82Show/hide
Query:  SVVKKQSSRVSRSLTLNAPKS------------RDDVYGSKKSRDGERAGVS---ARTVNRGGL--------KQVSQRRRSSGAGSCANVDDCNGVKSEL
        SV+ K  ++    LT   PKS            R  +    KS + E A ++   AR+VNR  +        + +S++   +   + A  D+      EL
Subjt:  SVVKKQSSRVSRSLTLNAPKS------------RDDVYGSKKSRDGERAGVS---ARTVNRGGL--------KQVSQRRRSSGAGSCANVDDCNGVKSEL

Query:  QEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNN--------------VQREAVAEYPQPNAKDSQILENGKLQ-
        +EKL   E+LIKDLQ Q+L LK EL++++N N+EL+  N  L +DL +AEAK+++ S+N              +QR   ++  QP  K    +E+ +L  
Subjt:  QEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAAAEAKLANASNN--------------VQREAVAEYPQPNAKDSQILENGKLQ-

Query:  ------------------AQPSSSC-QNVRNLECKAPPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRD--PPLLGKPPAIN-AHNSI
                            P+SS  +   N    APP  PPPPPPPP   + L +A   QKSP + +LF  L K++  R+    + G    +N AHNSI
Subjt:  ------------------AQPSSSC-QNVRNLECKAPPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRD--PPLLGKPPAIN-AHNSI

Query:  VGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        VGEIQNRSAHL+AIKADIETKGEFIN LI KVL   ++D+EDV+KFVDWLD +L++LADERAVLKHFKWPEKKADT++EAA+EYR
Subjt:  VGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.4e-3247.27Show/hide
Query:  PPRRPPP-----------PPPPPLPFQSLPRAVA----TQKSPDLIRLFHSLRKKEGKRD--PPLL--GKPPAINAHNSIVGEIQNRSAHLLAIKADIET
        PP  PPP           PPPPP P  +L R         ++P+L+  + SL K+E K++  P L+  G   +  A N+++GEI+NRS  LLA+KAD+ET
Subjt:  PPRRPPP-----------PPPPPLPFQSLPRAVA----TQKSPDLIRLFHSLRKKEGKRD--PPLL--GKPPAINAHNSIVGEIQNRSAHLLAIKADIET

Query:  KGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        +G+F+  L  +V  +++TDIED+L FV WLD +LS L DERAVLKHF WPE KAD +REAA EY+
Subjt:  KGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.4e-3247.27Show/hide
Query:  PPRRPPP-----------PPPPPLPFQSLPRAVA----TQKSPDLIRLFHSLRKKEGKRD--PPLL--GKPPAINAHNSIVGEIQNRSAHLLAIKADIET
        PP  PPP           PPPPP P  +L R         ++P+L+  + SL K+E K++  P L+  G   +  A N+++GEI+NRS  LLA+KAD+ET
Subjt:  PPRRPPP-----------PPPPPLPFQSLPRAVA----TQKSPDLIRLFHSLRKKEGKRD--PPLL--GKPPAINAHNSIVGEIQNRSAHLLAIKADIET

Query:  KGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        +G+F+  L  +V  +++TDIED+L FV WLD +LS L DERAVLKHF WPE KAD +REAA EY+
Subjt:  KGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.4e-3247.27Show/hide
Query:  PPRRPPP-----------PPPPPLPFQSLPRAVA----TQKSPDLIRLFHSLRKKEGKRD--PPLL--GKPPAINAHNSIVGEIQNRSAHLLAIKADIET
        PP  PPP           PPPPP P  +L R         ++P+L+  + SL K+E K++  P L+  G   +  A N+++GEI+NRS  LLA+KAD+ET
Subjt:  PPRRPPP-----------PPPPPLPFQSLPRAVA----TQKSPDLIRLFHSLRKKEGKRD--PPLL--GKPPAINAHNSIVGEIQNRSAHLLAIKADIET

Query:  KGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR
        +G+F+  L  +V  +++TDIED+L FV WLD +LS L DERAVLKHF WPE KAD +REAA EY+
Subjt:  KGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.4e-2937.18Show/hide
Query:  NLIKDLQ--SQLLVLKEELQKSQNLNMELQS----QNDLLVRDLAAAEAKLAN-------ASNNVQREAVAEYPQPNAKDSQIL-ENGKLQAQPSSSCQN
        NLI+ L+    L  L E +   +N N  + S      D+  +D   + ++ +N       +S +  R  V   P+P  K S  L ++ + +A P      
Subjt:  NLIKDLQ--SQLLVLKEELQKSQNLNMELQS----QNDLLVRDLAAAEAKLAN-------ASNNVQREAVAEYPQPNAKDSQIL-ENGKLQAQPSSSCQN

Query:  VRNLECKAPPRRP---------------PPPPPPPLPFQSLPRAVA-TQKSPDLIRLFHSLRKKE---GKRDPPLLGK--PPAINAHNS---IVGEIQNR
         +++    PP  P               PPPPPPP P +SL  A A  ++ P+++  +HSL +++    +RD    G     AI A+++   ++GEI+NR
Subjt:  VRNLECKAPPRRP---------------PPPPPPPLPFQSLPRAVA-TQKSPDLIRLFHSLRKKE---GKRDPPLLGK--PPAINAHNS---IVGEIQNR

Query:  SAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEY
        S +LLAIK D+ET+G+FI  LI +V  AA++DIEDV+ FV WLD +LS L DERAVLKHF+WPE+KAD +REAA  Y
Subjt:  SAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGATGTTGAGCGGCGACCTTCCATCTTCAATTTTGAAAGCTTTTAATCCAAGAGCTGGAAAAAAGTGGTCCATCAATGCCATGATAAACCATTTGATCTTCAAATC
AAATCCAGAATATAGACGACACCCAATTGTAGTGCTTTTAATCGCCATTGTTAATCCTTTCTCTTTCCTTCGGCCTCGAGTTTTGTTTTGTTTTGTTTCGTTTGGTGGGT
GCTTCGCTGGGGTCATCGCTGTGGCTGTCTTTTCAAATATATTCTTCTCTCTCTCTCTCTCCTCTCGTCTTCCATGTCTGTACAGAGCGAATGGGTTTGAGGGTTGGCCG
GTGCATGGCGGCTGCCGGCAGGTAATTCGCCTTTATTGGCCGTTTTACGTCAGACCCAGTACGGCGGCGCCACTCTGCGAGTTGAGAACCTGTCTCTGTCCTTGTCTGAA
ACGTGGAATTGTCGGTTCTTCAATCGAGTCTTTAATGGAGCAGAGAGGGAAATCAACTGCTGCAGCGACGAGCGCGTCGAACTCTGCGATGTCAGCTCGGGGAAGGGTTT
CTTCTAAAGCTATGGAGTCGCCGAAGCCGCTGGTTTCTGTGCCTGCTGTTCAATCGACGCCTCAGTCTGTCGTTAAGAAGCAAAGTTCGAGAGTTAGCAGATCTCTGACG
CTGAATGCTCCCAAGTCTCGAGACGACGTTTATGGAAGTAAGAAGAGTAGGGATGGGGAACGTGCTGGAGTCTCGGCTCGAACGGTCAACCGCGGCGGTCTCAAGCAAGT
TTCGCAGCGGCGGCGTTCTTCTGGTGCTGGTTCGTGTGCGAATGTTGACGATTGTAATGGAGTGAAGAGTGAATTGCAGGAGAAGCTTTGTTTCGCTGAGAATTTGATCA
AAGATTTGCAGTCTCAATTGCTGGTTTTGAAGGAGGAGTTGCAGAAGTCTCAGAACTTGAACATGGAGCTGCAATCGCAGAACGATTTGCTCGTCCGTGACCTAGCCGCC
GCTGAAGCGAAGTTAGCAAATGCTAGCAATAACGTTCAGAGGGAGGCAGTTGCAGAGTACCCTCAACCGAACGCCAAGGACAGTCAGATACTCGAAAATGGAAAGTTGCA
GGCCCAACCTTCTAGTTCGTGTCAGAATGTTAGGAATTTGGAATGCAAGGCCCCACCACGACGGCCACCGCCGCCGCCGCCTCCGCCTCTTCCCTTCCAATCCTTGCCCC
GAGCAGTGGCCACTCAGAAGTCTCCAGACCTCATACGCCTCTTTCACTCGTTGAGAAAGAAAGAGGGGAAGAGAGATCCTCCATTGTTGGGGAAACCACCTGCGATTAAT
GCGCACAATAGCATTGTTGGGGAAATTCAGAATCGTTCCGCGCATCTTTTAGCAATAAAAGCAGACATTGAAACCAAAGGAGAGTTCATCAATGGCCTCATTGACAAAGT
GCTTGTTGCAGCTTATACGGACATTGAAGATGTCCTCAAGTTTGTCGATTGGCTTGATTCCCAACTCTCGTCATTGGCTGACGAGCGGGCTGTGTTGAAGCATTTCAAGT
GGCCTGAGAAGAAAGCCGACACCATGCGAGAAGCAGCGATAGAATACCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGATGTTGAGCGGCGACCTTCCATCTTCAATTTTGAAAGCTTTTAATCCAAGAGCTGGAAAAAAGTGGTCCATCAATGCCATGATAAACCATTTGATCTTCAAATC
AAATCCAGAATATAGACGACACCCAATTGTAGTGCTTTTAATCGCCATTGTTAATCCTTTCTCTTTCCTTCGGCCTCGAGTTTTGTTTTGTTTTGTTTCGTTTGGTGGGT
GCTTCGCTGGGGTCATCGCTGTGGCTGTCTTTTCAAATATATTCTTCTCTCTCTCTCTCTCCTCTCGTCTTCCATGTCTGTACAGAGCGAATGGGTTTGAGGGTTGGCCG
GTGCATGGCGGCTGCCGGCAGGTAATTCGCCTTTATTGGCCGTTTTACGTCAGACCCAGTACGGCGGCGCCACTCTGCGAGTTGAGAACCTGTCTCTGTCCTTGTCTGAA
ACGTGGAATTGTCGGTTCTTCAATCGAGTCTTTAATGGAGCAGAGAGGGAAATCAACTGCTGCAGCGACGAGCGCGTCGAACTCTGCGATGTCAGCTCGGGGAAGGGTTT
CTTCTAAAGCTATGGAGTCGCCGAAGCCGCTGGTTTCTGTGCCTGCTGTTCAATCGACGCCTCAGTCTGTCGTTAAGAAGCAAAGTTCGAGAGTTAGCAGATCTCTGACG
CTGAATGCTCCCAAGTCTCGAGACGACGTTTATGGAAGTAAGAAGAGTAGGGATGGGGAACGTGCTGGAGTCTCGGCTCGAACGGTCAACCGCGGCGGTCTCAAGCAAGT
TTCGCAGCGGCGGCGTTCTTCTGGTGCTGGTTCGTGTGCGAATGTTGACGATTGTAATGGAGTGAAGAGTGAATTGCAGGAGAAGCTTTGTTTCGCTGAGAATTTGATCA
AAGATTTGCAGTCTCAATTGCTGGTTTTGAAGGAGGAGTTGCAGAAGTCTCAGAACTTGAACATGGAGCTGCAATCGCAGAACGATTTGCTCGTCCGTGACCTAGCCGCC
GCTGAAGCGAAGTTAGCAAATGCTAGCAATAACGTTCAGAGGGAGGCAGTTGCAGAGTACCCTCAACCGAACGCCAAGGACAGTCAGATACTCGAAAATGGAAAGTTGCA
GGCCCAACCTTCTAGTTCGTGTCAGAATGTTAGGAATTTGGAATGCAAGGCCCCACCACGACGGCCACCGCCGCCGCCGCCTCCGCCTCTTCCCTTCCAATCCTTGCCCC
GAGCAGTGGCCACTCAGAAGTCTCCAGACCTCATACGCCTCTTTCACTCGTTGAGAAAGAAAGAGGGGAAGAGAGATCCTCCATTGTTGGGGAAACCACCTGCGATTAAT
GCGCACAATAGCATTGTTGGGGAAATTCAGAATCGTTCCGCGCATCTTTTAGCAATAAAAGCAGACATTGAAACCAAAGGAGAGTTCATCAATGGCCTCATTGACAAAGT
GCTTGTTGCAGCTTATACGGACATTGAAGATGTCCTCAAGTTTGTCGATTGGCTTGATTCCCAACTCTCGTCATTGGCTGACGAGCGGGCTGTGTTGAAGCATTTCAAGT
GGCCTGAGAAGAAAGCCGACACCATGCGAGAAGCAGCGATAGAATACCGATGA
Protein sequenceShow/hide protein sequence
MVMLSGDLPSSILKAFNPRAGKKWSINAMINHLIFKSNPEYRRHPIVVLLIAIVNPFSFLRPRVLFCFVSFGGCFAGVIAVAVFSNIFFSLSLSSRLPCLYRANGFEGWP
VHGGCRQVIRLYWPFYVRPSTAAPLCELRTCLCPCLKRGIVGSSIESLMEQRGKSTAAATSASNSAMSARGRVSSKAMESPKPLVSVPAVQSTPQSVVKKQSSRVSRSLT
LNAPKSRDDVYGSKKSRDGERAGVSARTVNRGGLKQVSQRRRSSGAGSCANVDDCNGVKSELQEKLCFAENLIKDLQSQLLVLKEELQKSQNLNMELQSQNDLLVRDLAA
AEAKLANASNNVQREAVAEYPQPNAKDSQILENGKLQAQPSSSCQNVRNLECKAPPRRPPPPPPPPLPFQSLPRAVATQKSPDLIRLFHSLRKKEGKRDPPLLGKPPAIN
AHNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAYTDIEDVLKFVDWLDSQLSSLADERAVLKHFKWPEKKADTMREAAIEYR