; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr005481 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr005481
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationtig00003788:1154..1822
RNA-Seq ExpressionSgr005481
SyntenySgr005481
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8376458.1 hypothetical protein BUALT_Bualt09G0065400 [Buddleja alternifolia]1.7e-4246.73Show/hide
Query:  LTVTLTGGNNPPVPQSPPPPSPTLNPTPNARRR---QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAE
        L ++L+  ++ P P   PP   TL   P  RRR   Q+LRPGK             RRA +   +  + K       +++C KC+R+  IEYDL +KFAE
Subjt:  LTVTLTGGNNPPVPQSPPPPSPTLNPTPNARRR---QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAE

Query:  VQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPS
        V  FI KN  EMHHRAP  W+ P   +C  C    C KP++ KKR+INWLFLFLGQMIG+C + +LKY+CKHT+NH+TGAK+R++Y+ YM LC+QL+P+
Subjt:  VQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPS

KAG8376469.1 hypothetical protein BUALT_Bualt09G0066800 [Buddleja alternifolia]5.0e-4246.23Show/hide
Query:  LTVTLTGGNNPPVPQSPPPPSPTLNPTPNARRR---QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAE
        L ++L+  ++ P P   PP   T    P+ RRR   Q+LRPGK             RRA +   +  + K       +++C KC+R+  IEYDL +KFAE
Subjt:  LTVTLTGGNNPPVPQSPPPPSPTLNPTPNARRR---QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAE

Query:  VQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPS
        V  FI KN  EMHHRAP  W+ P   +C  C    C KP++ KKR+INWLFLFLGQMIG+C + +LKY+CKHT+NH+TGAK+R++Y+ YM LC+QL+P+
Subjt:  VQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPS

TXG59059.1 hypothetical protein EZV62_016888 [Acer yangbiense]2.2e-4244.88Show/hide
Query:  PPVPQSPPPP-------------SPTLNPTPNARRR--QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKF
        PP P SPPPP             +P   PT   RR   Q  +PGK     S       +RA +   +            +++C +CE+  +IEYDL EKF
Subjt:  PPVPQSPPPP-------------SPTLNPTPNARRR--QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKF

Query:  AEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHP
         E+  FI KN+  MHHRAPAVW+ P    C  C    CV+P+I+KK+++NWLFL LG+M+G C +EQLKY+CKHT+NH+TGAKDR++Y+ Y+ LCKQL P
Subjt:  AEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHP

Query:  SGPYD
        +GP+D
Subjt:  SGPYD

XP_006480845.1 uncharacterized protein LOC102624229 [Citrus sinensis]5.5e-4149.7Show/hide
Query:  QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGC
        Q LRPGK     +       RRA +   E        K   +++C +CE++ EIEYDL  KF EV  FI +N++ MH RAPA+W+ P   +C  C    C
Subjt:  QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGC

Query:  VKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPSGPYD
        VKP++ KK++INWLFL LGQM+G C + +LKY+CKHTRNH+TGAKDR++Y+ Y+SLCKQL P+G YD
Subjt:  VKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPSGPYD

XP_022157692.1 uncharacterized protein LOC111024349 [Momordica charantia]2.5e-4652.36Show/hide
Query:  KPRTSSRSLTVTLTGGNNPPVPQSPPP--PSPTLNP-TPNARRRQLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEY
        K  T S SL+        PP P  PPP  PSP  NP TP  R R LL  GK            + RA+IR  +   R   +K   +++C KC  +S++E+
Subjt:  KPRTSSRSLTVTLTGGNNPPVPQSPPP--PSPTLNP-TPNARRRQLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEY

Query:  DLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSL
        +LTEKF EV+ FI  N+ EMH RAP  W  P  +DC  C G GC +PV  KKR +NWLFL LGQMIG   +E LKY CKHTRNH+TGAKDRLVYIAYM L
Subjt:  DLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSL

Query:  CKQLHPSGPYDL
        CKQLHP+GPYDL
Subjt:  CKQLHPSGPYDL

TrEMBL top hitse value%identityAlignment
A0A2C9U2B7 Uncharacterized protein1.9e-3943.19Show/hide
Query:  PRTSSRSLTVTLTGGNNPPVPQSP--PPPSPTLNPTPNARRR--QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEY
        P  S   ++   T  + P +P SP  PP  P L P P ARR   Q  R GK            +RRA +           +     ++C +CE++ E+E+
Subjt:  PRTSSRSLTVTLTGGNNPPVPQSP--PPPSPTLNPTPNARRR--QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEY

Query:  DLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVI-EKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMS
        +L E+F  V ++I +N+  MH RAPA W+ P+   C+ C+   CVKPVI +KK++INWLFL LG+M+G C ++QLKY+CKHT+NH+TGAKDR++Y+ Y+ 
Subjt:  DLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVI-EKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMS

Query:  LCKQLHPSGPYDL
        LCKQLHP GP+DL
Subjt:  LCKQLHPSGPYDL

A0A2H5NYI9 Uncharacterized protein2.7e-4149.7Show/hide
Query:  QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGC
        Q LRPGK     +       RRA +   E        K   +++C +CE++ EIEYDL  KF EV  FI +N++ MH RAPA+W+ P   +C  C    C
Subjt:  QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGC

Query:  VKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPSGPYD
        VKP++ KK++INWLFL LGQM+G C + +LKY+CKHTRNH+TGAKDR++Y+ Y+SLCKQL P+G YD
Subjt:  VKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPSGPYD

A0A5C7HQG1 Uncharacterized protein1.1e-4244.88Show/hide
Query:  PPVPQSPPPP-------------SPTLNPTPNARRR--QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKF
        PP P SPPPP             +P   PT   RR   Q  +PGK     S       +RA +   +            +++C +CE+  +IEYDL EKF
Subjt:  PPVPQSPPPP-------------SPTLNPTPNARRR--QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKF

Query:  AEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHP
         E+  FI KN+  MHHRAPAVW+ P    C  C    CV+P+I+KK+++NWLFL LG+M+G C +EQLKY+CKHT+NH+TGAKDR++Y+ Y+ LCKQL P
Subjt:  AEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHP

Query:  SGPYD
        +GP+D
Subjt:  SGPYD

A0A6J1DV57 uncharacterized protein LOC1110243491.2e-4652.36Show/hide
Query:  KPRTSSRSLTVTLTGGNNPPVPQSPPP--PSPTLNP-TPNARRRQLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEY
        K  T S SL+        PP P  PPP  PSP  NP TP  R R LL  GK            + RA+IR  +   R   +K   +++C KC  +S++E+
Subjt:  KPRTSSRSLTVTLTGGNNPPVPQSPPP--PSPTLNP-TPNARRRQLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEY

Query:  DLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSL
        +LTEKF EV+ FI  N+ EMH RAP  W  P  +DC  C G GC +PV  KKR +NWLFL LGQMIG   +E LKY CKHTRNH+TGAKDRLVYIAYM L
Subjt:  DLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSL

Query:  CKQLHPSGPYDL
        CKQLHP+GPYDL
Subjt:  CKQLHPSGPYDL

V4SNJ8 Uncharacterized protein1.0e-4049.1Show/hide
Query:  QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGC
        Q LRPGK     +       RRA +   E        K   +++C +CE++ EIEYDL  KF EV  FI +N++ MH RAPA+W+ P   +C  C    C
Subjt:  QLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGC

Query:  VKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPSGPYD
        VKP++ KK++INWLFL LG M+G C + +LKY+CKHTRNH+TGAKDR++Y+ Y+SLCKQL P+G YD
Subjt:  VKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPSGPYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein1.0e-2940.1Show/hide
Query:  PQSPPPPSPTLNPTPNAR--------RRQLLRPGKKRDNTS-TISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAEVQRFIVK
        P    PPS  L P P  R         R      KK D  S       +RR  I+  E     +      +++C  CE+  ++ Y+L E+FAEV +F + 
Subjt:  PQSPPPPSPTLNPTPNAR--------RRQLLRPGKKRDNTS-TISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEYDLTEKFAEVQRFIVK

Query:  NQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVI-EKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHP
         + +M  RA   W  P  R C+ C     VKPVI E+K  INWLFL LGQ +G C +EQLK +CKH++NH+TGAKDR++Y+ YM LCK L P
Subjt:  NQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVI-EKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)5.3e-3449.22Show/hide
Query:  KLECSKCERESEIEYDLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVI-EKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNH
        ++ C  C+R   +EY+L EKF+E+  +I  N+ EM HRAP  W  P+   C  C+    +KPV+ E+K  INWLFL LGQM+G C ++QL+Y+C+    H
Subjt:  KLECSKCERESEIEYDLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVI-EKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNH

Query:  KTGAKDRLVYIAYMSLCKQLHPSGPYDL
        +TG+KDR+VYI Y+SLCKQL P GP++L
Subjt:  KTGAKDRLVYIAYMSLCKQLHPSGPYDL

AT2G16190.2 FUNCTIONS IN: molecular_function unknown2.6e-1745.56Show/hide
Query:  KLECSKCERESEIEYDLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVI-EKKRAINWLFLFLGQMIGLCGIEQL
        ++ C  C+R   +EY+L EKF+E+  +I  N+ EM HRAP  W  P+   C  C+    +KPV+ E+K  INWLFL LGQM+G C ++QL
Subjt:  KLECSKCERESEIEYDLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVI-EKKRAINWLFLFLGQMIGLCGIEQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTACCATGCATGGACAAGAGGAAGAGGAAGACCAAGCCGCGAACCTCCTCTCGATCTCTCACTGTCACTCTCACAGGTGGAAACAATCCGCCGGTCCCGCAGTC
GCCGCCGCCACCCTCGCCAACTCTAAACCCTACGCCCAACGCGCGGCGCAGGCAATTACTCCGGCCGGGGAAAAAGCGAGACAATACCAGTACTATATCCATGGGCGACG
ACCGGCGAGCAATGATACGCCAGTCTGAGAGATCTCGTCGCAAACGGAATCAGAAAAATCACCGAAAATTGGAGTGCAGCAAGTGCGAGAGGGAGAGCGAAATCGAATAC
GATCTGACGGAAAAGTTCGCGGAAGTGCAGAGGTTCATCGTGAAGAACCAGTGGGAGATGCACCACCGTGCGCCGGCGGTTTGGTTGGCGCCTAGGCCGCGGGACTGCGA
CGGCTGCCAAGGAATCGGCTGCGTGAAGCCGGTTATTGAGAAGAAGAGAGCGATAAATTGGTTGTTCTTGTTTCTAGGGCAAATGATTGGCCTCTGTGGTATAGAACAAC
TCAAATACTACTGTAAGCATACGAGAAACCATAAGACTGGTGCAAAAGATAGGCTGGTGTATATTGCCTATATGAGTTTGTGCAAGCAACTTCATCCCTCAGGTCCTTAT
GATCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTACCATGCATGGACAAGAGGAAGAGGAAGACCAAGCCGCGAACCTCCTCTCGATCTCTCACTGTCACTCTCACAGGTGGAAACAATCCGCCGGTCCCGCAGTC
GCCGCCGCCACCCTCGCCAACTCTAAACCCTACGCCCAACGCGCGGCGCAGGCAATTACTCCGGCCGGGGAAAAAGCGAGACAATACCAGTACTATATCCATGGGCGACG
ACCGGCGAGCAATGATACGCCAGTCTGAGAGATCTCGTCGCAAACGGAATCAGAAAAATCACCGAAAATTGGAGTGCAGCAAGTGCGAGAGGGAGAGCGAAATCGAATAC
GATCTGACGGAAAAGTTCGCGGAAGTGCAGAGGTTCATCGTGAAGAACCAGTGGGAGATGCACCACCGTGCGCCGGCGGTTTGGTTGGCGCCTAGGCCGCGGGACTGCGA
CGGCTGCCAAGGAATCGGCTGCGTGAAGCCGGTTATTGAGAAGAAGAGAGCGATAAATTGGTTGTTCTTGTTTCTAGGGCAAATGATTGGCCTCTGTGGTATAGAACAAC
TCAAATACTACTGTAAGCATACGAGAAACCATAAGACTGGTGCAAAAGATAGGCTGGTGTATATTGCCTATATGAGTTTGTGCAAGCAACTTCATCCCTCAGGTCCTTAT
GATCTTTGA
Protein sequenceShow/hide protein sequence
MPLPCMDKRKRKTKPRTSSRSLTVTLTGGNNPPVPQSPPPPSPTLNPTPNARRRQLLRPGKKRDNTSTISMGDDRRAMIRQSERSRRKRNQKNHRKLECSKCERESEIEY
DLTEKFAEVQRFIVKNQWEMHHRAPAVWLAPRPRDCDGCQGIGCVKPVIEKKRAINWLFLFLGQMIGLCGIEQLKYYCKHTRNHKTGAKDRLVYIAYMSLCKQLHPSGPY
DL