; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024115 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024115
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00001047:3346911..3347474
RNA-Seq ExpressionSgr024115
SyntenySgr024115
Gene Ontology termsGO:0010190 - cytochrome b6f complex assembly (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN51853.2 hypothetical protein Csa_008870 [Cucumis sativus]8.4e-5974.71Show/hide
Query:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG
        +P++ TRR RFAVDDGADL DCS KHCRSCTAGLVADCVA+CCCPCSVVSFLALALVK+PWMVGRR L+QA+KK+KK+K+          +CRRG E DG
Subjt:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG

Query:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN
        A A E GG    E+GLPE+S GSGEED   GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN NLW NSN
Subjt:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN

XP_008458773.1 PREDICTED: uncharacterized protein LOC103498078 [Cucumis melo]2.4e-5875.88Show/hide
Query:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG
        +P+R TRR RFAVDDGADL DCS KHCRSCTAGLVADCVA+CCCPCSVVSFLALALVK+PWMVGRR L+QA+KK+KK+K++R          RRG E DG
Subjt:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG

Query:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN
        A A E GG    E+GLPEIS GSGEED   GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN NLW NSN
Subjt:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN

XP_011655414.1 uncharacterized protein LOC105435525 [Cucumis sativus]8.4e-5974.71Show/hide
Query:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG
        +P++ TRR RFAVDDGADL DCS KHCRSCTAGLVADCVA+CCCPCSVVSFLALALVK+PWMVGRR L+QA+KK+KK+K+          +CRRG E DG
Subjt:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG

Query:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN
        A A E GG    E+GLPE+S GSGEED   GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN NLW NSN
Subjt:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN

XP_022989907.1 uncharacterized protein LOC111486958 [Cucurbita maxima]1.2e-5266.67Show/hide
Query:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG
        +P+ PTRR RF VDDG DL DCS KHCRSCTAGLVADCVA+CCCPCSVVSFLALALVK+PWM+GRR L++A+KK+K              + RR  E DG
Subjt:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG

Query:  AAAV-EAGGSPAKEEGLPEISSGS----------GEEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN
        AAA  E G  PA+EEGLPEI  GS           EE+G+GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGN N W NSN
Subjt:  AAAV-EAGGSPAKEEGLPEISSGS----------GEEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN

XP_038891069.1 uncharacterized protein LOC120080480 [Benincasa hispida]1.7e-5975.88Show/hide
Query:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG
        +P+RPTRR RFAVDDGADL DCS KHCRSCTAGLVADCVA+CCCPCSVVSFLALAL+K+PWMVGRR L++A+KK+KK+K++R          RRG E DG
Subjt:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG

Query:  AAAVEAGGSPAKEEGLPEISSGSG-EEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN
        A A E GG  A EEGLPEIS GSG EE+ +GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN NLW NSN
Subjt:  AAAVEAGGSPAKEEGLPEISSGSG-EEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN

TrEMBL top hitse value%identityAlignment
A0A0A0KR58 Uncharacterized protein4.1e-5974.71Show/hide
Query:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG
        +P++ TRR RFAVDDGADL DCS KHCRSCTAGLVADCVA+CCCPCSVVSFLALALVK+PWMVGRR L+QA+KK+KK+K+          +CRRG E DG
Subjt:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG

Query:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN
        A A E GG    E+GLPE+S GSGEED   GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN NLW NSN
Subjt:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN

A0A1S3C8M3 uncharacterized protein LOC1034980781.2e-5875.88Show/hide
Query:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG
        +P+R TRR RFAVDDGADL DCS KHCRSCTAGLVADCVA+CCCPCSVVSFLALALVK+PWMVGRR L+QA+KK+KK+K++R          RRG E DG
Subjt:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG

Query:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN
        A A E GG    E+GLPEIS GSGEED   GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN NLW NSN
Subjt:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN

A0A5A7T2E5 Uncharacterized protein1.2e-5875.88Show/hide
Query:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG
        +P+R TRR RFAVDDGADL DCS KHCRSCTAGLVADCVA+CCCPCSVVSFLALALVK+PWMVGRR L+QA+KK+KK+K++R          RRG E DG
Subjt:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG

Query:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN
        A A E GG    E+GLPEIS GSGEED   GNFSARFEAERIWLQLYQVGQLGFGRVSFTGN NLW NSN
Subjt:  AAAVEAGGSPAKEEGLPEISSGSGEED-GMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN

A0A6J1HEE2 uncharacterized protein LOC1114632061.1e-5167.43Show/hide
Query:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG
        +P+ PTRR RF VDDG DL DCS KHCRSCTAGLVADCVA+CCCPCSVVSFLALALVK+PWM+GRR L++A+KK+K              + RR  E DG
Subjt:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG

Query:  AAAV-EAGGSPAKEEGLPEISSGS-----GEEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN
        AAA  E     A+EEGLPEI  GS      EE+G+GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGN N W NSN
Subjt:  AAAV-EAGGSPAKEEGLPEISSGS-----GEEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN

A0A6J1JH40 uncharacterized protein LOC1114869585.7e-5366.67Show/hide
Query:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG
        +P+ PTRR RF VDDG DL DCS KHCRSCTAGLVADCVA+CCCPCSVVSFLALALVK+PWM+GRR L++A+KK+K              + RR  E DG
Subjt:  TPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSICRRGEESDG

Query:  AAAV-EAGGSPAKEEGLPEISSGS----------GEEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN
        AAA  E G  PA+EEGLPEI  GS           EE+G+GNFSARFEAERIWLQLYQ+GQLGFGRVSFTGN N W NSN
Subjt:  AAAV-EAGGSPAKEEGLPEISSGS----------GEEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01516.1 unknown protein9.3e-2446.2Show/hide
Query:  ADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQ-AKKKKKKKKMMRAQR-----------EEMKSICRRGEESDG-----AA
        A CS K CRS  A  +ADCVA+CCCPC+VV+   LA VKVPWM+GR+ + +    KK+ KK+ R  R           E +   C  G + DG       
Subjt:  ADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQ-AKKKKKKKKMMRAQR-----------EEMKSICRRGEESDG-----AA

Query:  AVEAGGSPAKEEGLPEISSGSGEEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTG
         VE  GS  KEE      + S +E+     SAR EAER+WL+LYQ+G LGFGRVSFTG
Subjt:  AVEAGGSPAKEEGLPEISSGSGEEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTG

AT5G14690.1 unknown protein1.7e-2539.53Show/hide
Query:  MEENPTRAPRK--FGAH-DTATPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKK
        MEENP R  R+   G H          RR R       D   CS K CRS  A  +ADCVA+CCCPC++++ L L LVKVPWM+GRR L    + KKK++
Subjt:  MEENPTRAPRK--FGAH-DTATPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKK

Query:  MMRAQR-------------------------EEMKSIC--------RRGEESDGAAAVEAGGSPAKEEGLPE-ISSGSGEEDGMGNFSARFEAERIWLQL
        ++  ++                         E  K  C          G+  D    VE  GS  KEE   E  +S  GE+      SAR EAER+WL+L
Subjt:  MMRAQR-------------------------EEMKSIC--------RRGEESDGAAAVEAGGSPAKEEGLPE-ISSGSGEEDGMGNFSARFEAERIWLQL

Query:  YQVGQLGFGRVSFTG
        YQ+G LGFGRVSFTG
Subjt:  YQVGQLGFGRVSFTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAAACCCAACTCGAGCTCCTCGCAAATTTGGGGCCCATGACACCGCCACCCCGGCACGGCCCACACGGCGGAGCAGATTCGCCGTCGACGACGGAGCCGATCT
GGCGGACTGCTCCTGCAAGCATTGCCGCTCATGCACCGCCGGCTTGGTAGCCGACTGCGTCGCCATCTGCTGCTGCCCTTGCTCGGTCGTCAGCTTCTTGGCTCTGGCTC
TCGTCAAAGTGCCGTGGATGGTCGGCCGGAGGTGGCTGGAACAGGCCAAGAAGAAGAAGAAAAAGAAGAAGATGATGAGAGCTCAGAGAGAGGAGATGAAGTCGATTTGC
CGGAGAGGGGAAGAGAGTGACGGCGCGGCGGCGGTGGAAGCAGGTGGGAGTCCGGCGAAGGAGGAAGGGTTGCCGGAAATTTCATCGGGGTCCGGCGAGGAAGACGGGAT
GGGGAATTTCAGTGCGAGGTTTGAAGCAGAGAGAATATGGTTGCAATTGTATCAGGTTGGCCAGTTGGGTTTTGGAAGAGTTTCCTTCACTGGGAATCCAAATCTCTGGC
TCAACTCCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAAACCCAACTCGAGCTCCTCGCAAATTTGGGGCCCATGACACCGCCACCCCGGCACGGCCCACACGGCGGAGCAGATTCGCCGTCGACGACGGAGCCGATCT
GGCGGACTGCTCCTGCAAGCATTGCCGCTCATGCACCGCCGGCTTGGTAGCCGACTGCGTCGCCATCTGCTGCTGCCCTTGCTCGGTCGTCAGCTTCTTGGCTCTGGCTC
TCGTCAAAGTGCCGTGGATGGTCGGCCGGAGGTGGCTGGAACAGGCCAAGAAGAAGAAGAAAAAGAAGAAGATGATGAGAGCTCAGAGAGAGGAGATGAAGTCGATTTGC
CGGAGAGGGGAAGAGAGTGACGGCGCGGCGGCGGTGGAAGCAGGTGGGAGTCCGGCGAAGGAGGAAGGGTTGCCGGAAATTTCATCGGGGTCCGGCGAGGAAGACGGGAT
GGGGAATTTCAGTGCGAGGTTTGAAGCAGAGAGAATATGGTTGCAATTGTATCAGGTTGGCCAGTTGGGTTTTGGAAGAGTTTCCTTCACTGGGAATCCAAATCTCTGGC
TCAACTCCAATTAG
Protein sequenceShow/hide protein sequence
MEENPTRAPRKFGAHDTATPARPTRRSRFAVDDGADLADCSCKHCRSCTAGLVADCVAICCCPCSVVSFLALALVKVPWMVGRRWLEQAKKKKKKKKMMRAQREEMKSIC
RRGEESDGAAAVEAGGSPAKEEGLPEISSGSGEEDGMGNFSARFEAERIWLQLYQVGQLGFGRVSFTGNPNLWLNSN