; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021244 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021244
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionexpressed protein localized to the inner membrane of the chloroplast.
Genome locationtig00153653:441194..449140
RNA-Seq ExpressionSgr021244
SyntenySgr021244
Gene Ontology termsGO:0009706 - chloroplast inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571164.1 hypothetical protein SDJN03_30079, partial [Cucurbita argyrosperma subsp. sororia]4.0e-7184.71Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR
        MAACFAPSLSVS             GGLIKASDLSSKSISFGQAPKLAIQ+KC RT+HKLSVRAEYNDG RNGGG+FVAGFLLGGAVFGTLAYIFAPQIR
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR

Query:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI
        RS+LNEDEYGFRRA+RPIYYD+GLEKTRQTLN KI QLNSAIDNVSSRLRGGN TP+VPVEADPEIEAT+
Subjt:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI

KAG7010970.1 hypothetical protein SDJN02_27768, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-7185.29Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR
        MAACFAPSLSVS             GGLIKASDLSSKSISFGQAPKLAIQ+KC RTNHKLSVRAEYNDG RNGGG+FVAGFLLGGAVFGTLAYIFAPQIR
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR

Query:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI
        RS+LNEDEYGFRRA+RPIYYD+GLEKTRQTLN KI QLNSAIDNVSSRLRGGN TP+VPVEADPEIEAT+
Subjt:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI

XP_022148671.1 uncharacterized protein LOC111017273 [Momordica charantia]1.1e-7188.24Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR
        MAACFAPSLSVS             GGLIKASDLSSKSI+FGQAPKLAIQKKCLRTN KLSVRAEYNDG R GGGDFVAGFLLGGAVFGTLAYIFAPQIR
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR

Query:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI
        RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKI QLNSAIDNVSSRLRGGNNTPAVPVEADPE EAT+
Subjt:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI

XP_022944447.1 uncharacterized protein LOC111448897 [Cucurbita moschata]1.8e-7185.29Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR
        MAACFAPSLSVS             GGLIKASDLSSKSISFGQAPKLAIQ+KC RT+HKLSVRAEYNDG RNGGG+FVAGFLLGGAVFGTLAYIFAPQIR
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR

Query:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI
        RS+LNEDEYGFRRA+RPIYYD+GLEKTRQTLN KI QLNSAIDNVSSRLRGGN TPAVPVEADPEIEAT+
Subjt:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI

XP_038902580.1 uncharacterized protein LOC120089235 [Benincasa hispida]1.3e-7288.24Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR
        MAACFAPSLSVS             GGLIKASDLSSKSISFGQAPKLAIQ+K  RTN KLSVRAEYNDG+R+GGGDFVAGFLLGGAVFGTLAYIFAPQIR
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR

Query:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI
        RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEAT+
Subjt:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI

TrEMBL top hitse value%identityAlignment
A0A1S3C9M4 uncharacterized protein LOC1034983952.8e-7083.63Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR
        MAACFAPSLSVS             GGLIKASDLSSKS+SFGQ PKLAI++KC +TNHKLSVRAEYNDG R+GGGDFVAGFLLGGAVFGTLAY+FAPQIR
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR

Query:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPV-EADPEIEATI
        RS+LNEDE+GFRRAKRP+YYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPV EA+PEIEAT+
Subjt:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPV-EADPEIEATI

A0A5D3CQ66 Uncharacterized protein2.8e-7083.63Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR
        MAACFAPSLSVS             GGLIKASDLSSKS+SFGQ PKLAI++KC +TNHKLSVRAEYNDG R+GGGDFVAGFLLGGAVFGTLAY+FAPQIR
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR

Query:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPV-EADPEIEATI
        RS+LNEDE+GFRRAKRP+YYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPV EA+PEIEAT+
Subjt:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPV-EADPEIEATI

A0A6J1D647 uncharacterized protein LOC1110172735.2e-7288.24Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR
        MAACFAPSLSVS             GGLIKASDLSSKSI+FGQAPKLAIQKKCLRTN KLSVRAEYNDG R GGGDFVAGFLLGGAVFGTLAYIFAPQIR
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR

Query:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI
        RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKI QLNSAIDNVSSRLRGGNNTPAVPVEADPE EAT+
Subjt:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI

A0A6J1FVP6 uncharacterized protein LOC1114488978.8e-7285.29Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR
        MAACFAPSLSVS             GGLIKASDLSSKSISFGQAPKLAIQ+KC RT+HKLSVRAEYNDG RNGGG+FVAGFLLGGAVFGTLAYIFAPQIR
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR

Query:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI
        RS+LNEDEYGFRRA+RPIYYD+GLEKTRQTLN KI QLNSAIDNVSSRLRGGN TPAVPVEADPEIEAT+
Subjt:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI

A0A6J1JGK4 uncharacterized protein LOC1114843141.3e-7084.12Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR
        MAAC APSLSVS             GGLIKASDLSSKSISFGQAPKLAIQ+KC R+NHKLSVRAEYNDG R+GGG+FVAGFLLGGAVFGTLAYIFAPQIR
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIR

Query:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI
        RS+LNEDEYGFRRA+RPIYYD+GLEKTRQTLN KI QLNSAIDNVSSRLRGGN TPAVPVEADPEIEAT+
Subjt:  RSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G42960.1 expressed protein localized to the inner membrane of the chloroplast.2.7e-1736.63Show/hide
Query:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGG-GDFVAGFLLGGAVFGTLAYIFAPQI
        MA+  + SLS+   S  L+       G     +    S+SFG      +     RT   L++++ Y D + +G  G FV GF+LGG + G L  ++APQI
Subjt:  MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGG-GDFVAGFLLGGAVFGTLAYIFAPQI

Query:  RRSLLNEDEYGFRRAKRPIYYDE--GLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEAT
         +++   D     R      YDE   LEKTR+ L  KI+QLNSAID+VSS+L+   +TP     +  EIEAT
Subjt:  RRSLLNEDEYGFRRAKRPIYYDE--GLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEAT

AT3G02900.1 unknown protein2.4e-3762.6Show/hide
Query:  LAIQKKCLRTNHKLSVRAEYNDGNRNGG-GDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNV
        L +Q K  R +HKLSV A Y  G++ GG  DFV GFLLG AVFGTLAYIFAPQIRRS+L+E+EYGF++ ++P+YYDEGLE+ R+ LN KI QLNSAID V
Subjt:  LAIQKKCLRTNHKLSVRAEYNDGNRNGG-GDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNV

Query:  SSRLRGG-------NNTPAVPVEADPEIEAT
        SSRL+GG        ++P+VPVE D E EAT
Subjt:  SSRLRGG-------NNTPAVPVEADPEIEAT

AT3G02900.2 unknown protein2.4e-3762.6Show/hide
Query:  LAIQKKCLRTNHKLSVRAEYNDGNRNGG-GDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNV
        L +Q K  R +HKLSV A Y  G++ GG  DFV GFLLG AVFGTLAYIFAPQIRRS+L+E+EYGF++ ++P+YYDEGLE+ R+ LN KI QLNSAID V
Subjt:  LAIQKKCLRTNHKLSVRAEYNDGNRNGG-GDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNV

Query:  SSRLRGG-------NNTPAVPVEADPEIEAT
        SSRL+GG        ++P+VPVE D E EAT
Subjt:  SSRLRGG-------NNTPAVPVEADPEIEAT

AT5G16660.1 unknown protein8.5e-4361.02Show/hide
Query:  MAACFAPS-LSVSGKSFLLYLKIAVQGGLIKASDLS--SKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNG-GGDFVAGFLLGGAVFGTLAYIFA
        MA+C A + LS+SG S         Q   +KA+ LS  +K  S  +   L I KK  RT  K SV A Y DG+R+G  GDF+AGFLLGGAVFG +AYIFA
Subjt:  MAACFAPS-LSVSGKSFLLYLKIAVQGGLIKASDLS--SKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNG-GGDFVAGFLLGGAVFGTLAYIFA

Query:  PQIRRSLLN-EDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRG-GNNTPA--VPVEADPEIEAT
        PQIRRS+LN EDEYGF + K+P YYDEGLEKTR+TLN KI QLNSAIDNVSSRLRG   NT +  VPVE DPE+EAT
Subjt:  PQIRRSLLN-EDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRG-GNNTPA--VPVEADPEIEAT

AT5G16660.2 unknown protein1.0e-4060.45Show/hide
Query:  MAACFAPS-LSVSGKSFLLYLKIAVQGGLIKASDLS--SKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNG-GGDFVAGFLLGGAVFGTLAYIFA
        MA+C A + LS+SG S         Q   +KA+ LS  +K  S  +   L I KK  RT  K SV A   DG+R+G  GDF+AGFLLGGAVFG +AYIFA
Subjt:  MAACFAPS-LSVSGKSFLLYLKIAVQGGLIKASDLS--SKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNG-GGDFVAGFLLGGAVFGTLAYIFA

Query:  PQIRRSLLN-EDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRG-GNNTPA--VPVEADPEIEAT
        PQIRRS+LN EDEYGF + K+P YYDEGLEKTR+TLN KI QLNSAIDNVSSRLRG   NT +  VPVE DPE+EAT
Subjt:  PQIRRSLLN-EDEYGFRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRG-GNNTPA--VPVEADPEIEAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCCTGCTTCGCTCCTTCGCTGTCCGTGTCTGGTAAATCTTTTCTTCTCTATCTTAAGATAGCAGTTCAAGGGGGATTGATCAAGGCCTCAGATCTCTCCTCAAA
GTCCATTTCCTTTGGGCAAGCACCCAAACTCGCCATTCAAAAGAAGTGCTTGAGAACCAACCACAAGTTATCAGTTCGTGCAGAGTACAATGATGGTAATAGAAATGGAG
GTGGGGACTTTGTTGCTGGTTTTCTTCTTGGGGGTGCAGTATTTGGAACTTTAGCTTATATTTTTGCTCCGCAGATCAGGAGATCTCTACTGAATGAAGATGAGTACGGT
TTTAGGAGGGCTAAGCGTCCAATCTACTACGATGAAGGTTTAGAGAAAACCAGACAGACGTTGAATGCAAAAATAAGCCAATTGAATTCTGCCATTGACAATGTATCTTC
ACGTCTGAGAGGTGGCAACAATACTCCAGCTGTGCCAGTTGAAGCCGATCCTGAGATAGAAGCTACCATAATTCTGAAACAAGGACTCAGTTCATTTCATGGCCACCGGA
ATGTCTTACCTTCATTGTATTCTGATGAAAAACCAACAGCGCCGCCACCAGCACCGCCAATCGCAGCTATTACCGGCCCCTCCGACCACCCATCGGAGAAATTGGGCATT
TTATGGGAAGGTGGGTTGGTCTTGGCTTCTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCCTGCTTCGCTCCTTCGCTGTCCGTGTCTGGTAAATCTTTTCTTCTCTATCTTAAGATAGCAGTTCAAGGGGGATTGATCAAGGCCTCAGATCTCTCCTCAAA
GTCCATTTCCTTTGGGCAAGCACCCAAACTCGCCATTCAAAAGAAGTGCTTGAGAACCAACCACAAGTTATCAGTTCGTGCAGAGTACAATGATGGTAATAGAAATGGAG
GTGGGGACTTTGTTGCTGGTTTTCTTCTTGGGGGTGCAGTATTTGGAACTTTAGCTTATATTTTTGCTCCGCAGATCAGGAGATCTCTACTGAATGAAGATGAGTACGGT
TTTAGGAGGGCTAAGCGTCCAATCTACTACGATGAAGGTTTAGAGAAAACCAGACAGACGTTGAATGCAAAAATAAGCCAATTGAATTCTGCCATTGACAATGTATCTTC
ACGTCTGAGAGGTGGCAACAATACTCCAGCTGTGCCAGTTGAAGCCGATCCTGAGATAGAAGCTACCATAATTCTGAAACAAGGACTCAGTTCATTTCATGGCCACCGGA
ATGTCTTACCTTCATTGTATTCTGATGAAAAACCAACAGCGCCGCCACCAGCACCGCCAATCGCAGCTATTACCGGCCCCTCCGACCACCCATCGGAGAAATTGGGCATT
TTATGGGAAGGTGGGTTGGTCTTGGCTTCTATGTGA
Protein sequenceShow/hide protein sequence
MAACFAPSLSVSGKSFLLYLKIAVQGGLIKASDLSSKSISFGQAPKLAIQKKCLRTNHKLSVRAEYNDGNRNGGGDFVAGFLLGGAVFGTLAYIFAPQIRRSLLNEDEYG
FRRAKRPIYYDEGLEKTRQTLNAKISQLNSAIDNVSSRLRGGNNTPAVPVEADPEIEATIILKQGLSSFHGHRNVLPSLYSDEKPTAPPPAPPIAAITGPSDHPSEKLGI
LWEGGLVLASM