; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028853 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028853
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGag-Pol polyprotein/retrotransposon
Genome locationtig00153207:1325121..1329248
RNA-Seq ExpressionSgr028853
SyntenySgr028853
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601085.1 hypothetical protein SDJN03_06318, partial [Cucurbita argyrosperma subsp. sororia]2.1e-8374.04Show/hide
Query:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN
        MPTPLKL NG I  V+  VA              G N+L PKTI R SPPFR +K  NIFV++NP +R CL+NA+I AN  L SEN FSNHETEGSMEKN
Subjt:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN

Query:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT
        E+ Q+HP KS +VLDKLRRYG+SGILSYGLLNTVYYLTTFL VWFYIAP P KMGY AAAGRFLKIMATVWAGSQVTKLARAAGALA+APFV+RGLSWFT
Subjt:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT

Query:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
        VKYNF+SQGKA +AIVGFC GL +LLFIAVTLLSA
Subjt:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA

KAG7031890.1 hypothetical protein SDJN02_05931, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-7670.21Show/hide
Query:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN
        MPTPLKL NG I  V                +  G N+L PKTI R SPPFR +K  NIFV++NP +R CLNNA+I AN  L SEN FSNHETEGSMEKN
Subjt:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN

Query:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT
        E+ Q+HP KS +VLDKLRRYG+SGILSYGLLNTVYYLTTFL VWFYIAP P KMGY         IMATVWAGSQVTKLARAAGALA+APFV+R LSWFT
Subjt:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT

Query:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
        VKYNF+SQGKA MAIVGFC GL +LLFIAVTLLSA
Subjt:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA

XP_022956520.1 uncharacterized protein LOC111458238 [Cucurbita moschata]2.3e-8273.62Show/hide
Query:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN
        MPTPLKL NG I  V+  VA              G N+L PKTI   SPPFR +K  NIFV++NP +R CLNNA+I AN  L SEN FSNHETEGSMEKN
Subjt:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN

Query:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT
        E+ Q+HP KS +VLDKLRRYG+SGILSYGLLNTVYYLTTFL VWFYIAP P KMGY AAAGRFLKIMATVWAGSQVTKLARAAGALA+APFV+R LSWFT
Subjt:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT

Query:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
        VKYNF+SQGKA +AIVGFC GL +LLFIAVTLLSA
Subjt:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA

XP_022983937.1 uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima]4.2e-8474.04Show/hide
Query:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN
        MPTPLKL NG I  V+  VA              G NEL PKTI R SPPFR +K  NIFV++NP +R CLNNA+I AN  L SE+ FSNHETEGSMEKN
Subjt:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN

Query:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT
        E+ ++HP KS +VLDKLRRYG+SGILSYGLLNTVYYLTTFL VWFYIAPAP KMGY AAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFV+RGLSWFT
Subjt:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT

Query:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
        VKYNF+SQGKA +AIVGFC GL +LLFIAVTLLSA
Subjt:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA

XP_023534257.1 uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo]9.4e-8474.04Show/hide
Query:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN
        MPTPLKL NG I  V+  VA              G N+L PKTI R SPPFR +K  NIFV++NP +R CLNNA+I AN  L SEN FSNHETEGSMEKN
Subjt:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN

Query:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT
        E+ ++HP KS +VLDKLRRYG+SGILSYGLLNTVYYLTTFL VWFYIAP P KMGY AAAGRFLKIMATVWAGSQVTKLARAAGALA+APFV+RGLSWFT
Subjt:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT

Query:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
        VKYNF+SQGKA +AIVGFC GL +LLFIAVTLLSA
Subjt:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA

TrEMBL top hitse value%identityAlignment
A0A1S3BDD7 uncharacterized protein LOC103488806 isoform X52.3e-7277.95Show/hide
Query:  PKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKNESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTF
        P+   R SPP    +     V++NP VRLCL+NA I AN  L SE+ FSNHETEGSMEKNE+ Q+HP KSN+VLDKLRRYG+SGILSYGLLNT YYLTTF
Subjt:  PKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKNESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTF

Query:  LFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFTVKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
        L VWFYIAPAP KMGY AAAGRFLKIMATVWAGSQVTKLARAAGALALAPFV+RGLSWFTV YNFESQGKA MAIVGFC GL +LLFI VTLLSA
Subjt:  LFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFTVKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA

A0A6J1CD92 uncharacterized protein LOC1110101425.4e-6966.95Show/hide
Query:  MPTPLKLAN-GDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGA-NQLNSENCFSNHETEGSMEK
        M +P+KL N G+ RWV                IL G N+LLP+TISR+SPPFRA   KN+FVNQNPY+RLC +NA IGA + LNSE+ FSNHE EGSMEK
Subjt:  MPTPLKLAN-GDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGA-NQLNSENCFSNHETEGSMEK

Query:  NESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWF
        NESSQRH  KSN VLDKLRR                         FYIAPAP KMGY AAAGRFLKIMATV AGSQVTKLARAAGALA+APFV+RGLSWF
Subjt:  NESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWF

Query:  TVKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
        TVKYNFESQGKA MAIVGFCFGL +LLFIAVTLLSA
Subjt:  TVKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA

A0A6J1GY06 uncharacterized protein LOC1114582381.1e-8273.62Show/hide
Query:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN
        MPTPLKL NG I  V+  VA              G N+L PKTI   SPPFR +K  NIFV++NP +R CLNNA+I AN  L SEN FSNHETEGSMEKN
Subjt:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN

Query:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT
        E+ Q+HP KS +VLDKLRRYG+SGILSYGLLNTVYYLTTFL VWFYIAP P KMGY AAAGRFLKIMATVWAGSQVTKLARAAGALA+APFV+R LSWFT
Subjt:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT

Query:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
        VKYNF+SQGKA +AIVGFC GL +LLFIAVTLLSA
Subjt:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA

A0A6J1J0R3 uncharacterized protein LOC111482401 isoform X22.0e-8474.04Show/hide
Query:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN
        MPTPLKL NG I  V+  VA              G NEL PKTI R SPPFR +K  NIFV++NP +R CLNNA+I AN  L SE+ FSNHETEGSMEKN
Subjt:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN

Query:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT
        E+ ++HP KS +VLDKLRRYG+SGILSYGLLNTVYYLTTFL VWFYIAPAP KMGY AAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFV+RGLSWFT
Subjt:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT

Query:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
        VKYNF+SQGKA +AIVGFC GL +LLFIAVTLLSA
Subjt:  VKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA

A0A6J1J900 uncharacterized protein LOC111482401 isoform X15.0e-7573.33Show/hide
Query:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN
        MPTPLKL NG I  V+  VA              G NEL PKTI R SPPFR +K  NIFV++NP +R CLNNA+I AN  L SE+ FSNHETEGSMEKN
Subjt:  MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQ-LNSENCFSNHETEGSMEKN

Query:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT
        E+ ++HP KS +VLDKLRRYG+SGILSYGLLNTVYYLTTFL VWFYIAPAP KMGY AAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFV+RGLSWFT
Subjt:  ESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFT

Query:  VKYNFESQGK
        VKYNF+SQGK
Subjt:  VKYNFESQGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38695.1 unknown protein5.2e-4869.93Show/hide
Query:  EGSM-EKNESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFV
        EG M +KN  S+++P  S ++L KL+RYG+SGILSYGLLNTVYY T FL VWFY+APAP KMGY AAA RFLK+MA VWAGSQVTKL R  GA+ALAP V
Subjt:  EGSM-EKNESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFV

Query:  ERGLSWFTVKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
        +RGLSWFTVK NFESQGKA  A+VG C G+ ++LFI VTLL A
Subjt:  ERGLSWFTVKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA

AT2G38695.2 unknown protein7.8e-2869.57Show/hide
Query:  EGSM-EKNESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAG
        EG M +KN  S+++P  S ++L KL+RYG+SGILSYGLLNTVYY T FL VWFY+APAP KMGY AAA RFLK+MA VWAGSQVTKL R  G
Subjt:  EGSM-EKNESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAG

AT2G38695.3 unknown protein3.6e-4152.36Show/hide
Query:  EGSM-EKNESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAG--------
        EG M +KN  S+++P  S ++L KL+RYG+SGILSYGLLNTVYY T FL VWFY+APAP KMGY AAA RFLK+MA VWAGSQVTKL R  G        
Subjt:  EGSM-EKNESSQRHPSKSNKVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAG--------

Query:  ----------------------------------------ALALAPFVERGLSWFTVKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA
                                                A+ALAP V+RGLSWFTVK NFESQGKA  A+VG C G+ ++LFI VTLL A
Subjt:  ----------------------------------------ALALAPFVERGLSWFTVKYNFESQGKACMAIVGFCFGLVILLFIAVTLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACTCCACTCAAGCTCGCGAATGGCGACATCCGCTGGGTCCTTCTCCGGGTAGCGCTACGAGCTTCACTCAATTTCGAAAATCTGAAAATTCTCACAGGTTTGAA
CGAATTGCTGCCGAAGACCATTTCCAGAGTTTCCCCGCCATTTCGCGCCGCAAAGCTGAAGAACATTTTTGTCAATCAGAACCCTTACGTCCGGCTCTGCCTCAACAATG
CCGATATCGGCGCCAATCAATTGAATTCTGAAAATTGCTTTTCCAATCATGAAACTGAAGGTTCAATGGAAAAAAATGAAAGTAGTCAAAGACATCCATCAAAATCGAAC
AAGGTACTGGATAAATTGAGGAGATATGGAATTTCTGGAATATTGTCGTACGGATTACTGAATACTGTATACTATCTTACAACATTTCTCTTTGTGTGGTTCTACATTGC
ACCAGCACCTGTGAAAATGGGTTATGCTGCTGCTGCTGGAAGATTTCTCAAAATAATGGCTACAGTATGGGCTGGAAGCCAAGTTACTAAGCTTGCAAGAGCTGCTGGGG
CTCTTGCTCTAGCACCATTCGTCGAGAGAGGGTTGTCGTGGTTCACAGTCAAATACAACTTCGAGTCTCAGGGGAAGGCATGTATGGCAATTGTTGGGTTCTGCTTTGGA
TTGGTCATCTTGTTATTCATTGCTGTGACTCTGCTTTCAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCAACTCCACTCAAGCTCGCGAATGGCGACATCCGCTGGGTCCTTCTCCGGGTAGCGCTACGAGCTTCACTCAATTTCGAAAATCTGAAAATTCTCACAGGTTTGAA
CGAATTGCTGCCGAAGACCATTTCCAGAGTTTCCCCGCCATTTCGCGCCGCAAAGCTGAAGAACATTTTTGTCAATCAGAACCCTTACGTCCGGCTCTGCCTCAACAATG
CCGATATCGGCGCCAATCAATTGAATTCTGAAAATTGCTTTTCCAATCATGAAACTGAAGGTTCAATGGAAAAAAATGAAAGTAGTCAAAGACATCCATCAAAATCGAAC
AAGGTACTGGATAAATTGAGGAGATATGGAATTTCTGGAATATTGTCGTACGGATTACTGAATACTGTATACTATCTTACAACATTTCTCTTTGTGTGGTTCTACATTGC
ACCAGCACCTGTGAAAATGGGTTATGCTGCTGCTGCTGGAAGATTTCTCAAAATAATGGCTACAGTATGGGCTGGAAGCCAAGTTACTAAGCTTGCAAGAGCTGCTGGGG
CTCTTGCTCTAGCACCATTCGTCGAGAGAGGGTTGTCGTGGTTCACAGTCAAATACAACTTCGAGTCTCAGGGGAAGGCATGTATGGCAATTGTTGGGTTCTGCTTTGGA
TTGGTCATCTTGTTATTCATTGCTGTGACTCTGCTTTCAGCATAA
Protein sequenceShow/hide protein sequence
MPTPLKLANGDIRWVLLRVALRASLNFENLKILTGLNELLPKTISRVSPPFRAAKLKNIFVNQNPYVRLCLNNADIGANQLNSENCFSNHETEGSMEKNESSQRHPSKSN
KVLDKLRRYGISGILSYGLLNTVYYLTTFLFVWFYIAPAPVKMGYAAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVERGLSWFTVKYNFESQGKACMAIVGFCFG
LVILLFIAVTLLSA