; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019595 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019595
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPatatin
Genome locationtig00153349:234640..237536
RNA-Seq ExpressionSgr019595
SyntenySgr019595
Gene Ontology termsGO:0000162 - tryptophan biosynthetic process (biological process)
GO:0005737 - cytoplasm (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584077.1 hypothetical protein SDJN03_20009, partial [Cucurbita argyrosperma subsp. sororia]4.0e-4078.63Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKIC-SAFDVSESNRLNTI
        MAATA VAIGTRGT+GSL+KKEI+YFAKIELERCSS SSQRPQ P   DMA+S  SSSPPTFW++VMSWRRKKKR+ +RFVPKIC SAFDVSESN++N I
Subjt:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKIC-SAFDVSESNRLNTI

Query:  SGFNYKILQNDFNSLHI
        SGFNY ILQN+F+SLH+
Subjt:  SGFNYKILQNDFNSLHI

KAG7019678.1 hypothetical protein SDJN02_18641, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-3977.78Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKIC-SAFDVSESNRLNTI
        MAATA VAIGTRGT+GSL+KKEI+YFAKIELERCSS SSQRPQ P   DMA+S  SS PPTFW++VMSWRRKKKR+ +RFVPKIC SAFDVSESN++N I
Subjt:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKIC-SAFDVSESNRLNTI

Query:  SGFNYKILQNDFNSLHI
        SGFNY ILQN+F+SLH+
Subjt:  SGFNYKILQNDFNSLHI

KGN64543.2 hypothetical protein Csa_013063 [Cucumis sativus]9.6e-3472.41Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTIS
        MAA APVAIGTRGT+GSLVKKEI+YFAKIELE  +S SSQR QGP   +MASS   SSPPTFW ++MSWRRK K TSNRFV K+CS FD S SNR+N IS
Subjt:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTIS

Query:  GFNYKILQNDFNSLHI
        G +Y ILQNDF+SLH+
Subjt:  GFNYKILQNDFNSLHI

XP_016898893.1 PREDICTED: uncharacterized protein LOC103485409 [Cucumis melo]2.3e-3572.41Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTIS
        MAATAPVAIGTRGT+GSL+KKEI+YFAKIELE  +S SSQR QGP   +MASS   SSPPTFW ++MSWRRKKK TSNRF+ K+CS FD S SNR+N IS
Subjt:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTIS

Query:  GFNYKILQNDFNSLHI
        G +Y ILQNDF+SLH+
Subjt:  GFNYKILQNDFNSLHI

XP_022140128.1 uncharacterized protein LOC111010862 [Momordica charantia]4.5e-3979.49Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSS-RSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTI
        MAATAPVAIGTRGTVGSLVKKEI+YFAKIE ERCS            NDMASSS RSSSPPTFW+TVMSWRRKKKR  NRF+ KICSAFDVS SNRLN I
Subjt:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSS-RSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTI

Query:  SGFNYKILQNDFNSLHI
        SGFNY ILQNDFNSLH+
Subjt:  SGFNYKILQNDFNSLHI

TrEMBL top hitse value%identityAlignment
A0A0A0LX43 Uncharacterized protein4.7e-3472.41Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTIS
        MAA APVAIGTRGT+GSLVKKEI+YFAKIELE  +S SSQR QGP   +MASS   SSPPTFW ++MSWRRK K TSNRFV K+CS FD S SNR+N IS
Subjt:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTIS

Query:  GFNYKILQNDFNSLHI
        G +Y ILQNDF+SLH+
Subjt:  GFNYKILQNDFNSLHI

A0A1S4DSE9 uncharacterized protein LOC1034854091.1e-3572.41Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTIS
        MAATAPVAIGTRGT+GSL+KKEI+YFAKIELE  +S SSQR QGP   +MASS   SSPPTFW ++MSWRRKKK TSNRF+ K+CS FD S SNR+N IS
Subjt:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTIS

Query:  GFNYKILQNDFNSLHI
        G +Y ILQNDF+SLH+
Subjt:  GFNYKILQNDFNSLHI

A0A314Z3X1 Uncharacterized protein4.1e-3061.54Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERC-SSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTI
        MAA AP+AIGTRGTVGSLV+KEIEYF+K+EL+R   SSSS++PQG  V+  +SS   SS P+FW+ +M+W+RKK+R+S RF+  ICSA  V+E+NRLN I
Subjt:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERC-SSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTI

Query:  SGFNYKILQNDFNSLHI
         GFNY+IL++D N+L I
Subjt:  SGFNYKILQNDFNSLHI

A0A5A7UTL1 Uncharacterized protein2.6e-2973.27Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTIS
        MAATAPVAIGTRGT+GSL+KKEI+YFAKIELE  +S SSQR QGP   +MASS   SSPPTFW ++MSWRRKKK TSNRF+ K+CS FD S SNR+N IS
Subjt:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTIS

Query:  G
        G
Subjt:  G

A0A6J1CHB1 uncharacterized protein LOC1110108622.2e-3979.49Show/hide
Query:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSS-RSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTI
        MAATAPVAIGTRGTVGSLVKKEI+YFAKIE ERCS            NDMASSS RSSSPPTFW+TVMSWRRKKKR  NRF+ KICSAFDVS SNRLN I
Subjt:  MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSS-RSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTI

Query:  SGFNYKILQNDFNSLHI
        SGFNY ILQNDFNSLH+
Subjt:  SGFNYKILQNDFNSLHI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G21780.1 unknown protein1.4e-1441.28Show/hide
Query:  APVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRT---SNRFVPKICSAFDVSESNRLNTISG
        AP+AIGTRGT+GSLV+KEI+YF             +       N      RSSS    W++   WR+KK++T     +F P +CSA +VS  NR   + G
Subjt:  APVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRT---SNRFVPKICSAFDVSESNRLNTISG

Query:  FNYKILQND
        FNY+IL++D
Subjt:  FNYKILQND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCAACTGCTCCTGTCGCCATAGGAACTCGAGGCACCGTGGGCTCGCTCGTCAAGAAGGAAATTGAATATTTCGCCAAGATTGAGCTAGAGAGATGTAGCAGCAG
CAGCTCACAGAGGCCTCAAGGACCCGCTGTAAATGACATGGCTTCTTCCAGCCGCAGCAGTTCCCCACCCACATTCTGGTACACGGTAATGTCCTGGCGAAGGAAGAAGA
AGAGAACCAGCAATCGTTTTGTCCCGAAAATTTGCTCGGCTTTTGATGTTTCAGAAAGCAATCGGCTAAATACGATTTCTGGGTTCAATTATAAGATCCTTCAGAACGAT
TTCAACAGCTTACACATCTTCAATTTCATTCCACTTCTCAACATTCTTCGAGAACCTCTCCTTGATTTGCCCAATAAGTTGCTTTGCCTCTTCGACCTTCGAAATGCTAA
CCAGCCCGTCGACAAGAGACTTCATTGTAGAAATATTTGGAACCCACCCCTTCTCCATGCTTTCCACACAGATCTTGAATGCTGTATCATAATCTCCTCCCCGGCAGAGG
AAGTAAACCAAAGTGAAATAGCAATCACTGTCCGGTTTGCAACCGCTGTTGATCATTCTCTTAAAAAGCGTCTTAGCTTCATCCAGATTTCCTTCCTTACAGAATCCATG
AATCAATTCGCAAGAACAATAGCGTGGCAAGCGAAACGCTCGTTTTTCAGGTCGGGGCGAGATTTCGATTCCTCGAGTAACCGGCTAATCCCATCGAAGTGCTTGGACCC
TGAAAGCTTGGAAATGGCAATAGAGAAAGCGATGCGATCGAGTTGTGATTCTGGAGTGAGAGCAGCGGCGCGGCAAATATCAATGATACGTTCAGGGATTAGTGGAGCCG
GGGGCAAGGATTGTCGAGAGAGAACGGTAGTGAAGACGATAGTTCGTGTAAATCGAGTTCGAAGGGAAAGCATAGCGGAGGCGATAGAGAAGGCCATGGCGAACCGAGAA
TGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCAACTGCTCCTGTCGCCATAGGAACTCGAGGCACCGTGGGCTCGCTCGTCAAGAAGGAAATTGAATATTTCGCCAAGATTGAGCTAGAGAGATGTAGCAGCAG
CAGCTCACAGAGGCCTCAAGGACCCGCTGTAAATGACATGGCTTCTTCCAGCCGCAGCAGTTCCCCACCCACATTCTGGTACACGGTAATGTCCTGGCGAAGGAAGAAGA
AGAGAACCAGCAATCGTTTTGTCCCGAAAATTTGCTCGGCTTTTGATGTTTCAGAAAGCAATCGGCTAAATACGATTTCTGGGTTCAATTATAAGATCCTTCAGAACGAT
TTCAACAGCTTACACATCTTCAATTTCATTCCACTTCTCAACATTCTTCGAGAACCTCTCCTTGATTTGCCCAATAAGTTGCTTTGCCTCTTCGACCTTCGAAATGCTAA
CCAGCCCGTCGACAAGAGACTTCATTGTAGAAATATTTGGAACCCACCCCTTCTCCATGCTTTCCACACAGATCTTGAATGCTGTATCATAATCTCCTCCCCGGCAGAGG
AAGTAAACCAAAGTGAAATAGCAATCACTGTCCGGTTTGCAACCGCTGTTGATCATTCTCTTAAAAAGCGTCTTAGCTTCATCCAGATTTCCTTCCTTACAGAATCCATG
AATCAATTCGCAAGAACAATAGCGTGGCAAGCGAAACGCTCGTTTTTCAGGTCGGGGCGAGATTTCGATTCCTCGAGTAACCGGCTAATCCCATCGAAGTGCTTGGACCC
TGAAAGCTTGGAAATGGCAATAGAGAAAGCGATGCGATCGAGTTGTGATTCTGGAGTGAGAGCAGCGGCGCGGCAAATATCAATGATACGTTCAGGGATTAGTGGAGCCG
GGGGCAAGGATTGTCGAGAGAGAACGGTAGTGAAGACGATAGTTCGTGTAAATCGAGTTCGAAGGGAAAGCATAGCGGAGGCGATAGAGAAGGCCATGGCGAACCGAGAA
TGA
Protein sequenceShow/hide protein sequence
MAATAPVAIGTRGTVGSLVKKEIEYFAKIELERCSSSSSQRPQGPAVNDMASSSRSSSPPTFWYTVMSWRRKKKRTSNRFVPKICSAFDVSESNRLNTISGFNYKILQND
FNSLHIFNFIPLLNILREPLLDLPNKLLCLFDLRNANQPVDKRLHCRNIWNPPLLHAFHTDLECCIIISSPAEEVNQSEIAITVRFATAVDHSLKKRLSFIQISFLTESM
NQFARTIAWQAKRSFFRSGRDFDSSSNRLIPSKCLDPESLEMAIEKAMRSSCDSGVRAAARQISMIRSGISGAGGKDCRERTVVKTIVRVNRVRRESIAEAIEKAMANRE