; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012909 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012909
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153600:220169..220786
RNA-Seq ExpressionSgr012909
SyntenySgr012909
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]1.4e-2344.62Show/hide
Query:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIG-----------TP-IPNPEYDMWIAVDQLLVGW
        SNP L  ILNQ+T++K+DR N+LLW+ +AL IL+ +KL+GHLT +TPCP   ++    ++ ++   G           TP I NP ++ W+  D LL+GW
Subjt:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIG-----------TP-IPNPEYDMWIAVDQLLVGW

Query:  LYNSMTSKVASQVMRYDTAKDLWTALQEFY
        LYNSMT  VA Q+M +   +DLW A Q+F+
Subjt:  LYNSMTSKVASQVMRYDTAKDLWTALQEFY

XP_016902203.1 PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo]1.4e-2344.62Show/hide
Query:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIG-----------TP-IPNPEYDMWIAVDQLLVGW
        SNP L  ILNQ+T++K+DR N+LLW+ +AL IL+ +KL+GHLT +TPCP   ++    ++ ++   G           TP I NP ++ W+  D LL+GW
Subjt:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIG-----------TP-IPNPEYDMWIAVDQLLVGW

Query:  LYNSMTSKVASQVMRYDTAKDLWTALQEFY
        LYNSMT  VA Q+M +   +DLW A Q+F+
Subjt:  LYNSMTSKVASQVMRYDTAKDLWTALQEFY

XP_016902205.1 PREDICTED: uncharacterized protein LOC107991581 isoform X5 [Cucumis melo]1.4e-2344.62Show/hide
Query:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIG-----------TP-IPNPEYDMWIAVDQLLVGW
        SNP L  ILNQ+T++K+DR N+LLW+ +AL IL+ +KL+GHLT +TPCP   ++    ++ ++   G           TP I NP ++ W+  D LL+GW
Subjt:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIG-----------TP-IPNPEYDMWIAVDQLLVGW

Query:  LYNSMTSKVASQVMRYDTAKDLWTALQEFY
        LYNSMT  VA Q+M +   +DLW A Q+F+
Subjt:  LYNSMTSKVASQVMRYDTAKDLWTALQEFY

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]5.4e-2852.89Show/hide
Query:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSII---VPPENDDSLDAIGTPIPNPEYDMWIAVDQLLVGWLYNSMTSKV
        ++P L  +LNQ+TSIKMDR NFLLWQN+AL ILRS+KL  +LTG  PCP   ++    P   + S  +  +P  NP Y+ WI VD+LL+GWLYNSM + V
Subjt:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSII---VPPENDDSLDAIGTPIPNPEYDMWIAVDQLLVGWLYNSMTSKV

Query:  ASQVMRYDTAKDLWTALQEFY
        A QVM + T+++LWTA+QE +
Subjt:  ASQVMRYDTAKDLWTALQEFY

XP_031745012.1 uncharacterized protein LOC116405217 [Cucumis sativus]1.1e-2550.85Show/hide
Query:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIGTPIPNPEYDMWIAVDQLLVGWLYNSMTSKVASQ
        SNP L  ILNQ+ S+K+DR N+LLWQ +AL IL+S+KL GHLT +  CP    I+ P    S  +  T   NP++D W+  D LL+GW+YNSMT +VA Q
Subjt:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIGTPIPNPEYDMWIAVDQLLVGWLYNSMTSKVASQ

Query:  VMRYDTAKDLWTALQEFY
        +M ++TAKDLW A+Q+ +
Subjt:  VMRYDTAKDLWTALQEFY

TrEMBL top hitse value%identityAlignment
A0A1S4DY80 uncharacterized protein LOC1079911166.7e-2444.27Show/hide
Query:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPC-PDLSIIVPPENDDSLDAIGTP------------IPNPEYDMWIAVDQLLVG
        +NP L  ILNQ+T+IK+DR N+LLW+ +AL IL+S+KL+ HL G++PC P + ++    N+  ++  G P              NP+Y+ WI  D LL+G
Subjt:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPC-PDLSIIVPPENDDSLDAIGTP------------IPNPEYDMWIAVDQLLVG

Query:  WLYNSMTSKVASQVMRYDTAKDLWTALQEFY
        WLYNSMT +V  Q+M +  AKDLW A Q+ +
Subjt:  WLYNSMTSKVASQVMRYDTAKDLWTALQEFY

A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X16.7e-2444.62Show/hide
Query:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIG-----------TP-IPNPEYDMWIAVDQLLVGW
        SNP L  ILNQ+T++K+DR N+LLW+ +AL IL+ +KL+GHLT +TPCP   ++    ++ ++   G           TP I NP ++ W+  D LL+GW
Subjt:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIG-----------TP-IPNPEYDMWIAVDQLLVGW

Query:  LYNSMTSKVASQVMRYDTAKDLWTALQEFY
        LYNSMT  VA Q+M +   +DLW A Q+F+
Subjt:  LYNSMTSKVASQVMRYDTAKDLWTALQEFY

A0A1S4E1U9 uncharacterized protein LOC107991581 isoform X46.7e-2444.62Show/hide
Query:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIG-----------TP-IPNPEYDMWIAVDQLLVGW
        SNP L  ILNQ+T++K+DR N+LLW+ +AL IL+ +KL+GHLT +TPCP   ++    ++ ++   G           TP I NP ++ W+  D LL+GW
Subjt:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIG-----------TP-IPNPEYDMWIAVDQLLVGW

Query:  LYNSMTSKVASQVMRYDTAKDLWTALQEFY
        LYNSMT  VA Q+M +   +DLW A Q+F+
Subjt:  LYNSMTSKVASQVMRYDTAKDLWTALQEFY

A0A5A7VPY0 Uncharacterized protein6.7e-2444.27Show/hide
Query:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPC-PDLSIIVPPENDDSLDAIGTP------------IPNPEYDMWIAVDQLLVG
        +NP L  ILNQ+T+IK+DR N+LLW+ +AL IL+S+KL+ HL G++PC P + ++    N+  ++  G P              NP+Y+ WI  D LL+G
Subjt:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPC-PDLSIIVPPENDDSLDAIGTP------------IPNPEYDMWIAVDQLLVG

Query:  WLYNSMTSKVASQVMRYDTAKDLWTALQEFY
        WLYNSMT +V  Q+M +  AKDLW A Q+ +
Subjt:  WLYNSMTSKVASQVMRYDTAKDLWTALQEFY

A0A6J1DCW4 uncharacterized protein LOC1110195982.6e-2852.89Show/hide
Query:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSII---VPPENDDSLDAIGTPIPNPEYDMWIAVDQLLVGWLYNSMTSKV
        ++P L  +LNQ+TSIKMDR NFLLWQN+AL ILRS+KL  +LTG  PCP   ++    P   + S  +  +P  NP Y+ WI VD+LL+GWLYNSM + V
Subjt:  SNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSII---VPPENDDSLDAIGTPIPNPEYDMWIAVDQLLVGWLYNSMTSKV

Query:  ASQVMRYDTAKDLWTALQEFY
        A QVM + T+++LWTA+QE +
Subjt:  ASQVMRYDTAKDLWTALQEFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).6.2e-0627.78Show/hide
Query:  VTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIGTPIPNPEYDMWIAVDQLLVGWLYNSMTSKVASQVMRYDTAKDLW
        +  +  D  N++ W+    S LR  K  G + G  P PD                  P  +P Y  W   + +++ WL NSMT K+   VM  +TA  +W
Subjt:  VTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIGTPIPNPEYDMWIAVDQLLVGWLYNSMTSKVASQVMRYDTAKDLW

Query:  TALQEFYV
          L+  +V
Subjt:  TALQEFYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCACTCGATTCCTGATATTTCCAAGCAGACCAAGGAAGGGTATTTTGCTTTGCTCCCACAAAGTAATCCTTTTCTTGCAACCATCCTCAACCAAGTAACCTCCAT
CAAAATGGATCGCACAAATTTTCTTCTTTGGCAAAATATTGCTCTCTCTATTCTTAGAAGTCACAAGCTAGATGGTCATCTCACTGGTAAAACTCCTTGCCCAGATCTCT
CTATTATAGTTCCACCTGAGAATGATGACTCTCTAGACGCCATTGGAACACCTATTCCTAATCCGGAGTACGACATGTGGATTGCTGTTGATCAGTTGCTTGTGGGCTGG
CTGTACAACTCGATGACCAGCAAAGTGGCTTCACAAGTTATGCGTTATGATACGGCAAAAGATCTCTGGACAGCTCTGCAGGAATTCTATGTTTTACTGATAATCTCCAG
CTTGCAAGTTAACCCATGCCCACAAGAAGTTTCATCTCGTATGTCTTTGCGGGATTGGATGAAGAACATAACCCTATT
mRNA sequenceShow/hide mRNA sequence
ATGGCTCACTCGATTCCTGATATTTCCAAGCAGACCAAGGAAGGGTATTTTGCTTTGCTCCCACAAAGTAATCCTTTTCTTGCAACCATCCTCAACCAAGTAACCTCCAT
CAAAATGGATCGCACAAATTTTCTTCTTTGGCAAAATATTGCTCTCTCTATTCTTAGAAGTCACAAGCTAGATGGTCATCTCACTGGTAAAACTCCTTGCCCAGATCTCT
CTATTATAGTTCCACCTGAGAATGATGACTCTCTAGACGCCATTGGAACACCTATTCCTAATCCGGAGTACGACATGTGGATTGCTGTTGATCAGTTGCTTGTGGGCTGG
CTGTACAACTCGATGACCAGCAAAGTGGCTTCACAAGTTATGCGTTATGATACGGCAAAAGATCTCTGGACAGCTCTGCAGGAATTCTATGTTTTACTGATAATCTCCAG
CTTGCAAGTTAACCCATGCCCACAAGAAGTTTCATCTCGTATGTCTTTGCGGGATTGGATGAAGAACATAACCCTATT
Protein sequenceShow/hide protein sequence
MAHSIPDISKQTKEGYFALLPQSNPFLATILNQVTSIKMDRTNFLLWQNIALSILRSHKLDGHLTGKTPCPDLSIIVPPENDDSLDAIGTPIPNPEYDMWIAVDQLLVGW
LYNSMTSKVASQVMRYDTAKDLWTALQEFYVLLIISSLQVNPCPQEVSSRMSLRDWMKNITLX