; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr000853 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr000853
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF674)
Genome locationtig00000568:45831..46512
RNA-Seq ExpressionSgr000853
SyntenySgr000853
Gene Ontology termsNA
InterPro domainsIPR007750 - Protein of unknown function DUF674


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147723.1 uncharacterized protein LOC101207526 [Cucumis sativus]9.9e-5263.1Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNIDSPA-AATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL
        M G L NLY SVEALN+ YL PN+ KD+LLKPKVSF  ST+LLPNI+S A     YLC +     C  +V   P  +CP CR  MS+   FV PPSA   
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNIDSPA-AATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL

Query:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRKLPI
           D   + GGFVKG+VTYMVMDDL+VKPMSTISSITLLNKF++KEVGALEEKVVTLDV++G+KLLK SL SKTVLTDVF+ R L +
Subjt:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRKLPI

XP_008461735.1 PREDICTED: uncharacterized protein LOC103500268 [Cucumis melo]4.7e-5465.22Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNIDSPA-AATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL
        MVG L NLYESVEALN+ YL PN+ KD LLKPKVSF  ST+LLPNI+S A    FYLC +     C  +V   P  +CP+CR  MS+    V PP+A T 
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNIDSPA-AATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL

Query:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK
           D   + GGFVKG+VTYMVMDDL+VKPMSTISSITLLNKF++KEVGALEEKV+TLDVN+GVKLL+ SL SKTVLTDVF+ RK
Subjt:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK

XP_022138964.1 uncharacterized protein LOC111010013 [Momordica charantia]3.7e-7580.65Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCG--STMLLPNID-SPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSAR
        MVGCLGNLYESVE LN+ YL PN+ KD LLKPKVSFCG  STMLLPNID S AA TFYLCNST +ANCRRSV DGPN ICP C V M+QVGTFV+PPSA 
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCG--STMLLPNID-SPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSAR

Query:  TLTQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK
          T A  ++D+GGFVKG+VTYMVMDDL+VKPMSTISSI LLNKF+VKEVGALEEKVVTLDVNEGVKLLK SLHSKTVLTDVFI+RK
Subjt:  TLTQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK

XP_022139195.1 uncharacterized protein LOC111010165 [Momordica charantia]2.0e-5766.33Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNID---SPAAA--TFYLCNSTV-YANCRRSVFDGPNEICPTCRVLMSQVGTFVQPP
        MVGCLGNLYESVE LN+ YL   + KD LLKPK S   ST+LLPN+D    PAAA  T YLC S+  YA+CR SV D PN ICPT +  MSQVGTFV+P 
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNID---SPAAA--TFYLCNSTV-YANCRRSVFDGPNEICPTCRVLMSQVGTFVQPP

Query:  SARTLTQAD-------TRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRKLPI
        SA     A        T +D+GGFVKG+VTYMVM+DLTVKPMSTISSI LLNK +VKEVG+L+EKVVT  V+EGVKLLK SLHSKTVLTDVF++RKL I
Subjt:  SARTLTQAD-------TRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRKLPI

XP_022139199.1 uncharacterized protein LOC111010168 [Momordica charantia]4.9e-5966.67Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNID-SPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL
        MVGCLGNLY+SVE LN+ YL   + K+TLL PKVS CGSTMLLP+++ S AA TFY C+   Y NCR  V DGPN  CP C+  M+QV T+VQPPS    
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNID-SPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL

Query:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKR
               DQGG+VK +VTYMVMDDLTVKPMSTISSITLLNKF++KEVGALEEK++T+D N+GVKLLK SL SKTVLTDVF+K+
Subjt:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKR

TrEMBL top hitse value%identityAlignment
A0A1S3CGQ2 uncharacterized protein LOC1035002682.3e-5465.22Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNIDSPA-AATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL
        MVG L NLYESVEALN+ YL PN+ KD LLKPKVSF  ST+LLPNI+S A    FYLC +     C  +V   P  +CP+CR  MS+    V PP+A T 
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNIDSPA-AATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL

Query:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK
           D   + GGFVKG+VTYMVMDDL+VKPMSTISSITLLNKF++KEVGALEEKV+TLDVN+GVKLL+ SL SKTVLTDVF+ RK
Subjt:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK

A0A5A7U8V2 DUF674 domain-containing protein2.3e-5465.22Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNIDSPA-AATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL
        MVG L NLYESVEALN+ YL PN+ KD LLKPKVSF  ST+LLPNI+S A    FYLC +     C  +V   P  +CP+CR  MS+    V PP+A T 
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNIDSPA-AATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL

Query:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK
           D   + GGFVKG+VTYMVMDDL+VKPMSTISSITLLNKF++KEVGALEEKV+TLDVN+GVKLL+ SL SKTVLTDVF+ RK
Subjt:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK

A0A6J1CBJ8 uncharacterized protein LOC1110100131.8e-7580.65Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCG--STMLLPNID-SPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSAR
        MVGCLGNLYESVE LN+ YL PN+ KD LLKPKVSFCG  STMLLPNID S AA TFYLCNST +ANCRRSV DGPN ICP C V M+QVGTFV+PPSA 
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCG--STMLLPNID-SPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSAR

Query:  TLTQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK
          T A  ++D+GGFVKG+VTYMVMDDL+VKPMSTISSI LLNKF+VKEVGALEEKVVTLDVNEGVKLLK SLHSKTVLTDVFI+RK
Subjt:  TLTQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK

A0A6J1CBM5 uncharacterized protein LOC1110101659.9e-5866.33Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNID---SPAAA--TFYLCNSTV-YANCRRSVFDGPNEICPTCRVLMSQVGTFVQPP
        MVGCLGNLYESVE LN+ YL   + KD LLKPK S   ST+LLPN+D    PAAA  T YLC S+  YA+CR SV D PN ICPT +  MSQVGTFV+P 
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNID---SPAAA--TFYLCNSTV-YANCRRSVFDGPNEICPTCRVLMSQVGTFVQPP

Query:  SARTLTQAD-------TRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRKLPI
        SA     A        T +D+GGFVKG+VTYMVM+DLTVKPMSTISSI LLNK +VKEVG+L+EKVVT  V+EGVKLLK SLHSKTVLTDVF++RKL I
Subjt:  SARTLTQAD-------TRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRKLPI

A0A6J1CC90 uncharacterized protein LOC1110101682.4e-5966.67Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNID-SPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL
        MVGCLGNLY+SVE LN+ YL   + K+TLL PKVS CGSTMLLP+++ S AA TFY C+   Y NCR  V DGPN  CP C+  M+QV T+VQPPS    
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNID-SPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTL

Query:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKR
               DQGG+VK +VTYMVMDDLTVKPMSTISSITLLNKF++KEVGALEEK++T+D N+GVKLLK SL SKTVLTDVF+K+
Subjt:  TQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09110.1 Protein of unknown function (DUF674)3.2e-0827.17Show/hide
Query:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGS--TMLLPNIDSPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSART
        +VGCL NLY+SV  ++        CK  LL P+ S  GS    L  NID   A  F++C + V     R +F   + +   C   M +            
Subjt:  MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGS--TMLLPNIDSPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSART

Query:  LTQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKR
            + +Q  G F     ++++ DDL V   S    + +LN F       L+E ++ +   E + LL     S+  LTD F+++
Subjt:  LTQADTRQDQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKR

AT3G09140.1 Protein of unknown function (DUF674)1.3e-0435.71Show/hide
Query:  GFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVF
        GF+K    ++V DDL +KP + +S+I+LL      +   +EE V+T+   E + LL+ SL + + LT  F
Subjt:  GFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVF

AT5G01150.1 Protein of unknown function (DUF674)4.6e-0726.84Show/hide
Query:  VGCLGNLYESVEALNEMYLPPNECKDTLLKPK-VSFCGSTMLLPNIDSPAAATFYLCNS--TVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSART
        +GC  NLY SV  +         CK  L+ PK V       L  NI+       + C+S   +Y+N   S           CR      G F+       
Subjt:  VGCLGNLYESVEALNEMYLPPNECKDTLLKPK-VSFCGSTMLLPNIDSPAAATFYLCNS--TVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSART

Query:  LTQADTRQ----DQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRKL
          + D  +    D G FV G  ++++ DDL V   ST   +  L      +VG L E+++ + V E + LL     S   L D+F+ +K+
Subjt:  LTQADTRQ----DQGGFVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRKL

AT5G43240.1 Protein of unknown function (DUF674)1.1e-0826.09Show/hide
Query:  VGCLGNLYESVEALNEMYLPPNECKDTLLKP-KVSFCGSTMLLPNIDSPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTLT
        +GC  N+Y SV ++   +     CK  LL P  ++      L   +D   A  +++C   V        +   N    +C VLM++V    Q      L 
Subjt:  VGCLGNLYESVEALNEMYLPPNECKDTLLKP-KVSFCGSTMLLPNIDSPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTLT

Query:  QADTRQDQGGFVKG-MVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK
         A    + G FV+    ++M+ DDL V+  S   ++ +L      +   L+EK+  +++ E   LL+    S   LTD F+K+K
Subjt:  QADTRQDQGGFVKG-MVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK

AT5G43240.3 Protein of unknown function (DUF674)1.1e-0826.09Show/hide
Query:  VGCLGNLYESVEALNEMYLPPNECKDTLLKP-KVSFCGSTMLLPNIDSPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTLT
        +GC  N+Y SV ++   +     CK  LL P  ++      L   +D   A  +++C   V        +   N    +C VLM++V    Q      L 
Subjt:  VGCLGNLYESVEALNEMYLPPNECKDTLLKP-KVSFCGSTMLLPNIDSPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTLT

Query:  QADTRQDQGGFVKG-MVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK
         A    + G FV+    ++M+ DDL V+  S   ++ +L      +   L+EK+  +++ E   LL+    S   LTD F+K+K
Subjt:  QADTRQDQGGFVKG-MVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGGTGCTTGGGAAATTTGTACGAGAGTGTGGAAGCTTTGAACGAGATGTACTTGCCGCCAAATGAGTGCAAAGACACCCTTTTGAAGCCCAAAGTCTCGTTTTG
TGGTTCCACCATGCTTTTGCCTAACATCGATTCTCCAGCTGCAGCTACATTTTATTTGTGCAATTCAACTGTTTATGCTAATTGCCGCCGTTCAGTTTTTGATGGTCCTA
ATGAAATTTGTCCAACATGTAGGGTTCTCATGAGCCAAGTGGGCACATTTGTGCAGCCACCAAGTGCAAGGACACTCACACAAGCAGATACTAGGCAAGATCAGGGAGGA
TTTGTAAAAGGAATGGTGACTTATATGGTGATGGATGACTTGACTGTGAAGCCCATGTCCACCATTTCCAGCATAACCCTTTTGAACAAGTTCGATGTCAAGGAAGTGGG
TGCTTTGGAGGAGAAAGTTGTTACTTTGGATGTCAATGAGGGTGTGAAATTGTTGAAGGTTTCTCTTCACTCCAAGACTGTTCTCACCGATGTCTTCATTAAAAGAAAGC
TTCCAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGGTGCTTGGGAAATTTGTACGAGAGTGTGGAAGCTTTGAACGAGATGTACTTGCCGCCAAATGAGTGCAAAGACACCCTTTTGAAGCCCAAAGTCTCGTTTTG
TGGTTCCACCATGCTTTTGCCTAACATCGATTCTCCAGCTGCAGCTACATTTTATTTGTGCAATTCAACTGTTTATGCTAATTGCCGCCGTTCAGTTTTTGATGGTCCTA
ATGAAATTTGTCCAACATGTAGGGTTCTCATGAGCCAAGTGGGCACATTTGTGCAGCCACCAAGTGCAAGGACACTCACACAAGCAGATACTAGGCAAGATCAGGGAGGA
TTTGTAAAAGGAATGGTGACTTATATGGTGATGGATGACTTGACTGTGAAGCCCATGTCCACCATTTCCAGCATAACCCTTTTGAACAAGTTCGATGTCAAGGAAGTGGG
TGCTTTGGAGGAGAAAGTTGTTACTTTGGATGTCAATGAGGGTGTGAAATTGTTGAAGGTTTCTCTTCACTCCAAGACTGTTCTCACCGATGTCTTCATTAAAAGAAAGC
TTCCAATTTGA
Protein sequenceShow/hide protein sequence
MVGCLGNLYESVEALNEMYLPPNECKDTLLKPKVSFCGSTMLLPNIDSPAAATFYLCNSTVYANCRRSVFDGPNEICPTCRVLMSQVGTFVQPPSARTLTQADTRQDQGG
FVKGMVTYMVMDDLTVKPMSTISSITLLNKFDVKEVGALEEKVVTLDVNEGVKLLKVSLHSKTVLTDVFIKRKLPI