; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021936 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021936
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionsmall acidic protein-like isoform X2
Genome locationtig00153841:1435819..1436936
RNA-Seq ExpressionSgr021936
SyntenySgr021936
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR036020 - WW domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039848.1 uncharacterized protein E6C27_scaffold122G001230 [Cucumis melo var. makuwa]4.1e-6675.76Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRR-----SSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKV
        MAA+TDSL+QSFRNFSLN RL+SA   A SS GVRR     SSSSSSSDDE HL    H+RFDT LELNSHISLPPFWEQCLDLKTGEVYY NCRTGMKV
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRR-----SSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKV

Query:  KEDPRTAEAHGQDFYWEDEG-----SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF
        KEDPRTA AH +D Y ED+G     SSSDG  E S SSS  GGSR QY A + E+VLVVAGCKRCFMYFMVPKQVEDCPKC SSRLVHFDRSD+ NGF
Subjt:  KEDPRTAEAHGQDFYWEDEG-----SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF

KAG6576106.1 hypothetical protein SDJN03_26745, partial [Cucurbita argyrosperma subsp. sororia]6.3e-6776.92Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPR
        MAA+TDSL+QSFR FSLN RL S    AA S GVRRSSSSSSS DE HL    HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYY NCRTGM+V EDPR
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPR

Query:  TAEAHGQDFYWE-------DEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF
        TAEAH +D Y E       DE SS DGSEES SSSS  G SR QY  G EE+VLVVAGCKRCFMYFMVPKQVEDCPKC SSRLVHFDRS+D NGF
Subjt:  TAEAHGQDFYWE-------DEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF

XP_022953156.1 uncharacterized protein LOC111455780 [Cucurbita moschata]8.2e-6776.92Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPR
        MAA+TDSL+QSFR FSLN RL S    AA S GVRRSSSSSSS DE HL    HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYY NCRTGM+V EDPR
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPR

Query:  TAEAHGQDFYWE-------DEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF
        TAEAH +D Y E       DE SS DGSEES SSSS  G SR QY  G EE+VLVVAGCKRCFMYFMVPKQVEDCPKC SSRLVHFDRS+D NGF
Subjt:  TAEAHGQDFYWE-------DEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF

XP_022991817.1 uncharacterized protein LOC111488350 [Cucurbita maxima]2.4e-6675.38Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDE-----HHLHHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDP
        MAA+TDSL+QSFR FSLN RL S     A S GVRRSSSSSSS DE     HH HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYY NCRTGM+V EDP
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDE-----HHLHHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDP

Query:  RTAEAHGQDFYWEDEG------SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF
        RTAEAHG+D Y ED+       SSSD S E S SSS  G SR QY  G EE+VLVVAGCKRCFMYFMVPKQVEDCPKC SSRLVHFDRS+D NGF
Subjt:  RTAEAHGQDFYWEDEG------SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF

XP_023547639.1 uncharacterized protein LOC111806522 [Cucurbita pepo subsp. pepo]5.3e-6675.9Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPR
        MAA+TDSL+QSFR FSLN RL S    A  S GVRRSSSSSSS D+ HL    HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYY NCRTGM+V EDPR
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPR

Query:  TAEAHGQDFYWE-------DEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF
        TAEAH +D Y E       DE SS DGSEES SSSS  G SR QY  G EE+VLVVAGCKRCFMYFMVPKQVEDCPKC SSRLVHFDRS+D NGF
Subjt:  TAEAHGQDFYWE-------DEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF

TrEMBL top hitse value%identityAlignment
A0A0A0KF89 Uncharacterized protein1.3e-6574.87Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRR-----SSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKV
        MAA+TDSL+QSFRNFSLN RL+SA   A SS GVRR     SSSSSSSDDE HL    H+RFDT LELNSHISLPPFWEQCLDLKTGEVYY NCRTGMKV
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRR-----SSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKV

Query:  KEDPRTAEAHGQDFYWEDEG------SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF
        KEDPRTA AH +D Y ED+       SSSDG  E S SSS  GGSR QY A + E+VLVVAGCKRCFMYFMVPKQVEDCPKC SSRLVHFDRSD+ NGF
Subjt:  KEDPRTAEAHGQDFYWEDEG------SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF

A0A1S4E3A8 uncharacterized protein LOC1034988821.3e-6575.77Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRR-----SSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKV
        MAA+TDSL+QSFRNFSLN RL+SA   A SS GVRR     SSSSSSSDDE HL    H+RFDT LELNSHISLPPFWEQCLDLKTGEVYY NCRTGMKV
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRR-----SSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKV

Query:  KEDPRTAEAHGQDFYWEDEG-----SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD
        KEDPRTA AH +D Y ED+G     SSSDG  E S SSS  GGSR QY A + E+VLVVAGCKRCFMYFMVPKQVEDCPKC SSRLVHFDRSD+
Subjt:  KEDPRTAEAHGQDFYWEDEG-----SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD

A0A5A7TA84 Uncharacterized protein2.0e-6675.76Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRR-----SSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKV
        MAA+TDSL+QSFRNFSLN RL+SA   A SS GVRR     SSSSSSSDDE HL    H+RFDT LELNSHISLPPFWEQCLDLKTGEVYY NCRTGMKV
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRR-----SSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKV

Query:  KEDPRTAEAHGQDFYWEDEG-----SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF
        KEDPRTA AH +D Y ED+G     SSSDG  E S SSS  GGSR QY A + E+VLVVAGCKRCFMYFMVPKQVEDCPKC SSRLVHFDRSD+ NGF
Subjt:  KEDPRTAEAHGQDFYWEDEG-----SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF

A0A6J1GM82 uncharacterized protein LOC1114557804.0e-6776.92Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPR
        MAA+TDSL+QSFR FSLN RL S    AA S GVRRSSSSSSS DE HL    HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYY NCRTGM+V EDPR
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHL----HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPR

Query:  TAEAHGQDFYWE-------DEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF
        TAEAH +D Y E       DE SS DGSEES SSSS  G SR QY  G EE+VLVVAGCKRCFMYFMVPKQVEDCPKC SSRLVHFDRS+D NGF
Subjt:  TAEAHGQDFYWE-------DEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF

A0A6J1JMX9 uncharacterized protein LOC1114883501.2e-6675.38Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDE-----HHLHHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDP
        MAA+TDSL+QSFR FSLN RL S     A S GVRRSSSSSSS DE     HH HHRFDTTLELNSHISLPPFWEQCLDLKTGEVYY NCRTGM+V EDP
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDE-----HHLHHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDP

Query:  RTAEAHGQDFYWEDEG------SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF
        RTAEAHG+D Y ED+       SSSD S E S SSS  G SR QY  G EE+VLVVAGCKRCFMYFMVPKQVEDCPKC SSRLVHFDRS+D NGF
Subjt:  RTAEAHGQDFYWEDEG------SSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKC-SSRLVHFDRSDD-NGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G28070.1 unknown protein7.2e-2943.85Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHLHHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPRTAEA
        MA IT+ L++S +N SL  R +S                   SD+   +  RF   LEL+SH S+P   EQCLDLKTGE+YY +  +GM+VKEDPR + +
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHLHHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPRTAEA

Query:  HGQ-----------DFYWEDEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKCSSRLVHFDR
         G              +  +E SS   SEESSS SS    SR  ++   +E+VLVVAGCK C MYFMVPK  +DCPKC+++L+HFD+
Subjt:  HGQ-----------DFYWEDEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKCSSRLVHFDR

AT2G33510.1 CONTAINS InterPro DOMAIN/s: WW/Rsp5/WWP (InterPro:IPR001202)1.9e-3751.32Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHLHHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPR---T
        M  IT+SL++S  N SLN R     G+     G  RSSS+       H+    D TLELNSH+SLP  WEQCLDLKTGE+YY+N + GM+VKEDPR    
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHLHHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPR---T

Query:  AEAHGQDFY---WEDEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEE-------VLVVAGCKRCFMYFMVPKQVEDCPKCSSRLVHFDR
        A+    D Y     +E SS   SEESSS SS      ++ +   EEE       VLVVAGCK CFMYFMVPK VEDCPKC+++L+HFDR
Subjt:  AEAHGQDFY---WEDEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEE-------VLVVAGCKRCFMYFMVPKQVEDCPKCSSRLVHFDR

AT2G33510.2 unknown protein2.0e-3447.55Show/hide
Query:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHLHHRFDTTLELNSHISLPPFWEQCLDLK---------------TGEVYYMNC
        M  IT+SL++S  N SLN R     G+     G  RSSS+       H+    D TLELNSH+SLP  WEQCLDLK               TGE+YY+N 
Subjt:  MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHLHHRFDTTLELNSHISLPPFWEQCLDLK---------------TGEVYYMNC

Query:  RTGMKVKEDPR---TAEAHGQDFY---WEDEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEE-------VLVVAGCKRCFMYFMVPKQVEDCPKCSSRLV
        + GM+VKEDPR    A+    D Y     +E SS   SEESSS SS      ++ +   EEE       VLVVAGCK CFMYFMVPK VEDCPKC+++L+
Subjt:  RTGMKVKEDPR---TAEAHGQDFY---WEDEGSSSDGSEESSSSSSCGGGSRNQYQAGSEEE-------VLVVAGCKRCFMYFMVPKQVEDCPKCSSRLV

Query:  HFDR
        HFDR
Subjt:  HFDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCGATTACTGATTCTCTACAGCAGTCTTTCCGTAACTTCTCCCTCAACCAGCGCCTCACCTCCGCTTCCGGTGAGGCCGCGTCCTCTCCCGGGGTTCGGAGGTC
GTCGTCTTCTTCTTCTTCCGACGACGAACACCATCTTCATCATCGCTTCGACACCACCTTGGAGCTCAACTCTCACATCTCTCTCCCTCCCTTCTGGGAACAATGTCTCG
ATTTAAAGACAGGGGAAGTTTACTACATGAACTGCCGGACCGGAATGAAAGTAAAGGAAGATCCGAGGACGGCGGAAGCACACGGCCAAGATTTCTACTGGGAAGACGAG
GGGAGCTCGTCGGACGGCAGCGAGGAGTCGTCTTCTTCGTCGTCCTGCGGTGGTGGTAGTAGAAATCAATACCAGGCGGGAAGCGAGGAGGAGGTGTTGGTGGTGGCCGG
CTGCAAGAGATGCTTCATGTACTTCATGGTGCCGAAACAGGTGGAAGATTGCCCCAAATGCAGCAGTCGTCTTGTTCATTTCGATCGCTCCGACGACAATGGCTTCGCAT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCGATTACTGATTCTCTACAGCAGTCTTTCCGTAACTTCTCCCTCAACCAGCGCCTCACCTCCGCTTCCGGTGAGGCCGCGTCCTCTCCCGGGGTTCGGAGGTC
GTCGTCTTCTTCTTCTTCCGACGACGAACACCATCTTCATCATCGCTTCGACACCACCTTGGAGCTCAACTCTCACATCTCTCTCCCTCCCTTCTGGGAACAATGTCTCG
ATTTAAAGACAGGGGAAGTTTACTACATGAACTGCCGGACCGGAATGAAAGTAAAGGAAGATCCGAGGACGGCGGAAGCACACGGCCAAGATTTCTACTGGGAAGACGAG
GGGAGCTCGTCGGACGGCAGCGAGGAGTCGTCTTCTTCGTCGTCCTGCGGTGGTGGTAGTAGAAATCAATACCAGGCGGGAAGCGAGGAGGAGGTGTTGGTGGTGGCCGG
CTGCAAGAGATGCTTCATGTACTTCATGGTGCCGAAACAGGTGGAAGATTGCCCCAAATGCAGCAGTCGTCTTGTTCATTTCGATCGCTCCGACGACAATGGCTTCGCAT
GA
Protein sequenceShow/hide protein sequence
MAAITDSLQQSFRNFSLNQRLTSASGEAASSPGVRRSSSSSSSDDEHHLHHRFDTTLELNSHISLPPFWEQCLDLKTGEVYYMNCRTGMKVKEDPRTAEAHGQDFYWEDE
GSSSDGSEESSSSSSCGGGSRNQYQAGSEEEVLVVAGCKRCFMYFMVPKQVEDCPKCSSRLVHFDRSDDNGFA