; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024097 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024097
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNHL domain protein
Genome locationtig00001047:3179245..3179709
RNA-Seq ExpressionSgr024097
SyntenySgr024097
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602448.1 hypothetical protein SDJN03_07681, partial [Cucurbita argyrosperma subsp. sororia]8.7e-7080Show/hide
Query:  MSVAQSPDLSPDGDGYHGDD-IHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF
        MSV+QSPDLS D +G+H DD +HEAAFASRGCCFW+PCLRSNPS++   WWERIRAA+N+D+WWLRGWKR R+WSEIVAGPKWKTFIRQF+KNR+RQ+T+
Subjt:  MSVAQSPDLSPDGDGYHGDD-IHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF

Query:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI
        RYDPLSY+LNFDEGP  DDP SED++RRDFS RFA+IPASAKSSMDLGKDGPSFI
Subjt:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI

XP_022961939.1 uncharacterized protein LOC111462557 [Cucurbita moschata]1.5e-6980Show/hide
Query:  MSVAQSPDLSPDGDGYHGDD-IHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF
        MSV QSPDLS D +G+H DD +HEAAFASRGCCFW+PCLRSNPS++   WWERIRAA+N+D+WWLRGWKR R+WSEIVAGPKWKTFIRQF+KNR+RQ+T+
Subjt:  MSVAQSPDLSPDGDGYHGDD-IHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF

Query:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI
        RYDPLSY+LNFDEGP  DDP SED++RRDFS RFA+IPASAKSSMDLGKDGPSFI
Subjt:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI

XP_023533538.1 uncharacterized protein LOC111795380 [Cucurbita pepo subsp. pepo]3.3e-6979.35Show/hide
Query:  MSVAQSPDLSPDGDGYHGDD-IHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF
        MSV+QSPDLS D +G+H DD +HEAAFASRGCCFW+PCLRS+PS++   WWERIRAA+N+D+WWLRGWKR R+WSEIVAGPKWKTFIRQF+KNR+RQ+T+
Subjt:  MSVAQSPDLSPDGDGYHGDD-IHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF

Query:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI
        RYDPLSY+LNFDEGP  DDP SED++RRDFS RFA+IPASAKSSMDLGKDGPSFI
Subjt:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI

XP_031741765.1 uncharacterized protein LOC116403956 [Cucumis sativus]1.5e-6979.35Show/hide
Query:  MSVAQSPDLSPDGDGY-HGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF
        MS+A SPDLS D +G+ H DD+H+A FA+RGCC WIPCLRSN SQ+   WWERIRAA+N+D+WWL+GWKR REWSEIVAGPKWKTFIRQF+KNRNRQ+TF
Subjt:  MSVAQSPDLSPDGDGY-HGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF

Query:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI
        RYDPLSYSLNFDEGP HDDP ++D++RRDFSTRFAAIPASAKSSMDLGKD PSFI
Subjt:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI

XP_038889212.1 uncharacterized protein LOC120079097 [Benincasa hispida]2.2e-7382.47Show/hide
Query:  MSVAQSPDLSPDGDGYHGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATFR
        MS+AQSPDLS D + +HGDD+H+AAF SRGCC W+PCLRSNPSQ+   WWERIRAA+N+D+WWLRGWKR REWSEI+AGPKWKTFIRQFNKNRNRQ+TFR
Subjt:  MSVAQSPDLSPDGDGYHGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATFR

Query:  YDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI
        YDPLSYSLNFDEGP HDDP S+D +RRDFSTRFAAIPASAKSSMDLGKDGP FI
Subjt:  YDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI

TrEMBL top hitse value%identityAlignment
A0A0A0KRY7 Uncharacterized protein7.2e-7079.35Show/hide
Query:  MSVAQSPDLSPDGDGY-HGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF
        MS+A SPDLS D +G+ H DD+H+A FA+RGCC WIPCLRSN SQ+   WWERIRAA+N+D+WWL+GWKR REWSEIVAGPKWKTFIRQF+KNRNRQ+TF
Subjt:  MSVAQSPDLSPDGDGY-HGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF

Query:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI
        RYDPLSYSLNFDEGP HDDP ++D++RRDFSTRFAAIPASAKSSMDLGKD PSFI
Subjt:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI

A0A1S4E267 uncharacterized protein LOC1079916283.9e-6877.27Show/hide
Query:  MSVAQSPDLSPDGDGYHGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATFR
        MS+AQSPD S D +G+H +D+H+A FA+ GCCFWIPCLRSN SQ+   WWERIRAA+N+D+WWLRGWKR REWSEIVAGPKWKTFIRQF+KNRNRQ+TFR
Subjt:  MSVAQSPDLSPDGDGYHGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATFR

Query:  YDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI
        YDPLSYSLNFDEGP H+D  ++D++RRDFSTRFAAIPASAKSSMDL KD PS I
Subjt:  YDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI

A0A5A7T228 Putative NHL domain-containing protein2.2e-6677.85Show/hide
Query:  MSVAQSPDLSPDGDGYHGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATFR
        MS+AQSPD S D +G+H +D+H+A FA+ GCCFWIPCLRSN SQ+   WWERIRAA+N+D+WWLRGWKR REWSEIVAGPKWKTFIRQF+KNRNRQ+TFR
Subjt:  MSVAQSPDLSPDGDGYHGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATFR

Query:  YDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKD
        YDPLSYSLNFDEGP H+D  ++D++RRDFSTRFAAIPASAKSSMDL KD
Subjt:  YDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKD

A0A6J1HFG4 uncharacterized protein LOC1114625577.2e-7080Show/hide
Query:  MSVAQSPDLSPDGDGYHGDD-IHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF
        MSV QSPDLS D +G+H DD +HEAAFASRGCCFW+PCLRSNPS++   WWERIRAA+N+D+WWLRGWKR R+WSEIVAGPKWKTFIRQF+KNR+RQ+T+
Subjt:  MSVAQSPDLSPDGDGYHGDD-IHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF

Query:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI
        RYDPLSY+LNFDEGP  DDP SED++RRDFS RFA+IPASAKSSMDLGKDGPSFI
Subjt:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI

A0A6J1JTQ3 uncharacterized protein LOC1114874224.6e-6978.71Show/hide
Query:  MSVAQSPDLSPDGDGYHGDD-IHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF
        MSV+QSPDLS D +G+H DD +HEAAFASRGCCFW+PCLRSNPS+    WWERIRAA+N+D+WWLRGWKR R+WSEIVAGPKWKTFIRQF+KNR+RQ+T+
Subjt:  MSVAQSPDLSPDGDGYHGDD-IHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATF

Query:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI
        RYDPLSY+LNFD+GP  DDP  ED++RRDFS RFA+IPASAKSSMDLGKDGPSFI
Subjt:  RYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)2.2e-3446.06Show/hide
Query:  DDIHEAAFASRGCCFWIPCLRSN--PSQAGSPWWERIRAA---ENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNR--------------------
        DD+HEA FA RGCCF +PCL S+   ++ GS WW+RI      E +++WW+RGW+R+REWSE+VAGP+WKT+IR+F ++                     
Subjt:  DDIHEAAFASRGCCFWIPCLRSN--PSQAGSPWWERIRAA---ENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNR--------------------

Query:  --NR---QATFRYDPLSYSLNFDEG--PGHDDPVSEDYLRRDFSTRFAA--IPASAKSSMDLGKD
          NR   Q  FRYD LSYSLNFD+G   GH D   +++  RD+S RFAA  +P S K S+D   D
Subjt:  --NR---QATFRYDPLSYSLNFDEG--PGHDDPVSEDYLRRDFSTRFAA--IPASAKSSMDLGKD

AT3G48020.1 unknown protein7.9e-2145.95Show/hide
Query:  SPWWERI-RAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNR------QATFRYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIP-A
        S WW+RI R    E +WW+R + ++REWSEIVAGP+WKTFIR+FN++  R         FRYDP+SY+L+F++    DD  +     R FS R+A++P A
Subjt:  SPWWERI-RAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNR------QATFRYDPLSYSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIP-A

Query:  SAKSSMDLGKD
        S KS   +  D
Subjt:  SAKSSMDLGKD

AT5G14890.1 NHL domain-containing protein1.3e-3448.68Show/hide
Query:  DDIHEAAFASRGCCFWIPCLRSNPSQA--GSPWWERIRAA---ENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNK------------NRNRQATFRY
        D++HEA FA RGCCF +PCL S+      GS WW+RIR     E +++WW+ GW ++REWSEIVAGPKWKTFIR+F +            NR    +FRY
Subjt:  DDIHEAAFASRGCCFWIPCLRSNPSQA--GSPWWERIRAA---ENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNK------------NRNRQATFRY

Query:  DPLSYSLNFDEG--PGHDDPVSEDYLRRDFSTRFAA--IPASAKSSMDLGKD
        D  SYSLNFD+G   GH     +++  RD+S RFAA  +P S K S+D   D
Subjt:  DPLSYSLNFDEG--PGHDDPVSEDYLRRDFSTRFAA--IPASAKSSMDLGKD

AT5G25240.1 unknown protein1.4e-0932.8Show/hide
Query:  LSPDGDGYHGDDIHEAAFASRGCCF-----WIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNR---NRQATFR
        ++ D +    DD  E A A  GC +     +    R +        W      E    W     K L+E SE +AGPKWK FIR F+  R    R   F 
Subjt:  LSPDGDGYHGDDIHEAAFASRGCCF-----WIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNR---NRQATFR

Query:  YDPLSYSLNFDEGPGHDDPVSEDYL
        YD  +YSLNFD+G    D   E ++
Subjt:  YDPLSYSLNFDEGPGHDDPVSEDYL

AT5G62865.1 unknown protein4.2e-2243.94Show/hide
Query:  EAAFASRGCCFWIPCLRSNPSQ--AGSPWWERIRAAE---------NEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNR------QATFRYDPLS
        +  +  R CCF  P  R + S    G   W RIR  +         +E +WW+R   ++REWSEIVAGP+WKTFIR+FN++  R         F+YDPLS
Subjt:  EAAFASRGCCFWIPCLRSNPSQ--AGSPWWERIRAAE---------NEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNR------QATFRYDPLS

Query:  YSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIP
        YSLNFD+    D+ V    L R FSTRFA++P
Subjt:  YSLNFDEGPGHDDPVSEDYLRRDFSTRFAAIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTTGCTCAATCTCCCGACCTCTCGCCCGACGGAGATGGCTATCACGGCGACGACATTCACGAGGCGGCGTTTGCGAGCCGCGGTTGCTGCTTCTGGATCCCTTG
CCTGAGGTCGAATCCGTCGCAAGCCGGGTCGCCTTGGTGGGAGCGGATTCGGGCGGCGGAGAACGAAGACCAGTGGTGGCTCAGGGGGTGGAAGAGGCTCCGCGAGTGGT
CGGAGATTGTCGCCGGTCCGAAATGGAAAACGTTCATCCGTCAGTTCAATAAGAATCGCAACCGGCAAGCCACATTCCGCTACGATCCTCTCAGTTATTCTCTCAACTTC
GATGAAGGTCCCGGCCACGACGATCCTGTCAGCGAAGACTATTTACGCCGCGATTTCTCCACTCGGTTCGCCGCGATTCCGGCTTCGGCCAAGTCCTCCATGGACCTCGG
CAAGGACGGGCCATCCTTCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTTGCTCAATCTCCCGACCTCTCGCCCGACGGAGATGGCTATCACGGCGACGACATTCACGAGGCGGCGTTTGCGAGCCGCGGTTGCTGCTTCTGGATCCCTTG
CCTGAGGTCGAATCCGTCGCAAGCCGGGTCGCCTTGGTGGGAGCGGATTCGGGCGGCGGAGAACGAAGACCAGTGGTGGCTCAGGGGGTGGAAGAGGCTCCGCGAGTGGT
CGGAGATTGTCGCCGGTCCGAAATGGAAAACGTTCATCCGTCAGTTCAATAAGAATCGCAACCGGCAAGCCACATTCCGCTACGATCCTCTCAGTTATTCTCTCAACTTC
GATGAAGGTCCCGGCCACGACGATCCTGTCAGCGAAGACTATTTACGCCGCGATTTCTCCACTCGGTTCGCCGCGATTCCGGCTTCGGCCAAGTCCTCCATGGACCTCGG
CAAGGACGGGCCATCCTTCATATGA
Protein sequenceShow/hide protein sequence
MSVAQSPDLSPDGDGYHGDDIHEAAFASRGCCFWIPCLRSNPSQAGSPWWERIRAAENEDQWWLRGWKRLREWSEIVAGPKWKTFIRQFNKNRNRQATFRYDPLSYSLNF
DEGPGHDDPVSEDYLRRDFSTRFAAIPASAKSSMDLGKDGPSFI