; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022957 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022957
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionExostosin domain-containing protein
Genome locationtig00000729:1332172..1335635
RNA-Seq ExpressionSgr022957
SyntenySgr022957
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150287.1 uncharacterized protein LOC101205851 isoform X2 [Cucumis sativus]1.7e-7382.69Show/hide
Query:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSI-GSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITT-TAKAREKKVFNLPGQKYDPPE
        MV++ K  ++K+KP   DGGSRLKLE+SD+KKKI+SS KNSI  SK KSVS+VTK+EVKSKTISSSSKTTTKTTT TT TAK REKKVFNLPGQKYDPPE
Subjt:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSI-GSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITT-TAKAREKKVFNLPGQKYDPPE

Query:  EREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIK--SPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDS-DDDDFI
        EREPLRIFYESLSKQIP SEMAEFWMMEHGMLSPEKAK+AYEKKLRRQKEQRTGTPIK  S K LSRPESSQ+ Q  SKNGD+KAKKKI+NDS DDDDFI
Subjt:  EREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIK--SPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDS-DDDDFI

Query:  LSPKRRKM
        LSPKRRKM
Subjt:  LSPKRRKM

XP_008445018.1 PREDICTED: uncharacterized protein LOC103488185 isoform X1 [Cucumis melo]3.4e-7483.17Show/hide
Query:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSI-GSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITT-TAKAREKKVFNLPGQKYDPPE
        MV++ K  ++K+KP   DGGSRLKLE+SD+KKKI+SS KNSI  SK KSVS+VTKSEVKSKTISSSSKTTTKTTT TT TAK REKK+FNLPGQKYDPPE
Subjt:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSI-GSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITT-TAKAREKKVFNLPGQKYDPPE

Query:  EREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIK--SPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDS-DDDDFI
        EREPLRIFYESLSKQIP SEMAEFWMMEHGMLSPEKAK+AYEKKLRRQKEQRTGTPIK  S K LSRPESSQ+ Q  SKNGD+KAKKKIMNDS DDDDFI
Subjt:  EREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIK--SPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDS-DDDDFI

Query:  LSPKRRKM
        LSPKRRKM
Subjt:  LSPKRRKM

XP_022132016.1 uncharacterized protein LOC111004987 isoform X1 [Momordica charantia]2.2e-8186.83Show/hide
Query:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTK-TTTITTTAKAREKKVFNLPGQKYDPPEE
        MVAETKP V+K+KPG QDGGSRLKLE+SDNK+KIESSTK+SIGSK KS SV+ KSEVKSK  SSSSKTT+K TTT TTT K REKKV+NL GQKYDPPEE
Subjt:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTK-TTTITTTAKAREKKVFNLPGQKYDPPEE

Query:  REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMND-SDDDDFILSP
        REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTP+KSPKL S+PESSQKQQ SSKNGDLKAKKKI ND S+DDDFILSP
Subjt:  REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMND-SDDDDFILSP

Query:  KRRKM
        KRRKM
Subjt:  KRRKM

XP_022132017.1 uncharacterized protein LOC111004987 isoform X2 [Momordica charantia]6.0e-7985.85Show/hide
Query:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTK-TTTITTTAKAREKKVFNLPGQKYDPPEE
        MVAETKP V+K+KPG QDGGSRLKLE+SDNK+KIESSTK+SIGSK KS SV+ KSE  SK  SSSSKTT+K TTT TTT K REKKV+NL GQKYDPPEE
Subjt:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTK-TTTITTTAKAREKKVFNLPGQKYDPPEE

Query:  REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMND-SDDDDFILSP
        REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTP+KSPKL S+PESSQKQQ SSKNGDLKAKKKI ND S+DDDFILSP
Subjt:  REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMND-SDDDDFILSP

Query:  KRRKM
        KRRKM
Subjt:  KRRKM

XP_038885128.1 uncharacterized protein LOC120075630 isoform X1 [Benincasa hispida]3.9e-7884.88Show/hide
Query:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSI-GSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITTTAKAREKKVFNLPGQKYDPPEE
        MV+++KP ++K+KP   DGGSRLKLE+SD+KKKI+SS KNSI  SK KS+S+VTKSEVKSKTISSSSKTTTKTTT TTTAK REKKVFNLPGQKYDPPEE
Subjt:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSI-GSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITTTAKAREKKVFNLPGQKYDPPEE

Query:  REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDS-DDDDFILSP
        REPLRIFYESLSKQIP SEMAEFWMMEHGMLSPEKAK+AY+KKLRRQKEQRTGTPIKS K  SRPESSQK QQ SKNGD+KAKKKIMNDS DDDDFILSP
Subjt:  REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDS-DDDDFILSP

Query:  KRRKM
        KRRKM
Subjt:  KRRKM

TrEMBL top hitse value%identityAlignment
A0A0A0LLY4 Uncharacterized protein8.3e-7482.69Show/hide
Query:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSI-GSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITT-TAKAREKKVFNLPGQKYDPPE
        MV++ K  ++K+KP   DGGSRLKLE+SD+KKKI+SS KNSI  SK KSVS+VTK+EVKSKTISSSSKTTTKTTT TT TAK REKKVFNLPGQKYDPPE
Subjt:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSI-GSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITT-TAKAREKKVFNLPGQKYDPPE

Query:  EREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIK--SPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDS-DDDDFI
        EREPLRIFYESLSKQIP SEMAEFWMMEHGMLSPEKAK+AYEKKLRRQKEQRTGTPIK  S K LSRPESSQ+ Q  SKNGD+KAKKKI+NDS DDDDFI
Subjt:  EREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIK--SPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDS-DDDDFI

Query:  LSPKRRKM
        LSPKRRKM
Subjt:  LSPKRRKM

A0A1S3BCG5 uncharacterized protein LOC103488185 isoform X11.7e-7483.17Show/hide
Query:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSI-GSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITT-TAKAREKKVFNLPGQKYDPPE
        MV++ K  ++K+KP   DGGSRLKLE+SD+KKKI+SS KNSI  SK KSVS+VTKSEVKSKTISSSSKTTTKTTT TT TAK REKK+FNLPGQKYDPPE
Subjt:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSI-GSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITT-TAKAREKKVFNLPGQKYDPPE

Query:  EREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIK--SPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDS-DDDDFI
        EREPLRIFYESLSKQIP SEMAEFWMMEHGMLSPEKAK+AYEKKLRRQKEQRTGTPIK  S K LSRPESSQ+ Q  SKNGD+KAKKKIMNDS DDDDFI
Subjt:  EREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIK--SPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDS-DDDDFI

Query:  LSPKRRKM
        LSPKRRKM
Subjt:  LSPKRRKM

A0A6J1BR23 uncharacterized protein LOC111004987 isoform X11.1e-8186.83Show/hide
Query:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTK-TTTITTTAKAREKKVFNLPGQKYDPPEE
        MVAETKP V+K+KPG QDGGSRLKLE+SDNK+KIESSTK+SIGSK KS SV+ KSEVKSK  SSSSKTT+K TTT TTT K REKKV+NL GQKYDPPEE
Subjt:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTK-TTTITTTAKAREKKVFNLPGQKYDPPEE

Query:  REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMND-SDDDDFILSP
        REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTP+KSPKL S+PESSQKQQ SSKNGDLKAKKKI ND S+DDDFILSP
Subjt:  REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMND-SDDDDFILSP

Query:  KRRKM
        KRRKM
Subjt:  KRRKM

A0A6J1BSP0 uncharacterized protein LOC111004987 isoform X22.9e-7985.85Show/hide
Query:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTK-TTTITTTAKAREKKVFNLPGQKYDPPEE
        MVAETKP V+K+KPG QDGGSRLKLE+SDNK+KIESSTK+SIGSK KS SV+ KSE  SK  SSSSKTT+K TTT TTT K REKKV+NL GQKYDPPEE
Subjt:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTK-TTTITTTAKAREKKVFNLPGQKYDPPEE

Query:  REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMND-SDDDDFILSP
        REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTP+KSPKL S+PESSQKQQ SSKNGDLKAKKKI ND S+DDDFILSP
Subjt:  REPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMND-SDDDDFILSP

Query:  KRRKM
        KRRKM
Subjt:  KRRKM

A0A6J1GJC9 uncharacterized protein LOC111454380 isoform X11.1e-7080.49Show/hide
Query:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITTTAKAREKKVFNLPGQKYDPPEER
        MV++ KP V+KVK G+QDGGSRLKLE+SDNKK   S +     SK KSVSV+ KSEVKSKTI+SSSKTTTK T  T+TAK REKKVFNL GQKYDPPEER
Subjt:  MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITTTAKAREKKVFNLPGQKYDPPEER

Query:  EPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDSDDD--DFILSP
        EPLRIFYESLSKQI TSEMAEFWMMEHGMLSPEKAK+AYEKKLRRQKEQRTGTPIKS K  SRPESSQ+ QQ SKNGDLKAKKKI N+SDDD  DFILSP
Subjt:  EPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDSDDD--DFILSP

Query:  KRRKM
        KRRKM
Subjt:  KRRKM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19990.1 unknown protein1.2e-1939.91Show/hide
Query:  AKVKP--GYQDGGSRLKLETS----DNKKKIESSTKNSIGSKQKSVSVVTKSEVK---SKTISS--SSKTTTKTTTITTTAKAREKKVFNLPGQKYDPPE
        AK KP  G   G  +LK E +    D+ K I+SS   S     K    + K + K   SK  SS   SK   K        K RE+KV++LPGQK + P+
Subjt:  AKVKP--GYQDGGSRLKLETS----DNKKKIESSTKNSIGSKQKSVSVVTKSEVK---SKTISS--SSKTTTKTTTITTTAKAREKKVFNLPGQKYDPPE

Query:  EREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKS-------------------PKLLSRPESSQKQQQSSKNGDL
        ER+PLRIFYESL KQIPTS+MA+ W+ME G+L  EKAK+  EKKL  QK  +  +P+KS                    K  S   S++K+   SK    
Subjt:  EREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKS-------------------PKLLSRPESSQKQQQSSKNGDL

Query:  KAKKKIMNDSDDDDFILSPKRRK
        K KK   +D  DDDF+ S   +K
Subjt:  KAKKKIMNDSDDDDFILSPKRRK

AT5G11600.1 unknown protein2.5e-3849.59Show/hide
Query:  MVAETKPAVAKVKP--GYQDGGSRLKLETS-DNKKKIESSTK--NSIGSKQKSVSVVT-KSEVKS-------KTISSSS---------------------
        M  + +P+ AK +P        SR+K++ S  +KKKI +S++   S    + SVS VT KSE K        KTI+++S                     
Subjt:  MVAETKPAVAKVKP--GYQDGGSRLKLETS-DNKKKIESSTK--NSIGSKQKSVSVVT-KSEVKS-------KTISSSS---------------------

Query:  -KTTTKTTTITTTAKAREKKVFNLPGQKYDPPEEREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKS-PKLLSRP
         KTT+ T ++      REKKV++L GQK+DPPEEREPLRIFYESLSKQIP SEMAEFW+MEHGMLSPEKAKRA+EKK R+ K+ R GTP KS P   S+ 
Subjt:  -KTTTKTTTITTTAKAREKKVFNLPGQKYDPPEEREPLRIFYESLSKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKS-PKLLSRP

Query:  ESSQKQQQSSKNG-DLKAKKKIM--NDSDDDDFILSPKRRKM
        ESSQ+   S  NG D + KKK++  +D DDDDFILS KRRK+
Subjt:  ESSQKQQQSSKNG-DLKAKKKIM--NDSDDDDFILSPKRRKM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGCAGAGACCAAACCCGCTGTTGCGAAGGTCAAACCTGGTTATCAAGATGGCGGCTCCAGATTAAAGCTCGAAACTTCTGACAATAAGAAGAAGATTGAGAGCTC
CACCAAGAACTCAATTGGCTCCAAACAGAAATCCGTGTCTGTTGTTACCAAATCCGAGGTAAAGTCAAAGACAATATCAAGTTCTTCAAAAACGACAACAAAAACTACTA
CTATTACTACTACTGCCAAAGCGAGAGAAAAGAAAGTGTTCAATTTGCCCGGTCAGAAATATGATCCACCCGAAGAGAGAGAGCCCCTTCGGATATTTTATGAGTCCTTA
TCGAAACAGATACCAACAAGTGAAATGGCAGAGTTTTGGATGATGGAGCATGGCATGTTATCTCCCGAAAAGGCAAAAAGGGCATACGAGAAGAAACTGAGAAGACAAAA
GGAACAGAGGACGGGGACTCCAATTAAATCACCGAAACTGCTGAGCAGACCAGAGAGTTCACAGAAGCAGCAGCAGTCATCAAAGAATGGTGATCTAAAAGCGAAGAAAA
AGATCATGAACGATAGCGACGACGACGACTTCATTTTAAGCCCCAAGAGAAGGAAAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTGCAGAGACCAAACCCGCTGTTGCGAAGGTCAAACCTGGTTATCAAGATGGCGGCTCCAGATTAAAGCTCGAAACTTCTGACAATAAGAAGAAGATTGAGAGCTC
CACCAAGAACTCAATTGGCTCCAAACAGAAATCCGTGTCTGTTGTTACCAAATCCGAGGTAAAGTCAAAGACAATATCAAGTTCTTCAAAAACGACAACAAAAACTACTA
CTATTACTACTACTGCCAAAGCGAGAGAAAAGAAAGTGTTCAATTTGCCCGGTCAGAAATATGATCCACCCGAAGAGAGAGAGCCCCTTCGGATATTTTATGAGTCCTTA
TCGAAACAGATACCAACAAGTGAAATGGCAGAGTTTTGGATGATGGAGCATGGCATGTTATCTCCCGAAAAGGCAAAAAGGGCATACGAGAAGAAACTGAGAAGACAAAA
GGAACAGAGGACGGGGACTCCAATTAAATCACCGAAACTGCTGAGCAGACCAGAGAGTTCACAGAAGCAGCAGCAGTCATCAAAGAATGGTGATCTAAAAGCGAAGAAAA
AGATCATGAACGATAGCGACGACGACGACTTCATTTTAAGCCCCAAGAGAAGGAAAATGTAG
Protein sequenceShow/hide protein sequence
MVAETKPAVAKVKPGYQDGGSRLKLETSDNKKKIESSTKNSIGSKQKSVSVVTKSEVKSKTISSSSKTTTKTTTITTTAKAREKKVFNLPGQKYDPPEEREPLRIFYESL
SKQIPTSEMAEFWMMEHGMLSPEKAKRAYEKKLRRQKEQRTGTPIKSPKLLSRPESSQKQQQSSKNGDLKAKKKIMNDSDDDDFILSPKRRKM