; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029314 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029314
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSmall and basic intrinsic protein 2
Genome locationtig00153293:1133670..1135789
RNA-Seq ExpressionSgr029314
SyntenySgr029314
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG9439246.1 hypothetical protein H6P81_019411 [Aristolochia fimbriata]4.1e-1334.48Show/hide
Query:  VHIDDGRGVGVTLDAAASELAPEI-THRFVTMG--GIKIQK-----VIRRRYAAGKNFSDDGNLVTLERKADGSGGGSESANTVTEIKIQR----RGINH
        V +D G G+GV LDA        I  H  +     G+++++     V+ RR       SD G++V      +G  GG+ +     E++IQR    RG++ 
Subjt:  VHIDDGRGVGVTLDAAASELAPEI-THRFVTMG--GIKIQK-----VIRRRYAAGKNFSDDGNLVTLERKADGSGGGSESANTVTEIKIQR----RGINH

Query:  QS------LLPARGLNRATLTGALIVVILV-----KVKKILGG-----QREIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRP
         +      + P  GL+ A L   L V ++V     +V ++ GG     + EIGLGLGA+EL+   +VG G LL  VE  +P+ ALLARI DL  V  PRP
Subjt:  QS------LLPARGLNRATLTGALIVVILV-----KVKKILGG-----QREIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRP

Query:  LPHSSKMHFLNVSAVT--LDDNRLLPRRHLSQ
        LPH ++++ L+ S +    D   L PRRH+ +
Subjt:  LPHSSKMHFLNVSAVT--LDDNRLLPRRHLSQ

PON37874.1 hypothetical protein PanWU01x14_316950 [Parasponia andersonii]3.7e-3040.57Show/hide
Query:  HTSGKNSGAFWRGKVQIDSEASSRLEVLAVHIDDGRGVGVTLD--AAASELAPEITHRFVTM------------GGIKIQKVIRRRYAAGKNFSDDGNLV
        H   KNS  F   +V+I+ E S+R +++ V IDDG  V V LD  A A     EI +R   +             G  ++  I      G++  D  +  
Subjt:  HTSGKNSGAFWRGKVQIDSEASSRLEVLAVHIDDGRGVGVTLD--AAASELAPEITHRFVTM------------GGIKIQKVIRRRYAAGKNFSDDGNLV

Query:  TLERKADGSGGGSESANTVTEIKIQ-RRGINHQSLLPARGLNRATLTGALIVVILVKVKKI-------LGGQREIGLGLGAAELSSGSIVGSGRLLRRVE
         L  K +  GG    A +V EIKIQ RR + HQS L ARGL+R  LT ALI  + V +  +       LG + E+GLGLGAAE+S GS+VG+G     VE
Subjt:  TLERKADGSGGGSESANTVTEIKIQ-RRGINHQSLLPARGLNRATLTGALIVVILVKVKKI-------LGGQREIGLGLGAAELSSGSIVGSGRLLRRVE

Query:  GSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVSAVTLDDN----RLLPRRHL---SQNPD------QIRSNQIKSKKR
        G EP+PALLA + DL GV  PRPLPHS +MH L+V AV  +      RL+ RRH     +NPD      +I++N+ K KKR
Subjt:  GSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVSAVTLDDN----RLLPRRHL---SQNPD------QIRSNQIKSKKR

RWW50265.1 hypothetical protein BHE74_00043572 [Ensete ventricosum]1.7e-1133.47Show/hide
Query:  WRGKVQIDSE----ASSRLEVLAVHIDDGRGVGVTLDAAA---------------SELAPEITHRFVTMGGIKIQKV--IRRRYAAGKNFSDDGNLVTLE
        WRGKV+++ E       R  V  V I DG GV V ++AA                 E+      RF      ++++   + RR+  G     DG  V LE
Subjt:  WRGKVQIDSE----ASSRLEVLAVHIDDGRGVGVTLDAAA---------------SELAPEITHRFVTMGGIKIQKV--IRRRYAAGKNFSDDGNLVTLE

Query:  RKADGSGGGSESANTVTEIKIQR---RGINHQ-SLLPARGLNRATL-TGALIVVILVKVKKILGGQREIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNP
         K        + A TV  I+++R   R +  +    P  GL+RA L +GA  V ++++         E+GLGLG AEL+ G +V   RLL  VEG EP+P
Subjt:  RKADGSGGGSESANTVTEIKIQR---RGINHQ-SLLPARGLNRATL-TGALIVVILVKVKKILGGQREIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNP

Query:  ALLARISDLSGVSPPRPLPHSSKMHFL-NVSAVTLDDNRLLPRRH
        ALL R+  L  V PP PLP+  ++  L   S+ +  D +LL  RH
Subjt:  ALLARISDLSGVSPPRPLPHSSKMHFL-NVSAVTLDDNRLLPRRH

RWW50829.1 hypothetical protein BHE74_00042874 [Ensete ventricosum]2.9e-1140Show/hide
Query:  DGNLVTLERKADGSGGGSESANTVTEIKIQ--RRGINHQSLLPARGLNRATLTGALIVV-----------ILVKVKKILGGQR--EIGLGLGAAELSSGS
        DG +V LE KA        +A TV E++I+  R   + +   P R L    L GA +               V    +  G R  E+GLGLG+AEL+ G 
Subjt:  DGNLVTLERKADGSGGGSESANTVTEIKIQ--RRGINHQSLLPARGLNRATLTGALIVV-----------ILVKVKKILGGQR--EIGLGLGAAELSSGS

Query:  IVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVSAVTLDDNRLLPRRH--------LSQNP
        +VG   LLRRVE  EP+ ALLA +++L  V+PPRPLPHSS+M  L  S  T    RL  RRH        LS NP
Subjt:  IVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVSAVTLDDNRLLPRRH--------LSQNP

RZR89617.1 hypothetical protein BHM03_00017370 [Ensete ventricosum]1.1e-1033.06Show/hide
Query:  WRGKVQIDSE----ASSRLEVLAVHIDDGRGVGVTLDAAA---------------SELAPEITHRFVTMGGIKIQKV--IRRRYAAGKNFSDDGNLVTLE
        WRGKV+++ E       R  V  V I DG GV   ++AA                 E+      RF      ++++   + RR+  G     DG  V LE
Subjt:  WRGKVQIDSE----ASSRLEVLAVHIDDGRGVGVTLDAAA---------------SELAPEITHRFVTMGGIKIQKV--IRRRYAAGKNFSDDGNLVTLE

Query:  RKADGSGGGSESANTVTEIKIQR---RGINHQ-SLLPARGLNRATL-TGALIVVILVKVKKILGGQREIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNP
         K        + A TV  I+++R   R +  +    P  GL+RA L +GA  V ++++         E+GLGLG AEL+ G +V   RLL  VEG EP+P
Subjt:  RKADGSGGGSESANTVTEIKIQR---RGINHQ-SLLPARGLNRATL-TGALIVVILVKVKKILGGQREIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNP

Query:  ALLARISDLSGVSPPRPLPHSSKMHFL-NVSAVTLDDNRLLPRRH
        ALL R+  L  V PP PLP+  ++  L   S+ +  D +LL  RH
Subjt:  ALLARISDLSGVSPPRPLPHSSKMHFL-NVSAVTLDDNRLLPRRH

TrEMBL top hitse value%identityAlignment
A0A2P5AMW1 Uncharacterized protein1.8e-3040.57Show/hide
Query:  HTSGKNSGAFWRGKVQIDSEASSRLEVLAVHIDDGRGVGVTLD--AAASELAPEITHRFVTM------------GGIKIQKVIRRRYAAGKNFSDDGNLV
        H   KNS  F   +V+I+ E S+R +++ V IDDG  V V LD  A A     EI +R   +             G  ++  I      G++  D  +  
Subjt:  HTSGKNSGAFWRGKVQIDSEASSRLEVLAVHIDDGRGVGVTLD--AAASELAPEITHRFVTM------------GGIKIQKVIRRRYAAGKNFSDDGNLV

Query:  TLERKADGSGGGSESANTVTEIKIQ-RRGINHQSLLPARGLNRATLTGALIVVILVKVKKI-------LGGQREIGLGLGAAELSSGSIVGSGRLLRRVE
         L  K +  GG    A +V EIKIQ RR + HQS L ARGL+R  LT ALI  + V +  +       LG + E+GLGLGAAE+S GS+VG+G     VE
Subjt:  TLERKADGSGGGSESANTVTEIKIQ-RRGINHQSLLPARGLNRATLTGALIVVILVKVKKI-------LGGQREIGLGLGAAELSSGSIVGSGRLLRRVE

Query:  GSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVSAVTLDDN----RLLPRRHL---SQNPD------QIRSNQIKSKKR
        G EP+PALLA + DL GV  PRPLPHS +MH L+V AV  +      RL+ RRH     +NPD      +I++N+ K KKR
Subjt:  GSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVSAVTLDDN----RLLPRRHL---SQNPD------QIRSNQIKSKKR

A0A427A686 Uncharacterized protein1.2e-1029.52Show/hide
Query:  LPVCQFAHLPSQLYLKLYMGKAKS-------SALDIDDRRPAPPPPPPRKIKNKKKERENRQPRIPTPPSVVVPISTAISRRVRKTPLKLSHSLPSAAAT
        LP C  +H  +   L+   G+  S       SA D    R A P  PP    +  K       R P+PP          SR     P  L HS+PS A  
Subjt:  LPVCQFAHLPSQLYLKLYMGKAKS-------SALDIDDRRPAPPPPPPRKIKNKKKERENRQPRIPTPPSVVVPISTAISRRVRKTPLKLSHSLPSAAAT

Query:  HYVKRNNCATAAGAFLSIHVSFVFSPARTTDRSVFGFKDTIP--HTSGKNS------GAFWRGKVQIDSEASSRLEVLAVHIDDGRGVGVTL-DAAASEL
            R   + A     +IH            ++   +  + P    SG  S      G     + ++  +   R  V+ V ID G  VGV L D     L
Subjt:  HYVKRNNCATAAGAFLSIHVSFVFSPARTTDRSVFGFKDTIP--HTSGKNS------GAFWRGKVQIDSEASSRLEVLAVHIDDGRGVGVTL-DAAASEL

Query:  APEITHR------FVTMGGIKIQKVIRRRYAAGKNFSD-------DGNLVTLERKADGSGGGSESANTVTEIKIQ--RRGINHQSLLPARGLNRATLTGA
             HR       V  G  ++    R    +G    +       DG +V LE KA        +A TV E++I+  R   + +   P R L    L GA
Subjt:  APEITHR------FVTMGGIKIQKVIRRRYAAGKNFSD-------DGNLVTLERKADGSGGGSESANTVTEIKIQ--RRGINHQSLLPARGLNRATLTGA

Query:  LIVV-----------ILVKVKKILGGQR--EIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVS
         +               V    +  G R  E+GLGLG+AEL+ G +VG   LLRRVE  EP+ ALLA +++L  V+PPRPLPHSS+M  L  S
Subjt:  LIVV-----------ILVKVKKILGGQR--EIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVS

A0A444C4E4 Uncharacterized protein1.7e-0929.35Show/hide
Query:  LPVCQFAHLPSQLYLKLYMGKAKS-------SALDIDDRRPAPPPPPPRKIKNKKKERENRQPRIPTPPSVVVPISTAISRRVRKTPLKLSHSLPSAAAT
        LP C   H  +   L+   G+  S       SA D    R A P  PP    +  K       R P+PP          SR     P  L HS+PS    
Subjt:  LPVCQFAHLPSQLYLKLYMGKAKS-------SALDIDDRRPAPPPPPPRKIKNKKKERENRQPRIPTPPSVVVPISTAISRRVRKTPLKLSHSLPSAAAT

Query:  HYVKRNNCATAAGAFLSIHVSF--VFSPARTTDRSVFGFKDTIPHTSGKNSGAFWRGKVQIDSEASSRLEVLAVHIDDGRGVGVTLDAAASELAPEITHR
         + + ++             +   V    +   R  +G +  +    G + G   R  V +  +   RLE    H    R V V            + HR
Subjt:  HYVKRNNCATAAGAFLSIHVSF--VFSPARTTDRSVFGFKDTIPHTSGKNSGAFWRGKVQIDSEASSRLEVLAVHIDDGRGVGVTLDAAASELAPEITHR

Query:  FVTMGGIKIQKV--IRRRYAAGKNFSDDGNLVTLERKADGSGGGSESANTVTEIKIQ--RRGINHQSLLPARGLNRATLTGALIVV-----------ILV
             G ++++   +RRR         DG +V LE KA        +A TV E++I+  R   + +   P R L    L GA +               V
Subjt:  FVTMGGIKIQKV--IRRRYAAGKNFSDDGNLVTLERKADGSGGGSESANTVTEIKIQ--RRGINHQSLLPARGLNRATLTGALIVV-----------ILV

Query:  KVKKILGGQR--EIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVSAVTLDDNRLLPRRH--------LSQ
            +  G R  E+GLGLG+AEL+ G +VG   LLRRVE  EP+ ALLA +++L  V+PPRPLPHSS+M  L  S  T    RL  RRH        LS 
Subjt:  KVKKILGGQR--EIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVSAVTLDDNRLLPRRH--------LSQ

Query:  NP
        NP
Subjt:  NP

A0A445BBK6 Uncharacterized protein1.2e-1045.22Show/hide
Query:  LPARGLNRATLTGALIVVIL---VKVKKILGGQR-EIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVS--
        +PA   + A L GA +VV+    V +  ++G ++ E+GL L A ELSSG +V +  +L  VEG+EPN ALLA ISDLS V  P PLP++ KM+  N    
Subjt:  LPARGLNRATLTGALIVVIL---VKVKKILGGQR-EIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVS--

Query:  -AVTLDDNRLLPRRH
         AV L+ +RL+P  H
Subjt:  -AVTLDDNRLLPRRH

A0A445BBL8 Uncharacterized protein1.2e-1045.22Show/hide
Query:  LPARGLNRATLTGALIVVIL---VKVKKILGGQR-EIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVS--
        +PA   + A L GA +VV+    V +  ++G ++ E+GL L A ELSSG +V +  +L  VEG+EPN ALLA ISDLS V  P PLP++ KM+  N    
Subjt:  LPARGLNRATLTGALIVVIL---VKVKKILGGQR-EIGLGLGAAELSSGSIVGSGRLLRRVEGSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVS--

Query:  -AVTLDDNRLLPRRH
         AV L+ +RL+P  H
Subjt:  -AVTLDDNRLLPRRH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTGGGCATGGGGGGGATGTCCACGGTACACGTACACCACATCTGTTTGCCAGTTTGCCAGTTTGCCCACTTACCCTCCCAATTATATTTAAAGCTATATATGGG
AAAAGCGAAATCTTCCGCCTTAGATATCGACGACCGTAGACCGGCCCCACCCCCACCTCCACCCCGCAAAATAAAAAATAAAAAAAAAGAGAGAGAAAATCGGCAGCCGC
GCATCCCCACCCCCCCTTCCGTTGTGGTCCCCATCTCGACCGCCATTTCGCGTCGTGTGCGTAAAACGCCTTTAAAGCTTTCACACTCGCTGCCCTCCGCGGCCGCCACG
CACTATGTGAAACGCAACAACTGTGCCACAGCCGCCGGCGCATTTCTTTCAATACACGTGTCCTTTGTGTTTTCTCCAGCTCGAACAACTGATCGATCAGTCTTCGGATT
TAAGGACACGATTCCACATACGTCGGGAAAAAATTCAGGCGCTTTCTGGAGGGGGAAAGTTCAGATCGATTCCGAGGCCTCCTCGAGGCTTGAGGTCCTGGCCGTTCATA
TCGACGACGGACGAGGAGTCGGAGTCACTCTGGACGCCGCCGCTTCCGAACTCGCGCCGGAAATCACTCACCGCTTCGTGACGATGGGCGGCATCAAAATACAGAAGGTG
ATTCGACGTCGGTATGCCGCGGGCAAAAACTTCTCCGATGACGGGAATTTGGTAACGCTGGAACGGAAAGCTGATGGCTCCGGCGGCGGCAGCGAGTCCGCCAATACCGT
GACCGAGATTAAGATTCAAAGGCGAGGAATCAACCATCAGAGCTTGCTCCCGGCTAGAGGACTCAACCGTGCTACTCTGACTGGGGCTCTGATTGTTGTTATTCTTGTTA
AGGTGAAGAAGATCCTCGGAGGGCAAAGGGAAATTGGTCTTGGCCTTGGCGCCGCGGAACTCTCTAGCGGCAGTATCGTAGGCTCGGGCCGCCTCCTCCGCCGTGTCGAA
GGTTCCGAGCCAAACCCGGCTCTTCTTGCACGGATCTCTGATCTCAGCGGCGTATCTCCCCCAAGGCCTCTTCCTCACTCCTCTAAAATGCACTTCCTTAACGTTTCCGC
CGTTACCCTTGACGACAACCGCCTTCTCCCTCGGCGCCATTTATCCCAAAATCCAGATCAGATCAGATCAAATCAAATCAAATCAAAGAAACGAAAAGAAGCTCCGTTTG
ATTCTCGAGAAAATAATTCCTCTTTTCCTCCTTTCGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGATTGGGCATGGGGGGGATGTCCACGGTACACGTACACCACATCTGTTTGCCAGTTTGCCAGTTTGCCCACTTACCCTCCCAATTATATTTAAAGCTATATATGGG
AAAAGCGAAATCTTCCGCCTTAGATATCGACGACCGTAGACCGGCCCCACCCCCACCTCCACCCCGCAAAATAAAAAATAAAAAAAAAGAGAGAGAAAATCGGCAGCCGC
GCATCCCCACCCCCCCTTCCGTTGTGGTCCCCATCTCGACCGCCATTTCGCGTCGTGTGCGTAAAACGCCTTTAAAGCTTTCACACTCGCTGCCCTCCGCGGCCGCCACG
CACTATGTGAAACGCAACAACTGTGCCACAGCCGCCGGCGCATTTCTTTCAATACACGTGTCCTTTGTGTTTTCTCCAGCTCGAACAACTGATCGATCAGTCTTCGGATT
TAAGGACACGATTCCACATACGTCGGGAAAAAATTCAGGCGCTTTCTGGAGGGGGAAAGTTCAGATCGATTCCGAGGCCTCCTCGAGGCTTGAGGTCCTGGCCGTTCATA
TCGACGACGGACGAGGAGTCGGAGTCACTCTGGACGCCGCCGCTTCCGAACTCGCGCCGGAAATCACTCACCGCTTCGTGACGATGGGCGGCATCAAAATACAGAAGGTG
ATTCGACGTCGGTATGCCGCGGGCAAAAACTTCTCCGATGACGGGAATTTGGTAACGCTGGAACGGAAAGCTGATGGCTCCGGCGGCGGCAGCGAGTCCGCCAATACCGT
GACCGAGATTAAGATTCAAAGGCGAGGAATCAACCATCAGAGCTTGCTCCCGGCTAGAGGACTCAACCGTGCTACTCTGACTGGGGCTCTGATTGTTGTTATTCTTGTTA
AGGTGAAGAAGATCCTCGGAGGGCAAAGGGAAATTGGTCTTGGCCTTGGCGCCGCGGAACTCTCTAGCGGCAGTATCGTAGGCTCGGGCCGCCTCCTCCGCCGTGTCGAA
GGTTCCGAGCCAAACCCGGCTCTTCTTGCACGGATCTCTGATCTCAGCGGCGTATCTCCCCCAAGGCCTCTTCCTCACTCCTCTAAAATGCACTTCCTTAACGTTTCCGC
CGTTACCCTTGACGACAACCGCCTTCTCCCTCGGCGCCATTTATCCCAAAATCCAGATCAGATCAGATCAAATCAAATCAAATCAAAGAAACGAAAAGAAGCTCCGTTTG
ATTCTCGAGAAAATAATTCCTCTTTTCCTCCTTTCGGGTGA
Protein sequenceShow/hide protein sequence
MGLGMGGMSTVHVHHICLPVCQFAHLPSQLYLKLYMGKAKSSALDIDDRRPAPPPPPPRKIKNKKKERENRQPRIPTPPSVVVPISTAISRRVRKTPLKLSHSLPSAAAT
HYVKRNNCATAAGAFLSIHVSFVFSPARTTDRSVFGFKDTIPHTSGKNSGAFWRGKVQIDSEASSRLEVLAVHIDDGRGVGVTLDAAASELAPEITHRFVTMGGIKIQKV
IRRRYAAGKNFSDDGNLVTLERKADGSGGGSESANTVTEIKIQRRGINHQSLLPARGLNRATLTGALIVVILVKVKKILGGQREIGLGLGAAELSSGSIVGSGRLLRRVE
GSEPNPALLARISDLSGVSPPRPLPHSSKMHFLNVSAVTLDDNRLLPRRHLSQNPDQIRSNQIKSKKRKEAPFDSRENNSSFPPFG