; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019441 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019441
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCotton fiber protein
Genome locationtig00153347:632156..632785
RNA-Seq ExpressionSgr019441
SyntenySgr019441
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653186.1 hypothetical protein Csa_020019 [Cucumis sativus]2.1e-1846.93Show/hide
Query:  MSSNSNS-PPNLLVADS---FLPYE-PKRHPY-KCAN---VAFFSSLAFLLSIFFC-ISIF-NLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-R
        MS NS S P NL+V ++   FLPY+ PK  PY K  N   +AFFSSL FLLSIF   ISIF NLS SSLFNS+ FWF ISNTLI IIA DYG+FS S+ +
Subjt:  MSSNSNS-PPNLLVADS---FLPYE-PKRHPY-KCAN---VAFFSSLAFLLSIFFC-ISIF-NLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-R

Query:  QLDFYEDYISKPKQTQDQSSSLHLDEQQPKKNLTPKQ---ERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNE-
            YEDY   P                P   LT  Q     LVV D++RE P+EKL+ +V+ +    + +   +      RTY R KSEK KR VS E 
Subjt:  QLDFYEDYISKPKQTQDQSSSLHLDEQQPKKNLTPKQ---ERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNE-

Query:  -SKIMA-RRSESVKYK-KQREKKMTFQR
          K+M  RRSESVK + K+ E +  F +
Subjt:  -SKIMA-RRSESVKYK-KQREKKMTFQR

XP_022927149.1 uncharacterized protein LOC111434084 [Cucurbita moschata]3.6e-1845Show/hide
Query:  MSSNSNSPPNLLVADSFLPYEPKRHPYKCAN------VAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-RQLDFY
        MSS+SN    + V + F+PY+PK  P  C        +AFFSSL  LLS  FCI   NLS SSLFNS+ FWF ISN LI IIAADY  FS S+ ++   Y
Subjt:  MSSNSNSPPNLLVADSFLPYEPKRHPYKCAN------VAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-RQLDFY

Query:  EDYI-SKPKQT--QDQSSSLHLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNES-KIMA
        E Y    PK T  Q Q+SS                   VV D++RE P+  LQ  V  R D     K         RTYRRSKSEK KRSVS ES K MA
Subjt:  EDYI-SKPKQT--QDQSSSLHLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNES-KIMA

Query:  RRSESVKYKKQREKKMTFQR
        +RSES KY+++  ++  + +
Subjt:  RRSESVKYKKQREKKMTFQR

XP_023001467.1 uncharacterized protein LOC111495593 [Cucurbita maxima]7.2e-1947.85Show/hide
Query:  MSSNSNSPPNLLVADSFLPYEPKRHP---YKCAN---VAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-RQLDFY
        MSS+SN    + + + F+PY+PK  P    K  N   +AFFSSL FLLS  FCI   NLS SSLFNS+ FWF ISNTLI IIAADY  FS S+ ++  FY
Subjt:  MSSNSNSPPNLLVADSFLPYEPKRHP---YKCAN---VAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-RQLDFY

Query:  EDYI-SKPKQT--QDQSSSLHLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNES-KIMA
        E Y    PK T  Q Q+SS                   VV +++RE P+  LQ IV+ R D     K         RTY+RSKSEK KR+VS ES K MA
Subjt:  EDYI-SKPKQT--QDQSSSLHLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNES-KIMA

Query:  RRSESVKYK
        +RSES KY+
Subjt:  RRSESVKYK

XP_030973139.1 uncharacterized protein LOC115993012 [Quercus lobata]3.0e-1738.76Show/hide
Query:  SNSPPNLLVADSFLPY-EPKRHPYKCANVAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDYISKPKQT
        + +PPN LV DS  PY   K +P K   ++F++SL  +      + IFNLS S+LF ++ FWF +SNTLILIIA DYG +S S  + D Y++Y+   K+T
Subjt:  SNSPPNLLVADSFLPY-EPKRHPYKCANVAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDYISKPKQT

Query:  QDQSSSLHLDE-QQPKKNLTPKQ-------ERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEM--KIQTRTYRRSKSEKAKRSVSNESKIMARRSE
        Q ++    + + Q+  K  TPKQ       +R V+V + +  PE  LQ++++   +     K  E++  KI+ +TYRRSKSE+AKR V +ESK +  RS 
Subjt:  QDQSSSLHLDE-QQPKKNLTPKQ-------ERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEM--KIQTRTYRRSKSEKAKRSVSNESKIMARRSE

Query:  SVKYKKQRE
          +  ++ E
Subjt:  SVKYKKQRE

XP_038877172.1 uncharacterized protein LOC120069472 [Benincasa hispida]2.7e-2148.42Show/hide
Query:  MSSNSNSPPNLLVAD-SFLPY-EPKRHP--YKCAN---VAFFSSLAFLLSIFFC-ISIF-NLSLSSLFNSSNFWFCISNTLILIIAADYGVFS-PSKRQL
        MSSNS SP NLLV D SFLPY +PK      K AN   +AFFSSL FLLSIF   ISIF NLSL SLFNS+ FWF ISNTLI IIA DYGVFS P  +  
Subjt:  MSSNSNSPPNLLVAD-SFLPY-EPKRHP--YKCAN---VAFFSSLAFLLSIFFC-ISIF-NLSLSSLFNSSNFWFCISNTLILIIAADYGVFS-PSKRQL

Query:  DFYEDYI-SKPKQTQDQSSSLHLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNES-KIM
          YED+  S PK    +  S                  LVV D+++E   EKL+++V+GR+                RT RR KSEK K  VS ES K+M
Subjt:  DFYEDYI-SKPKQTQDQSSSLHLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNES-KIM

Query:  ARRSESVKYKKQREKKMTFQR
         +RSESVKY+ +  ++  F++
Subjt:  ARRSESVKYKKQREKKMTFQR

TrEMBL top hitse value%identityAlignment
A0A2P5EQV2 Cotton fiber protein3.6e-1638.64Show/hide
Query:  YEPKRHPYKCANVAFFSSLAFLLSIFFCISI---FNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-RQLDFYEDYISKPKQTQDQSSSLHLD--
        Y   ++ YK   ++FF   AF+ S+F  IS+   FNLS S+LFN++ FWF ISNTLILIIAADYG FS S   + D Y++Y++  +  + +SSSL  +  
Subjt:  YEPKRHPYKCANVAFFSSLAFLLSIFFCISI---FNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-RQLDFYEDYISKPKQTQDQSSSLHLD--

Query:  ---------------EQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEG--------------REDIFQRN-KRCEEMKIQ---TRTYRRSKSEKAKRS
                        Q+    +  K+E  VV D+E E PE+KLQI+ +                EDI ++    CEE K      +TYRRSKSEK KR 
Subjt:  ---------------EQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEG--------------REDIFQRN-KRCEEMKIQ---TRTYRRSKSEKAKRS

Query:  VSNES------KIMARRSES
        V +ES      +++ RRSE+
Subjt:  VSNES------KIMARRSES

A0A6J1EH73 uncharacterized protein LOC1114340841.7e-1845Show/hide
Query:  MSSNSNSPPNLLVADSFLPYEPKRHPYKCAN------VAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-RQLDFY
        MSS+SN    + V + F+PY+PK  P  C        +AFFSSL  LLS  FCI   NLS SSLFNS+ FWF ISN LI IIAADY  FS S+ ++   Y
Subjt:  MSSNSNSPPNLLVADSFLPYEPKRHPYKCAN------VAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-RQLDFY

Query:  EDYI-SKPKQT--QDQSSSLHLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNES-KIMA
        E Y    PK T  Q Q+SS                   VV D++RE P+  LQ  V  R D     K         RTYRRSKSEK KRSVS ES K MA
Subjt:  EDYI-SKPKQT--QDQSSSLHLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNES-KIMA

Query:  RRSESVKYKKQREKKMTFQR
        +RSES KY+++  ++  + +
Subjt:  RRSESVKYKKQREKKMTFQR

A0A6J1KGM0 uncharacterized protein LOC1114955933.5e-1947.85Show/hide
Query:  MSSNSNSPPNLLVADSFLPYEPKRHP---YKCAN---VAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-RQLDFY
        MSS+SN    + + + F+PY+PK  P    K  N   +AFFSSL FLLS  FCI   NLS SSLFNS+ FWF ISNTLI IIAADY  FS S+ ++  FY
Subjt:  MSSNSNSPPNLLVADSFLPYEPKRHP---YKCAN---VAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSK-RQLDFY

Query:  EDYI-SKPKQT--QDQSSSLHLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNES-KIMA
        E Y    PK T  Q Q+SS                   VV +++RE P+  LQ IV+ R D     K         RTY+RSKSEK KR+VS ES K MA
Subjt:  EDYI-SKPKQT--QDQSSSLHLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNES-KIMA

Query:  RRSESVKYK
        +RSES KY+
Subjt:  RRSESVKYK

A0A6P3ZKV5 uncharacterized protein LOC1074125091.1e-1538.36Show/hide
Query:  PYEPKRHPYKCANVAFFSSLAFLLSIFFCIS---IFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDYISKPKQTQDQSSSLHLDEQ
        PY+   +P K   ++FF   AF+ SIF  IS   IFNLS S++FN++ FWF ISNTLILIIA DYG FS SK + D Y++Y+   + +Q +S++ H    
Subjt:  PYEPKRHPYKCANVAFFSSLAFLLSIFFCIS---IFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDYISKPKQTQDQSSSLHLDEQ

Query:  QP-------------KKNLTPKQERLV-------VVDKERENPEEKLQIIVEGR--------------EDIFQR----------NKRCEEMKIQT---RT
         P             K N+  KQE  V       +   + +NP+ KLQI+ +                EDI ++           K C E   +T   +T
Subjt:  QP-------------KKNLTPKQERLV-------VVDKERENPEEKLQIIVEGR--------------EDIFQR----------NKRCEEMKIQT---RT

Query:  YRRSKSEKAKRSVSNESK-IMARRSESVKYKK
        YRRSKSEKAKR V +E K I  RRSE+ K K+
Subjt:  YRRSKSEKAKRSVSNESK-IMARRSESVKYKK

A0A7N2LQF9 Uncharacterized protein1.5e-1738.76Show/hide
Query:  SNSPPNLLVADSFLPY-EPKRHPYKCANVAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDYISKPKQT
        + +PPN LV DS  PY   K +P K   ++F++SL  +      + IFNLS S+LF ++ FWF +SNTLILIIA DYG +S S  + D Y++Y+   K+T
Subjt:  SNSPPNLLVADSFLPY-EPKRHPYKCANVAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDYISKPKQT

Query:  QDQSSSLHLDE-QQPKKNLTPKQ-------ERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEM--KIQTRTYRRSKSEKAKRSVSNESKIMARRSE
        Q ++    + + Q+  K  TPKQ       +R V+V + +  PE  LQ++++   +     K  E++  KI+ +TYRRSKSE+AKR V +ESK +  RS 
Subjt:  QDQSSSLHLDE-QQPKKNLTPKQ-------ERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEM--KIQTRTYRRSKSEKAKRSVSNESKIMARRSE

Query:  SVKYKKQRE
          +  ++ E
Subjt:  SVKYKKQRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30190.1 unknown protein2.2e-0536.64Show/hide
Query:  YEPKRHPYKCANVAFFSSLAFLLSIFFCISIFN---LSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDYISKPKQTQDQSSSLHLDEQQ
        +EPK+  Y  +     S L   L IF  I IF+   +SLSS+F  +   F ISNTLILIIAADYG FS  + Q DFY +Y       ++++ +       
Subjt:  YEPKRHPYKCANVAFFSSLAFLLSIFFCISIFN---LSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDYISKPKQTQDQSSSLHLDEQQ

Query:  PKKNLTPKQERLVVVDKERENPEEKLQIIVE
          +  T   E     D E  NPEE+ + +V+
Subjt:  PKKNLTPKQERLVVVDKERENPEEKLQIIVE

AT2G34610.1 unknown protein6.3e-0529.33Show/hide
Query:  PPNLLVADSFLPYEPKRHPYKCANVAFFSSLAFLLSIFFCISIF---NLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDY--------
        P  ++    F PY+PK+  YK     + S L  +LSIF  I IF   ++S  S+FN +   F ISN LI+IIAADYG F+  K  LDFY +Y        
Subjt:  PPNLLVADSFLPYEPKRHPYKCANVAFFSSLAFLLSIFFCISIF---NLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDY--------

Query:  ------------ISKPKQTQDQS------SSLHLDEQ---QPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIF-QRNKRCEEMKI-QTRTYRRSK
                    +S  ++T+++        +  L +Q     K  +T +  + V  ++ R    EK + + E    I   + + C    +  ++ Y RSK
Subjt:  ------------ISKPKQTQDQS------SSLHLDEQ---QPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIF-QRNKRCEEMKI-QTRTYRRSK

Query:  SEKAKRSVSNESKIMARRSESVKYK
        S+KA+ SV ++     RR + +K++
Subjt:  SEKAKRSVSNESKIMARRSESVKYK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCTAATTCAAATTCACCTCCAAATCTACTGGTTGCAGATTCATTTCTGCCCTACGAACCCAAACGACACCCTTATAAATGTGCAAACGTAGCCTTCTTCTCTTC
TCTCGCCTTCCTCCTCTCCATTTTCTTCTGCATTTCCATCTTCAACCTCTCTCTTTCATCTCTCTTCAACAGCAGCAACTTCTGGTTCTGCATTTCCAACACCCTCATCC
TTATCATTGCTGCTGATTATGGCGTTTTCTCTCCCTCCAAACGCCAACTCGATTTTTATGAAGATTACATATCAAAGCCAAAGCAAACTCAGGATCAAAGTTCTTCACTG
CACCTTGATGAACAACAGCCCAAGAAAAATCTCACTCCCAAACAAGAACGACTGGTAGTAGTTGATAAAGAGAGAGAAAACCCAGAAGAGAAATTGCAAATCATCGTTGA
AGGACGTGAAGATATATTCCAACGGAACAAACGCTGCGAAGAGATGAAGATTCAGACGAGAACTTACCGGCGAAGCAAGTCGGAGAAAGCGAAAAGAAGTGTGTCAAATG
AGAGCAAAATCATGGCGAGGAGGTCGGAAAGTGTAAAGTACAAGAAGCAAAGGGAGAAGAAAATGACTTTTCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCTAATTCAAATTCACCTCCAAATCTACTGGTTGCAGATTCATTTCTGCCCTACGAACCCAAACGACACCCTTATAAATGTGCAAACGTAGCCTTCTTCTCTTC
TCTCGCCTTCCTCCTCTCCATTTTCTTCTGCATTTCCATCTTCAACCTCTCTCTTTCATCTCTCTTCAACAGCAGCAACTTCTGGTTCTGCATTTCCAACACCCTCATCC
TTATCATTGCTGCTGATTATGGCGTTTTCTCTCCCTCCAAACGCCAACTCGATTTTTATGAAGATTACATATCAAAGCCAAAGCAAACTCAGGATCAAAGTTCTTCACTG
CACCTTGATGAACAACAGCCCAAGAAAAATCTCACTCCCAAACAAGAACGACTGGTAGTAGTTGATAAAGAGAGAGAAAACCCAGAAGAGAAATTGCAAATCATCGTTGA
AGGACGTGAAGATATATTCCAACGGAACAAACGCTGCGAAGAGATGAAGATTCAGACGAGAACTTACCGGCGAAGCAAGTCGGAGAAAGCGAAAAGAAGTGTGTCAAATG
AGAGCAAAATCATGGCGAGGAGGTCGGAAAGTGTAAAGTACAAGAAGCAAAGGGAGAAGAAAATGACTTTTCAAAGATGA
Protein sequenceShow/hide protein sequence
MSSNSNSPPNLLVADSFLPYEPKRHPYKCANVAFFSSLAFLLSIFFCISIFNLSLSSLFNSSNFWFCISNTLILIIAADYGVFSPSKRQLDFYEDYISKPKQTQDQSSSL
HLDEQQPKKNLTPKQERLVVVDKERENPEEKLQIIVEGREDIFQRNKRCEEMKIQTRTYRRSKSEKAKRSVSNESKIMARRSESVKYKKQREKKMTFQR