; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028025 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028025
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionExtensin-like protein
Genome locationtig00153056:2681878..2685193
RNA-Seq ExpressionSgr028025
SyntenySgr028025
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649619.1 hypothetical protein Csa_012837 [Cucumis sativus]4.6e-5372.46Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSP
        MD DEFYRQPAAVPFKWEIKPGVPRNHH+LR  P+  P Q     QKL+PPPA SHF  P +    SLH   RT+S+RWRF RS     Q+ S GCFPSP
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSP

Query:  LPKRKSGKSVVRK-PEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWA
        LP RKS KSV RK PEPDYSS+L+TLSRWS+SSRKSISPFR SVSSSPSSFSSY+SSPRPTSDTEWA
Subjt:  LPKRKSGKSVVRK-PEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWA

XP_004142634.1 uncharacterized protein LOC101220757 [Cucumis sativus]9.3e-5472.62Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSP
        MD DEFYRQPAAVPFKWEIKPGVPRNHH+LR  P+  P Q     QKL+PPPA SHF  P +    SLH   RT+S+RWRF RS     Q+ S GCFPSP
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSP

Query:  LPKRKSGKSVVRK-PEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG
        LP RKS KSV RK PEPDYSS+L+TLSRWS+SSRKSISPFR SVSSSPSSFSSY+SSPRPTSDTEWAG
Subjt:  LPKRKSGKSVVRK-PEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG

XP_008444194.1 PREDICTED: uncharacterized protein LOC103487607 [Cucumis melo]1.7e-5272.19Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSP
        MD DEFYR+PAAVPFKWEIKPGVPRNHH+ RQ P+  P Q     QKL+PPPA SHF  PS+    SLH   RTRSDRWRF RS     Q+ S GCFPSP
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSP

Query:  LPKRKSGKSVVRK-PEPDYSSELETLSRWSISSRKSISPFRESV-SSSPSSFSSYRSSPRPTSDTEWAG
        LP RKS K++ RK PEPDYSS+L+TLSRWS+SSRKSISPFR SV SSSPSSFSSY+SSPRPTSDTEWAG
Subjt:  LPKRKSGKSVVRK-PEPDYSSELETLSRWSISSRKSISPFRESV-SSSPSSFSSYRSSPRPTSDTEWAG

XP_022131529.1 uncharacterized protein DKFZp434B061-like [Momordica charantia]2.7e-6176.33Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSP--GCFP
        MD DEFYR+PAAVPFKWEIKPGVPR HH+L   PS       PPPQKL+PPP  SHF RPS+S + SLH   RTRSDRWRFARS LAEP  VSP  GCFP
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSP--GCFP

Query:  SPLPKRKSGKSVVRKPEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG
        SP P RKSGKS+ RKPEP+Y++ELETLSRWS+SSRKSISPFR+SVSSSPSSFSSY+SSPRPTSDTEWAG
Subjt:  SPLPKRKSGKSVVRKPEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG

XP_038899347.1 uncharacterized protein LOC120086669 [Benincasa hispida]1.1e-5471.86Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSP
        MDVDEFYRQPAAVPFKWEIKPGVP+NHH+LR  P+  P   P   QKL+PPP+ S+FL PS+    SLH   RTRSDRWRF     ++P+ VS GCFPSP
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSP

Query:  LPKRKSGKSVVRKPEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG
        LP RKS KS+ R PEPDYSS LE+LSRWS+SSRKSISPFR SVSSSPSS+SSY SSPRPTSDTEWAG
Subjt:  LPKRKSGKSVVRKPEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG

TrEMBL top hitse value%identityAlignment
A0A1S3BAM4 uncharacterized protein LOC1034876078.5e-5372.19Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSP
        MD DEFYR+PAAVPFKWEIKPGVPRNHH+ RQ P+  P Q     QKL+PPPA SHF  PS+    SLH   RTRSDRWRF RS     Q+ S GCFPSP
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSP

Query:  LPKRKSGKSVVRK-PEPDYSSELETLSRWSISSRKSISPFRESV-SSSPSSFSSYRSSPRPTSDTEWAG
        LP RKS K++ RK PEPDYSS+L+TLSRWS+SSRKSISPFR SV SSSPSSFSSY+SSPRPTSDTEWAG
Subjt:  LPKRKSGKSVVRK-PEPDYSSELETLSRWSISSRKSISPFRESV-SSSPSSFSSYRSSPRPTSDTEWAG

A0A6J1BQH3 uncharacterized protein DKFZp434B061-like1.3e-6176.33Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSP--GCFP
        MD DEFYR+PAAVPFKWEIKPGVPR HH+L   PS       PPPQKL+PPP  SHF RPS+S + SLH   RTRSDRWRFARS LAEP  VSP  GCFP
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSP--GCFP

Query:  SPLPKRKSGKSVVRKPEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG
        SP P RKSGKS+ RKPEP+Y++ELETLSRWS+SSRKSISPFR+SVSSSPSSFSSY+SSPRPTSDTEWAG
Subjt:  SPLPKRKSGKSVVRKPEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG

A0A6J1FHC7 uncharacterized protein LOC1114457752.7e-5169.23Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPA--ASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFP
        MDVDEFYRQPAAVPFKWEIKPGVPRNHH+L Q P+  PQQ     +KL+PPPA  A+ F R S+S        LRTRSDRW   +S+LAEP+ VS GCF 
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPA--ASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFP

Query:  SPLPKRKSGKSVVRKPEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG
        SPLP RK+ K V RKPEPDY+SELETL RWS+SS+KSISPFR SVSS  SS SSY+SSPRPTSD+EWAG
Subjt:  SPLPKRKSGKSVVRKPEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG

A0A6J1ISY3 uncharacterized protein LOC1114803251.5e-4967.84Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPA--ASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFP
        MDVDEFYRQPAAVPFKWEIKPGVPRNHH L   P+  PQQ     +KL+PPPA  A+ F R S+S        LRTRSDRW  ++S+LAEP+ VS GCF 
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPA--ASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFP

Query:  SPLPKRKSGKSVVRKPEPDYSSELETLSRWSISSRKSISPFRESVSS--SPSSFSSYRSSPRPTSDTEWAG
        SPLP RK+ K + RKPEPD +SELETL RWS+SS+KSISPFR SVSS  SPSS SSY+SSPRPTSD+EWAG
Subjt:  SPLPKRKSGKSVVRKPEPDYSSELETLSRWSISSRKSISPFRESVSS--SPSSFSSYRSSPRPTSDTEWAG

A0A7N2MK80 Uncharacterized protein7.0e-3953.4Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNH-----HQLRQLPSQCPQQR------------PP--PPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRF
        M +DE   +P A+PFKWEIKPGVP+ H     H+ +Q P + P+ R            PP   PQKLRPPP+ SHF+ P +  TRS     RTRS+RWRF
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNH-----HQLRQLPSQCPQQR------------PP--PPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRF

Query:  AR-SRLAEPQLVSPGCFPSPLPKRKSGKSVVRKP----EPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG
         R +    P++V+PGCF S   KRKS K+ ++KP    EPDYSS+LE LSRWS+SSR+S+SPFR S  S  SSFSSY+SSPRP SD EWAG
Subjt:  AR-SRLAEPQLVSPGCFPSPLPKRKSGKSVVRKP----EPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77400.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF688 (InterPro:IPR007789)4.5e-1432.33Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNH----------------HQLRQLPSQCPQQRPPP-----------------------------PQKLRP---PP
        +DVD+ +++P  +PF WEI+PGVP+                    LR  P    Q   PP                             P KL+P   P 
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNH----------------HQLRQLPSQCPQQRPPP-----------------------------PQKLRP---PP

Query:  AASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQ----------LVSPGCFPSP---LPKRKSG----KSVVRKPEPDYSSELETLSRWSISSRKS
        + S F  P  S   S     R  S+RW+  R     P+          +   GCFPSP   L K KSG    KS  R     Y S++ET+S W++SSR+S
Subjt:  AASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQ----------LVSPGCFPSP---LPKRKSG----KSVVRKPEPDYSSELETLSRWSISSRKS

Query:  ISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG
        +SP  E   S  SSFSS R SPR  ++ EW G
Subjt:  ISPFRESVSSSPSSFSSYRSSPRPTSDTEWAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTCGATGAATTTTACAGGCAACCGGCTGCTGTTCCTTTCAAATGGGAGATCAAACCCGGCGTCCCCAGGAATCACCACCAACTCCGGCAGTTGCCAAGTCAGTG
TCCTCAACAACGTCCGCCGCCTCCGCAAAAGCTGAGGCCTCCTCCTGCTGCATCCCACTTCCTCCGACCTTCCGATTCCATGACCCGCTCCCTCCACTTGCCCTTGCGAA
CACGGTCTGATCGCTGGCGGTTCGCCCGGTCCAGGCTCGCCGAACCCCAGTTAGTTTCGCCCGGATGCTTCCCGTCGCCTCTGCCGAAGCGGAAATCGGGCAAGAGTGTG
GTCCGGAAACCCGAACCCGATTATTCCTCCGAACTGGAGACCCTGTCCCGGTGGTCCATTTCCAGCAGGAAGTCGATTTCTCCGTTCCGGGAGTCAGTTTCGTCGTCGCC
GTCGTCGTTCTCATCGTACCGGTCATCGCCCCGTCCAACTAGTGATACTGAGTGGGCCGGAAAGGCCCAGTTGTCGCCCCATCTAGCTAGTGATTCTCGGTGGGCCGGAA
CGTCGACTACTTGTTTGGAATGCCTGAAATCTCCGTGCAACAGAAAACAACGGCGCCATTTTCAACTCCGGCGTCATGCTAACAGAGAGCCATCGAACCGTATATTCCAG
CCTCTGATGTGTCACATCAATGAGTTCGATCGAAGTCAATCACAGATCGGAGACGGAGAATCTCCAATTGCATGGAACGGCCGCCACGAAAAGACCAGACTCCACCATTC
GTTCCACGTTCCGCTCACTTCTTCGTCCCCGTTGAAGTGCTTTACGTTTTCACCAATAAAATCACCAACGGTGCCACGGCGATGTTTAGAGTCTCAACCAGGGAACAATA
CAATCGGAATGCGAGCCGATCTTGAAGGCATCCGAACCAAAAGTGAAATGATCGCCATGGATGGAGGCAGAAGAGAAACGGCCGCTGCTCTATTGTACATGGCAATGACA
ATGGCATGGACTGAGTCTCTGAGGGAGGAGAGCACTACTTCAGCAACCAACGAGTCAATGACTTGGCCTCTGAGCGCCAAAATTAAGTTCGTTCAAATTTTTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTCGATGAATTTTACAGGCAACCGGCTGCTGTTCCTTTCAAATGGGAGATCAAACCCGGCGTCCCCAGGAATCACCACCAACTCCGGCAGTTGCCAAGTCAGTG
TCCTCAACAACGTCCGCCGCCTCCGCAAAAGCTGAGGCCTCCTCCTGCTGCATCCCACTTCCTCCGACCTTCCGATTCCATGACCCGCTCCCTCCACTTGCCCTTGCGAA
CACGGTCTGATCGCTGGCGGTTCGCCCGGTCCAGGCTCGCCGAACCCCAGTTAGTTTCGCCCGGATGCTTCCCGTCGCCTCTGCCGAAGCGGAAATCGGGCAAGAGTGTG
GTCCGGAAACCCGAACCCGATTATTCCTCCGAACTGGAGACCCTGTCCCGGTGGTCCATTTCCAGCAGGAAGTCGATTTCTCCGTTCCGGGAGTCAGTTTCGTCGTCGCC
GTCGTCGTTCTCATCGTACCGGTCATCGCCCCGTCCAACTAGTGATACTGAGTGGGCCGGAAAGGCCCAGTTGTCGCCCCATCTAGCTAGTGATTCTCGGTGGGCCGGAA
CGTCGACTACTTGTTTGGAATGCCTGAAATCTCCGTGCAACAGAAAACAACGGCGCCATTTTCAACTCCGGCGTCATGCTAACAGAGAGCCATCGAACCGTATATTCCAG
CCTCTGATGTGTCACATCAATGAGTTCGATCGAAGTCAATCACAGATCGGAGACGGAGAATCTCCAATTGCATGGAACGGCCGCCACGAAAAGACCAGACTCCACCATTC
GTTCCACGTTCCGCTCACTTCTTCGTCCCCGTTGAAGTGCTTTACGTTTTCACCAATAAAATCACCAACGGTGCCACGGCGATGTTTAGAGTCTCAACCAGGGAACAATA
CAATCGGAATGCGAGCCGATCTTGAAGGCATCCGAACCAAAAGTGAAATGATCGCCATGGATGGAGGCAGAAGAGAAACGGCCGCTGCTCTATTGTACATGGCAATGACA
ATGGCATGGACTGAGTCTCTGAGGGAGGAGAGCACTACTTCAGCAACCAACGAGTCAATGACTTGGCCTCTGAGCGCCAAAATTAAGTTCGTTCAAATTTTTTTTTAA
Protein sequenceShow/hide protein sequence
MDVDEFYRQPAAVPFKWEIKPGVPRNHHQLRQLPSQCPQQRPPPPQKLRPPPAASHFLRPSDSMTRSLHLPLRTRSDRWRFARSRLAEPQLVSPGCFPSPLPKRKSGKSV
VRKPEPDYSSELETLSRWSISSRKSISPFRESVSSSPSSFSSYRSSPRPTSDTEWAGKAQLSPHLASDSRWAGTSTTCLECLKSPCNRKQRRHFQLRRHANREPSNRIFQ
PLMCHINEFDRSQSQIGDGESPIAWNGRHEKTRLHHSFHVPLTSSSPLKCFTFSPIKSPTVPRRCLESQPGNNTIGMRADLEGIRTKSEMIAMDGGRRETAAALLYMAMT
MAWTESLREESTTSATNESMTWPLSAKIKFVQIFF