; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr004619 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr004619
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationtig00003114:46251..54306
RNA-Seq ExpressionSgr004619
SyntenySgr004619
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053047.1 uncharacterized protein E6C27_scaffold344G001630 [Cucumis melo var. makuwa]5.6e-1746.22Show/hide
Query:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK---------------WHGE------------
        VN  +NGD +TR FVYWTA+  A TGCY+M CQGFVQV+ +     PL+P+STYQGQQYDYQFT+ +I                W  E            
Subjt:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK---------------WHGE------------

Query:  ---AFAKPSPYGMSPPLGN
             AKPS  GMSP LG+
Subjt:  ---AFAKPSPYGMSPPLGN

KAE8650029.1 hypothetical protein Csa_011504 [Cucumis sativus]3.1e-1559.42Show/hide
Query:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK
        VN  +NGD +TR FVYWTA+    TGCY+M CQGFVQV+ +     PL P+STY+GQQYDYQFT+ +I+
Subjt:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK

TYK11502.1 neprosin 2 [Cucumis melo var. makuwa]5.6e-1746.22Show/hide
Query:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK---------------WHGE------------
        VN  +NGD +TR FVYWTA+  A TGCY+M CQGFVQV+ +     PL+P+STYQGQQYDYQFT+ +I                W  E            
Subjt:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK---------------WHGE------------

Query:  ---AFAKPSPYGMSPPLGN
             AKPS  GMSP LG+
Subjt:  ---AFAKPSPYGMSPPLGN

XP_022145287.1 uncharacterized protein LOC111014775 [Momordica charantia]5.4e-2047.15Show/hide
Query:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK------------------WHGEAF------
        QVN  INGDS TR+FVYWTA+    TG Y+M C+ F+Q + +  PN PLYPSSTYQG+QYDY FTV++ +                  W  E F      
Subjt:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK------------------WHGEAF------

Query:  ---------AKPSPYGMSPPLGN
                 AKPSP GMSPPLGN
Subjt:  ---------AKPSPYGMSPPLGN

XP_031738648.1 uncharacterized protein LOC105435061 [Cucumis sativus]3.1e-1559.42Show/hide
Query:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK
        VN  +NGD +TR FVYWTA+    TGCY+M CQGFVQV+ +     PL P+STY+GQQYDYQFT+ +I+
Subjt:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK

TrEMBL top hitse value%identityAlignment
A0A072TRZ0 Carboxyl-terminal peptidase1.5e-1242.7Show/hide
Query:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWHGEAFAKPSPYGMSPPLG
        QVN  + GD+L RLF+YWTA+   +TGCYD+LC GFVQ ++ I     + P+STY G QY+    +Y+   +G  + +   YG+  P+G
Subjt:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWHGEAFAKPSPYGMSPPLG

A0A0A0L400 Neprosin domain-containing protein1.6e-1455.56Show/hide
Query:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWH
        QVN A+NGD+L R FVYWT +    TGCY+MLCQGFV V+  I     + P+S YQGQQYDYQF++ +   H
Subjt:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWH

A0A5A7UEV4 Uncharacterized protein2.7e-1746.22Show/hide
Query:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK---------------WHGE------------
        VN  +NGD +TR FVYWTA+  A TGCY+M CQGFVQV+ +     PL+P+STYQGQQYDYQFT+ +I                W  E            
Subjt:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK---------------WHGE------------

Query:  ---AFAKPSPYGMSPPLGN
             AKPS  GMSP LG+
Subjt:  ---AFAKPSPYGMSPPLGN

A0A5D3CJM0 Neprosin 22.7e-1746.22Show/hide
Query:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK---------------WHGE------------
        VN  +NGD +TR FVYWTA+  A TGCY+M CQGFVQV+ +     PL+P+STYQGQQYDYQFT+ +I                W  E            
Subjt:  VNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK---------------WHGE------------

Query:  ---AFAKPSPYGMSPPLGN
             AKPS  GMSP LG+
Subjt:  ---AFAKPSPYGMSPPLGN

A0A6J1CW60 uncharacterized protein LOC1110147752.6e-2047.15Show/hide
Query:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK------------------WHGEAF------
        QVN  INGDS TR+FVYWTA+    TG Y+M C+ F+Q + +  PN PLYPSSTYQG+QYDY FTV++ +                  W  E F      
Subjt:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIK------------------WHGEAF------

Query:  ---------AKPSPYGMSPPLGN
                 AKPSP GMSPPLGN
Subjt:  ---------AKPSPYGMSPPLGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G70550.1 Protein of Unknown Function (DUF239)9.8e-1238.36Show/hide
Query:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWHG
        Q++  + GD+  R F YWT++    TGCY++LC GFVQ +R I     + P S+Y+G Q+D    +++   HG
Subjt:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWHG

AT1G70550.2 Protein of Unknown Function (DUF239)9.8e-1238.36Show/hide
Query:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWHG
        Q++  + GD+  R F YWT++    TGCY++LC GFVQ +R I     + P S+Y+G Q+D    +++   HG
Subjt:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWHG

AT2G44210.1 Protein of Unknown Function (DUF239)1.3e-1142.65Show/hide
Query:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYR
        QV+  + GD+ TRLF YWT++    TGCY++LC GFVQ++R I     + P S Y   QYD    +++
Subjt:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYR

AT2G44210.2 Protein of Unknown Function (DUF239)1.3e-1142.65Show/hide
Query:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYR
        QV+  + GD+ TRLF YWT++    TGCY++LC GFVQ++R I     + P S Y   QYD    +++
Subjt:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYR

AT5G50150.1 Protein of Unknown Function (DUF239)4.4e-1231.25Show/hide
Query:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWHGEAFAKPSPYGMSPPLGNVWIVGTKGCL
        QV+  + GD+  R F YWT +    TGCY++LC GFVQ +  I     + P S+Y G+Q+D    +++   HG  + +         LGN  +VG     
Subjt:  QVNLAINGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWHGEAFAKPSPYGMSPPLGNVWIVGTKGCL

Query:  IASPLEDQVERI
        + S L      +
Subjt:  IASPLEDQVERI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATATAAAACGCCGTCTTGTGCGACGGAAGCCGGTTTAGCGGTTGGTAGCCCCAACGAATGGGTGGCTATTGAAGGCGATTTTGTCGGAGAGACGGCCACCAAAGG
CAGTGAGGAGGGTGGTTTTTTCGCTGTCGGAAGGCCGGAGCATGGGGAGAATTTCGCCGGGGGAAGACGACGCTGCTTAACCAAAATGGTTTTTCTCTCTACTCACCCAA
TGATGAACTGCTGCTCCTCAAATCCTCAAACCCACACTGTTGACTATGCACAAACTTCATCCTATCTGGCACAGTTTATTACTATTTTTTGCCAAGTGAATCTGGCAATC
AACGGTGATAGTCTCACTAGATTGTTTGTGTACTGGACGGCTAATGAAGCTGCTAAAACAGGATGCTACGATATGCTTTGTCAAGGTTTTGTACAAGTAGATCGAACAAT
TACTCCAAACTTCCCTCTTTACCCATCCTCCACCTATCAAGGGCAACAATATGACTATCAATTTACAGTTTATCGGATCAAGTGGCATGGGGAGGCATTTGCAAAGCCTT
CACCATATGGAATGAGCCCTCCCTTAGGCAACGTGTGGATTGTGGGCACGAAAGGTTGTCTTATTGCTTCACCTTTGGAGGACCAGGTGGAAAGAATTGTAGAGCCAATT
AGGCTACATGTTGAAGCTCTTGTATGGGATTTTTCCCAATGCCTTCGCCAAAGGTTCGCTCTGCGTCAACACGCTCTTCAACTTCATTTACGACGATCCGATCCGTCTTA
TGGCCGTATCGTGCTAGACTACGGGTTAGTTAGAGGTCGATGCTATGGGATCGGTTCCTCGACCAAAGTGATCCACCTCGGCATCTCGGCACAGGCTCATAGGCACCTCA
AATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATATAAAACGCCGTCTTGTGCGACGGAAGCCGGTTTAGCGGTTGGTAGCCCCAACGAATGGGTGGCTATTGAAGGCGATTTTGTCGGAGAGACGGCCACCAAAGG
CAGTGAGGAGGGTGGTTTTTTCGCTGTCGGAAGGCCGGAGCATGGGGAGAATTTCGCCGGGGGAAGACGACGCTGCTTAACCAAAATGGTTTTTCTCTCTACTCACCCAA
TGATGAACTGCTGCTCCTCAAATCCTCAAACCCACACTGTTGACTATGCACAAACTTCATCCTATCTGGCACAGTTTATTACTATTTTTTGCCAAGTGAATCTGGCAATC
AACGGTGATAGTCTCACTAGATTGTTTGTGTACTGGACGGCTAATGAAGCTGCTAAAACAGGATGCTACGATATGCTTTGTCAAGGTTTTGTACAAGTAGATCGAACAAT
TACTCCAAACTTCCCTCTTTACCCATCCTCCACCTATCAAGGGCAACAATATGACTATCAATTTACAGTTTATCGGATCAAGTGGCATGGGGAGGCATTTGCAAAGCCTT
CACCATATGGAATGAGCCCTCCCTTAGGCAACGTGTGGATTGTGGGCACGAAAGGTTGTCTTATTGCTTCACCTTTGGAGGACCAGGTGGAAAGAATTGTAGAGCCAATT
AGGCTACATGTTGAAGCTCTTGTATGGGATTTTTCCCAATGCCTTCGCCAAAGGTTCGCTCTGCGTCAACACGCTCTTCAACTTCATTTACGACGATCCGATCCGTCTTA
TGGCCGTATCGTGCTAGACTACGGGTTAGTTAGAGGTCGATGCTATGGGATCGGTTCCTCGACCAAAGTGATCCACCTCGGCATCTCGGCACAGGCTCATAGGCACCTCA
AATAA
Protein sequenceShow/hide protein sequence
MGYKTPSCATEAGLAVGSPNEWVAIEGDFVGETATKGSEEGGFFAVGRPEHGENFAGGRRRCLTKMVFLSTHPMMNCCSSNPQTHTVDYAQTSSYLAQFITIFCQVNLAI
NGDSLTRLFVYWTANEAAKTGCYDMLCQGFVQVDRTITPNFPLYPSSTYQGQQYDYQFTVYRIKWHGEAFAKPSPYGMSPPLGNVWIVGTKGCLIASPLEDQVERIVEPI
RLHVEALVWDFSQCLRQRFALRQHALQLHLRRSDPSYGRIVLDYGLVRGRCYGIGSSTKVIHLGISAQAHRHLK