; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024417 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024417
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationtig00001291:2599642..2601981
RNA-Seq ExpressionSgr024417
SyntenySgr024417
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2681134.1 hypothetical protein I3760_11G130700 [Carya illinoinensis]1.1e-1449.37Show/hide
Query:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDF
        E  P MGSGHFPE G+ K+AY+NQI+VV + +  FVDP +    +  +   CY+AIH   +   WGHH++FGG GNC F
Subjt:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDF

KAG7956575.1 hypothetical protein I3843_11G130800, partial [Carya illinoinensis]4.1e-1449.35Show/hide
Query:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC
        E  P MGSGHFPE G+ K+AY+NQI+VV + +  FVDP +    +  +   CY+AIH   +   WGHH++FGG GNC
Subjt:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC

XP_022158434.1 uncharacterized protein LOC111024921 [Momordica charantia]6.5e-2030.2Show/hide
Query:  SINKFPAGSVSIRRTRKEDLIAAKYLKPLRLHPLTDNLRPDTTIDANG-HFET------------------------QFSITSMGLDEDSRNQENSVQ--
        +IN  P GSV IRRT KEDLIAAK  KPL  H   D+ RP TTIDANG H  T                        QFS  S+ L    R+Q+N++Q  
Subjt:  SINKFPAGSVSIRRTRKEDLIAAKYLKPLRLHPLTDNLRPDTTIDANG-HFET------------------------QFSITSMGLDEDSRNQENSVQ--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------AGWG-------PEPRPAMGSGHFPEGGFTKTAYMNQIQVV-YKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC
                A WG        E  PAMGSGHFPE GF K+A++NQIQVV Y   + FVDP D +L + ++   C+  I+KFT  GNWG HIFFGG   C
Subjt:  --------AGWG-------PEPRPAMGSGHFPEGGFTKTAYMNQIQVV-YKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC

XP_040986169.1 uncharacterized protein LOC121234329 isoform X2 [Juglans microcarpa x Juglans regia]4.1e-1451.28Show/hide
Query:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVY-KGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC
        E  P MGSGHFPE G+ K+AY+NQI+VV  K +  FVDP D    +  +   CY+AIH   +   WGHH++FGG GNC
Subjt:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVY-KGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC

XP_042950336.1 uncharacterized protein LOC122282452, partial [Carya illinoinensis]1.1e-1449.37Show/hide
Query:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDF
        E  P MGSGHFPE G+ K+AY+NQI+VV + +  FVDP +    +  +   CY+AIH   +   WGHH++FGG GNC F
Subjt:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDF

TrEMBL top hitse value%identityAlignment
A0A2I4H8L6 uncharacterized protein LOC109014471 isoform X12.2e-1348.1Show/hide
Query:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDF
        E  P MGSG FPE G+ K+AY+ QI+VV      FVDP D    +  +   CY+AIH   +   WGHH++FGG GNC F
Subjt:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDF

A0A2N9FVS7 Uncharacterized protein1.1e-1231.06Show/hide
Query:  PAGSVSIRRTRKEDLIAAKYLKPL-RLHPLTDNLRPDTTIDANGHFETQ-----------------------FSITSMGLDEDSRN--------------
        P GSV IRRT KE+LI AKYLK L R +P   +    +TID  G+   Q                       F++   G  + S N              
Subjt:  PAGSVSIRRTRKEDLIAAKYLKPL-RLHPLTDNLRPDTTIDANGHFETQ-----------------------FSITSMGLDEDSRN--------------

Query:  --------------------------------------QENSVQAGWGP-------EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLV
                                               E +    WG        E  PAMGSGHFPE  + K+AY++QIQ++    A FVDP D  L 
Subjt:  --------------------------------------QENSVQAGWGP-------EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLV

Query:  LSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDF
        L V+   CY+AI    +   WGH+I+FGG GNC F
Subjt:  LSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDF

A0A2P5EGA7 Uncharacterized protein2.4e-1240.32Show/hide
Query:  ANGHFETQFSITSMGL---DEDSRNQENSVQAGWGPEP-------RPAMGSGHFPEGGFTKTAYMNQIQVVYKGLAS--FVDPVDLQLVLSVNYRTCYDA
        A+GH+   F    +G    +   R  E +    WG E         PAMG GHFPE GF K AY+ QI+VV     S  F DP D  L    N   CY+A
Subjt:  ANGHFETQFSITSMGL---DEDSRNQENSVQAGWGPEP-------RPAMGSGHFPEGGFTKTAYMNQIQVVYKGLAS--FVDPVDLQLVLSVNYRTCYDA

Query:  IHKFTEAGNWGHHIFFGGRGNCDF
         + +  AG WG+++FFGG GNC F
Subjt:  IHKFTEAGNWGHHIFFGGRGNCDF

A0A6J1DZE3 uncharacterized protein LOC1110249213.2e-2030.2Show/hide
Query:  SINKFPAGSVSIRRTRKEDLIAAKYLKPLRLHPLTDNLRPDTTIDANG-HFET------------------------QFSITSMGLDEDSRNQENSVQ--
        +IN  P GSV IRRT KEDLIAAK  KPL  H   D+ RP TTIDANG H  T                        QFS  S+ L    R+Q+N++Q  
Subjt:  SINKFPAGSVSIRRTRKEDLIAAKYLKPLRLHPLTDNLRPDTTIDANG-HFET------------------------QFSITSMGLDEDSRNQENSVQ--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------AGWG-------PEPRPAMGSGHFPEGGFTKTAYMNQIQVV-YKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC
                A WG        E  PAMGSGHFPE GF K+A++NQIQVV Y   + FVDP D +L + ++   C+  I+KFT  GNWG HIFFGG   C
Subjt:  --------AGWG-------PEPRPAMGSGHFPEGGFTKTAYMNQIQVV-YKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC

A0A6P9E585 uncharacterized protein LOC109014471 isoform X22.2e-1348.1Show/hide
Query:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDF
        E  P MGSG FPE G+ K+AY+ QI+VV      FVDP D    +  +   CY+AIH   +   WGHH++FGG GNC F
Subjt:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20170.1 Protein of Unknown Function (DUF239)1.2e-0837.84Show/hide
Query:  PAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC
        P MGSGHFP+ GF K A++N ++V+ + +     P    L L  N   CY    K      W   IF+GG G C
Subjt:  PAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC

AT2G20170.2 Protein of Unknown Function (DUF239)1.2e-0837.84Show/hide
Query:  PAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC
        P MGSGHFP+ GF K A++N ++V+ + +     P    L L  N   CY    K      W   IF+GG G C
Subjt:  PAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC

AT2G20170.3 Protein of Unknown Function (DUF239)1.2e-0837.84Show/hide
Query:  PAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC
        P MGSGHFP+ GF K A++N ++V+ + +     P    L L  N   CY    K      W   IF+GG G C
Subjt:  PAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNC

AT2G44250.1 Protein of Unknown Function (DUF239)6.7e-0741.43Show/hide
Query:  MGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRG
        MGSGHF E GF K AY+N I+ + K     + P    L  SV    CY+   K   +  WG +IF+GG G
Subjt:  MGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRG

AT4G23390.1 Protein of Unknown Function (DUF239)4.7e-0836.25Show/hide
Query:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGN-WGHHIFFGGRGNCDF
        E  P+MGSGHFP+ GF K AY+N ++++         P+   L    +   CY+ + K    G  W   I FGG G C F
Subjt:  EPRPAMGSGHFPEGGFTKTAYMNQIQVVYKGLASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGN-WGHHIFFGGRGNCDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATTAACAAGTTTCCTGCAGGATCAGTATCCATAAGACGGACTAGAAAGGAAGATTTGATAGCAGCAAAATATTTAAAGCCCTTGCGATTGCATCCTTTGACAGA
TAACCTGCGGCCAGACACCACAATCGATGCAAATGGTCATTTTGAAACGCAATTTAGCATCACTAGTATGGGGCTTGACGAGGACTCTAGAAATCAAGAGAACAGTGTAC
AAGCTGGTTGGGGACCTGAACCGAGGCCTGCAATGGGGAGTGGCCATTTTCCTGAAGGGGGATTTACAAAAACTGCATACATGAATCAGATCCAAGTAGTGTATAAAGGT
TTGGCGTCGTTTGTTGATCCAGTTGATTTGCAACTCGTTCTGTCAGTAAACTATCGTACCTGCTATGATGCCATTCATAAATTTACTGAAGCTGGAAACTGGGGGCATCA
TATATTCTTTGGTGGGCGTGGGAATTGTGATTTCAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCATTAACAAGTTTCCTGCAGGATCAGTATCCATAAGACGGACTAGAAAGGAAGATTTGATAGCAGCAAAATATTTAAAGCCCTTGCGATTGCATCCTTTGACAGA
TAACCTGCGGCCAGACACCACAATCGATGCAAATGGTCATTTTGAAACGCAATTTAGCATCACTAGTATGGGGCTTGACGAGGACTCTAGAAATCAAGAGAACAGTGTAC
AAGCTGGTTGGGGACCTGAACCGAGGCCTGCAATGGGGAGTGGCCATTTTCCTGAAGGGGGATTTACAAAAACTGCATACATGAATCAGATCCAAGTAGTGTATAAAGGT
TTGGCGTCGTTTGTTGATCCAGTTGATTTGCAACTCGTTCTGTCAGTAAACTATCGTACCTGCTATGATGCCATTCATAAATTTACTGAAGCTGGAAACTGGGGGCATCA
TATATTCTTTGGTGGGCGTGGGAATTGTGATTTCAAGTGA
Protein sequenceShow/hide protein sequence
MSINKFPAGSVSIRRTRKEDLIAAKYLKPLRLHPLTDNLRPDTTIDANGHFETQFSITSMGLDEDSRNQENSVQAGWGPEPRPAMGSGHFPEGGFTKTAYMNQIQVVYKG
LASFVDPVDLQLVLSVNYRTCYDAIHKFTEAGNWGHHIFFGGRGNCDFK