; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021399 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021399
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionaspartic proteinase CDR1-like
Genome locationtig00153666:689044..690104
RNA-Seq ExpressionSgr021399
SyntenySgr021399
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582237.1 Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia]2.3e-5741.97Show/hide
Query:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK-----------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG-----------
        MV T VGFTARLIH DSPLSP Y+H   + AR++                             +PTLVHEGGEYLMSF IGNP S+V+G           
Subjt:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK-----------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG-----------

Query:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG
                                                                L YED S TSG LSSDSFSFDT+DGK VDVGYLNFGCS+APL G
Subjt:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG

Query:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT
        G +S                                                     +YPN DAY+VKVLG SV  D+  LD V DVYDV DGWIIDS T
Subjt:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT

Query:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFL
        TYSSL TDAFD LL KF TLP+L ++K+DPRN+FELC AANAND+E+FPD +  L
Subjt:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFL

KAG7018636.1 Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. argyrosperma]6.2e-5842.25Show/hide
Query:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVKT-----------------------------PTLVHEGGEYLMSFYIGNPPSRVLG-----------
        MV T VGFTARLIH DSPLSP Y+H   + AR++                              PTLVHEGGEYLMSF IGNP S+V+G           
Subjt:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVKT-----------------------------PTLVHEGGEYLMSFYIGNPPSRVLG-----------

Query:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG
                                                                L YED S TSG LSSDSFSFDT+DGK VDVGYLNFGCS+APLTG
Subjt:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG

Query:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT
        G +S                                                     +YPN DAY+VKVLG SV  D+  LD V DVYDV DGWIIDS T
Subjt:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT

Query:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFL
        TYSSL TDAFD LL KF TLP+L ++K+DPRN+FELC AANAND+E+FPD +  L
Subjt:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFL

XP_022137990.1 aspartic proteinase CDR1-like [Momordica charantia]2.2e-6344Show/hide
Query:  MVL-TRVGFTARLIHHDSPLSPLYNHTTKDKARVK------------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG---------
        MVL T VGFTA LIH DSPLSP YNH+  D AR++                              +PTLVHEGGEYLMSF+IGNPPSRV+G         
Subjt:  MVL-TRVGFTARLIHHDSPLSPLYNHTTKDKARVK------------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG---------

Query:  ---------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLT
                                                                 LEYED S T+GILSSDSFSFDTSDGKLVDVGYLNFGCSDAPL 
Subjt:  ---------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLT

Query:  GGSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSR
        GG +S                                                     +YPNLDAY+VKV+G S+  DELYLD VSDV+DVGDGWI+DS 
Subjt:  GGSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSR

Query:  TTYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFP
         TYSSL TDAFD L+DK +  P LPKRKDDPRN+FE+C A N +DLES P
Subjt:  TTYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFP

XP_022955985.1 aspartic proteinase CDR1-like [Cucurbita moschata]3.6e-5842.54Show/hide
Query:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK-----------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG-----------
        MV T VGFTARLIH DSPLSP YNH   + AR++                             +PTLVHEGGEYLMSF IGNP S+V+G           
Subjt:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK-----------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG-----------

Query:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG
                                                                L YED S TSG LSSDSFSFDT+DGK VDVGYLNFGCS+APLTG
Subjt:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG

Query:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT
        G +S                                                     +YPN DAY+VKVLG SV  D+  LD V DVYDV DGWIIDS T
Subjt:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT

Query:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFL
        TYSSL TDAFD LL KF TLP+L ++K+DPRN+FELC AANAND+E+FPD +  L
Subjt:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFL

XP_023528351.1 aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo]9.5e-5942.25Show/hide
Query:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK-----------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG-----------
        MV T VGFTARLIH DSP+SP Y+H   + A+++                             +PTLVHEGGEYLMSF IGNPPS+V+G           
Subjt:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK-----------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG-----------

Query:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG
                                                                L YED S TSG LSSDSFSFDT+DGK VDVGYLNFGCS+APLTG
Subjt:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG

Query:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT
        G +S                                                     +YPN DAY+VKVLG SV  D+  LD V DVYDV DGWIIDS T
Subjt:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT

Query:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFL
        TYSSL TDAFD LL KF+TLPDL ++K+DPRN+FELC AANAND+E+FPD +  L
Subjt:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFL

TrEMBL top hitse value%identityAlignment
A0A0A0L7U3 Peptidase A1 domain-containing protein9.6e-5741.81Show/hide
Query:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK------------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG----------
        MV   VGFTARLIHHDSPLSP YNHT  D AR++                              +PTLV+EGGEYLMSF IGNP S+V+G          
Subjt:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK------------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG----------

Query:  ----------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPL
                                                                  L Y D  ATSGILSSDSF FDTSDG LVDVG+LNFGCS+APL
Subjt:  ----------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPL

Query:  T-----------------------------------------------------GGSKSSVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIID
        T                                                     GG    +YPN DAY+VKVLG S+ NDE + D V DVY+V DGWIID
Subjt:  T-----------------------------------------------------GGSKSSVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIID

Query:  SRTTYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCL-AANANDLESFPD
        +  TYSSL TDAFDSLL KFLTL D P+RKDDP+ +FELC    NANDLESFPD
Subjt:  SRTTYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCL-AANANDLESFPD

A0A5D3CXD4 Aspartic proteinase CDR1-like2.1e-5641.53Show/hide
Query:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK------------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG----------
        MV   VGFTARLIHHDSPLSP YNH     AR++                              +PTLV+EGGEYLMSF IGNPPS+V+G          
Subjt:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK------------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG----------

Query:  ----------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPL
                                                                  L Y D  ATSGILSSDSF FDTSDGKLVDVG+LNFGCS+APL
Subjt:  ----------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPL

Query:  T-----------------------------------------------------GGSKSSVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIID
        T                                                     GG    +YPN DAY+VKVLG S+ NDE + D V DVYDV DGWIID
Subjt:  T-----------------------------------------------------GGSKSSVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIID

Query:  SRTTYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCL-AANANDLESFPD
        +  TYSSL TDAFDSLL KFL L + P+RK+DP+++FELC   ANANDLESFPD
Subjt:  SRTTYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCL-AANANDLESFPD

A0A6J1C870 aspartic proteinase CDR1-like1.1e-6344Show/hide
Query:  MVL-TRVGFTARLIHHDSPLSPLYNHTTKDKARVK------------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG---------
        MVL T VGFTA LIH DSPLSP YNH+  D AR++                              +PTLVHEGGEYLMSF+IGNPPSRV+G         
Subjt:  MVL-TRVGFTARLIHHDSPLSPLYNHTTKDKARVK------------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG---------

Query:  ---------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLT
                                                                 LEYED S T+GILSSDSFSFDTSDGKLVDVGYLNFGCSDAPL 
Subjt:  ---------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLT

Query:  GGSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSR
        GG +S                                                     +YPNLDAY+VKV+G S+  DELYLD VSDV+DVGDGWI+DS 
Subjt:  GGSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSR

Query:  TTYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFP
         TYSSL TDAFD L+DK +  P LPKRKDDPRN+FE+C A N +DLES P
Subjt:  TTYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFP

A0A6J1GWK9 aspartic proteinase CDR1-like1.8e-5842.54Show/hide
Query:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK-----------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG-----------
        MV T VGFTARLIH DSPLSP YNH   + AR++                             +PTLVHEGGEYLMSF IGNP S+V+G           
Subjt:  MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVK-----------------------------TPTLVHEGGEYLMSFYIGNPPSRVLG-----------

Query:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG
                                                                L YED S TSG LSSDSFSFDT+DGK VDVGYLNFGCS+APLTG
Subjt:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG

Query:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT
        G +S                                                     +YPN DAY+VKVLG SV  D+  LD V DVYDV DGWIIDS T
Subjt:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT

Query:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFL
        TYSSL TDAFD LL KF TLP+L ++K+DPRN+FELC AANAND+E+FPD +  L
Subjt:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFL

A0A6J1IXB1 aspartic proteinase CDR1-like1.5e-5742.69Show/hide
Query:  MVLTRVGFTARLIHHDSPLSPLYNHT-----------------------------TKDKARVKTPTLVHEGGEYLMSFYIGNPPSRVLG-----------
        MV T VGFTARLIH DSPLSP Y+H                              T D     +PTLVHEGGEYLMSF IGNPPS+V+G           
Subjt:  MVLTRVGFTARLIHHDSPLSPLYNHT-----------------------------TKDKARVKTPTLVHEGGEYLMSFYIGNPPSRVLG-----------

Query:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG
                                                                L YED S TSG LSSDSFSFDT+DGK VDVGYLNFGCS+APLTG
Subjt:  --------------------------------------------------------LEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTG

Query:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT
        G +S                                                     +YPN DAY+VKVLG SV  D+  L+ V DVYDV DGWIIDS T
Subjt:  GSKS----------------------------------------------------SVYPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRT

Query:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFP
        TYSSL TDAFD LL KF+TLPDL ++K+DPRN+FELC AANAND+E+FP
Subjt:  TYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFP

SwissProt top hitse value%identityAlignment
Q6XBF8 Aspartic proteinase CDR15.3e-0434.18Show/hide
Query:  RVGFTARLIHHDSPLSPLYN----------------------HTTKDKARVKTPTLVHEGGEYLMSFYIGNPPSRVLGL
        ++GFTA LIH DSP SP YN                       T KD        L    GEYLM+  IG PP  ++ +
Subjt:  RVGFTARLIHHDSPLSPLYN----------------------HTTKDKARVKTPTLVHEGGEYLMSFYIGNPPSRVLGL

Arabidopsis top hitse value%identityAlignment
AT1G31450.1 Eukaryotic aspartyl protease family protein5.3e-0742.11Show/hide
Query:  RVGFTARLIHHDSPLSPLYN--HTTKDK------------ARVKTPT-----LVHEGGEYLMSFYIGNPPSRVLGL
        R   T  LIH DSP SPLYN  HT  D+             R  T T     L+  GGEY MS  IG PPS+V  +
Subjt:  RVGFTARLIHHDSPLSPLYN--HTTKDK------------ARVKTPT-----LVHEGGEYLMSFYIGNPPSRVLGL

AT1G64830.1 Eukaryotic aspartyl protease family protein9.3e-0433.33Show/hide
Query:  GFTARLIHHDSPLSPLYNHT---------------------TKDKARVKTPT--LVHEGGEYLMSFYIGNPPSRVLGL
        GFT  LIH DSP SP YN                       + D A   +P   +    GEYLM+  IG PP  +L +
Subjt:  GFTARLIHHDSPLSPLYNHT---------------------TKDKARVKTPT--LVHEGGEYLMSFYIGNPPSRVLGL

AT5G33340.1 Eukaryotic aspartyl protease family protein3.8e-0534.18Show/hide
Query:  RVGFTARLIHHDSPLSPLYN----------------------HTTKDKARVKTPTLVHEGGEYLMSFYIGNPPSRVLGL
        ++GFTA LIH DSP SP YN                       T KD        L    GEYLM+  IG PP  ++ +
Subjt:  RVGFTARLIHHDSPLSPLYN----------------------HTTKDKARVKTPTLVHEGGEYLMSFYIGNPPSRVLGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACTAACTAGAGTTGGCTTCACTGCACGTTTGATTCACCATGACTCACCTTTATCACCGCTCTACAATCACACCACGAAAGACAAGGCACGGGTCAAGACACCCAC
ATTGGTTCATGAAGGTGGTGAGTACCTTATGAGTTTCTACATTGGAAATCCTCCAAGTCGAGTGCTAGGATTAGAATATGAAGATACTTCTGCAACAAGTGGAATTCTGT
CATCTGATAGTTTTAGTTTTGATACCTCAGATGGGAAACTTGTGGATGTTGGCTATTTGAACTTTGGCTGTTCAGATGCTCCTTTAACAGGAGGGTCAAAATCCTCTGTA
TATCCCAATTTAGATGCTTATCATGTGAAGGTTCTGGGAAATAGTGTCAGCAATGATGAGCTCTATTTAGATAGAGTTTCTGACGTATATGATGTCGGAGATGGATGGAT
CATAGATTCAAGAACAACATACTCAAGTCTTAGAACAGATGCATTTGATAGTTTGCTAGATAAATTCCTTACACTACCAGATTTACCAAAGAGAAAAGATGACCCTAGAA
ACAAATTTGAATTGTGCTTGGCAGCAAATGCAAATGATTTGGAGTCATTTCCAGATTTCAGTTCATTTTTATGGTGCAAATTCAGTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTACTAACTAGAGTTGGCTTCACTGCACGTTTGATTCACCATGACTCACCTTTATCACCGCTCTACAATCACACCACGAAAGACAAGGCACGGGTCAAGACACCCAC
ATTGGTTCATGAAGGTGGTGAGTACCTTATGAGTTTCTACATTGGAAATCCTCCAAGTCGAGTGCTAGGATTAGAATATGAAGATACTTCTGCAACAAGTGGAATTCTGT
CATCTGATAGTTTTAGTTTTGATACCTCAGATGGGAAACTTGTGGATGTTGGCTATTTGAACTTTGGCTGTTCAGATGCTCCTTTAACAGGAGGGTCAAAATCCTCTGTA
TATCCCAATTTAGATGCTTATCATGTGAAGGTTCTGGGAAATAGTGTCAGCAATGATGAGCTCTATTTAGATAGAGTTTCTGACGTATATGATGTCGGAGATGGATGGAT
CATAGATTCAAGAACAACATACTCAAGTCTTAGAACAGATGCATTTGATAGTTTGCTAGATAAATTCCTTACACTACCAGATTTACCAAAGAGAAAAGATGACCCTAGAA
ACAAATTTGAATTGTGCTTGGCAGCAAATGCAAATGATTTGGAGTCATTTCCAGATTTCAGTTCATTTTTATGGTGCAAATTCAGTTCTTAA
Protein sequenceShow/hide protein sequence
MVLTRVGFTARLIHHDSPLSPLYNHTTKDKARVKTPTLVHEGGEYLMSFYIGNPPSRVLGLEYEDTSATSGILSSDSFSFDTSDGKLVDVGYLNFGCSDAPLTGGSKSSV
YPNLDAYHVKVLGNSVSNDELYLDRVSDVYDVGDGWIIDSRTTYSSLRTDAFDSLLDKFLTLPDLPKRKDDPRNKFELCLAANANDLESFPDFSSFLWCKFSS