; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021234 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021234
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPhytocyanin domain-containing protein
Genome locationtig00153653:326425..329985
RNA-Seq ExpressionSgr021234
SyntenySgr021234
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023921.1 hypothetical protein SDJN02_14949, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-5159.03Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPP---------PPPS-------PPPP------PPSPPPPSPP--P
        MASI+S  I+L  +ACMST+SSA  GWF   N T+PF   HK P   PPPS  LP P         PPPS       PPPP      PPSPPPP  P  P
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPP---------PPPS-------PPPP------PPSPPPPSPP--P

Query:  PLPQ----QSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKP
        P P     Q+ RKIIVGGS++WR GFDYNDW  KNGPFYLNDILVFKYDPP S+  PH+VYLL NM+S + CD RRA+ + N TQG+G+GF FVLKQ+K 
Subjt:  PLPQ----QSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKP

Query:  YYFACGEGNGFHCRNGTMKFFVTPKVR
        YYFAC EGNGFHC  G+MKF +TP+ R
Subjt:  YYFACGEGNGFHCRNGTMKFFVTPKVR

KAG7035126.1 hypothetical protein SDJN02_01921, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-5159.03Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPP---------PPPS-------PPPP------PPSPPPPSPP--P
        MASI+S  I+L  +ACMST+SSA  GWF   N T+PF   HK P   PPPS  LP P         PPPS       PPPP      PPSPPPP  P  P
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPP---------PPPS-------PPPP------PPSPPPPSPP--P

Query:  PLPQ----QSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKP
        P P     Q+ RKIIVGGS++WR GFDYNDW  KNGPFYLNDILVFKYDPP S+  PH+VYLL NM+S + CD RRA+ + N TQG+G+GF FVLKQ+K 
Subjt:  PLPQ----QSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKP

Query:  YYFACGEGNGFHCRNGTMKFFVTPKVR
        YYFAC EGNGFHC  G+MKF +TP+ R
Subjt:  YYFACGEGNGFHCRNGTMKFFVTPKVR

XP_022148634.1 leucine-rich repeat extensin-like protein 3 [Momordica charantia]3.8e-5175.95Show/hide
Query:  PPPPPS-----LPPPPPPSPPPPPPSPPPPSPPPP----LPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSA-LPHSVYLLSNM
        PPPPP       PPPPPPSP PPPP P PPS PPP     P QS RKIIVGGSE+W  GFDY++WALKNGPF+LNDILVFKYDPPT A +PHSVYLLSNM
Subjt:  PPPPPS-----LPPPPPPSPPPPPPSPPPPSPPPP----LPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSA-LPHSVYLLSNM

Query:  RSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP
        +SFSNCDLRRA KL N TQG GDGF+FVLKQ+K YYFACGEGNGFHC+NG+MKF VTP
Subjt:  RSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP

XP_022947110.1 extensin-like isoform X5 [Cucurbita moschata]2.7e-5263.05Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPPP----PPSPPPPPPSPPPPSPPPPLPQQSSRKIIVGGSEHWRF
        MASI+S  I+L  +ACMST+SSA   WF   N T+ F   HK P  PPPPS  LP PP    PPSPPP      PPS P P P Q+ RKIIVGGS++WR 
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPPP----PPSPPPPPPSPPPPSPPPPLPQQSSRKIIVGGSEHWRF

Query:  GFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP
        GFDYNDW LKNGPFY+NDILVFKYDPP S+  PH+VYLL NM+S + CD RRA+ + N TQG+G+GF FVLKQ+K YYFACGEGNGFHC  G+MKF +TP
Subjt:  GFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP

Query:  KVR
        + R
Subjt:  KVR

XP_023533130.1 extensin-like [Cucurbita pepo subsp. pepo]4.5e-5256.12Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS------------------LPPPP------------------PPSPPPPP
        MASI+S  I+L  +ACMST+SSA+ GWF   N T+PF   HK P  PPPPS                  LPP P                  PPS P  P
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS------------------LPPPP------------------PPSPPPPP

Query:  PSPPPPSPPPPLP----QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPT-SALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDG
         +PPPP   PPLP     Q+ RKIIVGGS++WR GFDYNDW LKNGPFYLNDILVFKYDPP  S  PH+VYLL NM+S + CD RRA+ + NSTQG+G+G
Subjt:  PSPPPPSPPPPLP----QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPT-SALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDG

Query:  FEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR
        F FVLKQ+K YYFACGEGNGFHC  G+MKF +TP+ R
Subjt:  FEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR

TrEMBL top hitse value%identityAlignment
A0A6J1D4K0 leucine-rich repeat extensin-like protein 31.9e-5175.95Show/hide
Query:  PPPPPS-----LPPPPPPSPPPPPPSPPPPSPPPP----LPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSA-LPHSVYLLSNM
        PPPPP       PPPPPPSP PPPP P PPS PPP     P QS RKIIVGGSE+W  GFDY++WALKNGPF+LNDILVFKYDPPT A +PHSVYLLSNM
Subjt:  PPPPPS-----LPPPPPPSPPPPPPSPPPPSPPPP----LPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSA-LPHSVYLLSNM

Query:  RSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP
        +SFSNCDLRRA KL N TQG GDGF+FVLKQ+K YYFACGEGNGFHC+NG+MKF VTP
Subjt:  RSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP

A0A6J1FUQ3 alpha carbonic anhydrase 8-like2.4e-5154.92Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPP----------PSLPPPPP-------------PSPPPP----PPSPPPP---
        M SI S+ ++L   AC+ST+SSA++ WFW +NC+ PF   H  P PPPP           SLPP PP             PSPPPP    PP PPPP   
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPP----------PSLPPPPP-------------PSPPPP----PPSPPPP---

Query:  -------------------SPPPPLPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALP-HSVYLLSNMRSFSNCDLRRAQKLG
                           SPPPP P   SRKIIVGGSEHW  GFDYNDWALKNGPF++NDILVFKYDPP S  P HSVY L NMRSF NCDL +A+ L 
Subjt:  -------------------SPPPPLPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALP-HSVYLLSNMRSFSNCDLRRAQKLG

Query:  NSTQGAGD-GFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP
        NSTQG+ + GFEF LK + PYYFACGE NGFHC+ G+MKF +TP
Subjt:  NSTQGAGD-GFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP

A0A6J1G5X5 extensin-like isoform X51.3e-5263.05Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPPP----PPSPPPPPPSPPPPSPPPPLPQQSSRKIIVGGSEHWRF
        MASI+S  I+L  +ACMST+SSA   WF   N T+ F   HK P  PPPPS  LP PP    PPSPPP      PPS P P P Q+ RKIIVGGS++WR 
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPPP----PPSPPPPPPSPPPPSPPPPLPQQSSRKIIVGGSEHWRF

Query:  GFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP
        GFDYNDW LKNGPFY+NDILVFKYDPP S+  PH+VYLL NM+S + CD RRA+ + N TQG+G+GF FVLKQ+K YYFACGEGNGFHC  G+MKF +TP
Subjt:  GFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP

Query:  KVR
        + R
Subjt:  KVR

A0A6J1L0M6 extensin-like isoform X17.8e-5056.07Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPPP----PPSPPP----------------------------PPPS
        MASI+S  I+L  +ACMST+SSA  GWF   N T+ F   HK P  PPP S  LP PP    PPSPPP                            PPP 
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPPP----PPSPPP----------------------------PPPS

Query:  PPPPSPP-------PPLP-QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAG
          PPSPP       PPLP  Q+ RKIIVGGS++WR GFDYNDW LKNGPFYLNDILVFKYD P S+  PH+VYLL NM+SF+ CD RRA+ + N TQG+G
Subjt:  PPPPSPP-------PPLP-QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAG

Query:  DGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR
        +GF FVLKQ+K YYFACGEG GFHC  G+MKF +TP+ R
Subjt:  DGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR

A0A6J1L7S6 extensin-like isoform X37.8e-5056.07Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPPP----PPSPPP----------------------------PPPS
        MASI+S  I+L  +ACMST+SSA  GWF   N T+ F   HK P  PPP S  LP PP    PPSPPP                            PPP 
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS--LPPPP----PPSPPP----------------------------PPPS

Query:  PPPPSPP-------PPLP-QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAG
          PPSPP       PPLP  Q+ RKIIVGGS++WR GFDYNDW LKNGPFYLNDILVFKYD P S+  PH+VYLL NM+SF+ CD RRA+ + N TQG+G
Subjt:  PPPPSPP-------PPLP-QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAG

Query:  DGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR
        +GF FVLKQ+K YYFACGEG GFHC  G+MKF +TP+ R
Subjt:  DGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15770.1 Cupredoxin superfamily protein1.2e-3147.2Show/hide
Query:  LPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACG
        L +++ +KIIVGGS+ W+ G DY DWA KN PFY+ND+LVFKYD  ++   ++VYL  +  S+ NCD++ A+K+G++ +G+ + F F LK+ +PY+FA G
Subjt:  LPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACG

Query:  EGNGFHCRNGTMKFFVTPKVRVEAK
        E +G +CRN  MKF + P + V  K
Subjt:  EGNGFHCRNGTMKFFVTPKVRVEAK

AT2G15780.1 Cupredoxin superfamily protein8.8e-3860.71Show/hide
Query:  RKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFH
        RKIIVGG + W +GF+Y DWA K  PF+LNDILVFKY+PP +   HSVYLL N  S+  CD+++ + + +  QGAG GFEFVLKQ KPYY +CGE +G H
Subjt:  RKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFH

Query:  CRNGTMKFFVTP
        C NGTMKF V P
Subjt:  CRNGTMKFFVTP

AT4G33930.1 Cupredoxin superfamily protein2.5e-1638.89Show/hide
Query:  PQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKY---DPPTSALPH------SVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQR
        P  + RKI V     W+ G+ Y +W  K+ PFY++D+LVFKY   D   S   H       VYLL +M+SF  C++ R +KL      +  GF+ +L++ 
Subjt:  PQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKY---DPPTSALPH------SVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQR

Query:  KPYYFACGEGNGFHCRNGTMKFFVTP
        + YYFA G+ N   C N  MKF V P
Subjt:  KPYYFACGEGNGFHCRNGTMKFFVTP

AT4G34300.1 Cupredoxin superfamily protein1.1e-1438.39Show/hide
Query:  WRFGFDYNDWALKNGPFYLNDILVFKY---DPPTSALPH-------SVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFH
        W+ G+ Y +W  K+ PFY+ND+LVF Y   D   S   H        VYLL +M+SF  C++ R +KL      +  GF+ +L++   YYF  G+ N   
Subjt:  WRFGFDYNDWALKNGPFYLNDILVFKY---DPPTSALPH-------SVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFH

Query:  CRNGTMKFFVTP
        C N  MKF V P
Subjt:  CRNGTMKFFVTP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATATATTCACAAGCAATCCTCCTCTTGACAGTAGCCTGCATGTCCACTGTAAGCTCAGCCCACAGGGGTTGGTTTTGGGGGTATAATTGCACTTGGCCTTT
CAAACATGGGCACAAGAGTCCCCTGCCTCCGCCGCCACCATCACTGCCGCCCCCACCGCCACCGTCACCACCACCACCGCCTCCGTCACCACCACCGCCATCACCGCCTC
CACCACTGCCCCAACAAAGTTCTAGAAAGATCATAGTGGGTGGTTCCGAGCATTGGCGTTTTGGCTTTGACTATAACGATTGGGCACTTAAGAATGGTCCCTTTTATTTA
AACGATATTCTCGTCTTCAAATACGATCCTCCAACCAGTGCACTTCCTCATAGTGTTTACTTGCTATCAAACATGCGAAGCTTCTCCAACTGTGATTTGAGAAGAGCTCA
AAAGCTGGGAAACTCGACACAAGGAGCTGGAGATGGGTTCGAGTTCGTGCTCAAACAACGGAAGCCATATTACTTTGCATGTGGTGAAGGCAATGGCTTTCATTGCAGAA
ATGGAACCATGAAGTTCTTCGTCACTCCCAAAGTTCGGGTCGAAGCCAAAATGCCAGTGCTCGGAACCACCAACGACGATCTTGTTGGGACTTTGTGGCGGTGGCGGTGG
TGGCAGCAGATGGCGATAACCTGGCCAAGTGTAATTGAACCCATAATACCAACCCCTGTGGGCAGAGCATACAGCCAACATGCAGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCATATATTCACAAGCAATCCTCCTCTTGACAGTAGCCTGCATGTCCACTGTAAGCTCAGCCCACAGGGGTTGGTTTTGGGGGTATAATTGCACTTGGCCTTT
CAAACATGGGCACAAGAGTCCCCTGCCTCCGCCGCCACCATCACTGCCGCCCCCACCGCCACCGTCACCACCACCACCGCCTCCGTCACCACCACCGCCATCACCGCCTC
CACCACTGCCCCAACAAAGTTCTAGAAAGATCATAGTGGGTGGTTCCGAGCATTGGCGTTTTGGCTTTGACTATAACGATTGGGCACTTAAGAATGGTCCCTTTTATTTA
AACGATATTCTCGTCTTCAAATACGATCCTCCAACCAGTGCACTTCCTCATAGTGTTTACTTGCTATCAAACATGCGAAGCTTCTCCAACTGTGATTTGAGAAGAGCTCA
AAAGCTGGGAAACTCGACACAAGGAGCTGGAGATGGGTTCGAGTTCGTGCTCAAACAACGGAAGCCATATTACTTTGCATGTGGTGAAGGCAATGGCTTTCATTGCAGAA
ATGGAACCATGAAGTTCTTCGTCACTCCCAAAGTTCGGGTCGAAGCCAAAATGCCAGTGCTCGGAACCACCAACGACGATCTTGTTGGGACTTTGTGGCGGTGGCGGTGG
TGGCAGCAGATGGCGATAACCTGGCCAAGTGTAATTGAACCCATAATACCAACCCCTGTGGGCAGAGCATACAGCCAACATGCAGCATAG
Protein sequenceShow/hide protein sequence
MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPSLPPPPPPSPPPPPPSPPPPSPPPPLPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYL
NDILVFKYDPPTSALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVRVEAKMPVLGTTNDDLVGTLWRWRW
WQQMAITWPSVIEPIIPTPVGRAYSQHAA