; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr005126 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr005126
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPhytocyanin domain-containing protein
Genome locationtig00003578:9196..12809
RNA-Seq ExpressionSgr005126
SyntenySgr005126
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023921.1 hypothetical protein SDJN02_14949, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-5059.03Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS---PPPP--------PPPS-------PPPP------PPSPPPPSPP--P
        MASI+S  I+L  +ACMST+SSA  GWF   N T+PF   HK P   PPPS   P PP        PPPS       PPPP      PPSPPPP  P  P
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS---PPPP--------PPPS-------PPPP------PPSPPPPSPP--P

Query:  PLPQ----QSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKP
        P P     Q+ RKIIVGGS++WR GFDYNDW  KNGPFYLNDILVFKYDPP S+  PH+VYLL NM+S + CD RRA+ + N TQG+G+GF FVLKQ+K 
Subjt:  PLPQ----QSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKP

Query:  YYFACGEGNGFHCRNGTMKFFVTPKVR
        YYFAC EGNGFHC  G+MKF +TP+ R
Subjt:  YYFACGEGNGFHCRNGTMKFFVTPKVR

KAG7035126.1 hypothetical protein SDJN02_01921, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-5059.03Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS---PPPP--------PPPS-------PPPP------PPSPPPPSPP--P
        MASI+S  I+L  +ACMST+SSA  GWF   N T+PF   HK P   PPPS   P PP        PPPS       PPPP      PPSPPPP  P  P
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS---PPPP--------PPPS-------PPPP------PPSPPPPSPP--P

Query:  PLPQ----QSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKP
        P P     Q+ RKIIVGGS++WR GFDYNDW  KNGPFYLNDILVFKYDPP S+  PH+VYLL NM+S + CD RRA+ + N TQG+G+GF FVLKQ+K 
Subjt:  PLPQ----QSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKP

Query:  YYFACGEGNGFHCRNGTMKFFVTPKVR
        YYFAC EGNGFHC  G+MKF +TP+ R
Subjt:  YYFACGEGNGFHCRNGTMKFFVTPKVR

XP_022944052.1 alpha carbonic anhydrase 8-like [Cucurbita moschata]5.7e-5054.92Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPSP----------PPPPP-------------PSPPPP----PPSPPPP---
        M SI S+ ++L   AC+ST+SSA++ WFW +NC+ PF   H  P PPPP SP          PP PP             PSPPPP    PP PPPP   
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPSP----------PPPPP-------------PSPPPP----PPSPPPP---

Query:  -------------------SPPPPLPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALP-HSVYLLSNMRSFSNCDLRRAQKLG
                           SPPPP P   SRKIIVGGSEHW  GFDYNDWALKNGPF++NDILVFKYDPP S  P HSVY L NMRSF NCDL +A+ L 
Subjt:  -------------------SPPPPLPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALP-HSVYLLSNMRSFSNCDLRRAQKLG

Query:  NSTQGAGD-GFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP
        NSTQG+ + GFEF LK + PYYFACGE NGFHC+ G+MKF +TP
Subjt:  NSTQGAGD-GFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP

XP_022947110.1 extensin-like isoform X5 [Cucurbita moschata]3.4e-5062.87Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPSPPPPPPP---SPPPPPPSPPPPSPP-PPLP-QQSSRKIIVGGSEHWRFG
        MASI+S  I+L  +ACMST+SSA   WF   N T+ F   HK P  PPPPS   P PP    PP PPPS  P +PP  PLP  Q+ RKIIVGGS++WR G
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPSPPPPPPP---SPPPPPPSPPPPSPP-PPLP-QQSSRKIIVGGSEHWRFG

Query:  FDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPK
        FDYNDW LKNGPFY+NDILVFKYDPP S+  PH+VYLL NM+S + CD RRA+ + N TQG+G+GF FVLKQ+K YYFACGEGNGFHC  G+MKF +TP+
Subjt:  FDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPK

Query:  VR
         R
Subjt:  VR

XP_023533130.1 extensin-like [Cucurbita pepo subsp. pepo]4.4e-5055.7Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS-------------------PPPP-----------------PPPSPPPPP
        MASI+S  I+L  +ACMST+SSA+ GWF   N T+PF   HK P  PPPPS                   PP P                  PPS P  P
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPS-------------------PPPP-----------------PPPSPPPPP

Query:  PSPPPPSPPPPLP----QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPT-SALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDG
         +PPPP   PPLP     Q+ RKIIVGGS++WR GFDYNDW LKNGPFYLNDILVFKYDPP  S  PH+VYLL NM+S + CD RRA+ + NSTQG+G+G
Subjt:  PSPPPPSPPPPLP----QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPT-SALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDG

Query:  FEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR
        F FVLKQ+K YYFACGEGNGFHC  G+MKF +TP+ R
Subjt:  FEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR

TrEMBL top hitse value%identityAlignment
A0A6J1D4K0 leucine-rich repeat extensin-like protein 33.6e-5075.95Show/hide
Query:  PPPPPS-----PPPPPPPSPPPPPPSPPPPSPPPP----LPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSA-LPHSVYLLSNM
        PPPPP       PPPPPPSP PPPP P PPS PPP     P QS RKIIVGGSE+W  GFDY++WALKNGPF+LNDILVFKYDPPT A +PHSVYLLSNM
Subjt:  PPPPPS-----PPPPPPPSPPPPPPSPPPPSPPPP----LPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSA-LPHSVYLLSNM

Query:  RSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP
        +SFSNCDLRRA KL N TQG GDGF+FVLKQ+K YYFACGEGNGFHC+NG+MKF VTP
Subjt:  RSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP

A0A6J1FUQ3 alpha carbonic anhydrase 8-like2.8e-5054.92Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPSP----------PPPPP-------------PSPPPP----PPSPPPP---
        M SI S+ ++L   AC+ST+SSA++ WFW +NC+ PF   H  P PPPP SP          PP PP             PSPPPP    PP PPPP   
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPSP----------PPPPP-------------PSPPPP----PPSPPPP---

Query:  -------------------SPPPPLPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALP-HSVYLLSNMRSFSNCDLRRAQKLG
                           SPPPP P   SRKIIVGGSEHW  GFDYNDWALKNGPF++NDILVFKYDPP S  P HSVY L NMRSF NCDL +A+ L 
Subjt:  -------------------SPPPPLPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALP-HSVYLLSNMRSFSNCDLRRAQKLG

Query:  NSTQGAGD-GFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP
        NSTQG+ + GFEF LK + PYYFACGE NGFHC+ G+MKF +TP
Subjt:  NSTQGAGD-GFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTP

A0A6J1G5X5 extensin-like isoform X51.6e-5062.87Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPSPPPPPPP---SPPPPPPSPPPPSPP-PPLP-QQSSRKIIVGGSEHWRFG
        MASI+S  I+L  +ACMST+SSA   WF   N T+ F   HK P  PPPPS   P PP    PP PPPS  P +PP  PLP  Q+ RKIIVGGS++WR G
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPSPPPPPPP---SPPPPPPSPPPPSPP-PPLP-QQSSRKIIVGGSEHWRFG

Query:  FDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPK
        FDYNDW LKNGPFY+NDILVFKYDPP S+  PH+VYLL NM+S + CD RRA+ + N TQG+G+GF FVLKQ+K YYFACGEGNGFHC  G+MKF +TP+
Subjt:  FDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPK

Query:  VR
         R
Subjt:  VR

A0A6J1L0M6 extensin-like isoform X14.4e-4856.49Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHK-SPLPPP-------------PPSPPP--------PP--------PPSPPPPPPSPP--
        MASI+S  I+L  +ACMST+SSA  GWF   N T+ F   HK  P PPP             PPSPPP        PP        PPS P  P +PP  
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHK-SPLPPP-------------PPSPPP--------PP--------PPSPPPPPPSPP--

Query:  --PPSPP-------PPLP-QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAG
          PPSPP       PPLP  Q+ RKIIVGGS++WR GFDYNDW LKNGPFYLNDILVFKYD P S+  PH+VYLL NM+SF+ CD RRA+ + N TQG+G
Subjt:  --PPSPP-------PPLP-QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAG

Query:  DGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR
        +GF FVLKQ+K YYFACGEG GFHC  G+MKF +TP+ R
Subjt:  DGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR

A0A6J1L7S6 extensin-like isoform X34.4e-4856.49Show/hide
Query:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHK-SPLPPP-------------PPSPPP--------PP--------PPSPPPPPPSPP--
        MASI+S  I+L  +ACMST+SSA  GWF   N T+ F   HK  P PPP             PPSPPP        PP        PPS P  P +PP  
Subjt:  MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHK-SPLPPP-------------PPSPPP--------PP--------PPSPPPPPPSPP--

Query:  --PPSPP-------PPLP-QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAG
          PPSPP       PPLP  Q+ RKIIVGGS++WR GFDYNDW LKNGPFYLNDILVFKYD P S+  PH+VYLL NM+SF+ CD RRA+ + N TQG+G
Subjt:  --PPSPP-------PPLP-QQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSAL-PHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAG

Query:  DGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR
        +GF FVLKQ+K YYFACGEG GFHC  G+MKF +TP+ R
Subjt:  DGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15770.1 Cupredoxin superfamily protein1.3e-3147.2Show/hide
Query:  LPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACG
        L +++ +KIIVGGS+ W+ G DY DWA KN PFY+ND+LVFKYD  ++   ++VYL  +  S+ NCD++ A+K+G++ +G+ + F F LK+ +PY+FA G
Subjt:  LPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACG

Query:  EGNGFHCRNGTMKFFVTPKVRVEAK
        E +G +CRN  MKF + P + V  K
Subjt:  EGNGFHCRNGTMKFFVTPKVRVEAK

AT2G15780.1 Cupredoxin superfamily protein9.1e-3860.71Show/hide
Query:  RKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFH
        RKIIVGG + W +GF+Y DWA K  PF+LNDILVFKY+PP +   HSVYLL N  S+  CD+++ + + +  QGAG GFEFVLKQ KPYY +CGE +G H
Subjt:  RKIIVGGSEHWRFGFDYNDWALKNGPFYLNDILVFKYDPPTSALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFH

Query:  CRNGTMKFFVTP
        C NGTMKF V P
Subjt:  CRNGTMKFFVTP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATATATTCACAAGCAATCCTCCTCTTGACAGTAGCCTGCATGTCCACTGTAAGCTCAGCCCATAGGGGTTGGTTTTGGGGGTATAATTGCACTTGGCCTTT
CAAACATGGGCACAAGAGTCCCCTGCCTCCGCCGCCACCATCACCGCCGCCTCCACCGCCACCGTCACCACCACCACCGCCTCCGTCACCACCACCGCCATCACCACCTC
CACCACTGCCCCAACAGAGTTCTAGAAAGATCATAGTGGGTGGTTCCGAGCATTGGCGTTTTGGCTTTGACTATAACGATTGGGCACTTAAGAATGGTCCCTTTTATTTA
AATGATATTCTCGTCTTCAAATATGATCCTCCAACCAGTGCACTTCCTCATAGTGTTTACTTGCTATCAAACATGCGAAGCTTCTCCAACTGTGATTTGAGAAGAGCTCA
AAAGCTGGGAAACTCGACACAAGGAGCTGGAGATGGGTTCGAGTTCGTGCTCAAACAACGGAAGCCTTATTACTTTGCATGTGGTGAAGGCAATGGCTTTCATTGCAGAA
ATGGAACCATGAAGTTCTTCGTCACTCCCAAAGTTCGGGTCGAAGCCAAAATGCCAGTGCTCGGAACCACCAACGACGATCTTGTTGGGACTTTGTGGCGGTGGCGGTGG
TGGCAGCAGATGGCGATAACCTGGCCAAGTGTAATTGAACCCATAATACCAACCCCTGTGGGCAGAGCATACAGCCAACATGCAGCAAAGCATGAGCAGGAGGATTGCTT
CTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCATATATTCACAAGCAATCCTCCTCTTGACAGTAGCCTGCATGTCCACTGTAAGCTCAGCCCATAGGGGTTGGTTTTGGGGGTATAATTGCACTTGGCCTTT
CAAACATGGGCACAAGAGTCCCCTGCCTCCGCCGCCACCATCACCGCCGCCTCCACCGCCACCGTCACCACCACCACCGCCTCCGTCACCACCACCGCCATCACCACCTC
CACCACTGCCCCAACAGAGTTCTAGAAAGATCATAGTGGGTGGTTCCGAGCATTGGCGTTTTGGCTTTGACTATAACGATTGGGCACTTAAGAATGGTCCCTTTTATTTA
AATGATATTCTCGTCTTCAAATATGATCCTCCAACCAGTGCACTTCCTCATAGTGTTTACTTGCTATCAAACATGCGAAGCTTCTCCAACTGTGATTTGAGAAGAGCTCA
AAAGCTGGGAAACTCGACACAAGGAGCTGGAGATGGGTTCGAGTTCGTGCTCAAACAACGGAAGCCTTATTACTTTGCATGTGGTGAAGGCAATGGCTTTCATTGCAGAA
ATGGAACCATGAAGTTCTTCGTCACTCCCAAAGTTCGGGTCGAAGCCAAAATGCCAGTGCTCGGAACCACCAACGACGATCTTGTTGGGACTTTGTGGCGGTGGCGGTGG
TGGCAGCAGATGGCGATAACCTGGCCAAGTGTAATTGAACCCATAATACCAACCCCTGTGGGCAGAGCATACAGCCAACATGCAGCAAAGCATGAGCAGGAGGATTGCTT
CTGA
Protein sequenceShow/hide protein sequence
MASIYSQAILLLTVACMSTVSSAHRGWFWGYNCTWPFKHGHKSPLPPPPPSPPPPPPPSPPPPPPSPPPPSPPPPLPQQSSRKIIVGGSEHWRFGFDYNDWALKNGPFYL
NDILVFKYDPPTSALPHSVYLLSNMRSFSNCDLRRAQKLGNSTQGAGDGFEFVLKQRKPYYFACGEGNGFHCRNGTMKFFVTPKVRVEAKMPVLGTTNDDLVGTLWRWRW
WQQMAITWPSVIEPIIPTPVGRAYSQHAAKHEQEDCF