; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0996 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0996
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPhytocyanin domain-containing protein
Genome locationMC02:7982689..7984116
RNA-Seq ExpressionMC02g0996
SyntenyMC02g0996
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA2978307.1 CUB and sushi domain-containing 3 [Olea europaea subsp. europaea]4.11e-6375.44Show/hide
Query:  NKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNG
        +K+I+VGGSENWRFGF+Y +WA+KNGPFY+NDTLVFK+DPPN TTFPHSVYLLPNL SF  CDLR AQK+A+ TQG G+GFEFVL++ +PYYFACGERNG
Subjt:  NKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNG

Query:  FHCKVGTMKFSLTP
        FHCK G MKF++ P
Subjt:  FHCKVGTMKFSLTP

KAG4168464.1 hypothetical protein ERO13_A12G025700v2, partial [Gossypium hirsutum]1.79e-6167.16Show/hide
Query:  GSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDG
        G GW     P  PP Q    NK I VGGSENW FGF+Y  WA +NGPFY NDTLVFKYDPP++ TFPHSVYLLPNL SF  CDLR A+ +A+ TQGGGDG
Subjt:  GSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDG

Query:  FEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTP
        F FVL + +PYYFACGERNGFHCKVG M+F + P
Subjt:  FEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTP

XP_022148560.1 uncharacterized protein LOC111017194 [Momordica charantia]4.34e-115100Show/hide
Query:  PPPPQISSDMIAAVSSAGSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECD
        PPPPQISSDMIAAVSSAGSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECD
Subjt:  PPPPQISSDMIAAVSSAGSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECD

Query:  LRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL
        LRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL
Subjt:  LRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL

XP_022148634.1 leucine-rich repeat extensin-like protein 3 [Momordica charantia]9.17e-6261.73Show/hide
Query:  PPPPQISSDMIAAVSSAGSGWNQYLFPSPP---------PPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLP
        PPPP  S    +      S       PSPP         PP  SP+  KI+VGGSENW  GFDY+NWALKNGPF++ND LVFKYDPP   T PHSVYLL 
Subjt:  PPPPQISSDMIAAVSSAGSGWNQYLFPSPP---------PPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLP

Query:  NLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL
        N+ SFS CDLR A K+ANWTQG GDGF+FVL+Q K YYFACGE NGFHCK G+MKFS+TP L
Subjt:  NLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL

XP_022853657.1 uncharacterized protein LOC111375099 [Olea europaea var. sylvestris]4.11e-6375.44Show/hide
Query:  NKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNG
        +K+I+VGGSENWRFGF+Y +WA+KNGPFY+NDTLVFK+DPPN TTFPHSVYLLPNL SF  CDLR AQK+A+ TQG G+GFEFVL++ +PYYFACGERNG
Subjt:  NKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNG

Query:  FHCKVGTMKFSLTP
        FHCK G MKF++ P
Subjt:  FHCKVGTMKFSLTP

TrEMBL top hitse value%identityAlignment
A0A061E0S0 CUB and sushi domain-containing protein 31.29e-6165.44Show/hide
Query:  GSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDG
        G G+N   + S  P   +   KKI+VGGS+NW+FG +Y +W+LKN PFY NDTLVFKYDPP++TTFPHSVYLLPNL SF  CDLR A+ +AN TQGGG+G
Subjt:  GSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDG

Query:  FEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL
        FEFVL++ +PYYFACGERNGFHCK G MKF++ P L
Subjt:  FEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL

A0A2Z7C2K8 CUB and sushi domain-containing protein 31.00e-6170.34Show/hide
Query:  ISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACG
        +S  +K+I+VGGSENWRFGF+Y +WA+KNGPFY+NDTLVF+YDPPN TTFPHSVYLLPN  SF  CDLR A+K+   T G G+GFE+VL++ +PYYFACG
Subjt:  ISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACG

Query:  ERNGFHCKVGTMKFSLTP
        ERNGFHC VG MKF+L P
Subjt:  ERNGFHCKVGTMKFSLTP

A0A6A3A5X2 Armadillo repeat-containing protein 6-like8.74e-6262.76Show/hide
Query:  SSDMIAAVSSAGSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQK
        + D  +  +  G GW  Y     PP Q    N  I VGGS+NW FGF+Y  WA +NGPFY NDTLVFKYDPP++TTFPHSVYLLPNL SF  CDLR A+ 
Subjt:  SSDMIAAVSSAGSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQK

Query:  VANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTP
        +AN TQGGGDGF F L + +PYYFACGERNGFHCKVG MKF + P
Subjt:  VANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTP

A0A6J1D388 uncharacterized protein LOC1110171942.10e-115100Show/hide
Query:  PPPPQISSDMIAAVSSAGSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECD
        PPPPQISSDMIAAVSSAGSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECD
Subjt:  PPPPQISSDMIAAVSSAGSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECD

Query:  LRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL
        LRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL
Subjt:  LRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL

A0A6J1D4K0 leucine-rich repeat extensin-like protein 34.44e-6261.73Show/hide
Query:  PPPPQISSDMIAAVSSAGSGWNQYLFPSPP---------PPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLP
        PPPP  S    +      S       PSPP         PP  SP+  KI+VGGSENW  GFDY+NWALKNGPF++ND LVFKYDPP   T PHSVYLL 
Subjt:  PPPPQISSDMIAAVSSAGSGWNQYLFPSPP---------PPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLP

Query:  NLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL
        N+ SFS CDLR A K+ANWTQG GDGF+FVL+Q K YYFACGE NGFHCK G+MKFS+TP L
Subjt:  NLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15770.1 Cupredoxin superfamily protein1.5e-2745.22Show/hide
Query:  KKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGF
        KKI+VGGS+ W+ G DY +WA KN PFY+ND LVFKYD   S    ++VYL  +  S+  CD++ A+K+ +  +G  + F F L++ +PY+FA GE +G 
Subjt:  KKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGF

Query:  HCKVGTMKFSLTPQL
        +C+   MKF++ P L
Subjt:  HCKVGTMKFSLTPQL

AT2G15780.1 Cupredoxin superfamily protein2.3e-3654.78Show/hide
Query:  KKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGF
        +KI+VGG + W +GF+Y +WA K  PF++ND LVFKY+PP    F HSVYLLPN  S+ +CD++  + +A+  QG G GFEFVL+Q KPYY +CGE +G 
Subjt:  KKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGF

Query:  HCKVGTMKFSLTPQL
        HC  GTMKF++ P L
Subjt:  HCKVGTMKFSLTPQL

AT2G27035.1 early nodulin-like protein 204.6e-0532.11Show/hide
Query:  GGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVG
        GG   W    ++++WA  +  FY  D L F +   N T   H++ L  N  S+ +C       + N T+GG D F+  L + KPYYF CG   G+  K  
Subjt:  GGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFHCKVG

Query:  TMKFSLTPQ
         +  ++ PQ
Subjt:  TMKFSLTPQ

AT4G33930.1 Cupredoxin superfamily protein2.2e-1534.68Show/hide
Query:  PKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTF--------PHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKP
        P  +KI V     W+ G+ Y  W  K+ PFY++D LVFKY+  + T           + VYLLP++ SF  C++   +K+         GF+ +L++ + 
Subjt:  PKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTF--------PHSVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKP

Query:  YYFACGERNGFHCKVGTMKFSLTP
        YYFA G+ N   C    MKFS+ P
Subjt:  YYFACGERNGFHCKVGTMKFSLTP

AT4G34300.1 Cupredoxin superfamily protein2.5e-1435.71Show/hide
Query:  WRFGFDYNNWALKNGPFYINDTLVFKY---DPPNSTTFPH------SVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFH
        W+ G+ Y  W  K+ PFY+ND LVF Y   D   S T  H       VYLLP++ SF  C++   +K+         GF+ +L++   YYF  G+ N  +
Subjt:  WRFGFDYNNWALKNGPFYINDTLVFKY---DPPNSTTFPH------SVYLLPNLGSFSECDLRAAQKVANWTQGGGDGFEFVLQQSKPYYFACGERNGFH

Query:  CKVGTMKFSLTP
             MKFS+ P
Subjt:  CKVGTMKFSLTP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CCTCCGCCGCCGCAAATTAGTTCCGACATGATCGCCGCCGTCTCTTCCGCCGGTAGTGGTTGGAATCAGTACCTGTTTCCGTCCCCTCCGCCGCCACAAATTAGTCCCAA
GAACAAGAAGATCATGGTCGGTGGCTCTGAGAATTGGCGTTTCGGCTTTGACTACAATAATTGGGCACTGAAAAATGGACCTTTTTACATAAACGACACCCTTGTTTTCA
AGTACGATCCTCCCAACAGCACAACATTTCCTCATAGTGTGTACTTGCTTCCAAACTTGGGGAGCTTTTCCGAGTGTGATTTGAGGGCGGCTCAAAAGGTGGCGAATTGG
ACGCAAGGAGGGGGAGATGGCTTTGAATTTGTGCTTCAACAATCCAAGCCATACTACTTTGCCTGTGGAGAACGTAACGGCTTTCATTGTAAAGTTGGAACCATGAAGTT
CTCTCTTACCCCACAACTT
mRNA sequenceShow/hide mRNA sequence
CCTCCGCCGCCGCAAATTAGTTCCGACATGATCGCCGCCGTCTCTTCCGCCGGTAGTGGTTGGAATCAGTACCTGTTTCCGTCCCCTCCGCCGCCACAAATTAGTCCCAA
GAACAAGAAGATCATGGTCGGTGGCTCTGAGAATTGGCGTTTCGGCTTTGACTACAATAATTGGGCACTGAAAAATGGACCTTTTTACATAAACGACACCCTTGTTTTCA
AGTACGATCCTCCCAACAGCACAACATTTCCTCATAGTGTGTACTTGCTTCCAAACTTGGGGAGCTTTTCCGAGTGTGATTTGAGGGCGGCTCAAAAGGTGGCGAATTGG
ACGCAAGGAGGGGGAGATGGCTTTGAATTTGTGCTTCAACAATCCAAGCCATACTACTTTGCCTGTGGAGAACGTAACGGCTTTCATTGTAAAGTTGGAACCATGAAGTT
CTCTCTTACCCCACAACTT
Protein sequenceShow/hide protein sequence
PPPPQISSDMIAAVSSAGSGWNQYLFPSPPPPQISPKNKKIMVGGSENWRFGFDYNNWALKNGPFYINDTLVFKYDPPNSTTFPHSVYLLPNLGSFSECDLRAAQKVANW
TQGGGDGFEFVLQQSKPYYFACGERNGFHCKVGTMKFSLTPQL