; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC09G180680 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC09G180680
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionprotein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic
Genome locationCicolChr09:32091528..32096425
RNA-Seq ExpressionCcUC09G180680
SyntenyCcUC09G180680
Gene Ontology termsNA
InterPro domainsIPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004153120.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucumis sativus]5.7e-3469.9Show/hide
Query:  SSASNNIFPYQLLKWPPYYNTTITKSSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQR
        +S S NIFP+   +       +     I+ FAA PT++LLPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTPEQR
Subjt:  SSASNNIFPYQLLKWPPYYNTTITKSSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQR

Query:  GER
        GER
Subjt:  GER

XP_008459263.1 PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic [Cucumis melo]3.3e-3470.75Show/hide
Query:  SSASNNIFP---YQLLKWPPYYNTTITKSSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTP
        +S S NIFP   YQ+++     +  IT    + FAA PT++LLPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTP
Subjt:  SSASNNIFP---YQLLKWPPYYNTTITKSSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTP

Query:  EQRGER
        EQRGER
Subjt:  EQRGER

XP_022152486.1 uncharacterized protein LOC111020203 [Momordica charantia]3.7e-3366.67Show/hide
Query:  SSASNNIFPYQLLKWPPYYNTTIT------KSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGL
        S   N IFP+     PP +   I+      + S ISF A     LPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCPECG GGL
Subjt:  SSASNNIFPYQLLKWPPYYNTTIT------KSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGL

Query:  TPEQRGER
        TPEQRGER
Subjt:  TPEQRGER

XP_023521952.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic-like [Cucurbita pepo subsp. pepo]3.7e-3363.16Show/hide
Query:  WKRGMGSSASNNIFPYQLLKWPPY------YNTTITKSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE
        W     S ++N+ FP+    +PP+       N + +K S I FAA     LPK C+ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+
Subjt:  WKRGMGSSASNNIFPYQLLKWPPY------YNTTITKSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE

Query:  CGKGGLTPEQRGER
        CGKGGLTPEQRGER
Subjt:  CGKGGLTPEQRGER

XP_038901776.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic isoform X1 [Benincasa hispida]2.2e-3877.06Show/hide
Query:  MGSSASNNIF---PYQLLKWPPYYNTTITKSSIISFAAPTA--SLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGG
        M  ++SNNIF   PY+L++  P Y T+ TKSS IS AA  A  SLLPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGG
Subjt:  MGSSASNNIF---PYQLLKWPPYYNTTITKSSIISFAAPTA--SLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGG

Query:  LTPEQRGER
        LTPEQRGER
Subjt:  LTPEQRGER

TrEMBL top hitse value%identityAlignment
A0A0A0LIM5 Uncharacterized protein2.7e-3469.9Show/hide
Query:  SSASNNIFPYQLLKWPPYYNTTITKSSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQR
        +S S NIFP+   +       +     I+ FAA PT++LLPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTPEQR
Subjt:  SSASNNIFPYQLLKWPPYYNTTITKSSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQR

Query:  GER
        GER
Subjt:  GER

A0A1S3C9W1 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic1.6e-3470.75Show/hide
Query:  SSASNNIFP---YQLLKWPPYYNTTITKSSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTP
        +S S NIFP   YQ+++     +  IT    + FAA PT++LLPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTP
Subjt:  SSASNNIFP---YQLLKWPPYYNTTITKSSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTP

Query:  EQRGER
        EQRGER
Subjt:  EQRGER

A0A6J1DHW0 uncharacterized protein LOC1110202031.8e-3366.67Show/hide
Query:  SSASNNIFPYQLLKWPPYYNTTIT------KSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGL
        S   N IFP+     PP +   I+      + S ISF A     LPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCPECG GGL
Subjt:  SSASNNIFPYQLLKWPPYYNTTIT------KSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGL

Query:  TPEQRGER
        TPEQRGER
Subjt:  TPEQRGER

A0A6J1HLG5 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic2.6e-3262.28Show/hide
Query:  WKRGMGSSASNNIFPYQLLKWPPY------YNTTITKSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE
        W     S ++N+  P+    +PP+       N   +K S I FAA     LPK C+ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+
Subjt:  WKRGMGSSASNNIFPYQLLKWPPY------YNTTITKSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE

Query:  CGKGGLTPEQRGER
        CGKGGLTPEQRGER
Subjt:  CGKGGLTPEQRGER

A0A6J1JSI3 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic1.5e-3262.28Show/hide
Query:  WKRGMGSSASNNIFPYQLLKWPPY------YNTTITKSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE
        W     S ++N+ FP+    +PP+       N + +K S I FAA     LPK C+ CGGKGAIDC GCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+
Subjt:  WKRGMGSSASNNIFPYQLLKWPPY------YNTTITKSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE

Query:  CGKGGLTPEQRGER
        CGKGGLTPEQRGER
Subjt:  CGKGGLTPEQRGER

SwissProt top hitse value%identityAlignment
O64750 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic7.5e-0542.19Show/hide
Query:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG
        SCRNC G GA+ C  C GTGK      K+  +++E  +C +C G G   CP C   GL P  +G
Subjt:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG

Arabidopsis top hitse value%identityAlignment
AT1G22630.1 unknown protein1.6e-3188.71Show/hide
Query:  KSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER
        KSC  CG KGAI+CPGCKGTGKNKKNGN+FERWKCFDCQGFG+KSCP+CGKGGLTPEQRGER
Subjt:  KSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER

AT2G34860.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein5.3e-0642.19Show/hide
Query:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG
        SCRNC G GA+ C  C GTGK      K+  +++E  +C +C G G   CP C   GL P  +G
Subjt:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG

AT2G34860.2 DnaJ/Hsp40 cysteine-rich domain superfamily protein5.3e-0642.19Show/hide
Query:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG
        SCRNC G GA+ C  C GTGK      K+  +++E  +C +C G G   CP C   GL P  +G
Subjt:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTAAGAAGAAAGCCCTAATTGAGAGAAAACCTGCAACGTGGAGAGCTGGGTCTCCAAGGACTCGAGGGCATCGCAGATATTCAAAAGTCGTTCCACTTCTT
CATCGTCCTCCGTATCGCCGACCAGAGTTTGGGAGCCAGAACCGCCGTCGTTGGCGGCCCCATTTCCCGTTCCGGCAGATGGGTCTTGCCCGAGTGAGGAGAGAA
TGGGGTGAATTGATCAAAATAGCAACACCAAACCCTCAAAAGATTAATTGGAAAAGAGGGATGGGTAGCTCTGCCTCTAACAACATCTTCCCATATCAACTCCTC
AAATGGCCACCATATTATAATACAACTATAACTAAATCATCCATCATTTCTTTTGCTGCACCCACAGCCTCTTTGCTGCCCAAAAGTTGCCGCAACTGTGGAGGT
AAAGGAGCCATTGATTGTCCTGGATGTAAGGGCACAGGAAAGAACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAATGTTTTGATTGTCAAGGATTTGGATTA
AAGAGTTGTCCGGAATGTGGAAAAGGAGGACTCACCCCCGAACAAAGGGGAGAAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTAAGAAGAAAGCCCTAATTGAGAGAAAACCTGCAACGTGGAGAGCTGGGTCTCCAAGGACTCGAGGGCATCGCAGATATTCAAAAGTCGTTCCACTTCTT
CATCGTCCTCCGTATCGCCGACCAGAGTTTGGGAGCCAGAACCGCCGTCGTTGGCGGCCCCATTTCCCGTTCCGGCAGATGGGTCTTGCCCGAGTGAGGAGAGAA
TGGGGTGAATTGATCAAAATAGCAACACCAAACCCTCAAAAGATTAATTGGAAAAGAGGGATGGGTAGCTCTGCCTCTAACAACATCTTCCCATATCAACTCCTC
AAATGGCCACCATATTATAATACAACTATAACTAAATCATCCATCATTTCTTTTGCTGCACCCACAGCCTCTTTGCTGCCCAAAAGTTGCCGCAACTGTGGAGGT
AAAGGAGCCATTGATTGTCCTGGATGTAAGGGCACAGGAAAGAACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAATGTTTTGATTGTCAAGGATTTGGATTA
AAGAGTTGTCCGGAATGTGGAAAAGGAGGACTCACCCCCGAACAAAGGGGAGAAAGATAATGCATACTTTCCCCAATGCAATATATTTTGTATTTATATCTCCTG
ATTTTTCTAATTTAAGGTTTTATTATGTTTTTTTCTTTTTTTTTTTTTTTTTAGAATTTTAACAAACATTATTAAGAATGGCGAGATTAGACATCTTATTCAAAT
TTTTATGCGTGCTAAAATAAATTATTTTAATT
Protein sequenceShow/hide protein sequence
MCKKKALIERKPATWRAGSPRTRGHRRYSKVVPLLHRPPYRRPEFGSQNRRRWRPHFPFRQMGLARVRREWGELIKIATPNPQKINWKRGMGSSASNNIFPYQLL
KWPPYYNTTITKSSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER