; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G19590 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G19590
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic
Genome locationClcChr09:33077813..33082016
RNA-Seq ExpressionClc09G19590
SyntenyClc09G19590
Gene Ontology termsNA
InterPro domainsIPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033428.1 hypothetical protein SDJN02_07484 [Cucurbita argyrosperma subsp. argyrosperma]5.7e-3464.91Show/hide
Query:  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE
        W     S ++N  FPF    +PP+       N   +KPS I FAA     LPK C+ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+
Subjt:  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE

Query:  CGKGGLTPEQRGER
        CGKGGLTPEQRGER
Subjt:  CGKGGLTPEQRGER

XP_008459263.1 PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic [Cucumis melo]7.4e-3471.03Show/hide
Query:  SSASNKIFPFQVLKWPPYY---NTTITKPSI-ISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLT
        +S S  IFPF     P Y    + T + P+I + FAA PT++LLPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLT
Subjt:  SSASNKIFPFQVLKWPPYY---NTTITKPSI-ISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLT

Query:  PEQRGER
        PEQRGER
Subjt:  PEQRGER

XP_022152486.1 uncharacterized protein LOC111020203 [Momordica charantia]5.7e-3468.52Show/hide
Query:  SSASNKIFPFQVLKWPPYYNTTITKP------SIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGL
        S   N IFPF     PP +   I+ P      S ISF A     LPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCPECG GGL
Subjt:  SSASNKIFPFQVLKWPPYYNTTITKP------SIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGL

Query:  TPEQRGER
        TPEQRGER
Subjt:  TPEQRGER

XP_023521952.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic-like [Cucurbita pepo subsp. pepo]3.3e-3464.91Show/hide
Query:  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE
        W     S ++N  FPF    +PP+       N + +KPS I FAA     LPK C+ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+
Subjt:  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE

Query:  CGKGGLTPEQRGER
        CGKGGLTPEQRGER
Subjt:  CGKGGLTPEQRGER

XP_038901776.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic isoform X1 [Benincasa hispida]2.5e-3775.45Show/hide
Query:  MGSSASNKIFPFQ----VLKWPPYYNTTITKPSIISFAAPTA--SLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKG
        M  ++SN IFPF     V   PPY  T+ TK S IS AA  A  SLLPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKG
Subjt:  MGSSASNKIFPFQ----VLKWPPYYNTTITKPSIISFAAPTA--SLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKG

Query:  GLTPEQRGER
        GLTPEQRGER
Subjt:  GLTPEQRGER

TrEMBL top hitse value%identityAlignment
A0A0A0LIM5 Uncharacterized protein6.1e-3469.9Show/hide
Query:  SSASNKIFPFQVLKWPPYYNTTITKPSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQR
        +S S  IFPF   +       +     I+ FAA PT++LLPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTPEQR
Subjt:  SSASNKIFPFQVLKWPPYYNTTITKPSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQR

Query:  GER
        GER
Subjt:  GER

A0A1S3C9W1 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic3.6e-3471.03Show/hide
Query:  SSASNKIFPFQVLKWPPYY---NTTITKPSI-ISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLT
        +S S  IFPF     P Y    + T + P+I + FAA PT++LLPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLT
Subjt:  SSASNKIFPFQVLKWPPYY---NTTITKPSI-ISFAA-PTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLT

Query:  PEQRGER
        PEQRGER
Subjt:  PEQRGER

A0A6J1DHW0 uncharacterized protein LOC1110202032.7e-3468.52Show/hide
Query:  SSASNKIFPFQVLKWPPYYNTTITKP------SIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGL
        S   N IFPF     PP +   I+ P      S ISF A     LPK C+ CGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCPECG GGL
Subjt:  SSASNKIFPFQVLKWPPYYNTTITKP------SIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGL

Query:  TPEQRGER
        TPEQRGER
Subjt:  TPEQRGER

A0A6J1HLG5 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic2.3e-3364.04Show/hide
Query:  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE
        W     S ++N   PF    +PP+       N   +KPS I FAA     LPK C+ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+
Subjt:  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE

Query:  CGKGGLTPEQRGER
        CGKGGLTPEQRGER
Subjt:  CGKGGLTPEQRGER

A0A6J1JSI3 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic1.4e-3364.04Show/hide
Query:  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE
        W     S ++N  FPF    +PP+       N + +KPS I FAA     LPK C+ CGGKGAIDC GCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+
Subjt:  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPE

Query:  CGKGGLTPEQRGER
        CGKGGLTPEQRGER
Subjt:  CGKGGLTPEQRGER

SwissProt top hitse value%identityAlignment
O64750 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic5.7e-0542.19Show/hide
Query:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG
        SCRNC G GA+ C  C GTGK      K+  +++E  +C +C G G   CP C   GL P  +G
Subjt:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG

Arabidopsis top hitse value%identityAlignment
AT1G22630.1 unknown protein1.3e-3188.71Show/hide
Query:  KSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER
        KSC  CG KGAI+CPGCKGTGKNKKNGN+FERWKCFDCQGFG+KSCP+CGKGGLTPEQRGER
Subjt:  KSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER

AT2G34860.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein4.1e-0642.19Show/hide
Query:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG
        SCRNC G GA+ C  C GTGK      K+  +++E  +C +C G G   CP C   GL P  +G
Subjt:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG

AT2G34860.2 DnaJ/Hsp40 cysteine-rich domain superfamily protein4.1e-0642.19Show/hide
Query:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG
        SCRNC G GA+ C  C GTGK      K+  +++E  +C +C G G   CP C   GL P  +G
Subjt:  SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTAAGAAGAAAGCCCTAATTGAGAGAAAACCTGCAACGAGGAGAGCTGGGTCTCCAAGGACTCGAGGGCATCGCAGATATTCAAAAGTCGTTCCACTTCTTCATCG
TCCTCCGTATCGCCGACCAGAGTTTGGGAGCCAGAACCGCGGTCGTTGGCGGCCCCATTTCCCGTTTCGGCAGATGGGTCTTGCCCGAGTGAGGAGAGAATGGGATGAAT
TGATGAAAATAGGAACATCAAACCCTGAAAAGAATAATTGGAAAAGAGGGATGGGTAGCTCTGCCTCTAACAAAATCTTCCCATTTCAAGTCCTCAAATGGCCACCATAT
TATAATACAACTATAACTAAACCATCCATCATTTCTTTTGCTGCACCCACAGCCTCTTTGCTGCCCAAAAGTTGCCGCAACTGTGGAGGTAAAGGAGCCATTGATTGTCC
TGGATGTAAGGGCACGGGAAAGAACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAATGTTTTGATTGTCAAGGATTTGGATTAAAGAGTTGTCCGGAATGTGGAAAAG
GAGGACTCACCCCCGAACAAAGGGGAGAAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTAAGAAGAAAGCCCTAATTGAGAGAAAACCTGCAACGAGGAGAGCTGGGTCTCCAAGGACTCGAGGGCATCGCAGATATTCAAAAGTCGTTCCACTTCTTCATCG
TCCTCCGTATCGCCGACCAGAGTTTGGGAGCCAGAACCGCGGTCGTTGGCGGCCCCATTTCCCGTTTCGGCAGATGGGTCTTGCCCGAGTGAGGAGAGAATGGGATGAAT
TGATGAAAATAGGAACATCAAACCCTGAAAAGAATAATTGGAAAAGAGGGATGGGTAGCTCTGCCTCTAACAAAATCTTCCCATTTCAAGTCCTCAAATGGCCACCATAT
TATAATACAACTATAACTAAACCATCCATCATTTCTTTTGCTGCACCCACAGCCTCTTTGCTGCCCAAAAGTTGCCGCAACTGTGGAGGTAAAGGAGCCATTGATTGTCC
TGGATGTAAGGGCACGGGAAAGAACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAATGTTTTGATTGTCAAGGATTTGGATTAAAGAGTTGTCCGGAATGTGGAAAAG
GAGGACTCACCCCCGAACAAAGGGGAGAAAGATAATGCATATTTTCCCCAATGCAATATATATTTTGTATTTATATGTCCTGATTTTTCTAATTTAAGGTTTTATTATGT
TTTTTTTTTCTTTCTTTTTAGAATTTTAACAAACATTATTAAGAATGGTGAAATTAAACGTCTTATTCAAATTTTTATGCGTGCTAAAATAAATTATTCGAATT
Protein sequenceShow/hide protein sequence
MCKKKALIERKPATRRAGSPRTRGHRRYSKVVPLLHRPPYRRPEFGSQNRGRWRPHFPFRQMGLARVRREWDELMKIGTSNPEKNNWKRGMGSSASNKIFPFQVLKWPPY
YNTTITKPSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER