; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G031580 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G031580
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic
Genome locationCmo_Chr04:21851746..21852481
RNA-Seq ExpressionCmoCh04G031580
SyntenyCmoCh04G031580
Gene Ontology termsNA
InterPro domainsIPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033428.1 hypothetical protein SDJN02_07484 [Cucurbita argyrosperma subsp. argyrosperma]2.6e-6399.12Show/hide
Query:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
        MAWCGSSVSKSNNST PFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GKGGLTPEQRGER
        GKGGLTPEQRGER
Subjt:  GKGGLTPEQRGER

XP_022152486.1 uncharacterized protein LOC111020203 [Momordica charantia]1.2e-3973.21Show/hide
Query:  SSVSKSNNSTSPFPF----PPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCG
        S  S + N+  PFPF    PPH   I P NK + + SNI F AA  FSLPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCP+CG
Subjt:  SSVSKSNNSTSPFPF----PPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCG

Query:  KGGLTPEQRGER
         GGLTPEQRGER
Subjt:  KGGLTPEQRGER

XP_022963874.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita moschata]5.2e-64100Show/hide
Query:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
        MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GKGGLTPEQRGER
        GKGGLTPEQRGER
Subjt:  GKGGLTPEQRGER

XP_022990183.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita maxima]3.2e-6197.35Show/hide
Query:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
        MAWCGSSVSKSNNST PFPFPPHGFAILPQNK RSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDC GCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GKGGLTPEQRGER
        GKGGLTPEQRGER
Subjt:  GKGGLTPEQRGER

XP_023521952.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic-like [Cucurbita pepo subsp. pepo]3.7e-6298.23Show/hide
Query:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
        MAWCGSSVSKSNNST PFPFPPHGFAILPQNK RSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GKGGLTPEQRGER
        GKGGLTPEQRGER
Subjt:  GKGGLTPEQRGER

TrEMBL top hitse value%identityAlignment
A0A0A0LIM5 Uncharacterized protein3.2e-3568.81Show/hide
Query:  VSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNI--PFAAAPKFS-LPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGG
        ++ ++ S + FPFP +    + ++K +S P++I  PFAA P  + LPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCP CGKGG
Subjt:  VSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNI--PFAAAPKFS-LPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGG

Query:  LTPEQRGER
        LTPEQRGER
Subjt:  LTPEQRGER

A0A1S3C9W1 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic1.1e-3567.89Show/hide
Query:  SVSKSNNSTSPFPFPPHGFAILPQNKCRSKPS-NIPFAAAPKFS-LPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGG
        + + ++ S + FPFP +    + ++K +S P+  +PFAA P  + LPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCP CGKGG
Subjt:  SVSKSNNSTSPFPFPPHGFAILPQNKCRSKPS-NIPFAAAPKFS-LPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGG

Query:  LTPEQRGER
        LTPEQRGER
Subjt:  LTPEQRGER

A0A6J1DHW0 uncharacterized protein LOC1110202035.7e-4073.21Show/hide
Query:  SSVSKSNNSTSPFPF----PPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCG
        S  S + N+  PFPF    PPH   I P NK + + SNI F AA  FSLPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCP+CG
Subjt:  SSVSKSNNSTSPFPF----PPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCG

Query:  KGGLTPEQRGER
         GGLTPEQRGER
Subjt:  KGGLTPEQRGER

A0A6J1HLG5 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic2.5e-64100Show/hide
Query:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
        MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GKGGLTPEQRGER
        GKGGLTPEQRGER
Subjt:  GKGGLTPEQRGER

A0A6J1JSI3 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic1.5e-6197.35Show/hide
Query:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
        MAWCGSSVSKSNNST PFPFPPHGFAILPQNK RSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDC GCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GKGGLTPEQRGER
        GKGGLTPEQRGER
Subjt:  GKGGLTPEQRGER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22630.1 unknown protein1.6e-3183.08Show/hide
Query:  SLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGGLTPEQRGER
        S+ K C+TCG KGAI+CPGCKGTG+NKKNGN+FERWKCF+CQGFG+KSCP CGKGGLTPEQRGER
Subjt:  SLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGGLTPEQRGER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTGGTGTGGCAGCTCTGTCTCAAAGAGCAACAACAGCACCTCCCCGTTCCCATTCCCACCCCATGGATTCGCCATACTGCCACAAAATAAATGTAGAAGTAAACC
ATCCAACATCCCCTTCGCTGCAGCGCCAAAATTTTCGCTGCCCAAAAGGTGCCAAACGTGTGGAGGTAAAGGGGCTATCGATTGTCCCGGATGCAAGGGGACTGGGAGAA
ACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAATGTTTCGAGTGCCAAGGATTTGGATTAAAGAGCTGCCCTGACTGTGGAAAAGGAGGACTCACCCCCGAACAAAGG
GGAGAAAGATAA
mRNA sequenceShow/hide mRNA sequence
GCCCAAAGAAAGGAAGATGGCGTGGTGTGGCAGCTCTGTCTCAAAGAGCAACAACAGCACCTCCCCGTTCCCATTCCCACCCCATGGATTCGCCATACTGCCACAAAATA
AATGTAGAAGTAAACCATCCAACATCCCCTTCGCTGCAGCGCCAAAATTTTCGCTGCCCAAAAGGTGCCAAACGTGTGGAGGTAAAGGGGCTATCGATTGTCCCGGATGC
AAGGGGACTGGGAGAAACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAATGTTTCGAGTGCCAAGGATTTGGATTAAAGAGCTGCCCTGACTGTGGAAAAGGAGGACT
CACCCCCGAACAAAGGGGAGAAAGATAATGCACTCGCACCAGTTTATATTTATCTACTTCAACTTGTCTATTTTCTAATGTGATGCATTTGCATTGTATCATATATATAT
ATATATATGGTTTCCAATTCTAAGACTTCACTACATTTTGTTCTAAAAAGTGATAAAAGTAAATTATGGTTCTTTG
Protein sequenceShow/hide protein sequence
MAWCGSSVSKSNNSTSPFPFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGGKGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGGLTPEQR
GER