; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G001080 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G001080
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein SPA, chloroplastic
Genome locationCmo_Chr09:506922..509528
RNA-Seq ExpressionCmoCh09G001080
SyntenyCmoCh09G001080
Gene Ontology termsGO:0010206 - photosystem II repair (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003756 - protein disulfide isomerase activity (molecular function)
InterPro domainsIPR035272 - Protein of unknown function DUF5351
IPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024161.1 Protein SPA, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]6.1e-8189.14Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGA--
        MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGA  
Subjt:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGA--

Query:  ----------------QRCRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
                        +RCRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  ----------------QRCRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

XP_008466492.1 PREDICTED: protein SPA, chloroplastic [Cucumis melo]2.8e-8195.54Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
        MTVAPSLSRLHSPFLYCPLKPTSS SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGL S+AVGIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

XP_022142253.1 protein SPA, chloroplastic [Momordica charantia]1.0e-8094.9Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
        MT APSLSRLHSPFLY PLKPTS+ASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGL S+AVGIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQ+
Subjt:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGDEKEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

XP_022936404.1 protein SPA, chloroplastic [Cucurbita moschata]9.2e-85100Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
        MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

XP_038899789.1 protein SPA, chloroplastic [Benincasa hispida]1.6e-8194.9Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
        MTVAPS+SRLHSPFLYCPLKPTSS SLS+TFSRNQRSPA+YPRIRAIDLDQNTVVALSVGL S+AVGIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGDEKEVSRCINCDG GTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

TrEMBL top hitse value%identityAlignment
A0A0A0LJ55 Uncharacterized protein1.5e-8093.63Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
        MTVAPSLSRLHSPFLYCPLKPT S SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGL S+A+GIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGD+KEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

A0A1S3CRE4 protein SPA, chloroplastic1.3e-8195.54Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
        MTVAPSLSRLHSPFLYCPLKPTSS SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGL S+AVGIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

A0A6J1CMU6 protein SPA, chloroplastic5.1e-8194.9Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
        MT APSLSRLHSPFLY PLKPTS+ASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGL S+AVGIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQ+
Subjt:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGDEKEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

A0A6J1F877 protein SPA, chloroplastic4.4e-85100Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
        MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

A0A6J1IFD3 protein SPA, chloroplastic4.4e-85100Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
        MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQR

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

SwissProt top hitse value%identityAlignment
K4BVL1 Protein SPA, chloroplastic7.1e-6474.84Show/hide
Query:  MTVAPSLSRLHSPFLYCPLK-PT-SSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGA
        M  APSLSR  SPF+  PLK PT SS+  +  F +  R   SYP I+A+DLDQNTV+A++VG+ S+A+G+GIPVFYETQIDNAAKRENTQPCFPCTG+GA
Subjt:  MTVAPSLSRLHSPFLYCPLK-PT-SSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGA

Query:  QRCRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        Q+CRFCMGTG+VTVELGG E EVSRCINCDG G LTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  QRCRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

Q8GSJ6 Protein disulfide-isomerase LQY1, chloroplastic1.0e-6275.48Show/hide
Query:  APSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPA-SYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQRCR
        APS  RLHSPF++CP+  T S+      +RN RSP+ SYPRI+A +LD NTVVA+SVG+AS+A+GIGIPVFYETQIDNAAKRENTQPCFPC G+GAQ+CR
Subjt:  APSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPA-SYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQRCR

Query:  FCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
         C+G+GNVTVELGG EKEVS CINCDG G+LTCTTCQGSG+QPRYLDRREFKDDD
Subjt:  FCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

Arabidopsis top hitse value%identityAlignment
AT1G75690.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein7.3e-6475.48Show/hide
Query:  APSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPA-SYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQRCR
        APS  RLHSPF++CP+  T S+      +RN RSP+ SYPRI+A +LD NTVVA+SVG+AS+A+GIGIPVFYETQIDNAAKRENTQPCFPC G+GAQ+CR
Subjt:  APSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPA-SYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQRCR

Query:  FCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
         C+G+GNVTVELGG EKEVS CINCDG G+LTCTTCQGSG+QPRYLDRREFKDDD
Subjt:  FCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

AT2G38000.1 chaperone protein dnaJ-related8.5e-0429.67Show/hide
Query:  FYETQIDNAAKRENTQPCFPCTGSG--------------------AQRCRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGI
        + ETQ+      E  + C  CTG G                      +C  C G G V  + G D    + C NC+G G L C TCQ  G+
Subjt:  FYETQIDNAAKRENTQPCFPCTGSG--------------------AQRCRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGI

AT4G13670.1 plastid transcriptionally active 55.0e-0429.7Show/hide
Query:  RAIDLDQNTVVALSVGL---ASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQRCRFCMGTGNVTVELGGDE--KEVSRCINCDGVGTLTCTTCQ
        R +D+ QN V  L        S  +G   PV      D +        C  C G G   C  C GTG   +E    E   E ++C  C+G+G   C  C 
Subjt:  RAIDLDQNTVVALSVGL---ASLAVGIGIPVFYETQIDNAAKRENTQPCFPCTGSGAQRCRFCMGTGNVTVELGGDE--KEVSRCINCDGVGTLTCTTCQ

Query:  G
        G
Subjt:  G


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCCTCAAGTATTTCTTCAGGAATTAGAACTGAGACCCTCTTTCTATCTCCCTGATTCGGAGCTGCTCTCAGAAGCAAAGCAATCGAATCCAAGGAATTCAGAAAT
GACAGTAGCACCTTCGCTTTCTCGCCTCCACTCCCCATTTCTGTATTGTCCTCTCAAGCCGACTTCTTCTGCCTCTCTATCCCTTACCTTCTCCAGAAATCAACGATCGC
CAGCATCATATCCACGCATCAGAGCTATAGATCTTGACCAAAATACGGTGGTGGCACTTTCAGTGGGGCTAGCGAGTTTGGCAGTTGGAATAGGCATTCCGGTCTTCTAT
GAAACCCAAATTGATAATGCTGCAAAGCGTGAAAATACTCAGCCCTGCTTTCCCTGCACTGGTTCGGGGGCACAGAGATGCAGATTTTGCATGGGAACTGGCAATGTGAC
CGTAGAACTTGGTGGAGATGAAAAGGAAGTCTCCCGATGTATTAATTGTGATGGTGTTGGCACATTGACATGCACTACATGTCAGGGCTCAGGAATTCAACCTCGATACC
TAGATCGCAGAGAATTTAAAGATGATGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCCTCAAGTATTTCTTCAGGAATTAGAACTGAGACCCTCTTTCTATCTCCCTGATTCGGAGCTGCTCTCAGAAGCAAAGCAATCGAATCCAAGGAATTCAGAAAT
GACAGTAGCACCTTCGCTTTCTCGCCTCCACTCCCCATTTCTGTATTGTCCTCTCAAGCCGACTTCTTCTGCCTCTCTATCCCTTACCTTCTCCAGAAATCAACGATCGC
CAGCATCATATCCACGCATCAGAGCTATAGATCTTGACCAAAATACGGTGGTGGCACTTTCAGTGGGGCTAGCGAGTTTGGCAGTTGGAATAGGCATTCCGGTCTTCTAT
GAAACCCAAATTGATAATGCTGCAAAGCGTGAAAATACTCAGCCCTGCTTTCCCTGCACTGGTTCGGGGGCACAGAGATGCAGATTTTGCATGGGAACTGGCAATGTGAC
CGTAGAACTTGGTGGAGATGAAAAGGAAGTCTCCCGATGTATTAATTGTGATGGTGTTGGCACATTGACATGCACTACATGTCAGGGCTCAGGAATTCAACCTCGATACC
TAGATCGCAGAGAATTTAAAGATGATGATTGAAGCCCAGCAGAACCAAATCCAGCAACATCCGCTGATTTTGCTTTCAGTTTGATGGCTTTAAACTTGTTAGTTCCATTG
AATCGTAGAGCTTGGTAACTTCCACCAAGCACTTCTTTTTGGTAACTGTTCGCGTGCATTCTTCTAATGTAGATCTGTGGTTAATAATACAAACAACGTTGTATGAAGAG
AGAAGAAATAACTGTACTATCCCGAGTTTATGACTTCATTCCTCTATTGACTGTCAGTAAAATGGGGGATCAAACTCAATCTGATAATCACTAGCTAGTAATGGCAGCAG
GTGCAGGTGCAGGTGAATCTGATCAATTCAATTATCAAATCAATCTTGCAATCATATCAGCCGATTGGAACTCAGAGAGTCGCTAATAAAAATCCACATAACTCCGTGGA
TTCATATCTCAGCCAAACATACAGGGGTAGAAATACTTTTACAAAGAGAGTCAAACATCCCGGTTAACATGAGGTAGTCTTACTTTTACAACACCTTTCTTCTAACCACC
TCAAACGTTTCTGGTTCCCTTCAAGTGAAATGAATGTCTCCCTTAAATTTGGTTACTCAAAGAGATCCAAAGCAGCAA
Protein sequenceShow/hide protein sequence
MVPQVFLQELELRPSFYLPDSELLSEAKQSNPRNSEMTVAPSLSRLHSPFLYCPLKPTSSASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLASLAVGIGIPVFY
ETQIDNAAKRENTQPCFPCTGSGAQRCRFCMGTGNVTVELGGDEKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD