; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G20290 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G20290
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein SPA, chloroplastic
Genome locationClcChr05:29065933..29067422
RNA-Seq ExpressionClc05G20290
SyntenyClc05G20290
Gene Ontology termsGO:0010206 - photosystem II repair (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003756 - protein disulfide isomerase activity (molecular function)
InterPro domainsIPR035272 - Protein of unknown function DUF5351
IPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136388.1 protein disulfide-isomerase LQY1, chloroplastic [Cucumis sativus]1.8e-7693.38Show/hide
Query:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK
        MTVA SLSRLHSPFLYCPLKPT STSLSVTFS NQRSP SYPRIRA+DLDQNTVVALSVGLVSVA+GIGIPVFYETQIDN+AKRENTQPCFPCSGSGAQ+
Subjt:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGD+KE+SRCINCDGVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

XP_008466492.1 PREDICTED: protein SPA, chloroplastic [Cucumis melo]1.6e-7795.36Show/hide
Query:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK
        MTVA SLSRLHSPFLYCPLKPTSSTSLSVTFS NQRSP SYPRIRA+DLDQNTVVALSVGLVSVAVGIGIPVFYETQIDN+AKRENTQPCFPCSGSGAQ+
Subjt:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKE+SRCINCDGVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

XP_022142253.1 protein SPA, chloroplastic [Momordica charantia]6.9e-7693.38Show/hide
Query:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK
        MT A SLSRLHSPFLY PLKPTS+ SLS+TFSRNQRSPASYPRIRA+DLDQNTVVALSVGLVSVAVGIGIPVFYETQIDN+AKRENTQPCFPCSGSGAQK
Subjt:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKE+SRCINC+GVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

XP_022936404.1 protein SPA, chloroplastic [Cucurbita moschata]8.1e-7793.38Show/hide
Query:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK
        MTVA SLSRLHSPFLYCPLKPTSS SLS+TFSRNQRSPASYPRIRA+DLDQNTVVALSVGL S+AVGIGIPVFYETQIDN+AKRENTQPCFPC+GSGAQ+
Subjt:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKE+SRCINCDGVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

XP_038899789.1 protein SPA, chloroplastic [Benincasa hispida]9.6e-7894.7Show/hide
Query:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK
        MTVA S+SRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPA+YPRIRA+DLDQNTVVALSVGLVSVAVGIGIPVFYETQIDN+AKRENTQPCFPCSGSGAQ+
Subjt:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKE+SRCINCDG GTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

TrEMBL top hitse value%identityAlignment
A0A0A0LJ55 Uncharacterized protein8.8e-7793.38Show/hide
Query:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK
        MTVA SLSRLHSPFLYCPLKPT STSLSVTFS NQRSP SYPRIRA+DLDQNTVVALSVGLVSVA+GIGIPVFYETQIDN+AKRENTQPCFPCSGSGAQ+
Subjt:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGD+KE+SRCINCDGVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

A0A1S3CRE4 protein SPA, chloroplastic7.9e-7895.36Show/hide
Query:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK
        MTVA SLSRLHSPFLYCPLKPTSSTSLSVTFS NQRSP SYPRIRA+DLDQNTVVALSVGLVSVAVGIGIPVFYETQIDN+AKRENTQPCFPCSGSGAQ+
Subjt:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKE+SRCINCDGVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

A0A6J1CMU6 protein SPA, chloroplastic3.3e-7693.38Show/hide
Query:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK
        MT A SLSRLHSPFLY PLKPTS+ SLS+TFSRNQRSPASYPRIRA+DLDQNTVVALSVGLVSVAVGIGIPVFYETQIDN+AKRENTQPCFPCSGSGAQK
Subjt:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKE+SRCINC+GVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

A0A6J1F877 protein SPA, chloroplastic3.9e-7793.38Show/hide
Query:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK
        MTVA SLSRLHSPFLYCPLKPTSS SLS+TFSRNQRSPASYPRIRA+DLDQNTVVALSVGL S+AVGIGIPVFYETQIDN+AKRENTQPCFPC+GSGAQ+
Subjt:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKE+SRCINCDGVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

A0A6J1IFD3 protein SPA, chloroplastic3.9e-7793.38Show/hide
Query:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK
        MTVA SLSRLHSPFLYCPLKPTSS SLS+TFSRNQRSPASYPRIRA+DLDQNTVVALSVGL S+AVGIGIPVFYETQIDN+AKRENTQPCFPC+GSGAQ+
Subjt:  MTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKE+SRCINCDGVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

SwissProt top hitse value%identityAlignment
K4BVL1 Protein SPA, chloroplastic4.7e-5973.2Show/hide
Query:  MTVASSLSRLHSPFLYCPLK-PT-SSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGA
        M  A SLSR  SPF+  PLK PT SS+  +  F +  R   SYP I+AVDLDQNTV+A++VG++SVA+G+GIPVFYETQIDN+AKRENTQPCFPC+G+GA
Subjt:  MTVASSLSRLHSPFLYCPLK-PT-SSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGA

Query:  QKCRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
        QKCRFCMGTG+VTVELGG E E+SRCINCDG G LTCTTCQGSGIQPRYLDRR
Subjt:  QKCRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

Q8GSJ6 Protein disulfide-isomerase LQY1, chloroplastic1.5e-5773.15Show/hide
Query:  ASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPA-SYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQKCR
        A S  RLHSPF++CP+  T S+      +RN RSP+ SYPRI+A +LD NTVVA+SVG+ SVA+GIGIPVFYETQIDN+AKRENTQPCFPC+G+GAQKCR
Subjt:  ASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPA-SYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQKCR

Query:  FCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
         C+G+GNVTVELGG EKE+S CINCDG G+LTCTTCQGSG+QPRYLDRR
Subjt:  FCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

Arabidopsis top hitse value%identityAlignment
AT1G75690.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein1.1e-5873.15Show/hide
Query:  ASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPA-SYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQKCR
        A S  RLHSPF++CP+  T S+      +RN RSP+ SYPRI+A +LD NTVVA+SVG+ SVA+GIGIPVFYETQIDN+AKRENTQPCFPC+G+GAQKCR
Subjt:  ASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPA-SYPRIRAVDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQKCR

Query:  FCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR
         C+G+GNVTVELGG EKE+S CINCDG G+LTCTTCQGSG+QPRYLDRR
Subjt:  FCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRR

AT2G24860.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein1.0e-0529.58Show/hide
Query:  CFPCSGSGAQKCRFCMGTGNVTVELGGDE------KEISRCINCDGVGTLTCTTCQGSGIQPRYLDRRYVP
        C  C+  G  +C++C GTG   +   GD          + C+ C G G+ +C+ C+G+G + ++L++  VP
Subjt:  CFPCSGSGAQKCRFCMGTGNVTVELGGDE------KEISRCINCDGVGTLTCTTCQGSGIQPRYLDRRYVP

AT4G13670.1 plastid transcriptionally active 53.4e-0431.68Show/hide
Query:  RAVDLDQNTVVALSVGL---VSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQKCRFCMGTGNVTVELGGDE--KEISRCINCDGVGTLTCTTCQ
        R VD+ QN V  L        S  +G   PV      D S        C  C G G   C  C GTG   +E    E   E ++C  C+G+G   C  C 
Subjt:  RAVDLDQNTVVALSVGL---VSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQKCRFCMGTGNVTVELGGDE--KEISRCINCDGVGTLTCTTCQ

Query:  G
        G
Subjt:  G


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGGCAGTTGGATGAGTGGTGGGTCTGTAGCAGAGCAGAGGCAGCCAATAACGGATATCCAAATCTGGAATGCGTGGACTGAGACTGACTTTCTTTTTCTTTTTGA
TTCGGAGCTCTCAGAAGCAAAGGAATCGAGTCCAAGGAATTCAGAAATGACAGTAGCATCTTCGCTTTCTCGCCTCCACTCTCCATTTCTGTATTGTCCTCTCAAGCCGA
CTTCATCTACCTCTCTATCTGTCACATTCTCTAGAAATCAACGATCGCCAGCATCATATCCACGCATCAGAGCAGTAGATCTTGACCAAAACACGGTTGTGGCACTTTCA
GTTGGGCTGGTGAGTGTTGCAGTTGGAATAGGCATTCCGGTATTCTATGAAACCCAAATAGATAATTCTGCAAAGCGTGAAAATACTCAGCCCTGCTTTCCCTGCAGTGG
TTCGGGAGCACAGAAATGCAGATTTTGCATGGGAACTGGCAATGTGACCGTAGAACTTGGTGGAGATGAAAAAGAAATCTCCCGGTGTATTAACTGTGATGGTGTTGGCA
CATTGACATGCACTACATGTCAGGGCTCTGGAATTCAACCTCGATATTTAGATCGCAGGTATGTCCCCTTTTATCCATCTTTTGCTTCATTTTTCTTCTTCTGGTACTTG
GAACTCGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGGCAGTTGGATGAGTGGTGGGTCTGTAGCAGAGCAGAGGCAGCCAATAACGGATATCCAAATCTGGAATGCGTGGACTGAGACTGACTTTCTTTTTCTTTTTGA
TTCGGAGCTCTCAGAAGCAAAGGAATCGAGTCCAAGGAATTCAGAAATGACAGTAGCATCTTCGCTTTCTCGCCTCCACTCTCCATTTCTGTATTGTCCTCTCAAGCCGA
CTTCATCTACCTCTCTATCTGTCACATTCTCTAGAAATCAACGATCGCCAGCATCATATCCACGCATCAGAGCAGTAGATCTTGACCAAAACACGGTTGTGGCACTTTCA
GTTGGGCTGGTGAGTGTTGCAGTTGGAATAGGCATTCCGGTATTCTATGAAACCCAAATAGATAATTCTGCAAAGCGTGAAAATACTCAGCCCTGCTTTCCCTGCAGTGG
TTCGGGAGCACAGAAATGCAGATTTTGCATGGGAACTGGCAATGTGACCGTAGAACTTGGTGGAGATGAAAAAGAAATCTCCCGGTGTATTAACTGTGATGGTGTTGGCA
CATTGACATGCACTACATGTCAGGGCTCTGGAATTCAACCTCGATATTTAGATCGCAGGTATGTCCCCTTTTATCCATCTTTTGCTTCATTTTTCTTCTTCTGGTACTTG
GAACTCGCTTAA
Protein sequenceShow/hide protein sequence
MAGSWMSGGSVAEQRQPITDIQIWNAWTETDFLFLFDSELSEAKESSPRNSEMTVASSLSRLHSPFLYCPLKPTSSTSLSVTFSRNQRSPASYPRIRAVDLDQNTVVALS
VGLVSVAVGIGIPVFYETQIDNSAKRENTQPCFPCSGSGAQKCRFCMGTGNVTVELGGDEKEISRCINCDGVGTLTCTTCQGSGIQPRYLDRRYVPFYPSFASFFFFWYL
ELA