; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021857 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021857
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein SPA, chloroplastic
Genome locationscaffold1:796999..798156
RNA-Seq ExpressionMS021857
SyntenyMS021857
Gene Ontology termsGO:0010206 - photosystem II repair (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003756 - protein disulfide isomerase activity (molecular function)
InterPro domainsIPR035272 - Protein of unknown function DUF5351
IPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136388.1 protein disulfide-isomerase LQY1, chloroplastic [Cucumis sativus]2.0e-7492.05Show/hide
Query:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
        MT APSLSRLHSPFLY PLKPT + SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGLVSVA+GIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQ+
Subjt:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGD+KEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

XP_008466492.1 PREDICTED: protein SPA, chloroplastic [Cucumis melo]1.8e-7594.04Show/hide
Query:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
        MT APSLSRLHSPFLY PLKPTS+ SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQ+
Subjt:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

XP_022142253.1 protein SPA, chloroplastic [Momordica charantia]7.2e-80100Show/hide
Query:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
        MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
Subjt:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

XP_022936404.1 protein SPA, chloroplastic [Cucurbita moschata]1.7e-7694.7Show/hide
Query:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
        MT APSLSRLHSPFLY PLKPTS+ASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGL S+AVGIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQ+
Subjt:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

XP_038899789.1 protein SPA, chloroplastic [Benincasa hispida]1.1e-7593.38Show/hide
Query:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
        MT APS+SRLHSPFLY PLKPTS+ SLS+TFSRNQRSPA+YPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQ+
Subjt:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKEVSRCINC+G GTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

TrEMBL top hitse value%identityAlignment
A0A0A0LJ55 Uncharacterized protein9.8e-7592.05Show/hide
Query:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
        MT APSLSRLHSPFLY PLKPT + SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGLVSVA+GIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQ+
Subjt:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGD+KEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

A0A1S3CRE4 protein SPA, chloroplastic8.9e-7694.04Show/hide
Query:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
        MT APSLSRLHSPFLY PLKPTS+ SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQ+
Subjt:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

A0A6J1CMU6 protein SPA, chloroplastic3.5e-80100Show/hide
Query:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
        MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
Subjt:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

A0A6J1F877 protein SPA, chloroplastic8.1e-7794.7Show/hide
Query:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
        MT APSLSRLHSPFLY PLKPTS+ASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGL S+AVGIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQ+
Subjt:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

A0A6J1IFD3 protein SPA, chloroplastic8.1e-7794.7Show/hide
Query:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK
        MT APSLSRLHSPFLY PLKPTS+ASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGL S+AVGIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQ+
Subjt:  MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQK

Query:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        CRFCMGTGNVTVELGGDEKEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRR
Subjt:  CRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

SwissProt top hitse value%identityAlignment
K4BVL1 Protein SPA, chloroplastic1.7e-6074.51Show/hide
Query:  MTTAPSLSRLHSPFLYSPLK-PT-SAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGA
        M TAPSLSR  SPF+ SPLK PT S++  +  F +  R   SYP I+A+DLDQNTV+A++VG++SVA+G+GIPVFYETQIDNAAKRENTQPCFPC+G+GA
Subjt:  MTTAPSLSRLHSPFLYSPLK-PT-SAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGA

Query:  QKCRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        QKCRFCMGTG+VTVELGG E EVSRCINC+G G LTCTTCQGSGIQPRYLDRR
Subjt:  QKCRFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

Q8GSJ6 Protein disulfide-isomerase LQY1, chloroplastic5.2e-5773.33Show/hide
Query:  TAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPA-SYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQKC
        +APS  RLHSPF++ P+  T     S   +RN RSP+ SYPRI+A +LD NTVVA+SVG+ SVA+GIGIPVFYETQIDNAAKRENTQPCFPC+G+GAQKC
Subjt:  TAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPA-SYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQKC

Query:  RFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        R C+G+GNVTVELGG EKEVS CINC+G G+LTCTTCQGSG+QPRYLDRR
Subjt:  RFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

Arabidopsis top hitse value%identityAlignment
AT1G75690.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein3.7e-5873.33Show/hide
Query:  TAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPA-SYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQKC
        +APS  RLHSPF++ P+  T     S   +RN RSP+ SYPRI+A +LD NTVVA+SVG+ SVA+GIGIPVFYETQIDNAAKRENTQPCFPC+G+GAQKC
Subjt:  TAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPA-SYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQKC

Query:  RFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR
        R C+G+GNVTVELGG EKEVS CINC+G G+LTCTTCQGSG+QPRYLDRR
Subjt:  RFCMGTGNVTVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRR

AT2G24860.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein4.8e-0528.36Show/hide
Query:  CFPCSGSGAQKCRFCMGTGNVTVELGGDE------KEVSRCINCEGVGTLTCTTCQGSGIQPRYLDR
        C  C+  G  +C++C GTG   +   GD          + C+ C G G+ +C+ C+G+G + ++L++
Subjt:  CFPCSGSGAQKCRFCMGTGNVTVELGGDE------KEVSRCINCEGVGTLTCTTCQGSGIQPRYLDR

AT4G13670.1 plastid transcriptionally active 54.0e-0430.69Show/hide
Query:  RAIDLDQNTVVALSVGL---VSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQKCRFCMGTGNVTVELGGDE--KEVSRCINCEGVGTLTCTTCQ
        R +D+ QN V  L        S  +G   PV      D +        C  C G G   C  C GTG   +E    E   E ++C  CEG+G   C  C 
Subjt:  RAIDLDQNTVVALSVGL---VSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQKCRFCMGTGNVTVELGGDE--KEVSRCINCEGVGTLTCTTCQ

Query:  G
        G
Subjt:  G


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACAGCACCTTCGCTTTCTCGCCTCCACTCTCCGTTTCTGTATTCTCCTCTCAAACCCACTTCAGCTGCCTCTCTATCACTCACATTTTCCAGAAATCAACGGTC
TCCAGCATCATATCCACGCATCAGAGCTATAGATCTTGACCAAAACACGGTGGTGGCACTTTCAGTGGGGCTGGTGAGTGTTGCAGTTGGAATAGGCATTCCCGTCTTCT
ATGAAACCCAAATTGATAATGCTGCGAAGCGTGAGAATACCCAGCCTTGCTTTCCCTGCAGTGGTTCGGGAGCACAGAAGTGCAGATTTTGTATGGGAACTGGCAATGTG
ACCGTAGAACTCGGTGGAGATGAAAAGGAAGTTTCCCGTTGTATTAACTGTGAAGGTGTTGGCACATTGACTTGCACTACATGTCAGGGCTCTGGAATTCAACCTCGATA
TCTAGATCGCAGGTATGGTTTTGCATCTTGC
mRNA sequenceShow/hide mRNA sequence
ATGACAACAGCACCTTCGCTTTCTCGCCTCCACTCTCCGTTTCTGTATTCTCCTCTCAAACCCACTTCAGCTGCCTCTCTATCACTCACATTTTCCAGAAATCAACGGTC
TCCAGCATCATATCCACGCATCAGAGCTATAGATCTTGACCAAAACACGGTGGTGGCACTTTCAGTGGGGCTGGTGAGTGTTGCAGTTGGAATAGGCATTCCCGTCTTCT
ATGAAACCCAAATTGATAATGCTGCGAAGCGTGAGAATACCCAGCCTTGCTTTCCCTGCAGTGGTTCGGGAGCACAGAAGTGCAGATTTTGTATGGGAACTGGCAATGTG
ACCGTAGAACTCGGTGGAGATGAAAAGGAAGTTTCCCGTTGTATTAACTGTGAAGGTGTTGGCACATTGACTTGCACTACATGTCAGGGCTCTGGAATTCAACCTCGATA
TCTAGATCGCAGGTATGGTTTTGCATCTTGC
Protein sequenceShow/hide protein sequence
MTTAPSLSRLHSPFLYSPLKPTSAASLSLTFSRNQRSPASYPRIRAIDLDQNTVVALSVGLVSVAVGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQKCRFCMGTGNV
TVELGGDEKEVSRCINCEGVGTLTCTTCQGSGIQPRYLDRRYGFASC