; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000378 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000378
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic
Genome locationscaffold8:47653332..47656987
RNA-Seq ExpressionSpg000378
SyntenySpg000378
Gene Ontology termsGO:0048564 - photosystem I assembly (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0047134 - protein-disulfide reductase activity (molecular function)
InterPro domainsIPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142153.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic isoform X1 [Cucumis sativus]1.2e-6679.87Show/hide
Query:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
        AN KP+VL VTSN DDE CSTGDSKTPSKPL                       K YGGT+TEAIANTMDGKP CRNCGGSGAVLCDMCGGTGKWKALNR
Subjt:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR

Query:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDKMYNGRLLPNS
Subjt:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

XP_011653566.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic isoform X2 [Cucumis sativus]2.6e-6679.25Show/hide
Query:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
        +N KP+VL VTSN DDE CSTGDSKTPSKPL                       K YGGT+TEAIANTMDGKP CRNCGGSGAVLCDMCGGTGKWKALNR
Subjt:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR

Query:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDKMYNGRLLPNS
Subjt:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

XP_016900747.1 PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X3 [Cucumis melo]3.8e-6579.75Show/hide
Query:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
        ANSKP+VL +TSN DDE CSTGDS+TPSKPL                       K+YGGT TEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
Subjt:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR

Query:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN
        KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDKMYNGRLLPN
Subjt:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN

XP_022958131.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita moschata]2.6e-6679.25Show/hide
Query:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
        +NSKP+VLQVTSN DDE  +TGDS TPSKPL                       KDYGGTM EAIANTMDGKP CRNCGGSGAVLCDMCGGTGKWKALNR
Subjt:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR

Query:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKM+NGRLLPNS
Subjt:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

XP_022996228.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita maxima]4.0e-6779.87Show/hide
Query:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
        +NSKP+VLQVTSN DDE CSTGD  TPSKPL                       KDYGGTM EAIANTMDGKP CRNCGGSGAVLCDMCGGTGKWKALNR
Subjt:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR

Query:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKM+NGRLLPNS
Subjt:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

TrEMBL top hitse value%identityAlignment
A0A0A0KX46 Uncharacterized protein5.7e-6779.87Show/hide
Query:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
        AN KP+VL VTSN DDE CSTGDSKTPSKPL                       K YGGT+TEAIANTMDGKP CRNCGGSGAVLCDMCGGTGKWKALNR
Subjt:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR

Query:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDKMYNGRLLPNS
Subjt:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

A0A1S4DXP4 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X31.8e-6579.75Show/hide
Query:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
        ANSKP+VL +TSN DDE CSTGDS+TPSKPL                       K+YGGT TEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
Subjt:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR

Query:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN
        KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDKMYNGRLLPN
Subjt:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN

A0A5A7TFL6 Protein EMBRYO SAC DEVELOPMENT ARREST 31.8e-6579.75Show/hide
Query:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
        ANSKP+VL +TSN DDE CSTGDS+TPSKPL                       K+YGGT TEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
Subjt:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR

Query:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN
        KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDAR+LLDKMYNGRLLPN
Subjt:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN

A0A6J1H182 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic1.3e-6679.25Show/hide
Query:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
        +NSKP+VLQVTSN DDE  +TGDS TPSKPL                       KDYGGTM EAIANTMDGKP CRNCGGSGAVLCDMCGGTGKWKALNR
Subjt:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR

Query:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKM+NGRLLPNS
Subjt:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

A0A6J1KA92 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic2.0e-6779.87Show/hide
Query:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR
        +NSKP+VLQVTSN DDE CSTGD  TPSKPL                       KDYGGTM EAIANTMDGKP CRNCGGSGAVLCDMCGGTGKWKALNR
Subjt:  ANSKPIVLQVTSNFDDERCSTGDSKTPSKPL-----------------------KDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNR

Query:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKM+NGRLLPNS
Subjt:  KRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

SwissProt top hitse value%identityAlignment
A0A1D6KL43 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic3.5e-4580.2Show/hide
Query:  GTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN
        G   +A+      K VCRNC GSGAV+CDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP+A++LLDKMYNG++LP 
Subjt:  GTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPN

Query:  S
        S
Subjt:  S

B5YAR4 Chaperone protein DnaJ7.0e-0631.33Show/hide
Query:  GTMTEAIANTMDGKPVCRNCG---GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNK
        G+  E     ++  P C+  G   G+  V CDMC GTG+ + + +       + T CP C+G G+++   C  C GTG    K
Subjt:  GTMTEAIANTMDGKPVCRNCG---GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNK

O64750 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic7.7e-4587.5Show/hide
Query:  ANTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        A  +D KP   CRNC GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP AR+LL+KMYNGRLLP+S
Subjt:  ANTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

Q6YUA8 Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic7.0e-4687.5Show/hide
Query:  IANTMDGKP-VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        +A+ M  KP VCRNC GSGAVLCDMCGGTGKWKALNRKRAKDVY FTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDA+KLLDKMYNG++LP+S
Subjt:  IANTMDGKP-VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

Q7UA76 Chaperone protein DnaJ1.3e-0435.14Show/hide
Query:  CRNCGGSGA------VLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNKGLLR
        C  CGGSGA        C  CGG G+ +   R       +  ECPNC G G+++   C  C G G+   +  LR
Subjt:  CRNCGGSGA------VLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLV---CPVCLGTGLPNNKGLLR

Arabidopsis top hitse value%identityAlignment
AT2G34860.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein5.5e-4687.5Show/hide
Query:  ANTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        A  +D KP   CRNC GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP AR+LL+KMYNGRLLP+S
Subjt:  ANTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

AT2G34860.2 DnaJ/Hsp40 cysteine-rich domain superfamily protein5.5e-4687.5Show/hide
Query:  ANTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS
        A  +D KP   CRNC GSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRP AR+LL+KMYNGRLLP+S
Subjt:  ANTMDGKP--VCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS

AT5G03870.1 Glutaredoxin family protein6.8e-0429.41Show/hide
Query:  GTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVC
        G + E I     G   CR CGG   ++C +C G+ K +   +K         +C  C   G ++CP+C
Subjt:  GTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVC

AT5G06130.1 chaperone protein dnaJ-related1.4e-0435.82Show/hide
Query:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL
        K  C+ C G+G + C  C  +G   +++   R RA +    V     C NC G GK++CP CL TG+
Subjt:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL

AT5G06130.2 chaperone protein dnaJ-related1.4e-0435.82Show/hide
Query:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL
        K  C+ C G+G + C  C  +G   +++   R RA +    V     C NC G GK++CP CL TG+
Subjt:  KPVCRNCGGSGAVLCDMCGGTGKWKALN---RKRAKD----VYEFTECPNCYGRGKLVCPVCLGTGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATGGCGTCTTCTTCTCATCTATCCGCCATTCCCCTGCGCCCCTCTTCCTCCTCTGCAGCTTCCTTATCCCACCGTGAGCACTTTCTCTCTCTCTCTCTCTCTCT
CTCTTCTCTAGCCTCTGCAAATTCTTTCCAGCTTGTGCCCAATTCTCCACTATCCTGCTACTTCCATTTTGTAGAGGGGTTTAATCTCCTCTGCTCAATGTTGTCGGCCG
GACGGATATGCTTTTTCTTTCTGGTGGATGAGATGATTTGCCTCGACTTCGTTTCACTTTTCTGGACTGTTTCTGTATATACTTATCCCTTCCTTATCGAGTTTTGTGCT
GTTGCTTGTCATCCCCTTCTCCAGTTGTTGTTTGGCACTATCTTATTCTTGGCTAACTCAAAGCCCATTGTACTTCAAGTGACATCAAATTTTGACGATGAAAGATGTAG
TACTGGAGATTCGAAAACACCATCAAAACCACTTAAGGATTATGGGGGCACAATGACTGAAGCTATTGCAAATACCATGGATGGAAAACCGGTGTGCCGGAATTGTGGAG
GAAGTGGCGCCGTACTTTGTGACATGTGCGGTGGTACAGGCAAATGGAAAGCTTTGAACAGAAAACGGGCTAAAGATGTCTACGAGTTTACAGAATGTCCAAATTGTTAT
GGTAGAGGAAAACTTGTATGTCCTGTTTGTCTAGGAACTGGTTTACCAAACAACAAAGGTCTTCTAAGAAGGCCCGATGCACGAAAATTGCTTGATAAGATGTACAATGG
TCGCTTGTTACCAAATTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGATGGCGTCTTCTTCTCATCTATCCGCCATTCCCCTGCGCCCCTCTTCCTCCTCTGCAGCTTCCTTATCCCACCGTGAGCACTTTCTCTCTCTCTCTCTCTCTCT
CTCTTCTCTAGCCTCTGCAAATTCTTTCCAGCTTGTGCCCAATTCTCCACTATCCTGCTACTTCCATTTTGTAGAGGGGTTTAATCTCCTCTGCTCAATGTTGTCGGCCG
GACGGATATGCTTTTTCTTTCTGGTGGATGAGATGATTTGCCTCGACTTCGTTTCACTTTTCTGGACTGTTTCTGTATATACTTATCCCTTCCTTATCGAGTTTTGTGCT
GTTGCTTGTCATCCCCTTCTCCAGTTGTTGTTTGGCACTATCTTATTCTTGGCTAACTCAAAGCCCATTGTACTTCAAGTGACATCAAATTTTGACGATGAAAGATGTAG
TACTGGAGATTCGAAAACACCATCAAAACCACTTAAGGATTATGGGGGCACAATGACTGAAGCTATTGCAAATACCATGGATGGAAAACCGGTGTGCCGGAATTGTGGAG
GAAGTGGCGCCGTACTTTGTGACATGTGCGGTGGTACAGGCAAATGGAAAGCTTTGAACAGAAAACGGGCTAAAGATGTCTACGAGTTTACAGAATGTCCAAATTGTTAT
GGTAGAGGAAAACTTGTATGTCCTGTTTGTCTAGGAACTGGTTTACCAAACAACAAAGGTCTTCTAAGAAGGCCCGATGCACGAAAATTGCTTGATAAGATGTACAATGG
TCGCTTGTTACCAAATTCTTGA
Protein sequenceShow/hide protein sequence
MMMASSSHLSAIPLRPSSSSAASLSHREHFLSLSLSLSSLASANSFQLVPNSPLSCYFHFVEGFNLLCSMLSAGRICFFFLVDEMICLDFVSLFWTVSVYTYPFLIEFCA
VACHPLLQLLFGTILFLANSKPIVLQVTSNFDDERCSTGDSKTPSKPLKDYGGTMTEAIANTMDGKPVCRNCGGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCY
GRGKLVCPVCLGTGLPNNKGLLRRPDARKLLDKMYNGRLLPNS