; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G11940 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G11940
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionprotein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic
Genome locationChr2:12237949..12239169
RNA-Seq ExpressionCSPI02G11940
SyntenyCSPI02G11940
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004153120.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucumis sativus]2.0e-57100Show/hide
Query:  MAATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTP
        MAATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTP
Subjt:  MAATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTP

Query:  EQRGER
        EQRGER
Subjt:  EQRGER

XP_008459263.1 PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic [Cucumis melo]9.7e-5295.19Show/hide
Query:  ATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQ
        +TSTSINIFPFPSYQI RSK KSSPN I VPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQ
Subjt:  ATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQ

Query:  RGER
        RGER
Subjt:  RGER

XP_022963874.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita moschata]2.2e-3568.81Show/hide
Query:  MAATSTSINIFPFPSYQ---IFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGG
        ++ ++ S + FPFP +    + ++K +S P++I  PFAA P  + LPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCP CGKGG
Subjt:  MAATSTSINIFPFPSYQ---IFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGG

Query:  LTPEQRGER
        LTPEQRGER
Subjt:  LTPEQRGER

XP_023521952.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic-like [Cucurbita pepo subsp. pepo]3.7e-3574.75Show/hide
Query:  FPFPSYQ---IFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER
        FPFP +    + ++K +S P++I  PFAA P  + LPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCP CGKGGLTPEQRGER
Subjt:  FPFPSYQ---IFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER

XP_038901776.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic isoform X1 [Benincasa hispida]3.9e-3771.82Show/hide
Query:  ATSTSINIFPFPSYQIFR------SKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKG
        A ++S NIFPF  Y++ R      SK KSS   +    AA    +LLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKG
Subjt:  ATSTSINIFPFPSYQIFR------SKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKG

Query:  GLTPEQRGER
        GLTPEQRGER
Subjt:  GLTPEQRGER

TrEMBL top hitse value%identityAlignment
A0A0A0LIM5 Uncharacterized protein9.7e-58100Show/hide
Query:  MAATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTP
        MAATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTP
Subjt:  MAATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTP

Query:  EQRGER
        EQRGER
Subjt:  EQRGER

A0A1S3C9W1 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic4.7e-5295.19Show/hide
Query:  ATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQ
        +TSTSINIFPFPSYQI RSK KSSPN I VPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQ
Subjt:  ATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQ

Query:  RGER
        RGER
Subjt:  RGER

A0A6J1DHW0 uncharacterized protein LOC1110202032.2e-3366.07Show/hide
Query:  AATSTSIN-IFPF------PSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCG
        + +S +IN IFPF      P +++  S          + F A  T + LPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCP+CG
Subjt:  AATSTSIN-IFPF------PSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCG

Query:  KGGLTPEQRGER
         GGLTPEQRGER
Subjt:  KGGLTPEQRGER

A0A6J1HLG5 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic1.0e-3568.81Show/hide
Query:  MAATSTSINIFPFPSYQ---IFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGG
        ++ ++ S + FPFP +    + ++K +S P++I  PFAA P  + LPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCP CGKGG
Subjt:  MAATSTSINIFPFPSYQ---IFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGG

Query:  LTPEQRGER
        LTPEQRGER
Subjt:  LTPEQRGER

A0A6J1JSI3 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic1.5e-3473.74Show/hide
Query:  FPFPSYQ---IFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER
        FPFP +    + ++K +S P++I  PFAA P  + LPKRCQ CGGKGAIDC GCKGTG+NKKNGNIFERWKCFECQGFGLKSCP CGKGGLTPEQRGER
Subjt:  FPFPSYQ---IFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22630.1 unknown protein2.6e-3185.48Show/hide
Query:  KRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER
        K C+ CG KGAI+CPGCKGTGKNKKNGN+FERWKCF+CQGFG+KSCP+CGKGGLTPEQRGER
Subjt:  KRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTACCTCTACCTCCATCAACATCTTCCCATTTCCCTCCTATCAAATTTTTAGAAGTAAAAGAAAATCATCACCCAATGACATTATTGTTCCTTTTGCTGCACT
GCCAACATCTAATTTGCTGCCCAAAAGATGCCAGAAGTGTGGAGGTAAAGGGGCCATTGATTGTCCTGGATGTAAGGGAACGGGAAAGAACAAGAAAAACGGAAACATCT
TCGAGCGTTGGAAATGTTTTGAGTGTCAAGGATTTGGATTGAAGAGTTGTCCTCAATGTGGAAAAGGAGGCCTCACTCCAGAGCAAAGGGGAGAAAGATAA
mRNA sequenceShow/hide mRNA sequence
GTAAAAGAGAAAAAAAAGTAGTGGTTCATCATAGAGCTAAGGAAAGAAGTGTGGATAATGTGGAGAGATCAGTTGTGAGAGATAATAGATCAGAAAAGAAAAGTATTAGC
AAAGACCCAAAGGCACACACAAAAGCTGATTAATTGAAAATGGCTGCTACCTCTACCTCCATCAACATCTTCCCATTTCCCTCCTATCAAATTTTTAGAAGTAAAAGAAA
ATCATCACCCAATGACATTATTGTTCCTTTTGCTGCACTGCCAACATCTAATTTGCTGCCCAAAAGATGCCAGAAGTGTGGAGGTAAAGGGGCCATTGATTGTCCTGGAT
GTAAGGGAACGGGAAAGAACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAATGTTTTGAGTGTCAAGGATTTGGATTGAAGAGTTGTCCTCAATGTGGAAAAGGAGGC
CTCACTCCAGAGCAAAGGGGAGAAAGATAAATGCATACAACACAACAGCTCCTAAATAAACCCTTTATATATCTATACATATACTTTCAATTTGATTTTTCTATTTTTCC
TATTGTAATAATGTGTTAATATATATATATATATCTCATTAGCTTTC
Protein sequenceShow/hide protein sequence
MAATSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER