; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsaV3_7G007730 (gene) of Cucumber (Chinese Long) v3 genome

Gene IDCsaV3_7G007730
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
DescriptionPhotosystem I reaction centre subunit N
Genome locationchr7:4808561..4810245
RNA-Seq ExpressionCsaV3_7G007730
SyntenyCsaV3_7G007730
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
InterPro domainsIPR008796 - Photosystem I reaction centre subunit N, chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136934.1 uncharacterized protein LOC101214221 [Cucumis sativus]2.1e-56100Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKDY
        MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKDY
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKDY

Query:  FEFVEGSVKNKNELSEAEKGIVEWLKRSK
        FEFVEGSVKNKNELSEAEKGIVEWLKRSK
Subjt:  FEFVEGSVKNKNELSEAEKGIVEWLKRSK

XP_008455049.1 PREDICTED: uncharacterized protein LOC103495319 [Cucumis melo]1.7e-5093.85Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSPIGRRSLLLSTLAPAS-AAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKD
        MSSIGQSILMALAVTLNKFASSNVQSVQRNK  ATATVSSPIGRR LLLST+APAS AAAA+ VDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKD
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSPIGRRSLLLSTLAPAS-AAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKD

Query:  YFEFVEGSVKNKNELSEAEKGIVEWLKRSK
        YFEFVEGSVKNKNELSEAEKGIVEWLKR+K
Subjt:  YFEFVEGSVKNKNELSEAEKGIVEWLKRSK

XP_022927848.1 uncharacterized protein LOC111434615 [Cucurbita moschata]1.0e-3974.1Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNK----------ATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLE
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRNK           +A+ + SS I RR LLL      SAA A+ VDSRTELLKRYLKKSEENKEKNDKERLE
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNK----------ATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLE

Query:  SYYKRNYKDYFEFVEGSVKNKNELSEAEKGIVEWLKRSK
        S+YKRNYKDYFEFVEGS+KNK+ELSEAEKGI+EWLKR+K
Subjt:  SYYKRNYKDYFEFVEGSVKNKNELSEAEKGIVEWLKRSK

XP_022989008.1 uncharacterized protein LOC111486201 [Cucurbita maxima]3.2e-4178.36Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNK-----ATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKR
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRNK      TAT + SS I RR LLL      SAA A+ VDSRTELLKRYLKKSEENKEKNDKERLES+YKR
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNK-----ATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKR

Query:  NYKDYFEFVEGSVKNKNELSEAEKGIVEWLKRSK
        NYKDYFEFVEGS+KNK+ELSEAEKGI+EWLKR+K
Subjt:  NYKDYFEFVEGSVKNKNELSEAEKGIVEWLKRSK

XP_038887440.1 uncharacterized protein LOC120077574 [Benincasa hispida]1.5e-4684.96Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKA----TATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN
        MSSIGQSILMALAVTLNKFASSNVQSVQRN+A    TATAT  S IGRR LLLS +A A+A     VDSRTELLKRYLKKSEENKEKNDKERLESYYKRN
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKA----TATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN

Query:  YKDYFEFVEGSVKNKNELSEAEKGIVEWLKRSK
        YKDYFEFVEGSVKNKNELSEAEKGI+EWLKR+K
Subjt:  YKDYFEFVEGSVKNKNELSEAEKGIVEWLKRSK

TrEMBL top hitse value%identityAlignment
A0A1S3C176 uncharacterized protein LOC1034953198.2e-5193.85Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSPIGRRSLLLSTLAPAS-AAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKD
        MSSIGQSILMALAVTLNKFASSNVQSVQRNK  ATATVSSPIGRR LLLST+APAS AAAA+ VDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKD
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSPIGRRSLLLSTLAPAS-AAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKD

Query:  YFEFVEGSVKNKNELSEAEKGIVEWLKRSK
        YFEFVEGSVKNKNELSEAEKGIVEWLKR+K
Subjt:  YFEFVEGSVKNKNELSEAEKGIVEWLKRSK

A0A2I4HLI8 uncharacterized protein LOC1090192131.8e-3468.66Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNK----ATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN
        MSSIGQSILMAL VT+N+FASSNVQ+V R +    ++ T T +S IGRR LLLSTL     AA    DSRT+LLK+YLKKSEENK KNDKERL+SYYKRN
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNK----ATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRN

Query:  YKDYFEFVEGSVK-NKNELSEAEKGIVEWLKRSK
        YKDYFEFVEG+ K N+ +LSEAEKGI++WL+R+K
Subjt:  YKDYFEFVEGSVK-NKNELSEAEKGIVEWLKRSK

A0A6J1D574 uncharacterized protein LOC111017388 isoform X11.9e-3977.52Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKDY
        MSSIGQSILMALAVT+NKFASSNVQSV RN++ A A  +S IGRR LL S    A AAA + VDSRTELLKRYLKKSE+NKEKNDKERL+SYYKRNYKDY
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKDY

Query:  FEFVEGSVKNKNELSEAEKGIVEWLKRSK
        FEFVEGSV+NK+ELSE EK I+EWL+R+K
Subjt:  FEFVEGSVKNKNELSEAEKGIVEWLKRSK

A0A6J1EM63 uncharacterized protein LOC1114346155.0e-4074.1Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNK----------ATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLE
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRNK           +A+ + SS I RR LLL      SAA A+ VDSRTELLKRYLKKSEENKEKNDKERLE
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNK----------ATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLE

Query:  SYYKRNYKDYFEFVEGSVKNKNELSEAEKGIVEWLKRSK
        S+YKRNYKDYFEFVEGS+KNK+ELSEAEKGI+EWLKR+K
Subjt:  SYYKRNYKDYFEFVEGSVKNKNELSEAEKGIVEWLKRSK

A0A6J1JNZ7 uncharacterized protein LOC1114862011.6e-4178.36Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNK-----ATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKR
        MSSIGQ+ILMALA+TLN+FASSNVQSVQRNK      TAT + SS I RR LLL      SAA A+ VDSRTELLKRYLKKSEENKEKNDKERLES+YKR
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNK-----ATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKR

Query:  NYKDYFEFVEGSVKNKNELSEAEKGIVEWLKRSK
        NYKDYFEFVEGS+KNK+ELSEAEKGI+EWLKR+K
Subjt:  NYKDYFEFVEGSVKNKNELSEAEKGIVEWLKRSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49975.1 INVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Photosystem I reaction centre subunit N (InterPro:IPR008796); Has 34 Blast hits to 34 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.5e-3160.15Show/hide
Query:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSP--IGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYK
        MSSI QSILMAL VT+NK+ASSNVQ+V+RN     +  + P  +GRR++L S+ +  +AA  S+     +LL++YLKK+EENK KNDKERL+S+YKRNYK
Subjt:  MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSP--IGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYK

Query:  DYFEFVEGSVKNKN--ELSEAEKGIVEWLKRSK
        DYFEFVEGS+K K   ELSE+EK I+EWLK +K
Subjt:  DYFEFVEGSVKNKN--ELSEAEKGIVEWLKRSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCATTGGCCAAAGCATTCTCATGGCCTTAGCCGTCACTCTCAACAAATTTGCTTCTTCCAACGTTCAATCCGTTCAAAGAAACAAAGCCACCGCCACCGCCAC
CGTCTCTTCACCAATTGGAAGAAGAAGCCTCCTCTTGTCCACCCTTGCCCCCGCCTCCGCCGCCGCCGCCTCCACGGTCGACTCCAGAACAGAGCTGCTAAAAAGGTACC
TCAAGAAGTCTGAAGAGAACAAAGAAAAGAATGACAAGGAGAGATTGGAAAGTTACTACAAGCGAAATTACAAAGATTATTTTGAGTTTGTGGAAGGATCGGTGAAGAAC
AAGAACGAACTTTCCGAAGCTGAAAAAGGTATTGTTGAGTGGCTTAAACGAAGTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCCATTGGCCAAAGCATTCTCATGGCCTTAGCCGTCACTCTCAACAAATTTGCTTCTTCCAACGTTCAATCCGTTCAAAGAAACAAAGCCACCGCCACCGCCAC
CGTCTCTTCACCAATTGGAAGAAGAAGCCTCCTCTTGTCCACCCTTGCCCCCGCCTCCGCCGCCGCCGCCTCCACGGTCGACTCCAGAACAGAGCTGCTAAAAAGGTACC
TCAAGAAGTCTGAAGAGAACAAAGAAAAGAATGACAAGGAGAGATTGGAAAGTTACTACAAGCGAAATTACAAAGATTATTTTGAGTTTGTGGAAGGATCGGTGAAGAAC
AAGAACGAACTTTCCGAAGCTGAAAAAGGTATTGTTGAGTGGCTTAAACGAAGTAAATGA
Protein sequenceShow/hide protein sequence
MSSIGQSILMALAVTLNKFASSNVQSVQRNKATATATVSSPIGRRSLLLSTLAPASAAAASTVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKDYFEFVEGSVKN
KNELSEAEKGIVEWLKRSK