; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012162 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012162
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPhotosystem I reaction centre subunit N
Genome locationscaffold797:41040..42048
RNA-Seq ExpressionMS012162
SyntenyMS012162
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
InterPro domainsIPR008796 - Photosystem I reaction centre subunit N, chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136934.1 uncharacterized protein LOC101214221 [Cucumis sativus]2.0e-4079.07Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAA--ASDIGRRGLLFS----AVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDY
        MSSIGQSILMALAVT+NKFASSNVQSV RN++ A A  +S IGRR LL S    A AAA + VDSRTELLKRYLKKSEENKEKNDKERL+SYYKRNYKDY
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAA--ASDIGRRGLLFS----AVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDY

Query:  FEFVEGSVRNKSELSEAEKDIIEWLRRNK
        FEFVEGSV+NK+ELSEAEK I+EWL+R+K
Subjt:  FEFVEGSVRNKSELSEAEKDIIEWLRRNK

XP_008455049.1 PREDICTED: uncharacterized protein LOC103495319 [Cucumis melo]2.1e-4280.47Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAV-----AAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYF
        MSSIGQSILMALAVT+NKFASSNVQSV RN++ A  +S IGRR LL S V     AAA A VDSRTELLKRYLKKSEENKEKNDKERL+SYYKRNYKDYF
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAV-----AAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYF

Query:  EFVEGSVRNKSELSEAEKDIIEWLRRNK
        EFVEGSV+NK+ELSEAEK I+EWL+RNK
Subjt:  EFVEGSVRNKSELSEAEKDIIEWLRRNK

XP_022148828.1 uncharacterized protein LOC111017388 isoform X1 [Momordica charantia]7.0e-5498.37Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYFEFVEG
        MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSE+NKEKNDKERLDSYYKRNYKDYFEFVEG
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYFEFVEG

Query:  SVRNKSELSEAEKDIIEWLRRNK
        SVRNKSELSE EKDIIEWLRRNK
Subjt:  SVRNKSELSEAEKDIIEWLRRNK

XP_022989008.1 uncharacterized protein LOC111486201 [Cucurbita maxima]1.1e-3875.38Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ-------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKD
        MSSIGQ+ILMALA+T+N+FASSNVQSV RN+       +  +A+SDI RRGLL SA  AA   VDSRTELLKRYLKKSEENKEKNDKERL+S+YKRNYKD
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ-------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKD

Query:  YFEFVEGSVRNKSELSEAEKDIIEWLRRNK
        YFEFVEGS++NK ELSEAEK IIEWL+RNK
Subjt:  YFEFVEGSVRNKSELSEAEKDIIEWLRRNK

XP_038887440.1 uncharacterized protein LOC120077574 [Benincasa hispida]2.7e-4280.45Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------SAAAAASDIGRRGLLFSAVAAAVA----PVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRN
        MSSIGQSILMALAVT+NKFASSNVQSV RNQ      + A   S IGRRGLL SAVAAA A     VDSRTELLKRYLKKSEENKEKNDKERL+SYYKRN
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------SAAAAASDIGRRGLLFSAVAAAVA----PVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRN

Query:  YKDYFEFVEGSVRNKSELSEAEKDIIEWLRRNK
        YKDYFEFVEGSV+NK+ELSEAEK IIEWL+RNK
Subjt:  YKDYFEFVEGSVRNKSELSEAEKDIIEWLRRNK

TrEMBL top hitse value%identityAlignment
A0A1S3C176 uncharacterized protein LOC1034953191.0e-4280.47Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAV-----AAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYF
        MSSIGQSILMALAVT+NKFASSNVQSV RN++ A  +S IGRR LL S V     AAA A VDSRTELLKRYLKKSEENKEKNDKERL+SYYKRNYKDYF
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAV-----AAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYF

Query:  EFVEGSVRNKSELSEAEKDIIEWLRRNK
        EFVEGSV+NK+ELSEAEK I+EWL+RNK
Subjt:  EFVEGSVRNKSELSEAEKDIIEWLRRNK

A0A2I4HLI8 uncharacterized protein LOC1090192132.7e-3570Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDY
        MSSIGQSILMAL VTVN+FASSNVQ+VHR +      +     SDIGRR LL S + AA    DSRT+LLK+YLKKSEENK KNDKERLDSYYKRNYKDY
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDY

Query:  FEFVEGSVR-NKSELSEAEKDIIEWLRRNK
        FEFVEG+ + N+ +LSEAEK II+WL+RNK
Subjt:  FEFVEGSVR-NKSELSEAEKDIIEWLRRNK

A0A6J1D574 uncharacterized protein LOC111017388 isoform X13.4e-5498.37Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYFEFVEG
        MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSE+NKEKNDKERLDSYYKRNYKDYFEFVEG
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYFEFVEG

Query:  SVRNKSELSEAEKDIIEWLRRNK
        SVRNKSELSE EKDIIEWLRRNK
Subjt:  SVRNKSELSEAEKDIIEWLRRNK

A0A6J1EM63 uncharacterized protein LOC1114346151.5e-3872.59Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYK
        MSSIGQ+ILMALA+T+N+FASSNVQSV RN+            ++ +A+SDI RRGLL SA  AA   VDSRTELLKRYLKKSEENKEKNDKERL+S+YK
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYK

Query:  RNYKDYFEFVEGSVRNKSELSEAEKDIIEWLRRNK
        RNYKDYFEFVEGS++NK ELSEAEK IIEWL+RNK
Subjt:  RNYKDYFEFVEGSVRNKSELSEAEKDIIEWLRRNK

A0A6J1JNZ7 uncharacterized protein LOC1114862015.2e-3975.38Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ-------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKD
        MSSIGQ+ILMALA+T+N+FASSNVQSV RN+       +  +A+SDI RRGLL SA  AA   VDSRTELLKRYLKKSEENKEKNDKERL+S+YKRNYKD
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ-------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKD

Query:  YFEFVEGSVRNKSELSEAEKDIIEWLRRNK
        YFEFVEGS++NK ELSEAEK IIEWL+RNK
Subjt:  YFEFVEGSVRNKSELSEAEKDIIEWLRRNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49975.1 INVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Photosystem I reaction centre subunit N (InterPro:IPR008796); Has 34 Blast hits to 34 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.7e-3564.34Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRN----QSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYFE
        MSSI QSILMAL VTVNK+ASSNVQ+V RN     S  A  +D+GRR +LFS+ +   A + S  +LL++YLKK+EENK KNDKERLDS+YKRNYKDYFE
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRN----QSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYFE

Query:  FVEGSVRNK--SELSEAEKDIIEWLRRNK
        FVEGS++ K  +ELSE+EK I+EWL+ NK
Subjt:  FVEGSVRNK--SELSEAEKDIIEWLRRNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCAATCGGCCAAAGCATTCTGATGGCGCTCGCCGTCACCGTCAACAAATTCGCTTCCTCCAACGTTCAATCCGTCCACAGAAACCAATCAGCCGCCGCCGCCGC
TTCCGACATCGGAAGAAGAGGCCTCCTCTTCTCCGCCGTCGCCGCCGCCGTCGCCCCCGTCGACTCCAGAACCGAGCTGCTCAAAAGGTATCTCAAGAAATCTGAAGAAA
ACAAAGAAAAAAATGACAAGGAGAGATTGGATAGTTACTACAAGCGAAATTACAAAGATTATTTTGAATTCGTTGAAGGATCGGTGAGGAATAAGAGCGAGCTTTCAGAA
GCTGAGAAAGACATTATTGAGTGGCTTCGACGAAACAAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCAATCGGCCAAAGCATTCTGATGGCGCTCGCCGTCACCGTCAACAAATTCGCTTCCTCCAACGTTCAATCCGTCCACAGAAACCAATCAGCCGCCGCCGCCGC
TTCCGACATCGGAAGAAGAGGCCTCCTCTTCTCCGCCGTCGCCGCCGCCGTCGCCCCCGTCGACTCCAGAACCGAGCTGCTCAAAAGGTATCTCAAGAAATCTGAAGAAA
ACAAAGAAAAAAATGACAAGGAGAGATTGGATAGTTACTACAAGCGAAATTACAAAGATTATTTTGAATTCGTTGAAGGATCGGTGAGGAATAAGAGCGAGCTTTCAGAA
GCTGAGAAAGACATTATTGAGTGGCTTCGACGAAACAAG
Protein sequenceShow/hide protein sequence
MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEENKEKNDKERLDSYYKRNYKDYFEFVEGSVRNKSELSE
AEKDIIEWLRRNK