; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0617 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0617
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionINVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages;
Genome locationMC04:5510096..5511547
RNA-Seq ExpressionMC04g0617
SyntenyMC04g0617
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
InterPro domainsIPR008796 - Photosystem I reaction centre subunit N, chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136934.1 uncharacterized protein LOC101214221 [Cucumis sativus]1.25e-5277.52Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAA--ASDIGRRGLLFS----AVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDY
        MSSIGQSILMALAVT+NKFASSNVQSV RN++ A A  +S IGRR LL S    A AAA + VDSRTELLKRYLKKSE+NKEKNDKERL+SYYKRNYKDY
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAA--ASDIGRRGLLFS----AVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDY

Query:  FEFVEGSVRNKSELSETEKDIIEWLRRNK
        FEFVEGSV+NK+ELSE EK I+EWL+R+K
Subjt:  FEFVEGSVRNKSELSETEKDIIEWLRRNK

XP_008455049.1 PREDICTED: uncharacterized protein LOC103495319 [Cucumis melo]3.10e-5578.91Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVA-----AAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYF
        MSSIGQSILMALAVT+NKFASSNVQSV RN++ A  +S IGRR LL S VA     AA A VDSRTELLKRYLKKSE+NKEKNDKERL+SYYKRNYKDYF
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVA-----AAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYF

Query:  EFVEGSVRNKSELSETEKDIIEWLRRNK
        EFVEGSV+NK+ELSE EK I+EWL+RNK
Subjt:  EFVEGSVRNKSELSETEKDIIEWLRRNK

XP_022148828.1 uncharacterized protein LOC111017388 isoform X1 [Momordica charantia]1.11e-72100Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYFEFVEG
        MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYFEFVEG
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYFEFVEG

Query:  SVRNKSELSETEKDIIEWLRRNK
        SVRNKSELSETEKDIIEWLRRNK
Subjt:  SVRNKSELSETEKDIIEWLRRNK

XP_022989008.1 uncharacterized protein LOC111486201 [Cucurbita maxima]1.64e-5073.85Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ-------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKD
        MSSIGQ+ILMALA+T+N+FASSNVQSV RN+       +  +A+SDI RRGLL SA  AA   VDSRTELLKRYLKKSE+NKEKNDKERL+S+YKRNYKD
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ-------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKD

Query:  YFEFVEGSVRNKSELSETEKDIIEWLRRNK
        YFEFVEGS++NK ELSE EK IIEWL+RNK
Subjt:  YFEFVEGSVRNKSELSETEKDIIEWLRRNK

XP_038887440.1 uncharacterized protein LOC120077574 [Benincasa hispida]5.17e-5578.95Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQS------AAAAASDIGRRGLLFSAVAAAVAP----VDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRN
        MSSIGQSILMALAVT+NKFASSNVQSV RNQ+       A   S IGRRGLL SAVAAA A     VDSRTELLKRYLKKSE+NKEKNDKERL+SYYKRN
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQS------AAAAASDIGRRGLLFSAVAAAVAP----VDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRN

Query:  YKDYFEFVEGSVRNKSELSETEKDIIEWLRRNK
        YKDYFEFVEGSV+NK+ELSE EK IIEWL+RNK
Subjt:  YKDYFEFVEGSVRNKSELSETEKDIIEWLRRNK

TrEMBL top hitse value%identityAlignment
A0A1S3C176 uncharacterized protein LOC1034953191.50e-5578.91Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVA-----AAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYF
        MSSIGQSILMALAVT+NKFASSNVQSV RN++ A  +S IGRR LL S VA     AA A VDSRTELLKRYLKKSE+NKEKNDKERL+SYYKRNYKDYF
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVA-----AAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYF

Query:  EFVEGSVRNKSELSETEKDIIEWLRRNK
        EFVEGSV+NK+ELSE EK I+EWL+RNK
Subjt:  EFVEGSVRNKSELSETEKDIIEWLRRNK

A0A2I4HLI8 uncharacterized protein LOC1090192138.90e-4668.46Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDY
        MSSIGQSILMAL VTVN+FASSNVQ+VHR +      +     SDIGRR LL S + AA    DSRT+LLK+YLKKSE+NK KNDKERLDSYYKRNYKDY
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDY

Query:  FEFVEGSVR-NKSELSETEKDIIEWLRRNK
        FEFVEG+ + N+ +LSE EK II+WL+RNK
Subjt:  FEFVEGSVR-NKSELSETEKDIIEWLRRNK

A0A6J1D574 uncharacterized protein LOC111017388 isoform X15.36e-73100Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYFEFVEG
        MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYFEFVEG
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYFEFVEG

Query:  SVRNKSELSETEKDIIEWLRRNK
        SVRNKSELSETEKDIIEWLRRNK
Subjt:  SVRNKSELSETEKDIIEWLRRNK

A0A6J1EM63 uncharacterized protein LOC1114346153.77e-5071.11Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYK
        MSSIGQ+ILMALA+T+N+FASSNVQSV RN+            ++ +A+SDI RRGLL SA  AA   VDSRTELLKRYLKKSE+NKEKNDKERL+S+YK
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYK

Query:  RNYKDYFEFVEGSVRNKSELSETEKDIIEWLRRNK
        RNYKDYFEFVEGS++NK ELSE EK IIEWL+RNK
Subjt:  RNYKDYFEFVEGSVRNKSELSETEKDIIEWLRRNK

A0A6J1JNZ7 uncharacterized protein LOC1114862017.92e-5173.85Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ-------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKD
        MSSIGQ+ILMALA+T+N+FASSNVQSV RN+       +  +A+SDI RRGLL SA  AA   VDSRTELLKRYLKKSE+NKEKNDKERL+S+YKRNYKD
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRNQ-------SAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKD

Query:  YFEFVEGSVRNKSELSETEKDIIEWLRRNK
        YFEFVEGS++NK ELSE EK IIEWL+RNK
Subjt:  YFEFVEGSVRNKSELSETEKDIIEWLRRNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49975.1 INVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Photosystem I reaction centre subunit N (InterPro:IPR008796); Has 34 Blast hits to 34 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.9e-3563.57Show/hide
Query:  MSSIGQSILMALAVTVNKFASSNVQSVHRN----QSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYFE
        MSSI QSILMAL VTVNK+ASSNVQ+V RN     S  A  +D+GRR +LFS+ +   A + S  +LL++YLKK+E+NK KNDKERLDS+YKRNYKDYFE
Subjt:  MSSIGQSILMALAVTVNKFASSNVQSVHRN----QSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYFE

Query:  FVEGSVRNK--SELSETEKDIIEWLRRNK
        FVEGS++ K  +ELSE+EK I+EWL+ NK
Subjt:  FVEGSVRNK--SELSETEKDIIEWLRRNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCAATCGGCCAAAGCATTCTGATGGCGCTCGCCGTCACCGTCAACAAATTCGCTTCCTCCAACGTTCAATCCGTCCACAGAAACCAATCAGCCGCCGCCGCCGC
TTCCGACATCGGAAGAAGAGGCCTCCTCTTCTCCGCCGTCGCCGCCGCCGTCGCCCCCGTCGACTCCAGAACCGAGCTGCTCAAAAGGTATCTCAAGAAATCTGAACAAA
ACAAAGAAAAAAATGACAAGGAGAGATTGGATAGTTACTACAAGCGAAATTACAAAGATTATTTTGAATTCGTTGAAGGATCGGTGAGGAATAAGAGCGAGCTTTCAGAA
ACTGAGAAAGACATTATTGAGTGGCTTCGACGAAACAAGTGA
mRNA sequenceShow/hide mRNA sequence
AAAGCAAGTATAAAAGGAGTGGGCTTGGGAACTTAAAAAAGTAATAAAAAAAGAATTGAAGAGGCATATGGACCAAGAAAGCCTCATCTCTTCGTTCAACCTCATCCTCA
AAACTCAAAATTGAAATCTGACCGTTAGGTTTCCGATGAGTTCAATCGGCCAAAGCATTCTGATGGCGCTCGCCGTCACCGTCAACAAATTCGCTTCCTCCAACGTTCAA
TCCGTCCACAGAAACCAATCAGCCGCCGCCGCCGCTTCCGACATCGGAAGAAGAGGCCTCCTCTTCTCCGCCGTCGCCGCCGCCGTCGCCCCCGTCGACTCCAGAACCGA
GCTGCTCAAAAGGTATCTCAAGAAATCTGAACAAAACAAAGAAAAAAATGACAAGGAGAGATTGGATAGTTACTACAAGCGAAATTACAAAGATTATTTTGAATTCGTTG
AAGGATCGGTGAGGAATAAGAGCGAGCTTTCAGAAACTGAGAAAGACATTATTGAGTGGCTTCGACGAAACAAGTGAAACTCACGTTCATCTATTAGATTGTTCAATTTT
TAGTCTAGATTTCGTTTAGTAATGATTTTGTTTTTACTTTTATGGCTTTTGAATCTAAGCTTGTTGGATGATATTTTTCGATTGGAAAGAAAATGCAAAAACCATTGAAC
TATAGAAACGGTTATAGCAGTTGGCCCTCCAAATTAGTTTGATAATTATAGGATTTTGTTGCAAATATTTCATAAGTTAGAGAGATAATTACTACTGAATTGAATGTGAG
TATCTTGACTCTAGTG
Protein sequenceShow/hide protein sequence
MSSIGQSILMALAVTVNKFASSNVQSVHRNQSAAAAASDIGRRGLLFSAVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYFEFVEGSVRNKSELSE
TEKDIIEWLRRNK