; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0728 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0728
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionThylakoid soluble phosphoprotein TSP
Genome locationMC02:5840274..5840567
RNA-Seq ExpressionMC02g0728
SyntenyMC02g0728
Gene Ontology termsNA
InterPro domainsIPR021584 - Thylakoid soluble phosphoprotein TSP9
IPR037244 - Thylakoid soluble phosphoprotein TSP9 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010512.1 hypothetical protein SDJN02_27306, partial [Cucurbita argyrosperma subsp. argyrosperma]1.93e-4279.8Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKN-NGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G LE+DQLLETDP+LQKVEGK+ N G  +G      GRKNSVQVPPKKNGG FGGLFAK+D
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKN-NGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

XP_004143212.1 uncharacterized protein LOC101205268 [Cucumis sativus]1.71e-4681.63Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G L KDQLLETDP+LQKVEGK+   +G G      GRKNSVQVPPKKNGGGFGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

XP_008464052.1 PREDICTED: uncharacterized protein LOC103502033 [Cucumis melo]2.76e-4579.59Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G L KDQLLETDP+LQKVEGK+  G         RGRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

XP_023532336.1 uncharacterized protein LOC111794525 [Cucurbita pepo subsp. pepo]3.00e-4278.57Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+L  AFSPAAGRVFAATAAKGAGESKKEKG+ DWILGG+EKDQLLETDP+LQKVEGKN G    G      G+KNSVQVPPKKNG  FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

XP_038901860.1 uncharacterized protein LOC120088545 [Benincasa hispida]1.12e-4480.61Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAA RVFAATAAKGAGESKKE+GL DWI+G L KDQLLETDP+LQKVEGK   G G+G      GRKNSVQVPPKKNGGGFGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

TrEMBL top hitse value%identityAlignment
A0A0A0KC59 Uncharacterized protein8.30e-4781.63Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G L KDQLLETDP+LQKVEGK+   +G G      GRKNSVQVPPKKNGGGFGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

A0A1S3CKK5 uncharacterized protein LOC1035020331.34e-4579.59Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G L KDQLLETDP+LQKVEGK+  G         RGRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

A0A5A7V5S9 Uncharacterized protein1.34e-4579.59Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G L KDQLLETDP+LQKVEGK+  G         RGRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

A0A6J1G777 uncharacterized protein LOC1114514525.91e-4277.55Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+L  AFSPAAGRVFAATAAKGAGE KKEKG+ DWILGG+EKDQLLETDP+LQKVEGKN G    G      G+KNSVQVPPKKNG  FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

A0A6J1L2X7 uncharacterized protein LOC1114999275.91e-4277.55Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+L  AFSPAAGRVFAATAAKGAGE KKEKG+ DWILGG+EKDQLLETDP+LQKVEGKN G    G      G+KNSVQVPPKKNG  FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47070.1 LOCATED IN: thylakoid, chloroplast thylakoid membrane, chloroplast, chloroplast envelope; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Thylakoid soluble phosphoprotein TSP9 (InterPro:IPR021584); Has 37 Blast hits to 37 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 37; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.2e-1144.21Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEK-DQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVP-PKKNGGGFGGL
        +++L ++F+PA  RV+A +   G+G  K+EK   D++LG + K DQ  ET+P+L+KV+ K   G   G     RG KNS   P PKK+ GGFGGL
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEK-DQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVP-PKKNGGGFGGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTCTGCCCATCGCATTCTCTCCGGCCGCCGGACGGGTCTTCGCGGCCACGGCGGCCAAGGGCGCCGGCGAAAGCAAGAAAGAGAAGGGCCTTTTCGATTGGAT
CCTCGGCGGCCTCGAGAAGGATCAGCTTCTCGAAACCGACCCCATTCTTCAAAAGGTCGAGGGGAAGAACAACGGCGGCGCCGGCGCCGGCGCTGCTGACGCTGCCCGTG
GCCGGAAGAACTCCGTCCAAGTCCCGCCCAAGAAGAACGGCGGCGGATTCGGCGGTCTATTTGCCAAGAAAGAC
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCTCTGCCCATCGCATTCTCTCCGGCCGCCGGACGGGTCTTCGCGGCCACGGCGGCCAAGGGCGCCGGCGAAAGCAAGAAAGAGAAGGGCCTTTTCGATTGGAT
CCTCGGCGGCCTCGAGAAGGATCAGCTTCTCGAAACCGACCCCATTCTTCAAAAGGTCGAGGGGAAGAACAACGGCGGCGCCGGCGCCGGCGCTGCTGACGCTGCCCGTG
GCCGGAAGAACTCCGTCCAAGTCCCGCCCAAGAAGAACGGCGGCGGATTCGGCGGTCTATTTGCCAAGAAAGAC
Protein sequenceShow/hide protein sequence
MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD