; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS014218 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS014218
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionThylakoid soluble phosphoprotein TSP
Genome locationscaffold5:1461586..1461879
RNA-Seq ExpressionMS014218
SyntenyMS014218
Gene Ontology termsNA
InterPro domainsIPR021584 - Thylakoid soluble phosphoprotein TSP9
IPR037244 - Thylakoid soluble phosphoprotein TSP9 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010512.1 hypothetical protein SDJN02_27306, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-3079.8Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGK-NNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G LE+DQLLETDP+LQKVEGK +N G  +G      GRKNSVQVPPKKN GGFGGLFAK+D
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGK-NNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

XP_004143212.1 uncharacterized protein LOC101205268 [Cucumis sativus]1.4e-3381.63Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G L KDQLLETDP+LQKVEGK +  +G G      GRKNSVQVPPKKNGGGFGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

XP_008464052.1 PREDICTED: uncharacterized protein LOC103502033 [Cucumis melo]1.2e-3279.59Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G L KDQLLETDP+LQKVEGK+  G         RGRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

XP_023532336.1 uncharacterized protein LOC111794525 [Cucurbita pepo subsp. pepo]1.9e-3078.57Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+L  AFSPAAGRVFAATAAKGAGESKKEKG+ DWILGG+EKDQLLETDP+LQKVEGKN G    G      G+KNSVQVPPKKN G FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

XP_038901860.1 uncharacterized protein LOC120088545 [Benincasa hispida]2.7e-3280.61Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAA RVFAATAAKGAGESKKE+GL DWI+G L KDQLLETDP+LQKVEGK    AG G+     GRKNSVQVPPKKNGGGFGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

TrEMBL top hitse value%identityAlignment
A0A0A0KC59 Uncharacterized protein6.9e-3481.63Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G L KDQLLETDP+LQKVEGK +  +G G      GRKNSVQVPPKKNGGGFGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

A0A1S3CKK5 uncharacterized protein LOC1035020335.8e-3379.59Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G L KDQLLETDP+LQKVEGK+  G         RGRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

A0A5A7V5S9 Uncharacterized protein5.8e-3379.59Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+LPIAFSPAAGRVFAATAAKGAGESKKEKGL DWI+G L KDQLLETDP+LQKVEGK+  G         RGRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

A0A6J1G777 uncharacterized protein LOC1114514522.7e-3077.55Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+L  AFSPAAGRVFAATAAKGAGE KKEKG+ DWILGG+EKDQLLETDP+LQKVEGKN G    G      G+KNSVQVPPKKN G FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

A0A6J1L2X7 uncharacterized protein LOC1114999272.7e-3077.55Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD
        MA+L  AFSPAAGRVFAATAAKGAGE KKEKG+ DWILGG+EKDQLLETDP+LQKVEGKN G    G      G+KNSVQVPPKKN G FGGLFAKKD
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47070.1 LOCATED IN: thylakoid, chloroplast thylakoid membrane, chloroplast, chloroplast envelope; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Thylakoid soluble phosphoprotein TSP9 (InterPro:IPR021584); Has 37 Blast hits to 37 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 37; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.2e-1144.21Show/hide
Query:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEK-DQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVP-PKKNGGGFGGL
        +++L ++F+PA  RV+A +   G+G  K+EK   D++LG + K DQ  ET+P+L+KV+ K   G   G     RG KNS   P PKK+ GGFGGL
Subjt:  MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEK-DQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVP-PKKNGGGFGGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTCTGCCCATCGCATTCTCTCCGGCCGCCGGACGGGTCTTCGCGGCCACGGCGGCCAAGGGCGCCGGCGAAAGCAAGAAAGAGAAGGGCCTTTTCGATTGGAT
CCTCGGCGGCCTCGAGAAGGATCAGCTTCTCGAAACCGACCCCATTCTTCAAAAGGTCGAGGGGAAGAACAACGGCGGCGCCGGCGCCGGCGCTGCTGACGCTGCCCGTG
GCCGGAAGAACTCCGTCCAAGTCCCGCCCAAGAAGAACGGCGGCGGATTCGGCGGTCTATTTGCCAAGAAAGAC
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCTCTGCCCATCGCATTCTCTCCGGCCGCCGGACGGGTCTTCGCGGCCACGGCGGCCAAGGGCGCCGGCGAAAGCAAGAAAGAGAAGGGCCTTTTCGATTGGAT
CCTCGGCGGCCTCGAGAAGGATCAGCTTCTCGAAACCGACCCCATTCTTCAAAAGGTCGAGGGGAAGAACAACGGCGGCGCCGGCGCCGGCGCTGCTGACGCTGCCCGTG
GCCGGAAGAACTCCGTCCAAGTCCCGCCCAAGAAGAACGGCGGCGGATTCGGCGGTCTATTTGCCAAGAAAGAC
Protein sequenceShow/hide protein sequence
MAALPIAFSPAAGRVFAATAAKGAGESKKEKGLFDWILGGLEKDQLLETDPILQKVEGKNNGGAGAGAADAARGRKNSVQVPPKKNGGGFGGLFAKKD