; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004835 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004835
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionoral cancer-overexpressed protein 1 homolog
Genome locationscaffold176:890897..891310
RNA-Seq ExpressionMS004835
SyntenyMS004835
Gene Ontology termsGO:0005542 - folic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR019191 - Essential protein Yae1, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047887.1 formimidoyltransferase-cyclodeaminase isoform X2 [Cucumis melo var. makuwa]1.0e-6186.23Show/hide
Query:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ
        MDDIF SSLNLEE HLKEG+A+GY+DGLVAGKEEA+QVGLKVGFEVGEE+GFYRGCVD WNSAI I+PERFSVRVRKSVK +EEL+EKYPLQDPENEQVQ
Subjt:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEIEF
        ELMEGLRLKFRAI ATLGVKLEYNGYP+S SDGK+I++
Subjt:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEIEF

KAG6604815.1 Protein LTO1-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.1e-6289.71Show/hide
Query:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ
        MDDIF SSLNLEETHLKEG+AEGYKDGLVAGKEEAKQVGLKVGFEVGEE+GF+RGCVD W S ILIDPERFS RV+KSVKQMEELVE+YPLQDPENEQVQ
Subjt:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEI
        ELMEGLRLKFR ICATLGVKLEY+G+PK ASDGKEI
Subjt:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEI

XP_022947461.1 oral cancer-overexpressed protein 1 homolog [Cucurbita moschata]4.1e-6390.44Show/hide
Query:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ
        MDDIF SSLNLEETHLKEG+AEGYKDGLVAGKEEAKQVGLKVGFEVGEE+GF+RGCVD W S ILIDPERFS RV+KSVKQMEELVE+YPLQDPENEQVQ
Subjt:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEI
        ELMEGLRLKFR ICATLGVKLEY+G+PKSASDGKEI
Subjt:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEI

XP_023532878.1 oral cancer-overexpressed protein 1 homolog [Cucurbita pepo subsp. pepo]1.2e-6289.71Show/hide
Query:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ
        MDDIF SSLNLEETHLKEG+AEGYKDGLVAGKEEAKQVGLKVGFEVGEE+GF+RGCVD W S ILIDPERFS RV+KSVKQMEELVE+YPLQDPENEQVQ
Subjt:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEI
        ELMEGLRLKFR ICATLGVKLEY+G+PKSASDG+EI
Subjt:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEI

XP_038902357.1 formimidoyltransferase-cyclodeaminase-like [Benincasa hispida]4.4e-6592.03Show/hide
Query:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ
        MDDIF SSLNLEETHLKEGFAEGYKDGLVAGKEEA+QVGLKVGFEVGEELGFYRGCVD WNS I I+PERFS+RVRKSVKQMEELVEKYPLQDPENEQVQ
Subjt:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEIEF
        ELMEGLRLKFRAI ATLGVKLEY GYPKS SDGK+IEF
Subjt:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEIEF

TrEMBL top hitse value%identityAlignment
A0A0A0KE52 Yae1_N domain-containing protein7.6e-6387.68Show/hide
Query:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ
        MDDIF SSLNLEE HLKEG+A+GYKDGLVAGKEEA+QVGLKVGFEVGEELGFYRGCVD WNS I I+PERFS+RVRKSVK MEEL+EKYPLQDPENEQVQ
Subjt:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEIEF
        ELMEGLRLKFRA+ ATLGVKLEY+GYPKS SDGK+IEF
Subjt:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEIEF

A0A5D3BVB4 Formimidoyltransferase-cyclodeaminase isoform X24.9e-6286.23Show/hide
Query:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ
        MDDIF SSLNLEE HLKEG+A+GY+DGLVAGKEEA+QVGLKVGFEVGEE+GFYRGCVD WNSAI I+PERFSVRVRKSVK +EEL+EKYPLQDPENEQVQ
Subjt:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEIEF
        ELMEGLRLKFRAI ATLGVKLEYNGYP+S SDGK+I++
Subjt:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEIEF

A0A6J1CFK0 formimidoyltransferase-cyclodeaminase-like isoform X28.4e-6297.58Show/hide
Query:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ
        MDDIFASSLNLEETHLKEG+AEGYKDGLVAGKEEAKQVGLKVGF+VGEELGFYRGCVDAWNSAILIDPERFSVRVRKS+KQMEELVEKYPLQDPENEQVQ
Subjt:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  ELMEGLRLKFRAICATLGVKLEYN
        ELMEGLRLKFRAICATLGVKLEYN
Subjt:  ELMEGLRLKFRAICATLGVKLEYN

A0A6J1G6H8 oral cancer-overexpressed protein 1 homolog2.0e-6390.44Show/hide
Query:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ
        MDDIF SSLNLEETHLKEG+AEGYKDGLVAGKEEAKQVGLKVGFEVGEE+GF+RGCVD W S ILIDPERFS RV+KSVKQMEELVE+YPLQDPENEQVQ
Subjt:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEI
        ELMEGLRLKFR ICATLGVKLEY+G+PKSASDGKEI
Subjt:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEI

A0A6J1I414 oral cancer-overexpressed protein 1 homolog1.1e-6188.24Show/hide
Query:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ
        MDDIF SSLNLEETHLKEG+AEGYKDGLVAGKEEAKQVGLKVGFEVGEE+GF+RGCVD WNS I I PERFS RV+KSVKQMEELVE+YPL+DPENEQVQ
Subjt:  MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEI
        E+MEGLRLKFR ICATLGVKLEY+G+PKSASDGKEI
Subjt:  ELMEGLRLKFRAICATLGVKLEYNGYPKSASDGKEI

SwissProt top hitse value%identityAlignment
Q75JW3 Protein LTO1 homolog1.2e-0928.79Show/hide
Query:  FASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPE------------RFSVRVRKSVKQMEELVEKYPLQ
        F   L++E         +G  DG   G  E  Q+G + G E+G+E+G+Y+ CV  WN  + I+              +FSVR  ++++++ +L+E Y L 
Subjt:  FASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPE------------RFSVRVRKSVKQMEELVEKYPLQ

Query:  DPENEQVQELMEGLRLKFRAICATLGVKLEYN
        D  +E +   +  +RLKF+     LG++ + N
Subjt:  DPENEQVQELMEGLRLKFRAICATLGVKLEYN

Q8CH62 Protein LTO1 homolog1.5e-1232Show/hide
Query:  DIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQEL
        DIF + +  +E    EG+ EGY++G   G  E K+ G+  G ++G E+G YRG   AW   +         R  K V+ +  L++ +P  DP  E++ E 
Subjt:  DIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQEL

Query:  MEGLRLKFRAICATLGVKLEYNGYP
        ++ +R KFR +C+ L V+ ++   P
Subjt:  MEGLRLKFRAICATLGVKLEYNGYP

Q8WV07 Protein LTO1 homolog1.7e-1127.94Show/hide
Query:  DIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQEL
        DIF + +  +E    EG+ EGY++G   G  E +Q G   G ++G E+G Y+G   AW   +         R  K ++ +  +++K+P  DP  +++ E 
Subjt:  DIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQEL

Query:  MEGLRLKFRAICATLGVKLEYNGYPKSASDGKEIEF
        ++ +R KF+  C+ L V+ ++    K +++G  + F
Subjt:  MEGLRLKFRAICATLGVKLEYNGYPKSASDGKEIEF

Arabidopsis top hitse value%identityAlignment
AT2G20830.2 transferases;folic acid binding2.2e-3050.43Show/hide
Query:  DDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQE
        +D     + LEETH+++GF EGY++GLV+G+E+A+ +GLK+GFE GE +GFYRGC   WNSA+ IDP RFS ++ K +     L++K PL DPE+E    
Subjt:  DDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQE

Query:  LMEGLRLKFRAICATLG
        + + LR+KF  ICA+LG
Subjt:  LMEGLRLKFRAICATLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGACATCTTCGCTTCATCTCTCAATCTCGAAGAGACCCACCTGAAGGAAGGCTTCGCCGAGGGCTATAAAGACGGTCTAGTCGCTGGCAAAGAGGAGGCTAAACA
AGTAGGCCTTAAAGTTGGGTTCGAGGTCGGCGAGGAGTTGGGATTCTACAGAGGGTGTGTGGACGCGTGGAATTCTGCAATTCTGATCGACCCGGAACGGTTCTCGGTTC
GGGTTCGGAAGAGTGTCAAGCAGATGGAGGAGTTGGTGGAGAAATACCCACTTCAGGACCCTGAGAATGAGCAAGTTCAGGAGCTGATGGAAGGCTTGAGGCTCAAGTTC
AGAGCGATTTGCGCCACTCTGGGTGTCAAATTGGAGTATAATGGCTATCCGAAATCGGCTTCTGATGGAAAAGAGATTGAGTTT
mRNA sequenceShow/hide mRNA sequence
ATGGACGACATCTTCGCTTCATCTCTCAATCTCGAAGAGACCCACCTGAAGGAAGGCTTCGCCGAGGGCTATAAAGACGGTCTAGTCGCTGGCAAAGAGGAGGCTAAACA
AGTAGGCCTTAAAGTTGGGTTCGAGGTCGGCGAGGAGTTGGGATTCTACAGAGGGTGTGTGGACGCGTGGAATTCTGCAATTCTGATCGACCCGGAACGGTTCTCGGTTC
GGGTTCGGAAGAGTGTCAAGCAGATGGAGGAGTTGGTGGAGAAATACCCACTTCAGGACCCTGAGAATGAGCAAGTTCAGGAGCTGATGGAAGGCTTGAGGCTCAAGTTC
AGAGCGATTTGCGCCACTCTGGGTGTCAAATTGGAGTATAATGGCTATCCGAAATCGGCTTCTGATGGAAAAGAGATTGAGTTT
Protein sequenceShow/hide protein sequence
MDDIFASSLNLEETHLKEGFAEGYKDGLVAGKEEAKQVGLKVGFEVGEELGFYRGCVDAWNSAILIDPERFSVRVRKSVKQMEELVEKYPLQDPENEQVQELMEGLRLKF
RAICATLGVKLEYNGYPKSASDGKEIEF