; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015108 (gene) of Snake gourd v1 genome

Gene IDTan0015108
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionoral cancer-overexpressed protein 1 homolog
Genome locationLG01:89084324..89085720
RNA-Seq ExpressionTan0015108
SyntenyTan0015108
Gene Ontology termsGO:0005542 - folic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR019191 - Essential protein Yae1, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604815.1 Protein LTO1-like protein, partial [Cucurbita argyrosperma subsp. sororia]6.4e-5985.29Show/hide
Query:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ
        +DD+FDSSLNLEETHLKEGYAEGYKDGLVAGK+EAK+VGLKVGFEVGEEIGF+RGCVDVW SVI IDP+  S RV+KSVKQMEELVE+YPLQDPENEQVQ
Subjt:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI
        +LMEGLRLKFR ICATLGVKLEY+G+PK AS+ KEI
Subjt:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI

XP_022947461.1 oral cancer-overexpressed protein 1 homolog [Cucurbita moschata]1.3e-5986.03Show/hide
Query:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ
        +DD+FDSSLNLEETHLKEGYAEGYKDGLVAGK+EAK+VGLKVGFEVGEEIGF+RGCVDVW SVI IDP+  S RV+KSVKQMEELVE+YPLQDPENEQVQ
Subjt:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI
        +LMEGLRLKFR ICATLGVKLEY+G+PKSAS+ KEI
Subjt:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI

XP_022970845.1 oral cancer-overexpressed protein 1 homolog [Cucurbita maxima]2.2e-5984.56Show/hide
Query:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ
        +DD+FDSSLNLEETHLKEGYAEGYKDGLVAGK+EAK+VGLKVGFEVGEE+GF+RGCVDVWNSVIRI P+  S RV+KSVKQMEELVE+YPL+DPENEQVQ
Subjt:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI
        ++MEGLRLKFR ICATLGVKLEY+G+PKSAS+ KEI
Subjt:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI

XP_023532878.1 oral cancer-overexpressed protein 1 homolog [Cucurbita pepo subsp. pepo]3.8e-5985.29Show/hide
Query:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ
        +DD+FDSSLNLEETHLKEGYAEGYKDGLVAGK+EAK+VGLKVGFEVGEEIGF+RGCVDVW SVI IDP+  S RV+KSVKQMEELVE+YPLQDPENEQVQ
Subjt:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI
        +LMEGLRLKFR ICATLGVKLEY+G+PKSAS+ +EI
Subjt:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI

XP_038902357.1 formimidoyltransferase-cyclodeaminase-like [Benincasa hispida]8.1e-6286.96Show/hide
Query:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ
        +DD+FDSSLNLEETHLKEG+AEGYKDGLVAGK+EA++VGLKVGFEVGEE+GFYRGCVDVWNSVIRI+P+  SIRVRKSVKQMEELVEKYPLQDPENEQVQ
Subjt:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEIEF
        +LMEGLRLKFRAI ATLGVKLEY GYPKS S+ K+IEF
Subjt:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEIEF

TrEMBL top hitse value%identityAlignment
A0A0A0KE52 Yae1_N domain-containing protein2.2e-6084.06Show/hide
Query:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ
        +DD+FDSSLNLEE HLKEGYA+GYKDGLVAGK+EA++VGLKVGFEVGEE+GFYRGCVDVWNSVIRI+P+  SIRVRKSVK MEEL+EKYPLQDPENEQVQ
Subjt:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEIEF
        +LMEGLRLKFRA+ ATLGVKLEY+GYPKS S+ K+IEF
Subjt:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEIEF

A0A5D3BVB4 Formimidoyltransferase-cyclodeaminase isoform X26.9e-5981.16Show/hide
Query:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ
        +DD+FDSSLNLEE HLKEGYA+GY+DGLVAGK+EA++VGLKVGFEVGEEIGFYRGCVDVWNS IRI+P+  S+RVRKSVK +EEL+EKYPLQDPENEQVQ
Subjt:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEIEF
        +LMEGLRLKFRAI ATLGVKLEYNGYP+S S+ K+I++
Subjt:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEIEF

A0A6J1CFK0 formimidoyltransferase-cyclodeaminase-like isoform X29.4e-5687.1Show/hide
Query:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ
        +DD+F SSLNLEETHLKEGYAEGYKDGLVAGK+EAK+VGLKVGF+VGEE+GFYRGCVD WNS I IDP+  S+RVRKS+KQMEELVEKYPLQDPENEQVQ
Subjt:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  DLMEGLRLKFRAICATLGVKLEYN
        +LMEGLRLKFRAICATLGVKLEYN
Subjt:  DLMEGLRLKFRAICATLGVKLEYN

A0A6J1G6H8 oral cancer-overexpressed protein 1 homolog6.3e-6086.03Show/hide
Query:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ
        +DD+FDSSLNLEETHLKEGYAEGYKDGLVAGK+EAK+VGLKVGFEVGEEIGF+RGCVDVW SVI IDP+  S RV+KSVKQMEELVE+YPLQDPENEQVQ
Subjt:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI
        +LMEGLRLKFR ICATLGVKLEY+G+PKSAS+ KEI
Subjt:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI

A0A6J1I414 oral cancer-overexpressed protein 1 homolog1.1e-5984.56Show/hide
Query:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ
        +DD+FDSSLNLEETHLKEGYAEGYKDGLVAGK+EAK+VGLKVGFEVGEE+GF+RGCVDVWNSVIRI P+  S RV+KSVKQMEELVE+YPL+DPENEQVQ
Subjt:  LDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQ

Query:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI
        ++MEGLRLKFR ICATLGVKLEY+G+PKSAS+ KEI
Subjt:  DLMEGLRLKFRAICATLGVKLEYNGYPKSASNEKEI

SwissProt top hitse value%identityAlignment
Q75JW3 Protein LTO1 homolog7.5e-1028.79Show/hide
Query:  FDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQ------------WCSIRVRKSVKQMEELVEKYPLQ
        FD  L++E         +G  DG   G  E  ++G + G E+G+EIG+Y+ CV VWN ++ I+                S+R  ++++++ +L+E Y L 
Subjt:  FDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQ------------WCSIRVRKSVKQMEELVEKYPLQ

Query:  DPENEQVQDLMEGLRLKFRAICATLGVKLEYN
        D  +E + + +  +RLKF+     LG++ + N
Subjt:  DPENEQVQDLMEGLRLKFRAICATLGVKLEYN

Q8CH62 Protein LTO1 homolog2.5e-1332Show/hide
Query:  DVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQDL
        D+FD+ +  +E    EGY EGY++G   G  E K  G+  G ++G EIG YRG    W  ++         R  K V+ +  L++ +P  DP  E++ + 
Subjt:  DVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQDL

Query:  MEGLRLKFRAICATLGVKLEYNGYP
        ++ +R KFR +C+ L V+ ++   P
Subjt:  MEGLRLKFRAICATLGVKLEYNGYP

Q8WV07 Protein LTO1 homolog1.0e-1128.1Show/hide
Query:  DVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQDL
        D+FD+ +  +E    EGY EGY++G   G  E ++ G   G ++G EIG Y+G    W  ++         R  K ++ +  +++K+P  DP  +++ + 
Subjt:  DVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQDL

Query:  MEGLRLKFRAICATLGVKLEY
        ++ +R KF+  C+ L V+ ++
Subjt:  MEGLRLKFRAICATLGVKLEY

Arabidopsis top hitse value%identityAlignment
AT2G20830.2 transferases;folic acid binding1.9e-2948.72Show/hide
Query:  DDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQD
        +D  D  + LEETH+++G+ EGY++GLV+G+++A+ +GLK+GFE GE IGFYRGC  +WNS +RIDP   S ++ K +     L++K PL DPE+E    
Subjt:  DDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQD

Query:  LMEGLRLKFRAICATLG
        + + LR+KF  ICA+LG
Subjt:  LMEGLRLKFRAICATLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGAGAGGTTCATTTAGACGACGTCTTCGATTCCTCTCTCAATCTTGAAGAGACCCACTTGAAGGAAGGCTACGCCGAGGGCTACAAAGATGGCCTAGTTGCTGG
CAAACAAGAGGCAAAAGAAGTAGGCCTTAAAGTTGGTTTCGAGGTCGGCGAGGAAATTGGTTTCTACAGAGGGTGTGTGGACGTCTGGAATTCAGTTATTCGGATCGACC
CGCAATGGTGTTCCATTCGGGTTCGGAAGAGTGTGAAGCAGATGGAGGAGTTGGTAGAGAAATACCCACTTCAGGACCCTGAGAATGAGCAAGTTCAGGATCTGATGGAA
GGGTTGAGGCTCAAGTTCAGAGCGATTTGTGCCACTCTTGGTGTCAAATTGGAGTATAATGGCTATCCGAAATCCGCTTCAAATGAAAAAGAAATTGAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
GCAGGGCTTCGATTGTCGTACGAGGCGGTGAGAGTTGGAGATCTGGGCTTCGTACAACTGTGGCGGCGAGGGAGTTGTGCAATTCTGAGCGACTGTGTTGGTCGGCGGCG
GTGAGGCTTTCGTGGAGAGGTAGTTTGCTCAGATACTCGGCGCCGGTGAAGACGGATTCCAAATTGCTCACGGGCTAGGGATTTTCTTTTTCTTTCGCGCACGGGGGTTT
CGTACGCACAGAAATTCGCACGTGTTCTGAATCTGAACGCAATTCAGGAGTAGAGTACGCCGGGATTTCGAACGTTGAAGTTTTGCTCACAAGGGGAGCTCGCCGGAGAA
GAAGAGGATGGACAGAGAGGTTCATTTAGACGACGTCTTCGATTCCTCTCTCAATCTTGAAGAGACCCACTTGAAGGAAGGCTACGCCGAGGGCTACAAAGATGGCCTAG
TTGCTGGCAAACAAGAGGCAAAAGAAGTAGGCCTTAAAGTTGGTTTCGAGGTCGGCGAGGAAATTGGTTTCTACAGAGGGTGTGTGGACGTCTGGAATTCAGTTATTCGG
ATCGACCCGCAATGGTGTTCCATTCGGGTTCGGAAGAGTGTGAAGCAGATGGAGGAGTTGGTAGAGAAATACCCACTTCAGGACCCTGAGAATGAGCAAGTTCAGGATCT
GATGGAAGGGTTGAGGCTCAAGTTCAGAGCGATTTGTGCCACTCTTGGTGTCAAATTGGAGTATAATGGCTATCCGAAATCCGCTTCAAATGAAAAAGAAATTGAGTTTT
GATGTGATTTTTGTGAATATTGTGTAATAGGCTCTTTGGAGATTTCACAGTAGTTGATGAAATGATTTGGTCTTATTATATATATCTAAATAAGAAGTCTCTACTTCATC
TTTTGTGGAGAGTTCTTTTGGTTGCGGTTCAAAAGTAGGCTGTATTATACTATTATTTGTATTATGGCATTTAGTTTGTGTATTAGAATTTGAAAACAAAAAATGATGAA
AAGTTATTCAATGTACATGATGTATAGGGTAGTTAAAGGTTCTTAGCAACAAGTAATAGAAAACTTTTGCTTGAAAGTGTTGAGTTTATCATGATTAATGATATTTACTT
ACTTTTCCTTGAGCCAGTGGATTCCTCATTTTTCGCAAGGATGAACTCAAGTGCTGCATTACTTTGCTTTTCAAATAATTTTAGATTGGCTTTCAAAATGTATAAATTTG
ACCATG
Protein sequenceShow/hide protein sequence
MDREVHLDDVFDSSLNLEETHLKEGYAEGYKDGLVAGKQEAKEVGLKVGFEVGEEIGFYRGCVDVWNSVIRIDPQWCSIRVRKSVKQMEELVEKYPLQDPENEQVQDLME
GLRLKFRAICATLGVKLEYNGYPKSASNEKEIEF