; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G007120 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G007120
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGBBH-like_N domain-containing protein
Genome locationCmo_Chr14:3652531..3656965
RNA-Seq ExpressionCmoCh14G007120
SyntenyCmoCh14G007120
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR010376 - Gamma-butyrobetaine hydroxylase-like, N-terminal
IPR038492 - GBBH-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017806.1 hypothetical protein SDJN02_19672 [Cucurbita argyrosperma subsp. argyrosperma]3.8e-6998.53Show/hide
Query:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQRAFRTIHTTVDAPQ+TSFALRAPK VEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

XP_022935010.1 uncharacterized protein LOC111442002 isoform X1 [Cucurbita moschata]2.6e-70100Show/hide
Query:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

XP_022984013.1 uncharacterized protein LOC111482460 isoform X1 [Cucurbita maxima]2.9e-6997.79Show/hide
Query:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLAL RAFRTIHTTVD+PQ+TSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

XP_031737737.1 uncharacterized protein LOC101219823 isoform X1 [Cucumis sativus]6.7e-6692.65Show/hide
Query:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        ML L R FRTIHTT+DAPQ+T+FAL APKYVEVKFADGSVFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYF+HLGSNKFT LRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

XP_038891561.1 uncharacterized protein LOC120080945 isoform X1 [Benincasa hispida]3.6e-6794.85Show/hide
Query:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQR FRTIHTT+DAPQ+T+FALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRS+GGEKVISGRRHV IMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

TrEMBL top hitse value%identityAlignment
A0A1S3BNF3 uncharacterized protein LOC103491759 isoform X12.1e-6591.91Show/hide
Query:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        ML L R FRTIHTT++APQ+T+FAL APKYVEVKFADGSVFNLSAEFLR+YSPAADAKVRSIGGEKVI GRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

A0A4U5PZX0 GBBH-like_N domain-containing protein9.5e-5877.21Show/hide
Query:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLA+Q+A R +HT++DAP++T F L+APKYVEV++A+GS FNLSAEFLR++SPA D KVRS+GGEKVI GRRHVGIMSAEPVGNYGVR++FDDLH+TGIY
Subjt:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        +WD+FYHLGSNKFT +RNY+KTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

A0A6J1DIS9 uncharacterized protein LOC111021468 isoform X12.3e-6489.63Show/hide
Query:  LALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYS
        +AL RAFRTIHTTV+AP++TSFALRAPK+VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYS
Subjt:  LALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYS

Query:  WDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        WDYFYHLGSNKFT LRNYV+TL+KHGLSRDPP+RK
Subjt:  WDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

A0A6J1F9C8 uncharacterized protein LOC111442002 isoform X11.3e-70100Show/hide
Query:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

A0A6J1J9C4 uncharacterized protein LOC111482460 isoform X11.4e-6997.79Show/hide
Query:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLAL RAFRTIHTTVD+PQ+TSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27340.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 1035 Blast hits to 1035 proteins in 399 species: Archae - 0; Bacteria - 765; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 231 (source: NCBI BLink).1.6e-4967.65Show/hide
Query:  MLALQRAF-RTIHTT--VDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT
        M A+ R   R +H T  +  P+++ F++ +PK VEV++ADG+ FN S+EFLR++SPAAD KVRSIGGEKVISGRR+VGIMSAEPVGNYGVR++FDDLHRT
Subjt:  MLALQRAF-RTIHTT--VDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT

Query:  GIYSWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPP
        GIY WDYFY LGSNKF  +RNY+KTL+KH LSR+PP
Subjt:  GIYSWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPP

AT3G27340.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 693 Blast hits to 693 proteins in 282 species: Archae - 0; Bacteria - 530; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 124 (source: NCBI BLink).2.5e-2664.13Show/hide
Query:  MLALQRAF-RTIHTT--VDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRI
        M A+ R   R +H T  +  P+++ F++ +PK VEV++ADG+ FN S+EFLR++SPAAD KVRSIGGEKVISGRR+VGIMSAEPVGNYGVRI
Subjt:  MLALQRAF-RTIHTT--VDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRI

AT3G27340.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 945 Blast hits to 945 proteins in 390 species: Archae - 0; Bacteria - 710; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 196 (source: NCBI BLink).8.5e-4363.24Show/hide
Query:  MLALQRAF-RTIHTT--VDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT
        M A+ R   R +H T  +  P+++ F++ +PK VEV++ADG+ FN S+EFLR++SPAAD KVRSIGGEKVISGRR+VGIMSAEPVGNYGV        RT
Subjt:  MLALQRAF-RTIHTT--VDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT

Query:  GIYSWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPP
        GIY WDYFY LGSNKF  +RNY+KTL+KH LSR+PP
Subjt:  GIYSWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGCTTTGCAAAGGGCTTTCCGGACGATCCATACCACCGTTGATGCTCCGCAAGTTACCAGTTTCGCTCTTCGCGCCCCCAAATATGTAGAGGTTAAATTTGCAGA
TGGAAGTGTGTTCAACTTGTCAGCTGAATTCTTGAGAGTATATAGCCCAGCTGCTGATGCTAAGGTTCGGTCAATTGGGGGTGAAAAGGTAATTTCTGGACGACGTCACG
TCGGTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATCCTTTTTGATGACTTGCATAGAACTGGGATTTATTCGTGGGATTATTTCTATCATCTGGGGAGC
AACAAGTTTACATTCTTGAGAAATTATGTTAAAACACTAAAGAAGCATGGACTGAGCCGAGATCCACCAAAGAGAAAGTGA
mRNA sequenceShow/hide mRNA sequence
GTTCGAACCCAATGCACAGCAGTTAGGCGCGAAACTCGAACCAAAATCAGTCGGCCAAACGAATCAGAGAGACGATGTTAGCTTTGCAAAGGGCTTTCCGGACGATCCAT
ACCACCGTTGATGCTCCGCAAGTTACCAGTTTCGCTCTTCGCGCCCCCAAATATGTAGAGGTTAAATTTGCAGATGGAAGTGTGTTCAACTTGTCAGCTGAATTCTTGAG
AGTATATAGCCCAGCTGCTGATGCTAAGGTTCGGTCAATTGGGGGTGAAAAGGTAATTTCTGGACGACGTCACGTCGGTATCATGTCTGCAGAACCAGTTGGGAACTATG
GGGTAAGGATCCTTTTTGATGACTTGCATAGAACTGGGATTTATTCGTGGGATTATTTCTATCATCTGGGGAGCAACAAGTTTACATTCTTGAGAAATTATGTTAAAACA
CTAAAGAAGCATGGACTGAGCCGAGATCCACCAAAGAGAAAGTGATGGACAAATCAACTTTCGTGAGCAGGTTGGAGAAAGATTCAATTGTTGAACTATAAGCATCATCT
GAGACCTCTTGGACTGTGAAGCAAGAACCTGAGCATGGAAATACACATGTTAGTGTTGCTCCTTGCAAGATTATCTATGAATTTAGTTTAATTACAAATTAGGACTTGTA
TTTCGATGGTTGGTTGGCTACTTAGTCCTTAAGTTTTGACTAACAAATTTTGATTATGACCATTGATTTGTTAAGAAGATGCTTGATAAAATAGAGGGGTATTTACACAT
GATGTTAAGTTTTAATGATCAATTGTGTAAAACCACAAAGTTATCTTGTCTCTCAG
Protein sequenceShow/hide protein sequence
MLALQRAFRTIHTTVDAPQVTSFALRAPKYVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFYHLGS
NKFTFLRNYVKTLKKHGLSRDPPKRK