; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g1235 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g1235
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionGBBH-like_N domain-containing protein
Genome locationMC08:10644441..10647196
RNA-Seq ExpressionMC08g1235
SyntenyMC08g1235
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR010376 - Gamma-butyrobetaine hydroxylase-like, N-terminal
IPR038492 - GBBH-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017806.1 hypothetical protein SDJN02_19672 [Cucurbita argyrosperma subsp. argyrosperma]6.12e-8589.63Show/hide
Query:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +AL RAFRTIHTTV+AP++TSFALRAPK VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYS
Subjt:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
        WDYFYHLGSNKFT LRNYV+TL+KHGLSRDPP+RK
Subjt:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK

XP_022154140.1 uncharacterized protein LOC111021468 isoform X1 [Momordica charantia]4.87e-93100Show/hide
Query:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
Subjt:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
        WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
Subjt:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK

XP_022935010.1 uncharacterized protein LOC111442002 isoform X1 [Cucurbita moschata]1.50e-8589.63Show/hide
Query:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +AL RAFRTIHTTV+AP++TSFALRAPK+VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYS
Subjt:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
        WDYFYHLGSNKFT LRNYV+TL+KHGLSRDPP+RK
Subjt:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK

XP_022984013.1 uncharacterized protein LOC111482460 isoform X1 [Cucurbita maxima]1.83e-8689.63Show/hide
Query:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +ALHRAFRTIHTTV++P++TSFALRAPK+VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYS
Subjt:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
        WDYFYHLGSNKFT LRNYV+TL+KHGLSRDPP+RK
Subjt:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK

XP_038891561.1 uncharacterized protein LOC120080945 isoform X1 [Benincasa hispida]2.91e-8386.67Show/hide
Query:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +AL R FRTIHTT++AP++T+FALRAPK+VEVKFADGS+FNLSAEFLR+YSPA DAKVRS+GGEKVISGRRHV IMSAEPVGNYGVRILFDDLH+TGIYS
Subjt:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
        WDYFYHLGSNKFTLLRNYV+TL+KHGLSRDPP+RK
Subjt:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK

TrEMBL top hitse value%identityAlignment
A0A1S3BNF3 uncharacterized protein LOC103491759 isoform X13.45e-8286.67Show/hide
Query:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        + L+R FRTIHTT+ AP++T+FAL APK+VEVKFADGS+FNLSAEFLRIYSPA DAKVRSIGGEKVI GRRHVGIMSAEPVGNYGVRILFDDLH+TGIYS
Subjt:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
        WDYFYHLGSNKFTLLRNYV+TL+KHGLSRDPP+RK
Subjt:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK

A0A5N5JSF6 GBBH-like_N domain-containing protein1.03e-7679.26Show/hide
Query:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +A+ +AFR IHT+++APRLT F L+APK VEV++A+GS FNL+AEFLRI+SPAVD KVRS+GGEKVI GRRHVGIMSAEPVGNYGVR++FDDLHKTGIY+
Subjt:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
        WDYFYHLGSNKFTL+RNY++TL+KHGLSRDPPRRK
Subjt:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK

A0A6J1DIS9 uncharacterized protein LOC111021468 isoform X12.36e-93100Show/hide
Query:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
Subjt:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
        WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
Subjt:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK

A0A6J1F9C8 uncharacterized protein LOC111442002 isoform X17.27e-8689.63Show/hide
Query:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +AL RAFRTIHTTV+AP++TSFALRAPK+VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYS
Subjt:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
        WDYFYHLGSNKFT LRNYV+TL+KHGLSRDPP+RK
Subjt:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK

A0A6J1J9C4 uncharacterized protein LOC111482460 isoform X18.84e-8789.63Show/hide
Query:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +ALHRAFRTIHTTV++P++TSFALRAPK+VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYS
Subjt:  MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK
        WDYFYHLGSNKFT LRNYV+TL+KHGLSRDPP+RK
Subjt:  WDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27340.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 1035 Blast hits to 1035 proteins in 399 species: Archae - 0; Bacteria - 765; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 231 (source: NCBI BLink).9.3e-5070.87Show/hide
Query:  RTIHTT--VEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY
        R +H T  +  P+L+ F++ +PK VEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRR+VGIMSAEPVGNYGVR++FDDLH+TGIY WDYFY
Subjt:  RTIHTT--VEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY

Query:  HLGSNKFTLLRNYVRTLQKHGLSRDPP
         LGSNKF L+RNY++TLQKH LSR+PP
Subjt:  HLGSNKFTLLRNYVRTLQKHGLSRDPP

AT3G27340.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 693 Blast hits to 693 proteins in 282 species: Archae - 0; Bacteria - 530; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 124 (source: NCBI BLink).3.2e-2668.67Show/hide
Query:  RTIHTT--VEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRI
        R +H T  +  P+L+ F++ +PK VEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRR+VGIMSAEPVGNYGVRI
Subjt:  RTIHTT--VEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRI

AT3G27340.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 945 Blast hits to 945 proteins in 390 species: Archae - 0; Bacteria - 710; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 196 (source: NCBI BLink).2.2e-4366.93Show/hide
Query:  RTIHTT--VEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY
        R +H T  +  P+L+ F++ +PK VEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRR+VGIMSAEPVGNYGVR        TGIY WDYFY
Subjt:  RTIHTT--VEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY

Query:  HLGSNKFTLLRNYVRTLQKHGLSRDPP
         LGSNKF L+RNY++TLQKH LSR+PP
Subjt:  HLGSNKFTLLRNYVRTLQKHGLSRDPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTTCACAGAGCTTTCCGGACGATCCATACCACCGTGGAGGCTCCGCGACTTACCAGTTTTGCTCTTCGCGCCCCCAAATTTGTAGAGGTTAAATTTGCAGATGG
AAGTCTGTTCAATTTGTCAGCTGAATTCTTGAGAATATATAGCCCAGCTGTTGATGCTAAGGTCAGATCAATTGGGGGTGAAAAGGTGATTTCTGGGCGGCGTCACGTGG
GTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATACTTTTTGATGACTTGCATAAGACTGGGATTTATTCATGGGATTATTTCTACCATCTCGGGAGCAAC
AAATTTACACTCTTGAGAAATTATGTTAGAACACTACAGAAGCATGGGCTGAGTCGAGATCCACCTAGGAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
GTATCTTTGTATAATTAAAAAAAGATTTAAGAGAAAATTAACAGGCCGCGGTTCGAACCCGATAACCAGCAAGTAGGCGCGAAAACCCGAACGAAAATCAATCAGGGAGA
TCGAGGCTGTGAACTTGTGAGATATTCGGAGCAACAATGGCGCTTCACAGAGCTTTCCGGACGATCCATACCACCGTGGAGGCTCCGCGACTTACCAGTTTTGCTCTTCG
CGCCCCCAAATTTGTAGAGGTTAAATTTGCAGATGGAAGTCTGTTCAATTTGTCAGCTGAATTCTTGAGAATATATAGCCCAGCTGTTGATGCTAAGGTCAGATCAATTG
GGGGTGAAAAGGTGATTTCTGGGCGGCGTCACGTGGGTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATACTTTTTGATGACTTGCATAAGACTGGGATT
TATTCATGGGATTATTTCTACCATCTCGGGAGCAACAAATTTACACTCTTGAGAAATTATGTTAGAACACTACAGAAGCATGGGCTGAGTCGAGATCCACCTAGGAGAAA
ATGATGGATGAGTCAACTGTTCCCGTCTTTTGAGTATCTTTTGTTTCCCAATCGTGAGTGTTCGGGTCAATTTATCTGCCTTTCAGCTAATCAACTATGAGAATCTTGTG
CAATGATCAGAGGGAGTTCACTATTTAGCAAGGAGATAAAGAAGATAGGCTGCCTCAAACTGGCTGATGAGAAAAGATTCAACTACTGTACAAATTTAAATCTTACCTAT
ATGCATTTGGGATCCTTGACTTACTATTGAGTCAAGAACCTTAGCATGAAACTAGATGCTGATTATTGCTTTAGCTCCTTAAACTCTTGAGGTCTGGAGAGTCGGTCTTA
GGACCAGCATGAACTTGTAAAGAAAAAAATAACTATTTTATATTTTGATGAGATATTAAAATATATTTTGATGGTTGTCTATTTAATCTCTAAATTAAATATTTAAAGAT
TTCATTGTTTCGGG
Protein sequenceShow/hide protein sequence
MALHRAFRTIHTTVEAPRLTSFALRAPKFVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSN
KFTLLRNYVRTLQKHGLSRDPPRRK