; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009787 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009787
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGBBH-like_N domain-containing protein
Genome locationscaffold173:280340..281680
RNA-Seq ExpressionMS009787
SyntenyMS009787
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR010376 - Gamma-butyrobetaine hydroxylase-like, N-terminal
IPR038492 - GBBH-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650784.1 hypothetical protein Csa_017608 [Cucumis sativus]2.4e-5091.51Show/hide
Query:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
        VEVKFADGS+FNLSAEFLR+YSPA DAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYSWDYF+HLGSNKFTLLRNYV+TL+KHGLSR
Subjt:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR

Query:  DPPRRK
        DPP+RK
Subjt:  DPPRRK

KAG7017806.1 hypothetical protein SDJN02_19672 [Cucurbita argyrosperma subsp. argyrosperma]1.8e-5092.45Show/hide
Query:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
        VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYSWDYFYHLGSNKFT LRNYV+TL+KHGLSR
Subjt:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR

Query:  DPPRRK
        DPP+RK
Subjt:  DPPRRK

XP_022154140.1 uncharacterized protein LOC111021468 isoform X1 [Momordica charantia]2.3e-53100Show/hide
Query:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
        VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
Subjt:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR

Query:  DPPRRK
        DPPRRK
Subjt:  DPPRRK

XP_022935010.1 uncharacterized protein LOC111442002 isoform X1 [Cucurbita moschata]1.8e-5092.45Show/hide
Query:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
        VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYSWDYFYHLGSNKFT LRNYV+TL+KHGLSR
Subjt:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR

Query:  DPPRRK
        DPP+RK
Subjt:  DPPRRK

XP_022984013.1 uncharacterized protein LOC111482460 isoform X1 [Cucurbita maxima]1.8e-5092.45Show/hide
Query:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
        VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYSWDYFYHLGSNKFT LRNYV+TL+KHGLSR
Subjt:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR

Query:  DPPRRK
        DPP+RK
Subjt:  DPPRRK

TrEMBL top hitse value%identityAlignment
A0A0A0LB75 GBBH-like_N domain-containing protein6.8e-5190.65Show/hide
Query:  QVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLS
        +VEVKFADGS+FNLSAEFLR+YSPA DAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYSWDYF+HLGSNKFTLLRNYV+TL+KHGLS
Subjt:  QVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLS

Query:  RDPPRRK
        RDPP+RK
Subjt:  RDPPRRK

A0A1S3BNF3 uncharacterized protein LOC103491759 isoform X11.2e-5093.4Show/hide
Query:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
        VEVKFADGS+FNLSAEFLRIYSPA DAKVRSIGGEKVI GRRHVGIMSAEPVGNYGVRILFDDLH+TGIYSWDYFYHLGSNKFTLLRNYV+TL+KHGLSR
Subjt:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR

Query:  DPPRRK
        DPP+RK
Subjt:  DPPRRK

A0A6J1DIS9 uncharacterized protein LOC111021468 isoform X11.1e-53100Show/hide
Query:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
        VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
Subjt:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR

Query:  DPPRRK
        DPPRRK
Subjt:  DPPRRK

A0A6J1F9C8 uncharacterized protein LOC111442002 isoform X18.8e-5192.45Show/hide
Query:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
        VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYSWDYFYHLGSNKFT LRNYV+TL+KHGLSR
Subjt:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR

Query:  DPPRRK
        DPP+RK
Subjt:  DPPRRK

A0A6J1J9C4 uncharacterized protein LOC111482460 isoform X18.8e-5192.45Show/hide
Query:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR
        VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYSWDYFYHLGSNKFT LRNYV+TL+KHGLSR
Subjt:  VEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSR

Query:  DPPRRK
        DPP+RK
Subjt:  DPPRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27340.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 1035 Blast hits to 1035 proteins in 399 species: Archae - 0; Bacteria - 765; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 231 (source: NCBI BLink).1.3e-4679.81Show/hide
Query:  QVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLS
        QVEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRR+VGIMSAEPVGNYGVR++FDDLH+TGIY WDYFY LGSNKF L+RNY++TLQKH LS
Subjt:  QVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLS

Query:  RDPP
        R+PP
Subjt:  RDPP

AT3G27340.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 693 Blast hits to 693 proteins in 282 species: Archae - 0; Bacteria - 530; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 124 (source: NCBI BLink).4.5e-2383.33Show/hide
Query:  QVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRI
        QVEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRR+VGIMSAEPVGNYGVRI
Subjt:  QVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRI

AT3G27340.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 945 Blast hits to 945 proteins in 390 species: Archae - 0; Bacteria - 710; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 196 (source: NCBI BLink).3.1e-4075Show/hide
Query:  QVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLS
        QVEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRR+VGIMSAEPVGNYGVR        TGIY WDYFY LGSNKF L+RNY++TLQKH LS
Subjt:  QVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLS

Query:  RDPP
        R+PP
Subjt:  RDPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAGGTAGAGGTTAAATTTGCAGATGGAAGTCTGTTCAATTTGTCAGCTGAATTCTTGAGAATATATAGCCCAGCTGTTGATGCTAAGGTCAGATCAATTGGGGGTGAAAA
GGTGATTTCTGGGCGGCGTCACGTGGGTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATACTTTTTGATGACTTGCATAAGACTGGGATTTATTCATGGG
ATTATTTCTACCATCTCGGGAGCAACAAATTTACACTCTTGAGAAATTATGTTAGAACACTACAGAAGCATGGGCTGAGTCGAGATCCACCTAGGAGAAAA
mRNA sequenceShow/hide mRNA sequence
CAGGTAGAGGTTAAATTTGCAGATGGAAGTCTGTTCAATTTGTCAGCTGAATTCTTGAGAATATATAGCCCAGCTGTTGATGCTAAGGTCAGATCAATTGGGGGTGAAAA
GGTGATTTCTGGGCGGCGTCACGTGGGTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATACTTTTTGATGACTTGCATAAGACTGGGATTTATTCATGGG
ATTATTTCTACCATCTCGGGAGCAACAAATTTACACTCTTGAGAAATTATGTTAGAACACTACAGAAGCATGGGCTGAGTCGAGATCCACCTAGGAGAAAA
Protein sequenceShow/hide protein sequence
QVEVKFADGSLFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGSNKFTLLRNYVRTLQKHGLSRDPPRRK