; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G013170 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G013170
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGBBH-like_N domain-containing protein
Genome locationCG_Chr05:18474236..18479771
RNA-Seq ExpressionClCG05G013170
SyntenyClCG05G013170
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR010376 - Gamma-butyrobetaine hydroxylase-like, N-terminal
IPR038492 - GBBH-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008450051.1 PREDICTED: uncharacterized protein LOC103491759 isoform X1 [Cucumis melo]2.7e-6794.12Show/hide
Query:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        ML L R+FRTIHTTL+APQITTFALHAPKYVEVKFADG+VFNLSAEFLR+YSPAADAK+RSIGGEKVI GRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

XP_022935010.1 uncharacterized protein LOC111442002 isoform X1 [Cucurbita moschata]2.1e-6794.12Show/hide
Query:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQR FRTIHTT+DAPQ+T+FAL APKYVEVKFADG+VFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

XP_022984013.1 uncharacterized protein LOC111482460 isoform X1 [Cucurbita maxima]1.4e-6693.38Show/hide
Query:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLAL R FRTIHTT+D+PQIT+FAL APKYVEVKFADG+VFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

XP_031737737.1 uncharacterized protein LOC101219823 isoform X1 [Cucumis sativus]2.5e-6896.32Show/hide
Query:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        ML L R+FRTIHTTLDAPQITTFALHAPKYVEVKFADG+VFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        SWDYF+HLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

XP_038891561.1 uncharacterized protein LOC120080945 isoform X1 [Benincasa hispida]1.2e-6795.59Show/hide
Query:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQRVFRTIHTT+DAPQITTFAL APKYVEVKFADG+VFNLSAEFLRVYSPAADAK+RS+GGEKVISGRRHV IMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

TrEMBL top hitse value%identityAlignment
A0A1S3BNF3 uncharacterized protein LOC103491759 isoform X11.3e-6794.12Show/hide
Query:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        ML L R+FRTIHTTL+APQITTFALHAPKYVEVKFADG+VFNLSAEFLR+YSPAADAK+RSIGGEKVI GRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

A0A2I4H1X3 uncharacterized protein LOC109012788 isoform X15.6e-5880.15Show/hide
Query:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLA++R  R IHT +DAP++T FALHAPK VEV FADG+VFNL+AEFLRV SPA D KIRS+GGEKVISGRRHVGIMSAEPVGNYGVRI+FDDLH+TGIY
Subjt:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
         WDYFYHLGSNKFTL+++Y+K LKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

A0A6J1DIS9 uncharacterized protein LOC111021468 isoform X15.7e-6385.93Show/hide
Query:  LALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYS
        +AL R FRTIHTT++AP++T+FAL APK+VEVKFADG++FNLSAEFLR+YSPA DAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYS
Subjt:  LALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYS

Query:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        WDYFYHLGSNKFTLLRNYV+TL+KHGLSRDPP+RK
Subjt:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

A0A6J1F9C8 uncharacterized protein LOC111442002 isoform X11.0e-6794.12Show/hide
Query:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQR FRTIHTT+DAPQ+T+FAL APKYVEVKFADG+VFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

A0A6J1J9C4 uncharacterized protein LOC111482460 isoform X16.5e-6793.38Show/hide
Query:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLAL R FRTIHTT+D+PQIT+FAL APKYVEVKFADG+VFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27340.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 1035 Blast hits to 1035 proteins in 399 species: Archae - 0; Bacteria - 765; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 231 (source: NCBI BLink).1.1e-5069.12Show/hide
Query:  MLALQRVF-RTIHTT--LDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT
        M A+ RV  R +H T  +  P+++ F++ +PK VEV++ADGT FN S+EFLR++SPAAD K+RSIGGEKVISGRR+VGIMSAEPVGNYGVR++FDDLHRT
Subjt:  MLALQRVF-RTIHTT--LDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT

Query:  GIYSWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP
        GIY WDYFY LGSNKF L+RNY+KTL+KH LSR+PP
Subjt:  GIYSWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP

AT3G27340.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 693 Blast hits to 693 proteins in 282 species: Archae - 0; Bacteria - 530; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 124 (source: NCBI BLink).6.5e-2765.22Show/hide
Query:  MLALQRVF-RTIHTT--LDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRI
        M A+ RV  R +H T  +  P+++ F++ +PK VEV++ADGT FN S+EFLR++SPAAD K+RSIGGEKVISGRR+VGIMSAEPVGNYGVRI
Subjt:  MLALQRVF-RTIHTT--LDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRI

AT3G27340.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 945 Blast hits to 945 proteins in 390 species: Archae - 0; Bacteria - 710; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 196 (source: NCBI BLink).5.9e-4464.71Show/hide
Query:  MLALQRVF-RTIHTT--LDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT
        M A+ RV  R +H T  +  P+++ F++ +PK VEV++ADGT FN S+EFLR++SPAAD K+RSIGGEKVISGRR+VGIMSAEPVGNYGV        RT
Subjt:  MLALQRVF-RTIHTT--LDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT

Query:  GIYSWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP
        GIY WDYFY LGSNKF L+RNY+KTL+KH LSR+PP
Subjt:  GIYSWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGCTTTGCAAAGAGTTTTCCGGACGATCCATACCACTCTCGATGCTCCTCAAATTACCACTTTCGCTCTTCACGCCCCCAAATATGTAGAGGTTAAATTTGCAGA
TGGAACTGTGTTCAACTTGTCAGCTGAATTCTTGAGAGTATATAGTCCAGCTGCTGATGCTAAAATCCGGTCAATTGGGGGTGAAAAGGTAATTTCTGGACGACGTCATG
TCGGTATCATGTCTGCTGAACCAGTTGGGAACTATGGGGTAAGGATCCTTTTTGATGATTTGCATAGAACTGGGATTTATTCGTGGGACTATTTCTATCATCTCGGGAGC
AACAAATTTACACTCTTGAGAAATTATGTTAAAACACTGAAGAAGCATGGGCTGAGCCGAGATCCACCAAAGAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGCTTTGCAAAGAGTTTTCCGGACGATCCATACCACTCTCGATGCTCCTCAAATTACCACTTTCGCTCTTCACGCCCCCAAATATGTAGAGGTTAAATTTGCAGA
TGGAACTGTGTTCAACTTGTCAGCTGAATTCTTGAGAGTATATAGTCCAGCTGCTGATGCTAAAATCCGGTCAATTGGGGGTGAAAAGGTAATTTCTGGACGACGTCATG
TCGGTATCATGTCTGCTGAACCAGTTGGGAACTATGGGGTAAGGATCCTTTTTGATGATTTGCATAGAACTGGGATTTATTCGTGGGACTATTTCTATCATCTCGGGAGC
AACAAATTTACACTCTTGAGAAATTATGTTAAAACACTGAAGAAGCATGGGCTGAGCCGAGATCCACCAAAGAGAAAATGA
Protein sequenceShow/hide protein sequence
MLALQRVFRTIHTTLDAPQITTFALHAPKYVEVKFADGTVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFYHLGS
NKFTLLRNYVKTLKKHGLSRDPPKRK