; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001889 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001889
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGBBH-like_N domain-containing protein
Genome locationscaffold10:1652399..1656921
RNA-Seq ExpressionSpg001889
SyntenySpg001889
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR010376 - Gamma-butyrobetaine hydroxylase-like, N-terminal
IPR038492 - GBBH-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017806.1 hypothetical protein SDJN02_19672 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-6592.65Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLAL +AFRTIHTTVDAPQITSFALR PK VEVKFADGSVFNLSAEFLR+YSPA DAKVRSIGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_022154140.1 uncharacterized protein LOC111021468 isoform X1 [Momordica charantia]8.8e-6691.85Show/hide
Query:  LALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +ALH+AFRTIHTTV+AP++TSFALR PK+VEVKFADGS+FNLSAEFLRIYSPAVDAKVRSIGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLHKTGIYS
Subjt:  LALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        WDYFYHLGSNKFTLLRNYV+TL+KHGLSRDPPRRK
Subjt:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_022935010.1 uncharacterized protein LOC111442002 isoform X1 [Cucurbita moschata]1.4e-6692.65Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLAL +AFRTIHTTVDAPQ+TSFALR PKYVEVKFADGSVFNLSAEFLR+YSPA DAKVRSIGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_022984013.1 uncharacterized protein LOC111482460 isoform X1 [Cucurbita maxima]2.7e-6793.38Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLALH+AFRTIHTTVD+PQITSFALR PKYVEVKFADGSVFNLSAEFLR+YSPA DAKVRSIGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_031737737.1 uncharacterized protein LOC101219823 isoform X1 [Cucumis sativus]4.4e-6589.71Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        ML LH+ FRTIHTT+DAPQIT+FAL  PKYVEVKFADGSVFNLSAEFLR+YSPA DAK+RSIGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYF+HLGSNKFTLLRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

TrEMBL top hitse value%identityAlignment
A0A1S3BNF3 uncharacterized protein LOC103491759 isoform X15.2e-6489.71Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        ML L++ FRTIHTT++APQIT+FAL  PKYVEVKFADGSVFNLSAEFLRIYSPA DAKVRSIGGEKVI GRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A4U5PZX0 GBBH-like_N domain-containing protein1.1e-5879.41Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLA+ KA R +HT++DAP++T F L+ PKYVEV++A+GS FNLSAEFLRI+SPAVD KVRS+GGEKVI GRR+VGIMSAEPVGNYGVR++FDDLHKTGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        +WD+FYHLGSNKFTL+RNY+KTLKKHGLSRDPPRRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A6J1DIS9 uncharacterized protein LOC111021468 isoform X14.2e-6691.85Show/hide
Query:  LALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +ALH+AFRTIHTTV+AP++TSFALR PK+VEVKFADGS+FNLSAEFLRIYSPAVDAKVRSIGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLHKTGIYS
Subjt:  LALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        WDYFYHLGSNKFTLLRNYV+TL+KHGLSRDPPRRK
Subjt:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A6J1F9C8 uncharacterized protein LOC111442002 isoform X16.5e-6792.65Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLAL +AFRTIHTTVDAPQ+TSFALR PKYVEVKFADGSVFNLSAEFLR+YSPA DAKVRSIGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A6J1J9C4 uncharacterized protein LOC111482460 isoform X11.3e-6793.38Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLALH+AFRTIHTTVD+PQITSFALR PKYVEVKFADGSVFNLSAEFLR+YSPA DAKVRSIGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27340.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 1035 Blast hits to 1035 proteins in 399 species: Archae - 0; Bacteria - 765; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 231 (source: NCBI BLink).7.2e-5070.87Show/hide
Query:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY
        R +H T  +  P+++ F++  PK VEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRRYVGIMSAEPVGNYGVR++FDDLH+TGIY WDYFY
Subjt:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY

Query:  HLGSNKFTLLRNYVKTLKKHGLSRDPP
         LGSNKF L+RNY+KTL+KH LSR+PP
Subjt:  HLGSNKFTLLRNYVKTLKKHGLSRDPP

AT3G27340.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 693 Blast hits to 693 proteins in 282 species: Archae - 0; Bacteria - 530; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 124 (source: NCBI BLink).1.5e-2668.67Show/hide
Query:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRI
        R +H T  +  P+++ F++  PK VEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRRYVGIMSAEPVGNYGVRI
Subjt:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRI

AT3G27340.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 945 Blast hits to 945 proteins in 390 species: Archae - 0; Bacteria - 710; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 196 (source: NCBI BLink).1.7e-4366.93Show/hide
Query:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY
        R +H T  +  P+++ F++  PK VEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRRYVGIMSAEPVGNYGVR        TGIY WDYFY
Subjt:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY

Query:  HLGSNKFTLLRNYVKTLKKHGLSRDPP
         LGSNKF L+RNY+KTL+KH LSR+PP
Subjt:  HLGSNKFTLLRNYVKTLKKHGLSRDPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGCTTTGCATAAAGCTTTCCGGACGATCCATACCACCGTCGATGCTCCTCAAATTACCAGTTTCGCTCTTCGCGTCCCCAAATATGTAGAGGTGAAATTTGCAGA
TGGAAGTGTGTTCAACTTGTCAGCTGAATTCTTGAGAATATATAGTCCAGCTGTTGATGCTAAGGTCCGATCAATTGGAGGTGAAAAGGTAATTTCTGGACGGCGTTATG
TTGGTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATACTTTTTGATGACTTGCATAAAACTGGGATTTATTCATGGGATTATTTCTACCATCTTGGGAGC
AACAAATTTACGCTCTTGAGAAATTATGTTAAAACACTGAAGAAGCACGGGCTGAGCCGAGATCCACCAAGGAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGCTTTGCATAAAGCTTTCCGGACGATCCATACCACCGTCGATGCTCCTCAAATTACCAGTTTCGCTCTTCGCGTCCCCAAATATGTAGAGGTGAAATTTGCAGA
TGGAAGTGTGTTCAACTTGTCAGCTGAATTCTTGAGAATATATAGTCCAGCTGTTGATGCTAAGGTCCGATCAATTGGAGGTGAAAAGGTAATTTCTGGACGGCGTTATG
TTGGTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATACTTTTTGATGACTTGCATAAAACTGGGATTTATTCATGGGATTATTTCTACCATCTTGGGAGC
AACAAATTTACGCTCTTGAGAAATTATGTTAAAACACTGAAGAAGCACGGGCTGAGCCGAGATCCACCAAGGAGAAAATGA
Protein sequenceShow/hide protein sequence
MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGS
NKFTLLRNYVKTLKKHGLSRDPPRRK