; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027524 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027524
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGBBH-like_N domain-containing protein
Genome locationchr8:1717630..1719727
RNA-Seq ExpressionLag0027524
SyntenyLag0027524
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR010376 - Gamma-butyrobetaine hydroxylase-like, N-terminal
IPR038492 - GBBH-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017806.1 hypothetical protein SDJN02_19672 [Cucurbita argyrosperma subsp. argyrosperma]5.7e-6591.18Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLAL +AFRTIHTTVDAPQITSFALR PK VEVKFADGSVFNLSAEFLR+YSPA D+KVR+IGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_022154140.1 uncharacterized protein LOC111021468 isoform X1 [Momordica charantia]4.4e-6590.37Show/hide
Query:  LALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +ALH+AFRTIHTTV+AP++TSFALR PK+VEVKFADGS+FNLSAEFLRIYSPAVD+KVR+IGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLHKTGIYS
Subjt:  LALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        WDYFYHLGSNKFTLLRNYV+TL+KHGLSRDPPRRK
Subjt:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_022935010.1 uncharacterized protein LOC111442002 isoform X1 [Cucurbita moschata]6.7e-6691.18Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLAL +AFRTIHTTVDAPQ+TSFALR PKYVEVKFADGSVFNLSAEFLR+YSPA D+KVR+IGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_022984013.1 uncharacterized protein LOC111482460 isoform X1 [Cucurbita maxima]1.4e-6691.91Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLALH+AFRTIHTTVD+PQITSFALR PKYVEVKFADGSVFNLSAEFLR+YSPA D+KVR+IGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_031737737.1 uncharacterized protein LOC101219823 isoform X1 [Cucumis sativus]2.2e-6488.24Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        ML LH+ FRTIHTT+DAPQIT+FAL  PKYVEVKFADGSVFNLSAEFLR+YSPA D+K+R+IGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYF+HLGSNKFTLLRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

TrEMBL top hitse value%identityAlignment
A0A1S3BNF3 uncharacterized protein LOC103491759 isoform X12.6e-6388.24Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        ML L++ FRTIHTT++APQIT+FAL  PKYVEVKFADGSVFNLSAEFLRIYSPA D+KVR+IGGEKVI GRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A6J1DIS9 uncharacterized protein LOC111021468 isoform X12.1e-6590.37Show/hide
Query:  LALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +ALH+AFRTIHTTV+AP++TSFALR PK+VEVKFADGS+FNLSAEFLRIYSPAVD+KVR+IGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLHKTGIYS
Subjt:  LALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        WDYFYHLGSNKFTLLRNYV+TL+KHGLSRDPPRRK
Subjt:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A6J1F9C8 uncharacterized protein LOC111442002 isoform X13.2e-6691.18Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLAL +AFRTIHTTVDAPQ+TSFALR PKYVEVKFADGSVFNLSAEFLR+YSPA D+KVR+IGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A6J1J9C4 uncharacterized protein LOC111482460 isoform X16.5e-6791.91Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLALH+AFRTIHTTVD+PQITSFALR PKYVEVKFADGSVFNLSAEFLR+YSPA D+KVR+IGGEKVISGRR+VGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A7N2KR88 GBBH-like_N domain-containing protein1.9e-5882.35Show/hide
Query:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLA+  A R IHTTVDAP++T FAL  PK VEV FADGS+FNLSAEFLR++SPAVDSK+R+IGGEKVISGRR+VGIMSAEPVGNYGVRI+FDDLHKTGIY
Subjt:  MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFY LGSNKFTL+RNY+KTLKKHGL+RDP RRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27340.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 1035 Blast hits to 1035 proteins in 399 species: Archae - 0; Bacteria - 765; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 231 (source: NCBI BLink).1.2e-4970.08Show/hide
Query:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY
        R +H T  +  P+++ F++  PK VEV++ADG+ FN S+EFLRI+SPA D KVR+IGGEKVISGRRYVGIMSAEPVGNYGVR++FDDLH+TGIY WDYFY
Subjt:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY

Query:  HLGSNKFTLLRNYVKTLKKHGLSRDPP
         LGSNKF L+RNY+KTL+KH LSR+PP
Subjt:  HLGSNKFTLLRNYVKTLKKHGLSRDPP

AT3G27340.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 693 Blast hits to 693 proteins in 282 species: Archae - 0; Bacteria - 530; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 124 (source: NCBI BLink).2.5e-2667.47Show/hide
Query:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRI
        R +H T  +  P+++ F++  PK VEV++ADG+ FN S+EFLRI+SPA D KVR+IGGEKVISGRRYVGIMSAEPVGNYGVRI
Subjt:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRI

AT3G27340.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 945 Blast hits to 945 proteins in 390 species: Archae - 0; Bacteria - 710; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 196 (source: NCBI BLink).2.9e-4366.14Show/hide
Query:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY
        R +H T  +  P+++ F++  PK VEV++ADG+ FN S+EFLRI+SPA D KVR+IGGEKVISGRRYVGIMSAEPVGNYGVR        TGIY WDYFY
Subjt:  RTIHTT--VDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFY

Query:  HLGSNKFTLLRNYVKTLKKHGLSRDPP
         LGSNKF L+RNY+KTL+KH LSR+PP
Subjt:  HLGSNKFTLLRNYVKTLKKHGLSRDPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGCTTTGCATAAAGCTTTCCGGACGATCCATACCACCGTCGATGCTCCTCAAATTACCAGTTTCGCTCTTCGCGTCCCGAAATATGTAGAGGTGAAATTTGCAGA
TGGAAGTGTGTTCAATTTGTCAGCTGAATTCTTGAGAATATATAGTCCAGCTGTTGATTCTAAGGTCCGAACAATTGGGGGTGAAAAGGTAATTTCTGGACGGCGTTATG
TTGGTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATACTTTTTGATGACTTGCATAAAACTGGGATTTATTCATGGGATTATTTCTACCATCTTGGGAGC
AACAAATTTACGCTCTTGAGAAATTATGTTAAAACACTGAAGAAGCACGGGCTGAGCCGAGATCCACCAAGGAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGCTTTGCATAAAGCTTTCCGGACGATCCATACCACCGTCGATGCTCCTCAAATTACCAGTTTCGCTCTTCGCGTCCCGAAATATGTAGAGGTGAAATTTGCAGA
TGGAAGTGTGTTCAATTTGTCAGCTGAATTCTTGAGAATATATAGTCCAGCTGTTGATTCTAAGGTCCGAACAATTGGGGGTGAAAAGGTAATTTCTGGACGGCGTTATG
TTGGTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATACTTTTTGATGACTTGCATAAAACTGGGATTTATTCATGGGATTATTTCTACCATCTTGGGAGC
AACAAATTTACGCTCTTGAGAAATTATGTTAAAACACTGAAGAAGCACGGGCTGAGCCGAGATCCACCAAGGAGAAAATGA
Protein sequenceShow/hide protein sequence
MLALHKAFRTIHTTVDAPQITSFALRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDSKVRTIGGEKVISGRRYVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLGS
NKFTLLRNYVKTLKKHGLSRDPPRRK