; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg09684 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg09684
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionGBBH-like_N domain-containing protein
Genome locationCarg_Chr14:3535045..3539347
RNA-Seq ExpressionCarg09684
SyntenyCarg09684
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR010376 - Gamma-butyrobetaine hydroxylase-like, N-terminal
IPR038492 - GBBH-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017806.1 hypothetical protein SDJN02_19672 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-70100Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

XP_022935010.1 uncharacterized protein LOC111442002 isoform X1 [Cucurbita moschata]2.9e-6998.53Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQRAFRTIHTTVDAPQ+TSFALRAPK VEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

XP_022984013.1 uncharacterized protein LOC111482460 isoform X1 [Cucurbita maxima]1.9e-6897.79Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLAL RAFRTIHTTVD+PQITSFALRAPK VEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

XP_031737737.1 uncharacterized protein LOC101219823 isoform X1 [Cucumis sativus]4.4e-6592.65Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        ML L R FRTIHTT+DAPQIT+FAL APK VEVKFADGSVFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYF+HLGSNKFT LRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

XP_038891561.1 uncharacterized protein LOC120080945 isoform X1 [Benincasa hispida]2.3e-6694.85Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQR FRTIHTT+DAPQIT+FALRAPK VEVKFADGSVFNLSAEFLRVYSPAADAKVRS+GGEKVISGRRHV IMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

TrEMBL top hitse value%identityAlignment
A0A1S3BNF3 uncharacterized protein LOC103491759 isoform X11.4e-6491.91Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        ML L R FRTIHTT++APQIT+FAL APK VEVKFADGSVFNLSAEFLR+YSPAADAKVRSIGGEKVI GRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

A0A2I4H1X3 uncharacterized protein LOC109012788 isoform X16.6e-5980.88Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLA++RA R IHT VDAP++T FAL APKCVEV FADGSVFNL+AEFLRV SPA D K+RS+GGEKVISGRRHVGIMSAEPVGNYGVRI+FDDLH+TGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
         WDYFYHLGSNKFT +++Y+K LKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

A0A6J1DIS9 uncharacterized protein LOC111021468 isoform X15.2e-6489.63Show/hide
Query:  LALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYS
        +AL RAFRTIHTTV+AP++TSFALRAPK VEVKFADGS+FNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYS
Subjt:  LALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYS

Query:  WDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        WDYFYHLGSNKFT LRNYV+TL+KHGLSRDPP+RK
Subjt:  WDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

A0A6J1F9C8 uncharacterized protein LOC111442002 isoform X11.4e-6998.53Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLALQRAFRTIHTTVDAPQ+TSFALRAPK VEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

A0A6J1J9C4 uncharacterized protein LOC111482460 isoform X19.1e-6997.79Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
        MLAL RAFRTIHTTVD+PQITSFALRAPK VEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIY

Query:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
        SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK
Subjt:  SWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPPKRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27340.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 1035 Blast hits to 1035 proteins in 399 species: Archae - 0; Bacteria - 765; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 231 (source: NCBI BLink).1.6e-4967.65Show/hide
Query:  MLALQRAF-RTIHTT--VDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT
        M A+ R   R +H T  +  P+++ F++ +PK VEV++ADG+ FN S+EFLR++SPAAD KVRSIGGEKVISGRR+VGIMSAEPVGNYGVR++FDDLHRT
Subjt:  MLALQRAF-RTIHTT--VDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT

Query:  GIYSWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPP
        GIY WDYFY LGSNKF  +RNY+KTL+KH LSR+PP
Subjt:  GIYSWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPP

AT3G27340.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 693 Blast hits to 693 proteins in 282 species: Archae - 0; Bacteria - 530; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 124 (source: NCBI BLink).2.5e-2664.13Show/hide
Query:  MLALQRAF-RTIHTT--VDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRI
        M A+ R   R +H T  +  P+++ F++ +PK VEV++ADG+ FN S+EFLR++SPAAD KVRSIGGEKVISGRR+VGIMSAEPVGNYGVRI
Subjt:  MLALQRAF-RTIHTT--VDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRI

AT3G27340.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 945 Blast hits to 945 proteins in 390 species: Archae - 0; Bacteria - 710; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 196 (source: NCBI BLink).8.5e-4363.24Show/hide
Query:  MLALQRAF-RTIHTT--VDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT
        M A+ R   R +H T  +  P+++ F++ +PK VEV++ADG+ FN S+EFLR++SPAAD KVRSIGGEKVISGRR+VGIMSAEPVGNYGV        RT
Subjt:  MLALQRAF-RTIHTT--VDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRT

Query:  GIYSWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPP
        GIY WDYFY LGSNKF  +RNY+KTL+KH LSR+PP
Subjt:  GIYSWDYFYHLGSNKFTFLRNYVKTLKKHGLSRDPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGCTTTGCAAAGGGCTTTCCGGACGATCCATACCACCGTTGATGCTCCGCAAATTACAAGTTTCGCTCTTCGCGCCCCCAAATGTGTAGAGGTTAAATTTGCAGA
TGGAAGTGTGTTCAACTTGTCAGCTGAATTCTTGAGAGTATATAGCCCAGCTGCTGATGCTAAGGTTCGGTCAATTGGGGGTGAAAAGGTAATTTCTGGACGACGTCACG
TCGGTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATCCTTTTTGATGACTTACATAGAACTGGGATTTATTCGTGGGATTATTTCTATCATCTGGGGAGC
AACAAGTTTACATTCTTGAGAAATTATGTTAAAACACTAAAGAAGCATGGACTGAGCCGAGATCCACCGAAGAGAAAGTGA
mRNA sequenceShow/hide mRNA sequence
GAATAATTGTTGAGAAAAGTTAAGAGGCCGTGGTTCGAACCCAATGCACAGCAGTTAGGCGCGAAACTGGAACCAAAATCAGTCGGCCAAACGAATCAGAGAGACGATGT
TAGCTTTGCAAAGGGCTTTCCGGACGATCCATACCACCGTTGATGCTCCGCAAATTACAAGTTTCGCTCTTCGCGCCCCCAAATGTGTAGAGGTTAAATTTGCAGATGGA
AGTGTGTTCAACTTGTCAGCTGAATTCTTGAGAGTATATAGCCCAGCTGCTGATGCTAAGGTTCGGTCAATTGGGGGTGAAAAGGTAATTTCTGGACGACGTCACGTCGG
TATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATCCTTTTTGATGACTTACATAGAACTGGGATTTATTCGTGGGATTATTTCTATCATCTGGGGAGCAACA
AGTTTACATTCTTGAGAAATTATGTTAAAACACTAAAGAAGCATGGACTGAGCCGAGATCCACCGAAGAGAAAGTGATGGACAAATCAACTGTCGTGAGCAGGTTGGAGA
AAGATTCAATTGTTGAACTAGAAGCATCATCTGAGACCTCTTGGACTGTGAAGCAAGAACCTGAGCATGGAAATACACATATTAGTGTAGCTCCTTGCAAGTTTATCTAT
GAATTTAGTTTAATTACAGATTAGGACTTGTATTTCGATGGTTGGTTGGCT
Protein sequenceShow/hide protein sequence
MLALQRAFRTIHTTVDAPQITSFALRAPKCVEVKFADGSVFNLSAEFLRVYSPAADAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFYHLGS
NKFTFLRNYVKTLKKHGLSRDPPKRK