; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008515 (gene) of Snake gourd v1 genome

Gene IDTan0008515
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGBBH-like_N domain-containing protein
Genome locationLG07:65719430..65721751
RNA-Seq ExpressionTan0008515
SyntenyTan0008515
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR010376 - Gamma-butyrobetaine hydroxylase-like, N-terminal
IPR038492 - GBBH-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017806.1 hypothetical protein SDJN02_19672 [Cucurbita argyrosperma subsp. argyrosperma]8.0e-6794.12Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLALQRAFRTIHTTVDAPQITSF+LR PK VEVKFADGSVFNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_022154140.1 uncharacterized protein LOC111021468 isoform X1 [Momordica charantia]2.0e-6591.85Show/hide
Query:  LALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +AL RAFRTIHTTV+AP++TSF+LR PK+VEVKFADGS+FNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
Subjt:  LALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        WDYFYHLGSNKFTLLRNYV+TL+KHGLSRDPPRRK
Subjt:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_022935010.1 uncharacterized protein LOC111442002 isoform X1 [Cucurbita moschata]9.4e-6894.12Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLALQRAFRTIHTTVDAPQ+TSF+LR PKYVEVKFADGSVFNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_022984013.1 uncharacterized protein LOC111482460 isoform X1 [Cucurbita maxima]6.1e-6793.38Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLAL RAFRTIHTTVD+PQITSF+LR PKYVEVKFADGSVFNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

XP_038891561.1 uncharacterized protein LOC120080945 isoform X1 [Benincasa hispida]5.2e-6691.91Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLALQR FRTIHTT+DAPQIT+F+LR PKYVEVKFADGSVFNLSAEFLR+YSPA DAKVRS+GGEKVISGRRHV IMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

TrEMBL top hitse value%identityAlignment
A0A1S3BNF3 uncharacterized protein LOC103491759 isoform X11.8e-6490.44Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY
        ML L R FRTIHTT++APQIT+F+L  PKYVEVKFADGSVFNLSAEFLRIYSPA DAKVRSIGGEKVI GRRHVGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A4U5PZX0 GBBH-like_N domain-containing protein1.3e-5980.15Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLA+Q+A R +HT++DAP++T F+L+ PKYVEV++A+GS FNLSAEFLRI+SPAVD KVRS+GGEKVI GRRHVGIMSAEPVGNYGVR++FDDLHKTGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        +WD+FYHLGSNKFTL+RNY+KTLKKHGLSRDPPRRK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A6J1DIS9 uncharacterized protein LOC111021468 isoform X19.5e-6691.85Show/hide
Query:  LALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
        +AL RAFRTIHTTV+AP++TSF+LR PK+VEVKFADGS+FNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS
Subjt:  LALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYS

Query:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        WDYFYHLGSNKFTLLRNYV+TL+KHGLSRDPPRRK
Subjt:  WDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A6J1F9C8 uncharacterized protein LOC111442002 isoform X14.6e-6894.12Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLALQRAFRTIHTTVDAPQ+TSF+LR PKYVEVKFADGSVFNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

A0A6J1J9C4 uncharacterized protein LOC111482460 isoform X13.0e-6793.38Show/hide
Query:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY
        MLAL RAFRTIHTTVD+PQITSF+LR PKYVEVKFADGSVFNLSAEFLR+YSPA DAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIY
Subjt:  MLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIY

Query:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK
        SWDYFYHLGSNKFT LRNYVKTLKKHGLSRDPP+RK
Subjt:  SWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPPRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27340.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 1035 Blast hits to 1035 proteins in 399 species: Archae - 0; Bacteria - 765; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 231 (source: NCBI BLink).1.2e-4968.38Show/hide
Query:  MLALQRAF-RTIHTT--VDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKT
        M A+ R   R +H T  +  P+++ FS+  PK VEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRR+VGIMSAEPVGNYGVR++FDDLH+T
Subjt:  MLALQRAF-RTIHTT--VDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKT

Query:  GIYSWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP
        GIY WDYFY LGSNKF L+RNY+KTL+KH LSR+PP
Subjt:  GIYSWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP

AT3G27340.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 693 Blast hits to 693 proteins in 282 species: Archae - 0; Bacteria - 530; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 124 (source: NCBI BLink).3.3e-2665.22Show/hide
Query:  MLALQRAF-RTIHTT--VDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRI
        M A+ R   R +H T  +  P+++ FS+  PK VEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRR+VGIMSAEPVGNYGVRI
Subjt:  MLALQRAF-RTIHTT--VDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRI

AT3G27340.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 945 Blast hits to 945 proteins in 390 species: Archae - 0; Bacteria - 710; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 196 (source: NCBI BLink).2.9e-4364.71Show/hide
Query:  MLALQRAF-RTIHTT--VDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKT
        M A+ R   R +H T  +  P+++ FS+  PK VEV++ADG+ FN S+EFLRI+SPA D KVRSIGGEKVISGRR+VGIMSAEPVGNYGVR        T
Subjt:  MLALQRAF-RTIHTT--VDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKT

Query:  GIYSWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP
        GIY WDYFY LGSNKF L+RNY+KTL+KH LSR+PP
Subjt:  GIYSWDYFYHLGSNKFTLLRNYVKTLKKHGLSRDPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTTAGCTTTGCAAAGAGCTTTCCGGACGATCCATACCACCGTCGATGCTCCGCAAATTACCAGTTTTTCTCTTCGCGTCCCCAAATATGTAGAGGTTAAATTCGC
AGATGGAAGTGTGTTCAATTTGTCAGCTGAATTCCTGAGAATATATAGCCCAGCTGTTGATGCTAAGGTCCGGTCAATTGGGGGTGAAAAGGTAATTTCTGGGCGGCGCC
ATGTCGGTATCATGTCTGCAGAACCAGTTGGGAACTATGGGGTAAGGATACTTTTTGATGACTTGCATAAAACTGGGATTTATTCATGGGATTATTTCTACCATCTCGGA
AGCAACAAATTTACACTTCTGAGAAATTATGTTAAAACGCTGAAAAAGCACGGGTTGAGCCGAGATCCACCAAGGAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
CCAATACGAGCAAGTAGGCGGGAATAACTCGAACGAAAATCAATCAGCGAAACGAATCAAATCGAATCGAGTTTGTGAGATATTCAGAAAAATGATGTTAGCTTTGCAAA
GAGCTTTCCGGACGATCCATACCACCGTCGATGCTCCGCAAATTACCAGTTTTTCTCTTCGCGTCCCCAAATATGTAGAGGTTAAATTCGCAGATGGAAGTGTGTTCAAT
TTGTCAGCTGAATTCCTGAGAATATATAGCCCAGCTGTTGATGCTAAGGTCCGGTCAATTGGGGGTGAAAAGGTAATTTCTGGGCGGCGCCATGTCGGTATCATGTCTGC
AGAACCAGTTGGGAACTATGGGGTAAGGATACTTTTTGATGACTTGCATAAAACTGGGATTTATTCATGGGATTATTTCTACCATCTCGGAAGCAACAAATTTACACTTC
TGAGAAATTATGTTAAAACGCTGAAAAAGCACGGGTTGAGCCGAGATCCACCAAGGAGAAAATGATGGACGAGTCAACCGCCCCCAGCTATGAGCATCCATATGTTAGAT
AGTCGTGAGTGTTCGGGTCAACTAACATGCTTCCTAGCAAATCATCTATGAGCATCTTGTGCAACTATTAGATGGAATTCATCACTATTGAGCAGGAAGATAAGCTGTCT
TAGCCCTGGCTGACGAGAAAGACTCGATTGTTGTACAAAGCATCAAAAACAATTTTCTAGTCCCTGTTTCAAATCTTCAATTGAGACCCCTTGAACCAAGGGGAAGAGTA
TGG
Protein sequenceShow/hide protein sequence
MMLALQRAFRTIHTTVDAPQITSFSLRVPKYVEVKFADGSVFNLSAEFLRIYSPAVDAKVRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHKTGIYSWDYFYHLG
SNKFTLLRNYVKTLKKHGLSRDPPRRK