; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G008770 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G008770
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationchr10:13043899..13045178
RNA-Seq ExpressionLsi10G008770
SyntenyLsi10G008770
Gene Ontology termsGO:0009808 - lignin metabolic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0016710 - trans-cinnamate 4-monooxygenase activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010693.1 hypothetical protein SDJN02_27489 [Cucurbita argyrosperma subsp. argyrosperma]9.7e-5297.06Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDK SMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTL DLWN YDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NG
        NG
Subjt:  NG

XP_022943358.1 uncharacterized protein LOC111448145 [Cucurbita moschata]1.1e-5298.04Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTL DLWN YDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NG
        NG
Subjt:  NG

XP_022986335.1 uncharacterized protein LOC111484110 [Cucurbita maxima]1.1e-5298.04Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTL DLWN YDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NG
        NG
Subjt:  NG

XP_023511831.1 uncharacterized protein LOC111776733 [Cucurbita pepo subsp. pepo]5.7e-5297.06Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTL DLWN YDEWSAYGAGVPI VNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NG
        NG
Subjt:  NG

XP_038900400.1 uncharacterized protein LOC120087632 [Benincasa hispida]3.9e-5399.02Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWN YDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NG
        NG
Subjt:  NG

TrEMBL top hitse value%identityAlignment
A0A0A0KG73 Uncharacterized protein6.1e-5297.06Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDK SMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTL DLWN YDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NG
        NG
Subjt:  NG

A0A1S3BIR2 uncharacterized protein LOC1034905141.8e-5196.08Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKGSMQSNLGCFLHCTTPVVNSQFL KSEIRNLNRLWHPWEREKVEYFTL DLWN YDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NG
        +G
Subjt:  NG

A0A5D3C173 DUF789 domain-containing protein1.8e-5196.08Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKGSMQSNLGCFLHCTTPVVNSQFL KSEIRNLNRLWHPWEREKVEYFTL DLWN YDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NG
        +G
Subjt:  NG

A0A6J1FWT0 uncharacterized protein LOC1114481455.5e-5398.04Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTL DLWN YDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NG
        NG
Subjt:  NG

A0A6J1JDS8 uncharacterized protein LOC1114841105.5e-5398.04Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTL DLWN YDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NG
        NG
Subjt:  NG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G03610.1 Protein of unknown function (DUF789)5.0e-3872.28Show/hide
Query:  MILDKGS-MQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST
        M+  KGS  +SNL  FLHC TP+V  Q LPK+EIR LNRLWHPWER+KVE+F L+DLW+ YDEWSAYGA VPI V NGE+LVQYYVPYLSAIQIFTS+S+
Subjt:  MILDKGS-MQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNST

Query:  V
        +
Subjt:  V

AT4G03420.1 Protein of unknown function (DUF789)2.2e-3872Show/hide
Query:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        M+  KG   SNL  FLHCTTPVV  Q L K+EIR+LNR+WHPWER+KVE+F L+DLW+ YDEWSAYGAGVPI ++NGE+LVQYYVPYLSAIQIFTS S++
Subjt:  MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

AT4G16100.1 Protein of unknown function (DUF789)8.5e-2250.54Show/hide
Query:  GSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNS
        G+  SNLG FL CTTP+V++Q LP +  +     W   E E   YF L DLW+S++EWSAYG GVP+ +N  +++VQYYVPYLS IQ++   S
Subjt:  GSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNS

AT4G28150.1 Protein of unknown function (DUF789)2.8e-3363.64Show/hide
Query:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGMNAVT
        +SNL  FL CTTP+V +  LPK++I+NLN LW+P E + VEYF L D W+ +DEWSAYGAGVPI    GETLVQYYVPYLSAIQIFTS+S +N +   T
Subjt:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGMNAVT

AT4G28150.2 Protein of unknown function (DUF789)2.6e-3163.64Show/hide
Query:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGMNAVT
        +SNL  FL CTTP+V +  LPK  I+NLN LW+P E + VEYF L D W+ +DEWSAYGAGVPI    GETLVQYYVPYLSAIQIFTS+S +N +   T
Subjt:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGMNAVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTTGACAAAGGGTCAATGCAATCCAATTTGGGTTGCTTTCTCCATTGCACCACGCCGGTTGTCAATTCCCAATTCTTGCCTAAGAGTGAGATTAGGAATCTTAA
TCGTCTATGGCATCCATGGGAGAGAGAGAAGGTTGAATATTTCACCCTCGCCGATCTCTGGAATTCTTACGACGAATGGAGCGCTTACGGTGCCGGTGTTCCCATCGCCG
TCAACAACGGCGAGACCCTTGTTCAATACTATGTTCCTTATCTCTCTGCAATCCAAATCTTCACCAGCAATTCCACCGTCAATGGGATGAATGCGGTGACAGTGAAACAA
GGGATTCGTTCAGTGATTCTTGCAGCGATGAGAGTGAAAGTGAAAAATTATGGAGATGGGACGGAAGCTCATCAGAAGAGGGAGGATTCTTAG
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAGGAAAAGAGGCAGATTCTAATTTTCCCGTGAAATTTGGTGTTTGAAATCCTAATATTAGCAAATCTCTCGCATGTTTTTTGGAGGGGTCTTTTTTATC
TCATTTTGGATCCTATTTTTTTCCTAAAATTTTAAAAGGAAAAAGAAACAAGGTCCGATTATCTTCCCTCTTCCATTTCTGTCTCTCTAGTCTTTTTCGGTGCTCTGTTT
TTCGGTTTGCTTTCCGGGAATCCTCTCTGCTTTTCAGGGTTGGGGGGAAGACTCTGAAAAATTTGTCTCTCTCCCTCTCCTCCAAAATTATGATTCTTGACAAAGGGTCA
ATGCAATCCAATTTGGGTTGCTTTCTCCATTGCACCACGCCGGTTGTCAATTCCCAATTCTTGCCTAAGAGTGAGATTAGGAATCTTAATCGTCTATGGCATCCATGGGA
GAGAGAGAAGGTTGAATATTTCACCCTCGCCGATCTCTGGAATTCTTACGACGAATGGAGCGCTTACGGTGCCGGTGTTCCCATCGCCGTCAACAACGGCGAGACCCTTG
TTCAATACTATGTTCCTTATCTCTCTGCAATCCAAATCTTCACCAGCAATTCCACCGTCAATGGGATGAATGCGGTGACAGTGAAACAAGGGATTCGTTCAGTGATTCTT
GCAGCGATGAGAGTGAAAGTGAAAAATTATGGAGATGGGACGGAAGCTCATCAGAAGAGGGAGGATTCTTAGAGCAAGAAAGTCCTCTGCATCTCAGTGAGAGATTGGGA
TATCTTTACTTTCAGTATTTCGAGAGATCAACTCCATACGGAAGAGTTCCATTAATGGATAAGGTAGAACTAAAACCCCCCCATTTTACTTGGAATTTTGTATTTAGATT
ATGACTAGAAATGCTTAATGAATGGTGTGGAATCATATGGATTTTGTTTATTTATGCTTTTATGTGTTTGGCATCTACTAAAGAAGAAAACCCAAACTTTTTTGACCTTT
GAATATGGTTGCTGAATCTGTGCTATACCCACTTCTATATTGTG
Protein sequenceShow/hide protein sequence
MILDKGSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLADLWNSYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGMNAVTVKQ
GIRSVILAAMRVKVKNYGDGTEAHQKREDS