; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G003790 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G003790
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationchr01:3330108..3332200
RNA-Seq ExpressionLsi01G003790
SyntenyLsi01G003790
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647674.1 hypothetical protein Csa_003691 [Cucumis sativus]7.3e-8287.78Show/hide
Query:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW
        +TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCI VRSA +YKGL+ NKQLA GSL+FAW
Subjt:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW

Query:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        IALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPAGATGHV
Subjt:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

XP_004134744.2 uncharacterized protein LOC101214369 [Cucumis sativus]7.3e-8287.78Show/hide
Query:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW
        +TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCI VRSA +YKGL+ NKQLA GSL+FAW
Subjt:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW

Query:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        IALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPAGATGHV
Subjt:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

XP_008439942.1 PREDICTED: uncharacterized protein LOC103484578 [Cucumis melo]7.3e-8287.78Show/hide
Query:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW
        +TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCIFVRSA +YKGL+ NKQLAAGSL+FAW
Subjt:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW

Query:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        IAL+VGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPA ATGHV
Subjt:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

XP_022977369.1 uncharacterized protein LOC111477717 [Cucurbita maxima]1.2e-8189.27Show/hide
Query:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAWIAL
        GFVVCILVI+MDVTAGILGIQAE+AQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC++VRSA EYKGLT+NKQLAAGSLVFAWIAL
Subjt:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAWIAL

Query:  VVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        VVGFSLLISGAMYNTRSRKSCGLAH+ LLSIGGIVCFVHGLFAVAYYVSATA HRE  +    PPPQG+PAGATGHV
Subjt:  VVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

XP_038881391.1 uncharacterized protein LOC120072925 [Benincasa hispida]6.6e-8390Show/hide
Query:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW
        +TYGF+VC+L+IIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANLAGG IFVRSA EYKGLT NKQLA GSLVFAW
Subjt:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW

Query:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        IALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVSATAEHREE K    PPPQGNPA A G V
Subjt:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

TrEMBL top hitse value%identityAlignment
A0A0A0KLA9 Uncharacterized protein3.5e-8287.78Show/hide
Query:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW
        +TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCI VRSA +YKGL+ NKQLA GSL+FAW
Subjt:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW

Query:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        IALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPAGATGHV
Subjt:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

A0A1S3AZY5 uncharacterized protein LOC1034845783.5e-8287.78Show/hide
Query:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW
        +TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCIFVRSA +YKGL+ NKQLAAGSL+FAW
Subjt:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW

Query:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        IAL+VGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPA ATGHV
Subjt:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

A0A5D3CMA3 Uncharacterized protein3.5e-8287.78Show/hide
Query:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW
        +TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCIFVRSA +YKGL+ NKQLAAGSL+FAW
Subjt:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW

Query:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        IAL+VGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPA ATGHV
Subjt:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

A0A6J1GE80 uncharacterized protein LOC1114533741.0e-8188.7Show/hide
Query:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAWIAL
        GFVVCILVI+MDVTAGILGIQAE+AQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC++VRSA EYKGLT+NKQLAAGSLVFAWIA+
Subjt:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAWIAL

Query:  VVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        VVGFSLLISGAMYNTRSRKSCGLAHN LLSIGGI CFVHGLFAVAYYVSATA +REE +    PPPQG+PAGATGHV
Subjt:  VVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

A0A6J1IR67 uncharacterized protein LOC1114777176.0e-8289.27Show/hide
Query:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAWIAL
        GFVVCILVI+MDVTAGILGIQAE+AQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC++VRSA EYKGLT+NKQLAAGSLVFAWIAL
Subjt:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAWIAL

Query:  VVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        VVGFSLLISGAMYNTRSRKSCGLAH+ LLSIGGIVCFVHGLFAVAYYVSATA HRE  +    PPPQG+PAGATGHV
Subjt:  VVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05291.1 Protein of unknown function (DUF1218)1.4e-1936.57Show/hide
Query:  VVCILVII-MDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNK---QLAAGSLVFAWI
        +VCI++ + +D+ AG +G+QA+ AQ  V H K+   ECK PS  AF LG+ A   LA AH  AN+  GC           L  NK         L   W+
Subjt:  VVCILVII-MDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNK---QLAAGSLVFAWI

Query:  ALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVS---ATAEHREEMKPPPPPPPQGNP
          + G  +L +G   NT SR  C   +N + SIGG VCF+H + +  YY+S   A A H    KP    P +  P
Subjt:  ALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVS---ATAEHREEMKPPPPPPPQGNP

AT1G11500.1 Protein of unknown function (DUF1218)1.4e-3042.68Show/hide
Query:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFE------C-KDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSL
        GF+V +++I  D+TA +LGI+AEIAQ+K  H             C + PS  AF  G+AA +LL + H +AN+ GGC ++RS  ++K  T NK LA   L
Subjt:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFE------C-KDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSL

Query:  VFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREE
        V +WI  VV +S L+ G + N+R+ + C L H     IGGI C  HG+   AYYVSA A  +E+
Subjt:  VFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREE

AT2G32280.1 Protein of unknown function (DUF1218)6.9e-4651.28Show/hide
Query:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAWIAL
        G +VC++++ +DV A ILGIQAE+AQN+V H ++W+FEC++PS +AF+LGL AA +L +AH + NL GGC+ + S DE++  ++ +Q++   LV  WI  
Subjt:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAWIAL

Query:  VVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHRE
         VGF  ++ G M N++SR SCG  H+  LSIGGI+CF+H LF VAYYVSATA   E
Subjt:  VVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHRE

AT4G21310.1 Protein of unknown function (DUF1218)3.5e-5865.62Show/hide
Query:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW
        R  GF +CIL++ MDV+AGILGI+AEIAQNKV H KMWIFEC+DPSY AFK GLAA ILL LAH  AN  GGC+ V S  + +  + NKQLA  SL+F W
Subjt:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAW

Query:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREE
        I L + FS+LI G M N+RSRK+CG++H+R+LSIGGI+CFVHGLFAVAYY+SATA  RE+
Subjt:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACCAGATGACAATTCAAGCAAAAGAATCGTCAATGGCTGTAAACTGTCTCAAGCAAATTCGAAAACAACACGTCCCACAGAGAGAAGACAGAACAAAGAAATC
AAAACCAATTCTCGGCCGATCACTTTCTGTAGTTATGGCTAAGCTGCAGCAGATTAGCCCTTCCTCCTCTGCTCGGCTTCAACCTCGACCAAAGCAGATTCTCGCACTCA
GAACTTATGGTTTTGTTGTCTGCATCTTGGTTATTATTATGGATGTTACAGCCGGAATTCTTGGAATTCAAGCTGAAATAGCTCAAAACAAGGTGAACCATTTCAAAATG
TGGATATTTGAGTGTAAAGACCCAAGCTATAATGCTTTCAAGCTAGGTTTGGCTGCAGCCATACTGCTTGCCCTTGCTCACGCCATTGCCAACCTGGCTGGTGGGTGCAT
TTTTGTTCGATCTGCTGACGAATACAAAGGATTAACCACTAACAAGCAACTTGCTGCGGGTTCACTCGTCTTTGCATGGATTGCGCTAGTGGTCGGATTCTCCTTGCTTA
TTTCCGGGGCGATGTATAACACAAGGTCGAGAAAATCATGCGGACTTGCTCACAATCGTCTTCTGTCTATAGGGGGGATTGTGTGCTTCGTACACGGCTTGTTTGCAGTT
GCATATTACGTTTCTGCCACAGCTGAACACAGGGAGGAGATGAAGCCGCCGCCGCCGCCACCACCACAGGGGAATCCTGCTGGGGCTACAGGCCATGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACCAGATGACAATTCAAGCAAAAGAATCGTCAATGGCTGTAAACTGTCTCAAGCAAATTCGAAAACAACACGTCCCACAGAGAGAAGACAGAACAAAGAAATC
AAAACCAATTCTCGGCCGATCACTTTCTGTAGTTATGGCTAAGCTGCAGCAGATTAGCCCTTCCTCCTCTGCTCGGCTTCAACCTCGACCAAAGCAGATTCTCGCACTCA
GAACTTATGGTTTTGTTGTCTGCATCTTGGTTATTATTATGGATGTTACAGCCGGAATTCTTGGAATTCAAGCTGAAATAGCTCAAAACAAGGTGAACCATTTCAAAATG
TGGATATTTGAGTGTAAAGACCCAAGCTATAATGCTTTCAAGCTAGGTTTGGCTGCAGCCATACTGCTTGCCCTTGCTCACGCCATTGCCAACCTGGCTGGTGGGTGCAT
TTTTGTTCGATCTGCTGACGAATACAAAGGATTAACCACTAACAAGCAACTTGCTGCGGGTTCACTCGTCTTTGCATGGATTGCGCTAGTGGTCGGATTCTCCTTGCTTA
TTTCCGGGGCGATGTATAACACAAGGTCGAGAAAATCATGCGGACTTGCTCACAATCGTCTTCTGTCTATAGGGGGGATTGTGTGCTTCGTACACGGCTTGTTTGCAGTT
GCATATTACGTTTCTGCCACAGCTGAACACAGGGAGGAGATGAAGCCGCCGCCGCCGCCACCACCACAGGGGAATCCTGCTGGGGCTACAGGCCATGTTTAGCTCTTTGA
TCTTCATGTTCTAAGAAAATGCAATGAACTTCACTTTCTTTTTCAGTTCTTCAGCCGTAAGAGTTTGCGCAATTAATTGTTATCAACTTCCGAATGTTGCTTCAATTTGT
AGCTAGTAATACTCCCAAATTCGGGAAATAGTTCTCTCCTTACAAATATAAAATCATATTCCTTCTGTTTGGCGTTAGTAAAATATGAACCATCATGGTTCGTCGAACTC
CGAAGAAAGAGAGAAAGAAGCTTTATTTTATAAGAACTACTGCAGTAGTATGGAATTTGGAAAGGAGGTCATTTGAACAAAATATAGTTTTTAAATATAAAATTTTGTTT
GGCC
Protein sequenceShow/hide protein sequence
MGDQMTIQAKESSMAVNCLKQIRKQHVPQREDRTKKSKPILGRSLSVVMAKLQQISPSSSARLQPRPKQILALRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKM
WIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAV
AYYVSATAEHREEMKPPPPPPPQGNPAGATGHV