; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003832 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003832
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationChr08:9954021..9955428
RNA-Seq ExpressionHG10003832
SyntenyHG10003832
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017148.1 hypothetical protein SDJN02_22260 [Cucurbita argyrosperma subsp. argyrosperma]7.5e-4581.89Show/hide
Query:  MISGKLYPSYSASCIFPPTRT---TLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKK
        MISG LYPS S SCIFPP  T   T A++HLPLLKISS LRLISFESVSLSFPTF  SKS+  STR  NSV KVYS+EGQNP +LSDLEDLSENGVVYKK
Subjt:  MISGKLYPSYSASCIFPPTRT---TLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKK

Query:  TLAMVECSMFAALNGLVYFLSNSLALE
        TLAMVECSMFAALNGLVYFLSNSLALE
Subjt:  TLAMVECSMFAALNGLVYFLSNSLALE

XP_022929150.1 uncharacterized protein LOC111435817 [Cucurbita moschata]1.7e-4481.4Show/hide
Query:  MISGKLYPSYSASCIFPPTRT---TLANLH--LPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVY
        MISG LYPS S SCIFPP  T   T A++H  LPLLKISS LRLISFESVSLSFPTF  SKS+  STRF NSV KVYS+EGQNP +LSDLEDLSENGVVY
Subjt:  MISGKLYPSYSASCIFPPTRT---TLANLH--LPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVY

Query:  KKTLAMVECSMFAALNGLVYFLSNSLALE
        KKTLAMVECSMFAALNGLVYFLSNSLALE
Subjt:  KKTLAMVECSMFAALNGLVYFLSNSLALE

XP_022969819.1 uncharacterized protein LOC111468904 [Cucurbita maxima]1.8e-4684Show/hide
Query:  MISGKLYPSYSASCIFPPTRT-TLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKKTL
        MISG LYPS S SCIFPP RT T A++HLPLLKIS+ LRLISFESVSLSFPTF  SKS+  STRF NSV KVYS+EGQNP +LSDLEDLSENGVVYKKTL
Subjt:  MISGKLYPSYSASCIFPPTRT-TLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKKTL

Query:  AMVECSMFAALNGLVYFLSNSLALE
        AMVECSMFAALNGLVYFLSNSLALE
Subjt:  AMVECSMFAALNGLVYFLSNSLALE

XP_023549519.1 uncharacterized protein LOC111807999 [Cucurbita pepo subsp. pepo]1.5e-4581.89Show/hide
Query:  MISGKLYPSYSASCIFPPTRT---TLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKK
        MISGKLYPS S SCIFPP  T   T  ++HLPLLKISS LRLISF+SVSLSFPTF  SKS+  STRF NSV KVYS+EGQNP +LSDLEDLSENGVVYKK
Subjt:  MISGKLYPSYSASCIFPPTRT---TLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKK

Query:  TLAMVECSMFAALNGLVYFLSNSLALE
        TLAMVECSMFAALNGLVYFLSNSLALE
Subjt:  TLAMVECSMFAALNGLVYFLSNSLALE

XP_038885169.1 uncharacterized protein LOC120075651 [Benincasa hispida]1.5e-4887.1Show/hide
Query:  MISGKLYPSYSASCIFPPTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKKTLA
        MISGKLYPSYSASCIFPP +T   NLHLPLL+ISSTLRLISF+SVSLSFP+FF SKS+A STRF NSV KVYSYEGQNPI LSDLEDLSE+G VYKKTLA
Subjt:  MISGKLYPSYSASCIFPPTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKKTLA

Query:  MVECSMFAALNGLVYFLSNSLALE
        MVECSMFAALNGLVYFLSNSLALE
Subjt:  MVECSMFAALNGLVYFLSNSLALE

TrEMBL top hitse value%identityAlignment
A0A1S3BEA4 uncharacterized protein LOC103488678 isoform X21.5e-4379.69Show/hide
Query:  MISGKLYPSYSASCIFP----PTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYK
        MISGKLY S S+SCIFP    PT T   NLHL  LKISSTLRLISF+SVSLS P+ F SKS+A STRF NS+ +VYSYEGQN ITLSDL+DLSENGVVYK
Subjt:  MISGKLYPSYSASCIFP----PTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYK

Query:  KTLAMVECSMFAALNGLVYFLSNSLALE
        KTLAMVECSMFAALNGLVYFLSNSLALE
Subjt:  KTLAMVECSMFAALNGLVYFLSNSLALE

A0A1S4DVX6 uncharacterized protein LOC103488678 isoform X71.5e-4379.69Show/hide
Query:  MISGKLYPSYSASCIFP----PTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYK
        MISGKLY S S+SCIFP    PT T   NLHL  LKISSTLRLISF+SVSLS P+ F SKS+A STRF NS+ +VYSYEGQN ITLSDL+DLSENGVVYK
Subjt:  MISGKLYPSYSASCIFP----PTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYK

Query:  KTLAMVECSMFAALNGLVYFLSNSLALE
        KTLAMVECSMFAALNGLVYFLSNSLALE
Subjt:  KTLAMVECSMFAALNGLVYFLSNSLALE

A0A1S4DVX7 uncharacterized protein LOC103488678 isoform X51.5e-4379.69Show/hide
Query:  MISGKLYPSYSASCIFP----PTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYK
        MISGKLY S S+SCIFP    PT T   NLHL  LKISSTLRLISF+SVSLS P+ F SKS+A STRF NS+ +VYSYEGQN ITLSDL+DLSENGVVYK
Subjt:  MISGKLYPSYSASCIFP----PTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYK

Query:  KTLAMVECSMFAALNGLVYFLSNSLALE
        KTLAMVECSMFAALNGLVYFLSNSLALE
Subjt:  KTLAMVECSMFAALNGLVYFLSNSLALE

A0A6J1ETG3 uncharacterized protein LOC1114358178.1e-4581.4Show/hide
Query:  MISGKLYPSYSASCIFPPTRT---TLANLH--LPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVY
        MISG LYPS S SCIFPP  T   T A++H  LPLLKISS LRLISFESVSLSFPTF  SKS+  STRF NSV KVYS+EGQNP +LSDLEDLSENGVVY
Subjt:  MISGKLYPSYSASCIFPPTRT---TLANLH--LPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVY

Query:  KKTLAMVECSMFAALNGLVYFLSNSLALE
        KKTLAMVECSMFAALNGLVYFLSNSLALE
Subjt:  KKTLAMVECSMFAALNGLVYFLSNSLALE

A0A6J1I3S1 uncharacterized protein LOC1114689048.7e-4784Show/hide
Query:  MISGKLYPSYSASCIFPPTRT-TLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKKTL
        MISG LYPS S SCIFPP RT T A++HLPLLKIS+ LRLISFESVSLSFPTF  SKS+  STRF NSV KVYS+EGQNP +LSDLEDLSENGVVYKKTL
Subjt:  MISGKLYPSYSASCIFPPTRT-TLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKKTL

Query:  AMVECSMFAALNGLVYFLSNSLALE
        AMVECSMFAALNGLVYFLSNSLALE
Subjt:  AMVECSMFAALNGLVYFLSNSLALE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26180.1 unknown protein1.1e-0635.78Show/hide
Query:  FPPTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVY
        F PTR+  +++ L     S    +IS  S+S +    F   S + ++ + N          +N  +  + ++     VVY+KTL +VEC+MFAA+ GLVY
Subjt:  FPPTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKKTLAMVECSMFAALNGLVY

Query:  FLSNSLALE
        FLSNSLA+E
Subjt:  FLSNSLALE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCTGGGAAGCTTTATCCATCTTACTCCGCATCATGCATTTTCCCACCAACACGAACAACACTAGCCAATCTTCATCTTCCTCTTCTTAAAATCTCTTCCACACT
CAGATTAATTAGCTTTGAATCCGTCTCCCTCTCTTTTCCAACCTTCTTTCCTTCTAAATCCACTGCCAATTCAACTAGATTTCCGAATTCTGTGCCAAAAGTTTATAGCT
ATGAGGGCCAAAACCCCATTACTTTGTCGGATTTGGAAGACTTGTCCGAGAATGGAGTTGTCTATAAGAAGACACTGGCCATGGTGGAGTGCTCCATGTTCGCTGCACTT
AATGGCTTGGTCTACTTCTTGAGCAATTCACTTGCTCTTGAGGAAAGCCCAGAAACCAGTGATAGAATGACATGTGCCAAGGCAAACACCCTAATCAACAGCTGCAGCCT
AAAGTTGAGAGATGAGGCAGAAGCTTTTAAAGGAATATTCTTGGACAAGATTACCCTTAAGACACAAAAGAAAATTATGAACAACAAGCTTGTAAATGGTCTTGGGTCTA
GCAACAAACTGACTTTGACGATGAACAGGGAAATTTATAATTATAACAACTGCCCAATAAACTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTCTGGGAAGCTTTATCCATCTTACTCCGCATCATGCATTTTCCCACCAACACGAACAACACTAGCCAATCTTCATCTTCCTCTTCTTAAAATCTCTTCCACACT
CAGATTAATTAGCTTTGAATCCGTCTCCCTCTCTTTTCCAACCTTCTTTCCTTCTAAATCCACTGCCAATTCAACTAGATTTCCGAATTCTGTGCCAAAAGTTTATAGCT
ATGAGGGCCAAAACCCCATTACTTTGTCGGATTTGGAAGACTTGTCCGAGAATGGAGTTGTCTATAAGAAGACACTGGCCATGGTGGAGTGCTCCATGTTCGCTGCACTT
AATGGCTTGGTCTACTTCTTGAGCAATTCACTTGCTCTTGAGGAAAGCCCAGAAACCAGTGATAGAATGACATGTGCCAAGGCAAACACCCTAATCAACAGCTGCAGCCT
AAAGTTGAGAGATGAGGCAGAAGCTTTTAAAGGAATATTCTTGGACAAGATTACCCTTAAGACACAAAAGAAAATTATGAACAACAAGCTTGTAAATGGTCTTGGGTCTA
GCAACAAACTGACTTTGACGATGAACAGGGAAATTTATAATTATAACAACTGCCCAATAAACTTTTGA
Protein sequenceShow/hide protein sequence
MISGKLYPSYSASCIFPPTRTTLANLHLPLLKISSTLRLISFESVSLSFPTFFPSKSTANSTRFPNSVPKVYSYEGQNPITLSDLEDLSENGVVYKKTLAMVECSMFAAL
NGLVYFLSNSLALEESPETSDRMTCAKANTLINSCSLKLRDEAEAFKGIFLDKITLKTQKKIMNNKLVNGLGSSNKLTLTMNREIYNYNNCPINF