; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017342 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017342
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionYLP motif-containing protein 1-like
Genome locationChr03:13347042..13347551
RNA-Seq ExpressionHG10017342
SyntenyHG10017342
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603545.1 hypothetical protein SDJN03_04154, partial [Cucurbita argyrosperma subsp. sororia]2.0e-7585.8Show/hide
Query:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF
        MLCETSDYP KSLQFKHNDKFFSK+L+RESSRAN+SSRIYYGGLAGAVPFVWESQPGTP+HRFSDDLTPPLTPPPSYFS S++KP K RSKSLSL HIFF
Subjt:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF

Query:  NSKRKFDLLTPPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR
        N KRK DLLTPPVSKSASLPSSGS FDSA GAKF GRR ARR+  G+S SKE EDAAA+GSVLCFGIGR
Subjt:  NSKRKFDLLTPPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR

XP_008464261.1 PREDICTED: uncharacterized protein LOC103502186 [Cucumis melo]2.1e-7790Show/hide
Query:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF
        M+CETSDYPQKSLQFK NDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDS+KKPLKKRSKSLSLLHIFF
Subjt:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF

Query:  NSKRKFDLLT-PPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR
        NSKRKFDLL+ PPVSKSASLPS GS FDSAAGAKFTGRR ARR+PT     KEEE+AAA+GSVLCFG+GR
Subjt:  NSKRKFDLLT-PPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR

XP_011653717.1 uncharacterized protein LOC105435254 [Cucumis sativus]9.8e-7588.24Show/hide
Query:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF
        M+CETS+YPQKSLQFK NDKF SKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDS+KKPLKKRSKSLSLLHIFF
Subjt:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF

Query:  NSKRKFDLLT-PPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR
        +SKRKFDLL+ PPVSKS SL SSGS FDSAAGAKFTGRR ARR+PT     KEEE+AAAT SVLCFGIGR
Subjt:  NSKRKFDLLT-PPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR

XP_022925049.1 uncharacterized protein LOC111432413 [Cucurbita moschata]5.4e-6580.12Show/hide
Query:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF
        MLCETSD   +SLQ KHNDKFFSK+L+RESSRA+YSSR+YY GLAGAVPF WESQPGTPIHRFSDDLTPPLTPPPSYFSDS+KKP KKRSKSLSL HIFF
Subjt:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF

Query:  NSKRKF-DLLTPPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAA-TGSVLCFGIGR
        N KRK  DLL+PP SKSASLPSSGS FDSAA     GRR  RR P+GIS+SKE+  AAA TGSVLCFGIGR
Subjt:  NSKRKF-DLLTPPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAA-TGSVLCFGIGR

XP_038882810.1 uncharacterized protein LOC120073955 [Benincasa hispida]2.7e-8091.72Show/hide
Query:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF
        MLCETSDYPQKSLQFK NDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTP LTPPPSYFSDSVKKPLKKRSKSLSLLHI F
Subjt:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF

Query:  NSKRKFDLLTPPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR
        NSKRK DLL+PP SKSASLPSSGS FDSA GAKFTGRR  RR+PTGIS SKEEEDAAATGS+LCFGI R
Subjt:  NSKRKFDLLTPPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR

TrEMBL top hitse value%identityAlignment
A0A0A0L1H9 Uncharacterized protein4.8e-7588.24Show/hide
Query:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF
        M+CETS+YPQKSLQFK NDKF SKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDS+KKPLKKRSKSLSLLHIFF
Subjt:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF

Query:  NSKRKFDLLT-PPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR
        +SKRKFDLL+ PPVSKS SL SSGS FDSAAGAKFTGRR ARR+PT     KEEE+AAAT SVLCFGIGR
Subjt:  NSKRKFDLLT-PPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR

A0A1S3CL26 uncharacterized protein LOC1035021861.0e-7790Show/hide
Query:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF
        M+CETSDYPQKSLQFK NDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDS+KKPLKKRSKSLSLLHIFF
Subjt:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF

Query:  NSKRKFDLLT-PPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR
        NSKRKFDLL+ PPVSKSASLPS GS FDSAAGAKFTGRR ARR+PT     KEEE+AAA+GSVLCFG+GR
Subjt:  NSKRKFDLLT-PPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR

A0A6J1CSP7 uncharacterized protein LOC1110138652.4e-5079.7Show/hide
Query:  TSDYPQKSLQFKHNDKFFSKILSRESSRANYSSR-IYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFFNSK
        +SDYPQKSLQ K NDKFFSK+LSRESSRAN SSR +YYGG AGAVPF WESQPGTP HRFSD LTPPLTPPPS+FSDS KKP K RSKSL+L HIFFN K
Subjt:  TSDYPQKSLQFKHNDKFFSKILSRESSRANYSSR-IYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFFNSK

Query:  RKFDLLTPPVSKSASLPSSGSAFDSAAGAKFTG
        RKFDLL PP S+SA+LPSSGSA DS A  KF G
Subjt:  RKFDLLTPPVSKSASLPSSGSAFDSAAGAKFTG

A0A6J1EB06 uncharacterized protein LOC1114324132.6e-6580.12Show/hide
Query:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF
        MLCETSD   +SLQ KHNDKFFSK+L+RESSRA+YSSR+YY GLAGAVPF WESQPGTPIHRFSDDLTPPLTPPPSYFSDS+KKP KKRSKSLSL HIFF
Subjt:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF

Query:  NSKRKF-DLLTPPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAA-TGSVLCFGIGR
        N KRK  DLL+PP SKSASLPSSGS FDSAA     GRR  RR P+GIS+SKE+  AAA TGSVLCFGIGR
Subjt:  NSKRKF-DLLTPPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAA-TGSVLCFGIGR

A0A6J1IKR7 uncharacterized protein LOC1114783342.5e-6085.71Show/hide
Query:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF
        MLCETSDYP KSLQFKHNDKFFSK+L+RESSRAN+SSRIYYGGLAGAVPFVWESQPGTP+HRFSDDLTPPLTPPPSYFS S++KP K RSKSLSL HIFF
Subjt:  MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFF

Query:  NSKRKFDLLTPPVSKSASLPSSGSAFDSAAGAKFTGRRLA
        N KRK DLLTPPVSKSASLPSSGS FDSA GA    R LA
Subjt:  NSKRKFDLLTPPVSKSASLPSSGSAFDSAAGAKFTGRRLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06930.1 unknown protein2.5e-0732.12Show/hide
Query:  YYGGLAGAVPFVWESQPGTPIH--------------RFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFFNSKRKFDLLTPPVSKSASLPSSGSA
        YYGG + AVPF WESQPGTP                 F+  ++ PLTPPPSYF  S         K  + L     SK +    + P S ++S  SS S+
Subjt:  YYGGLAGAVPFVWESQPGTPIH--------------RFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFFNSKRKFDLLTPPVSKSASLPSSGSA

Query:  FDSA--AGAKFTGRRLARRYPTGISNSKEEEDAAATG
          S+    +  + RR +  + +G S       A ++G
Subjt:  FDSA--AGAKFTGRRLARRYPTGISNSKEEEDAAATG

AT2G40475.1 unknown protein9.3e-1537.34Show/hide
Query:  NDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHR-FSDDL-TPPLTPPPSYFSDSVKKPLKKRSKSLSLLHI-FFNSKRKFDLLTPPVS
        N    SKI+ +ESS+ N SSRIYY G   +VPF+WE++PGTP H  FS+ L  PPLTPPPSY+S S      K SK+ ++    F  +     +  P  S
Subjt:  NDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHR-FSDDL-TPPLTPPPSYFSDSVKKPLKKRSKSLSLLHI-FFNSKRKFDLLTPPVS

Query:  KSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEED----AAATGSVLCFGIG
         S++  SS S++ S++       R  + Y    S  KE+++    +++  S LC+  G
Subjt:  KSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEED----AAATGSVLCFGIG

AT3G56260.2 unknown protein9.0e-1036.43Show/hide
Query:  YYGGLAGAVPFVWESQPGTP-IHRFSDDLTP-PLTPPPSYFSDSVKKPLKKRSKSLSLLHIFFNSKRKFDL--------LTPPVSKSASLPSSGSAFDSA
        YYGG   ++PF+WES+PGTP  H  SD   P PLTPPPSY+S  +    + +SK  S L  F +     DL         T   S S S  SS S+F S+
Subjt:  YYGGLAGAVPFVWESQPGTP-IHRFSDDLTP-PLTPPPSYFSDSVKKPLKKRSKSLSLLHIFFNSKRKFDL--------LTPPVSKSASLPSSGSAFDSA

Query:  AGAKFTGRRL-ARRYPTGISNSKEEEDAAATGSVLCFGIG
               R    ++ P   +N +E+E  ++  S LC   G
Subjt:  AGAKFTGRRL-ARRYPTGISNSKEEEDAAATGSVLCFGIG

AT4G25845.1 BEST Arabidopsis thaliana protein match is: OSBP(oxysterol binding protein)-related protein 4B (TAIR:AT4G25850.2)7.2e-0745Show/hide
Query:  LSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKP
        +SR SS   Y  R+  G     VPF WE QPGTPI+    ++ PPL+PPP+  S  + KP
Subjt:  LSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKP

AT5G01790.1 unknown protein2.8e-1138.46Show/hide
Query:  YYGGLAGAVPFVWESQPGTPIHRFSDDLT-PPLTPPPSYFS---DSVKKPLKKRSKSLSLL---HIFF-----NSKRKFDLLTPPVSKSASLPSSGSAFD
        YY G AGAVPF WES PGTP H  S+  T PPLTPPPS+FS   D +++  KK +K +  L    +F+     N K K    + P S S  +    + +D
Subjt:  YYGGLAGAVPFVWESQPGTPIHRFSDDLT-PPLTPPPSYFS---DSVKKPLKKRSKSLSLL---HIFF-----NSKRKFDLLTPPVSKSASLPSSGSAFD

Query:  SAAGAKF-TGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR
             KF T R + RR+ +   +S  +     + S  CFGI R
Subjt:  SAAGAKF-TGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTGTGAAACCTCCGATTATCCTCAAAAATCTCTCCAATTCAAGCACAATGACAAATTCTTCTCCAAGATTCTGTCCAGAGAAAGTTCCAGAGCCAATTACTCTTC
CAGAATCTACTACGGCGGATTGGCCGGCGCCGTTCCTTTCGTTTGGGAGTCTCAGCCAGGTACGCCTATTCACCGATTTTCCGATGACCTAACTCCTCCTCTTACTCCTC
CTCCTTCTTACTTTTCTGATTCCGTTAAAAAACCGCTCAAGAAACGATCCAAATCTCTCTCTCTTCTGCATATCTTCTTCAATTCCAAGAGGAAATTCGATCTGTTGACG
CCGCCGGTTTCCAAATCCGCGTCATTACCTTCTTCCGGATCGGCGTTCGACTCGGCTGCCGGCGCGAAGTTCACAGGGCGCCGTTTGGCGCGGCGGTATCCGACTGGAAT
CTCCAATTCGAAGGAGGAAGAGGATGCGGCGGCTACTGGTTCGGTACTTTGTTTCGGAATCGGTAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTGTGAAACCTCCGATTATCCTCAAAAATCTCTCCAATTCAAGCACAATGACAAATTCTTCTCCAAGATTCTGTCCAGAGAAAGTTCCAGAGCCAATTACTCTTC
CAGAATCTACTACGGCGGATTGGCCGGCGCCGTTCCTTTCGTTTGGGAGTCTCAGCCAGGTACGCCTATTCACCGATTTTCCGATGACCTAACTCCTCCTCTTACTCCTC
CTCCTTCTTACTTTTCTGATTCCGTTAAAAAACCGCTCAAGAAACGATCCAAATCTCTCTCTCTTCTGCATATCTTCTTCAATTCCAAGAGGAAATTCGATCTGTTGACG
CCGCCGGTTTCCAAATCCGCGTCATTACCTTCTTCCGGATCGGCGTTCGACTCGGCTGCCGGCGCGAAGTTCACAGGGCGCCGTTTGGCGCGGCGGTATCCGACTGGAAT
CTCCAATTCGAAGGAGGAAGAGGATGCGGCGGCTACTGGTTCGGTACTTTGTTTCGGAATCGGTAGATGA
Protein sequenceShow/hide protein sequence
MLCETSDYPQKSLQFKHNDKFFSKILSRESSRANYSSRIYYGGLAGAVPFVWESQPGTPIHRFSDDLTPPLTPPPSYFSDSVKKPLKKRSKSLSLLHIFFNSKRKFDLLT
PPVSKSASLPSSGSAFDSAAGAKFTGRRLARRYPTGISNSKEEEDAAATGSVLCFGIGR