; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10000155 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10000155
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationChr09:1541925..1544172
RNA-Seq ExpressionHG10000155
SyntenyHG10000155
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652279.1 hypothetical protein Csa_022101 [Cucumis sativus]1.5e-8185.19Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFT--------KDCVKHDSGMNEETEKSFAELLS
        MAVK VGSCSPGLTKVGL+ MALCIA YILGPPLYWHFME L AFSSSS STCPPCFCDCSS TDFAFT        +DCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFT--------KDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTA+WETRARQRGWRD+IV SRG +Q S
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

XP_004151898.1 uncharacterized protein LOC101219040 [Cucumis sativus]1.5e-8185.19Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFT--------KDCVKHDSGMNEETEKSFAELLS
        MAVK VGSCSPGLTKVGL+ MALCIA YILGPPLYWHFME L AFSSSS STCPPCFCDCSS TDFAFT        +DCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFT--------KDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTA+WETRARQRGWRD+IV SRG +Q S
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

XP_008455917.1 PREDICTED: uncharacterized protein LOC103495987 isoform X1 [Cucumis melo]1.7e-8085.19Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFT--------KDCVKHDSGMNEETEKSFAELLS
        MA K VGS SPGLTKVGL FMA+CIA YILGPPLYWHF E LAAFSSSS STCPPCFCDCSS TDFAFT        +DCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFT--------KDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LT +WETRARQRGWRDDIV SRG VQTS
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

XP_008455918.1 PREDICTED: uncharacterized protein LOC103495987 isoform X2 [Cucumis melo]1.4e-8288.95Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREA
        MA K VGS SPGLTKVGL FMA+CIA YILGPPLYWHF E LAAFSSSS STCPPCFCDCSS TDFAFT+DCVKHDSGMNEETEK+FAELLSEELKLREA
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREA

Query:  ETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        E LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LT +WETRARQRGWRDDIV SRG VQTS
Subjt:  ETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

XP_023534558.1 uncharacterized protein LOC111796098 isoform X2 [Cucurbita pepo subsp. pepo]8.3e-8086.96Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREA
        MAVK  GSCSPGLTKVGL F+ALCIA YILGPPLYWHFME LA  SSSS STCPPCFCDCSS TDFAFT DCVKHDSGMNEETE+SFAELLSEELKLREA
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREA

Query:  ETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS
        E +E HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTA+WE RARQRGWRDDIV S   R  VQTS
Subjt:  ETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS

TrEMBL top hitse value%identityAlignment
A0A0A0LQC2 Uncharacterized protein7.3e-8285.19Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFT--------KDCVKHDSGMNEETEKSFAELLS
        MAVK VGSCSPGLTKVGL+ MALCIA YILGPPLYWHFME L AFSSSS STCPPCFCDCSS TDFAFT        +DCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFT--------KDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTA+WETRARQRGWRD+IV SRG +Q S
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

A0A1S3C1Z9 uncharacterized protein LOC103495987 isoform X18.1e-8185.19Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFT--------KDCVKHDSGMNEETEKSFAELLS
        MA K VGS SPGLTKVGL FMA+CIA YILGPPLYWHF E LAAFSSSS STCPPCFCDCSS TDFAFT        +DCVKHDSGMNEETEK+FAELLS
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFT--------KDCVKHDSGMNEETEKSFAELLS

Query:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        EELKLREAE LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LT +WETRARQRGWRDDIV SRG VQTS
Subjt:  EELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

A0A1S3C2Q4 uncharacterized protein LOC103495987 isoform X26.6e-8388.95Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREA
        MA K VGS SPGLTKVGL FMA+CIA YILGPPLYWHF E LAAFSSSS STCPPCFCDCSS TDFAFT+DCVKHDSGMNEETEK+FAELLSEELKLREA
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREA

Query:  ETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS
        E LE+HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LT +WETRARQRGWRDDIV SRG VQTS
Subjt:  ETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS

A0A6J1G609 uncharacterized protein LOC111451120 isoform X22.0e-7985.87Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREA
        MAVK  GSCSPGLTKVGL F+ALCIA YILGPPLYWHF+E LA  SSSS STCPPCFCDCSS TDFAFT DCVKHDSGMNEETE++FAELLSEELKLREA
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREA

Query:  ETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS
        E +E HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTA+WE RARQRGWRDDIV S   R  VQTS
Subjt:  ETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS

A0A6J1I5W5 uncharacterized protein LOC111469880 isoform X29.9e-7985.33Show/hide
Query:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREA
        MAVK  GSCSPGLTKVGL F+ALCIA YILGPPLYWHFME LA  SSS  STCPPCFCDCSS TDFAFT DCVKHDSGMNEETE+SFAELLSE+LKLREA
Subjt:  MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREA

Query:  ETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS
        + +E HR ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTA+WE RARQRGWRDDIV S   R  VQTS
Subjt:  ETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS---RGNVQTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)1.3e-5160.34Show/hide
Query:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTK--------DCVKHDSGMNEETEKSFAELLSEELKLREAETLESH
        K+GL  + L +A YILGPPLYWH  EALAA S+SS   CP C C+CS+++     K        DC KHD  +NE+TEK++AELL+EELKLREAE+LE H
Subjt:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTK--------DCVKHDSGMNEETEKSFAELLSEELKLREAETLESH

Query:  RGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDI----VRSRGNVQTS
        + AD+ LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  LA QKKLT+ WE RARQ+GWR+      V+S+ NVQ +
Subjt:  RGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDI----VRSRGNVQTS

AT2G32580.1 Protein of unknown function (DUF1068)2.1e-4457.4Show/hide
Query:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSS---------HTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREAETLES
        KVGL  +AL +  YILGPPLYWH  EALA     S ++C  C CDCSS          ++ +FT DC K D  +NE+TEK++AELL+EELK REA ++E 
Subjt:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSS---------HTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREAETLES

Query:  HRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS
        H+  D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QKKLT++WE RARQ+G++D   +S
Subjt:  HRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS

AT2G32580.2 Protein of unknown function (DUF1068)2.0e-3163.46Show/hide
Query:  DCVKHDSGMNEETEKSFAELLSEELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDD
        +C K D  +NE+TEK++AELL+EELK REA ++E H+  D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QKKLT++WE RARQ+G++D 
Subjt:  DCVKHDSGMNEETEKSFAELLSEELKLREAETLESHRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDD

Query:  IVRS
          +S
Subjt:  IVRS

AT4G04360.1 Protein of unknown function (DUF1068)1.3e-3852.66Show/hide
Query:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSH---------TDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREAETLES
        KV    M LCI  YI GP LYWH  E +A    S  S+CPPC CDCSS          ++ +F  DC++H+ G +EE+E SF E+++EELKLREA+  E 
Subjt:  KVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSH---------TDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREAETLES

Query:  HRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS
           AD  LL+AKK  SQYQKEADKC+ GMETCE ARE+AEA L  Q++L+ +WE RARQ GW++  V S
Subjt:  HRGADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRS

AT4G30996.1 Protein of unknown function (DUF1068)2.4e-2943.83Show/hide
Query:  LYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDF---------AFTKDCVKHDSGMNEETEKSFAELLSEELKLREAETLESHRG
        L   A+  A  + GP LYW F +     S+ + S CPPC CDC                   DC   D  + +E EK F +LL+EELKL+EA   E  R 
Subjt:  LYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDF---------AFTKDCVKHDSGMNEETEKSFAELLSEELKLREAETLESHRG

Query:  ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDD
         +++L EAK++ SQYQKEA+KCN+  E CE+ARERAEA L  ++K+T++WE RARQ GW  +
Subjt:  ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGAAGCAAGTGGGTTCGTGCTCTCCAGGCTTGACGAAGGTGGGATTGTATTTTATGGCTCTTTGTATAGCGACTTACATTCTGGGTCCGCCTCTCTACTGGCA
TTTCATGGAGGCCTTGGCCGCTTTCTCTTCTTCCTCTTTCTCAACTTGCCCACCTTGCTTTTGTGACTGTTCTTCTCACACTGACTTTGCCTTCACTAAAGATTGTGTGA
AACATGACTCTGGCATGAATGAGGAAACAGAAAAGAGTTTTGCAGAGTTGTTGTCTGAGGAACTGAAACTGAGGGAAGCTGAAACTTTGGAGAGTCATCGGGGCGCCGAC
ATATCTCTGCTAGAAGCAAAGAAGATGACATCTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGCATGGAAACATGTGAAGCAGCAAGGGAAAGGGCTGAAGCTAC
ATTAGCTTCACAAAAGAAGTTAACAGCAATATGGGAGACTAGGGCTCGCCAAAGAGGATGGAGAGACGACATTGTTAGATCCCGTGGTAACGTCCAAACCTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTGAAGCAAGTGGGTTCGTGCTCTCCAGGCTTGACGAAGGTGGGATTGTATTTTATGGCTCTTTGTATAGCGACTTACATTCTGGGTCCGCCTCTCTACTGGCA
TTTCATGGAGGCCTTGGCCGCTTTCTCTTCTTCCTCTTTCTCAACTTGCCCACCTTGCTTTTGTGACTGTTCTTCTCACACTGACTTTGCCTTCACTAAAGATTGTGTGA
AACATGACTCTGGCATGAATGAGGAAACAGAAAAGAGTTTTGCAGAGTTGTTGTCTGAGGAACTGAAACTGAGGGAAGCTGAAACTTTGGAGAGTCATCGGGGCGCCGAC
ATATCTCTGCTAGAAGCAAAGAAGATGACATCTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGCATGGAAACATGTGAAGCAGCAAGGGAAAGGGCTGAAGCTAC
ATTAGCTTCACAAAAGAAGTTAACAGCAATATGGGAGACTAGGGCTCGCCAAAGAGGATGGAGAGACGACATTGTTAGATCCCGTGGTAACGTCCAAACCTCATAA
Protein sequenceShow/hide protein sequence
MAVKQVGSCSPGLTKVGLYFMALCIATYILGPPLYWHFMEALAAFSSSSFSTCPPCFCDCSSHTDFAFTKDCVKHDSGMNEETEKSFAELLSEELKLREAETLESHRGAD
ISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKKLTAIWETRARQRGWRDDIVRSRGNVQTS