; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006608 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006608
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationChr07:20287298..20290145
RNA-Seq ExpressionHG10006608
SyntenyHG10006608
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139277.1 uncharacterized protein LOC101212944 [Cucumis sativus]1.4e-8193.02Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKA QLGDSKASCPPCICDCPPPLSLLKISP        DCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGE+TRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

XP_008457359.1 PREDICTED: uncharacterized protein LOC103497063 [Cucumis melo]6.2e-8293.6Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKA QLGDSKASCPPCICDCPPPLSLLKISP        DCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGE+TRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

XP_022143173.1 uncharacterized protein LOC111013108 [Momordica charantia]1.4e-8193.02Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSK SCPPCICDCPPPLSLLKI+P        DCGSNDPDLK EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGE+TRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

XP_023547119.1 uncharacterized protein LOC111806024 [Cucurbita pepo subsp. pepo]1.3e-7689.94Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSG+CLRCCLVIFAVVSALAVCGPALYWRFKKA  LGDSK SC PCICDCPPPLSLLKI+P        DCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGW
        EAVSGE+TRHMNITLFEAKR ASQYQREAEKCIAATETCEEARERAEAL IKERKLTSLWERRARQMGW
Subjt:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGW

XP_038890751.1 uncharacterized protein LOC120080238 [Benincasa hispida]2.1e-8294.19Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKA QLGDSKASCPPCICDCPPPLSLLKISP        DCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGE+TRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

TrEMBL top hitse value%identityAlignment
A0A0A0LG38 Uncharacterized protein6.7e-8293.02Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKA QLGDSKASCPPCICDCPPPLSLLKISP        DCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGE+TRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A1S3C6L6 uncharacterized protein LOC1034970633.0e-8293.6Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKA QLGDSKASCPPCICDCPPPLSLLKISP        DCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGE+TRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A5A7VC79 DUF1068 domain-containing protein3.0e-8293.6Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKA QLGDSKASCPPCICDCPPPLSLLKISP        DCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGE+TRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A6J1CQ21 uncharacterized protein LOC1110131086.7e-8293.02Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSK SCPPCICDCPPPLSLLKI+P        DCGSNDPDLK EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        EAVSGE+TRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
Subjt:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

A0A6J1FI57 uncharacterized protein LOC1114443445.5e-7686.55Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPAL+WRFKKA  LGDSK SC PCICDCPPPLSLLKI+P        DCG NDPDLK+EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG
        EAV+GE+TRHMNIT+FEAKRAASQYQREAEKCI+ATETCEEARERAEAL IKERKLTSLWERRARQMGW+G
Subjt:  EAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)6.5e-2943.37Show/hide
Query:  RSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDC--------PPPLSLLKISPDCGSNDPDLKQEMEKQFVDLLTEELKLQEAV
        R  A L+  L +  +  A  + GP LYW   +A     S +SCP C C+C        P  LS    + DC  +DP++ ++ EK + +LLTEELKL+EA 
Subjt:  RSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDC--------PPPLSLLKISPDCGSNDPDLKQEMEKQFVDLLTEELKLQEAV

Query:  SGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGW
        S E  +  ++ L EAK+  S YQ+EA+KC +  ETCEEARE+AE  + +++KLTS WE RARQ GW
Subjt:  SGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGW

AT2G24290.1 Protein of unknown function (DUF1068)1.3e-6169.14Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKAS---CPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEEL
        M+RRSG C+R CLVIF+VVSAL VCGPALYW+  K F +G ++++   CPPC+CD PPPLSLL+I+P         CGS+DP+LK+EMEK FVDLLTEEL
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKAS---CPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEEL

Query:  KLQEAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        KLQEAV+ E++RHMN+TL EAKR ASQYQ+EAEKC AATE CE ARERA+AL++KERK+T LWERRARQ+GWEGE
Subjt:  KLQEAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE

AT2G32580.1 Protein of unknown function (DUF1068)1.4e-2843.29Show/hide
Query:  ACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKIS--------PDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGE
        A L+  L + A+     + GP LYW   +A  L  S  SC  C+CDC   L LL I          DC   DP++ ++ EK + +LLTEELK +EA S E
Subjt:  ACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKIS--------PDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGE

Query:  YTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE
          + ++  L EAK+  S YQ+EA+KC +  ETCEEARE+AE  +++++KLTS+WE+RARQ G++
Subjt:  YTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE

AT4G04360.1 Protein of unknown function (DUF1068)9.6e-2541.04Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDS-KASCPPCICDC--PPPLSLLKISPDCGSNDPDL--------KQEMEKQFVDLLTEEL
        M+RR     +   V+  +     + GP+LYW   +   + DS  +SCPPC+CDC   P LS+    PD  SN   L         +E E  F +++ EEL
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDS-KASCPPCICDC--PPPLSLLKISPDCGSNDPDL--------KQEMEKQFVDLLTEEL

Query:  KLQEAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE
        KL+EA + E     +  L +AK+AASQYQ+EA+KC    ETCE ARE+AEA + ++R+L+ +WE RARQ GW+
Subjt:  KLQEAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE

AT4G30996.1 Protein of unknown function (DUF1068)1.0e-6674.71Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKAS--CPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELK
        M RRSG C+R CLVIFAVVSAL VCGPALYW+F K F +G ++A+  CPPC+CDCPPPLSLL+I+P        DCGS+DP+LKQEMEKQFVDLLTEELK
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKAS--CPPCICDCPPPLSLLKISP--------DCGSNDPDLKQEMEKQFVDLLTEELK

Query:  LQEAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE
        LQEAV+ E++RHMN+TL EAKR ASQYQ+EAEKC AATE CE ARERAEAL+IKERK+TSLWE+RARQ GWEGE
Subjt:  LQEAVSGEYTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACGCCGATCTGGGGCTTGCCTAAGGTGTTGTCTCGTGATTTTTGCTGTAGTTTCTGCTCTGGCTGTTTGTGGACCGGCTTTGTATTGGAGATTCAAGAAGGCTTT
CCAATTGGGAGATTCCAAAGCCTCCTGTCCTCCTTGCATCTGCGATTGTCCACCCCCATTATCCCTTTTGAAGATTTCCCCTGACTGCGGTAGTAATGACCCAGATCTCA
AGCAGGAGATGGAAAAACAATTTGTGGACCTTTTGACAGAGGAATTGAAGCTTCAAGAAGCAGTTTCTGGCGAATATACTCGCCATATGAACATCACTTTATTCGAGGCA
AAAAGGGCAGCTTCTCAGTATCAGAGGGAGGCTGAAAAGTGCATTGCTGCAACAGAAACTTGTGAAGAGGCCCGAGAACGCGCCGAGGCATTGATGATCAAGGAGAGGAA
GCTTACATCATTGTGGGAGCGACGAGCCCGCCAAATGGGTTGGGAAGGAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACGCCGATCTGGGGCTTGCCTAAGGTGTTGTCTCGTGATTTTTGCTGTAGTTTCTGCTCTGGCTGTTTGTGGACCGGCTTTGTATTGGAGATTCAAGAAGGCTTT
CCAATTGGGAGATTCCAAAGCCTCCTGTCCTCCTTGCATCTGCGATTGTCCACCCCCATTATCCCTTTTGAAGATTTCCCCTGACTGCGGTAGTAATGACCCAGATCTCA
AGCAGGAGATGGAAAAACAATTTGTGGACCTTTTGACAGAGGAATTGAAGCTTCAAGAAGCAGTTTCTGGCGAATATACTCGCCATATGAACATCACTTTATTCGAGGCA
AAAAGGGCAGCTTCTCAGTATCAGAGGGAGGCTGAAAAGTGCATTGCTGCAACAGAAACTTGTGAAGAGGCCCGAGAACGCGCCGAGGCATTGATGATCAAGGAGAGGAA
GCTTACATCATTGTGGGAGCGACGAGCCCGCCAAATGGGTTGGGAAGGAGAATAA
Protein sequenceShow/hide protein sequence
MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKAFQLGDSKASCPPCICDCPPPLSLLKISPDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEYTRHMNITLFEA
KRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWEGE