; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009129 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009129
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationChr06:2782428..2783190
RNA-Seq ExpressionHG10009129
SyntenyHG10009129
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6688596.1 hypothetical protein I3842_11G133000 [Carya illinoinensis]2.4e-3152.99Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGL
        D  TGNW FMF DKY+GYWPK +   L  G    +WGG+VYSP  E  P MGSGHFPEEGY KSA++NQI+V +     FVDP  S   I  DKP C+  
Subjt:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGL

Query:  INKFTQAGNWGHHIFFG
        I+   +   WGHH++FG
Subjt:  INKFTQAGNWGHHIFFG

KAG7956575.1 hypothetical protein I3843_11G130800, partial [Carya illinoinensis]2.4e-3152.99Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGL
        D  TGNW FMF DKY+GYWPK +   L  G    +WGG+VYSP  E  P MGSGHFPEEGY KSA++NQI+V +     FVDP  S   I  DKP C+  
Subjt:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGL

Query:  INKFTQAGNWGHHIFFG
        I+   +   WGHH++FG
Subjt:  INKFTQAGNWGHHIFFG

XP_022158434.1 uncharacterized protein LOC111024921 [Momordica charantia]3.9e-5073.08Show/hide
Query:  NLHKSL--DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEY-SSLEFVDPVGSQLSIV
        ++H S+  D +TGNW+ MFGDKYIGYWPK ++PGL +GAA+AAWGGEVYSPT+EAGPAMGSGHFPEEG++KSAFVNQIQV +Y SS  FVDP  S+LS+V
Subjt:  NLHKSL--DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEY-SSLEFVDPVGSQLSIV

Query:  LDKPVCFGLINKFTQAGNWGHHIFFGWPRG
        LD+P+CFGLINKFT  GNWG HIFFG P G
Subjt:  LDKPVCFGLINKFTQAGNWGHHIFFGWPRG

XP_040986169.1 uncharacterized protein LOC121234329 isoform X2 [Juglans microcarpa x Juglans regia]3.1e-3153.39Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLE-FVDPVGSQLSIVLDKPVCFG
        D  TGNW FMF DKY+GYWPK +   L  GA   +WGG+VYSP  E  P MGSGHFPEEGY KSA++NQI+V     +  FVDP  S   I  DKP C+ 
Subjt:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLE-FVDPVGSQLSIVLDKPVCFG

Query:  LINKFTQAGNWGHHIFFG
         I+   +   WGHH++FG
Subjt:  LINKFTQAGNWGHHIFFG

XP_042950336.1 uncharacterized protein LOC122282452, partial [Carya illinoinensis]2.4e-3152.99Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGL
        D  TGNW FMF DKY+GYWPK +   L  G    +WGG+VYSP  E  P MGSGHFPEEGY KSA++NQI+V +     FVDP  S   I  DKP C+  
Subjt:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGL

Query:  INKFTQAGNWGHHIFFG
        I+   +   WGHH++FG
Subjt:  INKFTQAGNWGHHIFFG

TrEMBL top hitse value%identityAlignment
A0A2I4H8L6 uncharacterized protein LOC109014471 isoform X11.1e-2951.28Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGL
        D  TGNW FMF DKY+GYWPK +   L   A   +WGG+VYSP  E  P MGSG FPEEGY KSA++ QI+V       FVDP  S   I  DKP C+  
Subjt:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGL

Query:  INKFTQAGNWGHHIFFG
        I+   +   WGHH++FG
Subjt:  INKFTQAGNWGHHIFFG

A0A2N9FVS7 Uncharacterized protein9.2e-2951.2Show/hide
Query:  NLHKSLDANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDK
        N+H+  D  TG+W FMF D YIGYWPK +   L+ GA+  +WGGEVYSP  E  PAMGSGHFPEE Y KSA+++QIQ+ +  +  FVDP    L + +DK
Subjt:  NLHKSLDANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDK

Query:  PVCFGLINKFTQAGNWGHHIFFGWP
        P C+  I    +   WGH+I+FG P
Subjt:  PVCFGLINKFTQAGNWGHHIFFGWP

A0A2P5EGA7 Uncharacterized protein2.3e-2747.93Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVA--EYSSLEFVDPVGSQLSIVLDKPVCF
        D  +G+W  MF DKY+GYWPK ++P L  GA   +WGGEVYSP     PAMG GHFPEEG+RK+A++ QI+V   + +S +F DP    L    ++P C+
Subjt:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVA--EYSSLEFVDPVGSQLSIVLDKPVCF

Query:  GLINKFTQAGNWGHHIFFGWP
           N +  AG WG+++FFG P
Subjt:  GLINKFTQAGNWGHHIFFGWP

A0A6J1DZE3 uncharacterized protein LOC1110249211.9e-5073.08Show/hide
Query:  NLHKSL--DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEY-SSLEFVDPVGSQLSIV
        ++H S+  D +TGNW+ MFGDKYIGYWPK ++PGL +GAA+AAWGGEVYSPT+EAGPAMGSGHFPEEG++KSAFVNQIQV +Y SS  FVDP  S+LS+V
Subjt:  NLHKSL--DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEY-SSLEFVDPVGSQLSIV

Query:  LDKPVCFGLINKFTQAGNWGHHIFFGWPRG
        LD+P+CFGLINKFT  GNWG HIFFG P G
Subjt:  LDKPVCFGLINKFTQAGNWGHHIFFGWPRG

A0A6P9E585 uncharacterized protein LOC109014471 isoform X21.1e-2951.28Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGL
        D  TGNW FMF DKY+GYWPK +   L   A   +WGG+VYSP  E  P MGSG FPEEGY KSA++ QI+V       FVDP  S   I  DKP C+  
Subjt:  DANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGL

Query:  INKFTQAGNWGHHIFFG
        I+   +   WGHH++FG
Subjt:  INKFTQAGNWGHHIFFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20170.1 Protein of Unknown Function (DUF239)4.1e-2140.32Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGL--LPGLENGAAIAAWGGEVYSPTTEA-GPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVC
        D  TGNW F+  ++ IGYWPK L  + GL  GA+   WGGEV+S   ++  P MGSGHFP+EG++K+AFVN ++V +    +   P    L +  + P C
Subjt:  DANTGNWLFMFGDKYIGYWPKGL--LPGLENGAAIAAWGGEVYSPTTEA-GPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVC

Query:  FGLINKFTQAGNWGHHIFFGWPRG
        + +  K      W   IF+G P G
Subjt:  FGLINKFTQAGNWGHHIFFGWPRG

AT2G20170.2 Protein of Unknown Function (DUF239)4.1e-2140.32Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGL--LPGLENGAAIAAWGGEVYSPTTEA-GPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVC
        D  TGNW F+  ++ IGYWPK L  + GL  GA+   WGGEV+S   ++  P MGSGHFP+EG++K+AFVN ++V +    +   P    L +  + P C
Subjt:  DANTGNWLFMFGDKYIGYWPKGL--LPGLENGAAIAAWGGEVYSPTTEA-GPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVC

Query:  FGLINKFTQAGNWGHHIFFGWPRG
        + +  K      W   IF+G P G
Subjt:  FGLINKFTQAGNWGHHIFFGWPRG

AT2G20170.3 Protein of Unknown Function (DUF239)4.1e-2140.32Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGL--LPGLENGAAIAAWGGEVYSPTTEA-GPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVC
        D  TGNW F+  ++ IGYWPK L  + GL  GA+   WGGEV+S   ++  P MGSGHFP+EG++K+AFVN ++V +    +   P    L +  + P C
Subjt:  DANTGNWLFMFGDKYIGYWPKGL--LPGLENGAAIAAWGGEVYSPTTEA-GPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVC

Query:  FGLINKFTQAGNWGHHIFFGWPRG
        + +  K      W   IF+G P G
Subjt:  FGLINKFTQAGNWGHHIFFGWPRG

AT4G23350.1 Protein of Unknown Function (DUF239)2.9e-1935.77Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGLLPGL--ENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCF
        D+ TGNW F+F ++ IGYWP  L       N A  A+WGG+VYSP  E  P MGSGH+P EG+ K+AF++ +++ +    +  +P    + +   +P+C+
Subjt:  DANTGNWLFMFGDKYIGYWPKGLLPGL--ENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCF

Query:  GLINKFTQAGNWGHHIFFGWPRG
                   W   ++FG P G
Subjt:  GLINKFTQAGNWGHHIFFGWPRG

AT4G23390.1 Protein of Unknown Function (DUF239)4.8e-2241.13Show/hide
Query:  DANTGNWLFMFGDKYIGYWPKGLL--PGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCF
        D  T +W F+  ++ IGYWPK L    GL +GA+   WGGEVYS   E  P+MGSGHFP+EG++K+A+VN +++    + E   P+ S L      P C+
Subjt:  DANTGNWLFMFGDKYIGYWPKGLL--PGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKSAFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCF

Query:  GLINKFTQAGN-WGHHIFFGWPRG
          + K    G  W   I FG P G
Subjt:  GLINKFTQAGN-WGHHIFFGWPRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGTGAAGGTTTGAGAAAACTGAGGTTTCTGGTGTTTGTAGTGTCTGTTGTTGTTCTAGGTGTAGATGCTTTCACACAATTCCAAATGTTGTCTGAAGAAGAGTT
GGAACTCAACAACCTTCATAAGTCATTGGATGCGAACACGGGGAACTGGTTGTTCATGTTTGGAGACAAATACATTGGGTACTGGCCAAAGGGACTGCTGCCAGGCTTGG
AAAATGGAGCAGCCATTGCAGCATGGGGAGGGGAAGTTTACAGCCCTACAACAGAAGCAGGGCCAGCCATGGGGAGTGGCCATTTTCCTGAAGAGGGTTACAGAAAAAGT
GCTTTTGTGAATCAGATTCAAGTGGCAGAGTATAGTTCATTGGAATTTGTTGATCCAGTGGGTTCGCAGCTCAGTATTGTTTTGGACAAACCTGTCTGTTTTGGGCTCAT
TAATAAGTTTACTCAAGCTGGGAACTGGGGGCATCATATCTTCTTTGGATGGCCAAGAGGAAAACTGATTCTATGGGCAAAAATAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGTGAAGGTTTGAGAAAACTGAGGTTTCTGGTGTTTGTAGTGTCTGTTGTTGTTCTAGGTGTAGATGCTTTCACACAATTCCAAATGTTGTCTGAAGAAGAGTT
GGAACTCAACAACCTTCATAAGTCATTGGATGCGAACACGGGGAACTGGTTGTTCATGTTTGGAGACAAATACATTGGGTACTGGCCAAAGGGACTGCTGCCAGGCTTGG
AAAATGGAGCAGCCATTGCAGCATGGGGAGGGGAAGTTTACAGCCCTACAACAGAAGCAGGGCCAGCCATGGGGAGTGGCCATTTTCCTGAAGAGGGTTACAGAAAAAGT
GCTTTTGTGAATCAGATTCAAGTGGCAGAGTATAGTTCATTGGAATTTGTTGATCCAGTGGGTTCGCAGCTCAGTATTGTTTTGGACAAACCTGTCTGTTTTGGGCTCAT
TAATAAGTTTACTCAAGCTGGGAACTGGGGGCATCATATCTTCTTTGGATGGCCAAGAGGAAAACTGATTCTATGGGCAAAAATAAAATAA
Protein sequenceShow/hide protein sequence
MDGEGLRKLRFLVFVVSVVVLGVDAFTQFQMLSEEELELNNLHKSLDANTGNWLFMFGDKYIGYWPKGLLPGLENGAAIAAWGGEVYSPTTEAGPAMGSGHFPEEGYRKS
AFVNQIQVAEYSSLEFVDPVGSQLSIVLDKPVCFGLINKFTQAGNWGHHIFFGWPRGKLILWAKIK