; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0001645 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0001645
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationtig00001329:328704..329311
RNA-Seq ExpressionIVF0001645
SyntenyIVF0001645
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053047.1 uncharacterized protein E6C27_scaffold344G001630 [Cucumis melo var. makuwa]1.86e-124100Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
        MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY

Query:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCE
        NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCE
Subjt:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCE

KAE8650029.1 hypothetical protein Csa_011504 [Cucumis sativus]3.27e-12094.12Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
        MYCQGFVQVNPSHHVGAPL PTSTY+GQQYDYQFTIIQI GNWWVLVGENLGLGYWPKEL+QNLVDGA+QIAWGGIA+PSIDG+SPMLGSGHKPN+NGDY
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY

Query:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEATIF
        NEGCYIRNIQIISGAATNTY LPTWDNTLSYSSNTSCYDLNPNVNCG DMMEYCFTFGGPGGPNCEATIF
Subjt:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEATIF

TYK11502.1 neprosin 2 [Cucumis melo var. makuwa]9.08e-6897.17Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
        MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY

Query:  NEGCYI
        NEG + 
Subjt:  NEGCYI

XP_031738648.1 uncharacterized protein LOC105435061 [Cucumis sativus]1.93e-12194.12Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
        MYCQGFVQVNPSHHVGAPL PTSTY+GQQYDYQFTIIQI GNWWVLVGENLGLGYWPKEL+QNLVDGA+QIAWGGIA+PSIDG+SPMLGSGHKPN+NGDY
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY

Query:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEATIF
        NEGCYIRNIQIISGAATNTY LPTWDNTLSYSSNTSCYDLNPNVNCG DMMEYCFTFGGPGGPNCEATIF
Subjt:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEATIF

XP_031738649.1 uncharacterized protein LOC116402744 [Cucumis sativus]1.40e-6158.82Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGEN-LGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDG-MSPMLGSGHKPNDNG
        M CQGFV VNP+ HVG+ + P S YQGQQYDYQF+I+Q  G+WWV VG+N +GLGYWP EL  NL+ GA+Q+AWGG A+P++ G  SP LGSGHKPN   
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGEN-LGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDG-MSPMLGSGHKPNDNG

Query:  DYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEAT
        D  E  ++RNIQ I  A      +PT +NT++Y SN+SCYDL  N NC  D  +YCFTFGGPGG  CEA+
Subjt:  DYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEAT

TrEMBL top hitse value%identityAlignment
A0A0A0L1W6 Neprosin domain-containing protein4.2e-9594.12Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
        MYCQGFVQVNPSHHVGAPL PTSTY+GQQYDYQFTIIQI GNWWVLVGENLGLGYWPKEL+QNLVDGA+QIAWGGIA+PSIDG+SPMLGSGHKPN+NGDY
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY

Query:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEATIF
        NEGCYIRNIQIISGAATNTY LPTWDNTLSYSSNTSCYDLNPNVNCG DMMEYCFTFGGPGGPNCEATIF
Subjt:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEATIF

A0A5A7UEV4 Uncharacterized protein1.4e-98100Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
        MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY

Query:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCE
        NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCE
Subjt:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCE

A0A5D3CJM0 Neprosin 27.7e-5798.1Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
        MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY

Query:  NEGCY
        NEG +
Subjt:  NEGCY

A0A5D3CJM0 Neprosin 23.5e-4958.24Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGEN-LGLGYWPKELVQNLVDGAEQIAWGGIAKPSI-DGMSPMLGSGHKPNDNG
        M CQGFV VNP   VG+ + P S YQG+QYDYQF+I+Q  G+WWV VG++ +GLGYWP EL  NL+ GAEQ+AWGG A+PS+    SP LGSGHKPN   
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGEN-LGLGYWPKELVQNLVDGAEQIAWGGIAKPSI-DGMSPMLGSGHKPNDNG

Query:  DYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEAT
        D  E C++RNIQ I  A+     +PT DNT++Y S++SCYDL  N NC  D  +YCFTFGGPGG +C AT
Subjt:  DYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEAT

A0A5D3CJM0 Neprosin 25.3e-5058.82Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGEN-LGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDG-MSPMLGSGHKPNDNG
        M CQGFV VNP+ HVG+ + P S YQGQQYDYQF+I+Q  G+WWV VG+N +GLGYWP EL  NL+ GA+Q+AWGG A+P++ G  SP LGSGHKP  NG
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGEN-LGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDG-MSPMLGSGHKPNDNG

Query:  DYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEAT
          +E  ++RNIQ I  A      +PT +NT++Y SN+SCYDL  N NC  D  +YCFTFGGPGG  CEA+
Subjt:  DYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEAT

A0A6J1CVW9 uncharacterized protein LOC1110147745.0e-3255.38Show/hide
Query:  GNWWVLVGEN-LGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYD
        G+WW+ V ++   +GYWPKEL  +L DGAEQ+AWGGIAKPS +GMSP LG+GHKPN NG YNE CY ++I  I G   N    P ++N +S+ SN+ CY 
Subjt:  GNWWVLVGEN-LGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYD

Query:  LNPNV-NCGDDMMEYCFTFGGPGGPNCEAT
        L      C  D M +CFTFGGPGG NC AT
Subjt:  LNPNV-NCGDDMMEYCFTFGGPGGPNCEAT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20170.1 Protein of Unknown Function (DUF239)2.7e-2235.76Show/hide
Query:  CQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQ--IAGNWWVLVGENLGLGYWPKEL--VQNLVDGAEQIAWGGIAKPSI-DGMSPMLGSGHKPNDN
        C GFVQV+    +G    PTSTY G+QY  Q  I Q  I GNWW L+ +N  +GYWPK L  VQ L  GA ++ WGG    ++    SP++GSGH P + 
Subjt:  CQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQ--IAGNWWVLVGENLGLGYWPKEL--VQNLVDGAEQIAWGGIAKPSI-DGMSPMLGSGHKPNDN

Query:  GDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGG
          + +  ++  +++I          P  D  L ++++  CY +      G++     F +GGPGG
Subjt:  GDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGG

AT2G20170.2 Protein of Unknown Function (DUF239)2.7e-2235.76Show/hide
Query:  CQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQ--IAGNWWVLVGENLGLGYWPKEL--VQNLVDGAEQIAWGGIAKPSI-DGMSPMLGSGHKPNDN
        C GFVQV+    +G    PTSTY G+QY  Q  I Q  I GNWW L+ +N  +GYWPK L  VQ L  GA ++ WGG    ++    SP++GSGH P + 
Subjt:  CQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQ--IAGNWWVLVGENLGLGYWPKEL--VQNLVDGAEQIAWGGIAKPSI-DGMSPMLGSGHKPNDN

Query:  GDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGG
          + +  ++  +++I          P  D  L ++++  CY +      G++     F +GGPGG
Subjt:  GDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGG

AT2G20170.3 Protein of Unknown Function (DUF239)2.7e-2235.76Show/hide
Query:  CQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQ--IAGNWWVLVGENLGLGYWPKEL--VQNLVDGAEQIAWGGIAKPSI-DGMSPMLGSGHKPNDN
        C GFVQV+    +G    PTSTY G+QY  Q  I Q  I GNWW L+ +N  +GYWPK L  VQ L  GA ++ WGG    ++    SP++GSGH P + 
Subjt:  CQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQ--IAGNWWVLVGENLGLGYWPKEL--VQNLVDGAEQIAWGGIAKPSI-DGMSPMLGSGHKPNDN

Query:  GDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGG
          + +  ++  +++I          P  D  L ++++  CY +      G++     F +GGPGG
Subjt:  GDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGG

AT4G23390.1 Protein of Unknown Function (DUF239)1.5e-2033.33Show/hide
Query:  GFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQ--IAGNWWVLVGENLGLGYWPKELV--QNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY
        GFV V+  + +G    P S Y GQQY  + +I Q  +  +WW ++  N  +GYWPK L   Q L DGA  + WGG    S+   SP +GSGH P +   +
Subjt:  GFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQ--IAGNWWVLVGENLGLGYWPKELV--QNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDY

Query:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGG
         +  Y+  ++II+   T     P      +++S+ +CY++   +  G +       FGGPGG
Subjt:  NEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGG

AT5G18460.1 Protein of Unknown Function (DUF239)1.5e-2032.16Show/hide
Query:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQ--IAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGG-IAKPSIDG--MSPMLGSGHKPN
        + C GF+Q N    +GA + P ST++G Q+D    I +    GNWW+ +G++  +GYWP EL  +L D A  + WGG +      G   +  +GSGH P+
Subjt:  MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQ--IAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGG-IAKPSIDG--MSPMLGSGHKPN

Query:  DNGDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPG-GPNC
        +   + +  Y RN++++    ++   +P  D  +  + NT CYD+  + +  ++   Y F +GGPG  P C
Subjt:  DNGDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPG-GPNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTGTCAAGGCTTCGTACAAGTAAATCCAAGTCATCATGTAGGCGCTCCTCTTCATCCAACCTCCACCTATCAAGGACAACAATATGACTATCAATTCACCATCAT
TCAAATTGCAGGGAATTGGTGGGTTCTGGTGGGTGAAAATCTGGGATTAGGATATTGGCCAAAGGAGTTGGTTCAAAATCTAGTTGATGGGGCAGAACAAATAGCATGGG
GAGGCATTGCAAAGCCATCAATAGATGGAATGAGCCCTATGTTGGGGAGTGGACACAAGCCAAATGACAATGGTGATTATAATGAAGGCTGTTACATAAGAAACATTCAA
ATCATATCAGGTGCTGCAACGAATACTTATAAACTGCCAACTTGGGATAACACACTAAGTTATTCAAGTAACACTAGTTGTTATGATTTGAACCCTAATGTGAATTGTGG
TGATGATATGATGGAATATTGCTTCACCTTCGGAGGACCAGGTGGACCTAATTGTGAAGCCACCATCTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATTGTCAAGGCTTCGTACAAGTAAATCCAAGTCATCATGTAGGCGCTCCTCTTCATCCAACCTCCACCTATCAAGGACAACAATATGACTATCAATTCACCATCAT
TCAAATTGCAGGGAATTGGTGGGTTCTGGTGGGTGAAAATCTGGGATTAGGATATTGGCCAAAGGAGTTGGTTCAAAATCTAGTTGATGGGGCAGAACAAATAGCATGGG
GAGGCATTGCAAAGCCATCAATAGATGGAATGAGCCCTATGTTGGGGAGTGGACACAAGCCAAATGACAATGGTGATTATAATGAAGGCTGTTACATAAGAAACATTCAA
ATCATATCAGGTGCTGCAACGAATACTTATAAACTGCCAACTTGGGATAACACACTAAGTTATTCAAGTAACACTAGTTGTTATGATTTGAACCCTAATGTGAATTGTGG
TGATGATATGATGGAATATTGCTTCACCTTCGGAGGACCAGGTGGACCTAATTGTGAAGCCACCATCTTTTAA
Protein sequenceShow/hide protein sequence
MYCQGFVQVNPSHHVGAPLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGIAKPSIDGMSPMLGSGHKPNDNGDYNEGCYIRNIQ
IISGAATNTYKLPTWDNTLSYSSNTSCYDLNPNVNCGDDMMEYCFTFGGPGGPNCEATIF