; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cp4.1LG04g02230 (gene) of Cucurbita pepo (MU-CU-16) v4.1 genome

Gene IDCp4.1LG04g02230
OrganismCucurbita pepo var. pepo MU-CU-16 (Cucurbita pepo (MU-CU-16) v4.1)
DescriptionPlant/F14G9-20 protein
Genome locationCp4.1LG04:7933630..7935381
RNA-Seq ExpressionCp4.1LG04g02230
SyntenyCp4.1LG04g02230
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589314.1 hypothetical protein SDJN03_17879, partial [Cucurbita argyrosperma subsp. sororia]1.55e-7175.15Show/hide
Query:  MPMHSVSLQSSPYPNSKFPLQLSNSNLSVTRFSFIFSDKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEAL-------------------------
        MPM+SVSLQSSPYPNSKFPLQLSNSNLSVTRFSFIFSDKHFHFGRHQRLLPLP ALRESQDYQQAVKRKDLAEAL                         
Subjt:  MPMHSVSLQSSPYPNSKFPLQLSNSNLSVTRFSFIFSDKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEAL-------------------------

Query:  ----------SVQVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLG
                  SVQVDPKKWGLSGSFRYALISFLGGASFLL QDIDIRPNLLAL GLPFLD ILL 
Subjt:  ----------SVQVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLG

KAG7023013.1 hypothetical protein SDJN02_16749, partial [Cucurbita argyrosperma subsp. argyrosperma]3.95e-9863.3Show/hide
Query:  MPMHSVSLQSSPYPNSKFPLQLSNSNLSVTRFSFIFSDKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALS------------------------
        MPM+SVSLQSSPYPNSKFPLQLSNSNLSVTRFSFIFSDKHFHFGRHQRLLPLP ALRESQDYQQAVKRKDLAEALS                        
Subjt:  MPMHSVSLQSSPYPNSKFPLQLSNSNLSVTRFSFIFSDKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALS------------------------

Query:  -------------------------------------------------------------------VQVDPKKWGLSGSFRYALISFLGGASFLLCQDI
                                                                           VQVDPKKWGLSGSFRYALISFLGGASFLL QDI
Subjt:  -------------------------------------------------------------------VQVDPKKWGLSGSFRYALISFLGGASFLLCQDI

Query:  DIRPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKRLSPSYLVGSSICGVILDAIVAMEVGIYQGR
        DIRPNLLAL GLPFLD ILLGSTCLAQISSYWPPYKRLSPSYLVGS ICGVILDAIVAMEVGIY+GR
Subjt:  DIRPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKRLSPSYLVGSSICGVILDAIVAMEVGIYQGR

XP_008447096.1 PREDICTED: uncharacterized protein LOC103489633 isoform X1 [Cucumis melo]5.41e-4042.51Show/hide
Query:  DKHFHFGRH--QRLLPLPPALRESQDYQQAVKRKDLAEALSV----------------------------------------------------------
        +KHFH  RH  QRLLPL  ALRE QDY++AVKRKDLAEAL                                                            
Subjt:  DKHFHFGRH--QRLLPLPPALRESQDYQQAVKRKDLAEALSV----------------------------------------------------------

Query:  ---------------------------------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQIS
                                               ++ PKKWGLSGS RYALI+FLGG SFLL QDIDIRPNLLAL GL FLD ILLG TCLAQIS
Subjt:  ---------------------------------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQIS

Query:  SYWPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR
        SYWPPY+R         L  +YL+G  I GVILD IVAM++GI QG+
Subjt:  SYWPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR

XP_022969427.1 uncharacterized protein LOC111468437 isoform X2 [Cucurbita maxima]3.41e-4044Show/hide
Query:  DKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALSV------------------------------------------------------------
        ++HFH  RHQRLL LP A+RE Q+Y++AVKRKDLAEAL                                                              
Subjt:  DKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALSV------------------------------------------------------------

Query:  -----------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKR---------LSPSY
                         ++ PKKWGLSGS RYALI+ LGG SFLL QDIDIRPNL AL GL FLD ILLG TCLAQISS WPPY+R         L  +Y
Subjt:  -----------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKR---------LSPSY

Query:  LVGSSICGVILDAIVAMEVGIYQGR
        L+G  I GVILD IVAM++GI QG+
Subjt:  LVGSSICGVILDAIVAMEVGIYQGR

XP_038888049.1 uncharacterized protein LOC120077976 isoform X1 [Benincasa hispida]1.04e-3941.63Show/hide
Query:  DKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALSV------------------------------------------------------------
        +KHF+  RHQRLLPL  AL E QDY++AVKRKDLAEAL                                                              
Subjt:  DKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALSV------------------------------------------------------------

Query:  -------------------------------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQISSY
                                             ++ PKKWG+SGS RYALI+FLGG SFLL QDIDIRPNLLAL GL FLD ILLG TCLAQISSY
Subjt:  -------------------------------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQISSY

Query:  WPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR
        WPPY+R         L  +YL+G  I GVILD IVAM++GI QG+
Subjt:  WPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR

TrEMBL top hitse value%identityAlignment
A0A0A0K7I5 Uncharacterized protein2.00e-3942.11Show/hide
Query:  DKHFHFGRH--QRLLPLPPALRESQDYQQAVKRKDLAEALSV----------------------------------------------------------
        +K+FH  RH  QRLLPL  ALRE QDY++AVKRKDLAEAL                                                            
Subjt:  DKHFHFGRH--QRLLPLPPALRESQDYQQAVKRKDLAEALSV----------------------------------------------------------

Query:  ---------------------------------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQIS
                                               ++ PKKWGLSGS RYALI+FLGG SFLL QDIDIRPNLLAL GL FLD ILLG TCLAQIS
Subjt:  ---------------------------------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQIS

Query:  SYWPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR
        SYWPPY+R         L  +YL+G  I GVILD IVAM++GI QG+
Subjt:  SYWPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR

A0A1S3BH83 uncharacterized protein LOC103489633 isoform X12.62e-4042.51Show/hide
Query:  DKHFHFGRH--QRLLPLPPALRESQDYQQAVKRKDLAEALSV----------------------------------------------------------
        +KHFH  RH  QRLLPL  ALRE QDY++AVKRKDLAEAL                                                            
Subjt:  DKHFHFGRH--QRLLPLPPALRESQDYQQAVKRKDLAEALSV----------------------------------------------------------

Query:  ---------------------------------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQIS
                                               ++ PKKWGLSGS RYALI+FLGG SFLL QDIDIRPNLLAL GL FLD ILLG TCLAQIS
Subjt:  ---------------------------------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQIS

Query:  SYWPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR
        SYWPPY+R         L  +YL+G  I GVILD IVAM++GI QG+
Subjt:  SYWPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR

A0A5A7U732 Uncharacterized protein2.62e-4042.51Show/hide
Query:  DKHFHFGRH--QRLLPLPPALRESQDYQQAVKRKDLAEALSV----------------------------------------------------------
        +KHFH  RH  QRLLPL  ALRE QDY++AVKRKDLAEAL                                                            
Subjt:  DKHFHFGRH--QRLLPLPPALRESQDYQQAVKRKDLAEALSV----------------------------------------------------------

Query:  ---------------------------------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQIS
                                               ++ PKKWGLSGS RYALI+FLGG SFLL QDIDIRPNLLAL GL FLD ILLG TCLAQIS
Subjt:  ---------------------------------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQIS

Query:  SYWPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR
        SYWPPY+R         L  +YL+G  I GVILD IVAM++GI QG+
Subjt:  SYWPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR

A0A6J1D1P2 uncharacterized protein LOC1110167832.75e-3940.51Show/hide
Query:  LQLSNSNLSVTRF----SFIFSDK-------HFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALS--------------------------------
        LQ+S+S+L    F    SF F  K       HFH  R QRLL LP ALRE QDY++AVKRKDLAEAL                                 
Subjt:  LQLSNSNLSVTRF----SFIFSDK-------HFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALS--------------------------------

Query:  -----------------------------------------------------------------VQVDPKKWGLSGSFRYALISFLGGASFLLCQDIDI
                                                                          ++ PKKWGLSGS  YALI+FLGG SFLL +DIDI
Subjt:  -----------------------------------------------------------------VQVDPKKWGLSGSFRYALISFLGGASFLLCQDIDI

Query:  RPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR
        RPNLLAL GL FLD ILLG TCLAQISSYWPPY+R         L  +YL+G  I GVILD IVAM++GI QG+
Subjt:  RPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKR---------LSPSYLVGSSICGVILDAIVAMEVGIYQGR

A0A6J1HWB9 uncharacterized protein LOC111468437 isoform X21.65e-4044Show/hide
Query:  DKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALSV------------------------------------------------------------
        ++HFH  RHQRLL LP A+RE Q+Y++AVKRKDLAEAL                                                              
Subjt:  DKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALSV------------------------------------------------------------

Query:  -----------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKR---------LSPSY
                         ++ PKKWGLSGS RYALI+ LGG SFLL QDIDIRPNL AL GL FLD ILLG TCLAQISS WPPY+R         L  +Y
Subjt:  -----------------QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKR---------LSPSY

Query:  LVGSSICGVILDAIVAMEVGIYQGR
        L+G  I GVILD IVAM++GI QG+
Subjt:  LVGSSICGVILDAIVAMEVGIYQGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G56180.1 unknown protein1.6e-2552.88Show/hide
Query:  QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKR---------LSPSYLVGSSICGVILDAIVAM
        ++ PKKWGLSG    AL + LGG S+LL Q+ID+RPNL  + GL +LD + LG TCLAQ+S YWPP+KR         L  +YL+G  I GVILD +VAM
Subjt:  QVDPKKWGLSGSFRYALISFLGGASFLLCQDIDIRPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKR---------LSPSYLVGSSICGVILDAIVAM

Query:  EVGI
        ++G+
Subjt:  EVGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATGCATTCCGTCTCATTGCAATCCTCACCTTACCCAAACTCCAAATTTCCTCTTCAACTCTCCAATTCCAACCTTTCCGTTACCCGATTCTCTTTCATCTTCAG
CGATAAGCATTTCCATTTCGGGCGCCATCAGCGTCTCCTACCCCTGCCTCCTGCTCTTCGCGAATCACAGGACTACCAACAGGCTGTGAAGCGCAAGGATCTCGCTGAAG
CTCTCAGTGTCCAAGTTGATCCAAAGAAGTGGGGTCTTTCAGGCAGCTTTCGTTATGCTTTGATTTCTTTTCTTGGTGGAGCATCATTTCTTCTCTGCCAGGACATAGAT
ATTAGGCCAAACCTGTTGGCACTGCCGGGGCTGCCATTTTTGGACTTTATCCTCCTTGGTAGTACTTGTCTCGCTCAAATATCAAGCTATTGGCCACCATATAAGCGGTT
GAGTCCTTCTTACCTCGTGGGTAGCTCAATTTGTGGAGTGATTTTGGATGCAATTGTTGCTATGGAAGTGGGGATATACCAGGGCAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGATGCATTCCGTCTCATTGCAATCCTCACCTTACCCAAACTCCAAATTTCCTCTTCAACTCTCCAATTCCAACCTTTCCGTTACCCGATTCTCTTTCATCTTCAG
CGATAAGCATTTCCATTTCGGGCGCCATCAGCGTCTCCTACCCCTGCCTCCTGCTCTTCGCGAATCACAGGACTACCAACAGGCTGTGAAGCGCAAGGATCTCGCTGAAG
CTCTCAGTGTCCAAGTTGATCCAAAGAAGTGGGGTCTTTCAGGCAGCTTTCGTTATGCTTTGATTTCTTTTCTTGGTGGAGCATCATTTCTTCTCTGCCAGGACATAGAT
ATTAGGCCAAACCTGTTGGCACTGCCGGGGCTGCCATTTTTGGACTTTATCCTCCTTGGTAGTACTTGTCTCGCTCAAATATCAAGCTATTGGCCACCATATAAGCGGTT
GAGTCCTTCTTACCTCGTGGGTAGCTCAATTTGTGGAGTGATTTTGGATGCAATTGTTGCTATGGAAGTGGGGATATACCAGGGCAGGTAAGCAATCCTTTCATATGCAT
TTCAAACCATGGTCTTGAATGTTAGGAATGCAAAACAAAGTCCCATATTGGCTACGAAAGGGAGGGATCTAGATATTTAAGTTAGAACAAGTATCTCCATAATCCATTTG
GGCAGAACAAAAAAACAAAATCATGAAGACTTGTCTAAAGTGGACGATATCAGATAGATGGAGGTTTTTTTGTCCTTAGCTATTGAAATGGTGGTAACTGAAAATATTTT
GCACCGAGATATTCATGTGGGTTCTACATCAAGCAGATGGCTTTGTAGGTAAGGGATAGGAATACGATTAAGATAAACTCAATAATAGATCAGAGCAGGGTGGAAAATAT
GCTAAAAAATATAAAATCGATTTGAGAGTAGAGTTAGACTTCCAATAGTTTGCGGCTGATGAACAACTTATCTCATAATTTGGACGGCTAAATGTGAAGTTGTAACACCC
GAGCTTGATAGGTACAAGCGCTCCATTGCTTCCCTCCTTATGATTAA
Protein sequenceShow/hide protein sequence
MPMHSVSLQSSPYPNSKFPLQLSNSNLSVTRFSFIFSDKHFHFGRHQRLLPLPPALRESQDYQQAVKRKDLAEALSVQVDPKKWGLSGSFRYALISFLGGASFLLCQDID
IRPNLLALPGLPFLDFILLGSTCLAQISSYWPPYKRLSPSYLVGSSICGVILDAIVAMEVGIYQGR