; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg17954 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg17954
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of unknown function (DUF962)
Genome locationCarg_Chr10:908671..910094
RNA-Seq ExpressionCarg17954
SyntenyCarg17954
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR009305 - 2-hydroxy-palmitic acid dioxygenase Mpo1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587548.1 hypothetical protein SDJN03_16113, partial [Cucurbita argyrosperma subsp. sororia]1.2e-4561.73Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV
        MGKT  FDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL                                             AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV

Query:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        G S ++ RLGYSQTWK+VL AQLFCWTNQ I HGVFEKRAPALLDNLAQ FLMAPFFVF EV
Subjt:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV

KAG6589515.1 hypothetical protein SDJN03_14938, partial [Cucurbita argyrosperma subsp. sororia]4.3e-5987.31Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL-----------------AGSNAALPCLICWVGESILAFRLGYSQTWKIVLTAQLFCWTN
        MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL                 AGSNAALPCLICWVGESILAFRLGYSQTWKIVLTAQLFCWTN
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL-----------------AGSNAALPCLICWVGESILAFRLGYSQTWKIVLTAQLFCWTN

Query:  QLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        QLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
Subjt:  QLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV

KAG7023203.1 SPAC16E8.02, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-90100Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTLAGSNAALPCLICWVGESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLD
        MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTLAGSNAALPCLICWVGESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLD
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTLAGSNAALPCLICWVGESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLD

Query:  NLAQPFLMAPFFVFFEVFLLWLFGELLVGVDIEATLLFLMFGVRFFKVCSNMNHTQDLVRVCKRR
        NLAQPFLMAPFFVFFEVFLLWLFGELLVGVDIEATLLFLMFGVRFFKVCSNMNHTQDLVRVCKRR
Subjt:  NLAQPFLMAPFFVFFEVFLLWLFGELLVGVDIEATLLFLMFGVRFFKVCSNMNHTQDLVRVCKRR

XP_022933403.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita moschata]1.2e-4561.73Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV
        MGKT  FDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL                                             AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV

Query:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        G S ++ RLGYSQTWK+VL AQLFCWTNQ I HGVFEKRAPALLDNLAQ FLMAPFFVF EV
Subjt:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV

XP_022972978.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita maxima]1.6e-4561.11Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV
        MGKT  FDLERH+AFYGAYHSNPVNIFIHVLFVWPIFFTTL                                             AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV

Query:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        G S ++ RLGYSQTWK+VL AQLFCWTNQ+I HGVFEKRAPALLDNLAQ FLMAPFFVF EV
Subjt:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV

TrEMBL top hitse value%identityAlignment
A0A0A0LP54 Uncharacterized protein3.2e-4459.88Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV
        MGKTGLFDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFT+L                                             AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV

Query:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        G S +A +LGYSQTWK+VL AQLFCWTNQ I HGVFEKRAPALLDNLAQ FLMAPFFV  EV
Subjt:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV

A0A1S3BV50 uncharacterized endoplasmic reticulum membrane protein YGL010W4.2e-4459.88Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV
        MGKTGLFDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFT+L                                             AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV

Query:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        G S +A +LGYSQTWK+VL AQLFCWTNQ I HGVFEKRAPALLDNLAQ FLMAPFFV  EV
Subjt:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV

A0A5A7UQ13 Putative endoplasmic reticulum membrane protein4.2e-4459.88Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV
        MGKTGLFDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFT+L                                             AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV

Query:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        G S +A +LGYSQTWK+VL AQLFCWTNQ I HGVFEKRAPALLDNLAQ FLMAPFFV  EV
Subjt:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV

A0A6J1F4N0 uncharacterized endoplasmic reticulum membrane protein C16E8.025.9e-4661.73Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV
        MGKT  FDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL                                             AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV

Query:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        G S ++ RLGYSQTWK+VL AQLFCWTNQ I HGVFEKRAPALLDNLAQ FLMAPFFVF EV
Subjt:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV

A0A6J1IBP8 uncharacterized endoplasmic reticulum membrane protein C16E8.027.7e-4661.11Show/hide
Query:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV
        MGKT  FDLERH+AFYGAYHSNPVNIFIHVLFVWPIFFTTL                                             AGS AAL C +CWV
Subjt:  MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL---------------------------------------------AGSNAALPCLICWV

Query:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        G S ++ RLGYSQTWK+VL AQLFCWTNQ+I HGVFEKRAPALLDNLAQ FLMAPFFVF EV
Subjt:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV

SwissProt top hitse value%identityAlignment
O13737 2-hydroxy-palmitic acid dioxygenase mpo11.6e-1133.56Show/hide
Query:  LERHFAFYGAYHSNPVNIFIH----------------------------------VLFVWPIFFTTLAGSNAALPCLICWVGESILAFRL----GYSQTW
        L R ++FY AYHSNPVNI IH                                  V   + IF+ TL   +  L   + ++   IL  +L      S   
Subjt:  LERHFAFYGAYHSNPVNIFIH----------------------------------VLFVWPIFFTTLAGSNAALPCLICWVGESILAFRL----GYSQTW

Query:  KIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFE
        +      + CW  Q I HGVFEKR PALLDNL Q   +AP F F E
Subjt:  KIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFE

P25338 2-hydroxy-palmitic acid dioxygenase MPO16.0e-1133.57Show/hide
Query:  GLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFT------------------TLAGSNAALPCLI-----CWVGESILAFRLG-----YSQTWKIVLTA
        GL DL     FY  YH NP N+ IH +FV  I F+                   L+   +   CL+        G  +L   L         T+K  L  
Subjt:  GLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFT------------------TLAGSNAALPCLI-----CWVGESILAFRLG-----YSQTWKIVLTA

Query:  QLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFE
            W  Q + HGVFEKR PAL+DNL Q  ++AP+F+ FE
Subjt:  QLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFE

Arabidopsis top hitse value%identityAlignment
AT1G18720.1 Protein of unknown function (DUF962)1.4e-3649.38Show/hide
Query:  GLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL-------------------------------------------------AGSNAALPCLICWV
        GLFDLE+HFAFYGAYHSNP+NI IH++FVWPIFF+ L                                                 +G  AAL C  CWV
Subjt:  GLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL-------------------------------------------------AGSNAALPCLICWV

Query:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        G S LA RLG S   K+ L +QL CWT Q + HGVFEKRAPALLDNL Q FLMAPFFV  EV
Subjt:  GESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV

AT1G74440.1 Protein of unknown function (DUF962)2.3e-3446.34Show/hide
Query:  KTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL-------------------------------------------------AGSNAALPCLIC
        + GL DLE+HFAFYGAYHSNP+NI IH LFVWP  F TL                                                 +G  AAL C  C
Subjt:  KTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTL-------------------------------------------------AGSNAALPCLIC

Query:  WVGESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV
        W+G S LA RLG+S T K+ + +QL CWT Q + HG+FEKRAPALLDNL Q FLM PFFV  EV
Subjt:  WVGESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAPFFVFFEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGACTGGTTTGTTTGATCTGGAGAGGCATTTCGCCTTCTATGGCGCTTATCACAGCAACCCAGTCAACATTTTCATTCATGTTCTGTTTGTGTGGCCAATTTT
CTTTACCACCCTCGCGGGGTCCAATGCCGCTTTGCCTTGCTTGATTTGTTGGGTTGGAGAAAGCATACTCGCCTTTAGACTTGGTTATTCTCAGACCTGGAAGATAGTAC
TGACTGCTCAGTTGTTCTGTTGGACCAATCAGTTAATAGACCATGGAGTATTTGAGAAACGAGCACCGGCTTTGTTAGACAATCTTGCTCAACCTTTTCTAATGGCTCCA
TTCTTTGTATTTTTTGAGGTATTTCTTCTCTGGTTATTCGGAGAGTTACTTGTCGGTGTTGATATTGAAGCAACTTTGTTGTTTCTGATGTTTGGTGTCAGGTTCTTCAA
AGTTTGTTCAAATATGAACCATACCCAGGATTTAGTGCGAGTGTGCAAGCGAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGACTGGTTTGTTTGATCTGGAGAGGCATTTCGCCTTCTATGGCGCTTATCACAGCAACCCAGTCAACATTTTCATTCATGTTCTGTTTGTGTGGCCAATTTT
CTTTACCACCCTCGCGGGGTCCAATGCCGCTTTGCCTTGCTTGATTTGTTGGGTTGGAGAAAGCATACTCGCCTTTAGACTTGGTTATTCTCAGACCTGGAAGATAGTAC
TGACTGCTCAGTTGTTCTGTTGGACCAATCAGTTAATAGACCATGGAGTATTTGAGAAACGAGCACCGGCTTTGTTAGACAATCTTGCTCAACCTTTTCTAATGGCTCCA
TTCTTTGTATTTTTTGAGGTATTTCTTCTCTGGTTATTCGGAGAGTTACTTGTCGGTGTTGATATTGAAGCAACTTTGTTGTTTCTGATGTTTGGTGTCAGGTTCTTCAA
AGTTTGTTCAAATATGAACCATACCCAGGATTTAGTGCGAGTGTGCAAGCGAAGATAAAAGCAGATATCAAAGAGTGGAAAGAACAGAAGGAAAAGCTGACAACCTGCCT
AAAGTTCAAGACAACTCTTGCCTAGATCTCCCTTTGTAGTAGTATGAAGAGAAAAACATTTTGAAATCTTTACGTGATATTTGTTAGGAATCACGAACTTTCACAATGAT
ATGATATTGTCTATTTTGAGCATAAACTCTCATGAAGACATGACTCTGATACCATGTTAGAAATCACGAACCTCCACACTAGTATGATATTGTCCACTGCTCTCATGGCA
TAAGCTCCCTCTCAACAATCCTCGACAATAGACGATGTTTTGCTTTTCAACCTTGATTTGTCATTCTACTATGGTTTGTAATGTAACAGAGTACATAAAATGAATGAAGA
AGACACAGGTGCGATGACTTACCTGTTAAATAAGAACTCCAAACGAATGCACA
Protein sequenceShow/hide protein sequence
MGKTGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTTLAGSNAALPCLICWVGESILAFRLGYSQTWKIVLTAQLFCWTNQLIDHGVFEKRAPALLDNLAQPFLMAP
FFVFFEVFLLWLFGELLVGVDIEATLLFLMFGVRFFKVCSNMNHTQDLVRVCKRR