; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G16730 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G16730
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF1685)
Genome locationClcChr01:29506321..29506881
RNA-Seq ExpressionClc01G16730
SyntenyClc01G16730
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034341.1 hypothetical protein SDJN02_04068, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-6370.97Show/hide
Query:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
        MSD+ D R PP  L RPFPLYK LSWSPDADREEAWLRRK+QSK+RR+KSVTDDD EELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNK Y RTLS
Subjt:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS

Query:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        NSSASLCSSPVS+ +SPS+++SPAAII                                  GENPQTVKARLKQWAQVVACSVRQY
Subjt:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY

TYK12889.1 Membrane insertase YidC [Cucumis melo var. makuwa]5.7e-6873.66Show/hide
Query:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
        MSDSADHR PP P +RPFPLYKQ SWSPDADR++AW+RRK+QSKMRR+KSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
Subjt:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS

Query:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        NSSASLCSSPVS+ VSPS++ SPAAII H                                GENPQ VKARLKQWAQVVACSVRQY
Subjt:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY

XP_004141915.1 uncharacterized protein LOC101216565 [Cucumis sativus]2.3e-6975.27Show/hide
Query:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
        MSDSADHR PP PLTRPFPLYKQ SWSPDADR++AWLRRK+QSKMRR+KSVTDDDLEELKAC+ELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
Subjt:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS

Query:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        NSSASLCSSPVSE VSPS+++SPAAII H                                GENPQ VKARLKQWAQVVACSVRQY
Subjt:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY

XP_022950047.1 uncharacterized protein LOC111453248 [Cucurbita moschata]2.3e-6471.51Show/hide
Query:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
        MSD+ D R PP  L RPFPLYK LSWSPDADREEAWLRRK+QSK+RR+KSVTDDD EELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNK Y RTLS
Subjt:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS

Query:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        NSSASLCSSPVS+F+SPS+++SPAAII                                  GENPQTVKARLKQWAQVVACSVRQY
Subjt:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY

XP_038881139.1 uncharacterized protein LOC120072737 [Benincasa hispida]1.1e-7177.96Show/hide
Query:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
        MSDSADHRPPP  LTRPFPLYKQLSWSPDADREEAWLRRK+QSKMRR+KSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
Subjt:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS

Query:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        NSSASLCSSPVS+ VSPSS NSPAAII +                                GENPQTVKARLKQWAQVVACSVRQY
Subjt:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY

TrEMBL top hitse value%identityAlignment
A0A0A0KIC0 Uncharacterized protein1.1e-6975.27Show/hide
Query:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
        MSDSADHR PP PLTRPFPLYKQ SWSPDADR++AWLRRK+QSKMRR+KSVTDDDLEELKAC+ELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
Subjt:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS

Query:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        NSSASLCSSPVSE VSPS+++SPAAII H                                GENPQ VKARLKQWAQVVACSVRQY
Subjt:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY

A0A5D3CRC3 Membrane insertase YidC2.8e-6873.66Show/hide
Query:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
        MSDSADHR PP P +RPFPLYKQ SWSPDADR++AW+RRK+QSKMRR+KSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
Subjt:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS

Query:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        NSSASLCSSPVS+ VSPS++ SPAAII H                                GENPQ VKARLKQWAQVVACSVRQY
Subjt:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY

A0A6J1GDR3 uncharacterized protein LOC1114532481.1e-6471.51Show/hide
Query:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
        MSD+ D R PP  L RPFPLYK LSWSPDADREEAWLRRK+QSK+RR+KSVTDDD EELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNK Y RTLS
Subjt:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS

Query:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        NSSASLCSSPVS+F+SPS+++SPAAII                                  GENPQTVKARLKQWAQVVACSVRQY
Subjt:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY

A0A6J1IVC7 uncharacterized protein LOC1114787293.5e-6370.43Show/hide
Query:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
        MSD+ D R PP  L RPFPLYK LSWSPDADREEAWLRRK+QSK+RR+KSVTDDD EELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNK Y RTLS
Subjt:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS

Query:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        NSS SLCSSPVS+ +SPS+++SPAAII                                  GENPQTVKARLKQWAQVVACSVRQY
Subjt:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY

A0A6J1KLU9 uncharacterized protein LOC1114968953.9e-6270.43Show/hide
Query:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS
        MS SAD R P  P+ RPFPLYKQLSWSPDADRE+AWLRRK+QSKMRR+KSVTDDDLEELKACVELGFGFNSPEVDPRLCET PAL FY AVNK YN +LS
Subjt:  MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLS

Query:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        NSSASLCSSP S  VSPS+++SPAAII H                                GENPQTVKARLKQWAQVVACSVRQY
Subjt:  NSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15590.1 Protein of unknown function (DUF1685)7.8e-2351.33Show/hide
Query:  ADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQ---SKMRRTKSVTDDDLEELKACVELGFGF--NSPEVDPRLCETFPALGFYHAVNKQYNRTL
        ++H P P     P PL KQ SWSPD  REEAWLR+K +     + R+KSVT+DD+EELK C ELGFGF   SP+++PRL  T PAL  Y AV++QY+  L
Subjt:  ADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQ---SKMRRTKSVTDDDLEELKACVELGFGF--NSPEVDPRLCETFPALGFYHAVNKQYNRTL

Query:  SNSSASLCSSPVS
        S +S+      VS
Subjt:  SNSSASLCSSPVS

AT2G15590.2 Protein of unknown function (DUF1685)2.9e-2539.46Show/hide
Query:  ADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQ---SKMRRTKSVTDDDLEELKACVELGFGF--NSPEVDPRLCETFPALGFYHAVNKQYNRTL
        ++H P P     P PL KQ SWSPD  REEAWLR+K +     + R+KSVT+DD+EELK C ELGFGF   SP+++PRL  T PAL  Y AV++QY+  L
Subjt:  ADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQ---SKMRRTKSVTDDDLEELKACVELGFGF--NSPEVDPRLCETFPALGFYHAVNKQYNRTL

Query:  SNSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVR
        S +S+      V      S+ N+   I+                                  G++ +T+K +LKQWA+VV  SVR
Subjt:  SNSSASLCSSPVSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVR

AT3G50350.1 Protein of unknown function (DUF1685)1.7e-2255.36Show/hide
Query:  PPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSK---MRRTKSVTDDDLEELKACVELGFGFNSPE-VDPRLCETFPALGFYHAVNKQYNRTLSNSSAS
        PPLP T    L KQ SWSPD  REEAW +R+  S+   +RR KS+TD+DL+ELKA  ELGFGF SPE  DPRL  T PAL  Y AV K YN  +SN S +
Subjt:  PPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSK---MRRTKSVTDDDLEELKACVELGFGFNSPE-VDPRLCETFPALGFYHAVNKQYNRTLSNSSAS

Query:  LCSSPVSEFVSP
          SS      SP
Subjt:  LCSSPVSEFVSP

AT3G50350.2 Protein of unknown function (DUF1685)2.9e-2542.44Show/hide
Query:  FPLYKQLSWSPDADREEAWLRRKSQSK---MRRTKSVTDDDLEELKACVELGFGFNSPE-VDPRLCETFPALGFYHAVNKQYNRTLSNSSASLCSSPVSE
        + L  Q SWSPD  REEAW +R+  S+   +RR KS+TD+DL+ELKA  ELGFGF SPE  DPRL  T PAL  Y AV K YN  +SN S +  SS    
Subjt:  FPLYKQLSWSPDADREEAWLRRKSQSK---MRRTKSVTDDDLEELKACVELGFGFNSPE-VDPRLCETFPALGFYHAVNKQYNRTLSNSSASLCSSPVSE

Query:  FVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQ
          SP           H +                             + ++PQTVK +LKQWA+VVAC+V Q
Subjt:  FVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQ

AT4G33985.1 Protein of unknown function (DUF1685)5.2e-3549.71Show/hide
Query:  PFPLYKQLSWSPDADREEAWLRRK---SQSKMRRTKSVTDDDLEELKACVELGFGF--NSPEVDPRLCETFPALGFYHAVNKQYNRTLSNSSASLCSSPV
        P PL KQ SWSPDADREEAWLR+K   S  ++ R+KSVTD+DLEELK C+ELGFGF  +SP++DPRL ET PALG Y AVNKQY+  LS +S+   SS  
Subjt:  PFPLYKQLSWSPDADREEAWLRRK---SQSKMRRTKSVTDDDLEELKACVELGFGF--NSPEVDPRLCETFPALGFYHAVNKQYNRTLSNSSASLCSSPV

Query:  SEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY
        SE     + NS   I+                                  G++P+T+K RLKQWAQVVACSV+Q+
Subjt:  SEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGATTCCGCCGACCACCGCCCACCGCCTCTGCCCTTAACACGGCCTTTCCCACTCTACAAGCAGCTGTCCTGGTCGCCGGATGCCGACCGTGAGGAAGCCTGGCT
CCGCCGCAAATCCCAAAGCAAGATGCGCCGGACCAAGAGCGTGACCGACGATGATCTCGAAGAACTCAAAGCCTGCGTTGAGTTGGGTTTCGGATTCAACTCACCGGAGG
TGGACCCTCGACTCTGTGAAACTTTTCCGGCCTTGGGCTTTTACCACGCCGTTAATAAGCAGTACAATCGCACTCTCTCGAACTCCTCGGCGTCTCTCTGCTCTTCCCCT
GTTTCTGAATTTGTTTCTCCCTCTTCTGAGAACAGCCCCGCCGCCATAATCGGCCACGGTATGTTACCTCCCCTTGTGAAATTGAATGGTGAAAATTGTTTTGATTTTGA
TTTTATCTTTTTGGACGTCGGGAAATTTCGAACTGGGTTTTCAGGGGAGAATCCACAGACGGTGAAAGCAAGGCTGAAACAATGGGCGCAGGTGGTTGCTTGTTCAGTAC
GGCAGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGATTCCGCCGACCACCGCCCACCGCCTCTGCCCTTAACACGGCCTTTCCCACTCTACAAGCAGCTGTCCTGGTCGCCGGATGCCGACCGTGAGGAAGCCTGGCT
CCGCCGCAAATCCCAAAGCAAGATGCGCCGGACCAAGAGCGTGACCGACGATGATCTCGAAGAACTCAAAGCCTGCGTTGAGTTGGGTTTCGGATTCAACTCACCGGAGG
TGGACCCTCGACTCTGTGAAACTTTTCCGGCCTTGGGCTTTTACCACGCCGTTAATAAGCAGTACAATCGCACTCTCTCGAACTCCTCGGCGTCTCTCTGCTCTTCCCCT
GTTTCTGAATTTGTTTCTCCCTCTTCTGAGAACAGCCCCGCCGCCATAATCGGCCACGGTATGTTACCTCCCCTTGTGAAATTGAATGGTGAAAATTGTTTTGATTTTGA
TTTTATCTTTTTGGACGTCGGGAAATTTCGAACTGGGTTTTCAGGGGAGAATCCACAGACGGTGAAAGCAAGGCTGAAACAATGGGCGCAGGTGGTTGCTTGTTCAGTAC
GGCAGTATTGA
Protein sequenceShow/hide protein sequence
MSDSADHRPPPLPLTRPFPLYKQLSWSPDADREEAWLRRKSQSKMRRTKSVTDDDLEELKACVELGFGFNSPEVDPRLCETFPALGFYHAVNKQYNRTLSNSSASLCSSP
VSEFVSPSSENSPAAIIGHGMLPPLVKLNGENCFDFDFIFLDVGKFRTGFSGENPQTVKARLKQWAQVVACSVRQY