; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G21000 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G21000
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDUF761 domain-containing protein
Genome locationClcChr09:34520881..34521417
RNA-Seq ExpressionClc09G21000
SyntenyClc09G21000
Gene Ontology termsNA
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033099.1 hypothetical protein SDJN02_07152, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-7281.92Show/hide
Query:  PSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGH
        PSSP ILPS+Q+FS+KSS+SSTL  VKFK+F+HTIIFS  CRL RAISRAKSTV+ +LKK+YHY  +NKNKIFFGSFRLHYNWCSSHVMPVPDP+WE GH
Subjt:  PSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGH

Query:  FYYDHAAADGSQLSGYLQWLEERKLESET---TTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        FYYD A AD SQLSGYLQWLEERKLESET    TT   EMNEID+LAEMFIASCHEKFRLEKQESARRFQEMMARSM
Subjt:  FYYDHAAADGSQLSGYLQWLEERKLESET---TTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

XP_008458796.1 PREDICTED: uncharacterized protein LOC103498096 [Cucumis melo]2.7e-8389.3Show/hide
Query:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHYKN--KNKIFFGSFRLHYNWCSSHVMPVPDP
        MKMAI PSSPKILPS QIFSIK SHSSTLIFVKFKTFIHTIIFSQFCRL RAISRAKSTV+HILKKSYHYK+  KNKIFFGSFRLHYNWCSSHVMPVPDP
Subjt:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHYKN--KNKIFFGSFRLHYNWCSSHVMPVPDP

Query:  MWELGHFYYDHA---AADGSQLSGYLQWLEERKLESE----TTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        +WELGHFYYDHA   AADGSQLSGYLQWLEERKLESE    TTT TTAE+NEID+LAEMFIASCHEKFRLEKQESARRFQ MMARSM
Subjt:  MWELGHFYYDHA---AADGSQLSGYLQWLEERKLESE----TTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

XP_011655408.1 uncharacterized protein LOC105435523 [Cucumis sativus]3.8e-8591.4Show/hide
Query:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDP
        MKMAIGPSSP ILPS QIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRL RAISRAKSTV+HILKKSYHY  KNKNKIFFGSFRLHYNWCSSHVMPVPDP
Subjt:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDP

Query:  MWELGHFYYDHA---AADGSQLSGYLQWLEERKLESETTTT---TTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        +WELGHFYYDHA   AADGSQLSGYLQWLEERKLESET TT   TTAEMNEID+LAEMFIASCHEKFRLEKQESARRFQ MMARSM
Subjt:  MWELGHFYYDHA---AADGSQLSGYLQWLEERKLESETTTT---TTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

XP_023548199.1 uncharacterized protein LOC111806908 isoform X1 [Cucurbita pepo subsp. pepo]4.3e-7382.02Show/hide
Query:  PSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGH
        PSSP ILPS+Q+FS+KSS+SSTL  VKFK+F+HTIIFS  CRL RAISRAKSTV+ +LKK+YHY  +NKNKIFFGSFRLHYNWCSSHVMPVPDP+WE GH
Subjt:  PSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGH

Query:  FYYDHAAADGSQLSGYLQWLEERKLESET----TTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        FYYD A AD SQLSGYLQWLEERKLESET    TTT   EMNEID+LAEMFIASCHEKFRLEKQESARRFQEMMARSM
Subjt:  FYYDHAAADGSQLSGYLQWLEERKLESET----TTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

XP_038890518.1 uncharacterized protein LOC120080048 [Benincasa hispida]1.6e-8390.27Show/hide
Query:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDP
        MKMAI PSSPKILPSVQIFSIKSSHSSTLIFVKFK+FIHTIIFSQ CRL RAISRAKSTV+HILKKSYHY  KNKNKIFFGSFRLHYNWCSSHVMP+PDP
Subjt:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDP

Query:  MWELGHFYYDHAA---ADGSQLSGYLQWLEERKLESE--TTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        +WELGHFYYDHAA   ADGSQLSGYLQWLEERKLESE  TTT T  EMNEID+LAEMFIASCHEKFRLEKQESARRFQEMMARSM
Subjt:  MWELGHFYYDHAA---ADGSQLSGYLQWLEERKLESE--TTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

TrEMBL top hitse value%identityAlignment
A0A0A0KP26 Uncharacterized protein1.8e-8591.4Show/hide
Query:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDP
        MKMAIGPSSP ILPS QIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRL RAISRAKSTV+HILKKSYHY  KNKNKIFFGSFRLHYNWCSSHVMPVPDP
Subjt:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDP

Query:  MWELGHFYYDHA---AADGSQLSGYLQWLEERKLESETTTT---TTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        +WELGHFYYDHA   AADGSQLSGYLQWLEERKLESET TT   TTAEMNEID+LAEMFIASCHEKFRLEKQESARRFQ MMARSM
Subjt:  MWELGHFYYDHA---AADGSQLSGYLQWLEERKLESETTTT---TTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

A0A1S3C8P2 uncharacterized protein LOC1034980961.3e-8389.3Show/hide
Query:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHYKN--KNKIFFGSFRLHYNWCSSHVMPVPDP
        MKMAI PSSPKILPS QIFSIK SHSSTLIFVKFKTFIHTIIFSQFCRL RAISRAKSTV+HILKKSYHYK+  KNKIFFGSFRLHYNWCSSHVMPVPDP
Subjt:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHYKN--KNKIFFGSFRLHYNWCSSHVMPVPDP

Query:  MWELGHFYYDHA---AADGSQLSGYLQWLEERKLESE----TTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        +WELGHFYYDHA   AADGSQLSGYLQWLEERKLESE    TTT TTAE+NEID+LAEMFIASCHEKFRLEKQESARRFQ MMARSM
Subjt:  MWELGHFYYDHA---AADGSQLSGYLQWLEERKLESE----TTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

A0A5D3BVA7 DUF761 domain-containing protein1.3e-8389.3Show/hide
Query:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHYKN--KNKIFFGSFRLHYNWCSSHVMPVPDP
        MKMAI PSSPKILPS QIFSIK SHSSTLIFVKFKTFIHTIIFSQFCRL RAISRAKSTV+HILKKSYHYK+  KNKIFFGSFRLHYNWCSSHVMPVPDP
Subjt:  MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHYKN--KNKIFFGSFRLHYNWCSSHVMPVPDP

Query:  MWELGHFYYDHA---AADGSQLSGYLQWLEERKLESE----TTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        +WELGHFYYDHA   AADGSQLSGYLQWLEERKLESE    TTT TTAE+NEID+LAEMFIASCHEKFRLEKQESARRFQ MMARSM
Subjt:  MWELGHFYYDHA---AADGSQLSGYLQWLEERKLESE----TTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

A0A6J1HAU6 uncharacterized protein LOC1114622626.1e-7381.92Show/hide
Query:  PSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGH
        PSSP ILPS+Q+FS+KSS+SSTL  VKFK+F+HTIIFS  CRL RAISRAKSTV+ +LKK+YHY  +NKNKIFFGSFRLHYNWCSSHVMPVPDP+WE GH
Subjt:  PSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGH

Query:  FYYDHAAADGSQLSGYLQWLEERKLESET---TTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        FYYD A AD SQLSGYLQWLEERKLESET    TT   EMNEID+LAEMFIASCHEKFRLEKQESARRFQEMMARSM
Subjt:  FYYDHAAADGSQLSGYLQWLEERKLESET---TTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

A0A6J1JV22 uncharacterized protein LOC1114877549.8e-7179.66Show/hide
Query:  PSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGH
        PSSP ILPS+Q+FS+KSS+SSTL  VKFK+F+HTIIFS  CRL RAISRAKSTV+ +LK++Y Y  +NKNK+FFGSFRLHYNWCSSHVMPVPDP+WE G+
Subjt:  PSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHY--KNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGH

Query:  FYYDHAAADGSQLSGYLQWLEERKLESET---TTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        FYYD A ADGSQLSGYLQWLEERKLESET    T    EMNEID+LAEMFIASCHEKFRLEKQESARRFQEMMARSM
Subjt:  FYYDHAAADGSQLSGYLQWLEERKLESET---TTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42180.1 unknown protein1.2e-1540.45Show/hide
Query:  SIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKS---------YHYK---NKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGHFYY
        S  SSHS+ L     KT    +I     RL R++SRA+S +I I K +         Y  K   N++ IFFGS        S  V+PV  P      F  
Subjt:  SIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKS---------YHYK---NKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGHFYY

Query:  DHAAADGSQL-SGYLQWLEERKLES------ETTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        D    D   L S YLQWLEER  E+      ++        ++IDRLA+ FIA CHEKF LEK ES RRFQ+M+ARS+
Subjt:  DHAAADGSQL-SGYLQWLEERKLES------ETTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

AT3G57950.1 unknown protein2.2e-3045.45Show/hide
Query:  SIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYH--------------YKNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGHF
        S  SS SS+   +K KT I  ++     R  RA+++AKS  + I K + +               KN+ KIFFGSFRLHYNWCSSHV+PVP P +     
Subjt:  SIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYH--------------YKNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGHF

Query:  YYDHAAADGSQLSGYLQWLEERK---LESETTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM
        Y +    D SQLSGYL+WLE +K   +E           ++ID LA+MFIA+CHEKF LEK ES RRFQEM+ R +
Subjt:  YYDHAAADGSQLSGYLQWLEERK---LESETTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM

AT5G06790.1 unknown protein8.6e-2742.78Show/hide
Query:  SSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHIL-KKSYHY--------------KNKNKIFFGSFRLHYNWCSSHVMPVPDPM--------
        SS S T   +K K+ I T+I SQ CRL R ISR  S ++ +L KK Y++              K KN I FGSFRLHYN+CSSHV+PV  P+        
Subjt:  SSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHIL-KKSYHY--------------KNKNKIFFGSFRLHYNWCSSHVMPVPDPM--------

Query:  --------WELGHFYYDHAAADG--------SQLSGYLQWLEERKLESETTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARS
                WE     Y   + DG        SQLS YL+ LE++  + +   T T  MNEID+LA+ FIA+CHEKF LEK +S RR Q  + RS
Subjt:  --------WELGHFYYDHAAADG--------SQLSGYLQWLEERKLESETTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGGCCATTGGTCCATCTTCACCAAAGATCCTTCCTTCTGTCCAAATTTTCTCAATCAAATCCTCACATTCTTCCACCTTAATTTTTGTGAAGTTCAAAACTTT
CATCCACACTATCATCTTCTCTCAATTTTGTCGATTGGGTCGGGCGATCTCTCGAGCCAAGTCGACAGTGATTCATATTCTAAAGAAAAGTTATCACTACAAGAACAAGA
ACAAAATCTTTTTCGGGTCATTTAGACTCCATTATAACTGGTGCTCTTCTCATGTGATGCCAGTGCCGGATCCCATGTGGGAGTTGGGGCATTTTTACTACGACCATGCC
GCTGCCGATGGCTCGCAGCTGTCCGGGTATTTGCAGTGGTTGGAGGAGAGAAAATTGGAGAGCGAAACGACAACGACGACAACGGCAGAAATGAATGAGATCGACAGATT
GGCGGAGATGTTCATTGCCAGCTGCCATGAAAAATTCAGGCTAGAGAAACAGGAATCAGCTAGGAGATTTCAAGAGATGATGGCTAGAAGCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGGCCATTGGTCCATCTTCACCAAAGATCCTTCCTTCTGTCCAAATTTTCTCAATCAAATCCTCACATTCTTCCACCTTAATTTTTGTGAAGTTCAAAACTTT
CATCCACACTATCATCTTCTCTCAATTTTGTCGATTGGGTCGGGCGATCTCTCGAGCCAAGTCGACAGTGATTCATATTCTAAAGAAAAGTTATCACTACAAGAACAAGA
ACAAAATCTTTTTCGGGTCATTTAGACTCCATTATAACTGGTGCTCTTCTCATGTGATGCCAGTGCCGGATCCCATGTGGGAGTTGGGGCATTTTTACTACGACCATGCC
GCTGCCGATGGCTCGCAGCTGTCCGGGTATTTGCAGTGGTTGGAGGAGAGAAAATTGGAGAGCGAAACGACAACGACGACAACGGCAGAAATGAATGAGATCGACAGATT
GGCGGAGATGTTCATTGCCAGCTGCCATGAAAAATTCAGGCTAGAGAAACAGGAATCAGCTAGGAGATTTCAAGAGATGATGGCTAGAAGCATGTGA
Protein sequenceShow/hide protein sequence
MKMAIGPSSPKILPSVQIFSIKSSHSSTLIFVKFKTFIHTIIFSQFCRLGRAISRAKSTVIHILKKSYHYKNKNKIFFGSFRLHYNWCSSHVMPVPDPMWELGHFYYDHA
AADGSQLSGYLQWLEERKLESETTTTTTAEMNEIDRLAEMFIASCHEKFRLEKQESARRFQEMMARSM