; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G16100 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G16100
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionperiodic tryptophan protein 1 homolog
Genome locationClcChr06:26929474..26934165
RNA-Seq ExpressionClc06G16100
SyntenyClc06G16100
Gene Ontology termsGO:0006364 - rRNA processing (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7021602.1 hypothetical protein SDJN02_15328 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-2578.65Show/hide
Query:  DPPSKEFIDEFLKIK-IFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE
        DPPSKE IDE LK K +  EDSS  S+DEV+EEDMDVED  DEEIANALAVAQ LGKS ETTKS+TKYDD+AEGLKEL MD YDDEDDE
Subjt:  DPPSKEFIDEFLKIK-IFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE

KAG7021602.1 hypothetical protein SDJN02_15328 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-0784.85Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + RNPSHSGYKWLVT DVESLAWDP TEHMFV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

KAG7021602.1 hypothetical protein SDJN02_15328 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-2578.65Show/hide
Query:  DPPSKEFIDEFLKIK-IFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE
        DPPSKE IDE LK K +  EDSS  S+DEV+EEDMDVED  DEEIANALAVAQ LGKS ETTKS+TKYDD+AEGLKEL MD YDDEDDE
Subjt:  DPPSKEFIDEFLKIK-IFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE

XP_008452352.1 PREDICTED: periodic tryptophan protein 1 homolog [Cucumis melo]5.8e-0781.82Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + RNPSHSGYKW VT DVESLAWDP TEHMFV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

XP_022933466.1 uncharacterized WD repeat-containing protein C17D11.16 [Cucurbita moschata]1.2e-0784.85Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + RNPSHSGYKWLVT DVESLAWDP TEHMFV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

XP_022933466.1 uncharacterized WD repeat-containing protein C17D11.16 [Cucurbita moschata]1.2e-2578.65Show/hide
Query:  DPPSKEFIDEFLKIK-IFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE
        DPPSKE IDE LK K +  EDSS  S+DEV+EEDMDVED  DEEIANALAVAQ LGKS ETTKS+TKYDD+AEGLKEL MD YDDEDDE
Subjt:  DPPSKEFIDEFLKIK-IFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE

XP_023529554.1 uncharacterized WD repeat-containing protein C17D11.16 [Cucurbita pepo subsp. pepo]1.2e-0784.85Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + RNPSHSGYKWLVT DVESLAWDP TEHMFV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

XP_023529554.1 uncharacterized WD repeat-containing protein C17D11.16 [Cucurbita pepo subsp. pepo]3.0e-2473.33Show/hide
Query:  LEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE
        L DPPS+E IDE LK    +EDSS  SDDE +EEDMDVED  DEEIANALAVAQ LGKS ETT  +TKYDD+AEGLKEL MD YD+EDDE
Subjt:  LEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE

XP_038890811.1 uncharacterized WD repeat-containing protein C17D11.16-like [Benincasa hispida]5.5e-2666.97Show/hide
Query:  LEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDEAR-NPSHSGY
        L DPPSKE IDE LK K  ++DSS  SDDE +E DMDVED  DEEIANALAVAQ LGKS ETTKS TKYDD+AEGLKEL MD YDDEDDE     S +G 
Subjt:  LEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDEAR-NPSHSGY

Query:  KWLVTTDVE
         +  T D++
Subjt:  KWLVTTDVE

TrEMBL top hitse value%identityAlignment
A0A1S3BUS3 periodic tryptophan protein 1 homolog2.8e-0781.82Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + RNPSHSGYKW VT DVESLAWDP TEHMFV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

A0A1S3BUS3 periodic tryptophan protein 1 homolog2.5e-2475.28Show/hide
Query:  DPPSKEFIDEFLKIK-IFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE
        DPPSKE IDE LK K +  EDSS  S+DEV++EDMDVED  DEEIANALAVAQ LGKS +TT S+TKYDD+AEGLKEL MD YDDEDDE
Subjt:  DPPSKEFIDEFLKIK-IFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE

A0A5A7TNX2 Periodic tryptophan protein 1-like protein2.8e-0781.82Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + RNPSHSGYKW VT DVESLAWDP TEHMFV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

A0A5D3BBF1 Periodic tryptophan protein 1-like protein2.8e-0781.82Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + RNPSHSGYKW VT DVESLAWDP TEHMFV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

A0A5D3BBF1 Periodic tryptophan protein 1-like protein4.3e-2473.03Show/hide
Query:  LEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDD
        L DPPS+E IDE LK    +EDSS  SDDE +EEDMDVED  DEEIANALAVAQ LGKS ETT  +TKYDD+AEGLKEL MD YD+EDD
Subjt:  LEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDD

A0A6J1EZ45 uncharacterized WD repeat-containing protein C17D11.166.0e-2678.65Show/hide
Query:  DPPSKEFIDEFLKIK-IFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE
        DPPSKE IDE LK K +  EDSS  S+DEV+EEDMDVED  DEEIANALAVAQ LGKS ETTKS+TKYDD+AEGLKEL MD YDDEDDE
Subjt:  DPPSKEFIDEFLKIK-IFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE

A0A6J1EZ45 uncharacterized WD repeat-containing protein C17D11.165.6e-0884.85Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + RNPSHSGYKWLVT DVESLAWDP TEHMFV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

A0A6J1EZ45 uncharacterized WD repeat-containing protein C17D11.161.5e-2473.33Show/hide
Query:  LEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE
        L DPPS+E IDE LK    +EDSS  SDDE +EEDMDVED  DEEIANALAVAQ LGKS ETT  +TKYDD+AEGLKEL MD YD+EDDE
Subjt:  LEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE

A0A6J1KY07 uncharacterized WD repeat-containing protein C17D11.165.6e-0884.85Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + RNPSHSGYKWLVT DVESLAWDP TEHMFV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

A0A6J1KY07 uncharacterized WD repeat-containing protein C17D11.164.3e-2473.03Show/hide
Query:  LEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDD
        L DPPS+E IDE LK    +EDSS  SDDE +EEDMDVED  DEEIANALAVAQ LGKS ETT  +TKYDD+AEGLKEL MD YD+EDD
Subjt:  LEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G18900.1 Transducin/WD40 repeat-like superfamily protein4.2e-0866.67Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + R PSHSG+KW V +DVESLAWDP +EH FV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

AT4G18900.1 Transducin/WD40 repeat-like superfamily protein1.0e-0640.66Show/hide
Query:  EDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPET---TKSKTKYDDLAEGLKELSMDRYDDEDD
        ++ PSKE      +I++  +     S+D   EE  D E+ G  E+A+A AVA+  GKS ++   + S    D++AEGLKEL MD YD+EDD
Subjt:  EDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPET---TKSKTKYDDLAEGLKELSMDRYDDEDD

AT4G18905.1 Transducin/WD40 repeat-like superfamily protein8.6e-0940.22Show/hide
Query:  DPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDE--EIANALAVAQTLGKSPET--TKSKTKYDDLAEGLKELSMDRYDDEDDE
        +PPSKE + E ++   F  +    ++DE  E    +E+ G+E  E+ +A AVA+ LGKS ++    S  + D++++GLKEL MD YD+EDDE
Subjt:  DPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDE--EIANALAVAQTLGKSPET--TKSKTKYDDLAEGLKELSMDRYDDEDDE

AT4G18905.1 Transducin/WD40 repeat-like superfamily protein7.2e-0866.67Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + R PSHSG+KW V +DVESLAWDP  EH FV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

AT4G18905.2 Transducin/WD40 repeat-like superfamily protein8.6e-0940.22Show/hide
Query:  DPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDE--EIANALAVAQTLGKSPET--TKSKTKYDDLAEGLKELSMDRYDDEDDE
        +PPSKE + E ++   F  +    ++DE  E    +E+ G+E  E+ +A AVA+ LGKS ++    S  + D++++GLKEL MD YD+EDDE
Subjt:  DPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDE--EIANALAVAQTLGKSPET--TKSKTKYDDLAEGLKELSMDRYDDEDDE

AT4G18905.2 Transducin/WD40 repeat-like superfamily protein7.2e-0866.67Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + R PSHSG+KW V +DVESLAWDP  EH FV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

AT4G35370.1 Transducin/WD40 repeat-like superfamily protein4.4e-0554.55Show/hide
Query:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL
        + R+PS+SG KW     VE LAWDP +EH FV+
Subjt:  EARNPSHSGYKWLVTTDVESLAWDPRTEHMFVL

AT4G35370.1 Transducin/WD40 repeat-like superfamily protein3.7e-0437.88Show/hide
Query:  IRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE
        + ++    EE MD ++  + ++ +A +VA++ GKS   + S T  D++ + LKEL MD YD+EDDE
Subjt:  IRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTKYDDLAEGLKELSMDRYDDEDDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCCCAATCATTTCGCACCGTTTTAACTCAGCGGTTCGCCCTACCCGCCGGCGCCACCGCTACCGCCACTCTCTTTCTTTTATTTGTTCTCTTGTTTGGCTGCCA
TTATTGTGAACTCTTCTTGGAAGATCCTCCCTCCAAGGAGTTTATTGATGAATTTCTCAAGATTAAGATTTTCTTGGAGGATTCCAGCATACGTAGTGATGATGAGGTAA
ATGAAGAAGATATGGATGTTGAAGACACCGGTGATGAAGAGATTGCCAATGCACTAGCTGTTGCACAAACACTTGGAAAATCTCCTGAGACCACAAAGTCAAAAACCAAA
TATGACGATCTCGCTGAAGGTTTGAAGGAACTCAGCATGGACCGTTATGATGATGAAGATGATGAAGCAAGAAACCCTTCCCATTCAGGTTACAAGTGGCTAGTTACAAC
AGATGTGGAGAGCTTGGCATGGGATCCACGTACAGAGCACATGTTTGTGCTTTGGCCGATTTCTAGCTTCTGTTTGTCTGTACTCTTCTCCAGCTACTTGCAACTGGATC
TACTGACAAAGTGGTGTGGGATACATTATCTGATGCTGCAGTCTCTTGGAAGTTTGGAAAGTACGGGCAGCCGATATCTTAAATTGTTGTTTGAAAAATTGCTGAGTACC
GGGGCTCCAAATTTGTTTGTATTTGATGTTAGTTATACTGTAAAGAACGTTGGTGAAAATGAAAATTGGATTTCATATTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCCCAATCATTTCGCACCGTTTTAACTCAGCGGTTCGCCCTACCCGCCGGCGCCACCGCTACCGCCACTCTCTTTCTTTTATTTGTTCTCTTGTTTGGCTGCCA
TTATTGTGAACTCTTCTTGGAAGATCCTCCCTCCAAGGAGTTTATTGATGAATTTCTCAAGATTAAGATTTTCTTGGAGGATTCCAGCATACGTAGTGATGATGAGGTAA
ATGAAGAAGATATGGATGTTGAAGACACCGGTGATGAAGAGATTGCCAATGCACTAGCTGTTGCACAAACACTTGGAAAATCTCCTGAGACCACAAAGTCAAAAACCAAA
TATGACGATCTCGCTGAAGGTTTGAAGGAACTCAGCATGGACCGTTATGATGATGAAGATGATGAAGCAAGAAACCCTTCCCATTCAGGTTACAAGTGGCTAGTTACAAC
AGATGTGGAGAGCTTGGCATGGGATCCACGTACAGAGCACATGTTTGTGCTTTGGCCGATTTCTAGCTTCTGTTTGTCTGTACTCTTCTCCAGCTACTTGCAACTGGATC
TACTGACAAAGTGGTGTGGGATACATTATCTGATGCTGCAGTCTCTTGGAAGTTTGGAAAGTACGGGCAGCCGATATCTTAAATTGTTGTTTGAAAAATTGCTGAGTACC
GGGGCTCCAAATTTGTTTGTATTTGATGTTAGTTATACTGTAAAGAACGTTGGTGAAAATGAAAATTGGATTTCATATTCATAA
Protein sequenceShow/hide protein sequence
MGSQSFRTVLTQRFALPAGATATATLFLLFVLLFGCHYCELFLEDPPSKEFIDEFLKIKIFLEDSSIRSDDEVNEEDMDVEDTGDEEIANALAVAQTLGKSPETTKSKTK
YDDLAEGLKELSMDRYDDEDDEARNPSHSGYKWLVTTDVESLAWDPRTEHMFVLWPISSFCLSVLFSSYLQLDLLTKWCGIHYLMLQSLGSLESTGSRYLKLLFEKLLST
GAPNLFVFDVSYTVKNVGENENWISYS