; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G05950 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G05950
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function, DUF538
Genome locationClcChr09:4700644..4702789
RNA-Seq ExpressionClc09G05950
SyntenyClc09G05950
Gene Ontology termsNA
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136020.3 uncharacterized protein LOC101206914 [Cucumis sativus]1.3e-6888.05Show/hide
Query:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS
        MSFSKLMASLLLLA IS S+SLVS  +ALTAYDILQQYGFPVGILP+G TGY+LNRATGEFSL+L+QKCKFKI+SYELEYK T+QGVIS+GRIRKLKGVS
Subjt:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS

Query:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDK-AGSLVSAS
        VKI LLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESP+CGCGFDCD  AGSLVSAS
Subjt:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDK-AGSLVSAS

XP_008451556.1 PREDICTED: uncharacterized protein LOC103492801 [Cucumis melo]4.0e-7089.31Show/hide
Query:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS
        MSFSKL+ASLLLLA IS S+SL+S  K LTAYDILQQYGFPVGILP+G TGYELNRATGEFSL+LNQ+CKFKI+SYELEYK TVQGVISQGRIRKLKGVS
Subjt:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS

Query:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDK-AGSLVSAS
        VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESP+CGCGFDCD  AGSLVSAS
Subjt:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDK-AGSLVSAS

XP_022953728.1 uncharacterized protein LOC111456173 [Cucurbita moschata]4.9e-6884.81Show/hide
Query:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS
        MSFSKLMASLL LAF+SFSS LVSAQKALTAYDI+QQYGFPVGILP+GVTGY+L+R TG FSLFL+QKCKF I+SYELEYKPTV GVISQG++ KLKGVS
Subjt:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS

Query:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS
        VKI++LWLSIVEVV+DGD+L FSVGIASANFP+DSFYESPQCGCGFDC+K GSLVSAS
Subjt:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS

XP_022991260.1 uncharacterized protein At5g01610-like [Cucurbita maxima]9.8e-6985.44Show/hide
Query:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS
        MSFSKLMASLL LAF+SFSS LVSAQKALTAYDI+QQYGFPVGILP+GVTGYEL+R TG FSLFL+QKCKF I+SYELEYKPT+ GVISQGR+ KLKGVS
Subjt:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS

Query:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS
        VKI++LWLSIVEVV+DGD+L FSVGIASANFP+DSFYESPQCGCGFDC+K GSLVSAS
Subjt:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS

XP_038898643.1 uncharacterized protein LOC120086187 [Benincasa hispida]2.7e-7493.79Show/hide
Query:  MSFSKLMAS---LLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLK
        M+FSKLMAS   LLLLAF+SFSS LVSAQK LTAYDILQQYGFPVGILPVGV GYELNRATGEFSLFLNQKCKFKIESYELEYK TVQGVISQGRIRKLK
Subjt:  MSFSKLMAS---LLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLK

Query:  GVSVKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS
        GVSVKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESP+CGCGFDCDKAGSLVSAS
Subjt:  GVSVKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS

TrEMBL top hitse value%identityAlignment
A0A1S3BSV3 uncharacterized protein LOC1034928011.9e-7089.31Show/hide
Query:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS
        MSFSKL+ASLLLLA IS S+SL+S  K LTAYDILQQYGFPVGILP+G TGYELNRATGEFSL+LNQ+CKFKI+SYELEYK TVQGVISQGRIRKLKGVS
Subjt:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS

Query:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDK-AGSLVSAS
        VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESP+CGCGFDCD  AGSLVSAS
Subjt:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDK-AGSLVSAS

A0A6J1DLP9 uncharacterized protein At5g01610-like6.4e-6682.39Show/hide
Query:  FSKLMAS---LLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGV
        +SK MAS   LLL AF+SFS+ LVSAQKALTAYDILQQYGFPVGILPVGVTGYEL+R TGEFSL+LNQKC+F IESY LEYKPT++GVISQGRIR LKGV
Subjt:  FSKLMAS---LLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGV

Query:  SVKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS
        +VK++LLWL+IVEVVNDG DLQFSVGIASANFP+D FYESPQCGCGFDC KAG LV+AS
Subjt:  SVKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS

A0A6J1GP52 uncharacterized protein LOC1114561732.4e-6884.81Show/hide
Query:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS
        MSFSKLMASLL LAF+SFSS LVSAQKALTAYDI+QQYGFPVGILP+GVTGY+L+R TG FSLFL+QKCKF I+SYELEYKPTV GVISQG++ KLKGVS
Subjt:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS

Query:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS
        VKI++LWLSIVEVV+DGD+L FSVGIASANFP+DSFYESPQCGCGFDC+K GSLVSAS
Subjt:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS

A0A6J1H965 uncharacterized protein At5g01610-like5.1e-6378.62Show/hide
Query:  MSFSKLMASLLLLAFISFSS-SLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGV
        MSFSKL+   LLLAFIS SS  L  AQK+L+AYDILQQYGFPVGILPVGVTGYE N+ATGEFSLFLN+KC+F I+SYELEYKPTV+GVISQG I+ LKGV
Subjt:  MSFSKLMASLLLLAFISFSS-SLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGV

Query:  SVKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS
        SVKI+L+W +IVEVV+D D+L+FS+G+ASANFP++SFYESPQCGCGFDCDK GSLVSAS
Subjt:  SVKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS

A0A6J1JSE0 uncharacterized protein At5g01610-like4.8e-6985.44Show/hide
Query:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS
        MSFSKLMASLL LAF+SFSS LVSAQKALTAYDI+QQYGFPVGILP+GVTGYEL+R TG FSLFL+QKCKF I+SYELEYKPT+ GVISQGR+ KLKGVS
Subjt:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS

Query:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS
        VKI++LWLSIVEVV+DGD+L FSVGIASANFP+DSFYESPQCGCGFDC+K GSLVSAS
Subjt:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02813.1 Protein of unknown function, DUF5381.9e-3345.77Show/hide
Query:  MASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVSVKIVLL
        M+  ++   +  +S  VS QK  + Y +L+ Y  P GILP GV  Y+LNR TG F +  N  C+F I+SY+++YKP + G+I++GR+ +L GVSVK++  
Subjt:  MASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVSVKIVLL

Query:  WLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDC
        W++I EV  DGDD++F VG AS  F    F +SP+CGCGF+C
Subjt:  WLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDC

AT1G02816.1 Protein of unknown function, DUF5387.1e-4160Show/hide
Query:  TAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIE-SYELEYKPTVQGVISQGRIRKLKGVSVKIVLLWLSIVEVVNDGDDLQFSVGIAS
        TAY +LQ Y FPVGILP GV  Y+L+++TG+F  + N+ C F ++ SY+L+YK T+ G IS+ +I KL GV VK++ LWL+IVEV+ +GD+L+FSVGI S
Subjt:  TAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIE-SYELEYKPTVQGVISQGRIRKLKGVSVKIVLLWLSIVEVVNDGDDLQFSVGIAS

Query:  ANFPLDSFYESPQCGCGFDC
        ANF +D FYESPQCGCGFDC
Subjt:  ANFPLDSFYESPQCGCGFDC

AT4G02360.1 Protein of unknown function, DUF5382.5e-3853.69Show/hide
Query:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS
        MS      S++   F + S + VS QK  TAYD ++ Y  P GILP GV  YELN  TG F ++ N  C+F I+SY+L+YK T+ GVIS G ++ LKGVS
Subjt:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVS

Query:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCD
        VK++  W++I EV  DG DL FSVGIASA+FP  +F ESPQCGCGFDC+
Subjt:  VKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCD

AT4G02370.1 Protein of unknown function, DUF5381.3e-3750Show/hide
Query:  LMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFK-IESYELEYKPTVQGVISQGRIRKLKGVSVKIV
        L+AS L L+ ++ +    +     TAY +LQ Y FPVGILP GV  Y+L+  TG+F  + N  C F  + SY+L YK T+ G IS+ +++KL GV VK++
Subjt:  LMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFK-IESYELEYKPTVQGVISQGRIRKLKGVSVKIV

Query:  LLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDC
         LWL+IVEV+ +GD+++FSVGI SANF +  F ESPQCGCGF+C
Subjt:  LLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDC

AT5G19590.1 Protein of unknown function, DUF5387.6e-1937.5Show/hide
Query:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKI--ESYELEYKPTVQGVISQGRIRKLKG
        M  S  + S+L+L  IS        +K   A+  L  +GFP+G+LP+ V  Y LN+ +G+FSLFLN  CK  +  ++Y   Y   V G ISQG+I +L+G
Subjt:  MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKI--ESYELEYKPTVQGVISQGRIRKLKG

Query:  VSVKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQC
        + V+      SI  + + GD+L F V   +A +P  +F ES  C
Subjt:  VSVKIVLLWLSIVEVVNDGDDLQFSVGIASANFPLDSFYESPQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTCTAAACTTATGGCTTCTCTTCTTCTTCTTGCTTTCATCTCATTCTCAAGCTCTCTCGTCTCTGCTCAAAAAGCTCTTACCGCCTACGATATTCTCCAACA
GTACGGCTTCCCCGTCGGCATTCTTCCCGTCGGTGTAACCGGCTACGAATTGAACAGAGCCACCGGAGAATTTAGCCTCTTCTTGAATCAGAAGTGCAAATTCAAGATCG
AATCGTACGAATTGGAGTACAAACCTACCGTCCAGGGCGTGATTTCCCAAGGCAGAATCAGGAAACTGAAAGGTGTTTCTGTTAAGATCGTCCTGCTGTGGCTCAGCATC
GTGGAAGTCGTTAACGATGGTGACGACCTTCAGTTCTCTGTTGGAATTGCCTCTGCCAATTTTCCACTTGACAGTTTCTATGAGTCGCCGCAGTGTGGATGTGGATTTGA
CTGCGATAAGGCTGGAAGCTTGGTTTCGGCTTCTTGA
mRNA sequenceShow/hide mRNA sequence
TGCAGTCGAGGTTGTACCAAACTTTTTCTATTTAATAGTTTTAATGGATAATATTGGAGAAATGTTTTATTTTTTGCCCATATATAAATAATAGGAGAAAAAAAAATCCA
TTAAGAAGAGTCATTAAGATCCCCCTTCTTCTTCATTCTCAATCCAAATCAAAAGCAAAACAACTCAGAACTTGAAGAAGAAGAAAGAAGAAGAAAGAAGAAGAAGAAGA
CGAAGAAGAAACCTCAAAAATCGAAGGAAGAAGATGAGTTTCTCTAAACTTATGGCTTCTCTTCTTCTTCTTGCTTTCATCTCATTCTCAAGCTCTCTCGTCTCTGCTCA
AAAAGCTCTTACCGCCTACGATATTCTCCAACAGTACGGCTTCCCCGTCGGCATTCTTCCCGTCGGTGTAACCGGCTACGAATTGAACAGAGCCACCGGAGAATTTAGCC
TCTTCTTGAATCAGAAGTGCAAATTCAAGATCGAATCGTACGAATTGGAGTACAAACCTACCGTCCAGGGCGTGATTTCCCAAGGCAGAATCAGGAAACTGAAAGGTGTT
TCTGTTAAGATCGTCCTGCTGTGGCTCAGCATCGTGGAAGTCGTTAACGATGGTGACGACCTTCAGTTCTCTGTTGGAATTGCCTCTGCCAATTTTCCACTTGACAGTTT
CTATGAGTCGCCGCAGTGTGGATGTGGATTTGACTGCGATAAGGCTGGAAGCTTGGTTTCGGCTTCTTGAAACTAGGTGTTAAATAGAAGGTGAATGTAGCACCGTAATA
CTAGAAATTTTCTCGAGAGAAACCCCTTCTGGAGTTCAAGTTTCAGCCACAGTTTGCATTACCAAATTGTTTTTAGGTAAGTTAATCCAAACTCTATGACAAAATCACTT
GATTTAAATTTCTTGGAAAGTTCGTCAATTTCCCTTCAAATTACTAGCCGGGAATCGTTCCTGGTCGGATGGGAATATTGGAATTCATTGTTTGTTTGTCCCATCTGTTC
TTGGAAATGACCCCTCTATCGAATGGCAAAGTCGTTGCACGTTTTTCATGAGCATGTCCATTCCATAGGATTGATATTTGCAAAAACAGGGTAGATTTTTTTGTT
Protein sequenceShow/hide protein sequence
MSFSKLMASLLLLAFISFSSSLVSAQKALTAYDILQQYGFPVGILPVGVTGYELNRATGEFSLFLNQKCKFKIESYELEYKPTVQGVISQGRIRKLKGVSVKIVLLWLSI
VEVVNDGDDLQFSVGIASANFPLDSFYESPQCGCGFDCDKAGSLVSAS