; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G14470 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G14470
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDUF4228 domain-containing protein
Genome locationChr2:14271088..14271960
RNA-Seq ExpressionCSPI02G14470
SyntenyCSPI02G14470
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045798.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]8.0e-4095.56Show/hide
Query:  MDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSALRNIQSSLFR
        MDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQ+GQIYFLLPLSLAHSPLSLPDLCNFAIKASSALRNI S  FR
Subjt:  MDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSALRNIQSSLFR

KAE8651930.1 hypothetical protein Csa_006329 [Cucumis sativus]1.0e-66100Show/hide
Query:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLS
        MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLS
Subjt:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLS

Query:  LAHSPLSLPDLCNFAIKASSALRNIQSSLFR
        LAHSPLSLPDLCNFAIKASSALRNIQSSLFR
Subjt:  LAHSPLSLPDLCNFAIKASSALRNIQSSLFR

KAG6571421.1 hypothetical protein SDJN03_30336, partial [Cucurbita argyrosperma subsp. sororia]3.5e-4372.73Show/hide
Query:  MGVCASTQT-PLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPL
        MGVCAST T  ++ NPK G++  +QQQ      NLDSIKV+HMDGF++EFSDPIKASKITS NPN  LCNS++M IGS VPSLS DENLQ+GQIYFLLPL
Subjt:  MGVCASTQT-PLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPL

Query:  SLAHSPLSLPDLCNFAIKASSALRNIQSSLFR
        SLA SPLSLPDLCNFAIKASSAL  +QS  FR
Subjt:  SLAHSPLSLPDLCNFAIKASSALRNIQSSLFR

XP_008457799.2 PREDICTED: uncharacterized protein LOC103497401 [Cucumis melo]1.7e-6191.79Show/hide
Query:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQ---QLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLL
        MGVCASTQTP+TRNPKCG+KF KQQQQQQ   QLLNLDSIKV+HMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQ+GQIYFLL
Subjt:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQ---QLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLL

Query:  PLSLAHSPLSLPDLCNFAIKASSALRNIQSSLFR
        PLSLAHSPLSLPDLCNFAIKASSALRNI S  FR
Subjt:  PLSLAHSPLSLPDLCNFAIKASSALRNIQSSLFR

XP_022158695.1 uncharacterized protein LOC111025158 [Momordica charantia]9.8e-3866.67Show/hide
Query:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLS
        MGVC STQT  T + + G+       Q + +    +IK+VHMDG IEEF+DPIKASKI SRNPNF LCNSEQMLIGSCVP+LS DE+LQ+GQIYFL+P  
Subjt:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLS

Query:  LAHSPLSLPDLCNFAIKASSALRNIQ-SSLFR
         AH+PLSLPDLCN AIKASSALRN + +SLFR
Subjt:  LAHSPLSLPDLCNFAIKASSALRNIQ-SSLFR

TrEMBL top hitse value%identityAlignment
A0A0A0LJJ9 Uncharacterized protein4.9e-67100Show/hide
Query:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLS
        MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLS
Subjt:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLS

Query:  LAHSPLSLPDLCNFAIKASSALRNIQSSLFR
        LAHSPLSLPDLCNFAIKASSALRNIQSSLFR
Subjt:  LAHSPLSLPDLCNFAIKASSALRNIQSSLFR

A0A1S3C6C0 uncharacterized protein LOC1034974018.0e-6291.79Show/hide
Query:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQ---QLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLL
        MGVCASTQTP+TRNPKCG+KF KQQQQQQ   QLLNLDSIKV+HMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQ+GQIYFLL
Subjt:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQ---QLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLL

Query:  PLSLAHSPLSLPDLCNFAIKASSALRNIQSSLFR
        PLSLAHSPLSLPDLCNFAIKASSALRNI S  FR
Subjt:  PLSLAHSPLSLPDLCNFAIKASSALRNIQSSLFR

A0A5A7TSG7 DUF4228 domain-containing protein3.9e-4095.56Show/hide
Query:  MDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSALRNIQSSLFR
        MDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQ+GQIYFLLPLSLAHSPLSLPDLCNFAIKASSALRNI S  FR
Subjt:  MDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSALRNIQSSLFR

A0A6J0ZM23 uncharacterized protein LOC1104100652.7e-2552.38Show/hide
Query:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLL---NLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLL
        MG CAS+    T NPK G        +    +   +  S KVVH+DG ++EF  PI+A  + S+NPN FLC+SE M +G+CVP L  DE LQ GQIYFLL
Subjt:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLL---NLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLL

Query:  PLSLAHSPLSLPDLCNFAIKASSALR
        PLS +  PLSLPDLC+ AIKASS +R
Subjt:  PLSLAHSPLSLPDLCNFAIKASSALR

A0A6J1DWJ0 uncharacterized protein LOC1110251584.7e-3866.67Show/hide
Query:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLS
        MGVC STQT  T + + G+       Q + +    +IK+VHMDG IEEF+DPIKASKI SRNPNF LCNSEQMLIGSCVP+LS DE+LQ+GQIYFL+P  
Subjt:  MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLS

Query:  LAHSPLSLPDLCNFAIKASSALRNIQ-SSLFR
         AH+PLSLPDLCN AIKASSALRN + +SLFR
Subjt:  LAHSPLSLPDLCNFAIKASSALRNIQ-SSLFR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21010.1 unknown protein1.3e-1132.35Show/hide
Query:  SIKVVHMDGFIEEFSDPIKASKI-------------TSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSAL
        ++K+V ++G + E++ P+ AS++             +SR  ++F+C+S+ +     +P++ S+E LQ  QIYF+LP+S   S L+  D+   A+KAS A+
Subjt:  SIKVVHMDGFIEEFSDPIKASKI-------------TSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSAL

Query:  RN
        +N
Subjt:  RN

AT2G23690.1 unknown protein5.6e-1542.35Show/hide
Query:  KVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSAL
        K++  DG + EF+ P+K   +  +NP  F+CNS+ M   + V ++S+DE  Q+GQ+YF LPLS  H  L   ++   A+KASSAL
Subjt:  KVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSAL

AT3G50800.1 unknown protein8.1e-1438.64Show/hide
Query:  DSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSAL
        ++ K++  DG ++EFS P+K  +I  +NP  F+CNS+ M     V ++   E+L+ G++YF+LPL+  + PL   ++   A+KASSAL
Subjt:  DSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSAL

AT4G37240.1 unknown protein2.1e-1442.35Show/hide
Query:  KVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSAL
        K++  DG + EF++P+K   +  + P  F+CNS+ M     V ++S+DE LQ+GQIYF LPL     PL   ++   A+KASSAL
Subjt:  KVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSAL

AT5G66580.1 unknown protein1.0e-1640.43Show/hide
Query:  QQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSAL
        ++ L  DS K++ +DG ++EFS P+K  +I  +NP  F+CNS++M     V +++ +E L+ GQ+YF+LPL+  + PL   ++   A+KASSAL
Subjt:  QQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPDLCNFAIKASSAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGTTTGTGCTTCCACTCAAACGCCCCTAACAAGGAATCCCAAATGCGGAGTCAAATTCCCCAAACAACAACAACAACAGCAACAATTATTGAATCTAGATTCAAT
CAAGGTCGTTCACATGGACGGCTTCATCGAGGAATTTTCCGATCCAATCAAAGCCTCCAAAATCACATCTCGTAACCCTAATTTCTTCCTCTGTAACTCAGAGCAAATGT
TAATCGGTAGCTGTGTTCCTTCTCTTTCCAGCGACGAAAATCTTCAAATGGGTCAAATCTACTTCCTCCTCCCTCTCTCTCTCGCTCATTCCCCTCTTTCTCTCCCTGAT
CTCTGCAATTTCGCCATTAAAGCTTCCTCTGCTCTTCGAAACATTCAATCTTCCTTATTCAGGTAA
mRNA sequenceShow/hide mRNA sequence
TAAAAAGGGAATCTCTTCTTCTCAAAATGAATGAAGAAGAAGAAGACCATAAGAACAATTGATAAAAGCAAAGAGAAAAAATACATCAAATTAAAATCACTTCTTCCTTC
CTTTCCCATTTAGTTGTTCTACAAATCCATACCCAAAATGGGGGTTTGTGCTTCCACTCAAACGCCCCTAACAAGGAATCCCAAATGCGGAGTCAAATTCCCCAAACAAC
AACAACAACAGCAACAATTATTGAATCTAGATTCAATCAAGGTCGTTCACATGGACGGCTTCATCGAGGAATTTTCCGATCCAATCAAAGCCTCCAAAATCACATCTCGT
AACCCTAATTTCTTCCTCTGTAACTCAGAGCAAATGTTAATCGGTAGCTGTGTTCCTTCTCTTTCCAGCGACGAAAATCTTCAAATGGGTCAAATCTACTTCCTCCTCCC
TCTCTCTCTCGCTCATTCCCCTCTTTCTCTCCCTGATCTCTGCAATTTCGCCATTAAAGCTTCCTCTGCTCTTCGAAACATTCAATCTTCCTTATTCAGGTAATCGTGTA
ATTCGTTGGATTTGCTTGCTTGACAATGTGATGGGTTTCGATGTCGGAGTCATTTTAGTTAATTGTTATTATTGTTGATCGGATTATTACAAAGTCAAGTGGTGAATTCA
GAAAATGGATCGATGGATTCGTTCTTTTTTTCATATTTGTTATTATGATTATTGTTCCGTTTTTTTGTTGGCGGAATTAGATTGATTGAAGAATTGTTGTAATAGTTTTG
TTTATTTGATTTGTTAGTGTGTGTGTATGAAAAGGAAATTGAAAAAGCAAAAAGCTCCTTTGGGATTCTCATATACTAAACGACACCGTTAATAATTCAAATC
Protein sequenceShow/hide protein sequence
MGVCASTQTPLTRNPKCGVKFPKQQQQQQQLLNLDSIKVVHMDGFIEEFSDPIKASKITSRNPNFFLCNSEQMLIGSCVPSLSSDENLQMGQIYFLLPLSLAHSPLSLPD
LCNFAIKASSALRNIQSSLFR