; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G04470 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G04470
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDUF4228 domain protein
Genome locationChr1:2801007..2801768
RNA-Seq ExpressionCSPI01G04470
SyntenyCSPI01G04470
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652475.1 hypothetical protein Csa_013772 [Cucumis sativus]4.6e-7299.3Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQS
        ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQ+
Subjt:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQS

XP_004137354.1 uncharacterized protein LOC101203132 [Cucumis sativus]6.0e-88100Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT
        ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT
Subjt:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT

XP_008453692.1 PREDICTED: uncharacterized protein LOC103494340 [Cucumis melo]9.9e-8396.47Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MGNSASCAPS+ASNGA KVLSLDG LQSF+KPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT
        ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDD  ENQSSKAVKRLMSKQRSWKPALETIAETSCT
Subjt:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT

XP_022135244.1 uncharacterized protein LOC111007254 [Momordica charantia]2.1e-6983.04Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MGNSASCAPSM SNGA KVLSLDG+L+S++KPV AAELMIE+SGKFLCDS DLKVGHRIQGLLPDEDLE RRLYFLLPMDLLYSVLTLEEMSSLT+IATK
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDD-DRENQSSKAVKRLMSKQRSWKPALETIAETSCT
        ALKQGNSSGFGRIFPVLISE C  P++V  LK E  D + EN + K V+RLMSKQRSWKPALETIAETSCT
Subjt:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDD-DRENQSSKAVKRLMSKQRSWKPALETIAETSCT

XP_038879989.1 uncharacterized protein LOC120071684 [Benincasa hispida]9.6e-7891.76Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MGNS SCAPSMASNGA KVLSLDG+LQSF+KPV AAELMIEHSGKFLCDSSDLK+GHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSL+FIATK
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT
        ALK GNSSGFGRIFPVLISE C SPADV  LKLE D DRENQSSKAVKRLMSKQRSWKPALETIAETSCT
Subjt:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT

TrEMBL top hitse value%identityAlignment
A0A0A0LSX3 Uncharacterized protein2.9e-88100Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT
        ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT
Subjt:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT

A0A1S3BY23 uncharacterized protein LOC1034943404.8e-8396.47Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MGNSASCAPS+ASNGA KVLSLDG LQSF+KPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT
        ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDD  ENQSSKAVKRLMSKQRSWKPALETIAETSCT
Subjt:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT

A0A5D3E1W3 DUF4228 domain protein4.8e-8396.47Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MGNSASCAPS+ASNGA KVLSLDG LQSF+KPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT
        ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDD  ENQSSKAVKRLMSKQRSWKPALETIAETSCT
Subjt:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT

A0A6J1C0W6 uncharacterized protein LOC1110072541.0e-6983.04Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MGNSASCAPSM SNGA KVLSLDG+L+S++KPV AAELMIE+SGKFLCDS DLKVGHRIQGLLPDEDLE RRLYFLLPMDLLYSVLTLEEMSSLT+IATK
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDD-DRENQSSKAVKRLMSKQRSWKPALETIAETSCT
        ALKQGNSSGFGRIFPVLISE C  P++V  LK E  D + EN + K V+RLMSKQRSWKPALETIAETSCT
Subjt:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDD-DRENQSSKAVKRLMSKQRSWKPALETIAETSCT

A0A6J1HLR9 uncharacterized protein LOC1114653633.0e-6982.94Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MGNSA CAPSMA NGA KVL+LDG LQSF+KPV AAELMIEHSGKFLCDS DLKVGHRI GLL +E LE RRLYFLLPMDLLYSVLT+EEMSSLT+IATK
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT
        ALK+GNSSGFGRIFPVLISE C  P+DV GLK  D D   NQSSK V+RLMSKQRSWKPALETIAETSCT
Subjt:  ALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G18290.1 unknown protein4.2e-2339.33Show/hide
Query:  MGNSASCAP----SMASNGAPKVLS-LDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRR-LYFLLPMDLLYSVLTLEEMSSL
        MGN++SCAP    + +S+G  K+L+   G L+ FSKP+  ++++  HSG F+ DS+ L++ HR+  + PDE L  RR LY LLP D+L+SVLT EE+S +
Subjt:  MGNSASCAP----SMASNGAPKVLS-LDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRR-LYFLLPMDLLYSVLTLEEMSSL

Query:  TFIATKALKQGNSSGFGRIFPVLI----SEFCNSPADVKGLKLEDD-DDRENQSSKAVKRLMSKQRSWKPALETIAET
        +  A + L +   +   RIFPV I     +   +P+ V   +  D  + RE    K +    SK  SW+P LETI E+
Subjt:  TFIATKALKQGNSSGFGRIFPVLI----SEFCNSPADVKGLKLEDD-DDRENQSSKAVKRLMSKQRSWKPALETIAET

AT3G03280.1 unknown protein1.8e-1033.14Show/hide
Query:  MGNSASCAPSMASNG-APKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWR--RLYFLLPMDLLYSVLTLEEMSSLTFI
        MGN  SCA +  S+    KV+  DG ++    P  AAELM+E    FL D+  +KVG +   L  D+DL+     +Y   PM    S     +M+ L   
Subjt:  MGNSASCAPSMASNG-APKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWR--RLYFLLPMDLLYSVLTLEEMSSLTFI

Query:  ATKALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAE
          K  K   + G  R+ P       N    + G KL  +D  E  +++ + R+ S  +S KP LETIAE
Subjt:  ATKALKQGNSSGFGRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAE

AT3G50800.1 unknown protein3.8e-0832.61Show/hide
Query:  KVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATKALKQGNSSG
        K++  DG LQ FS PV   +++ ++   F+C+S D+     +  +   EDL    LYF+LP+  L   L  +EM++L   A+ AL +    G
Subjt:  KVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATKALKQGNSSG

AT4G37240.1 unknown protein1.2e-0930.84Show/hide
Query:  CAPSMASNGA-PKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATKALKQG
        C+ S ++  A  K++  DGR+  F+ PV    +++++   F+C+S D+     +  +  DE+L+  ++YF LP+  L   L  EEM++L   A+ AL +G
Subjt:  CAPSMASNGA-PKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATKALKQG

Query:  NSSGFGR
           G  R
Subjt:  NSSGFGR

AT5G66580.1 unknown protein4.5e-0933.64Show/hide
Query:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK
        MG  AS   S+ S+ A K++ LDG LQ FS PV   +++ ++   F+C+S ++     +  +  +E+L   +LYF+LP+  L   L  EEM++L   A+ 
Subjt:  MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATK

Query:  ALKQGNSSGF
        AL +    G+
Subjt:  ALKQGNSSGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAATTCAGCTTCTTGTGCTCCTTCAATGGCCTCCAATGGCGCCCCAAAGGTTTTATCCTTAGATGGAAGATTACAGAGCTTCTCAAAGCCAGTGACGGCCGCCGA
ACTAATGATCGAGCATTCCGGTAAATTCCTATGCGATTCCAGCGATCTTAAAGTCGGCCATCGGATTCAAGGTCTATTACCGGATGAAGATCTGGAATGGCGGCGATTAT
ACTTTCTTCTTCCGATGGATCTTCTTTACTCTGTTCTAACACTGGAAGAAATGAGTTCTCTGACTTTCATCGCTACAAAGGCTTTGAAACAGGGAAATTCGAGCGGATTT
GGGAGAATTTTTCCTGTTTTAATCAGTGAGTTTTGTAATTCTCCGGCGGATGTGAAGGGATTGAAATTGGAAGATGATGATGATCGAGAGAATCAAAGTTCGAAGGCGGT
AAAGAGATTGATGTCGAAACAGAGATCGTGGAAGCCGGCGCTTGAAACAATTGCTGAAACTTCGTGCACATAG
mRNA sequenceShow/hide mRNA sequence
CACTTTTTTTTCCCTCTCTATTTCTCTGTCGTATTCATGTATAAATTGAAGCATTGGGGGGAGAGATTCCATTATCGGCGCAGTTCCCATTTTCTTCCACAACGCTATTT
CAATGGGGAATTCAGCTTCTTGTGCTCCTTCAATGGCCTCCAATGGCGCCCCAAAGGTTTTATCCTTAGATGGAAGATTACAGAGCTTCTCAAAGCCAGTGACGGCCGCC
GAACTAATGATCGAGCATTCCGGTAAATTCCTATGCGATTCCAGCGATCTTAAAGTCGGCCATCGGATTCAAGGTCTATTACCGGATGAAGATCTGGAATGGCGGCGATT
ATACTTTCTTCTTCCGATGGATCTTCTTTACTCTGTTCTAACACTGGAAGAAATGAGTTCTCTGACTTTCATCGCTACAAAGGCTTTGAAACAGGGAAATTCGAGCGGAT
TTGGGAGAATTTTTCCTGTTTTAATCAGTGAGTTTTGTAATTCTCCGGCGGATGTGAAGGGATTGAAATTGGAAGATGATGATGATCGAGAGAATCAAAGTTCGAAGGCG
GTAAAGAGATTGATGTCGAAACAGAGATCGTGGAAGCCGGCGCTTGAAACAATTGCTGAAACTTCGTGCACATAGAAGGAAAACAGAAGGGGAATTCGGAATAATGGGGA
AAAATGTATGAGATTAATTTACATTTACGAATAATTGGGTAGCGTTTTTGGTTGGATCAATGGTAGTTGATAGAGATGAGATGAGTGGATAAATAAACAGAG
Protein sequenceShow/hide protein sequence
MGNSASCAPSMASNGAPKVLSLDGRLQSFSKPVTAAELMIEHSGKFLCDSSDLKVGHRIQGLLPDEDLEWRRLYFLLPMDLLYSVLTLEEMSSLTFIATKALKQGNSSGF
GRIFPVLISEFCNSPADVKGLKLEDDDDRENQSSKAVKRLMSKQRSWKPALETIAETSCT