; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G17810 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G17810
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUBP1-associated proteins 1C-like isoform X3
Genome locationChr1:13377283..13378752
RNA-Seq ExpressionCSPI01G17810
SyntenyCSPI01G17810
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003604 - Matrin/U1-C-like, C2H2-type zinc finger
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653058.1 hypothetical protein Csa_020056, partial [Cucumis sativus]5.4e-3998.89Show/hide
Query:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREI
        MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIR+I
Subjt:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREI

XP_008443847.1 PREDICTED: uncharacterized protein LOC103487343 [Cucumis melo]4.3e-9768.98Show/hide
Query:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI
        MNF FRAIDNKSPATAASTS QP+Q                                           DDSL EELWKQRIKEEIT+REIVR+RM+EAEI
Subjt:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI

Query:  RREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLNLKK--NRSEDVMNRLITLKLEERPDSGKFIGKRKAEGYQ
        RRE+L+ERELAIRR RGQTEGLLSFDNQFLVRFMN  VN I DPSS  ALLAVPGSN SLNLK+     ED MN+LITL   ERPD GKF+GKRKA G Q
Subjt:  RREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLNLKK--NRSEDVMNRLITLKLEERPDSGKFIGKRKAEGYQ

Query:  EATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTREEPNQREEDHKEKLDHPMNNIDKNLVVQ
        EA A +GG GERDQ QTIIPTPWIGSKKLAK+EFVCSMCNVKATSEISFNAHINGKKH+AKEGR QVQQTT +EPNQ EED KEKLDHP+ NIDKN  ++
Subjt:  EATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTREEPNQREEDHKEKLDHPMNNIDKNLVVQ

Query:  KKK
        K K
Subjt:  KKK

XP_022926958.1 uncharacterized protein LOC111433915 [Cucurbita moschata]1.1e-1560.22Show/hide
Query:  DDSLKEELWKQRIKEEITIREIVRQRMMEAEIRREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLN
        DDS   EL KQRIKEEIT RE   +RM+EAEIRRE+++E+EL+I R  G+TEG L+FD  F +R ++ R+N I D SS R LLAVPGS  SLN
Subjt:  DDSLKEELWKQRIKEEITIREIVRQRMMEAEIRREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLN

XP_037494531.1 uncharacterized protein LOC105633612 [Jatropha curcas]3.2e-1532.75Show/hide
Query:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI
        M F FRA+D +   T  S+S     G  S               Q F  A  +I+ P        +  ++ L+ E+ K+RI+EEI   EIVR+RM+EAE+
Subjt:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI

Query:  RREILLERELAIRRVRGQTEGLLSFDNQFLV------------RFMNGRVN---SIPDPSSFRALLAVPGSNPSLNLK--KNRSEDVMNRLITLKLEERP
        RRE+++ERE+A   +R   EG LSF+ +  +            +F N R+      P    F   L  P  + +L     K  SE   +RLI L    +P
Subjt:  RREILLERELAIRRVRGQTEGLLSFDNQFLV------------RFMNGRVN---SIPDPSSFRALLAVPGSNPSLNLK--KNRSEDVMNRLITLKLEERP

Query:  DSGKFIGKRKAEGYQEATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTR
        D      KRKA     AT   G  GE   +         G KK  K+E+ C++C V ATSE   N H+ GKKH+AKE R +  +  +
Subjt:  DSGKFIGKRKAEGYQEATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTR

XP_038880353.1 zinc finger protein 385B [Benincasa hispida]7.2e-4447.44Show/hide
Query:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI
        M+F FRA DNKSPATA STS                V   D + Q                        DSL  EL KQR+K+EI IREI  +RM+EAEI
Subjt:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI

Query:  RREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNG-RVN-SIPDPSSFRALLAVPGSNPS----------LNLKKNRSEDVMNRLITLKLEERPDSGKF
        RRE+++E+ELA RRV G+TEGLL FD+QF VR ++  R+N +I DP  FR LL VPGS+ S           N +    +D  N+LI L    +PD  KF
Subjt:  RREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNG-RVN-SIPDPSSFRALLAVPGSNPS----------LNLKKNRSEDVMNRLITLKLEERPDSGKF

Query:  IGKRKAEGYQEATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTREEPNQREEDHKEKLDHPM
         GKRKAEG  EA        E D  Q   PT WI SKKLAK+EFVCSMCNV+ TSEISFNAH+ GKKH AKEGR+   QT    P    ED KEKLD+  
Subjt:  IGKRKAEGYQEATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTREEPNQREEDHKEKLDHPM

Query:  NNIDKNLVVQKK
         + DK   ++ K
Subjt:  NNIDKNLVVQKK

TrEMBL top hitse value%identityAlignment
A0A067L3P2 Uncharacterized protein1.5e-1532.75Show/hide
Query:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI
        M F FRA+D +   T  S+S     G  S               Q F  A  +I+ P        +  ++ L+ E+ K+RI+EEI   EIVR+RM+EAE+
Subjt:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI

Query:  RREILLERELAIRRVRGQTEGLLSFDNQFLV------------RFMNGRVN---SIPDPSSFRALLAVPGSNPSLNLK--KNRSEDVMNRLITLKLEERP
        RRE+++ERE+A   +R   EG LSF+ +  +            +F N R+      P    F   L  P  + +L     K  SE   +RLI L    +P
Subjt:  RREILLERELAIRRVRGQTEGLLSFDNQFLV------------RFMNGRVN---SIPDPSSFRALLAVPGSNPSLNLK--KNRSEDVMNRLITLKLEERP

Query:  DSGKFIGKRKAEGYQEATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTR
        D      KRKA     AT   G  GE   +         G KK  K+E+ C++C V ATSE   N H+ GKKH+AKE R +  +  +
Subjt:  DSGKFIGKRKAEGYQEATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTR

A0A1S3B9S7 uncharacterized protein LOC1034873432.1e-9768.98Show/hide
Query:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI
        MNF FRAIDNKSPATAASTS QP+Q                                           DDSL EELWKQRIKEEIT+REIVR+RM+EAEI
Subjt:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI

Query:  RREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLNLKK--NRSEDVMNRLITLKLEERPDSGKFIGKRKAEGYQ
        RRE+L+ERELAIRR RGQTEGLLSFDNQFLVRFMN  VN I DPSS  ALLAVPGSN SLNLK+     ED MN+LITL   ERPD GKF+GKRKA G Q
Subjt:  RREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLNLKK--NRSEDVMNRLITLKLEERPDSGKFIGKRKAEGYQ

Query:  EATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTREEPNQREEDHKEKLDHPMNNIDKNLVVQ
        EA A +GG GERDQ QTIIPTPWIGSKKLAK+EFVCSMCNVKATSEISFNAHINGKKH+AKEGR QVQQTT +EPNQ EED KEKLDHP+ NIDKN  ++
Subjt:  EATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTREEPNQREEDHKEKLDHPMNNIDKNLVVQ

Query:  KKK
        K K
Subjt:  KKK

A0A5D3B800 UBP1-associated proteins 1C-like isoform X32.1e-9768.98Show/hide
Query:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI
        MNF FRAIDNKSPATAASTS QP+Q                                           DDSL EELWKQRIKEEIT+REIVR+RM+EAEI
Subjt:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEI

Query:  RREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLNLKK--NRSEDVMNRLITLKLEERPDSGKFIGKRKAEGYQ
        RRE+L+ERELAIRR RGQTEGLLSFDNQFLVRFMN  VN I DPSS  ALLAVPGSN SLNLK+     ED MN+LITL   ERPD GKF+GKRKA G Q
Subjt:  RREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLNLKK--NRSEDVMNRLITLKLEERPDSGKFIGKRKAEGYQ

Query:  EATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTREEPNQREEDHKEKLDHPMNNIDKNLVVQ
        EA A +GG GERDQ QTIIPTPWIGSKKLAK+EFVCSMCNVKATSEISFNAHINGKKH+AKEGR QVQQTT +EPNQ EED KEKLDHP+ NIDKN  ++
Subjt:  EATATIGGRGERDQSQTIIPTPWIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTREEPNQREEDHKEKLDHPMNNIDKNLVVQ

Query:  KKK
        K K
Subjt:  KKK

A0A6J1EJN4 uncharacterized protein LOC1114339155.3e-1660.22Show/hide
Query:  DDSLKEELWKQRIKEEITIREIVRQRMMEAEIRREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLN
        DDS   EL KQRIKEEIT RE   +RM+EAEIRRE+++E+EL+I R  G+TEG L+FD  F +R ++ R+N I D SS R LLAVPGS  SLN
Subjt:  DDSLKEELWKQRIKEEITIREIVRQRMMEAEIRREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLN

A0A6J1HN47 uncharacterized protein LOC111465139 isoform X14.2e-1355.91Show/hide
Query:  DDSLKEELWKQRIKEEITIREIVRQRMMEAEIRREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLN
        DDS   EL KQRIK +IT REI  +RM+EAE R E+++E+EL+I R  G TEG L+FD  F +R ++ R+N I D SS R LLA PGS  SLN
Subjt:  DDSLKEELWKQRIKEEITIREIVRQRMMEAEIRREILLERELAIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24030.1 zinc ion binding;nucleic acid binding1.1e-0524.43Show/hide
Query:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDP-ITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAE
        M F +RAIDN  P  A  T+  P             + P+ P +++   P   S  Q             +++K E+ K++I++EI I E  R+R + AE
Subjt:  MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDP-ITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAE

Query:  IRREILLERELAIRRVRGQTEGLL-------------------SFDNQFLVRFMNGRVNSIPDPSSFRALLAVP------------GSNPSLNLKKNRSE
        + +E+ +ERE+AIRRV   TE  L                    F+N F  ++    ++S+ +  S+ +LL  P             +  S+      + 
Subjt:  IRREILLERELAIRRVRGQTEGLL-------------------SFDNQFLVRFMNGRVNSIPDPSSFRALLAVP------------GSNPSLNLKKNRSE

Query:  DVMNRLITLKLEERPDSGKFIGKRKAEGYQEATATIGGRGERDQSQTIIPTPWIGSKKLAKD
         V+NR   +  + + DS   +   K +   EAT T      ++Q   +     +G K+ A+D
Subjt:  DVMNRLITLKLEERPDSGKFIGKRKAEGYQEATATIGGRGERDQSQTIIPTPWIGSKKLAKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTCACGTTCCGAGCAATAGATAACAAATCGCCGGCCACCGCCGCTTCCACCTCCCATCAACCGCTTCAAGGTAACTTAAGCTTAATCGAAATGTTAATCTTTGT
TTTACCTTCAGATCCTATTACACAAACATTCCCTCCAGCCTCCATTTCGATTATTCAACCAGTTGCTTATAAGCTTTTTATTGTAATCACTTCAGATGATTCTCTAAAGG
AGGAGCTCTGGAAACAACGGATCAAAGAAGAGATAACGATCCGAGAAATAGTGAGACAAAGAATGATGGAGGCAGAGATAAGGAGAGAGATCCTTTTGGAACGAGAACTG
GCTATTAGAAGAGTTAGGGGCCAGACGGAAGGCTTATTATCGTTTGACAATCAGTTTCTCGTGAGATTTATGAACGGAAGAGTGAATAGCATTCCGGATCCGTCATCGTT
TAGAGCTTTACTGGCGGTTCCAGGTTCCAATCCCTCGCTTAACTTAAAGAAGAACCGAAGCGAGGATGTAATGAACAGGCTAATCACTCTGAAACTGGAGGAGAGGCCAG
ACTCGGGAAAATTTATCGGGAAGAGAAAAGCAGAGGGATATCAAGAAGCAACAGCAACAATAGGAGGGCGGGGTGAAAGAGATCAAAGTCAAACCATCATTCCAACTCCT
TGGATTGGTTCAAAGAAATTAGCAAAAGACGAGTTTGTTTGTAGTATGTGCAATGTTAAGGCCACAAGTGAAATTAGCTTCAATGCCCACATAAATGGGAAAAAACACCA
AGCCAAAGAGGGTCGTACCCAAGTACAACAAACAACTCGTGAGGAACCCAACCAAAGGGAAGAAGATCATAAAGAAAAATTAGACCACCCAATGAACAACATAGATAAAA
ATCTAGTCGTTCAGAAAAAAAAAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTCACGTTCCGAGCAATAGATAACAAATCGCCGGCCACCGCCGCTTCCACCTCCCATCAACCGCTTCAAGGTAACTTAAGCTTAATCGAAATGTTAATCTTTGT
TTTACCTTCAGATCCTATTACACAAACATTCCCTCCAGCCTCCATTTCGATTATTCAACCAGTTGCTTATAAGCTTTTTATTGTAATCACTTCAGATGATTCTCTAAAGG
AGGAGCTCTGGAAACAACGGATCAAAGAAGAGATAACGATCCGAGAAATAGTGAGACAAAGAATGATGGAGGCAGAGATAAGGAGAGAGATCCTTTTGGAACGAGAACTG
GCTATTAGAAGAGTTAGGGGCCAGACGGAAGGCTTATTATCGTTTGACAATCAGTTTCTCGTGAGATTTATGAACGGAAGAGTGAATAGCATTCCGGATCCGTCATCGTT
TAGAGCTTTACTGGCGGTTCCAGGTTCCAATCCCTCGCTTAACTTAAAGAAGAACCGAAGCGAGGATGTAATGAACAGGCTAATCACTCTGAAACTGGAGGAGAGGCCAG
ACTCGGGAAAATTTATCGGGAAGAGAAAAGCAGAGGGATATCAAGAAGCAACAGCAACAATAGGAGGGCGGGGTGAAAGAGATCAAAGTCAAACCATCATTCCAACTCCT
TGGATTGGTTCAAAGAAATTAGCAAAAGACGAGTTTGTTTGTAGTATGTGCAATGTTAAGGCCACAAGTGAAATTAGCTTCAATGCCCACATAAATGGGAAAAAACACCA
AGCCAAAGAGGGTCGTACCCAAGTACAACAAACAACTCGTGAGGAACCCAACCAAAGGGAAGAAGATCATAAAGAAAAATTAGACCACCCAATGAACAACATAGATAAAA
ATCTAGTCGTTCAGAAAAAAAAAAGCTAA
Protein sequenceShow/hide protein sequence
MNFTFRAIDNKSPATAASTSHQPLQGNLSLIEMLIFVLPSDPITQTFPPASISIIQPVAYKLFIVITSDDSLKEELWKQRIKEEITIREIVRQRMMEAEIRREILLEREL
AIRRVRGQTEGLLSFDNQFLVRFMNGRVNSIPDPSSFRALLAVPGSNPSLNLKKNRSEDVMNRLITLKLEERPDSGKFIGKRKAEGYQEATATIGGRGERDQSQTIIPTP
WIGSKKLAKDEFVCSMCNVKATSEISFNAHINGKKHQAKEGRTQVQQTTREEPNQREEDHKEKLDHPMNNIDKNLVVQKKKS