; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G21010 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G21010
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Description101 kDa malaria antigen-like
Genome locationChr1:16543013..16543876
RNA-Seq ExpressionCSPI01G21010
SyntenyCSPI01G21010
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019508.1 hypothetical protein SDJN02_18469, partial [Cucurbita argyrosperma subsp. argyrosperma]6.9e-2850.72Show/hide
Query:  MNIKLPKKFLYLFAKGALFLISF---FFIYFSL----SSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQLPNSP
        M +K  +K L+L+A+ AL  ++F    FIYFS      S LF HTNFWF LSNTLIF+IA  S AFS P + V +A   P+ PQ+NF          NS 
Subjt:  MNIKLPKKFLYLFAKGALFLISF---FFIYFSL----SSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQLPNSP

Query:  IPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTK-EKEEENDEFTKMTDEELNRRVEEFIERFNRQI
         P +N++++  IPL TEIS     NNP K Y RSKSEK I+R V K  K+ MRRSKTM + + T T+ EKEE   E  +M++EELN+RVEEFIERFNRQ+
Subjt:  IPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTK-EKEEENDEFTKMTDEELNRRVEEFIERFNRQI

Query:  RLQEMNE
        RLQ + +
Subjt:  RLQEMNE

XP_008444445.1 PREDICTED: 101 kDa malaria antigen-like [Cucumis melo]2.3e-8480.89Show/hide
Query:  EFSAPKSVVMNIKLPKKFLYLFAKGALFLI----SFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVN
        EFSAPKSVVMNIKLPKKFL+LFAK ALFLI     FFF+YFSLSSD FNHTNFWFFLSNTLIF+IALDSGAFSSPSSF+PAAKPNP+SPQHNFNNTIVVN
Subjt:  EFSAPKSVVMNIKLPKKFLYLFAKGALFLI----SFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVN

Query:  QLPNSPIPAQ------NEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEEND-------EFTKMTD
        +LPNSPIPAQ       EEEE  IPLTTEIS P KFNNPIKPYQRSKSEKDIKRM  KAKK+ M+RSKTMI+Q+ TSTKEKEEE +       EFTKMTD
Subjt:  QLPNSPIPAQ------NEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEEND-------EFTKMTD

Query:  EELNRRVEEFIERFNRQIRLQEMNE
        EELNRRVEEFIERFNRQIRLQ++NE
Subjt:  EELNRRVEEFIERFNRQIRLQEMNE

XP_011657186.2 uncharacterized protein LOC105435817 [Cucumis sativus]3.2e-110100Show/hide
Query:  MITEFSAPKSVVMNIKLPKKFLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQ
        MITEFSAPKSVVMNIKLPKKFLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQ
Subjt:  MITEFSAPKSVVMNIKLPKKFLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQ

Query:  LPNSPIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERF
        LPNSPIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERF
Subjt:  LPNSPIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERF

Query:  NRQIRLQEMNEDENEKEDRF
        NRQIRLQEMNEDENEKEDRF
Subjt:  NRQIRLQEMNEDENEKEDRF

XP_022927241.1 uncharacterized protein LOC111434146 [Cucurbita moschata]1.8e-2850.24Show/hide
Query:  MNIKLPKKFLYLFAKGALFLISF---FFIYFSL----SSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKP--NPSSPQHNFNNTIVVNQLPN
        M +K  +KFL+L+A+ AL   +F    FIYFS      S LF HT FWF LSNTLIF+IA  S AFS P +F  AA     P+ P +NF          N
Subjt:  MNIKLPKKFLYLFAKGALFLISF---FFIYFSL----SSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKP--NPSSPQHNFNNTIVVNQLPN

Query:  SPIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTK-EKEEENDEFTKMTDEELNRRVEEFIERFNR
        S  P +N +++  IPL TEIS     NNP K Y RSKSEK I+R V K  K+ MRRSKTM + + TST+ EKEE  +E  +M++EELN++VEEFIERFNR
Subjt:  SPIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTK-EKEEENDEFTKMTDEELNRRVEEFIERFNR

Query:  QIRLQEMNEDE
        Q+RLQ + + E
Subjt:  QIRLQEMNEDE

XP_038895571.1 uncharacterized protein LOC120083777 [Benincasa hispida]2.3e-6370.83Show/hide
Query:  FSAPKSVVMNIKLPKKFLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQLPNS
        FSA KS+VM IKLPKKFL+LFAK ALFL+S FFIYFSLSS+LFNHTNFWFFL+NTLIF+IA DSGAFS PSSFV A    P+ P+ +FN  +VV + PNS
Subjt:  FSAPKSVVMNIKLPKKFLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQLPNS

Query:  PIPAQN-EEEETIIPLTTE-ISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERFNR
        PI  QN EEEE II LTTE  S P KFNN  K YQRSKSEK+IKR+ EKA K+ M+RSKTMI+ + T+TK+KEEE DEF +MT+EELNRRVEEFIERFNR
Subjt:  PIPAQN-EEEETIIPLTTE-ISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERFNR

Query:  QIRLQEMNEDENEKED
        QIRLQE+NE  NE  +
Subjt:  QIRLQEMNEDENEKED

TrEMBL top hitse value%identityAlignment
A0A0A0LUF0 Uncharacterized protein1.6e-110100Show/hide
Query:  MITEFSAPKSVVMNIKLPKKFLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQ
        MITEFSAPKSVVMNIKLPKKFLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQ
Subjt:  MITEFSAPKSVVMNIKLPKKFLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQ

Query:  LPNSPIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERF
        LPNSPIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERF
Subjt:  LPNSPIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERF

Query:  NRQIRLQEMNEDENEKEDRF
        NRQIRLQEMNEDENEKEDRF
Subjt:  NRQIRLQEMNEDENEKEDRF

A0A1S3B9V3 101 kDa malaria antigen-like1.1e-8480.89Show/hide
Query:  EFSAPKSVVMNIKLPKKFLYLFAKGALFLI----SFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVN
        EFSAPKSVVMNIKLPKKFL+LFAK ALFLI     FFF+YFSLSSD FNHTNFWFFLSNTLIF+IALDSGAFSSPSSF+PAAKPNP+SPQHNFNNTIVVN
Subjt:  EFSAPKSVVMNIKLPKKFLYLFAKGALFLI----SFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVN

Query:  QLPNSPIPAQ------NEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEEND-------EFTKMTD
        +LPNSPIPAQ       EEEE  IPLTTEIS P KFNNPIKPYQRSKSEKDIKRM  KAKK+ M+RSKTMI+Q+ TSTKEKEEE +       EFTKMTD
Subjt:  QLPNSPIPAQ------NEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEEND-------EFTKMTD

Query:  EELNRRVEEFIERFNRQIRLQEMNE
        EELNRRVEEFIERFNRQIRLQ++NE
Subjt:  EELNRRVEEFIERFNRQIRLQEMNE

A0A6J1CEX1 uncharacterized protein LOC1110108731.2e-1741.46Show/hide
Query:  NIKLPKKFLYLFAKGALFLISFF----FIYFSLSS-----DLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQLPNS
        N KL +++L L A+GA    +FF    F Y S+ +      LF+   FWF +SNTLIF+IA+D GAFS P   + + +   S+ +       +V + PN 
Subjt:  NIKLPKKFLYLFAKGALFLISFF----FIYFSLSS-----DLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQLPNS

Query:  PIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERFNRQI
                 ET            +  N  K Y+RSKSEK     VEK +K+ MRRSKTM  +     +E  +E++EF +MTDEELNRRVEEFIERFNR+I
Subjt:  PIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERFNRQI

Query:  RLQEM
        RLQ++
Subjt:  RLQEM

A0A6J1EH50 uncharacterized protein LOC1114341468.8e-2950.24Show/hide
Query:  MNIKLPKKFLYLFAKGALFLISF---FFIYFSL----SSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKP--NPSSPQHNFNNTIVVNQLPN
        M +K  +KFL+L+A+ AL   +F    FIYFS      S LF HT FWF LSNTLIF+IA  S AFS P +F  AA     P+ P +NF          N
Subjt:  MNIKLPKKFLYLFAKGALFLISF---FFIYFSL----SSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKP--NPSSPQHNFNNTIVVNQLPN

Query:  SPIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTK-EKEEENDEFTKMTDEELNRRVEEFIERFNR
        S  P +N +++  IPL TEIS     NNP K Y RSKSEK I+R V K  K+ MRRSKTM + + TST+ EKEE  +E  +M++EELN++VEEFIERFNR
Subjt:  SPIPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTK-EKEEENDEFTKMTDEELNRRVEEFIERFNR

Query:  QIRLQEMNEDE
        Q+RLQ + + E
Subjt:  QIRLQEMNEDE

A0A6J1KIQ0 uncharacterized protein LOC1114955971.0e-2448.57Show/hide
Query:  MNIKLPKKFLYLFAKGAL----FLISFF--FIYFSLS-SDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQLPNSP
        M +K   K L+L+A+ AL    FL+S F  F  F+LS S LF H NFWF LSNTL+ +IA  S AFS P +F     P P  P +NF          +S 
Subjt:  MNIKLPKKFLYLFAKGAL----FLISFF--FIYFSLS-SDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQLPNSP

Query:  IPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERFNRQIR
         P +N+E++  IPL TEI  P + NN  K Y RSKSEK ++R V K  K+ MRRSKTM +       E+EE  +E  +M++EELN+RVEEFIERFNRQIR
Subjt:  IPAQNEEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERFNRQIR

Query:  LQEMNEDENE
        LQ +  D NE
Subjt:  LQEMNEDENE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30190.1 unknown protein8.8e-0529.1Show/hide
Query:  PKSVVMNIKL-PKK-------FLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSS------FVPAAKP----------
        P  ++ N K  PKK        L +F     +++ F     SLSS +F  T   FF+SNTLI +IA D G+FS   S      +  AA            
Subjt:  PKSVVMNIKL-PKK-------FLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSS------FVPAAKP----------

Query:  ------------------------NPSSPQHNFNNTIVVNQLPNSPIPAQNEEE-------ETIIPLTTEI---SFPCKFN---NPIKPYQRSKSEKDIK
                                NP          I+    P   +   +E++       E   P+T +       C      NP KPY RSKS+K  +
Subjt:  ------------------------NPSSPQHNFNNTIVVNQLPNSPIPAQNEEE-------ETIIPLTTEI---SFPCKFN---NPIKPYQRSKSEKDIK

Query:  RMVEKAKKVRMRRSKTMIKQNGTS---TKEK----EEENDEFTKMTDEELNRRVEEFIERFNRQIRLQ
        + +    +   R+S    K + +      EK    +EE++EF+K+++EELN+RVEEFI+RFNRQIR Q
Subjt:  RMVEKAKKVRMRRSKTMIKQNGTS---TKEK----EEENDEFTKMTDEELNRRVEEFIERFNRQIRLQ

AT2G34610.1 unknown protein5.1e-0543.24Show/hide
Query:  KPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERFNRQIRLQ
        K Y RS+S+     +V K KK      +   K         +EE++EF+KM++EELNRRVE+FI+RFNR I+ Q
Subjt:  KPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERFNRQIRLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACAGAATTTTCAGCACCAAAATCAGTAGTAATGAACATAAAATTGCCCAAAAAGTTTCTCTATTTATTTGCAAAGGGAGCTTTGTTTCTTATCTCCTTCTTCTT
CATTTACTTCTCTCTTTCTTCTGATCTCTTCAACCATACCAACTTCTGGTTTTTCCTTTCCAACACTCTCATTTTCGTCATTGCCCTCGATTCCGGCGCCTTCTCTTCGC
CGTCTAGCTTCGTCCCCGCCGCAAAACCCAACCCATCCTCCCCACAACATAACTTTAATAACACTATAGTCGTTAATCAACTTCCTAATTCACCCATACCAGCTCAAAAC
GAAGAAGAAGAAACAATAATCCCACTTACCACAGAAATTTCATTTCCTTGTAAATTCAATAATCCAATTAAACCTTACCAACGAAGCAAGTCAGAGAAAGACATCAAAAG
GATGGTAGAGAAGGCAAAAAAGGTCAGAATGAGGAGATCAAAGACAATGATAAAACAGAATGGGACATCAACGAAAGAGAAAGAAGAAGAAAATGATGAGTTTACAAAAA
TGACAGATGAAGAATTGAATAGAAGAGTGGAAGAATTTATCGAAAGATTCAATAGACAAATAAGACTCCAAGAAATGAATGAAGATGAGAATGAGAAAGAGGATAGATTT
TGA
mRNA sequenceShow/hide mRNA sequence
AACTAATTTTTCAAAACCCCAAAACAAAATCCTTCATGATCACAGAATTTTCAGCACCAAAATCAGTAGTAATGAACATAAAATTGCCCAAAAAGTTTCTCTATTTATTT
GCAAAGGGAGCTTTGTTTCTTATCTCCTTCTTCTTCATTTACTTCTCTCTTTCTTCTGATCTCTTCAACCATACCAACTTCTGGTTTTTCCTTTCCAACACTCTCATTTT
CGTCATTGCCCTCGATTCCGGCGCCTTCTCTTCGCCGTCTAGCTTCGTCCCCGCCGCAAAACCCAACCCATCCTCCCCACAACATAACTTTAATAACACTATAGTCGTTA
ATCAACTTCCTAATTCACCCATACCAGCTCAAAACGAAGAAGAAGAAACAATAATCCCACTTACCACAGAAATTTCATTTCCTTGTAAATTCAATAATCCAATTAAACCT
TACCAACGAAGCAAGTCAGAGAAAGACATCAAAAGGATGGTAGAGAAGGCAAAAAAGGTCAGAATGAGGAGATCAAAGACAATGATAAAACAGAATGGGACATCAACGAA
AGAGAAAGAAGAAGAAAATGATGAGTTTACAAAAATGACAGATGAAGAATTGAATAGAAGAGTGGAAGAATTTATCGAAAGATTCAATAGACAAATAAGACTCCAAGAAA
TGAATGAAGATGAGAATGAGAAAGAGGATAGATTTTGA
Protein sequenceShow/hide protein sequence
MITEFSAPKSVVMNIKLPKKFLYLFAKGALFLISFFFIYFSLSSDLFNHTNFWFFLSNTLIFVIALDSGAFSSPSSFVPAAKPNPSSPQHNFNNTIVVNQLPNSPIPAQN
EEEETIIPLTTEISFPCKFNNPIKPYQRSKSEKDIKRMVEKAKKVRMRRSKTMIKQNGTSTKEKEEENDEFTKMTDEELNRRVEEFIERFNRQIRLQEMNEDENEKEDRF