; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G029770 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G029770
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationchr02:35871782..35876677
RNA-Seq ExpressionLsi02G029770
SyntenyLsi02G029770
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059471.1 protein CHUP1 [Cucumis melo var. makuwa]4.4e-30187.73Show/hide
Query:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN
        ME+K +LMKP+L KFGV LAISFA  LYSRFRL NKRPPLPPP SSSSDDQGNKV+LGRGRG   RLDNQGMKAAT ASS NVVLFAVDAY+EMCIPKVN
Subjt:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN

Query:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
        VDDSN+GLCPSNKHGV+KDGLLLPEFQE VKEFD +AANA  SPKKNV+APR  L+TPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLE QLLEYYGL
Subjt:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL

Query:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD
        KEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL SQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V+KLQDQEHKTNESNKD
Subjt:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD

Query:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE
        AQIKLQKIE+LEKEIE+LRK N RLQIENSDLGRRLDATQFLANS+LEDQEKESLKEE ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYE
Subjt:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE

Query:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG
        LRNFQPPAGKTAARDLSKTLSPKS+EKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSHTDPGDPDDSA +FPST KTSSNKIKFI KL+KLLRG
Subjt:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG

Query:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS
        KGSQQNLTLLAEKSAAS+EDSDSP YSSSNSTGTNATRAEGQ IGY T S+NSSR+S+DF RLH+QKED+ KTEDS RRNSDVGYVNKRFVLGSD+SSNS
Subjt:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS

Query:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF
        S RSQSQD ESTEKSELMKYAEVLKDTRGAKN+ HRKAASIGSF
Subjt:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF

XP_004141788.1 protein CHUP1, chloroplastic isoform X2 [Cucumis sativus]6.8e-29485.56Show/hide
Query:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN
        ME+K +L +P+LFKFGV LAISFAG LYSRFRL NKRPPLPPPS SSSDDQGNKV+LGRGRG   RLD QG       + +NVVLFAVDAY+E CIPKVN
Subjt:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN

Query:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
         DDSN+GLCPSNKHGV+KDGLL PEFQEL+KEFD +AANA  S KKNV+APR  L+TPKAYKTVE+DEYEQEIR+LKSKVKMLRERERNLEVQLLEYYGL
Subjt:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL

Query:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD
        KEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL SQVCDHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQ+RV+KLQDQEHKTN+SNKD
Subjt:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD

Query:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE
        AQIKLQKIE+LEKEIE+LRKSNLRL+IENSDLGRRLDATQFLANS+LEDQEKESLKEE ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYE
Subjt:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE

Query:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG
        LRNFQPPAGKTAARDLSKTLSPKS+EKAKKLIL+YANTEG EGK +NV DFDSDQWSSSQASSHTDPGDPDDS  DFPST KT SNKIKFISKLRKLL+G
Subjt:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG

Query:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS
        KGSQQN+TLLAEKSAASVEDSDSP YS+SNSTGTNATRAEGQ IGY TP  NSS HSMDFHRL +QKEDD K EDSIRRNSDVG VNKRFV+GSD+ S+S
Subjt:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS

Query:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF
        SYRSQ+QD ESTEKSELMKYAEVLKDTRGAKNR HRK ASIGSF
Subjt:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF

XP_008462405.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]4.9e-30087.58Show/hide
Query:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN
        ME+K +LMKP+L KFGV LAISFA  LYSRFRL NKRPPLPPP SSSSDDQGNKV+LGRGRG   RLDNQGMKAAT ASS NVVLFAVDAY+EMCI KVN
Subjt:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN

Query:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
        VDDSN+GLCPSNKHGV+KDGLLLPEFQE VKEFD +AANA  SPKKNV+APR  L+TPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLE QLLEYYGL
Subjt:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL

Query:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD
        KEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL SQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V+KLQDQEHKTNESNKD
Subjt:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD

Query:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE
        AQIKLQKIE+LEKEIE+LRK N RLQIENSDLGRRLDATQFLANS+LEDQEKESLKEE ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYE
Subjt:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE

Query:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG
        LRNFQPPAGKTAARDLSKTLSPKS+EKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSHTDPGDPDDSA +FPST KTSSNKIKFI KL+KLLRG
Subjt:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG

Query:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS
        KGSQQNLTLLAEKSAAS+EDSDSP YSSSNSTGTNATRAEGQ IGY T S+NSSR+S+DF RLH+QKED+ KTEDS RRNSDVGYVNKRFVLGSD+SSNS
Subjt:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS

Query:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF
        S RSQSQD ESTEKSELMKYAEVLKDTRGAKN+ HRKAASIGSF
Subjt:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF

XP_031744947.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus]7.6e-29385.43Show/hide
Query:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPS-SSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKV
        ME+K +L +P+LFKFGV LAISFAG LYSRFRL NKRPPLPPPS SSS+DDQGNKV+LGRGRG   RLD QG       + +NVVLFAVDAY+E CIPKV
Subjt:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPS-SSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKV

Query:  NVDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYG
        N DDSN+GLCPSNKHGV+KDGLL PEFQEL+KEFD +AANA  S KKNV+APR  L+TPKAYKTVE+DEYEQEIR+LKSKVKMLRERERNLEVQLLEYYG
Subjt:  NVDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYG

Query:  LKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK
        LKEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL SQVCDHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQ+RV+KLQDQEHKTN+SNK
Subjt:  LKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK

Query:  DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRY
        DAQIKLQKIE+LEKEIE+LRKSNLRL+IENSDLGRRLDATQFLANS+LEDQEKESLKEE ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRY
Subjt:  DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRY

Query:  ELRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLR
        ELRNFQPPAGKTAARDLSKTLSPKS+EKAKKLIL+YANTEG EGK +NV DFDSDQWSSSQASSHTDPGDPDDS  DFPST KT SNKIKFISKLRKLL+
Subjt:  ELRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLR

Query:  GKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN
        GKGSQQN+TLLAEKSAASVEDSDSP YS+SNSTGTNATRAEGQ IGY TP  NSS HSMDFHRL +QKEDD K EDSIRRNSDVG VNKRFV+GSD+ S+
Subjt:  GKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN

Query:  SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF
        SSYRSQ+QD ESTEKSELMKYAEVLKDTRGAKNR HRK ASIGSF
Subjt:  SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF

XP_038898688.1 protein CHUP1, chloroplastic [Benincasa hispida]7.4e-30989.92Show/hide
Query:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN
        M++KRDLMKP+LFKFG ALAISFAG L S+FRL NKRPPL PPSSSSSDDQ +KVDLGRGRG   RLDNQG+KAAT ASS NVV FAVDAY++ CIPKVN
Subjt:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN

Query:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
         DDSN+GL PSNKHGV+KDG LLPEFQELVKEFDF+AANAGL PKKNV+APR  L+TPKAYKTVEDDEYEQEIRHLKSKVK LRERERNLEVQLLEYYGL
Subjt:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL

Query:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD
        KEQETAVMELQNRLKI+NMEAKLF LKIESLQADNRRL SQVCDHAKSVSDLEAAKAKIKFLKKK+RYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD
Subjt:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD

Query:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE
        AQI+LQKIEELEKEIEDLRKSNL+LQIENSDL RRLDATQFLANS+LEDQEKESLKEE ERL+ ENEAL KEIEQLQAHRCAD+EELVYLRWINACLRYE
Subjt:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE

Query:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG
        LRNFQPPAGKTAARDLSKTLSPKS+EKAKKLIL+YANTEGIEGK IN+ DFDSDQWSSSQASSHTDPGDPDDSAVDFPST KTSSNK+KFISKLRKLLRG
Subjt:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG

Query:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSI-RRNSDVGYVNKRFVLGSDRSSN
        KGSQQNLTLLAEKSAASVEDS SPRYSSSNS GTNATRAEGQGIGYTTPS+NSSRHSMDFHRL++QKEDDGKTEDSI RRNSDVGYVNK+FVLGSD SSN
Subjt:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSI-RRNSDVGYVNKRFVLGSDRSSN

Query:  SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF
        SSYRSQSQD ESTEKSELMKYAEVLKDTRGAKN+  RKAASIGSF
Subjt:  SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF

TrEMBL top hitse value%identityAlignment
A0A0A0K799 Uncharacterized protein3.3e-29485.56Show/hide
Query:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN
        ME+K +L +P+LFKFGV LAISFAG LYSRFRL NKRPPLPPPS SSSDDQGNKV+LGRGRG   RLD QG       + +NVVLFAVDAY+E CIPKVN
Subjt:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN

Query:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
         DDSN+GLCPSNKHGV+KDGLL PEFQEL+KEFD +AANA  S KKNV+APR  L+TPKAYKTVE+DEYEQEIR+LKSKVKMLRERERNLEVQLLEYYGL
Subjt:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL

Query:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD
        KEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL SQVCDHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQ+RV+KLQDQEHKTN+SNKD
Subjt:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD

Query:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE
        AQIKLQKIE+LEKEIE+LRKSNLRL+IENSDLGRRLDATQFLANS+LEDQEKESLKEE ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYE
Subjt:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE

Query:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG
        LRNFQPPAGKTAARDLSKTLSPKS+EKAKKLIL+YANTEG EGK +NV DFDSDQWSSSQASSHTDPGDPDDS  DFPST KT SNKIKFISKLRKLL+G
Subjt:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG

Query:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS
        KGSQQN+TLLAEKSAASVEDSDSP YS+SNSTGTNATRAEGQ IGY TP  NSS HSMDFHRL +QKEDD K EDSIRRNSDVG VNKRFV+GSD+ S+S
Subjt:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS

Query:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF
        SYRSQ+QD ESTEKSELMKYAEVLKDTRGAKNR HRK ASIGSF
Subjt:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF

A0A1S3CGW9 protein CHUP1, chloroplastic2.4e-30087.58Show/hide
Query:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN
        ME+K +LMKP+L KFGV LAISFA  LYSRFRL NKRPPLPPP SSSSDDQGNKV+LGRGRG   RLDNQGMKAAT ASS NVVLFAVDAY+EMCI KVN
Subjt:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN

Query:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
        VDDSN+GLCPSNKHGV+KDGLLLPEFQE VKEFD +AANA  SPKKNV+APR  L+TPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLE QLLEYYGL
Subjt:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL

Query:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD
        KEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL SQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V+KLQDQEHKTNESNKD
Subjt:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD

Query:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE
        AQIKLQKIE+LEKEIE+LRK N RLQIENSDLGRRLDATQFLANS+LEDQEKESLKEE ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYE
Subjt:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE

Query:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG
        LRNFQPPAGKTAARDLSKTLSPKS+EKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSHTDPGDPDDSA +FPST KTSSNKIKFI KL+KLLRG
Subjt:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG

Query:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS
        KGSQQNLTLLAEKSAAS+EDSDSP YSSSNSTGTNATRAEGQ IGY T S+NSSR+S+DF RLH+QKED+ KTEDS RRNSDVGYVNKRFVLGSD+SSNS
Subjt:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS

Query:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF
        S RSQSQD ESTEKSELMKYAEVLKDTRGAKN+ HRKAASIGSF
Subjt:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF

A0A5A7V182 Protein CHUP12.1e-30187.73Show/hide
Query:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN
        ME+K +LMKP+L KFGV LAISFA  LYSRFRL NKRPPLPPP SSSSDDQGNKV+LGRGRG   RLDNQGMKAAT ASS NVVLFAVDAY+EMCIPKVN
Subjt:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN

Query:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
        VDDSN+GLCPSNKHGV+KDGLLLPEFQE VKEFD +AANA  SPKKNV+APR  L+TPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLE QLLEYYGL
Subjt:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL

Query:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD
        KEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL SQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V+KLQDQEHKTNESNKD
Subjt:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD

Query:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE
        AQIKLQKIE+LEKEIE+LRK N RLQIENSDLGRRLDATQFLANS+LEDQEKESLKEE ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYE
Subjt:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE

Query:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG
        LRNFQPPAGKTAARDLSKTLSPKS+EKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSHTDPGDPDDSA +FPST KTSSNKIKFI KL+KLLRG
Subjt:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG

Query:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS
        KGSQQNLTLLAEKSAAS+EDSDSP YSSSNSTGTNATRAEGQ IGY T S+NSSR+S+DF RLH+QKED+ KTEDS RRNSDVGYVNKRFVLGSD+SSNS
Subjt:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS

Query:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF
        S RSQSQD ESTEKSELMKYAEVLKDTRGAKN+ HRKAASIGSF
Subjt:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF

A0A6J1D049 protein CHUP1, chloroplastic isoform X33.8e-25879.22Show/hide
Query:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN
        ME KR+L KP+L KFGV LAISFAG LYSRFR+  KRP LPPPSSSSS DQGNKVDL RGRG   +LDNQ +K                  +EM IPKVN
Subjt:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVN

Query:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
        VDDSNVGLCPS+K  V+KDGL LPE QELVKE DF AANAGLS +KNV+A R  L+TPKAY   E D+YEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
Subjt:  VDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL

Query:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD
        KEQETAVMELQNRLKI+NMEAKLF LKIESLQADNRRL SQV DHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQQRV KL DQE+KTNESNKD
Subjt:  KEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKD

Query:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE
        A+IKL++IE+LEKE+EDLR SNLRLQIENSDL RRLDATQ LANSILED EKESLKEERERL  ENE L KEIEQLQAHRCADVEELVYLRWINACLRYE
Subjt:  AQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYE

Query:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG
        LRN+QP  GKTAARDLSKTLSPKS+EKAKKLILEYANTEGIEGKGIN++DFDSDQWSSSQASS T   D DDS VDF +TTK SSNKIKFISKLRKLL+G
Subjt:  LRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRG

Query:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS
        K SQQN  L AEKSAAS+EDSDSPRYSSSNSTGTNATRAEGQGIG    SQ+SSRHSMDF RL +Q  + GK EDS+RRNSD GY NKR VLGS+R SNS
Subjt:  KGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNS

Query:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRK-AASIGSF
         +++ S D ES+EKSELMKYAEVLKD+ GAKNR HRK AASI S+
Subjt:  SYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRK-AASIGSF

A0A6J1HMC2 protein CHUP1, chloroplastic isoform X15.9e-25977.9Show/hide
Query:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDD-QGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKV
        ME K DL+KPVLFKFGV LAISFA  +YSRFR+ NKRP L PPSSSSSD+ + NKV+LGRGRG   +LD+Q MK AT ASSN ++L A DAY+EMCI K 
Subjt:  MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDD-QGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKV

Query:  NVDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYG
        N DDS+ G    N H V+++GLLLPEFQELVK+FD +AANAG SPKKN  A RL ++TPKAYK VE D YE EI+HLKSKVKMLRERERNLEVQLLEYYG
Subjt:  NVDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYG

Query:  LKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK
        LKEQETAVMELQNRLKI+NMEAKLF LKIESLQADNRRL SQV D AKS SDLEAA+  IKFLKKKLR+EAEQNR QI+NLQQRV KL DQE K NES K
Subjt:  LKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK

Query:  DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRY
        + QIKLQ IE+LEKEIE+L+K+N RLQ ENSDLGRRLDATQFLANSILEDQEKESLKEER+R A ENE L KEIEQLQAHRCADVEELVYLRWINACLRY
Subjt:  DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRY

Query:  ELRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLR
        ELRNFQP AGKTAARDLSKTLSPKS+ KAKKLILEYANTEGIEGK IN+ DFDSDQWSSSQASSHTDPGD D SAVD   T K SSNKIKF+SKLR LLR
Subjt:  ELRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLR

Query:  GKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN
        GK +QQ+  LL EKSAA+V D DSPRYSSS+STGTNATRA+G G GYTTPSQNSSR SMDFHRL++QKEDD KTEDS+RRNSDVGY+NKRFV GSDRSSN
Subjt:  GKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN

Query:  SSYRSQSQDAEST---EKSELMKYAEVLKDTRGAKNRPHRKAASIGS
        S YRS SQ+ EST   EKSEL+KYAEVLK++RG KN+  RK A + S
Subjt:  SSYRSQSQDAEST---EKSELMKYAEVLKDTRGAKNRPHRKAASIGS

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.4e-5238.66Show/hide
Query:  DGLLLPEFQELVK-EFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIS
        D  +LPEF++L+  E ++        P  + D      +  + Y+ VE    + E+  LK  VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI 
Subjt:  DGLLLPEFQELVK-EFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIS

Query:  NMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIED
         +E  +  + I SLQA+ ++L  ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E +    + + + KL+ +++LE ++ +
Subjt:  NMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIED

Query:  LRKSNLRLQIENSDLGRRLDATQ---FLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAAR
        L++ N  LQ E  +L  +LD+ +      +++ E  +   ++EE   L   NE L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK +AR
Subjt:  LRKSNLRLQIENSDLGRRLDATQ---FLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAAR

Query:  DLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTS-SNKIKFISKLRKLLRGK
        DLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D D++++D  ++  +S S K   I KL+K  + K
Subjt:  DLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTS-SNKIKFISKLRKLLRGK

Arabidopsis top hitse value%identityAlignment
AT1G52080.1 actin binding protein family1.7e-9340.06Show/hide
Query:  NKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKR-----PPLPPPSSSSS-DDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCI
        +KRD+   VL + G ALA+SFAG L++RFR   KR     PPLPP SS +   D  NK    R  G              T  ++   L  V   +E  +
Subjt:  NKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKR-----PPLPPPSSSSS-DDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCI

Query:  PKVNVDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLE
                            EKD  LLPEF+E  K+ D    +       + + PR  +  P A+ + E+ ++E EI  L++ V+ LRERER LE +LLE
Subjt:  PKVNVDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLE

Query:  YYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNE
        YY LKEQ+   MEL++RLK++ ME K+F  KI+ LQA+N +L ++  +H+K + +L+ AK++++ LKKKL    +Q+  QIL+L+QRV +LQ++E K   
Subjt:  YYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNE

Query:  SNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEK-ESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINA
         + +A   +Q++ +LE EI +L  +N RLQ EN +L  +L++ Q +ANS LE+ E+ E+L+E+  RL  ENE L K++EQLQ  RC D+E+LVYLRWINA
Subjt:  SNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEK-ESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINA

Query:  CLRYELRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSH--TDPGDPDDSAVDFPSTTKT-SSNKIKFIS
        CLRYELR +QPPAGKT ARDLS TLSP S+EKAK+LILEYA++E          + D D+WSSSQ  S   TD    DDS+VD    TKT  + K K + 
Subjt:  CLRYELRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSH--TDPGDPDDSAVDFPSTTKT-SSNKIKFIS

Query:  KLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVL
        KL K+L GK ++      ++K A S E        SS++TG            ++TP Q  S HSMDF  L       GK E+   +N  V    K    
Subjt:  KLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVL

Query:  GSDRSSNSSY-RSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAAS
         S+ + +S+Y      + +   K EL+K A+ L  +R  K + H+K+ S
Subjt:  GSDRSSNSSY-RSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAAS

AT2G36650.1 unknown protein3.6e-0623.11Show/hide
Query:  DEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLK---
        ++ +QEI  LKS+ + L+ +E  +E+    +  LK+QE  ++E ++ L +   +   F+ ++ +++ +++R  + V  + K V +++  +++   L+   
Subjt:  DEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLK---

Query:  KKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLA
        KKLR +++Q   +++N  ++++ ++ +  K  +   + + K   ++ELE +++D+      LQ E  +L            S     E  S+++ R    
Subjt:  KKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLA

Query:  GENEALAKEIEQLQAHRCADVEELVYLRWINACLRYEL
             + +E E+L+      V+E++ LRW NACLR+E+
Subjt:  GENEALAKEIEQLQAHRCADVEELVYLRWINACLRYEL

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.0e-5338.66Show/hide
Query:  DGLLLPEFQELVK-EFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIS
        D  +LPEF++L+  E ++        P  + D      +  + Y+ VE    + E+  LK  VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI 
Subjt:  DGLLLPEFQELVK-EFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIS

Query:  NMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIED
         +E  +  + I SLQA+ ++L  ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E +    + + + KL+ +++LE ++ +
Subjt:  NMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIED

Query:  LRKSNLRLQIENSDLGRRLDATQ---FLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAAR
        L++ N  LQ E  +L  +LD+ +      +++ E  +   ++EE   L   NE L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK +AR
Subjt:  LRKSNLRLQIENSDLGRRLDATQ---FLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAAR

Query:  DLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTS-SNKIKFISKLRKLLRGK
        DLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D D++++D  ++  +S S K   I KL+K  + K
Subjt:  DLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTS-SNKIKFISKLRKLLRGK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.0e-5338.66Show/hide
Query:  DGLLLPEFQELVK-EFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIS
        D  +LPEF++L+  E ++        P  + D      +  + Y+ VE    + E+  LK  VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI 
Subjt:  DGLLLPEFQELVK-EFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIS

Query:  NMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIED
         +E  +  + I SLQA+ ++L  ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E +    + + + KL+ +++LE ++ +
Subjt:  NMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIED

Query:  LRKSNLRLQIENSDLGRRLDATQ---FLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAAR
        L++ N  LQ E  +L  +LD+ +      +++ E  +   ++EE   L   NE L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK +AR
Subjt:  LRKSNLRLQIENSDLGRRLDATQ---FLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAAR

Query:  DLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTS-SNKIKFISKLRKLLRGK
        DLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D D++++D  ++  +S S K   I KL+K  + K
Subjt:  DLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTS-SNKIKFISKLRKLLRGK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.9e-3638.24Show/hide
Query:  DNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLG
        +++ L  ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E +    + + + KL+ +++LE ++ +L++ N  LQ E  +L 
Subjt:  DNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLG

Query:  RRLDATQ---FLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKTLSPKSKEKAKK
         +LD+ +      +++ E  +   ++EE   L   NE L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK +ARDLSK LSPKS+ KAK+
Subjt:  RRLDATQ---FLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKTLSPKSKEKAKK

Query:  LILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTS-SNKIKFISKLRKLLRGK
        L+LEYA +E   G+G      D+D  S+    S     D D++++D  ++  +S S K   I KL+K  + K
Subjt:  LILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTS-SNKIKFISKLRKLLRGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACAAGAGGGATTTGATGAAGCCTGTATTATTCAAATTTGGGGTTGCTCTGGCTATCTCCTTTGCTGGTTTGCTCTATTCCCGATTCAGACTCGGAAATAAGAG
ACCTCCTCTGCCTCCTCCCTCGTCGAGTTCTTCAGATGATCAGGGCAATAAAGTTGACTTGGGAAGGGGAAGGGGAAGAAGACTTAGACTTGACAATCAAGGAATGAAGG
CAGCAACAACAGCATCCTCTAATAATGTTGTTCTTTTTGCAGTTGATGCCTATCAAGAAATGTGTATTCCAAAAGTCAATGTTGATGATTCAAATGTTGGTCTCTGTCCT
AGCAATAAGCATGGTGTAGAAAAAGATGGCTTGCTTCTCCCAGAGTTTCAGGAACTTGTCAAGGAATTTGATTTTGCTGCAGCAAATGCTGGGCTTTCTCCTAAGAAAAA
TGTTGACGCACCAAGGTTGGCGCTCAAAACTCCAAAAGCTTATAAGACAGTTGAGGATGATGAATATGAACAAGAGATCAGACACCTCAAAAGCAAGGTGAAAATGCTGC
GAGAGAGGGAGAGGAACCTTGAGGTTCAACTACTTGAGTATTATGGTCTGAAAGAGCAAGAAACTGCAGTAATGGAGCTCCAAAATAGGTTGAAGATTAGTAACATGGAA
GCCAAGCTTTTCAAACTCAAGATTGAGTCCCTTCAGGCAGATAACCGACGATTAGTGTCACAAGTTTGCGATCATGCTAAGTCAGTGTCCGACCTCGAGGCCGCAAAAGC
AAAAATTAAGTTCCTCAAGAAAAAACTTAGATATGAAGCAGAACAGAACAGGGGACAGATCTTAAATCTTCAGCAAAGAGTTGTTAAGCTGCAAGATCAAGAACATAAGA
CAAATGAAAGCAATAAAGATGCCCAAATCAAGTTGCAAAAGATTGAAGAATTGGAGAAAGAGATAGAGGACTTGAGAAAGTCGAATTTGAGATTACAGATAGAAAATTCT
GATCTGGGTCGGAGATTAGATGCTACTCAATTTCTTGCAAATTCTATTTTGGAAGACCAAGAAAAAGAATCACTCAAAGAAGAAAGGGAGCGTTTGGCAGGAGAAAATGA
GGCGTTGGCTAAGGAAATTGAGCAGCTTCAAGCACACCGGTGTGCAGATGTTGAAGAGCTAGTCTATCTTCGCTGGATTAATGCTTGCTTAAGATATGAACTGCGGAATT
TTCAGCCTCCAGCAGGGAAAACAGCAGCAAGAGACCTGAGCAAAACATTAAGTCCCAAATCCAAGGAGAAAGCAAAGAAGCTCATCCTCGAATATGCAAATACAGAAGGA
ATTGAAGGGAAGGGCATTAACGTTGTGGATTTCGATTCAGATCAATGGTCGTCTTCACAAGCTTCCTCTCATACTGATCCTGGAGATCCGGATGATTCAGCTGTTGATTT
TCCATCAACAACCAAAACAAGTTCAAACAAAATCAAATTCATTAGTAAACTCAGAAAACTCTTGAGGGGAAAAGGTAGTCAACAAAACCTGACTTTGTTAGCAGAAAAAT
CTGCAGCATCTGTAGAAGATAGTGATTCACCTCGTTACAGTTCAAGTAATTCTACTGGGACCAATGCTACTCGAGCCGAGGGGCAGGGTATTGGATACACAACTCCATCT
CAGAATTCATCAAGACATTCAATGGATTTTCACAGATTGCATACCCAAAAGGAAGATGATGGAAAAACTGAGGACTCCATTAGAAGGAATAGTGATGTTGGCTACGTGAA
CAAGAGATTTGTTTTAGGGAGCGACCGATCGAGCAACTCATCATATAGATCTCAAAGTCAGGATGCAGAATCCACCGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTT
TGAAAGACACTCGAGGAGCTAAGAACCGGCCACATAGGAAGGCTGCATCCATTGGTTCGTTTTGA
mRNA sequenceShow/hide mRNA sequence
AAACGACCGAGCGATTAAACAGCTCGTTCACGTAAATCCTCAAAAACGAAAAACCATCCAAAATTAAACCATTTCCATAACAAAACCCATCTTCTTCTTCCTCATAATCC
TCATTTTAATCTACCTTTCATTATCCCTATCTCATTAAACTCTTCCAAATTGGAATCCATTAAGATCCCTTTCTTTCCCTGTTCCATCAACCAAAAATCCTTCTGGGTAG
ACCCGCTCTTGCACGTCAACATCGACACAAAACGTCGCCATTGTTTGCCAAAGAAGGAAGGAAGAAAGAAAGAAAGAAGGGATTTTTAATTGGAATTTTCTACACTTGGA
ACTGTGTTTAATCAGCCTATGACACGGACGAAAAAGCACCCAATAGAAACAACAATCGCAGTGGTCCGGCCAACGTCAAAGCGTTACTCTCCCGCTGATTTCATGCGTTC
CCAAATGCTTTTCTGTTACAAAATCAAAATGGCGGACGATGAAGACGAAGACCTATAATATTTTATGATCCTTTAGTTCATTAATTTGGTTCGATTAAAGGTTCTTGATA
TCCAATAATGGAAAACAAGAGGGATTTGATGAAGCCTGTATTATTCAAATTTGGGGTTGCTCTGGCTATCTCCTTTGCTGGTTTGCTCTATTCCCGATTCAGACTCGGAA
ATAAGAGACCTCCTCTGCCTCCTCCCTCGTCGAGTTCTTCAGATGATCAGGGCAATAAAGTTGACTTGGGAAGGGGAAGGGGAAGAAGACTTAGACTTGACAATCAAGGA
ATGAAGGCAGCAACAACAGCATCCTCTAATAATGTTGTTCTTTTTGCAGTTGATGCCTATCAAGAAATGTGTATTCCAAAAGTCAATGTTGATGATTCAAATGTTGGTCT
CTGTCCTAGCAATAAGCATGGTGTAGAAAAAGATGGCTTGCTTCTCCCAGAGTTTCAGGAACTTGTCAAGGAATTTGATTTTGCTGCAGCAAATGCTGGGCTTTCTCCTA
AGAAAAATGTTGACGCACCAAGGTTGGCGCTCAAAACTCCAAAAGCTTATAAGACAGTTGAGGATGATGAATATGAACAAGAGATCAGACACCTCAAAAGCAAGGTGAAA
ATGCTGCGAGAGAGGGAGAGGAACCTTGAGGTTCAACTACTTGAGTATTATGGTCTGAAAGAGCAAGAAACTGCAGTAATGGAGCTCCAAAATAGGTTGAAGATTAGTAA
CATGGAAGCCAAGCTTTTCAAACTCAAGATTGAGTCCCTTCAGGCAGATAACCGACGATTAGTGTCACAAGTTTGCGATCATGCTAAGTCAGTGTCCGACCTCGAGGCCG
CAAAAGCAAAAATTAAGTTCCTCAAGAAAAAACTTAGATATGAAGCAGAACAGAACAGGGGACAGATCTTAAATCTTCAGCAAAGAGTTGTTAAGCTGCAAGATCAAGAA
CATAAGACAAATGAAAGCAATAAAGATGCCCAAATCAAGTTGCAAAAGATTGAAGAATTGGAGAAAGAGATAGAGGACTTGAGAAAGTCGAATTTGAGATTACAGATAGA
AAATTCTGATCTGGGTCGGAGATTAGATGCTACTCAATTTCTTGCAAATTCTATTTTGGAAGACCAAGAAAAAGAATCACTCAAAGAAGAAAGGGAGCGTTTGGCAGGAG
AAAATGAGGCGTTGGCTAAGGAAATTGAGCAGCTTCAAGCACACCGGTGTGCAGATGTTGAAGAGCTAGTCTATCTTCGCTGGATTAATGCTTGCTTAAGATATGAACTG
CGGAATTTTCAGCCTCCAGCAGGGAAAACAGCAGCAAGAGACCTGAGCAAAACATTAAGTCCCAAATCCAAGGAGAAAGCAAAGAAGCTCATCCTCGAATATGCAAATAC
AGAAGGAATTGAAGGGAAGGGCATTAACGTTGTGGATTTCGATTCAGATCAATGGTCGTCTTCACAAGCTTCCTCTCATACTGATCCTGGAGATCCGGATGATTCAGCTG
TTGATTTTCCATCAACAACCAAAACAAGTTCAAACAAAATCAAATTCATTAGTAAACTCAGAAAACTCTTGAGGGGAAAAGGTAGTCAACAAAACCTGACTTTGTTAGCA
GAAAAATCTGCAGCATCTGTAGAAGATAGTGATTCACCTCGTTACAGTTCAAGTAATTCTACTGGGACCAATGCTACTCGAGCCGAGGGGCAGGGTATTGGATACACAAC
TCCATCTCAGAATTCATCAAGACATTCAATGGATTTTCACAGATTGCATACCCAAAAGGAAGATGATGGAAAAACTGAGGACTCCATTAGAAGGAATAGTGATGTTGGCT
ACGTGAACAAGAGATTTGTTTTAGGGAGCGACCGATCGAGCAACTCATCATATAGATCTCAAAGTCAGGATGCAGAATCCACCGAAAAGTCTGAGTTGATGAAATATGCT
GAAGTTTTGAAAGACACTCGAGGAGCTAAGAACCGGCCACATAGGAAGGCTGCATCCATTGGTTCGTTTTGAACACAAAAAAAGCTTGTCTGGCTCTCAATGGCTTCATC
ATCACCTGTTCATATGTGTAAATTAAGGACCATACCGTTTCACAAAAGATAGGAAATGTGTCAATCCTGGATTACCTGTTCTTGAACATTGCAAAAGCAACACAGAAAAC
AAAAAGAGTATTAGCTCAAATATACTCAATGTGAAGAAAAACTCCAAGAAACAATGTTCTATTGCAACATCAAGTTCCAACAGAGTCCAAACAGTCCAATATCAACTGAT
ACAATGAAATAGCAATGAAAAGATAACGAAACAGAAGAACACAATAACCCGGTTATTAGAGAGAACCGTTGCCCTCCTCCTCAAGCTTCCAGCAGCTACAACCTTTCTCT
CTTCAAACCCCTACATAAACCCCTATTCATGGTCCCCTTCTCAGGCACAATGCACACTACAAACTAAAGTTAAATAAATTATGTGAATACCCTTTTTTACCCCCTCTTTA
TGAACGTGCAAATGATCGGAGGTCTTACAGATTTCGACGTAATAATAACTAGTACAAGAACGTGCTGGAGAAACAAGAAATATTTGGAAGAAAGAGTGAATCTAAAACAG
CTCATATATGGTGATGATCAGATGAGAGCATAGCCCTGGACAATCATGGCATCAGCAAGTATCTGATTGGCAGCCTCTGATGGATGAACACTGTCCCAAAACATATATTT
GGTTGCATTAGAACATGTTTCATGAGACTTTGGATTGCACAAAACTGATGCTGTCTCCACTGCCCCAGTGCCACAACACCCTTTTCTCACTTCATCAAATCCTTGAAGGA
AACACAAAAAACACAGAGTGAATACTGTTTTGATTAGCAACCGAGTGTTTGAAAACTATTCTATTATTGATGAAGTTTTTTTGTTTGTGAAGTACCATGATTTGATGGAG
ACATAATAGCGTCATATAAAGGTTTGAAAACGTCGAAGACGACGAGCTTGAGACCGGGAAGCTGCTTTTGAAGAGTTGCAGCAGTAGAGTTGAGTTTCCTGTTGAAGACT
AGGACATCATTGTTGATTGTTCTGACACAACCCTTTTTTTGTTGGTAGTAGCCAAACTGAGTGAGTGCAGCAGGAAAACAGCCTAATGGAGGTAGGGAAGTGACCCCGAT
TTTCCTTGCTCCTAGCCCATGTAAATCCTTCATCAAAGTCACAAATCCTCCATCCAATGTTATTATGAACATAATAATTAGGGTTTTCTTTTCTTTTCGTTTCCAAGCTA
ACAAGTTATATGGAATAAAACAGCAACTTTGT
Protein sequenceShow/hide protein sequence
MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSDDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCP
SNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNME
AKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENS
DLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEG
IEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPS
QNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNSSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF