; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0238 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0238
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein CHUP1, chloroplastic isoform X3
Genome locationMC09:2201978..2206551
RNA-Seq ExpressionMC09g0238
SyntenyMC09g0238
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147409.1 protein CHUP1, chloroplastic isoform X1 [Momordica charantia]0.096.26Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIK                  +MYIPKVNVDDS
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS

Query:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
        NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
Subjt:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE

Query:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
        TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
Subjt:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK

Query:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY
        LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY
Subjt:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY

Query:  ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD
        ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD
Subjt:  ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD

Query:  SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT
        SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT
Subjt:  SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT

Query:  PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

XP_022147417.1 protein CHUP1, chloroplastic isoform X2 [Momordica charantia]0.096.1Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSS DQGNKVDLSRGRGPKLDNQAIK                  +MYIPKVNVDDS
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS

Query:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
        NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
Subjt:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE

Query:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
        TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
Subjt:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK

Query:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY
        LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY
Subjt:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY

Query:  ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD
        ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD
Subjt:  ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD

Query:  SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT
        SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT
Subjt:  SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT

Query:  PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

XP_022147425.1 protein CHUP1, chloroplastic isoform X3 [Momordica charantia]0.097.01Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIK                  +MYIPKVNVDDS
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS

Query:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
        NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
Subjt:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE

Query:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
        TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
Subjt:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK

Query:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNY
        LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNY
Subjt:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNY

Query:  QPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKDSQQNQ
        QPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKDSQQNQ
Subjt:  QPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKDSQQNQ

Query:  ALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKTPSPDT
        ALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKTPSPDT
Subjt:  ALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKTPSPDT

Query:  ESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        ESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  ESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

XP_022147434.1 protein CHUP1, chloroplastic isoform X4 [Momordica charantia]0.099.09Show/hide
Query:  MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ
        MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ
Subjt:  MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ

Query:  LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK
        LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK
Subjt:  LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK

Query:  TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV
        TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV
Subjt:  TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV

Query:  YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI
        YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI
Subjt:  YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI

Query:  SKLRKLLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG
        SKLRKLLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG
Subjt:  SKLRKLLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG

Query:  SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

XP_038898688.1 protein CHUP1, chloroplastic [Benincasa hispida]0.079Show/hide
Query:  MEEKRNL-KPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDD
        M++KR+L KPIL KFG  LAISFAGFL S+FR+R KRP L PPSSSSS DQ +KVDL RGRGP+LDNQ +K AT+AS NV+ FA DAY K  IPKVN DD
Subjt:  MEEKRNL-KPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDD

Query:  SNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQ
        SN+GL PS+K  VDKDGL LPE QELVKE DF AANAGL  +KNVEA RSGL+TPKAY   E D+YEQEIRHLKSKVK LRERERNLEVQLLEYYGLKEQ
Subjt:  SNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQ

Query:  ETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARI
        ETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQV DHAKSVSDLEAA+AKIKFLKKK+RYEAEQNRGQILNLQQRV KL DQE+KTNESNKDA+I
Subjt:  ETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARI

Query:  KLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRN
        +L++IE+LEKE+EDLR SNL+LQIENSDL+RRLDATQ LANS+LED EKESLKEE ERL +ENE L KEIEQLQAHRCAD+EELVYLRWINACLRYELRN
Subjt:  KLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRN

Query:  YQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQ---DDSYVDFQATTKPSSNKIKFISKLRKLLKGKDS
        +QP  GKTAARDLSKTLSPKSEEKAKKLIL+YANTEGIEGK INI DFDSDQWSSSQASS TD    DDS VDF +T K SSNK+KFISKLRKLL+GK S
Subjt:  YQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQ---DDSYVDFQATTKPSSNKIKFISKLRKLLKGKDS

Query:  QQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEY--GKPEDSVRR-NSDGGYTNKRLVLGSNRMSNSPF
        QQN  L AEKSAAS+EDS SPRYSSSNS GTNATRAEGQGIG    S++SSRHSMDF RL+SQ+   GK EDS+RR NSD GY NK+ VLGS+  SNS +
Subjt:  QQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEY--GKPEDSVRR-NSDGGYTNKRLVLGSNRMSNSPF

Query:  KTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        ++ S DTES+EKSELMKYAEVLKD+ GAKN+S RK AASI S+
Subjt:  KTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

TrEMBL top hitse value%identityAlignment
A0A5A7V182 Protein CHUP10.078.5Show/hide
Query:  MEEKRNL-KPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDD
        ME+K NL KP+LLKFGVVLAISFA FLYSRFR++ KRP LPPP SSSS DQGNKV+L RGRGP+LDNQ +K AT+AS NV+LFA DAY +M IPKVNVDD
Subjt:  MEEKRNL-KPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDD

Query:  SNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQ
        SN+GLCPS+K  VDKDGL LPE QE VKE D  AANA  S +KNVEA RSGL+TPKAY   E D+YEQEIRHLKSKVKMLRERERNLE QLLEYYGLKEQ
Subjt:  SNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQ

Query:  ETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARI
        ETAVMELQNRLKINNMEAKLFT KIESL+ADNRRLESQV +HAK+VSDLEAARAKIKFLKKKLR+EAEQNR QILNLQQ+V KL DQE+KTNESNKDA+I
Subjt:  ETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARI

Query:  KLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRN
        KL++IEDLEKE+E+LR  N RLQIENSDL RRLDATQ LANS+LED EKESLKEE ERL QENE L KEIEQLQAHR ADVEELVYLRWINACLRYELRN
Subjt:  KLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRN

Query:  YQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQ---DDSYVDFQATTKPSSNKIKFISKLRKLLKGKDS
        +QP  GKTAARDLSKTLSPKSEEKAKKLIL+YANTEG EGKGI++ DFDSDQWSSSQASS TD    DDS  +F +T K SSNKIKFI KL+KLL+GK S
Subjt:  YQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQ---DDSYVDFQATTKPSSNKIKFISKLRKLLKGKDS

Query:  QQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYG--KPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFK
        QQN  L AEKSAAS+EDSDSP YSSSNSTGTNATRAEGQ IG A SS++SSR+S+DF+RL SQ+    K EDS RRNSD GY NKR VLGS++ SNS  +
Subjt:  QQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYG--KPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFK

Query:  TPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        + S DTES+EKSELMKYAEVLKD+ GAKN+SHRK AASI S+
Subjt:  TPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

A0A6J1D049 protein CHUP1, chloroplastic isoform X30.097.01Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIK                  +MYIPKVNVDDS
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS

Query:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
        NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
Subjt:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE

Query:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
        TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
Subjt:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK

Query:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNY
        LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNY
Subjt:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNY

Query:  QPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKDSQQNQ
        QPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKDSQQNQ
Subjt:  QPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKDSQQNQ

Query:  ALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKTPSPDT
        ALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKTPSPDT
Subjt:  ALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKTPSPDT

Query:  ESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        ESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  ESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

A0A6J1D0X5 protein CHUP1, chloroplastic isoform X10.096.26Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIK                  +MYIPKVNVDDS
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS

Query:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
        NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
Subjt:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE

Query:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
        TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
Subjt:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK

Query:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY
        LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY
Subjt:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY

Query:  ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD
        ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD
Subjt:  ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD

Query:  SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT
        SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT
Subjt:  SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT

Query:  PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

A0A6J1D100 protein CHUP1, chloroplastic isoform X40.099.09Show/hide
Query:  MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ
        MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ
Subjt:  MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ

Query:  LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK
        LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK
Subjt:  LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK

Query:  TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV
        TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV
Subjt:  TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV

Query:  YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI
        YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI
Subjt:  YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI

Query:  SKLRKLLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG
        SKLRKLLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG
Subjt:  SKLRKLLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG

Query:  SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

A0A6J1D2B1 protein CHUP1, chloroplastic isoform X20.096.1Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSS DQGNKVDLSRGRGPKLDNQAIK                  +MYIPKVNVDDS
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDS

Query:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
        NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE
Subjt:  NVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQE

Query:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
        TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK
Subjt:  TAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIK

Query:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY
        LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY
Subjt:  LKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY

Query:  ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD
        ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD
Subjt:  ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKD

Query:  SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT
        SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT
Subjt:  SQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKT

Query:  PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  PSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

SwissProt top hitse value%identityAlignment
Q3V6T2 Girdin1.4e-0426.73Show/hide
Query:  DDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNR-LKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLK--
        ++ E E+ HL+ + ++L+++  NL++   +   L EQE + +E +NR LK      K  T ++ESL+ +N +L+ +  +  ++V  L+ A  K+  L+  
Subjt:  DDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNR-LKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLK--

Query:  -KKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKE--SLKEERE
         K+L  E EQ +  +  L+    K    E      + + +   K +E+  K+++ L +    L++EN  L + L+  ++ ++  LE  EKE  SL++E  
Subjt:  -KKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKE--SLKEERE

Query:  RLGQENENLMKEIEQLQ
        +L ++ + L KE ++L+
Subjt:  RLGQENENLMKEIEQLQ

Q5SNZ0 Girdin2.5e-0425.81Show/hide
Query:  DDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEA-KLFTLKIESLQADNRRLESQVSDHAKSVSDLEAA---RAKIKFL
        ++ E E+ HL  + ++L+++  NL++   E     EQE + +E +NR     +++ K  T ++ESL+ +N +L+ +  +  +SV  L+ A    A+++  
Subjt:  DDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEA-KLFTLKIESLQADNRRLESQVSDHAKSVSDLEAA---RAKIKFL

Query:  KKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKE--SLKEERE
         K+L  E EQ R  +  ++    K    E      + + +   K +E+  K+++ L +    L++EN  L + L+  ++ ++  LE  EKE  SL++E  
Subjt:  KKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKE--SLKEERE

Query:  RLGQENENLMKEIEQLQ
        +L ++ + L KE ++L+
Subjt:  RLGQENENLMKEIEQLQ

Q9LI74 Protein CHUP1, chloroplastic4.9e-5336.62Show/hide
Query:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN
        D   LPE ++L+  E ++P  +   + EK  +  +   +   AYN       + E+  LK  VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI 
Subjt:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN

Query:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED
         +E  +  + I SLQA+ ++L+ ++S +     +LE AR KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + +   KLK ++DLE +V +
Subjt:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED

Query:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR
        L+  N  LQ E  +L+ +LD+ +      +++ E  +   ++EE   L   NE+L+K++E LQ +R ++VEELVYLRW+NACLRYELRNYQ   GK +AR
Subjt:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR

Query:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA
        DLSK LSPKS+ KAK+L+LEYA +E    +G    D +S+ +S   +    D D++ +D   +   S S K   I KL+K  K KD    Q+      + 
Subjt:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA

Query:  SMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE
        S       R SSS     N  R   + +   N+ +S +     F ++  +  G PE
Subjt:  SMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE

Arabidopsis top hitse value%identityAlignment
AT1G52080.1 actin binding protein family1.9e-9239.41Show/hide
Query:  KRNLKPILLKFGVVLAISFAGFLYSRFRIRKKR-----PRLPPPSSSSS-ADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNV
        KR++  ++L+ G  LA+SFAGFL++RFR   KR     P LPP SS +   D  NK    R  G +  +                               
Subjt:  KRNLKPILLKFGVVLAISFAGFLYSRFRIRKKR-----PRLPPPSSSSS-ADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNV

Query:  DDSNVGLCPSSKRSVD-KDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
        +++ +G+ P  +  +D KD   LPE +E  K+ D    +       + E  RS +  P A+ + E  D+E EI  L++ V+ LRERER LE +LLEYY L
Subjt:  DDSNVGLCPSSKRSVD-KDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL

Query:  KEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKD
        KEQ+   MEL++RLK+N ME K+F  KI+ LQA+N +L+++  +H+K + +L+ A+++++ LKKKL    +Q+  QIL+L+QRVA+L ++E K    + +
Subjt:  KEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKD

Query:  ARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEK-ESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY
        A   ++R+ DLE E+ +L ++N RLQ EN +L+ +L++ Q++ANS LE+PE+ E+L+E+  RL  ENE L K++EQLQ  RC D+E+LVYLRWINACLRY
Subjt:  ARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEK-ESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRY

Query:  ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQ--ASSLTDQ---DDSYVD-FQATTKPSSNKIKFISKLRK
        ELR YQP  GKT ARDLS TLSP SEEKAK+LILEYA++E          + D D+WSSSQ  +S +TD    DDS VD   AT    + K K + KL K
Subjt:  ELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQ--ASSLTDQ---DDSYVD-FQATTKPSSNKIKFISKLRK

Query:  LLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMS
        +L GKD++      ++K A S E        SS++TG ++T             Q  S HSMDF+ L     GK E+   +N       K    GS+   
Subjt:  LLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMS

Query:  NSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAA
            +    +T+ + K EL+K A+ L  S   K + H+K  +
Subjt:  NSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAA

AT2G36650.1 unknown protein6.0e-0623.83Show/hide
Query:  EQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLK---KKL
        +QEI  LKS+ + L+ +E  +E+    +  LK+QE  ++E ++ L +   +   F  ++ +++ +++R ++ V  + K V +++  R++   L+   KKL
Subjt:  EQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLK---KKL

Query:  RYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQEN
        R +++Q   +++N  +++  +  +  K  +   +   K   +++LE +V+D+      LQ E  +L            S     E  S+++ R       
Subjt:  RYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQEN

Query:  ENLMKEIEQLQAHRCADVEELVYLRWINACLRYEL
          +++E E+L+      V+E++ LRW NACLR+E+
Subjt:  ENLMKEIEQLQAHRCADVEELVYLRWINACLRYEL

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.5e-5436.62Show/hide
Query:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN
        D   LPE ++L+  E ++P  +   + EK  +  +   +   AYN       + E+  LK  VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI 
Subjt:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN

Query:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED
         +E  +  + I SLQA+ ++L+ ++S +     +LE AR KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + +   KLK ++DLE +V +
Subjt:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED

Query:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR
        L+  N  LQ E  +L+ +LD+ +      +++ E  +   ++EE   L   NE+L+K++E LQ +R ++VEELVYLRW+NACLRYELRNYQ   GK +AR
Subjt:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR

Query:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA
        DLSK LSPKS+ KAK+L+LEYA +E    +G    D +S+ +S   +    D D++ +D   +   S S K   I KL+K  K KD    Q+      + 
Subjt:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA

Query:  SMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE
        S       R SSS     N  R   + +   N+ +S +     F ++  +  G PE
Subjt:  SMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.5e-5436.62Show/hide
Query:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN
        D   LPE ++L+  E ++P  +   + EK  +  +   +   AYN       + E+  LK  VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI 
Subjt:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN

Query:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED
         +E  +  + I SLQA+ ++L+ ++S +     +LE AR KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + +   KLK ++DLE +V +
Subjt:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED

Query:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR
        L+  N  LQ E  +L+ +LD+ +      +++ E  +   ++EE   L   NE+L+K++E LQ +R ++VEELVYLRW+NACLRYELRNYQ   GK +AR
Subjt:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR

Query:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA
        DLSK LSPKS+ KAK+L+LEYA +E    +G    D +S+ +S   +    D D++ +D   +   S S K   I KL+K  K KD    Q+      + 
Subjt:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA

Query:  SMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE
        S       R SSS     N  R   + +   N+ +S +     F ++  +  G PE
Subjt:  SMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein9.5e-3635.59Show/hide
Query:  DNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLA
        +++ L+ ++S +     +LE AR KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + +   KLK ++DLE +V +L+  N  LQ E  +L+
Subjt:  DNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLA

Query:  RRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKK
         +LD+ +      +++ E  +   ++EE   L   NE+L+K++E LQ +R ++VEELVYLRW+NACLRYELRNYQ   GK +ARDLSK LSPKS+ KAK+
Subjt:  RRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKK

Query:  LILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNST
        L+LEYA +E    +G    D +S+ +S   +    D D++ +D   +   S S K   I KL+K  K KD    Q+      + S       R SSS   
Subjt:  LILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNST

Query:  GTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE
          N  R   + +   N+ +S +     F ++  +  G PE
Subjt:  GTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGAAGAGGAATTTAAAGCCTATATTATTAAAATTTGGGGTGGTTCTGGCTATTTCCTTTGCTGGTTTTCTCTATTCCCGCTTTAGAATCAGGAAGAAAAGACC
TCGTCTGCCTCCTCCCTCGTCAAGTTCTTCAGCAGATCAGGGCAATAAAGTTGACTTGAGTAGAGGAAGAGGACCTAAACTTGATAATCAAGCAATAAAGGTAGCAACAT
CTGCCTCCTTTAATGTTATTCTTTTTGCAGCTGATGCCTATTTAAAGATGTATATACCAAAAGTTAATGTTGATGATTCAAATGTTGGTCTCTGCCCTAGCAGTAAGCGT
AGTGTAGATAAAGATGGGTTGTTTCTCCCTGAGCTTCAGGAACTTGTCAAGGAATCTGATTTTCCTGCAGCGAATGCTGGGTTATCTCATGAGAAGAACGTTGAAGCATT
GAGGTCGGGTCTTCAAACTCCGAAAGCATATAACAATTTTGAGACGGATGACTATGAACAAGAGATCAGGCACCTCAAAAGTAAGGTGAAAATGCTTCGAGAGAGGGAGA
GGAACCTTGAGGTTCAGCTACTTGAGTATTATGGCCTGAAAGAGCAGGAAACTGCTGTCATGGAGCTCCAAAATAGGTTGAAGATTAACAATATGGAAGCCAAGCTTTTC
ACCCTCAAGATCGAGTCCCTTCAGGCAGATAATCGAAGATTAGAATCACAAGTTTCTGATCATGCAAAATCAGTGTCTGACCTTGAGGCTGCAAGAGCAAAAATTAAGTT
TCTCAAGAAAAAACTCAGATATGAAGCAGAACAGAACAGGGGACAGATCTTAAATCTTCAGCAAAGAGTTGCTAAGCTGCACGATCAAGAATATAAGACAAATGAAAGTA
ATAAGGATGCCCGAATTAAGCTGAAAAGGATTGAAGATTTGGAGAAAGAGGTAGAGGACTTAAGAAACTCGAATTTGAGATTACAAATAGAAAATTCTGATCTGGCTCGG
AGATTAGATGCTACCCAAGTTCTTGCAAATTCTATTTTGGAAGACCCAGAAAAAGAATCCCTGAAAGAAGAAAGGGAGCGTCTAGGACAAGAAAATGAAAATTTGATGAA
GGAGATTGAGCAACTTCAAGCTCACCGGTGTGCAGATGTTGAAGAGCTAGTCTATCTTCGCTGGATTAATGCTTGCTTAAGATATGAGCTGCGGAATTATCAGCCCCGAC
CAGGGAAAACAGCGGCAAGAGATCTAAGCAAAACGTTAAGCCCCAAATCTGAGGAGAAAGCAAAGAAGCTCATACTCGAATATGCAAATACAGAAGGAATTGAAGGGAAG
GGCATCAACATTATGGATTTTGATTCAGATCAATGGTCATCCTCCCAAGCTTCCTCTCTTACAGATCAGGATGATTCGTATGTTGATTTTCAAGCAACAACAAAACCAAG
TTCAAACAAAATCAAATTCATAAGTAAACTCAGGAAACTCTTGAAGGGAAAAGATAGTCAACAAAACCAGGCTCTGTCAGCAGAAAAATCTGCTGCATCCATGGAAGATA
GTGATTCTCCACGTTACAGTTCGAGTAATTCTACCGGCACCAATGCTACGAGAGCCGAGGGACAGGGTATCGGATCTGCAAATTCATCTCAGAGTTCGTCGAGACATTCA
ATGGATTTTCGCAGATTGAGTAGCCAAGAGTATGGAAAACCTGAAGACTCTGTTAGAAGGAACAGTGATGGTGGATACACTAACAAAAGACTTGTTTTAGGTAGCAACCG
TATGAGCAACTCGCCATTTAAAACTCCTAGTCCGGATACAGAATCTTCTGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTTTGAAAGACTCTCCGGGAGCAAAGAACC
GGTCGCATAGGAAGGGTGCTGCATCCATTGATTCGTACTGA
mRNA sequenceShow/hide mRNA sequence
CGAAGAAATACGAGGGCAAATGATCAAGTTCGCAGTTCGAAGATAAAAAAAACGAAGAACGATTCAACAGCTCGTCCACCAAAATCCCTAAACAAAACACACCTAAATTT
GAACAATCCCATAAACAAAACCCATCGTATCTTCTTCCTCATGATTCCTATTTCTTCTGCTTCGAATTTTCCCTCCTTCATTCTGATTCAGCTTCTCGAGTTGGAATCCA
TCAAGATTCCTTTCTTTTCCTGTTCCATCGACCAAAATCCTTCTGGTAGGCCCGCTCTCGCACGTCACGTCTCGCCATGGCATGGAGAAGCCGACACGGCGTGGCCATTG
TTCGCCAGTTCTAAAAGGAGAAGAAATCAGTTCGAGTTGGAATTTTCTACACTTGGAATTCTGTTTAATCAGTCTATGACACGGACGGAAAAGCACCAAATAGAAACAAA
TTCAGCGGCGATCAGGATCAGCCCAACGTCAAACCGTAACTCTCCCGTTGATTCCATGCGTTCCCGTTCCCAAACGCTTTTCTGTTACAAAATTAAAATGAAGGATGATG
AAGACGACGACCTATGATAACTTCTGGTCATTGGGTTCATTAGTTTGATTAGATTGAAGGTTCTCGATACCAAATAATGGAAGAGAAGAGGAATTTAAAGCCTATATTAT
TAAAATTTGGGGTGGTTCTGGCTATTTCCTTTGCTGGTTTTCTCTATTCCCGCTTTAGAATCAGGAAGAAAAGACCTCGTCTGCCTCCTCCCTCGTCAAGTTCTTCAGCA
GATCAGGGCAATAAAGTTGACTTGAGTAGAGGAAGAGGACCTAAACTTGATAATCAAGCAATAAAGGTAGCAACATCTGCCTCCTTTAATGTTATTCTTTTTGCAGCTGA
TGCCTATTTAAAGATGTATATACCAAAAGTTAATGTTGATGATTCAAATGTTGGTCTCTGCCCTAGCAGTAAGCGTAGTGTAGATAAAGATGGGTTGTTTCTCCCTGAGC
TTCAGGAACTTGTCAAGGAATCTGATTTTCCTGCAGCGAATGCTGGGTTATCTCATGAGAAGAACGTTGAAGCATTGAGGTCGGGTCTTCAAACTCCGAAAGCATATAAC
AATTTTGAGACGGATGACTATGAACAAGAGATCAGGCACCTCAAAAGTAAGGTGAAAATGCTTCGAGAGAGGGAGAGGAACCTTGAGGTTCAGCTACTTGAGTATTATGG
CCTGAAAGAGCAGGAAACTGCTGTCATGGAGCTCCAAAATAGGTTGAAGATTAACAATATGGAAGCCAAGCTTTTCACCCTCAAGATCGAGTCCCTTCAGGCAGATAATC
GAAGATTAGAATCACAAGTTTCTGATCATGCAAAATCAGTGTCTGACCTTGAGGCTGCAAGAGCAAAAATTAAGTTTCTCAAGAAAAAACTCAGATATGAAGCAGAACAG
AACAGGGGACAGATCTTAAATCTTCAGCAAAGAGTTGCTAAGCTGCACGATCAAGAATATAAGACAAATGAAAGTAATAAGGATGCCCGAATTAAGCTGAAAAGGATTGA
AGATTTGGAGAAAGAGGTAGAGGACTTAAGAAACTCGAATTTGAGATTACAAATAGAAAATTCTGATCTGGCTCGGAGATTAGATGCTACCCAAGTTCTTGCAAATTCTA
TTTTGGAAGACCCAGAAAAAGAATCCCTGAAAGAAGAAAGGGAGCGTCTAGGACAAGAAAATGAAAATTTGATGAAGGAGATTGAGCAACTTCAAGCTCACCGGTGTGCA
GATGTTGAAGAGCTAGTCTATCTTCGCTGGATTAATGCTTGCTTAAGATATGAGCTGCGGAATTATCAGCCCCGACCAGGGAAAACAGCGGCAAGAGATCTAAGCAAAAC
GTTAAGCCCCAAATCTGAGGAGAAAGCAAAGAAGCTCATACTCGAATATGCAAATACAGAAGGAATTGAAGGGAAGGGCATCAACATTATGGATTTTGATTCAGATCAAT
GGTCATCCTCCCAAGCTTCCTCTCTTACAGATCAGGATGATTCGTATGTTGATTTTCAAGCAACAACAAAACCAAGTTCAAACAAAATCAAATTCATAAGTAAACTCAGG
AAACTCTTGAAGGGAAAAGATAGTCAACAAAACCAGGCTCTGTCAGCAGAAAAATCTGCTGCATCCATGGAAGATAGTGATTCTCCACGTTACAGTTCGAGTAATTCTAC
CGGCACCAATGCTACGAGAGCCGAGGGACAGGGTATCGGATCTGCAAATTCATCTCAGAGTTCGTCGAGACATTCAATGGATTTTCGCAGATTGAGTAGCCAAGAGTATG
GAAAACCTGAAGACTCTGTTAGAAGGAACAGTGATGGTGGATACACTAACAAAAGACTTGTTTTAGGTAGCAACCGTATGAGCAACTCGCCATTTAAAACTCCTAGTCCG
GATACAGAATCTTCTGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTTTGAAAGACTCTCCGGGAGCAAAGAACCGGTCGCATAGGAAGGGTGCTGCATCCATTGATTC
GTACTGAACATCACGAAGCTTGTCTGCCTTTCCATGGCTTCATCACCTGTTCATATGTGTATTTTAGCTAAGGGGCCGAGTATTTAGTGCCCAATATACCATACCGTTCA
CAGGAGGTAGGTAGAAACTAATTCCGCTGATACCGTTCTATCAGTGACTTATGTTGTAATGTTACTTGTACTATTAAATTGATCTTATCAACTTTCTCTAGCCATTGTGA
GGAAATGTATAACTTATGGATTGCCTGTTAGCAAGACGGAAAACGAAAAGAGTACGTATTCTTTATCGGCCTTCACTTTTTCATTGAGTTTGTCCAACGTTTTAATACTA
AATATGATCAAAAGCATATCATCATGTCTTTAACAAGTAGAAGAGTTTGCTGTGATAAGCACGAAATATTTGGAAAAAAGTGTAAAAGACAGCCCATATGGTGAAGATCA
GATGAGAGCATAGCCTTGGATAAGCATTGCATCAGCAAGAATCTGGTTGGCAGCCTCGGATGGATGGATACTGTCCCAGAACATATATTTGGTCGCATTAGAACATGTTC
TTCCAACTGTCTTTGGATTGCACAAAACCGATGCTGTCTCTACTGCCCCAGTGCCACAGCACCCTCTTCTCACATCATCAAAACCTTGGAGGGAAAAAGTAAAAAAAGTT
ATAACAAGTTTAGTTCTGAACTTTTAACAGTTGTGTCGTGTCTGTGAGATTCCCAAACTTTTAATTCCGTGTCTAATATGTCATGAACTTATTCGATATATTTTGATTTA
TGGATGTGAACTTTATAGGATCTCTCTATCTTTTCAATTTCATGTCTAATAGACCTGTGAATATAAGTCAGGGATGTAACAGATGTAAAATTAAAATTTTAAGGACCTGT
TAAACATAAAATTAAAAGCTCAGAAATCTTTTAAATACTCTTTAAAGTTTAGGAACCAACCTGACACAACTCTGAACTTGTAATTTAATAAAGGAAAAAAATAGAGAGTG
AATGCTACCACTCTTCTCACCGTGATTCGATGGAGAAGCAATAACGTCGTATAGAGGTTTGAAAACGTCGAAGATGACGAGTTTGAGACCTGGAAGCTGCTTTTGGAGAG
CTGCAGCAGTAGAGTTGAGCTTCCTGTTGAAGACGAGGACATGATTGTTGACTGTTCGGACACAGCCCTTACCTTGGAAGCCAAAGAGAGCGAGGGCAGAAGGAAAACAG
CCTAATGCAGGAAGTGAGGTGACCCCAATTCTTCTGGCTCCCACCCCATGTAAATCCTTCATTAAAG
Protein sequenceShow/hide protein sequence
MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKR
SVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF
TLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLAR
RLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGK
GINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHS
MDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY