; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009536 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009536
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein CHUP1, chloroplastic isoform X3
Genome locationscaffold813:2000195..2002661
RNA-Seq ExpressionMS009536
SyntenyMS009536
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147409.1 protein CHUP1, chloroplastic isoform X1 [Momordica charantia]0.0e+0092.63Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSS                        ADQGNKVDLSRGRGPKLDNQAIK      
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS

Query:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
                    +MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
Subjt:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV

Query:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
        KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
Subjt:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL

Query:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQ
        QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQ
Subjt:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQ

Query:  LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ
        LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ
Subjt:  LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ

Query:  ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN
        ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAAS+EDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN
Subjt:  ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN

Query:  SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

XP_022147417.1 protein CHUP1, chloroplastic isoform X2 [Momordica charantia]0.0e+0092.48Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSS                         DQGNKVDLSRGRGPKLDNQAIK      
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS

Query:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
                    +MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
Subjt:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV

Query:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
        KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
Subjt:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL

Query:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQ
        QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQ
Subjt:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQ

Query:  LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ
        LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ
Subjt:  LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ

Query:  ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN
        ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAAS+EDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN
Subjt:  ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN

Query:  SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

XP_022147425.1 protein CHUP1, chloroplastic isoform X3 [Momordica charantia]0.0e+0093.33Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSS                        ADQGNKVDLSRGRGPKLDNQAIK      
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS

Query:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
                    +MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
Subjt:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV

Query:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
        KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
Subjt:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL

Query:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHR
        QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHR
Subjt:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHR

Query:  CADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKP
        CADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKP
Subjt:  CADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKP

Query:  SSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGY
        SSNKIKFISKLRKLLKGKDSQQNQALSAEKSAAS+EDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGY
Subjt:  SSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGY

Query:  TNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        TNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  TNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

XP_022147434.1 protein CHUP1, chloroplastic isoform X4 [Momordica charantia]8.0e-29098.91Show/hide
Query:  MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ
        MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ
Subjt:  MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ

Query:  LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK
        LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK
Subjt:  LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK

Query:  TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV
        TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV
Subjt:  TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV

Query:  YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI
        YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI
Subjt:  YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI

Query:  SKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG
        SKLRKLLKGKDSQQNQALSAEKSAAS+EDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG
Subjt:  SKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG

Query:  SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

XP_038898688.1 protein CHUP1, chloroplastic [Benincasa hispida]2.9e-25576.16Show/hide
Query:  MEEKRNL-KPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSA
        M++KR+L KPIL KFG  LAISFAGFL S+FR+R KRP L PPSSSSS                         DQ +KVDL RGRGP+LDNQ +K AT+A
Subjt:  MEEKRNL-KPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSA

Query:  SFNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSK
        S NV+ FA DAY K  IPKVN DDSN+GL PS+K  VDKDGL LPE QELVKE DF AANAGL  +KNVEA RSGL+TPKAY   E D+YEQEIRHLKSK
Subjt:  SFNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSK

Query:  VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILN
        VK LRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQV DHAKSVSDLEAA+AKIKFLKKK+RYEAEQNRGQILN
Subjt:  VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILN

Query:  LQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAH
        LQQRV KL DQE+KTNESNKDA+I+L++IE+LEKE+EDLR SNL+LQIENSDL+RRLDATQ LANS+LED EKESLKEE ERL +ENE L KEIEQLQAH
Subjt:  LQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAH

Query:  RCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLT---DQDDSYVDFQA
        RCAD+EELVYLRWINACLRYELRN+QP  GKTAARDLSKTLSPKSEEKAKKLIL+YANTEGIEGK INI DFDSDQWSSSQASS T   D DDS VDF +
Subjt:  RCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLT---DQDDSYVDFQA

Query:  TTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQ--EYGKPEDSV-R
        T K SSNK+KFISKLRKLL+GK SQQN  L AEKSAAS+EDS SPRYSSSNS GTNATRAEGQGIG    S++SSRHSMDF RL+SQ  + GK EDS+ R
Subjt:  TTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQ--EYGKPEDSV-R

Query:  RNSDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        RNSD GY NK+ VLGS+  SNS +++ S DTES+EKSELMKYAEVLKD+ GAKN+S RK AASI S+
Subjt:  RNSDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

TrEMBL top hitse value%identityAlignment
A0A5A7V182 Protein CHUP14.5e-25475.83Show/hide
Query:  MEEKRNL-KPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSA
        ME+K NL KP+LLKFGVVLAISFA FLYSRFR++ KRP LPPP SSSS                         DQGNKV+L RGRGP+LDNQ +K AT+A
Subjt:  MEEKRNL-KPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSA

Query:  SFNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSK
        S NV+LFA DAY +M IPKVNVDDSN+GLCPS+K  VDKDGL LPE QE VKE D  AANA  S +KNVEA RSGL+TPKAY   E D+YEQEIRHLKSK
Subjt:  SFNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSK

Query:  VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILN
        VKMLRERERNLE QLLEYYGLKEQETAVMELQNRLKINNMEAKLFT KIESL+ADNRRLESQV +HAK+VSDLEAARAKIKFLKKKLR+EAEQNR QILN
Subjt:  VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILN

Query:  LQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAH
        LQQ+V KL DQE+KTNESNKDA+IKL++IEDLEKE+E+LR  N RLQIENSDL RRLDATQ LANS+LED EKESLKEE ERL QENE L KEIEQLQAH
Subjt:  LQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAH

Query:  RCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLT---DQDDSYVDFQA
        R ADVEELVYLRWINACLRYELRN+QP  GKTAARDLSKTLSPKSEEKAKKLIL+YANTEG EGKGI++ DFDSDQWSSSQASS T   D DDS  +F +
Subjt:  RCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLT---DQDDSYVDFQA

Query:  TTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYG--KPEDSVRR
        T K SSNKIKFI KL+KLL+GK SQQN  L AEKSAASIEDSDSP YSSSNSTGTNATRAEGQ IG A SS++SSR+S+DF+RL SQ+    K EDS RR
Subjt:  TTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYG--KPEDSVRR

Query:  NSDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        NSD GY NKR VLGS++ SNS  ++ S DTES+EKSELMKYAEVLKD+ GAKN+SHRK AASI S+
Subjt:  NSDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

A0A6J1D049 protein CHUP1, chloroplastic isoform X30.0e+0093.33Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSS                        ADQGNKVDLSRGRGPKLDNQAIK      
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS

Query:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
                    +MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
Subjt:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV

Query:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
        KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
Subjt:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL

Query:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHR
        QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHR
Subjt:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHR

Query:  CADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKP
        CADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKP
Subjt:  CADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKP

Query:  SSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGY
        SSNKIKFISKLRKLLKGKDSQQNQALSAEKSAAS+EDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGY
Subjt:  SSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGY

Query:  TNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        TNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  TNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

A0A6J1D0X5 protein CHUP1, chloroplastic isoform X10.0e+0092.63Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSS                        ADQGNKVDLSRGRGPKLDNQAIK      
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS

Query:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
                    +MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
Subjt:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV

Query:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
        KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
Subjt:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL

Query:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQ
        QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQ
Subjt:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQ

Query:  LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ
        LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ
Subjt:  LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ

Query:  ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN
        ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAAS+EDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN
Subjt:  ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN

Query:  SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

A0A6J1D100 protein CHUP1, chloroplastic isoform X43.9e-29098.91Show/hide
Query:  MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ
        MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ
Subjt:  MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQ

Query:  LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK
        LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK
Subjt:  LLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYK

Query:  TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV
        TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV
Subjt:  TNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQLQAHRCADVEELV

Query:  YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI
        YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI
Subjt:  YLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFI

Query:  SKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG
        SKLRKLLKGKDSQQNQALSAEKSAAS+EDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG
Subjt:  SKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLG

Query:  SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  SNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

A0A6J1D2B1 protein CHUP1, chloroplastic isoform X20.0e+0092.48Show/hide
Query:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS
        MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSS                         DQGNKVDLSRGRGPKLDNQAIK      
Subjt:  MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSAS

Query:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
                    +MYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV
Subjt:  FNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKV

Query:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
        KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL
Subjt:  KMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNL

Query:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQ
        QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE     KESLKEERERLGQENENLMKEIEQ
Subjt:  QQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPE-----KESLKEERERLGQENENLMKEIEQ

Query:  LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ
        LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ
Subjt:  LQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQ

Query:  ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN
        ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAAS+EDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN
Subjt:  ATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRN

Query:  SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
        SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY
Subjt:  SDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY

SwissProt top hitse value%identityAlignment
Q3V6T2 Girdin1.5e-0426.73Show/hide
Query:  DDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNR-LKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLK--
        ++ E E+ HL+ + ++L+++  NL++   +   L EQE + +E +NR LK      K  T ++ESL+ +N +L+ +  +  ++V  L+ A  K+  L+  
Subjt:  DDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNR-LKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLK--

Query:  -KKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKE--SLKEERE
         K+L  E EQ +  +  L+    K    E      + + +   K +E+  K+++ L +    L++EN  L + L+  ++ ++  LE  EKE  SL++E  
Subjt:  -KKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKE--SLKEERE

Query:  RLGQENENLMKEIEQLQ
        +L ++ + L KE ++L+
Subjt:  RLGQENENLMKEIEQLQ

Q5SNZ0 Girdin2.6e-0425.81Show/hide
Query:  DDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEA-KLFTLKIESLQADNRRLESQVSDHAKSVSDLEAA---RAKIKFL
        ++ E E+ HL  + ++L+++  NL++   E     EQE + +E +NR     +++ K  T ++ESL+ +N +L+ +  +  +SV  L+ A    A+++  
Subjt:  DDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEA-KLFTLKIESLQADNRRLESQVSDHAKSVSDLEAA---RAKIKFL

Query:  KKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKE--SLKEERE
         K+L  E EQ R  +  ++    K    E      + + +   K +E+  K+++ L +    L++EN  L + L+  ++ ++  LE  EKE  SL++E  
Subjt:  KKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKE--SLKEERE

Query:  RLGQENENLMKEIEQLQ
        +L ++ + L KE ++L+
Subjt:  RLGQENENLMKEIEQLQ

Q9LI74 Protein CHUP1, chloroplastic3.0e-5336.62Show/hide
Query:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN
        D   LPE ++L+  E ++P  +   + EK  +  +   +   AYN       + E+  LK  VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI 
Subjt:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN

Query:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED
         +E  +  + I SLQA+ ++L+ ++S +     +LE AR KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + +   KLK ++DLE +V +
Subjt:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED

Query:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR
        L+  N  LQ E  +L+ +LD+ +      +++ E  +   ++EE   L   NE+L+K++E LQ +R ++VEELVYLRW+NACLRYELRNYQ   GK +AR
Subjt:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR

Query:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA
        DLSK LSPKS+ KAK+L+LEYA +E    +G    D +S+ +S   +    D D++ +D   +   S S K   I KL+K  K KD    Q+      + 
Subjt:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA

Query:  SIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE
        S       R SSS     N  R   + +   N+ +S +     F ++  +  G PE
Subjt:  SIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE

Arabidopsis top hitse value%identityAlignment
AT1G52080.1 actin binding protein family5.7e-9238.4Show/hide
Query:  KRNLKPILLKFGVVLAISFAGFLYSRFRIRKKR--PRLP--PPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSA
        KR++  ++L+ G  LA+SFAGFL++RFR   KR  P LP  PP SS +G+                       D  NK    R  G +  +         
Subjt:  KRNLKPILLKFGVVLAISFAGFLYSRFRIRKKR--PRLP--PPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSA

Query:  SFNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVD-KDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKS
                              +++ +G+ P  +  +D KD   LPE +E  K+ D    +       + E  RS +  P A+ + E  D+E EI  L++
Subjt:  SFNVILFAADAYLKMYIPKVNVDDSNVGLCPSSKRSVD-KDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKS

Query:  KVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQIL
         V+ LRERER LE +LLEYY LKEQ+   MEL++RLK+N ME K+F  KI+ LQA+N +L+++  +H+K + +L+ A+++++ LKKKL    +Q+  QIL
Subjt:  KVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQIL

Query:  NLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEK-ESLKEERERLGQENENLMKEIEQLQ
        +L+QRVA+L ++E K    + +A   ++R+ DLE E+ +L ++N RLQ EN +L+ +L++ Q++ANS LE+PE+ E+L+E+  RL  ENE L K++EQLQ
Subjt:  NLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEK-ESLKEERERLGQENENLMKEIEQLQ

Query:  AHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQ--ASSLTDQ---DDSYV
          RC D+E+LVYLRWINACLRYELR YQP  GKT ARDLS TLSP SEEKAK+LILEYA++E          + D D+WSSSQ  +S +TD    DDS V
Subjt:  AHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQ--ASSLTDQ---DDSYV

Query:  D-FQATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDS
        D   AT    + K K + KL K+L GKD++      ++K A S E        SS++TG ++T             Q  S HSMDF+ L     GK E+ 
Subjt:  D-FQATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDS

Query:  VRRNSDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAA
          +N       K    GS+       +    +T+ + K EL+K A+ L  S   K + H+K  +
Subjt:  VRRNSDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAA

AT2G36650.1 unknown protein6.2e-0623.83Show/hide
Query:  EQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLK---KKL
        +QEI  LKS+ + L+ +E  +E+    +  LK+QE  ++E ++ L +   +   F  ++ +++ +++R ++ V  + K V +++  R++   L+   KKL
Subjt:  EQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLK---KKL

Query:  RYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQEN
        R +++Q   +++N  +++  +  +  K  +   +   K   +++LE +V+D+      LQ E  +L            S     E  S+++ R       
Subjt:  RYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQEN

Query:  ENLMKEIEQLQAHRCADVEELVYLRWINACLRYEL
          +++E E+L+      V+E++ LRW NACLR+E+
Subjt:  ENLMKEIEQLQAHRCADVEELVYLRWINACLRYEL

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein2.1e-5436.62Show/hide
Query:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN
        D   LPE ++L+  E ++P  +   + EK  +  +   +   AYN       + E+  LK  VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI 
Subjt:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN

Query:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED
         +E  +  + I SLQA+ ++L+ ++S +     +LE AR KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + +   KLK ++DLE +V +
Subjt:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED

Query:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR
        L+  N  LQ E  +L+ +LD+ +      +++ E  +   ++EE   L   NE+L+K++E LQ +R ++VEELVYLRW+NACLRYELRNYQ   GK +AR
Subjt:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR

Query:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA
        DLSK LSPKS+ KAK+L+LEYA +E    +G    D +S+ +S   +    D D++ +D   +   S S K   I KL+K  K KD    Q+      + 
Subjt:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA

Query:  SIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE
        S       R SSS     N  R   + +   N+ +S +     F ++  +  G PE
Subjt:  SIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein2.1e-5436.62Show/hide
Query:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN
        D   LPE ++L+  E ++P  +   + EK  +  +   +   AYN       + E+  LK  VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI 
Subjt:  DGLFLPELQELVK-ESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKIN

Query:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED
         +E  +  + I SLQA+ ++L+ ++S +     +LE AR KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + +   KLK ++DLE +V +
Subjt:  NMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVED

Query:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR
        L+  N  LQ E  +L+ +LD+ +      +++ E  +   ++EE   L   NE+L+K++E LQ +R ++VEELVYLRW+NACLRYELRNYQ   GK +AR
Subjt:  LRNSNLRLQIENSDLARRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAAR

Query:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA
        DLSK LSPKS+ KAK+L+LEYA +E    +G    D +S+ +S   +    D D++ +D   +   S S K   I KL+K  K KD    Q+      + 
Subjt:  DLSKTLSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAA

Query:  SIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE
        S       R SSS     N  R   + +   N+ +S +     F ++  +  G PE
Subjt:  SIEDSDSPRYSSSNSTGTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein7.5e-3635.59Show/hide
Query:  DNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLA
        +++ L+ ++S +     +LE AR KIK L+++++ +A Q +GQ+L L+Q V+ L  +E +    + +   KLK ++DLE +V +L+  N  LQ E  +L+
Subjt:  DNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLA

Query:  RRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKK
         +LD+ +      +++ E  +   ++EE   L   NE+L+K++E LQ +R ++VEELVYLRW+NACLRYELRNYQ   GK +ARDLSK LSPKS+ KAK+
Subjt:  RRLDATQ---VLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTLSPKSEEKAKK

Query:  LILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNST
        L+LEYA +E    +G    D +S+ +S   +    D D++ +D   +   S S K   I KL+K  K KD    Q+      + S       R SSS   
Subjt:  LILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPS-SNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNST

Query:  GTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE
          N  R   + +   N+ +S +     F ++  +  G PE
Subjt:  GTNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGAAGAGGAATTTAAAGCCTATATTATTAAAATTTGGGGTGGTTCTGGCTATTTCCTTTGCTGGTTTTCTCTATTCCCGCTTTAGAATCAGGAAGAAAAGACC
TCGTCTGCCTCCTCCCTCGTCAAGTTCTTCAGGTTTTACAGTTAATCTTGTGTGCTTATTTGATTTCTTAAAATTTATTCTTAATTTCATCAATTCCATTTTAGCAGATC
AGGGCAATAAAGTTGACTTGAGTAGAGGAAGAGGACCTAAACTTGATAATCAAGCAATAAAGGTAGCAACATCTGCCTCCTTTAATGTTATTCTTTTTGCAGCTGATGCC
TATTTAAAGATGTATATACCAAAAGTTAATGTTGATGATTCAAATGTTGGTCTCTGCCCTAGCAGTAAGCGTAGTGTAGATAAAGATGGGTTGTTTCTCCCTGAGCTTCA
GGAACTTGTCAAGGAATCTGATTTTCCTGCAGCGAATGCTGGGTTATCTCATGAGAAGAACGTTGAAGCATTGAGGTCGGGTCTTCAAACTCCGAAAGCATATAACAATT
TTGAGACGGATGACTACGAACAAGAGATCAGGCACCTCAAAAGTAAGGTGAAAATGCTTCGAGAGAGGGAGAGGAACCTTGAGGTTCAGCTACTTGAGTATTATGGCCTG
AAAGAGCAGGAAACTGCTGTCATGGAGCTCCAAAATAGGTTGAAGATTAACAATATGGAAGCCAAGCTTTTCACCCTCAAGATCGAGTCCCTTCAGGCGGATAATCGAAG
ATTAGAATCACAAGTTTCTGATCATGCAAAATCAGTGTCTGACCTTGAGGCTGCAAGAGCAAAAATTAAGTTTCTCAAGAAAAAACTCAGATATGAAGCAGAACAGAACA
GGGGACAGATCTTAAATCTTCAGCAAAGAGTTGCTAAGCTGCACGATCAAGAATATAAGACAAATGAAAGTAATAAGGATGCCCGAATTAAGCTGAAAAGGATTGAAGAT
TTGGAGAAAGAGGTAGAGGACTTAAGAAACTCGAATTTGAGATTACAAATAGAAAATTCTGATCTGGCTCGGAGATTAGATGCTACCCAAGTTCTTGCAAATTCTATTTT
GGAAGACCCAGAAAAAGAATCCCTGAAAGAAGAAAGGGAGCGTCTAGGACAAGAAAATGAAAATTTGATGAAGGAGATTGAGCAACTTCAAGCTCACCGGTGTGCAGATG
TTGAAGAGCTAGTCTATCTTCGCTGGATTAATGCTTGCTTAAGATATGAGCTGCGAAATTATCAGCCCCGACCAGGGAAAACAGCGGCAAGAGATCTAAGCAAAACGTTA
AGCCCCAAATCTGAGGAGAAAGCAAAGAAGCTCATACTCGAATATGCAAATACAGAAGGAATTGAAGGGAAGGGCATCAACATTATGGATTTTGATTCAGATCAATGGTC
ATCCTCCCAAGCTTCCTCTCTTACAGATCAGGATGATTCGTATGTTGATTTTCAAGCAACAACAAAACCAAGTTCAAACAAAATCAAATTCATAAGTAAACTCAGGAAAC
TCTTGAAGGGAAAAGATAGTCAACAAAACCAGGCTCTGTCAGCAGAAAAATCTGCTGCATCCATAGAAGATAGTGATTCTCCACGTTACAGTTCGAGTAATTCTACCGGC
ACCAATGCTACGAGAGCCGAGGGACAGGGTATCGGATCTGCAAATTCATCTCAGAGTTCGTCGAGACATTCAATGGATTTTCGCAGATTGAGTAGCCAAGAGTATGGAAA
ACCTGAAGACTCTGTTAGAAGGAACAGTGATGGTGGATACACTAACAAGAGACTTGTTTTAGGTAGCAACCGTATGAGCAACTCGCCATTTAAAACTCCTAGTCCGGATA
CAGAATCTTCTGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTTTGAAAGACTCTCCGGGAGCAAAGAACCGGTCGCATAGGAAGGGTGCTGCATCCATTGATTCGTAC
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGAAGAGGAATTTAAAGCCTATATTATTAAAATTTGGGGTGGTTCTGGCTATTTCCTTTGCTGGTTTTCTCTATTCCCGCTTTAGAATCAGGAAGAAAAGACC
TCGTCTGCCTCCTCCCTCGTCAAGTTCTTCAGGTTTTACAGTTAATCTTGTGTGCTTATTTGATTTCTTAAAATTTATTCTTAATTTCATCAATTCCATTTTAGCAGATC
AGGGCAATAAAGTTGACTTGAGTAGAGGAAGAGGACCTAAACTTGATAATCAAGCAATAAAGGTAGCAACATCTGCCTCCTTTAATGTTATTCTTTTTGCAGCTGATGCC
TATTTAAAGATGTATATACCAAAAGTTAATGTTGATGATTCAAATGTTGGTCTCTGCCCTAGCAGTAAGCGTAGTGTAGATAAAGATGGGTTGTTTCTCCCTGAGCTTCA
GGAACTTGTCAAGGAATCTGATTTTCCTGCAGCGAATGCTGGGTTATCTCATGAGAAGAACGTTGAAGCATTGAGGTCGGGTCTTCAAACTCCGAAAGCATATAACAATT
TTGAGACGGATGACTACGAACAAGAGATCAGGCACCTCAAAAGTAAGGTGAAAATGCTTCGAGAGAGGGAGAGGAACCTTGAGGTTCAGCTACTTGAGTATTATGGCCTG
AAAGAGCAGGAAACTGCTGTCATGGAGCTCCAAAATAGGTTGAAGATTAACAATATGGAAGCCAAGCTTTTCACCCTCAAGATCGAGTCCCTTCAGGCGGATAATCGAAG
ATTAGAATCACAAGTTTCTGATCATGCAAAATCAGTGTCTGACCTTGAGGCTGCAAGAGCAAAAATTAAGTTTCTCAAGAAAAAACTCAGATATGAAGCAGAACAGAACA
GGGGACAGATCTTAAATCTTCAGCAAAGAGTTGCTAAGCTGCACGATCAAGAATATAAGACAAATGAAAGTAATAAGGATGCCCGAATTAAGCTGAAAAGGATTGAAGAT
TTGGAGAAAGAGGTAGAGGACTTAAGAAACTCGAATTTGAGATTACAAATAGAAAATTCTGATCTGGCTCGGAGATTAGATGCTACCCAAGTTCTTGCAAATTCTATTTT
GGAAGACCCAGAAAAAGAATCCCTGAAAGAAGAAAGGGAGCGTCTAGGACAAGAAAATGAAAATTTGATGAAGGAGATTGAGCAACTTCAAGCTCACCGGTGTGCAGATG
TTGAAGAGCTAGTCTATCTTCGCTGGATTAATGCTTGCTTAAGATATGAGCTGCGAAATTATCAGCCCCGACCAGGGAAAACAGCGGCAAGAGATCTAAGCAAAACGTTA
AGCCCCAAATCTGAGGAGAAAGCAAAGAAGCTCATACTCGAATATGCAAATACAGAAGGAATTGAAGGGAAGGGCATCAACATTATGGATTTTGATTCAGATCAATGGTC
ATCCTCCCAAGCTTCCTCTCTTACAGATCAGGATGATTCGTATGTTGATTTTCAAGCAACAACAAAACCAAGTTCAAACAAAATCAAATTCATAAGTAAACTCAGGAAAC
TCTTGAAGGGAAAAGATAGTCAACAAAACCAGGCTCTGTCAGCAGAAAAATCTGCTGCATCCATAGAAGATAGTGATTCTCCACGTTACAGTTCGAGTAATTCTACCGGC
ACCAATGCTACGAGAGCCGAGGGACAGGGTATCGGATCTGCAAATTCATCTCAGAGTTCGTCGAGACATTCAATGGATTTTCGCAGATTGAGTAGCCAAGAGTATGGAAA
ACCTGAAGACTCTGTTAGAAGGAACAGTGATGGTGGATACACTAACAAGAGACTTGTTTTAGGTAGCAACCGTATGAGCAACTCGCCATTTAAAACTCCTAGTCCGGATA
CAGAATCTTCTGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTTTGAAAGACTCTCCGGGAGCAAAGAACCGGTCGCATAGGAAGGGTGCTGCATCCATTGATTCGTAC
Protein sequenceShow/hide protein sequence
MEEKRNLKPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSGFTVNLVCLFDFLKFILNFINSILADQGNKVDLSRGRGPKLDNQAIKVATSASFNVILFAADA
YLKMYIPKVNVDDSNVGLCPSSKRSVDKDGLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSKVKMLRERERNLEVQLLEYYGL
KEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLESQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNKDARIKLKRIED
LEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEERERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKTL
SPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLTDQDDSYVDFQATTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASIEDSDSPRYSSSNSTG
TNATRAEGQGIGSANSSQSSSRHSMDFRRLSSQEYGKPEDSVRRNSDGGYTNKRLVLGSNRMSNSPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY