; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037324 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037324
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCentromere-associated protein E isoform X1
Genome locationchr2:5161408..5168434
RNA-Seq ExpressionLag0037324
SyntenyLag0037324
Gene Ontology termsGO:0006336 - DNA replication-independent nucleosome assembly (biological process)
GO:0042981 - regulation of apoptotic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031491 - nucleosome binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591632.1 hypothetical protein SDJN03_13978, partial [Cucurbita argyrosperma subsp. sororia]1.4e-19676.62Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDC------------DSDTVDSSASPSTEHYLAAEIDHS
        MDKNKSRSDLLAAGRKKLQQFRKKKDN+G GSQGNSSKNTSKLEQHD D DI TA   + +  C            D DTV SSASPS EH LAAEIDHS
Subjt:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDC------------DSDTVDSSASPSTEHYLAAEIDHS

Query:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP
        T  VKQEMDLAETSAIDQAEV MQEVGYRED +HPIQN EA+  + S  SLPTDAEEN+NHI NLSSTESS QISSASVEQQGRI EV   CREEELLP 
Subjt:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP

Query:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD
        QSASLLQAREDVGM   EDALMQS Q  ETE +GD QLETGGMNE AAETTFKD  C++EE+I ADV SVSGA TESNSY ISSPGEKL M+NSSSS RD
Subjt:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD

Query:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD
        +WKEE Q HAEDMI SSR +VQYM EDNFADQSEGH+  S+     SD GDA      AHMT+ SDA SGT SSFGQ+SKF  LLER+KEELIVTSFSKD
Subjt:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD

Query:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL
        IFNLQISEQNELQ+ELDN   K TNDM+ LNTS +EVLERN SLVDELSHCRSELKDVST +EELR QLL AE+EIEKLSSR SETENSLEKFHG MFRL
Subjt:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL

Query:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER
        GKEL+DCKH VTVLEEENERLNGIIT ENENK KL EE+
Subjt:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER

KAG7024516.1 hypothetical protein SDJN02_13332 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-19676.62Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDC------------DSDTVDSSASPSTEHYLAAEIDHS
        MDKNKSRSDLLAAGRKKLQQFRKKKDN+G GSQGNSSKNTSKLEQHD D DI TA   + +  C            D DTV SSASPS EH LAAEIDHS
Subjt:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDC------------DSDTVDSSASPSTEHYLAAEIDHS

Query:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP
        T  VKQEMDLAETSAIDQAEV MQEVGYRED +HPIQN EA+  + S  SLPTDAEEN+NHI NLSSTESS QISSASVEQQGRI EV   CREEELLP 
Subjt:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP

Query:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD
        QSASLLQAREDVGM   EDALMQS Q  ETE +GD QLETGGMNE AAETTFKD  C++EE+I ADV SVSGA TESNSY ISSPGEKL M+NSSSS RD
Subjt:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD

Query:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD
        +WKEE Q HAEDMI SSR +VQYM EDNFADQSEGH+  S+     SD GDA      AHMT+ SDA SGT SSFGQ+SKF  LLER+KEELIVTSFSKD
Subjt:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD

Query:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL
        IFNLQISEQNELQ+ELDN   K TNDM+ LNTS +EVLERN SLVDELSHCRSELKDVST +EELR QLL AE+EIEKLSSR SETENSLEKFHG MFRL
Subjt:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL

Query:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER
        GKEL+DCKH VTVLEEENERLNGIIT ENENK KL EE+
Subjt:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER

XP_022975256.1 golgin subfamily A member 4-like isoform X1 [Cucurbita maxima]2.4e-19675.51Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDCDS------------DTVDSSASPSTEHYLAAEIDHS
        MDKNKSRSDLLAAGRKKLQQFRKKKDN+G GSQGNSSKNTSKLEQHD D DI TA   + +  C +            DTVDSSASPS EH LAAEIDHS
Subjt:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDCDS------------DTVDSSASPSTEHYLAAEIDHS

Query:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP
        T  VKQEMDLAETSAIDQAEV MQE+ YRED +HPIQN EA+G +    SLPTDAEEN+NHI NLSSTESS QISS SVEQQGRI EV   CREEELLP 
Subjt:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP

Query:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD
        QSASLLQAREDVGM   EDAL+QS QV ETE +GD +LETGGMNE AAETTFKD  C++EE+I  DV SVSGA TESNSY ISSPGEKLGM+NSSSS RD
Subjt:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD

Query:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD
        +WKEE QAHAEDMI SSR +VQYM EDNFADQSEGH+  S+T    SD GDA      AHMT+  DA SGT SSFGQ+SKF DLLER+KEELIVTSFSKD
Subjt:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD

Query:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL
        IFNLQI EQNELQ+ELDN   K TNDM+ LNTS NEVLERN SLV+ELSHCRSELKDVST +EELR+ LL AE+EIEKLS+R SETENSLEKFHG MFRL
Subjt:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL

Query:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER
        G EL+DCKH VTVLEEENERLNGIIT ENENK KL EE+
Subjt:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER

XP_023534886.1 major antigen-like isoform X1 [Cucurbita pepo subsp. pepo]1.9e-20177.55Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAG------------VLASTVDCDSDTVDSSASPSTEHYLAAEIDHS
        MDKNKSRSDLLAAGRKKLQQFRKKKDN+G GSQGNSSKNTSKLEQHD D DI TA              L+ +V  D DTVDSSASPS EH LAAEIDHS
Subjt:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAG------------VLASTVDCDSDTVDSSASPSTEHYLAAEIDHS

Query:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP
        T  VKQEMDLAETSAIDQAEV MQEVGYRED +HPIQN EA+G +    SLPTDA+EN+NHI NLSSTESS QISSASVEQQGRI EV   CREEELLP 
Subjt:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP

Query:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD
        QSASLLQAREDVGM   EDALMQS QV ETE +GD QLETGGMNE AAETTFKD  C++EE+I ADV SVSGA TESNSY ISSPGEKLGM+NSSSS RD
Subjt:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD

Query:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD
        +WKEE Q HAEDMI SSR +VQYM EDNFADQSEGH+  S+T    SD GDA      AHMT+ SDA SGT SSFGQ+SKF DLLER+KEELIVTSFSKD
Subjt:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD

Query:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL
        IFNLQISEQNELQ+ELDN   K TNDM+ LNTS +EVLERN SLVDELSHCRSELKDVST +EELR QLL AE+EIEKLSSR SETENSLEKFHG MFRL
Subjt:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL

Query:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER
        GKEL+DCKH VTVLEEENERLNGIIT ENENK KL EE+
Subjt:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER

XP_038897845.1 centromere-associated protein E isoform X1 [Benincasa hispida]2.5e-21479.85Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTD--------------IGTAGVLASTVDCDSDTVDSSASPSTEHYLAAEID
        MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQG+SSKN SKLEQHDAD D                T GVL S+VDCD DTVDSSASPSTEH LAAEI+
Subjt:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTD--------------IGTAGVLASTVDCDSDTVDSSASPSTEHYLAAEID

Query:  HSTVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELL
        HSTV VKQEMDL ETSAIDQAE+ MQEVGYRED D  IQNAEA+G VS  PSLPTDAEENNNHI NLSSTESSSQISSASVEQQG+IVEV   CREEELL
Subjt:  HSTVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELL

Query:  PPQSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSG
        P QS SLLQAREDVGM   EDALMQSGQV ETELAGD QLETGG +E AAETTFK+TCCNEEEVI  DV SVSGA TESNSYSISSPGEKLGMENSSSS 
Subjt:  PPQSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSG

Query:  RDEWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFS
        RD WKEERQ HAE  IHSS  QVQY+ EDNFADQSEGHE  S+TSVKISD GD       AHMT  SDAQSGT SSFGQ+ KFFDLLER+KEELIVTSFS
Subjt:  RDEWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFS

Query:  KDIFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMF
        KDIFN+QI+EQNELQMELDN R K TNDMSLLNTS NEV+ERN SLVDELSHCRSEL DVS  +EELR QLL AE+EIEKLSSR SETENSLEK HG MF
Subjt:  KDIFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMF

Query:  RLGKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER
        RL KEL++CKH VT+LE ENERLNGIITFENENK KL EE+
Subjt:  RLGKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER

TrEMBL top hitse value%identityAlignment
A0A5A7TAW3 Centromere-associated protein E isoform X11.7e-19273.53Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTA------------GVLASTVDCDSDTVDSSASPSTEHYLAAEI-DH
        MDKNK+RSDLLAAGRKKLQQFRKKKD+KGSGSQG+SS+NTSKLEQ DAD DI T             GVLAS+VD +   VDSSAS STEH LAAE  DH
Subjt:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTA------------GVLASTVDCDSDTVDSSASPSTEHYLAAEI-DH

Query:  STVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLP
        STV VKQEMDLAETSAIDQ E  MQEVGYRE+++HPIQNAEA G VSS PSLPTD EEN+N   NLS  ESSSQISSASVEQQGRIVEV   CREEELL 
Subjt:  STVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLP

Query:  PQSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGR
          S SLLQAREDVGM    DALMQSGQV ETELAGD  L+TGG +E AAETTFK+T C++EE I A+VASVS A TESNSYSISSPGE LGM+NSSSS R
Subjt:  PQSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGR

Query:  DEWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSK
        D+WK+ERQ HAED IHSSRSQV+ + ED+FADQSEGH   S+TSVK+SD  DA       HMT  SDAQSGT SSFGQ+  FFDLLER+KEELIV+SFSK
Subjt:  DEWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSK

Query:  DIFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFR
        +IFN+QI+EQNELQMELDN R+K T D++LLNTS NEV+ERN SLVDELSHCRSEL+DVS  +E+ R QLL AE+EIEKLSS+ SETENSLEK HG MFR
Subjt:  DIFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFR

Query:  LGKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEERSCIS
        L KEL+DCKH VTVLE E ERLNGIITFENENK KL EE+   S
Subjt:  LGKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEERSCIS

A0A6J1BYX1 centromere-associated protein E2.0e-19674.58Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDI------------GTAGVLASTVDCDSDTVDSSASPSTEHYLAAEIDHS
        MDKNKSRSDLLAAGRKKLQQFRKKKDNKG+GSQGNSSKN SKLEQ DADT+I             T GVLAS+ DCD+D VDSSASPS EH  AAEIDHS
Subjt:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDI------------GTAGVLASTVDCDSDTVDSSASPSTEHYLAAEIDHS

Query:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP
        TV +KQEMDLAETSAID+AE+ ++EVGYRED D  IQNAEA+GV+SS PS+  DAE NNNH+ NLSSTESSSQISSASV++QGR+VEV   CREEE+ P 
Subjt:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP

Query:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD
        +SASLLQAREDVG  M EDA MQS QV ET+LA D QL+TGGM E AAETTFKDT C+EEE+IT DVASVSG AT SN +SIS+PGE LGM++SSSS RD
Subjt:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD

Query:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD
        +WKEERQ HAEDM+H S  QVQYM EDNFA QSEGHE PS+TS KIS+GGDA      AHMTT    QSGT  SFGQES+F DLLERVKEELIVTSFSKD
Subjt:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD

Query:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL
        I NLQISEQN LQMELDNQ  KLTNDMSL+NTS NE+LERN SLVDELS CRSEL+DVS+  EEL++QLL AE+EIE LSSRA+E+E +LEKFH  MFRL
Subjt:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL

Query:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER
         KEL+DCKH VT LEEENERLNG+IT ENENKMKL EE+
Subjt:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER

A0A6J1F6C6 major antigen-like isoform X12.4e-19475.7Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDC------------DSDTVDSSASPSTEHYLAAEIDHS
        MDKNKSRSDLLAAGRKKLQQFRKKKDN+G GSQGNSSKNTSKLEQHD D DI TA   + +  C            D D VDSSASPS EH LAAEIDHS
Subjt:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDC------------DSDTVDSSASPSTEHYLAAEIDHS

Query:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP
        T  VKQEMDLAETSAIDQAEV MQEVGY ED +HPIQN EA+  +    SLPTDAEEN+NHI NLSSTESS QISSASVEQQGRI EV   CREEELLP 
Subjt:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP

Query:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD
        QSASLLQAREDVGM   ED LMQS Q  ETE +GD QLETGGMNE AAETTFKD  C+++E+I ADV SVSGA TESNSY ISSPGEKLGM+NSSSS RD
Subjt:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD

Query:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD
        +WKEE Q HAEDMI SSR +VQYM EDNFADQSEGH+  S+T    SD GDA      AHMT+ SDA SGT SSF Q+SKF  LLER+KEELIVTSFSKD
Subjt:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD

Query:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL
        IFNLQISEQNELQ+ELDN   K T+DM+ LNTS +EVLERN SLVDELSHCRSELKDV T +EELR QLL AE+EIEKLSSR SETENSLEKFHG MFRL
Subjt:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL

Query:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER
        GKEL+DCKH VTVLEEENERLNGIIT ENENK KL EE+
Subjt:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER

A0A6J1IDN6 golgin subfamily A member 4-like isoform X11.2e-19675.51Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDCDS------------DTVDSSASPSTEHYLAAEIDHS
        MDKNKSRSDLLAAGRKKLQQFRKKKDN+G GSQGNSSKNTSKLEQHD D DI TA   + +  C +            DTVDSSASPS EH LAAEIDHS
Subjt:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDCDS------------DTVDSSASPSTEHYLAAEIDHS

Query:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP
        T  VKQEMDLAETSAIDQAEV MQE+ YRED +HPIQN EA+G +    SLPTDAEEN+NHI NLSSTESS QISS SVEQQGRI EV   CREEELLP 
Subjt:  TVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPP

Query:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD
        QSASLLQAREDVGM   EDAL+QS QV ETE +GD +LETGGMNE AAETTFKD  C++EE+I  DV SVSGA TESNSY ISSPGEKLGM+NSSSS RD
Subjt:  QSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRD

Query:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD
        +WKEE QAHAEDMI SSR +VQYM EDNFADQSEGH+  S+T    SD GDA      AHMT+  DA SGT SSFGQ+SKF DLLER+KEELIVTSFSKD
Subjt:  EWKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKD

Query:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL
        IFNLQI EQNELQ+ELDN   K TNDM+ LNTS NEVLERN SLV+ELSHCRSELKDVST +EELR+ LL AE+EIEKLS+R SETENSLEKFHG MFRL
Subjt:  IFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRL

Query:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER
        G EL+DCKH VTVLEEENERLNGIIT ENENK KL EE+
Subjt:  GKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER

A0A6J1J1D8 centrosome-associated protein CEP250-like isoform X12.3e-19274.54Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDI-----------GTAGVLASTVDCDSDTVDSSASPSTEHYLAAEIDHST
        MDKNK+R +LLAAGRKKLQQFRKKKD KG GSQG+SSK+T+KLEQHDAD D             T GVLAS VDC  DTV SSASPSTE  LAAE+DHST
Subjt:  MDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDI-----------GTAGVLASTVDCDSDTVDSSASPSTEHYLAAEIDHST

Query:  VPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPPQ
          VKQE+DLAETS IDQ EV MQEVGYREDYDHPIQNAE++GV SS PSL   AEENN+ IYNLSS+ESSSQISSASVEQQ +IVEV   CR EELL P 
Subjt:  VPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEV---CREEELLPPQ

Query:  SASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRDE
        SASL QAREDVG  MK D LMQSG +  TELA D Q+ETGGMNE AAETTFKDTCC+ +++ITADVASVS A TESNSYSISSPGEKLGM+NSSSSGR++
Subjt:  SASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRDE

Query:  WKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKDI
        WKE RQ HAEDMIHSSRSQVQYM EDNFAD+SE H+ PS+TSVKISDGGD       AHMTT + A SGT SSFGQ SKF DLLERVKEELIVTSFSKDI
Subjt:  WKEERQAHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDA------AHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKDI

Query:  FNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRLG
        FN QISEQNELQM+LD                  EVL RNH+LVDELSHCRSELKDVS   EELR+QLLAAE+EI+KLSSRASETENS EKFHG MFRL 
Subjt:  FNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRLG

Query:  KELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER
        KEL+DCKH V+VLEEENERLNGIITFENENK KL +E+
Subjt:  KELNDCKHSVTVLEEENERLNGIITFENENKMKLVEER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31570.1 CONTAINS InterPro DOMAIN/s: Prefoldin (InterPro:IPR009053)5.0e-2726.54Show/hide
Query:  MDKNKSRSDLLAAGRKKLQQFR---------KKKDNKGSGSQGNSSKNTSKLEQHDADTDIG------------TAGVLASTVDCDSDTVDS---SASPS
        MDK K+R+D LAAGR+KLQQFR         +KKD+KGS SQG SSK ++K E+H+   D              T G   S V+   + VDS   S+   
Subjt:  MDKNKSRSDLLAAGRKKLQQFR---------KKKDNKGSGSQGNSSKNTSKLEQHDADTDIG------------TAGVLASTVDCDSDTVDS---SASPS

Query:  TEHYLAAEIDHSTVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQ---------NAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASV
           Y++    H +      +    T++ D +E R + V    D    +          N+  +G V S  S P D+E+   H  + S+ +     S    
Subjt:  TEHYLAAEIDHSTVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQ---------NAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASV

Query:  EQQGRIVEVCREEELLPPQSASLLQAREDVGM-RMKEDALMQSGQVRETELAGDTQLET-GGMNEPAAE-----------TTFKDTCCNEEEVITADVAS
        E +G  VE        P Q +SL +   DV + R + D +   G+++E ++   ++L    G+++ A E           ++   +  +E   +  D   
Subjt:  EQQGRIVEVCREEELLPPQSASLLQAREDVGM-RMKEDALMQSGQVRETELAGDTQLET-GGMNEPAAE-----------TTFKDTCCNEEEVITADVAS

Query:  VSGAATESNSYSISSPGEKLGMENSSSS-----GRD-----EWKEERQAHAEDMI-------------HSSRSQVQYMSEDNFADQSEGHEGPSKTSVKI
        + G      S  I    E L  E   +S      RD     E +E   A     +             H    ++  ++       +  HEG S + +++
Subjt:  VSGAATESNSYSISSPGEKLGMENSSSS-----GRD-----EWKEERQAHAEDMI-------------HSSRSQVQYMSEDNFADQSEGHEGPSKTSVKI

Query:  SD----GGDAAHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKDIFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDEL
         D     G   +    +  ++ + +  G  S     LER++EEL V+S  +DI ++Q++EQ+ LQ+E D+Q  +   ++S L  S++ V ERN SL +EL
Subjt:  SD----GGDAAHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKDIFNLQISEQNELQMELDNQRTKLTNDMSLLNTSFNEVLERNHSLVDEL

Query:  SHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRLGKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEERSCISMRMK
        S C+S+L   ++    L +QLLA E+++E  +++ +E + SLEK          +L++ K     L+ EN+ L  +I+  N+ K +L+EE+   +  +K
Subjt:  SHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRLGKELNDCKHSVTVLEEENERLNGIITFENENKMKLVEERSCISMRMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGAATTCGTCATCACCCATGGTCTAGAAGAGTTCAATCATCATCGAAAGAAGATGAAGAGAACGTTTCAACAGATAATTGCTATAAAGTTTCAAATTTCTACTCG
TGTATTAAGAATGAGAGAGAAGCAAAGGTCAAGAGACTTTGGATTGCAGAGGTCGCCTTCCACACTAGAAGGTAATTTTGGACATGCACAAGCAGCTGACGAGGACGATC
GGGCAGAGATAAGACCAGGAAACGACCCAAAGGAAGATCATACCAAAGGGTCGGGCCAACTCAGCCCGACCCATTTGGTCGCCATTGGCCCGAGCTCGTTCGCCTCCATT
CGGTCTCTGCCTCTGGCGGCCTCGGTCCCGCCTGTTACTTTTGTTGATTCTGCCGTGTTTTGCTCTTCAATTCCAGAGCTTCTGCTAGTGGGGGCGAAAAACGGAGTGGA
TCGGATGGACAAGAACAAGAGTCGTTCCGATCTGCTCGCCGCAGGGAGGAAGAAGCTCCAGCAATTCCGCAAGAAGAAGGATAATAAAGGCAGTGGTAGCCAAGGAAATT
CATCAAAAAATACCAGTAAATTGGAACAGCACGATGCAGATACAGACATTGGAACTGCTGGAGTACTTGCATCTACTGTTGATTGCGATTCAGATACTGTAGATTCTTCA
GCATCACCTTCTACAGAACATTATTTGGCAGCTGAGATTGATCATTCAACAGTTCCAGTTAAGCAAGAGATGGATTTAGCTGAAACATCAGCAATTGACCAGGCAGAGGT
TCGGATGCAGGAAGTAGGATATAGGGAGGACTATGATCACCCAATCCAAAATGCTGAGGCTTCTGGAGTCGTATCCTCTAGACCTTCCCTTCCTACTGACGCTGAAGAGA
ATAATAATCACATTTATAATTTGTCTTCGACTGAATCCTCTTCCCAAATTTCTTCTGCTTCTGTTGAGCAGCAGGGAAGAATAGTTGAAGTATGTAGGGAAGAAGAGCTA
TTGCCTCCACAGTCTGCATCTTTGTTGCAAGCAAGGGAAGATGTAGGCATGCGCATGAAGGAGGATGCTTTGATGCAATCTGGTCAAGTCCGTGAAACAGAGCTTGCAGG
AGACACGCAGCTAGAGACTGGTGGCATGAATGAGCCAGCAGCAGAGACTACTTTTAAAGACACATGCTGCAATGAGGAAGAGGTTATTACAGCAGATGTGGCATCTGTAT
CTGGTGCTGCAACTGAGTCAAACAGTTATTCCATTTCTAGTCCGGGAGAAAAATTAGGTATGGAGAATAGTTCTAGTAGTGGTAGAGATGAGTGGAAAGAAGAAAGACAA
GCTCATGCAGAAGATATGATACATTCAAGTAGGAGTCAAGTGCAATATATGTCAGAAGATAATTTTGCAGATCAGTCTGAGGGCCATGAAGGGCCTTCAAAAACAAGTGT
GAAAATTTCTGATGGGGGAGATGCTGCACATATGACCACAATTTCGGATGCACAGTCGGGGACCTGTTCTTCCTTTGGACAAGAGAGTAAATTTTTTGATTTACTGGAGA
GAGTGAAAGAAGAGTTGATAGTAACAAGTTTCTCGAAAGATATCTTTAACTTGCAAATTTCTGAACAGAATGAACTACAAATGGAGCTTGATAACCAGCGTACTAAATTA
ACCAACGATATGTCTCTGCTCAATACCTCATTCAATGAAGTTCTTGAGAGGAATCATAGCCTCGTGGATGAACTTTCTCATTGCAGATCTGAACTTAAGGATGTCTCAAC
TGTGGAGGAGGAGCTCAGAAGTCAGCTGCTAGCTGCAGAGTCTGAGATAGAAAAGCTTTCTTCTAGAGCTAGTGAGACAGAGAATAGCCTGGAGAAGTTCCATGGACATA
TGTTCAGATTGGGAAAGGAGTTGAATGACTGCAAACATTCGGTAACAGTGTTAGAAGAGGAGAATGAAAGATTAAATGGTATTATCACCTTTGAAAACGAAAATAAAATG
AAACTAGTTGAGGAAAGGAGTTGTATATCAATGAGAATGAAAAGATCTTATCAGAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGAATTCGTCATCACCCATGGTCTAGAAGAGTTCAATCATCATCGAAAGAAGATGAAGAGAACGTTTCAACAGATAATTGCTATAAAGTTTCAAATTTCTACTCG
TGTATTAAGAATGAGAGAGAAGCAAAGGTCAAGAGACTTTGGATTGCAGAGGTCGCCTTCCACACTAGAAGGTAATTTTGGACATGCACAAGCAGCTGACGAGGACGATC
GGGCAGAGATAAGACCAGGAAACGACCCAAAGGAAGATCATACCAAAGGGTCGGGCCAACTCAGCCCGACCCATTTGGTCGCCATTGGCCCGAGCTCGTTCGCCTCCATT
CGGTCTCTGCCTCTGGCGGCCTCGGTCCCGCCTGTTACTTTTGTTGATTCTGCCGTGTTTTGCTCTTCAATTCCAGAGCTTCTGCTAGTGGGGGCGAAAAACGGAGTGGA
TCGGATGGACAAGAACAAGAGTCGTTCCGATCTGCTCGCCGCAGGGAGGAAGAAGCTCCAGCAATTCCGCAAGAAGAAGGATAATAAAGGCAGTGGTAGCCAAGGAAATT
CATCAAAAAATACCAGTAAATTGGAACAGCACGATGCAGATACAGACATTGGAACTGCTGGAGTACTTGCATCTACTGTTGATTGCGATTCAGATACTGTAGATTCTTCA
GCATCACCTTCTACAGAACATTATTTGGCAGCTGAGATTGATCATTCAACAGTTCCAGTTAAGCAAGAGATGGATTTAGCTGAAACATCAGCAATTGACCAGGCAGAGGT
TCGGATGCAGGAAGTAGGATATAGGGAGGACTATGATCACCCAATCCAAAATGCTGAGGCTTCTGGAGTCGTATCCTCTAGACCTTCCCTTCCTACTGACGCTGAAGAGA
ATAATAATCACATTTATAATTTGTCTTCGACTGAATCCTCTTCCCAAATTTCTTCTGCTTCTGTTGAGCAGCAGGGAAGAATAGTTGAAGTATGTAGGGAAGAAGAGCTA
TTGCCTCCACAGTCTGCATCTTTGTTGCAAGCAAGGGAAGATGTAGGCATGCGCATGAAGGAGGATGCTTTGATGCAATCTGGTCAAGTCCGTGAAACAGAGCTTGCAGG
AGACACGCAGCTAGAGACTGGTGGCATGAATGAGCCAGCAGCAGAGACTACTTTTAAAGACACATGCTGCAATGAGGAAGAGGTTATTACAGCAGATGTGGCATCTGTAT
CTGGTGCTGCAACTGAGTCAAACAGTTATTCCATTTCTAGTCCGGGAGAAAAATTAGGTATGGAGAATAGTTCTAGTAGTGGTAGAGATGAGTGGAAAGAAGAAAGACAA
GCTCATGCAGAAGATATGATACATTCAAGTAGGAGTCAAGTGCAATATATGTCAGAAGATAATTTTGCAGATCAGTCTGAGGGCCATGAAGGGCCTTCAAAAACAAGTGT
GAAAATTTCTGATGGGGGAGATGCTGCACATATGACCACAATTTCGGATGCACAGTCGGGGACCTGTTCTTCCTTTGGACAAGAGAGTAAATTTTTTGATTTACTGGAGA
GAGTGAAAGAAGAGTTGATAGTAACAAGTTTCTCGAAAGATATCTTTAACTTGCAAATTTCTGAACAGAATGAACTACAAATGGAGCTTGATAACCAGCGTACTAAATTA
ACCAACGATATGTCTCTGCTCAATACCTCATTCAATGAAGTTCTTGAGAGGAATCATAGCCTCGTGGATGAACTTTCTCATTGCAGATCTGAACTTAAGGATGTCTCAAC
TGTGGAGGAGGAGCTCAGAAGTCAGCTGCTAGCTGCAGAGTCTGAGATAGAAAAGCTTTCTTCTAGAGCTAGTGAGACAGAGAATAGCCTGGAGAAGTTCCATGGACATA
TGTTCAGATTGGGAAAGGAGTTGAATGACTGCAAACATTCGGTAACAGTGTTAGAAGAGGAGAATGAAAGATTAAATGGTATTATCACCTTTGAAAACGAAAATAAAATG
AAACTAGTTGAGGAAAGGAGTTGTATATCAATGAGAATGAAAAGATCTTATCAGAATTAA
Protein sequenceShow/hide protein sequence
MVEFVITHGLEEFNHHRKKMKRTFQQIIAIKFQISTRVLRMREKQRSRDFGLQRSPSTLEGNFGHAQAADEDDRAEIRPGNDPKEDHTKGSGQLSPTHLVAIGPSSFASI
RSLPLAASVPPVTFVDSAVFCSSIPELLLVGAKNGVDRMDKNKSRSDLLAAGRKKLQQFRKKKDNKGSGSQGNSSKNTSKLEQHDADTDIGTAGVLASTVDCDSDTVDSS
ASPSTEHYLAAEIDHSTVPVKQEMDLAETSAIDQAEVRMQEVGYREDYDHPIQNAEASGVVSSRPSLPTDAEENNNHIYNLSSTESSSQISSASVEQQGRIVEVCREEEL
LPPQSASLLQAREDVGMRMKEDALMQSGQVRETELAGDTQLETGGMNEPAAETTFKDTCCNEEEVITADVASVSGAATESNSYSISSPGEKLGMENSSSSGRDEWKEERQ
AHAEDMIHSSRSQVQYMSEDNFADQSEGHEGPSKTSVKISDGGDAAHMTTISDAQSGTCSSFGQESKFFDLLERVKEELIVTSFSKDIFNLQISEQNELQMELDNQRTKL
TNDMSLLNTSFNEVLERNHSLVDELSHCRSELKDVSTVEEELRSQLLAAESEIEKLSSRASETENSLEKFHGHMFRLGKELNDCKHSVTVLEEENERLNGIITFENENKM
KLVEERSCISMRMKRSYQN