; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g30700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g30700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionSequence-specific DNA binding transcription factor
Genome locationchr5:22960490..22961869
RNA-Seq ExpressionMoc05g30700
SyntenyMoc05g30700
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596531.1 hypothetical protein SDJN03_09711, partial [Cucurbita argyrosperma subsp. sororia]1.3e-19080.68Show/hide
Query:  MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDS
        ME  SLGG GGGGG GGGG+FSGMNSAMLGL+LPLH +  NP N HQLHHP M  Y         QPPP AV+YPYPAKPKPQQ NLSD+EE GFA +D 
Subjt:  MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDS

Query:  NGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        N D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG E  DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHIQ----QQQQQRCFHATENATAAAAEGS--K
        CRVVENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH TG+N GGG HPSPDTAAE PSH+Q    QQQQQRCFHATE A AAA  G+   
Subjt:  CRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHIQ----QQQQQRCFHATENATAAAAEGS--K

Query:  SEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGG--MTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELE
         ED+D+EE+E  E+E++EEEEEIEG+SRG+EEE+E ESRKRARKGG  + AAMQQL+AEVIGV+QDGGRS WEKKQW+KSRLIQLEEQQV YQSQAFELE
Subjt:  SEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGG--MTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELE

Query:  KQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        KQRLKWLKFR+KKERDMERAKLENEKRRLE ERM L+VKQKEL+ MD+HHYQQ   QHSSN+RGDPSSITG
Subjt:  KQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

XP_022933836.1 uncharacterized protein LOC111441130 [Cucurbita moschata]2.7e-19180.68Show/hide
Query:  MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPYEHHHHH-HPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDS
        ME  SLG  GGGGG GGGG+FSG+NSAMLGL+LPLH +  NP N HQLHH  M  Y         QPPP AV+YPYPAKPKPQQ NLSD+EE GFA +D 
Subjt:  MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPYEHHHHH-HPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDS

Query:  NGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        N D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG E  DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEP-------SHIQQQQQQRCFHATENATAAAAEGSK
        CRVVENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH TG+N GGG HPSPDTAAEP        H QQQQQQRCFHATE+A AAA  G+ 
Subjt:  CRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEP-------SHIQQQQQQRCFHATENATAAAAEGSK

Query:  SEDDDE-EEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELE
         +D+DE +EEEESEE+E++EEEEIEG+SRG+EEE+E ESRKRARKGG+  AAMQQL+AEVIGV+QDGGRS WEKKQW+KSRLIQLEEQQV YQSQAFELE
Subjt:  SEDDDE-EEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELE

Query:  KQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        KQRLKWLKFR+KKERDMERAKLENEKRRLE ERMVL+VKQKEL+ MD+HHYQQQ  QHSSN+RGDPSSITG
Subjt:  KQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

XP_023005783.1 ESF1 homolog [Cucurbita maxima]1.3e-19080.68Show/hide
Query:  GSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGD
        G  GGGGGGGG GGGG+FSGMNSAMLGL+LPLH +  NP N HQLHHP M  Y         QPPP AV+YPYPAKPKPQQ NLSD+EE GFA +D N D
Subjt:  GSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGD

Query:  AKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRV
         KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG E  DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRV
Subjt:  AKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRV

Query:  VENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHI-----QQQQQQRCFHATENATAAAAEGSKSEDD
        VENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH TG+N GGG HPSPDTAAE PSH+     QQQQQQRCFHATE A AAA  G+  +DD
Subjt:  VENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHI-----QQQQQQRCFHATENATAAAAEGSKSEDD

Query:  DE--EEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM----TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELE
        DE  EE+E  E+E++EEEEEIEG+SRG+EEE+E ESRKRARKGG+     AAMQQL+AEVIGV+QDGGRS WEKKQW+KSRLIQLEEQQV+YQSQAFELE
Subjt:  DE--EEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM----TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELE

Query:  KQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        KQRLKWLKFR+KKERDMERAKLENEKRRLE ERMVL+VKQKEL+ MD+HHYQQ   QHSSN+RGDPSSITG
Subjt:  KQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

XP_023539309.1 ESF1 homolog [Cucurbita pepo subsp. pepo]4.6e-19181.2Show/hide
Query:  MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDS
        ME  SLGG GGGGG GGGG+FSGMNSAMLGL+LPLH +  NP N HQLHHP M  Y         QPPP AV+YPYPAKPKPQQ NLSD+EE GFA +D 
Subjt:  MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDS

Query:  NGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        N D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG E  DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHIQQ--QQQQRCFHATENATAAAAEGS--KSE
        CRVVENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH TG+N GGG HPSPDTAAE PSH+QQ  QQQQRCFHATE A  AA  G+    E
Subjt:  CRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHIQQ--QQQQRCFHATENATAAAAEGS--KSE

Query:  DDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQR
        D+D+EE+E  E+E++EEEEEIEG+SRG+EEE+E ESRKRARKGG+  AAMQQL+AEVIGV+QDGGRS WEKKQW+KSRLIQLEEQQV YQSQAFELEKQR
Subjt:  DDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQR

Query:  LKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        LKWLKFR+KKERDMERAKLENEKRRLE ERMVL+VKQKEL+ MD+HHYQQ   QHSSN+RGDPSSITG
Subjt:  LKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

XP_038903237.1 transcription factor SPT20 homolog [Benincasa hispida]2.0e-19183.63Show/hide
Query:  MNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRL
        MNS+MLGLELPLHQN  NP NPHQLHHPP+  Y  H  HHH QPPP ++KYPYP KPKPQQ N+SD+EE GFAADDSNGD KKKISPWQRMKWTDMMVRL
Subjt:  MNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQNLLDSMELTPKLKEEV
        LITAVFYIGDEGG+EP DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQ LLDSMELTPK KEEV
Subjt:  LITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQNLLDSMELTPKLKEEV

Query:  RKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAAAA----EGSKSEDDDEEEEE--ESEEEEEEEEEEI
        RKLLNSKHLFFREMCAYHN+CRHGT        HPSPDTAAEPSH+ QQQQQQRCFHATE  T+AAA    E SKS D++EEEEE  ESEEEEEEE+EEI
Subjt:  RKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAAAA----EGSKSEDDDEEEEE--ESEEEEEEEEEEI

Query:  EGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENE
        EG     EEEEE ESRKRARKGG+TA MQQLSAEV+GVVQDGGRS WEKKQW+KSRLIQLEEQQV YQ+QAFELEKQRLKW+KFR+KKERDMERAKLENE
Subjt:  EGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENE

Query:  KRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        KRRLENERM+L+VKQKEL+LM +HHYQQQQ QHSSN+RGDPSSITG
Subjt:  KRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein2.1e-18979.29Show/hide
Query:  MEPGSLG----------GGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDE
        ME  SLG          GGGGGGG GGGG+FSGMNS+MLGLELPLHQN  NPTNPHQLHHPPM  Y +H  HHH QPP  +VKYP+P K KPQQ NLSD+
Subjt:  MEPGSLG----------GGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDE

Query:  EEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EE GFAADDSNGD KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+EPVDH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI--QQQQQQRCFHATENATA--
        VNDILGKGTAC+VVENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH T        HPSPD A EPSH+  QQQQQQ CFHAT+  T+  
Subjt:  VNDILGKGTACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI--QQQQQQRCFHATENATA--

Query:  -AAAEGSKSEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQ
         AA EGSKS D++EEEEEE E EEEEE+EE EG     EEEEE ESRKRARKGGM TA MQQLSAEV+GV+ DGGRS WEKKQW+KSRLIQLEEQQV +Q
Subjt:  -AAAEGSKSEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQ

Query:  SQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        +QAFELEKQRLKW+KFR+KKERDMERAKLENEKR LENERM+L+VK+ EL+LM + HYQQQQ QHSSN+RGDPSSITG
Subjt:  SQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

A0A1S3B6J6 ESF1 homolog2.1e-18679.74Show/hide
Query:  GSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGD
        G  GGGGG G  GGGG+FSGMNS+MLGLELPLHQN  NPTNPHQLHHPPM  Y +H  HHH QPP  +VKYP+P K KPQQ NLSD+EE GFAADDSNGD
Subjt:  GSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGD

Query:  AKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRV
         KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+EPVDH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+V
Subjt:  AKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRV

Query:  VENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI--QQQQQQRCFHATE---NATAAAAEGSKSEDDD
        VENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH T        HPSPD A EPSH+  QQQQQQ CFHAT+   +A+ AA E SKS D++
Subjt:  VENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI--QQQQQQRCFHATE---NATAAAAEGSKSEDDD

Query:  EEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWL
        +EEEEE E EEEEE+EE EG     EEEEE ESRKRARKGG+TA MQQLSAEV+GV+ DGGRS WEKKQW+KSRLIQLEEQ+V +Q+QAFELEKQRLKW+
Subjt:  EEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWL

Query:  KFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        KFR+KKERDMERAKLENEKR LENERM+L+VK+ EL+LM + HYQQQQ QHSSN+RGDPSSITG
Subjt:  KFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411301.3e-19180.68Show/hide
Query:  MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPYEHHHHH-HPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDS
        ME  SLG  GGGGG GGGG+FSG+NSAMLGL+LPLH +  NP N HQLHH  M  Y         QPPP AV+YPYPAKPKPQQ NLSD+EE GFA +D 
Subjt:  MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPYEHHHHH-HPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDS

Query:  NGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        N D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG E  DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEP-------SHIQQQQQQRCFHATENATAAAAEGSK
        CRVVENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH TG+N GGG HPSPDTAAEP        H QQQQQQRCFHATE+A AAA  G+ 
Subjt:  CRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEP-------SHIQQQQQQRCFHATENATAAAAEGSK

Query:  SEDDDE-EEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELE
         +D+DE +EEEESEE+E++EEEEIEG+SRG+EEE+E ESRKRARKGG+  AAMQQL+AEVIGV+QDGGRS WEKKQW+KSRLIQLEEQQV YQSQAFELE
Subjt:  SEDDDE-EEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELE

Query:  KQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        KQRLKWLKFR+KKERDMERAKLENEKRRLE ERMVL+VKQKEL+ MD+HHYQQQ  QHSSN+RGDPSSITG
Subjt:  KQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

A0A6J1J5X4 ribosome quality control complex subunit 2-like4.6e-18980.13Show/hide
Query:  MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNG
        MEP SLGGGGG      GG+F G++S+MLGLELPLHQ  P+NPHQLHHPPM  Y  H  HH  QPPPAAVK PYPAKPKPQQ N+SDEEE G  ADDSN 
Subjt:  MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNG

Query:  DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACR
        DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+EP+DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACR
Subjt:  DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACR

Query:  VVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENA-TAAAAEGSKSEDDDEE
        VVEN  LL+SMELTPK+KEEVRKLLNSKHLFFREMCAYHN+CRHG G++ GGG +  P+T AEPSH+ QQQQQQRCFHATE A  A A EGSKS D++EE
Subjt:  VVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENA-TAAAAEGSKSEDDDEE

Query:  EE------EESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQR
        +E      EE +EEEEEEEEE+EGSSR  EEEEE ES+KR RK G TA +QQ+SAEV+GVVQDGGRS WEKKQW+K RLIQLEEQQV+YQSQ  E+EKQR
Subjt:  EE------EESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQR

Query:  LKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        LKWLKFR KKERDMERAKLENEKRRLENERM+L+VKQKEL+L DLH+YQQQQ QHSSNRRGDPSSITG
Subjt:  LKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

A0A6J1L352 ESF1 homolog6.4e-19180.68Show/hide
Query:  GSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGD
        G  GGGGGGGG GGGG+FSGMNSAMLGL+LPLH +  NP N HQLHHP M  Y         QPPP AV+YPYPAKPKPQQ NLSD+EE GFA +D N D
Subjt:  GSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGD

Query:  AKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRV
         KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG E  DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRV
Subjt:  AKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRV

Query:  VENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHI-----QQQQQQRCFHATENATAAAAEGSKSEDD
        VENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH TG+N GGG HPSPDTAAE PSH+     QQQQQQRCFHATE A AAA  G+  +DD
Subjt:  VENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHI-----QQQQQQRCFHATENATAAAAEGSKSEDD

Query:  DE--EEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM----TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELE
        DE  EE+E  E+E++EEEEEIEG+SRG+EEE+E ESRKRARKGG+     AAMQQL+AEVIGV+QDGGRS WEKKQW+KSRLIQLEEQQV+YQSQAFELE
Subjt:  DE--EEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM----TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELE

Query:  KQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        KQRLKWLKFR+KKERDMERAKLENEKRRLE ERMVL+VKQKEL+ MD+HHYQQ   QHSSN+RGDPSSITG
Subjt:  KQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors7.5e-5136.12Show/hide
Query:  GGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPYEHH-----------HHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNG
        GG    G   + G +   L   + +H  +  N    H+P   P                HH  Q     +      K + ++ ++SD++E  F  +  +G
Subjt:  GGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPYEHH-----------HHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNG

Query:  -----DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGK
             +   K SPWQR+KWTD MV+LLITAV YIGD+     +D + ++K   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+
Subjt:  -----DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGK

Query:  GTACRVVENQNLLDSM-ELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENATAAAAEGSKSED
        GT+C+VVEN  LLDS+  L  K K++VRK+++SKHLF+ EMC+YHN          G   H   D A + S +Q   + R  H  +++     E    ED
Subjt:  GTACRVVENQNLLDSM-ELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENATAAAAEGSKSED

Query:  DDEEEEEESEEEEEE---EEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQ-------------DGGRSSWEKKQWIKSRLIQLEEQQ
         D + +E  E EE+     +  +     G    +++        G   + +  L    + + Q             + GR+   +KQW++SR +QLEEQ+
Subjt:  DDEEEEEESEEEEEE---EEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQ-------------DGGRSSWEKKQWIKSRLIQLEEQQ

Query:  VRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKEL
        ++ Q +  ELEKQR +W +F  K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  VRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)6.0e-4837.05Show/hide
Query:  NLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLN
        ++S+++E    + D    +K+  SPWQR+KW D MV+L+ITA+ YIG++ G++        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFNDLN
Subjt:  NLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLN

Query:  KRYKRVNDILGKGTACRVVENQNLLDSME-LTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENA
        KRYK++N++LG+GT+C VVEN +LLD ++ L  K K+EVR++++SKHLF+ EMC+YHN          G   H   D A           QR  H     
Subjt:  KRYKRVNDILGKGTACRVVENQNLLDSME-LTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENA

Query:  TAAAAEGSKSEDDDEE---EEEESEEEEEEEEEEIEGSSRGN-----EEEEEMESRKRARKGGMTAAMQQLSAEV-IGVVQDGGRSSWEKKQWIKSRLIQ
              GS+ + D++E    + E  +++++ EE+ +G+          + +  E      KG     + +  A+V  G+  D  +++  ++Q I+S+ ++
Subjt:  TAAAAEGSKSEDDDEE---EEEESEEEEEEEEEEIEGSSRGN-----EEEEEMESRKRARKGGMTAAMQQLSAEV-IGVVQDGGRSSWEKKQWIKSRLIQ

Query:  LEEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKEL
        LE ++++ Q++  ELE+Q+ KW  F  ++E+ + + ++ENE+ +LENERM L +K+ EL
Subjt:  LEEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors3.5e-9650.98Show/hide
Query:  LFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPYEHHHHHHPQPPPAAVKYPYPAKPK---PQQLNLSDEEEAGFAA-------DDSNGDAKKKISPWQ
        +FSG +  ML LE+P +  NP N  Q  HP   PY        QPP  ++ YPY +KPK   P      D+E+ G  +       D +  D K+K+S W 
Subjt:  LFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPYEHHHHHHPQPPPAAVKYPYPAKPK---PQQLNLSDEEEAGFAA-------DDSNGDAKKKISPWQ

Query:  RMKWTDMMVRLLITAVFYIGDEGGA-EPVDHAGKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACR
        RMKWTD MVRLLI AVFYIGDE G  +PVD   KKK           G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDILGKG ACR
Subjt:  RMKWTDMMVRLLITAVFYIGDEGGA-EPVDHAGKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACR

Query:  VVENQNLLDSME-LTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENATAAAAEGSKSEDDDEEE
        VVENQ LL+SM+ LTPKLK+EV+KLLNSKHLFFREMCAYHNSC H  G +    P  +P +   PS     QQQ CFHA E    A      +E  + EE
Subjt:  VVENQNLLDSME-LTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENATAAAAEGSKSEDDDEEE

Query:  EEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWLKFR
        E ES+  E+ E E         EE EE E+RK+ R   ++ A+++L  E   VV+D G+S WEKK+WI+ +++++EE+++ Y+ +  E+EKQR+KW+++R
Subjt:  EEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWLKFR

Query:  NKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        +KKER+ME+AKL+N++RRLE ERM+L++++ E+EL +L          SS  R DPSS  G
Subjt:  NKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCTGGTAGTTTAGGCGGCGGCGGCGGCGGCGGCGGTAATGGTGGTGGAGGGCTGTTTTCCGGGATGAATTCTGCAATGTTGGGATTGGAATTACCTCTTCATCA
GAACAACCCCACAAATCCCCACCAATTACACCACCCCCCAATGGCGCCTTATGAGCATCACCACCACCACCACCCACAACCACCGCCGGCGGCCGTGAAGTACCCTTATC
CGGCGAAGCCGAAGCCGCAGCAGTTGAATCTCAGCGACGAGGAGGAGGCGGGGTTCGCGGCGGACGACAGCAACGGTGACGCGAAGAAGAAAATCTCGCCGTGGCAGAGG
ATGAAGTGGACCGACATGATGGTCCGGCTGCTGATCACGGCGGTTTTCTACATCGGAGACGAGGGCGGGGCGGAGCCGGTGGATCACGCCGGGAAGAAGAAGCCGGTGGG
GCTGCTGCAGAAGAAGGGGAAATGGAAATCGGTGTCCCGCGCGATGATGGAGAAAGGATTCTACGTTTCGCCGCAGCAATGCGAAGACAAATTCAACGATTTAAACAAAC
GATACAAAAGAGTGAACGACATACTCGGAAAAGGAACCGCGTGCAGAGTCGTCGAGAATCAAAACCTCTTGGATTCCATGGAATTAACACCCAAACTGAAGGAAGAAGTC
CGAAAATTACTCAATTCCAAGCACCTGTTCTTCAGGGAAATGTGCGCTTACCACAACAGCTGCCGTCACGGCACCGGCAGCAACAGCGGCGGCGGTCCCCACCCCTCGCC
GGACACGGCGGCGGAACCATCTCACATTCAACAACAACAACAACAGCGGTGTTTCCACGCGACCGAGAACGCCACCGCCGCCGCCGCCGAGGGGTCCAAAAGCGAGGACG
ACGACGAGGAGGAAGAGGAAGAATCGGAGGAAGAAGAGGAGGAGGAGGAGGAGGAGATTGAGGGAAGTTCGCGAGGGAATGAAGAGGAAGAGGAAATGGAGTCGAGGAAG
CGGGCGAGGAAGGGGGGAATGACGGCGGCGATGCAGCAACTAAGCGCGGAGGTGATCGGAGTAGTACAGGACGGGGGGAGGAGCTCGTGGGAGAAGAAGCAGTGGATCAA
GAGCCGGCTGATTCAACTGGAAGAGCAGCAAGTGAGGTACCAATCGCAGGCGTTCGAGCTGGAGAAGCAGCGGCTGAAATGGCTGAAGTTCAGGAACAAGAAGGAAAGGG
ACATGGAAAGAGCGAAGCTGGAGAACGAGAAGCGGCGGCTCGAGAACGAGCGGATGGTTCTTGTGGTGAAGCAGAAGGAGCTCGAATTGATGGATCTCCACCATTATCAG
CAGCAGCAGCACCAACATTCCTCCAATAGACGAGGGGATCCATCGTCGATTACAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCTGGTAGTTTAGGCGGCGGCGGCGGCGGCGGCGGTAATGGTGGTGGAGGGCTGTTTTCCGGGATGAATTCTGCAATGTTGGGATTGGAATTACCTCTTCATCA
GAACAACCCCACAAATCCCCACCAATTACACCACCCCCCAATGGCGCCTTATGAGCATCACCACCACCACCACCCACAACCACCGCCGGCGGCCGTGAAGTACCCTTATC
CGGCGAAGCCGAAGCCGCAGCAGTTGAATCTCAGCGACGAGGAGGAGGCGGGGTTCGCGGCGGACGACAGCAACGGTGACGCGAAGAAGAAAATCTCGCCGTGGCAGAGG
ATGAAGTGGACCGACATGATGGTCCGGCTGCTGATCACGGCGGTTTTCTACATCGGAGACGAGGGCGGGGCGGAGCCGGTGGATCACGCCGGGAAGAAGAAGCCGGTGGG
GCTGCTGCAGAAGAAGGGGAAATGGAAATCGGTGTCCCGCGCGATGATGGAGAAAGGATTCTACGTTTCGCCGCAGCAATGCGAAGACAAATTCAACGATTTAAACAAAC
GATACAAAAGAGTGAACGACATACTCGGAAAAGGAACCGCGTGCAGAGTCGTCGAGAATCAAAACCTCTTGGATTCCATGGAATTAACACCCAAACTGAAGGAAGAAGTC
CGAAAATTACTCAATTCCAAGCACCTGTTCTTCAGGGAAATGTGCGCTTACCACAACAGCTGCCGTCACGGCACCGGCAGCAACAGCGGCGGCGGTCCCCACCCCTCGCC
GGACACGGCGGCGGAACCATCTCACATTCAACAACAACAACAACAGCGGTGTTTCCACGCGACCGAGAACGCCACCGCCGCCGCCGCCGAGGGGTCCAAAAGCGAGGACG
ACGACGAGGAGGAAGAGGAAGAATCGGAGGAAGAAGAGGAGGAGGAGGAGGAGGAGATTGAGGGAAGTTCGCGAGGGAATGAAGAGGAAGAGGAAATGGAGTCGAGGAAG
CGGGCGAGGAAGGGGGGAATGACGGCGGCGATGCAGCAACTAAGCGCGGAGGTGATCGGAGTAGTACAGGACGGGGGGAGGAGCTCGTGGGAGAAGAAGCAGTGGATCAA
GAGCCGGCTGATTCAACTGGAAGAGCAGCAAGTGAGGTACCAATCGCAGGCGTTCGAGCTGGAGAAGCAGCGGCTGAAATGGCTGAAGTTCAGGAACAAGAAGGAAAGGG
ACATGGAAAGAGCGAAGCTGGAGAACGAGAAGCGGCGGCTCGAGAACGAGCGGATGGTTCTTGTGGTGAAGCAGAAGGAGCTCGAATTGATGGATCTCCACCATTATCAG
CAGCAGCAGCACCAACATTCCTCCAATAGACGAGGGGATCCATCGTCGATTACAGGCTGA
Protein sequenceShow/hide protein sequence
MEPGSLGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPYEHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGDAKKKISPWQR
MKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQNLLDSMELTPKLKEEV
RKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENATAAAAEGSKSEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRK
RARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQ
QQQHQHSSNRRGDPSSITG