; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019578 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019578
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSequence-specific DNA binding transcription factor
Genome locationscaffold729:703019..704401
RNA-Seq ExpressionMS019578
SyntenyMS019578
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138003.2 uncharacterized protein DDB_G0290301 [Cucumis sativus]8.6e-19079.71Show/hide
Query:  MEPGSLGG---------GGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDE
        ME  SLGG         GGGGGGG GGGG+FSGMNS+MLGLELPLHQN  NPTNPHQLHHPPM  Y +H  HHH QPP  +VKYP+P K KPQQ NLSD+
Subjt:  MEPGSLGG---------GGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDE

Query:  EEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EE GFAADDSNGD KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+EPVDH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI--QQQQQQRCFHATENATAA-
        VNDILGKGTAC+VVENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH T        HPSPD A EPSH+  QQQQQQ CFHAT+  T+A 
Subjt:  VNDILGKGTACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI--QQQQQQRCFHATENATAA-

Query:  -AAAEGSKSEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQ
         AA EGSKS D++EEEEEE E EEEEE+EE EG     EEEEE ESRKRARKGGM TA MQQLSAEV+GV+ DGGRS WEKKQW+KSRLIQLEEQQV +Q
Subjt:  -AAAEGSKSEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQ

Query:  SQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        +QAFELEKQRLKW+KFR+KKERDMERAKLENEKR LENERM+L+VK+ EL+LM + HYQQQQ QHSSN+RGDPSSITG
Subjt:  SQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

XP_022982944.1 ribosome quality control complex subunit 2-like [Cucurbita maxima]1.1e-18979.96Show/hide
Query:  MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSN
        MEP SLGGGGG       GG+F G++S+MLGLELPLHQ  P+NPHQLHHPPM  Y  H  HH  QPPPAAVK PYPAKPKPQQ N+SDEEE G  ADDSN
Subjt:  MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSN

Query:  GDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
         DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+EP+DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
Subjt:  GDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC

Query:  RVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAAAAAEGSKSEDDDE
        RVVEN  LL+SMELTPK+KEEVRKLLNSKHLFFREMCAYHN+CRHG G++ GGG +  P+T AEPSH+ QQQQQQRCFHATE A  A A EGSKS D++E
Subjt:  RVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAAAAAEGSKSEDDDE

Query:  EEE------EESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQ
        E+E      EE +EEEEEEEEE+EGSSR  EEEEE ES+KR RK G TA +QQ+SAEV+GVVQDGGRS WEKKQW+K RLIQLEEQQV+YQSQ  E+EKQ
Subjt:  EEE------EESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQ

Query:  RLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        RLKWLKFR KKERDMERAKLENEKRRLENERM+L+VKQKEL+L DLH+YQQQQ QHSSNRRGDPSSITG
Subjt:  RLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

XP_023005783.1 ESF1 homolog [Cucurbita maxima]5.1e-19080.13Show/hide
Query:  MEPGSLG---GGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFA
        ME  SLG   GGGGGGGG GGGG+FSGMNSAMLGL+LPLH +  NP N HQLHHP M  Y         QPPP AV+YPYPAKPKPQQ NLSD+EE GFA
Subjt:  MEPGSLG---GGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFA

Query:  ADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG
         +D N D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG E  DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG
Subjt:  ADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG

Query:  KGTACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHI-----QQQQQQRCFHATENATAAAAA
        KGTACRVVENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH TG+N GGG HPSPDTAAE PSH+     QQQQQQRCFHATE A AAA  
Subjt:  KGTACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHI-----QQQQQQRCFHATENATAAAAA

Query:  EGSKSEDDDEEEEEESEE-EEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM----TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQ
             +DD+++EE+ESEE E++EEEEEIEG+SRG+EEE+E ESRKRARKGG+     AAMQQL+AEVIGV+QDGGRS WEKKQW+KSRLIQLEEQQV+YQ
Subjt:  EGSKSEDDDEEEEEESEE-EEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM----TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQ

Query:  SQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        SQAFELEKQRLKWLKFR+KKERDMERAKLENEKRRLE ERMVL+VKQKEL+ MD+HHYQQ   QHSSN+RGDPSSITG
Subjt:  SQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

XP_023527808.1 uncharacterized protein DDB_G0283697-like [Cucurbita pepo subsp. pepo]3.0e-19080.3Show/hide
Query:  MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSN
        MEP SLGGGGG       GG+F G++S+MLGLELPLHQ NP+NPHQLHHPPM  Y  H  HH  QPPPAAVK PYPAKPKPQQ N+SDEEE G  ADDSN
Subjt:  MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSN

Query:  GDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
         DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+EP+DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
Subjt:  GDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC

Query:  RVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAAAAAEGSKSEDDDE
        RVVEN  LL+SMELTPK+KEEVRKLLNSKHLFFREMCAYHN+CRHG G++ G G + SP+  AEPSH+ QQQQQQRCFHATE A AA A EGSKS D++E
Subjt:  RVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAAAAAEGSKSEDDDE

Query:  EEEEE----SEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRL
        E+E+E     EE+EEEEEE+IEGSSR  EEEEE ES+KR RK G TA +QQ+SAEV+GVVQDGGRS WEKKQW+K RLIQLEEQQV+YQSQ  E+EKQRL
Subjt:  EEEEE----SEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRL

Query:  KWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        KWLKFR KKERDMERAKLENEKRRLENERM+L+VKQKEL+L DLH+YQQQQ QHSSNRRGDPSSITG
Subjt:  KWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

XP_038903237.1 transcription factor SPT20 homolog [Benincasa hispida]2.1e-19183.63Show/hide
Query:  MNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRL
        MNS+MLGLELPLHQN  NP NPHQLHHPP+  Y  H  HHH QPPP ++KYPYP KPKPQQ N+SD+EE GFAADDSNGD KKKISPWQRMKWTDMMVRL
Subjt:  MNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQNLLDSMELTPKLKEEV
        LITAVFYIGDEGG+EP DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQ LLDSMELTPK KEEV
Subjt:  LITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQNLLDSMELTPKLKEEV

Query:  RKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAA---AAAEGSKSEDDDEEEEE--ESEEEEEEEEEEI
        RKLLNSKHLFFREMCAYHN+CRHGT        HPSPDTAAEPSH+ QQQQQQRCFHATE  T+A   AA E SKS D++EEEEE  ESEEEEEEE+EEI
Subjt:  RKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAA---AAAEGSKSEDDDEEEEE--ESEEEEEEEEEEI

Query:  EGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENE
        EG     EEEEE ESRKRARKGG+TA MQQLSAEV+GVVQDGGRS WEKKQW+KSRLIQLEEQQV YQ+QAFELEKQRLKW+KFR+KKERDMERAKLENE
Subjt:  EGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENE

Query:  KRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        KRRLENERM+L+VKQKEL+LM +HHYQQQQ QHSSN+RGDPSSITG
Subjt:  KRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein4.2e-19079.71Show/hide
Query:  MEPGSLGG---------GGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDE
        ME  SLGG         GGGGGGG GGGG+FSGMNS+MLGLELPLHQN  NPTNPHQLHHPPM  Y +H  HHH QPP  +VKYP+P K KPQQ NLSD+
Subjt:  MEPGSLGG---------GGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDE

Query:  EEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EE GFAADDSNGD KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+EPVDH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI--QQQQQQRCFHATENATAA-
        VNDILGKGTAC+VVENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH T        HPSPD A EPSH+  QQQQQQ CFHAT+  T+A 
Subjt:  VNDILGKGTACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI--QQQQQQRCFHATENATAA-

Query:  -AAAEGSKSEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQ
         AA EGSKS D++EEEEEE E EEEEE+EE EG     EEEEE ESRKRARKGGM TA MQQLSAEV+GV+ DGGRS WEKKQW+KSRLIQLEEQQV +Q
Subjt:  -AAAEGSKSEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQ

Query:  SQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        +QAFELEKQRLKW+KFR+KKERDMERAKLENEKR LENERM+L+VK+ EL+LM + HYQQQQ QHSSN+RGDPSSITG
Subjt:  SQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411307.1e-19080.08Show/hide
Query:  MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPYEHHHHH-HPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADD
        ME  SLG  GGGGGG GGGG+FSG+NSAMLGL+LPLH +  NP N HQLHH  M  Y         QPPP AV+YPYPAKPKPQQ NLSD+EE GFA +D
Subjt:  MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPYEHHHHH-HPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADD

Query:  SNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
         N D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG E  DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
Subjt:  SNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEP-------SHIQQQQQQRCFHATENATAAAAAEG
        ACRVVENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH TG+N GGG HPSPDTAAEP        H QQQQQQRCFHATE+A AAA    
Subjt:  ACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEP-------SHIQQQQQQRCFHATENATAAAAAEG

Query:  SKSEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFEL
           ++D+++EEEESEE+E++EEEEIEG+SRG+EEE+E ESRKRARKGG+  AAMQQL+AEVIGV+QDGGRS WEKKQW+KSRLIQLEEQQV YQSQAFEL
Subjt:  SKSEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM-TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFEL

Query:  EKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        EKQRLKWLKFR+KKERDMERAKLENEKRRLE ERMVL+VKQKEL+ MD+HHYQQQ  QHSSN+RGDPSSITG
Subjt:  EKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

A0A6J1F2H4 transcription initiation factor TFIID subunit 7-like3.3e-18779.32Show/hide
Query:  MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSN
        MEP SLGGGGG       GG+F G++S+MLGLELPLHQ NP+NPHQLHHPPM  Y  H  HH  QPPPAAVK PYPAKPKPQQ N+SDEEE G  ADDSN
Subjt:  MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSN

Query:  GDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
         DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+EP+DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
Subjt:  GDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC

Query:  RVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAAAAAEGSKS---ED
        RVVEN  LL+SMELTPK+KEEVRKLLNSKHLFFREMCAYHN+CRHG G++SGGG + SP+   EPSH+ QQQQQQRCFHATE A AA A EGSKS   E+
Subjt:  RVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAAAAAEGSKS---ED

Query:  DDEEEEEESEEEEEE-EEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRL
        +DE+E++E+EE++EE EEEEI+GSS+  EEEEE ES+KR RK G T+ +QQ+ AEV+GVVQDGGRS WEKKQW+K RLIQLEEQQV YQSQ  E+EKQRL
Subjt:  DDEEEEEESEEEEEE-EEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRL

Query:  KWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHY--QQQQHQHSSNRRGDPSSITG
        KWLKFR KKERDMERAKLENEKRRLENERM+L+VKQKEL+L DLH+Y  QQQQ QHSSNRRGDPSSITG
Subjt:  KWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHY--QQQQHQHSSNRRGDPSSITG

A0A6J1J5X4 ribosome quality control complex subunit 2-like5.5e-19079.96Show/hide
Query:  MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSN
        MEP SLGGGGG       GG+F G++S+MLGLELPLHQ  P+NPHQLHHPPM  Y  H  HH  QPPPAAVK PYPAKPKPQQ N+SDEEE G  ADDSN
Subjt:  MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSN

Query:  GDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
         DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+EP+DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
Subjt:  GDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC

Query:  RVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAAAAAEGSKSEDDDE
        RVVEN  LL+SMELTPK+KEEVRKLLNSKHLFFREMCAYHN+CRHG G++ GGG +  P+T AEPSH+ QQQQQQRCFHATE A  A A EGSKS D++E
Subjt:  RVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHI-QQQQQQRCFHATENATAAAAAEGSKSEDDDE

Query:  EEE------EESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQ
        E+E      EE +EEEEEEEEE+EGSSR  EEEEE ES+KR RK G TA +QQ+SAEV+GVVQDGGRS WEKKQW+K RLIQLEEQQV+YQSQ  E+EKQ
Subjt:  EEE------EESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQ

Query:  RLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        RLKWLKFR KKERDMERAKLENEKRRLENERM+L+VKQKEL+L DLH+YQQQQ QHSSNRRGDPSSITG
Subjt:  RLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

A0A6J1L352 ESF1 homolog2.4e-19080.13Show/hide
Query:  MEPGSLG---GGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFA
        ME  SLG   GGGGGGGG GGGG+FSGMNSAMLGL+LPLH +  NP N HQLHHP M  Y         QPPP AV+YPYPAKPKPQQ NLSD+EE GFA
Subjt:  MEPGSLG---GGGGGGGGNGGGGLFSGMNSAMLGLELPLHQN--NPTNPHQLHHPPMAPY-EHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFA

Query:  ADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG
         +D N D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG E  DHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG
Subjt:  ADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG

Query:  KGTACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHI-----QQQQQQRCFHATENATAAAAA
        KGTACRVVENQ LLDSMELTPK KEEVRKLLNSKHLFFREMCAYHN+CRH TG+N GGG HPSPDTAAE PSH+     QQQQQQRCFHATE A AAA  
Subjt:  KGTACRVVENQNLLDSMELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAE-PSHI-----QQQQQQRCFHATENATAAAAA

Query:  EGSKSEDDDEEEEEESEE-EEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM----TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQ
             +DD+++EE+ESEE E++EEEEEIEG+SRG+EEE+E ESRKRARKGG+     AAMQQL+AEVIGV+QDGGRS WEKKQW+KSRLIQLEEQQV+YQ
Subjt:  EGSKSEDDDEEEEEESEE-EEEEEEEEIEGSSRGNEEEEEMESRKRARKGGM----TAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQ

Query:  SQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        SQAFELEKQRLKWLKFR+KKERDMERAKLENEKRRLE ERMVL+VKQKEL+ MD+HHYQQ   QHSSN+RGDPSSITG
Subjt:  SQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors2.9e-5036.24Show/hide
Query:  GGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPYEHH-----------HHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNG
        GG    G   + G +   L   + +H  +  N    H+P   P                HH  Q     +      K + ++ ++SD++E  F  +  +G
Subjt:  GGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPYEHH-----------HHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNG

Query:  -----DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGK
             +   K SPWQR+KWTD MV+LLITAV YIGD+     +D + ++K   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+
Subjt:  -----DAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGK

Query:  GTACRVVENQNLLDSM-ELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENATAAAAAEGSKSE
        GT+C+VVEN  LLDS+  L  K K++VRK+++SKHLF+ EMC+YHN          G   H   D A + S +Q   + R  H  ++    +     +  
Subjt:  GTACRVVENQNLLDSM-ELTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENATAAAAAEGSKSE

Query:  DDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRAR------KGGMTAAMQQLSAEVIGVVQ-------------DGGRSSWEKKQWIKSRLIQL
        DD++ + +  E +E EE+    G  R N         K+ R       G   + +  L    + + Q             + GR+   +KQW++SR +QL
Subjt:  DDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRAR------KGGMTAAMQQLSAEVIGVVQ-------------DGGRSSWEKKQWIKSRLIQL

Query:  EEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKEL
        EEQ+++ Q +  ELEKQR +W +F  K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  EEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)7.8e-4836.94Show/hide
Query:  NLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLN
        ++S+++E    + D    +K+  SPWQR+KW D MV+L+ITA+ YIG++ G++        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFNDLN
Subjt:  NLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLN

Query:  KRYKRVNDILGKGTACRVVENQNLLDSME-LTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENA
        KRYK++N++LG+GT+C VVEN +LLD ++ L  K K+EVR++++SKHLF+ EMC+YHN          G   H   D A           QR  H     
Subjt:  KRYKRVNDILGKGTACRVVENQNLLDSME-LTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENA

Query:  TAAAAAEGSKSEDDDEE---EEEESEEEEEEEEEEIEGSSRGN-----EEEEEMESRKRARKGGMTAAMQQLSAEV-IGVVQDGGRSSWEKKQWIKSRLI
               GS+ + D++E    + E  +++++ EE+ +G+          + +  E      KG     + +  A+V  G+  D  +++  ++Q I+S+ +
Subjt:  TAAAAAEGSKSEDDDEE---EEEESEEEEEEEEEEIEGSSRGN-----EEEEEMESRKRARKGGMTAAMQQLSAEV-IGVVQDGGRSSWEKKQWIKSRLI

Query:  QLEEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKEL
        +LE ++++ Q++  ELE+Q+ KW  F  ++E+ + + ++ENE+ +LENERM L +K+ EL
Subjt:  QLEEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors6.3e-9850.87Show/hide
Query:  LFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPYEHHHHHHPQPPPAAVKYPYPAKPK---PQQLNLSDEEEAGFAA-------DDSNGDAKKKISPWQ
        +FSG +  ML LE+P +  NP N  Q  HP   PY        QPP  ++ YPY +KPK   P      D+E+ G  +       D +  D K+K+S W 
Subjt:  LFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPYEHHHHHHPQPPPAAVKYPYPAKPK---PQQLNLSDEEEAGFAA-------DDSNGDAKKKISPWQ

Query:  RMKWTDMMVRLLITAVFYIGDEGGA-EPVDHAGKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACR
        RMKWTD MVRLLI AVFYIGDE G  +PVD   KKK           G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDILGKG ACR
Subjt:  RMKWTDMMVRLLITAVFYIGDEGGA-EPVDHAGKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACR

Query:  VVENQNLLDSME-LTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENATAAAAAEGSKSEDDDEE
        VVENQ LL+SM+ LTPKLK+EV+KLLNSKHLFFREMCAYHNSC H  G +    P  +P +   PS     QQQ CFHA E    A  A           
Subjt:  VVENQNLLDSME-LTPKLKEEVRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENATAAAAAEGSKSEDDDEE

Query:  EEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWLKF
          E  E EEE E +  E S    EE EE E+RK+ R   ++ A+++L  E   VV+D G+S WEKK+WI+ +++++EE+++ Y+ +  E+EKQR+KW+++
Subjt:  EEEESEEEEEEEEEEIEGSSRGNEEEEEMESRKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWLKF

Query:  RNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG
        R+KKER+ME+AKL+N++RRLE ERM+L++++ E+EL +L          SS  R DPSS  G
Subjt:  RNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHHYQQQQHQHSSNRRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCTGGTAGTTTAGGCGGCGGCGGAGGCGGCGGCGGCGGTAATGGTGGTGGAGGGCTGTTTTCCGGGATGAATTCTGCAATGTTGGGATTGGAATTACCTCTTCA
TCAGAACAACCCCACAAATCCCCACCAATTACACCACCCCCCAATGGCGCCTTATGAGCATCACCACCACCACCACCCACAACCACCGCCGGCGGCCGTGAAGTACCCTT
ATCCGGCGAAGCCGAAGCCGCAGCAGTTGAATCTCAGCGACGAGGAGGAGGCGGGGTTCGCGGCGGACGACAGCAACGGCGACGCGAAGAAGAAAATCTCGCCGTGGCAG
AGGATGAAGTGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTCTACATCGGAGACGAGGGCGGGGCGGAGCCGGTGGATCACGCCGGGAAGAAGAAGCCGGT
GGGGCTGCTGCAGAAGAAGGGGAAATGGAAATCGGTGTCCCGCGCGATGATGGAGAAAGGATTCTACGTTTCGCCGCAGCAATGCGAAGACAAATTCAACGATTTAAACA
AACGATACAAAAGAGTGAACGACATACTCGGAAAAGGAACCGCGTGCAGAGTCGTCGAGAATCAAAACCTCTTGGATTCCATGGAATTAACACCCAAACTGAAGGAAGAA
GTCCGAAAATTACTCAATTCCAAGCACCTGTTCTTCAGGGAAATGTGCGCTTACCACAACAGCTGCCGTCACGGCACCGGCAGCAACAGCGGCGGCGGTCCCCACCCCTC
GCCGGACACGGCGGCGGAACCATCTCACATTCAACAACAACAACAACAGCGGTGTTTCCACGCGACGGAGAACGCCACCGCCGCCGCCGCCGCCGAAGGGTCCAAAAGCG
AGGACGACGACGAGGAGGAAGAGGAAGAATCGGAGGAAGAAGAGGAGGAGGAGGAGGAGGAGATTGAGGGAAGTTCGCGAGGGAATGAAGAGGAAGAGGAAATGGAGTCG
AGGAAGCGGGCGAGGAAGGGGGGAATGACGGCGGCGATGCAGCAACTAAGCGCGGAGGTGATCGGAGTAGTACAGGACGGGGGGAGGAGCTCGTGGGAGAAGAAGCAGTG
GATCAAGAGCCGGCTGATTCAACTGGAAGAGCAGCAAGTGAGGTACCAATCGCAGGCGTTCGAGCTGGAGAAGCAGCGGCTGAAATGGCTGAAGTTCAGGAACAAGAAGG
AAAGGGACATGGAAAGAGCGAAGCTGGAGAACGAGAAGCGGCGGCTCGAGAACGAGCGGATGGTTCTTGTGGTGAAGCAGAAGGAGCTCGAATTGATGGATCTCCACCAT
TATCAGCAGCAGCAGCACCAACATTCCTCCAATAGACGAGGGGATCCATCGTCGATTACAGGC
mRNA sequenceShow/hide mRNA sequence
ATGGAACCTGGTAGTTTAGGCGGCGGCGGAGGCGGCGGCGGCGGTAATGGTGGTGGAGGGCTGTTTTCCGGGATGAATTCTGCAATGTTGGGATTGGAATTACCTCTTCA
TCAGAACAACCCCACAAATCCCCACCAATTACACCACCCCCCAATGGCGCCTTATGAGCATCACCACCACCACCACCCACAACCACCGCCGGCGGCCGTGAAGTACCCTT
ATCCGGCGAAGCCGAAGCCGCAGCAGTTGAATCTCAGCGACGAGGAGGAGGCGGGGTTCGCGGCGGACGACAGCAACGGCGACGCGAAGAAGAAAATCTCGCCGTGGCAG
AGGATGAAGTGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTCTACATCGGAGACGAGGGCGGGGCGGAGCCGGTGGATCACGCCGGGAAGAAGAAGCCGGT
GGGGCTGCTGCAGAAGAAGGGGAAATGGAAATCGGTGTCCCGCGCGATGATGGAGAAAGGATTCTACGTTTCGCCGCAGCAATGCGAAGACAAATTCAACGATTTAAACA
AACGATACAAAAGAGTGAACGACATACTCGGAAAAGGAACCGCGTGCAGAGTCGTCGAGAATCAAAACCTCTTGGATTCCATGGAATTAACACCCAAACTGAAGGAAGAA
GTCCGAAAATTACTCAATTCCAAGCACCTGTTCTTCAGGGAAATGTGCGCTTACCACAACAGCTGCCGTCACGGCACCGGCAGCAACAGCGGCGGCGGTCCCCACCCCTC
GCCGGACACGGCGGCGGAACCATCTCACATTCAACAACAACAACAACAGCGGTGTTTCCACGCGACGGAGAACGCCACCGCCGCCGCCGCCGCCGAAGGGTCCAAAAGCG
AGGACGACGACGAGGAGGAAGAGGAAGAATCGGAGGAAGAAGAGGAGGAGGAGGAGGAGGAGATTGAGGGAAGTTCGCGAGGGAATGAAGAGGAAGAGGAAATGGAGTCG
AGGAAGCGGGCGAGGAAGGGGGGAATGACGGCGGCGATGCAGCAACTAAGCGCGGAGGTGATCGGAGTAGTACAGGACGGGGGGAGGAGCTCGTGGGAGAAGAAGCAGTG
GATCAAGAGCCGGCTGATTCAACTGGAAGAGCAGCAAGTGAGGTACCAATCGCAGGCGTTCGAGCTGGAGAAGCAGCGGCTGAAATGGCTGAAGTTCAGGAACAAGAAGG
AAAGGGACATGGAAAGAGCGAAGCTGGAGAACGAGAAGCGGCGGCTCGAGAACGAGCGGATGGTTCTTGTGGTGAAGCAGAAGGAGCTCGAATTGATGGATCTCCACCAT
TATCAGCAGCAGCAGCACCAACATTCCTCCAATAGACGAGGGGATCCATCGTCGATTACAGGC
Protein sequenceShow/hide protein sequence
MEPGSLGGGGGGGGGNGGGGLFSGMNSAMLGLELPLHQNNPTNPHQLHHPPMAPYEHHHHHHPQPPPAAVKYPYPAKPKPQQLNLSDEEEAGFAADDSNGDAKKKISPWQ
RMKWTDMMVRLLITAVFYIGDEGGAEPVDHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQNLLDSMELTPKLKEE
VRKLLNSKHLFFREMCAYHNSCRHGTGSNSGGGPHPSPDTAAEPSHIQQQQQQRCFHATENATAAAAAEGSKSEDDDEEEEEESEEEEEEEEEEIEGSSRGNEEEEEMES
RKRARKGGMTAAMQQLSAEVIGVVQDGGRSSWEKKQWIKSRLIQLEEQQVRYQSQAFELEKQRLKWLKFRNKKERDMERAKLENEKRRLENERMVLVVKQKELELMDLHH
YQQQQHQHSSNRRGDPSSITG