; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G002500 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G002500
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionsequence-specific DNA binding transcription factors
Genome locationCmo_Chr14:1148283..1149671
RNA-Seq ExpressionCmoCh14G002500
SyntenyCmoCh14G002500
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138003.2 uncharacterized protein DDB_G0290301 [Cucumis sativus]3.0e-18275.62Show/hide
Query:  MEPNSLGGGGG--------------ASGGMFPGISSSMLGLELPLHQ---NPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDE
        ME NSLGGGGG                GGMF G++SSMLGLELPLHQ   NP+NPHQLHHPPMVSYV H+ HH QQPP  +VK P+P K KPQQSN+SD+
Subjt:  MEPNSLGGGGG--------------ASGGMFPGISSSMLGLELPLHQ---NPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDE

Query:  EEQGL-ADDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EEQG  ADDSN D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP+DH  KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEQGL-ADDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQR-CFHATETAPAAT
        VNDILGKGTAC+VVEN TLL+SMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH          + SP+A TEPSHLPQQQQQQ+ CFHAT+T  +A+
Subjt:  VNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQR-CFHATETAPAAT

Query:  --AGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEG-FTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQ
          AGEGSKSGDEEEE+E+E    EE +EE E+EE +G    QEEEEETES+KR RK G  T+G+QQ+ AEVMGV+ DGGRSPWEKKQWMK RLIQLEEQQ
Subjt:  --AGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEG-FTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQ

Query:  VEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        V +Q+Q  E+EKQRLKW+KFR KKERDMERAKLENEKR LENERMMLMVK+ ELDL  + +Y  QQQQQQHSSN+RGDPSSITG
Subjt:  VEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

XP_022934686.1 transcription initiation factor TFIID subunit 7-like [Cucurbita moschata]3.9e-251100Show/hide
Query:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
        MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
Subjt:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
        SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT

Query:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
        LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
Subjt:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD

Query:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR
        ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR
Subjt:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR

Query:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

XP_022982944.1 ribosome quality control complex subunit 2-like [Cucurbita maxima]1.6e-23996.12Show/hide
Query:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
        MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQ PSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
Subjt:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
        SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT

Query:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
        LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNS GGGANT PE G EPSHLPQQQQQQRCFHATETAP ATAGEGSKSGDEEEEDEDEDD
Subjt:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD

Query:  ETEEDD--EEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKF
        E+EEDD  EE EEEE++GSS+VQEEEEETESKKRGRKEGFT+GIQQM AEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQV+YQSQGLEIEKQRLKWLKF
Subjt:  ETEEDD--EEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKF

Query:  RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNY  QQQQQQHSSNRRGDPSSITG
Subjt:  RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

XP_023527808.1 uncharacterized protein DDB_G0283697-like [Cucurbita pepo subsp. pepo]1.1e-24096.54Show/hide
Query:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
        MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSY PHEAHHQQQPPPAAVK PYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
Subjt:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
        SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT

Query:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
        LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNS G GANTSPEAG EPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
Subjt:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD

Query:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR
        E+EE+DEE EEE+I+GSS+VQEEEEETESKKRGRKEGFT+GIQQM AEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQV+YQSQGLEIEKQRLKWLKFRR
Subjt:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR

Query:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNY  QQQQQQHSSNRRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

XP_038903237.1 transcription factor SPT20 homolog [Benincasa hispida]1.5e-18681.11Show/hide
Query:  ISSSMLGLELPLHQNPS---NPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGL-ADDSNSDAKKKISPWQRMKWTDMMVRL
        ++SSMLGLELPLHQNP+   NPHQLHHPP+VSYV H+ HH QQPPP ++K PYP KPKPQQSN+SD+EEQG  ADDSN D KKKISPWQRMKWTDMMVRL
Subjt:  ISSSMLGLELPLHQNPS---NPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGL-ADDSNSDAKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHTLLESMELTPKVKEEV
        LITAVFYIGDEGGSEP DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVEN TLL+SMELTPK KEEV
Subjt:  LITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHTLLESMELTPKVKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATET---APAATAGEGSKSGDEEEEDEDEDDETEEDDEEVEEE
        RKLLNSKHLFFREMCAYHNTCRHG         + SP+   EPSHLPQQQQQQRCFHATET   A A  AGE SKSGDEEEE E+EDDE+EE++EE E+E
Subjt:  RKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATET---APAATAGEGSKSGDEEEEDEDEDDETEEDDEEVEEE

Query:  EIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLE
        EI+G    QEEEEETES+KR RK G T+G+QQ+ AEVMGVVQDGGRSPWEKKQWMK RLIQLEEQQV YQ+Q  E+EKQRLKW+KFR KKERDMERAKLE
Subjt:  EIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLE

Query:  NEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        NEKRRLENERMMLMVKQKELDL  +H+Y  QQQQQQHSSN+RGDPSSITG
Subjt:  NEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein1.4e-18275.62Show/hide
Query:  MEPNSLGGGGG--------------ASGGMFPGISSSMLGLELPLHQ---NPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDE
        ME NSLGGGGG                GGMF G++SSMLGLELPLHQ   NP+NPHQLHHPPMVSYV H+ HH QQPP  +VK P+P K KPQQSN+SD+
Subjt:  MEPNSLGGGGG--------------ASGGMFPGISSSMLGLELPLHQ---NPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDE

Query:  EEQGL-ADDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EEQG  ADDSN D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP+DH  KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEQGL-ADDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQR-CFHATETAPAAT
        VNDILGKGTAC+VVEN TLL+SMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH          + SP+A TEPSHLPQQQQQQ+ CFHAT+T  +A+
Subjt:  VNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQR-CFHATETAPAAT

Query:  --AGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEG-FTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQ
          AGEGSKSGDEEEE+E+E    EE +EE E+EE +G    QEEEEETES+KR RK G  T+G+QQ+ AEVMGV+ DGGRSPWEKKQWMK RLIQLEEQQ
Subjt:  --AGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEG-FTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQ

Query:  VEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        V +Q+Q  E+EKQRLKW+KFR KKERDMERAKLENEKR LENERMMLMVK+ ELDL  + +Y  QQQQQQHSSN+RGDPSSITG
Subjt:  VEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

A0A1S3B6J6 ESF1 homolog1.8e-18074.38Show/hide
Query:  MEPNSLGGGG---------------GASGGMFPGISSSMLGLELPLHQ---NPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSD
        ME NSLGGGG                  GGMF G++SSMLGLELPLHQ   NP+NPHQLHHPPMVSYV H+ HH QQPP  +VK P+P K KPQQSN+SD
Subjt:  MEPNSLGGGG---------------GASGGMFPGISSSMLGLELPLHQ---NPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSD

Query:  EEEQGL-ADDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
        +EEQG  ADDSN D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP+DH  KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
Subjt:  EEEQGL-ADDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK

Query:  RVNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQR-CFHATETAPAA
        RVNDILGKGTAC+VVEN TLL+SMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH          + SP+A TEPSHLPQQQQQQ+ CFHAT+T  +A
Subjt:  RVNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQR-CFHATETAPAA

Query:  T--AGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQ
        +  AGE SKSGDEE+E+E+ED    E +EE E+EE +G    QEEEEETES+KR RK G T+G+QQ+ AEVMGV+ DGGRSPWEKKQWMK RLIQLEEQ+
Subjt:  T--AGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQ

Query:  VEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        V +Q+Q  E+EKQRLKW+KFR KKERDMERAKLENEKR LENERMML+VK+ ELDL  + +Y   QQQQQHSSN+RGDPSSITG
Subjt:  VEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411301.3e-18075.68Show/hide
Query:  MEPNSL----GGGGGASGGMFPGISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLA-DDS
        ME +SL    GGGGG  GGMF G++S+MLGL+LPLH +P+NP   HQLHH  MVSY P +   QQQPPP AV+ PYPAKPKPQQSN+SD+EEQG A +D 
Subjt:  MEPNSL----GGGGGASGGMFPGISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLA-DDS

Query:  NSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        NSD KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E  DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTE-PSHL-----PQQQQQQRCFHATETAPAATAGEGS
        CRVVEN TLL+SMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH  GN+ GGGA+ SP+   E PSHL      QQQQQQRCFHATE+A AA A  G+
Subjt:  CRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTE-PSHL-----PQQQQQQRCFHATETAPAATAGEGS

Query:  KSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGF-TSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQG
           DE+E+DE+E+ E +EDD   EEEEI+G+S+  EEE+ETES+KR RK G   + +QQ+ AEV+GV+QDGGRSPWEKKQWMK RLIQLEEQQV YQSQ 
Subjt:  KSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGF-TSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQG

Query:  LEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
         E+EKQRLKWLKFR KKERDMERAKLENEKRRLE ERM+LMVKQKELD  D+H+Y    QQQQHSSN+RGDPSSITG
Subjt:  LEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

A0A6J1F2H4 transcription initiation factor TFIID subunit 7-like1.9e-251100Show/hide
Query:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
        MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
Subjt:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
        SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT

Query:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
        LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
Subjt:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD

Query:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR
        ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR
Subjt:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR

Query:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

A0A6J1J5X4 ribosome quality control complex subunit 2-like7.5e-24096.12Show/hide
Query:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
        MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQ PSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
Subjt:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
        SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT

Query:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
        LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNS GGGANT PE G EPSHLPQQQQQQRCFHATETAP ATAGEGSKSGDEEEEDEDEDD
Subjt:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD

Query:  ETEEDD--EEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKF
        E+EEDD  EE EEEE++GSS+VQEEEEETESKKRGRKEGFT+GIQQM AEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQV+YQSQGLEIEKQRLKWLKF
Subjt:  ETEEDD--EEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKF

Query:  RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNY  QQQQQQHSSNRRGDPSSITG
Subjt:  RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors2.5e-5437.34Show/hide
Query:  GGGASGGMFPGISSSMLGLELP-----LHQNPSNPHQLHHP---PMVSYVPHEA-------HHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADD--
        G    GG+    +SS  G +L       HQ+  N    H+P   P+   +P          HHQ Q     +      K + ++++VSD++E    ++  
Subjt:  GGGASGGMFPGISSSMLGLELP-----LHQNPSNPHQLHHP---PMVSYVPHEA-------HHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADD--

Query:  ----SNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
            + ++   K SPWQR+KWTD MV+LLITAV YIGD+     +D +S++K   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+L
Subjt:  ----SNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL

Query:  GKGTACRVVENHTLLESM-ELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFH-ATETAPAATAGEG
        G+GT+C+VVEN  LL+S+  L  K K++VRK+++SKHLF+ EMC+YHN      GN                 HLP     QR    A  +       + 
Subjt:  GKGTACRVVENHTLLESM-ELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFH-ATETAPAATAGEG

Query:  SKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQ----------------EEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDG---GRSPWEKKQW
         K   E+ +DED D + +E DE  E+    G  +V                   E+          E     + Q+      V Q G   GR+   +KQW
Subjt:  SKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQ----------------EEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDG---GRSPWEKKQW

Query:  MKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKEL
        M+ R +QLEEQ+++ Q + LE+EKQR +W +F +K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  MKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)2.4e-4936.87Show/hide
Query:  QSNVSDEEEQGLADDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDL
        + ++S+++E  L      +  K+ SPWQR+KW D MV+L+ITA+ YIG++ GS+        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFNDL
Subjt:  QSNVSDEEEQGLADDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDL

Query:  NKRYKRVNDILGKGTACRVVENHTLLESME-LTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATE
        NKRYK++N++LG+GT+C VVEN +LL+ ++ L  K K+EVR++++SKHLF+ EMC+YHN      GN                 HLP     QR  H   
Subjt:  NKRYKRVNDILGKGTACRVVENHTLLESME-LTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATE

Query:  TAPAATAGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEV-MGVVQDGGRSPWEKKQWMKRRLIQL
        T  +    +  + G  + ED D+DD+ EED +    +      +  +  E+     +G        + +  A+V  G+  D  ++   ++Q ++ + ++L
Subjt:  TAPAATAGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEV-MGVVQDGGRSPWEKKQWMKRRLIQL

Query:  EEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKEL
        E ++++ Q++ +E+E+Q+ KW  F +++E+ + + ++ENE+ +LENERM L +K+ EL
Subjt:  EEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors5.9e-9649.37Show/hide
Query:  SGGMFPGISSSMLGLELPLHQNPSNPH---QLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVS----DEEEQGLA--------DDSNSDAK
        S  MF G S  ML LE+P  QNP NP    Q  HP   +     +  QQ  PP     PY +KPK Q S +S    D+E++G          D + +D K
Subjt:  SGGMFPGISSSMLGLELPLHQNPSNPH---QLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVS----DEEEQGLA--------DDSNSDAK

Query:  KKISPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPMDHASKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
        +K+S W RMKWTD MVRLLI AVFYIGDE G ++P+D  +KKK           G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDIL
Subjt:  KKISPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPMDHASKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL

Query:  GKGTACRVVENHTLLESME-LTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGS
        GKG ACRVVEN  LLESM+ LTPK+K+EV+KLLNSKHLFFREMCAYHN+C H      GG     P+    P  +P   QQQ CFHA E    A   E  
Subjt:  GKGTACRVVENHTLLESME-LTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGS

Query:  KSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGL
             E E+E E D  E+ + E+EE           EEEET  K+R      ++ ++++  E   VV+D G+S WEKK+W++R+++++EE+++ Y+ +G+
Subjt:  KSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGL

Query:  EIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        E+EKQR+KW+++R KKER+ME+AKL+N++RRLE ERM+LM+++ E++L +L            SS  R DPSS  G
Subjt:  EIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCCAATAGTTTAGGCGGCGGCGGCGGTGCTAGTGGAGGGATGTTTCCCGGCATAAGTTCGTCAATGCTGGGATTGGAATTGCCACTTCATCAAAACCCCTCAAA
TCCTCACCAATTACACCACCCCCCAATGGTGTCATATGTCCCACACGAGGCCCACCACCAACAACAACCGCCGCCGGCCGCAGTGAAATGCCCATACCCGGCGAAGCCTA
AGCCGCAGCAGTCGAATGTCAGCGACGAGGAGGAGCAGGGATTAGCGGACGACAGCAACAGCGACGCCAAGAAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGAC
ATGATGGTCCGGCTGCTGATCACGGCGGTGTTCTACATCGGCGACGAAGGTGGGTCGGAGCCGATGGACCATGCCAGCAAGAAAAAACCAGTGGGGCTTCTGCAAAAGAA
GGGGAAATGGAAATCCGTATCCAGAGCAATGATGGAAAAAGGATTCTACGTTTCACCGCAGCAATGCGAAGACAAATTTAACGATTTAAATAAAAGATACAAACGAGTTA
ACGACATTTTGGGGAAGGGCACCGCCTGCAGAGTCGTCGAGAATCACACCTTATTGGAATCAATGGAATTAACACCGAAAGTGAAAGAAGAAGTCCGAAAATTACTCAAT
TCTAAACATCTTTTCTTCAGAGAAATGTGCGCTTACCACAACACTTGCCGTCATGGCGCCGGCAACAGCAGCGGCGGCGGTGCCAATACCTCACCAGAGGCGGGGACAGA
ACCATCCCACCTTCCACAACAACAACAACAGCAACGATGCTTCCACGCGACGGAGACCGCTCCAGCCGCCACGGCCGGCGAGGGTTCGAAAAGCGGAGATGAGGAAGAGG
AAGACGAGGATGAGGATGATGAAACGGAAGAAGACGATGAGGAGGTGGAGGAGGAGGAGATTGATGGAAGTTCGAAAGTACAGGAAGAGGAGGAGGAAACGGAATCGAAG
AAGAGGGGGAGAAAAGAAGGATTCACATCGGGGATTCAGCAGATGTGTGCGGAGGTGATGGGAGTTGTGCAGGACGGGGGGAGGAGCCCGTGGGAGAAAAAGCAATGGAT
GAAGAGGCGATTGATTCAGCTAGAAGAGCAGCAAGTGGAATACCAATCGCAGGGCTTAGAGATTGAGAAACAGAGACTGAAATGGCTGAAGTTTAGGAGGAAGAAGGAGA
GGGATATGGAGAGAGCGAAGCTGGAGAACGAGAAGAGAAGGCTGGAAAACGAGAGGATGATGCTGATGGTGAAGCAGAAGGAACTCGATTTGACGGATCTGCACAATTAT
CAGCAGCAGCAGCAGCAGCAGCAGCATTCCTCGAACAGGCGAGGTGATCCATCTTCGATCACAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCCAATAGTTTAGGCGGCGGCGGCGGTGCTAGTGGAGGGATGTTTCCCGGCATAAGTTCGTCAATGCTGGGATTGGAATTGCCACTTCATCAAAACCCCTCAAA
TCCTCACCAATTACACCACCCCCCAATGGTGTCATATGTCCCACACGAGGCCCACCACCAACAACAACCGCCGCCGGCCGCAGTGAAATGCCCATACCCGGCGAAGCCTA
AGCCGCAGCAGTCGAATGTCAGCGACGAGGAGGAGCAGGGATTAGCGGACGACAGCAACAGCGACGCCAAGAAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGAC
ATGATGGTCCGGCTGCTGATCACGGCGGTGTTCTACATCGGCGACGAAGGTGGGTCGGAGCCGATGGACCATGCCAGCAAGAAAAAACCAGTGGGGCTTCTGCAAAAGAA
GGGGAAATGGAAATCCGTATCCAGAGCAATGATGGAAAAAGGATTCTACGTTTCACCGCAGCAATGCGAAGACAAATTTAACGATTTAAATAAAAGATACAAACGAGTTA
ACGACATTTTGGGGAAGGGCACCGCCTGCAGAGTCGTCGAGAATCACACCTTATTGGAATCAATGGAATTAACACCGAAAGTGAAAGAAGAAGTCCGAAAATTACTCAAT
TCTAAACATCTTTTCTTCAGAGAAATGTGCGCTTACCACAACACTTGCCGTCATGGCGCCGGCAACAGCAGCGGCGGCGGTGCCAATACCTCACCAGAGGCGGGGACAGA
ACCATCCCACCTTCCACAACAACAACAACAGCAACGATGCTTCCACGCGACGGAGACCGCTCCAGCCGCCACGGCCGGCGAGGGTTCGAAAAGCGGAGATGAGGAAGAGG
AAGACGAGGATGAGGATGATGAAACGGAAGAAGACGATGAGGAGGTGGAGGAGGAGGAGATTGATGGAAGTTCGAAAGTACAGGAAGAGGAGGAGGAAACGGAATCGAAG
AAGAGGGGGAGAAAAGAAGGATTCACATCGGGGATTCAGCAGATGTGTGCGGAGGTGATGGGAGTTGTGCAGGACGGGGGGAGGAGCCCGTGGGAGAAAAAGCAATGGAT
GAAGAGGCGATTGATTCAGCTAGAAGAGCAGCAAGTGGAATACCAATCGCAGGGCTTAGAGATTGAGAAACAGAGACTGAAATGGCTGAAGTTTAGGAGGAAGAAGGAGA
GGGATATGGAGAGAGCGAAGCTGGAGAACGAGAAGAGAAGGCTGGAAAACGAGAGGATGATGCTGATGGTGAAGCAGAAGGAACTCGATTTGACGGATCTGCACAATTAT
CAGCAGCAGCAGCAGCAGCAGCAGCATTCCTCGAACAGGCGAGGTGATCCATCTTCGATCACAGGTTGA
Protein sequenceShow/hide protein sequence
MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKISPWQRMKWTD
MMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLN
SKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESK
KRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNY
QQQQQQQQHSSNRRGDPSSITG