; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g095980 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g095980
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
Descriptionsequence-specific DNA binding transcription factors
Genome locationCsor_Chr14:1150940..1152328
RNA-Seq ExpressionCsor.00g095980
SyntenyCsor.00g095980
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022934686.1 transcription initiation factor TFIID subunit 7-like [Cucurbita moschata]0.0100Show/hide
Query:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
        MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
Subjt:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
        SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT

Query:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
        LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
Subjt:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD

Query:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR
        ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR
Subjt:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR

Query:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

XP_022982944.1 ribosome quality control complex subunit 2-like [Cucurbita maxima]6.24e-30796.12Show/hide
Query:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
        MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQ PSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
Subjt:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
        SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT

Query:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
        LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNS GGGANT PE G EPSHLPQQQQQQRCFHATETAP ATAGEGSKSGDEEEEDEDEDD
Subjt:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD

Query:  ETEEDDEEVEEEE--IDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKF
        E+EEDDEE EEEE  ++GSS+VQEEEEETESKKRGRKEGFT+GIQQM AEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQV+YQSQGLEIEKQRLKWLKF
Subjt:  ETEEDDEEVEEEE--IDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKF

Query:  RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQ  HSSNRRGDPSSITG
Subjt:  RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

XP_023527808.1 uncharacterized protein DDB_G0283697-like [Cucurbita pepo subsp. pepo]1.73e-30896.54Show/hide
Query:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
        MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSY PHEAHHQQQPPPAAVK PYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
Subjt:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
        SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT

Query:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
        LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNS G GANTSPEAG EPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
Subjt:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD

Query:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR
        E+EE+DEE EEE+I+GSS+VQEEEEETESKKRGRKEGFT+GIQQM AEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQV+YQSQGLEIEKQRLKWLKFRR
Subjt:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR

Query:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQ  HSSNRRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

XP_023539309.1 ESF1 homolog [Cucurbita pepo subsp. pepo]3.23e-23176.74Show/hide
Query:  MEPNSLGG----GGGASGGMFPGISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLA-DDS
        ME +SLGG    GGG  GGMF G++S+MLGL+LPLH +P+NP   HQLHHP MVSYVP +   QQQPPP AV+ PYPAKPKPQQSN+SD+EEQG A +D 
Subjt:  MEPNSLGG----GGGASGGMFPGISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLA-DDS

Query:  NSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        NSD KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E  DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEP-SHLPQQ-QQQQRCFHATETAPAATAGEGSKSGD
        CRVVEN TLL+SMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH  GN+ GGGA+ SP+   EP SHL Q  QQQQRCFHATETA  ATA      GD
Subjt:  CRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEP-SHLPQQ-QQQQRCFHATETAPAATAGEGSKSGD

Query:  EEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFT-SGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIE
        +E+ED DE+DE+EED+++ EEEEI+G+S+  EEE+ETES+KR RK G   + +QQ+ AEV+GV+QDGGRSPWEKKQWMK RLIQLEEQQV YQSQ  E+E
Subjt:  EEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFT-SGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIE

Query:  KQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        KQRLKWLKFR KKERDMERAKLENEKRRLE ERM+LMVKQKELD  D+H+YQQQ     HSSN+RGDPSSITG
Subjt:  KQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

XP_038903237.1 transcription factor SPT20 homolog [Benincasa hispida]2.63e-23781.11Show/hide
Query:  ISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLA-DDSNSDAKKKISPWQRMKWTDMMVRL
        ++SSMLGLELPLHQNP+NP   HQLHHPP+VSYV H+ HH QQPPP ++K PYP KPKPQQSN+SD+EEQG A DDSN D KKKISPWQRMKWTDMMVRL
Subjt:  ISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLA-DDSNSDAKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHTLLESMELTPKVKEEV
        LITAVFYIGDEGGSEP DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVEN TLL+SMELTPK KEEV
Subjt:  LITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHTLLESMELTPKVKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATET---APAATAGEGSKSGDEEEEDEDEDDETEEDDEEVEEE
        RKLLNSKHLFFREMCAYHNTCRHG         + SP+   EPSHLPQQQQQQRCFHATET   A A  AGE SKSGDEEEE+E EDDE+EE++EE E+E
Subjt:  RKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATET---APAATAGEGSKSGDEEEEDEDEDDETEEDDEEVEEE

Query:  EIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLE
        EI+G    QEEEEETES+KR RK G T+G+QQ+ AEVMGVVQDGGRSPWEKKQWMK RLIQLEEQQV YQ+Q  E+EKQRLKW+KFR KKERDMERAKLE
Subjt:  EIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLE

Query:  NEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        NEKRRLENERMMLMVKQKELDL  +H+YQQQQQQ  HSSN+RGDPSSITG
Subjt:  NEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein8.67e-23275.62Show/hide
Query:  MEPNSLGGGGGAS--------------GGMFPGISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDE
        ME NSLGGGGG                GGMF G++SSMLGLELPLHQNP+NP   HQLHHPPMVSYV H+ HH QQPP  +VK P+P K KPQQSN+SD+
Subjt:  MEPNSLGGGGGAS--------------GGMFPGISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDE

Query:  EEQGLA-DDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EEQG A DDSN D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP+DH  KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEQGLA-DDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQR-CFHATETAPAAT
        VNDILGKGTAC+VVEN TLL+SMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH          + SP+A TEPSHLPQQQQQQ+ CFHAT+T  +A+
Subjt:  VNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQR-CFHATETAPAAT

Query:  --AGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGF-TSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQ
          AGEGSKSGDEEEE+E+E    EE +EE E+EE +G    QEEEEETES+KR RK G  T+G+QQ+ AEVMGV+ DGGRSPWEKKQWMK RLIQLEEQQ
Subjt:  --AGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGF-TSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQ

Query:  VEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        V +Q+Q  E+EKQRLKW+KFR KKERDMERAKLENEKR LENERMMLMVK+ ELDL  + +YQQQQQQ  HSSN+RGDPSSITG
Subjt:  VEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

A0A1S3B6J6 ESF1 homolog4.57e-22974.79Show/hide
Query:  MEPNSLGGGG-------------GASGG--MFPGISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSD
        ME NSLGGGG             GA+GG  MF G++SSMLGLELPLHQNP+NP   HQLHHPPMVSYV H+ HH QQPP  +VK P+P K KPQQSN+SD
Subjt:  MEPNSLGGGG-------------GASGG--MFPGISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSD

Query:  EEEQGLA-DDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
        +EEQG A DDSN D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP+DH  KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
Subjt:  EEEQGLA-DDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK

Query:  RVNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQR-CFHATETAPAA
        RVNDILGKGTAC+VVEN TLL+SMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH          + SP+A TEPSHLPQQQQQQ+ CFHAT+T  +A
Subjt:  RVNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQR-CFHATETAPAA

Query:  T--AGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQ
        +  AGE SKSGDEE+E+E+ED    E +EE E+EE +G    QEEEEETES+KR RK G T+G+QQ+ AEVMGV+ DGGRSPWEKKQWMK RLIQLEEQ+
Subjt:  T--AGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQ

Query:  VEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        V +Q+Q  E+EKQRLKW+KFR KKERDMERAKLENEKR LENERMML+VK+ ELDL  + +YQQQQQ   HSSN+RGDPSSITG
Subjt:  VEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411304.91e-22975.68Show/hide
Query:  MEPNSLG----GGGGASGGMFPGISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLA-DDS
        ME +SLG    GGGG  GGMF G++S+MLGL+LPLH +P+NP   HQLHH  MVSY P +   QQQPPP AV+ PYPAKPKPQQSN+SD+EEQG A +D 
Subjt:  MEPNSLG----GGGGASGGMFPGISSSMLGLELPLHQNPSNP---HQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLA-DDS

Query:  NSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        NSD KKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E  DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEP-SHLPQ-----QQQQQRCFHATETAPAATAGEGS
        CRVVEN TLL+SMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH  GN+ GGGA+ SP+   EP SHL Q     QQQQQRCFHATE+A AA A  G+
Subjt:  CRVVENHTLLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEP-SHLPQ-----QQQQQRCFHATETAPAATAGEGS

Query:  KSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFT-SGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQG
           DE+E+DE+E+ E +EDDEE   EEI+G+S+  EEE+ETES+KR RK G   + +QQ+ AEV+GV+QDGGRSPWEKKQWMK RLIQLEEQQV YQSQ 
Subjt:  KSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFT-SGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQG

Query:  LEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
         E+EKQRLKWLKFR KKERDMERAKLENEKRRLE ERM+LMVKQKELD  D+H+YQQQQ    HSSN+RGDPSSITG
Subjt:  LEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

A0A6J1F2H4 transcription initiation factor TFIID subunit 7-like0.0100Show/hide
Query:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
        MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
Subjt:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
        SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT

Query:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
        LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
Subjt:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD

Query:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR
        ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR
Subjt:  ETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRR

Query:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

A0A6J1J5X4 ribosome quality control complex subunit 2-like3.02e-30796.12Show/hide
Query:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
        MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQ PSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI
Subjt:  MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
        SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHT

Query:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD
        LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNS GGGANT PE G EPSHLPQQQQQQRCFHATETAP ATAGEGSKSGDEEEEDEDEDD
Subjt:  LLESMELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDD

Query:  ETEEDDEEVEEEE--IDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKF
        E+EEDDEE EEEE  ++GSS+VQEEEEETESKKRGRKEGFT+GIQQM AEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQV+YQSQGLEIEKQRLKWLKF
Subjt:  ETEEDDEEVEEEE--IDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKF

Query:  RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQ  HSSNRRGDPSSITG
Subjt:  RRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors2.5e-5437.34Show/hide
Query:  GGGASGGMFPGISSSMLGLELP-----LHQNPSNPHQLHHP---PMVSYVPHEA-------HHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADD--
        G    GG+    +SS  G +L       HQ+  N    H+P   P+   +P          HHQ Q     +      K + ++++VSD++E    ++  
Subjt:  GGGASGGMFPGISSSMLGLELP-----LHQNPSNPHQLHHP---PMVSYVPHEA-------HHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADD--

Query:  ----SNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
            + ++   K SPWQR+KWTD MV+LLITAV YIGD+     +D +S++K   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+L
Subjt:  ----SNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL

Query:  GKGTACRVVENHTLLESM-ELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFH-ATETAPAATAGEG
        G+GT+C+VVEN  LL+S+  L  K K++VRK+++SKHLF+ EMC+YHN      GN                 HLP     QR    A  +       + 
Subjt:  GKGTACRVVENHTLLESM-ELTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFH-ATETAPAATAGEG

Query:  SKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQ----------------EEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDG---GRSPWEKKQW
         K   E+ +DED D + +E DE  E+    G  +V                   E+          E     + Q+      V Q G   GR+   +KQW
Subjt:  SKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQ----------------EEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDG---GRSPWEKKQW

Query:  MKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKEL
        M+ R +QLEEQ+++ Q + LE+EKQR +W +F +K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  MKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)2.4e-4936.87Show/hide
Query:  QSNVSDEEEQGLADDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDL
        + ++S+++E  L      +  K+ SPWQR+KW D MV+L+ITA+ YIG++ GS+        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFNDL
Subjt:  QSNVSDEEEQGLADDSNSDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDL

Query:  NKRYKRVNDILGKGTACRVVENHTLLESME-LTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATE
        NKRYK++N++LG+GT+C VVEN +LL+ ++ L  K K+EVR++++SKHLF+ EMC+YHN      GN                 HLP     QR  H   
Subjt:  NKRYKRVNDILGKGTACRVVENHTLLESME-LTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATE

Query:  TAPAATAGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEV-MGVVQDGGRSPWEKKQWMKRRLIQL
        T  +    +  + G  + ED D+DD+ EED +    +      +  +  E+     +G        + +  A+V  G+  D  ++   ++Q ++ + ++L
Subjt:  TAPAATAGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEV-MGVVQDGGRSPWEKKQWMKRRLIQL

Query:  EEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKEL
        E ++++ Q++ +E+E+Q+ KW  F +++E+ + + ++ENE+ +LENERM L +K+ EL
Subjt:  EEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors5.9e-9649.37Show/hide
Query:  SGGMFPGISSSMLGLELPLHQNPSNPH---QLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVS----DEEEQGLA--------DDSNSDAK
        S  MF G S  ML LE+P  QNP NP    Q  HP   +     +  QQ  PP     PY +KPK Q S +S    D+E++G          D + +D K
Subjt:  SGGMFPGISSSMLGLELPLHQNPSNPH---QLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVS----DEEEQGLA--------DDSNSDAK

Query:  KKISPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPMDHASKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
        +K+S W RMKWTD MVRLLI AVFYIGDE G ++P+D  +KKK           G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDIL
Subjt:  KKISPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPMDHASKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL

Query:  GKGTACRVVENHTLLESME-LTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGS
        GKG ACRVVEN  LLESM+ LTPK+K+EV+KLLNSKHLFFREMCAYHN+C H      GG     P+    P  +P   QQQ CFHA E    A   E  
Subjt:  GKGTACRVVENHTLLESME-LTPKVKEEVRKLLNSKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGS

Query:  KSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGL
             E E+E E D  E+ + E+EE           EEEET  K+R      ++ ++++  E   VV+D G+S WEKK+W++R+++++EE+++ Y+ +G+
Subjt:  KSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESKKRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGL

Query:  EIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG
        E+EKQR+KW+++R KKER+ME+AKL+N++RRLE ERM+LM+++ E++L +L            SS  R DPSS  G
Subjt:  EIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNYQQQQQQQQHSSNRRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCCAATAGTTTAGGCGGCGGCGGCGGTGCTAGTGGAGGGATGTTTCCCGGCATAAGTTCGTCAATGCTGGGATTGGAATTGCCACTTCATCAAAACCCCTCAAA
TCCTCACCAATTACACCACCCCCCAATGGTGTCATATGTCCCACACGAGGCCCACCACCAACAACAACCGCCGCCGGCCGCAGTGAAATGCCCATACCCGGCGAAGCCTA
AGCCGCAGCAGTCGAATGTCAGCGACGAGGAGGAGCAGGGATTAGCGGACGACAGCAACAGCGACGCCAAGAAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGAC
ATGATGGTCCGGCTGCTGATCACGGCGGTGTTCTACATCGGCGACGAAGGTGGGTCGGAGCCGATGGACCATGCCAGCAAGAAAAAACCAGTGGGGCTTCTGCAAAAGAA
GGGGAAATGGAAATCCGTATCCAGAGCAATGATGGAAAAAGGATTCTACGTTTCACCGCAGCAATGCGAAGACAAATTTAACGATTTAAATAAAAGATACAAACGAGTTA
ACGACATTTTGGGGAAGGGCACCGCCTGCAGAGTCGTCGAGAATCACACCTTATTGGAATCAATGGAATTAACACCGAAAGTGAAAGAAGAAGTCCGAAAATTACTCAAT
TCTAAACATCTTTTCTTCAGAGAAATGTGCGCTTACCACAACACTTGCCGTCATGGCGCCGGCAACAGCAGCGGCGGCGGTGCCAATACCTCACCAGAGGCGGGGACAGA
ACCATCCCACCTTCCACAACAACAACAACAGCAACGATGCTTCCACGCGACGGAGACCGCTCCAGCCGCCACGGCCGGCGAGGGTTCGAAAAGCGGAGATGAGGAAGAGG
AAGACGAGGATGAGGATGATGAAACGGAAGAAGACGATGAGGAGGTGGAGGAGGAGGAGATTGATGGAAGTTCGAAAGTACAGGAAGAGGAGGAGGAAACGGAATCGAAG
AAGAGGGGGAGAAAAGAAGGATTCACATCGGGGATTCAGCAGATGTGTGCGGAGGTGATGGGAGTTGTGCAGGACGGGGGGAGGAGCCCGTGGGAGAAAAAGCAATGGAT
GAAGAGGCGATTGATTCAGCTAGAAGAGCAGCAAGTGGAATACCAATCGCAGGGCTTAGAGATTGAGAAACAGAGACTGAAATGGCTGAAGTTTAGGAGGAAGAAGGAGA
GGGATATGGAGAGAGCGAAGCTGGAGAACGAGAAGAGAAGGCTGGAAAACGAGAGGATGATGCTGATGGTGAAGCAGAAGGAACTCGATTTGACGGATCTGCACAATTAT
CAGCAGCAGCAGCAGCAGCAGCAGCATTCCTCGAACAGGCGAGGTGATCCATCTTCGATCACAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCCAATAGTTTAGGCGGCGGCGGCGGTGCTAGTGGAGGGATGTTTCCCGGCATAAGTTCGTCAATGCTGGGATTGGAATTGCCACTTCATCAAAACCCCTCAAA
TCCTCACCAATTACACCACCCCCCAATGGTGTCATATGTCCCACACGAGGCCCACCACCAACAACAACCGCCGCCGGCCGCAGTGAAATGCCCATACCCGGCGAAGCCTA
AGCCGCAGCAGTCGAATGTCAGCGACGAGGAGGAGCAGGGATTAGCGGACGACAGCAACAGCGACGCCAAGAAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGAC
ATGATGGTCCGGCTGCTGATCACGGCGGTGTTCTACATCGGCGACGAAGGTGGGTCGGAGCCGATGGACCATGCCAGCAAGAAAAAACCAGTGGGGCTTCTGCAAAAGAA
GGGGAAATGGAAATCCGTATCCAGAGCAATGATGGAAAAAGGATTCTACGTTTCACCGCAGCAATGCGAAGACAAATTTAACGATTTAAATAAAAGATACAAACGAGTTA
ACGACATTTTGGGGAAGGGCACCGCCTGCAGAGTCGTCGAGAATCACACCTTATTGGAATCAATGGAATTAACACCGAAAGTGAAAGAAGAAGTCCGAAAATTACTCAAT
TCTAAACATCTTTTCTTCAGAGAAATGTGCGCTTACCACAACACTTGCCGTCATGGCGCCGGCAACAGCAGCGGCGGCGGTGCCAATACCTCACCAGAGGCGGGGACAGA
ACCATCCCACCTTCCACAACAACAACAACAGCAACGATGCTTCCACGCGACGGAGACCGCTCCAGCCGCCACGGCCGGCGAGGGTTCGAAAAGCGGAGATGAGGAAGAGG
AAGACGAGGATGAGGATGATGAAACGGAAGAAGACGATGAGGAGGTGGAGGAGGAGGAGATTGATGGAAGTTCGAAAGTACAGGAAGAGGAGGAGGAAACGGAATCGAAG
AAGAGGGGGAGAAAAGAAGGATTCACATCGGGGATTCAGCAGATGTGTGCGGAGGTGATGGGAGTTGTGCAGGACGGGGGGAGGAGCCCGTGGGAGAAAAAGCAATGGAT
GAAGAGGCGATTGATTCAGCTAGAAGAGCAGCAAGTGGAATACCAATCGCAGGGCTTAGAGATTGAGAAACAGAGACTGAAATGGCTGAAGTTTAGGAGGAAGAAGGAGA
GGGATATGGAGAGAGCGAAGCTGGAGAACGAGAAGAGAAGGCTGGAAAACGAGAGGATGATGCTGATGGTGAAGCAGAAGGAACTCGATTTGACGGATCTGCACAATTAT
CAGCAGCAGCAGCAGCAGCAGCAGCATTCCTCGAACAGGCGAGGTGATCCATCTTCGATCACAGGTTGA
Protein sequenceShow/hide protein sequence
MEPNSLGGGGGASGGMFPGISSSMLGLELPLHQNPSNPHQLHHPPMVSYVPHEAHHQQQPPPAAVKCPYPAKPKPQQSNVSDEEEQGLADDSNSDAKKKISPWQRMKWTD
MMVRLLITAVFYIGDEGGSEPMDHASKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENHTLLESMELTPKVKEEVRKLLN
SKHLFFREMCAYHNTCRHGAGNSSGGGANTSPEAGTEPSHLPQQQQQQRCFHATETAPAATAGEGSKSGDEEEEDEDEDDETEEDDEEVEEEEIDGSSKVQEEEEETESK
KRGRKEGFTSGIQQMCAEVMGVVQDGGRSPWEKKQWMKRRLIQLEEQQVEYQSQGLEIEKQRLKWLKFRRKKERDMERAKLENEKRRLENERMMLMVKQKELDLTDLHNY
QQQQQQQQHSSNRRGDPSSITG