; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022842 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022842
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSequence-specific DNA binding transcription factors isoform 2
Genome locationtig00000589:2319237..2320619
RNA-Seq ExpressionSgr022842
SyntenySgr022842
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596531.1 hypothetical protein SDJN03_09711, partial [Cucurbita argyrosperma subsp. sororia]1.9e-18177.49Show/hide
Query:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+SAMLGL+LPLHP+PTNP N  HQLHHP MVSYV  +       P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAE-PSHVQQ-QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDD
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GGG   GAH SP+ AAE PSH+QQ Q  Q QQRCFHA  +  +A       A V   G+D++E++E+D+
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAE-PSHVQQ-QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDD

Query:  SEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKF
        SEE+EDDE++EEIE TSRGH++EDET+SRKR RKGG   A   MQQL+AEV+GV+QDGG+SPWEKKQWMKSRLIQLEEQQVNY+SQAFELEKQRLKWLKF
Subjt:  SEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKF

Query:  RSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        RSKKERDMERAKLENEKRRLE ERM L+VKQKELD +D+HHY   QQQHSSNKRGDPSSITG
Subjt:  RSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

KAG7028064.1 hypothetical protein SDJN02_09244, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-18177.49Show/hide
Query:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+SAMLGL+LPLHP+PTNP N  HQLHHP MVSYV  +       P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAE-PSHVQQ-QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDD
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GGG   GAH SP+ AAE PSH+QQ Q  Q QQRCFHA  +  +A       A V   G+D++E++E+D+
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAE-PSHVQQ-QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDD

Query:  SEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKF
        SEE+EDDE++EEIE TSRGH++EDET+SRKR RKGG   A   MQQL+AEV+GV+QDGG+SPWEKKQWMKSRLIQLEEQQVNY+SQAFELEKQRLKWLKF
Subjt:  SEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKF

Query:  RSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        RSKKERDMERAKLENEKRRLE ERM L+VKQKELD +D+HHY   QQQHSSNKRGDPSSITG
Subjt:  RSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

XP_023005783.1 ESF1 homolog [Cucurbita maxima]2.5e-18177.59Show/hide
Query:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+SAMLGL+LPLHP+PTNP N  HQLHHP MVSYV  +       P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAE-PSHVQ--QQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDD
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GGG   GAH SP+ AAE PSH+Q  QQQ Q QQRCFHA  +  +A       A V   G+DD+E++E+D
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAE-PSHVQ--QQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDD

Query:  DSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGF-TAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL
        +SEE+EDDE++EEIE TSRGH++EDET+SRKR RKGG   AA   MQQL+AEV+GV+QDGG+S WEKKQWMKSRLIQLEEQQV Y+SQAFELEKQRLKWL
Subjt:  DSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGF-TAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL

Query:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KFRSKKERDMERAKLENEKRRLE ERMVL+VKQKELD +D+HHY   QQQHSSNKRGDPSSITG
Subjt:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

XP_023539309.1 ESF1 homolog [Cucurbita pepo subsp. pepo]6.6e-18277.39Show/hide
Query:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+SAMLGL+LPLHP+PTNP N  HQLHHP MVSYV  +       P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GGG   GAH SP+ AAEP    QQ  Q QQRCFHA  +  +A       A V   G+D++E++E+D+SE
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE

Query:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS
        E+EDDE++EEIE TSRGH++EDET+SRKR RKGG   A   MQQL+AEV+GV+QDGG+SPWEKKQWMKSRLIQLEEQQVNY+SQAFELEKQRLKWLKFRS
Subjt:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS

Query:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KKERDMERAKLENEKRRLE ERMVL+VKQKELD +D+HHY   QQQHSSNKRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

XP_038903237.1 transcription factor SPT20 homolog [Benincasa hispida]1.2e-18378.48Show/hide
Query:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+S+MLGLELPLH NPTNP NP HQLHHP +VSYV H  D HHH      ++KY YP KPK QQSN+SD+EE GFAADDSNGD KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGGSE  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE
        EEVRKLLNSKHLFFREMCAYHN+CRHGT           H SP+ AAEPSH+ QQQ   QQRCFHA  +T SAA +     E SKSG+++EEEEEDD+SE
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE

Query:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS
        EEE +E+DEEIE      ++E+ET+SRKR RKGG TA    MQQLSAEV+GV+QDGG+SPWEKKQWMKSRLIQLEEQQV+Y++QAFELEKQRLKW+KFRS
Subjt:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS

Query:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KKERDMERAKLENEKRRLENERM+L+VKQKELDL+ +HHYQQQQQQHSSNKRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein6.9e-17776.09Show/hide
Query:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+S+MLGLELPLH NPTNP NP HQLHHP MVSYVQH  D HHH    + +VKY +P K K QQSNLSD+EE GFAADDSNGD KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGGSE  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE
        EEVRKLLNSKHLFFREMCAYHN+CRH T           H SP+AA EPSH+ QQQ Q QQ CFHA  +T SA+       E SKSG+++EEEEE+++SE
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE

Query:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS
        EEE+DE+      T    ++E+ET+SRKR RKGG    T  MQQLSAEV+GVI DGG+SPWEKKQWMKSRLIQLEEQQV++++QAFELEKQRLKW+KFRS
Subjt:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS

Query:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KKERDMERAKLENEKR LENERM+L+VK+ ELDL+ + HYQQQQQQHSSNKRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

A0A1S3B6J6 ESF1 homolog5.8e-17675.87Show/hide
Query:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+S+MLGLELPLH NPTNP NP HQLHHP MVSYVQH  D HHH    + +VKY +P K K QQSNLSD+EE GFAADDSNGD KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGGSE  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE
        EEVRKLLNSKHLFFREMCAYHN+CRH T           H SP+AA EPSH+ QQQ Q QQ CFHA  +T SA+       E SKSG++++EEEE+D+SE
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE

Query:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS
        EEE+DE+      T    ++E+ET+SRKR RKGG TA    MQQLSAEV+GVI DGG+SPWEKKQWMKSRLIQLEEQ+V++++QAFELEKQRLKW+KFRS
Subjt:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS

Query:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KKERDMERAKLENEKR LENERM+L+VK+ ELDL+ + HY QQQQQHSSNKRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

A0A5A7TKS6 ESF1-like protein5.8e-17675.87Show/hide
Query:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+S+MLGLELPLH NPTNP NP HQLHHP MVSYVQH  D HHH    + +VKY +P K K QQSNLSD+EE GFAADDSNGD KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGGSE  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE
        EEVRKLLNSKHLFFREMCAYHN+CRH T           H SP+AA EPSH+ QQQ Q QQ CFHA  +T SA+       E SKSG++++EEEE+D+SE
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE

Query:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS
        EEE+DE+      T    ++E+ET+SRKR RKGG TA    MQQLSAEV+GVI DGG+SPWEKKQWMKSRLIQLEEQ+V++++QAFELEKQRLKW+KFRS
Subjt:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS

Query:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KKERDMERAKLENEKR LENERM+L+VK+ ELDL+ + HY QQQQQHSSNKRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411303.0e-18077.16Show/hide
Query:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        ++SAMLGL+LPLHP+PTNP N  HQLHH  MVSY   +       P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAE-PSHVQQ---QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEED
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GGG   GAH SP+ AAE PSH+QQ   QQ Q QQRCFHA  S  +A       A V   G+D++E++E+
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAE-PSHVQQ---QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEED

Query:  DDSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL
        ++SEE+EDDE +EEIE TSRGH++EDET+SRKR RKGG   A   MQQL+AEV+GV+QDGG+SPWEKKQWMKSRLIQLEEQQVNY+SQAFELEKQRLKWL
Subjt:  DDSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL

Query:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KFRSKKERDMERAKLENEKRRLE ERMVL+VKQKELD +D+HHY  QQQQHSSNKRGDPSSITG
Subjt:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

A0A6J1L352 ESF1 homolog1.2e-18177.59Show/hide
Query:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+SAMLGL+LPLHP+PTNP N  HQLHHP MVSYV  +       P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAE-PSHVQ--QQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDD
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GGG   GAH SP+ AAE PSH+Q  QQQ Q QQRCFHA  +  +A       A V   G+DD+E++E+D
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAE-PSHVQ--QQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDD

Query:  DSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGF-TAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL
        +SEE+EDDE++EEIE TSRGH++EDET+SRKR RKGG   AA   MQQL+AEV+GV+QDGG+S WEKKQWMKSRLIQLEEQQV Y+SQAFELEKQRLKWL
Subjt:  DSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGF-TAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL

Query:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KFRSKKERDMERAKLENEKRRLE ERMVL+VKQKELD +D+HHY   QQQHSSNKRGDPSSITG
Subjt:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.2e-5337.16Show/hide
Query:  NPTNPANPHHQ-LHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNG-----DAKKKISPWQRMKWTDMMVRLLITAV
        N  +  NP+ + LH     + V  ++  HH +      +  S   K +++++++SD++E  F  +  +G     +   K SPWQR+KWTD MV+LLITAV
Subjt:  NPTNPANPHHQ-LHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNG-----DAKKKISPWQRMKWTDMMVRLLITAV

Query:  FYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSM-ELTPKMKEEVRKLL
         YIGD+   +++     ++   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+GT+C+VVEN +LLDS+  L  K K++VRK++
Subjt:  FYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSM-ELTPKMKEEVRKLL

Query:  NSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSEEEEDDED
        +SKHLF+ EMC+YHN  R               H P   A      Q+  QL  R      +  S  H + D  +    G+ DE +E ++      D   
Subjt:  NSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSEEEEDDED

Query:  DE--------EIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDG---GKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL
        +         +  R S  H+D D                   + Q+      V Q G   G++   +KQWM+SR +QLEEQ++  + +  ELEKQR +W 
Subjt:  DE--------EIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDG---GKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL

Query:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKEL
        +F  K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)6.6e-4736.1Show/hide
Query:  NPANPHHQLHHP--QMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEG
        N  N + + HHP  +  S   +  D  H++                + + ++S+++E    + D    +K+  SPWQR+KW D MV+L+ITA+ YIG++ 
Subjt:  NPANPHHQLHHP--QMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEG

Query:  GSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSME-LTPKMKEEVRKLLNSKHLFF
        GS+        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFNDLNKRYK++N++LG+GT+C VVEN SLLD ++ L  K K+EVR++++SKHLF+
Subjt:  GSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSME-LTPKMKEEVRKLLNSKHLFF

Query:  REMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSEEEEDDE--DDEEIE
         EMC+YHN  R               H P   A             QR  H   + GS     RDD +  + G+   E+ +DDD  EE+ D    D  ++
Subjt:  REMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSEEEEDDE--DDEEIE

Query:  RTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEV-VGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRSKKERDMERAKL
        R  +    ED       P KG       + Q   A+V  G+  D  K+   ++Q ++S+ ++LE +++  +++  ELE+Q+ KW  F  ++E+ + + ++
Subjt:  RTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEV-VGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRSKKERDMERAKL

Query:  ENEKRRLENERMVLLVKQKEL
        ENE+ +LENERM L +K+ EL
Subjt:  ENEKRRLENERMVLLVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors3.5e-8846.17Show/hide
Query:  SSAMLGLELPLHPNPTNPANPHHQLH-HPQMVSYVQHESDHHHHHPQAAPAVK--YSYPAKPKQQ---QSNLSDEEEAGFAA-------DDSNGDAKKKI
        S  ML LE+P   NP NP N     H HP   S  Q          Q  P +K  Y Y +KPKQ         D+E+ G  +       D +  D K+K+
Subjt:  SSAMLGLELPLHPNPTNPANPHHQLH-HPQMVSYVQHESDHHHHHPQAAPAVK--YSYPAKPKQQ---QSNLSDEEEAGFAA-------DDSNGDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEATDHTGKKKPV----------GLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG
        S W RMKWTD MVRLLI AVFYIGDE G    D    KK            G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDILGKG
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEATDHTGKKKPV----------GLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG

Query:  TACRVVENQSLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLV
         ACRVVENQ LL+SM+ LTPK+K+EV+KLLNSKHLFFREMCAYHNSC    G+ GG        +P +   PS         QQ CFH AA  G  A  +
Subjt:  TACRVVENQSLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLV

Query:  RDDAEVSKSGEDDEEEEEDDDSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQ
         +  EV +  E D  E+ + + EE E++E                   +RK+ R       +  +++L  E   V++D GKS WEKK+W++ +++++EE+
Subjt:  RDDAEVSKSGEDDEEEEEDDDSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQ

Query:  QVNYKSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        ++ Y+ +  E+EKQR+KW+++RSKKER+ME+AKL+N++RRLE ERM+L++++ E++L +L          SS  R DPSS  G
Subjt:  QVNYKSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTGCAATGTTAGGATTGGAATTGCCTCTGCATCCGAACCCCACGAATCCAGCTAATCCTCACCACCAATTACACCACCCCCAGATGGTGTCTTATGTCCAGCA
CGAGTCCGACCACCACCATCACCACCCACAAGCTGCGCCGGCGGTTAAATATTCGTATCCGGCGAAGCCCAAGCAGCAGCAGTCGAATCTCAGCGACGAGGAGGAGGCCG
GGTTTGCGGCGGACGACAGTAATGGCGATGCGAAGAAGAAAATCTCGCCGTGGCAGAGAATGAAGTGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTTTAC
ATCGGAGATGAAGGTGGGTCGGAGGCGACGGACCACACCGGGAAGAAGAAGCCGGTGGGGCTGTTGCAGAAGAAAGGGAAGTGGAAATCGGTGTCTCGGGCGATGATGGA
GAAAGGTTTCTACGTTTCGCCGCAGCAATGCGAAGATAAGTTCAACGATTTGAATAAGAGATACAAGAGAGTGAACGATATTCTCGGGAAGGGCACCGCCTGCAGGGTGG
TGGAGAATCAGAGCTTGTTGGATTCAATGGAGTTGACGCCGAAAATGAAGGAAGAAGTTCGGAAATTACTCAATTCTAAGCATCTTTTCTTCAGGGAAATGTGTGCTTAC
CATAACAGCTGTCGTCACGGCACTGGGAACGGCGGCGGCGGCGCCACGTCTGGCGCTCACCATTCCCCAGAAGCGGCGGCGGAACCATCCCACGTTCAACAACAACAAGC
GCAACTACAGCAAAGGTGTTTCCACGCCGCCGCCTCCACCGGGTCGGCCGCGCATCTGGTCAGAGACGACGCGGAGGTGTCGAAAAGCGGAGAAGACGACGAGGAAGAGG
AGGAAGACGACGATTCGGAGGAAGAAGAAGACGACGAAGACGACGAAGAAATCGAGAGAACTTCACGAGGGCACGACGACGAAGACGAGACGGACTCGAGGAAAAGGCCG
AGAAAGGGGGGATTCACGGCGGCGACGCAGCAAATGCAGCAGCTGAGCGCGGAAGTCGTCGGCGTTATTCAAGACGGCGGGAAGAGTCCATGGGAGAAGAAGCAGTGGAT
GAAGAGCCGGCTGATACAGCTGGAGGAGCAGCAAGTGAACTACAAATCGCAAGCTTTCGAGCTGGAGAAGCAGCGGCTGAAATGGCTGAAGTTCCGGAGCAAGAAGGAGA
GGGACATGGAGAGAGCGAAGCTGGAGAACGAGAAGAGACGGCTGGAAAACGAGAGAATGGTGCTGCTGGTGAAGCAGAAGGAGCTCGATTTGCTGGATCTCCACCATTAC
CAGCAGCAGCAGCAGCAGCATTCGTCGAACAAGCGAGGCGACCCATCATCGATCACGGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTGCAATGTTAGGATTGGAATTGCCTCTGCATCCGAACCCCACGAATCCAGCTAATCCTCACCACCAATTACACCACCCCCAGATGGTGTCTTATGTCCAGCA
CGAGTCCGACCACCACCATCACCACCCACAAGCTGCGCCGGCGGTTAAATATTCGTATCCGGCGAAGCCCAAGCAGCAGCAGTCGAATCTCAGCGACGAGGAGGAGGCCG
GGTTTGCGGCGGACGACAGTAATGGCGATGCGAAGAAGAAAATCTCGCCGTGGCAGAGAATGAAGTGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTTTAC
ATCGGAGATGAAGGTGGGTCGGAGGCGACGGACCACACCGGGAAGAAGAAGCCGGTGGGGCTGTTGCAGAAGAAAGGGAAGTGGAAATCGGTGTCTCGGGCGATGATGGA
GAAAGGTTTCTACGTTTCGCCGCAGCAATGCGAAGATAAGTTCAACGATTTGAATAAGAGATACAAGAGAGTGAACGATATTCTCGGGAAGGGCACCGCCTGCAGGGTGG
TGGAGAATCAGAGCTTGTTGGATTCAATGGAGTTGACGCCGAAAATGAAGGAAGAAGTTCGGAAATTACTCAATTCTAAGCATCTTTTCTTCAGGGAAATGTGTGCTTAC
CATAACAGCTGTCGTCACGGCACTGGGAACGGCGGCGGCGGCGCCACGTCTGGCGCTCACCATTCCCCAGAAGCGGCGGCGGAACCATCCCACGTTCAACAACAACAAGC
GCAACTACAGCAAAGGTGTTTCCACGCCGCCGCCTCCACCGGGTCGGCCGCGCATCTGGTCAGAGACGACGCGGAGGTGTCGAAAAGCGGAGAAGACGACGAGGAAGAGG
AGGAAGACGACGATTCGGAGGAAGAAGAAGACGACGAAGACGACGAAGAAATCGAGAGAACTTCACGAGGGCACGACGACGAAGACGAGACGGACTCGAGGAAAAGGCCG
AGAAAGGGGGGATTCACGGCGGCGACGCAGCAAATGCAGCAGCTGAGCGCGGAAGTCGTCGGCGTTATTCAAGACGGCGGGAAGAGTCCATGGGAGAAGAAGCAGTGGAT
GAAGAGCCGGCTGATACAGCTGGAGGAGCAGCAAGTGAACTACAAATCGCAAGCTTTCGAGCTGGAGAAGCAGCGGCTGAAATGGCTGAAGTTCCGGAGCAAGAAGGAGA
GGGACATGGAGAGAGCGAAGCTGGAGAACGAGAAGAGACGGCTGGAAAACGAGAGAATGGTGCTGCTGGTGAAGCAGAAGGAGCTCGATTTGCTGGATCTCCACCATTAC
CAGCAGCAGCAGCAGCAGCATTCGTCGAACAAGCGAGGCGACCCATCATCGATCACGGGATGA
Protein sequenceShow/hide protein sequence
MSSAMLGLELPLHPNPTNPANPHHQLHHPQMVSYVQHESDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFY
IGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMKEEVRKLLNSKHLFFREMCAY
HNSCRHGTGNGGGGATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSEEEEDDEDDEEIERTSRGHDDEDETDSRKRP
RKGGFTAATQQMQQLSAEVVGVIQDGGKSPWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHY
QQQQQQHSSNKRGDPSSITG