; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr001060 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr001060
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSequence-specific DNA binding transcription factors isoform 2
Genome locationtig00000688:44995..46377
RNA-Seq ExpressionSgr001060
SyntenySgr001060
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596531.1 hypothetical protein SDJN03_09711, partial [Cucurbita argyrosperma subsp. sororia]1.1e-17977.06Show/hide
Query:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+SAMLGL+LPLH +PTNP N  HQLHHP MVSYV  +P      P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAE-PSHVQQ-QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDD
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GG    GAH SP+ AAE PSH+QQ Q  Q QQRCFHA  +  +A       A V   G+D++E++E+D+
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAE-PSHVQQ-QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDD

Query:  SEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKF
        SEE+EDDE++EEIE TSRGH++EDET+SRKR RKGG   A   MQQL+AEV+GV+QDGG+S WEKKQWMKSRLIQLEEQQVNY+SQAFELEKQRLKWLKF
Subjt:  SEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKF

Query:  RSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        RSKKERDMERAKLENEKRRLE ERM L+VKQKELD +D+HHY   QQQHSSNKRGDPSSITG
Subjt:  RSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

KAG7028064.1 hypothetical protein SDJN02_09244, partial [Cucurbita argyrosperma subsp. argyrosperma]8.1e-18077.06Show/hide
Query:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+SAMLGL+LPLH +PTNP N  HQLHHP MVSYV  +P      P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAE-PSHVQQ-QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDD
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GG    GAH SP+ AAE PSH+QQ Q  Q QQRCFHA  +  +A       A V   G+D++E++E+D+
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAE-PSHVQQ-QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDD

Query:  SEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKF
        SEE+EDDE++EEIE TSRGH++EDET+SRKR RKGG   A   MQQL+AEV+GV+QDGG+S WEKKQWMKSRLIQLEEQQVNY+SQAFELEKQRLKWLKF
Subjt:  SEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKF

Query:  RSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        RSKKERDMERAKLENEKRRLE ERM L+VKQKELD +D+HHY   QQQHSSNKRGDPSSITG
Subjt:  RSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

XP_023005783.1 ESF1 homolog [Cucurbita maxima]9.5e-18177.37Show/hide
Query:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+SAMLGL+LPLH +PTNP N  HQLHHP MVSYV  +P      P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAE-PSHVQ--QQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDD
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GG    GAH SP+ AAE PSH+Q  QQQ Q QQRCFHA  +  +A       A V   G+DD+E++E+D
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAE-PSHVQ--QQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDD

Query:  DSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGF-TAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL
        +SEE+EDDE++EEIE TSRGH++EDET+SRKR RKGG   AA   MQQL+AEV+GV+QDGG+S WEKKQWMKSRLIQLEEQQV Y+SQAFELEKQRLKWL
Subjt:  DSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGF-TAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL

Query:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KFRSKKERDMERAKLENEKRRLE ERMVL+VKQKELD +D+HHY   QQQHSSNKRGDPSSITG
Subjt:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

XP_023539309.1 ESF1 homolog [Cucurbita pepo subsp. pepo]4.7e-18076.96Show/hide
Query:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+SAMLGL+LPLH +PTNP N  HQLHHP MVSYV  +P      P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GG    GAH SP+ AAEP    QQ  Q QQRCFHA  +  +A       A V   G+D++E++E+D+SE
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE

Query:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS
        E+EDDE++EEIE TSRGH++EDET+SRKR RKGG   A   MQQL+AEV+GV+QDGG+S WEKKQWMKSRLIQLEEQQVNY+SQAFELEKQRLKWLKFRS
Subjt:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS

Query:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KKERDMERAKLENEKRRLE ERMVL+VKQKELD +D+HHY   QQQHSSNKRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

XP_038903237.1 transcription factor SPT20 homolog [Benincasa hispida]2.4e-18478.74Show/hide
Query:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAP-AVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDM
        M+S+MLGLELPLHQNPTNP NP HQLHHP +VSYV H+P   HHH Q  P ++KY YP KPK QQSN+SD+EE GFAADDSNGD KKKISPWQRMKWTDM
Subjt:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAP-AVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDM

Query:  MVRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKM
        MVRLLITAVFYIGDEGGSE  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQ+LLDSMELTPK 
Subjt:  MVRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKM

Query:  KEEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDS
        KEEVRKLLNSKHLFFREMCAYHN+CRHGT           H SP+ AAEPSH+ QQQ   QQRCFHA  +T SAA +     E SKSG+++EEEEEDD+S
Subjt:  KEEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDS

Query:  EEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFR
        EEEE +E+DEEIE      ++E+ET+SRKR RKGG TA    MQQLSAEV+GV+QDGG+S WEKKQWMKSRLIQLEEQQV+Y++QAFELEKQRLKW+KFR
Subjt:  EEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFR

Query:  SKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        SKKERDMERAKLENEKRRLENERM+L+VKQKELDL+ +HHYQQQQQQHSSNKRGDPSSITG
Subjt:  SKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein4.0e-17776.09Show/hide
Query:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+S+MLGLELPLHQNPTNP NP HQLHHP MVSYVQH+P  HHH    + +VKY +P K K QQSNLSD+EE GFAADDSNGD KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGGSE  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE
        EEVRKLLNSKHLFFREMCAYHN+CRH T           H SP+AA EPSH+ QQQ Q QQ CFHA  +T SA+       E SKSG+++EEEEE+++SE
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE

Query:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS
        EEE+DE+      T    ++E+ET+SRKR RKGG    T  MQQLSAEV+GVI DGG+S WEKKQWMKSRLIQLEEQQV++++QAFELEKQRLKW+KFRS
Subjt:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS

Query:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KKERDMERAKLENEKR LENERM+L+VK+ ELDL+ + HYQQQQQQHSSNKRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

A0A1S3B6J6 ESF1 homolog3.4e-17675.87Show/hide
Query:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+S+MLGLELPLHQNPTNP NP HQLHHP MVSYVQH+P  HHH    + +VKY +P K K QQSNLSD+EE GFAADDSNGD KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGGSE  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE
        EEVRKLLNSKHLFFREMCAYHN+CRH T           H SP+AA EPSH+ QQQ Q QQ CFHA  +T SA+       E SKSG++++EEEE+D+SE
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE

Query:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS
        EEE+DE+      T    ++E+ET+SRKR RKGG TA    MQQLSAEV+GVI DGG+S WEKKQWMKSRLIQLEEQ+V++++QAFELEKQRLKW+KFRS
Subjt:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS

Query:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KKERDMERAKLENEKR LENERM+L+VK+ ELDL+ + HY QQQQQHSSNKRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

A0A5A7TKS6 ESF1-like protein3.4e-17675.87Show/hide
Query:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+S+MLGLELPLHQNPTNP NP HQLHHP MVSYVQH+P  HHH    + +VKY +P K K QQSNLSD+EE GFAADDSNGD KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGGSE  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE
        EEVRKLLNSKHLFFREMCAYHN+CRH T           H SP+AA EPSH+ QQQ Q QQ CFHA  +T SA+       E SKSG++++EEEE+D+SE
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSE

Query:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS
        EEE+DE+      T    ++E+ET+SRKR RKGG TA    MQQLSAEV+GVI DGG+S WEKKQWMKSRLIQLEEQ+V++++QAFELEKQRLKW+KFRS
Subjt:  EEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRS

Query:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KKERDMERAKLENEKR LENERM+L+VK+ ELDL+ + HY QQQQQHSSNKRGDPSSITG
Subjt:  KKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411301.6e-17876.72Show/hide
Query:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        ++SAMLGL+LPLH +PTNP N  HQLHH  MVSY   +P      P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAE-PSHVQQ---QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEED
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GG    GAH SP+ AAE PSH+QQ   QQ Q QQRCFHA  S  +A       A V   G+D++E++E+
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAE-PSHVQQ---QQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEED

Query:  DDSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL
        ++SEE+EDDE +EEIE TSRGH++EDET+SRKR RKGG   A   MQQL+AEV+GV+QDGG+S WEKKQWMKSRLIQLEEQQVNY+SQAFELEKQRLKWL
Subjt:  DDSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL

Query:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KFRSKKERDMERAKLENEKRRLE ERMVL+VKQKELD +D+HHY  QQQQHSSNKRGDPSSITG
Subjt:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

A0A6J1L352 ESF1 homolog4.6e-18177.37Show/hide
Query:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM
        M+SAMLGL+LPLH +PTNP N  HQLHHP MVSYV  +P      P    AV+Y YPAKPK QQSNLSD+EE GFA +D N D KKKISPWQRMKWTDMM
Subjt:  MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMM

Query:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK
        VRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ+LLDSMELTPK K
Subjt:  VRLLITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMK

Query:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAE-PSHVQ--QQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDD
        EEVRKLLNSKHLFFREMCAYHN+CRH TGN GG    GAH SP+ AAE PSH+Q  QQQ Q QQRCFHA  +  +A       A V   G+DD+E++E+D
Subjt:  EEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAE-PSHVQ--QQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDD

Query:  DSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGF-TAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL
        +SEE+EDDE++EEIE TSRGH++EDET+SRKR RKGG   AA   MQQL+AEV+GV+QDGG+S WEKKQWMKSRLIQLEEQQV Y+SQAFELEKQRLKWL
Subjt:  DSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGF-TAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWL

Query:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        KFRSKKERDMERAKLENEKRRLE ERMVL+VKQKELD +D+HHY   QQQHSSNKRGDPSSITG
Subjt:  KFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.2e-5336.96Show/hide
Query:  HQNPTNPANPHH----QLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNG-----DAKKKISPWQRMKWTDMMVRL
        HQ+  N  + H+     LH     + V  +   HH +      +  S   K +++++++SD++E  F  +  +G     +   K SPWQR+KWTD MV+L
Subjt:  HQNPTNPANPHH----QLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNG-----DAKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSM-ELTPKMKEE
        LITAV YIGD+   +++     ++   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+GT+C+VVEN +LLDS+  L  K K++
Subjt:  LITAVFYIGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSM-ELTPKMKEE

Query:  VRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSEEE
        VRK+++SKHLF+ EMC+YHN  R               H P   A      Q+  QL  R      +  S  H + D  +    G+ DE +E ++     
Subjt:  VRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSEEE

Query:  EDDEDDE--------EIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDG---GKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQ
         D   +         +  R S  H+D D                   + Q+      V Q G   G++   +KQWM+SR +QLEEQ++  + +  ELEKQ
Subjt:  EDDEDDE--------EIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDG---GKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQ

Query:  RLKWLKFRSKKERDMERAKLENEKRRLENERMVLLVKQKEL
        R +W +F  K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  RLKWLKFRSKKERDMERAKLENEKRRLENERMVLLVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)5.1e-4736.34Show/hide
Query:  NPANPHHQLHHP--QMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEG
        N  N + + HHP  +  S   +  D  H++                + + ++S+++E    + D    +K+  SPWQR+KW D MV+L+ITA+ YIG++ 
Subjt:  NPANPHHQLHHP--QMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFYIGDEG

Query:  GSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSME-LTPKMKEEVRKLLNSKHLFF
        GS+        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFNDLNKRYK++N++LG+GT+C VVEN SLLD ++ L  K K+EVR++++SKHLF+
Subjt:  GSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSME-LTPKMKEEVRKLLNSKHLFF

Query:  REMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSEEEEDDE--DDEEIE
         EMC+YHN  R               H P   A             QR  H   + GS     RDD +  + G+   E+ +DDD  EE+ D    D  ++
Subjt:  REMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSEEEEDDE--DDEEIE

Query:  RTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEV-VGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRSKKERDMERAKL
        R  +    ED       P KG       + Q   A+V  G+  D  K+A  ++Q ++S+ ++LE +++  +++  ELE+Q+ KW  F  ++E+ + + ++
Subjt:  RTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEV-VGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRSKKERDMERAKL

Query:  ENEKRRLENERMVLLVKQKEL
        ENE+ +LENERM L +K+ EL
Subjt:  ENEKRRLENERMVLLVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors4.1e-8946.38Show/hide
Query:  SSAMLGLELPLHQNPTNPANPHHQLH-HPQMVSYVQHEPDHHHHHPQAAPAVK--YSYPAKPKQQ---QSNLSDEEEAGFAA-------DDSNGDAKKKI
        S  ML LE+P  QNP NP N     H HP   S  Q          Q  P +K  Y Y +KPKQ         D+E+ G  +       D +  D K+K+
Subjt:  SSAMLGLELPLHQNPTNPANPHHQLH-HPQMVSYVQHEPDHHHHHPQAAPAVK--YSYPAKPKQQ---QSNLSDEEEAGFAA-------DDSNGDAKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEATDHTGKKKPV----------GLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG
        S W RMKWTD MVRLLI AVFYIGDE G    D    KK            G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDILGKG
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGGSEATDHTGKKKPV----------GLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG

Query:  TACRVVENQSLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLV
         ACRVVENQ LL+SM+ LTPK+K+EV+KLLNSKHLFFREMCAYHNSC    G+ GG        +P +   PS         QQ CFH AA  G  A  +
Subjt:  TACRVVENQSLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLV

Query:  RDDAEVSKSGEDDEEEEEDDDSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQ
         +  EV +  E D  E+ + + EE E++E                   +RK+ R       +  +++L  E   V++D GKS WEKK+W++ +++++EE+
Subjt:  RDDAEVSKSGEDDEEEEEDDDSEEEEDDEDDEEIERTSRGHDDEDETDSRKRPRKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQ

Query:  QVNYKSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG
        ++ Y+ +  E+EKQR+KW+++RSKKER+ME+AKL+N++RRLE ERM+L++++ E++L +L          SS  R DPSS  G
Subjt:  QVNYKSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHYQQQQQQHSSNKRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTGCAATGTTAGGATTGGAATTGCCTCTGCATCAGAACCCCACGAATCCAGCTAATCCTCACCACCAATTACACCACCCCCAGATGGTGTCTTACGTCCAGCA
CGAGCCGGACCACCACCATCACCACCCACAAGCTGCGCCGGCGGTTAAATATTCGTATCCGGCGAAGCCCAAGCAGCAGCAGTCGAATCTCAGCGACGAGGAGGAGGCCG
GGTTTGCGGCGGACGACAGTAATGGCGATGCGAAGAAGAAAATCTCGCCGTGGCAGAGAATGAAGTGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTTTAC
ATCGGAGATGAAGGTGGGTCGGAGGCGACGGACCACACCGGTAAGAAGAAGCCGGTGGGGCTGTTGCAGAAGAAAGGGAAGTGGAAATCGGTGTCTCGGGCGATGATGGA
GAAAGGTTTCTACGTTTCGCCGCAGCAATGCGAAGATAAGTTCAACGATTTGAATAAGAGATACAAGAGAGTGAACGATATTCTCGGGAAGGGCACCGCCTGCAGGGTGG
TGGAGAATCAGAGCTTGTTGGATTCAATGGAGTTGACGCCGAAAATGAAGGAAGAAGTTCGGAAATTACTCAATTCTAAGCATCTTTTCTTCAGGGAAATGTGTGCTTAC
CATAACAGCTGTCGTCACGGCACCGGGAACGGCGGCGGCGCCGCCACGTCTGGCGCTCACCATTCCCCAGAAGCGGCGGCGGAACCATCCCACGTTCAACAACAACAAGC
GCAACTACAGCAAAGGTGTTTCCACGCCGCCGCCTCCACCGGGTCGGCCGCGCATCTGGTCAGAGACGACGCGGAGGTGTCGAAAAGCGGAGAAGACGACGAGGAAGAGG
AGGAAGACGACGATTCGGAGGAAGAAGAAGACGACGAAGACGACGAAGAAATCGAGAGAACTTCACGAGGGCACGACGACGAAGACGAGACGGACTCGAGGAAAAGGCCG
AGAAAGGGGGGATTCACGGCGGCGACGCAGCAAATGCAGCAGCTGAGCGCGGAAGTCGTCGGCGTTATTCAAGACGGCGGGAAGAGTGCATGGGAGAAGAAGCAGTGGAT
GAAGAGCCGGCTGATACAGCTGGAGGAGCAGCAAGTGAACTACAAATCGCAAGCTTTCGAGCTGGAGAAGCAGCGGCTGAAATGGCTGAAGTTCCGGAGCAAGAAGGAGA
GGGACATGGAGAGAGCGAAGCTGGAGAACGAGAAGAGACGGCTGGAAAACGAGAGAATGGTGCTGCTGGTGAAGCAGAAGGAGCTCGATTTGCTGGATCTCCACCATTAC
CAGCAGCAGCAGCAGCAGCATTCGTCGAACAAGCGAGGCGACCCATCATCGATCACGGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTGCAATGTTAGGATTGGAATTGCCTCTGCATCAGAACCCCACGAATCCAGCTAATCCTCACCACCAATTACACCACCCCCAGATGGTGTCTTACGTCCAGCA
CGAGCCGGACCACCACCATCACCACCCACAAGCTGCGCCGGCGGTTAAATATTCGTATCCGGCGAAGCCCAAGCAGCAGCAGTCGAATCTCAGCGACGAGGAGGAGGCCG
GGTTTGCGGCGGACGACAGTAATGGCGATGCGAAGAAGAAAATCTCGCCGTGGCAGAGAATGAAGTGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTTTAC
ATCGGAGATGAAGGTGGGTCGGAGGCGACGGACCACACCGGTAAGAAGAAGCCGGTGGGGCTGTTGCAGAAGAAAGGGAAGTGGAAATCGGTGTCTCGGGCGATGATGGA
GAAAGGTTTCTACGTTTCGCCGCAGCAATGCGAAGATAAGTTCAACGATTTGAATAAGAGATACAAGAGAGTGAACGATATTCTCGGGAAGGGCACCGCCTGCAGGGTGG
TGGAGAATCAGAGCTTGTTGGATTCAATGGAGTTGACGCCGAAAATGAAGGAAGAAGTTCGGAAATTACTCAATTCTAAGCATCTTTTCTTCAGGGAAATGTGTGCTTAC
CATAACAGCTGTCGTCACGGCACCGGGAACGGCGGCGGCGCCGCCACGTCTGGCGCTCACCATTCCCCAGAAGCGGCGGCGGAACCATCCCACGTTCAACAACAACAAGC
GCAACTACAGCAAAGGTGTTTCCACGCCGCCGCCTCCACCGGGTCGGCCGCGCATCTGGTCAGAGACGACGCGGAGGTGTCGAAAAGCGGAGAAGACGACGAGGAAGAGG
AGGAAGACGACGATTCGGAGGAAGAAGAAGACGACGAAGACGACGAAGAAATCGAGAGAACTTCACGAGGGCACGACGACGAAGACGAGACGGACTCGAGGAAAAGGCCG
AGAAAGGGGGGATTCACGGCGGCGACGCAGCAAATGCAGCAGCTGAGCGCGGAAGTCGTCGGCGTTATTCAAGACGGCGGGAAGAGTGCATGGGAGAAGAAGCAGTGGAT
GAAGAGCCGGCTGATACAGCTGGAGGAGCAGCAAGTGAACTACAAATCGCAAGCTTTCGAGCTGGAGAAGCAGCGGCTGAAATGGCTGAAGTTCCGGAGCAAGAAGGAGA
GGGACATGGAGAGAGCGAAGCTGGAGAACGAGAAGAGACGGCTGGAAAACGAGAGAATGGTGCTGCTGGTGAAGCAGAAGGAGCTCGATTTGCTGGATCTCCACCATTAC
CAGCAGCAGCAGCAGCAGCATTCGTCGAACAAGCGAGGCGACCCATCATCGATCACGGGATGA
Protein sequenceShow/hide protein sequence
MSSAMLGLELPLHQNPTNPANPHHQLHHPQMVSYVQHEPDHHHHHPQAAPAVKYSYPAKPKQQQSNLSDEEEAGFAADDSNGDAKKKISPWQRMKWTDMMVRLLITAVFY
IGDEGGSEATDHTGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQSLLDSMELTPKMKEEVRKLLNSKHLFFREMCAY
HNSCRHGTGNGGGAATSGAHHSPEAAAEPSHVQQQQAQLQQRCFHAAASTGSAAHLVRDDAEVSKSGEDDEEEEEDDDSEEEEDDEDDEEIERTSRGHDDEDETDSRKRP
RKGGFTAATQQMQQLSAEVVGVIQDGGKSAWEKKQWMKSRLIQLEEQQVNYKSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLLVKQKELDLLDLHHY
QQQQQQHSSNKRGDPSSITG