; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033573 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033573
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSequence-specific DNA binding transcription factors isoform 2
Genome locationscaffold5:3609153..3610922
RNA-Seq ExpressionSpg033573
SyntenySpg033573
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596531.1 hypothetical protein SDJN03_09711, partial [Cucurbita argyrosperma subsp. sororia]9.9e-20785.27Show/hide
Query:  MENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDE
        ME+ SLGG GGGGGGG        GGGMF GM+SAMLGL+LPLH +   NP N HQLHHPSMVSYVP +P   QQPPP  V+YPYPAKPKPQQSNLSDDE
Subjt:  MENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDE

Query:  EQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV
        EQGFA +DGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV
Subjt:  EQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV

Query:  NDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAE-PSHLP----QQQQQRCFHATETAAA
        NDILGKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAE PSHL     QQQQQRCFHATETAA 
Subjt:  NDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAE-PSHLP----QQQQQRCFHATETAAA

Query:  ATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQ
          AAA     GDDEDE+D+EDESE+DE+++++EEIEGTSRGHEEE+ETESRKRARKGG A AAAMQQL+AEVIGVLQDGGRS WEKKQWMKSRLIQLEEQ
Subjt:  ATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQ

Query:  QVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG
        QVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERM LM+KQKELD MDMHHY  QQQHSSNKRGDPSSITG
Subjt:  QVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG

KAG7028064.1 hypothetical protein SDJN02_09244, partial [Cucurbita argyrosperma subsp. argyrosperma]8.9e-21680.6Show/hide
Query:  SFYILSLSLHPTNLSILSLSLSPDSSHSKGFRFLGLLLFLELLTANRTNWGLVFMENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQN
        SF +  + L    + I +  L+   S +KGF FLGL+LFLE +TANR N G+V ME+ SLGG GGGG           GGGMF GM+SAMLGL+LPLH +
Subjt:  SFYILSLSLHPTNLSILSLSLSPDSSHSKGFRFLGLLLFLELLTANRTNWGLVFMENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQN

Query:  QNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGS
           NP N HQLHHPSMVSYVP +P   QQPPP  V+YPYPAKPKPQQSNLSDDEEQGFA +DGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+
Subjt:  QNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGS

Query:  EPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREM
        E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREM
Subjt:  EPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREM

Query:  CAYHNTCRHGTGNN-GGGAHPSPDTAAE-PSHLP----QQQQQRCFHATETAAAATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEE
        CAYHNTCRH TGNN GGGAHPSPDTAAE PSHL     QQQQQRCFHATETAA   AAA     GDDEDE+D+EDESE+DE+++++EEIEGTSRGHEEE+
Subjt:  CAYHNTCRHGTGNN-GGGAHPSPDTAAE-PSHLP----QQQQQRCFHATETAAAATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEE

Query:  ETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERM
        ETESRKRARKGG A AAAMQQL+AEVIGVLQDGGRS WEKKQWMKSRLIQLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERM
Subjt:  ETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERM

Query:  VLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG
         LM+KQKELD MDMHHY  QQQHSSNKRGDPSSITG
Subjt:  VLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG

XP_004138003.2 uncharacterized protein DDB_G0290301 [Cucumis sativus]1.9e-20578.19Show/hide
Query:  SSFYILSLS--LHPTNLSILSLSLSPDSSHSKGFRFL--GLLLFLELLTANRTNWGLVFMENSSLGGGGGG--GGGGGGGGGGGAGGGMFPGMSSAMLGL
        S +Y+LSLS  L P+N   + L L  D      F +L  GLLLFL L TANR NWG++ ME +SLGGGGGG  GG GGGGGGG  GGGMF GM+S+MLGL
Subjt:  SSFYILSLS--LHPTNLSILSLSLSPDSSHSKGFRFL--GLLLFLELLTANRTNWGLVFMENSSLGGGGGG--GGGGGGGGGGGAGGGMFPGMSSAMLGL

Query:  ELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFY
        ELPLHQN   NPTNPHQLHHP MVSYV H+PHHHQQPP   VKYP+P K KPQQSNLSDDEEQGFAADD N DGKKKISPWQRMKWTDMMVRLLITAVFY
Subjt:  ELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFY

Query:  IGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSK
        IGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQTLLDSMELTPK KEEVRKLLNSK
Subjt:  IGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSK

Query:  HLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQR--CFHATETAAAAT-AAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGH
        HLFFREMCAYHNTCRH T       HPSPD A EPSHLPQQQQQ+  CFHAT+T  +A+ AA E SKSGD+E+EE++E+ES   EEE++DEE EG     
Subjt:  HLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQR--CFHATETAAAAT-AAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGH

Query:  EEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE
        EEEEETESRKRARKGG    A MQQLSAEV+GV+ DGGRS WEKKQWMKSRLIQLEEQQV++Q+QAFELEKQRLKW+KFRSKKERDMERAKLENEKR LE
Subjt:  EEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE

Query:  NERMVLMMKQKELDLMDMHHY-QQQQQHSSNKRGDPSSITG
        NERM+LM+K+ ELDLM M HY QQQQQHSSNKRGDPSSITG
Subjt:  NERMVLMMKQKELDLMDMHHY-QQQQQHSSNKRGDPSSITG

XP_023005783.1 ESF1 homolog [Cucurbita maxima]3.2e-20585.15Show/hide
Query:  MENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDE
        ME+ SLGG  GGGGGGGGGG    GGGMF GM+SAMLGL+LPLH +   NP N HQLHHPSMVSYVP +P   QQPPP  V+YPYPAKPKPQQSNLSDDE
Subjt:  MENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDE

Query:  EQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV
        EQGFA +DGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV
Subjt:  EQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV

Query:  NDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAE-PSHL-----PQQQQQRCFHATETAA
        NDILGKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAE PSHL      QQQQQRCFHATETAA
Subjt:  NDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAE-PSHL-----PQQQQQRCFHATETAA

Query:  AATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGA--AAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQL
           AAA     GDD+DE+D+EDESE+DE+++++EEIEGTSRGHEEE+ETESRKRARKGG A A  AAMQQL+AEVIGVLQDGGRS WEKKQWMKSRLIQL
Subjt:  AATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGA--AAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQL

Query:  EEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG
        EEQQV YQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERMVLM+KQKELD MDMHHY  QQQHSSNKRGDPSSITG
Subjt:  EEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG

XP_023539309.1 ESF1 homolog [Cucurbita pepo subsp. pepo]1.1e-20585.42Show/hide
Query:  MENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDE
        ME+ SLGG GGGGGGG        GGGMF GM+SAMLGL+LPLH +   NP N HQLHHPSMVSYVP +P   QQPPP  V+YPYPAKPKPQQSNLSDDE
Subjt:  MENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDE

Query:  EQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV
        EQGFA +DGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV
Subjt:  EQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV

Query:  NDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAE-PSHLPQ--QQQQRCFHATETAAAAT
        NDILGKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAE PSHL Q  QQQQRCFHATETAA   
Subjt:  NDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAE-PSHLPQ--QQQQRCFHATETAAAAT

Query:  AAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQV
         AA     GDDEDE+D+EDESE+DE+++++EEIEGTSRGHEEE+ETESRKRARKGG A  AAMQQL+AEVIGVLQDGGRS WEKKQWMKSRLIQLEEQQV
Subjt:  AAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQV

Query:  NYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG
        NYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERMVLM+KQKELD MDMHHY  QQQHSSNKRGDPSSITG
Subjt:  NYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein2.9e-19681.95Show/hide
Query:  MENSSLGGGGGG--GGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSD
        ME +SLGGGGGG  GG GGGGGGG  GGGMF GM+S+MLGLELPLHQN   NPTNPHQLHHP MVSYV H+PHHHQQPP   VKYP+P K KPQQSNLSD
Subjt:  MENSSLGGGGGG--GGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSD

Query:  DEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
        DEEQGFAADD N DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
Subjt:  DEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK

Query:  RVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQR--CFHATETAAAAT
        RVNDILGKGTAC+VVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH T       HPSPD A EPSHLPQQQQQ+  CFHAT+T  +A+
Subjt:  RVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQR--CFHATETAAAAT

Query:  -AAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQ
         AA E SKSGD+E+EE++E+ES   EEE++DEE EG     EEEEETESRKRARKGG    A MQQLSAEV+GV+ DGGRS WEKKQWMKSRLIQLEEQQ
Subjt:  -AAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQ

Query:  VNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHY-QQQQQHSSNKRGDPSSITG
        V++Q+QAFELEKQRLKW+KFRSKKERDMERAKLENEKR LENERM+LM+K+ ELDLM M HY QQQQQHSSNKRGDPSSITG
Subjt:  VNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHY-QQQQQHSSNKRGDPSSITG

A0A1S3B6J6 ESF1 homolog1.3e-19682.16Show/hide
Query:  MENSSLGGGG-GGGGGGGGGGGGGA--GGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLS
        ME +SLGGGG GG GGGGGGGG GA  GGGMF GM+S+MLGLELPLHQN   NPTNPHQLHHP MVSYV H+PHHHQQPP   VKYP+P K KPQQSNLS
Subjt:  MENSSLGGGG-GGGGGGGGGGGGGA--GGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLS

Query:  DDEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRY
        DDEEQGFAADD N DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRY
Subjt:  DDEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRY

Query:  KRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQR--CFHATE-TAAA
        KRVNDILGKGTAC+VVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH T       HPSPD A EPSHLPQQQQQ+  CFHAT+ T +A
Subjt:  KRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQR--CFHATE-TAAA

Query:  ATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQ
        + AA E SKSGD+EDEE++EDES   EEE++DEE EG     EEEEETESRKRARKGG    A MQQLSAEV+GV+ DGGRS WEKKQWMKSRLIQLEEQ
Subjt:  ATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQ

Query:  QVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG
        +V++Q+QAFELEKQRLKW+KFRSKKERDMERAKLENEKR LENERM+L++K+ ELDLM M HYQQQQQHSSNKRGDPSSITG
Subjt:  QVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411301.6e-20283.68Show/hide
Query:  MENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDE
        ME+ SLG  GGGGGGG        GGGMF G++SAMLGL+LPLH +   NP N HQLHH SMVSY P +P   QQPPP  V+YPYPAKPKPQQSNLSDDE
Subjt:  MENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDE

Query:  EQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV
        EQGFA +DGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV
Subjt:  EQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV

Query:  NDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEP-------SHLPQQQQQRCFHATETA
        NDILGKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAEP        H  QQQQQRCFHATE+A
Subjt:  NDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEP-------SHLPQQQQQRCFHATETA

Query:  AAATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLE
        A   AAA     GDDEDE+D+E+ESE+D E+D++EEIEGTSRGHEEE+ETESRKRARKGG A  AAMQQL+AEVIGVLQDGGRS WEKKQWMKSRLIQLE
Subjt:  AAATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLE

Query:  EQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG
        EQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERMVLM+KQKELD MDMHHY QQQQHSSNKRGDPSSITG
Subjt:  EQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG

A0A6J1J5X4 ribosome quality control complex subunit 2-like1.1e-19282.11Show/hide
Query:  GGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGNCDGKKKIS
        GGGGGA GGMFPG+SS+MLGLELPLH    Q P+NPHQLHHP MVSYVPHE HH QQPPPA VK PYPAKPKPQQSN+SD+EEQG  ADD N D KKKIS
Subjt:  GGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGNCDGKKKIS

Query:  PWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQTL
        PWQRMKWTDMMVRLLITAVFYIGDEGGSEP DHA KKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVEN TL
Subjt:  PWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQTL

Query:  LDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHLP-QQQQQRCFHATETAAAATAAAEASKSGDDEDE---EDD
        L+SMELTPK+KEEVRKLLNSKHLFFREMCAYHNTCRHG GN+ GGGA+  P+T AEPSHLP QQQQQRCFHATETA  AT A E SKSGD+E+E   EDD
Subjt:  LDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHLP-QQQQQRCFHATETAAAATAAAEASKSGDDEDE---EDD

Query:  EDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQRLKWL
        E E +D+EEE+++EE+EG+SR  EEEEETES+KR RK G    A +QQ+SAEV+GV+QDGGRS WEKKQWMK RLIQLEEQQV YQSQ  E+EKQRLKWL
Subjt:  EDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQRLKWL

Query:  KFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHY-QQQQQHSSNKRGDPSSITG
        KFR KKERDMERAKLENEKRRLENERM+LM+KQKELDL D+H+Y QQQQQHSSN+RGDPSSITG
Subjt:  KFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHY-QQQQQHSSNKRGDPSSITG

A0A6J1L352 ESF1 homolog1.5e-20585.15Show/hide
Query:  MENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDE
        ME+ SLGG  GGGGGGGGGG    GGGMF GM+SAMLGL+LPLH +   NP N HQLHHPSMVSYVP +P   QQPPP  V+YPYPAKPKPQQSNLSDDE
Subjt:  MENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDE

Query:  EQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV
        EQGFA +DGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV
Subjt:  EQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRV

Query:  NDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAE-PSHL-----PQQQQQRCFHATETAA
        NDILGKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAE PSHL      QQQQQRCFHATETAA
Subjt:  NDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAE-PSHL-----PQQQQQRCFHATETAA

Query:  AATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGA--AAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQL
           AAA     GDD+DE+D+EDESE+DE+++++EEIEGTSRGHEEE+ETESRKRARKGG A A  AAMQQL+AEVIGVLQDGGRS WEKKQWMKSRLIQL
Subjt:  AATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGA--AAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQL

Query:  EEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG
        EEQQV YQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERMVLM+KQKELD MDMHHY  QQQHSSNKRGDPSSITG
Subjt:  EEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors3.7e-5036.92Show/hide
Query:  GGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEP-HHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGN----
        GG    G +  G F    S  +  +  ++Q    NP N   LH     + V  +   HHQ    ++ +     K + +++++SDD+E  F  + G+    
Subjt:  GGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEP-HHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGN----

Query:  -CDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
          +   K SPWQR+KWTD MV+LLITAV YIGD+   +    +  ++   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+GT+
Subjt:  -CDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENQTLLDSM-ELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQRCFHATETAAAATAAAEASKSGDDE
        C+VVEN  LLDS+  L  K K++VRK+++SKHLF+ EMC+YH         NG   H   D A       Q+  Q    + +      +     +  DDE
Subjt:  CRVVENQTLLDSM-ELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQRCFHATETAAAATAAAEASKSGDDE

Query:  DEEDDEDESEDDEEED---DDEEIEGTSRG-----------HEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQ
        D + D DE ++ EE+     D  +     G             E+ +  S   + +        +    A+V     + GR+   +KQWM+SR +QLEEQ
Subjt:  DEEDDEDESEDDEEED---DDEEIEGTSRG-----------HEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQ

Query:  QVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKEL
        ++  Q +  ELEKQR +W +F  K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  QVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)4.1e-4937.78Show/hide
Query:  QSNLSDDEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND
        + ++S+D+E    + DG  +  K+ SPWQR+KW D MV+L+ITA+ YIG++ GS+        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFND
Subjt:  QSNLSDDEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND

Query:  LNKRYKRVNDILGKGTACRVVENQTLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQ-QRCFHATE
        LNKRYK++N++LG+GT+C VVEN +LLD ++ L  K K+EVR++++SKHLF+ EMC+YHN      GN                HLP     QR  H   
Subjt:  LNKRYKRVNDILGKGTACRVVENQTLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQ-QRCFHATE

Query:  TAAAATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEV-IGVLQDGGRSAWEKKQWMKSRLI
          +      +      +ED +DD+D  ED +    D  ++   +    E+     K            + +  A+V  G+  D  ++A  ++Q ++S+ +
Subjt:  TAAAATAAAEASKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEV-IGVLQDGGRSAWEKKQWMKSRLI

Query:  QLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKEL
        +LE +++  Q++  ELE+Q+ KW  F  ++E+ + + ++ENE+ +LENERM L +K+ EL
Subjt:  QLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors7.6e-9649.89Show/hide
Query:  MFPGMSSAMLGLELPLHQNQNQNPTNPH---QLHHPSMVSYVPHEPHHHQQ-PPPAVVKYPYPAKPKPQQSNLS----DDEEQGFAA-------DDGNCD
        MF G S  ML LE+P      QNP NP    Q  HP      P+     QQ  PP    YPY +KPK Q S +S    DDE++G  +       D    D
Subjt:  MFPGMSSAMLGLELPLHQNQNQNPTNPH---QLHHPSMVSYVPHEPHHHQQ-PPPAVVKYPYPAKPKPQQSNLS----DDEEQGFAA-------DDGNCD

Query:  GKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPADHAGKKKT---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVND
        GK+K+S W RMKWTD MVRLLI AVFYIGDE G ++P D   KKKT          G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVND
Subjt:  GKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPADHAGKKKT---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVND

Query:  ILGKGTACRVVENQTLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQRCFHATETAAAATAAAEA
        ILGKG ACRVVENQ LL+SM+ LTPK+K+EV+KLLNSKHLFFREMCAYHN+C H  G++      +P +   PS     QQQ CFH          AAEA
Subjt:  ILGKGTACRVVENQTLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQRCFHATETAAAATAAAEA

Query:  SKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQVNYQSQ
         K     +  + E+E E D  ED + E+       EE EE E+RK+ R      + A+++L  E   V++D G+S WEKK+W++ +++++EE+++ Y+ +
Subjt:  SKSGDDEDEEDDEDESEDDEEEDDDEEIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQVNYQSQ

Query:  AFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG
          E+EKQR+KW+++RSKKER+ME+AKL+N++RRLE ERM+LM+++ E++L ++         SS  R DPSS  G
Subjt:  AFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAACCAACAACATTTCACTACGCCATTCCATTCTCAAACTCCCCCGTTTTCCTTCCAATTCCTGTGGGTTTCCAGTTTCAATTCAACACCGCTGCTCCCAAAACC
ATGGGCGGTTTTCTGTCAACCCAATACCACAATCCGCCCTCCCAAAACCGGCTGCCCCTCCTCTTCCTCTTTCTATATACTCTCTCTCTCACTTCATCCCACAAACTTAT
CGATCTTATCTCTCTCTCTTTCTCCAGATTCTTCACATTCTAAGGGATTTCGCTTTCTGGGTTTGCTTCTGTTTCTCGAATTACTAACGGCAAATCGGACGAATTGGGGA
TTGGTTTTCATGGAGAATAGTAGTTTAGGCGGCGGCGGGGGAGGTGGAGGTGGAGGCGGCGGCGGCGGTGGTGGAGGTGCGGGTGGAGGGATGTTTCCCGGGATGAGTTC
GGCAATGTTGGGATTGGAATTACCGCTTCACCAAAACCAAAACCAAAACCCCACCAATCCTCACCAATTGCATCATCCTTCTATGGTGTCTTACGTCCCACATGAGCCCC
ACCACCACCAGCAACCGCCGCCTGCCGTCGTGAAATACCCTTATCCGGCGAAGCCCAAGCCGCAGCAGTCGAATCTGAGCGACGACGAGGAGCAGGGGTTCGCGGCGGAT
GACGGCAACTGCGACGGGAAGAAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTCTACATCGGCGACGAGGG
TGGGTCGGAGCCGGCGGACCACGCCGGCAAGAAGAAAACAGTGGGGCTTCTGCAGAAGAAAGGGAAATGGAAATCGGTGTCGAGAGCGATGATGGAAAAAGGATTCTACG
TTTCGCCGCAGCAATGCGAAGATAAATTCAACGATTTGAACAAAAGATACAAACGAGTTAACGACATTTTGGGAAAGGGAACCGCCTGCAGAGTCGTCGAGAATCAGACC
TTATTGGATTCAATGGAATTAACGCCGAAAATGAAAGAAGAAGTCCGGAAATTACTCAATTCTAAACATCTCTTCTTCAGAGAAATGTGCGCTTACCACAACACTTGCCG
TCACGGCACCGGAAACAACGGCGGCGGCGCGCATCCTTCACCGGACACCGCGGCGGAACCATCCCACCTTCCACAACAACAGCAACAACGATGCTTCCACGCGACAGAGA
CCGCCGCCGCCGCTACAGCGGCGGCGGAGGCTTCAAAAAGCGGTGATGATGAAGACGAGGAGGACGACGAAGACGAATCGGAGGACGACGAGGAGGAGGACGACGATGAG
GAAATCGAAGGAACTTCGAGAGGGCACGAAGAGGAAGAGGAAACGGAATCGAGGAAGCGGGCGAGGAAAGGAGGATCTGCAGGGGCGGCGGCGATGCAGCAATTGAGTGC
GGAGGTGATCGGAGTGCTGCAGGACGGTGGGAGGAGCGCGTGGGAGAAGAAGCAATGGATGAAGAGCCGATTGATTCAGCTTGAAGAGCAGCAGGTGAACTACCAATCGC
AAGCTTTCGAGCTGGAGAAACAGAGGCTGAAATGGCTGAAGTTCAGGAGCAAAAAGGAGAGGGATATGGAGAGGGCGAAGCTGGAGAACGAGAAGAGGAGGCTGGAGAAC
GAGAGGATGGTGCTGATGATGAAGCAGAAGGAGTTGGATTTGATGGATATGCACCATTATCAGCAGCAGCAGCAGCATTCGTCGAATAAGCGAGGGGATCCATCGTCGAT
TACAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAACCAACAACATTTCACTACGCCATTCCATTCTCAAACTCCCCCGTTTTCCTTCCAATTCCTGTGGGTTTCCAGTTTCAATTCAACACCGCTGCTCCCAAAACC
ATGGGCGGTTTTCTGTCAACCCAATACCACAATCCGCCCTCCCAAAACCGGCTGCCCCTCCTCTTCCTCTTTCTATATACTCTCTCTCTCACTTCATCCCACAAACTTAT
CGATCTTATCTCTCTCTCTTTCTCCAGATTCTTCACATTCTAAGGGATTTCGCTTTCTGGGTTTGCTTCTGTTTCTCGAATTACTAACGGCAAATCGGACGAATTGGGGA
TTGGTTTTCATGGAGAATAGTAGTTTAGGCGGCGGCGGGGGAGGTGGAGGTGGAGGCGGCGGCGGCGGTGGTGGAGGTGCGGGTGGAGGGATGTTTCCCGGGATGAGTTC
GGCAATGTTGGGATTGGAATTACCGCTTCACCAAAACCAAAACCAAAACCCCACCAATCCTCACCAATTGCATCATCCTTCTATGGTGTCTTACGTCCCACATGAGCCCC
ACCACCACCAGCAACCGCCGCCTGCCGTCGTGAAATACCCTTATCCGGCGAAGCCCAAGCCGCAGCAGTCGAATCTGAGCGACGACGAGGAGCAGGGGTTCGCGGCGGAT
GACGGCAACTGCGACGGGAAGAAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTCTACATCGGCGACGAGGG
TGGGTCGGAGCCGGCGGACCACGCCGGCAAGAAGAAAACAGTGGGGCTTCTGCAGAAGAAAGGGAAATGGAAATCGGTGTCGAGAGCGATGATGGAAAAAGGATTCTACG
TTTCGCCGCAGCAATGCGAAGATAAATTCAACGATTTGAACAAAAGATACAAACGAGTTAACGACATTTTGGGAAAGGGAACCGCCTGCAGAGTCGTCGAGAATCAGACC
TTATTGGATTCAATGGAATTAACGCCGAAAATGAAAGAAGAAGTCCGGAAATTACTCAATTCTAAACATCTCTTCTTCAGAGAAATGTGCGCTTACCACAACACTTGCCG
TCACGGCACCGGAAACAACGGCGGCGGCGCGCATCCTTCACCGGACACCGCGGCGGAACCATCCCACCTTCCACAACAACAGCAACAACGATGCTTCCACGCGACAGAGA
CCGCCGCCGCCGCTACAGCGGCGGCGGAGGCTTCAAAAAGCGGTGATGATGAAGACGAGGAGGACGACGAAGACGAATCGGAGGACGACGAGGAGGAGGACGACGATGAG
GAAATCGAAGGAACTTCGAGAGGGCACGAAGAGGAAGAGGAAACGGAATCGAGGAAGCGGGCGAGGAAAGGAGGATCTGCAGGGGCGGCGGCGATGCAGCAATTGAGTGC
GGAGGTGATCGGAGTGCTGCAGGACGGTGGGAGGAGCGCGTGGGAGAAGAAGCAATGGATGAAGAGCCGATTGATTCAGCTTGAAGAGCAGCAGGTGAACTACCAATCGC
AAGCTTTCGAGCTGGAGAAACAGAGGCTGAAATGGCTGAAGTTCAGGAGCAAAAAGGAGAGGGATATGGAGAGGGCGAAGCTGGAGAACGAGAAGAGGAGGCTGGAGAAC
GAGAGGATGGTGCTGATGATGAAGCAGAAGGAGTTGGATTTGATGGATATGCACCATTATCAGCAGCAGCAGCAGCATTCGTCGAATAAGCGAGGGGATCCATCGTCGAT
TACAGGTTGA
Protein sequenceShow/hide protein sequence
MPNQQHFTTPFHSQTPPFSFQFLWVSSFNSTPLLPKPWAVFCQPNTTIRPPKTGCPSSSSFYILSLSLHPTNLSILSLSLSPDSSHSKGFRFLGLLLFLELLTANRTNWG
LVFMENSSLGGGGGGGGGGGGGGGGGAGGGMFPGMSSAMLGLELPLHQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAAD
DGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQT
LLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHLPQQQQQRCFHATETAAAATAAAEASKSGDDEDEEDDEDESEDDEEEDDDE
EIEGTSRGHEEEEETESRKRARKGGSAGAAAMQQLSAEVIGVLQDGGRSAWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLEN
ERMVLMMKQKELDLMDMHHYQQQQQHSSNKRGDPSSITG