; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026975 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026975
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSequence-specific DNA binding transcription factors isoform 2
Genome locationchr10:43904768..43906228
RNA-Seq ExpressionLag0026975
SyntenyLag0026975
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596531.1 hypothetical protein SDJN03_09711, partial [Cucurbita argyrosperma subsp. sororia]1.6e-20283.79Show/hide
Query:  MENSSLGA-GGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADD
        ME+ SLG  GGGGGG GGGMFSGM+SAMLGL+LPL+ +   NP N HQLHHPSMVSYVP +P   QQPPP  V+YPYPAKPKPQQSNLSDDEEQGFA +D
Subjt:  MENSSLGA-GGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADD

Query:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
        GN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
Subjt:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHH-----PQQQQQRCFHATETAAAAAAAAEAS
        ACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAEP  H      QQQQQRCFHATETAAAAA      
Subjt:  ACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHH-----PQQQQQRCFHATETAAAAAAAAEAS

Query:  KSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQA
          GDD+DEDDEEDESEEDE+++++E+IEGTSRG EEE+ETESRKRARKGG A AAAMQQL+AEVIGV+QDGGRS WEKKQWMK RLIQLEEQQV YQSQA
Subjt:  KSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQA

Query:  LELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL
         ELEKQRLKW+KFRSKKERDMERAK+ENEKRRLE ERM LMVKQKELD MDMHHY     QQQHSSNKRGDPS +
Subjt:  LELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL

KAG7028064.1 hypothetical protein SDJN02_09244, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-20183.54Show/hide
Query:  MENSSLGAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDG
        ME+ SLG  GGGG  GGGMFSGM+SAMLGL+LPL+ +   NP N HQLHHPSMVSYVP +P   QQPPP  V+YPYPAKPKPQQSNLSDDEEQGFA +DG
Subjt:  MENSSLGAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDG

Query:  NCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        N DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHH-----PQQQQQRCFHATETAAAAAAAAEASK
        CRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAEP  H      QQQQQRCFHATETAAAAA       
Subjt:  CRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHH-----PQQQQQRCFHATETAAAAAAAAEASK

Query:  SGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQAL
         GDD+DEDDEEDESEEDE+++++E+IEGTSRG EEE+ETESRKRARKGG A AAAMQQL+AEVIGV+QDGGRS WEKKQWMK RLIQLEEQQV YQSQA 
Subjt:  SGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQAL

Query:  ELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL
        ELEKQRLKW+KFRSKKERDMERAK+ENEKRRLE ERM LMVKQKELD MDMHHY     QQQHSSNKRGDPS +
Subjt:  ELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL

XP_022933836.1 uncharacterized protein LOC111441130 [Cucurbita moschata]1.8e-19882.81Show/hide
Query:  MENSSLGA-GGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADD
        ME+ SLG+ GGGGGG GGGMFSG++SAMLGL+LPL+ +   NP N HQLHH SMVSY P +P   QQPPP  V+YPYPAKPKPQQSNLSDDEEQGFA +D
Subjt:  MENSSLGA-GGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADD

Query:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
        GN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
Subjt:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEP-------SHHPQQQQQRCFHATETAAAAAAAAE
        ACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAEP        H  QQQQQRCFHATE+AAAAA    
Subjt:  ACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEP-------SHHPQQQQQRCFHATETAAAAAAAAE

Query:  ASKSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQS
            GDD+DEDDEE+ESEED E+D++E+IEGTSRG EEE+ETESRKRARKGG  A AAMQQL+AEVIGV+QDGGRS WEKKQWMK RLIQLEEQQV YQS
Subjt:  ASKSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQS

Query:  QALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL
        QA ELEKQRLKW+KFRSKKERDMERAK+ENEKRRLE ERMVLMVKQKELD MDMHHY    QQQQHSSNKRGDPS +
Subjt:  QALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL

XP_023005783.1 ESF1 homolog [Cucurbita maxima]3.6e-20283.2Show/hide
Query:  MENSSL-----GAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGF
        ME+ SL     G GGGGGG GGGMFSGM+SAMLGL+LPL+ +   NP N HQLHHPSMVSYVP +P   QQPPP  V+YPYPAKPKPQQSNLSDDEEQGF
Subjt:  MENSSL-----GAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGF

Query:  AADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
        A +DGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
Subjt:  AADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL

Query:  GKGTACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHH------PQQQQQRCFHATETAAAAAA
        GKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAEP  H       QQQQQRCFHATETAAAAA 
Subjt:  GKGTACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHH------PQQQQQRCFHATETAAAAAA

Query:  AAEASKSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGG--CAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQ
               GDDDDEDDEEDESEEDE+++++E+IEGTSRG EEE+ETESRKRARKGG   AA AAMQQL+AEVIGV+QDGGRS WEKKQWMK RLIQLEEQQ
Subjt:  AAEASKSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGG--CAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQ

Query:  VKYQSQALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL
        VKYQSQA ELEKQRLKW+KFRSKKERDMERAK+ENEKRRLE ERMVLMVKQKELD MDMHHY     QQQHSSNKRGDPS +
Subjt:  VKYQSQALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL

XP_023539309.1 ESF1 homolog [Cucurbita pepo subsp. pepo]8.8e-20183.93Show/hide
Query:  MENSSLGA-GGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADD
        ME+ SLG  GGGGGG GGGMFSGM+SAMLGL+LPL+ +   NP N HQLHHPSMVSYVP +P   QQPPP  V+YPYPAKPKPQQSNLSDDEEQGFA +D
Subjt:  MENSSLGA-GGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADD

Query:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
        GN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
Subjt:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHHPQ---QQQQRCFHATETAAAAAAAAEASKS
        ACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAEP  H Q   QQQQRCFHATETAA AA        
Subjt:  ACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHHPQ---QQQQRCFHATETAAAAAAAAEASKS

Query:  GDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALE
        GDD+DEDDEEDESEEDE+++++E+IEGTSRG EEE+ETESRKRARKGG  A AAMQQL+AEVIGV+QDGGRS WEKKQWMK RLIQLEEQQV YQSQA E
Subjt:  GDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALE

Query:  LEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL
        LEKQRLKW+KFRSKKERDMERAK+ENEKRRLE ERMVLMVKQKELD MDMHHY     QQQHSSNKRGDPS +
Subjt:  LEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein1.3e-18981.08Show/hide
Query:  GAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGNCDGKK
        G GGGGG  GGGMFSGM+S+MLGLELPL+QN   NPTNPHQLHHP MVSYV H+PHHHQQPP   VKYP+P K KPQQSNLSDDEEQGFAADD N DGKK
Subjt:  GAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGNCDGKK

Query:  KISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVEN
        KISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVEN
Subjt:  KISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVEN

Query:  QTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHHPQQQQQR--CFHATETAAAAA-AAAEASKSGDDDDEDD
        QTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH T       HPSPD A EPSH PQQQQQ+  CFHAT+T  +A+ AA E SKSGD+++E++
Subjt:  QTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHHPQQQQQR--CFHATETAAAAA-AAAEASKSGDDDDEDD

Query:  EEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALELEKQRLKW
        EE+ESEE+EE   DE+ EG    QEEEEETESRKRARKGG    A MQQLSAEV+GV+ DGGRS WEKKQWMK RLIQLEEQQV +Q+QA ELEKQRLKW
Subjt:  EEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALELEKQRLKW

Query:  VKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL
        VKFRSKKERDMERAK+ENEKR LENERM+LMVK+ ELDLM M HY  QQQQQQHSSNKRGDPS +
Subjt:  VKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL

A0A6J1F055 uncharacterized protein LOC1114411308.8e-19982.81Show/hide
Query:  MENSSLGA-GGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADD
        ME+ SLG+ GGGGGG GGGMFSG++SAMLGL+LPL+ +   NP N HQLHH SMVSY P +P   QQPPP  V+YPYPAKPKPQQSNLSDDEEQGFA +D
Subjt:  MENSSLGA-GGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADD

Query:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
        GN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
Subjt:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEP-------SHHPQQQQQRCFHATETAAAAAAAAE
        ACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAEP        H  QQQQQRCFHATE+AAAAA    
Subjt:  ACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEP-------SHHPQQQQQRCFHATETAAAAAAAAE

Query:  ASKSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQS
            GDD+DEDDEE+ESEED E+D++E+IEGTSRG EEE+ETESRKRARKGG  A AAMQQL+AEVIGV+QDGGRS WEKKQWMK RLIQLEEQQV YQS
Subjt:  ASKSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQS

Query:  QALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL
        QA ELEKQRLKW+KFRSKKERDMERAK+ENEKRRLE ERMVLMVKQKELD MDMHHY    QQQQHSSNKRGDPS +
Subjt:  QALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL

A0A6J1F2H4 transcription initiation factor TFIID subunit 7-like1.4e-19180.47Show/hide
Query:  MENSSLGAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDG
        ME +SL   GGGGG  GGMF G+SS+MLGLELPL    +QNP+NPHQLHHP MVSYVPHE HH QQPPPA VK PYPAKPKPQQSN+SD+EEQG  ADD 
Subjt:  MENSSLGAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDG

Query:  NCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        N D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DHA KKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGN-NGGGAHPSPDTAAEPSHHP-QQQQQRCFHATETAAAAAAAAEASKSGDD
        CRVVEN TLL+SMELTPK+KEEVRKLLNSKHLFFREMCAYHNTCRHG GN +GGGA+ SP+   EPSH P QQQQQRCFHATET A AA A E SKSGD+
Subjt:  CRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGN-NGGGAHPSPDTAAEPSHHP-QQQQQRCFHATETAAAAAAAAEASKSGDD

Query:  DDED-DEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALELE
        ++ED DE+DE+EED+EE ++E+I+G+S+ QEEEEETES+KR RK G    + +QQ+ AEV+GVVQDGGRS WEKKQWMKRRLIQLEEQQV+YQSQ LE+E
Subjt:  DDED-DEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALELE

Query:  KQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL
        KQRLKW+KFR KKERDMERAK+ENEKRRLENERM+LMVKQKELDL D+H+YQQQQQQQQHSSN+RGDPS +
Subjt:  KQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL

A0A6J1J5X4 ribosome quality control complex subunit 2-like7.5e-19080.76Show/hide
Query:  MENSSLGAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDG
        ME +SL   GGGGG  GGMF G+SS+MLGLELPL    +Q P+NPHQLHHP MVSYVPHE HH QQPPPA VK PYPAKPKPQQSN+SD+EEQG  ADD 
Subjt:  MENSSLGAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDG

Query:  NCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        N D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DHA KKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHHP-QQQQQRCFHATETAAAAAAAAEASKSGDD
        CRVVEN TLL+SMELTPK+KEEVRKLLNSKHLFFREMCAYHNTCRHG GN+ GGGA+  P+T AEPSH P QQQQQRCFHATET A  A A E SKSGD+
Subjt:  CRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHHP-QQQQQRCFHATETAAAAAAAAEASKSGDD

Query:  DDED-DEEDESEED--EEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALE
        ++ED DE+DESEED  EEE+++E++EG+SR QEEEEETES+KR RK G    A +QQ+SAEV+GVVQDGGRS WEKKQWMKRRLIQLEEQQVKYQSQ LE
Subjt:  DDED-DEEDESEED--EEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALE

Query:  LEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL
        +EKQRLKW+KFR KKERDMERAK+ENEKRRLENERM+LMVKQKELDL D+H+Y  QQQQQQHSSN+RGDPS +
Subjt:  LEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL

A0A6J1L352 ESF1 homolog1.7e-20283.2Show/hide
Query:  MENSSL-----GAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGF
        ME+ SL     G GGGGGG GGGMFSGM+SAMLGL+LPL+ +   NP N HQLHHPSMVSYVP +P   QQPPP  V+YPYPAKPKPQQSNLSDDEEQGF
Subjt:  MENSSL-----GAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGF

Query:  AADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
        A +DGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
Subjt:  AADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL

Query:  GKGTACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHH------PQQQQQRCFHATETAAAAAA
        GKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNN GGGAHPSPDTAAEP  H       QQQQQRCFHATETAAAAA 
Subjt:  GKGTACRVVENQTLLDSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNN-GGGAHPSPDTAAEPSHH------PQQQQQRCFHATETAAAAAA

Query:  AAEASKSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGG--CAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQ
               GDDDDEDDEEDESEEDE+++++E+IEGTSRG EEE+ETESRKRARKGG   AA AAMQQL+AEVIGV+QDGGRS WEKKQWMK RLIQLEEQQ
Subjt:  AAEASKSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGG--CAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQ

Query:  VKYQSQALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL
        VKYQSQA ELEKQRLKW+KFRSKKERDMERAK+ENEKRRLE ERMVLMVKQKELD MDMHHY     QQQHSSNKRGDPS +
Subjt:  VKYQSQALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors5.0e-5337.42Show/hide
Query:  GGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEP-HHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGN-----CDGKK
        G +  G F    S  +  +  +NQ    NP N   LH     + V  +   HHQ    ++ +     K + +++++SDD+E  F  + G+      +   
Subjt:  GGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEP-HHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGN-----CDGKK

Query:  KISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVEN
        K SPWQR+KWTD MV+LLITAV YIGD+   +    +  ++   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+GT+C+VVEN
Subjt:  KISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVEN

Query:  QTLLDSM-ELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHHPQQQQQRCFHATETAAAAAAAAEASKSGDDDDEDDEE
          LLDS+  L  K K++VRK+++SKHLF+ EMC+YH         NG   H   D A       Q+  Q    + +      +     +  DD+D D + 
Subjt:  QTLLDSM-ELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHHPQQQQQRCFHATETAAAAAAAAEASKSGDDDDEDDEE

Query:  DESEEDEEE-----DDDEDIEGTSRG---------QEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQS
        DE +E EE+     D   +  G   G           E+ +  S   + +    +   +    A+V     + GR+   +KQWM+ R +QLEEQ+++ Q 
Subjt:  DESEEDEEE-----DDDEDIEGTSRG---------QEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQS

Query:  QALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKEL
        + LELEKQR +W +F  K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  QALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)2.6e-4937.33Show/hide
Query:  QSNLSDDEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND
        + ++S+D+E    + DG  +  K+ SPWQR+KW D MV+L+ITA+ YIG++ GS+        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFND
Subjt:  QSNLSDDEEQGFAADDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND

Query:  LNKRYKRVNDILGKGTACRVVENQTLLDSME-LTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHHPQQQQQRCFHATET
        LNKRYK++N++LG+GT+C VVEN +LLD ++ L  K K+EVR++++SKHLF+ EMC+YH         NG   H   D A + S H              
Subjt:  LNKRYKRVNDILGKGTACRVVENQTLLDSME-LTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHHPQQQQQRCFHATET

Query:  AAAAAAAAEASKSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQ---------EEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQ
                     G  DD D++E    ++E+ DDD+D E    G           + +  E      KG         Q      G+  D  ++A  ++Q
Subjt:  AAAAAAAAEASKSGDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQ---------EEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQ

Query:  WMKRRLIQLEEQQVKYQSQALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKEL
         ++ + ++LE ++++ Q++ +ELE+Q+ KW  F  ++E+ + + ++ENE+ +LENERM L +K+ EL
Subjt:  WMKRRLIQLEEQQVKYQSQALELEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors1.3e-9650.32Show/hide
Query:  MFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQ-PPPAVVKYPYPAKPKPQQSNLS----DDEEQGFAA-------DDGNCDGKK
        MFSG S  ML LE+P N     NP N  Q  HP      P+     QQ  PP    YPY +KPK Q S +S    DDE++G  +       D    DGK+
Subjt:  MFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQ-PPPAVVKYPYPAKPKPQQSNLS----DDEEQGFAA-------DDGNCDGKK

Query:  KISPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPADHAGKKKT---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG
        K+S W RMKWTD MVRLLI AVFYIGDE G ++P D   KKKT          G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDILG
Subjt:  KISPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPADHAGKKKT---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG

Query:  KGTACRVVENQTLLDSME-LTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHHPQQQQQRCFHATETAAAAAAAAEASKS
        KG ACRVVENQ LL+SM+ LTPK+K+EV+KLLNSKHLFFREMCAYHN+C H  G++      +P +   PS     QQQ CFHA E    A  A      
Subjt:  KGTACRVVENQTLLDSME-LTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHHPQQQQQRCFHATETAAAAAAAAEASKS

Query:  GDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALE
                E  E EE+ E D  ED E      EE EE E+RK+ R      + A+++L  E   VV+D G+S WEKK+W++R+++++EE+++ Y+ + +E
Subjt:  GDDDDEDDEEDESEEDEEEDDDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALE

Query:  LEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPS
        +EKQR+KW+++RSKKER+ME+AK++N++RRLE ERM+LM+++ E++L ++            SS  R DPS
Subjt:  LEKQRLKWVKFRSKKERDMERAKVENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATAGTAGTTTAGGCGCCGGCGGGGGAGGTGGAGGTACGGGTGGAGGGATGTTTTCCGGGATGAGTTCGGCAATGTTGGGATTGGAATTACCGCTTAAC
CAAAACCAAAACCAAAACCCCACGAATCCTCACCAATTGCATCATCCTTCTATGGTGTCTTACGTCCCACATGAGCCCCACCACCACCAGCAACCGCCGCCTGCC
GTCGTGAAATACCCTTATCCGGCGAAGCCCAAGCCGCAGCAGTCGAATCTGAGCGACGACGAGGAACAGGGGTTTGCGGCGGATGACGGCAACTGCGACGGGAAG
AAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTCTACATCGGCGATGAAGGTGGGTCGGAGCCGGCG
GACCACGCCGGCAAGAAGAAAACGGTGGGGCTTCTGCAGAAGAAAGGGAAATGGAAATCGGTGTCGAGAGCGATGATGGAAAAAGGATTCTACGTTTCGCCGCAA
CAATGCGAAGATAAATTCAACGATTTGAACAAAAGATACAAACGAGTTAACGACATTTTGGGAAAGGGAACCGCCTGCAGAGTCGTCGAGAATCAGACCTTATTG
GATTCAATGGAATTAACGCCGAAAATCAAAGAAGAAGTCCGGAAATTACTCAATTCTAAACATCTATTCTTCAGAGAAATGTGCGCTTACCACAACACCTGCCGT
CACGGCACGGGAAACAACGGCGGCGGCGCGCATCCTTCACCAGACACCGCGGCGGAACCATCTCACCATCCACAACAACAGCAGCAACGATGCTTCCACGCAACA
GAGACCGCCGCCGCTGCGGCGGCGGCGGCGGAGGCTTCAAAAAGCGGAGATGACGACGACGAGGACGACGAAGAAGACGAATCGGAGGAGGATGAGGAGGAGGAC
GACGATGAGGATATCGAAGGAACTTCGAGAGGGCAGGAAGAGGAAGAGGAAACGGAATCGAGGAAGCGGGCGAGGAAAGGGGGATGTGCAGCGGCGGCGGCGATG
CAGCAATTGAGTGCGGAGGTGATCGGAGTGGTGCAGGACGGTGGGAGGAGCGCGTGGGAGAAGAAGCAATGGATGAAGAGGCGATTGATTCAGCTTGAAGAGCAG
CAGGTGAAGTACCAATCGCAAGCTTTGGAGCTGGAGAAACAGAGGCTGAAATGGGTGAAGTTCAGGAGCAAGAAGGAGAGGGATATGGAGAGGGCGAAGGTGGAG
AACGAGAAGAGGAGGCTGGAGAACGAGAGGATGGTGCTGATGGTGAAGCAGAAGGAGTTGGATTTGATGGATATGCACCATTATCAGCAGCAGCAGCAGCAGCAG
CAGCATTCCTCGAATAAGCGAGGGGATCCATCTCGATTACAGGTTGAAGATGAAGAAGAAGAAGACGAAGAAAGGGGGATTAGGGTTAGGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATAGTAGTTTAGGCGCCGGCGGGGGAGGTGGAGGTACGGGTGGAGGGATGTTTTCCGGGATGAGTTCGGCAATGTTGGGATTGGAATTACCGCTTAAC
CAAAACCAAAACCAAAACCCCACGAATCCTCACCAATTGCATCATCCTTCTATGGTGTCTTACGTCCCACATGAGCCCCACCACCACCAGCAACCGCCGCCTGCC
GTCGTGAAATACCCTTATCCGGCGAAGCCCAAGCCGCAGCAGTCGAATCTGAGCGACGACGAGGAACAGGGGTTTGCGGCGGATGACGGCAACTGCGACGGGAAG
AAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTCTACATCGGCGATGAAGGTGGGTCGGAGCCGGCG
GACCACGCCGGCAAGAAGAAAACGGTGGGGCTTCTGCAGAAGAAAGGGAAATGGAAATCGGTGTCGAGAGCGATGATGGAAAAAGGATTCTACGTTTCGCCGCAA
CAATGCGAAGATAAATTCAACGATTTGAACAAAAGATACAAACGAGTTAACGACATTTTGGGAAAGGGAACCGCCTGCAGAGTCGTCGAGAATCAGACCTTATTG
GATTCAATGGAATTAACGCCGAAAATCAAAGAAGAAGTCCGGAAATTACTCAATTCTAAACATCTATTCTTCAGAGAAATGTGCGCTTACCACAACACCTGCCGT
CACGGCACGGGAAACAACGGCGGCGGCGCGCATCCTTCACCAGACACCGCGGCGGAACCATCTCACCATCCACAACAACAGCAGCAACGATGCTTCCACGCAACA
GAGACCGCCGCCGCTGCGGCGGCGGCGGCGGAGGCTTCAAAAAGCGGAGATGACGACGACGAGGACGACGAAGAAGACGAATCGGAGGAGGATGAGGAGGAGGAC
GACGATGAGGATATCGAAGGAACTTCGAGAGGGCAGGAAGAGGAAGAGGAAACGGAATCGAGGAAGCGGGCGAGGAAAGGGGGATGTGCAGCGGCGGCGGCGATG
CAGCAATTGAGTGCGGAGGTGATCGGAGTGGTGCAGGACGGTGGGAGGAGCGCGTGGGAGAAGAAGCAATGGATGAAGAGGCGATTGATTCAGCTTGAAGAGCAG
CAGGTGAAGTACCAATCGCAAGCTTTGGAGCTGGAGAAACAGAGGCTGAAATGGGTGAAGTTCAGGAGCAAGAAGGAGAGGGATATGGAGAGGGCGAAGGTGGAG
AACGAGAAGAGGAGGCTGGAGAACGAGAGGATGGTGCTGATGGTGAAGCAGAAGGAGTTGGATTTGATGGATATGCACCATTATCAGCAGCAGCAGCAGCAGCAG
CAGCATTCCTCGAATAAGCGAGGGGATCCATCTCGATTACAGGTTGAAGATGAAGAAGAAGAAGACGAAGAAAGGGGGATTAGGGTTAGGGTTTAG
Protein sequenceShow/hide protein sequence
MENSSLGAGGGGGGTGGGMFSGMSSAMLGLELPLNQNQNQNPTNPHQLHHPSMVSYVPHEPHHHQQPPPAVVKYPYPAKPKPQQSNLSDDEEQGFAADDGNCDGK
KKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQTLL
DSMELTPKIKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGAHPSPDTAAEPSHHPQQQQQRCFHATETAAAAAAAAEASKSGDDDDEDDEEDESEEDEEED
DDEDIEGTSRGQEEEEETESRKRARKGGCAAAAAMQQLSAEVIGVVQDGGRSAWEKKQWMKRRLIQLEEQQVKYQSQALELEKQRLKWVKFRSKKERDMERAKVE
NEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQQQHSSNKRGDPSRLQVEDEEEEDEERGIRVRV