; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G204840 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G204840
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionsequence-specific DNA binding transcription factors
Genome locationCicolChr10:32114667..32121610
RNA-Seq ExpressionCcUC10G204840
SyntenyCcUC10G204840
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043934.1 ESF1-like protein [Cucumis melo var. makuwa]5.4e-20889.59Show/hide
Query:  MNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+P HHQQPP V+VKYP+PTK KPQQSNLSDD+EQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEI
        RKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A+VAAG+  KSGDE+DEEEEEDES   EE+EEDEE 
Subjt:  RKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEI

Query:  EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRR
        EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQ+VS+QTQAFELEKQRLKWVKFRSKKERDMERAKLENEKR 
Subjt:  EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRR

Query:  LENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG
        LENERMML+VK+ ELDLMGM HYQQQQQHSSNKRGDPSSITG
Subjt:  LENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG

KAE8651071.1 hypothetical protein Csa_001092 [Cucumis sativus]2.4e-20889.29Show/hide
Query:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDM
        MFSGMNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+P HHQQPP V+VKYP+PTK KPQQSNLSDD+EQGFAADDSNGDGKKKISPWQRMKWTDM
Subjt:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDM

Query:  MVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT
        MVRLLITAVFYIGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT
Subjt:  MVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT

Query:  KEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEE
        KEEVRKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A++AAG+G KSGDE++EEEEE+ES   EE+EE
Subjt:  KEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEE

Query:  DEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLE
        DEE EGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQQVS+QTQAFELEKQRLKWVKFRSKKERDMERAKLE
Subjt:  DEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLE

Query:  NEKRRLENERMMLMVKQKELDLMGMHHY-QQQQQHSSNKRGDPSSITG
        NEKR LENERMMLMVK+ ELDLMGM HY QQQQQHSSNKRGDPSSITG
Subjt:  NEKRRLENERMMLMVKQKELDLMGMHHY-QQQQQHSSNKRGDPSSITG

XP_004138003.2 uncharacterized protein DDB_G0290301 [Cucumis sativus]3.7e-23383.45Show/hide
Query:  MGGFLSTQYHNPLSQNQLPP--SSSIY------------------------IEFLFL--GLVLFLDLETANRKNWGVVLMESSSLGGGGG---GGGGGGG
        MGGFLSTQYHNPLSQN LPP    SIY                        I+FL+L  GL+LFL L+TANRKNWGV+LME++SLGGGGG   GG GGGG
Subjt:  MGGFLSTQYHNPLSQNQLPP--SSSIY------------------------IEFLFL--GLVLFLDLETANRKNWGVVLMESSSLGGGGG---GGGGGGG

Query:  GGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPW
        GGG GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+P HHQQPP V+VKYP+PTK KPQQSNLSDD+EQGFAADDSNGDGKKKISPW
Subjt:  GGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPW

Query:  QRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLD
        QRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLD
Subjt:  QRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLD

Query:  SMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDES
        SMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A++AAG+G KSGDE++EEEEE+ES
Subjt:  SMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDES

Query:  DEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKER
           EE+EEDEE EGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQQVS+QTQAFELEKQRLKWVKFRSKKER
Subjt:  DEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKER

Query:  DMERAKLENEKRRLENERMMLMVKQKELDLMGMHHY-QQQQQHSSNKRGDPSSITG
        DMERAKLENEKR LENERMMLMVK+ ELDLMGM HY QQQQQHSSNKRGDPSSITG
Subjt:  DMERAKLENEKRRLENERMMLMVKQKELDLMGMHHY-QQQQQHSSNKRGDPSSITG

XP_008442785.1 PREDICTED: ESF1 homolog [Cucumis melo]1.0e-21488.66Show/hide
Query:  MESSSLGGGGGGGGGGGGGGGG----GGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSD
        ME++SLGGGG GG GGGGGGGG    GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+P HHQQPP V+VKYP+PTK KPQQSNLSD
Subjt:  MESSSLGGGGGGGGGGGGGGGG----GGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSD

Query:  DDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
        D+EQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
Subjt:  DDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK

Query:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAA
        RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A+
Subjt:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAA

Query:  VAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQT
        VAAG+  KSGDE+DEEEEEDES   EE+EEDEE EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQ+VS+QT
Subjt:  VAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQT

Query:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG
        QAFELEKQRLKWVKFRSKKERDMERAKLENEKR LENERMML+VK+ ELDLMGM HYQQQQQHSSNKRGDPSSITG
Subjt:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG

XP_038903237.1 transcription factor SPT20 homolog [Benincasa hispida]1.9e-21693.44Show/hide
Query:  MNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNPPNPHQLHHP IVSYV H+P HHQQPPPV++KYPYPTKPKPQQSN+SDD+EQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEPADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLP-QQQQQRCFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIE
        RKLLNSKHLFFREMCAYHNTCRH T+    HPSPD AAEPSHLP QQQQQRCFHATETTT SAAAVAAG+  KSGDE++EEEE+DES EEEE+EEDEEIE
Subjt:  RKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLP-QQQQQRCFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIE

Query:  GRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRL
        GRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRL+QLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRL
Subjt:  GRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRL

Query:  ENERMMLMVKQKELDLMGMHHY-QQQQQHSSNKRGDPSSITG
        ENERMMLMVKQKELDLM MHHY QQQQQHSSNKRGDPSSITG
Subjt:  ENERMMLMVKQKELDLMGMHHY-QQQQQHSSNKRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein1.2e-21388.47Show/hide
Query:  MESSSLGGGGG---GGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDD
        ME++SLGGGGG   GG GGGGGGG GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+P HHQQPP V+VKYP+PTK KPQQSNLSDD
Subjt:  MESSSLGGGGG---GGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDD

Query:  DEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        +EQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  DEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAV
        VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A++
Subjt:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAV

Query:  AAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQT
        AAG+G KSGDE++EEEEE+ES   EE+EEDEE EGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQQVS+QT
Subjt:  AAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQT

Query:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHY-QQQQQHSSNKRGDPSSITG
        QAFELEKQRLKWVKFRSKKERDMERAKLENEKR LENERMMLMVK+ ELDLMGM HY QQQQQHSSNKRGDPSSITG
Subjt:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHY-QQQQQHSSNKRGDPSSITG

A0A1S3B6J6 ESF1 homolog5.0e-21588.66Show/hide
Query:  MESSSLGGGGGGGGGGGGGGGG----GGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSD
        ME++SLGGGG GG GGGGGGGG    GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+P HHQQPP V+VKYP+PTK KPQQSNLSD
Subjt:  MESSSLGGGGGGGGGGGGGGGG----GGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSD

Query:  DDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
        D+EQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
Subjt:  DDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK

Query:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAA
        RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A+
Subjt:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAA

Query:  VAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQT
        VAAG+  KSGDE+DEEEEEDES   EE+EEDEE EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQ+VS+QT
Subjt:  VAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQT

Query:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG
        QAFELEKQRLKWVKFRSKKERDMERAKLENEKR LENERMML+VK+ ELDLMGM HYQQQQQHSSNKRGDPSSITG
Subjt:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG

A0A5A7TKS6 ESF1-like protein2.6e-20889.59Show/hide
Query:  MNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+P HHQQPP V+VKYP+PTK KPQQSNLSDD+EQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEP-HHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEP DH GKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEI
        RKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A+VAAG+  KSGDE+DEEEEEDES   EE+EEDEE 
Subjt:  RKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEI

Query:  EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRR
        EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQ+VS+QTQAFELEKQRLKWVKFRSKKERDMERAKLENEKR 
Subjt:  EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRR

Query:  LENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG
        LENERMML+VK+ ELDLMGM HYQQQQQHSSNKRGDPSSITG
Subjt:  LENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411301.2e-19280.37Show/hide
Query:  MESSSLGGGGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPH-HQQPPPVAVKYPYPTKPKPQQSNLSDDDEQ
        MES SL       G  GGGGGGGGGGMFSG+NS+MLGL+LPLH +PTNPPN HQLHH  +VSY P +P   QQPPP AV+YPYP KPKPQQSNLSDD+EQ
Subjt:  MESSSLGGGGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPH-HQQPPPVAVKYPYPTKPKPQQSNLSDDDEQ

Query:  GFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVND
        GFA +D N DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVND
Subjt:  GFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVND

Query:  ILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNS---GAHPSPDAAAEP-------SHLPQQQQQRCFHATETTTS
        ILGKGTAC+VVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST N+   GAHPSPD AAEP        H  QQQQQRCFHATE   S
Subjt:  ILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNS---GAHPSPDAAAEP-------SHLPQQQQQRCFHATETTTS

Query:  SAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI-TAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLE
        +AAA   G     GD++DE++EE+ES EE+ED+E+EEIEG     EEE+ETESRKRARKGGI  A MQQL+AEV+GV+QDGGRSPWEKKQWMKSRL+QLE
Subjt:  SAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI-TAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLE

Query:  EQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG
        EQQV+YQ+QAFELEKQRLKW+KFRSKKERDMERAKLENEKRRLE ERM+LMVKQKELD M MHHY QQQQHSSNKRGDPSSITG
Subjt:  EQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG

A0A6J1L352 ESF1 homolog8.8e-19681.89Show/hide
Query:  MESSSLGGGGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPH-HQQPPPVAVKYPYPTKPKPQQSNLSDDDEQ
        MES SL   GG  GGGGGGGGGGGGGMFSGMNS+MLGL+LPLH +PTNPPN HQLHHP +VSYVP +P   QQPPP AV+YPYP KPKPQQSNLSDD+EQ
Subjt:  MESSSLGGGGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPH-HQQPPPVAVKYPYPTKPKPQQSNLSDDDEQ

Query:  GFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVND
        GFA +D N DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKK VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVND
Subjt:  GFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVND

Query:  ILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNS---GAHPSPDAAAE-PSHL-----PQQQQQRCFHATETTTSS
        ILGKGTAC+VVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST N+   GAHPSPD AAE PSHL      QQQQQRCFHATE  T++
Subjt:  ILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNS---GAHPSPDAAAE-PSHL-----PQQQQQRCFHATETTTSS

Query:  AAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI----TAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQ
        AAA    D    GD+DDE++EEDES+E+E+DEE+EEIEG     EEE+ETESRKRARKGGI     A MQQL+AEV+GV+QDGGRS WEKKQWMKSRL+Q
Subjt:  AAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI----TAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQ

Query:  LEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG
        LEEQQV YQ+QAFELEKQRLKW+KFRSKKERDMERAKLENEKRRLE ERM+LMVKQKELD M MHHY  QQQHSSNKRGDPSSITG
Subjt:  LEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors4.2e-4936.7Show/hide
Query:  GGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHE--PHHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNG----
        GG    G    G F    S  +  +  ++Q   + PN   LH     + V  +   HHQ      +      K + +++++SDDDE  F  +  +G    
Subjt:  GGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHE--PHHQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNG----

Query:  -DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
         +   K SPWQR+KWTD MV+LLITAV YIGD+   +    +  ++   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+GT+C
Subjt:  -DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC

Query:  KVVENQTLLDSM-ELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQRCFHATETTTSSAAAVAAGDGLKSGDED
        +VVEN  LLDS+  L  K K++VRK+++SKHLF+ EMC+YHN  R                   HLP        H      S   A+ + D     D+ 
Subjt:  KVVENQTLLDSM-ELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQRCFHATETTTSSAAAVAAGDGLKSGDED

Query:  DEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGG---------------------------ITAGMQQLSAEVMGVVQDG---GRSPWEKKQW
         + + ED  DE+ + + DE  E  ++     + R     GG                               + Q+      V Q G   GR+   +KQW
Subjt:  DEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGG---------------------------ITAGMQQLSAEVMGVVQDG---GRSPWEKKQW

Query:  MKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL
        M+SR LQLEEQ++  Q +  ELEKQR +W +F  K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  MKSRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)4.2e-4937.36Show/hide
Query:  QSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND
        + ++S+DDE    + D     K+  SPWQR+KW D MV+L+ITA+ YIG++ GS+        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFND
Subjt:  QSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND

Query:  LNKRYKRVNDILGKGTACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQ-QRCFHATETT
        LNKRYK++N++LG+GT+C+VVEN +LLD ++ L  K K+EVR++++SKHLF+ EMC+YHN  R                   HLP     QR  H     
Subjt:  LNKRYKRVNDILGKGTACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQ-QRCFHATETT

Query:  TSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEG--------RQEEEEETESRKRARKGGITAGMQQLSAEV-MGVVQDGGRSPWEKKQWMK
                   G +   ++DE  +    D +++D+ +E+ +G        R  + +  E      KG     + +  A+V  G+  D  ++   ++Q ++
Subjt:  TSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEG--------RQEEEEETESRKRARKGGITAGMQQLSAEV-MGVVQDGGRSPWEKKQWMK

Query:  SRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL
        S+ L+LE +++  Q +  ELE+Q+ KW  F  ++E+ + + ++ENE+ +LENERM L +K+ EL
Subjt:  SRLLQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors3.8e-9850.86Show/hide
Query:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQPPPVAVKYPYPTKPKPQQSNLS----DDDEQGFAA-------DDSNGDGKKKIS
        MFSG +  ML LE+P  QNP NP N  Q  HP   +        Q  PP+   YPY +KPK Q S +S    DD+++G  +       D +  DGK+K+S
Subjt:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQPPPVAVKYPYPTKPKPQQSNLS----DDDEQGFAA-------DDSNGDGKKKIS

Query:  PWQRMKWTDMMVRLLITAVFYIGDEGG-SEPADHAGKKKT---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
         W RMKWTD MVRLLI AVFYIGDE G ++P D   KKKT          G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDILGKG 
Subjt:  PWQRMKWTDMMVRLLITAVFYIGDEGG-SEPADHAGKKKT---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLP-QQQQQRCFHATETTTSSAAAVAAGDGLKSG
        AC+VVENQ LL+SM+ LTPK K+EV+KLLNSKHLFFREMCAYHN+C H     G H        P  +P   QQQ CFHA E    +  A          
Subjt:  ACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLP-QQQQQRCFHATETTTSSAAAVAAGDGLKSG

Query:  DEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRL
         E  E EEE ESD  E+ E +       EE EE E+RK+ R   I+  +++L  E   VV+D G+S WEKK+W++ ++L++EE+++ Y+ +  E+EKQR+
Subjt:  DEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQTQAFELEKQRL

Query:  KWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG
        KW+++RSKKER+ME+AKL+N++RRLE ERM+LM+++ E++L         +  SS  R DPSS  G
Subjt:  KWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGAATTAGCAATGCATTTATGTGGGTTGTTTATAATTAGAAAAGAAAATAAAATCAATTTCCCCATATTCACGGTTAAATTCTCAAAATACTCACATATT
ACACGCTATAATGTATTTAAAGTGTACATTAGAAATATAAGACACAGTACGATTTCTTTGGTGAAGCCTGCTATGACAAAAAACAAGGACAAACTCCCCGTAAAG
TTTTCTTCTAAATCCTGTGGGTTTCCAGTTTTTCCTCTAACACTGTTTCCTCCCAAAACCATGGGCGGTTTTCTGTCAACCCAATACCACAATCCGCTCTCCCAA
AACCAGCTGCCCCCTTCTTCTTCTATCTATATAGAGTTTCTGTTTCTGGGTTTGGTTCTATTTCTAGATTTAGAAACGGCTAATCGAAAGAATTGGGGGGTGGTT
TTAATGGAATCCAGTAGTTTAGGCGGCGGGGGTGGGGGTGGGGGTGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGGATGTTTTCAGGCATGAATTCATCA
ATGTTGGGATTGGAATTACCACTTCACCAAAACCCAACAAATCCACCAAATCCTCACCAACTACATCACCCTCAAATCGTGTCTTATGTCCCACACGAGCCCCAC
CACCAACAACCGCCGCCGGTCGCCGTGAAATACCCCTATCCGACGAAGCCTAAACCGCAGCAGTCGAATCTCAGTGATGACGACGAGCAGGGGTTCGCGGCGGAT
GACAGCAATGGAGATGGCAAGAAGAAAATATCGCCGTGGCAGAGGATGAAATGGACGGATATGATGGTTAGGTTGCTGATTACGGCGGTGTTCTACATCGGTGAT
GAAGGTGGATCGGAGCCGGCAGATCACGCCGGCAAGAAAAAAACAGTGGGGTTACTGCAAAAGAAGGGAAAATGGAAATCGGTATCCAGAGCAATGATGGAAAAA
GGATTCTACGTATCACCACAGCAATGCGAAGATAAATTCAACGATTTAAACAAAAGATATAAACGAGTTAACGATATCTTGGGGAAAGGCACCGCCTGCAAAGTC
GTCGAGAATCAAACATTACTGGATTCAATGGAATTAACACCGAAAACGAAAGAAGAAGTACGAAAATTACTCAATTCTAAACATCTCTTCTTCAGAGAAATGTGC
GCTTACCACAACACTTGCCGTCACAGCACCAGCAACAGCGGCGCCCACCCCTCGCCGGACGCGGCAGCGGAACCATCACACCTTCCACAACAACAACAGCAACGA
TGCTTCCACGCAACAGAGACCACAACCTCCTCCGCCGCCGCCGTAGCGGCCGGCGATGGTTTGAAAAGCGGAGATGAGGATGATGAAGAGGAGGAAGAAGACGAA
TCGGATGAGGAGGAGGAAGATGAGGAGGATGAGGAAATTGAAGGACGGCAAGAGGAGGAGGAGGAAACGGAATCGAGGAAGAGAGCGAGGAAAGGGGGGATTACG
GCGGGAATGCAGCAGTTGAGCGCGGAGGTGATGGGAGTAGTGCAGGACGGCGGGAGGAGTCCATGGGAGAAGAAGCAATGGATGAAGAGCCGGTTGCTCCAGCTT
GAAGAACAGCAAGTGAGCTACCAAACGCAAGCTTTCGAGCTGGAGAAGCAGAGGCTGAAATGGGTGAAGTTCAGGAGCAAGAAGGAGAGGGATATGGAGAGGGCG
AAGCTGGAGAATGAGAAGAGAAGGCTGGAAAATGAGAGGATGATGCTGATGGTGAAGCAGAAGGAATTGGATTTAATGGGTATGCACCATTATCAGCAGCAGCAG
CAGCATTCGTCGAACAAGCGAGGGGATCCATCGTCGATTACAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGATCGAATTAGCAATGCATTTATGTGGGTTGTTTATAATTAGAAAAGAAAATAAAATCAATTTCCCCATATTCACGGTTAAATTCTCAAAATACTCACATATT
ACACGCTATAATGTATTTAAAGTGTACATTAGAAATATAAGACACAGTACGATTTCTTTGGTGAAGCCTGCTATGACAAAAAACAAGGACAAACTCCCCGTAAAG
TTTTCTTCTAAATCCTGTGGGTTTCCAGTTTTTCCTCTAACACTGTTTCCTCCCAAAACCATGGGCGGTTTTCTGTCAACCCAATACCACAATCCGCTCTCCCAA
AACCAGCTGCCCCCTTCTTCTTCTATCTATATAGAGTTTCTGTTTCTGGGTTTGGTTCTATTTCTAGATTTAGAAACGGCTAATCGAAAGAATTGGGGGGTGGTT
TTAATGGAATCCAGTAGTTTAGGCGGCGGGGGTGGGGGTGGGGGTGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGGATGTTTTCAGGCATGAATTCATCA
ATGTTGGGATTGGAATTACCACTTCACCAAAACCCAACAAATCCACCAAATCCTCACCAACTACATCACCCTCAAATCGTGTCTTATGTCCCACACGAGCCCCAC
CACCAACAACCGCCGCCGGTCGCCGTGAAATACCCCTATCCGACGAAGCCTAAACCGCAGCAGTCGAATCTCAGTGATGACGACGAGCAGGGGTTCGCGGCGGAT
GACAGCAATGGAGATGGCAAGAAGAAAATATCGCCGTGGCAGAGGATGAAATGGACGGATATGATGGTTAGGTTGCTGATTACGGCGGTGTTCTACATCGGTGAT
GAAGGTGGATCGGAGCCGGCAGATCACGCCGGCAAGAAAAAAACAGTGGGGTTACTGCAAAAGAAGGGAAAATGGAAATCGGTATCCAGAGCAATGATGGAAAAA
GGATTCTACGTATCACCACAGCAATGCGAAGATAAATTCAACGATTTAAACAAAAGATATAAACGAGTTAACGATATCTTGGGGAAAGGCACCGCCTGCAAAGTC
GTCGAGAATCAAACATTACTGGATTCAATGGAATTAACACCGAAAACGAAAGAAGAAGTACGAAAATTACTCAATTCTAAACATCTCTTCTTCAGAGAAATGTGC
GCTTACCACAACACTTGCCGTCACAGCACCAGCAACAGCGGCGCCCACCCCTCGCCGGACGCGGCAGCGGAACCATCACACCTTCCACAACAACAACAGCAACGA
TGCTTCCACGCAACAGAGACCACAACCTCCTCCGCCGCCGCCGTAGCGGCCGGCGATGGTTTGAAAAGCGGAGATGAGGATGATGAAGAGGAGGAAGAAGACGAA
TCGGATGAGGAGGAGGAAGATGAGGAGGATGAGGAAATTGAAGGACGGCAAGAGGAGGAGGAGGAAACGGAATCGAGGAAGAGAGCGAGGAAAGGGGGGATTACG
GCGGGAATGCAGCAGTTGAGCGCGGAGGTGATGGGAGTAGTGCAGGACGGCGGGAGGAGTCCATGGGAGAAGAAGCAATGGATGAAGAGCCGGTTGCTCCAGCTT
GAAGAACAGCAAGTGAGCTACCAAACGCAAGCTTTCGAGCTGGAGAAGCAGAGGCTGAAATGGGTGAAGTTCAGGAGCAAGAAGGAGAGGGATATGGAGAGGGCG
AAGCTGGAGAATGAGAAGAGAAGGCTGGAAAATGAGAGGATGATGCTGATGGTGAAGCAGAAGGAATTGGATTTAATGGGTATGCACCATTATCAGCAGCAGCAG
CAGCATTCGTCGAACAAGCGAGGGGATCCATCGTCGATTACAGGATGA
Protein sequenceShow/hide protein sequence
MIELAMHLCGLFIIRKENKINFPIFTVKFSKYSHITRYNVFKVYIRNIRHSTISLVKPAMTKNKDKLPVKFSSKSCGFPVFPLTLFPPKTMGGFLSTQYHNPLSQ
NQLPPSSSIYIEFLFLGLVLFLDLETANRKNWGVVLMESSSLGGGGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPH
HQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKTVGLLQKKGKWKSVSRAMMEK
GFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR
CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQL
EEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQHSSNKRGDPSSITG