; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G031840 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G031840
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionsequence-specific DNA binding transcription factors
Genome locationGy14Chr3:31265656..31268168
RNA-Seq ExpressionCsGy3G031840
SyntenyCsGy3G031840
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043934.1 ESF1-like protein [Cucumis melo var. makuwa]5.22e-29198.16Show/hide
Query:  MNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEEDEETEGRQEEEEE
        RKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSAS+AAGE SKSGDEE+EEEEE+ESEEEEEDEETEGRQEEEEE
Subjt:  RKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEEDEETEGRQEEEEE

Query:  TESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMML
        TESRKRARKGG ITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQ+VSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMML
Subjt:  TESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMML

Query:  MVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        +VKKNELDLMGMQHYQQQQQ HSSNKRGDPSSITG
Subjt:  MVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG

KAE8651071.1 hypothetical protein Csa_001092 [Cucumis sativus]1.23e-302100Show/hide
Query:  MFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDM
        MFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDM
Subjt:  MFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDM

Query:  MVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT
        MVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT
Subjt:  MVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT

Query:  KEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEEDEETEGRQE
        KEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEEDEETEGRQE
Subjt:  KEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEEDEETEGRQE

Query:  EEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENE
        EEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENE
Subjt:  EEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENE

Query:  RMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        RMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
Subjt:  RMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG

XP_004138003.2 uncharacterized protein DDB_G0290301 [Cucumis sativus]0.0100Show/hide
Query:  MGGFLSTQYHNPLSQNHLPPLLLLSIYYLLSLSLPLTPSNEPAILLFLFNDIQFLYLGFGLLLFLHLDTANRKNWGVILMETNSLGGGGGGGVGGSGGGG
        MGGFLSTQYHNPLSQNHLPPLLLLSIYYLLSLSLPLTPSNEPAILLFLFNDIQFLYLGFGLLLFLHLDTANRKNWGVILMETNSLGGGGGGGVGGSGGGG
Subjt:  MGGFLSTQYHNPLSQNHLPPLLLLSIYYLLSLSLPLTPSNEPAILLFLFNDIQFLYLGFGLLLFLHLDTANRKNWGVILMETNSLGGGGGGGVGGSGGGG

Query:  GGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPW
        GGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPW
Subjt:  GGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPW

Query:  QRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLD
        QRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLD
Subjt:  QRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLD

Query:  SMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEED
        SMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEED
Subjt:  SMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEED

Query:  EETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLEN
        EETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLEN
Subjt:  EETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLEN

Query:  EKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        EKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
Subjt:  EKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG

XP_008442785.1 PREDICTED: ESF1 homolog [Cucumis melo]7.17e-30296.82Show/hide
Query:  METNSLGGGGGGGVGGSGGGGGGG----AGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSN
        METNSLGGGG   VGGSGGGGGGG    AGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSN
Subjt:  METNSLGGGGGGGVGGSGGGGGGG----AGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSN

Query:  LSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNK
        LSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNK
Subjt:  LSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNK

Query:  RYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAA
        RYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSAS+AA
Subjt:  RYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAA

Query:  GEGSKSGDEEEEEEEEEESEEEEEDEETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFEL
        GE SKSGDEE+EEEEE+ESEEEEEDEETEGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQ+VSWQTQAFEL
Subjt:  GEGSKSGDEEEEEEEEEESEEEEEDEETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFEL

Query:  EKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        EKQRLKWVKFRSKKERDMERAKLENEKRMLENERMML+VKKNELDLMGMQHYQQQQQ HSSNKRGDPSSITG
Subjt:  EKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG

XP_038903237.1 transcription factor SPT20 homolog [Benincasa hispida]3.59e-27192.24Show/hide
Query:  MNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNP NPHQLHHPP+VSYV HDPHHHQQPP VS+KYP+PTK KPQQSN+SDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSAS-IAAGEGSKSGDEEEEEEEEEESEEEEE--DEETEGRQEE
        RKLLNSKHLFFREMCAYHNTCRH TTHPSPD A EPSHLPQQQQQQ+ CFHAT+TTTSA+ +AAGE SKSGDEEEEEEE++ESEEEEE  DEE EGRQEE
Subjt:  RKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSAS-IAAGEGSKSGDEEEEEEEEEESEEEEE--DEETEGRQEE

Query:  EEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENER
        EEETESRKRARKGG ITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRLIQLEEQQVS+QTQAFELEKQRLKWVKFRSKKERDMERAKLENEKR LENER
Subjt:  EEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENER

Query:  MMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        MMLMVK+ ELDLM M HYQQQQQQHSSNKRGDPSSITG
Subjt:  MMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein0.0100Show/hide
Query:  METNSLGGGGGGGVGGSGGGGGGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDD
        METNSLGGGGGGGVGGSGGGGGGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDD
Subjt:  METNSLGGGGGGGVGGSGGGGGGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDD

Query:  EEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGS
        VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGS
Subjt:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGS

Query:  KSGDEEEEEEEEEESEEEEEDEETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQR
        KSGDEEEEEEEEEESEEEEEDEETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQR
Subjt:  KSGDEEEEEEEEEESEEEEEDEETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQR

Query:  LKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        LKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
Subjt:  LKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG

A0A1S3B6J6 ESF1 homolog3.47e-30296.82Show/hide
Query:  METNSLGGGGGGGVGGSGGGGGGG----AGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSN
        METNSLGGGG   VGGSGGGGGGG    AGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSN
Subjt:  METNSLGGGGGGGVGGSGGGGGGG----AGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSN

Query:  LSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNK
        LSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNK
Subjt:  LSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNK

Query:  RYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAA
        RYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSAS+AA
Subjt:  RYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAA

Query:  GEGSKSGDEEEEEEEEEESEEEEEDEETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFEL
        GE SKSGDEE+EEEEE+ESEEEEEDEETEGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQ+VSWQTQAFEL
Subjt:  GEGSKSGDEEEEEEEEEESEEEEEDEETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFEL

Query:  EKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        EKQRLKWVKFRSKKERDMERAKLENEKRMLENERMML+VKKNELDLMGMQHYQQQQQ HSSNKRGDPSSITG
Subjt:  EKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG

A0A5A7TKS6 ESF1-like protein2.53e-29198.16Show/hide
Query:  MNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEEDEETEGRQEEEEE
        RKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSAS+AAGE SKSGDEE+EEEEE+ESEEEEEDEETEGRQEEEEE
Subjt:  RKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEEDEETEGRQEEEEE

Query:  TESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMML
        TESRKRARKGG ITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQ+VSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMML
Subjt:  TESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMML

Query:  MVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        +VKKNELDLMGMQHYQQQQQ HSSNKRGDPSSITG
Subjt:  MVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411301.93e-22977.9Show/hide
Query:  GGGGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKIS
        GGGGG GGGGMFSG+NS+MLGL+LPLH +PTNP N HQLHH  MVSY    P   QQPP  +V+YP+P K KPQQSNLSDDEEQGFA +D N DGKKKIS
Subjt:  GGGGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKIS

Query:  PWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTL
        PWQRMKWTDMMVRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQTL
Subjt:  PWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTL

Query:  LDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST-------THPSPDAATEP-SHLPQ----QQQQQQLCFHATDTTTSASIAAGEGSKSGDEEE
        LDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST        HPSPD A EP SHL Q    QQQQQQ CFHAT++  +A+    +G    D+E+
Subjt:  LDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST-------THPSPDAATEP-SHLPQ----QQQQQQLCFHATDTTTSASIAAGEGSKSGDEEE

Query:  EEEEEEESEEEEEDEE-----TEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLK
        E++EEEESEE+E+DEE     T    EEE+ETESRKRARKGG+  A MQQL+AEV+GV+ DGGRSPWEKKQWMKSRLIQLEEQQV++Q+QAFELEKQRLK
Subjt:  EEEEEEESEEEEEDEE-----TEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLK

Query:  WVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        W+KFRSKKERDMERAKLENEKR LE ERM+LMVK+ ELD M M HYQQQQ  HSSNKRGDPSSITG
Subjt:  WVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG

A0A6J1L352 ESF1 homolog6.85e-22976.91Show/hide
Query:  METNSLGGGGGGGVGGSGGGGGGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDD
        ME++SLGG  GGG      GGGGG GGGGMFSGMNS+MLGL+LPLH +PTNP N HQLHHP MVSYV   P   QQPP  +V+YP+P K KPQQSNLSDD
Subjt:  METNSLGGGGGGGVGGSGGGGGGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDD

Query:  EEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EEQGFA +D N DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST-------THPSPDAATEP-SHLPQ---QQQQQQLCFHATDTT
        VNDILGKGTAC+VVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST        HPSPD A EP SHL Q   QQQQQQ CFHAT+T 
Subjt:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST-------THPSPDAATEP-SHLPQ---QQQQQQLCFHATDTT

Query:  TSASIAAGEGSKSGDEEEEEEEEEESEEEEEDEETEGRQ---EEEEETESRKRARKGGMITAG---MQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLE
         +A+    +G    DE++EE+E EE E++EE+EE EG     EEE+ETESRKRARKGG+  A    MQQL+AEV+GV+ DGGRS WEKKQWMKSRLIQLE
Subjt:  TSASIAAGEGSKSGDEEEEEEEEEESEEEEEDEETEGRQ---EEEEETESRKRARKGGMITAG---MQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLE

Query:  EQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        EQQV +Q+QAFELEKQRLKW+KFRSKKERDMERAKLENEKR LE ERM+LMVK+ ELD M M HYQQQ   HSSNKRGDPSSITG
Subjt:  EQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors5.3e-5137.18Show/hide
Query:  GGGGGGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHH--PPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNG---
        GG    GA   G F    S  +  +  ++Q   +  N   LH   P  +   Q   HH  Q  S+S +     KA+ +++++SDD+E  F  +  +G   
Subjt:  GGGGGGGAGGGGMFSGMNSSMLGLELPLHQNPTNPTNPHQLHH--PPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNG---

Query:  --DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
          +   K SPWQR+KWTD MV+LLITAV YIGD+     +D   ++K   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+GT+
Subjt:  --DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CKVVENQTLLDSM-ELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEE
        C+VVEN  LLDS+  L  K K++VRK+++SKHLF+ EMC+YHN  R               HLP     Q+          S  +A     +S D+ + +
Subjt:  CKVVENQTLLDSM-ELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEE

Query:  EEEEESEEEEEDEETEGRQEEEEETESRKRA----------RKGGMITAGMQQLS---------------------------AEVMGVISDGGRSPWEKK
        +  +   E+ +DE+ +G  +E +E E +  A            GG +      LS                           A+V    ++ GR+   +K
Subjt:  EEEEESEEEEEDEETEGRQEEEEETESRKRA----------RKGGMITAGMQQLS---------------------------AEVMGVISDGGRSPWEKK

Query:  QWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNEL
        QWM+SR +QLEEQ++  Q +  ELEKQR +W +F  K+++++ER ++ENE+  LEN+RM L +K+ EL
Subjt:  QWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)3.9e-4635.45Show/hide
Query:  QHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGK
        QH P+  Q     +            + ++S+D+E    + D     K+  SPWQR+KW D MV+L+ITA+ YIG++ GS+        K   +LQKKGK
Subjt:  QHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPVDHVGKKKPVGLLQKKGK

Query:  WKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAA
        W+SVS+ M E+G++VSPQQCEDKFNDLNKRYK++N++LG+GT+C+VVEN +LLD ++ L  K K+EVR++++SKHLF+ EMC+YHN  R           
Subjt:  WKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAA

Query:  TEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEED----EETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVIS
            HLP     Q+    +    T  S    +  + G  + E+ ++++  EE+ D    +    R  + +  E      KG  +    +  +    G+  
Subjt:  TEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEED----EETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVIS

Query:  DGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNEL
        D  ++   ++Q ++S+ ++LE +++  Q +  ELE+Q+ KW  F  ++E+ + + ++ENE+  LENERM L +K+ EL
Subjt:  DGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNEL

AT3G10040.1 sequence-specific DNA binding transcription factors1.1e-9649.67Show/hide
Query:  MFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLS----DDEEQGFAA-------DDSNGDGKKKI
        MFSG +  ML LE+P  QNP NP N  Q  HP    +        Q  P +   YP+ +K K Q S +S    DDE++G  +       D +  DGK+K+
Subjt:  MFSGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLS----DDEEQGFAA-------DDSNGDGKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPVDHVGKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG
        S W RMKWTD MVRLLI AVFYIGDE G ++PVD   KKK           G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDILGKG
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPVDHVGKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG

Query:  TACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEE
         AC+VVENQ LL+SM+ LTPK K+EV+KLLNSKHLFFREMCAYHN+C H   H        P  +P   QQQ  CFHA +    A IA          E 
Subjt:  TACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEE

Query:  EEEEEEESEEEEEDEETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFR
         E EEE   +  ED E+E  + EEEET  ++R      I+  +++L  E   V+ D G+S WEKK+W++ +++++EE+++ ++ +  E+EKQR+KW+++R
Subjt:  EEEEEEESEEEEEDEETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVISDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFR

Query:  SKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG
        SKKER+ME+AKL+N++R LE ERM+LM++++E++L  +Q         SS  R DPSS  G
Subjt:  SKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGGTTTTCTGTCAACCCAATACCACAATCCACTCTCCCAAAACCACCTCCCCCCTCTTCTTCTTCTATCTATATACTATCTTCTCTCTCTCTCTCTCCCCCTCAC
TCCATCCAACGAACCCGCAATTCTTCTCTTTCTCTTCAACGACATTCAATTTCTCTATCTGGGTTTTGGGTTGCTTCTATTTCTTCATTTAGATACGGCTAATCGAAAGA
ATTGGGGGGTAATTTTAATGGAAACCAATAGTTTAGGCGGCGGCGGCGGCGGCGGTGTGGGTGGGAGTGGTGGAGGAGGAGGAGGAGGAGCTGGAGGAGGAGGGATGTTT
TCAGGCATGAATTCGTCAATGCTGGGATTGGAATTACCACTTCATCAAAACCCAACAAATCCCACTAATCCTCACCAATTACATCACCCTCCAATGGTATCTTATGTCCA
ACATGACCCCCATCATCACCAACAACCGCCGTCGGTCTCCGTGAAATACCCTTTTCCGACAAAGGCTAAGCCACAGCAGTCGAATCTCAGTGACGACGAGGAGCAGGGGT
TCGCGGCGGATGACAGTAATGGAGATGGCAAGAAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGATATGATGGTTCGGCTGTTGATTACGGCGGTGTTCTATATC
GGCGATGAAGGTGGGTCGGAGCCGGTGGACCATGTCGGGAAGAAAAAACCAGTGGGGTTACTGCAAAAGAAGGGGAAATGGAAATCGGTATCCAGAGCAATGATGGAAAA
AGGATTCTACGTTTCACCACAGCAATGTGAAGACAAATTCAACGATTTAAACAAAAGATACAAACGAGTTAACGATATTTTAGGGAAGGGAACAGCCTGCAAAGTAGTAG
AGAATCAAACATTACTCGATTCAATGGAATTAACACCAAAAACAAAAGAAGAAGTACGAAAATTACTCAATTCTAAACATCTTTTCTTCAGAGAAATGTGCGCTTACCAC
AACACTTGCCGTCACAGCACCACCCACCCCTCGCCGGACGCAGCAACAGAACCATCCCACCTTCCACAACAACAACAGCAACAGCAACTATGCTTCCACGCCACAGACAC
GACAACCTCCGCCAGTATAGCAGCCGGCGAGGGTTCAAAAAGCGGAGATGAGGAGGAAGAGGAGGAGGAAGAGGAGGAATCGGAAGAGGAGGAAGAGGATGAGGAAACAG
AAGGGAGACAGGAAGAAGAGGAGGAAACGGAATCGAGGAAGAGGGCGAGGAAAGGGGGGATGATAACGGCTGGGATGCAGCAGTTGAGTGCGGAGGTGATGGGAGTTATA
TCGGACGGGGGGAGGAGTCCATGGGAGAAGAAGCAATGGATGAAGAGCCGACTGATTCAGCTTGAAGAGCAGCAAGTGAGCTGGCAAACGCAGGCATTCGAGCTGGAGAA
ACAGAGGCTGAAATGGGTGAAGTTCAGGAGCAAGAAGGAGAGGGATATGGAGAGAGCGAAGCTGGAGAATGAGAAGAGAATGCTGGAAAACGAGAGGATGATGCTGATGG
TAAAGAAGAACGAGTTGGATTTGATGGGTATGCAACATTATCAGCAGCAACAGCAGCAGCATTCGTCGAACAAGCGAGGGGATCCATCGTCGATTACAGGATGA
mRNA sequenceShow/hide mRNA sequence
TTAATTAAATAGGATTGTTTTTTCAAAATTTAAGATTTATTTAAGTAGAAAGTTTTAAAATTAATTGTAATGATAGAATAAGCCAATATTTTGAATTTTTGTATTTACCT
CTATAAGATTGTTTTATTAAAGATCTGTTACTATTTAGAAAGACATAGTTAAATGTGATTTGAAACTCCAATCTCAAACTTGAAATTTACTTTGTATTTTTATATAAAAA
AGCAAGACCGATTAGGTAAAACTTTCTATAAGTTCTGAATTTTGTTTAAATTTAGTCAAATTTCAGTTATCAAAATACTCCCTGTTTTGTACCCTCCACCACTGACACCT
AATTAAAACTTGTACCCAAAATAATTTATTAATGTATACAGAAATCTAAAAAGACAAAGATATTTTAGGAAAGGAAAATATACTTAAATCAAAACCCCCACTAAACTTTA
TAAACCCTATAAAAGGAAAAAGAGTTGACTGAGTAAGAAAGAAGCAAAAAGAAAAAAGAAAAGAGTATGTGGGTTTCGAGAAACGTACGTGGAATAGCAATGTGTCTAAT
GGCGAAGCGATTTCTGAAAAACCAACAACAAATCACTAAAACCCATTTTCCCTTCTTCACTCCCCAGTTTTCGTCTTAATCCTGTGGGTTTCCAGTTTTTCCCTCCCAAA
ACCATGGGCGGTTTTCTGTCAACCCAATACCACAATCCACTCTCCCAAAACCACCTCCCCCCTCTTCTTCTTCTATCTATATACTATCTTCTCTCTCTCTCTCTCCCCCT
CACTCCATCCAACGAACCCGCAATTCTTCTCTTTCTCTTCAACGACATTCAATTTCTCTATCTGGGTTTTGGGTTGCTTCTATTTCTTCATTTAGATACGGCTAATCGAA
AGAATTGGGGGGTAATTTTAATGGAAACCAATAGTTTAGGCGGCGGCGGCGGCGGCGGTGTGGGTGGGAGTGGTGGAGGAGGAGGAGGAGGAGCTGGAGGAGGAGGGATG
TTTTCAGGCATGAATTCGTCAATGCTGGGATTGGAATTACCACTTCATCAAAACCCAACAAATCCCACTAATCCTCACCAATTACATCACCCTCCAATGGTATCTTATGT
CCAACATGACCCCCATCATCACCAACAACCGCCGTCGGTCTCCGTGAAATACCCTTTTCCGACAAAGGCTAAGCCACAGCAGTCGAATCTCAGTGACGACGAGGAGCAGG
GGTTCGCGGCGGATGACAGTAATGGAGATGGCAAGAAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGATATGATGGTTCGGCTGTTGATTACGGCGGTGTTCTAT
ATCGGCGATGAAGGTGGGTCGGAGCCGGTGGACCATGTCGGGAAGAAAAAACCAGTGGGGTTACTGCAAAAGAAGGGGAAATGGAAATCGGTATCCAGAGCAATGATGGA
AAAAGGATTCTACGTTTCACCACAGCAATGTGAAGACAAATTCAACGATTTAAACAAAAGATACAAACGAGTTAACGATATTTTAGGGAAGGGAACAGCCTGCAAAGTAG
TAGAGAATCAAACATTACTCGATTCAATGGAATTAACACCAAAAACAAAAGAAGAAGTACGAAAATTACTCAATTCTAAACATCTTTTCTTCAGAGAAATGTGCGCTTAC
CACAACACTTGCCGTCACAGCACCACCCACCCCTCGCCGGACGCAGCAACAGAACCATCCCACCTTCCACAACAACAACAGCAACAGCAACTATGCTTCCACGCCACAGA
CACGACAACCTCCGCCAGTATAGCAGCCGGCGAGGGTTCAAAAAGCGGAGATGAGGAGGAAGAGGAGGAGGAAGAGGAGGAATCGGAAGAGGAGGAAGAGGATGAGGAAA
CAGAAGGGAGACAGGAAGAAGAGGAGGAAACGGAATCGAGGAAGAGGGCGAGGAAAGGGGGGATGATAACGGCTGGGATGCAGCAGTTGAGTGCGGAGGTGATGGGAGTT
ATATCGGACGGGGGGAGGAGTCCATGGGAGAAGAAGCAATGGATGAAGAGCCGACTGATTCAGCTTGAAGAGCAGCAAGTGAGCTGGCAAACGCAGGCATTCGAGCTGGA
GAAACAGAGGCTGAAATGGGTGAAGTTCAGGAGCAAGAAGGAGAGGGATATGGAGAGAGCGAAGCTGGAGAATGAGAAGAGAATGCTGGAAAACGAGAGGATGATGCTGA
TGGTAAAGAAGAACGAGTTGGATTTGATGGGTATGCAACATTATCAGCAGCAACAGCAGCAGCATTCGTCGAACAAGCGAGGGGATCCATCGTCGATTACAGGATGAAGA
AGAAAGGGATTGGGTTTAGGGTTTAGAAACTGTGGGGATTATAATATAATTTAGAAGAAGTGAAACTCCCGTTTGGTTAAATTTAATTTTTATTATTGACTAACATTTTT
GTCCTATACATGTCGGATGCATTTTGAATAACCCTCACCCTTCCTTCCCCATCCTATAATCTTCTTTTCATTCACTTATCATATTCTAAAACC
Protein sequenceShow/hide protein sequence
MGGFLSTQYHNPLSQNHLPPLLLLSIYYLLSLSLPLTPSNEPAILLFLFNDIQFLYLGFGLLLFLHLDTANRKNWGVILMETNSLGGGGGGGVGGSGGGGGGGAGGGGMF
SGMNSSMLGLELPLHQNPTNPTNPHQLHHPPMVSYVQHDPHHHQQPPSVSVKYPFPTKAKPQQSNLSDDEEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYI
GDEGGSEPVDHVGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYH
NTCRHSTTHPSPDAATEPSHLPQQQQQQQLCFHATDTTTSASIAAGEGSKSGDEEEEEEEEEESEEEEEDEETEGRQEEEEETESRKRARKGGMITAGMQQLSAEVMGVI
SDGGRSPWEKKQWMKSRLIQLEEQQVSWQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRMLENERMMLMVKKNELDLMGMQHYQQQQQQHSSNKRGDPSSITG