; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020169 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020169
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSequence-specific DNA binding transcription factor
Genome locationChr04:29479247..29480668
RNA-Seq ExpressionHG10020169
SyntenyHG10020169
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043934.1 ESF1-like protein [Cucumis melo var. makuwa]6.5e-20989.59Show/hide
Query:  MNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNP NPHQLHHPPMVSYV H+PHHHQQPP V+VKYP+PTK KPQQSNLSDDEEQGFA DDSNGDGKKK+SPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIE
        RKLLNSKHLFFREMCAYHNTCRH T+    HPSPDA  EPSHLPQQQQQ+  CFH T+TTTS A+VAAG+  KSGDEE+EEEE+DESEEE   EEDEE E
Subjt:  RKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIE

Query:  GRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRL
        GRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQW+KSRLIQLEEQ+VS+QTQAFELEKQRLKWVKFRSKKERDMERAKLENEKR L
Subjt:  GRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRL

Query:  ENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        ENERMML+VK+ ELDLMGM HY QQQQQHSSNKRGDPSSITG
Subjt:  ENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG

KAE8651071.1 hypothetical protein Csa_001092 [Cucumis sativus]2.0e-21390.16Show/hide
Query:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSNGDGKKKMSPWQRMKWTDM
        MFSGMNSSMLGLELPLHQNPTNP NPHQLHHPPMVSYV H+PHHHQQPP V+VKYP+PTK KPQQSNLSDDEEQGFA DDSNGDGKKK+SPWQRMKWTDM
Subjt:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSNGDGKKKMSPWQRMKWTDM

Query:  MVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT
        MVRLLITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT
Subjt:  MVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT

Query:  KEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDEED
        KEEVRKLLNSKHLFFREMCAYHNTCRH T+    HPSPDA  EPSHLPQQQQQ+  CFH T+TTTS A++AAG+G KSGDEEEEEEE++ESEEE   EED
Subjt:  KEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDEED

Query:  EEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLEN
        EE EGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGV+ DGGRSPWEKKQW+KSRLIQLEEQQVS+QTQAFELEKQRLKWVKFRSKKERDMERAKLEN
Subjt:  EEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLEN

Query:  EKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        EKR LENERMMLMVK+ ELDLMGM HYQQQQQQHSSNKRGDPSSITG
Subjt:  EKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG

XP_004138003.2 uncharacterized protein DDB_G0290301 [Cucumis sativus]1.4e-21689.5Show/hide
Query:  MEPNGLGGGGGVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDD
        ME N LGGGGG G GG GGGGGGG GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHPPMVSYV H+PHHHQQPP V+VKYP+PTK KPQQSNLSDD
Subjt:  MEPNGLGGGGGVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDD

Query:  EEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EEQGFA DDSNGDGKKK+SPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAVA
        VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRH T+    HPSPDA  EPSHLPQQQQQ+  CFH T+TTTS A++A
Subjt:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAVA

Query:  AGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQ
        AG+G KSGDEEEEEEE++ESEEE   EEDEE EGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGV+ DGGRSPWEKKQW+KSRLIQLEEQQVS+QTQ
Subjt:  AGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQ

Query:  AFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        AFELEKQRLKWVKFRSKKERDMERAKLENEKR LENERMMLMVK+ ELDLMGM HYQQQQQQHSSNKRGDPSSITG
Subjt:  AFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG

XP_008442785.1 PREDICTED: ESF1 homolog [Cucumis melo]1.3e-21288.66Show/hide
Query:  MEPNGLGGGG-GVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSD
        ME N LGGGG G  GGGGGGGG G  GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHPPMVSYV H+PHHHQQPP V+VKYP+PTK KPQQSNLSD
Subjt:  MEPNGLGGGG-GVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSD

Query:  DEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
        DEEQGFA DDSNGDGKKK+SPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
Subjt:  DEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK

Query:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAV
        RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRH T+    HPSPDA  EPSHLPQQQQQ+  CFH T+TTTS A+V
Subjt:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAV

Query:  AAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQ
        AAG+  KSGDEE+EEEE+DESEEE   EEDEE EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQW+KSRLIQLEEQ+VS+QTQ
Subjt:  AAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQ

Query:  AFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        AFELEKQRLKWVKFRSKKERDMERAKLENEKR LENERMML+VK+ ELDLMGM HY QQQQQHSSNKRGDPSSITG
Subjt:  AFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG

XP_038903237.1 transcription factor SPT20 homolog [Benincasa hispida]7.2e-22494.78Show/hide
Query:  MNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNPPNPHQLHHPP+VSYV H+PHHHQQPPPV++KYPYPTKPKPQQSN+SDDEEQGFA DDSNGDGKKK+SPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLP-QQQQQRCFHVTETTTSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIEG
        RKLLNSKHLFFREMCAYHNTCRHGT+    HPSPD  AEPSHLP QQQQQRCFH TETTTSAAAVAAG+  KSGDEEEEEEEDDESEEEEE EEDEEIEG
Subjt:  RKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLP-QQQQQRCFHVTETTTSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIEG

Query:  RQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLE
        RQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQW+KSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLE
Subjt:  RQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLE

Query:  NERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        NERMMLMVKQKELDLM MHHYQQQQQQHSSNKRGDPSSITG
Subjt:  NERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein7.0e-21789.5Show/hide
Query:  MEPNGLGGGGGVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDD
        ME N LGGGGG G GG GGGGGGG GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHPPMVSYV H+PHHHQQPP V+VKYP+PTK KPQQSNLSDD
Subjt:  MEPNGLGGGGGVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDD

Query:  EEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EEQGFA DDSNGDGKKK+SPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAVA
        VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRH T+    HPSPDA  EPSHLPQQQQQ+  CFH T+TTTS A++A
Subjt:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAVA

Query:  AGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQ
        AG+G KSGDEEEEEEE++ESEEE   EEDEE EGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGV+ DGGRSPWEKKQW+KSRLIQLEEQQVS+QTQ
Subjt:  AGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQ

Query:  AFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        AFELEKQRLKWVKFRSKKERDMERAKLENEKR LENERMMLMVK+ ELDLMGM HYQQQQQQHSSNKRGDPSSITG
Subjt:  AFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG

A0A1S3B6J6 ESF1 homolog6.1e-21388.66Show/hide
Query:  MEPNGLGGGG-GVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSD
        ME N LGGGG G  GGGGGGGG G  GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHPPMVSYV H+PHHHQQPP V+VKYP+PTK KPQQSNLSD
Subjt:  MEPNGLGGGG-GVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSD

Query:  DEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
        DEEQGFA DDSNGDGKKK+SPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
Subjt:  DEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK

Query:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAV
        RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRH T+    HPSPDA  EPSHLPQQQQQ+  CFH T+TTTS A+V
Subjt:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAV

Query:  AAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQ
        AAG+  KSGDEE+EEEE+DESEEE   EEDEE EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQW+KSRLIQLEEQ+VS+QTQ
Subjt:  AAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQ

Query:  AFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        AFELEKQRLKWVKFRSKKERDMERAKLENEKR LENERMML+VK+ ELDLMGM HY QQQQQHSSNKRGDPSSITG
Subjt:  AFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG

A0A5A7TKS6 ESF1-like protein3.2e-20989.59Show/hide
Query:  MNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNP NPHQLHHPPMVSYV H+PHHHQQPP V+VKYP+PTK KPQQSNLSDDEEQGFA DDSNGDGKKK+SPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIE
        RKLLNSKHLFFREMCAYHNTCRH T+    HPSPDA  EPSHLPQQQQQ+  CFH T+TTTS A+VAAG+  KSGDEE+EEEE+DESEEE   EEDEE E
Subjt:  RKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQR--CFHVTETTTSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEIE

Query:  GRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRL
        GRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQW+KSRLIQLEEQ+VS+QTQAFELEKQRLKWVKFRSKKERDMERAKLENEKR L
Subjt:  GRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRL

Query:  ENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        ENERMML+VK+ ELDLMGM HY QQQQQHSSNKRGDPSSITG
Subjt:  ENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG

A0A6J1J5X4 ribosome quality control complex subunit 2-like2.1e-19280.12Show/hide
Query:  MEPNGLGGGGGVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDD
        MEPN L              GGGGG  GGMF G++SSMLGLELPLHQ P+   NPHQLHHPPMVSYVPHE HH QQPPP AVK PYP KPKPQQSN+SD+
Subjt:  MEPNGLGGGGGVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDD

Query:  EEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        EEQG A DDSN D KKK+SPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  EEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNS---GAHPSPDATAEPSHLP-QQQQQRCFHVTETTTSAAA
        VNDILGKGTAC+VVEN TLL+SMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRHG  NS   GA+  P+  AEPSHLP QQQQQRCFH TET   A  
Subjt:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNS---GAHPSPDATAEPSHLP-QQQQQRCFHVTETTTSAAA

Query:  VAAGDGLKSGDEEEE-EEEDDESEE--EEEDEEDEEIEGR---QEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQ
          AG+G KSGDEEEE E+EDDESEE  EEE+EE+EE+EG    QEEEEETES+KR RK G TAG+QQ+SAEVMGVVQDGGRSPWEKKQW+K RLIQLEEQ
Subjt:  VAAGDGLKSGDEEEE-EEEDDESEE--EEEDEEDEEIEGR---QEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQ

Query:  QVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        QV YQ+Q  E+EKQRLKW+KFR KKERDMERAKLENEKRRLENERMMLMVKQKELDL  +H+YQQQQQQHSSN+RGDPSSITG
Subjt:  QVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG

A0A6J1L352 ESF1 homolog6.4e-19481.59Show/hide
Query:  VGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSN
        +GG  GGGGGGGGGGGGGMFSGMNS+MLGL+LPLH +PTNPPN HQLHHP MVSYVP +P   QQPPP AV+YPYP KPKPQQSNLSDDEEQGFA +D N
Subjt:  VGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSN

Query:  GDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
         DGKKK+SPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC
Subjt:  GDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC

Query:  KVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNS---GAHPSPDATAE-PSHL-----PQQQQQRCFHVTETTTSAAAVAAGDG
        +VVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRH T N+   GAHPSPD  AE PSHL      QQQQQRCFH TET  +AA V A   
Subjt:  KVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNS---GAHPSPDATAE-PSHL-----PQQQQQRCFHVTETTTSAAAVAAGDG

Query:  LKSGDEEEEEEEDDESEEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI----TAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQ
           GD+++E++E+DESEE+E+DEE+EEIEG     EEE+ETESRKRARKGGI     A MQQL+AEV+GV+QDGGRS WEKKQW+KSRLIQLEEQQV YQ
Subjt:  LKSGDEEEEEEEDDESEEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI----TAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQ

Query:  TQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        +QAFELEKQRLKW+KFRSKKERDMERAKLENEKRRLE ERM+LMVKQKELD M MHHY   QQQHSSNKRGDPSSITG
Subjt:  TQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.2e-4835.52Show/hide
Query:  GGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSNG-----
        GG    G    G F    S  +  +  ++Q   + PN   LH     + V  +   H Q   +++      K + +++++SDD+E  F  +  +G     
Subjt:  GGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDSNG-----

Query:  DGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACK
        +   K SPWQR+KWTD MV+LLITAV YIGD+   +    +  ++   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+GT+C+
Subjt:  DGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACK

Query:  VVENQTLLDSM-ELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQ-QQQRCFHVTETTTSAAAVAAGDGLKSGDEEE
        VVEN  LLDS+  L  K K++VRK+++SKHLF+ EMC+YHN  R                   HLP     QR   +               L+S D   
Subjt:  VVENQTLLDSM-ELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQ-QQQRCFHVTETTTSAAAVAAGDGLKSGDEEE

Query:  EEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRA-------RKGGITAGMQQL-------------------------------SAEVMGVVQDGGRS
        + + DD  + + ED +DE+ +G  +E +E E +  A         GG    ++++                                A+V     + GR+
Subjt:  EEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRA-------RKGGITAGMQQL-------------------------------SAEVMGVVQDGGRS

Query:  PWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL
           +KQW++SR +QLEEQ++  Q +  ELEKQR +W +F  K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  PWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)3.3e-4938.38Show/hide
Query:  QSNLSDDEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND
        + ++S+D+E      D     K+  SPWQR+KW D MV+L+ITA+ YIG++ GS+        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFND
Subjt:  QSNLSDDEEQGFAGDDSNGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND

Query:  LNKRYKRVNDILGKGTACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQ-QRCFHVTETT
        LNKRYK++N++LG+GT+C+VVEN +LLD ++ L  K K+EVR++++SKHLF+ EMC+YHN  R                   HLP     QR  H+    
Subjt:  LNKRYKRVNDILGKGTACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQ-QRCFHVTETT

Query:  TSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDE--EDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEV-MGVVQDGGRSPWEKKQWIKSRLIQLE
         +  +    D  + G  + E+ +DD+  EE+ D    D  ++ R  + +  E      KG     + +  A+V  G+  D  ++   ++Q I+S+ ++LE
Subjt:  TSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDE--EDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEV-MGVVQDGGRSPWEKKQWIKSRLIQLE

Query:  EQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL
         +++  Q +  ELE+Q+ KW  F  ++E+ + + ++ENE+ +LENERM L +K+ EL
Subjt:  EQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors2.5e-9750.43Show/hide
Query:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQ-PPPVAVKYPYPTKPKPQQSNLS----DDEEQGFAG-------DDSNGDGKKK
        MFSG +  ML LE+P  QNP NP N  Q  HP      P+     QQ  PP+   YPY +KPK Q S +S    DDE++G          D +  DGK+K
Subjt:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQ-PPPVAVKYPYPTKPKPQQSNLS----DDEEQGFAG-------DDSNGDGKKK

Query:  MSPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPADHAGKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGK
        +S W RMKWTD MVRLLI AVFYIGDE G ++P D   KKK           G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDILGK
Subjt:  MSPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPADHAGKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGK

Query:  GTACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLP-QQQQQRCFHVTETTTSAAAVAAGDGLKS
        G AC+VVENQ LL+SM+ LTPK K+EV+KLLNSKHLFFREMCAYHN+C H     G H        P  +P   QQQ CFH  E    A           
Subjt:  GTACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLP-QQQQQRCFHVTETTTSAAAVAAGDGLKS

Query:  GDEEEEEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQR
                E  E EEE E +  E+ E   EE EE E+RK+ R   I+  +++L  E   VV+D G+S WEKK+WI+ +++++EE+++ Y+ +  E+EKQR
Subjt:  GDEEEEEEEDDESEEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQR

Query:  LKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG
        +KW+++RSKKER+ME+AKL+N++RRLE ERM+LM+++ E++L          +  SS  R DPSS  G
Subjt:  LKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCTAATGGTTTAGGCGGCGGCGGCGGGGTTGGGGGTGGAGGTGGAGGTGGGGGCGGTGGTGGAGGAGGAGGAGGAGGAGGGATGTTTTCAGGCATGAATTCGTC
AATGTTGGGATTGGAATTACCACTTCATCAAAACCCAACAAATCCCCCGAATCCTCATCAATTACATCACCCCCCAATGGTGTCTTATGTCCCACACGAGCCCCACCACC
ACCAACAACCGCCGCCAGTCGCCGTGAAATACCCCTATCCGACGAAGCCTAAGCCGCAGCAGTCGAATCTCAGTGACGACGAGGAGCAGGGATTCGCGGGGGATGATAGT
AACGGAGATGGCAAGAAGAAAATGTCGCCGTGGCAGAGGATGAAATGGACGGACATGATGGTCAGGTTGCTGATTACGGCGGTGTTCTACATCGGCGACGAAGGTGGGTC
GGAGCCGGCGGACCACGCCGGCAAGAAAAAGCCAGTGGGGTTACTGCAAAAGAAGGGGAAATGGAAATCGGTATCCAGAGCAATGATGGAAAAAGGATTCTACGTTTCAC
CACAGCAATGCGAAGACAAATTCAACGATTTAAACAAAAGATATAAACGAGTTAACGACATTTTGGGGAAGGGCACAGCCTGCAAAGTTGTCGAGAATCAAACGTTACTG
GATTCAATGGAATTAACACCGAAAACGAAAGAAGAAGTACGAAAGTTACTCAATTCTAAACATCTGTTCTTCAGAGAAATGTGTGCTTACCATAACACTTGCCGTCACGG
CACCAGCAACAGCGGCGCTCATCCCTCACCGGATGCGACAGCGGAACCATCTCACCTTCCACAACAACAACAGCAACGATGCTTCCACGTGACAGAGACCACAACCTCCG
CCGCCGCTGTAGCGGCCGGCGATGGTTTGAAAAGTGGAGATGAAGAGGAGGAGGAGGAGGAAGACGATGAATCGGAGGAGGAGGAGGAAGATGAGGAGGATGAGGAAATT
GAAGGAAGGCAAGAAGAGGAGGAGGAAACGGAATCGAGGAAAAGAGCGAGGAAAGGGGGGATTACGGCGGGGATGCAGCAGTTGAGTGCGGAGGTGATGGGAGTTGTGCA
GGACGGCGGGAGAAGTCCATGGGAGAAGAAACAGTGGATAAAAAGCCGGTTAATTCAGCTTGAAGAGCAGCAAGTGAGCTATCAAACGCAAGCTTTTGAGCTGGAGAAAC
AGAGGCTGAAATGGGTGAAGTTCAGAAGCAAGAAGGAGAGGGATATGGAGAGGGCGAAGCTGGAGAATGAGAAGAGAAGGCTGGAAAATGAGAGGATGATGCTGATGGTG
AAGCAGAAGGAGTTGGATTTGATGGGTATGCACCATTACCAGCAACAGCAGCAGCAGCATTCGTCGAACAAGCGAGGGGATCCATCGTCGATTACAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCTAATGGTTTAGGCGGCGGCGGCGGGGTTGGGGGTGGAGGTGGAGGTGGGGGCGGTGGTGGAGGAGGAGGAGGAGGAGGGATGTTTTCAGGCATGAATTCGTC
AATGTTGGGATTGGAATTACCACTTCATCAAAACCCAACAAATCCCCCGAATCCTCATCAATTACATCACCCCCCAATGGTGTCTTATGTCCCACACGAGCCCCACCACC
ACCAACAACCGCCGCCAGTCGCCGTGAAATACCCCTATCCGACGAAGCCTAAGCCGCAGCAGTCGAATCTCAGTGACGACGAGGAGCAGGGATTCGCGGGGGATGATAGT
AACGGAGATGGCAAGAAGAAAATGTCGCCGTGGCAGAGGATGAAATGGACGGACATGATGGTCAGGTTGCTGATTACGGCGGTGTTCTACATCGGCGACGAAGGTGGGTC
GGAGCCGGCGGACCACGCCGGCAAGAAAAAGCCAGTGGGGTTACTGCAAAAGAAGGGGAAATGGAAATCGGTATCCAGAGCAATGATGGAAAAAGGATTCTACGTTTCAC
CACAGCAATGCGAAGACAAATTCAACGATTTAAACAAAAGATATAAACGAGTTAACGACATTTTGGGGAAGGGCACAGCCTGCAAAGTTGTCGAGAATCAAACGTTACTG
GATTCAATGGAATTAACACCGAAAACGAAAGAAGAAGTACGAAAGTTACTCAATTCTAAACATCTGTTCTTCAGAGAAATGTGTGCTTACCATAACACTTGCCGTCACGG
CACCAGCAACAGCGGCGCTCATCCCTCACCGGATGCGACAGCGGAACCATCTCACCTTCCACAACAACAACAGCAACGATGCTTCCACGTGACAGAGACCACAACCTCCG
CCGCCGCTGTAGCGGCCGGCGATGGTTTGAAAAGTGGAGATGAAGAGGAGGAGGAGGAGGAAGACGATGAATCGGAGGAGGAGGAGGAAGATGAGGAGGATGAGGAAATT
GAAGGAAGGCAAGAAGAGGAGGAGGAAACGGAATCGAGGAAAAGAGCGAGGAAAGGGGGGATTACGGCGGGGATGCAGCAGTTGAGTGCGGAGGTGATGGGAGTTGTGCA
GGACGGCGGGAGAAGTCCATGGGAGAAGAAACAGTGGATAAAAAGCCGGTTAATTCAGCTTGAAGAGCAGCAAGTGAGCTATCAAACGCAAGCTTTTGAGCTGGAGAAAC
AGAGGCTGAAATGGGTGAAGTTCAGAAGCAAGAAGGAGAGGGATATGGAGAGGGCGAAGCTGGAGAATGAGAAGAGAAGGCTGGAAAATGAGAGGATGATGCTGATGGTG
AAGCAGAAGGAGTTGGATTTGATGGGTATGCACCATTACCAGCAACAGCAGCAGCAGCATTCGTCGAACAAGCGAGGGGATCCATCGTCGATTACAGGATGA
Protein sequenceShow/hide protein sequence
MEPNGLGGGGGVGGGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPPMVSYVPHEPHHHQQPPPVAVKYPYPTKPKPQQSNLSDDEEQGFAGDDS
NGDGKKKMSPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLL
DSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHGTSNSGAHPSPDATAEPSHLPQQQQQRCFHVTETTTSAAAVAAGDGLKSGDEEEEEEEDDESEEEEEDEEDEEI
EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWIKSRLIQLEEQQVSYQTQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMV
KQKELDLMGMHHYQQQQQQHSSNKRGDPSSITG