; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G19010 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G19010
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionsequence-specific DNA binding transcription factors
Genome locationClcChr10:32808071..32809812
RNA-Seq ExpressionClc10G19010
SyntenyClc10G19010
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651071.1 hypothetical protein Csa_001092 [Cucumis sativus]1.4e-20989.06Show/hide
Query:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDM
        MFSGMNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+PHH QQPP V+VKYP+PTK KPQQSNLSDD+EQGFAADDSNGDGKKKISPWQRMKWTDM
Subjt:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDM

Query:  MVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT
        MVRLLITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT
Subjt:  MVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKT

Query:  KEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEE
        KEEVRKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A++AAG+G KSGDE++EEEEE+ES   EE+EE
Subjt:  KEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEE

Query:  DEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLE
        DEE EGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQQVS+Q QAFELEKQRLKWVKFRSKKERDMERAKLE
Subjt:  DEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLE

Query:  NEKRRLENERMMLMVKQKELDLMGMHHY-QHQQQHSSNKRGDPSSITG
        NEKR LENERMMLMVK+ ELDLMGM HY Q QQQHSSNKRGDPSSITG
Subjt:  NEKRRLENERMMLMVKQKELDLMGMHHY-QHQQQHSSNKRGDPSSITG

KAG7028064.1 hypothetical protein SDJN02_09244, partial [Cucurbita argyrosperma subsp. argyrosperma]9.9e-21180.85Show/hide
Query:  LSQNQLPPSSSIYIEFLFLGLVLFLDLETANRKNWGVVLMESSSLGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPH
        +S N L    SI   FLFLGLVLFL+  TANR+N GVVLMES SL     GG GGGGGGGMFSGMNS+MLGL+LPLH +PTNPPN HQLHHP +VSYVP 
Subjt:  LSQNQLPPSSSIYIEFLFLGLVLFLDLETANRKNWGVVLMESSSLGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPH

Query:  EPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWK
        +P  QQQPPP AV+YPYP KPKPQQSNLSDD+EQGFA +D N DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWK
Subjt:  EPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWK

Query:  SVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNS---GAHPS
        SVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST N+   GAHPS
Subjt:  SVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNS---GAHPS

Query:  PDAAAE-PSHLP----QQQQQRCFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI--TAGM
        PD AAE PSHL     QQQQQRCFHATE  T++AAA    D    GD++DE++EEDES+E+E+DEE+EEIEG     EEE+ETESRKRARKGGI   A M
Subjt:  PDAAAE-PSHLP----QQQQQRCFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI--TAGM

Query:  QQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQH
        QQL+AEV+GV+QDGGRSPWEKKQWMKSRL+QLEEQQV+YQ+QAFELEKQRLKW+KFRSKKERDMERAKLENEKRRLE ERM LMVKQKELD M MHHY  
Subjt:  QQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQH

Query:  QQQHSSNKRGDPSSITG
        QQQHSSNKRGDPSSITG
Subjt:  QQQHSSNKRGDPSSITG

XP_004138003.2 uncharacterized protein DDB_G0290301 [Cucumis sativus]1.7e-23482.55Show/hide
Query:  MGGFLSTQYHNPLSQNQLPP--SSSIY------------------------IEFLFL--GLVLFLDLETANRKNWGVVLMESSSLGGG--------GGGG
        MGGFLSTQYHNPLSQN LPP    SIY                        I+FL+L  GL+LFL L+TANRKNWGV+LME++SLGGG        GGGG
Subjt:  MGGFLSTQYHNPLSQNQLPP--SSSIY------------------------IEFLFL--GLVLFLDLETANRKNWGVVLMESSSLGGG--------GGGG

Query:  GGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPW
        GGG GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+PHH QQPP V+VKYP+PTK KPQQSNLSDD+EQGFAADDSNGDGKKKISPW
Subjt:  GGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPW

Query:  QRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLD
        QRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLD
Subjt:  QRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLD

Query:  SMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDES
        SMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A++AAG+G KSGDE++EEEEE+ES
Subjt:  SMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDES

Query:  DEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKER
           EE+EEDEE EGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQQVS+Q QAFELEKQRLKWVKFRSKKER
Subjt:  DEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKER

Query:  DMERAKLENEKRRLENERMMLMVKQKELDLMGMHHY-QHQQQHSSNKRGDPSSITG
        DMERAKLENEKR LENERMMLMVK+ ELDLMGM HY Q QQQHSSNKRGDPSSITG
Subjt:  DMERAKLENEKRRLENERMMLMVKQKELDLMGMHHY-QHQQQHSSNKRGDPSSITG

XP_008442785.1 PREDICTED: ESF1 homolog [Cucumis melo]6.0e-21687.82Show/hide
Query:  MESSSLGGG-----GGGGGGGG----GGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSD
        ME++SLGGG     GGGGGGGG    GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+PHH QQPP V+VKYP+PTK KPQQSNLSD
Subjt:  MESSSLGGG-----GGGGGGGG----GGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSD

Query:  DDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
        D+EQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
Subjt:  DDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK

Query:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAA
        RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A+
Subjt:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAA

Query:  VAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQA
        VAAG+  KSGDE+DEEEEEDES   EE+EEDEE EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQ+VS+Q 
Subjt:  VAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQA

Query:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG
        QAFELEKQRLKWVKFRSKKERDMERAKLENEKR LENERMML+VK+ ELDLMGM HYQ QQQHSSNKRGDPSSITG
Subjt:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG

XP_038903237.1 transcription factor SPT20 homolog [Benincasa hispida]1.4e-21793.21Show/hide
Query:  MNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNPPNPHQLHHP IVSYV H+PHH QQPPPV++KYPYPTKPKPQQSN+SDD+EQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLP-QQQQQRCFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIE
        RKLLNSKHLFFREMCAYHNTCRH T+    HPSPD AAEPSHLP QQQQQRCFHATETTT SAAAVAAG+  KSGDE++EEEE+DES EEEE+EEDEEIE
Subjt:  RKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLP-QQQQQRCFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIE

Query:  GRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRL
        GRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRL+QLEEQQVSYQ QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRL
Subjt:  GRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRL

Query:  ENERMMLMVKQKELDLMGMHHY-QHQQQHSSNKRGDPSSITG
        ENERMMLMVKQKELDLM MHHY Q QQQHSSNKRGDPSSITG
Subjt:  ENERMMLMVKQKELDLMGMHHY-QHQQQHSSNKRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein4.2e-21587.42Show/hide
Query:  MESSSLGGG--------GGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDD
        ME++SLGGG        GGGGGGG GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+PHH QQPP V+VKYP+PTK KPQQSNLSDD
Subjt:  MESSSLGGG--------GGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDD

Query:  DEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
        +EQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR
Subjt:  DEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKR

Query:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAV
        VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A++
Subjt:  VNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAV

Query:  AAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQA
        AAG+G KSGDE++EEEEE+ES   EE+EEDEE EGRQEEEEETESRKRARKGG ITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQQVS+Q 
Subjt:  AAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGG-ITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQA

Query:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHY-QHQQQHSSNKRGDPSSITG
        QAFELEKQRLKWVKFRSKKERDMERAKLENEKR LENERMMLMVK+ ELDLMGM HY Q QQQHSSNKRGDPSSITG
Subjt:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHY-QHQQQHSSNKRGDPSSITG

A0A1S3B6J6 ESF1 homolog2.9e-21687.82Show/hide
Query:  MESSSLGGG-----GGGGGGGG----GGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSD
        ME++SLGGG     GGGGGGGG    GGGGMFSGMNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+PHH QQPP V+VKYP+PTK KPQQSNLSD
Subjt:  MESSSLGGG-----GGGGGGGG----GGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSD

Query:  DDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
        D+EQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK
Subjt:  DDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYK

Query:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAA
        RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A+
Subjt:  RVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAA

Query:  VAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQA
        VAAG+  KSGDE+DEEEEEDES   EE+EEDEE EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQ+VS+Q 
Subjt:  VAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQA

Query:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG
        QAFELEKQRLKWVKFRSKKERDMERAKLENEKR LENERMML+VK+ ELDLMGM HYQ QQQHSSNKRGDPSSITG
Subjt:  QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG

A0A5A7TKS6 ESF1-like protein2.0e-20989.37Show/hide
Query:  MNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
        MNSSMLGLELPLHQNPTNP NPHQLHHP +VSYV H+PHH QQPP V+VKYP+PTK KPQQSNLSDD+EQGFAADDSNGDGKKKISPWQRMKWTDMMVRL
Subjt:  MNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRL

Query:  LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
        LITAVFYIGDEGGSEP DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV
Subjt:  LITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEV

Query:  RKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEI
        RKLLNSKHLFFREMCAYHNTCRHST+    HPSPDAA EPSHLPQQQQQ+  CFHAT+TTTS  A+VAAG+  KSGDE+DEEEEEDES   EE+EEDEE 
Subjt:  RKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQR--CFHATETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEI

Query:  EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRR
        EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGV+ DGGRSPWEKKQWMKSRL+QLEEQ+VS+Q QAFELEKQRLKWVKFRSKKERDMERAKLENEKR 
Subjt:  EGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRR

Query:  LENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG
        LENERMML+VK+ ELDLMGM HYQ QQQHSSNKRGDPSSITG
Subjt:  LENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411305.2e-19781.63Show/hide
Query:  MESSSLGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAAD
        MES SL  G  GGGGGGGGGGMFSG+NS+MLGL+LPLH +PTNPPN HQLHH  +VSY P +P  QQQPPP AV+YPYP KPKPQQSNLSDD+EQGFA +
Subjt:  MESSSLGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAAD

Query:  DSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG
        D N DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG
Subjt:  DSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG

Query:  TACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNS---GAHPSPDAAAEP-------SHLPQQQQQRCFHATETTTSSAAAV
        TAC+VVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST N+   GAHPSPD AAEP        H  QQQQQRCFHATE   S+AAA 
Subjt:  TACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNS---GAHPSPDAAAEP-------SHLPQQQQQRCFHATETTTSSAAAV

Query:  AAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI-TAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVS
          G     GD++DE++EE+ES EE+ED+E+EEIEG     EEE+ETESRKRARKGGI  A MQQL+AEV+GV+QDGGRSPWEKKQWMKSRL+QLEEQQV+
Subjt:  AAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI-TAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVS

Query:  YQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG
        YQ+QAFELEKQRLKW+KFRSKKERDMERAKLENEKRRLE ERM+LMVKQKELD M MHHYQ QQQHSSNKRGDPSSITG
Subjt:  YQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG

A0A6J1L352 ESF1 homolog2.6e-20182.82Show/hide
Query:  MESSSLGG--GGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFA
        MES SLGG  GGGGGGGGGGGGGMFSGMNS+MLGL+LPLH +PTNPPN HQLHHP +VSYVP +P  QQQPPP AV+YPYP KPKPQQSNLSDD+EQGFA
Subjt:  MESSSLGG--GGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFA

Query:  ADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG
         +D N DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG
Subjt:  ADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILG

Query:  KGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNS---GAHPSPDAAAE-PSHL-----PQQQQQRCFHATETTTSSAAA
        KGTAC+VVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHST N+   GAHPSPD AAE PSHL      QQQQQRCFHATE  T++AAA
Subjt:  KGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNS---GAHPSPDAAAE-PSHL-----PQQQQQRCFHATETTTSSAAA

Query:  VAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI----TAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEE
            D    GD+DDE++EEDES+E+E+DEE+EEIEG     EEE+ETESRKRARKGGI     A MQQL+AEV+GV+QDGGRS WEKKQWMKSRL+QLEE
Subjt:  VAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGR---QEEEEETESRKRARKGGI----TAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEE

Query:  QQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG
        QQV YQ+QAFELEKQRLKW+KFRSKKERDMERAKLENEKRRLE ERM+LMVKQKELD M MHHY  QQQHSSNKRGDPSSITG
Subjt:  QQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors3.8e-5136.83Show/hide
Query:  GGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHE--PHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNG---
        GG    G    G F    S  +  +  ++Q   + PN   LH     + V  +   HHQ Q   ++ +     K + +++++SDDDE  F  +  +G   
Subjt:  GGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHE--PHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNG---

Query:  --DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
          +   K SPWQR+KWTD MV+LLITAV YIGD+   +    +  ++   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+GT+
Subjt:  --DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CKVVENQTLLDSM-ELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQRCFHATETTTSSAAAVAAGDGLKSGDE
        C+VVEN  LLDS+  L  K K++VRK+++SKHLF+ EMC+YHN  R                   HLP        H      S   A+ + D     D+
Subjt:  CKVVENQTLLDSM-ELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQRCFHATETTTSSAAAVAAGDGLKSGDE

Query:  DDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGG---------------------------ITAGMQQLSAEVMGVVQDG---GRSPWEKKQ
          + + ED  DE+ + + DE  E  ++     + R     GG                               + Q+      V Q G   GR+   +KQ
Subjt:  DDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGG---------------------------ITAGMQQLSAEVMGVVQDG---GRSPWEKKQ

Query:  WMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL
        WM+SR LQLEEQ++  Q +  ELEKQR +W +F  K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  WMKSRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)9.4e-5037.64Show/hide
Query:  QSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND
        + ++S+DDE    + D     K+  SPWQR+KW D MV+L+ITA+ YIG++ GS+        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFND
Subjt:  QSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND

Query:  LNKRYKRVNDILGKGTACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQ-QRCFHATETT
        LNKRYK++N++LG+GT+C+VVEN +LLD ++ L  K K+EVR++++SKHLF+ EMC+YHN  R                   HLP     QR  H     
Subjt:  LNKRYKRVNDILGKGTACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQ-QRCFHATETT

Query:  TSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEG--------RQEEEEETESRKRARKGGITAGMQQLSAEV-MGVVQDGGRSPWEKKQWMK
                   G +   ++DE  +    D +++D+ +E+ +G        R  + +  E      KG     + +  A+V  G+  D  ++   ++Q ++
Subjt:  TSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEG--------RQEEEEETESRKRARKGGITAGMQQLSAEV-MGVVQDGGRSPWEKKQWMK

Query:  SRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL
        S+ L+LE +++  QA+  ELE+Q+ KW  F  ++E+ + + ++ENE+ +LENERM L +K+ EL
Subjt:  SRLLQLEEQQVSYQAQAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors7.1e-9850.75Show/hide
Query:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLS----DDDEQGFAA-------DDSNGDGKKKI
        MFSG +  ML LE+P  QNP NP N  Q  HP   +        QQ  PP+   YPY +KPK Q S +S    DD+++G  +       D +  DGK+K+
Subjt:  MFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYVPHEPHHQQQPPPVAVKYPYPTKPKPQQSNLS----DDDEQGFAA-------DDSNGDGKKKI

Query:  SPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPADHAGKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG
        S W RMKWTD MVRLLI AVFYIGDE G ++P D   KKK           G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDILGKG
Subjt:  SPWQRMKWTDMMVRLLITAVFYIGDEGG-SEPADHAGKKKP---------VGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKG

Query:  TACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLP-QQQQQRCFHATETTTSSAAAVAAGDGLKS
         AC+VVENQ LL+SM+ LTPK K+EV+KLLNSKHLFFREMCAYHN+C H     G H        P  +P   QQQ CFHA E    +  A         
Subjt:  TACKVVENQTLLDSME-LTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLP-QQQQQRCFHATETTTSSAAAVAAGDGLKS

Query:  GDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQR
          E  E EEE ESD  E+ E +       EE EE E+RK+ R   I+  +++L  E   VV+D G+S WEKK+W++ ++L++EE+++ Y+ +  E+EKQR
Subjt:  GDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQAQAFELEKQR

Query:  LKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG
        +KW+++RSKKER+ME+AKL+N++RRLE ERM+LM+++ E++L         +  SS  R DPSS  G
Subjt:  LKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGGTTTTCTGTCAACCCAATACCACAATCCGCTCTCCCAAAACCAGCTGCCCCCTTCTTCTTCTATCTATATAGAGTTTCTGTTTCTGGGTTTGGTTCTATTTCT
AGATTTAGAAACGGCTAATCGAAAGAATTGGGGGGTGGTTTTAATGGAATCCAGTAGTTTAGGCGGCGGGGGTGGGGGTGGAGGAGGAGGAGGAGGAGGAGGAGGGATGT
TTTCAGGCATGAATTCATCAATGTTGGGATTGGAATTACCACTTCACCAAAACCCAACAAATCCCCCAAATCCTCACCAATTACATCACCCTCAAATCGTGTCTTATGTC
CCACACGAGCCCCACCACCAACAACAACCGCCGCCGGTCGCCGTGAAATACCCCTATCCGACGAAGCCTAAACCGCAGCAGTCGAATCTCAGTGATGACGACGAGCAGGG
GTTCGCGGCGGATGACAGCAATGGAGATGGCAAGAAGAAAATATCACCGTGGCAGAGGATGAAATGGACGGATATGATGGTTAGGTTGCTGATTACGGCGGTGTTCTACA
TCGGTGATGAAGGTGGATCGGAGCCGGCAGATCACGCCGGCAAGAAAAAACCAGTGGGGTTACTGCAAAAGAAGGGAAAATGGAAATCGGTATCCAGAGCAATGATGGAA
AAAGGATTCTACGTTTCACCACAGCAATGCGAAGATAAATTCAACGATTTAAACAAAAGATATAAACGAGTTAACGATATCTTGGGGAAAGGCACCGCCTGCAAAGTCGT
CGAGAATCAAACATTACTGGATTCAATGGAATTAACACCGAAAACGAAAGAAGAAGTACGAAAATTACTCAATTCTAAACATCTCTTCTTCAGAGAAATGTGCGCTTACC
ACAACACTTGCCGTCACAGCACCAGCAACAGCGGCGCCCACCCCTCGCCGGACGCGGCAGCGGAACCATCACACCTTCCACAACAACAACAGCAACGATGCTTCCACGCA
ACAGAGACCACAACCTCCTCCGCCGCCGCCGTAGCGGCCGGCGATGGTTTGAAAAGCGGAGATGAGGATGATGAAGAGGAGGAAGAAGACGAATCGGATGAGGAGGAGGA
AGATGAGGAGGATGAGGAAATTGAAGGACGGCAAGAAGAGGAGGAGGAAACGGAATCGAGGAAGAGGGCGAGGAAAGGGGGGATTACGGCGGGAATGCAGCAGTTGAGCG
CGGAGGTGATGGGAGTAGTGCAGGACGGCGGGAGGAGTCCATGGGAGAAGAAGCAGTGGATGAAGAGTCGGTTGCTCCAGCTTGAAGAACAGCAAGTGAGCTACCAAGCG
CAAGCTTTCGAGCTGGAGAAGCAGAGGCTGAAATGGGTGAAGTTCAGGAGCAAGAAGGAGAGGGATATGGAGAGGGCGAAGCTGGAGAATGAGAAGAGAAGGCTGGAAAA
TGAGAGGATGATGCTGATGGTGAAGCAGAAGGAATTGGATTTGATGGGTATGCACCATTATCAGCACCAGCAGCAGCATTCGTCGAACAAGCGAGGGGATCCATCGTCGA
TTACAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGGTTTTCTGTCAACCCAATACCACAATCCGCTCTCCCAAAACCAGCTGCCCCCTTCTTCTTCTATCTATATAGAGTTTCTGTTTCTGGGTTTGGTTCTATTTCT
AGATTTAGAAACGGCTAATCGAAAGAATTGGGGGGTGGTTTTAATGGAATCCAGTAGTTTAGGCGGCGGGGGTGGGGGTGGAGGAGGAGGAGGAGGAGGAGGAGGGATGT
TTTCAGGCATGAATTCATCAATGTTGGGATTGGAATTACCACTTCACCAAAACCCAACAAATCCCCCAAATCCTCACCAATTACATCACCCTCAAATCGTGTCTTATGTC
CCACACGAGCCCCACCACCAACAACAACCGCCGCCGGTCGCCGTGAAATACCCCTATCCGACGAAGCCTAAACCGCAGCAGTCGAATCTCAGTGATGACGACGAGCAGGG
GTTCGCGGCGGATGACAGCAATGGAGATGGCAAGAAGAAAATATCACCGTGGCAGAGGATGAAATGGACGGATATGATGGTTAGGTTGCTGATTACGGCGGTGTTCTACA
TCGGTGATGAAGGTGGATCGGAGCCGGCAGATCACGCCGGCAAGAAAAAACCAGTGGGGTTACTGCAAAAGAAGGGAAAATGGAAATCGGTATCCAGAGCAATGATGGAA
AAAGGATTCTACGTTTCACCACAGCAATGCGAAGATAAATTCAACGATTTAAACAAAAGATATAAACGAGTTAACGATATCTTGGGGAAAGGCACCGCCTGCAAAGTCGT
CGAGAATCAAACATTACTGGATTCAATGGAATTAACACCGAAAACGAAAGAAGAAGTACGAAAATTACTCAATTCTAAACATCTCTTCTTCAGAGAAATGTGCGCTTACC
ACAACACTTGCCGTCACAGCACCAGCAACAGCGGCGCCCACCCCTCGCCGGACGCGGCAGCGGAACCATCACACCTTCCACAACAACAACAGCAACGATGCTTCCACGCA
ACAGAGACCACAACCTCCTCCGCCGCCGCCGTAGCGGCCGGCGATGGTTTGAAAAGCGGAGATGAGGATGATGAAGAGGAGGAAGAAGACGAATCGGATGAGGAGGAGGA
AGATGAGGAGGATGAGGAAATTGAAGGACGGCAAGAAGAGGAGGAGGAAACGGAATCGAGGAAGAGGGCGAGGAAAGGGGGGATTACGGCGGGAATGCAGCAGTTGAGCG
CGGAGGTGATGGGAGTAGTGCAGGACGGCGGGAGGAGTCCATGGGAGAAGAAGCAGTGGATGAAGAGTCGGTTGCTCCAGCTTGAAGAACAGCAAGTGAGCTACCAAGCG
CAAGCTTTCGAGCTGGAGAAGCAGAGGCTGAAATGGGTGAAGTTCAGGAGCAAGAAGGAGAGGGATATGGAGAGGGCGAAGCTGGAGAATGAGAAGAGAAGGCTGGAAAA
TGAGAGGATGATGCTGATGGTGAAGCAGAAGGAATTGGATTTGATGGGTATGCACCATTATCAGCACCAGCAGCAGCATTCGTCGAACAAGCGAGGGGATCCATCGTCGA
TTACAGGATGAAGAAGAAGAAGGGGATTAGGGTTAGGGTTTAGAATTTGGTGGTTTTGAATTTGGGGGGTTGTAATGTAATTTAGAAGAAATGTAAATCCCG
Protein sequenceShow/hide protein sequence
MGGFLSTQYHNPLSQNQLPPSSSIYIEFLFLGLVLFLDLETANRKNWGVVLMESSSLGGGGGGGGGGGGGGGMFSGMNSSMLGLELPLHQNPTNPPNPHQLHHPQIVSYV
PHEPHHQQQPPPVAVKYPYPTKPKPQQSNLSDDDEQGFAADDSNGDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSEPADHAGKKKPVGLLQKKGKWKSVSRAMME
KGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACKVVENQTLLDSMELTPKTKEEVRKLLNSKHLFFREMCAYHNTCRHSTSNSGAHPSPDAAAEPSHLPQQQQQRCFHA
TETTTSSAAAVAAGDGLKSGDEDDEEEEEDESDEEEEDEEDEEIEGRQEEEEETESRKRARKGGITAGMQQLSAEVMGVVQDGGRSPWEKKQWMKSRLLQLEEQQVSYQA
QAFELEKQRLKWVKFRSKKERDMERAKLENEKRRLENERMMLMVKQKELDLMGMHHYQHQQQHSSNKRGDPSSITG