; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009400 (gene) of Snake gourd v1 genome

Gene IDTan0009400
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionsequence-specific DNA binding transcription factors
Genome locationLG08:69896031..69901877
RNA-Seq ExpressionTan0009400
SyntenyTan0009400
Gene Ontology termsGO:0010629 - negative regulation of gene expression (biological process)
GO:1900037 - regulation of cellular response to hypoxia (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596531.1 hypothetical protein SDJN03_09711, partial [Cucurbita argyrosperma subsp. sororia]2.5e-21788.58Show/hide
Query:  MDNNSL-GTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVED
        M+++SL G GGGGGGGGGGMFSGMNSAMLGLDLPLH +PTN  NSHQLHHPS+VSYVP +P    QQPPP AV+YPYP KPKPQQSNLSDDEEQGFAVED
Subjt:  MDNNSL-GTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVED

Query:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
        GN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
Subjt:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAE-PSHFP--QQQQQQQRCFHATETAAAAA---ADG
        ACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNNGGGGAHPSPDTAAE PSH    Q QQQQQRCFHATETAAAAA   ADG
Subjt:  ACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAE-PSHFP--QQQQQQQRCFHATETAAAAA---ADG

Query:  DDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAFE
        DDEDE+DEE++ EE+ +++EEEEIEGTSRGHE+E+ETESRKR RKGGI A AAAMQQL+AEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAFE
Subjt:  DDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAFE

Query:  LEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        LEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERM LMVKQKELD MDMHHY   QQQHSSNKRGDPSSITG
Subjt:  LEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG

KAG7028064.1 hypothetical protein SDJN02_09244, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-22087.76Show/hide
Query:  LNLPSVTANRRILGRVVLMDNNSLGTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKP
        L L  VTANRR LG VVLM+++SL  GG GGGGGGGMFSGMNSAMLGLDLPLH +PTN  NSHQLHHPS+VSYVP +P    QQPPP AV+YPYP KPKP
Subjt:  LNLPSVTANRRILGRVVLMDNNSLGTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKP

Query:  QQSNLSDDEEQGFAVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFN
        QQSNLSDDEEQGFAVEDGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFN
Subjt:  QQSNLSDDEEQGFAVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFN

Query:  DLNKRYKRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAE-PSHFP--QQQQQQQRC
        DLNKRYKRVNDILGKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNNGGGGAHPSPDTAAE PSH    Q QQQQQRC
Subjt:  DLNKRYKRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAE-PSHFP--QQQQQQQRC

Query:  FHATETAAAAA---ADGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSR
        FHATETAAAAA   ADGDDEDE+DEE++ EE+ +++EEEEIEGTSRGHE+E+ETESRKR RKGGI A AAAMQQL+AEVIGVLQDGGRSPWEKKQWMKSR
Subjt:  FHATETAAAAA---ADGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSR

Query:  LIQLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        LIQLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERM LMVKQKELD MDMHHY   QQQHSSNKRGDPSSITG
Subjt:  LIQLEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG

XP_022933836.1 uncharacterized protein LOC111441130 [Cucurbita moschata]7.1e-21788.42Show/hide
Query:  MDNNSLGT-GGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVED
        M+++SLG+ GGGGGGGGGGMFSG+NSAMLGLDLPLH +PTN  NSHQLHH S+VSY P +P    QQPPP AV+YPYP KPKPQQSNLSDDEEQGFAVED
Subjt:  MDNNSLGT-GGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVED

Query:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
        GN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
Subjt:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEP-----SHFPQQQQQQQRCFHATETAAAAA---A
        ACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNNGGGGAHPSPDTAAEP      H  QQQQQQQRCFHATE+AAAAA   A
Subjt:  ACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEP-----SHFPQQQQQQQRCFHATETAAAAA---A

Query:  DGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQA
        DGDDEDE+DEEE E EE E+DEEEEIEGTSRGHE+E+ETESRKR RKGGI  A AAMQQL+AEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQA
Subjt:  DGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQA

Query:  FELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        FELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERMVLMVKQKELD MDMHHY  QQQQHSSNKRGDPSSITG
Subjt:  FELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG

XP_023005783.1 ESF1 homolog [Cucurbita maxima]1.8e-21587.27Show/hide
Query:  MDNNSL-----GTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGF
        M+++SL     G GGGGGGGGGGMFSGMNSAMLGLDLPLH +PTN  NSHQLHHPS+VSYVP +P    QQPPP AV+YPYP KPKPQQSNLSDDEEQGF
Subjt:  MDNNSL-----GTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGF

Query:  AVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
        AVEDGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
Subjt:  AVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL

Query:  GKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAE-PSHF---PQQQQQQQRCFHATETAAAAA-
        GKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNNGGGGAHPSPDTAAE PSH     QQQQQQQRCFHATETAAAAA 
Subjt:  GKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAE-PSHF---PQQQQQQQRCFHATETAAAAA-

Query:  --ADGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIG-AAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNY
          ADGDD+DE+DEE++ EE+ +++EEEEIEGTSRGHE+E+ETESRKR RKGGI  AA AAMQQL+AEVIGVLQDGGRS WEKKQWMKSRLIQLEEQQV Y
Subjt:  --ADGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIG-AAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNY

Query:  QSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        QSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERMVLMVKQKELD MDMHHY   QQQHSSNKRGDPSSITG
Subjt:  QSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG

XP_023539309.1 ESF1 homolog [Cucurbita pepo subsp. pepo]8.4e-21888.54Show/hide
Query:  MDNNSL-GTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVED
        M+++SL G GGGGGGGGGGMFSGMNSAMLGLDLPLH +PTN  NSHQLHHPS+VSYVP +P    QQPPP AV+YPYP KPKPQQSNLSDDEEQGFAVED
Subjt:  MDNNSL-GTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVED

Query:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
        GN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
Subjt:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAE-PSHFPQQQQQQQRCFHATETAAAAA---ADGDD
        ACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNNGGGGAHPSPDTAAE PSH  Q  QQQQRCFHATETAA AA   ADGDD
Subjt:  ACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAE-PSHFPQQQQQQQRCFHATETAAAAA---ADGDD

Query:  EDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAFELE
        EDE+DEE++ EE+ +++EEEEIEGTSRGHE+E+ETESRKR RKGGI  A AAMQQL+AEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAFELE
Subjt:  EDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAFELE

Query:  KQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        KQRLKWLKFRSKKERDMERAKLENEKRRLE ERMVLMVKQKELD MDMHHY   QQQHSSNKRGDPSSITG
Subjt:  KQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG

TrEMBL top hitse value%identityAlignment
A0A0A0LDU6 Uncharacterized protein4.0e-19782Show/hide
Query:  GTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVEDGNCDGKK
        G GGGGG GGGGMFSGMNS+MLGL+LPLHQNPTN +N HQLHHP +VSYV H+P HHHQQPP  +VKYP+PTK KPQQSNLSDDEEQGFA +D N DGKK
Subjt:  GTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVEDGNCDGKK

Query:  KISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVEN
        KISPWQRMKWTDMMVRLLITAVFYIGDEGGSE  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTAC+VVEN
Subjt:  KISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVEN

Query:  QTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHATETAAAAA-ADGDDEDEEDEEEDE
        QTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH T        HPSPD A EPSH PQQQQQQQ CFHAT+T  +A+ A G+     DEEE+E
Subjt:  QTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHATETAAAAA-ADGDDEDEEDEEEDE

Query:  EEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQRLKWLKFR
        EEE E +EEEE E T    E+EEETESRKR RKGG+    A MQQLSAEV+GV+ DGGRSPWEKKQWMKSRLIQLEEQQV++Q+QAFELEKQRLKW+KFR
Subjt:  EEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQRLKWLKFR

Query:  SKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        SKKERDMERAKLENEKR LENERM+LMVK+ ELDLM M HYQQQQQQHSSNKRGDPSSITG
Subjt:  SKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG

A0A1S3B6J6 ESF1 homolog9.1e-19479.33Show/hide
Query:  MDNNSL------GTGGGGGG------GGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLS
        M+ NSL      G+GGGGGG      GGGGMFSGMNS+MLGL+LPLHQNPTN +N HQLHHP +VSYV H+P HHHQQPP  +VKYP+PTK KPQQSNLS
Subjt:  MDNNSL------GTGGGGGG------GGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLS

Query:  DDEEQGFAVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRY
        DDEEQGFA +D N DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSE  DH GKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRY
Subjt:  DDEEQGFAVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRY

Query:  KRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHATE-TAA
        KRVNDILGKGTAC+VVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH T        HPSPD A EPSH PQQQQQQQ CFHAT+ T +
Subjt:  KRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHATE-TAA

Query:  AAAADGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNY
        A+ A G+     DEE++EEEE E +EEEE E T    E+EEETESRKR RKGGI    A MQQLSAEV+GV+ DGGRSPWEKKQWMKSRLIQLEEQ+V++
Subjt:  AAAADGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNY

Query:  QSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        Q+QAFELEKQRLKW+KFRSKKERDMERAKLENEKR LENERM+L+VK+ ELDLM M HY QQQQQHSSNKRGDPSSITG
Subjt:  QSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG

A0A6J1F055 uncharacterized protein LOC1114411303.5e-21788.42Show/hide
Query:  MDNNSLGT-GGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVED
        M+++SLG+ GGGGGGGGGGMFSG+NSAMLGLDLPLH +PTN  NSHQLHH S+VSY P +P    QQPPP AV+YPYP KPKPQQSNLSDDEEQGFAVED
Subjt:  MDNNSLGT-GGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVED

Query:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
        GN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
Subjt:  GNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEP-----SHFPQQQQQQQRCFHATETAAAAA---A
        ACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNNGGGGAHPSPDTAAEP      H  QQQQQQQRCFHATE+AAAAA   A
Subjt:  ACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEP-----SHFPQQQQQQQRCFHATETAAAAA---A

Query:  DGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQA
        DGDDEDE+DEEE E EE E+DEEEEIEGTSRGHE+E+ETESRKR RKGGI  A AAMQQL+AEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQA
Subjt:  DGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQA

Query:  FELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        FELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERMVLMVKQKELD MDMHHY  QQQQHSSNKRGDPSSITG
Subjt:  FELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG

A0A6J1J5X4 ribosome quality control complex subunit 2-like3.8e-19279.54Show/hide
Query:  MDNNSLGTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVEDG
        M+ NSL   GGGGG  GGMF G++S+MLGL+LPLHQ P   SN HQLHHP +VSYVPHE  HH QQPPPAAVK PYP KPKPQQSN+SD+EEQG A +D 
Subjt:  MDNNSLGTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVEDG

Query:  NCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
        N D KKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSE  DHA KKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA
Subjt:  NCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTA

Query:  CRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHATETAAAAAA-----DGDD
        CRVVEN TLL+SMELTPK+KEEVRKLLNSKHLFFREMCAYHNTCRHG GN+GGGGA+  P+T AEPSH P QQQQQQRCFHATETA  A A      GD+
Subjt:  CRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHATETAAAAAA-----DGDD

Query:  EDE---EDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAF
        E+E   ED+E +E++E EE+EEEE+EG+SR  E+EEETES+KR RK G     A +QQ+SAEV+GV+QDGGRSPWEKKQWMK RLIQLEEQQV YQSQ  
Subjt:  EDE---EDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAF

Query:  ELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        E+EKQRLKWLKFR KKERDMERAKLENEKRRLENERM+LMVKQKELDL D+H+YQQQQQQHSSN+RGDPSSITG
Subjt:  ELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG

A0A6J1L352 ESF1 homolog8.5e-21687.27Show/hide
Query:  MDNNSL-----GTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGF
        M+++SL     G GGGGGGGGGGMFSGMNSAMLGLDLPLH +PTN  NSHQLHHPS+VSYVP +P    QQPPP AV+YPYP KPKPQQSNLSDDEEQGF
Subjt:  MDNNSL-----GTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGF

Query:  AVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
        AVEDGN DGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGG+E ADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL
Subjt:  AVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDIL

Query:  GKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAE-PSHF---PQQQQQQQRCFHATETAAAAA-
        GKGTACRVVENQTLLDSMELTPK KEEVRKLLNSKHLFFREMCAYHNTCRH TGNNGGGGAHPSPDTAAE PSH     QQQQQQQRCFHATETAAAAA 
Subjt:  GKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAE-PSHF---PQQQQQQQRCFHATETAAAAA-

Query:  --ADGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIG-AAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNY
          ADGDD+DE+DEE++ EE+ +++EEEEIEGTSRGHE+E+ETESRKR RKGGI  AA AAMQQL+AEVIGVLQDGGRS WEKKQWMKSRLIQLEEQQV Y
Subjt:  --ADGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIG-AAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNY

Query:  QSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        QSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLE ERMVLMVKQKELD MDMHHY   QQQHSSNKRGDPSSITG
Subjt:  QSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.4e-5637.56Show/hide
Query:  GGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVEDGN-----CDGKKK
        G    G F    S  +     ++Q   ++ NS  LH     + V  +   HHQ      +      K + +++++SDD+E  F  E G+      +   K
Subjt:  GGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVEDGN-----CDGKKK

Query:  ISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ
         SPWQR+KWTD MV+LLITAV YIGD+   +S+     ++   +LQKKGKWKSVS+ M E+G++VSPQQCEDKFNDLNKRYK++ND+LG+GT+C+VVEN 
Subjt:  ISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQ

Query:  TLLDSM-ELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHATETAAAAAADGDDEDEEDEEEDEE
         LLDS+  L  K K++VRK+++SKHLF+ EMC+YHN          G   H   D A + S    Q   + R  H  + +       +D D+ED + D +
Subjt:  TLLDSM-ELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHATETAAAAAADGDDEDEEDEEEDEE

Query:  EESEEDEEEEIEGTSR------------------GHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQ
        E  E +E+    G  R                   HED +            +        Q      G   + GR+   +KQWM+SR +QLEEQ++  Q
Subjt:  EESEEDEEEEIEGTSR------------------GHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQ

Query:  SQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKEL
         +  ELEKQR +W +F  K+++++ER ++ENE+ +LEN+RM L +KQ+EL
Subjt:  SQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)1.1e-5037.88Show/hide
Query:  QSNLSDDEEQGFAVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND
        + ++S+D+E      DG  +  K+ SPWQR+KW D MV+L+ITA+ YIG++ GS+        K   +LQKKGKW+SVS+ M E+G++VSPQQCEDKFND
Subjt:  QSNLSDDEEQGFAVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFND

Query:  LNKRYKRVNDILGKGTACRVVENQTLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHA
        LNKRYK++N++LG+GT+C VVEN +LLD ++ L  K K+EVR++++SKHLF+ EMC+YHN          G   H   D A             QR  H 
Subjt:  LNKRYKRVNDILGKGTACRVVENQTLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHA

Query:  TETAAAAAADGDDEDEEDEEEDEEEESEEDEEEEIEG--TSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEV-IGVLQDGGRSPWEKKQWMKSRLIQ
               + D  D DE  + ++E+ + ++D EE+ +G  + R  +   +++S +       G     + +  A+V  G+  D  ++   ++Q ++S+ ++
Subjt:  TETAAAAAADGDDEDEEDEEEDEEEESEEDEEEEIEG--TSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEV-IGVLQDGGRSPWEKKQWMKSRLIQ

Query:  LEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKEL
        LE +++  Q++  ELE+Q+ KW  F  ++E+ + + ++ENE+ +LENERM L +K+ EL
Subjt:  LEEQQVNYQSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKEL

AT3G10040.1 sequence-specific DNA binding transcription factors1.8e-9649.36Show/hide
Query:  MFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLS----DDEEQGFAVEDG-------NCDGKKK
        MFSG +  ML L++P  QNP N  NS Q  HP      P+      Q  PP    YPY +KPK Q S +S    DDE++G     G         DGK+K
Subjt:  MFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVPHEPHHHHQQPPPAAVKYPYPTKPKPQQSNLS----DDEEQGFAVEDG-------NCDGKKK

Query:  ISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKK--------PVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT
        +S W RMKWTD MVRLLI AVFYIGDE G      A KK           G+LQKKGKWKSVSRAM+EKGF VSPQQCEDKFNDLNKRYKRVNDILGKG 
Subjt:  ISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKK--------PVGLLQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGT

Query:  ACRVVENQTLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHATETAAAAAADGDDEDE
        ACRVVENQ LL+SM+ LTPK+K+EV+KLLNSKHLFFREMCAYHN+C H        G H        P   P    QQQ CFHA E    A      E E
Subjt:  ACRVVENQTLLDSME-LTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQRCFHATETAAAAAADGDDEDE

Query:  EDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQR
        E+ E D  E+SE + EE          +EEET  ++R         + A+++L  E   V++D G+S WEKK+W++ +++++EE+++ Y+ +  E+EKQR
Subjt:  EDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNYQSQAFELEKQR

Query:  LKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG
        +KW+++RSKKER+ME+AKL+N++RRLE ERM+LM+++ E++L ++          SS  R DPSS  G
Subjt:  LKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTAGTATTTAGTTCATGGGTTACCGTTCTTCGACGATCGGAAGAAGGCGGCGGCAGCGGCGGGCTGAAGAATGCGACGGTGACGGGCGGAAGAAGGCGACTCAA
TCTTCCCTCAGTAACTGCGAATCGGAGAATTTTGGGGAGGGTGGTTTTAATGGATAACAATAGTTTAGGCACCGGCGGAGGAGGTGGAGGCGGCGGCGGAGGGATGTTTT
CCGGGATGAATTCGGCAATGCTGGGATTGGATTTACCGCTTCATCAAAACCCCACGAATTCCTCGAATTCACACCAATTACATCATCCTTCAATTGTGTCTTATGTCCCA
CACGAGCCCCACCACCACCACCAGCAACCGCCGCCGGCTGCCGTGAAATACCCTTATCCGACGAAGCCCAAGCCGCAGCAGTCGAATCTCAGCGACGACGAGGAGCAGGG
GTTTGCGGTGGAGGACGGGAACTGCGACGGGAAGAAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTTTACA
TCGGCGATGAAGGTGGGTCGGAGTCGGCGGACCACGCCGGCAAGAAAAAACCAGTGGGGCTGCTGCAGAAAAAGGGGAAATGGAAATCGGTCTCCAGAGCGATGATGGAG
AAAGGATTCTACGTTTCGCCACAGCAATGCGAAGACAAATTCAATGATTTAAACAAAAGATATAAACGAGTTAACGACATTTTGGGAAAGGGCACCGCCTGCAGAGTCGT
CGAGAATCAGACGTTATTGGATTCGATGGAATTAACACCAAAAATGAAAGAAGAAGTTCGAAAATTACTCAATTCTAAACATCTCTTCTTCAGAGAAATGTGCGCTTACC
ACAACACTTGCCGTCACGGCACCGGCAACAACGGCGGTGGCGGCGCCCATCCCTCGCCGGACACAGCGGCGGAACCCTCCCACTTCCCACAACAACAGCAACAGCAACAA
CGATGCTTCCACGCGACGGAGACCGCCGCTGCCGCAGCGGCGGACGGAGATGATGAAGACGAGGAGGATGAAGAAGAGGATGAGGAAGAGGAATCGGAGGAAGATGAGGA
GGAGGAAATTGAAGGAACTTCAAGAGGGCACGAAGACGAGGAGGAAACGGAATCGAGGAAGCGGCCGAGGAAAGGGGGGATTGGAGCGGCGGCGGCGGCGATGCAGCAGT
TGAGCGCGGAGGTGATTGGAGTGCTGCAGGACGGCGGGAGGAGTCCGTGGGAGAAGAAGCAATGGATGAAGAGCCGATTGATTCAGCTTGAAGAGCAGCAAGTGAATTAT
CAATCGCAAGCTTTCGAGCTGGAGAAACAGAGGCTGAAATGGCTGAAGTTCAGGAGCAAGAAGGAGAGGGATATGGAAAGGGCGAAGCTGGAGAATGAGAAGAGAAGGCT
GGAGAACGAGAGGATGGTGCTGATGGTGAAGCAGAAGGAGTTGGATTTGATGGATATGCATCACTATCAACAGCAGCAGCAGCAGCATTCGTCGAACAAGCGAGGGGATC
CATCGTCGATTACAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTAGTATTTAGTTCATGGGTTACCGTTCTTCGACGATCGGAAGAAGGCGGCGGCAGCGGCGGGCTGAAGAATGCGACGGTGACGGGCGGAAGAAGGCGACTCAA
TCTTCCCTCAGTAACTGCGAATCGGAGAATTTTGGGGAGGGTGGTTTTAATGGATAACAATAGTTTAGGCACCGGCGGAGGAGGTGGAGGCGGCGGCGGAGGGATGTTTT
CCGGGATGAATTCGGCAATGCTGGGATTGGATTTACCGCTTCATCAAAACCCCACGAATTCCTCGAATTCACACCAATTACATCATCCTTCAATTGTGTCTTATGTCCCA
CACGAGCCCCACCACCACCACCAGCAACCGCCGCCGGCTGCCGTGAAATACCCTTATCCGACGAAGCCCAAGCCGCAGCAGTCGAATCTCAGCGACGACGAGGAGCAGGG
GTTTGCGGTGGAGGACGGGAACTGCGACGGGAAGAAGAAAATCTCGCCGTGGCAGAGGATGAAATGGACGGACATGATGGTCCGGCTGCTGATCACGGCGGTGTTTTACA
TCGGCGATGAAGGTGGGTCGGAGTCGGCGGACCACGCCGGCAAGAAAAAACCAGTGGGGCTGCTGCAGAAAAAGGGGAAATGGAAATCGGTCTCCAGAGCGATGATGGAG
AAAGGATTCTACGTTTCGCCACAGCAATGCGAAGACAAATTCAATGATTTAAACAAAAGATATAAACGAGTTAACGACATTTTGGGAAAGGGCACCGCCTGCAGAGTCGT
CGAGAATCAGACGTTATTGGATTCGATGGAATTAACACCAAAAATGAAAGAAGAAGTTCGAAAATTACTCAATTCTAAACATCTCTTCTTCAGAGAAATGTGCGCTTACC
ACAACACTTGCCGTCACGGCACCGGCAACAACGGCGGTGGCGGCGCCCATCCCTCGCCGGACACAGCGGCGGAACCCTCCCACTTCCCACAACAACAGCAACAGCAACAA
CGATGCTTCCACGCGACGGAGACCGCCGCTGCCGCAGCGGCGGACGGAGATGATGAAGACGAGGAGGATGAAGAAGAGGATGAGGAAGAGGAATCGGAGGAAGATGAGGA
GGAGGAAATTGAAGGAACTTCAAGAGGGCACGAAGACGAGGAGGAAACGGAATCGAGGAAGCGGCCGAGGAAAGGGGGGATTGGAGCGGCGGCGGCGGCGATGCAGCAGT
TGAGCGCGGAGGTGATTGGAGTGCTGCAGGACGGCGGGAGGAGTCCGTGGGAGAAGAAGCAATGGATGAAGAGCCGATTGATTCAGCTTGAAGAGCAGCAAGTGAATTAT
CAATCGCAAGCTTTCGAGCTGGAGAAACAGAGGCTGAAATGGCTGAAGTTCAGGAGCAAGAAGGAGAGGGATATGGAAAGGGCGAAGCTGGAGAATGAGAAGAGAAGGCT
GGAGAACGAGAGGATGGTGCTGATGGTGAAGCAGAAGGAGTTGGATTTGATGGATATGCATCACTATCAACAGCAGCAGCAGCAGCATTCGTCGAACAAGCGAGGGGATC
CATCGTCGATTACAGGTTGA
Protein sequenceShow/hide protein sequence
MVVVFSSWVTVLRRSEEGGGSGGLKNATVTGGRRRLNLPSVTANRRILGRVVLMDNNSLGTGGGGGGGGGGMFSGMNSAMLGLDLPLHQNPTNSSNSHQLHHPSIVSYVP
HEPHHHHQQPPPAAVKYPYPTKPKPQQSNLSDDEEQGFAVEDGNCDGKKKISPWQRMKWTDMMVRLLITAVFYIGDEGGSESADHAGKKKPVGLLQKKGKWKSVSRAMME
KGFYVSPQQCEDKFNDLNKRYKRVNDILGKGTACRVVENQTLLDSMELTPKMKEEVRKLLNSKHLFFREMCAYHNTCRHGTGNNGGGGAHPSPDTAAEPSHFPQQQQQQQ
RCFHATETAAAAAADGDDEDEEDEEEDEEEESEEDEEEEIEGTSRGHEDEEETESRKRPRKGGIGAAAAAMQQLSAEVIGVLQDGGRSPWEKKQWMKSRLIQLEEQQVNY
QSQAFELEKQRLKWLKFRSKKERDMERAKLENEKRRLENERMVLMVKQKELDLMDMHHYQQQQQQHSSNKRGDPSSITG