; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006572 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006572
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein ALWAYS EARLY 2-like
Genome locationchr6:43703057..43705845
RNA-Seq ExpressionLag0006572
SyntenyLag0006572
Gene Ontology termsGO:0006351 - transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR010561 - Protein LIN-9/Protein ALWAYS EARLY
IPR028306 - Protein ALWAYS EARLY, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022937461.1 protein ALWAYS EARLY 2-like isoform X1 [Cucurbita moschata]2.9e-5762.69Show/hide
Query:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND
        G    LW  A      R  SS+    N +H+ G G  D+ R +             V+LST++GDDPLTIICGALHSF+    SFE+QKPLS  QEY+ND
Subjt:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND

Query:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT
        SLG F+Q CS EHLST DL S R R SDKD+GGIPSNLITSCVATLLMIQAC++ PYPPGDVAQILGLAVKSLHPRCSQNLHFY+EIE+CIGRI + L +
Subjt:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT

Query:  I
        I
Subjt:  I

XP_022937463.1 protein ALWAYS EARLY 2-like isoform X2 [Cucurbita moschata]2.9e-5762.69Show/hide
Query:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND
        G    LW  A      R  SS+    N +H+ G G  D+ R +             V+LST++GDDPLTIICGALHSF+    SFE+QKPLS  QEY+ND
Subjt:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND

Query:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT
        SLG F+Q CS EHLST DL S R R SDKD+GGIPSNLITSCVATLLMIQAC++ PYPPGDVAQILGLAVKSLHPRCSQNLHFY+EIE+CIGRI + L +
Subjt:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT

Query:  I
        I
Subjt:  I

XP_022937464.1 protein ALWAYS EARLY 2-like isoform X3 [Cucurbita moschata]2.9e-5762.69Show/hide
Query:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND
        G    LW  A      R  SS+    N +H+ G G  D+ R +             V+LST++GDDPLTIICGALHSF+    SFE+QKPLS  QEY+ND
Subjt:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND

Query:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT
        SLG F+Q CS EHLST DL S R R SDKD+GGIPSNLITSCVATLLMIQAC++ PYPPGDVAQILGLAVKSLHPRCSQNLHFY+EIE+CIGRI + L +
Subjt:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT

Query:  I
        I
Subjt:  I

XP_038889457.1 protein ALWAYS EARLY 2-like isoform X1 [Benincasa hispida]1.3e-6079.87Show/hide
Query:  VLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLG-HFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKR
        V LS N+G+DPLT +CGALHSFDNQ SSFE QKPLSM Q+ MNDSLG HF+QF  S+H+STGDLSS R RHSD+D+GGIPSNLITSCVATLLMIQACI+R
Subjt:  VLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLG-HFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKR

Query:  PYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT
        PYPPGDVAQILGLA+KSLHP CSQNLHFY+EIE+C+ RIKTQL +IVPT
Subjt:  PYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT

XP_038889458.1 protein ALWAYS EARLY 2-like isoform X2 [Benincasa hispida]2.6e-6177.92Show/hide
Query:  ERKAVVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLG-HFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQ
        E + +V LS N+G+DPLT +CGALHSFDNQ SSFE QKPLSM Q+ MNDSLG HF+QF  S+H+STGDLSS R RHSD+D+GGIPSNLITSCVATLLMIQ
Subjt:  ERKAVVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLG-HFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQ

Query:  ACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT
        ACI+RPYPPGDVAQILGLA+KSLHP CSQNLHFY+EIE+C+ RIKTQL +IVPT
Subjt:  ACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT

TrEMBL top hitse value%identityAlignment
A0A6J1FB90 protein ALWAYS EARLY 2-like isoform X31.4e-5762.69Show/hide
Query:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND
        G    LW  A      R  SS+    N +H+ G G  D+ R +             V+LST++GDDPLTIICGALHSF+    SFE+QKPLS  QEY+ND
Subjt:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND

Query:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT
        SLG F+Q CS EHLST DL S R R SDKD+GGIPSNLITSCVATLLMIQAC++ PYPPGDVAQILGLAVKSLHPRCSQNLHFY+EIE+CIGRI + L +
Subjt:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT

Query:  I
        I
Subjt:  I

A0A6J1FB98 protein ALWAYS EARLY 2-like isoform X11.4e-5762.69Show/hide
Query:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND
        G    LW  A      R  SS+    N +H+ G G  D+ R +             V+LST++GDDPLTIICGALHSF+    SFE+QKPLS  QEY+ND
Subjt:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND

Query:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT
        SLG F+Q CS EHLST DL S R R SDKD+GGIPSNLITSCVATLLMIQAC++ PYPPGDVAQILGLAVKSLHPRCSQNLHFY+EIE+CIGRI + L +
Subjt:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT

Query:  I
        I
Subjt:  I

A0A6J1FG45 protein ALWAYS EARLY 2-like isoform X21.4e-5762.69Show/hide
Query:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND
        G    LW  A      R  SS+    N +H+ G G  D+ R +             V+LST++GDDPLTIICGALHSF+    SFE+QKPLS  QEY+ND
Subjt:  GEEEQLWLSAIERNGWR--SSLRRGTN-THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMND

Query:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT
        SLG F+Q CS EHLST DL S R R SDKD+GGIPSNLITSCVATLLMIQAC++ PYPPGDVAQILGLAVKSLHPRCSQNLHFY+EIE+CIGRI + L +
Subjt:  SLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLT

Query:  I
        I
Subjt:  I

A0A6J1HKN4 protein ALWAYS EARLY 2 isoform X17.1e-5767.05Show/hide
Query:  THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHFSQFCSSEHLSTGDLSSLRLRHSD
        +H+ G G  D+ R +             V+LST++GDDPLTIICGALHSF+    SFE+QKPLS  QEY+NDSLG F+Q CS EHL T DL S R R SD
Subjt:  THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHFSQFCSSEHLSTGDLSSLRLRHSD

Query:  KDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTI
        KD+GGIPSNLITSCVATLLMIQAC++ PYPPGDVAQILGLAVKSLHPRCSQNLHFY+EIE+C+GRI + L +I
Subjt:  KDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTI

A0A6J1HRC2 protein ALWAYS EARLY 2 isoform X37.1e-5767.05Show/hide
Query:  THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHFSQFCSSEHLSTGDLSSLRLRHSD
        +H+ G G  D+ R +             V+LST++GDDPLTIICGALHSF+    SFE+QKPLS  QEY+NDSLG F+Q CS EHL T DL S R R SD
Subjt:  THQEGNGFCDLERKA------------VVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHFSQFCSSEHLSTGDLSSLRLRHSD

Query:  KDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTI
        KD+GGIPSNLITSCVATLLMIQAC++ PYPPGDVAQILGLAVKSLHPRCSQNLHFY+EIE+C+GRI + L +I
Subjt:  KDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTI

SwissProt top hitse value%identityAlignment
Q6A331 Protein ALWAYS EARLY 16.6e-2040Show/hide
Query:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPP
        S    +D   ++  AL S    +       P    QEY N SL H S   ++E +S G +S      S K+   +PS LITSCVA+ LM+Q   K+ YPP
Subjt:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHFSQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPP

Query:  GDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT
         DVAQ++   V  L PRC QN+  Y+EI++C+G IKTQ++ +V T
Subjt:  GDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT

Q6A332 Protein ALWAYS EARLY 36.2e-1850.62Show/hide
Query:  LRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT
        L   D++   +PS+L++ C+ATLLMIQ C +R +PP +VAQ+L  AV SL P CSQNL  Y EI+ C+G I+ Q+L +VP+
Subjt:  LRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT

Q6A333 Protein ALWAYS EARLY 27.6e-2443.24Show/hide
Query:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP
        S  EG+D  T+I  AL      +     +  +    E++N S+ H    S    SE ++  DL+S   +   +    +PS LITSCVAT LMIQ C +R 
Subjt:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP

Query:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT
        YPP DVAQ++  AV SL PRC QNL  Y+EI++C+GRIKTQ++++VPT
Subjt:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT

Arabidopsis top hitse value%identityAlignment
AT3G05380.1 DIRP ;Myb-like DNA-binding domain5.4e-2543.24Show/hide
Query:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP
        S  EG+D  T+I  AL      +     +  +    E++N S+ H    S    SE ++  DL+S   +   +    +PS LITSCVAT LMIQ C +R 
Subjt:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP

Query:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT
        YPP DVAQ++  AV SL PRC QNL  Y+EI++C+GRIKTQ++++VPT
Subjt:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT

AT3G05380.2 DIRP ;Myb-like DNA-binding domain5.4e-2543.24Show/hide
Query:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP
        S  EG+D  T+I  AL      +     +  +    E++N S+ H    S    SE ++  DL+S   +   +    +PS LITSCVAT LMIQ C +R 
Subjt:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP

Query:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT
        YPP DVAQ++  AV SL PRC QNL  Y+EI++C+GRIKTQ++++VPT
Subjt:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT

AT3G05380.3 DIRP ;Myb-like DNA-binding domain5.4e-2543.24Show/hide
Query:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP
        S  EG+D  T+I  AL      +     +  +    E++N S+ H    S    SE ++  DL+S   +   +    +PS LITSCVAT LMIQ C +R 
Subjt:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP

Query:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT
        YPP DVAQ++  AV SL PRC QNL  Y+EI++C+GRIKTQ++++VPT
Subjt:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT

AT3G05380.4 DIRP ;Myb-like DNA-binding domain5.4e-2543.24Show/hide
Query:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP
        S  EG+D  T+I  AL      +     +  +    E++N S+ H    S    SE ++  DL+S   +   +    +PS LITSCVAT LMIQ C +R 
Subjt:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP

Query:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT
        YPP DVAQ++  AV SL PRC QNL  Y+EI++C+GRIKTQ++++VPT
Subjt:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT

AT3G05380.5 DIRP ;Myb-like DNA-binding domain5.4e-2543.24Show/hide
Query:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP
        S  EG+D  T+I  AL      +     +  +    E++N S+ H    S    SE ++  DL+S   +   +    +PS LITSCVAT LMIQ C +R 
Subjt:  STNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHF---SQFCSSEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRP

Query:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT
        YPP DVAQ++  AV SL PRC QNL  Y+EI++C+GRIKTQ++++VPT
Subjt:  YPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGAAAAAGACTGAAAAATGTTTTGTAACTTACCGGCGAGGAGAAGAAGAGCAACTGTGGTTGTCGGCGATTGAGAGAAACGGTTGGCGAAGCTCCCTAAGGCG
AGGCACGAACACACATCAAGAGGGAAATGGGTTTTGCGATTTGGAGAGGAAGGCTGTCGTCTTGTTGAGCACAAACGAAGGTGATGACCCTCTTACAATTATTTGTGGTG
CCTTGCATTCTTTTGATAATCAAAGGTCGTCGTTTGAGTTTCAGAAACCTTTAAGCATGCCTCAAGAGTATATGAATGATAGCTTAGGTCACTTTAGTCAATTCTGCTCA
TCAGAACACCTTTCTACTGGTGATCTATCTAGTCTGAGATTGAGACATTCCGACAAAGATTTTGGAGGAATTCCTTCAAATCTAATCACTTCATGTGTCGCAACTTTGCT
CATGATACAGGCGTGTATCAAGCGTCCGTATCCACCAGGCGACGTGGCTCAGATTTTAGGTCTAGCAGTTAAAAGTTTACATCCTAGATGTTCTCAGAATCTGCATTTTT
ATCAAGAGATTGAAAGTTGCATAGGAAGAATCAAAACTCAGTTGTTAACCATTGTTCCAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGAAAAAGACTGAAAAATGTTTTGTAACTTACCGGCGAGGAGAAGAAGAGCAACTGTGGTTGTCGGCGATTGAGAGAAACGGTTGGCGAAGCTCCCTAAGGCG
AGGCACGAACACACATCAAGAGGGAAATGGGTTTTGCGATTTGGAGAGGAAGGCTGTCGTCTTGTTGAGCACAAACGAAGGTGATGACCCTCTTACAATTATTTGTGGTG
CCTTGCATTCTTTTGATAATCAAAGGTCGTCGTTTGAGTTTCAGAAACCTTTAAGCATGCCTCAAGAGTATATGAATGATAGCTTAGGTCACTTTAGTCAATTCTGCTCA
TCAGAACACCTTTCTACTGGTGATCTATCTAGTCTGAGATTGAGACATTCCGACAAAGATTTTGGAGGAATTCCTTCAAATCTAATCACTTCATGTGTCGCAACTTTGCT
CATGATACAGGCGTGTATCAAGCGTCCGTATCCACCAGGCGACGTGGCTCAGATTTTAGGTCTAGCAGTTAAAAGTTTACATCCTAGATGTTCTCAGAATCTGCATTTTT
ATCAAGAGATTGAAAGTTGCATAGGAAGAATCAAAACTCAGTTGTTAACCATTGTTCCAACTTGA
Protein sequenceShow/hide protein sequence
MKKKKTEKCFVTYRRGEEEQLWLSAIERNGWRSSLRRGTNTHQEGNGFCDLERKAVVLLSTNEGDDPLTIICGALHSFDNQRSSFEFQKPLSMPQEYMNDSLGHFSQFCS
SEHLSTGDLSSLRLRHSDKDFGGIPSNLITSCVATLLMIQACIKRPYPPGDVAQILGLAVKSLHPRCSQNLHFYQEIESCIGRIKTQLLTIVPT