; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G012130 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G012130
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGATA transcription factor
Genome locationCG_Chr08:25013061..25014877
RNA-Seq ExpressionClCG08G012130
SyntenyClCG08G012130
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type
IPR016679 - Transcription factor, GATA, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445001.1 PREDICTED: GATA transcription factor 12-like [Cucumis melo]1.0e-15381.77Show/hide
Query:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI------GDGGGLFYNNNN------NGNNNSTECSAVTVIESC
        MEAPEYFQIN Y SQF   SS D+  A+ TA   AA PEHFIVEELLDFS N+DDAV       G GGGLFYNNNN      N NNNS E SA+TV+ESC
Subjt:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI------GDGGGLFYNNNN------NGNNNSTECSAVTVIESC

Query:  N-SSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKR
        N SSSF EDISGSNL DAHFSSELCVPYDDLAELEWLS+FVEESFSSEDMQKLEL+SGVKVK    PAQSPQPT +  R AAAIFKP+IVSVPAKARSKR
Subjt:  N-SSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKR

Query:  SRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
        SRA+PSNWNNS LLPLSPT EPEIT   G P+ IKKP PK AATAKKKD+P+VG SSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
Subjt:  SRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE

Query:  YRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
        YRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQQQHLLLDH QDMIFDASNGDDYLIHQHVGPDFRQ+I
Subjt:  YRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI

XP_022132107.1 GATA transcription factor 12 [Momordica charantia]7.2e-12872.3Show/hide
Query:  MEAPEYFQIN--GYC-SQFAT---HSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSS----
        ME P+YFQIN   YC SQF     HSSSDND      T    G EHFIVEELLDFSN DD V  D          NGN+N+    +V+VIESCNSS    
Subjt:  MEAPEYFQIN--GYC-SQFAT---HSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSS----

Query:  -----SFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKR
             SFL+DI+ SNL DA FS+ELCVPYDDLAELEWLS+FVEESFSSEDMQKLELISGVKVK  E  Q  QP+ +    AA IFKPDIVSVPAKARSKR
Subjt:  -----SFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKR

Query:  SR-AVPSNWNNSRLLPLSPTTEPE----ITTTAGPPHPIKKPPPKA-ATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
        SR AVP+NWNNSRLLPLSPTT       +   A PPHP KK   KA  TAKKKD P+ G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
Subjt:  SR-AVPSNWNNSRLLPLSPTTEPE----ITTTAGPPHPIKKPPPKA-ATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG

Query:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQH-LLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
        RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL R QQQQ QH L+LDHHQ+MIFDASNGDDYLIHQHVGPDFRQLI
Subjt:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQH-LLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI

XP_023002390.1 GATA transcription factor 12-like [Cucurbita maxima]2.4e-12366.93Show/hide
Query:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSS--------SF
        MEAPEYF  N YCSQF    +SD D+A +TATATA   +HFIVEELLDFSNDDD+ I D GG F N     N NS+E SA T +ES NSS        SF
Subjt:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSS--------SF

Query:  LEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPT-----VSHGRKAAA-IFKPDIVSVPAKARSKRS
         +D+SGS+L D  FS ++ +PY++L ELEWL+ F EE FSSEDMQKLELI+GVKVK  EP QS  PT      SHGR AAA IFKPDIV+VPAKARSKRS
Subjt:  LEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPT-----VSHGRKAAA-IFKPDIVSVPAKARSKRS

Query:  RAVPSNWNNSRLLPLSPT---TEPEITTTAGPPHPIKKPPPKAATAKKK----DSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
        R +PSNWNNSRLLPLSPT   +E +I  T  PPHP+KK PPK A A KK     S E G+S+GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
Subjt:  RAVPSNWNNSRLLPLSPT---TEPEITTTAGPPHPIKKPPPKAATAKKK----DSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG

Query:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---QQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
        RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +AQ+QQ     H    HHQ+M+FD+SNG+DYL+ Q+V  D+  LI
Subjt:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---QQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI

XP_031736569.1 GATA transcription factor 12 [Cucumis sativus]6.1e-15180.53Show/hide
Query:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI------------GDGGGLFYNNNN------NGNNNSTECSAV
        MEAPEYFQIN Y SQF   SS D+  AT TA A AA P+HFIVEELLDFS N+DDAV+            G GGGLFYNNNN      N NNNSTE SAV
Subjt:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI------------GDGGGLFYNNNN------NGNNNSTECSAV

Query:  TVIESCN-SSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRKAAAIFKPDIVSVPA
        TV+ESCN SSSF EDISGSNL DAHFSSELCVPYDDLAELEWLS+FVEESFSSEDMQKLELISGVKVK    P QSPQPT +  R AAAIFKP+IVSVPA
Subjt:  TVIESCN-SSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRKAAAIFKPDIVSVPA

Query:  KARSKRSRAVPSNWNNSRLLPL-SPTTEPEITTTAGPPHPIKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
        KARSKRSRA+PSNWNNS LLPL SPT E E T     PHPIKK  PK AATAKKKDSP++G SSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
Subjt:  KARSKRSRAVPSNWNNSRLLPL-SPTTEPEITTTAGPPHPIKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK

Query:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
        SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ QHLLLDH QDMIFDASNGDDYLIHQHVGPDFRQLI
Subjt:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI

XP_038886306.1 GATA transcription factor 12-like [Benincasa hispida]9.3e-17689.67Show/hide
Query:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFY--NNNNNGNNNSTECSAVTVIESCNS---------
        MEAPEYFQINGYCSQF+THSSSD D+ TATAT   AGPEHFIVEELLDFSNDDD V+GDGGGLFY  NN NN NNNSTE SAVTVIESCNS         
Subjt:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFY--NNNNNGNNNSTECSAVTVIESCNS---------

Query:  SSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAV
        SSFLEDISGSNL DAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKV+  EP  S QPT +  R AAAIFKPDIVSVPAKARSKRSRAV
Subjt:  SSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAV

Query:  PSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAA
        PSNWNNSRLLPLSPTTEPEIT TAGPPHPIKK PPKAATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAA
Subjt:  PSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAA

Query:  SPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
        SPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQH+GPDFRQLI
Subjt:  SPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI

TrEMBL top hitse value%identityAlignment
A0A0A0LPR5 GATA transcription factor2.9e-15180.53Show/hide
Query:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI------------GDGGGLFYNNNN------NGNNNSTECSAV
        MEAPEYFQIN Y SQF   SS D+  AT TA A AA P+HFIVEELLDFS N+DDAV+            G GGGLFYNNNN      N NNNSTE SAV
Subjt:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI------------GDGGGLFYNNNN------NGNNNSTECSAV

Query:  TVIESCN-SSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRKAAAIFKPDIVSVPA
        TV+ESCN SSSF EDISGSNL DAHFSSELCVPYDDLAELEWLS+FVEESFSSEDMQKLELISGVKVK    P QSPQPT +  R AAAIFKP+IVSVPA
Subjt:  TVIESCN-SSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRKAAAIFKPDIVSVPA

Query:  KARSKRSRAVPSNWNNSRLLPL-SPTTEPEITTTAGPPHPIKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
        KARSKRSRA+PSNWNNS LLPL SPT E E T     PHPIKK  PK AATAKKKDSP++G SSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
Subjt:  KARSKRSRAVPSNWNNSRLLPL-SPTTEPEITTTAGPPHPIKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK

Query:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
        SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ QHLLLDH QDMIFDASNGDDYLIHQHVGPDFRQLI
Subjt:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI

A0A1S3BBN7 GATA transcription factor4.8e-15481.77Show/hide
Query:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI------GDGGGLFYNNNN------NGNNNSTECSAVTVIESC
        MEAPEYFQIN Y SQF   SS D+  A+ TA   AA PEHFIVEELLDFS N+DDAV       G GGGLFYNNNN      N NNNS E SA+TV+ESC
Subjt:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI------GDGGGLFYNNNN------NGNNNSTECSAVTVIESC

Query:  N-SSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKR
        N SSSF EDISGSNL DAHFSSELCVPYDDLAELEWLS+FVEESFSSEDMQKLEL+SGVKVK    PAQSPQPT +  R AAAIFKP+IVSVPAKARSKR
Subjt:  N-SSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKR

Query:  SRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
        SRA+PSNWNNS LLPLSPT EPEIT   G P+ IKKP PK AATAKKKD+P+VG SSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
Subjt:  SRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE

Query:  YRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
        YRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQQQHLLLDH QDMIFDASNGDDYLIHQHVGPDFRQ+I
Subjt:  YRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI

A0A5A7VCX1 GATA transcription factor4.8e-15481.77Show/hide
Query:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI------GDGGGLFYNNNN------NGNNNSTECSAVTVIESC
        MEAPEYFQIN Y SQF   SS D+  A+ TA   AA PEHFIVEELLDFS N+DDAV       G GGGLFYNNNN      N NNNS E SA+TV+ESC
Subjt:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI------GDGGGLFYNNNN------NGNNNSTECSAVTVIESC

Query:  N-SSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKR
        N SSSF EDISGSNL DAHFSSELCVPYDDLAELEWLS+FVEESFSSEDMQKLEL+SGVKVK    PAQSPQPT +  R AAAIFKP+IVSVPAKARSKR
Subjt:  N-SSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKR

Query:  SRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
        SRA+PSNWNNS LLPLSPT EPEIT   G P+ IKKP PK AATAKKKD+P+VG SSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
Subjt:  SRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE

Query:  YRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
        YRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQQQHLLLDH QDMIFDASNGDDYLIHQHVGPDFRQ+I
Subjt:  YRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI

A0A6J1BSX6 GATA transcription factor3.5e-12872.3Show/hide
Query:  MEAPEYFQIN--GYC-SQFAT---HSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSS----
        ME P+YFQIN   YC SQF     HSSSDND      T    G EHFIVEELLDFSN DD V  D          NGN+N+    +V+VIESCNSS    
Subjt:  MEAPEYFQIN--GYC-SQFAT---HSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSS----

Query:  -----SFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKR
             SFL+DI+ SNL DA FS+ELCVPYDDLAELEWLS+FVEESFSSEDMQKLELISGVKVK  E  Q  QP+ +    AA IFKPDIVSVPAKARSKR
Subjt:  -----SFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKR

Query:  SR-AVPSNWNNSRLLPLSPTTEPE----ITTTAGPPHPIKKPPPKA-ATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
        SR AVP+NWNNSRLLPLSPTT       +   A PPHP KK   KA  TAKKKD P+ G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
Subjt:  SR-AVPSNWNNSRLLPLSPTTEPE----ITTTAGPPHPIKKPPPKA-ATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG

Query:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQH-LLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
        RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL R QQQQ QH L+LDHHQ+MIFDASNGDDYLIHQHVGPDFRQLI
Subjt:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQH-LLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI

A0A6J1KNT0 GATA transcription factor1.2e-12366.93Show/hide
Query:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSS--------SF
        MEAPEYF  N YCSQF    +SD D+A +TATATA   +HFIVEELLDFSNDDD+ I D GG F N     N NS+E SA T +ES NSS        SF
Subjt:  MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSS--------SF

Query:  LEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPT-----VSHGRKAAA-IFKPDIVSVPAKARSKRS
         +D+SGS+L D  FS ++ +PY++L ELEWL+ F EE FSSEDMQKLELI+GVKVK  EP QS  PT      SHGR AAA IFKPDIV+VPAKARSKRS
Subjt:  LEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPT-----VSHGRKAAA-IFKPDIVSVPAKARSKRS

Query:  RAVPSNWNNSRLLPLSPT---TEPEITTTAGPPHPIKKPPPKAATAKKK----DSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
        R +PSNWNNSRLLPLSPT   +E +I  T  PPHP+KK PPK A A KK     S E G+S+GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
Subjt:  RAVPSNWNNSRLLPLSPT---TEPEITTTAGPPHPIKKPPPKAATAKKK----DSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG

Query:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---QQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
        RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +AQ+QQ     H    HHQ+M+FD+SNG+DYL+ Q+V  D+  LI
Subjt:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---QQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI

SwissProt top hitse value%identityAlignment
O49741 GATA transcription factor 28.2e-4239.46Show/hide
Query:  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSED
        + P+   +++LLDFSN+D       GG            ST  ++ +      + SF      S+     F  ++CVP DD A LEWLS FV++SF+   
Subjt:  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSED

Query:  MQKL-ELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKDSPE
           L   ++ VK +                         S P K RSKRSRA          +PL    +   +     P   +         + + S  
Subjt:  MQKL-ELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKDSPE

Query:  VGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHH
             G  R+C HCA++KTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R  QQ Q H    HH
Subjt:  VGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHH

O49743 GATA transcription factor 49.7e-4343.21Show/hide
Query:  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSED
        + P+   +++LLDFSND+         +F     + ++  T  +A +   S N  SF      S      F+ +LCVP DD A LEWLS FV++SFS   
Subjt:  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSED

Query:  MQKLELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRA----VPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKD
                      PA     TV          +P+I S   K RS+RSRA    V   W        +P +E E+       H + KP P      KK 
Subjt:  MQKLELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRA----VPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKD

Query:  SPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE
             V++   R+C HCA++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE
Subjt:  SPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE

O82632 GATA transcription factor 93.4e-6447.77Show/hide
Query:  AAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSE
        A  P+ F+V++LLDFSNDD  V         ++  N   +S+  S  T+ +S NSSS   D +G         S+L +P DD+AELEWLS+FVEESF+ E
Subjt:  AAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSE

Query:  DMQKLELISGVKVKEPAQS-------PQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATA
        D  KL L SG+K  +   S       P+P + H            V+VPAKARSKRSR+  S W  SRLL L+ + E               P  K    
Subjt:  DMQKLELISGVKVKEPAQS-------PQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATA

Query:  KKKD-SPEVGVSSGE---GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHL
        K++D + ++ V  GE   GR+C+HCAT+KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKE+      + +HL
Subjt:  KKKD-SPEVGVSSGE---GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHL

Query:  LLD-HHQDMIFD-ASNGDDYLIH---QHVGPDFRQLI
        L     ++++ D  SNG+D+L+H    HV PDFR LI
Subjt:  LLD-HHQDMIFD-ASNGDDYLIH---QHVGPDFRQLI

P69781 GATA transcription factor 121.3e-6852.06Show/hide
Query:  FIVEELL-DFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCN-SSSFLEDISGSNLTDAHFSSELCVPYDDLA-ELEWLSHFVEESFSSEDMQ
        F V++LL DFSNDDD              N+   +ST  +  T+ +S N S++ L    G       FS +LC+P DDLA ELEWLS+ V+ES S ED+ 
Subjt:  FIVEELL-DFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCN-SSSFLEDISGSNLTDAHFSSELCVPYDDLA-ELEWLSHFVEESFSSEDMQ

Query:  KLELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPL----SPTTEPEITTTAGPPHPIKKPPPKAATAKKK---
        KLELISG K +   +S   +  +   ++ IF  D VSVPAKARSKRSRA   NW +  LL      SP T   I ++     P   PP   A   KK   
Subjt:  KLELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPL----SPTTEPEITTTAGPPHPIKKPPPKAATAKKK---

Query:  ----------DSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQ
                   SPE G    E R+C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKV+ELRRQKE+ RA     
Subjt:  ----------DSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQ

Query:  QHLLLDHHQD----MIFD-ASNGDDYLIHQHVGPDFRQLI
         H  + HH      MIFD +S+GDDYLIH +VGPDFRQLI
Subjt:  QHLLLDHHQD----MIFD-ASNGDDYLIHQHVGPDFRQLI

Q9FH57 GATA transcription factor 51.5e-3540.07Show/hide
Query:  EHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQK
        + F V++LLD SNDD  V  D        +    +     S+    +  ++     D SG +   +  +SEL +P DDLA LEWLSHFVE+SF+      
Subjt:  EHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQK

Query:  LELISGVKVKEPA------QSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEI-TTTAGPPHP-----------IKKP
        L   +G   ++PA      + P   V+        FK     VPAKARSKR+R     W+        P++     ++++GP  P           +   
Subjt:  LELISGVKVKEPA------QSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEI-TTTAGPPHP-----------IKKP

Query:  PPKAATAKKKDSPEVGVSSGE------GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE
         P      KK S E  V SGE       RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN HRKV+E+RR+KE
Subjt:  PPKAATAKKKDSPEVGVSSGE------GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE

Arabidopsis top hitse value%identityAlignment
AT2G45050.1 GATA transcription factor 25.8e-4339.46Show/hide
Query:  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSED
        + P+   +++LLDFSN+D       GG            ST  ++ +      + SF      S+     F  ++CVP DD A LEWLS FV++SF+   
Subjt:  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSED

Query:  MQKL-ELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKDSPE
           L   ++ VK +                         S P K RSKRSRA          +PL    +   +     P   +         + + S  
Subjt:  MQKL-ELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKDSPE

Query:  VGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHH
             G  R+C HCA++KTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R  QQ Q H    HH
Subjt:  VGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHH

AT3G60530.1 GATA transcription factor 46.9e-4443.21Show/hide
Query:  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSED
        + P+   +++LLDFSND+         +F     + ++  T  +A +   S N  SF      S      F+ +LCVP DD A LEWLS FV++SFS   
Subjt:  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSED

Query:  MQKLELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRA----VPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKD
                      PA     TV          +P+I S   K RS+RSRA    V   W        +P +E E+       H + KP P      KK 
Subjt:  MQKLELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRA----VPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKD

Query:  SPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE
             V++   R+C HCA++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE
Subjt:  SPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE

AT4G32890.1 GATA transcription factor 92.4e-6547.77Show/hide
Query:  AAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSE
        A  P+ F+V++LLDFSNDD  V         ++  N   +S+  S  T+ +S NSSS   D +G         S+L +P DD+AELEWLS+FVEESF+ E
Subjt:  AAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSE

Query:  DMQKLELISGVKVKEPAQS-------PQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATA
        D  KL L SG+K  +   S       P+P + H            V+VPAKARSKRSR+  S W  SRLL L+ + E               P  K    
Subjt:  DMQKLELISGVKVKEPAQS-------PQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATA

Query:  KKKD-SPEVGVSSGE---GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHL
        K++D + ++ V  GE   GR+C+HCAT+KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKE+      + +HL
Subjt:  KKKD-SPEVGVSSGE---GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHL

Query:  LLD-HHQDMIFD-ASNGDDYLIH---QHVGPDFRQLI
        L     ++++ D  SNG+D+L+H    HV PDFR LI
Subjt:  LLD-HHQDMIFD-ASNGDDYLIH---QHVGPDFRQLI

AT5G25830.1 GATA transcription factor 129.6e-7052.06Show/hide
Query:  FIVEELL-DFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCN-SSSFLEDISGSNLTDAHFSSELCVPYDDLA-ELEWLSHFVEESFSSEDMQ
        F V++LL DFSNDDD              N+   +ST  +  T+ +S N S++ L    G       FS +LC+P DDLA ELEWLS+ V+ES S ED+ 
Subjt:  FIVEELL-DFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCN-SSSFLEDISGSNLTDAHFSSELCVPYDDLA-ELEWLSHFVEESFSSEDMQ

Query:  KLELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPL----SPTTEPEITTTAGPPHPIKKPPPKAATAKKK---
        KLELISG K +   +S   +  +   ++ IF  D VSVPAKARSKRSRA   NW +  LL      SP T   I ++     P   PP   A   KK   
Subjt:  KLELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPL----SPTTEPEITTTAGPPHPIKKPPPKAATAKKK---

Query:  ----------DSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQ
                   SPE G    E R+C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKV+ELRRQKE+ RA     
Subjt:  ----------DSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQ

Query:  QHLLLDHHQD----MIFD-ASNGDDYLIHQHVGPDFRQLI
         H  + HH      MIFD +S+GDDYLIH +VGPDFRQLI
Subjt:  QHLLLDHHQD----MIFD-ASNGDDYLIHQHVGPDFRQLI

AT5G66320.1 GATA transcription factor 51.1e-3640.07Show/hide
Query:  EHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQK
        + F V++LLD SNDD  V  D        +    +     S+    +  ++     D SG +   +  +SEL +P DDLA LEWLSHFVE+SF+      
Subjt:  EHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQK

Query:  LELISGVKVKEPA------QSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEI-TTTAGPPHP-----------IKKP
        L   +G   ++PA      + P   V+        FK     VPAKARSKR+R     W+        P++     ++++GP  P           +   
Subjt:  LELISGVKVKEPA------QSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEI-TTTAGPPHP-----------IKKP

Query:  PPKAATAKKKDSPEVGVSSGE------GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE
         P      KK S E  V SGE       RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN HRKV+E+RR+KE
Subjt:  PPKAATAKKKDSPEVGVSSGE------GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTCCTGAATATTTCCAGATCAATGGCTACTGTTCTCAATTCGCCACCCACTCCTCCTCCGACAACGACTCCGCCACCGCCACCGCCACAGCCACTGCCGCCGG
ACCGGAGCATTTCATCGTGGAGGAGCTTCTCGATTTCTCCAACGATGACGACGCCGTTATTGGTGACGGTGGAGGATTGTTTTACAATAATAATAACAATGGGAATAATA
ATTCAACGGAATGTTCCGCCGTTACGGTGATTGAGAGTTGCAATTCGTCGTCGTTTTTGGAAGATATTAGTGGCTCTAATTTAACCGACGCCCATTTCTCCAGCGAACTC
TGCGTTCCGTACGACGATTTAGCTGAGTTGGAATGGCTTTCACATTTCGTAGAGGAATCATTTTCCAGCGAGGACATGCAAAAGTTGGAATTAATCTCCGGAGTCAAAGT
CAAAGAACCCGCCCAATCCCCACAACCCACCGTCTCTCACGGCCGAAAAGCCGCCGCAATTTTCAAACCGGACATCGTTTCCGTTCCGGCCAAAGCCCGTAGCAAACGCT
CACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTTCCTCTTTCTCCCACCACCGAACCCGAAATTACCACCACCGCGGGACCACCGCACCCCATCAAAAAACCC
CCTCCCAAGGCGGCGACAGCCAAGAAGAAGGACAGCCCGGAGGTCGGAGTGTCCTCCGGCGAGGGGCGAAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCG
GACCGGCCCAATGGGCCCGAAAACGCTGTGTAATGCTTGCGGCGTTCGGTACAAATCCGGCCGCCTGGTGCCGGAGTACCGCCCCGCCGCTAGCCCCACCTTCGTTTTAA
CCAAACACTCCAATTCTCACCGGAAAGTTTTGGAGCTCCGGCGGCAGAAAGAGCTTCTTAGAGCCCAACAACAGCAACAACAACATTTGCTTTTGGATCATCATCAGGAT
ATGATCTTTGATGCATCCAACGGTGATGATTATCTCATTCATCAACATGTGGGCCCCGATTTCCGGCAGCTGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTCCTGAATATTTCCAGATCAATGGCTACTGTTCTCAATTCGCCACCCACTCCTCCTCCGACAACGACTCCGCCACCGCCACCGCCACAGCCACTGCCGCCGG
ACCGGAGCATTTCATCGTGGAGGAGCTTCTCGATTTCTCCAACGATGACGACGCCGTTATTGGTGACGGTGGAGGATTGTTTTACAATAATAATAACAATGGGAATAATA
ATTCAACGGAATGTTCCGCCGTTACGGTGATTGAGAGTTGCAATTCGTCGTCGTTTTTGGAAGATATTAGTGGCTCTAATTTAACCGACGCCCATTTCTCCAGCGAACTC
TGCGTTCCGTACGACGATTTAGCTGAGTTGGAATGGCTTTCACATTTCGTAGAGGAATCATTTTCCAGCGAGGACATGCAAAAGTTGGAATTAATCTCCGGAGTCAAAGT
CAAAGAACCCGCCCAATCCCCACAACCCACCGTCTCTCACGGCCGAAAAGCCGCCGCAATTTTCAAACCGGACATCGTTTCCGTTCCGGCCAAAGCCCGTAGCAAACGCT
CACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTTCCTCTTTCTCCCACCACCGAACCCGAAATTACCACCACCGCGGGACCACCGCACCCCATCAAAAAACCC
CCTCCCAAGGCGGCGACAGCCAAGAAGAAGGACAGCCCGGAGGTCGGAGTGTCCTCCGGCGAGGGGCGAAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCG
GACCGGCCCAATGGGCCCGAAAACGCTGTGTAATGCTTGCGGCGTTCGGTACAAATCCGGCCGCCTGGTGCCGGAGTACCGCCCCGCCGCTAGCCCCACCTTCGTTTTAA
CCAAACACTCCAATTCTCACCGGAAAGTTTTGGAGCTCCGGCGGCAGAAAGAGCTTCTTAGAGCCCAACAACAGCAACAACAACATTTGCTTTTGGATCATCATCAGGAT
ATGATCTTTGATGCATCCAACGGTGATGATTATCTCATTCATCAACATGTGGGCCCCGATTTCCGGCAGCTGATCTGA
Protein sequenceShow/hide protein sequence
MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSEL
CVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKP
PPKAATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQD
MIFDASNGDDYLIHQHVGPDFRQLI