; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022967 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022967
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGATA transcription factor
Genome locationtig00000729:1444891..1446973
RNA-Seq ExpressionSgr022967
SyntenySgr022967
Gene Ontology termsGO:0030154 - cell differentiation (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type
IPR016679 - Transcription factor, GATA, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445001.1 PREDICTED: GATA transcription factor 12-like [Cucumis melo]1.5e-12570.34Show/hide
Query:  MEAPEYFQ-NGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSN--DDAVVT------ADAALF---------NASFNGNSAESSAVTVIDSCNSSS
        MEAPEYFQ N Y SQF++     +D  T      EHFIVEELLDFSN  DDAV T          LF         + + N NSAESSA+TV++SCNSS 
Subjt:  MEAPEYFQ-NGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSN--DDAVVT------ADAALF---------NASFNGNSAESSAVTVIDSCNSSS

Query:  FSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADE-ASENRQPTATHGRNAAAIFKPDIVSVPAKA
               S F +D+S SN  D HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVK+DE  +++ QPTAT  R AAAIFKP+IVSVPAKA
Subjt:  FSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADE-ASENRQPTATHGRNAAAIFKPDIVSVPAKA

Query:  RSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVA-VGPPHPGKK--APVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY
        RSKRSRA+  NWN S LLPLSPT   +EP+  A +G P+  KK    V ATAKKKD P+  G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY
Subjt:  RSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVA-VGPPHPGKK--APVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY

Query:  KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI
        KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ   LLLDH QDM+FDASNGDDYLIHQH+GPDFRQ+I
Subjt:  KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI

XP_021596902.1 GATA transcription factor 12-like [Manihot esculenta]1.2e-12569.07Show/hide
Query:  MEAPEYF-QNGYCSQFAAETRHSSDN-DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSR
        MEAPE++ Q+G CSQFA E  HS D+  + GGGG +HFIVE+LLDFSN+DAV+T D + F+ +  GNS +SS VTV+DSCNSSSFSGCEP   F  D+  
Subjt:  MEAPEYF-QNGYCSQFAAETRHSSDN-DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSR

Query:  SNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENR--QPTATHG------RNAAA------IFKPDIVSVPAKARSK
         NFAD  FSS+LCVPYDDLAELEWLSNFVEESFSSED+QKL+L+SG+K + DE+SE R  QP   +G       NAAA      IF P+ VSVPAKARSK
Subjt:  SNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENR--QPTATHG------RNAAA------IFKPDIVSVPAKARSK

Query:  RSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPGK-KAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRL
        RSRA  CNW  SRLL LSPTTSSS+P+ VA    HP   K  VKA   K+      G   G+GRKC+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRL
Subjt:  RSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPGK-KAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRL

Query:  VPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRA--QQQQLLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI
        VPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRA  QQQQ  L HHQ+M+FD SNGDDYLIHQH+GPDFRQLI
Subjt:  VPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRA--QQQQLLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI

XP_022132107.1 GATA transcription factor 12 [Momordica charantia]3.6e-14378.17Show/hide
Query:  MEAPEYFQ---NGYC-SQFAAETRH-SSDNDTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSS-SFSGCEPNSLFLD
        ME P+YFQ     YC SQF AETRH SSDNDT GG G EHFIVEELLDFSNDD  V AD + FN   N N+  S +V+VI+SCNSS SFS CEPNS FLD
Subjt:  MEAPEYFQ---NGYC-SQFAAETRH-SSDNDTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSS-SFSGCEPNSLFLD

Query:  DMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSR-AVACNW
        D++ SN  D  FS+ELCVPYDDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVK+DE  + RQP+      AA IFKPDIVSVPAKARSKRSR AV  NW
Subjt:  DMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSR-AVACNW

Query:  NKSRLLPLSPTTSSSEPD--AVAVGPPHPGKKAPVKA--TAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRP
        N SRLLPLSPTTSSSE D  AVA  PPHPGKKA +KA  TAKKKDCP+ AG SPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRP
Subjt:  NKSRLLPLSPTTSSSEPD--AVAVGPPHPGKKAPVKA--TAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRP

Query:  AASPTFVLTKHSNSHRKVLELRRQKELLRAQQQ----QLLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI
        AASPTFVLTKHSNSHRKVLELRRQKEL R QQQ    QL+LDHHQ+M+FDASNGDDYLIHQH+GPDFRQLI
Subjt:  AASPTFVLTKHSNSHRKVLELRRQKELLRAQQQ----QLLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI

XP_031736569.1 GATA transcription factor 12 [Cucumis sativus]1.3e-12468.65Show/hide
Query:  MEAPEYFQ-NGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSN--DDAVVT------------ADAALF---------NASFNGNSAESSAVTVID
        MEAPEYFQ N Y SQF++     +    A     +HFIVEELLDFSN  DDAV+T                LF         + + N NS ESSAVTV++
Subjt:  MEAPEYFQ-NGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSN--DDAVVT------------ADAALF---------NASFNGNSAESSAVTVID

Query:  SCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADE-ASENRQPTATHGRNAAAIFKPDIV
        SCNSS        S F +D+S SN  D HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVK+DE  +++ QPTAT  R+AAAIFKP+IV
Subjt:  SCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADE-ASENRQPTATHGRNAAAIFKPDIV

Query:  SVPAKARSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPGKKAPVK--ATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNA
        SVPAKARSKRSRA+  NWN S LLPLS  T+ SE     +  PHP KK   K  ATAKKKD P+  G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNA
Subjt:  SVPAKARSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPGKKAPVK--ATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNA

Query:  CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI
        CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ   LLLDH QDM+FDASNGDDYLIHQH+GPDFRQLI
Subjt:  CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI

XP_038886306.1 GATA transcription factor 12-like [Benincasa hispida]1.3e-14880.59Show/hide
Query:  MEAPEYFQ-NGYCSQFAAETRHSSDNDT---AGGGGAEHFIVEELLDFSNDDAVVTAD-AALFNASFNG-----NSAESSAVTVIDSCNSSSFSGCEPNS
        MEAPEYFQ NGYCSQF+  T  SSD DT       G EHFIVEELLDFSNDD  V  D   LF  + NG     NS ESSAVTVI+SCNSSSFSGCEPNS
Subjt:  MEAPEYFQ-NGYCSQFAAETRHSSDNDT---AGGGGAEHFIVEELLDFSNDDAVVTAD-AALFNASFNG-----NSAESSAVTVIDSCNSSSFSGCEPNS

Query:  LFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRAVA
         FL+D+S SN AD HFSSELCVPYDDLAELEWLS+FVEESFSSEDMQKLEL+SGVKV++DE + +RQPTAT  RNAAAIFKPDIVSVPAKARSKRSRAV 
Subjt:  LFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRAVA

Query:  CNWNKSRLLPLSPTTSSSEPDAVA-VGPPHPGKKAPVK-ATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYR
         NWN SRLLPLSPTT   EP+  A  GPPHP KK P K ATAKKKD PE  GVS GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYR
Subjt:  CNWNKSRLLPLSPTTSSSEPDAVA-VGPPHPGKKAPVK-ATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYR

Query:  PAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI
        PAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ   LLLDHHQDM+FDASNGDDYLIHQHMGPDFRQLI
Subjt:  PAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI

TrEMBL top hitse value%identityAlignment
A0A0A0LPR5 GATA transcription factor6.2e-12568.65Show/hide
Query:  MEAPEYFQ-NGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSN--DDAVVT------------ADAALF---------NASFNGNSAESSAVTVID
        MEAPEYFQ N Y SQF++     +    A     +HFIVEELLDFSN  DDAV+T                LF         + + N NS ESSAVTV++
Subjt:  MEAPEYFQ-NGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSN--DDAVVT------------ADAALF---------NASFNGNSAESSAVTVID

Query:  SCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADE-ASENRQPTATHGRNAAAIFKPDIV
        SCNSS        S F +D+S SN  D HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVK+DE  +++ QPTAT  R+AAAIFKP+IV
Subjt:  SCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADE-ASENRQPTATHGRNAAAIFKPDIV

Query:  SVPAKARSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPGKKAPVK--ATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNA
        SVPAKARSKRSRA+  NWN S LLPLS  T+ SE     +  PHP KK   K  ATAKKKD P+  G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNA
Subjt:  SVPAKARSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPGKKAPVK--ATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNA

Query:  CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI
        CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ   LLLDH QDM+FDASNGDDYLIHQH+GPDFRQLI
Subjt:  CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI

A0A1S3BBN7 GATA transcription factor7.3e-12670.34Show/hide
Query:  MEAPEYFQ-NGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSN--DDAVVT------ADAALF---------NASFNGNSAESSAVTVIDSCNSSS
        MEAPEYFQ N Y SQF++     +D  T      EHFIVEELLDFSN  DDAV T          LF         + + N NSAESSA+TV++SCNSS 
Subjt:  MEAPEYFQ-NGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSN--DDAVVT------ADAALF---------NASFNGNSAESSAVTVIDSCNSSS

Query:  FSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADE-ASENRQPTATHGRNAAAIFKPDIVSVPAKA
               S F +D+S SN  D HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVK+DE  +++ QPTAT  R AAAIFKP+IVSVPAKA
Subjt:  FSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADE-ASENRQPTATHGRNAAAIFKPDIVSVPAKA

Query:  RSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVA-VGPPHPGKK--APVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY
        RSKRSRA+  NWN S LLPLSPT   +EP+  A +G P+  KK    V ATAKKKD P+  G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY
Subjt:  RSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVA-VGPPHPGKK--APVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY

Query:  KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI
        KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ   LLLDH QDM+FDASNGDDYLIHQH+GPDFRQ+I
Subjt:  KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI

A0A2C9UBF9 GATA transcription factor5.6e-12669.07Show/hide
Query:  MEAPEYF-QNGYCSQFAAETRHSSDN-DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSR
        MEAPE++ Q+G CSQFA E  HS D+  + GGGG +HFIVE+LLDFSN+DAV+T D + F+ +  GNS +SS VTV+DSCNSSSFSGCEP   F  D+  
Subjt:  MEAPEYF-QNGYCSQFAAETRHSSDN-DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSR

Query:  SNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENR--QPTATHG------RNAAA------IFKPDIVSVPAKARSK
         NFAD  FSS+LCVPYDDLAELEWLSNFVEESFSSED+QKL+L+SG+K + DE+SE R  QP   +G       NAAA      IF P+ VSVPAKARSK
Subjt:  SNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENR--QPTATHG------RNAAA------IFKPDIVSVPAKARSK

Query:  RSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPGK-KAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRL
        RSRA  CNW  SRLL LSPTTSSS+P+ VA    HP   K  VKA   K+      G   G+GRKC+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRL
Subjt:  RSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPGK-KAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRL

Query:  VPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRA--QQQQLLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI
        VPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRA  QQQQ  L HHQ+M+FD SNGDDYLIHQH+GPDFRQLI
Subjt:  VPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRA--QQQQLLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI

A0A5A7VCX1 GATA transcription factor7.3e-12670.34Show/hide
Query:  MEAPEYFQ-NGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSN--DDAVVT------ADAALF---------NASFNGNSAESSAVTVIDSCNSSS
        MEAPEYFQ N Y SQF++     +D  T      EHFIVEELLDFSN  DDAV T          LF         + + N NSAESSA+TV++SCNSS 
Subjt:  MEAPEYFQ-NGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSN--DDAVVT------ADAALF---------NASFNGNSAESSAVTVIDSCNSSS

Query:  FSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADE-ASENRQPTATHGRNAAAIFKPDIVSVPAKA
               S F +D+S SN  D HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVK+DE  +++ QPTAT  R AAAIFKP+IVSVPAKA
Subjt:  FSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADE-ASENRQPTATHGRNAAAIFKPDIVSVPAKA

Query:  RSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVA-VGPPHPGKK--APVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY
        RSKRSRA+  NWN S LLPLSPT   +EP+  A +G P+  KK    V ATAKKKD P+  G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY
Subjt:  RSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVA-VGPPHPGKK--APVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY

Query:  KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI
        KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ   LLLDH QDM+FDASNGDDYLIHQH+GPDFRQ+I
Subjt:  KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---LLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI

A0A6J1BSX6 GATA transcription factor1.7e-14378.17Show/hide
Query:  MEAPEYFQ---NGYC-SQFAAETRH-SSDNDTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSS-SFSGCEPNSLFLD
        ME P+YFQ     YC SQF AETRH SSDNDT GG G EHFIVEELLDFSNDD  V AD + FN   N N+  S +V+VI+SCNSS SFS CEPNS FLD
Subjt:  MEAPEYFQ---NGYC-SQFAAETRH-SSDNDTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSS-SFSGCEPNSLFLD

Query:  DMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSR-AVACNW
        D++ SN  D  FS+ELCVPYDDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVK+DE  + RQP+      AA IFKPDIVSVPAKARSKRSR AV  NW
Subjt:  DMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSR-AVACNW

Query:  NKSRLLPLSPTTSSSEPD--AVAVGPPHPGKKAPVKA--TAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRP
        N SRLLPLSPTTSSSE D  AVA  PPHPGKKA +KA  TAKKKDCP+ AG SPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRP
Subjt:  NKSRLLPLSPTTSSSEPD--AVAVGPPHPGKKAPVKA--TAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRP

Query:  AASPTFVLTKHSNSHRKVLELRRQKELLRAQQQ----QLLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI
        AASPTFVLTKHSNSHRKVLELRRQKEL R QQQ    QL+LDHHQ+M+FDASNGDDYLIHQH+GPDFRQLI
Subjt:  AASPTFVLTKHSNSHRKVLELRRQKELLRAQQQ----QLLLDHHQDMMFDASNGDDYLIHQHMGPDFRQLI

SwissProt top hitse value%identityAlignment
O49741 GATA transcription factor 21.3e-4240.45Show/hide
Query:  DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCH-FSSELCVPYDDLAELEWLS
        D  G    +   +++LLDFSN+D        +F+AS +G S  ++        +SSSF   +  S     +  S  AD H F  ++CVP DD A LEWLS
Subjt:  DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCH-FSSELCVPYDDLAELEWLS

Query:  NFVEESFSSEDMQKL-ELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRA---VACNWNKSRLLPLSPTTSSSEPDAVAVGPPHP
         FV++SF+      L   ++ VK +                           S P K RSKRSRA    A  W+     P+   +   +  + A   P  
Subjt:  NFVEESFSSEDMQKL-ELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRA---VACNWNKSRLLPLSPTTSSSEPDAVAVGPPHP

Query:  GKKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQ
         +         +     +     G  R+C HCA++KTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R Q
Subjt:  GKKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQ

Query:  QQQLLLDHH
         QQ+ L HH
Subjt:  QQQLLLDHH

O49743 GATA transcription factor 42.4e-4141.5Show/hide
Query:  DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSN
        D  G    +   +++LLDFSND+                    SS+ TV  S  SS+ S   P S F      S      F+ +LCVP DD A LEWLS 
Subjt:  DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSN

Query:  FVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRA----VACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPG
        FV++SFS      L +                             +P+I S   K RS+RSRA    VA  W        +P + S    +V        
Subjt:  FVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRA----VACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPG

Query:  KKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE
              A  K K    A  V+    R+C HCA++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE
Subjt:  KKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE

O82632 GATA transcription factor 91.6e-6147.25Show/hide
Query:  GGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEES
        G  + F+V++LLDFSNDD  V  D  L       +S+  S  T+ DS NSSS                  F D    S+L +P DD+AELEWLSNFVEES
Subjt:  GGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEES

Query:  FSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPD-------------IVSVPAKARSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGP
        F+ ED  KL L SG+K        N Q T   G     + KP+              V+VPAKARSKRSR+ A  W  SRLL L+ +  +          
Subjt:  FSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPD-------------IVSVPAKARSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGP

Query:  PHPGKKAPVKATAKKKDCPEAAGVSPGE---GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQK
         +P KK   +   K++D      V  GE   GR+C+HCAT+KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQK
Subjt:  PHPGKKAPVKATAKKKDCPEAAGVSPGE---GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQK

Query:  ELLRAQQQQLLLDHHQDMMFDASNGDDYLIH---QHMGPDFRQLI
        E +R +     L     +M   SNG+D+L+H    H+ PDFR LI
Subjt:  ELLRAQQQQLLLDHHQDMMFDASNGDDYLIH---QHMGPDFRQLI

P69781 GATA transcription factor 121.1e-7052.34Show/hide
Query:  FIVEELL-DFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLA-ELEWLSNFVEESFSS
        F V++LL DFSNDD              N   A+S+  T I   +SS+FS  +  S   D    ++     FS +LC+P DDLA ELEWLSN V+ES S 
Subjt:  FIVEELL-DFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLA-ELEWLSNFVEESFSS

Query:  EDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRAVACNWNKSRLL-------PLSPTT--SSSEPDAVAVGPP----HP
        ED+ KLEL+SG K + D  S+   P   +  +++ IF  D VSVPAKARSKRSRA ACNW    LL       P +  T  SS +  +    PP      
Subjt:  EDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRAVACNWNKSRLL-------PLSPTT--SSSEPDAVAVGPP----HP

Query:  GKKAPVKATAKKK---DCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELL
        GKK  V    ++K     PE+ G    E R+C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKV+ELRRQKE+ 
Subjt:  GKKAPVKATAKKK---DCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELL

Query:  RAQQQQLLLDHHQD--MMFD-ASNGDDYLIHQHMGPDFRQLI
        RA  + +   H  D  M+FD +S+GDDYLIH ++GPDFRQLI
Subjt:  RAQQQQLLLDHHQD--MMFD-ASNGDDYLIHQHMGPDFRQLI

Q9FH57 GATA transcription factor 53.6e-3738.34Show/hide
Query:  GGGAEHFIVEELLDFSNDDAV------VTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWL
        G   + F V++LLD SNDD        + A   +   S    + +  A+       SS FSGC       DD           +SEL +P DDLA LEWL
Subjt:  GGGAEHFIVEELLDFSNDDAV------VTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWL

Query:  SNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVS--VPAKARSKRSRAVACNWNKSRLLPLSPTTSSS-------------
        S+FVE+SF+          SG  +      +    T        A+ +       VPAKARSKR+R     W+        P++S S             
Subjt:  SNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVS--VPAKARSKRSRAVACNWNKSRLLPLSPTTSSS-------------

Query:  ------EPDAVAVGPPHPGKKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN
              EP   +  PP P K    K +A+     E   + P   RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN
Subjt:  ------EPDAVAVGPPHPGKKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN

Query:  SHRKVLELRRQKE
         HRKV+E+RR+KE
Subjt:  SHRKVLELRRQKE

Arabidopsis top hitse value%identityAlignment
AT2G45050.1 GATA transcription factor 29.0e-4440.45Show/hide
Query:  DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCH-FSSELCVPYDDLAELEWLS
        D  G    +   +++LLDFSN+D        +F+AS +G S  ++        +SSSF   +  S     +  S  AD H F  ++CVP DD A LEWLS
Subjt:  DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCH-FSSELCVPYDDLAELEWLS

Query:  NFVEESFSSEDMQKL-ELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRA---VACNWNKSRLLPLSPTTSSSEPDAVAVGPPHP
         FV++SF+      L   ++ VK +                           S P K RSKRSRA    A  W+     P+   +   +  + A   P  
Subjt:  NFVEESFSSEDMQKL-ELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRA---VACNWNKSRLLPLSPTTSSSEPDAVAVGPPHP

Query:  GKKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQ
         +         +     +     G  R+C HCA++KTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R Q
Subjt:  GKKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQ

Query:  QQQLLLDHH
         QQ+ L HH
Subjt:  QQQLLLDHH

AT3G60530.1 GATA transcription factor 41.7e-4241.5Show/hide
Query:  DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSN
        D  G    +   +++LLDFSND+                    SS+ TV  S  SS+ S   P S F      S      F+ +LCVP DD A LEWLS 
Subjt:  DTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSN

Query:  FVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRA----VACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPG
        FV++SFS      L +                             +P+I S   K RS+RSRA    VA  W        +P + S    +V        
Subjt:  FVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRA----VACNWNKSRLLPLSPTTSSSEPDAVAVGPPHPG

Query:  KKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE
              A  K K    A  V+    R+C HCA++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE
Subjt:  KKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE

AT4G32890.1 GATA transcription factor 91.1e-6247.25Show/hide
Query:  GGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEES
        G  + F+V++LLDFSNDD  V  D  L       +S+  S  T+ DS NSSS                  F D    S+L +P DD+AELEWLSNFVEES
Subjt:  GGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWLSNFVEES

Query:  FSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPD-------------IVSVPAKARSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGP
        F+ ED  KL L SG+K        N Q T   G     + KP+              V+VPAKARSKRSR+ A  W  SRLL L+ +  +          
Subjt:  FSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPD-------------IVSVPAKARSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGP

Query:  PHPGKKAPVKATAKKKDCPEAAGVSPGE---GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQK
         +P KK   +   K++D      V  GE   GR+C+HCAT+KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQK
Subjt:  PHPGKKAPVKATAKKKDCPEAAGVSPGE---GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQK

Query:  ELLRAQQQQLLLDHHQDMMFDASNGDDYLIH---QHMGPDFRQLI
        E +R +     L     +M   SNG+D+L+H    H+ PDFR LI
Subjt:  ELLRAQQQQLLLDHHQDMMFDASNGDDYLIH---QHMGPDFRQLI

AT5G25830.1 GATA transcription factor 127.8e-7252.34Show/hide
Query:  FIVEELL-DFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLA-ELEWLSNFVEESFSS
        F V++LL DFSNDD              N   A+S+  T I   +SS+FS  +  S   D    ++     FS +LC+P DDLA ELEWLSN V+ES S 
Subjt:  FIVEELL-DFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLA-ELEWLSNFVEESFSS

Query:  EDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRAVACNWNKSRLL-------PLSPTT--SSSEPDAVAVGPP----HP
        ED+ KLEL+SG K + D  S+   P   +  +++ IF  D VSVPAKARSKRSRA ACNW    LL       P +  T  SS +  +    PP      
Subjt:  EDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRAVACNWNKSRLL-------PLSPTT--SSSEPDAVAVGPP----HP

Query:  GKKAPVKATAKKK---DCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELL
        GKK  V    ++K     PE+ G    E R+C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKV+ELRRQKE+ 
Subjt:  GKKAPVKATAKKK---DCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELL

Query:  RAQQQQLLLDHHQD--MMFD-ASNGDDYLIHQHMGPDFRQLI
        RA  + +   H  D  M+FD +S+GDDYLIH ++GPDFRQLI
Subjt:  RAQQQQLLLDHHQD--MMFD-ASNGDDYLIHQHMGPDFRQLI

AT5G66320.1 GATA transcription factor 52.5e-3838.34Show/hide
Query:  GGGAEHFIVEELLDFSNDDAV------VTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWL
        G   + F V++LLD SNDD        + A   +   S    + +  A+       SS FSGC       DD           +SEL +P DDLA LEWL
Subjt:  GGGAEHFIVEELLDFSNDDAV------VTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSELCVPYDDLAELEWL

Query:  SNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVS--VPAKARSKRSRAVACNWNKSRLLPLSPTTSSS-------------
        S+FVE+SF+          SG  +      +    T        A+ +       VPAKARSKR+R     W+        P++S S             
Subjt:  SNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVS--VPAKARSKRSRAVACNWNKSRLLPLSPTTSSS-------------

Query:  ------EPDAVAVGPPHPGKKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN
              EP   +  PP P K    K +A+     E   + P   RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN
Subjt:  ------EPDAVAVGPPHPGKKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN

Query:  SHRKVLELRRQKE
         HRKV+E+RR+KE
Subjt:  SHRKVLELRRQKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTCCCGAGTATTTCCAGAATGGCTATTGCTCGCAATTCGCCGCCGAAACTCGCCACTCCTCCGATAATGACACGGCAGGCGGCGGCGGCGCGGAGCATTTCAT
CGTTGAGGAACTCCTCGACTTCTCCAACGACGACGCCGTCGTCACTGCCGACGCTGCGTTGTTCAATGCGAGCTTCAATGGCAATTCCGCCGAATCCTCCGCCGTTACCG
TCATTGATAGTTGCAATTCGTCTTCGTTTTCCGGCTGCGAACCAAATTCGTTATTTCTGGACGACATGAGTCGCTCTAATTTTGCCGACTGCCATTTCTCCAGCGAACTC
TGCGTTCCGTACGACGATTTAGCTGAGCTGGAATGGCTTTCGAACTTCGTAGAGGAATCGTTTTCCAGCGAGGACATGCAGAAGCTGGAACTGCTCTCCGGCGTCAAAGT
CAAAGCCGACGAAGCCTCCGAAAACCGACAACCCACCGCCACCCACGGCCGAAACGCCGCCGCAATCTTCAAACCCGACATCGTTTCGGTGCCGGCCAAGGCCCGCAGCA
AACGCTCCCGTGCCGTCGCATGCAATTGGAATAAATCCCGCCTACTCCCCCTCTCACCGACCACCTCCTCCTCCGAACCCGACGCTGTCGCCGTTGGACCACCGCATCCC
GGAAAGAAAGCCCCCGTGAAGGCCACCGCAAAGAAGAAGGACTGCCCAGAGGCCGCTGGCGTATCCCCCGGAGAAGGTCGCAAGTGCATGCACTGCGCCACCGACAAGAC
GCCGCAGTGGCGGACGGGGCCGATGGGCCCAAAGACGCTCTGTAACGCCTGTGGCGTCCGGTACAAGTCCGGTAGGCTGGTGCCGGAGTACCGCCCAGCTGCGAGCCCCA
CCTTCGTCCTCACAAAACACTCCAACTCTCACCGGAAGGTGTTGGAGCTCCGGCGGCAGAAGGAGCTGCTGAGGGCGCAGCAGCAACAGCTGCTTCTGGATCATCATCAG
GATATGATGTTCGACGCATCCAACGGCGACGATTATCTGATCCACCAGCACATGGGGCCCGATTTCCGGCAGCTGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTCCCGAGTATTTCCAGAATGGCTATTGCTCGCAATTCGCCGCCGAAACTCGCCACTCCTCCGATAATGACACGGCAGGCGGCGGCGGCGCGGAGCATTTCAT
CGTTGAGGAACTCCTCGACTTCTCCAACGACGACGCCGTCGTCACTGCCGACGCTGCGTTGTTCAATGCGAGCTTCAATGGCAATTCCGCCGAATCCTCCGCCGTTACCG
TCATTGATAGTTGCAATTCGTCTTCGTTTTCCGGCTGCGAACCAAATTCGTTATTTCTGGACGACATGAGTCGCTCTAATTTTGCCGACTGCCATTTCTCCAGCGAACTC
TGCGTTCCGTACGACGATTTAGCTGAGCTGGAATGGCTTTCGAACTTCGTAGAGGAATCGTTTTCCAGCGAGGACATGCAGAAGCTGGAACTGCTCTCCGGCGTCAAAGT
CAAAGCCGACGAAGCCTCCGAAAACCGACAACCCACCGCCACCCACGGCCGAAACGCCGCCGCAATCTTCAAACCCGACATCGTTTCGGTGCCGGCCAAGGCCCGCAGCA
AACGCTCCCGTGCCGTCGCATGCAATTGGAATAAATCCCGCCTACTCCCCCTCTCACCGACCACCTCCTCCTCCGAACCCGACGCTGTCGCCGTTGGACCACCGCATCCC
GGAAAGAAAGCCCCCGTGAAGGCCACCGCAAAGAAGAAGGACTGCCCAGAGGCCGCTGGCGTATCCCCCGGAGAAGGTCGCAAGTGCATGCACTGCGCCACCGACAAGAC
GCCGCAGTGGCGGACGGGGCCGATGGGCCCAAAGACGCTCTGTAACGCCTGTGGCGTCCGGTACAAGTCCGGTAGGCTGGTGCCGGAGTACCGCCCAGCTGCGAGCCCCA
CCTTCGTCCTCACAAAACACTCCAACTCTCACCGGAAGGTGTTGGAGCTCCGGCGGCAGAAGGAGCTGCTGAGGGCGCAGCAGCAACAGCTGCTTCTGGATCATCATCAG
GATATGATGTTCGACGCATCCAACGGCGACGATTATCTGATCCACCAGCACATGGGGCCCGATTTCCGGCAGCTGATCTGA
Protein sequenceShow/hide protein sequence
MEAPEYFQNGYCSQFAAETRHSSDNDTAGGGGAEHFIVEELLDFSNDDAVVTADAALFNASFNGNSAESSAVTVIDSCNSSSFSGCEPNSLFLDDMSRSNFADCHFSSEL
CVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKADEASENRQPTATHGRNAAAIFKPDIVSVPAKARSKRSRAVACNWNKSRLLPLSPTTSSSEPDAVAVGPPHP
GKKAPVKATAKKKDCPEAAGVSPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQLLLDHHQ
DMMFDASNGDDYLIHQHMGPDFRQLI