; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004225 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004225
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGATA transcription factor
Genome locationscaffold92:961470..963436
RNA-Seq ExpressionMS004225
SyntenyMS004225
Gene Ontology termsGO:0030154 - cell differentiation (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type
IPR016679 - Transcription factor, GATA, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142426.1 GATA transcription factor 8 [Cucumis sativus]3.5e-15985.71Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG
        MIGN+F DEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLP SDSVFSANSNSDLSAELSVPYEDIV L+WL+NFVEDSFCG 
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG

Query:  SLTMNKEEQPKDLSH--NQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPP-IQLISPASSVTE-TTHDQALPLPP
         LTMNKEE  KDL+H  NQFQTSSPVSVLESSSSCSSDK+ +PRSPEPTVATPGQQRGRARSKRPRPATFSPR P IQ ISPASSVTE TT DQAL L P
Subjt:  SLTMNKEEQPKDLSH--NQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPP-IQLISPASSVTE-TTHDQALPLPP

Query:  KASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPL---DQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFP
        KA+SD+DNFAESRPL+K+PKH   +   + KNKK+KLSFSLAP       NQN PS  QSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFP
Subjt:  KASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPL---DQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFP

Query:  EYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM
        EYRPAASPTFIPSLHSNSHKKVLEMR+KTD+NTA ITISVQPELIPN NSAISMDYM
Subjt:  EYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM

XP_008446884.1 PREDICTED: GATA transcription factor 8 [Cucumis melo]1.4e-16387.04Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG
        MIGN+F DEIDCGSFFD IDDLLDFPVEDVD GLPPAKGGDS NSFPTIWPTHSESLP SDSVFSANSNSDLSAELSVPYEDIV L+WL+NFVEDSFCGG
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG

Query:  SLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTE-TTHDQALPLPPKAS
        SLTMNKEE PKDL+HNQFQTSSPVSVLESSSSCSSDK+ +PRSPEPTVATPGQQRGRARSKRPRPATF+PRPPIQLISPASSVTE TT DQ L L PKA 
Subjt:  SLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTE-TTHDQALPLPPKAS

Query:  SDSDNFAESRPLIKMPKHSPPASALQK-KNKKVKLSFSLAPLDQA---NQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY
        SD++NFAESRP +K+PKH   AS  QK KNKK+KLSFSLAP  +A   NQN PS  QSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY
Subjt:  SDSDNFAESRPLIKMPKHSPPASALQK-KNKKVKLSFSLAPLDQA---NQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY

Query:  RPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM
        RPAASPTFIPSLHSNSHKKVLEMR+KTD+NTA ITISVQPELIPN NSAISMDYM
Subjt:  RPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM

XP_022966932.1 GATA transcription factor 8-like [Cucurbita maxima]1.3e-15883.71Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFC-G
        M+GNSF D++DCGSFFDHIDDLLDFPVEDVDAGLPPA GGDS NSFPTIW T SE+LP SDSVFSAN NSDLSA+LSVPYEDIV LEWLSNFVEDSFC G
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFC-G

Query:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHD---QALP-LPP
        GSL M KEE PK L+H QFQTSSPVSVLESSSSCSSDKS PRSPEPT+ATP QQRGRARSKRPRPATF PRPPIQLISPASSV+ETTHD   Q L  + P
Subjt:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHD---QALP-LPP

Query:  KASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYR
        K +SDSDNFAESRPL+KMPKH      +QKKNKK+KLSFSLAPLD ++QNSPS  QS+RKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYR
Subjt:  KASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYR

Query:  PAASPTFIPSLHSNSHKKVLEMRSKTDDNTA--IITISVQPELIPNTNSAISMDYM
        PAASPTF+PSLHSNSHKKVLEMR+KTD+ TA  +ITI+VQPELIPNTNSAISMDYM
Subjt:  PAASPTFIPSLHSNSHKKVLEMRSKTDDNTA--IITISVQPELIPNTNSAISMDYM

XP_023542369.1 GATA transcription factor 8-like [Cucurbita pepo subsp. pepo]1.3e-15883.75Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFC-G
        M+GNSF D++DCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDS NSFPTIW T SE+LP SDSVFSAN NSDLSA+LSVPYEDIV LEWLSNFVEDSFC G
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFC-G

Query:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHD---QALP-LPP
        GSL    +E+PKDL+H QFQTSSPVSVLESSSSCSSDKS PRSPEPT+ATP QQRGRARSKRPRPATF PRPPIQLISPASSV+ETTHD   Q L  + P
Subjt:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHD---QALP-LPP

Query:  KASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLD-QANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY
        K +SDSDNFAESRPL KMPKH   A  +QKKNKK+KLSFSLAPLD  A+QNSPS  QS+RKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY
Subjt:  KASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLD-QANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY

Query:  RPAASPTFIPSLHSNSHKKVLEMRSKTDDNTA--IITISVQPELIPNTNSAISMDYM
        RPAASPTF+PSLHSNSHKKVLEMR+KTD+ TA  +ITI+VQPELIPNTNSAISMDYM
Subjt:  RPAASPTFIPSLHSNSHKKVLEMRSKTDDNTA--IITISVQPELIPNTNSAISMDYM

XP_038892635.1 GATA transcription factor 8-like [Benincasa hispida]2.6e-16288.14Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG
        MIGN+F DEIDCGSFFD IDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESL  SDSVFS+NSNSDLSAELSVPYEDIV LEWLSNFVEDSFCGG
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG

Query:  SLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTET-THDQALPLPPKAS
        SLTMNKEE+ KDL+HNQFQTSSPVSVLESSSSCSSDKS +PRSPEPTVATPG QRGRARSKRPRPATFSPRPPIQLISPASSV+ET T DQAL L PKA+
Subjt:  SLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTET-THDQALPLPPKAS

Query:  -SDSDNFAESRPLIKMPKHSPPASALQK-KNKKVKLSFSLA-PLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYR
         SD++NFAESRPLIK+PKH   AS +QK KNKK+KLSFSLA P +  NQNSPS  QSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYR
Subjt:  -SDSDNFAESRPLIKMPKHSPPASALQK-KNKKVKLSFSLA-PLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYR

Query:  PAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM
        PAASPTFIPSLHSNSHKKVLEMR+K D+NTA ITISVQPELIPNTNSAISMDY+
Subjt:  PAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM

TrEMBL top hitse value%identityAlignment
A0A0A0KRL5 GATA transcription factor1.7e-15985.71Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG
        MIGN+F DEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLP SDSVFSANSNSDLSAELSVPYEDIV L+WL+NFVEDSFCG 
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG

Query:  SLTMNKEEQPKDLSH--NQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPP-IQLISPASSVTE-TTHDQALPLPP
         LTMNKEE  KDL+H  NQFQTSSPVSVLESSSSCSSDK+ +PRSPEPTVATPGQQRGRARSKRPRPATFSPR P IQ ISPASSVTE TT DQAL L P
Subjt:  SLTMNKEEQPKDLSH--NQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPP-IQLISPASSVTE-TTHDQALPLPP

Query:  KASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPL---DQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFP
        KA+SD+DNFAESRPL+K+PKH   +   + KNKK+KLSFSLAP       NQN PS  QSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFP
Subjt:  KASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPL---DQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFP

Query:  EYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM
        EYRPAASPTFIPSLHSNSHKKVLEMR+KTD+NTA ITISVQPELIPN NSAISMDYM
Subjt:  EYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM

A0A1S3BGY0 GATA transcription factor6.6e-16487.04Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG
        MIGN+F DEIDCGSFFD IDDLLDFPVEDVD GLPPAKGGDS NSFPTIWPTHSESLP SDSVFSANSNSDLSAELSVPYEDIV L+WL+NFVEDSFCGG
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG

Query:  SLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTE-TTHDQALPLPPKAS
        SLTMNKEE PKDL+HNQFQTSSPVSVLESSSSCSSDK+ +PRSPEPTVATPGQQRGRARSKRPRPATF+PRPPIQLISPASSVTE TT DQ L L PKA 
Subjt:  SLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTE-TTHDQALPLPPKAS

Query:  SDSDNFAESRPLIKMPKHSPPASALQK-KNKKVKLSFSLAPLDQA---NQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY
        SD++NFAESRP +K+PKH   AS  QK KNKK+KLSFSLAP  +A   NQN PS  QSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY
Subjt:  SDSDNFAESRPLIKMPKHSPPASALQK-KNKKVKLSFSLAPLDQA---NQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY

Query:  RPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM
        RPAASPTFIPSLHSNSHKKVLEMR+KTD+NTA ITISVQPELIPN NSAISMDYM
Subjt:  RPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM

A0A5D3CBQ4 GATA transcription factor6.6e-16487.04Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG
        MIGN+F DEIDCGSFFD IDDLLDFPVEDVD GLPPAKGGDS NSFPTIWPTHSESLP SDSVFSANSNSDLSAELSVPYEDIV L+WL+NFVEDSFCGG
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG

Query:  SLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTE-TTHDQALPLPPKAS
        SLTMNKEE PKDL+HNQFQTSSPVSVLESSSSCSSDK+ +PRSPEPTVATPGQQRGRARSKRPRPATF+PRPPIQLISPASSVTE TT DQ L L PKA 
Subjt:  SLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKS-RPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTE-TTHDQALPLPPKAS

Query:  SDSDNFAESRPLIKMPKHSPPASALQK-KNKKVKLSFSLAPLDQA---NQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY
        SD++NFAESRP +K+PKH   AS  QK KNKK+KLSFSLAP  +A   NQN PS  QSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY
Subjt:  SDSDNFAESRPLIKMPKHSPPASALQK-KNKKVKLSFSLAPLDQA---NQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY

Query:  RPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM
        RPAASPTFIPSLHSNSHKKVLEMR+KTD+NTA ITISVQPELIPN NSAISMDYM
Subjt:  RPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM

A0A6J1GZT2 GATA transcription factor1.6e-15784.66Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG
        MIGNS  DEIDCGSFFDHIDDLLDFPVEDVD GLPP   GDSANSFPTIWPTHSESLP S SVFSAN N+DLSA+LSVPYEDIV LEWLSNFVEDSF GG
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGG

Query:  SLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRP--RSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTET-THDQALPLPPKA
        SL+MNKEE PKDL++NQFQTSSPVSVLESSSSCSSDKS P   SPE TVATP QQRGRARSKRPRPATFSPRPPIQLISPASSVTET T DQ L L PKA
Subjt:  SLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRP--RSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTET-THDQALPLPPKA

Query:  SSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPA
         SD+DNFAES+PLIKMPKH   +     KNKK+KLSFSLAPLD A  N  S P SVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPA
Subjt:  SSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPA

Query:  ASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM
        ASPTFIPSLHSNSHKKVLEMR+K D+NT  ITISVQPELIPNTNSAI+MDYM
Subjt:  ASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPELIPNTNSAISMDYM

A0A6J1HPB8 GATA transcription factor6.4e-15983.71Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFC-G
        M+GNSF D++DCGSFFDHIDDLLDFPVEDVDAGLPPA GGDS NSFPTIW T SE+LP SDSVFSAN NSDLSA+LSVPYEDIV LEWLSNFVEDSFC G
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFC-G

Query:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHD---QALP-LPP
        GSL M KEE PK L+H QFQTSSPVSVLESSSSCSSDKS PRSPEPT+ATP QQRGRARSKRPRPATF PRPPIQLISPASSV+ETTHD   Q L  + P
Subjt:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHD---QALP-LPP

Query:  KASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYR
        K +SDSDNFAESRPL+KMPKH      +QKKNKK+KLSFSLAPLD ++QNSPS  QS+RKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYR
Subjt:  KASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYR

Query:  PAASPTFIPSLHSNSHKKVLEMRSKTDDNTA--IITISVQPELIPNTNSAISMDYM
        PAASPTF+PSLHSNSHKKVLEMR+KTD+ TA  +ITI+VQPELIPNTNSAISMDYM
Subjt:  PAASPTFIPSLHSNSHKKVLEMRSKTDDNTA--IITISVQPELIPNTNSAISMDYM

SwissProt top hitse value%identityAlignment
O82632 GATA transcription factor 93.8e-3135.86Show/hide
Query:  IDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGGSLTMNKEEQPKDLSHNQF
        +DDLLDF  +D +         D  N+ P      + +L  S +  S  ++    ++L +P +DI  LEWLSNFVE+SF G     +K      L + Q 
Subjt:  IDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGGSLTMNKEEQPKDLSHNQF

Query:  QTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLIKMPKHSP
          S+   +++       D       E  VA P     +ARSKR R                 S   T   + L L     +DSD   E+ P         
Subjt:  QTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLIKMPKHSP

Query:  PASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEMR
             +KK ++VK       +D     S       R+C+HC   KTPQWR GPMGPKTLCNACGVRYKSGRL PEYRPA+SPTF+ + HSNSH+KV+E+R
Subjt:  PASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEMR

Query:  SKTD
         + +
Subjt:  SKTD

Q6DBP8 GATA transcription factor 114.4e-3234.62Show/hide
Query:  GSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGGSLTMNKEEQPKD
        G FFD + + LD P++D+D        GD  + F  + P   +  P   S  ++  +    A       +I  L+   ++  ++    S T+++   P +
Subjt:  GSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGGSLTMNKEEQPKD

Query:  LSHNQ-FQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLI
        +  ++ FQ+ SPVSVLE+S    S  +   +    +A P +     RSKR RP T      +  + P+              P K    +    ES    
Subjt:  LSHNQ-FQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLI

Query:  KMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSH
           +H+       KK +K+ L+           NS      VRKC HCE TKTPQWR GP GPKTLCNACGVR++SGRL PEYRPA+SPTFIP++HSNSH
Subjt:  KMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSH

Query:  KKVLEMRSKTDD
        +K++EMR K D+
Subjt:  KKVLEMRSKTDD

Q8VZP4 GATA transcription factor 102.0e-3234.67Show/hide
Query:  GSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGGSLTMNKEEQPKD
        G FFD + + LD P+ED+D+       GD    F  + P   +  P   S  ++      +A + +P   I  L+   +   ++  G + T ++   P D
Subjt:  GSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGGSLTMNKEEQPKD

Query:  LSHNQ-FQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLI
        +  +  FQ+ +PVSVLE+S    S ++   S    +A P +     RSKR RP T      ++L                P  P+ S+  ++  E     
Subjt:  LSHNQ-FQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLI

Query:  KMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSH
           +H+       KK +K+ L   +   + +   S      VR C HCE   TPQWR GP GPKTLCNACGVR+KSGRL PEYRPA+SPTFIPS+HSNSH
Subjt:  KMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSH

Query:  KKVLEMRSKTDD-NTAIITISVQ
        +K++EMR K D+ +T++I   +Q
Subjt:  KKVLEMRSKTDD-NTAIITISVQ

Q9FH57 GATA transcription factor 59.0e-3337.59Show/hide
Query:  SAELSVPYEDIVHLEWLSNFVEDSF---CGGSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSP
        ++ELS+P +D+ +LEWLS+FVEDSF    G +LT    E+P  L+ ++     PV+ +       ++++  +SP P          +ARSKR R      
Subjt:  SAELSVPYEDIVHLEWLSNFVEDSF---CGGSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSP

Query:  RPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRA
           +++ S  SS +             +   S  F+ +  L  +     P    + K +  +  FS   L Q         Q  RKC HC + KTPQWRA
Subjt:  RPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRA

Query:  GPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK----TDDNTAIITISVQPELIPN
        GPMG KTLCNACGVRYKSGRL PEYRPA SPTF   LHSN H+KV+EMR K    +D+ T +  +   P+ +P+
Subjt:  GPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK----TDDNTAIITISVQPELIPN

Q9SV30 GATA transcription factor 86.6e-6846.24Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLP-VSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCG
        MIG SF +++DCG+FFD++DDL+DFP  D+D G     G   ++SFPTIW TH ++ P  SD +FS+N+NSD S EL VP+EDIV +E   +FVE++   
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLP-VSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCG

Query:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASS
                      SH+QF++SSPVSVLESSSS S   +       ++  PG + GR R+KR       PRPP+Q                         
Subjt:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASS

Query:  DSDNFAESRPLIKMPK-----HSPPASALQKKNKKVKLSFSLAPLD-QANQNS----PSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGR
        D+    +SR +I++PK     H+   +  +KK  K+  S S + +D + N N+     S    +RKCMHCE+TKTPQWR GPMGPKTLCNACGVRYKSGR
Subjt:  DSDNFAESRPLIKMPK-----HSPPASALQKKNKKVKLSFSLAPLD-QANQNS----PSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGR

Query:  LFPEYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPE-LIPNTNSAISMD
        LFPEYRPAASPTF P+LHSNSHKKV EMR+K   + + IT     + LIPN N+ I +D
Subjt:  LFPEYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPE-LIPNTNSAISMD

Arabidopsis top hitse value%identityAlignment
AT1G08000.1 GATA transcription factor 101.4e-3334.67Show/hide
Query:  GSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGGSLTMNKEEQPKD
        G FFD + + LD P+ED+D+       GD    F  + P   +  P   S  ++      +A + +P   I  L+   +   ++  G + T ++   P D
Subjt:  GSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGGSLTMNKEEQPKD

Query:  LSHNQ-FQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLI
        +  +  FQ+ +PVSVLE+S    S ++   S    +A P +     RSKR RP T      ++L                P  P+ S+  ++  E     
Subjt:  LSHNQ-FQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLI

Query:  KMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSH
           +H+       KK +K+ L   +   + +   S      VR C HCE   TPQWR GP GPKTLCNACGVR+KSGRL PEYRPA+SPTFIPS+HSNSH
Subjt:  KMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSH

Query:  KKVLEMRSKTDD-NTAIITISVQ
        +K++EMR K D+ +T++I   +Q
Subjt:  KKVLEMRSKTDD-NTAIITISVQ

AT3G54810.1 Plant-specific GATA-type zinc finger transcription factor family protein4.7e-6946.24Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLP-VSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCG
        MIG SF +++DCG+FFD++DDL+DFP  D+D G     G   ++SFPTIW TH ++ P  SD +FS+N+NSD S EL VP+EDIV +E   +FVE++   
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLP-VSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCG

Query:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASS
                      SH+QF++SSPVSVLESSSS S   +       ++  PG + GR R+KR       PRPP+Q                         
Subjt:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASS

Query:  DSDNFAESRPLIKMPK-----HSPPASALQKKNKKVKLSFSLAPLD-QANQNS----PSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGR
        D+    +SR +I++PK     H+   +  +KK  K+  S S + +D + N N+     S    +RKCMHCE+TKTPQWR GPMGPKTLCNACGVRYKSGR
Subjt:  DSDNFAESRPLIKMPK-----HSPPASALQKKNKKVKLSFSLAPLD-QANQNS----PSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGR

Query:  LFPEYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPE-LIPNTNSAISMD
        LFPEYRPAASPTF P+LHSNSHKKV EMR+K   + + IT     + LIPN N+ I +D
Subjt:  LFPEYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPE-LIPNTNSAISMD

AT3G54810.2 Plant-specific GATA-type zinc finger transcription factor family protein4.7e-6946.24Show/hide
Query:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLP-VSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCG
        MIG SF +++DCG+FFD++DDL+DFP  D+D G     G   ++SFPTIW TH ++ P  SD +FS+N+NSD S EL VP+EDIV +E   +FVE++   
Subjt:  MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLP-VSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCG

Query:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASS
                      SH+QF++SSPVSVLESSSS S   +       ++  PG + GR R+KR       PRPP+Q                         
Subjt:  GSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASS

Query:  DSDNFAESRPLIKMPK-----HSPPASALQKKNKKVKLSFSLAPLD-QANQNS----PSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGR
        D+    +SR +I++PK     H+   +  +KK  K+  S S + +D + N N+     S    +RKCMHCE+TKTPQWR GPMGPKTLCNACGVRYKSGR
Subjt:  DSDNFAESRPLIKMPK-----HSPPASALQKKNKKVKLSFSLAPLD-QANQNS----PSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGR

Query:  LFPEYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPE-LIPNTNSAISMD
        LFPEYRPAASPTF P+LHSNSHKKV EMR+K   + + IT     + LIPN N+ I +D
Subjt:  LFPEYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITISVQPE-LIPNTNSAISMD

AT5G66320.1 GATA transcription factor 56.4e-3437.59Show/hide
Query:  SAELSVPYEDIVHLEWLSNFVEDSF---CGGSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSP
        ++ELS+P +D+ +LEWLS+FVEDSF    G +LT    E+P  L+ ++     PV+ +       ++++  +SP P          +ARSKR R      
Subjt:  SAELSVPYEDIVHLEWLSNFVEDSF---CGGSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSP

Query:  RPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRA
           +++ S  SS +             +   S  F+ +  L  +     P    + K +  +  FS   L Q         Q  RKC HC + KTPQWRA
Subjt:  RPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRA

Query:  GPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK----TDDNTAIITISVQPELIPN
        GPMG KTLCNACGVRYKSGRL PEYRPA SPTF   LHSN H+KV+EMR K    +D+ T +  +   P+ +P+
Subjt:  GPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK----TDDNTAIITISVQPELIPN

AT5G66320.2 GATA transcription factor 56.4e-3437.59Show/hide
Query:  SAELSVPYEDIVHLEWLSNFVEDSF---CGGSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSP
        ++ELS+P +D+ +LEWLS+FVEDSF    G +LT    E+P  L+ ++     PV+ +       ++++  +SP P          +ARSKR R      
Subjt:  SAELSVPYEDIVHLEWLSNFVEDSF---CGGSLTMNKEEQPKDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSP

Query:  RPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRA
           +++ S  SS +             +   S  F+ +  L  +     P    + K +  +  FS   L Q         Q  RKC HC + KTPQWRA
Subjt:  RPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLIKMPKHSPPASALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRA

Query:  GPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK----TDDNTAIITISVQPELIPN
        GPMG KTLCNACGVRYKSGRL PEYRPA SPTF   LHSN H+KV+EMR K    +D+ T +  +   P+ +P+
Subjt:  GPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK----TDDNTAIITISVQPELIPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGGAAATAGCTTCGCCGACGAGATAGACTGCGGCAGCTTCTTCGACCACATCGACGACCTCCTCGATTTTCCGGTCGAGGATGTCGACGCCGGCTTGCCGCCGGC
GAAGGGCGGCGACTCGGCCAACTCGTTCCCCACCATTTGGCCGACTCACTCCGAGTCACTCCCCGTTTCCGACTCGGTCTTCTCCGCCAATAGCAACTCCGACTTGTCGG
CTGAGCTCTCTGTGCCGTATGAAGACATTGTTCATTTGGAGTGGCTTTCAAACTTTGTTGAGGATTCCTTCTGTGGAGGAAGCCTTACAATGAACAAAGAGGAGCAGCCT
AAGGATTTGAGTCATAACCAATTCCAAACTTCTAGCCCTGTTTCTGTTCTTGAAAGCAGCAGCTCTTGCTCTAGTGACAAGAGCCGGCCCCGTAGCCCCGAACCGACTGT
CGCCACTCCGGGGCAGCAGCGTGGCCGTGCTCGCAGCAAGCGCCCTCGCCCTGCAACCTTCAGCCCTCGCCCCCCAATTCAGCTCATTTCCCCTGCCTCTTCGGTCACCG
AAACAACCCATGATCAGGCATTGCCCCTTCCCCCAAAGGCCTCCTCGGACTCCGACAACTTTGCCGAGTCCCGGCCCTTGATCAAAATGCCAAAGCACAGCCCCCCCGCC
TCGGCGTTGCAGAAGAAGAACAAGAAAGTCAAGTTGTCGTTTTCGCTCGCACCTTTGGATCAGGCGAATCAAAACTCGCCATCGCCACCACAATCAGTCAGGAAATGTAT
GCACTGTGAGATAACCAAGACTCCACAATGGAGGGCAGGACCAATGGGGCCGAAAACTCTCTGCAACGCCTGCGGCGTTCGGTACAAATCCGGGAGACTCTTCCCCGAGT
ACCGGCCCGCAGCCAGCCCGACTTTCATCCCATCATTGCACTCAAACTCCCACAAGAAGGTGCTCGAAATGAGAAGCAAGACCGACGACAATACTGCAATTATCACCATA
AGCGTCCAGCCCGAGCTCATTCCAAACACAAACAGTGCGATTTCGATGGACTACATG
mRNA sequenceShow/hide mRNA sequence
ATGATCGGAAATAGCTTCGCCGACGAGATAGACTGCGGCAGCTTCTTCGACCACATCGACGACCTCCTCGATTTTCCGGTCGAGGATGTCGACGCCGGCTTGCCGCCGGC
GAAGGGCGGCGACTCGGCCAACTCGTTCCCCACCATTTGGCCGACTCACTCCGAGTCACTCCCCGTTTCCGACTCGGTCTTCTCCGCCAATAGCAACTCCGACTTGTCGG
CTGAGCTCTCTGTGCCGTATGAAGACATTGTTCATTTGGAGTGGCTTTCAAACTTTGTTGAGGATTCCTTCTGTGGAGGAAGCCTTACAATGAACAAAGAGGAGCAGCCT
AAGGATTTGAGTCATAACCAATTCCAAACTTCTAGCCCTGTTTCTGTTCTTGAAAGCAGCAGCTCTTGCTCTAGTGACAAGAGCCGGCCCCGTAGCCCCGAACCGACTGT
CGCCACTCCGGGGCAGCAGCGTGGCCGTGCTCGCAGCAAGCGCCCTCGCCCTGCAACCTTCAGCCCTCGCCCCCCAATTCAGCTCATTTCCCCTGCCTCTTCGGTCACCG
AAACAACCCATGATCAGGCATTGCCCCTTCCCCCAAAGGCCTCCTCGGACTCCGACAACTTTGCCGAGTCCCGGCCCTTGATCAAAATGCCAAAGCACAGCCCCCCCGCC
TCGGCGTTGCAGAAGAAGAACAAGAAAGTCAAGTTGTCGTTTTCGCTCGCACCTTTGGATCAGGCGAATCAAAACTCGCCATCGCCACCACAATCAGTCAGGAAATGTAT
GCACTGTGAGATAACCAAGACTCCACAATGGAGGGCAGGACCAATGGGGCCGAAAACTCTCTGCAACGCCTGCGGCGTTCGGTACAAATCCGGGAGACTCTTCCCCGAGT
ACCGGCCCGCAGCCAGCCCGACTTTCATCCCATCATTGCACTCAAACTCCCACAAGAAGGTGCTCGAAATGAGAAGCAAGACCGACGACAATACTGCAATTATCACCATA
AGCGTCCAGCCCGAGCTCATTCCAAACACAAACAGTGCGATTTCGATGGACTACATG
Protein sequenceShow/hide protein sequence
MIGNSFADEIDCGSFFDHIDDLLDFPVEDVDAGLPPAKGGDSANSFPTIWPTHSESLPVSDSVFSANSNSDLSAELSVPYEDIVHLEWLSNFVEDSFCGGSLTMNKEEQP
KDLSHNQFQTSSPVSVLESSSSCSSDKSRPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTHDQALPLPPKASSDSDNFAESRPLIKMPKHSPPA
SALQKKNKKVKLSFSLAPLDQANQNSPSPPQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEMRSKTDDNTAIITI
SVQPELIPNTNSAISMDYM