; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr001229 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr001229
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGATA transcription factor
Genome locationtig00000800:38675..40664
RNA-Seq ExpressionSgr001229
SyntenySgr001229
Gene Ontology termsGO:0030154 - cell differentiation (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type
IPR016679 - Transcription factor, GATA, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446884.1 PREDICTED: GATA transcription factor 8 [Cucumis melo]4.6e-16488.67Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG
        MIGNNF+DEIDCGSFFD IDDLLDFPVEDVD+GLP   GGDS NSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQL+WL+NFVEDSFCGG
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG

Query:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSL-PRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAPS
        SL MNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDK+L PRSPEPTVATPGQQRGRARSKRPRPATF+PRPPIQLISPASSVTETT   + LQL PKAPS
Subjt:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSL-PRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAPS

Query:  DSENFAESRPLLKMPKHG-SDSAMQK-KNKKIKLSFSLA-PLEA---NQNSP-SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRP
        D+ENFAESRP +K+PKHG + S  QK KNKKIKLSFSLA PLEA   NQN P SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRP
Subjt:  DSENFAESRPLLKMPKHG-SDSAMQK-KNKKIKLSFSLA-PLEA---NQNSP-SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRP

Query:  AASPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM
        AASPTFIPSLHSNSHKKVLEMR+K DE T AITISVQPELIPN NSAISMDYM
Subjt:  AASPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM

XP_022957293.1 GATA transcription factor 8 [Cucurbita moschata]3.1e-16086.89Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG
        MIGN+ +DEIDCGSFFDHIDDLLDFPVEDVD GLP    GDSANSFPTIWPTHSESLPGS SVFSAN N+DLSA+LSVPYEDIVQLEWLSNFVEDSF GG
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG

Query:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLP--RSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAP
        SL+MNKEEPKDLT+NQFQTSSPVSVLESSSSCSSDKSLP   SPE TVATP QQRGRARSKRPRPATFSPRPPIQLISPASSVTET    + LQLAPKAP
Subjt:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLP--RSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAP

Query:  SDSENFAESRPLLKMPKHGSDSAMQK-KNKKIKLSFSLAPLEA---NQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA
        SD++NFAES+PL+KMPKHGS S +Q  KNKKIKLSFSLAPL+A   NQ+SPS SVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA
Subjt:  SDSENFAESRPLLKMPKHGSDSAMQK-KNKKIKLSFSLAPLEA---NQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA

Query:  SPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM
        SPTFIPSLHSNSHKKVLEMR+K DE T AITISVQPELIPNTNSAI+MDYM
Subjt:  SPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM

XP_022966932.1 GATA transcription factor 8-like [Cucurbita maxima]1.4e-16085.23Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFC-G
        M+GN+F+D++DCGSFFDHIDDLLDFPVEDVD+GLP   GGDS NSFPTIW T SE+LPGSDSVFSAN NSDLSA+LSVPYEDIVQLEWLSNFVEDSFC G
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFC-G

Query:  GSLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT----QEALQ-LAPK
        GSLAM KEEPK LTH QFQTSSPVSVLESSSSCSSDKSLPRSPEPT+ATP QQRGRARSKRPRPATF PRPPIQLISPASSV+ETT     + LQ +APK
Subjt:  GSLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT----QEALQ-LAPK

Query:  APSDSENFAESRPLLKMPKHGSDSAMQKKNKKIKLSFSLAPLE-ANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAAS
          SDS+NFAESRPL+KMPKHG    MQKKNKKIKLSFSLAPL+ ++QNSPSQS+RKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAAS
Subjt:  APSDSENFAESRPLLKMPKHGSDSAMQKKNKKIKLSFSLAPLE-ANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAAS

Query:  PTFIPSLHSNSHKKVLEMRSKADEKTA---ITISVQPELIPNTNSAISMDYM
        PTF+PSLHSNSHKKVLEMR+K DEKTA   ITI+VQPELIPNTNSAISMDYM
Subjt:  PTFIPSLHSNSHKKVLEMRSKADEKTA---ITISVQPELIPNTNSAISMDYM

XP_023552240.1 GATA transcription factor 8 [Cucurbita pepo subsp. pepo]2.4e-16087.46Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG
        MIGN+  DEIDCGSFFDHIDDLLDFPVEDVD GLP    GDSANSFPTIWPTHSESLPGS SVFSAN N+DLSA+LSVPYEDIVQLEWLSNFVEDSF GG
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG

Query:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLP--RSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAP
        SL MNKEEPKDLT+NQFQTSSPVSVLESSSSCSSDKSLP   SPE TVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTET    + LQLAPKAP
Subjt:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLP--RSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAP

Query:  SDSENFAESRPLLKMPKHGSDSAMQK-KNKKIKLSFSLAPLEA---NQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA
        SD++NFAES+PL+KMPKHGS S +Q  KNKKIKLSFSLAPL+A   NQ+SPS SVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA
Subjt:  SDSENFAESRPLLKMPKHGSDSAMQK-KNKKIKLSFSLAPLEA---NQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA

Query:  SPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM
        SPTFIPSLHSNSHKKVLEMR+K DE T AITISVQPELIPNTNSAISMDYM
Subjt:  SPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM

XP_038892635.1 GATA transcription factor 8-like [Benincasa hispida]4.6e-16489.74Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG
        MIGNNF+DEIDCGSFFD IDDLLDFPVEDVD+GLP   GGDSANSFPTIWPTHSESL GSDSVFS+NSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG

Query:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSL-PRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPK-AP
        SL MNKEE KDLTHNQFQTSSPVSVLESSSSCSSDKSL PRSPEPTVATPG QRGRARSKRPRPATFSPRPPIQLISPASSV+ET    +ALQL PK AP
Subjt:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSL-PRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPK-AP

Query:  SDSENFAESRPLLKMPKHGSDSAMQK-KNKKIKLSFSLA-PLEA-NQNSPS-QSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA
        SD+ENFAESRPL+K+PKHG+ S MQK KNKKIKLSFSLA P EA NQNSPS QSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA
Subjt:  SDSENFAESRPLLKMPKHGSDSAMQK-KNKKIKLSFSLA-PLEA-NQNSPS-QSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA

Query:  SPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM
        SPTFIPSLHSNSHKKVLEMR+K DE T AITISVQPELIPNTNSAISMDY+
Subjt:  SPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM

TrEMBL top hitse value%identityAlignment
A0A0A0KRL5 GATA transcription factor4.4e-16087.32Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG
        MIGNNF+DEIDCGSFFDHIDDLLDFPVEDVD+GLP   GGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQL+WL+NFVEDSFCG 
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG

Query:  SLAMNKEEPKDLTH--NQFQTSSPVSVLESSSSCSSDKSL-PRSPEPTVATPGQQRGRARSKRPRPATFSPRPP-IQLISPASSVTETT--QEALQLAPK
         L MNKEE KDLTH  NQFQTSSPVSVLESSSSCSSDK+L PRSPEPTVATPGQQRGRARSKRPRPATFSPR P IQ ISPASSVTETT   +ALQL PK
Subjt:  SLAMNKEEPKDLTH--NQFQTSSPVSVLESSSSCSSDKSL-PRSPEPTVATPGQQRGRARSKRPRPATFSPRPP-IQLISPASSVTETT--QEALQLAPK

Query:  APSDSENFAESRPLLKMPKHGSDSAMQK-KNKKIKLSFSLA-PLE---ANQNSP-SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY
        A SD++NFAESRPL+K+PKHG+ S  QK KNKKIKLSFSLA PLE    NQN P SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY
Subjt:  APSDSENFAESRPLLKMPKHGSDSAMQK-KNKKIKLSFSLA-PLE---ANQNSP-SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEY

Query:  RPAASPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM
        RPAASPTFIPSLHSNSHKKVLEMR+K DE T AITISVQPELIPN NSAISMDYM
Subjt:  RPAASPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM

A0A1S3BGY0 GATA transcription factor2.2e-16488.67Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG
        MIGNNF+DEIDCGSFFD IDDLLDFPVEDVD+GLP   GGDS NSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQL+WL+NFVEDSFCGG
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG

Query:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSL-PRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAPS
        SL MNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDK+L PRSPEPTVATPGQQRGRARSKRPRPATF+PRPPIQLISPASSVTETT   + LQL PKAPS
Subjt:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSL-PRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAPS

Query:  DSENFAESRPLLKMPKHG-SDSAMQK-KNKKIKLSFSLA-PLEA---NQNSP-SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRP
        D+ENFAESRP +K+PKHG + S  QK KNKKIKLSFSLA PLEA   NQN P SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRP
Subjt:  DSENFAESRPLLKMPKHG-SDSAMQK-KNKKIKLSFSLA-PLEA---NQNSP-SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRP

Query:  AASPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM
        AASPTFIPSLHSNSHKKVLEMR+K DE T AITISVQPELIPN NSAISMDYM
Subjt:  AASPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM

A0A5D3CBQ4 GATA transcription factor2.2e-16488.67Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG
        MIGNNF+DEIDCGSFFD IDDLLDFPVEDVD+GLP   GGDS NSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQL+WL+NFVEDSFCGG
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG

Query:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSL-PRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAPS
        SL MNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDK+L PRSPEPTVATPGQQRGRARSKRPRPATF+PRPPIQLISPASSVTETT   + LQL PKAPS
Subjt:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSL-PRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAPS

Query:  DSENFAESRPLLKMPKHG-SDSAMQK-KNKKIKLSFSLA-PLEA---NQNSP-SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRP
        D+ENFAESRP +K+PKHG + S  QK KNKKIKLSFSLA PLEA   NQN P SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRP
Subjt:  DSENFAESRPLLKMPKHG-SDSAMQK-KNKKIKLSFSLA-PLEA---NQNSP-SQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRP

Query:  AASPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM
        AASPTFIPSLHSNSHKKVLEMR+K DE T AITISVQPELIPN NSAISMDYM
Subjt:  AASPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM

A0A6J1GZT2 GATA transcription factor1.5e-16086.89Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG
        MIGN+ +DEIDCGSFFDHIDDLLDFPVEDVD GLP    GDSANSFPTIWPTHSESLPGS SVFSAN N+DLSA+LSVPYEDIVQLEWLSNFVEDSF GG
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGG

Query:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLP--RSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAP
        SL+MNKEEPKDLT+NQFQTSSPVSVLESSSSCSSDKSLP   SPE TVATP QQRGRARSKRPRPATFSPRPPIQLISPASSVTET    + LQLAPKAP
Subjt:  SLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLP--RSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT--QEALQLAPKAP

Query:  SDSENFAESRPLLKMPKHGSDSAMQK-KNKKIKLSFSLAPLEA---NQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA
        SD++NFAES+PL+KMPKHGS S +Q  KNKKIKLSFSLAPL+A   NQ+SPS SVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA
Subjt:  SDSENFAESRPLLKMPKHGSDSAMQK-KNKKIKLSFSLAPLEA---NQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAA

Query:  SPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM
        SPTFIPSLHSNSHKKVLEMR+K DE T AITISVQPELIPNTNSAI+MDYM
Subjt:  SPTFIPSLHSNSHKKVLEMRSKADEKT-AITISVQPELIPNTNSAISMDYM

A0A6J1HPB8 GATA transcription factor6.7e-16185.23Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFC-G
        M+GN+F+D++DCGSFFDHIDDLLDFPVEDVD+GLP   GGDS NSFPTIW T SE+LPGSDSVFSAN NSDLSA+LSVPYEDIVQLEWLSNFVEDSFC G
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFC-G

Query:  GSLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT----QEALQ-LAPK
        GSLAM KEEPK LTH QFQTSSPVSVLESSSSCSSDKSLPRSPEPT+ATP QQRGRARSKRPRPATF PRPPIQLISPASSV+ETT     + LQ +APK
Subjt:  GSLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETT----QEALQ-LAPK

Query:  APSDSENFAESRPLLKMPKHGSDSAMQKKNKKIKLSFSLAPLE-ANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAAS
          SDS+NFAESRPL+KMPKHG    MQKKNKKIKLSFSLAPL+ ++QNSPSQS+RKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAAS
Subjt:  APSDSENFAESRPLLKMPKHGSDSAMQKKNKKIKLSFSLAPLE-ANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAAS

Query:  PTFIPSLHSNSHKKVLEMRSKADEKTA---ITISVQPELIPNTNSAISMDYM
        PTF+PSLHSNSHKKVLEMR+K DEKTA   ITI+VQPELIPNTNSAISMDYM
Subjt:  PTFIPSLHSNSHKKVLEMRSKADEKTA---ITISVQPELIPNTNSAISMDYM

SwissProt top hitse value%identityAlignment
O82632 GATA transcription factor 96.7e-3336.93Show/hide
Query:  IDDLLDFPVED--VDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGGSLAMNKEEPKDLTH--
        +DDLLDF  +D  VD GL         N+ P      + +L  S +  S  ++    ++L +P +DI +LEWLSNFVE+SF G        E +D  H  
Subjt:  IDDLLDFPVED--VDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGGSLAMNKEEPKDLTH--

Query:  ----NQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLK
            N   T S ++ L        D       E  VA P     +ARSKR R A                    +  A +L   A SD  N         
Subjt:  ----NQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLK

Query:  MPKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLE
         PK        KK +++K       ++ +    S   R+C+HC   KTPQWR GPMGPKTLCNACGVRYKSGRL PEYRPA+SPTF+ + HSNSH+KV+E
Subjt:  MPKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLE

Query:  MRSKAD
        +R + +
Subjt:  MRSKAD

Q6DBP8 GATA transcription factor 118.5e-3636.16Show/hide
Query:  GSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQ-LEWLSNFVEDSFCGGSLAMNKEEPKD
        G FFD + + LD P++D+D+     G GD  + F  + P   +  P       ++  S  S     P  DI + +  L           +L  +   P+ 
Subjt:  GSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQ-LEWLSNFVEDSFCGGSLAMNKEEPKD

Query:  LTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLKM
             FQ+ SPVSVLE+S    S  +   +    +A P +     RSKR RP                    TT     L P  P   E     +P  + 
Subjt:  LTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLKM

Query:  PKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEM
          + S     KK +KI L+          ++    VRKC HCE TKTPQWR GP GPKTLCNACGVR++SGRL PEYRPA+SPTFIP++HSNSH+K++EM
Subjt:  PKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEM

Query:  RSKADEK
        R K DE+
Subjt:  RSKADEK

Q8VZP4 GATA transcription factor 105.1e-3336.42Show/hide
Query:  GSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGGSLAMNKEEPKDL
        G FFD + + LD P+ED+DS     G GD    F  + P   +  P   S  ++      +A + +P   I  L+   +    S    +   +   P   
Subjt:  GSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGGSLAMNKEEPKDL

Query:  THNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPAT--------FSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAE
            FQ+ +PVSVLE+S    S ++   S    +A P +     RSKR RP T        F PR      +P  SVTE    + Q A            
Subjt:  THNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPAT--------FSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAE

Query:  SRPLLKMPKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNS
                         KK +KI L           +     VR C HCE   TPQWR GP GPKTLCNACGVR+KSGRL PEYRPA+SPTFIPS+HSNS
Subjt:  SRPLLKMPKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNS

Query:  HKKVLEMRSKADE
        H+K++EMR K DE
Subjt:  HKKVLEMRSKADE

Q9FH57 GATA transcription factor 51.5e-3235.89Show/hide
Query:  IDDLLDFPVEDV--DSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDL-SAELSVPYEDIVQLEWLSNFVEDSFCGGS----LAMNKEEPKD
        +DDLLD   +DV  D     K   +             ++L  S      +    L ++ELS+P +D+  LEWLS+FVEDSF   S         E+P  
Subjt:  IDDLLDFPVEDV--DSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDL-SAELSVPYEDIVQLEWLSNFVEDSFCGGS----LAMNKEEPKD

Query:  LTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLKM
        LT ++     PV+ +       ++++  +SP P          +ARSKR R         +++ S  SS +     +   +  +   S  +     LL+ 
Subjt:  LTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLKM

Query:  PKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEM
                  KK+KK + + S+   E  Q  P    RKC HC + KTPQWRAGPMG KTLCNACGVRYKSGRL PEYRPA SPTF   LHSN H+KV+EM
Subjt:  PKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEM

Query:  RSK----ADEKTAITISVQ-PELIPN
        R K    +D +T +   VQ P+ +P+
Subjt:  RSK----ADEKTAITISVQ-PELIPN

Q9SV30 GATA transcription factor 86.0e-6646.67Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLP-GSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFC-
        MIG +F +++DCG+FFD++DDL+DFP  D+D G    G GDS +SFPTIW TH ++ P  SD +FS+N+NSD S EL VP+EDIV++E   +FVE++   
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLP-GSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFC-

Query:  --GGSLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAP
            S + N +     +H+QF++SSPVSVLESSSS S   +       ++  PG + GR R+KR       PRPP+Q                       
Subjt:  --GGSLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAP

Query:  SDSENFAESRPLLKMPKH----GSDSAMQKKNKKIKLSFSLA----PLEANQN------SPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSG
         D+    +SR ++++PK      +    +KK KK K++ S +     LE N N      S    +RKCMHCE+TKTPQWR GPMGPKTLCNACGVRYKSG
Subjt:  SDSENFAESRPLLKMPKH----GSDSAMQKKNKKIKLSFSLA----PLEANQN------SPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSG

Query:  RLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK--ADEKTAITISVQPELIPNTNSAISMD
        RLFPEYRPAASPTF P+LHSNSHKKV EMR+K  +D       +    LIPN N+ I +D
Subjt:  RLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK--ADEKTAITISVQPELIPNTNSAISMD

Arabidopsis top hitse value%identityAlignment
AT1G08000.1 GATA transcription factor 103.7e-3436.42Show/hide
Query:  GSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGGSLAMNKEEPKDL
        G FFD + + LD P+ED+DS     G GD    F  + P   +  P   S  ++      +A + +P   I  L+   +    S    +   +   P   
Subjt:  GSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGGSLAMNKEEPKDL

Query:  THNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPAT--------FSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAE
            FQ+ +PVSVLE+S    S ++   S    +A P +     RSKR RP T        F PR      +P  SVTE    + Q A            
Subjt:  THNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPAT--------FSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAE

Query:  SRPLLKMPKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNS
                         KK +KI L           +     VR C HCE   TPQWR GP GPKTLCNACGVR+KSGRL PEYRPA+SPTFIPS+HSNS
Subjt:  SRPLLKMPKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNS

Query:  HKKVLEMRSKADE
        H+K++EMR K DE
Subjt:  HKKVLEMRSKADE

AT1G08010.1 GATA transcription factor 116.0e-3736.16Show/hide
Query:  GSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQ-LEWLSNFVEDSFCGGSLAMNKEEPKD
        G FFD + + LD P++D+D+     G GD  + F  + P   +  P       ++  S  S     P  DI + +  L           +L  +   P+ 
Subjt:  GSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQ-LEWLSNFVEDSFCGGSLAMNKEEPKD

Query:  LTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLKM
             FQ+ SPVSVLE+S    S  +   +    +A P +     RSKR RP                    TT     L P  P   E     +P  + 
Subjt:  LTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLKM

Query:  PKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEM
          + S     KK +KI L+          ++    VRKC HCE TKTPQWR GP GPKTLCNACGVR++SGRL PEYRPA+SPTFIP++HSNSH+K++EM
Subjt:  PKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEM

Query:  RSKADEK
        R K DE+
Subjt:  RSKADEK

AT1G08010.2 GATA transcription factor 116.0e-3736.16Show/hide
Query:  GSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQ-LEWLSNFVEDSFCGGSLAMNKEEPKD
        G FFD + + LD P++D+D+     G GD  + F  + P   +  P       ++  S  S     P  DI + +  L           +L  +   P+ 
Subjt:  GSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQ-LEWLSNFVEDSFCGGSLAMNKEEPKD

Query:  LTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLKM
             FQ+ SPVSVLE+S    S  +   +    +A P +     RSKR RP                    TT     L P  P   E     +P  + 
Subjt:  LTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLKM

Query:  PKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEM
          + S     KK +KI L+          ++    VRKC HCE TKTPQWR GP GPKTLCNACGVR++SGRL PEYRPA+SPTFIP++HSNSH+K++EM
Subjt:  PKHGSDSAMQKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEM

Query:  RSKADEK
        R K DE+
Subjt:  RSKADEK

AT3G54810.1 Plant-specific GATA-type zinc finger transcription factor family protein4.3e-6746.67Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLP-GSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFC-
        MIG +F +++DCG+FFD++DDL+DFP  D+D G    G GDS +SFPTIW TH ++ P  SD +FS+N+NSD S EL VP+EDIV++E   +FVE++   
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLP-GSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFC-

Query:  --GGSLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAP
            S + N +     +H+QF++SSPVSVLESSSS S   +       ++  PG + GR R+KR       PRPP+Q                       
Subjt:  --GGSLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAP

Query:  SDSENFAESRPLLKMPKH----GSDSAMQKKNKKIKLSFSLA----PLEANQN------SPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSG
         D+    +SR ++++PK      +    +KK KK K++ S +     LE N N      S    +RKCMHCE+TKTPQWR GPMGPKTLCNACGVRYKSG
Subjt:  SDSENFAESRPLLKMPKH----GSDSAMQKKNKKIKLSFSLA----PLEANQN------SPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSG

Query:  RLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK--ADEKTAITISVQPELIPNTNSAISMD
        RLFPEYRPAASPTF P+LHSNSHKKV EMR+K  +D       +    LIPN N+ I +D
Subjt:  RLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK--ADEKTAITISVQPELIPNTNSAISMD

AT3G54810.2 Plant-specific GATA-type zinc finger transcription factor family protein4.3e-6746.67Show/hide
Query:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLP-GSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFC-
        MIG +F +++DCG+FFD++DDL+DFP  D+D G    G GDS +SFPTIW TH ++ P  SD +FS+N+NSD S EL VP+EDIV++E   +FVE++   
Subjt:  MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLP-GSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFC-

Query:  --GGSLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAP
            S + N +     +H+QF++SSPVSVLESSSS S   +       ++  PG + GR R+KR       PRPP+Q                       
Subjt:  --GGSLAMNKEEPKDLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAP

Query:  SDSENFAESRPLLKMPKH----GSDSAMQKKNKKIKLSFSLA----PLEANQN------SPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSG
         D+    +SR ++++PK      +    +KK KK K++ S +     LE N N      S    +RKCMHCE+TKTPQWR GPMGPKTLCNACGVRYKSG
Subjt:  SDSENFAESRPLLKMPKH----GSDSAMQKKNKKIKLSFSLA----PLEANQN------SPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSG

Query:  RLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK--ADEKTAITISVQPELIPNTNSAISMD
        RLFPEYRPAASPTF P+LHSNSHKKV EMR+K  +D       +    LIPN N+ I +D
Subjt:  RLFPEYRPAASPTFIPSLHSNSHKKVLEMRSK--ADEKTAITISVQPELIPNTNSAISMD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGGAAATAACTTCATCGACGAGATAGACTGCGGCAGCTTCTTCGACCACATCGACGACCTGCTCGACTTTCCGGTGGAGGACGTCGACTCCGGTTTGCCGGCGAA
GGGAGGAGGAGACTCTGCTAACTCGTTCCCCACCATTTGGCCGACTCATTCCGAGTCGCTCCCCGGTTCCGACTCGGTCTTCTCCGCCAATAGCAACTCCGACCTGTCGG
CCGAGCTCTCCGTGCCGTATGAAGACATTGTTCAGCTGGAGTGGCTTTCAAACTTTGTTGAGGATTCCTTCTGTGGTGGAAGCCTTGCAATGAACAAAGAGGAGCCCAAG
GACTTGACTCATAACCAATTCCAGACCTCTAGCCCAGTTTCTGTTCTTGAAAGCAGCAGCTCTTGCTCTAGCGACAAAAGCCTGCCCCGCAGTCCTGAACCCACCGTCGC
CACCCCGGGCCAGCAGCGTGGCCGTGCCCGCAGCAAGCGTCCTCGCCCTGCAACCTTCAGTCCTCGCCCCCCAATTCAGCTTATTTCTCCTGCCTCGTCTGTTACTGAAA
CAACCCAGGAGGCATTGCAGCTTGCTCCCAAGGCTCCCTCAGACTCCGAGAACTTTGCCGAGTCCCGCCCTTTGCTCAAAATGCCAAAGCATGGCAGTGACTCTGCAATG
CAGAAGAAGAACAAGAAAATCAAGTTGTCGTTTTCGCTCGCTCCGTTGGAGGCAAATCAAAACTCACCATCACAGTCGGTGAGAAAATGTATGCACTGTGAGATAACCAA
GACTCCACAGTGGAGGGCAGGACCAATGGGGCCAAAGACTCTTTGCAATGCCTGCGGTGTTCGGTATAAGTCCGGTAGACTCTTCCCCGAGTATCGGCCAGCAGCGAGTC
CAACTTTCATCCCATCGTTGCATTCGAATTCTCACAAGAAGGTGCTCGAAATGAGAAGCAAGGCTGATGAGAAGACTGCAATCACCATAAGCGTCCAGCCTGAGCTCATT
CCAAACACAAACAGCGCGATTTCGATGGATTACATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCGGAAATAACTTCATCGACGAGATAGACTGCGGCAGCTTCTTCGACCACATCGACGACCTGCTCGACTTTCCGGTGGAGGACGTCGACTCCGGTTTGCCGGCGAA
GGGAGGAGGAGACTCTGCTAACTCGTTCCCCACCATTTGGCCGACTCATTCCGAGTCGCTCCCCGGTTCCGACTCGGTCTTCTCCGCCAATAGCAACTCCGACCTGTCGG
CCGAGCTCTCCGTGCCGTATGAAGACATTGTTCAGCTGGAGTGGCTTTCAAACTTTGTTGAGGATTCCTTCTGTGGTGGAAGCCTTGCAATGAACAAAGAGGAGCCCAAG
GACTTGACTCATAACCAATTCCAGACCTCTAGCCCAGTTTCTGTTCTTGAAAGCAGCAGCTCTTGCTCTAGCGACAAAAGCCTGCCCCGCAGTCCTGAACCCACCGTCGC
CACCCCGGGCCAGCAGCGTGGCCGTGCCCGCAGCAAGCGTCCTCGCCCTGCAACCTTCAGTCCTCGCCCCCCAATTCAGCTTATTTCTCCTGCCTCGTCTGTTACTGAAA
CAACCCAGGAGGCATTGCAGCTTGCTCCCAAGGCTCCCTCAGACTCCGAGAACTTTGCCGAGTCCCGCCCTTTGCTCAAAATGCCAAAGCATGGCAGTGACTCTGCAATG
CAGAAGAAGAACAAGAAAATCAAGTTGTCGTTTTCGCTCGCTCCGTTGGAGGCAAATCAAAACTCACCATCACAGTCGGTGAGAAAATGTATGCACTGTGAGATAACCAA
GACTCCACAGTGGAGGGCAGGACCAATGGGGCCAAAGACTCTTTGCAATGCCTGCGGTGTTCGGTATAAGTCCGGTAGACTCTTCCCCGAGTATCGGCCAGCAGCGAGTC
CAACTTTCATCCCATCGTTGCATTCGAATTCTCACAAGAAGGTGCTCGAAATGAGAAGCAAGGCTGATGAGAAGACTGCAATCACCATAAGCGTCCAGCCTGAGCTCATT
CCAAACACAAACAGCGCGATTTCGATGGATTACATGTGA
Protein sequenceShow/hide protein sequence
MIGNNFIDEIDCGSFFDHIDDLLDFPVEDVDSGLPAKGGGDSANSFPTIWPTHSESLPGSDSVFSANSNSDLSAELSVPYEDIVQLEWLSNFVEDSFCGGSLAMNKEEPK
DLTHNQFQTSSPVSVLESSSSCSSDKSLPRSPEPTVATPGQQRGRARSKRPRPATFSPRPPIQLISPASSVTETTQEALQLAPKAPSDSENFAESRPLLKMPKHGSDSAM
QKKNKKIKLSFSLAPLEANQNSPSQSVRKCMHCEITKTPQWRAGPMGPKTLCNACGVRYKSGRLFPEYRPAASPTFIPSLHSNSHKKVLEMRSKADEKTAITISVQPELI
PNTNSAISMDYM