; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g03150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g03150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGATA transcription factor
Genome locationchr8:2280409..2288856
RNA-Seq ExpressionMoc08g03150
SyntenyMoc08g03150
Gene Ontology termsGO:0030154 - cell differentiation (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR012876 - Protein of unknown function DUF1677, plant
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445001.1 PREDICTED: GATA transcription factor 12-like [Cucumis melo]1.7e-12066.32Show/hide
Query:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSN--DDGVAADV------------------SSFNGNDNNNPSVSVSVIESC
        ME P+YFQIN  AY SSQF   +    +D  T      EHFIVEELLDFSN  DD V  D                   +  N N+N+  S +++V+ESC
Subjt:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSN--DDGVAADV------------------SSFNGNDNNNPSVSVSVIESC

Query:  NSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVP
        NSS        +SF +DI+ SNLGDA FS+ELCVPYDDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVKSDE    + P P     AA IFKP+IVSVP
Subjt:  NSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVP

Query:  AKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNAC
        AKARSKRSR A+P+NWNNS LLPLSPT        +      P+  KK   K   TAKKKD PD G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNAC
Subjt:  AKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNAC

Query:  GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
        GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ R QQQQ QH L+LDH Q+MIFDASNGDDYLIHQHVGPDFRQ+I
Subjt:  GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI

XP_021596902.1 GATA transcription factor 12-like [Manihot esculenta]7.5e-11364.57Show/hide
Query:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDGVAADVSSFNG-NDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDIT
        ME P+++    +  C SQF  E  HS     + GG GG+HFIVE+LLDFSN+D V  D S+F+    N+  S +V+V++SCNSS SFS CEP  F  DI 
Subjt:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDGVAADVSSFNG-NDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDIT

Query:  HSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIR--QPSPAVAV-------AAA-----EIFKPDIVSVPAKARS
          N  D +FS++LCVPYDDLAELEWLSNFVEESFSSED+QKL+LISG+K + DE+ + R  QP+    V       AAA      IF P+ VSVPAKARS
Subjt:  HSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIR--QPSPAVAV-------AAA-----EIFKPDIVSVPAKARS

Query:  KRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
        KRSRAA P NW  SRLL LSPTTSSS+ +++A  A  P+ GKK T+KA  T K+++  D G   G+GRKC+HCATDKTPQWRTGPMGPKTLCNACGVRYK
Subjt:  KRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK

Query:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
        SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL R QQQQ Q    L HHQNM+FD SNGDDYLIHQHVGPDFRQLI
Subjt:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI

XP_022132107.1 GATA transcription factor 12 [Momordica charantia]1.3e-205100Show/hide
Query:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITH
        MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITH
Subjt:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITH

Query:  SNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSR
        SNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSR
Subjt:  SNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSR

Query:  LLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPT
        LLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPT
Subjt:  LLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPT

Query:  FVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
        FVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
Subjt:  FVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI

XP_031736569.1 GATA transcription factor 12 [Cucumis sativus]1.5e-12166.33Show/hide
Query:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTD------GGCGGEHFIVEELLDFSN--DDGVAAD------------------------VSSFNGNDNN
        ME P+YFQIN  AY SSQF       SS +D D           +HFIVEELLDFSN  DD V  D                         +  N N+N+
Subjt:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTD------GGCGGEHFIVEELLDFSN--DDGVAAD------------------------VSSFNGNDNN

Query:  NPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAA
          S +V+V+ESCNSS        +SF +DI+ SNLGDA FS+ELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDE    + P P    +A
Subjt:  NPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAA

Query:  AEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRT
        A IFKP+IVSVPAKARSKRSR A+P+NWNNS LLPLS  T+ SE          PHP KK   KA  TAKKKD PD G S GEGRKCMHCATDKTPQWRT
Subjt:  AEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRT

Query:  GPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
        GPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ R QQQQ QH L+LDH Q+MIFDASNGDDYLIHQHVGPDFRQLI
Subjt:  GPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI

XP_038886306.1 GATA transcription factor 12-like [Benincasa hispida]3.5e-13474.93Show/hide
Query:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDT---DGGCGGEHFIVEELLDFSN-DDGVAAD-----VSSFNGNDNNNPSV---SVSVIESCNSSNSFSC
        ME P+YFQIN   YC SQF   + HSSSD DT       G EHFIVEELLDFSN DDGV  D      ++ NGN+NNN S    +V+VIESCNSS SFS 
Subjt:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDT---DGGCGGEHFIVEELLDFSN-DDGVAAD-----VSSFNGNDNNNPSV---SVSVIESCNSSNSFSC

Query:  CEPN-SFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKR
        CEPN SFL+DI+ SNL DA FS+ELCVPYDDLAELEWLS+FVEESFSSEDMQKLELISGVKV+SDE    RQP+      AA IFKPDIVSVPAKARSKR
Subjt:  CEPN-SFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKR

Query:  SRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
        SR AVP+NWNNSRLLPLSPTT       +   A PPHP KK   KA  TAKKKD P+ G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
Subjt:  SRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG

Query:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
        RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL R QQQQ QH L+LDHHQ+MIFDASNGDDYLIHQH+GPDFRQLI
Subjt:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI

TrEMBL top hitse value%identityAlignment
A0A0A0LPR5 GATA transcription factor7.3e-12266.33Show/hide
Query:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTD------GGCGGEHFIVEELLDFSN--DDGVAAD------------------------VSSFNGNDNN
        ME P+YFQIN  AY SSQF       SS +D D           +HFIVEELLDFSN  DD V  D                         +  N N+N+
Subjt:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTD------GGCGGEHFIVEELLDFSN--DDGVAAD------------------------VSSFNGNDNN

Query:  NPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAA
          S +V+V+ESCNSS        +SF +DI+ SNLGDA FS+ELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDE    + P P    +A
Subjt:  NPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAA

Query:  AEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRT
        A IFKP+IVSVPAKARSKRSR A+P+NWNNS LLPLS  T+ SE          PHP KK   KA  TAKKKD PD G S GEGRKCMHCATDKTPQWRT
Subjt:  AEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRT

Query:  GPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
        GPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ R QQQQ QH L+LDH Q+MIFDASNGDDYLIHQHVGPDFRQLI
Subjt:  GPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI

A0A1S3BBN7 GATA transcription factor8.1e-12166.32Show/hide
Query:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSN--DDGVAADV------------------SSFNGNDNNNPSVSVSVIESC
        ME P+YFQIN  AY SSQF   +    +D  T      EHFIVEELLDFSN  DD V  D                   +  N N+N+  S +++V+ESC
Subjt:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSN--DDGVAADV------------------SSFNGNDNNNPSVSVSVIESC

Query:  NSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVP
        NSS        +SF +DI+ SNLGDA FS+ELCVPYDDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVKSDE    + P P     AA IFKP+IVSVP
Subjt:  NSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVP

Query:  AKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNAC
        AKARSKRSR A+P+NWNNS LLPLSPT        +      P+  KK   K   TAKKKD PD G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNAC
Subjt:  AKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNAC

Query:  GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
        GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ R QQQQ QH L+LDH Q+MIFDASNGDDYLIHQHVGPDFRQ+I
Subjt:  GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI

A0A2C9UBF9 GATA transcription factor3.6e-11364.57Show/hide
Query:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDGVAADVSSFNG-NDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDIT
        ME P+++    +  C SQF  E  HS     + GG GG+HFIVE+LLDFSN+D V  D S+F+    N+  S +V+V++SCNSS SFS CEP  F  DI 
Subjt:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDGVAADVSSFNG-NDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDIT

Query:  HSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIR--QPSPAVAV-------AAA-----EIFKPDIVSVPAKARS
          N  D +FS++LCVPYDDLAELEWLSNFVEESFSSED+QKL+LISG+K + DE+ + R  QP+    V       AAA      IF P+ VSVPAKARS
Subjt:  HSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIR--QPSPAVAV-------AAA-----EIFKPDIVSVPAKARS

Query:  KRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
        KRSRAA P NW  SRLL LSPTTSSS+ +++A  A  P+ GKK T+KA  T K+++  D G   G+GRKC+HCATDKTPQWRTGPMGPKTLCNACGVRYK
Subjt:  KRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK

Query:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
        SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL R QQQQ Q    L HHQNM+FD SNGDDYLIHQHVGPDFRQLI
Subjt:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI

A0A5A7VCX1 GATA transcription factor8.1e-12166.32Show/hide
Query:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSN--DDGVAADV------------------SSFNGNDNNNPSVSVSVIESC
        ME P+YFQIN  AY SSQF   +    +D  T      EHFIVEELLDFSN  DD V  D                   +  N N+N+  S +++V+ESC
Subjt:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSN--DDGVAADV------------------SSFNGNDNNNPSVSVSVIESC

Query:  NSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVP
        NSS        +SF +DI+ SNLGDA FS+ELCVPYDDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVKSDE    + P P     AA IFKP+IVSVP
Subjt:  NSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVP

Query:  AKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNAC
        AKARSKRSR A+P+NWNNS LLPLSPT        +      P+  KK   K   TAKKKD PD G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNAC
Subjt:  AKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNAC

Query:  GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
        GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ R QQQQ QH L+LDH Q+MIFDASNGDDYLIHQHVGPDFRQ+I
Subjt:  GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI

A0A6J1BSX6 GATA transcription factor6.3e-206100Show/hide
Query:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITH
        MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITH
Subjt:  MELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITH

Query:  SNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSR
        SNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSR
Subjt:  SNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSR

Query:  LLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPT
        LLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPT
Subjt:  LLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPT

Query:  FVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
        FVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI
Subjt:  FVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI

SwissProt top hitse value%identityAlignment
O49741 GATA transcription factor 22.3e-4041.86Show/hide
Query:  VEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKL-
        +++LLDFSN+D  +A  SS  G+             +  SS+SF   +  SF      S+     F  ++CVP DD A LEWLS FV++SF+      L 
Subjt:  VEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKL-

Query:  ELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVP--TNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKK
          ++ VK ++                          S P K RSKRSRA  P    W        SP    SE   L  AA    P K+ +        +
Subjt:  ELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVP--TNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKK

Query:  KDCPDAGASPGEG-RKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDH
             +  + G G R+C HCA++KTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE+ R  QQ   H     H
Subjt:  KDCPDAGASPGEG-RKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDH

Query:  H
        H
Subjt:  H

O49743 GATA transcription factor 41.0e-4042.05Show/hide
Query:  VEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLE
        +++LLDFSND+  +                S S + S  +S++ S   P SF      S      F+ +LCVP DD A LEWLS FV++SFS      L 
Subjt:  VEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLE

Query:  LISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDC
        +                             +P+I S   K RS+RSRA  P+         ++ T +      L  +   P P KK     +VTA     
Subjt:  LISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDC

Query:  PDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRT
           GA     R+C HCA++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE Q +
Subjt:  PDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRT

O82632 GATA transcription factor 92.0e-6046.57Show/hide
Query:  EHFIVEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDM
        + F+V++LLDFSNDDG   D    N   +++   + ++ +S NSS+ F+                 D    ++L +P DD+AELEWLSNFVEESF+ ED 
Subjt:  EHFIVEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDM

Query:  QKLELISGVK---VKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATV
         KL L SG+K           + +P P +     +I + + V+VPAKARSKRSR+A  T W  SRLL L+ +  ++               K+  +K   
Subjt:  QKLELISGVK---VKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATV

Query:  TAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLI
         A   D  D G S G GR+C+HCAT+KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKE+      + +H L 
Subjt:  TAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLI

Query:  LDHHQNMIFD-ASNGDDYLIH---QHVGPDFRQLI
            +N++ D  SNG+D+L+H    HV PDFR LI
Subjt:  LDHHQNMIFD-ASNGDDYLIH---QHVGPDFRQLI

P69781 GATA transcription factor 125.3e-6951.31Show/hide
Query:  FIVEELL-DFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLA-ELEWLSNFVEESFSSEDM
        F V++LL DFSNDD           ++ N+     +   +   S++FS  +  SF  D+         FS +LC+P DDLA ELEWLSN V+ES S ED+
Subjt:  FIVEELL-DFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLA-ELEWLSNFVEESFSSEDM

Query:  QKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLL-------PLSPTTSSSELDVLAVAATPP----HPGK
         KLELISG K + D   +    SP    +++ IF  D VSVPAKARSKRSRAA   NW +  LL       P +  T  S    L+   +PP      GK
Subjt:  QKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLL-------PLSPTTSSSELDVLAVAATPP----HPGK

Query:  KATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQ
        K  +      +KKD     +   E R+C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKV+ELRRQKE+ R   
Subjt:  KATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQ

Query:  QQHQHQLILDHHQN---MIFD-ASNGDDYLIHQHVGPDFRQLI
            H+ I  HH     MIFD +S+GDDYLIH +VGPDFRQLI
Subjt:  QQHQHQLILDHHQN---MIFD-ASNGDDYLIHQHVGPDFRQLI

Q9FH57 GATA transcription factor 52.7e-3637.85Show/hide
Query:  GCGGEHFIVEELLDFSNDDGVAAD------------VSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAEL
        G   + F V++LLD SNDD  A +            VSS   ND+          ++   S+ FS C+    L             ++EL +P DDLA L
Subjt:  GCGGEHFIVEELLDFSNDDGVAAD------------VSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAEL

Query:  EWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPT---------------NWNNSRLLPLSPT
        EWLS+FVE+SF+      L   +G   +        +  P  AV     FK     VPAKARSKR+R  +                 + ++S   P SP 
Subjt:  EWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPT---------------NWNNSRLLPLSPT

Query:  TSSSE-LDVLAVAATPPHP--GKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLT
         S +E L+ +  +  PP P   KK + ++  + + +            RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF   
Subjt:  TSSSE-LDVLAVAATPPHP--GKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLT

Query:  KHSNSHRKVLELRRQKE
         HSN HRKV+E+RR+KE
Subjt:  KHSNSHRKVLELRRQKE

Arabidopsis top hitse value%identityAlignment
AT2G45050.1 GATA transcription factor 21.7e-4141.86Show/hide
Query:  VEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKL-
        +++LLDFSN+D  +A  SS  G+             +  SS+SF   +  SF      S+     F  ++CVP DD A LEWLS FV++SF+      L 
Subjt:  VEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKL-

Query:  ELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVP--TNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKK
          ++ VK ++                          S P K RSKRSRA  P    W        SP    SE   L  AA    P K+ +        +
Subjt:  ELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVP--TNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKK

Query:  KDCPDAGASPGEG-RKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDH
             +  + G G R+C HCA++KTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE+ R  QQ   H     H
Subjt:  KDCPDAGASPGEG-RKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDH

Query:  H
        H
Subjt:  H

AT3G60530.1 GATA transcription factor 47.4e-4242.05Show/hide
Query:  VEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLE
        +++LLDFSND+  +                S S + S  +S++ S   P SF      S      F+ +LCVP DD A LEWLS FV++SFS      L 
Subjt:  VEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLE

Query:  LISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDC
        +                             +P+I S   K RS+RSRA  P+         ++ T +      L  +   P P KK     +VTA     
Subjt:  LISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDC

Query:  PDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRT
           GA     R+C HCA++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE Q +
Subjt:  PDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRT

AT4G32890.1 GATA transcription factor 91.4e-6146.57Show/hide
Query:  EHFIVEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDM
        + F+V++LLDFSNDDG   D    N   +++   + ++ +S NSS+ F+                 D    ++L +P DD+AELEWLSNFVEESF+ ED 
Subjt:  EHFIVEELLDFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDM

Query:  QKLELISGVK---VKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATV
         KL L SG+K           + +P P +     +I + + V+VPAKARSKRSR+A  T W  SRLL L+ +  ++               K+  +K   
Subjt:  QKLELISGVK---VKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATV

Query:  TAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLI
         A   D  D G S G GR+C+HCAT+KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKE+      + +H L 
Subjt:  TAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLI

Query:  LDHHQNMIFD-ASNGDDYLIH---QHVGPDFRQLI
            +N++ D  SNG+D+L+H    HV PDFR LI
Subjt:  LDHHQNMIFD-ASNGDDYLIH---QHVGPDFRQLI

AT5G25830.1 GATA transcription factor 123.8e-7051.31Show/hide
Query:  FIVEELL-DFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLA-ELEWLSNFVEESFSSEDM
        F V++LL DFSNDD           ++ N+     +   +   S++FS  +  SF  D+         FS +LC+P DDLA ELEWLSN V+ES S ED+
Subjt:  FIVEELL-DFSNDDGVAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLA-ELEWLSNFVEESFSSEDM

Query:  QKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLL-------PLSPTTSSSELDVLAVAATPP----HPGK
         KLELISG K + D   +    SP    +++ IF  D VSVPAKARSKRSRAA   NW +  LL       P +  T  S    L+   +PP      GK
Subjt:  QKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLL-------PLSPTTSSSELDVLAVAATPP----HPGK

Query:  KATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQ
        K  +      +KKD     +   E R+C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKV+ELRRQKE+ R   
Subjt:  KATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQ

Query:  QQHQHQLILDHHQN---MIFD-ASNGDDYLIHQHVGPDFRQLI
            H+ I  HH     MIFD +S+GDDYLIH +VGPDFRQLI
Subjt:  QQHQHQLILDHHQN---MIFD-ASNGDDYLIHQHVGPDFRQLI

AT5G66320.1 GATA transcription factor 51.9e-3737.85Show/hide
Query:  GCGGEHFIVEELLDFSNDDGVAAD------------VSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAEL
        G   + F V++LLD SNDD  A +            VSS   ND+          ++   S+ FS C+    L             ++EL +P DDLA L
Subjt:  GCGGEHFIVEELLDFSNDDGVAAD------------VSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAEL

Query:  EWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPT---------------NWNNSRLLPLSPT
        EWLS+FVE+SF+      L   +G   +        +  P  AV     FK     VPAKARSKR+R  +                 + ++S   P SP 
Subjt:  EWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVAAAEIFKPDIVSVPAKARSKRSRAAVPT---------------NWNNSRLLPLSPT

Query:  TSSSE-LDVLAVAATPPHP--GKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLT
         S +E L+ +  +  PP P   KK + ++  + + +            RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF   
Subjt:  TSSSE-LDVLAVAATPPHP--GKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLT

Query:  KHSNSHRKVLELRRQKE
         HSN HRKV+E+RR+KE
Subjt:  KHSNSHRKVLELRRQKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTATCAATGGAGGCCGGACTCCAACTCCAACGAACCATATCGGACATCTCGTCGGAGTTAACCAAAGAGGGCGTCGGAGGGAAGCTTCCGACGATCTCGGAGGTGGA
GGCCGCTGCCTGCGACTGCTGTGGCCTGTCGGAGGAGTGCACGCCGGAGTACATCGCCCGGCTCCGACACAAGTTCATGGGAAACCTCATATGCGGACTCTGCGCCGCCG
CGGTCAACGAGGAAATGGAAAAGATTAGAAAAATGCAAAAGATATCCAAAAGTGGAGCAATTAATGTAGACATGAAAACCTTATCCAATGTCTCTAAGAATACTCCTTTC
CTAGGTGTCCTCTTGTCCCCTTCAAACGCAAACCGACCCTCTCTCAGAGCCATAATTTTTCAGTCTTCGCTCTTAGCCACTTCCCTCTCTCCCTCCCTCTCACTATTTTT
TTTTTCTTCTCCCTCTCCCTCTCTTCTCTTGCGTTCTATCGCGCTGCGCCACACTTCCATGGAACTTCCCGACTATTTTCAGATCAATAATGCAGCATACTGCTCATCCC
AATTCGTCGCCGAAACTCGCCACTCCTCCTCCGATAACGACACCGACGGCGGCTGCGGCGGCGAGCATTTCATCGTCGAGGAACTTCTCGACTTCTCCAACGACGACGGC
GTCGCGGCCGACGTTAGCAGCTTCAACGGAAACGATAATAATAATCCCTCCGTCTCCGTCTCCGTGATCGAGAGTTGCAATTCGTCGAATTCCTTCTCCTGCTGCGAACC
CAATTCGTTCCTCGACGACATTACTCACTCCAATTTAGGCGACGCAAAATTCTCCACCGAACTCTGCGTTCCGTACGACGATTTAGCTGAGCTGGAATGGCTTTCAAACT
TCGTAGAGGAATCATTTTCCAGCGAGGACATGCAAAAGCTGGAGCTAATCTCCGGCGTCAAAGTCAAATCCGACGAAACCTTCCAAATCCGGCAGCCCTCCCCCGCCGTC
GCCGTCGCCGCCGCCGAAATCTTCAAACCCGACATCGTCTCCGTTCCGGCCAAGGCCCGCAGCAAACGCTCACGCGCCGCCGTTCCCACCAACTGGAACAACTCCCGCCT
CCTCCCCCTCTCTCCGACCACCTCCTCCTCCGAACTCGACGTCTTAGCCGTCGCCGCCACGCCGCCGCACCCCGGAAAGAAAGCCACCATTAAGGCCACGGTCACAGCAA
AGAAGAAGGATTGTCCGGACGCCGGCGCGTCGCCCGGAGAGGGGCGGAAGTGCATGCACTGCGCCACCGACAAGACGCCTCAGTGGCGGACGGGCCCAATGGGCCCAAAG
ACGCTGTGCAATGCGTGCGGCGTCCGGTACAAGTCCGGCCGGCTGGTGCCGGAGTACCGGCCCGCTGCCAGCCCCACGTTTGTTCTCACTAAGCATTCAAATTCTCACCG
GAAGGTGCTGGAGCTCCGGCGGCAGAAGGAGCTTCAAAGAACACAGCAGCAGCAGCATCAGCATCAGCTGATTCTTGATCATCATCAGAATATGATATTTGATGCATCCA
ACGGTGACGATTATCTCATCCACCAACATGTGGGCCCCGATTTTCGGCAGCTGATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTATCAATGGAGGCCGGACTCCAACTCCAACGAACCATATCGGACATCTCGTCGGAGTTAACCAAAGAGGGCGTCGGAGGGAAGCTTCCGACGATCTCGGAGGTGGA
GGCCGCTGCCTGCGACTGCTGTGGCCTGTCGGAGGAGTGCACGCCGGAGTACATCGCCCGGCTCCGACACAAGTTCATGGGAAACCTCATATGCGGACTCTGCGCCGCCG
CGGTCAACGAGGAAATGGAAAAGATTAGAAAAATGCAAAAGATATCCAAAAGTGGAGCAATTAATGTAGACATGAAAACCTTATCCAATGTCTCTAAGAATACTCCTTTC
CTAGGTGTCCTCTTGTCCCCTTCAAACGCAAACCGACCCTCTCTCAGAGCCATAATTTTTCAGTCTTCGCTCTTAGCCACTTCCCTCTCTCCCTCCCTCTCACTATTTTT
TTTTTCTTCTCCCTCTCCCTCTCTTCTCTTGCGTTCTATCGCGCTGCGCCACACTTCCATGGAACTTCCCGACTATTTTCAGATCAATAATGCAGCATACTGCTCATCCC
AATTCGTCGCCGAAACTCGCCACTCCTCCTCCGATAACGACACCGACGGCGGCTGCGGCGGCGAGCATTTCATCGTCGAGGAACTTCTCGACTTCTCCAACGACGACGGC
GTCGCGGCCGACGTTAGCAGCTTCAACGGAAACGATAATAATAATCCCTCCGTCTCCGTCTCCGTGATCGAGAGTTGCAATTCGTCGAATTCCTTCTCCTGCTGCGAACC
CAATTCGTTCCTCGACGACATTACTCACTCCAATTTAGGCGACGCAAAATTCTCCACCGAACTCTGCGTTCCGTACGACGATTTAGCTGAGCTGGAATGGCTTTCAAACT
TCGTAGAGGAATCATTTTCCAGCGAGGACATGCAAAAGCTGGAGCTAATCTCCGGCGTCAAAGTCAAATCCGACGAAACCTTCCAAATCCGGCAGCCCTCCCCCGCCGTC
GCCGTCGCCGCCGCCGAAATCTTCAAACCCGACATCGTCTCCGTTCCGGCCAAGGCCCGCAGCAAACGCTCACGCGCCGCCGTTCCCACCAACTGGAACAACTCCCGCCT
CCTCCCCCTCTCTCCGACCACCTCCTCCTCCGAACTCGACGTCTTAGCCGTCGCCGCCACGCCGCCGCACCCCGGAAAGAAAGCCACCATTAAGGCCACGGTCACAGCAA
AGAAGAAGGATTGTCCGGACGCCGGCGCGTCGCCCGGAGAGGGGCGGAAGTGCATGCACTGCGCCACCGACAAGACGCCTCAGTGGCGGACGGGCCCAATGGGCCCAAAG
ACGCTGTGCAATGCGTGCGGCGTCCGGTACAAGTCCGGCCGGCTGGTGCCGGAGTACCGGCCCGCTGCCAGCCCCACGTTTGTTCTCACTAAGCATTCAAATTCTCACCG
GAAGGTGCTGGAGCTCCGGCGGCAGAAGGAGCTTCAAAGAACACAGCAGCAGCAGCATCAGCATCAGCTGATTCTTGATCATCATCAGAATATGATATTTGATGCATCCA
ACGGTGACGATTATCTCATCCACCAACATGTGGGCCCCGATTTTCGGCAGCTGATCTAA
Protein sequenceShow/hide protein sequence
MLSMEAGLQLQRTISDISSELTKEGVGGKLPTISEVEAAACDCCGLSEECTPEYIARLRHKFMGNLICGLCAAAVNEEMEKIRKMQKISKSGAINVDMKTLSNVSKNTPF
LGVLLSPSNANRPSLRAIIFQSSLLATSLSPSLSLFFFSSPSPSLLLRSIALRHTSMELPDYFQINNAAYCSSQFVAETRHSSSDNDTDGGCGGEHFIVEELLDFSNDDG
VAADVSSFNGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAKFSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAV
AVAAAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGKKATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPK
TLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDASNGDDYLIHQHVGPDFRQLI