; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007929 (gene) of Chayote v1 genome

Gene IDSed0007929
OrganismSechium edule (Chayote v1)
DescriptionGATA transcription factor
Genome locationLG03:9361522..9363681
RNA-Seq ExpressionSed0007929
SyntenySed0007929
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type
IPR016679 - Transcription factor, GATA, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585379.1 GATA transcription factor 12, partial [Cucurbita argyrosperma subsp. sororia]1.3e-10861.87Show/hide
Query:  MEAPQVFHKKQSNAYSTSPFASDDPPA---------GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSF
        MEAP+ FH   +NAY  S F SD   A          DHF VEEL DFSNDDD    D+G GFF         +SA+SSA TAVES NSSSFSG E  SF
Subjt:  MEAPQVFHKKQSNAYSTSPFASDDPPA---------GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSF

Query:  LED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPAT------------THAAAIFKPEIVSVPAKARS
         +D   S L D +FSD + IP ++L ELEWL++  EEPFSSED+QKLELITGVKVKPDEP Q   P T              AAAIFKP+IV+VPAKARS
Subjt:  LED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPAT------------THAAAIFKPEIVSVPAKARS

Query:  KRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPP-PLK---------IRATPKVAAAKKRD---SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPK
        KRSR  PSNWNNS LL  SP +SSSE +IPA  PPP P+K           AT   AA KK++   SSE G+SA  EGR+CMHC+TDKTPQWRTGPMGPK
Subjt:  KRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPP-PLK---------IRATPKVAAAKKRD---SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPK

Query:  TLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL
        TLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ ++Q+   Q L++D  HHHHHQ+M+FDSSNG+DYL++Q++AHDY  L+
Subjt:  TLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL

XP_022951637.1 GATA transcription factor 12-like [Cucurbita moschata]3.9e-11063.97Show/hide
Query:  MEAPQVFHKKQSNAYSTSPFASDDPPA----GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSFLED--
        MEAP+ FH   +NAY  S F SD   A     DHF VEEL DFSNDDD    D+G GFF         +SA+SSA TAVES NSSSFSG E  SF +D  
Subjt:  MEAPQVFHKKQSNAYSTSPFASDDPPA----GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSFLED--

Query:  -SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQP-----ATTH----AAAIFKPEIVSVPAKARSKRSRAAPS
         S L D +FSD + IP ++L ELEWL++  EEPFSSED+QKLELITGVKVKPDEP Q   P     A +H    AAAIFKP+IV+VPAKARSKRSR  PS
Subjt:  -SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQP-----ATTH----AAAIFKPEIVSVPAKARSKRSRAAPS

Query:  NWNNSLLLSPSPATSSSEPEIPAGPPPP------PLKIRATPKVAAAKKR--DSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSG
        NWNNS LL  SP +SSSE +IPA  PPP      P K+ AT   A  KK    SSE G+SA  EGR+CMHC+TDKTPQWRTGPMGPKTLCNACGVRYKSG
Subjt:  NWNNSLLLSPSPATSSSEPEIPAGPPPP------PLKIRATPKVAAAKKR--DSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSG

Query:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL
        RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ ++Q+   Q L++D  HHHHHQ+M+FDSSNG+DYL++Q++AHDY  L+
Subjt:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL

XP_023002390.1 GATA transcription factor 12-like [Cucurbita maxima]3.5e-11163.28Show/hide
Query:  MEAPQVFHKKQSNAYSTSPFASDDPPA---------GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSF
        MEAP+ FH   +NAY  S F SD   A          DHF VEEL DFSNDDD    D+G GFF         +S++SSA TAVES NSSSFSGCE  SF
Subjt:  MEAPQVFHKKQSNAYSTSPFASDDPPA---------GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSF

Query:  LED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPATTHAAA---------IFKPEIVSVPAKARSKRS
         +D   S L D +FSD + IP ++L ELEWL++  EEPFSSED+QKLELITGVKVKPDEP Q + P T  +AA         IFKP+IV+VPAKARSKRS
Subjt:  LED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPATTHAAA---------IFKPEIVSVPAKARSKRS

Query:  RAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKR----DSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKS
        R  PSNWNNS LL  SP +SSSE +IPA  PPP    +  PKVAAA K+     SSE G+SA  EGR+CMHC+TDKTPQWRTGPMGPKTLCNACGVRYKS
Subjt:  RAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKR----DSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKS

Query:  GRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL
        GRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ ++Q+   Q L++D  HHHHHQ+M+FDSSNG+DYL++Q++AHDY  L+
Subjt:  GRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL

XP_023538437.1 GATA transcription factor 12-like [Cucurbita pepo subsp. pepo]8.7e-11062.34Show/hide
Query:  MEAPQVFHKKQSNAYSTSPFASDDPPA---------GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSF
        MEAP+ FHK   NAY  S F SD   A          DHF VEEL DFSNDDD    D+G GFF         +SA+SSA TAVES NSSSFSG E  SF
Subjt:  MEAPQVFHKKQSNAYSTSPFASDDPPA---------GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSF

Query:  LED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPAT------------THAAAIFKPEIVSVPAKARS
         +D   S L D +FSD + IP ++L ELEWL++  EEPFSSED+QKLELITGVKVKPDEP Q + P T              AAAIFKP+IV+VPAKARS
Subjt:  LED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPAT------------THAAAIFKPEIVSVPAKARS

Query:  KRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPP------PLKIRATPKVAAAKKR----DSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLC
        KRSR  PSNWNNS LL  SP +SSSE +IPA  PPP      P K+ AT    AA K+     SSE G+SA  EGR+CMHC+TDKTPQWRTGPMGPKTLC
Subjt:  KRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPP------PLKIRATPKVAAAKKR----DSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLC

Query:  NACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL
        NACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ ++Q+   Q L++D  HHHHHQ+M+FDSSNG+DYL++Q++AHDY  L+
Subjt:  NACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL

XP_038886306.1 GATA transcription factor 12-like [Benincasa hispida]6.4e-11366.84Show/hide
Query:  MEAPQVFHKKQSNAYST--SPFASDD-------PPAG-DHFSVEELFDFSNDDDDDDDDAGVGFF----------HSADSSAVTAVESCNSSSFSGCEP-
        MEAP+ F   Q N Y +  S  +S D         AG +HF VEEL DFSNDDD    D G  F+          +S +SSAVT +ESCNSSSFSGCEP 
Subjt:  MEAPQVFHKKQSNAYST--SPFASDD-------PPAG-DHFSVEELFDFSNDDDDDDDDAGVGFF----------HSADSSAVTAVESCNSSSFSGCEP-

Query:  VSFLED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQP-ATTHAAAIFKPEIVSVPAKARSKRSRAAPS
         SFLED   S+L DA FS +LC+P DDLAELEWLS+ VEE FSSED+QKLELI+GVKV+ DEP+  RQP AT +AAAIFKP+IVSVPAKARSKRSRA PS
Subjt:  VSFLED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQP-ATTHAAAIFKPEIVSVPAKARSKRSRAAPS

Query:  NWNNSLLLSPSPATSSSEPEIPA-GPPPPPLKIRATPKVAAAKKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYR
        NWNNS LL  SP T   EPEI A   PP P+K +  PK A AKK+DS E G+S+  EGR+CMHC+TDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYR
Subjt:  NWNNSLLLSPSPATSSSEPEIPA-GPPPPPLKIRATPKVAAAKKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYR

Query:  PAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL
        PAASPTFVLTKHSNSHRKVLELRRQKE++R+QQ   Q L+LD   HHQDMIFD+SNGDDYLI Q M  D+RQL+
Subjt:  PAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL

TrEMBL top hitse value%identityAlignment
A0A1S3BBN7 GATA transcription factor2.5e-10262.4Show/hide
Query:  MEAPQVFHKKQSNAYSTSPFASDD--------PPAGDHFSVEELFDFSNDDDD------DDDDAGVGFFH---------------SADSSAVTAVESCNS
        MEAP+ F   Q NAYS S F+S D          A +HF VEEL DFSN++DD           G G F+               SA+SSA+T +ESCNS
Subjt:  MEAPQVFHKKQSNAYSTSPFASDD--------PPAGDHFSVEELFDFSNDDDD------DDDDAGVGFFH---------------SADSSAVTAVESCNS

Query:  SSFSGCEPVSFLED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDE-PSQIRQP-ATTHAAAIFKPEIVSVPAKAR
        SS       SF ED   S+L DA FS +LC+P DDLAELEWLSN VEE FSSED+QKLEL++GVKVK DE P+Q  QP AT  AAAIFKPEIVSVPAKAR
Subjt:  SSFSGCEPVSFLED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDE-PSQIRQP-ATTHAAAIFKPEIVSVPAKAR

Query:  SKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAA-AKKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYK
        SKRSRA PSNWNNS LL  SP   ++EPEI A    P    +  PKVAA AKK+D+ + G S+  EGR+CMHC+TDKTPQWRTGPMGPKTLCNACGVRYK
Subjt:  SKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAA-AKKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYK

Query:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL
        SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE++R+QQ   Q L+LD   H QDMIFD+SNGDDYLI Q +  D+RQ++
Subjt:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL

A0A5A7VCX1 GATA transcription factor2.5e-10262.4Show/hide
Query:  MEAPQVFHKKQSNAYSTSPFASDD--------PPAGDHFSVEELFDFSNDDDD------DDDDAGVGFFH---------------SADSSAVTAVESCNS
        MEAP+ F   Q NAYS S F+S D          A +HF VEEL DFSN++DD           G G F+               SA+SSA+T +ESCNS
Subjt:  MEAPQVFHKKQSNAYSTSPFASDD--------PPAGDHFSVEELFDFSNDDDD------DDDDAGVGFFH---------------SADSSAVTAVESCNS

Query:  SSFSGCEPVSFLED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDE-PSQIRQP-ATTHAAAIFKPEIVSVPAKAR
        SS       SF ED   S+L DA FS +LC+P DDLAELEWLSN VEE FSSED+QKLEL++GVKVK DE P+Q  QP AT  AAAIFKPEIVSVPAKAR
Subjt:  SSFSGCEPVSFLED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDE-PSQIRQP-ATTHAAAIFKPEIVSVPAKAR

Query:  SKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAA-AKKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYK
        SKRSRA PSNWNNS LL  SP   ++EPEI A    P    +  PKVAA AKK+D+ + G S+  EGR+CMHC+TDKTPQWRTGPMGPKTLCNACGVRYK
Subjt:  SKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAA-AKKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYK

Query:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL
        SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE++R+QQ   Q L+LD   H QDMIFD+SNGDDYLI Q +  D+RQ++
Subjt:  SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL

A0A6J1BSX6 GATA transcription factor2.9e-10366.18Show/hide
Query:  YSTSPFASDDPPAGDHFSVEELFDFSNDDDDDDDDAGV-GFFHSADSSAVTAVESCNSS-SFSGCEPVSFLED---SDLPDAQFSDQLCIPEDDLAELEW
        +S+S   +D    G+HF VEEL DFSNDD    D +   G  ++  S +V+ +ESCNSS SFS CEP SFL+D   S+L DA+FS +LC+P DDLAELEW
Subjt:  YSTSPFASDDPPAGDHFSVEELFDFSNDDDDDDDDAGV-GFFHSADSSAVTAVESCNSS-SFSGCEPVSFLED---SDLPDAQFSDQLCIPEDDLAELEW

Query:  LSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPA---TTHAAAIFKPEIVSVPAKARSKRSRAA-PSNWNNSLLLSPSPATSSSEPEI--PAGPPPP
        LSN VEE FSSED+QKLELI+GVKVK DE  QIRQP+      AA IFKP+IVSVPAKARSKRSRAA P+NWNNS LL  SP TSSSE ++   A  PP 
Subjt:  LSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPA---TTHAAAIFKPEIVSVPAKARSKRSRAA-PSNWNNSLLLSPSPATSSSEPEI--PAGPPPP

Query:  PLKIRATPKVAAAKKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMV
        P K         AKK+D  +AG S   EGR+CMHC+TDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ 
Subjt:  PLKIRATPKVAAAKKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMV

Query:  RSQQPPHQ-PLILDHHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL
        R+QQ  HQ  LILD   HHQ+MIFD+SNGDDYLI Q +  D+RQL+
Subjt:  RSQQPPHQ-PLILDHHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL

A0A6J1GI87 GATA transcription factor1.9e-11063.97Show/hide
Query:  MEAPQVFHKKQSNAYSTSPFASDDPPA----GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSFLED--
        MEAP+ FH   +NAY  S F SD   A     DHF VEEL DFSNDDD    D+G GFF         +SA+SSA TAVES NSSSFSG E  SF +D  
Subjt:  MEAPQVFHKKQSNAYSTSPFASDDPPA----GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSFLED--

Query:  -SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQP-----ATTH----AAAIFKPEIVSVPAKARSKRSRAAPS
         S L D +FSD + IP ++L ELEWL++  EEPFSSED+QKLELITGVKVKPDEP Q   P     A +H    AAAIFKP+IV+VPAKARSKRSR  PS
Subjt:  -SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQP-----ATTH----AAAIFKPEIVSVPAKARSKRSRAAPS

Query:  NWNNSLLLSPSPATSSSEPEIPAGPPPP------PLKIRATPKVAAAKKR--DSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSG
        NWNNS LL  SP +SSSE +IPA  PPP      P K+ AT   A  KK    SSE G+SA  EGR+CMHC+TDKTPQWRTGPMGPKTLCNACGVRYKSG
Subjt:  NWNNSLLLSPSPATSSSEPEIPAGPPPP------PLKIRATPKVAAAKKR--DSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSG

Query:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL
        RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ ++Q+   Q L++D  HHHHHQ+M+FDSSNG+DYL++Q++AHDY  L+
Subjt:  RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL

A0A6J1KNT0 GATA transcription factor1.7e-11163.28Show/hide
Query:  MEAPQVFHKKQSNAYSTSPFASDDPPA---------GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSF
        MEAP+ FH   +NAY  S F SD   A          DHF VEEL DFSNDDD    D+G GFF         +S++SSA TAVES NSSSFSGCE  SF
Subjt:  MEAPQVFHKKQSNAYSTSPFASDDPPA---------GDHFSVEELFDFSNDDDDDDDDAGVGFF---------HSADSSAVTAVESCNSSSFSGCEPVSF

Query:  LED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPATTHAAA---------IFKPEIVSVPAKARSKRS
         +D   S L D +FSD + IP ++L ELEWL++  EEPFSSED+QKLELITGVKVKPDEP Q + P T  +AA         IFKP+IV+VPAKARSKRS
Subjt:  LED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPATTHAAA---------IFKPEIVSVPAKARSKRS

Query:  RAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKR----DSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKS
        R  PSNWNNS LL  SP +SSSE +IPA  PPP    +  PKVAAA K+     SSE G+SA  EGR+CMHC+TDKTPQWRTGPMGPKTLCNACGVRYKS
Subjt:  RAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKR----DSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKS

Query:  GRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL
        GRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ ++Q+   Q L++D  HHHHHQ+M+FDSSNG+DYL++Q++AHDY  L+
Subjt:  GRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILD--HHHHHQDMIFDSSNGDDYLIQQSMAHDYRQLL

SwissProt top hitse value%identityAlignment
O49741 GATA transcription factor 23.1e-4142.19Show/hide
Query:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDA----QFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQ
        D   +++L DFSN+D           F ++ S   TA  S  SSSF   +  SF     LP +     F   +C+P DD A LEWLS  V++ F+     
Subjt:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDA----QFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQ

Query:  KL-ELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRA-AP--SNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKRD
         L   +T VK +                        S P K RSKRSRA AP    W      SP P  S  + ++ +     P K ++        +  
Subjt:  KL-ELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRA-AP--SNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKRD

Query:  SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHH
        SS +  +     RRC HC+++KTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R  QP    L   HHHH
Subjt:  SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHH

Query:  H
        H
Subjt:  H

O49743 GATA transcription factor 45.8e-4041.52Show/hide
Query:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQK
        D   +++L DFSND+                SS+ T   S  SS+ S   P SF      S      F+  LC+P DD A LEWLS  V++ FS      
Subjt:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQK

Query:  LELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKRDSSEAG
        L +     V+P+                     +S   K RS+RSRA            P+P+         AG   P  +      VA  K +    A 
Subjt:  LELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKRDSSEAG

Query:  LSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRS--QQPPHQP
           A   RRC HC+++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE   S  + PP QP
Subjt:  LSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRS--QQPPHQP

O82632 GATA transcription factor 96.0e-5345.62Show/hide
Query:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLEL
        D F V++L DFSNDD + DD        S+  S  T  +S NSSS        F + +      FSD L IP DD+AELEWLSN VEE F+ ED  KL L
Subjt:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLEL

Query:  ITGVKVKPDEPSQIR-----QPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLL-LSPSPATSSSEPEIPAGPPPPPLKIRATPK--VAAAKKRD
         +G+K      S +      +P   H         V+VPAKARSKRSR+A S W + LL L+ S  T+            P  K R   +   A     D
Subjt:  ITGVKVKPDEPSQIR-----QPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLL-LSPSPATSSSEPEIPAGPPPPPLKIRATPK--VAAAKKRD

Query:  SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHH
          E+G      GRRC+HC+T+KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKEM           +L     
Subjt:  SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHH

Query:  HQDMIFDSSNGDDYLIQQSMAH---DYRQLL
           ++   SNG+D+L+  +  H   D+R L+
Subjt:  HQDMIFDSSNGDDYLIQQSMAH---DYRQLL

P69781 GATA transcription factor 126.3e-7151.65Show/hide
Query:  EAPQVFHKKQSNAYSTSPFASDDPPAGDHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPD-AQFSDQLCIP
        EA + FH        TS FA DD           L DFSNDDD+++D         ADS+  T +   +SS+FS  +  SF    D+ D   FS  LCIP
Subjt:  EAPQVFHKKQSNAYSTSPFASDDPPAGDHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPD-AQFSDQLCIP

Query:  EDDLA-ELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLS----PSPATSS---S
         DDLA ELEWLSN+V+E  S ED+ KLELI+G K +PD  S    P   ++++      VSVPAKARSKRSRAA  NW +  LL      SP T     S
Subjt:  EDDLA-ELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLS----PSPATSS---S

Query:  EPEIPAGPPPPPLKIRATPKVAAA-----KKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHS
          +  + P  PPL +    K  A      +K+D S    S   E RRC+HC+TDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHS
Subjt:  EPEIPAGPPPPPLKIRATPKVAAA-----KKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHS

Query:  NSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQD--MIFD-SSNGDDYLIQQSMAHDYRQLL
        NSHRKV+ELRRQKEM R+    H   I  HHHH  D  MIFD SS+GDDYLI  ++  D+RQL+
Subjt:  NSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQD--MIFD-SSNGDDYLIQQSMAHDYRQLL

Q9FH57 GATA transcription factor 55.4e-3839.46Show/hide
Query:  DHFSVEELFDFSNDDDDDDDDAGVGFFH-----SADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPF-----
        D FSV++L D SNDD   D++  +   H     S++           SS FSGC+    L  S         +L +P DDLA LEWLS+ VE+ F     
Subjt:  DHFSVEELFDFSNDDDDDDDDAGVGFFH-----SADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPF-----

Query:  ---SSEDIQKLELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPP---------PLK
           +    +K   +TG +  P         A T       P    VPAKARSKR+R     W+     S  P++S S     +GP  P         P+ 
Subjt:  ---SSEDIQKLELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPP---------PLK

Query:  IRATPKVAAAKKRDSSEAGLSAAVE----GRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE
            P      K+ S+E+  S  ++     R+C HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN HRKV+E+RR+KE
Subjt:  IRATPKVAAAKKRDSSEAGLSAAVE----GRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE

Arabidopsis top hitse value%identityAlignment
AT2G45050.1 GATA transcription factor 22.2e-4242.19Show/hide
Query:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDA----QFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQ
        D   +++L DFSN+D           F ++ S   TA  S  SSSF   +  SF     LP +     F   +C+P DD A LEWLS  V++ F+     
Subjt:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDA----QFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQ

Query:  KL-ELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRA-AP--SNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKRD
         L   +T VK +                        S P K RSKRSRA AP    W      SP P  S  + ++ +     P K ++        +  
Subjt:  KL-ELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRA-AP--SNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKRD

Query:  SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHH
        SS +  +     RRC HC+++KTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R  QP    L   HHHH
Subjt:  SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHH

Query:  H
        H
Subjt:  H

AT3G60530.1 GATA transcription factor 44.1e-4141.52Show/hide
Query:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQK
        D   +++L DFSND+                SS+ T   S  SS+ S   P SF      S      F+  LC+P DD A LEWLS  V++ FS      
Subjt:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLED---SDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQK

Query:  LELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKRDSSEAG
        L +     V+P+                     +S   K RS+RSRA            P+P+         AG   P  +      VA  K +    A 
Subjt:  LELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKRDSSEAG

Query:  LSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRS--QQPPHQP
           A   RRC HC+++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE   S  + PP QP
Subjt:  LSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRS--QQPPHQP

AT4G32890.1 GATA transcription factor 94.2e-5445.62Show/hide
Query:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLEL
        D F V++L DFSNDD + DD        S+  S  T  +S NSSS        F + +      FSD L IP DD+AELEWLSN VEE F+ ED  KL L
Subjt:  DHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPFSSEDIQKLEL

Query:  ITGVKVKPDEPSQIR-----QPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLL-LSPSPATSSSEPEIPAGPPPPPLKIRATPK--VAAAKKRD
         +G+K      S +      +P   H         V+VPAKARSKRSR+A S W + LL L+ S  T+            P  K R   +   A     D
Subjt:  ITGVKVKPDEPSQIR-----QPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLL-LSPSPATSSSEPEIPAGPPPPPLKIRATPK--VAAAKKRD

Query:  SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHH
          E+G      GRRC+HC+T+KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKEM           +L     
Subjt:  SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHH

Query:  HQDMIFDSSNGDDYLIQQSMAH---DYRQLL
           ++   SNG+D+L+  +  H   D+R L+
Subjt:  HQDMIFDSSNGDDYLIQQSMAH---DYRQLL

AT5G25830.1 GATA transcription factor 124.5e-7251.65Show/hide
Query:  EAPQVFHKKQSNAYSTSPFASDDPPAGDHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPD-AQFSDQLCIP
        EA + FH        TS FA DD           L DFSNDDD+++D         ADS+  T +   +SS+FS  +  SF    D+ D   FS  LCIP
Subjt:  EAPQVFHKKQSNAYSTSPFASDDPPAGDHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPD-AQFSDQLCIP

Query:  EDDLA-ELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLS----PSPATSS---S
         DDLA ELEWLSN+V+E  S ED+ KLELI+G K +PD  S    P   ++++      VSVPAKARSKRSRAA  NW +  LL      SP T     S
Subjt:  EDDLA-ELEWLSNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLS----PSPATSS---S

Query:  EPEIPAGPPPPPLKIRATPKVAAA-----KKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHS
          +  + P  PPL +    K  A      +K+D S    S   E RRC+HC+TDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHS
Subjt:  EPEIPAGPPPPPLKIRATPKVAAA-----KKRDSSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHS

Query:  NSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQD--MIFD-SSNGDDYLIQQSMAHDYRQLL
        NSHRKV+ELRRQKEM R+    H   I  HHHH  D  MIFD SS+GDDYLI  ++  D+RQL+
Subjt:  NSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQD--MIFD-SSNGDDYLIQQSMAHDYRQLL

AT5G66320.1 GATA transcription factor 53.8e-3939.46Show/hide
Query:  DHFSVEELFDFSNDDDDDDDDAGVGFFH-----SADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPF-----
        D FSV++L D SNDD   D++  +   H     S++           SS FSGC+    L  S         +L +P DDLA LEWLS+ VE+ F     
Subjt:  DHFSVEELFDFSNDDDDDDDDAGVGFFH-----SADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDAQFSDQLCIPEDDLAELEWLSNLVEEPF-----

Query:  ---SSEDIQKLELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPP---------PLK
           +    +K   +TG +  P         A T       P    VPAKARSKR+R     W+     S  P++S S     +GP  P         P+ 
Subjt:  ---SSEDIQKLELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPP---------PLK

Query:  IRATPKVAAAKKRDSSEAGLSAAVE----GRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE
            P      K+ S+E+  S  ++     R+C HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN HRKV+E+RR+KE
Subjt:  IRATPKVAAAKKRDSSEAGLSAAVE----GRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTCCTCAAGTTTTCCACAAGAAACAATCCAATGCCTACTCTACTTCCCCCTTCGCCTCCGACGACCCCCCTGCTGGAGACCATTTCAGTGTCGAGGAGCTTTT
CGACTTCTCCAACGACGACGACGACGACGACGACGACGCTGGAGTTGGTTTCTTCCATTCCGCCGACTCCTCCGCCGTCACCGCCGTCGAGAGTTGTAATTCTTCGTCGT
TTTCGGGTTGCGAACCCGTTTCGTTCTTGGAGGATTCTGATTTACCCGACGCCCAATTCTCCGACCAACTCTGCATTCCGGAAGACGATTTAGCTGAGCTGGAATGGCTT
TCAAATTTGGTGGAGGAACCATTTTCCAGCGAAGATATACAAAAATTAGAACTCATCACCGGAGTCAAAGTCAAACCCGACGAGCCCTCACAAATCCGGCAACCCGCCAC
CACCCACGCCGCCGCAATTTTCAAACCGGAGATCGTTTCGGTTCCTGCGAAGGCGCGTAGCAAACGCTCACGCGCCGCCCCATCCAATTGGAACAACTCCCTCCTCCTCT
CCCCCTCTCCGGCCACCTCCTCGTCGGAACCCGAAATCCCCGCCGGACCGCCGCCGCCGCCGCTAAAAATCAGAGCAACCCCGAAAGTGGCGGCGGCGAAAAAGAGAGAC
TCGTCGGAGGCGGGATTGTCCGCCGCCGTGGAGGGGCGGCGGTGCATGCACTGCTCCACCGACAAGACGCCGCAGTGGCGGACAGGCCCAATGGGCCCAAAGACGCTGTG
TAACGCTTGTGGGGTCCGGTACAAGTCGGGTCGGTTGGTACCCGAGTACCGACCCGCCGCCAGCCCCACCTTCGTCCTGACCAAACACTCCAATTCTCACCGGAAAGTTT
TGGAGCTCCGGCGACAGAAGGAGATGGTCAGATCTCAACAGCCGCCACACCAGCCGTTGATTTTGGATCATCATCATCATCATCAGGATATGATCTTTGATTCATCCAAC
GGTGACGATTATCTCATCCAGCAAAGCATGGCCCACGATTACCGGCAGCTTTTATGA
mRNA sequenceShow/hide mRNA sequence
TAGCCACACTCTGTTTCTCTCTCTAACTTTCTAACTGTTTTTTTGTTTTCCCCCAATTTTGGTTCATAATTGTTCTTCATTTTGCTCTTTTCTTGATCTTCCTCATCAGT
TCATGGAAGCTCCTCAAGTTTTCCACAAGAAACAATCCAATGCCTACTCTACTTCCCCCTTCGCCTCCGACGACCCCCCTGCTGGAGACCATTTCAGTGTCGAGGAGCTT
TTCGACTTCTCCAACGACGACGACGACGACGACGACGACGCTGGAGTTGGTTTCTTCCATTCCGCCGACTCCTCCGCCGTCACCGCCGTCGAGAGTTGTAATTCTTCGTC
GTTTTCGGGTTGCGAACCCGTTTCGTTCTTGGAGGATTCTGATTTACCCGACGCCCAATTCTCCGACCAACTCTGCATTCCGGAAGACGATTTAGCTGAGCTGGAATGGC
TTTCAAATTTGGTGGAGGAACCATTTTCCAGCGAAGATATACAAAAATTAGAACTCATCACCGGAGTCAAAGTCAAACCCGACGAGCCCTCACAAATCCGGCAACCCGCC
ACCACCCACGCCGCCGCAATTTTCAAACCGGAGATCGTTTCGGTTCCTGCGAAGGCGCGTAGCAAACGCTCACGCGCCGCCCCATCCAATTGGAACAACTCCCTCCTCCT
CTCCCCCTCTCCGGCCACCTCCTCGTCGGAACCCGAAATCCCCGCCGGACCGCCGCCGCCGCCGCTAAAAATCAGAGCAACCCCGAAAGTGGCGGCGGCGAAAAAGAGAG
ACTCGTCGGAGGCGGGATTGTCCGCCGCCGTGGAGGGGCGGCGGTGCATGCACTGCTCCACCGACAAGACGCCGCAGTGGCGGACAGGCCCAATGGGCCCAAAGACGCTG
TGTAACGCTTGTGGGGTCCGGTACAAGTCGGGTCGGTTGGTACCCGAGTACCGACCCGCCGCCAGCCCCACCTTCGTCCTGACCAAACACTCCAATTCTCACCGGAAAGT
TTTGGAGCTCCGGCGACAGAAGGAGATGGTCAGATCTCAACAGCCGCCACACCAGCCGTTGATTTTGGATCATCATCATCATCATCAGGATATGATCTTTGATTCATCCA
ACGGTGACGATTATCTCATCCAGCAAAGCATGGCCCACGATTACCGGCAGCTTTTATGATCGAATTCCG
Protein sequenceShow/hide protein sequence
MEAPQVFHKKQSNAYSTSPFASDDPPAGDHFSVEELFDFSNDDDDDDDDAGVGFFHSADSSAVTAVESCNSSSFSGCEPVSFLEDSDLPDAQFSDQLCIPEDDLAELEWL
SNLVEEPFSSEDIQKLELITGVKVKPDEPSQIRQPATTHAAAIFKPEIVSVPAKARSKRSRAAPSNWNNSLLLSPSPATSSSEPEIPAGPPPPPLKIRATPKVAAAKKRD
SSEAGLSAAVEGRRCMHCSTDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRSQQPPHQPLILDHHHHHQDMIFDSSN
GDDYLIQQSMAHDYRQLL