; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy4G076540 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy4G076540
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionGATA transcription factor
Genome locationchrH04:13453297..13454374
RNA-Seq ExpressionChy4G076540
SyntenyChy4G076540
Gene Ontology termsGO:0030154 - cell differentiation (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type
IPR016679 - Transcription factor, GATA, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150603.3 GATA transcription factor 4 [Cucumis sativus]5.00e-23197.2Show/hide
Query:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL
        MELPGYLVGGYYGTGAPQFSPDNKKS+AEHFP+DEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDN  L KFES SFCEAQFSSEL
Subjt:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL

Query:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA
        CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISS ATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA
Subjt:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA

Query:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS
        P+K EGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS
Subjt:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS

Query:  RSNGCDEYLIHRHNGGDFSHMM
        RSNGCDEYLIHRHNGGDFSHMM
Subjt:  RSNGCDEYLIHRHNGGDFSHMM

XP_008447537.1 PREDICTED: GATA transcription factor 4-like [Cucumis melo]2.65e-22495.05Show/hide
Query:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSD-SSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSE
        MELP YLVGGYYGTGA QFSP NKKS++EHFPVDEYLLDFSNEDVAMH GFFDNVAGNCSD SSTLTAIDSCNSSVSGGDN  LGKFES SFCEAQFSSE
Subjt:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSD-SSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSE

Query:  LCIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQAT
        LCIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPF+SGGISS ATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQAT
Subjt:  LCIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQAT

Query:  APDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF
        AP+K EG M KPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF
Subjt:  APDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF

Query:  SRSNGCDEYLIHRHNGGDFSHMM
        SRSNGCDEYLIHRHNGGDFSHMM
Subjt:  SRSNGCDEYLIHRHNGGDFSHMM

XP_022997862.1 GATA transcription factor 4-like [Cucurbita maxima]3.33e-17477.64Show/hide
Query:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL
        ME+P YL+GG+YG GA QFSP+   SSA+HF VDEYLLDFSN+DVA++SGFFD+VA NCSDSST+TAI+SCNSS+S GDN  LG F SASF EAQFS+EL
Subjt:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL

Query:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA
        CIP DDLAELEWLSNFVE+SFSTEEI+KDFP IPFL+G   + A PET SSSG TAFGYG+ KTT+FF  EAL  PGKARSKRSR +PCDWSTRLLQA  
Subjt:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA

Query:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS
        P K+E T     + SGRKCLHCAAEKTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPA+SPT+VSTKHSNSHRKVMELRRQKE+Q QEQF+SQ SIF 
Subjt:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS

Query:  RSNGCDEYLIHRHNGGDFSHMM
        RSNGCDEYLIHR NGGDF HM+
Subjt:  RSNGCDEYLIHRHNGGDFSHMM

XP_023524877.1 GATA transcription factor 4-like [Cucurbita pepo subsp. pepo]7.04e-17678.26Show/hide
Query:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL
        ME+PGYL+GG+YG GA QFSP+   S+ +HF VDEYLLDFSN+DVAM SGFFDNVA NCSDSST+TAIDSCNSS+S GDN  LG F SASF EAQFS+EL
Subjt:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL

Query:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA
        CIP DDLAELEWLSNFVE+SFSTEEI+KDFP IPFL+G   + A PET SSSG TAFGYG+AKTT+FF  EAL  PGKARSKRSR +PCDWSTRLLQA  
Subjt:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA

Query:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS
        P K+E T     + SGRKCLHCAAEKTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVSTKHSNSHRKVMELRRQKE+Q QEQF+SQ SI  
Subjt:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS

Query:  RSNGCDEYLIHRHNGGDFSHMM
        RSNGCDEYL+HR NGGDF HM+
Subjt:  RSNGCDEYLIHRHNGGDFSHMM

XP_038901882.1 GATA transcription factor 9-like [Benincasa hispida]3.98e-21289.75Show/hide
Query:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL
        ME+P YLVGGYYGTGA QFSP+ +KS+AEHF VDEYLLD SNEDVAMH+GFFDNVAGNCSDSST+TAI+SCNSSVSGGDN  LGKFES  FCE QFSSEL
Subjt:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL

Query:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA
        CIPCDDLAELEWLSNFVEESFSTEEI+KDF AIPFLSGGIS+  TPET SSSGATAFGYG+AKTT+F HSEALTLPGKARSKRSRATPCDWSTRL +ATA
Subjt:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA

Query:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS
        P+K EG M KPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS
Subjt:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS

Query:  RSNGCDEYLIHRHNGGDFSHMM
        RSNGCDEYLIHRHNGGDFS MM
Subjt:  RSNGCDEYLIHRHNGGDFSHMM

TrEMBL top hitse value%identityAlignment
A0A0A0L802 GATA transcription factor6.1e-17297.11Show/hide
Query:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL
        MELPGYLVGGYYGTGAPQFSPDNKKS+AEHFP+DEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDN  L KFES SFCEAQFSSEL
Subjt:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL

Query:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA
        CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISS ATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA
Subjt:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA

Query:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS
        P+K EGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS
Subjt:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS

Query:  RSNGCDEYLIH
        RSNGCDEYLIH
Subjt:  RSNGCDEYLIH

A0A1S3BI99 GATA transcription factor2.2e-17495.05Show/hide
Query:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSD-SSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSE
        MELP YLVGGYYGTGA QFSP NKKS++EHFPVDEYLLDFSNEDVAMH GFFDNVAGNCSD SSTLTAIDSCNSSVSGGDN  LGKFES SFCEAQFSSE
Subjt:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSD-SSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSE

Query:  LCIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQAT
        LCIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPF+SGGISS ATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQAT
Subjt:  LCIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQAT

Query:  APDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF
        AP+K EG M KPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF
Subjt:  APDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF

Query:  SRSNGCDEYLIHRHNGGDFSHMM
        SRSNGCDEYLIHRHNGGDFSHMM
Subjt:  SRSNGCDEYLIHRHNGGDFSHMM

A0A5A7U6E0 GATA transcription factor2.2e-17495.05Show/hide
Query:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSD-SSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSE
        MELP YLVGGYYGTGA QFSP NKKS++EHFPVDEYLLDFSNEDVAMH GFFDNVAGNCSD SSTLTAIDSCNSSVSGGDN  LGKFES SFCEAQFSSE
Subjt:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSD-SSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSE

Query:  LCIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQAT
        LCIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPF+SGGISS ATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQAT
Subjt:  LCIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQAT

Query:  APDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF
        AP+K EG M KPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF
Subjt:  APDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF

Query:  SRSNGCDEYLIHRHNGGDFSHMM
        SRSNGCDEYLIHRHNGGDFSHMM
Subjt:  SRSNGCDEYLIHRHNGGDFSHMM

A0A6J1GB38 GATA transcription factor5.4e-13677.95Show/hide
Query:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL
        ME+P YL+GG+Y  GA QFSP+   S+ +HF VDEYLLDFSN+DVAM SGFFDNVA NCSDSST+TAI+SCNSS+S GDN  LG F SASF EAQFS+EL
Subjt:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL

Query:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA
        CIP DDLAELEWLSNFVE+SFSTEEI+KDFP IPFL+G   + A PET SSSG TAFGYG+AKTT+FF  EA  LPGKARSKRSR +PCDWSTRLLQA  
Subjt:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA

Query:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS
        P K+E T     + SGRKCLHCAAEKTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVSTKHSNSHRKVMELRRQKE+Q QEQF+SQ SIF 
Subjt:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS

Query:  RSNGCDEYLIHRHNGGDFSHMM
        RSNGCDEYLIHR NGGDF HM+
Subjt:  RSNGCDEYLIHRHNGGDFSHMM

A0A6J1K681 GATA transcription factor5.4e-13677.64Show/hide
Query:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL
        ME+P YL+GG+YG GA QFSP+   SSA+HF VDEYLLDFSN+DVA++SGFFD+VA NCSDSST+TAI+SCNSS+S GDN  LG F SASF EAQFS+EL
Subjt:  MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSEL

Query:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA
        CIP DDLAELEWLSNFVE+SFSTEEI+KDFP IPFL+G   + A PET SSSG TAFGYG+ KTT+FF  EA  LPGKARSKRSR +PCDWSTRLLQA  
Subjt:  CIPCDDLAELEWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATA

Query:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS
        P K+E T     + SGRKCLHCAAEKTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPA+SPT+VSTKHSNSHRKVMELRRQKE+Q QEQF+SQ SIF 
Subjt:  PDKAEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS

Query:  RSNGCDEYLIHRHNGGDFSHMM
        RSNGCDEYLIHR NGGDF HM+
Subjt:  RSNGCDEYLIHRHNGGDFSHMM

SwissProt top hitse value%identityAlignment
O49741 GATA transcription factor 25.5e-4543.54Show/hide
Query:  SSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLAELEWLSNFVEESFSTEE
        SS +   +D+ LLDFSNED+            + S S   TA  S +S     +  F      +S     F  ++C+P DD A LEWLS FV++SF+   
Subjt:  SSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLAELEWLSNFVEESFSTEE

Query:  IDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRAT---PCDWSTRLLQATAPDKAEGTMAKP-----------
           DFPA P   GG  ++   ETS                          PGK RSKRSRA       WS   L++           KP           
Subjt:  IDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRAT---PCDWSTRLLQATAPDKAEGTMAKP-----------

Query:  ---------ETTSG---RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFV
                 ETT G   R+C HCA+EKTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRPASSPTFV T+HSNSHRKVMELRRQKE+  Q Q V
Subjt:  ---------ETTSG---RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFV

O49743 GATA transcription factor 41.5e-4544.93Show/hide
Query:  SSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCE----AQFSSELCIPCDDLAELEWLSNFVEESF
        SS +   +D+ LLDFSN+++              S SST+T+  S  SS +  +N F   F S+++        F+ +LC+P DD A LEWLS FV++SF
Subjt:  SSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCE----AQFSSELCIPCDDLAELEWLSNFVEESF

Query:  STEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRA---------TPCDWSTRLLQATAPDKAEGTMAKPE
        S      DFPA P     ++ T  PE                         ++  GK RS+RSRA          P   S        P   +   A+  
Subjt:  STEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRA---------TPCDWSTRLLQATAPDKAEGTMAKPE

Query:  TTSG-RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQ
        T  G R+C HCA+EKTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPASSPTFV T+HSNSHRKVMELRRQKE Q
Subjt:  TTSG-RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQ

O82632 GATA transcription factor 94.2e-5345.94Show/hide
Query:  EHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLAELEWLSNFVEESFSTEEIDK
        + F VD+ LLDFSN+D  +  G   N   + S  ST T  DS NS              S+ F +    S+L IP DD+AELEWLSNFVEESF+ E+ DK
Subjt:  EHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLAELEWLSNFVEESFSTEEIDK

Query:  DFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEA--LTLPGKARSKRSRATPCDWSTRLL-----QATAPDKAEGTMAKP----------
            +   SG       P+T+ S+              F   +   + +P KARSKRSR+    W++RLL       T P K +  + +           
Subjt:  DFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEA--LTLPGKARSKRSRATPCDWSTRLL-----QATAPDKAEGTMAKP----------

Query:  -ETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS-----RSNGC
         E+  GR+CLHCA EKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFV  +HSNSHRKVMELRRQKEM+  E  +SQ    +     RSNG 
Subjt:  -ETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS-----RSNGC

Query:  DEYLIH---RHNGGDFSHMM
        +++L+H    H   DF H++
Subjt:  DEYLIH---RHNGGDFSHMM

P69781 GATA transcription factor 125.0e-5442.11Show/hide
Query:  FPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAI-DSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLA-ELEWLSNFVEESFSTEEIDK
        F VD+ L+DFSN+D        D      +DS+T T I DS N S +      L  F         FS +LCIP DDLA ELEWLSN V+ES S E++ K
Subjt:  FPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAI-DSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLA-ELEWLSNFVEESFSTEEIDK

Query:  DFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTR-----------------------LLQATAPDKAE
            +  +SG     + P+  S +G+      N  +++   +  +++P KARSKRSRA  C+W++R                       L   T+P    
Subjt:  DFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTR-----------------------LLQATAPDKAE

Query:  GTMAKPETTSG------------------RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQ
          + K +   G                  R+CLHCA +KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV  KHSNSHRKVMELRRQKEM 
Subjt:  GTMAKPETTSG------------------RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQ

Query:  -------HQEQFVSQSSIFSRSNGCDEYLIHRHNGGDFSHMM
               H       + IF  S+  D+YLIH + G DF  ++
Subjt:  -------HQEQFVSQSSIFSRSNGCDEYLIHRHNGGDFSHMM

Q9FH57 GATA transcription factor 52.9e-3336.39Show/hide
Query:  SAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCE---AQFSSELCIPCDDLAELEWLSNFVEESFST
        S + F VD+ LLD SN+DV          A   +D      +   +S     D   L +    S C+   +  +SEL +P DDLA LEWLS+FVE+SF+ 
Subjt:  SAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCE---AQFSSELCIPCDDLAELEWLSNFVEESFST

Query:  EEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATAPDKAEGT----------------
                   +    ++ T T + +  +G            T F S    +P KARSKR+R     WS     ++ P  +  T                
Subjt:  EEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATAPDKAEGT----------------

Query:  MAKPETTS---------------------------GRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMEL
        + +P  TS                            RKC HC  +KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF S  HSN HRKV+E+
Subjt:  MAKPETTS---------------------------GRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMEL

Query:  RRQKE
        RR+KE
Subjt:  RRQKE

Arabidopsis top hitse value%identityAlignment
AT2G45050.1 GATA transcription factor 23.9e-4643.54Show/hide
Query:  SSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLAELEWLSNFVEESFSTEE
        SS +   +D+ LLDFSNED+            + S S   TA  S +S     +  F      +S     F  ++C+P DD A LEWLS FV++SF+   
Subjt:  SSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLAELEWLSNFVEESFSTEE

Query:  IDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRAT---PCDWSTRLLQATAPDKAEGTMAKP-----------
           DFPA P   GG  ++   ETS                          PGK RSKRSRA       WS   L++           KP           
Subjt:  IDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRAT---PCDWSTRLLQATAPDKAEGTMAKP-----------

Query:  ---------ETTSG---RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFV
                 ETT G   R+C HCA+EKTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRPASSPTFV T+HSNSHRKVMELRRQKE+  Q Q V
Subjt:  ---------ETTSG---RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFV

AT3G60530.1 GATA transcription factor 41.0e-4644.93Show/hide
Query:  SSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCE----AQFSSELCIPCDDLAELEWLSNFVEESF
        SS +   +D+ LLDFSN+++              S SST+T+  S  SS +  +N F   F S+++        F+ +LC+P DD A LEWLS FV++SF
Subjt:  SSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCE----AQFSSELCIPCDDLAELEWLSNFVEESF

Query:  STEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRA---------TPCDWSTRLLQATAPDKAEGTMAKPE
        S      DFPA P     ++ T  PE                         ++  GK RS+RSRA          P   S        P   +   A+  
Subjt:  STEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRA---------TPCDWSTRLLQATAPDKAEGTMAKPE

Query:  TTSG-RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQ
        T  G R+C HCA+EKTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPASSPTFV T+HSNSHRKVMELRRQKE Q
Subjt:  TTSG-RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQ

AT4G32890.1 GATA transcription factor 93.0e-5445.94Show/hide
Query:  EHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLAELEWLSNFVEESFSTEEIDK
        + F VD+ LLDFSN+D  +  G   N   + S  ST T  DS NS              S+ F +    S+L IP DD+AELEWLSNFVEESF+ E+ DK
Subjt:  EHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLAELEWLSNFVEESFSTEEIDK

Query:  DFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEA--LTLPGKARSKRSRATPCDWSTRLL-----QATAPDKAEGTMAKP----------
            +   SG       P+T+ S+              F   +   + +P KARSKRSR+    W++RLL       T P K +  + +           
Subjt:  DFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEA--LTLPGKARSKRSRATPCDWSTRLL-----QATAPDKAEGTMAKP----------

Query:  -ETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS-----RSNGC
         E+  GR+CLHCA EKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFV  +HSNSHRKVMELRRQKEM+  E  +SQ    +     RSNG 
Subjt:  -ETTSGRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS-----RSNGC

Query:  DEYLIH---RHNGGDFSHMM
        +++L+H    H   DF H++
Subjt:  DEYLIH---RHNGGDFSHMM

AT5G25830.1 GATA transcription factor 123.6e-5542.11Show/hide
Query:  FPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAI-DSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLA-ELEWLSNFVEESFSTEEIDK
        F VD+ L+DFSN+D        D      +DS+T T I DS N S +      L  F         FS +LCIP DDLA ELEWLSN V+ES S E++ K
Subjt:  FPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAI-DSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLA-ELEWLSNFVEESFSTEEIDK

Query:  DFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTR-----------------------LLQATAPDKAE
            +  +SG     + P+  S +G+      N  +++   +  +++P KARSKRSRA  C+W++R                       L   T+P    
Subjt:  DFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTR-----------------------LLQATAPDKAE

Query:  GTMAKPETTSG------------------RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQ
          + K +   G                  R+CLHCA +KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV  KHSNSHRKVMELRRQKEM 
Subjt:  GTMAKPETTSG------------------RKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQ

Query:  -------HQEQFVSQSSIFSRSNGCDEYLIHRHNGGDFSHMM
               H       + IF  S+  D+YLIH + G DF  ++
Subjt:  -------HQEQFVSQSSIFSRSNGCDEYLIHRHNGGDFSHMM

AT5G66320.1 GATA transcription factor 52.0e-3436.39Show/hide
Query:  SAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCE---AQFSSELCIPCDDLAELEWLSNFVEESFST
        S + F VD+ LLD SN+DV          A   +D      +   +S     D   L +    S C+   +  +SEL +P DDLA LEWLS+FVE+SF+ 
Subjt:  SAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCE---AQFSSELCIPCDDLAELEWLSNFVEESFST

Query:  EEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATAPDKAEGT----------------
                   +    ++ T T + +  +G            T F S    +P KARSKR+R     WS     ++ P  +  T                
Subjt:  EEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATAPDKAEGT----------------

Query:  MAKPETTS---------------------------GRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMEL
        + +P  TS                            RKC HC  +KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF S  HSN HRKV+E+
Subjt:  MAKPETTS---------------------------GRKCLHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMEL

Query:  RRQKE
        RR+KE
Subjt:  RRQKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACTCCCCGGGTATCTTGTCGGTGGCTACTACGGCACCGGAGCCCCTCAATTTTCCCCGGACAACAAAAAATCCTCCGCCGAACATTTCCCTGTCGATGAATATTT
ATTGGACTTCTCCAATGAAGATGTGGCAATGCATAGCGGTTTCTTCGATAATGTCGCCGGAAATTGCAGTGATTCCTCCACTCTTACTGCCATTGACAGCTGTAATTCCT
CTGTCTCTGGCGGCGATAACCATTTCTTAGGAAAATTTGAGTCCGCAAGTTTCTGTGAAGCTCAATTCTCGAGCGAACTCTGCATTCCGTGCGATGATTTGGCGGAACTC
GAATGGCTGTCGAATTTCGTTGAAGAATCATTTTCGACGGAGGAGATTGATAAGGATTTTCCAGCAATTCCATTCCTCTCCGGAGGAATAAGTTCGACGGCGACTCCAGA
AACATCATCGTCCTCAGGAGCGACAGCGTTTGGTTACGGAAATGCAAAAACGACAACCTTTTTTCACAGCGAAGCTCTCACGCTCCCCGGCAAAGCCAGAAGCAAACGTT
CACGCGCTACTCCATGCGATTGGTCGACGAGGCTCCTCCAAGCGACGGCGCCGGATAAAGCGGAAGGGACGATGGCGAAGCCGGAAACGACGTCGGGTCGGAAATGCCTA
CATTGCGCGGCGGAGAAGACGCCGCAGTGGCGGACTGGGCCGATGGGCCCAAAAACGCTTTGTAATGCCTGTGGGGTACGGTACAAATCGGGTCGGCTCGTACCTGAATA
CCGACCCGCTTCGAGCCCGACATTTGTGTCGACGAAGCACTCGAATTCTCACCGGAAGGTGATGGAGCTCCGACGGCAAAAGGAGATGCAACATCAAGAGCAGTTTGTAA
GTCAGAGTTCGATATTCAGCAGATCCAACGGCTGTGATGAGTATTTAATCCACCGTCACAACGGCGGTGATTTTAGTCACATGATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACTCCCCGGGTATCTTGTCGGTGGCTACTACGGCACCGGAGCCCCTCAATTTTCCCCGGACAACAAAAAATCCTCCGCCGAACATTTCCCTGTCGATGAATATTT
ATTGGACTTCTCCAATGAAGATGTGGCAATGCATAGCGGTTTCTTCGATAATGTCGCCGGAAATTGCAGTGATTCCTCCACTCTTACTGCCATTGACAGCTGTAATTCCT
CTGTCTCTGGCGGCGATAACCATTTCTTAGGAAAATTTGAGTCCGCAAGTTTCTGTGAAGCTCAATTCTCGAGCGAACTCTGCATTCCGTGCGATGATTTGGCGGAACTC
GAATGGCTGTCGAATTTCGTTGAAGAATCATTTTCGACGGAGGAGATTGATAAGGATTTTCCAGCAATTCCATTCCTCTCCGGAGGAATAAGTTCGACGGCGACTCCAGA
AACATCATCGTCCTCAGGAGCGACAGCGTTTGGTTACGGAAATGCAAAAACGACAACCTTTTTTCACAGCGAAGCTCTCACGCTCCCCGGCAAAGCCAGAAGCAAACGTT
CACGCGCTACTCCATGCGATTGGTCGACGAGGCTCCTCCAAGCGACGGCGCCGGATAAAGCGGAAGGGACGATGGCGAAGCCGGAAACGACGTCGGGTCGGAAATGCCTA
CATTGCGCGGCGGAGAAGACGCCGCAGTGGCGGACTGGGCCGATGGGCCCAAAAACGCTTTGTAATGCCTGTGGGGTACGGTACAAATCGGGTCGGCTCGTACCTGAATA
CCGACCCGCTTCGAGCCCGACATTTGTGTCGACGAAGCACTCGAATTCTCACCGGAAGGTGATGGAGCTCCGACGGCAAAAGGAGATGCAACATCAAGAGCAGTTTGTAA
GTCAGAGTTCGATATTCAGCAGATCCAACGGCTGTGATGAGTATTTAATCCACCGTCACAACGGCGGTGATTTTAGTCACATGATGTAG
Protein sequenceShow/hide protein sequence
MELPGYLVGGYYGTGAPQFSPDNKKSSAEHFPVDEYLLDFSNEDVAMHSGFFDNVAGNCSDSSTLTAIDSCNSSVSGGDNHFLGKFESASFCEAQFSSELCIPCDDLAEL
EWLSNFVEESFSTEEIDKDFPAIPFLSGGISSTATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKARSKRSRATPCDWSTRLLQATAPDKAEGTMAKPETTSGRKCL
HCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFSRSNGCDEYLIHRHNGGDFSHMM