; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017617 (gene) of Chayote v1 genome

Gene IDSed0017617
OrganismSechium edule (Chayote v1)
DescriptionGATA transcription factor
Genome locationLG01:2016842..2018326
RNA-Seq ExpressionSed0017617
SyntenySed0017617
Gene Ontology termsGO:0030154 - cell differentiation (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605045.1 GATA transcription factor 5, partial [Cucurbita argyrosperma subsp. sororia]3.1e-11971.99Show/hide
Query:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF
        M+ LEAKALKSSFHWELAMESA++DALVEE  C NG NLVAGEEF+VDEF NFSNGDFEHGS       D+  EFEK+  SVSS+ NQS E PAAG+ED 
Subjt:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF

Query:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQT---------MTSSSSSATSSGDSSA
        K+ LAVEL  PGDA+AELEWVS FVDDS   FSS+ VA +RSEPEK LAG VISCLP F PV+PRTKRSR SRQT          +SSSSS+TSSG SSA
Subjt:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQT---------MTSSSSSATSSGDSSA

Query:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSLAA----GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSG
         P FIFSDAG+NVDS N +G+ PKKQRK+    SP++L +    GQ PRRCSHCLVQKTPQWR+GPHGAKTLCNACGVR+KSGRLFPEYRPALSPTFCS 
Subjt:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSLAA----GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSG

Query:  VHSNSHRKVVEMRKMKEDPEPATELSQMVPSY
        VHSNSHRKV+EMRKMKE  +PATEL+ MV SY
Subjt:  VHSNSHRKVVEMRKMKEDPEPATELSQMVPSY

XP_004150140.1 GATA transcription factor 5 [Cucumis sativus]9.4e-12472.16Show/hide
Query:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF
        M+FLEAKALKSSFHWELAM+SA++DALVEE  C NGPNLV+GE+F+++EFLNF NGD EHGS       D+ EEFEKNR SVSSN NQSD  P  G+ED 
Subjt:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF

Query:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDSSA
        K+ LAVEL FPGD+L +LEWVSQFVDDS SEFS + VA NRSEPEKKL G VISCLP F+PVRPRTKRSR SRQ  +         SSSSS+TSSG SSA
Subjt:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDSSA

Query:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSL------AAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFC
         P FIFSDAGENVD  N +G+ PKKQRK+    SPSS       + GQ PRRCSHCLVQKTPQWR+GP+GAKTLCNACGVR+KSGRLFPEYRPALSPTFC
Subjt:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSL------AAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFC

Query:  SGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY
        SGVHSNSHRKV+EMRK KE P+PATEL+ MVPSY
Subjt:  SGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY

XP_008461014.1 PREDICTED: GATA transcription factor 5 [Cucumis melo]6.7e-12271.56Show/hide
Query:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF
        M+FLEAKALKSSFHWELAM+SA++DALVEE  C NG NLV+GE+F+++EFLNFSNGD EHGS       D+ EEFEKNR S+SSN NQ+   P  GDED 
Subjt:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF

Query:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDSSA
        K+ LAVEL FPGD+L +LEWVSQFVDDS SEFS   VA NRSEPEKKL G VISCLP F+PVRPRTKRSR SRQ  +         SSSSS+TSSG SSA
Subjt:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDSSA

Query:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSL------AAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFC
         P FIFSDAGENVDS N +G+ PKKQRK+    SPSS       + GQ PRRCSHCLVQKTPQWR+GP+GAKTLCNACGVR+KSGRLFPEYRPALSPTFC
Subjt:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSL------AAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFC

Query:  SGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY
        SGVHSNSHRKV+EMRK KE  +PATEL+ MVPSY
Subjt:  SGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY

XP_022948191.1 GATA transcription factor 5-like [Cucurbita moschata]1.4e-11972.29Show/hide
Query:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF
        M+ LEAKALKSSFHWELAMESA++DALVEE  C NG NLVAGEEF+VDEF NFSNGDFEHGS       D+  EFEK+  SVSS+ NQS E PAAG+ED 
Subjt:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF

Query:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQT---------MTSSSSSATSSGDSSA
        K+ LAVEL  PGDA+AELEWVS FVDDS   FSS+ VA +RSEPEK LAG VISCLP F PV+PRTKRSR SRQT          +SSSSS+TSSG SSA
Subjt:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQT---------MTSSSSSATSSGDSSA

Query:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSLAA----GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSG
         P FIFSDAGENVDS N +G+ PKKQRK+    SP++L +    GQ PRRCSHCLVQKTPQWR+GPHGAKTLCNACGVR+KSGRLFPEYRPALSPTFCS 
Subjt:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSLAA----GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSG

Query:  VHSNSHRKVVEMRKMKEDPEPATELSQMVPSY
        VHSNSHRKV+EMRKMKE  +PATEL+ MV SY
Subjt:  VHSNSHRKVVEMRKMKEDPEPATELSQMVPSY

XP_038902880.1 GATA transcription factor 5-like [Benincasa hispida]2.9e-12575.23Show/hide
Query:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEF-QVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDED
        M+FLEAKALKSSFHWELAM+SA++DALVEE  C NG NLVAGE+F +VDEFLNFSNGD EHGS       DE EEFEKNR+SVS N NQS  FP  G+ED
Subjt:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEF-QVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDED

Query:  FK-TFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDS
         K + LAVEL  PGD+LA+LEWVSQFVDDSCSEFS + VA NRSEPEKKLAG VISCLP F+PVRPRTKRSR SRQ  +         SSSSS+TSSG S
Subjt:  FK-TFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDS

Query:  SAVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAA-----GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGV
        SA P FIFSDAGENVDS N S + PKKQRK+S S   A     GQ PRRCSHCLVQKTPQWR+GP+GAKTLCNACGVR+KSGRLFPEYRPALSPTFCSGV
Subjt:  SAVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAA-----GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGV

Query:  HSNSHRKVVEMRKMKEDPEPATELSQMVPSY
        HSNSHRKV+EMRK KE PEPATELSQMVPSY
Subjt:  HSNSHRKVVEMRKMKEDPEPATELSQMVPSY

TrEMBL top hitse value%identityAlignment
A0A0A0LLB3 GATA transcription factor4.5e-12472.16Show/hide
Query:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF
        M+FLEAKALKSSFHWELAM+SA++DALVEE  C NGPNLV+GE+F+++EFLNF NGD EHGS       D+ EEFEKNR SVSSN NQSD  P  G+ED 
Subjt:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF

Query:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDSSA
        K+ LAVEL FPGD+L +LEWVSQFVDDS SEFS + VA NRSEPEKKL G VISCLP F+PVRPRTKRSR SRQ  +         SSSSS+TSSG SSA
Subjt:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDSSA

Query:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSL------AAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFC
         P FIFSDAGENVD  N +G+ PKKQRK+    SPSS       + GQ PRRCSHCLVQKTPQWR+GP+GAKTLCNACGVR+KSGRLFPEYRPALSPTFC
Subjt:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSL------AAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFC

Query:  SGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY
        SGVHSNSHRKV+EMRK KE P+PATEL+ MVPSY
Subjt:  SGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY

A0A1S3CE81 GATA transcription factor3.3e-12271.56Show/hide
Query:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF
        M+FLEAKALKSSFHWELAM+SA++DALVEE  C NG NLV+GE+F+++EFLNFSNGD EHGS       D+ EEFEKNR S+SSN NQ+   P  GDED 
Subjt:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF

Query:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDSSA
        K+ LAVEL FPGD+L +LEWVSQFVDDS SEFS   VA NRSEPEKKL G VISCLP F+PVRPRTKRSR SRQ  +         SSSSS+TSSG SSA
Subjt:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDSSA

Query:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSL------AAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFC
         P FIFSDAGENVDS N +G+ PKKQRK+    SPSS       + GQ PRRCSHCLVQKTPQWR+GP+GAKTLCNACGVR+KSGRLFPEYRPALSPTFC
Subjt:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSL------AAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFC

Query:  SGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY
        SGVHSNSHRKV+EMRK KE  +PATEL+ MVPSY
Subjt:  SGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY

A0A5A7TQJ0 GATA transcription factor3.3e-12271.56Show/hide
Query:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF
        M+FLEAKALKSSFHWELAM+SA++DALVEE  C NG NLV+GE+F+++EFLNFSNGD EHGS       D+ EEFEKNR S+SSN NQ+   P  GDED 
Subjt:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF

Query:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDSSA
        K+ LAVEL FPGD+L +LEWVSQFVDDS SEFS   VA NRSEPEKKL G VISCLP F+PVRPRTKRSR SRQ  +         SSSSS+TSSG SSA
Subjt:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMT---------SSSSSATSSGDSSA

Query:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSL------AAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFC
         P FIFSDAGENVDS N +G+ PKKQRK+    SPSS       + GQ PRRCSHCLVQKTPQWR+GP+GAKTLCNACGVR+KSGRLFPEYRPALSPTFC
Subjt:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSL------AAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFC

Query:  SGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY
        SGVHSNSHRKV+EMRK KE  +PATEL+ MVPSY
Subjt:  SGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY

A0A6J1G8K4 GATA transcription factor6.8e-12072.29Show/hide
Query:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF
        M+ LEAKALKSSFHWELAMESA++DALVEE  C NG NLVAGEEF+VDEF NFSNGDFEHGS       D+  EFEK+  SVSS+ NQS E PAAG+ED 
Subjt:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF

Query:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQT---------MTSSSSSATSSGDSSA
        K+ LAVEL  PGDA+AELEWVS FVDDS   FSS+ VA +RSEPEK LAG VISCLP F PV+PRTKRSR SRQT          +SSSSS+TSSG SSA
Subjt:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQT---------MTSSSSSATSSGDSSA

Query:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSLAA----GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSG
         P FIFSDAGENVDS N +G+ PKKQRK+    SP++L +    GQ PRRCSHCLVQKTPQWR+GPHGAKTLCNACGVR+KSGRLFPEYRPALSPTFCS 
Subjt:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSLAA----GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSG

Query:  VHSNSHRKVVEMRKMKEDPEPATELSQMVPSY
        VHSNSHRKV+EMRKMKE  +PATEL+ MV SY
Subjt:  VHSNSHRKVVEMRKMKEDPEPATELSQMVPSY

A0A6J1L063 GATA transcription factor2.0e-11972.29Show/hide
Query:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF
        M+ LEAKALKSSFHWELAMESA++DALVEE  C NG NLVAGEEF+VDEF NFSNGDFEHGS       D+ +EFEK+  SVSS+ NQS E PAAG+ED 
Subjt:  MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGS-------DEDEEFEKNRHSVSSNLNQSDEFPAAGDEDF

Query:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQT---------MTSSSSSATSSGDSSA
        K+ LAVEL  PGDALAELEWVS FVDDS   FSS+ VA +RSEPEK LAG VISCLP F PV+PRTKRSR SRQT          +SSSSS+TSSG SSA
Subjt:  KTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQT---------MTSSSSSATSSGDSSA

Query:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSLAA----GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSG
           FIFSDAGENVDS N +G+ PKKQRK+    SP++L +    GQ PRRCSHCLVQKTPQWR+GPHGAKTLCNACGVR+KSGRLFPEYRPALSPTFCS 
Subjt:  VPLFIFSDAGENVDSYNTSGKAPKKQRKR----SPSSLAA----GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSG

Query:  VHSNSHRKVVEMRKMKEDPEPATELSQMVPSY
        VHSNSHRKV+EMRKMKE  +PATEL+ MV SY
Subjt:  VHSNSHRKVVEMRKMKEDPEPATELSQMVPSY

SwissProt top hitse value%identityAlignment
O49741 GATA transcription factor 27.2e-3438.01Show/hide
Query:  QVDEFLNFSNGDFEHGSDEDEEFEKNRHSVSSNLNQSDEFP-----------AAGDEDFKTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNR
        ++D+ L+FSN D          F  +    S+    S  FP                D  +FL  ++  P D  A LEW+SQFVDDS ++F         
Subjt:  QVDEFLNFSNGDFEHGSDEDEEFEKNRHSVSSNLNQSDEFP-----------AAGDEDFKTFLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNR

Query:  SEPEKKLAGAVISC-LPAFYPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGENVDSYNTSGKAPKKQR-----------KRSPSSLAAGQ
          P   L G + S      +P +PR+KRSR          + A  +G  S +PL       E+   ++ +   PKK++           + S S    G 
Subjt:  SEPEKKLAGAVISC-LPAFYPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGENVDSYNTSGKAPKKQR-----------KRSPSSLAAGQ

Query:  FPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKE
          RRC+HC  +KTPQWR+GP G KTLCNACGVRFKSGRL PEYRPA SPTF    HSNSHRKV+E+R+ KE
Subjt:  FPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKE

O65515 GATA transcription factor 74.5e-3641.98Show/hide
Query:  EFQVDEFLNFSNGD--FEHGSDEDEEFEKNRHSVSSNLNQSDEFPAAGDEDFKTFLAVELDFPGDA----LAELEWVSQFVDDSCSE-FSSSDVALNRSE
        +F VD+ L+ SN D   E  S + +E E+ R    S  +QS       D          L FPGDA    L +LEW+S FV+DS SE + SSD  +N   
Subjt:  EFQVDEFLNFSNGD--FEHGSDEDEEFEKNRHSVSSNLNQSDEFPAAGDEDFKTFLAVELDFPGDA----LAELEWVSQFVDDSCSE-FSSSDVALNRSE

Query:  PEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAAG------QFPRRCSHC
               A +       PV+PR+KR R + +  +  S S          PL              ++  A +K+R R     + G      Q  R CSHC
Subjt:  PEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAAG------QFPRRCSHC

Query:  LVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMK
         VQKTPQWR GP GAKTLCNACGVRFKSGRL PEYRPA SPTF + +HSNSHRKV+E+R MK
Subjt:  LVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMK

Q8L4M6 GATA transcription factor 39.3e-3438.11Show/hide
Query:  EAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEH-------GSDEDEEFEKNRHSVSSNLNQSDEFPAAGDEDFKTFL
        EA+ALK+S   E  +       +V E+           E+F V+ FL+FS G  E         S +++E +++    SS     D+ P+  DED     
Subjt:  EAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEH-------GSDEDEEFEKNRHSVSSNLNQSDEFPAAGDEDFKTFL

Query:  AVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAF--YPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGE
                  + ELEWVS+ VDD     SS +V+L  ++  K          P+F   PV+PRTKRSR              S   S   PL        
Subjt:  AVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAF--YPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGE

Query:  NVDSYNTSGKAPKKQRKRSPSSLAAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKEDP
           S N    A ++ RK+   ++    F RRCSHC    TPQWR+GP G KTLCNACGVRFKSGRL PEYRPA SPTF + +HSN HRKV+E+RK KE  
Subjt:  NVDSYNTSGKAPKKQRKRSPSSLAAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKEDP

Query:  EPATELS
        E   E S
Subjt:  EPATELS

Q9FH57 GATA transcription factor 51.8e-4540.6Show/hide
Query:  LEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGSDEDEEFEKNRHSVSSN--------LNQSDEFPAAGDEDFKT
        +E  ALKSS   E+A+++       E  +     N  + ++F VD+ L+ SN D     + D + +     VSS         L +S +F  +G +DF +
Subjt:  LEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGSDEDEEFEKNRHSVSSN--------LNQSDEFPAAGDEDFKT

Query:  FLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAG-------AVI--SCLPAFYPVRPRTKRSR--LSRQTMTSSSSSATSS----G
            EL  P D LA LEW+S FV+DS +E+S  ++    +E    L G       AV   +C  +  P + R+KR+R  L   ++ SSSSS  SS     
Subjt:  FLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAG-------AVI--SCLPAFYPVRPRTKRSR--LSRQTMTSSSSSATSS----G

Query:  DSSAVPLFIFSDAGENVDSYNTSGKA--PKKQRKRSPSSLAAGQF-----PRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTF
         SS+ P   +    E ++   TS +   PKK +KRS  S+ +G+       R+CSHC VQKTPQWR+GP GAKTLCNACGVR+KSGRL PEYRPA SPTF
Subjt:  DSSAVPLFIFSDAGENVDSYNTSGKA--PKKQRKRSPSSLAAGQF-----PRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTF

Query:  CSGVHSNSHRKVVEMRKMKE-DPEPATELSQMVPS
         S +HSN HRKV+EMR+ KE   +  T L+Q+V S
Subjt:  CSGVHSNSHRKVVEMRKMKE-DPEPATELSQMVPS

Q9SD38 GATA transcription factor 61.1e-3742.14Show/hide
Query:  GEEFQVDEFLNFSNGDFEHG---SDEDEEFEKNRHSVS--SNLNQSDEFPAAGDEDFKTFLAVELDFPGDALAELEWVSQFVDDSC----SEFSSSDVAL
        G++F VD+ L+FS  + +      DE E   + +  VS  + L++S++F  A   DF T     L  P D +AELEW+S FVDDS     S  ++  V L
Subjt:  GEEFQVDEFLNFSNGDFEHG---SDEDEEFEKNRHSVS--SNLNQSDEFPAAGDEDFKTFLAVELDFPGDALAELEWVSQFVDDSC----SEFSSSDVAL

Query:  NRSEPEKKLAGAVISCLPAFYP-VRPRTKRSRL-------SRQTMTSSSSSATSSGDSS---AVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAAG
          +           +C  + +P V+ R KR+R          Q++T SSSS+T+S  SS   + PL++ S  G+ +D   T  +  KK  K +  +    
Subjt:  NRSEPEKKLAGAVISCLPAFYP-VRPRTKRSRL-------SRQTMTSSSSSATSSGDSS---AVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAAG

Query:  QF-PRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKEDPEPATE
        Q   R+C HC VQKTPQWR+GP GAKTLCNACGVR+KSGRL PEYRPA SPTF S +HSN H KV+EMR+ KE  + A E
Subjt:  QF-PRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKEDPEPATE

Arabidopsis top hitse value%identityAlignment
AT3G51080.1 GATA transcription factor 67.6e-3942.14Show/hide
Query:  GEEFQVDEFLNFSNGDFEHG---SDEDEEFEKNRHSVS--SNLNQSDEFPAAGDEDFKTFLAVELDFPGDALAELEWVSQFVDDSC----SEFSSSDVAL
        G++F VD+ L+FS  + +      DE E   + +  VS  + L++S++F  A   DF T     L  P D +AELEW+S FVDDS     S  ++  V L
Subjt:  GEEFQVDEFLNFSNGDFEHG---SDEDEEFEKNRHSVS--SNLNQSDEFPAAGDEDFKTFLAVELDFPGDALAELEWVSQFVDDSC----SEFSSSDVAL

Query:  NRSEPEKKLAGAVISCLPAFYP-VRPRTKRSRL-------SRQTMTSSSSSATSSGDSS---AVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAAG
          +           +C  + +P V+ R KR+R          Q++T SSSS+T+S  SS   + PL++ S  G+ +D   T  +  KK  K +  +    
Subjt:  NRSEPEKKLAGAVISCLPAFYP-VRPRTKRSRL-------SRQTMTSSSSSATSSGDSS---AVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAAG

Query:  QF-PRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKEDPEPATE
        Q   R+C HC VQKTPQWR+GP GAKTLCNACGVR+KSGRL PEYRPA SPTF S +HSN H KV+EMR+ KE  + A E
Subjt:  QF-PRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKEDPEPATE

AT4G34680.1 GATA transcription factor 36.6e-3538.11Show/hide
Query:  EAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEH-------GSDEDEEFEKNRHSVSSNLNQSDEFPAAGDEDFKTFL
        EA+ALK+S   E  +       +V E+           E+F V+ FL+FS G  E         S +++E +++    SS     D+ P+  DED     
Subjt:  EAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEH-------GSDEDEEFEKNRHSVSSNLNQSDEFPAAGDEDFKTFL

Query:  AVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAF--YPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGE
                  + ELEWVS+ VDD     SS +V+L  ++  K          P+F   PV+PRTKRSR              S   S   PL        
Subjt:  AVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAF--YPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGE

Query:  NVDSYNTSGKAPKKQRKRSPSSLAAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKEDP
           S N    A ++ RK+   ++    F RRCSHC    TPQWR+GP G KTLCNACGVRFKSGRL PEYRPA SPTF + +HSN HRKV+E+RK KE  
Subjt:  NVDSYNTSGKAPKKQRKRSPSSLAAGQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKEDP

Query:  EPATELS
        E   E S
Subjt:  EPATELS

AT4G36240.1 GATA transcription factor 73.2e-3741.98Show/hide
Query:  EFQVDEFLNFSNGD--FEHGSDEDEEFEKNRHSVSSNLNQSDEFPAAGDEDFKTFLAVELDFPGDA----LAELEWVSQFVDDSCSE-FSSSDVALNRSE
        +F VD+ L+ SN D   E  S + +E E+ R    S  +QS       D          L FPGDA    L +LEW+S FV+DS SE + SSD  +N   
Subjt:  EFQVDEFLNFSNGD--FEHGSDEDEEFEKNRHSVSSNLNQSDEFPAAGDEDFKTFLAVELDFPGDA----LAELEWVSQFVDDSCSE-FSSSDVALNRSE

Query:  PEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAAG------QFPRRCSHC
               A +       PV+PR+KR R + +  +  S S          PL              ++  A +K+R R     + G      Q  R CSHC
Subjt:  PEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAAG------QFPRRCSHC

Query:  LVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMK
         VQKTPQWR GP GAKTLCNACGVRFKSGRL PEYRPA SPTF + +HSNSHRKV+E+R MK
Subjt:  LVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMK

AT5G66320.1 GATA transcription factor 51.3e-4640.6Show/hide
Query:  LEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGSDEDEEFEKNRHSVSSN--------LNQSDEFPAAGDEDFKT
        +E  ALKSS   E+A+++       E  +     N  + ++F VD+ L+ SN D     + D + +     VSS         L +S +F  +G +DF +
Subjt:  LEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGSDEDEEFEKNRHSVSSN--------LNQSDEFPAAGDEDFKT

Query:  FLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAG-------AVI--SCLPAFYPVRPRTKRSR--LSRQTMTSSSSSATSS----G
            EL  P D LA LEW+S FV+DS +E+S  ++    +E    L G       AV   +C  +  P + R+KR+R  L   ++ SSSSS  SS     
Subjt:  FLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAG-------AVI--SCLPAFYPVRPRTKRSR--LSRQTMTSSSSSATSS----G

Query:  DSSAVPLFIFSDAGENVDSYNTSGKA--PKKQRKRSPSSLAAGQF-----PRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTF
         SS+ P   +    E ++   TS +   PKK +KRS  S+ +G+       R+CSHC VQKTPQWR+GP GAKTLCNACGVR+KSGRL PEYRPA SPTF
Subjt:  DSSAVPLFIFSDAGENVDSYNTSGKA--PKKQRKRSPSSLAAGQF-----PRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTF

Query:  CSGVHSNSHRKVVEMRKMKE-DPEPATELSQMVPS
         S +HSN HRKV+EMR+ KE   +  T L+Q+V S
Subjt:  CSGVHSNSHRKVVEMRKMKE-DPEPATELSQMVPS

AT5G66320.2 GATA transcription factor 51.3e-4640.6Show/hide
Query:  LEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGSDEDEEFEKNRHSVSSN--------LNQSDEFPAAGDEDFKT
        +E  ALKSS   E+A+++       E  +     N  + ++F VD+ L+ SN D     + D + +     VSS         L +S +F  +G +DF +
Subjt:  LEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGSDEDEEFEKNRHSVSSN--------LNQSDEFPAAGDEDFKT

Query:  FLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAG-------AVI--SCLPAFYPVRPRTKRSR--LSRQTMTSSSSSATSS----G
            EL  P D LA LEW+S FV+DS +E+S  ++    +E    L G       AV   +C  +  P + R+KR+R  L   ++ SSSSS  SS     
Subjt:  FLAVELDFPGDALAELEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAG-------AVI--SCLPAFYPVRPRTKRSR--LSRQTMTSSSSSATSS----G

Query:  DSSAVPLFIFSDAGENVDSYNTSGKA--PKKQRKRSPSSLAAGQF-----PRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTF
         SS+ P   +    E ++   TS +   PKK +KRS  S+ +G+       R+CSHC VQKTPQWR+GP GAKTLCNACGVR+KSGRL PEYRPA SPTF
Subjt:  DSSAVPLFIFSDAGENVDSYNTSGKA--PKKQRKRSPSSLAAGQF-----PRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTF

Query:  CSGVHSNSHRKVVEMRKMKE-DPEPATELSQMVPS
         S +HSN HRKV+EMR+ KE   +  T L+Q+V S
Subjt:  CSGVHSNSHRKVVEMRKMKE-DPEPATELSQMVPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTTTTGGAGGCTAAGGCTTTGAAATCAAGTTTCCACTGGGAATTAGCGATGGAATCTGCTAAAAAAGACGCTTTGGTGGAGGAGAATTCGTGTTTCAACGGACC
TAATCTCGTCGCCGGCGAGGAATTTCAGGTCGACGAGTTTTTGAACTTTTCTAACGGCGATTTTGAACATGGGTCCGACGAGGATGAAGAGTTTGAAAAAAATCGCCACT
CTGTTTCGTCGAATTTGAATCAGTCCGACGAGTTTCCGGCCGCCGGAGATGAGGATTTTAAGACGTTTCTCGCCGTTGAGCTTGATTTTCCGGGCGATGCTTTGGCGGAG
CTTGAATGGGTTTCTCAATTCGTCGACGATTCTTGCTCGGAATTTTCCTCCTCCGACGTGGCTCTGAACCGTTCCGAGCCGGAAAAGAAACTCGCCGGAGCTGTAATTTC
GTGTCTGCCGGCGTTTTATCCGGTCAGACCGAGGACGAAAAGGTCAAGACTAAGTCGTCAAACGATGACGTCGTCGTCTTCGTCGGCGACGTCCTCCGGCGATTCCTCCG
CCGTGCCGTTGTTCATCTTCTCCGACGCCGGCGAGAACGTGGACTCCTATAACACTTCCGGCAAGGCTCCAAAGAAGCAAAGGAAAAGGTCGCCGTCGTCGTTGGCCGCC
GGCCAGTTTCCTCGGCGGTGCAGTCATTGTCTGGTTCAGAAGACTCCACAGTGGCGGTCCGGTCCACACGGGGCGAAAACTCTCTGCAACGCTTGTGGGGTTCGGTTTAA
ATCCGGTCGGCTCTTTCCGGAGTACAGACCGGCGTTGAGCCCCACTTTTTGCAGCGGCGTTCACTCGAACAGTCATCGGAAAGTGGTCGAAATGAGGAAGATGAAGGAGG
ACCCTGAACCGGCAACCGAGCTGAGTCAAATGGTCCCAAGTTATTAA
mRNA sequenceShow/hide mRNA sequence
CTTCAACTGCCGCTCTGACTCTTCCTTTGCATTTCTCCCTGCAATTATTCCCCCTTTATCAAACAGAAAAACAAGATTATGAAGTTTTTGGAGGCTAAGGCTTTGAAATC
AAGTTTCCACTGGGAATTAGCGATGGAATCTGCTAAAAAAGACGCTTTGGTGGAGGAGAATTCGTGTTTCAACGGACCTAATCTCGTCGCCGGCGAGGAATTTCAGGTCG
ACGAGTTTTTGAACTTTTCTAACGGCGATTTTGAACATGGGTCCGACGAGGATGAAGAGTTTGAAAAAAATCGCCACTCTGTTTCGTCGAATTTGAATCAGTCCGACGAG
TTTCCGGCCGCCGGAGATGAGGATTTTAAGACGTTTCTCGCCGTTGAGCTTGATTTTCCGGGCGATGCTTTGGCGGAGCTTGAATGGGTTTCTCAATTCGTCGACGATTC
TTGCTCGGAATTTTCCTCCTCCGACGTGGCTCTGAACCGTTCCGAGCCGGAAAAGAAACTCGCCGGAGCTGTAATTTCGTGTCTGCCGGCGTTTTATCCGGTCAGACCGA
GGACGAAAAGGTCAAGACTAAGTCGTCAAACGATGACGTCGTCGTCTTCGTCGGCGACGTCCTCCGGCGATTCCTCCGCCGTGCCGTTGTTCATCTTCTCCGACGCCGGC
GAGAACGTGGACTCCTATAACACTTCCGGCAAGGCTCCAAAGAAGCAAAGGAAAAGGTCGCCGTCGTCGTTGGCCGCCGGCCAGTTTCCTCGGCGGTGCAGTCATTGTCT
GGTTCAGAAGACTCCACAGTGGCGGTCCGGTCCACACGGGGCGAAAACTCTCTGCAACGCTTGTGGGGTTCGGTTTAAATCCGGTCGGCTCTTTCCGGAGTACAGACCGG
CGTTGAGCCCCACTTTTTGCAGCGGCGTTCACTCGAACAGTCATCGGAAAGTGGTCGAAATGAGGAAGATGAAGGAGGACCCTGAACCGGCAACCGAGCTGAGTCAAATG
GTCCCAAGTTATTAAACCGACCAAACCAACCGGTTTGGATGGACCGGGTATCCATTTTGTCCTAGTACGGTTAGTGGAGAAATTAGGCCAGGTCATTACTTGTAGTGTAG
TATTTTACTATTTTTTGTGAAATTGATTTGATTAATTAGATTTTTGTTTATTAGGTAAAGATTAGATATGCTTAGGTATGAAAAAAAACAGAGATTTTTGTATTATTATA
TTAAATTAGCTAGTTTGCAAATATTAAATTTCATACTTTTTCGAGCTTTTTTATTAATACAACAAATGAGATAAAA
Protein sequenceShow/hide protein sequence
MKFLEAKALKSSFHWELAMESAKKDALVEENSCFNGPNLVAGEEFQVDEFLNFSNGDFEHGSDEDEEFEKNRHSVSSNLNQSDEFPAAGDEDFKTFLAVELDFPGDALAE
LEWVSQFVDDSCSEFSSSDVALNRSEPEKKLAGAVISCLPAFYPVRPRTKRSRLSRQTMTSSSSSATSSGDSSAVPLFIFSDAGENVDSYNTSGKAPKKQRKRSPSSLAA
GQFPRRCSHCLVQKTPQWRSGPHGAKTLCNACGVRFKSGRLFPEYRPALSPTFCSGVHSNSHRKVVEMRKMKEDPEPATELSQMVPSY