; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg13882 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg13882
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionGATA transcription factor 24-like
Genome locationCarg_Chr08:4721919..4728027
RNA-Seq ExpressionCarg13882
SyntenyCarg13882
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR010399 - Tify domain
IPR010402 - CCT domain
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025946.1 GATA transcription factor 28, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-255100Show/hide
Query:  EMDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSV
        EMDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSV
Subjt:  EMDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSV

Query:  SPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPI
        SPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPI
Subjt:  SPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPI

Query:  QEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKLQGSYQYSKSETLDINVDESFIDLSASNTYCKHGQ
        QEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKLQGSYQYSKSETLDINVDESFIDLSASNTYCKHGQ
Subjt:  QEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKLQGSYQYSKSETLDINVDESFIDLSASNTYCKHGQ

Query:  STAYSFFPSTRTLDSPPSVWAPTIVKVEHKLLKYISSVLVRVQYDSENDMLHLAALISVLEILFLKTVRELCGTSQKHQTRVDKLQASTGMRMLCRMEIL
        STAYSFFPSTRTLDSPPSVWAPTIVKVEHKLLKYISSVLVRVQYDSENDMLHLAALISVLEILFLKTVRELCGTSQKHQTRVDKLQASTGMRMLCRMEIL
Subjt:  STAYSFFPSTRTLDSPPSVWAPTIVKVEHKLLKYISSVLVRVQYDSENDMLHLAALISVLEILFLKTVRELCGTSQKHQTRVDKLQASTGMRMLCRMEIL

Query:  SPIQKVHRNCDRIVSSCRRRRRGRVSGQPESDLACQTRSLCHQTSQV
        SPIQKVHRNCDRIVSSCRRRRRGRVSGQPESDLACQTRSLCHQTSQV
Subjt:  SPIQKVHRNCDRIVSSCRRRRRGRVSGQPESDLACQTRSLCHQTSQV

TYK03986.1 GATA transcription factor 24-like isoform X1 [Cucumis melo var. makuwa]7.2e-14973.28Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MD IH SDGRMH+SHA HSMHTQSVQEQEHHDLHYMSNGNG+ADEHENEGHGIMVVERE  SDHGDLAENRGVMVDRGG+NCDQLTLSYQGQVYVFDSVS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALA-IPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPI
        PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALA IPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKP 
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALA-IPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPI

Query:  QEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKLQGSYQYSKSETLDINVDESFIDLSASNTYCKHGQ
         EDS  A  SWESNE+W SDGNGS QQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK        K++  +   D + +D ++S       +
Subjt:  QEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKLQGSYQYSKSETLDINVDESFIDLSASNTYCKHGQ

Query:  STAYSFFPSTRTLDSPPSVWAPTIVKVEHKLLKYISSVLVRVQYDSENDMLHLAALISVLEILFLKTVRELCGTSQKHQTRVDKLQASTGMRMLCRMEIL
        S  + FF            W P  +             L    YD ENDM+HLA L+  L  LFLK VREL GTSQK Q RVDKLQ S  M    RMEI 
Subjt:  STAYSFFPSTRTLDSPPSVWAPTIVKVEHKLLKYISSVLVRVQYDSENDMLHLAALISVLEILFLKTVRELCGTSQKHQTRVDKLQASTGMRMLCRMEIL

Query:  SPIQKVHR
        SP +KV R
Subjt:  SPIQKVHR

XP_022964234.1 GATA transcription factor 24-like isoform X1 [Cucurbita moschata]6.1e-148100Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
        EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

XP_022964236.1 GATA transcription factor 24-like isoform X2 [Cucurbita moschata]6.1e-148100Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
        EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

XP_023513996.1 GATA transcription factor 28-like isoform X3 [Cucurbita pepo subsp. pepo]4.6e-14899.62Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKL
        EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK+
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKL

TrEMBL top hitse value%identityAlignment
A0A5D3C0L3 GATA transcription factor 24-like isoform X13.5e-14973.28Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MD IH SDGRMH+SHA HSMHTQSVQEQEHHDLHYMSNGNG+ADEHENEGHGIMVVERE  SDHGDLAENRGVMVDRGG+NCDQLTLSYQGQVYVFDSVS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALA-IPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPI
        PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALA IPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKP 
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALA-IPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPI

Query:  QEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKLQGSYQYSKSETLDINVDESFIDLSASNTYCKHGQ
         EDS  A  SWESNE+W SDGNGS QQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK        K++  +   D + +D ++S       +
Subjt:  QEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKLQGSYQYSKSETLDINVDESFIDLSASNTYCKHGQ

Query:  STAYSFFPSTRTLDSPPSVWAPTIVKVEHKLLKYISSVLVRVQYDSENDMLHLAALISVLEILFLKTVRELCGTSQKHQTRVDKLQASTGMRMLCRMEIL
        S  + FF            W P  +             L    YD ENDM+HLA L+  L  LFLK VREL GTSQK Q RVDKLQ S  M    RMEI 
Subjt:  STAYSFFPSTRTLDSPPSVWAPTIVKVEHKLLKYISSVLVRVQYDSENDMLHLAALISVLEILFLKTVRELCGTSQKHQTRVDKLQASTGMRMLCRMEIL

Query:  SPIQKVHR
        SP +KV R
Subjt:  SPIQKVHR

A0A6J1HIC6 GATA transcription factor 24-like isoform X22.9e-148100Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
        EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

A0A6J1HK95 GATA transcription factor 24-like isoform X12.9e-148100Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
        EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

A0A6J1KC72 GATA transcription factor 28-like isoform X15.5e-14799.24Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSV 
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPI 
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
        EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

A0A6J1KIK6 GATA transcription factor 28-like isoform X34.2e-14798.87Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSV 
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPI 
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKL
        EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK+
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKL

SwissProt top hitse value%identityAlignment
A2XKR7 GATA transcription factor 181.3e-4450Show/hide
Query:  NGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVSPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDE
        +G+   +E E E       E E P      A      +  G  N  QLTL +QG+VYVF+SV+PEKVQAVLLLLG  E+P  + ++ + NQ  +R   D 
Subjt:  NGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVSPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDE

Query:  AFNQALAIPPRLSVP-QRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGIS
               +  R  +P +R+ASLIRFREKRKERNFDKKIRY VRKEVALRMQR KGQF     ++ +SLS      S       G     +E  C++CG S
Subjt:  AFNQALAIPPRLSVP-QRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGIS

Query:  EKSTPMMRRGPDGPRTLCNACGLMWANK
        EK TP MRRGP GPRTLCNACGLMWANK
Subjt:  EKSTPMMRRGPDGPRTLCNACGLMWANK

Q5Z4U5 GATA transcription factor 206.5e-5253.54Show/hide
Query:  ADEHENEGHGIMV-VEREAPSD---HGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVSPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEA
        A  HE  G  + V ++ EA +    HG +    G +        +QLTLS+QG+VYVFDSVSP+KVQAVLLLLGGRE+   + S   ++ P  +      
Subjt:  ADEHENEGHGIMV-VEREAPSD---HGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVSPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEA

Query:  FNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEK
                 RL+ P R+ASL+RFREKRKERNFDKKIRY+VRKEVALRMQRN+GQFTSSKP  +++ S  T+ + +  W S   G       C HCGI+ K
Subjt:  FNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEK

Query:  STPMMRRGPDGPRTLCNACGLMWANK
        +TPMMRRGPDGPRTLCNACGLMWANK
Subjt:  STPMMRRGPDGPRTLCNACGLMWANK

Q6Z433 GATA transcription factor 171.3e-4950Show/hide
Query:  EQEHHDLHYMSNGNGMADEHENEGHGI-MVVEREAPS-----DHGDLAENRGVMVDRGGDN--CDQLTLSYQGQVYVFDSVSPEKVQAVLLLLGGREVPL
        E+   +  Y     G  +E+E  G G+ M  +  A +      HG++    G     G  +   + LTLS+QG+VYVF+SVS E+VQAVLLLLGGRE+  
Subjt:  EQEHHDLHYMSNGNGMADEHENEGHGI-MVVEREAPS-----DHGDLAENRGVMVDRGGDN--CDQLTLSYQGQVYVFDSVSPEKVQAVLLLLGGREVPL

Query:  RVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQEDSLSATTSWESNETWCS
           S+P               + + A   +++ P R+ASL+RFREKRKERNFDKKIRYTVRKEVALRMQRN+GQFTSSK   E++ S  TS E +  W  
Subjt:  RVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQEDSLSATTSWESNETWCS

Query:  DGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
           G       C HCGIS  STPMMRRGPDGPRTLCNACGLMWANK
Subjt:  DGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

Q8GXL7 GATA transcription factor 244.8e-7961.74Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MD +H  +GRMH+  AQ+ MH Q     E H LH++ N N M D+H + G     VE + PS  G+ A+NRG +VDRG +N DQLTLS+QGQVYVFD VS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVP  +P+   +   N+R L           P RLSVPQRLASL+RFREKRK RNFDK IRYTVRKEVALRMQR KGQFTS+K   
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
        +DS S  + W SN++W  +G  + + E+LCRHCG SEKSTPMMRRGPDGPRTLCNACGLMWANK
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

Q8H1G0 GATA transcription factor 289.6e-8060.67Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGH-GIMV--VEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFD
        MD +H S+ RMH+  AQ  MH Q     EHH LH++ NG+GM D+  ++G+ G M   VE + PS  G++ +NRG +VDRG +  DQLTLS+QGQVYVFD
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGH-GIMV--VEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFD

Query:  SVSPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSK
        SV PEKVQAVLLLLGGRE+P   P  P    P+     +   +     P R S+PQRLASL+RFREKRK RNFDKKIRYTVRKEVALRMQRNKGQFTS+K
Subjt:  SVSPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSK

Query:  PIQEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
           +++ SA +SW SN+TW  + + +  QEI CRHCGI EKSTPMMRRGP GPRTLCNACGLMWANK
Subjt:  PIQEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

Arabidopsis top hitse value%identityAlignment
AT1G51600.1 ZIM-LIKE 26.8e-8160.67Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGH-GIMV--VEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFD
        MD +H S+ RMH+  AQ  MH Q     EHH LH++ NG+GM D+  ++G+ G M   VE + PS  G++ +NRG +VDRG +  DQLTLS+QGQVYVFD
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGH-GIMV--VEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFD

Query:  SVSPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSK
        SV PEKVQAVLLLLGGRE+P   P  P    P+     +   +     P R S+PQRLASL+RFREKRK RNFDKKIRYTVRKEVALRMQRNKGQFTS+K
Subjt:  SVSPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSK

Query:  PIQEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
           +++ SA +SW SN+TW  + + +  QEI CRHCGI EKSTPMMRRGP GPRTLCNACGLMWANK
Subjt:  PIQEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

AT1G51600.2 ZIM-LIKE 26.8e-8160.67Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGH-GIMV--VEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFD
        MD +H S+ RMH+  AQ  MH Q     EHH LH++ NG+GM D+  ++G+ G M   VE + PS  G++ +NRG +VDRG +  DQLTLS+QGQVYVFD
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGH-GIMV--VEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFD

Query:  SVSPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSK
        SV PEKVQAVLLLLGGRE+P   P  P    P+     +   +     P R S+PQRLASL+RFREKRK RNFDKKIRYTVRKEVALRMQRNKGQFTS+K
Subjt:  SVSPEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSK

Query:  PIQEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
           +++ SA +SW SN+TW  + + +  QEI CRHCGI EKSTPMMRRGP GPRTLCNACGLMWANK
Subjt:  PIQEDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

AT3G21175.1 ZIM-like 13.4e-8061.74Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MD +H  +GRMH+  AQ+ MH Q     E H LH++ N N M D+H + G     VE + PS  G+ A+NRG +VDRG +N DQLTLS+QGQVYVFD VS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVP  +P+   +   N+R L           P RLSVPQRLASL+RFREKRK RNFDK IRYTVRKEVALRMQR KGQFTS+K   
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
        +DS S  + W SN++W  +G  + + E+LCRHCG SEKSTPMMRRGPDGPRTLCNACGLMWANK
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

AT3G21175.2 ZIM-like 18.9e-8161.74Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MD +H  +GRMH+  AQ+ MH Q     E H LH++ N N M D+H + G     VE + PS  G+ A+NRG +VDRG +N DQLTLS+QGQVYVFD VS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVP  +P+   +   N+R L+          P RLSVPQRLASL+RFREKRK RNFDK IRYTVRKEVALRMQR KGQFTS+K   
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK
        +DS S  + W SN++W  +G  + + E+LCRHCG SEKSTPMMRRGPDGPRTLCNACGLMWANK
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANK

AT3G21175.3 ZIM-like 14.0e-5756.33Show/hide
Query:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS
        MD +H  +GRMH+  AQ+ MH Q     E H LH++ N N M D+H + G     VE + PS  G+ A+NRG +VDRG +N DQLTLS+QGQVYVFD VS
Subjt:  MDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVS

Query:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ
        PEKVQAVLLLLGGREVP  +P+   +   N+R L+          P RLSVPQRLASL+RFREKRK RNFDK IRYTVRKEVALRMQR KGQFTS+K   
Subjt:  PEKVQAVLLLLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQ

Query:  EDSLSATTSWESNETWCSDGNGSHQQEIL
        +DS S  + W SN++W  +G  + + E+L
Subjt:  EDSLSATTSWESNETWCSDGNGSHQQEIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAATGGACGGTATTCATGCGAGCGATGGACGGATGCATATGAGTCATGCACAGCATTCGATGCACACACAATCTGTGCAAGAACAGGAACATCATGACTTGCATTACAT
GAGTAATGGGAATGGAATGGCCGATGAGCATGAAAATGAAGGTCACGGTATCATGGTTGTGGAAAGGGAAGCTCCATCCGACCATGGTGATCTTGCCGAGAATCGTGGTG
TAATGGTGGATCGAGGAGGCGATAACTGCGATCAGCTTACCCTGTCATATCAGGGTCAAGTTTATGTTTTTGATTCTGTCTCGCCCGAAAAGGTTCAAGCCGTGTTATTA
TTATTGGGAGGTCGCGAAGTACCTTTGCGTGTTCCCTCAATCCCAATAACTAATCAACCAAATGATAGGCATCTCACTGATGAAGCATTTAACCAGGCACTTGCCATCCC
CCCACGTCTAAGTGTCCCTCAGAGATTAGCTTCTTTGATTAGATTTCGTGAAAAGCGAAAGGAACGTAATTTTGACAAGAAAATTCGGTATACAGTTCGTAAAGAGGTAG
CACTTAGGATGCAGCGTAATAAAGGTCAGTTCACATCTTCCAAACCCATTCAGGAAGATTCATTATCGGCCACAACAAGTTGGGAATCAAATGAGACTTGGTGTTCTGAT
GGAAATGGATCCCATCAACAAGAAATTCTCTGTCGACATTGTGGTATTAGTGAGAAGTCTACACCAATGATGCGACGTGGACCTGATGGGCCAAGAACCCTTTGCAATGC
CTGCGGACTCATGTGGGCAAACAAGCTTCAAGGATCTTATCAATATAGTAAATCTGAGACCCTGGACATAAATGTAGATGAATCATTTATTGATCTCTCGGCCTCTAACA
CTTATTGCAAGCATGGCCAATCCACTGCTTATTCATTTTTCCCCTCAACGAGAACCCTAGATTCTCCTCCTTCTGTCTGGGCTCCTACCATCGTGAAAGTTGAGCATAAG
CTCCTCAAGTATATTTCTTCTGTATTGGTCAGAGTTCAGTATGACTCTGAAAATGATATGCTGCATTTAGCTGCTCTGATTTCCGTTCTAGAAATTCTTTTCCTGAAGAC
TGTTAGGGAACTCTGCGGGACCTCTCAAAAGCACCAAACCAGGGTGGACAAGCTTCAAGCTTCAACAGGAATGAGAATGTTGTGCAGAATGGAGATTCTGAGTCCAATCC
AAAAAGTGCATAGGAATTGTGACAGGATTGTGTCTTCCTGCAGAAGACGAAGGCGTGGACGAGTCTCTGGGCAGCCCGAATCTGATCTGGCGTGCCAGACACGATCGCTT
TGCCATCAAACATCCCAGGTTTAG
mRNA sequenceShow/hide mRNA sequence
GAAATGGACGGTATTCATGCGAGCGATGGACGGATGCATATGAGTCATGCACAGCATTCGATGCACACACAATCTGTGCAAGAACAGGAACATCATGACTTGCATTACAT
GAGTAATGGGAATGGAATGGCCGATGAGCATGAAAATGAAGGTCACGGTATCATGGTTGTGGAAAGGGAAGCTCCATCCGACCATGGTGATCTTGCCGAGAATCGTGGTG
TAATGGTGGATCGAGGAGGCGATAACTGCGATCAGCTTACCCTGTCATATCAGGGTCAAGTTTATGTTTTTGATTCTGTCTCGCCCGAAAAGGTTCAAGCCGTGTTATTA
TTATTGGGAGGTCGCGAAGTACCTTTGCGTGTTCCCTCAATCCCAATAACTAATCAACCAAATGATAGGCATCTCACTGATGAAGCATTTAACCAGGCACTTGCCATCCC
CCCACGTCTAAGTGTCCCTCAGAGATTAGCTTCTTTGATTAGATTTCGTGAAAAGCGAAAGGAACGTAATTTTGACAAGAAAATTCGGTATACAGTTCGTAAAGAGGTAG
CACTTAGGATGCAGCGTAATAAAGGTCAGTTCACATCTTCCAAACCCATTCAGGAAGATTCATTATCGGCCACAACAAGTTGGGAATCAAATGAGACTTGGTGTTCTGAT
GGAAATGGATCCCATCAACAAGAAATTCTCTGTCGACATTGTGGTATTAGTGAGAAGTCTACACCAATGATGCGACGTGGACCTGATGGGCCAAGAACCCTTTGCAATGC
CTGCGGACTCATGTGGGCAAACAAGCTTCAAGGATCTTATCAATATAGTAAATCTGAGACCCTGGACATAAATGTAGATGAATCATTTATTGATCTCTCGGCCTCTAACA
CTTATTGCAAGCATGGCCAATCCACTGCTTATTCATTTTTCCCCTCAACGAGAACCCTAGATTCTCCTCCTTCTGTCTGGGCTCCTACCATCGTGAAAGTTGAGCATAAG
CTCCTCAAGTATATTTCTTCTGTATTGGTCAGAGTTCAGTATGACTCTGAAAATGATATGCTGCATTTAGCTGCTCTGATTTCCGTTCTAGAAATTCTTTTCCTGAAGAC
TGTTAGGGAACTCTGCGGGACCTCTCAAAAGCACCAAACCAGGGTGGACAAGCTTCAAGCTTCAACAGGAATGAGAATGTTGTGCAGAATGGAGATTCTGAGTCCAATCC
AAAAAGTGCATAGGAATTGTGACAGGATTGTGTCTTCCTGCAGAAGACGAAGGCGTGGACGAGTCTCTGGGCAGCCCGAATCTGATCTGGCGTGCCAGACACGATCGCTT
TGCCATCAAACATCCCAGGTTTAGCATGATTAACCACCACCATTGCCCCCTTATATATATCTGCCACAGTAACATACAAATTACAAACTCATTGAAGGAAAAACACAATA
TGTGCCTAACTGTCGTATTCGACTAACCTGCTGGACATGAACCATCTCCATAGACGTGCCTCCCAGAAAATGACCTGGACCCCCCCTGTTCTAAGAAAATTTATACAAAG
ACGATGATTCAAACCAACTAATAAACACTTAAAACACAATACTACATACACTTCTGAAAGGTTTCGAAATGCCACCATCTTCCAATCTCGACGAGCAGCTCAAATTTGAA
TAGAAAATTGGAGTATTAGTGAATTTATACATTTAGAATGATTGTTCTGTACAACAAGCTTGGGCAGGTAACGGTTAGTACTGAGTAGAGAAGCAGAGTGTTGTTAGAAG
ATGTGAATTATCTGCATGCTAGTATCGGCCCTTTTCATTTGATTAGTAATATGGTCTCCTCTCTACCCACCAAATAGCCTTATCTGTTGTTCATGTACAAGTAGTCTAGC
AACAACTGCAGCACCTGGTTCATATCCAATTTCTGCAATTCGACATTGTGCACTTATAACAGCTTCTTGTGCAGGAGAATATGTTTACACTAATCTCAGAGATGGAATGA
AGGAAAAAAGCAAATGTGCAAATAGGTAATTTGAACGGTTCTACGAT
Protein sequenceShow/hide protein sequence
EMDGIHASDGRMHMSHAQHSMHTQSVQEQEHHDLHYMSNGNGMADEHENEGHGIMVVEREAPSDHGDLAENRGVMVDRGGDNCDQLTLSYQGQVYVFDSVSPEKVQAVLL
LLGGREVPLRVPSIPITNQPNDRHLTDEAFNQALAIPPRLSVPQRLASLIRFREKRKERNFDKKIRYTVRKEVALRMQRNKGQFTSSKPIQEDSLSATTSWESNETWCSD
GNGSHQQEILCRHCGISEKSTPMMRRGPDGPRTLCNACGLMWANKLQGSYQYSKSETLDINVDESFIDLSASNTYCKHGQSTAYSFFPSTRTLDSPPSVWAPTIVKVEHK
LLKYISSVLVRVQYDSENDMLHLAALISVLEILFLKTVRELCGTSQKHQTRVDKLQASTGMRMLCRMEILSPIQKVHRNCDRIVSSCRRRRRGRVSGQPESDLACQTRSL
CHQTSQV