; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007910 (gene) of Chayote v1 genome

Gene IDSed0007910
OrganismSechium edule (Chayote v1)
Descriptionmyb family transcription factor PHL5-like
Genome locationLG03:47368207..47372111
RNA-Seq ExpressionSed0007910
SyntenySed0007910
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR025756 - MYB-CC type transcription factor, LHEQLE-containing domain
IPR044848 - PHR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022924882.1 uncharacterized protein LOC111432301 [Cucurbita moschata]5.6e-15877.11Show/hide
Query:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGG---
        MNDYGID  Q   + QQ+HG++ D C ++F A QQPWRMG  V + AMDEVES EQ++F SS SSSTIINLFESPASAFFATEQCMGIPPIEFR G    
Subjt:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGG---

Query:  ---SDSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKN
           SDS+SAIFQSSGEN S D  E+SGADSEF NTLQSVVKS LCKR FDGFPKS  SDHKVFD+CSHS+ K YSVPFKDQG+CYN    PSFC +QEK 
Subjt:  ---SDSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKN

Query:  SPRFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDM
        SPRFSCL +S+G G      NG+GFTTKTRIRWTQ LHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRR  M
Subjt:  SPRFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDM

Query:  DEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSK
         E+AQLD+KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQ+QIEEQGKQLKMMFDQQQETNKCFF+TN FN         N DD P PT ESIRNAQFPSK
Subjt:  DEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSK

Query:  IS
        IS
Subjt:  IS

XP_022966364.1 myb family transcription factor PHL5-like [Cucurbita maxima]1.0e-15977.86Show/hide
Query:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGG---
        MNDYGID  Q+ R   QNHG++ D C ++FRA QQPWRMG  V + AMDEVES EQ++F SS SSSTIINLFESPASAFFATEQCMGIPPIEFR G    
Subjt:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGG---

Query:  ---SDSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKN
           SDS+SAIFQSSGEN S D  E+SGADSEF NTLQSVVKS LCKR FDGFPKS  SDHK+FD+CSHS+ K YSVPFKDQGECYN    PSFC +QEK 
Subjt:  ---SDSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKN

Query:  SPRFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDM
        SPRFSCL +S+GSG      NG+GFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRR  M
Subjt:  SPRFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDM

Query:  DEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSK
         E+AQLD+KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQ+QIEEQGKQLKMMFDQQQETNKCFF+TN F     NN   NLD+   PT ESI+NAQFPSK
Subjt:  DEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSK

Query:  IS
        IS
Subjt:  IS

XP_023517343.1 myb family transcription factor PHL4-like [Cucurbita pepo subsp. pepo]2.5e-15877.61Show/hide
Query:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPG----
        MND+GID  Q   + QQNHG++ D C ++FRA QQPWRMG  V + AM+EVES EQ++F SS SSSTIINLFESPASAFFATEQCMGIPPIEFR G    
Subjt:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPG----

Query:  --GSDSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKN
           SDS+SAIFQSSGEN S D  E+SGADSEF NTLQSVVKS LCKR FDGFPKS  SDHKVFD+CSHSI K YSVPFKDQGECYN    PSFC +QEK 
Subjt:  --GSDSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKN

Query:  SPRFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDM
        SPRFS L +S+G G      NG+GFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRR  M
Subjt:  SPRFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDM

Query:  DEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSK
         E+AQLD+KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQ+QIEEQGKQLKMMFDQQQETNKCFF+TN FN         N DD P PT ESIRNAQFPSK
Subjt:  DEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSK

Query:  IS
        IS
Subjt:  IS

XP_023544661.1 myb family transcription factor PHL5 [Cucurbita pepo subsp. pepo]3.4e-15575.25Show/hide
Query:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGGSDS
        MNDYGID KQ   +  QNHGV+ D  +++ RA QQPWRMG  VH+SAMDEVES EQ+N   S SSSTIINLFESPASAFFATEQCMGIPPIEFR G    
Subjt:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGGSDS

Query:  LSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYN----SIATPSFCYAQEKNSP
              SS  + + DSAE SGADSEF+NTLQSVV+S LCKRSF+GFPK++F+D+KVFD    SIGK +SVPFKDQG CY+    SIA P+FC +QEKNSP
Subjt:  LSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYN----SIATPSFCYAQEKNSP

Query:  RFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDE
        RFSCLSSS+GSG      NG+GF TKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA+RKSDRR DM+E
Subjt:  RFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDE

Query:  IAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSKIS
        +A+LDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQ+QIEEQGK+LK+MFDQQQETNKCFF  N FNK  PN+P   LDD P PT+E+IRNAQFP+ IS
Subjt:  IAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSKIS

XP_038881143.1 myb family transcription factor PHL5-like isoform X1 [Benincasa hispida]2.0e-15576.25Show/hide
Query:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPG--GS
        MNDYGID KQ   + QQNHG+I DF +++FRA QQPWRMG  VH+S MDEVES EQ N   S S+STIINLFESP SAFFATEQCMGIPPI+F+ G   S
Subjt:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPG--GS

Query:  DSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKNSPRF
        DSLS IFQSSGENFS D AE SG DSE +NTLQSVVKS LCKRSF+GFPK+ F+DHKVFDE S +  K YSVPFKDQ  CYNSIA PSFC     NSPRF
Subjt:  DSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKNSPRF

Query:  SCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDEIA
        S LS S+GSG      +G+GFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAER+SDRR  M+E+ 
Subjt:  SCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDEIA

Query:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTS--ESIRNAQFPSKIS
        +LD KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQ+QIEEQGKQLKMMFDQQQETNKCFF+TN FNK  PNN    LD+ P P++  ++I+NAQFPSKIS
Subjt:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTS--ESIRNAQFPSKIS

TrEMBL top hitse value%identityAlignment
A0A1S3B500 uncharacterized protein LOC103486080 isoform X13.3e-14874.26Show/hide
Query:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGGS--
        MN YGID KQ   + QQNHG+I D+ +++FRA QQP RMG  VH+SAMDEVES E+ N   S  +STIINLFESP SAFFATEQCMGIPPI+F+ G S  
Subjt:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGGS--

Query:  DSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKNSPRF
        +SLS IFQSSGENFS DSAEQSG DSEF+NTLQSVVKS LCKRSF+G PK+ F +HKVFD  S++I K YSVPFKDQ  CYNSIA PSFC     NSPRF
Subjt:  DSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKNSPRF

Query:  SCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDEIA
        SCLS SIGSG      NG+GFT KTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAER+ DRR  M+E+ 
Subjt:  SCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDEIA

Query:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNS----FNKSIPNNPLEN--LDDHPSPTSESIRNAQFP
        +LD KTAMQIKDALQLQLDVQRRLHDQLEIQRKLQ+QIEEQGKQLKMMFDQQQETNKCFF+T +    FNK  P+N   +  LD+ P PT     NAQFP
Subjt:  QLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNS----FNKSIPNNPLEN--LDDHPSPTSESIRNAQFP

Query:  SKIS
        SKIS
Subjt:  SKIS

A0A6J1EA93 uncharacterized protein LOC1114323012.7e-15877.11Show/hide
Query:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGG---
        MNDYGID  Q   + QQ+HG++ D C ++F A QQPWRMG  V + AMDEVES EQ++F SS SSSTIINLFESPASAFFATEQCMGIPPIEFR G    
Subjt:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGG---

Query:  ---SDSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKN
           SDS+SAIFQSSGEN S D  E+SGADSEF NTLQSVVKS LCKR FDGFPKS  SDHKVFD+CSHS+ K YSVPFKDQG+CYN    PSFC +QEK 
Subjt:  ---SDSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKN

Query:  SPRFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDM
        SPRFSCL +S+G G      NG+GFTTKTRIRWTQ LHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRR  M
Subjt:  SPRFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDM

Query:  DEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSK
         E+AQLD+KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQ+QIEEQGKQLKMMFDQQQETNKCFF+TN FN         N DD P PT ESIRNAQFPSK
Subjt:  DEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSK

Query:  IS
        IS
Subjt:  IS

A0A6J1ESD3 myb family transcription factor PHL51.1e-15175.13Show/hide
Query:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGGSDS
        MNDYGID KQ   +  QNHGVI D C+++ RA QQPWRMG  VH+SAMDEVES EQ+N   S SSSTIINLFESPASAFF+TEQCMG+PPIEFR G    
Subjt:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGGSDS

Query:  LSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYN----SIATPSFCYAQEKNSP
              SS  + + DSAE SGADSEF+NTLQSVV+S LCKRSF+G PK++F+D+KVFD    SIGK +SVPFKDQG CY+    SIA PSFC +QEKNSP
Subjt:  LSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYN----SIATPSFCYAQEKNSP

Query:  RFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDE
        RFSCLSSS+GSG      NG+GF TKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA+RKSDRR DM+E
Subjt:  RFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDE

Query:  IAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESI
        +A+LDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQ+QIEEQGK+LK+MFDQQQETNKCFF  N FNK  PN+P   LDD P PT+E+I
Subjt:  IAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESI

A0A6J1HTK3 myb family transcription factor PHL5-like5.0e-16077.86Show/hide
Query:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGG---
        MNDYGID  Q+ R   QNHG++ D C ++FRA QQPWRMG  V + AMDEVES EQ++F SS SSSTIINLFESPASAFFATEQCMGIPPIEFR G    
Subjt:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGG---

Query:  ---SDSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKN
           SDS+SAIFQSSGEN S D  E+SGADSEF NTLQSVVKS LCKR FDGFPKS  SDHK+FD+CSHS+ K YSVPFKDQGECYN    PSFC +QEK 
Subjt:  ---SDSLSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKN

Query:  SPRFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDM
        SPRFSCL +S+GSG      NG+GFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRR  M
Subjt:  SPRFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDM

Query:  DEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSK
         E+AQLD+KTA+QIKDALQLQLDVQRRLHDQLEIQRKLQ+QIEEQGKQLKMMFDQQQETNKCFF+TN F     NN   NLD+   PT ESI+NAQFPSK
Subjt:  DEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSK

Query:  IS
        IS
Subjt:  IS

A0A6J1HW70 myb family transcription factor PHL52.5e-15174Show/hide
Query:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGGSDS
        MNDYGID KQ   +  QNHGVI D  +++ RA QQPWRMG  VH+SAMDEVES EQ+N   S SSSTIINLFESPASAFFATEQCMGIPPIEF  G    
Subjt:  MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGGSDS

Query:  LSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYN----SIATPSFCYAQEKNSP
              SS  + + DSAE SGADSEF+NTL SVV+S LCKRSF+GFPK++F+D+KVFD    SI K +S+PFKDQG CY+    SIA PSFC +QEKNSP
Subjt:  LSAIFQSSGENFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYN----SIATPSFCYAQEKNSP

Query:  RFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDE
        RFSC SSS GSG      NG+GF TKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA+RKSDRR DM+E
Subjt:  RFSCLSSSIGSG------NGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDE

Query:  IAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSKIS
        +A+LDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQ+QIEEQGK+LK+MFDQQQETNKCFF  N FNK  PN+P   LDD P P +E+IRNAQF + IS
Subjt:  IAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSKIS

SwissProt top hitse value%identityAlignment
B8ANX9 Protein PHOSPHATE STARVATION RESPONSE 11.6e-3846.5Show/hide
Query:  KDQGECYNSIAT-PSFCYAQEKNSPRFSCLSSSIGSGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIA
        K   +  NS A+ P+F  +   +S    C  +S    N +   +K R+RWT +LHE FV  VN+LGG+EKATPK +LKLM  +GLTI+HVKSHLQKYR A
Subjt:  KDQGECYNSIAT-PSFCYAQEKNSPRFSCLSSSIGSGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIA

Query:  KYMPESAERKSDRRKDMDEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQ-QETNKCFFKTNSFNKSIPNNPLENLD
        +Y P+ +E K+   K  DE++ LD+K +M + +AL+LQ++VQ+RLH+QLEIQRKLQ++IEEQGK L+ MF++Q + + +     +S + + P+ P  ++D
Subjt:  KYMPESAERKSDRRKDMDEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQ-QETNKCFFKTNSFNKSIPNNPLENLD

Q0WVU3 Myb family transcription factor PHL57.4e-5268.29Show/hide
Query:  EKNSPRFSCLSS-SIGSGN-GSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMD
        +++ PRFS   S SI  G+       KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS+GLTIFHVKSHLQKYRIAKYMPES E K ++R    
Subjt:  EKNSPRFSCLSS-SIGSGN-GSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMD

Query:  EIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFK
        E++QLD +T +QIK+ALQLQLDVQR LH+QLEIQR LQ++IEEQGKQLKMM +QQQ+  +   K
Subjt:  EIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFK

Q10LZ1 Protein PHOSPHATE STARVATION RESPONSE 11.6e-3846.5Show/hide
Query:  KDQGECYNSIAT-PSFCYAQEKNSPRFSCLSSSIGSGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIA
        K   +  NS A+ P+F  +   +S    C  +S    N +   +K R+RWT +LHE FV  VN+LGG+EKATPK +LKLM  +GLTI+HVKSHLQKYR A
Subjt:  KDQGECYNSIAT-PSFCYAQEKNSPRFSCLSSSIGSGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIA

Query:  KYMPESAERKSDRRKDMDEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQ-QETNKCFFKTNSFNKSIPNNPLENLD
        +Y P+ +E K+   K  DE++ LD+K +M + +AL+LQ++VQ+RLH+QLEIQRKLQ++IEEQGK L+ MF++Q + + +     +S + + P+ P  ++D
Subjt:  KYMPESAERKSDRRKDMDEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQ-QETNKCFFKTNSFNKSIPNNPLENLD

Q8GUN5 Protein PHR1-LIKE 11.8e-3754.86Show/hide
Query:  SGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERKSDRRKDMDEIAQLDVKTAMQI
        + + S  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   +++I  LD+KT+++I
Subjt:  SGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERKSDRRKDMDEIAQLDVKTAMQI

Query:  KDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQE
          AL+LQ++VQ+RLH+QLEIQR LQ+QIE+QG+ L+MMF++QQ+
Subjt:  KDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQE

Q94CL7 Protein PHOSPHATE STARVATION RESPONSE 13.6e-3851.15Show/hide
Query:  KDQGECYNSIATPSFCYAQEKNSPRFSCLSSSIGSGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAK
        KDQ     ++  P     Q++ SP       S  S N +  T K R+RWT +LHE FV+ VN LGG+E+ATPK +LK+M  EGLTI+HVKSHLQKYR A+
Subjt:  KDQGECYNSIATPSFCYAQEKNSPRFSCLSSSIGSGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAK

Query:  YMPESAERKSDRRK--DMDEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQ
        Y PE +E  S  RK   ++ I  LD+K  + I +AL+LQ++VQ++LH+QLEIQR LQ++IEEQGK L+MMF++Q
Subjt:  YMPESAERKSDRRK--DMDEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQ

Arabidopsis top hitse value%identityAlignment
AT4G28610.1 phosphate starvation response 12.5e-3951.15Show/hide
Query:  KDQGECYNSIATPSFCYAQEKNSPRFSCLSSSIGSGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAK
        KDQ     ++  P     Q++ SP       S  S N +  T K R+RWT +LHE FV+ VN LGG+E+ATPK +LK+M  EGLTI+HVKSHLQKYR A+
Subjt:  KDQGECYNSIATPSFCYAQEKNSPRFSCLSSSIGSGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAK

Query:  YMPESAERKSDRRK--DMDEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQ
        Y PE +E  S  RK   ++ I  LD+K  + I +AL+LQ++VQ++LH+QLEIQR LQ++IEEQGK L+MMF++Q
Subjt:  YMPESAERKSDRRK--DMDEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQ

AT5G06800.1 myb-like HTH transcriptional regulator family protein5.2e-5368.29Show/hide
Query:  EKNSPRFSCLSS-SIGSGN-GSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMD
        +++ PRFS   S SI  G+       KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS+GLTIFHVKSHLQKYRIAKYMPES E K ++R    
Subjt:  EKNSPRFSCLSS-SIGSGN-GSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMD

Query:  EIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFK
        E++QLD +T +QIK+ALQLQLDVQR LH+QLEIQR LQ++IEEQGKQLKMM +QQQ+  +   K
Subjt:  EIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQETNKCFFK

AT5G06800.2 myb-like HTH transcriptional regulator family protein2.2e-4367.88Show/hide
Query:  EKNSPRFSCLSS-SIGSGN-GSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMD
        +++ PRFS   S SI  G+       KTRIRWTQDLHEKFV+CVNRLGGA+KATPKAILK MDS+GLTIFHVKSHLQKYRIAKYMPES E K ++R    
Subjt:  EKNSPRFSCLSS-SIGSGN-GSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMD

Query:  EIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKL
        E++QLD +T +QIK+ALQLQLDVQR LH+QLE+  K+
Subjt:  EIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKL

AT5G29000.1 Homeodomain-like superfamily protein1.3e-3854.86Show/hide
Query:  SGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERKSDRRKDMDEIAQLDVKTAMQI
        + + S  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   +++I  LD+KT+++I
Subjt:  SGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERKSDRRKDMDEIAQLDVKTAMQI

Query:  KDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQE
          AL+LQ++VQ+RLH+QLEIQR LQ+QIE+QG+ L+MMF++QQ+
Subjt:  KDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQE

AT5G29000.2 Homeodomain-like superfamily protein1.3e-3854.86Show/hide
Query:  SGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERKSDRRKDMDEIAQLDVKTAMQI
        + + S  T+K R+RWT +LHE FV+ VN+LGG+E+ATPKA+LKL+++ GLTI+HVKSHLQKYR A+Y PE++    E +  +   +++I  LD+KT+++I
Subjt:  SGNGSGFTTKTRIRWTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESA----ERKSDRRKDMDEIAQLDVKTAMQI

Query:  KDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQE
          AL+LQ++VQ+RLH+QLEIQR LQ+QIE+QG+ L+MMF++QQ+
Subjt:  KDALQLQLDVQRRLHDQLEIQRKLQMQIEEQGKQLKMMFDQQQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGATTACGGAATCGATTTGAAGCAACAAGATCGACAAAATCAACAAAATCATGGAGTGATTGGTGATTTCTGTAATCGGAGTTTTAGGGCACAGCAGCAGCCATG
GCGGATGGGAGGTTCTGTTCATGTATCCGCCATGGATGAAGTTGAATCGTTTGAACAGCGAAATTTCGATTCGTCTACTTCGAGTTCGACTATCATCAATCTGTTTGAAT
CTCCTGCTTCGGCGTTCTTCGCGACGGAGCAATGCATGGGGATACCTCCGATCGAGTTTCGGCCTGGTGGTTCCGATTCGCTTTCAGCGATTTTTCAATCCTCCGGCGAG
AATTTCTCTCGCGATTCGGCGGAGCAAAGCGGCGCGGATTCCGAGTTCACGAACACTTTGCAATCGGTTGTGAAATCTCATCTCTGTAAGCGGAGTTTCGACGGATTTCC
GAAGAGTGTTTTTAGTGACCACAAGGTGTTCGATGAATGTTCTCATTCAATCGGGAAGTGTTATTCAGTTCCTTTCAAAGATCAAGGAGAGTGTTATAATTCAATTGCAA
CGCCAAGTTTCTGTTATGCACAAGAGAAGAACTCTCCAAGATTCTCTTGCTTGAGTTCTTCTATTGGATCTGGAAATGGCTCTGGATTCACCACCAAAACAAGAATCAGA
TGGACGCAAGATCTTCATGAGAAGTTTGTTGACTGTGTGAATCGTCTTGGTGGTGCTGAGAAGGCGACGCCGAAAGCGATTTTGAAGCTGATGGATTCAGAGGGATTGAC
CATATTCCATGTGAAGAGTCATTTGCAGAAATATCGGATAGCGAAATACATGCCAGAATCTGCAGAAAGGAAATCTGATAGAAGGAAGGATATGGATGAAATTGCCCAAC
TTGATGTCAAAACTGCCATGCAAATTAAGGATGCTCTGCAACTACAGCTAGATGTTCAGAGGCGCCTTCATGATCAACTGGAGATTCAGAGGAAGTTACAGATGCAAATT
GAAGAACAAGGGAAACAACTCAAGATGATGTTTGACCAACAACAAGAAACTAACAAATGCTTCTTCAAGACCAATAGCTTCAACAAATCAATCCCTAATAACCCGTTGGA
AAATCTCGATGACCACCCGTCTCCGACCTCCGAGAGCATCCGAAACGCCCAATTCCCATCCAAGATAAGTTAG
mRNA sequenceShow/hide mRNA sequence
CAACTTTCCATGATGAGATTGCAACCAAACAACATGAATGGGTTTCGTATTTAAGCGAAAGATCCATTTCAAATTTGTTCTTCAATCTTGTTTTCATACTCCATCGTTAT
TTATTTGTCGATTCTTGGAATAATTTGTTCCTATTTGCTGTTTATTCATCTTGTCTTACGCATCAAGATTTCAGGAATAAGGAATTTTATGCTACGCGTTGATGTTATTT
ATGCTATCTTTCTTGTTATGCTTTCAATTTGATCGAAAAACGATTGATTCTCTGTTTGGAAACCTAGAAATTTGAGTCGAAACGGAAGATTTTGTGATTGTTTGTTTGTA
AAATCGCTAAATGAACGATTACGGAATCGATTTGAAGCAACAAGATCGACAAAATCAACAAAATCATGGAGTGATTGGTGATTTCTGTAATCGGAGTTTTAGGGCACAGC
AGCAGCCATGGCGGATGGGAGGTTCTGTTCATGTATCCGCCATGGATGAAGTTGAATCGTTTGAACAGCGAAATTTCGATTCGTCTACTTCGAGTTCGACTATCATCAAT
CTGTTTGAATCTCCTGCTTCGGCGTTCTTCGCGACGGAGCAATGCATGGGGATACCTCCGATCGAGTTTCGGCCTGGTGGTTCCGATTCGCTTTCAGCGATTTTTCAATC
CTCCGGCGAGAATTTCTCTCGCGATTCGGCGGAGCAAAGCGGCGCGGATTCCGAGTTCACGAACACTTTGCAATCGGTTGTGAAATCTCATCTCTGTAAGCGGAGTTTCG
ACGGATTTCCGAAGAGTGTTTTTAGTGACCACAAGGTGTTCGATGAATGTTCTCATTCAATCGGGAAGTGTTATTCAGTTCCTTTCAAAGATCAAGGAGAGTGTTATAAT
TCAATTGCAACGCCAAGTTTCTGTTATGCACAAGAGAAGAACTCTCCAAGATTCTCTTGCTTGAGTTCTTCTATTGGATCTGGAAATGGCTCTGGATTCACCACCAAAAC
AAGAATCAGATGGACGCAAGATCTTCATGAGAAGTTTGTTGACTGTGTGAATCGTCTTGGTGGTGCTGAGAAGGCGACGCCGAAAGCGATTTTGAAGCTGATGGATTCAG
AGGGATTGACCATATTCCATGTGAAGAGTCATTTGCAGAAATATCGGATAGCGAAATACATGCCAGAATCTGCAGAAAGGAAATCTGATAGAAGGAAGGATATGGATGAA
ATTGCCCAACTTGATGTCAAAACTGCCATGCAAATTAAGGATGCTCTGCAACTACAGCTAGATGTTCAGAGGCGCCTTCATGATCAACTGGAGATTCAGAGGAAGTTACA
GATGCAAATTGAAGAACAAGGGAAACAACTCAAGATGATGTTTGACCAACAACAAGAAACTAACAAATGCTTCTTCAAGACCAATAGCTTCAACAAATCAATCCCTAATA
ACCCGTTGGAAAATCTCGATGACCACCCGTCTCCGACCTCCGAGAGCATCCGAAACGCCCAATTCCCATCCAAGATAAGTTAGCCCCCTAAGAACCACCAACACTTGGTG
TATAAAAACTCATTTTGGATATTCACCATCTTGTTTCAAACCCACAATGGACATTGCTGACCGACCCTGCTGAAACAAAAACAAAAAATTGTAAAGATTACAGGGTCCTC
TGTTGAATATAAAAAAAAAACTTTTGAAAGTTGAATTGAAGAAAGTAGCAATGGGTTTTGTTGTGGAAATTGTCTGTTCTAATGCTTTGAAAAGTATTTATCAACACAAA
TGATGCTTTGTATCATGTACTATGTCCTTTCAGTTATTTCATGTTTTAATTATTCTTATTTTGTCTGGGTCTTAAATGTCAATATCAATAAAATGTTTGGTCTCAACTAA
AATGTTGAGGTCTCAATTTAA
Protein sequenceShow/hide protein sequence
MNDYGIDLKQQDRQNQQNHGVIGDFCNRSFRAQQQPWRMGGSVHVSAMDEVESFEQRNFDSSTSSSTIINLFESPASAFFATEQCMGIPPIEFRPGGSDSLSAIFQSSGE
NFSRDSAEQSGADSEFTNTLQSVVKSHLCKRSFDGFPKSVFSDHKVFDECSHSIGKCYSVPFKDQGECYNSIATPSFCYAQEKNSPRFSCLSSSIGSGNGSGFTTKTRIR
WTQDLHEKFVDCVNRLGGAEKATPKAILKLMDSEGLTIFHVKSHLQKYRIAKYMPESAERKSDRRKDMDEIAQLDVKTAMQIKDALQLQLDVQRRLHDQLEIQRKLQMQI
EEQGKQLKMMFDQQQETNKCFFKTNSFNKSIPNNPLENLDDHPSPTSESIRNAQFPSKIS