; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g29800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g29800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyb family transcription factor PHL7
Genome locationchr1:21107406..21118444
RNA-Seq ExpressionMoc01g29800
SyntenyMoc01g29800
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR025756 - MYB-CC type transcription factor, LHEQLE-containing domain
IPR044848 - PHR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603850.1 Myb family transcription factor PHL7, partial [Cucurbita argyrosperma subsp. sororia]7.4e-28677.6Show/hide
Query:  MFPRLVNPDSDIQIHGGRGSATSDLT--HRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQS
        MFPRLVNPD DIQIHGGRGS  S+LT  HRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQK+RLGKQS
Subjt:  MFPRLVNPDSDIQIHGGRGSATSDLT--HRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQS

Query:  GKDMGEGSKDGAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGA
        GKDMGE SKDGAYLLESPSTNNFSPDLPISEMA+GYEVKEALRAQMEVQSKLHLQVE         AEKHLRIRQDAERRYLAMLE ACKMLADQFIGGA
Subjt:  GKDMGEGSKDGAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGA

Query:  VSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKD
        VSDSDSKKSQG +RKSPRS S+DPLGFYASQSQEMER+NGTEEVQ NL  QRADCSTESCLTSNESPGGL MEKSPV S+K M NLDS  ASLIW EAK+
Subjt:  VSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKD

Query:  RIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIW-GDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLP
        RIQDAN+IQ                          IW  D+F   ++   S ++ + L            R ++ P+                       
Subjt:  RIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIW-GDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLP

Query:  DASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDIL
        DAS MEP+NGSN P + PNLASKQRLRWTHDLH RFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPD SSDGKKADKKDS D+L
Subjt:  DASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDIL

Query:  SNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSG-APVPSSFSAPASGDNCPETDKNDPPTPAPTSESPRQE
        SNI+ SSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKL+GVLSG APV SSFSAPASG+NCPETDKNDPPTPAPTSE PRQE
Subjt:  SNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSG-APVPSSFSAPASGDNCPETDKNDPPTPAPTSESPRQE

Query:  KASKERAQAKSISVDDSSSSHHEPLTPDSGCHSSPSESPRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRL
        KASKERAQAKS+S+DDS SS HEPLTPDSGCHSSP+ESPRPVKK RQ +DGAF+KSEMILAHQILESSLNST+K S SVFPA E  DPSSG+SIGD  + 
Subjt:  KASKERAQAKSISVDDSSSSHHEPLTPDSGCHSSPSESPRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRL

Query:  D
        D
Subjt:  D

KAG7034030.1 Myb family transcription factor PHL7 [Cucurbita argyrosperma subsp. argyrosperma]1.6e-27573.96Show/hide
Query:  MFPRLVNPDSDIQIHGGRGSATSDLT--HRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQS
        MFPRLVNPD DIQIHGGRGS  S+LT  HRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQK+RLGKQS
Subjt:  MFPRLVNPDSDIQIHGGRGSATSDLT--HRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQS

Query:  GKDMGEGSKDGAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGA
        GKDMGE SKDGAYLLESPSTNNFSPDLPISEMA+GYEVKEALRAQMEVQSKLHLQVE         AEKHLRIRQDAERRYLAMLERACKMLADQFIGGA
Subjt:  GKDMGEGSKDGAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGA

Query:  VSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKD
        VSDSDSKKSQG +RKSPRS S+DPLGFYASQSQEMER+NGTEEVQ NL  QRADCSTESCLTSNESPGGL MEKSPV S+K M NLDS  ASLIW EAK+
Subjt:  VSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKD

Query:  RIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPD
        RIQDAN+IQ+N                           D+F   ++   S ++ + L            R ++ P+                       D
Subjt:  RIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPD

Query:  ASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSD--------------
        AS MEP+NGSN P + PNLASKQRLRWTHDLH RFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPD SSD              
Subjt:  ASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSD--------------

Query:  --------GKKADKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSG-APVPSSFSAPASGDNC
                GKKADKKDS D+LSNI+ SSGMQITEALKLQM          EVQRQLQLRIEAQGKYLKKIIEEQQKL+GVLSG APV SSFSAPASG+NC
Subjt:  --------GKKADKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSG-APVPSSFSAPASGDNC

Query:  PETDKNDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCHSSPSESPRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSV
        PETDKNDPPTPAPTSE PRQEKASKERAQAKS+S+DDS SS HEPLTPDSGCHSSP+ESPRPVKK RQ +DGAF+KSEMILAHQILESSLNST+K S SV
Subjt:  PETDKNDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCHSSPSESPRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSV

Query:  FPAGEKFDPSSGISIGDQHRLD
        FPA E  DPSSG+SIGD  + D
Subjt:  FPAGEKFDPSSGISIGDQHRLD

XP_023903823.1 uncharacterized protein LOC112015623 [Quercus suber]4.0e-22363.42Show/hide
Query:  MFPRLVNP--------DSDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKF
        M+PR++ P          +I I  G GS      HRGDPCLVLTSDPKPRLRWTA+LHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQK+
Subjt:  MFPRLVNP--------DSDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKF

Query:  RLGKQSGKDMGEGSKDG---AYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKM
        RLGKQSGK+MGE S+DG   AYLLESP T N SP+LP S+  EGYEVKEALRAQMEVQSKLHLQVE         AEKHL+IRQDAERRY+AMLERACKM
Subjt:  RLGKQSGKDMGEGSKDG---AYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKM

Query:  LADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGT-EEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVT
        LADQFI G+V D+DS   QGL  K+PR+SS+DPLGFY   S E+  ++G+ EEV  +L  QRADCSTESCLTS+ESPGGLT+E SP G +++M+++DS T
Subjt:  LADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGT-EEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVT

Query:  ASLIWSEAKDRIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQ----ISLE
        ASL+W EAK + QD N+ Q+N H+ +G+ +    Q     Q  +S+        +S   S++  + L   +   +++  R  S+   ++M Q     S  
Subjt:  ASLIWSEAKDRIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQ----ISLE

Query:  LIKPNKLGNTNLPD--ASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCS
        L+  N L +    D  AS M+P NG N  +SN +LASKQRLRWTH+LH RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPD S
Subjt:  LIKPNKLGNTNLPD--ASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCS

Query:  SDGKKADKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDKND
        SDGKKADKK++ D++SN+D SSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ+LSGVLS     S    PASGDNC ++DK D
Subjt:  SDGKKADKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDKND

Query:  PPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCHS-SPSESP---RPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPA
        P TPAPTSE+P  +KA+KE A A S+S+D+S SSHHEPLTPDS CH+ SP+ESP   R +KK R    G ++KSEM+L HQILESSL+S+Y++ HSVF  
Subjt:  PPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCHS-SPSESP---RPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPA

Query:  GEKFDPSSGISIGDQHRLD
         E+F+ SSGISI ++  ++
Subjt:  GEKFDPSSGISIGDQHRLD

XP_031266907.1 uncharacterized protein LOC116125306 [Pistacia vera]2.1e-21963.35Show/hide
Query:  PRLVNPDSDIQ-IHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKD
        P ++    D+Q  H          T   DPCLVLT+DPKPRLRWTADLH+RFVDAVTQLGG +KATPKAIMRTMNVKGLTLFHLKSHLQK+RLGKQ GKD
Subjt:  PRLVNPDSDIQ-IHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKD

Query:  MGEGSKDGAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVSD
        M +  KD +YLLESP T N SP LP S+M EGYEVKEALRAQMEVQSKLHLQVE         AEKHL+IRQDAERRYLAMLERACKMLADQFIGGAV D
Subjt:  MGEGSKDGAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVSD

Query:  SDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRIQ
        SD KK QGL  K PR+S++DP GFY+SQ+ ++  M+G EEV  +L  QRADCSTESCLTS++S GGLT+E S VG++KKM++LDS TASL+W E K    
Subjt:  SDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRIQ

Query:  DANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPD--A
        + ++ Q+N H  +G+                                             D     +PKS P        S  L + N L +    D  +
Subjt:  DANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPD--A

Query:  SLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSN
        S M+P NG N   +NP+LASKQRLRWTH+LH RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPD SSDGKKADKK++ D+LSN
Subjt:  SLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSN

Query:  IDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAP-VPSSFSAPASGDNCPETD-KNDPPTPAPTSESPRQEK
        +D SSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ+LSGVLS  P   +  SAP SGDN PE+D K DP TPAPTSESP Q+K
Subjt:  IDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAP-VPSSFSAPASGDNCPETD-KNDPPTPAPTSESPRQEK

Query:  ASKERAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSESP---RPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQ
        ASKE A +KS+SVD+S SS HEPLTPDSGC+  SPS+SP   R +KK R     A+SK+EM+L H ILESSL+S+Y++ HS+F   E+FDPSSGISIG +
Subjt:  ASKERAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSESP---RPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQ

Query:  HRLD
          L+
Subjt:  HRLD

XP_042944529.1 uncharacterized protein LOC122278404 [Carya illinoinensis]1.3e-21662.78Show/hide
Query:  MFPRLVN--------PDSDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKF
        M+PR++            DI I GG G        RGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQK+
Subjt:  MFPRLVN--------PDSDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKF

Query:  RLGKQSGKDMGEGSKDG---AYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKM
        RLGKQSGKDMGE S DG   + LLESPST   SP+LP S+  +GYEV EALR QMEVQSKLHLQVE         AEKHL+IRQDAERRY+AMLE ACKM
Subjt:  RLGKQSGKDMGEGSKDG---AYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKM

Query:  LADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGT-EEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVT
        LADQFI GAV +++ +K QG   K+PR+SS+D LGFY+ QS  +  ++G+ EEV  +L  QRADCSTESCLTS+ESPGGLT+E SP G QK+M+ LDS T
Subjt:  LADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGT-EEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVT

Query:  ASLIWSEAKDRIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKP
        ASLIW E K + QD N+ Q+N  S +G+ +    +A+         +    +  +S  FS++    + +P S++ +   R  S  +G+ +  ++      
Subjt:  ASLIWSEAKDRIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKP

Query:  NKLGNTNLPDASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKA
                   S M+P NG+N  ++ P+LASKQRLRWTH+LH RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPD SSDGKKA
Subjt:  NKLGNTNLPDASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKA

Query:  DKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETD-KNDPPTPA
        DKK+S D LSN+DSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ+LSGVLS     S    P SGDNCP+++ K DP TPA
Subjt:  DKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETD-KNDPPTPA

Query:  PTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSES---PRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFD
        PTSE+P Q+KA+ ERA A S+S+D+S SS HEP TPDSGCH SSPSES    R +KK R    G+++KSEM+L HQILESSL+S+Y +  S+F + E+F+
Subjt:  PTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSES---PRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFD

Query:  PSSGISI-GDQH
        PSSGISI  D H
Subjt:  PSSGISI-GDQH

TrEMBL top hitse value%identityAlignment
A0A4D6KL56 Two-component response regulator ARR-B family3.0e-17659.22Show/hide
Query:  ASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKD---GAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADL
        +++ATPKAIMRTMNVKGLTLFHLKSHLQK+RLGKQSGKD+GEG KD   G+YLLESP T N SP LP S+  EGYE+KEALRAQMEVQSKLHLQVE    
Subjt:  ASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKD---GAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADL

Query:  TIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMN-GTEEVQANLPYQRADCSTESC
             AEKHL+IRQDAERRY+AMLERACKMLADQFIG  V D+DS+K Q +  K+PR + VDPLGFY+  S E+  +N   EE+  +LP QRADCSTESC
Subjt:  TIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMN-GTEEVQANLPYQRADCSTESC

Query:  LTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKISVLREP
        LTS+ES GGLT+E SP G +++M+ +DS+ A LIWSEAK R Q  N+ Q NH                                                
Subjt:  LTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKISVLREP

Query:  TSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPDASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQ
                      P G                        S M+P NG N   +N NL SKQRLRWTH+LH RFV+AVAQLGGPDRATPKGVLRVMGVQ
Subjt:  TSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPDASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQ

Query:  GLTIYHVKSHLQKYRLAKYLPDCSSD-GKKADKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVL
        GLTIYHVKSHLQKYRLAKYLPD SSD GKKADKK++ D+LSN+D SSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ+LSGVL
Subjt:  GLTIYHVKSHLQKYRLAKYLPDCSSD-GKKADKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVL

Query:  SGAPVPSSFSAPASGDNCPETD-KNDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSESP---RPVKKPRQCRDGAFSKS
        S AP  S   A   GD C E D K DP TP P       EKA+K+R  AKS+S+ +S SSHHEPLTPDSGCH  SP++SP   R  KK R   DG++SK 
Subjt:  SGAPVPSSFSAPASGDNCPETD-KNDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSESP---RPVKKPRQCRDGAFSKS

Query:  EMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRLD
        +M+L  QILESS+ S+Y+   +VF   E+FDPS G+S      LD
Subjt:  EMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRLD

A0A5N5JIN0 Uncharacterized protein4.7e-16950.65Show/hide
Query:  LVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKD------------------------
        +VLTSDPKPRLRWTADLH+RFVDAV+QLGG +KATPKAI+RTMNVKGLTLFHLKSHLQK+RLGKQSGKDM +  KD                        
Subjt:  LVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKD------------------------

Query:  -GAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVSDSDSKKS
         G+YLLE+P T N S ++  S++ EGYEVKEALRAQMEVQSKLHLQVE         AEKHL IRQDAERRYLAMLERACKMLADQFIG AV D+D++K 
Subjt:  -GAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVSDSDSKKS

Query:  QGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRIQDANIIQ
         G+  K+ R +S+DPLGFY+ Q+ E+  + G E+V  +L +Q ADCSTESCLTSNESPGGL +E SP G +K M++L+S T SLIW E K    + N  Q
Subjt:  QGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRIQDANIIQ

Query:  INHH--SASG----------------------------------FFLILKL--QAFTDPQKWRSIWGDVFLDLD----------SSLFSLVKISVLREPT
        +N H  S SG                                    +++KL  + F   + WR +      ++D          S +F   K +    PT
Subjt:  INHH--SASG----------------------------------FFLILKL--QAFTDPQKWRSIWGDVFLDLD----------SSLFSLVKISVLREPT

Query:  SLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPDASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQG
         L      +    P    + +I   L + +        +A  M+P NG N   +NPNLASKQRLRWTH+LH RFV+AVAQLGGPDRATPKGVLRVMG   
Subjt:  SLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPDASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQG

Query:  LTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNIDSSS----GMQITEALKLQMEVQKR-------------------------------LHE
                   KYRLAKYLPD +SDGKK DKK++ D+LSN+D  S    G ++     LQ +   R                               L  
Subjt:  LTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNIDSSS----GMQITEALKLQMEVQKR-------------------------------LHE

Query:  QLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETD-KNDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTP
           VQRQLQLRIEAQGKYLKKIIEEQQ+LSGVL   P  S  SAP SGDNCPE+D K DP TPAPTSES  Q+K +KE A  KS+S+D S SS HEP TP
Subjt:  QLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETD-KNDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTP

Query:  DSGCHS-SPSESP---RPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQ
        DS C++ SP+ESP   R +KK R     AF K EM+L HQILESSLNS + + HS F   E+FDPSSG+S+G++
Subjt:  DSGCHS-SPSESP---RPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQ

A0A6A4LZN9 HTH myb-type domain-containing protein (Fragment)2.5e-17051.3Show/hide
Query:  HRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASK-------------------------------------------------------------
        H+GDPCLVLTSDPKPRLRWT+DLHERF+DAVTQLGG S+                                                             
Subjt:  HRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASK-------------------------------------------------------------

Query:  --------ATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDG---AYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQV
                ATPKAIMRTM VKGLTLFHLKSHLQK+RLGKQS KD+GE SKDG   ++L+E P T+N SP LP S+M EGYEVKEALRAQMEVQSKLHLQV
Subjt:  --------ATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDG---AYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQV

Query:  ELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQ--RAD
        E         AEKHL+IR +AE+RYLAMLER CKMLAD  IG A+++ + + ++ L   +P S S  PLGFY             EEV   +  Q  RAD
Subjt:  ELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQ--RAD

Query:  CSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKI
        CSTESCLTS+ES  GL ++ SP   +++M+N+DS T S IW E+   +Q    +Q++   + G F          P           L  + +L  L K 
Subjt:  CSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRIQDANIIQINHHSASGFFLILKLQAFTDPQKWRSIWGDVFLDLDSSLFSLVKI

Query:  SVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPDASLMEPSNGSNGPI-SNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGV
             P  LD                                    AS M+  NG N  + +NP+LASK RLRWTH+LH RFV+AVAQLGGPDRATPKGV
Subjt:  SVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPDASLMEPSNGSNGPI-SNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGV

Query:  LRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSS---------DGKKADKKDSSDILSNIDSS---------SGMQITEALKLQMEVQKRLHEQLEVQRQLQ
        LRVMGVQGLTIYHVKSHLQKYRLAKYLPD             GK ADKK+S D+LS++D S         SGMQITEALKLQMEVQKRLHEQLEVQRQLQ
Subjt:  LRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSS---------DGKKADKKDSSDILSNIDSS---------SGMQITEALKLQMEVQKRLHEQLEVQRQLQ

Query:  LRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDK--NDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCHSSP
        LRIEAQGKYLKKIIEEQQ+LSGVLS  P  S  SAPAS DNC E+D   + P TPAPTSESP  +K++KE A  KS+S+D+S +SHHEPLTPDSG  +  
Subjt:  LRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDK--NDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCHSSP

Query:  SESPRPVKKPRQCRDGA-FSKSEMILAHQILESSLNSTYKESHSVFPAGEKFD-PSSGISIGDQHRLD
         +  RPVKK R     A ++K EM+L  QI ESSL+  Y++++SVF AGE+ D  SSG+SIG++ +L+
Subjt:  SESPRPVKKPRQCRDGA-FSKSEMILAHQILESSLNSTYKESHSVFPAGEKFD-PSSGISIGDQHRLD

A0A6J1BTB6 protein PHR1-LIKE 2-like isoform X13.7e-174100Show/hide
Query:  MFPRLVNPDSDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGK
        MFPRLVNPDSDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGK
Subjt:  MFPRLVNPDSDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGK

Query:  DMGEGSKDGAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVS
        DMGEGSKDGAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVS
Subjt:  DMGEGSKDGAYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVS

Query:  DSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRI
        DSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRI
Subjt:  DSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRI

Query:  QDANIIQINHHSASG
        QDANIIQINHHSASG
Subjt:  QDANIIQINHHSASG

A0A6P9EMM2 uncharacterized protein LOC1089791882.1e-20956.38Show/hide
Query:  MFPRLVN--------PDSDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKF
        M+PR++            DI I GG G        RGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQK+
Subjt:  MFPRLVN--------PDSDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKF

Query:  RLGKQSGKDMGEGSKDG---AYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKM
        RLGKQSGKDMGE S DG   + LLESPST   SP+LP S+  +GYEV EALRA MEVQSKLHLQVE         AEKHL+IRQDAERRY+AMLE ACKM
Subjt:  RLGKQSGKDMGEGSKDG---AYLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKM

Query:  LADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNG-TEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVT
        LADQFIGGAV +++ +K QG   K+PRSSS+D LGFY+ QS  +  ++G  EEV  +L  QR DCSTESCLTS+ESPGGLT+E SP G QK+M+ LDS T
Subjt:  LADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLGFYASQSQEMERMNG-TEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVT

Query:  ASLIWSEAKDRIQDANIIQINHHSASGF-------------------------FLILKLQAFTDP----------------------QKWRSIWGDVFLD
        ASLIW E K + QD N+ Q+N  S +G+                         + IL +  F+ P                         R   G   L+
Subjt:  ASLIWSEAKDRIQDANIIQINHHSASGF-------------------------FLILKLQAFTDP----------------------QKWRSIWGDVFLD

Query:  L-------------------------DSSLFSLVKISVLREPTSLD---AMLFARPKS-------TPYGEQMVQISLELIKPNKLGNTNLPD--------
        L                           ++F+ + ++++     L     +++ R +S        P   ++  I L + +P  + N +L          
Subjt:  L-------------------------DSSLFSLVKISVLREPTSLD---AMLFARPKS-------TPYGEQMVQISLELIKPNKLGNTNLPD--------

Query:  -----ASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDS
              S M+P NG+N  ++  +LASKQRLRWTH+LH RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPD SSDGKKADKK+S
Subjt:  -----ASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDS

Query:  SDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETD-KNDPPTPAPTSES
         D+LSN+DSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ+LSGV S     S    P SGDNCP+++ K DP TPAPTSE+
Subjt:  SDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETD-KNDPPTPAPTSES

Query:  PRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSES---PRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGI
        P QEKA+ E A A S+S+D+S SS HEP TPDSGCH SSPSES    R +KK R    G+++KSEM+L HQILESSL+S+Y +  S+F + E+F+PSSGI
Subjt:  PRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSES---PRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGI

Query:  SI-GDQH
        SI  D H
Subjt:  SI-GDQH

SwissProt top hitse value%identityAlignment
B8ANX9 Protein PHOSPHATE STARVATION RESPONSE 19.8e-3953.63Show/hide
Query:  PSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNIDSS
        P N SN        ASKQR+RWT +LH  FV+AV +LGG ++ATPKGVL++M V GLTIYHVKSHLQKYR A+Y PD  S+GK  + K + ++  ++D  
Subjt:  PSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNIDSS

Query:  SGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDKND
        + M +TEAL+LQMEVQKRLHEQLE+QR+LQLRIE QGKYL+K+ E+Q K S          S   P+SGD    ++ ++
Subjt:  SGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDKND

Q10LZ1 Protein PHOSPHATE STARVATION RESPONSE 19.8e-3953.63Show/hide
Query:  PSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNIDSS
        P N SN        ASKQR+RWT +LH  FV+AV +LGG ++ATPKGVL++M V GLTIYHVKSHLQKYR A+Y PD  S+GK  + K + ++  ++D  
Subjt:  PSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNIDSS

Query:  SGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDKND
        + M +TEAL+LQMEVQKRLHEQLE+QR+LQLRIE QGKYL+K+ E+Q K S          S   P+SGD    ++ ++
Subjt:  SGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDKND

Q8LAJ7 Protein PHR1-LIKE 31.0e-4343.58Show/hide
Query:  SDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDG
        S + + G  G  +       D CLVLT+DPKPRLRWT++LHERFVDAVTQLGG  KATPK IMRTM VKGLTL+HLKSHLQKFRLG+QS K+  + SKD 
Subjt:  SDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDG

Query:  AYLLESPSTNNFSPD---LPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVSDSDSK
        + + ES  T + S     L   E  E Y+V EALRAQMEVQ +LH Q+E+         ++ L++R +A+ +YL ++LE+ACK + +Q +  A  ++  +
Subjt:  AYLLESPSTNNFSPD---LPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVSDSDSK

Query:  KSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSN
        +   L  K+  ++     G  ++       +    E+   + ++  +CS ES LTS+
Subjt:  KSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSN

Q94A57 Protein PHR1-LIKE 21.4e-4547.9Show/hide
Query:  GDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDGAYLLESPSTNNFSPD---L
        GD CLVLT+DPKPRLRWT +LHERFVDAVTQLGG  KATPK IMRTM VKGLTL+HLKSHLQKFRLG+Q+GK+  E SKD + + ES  T + S     +
Subjt:  GDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDGAYLLESPSTNNFSPD---L

Query:  PISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLG
           E  EGY+V EALRAQMEVQ +LH Q+E+         ++ L++R +A+ +YL ++LE+ACK   +Q    A  ++  ++   L  K   SS    + 
Subjt:  PISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLG

Query:  FY-ASQSQEMERMNGTEEVQANLPYQRADCSTESCLTS
        ++ A++   M  ++       N      +CS ES LTS
Subjt:  FY-ASQSQEMERMNGTEEVQANLPYQRADCSTESCLTS

Q9SJW0 Myb family transcription factor PHL73.9e-8866.33Show/hide
Query:  MEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNID
        ME  NG  GP  N + ASKQRLRWTH+LH RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPD SS+GKK DKK+S D+LS +D
Subjt:  MEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNID

Query:  SSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDKNDPPTPAPTSESPRQEKASKE
         SSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ+LSGVL G P     SAP +GD       +DP TPAPTSESP Q+K+ K+
Subjt:  SSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDKNDPPTPAPTSESPRQEKASKE

Query:  RAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSES---PRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRLD
            KS+SVD+S SS+ EPLTPDSGC+  SP ES    R  KKPR  R  A    ++++ H ILES LN++Y +S  V      FD  S   +G + +LD
Subjt:  RAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSES---PRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRLD

Arabidopsis top hitse value%identityAlignment
AT2G01060.1 myb-like HTH transcriptional regulator family protein2.8e-8966.33Show/hide
Query:  MEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNID
        ME  NG  GP  N + ASKQRLRWTH+LH RFV+AVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPD SS+GKK DKK+S D+LS +D
Subjt:  MEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQLGGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNID

Query:  SSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDKNDPPTPAPTSESPRQEKASKE
         SSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ+LSGVL G P     SAP +GD       +DP TPAPTSESP Q+K+ K+
Subjt:  SSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLSGVLSGAPVPSSFSAPASGDNCPETDKNDPPTPAPTSESPRQEKASKE

Query:  RAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSES---PRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRLD
            KS+SVD+S SS+ EPLTPDSGC+  SP ES    R  KKPR  R  A    ++++ H ILES LN++Y +S  V      FD  S   +G + +LD
Subjt:  RAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSES---PRPVKKPRQCRDGAFSKSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRLD

AT2G01060.2 myb-like HTH transcriptional regulator family protein8.7e-6763.97Show/hide
Query:  MGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLS
        MGVQGLTIYHVKSHLQKYRLAKYLPD SS+GKK DKK+S D+LS +D SSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQ+LS
Subjt:  MGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQQKLS

Query:  GVLSGAPVPSSFSAPASGDNCPETDKNDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSES---PRPVKKPRQCRDGAFS
        GVL G P     SAP +GD       +DP TPAPTSESP Q+K+ K+    KS+SVD+S SS+ EPLTPDSGC+  SP ES    R  KKPR  R  A  
Subjt:  GVLSGAPVPSSFSAPASGDNCPETDKNDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCH-SSPSES---PRPVKKPRQCRDGAFS

Query:  KSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRLD
          ++++ H ILES LN++Y +S  V      FD  S   +G + +LD
Subjt:  KSEMILAHQILESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRLD

AT3G24120.1 Homeodomain-like superfamily protein1.0e-4647.9Show/hide
Query:  GDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDGAYLLESPSTNNFSPD---L
        GD CLVLT+DPKPRLRWT +LHERFVDAVTQLGG  KATPK IMRTM VKGLTL+HLKSHLQKFRLG+Q+GK+  E SKD + + ES  T + S     +
Subjt:  GDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDGAYLLESPSTNNFSPD---L

Query:  PISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLG
           E  EGY+V EALRAQMEVQ +LH Q+E+         ++ L++R +A+ +YL ++LE+ACK   +Q    A  ++  ++   L  K   SS    + 
Subjt:  PISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLG

Query:  FY-ASQSQEMERMNGTEEVQANLPYQRADCSTESCLTS
        ++ A++   M  ++       N      +CS ES LTS
Subjt:  FY-ASQSQEMERMNGTEEVQANLPYQRADCSTESCLTS

AT3G24120.2 Homeodomain-like superfamily protein3.1e-4848.74Show/hide
Query:  GDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDGAYLLESPSTNNFSPD---L
        GD CLVLT+DPKPRLRWT +LHERFVDAVTQLGG  KATPK IMRTM VKGLTL+HLKSHLQKFRLG+Q+GK+  E SKD + + ES  T + S     +
Subjt:  GDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDGAYLLESPSTNNFSPD---L

Query:  PISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLG
           E  EGY+V EALRAQMEVQ +LH Q+E       GQ ++ L++R +A+ +YL ++LE+ACK   +Q    A  ++  ++   L  K   SS    + 
Subjt:  PISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSVDPLG

Query:  FY-ASQSQEMERMNGTEEVQANLPYQRADCSTESCLTS
        ++ A++   M  ++       N      +CS ES LTS
Subjt:  FY-ASQSQEMERMNGTEEVQANLPYQRADCSTESCLTS

AT4G13640.2 Homeodomain-like superfamily protein1.4e-4543.97Show/hide
Query:  SDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDG
        S + + G  G  +       D CLVLT+DPKPRLRWT++LHERFVDAVTQLGG  KATPK IMRTM VKGLTL+HLKSHLQKFRLG+QS K+  + SKD 
Subjt:  SDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDG

Query:  AYLLESPSTNNFSPD---LPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVSDSDSK
        + + ES  T + S     L   E  E Y+V EALRAQMEVQ +LH Q+E        Q ++ L++R +A+ +YL ++LE+ACK + +Q +  A  ++  +
Subjt:  AYLLESPSTNNFSPD---LPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYL-AMLERACKMLADQFIGGAVSDSDSK

Query:  KSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSN
        +   L  K+  ++     G  ++       +    E+   + ++  +CS ES LTS+
Subjt:  KSQGLDRKSPRSSSVDPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCCAAGGCTTGTTAATCCCGACAGCGATATCCAGATCCATGGCGGCCGTGGTTCTGCTACTTCCGACCTCACTCACAGGGGCGACCCATGTCTTGTTTTGACTTC
AGATCCCAAACCTCGCCTCCGATGGACGGCTGATCTTCACGAGCGATTCGTCGACGCCGTCACTCAGCTCGGTGGTGCTAGCAAAGCTACACCAAAAGCAATAATGCGAA
CGATGAATGTGAAAGGACTGACTCTCTTCCATTTGAAGAGCCACCTCCAGAAATTCAGATTAGGTAAGCAATCAGGGAAGGATATGGGTGAGGGATCTAAAGATGGTGCA
TATCTTTTAGAAAGTCCCAGTACCAATAATTTCTCTCCTGACTTGCCAATTTCTGAAATGGCCGAGGGTTATGAAGTCAAGGAGGCATTGAGAGCGCAGATGGAAGTGCA
AAGTAAGTTACATCTGCAAGTTGAGCTAGCTGACCTTACCATCGTTGGGCAGGCAGAGAAGCACCTACGAATTCGTCAGGATGCCGAGCGAAGATATTTGGCCATGCTCG
AGAGAGCTTGTAAAATGCTTGCAGATCAATTTATTGGAGGTGCAGTGTCGGACTCAGACAGCAAGAAGTCCCAAGGACTGGATCGTAAGAGCCCAAGAAGTTCCTCCGTC
GACCCACTTGGTTTCTATGCTTCACAATCACAAGAGATGGAAAGAATGAATGGCACAGAAGAAGTGCAGGCTAATCTTCCTTACCAAAGAGCTGATTGTTCGACTGAGAG
CTGCCTAACCTCAAACGAGAGTCCCGGGGGATTGACCATGGAAAAATCTCCCGTTGGAAGCCAGAAAAAAATGGTTAACTTGGATTCAGTAACTGCATCTTTGATTTGGA
GTGAAGCCAAGGACAGAATACAAGATGCTAATATCATCCAAATTAACCATCACAGTGCATCCGGATTCTTCTTGATTTTGAAACTCCAGGCCTTCACAGACCCACAAAAA
TGGCGCTCAATATGGGGAGATGTGTTCTTAGATCTGGATTCCAGCTTATTTTCTTTGGTGAAGATAAGTGTTCTGAGGGAGCCCACCTCTCTTGATGCTATGCTCTTTGC
TCGGCCGAAGAGTACTCCTTATGGAGAGCAAATGGTCCAAATATCTTTAGAACTCATTAAACCCAATAAACTTGGAAACACAAACCTTCCTGATGCAAGCTTGATGGAAC
CCTCAAATGGAAGTAATGGTCCCATTAGCAACCCGAATTTGGCCTCGAAGCAGAGACTGCGATGGACGCATGATCTTCACAACCGATTTGTTAATGCAGTGGCACAACTT
GGTGGTCCAGATCGTGCTACACCCAAAGGTGTTCTTAGAGTAATGGGTGTTCAAGGTTTAACAATATACCACGTCAAAAGCCATCTGCAGAAATATCGACTTGCAAAATA
CCTTCCCGACTGCTCATCTGATGGAAAAAAGGCTGACAAGAAGGATTCTAGTGACATTCTATCAAATATCGACAGCTCGTCTGGAATGCAAATTACCGAAGCACTTAAGC
TGCAGATGGAAGTCCAGAAGCGACTGCATGAACAATTAGAGGTGCAGAGACAGCTTCAGTTACGGATTGAGGCCCAAGGCAAGTACCTAAAGAAGATAATTGAAGAGCAA
CAAAAACTCAGTGGGGTTCTTTCAGGAGCTCCAGTTCCTTCTTCCTTTTCAGCTCCTGCTTCTGGTGACAACTGCCCAGAAACTGATAAAAACGACCCGCCAACACCTGC
CCCAACATCTGAGTCCCCTCGCCAAGAGAAAGCGTCGAAGGAACGTGCCCAAGCCAAGAGTATATCTGTTGATGATTCTTCCTCGTCTCATCACGAACCATTGACGCCAG
ATTCCGGTTGCCATAGCTCCCCAAGTGAGAGCCCAAGGCCAGTGAAGAAACCAAGGCAGTGCAGGGATGGTGCATTTTCCAAATCAGAAATGATACTTGCACATCAGATA
CTGGAGTCAAGTTTAAACTCCACTTACAAGGAATCACACTCAGTTTTTCCAGCCGGGGAAAAGTTCGATCCTTCGTCTGGAATATCGATAGGGGATCAACACCGATTAGA
CTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCCAAGGCTTGTTAATCCCGACAGCGATATCCAGATCCATGGCGGCCGTGGTTCTGCTACTTCCGACCTCACTCACAGGGGCGACCCATGTCTTGTTTTGACTTC
AGATCCCAAACCTCGCCTCCGATGGACGGCTGATCTTCACGAGCGATTCGTCGACGCCGTCACTCAGCTCGGTGGTGCTAGCAAAGCTACACCAAAAGCAATAATGCGAA
CGATGAATGTGAAAGGACTGACTCTCTTCCATTTGAAGAGCCACCTCCAGAAATTCAGATTAGGTAAGCAATCAGGGAAGGATATGGGTGAGGGATCTAAAGATGGTGCA
TATCTTTTAGAAAGTCCCAGTACCAATAATTTCTCTCCTGACTTGCCAATTTCTGAAATGGCCGAGGGTTATGAAGTCAAGGAGGCATTGAGAGCGCAGATGGAAGTGCA
AAGTAAGTTACATCTGCAAGTTGAGCTAGCTGACCTTACCATCGTTGGGCAGGCAGAGAAGCACCTACGAATTCGTCAGGATGCCGAGCGAAGATATTTGGCCATGCTCG
AGAGAGCTTGTAAAATGCTTGCAGATCAATTTATTGGAGGTGCAGTGTCGGACTCAGACAGCAAGAAGTCCCAAGGACTGGATCGTAAGAGCCCAAGAAGTTCCTCCGTC
GACCCACTTGGTTTCTATGCTTCACAATCACAAGAGATGGAAAGAATGAATGGCACAGAAGAAGTGCAGGCTAATCTTCCTTACCAAAGAGCTGATTGTTCGACTGAGAG
CTGCCTAACCTCAAACGAGAGTCCCGGGGGATTGACCATGGAAAAATCTCCCGTTGGAAGCCAGAAAAAAATGGTTAACTTGGATTCAGTAACTGCATCTTTGATTTGGA
GTGAAGCCAAGGACAGAATACAAGATGCTAATATCATCCAAATTAACCATCACAGTGCATCCGGATTCTTCTTGATTTTGAAACTCCAGGCCTTCACAGACCCACAAAAA
TGGCGCTCAATATGGGGAGATGTGTTCTTAGATCTGGATTCCAGCTTATTTTCTTTGGTGAAGATAAGTGTTCTGAGGGAGCCCACCTCTCTTGATGCTATGCTCTTTGC
TCGGCCGAAGAGTACTCCTTATGGAGAGCAAATGGTCCAAATATCTTTAGAACTCATTAAACCCAATAAACTTGGAAACACAAACCTTCCTGATGCAAGCTTGATGGAAC
CCTCAAATGGAAGTAATGGTCCCATTAGCAACCCGAATTTGGCCTCGAAGCAGAGACTGCGATGGACGCATGATCTTCACAACCGATTTGTTAATGCAGTGGCACAACTT
GGTGGTCCAGATCGTGCTACACCCAAAGGTGTTCTTAGAGTAATGGGTGTTCAAGGTTTAACAATATACCACGTCAAAAGCCATCTGCAGAAATATCGACTTGCAAAATA
CCTTCCCGACTGCTCATCTGATGGAAAAAAGGCTGACAAGAAGGATTCTAGTGACATTCTATCAAATATCGACAGCTCGTCTGGAATGCAAATTACCGAAGCACTTAAGC
TGCAGATGGAAGTCCAGAAGCGACTGCATGAACAATTAGAGGTGCAGAGACAGCTTCAGTTACGGATTGAGGCCCAAGGCAAGTACCTAAAGAAGATAATTGAAGAGCAA
CAAAAACTCAGTGGGGTTCTTTCAGGAGCTCCAGTTCCTTCTTCCTTTTCAGCTCCTGCTTCTGGTGACAACTGCCCAGAAACTGATAAAAACGACCCGCCAACACCTGC
CCCAACATCTGAGTCCCCTCGCCAAGAGAAAGCGTCGAAGGAACGTGCCCAAGCCAAGAGTATATCTGTTGATGATTCTTCCTCGTCTCATCACGAACCATTGACGCCAG
ATTCCGGTTGCCATAGCTCCCCAAGTGAGAGCCCAAGGCCAGTGAAGAAACCAAGGCAGTGCAGGGATGGTGCATTTTCCAAATCAGAAATGATACTTGCACATCAGATA
CTGGAGTCAAGTTTAAACTCCACTTACAAGGAATCACACTCAGTTTTTCCAGCCGGGGAAAAGTTCGATCCTTCGTCTGGAATATCGATAGGGGATCAACACCGATTAGA
CTAA
Protein sequenceShow/hide protein sequence
MFPRLVNPDSDIQIHGGRGSATSDLTHRGDPCLVLTSDPKPRLRWTADLHERFVDAVTQLGGASKATPKAIMRTMNVKGLTLFHLKSHLQKFRLGKQSGKDMGEGSKDGA
YLLESPSTNNFSPDLPISEMAEGYEVKEALRAQMEVQSKLHLQVELADLTIVGQAEKHLRIRQDAERRYLAMLERACKMLADQFIGGAVSDSDSKKSQGLDRKSPRSSSV
DPLGFYASQSQEMERMNGTEEVQANLPYQRADCSTESCLTSNESPGGLTMEKSPVGSQKKMVNLDSVTASLIWSEAKDRIQDANIIQINHHSASGFFLILKLQAFTDPQK
WRSIWGDVFLDLDSSLFSLVKISVLREPTSLDAMLFARPKSTPYGEQMVQISLELIKPNKLGNTNLPDASLMEPSNGSNGPISNPNLASKQRLRWTHDLHNRFVNAVAQL
GGPDRATPKGVLRVMGVQGLTIYHVKSHLQKYRLAKYLPDCSSDGKKADKKDSSDILSNIDSSSGMQITEALKLQMEVQKRLHEQLEVQRQLQLRIEAQGKYLKKIIEEQ
QKLSGVLSGAPVPSSFSAPASGDNCPETDKNDPPTPAPTSESPRQEKASKERAQAKSISVDDSSSSHHEPLTPDSGCHSSPSESPRPVKKPRQCRDGAFSKSEMILAHQI
LESSLNSTYKESHSVFPAGEKFDPSSGISIGDQHRLD