; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G015900 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G015900
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptiontranscription factor bHLH143-like
Genome locationCG_Chr09:30984576..30988883
RNA-Seq ExpressionClCG09G015900
SyntenyClCG09G015900
Gene Ontology termsNA
InterPro domainsIPR037546 - Transcription factor SAC51-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571857.1 Transcription factor basic helix-loop-helix 143, partial [Cucurbita argyrosperma subsp. sororia]1.2e-19080.51Show/hide
Query:  MVCQAASQTRFRALKHENGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKP-RGNQRHIHLVVIVYCWMVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRL
        MVCQ+ASQTRFRALK+ENGIAGKPTI+VKVIACFQP QNCQAEYFR LLKP  GNQRHI+LVVIV+CWMVKTGDS PQ GHSA NLP  +CMN+LLN RL
Subjt:  MVCQAASQTRFRALKHENGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKP-RGNQRHIHLVVIVYCWMVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRL

Query:  QCPNPDTYISSAQTEFWGSSIQH-GLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQMTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSA
        +C N DT ISSAQ EFWGSSIQ  GLNSE +NGL +  PSY +TMR NVLPCL++K YDSSL  AQ+ LPGSNT+FSKR+FIIFDQSGNQTSVMYSSGSA
Subjt:  QCPNPDTYISSAQTEFWGSSIQH-GLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQMTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSA

Query:  QIPISISAKNCSHGLNDD-DEEEAAGDIDLKNYLFHKGPVKNGI-AGEESEMHEDTEEINALLYSDDDNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNE
        QIP+SISAKNCSHGLNDD +EEEAA D D+K+YL+HK P  NGI AGEESEMHEDTEEINALLYSDDDNH SSDDEVTSTGHSPPLIKELYD+QIEEMNE
Subjt:  QIPISISAKNCSHGLNDD-DEEEAAGDIDLKNYLFHKGPVKNGI-AGEESEMHEDTEEINALLYSDDDNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNE

Query:  EVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLK
        EVASSDGPRKRQR++ G HK LSDAP S KVD FNNY VD KSSY+G DSQGHLMD   G FSSK+DKLRE LKLLE+MVPGAEGK P LVID+AIDYLK
Subjt:  EVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLK

Query:  SLKFKAKAMGLAAATLPHGDVGQGFQDGRKK
        SLKFKAKAMGL AATLP  DVGQG QDG+ K
Subjt:  SLKFKAKAMGLAAATLPHGDVGQGFQDGRKK

KAG7011543.1 Transcription factor bHLH, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-18579.07Show/hide
Query:  MVCQAASQTRFRALKHENGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKPRGNQRHIHLVVIVYCWMVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQ
        MVCQ+ASQTRFRALK+ENGIAGKPTI+VKVIACFQP QNCQAEYFR LLKP          VIV+CWMVKTGDS PQ GHSA NLP  +CMN+LLN RL+
Subjt:  MVCQAASQTRFRALKHENGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKPRGNQRHIHLVVIVYCWMVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQ

Query:  CPNPDTYISSAQTEFWGSSIQH-GLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQMTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQ
        C N DT ISSAQ EFWGSSIQ  GLNSE +NGL +  PSY +TMR NVLPCL++K YDSSL  AQ+ LPGSNT+FSKR+FIIFDQSGNQTSVMYSSGSAQ
Subjt:  CPNPDTYISSAQTEFWGSSIQH-GLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQMTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQ

Query:  IPISISAKNCSHGLNDDD-EEEAAGDIDLKNYLFHKGPVKNGI-AGEESEMHEDTEEINALLYSDDDNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEE
        IP+SISAKNCSHGLNDDD EEEAA D D+K+YL+HK P  NGI AGEESEMHEDTEEINALLYSDDDNH SSDDEVTSTGHSPPLIKELYD+QIEEMNEE
Subjt:  IPISISAKNCSHGLNDDD-EEEAAGDIDLKNYLFHKGPVKNGI-AGEESEMHEDTEEINALLYSDDDNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEE

Query:  VASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLKS
        VASSDGPRKRQR++ G HK LSDAP S KVD FNNY VD KSSY+G DSQGHLMD   G FSSK+DKLRE LKLLE+MVPGAEGK P+LVID+AIDYLKS
Subjt:  VASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLKS

Query:  LKFKAKAMGLAAATLPHGDVGQGFQDGRKK
        LKFKAKAMGL AATLP  DVGQG QDG+ K
Subjt:  LKFKAKAMGLAAATLPHGDVGQGFQDGRKK

KAG7022534.1 Transcription factor bHLH, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-18883.58Show/hide
Query:  MVCQAASQTRFRALKHENGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKP-RGNQRHIHLVVIVYCWMVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRL
        MVCQAASQTRFRALKHENGI GKPTIIVKVIACFQPLQNCQAEYFRHLLKP  GNQRHIHLV         TGDS PQPGH+AGNLP+LNCMN+LLNFRL
Subjt:  MVCQAASQTRFRALKHENGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKP-RGNQRHIHLVVIVYCWMVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRL

Query:  QCPNPDTYISSAQTEFWGSSIQH-GLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQMTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSA
         C  PDTYISSA+ +F GSS  H GLNSEQKNGLLHSFPSY+ T+R +VLP LVEKH +SSLG A+MTLPGSNT+FSK+E+IIFDQS NQTSVMYSSG A
Subjt:  QCPNPDTYISSAQTEFWGSSIQH-GLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQMTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSA

Query:  QIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEV
        QIPISISAKNCSHGLN DDEEEAAGDID KNYLFHK P+ NG+AGEESEMHEDTEEINALLYSDDDNHYSSDDEVTSTGHSPPLIKELYD+Q EEMNEEV
Subjt:  QIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEV

Query:  ASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLKSL
        ASSDGPRKRQR+L GGH KLSDAPVSVKVD FNNY VD KSSYTGGDSQGH MDS  GNFSSKK KLRE LKLLE+MVPGAEGKHP+ +ID+AIDYLKSL
Subjt:  ASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLKSL

Query:  KFKAKAMG
        KFKAK+ G
Subjt:  KFKAKAMG

XP_008455136.1 PREDICTED: transcription factor bHLH143-like isoform X1 [Cucumis melo]1.9e-17587.09Show/hide
Query:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH-GLNSEQK--NGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQ
        MVKTGDS PQPGHSAGNLPNLNCMN+LL FRLQC NPDT +SSAQTEFWGSSI H GLN EQK  NGLLHSFPSY  TM SN LP LVEK +DSSLGF +
Subjt:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH-GLNSEQK--NGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQ

Query:  MTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDD
        MT+P SNT+FSKREFIIFDQ+GNQTSVMYSS +AQIPISISAKNCSHGLN DDEE+AAGDIDLKNYLFHK P+KNGIAGEESEMHEDT+EINALLYSDDD
Subjt:  MTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDD

Query:  NHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDK
        NHY SDDEVTSTGHSPPLIKELYD+QIEEMNEEVASSDGPRKRQRM+ GGHKKLS+APVSVKVD  NNYRVDMKSSY+GGDSQGHLMDS   NFSSKKDK
Subjt:  NHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDK

Query:  LREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKKW
        LRE LKLLETMVPGAEGKHP+LVIDEAIDYLKSLKFKAKAMGLAAATLPH DVGQG+QDGRK+W
Subjt:  LREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKKW

XP_038887815.1 transcription factor bHLH143-like [Benincasa hispida]1.6e-17989.53Show/hide
Query:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH-GLNSEQK--NGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQ
        MV+TGDS PQPGHSAGNLPNLNCMN+LL FRLQC NPDTYIS AQTE  GSSI H GLN EQK  NGLLHSFPSY+ TMRSNVLPCL+EKHYDSSLGFA+
Subjt:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH-GLNSEQK--NGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQ

Query:  MTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDD
        MTLP SNT+F K+EFIIFD SGNQTSVMYSSG AQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGP+KNGIAGEESEMHEDTEEINALLYSDDD
Subjt:  MTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDD

Query:  NHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDK
        NHYSSDDEVTSTGHSP LIKELYD+QIEEMNEEVASSDGPRKRQRML  GHKKLSDAPVSVKVD FNNYRVDMKSSYTGGDSQGHLM+SSLG FSSKKDK
Subjt:  NHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDK

Query:  LREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKK
        LRE LKLLETMVPGAEGKHP+LVIDEAIDYLKSLKFKAKAMGLAA+TLP  DVGQ  QDGRKK
Subjt:  LREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKK

TrEMBL top hitse value%identityAlignment
A0A0A0K268 Uncharacterized protein8.5e-17485.99Show/hide
Query:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQHG-LNSEQK--NGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQ
        MVKTGDS PQPGHSAGNLPNLNC N+LL FRLQC NPDT +SSAQTEFWGSSI HG LN EQK  N LLHSFPSY  TM SN LPCLVEK +DSSLGF +
Subjt:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQHG-LNSEQK--NGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQ

Query:  MTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDD
        MT+P SNT+F KREFIIFDQ+GNQTSVMYSS +AQIPISIS KNCSHGLN DDEE+AAGDIDLKNYLFHK P+K+GIAGEESEMHEDT+EINALLYSDDD
Subjt:  MTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDD

Query:  NHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDK
        NHY SDDEVTSTGHSPPLIKELYD+QIEEMNEEVASSDGPRKRQRM+ GGHKKLS+APVSVKVD  NNYRVDMKSSYTGG+SQGHLMDS   NFSSKKDK
Subjt:  NHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDK

Query:  LREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKKW
        LRE LKLLETMVPGAEGKHP+LVIDEAIDYLKSLKFKAKAMGLAAATLPH DVGQG+QDGRK+W
Subjt:  LREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKKW

A0A1S3BZT1 transcription factor bHLH143-like isoform X19.1e-17687.09Show/hide
Query:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH-GLNSEQK--NGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQ
        MVKTGDS PQPGHSAGNLPNLNCMN+LL FRLQC NPDT +SSAQTEFWGSSI H GLN EQK  NGLLHSFPSY  TM SN LP LVEK +DSSLGF +
Subjt:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH-GLNSEQK--NGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQ

Query:  MTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDD
        MT+P SNT+FSKREFIIFDQ+GNQTSVMYSS +AQIPISISAKNCSHGLN DDEE+AAGDIDLKNYLFHK P+KNGIAGEESEMHEDT+EINALLYSDDD
Subjt:  MTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDD

Query:  NHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDK
        NHY SDDEVTSTGHSPPLIKELYD+QIEEMNEEVASSDGPRKRQRM+ GGHKKLS+APVSVKVD  NNYRVDMKSSY+GGDSQGHLMDS   NFSSKKDK
Subjt:  NHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDK

Query:  LREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKKW
        LRE LKLLETMVPGAEGKHP+LVIDEAIDYLKSLKFKAKAMGLAAATLPH DVGQG+QDGRK+W
Subjt:  LREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKKW

A0A5D3C989 Transcription factor bHLH143-like isoform X19.1e-17687.09Show/hide
Query:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH-GLNSEQK--NGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQ
        MVKTGDS PQPGHSAGNLPNLNCMN+LL FRLQC NPDT +SSAQTEFWGSSI H GLN EQK  NGLLHSFPSY  TM SN LP LVEK +DSSLGF +
Subjt:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH-GLNSEQK--NGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQ

Query:  MTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDD
        MT+P SNT+FSKREFIIFDQ+GNQTSVMYSS +AQIPISISAKNCSHGLN DDEE+AAGDIDLKNYLFHK P+KNGIAGEESEMHEDT+EINALLYSDDD
Subjt:  MTLPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDD

Query:  NHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDK
        NHY SDDEVTSTGHSPPLIKELYD+QIEEMNEEVASSDGPRKRQRM+ GGHKKLS+APVSVKVD  NNYRVDMKSSY+GGDSQGHLMDS   NFSSKKDK
Subjt:  NHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDK

Query:  LREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKKW
        LRE LKLLETMVPGAEGKHP+LVIDEAIDYLKSLKFKAKAMGLAAATLPH DVGQG+QDGRK+W
Subjt:  LREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKKW

A0A6J1CJJ9 transcription factor bHLH143-like1.8e-16081.42Show/hide
Query:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH--GLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQM
        MVKTG   PQ GHSAGNLPNLNCMN+L+NFRL+C NPDTYISSA TEF GSSI H  GLNSEQKNGLL++ PSY  TMR + LP LVEKH++SSLG A+M
Subjt:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH--GLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQM

Query:  TLPGSNTDFSKREFIIFDQSGNQTSVMYSSG--SAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDD
        TLP SN + SKREFIIFDQSGNQTSVMYSSG   A+IPISISAKNCSHGLND+++E+ AGDIDLKNYLFHK P+ NGIAGEESEMHEDTEEINALLYSDD
Subjt:  TLPGSNTDFSKREFIIFDQSGNQTSVMYSSG--SAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDD

Query:  DNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVD-PFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKK
        DNHYSSDDEVTSTGHSPPLIKELYD+Q EEMNEEVAS DGPRKRQR+L GG KKLSDAPVSVKVD   +NY VD KSSYT G+SQGH MDS LG FSSKK
Subjt:  DNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVD-PFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKK

Query:  DKLREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKKW
        DK+RE L+LLETMVPGAEGKHP+LVIDEAIDYLKSLKFKAKAMGL A TLPH DVGQG Q GRKKW
Subjt:  DKLREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMGLAAATLPHGDVGQGFQDGRKKW

A0A6J1EIH7 transcription factor bHLH143-like7.3e-15783.82Show/hide
Query:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH-GLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQMT
        MVKTGDS PQPGH+AGNLP+LNC N+LLNFRL C NPDTYISSA+ +F GSS  H  LNSEQKNGLLHSFPSY+ T+R +VLP LVEKH +SSLG A+MT
Subjt:  MVKTGDSLPQPGHSAGNLPNLNCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQH-GLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQMT

Query:  LPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNH
        LPGSNT+FSK+E+IIFDQS NQTSVMYSSG AQIPISISAKNCSHGLN DDEEEAAGDID KNYLFHK P+ NG+AGEESEMHEDTEEINALLYSDDDNH
Subjt:  LPGSNTDFSKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNH

Query:  YSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLR
        YSSDDEVTSTGHSPPLIKELYD+Q EEMNEEVASSDGPRKRQR+L GGHKKLSDAPVSVKVD FNNY VD KSSYTGGDSQGH MDS  GNFSSKK KLR
Subjt:  YSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLR

Query:  EILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMG
        E LKLLE+MVPGAEGKHP+ +ID+AIDYLKSLKFKAK+ G
Subjt:  EILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAKAMG

SwissProt top hitse value%identityAlignment
Q9ASX9 Transcription factor bHLH1444.3e-0525.97Show/hide
Query:  REFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNHYSSDDEVTST
        + F+IFDQ+ +++ VMY        ++  + N    L    + E  G     NY  ++  V        S   ED  EI+ALL +D+D   + D+E    
Subjt:  REFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNHYSSDDEVTST

Query:  GHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILKLLETMV
        G                 +EEV+++     R      G+        S   +  NN     K S +G         SS  N    + K+++++ +L  +V
Subjt:  GHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILKLLETMV

Query:  PGAEGKHPILVIDEAIDYLKSLKFKAKAMGL
        PG E  +   V+DEA+ YLKSLK +A+ +G+
Subjt:  PGAEGKHPILVIDEAIDYLKSLKFKAKAMGL

Q9FGB0 Transcription factor bHLH1452.6e-1833.91Show/hide
Query:  SKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAG-EESEMHEDTEEINALLYSDDDNHY-SSDDE
        S++ F++FDQSG+QT+++ +S   +   ++      H   D  EE    + DL         V +G+ G  E ++ ED+EE+NALLYS+D++ Y S +DE
Subjt:  SKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAG-EESEMHEDTEEINALLYSDDDNHY-SSDDE

Query:  VTSTGHSPPLIKELYDEQIEEMNEEVASSDGP--RKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILK
        VTS  HSP ++    ++Q       + S   P   K++++L   ++ + DA  S      +N R+              L  S L +    ++K+ E + 
Subjt:  VTSTGHSPPLIKELYDEQIEEMNEEVASSDGP--RKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILK

Query:  LLETMVPGAEGKHPILVIDEAIDYLKSLKFKAK
        LL ++VPG E   PILVID AIDYLKSLK +AK
Subjt:  LLETMVPGAEGKHPILVIDEAIDYLKSLKFKAK

Q9FMF4 Transcription factor SAC518.9e-2738.96Show/hide
Query:  TLPGSNTDFSKREFIIFDQSGNQTSVM-------YSSGSAQIPISIS-AKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINA
        T P    + S++  +IFDQSG+QT ++       + S +A  P+ +S  +       +D EE            FHK        G ESEMHEDTEEINA
Subjt:  TLPGSNTDFSKREFIIFDQSGNQTSVM-------YSSGSAQIPISIS-AKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINA

Query:  LLYSDD--DNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPF-----NNYRVDMKSSYTGGDSQGHL
        LLYSDD  D+   SDDEV STGHSP      Y  +      E+   DGP KRQ++L   +  +SD    V  +       +++  D K   +   S    
Subjt:  LLYSDD--DNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPF-----NNYRVDMKSSYTGGDSQGHL

Query:  MDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLKSLK
          S L N  SKKDK+R  LK+LE++VPGA+G   +L++DEAIDYLK LK
Subjt:  MDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLKSLK

Q9FY69 Transcription factor bHLH1433.6e-2839.13Show/hide
Query:  SKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNH--YSSDDE
        S++ FI+FDQSG QT ++      + P S+ A+  +       E+  + D  ++  +      +NG   E+SEMHEDTEEINALLYSDDD++  + SDDE
Subjt:  SKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNH--YSSDDE

Query:  VTSTGHSPPLI-KELYDEQIEEMNEEVASSDGP-RKRQRMLHGGHKKLSDAPV-SVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREIL
        V STGHSP  + ++  +   EE++E  ++ DGP  KRQ++L   ++  S + V + KV   ++  +  +S+ +     G    S L +  S+KDK+   L
Subjt:  VTSTGHSPPLI-KELYDEQIEEMNEEVASSDGP-RKRQRMLHGGHKKLSDAPV-SVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREIL

Query:  KLLETMVPGAEGKHPILVIDEAIDYLKSLK
        ++LE++VPGA+GK  +L++DEAIDYLK LK
Subjt:  KLLETMVPGAEGKHPILVIDEAIDYLKSLK

Arabidopsis top hitse value%identityAlignment
AT5G09460.1 sequence-specific DNA binding transcription factors;transcription regulators2.6e-2939.13Show/hide
Query:  SKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNH--YSSDDE
        S++ FI+FDQSG QT ++      + P S+ A+  +       E+  + D  ++  +      +NG   E+SEMHEDTEEINALLYSDDD++  + SDDE
Subjt:  SKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNH--YSSDDE

Query:  VTSTGHSPPLI-KELYDEQIEEMNEEVASSDGP-RKRQRMLHGGHKKLSDAPV-SVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREIL
        V STGHSP  + ++  +   EE++E  ++ DGP  KRQ++L   ++  S + V + KV   ++  +  +S+ +     G    S L +  S+KDK+   L
Subjt:  VTSTGHSPPLI-KELYDEQIEEMNEEVASSDGP-RKRQRMLHGGHKKLSDAPV-SVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREIL

Query:  KLLETMVPGAEGKHPILVIDEAIDYLKSLK
        ++LE++VPGA+GK  +L++DEAIDYLK LK
Subjt:  KLLETMVPGAEGKHPILVIDEAIDYLKSLK

AT5G09461.1 conserved peptide upstream open reading frame 432.7e-1569.23Show/hide
Query:  MVCQAASQTRFRALKHE-NGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKP
        MV Q+A QTRFR  K+E NG + +PTI+V+VIACFQP+ NCQAEYFRH+LKP
Subjt:  MVCQAASQTRFRALKHE-NGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKP

AT5G50010.1 sequence-specific DNA binding transcription factors;transcription regulators1.8e-1933.91Show/hide
Query:  SKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAG-EESEMHEDTEEINALLYSDDDNHY-SSDDE
        S++ F++FDQSG+QT+++ +S   +   ++      H   D  EE    + DL         V +G+ G  E ++ ED+EE+NALLYS+D++ Y S +DE
Subjt:  SKREFIIFDQSGNQTSVMYSSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAG-EESEMHEDTEEINALLYSDDDNHY-SSDDE

Query:  VTSTGHSPPLIKELYDEQIEEMNEEVASSDGP--RKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILK
        VTS  HSP ++    ++Q       + S   P   K++++L   ++ + DA  S      +N R+              L  S L +    ++K+ E + 
Subjt:  VTSTGHSPPLIKELYDEQIEEMNEEVASSDGP--RKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILK

Query:  LLETMVPGAEGKHPILVIDEAIDYLKSLKFKAK
        LL ++VPG E   PILVID AIDYLKSLK +AK
Subjt:  LLETMVPGAEGKHPILVIDEAIDYLKSLKFKAK

AT5G50011.1 conserved peptide upstream open reading frame 373.2e-1676.47Show/hide
Query:  MVCQAASQTRFRALKHENGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKP
        MVCQ+A QTRFR LKHE+GI G   I+V+VIACFQPLQ+CQAEYFR LLKP
Subjt:  MVCQAASQTRFRALKHENGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKP

AT5G64340.1 sequence-specific DNA binding transcription factors;transcription regulators6.3e-2838.96Show/hide
Query:  TLPGSNTDFSKREFIIFDQSGNQTSVM-------YSSGSAQIPISIS-AKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINA
        T P    + S++  +IFDQSG+QT ++       + S +A  P+ +S  +       +D EE            FHK        G ESEMHEDTEEINA
Subjt:  TLPGSNTDFSKREFIIFDQSGNQTSVM-------YSSGSAQIPISIS-AKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINA

Query:  LLYSDD--DNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPF-----NNYRVDMKSSYTGGDSQGHL
        LLYSDD  D+   SDDEV STGHSP      Y  +      E+   DGP KRQ++L   +  +SD    V  +       +++  D K   +   S    
Subjt:  LLYSDD--DNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEVASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPF-----NNYRVDMKSSYTGGDSQGHL

Query:  MDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLKSLK
          S L N  SKKDK+R  LK+LE++VPGA+G   +L++DEAIDYLK LK
Subjt:  MDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLKSLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGAGTAGGAAGGGAAGTGTGAGATTTGATCCTCATCCTCAAAATTGTTGTACTCAAAGGAGCGTGCATTTCCTCTTCCCATTATTCACTGTTCCTCTA
CAGTTCCCTTACACTTCCCTCTCTCCTCTGGCTTTCCCTTCCATTTCCTTTGTCTCTCAAACTTTCCATTCCCCTCTCGATCTTCACACGATTCGGCTAAAAGGC
GAGAGCCTGCTTAATTCTTCGGTAAATTACGGAAGTGACTGTAAACAAGACTCCAAGCTATTGTTCCTTTTAGTTTGCTTAAGATTTTCGAATTCTCTACTCTTG
TTCAGTTCCCCTGTGTTATGTTTGCTGCATAGTAAAGGGTTAAAAGTAGTTATGGTGTGCCAAGCAGCAAGTCAAACACGATTTCGGGCTTTGAAACATGAAAAT
GGGATTGCAGGGAAGCCAACAATTATTGTTAAAGTGATTGCATGTTTTCAACCTTTGCAGAATTGCCAGGCTGAGTACTTCCGTCATTTGCTCAAACCTAGAGGA
AATCAGAGACATATTCATCTTGTGGTAATTGTTTATTGTTGGATGGTTAAGACTGGAGATTCTTTGCCTCAACCTGGGCATTCTGCTGGGAACCTGCCCAATTTA
AATTGCATGAATAAGTTGCTAAATTTCAGATTGCAATGTCCAAACCCAGACACTTATATCTCTTCTGCACAAACGGAATTTTGGGGATCTTCTATTCAACATGGT
TTGAATTCAGAACAGAAGAATGGGTTGCTTCATAGCTTTCCATCTTACATCAAGACCATGCGTTCCAATGTACTTCCATGCCTTGTGGAGAAACACTATGATTCA
TCTTTGGGATTTGCACAAATGACCTTACCTGGTTCAAATACTGACTTTTCTAAGAGGGAATTCATTATCTTTGATCAGTCTGGAAATCAAACAAGTGTAATGTAT
AGTTCTGGTTCTGCTCAGATCCCTATATCAATTAGTGCAAAGAATTGTAGTCATGGCTTAAATGATGATGATGAAGAAGAAGCTGCTGGGGATATTGATTTAAAA
AATTATTTATTTCACAAAGGTCCTGTGAAAAATGGAATTGCTGGTGAGGAGAGTGAGATGCATGAAGACACAGAAGAAATAAACGCCTTGCTTTATTCAGATGAT
GACAATCACTATAGCAGCGATGATGAAGTAACTAGCACTGGTCATTCTCCACCGCTAATTAAGGAACTTTATGACGAACAAATTGAGGAAATGAATGAGGAAGTT
GCTAGTTCTGATGGCCCCAGAAAAAGGCAGAGAATGCTACATGGTGGGCACAAGAAATTGTCAGATGCTCCTGTTTCAGTGAAAGTAGATCCATTTAACAACTAC
AGAGTTGATATGAAATCAAGCTACACTGGTGGTGATAGCCAAGGACATCTAATGGATTCCAGTTTGGGAAATTTTTCATCAAAAAAAGACAAGCTAAGAGAGATT
TTGAAACTTCTTGAAACCATGGTTCCTGGTGCTGAGGGCAAGCACCCAATTTTGGTCATTGATGAAGCTATAGACTACTTGAAGTCTCTAAAGTTTAAAGCAAAA
GCTATGGGGCTGGCTGCTGCCACACTCCCTCACGGTGATGTCGGTCAAGGATTCCAGGATGGGAGAAAAAAGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGAGTAGGAAGGGAAGTGTGAGATTTGATCCTCATCCTCAAAATTGTTGTACTCAAAGGAGCGTGCATTTCCTCTTCCCATTATTCACTGTTCCTCTA
CAGTTCCCTTACACTTCCCTCTCTCCTCTGGCTTTCCCTTCCATTTCCTTTGTCTCTCAAACTTTCCATTCCCCTCTCGATCTTCACACGATTCGGCTAAAAGGC
GAGAGCCTGCTTAATTCTTCGGTAAATTACGGAAGTGACTGTAAACAAGACTCCAAGCTATTGTTCCTTTTAGTTTGCTTAAGATTTTCGAATTCTCTACTCTTG
TTCAGTTCCCCTGTGTTATGTTTGCTGCATAGTAAAGGGTTAAAAGTAGTTATGGTGTGCCAAGCAGCAAGTCAAACACGATTTCGGGCTTTGAAACATGAAAAT
GGGATTGCAGGGAAGCCAACAATTATTGTTAAAGTGATTGCATGTTTTCAACCTTTGCAGAATTGCCAGGCTGAGTACTTCCGTCATTTGCTCAAACCTAGAGGA
AATCAGAGACATATTCATCTTGTGGTAATTGTTTATTGTTGGATGGTTAAGACTGGAGATTCTTTGCCTCAACCTGGGCATTCTGCTGGGAACCTGCCCAATTTA
AATTGCATGAATAAGTTGCTAAATTTCAGATTGCAATGTCCAAACCCAGACACTTATATCTCTTCTGCACAAACGGAATTTTGGGGATCTTCTATTCAACATGGT
TTGAATTCAGAACAGAAGAATGGGTTGCTTCATAGCTTTCCATCTTACATCAAGACCATGCGTTCCAATGTACTTCCATGCCTTGTGGAGAAACACTATGATTCA
TCTTTGGGATTTGCACAAATGACCTTACCTGGTTCAAATACTGACTTTTCTAAGAGGGAATTCATTATCTTTGATCAGTCTGGAAATCAAACAAGTGTAATGTAT
AGTTCTGGTTCTGCTCAGATCCCTATATCAATTAGTGCAAAGAATTGTAGTCATGGCTTAAATGATGATGATGAAGAAGAAGCTGCTGGGGATATTGATTTAAAA
AATTATTTATTTCACAAAGGTCCTGTGAAAAATGGAATTGCTGGTGAGGAGAGTGAGATGCATGAAGACACAGAAGAAATAAACGCCTTGCTTTATTCAGATGAT
GACAATCACTATAGCAGCGATGATGAAGTAACTAGCACTGGTCATTCTCCACCGCTAATTAAGGAACTTTATGACGAACAAATTGAGGAAATGAATGAGGAAGTT
GCTAGTTCTGATGGCCCCAGAAAAAGGCAGAGAATGCTACATGGTGGGCACAAGAAATTGTCAGATGCTCCTGTTTCAGTGAAAGTAGATCCATTTAACAACTAC
AGAGTTGATATGAAATCAAGCTACACTGGTGGTGATAGCCAAGGACATCTAATGGATTCCAGTTTGGGAAATTTTTCATCAAAAAAAGACAAGCTAAGAGAGATT
TTGAAACTTCTTGAAACCATGGTTCCTGGTGCTGAGGGCAAGCACCCAATTTTGGTCATTGATGAAGCTATAGACTACTTGAAGTCTCTAAAGTTTAAAGCAAAA
GCTATGGGGCTGGCTGCTGCCACACTCCCTCACGGTGATGTCGGTCAAGGATTCCAGGATGGGAGAAAAAAGTGGTGA
Protein sequenceShow/hide protein sequence
MEKSRKGSVRFDPHPQNCCTQRSVHFLFPLFTVPLQFPYTSLSPLAFPSISFVSQTFHSPLDLHTIRLKGESLLNSSVNYGSDCKQDSKLLFLLVCLRFSNSLLL
FSSPVLCLLHSKGLKVVMVCQAASQTRFRALKHENGIAGKPTIIVKVIACFQPLQNCQAEYFRHLLKPRGNQRHIHLVVIVYCWMVKTGDSLPQPGHSAGNLPNL
NCMNKLLNFRLQCPNPDTYISSAQTEFWGSSIQHGLNSEQKNGLLHSFPSYIKTMRSNVLPCLVEKHYDSSLGFAQMTLPGSNTDFSKREFIIFDQSGNQTSVMY
SSGSAQIPISISAKNCSHGLNDDDEEEAAGDIDLKNYLFHKGPVKNGIAGEESEMHEDTEEINALLYSDDDNHYSSDDEVTSTGHSPPLIKELYDEQIEEMNEEV
ASSDGPRKRQRMLHGGHKKLSDAPVSVKVDPFNNYRVDMKSSYTGGDSQGHLMDSSLGNFSSKKDKLREILKLLETMVPGAEGKHPILVIDEAIDYLKSLKFKAK
AMGLAAATLPHGDVGQGFQDGRKKW