; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002083 (gene) of Snake gourd v1 genome

Gene IDTan0002083
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription factor bHLH140
Genome locationLG08:73480803..73492725
RNA-Seq ExpressionTan0002083
SyntenyTan0002083
Gene Ontology termsGO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0000012 - single strand break repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR036236 - Zinc finger C2H2 superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR002589 - Macro domain
IPR003656 - Zinc finger, BED-type
IPR011146 - HIT-like domain
IPR019808 - Histidine triad, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596332.1 Transcription factor basic helix-loop-helix 140, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.03Show/hide
Query:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG
        MDMDIDENSTAKG E + KL+MV+LVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLE EQR++FVKLG
Subjt:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQ+KELP LNEGF RITFCH+ETDVQSAID YK LGLHD L DGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA
        LGIM+FLKKAENP+KTCS+ANT KD P SQTTQE       K+ES+ T+LS+VNKES KGEN  PGV SLE+NIS SD PTLAFPSISTSDFKFSHEKAA
Subjt:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA

Query:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
        EIIVE VEEFM KLGNARLV+VDLSHGSKILSL KAKAAKKNI S KFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE AT
Subjt:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT

Query:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE
        KQQA SLRPGN VAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQ FISIV+D+FKSAKGI ERLGSAPSE ++HSE
Subjt:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE

Query:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA
        +TH KFKR +LQ PERSKKWKG+Q+SAEALNQN+NK   KMSKHWGSWAQALYNTAM+PERH + VLETSDDVVVLNDIYPKAR+HLLIVARYEGLDQLA
Subjt:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA

Query:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS
        DVHK HLPLL+TMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDF+SSHLKNKKHWNSFNTDFFRDSVDV+DEVSSHGK  I DDESLMS
Subjt:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS

Query:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
        ME RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLEGGRLV
Subjt:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

XP_022949470.1 transcription factor bHLH140 [Cucurbita moschata]0.0e+0087.03Show/hide
Query:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG
        MDMDIDENSTAKG E + KL+MV+LVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLE EQR++FVKLG
Subjt:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQ+KELP LNEGF RITFCH+ETDVQSAID YK LGLHD L DGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA
        LGIM+FLKKAENP+KTCS+ANT KD P SQTTQE       K+ES+ T+LS+VNKES KGEN  PGV SLE+NIS SD PTLAFPSISTSDFKFSHEKAA
Subjt:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA

Query:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
        EIIVE VEEFM KLGNARLV+VDLSHGSKILSL KAKAAKKNI S KFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE AT
Subjt:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT

Query:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE
        KQQA SLRPGN VAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQ FISIV+D+FKSAKGI ERLGSAPSE ++HSE
Subjt:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE

Query:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA
        +TH KFKR +LQ PERSKKWKG+Q+SAEALNQN+NK   KMSKHWGSWAQALYNTAM+PERH + VLETSDDVVVLNDIYPKAR+HLLIVARYEGLDQLA
Subjt:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA

Query:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS
        DVHK HLPLL+TMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDF+SSHLKNKKHWNSFNTDFFRDSVDV+DEVSSHGK  I DDESLMS
Subjt:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS

Query:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
        ME RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLEGGRLV
Subjt:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

XP_023540723.1 transcription factor bHLH140 [Cucurbita pepo subsp. pepo]0.0e+0087.03Show/hide
Query:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG
        MDMDIDENSTAKG E + KL+MV+LVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLE EQR++FVKLG
Subjt:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQ+KELP LNEGF RITFCH+ET+VQSAID YK LGLHD L DGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA
        LGIM+FLKKAENP+KTCS+ANT KD P SQTTQE       K+ES+ T+LS+VNKES KGE   PGVRSLE+NIS+SD+PTLAFPSISTSDFKFSHEKAA
Subjt:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA

Query:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
        EIIVE VEEFM KLGNARLV+VDLSHGSKILSL KAKAAKKNI S KFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE AT
Subjt:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT

Query:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE
        KQQA SLRPGN VAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQ FISIV+D+FKSAKGI ERLGSAPSE ++HSE
Subjt:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE

Query:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA
        +TH KFKR +LQ PERSKKWKG+Q+SAEALNQN+NK   KMSKHWGSWAQALYNTAM+PERH +TVLETSDDVVVLNDIYPKAR+HLLIVARYEGLDQLA
Subjt:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA

Query:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS
        DVHK HLPLL+TMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDF+SSHLKNKKHWNSFNTDFFRDSVDV+DEVSSHGK  I DDESLMS
Subjt:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS

Query:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
        ME RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLEGGRLV
Subjt:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0090.41Show/hide
Query:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG
        MDMD DENSTAKGKEGQ KL+MV+LVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLE EQRADFVKL 
Subjt:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ
        GPRVDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQ+KELP+LNEGF+RITFCHNETDVQSAIDMYK L LHD+L  GCFGQKNPDKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA
        LGIMKFLKKAE PS+T SSANTVKDSPI QTTQEK +SCDKKEESA T+  +V+ ES KGE+  PGVRSLEDNISQSD PTLAFPSISTSDFKFSHEKAA
Subjt:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA

Query:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
        EIIVEKVEEFM KLGNARLVLVDLSHGSKILSL KAKAAKKNISS KFFTFVGDITKL+SEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVAT
Subjt:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT

Query:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE
        KQQANSL+PGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR+AYSSLFQ FIS+VED+FKS KGIH RLG  PSEP++HSE
Subjt:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE

Query:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA
        N+H KFKRE+LQNPERSKKWKGSQDS EALNQN+NKTV KMSKHWGSWAQALYNTAMHPE+H DTVLETSDDVVVLNDIYPKAR+HLL+VAR+EGLDQLA
Subjt:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA

Query:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS
        DV   HLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDF+SSHLKN+KHWNSFNTDFFRDSV VMDEVSSHGK S+ DDESLMS
Subjt:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS

Query:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
        MELRCNRCRSAHPNLPKLKAHI KCQAPFPSTLLEGGRLV
Subjt:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0090.54Show/hide
Query:  MVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLGGPRVDVHAVVLDLPAQLCISR
        MV+LVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLE EQRADFVKL GPRVDV AVVLDLPAQLCISR
Subjt:  MVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLGGPRVDVHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQLGIMKFLKKAENPSKTCSSAN
        SVKRTGHEGNLSGGKAAAVVNKMLQ+KELP+LNEGF+RITFCHNETDVQSAIDMYK L LHD+L  GCFGQKNPDKKVQLGIMKFLKKAE PS+T SSAN
Subjt:  SVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQLGIMKFLKKAENPSKTCSSAN

Query:  TVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMSKLGNARLVL
        TVKDSPI QTTQEK +SCDKKEESA T+  +V+ ES KGE+  PGVRSLEDNISQSD PTLAFPSISTSDFKFSHEKAAEIIVEKVEEFM KLGNARLVL
Subjt:  TVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMSKLGNARLVL

Query:  VDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNAVAVQLPSTS
        VDLSHGSKILSL KAKAAKKNISS KFFTFVGDITKL+SEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQQANSL+PGNAVAVQLPSTS
Subjt:  VDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNAVAVQLPSTS

Query:  PLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSENTHRKFKREDLQNPERSKKWK
        PLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR+AYSSLFQ FIS+VED+FKS KGIH RLG  PSEP++HSEN+H KFKRE+LQNPERSKKWK
Subjt:  PLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSENTHRKFKREDLQNPERSKKWK

Query:  GSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLADVHKNHLPLLRTMHAVGLKWI
        GSQDS EALNQN+NKTV KMSKHWGSWAQALYNTAMHPE+H DTVLETSDDVVVLNDIYPKAR+HLL+VAR+EGLDQLADV   HLPLLRTMHAVGLKWI
Subjt:  GSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLADVHKNHLPLLRTMHAVGLKWI

Query:  DKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMSMELRCNRCRSAHPNLPKLKAH
        DKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDF+SSHLKN+KHWNSFNTDFFRDSV VMDEVSSHGK S+ DDESLMSMELRCNRCRSAHPNLPKLKAH
Subjt:  DKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMSMELRCNRCRSAHPNLPKLKAH

Query:  ISKCQAPFPSTLLEGGRLV
        I KCQAPFPSTLLEGGRLV
Subjt:  ISKCQAPFPSTLLEGGRLV

TrEMBL top hitse value%identityAlignment
A0A1S3B6C4 transcription factor bHLH1400.0e+0086.49Show/hide
Query:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG
        MDMD DENS AKGKEGQ KL+MV+LVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLE EQRADFVKLG
Subjt:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ
        GP+VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQ+KELPKLNEGF+RITFCHNE+DV SAIDMYK L LH++L  GCFGQKNPDKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA
        LGIMKFLKKAENPSKTCSSAN  K+SP  Q TQEK  SCDKKEES+  +  +V  ES KGE+  PGVRSLE  ISQSD PTLAFPSISTSDFKFSHEKAA
Subjt:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA

Query:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
        EIIVEKVEEFM KLGNARLVLVDLSHGSKILS+ KAKA +KNISS KFFTFVGDITKL+SEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
Subjt:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT

Query:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE
        KQQANSL PGNAVAVQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL +AYSSLFQ FISIV+D++KS KGI+E LGS P EPQ+HSE
Subjt:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE

Query:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA
        N+H KFKRE+LQN E SKKWKGS +S E LNQN+NKTV K SKHWGSWAQALY+TAMHPERH ++VLETSDDVVVL DIYPKAR+HLL+VAR+EGLDQLA
Subjt:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA

Query:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS
        DV   HLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDF+S+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGK  I DDE L+S
Subjt:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS

Query:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
        MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLV
Subjt:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

A0A5A7TLV2 Transcription factor bHLH1400.0e+0086.49Show/hide
Query:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG
        MDMD DENS AKGKEGQ KL+MV+LVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLE EQRADFVKLG
Subjt:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ
        GP+VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQ+KELPKLNEGF+RITFCHNE+DV SAIDMYK L LH++L  GCFGQKNPDKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA
        LGIMKFLKKAENPSKTCSSAN  K+SP  Q TQEK  SCDKKEES+  +  +V  ES KGE+  PGVRSLE  ISQSD PTLAFPSISTSDFKFSHEKAA
Subjt:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA

Query:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
        EIIVEKVEEFM KLGNARLVLVDLSHGSKILS+ KAKA +KNISS KFFTFVGDITKL+SEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
Subjt:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT

Query:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE
        KQQANSL PGNAVAVQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL +AYSSLFQ FISIV+D++KS KGI+E LGS P EPQ+HSE
Subjt:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE

Query:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA
        N+H KFKRE+LQN E SKKWKGS +S E LNQN+NKTV K SKHWGSWAQALY+TAMHPERH ++VLETSDDVVVL DIYPKAR+HLL+VAR+EGLDQLA
Subjt:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA

Query:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS
        DV   HLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDF+S+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGK  I DDE L+S
Subjt:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS

Query:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
        MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLV
Subjt:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

A0A6J1CV45 transcription factor bHLH1400.0e+0085.7Show/hide
Query:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG
        MDMDID+NSTAKGKEGQ+KL+MV+LVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLE EQRADFVK+G
Subjt:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQ+KELPKLNEGF+RITFCHNETDVQSAID YK LGLHD L  GCFGQ   D KVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA
        LGIMKFLKKA+NP+KTCSSAN +KDSP  QT++E   SCDKKEE A T+  +V+KES KGEN  PGVRSL D+IS+SD PTLAFPSISTSDFKFSHEKAA
Subjt:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA

Query:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
        EIIVE VEEFM KLGNARLVLVDL++GSK+LSL KAKAAKK I+ +KFFTFVGDITKL+SEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVAT
Subjt:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT

Query:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE
        KQQANSLRPGN V  QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQGFISIVE+QFKS KGI + LGSAPSE ++HSE
Subjt:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE

Query:  NTHR--------KFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVAR
        ++ R        KFKRED+QNPERSKKWKGSQDSAEA NQN+N  V KMSKHWGSWAQALYNTAMHPERHGDTVLE SDDV VLNDIY KA +HLL+VAR
Subjt:  NTHR--------KFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVAR

Query:  YEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSI
        YEGLDQLADV + HLPLLRTMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDF+SSHLKNKKHWNSFNTDFFRDSVDVMDEV SHGK SI
Subjt:  YEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSI

Query:  NDDESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
         DDESLMSMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLEG RLV
Subjt:  NDDESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

A0A6J1GCV0 transcription factor bHLH1400.0e+0087.03Show/hide
Query:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG
        MDMDIDENSTAKG E + KL+MV+LVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLE EQR++FVKLG
Subjt:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQ+KELP LNEGF RITFCH+ETDVQSAID YK LGLHD L DGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA
        LGIM+FLKKAENP+KTCS+ANT KD P SQTTQE       K+ES+ T+LS+VNKES KGEN  PGV SLE+NIS SD PTLAFPSISTSDFKFSHEKAA
Subjt:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA

Query:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
        EIIVE VEEFM KLGNARLV+VDLSHGSKILSL KAKAAKKNI S KFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE AT
Subjt:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT

Query:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE
        KQQA SLRPGN VAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQ FISIV+D+FKSAKGI ERLGSAPSE ++HSE
Subjt:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE

Query:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA
        +TH KFKR +LQ PERSKKWKG+Q+SAEALNQN+NK   KMSKHWGSWAQALYNTAM+PERH + VLETSDDVVVLNDIYPKAR+HLLIVARYEGLDQLA
Subjt:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA

Query:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS
        DVHK HLPLL+TMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDF+SSHLKNKKHWNSFNTDFFRDSVDV+DEVSSHGK  I DDESLMS
Subjt:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS

Query:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
        ME RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLEGGRLV
Subjt:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

A0A6J1I5W0 transcription factor bHLH1400.0e+0085.54Show/hide
Query:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG
        MD+DIDENSTA+G E + KL+MV+LVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLE EQR++FVKLG
Subjt:  MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQ+KELP LNEGF RI FCH+ETDVQSAID YK LGLHD L DGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA
        LGIM FLKKAENP+KTCS+ANT+KD P SQTTQE       K+ES+ T+LS+VNKES KGEN  PG+RSLE+NIS SD PTLAFPSISTSDFKFSHEKAA
Subjt:  LGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAA

Query:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT
        EIIVE VEEFM KLGNARLV+VD+SHGSKILSL KAKAAKKNI S KFFTFVGDITKL S+GGL C+VIANAANWRLKPGGGGVNAAIFSAAGP LE AT
Subjt:  EIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVAT

Query:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE
        KQQA SLRPG+ VAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQ FISIV+D+FKSAKGI +RLGSAPSE ++HSE
Subjt:  KQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSE

Query:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA
        +TH KFKR + Q PERSKKWKG+Q+SAEALNQN+NK   KMSKHWGSWAQALYNTAM+PERH +TVLETSDDVVVLNDIYPKAR+HLLIVARYEGLDQLA
Subjt:  NTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLA

Query:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS
        DVH  HLPLL+TMHAVGLKWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDF+SSHLKNKKHWNSFNTDFFRDSVDV+DEVS+HGK  I DDESLMS
Subjt:  DVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMS

Query:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
        ME RCNRCRSAHPNLPKLK H+SKCQ+PFPSTLLEG RLV
Subjt:  MELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

SwissProt top hitse value%identityAlignment
P61797 Aprataxin3.5e-3339.32Show/hide
Query:  THRKFK---------REDLQNPERSKKWKGSQDSAE-ALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVA
        THRK K         R+  Q  E S   +   DS++ ++  N  K      +  G W+Q L  +   P+      +   + VVV+ D YPKAR H L++ 
Subjt:  THRKFK---------REDLQNPERSKKWKGSQDSAE-ALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVA

Query:  RYEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVS
         +  +  L  V   HL LL+ MH VG K I   F   + L FRLGYH+ PSM  +HLHVISQDF+S  LKNKKHWNSFNT++F +S  V++ V   G+VS
Subjt:  RYEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVS

Query:  INDD-ESLMSMELRCNRCRSAHPNLPKLKAHISK
        + D    L+ + LRC+ C+   P++P+LK H+ +
Subjt:  INDD-ESLMSMELRCNRCRSAHPNLPKLKAHISK

P61798 Aprataxin (Fragment)2.9e-3534.23Show/hide
Query:  MNPQRPNYLNNDYDEGCKL----LRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSENTHRKFKREDLQN-PERSKKWK--GSQDSAEALNQ
        +NP   + ++   DE  K+    +    + L+   +   E+  +S     E++ +     +   EN       +D++N P+++KK +   +Q S+  L  
Subjt:  MNPQRPNYLNNDYDEGCKL----LRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSENTHRKFKREDLQN-PERSKKWK--GSQDSAEALNQ

Query:  NSNKT-----VQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHE
        + +            +H G W+Q L ++   P+      +   +  VV+ D YPKAR H L++  ++ +  L  V + HL LL  MHAVG K I +   +
Subjt:  NSNKT-----VQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHE

Query:  DASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDES-LMSMELRCNRCRSAHPNLPKLKAHISK
        + SL FRLGYH+ PSM QLHLHVISQDF+S  LK KKHWNSF T++F +S +V++ V S GKV++ND  S L+ + LRC+ C+     +P+LK H+ K
Subjt:  DASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDES-LMSMELRCNRCRSAHPNLPKLKAHISK

Q7TQC5 Aprataxin5.4e-3440.17Show/hide
Query:  THRKFKREDLQNPERSKK-----WKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGL
        T RK KR D  + E   +       GS  S  +++   +K      +  G W+Q L  +   P+      +   D VVV+ D YPKAR H L++  +  +
Subjt:  THRKFKREDLQNPERSKK-----WKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGL

Query:  DQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDE
          L  V   HL LL+ MHAVG K I   F   + L FRLGYH+ PSM  +HLHVISQDF+S  LKNKKHWNSFNT++F +S  V+  V   G+V++ D  
Subjt:  DQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDE

Query:  -SLMSMELRCNRCRSAHPNLPKLKAHISK
          L+ + LRC+ C+   P++P+LK H+ K
Subjt:  -SLMSMELRCNRCRSAHPNLPKLKAHISK

Q8K4H4 Aprataxin7.0e-3440.62Show/hide
Query:  THRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLAD
        T RK KR D  + E   +  G+  S  +++  + K      +  G W+Q L  +   P+      +   D VVV+ D YPKAR H L++  +  +  L  
Subjt:  THRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLAD

Query:  VHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDE-SLMS
        V   HL LL+ MHAVG K I   F   + L FRLGYH+ PSM  +HLHVISQDF+S  LKNKKHWNSFNT++F +S  V+  V   G+V++ D    L+ 
Subjt:  VHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDE-SLMS

Query:  MELRCNRCRSAHPNLPKLKAHISK
        + LRC+ C+   P++P+LK H+ K
Subjt:  MELRCNRCRSAHPNLPKLKAHISK

Q9M041 Transcription factor bHLH1408.3e-24559.65Show/hide
Query:  MDMDIDEN-----STAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRAD
        M++ I+E+     S    +  + K ++V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L +GKSVF+DRCNL+ EQR++
Subjt:  MDMDIDEN-----STAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRAD

Query:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNP
        F+KLGGP  +VHAVVL+LPAQ+CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGFSRI FC+++ DV +A++MY  LG  D L  GCFG+K  
Subjt:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNP

Query:  DKKVQLGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFS
        D K Q GIMKF KK    +   SS+N            E  N+  K +E    V  S  K                   S    PTLAFPSIST+DF+F 
Subjt:  DKKVQLGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFS

Query:  HEKAAEIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPG
         EKA++IIVEK EEF+SKLG ARLVLVDLS GSKILSL KAKA++KNI SAKFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP 
Subjt:  HEKAAEIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPG

Query:  LEVATKQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEP
        LE AT+ +AN+L PG AV V LPST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR+AY+SLF+GF+S+V+DQ K               P
Subjt:  LEVATKQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEP

Query:  QRHSENTHRKFKREDLQNPERSKKWKGSQDSAEALN------QNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLI
        +R S+        +  ++ ER+KK+KGSQD A   N      +++  + +KMSK W +WA AL++ AMHPERH + VLE  D++VV+ND YPKAR+H+L+
Subjt:  QRHSENTHRKFKREDLQNPERSKKWKGSQDSAEALN------QNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLI

Query:  VARYEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGK
        +AR E LD L DV K +L LL+ MH VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDFNS  LKNKKHWNSF T FFRDSVDV++EV+S GK
Subjt:  VARYEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGK

Query:  VSINDDESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
         ++   E L+  ELRCNRCRSAHPN+PKLK+H+  C + FP  LL+  RLV
Subjt:  VSINDDESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT2G40600.1 appr-1-p processing enzyme family protein9.2e-0533.6Show/hide
Query:  NISSAKFFTFV-GDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGP--
        N+S +     + GDITK S +     + I N AN R+  GGGG + AI  AAGP L  A   +   +RPG          +P FN    + VIH +GP  
Subjt:  NISSAKFFTFV-GDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGP--

Query:  --NMNPQRPNYLNNDYDEGCKLLRD
          ++NPQ    L N Y    ++ ++
Subjt:  --NMNPQRPNYLNNDYDEGCKLLRD

AT5G01310.1 APRATAXIN-like5.9e-24659.65Show/hide
Query:  MDMDIDEN-----STAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRAD
        M++ I+E+     S    +  + K ++V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L +GKSVF+DRCNL+ EQR++
Subjt:  MDMDIDEN-----STAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRAD

Query:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNP
        F+KLGGP  +VHAVVL+LPAQ+CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGFSRI FC+++ DV +A++MY  LG  D L  GCFG+K  
Subjt:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNP

Query:  DKKVQLGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFS
        D K Q GIMKF KK    +   SS+N            E  N+  K +E    V  S  K                   S    PTLAFPSIST+DF+F 
Subjt:  DKKVQLGIMKFLKKAENPSKTCSSANTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFS

Query:  HEKAAEIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPG
         EKA++IIVEK EEF+SKLG ARLVLVDLS GSKILSL KAKA++KNI SAKFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP 
Subjt:  HEKAAEIIVEKVEEFMSKLGNARLVLVDLSHGSKILSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPG

Query:  LEVATKQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEP
        LE AT+ +AN+L PG AV V LPST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR+AY+SLF+GF+S+V+DQ K               P
Subjt:  LEVATKQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEP

Query:  QRHSENTHRKFKREDLQNPERSKKWKGSQDSAEALN------QNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLI
        +R S+        +  ++ ER+KK+KGSQD A   N      +++  + +KMSK W +WA AL++ AMHPERH + VLE  D++VV+ND YPKAR+H+L+
Subjt:  QRHSENTHRKFKREDLQNPERSKKWKGSQDSAEALN------QNSNKTVQKMSKHWGSWAQALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLI

Query:  VARYEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGK
        +AR E LD L DV K +L LL+ MH VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDFNS  LKNKKHWNSF T FFRDSVDV++EV+S GK
Subjt:  VARYEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHLKNKKHWNSFNTDFFRDSVDVMDEVSSHGK

Query:  VSINDDESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
         ++   E L+  ELRCNRCRSAHPN+PKLK+H+  C + FP  LL+  RLV
Subjt:  VSINDDESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGATATCGACGAAAATTCCACAGCTAAAGGAAAGGAAGGGCAACAGAAGCTCTTAATGGTGATGTTAGTGGGCGCACCAGGAAGCGGCAAATCCACCTTCTG
CGAACTCGTAATGGGTTCTTCTTCTCGCCCGTGGGTTCGAATCTGCCAGGATACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGCAGCCAGTG
CATTGAATGATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAACAGAGCAGCGTGCAGATTTTGTGAAATTAGGCGGCCCTCGAGTGGATGTACATGCTGTTGTA
TTAGATCTTCCTGCTCAGCTTTGTATTTCCCGTTCGGTTAAGCGGACTGGTCATGAAGGGAATTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAG
GAAAGAATTGCCCAAGCTAAATGAAGGGTTCTCTCGTATAACGTTTTGTCACAATGAGACTGACGTTCAATCCGCTATAGATATGTACAAATTGCTTGGTTTACATGATG
TACTTTCGGATGGATGTTTTGGACAGAAGAATCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTAAAGAAAGCAGAAAATCCTTCTAAGACGTGCTCTAGTGCC
AATACCGTTAAGGATTCTCCAATTTCTCAAACTACCCAGGAAAAGATCAACTCTTGTGATAAAAAGGAAGAGTCTGCCTTCACAGTGTTGAGCAGTGTCAATAAAGAGTC
AACGAAAGGTGAGAATCCAATTCCAGGCGTTAGATCCTTAGAAGACAATATTTCTCAAAGTGATACTCCAACTCTTGCATTTCCATCTATTTCGACATCGGATTTCAAGT
TTAGCCATGAAAAGGCTGCAGAAATTATCGTTGAGAAGGTTGAGGAATTCATGAGTAAGCTTGGAAATGCCAGGCTTGTGTTGGTAGACTTGAGTCATGGATCAAAGATT
TTGTCTTTGGCTAAAGCTAAAGCAGCCAAGAAAAATATTAGCTCCGCCAAGTTTTTTACATTTGTAGGTGATATAACTAAACTCAGTTCAGAAGGTGGATTGCGCTGCAA
TGTAATAGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGGGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAAACAACAAG
CAAACTCCCTTCGACCTGGCAACGCCGTGGCCGTTCAATTGCCTTCAACTTCTCCTCTGTTTAATAGAGAAGGAGTAACTCATGTCATACATGTTCTAGGACCCAACATG
AATCCACAAAGGCCAAATTATCTCAACAATGACTACGATGAAGGTTGCAAGCTTCTTCGTGATGCTTACTCTTCCCTATTTCAAGGCTTTATTTCGATAGTAGAAGACCA
ATTTAAGTCGGCAAAGGGAATTCACGAACGCCTTGGCTCGGCACCTTCAGAACCACAAAGGCACTCTGAGAACACGCATCGCAAGTTTAAGAGAGAGGATTTGCAAAATC
CTGAAAGAAGCAAAAAGTGGAAAGGATCTCAGGATTCAGCTGAAGCATTAAACCAAAACAGCAATAAGACTGTCCAGAAAATGAGTAAGCACTGGGGCTCGTGGGCACAA
GCACTTTACAACACTGCAATGCATCCCGAGAGACATGGTGATACTGTACTGGAGACATCAGATGATGTTGTAGTATTGAATGATATTTATCCAAAAGCACGCCAGCATCT
TCTGATAGTGGCTCGGTATGAAGGCCTCGATCAACTAGCCGATGTACATAAAAATCACCTTCCATTGTTGAGGACGATGCACGCCGTGGGTTTGAAGTGGATTGATAAGT
TCTTTCATGAAGATGCATCATTGGTTTTTCGCCTCGGATACCACTCGGCTCCATCCATGAGGCAACTGCATCTACATGTTATAAGCCAGGATTTCAACTCCAGTCATCTA
AAGAACAAGAAGCATTGGAATTCTTTCAACACCGATTTCTTCCGAGACTCTGTCGACGTTATGGACGAAGTCAGTAGCCATGGAAAGGTGAGCATCAATGACGATGAGAG
CTTGATGTCTATGGAGTTGCGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAATTGAAAGCACATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACGCTGC
TTGAGGGCGGTCGTTTAGTAACCAACGTAAATATGGATTCCAATGTAGAGGGAGGGACAACTAGCTCGAAGGAGACTATAAAAAATAAAGATGTCGATCTTGATCTAAAA
GCTTCTGAGAATCCAAGTAAGAAGCAACGTAAGAACCCCAAACAATCAATGGTGTGGGATCATTTTGAGCGGTTAGAAGGTGATCCTAATGATCCTTATGCTAAGTGTAA
ATATTGTGGAGCTACTTATGCATGTCATTCTAAACGTAATGGTACTGGGAGTATGAAAAATCACTTAGACAATTGTAAGAAATATCCTTATCAAAAGAAGAGAGATCCAA
CCCAAAAGACATTAGCTTTCAAACCTATAGAAACTCAAGCTACTGAGGGTGGGGAATGCCAAGTTGTATGTGAGTCATTTGATTTGCAGAATTGTCGGGAAGTTTTAGCT
TAA
mRNA sequenceShow/hide mRNA sequence
AGCAAATTTGGGGAAGGAGCTGACCGTGGCGTATTGGACAAGCTTCGCGCATCGGAGGCCGATCGTCGGCAATGGCGATTGATAGGGATTAAAGGAAGACCTAAAATCTG
ACAAAGAAATGGACATGGATATCGACGAAAATTCCACAGCTAAAGGAAAGGAAGGGCAACAGAAGCTCTTAATGGTGATGTTAGTGGGCGCACCAGGAAGCGGCAAATCC
ACCTTCTGCGAACTCGTAATGGGTTCTTCTTCTCGCCCGTGGGTTCGAATCTGCCAGGATACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGC
AGCCAGTGCATTGAATGATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAACAGAGCAGCGTGCAGATTTTGTGAAATTAGGCGGCCCTCGAGTGGATGTACATG
CTGTTGTATTAGATCTTCCTGCTCAGCTTTGTATTTCCCGTTCGGTTAAGCGGACTGGTCATGAAGGGAATTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATG
CTGCAAAGGAAAGAATTGCCCAAGCTAAATGAAGGGTTCTCTCGTATAACGTTTTGTCACAATGAGACTGACGTTCAATCCGCTATAGATATGTACAAATTGCTTGGTTT
ACATGATGTACTTTCGGATGGATGTTTTGGACAGAAGAATCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTAAAGAAAGCAGAAAATCCTTCTAAGACGTGCT
CTAGTGCCAATACCGTTAAGGATTCTCCAATTTCTCAAACTACCCAGGAAAAGATCAACTCTTGTGATAAAAAGGAAGAGTCTGCCTTCACAGTGTTGAGCAGTGTCAAT
AAAGAGTCAACGAAAGGTGAGAATCCAATTCCAGGCGTTAGATCCTTAGAAGACAATATTTCTCAAAGTGATACTCCAACTCTTGCATTTCCATCTATTTCGACATCGGA
TTTCAAGTTTAGCCATGAAAAGGCTGCAGAAATTATCGTTGAGAAGGTTGAGGAATTCATGAGTAAGCTTGGAAATGCCAGGCTTGTGTTGGTAGACTTGAGTCATGGAT
CAAAGATTTTGTCTTTGGCTAAAGCTAAAGCAGCCAAGAAAAATATTAGCTCCGCCAAGTTTTTTACATTTGTAGGTGATATAACTAAACTCAGTTCAGAAGGTGGATTG
CGCTGCAATGTAATAGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGGGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAA
ACAACAAGCAAACTCCCTTCGACCTGGCAACGCCGTGGCCGTTCAATTGCCTTCAACTTCTCCTCTGTTTAATAGAGAAGGAGTAACTCATGTCATACATGTTCTAGGAC
CCAACATGAATCCACAAAGGCCAAATTATCTCAACAATGACTACGATGAAGGTTGCAAGCTTCTTCGTGATGCTTACTCTTCCCTATTTCAAGGCTTTATTTCGATAGTA
GAAGACCAATTTAAGTCGGCAAAGGGAATTCACGAACGCCTTGGCTCGGCACCTTCAGAACCACAAAGGCACTCTGAGAACACGCATCGCAAGTTTAAGAGAGAGGATTT
GCAAAATCCTGAAAGAAGCAAAAAGTGGAAAGGATCTCAGGATTCAGCTGAAGCATTAAACCAAAACAGCAATAAGACTGTCCAGAAAATGAGTAAGCACTGGGGCTCGT
GGGCACAAGCACTTTACAACACTGCAATGCATCCCGAGAGACATGGTGATACTGTACTGGAGACATCAGATGATGTTGTAGTATTGAATGATATTTATCCAAAAGCACGC
CAGCATCTTCTGATAGTGGCTCGGTATGAAGGCCTCGATCAACTAGCCGATGTACATAAAAATCACCTTCCATTGTTGAGGACGATGCACGCCGTGGGTTTGAAGTGGAT
TGATAAGTTCTTTCATGAAGATGCATCATTGGTTTTTCGCCTCGGATACCACTCGGCTCCATCCATGAGGCAACTGCATCTACATGTTATAAGCCAGGATTTCAACTCCA
GTCATCTAAAGAACAAGAAGCATTGGAATTCTTTCAACACCGATTTCTTCCGAGACTCTGTCGACGTTATGGACGAAGTCAGTAGCCATGGAAAGGTGAGCATCAATGAC
GATGAGAGCTTGATGTCTATGGAGTTGCGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAATTGAAAGCACATATTTCCAAATGCCAAGCGCCTTTCCCTTC
CACGCTGCTTGAGGGCGGTCGTTTAGTAACCAACGTAAATATGGATTCCAATGTAGAGGGAGGGACAACTAGCTCGAAGGAGACTATAAAAAATAAAGATGTCGATCTTG
ATCTAAAAGCTTCTGAGAATCCAAGTAAGAAGCAACGTAAGAACCCCAAACAATCAATGGTGTGGGATCATTTTGAGCGGTTAGAAGGTGATCCTAATGATCCTTATGCT
AAGTGTAAATATTGTGGAGCTACTTATGCATGTCATTCTAAACGTAATGGTACTGGGAGTATGAAAAATCACTTAGACAATTGTAAGAAATATCCTTATCAAAAGAAGAG
AGATCCAACCCAAAAGACATTAGCTTTCAAACCTATAGAAACTCAAGCTACTGAGGGTGGGGAATGCCAAGTTGTATGTGAGTCATTTGATTTGCAGAATTGTCGGGAAG
TTTTAGCTTAAATGATAATTCTAGATGAAGTGTCTTTTAAGTTTGTAGAACGTAAAGGTTTTCGGAAATTTGTCAATAAGTTAACATGTGGAAGTCCAATTGAAC
Protein sequenceShow/hide protein sequence
MDMDIDENSTAKGKEGQQKLLMVMLVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLETEQRADFVKLGGPRVDVHAVV
LDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQRKELPKLNEGFSRITFCHNETDVQSAIDMYKLLGLHDVLSDGCFGQKNPDKKVQLGIMKFLKKAENPSKTCSSA
NTVKDSPISQTTQEKINSCDKKEESAFTVLSSVNKESTKGENPIPGVRSLEDNISQSDTPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMSKLGNARLVLVDLSHGSKI
LSLAKAKAAKKNISSAKFFTFVGDITKLSSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNAVAVQLPSTSPLFNREGVTHVIHVLGPNM
NPQRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSAKGIHERLGSAPSEPQRHSENTHRKFKREDLQNPERSKKWKGSQDSAEALNQNSNKTVQKMSKHWGSWAQ
ALYNTAMHPERHGDTVLETSDDVVVLNDIYPKARQHLLIVARYEGLDQLADVHKNHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFNSSHL
KNKKHWNSFNTDFFRDSVDVMDEVSSHGKVSINDDESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVTNVNMDSNVEGGTTSSKETIKNKDVDLDLK
ASENPSKKQRKNPKQSMVWDHFERLEGDPNDPYAKCKYCGATYACHSKRNGTGSMKNHLDNCKKYPYQKKRDPTQKTLAFKPIETQATEGGECQVVCESFDLQNCREVLA