; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033589 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033589
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptiontranscription factor bHLH140
Genome locationscaffold5:872961..876643
RNA-Seq ExpressionSpg033589
SyntenySpg033589
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596332.1 Transcription factor basic helix-loop-helix 140, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.87Show/hide
Query:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG
        MDMDIDENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQR++FVKLG
Subjt:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENPAKTCS+ANT KD P SQ+TQE       K+ES+CTM SNV+KE EKGENPGV SLE NIS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA
        IVE VEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNI STKFFTFVGDITKL S+GGL CNVIANAANW          RLKPGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA

Query:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP
        GPSLE ATKQQA SLRPGNVVAVQLPSTSPL  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+D+FKS KGI ERL SAP
Subjt:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP

Query:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR
        SES+KHSE++HHKFKR +LQ PERSKKWKG+Q+SAEALNQNNNK  HKMSKHWGSWAQALYNTAM+PE+H ++VLETSDDVVVLND+YPKARKHLL+VAR
Subjt:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR

Query:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI
        YEGLDQLADV KEHLPLLKTMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDVIDEVSSHGKA I
Subjt:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI

Query:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIA
        KDDES MSME RCNRCRSAHPNLPKLKTHISKCQ+PFPSTLLEGGRLV A
Subjt:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIA

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0086.26Show/hide
Query:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG
        MDMDID+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGK++FVDRCNLEIEQRADFVK+G
Subjt:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHD LP GCFGQ   D KVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+NPAKTCSSAN IKDSP  Q+++E SYSCDKKEE ACT+  NVDKE EKGENPGVRSL  +IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA
        IVE VEEFMDKLGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKLNSEGGL CNVIANAANW          RLKPG GGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA

Query:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP
        GP LEVATKQQANSLRPGN V  QLPSTSPL  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVE+QFKSVKGI + L SAP
Subjt:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP

Query:  SESQKHSEES--------HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKAR
        SES+KHSE+S         HKFKRED+QNPERSKKWKGSQDSAEA NQNNN  VHKMSKHWGSWAQALYNTAMHPE+HGD VLE SDDV VLND+Y KA 
Subjt:  SESQKHSEES--------HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKAR

Query:  KHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEV
        KHLLVVARYEGLDQLADVR+EHLPLL+TMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDV+DEV
Subjt:  KHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEV

Query:  SSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYAHE
         SHGKASIKDDES MSMELRCNRCRSAHPNL KLK HISKC+APFPSTLLEG RLVIAPS  H+
Subjt:  SSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYAHE

XP_022949470.1 transcription factor bHLH140 [Cucurbita moschata]0.0e+0087.87Show/hide
Query:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG
        MDMDIDENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQR++FVKLG
Subjt:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENPAKTCS+ANT KD P SQ+TQE       K+ES+CTM SNV+KE EKGENPGV SLE NIS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA
        IVE VEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNI STKFFTFVGDITKL S+GGL CNVIANAANW          RLKPGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA

Query:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP
        GPSLE ATKQQA SLRPGNVVAVQLPSTSPL  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+D+FKS KGI ERL SAP
Subjt:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP

Query:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR
        SES+KHSE++HHKFKR +LQ PERSKKWKG+Q+SAEALNQNNNK  HKMSKHWGSWAQALYNTAM+PE+H ++VLETSDDVVVLND+YPKARKHLL+VAR
Subjt:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR

Query:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI
        YEGLDQLADV KEHLPLLKTMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDVIDEVSSHGKA I
Subjt:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI

Query:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIA
        KDDES MSME RCNRCRSAHPNLPKLKTHISKCQ+PFPSTLLEGGRLV A
Subjt:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIA

XP_023540723.1 transcription factor bHLH140 [Cucurbita pepo subsp. pepo]0.0e+0087.6Show/hide
Query:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG
        MDMDIDENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQR++FVKLG
Subjt:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ET+VQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENPAKTCS+ANT KD P SQ+TQE       K+ES+CTM SNV+KE EKGE PGVRSLE NIS+SD PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA
        IVE VEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNI STKFFTFVGDITKL S+GGL CNVIANAANW          RLKPGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA

Query:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP
        GPSLE ATKQQA SLRPGNVVAVQLPSTSPL  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+D+FKS KGI ERL SAP
Subjt:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP

Query:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR
        SES+KHSE++HHKFKR +LQ PERSKKWKG+Q+SAEALNQNNNK  HKMSKHWGSWAQALYNTAM+PE+H + VLETSDDVVVLND+YPKARKHLL+VAR
Subjt:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR

Query:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI
        YEGLDQLADV KEHLPLLKTMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDVIDEVSSHGKA I
Subjt:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI

Query:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIA
        KDDES MSME RCNRCRSAHPNLPKLKTHISKCQ+PFPSTLLEGGRLV A
Subjt:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIA

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0089.52Show/hide
Query:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG
        MDMD DENS AKGKEGQGKLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKL 
Subjt:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
        GPRVDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAID YKSL LHD LP GCFGQKNPDKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE P++T SSANT+KDSPI Q+TQEKS SCDKKEESACT+  NVD E +KGE+PGVRSLE NISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA
        IVEKVEEFMDKLGNARLV+VDLSHGSKILSLVKAKAAKKNI+STKFFTFVGDITKLNSEGGL CNVIANAANW          RLKPGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA

Query:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP
        G  LEVATKQQANSL+PGN VAVQLPSTSPL  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FIS+VED+FKSVKGIH RL   P
Subjt:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP

Query:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR
        SE +KHSE SHHKFKRE+LQNPERSKKWKGSQDS EALNQNNNKTV KMSKHWGSWAQALYNTAMHPE+H D VLETSDDVVVLND+YPKARKHLLVVAR
Subjt:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR

Query:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI
        +EGLDQLADV  EHLPLL+TMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKN+KHWNSFNTDFFRDSV V+DEVSSHGKAS+
Subjt:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI

Query:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYA
         DDES MSMELRCNRCRSAHPNLPKLK HI KCQAPFPSTLLEGGRLVIAPS A
Subjt:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYA

TrEMBL top hitse value%identityAlignment
A0A1S3B6C4 transcription factor bHLH1400.0e+0086.34Show/hide
Query:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG
        MDMD DENS AKGKEGQGKLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGK+VFVDRCNLEIEQRADFVKLG
Subjt:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
        GP+VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LP GCFGQKNPDKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAENP+KTCSSAN  K+SP  Q TQEK  SCDKKEES+C M  NV  E EKGE+PGVRSLE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA
        IVEKVEEFMDKLGNARLV+VDLSHGSKILS+VKAKA +KNI+STKFFTFVGDITKLNSEGGL CNVIANAANW          RLKPGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA

Query:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP
        GP LEVATKQQANSL PGN VAVQLPSTSPLL REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSLFQ FISIV+D++KSVKGI+E L S P
Subjt:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP

Query:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR
         E QKHSE SHHKFKRE+LQN E SKKWKGS +S E LNQNNNKTV K SKHWGSWAQALY+TAMHPE+H + VLETSDDVVVL D+YPKARKHLLVVAR
Subjt:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR

Query:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI
        +EGLDQLADV  EHLPLL+TMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHV+SQDFDS++LKNKKHWNSFNTDFFRDSV VIDEVSSHGKA I
Subjt:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI

Query:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYA
         DDE  +SMELRCNRCRSAHPNLPKLK HISKCQAPFPSTLLE GRLV+ PS A
Subjt:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYA

A0A5A7TLV2 Transcription factor bHLH1400.0e+0086.34Show/hide
Query:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG
        MDMD DENS AKGKEGQGKLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGK+VFVDRCNLEIEQRADFVKLG
Subjt:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
        GP+VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LP GCFGQKNPDKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAENP+KTCSSAN  K+SP  Q TQEK  SCDKKEES+C M  NV  E EKGE+PGVRSLE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA
        IVEKVEEFMDKLGNARLV+VDLSHGSKILS+VKAKA +KNI+STKFFTFVGDITKLNSEGGL CNVIANAANW          RLKPGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA

Query:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP
        GP LEVATKQQANSL PGN VAVQLPSTSPLL REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSLFQ FISIV+D++KSVKGI+E L S P
Subjt:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP

Query:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR
         E QKHSE SHHKFKRE+LQN E SKKWKGS +S E LNQNNNKTV K SKHWGSWAQALY+TAMHPE+H + VLETSDDVVVL D+YPKARKHLLVVAR
Subjt:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR

Query:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI
        +EGLDQLADV  EHLPLL+TMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHV+SQDFDS++LKNKKHWNSFNTDFFRDSV VIDEVSSHGKA I
Subjt:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI

Query:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYA
         DDE  +SMELRCNRCRSAHPNLPKLK HISKCQAPFPSTLLE GRLV+ PS A
Subjt:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYA

A0A6J1CV45 transcription factor bHLH1400.0e+0086.26Show/hide
Query:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG
        MDMDID+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGK++FVDRCNLEIEQRADFVK+G
Subjt:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHD LP GCFGQ   D KVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+NPAKTCSSAN IKDSP  Q+++E SYSCDKKEE ACT+  NVDKE EKGENPGVRSL  +IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA
        IVE VEEFMDKLGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKLNSEGGL CNVIANAANW          RLKPG GGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA

Query:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP
        GP LEVATKQQANSLRPGN V  QLPSTSPL  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVE+QFKSVKGI + L SAP
Subjt:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP

Query:  SESQKHSEES--------HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKAR
        SES+KHSE+S         HKFKRED+QNPERSKKWKGSQDSAEA NQNNN  VHKMSKHWGSWAQALYNTAMHPE+HGD VLE SDDV VLND+Y KA 
Subjt:  SESQKHSEES--------HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKAR

Query:  KHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEV
        KHLLVVARYEGLDQLADVR+EHLPLL+TMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDV+DEV
Subjt:  KHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEV

Query:  SSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYAHE
         SHGKASIKDDES MSMELRCNRCRSAHPNL KLK HISKC+APFPSTLLEG RLVIAPS  H+
Subjt:  SSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYAHE

A0A6J1GCV0 transcription factor bHLH1400.0e+0087.87Show/hide
Query:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG
        MDMDIDENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQR++FVKLG
Subjt:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENPAKTCS+ANT KD P SQ+TQE       K+ES+CTM SNV+KE EKGENPGV SLE NIS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA
        IVE VEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNI STKFFTFVGDITKL S+GGL CNVIANAANW          RLKPGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA

Query:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP
        GPSLE ATKQQA SLRPGNVVAVQLPSTSPL  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+D+FKS KGI ERL SAP
Subjt:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP

Query:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR
        SES+KHSE++HHKFKR +LQ PERSKKWKG+Q+SAEALNQNNNK  HKMSKHWGSWAQALYNTAM+PE+H ++VLETSDDVVVLND+YPKARKHLL+VAR
Subjt:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR

Query:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI
        YEGLDQLADV KEHLPLLKTMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDVIDEVSSHGKA I
Subjt:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI

Query:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIA
        KDDES MSME RCNRCRSAHPNLPKLKTHISKCQ+PFPSTLLEGGRLV A
Subjt:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIA

A0A6J1I5W0 transcription factor bHLH1400.0e+0086.67Show/hide
Query:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG
        MD+DIDENS A+G E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQR++FVKLG
Subjt:  MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RI FCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM FLKKAENPAKTCS+ANTIKD P SQ+TQE       K+ES+CTM SNV+KE EKGENPG+RSLE NIS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA
        IVE VEEFMDKLGNARLVMVD+SHGSKILSLVKAKAAKKNI STKFFTFVGDITKL S+GGL C+VIANAANW          RLKPGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAA

Query:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP
        GPSLE ATKQQA SLRPG+VVAVQLPSTSPL  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+D+FKS KGI +RL SAP
Subjt:  GPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAP

Query:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR
        SES+KHSE++HHKFKR + Q PERSKKWKG+Q+SAEALNQNNNK  HKMSKHWGSWAQALYNTAM+PE+H + VLETSDDVVVLND+YPKARKHLL+VAR
Subjt:  SESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVAR

Query:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI
        YEGLDQLADV  EHLPLLKTMHAVGLKWIDKF H+DASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDVIDEVS+HGKA I
Subjt:  YEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASI

Query:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIA
        KDDES MSME RCNRCRSAHPNLPKLKTH+SKCQ+PFPSTLLEG RLV A
Subjt:  KDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIA

SwissProt top hitse value%identityAlignment
P61798 Aprataxin (Fragment)2.5e-3538.31Show/hide
Query:  ERLSSAPSESQKHSEESHHKFKREDLQN-PERSKKWK---GSQDSAEALNQNNNKTVHK----MSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLN
        E +  A  + Q          + +D++N P+++KK +       SA+     ++ + H+      +H G W+Q L ++   P+    V +   +  VV+ 
Subjt:  ERLSSAPSESQKHSEESHHKFKREDLQN-PERSKKWK---GSQDSAEALNQNNNKTVHK----MSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLN

Query:  DVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDS
        D YPKAR H LV+  ++ +  L  V +EHL LL+ MHAVG K I +   ++ SL FRLGYH+ PSM QLHLHV+SQDFDS  LK KKHWNSF T++F +S
Subjt:  DVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDS

Query:  VDVIDEVSSHGKASIKDDESS-MSMELRCNRCRSAHPNLPKLKTHISK
         +VI+ V S GK ++ D  S  + + LRC+ C+     +P+LK H+ K
Subjt:  VDVIDEVSSHGKASIKDDESS-MSMELRCNRCRSAHPNLPKLKTHISK

Q7TQC5 Aprataxin4.6e-3438.75Show/hide
Query:  ERLSSAPSESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARK
        E ++ +P+ +Q+  + S      E+++    +    GS  S  +++   +K      +  G W+Q L  +   P+    + +   D VVV+ D YPKAR 
Subjt:  ERLSSAPSESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARK

Query:  HLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVS
        H LV+  +  +  L  V  EHL LLK MHAVG K I   F   + L FRLGYH+ PSM  +HLHV+SQDFDS  LKNKKHWNSFNT++F +S  VI  V 
Subjt:  HLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVS

Query:  SHGKASIKDDESS-MSMELRCNRCRSAHPNLPKLKTHISK
          G+ ++KD     + + LRC+ C+   P++P+LK H+ K
Subjt:  SHGKASIKDDESS-MSMELRCNRCRSAHPNLPKLKTHISK

Q7Z2E3 Aprataxin1.6e-3445Show/hide
Query:  GSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQ
        G W+Q L  +   P+    + +   + VVV+ D YPKAR H LV+  +  +  L  V +EHL LLK MH VG K I   F   + L FRLGYH+ PSM  
Subjt:  GSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQ

Query:  LHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDD-ESSMSMELRCNRCRSAHPNLPKLKTHISK
        +HLHV+SQDFDS  LKNKKHWNSFNT++F +S  VI+ V   G+ +++D     + + LRC+ C+   P++P+LK H+ K
Subjt:  LHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDD-ESSMSMELRCNRCRSAHPNLPKLKTHISK

Q8K4H4 Aprataxin2.7e-3439.58Show/hide
Query:  ERLSSAPSESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARK
        E ++ +P+ +Q+       K KR D  + E   +  G+  S  +++    K      +  G W+Q L  +   P+    + +   D VVV+ D YPKAR 
Subjt:  ERLSSAPSESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARK

Query:  HLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVS
        H LV+  +  +  L  V  EHL LLK MHAVG K I   F   + L FRLGYH+ PSM  +HLHV+SQDFDS  LKNKKHWNSFNT++F +S  VI  V 
Subjt:  HLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVS

Query:  SHGKASIKDDESS-MSMELRCNRCRSAHPNLPKLKTHISK
          G+ ++KD     + + LRC+ C+   P++P+LK H+ K
Subjt:  SHGKASIKDDESS-MSMELRCNRCRSAHPNLPKLKTHISK

Q9M041 Transcription factor bHLH1407.4e-24258.63Show/hide
Query:  MDMDIDEN-----SAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRAD
        M++ I+E+     S    +  + K I+V+L+G PGSGKSTFC+  MRSS RPW RICQD + NGK+GTKAQCLK A  +L +GK+VF+DRCNL+ EQR++
Subjt:  MDMDIDEN-----SAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRAD

Query:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNP
        F+KLGGP  +VHAVVL+LPAQ+CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++ Y  LG  D LP GCFG+K  
Subjt:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNP

Query:  DKKVQLGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHE
        D K Q GIMKF KK             +   P S S +  + +    E +A    S V                K  S    PTLAFPSIST+DF+F  E
Subjt:  DKKVQLGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHE

Query:  KAAEIIVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAA
        KA++IIVEK EEF+ KLG ARLV+VDLS GSKILSLVKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NW          RLKPGGGGVNAA
Subjt:  KAAEIIVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAA

Query:  IFSAAGPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHER
        IF AAGP LE AT+ +AN+L PG  V V LPST PL   EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR AY+SLF+GF+S+V+DQ K  K   + 
Subjt:  IFSAAGPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHER

Query:  LSSAPSESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALN------QNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYP
          S   E  K              ++ ER+KK+KGSQD A   N      ++   +  KMSK W +WA AL++ AMHPE+H +VVLE  D++VV+ND YP
Subjt:  LSSAPSESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALN------QNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYP

Query:  KARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVI
        KARKH+LV+AR E LD L DVRKE+L LL+ MH VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHV+SQDF+S  LKNKKHWNSF T FFRDSVDV+
Subjt:  KARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVI

Query:  DEVSSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLV
        +EV+S GKA++   E  +  ELRCNRCRSAHPN+PKLK+H+  C + FP  LL+  RLV
Subjt:  DEVSSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT5G01310.1 APRATAXIN-like5.3e-24358.63Show/hide
Query:  MDMDIDEN-----SAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRAD
        M++ I+E+     S    +  + K I+V+L+G PGSGKSTFC+  MRSS RPW RICQD + NGK+GTKAQCLK A  +L +GK+VF+DRCNL+ EQR++
Subjt:  MDMDIDEN-----SAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRAD

Query:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNP
        F+KLGGP  +VHAVVL+LPAQ+CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++ Y  LG  D LP GCFG+K  
Subjt:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNP

Query:  DKKVQLGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHE
        D K Q GIMKF KK             +   P S S +  + +    E +A    S V                K  S    PTLAFPSIST+DF+F  E
Subjt:  DKKVQLGIMKFLKKAENPAKTCSSANTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHE

Query:  KAAEIIVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAA
        KA++IIVEK EEF+ KLG ARLV+VDLS GSKILSLVKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NW          RLKPGGGGVNAA
Subjt:  KAAEIIVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAA

Query:  IFSAAGPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHER
        IF AAGP LE AT+ +AN+L PG  V V LPST PL   EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR AY+SLF+GF+S+V+DQ K  K   + 
Subjt:  IFSAAGPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHER

Query:  LSSAPSESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALN------QNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYP
          S   E  K              ++ ER+KK+KGSQD A   N      ++   +  KMSK W +WA AL++ AMHPE+H +VVLE  D++VV+ND YP
Subjt:  LSSAPSESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALN------QNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYP

Query:  KARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVI
        KARKH+LV+AR E LD L DVRKE+L LL+ MH VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHV+SQDF+S  LKNKKHWNSF T FFRDSVDV+
Subjt:  KARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVI

Query:  DEVSSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLV
        +EV+S GKA++   E  +  ELRCNRCRSAHPN+PKLK+H+  C + FP  LL+  RLV
Subjt:  DEVSSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATGGATATCGACGAAAATTCGGCAGCCAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTCTG
CGAACTCGTAATGCGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGCCAGGACACGATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGCAGCCAGTG
CTTTGAATGATGGAAAGAATGTATTTGTGGATAGGTGCAATCTTGAAATAGAGCAGCGTGCAGACTTTGTGAAACTTGGTGGCCCTCGAGTGGATGTACATGCTGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATTTCTCGTTCGGTTAAGCGGACTGGTCATGAAGGGAATTTGTCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCCAAGCTAAATGAAGGGTTCACTCGCATAACCTTTTGCCACAATGAGACTGACGTTCAATCCGCTATAGACACGTACAAATCACTTGGTTTACATGATG
CACTTCCAGATGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTAAAGAAAGCAGAAAATCCTGCTAAGACGTGTTCTAGTGCC
AATACCATTAAGGATTCTCCAATTTCTCAAAGTACCCAGGAAAAGAGCTACTCTTGTGATAAAAAGGAAGAGTCTGCCTGCACAATGTTTAGCAATGTCGATAAAGAGTT
AGAGAAAGGTGAAAATCCAGGTGTTAGATCCTTAGAAAAAAATATTTCTCAAAGTGATCCTCCAACTCTTGCATTTCCATCTATTTCGACATCAGATTTCAAGTTCAGCC
ATGAGAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAGGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTGATGGTAGACTTGAGTCATGGATCAAAGATTTTGTCT
TTGGTTAAAGCTAAAGCAGCCAAGAAGAATATTACTTCGACCAAGTTTTTTACATTCGTAGGTGATATAACCAAACTCAATTCAGAAGGTGGATTGTGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGAAGATACTGAATAAAGATATTTCGTGCAGGCGACTGAAACCGGGGGGTGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCCAGTC
TGGAAGTTGCAACTAAGCAACAAGCAAACTCACTTCGACCTGGCAATGTCGTGGCTGTTCAATTGCCTTCAACTTCTCCTTTGCTTATCAGGGAAGGAGTAACCCATGTC
ATACATGTTCTTGGACCAAACATGAATCCACAGAGGCCAAATTATCTCAACAATGACTATGATGAAGGTTGCAAACTTCTTCGTGCTGCTTACTCTTCTCTATTTCAAGG
CTTTATTTCTATAGTCGAAGACCAATTTAAGTCAGTGAAGGGAATTCACGAACGCCTCAGCTCAGCACCTTCAGAATCACAAAAGCACTCTGAGGAAAGCCATCACAAGT
TCAAGAGAGAGGATTTGCAAAATCCTGAAAGAAGCAAAAAGTGGAAAGGATCTCAAGATTCAGCTGAAGCATTAAACCAAAACAACAATAAGACTGTCCACAAAATGAGC
AAGCACTGGGGCTCATGGGCACAAGCACTTTACAACACTGCAATGCATCCCGAGAAACATGGCGATGTTGTACTGGAGACATCAGATGATGTTGTCGTACTGAATGATGT
TTATCCAAAAGCACGCAAGCATCTTCTTGTAGTGGCTCGGTATGAAGGCCTCGATCAACTAGCCGATGTACGTAAAGAACACCTTCCATTGTTGAAGACGATGCACGCCG
TGGGTTTGAAGTGGATCGATAAGTTCTTTCATGAAGATGCATCATTGGTTTTTCGCCTCGGATACCACTCGGCTCCATCCATGAGGCAACTGCACCTACATGTTGTAAGC
CAGGATTTCGACTCCAGTTATCTGAAGAACAAGAAGCATTGGAATTCTTTCAACACTGATTTCTTCCGAGACTCAGTCGACGTTATTGACGAAGTCAGTAGCCATGGAAA
GGCGAGCATCAAGGACGATGAGAGCTCGATGTCTATGGAGTTGCGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAATTAAAAACACATATTTCCAAATGCC
AAGCGCCTTTCCCTTCCACGCTACTTGAGGGCGGGCGTTTAGTGATTGCACCAAGTTATGCTCATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATGGATATCGACGAAAATTCGGCAGCCAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTCTG
CGAACTCGTAATGCGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGCCAGGACACGATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGCAGCCAGTG
CTTTGAATGATGGAAAGAATGTATTTGTGGATAGGTGCAATCTTGAAATAGAGCAGCGTGCAGACTTTGTGAAACTTGGTGGCCCTCGAGTGGATGTACATGCTGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATTTCTCGTTCGGTTAAGCGGACTGGTCATGAAGGGAATTTGTCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCCAAGCTAAATGAAGGGTTCACTCGCATAACCTTTTGCCACAATGAGACTGACGTTCAATCCGCTATAGACACGTACAAATCACTTGGTTTACATGATG
CACTTCCAGATGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTAAAGAAAGCAGAAAATCCTGCTAAGACGTGTTCTAGTGCC
AATACCATTAAGGATTCTCCAATTTCTCAAAGTACCCAGGAAAAGAGCTACTCTTGTGATAAAAAGGAAGAGTCTGCCTGCACAATGTTTAGCAATGTCGATAAAGAGTT
AGAGAAAGGTGAAAATCCAGGTGTTAGATCCTTAGAAAAAAATATTTCTCAAAGTGATCCTCCAACTCTTGCATTTCCATCTATTTCGACATCAGATTTCAAGTTCAGCC
ATGAGAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAGGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTGATGGTAGACTTGAGTCATGGATCAAAGATTTTGTCT
TTGGTTAAAGCTAAAGCAGCCAAGAAGAATATTACTTCGACCAAGTTTTTTACATTCGTAGGTGATATAACCAAACTCAATTCAGAAGGTGGATTGTGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGAAGATACTGAATAAAGATATTTCGTGCAGGCGACTGAAACCGGGGGGTGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCCAGTC
TGGAAGTTGCAACTAAGCAACAAGCAAACTCACTTCGACCTGGCAATGTCGTGGCTGTTCAATTGCCTTCAACTTCTCCTTTGCTTATCAGGGAAGGAGTAACCCATGTC
ATACATGTTCTTGGACCAAACATGAATCCACAGAGGCCAAATTATCTCAACAATGACTATGATGAAGGTTGCAAACTTCTTCGTGCTGCTTACTCTTCTCTATTTCAAGG
CTTTATTTCTATAGTCGAAGACCAATTTAAGTCAGTGAAGGGAATTCACGAACGCCTCAGCTCAGCACCTTCAGAATCACAAAAGCACTCTGAGGAAAGCCATCACAAGT
TCAAGAGAGAGGATTTGCAAAATCCTGAAAGAAGCAAAAAGTGGAAAGGATCTCAAGATTCAGCTGAAGCATTAAACCAAAACAACAATAAGACTGTCCACAAAATGAGC
AAGCACTGGGGCTCATGGGCACAAGCACTTTACAACACTGCAATGCATCCCGAGAAACATGGCGATGTTGTACTGGAGACATCAGATGATGTTGTCGTACTGAATGATGT
TTATCCAAAAGCACGCAAGCATCTTCTTGTAGTGGCTCGGTATGAAGGCCTCGATCAACTAGCCGATGTACGTAAAGAACACCTTCCATTGTTGAAGACGATGCACGCCG
TGGGTTTGAAGTGGATCGATAAGTTCTTTCATGAAGATGCATCATTGGTTTTTCGCCTCGGATACCACTCGGCTCCATCCATGAGGCAACTGCACCTACATGTTGTAAGC
CAGGATTTCGACTCCAGTTATCTGAAGAACAAGAAGCATTGGAATTCTTTCAACACTGATTTCTTCCGAGACTCAGTCGACGTTATTGACGAAGTCAGTAGCCATGGAAA
GGCGAGCATCAAGGACGATGAGAGCTCGATGTCTATGGAGTTGCGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAATTAAAAACACATATTTCCAAATGCC
AAGCGCCTTTCCCTTCCACGCTACTTGAGGGCGGGCGTTTAGTGATTGCACCAAGTTATGCTCATGAATGA
Protein sequenceShow/hide protein sequence
MDMDIDENSAAKGKEGQGKLIMVILVGAPGSGKSTFCELVMRSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEIEQRADFVKLGGPRVDVHAVV
LDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQLGIMKFLKKAENPAKTCSSA
NTIKDSPISQSTQEKSYSCDKKEESACTMFSNVDKELEKGENPGVRSLEKNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVMVDLSHGSKILS
LVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDISCRRLKPGGGGVNAAIFSAAGPSLEVATKQQANSLRPGNVVAVQLPSTSPLLIREGVTHV
IHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVEDQFKSVKGIHERLSSAPSESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMS
KHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVVS
QDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSYAHE