; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg01119 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg01119
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptiontranscription factor bHLH140
Genome locationCarg_Chr06:1134287..1138103
RNA-Seq ExpressionCarg01119
SyntenyCarg01119
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596332.1 Transcription factor basic helix-loop-helix 140, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0099.87Show/hide
Query:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
        MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
Subjt:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG

Query:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
        SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
Subjt:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ

Query:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
        LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
Subjt:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE

Query:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
        FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
Subjt:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP

Query:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA
        GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA
Subjt:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA

Query:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL
        ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL
Subjt:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL

Query:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR
        LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR
Subjt:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR

Query:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS
        SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRN LS
Subjt:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS

XP_022949470.1 transcription factor bHLH140 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
        MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
Subjt:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG

Query:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
        SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
Subjt:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ

Query:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
        LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
Subjt:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE

Query:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
        FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
Subjt:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP

Query:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA
        GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA
Subjt:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA

Query:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL
        ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL
Subjt:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL

Query:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR
        LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR
Subjt:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR

Query:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS
        SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS
Subjt:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS

XP_022971485.1 transcription factor bHLH140 [Cucurbita maxima]0.0e+0097.3Show/hide
Query:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
        MD+DIDENSTA+GNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQRSEFVKLG
Subjt:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG

Query:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
        SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRI FCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
Subjt:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ

Query:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
        LGIM FLKKAENPAKTCSNANT KDFPSSQTTQEKKESSCTMLSNVNKESEKGENPG+ SLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
Subjt:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE

Query:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
        FMDKLGNARLVMVD+SHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHC+VIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
Subjt:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP

Query:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA
        G+VVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGID+RLGSAPSESRKHSEDTHHKFKRA
Subjt:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA

Query:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL
        E QIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHAN VLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVH EHLPL
Subjt:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL

Query:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR
        LKTMHAVG+KWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVS+HGKAIIKDDESLMSMEFRCNRCR
Subjt:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR

Query:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS
        SAHPNLPKLKTH+SKCQSPFPSTLLEG RLVNASGNSRN LS
Subjt:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS

XP_023540723.1 transcription factor bHLH140 [Cucurbita pepo subsp. pepo]0.0e+0099.06Show/hide
Query:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
        MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
Subjt:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG

Query:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
        SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSET+VQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
Subjt:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ

Query:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
        LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGE PGV SLENNIS SD PTLAFPSISTSDFKFSHEKAAEIIVEMVEE
Subjt:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE

Query:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
        FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
Subjt:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP

Query:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA
        GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA
Subjt:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA

Query:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL
        ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHAN VLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL
Subjt:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL

Query:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR
        LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR
Subjt:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR

Query:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS
        SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRN LS
Subjt:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0085.08Show/hide
Query:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
        MDMD DENSTAKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQR++FVKL 
Subjt:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG

Query:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
           VDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAID YKSL LHD LP GCFGQKN DKKVQ
Subjt:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ

Query:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQE-------KKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAE P++T S+ANT KD P  QTTQE       K+ES+CT+  NV+ ES+KGE+PGV SLE+NIS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQE-------KKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEMVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQ
        IVE VEEFMDKLGNARLV+VDLSHGSKILSLVKAKAAKKNI STKFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAG  LE ATKQ
Subjt:  IVEMVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQ

Query:  QAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDT
        QA SL+PGN VAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR+AYSSLFQAFIS+V+D+FKS KGI  RLG  PSE RKHSE++
Subjt:  QAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDT

Query:  HHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADV
        HHKFKR  LQ PERSKKWKG+Q+S EALNQNNNK   KMSKHWGSWAQALYNTAM+PE+H++ VLETSDDVVVLNDIYPKARKHLL+VAR+EGLDQLADV
Subjt:  HHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADV

Query:  HKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSME
          EHLPLL+TMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSV V+DEVSSHGKA + DDESLMSME
Subjt:  HKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSME

Query:  FRCNRCRSAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNS
         RCNRCRSAHPNLPKLK HI KCQ+PFPSTLLEGGRLV A  N+
Subjt:  FRCNRCRSAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNS

TrEMBL top hitse value%identityAlignment
A0A1S3B6C4 transcription factor bHLH1400.0e+0082.93Show/hide
Query:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
        MDMD DENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLEIEQR++FVKLG
Subjt:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG

Query:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+E+DV SAID YKSL LH+ LP GCFGQKN DKKVQ
Subjt:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ

Query:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKK-------ESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENP+KTCS+AN +K+ P+ Q TQEK+       ESSC M  NV  ESEKGE+PGV SLE  IS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKK-------ESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEMVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQ
        IVE VEEFMDKLGNARLV+VDLSHGSKILS+VKAKA +KNI STKFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEMVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQ

Query:  QAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDT
        QA SL PGN VAVQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL +AYSSLFQAFISIV+D++KS KGI+E LGS P E +KHSE++
Subjt:  QAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDT

Query:  HHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADV
        HHKFKR  LQ  E SKKWKG+  S E LNQNNNK   K SKHWGSWAQALY+TAM+PERH N VLETSDDVVVL DIYPKARKHLL+VAR+EGLDQLADV
Subjt:  HHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADV

Query:  HKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSME
          EHLPLL+TMHA+G+KWI KF H+DA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV VIDEVSSHGKA I DDE L+SME
Subjt:  HKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSME

Query:  FRCNRCRSAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNS
         RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLE GRLV    N+
Subjt:  FRCNRCRSAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNS

A0A5A7TLV2 Transcription factor bHLH1400.0e+0082.93Show/hide
Query:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
        MDMD DENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLEIEQR++FVKLG
Subjt:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG

Query:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+E+DV SAID YKSL LH+ LP GCFGQKN DKKVQ
Subjt:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ

Query:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKK-------ESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENP+KTCS+AN +K+ P+ Q TQEK+       ESSC M  NV  ESEKGE+PGV SLE  IS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKK-------ESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEMVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQ
        IVE VEEFMDKLGNARLV+VDLSHGSKILS+VKAKA +KNI STKFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEMVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQ

Query:  QAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDT
        QA SL PGN VAVQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL +AYSSLFQAFISIV+D++KS KGI+E LGS P E +KHSE++
Subjt:  QAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDT

Query:  HHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADV
        HHKFKR  LQ  E SKKWKG+  S E LNQNNNK   K SKHWGSWAQALY+TAM+PERH N VLETSDDVVVL DIYPKARKHLL+VAR+EGLDQLADV
Subjt:  HHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADV

Query:  HKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSME
          EHLPLL+TMHA+G+KWI KF H+DA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV VIDEVSSHGKA I DDE L+SME
Subjt:  HKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSME

Query:  FRCNRCRSAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNS
         RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLE GRLV    N+
Subjt:  FRCNRCRSAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNS

A0A6J1CV45 transcription factor bHLH1400.0e+0082.3Show/hide
Query:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
        MDMDID+NSTAKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQR++FVK+G
Subjt:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG

Query:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
         S VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHD LP GCFGQ  +D KVQ
Subjt:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ

Query:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQE-------KKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKA+NPAKTCS+AN  KD P+ QT++E       K+E +CT+  NV+KESEKGENPGV SL ++IS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQE-------KKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEMVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQ
        IVE VEEFMDKLGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKL S+GGL CNVIANAANWRLKPG GGVNAAIFSAAGP LE ATKQ
Subjt:  IVEMVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQ

Query:  QAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDT
        QA SLRPGN V  QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+++FKS KGI + LGSAPSES KHSED+
Subjt:  QAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDT

Query:  --------HHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYE
                 HKFKR ++Q PERSKKWKG+Q+SAEA NQNNN I HKMSKHWGSWAQALYNTAM+PERH + VLE SDDV VLNDIY KA KHLL+VARYE
Subjt:  --------HHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYE

Query:  GLDQLADVHKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKD
        GLDQLADV +EHLPLL+TMH VG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDV+DEV SHGKA IKD
Subjt:  GLDQLADVHKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKD

Query:  DESLMSMEFRCNRCRSAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS
        DESLMSME RCNRCRSAHPNL KLK HISKC++PFPSTLLEG RLV A  N+ + LS
Subjt:  DESLMSMEFRCNRCRSAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS

A0A6J1GCV0 transcription factor bHLH1400.0e+00100Show/hide
Query:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
        MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
Subjt:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG

Query:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
        SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
Subjt:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ

Query:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
        LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
Subjt:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE

Query:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
        FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
Subjt:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP

Query:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA
        GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA
Subjt:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA

Query:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL
        ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL
Subjt:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL

Query:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR
        LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR
Subjt:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR

Query:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS
        SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS
Subjt:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS

A0A6J1I5W0 transcription factor bHLH1400.0e+0097.3Show/hide
Query:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG
        MD+DIDENSTA+GNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQRSEFVKLG
Subjt:  MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLG

Query:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
        SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRI FCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ
Subjt:  SSLVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQ

Query:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
        LGIM FLKKAENPAKTCSNANT KDFPSSQTTQEKKESSCTMLSNVNKESEKGENPG+ SLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE
Subjt:  LGIMRFLKKAENPAKTCSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEE

Query:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
        FMDKLGNARLVMVD+SHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHC+VIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP
Subjt:  FMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRP

Query:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA
        G+VVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGID+RLGSAPSESRKHSEDTHHKFKRA
Subjt:  GNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRA

Query:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL
        E QIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHAN VLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVH EHLPL
Subjt:  ELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPL

Query:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR
        LKTMHAVG+KWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVS+HGKAIIKDDESLMSMEFRCNRCR
Subjt:  LKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCR

Query:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS
        SAHPNLPKLKTH+SKCQSPFPSTLLEG RLVNASGNSRN LS
Subjt:  SAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS

SwissProt top hitse value%identityAlignment
P61798 Aprataxin (Fragment)2.6e-3741.36Show/hide
Query:  IPERSKKWK--GTQESAEALNQNNNKIS-----HKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKE
        +P+++KK +   TQ S+  L  + + +S         +H G W+Q L ++   P+    + +   +  VV+ D YPKAR H L++  ++ +  L  V +E
Subjt:  IPERSKKWK--GTQESAEALNQNNNKIS-----HKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKE

Query:  HLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDES-LMSMEFR
        HL LL+ MHAVG K I +    + SL FRLGYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T++F +S +VI+ V S GK  + D  S L+ +  R
Subjt:  HLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDES-LMSMEFR

Query:  CNRCRSAHPNLPKLKTHISK
        C+ C+     +P+LK H+ K
Subjt:  CNRCRSAHPNLPKLKTHISK

Q7TQC5 Aprataxin1.8e-3542.65Show/hide
Query:  GTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPLLKTMHAVGMKWI
        G+  S  +++   +K      +  G W+Q L  +   P+    + +   D VVV+ D YPKAR H L++  +  +  L  V  EHL LLK MHAVG K I
Subjt:  GTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPLLKTMHAVGMKWI

Query:  DKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDE-SLMSMEFRCNRCRSAHPNLPKLKT
          F    + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+  +KD    L+ +  RC+ C+   P++P+LK 
Subjt:  DKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDE-SLMSMEFRCNRCRSAHPNLPKLKT

Query:  HISK
        H+ K
Subjt:  HISK

Q7Z2E3 Aprataxin1.2e-3445Show/hide
Query:  GSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQ
        G W+Q L  +   P+    + +   + VVV+ D YPKAR H L++  +  +  L  V +EHL LLK MH VG K I  F    + L FRLGYH+ PSM  
Subjt:  GSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQ

Query:  LHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDD-ESLMSMEFRCNRCRSAHPNLPKLKTHISK
        +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI+ V   G+  ++D    L+ +  RC+ C+   P++P+LK H+ K
Subjt:  LHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDD-ESLMSMEFRCNRCRSAHPNLPKLKTHISK

Q8K4H4 Aprataxin9.1e-3541.07Show/hide
Query:  THHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLAD
        T  K KR +    E   +  G   S  +++    K      +  G W+Q L  +   P+    + +   D VVV+ D YPKAR H L++  +  +  L  
Subjt:  THHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLAD

Query:  VHKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDE-SLMS
        V  EHL LLK MHAVG K I  F    + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+  +KD    L+ 
Subjt:  VHKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDE-SLMS

Query:  MEFRCNRCRSAHPNLPKLKTHISK
        +  RC+ C+   P++P+LK H+ K
Subjt:  MEFRCNRCRSAHPNLPKLKTHISK

Q9M041 Transcription factor bHLH1402.3e-24360.89Show/hide
Query:  RTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLGSSLVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L +GKSVF+DRCNL+ EQRSEF+KLG    +VHAVVL+LPAQ
Subjt:  RTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLGSSLVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQLGIMRFLKKAENPAKT
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELP +NEGF RI FC+S+ DV +A++ Y  LG  D LP GCFG+K  D K Q GIM+F KK      +
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQLGIMRFLKKAENPAKT

Query:  CSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEEFMDKLGNARLVMVDLS
         SN  TN       TT++  E    M +NV     K     +GS +        PTLAFPSIST+DF+F  EKA++IIVE  EEF+ KLG ARLV+VDLS
Subjt:  CSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEEFMDKLGNARLVMVDLS

Query:  HGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRPGNVVAVQLPSTSPLFN
         GSKILSLVKAKA++KNI S KFFTFVGDITKL+S+GGLHCNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +A +L PG  V V LPST PL N
Subjt:  HGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRPGNVVAVQLPSTSPLFN

Query:  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRAELQIPERSKKWKGTQE
         EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR+AY+SLF+ F+S+V+D+ K  K   +   S   E  K  ED+            ER+KK+KG+Q+
Subjt:  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRAELQIPERSKKWKGTQE

Query:  SAEALNQNNNKI------SHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPLLKTMHAVGMK
         A   N  +  +        KMSK W +WA AL++ AM+PERH N+VLE  D++VV+ND YPKARKH+L++AR E LD L DV KE+L LL+ MH VG+K
Subjt:  SAEALNQNNNKI------SHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPLLKTMHAVGMK

Query:  WIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCRSAHPNLPKLK
        W+D+F ++DASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSVDV++EV+S GKA +   E L+  E RCNRCRSAHPN+PKLK
Subjt:  WIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCRSAHPNLPKLK

Query:  THISKCQSPFPSTLLEGGRLV
        +H+  C S FP  LL+  RLV
Subjt:  THISKCQSPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT2G40600.1 appr-1-p processing enzyme family protein3.5e-0535.4Show/hide
Query:  GDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGP----NMNPQRPNYL
        GDITK   D     + I N AN R+  GGGG + AI  AAGP L  A   +   +RPG          +P FN    + VIH +GP    ++NPQ    L
Subjt:  GDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGP----NMNPQRPNYL

Query:  NNDYDEGCKLLRD
         N Y    ++ ++
Subjt:  NNDYDEGCKLLRD

AT5G01310.1 APRATAXIN-like1.6e-24460.89Show/hide
Query:  RTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLGSSLVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L +GKSVF+DRCNL+ EQRSEF+KLG    +VHAVVL+LPAQ
Subjt:  RTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLGSSLVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQLGIMRFLKKAENPAKT
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELP +NEGF RI FC+S+ DV +A++ Y  LG  D LP GCFG+K  D K Q GIM+F KK      +
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQLGIMRFLKKAENPAKT

Query:  CSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEEFMDKLGNARLVMVDLS
         SN  TN       TT++  E    M +NV     K     +GS +        PTLAFPSIST+DF+F  EKA++IIVE  EEF+ KLG ARLV+VDLS
Subjt:  CSNANTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEEFMDKLGNARLVMVDLS

Query:  HGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRPGNVVAVQLPSTSPLFN
         GSKILSLVKAKA++KNI S KFFTFVGDITKL+S+GGLHCNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +A +L PG  V V LPST PL N
Subjt:  HGSKILSLVKAKAAKKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRPGNVVAVQLPSTSPLFN

Query:  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRAELQIPERSKKWKGTQE
         EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR+AY+SLF+ F+S+V+D+ K  K   +   S   E  K  ED+            ER+KK+KG+Q+
Subjt:  REGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRAELQIPERSKKWKGTQE

Query:  SAEALNQNNNKI------SHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPLLKTMHAVGMK
         A   N  +  +        KMSK W +WA AL++ AM+PERH N+VLE  D++VV+ND YPKARKH+L++AR E LD L DV KE+L LL+ MH VG+K
Subjt:  SAEALNQNNNKI------SHKMSKHWGSWAQALYNTAMYPERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPLLKTMHAVGMK

Query:  WIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCRSAHPNLPKLK
        W+D+F ++DASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSVDV++EV+S GKA +   E L+  E RCNRCRSAHPN+PKLK
Subjt:  WIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCRSAHPNLPKLK

Query:  THISKCQSPFPSTLLEGGRLV
        +H+  C S FP  LL+  RLV
Subjt:  THISKCQSPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATGGATATCGACGAAAATTCGACAGCCAAAGGAAATGAACGGCGAACGAAGCTCATAATGGTGATATTAGTGGGCGCACCTGGAAGCGGAAAGTCCACCTTCTG
CGAACTCGTAATGGCTTCCTCTTCTCGCCCTTGGGTTCGAATCTGCCAGGATACCATTGGAAATGGCAAGTCCGGAACCAAAGCACAGTGTCTGAAGAGTGCAGCCAGTG
CATTGAATGATGGAAAGAGTGTATTTGTGGACAGATGCAATCTTGAAATAGAGCAGCGTTCTGAGTTTGTGAAACTAGGTAGCTCTCTAGTGGATGTACATGCTGTTGTA
TTGGATCTTCCTGCACAGCTCTGTATCTCCCGTTCGGTTAAGCGGACCGGTCATGAAGGGAACCTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCGAACCTGAATGAAGGGTTCAAGCGTATAACATTTTGTCACAGTGAAACGGACGTTCAATCCGCCATAGATACGTATAAATCACTTGGTTTACATGATG
CACTTCCAGATGGATGTTTTGGACAGAAGAATGCAGACAAGAAAGTACAATTAGGCATAATGAGGTTCTTAAAGAAAGCAGAAAATCCTGCTAAAACATGTTCTAATGCC
AATACCAATAAGGATTTTCCAAGTTCTCAAACTACCCAGGAAAAGAAAGAGTCTTCCTGCACGATGTTGAGCAATGTCAATAAAGAGTCGGAGAAAGGCGAAAATCCAGG
CGTTGGATCCTTAGAAAACAATATTTCTGGAAGTGATCCTCCAACTCTTGCATTTCCTTCTATTTCAACATCGGATTTCAAGTTCAGCCATGAAAAGGCTGCTGAAATTA
TTGTTGAGATGGTGGAGGAATTCATGGATAAGCTTGGGAATGCCAGACTTGTTATGGTAGACTTAAGTCATGGATCAAAGATTTTGTCTTTGGTTAAAGCTAAAGCGGCC
AAGAAAAATATTTGTTCCACCAAGTTTTTTACATTTGTAGGTGATATAACTAAACTCAAGTCAGATGGTGGATTGCACTGCAATGTAATAGCCAATGCTGCAAACTGGCG
ATTGAAACCAGGAGGGGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCCAGTCTGGAAGAGGCAACAAAACAACAAGCCAAGTCCCTTCGACCTGGCAATGTTG
TGGCCGTTCAATTGCCTTCAACTTCTCCTTTGTTTAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCAAACATGAATCCTCAAAGACCAAATTATCTCAAC
AATGATTATGATGAAGGTTGCAAGCTTCTTCGTGATGCTTATTCTTCTCTATTTCAAGCCTTTATTTCAATAGTAAAAGATGAGTTTAAGTCGGCGAAGGGAATTGACGA
ACGCCTCGGCTCAGCACCTTCAGAATCACGAAAACACTCTGAGGACACCCATCACAAGTTTAAGAGAGCGGAGTTGCAAATTCCTGAAAGAAGCAAAAAGTGGAAAGGAA
CTCAGGAATCAGCTGAAGCATTAAACCAAAACAACAATAAGATTTCCCACAAGATGAGTAAGCACTGGGGCTCATGGGCACAAGCACTTTACAACACTGCAATGTATCCC
GAGAGACATGCCAATATTGTACTGGAGACATCAGATGATGTTGTAGTACTGAATGATATTTATCCAAAAGCACGCAAGCATCTTCTGATAGTGGCTCGATATGAAGGCCT
CGATCAACTAGCTGATGTACATAAAGAACACCTTCCACTGTTGAAAACGATGCACGCCGTGGGTATGAAGTGGATTGATAAGTTCATTCATGACGATGCATCGTTGGTTT
TTCGCCTCGGATACCACTCGGCTCCATCCATGAGGCAACTGCACCTACATGTTATAAGCCAAGATTTTGACTCCAGTCATCTGAAGAACAAGAAGCATTGGAATTCTTTC
AACACCGATTTCTTCCGAGACTCGGTCGACGTTATAGACGAAGTCAGTAGCCATGGAAAGGCAATCATCAAGGACGACGAGAGCTTGATGTCTATGGAGTTTCGTTGCAA
CAGATGCAGAAGTGCCCATCCCAACTTACCCAAGCTGAAAACACATATTTCCAAATGCCAATCGCCTTTCCCTTCCACGCTACTCGAGGGCGGTCGTTTAGTGAATGCTT
CAGGTAATTCTCGTAACTTTCTTTCGTAG
mRNA sequenceShow/hide mRNA sequence
CAGCGAATTTTGGGGGAGAGCTGATCGTGGAGTATTTTACAAGCTTCACTCATCGGAGGAAAAACGTCGGCGATGGCGATTGATAGGGTTTGAAGGAAGACGCAAAACCT
AATTTAGAAATGGATATGGATATCGACGAAAATTCGACAGCCAAAGGAAATGAACGGCGAACGAAGCTCATAATGGTGATATTAGTGGGCGCACCTGGAAGCGGAAAGTC
CACCTTCTGCGAACTCGTAATGGCTTCCTCTTCTCGCCCTTGGGTTCGAATCTGCCAGGATACCATTGGAAATGGCAAGTCCGGAACCAAAGCACAGTGTCTGAAGAGTG
CAGCCAGTGCATTGAATGATGGAAAGAGTGTATTTGTGGACAGATGCAATCTTGAAATAGAGCAGCGTTCTGAGTTTGTGAAACTAGGTAGCTCTCTAGTGGATGTACAT
GCTGTTGTATTGGATCTTCCTGCACAGCTCTGTATCTCCCGTTCGGTTAAGCGGACCGGTCATGAAGGGAACCTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAAT
GCTGCAAAAGAAAGAATTGCCGAACCTGAATGAAGGGTTCAAGCGTATAACATTTTGTCACAGTGAAACGGACGTTCAATCCGCCATAGATACGTATAAATCACTTGGTT
TACATGATGCACTTCCAGATGGATGTTTTGGACAGAAGAATGCAGACAAGAAAGTACAATTAGGCATAATGAGGTTCTTAAAGAAAGCAGAAAATCCTGCTAAAACATGT
TCTAATGCCAATACCAATAAGGATTTTCCAAGTTCTCAAACTACCCAGGAAAAGAAAGAGTCTTCCTGCACGATGTTGAGCAATGTCAATAAAGAGTCGGAGAAAGGCGA
AAATCCAGGCGTTGGATCCTTAGAAAACAATATTTCTGGAAGTGATCCTCCAACTCTTGCATTTCCTTCTATTTCAACATCGGATTTCAAGTTCAGCCATGAAAAGGCTG
CTGAAATTATTGTTGAGATGGTGGAGGAATTCATGGATAAGCTTGGGAATGCCAGACTTGTTATGGTAGACTTAAGTCATGGATCAAAGATTTTGTCTTTGGTTAAAGCT
AAAGCGGCCAAGAAAAATATTTGTTCCACCAAGTTTTTTACATTTGTAGGTGATATAACTAAACTCAAGTCAGATGGTGGATTGCACTGCAATGTAATAGCCAATGCTGC
AAACTGGCGATTGAAACCAGGAGGGGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCCAGTCTGGAAGAGGCAACAAAACAACAAGCCAAGTCCCTTCGACCTG
GCAATGTTGTGGCCGTTCAATTGCCTTCAACTTCTCCTTTGTTTAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCAAACATGAATCCTCAAAGACCAAAT
TATCTCAACAATGATTATGATGAAGGTTGCAAGCTTCTTCGTGATGCTTATTCTTCTCTATTTCAAGCCTTTATTTCAATAGTAAAAGATGAGTTTAAGTCGGCGAAGGG
AATTGACGAACGCCTCGGCTCAGCACCTTCAGAATCACGAAAACACTCTGAGGACACCCATCACAAGTTTAAGAGAGCGGAGTTGCAAATTCCTGAAAGAAGCAAAAAGT
GGAAAGGAACTCAGGAATCAGCTGAAGCATTAAACCAAAACAACAATAAGATTTCCCACAAGATGAGTAAGCACTGGGGCTCATGGGCACAAGCACTTTACAACACTGCA
ATGTATCCCGAGAGACATGCCAATATTGTACTGGAGACATCAGATGATGTTGTAGTACTGAATGATATTTATCCAAAAGCACGCAAGCATCTTCTGATAGTGGCTCGATA
TGAAGGCCTCGATCAACTAGCTGATGTACATAAAGAACACCTTCCACTGTTGAAAACGATGCACGCCGTGGGTATGAAGTGGATTGATAAGTTCATTCATGACGATGCAT
CGTTGGTTTTTCGCCTCGGATACCACTCGGCTCCATCCATGAGGCAACTGCACCTACATGTTATAAGCCAAGATTTTGACTCCAGTCATCTGAAGAACAAGAAGCATTGG
AATTCTTTCAACACCGATTTCTTCCGAGACTCGGTCGACGTTATAGACGAAGTCAGTAGCCATGGAAAGGCAATCATCAAGGACGACGAGAGCTTGATGTCTATGGAGTT
TCGTTGCAACAGATGCAGAAGTGCCCATCCCAACTTACCCAAGCTGAAAACACATATTTCCAAATGCCAATCGCCTTTCCCTTCCACGCTACTCGAGGGCGGTCGTTTAG
TGAATGCTTCAGGTAATTCTCGTAACTTTCTTTCGTAGTTTTCATTTCATAATGTCCTCCATACAAAATTTTCTGGTTGTATATGTTTTGAGCTCAATATCTTTGGGGGG
TTTATGTAGTTTTGATGGCCTCAGGAAGCCTTAGTTAATAGTGTTCTTAATCACCTTTTAGCTCTCAACCAGTTTAGTTTTTCGGAGAATTCTGATAGATCTTAATATAT
ATTTTTTTAATATATGTTTTCAAATTTTAT
Protein sequenceShow/hide protein sequence
MDMDIDENSTAKGNERRTKLIMVILVGAPGSGKSTFCELVMASSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRSEFVKLGSSLVDVHAVV
LDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPNLNEGFKRITFCHSETDVQSAIDTYKSLGLHDALPDGCFGQKNADKKVQLGIMRFLKKAENPAKTCSNA
NTNKDFPSSQTTQEKKESSCTMLSNVNKESEKGENPGVGSLENNISGSDPPTLAFPSISTSDFKFSHEKAAEIIVEMVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAA
KKNICSTKFFTFVGDITKLKSDGGLHCNVIANAANWRLKPGGGGVNAAIFSAAGPSLEEATKQQAKSLRPGNVVAVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLN
NDYDEGCKLLRDAYSSLFQAFISIVKDEFKSAKGIDERLGSAPSESRKHSEDTHHKFKRAELQIPERSKKWKGTQESAEALNQNNNKISHKMSKHWGSWAQALYNTAMYP
ERHANIVLETSDDVVVLNDIYPKARKHLLIVARYEGLDQLADVHKEHLPLLKTMHAVGMKWIDKFIHDDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSF
NTDFFRDSVDVIDEVSSHGKAIIKDDESLMSMEFRCNRCRSAHPNLPKLKTHISKCQSPFPSTLLEGGRLVNASGNSRNFLS