; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g33390 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g33390
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptiontranscription factor bHLH140
Genome locationchr5:24936508..24939794
RNA-Seq ExpressionMoc05g33390
SyntenyMoc05g33390
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596332.1 Transcription factor basic helix-loop-helix 140, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0082.43Show/hide
Query:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
        MDMDID+NSTAKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQR++FVK+G
Subjt:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG

Query:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
         S VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHD LP GCFGQ  +D KVQ
Subjt:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ

Query:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKA+NPAKTCS+AN  KD P+ QT++E       K+E +CT+  NV+KESEKGENPGV SL ++IS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKL S+GGL CNVIANAANWRLKPG GGVNAAIFSAAGP LE ATKQ
Subjt:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
        QA SLRPGN V  QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+++FKS KGI + LGSAPSES KHSED+
Subjt:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS

Query:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
                 HKFKR ++Q PERSKKWKG+Q+SAEA NQNNN I HKMSKHWGSWAQALYNTAM+PERH + VLE SDDV VLNDIY KA KHLL+VARYE
Subjt:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE

Query:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
        GLDQLADV +EHLPLL+TMH VG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDV+DEV SHGKA IKD
Subjt:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD

Query:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS
        DESLMSME RCNRCRSAHPNL KLK HISKC++PFPSTLLEG RLV A  N+ +SLS
Subjt:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+00100Show/hide
Query:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
        MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
Subjt:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG

Query:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
        GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
Subjt:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ

Query:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
        IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
        QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
Subjt:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS

Query:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
        PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
Subjt:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE

Query:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
        GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
Subjt:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD

Query:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS
        DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS
Subjt:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS

XP_022949470.1 transcription factor bHLH140 [Cucurbita moschata]0.0e+0082.3Show/hide
Query:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
        MDMDID+NSTAKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQR++FVK+G
Subjt:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG

Query:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
         S VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHD LP GCFGQ  +D KVQ
Subjt:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ

Query:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKA+NPAKTCS+AN  KD P+ QT++E       K+E +CT+  NV+KESEKGENPGV SL ++IS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKL S+GGL CNVIANAANWRLKPG GGVNAAIFSAAGP LE ATKQ
Subjt:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
        QA SLRPGN V  QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+++FKS KGI + LGSAPSES KHSED+
Subjt:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS

Query:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
                 HKFKR ++Q PERSKKWKG+Q+SAEA NQNNN I HKMSKHWGSWAQALYNTAM+PERH + VLE SDDV VLNDIY KA KHLL+VARYE
Subjt:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE

Query:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
        GLDQLADV +EHLPLL+TMH VG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDV+DEV SHGKA IKD
Subjt:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD

Query:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS
        DESLMSME RCNRCRSAHPNL KLK HISKC++PFPSTLLEG RLV A  N+ + LS
Subjt:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS

XP_023540723.1 transcription factor bHLH140 [Cucurbita pepo subsp. pepo]0.0e+0082.3Show/hide
Query:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
        MDMDID+NSTAKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQR++FVK+G
Subjt:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG

Query:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
         S VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP LNEGF RITFCH+ET+VQSAIDTYKSLGLHD LP GCFGQ  +D KVQ
Subjt:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ

Query:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKA+NPAKTCS+AN  KD P+ QT++E       K+E +CT+  NV+KESEKGE PGVRSL ++IS SD PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKL S+GGL CNVIANAANWRLKPG GGVNAAIFSAAGP LE ATKQ
Subjt:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
        QA SLRPGN V  QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+++FKS KGI + LGSAPSES KHSED+
Subjt:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS

Query:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
                 HKFKR ++Q PERSKKWKG+Q+SAEA NQNNN I HKMSKHWGSWAQALYNTAM+PERH +TVLE SDDV VLNDIY KA KHLL+VARYE
Subjt:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE

Query:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
        GLDQLADV +EHLPLL+TMH VG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDV+DEV SHGKA IKD
Subjt:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD

Query:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS
        DESLMSME RCNRCRSAHPNL KLK HISKC++PFPSTLLEG RLV A  N+ +SLS
Subjt:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0085.62Show/hide
Query:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
        MDMD D+NSTAKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGK++FVDRCNLEIEQRADFVK+ 
Subjt:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG

Query:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
        G  VDV AVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAID YKSL LHD LP GCFGQ   D KVQ
Subjt:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ

Query:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P++T SSAN +KDSP  QT++E S SCDKKEE ACTI  NVD ES+KGE+PGVRSL D+IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GSK+LSLVKAKAAKK I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAG GLEVATKQ
Subjt:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
        QANSL+PGN V  QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FIS+VE++FKSVKGI   LG  PSE  KHSE+S
Subjt:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS

Query:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
                 HKFKRE++QNPERSKKWKGSQDS EA NQNNN  V KMSKHWGSWAQALYNTAMHPE+H DTVLE SDDV VLNDIY KA KHLLVVAR+E
Subjt:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE

Query:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
        GLDQLADV  EHLPLLRTMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSV VMDEV SHGKAS+ D
Subjt:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD

Query:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSN
        DESLMSMELRCNRCRSAHPNL KLKAHI KC+APFPSTLLEG RLVIAPSN
Subjt:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSN

TrEMBL top hitse value%identityAlignment
A0A1S3B6C4 transcription factor bHLH1400.0e+0082.42Show/hide
Query:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLKSA+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG

Query:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LPHGCFGQ   D KVQ
Subjt:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ

Query:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+NP+KTCSSANA K+SP  Q ++E   SCDKKEE +C +  NV  ESEKGE+PGVRSL   IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
        QANSL PGN V  QLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSLFQ FISIV++++KSVKGI + LGS P E +KHSE+S
Subjt:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS

Query:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
                 HKFKRE++QN E SKKWKGS +S E  NQNNN  V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV VL DIY KA KHLLVVAR+E
Subjt:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE

Query:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
        GLDQLADV  EHLPLLRTMH +GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEV SHGKA I D
Subjt:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD

Query:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSN
        DE L+SMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLE  RLV+ PSN
Subjt:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSN

A0A5A7TLV2 Transcription factor bHLH1400.0e+0082.42Show/hide
Query:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLKSA+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG

Query:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LPHGCFGQ   D KVQ
Subjt:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ

Query:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+NP+KTCSSANA K+SP  Q ++E   SCDKKEE +C +  NV  ESEKGE+PGVRSL   IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
        QANSL PGN V  QLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSLFQ FISIV++++KSVKGI + LGS P E +KHSE+S
Subjt:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS

Query:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
                 HKFKRE++QN E SKKWKGS +S E  NQNNN  V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV VL DIY KA KHLLVVAR+E
Subjt:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE

Query:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
        GLDQLADV  EHLPLLRTMH +GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEV SHGKA I D
Subjt:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD

Query:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSN
        DE L+SMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLE  RLV+ PSN
Subjt:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSN

A0A6J1CV45 transcription factor bHLH1400.0e+00100Show/hide
Query:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
        MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
Subjt:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG

Query:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
        GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
Subjt:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ

Query:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
        IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
        QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
Subjt:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS

Query:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
        PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
Subjt:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE

Query:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
        GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
Subjt:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD

Query:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS
        DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS
Subjt:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS

A0A6J1GCV0 transcription factor bHLH1400.0e+0082.3Show/hide
Query:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
        MDMDID+NSTAKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQR++FVK+G
Subjt:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG

Query:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
         S VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHD LP GCFGQ  +D KVQ
Subjt:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ

Query:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKA+NPAKTCS+AN  KD P+ QT++E       K+E +CT+  NV+KESEKGENPGV SL ++IS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKL S+GGL CNVIANAANWRLKPG GGVNAAIFSAAGP LE ATKQ
Subjt:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
        QA SLRPGN V  QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+++FKS KGI + LGSAPSES KHSED+
Subjt:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS

Query:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
                 HKFKR ++Q PERSKKWKG+Q+SAEA NQNNN I HKMSKHWGSWAQALYNTAM+PERH + VLE SDDV VLNDIY KA KHLL+VARYE
Subjt:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE

Query:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
        GLDQLADV +EHLPLL+TMH VG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDV+DEV SHGKA IKD
Subjt:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD

Query:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS
        DESLMSME RCNRCRSAHPNL KLK HISKC++PFPSTLLEG RLV A  N+ + LS
Subjt:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS

A0A6J1I5W0 transcription factor bHLH1400.0e+0081.9Show/hide
Query:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG
        MD+DID+NSTA+G E + KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGK++FVDRCNLEIEQR++FVK+G
Subjt:  MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIG

Query:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ
         S VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP LNEGF RI FCH+ETDVQSAIDTYKSLGLHD LP GCFGQ  +D KVQ
Subjt:  GSGVDVHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQ

Query:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM FLKKA+NPAKTCS+AN IKD P+ QT++E       K+E +CT+  NV+KESEKGENPG+RSL ++IS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKADNPAKTCSSANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLV+VD+++GSK+LSLVKAKAAKK I  +KFFTFVGDITKL S+GGL C+VIANAANWRLKPG GGVNAAIFSAAGP LE ATKQ
Subjt:  IVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS
        QA SLRPG+ V  QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+++FKS KGI K LGSAPSES KHSED+
Subjt:  QANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDS

Query:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE
                 HKFKR + Q PERSKKWKG+Q+SAEA NQNNN I HKMSKHWGSWAQALYNTAM+PERH +TVLE SDDV VLNDIY KA KHLL+VARYE
Subjt:  PRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYE

Query:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD
        GLDQLADV  EHLPLL+TMH VGLKWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDV+DEV +HGKA IKD
Subjt:  GLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKD

Query:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS
        DESLMSME RCNRCRSAHPNL KLK H+SKC++PFPSTLLEGSRLV A  N+ +SLS
Subjt:  DESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS

SwissProt top hitse value%identityAlignment
P61798 Aprataxin (Fragment)1.3e-3336.36Show/hide
Query:  GSAPSESEKHSEDSPRSTFPSCDHKFKREDVQN-PERSKKWK--GSQDSAEAFNQNNNTI-----VHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDD
        G    ES   +E+  ++    C+   + +D++N P+++KK +   +Q S+     + +++          +H G W+Q L ++   P+      +   + 
Subjt:  GSAPSESEKHSEDSPRSTFPSCDHKFKREDVQN-PERSKKWK--GSQDSAEAFNQNNNTI-----VHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDD

Query:  VAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTD
          V+ D Y KA  H LV+  ++ +  L  V REHL LL  MH VG K I +   ++ SL FRLGYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T+
Subjt:  VAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTD

Query:  FFRDSVDVMDEVGSHGKASIKDDES-LMSMELRCNRCRSAHPNLLKLKAHISK
        +F +S +V++ V S GK ++ D  S L+ + LRC+ C+     + +LK H+ K
Subjt:  FFRDSVDVMDEVGSHGKASIKDDES-LMSMELRCNRCRSAHPNLLKLKAHISK

Q7YRZ1 Aprataxin2.1e-3144.38Show/hide
Query:  WAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLH
        W+Q L  +   P+      +   D V V+ D Y KA  H LV+  +  +  L  V REHL LLR MH VG K I   F   + L FRLGYH+ PSM  +H
Subjt:  WAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLH

Query:  LHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLLKLKAHISK
        LHVISQDFDS  LKNKKHWNSFNT++F +S  V++ V   G+ +++D    L+ + LRC+ C+   P++ +LK H+ K
Subjt:  LHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLLKLKAHISK

Q7Z2E3 Aprataxin2.8e-3143.33Show/hide
Query:  GSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQ
        G W+Q L  +   P+      +   + V V+ D Y KA  H LV+  +  +  L  V REHL LL+ MH VG K I   F   + L FRLGYH+ PSM  
Subjt:  GSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQ

Query:  LHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLLKLKAHISK
        +HLHVISQDFDS  LKNKKHWNSFNT++F +S  V++ V   G+ +++D    L+ + LRC+ C+   P++ +LK H+ K
Subjt:  LHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLLKLKAHISK

Q9BGQ0 Aprataxin2.8e-3143.33Show/hide
Query:  GSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQ
        G W+Q L  +   P+      +   + V V+ D Y KA  H LV+  +  +  L  V REHL LL+ MH VG K I   F   + L FRLGYH+ PSM  
Subjt:  GSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQ

Query:  LHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLLKLKAHISK
        +HLHVISQDFDS  LKNKKHWNSFNT++F +S  V++ V   G+ +++D    L+ + LRC+ C+   P++ +LK H+ K
Subjt:  LHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLLKLKAHISK

Q9M041 Transcription factor bHLH1404.5e-23959.67Show/hide
Query:  KLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLC
        K I+V+L+G PGSGKSTFC+  M SS RPW+RICQD + NGK+GTKAQCLK A+ +L +GKS+F+DRCNL+ EQR++F+K+GG   +VHAVVL+LPAQ+C
Subjt:  KLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLC

Query:  ISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQLGIMKFLKKADNPAKTCS
        ISRSVKRTGHEGNLQGG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++ Y  LG  DTLP GCFG+ K D K Q GIMKF KK    A   S
Subjt:  ISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQLGIMKFLKKADNPAKTCS

Query:  SANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLV
        S+N            E + +  K +E    +R +  K            LG   S    PTLAFPSIST+DF+F  EKA++IIVE  EEF+ KLG ARLV
Subjt:  SANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLV

Query:  LVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPST
        LVDL+ GSK+LSLVKAKA++K I+ +KFFTFVGDITKL SEGGL CNVIANA NWRLKPG GGVNAAIF AAGP LE AT+ +AN+L PG  VV  LPST
Subjt:  LVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPST

Query:  SPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDSPRSTFPSCDHKFKREDVQ
         PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR AY+SLF+GF+S+V++Q K  K   +   S   E  K  EDS                  
Subjt:  SPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDSPRSTFPSCDHKFKREDVQ

Query:  NPERSKKWKGSQDSAEAFNQNNNTIV------HKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREH
          ER+KK+KGSQD A   N  + ++        KMSK W +WA AL++ AMHPERH + VLE  D++ V+ND Y KA KH+LV+AR E LD L DVR+E+
Subjt:  NPERSKKWKGSQDSAEAFNQNNNTIV------HKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREH

Query:  LPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCN
        L LL+ MH+VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSVDV++EV S GKA++   E L+  ELRCN
Subjt:  LPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCN

Query:  RCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLV
        RCRSAHPN+ KLK+H+  C + FP  LL+ +RLV
Subjt:  RCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLV

Arabidopsis top hitse value%identityAlignment
AT5G01310.1 APRATAXIN-like3.2e-24059.67Show/hide
Query:  KLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLC
        K I+V+L+G PGSGKSTFC+  M SS RPW+RICQD + NGK+GTKAQCLK A+ +L +GKS+F+DRCNL+ EQR++F+K+GG   +VHAVVL+LPAQ+C
Subjt:  KLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLC

Query:  ISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQLGIMKFLKKADNPAKTCS
        ISRSVKRTGHEGNLQGG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++ Y  LG  DTLP GCFG+ K D K Q GIMKF KK    A   S
Subjt:  ISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQLGIMKFLKKADNPAKTCS

Query:  SANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLV
        S+N            E + +  K +E    +R +  K            LG   S    PTLAFPSIST+DF+F  EKA++IIVE  EEF+ KLG ARLV
Subjt:  SANAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLV

Query:  LVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPST
        LVDL+ GSK+LSLVKAKA++K I+ +KFFTFVGDITKL SEGGL CNVIANA NWRLKPG GGVNAAIF AAGP LE AT+ +AN+L PG  VV  LPST
Subjt:  LVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPST

Query:  SPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDSPRSTFPSCDHKFKREDVQ
         PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR AY+SLF+GF+S+V++Q K  K   +   S   E  K  EDS                  
Subjt:  SPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDSPRSTFPSCDHKFKREDVQ

Query:  NPERSKKWKGSQDSAEAFNQNNNTIV------HKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREH
          ER+KK+KGSQD A   N  + ++        KMSK W +WA AL++ AMHPERH + VLE  D++ V+ND Y KA KH+LV+AR E LD L DVR+E+
Subjt:  NPERSKKWKGSQDSAEAFNQNNNTIV------HKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREH

Query:  LPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCN
        L LL+ MH+VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSVDV++EV S GKA++   E L+  ELRCN
Subjt:  LPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCN

Query:  RCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLV
        RCRSAHPN+ KLK+H+  C + FP  LL+ +RLV
Subjt:  RCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGATATCGACGATAATTCGACCGCCAAAGGAAAGGAAGGGCAAGAGAAGCTCATAATGGTGATATTAGTTGGTGCACCAGGAAGCGGCAAGTCCACCTTCTG
CGAACTCGTAATGGGTTCCTCTTCTCGCCCGTGGGCTCGAATTTGCCAGGACACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGCGCATCAAGTG
CATTGAGTGATGGAAAGAGTATATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTAAAAATTGGCGGCTCTGGAGTGGATGTACATGCTGTTGTA
CTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCAGTTAAGCGGACCGGTCATGAAGGGAATTTACAAGGAGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAGGAATTACCCAAGTTAAATGAAGGGTTTACTCGCATAACGTTTTGTCACAATGAGACCGATGTTCAATCTGCTATAGATACGTACAAATCACTTGGTTTACATGATA
CGCTTCCACATGGATGTTTTGGACAAAATAAATCAGACAATAAAGTACAACTTGGCATAATGAAGTTCTTAAAGAAAGCAGACAATCCTGCTAAGACATGTTCTAGTGCC
AATGCCATTAAGGATTCTCCAAATCTGCAAACTTCCCGGGAAAATAGCTACTCTTGTGATAAAAAGGAAGAGCCTGCCTGCACAATAAGGGGCAATGTCGATAAAGAGTC
AGAGAAAGGCGAAAATCCAGGTGTAAGATCGTTAGGAGACAGTATTTCTCGAAGTGATCCTCCAACCCTTGCATTTCCGTCAATTTCGACATCGGATTTCAAGTTCAGCC
ATGAAAAGGCTGCTGAAATTATTGTTGAGACGGTTGAGGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTTTTAGTAGACTTGACTTATGGGTCAAAGATGTTGTCT
TTGGTTAAAGCTAAAGCAGCCAAGAAAAAGATTAATTGCTCCAAGTTTTTTACATTCGTTGGAGATATAACTAAACTCAATTCAGAAGGTGGATTGCGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGCGACTGAAACCGGGGGCTGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCGGGTCTGGAAGTGGCAACTAAACAACAAGCGAACT
CCCTTCGACCCGGCAATACCGTCGTTGCTCAATTGCCCTCAACTTCTCCTTTGTTTAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCAAACATGAATCCA
CAGAGGCCAAATTATCTCAACAATGACTATGATGAAGGTTGCAAACTTCTCCGTGCCGCTTACTCTTCCCTATTTCAAGGTTTTATTTCAATAGTAGAAAACCAATTTAA
GTCGGTGAAGGGAATTCAGAAACACCTTGGCTCGGCGCCTTCAGAATCAGAAAAGCACTCCGAGGACAGCCCAAGAAGTACTTTCCCGAGTTGCGATCACAAGTTTAAGA
GAGAGGATGTGCAGAATCCTGAAAGAAGCAAAAAGTGGAAAGGATCTCAGGATTCAGCTGAAGCATTCAACCAAAATAACAATACGATTGTCCACAAAATGAGTAAGCAC
TGGGGATCATGGGCACAAGCACTCTACAATACTGCAATGCATCCTGAGAGACATGGCGATACAGTACTGGAGTTATCAGATGATGTTGCAGTACTGAATGATATCTATGC
AAAGGCACATAAGCATCTTCTGGTAGTGGCTCGGTATGAAGGCCTCGATCAACTAGCCGATGTACGGAGAGAACACCTTCCATTGTTGAGAACGATGCACGATGTGGGTT
TGAAGTGGATAGATAAGTTCTTTCATGAAGATGCATCATTAGTTTTTCGCCTCGGATACCACTCGGCTCCATCCATGAGGCAGCTGCACCTACATGTTATAAGCCAGGAT
TTCGACTCAAGTCATCTGAAGAACAAGAAGCATTGGAATTCTTTCAACACCGATTTCTTCAGAGACTCGGTAGACGTTATGGACGAAGTCGGTAGCCATGGAAAGGCAAG
CATCAAGGACGACGAGAGCTTGATGTCTATGGAGTTGCGTTGCAACCGATGCAGAAGTGCACATCCCAACTTACTCAAATTGAAAGCACATATTTCAAAATGTCGAGCGC
CTTTCCCTTCCACACTACTCGAGGGCAGTCGTTTAGTGATTGCACCAAGTAATACTCATGACTCTCTATCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACATGGATATCGACGATAATTCGACCGCCAAAGGAAAGGAAGGGCAAGAGAAGCTCATAATGGTGATATTAGTTGGTGCACCAGGAAGCGGCAAGTCCACCTTCTG
CGAACTCGTAATGGGTTCCTCTTCTCGCCCGTGGGCTCGAATTTGCCAGGACACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGCGCATCAAGTG
CATTGAGTGATGGAAAGAGTATATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTAAAAATTGGCGGCTCTGGAGTGGATGTACATGCTGTTGTA
CTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCAGTTAAGCGGACCGGTCATGAAGGGAATTTACAAGGAGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAGGAATTACCCAAGTTAAATGAAGGGTTTACTCGCATAACGTTTTGTCACAATGAGACCGATGTTCAATCTGCTATAGATACGTACAAATCACTTGGTTTACATGATA
CGCTTCCACATGGATGTTTTGGACAAAATAAATCAGACAATAAAGTACAACTTGGCATAATGAAGTTCTTAAAGAAAGCAGACAATCCTGCTAAGACATGTTCTAGTGCC
AATGCCATTAAGGATTCTCCAAATCTGCAAACTTCCCGGGAAAATAGCTACTCTTGTGATAAAAAGGAAGAGCCTGCCTGCACAATAAGGGGCAATGTCGATAAAGAGTC
AGAGAAAGGCGAAAATCCAGGTGTAAGATCGTTAGGAGACAGTATTTCTCGAAGTGATCCTCCAACCCTTGCATTTCCGTCAATTTCGACATCGGATTTCAAGTTCAGCC
ATGAAAAGGCTGCTGAAATTATTGTTGAGACGGTTGAGGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTTTTAGTAGACTTGACTTATGGGTCAAAGATGTTGTCT
TTGGTTAAAGCTAAAGCAGCCAAGAAAAAGATTAATTGCTCCAAGTTTTTTACATTCGTTGGAGATATAACTAAACTCAATTCAGAAGGTGGATTGCGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGCGACTGAAACCGGGGGCTGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCGGGTCTGGAAGTGGCAACTAAACAACAAGCGAACT
CCCTTCGACCCGGCAATACCGTCGTTGCTCAATTGCCCTCAACTTCTCCTTTGTTTAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCAAACATGAATCCA
CAGAGGCCAAATTATCTCAACAATGACTATGATGAAGGTTGCAAACTTCTCCGTGCCGCTTACTCTTCCCTATTTCAAGGTTTTATTTCAATAGTAGAAAACCAATTTAA
GTCGGTGAAGGGAATTCAGAAACACCTTGGCTCGGCGCCTTCAGAATCAGAAAAGCACTCCGAGGACAGCCCAAGAAGTACTTTCCCGAGTTGCGATCACAAGTTTAAGA
GAGAGGATGTGCAGAATCCTGAAAGAAGCAAAAAGTGGAAAGGATCTCAGGATTCAGCTGAAGCATTCAACCAAAATAACAATACGATTGTCCACAAAATGAGTAAGCAC
TGGGGATCATGGGCACAAGCACTCTACAATACTGCAATGCATCCTGAGAGACATGGCGATACAGTACTGGAGTTATCAGATGATGTTGCAGTACTGAATGATATCTATGC
AAAGGCACATAAGCATCTTCTGGTAGTGGCTCGGTATGAAGGCCTCGATCAACTAGCCGATGTACGGAGAGAACACCTTCCATTGTTGAGAACGATGCACGATGTGGGTT
TGAAGTGGATAGATAAGTTCTTTCATGAAGATGCATCATTAGTTTTTCGCCTCGGATACCACTCGGCTCCATCCATGAGGCAGCTGCACCTACATGTTATAAGCCAGGAT
TTCGACTCAAGTCATCTGAAGAACAAGAAGCATTGGAATTCTTTCAACACCGATTTCTTCAGAGACTCGGTAGACGTTATGGACGAAGTCGGTAGCCATGGAAAGGCAAG
CATCAAGGACGACGAGAGCTTGATGTCTATGGAGTTGCGTTGCAACCGATGCAGAAGTGCACATCCCAACTTACTCAAATTGAAAGCACATATTTCAAAATGTCGAGCGC
CTTTCCCTTCCACACTACTCGAGGGCAGTCGTTTAGTGATTGCACCAAGTAATACTCATGACTCTCTATCGTAG
Protein sequenceShow/hide protein sequence
MDMDIDDNSTAKGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVV
LDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQNKSDNKVQLGIMKFLKKADNPAKTCSSA
NAIKDSPNLQTSRENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLVLVDLTYGSKMLS
LVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGAGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNP
QRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPSESEKHSEDSPRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEAFNQNNNTIVHKMSKH
WGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQD
FDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCNRCRSAHPNLLKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDSLS