; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016750 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016750
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptiontranscription factor bHLH140
Genome locationscaffold563:624822..627960
RNA-Seq ExpressionMS016750
SyntenyMS016750
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596332.1 Transcription factor basic helix-loop-helix 140, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0082.8Show/hide
Query:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
        KG E + KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQR++FVK+G S VDVHAVVL
Subjt:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL

Query:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD
        DLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHD LP GCFGQK +D KVQLGIM+FLKKA+
Subjt:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD

Query:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
        NPAKTCS+AN  KD P+ QT+QE       K+E +CT+  NV+KESEKGENPGV SL ++IS SDPPTLAFPSISTSDFKFSHEKAAEIIVE VEEFMDK
Subjt:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK

Query:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
        LGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQQA SLRPGN V
Subjt:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV

Query:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK
          QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+++FKS KGI + LGSAP ES KHSED+         HK
Subjt:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK

Query:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
        FKR ++Q PERSKKWKG+Q+SAEALNQNNN I HKMSKHWGSWAQALYNTAM+PERH + VLE SDDV VLNDIY KA KHLL+VARYEGLDQLADV +E
Subjt:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE

Query:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
        HLPLL+TMH VG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDV+DEV SHGKA IKDDESLMSME RC
Subjt:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC

Query:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS
        NRCRSAHPNLPKLK HISKC++PFPSTLLEG RLV A  N+ +S
Subjt:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0099.19Show/hide
Query:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
        KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
Subjt:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL

Query:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD
        DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQ KSDNKVQLGIMKFLKKAD
Subjt:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD

Query:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
        NPAKTCSSANAIKDSPNLQTS+ENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
Subjt:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK

Query:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
        LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
Subjt:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV

Query:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK
        VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAP ESEKHSEDSPRSTFPSCDHK
Subjt:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK

Query:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
        FKREDVQNPERSKKWKGSQDSAEA NQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
Subjt:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE

Query:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
        HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
Subjt:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC

Query:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS
        NRCRSAHPNL KLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS
Subjt:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS

XP_023540723.1 transcription factor bHLH140 [Cucurbita pepo subsp. pepo]0.0e+0082.66Show/hide
Query:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
        KG E + KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQR++FVK+G S VDVHAVVL
Subjt:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL

Query:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD
        DLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP LNEGF RITFCH+ET+VQSAIDTYKSLGLHD LP GCFGQK +D KVQLGIM+FLKKA+
Subjt:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD

Query:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
        NPAKTCS+AN  KD P+ QT+QE       K+E +CT+  NV+KESEKGE PGVRSL ++IS SD PTLAFPSISTSDFKFSHEKAAEIIVE VEEFMDK
Subjt:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK

Query:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
        LGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQQA SLRPGN V
Subjt:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV

Query:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK
          QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+++FKS KGI + LGSAP ES KHSED+         HK
Subjt:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK

Query:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
        FKR ++Q PERSKKWKG+Q+SAEALNQNNN I HKMSKHWGSWAQALYNTAM+PERH +TVLE SDDV VLNDIY KA KHLL+VARYEGLDQLADV +E
Subjt:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE

Query:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
        HLPLL+TMH VG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDV+DEV SHGKA IKDDESLMSME RC
Subjt:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC

Query:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS
        NRCRSAHPNLPKLK HISKC++PFPSTLLEG RLV A  N+ +S
Subjt:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0086.22Show/hide
Query:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
        KGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGK++FVDRCNLEIEQRADFVK+ G  VDV AVVL
Subjt:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL

Query:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD
        DLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAID YKSL LHD LP GCFGQK  D KVQLGIMKFLKKA+
Subjt:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD

Query:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
         P++T SSAN +KDSP  QT+QE S SCDKKEE ACTI  NVD ES+KGE+PGVRSL D+IS+SDPPTLAFPSISTSDFKFSHEKAAEIIVE VEEFMDK
Subjt:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK

Query:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
        LGNARLVLVDL++GSK+LSLVKAKAAKK I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQQANSL+PGN V
Subjt:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV

Query:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK
          QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FIS+VE++FKSVKGI   LG  P E  KHSE+S         HK
Subjt:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK

Query:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
        FKRE++QNPERSKKWKGSQDS EALNQNNN  V KMSKHWGSWAQALYNTAMHPE+H DTVLE SDDV VLNDIY KA KHLLVVAR+EGLDQLADV  E
Subjt:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE

Query:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
        HLPLLRTMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSV VMDEV SHGKAS+ DDESLMSMELRC
Subjt:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC

Query:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSN
        NRCRSAHPNLPKLKAHI KC+APFPSTLLEG RLVIAPSN
Subjt:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSN

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0086.16Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGK++FVDRCNLEIEQRADFVK+ G  VDV AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKADNPAKTCSSAN
        SVKRTGHEGNL GGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAID YKSL LHD LP GCFGQK  D KVQLGIMKFLKKA+ P++T SSAN
Subjt:  SVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKADNPAKTCSSAN

Query:  AIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLVLVD
         +KDSP  QT+QE S SCDKKEE ACTI  NVD ES+KGE+PGVRSL D+IS+SDPPTLAFPSISTSDFKFSHEKAAEIIVE VEEFMDKLGNARLVLVD
Subjt:  AIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLVLVD

Query:  LTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPSTSPL
        L++GSK+LSLVKAKAAKK I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQQANSL+PGN V  QLPSTSPL
Subjt:  LTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPSTSPL

Query:  FNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHKFKREDVQNPE
        FNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FIS+VE++FKSVKGI   LG  P E  KHSE+S         HKFKRE++QNPE
Subjt:  FNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHKFKREDVQNPE

Query:  RSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHD
        RSKKWKGSQDS EALNQNNN  V KMSKHWGSWAQALYNTAMHPE+H DTVLE SDDV VLNDIY KA KHLLVVAR+EGLDQLADV  EHLPLLRTMH 
Subjt:  RSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHD

Query:  VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCNRCRSAHPNL
        VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSV VMDEV SHGKAS+ DDESLMSMELRCNRCRSAHPNL
Subjt:  VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCNRCRSAHPNL

Query:  PKLKAHISKCRAPFPSTLLEGSRLVIAPSN
        PKLKAHI KC+APFPSTLLEG RLVIAPSN
Subjt:  PKLKAHISKCRAPFPSTLLEGSRLVIAPSN

TrEMBL top hitse value%identityAlignment
A0A1S3B6C4 transcription factor bHLH1400.0e+0083.24Show/hide
Query:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
        KGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLKSA+SAL+DGKS+FVDRCNLEIEQRADFVK+GG  VDVHAVVL
Subjt:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL

Query:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD
        DLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LPHGCFGQK  D KVQLGIMKFLKKA+
Subjt:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD

Query:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
        NP+KTCSSANA K+SP  Q +QE   SCDKKEE +C +  NV  ESEKGE+PGVRSL   IS+SDPPTLAFPSISTSDFKFSHEKAAEIIVE VEEFMDK
Subjt:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK

Query:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
        LGNARLVLVDL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSL PGN V
Subjt:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV

Query:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK
          QLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSLFQ FISIV++++KSVKGI + LGS P E +KHSE+S         HK
Subjt:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK

Query:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
        FKRE++QN E SKKWKGS +S E LNQNNN  V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV VL DIY KA KHLLVVAR+EGLDQLADV  E
Subjt:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE

Query:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
        HLPLLRTMH +GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEV SHGKA I DDE L+SMELRC
Subjt:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC

Query:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSN
        NRCRSAHPNLPKLKAHISKC+APFPSTLLE  RLV+ PSN
Subjt:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSN

A0A5A7TLV2 Transcription factor bHLH1400.0e+0083.24Show/hide
Query:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
        KGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLKSA+SAL+DGKS+FVDRCNLEIEQRADFVK+GG  VDVHAVVL
Subjt:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL

Query:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD
        DLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LPHGCFGQK  D KVQLGIMKFLKKA+
Subjt:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD

Query:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
        NP+KTCSSANA K+SP  Q +QE   SCDKKEE +C +  NV  ESEKGE+PGVRSL   IS+SDPPTLAFPSISTSDFKFSHEKAAEIIVE VEEFMDK
Subjt:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK

Query:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
        LGNARLVLVDL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSL PGN V
Subjt:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV

Query:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK
          QLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSLFQ FISIV++++KSVKGI + LGS P E +KHSE+S         HK
Subjt:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK

Query:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
        FKRE++QN E SKKWKGS +S E LNQNNN  V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV VL DIY KA KHLLVVAR+EGLDQLADV  E
Subjt:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE

Query:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
        HLPLLRTMH +GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEV SHGKA I DDE L+SMELRC
Subjt:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC

Query:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSN
        NRCRSAHPNLPKLKAHISKC+APFPSTLLE  RLV+ PSN
Subjt:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSN

A0A6J1CV45 transcription factor bHLH1400.0e+0099.19Show/hide
Query:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
        KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
Subjt:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL

Query:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD
        DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQ KSDNKVQLGIMKFLKKAD
Subjt:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD

Query:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
        NPAKTCSSANAIKDSPNLQTS+ENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
Subjt:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK

Query:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
        LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
Subjt:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV

Query:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK
        VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAP ESEKHSEDSPRSTFPSCDHK
Subjt:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK

Query:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
        FKREDVQNPERSKKWKGSQDSAEA NQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
Subjt:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE

Query:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
        HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
Subjt:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC

Query:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS
        NRCRSAHPNL KLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS
Subjt:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS

A0A6J1GCV0 transcription factor bHLH1400.0e+0082.77Show/hide
Query:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
        KG E + KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQR++FVK+G S VDVHAVVL
Subjt:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL

Query:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD
        DLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHD LP GCFGQK +D KVQLGIM+FLKKA+
Subjt:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD

Query:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
        NPAKTCS+AN  KD P+ QT+QE       K+E +CT+  NV+KESEKGENPGV SL ++IS SDPPTLAFPSISTSDFKFSHEKAAEIIVE VEEFMDK
Subjt:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK

Query:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
        LGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQQA SLRPGN V
Subjt:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV

Query:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK
          QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+++FKS KGI + LGSAP ES KHSED+         HK
Subjt:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK

Query:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
        FKR ++Q PERSKKWKG+Q+SAEALNQNNN I HKMSKHWGSWAQALYNTAM+PERH + VLE SDDV VLNDIY KA KHLL+VARYEGLDQLADV +E
Subjt:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE

Query:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
        HLPLL+TMH VG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDV+DEV SHGKA IKDDESLMSME RC
Subjt:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC

Query:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHD
        NRCRSAHPNLPKLK HISKC++PFPSTLLEG RLV A  N+ +
Subjt:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHD

A0A6J1I5W0 transcription factor bHLH1400.0e+0082.39Show/hide
Query:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL
        +G E + KLIMVILVGAPGSGKSTFCELVM SSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGK++FVDRCNLEIEQR++FVK+G S VDVHAVVL
Subjt:  KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVL

Query:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD
        DLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP LNEGF RI FCH+ETDVQSAIDTYKSLGLHD LP GCFGQK +D KVQLGIM FLKKA+
Subjt:  DLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKAD

Query:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK
        NPAKTCS+AN IKD P+ QT+QE       K+E +CT+  NV+KESEKGENPG+RSL ++IS SDPPTLAFPSISTSDFKFSHEKAAEIIVE VEEFMDK
Subjt:  NPAKTCSSANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDK

Query:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV
        LGNARLV+VD+++GSK+LSLVKAKAAKK I  +KFFTFVGDITKL S+GGL C+VIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQQA SLRPG+ V
Subjt:  LGNARLVLVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTV

Query:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK
          QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AYSSLFQ FISIV+++FKS KGI K LGSAP ES KHSED+         HK
Subjt:  VAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHK

Query:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE
        FKR + Q PERSKKWKG+Q+SAEALNQNNN I HKMSKHWGSWAQALYNTAM+PERH +TVLE SDDV VLNDIY KA KHLL+VARYEGLDQLADV  E
Subjt:  FKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRRE

Query:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC
        HLPLL+TMH VGLKWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDV+DEV +HGKA IKDDESLMSME RC
Subjt:  HLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRC

Query:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS
        NRCRSAHPNLPKLK H+SKC++PFPSTLLEGSRLV A  N+ +S
Subjt:  NRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS

SwissProt top hitse value%identityAlignment
P61798 Aprataxin (Fragment)6.3e-3637.55Show/hide
Query:  GSAPEESEKHSEDSPRSTFPSCDHKFKREDVQN-PERSKKWK--GSQDSAEALNQNNNTI-----VHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDD
        G   EES   +E+  ++    C+   + +D++N P+++KK +   +Q S+  L  + +++          +H G W+Q L ++   P+      +   + 
Subjt:  GSAPEESEKHSEDSPRSTFPSCDHKFKREDVQN-PERSKKWK--GSQDSAEALNQNNNTI-----VHKMSKHWGSWAQALYNTAMHPERHGDTVLELSDD

Query:  VAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTD
          V+ D Y KA  H LV+  ++ +  L  V REHL LL  MH VG K I +   ++ SL FRLGYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T+
Subjt:  VAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTD

Query:  FFRDSVDVMDEVGSHGKASIKDDES-LMSMELRCNRCRSAHPNLPKLKAHISK
        +F +S +V++ V S GK ++ D  S L+ + LRC+ C+     +P+LK H+ K
Subjt:  FFRDSVDVMDEVGSHGKASIKDDES-LMSMELRCNRCRSAHPNLPKLKAHISK

Q7YRZ1 Aprataxin1.1e-3244.94Show/hide
Query:  WAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLH
        W+Q L  +   P+      +   D V V+ D Y KA  H LV+  +  +  L  V REHL LLR MH VG K I   F   + L FRLGYH+ PSM  +H
Subjt:  WAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLH

Query:  LHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLPKLKAHISK
        LHVISQDFDS  LKNKKHWNSFNT++F +S  V++ V   G+ +++D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  LHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLPKLKAHISK

Q7Z2E3 Aprataxin1.5e-3243.89Show/hide
Query:  GSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQ
        G W+Q L  +   P+      +   + V V+ D Y KA  H LV+  +  +  L  V REHL LL+ MH VG K I   F   + L FRLGYH+ PSM  
Subjt:  GSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQ

Query:  LHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLPKLKAHISK
        +HLHVISQDFDS  LKNKKHWNSFNT++F +S  V++ V   G+ +++D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  LHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLPKLKAHISK

Q9BGQ0 Aprataxin1.5e-3243.89Show/hide
Query:  GSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQ
        G W+Q L  +   P+      +   + V V+ D Y KA  H LV+  +  +  L  V REHL LL+ MH VG K I   F   + L FRLGYH+ PSM  
Subjt:  GSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQ

Query:  LHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLPKLKAHISK
        +HLHVISQDFDS  LKNKKHWNSFNT++F +S  V++ V   G+ +++D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  LHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDD-ESLMSMELRCNRCRSAHPNLPKLKAHISK

Q9M041 Transcription factor bHLH1402.1e-24159.81Show/hide
Query:  KLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLC
        K I+V+L+G PGSGKSTFC+  M SS RPW+RICQD + NGK+GTKAQCLK A+ +L +GKS+F+DRCNL+ EQR++F+K+GG   +VHAVVL+LPAQ+C
Subjt:  KLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLC

Query:  ISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKADNPAKTCS
        ISRSVKRTGHEGNLQGG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++ Y  LG  DTLP GCFG+KK D K Q GIMKF KK    A   S
Subjt:  ISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKADNPAKTCS

Query:  SANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLV
        S+N            E + +  K +E    +R +  K            LG   S    PTLAFPSIST+DF+F  EKA++IIVE  EEF+ KLG ARLV
Subjt:  SANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLV

Query:  LVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPST
        LVDL+ GSK+LSLVKAKA++K I+ +KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +AN+L PG  VV  LPST
Subjt:  LVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPST

Query:  SPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHKFKREDVQ
         PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR AY+SLF+GF+S+V++Q K  K  +    +  +  E   EDS                  
Subjt:  SPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHKFKREDVQ

Query:  NPERSKKWKGSQDSAEALNQNNNTIV------HKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREH
          ER+KK+KGSQD A   N  + ++        KMSK W +WA AL++ AMHPERH + VLE  D++ V+ND Y KA KH+LV+AR E LD L DVR+E+
Subjt:  NPERSKKWKGSQDSAEALNQNNNTIV------HKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREH

Query:  LPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCN
        L LL+ MH+VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSVDV++EV S GKA++   E L+  ELRCN
Subjt:  LPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCN

Query:  RCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLV
        RCRSAHPN+PKLK+H+  C + FP  LL+ +RLV
Subjt:  RCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLV

Arabidopsis top hitse value%identityAlignment
AT2G40600.1 appr-1-p processing enzyme family protein2.3e-0434.82Show/hide
Query:  GDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGP----NMNPQRPNYL
        GDITK + +     + I N AN R+  GGGG + AI  AAGP L  A   +   +RPG          +P FN    + VIH +GP    ++NPQ    L
Subjt:  GDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGP----NMNPQRPNYL

Query:  NNDYDEGCKLLR
         N Y    ++ +
Subjt:  NNDYDEGCKLLR

AT5G01310.1 APRATAXIN-like1.5e-24259.81Show/hide
Query:  KLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLC
        K I+V+L+G PGSGKSTFC+  M SS RPW+RICQD + NGK+GTKAQCLK A+ +L +GKS+F+DRCNL+ EQR++F+K+GG   +VHAVVL+LPAQ+C
Subjt:  KLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLC

Query:  ISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKADNPAKTCS
        ISRSVKRTGHEGNLQGG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++ Y  LG  DTLP GCFG+KK D K Q GIMKF KK    A   S
Subjt:  ISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKADNPAKTCS

Query:  SANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLV
        S+N            E + +  K +E    +R +  K            LG   S    PTLAFPSIST+DF+F  EKA++IIVE  EEF+ KLG ARLV
Subjt:  SANAIKDSPNLQTSQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLV

Query:  LVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPST
        LVDL+ GSK+LSLVKAKA++K I+ +KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +AN+L PG  VV  LPST
Subjt:  LVDLTYGSKMLSLVKAKAAKKKINCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPST

Query:  SPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHKFKREDVQ
         PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR AY+SLF+GF+S+V++Q K  K  +    +  +  E   EDS                  
Subjt:  SPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHKFKREDVQ

Query:  NPERSKKWKGSQDSAEALNQNNNTIV------HKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREH
          ER+KK+KGSQD A   N  + ++        KMSK W +WA AL++ AMHPERH + VLE  D++ V+ND Y KA KH+LV+AR E LD L DVR+E+
Subjt:  NPERSKKWKGSQDSAEALNQNNNTIV------HKMSKHWGSWAQALYNTAMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREH

Query:  LPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCN
        L LL+ MH+VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSVDV++EV S GKA++   E L+  ELRCN
Subjt:  LPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCN

Query:  RCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLV
        RCRSAHPN+PKLK+H+  C + FP  LL+ +RLV
Subjt:  RCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAAGGAAAGGAAGGGCAAGAGAAGCTCATAATGGTGATATTAGTTGGTGCACCAGGAAGCGGCAAGTCCACCTTCTGCGAACTCGTAATGGGTTCCTCTTCTCGCCCGTG
GGCTCGAATTTGCCAGGACACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGCGCATCAAGTGCATTGAGTGATGGAAAGAGTATATTTGTGGACA
GGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTAAAAATTGGCGGCTCTGGAGTGGATGTACATGCTGTTGTACTAGATCTTCCTGCACAGCTCTGTATCTCTCGT
TCAGTTAAGCGGACCGGTCATGAAGGGAATTTACAAGGAGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAAGAAGGAATTACCCAAGTTAAATGAAGGGTTTAC
TCGCATAACGTTTTGTCACAATGAGACCGATGTTCAATCTGCTATAGATACGTACAAATCACTTGGTTTACATGATACGCTTCCACATGGATGTTTTGGACAAAAGAAAT
CAGACAATAAAGTACAACTTGGCATAATGAAGTTCTTAAAGAAAGCAGACAATCCTGCTAAGACATGTTCTAGTGCCAATGCCATTAAGGATTCTCCAAATCTGCAAACT
TCCCAGGAAAATAGCTACTCTTGTGATAAAAAGGAAGAGCCTGCCTGCACAATAAGGGGCAATGTCGATAAAGAGTCAGAGAAAGGCGAAAATCCAGGTGTAAGATCGTT
AGGAGACAGTATTTCTCGAAGTGATCCTCCAACCCTTGCATTTCCGTCAATTTCAACATCGGATTTCAAGTTCAGCCATGAAAAGGCTGCTGAAATTATTGTTGAGACGG
TTGAGGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTTTTAGTAGACTTGACTTATGGGTCAAAGATGTTGTCTTTGGTTAAAGCTAAAGCAGCCAAGAAAAAGATT
AATTGCTCCAAGTTTTTTACATTCGTTGGAGATATAACTAAACTCAATTCAGAAGGTGGATTGCGCTGCAATGTTATAGCCAATGCTGCAAACTGGCGACTGAAACCGGG
GGGTGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCGGGTCTGGAAGTGGCAACTAAACAACAAGCGAACTCCCTTCGACCCGGCAATACCGTCGTTGCTCAAT
TGCCCTCGACTTCTCCTTTATTTAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCAAACATGAATCCACAGAGGCCAAATTATCTCAACAATGACTATGAT
GAAGGTTGCAAACTTCTCCGTGCCGCTTACTCTTCCCTATTTCAAGGTTTTATTTCAATAGTAGAAAACCAATTTAAGTCGGTGAAGGGAATTCAGAAACACCTTGGCTC
GGCGCCTGAAGAATCAGAAAAGCACTCCGAGGACAGCCCAAGAAGTACTTTCCCGAGTTGCGATCACAAGTTTAAGAGAGAGGATGTGCAGAATCCTGAAAGAAGCAAAA
AGTGGAAAGGATCTCAGGATTCAGCTGAAGCATTAAACCAAAACAACAATACGATTGTCCACAAAATGAGTAAGCACTGGGGATCATGGGCACAAGCACTCTACAATACT
GCAATGCATCCCGAGAGACATGGCGATACAGTACTGGAGTTATCAGATGATGTTGCAGTACTGAATGATATCTATGCAAAGGCACATAAGCATCTTCTGGTAGTGGCTCG
GTATGAAGGCCTCGATCAACTAGCCGATGTACGGAGAGAACACCTTCCATTGTTGAGAACGATGCACGATGTGGGTTTGAAGTGGATAGATAAGTTCTTTCATGAAGATG
CATCATTAGTTTTTCGCCTCGGATACCACTCGGCTCCATCCATGAGGCAGCTGCACCTACATGTTATAAGCCAGGATTTCGACTCAAGTCATCTGAAGAACAAGAAGCAT
TGGAATTCTTTCAACACCGATTTCTTCAGAGACTCGGTAGACGTTATGGACGAAGTCGGTAGCCATGGAAAGGCAAGCATCAAGGACGACGAGAGCTTGATGTCTATGGA
GTTGCGTTGCAACCGATGCAGAAGTGCACATCCCAACTTACCCAAATTGAAAGCACATATTTCAAAATGTCGAGCGCCTTTCCCTTCCACACTACTCGAGGGCAGTCGTT
TAGTGATTGCACCAAGTAATACTCATGACTCT
mRNA sequenceShow/hide mRNA sequence
AAAGGAAAGGAAGGGCAAGAGAAGCTCATAATGGTGATATTAGTTGGTGCACCAGGAAGCGGCAAGTCCACCTTCTGCGAACTCGTAATGGGTTCCTCTTCTCGCCCGTG
GGCTCGAATTTGCCAGGACACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGCGCATCAAGTGCATTGAGTGATGGAAAGAGTATATTTGTGGACA
GGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTAAAAATTGGCGGCTCTGGAGTGGATGTACATGCTGTTGTACTAGATCTTCCTGCACAGCTCTGTATCTCTCGT
TCAGTTAAGCGGACCGGTCATGAAGGGAATTTACAAGGAGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAAGAAGGAATTACCCAAGTTAAATGAAGGGTTTAC
TCGCATAACGTTTTGTCACAATGAGACCGATGTTCAATCTGCTATAGATACGTACAAATCACTTGGTTTACATGATACGCTTCCACATGGATGTTTTGGACAAAAGAAAT
CAGACAATAAAGTACAACTTGGCATAATGAAGTTCTTAAAGAAAGCAGACAATCCTGCTAAGACATGTTCTAGTGCCAATGCCATTAAGGATTCTCCAAATCTGCAAACT
TCCCAGGAAAATAGCTACTCTTGTGATAAAAAGGAAGAGCCTGCCTGCACAATAAGGGGCAATGTCGATAAAGAGTCAGAGAAAGGCGAAAATCCAGGTGTAAGATCGTT
AGGAGACAGTATTTCTCGAAGTGATCCTCCAACCCTTGCATTTCCGTCAATTTCAACATCGGATTTCAAGTTCAGCCATGAAAAGGCTGCTGAAATTATTGTTGAGACGG
TTGAGGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTTTTAGTAGACTTGACTTATGGGTCAAAGATGTTGTCTTTGGTTAAAGCTAAAGCAGCCAAGAAAAAGATT
AATTGCTCCAAGTTTTTTACATTCGTTGGAGATATAACTAAACTCAATTCAGAAGGTGGATTGCGCTGCAATGTTATAGCCAATGCTGCAAACTGGCGACTGAAACCGGG
GGGTGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCGGGTCTGGAAGTGGCAACTAAACAACAAGCGAACTCCCTTCGACCCGGCAATACCGTCGTTGCTCAAT
TGCCCTCGACTTCTCCTTTATTTAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCAAACATGAATCCACAGAGGCCAAATTATCTCAACAATGACTATGAT
GAAGGTTGCAAACTTCTCCGTGCCGCTTACTCTTCCCTATTTCAAGGTTTTATTTCAATAGTAGAAAACCAATTTAAGTCGGTGAAGGGAATTCAGAAACACCTTGGCTC
GGCGCCTGAAGAATCAGAAAAGCACTCCGAGGACAGCCCAAGAAGTACTTTCCCGAGTTGCGATCACAAGTTTAAGAGAGAGGATGTGCAGAATCCTGAAAGAAGCAAAA
AGTGGAAAGGATCTCAGGATTCAGCTGAAGCATTAAACCAAAACAACAATACGATTGTCCACAAAATGAGTAAGCACTGGGGATCATGGGCACAAGCACTCTACAATACT
GCAATGCATCCCGAGAGACATGGCGATACAGTACTGGAGTTATCAGATGATGTTGCAGTACTGAATGATATCTATGCAAAGGCACATAAGCATCTTCTGGTAGTGGCTCG
GTATGAAGGCCTCGATCAACTAGCCGATGTACGGAGAGAACACCTTCCATTGTTGAGAACGATGCACGATGTGGGTTTGAAGTGGATAGATAAGTTCTTTCATGAAGATG
CATCATTAGTTTTTCGCCTCGGATACCACTCGGCTCCATCCATGAGGCAGCTGCACCTACATGTTATAAGCCAGGATTTCGACTCAAGTCATCTGAAGAACAAGAAGCAT
TGGAATTCTTTCAACACCGATTTCTTCAGAGACTCGGTAGACGTTATGGACGAAGTCGGTAGCCATGGAAAGGCAAGCATCAAGGACGACGAGAGCTTGATGTCTATGGA
GTTGCGTTGCAACCGATGCAGAAGTGCACATCCCAACTTACCCAAATTGAAAGCACATATTTCAAAATGTCGAGCGCCTTTCCCTTCCACACTACTCGAGGGCAGTCGTT
TAGTGATTGCACCAAGTAATACTCATGACTCT
Protein sequenceShow/hide protein sequence
KGKEGQEKLIMVILVGAPGSGKSTFCELVMGSSSRPWARICQDTIGNGKSGTKAQCLKSASSALSDGKSIFVDRCNLEIEQRADFVKIGGSGVDVHAVVLDLPAQLCISR
SVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDTLPHGCFGQKKSDNKVQLGIMKFLKKADNPAKTCSSANAIKDSPNLQT
SQENSYSCDKKEEPACTIRGNVDKESEKGENPGVRSLGDSISRSDPPTLAFPSISTSDFKFSHEKAAEIIVETVEEFMDKLGNARLVLVDLTYGSKMLSLVKAKAAKKKI
NCSKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLRPGNTVVAQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYD
EGCKLLRAAYSSLFQGFISIVENQFKSVKGIQKHLGSAPEESEKHSEDSPRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAEALNQNNNTIVHKMSKHWGSWAQALYNT
AMHPERHGDTVLELSDDVAVLNDIYAKAHKHLLVVARYEGLDQLADVRREHLPLLRTMHDVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKH
WNSFNTDFFRDSVDVMDEVGSHGKASIKDDESLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLEGSRLVIAPSNTHDS