; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022379 (gene) of Chayote v1 genome

Gene IDSed0022379
OrganismSechium edule (Chayote v1)
Descriptiontranscription factor bHLH140
Genome locationLG02:33642769..33649001
RNA-Seq ExpressionSed0022379
SyntenySed0022379
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596332.1 Transcription factor basic helix-loop-helix 140, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0081.98Show/hide
Query:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG
        MDMDID+N  AKG E R KL+MVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNG+SGTKAQCLKSAA+ALN+G +VFVDRCNLEIEQR++F+KLG
Subjt:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG

Query:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ
           VD HAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ

Query:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI
        LGIM+FLKK ENP+K CS+ANT KD P SQ TQE       K+E +CT+LSNV+KES K EN    SLE+NIS SD PTLAFPSISTSDF+FS+EKAAEI
Subjt:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI

Query:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ
        IVE VEEFMD+LGNARL MVDLSHGSKILSLVK +A +KNI  TKF TFVGDITKL S+GGL CNVI N  NWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ

Query:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS
        QA SLRPGN V V++PSTSPLFNREGVTHVIHVLGPNMNP+RPNYLNNDYDEGCKLLRDAYSSLFQ FISIV+D+FKS KGI ERLGSA SES+KHSED+
Subjt:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS

Query:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV
           FKR +LQ PE+SKK+KG+Q S EA+ +NNNK  HKMSKHWGSWAQALYNTAM+PE H + VLETSDDV+VLNDIYPKARKHLL+VAR EGLDQL DV
Subjt:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV

Query:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME
         +EHLPLL+TMH VG+KWI+KF+H+DASLVFRLGYHS PSMRQLHLHVISQDFDSSHLKNKKHWNSFNT+FFRDSV ++DEV SHGKA IKDDESLMSME
Subjt:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME

Query:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV
         RCNRCRS HPNLPKLKTHISKCQ+PFPS LLEGGRLV
Subjt:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0082.71Show/hide
Query:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG
        MDMDID N  AKGKE +EKL+MVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNG+SGTKAQCLKSA++AL++G ++FVDRCNLEIEQRADF+K+G
Subjt:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG

Query:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ
        G  VD HAVVLDLPAQLCISRSVKR+GHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHD LP GCFGQ  +D KVQ
Subjt:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ

Query:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI
        LGIMKFLKK +NP+K CSSAN +KDSP  Q ++E S SCDKKEEPACT+  NVDKES K EN   RSL D+IS SD PTLAFPSISTSDF+FS+EKAAEI
Subjt:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI

Query:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ
        IVE VEEFMD+LGNARL +VDL++GSK+LSLVK +A +K I+ +KF TFVGDITKLNSEGGLRCNVI N  NWRLKPG GGVNAAIFSAAGP LEVATKQ
Subjt:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ

Query:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS
        QANSLRPGN VV ++PSTSPLFNREGVTHVIHVLGPNMNP+RPNYLNNDYDEGCKLLR AYSSLFQGFISIVE+QFKSVKGIQ+ LGSA SES+KHSEDS
Subjt:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS

Query:  RR--------WFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCE
         R         FKRED+QNPE+SKK+KGSQ+S EA  +NNN  VHKMSKHWGSWAQALYNTAMHPE HGD VLE SDDV VLNDIY KA KHLLVVAR E
Subjt:  RR--------WFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCE

Query:  GLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKD
        GLDQL DVRREHLPLLRTMHDVGLKWI+KF HEDASLVFRLGYHS PSMRQLHLHVISQDFDSSHLKNKKHWNSFNT+FFRDSV +MDEVGSHGKA+IKD
Subjt:  GLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKD

Query:  DESLMSMELRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV
        DESLMSMELRCNRCRS HPNL KLK HISKC+ PFPS LLEG RLV
Subjt:  DESLMSMELRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV

XP_022949470.1 transcription factor bHLH140 [Cucurbita moschata]0.0e+0081.98Show/hide
Query:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG
        MDMDID+N  AKG E R KL+MVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNG+SGTKAQCLKSAA+ALN+G +VFVDRCNLEIEQR++F+KLG
Subjt:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG

Query:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ
           VD HAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ

Query:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI
        LGIM+FLKK ENP+K CS+ANT KD P SQ TQE       K+E +CT+LSNV+KES K EN    SLE+NIS SD PTLAFPSISTSDF+FS+EKAAEI
Subjt:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI

Query:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ
        IVE VEEFMD+LGNARL MVDLSHGSKILSLVK +A +KNI  TKF TFVGDITKL S+GGL CNVI N  NWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ

Query:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS
        QA SLRPGN V V++PSTSPLFNREGVTHVIHVLGPNMNP+RPNYLNNDYDEGCKLLRDAYSSLFQ FISIV+D+FKS KGI ERLGSA SES+KHSED+
Subjt:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS

Query:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV
           FKR +LQ PE+SKK+KG+Q S EA+ +NNNK  HKMSKHWGSWAQALYNTAM+PE H + VLETSDDV+VLNDIYPKARKHLL+VAR EGLDQL DV
Subjt:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV

Query:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME
         +EHLPLL+TMH VG+KWI+KF+H+DASLVFRLGYHS PSMRQLHLHVISQDFDSSHLKNKKHWNSFNT+FFRDSV ++DEV SHGKA IKDDESLMSME
Subjt:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME

Query:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV
         RCNRCRS HPNLPKLKTHISKCQ+PFPS LLEGGRLV
Subjt:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV

XP_023540723.1 transcription factor bHLH140 [Cucurbita pepo subsp. pepo]0.0e+0081.98Show/hide
Query:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG
        MDMDID+N  AKG E R KL+MVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNG+SGTKAQCLKSAA+ALN+G +VFVDRCNLEIEQR++F+KLG
Subjt:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG

Query:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ
           VD HAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ET+VQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ

Query:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI
        LGIM+FLKK ENP+K CS+ANT KD P SQ TQE       K+E +CT+LSNV+KES K E    RSLE+NISESD PTLAFPSISTSDF+FS+EKAAEI
Subjt:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI

Query:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ
        IVE VEEFMD+LGNARL MVDLSHGSKILSLVK +A +KNI  TKF TFVGDITKL S+GGL CNVI N  NWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ

Query:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS
        QA SLRPGN V V++PSTSPLFNREGVTHVIHVLGPNMNP+RPNYLNNDYDEGCKLLRDAYSSLFQ FISIV+D+FKS KGI ERLGSA SES+KHSED+
Subjt:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS

Query:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV
           FKR +LQ PE+SKK+KG+Q S EA+ +NNNK  HKMSKHWGSWAQALYNTAM+PE H + VLETSDDV+VLNDIYPKARKHLL+VAR EGLDQL DV
Subjt:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV

Query:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME
         +EHLPLL+TMH VG+KWI+KF+H+DASLVFRLGYHS PSMRQLHLHVISQDFDSSHLKNKKHWNSFNT+FFRDSV ++DEV SHGKA IKDDESLMSME
Subjt:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME

Query:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV
         RCNRCRS HPNLPKLKTHISKCQ+PFPS LLEGGRLV
Subjt:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0084.28Show/hide
Query:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG
        MDMD D+N  AKGKE + KL+MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNG+SGTKAQCLKSAA+ALN+G NVFVDRCNLEIEQRADF+KL 
Subjt:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG

Query:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ
        G +VD  AVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAID YKSL LHD LP GCFGQKN DKKVQ
Subjt:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ

Query:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI
        LGIMKFLKK E PS+  SSANTVKDSPI Q TQEKS+SCDKKEE ACT+  NVD ES K E+   RSLEDNIS+SD PTLAFPSISTSDF+FS+EKAAEI
Subjt:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI

Query:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ
        IVEKVEEFMD+LGNARL +VDLSHGSKILSLVK +A +KNIS TKF TFVGDITKLNSEGGLRCNVI N  NWRLKPGGGGVNAAIFSAAG  LEVATKQ
Subjt:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ

Query:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS
        QANSL+PGNAV V++PSTSPLFNREGVTHVIHVLGPNMNP+RPNYLNNDYDEGCKLLR+AYSSLFQ FIS+VED+FKSVKGI  RLG   SE +KHSE+S
Subjt:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS

Query:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV
           FKRE+LQNPE+SKK+KGSQ+STEA+ +NNNKTV KMSKHWGSWAQALYNTAMHPE H D VLETSDDV+VLNDIYPKARKHLLVVAR EGLDQL DV
Subjt:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV

Query:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME
          EHLPLLRTMH VGLKWI+KF HEDASLVFRLGYHS PSMRQLHLHVISQDFDSSHLKN+KHWNSFNT+FFRDSV +MDEV SHGKA++ DDESLMSME
Subjt:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME

Query:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV
        LRCNRCRS HPNLPKLK HI KCQ PFPS LLEGGRLV
Subjt:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV

TrEMBL top hitse value%identityAlignment
A0A1S3B6C4 transcription factor bHLH1400.0e+0081.62Show/hide
Query:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG
        MDMD D+N  AKGKE + KL+MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNG+SGT+AQCLKSA +ALN+G +VFVDRCNLEIEQRADF+KLG
Subjt:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG

Query:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ
        G QVD HAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LP GCFGQKN DKKVQ
Subjt:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ

Query:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI
        LGIMKFLKK ENPSK CSSAN  K+SP  Q TQEK  SCDKKEE +C +  NV  ES K E+   RSLE  IS+SD PTLAFPSISTSDF+FS+EKAAEI
Subjt:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI

Query:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ
        IVEKVEEFMD+LGNARL +VDLSHGSKILS+VK +ATEKNIS TKF TFVGDITKLNSEGGLRCNVI N  NWRLKPGGGGVNAAIFSAAGP LEVATKQ
Subjt:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ

Query:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS
        QANSL PGNAV V++PSTSPL NREGVTHVIHVLGPNMNP+RPNYLNNDYDEGCKLL +AYSSLFQ FISIV+D++KSVKGI E LGS   E QKHSE+S
Subjt:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS

Query:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV
           FKRE+LQN E SKK+KGS NSTE + +NNNKTV K SKHWGSWAQALY+TAMHPE H ++VLETSDDV+VL DIYPKARKHLLVVAR EGLDQL DV
Subjt:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV

Query:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME
          EHLPLLRTMH +GLKWI+KF HEDA LVFRLGYHS PSMRQLHLHVISQDFDS+HLKNKKHWNSFNT+FFRDSV ++DEV SHGKA I DDE L+SME
Subjt:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME

Query:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLVTK
        LRCNRCRS HPNLPKLK HISKCQ PFPS LLE GRLV +
Subjt:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLVTK

A0A5A7TLV2 Transcription factor bHLH1400.0e+0081.62Show/hide
Query:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG
        MDMD D+N  AKGKE + KL+MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNG+SGT+AQCLKSA +ALN+G +VFVDRCNLEIEQRADF+KLG
Subjt:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG

Query:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ
        G QVD HAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LP GCFGQKN DKKVQ
Subjt:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ

Query:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI
        LGIMKFLKK ENPSK CSSAN  K+SP  Q TQEK  SCDKKEE +C +  NV  ES K E+   RSLE  IS+SD PTLAFPSISTSDF+FS+EKAAEI
Subjt:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI

Query:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ
        IVEKVEEFMD+LGNARL +VDLSHGSKILS+VK +ATEKNIS TKF TFVGDITKLNSEGGLRCNVI N  NWRLKPGGGGVNAAIFSAAGP LEVATKQ
Subjt:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ

Query:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS
        QANSL PGNAV V++PSTSPL NREGVTHVIHVLGPNMNP+RPNYLNNDYDEGCKLL +AYSSLFQ FISIV+D++KSVKGI E LGS   E QKHSE+S
Subjt:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS

Query:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV
           FKRE+LQN E SKK+KGS NSTE + +NNNKTV K SKHWGSWAQALY+TAMHPE H ++VLETSDDV+VL DIYPKARKHLLVVAR EGLDQL DV
Subjt:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV

Query:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME
          EHLPLLRTMH +GLKWI+KF HEDA LVFRLGYHS PSMRQLHLHVISQDFDS+HLKNKKHWNSFNT+FFRDSV ++DEV SHGKA I DDE L+SME
Subjt:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME

Query:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLVTK
        LRCNRCRS HPNLPKLK HISKCQ PFPS LLE GRLV +
Subjt:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLVTK

A0A6J1CV45 transcription factor bHLH1400.0e+0082.71Show/hide
Query:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG
        MDMDID N  AKGKE +EKL+MVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNG+SGTKAQCLKSA++AL++G ++FVDRCNLEIEQRADF+K+G
Subjt:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG

Query:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ
        G  VD HAVVLDLPAQLCISRSVKR+GHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHD LP GCFGQ  +D KVQ
Subjt:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ

Query:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI
        LGIMKFLKK +NP+K CSSAN +KDSP  Q ++E S SCDKKEEPACT+  NVDKES K EN   RSL D+IS SD PTLAFPSISTSDF+FS+EKAAEI
Subjt:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI

Query:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ
        IVE VEEFMD+LGNARL +VDL++GSK+LSLVK +A +K I+ +KF TFVGDITKLNSEGGLRCNVI N  NWRLKPG GGVNAAIFSAAGP LEVATKQ
Subjt:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ

Query:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS
        QANSLRPGN VV ++PSTSPLFNREGVTHVIHVLGPNMNP+RPNYLNNDYDEGCKLLR AYSSLFQGFISIVE+QFKSVKGIQ+ LGSA SES+KHSEDS
Subjt:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS

Query:  RR--------WFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCE
         R         FKRED+QNPE+SKK+KGSQ+S EA  +NNN  VHKMSKHWGSWAQALYNTAMHPE HGD VLE SDDV VLNDIY KA KHLLVVAR E
Subjt:  RR--------WFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCE

Query:  GLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKD
        GLDQL DVRREHLPLLRTMHDVGLKWI+KF HEDASLVFRLGYHS PSMRQLHLHVISQDFDSSHLKNKKHWNSFNT+FFRDSV +MDEVGSHGKA+IKD
Subjt:  GLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKD

Query:  DESLMSMELRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV
        DESLMSMELRCNRCRS HPNL KLK HISKC+ PFPS LLEG RLV
Subjt:  DESLMSMELRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV

A0A6J1GCV0 transcription factor bHLH1400.0e+0081.98Show/hide
Query:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG
        MDMDID+N  AKG E R KL+MVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNG+SGTKAQCLKSAA+ALN+G +VFVDRCNLEIEQR++F+KLG
Subjt:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG

Query:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ
           VD HAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ

Query:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI
        LGIM+FLKK ENP+K CS+ANT KD P SQ TQE       K+E +CT+LSNV+KES K EN    SLE+NIS SD PTLAFPSISTSDF+FS+EKAAEI
Subjt:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI

Query:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ
        IVE VEEFMD+LGNARL MVDLSHGSKILSLVK +A +KNI  TKF TFVGDITKL S+GGL CNVI N  NWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ

Query:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS
        QA SLRPGN V V++PSTSPLFNREGVTHVIHVLGPNMNP+RPNYLNNDYDEGCKLLRDAYSSLFQ FISIV+D+FKS KGI ERLGSA SES+KHSED+
Subjt:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS

Query:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV
           FKR +LQ PE+SKK+KG+Q S EA+ +NNNK  HKMSKHWGSWAQALYNTAM+PE H + VLETSDDV+VLNDIYPKARKHLL+VAR EGLDQL DV
Subjt:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV

Query:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME
         +EHLPLL+TMH VG+KWI+KF+H+DASLVFRLGYHS PSMRQLHLHVISQDFDSSHLKNKKHWNSFNT+FFRDSV ++DEV SHGKA IKDDESLMSME
Subjt:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME

Query:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV
         RCNRCRS HPNLPKLKTHISKCQ+PFPS LLEGGRLV
Subjt:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV

A0A6J1I5W0 transcription factor bHLH1400.0e+0080.89Show/hide
Query:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG
        MD+DID+N  A+G E R KL+MVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNG+SGTKAQCLKSAA+ALN+G NVFVDRCNLEIEQR++F+KLG
Subjt:  MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLG

Query:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ
           VD HAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELP LNEGF RI FCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GLQVDRHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQ

Query:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI
        LGIM FLKK ENP+K CS+ANT+KD P SQ TQE       K+E +CT+LSNV+KES K EN   RSLE+NIS SD PTLAFPSISTSDF+FS+EKAAEI
Subjt:  LGIMKFLKKTENPSKACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEI

Query:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ
        IVE VEEFMD+LGNARL MVD+SHGSKILSLVK +A +KNI  TKF TFVGDITKL S+GGL C+VI N  NWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFMDRLGNARLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQ

Query:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS
        QA SLRPG+ V V++PSTSPLFNREGVTHVIHVLGPNMNP+RPNYLNNDYDEGCKLLRDAYSSLFQ FISIV+D+FKS KGI +RLGSA SES+KHSED+
Subjt:  QANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDS

Query:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV
           FKR + Q PE+SKK+KG+Q S EA+ +NNNK  HKMSKHWGSWAQALYNTAM+PE H + VLETSDDV+VLNDIYPKARKHLL+VAR EGLDQL DV
Subjt:  RRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDV

Query:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME
          EHLPLL+TMH VGLKWI+KF+H+DASLVFRLGYHS PSMRQLHLHVISQDFDSSHLKNKKHWNSFNT+FFRDSV ++DEV +HGKA IKDDESLMSME
Subjt:  RREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSME

Query:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV
         RCNRCRS HPNLPKLKTH+SKCQ+PFPS LLEG RLV
Subjt:  LRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLV

SwissProt top hitse value%identityAlignment
P61797 Aprataxin5.9e-3439Show/hide
Query:  ERLGSATSESQKHSEDSRRWFKREDLQNPEKSKKFK-GSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKAR
        E  G  T   +K S +S    +R+  Q  E S   + GS +S  ++  N  K      +  G W+Q L  +   P+      +   + V+V+ D YPKAR
Subjt:  ERLGSATSESQKHSEDSRRWFKREDLQNPEKSKKFK-GSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKAR

Query:  KHLLVVARCEGLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEV
         H LV+     +  L  V  EHL LL+ MH VG K I  F    + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNTE+F +S  +++ V
Subjt:  KHLLVVARCEGLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEV

Query:  GSHGKATIKDD-ESLMSMELRCNRCRSVHPNLPKLKTHISK
           G+ +++D    L+ + LRC+ C+ + P++P+LK H+ +
Subjt:  GSHGKATIKDD-ESLMSMELRCNRCRSVHPNLPKLKTHISK

P61798 Aprataxin (Fragment)4.8e-3635.23Show/hide
Query:  MNPKRPNYLNNDYDEGCKL----LRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDSRRWFKREDLQN-PEKSKKFK--GSQNSTEAIKK
        +NP   + ++   DE  K+    +    + L+   +   E+  +SV   +E++       ++  EDS    + +D++N P+K+KK +   +Q+S+  ++ 
Subjt:  MNPKRPNYLNNDYDEGCKL----LRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDSRRWFKREDLQN-PEKSKKFK--GSQNSTEAIKK

Query:  NNNKT-----VHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDVRREHLPLLRTMHDVGLKWINKFVHE
        + +            +H G W+Q L ++   P+      +   +  +V+ D YPKAR H LV+   + +  L  V REHL LL  MH VG K I +   +
Subjt:  NNNKT-----VHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDVRREHLPLLRTMHDVGLKWINKFVHE

Query:  DASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDES-LMSMELRCNRCRSVHPNLPKLKTHISK
        + SL FRLGYH+ PSM QLHLHVISQDFDS  LK KKHWNSF TE+F +S  +++ V S GK T+ D  S L+ + LRC+ C+     +P+LK H+ K
Subjt:  DASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDES-LMSMELRCNRCRSVHPNLPKLKTHISK

Q7YRZ1 Aprataxin1.6e-3439.83Show/hide
Query:  QERLGSATSESQKHSEDSRRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKAR
        ++R G++ S  +  S++++     E   NP        SQ S    K+ +  T  +   H   W+Q L  +   P+      +   D V+V+ D YPKAR
Subjt:  QERLGSATSESQKHSEDSRRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKAR

Query:  KHLLVVARCEGLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEV
         H LV+     +  L  V REHL LLR MH VG K I  F    + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNTE+F +S  +++ V
Subjt:  KHLLVVARCEGLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEV

Query:  GSHGKATIKDD-ESLMSMELRCNRCRSVHPNLPKLKTHISK
           G+ T++D    L+ + LRC+ C+ + P++P+LK H+ K
Subjt:  GSHGKATIKDD-ESLMSMELRCNRCRSVHPNLPKLKTHISK

Q7Z2E3 Aprataxin1.6e-3440Show/hide
Query:  GSATSESQKHSEDSRRWFKREDLQNPEKSKKFKGSQNSTEA---IKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARK
        G  T   +K S +S    +R+  Q  E     +   NS +    +KK  +  + K S   G W+Q L  +   P+      +   + V+V+ D YPKAR 
Subjt:  GSATSESQKHSEDSRRWFKREDLQNPEKSKKFKGSQNSTEA---IKKNNNKTVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARK

Query:  HLLVVARCEGLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVG
        H LV+     +  L  V REHL LL+ MH VG K I  F    + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNTE+F +S  +++ V 
Subjt:  HLLVVARCEGLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVG

Query:  SHGKATIKDD-ESLMSMELRCNRCRSVHPNLPKLKTHISK
          G+ T++D    L+ + LRC+ C+ + P++P+LK H+ K
Subjt:  SHGKATIKDD-ESLMSMELRCNRCRSVHPNLPKLKTHISK

Q9M041 Transcription factor bHLH1403.7e-23858.69Show/hide
Query:  EREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLGGLQVDRHAVVLDLPA
        ++ K ++V+L+G PGSGKSTFC+  M SS RPW RICQD + NG++GTKAQCLK A ++L EG +VF+DRCNL+ EQR++F+KLGG + + HAVVL+LPA
Subjt:  EREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLGGLQVDRHAVVLDLPA

Query:  QLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQLGIMKFLKKTENPSK
        Q+CISRSVKR+GHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++ Y  LG  D LP GCFG+K  D K Q GIMKF KK    + 
Subjt:  QLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQLGIMKFLKKTENPSK

Query:  ACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEIIVEKVEEFMDRLGNA
          SS+N            E +N+  K +E    + +NV     K  ++             +PTLAFPSIST+DFQF  EKA++IIVEK EEF+ +LG A
Subjt:  ACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEIIVEKVEEFMDRLGNA

Query:  RLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQQANSLRPGNAVVVRV
        RL +VDLS GSKILSLVK +A++KNI   KF TFVGDITKL SEGGL CNVI N TNWRLKPGGGGVNAAIF AAGPDLE AT+ +AN+L PG AVVV +
Subjt:  RLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQQANSLRPGNAVVVRV

Query:  PSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDSRRWFKREDLQNPEKS
        PST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR+AY+SLF+GF+S+V+DQ K  K   +   S + E  K  EDS            E++
Subjt:  PSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDSRRWFKREDLQNPEKS

Query:  KKFKGSQ-----NSTEAIKKNNNK-TVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDVRREHLPLLR
        KK+KGSQ     N+ E+    + + +  KMSK W +WA AL++ AMHPE H + VLE  D+++V+ND YPKARKH+LV+AR E LD L DVR+E+L LL+
Subjt:  KKFKGSQ-----NSTEAIKKNNNK-TVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDVRREHLPLLR

Query:  TMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSMELRCNRCRSV
         MH+VGLKW+++F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV +++EV S GKA +   E L+  ELRCNRCRS 
Subjt:  TMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSMELRCNRCRSV

Query:  HPNLPKLKTHISKCQTPFPSMLLEGGRLVTK
        HPN+PKLK+H+  C + FP  LL+  RLV +
Subjt:  HPNLPKLKTHISKCQTPFPSMLLEGGRLVTK

Arabidopsis top hitse value%identityAlignment
AT2G40600.1 appr-1-p processing enzyme family protein7.0e-0632.8Show/hide
Query:  NISPTKFLTFV-GDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQQANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGP--
        N+S +  L  + GDITK + +     + I NP N R+  GGGG + AI  AAGP L  A   +   +RPG          +P FN    + VIH +GP  
Subjt:  NISPTKFLTFV-GDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQQANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGP--

Query:  --NMNPKRPNYLNNDYDEGCKLLRD
          ++NP+    L N Y    ++ ++
Subjt:  --NMNPKRPNYLNNDYDEGCKLLRD

AT5G01310.1 APRATAXIN-like2.7e-23958.69Show/hide
Query:  EREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLGGLQVDRHAVVLDLPA
        ++ K ++V+L+G PGSGKSTFC+  M SS RPW RICQD + NG++GTKAQCLK A ++L EG +VF+DRCNL+ EQR++F+KLGG + + HAVVL+LPA
Subjt:  EREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLGGLQVDRHAVVLDLPA

Query:  QLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQLGIMKFLKKTENPSK
        Q+CISRSVKR+GHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++ Y  LG  D LP GCFG+K  D K Q GIMKF KK    + 
Subjt:  QLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQLGIMKFLKKTENPSK

Query:  ACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEIIVEKVEEFMDRLGNA
          SS+N            E +N+  K +E    + +NV     K  ++             +PTLAFPSIST+DFQF  EKA++IIVEK EEF+ +LG A
Subjt:  ACSSANTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEIIVEKVEEFMDRLGNA

Query:  RLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQQANSLRPGNAVVVRV
        RL +VDLS GSKILSLVK +A++KNI   KF TFVGDITKL SEGGL CNVI N TNWRLKPGGGGVNAAIF AAGPDLE AT+ +AN+L PG AVVV +
Subjt:  RLAMVDLSHGSKILSLVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQQANSLRPGNAVVVRV

Query:  PSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDSRRWFKREDLQNPEKS
        PST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR+AY+SLF+GF+S+V+DQ K  K   +   S + E  K  EDS            E++
Subjt:  PSTSPLFNREGVTHVIHVLGPNMNPKRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDSRRWFKREDLQNPEKS

Query:  KKFKGSQ-----NSTEAIKKNNNK-TVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDVRREHLPLLR
        KK+KGSQ     N+ E+    + + +  KMSK W +WA AL++ AMHPE H + VLE  D+++V+ND YPKARKH+LV+AR E LD L DVR+E+L LL+
Subjt:  KKFKGSQ-----NSTEAIKKNNNK-TVHKMSKHWGSWAQALYNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDVRREHLPLLR

Query:  TMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSMELRCNRCRSV
         MH+VGLKW+++F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV +++EV S GKA +   E L+  ELRCNRCRS 
Subjt:  TMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSMELRCNRCRSV

Query:  HPNLPKLKTHISKCQTPFPSMLLEGGRLVTK
        HPN+PKLK+H+  C + FP  LL+  RLV +
Subjt:  HPNLPKLKTHISKCQTPFPSMLLEGGRLVTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATGGATATCGACCAAAATTTGATAGCCAAAGGGAAGGAAGAGCGAGAGAAGCTTTTAATGGTGATATTAGTTGGCGCACCAGGAAGCGGCAAATCCACCTTCTG
CGAACTCGTAATGGGTTCTTCTTCTCGCCCCTGGGTTCGAATCTGCCAGGACACCATTGGAAATGGCAGGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGCTGCCAATG
CATTGAATGAGGGAAATAATGTATTTGTGGATAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTTTGAAACTAGGCGGTCTTCAAGTGGATAGACATGCTGTTGTA
TTGGATCTTCCTGCACAGCTTTGTATTTCTCGTTCCGTTAAGCGGTCTGGTCATGAAGGAAATTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCCAAGCTAAATGAAGGGTTCACTCGTATAACATTTTGTCACAATGAGACCGACGTTCAATCAGCTATAGATACGTATAAATCACTTGGTTTACATGATG
CTCTTCCAGATGGATGTTTTGGACAGAAGAACACAGACAAGAAAGTACAGCTTGGCATAATGAAGTTTTTAAAGAAAACAGAAAATCCTTCTAAGGCGTGTTCTAGTGCC
AATACCGTTAAGGATTCTCCAATTTCTCAAAATACCCAGGAAAAGAGCAACTCTTGTGACAAAAAGGAAGAGCCTGCCTGCACAGTGTTAAGCAATGTCGATAAAGAGTC
TGCTAAAAGAGAAAATTCAAGCACTAGATCCTTAGAAGACAATATTTCAGAAAGTGATCTTCCAACTCTTGCATTTCCTTCTATTTCAACATCAGATTTCCAGTTTAGCA
ATGAAAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAGGAATTCATGGATAGGTTGGGAAATGCTAGACTTGCGATGGTAGACTTGAGTCATGGATCAAAGATTTTGTCT
TTGGTTAAAACTAGAGCGACTGAGAAAAATATTAGTCCTACCAAGTTCCTTACATTCGTAGGTGATATAACTAAACTTAATTCAGAAGGAGGATTGCGTTGCAATGTAAT
AACTAATCCTACAAACTGGCGACTGAAACCGGGAGGGGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCTGATCTGGAAGTGGCAACTAAACAACAAGCTAACT
CTCTTCGGCCTGGCAATGCCGTGGTCGTTCGAGTGCCTTCAACTTCTCCTTTATTTAATAGGGAAGGAGTAACTCATGTCATACATGTTCTTGGACCCAACATGAATCCA
AAAAGGCCAAATTATCTCAACAATGACTATGACGAGGGTTGCAAGCTTCTCCGTGATGCTTACTCTTCTCTATTTCAAGGCTTTATTTCAATAGTAGAAGACCAATTTAA
GTCGGTGAAGGGAATTCAAGAACGCCTCGGATCAGCAACTTCAGAATCACAAAAGCACTCTGAGGACAGTCGTCGCTGGTTTAAGAGAGAGGATTTACAAAATCCTGAAA
AAAGCAAAAAGTTTAAAGGATCTCAGAATTCAACTGAAGCTATAAAGAAAAACAACAATAAGACTGTCCACAAAATGAGTAAACACTGGGGCTCATGGGCACAAGCACTT
TACAACACTGCAATGCATCCCGAGAGTCACGGCGATGCTGTACTGGAGACATCAGATGATGTTATAGTACTTAATGATATTTATCCAAAAGCACGCAAGCATCTTTTAGT
AGTTGCTCGGTGTGAAGGCCTCGATCAACTAGGCGATGTACGTAGAGAGCACCTTCCATTGTTGAGGACGATGCACGATGTGGGTTTGAAGTGGATCAATAAGTTCGTCC
ATGAAGATGCATCATTGGTTTTTCGTCTCGGGTACCACTCGGATCCATCCATGAGGCAACTGCACCTGCATGTTATAAGCCAGGATTTTGACTCCAGTCATCTGAAAAAC
AAGAAGCATTGGAATTCTTTCAACACCGAATTTTTCAGAGACTCGGTCTACATTATGGATGAAGTCGGTAGCCATGGAAAGGCAACCATCAAGGATGATGAGAGCTTGAT
GTCTATGGAGTTGCGTTGCAACAGATGCAGAAGTGTTCATCCCAACTTACCCAAATTGAAAACACATATTTCCAAATGCCAAACGCCTTTCCCTTCCATGCTACTTGAGG
GCGGTCGTTTAGTGACCAAGTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAACCCAGTTGCAAAACCCTTATAATTTTATTCGTTACAAGGATATGCTAAACGTGTAACCAAACAATTCCAGAGCAACAAATATGGGGAGGGAGCTGGCCGTGGCG
TATCGGTCAACCTTCACTCATCGGAGGACGAACGTCGACAATGGCTATTGATAGGGTTTAAAGGCTGACCTAAAAATCTAATCAAGAAATGGATATGGATATCGACCAAA
ATTTGATAGCCAAAGGGAAGGAAGAGCGAGAGAAGCTTTTAATGGTGATATTAGTTGGCGCACCAGGAAGCGGCAAATCCACCTTCTGCGAACTCGTAATGGGTTCTTCT
TCTCGCCCCTGGGTTCGAATCTGCCAGGACACCATTGGAAATGGCAGGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGCTGCCAATGCATTGAATGAGGGAAATAATGT
ATTTGTGGATAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTTTGAAACTAGGCGGTCTTCAAGTGGATAGACATGCTGTTGTATTGGATCTTCCTGCACAGCTTT
GTATTTCTCGTTCCGTTAAGCGGTCTGGTCATGAAGGAAATTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAAGAAAGAATTGCCCAAGCTAAAT
GAAGGGTTCACTCGTATAACATTTTGTCACAATGAGACCGACGTTCAATCAGCTATAGATACGTATAAATCACTTGGTTTACATGATGCTCTTCCAGATGGATGTTTTGG
ACAGAAGAACACAGACAAGAAAGTACAGCTTGGCATAATGAAGTTTTTAAAGAAAACAGAAAATCCTTCTAAGGCGTGTTCTAGTGCCAATACCGTTAAGGATTCTCCAA
TTTCTCAAAATACCCAGGAAAAGAGCAACTCTTGTGACAAAAAGGAAGAGCCTGCCTGCACAGTGTTAAGCAATGTCGATAAAGAGTCTGCTAAAAGAGAAAATTCAAGC
ACTAGATCCTTAGAAGACAATATTTCAGAAAGTGATCTTCCAACTCTTGCATTTCCTTCTATTTCAACATCAGATTTCCAGTTTAGCAATGAAAAGGCTGCTGAAATTAT
TGTTGAGAAGGTTGAGGAATTCATGGATAGGTTGGGAAATGCTAGACTTGCGATGGTAGACTTGAGTCATGGATCAAAGATTTTGTCTTTGGTTAAAACTAGAGCGACTG
AGAAAAATATTAGTCCTACCAAGTTCCTTACATTCGTAGGTGATATAACTAAACTTAATTCAGAAGGAGGATTGCGTTGCAATGTAATAACTAATCCTACAAACTGGCGA
CTGAAACCGGGAGGGGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCTGATCTGGAAGTGGCAACTAAACAACAAGCTAACTCTCTTCGGCCTGGCAATGCCGT
GGTCGTTCGAGTGCCTTCAACTTCTCCTTTATTTAATAGGGAAGGAGTAACTCATGTCATACATGTTCTTGGACCCAACATGAATCCAAAAAGGCCAAATTATCTCAACA
ATGACTATGACGAGGGTTGCAAGCTTCTCCGTGATGCTTACTCTTCTCTATTTCAAGGCTTTATTTCAATAGTAGAAGACCAATTTAAGTCGGTGAAGGGAATTCAAGAA
CGCCTCGGATCAGCAACTTCAGAATCACAAAAGCACTCTGAGGACAGTCGTCGCTGGTTTAAGAGAGAGGATTTACAAAATCCTGAAAAAAGCAAAAAGTTTAAAGGATC
TCAGAATTCAACTGAAGCTATAAAGAAAAACAACAATAAGACTGTCCACAAAATGAGTAAACACTGGGGCTCATGGGCACAAGCACTTTACAACACTGCAATGCATCCCG
AGAGTCACGGCGATGCTGTACTGGAGACATCAGATGATGTTATAGTACTTAATGATATTTATCCAAAAGCACGCAAGCATCTTTTAGTAGTTGCTCGGTGTGAAGGCCTC
GATCAACTAGGCGATGTACGTAGAGAGCACCTTCCATTGTTGAGGACGATGCACGATGTGGGTTTGAAGTGGATCAATAAGTTCGTCCATGAAGATGCATCATTGGTTTT
TCGTCTCGGGTACCACTCGGATCCATCCATGAGGCAACTGCACCTGCATGTTATAAGCCAGGATTTTGACTCCAGTCATCTGAAAAACAAGAAGCATTGGAATTCTTTCA
ACACCGAATTTTTCAGAGACTCGGTCTACATTATGGATGAAGTCGGTAGCCATGGAAAGGCAACCATCAAGGATGATGAGAGCTTGATGTCTATGGAGTTGCGTTGCAAC
AGATGCAGAAGTGTTCATCCCAACTTACCCAAATTGAAAACACATATTTCCAAATGCCAAACGCCTTTCCCTTCCATGCTACTTGAGGGCGGTCGTTTAGTGACCAAGTA
AATGCTCATAAACTCTGTAATCGTAGTTTTCATTTTCATAATTAGTAGTAAGATCATCTTTTGCTCTCTAGCAGAGTAATTTTCTGAGAAATTCGAAGTTTCGAGTTCCA
TCTATTCTTTTAAATCTTTTGT
Protein sequenceShow/hide protein sequence
MDMDIDQNLIAKGKEEREKLLMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGRSGTKAQCLKSAANALNEGNNVFVDRCNLEIEQRADFLKLGGLQVDRHAVV
LDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNTDKKVQLGIMKFLKKTENPSKACSSA
NTVKDSPISQNTQEKSNSCDKKEEPACTVLSNVDKESAKRENSSTRSLEDNISESDLPTLAFPSISTSDFQFSNEKAAEIIVEKVEEFMDRLGNARLAMVDLSHGSKILS
LVKTRATEKNISPTKFLTFVGDITKLNSEGGLRCNVITNPTNWRLKPGGGGVNAAIFSAAGPDLEVATKQQANSLRPGNAVVVRVPSTSPLFNREGVTHVIHVLGPNMNP
KRPNYLNNDYDEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIQERLGSATSESQKHSEDSRRWFKREDLQNPEKSKKFKGSQNSTEAIKKNNNKTVHKMSKHWGSWAQAL
YNTAMHPESHGDAVLETSDDVIVLNDIYPKARKHLLVVARCEGLDQLGDVRREHLPLLRTMHDVGLKWINKFVHEDASLVFRLGYHSDPSMRQLHLHVISQDFDSSHLKN
KKHWNSFNTEFFRDSVYIMDEVGSHGKATIKDDESLMSMELRCNRCRSVHPNLPKLKTHISKCQTPFPSMLLEGGRLVTK