; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027290 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027290
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontranscription factor bHLH140
Genome locationchr10:46522002..46525305
RNA-Seq ExpressionLag0027290
SyntenyLag0027290
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596332.1 Transcription factor basic helix-loop-helix 140, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.9Show/hide
Query:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG
        MDMDIDENS AKG E + K IMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLE+EQR++FVKLG
Subjt:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENPAKTCS+ANT KD P S +TQE       K+ES+CTMLSNV+KES+KGEN GVGSLENNI+ SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ
        IVE VEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNI STKFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIF+AAGPSLE ATKQ
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ

Query:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES
        QA SLRPGN VAVQLPSTSPLF REGVTHVIHVLGPNMNPQRPNYLNNDY+EGCKLLRDAYSSLFQ FISIV+D+FKS KGI ERL SA SES+KHSE++
Subjt:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES

Query:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV
        HHKFKR +LQ PERSKKWKG+Q+SAEALNQNNNK  HKMSKHWGSWAQALYNTAM+PE+H ++VLETSDDVVVLND+YPKARKHLL+VARYEGLDQLADV
Subjt:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV

Query:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME
         KEHLPLLKTMHA+G+KWI+KF H+DASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDVIDEVSSHGKA IKDDES MSME
Subjt:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME

Query:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA
         RCNRCRSAHPNLPKLKTHISKCQ+PFPSTLLEGGRLV A  N+
Subjt:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0086.21Show/hide
Query:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG
        MDMDID+NS AKGKEGQ K IMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGK++FVDRCNLE+EQRADFVK+G
Subjt:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHD LP GCFGQ   D KVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+NPAKTCSSAN IKDSP   +++E SYSCDKKEE ACT+  NVDKES+KGEN GV SL ++I++SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ
        IVE VEEFMDKLGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKLNSEGGL CNVIANAANWRLKPG GGVNAAIF+AAGP LEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ

Query:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES
        QANSLRPGN V  QLPSTSPLF REGVTHVIHVLGPNMNPQRPNYLNNDY+EGCKLLR AYSSLFQGFISIVE+QFKSVKGI + L SA SES+KHSE+S
Subjt:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES

Query:  --------HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYE
                 HKFKRED+QNPERSKKWKGSQDSAEA NQNNN  VHKMSKHWGSWAQALYNTAMHPE+HGD VLE SDDV VLND+Y KA KHLLVVARYE
Subjt:  --------HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYE

Query:  GLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKD
        GLDQLADVR+EHLPLL+TMH +GLKWI+KFFHEDASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDV+DEV SHGKASIKD
Subjt:  GLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKD

Query:  DESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNAHE
        DES MSMELRCNRCRSAHPNL KLK HISKC+APFPSTLLEG RLVIAPSN H+
Subjt:  DESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNAHE

XP_022949470.1 transcription factor bHLH140 [Cucurbita moschata]0.0e+0087.9Show/hide
Query:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG
        MDMDIDENS AKG E + K IMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLE+EQR++FVKLG
Subjt:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENPAKTCS+ANT KD P S +TQE       K+ES+CTMLSNV+KES+KGEN GVGSLENNI+ SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ
        IVE VEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNI STKFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIF+AAGPSLE ATKQ
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ

Query:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES
        QA SLRPGN VAVQLPSTSPLF REGVTHVIHVLGPNMNPQRPNYLNNDY+EGCKLLRDAYSSLFQ FISIV+D+FKS KGI ERL SA SES+KHSE++
Subjt:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES

Query:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV
        HHKFKR +LQ PERSKKWKG+Q+SAEALNQNNNK  HKMSKHWGSWAQALYNTAM+PE+H ++VLETSDDVVVLND+YPKARKHLL+VARYEGLDQLADV
Subjt:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV

Query:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME
         KEHLPLLKTMHA+G+KWI+KF H+DASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDVIDEVSSHGKA IKDDES MSME
Subjt:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME

Query:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA
         RCNRCRSAHPNLPKLKTHISKCQ+PFPSTLLEGGRLV A  N+
Subjt:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0089.92Show/hide
Query:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG
        MDMD DENS AKGKEGQGK IMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLE+EQRADFVKL 
Subjt:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
        GPRVDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAID YKSL LHD LP GCFGQKNPDKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE P++T SSANT+KDSPI  +TQEKS SCDKKEESACT+  NVD ES KGE+ GV SLE+NI+QSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ
        IVEKVEEFMDKLGNARLV+VDLSHGSKILSLVKAKAAKKNI+STKFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIF+AAG  LEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ

Query:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES
        QANSL+PGNAVAVQLPSTSPLF REGVTHVIHVLGPNMNPQRPNYLNNDY+EGCKLLR+AYSSLFQ FIS+VED+FKSVKGIH RL    SE +KHSE S
Subjt:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES

Query:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV
        HHKFKRE+LQNPERSKKWKGSQDS EALNQNNNKTV KMSKHWGSWAQALYNTAMHPE+H D VLETSDDVVVLND+YPKARKHLLVVAR+EGLDQLADV
Subjt:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV

Query:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME
          EHLPLL+TMHA+GLKWI+KFFHEDASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKN+KHWNSFNTDFFRDSV V+DEVSSHGKAS+ DDES MSME
Subjt:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME

Query:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA
        LRCNRCRSAHPNLPKLK HI KCQAPFPSTLLEGGRLVIAPSNA
Subjt:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0090.04Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLGGPRVDVHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLE+EQRADFVKL GPRVDV AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLGGPRVDVHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQLGIMKFLKKAENPAKTCSSAN
        SVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAID YKSL LHD LP GCFGQKNPDKKVQLGIMKFLKKAE P++T SSAN
Subjt:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQLGIMKFLKKAENPAKTCSSAN

Query:  TIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVMVD
        T+KDSPI  +TQEKS SCDKKEESACT+  NVD ES KGE+ GV SLE+NI+QSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLV+VD
Subjt:  TIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVMVD

Query:  LSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQQANSLRPGNAVAVQLPSTSPL
        LSHGSKILSLVKAKAAKKNI+STKFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIF+AAG  LEVATKQQANSL+PGNAVAVQLPSTSPL
Subjt:  LSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQQANSLRPGNAVAVQLPSTSPL

Query:  FIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEESHHKFKREDLQNPERSKKWKGS
        F REGVTHVIHVLGPNMNPQRPNYLNNDY+EGCKLLR+AYSSLFQ FIS+VED+FKSVKGIH RL    SE +KHSE SHHKFKRE+LQNPERSKKWKGS
Subjt:  FIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEESHHKFKREDLQNPERSKKWKGS

Query:  QDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAMGLKWINK
        QDS EALNQNNNKTV KMSKHWGSWAQALYNTAMHPE+H D VLETSDDVVVLND+YPKARKHLLVVAR+EGLDQLADV  EHLPLL+TMHA+GLKWI+K
Subjt:  QDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAMGLKWINK

Query:  FFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHIS
        FFHEDASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKN+KHWNSFNTDFFRDSV V+DEVSSHGKAS+ DDES MSMELRCNRCRSAHPNLPKLK HI 
Subjt:  FFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHIS

Query:  KCQAPFPSTLLEGGRLVIAPSNA
        KCQAPFPSTLLEGGRLVIAPSNA
Subjt:  KCQAPFPSTLLEGGRLVIAPSNA

TrEMBL top hitse value%identityAlignment
A0A1S3B6C4 transcription factor bHLH1400.0e+0086.69Show/hide
Query:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG
        MDMD DENS AKGKEGQGK IMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGK+VFVDRCNLE+EQRADFVKLG
Subjt:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
        GP+VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LP GCFGQKNPDKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAENP+KTCSSAN  K+SP    TQEK  SCDKKEES+C M  NV  ES+KGE+ GV SLE  I+QSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ
        IVEKVEEFMDKLGNARLV+VDLSHGSKILS+VKAKA +KNI+STKFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIF+AAGP LEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ

Query:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES
        QANSL PGNAVAVQLPSTSPL  REGVTHVIHVLGPNMNPQRPNYLNNDY+EGCKLL +AYSSLFQ FISIV+D++KSVKGI+E L S   E QKHSE S
Subjt:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES

Query:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV
        HHKFKRE+LQN E SKKWKGS +S E LNQNNNKTV K SKHWGSWAQALY+TAMHPE+H + VLETSDDVVVL D+YPKARKHLLVVAR+EGLDQLADV
Subjt:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV

Query:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME
          EHLPLL+TMHAMGLKWI+KFFHEDA LVFRLGYHSAPSMRQLHLHV+SQDFDS++LKNKKHWNSFNTDFFRDSV VIDEVSSHGKA I DDE  +SME
Subjt:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME

Query:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA
        LRCNRCRSAHPNLPKLK HISKCQAPFPSTLLE GRLV+ PSNA
Subjt:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA

A0A5A7TLV2 Transcription factor bHLH1400.0e+0086.69Show/hide
Query:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG
        MDMD DENS AKGKEGQGK IMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGK+VFVDRCNLE+EQRADFVKLG
Subjt:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
        GP+VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LP GCFGQKNPDKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAENP+KTCSSAN  K+SP    TQEK  SCDKKEES+C M  NV  ES+KGE+ GV SLE  I+QSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ
        IVEKVEEFMDKLGNARLV+VDLSHGSKILS+VKAKA +KNI+STKFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIF+AAGP LEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ

Query:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES
        QANSL PGNAVAVQLPSTSPL  REGVTHVIHVLGPNMNPQRPNYLNNDY+EGCKLL +AYSSLFQ FISIV+D++KSVKGI+E L S   E QKHSE S
Subjt:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES

Query:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV
        HHKFKRE+LQN E SKKWKGS +S E LNQNNNKTV K SKHWGSWAQALY+TAMHPE+H + VLETSDDVVVL D+YPKARKHLLVVAR+EGLDQLADV
Subjt:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV

Query:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME
          EHLPLL+TMHAMGLKWI+KFFHEDA LVFRLGYHSAPSMRQLHLHV+SQDFDS++LKNKKHWNSFNTDFFRDSV VIDEVSSHGKA I DDE  +SME
Subjt:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME

Query:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA
        LRCNRCRSAHPNLPKLK HISKCQAPFPSTLLE GRLV+ PSNA
Subjt:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA

A0A6J1CV45 transcription factor bHLH1400.0e+0086.21Show/hide
Query:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG
        MDMDID+NS AKGKEGQ K IMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGK++FVDRCNLE+EQRADFVK+G
Subjt:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHD LP GCFGQ   D KVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+NPAKTCSSAN IKDSP   +++E SYSCDKKEE ACT+  NVDKES+KGEN GV SL ++I++SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ
        IVE VEEFMDKLGNARLV+VDL++GSK+LSLVKAKAAKK I  +KFFTFVGDITKLNSEGGL CNVIANAANWRLKPG GGVNAAIF+AAGP LEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ

Query:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES
        QANSLRPGN V  QLPSTSPLF REGVTHVIHVLGPNMNPQRPNYLNNDY+EGCKLLR AYSSLFQGFISIVE+QFKSVKGI + L SA SES+KHSE+S
Subjt:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES

Query:  --------HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYE
                 HKFKRED+QNPERSKKWKGSQDSAEA NQNNN  VHKMSKHWGSWAQALYNTAMHPE+HGD VLE SDDV VLND+Y KA KHLLVVARYE
Subjt:  --------HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYE

Query:  GLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKD
        GLDQLADVR+EHLPLL+TMH +GLKWI+KFFHEDASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDV+DEV SHGKASIKD
Subjt:  GLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKD

Query:  DESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNAHE
        DES MSMELRCNRCRSAHPNL KLK HISKC+APFPSTLLEG RLVIAPSN H+
Subjt:  DESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNAHE

A0A6J1GCV0 transcription factor bHLH1400.0e+0087.9Show/hide
Query:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG
        MDMDIDENS AKG E + K IMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLE+EQR++FVKLG
Subjt:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENPAKTCS+ANT KD P S +TQE       K+ES+CTMLSNV+KES+KGEN GVGSLENNI+ SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ
        IVE VEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNI STKFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIF+AAGPSLE ATKQ
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ

Query:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES
        QA SLRPGN VAVQLPSTSPLF REGVTHVIHVLGPNMNPQRPNYLNNDY+EGCKLLRDAYSSLFQ FISIV+D+FKS KGI ERL SA SES+KHSE++
Subjt:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES

Query:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV
        HHKFKR +LQ PERSKKWKG+Q+SAEALNQNNNK  HKMSKHWGSWAQALYNTAM+PE+H ++VLETSDDVVVLND+YPKARKHLL+VARYEGLDQLADV
Subjt:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV

Query:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME
         KEHLPLLKTMHA+G+KWI+KF H+DASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDVIDEVSSHGKA IKDDES MSME
Subjt:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME

Query:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA
         RCNRCRSAHPNLPKLKTHISKCQ+PFPSTLLEGGRLV A  N+
Subjt:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA

A0A6J1I5W0 transcription factor bHLH1400.0e+0086.42Show/hide
Query:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG
        MD+DIDENS A+G E + K IMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLE+EQR++FVKLG
Subjt:  MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLG

Query:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RI FCH+ETDVQSAIDTYKSLGLHDALPDGCFGQKN DKKVQ
Subjt:  GPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM FLKKAENPAKTCS+ANTIKD P S +TQE       K+ES+CTMLSNV+KES+KGEN G+ SLENNI+ SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ
        IVE VEEFMDKLGNARLVMVD+SHGSKILSLVKAKAAKKNI STKFFTFVGDITKL S+GGL C+VIANAANWRLKPGGGGVNAAIF+AAGPSLE ATKQ
Subjt:  IVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQ

Query:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES
        QA SLRPG+ VAVQLPSTSPLF REGVTHVIHVLGPNMNPQRPNYLNNDY+EGCKLLRDAYSSLFQ FISIV+D+FKS KGI +RL SA SES+KHSE++
Subjt:  QANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEES

Query:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV
        HHKFKR + Q PERSKKWKG+Q+SAEALNQNNNK  HKMSKHWGSWAQALYNTAM+PE+H + VLETSDDVVVLND+YPKARKHLL+VARYEGLDQLADV
Subjt:  HHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADV

Query:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME
          EHLPLLKTMHA+GLKWI+KF H+DASLVFRLGYHSAPSMRQLHLHV+SQDFDSS+LKNKKHWNSFNTDFFRDSVDVIDEVS+HGKA IKDDES MSME
Subjt:  RKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSME

Query:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA
         RCNRCRSAHPNLPKLKTH+SKCQ+PFPSTLLEG RLV A  N+
Subjt:  LRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNA

SwissProt top hitse value%identityAlignment
P61798 Aprataxin (Fragment)1.2e-3437.9Show/hide
Query:  ERLSSALSESQKHSEESHHKFKREDLQN-PERSKKWK---GSQDSAEALNQNNNKTVHK----MSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLN
        E +  A  + Q          + +D++N P+++KK +       SA+     ++ + H+      +H G W+Q L ++   P+    V +   +  VV+ 
Subjt:  ERLSSALSESQKHSEESHHKFKREDLQN-PERSKKWK---GSQDSAEALNQNNNKTVHK----MSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLN

Query:  DVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDS
        D YPKAR H LV+  ++ +  L  V +EHL LL+ MHA+G K I +   ++ SL FRLGYH+ PSM QLHLHV+SQDFDS  LK KKHWNSF T++F +S
Subjt:  DVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDS

Query:  VDVIDEVSSHGKASIKDDESS-MSMELRCNRCRSAHPNLPKLKTHISK
         +VI+ V S GK ++ D  S  + + LRC+ C+     +P+LK H+ K
Subjt:  VDVIDEVSSHGKASIKDDESS-MSMELRCNRCRSAHPNLPKLKTHISK

Q7TQC5 Aprataxin7.8e-3440.17Show/hide
Query:  SHHKFKREDLQNPERSKK-----WKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGL
        +  K KR D  + E   +       GS  S  +++   +K      +  G W+Q L  +   P+    + +   D VVV+ D YPKAR H LV+  +  +
Subjt:  SHHKFKREDLQNPERSKK-----WKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGL

Query:  DQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDE
          L  V  EHL LLK MHA+G K I   F   + L FRLGYH+ PSM  +HLHV+SQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+ ++KD  
Subjt:  DQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDE

Query:  SS-MSMELRCNRCRSAHPNLPKLKTHISK
           + + LRC+ C+   P++P+LK H+ K
Subjt:  SS-MSMELRCNRCRSAHPNLPKLKTHISK

Q7Z2E3 Aprataxin4.6e-3444.44Show/hide
Query:  GSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQ
        G W+Q L  +   P+    + +   + VVV+ D YPKAR H LV+  +  +  L  V +EHL LLK MH +G K I   F   + L FRLGYH+ PSM  
Subjt:  GSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQ

Query:  LHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDD-ESSMSMELRCNRCRSAHPNLPKLKTHISK
        +HLHV+SQDFDS  LKNKKHWNSFNT++F +S  VI+ V   G+ +++D     + + LRC+ C+   P++P+LK H+ K
Subjt:  LHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDD-ESSMSMELRCNRCRSAHPNLPKLKTHISK

Q8K4H4 Aprataxin1.3e-3340.62Show/hide
Query:  SHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLAD
        +  K KR D  + E   +  G+  S  +++    K      +  G W+Q L  +   P+    + +   D VVV+ D YPKAR H LV+  +  +  L  
Subjt:  SHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLAD

Query:  VRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESS-MS
        V  EHL LLK MHA+G K I   F   + L FRLGYH+ PSM  +HLHV+SQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+ ++KD     + 
Subjt:  VRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESS-MS

Query:  MELRCNRCRSAHPNLPKLKTHISK
        + LRC+ C+   P++P+LK H+ K
Subjt:  MELRCNRCRSAHPNLPKLKTHISK

Q9M041 Transcription factor bHLH1406.0e-24459.81Show/hide
Query:  MDMDIDEN-----SAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRAD
        M++ I+E+     S    +  + K+I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L +GK+VF+DRCNL+ EQR++
Subjt:  MDMDIDEN-----SAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRAD

Query:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNP
        F+KLGGP  +VHAVVL+LPAQ+CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++ Y  LG  D LP GCFG+K  
Subjt:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNP

Query:  DKKVQLGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHE
        D K Q GIMKF KK    A   SS+N            E + +  K +E    M +NV     K   LG   +         PTLAFPSIST+DF+F  E
Subjt:  DKKVQLGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHE

Query:  KAAEIIVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLE
        KA++IIVEK EEF+ KLG ARLV+VDLS GSKILSLVKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE
Subjt:  KAAEIIVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLE

Query:  VATKQQANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQK
         AT+ +AN+L PG AV V LPST PL   EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR+AY+SLF+GF+S+V+DQ K  K       +A+S+S +
Subjt:  VATKQQANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQK

Query:  HSEESHHKFKREDLQNPERSKKWKGSQDSAEALN------QNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVA
          +E           + ER+KK+KGSQD A   N      ++   +  KMSK W +WA AL++ AMHPE+H +VVLE  D++VV+ND YPKARKH+LV+A
Subjt:  HSEESHHKFKREDLQNPERSKKWKGSQDSAEALN------QNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVA

Query:  RYEGLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAS
        R E LD L DVRKE+L LL+ MH +GLKW+++F +EDASL+FRLGYHS PSMRQLHLHV+SQDF+S  LKNKKHWNSF T FFRDSVDV++EV+S GKA+
Subjt:  RYEGLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAS

Query:  IKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLV
        +   E  +  ELRCNRCRSAHPN+PKLK+H+  C + FP  LL+  RLV
Subjt:  IKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT2G40600.1 appr-1-p processing enzyme family protein3.0e-0433.63Show/hide
Query:  GDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQQANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGP----NMNPQRPNYL
        GDITK + +     + I N AN R+  GGGG + AI  AAGP L  A   +   +RPG          +P F     + VIH +GP    ++NPQ    L
Subjt:  GDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQQANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGP----NMNPQRPNYL

Query:  NNDYNEGCKLLRD
         N Y    ++ ++
Subjt:  NNDYNEGCKLLRD

AT5G01310.1 APRATAXIN-like4.3e-24559.81Show/hide
Query:  MDMDIDEN-----SAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRAD
        M++ I+E+     S    +  + K+I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L +GK+VF+DRCNL+ EQR++
Subjt:  MDMDIDEN-----SAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRAD

Query:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNP
        F+KLGGP  +VHAVVL+LPAQ+CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++ Y  LG  D LP GCFG+K  
Subjt:  FVKLGGPRVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNP

Query:  DKKVQLGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHE
        D K Q GIMKF KK    A   SS+N            E + +  K +E    M +NV     K   LG   +         PTLAFPSIST+DF+F  E
Subjt:  DKKVQLGIMKFLKKAENPAKTCSSANTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHE

Query:  KAAEIIVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLE
        KA++IIVEK EEF+ KLG ARLV+VDLS GSKILSLVKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE
Subjt:  KAAEIIVEKVEEFMDKLGNARLVMVDLSHGSKILSLVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLE

Query:  VATKQQANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQK
         AT+ +AN+L PG AV V LPST PL   EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR+AY+SLF+GF+S+V+DQ K  K       +A+S+S +
Subjt:  VATKQQANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNPQRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQK

Query:  HSEESHHKFKREDLQNPERSKKWKGSQDSAEALN------QNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVA
          +E           + ER+KK+KGSQD A   N      ++   +  KMSK W +WA AL++ AMHPE+H +VVLE  D++VV+ND YPKARKH+LV+A
Subjt:  HSEESHHKFKREDLQNPERSKKWKGSQDSAEALN------QNNNKTVHKMSKHWGSWAQALYNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVA

Query:  RYEGLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAS
        R E LD L DVRKE+L LL+ MH +GLKW+++F +EDASL+FRLGYHS PSMRQLHLHV+SQDF+S  LKNKKHWNSF T FFRDSVDV++EV+S GKA+
Subjt:  RYEGLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKNKKHWNSFNTDFFRDSVDVIDEVSSHGKAS

Query:  IKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLV
        +   E  +  ELRCNRCRSAHPN+PKLK+H+  C + FP  LL+  RLV
Subjt:  IKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATGGATATCGACGAAAATTCGGCAGCCAAAGGAAAGGAAGGGCAAGGGAAGCGCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTTTG
CGAACTCGTAATGGGTTCCTCCTCTCGCCCTTGGGTTCGAATCTGCCAGGACACGATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGCAGCCAGTG
CTTTGAATGATGGAAAGAATGTATTTGTGGATAGGTGCAATCTTGAAATGGAGCAGCGTGCAGATTTTGTGAAACTTGGTGGCCCTCGAGTGGATGTACATGCTGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATTTCTCGTTCTGTTAAGCGGACTGGTCATGAAGGGAATTTGTCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAGATGCTGCAAAA
GAAAGAATTGCCCAAGCTAAATGAAGGCTTCACTCGCATAACCTTTTGCCACAATGAGACTGACGTTCAATCCGCTATAGACACATACAAATCACTTGGTTTACATGATG
CACTTCCAGATGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAGCTTGGCATAATGAAGTTCTTAAAGAAAGCAGAAAATCCTGCTAAGACGTGTTCTAGTGCC
AATACCATTAAGGATTCTCCAATTTCTCCAAGTACCCAGGAAAAGAGCTACTCTTGTGATAAGAAGGAAGAGTCTGCTTGCACAATGTTGAGCAATGTCGATAAAGAGTC
AAAGAAAGGTGAAAATCTAGGCGTAGGATCCTTAGAAAACAATATTGCTCAAAGTGATCCTCCAACTCTTGCATTTCCATCTATTTCAACATCAGATTTCAAGTTCAGCC
ATGAGAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTGATGGTAGACTTGAGTCATGGATCAAAGATTTTGTCT
TTGGTTAAAGCTAAAGCAGCCAAGAAGAATATTACTTCGACCAAGTTTTTTACATTCGTAGGTGATATAACCAAACTCAATTCAGAAGGTGGATTGTGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGCGACTTAAACCGGGTGGTGGTGGTGTCAATGCTGCAATTTTTACTGCTGCAGGTCCCAGTCTGGAAGTCGCAACTAAGCAACAAGCAAACT
CACTTCGACCTGGCAATGCCGTGGCTGTTCAATTGCCTTCAACTTCTCCATTGTTTATCAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCAAACATGAATCCA
CAGAGGCCAAATTATCTCAACAATGACTATAATGAAGGTTGCAAACTTCTTCGTGATGCTTACTCTTCCCTATTTCAAGGCTTTATTTCTATAGTAGAAGACCAATTTAA
GTCAGTGAAGGGAATCCACGAACGCCTCAGCTCAGCACTTTCAGAATCACAAAAGCACTCTGAGGAAAGCCATCACAAGTTCAAGAGAGAGGATTTGCAAAATCCTGAAA
GAAGCAAAAAGTGGAAAGGATCTCAAGATTCAGCTGAAGCATTAAACCAAAACAACAATAAGACTGTCCACAAAATGAGCAAGCACTGGGGCTCATGGGCACAAGCACTT
TACAACACTGCAATGCATCCCGAGAAACATGGCGATGTTGTACTGGAGACATCAGATGATGTTGTCGTACTGAATGATGTTTATCCAAAAGCACGCAAGCATCTTCTTGT
AGTTGCTCGGTATGAAGGCCTCGATCAACTAGCCGATGTACGTAAAGAACACCTTCCATTGTTGAAGACGATGCACGCCATGGGTTTGAAGTGGATCAATAAGTTCTTTC
ATGAAGATGCATCATTGGTTTTTCGCCTTGGATACCACTCGGCTCCATCCATGAGGCAACTGCACCTACATGTTGTAAGCCAGGATTTCGACTCCAGTTATCTGAAGAAC
AAGAAGCATTGGAATTCTTTCAACACCGATTTCTTCCGAGACTCAGTCGACGTTATTGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCAAGGACGATGAGAGCTCGAT
GTCTATGGAGTTGCGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAATTGAAAACACATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACGCTACTTGAGG
GCGGGCGTTTAGTGATTGCACCAAGTAATGCTCATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATGGATATCGACGAAAATTCGGCAGCCAAAGGAAAGGAAGGGCAAGGGAAGCGCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTTTG
CGAACTCGTAATGGGTTCCTCCTCTCGCCCTTGGGTTCGAATCTGCCAGGACACGATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGCAGCCAGTG
CTTTGAATGATGGAAAGAATGTATTTGTGGATAGGTGCAATCTTGAAATGGAGCAGCGTGCAGATTTTGTGAAACTTGGTGGCCCTCGAGTGGATGTACATGCTGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATTTCTCGTTCTGTTAAGCGGACTGGTCATGAAGGGAATTTGTCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAGATGCTGCAAAA
GAAAGAATTGCCCAAGCTAAATGAAGGCTTCACTCGCATAACCTTTTGCCACAATGAGACTGACGTTCAATCCGCTATAGACACATACAAATCACTTGGTTTACATGATG
CACTTCCAGATGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAGCTTGGCATAATGAAGTTCTTAAAGAAAGCAGAAAATCCTGCTAAGACGTGTTCTAGTGCC
AATACCATTAAGGATTCTCCAATTTCTCCAAGTACCCAGGAAAAGAGCTACTCTTGTGATAAGAAGGAAGAGTCTGCTTGCACAATGTTGAGCAATGTCGATAAAGAGTC
AAAGAAAGGTGAAAATCTAGGCGTAGGATCCTTAGAAAACAATATTGCTCAAAGTGATCCTCCAACTCTTGCATTTCCATCTATTTCAACATCAGATTTCAAGTTCAGCC
ATGAGAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTGATGGTAGACTTGAGTCATGGATCAAAGATTTTGTCT
TTGGTTAAAGCTAAAGCAGCCAAGAAGAATATTACTTCGACCAAGTTTTTTACATTCGTAGGTGATATAACCAAACTCAATTCAGAAGGTGGATTGTGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGCGACTTAAACCGGGTGGTGGTGGTGTCAATGCTGCAATTTTTACTGCTGCAGGTCCCAGTCTGGAAGTCGCAACTAAGCAACAAGCAAACT
CACTTCGACCTGGCAATGCCGTGGCTGTTCAATTGCCTTCAACTTCTCCATTGTTTATCAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCAAACATGAATCCA
CAGAGGCCAAATTATCTCAACAATGACTATAATGAAGGTTGCAAACTTCTTCGTGATGCTTACTCTTCCCTATTTCAAGGCTTTATTTCTATAGTAGAAGACCAATTTAA
GTCAGTGAAGGGAATCCACGAACGCCTCAGCTCAGCACTTTCAGAATCACAAAAGCACTCTGAGGAAAGCCATCACAAGTTCAAGAGAGAGGATTTGCAAAATCCTGAAA
GAAGCAAAAAGTGGAAAGGATCTCAAGATTCAGCTGAAGCATTAAACCAAAACAACAATAAGACTGTCCACAAAATGAGCAAGCACTGGGGCTCATGGGCACAAGCACTT
TACAACACTGCAATGCATCCCGAGAAACATGGCGATGTTGTACTGGAGACATCAGATGATGTTGTCGTACTGAATGATGTTTATCCAAAAGCACGCAAGCATCTTCTTGT
AGTTGCTCGGTATGAAGGCCTCGATCAACTAGCCGATGTACGTAAAGAACACCTTCCATTGTTGAAGACGATGCACGCCATGGGTTTGAAGTGGATCAATAAGTTCTTTC
ATGAAGATGCATCATTGGTTTTTCGCCTTGGATACCACTCGGCTCCATCCATGAGGCAACTGCACCTACATGTTGTAAGCCAGGATTTCGACTCCAGTTATCTGAAGAAC
AAGAAGCATTGGAATTCTTTCAACACCGATTTCTTCCGAGACTCAGTCGACGTTATTGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCAAGGACGATGAGAGCTCGAT
GTCTATGGAGTTGCGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAATTGAAAACACATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACGCTACTTGAGG
GCGGGCGTTTAGTGATTGCACCAAGTAATGCTCATGAATGA
Protein sequenceShow/hide protein sequence
MDMDIDENSAAKGKEGQGKRIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKNVFVDRCNLEMEQRADFVKLGGPRVDVHAVV
LDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDTYKSLGLHDALPDGCFGQKNPDKKVQLGIMKFLKKAENPAKTCSSA
NTIKDSPISPSTQEKSYSCDKKEESACTMLSNVDKESKKGENLGVGSLENNIAQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVMVDLSHGSKILS
LVKAKAAKKNITSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFTAAGPSLEVATKQQANSLRPGNAVAVQLPSTSPLFIREGVTHVIHVLGPNMNP
QRPNYLNNDYNEGCKLLRDAYSSLFQGFISIVEDQFKSVKGIHERLSSALSESQKHSEESHHKFKREDLQNPERSKKWKGSQDSAEALNQNNNKTVHKMSKHWGSWAQAL
YNTAMHPEKHGDVVLETSDDVVVLNDVYPKARKHLLVVARYEGLDQLADVRKEHLPLLKTMHAMGLKWINKFFHEDASLVFRLGYHSAPSMRQLHLHVVSQDFDSSYLKN
KKHWNSFNTDFFRDSVDVIDEVSSHGKASIKDDESSMSMELRCNRCRSAHPNLPKLKTHISKCQAPFPSTLLEGGRLVIAPSNAHE