; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G022350 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G022350
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptiontranscription factor bHLH140
Genome locationchr03:33373290..33376940
RNA-Seq ExpressionLsi03G022350
SyntenyLsi03G022350
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442389.1 PREDICTED: transcription factor bHLH140 [Cucumis melo]0.0e+0089.17Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS  KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         PQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV S IDMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPT Q TQEK +SCDKKE+ +C MSRNV MESEKGESPGVRSLE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA
        IVEKVEEFMDKLGNARLVLVDLSHGS ILSMVKAKA +KNISSTKFFTFVGDITKLNSEGGL CNVIANAANW          RL PGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA

Query:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP
        GPGLEVATKQQANSL PGN VAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLL NAY SLFQAFISIV+DK+KSVKGI+E LGS P
Subjt:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP

Query:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR
         EPQKHSE+SHHKFKRENLQNLE SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALYDTAMHPERH+++VLETSDDVVVL DIYPKARKHLLVVAR
Subjt:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR

Query:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI
         EGLDQLADVC EHL LLRTMHA+GLKWI KFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA I
Subjt:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI

Query:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        MD+E L+SMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_011651853.1 transcription factor bHLH140 [Cucumis sativus]0.0e+0089.17Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS  KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         PQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV S IDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPS+T SSANT K+SPT Q TQEK +SC KKE+ +CTMSRNV MESEKGESPG+RSL+D ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA
        IVEKVEEFMDKLGNARLVLVDLSHGS ILSMVKAKA +KNISSTKFFTFVGDITKLNSEGGL CNVIANAANW          RL PGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA

Query:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP
        G GLEVATKQQANSLQPGN VAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLL NAY SLFQAFISIV+DK+KSVKGIHE LGS P
Subjt:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP

Query:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR
         E QKHSED HHKFKRENLQNLERSKKWKGSQ+ TE LNQNNN TVPK SKHWGSWAQALYDTAMHPERH+++VLETSDDVVVL DIYPKARKHLLVVAR
Subjt:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR

Query:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI
         EGLDQLADVC EHL LLRTMHA+GLKWI+KFF ED  LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V++EVSSHGKA+I
Subjt:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI

Query:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        MD+ESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0084.1Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NST KG EGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE DVQS ID YKSL LH+ LPHGCFGQ   D KVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQT-QEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P++T SSAN +KDSP LQT +E S SCDKKE+PACT+  NVD ESEKGE+PGVRSL D+IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQT-QEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA
        IVE VEEFMDKLGNARLVLVDL++GS +LS+VKAKAAKK I+ +KFFTFVGDITKLNSEGGL CNVIANAANW          RL PG GGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA

Query:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP
        GPGLEVATKQQANSL+PGN V   LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLLR AY SLFQ FISIVE++FKSVKGI + LGSAP
Subjt:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP

Query:  SEPQKHSEDS--------HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKAR
        SE +KHSEDS         HKFKRE++QN ERSKKWKGSQD  EA NQNNN  V KMSKHWGSWAQALY+TAMHPERH DTVLE SDDV VLNDIY KA 
Subjt:  SEPQKHSEDS--------HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKAR

Query:  KHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEV
        KHLLVVAR EGLDQLADV REHL LLRTMH VGLKWIDKFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VMDEV
Subjt:  KHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEV

Query:  SSHGKASIMDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSN
         SHGKASI D+ESLMSMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLEG RLVIAPSN
Subjt:  SSHGKASIMDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSN

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0092.07Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENST KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQRADFVKL 
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         P+VDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE DVQS IDMYKSLDLH+MLP GCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAEKPSET SSANTVKDSP  Q TQEKSDSCDKKE+ ACT+SRNVD+ES+KGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA
        IVEKVEEFMDKLGNARLVLVDLSHGS ILS+VKAKAAKKNISSTKFFTFVGDITKLNSEGGL CNVIANAANW          RL PGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA

Query:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP
        G GLEVATKQQANSLQPGN VAV LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLLRNAY SLFQAFIS+VEDKFKSVKGIH RLG  P
Subjt:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP

Query:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR
        SEP+KHSE+SHHKFKRENLQN ERSKKWKGSQD TEALNQNNNKTVPKMSKHWGSWAQALY+TAMHPE+HSDTVLETSDDVVVLNDIYPKARKHLLVVAR
Subjt:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR

Query:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI
         EGLDQLADVC EHL LLRTMHAVGLKWIDKFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSVAVMDEVSSHGKAS+
Subjt:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI

Query:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        MD+ESLMSMELRCNRCRSAHPNLPKLKAHI KCQAPFPSTLLEGGRLVIAPSNAPLS
Subjt:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0092.12Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQRADFVKL  P+VDV AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSETRSSAN
        SVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE DVQS IDMYKSLDLH+MLP GCFGQKNPDKKVQLGIMKFLKKAEKPSET SSAN
Subjt:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSETRSSAN

Query:  TVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD
        TVKDSP  Q TQEKSDSCDKKE+ ACT+SRNVD+ES+KGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD
Subjt:  TVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD

Query:  LSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNVV
        LSHGS ILS+VKAKAAKKNISSTKFFTFVGDITKLNSEGGL CNVIANAANW          RL PGGGGVNAAIFSAAG GLEVATKQQANSLQPGN V
Subjt:  LSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNVV

Query:  AVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQN
        AV LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLLRNAY SLFQAFIS+VEDKFKSVKGIH RLG  PSEP+KHSE+SHHKFKRENLQN
Subjt:  AVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQN

Query:  LERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTM
         ERSKKWKGSQD TEALNQNNNKTVPKMSKHWGSWAQALY+TAMHPE+HSDTVLETSDDVVVLNDIYPKARKHLLVVAR EGLDQLADVC EHL LLRTM
Subjt:  LERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTM

Query:  HAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMELRCNRCRSAHP
        HAVGLKWIDKFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSVAVMDEVSSHGKAS+MD+ESLMSMELRCNRCRSAHP
Subjt:  HAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMELRCNRCRSAHP

Query:  NLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        NLPKLKAHI KCQAPFPSTLLEGGRLVIAPSNAPLS
Subjt:  NLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

TrEMBL top hitse value%identityAlignment
A0A0A0L9U1 Uncharacterized protein0.0e+0089.17Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS  KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         PQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV S IDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPS+T SSANT K+SPT Q TQEK +SC KKE+ +CTMSRNV MESEKGESPG+RSL+D ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA
        IVEKVEEFMDKLGNARLVLVDLSHGS ILSMVKAKA +KNISSTKFFTFVGDITKLNSEGGL CNVIANAANW          RL PGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA

Query:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP
        G GLEVATKQQANSLQPGN VAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLL NAY SLFQAFISIV+DK+KSVKGIHE LGS P
Subjt:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP

Query:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR
         E QKHSED HHKFKRENLQNLERSKKWKGSQ+ TE LNQNNN TVPK SKHWGSWAQALYDTAMHPERH+++VLETSDDVVVL DIYPKARKHLLVVAR
Subjt:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR

Query:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI
         EGLDQLADVC EHL LLRTMHA+GLKWI+KFF ED  LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V++EVSSHGKA+I
Subjt:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI

Query:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        MD+ESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A1S3B6C4 transcription factor bHLH1400.0e+0089.17Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS  KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         PQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV S IDMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPT Q TQEK +SCDKKE+ +C MSRNV MESEKGESPGVRSLE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA
        IVEKVEEFMDKLGNARLVLVDLSHGS ILSMVKAKA +KNISSTKFFTFVGDITKLNSEGGL CNVIANAANW          RL PGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA

Query:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP
        GPGLEVATKQQANSL PGN VAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLL NAY SLFQAFISIV+DK+KSVKGI+E LGS P
Subjt:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP

Query:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR
         EPQKHSE+SHHKFKRENLQNLE SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALYDTAMHPERH+++VLETSDDVVVL DIYPKARKHLLVVAR
Subjt:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR

Query:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI
         EGLDQLADVC EHL LLRTMHA+GLKWI KFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA I
Subjt:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI

Query:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        MD+E L+SMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A5A7TLV2 Transcription factor bHLH1400.0e+0089.17Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS  KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         PQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV S IDMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPT Q TQEK +SCDKKE+ +C MSRNV MESEKGESPGVRSLE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA
        IVEKVEEFMDKLGNARLVLVDLSHGS ILSMVKAKA +KNISSTKFFTFVGDITKLNSEGGL CNVIANAANW          RL PGGGGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA

Query:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP
        GPGLEVATKQQANSL PGN VAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLL NAY SLFQAFISIV+DK+KSVKGI+E LGS P
Subjt:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP

Query:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR
         EPQKHSE+SHHKFKRENLQNLE SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALYDTAMHPERH+++VLETSDDVVVL DIYPKARKHLLVVAR
Subjt:  SEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVAR

Query:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI
         EGLDQLADVC EHL LLRTMHA+GLKWI KFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA I
Subjt:  REGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASI

Query:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        MD+E L+SMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  MDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A6J1CV45 transcription factor bHLH1400.0e+0084.1Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NST KG EGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE DVQS ID YKSL LH+ LPHGCFGQ   D KVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQT-QEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P++T SSAN +KDSP LQT +E S SCDKKE+PACT+  NVD ESEKGE+PGVRSL D+IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQT-QEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA
        IVE VEEFMDKLGNARLVLVDL++GS +LS+VKAKAAKK I+ +KFFTFVGDITKLNSEGGL CNVIANAANW          RL PG GGVNAAIFSAA
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAA

Query:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP
        GPGLEVATKQQANSL+PGN V   LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLLR AY SLFQ FISIVE++FKSVKGI + LGSAP
Subjt:  GPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAP

Query:  SEPQKHSEDS--------HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKAR
        SE +KHSEDS         HKFKRE++QN ERSKKWKGSQD  EA NQNNN  V KMSKHWGSWAQALY+TAMHPERH DTVLE SDDV VLNDIY KA 
Subjt:  SEPQKHSEDS--------HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKAR

Query:  KHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEV
        KHLLVVAR EGLDQLADV REHL LLRTMH VGLKWIDKFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VMDEV
Subjt:  KHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEV

Query:  SSHGKASIMDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSN
         SHGKASI D+ESLMSMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLEG RLVIAPSN
Subjt:  SSHGKASIMDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSN

A0A6J1GCV0 transcription factor bHLH1400.0e+0083.27Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD DENST KG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQR++FVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        S  VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+E DVQS ID YKSL LH+ LP GCFGQKN DKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEII
        LGIM+FLKKAE P++T S+ANT KD P+ QT +      +K++ +CTM  NV+ ESEKGE+PGV SLE+NIS SDPPTLAFPSISTSDFKFSHEKAAEII
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEII

Query:  VEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAAG
        VE VEEFMDKLGNARLV+VDLSHGS ILS+VKAKAAKKNI STKFFTFVGDITKL S+GGL CNVIANAANW          RL PGGGGVNAAIFSAAG
Subjt:  VEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAAG

Query:  PGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPS
        P LE ATKQQA SL+PGNVVAV LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLLR+AY SLFQAFISIV+D+FKS KGI ERLGSAPS
Subjt:  PGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPS

Query:  EPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARR
        E +KHSED+HHKFKR  LQ  ERSKKWKG+Q+  EALNQNNNK   KMSKHWGSWAQALY+TAM+PERH++ VLETSDDVVVLNDIYPKARKHLL+VAR 
Subjt:  EPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARR

Query:  EGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIM
        EGLDQLADV +EHL LL+TMHAVG+KWIDKF H+DA LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA I 
Subjt:  EGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIM

Query:  DNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNA
        D+ESLMSME RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLEGGRLV A  N+
Subjt:  DNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNA

SwissProt top hitse value%identityAlignment
P61798 Aprataxin (Fragment)2.7e-3434.24Show/hide
Query:  MNPQRPNYLNHDYDEGCKL----LRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNN
        +NP   + ++   DE  K+    + +    L+   +   E+  +SV    E++       ++  EDS      EN+    +  +   +Q  +  L  + +
Subjt:  MNPQRPNYLNHDYDEGCKL----LRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNN

Query:  KTVP-----KMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDAL
           P        +H G W+Q L  +   P+      +   +  VV+ D YPKAR H LV+   + +  L  V REHL LL  MHAVG K I +   +++ 
Subjt:  KTVP-----KMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDAL

Query:  LVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNES-LMSMELRCNRCRSAHPNLPKLKAHISK
        L FR+GYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T++F +S  V++ V S GK ++ D  S L+ + LRC+ C+     +P+LK H+ K
Subjt:  LVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNES-LMSMELRCNRCRSAHPNLPKLKAHISK

Q7YRZ2 Aprataxin9.3e-3540.95Show/hide
Query:  RSKKWKGSQDPTE---------------ALNQNNNKTVPKMSK-------HWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARR
        R +K  GS DPTE                 N +     PK  K         G W+Q L  +   P+      +   + VVV+ D YPKAR H LV+   
Subjt:  RSKKWKGSQDPTE---------------ALNQNNNKTVPKMSK-------HWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARR

Query:  EGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIM
          +  L  V REHL LLR MHAVG K I  F        FR+GYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S AV++ V   G+ ++ 
Subjt:  EGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIM

Query:  DN-ESLMSMELRCNRCRSAHPNLPKLKAHISK
        D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  DN-ESLMSMELRCNRCRSAHPNLPKLKAHISK

Q8K4H4 Aprataxin1.2e-3444.85Show/hide
Query:  PKMSKH-------WGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALL
        PK  KH        G W+Q L  +   P+      +   D VVV+ D YPKAR H LV+     +  L  V  EHL LL+ MHAVG K I  F     L 
Subjt:  PKMSKH-------WGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALL

Query:  VFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNE-SLMSMELRCNRCRSAHPNLPKLKAHISK
         FR+GYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S AV+  V   G+ ++ D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  VFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNE-SLMSMELRCNRCRSAHPNLPKLKAHISK

Q9BGQ0 Aprataxin1.2e-3442.16Show/hide
Query:  GSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWI
        GS     ++     K  P   +  G W+Q L  +   P+      +   + VVV+ D YPKAR H LV+     +  L  V REHL LL+ MH VG K I
Subjt:  GSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWI

Query:  DKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDN-ESLMSMELRCNRCRSAHPNLPKLKA
          F     L  FR+GYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S AV++ V   G+ ++ D    L+ + LRC+ C+   P++P+LK 
Subjt:  DKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDN-ESLMSMELRCNRCRSAHPNLPKLKA

Query:  HISK
        H+ K
Subjt:  HISK

Q9M041 Transcription factor bHLH1405.5e-23759.02Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L +GKSVF+DRCNL+ EQR++F+KLG P+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSET
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ADV + ++MY  L   + LP GCFG+K  D K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSET

Query:  RSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARL
              V   P   + E +++  K ++    M+ NV +   K  S  +            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG ARL
Subjt:  RSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARL

Query:  VLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQP
        VLVDLS GS ILS+VKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NW          RL PGGGGVNAAIF AAGP LE AT+ +AN+L P
Subjt:  VLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQP

Query:  GNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRE
        G  V VPLPST PL N EG+THVIHVLGPNMNP RP+ LN+DY +GCK LR AY SLF+ F+S+V+D+ K  K   +   S   E  K  EDS       
Subjt:  GNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRE

Query:  NLQNLERSKKWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVC
             ER+KK+KGSQD     N      ++   +  KMSK W +WA AL+  AMHPERH + VLE  D++VV+ND YPKARKH+LV+AR+E LD L DV 
Subjt:  NLQNLERSKKWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVC

Query:  REHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMEL
        +E+L LL+ MH VGLKW+D+F +EDA L+FR+GYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V++EV+S GKA++  +E L+  EL
Subjt:  REHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMEL

Query:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
        RCNRCRSAHPN+PKLK+H+  C + FP  LL+  RLV
Subjt:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT5G01310.1 APRATAXIN-like3.9e-23859.02Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L +GKSVF+DRCNL+ EQR++F+KLG P+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSET
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ADV + ++MY  L   + LP GCFG+K  D K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSET

Query:  RSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARL
              V   P   + E +++  K ++    M+ NV +   K  S  +            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG ARL
Subjt:  RSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARL

Query:  VLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQP
        VLVDLS GS ILS+VKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NW          RL PGGGGVNAAIF AAGP LE AT+ +AN+L P
Subjt:  VLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQP

Query:  GNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRE
        G  V VPLPST PL N EG+THVIHVLGPNMNP RP+ LN+DY +GCK LR AY SLF+ F+S+V+D+ K  K   +   S   E  K  EDS       
Subjt:  GNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRE

Query:  NLQNLERSKKWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVC
             ER+KK+KGSQD     N      ++   +  KMSK W +WA AL+  AMHPERH + VLE  D++VV+ND YPKARKH+LV+AR+E LD L DV 
Subjt:  NLQNLERSKKWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVC

Query:  REHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMEL
        +E+L LL+ MH VGLKW+D+F +EDA L+FR+GYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V++EV+S GKA++  +E L+  EL
Subjt:  REHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMEL

Query:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV
        RCNRCRSAHPN+PKLK+H+  C + FP  LL+  RLV
Subjt:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGATACCGACGAGAATTCGACAGTCAAAGGAACGGAAGGGCAAGGGAAGCTCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACC
TTCTGCGAACTTGTAATGGGCTCCTCTTCTCGCCCTTGGGTTCGAATCTGCCAGGATACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGT
GCAGCGAGTGCACTGAATGATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTGAAGCTCGGTAGCCCTCAAGTGGAT
GTACATGCTGTTGTATTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCTGTTAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTT
GTGAATAAGATGCTGCAAAAGAAAGAATTGCCCAAACTAAATGAAGGGTTCACTCGCATAACCTTTTGCCACAATGAGGCCGACGTTCAATCCACTATAGATATG
TACAAATCACTTGATTTACATAATATGCTTCCACATGGATGTTTTGGACAGAAGAATCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCA
GAAAAACCTTCTGAGACGCGTTCTAGTGCTAATACTGTTAAGGATTCTCCAACTCTGCAAACCCAGGAAAAGAGCGACTCTTGTGATAAAAAGGAAGATCCTGCC
TGCACAATGTCGAGAAATGTAGATATGGAGTCGGAGAAAGGTGAAAGTCCAGGCGTTAGATCCTTAGAAGACAATATTTCCCAAAGTGATCCCCCAACTCTTGCA
TTTCCATCTATTTCAACTTCAGATTTCAAGTTTAGCCATGAAAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCCAGA
CTTGTACTGGTAGACTTGAGTCATGGATCAACGATATTGTCTATGGTTAAAGCTAAAGCAGCTAAGAAAAATATTAGTTCCACCAAGTTTTTTACATTCGTAGGT
GATATAACTAAACTCAATTCAGAAGGTGGATTGTGCTGCAATGTTATAGCCAATGCTGCAAACTGGAAGATACTTAATAAAGATATTCCGCACAGGCGACTGAAT
CCGGGAGGCGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAAACAACAAGCAAACTCCCTTCAACCTGGCAATGTAGTG
GCCGTTCCGTTGCCTTCAACTTCTCCGTTGTTCAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCCAACATGAATCCACAAAGGCCAAATTATCTC
AACCATGACTATGATGAAGGTTGCAAGCTTCTGCGCAATGCTTACTTTTCCCTATTTCAAGCCTTTATTTCAATCGTAGAAGACAAATTTAAGTCGGTTAAGGGA
ATTCACGAACGCCTCGGCTCGGCACCTTCAGAGCCACAAAAGCATTCTGAGGACAGCCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAAGAAGCAAA
AAGTGGAAAGGATCTCAGGACCCAACTGAAGCATTAAACCAAAACAACAATAAGACTGTCCCCAAAATGAGTAAGCACTGGGGTTCATGGGCACAAGCACTTTAC
GACACTGCAATGCATCCGGAGCGACATAGTGATACTGTACTGGAGACATCAGATGATGTTGTAGTACTCAATGATATTTATCCAAAGGCACGCAAGCATCTTCTA
GTAGTGGCTCGGCGTGAAGGCCTCGACCAACTAGCCGATGTATGTAGAGAACACCTTTCATTGTTGAGGACAATGCACGCTGTGGGTTTGAAGTGGATCGATAAG
TTCTTTCATGAAGATGCATTGTTGGTTTTTCGCATCGGATACCACTCGGCTCCATCAATGAGGCAACTGCACCTACATGTTATAAGCCAGGATTTCGACTCCAGT
CATCTGAAAAACAAGAAGCATTGGAATTCTTTCAACACCGATTTCTTCAGAGATTCGGTTGCCGTTATGGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCATG
GACAATGAGAGCTTGATGTCTATGGAGTTACGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAATTGAAAGCGCATATTTCCAAATGCCAAGCGCCT
TTCCCTTCCACACTGCTCGAGGGCGGTCGTTTAGTGATTGCGCCAAGTAATGCTCCTCTATCTTAG
mRNA sequenceShow/hide mRNA sequence
ACGTGCTAAATGCGTAGCCAAACACTCCCGTAGCAGCAAATTTGGGGAGGGAACTGACCGTGGCGTATTCTTCAAGCTTCACTCCTCGGCAGTGCGAACGTCGGT
AATGGCGATTGATAGGGTTTAAAAGGAAGAACTAAAATCTAATCAAGAAATGGACATGGATACCGACGAGAATTCGACAGTCAAAGGAACGGAAGGGCAAGGGAA
GCTCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTCTGCGAACTTGTAATGGGCTCCTCTTCTCGCCCTTGGGTTCGAATCTGCCAGGA
TACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGCAGCGAGTGCACTGAATGATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGA
AATAGAGCAGCGTGCAGATTTTGTGAAGCTCGGTAGCCCTCAAGTGGATGTACATGCTGTTGTATTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCTGTTAA
GCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAGATGCTGCAAAAGAAAGAATTGCCCAAACTAAATGAAGGGTTCACTCG
CATAACCTTTTGCCACAATGAGGCCGACGTTCAATCCACTATAGATATGTACAAATCACTTGATTTACATAATATGCTTCCACATGGATGTTTTGGACAGAAGAA
TCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAAACCTTCTGAGACGCGTTCTAGTGCTAATACTGTTAAGGATTCTCCAACTCT
GCAAACCCAGGAAAAGAGCGACTCTTGTGATAAAAAGGAAGATCCTGCCTGCACAATGTCGAGAAATGTAGATATGGAGTCGGAGAAAGGTGAAAGTCCAGGCGT
TAGATCCTTAGAAGACAATATTTCCCAAAGTGATCCCCCAACTCTTGCATTTCCATCTATTTCAACTTCAGATTTCAAGTTTAGCCATGAAAAGGCTGCTGAAAT
TATTGTTGAGAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTACTGGTAGACTTGAGTCATGGATCAACGATATTGTCTATGGTTAAAGCTAA
AGCAGCTAAGAAAAATATTAGTTCCACCAAGTTTTTTACATTCGTAGGTGATATAACTAAACTCAATTCAGAAGGTGGATTGTGCTGCAATGTTATAGCCAATGC
TGCAAACTGGAAGATACTTAATAAAGATATTCCGCACAGGCGACTGAATCCGGGAGGCGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGA
AGTGGCAACTAAACAACAAGCAAACTCCCTTCAACCTGGCAATGTAGTGGCCGTTCCGTTGCCTTCAACTTCTCCGTTGTTCAATAGGGAAGGAGTAACCCATGT
CATACATGTTCTTGGACCCAACATGAATCCACAAAGGCCAAATTATCTCAACCATGACTATGATGAAGGTTGCAAGCTTCTGCGCAATGCTTACTTTTCCCTATT
TCAAGCCTTTATTTCAATCGTAGAAGACAAATTTAAGTCGGTTAAGGGAATTCACGAACGCCTCGGCTCGGCACCTTCAGAGCCACAAAAGCATTCTGAGGACAG
CCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAAGAAGCAAAAAGTGGAAAGGATCTCAGGACCCAACTGAAGCATTAAACCAAAACAACAATAAGAC
TGTCCCCAAAATGAGTAAGCACTGGGGTTCATGGGCACAAGCACTTTACGACACTGCAATGCATCCGGAGCGACATAGTGATACTGTACTGGAGACATCAGATGA
TGTTGTAGTACTCAATGATATTTATCCAAAGGCACGCAAGCATCTTCTAGTAGTGGCTCGGCGTGAAGGCCTCGACCAACTAGCCGATGTATGTAGAGAACACCT
TTCATTGTTGAGGACAATGCACGCTGTGGGTTTGAAGTGGATCGATAAGTTCTTTCATGAAGATGCATTGTTGGTTTTTCGCATCGGATACCACTCGGCTCCATC
AATGAGGCAACTGCACCTACATGTTATAAGCCAGGATTTCGACTCCAGTCATCTGAAAAACAAGAAGCATTGGAATTCTTTCAACACCGATTTCTTCAGAGATTC
GGTTGCCGTTATGGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCATGGACAATGAGAGCTTGATGTCTATGGAGTTACGTTGCAACAGATGCAGAAGTGCTCA
TCCCAACTTACCCAAATTGAAAGCGCATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTCGAGGGCGGTCGTTTAGTGATTGCGCCAAGTAATGCTCC
TCTATCTTAGTTTTCATAATGTCTTTCACACACACAAATTTTGTTGTAAATATTTTGAGCTCAATATCTTTGGGGTTTATGTAGTTTGATGTTTTCGGGAAGCAT
TGGGGTGTGTTTAGTTGAACTTTTCAAGTGTTTAATTTTGAAAATAAGTCATTTT
Protein sequenceShow/hide protein sequence
MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVD
VHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKA
EKPSETRSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR
LVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWKILNKDIPHRRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNVV
AVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSK
KWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDK
FFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAP
FPSTLLEGGRLVIAPSNAPLS