; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020427 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020427
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiontranscription factor bHLH140
Genome locationChr04:31767927..31771273
RNA-Seq ExpressionHG10020427
SyntenyHG10020427
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442389.1 PREDICTED: transcription factor bHLH140 [Cucumis melo]0.0e+0090.36Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS  KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         PQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV S IDMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPT Q TQEK +SCDKKE+ +C MSRNV MESEKGESPGVRSLE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGS ILSMVKAKA +KNISSTKFFTFVGDITKLNSEGGL CNVIANAANWRL PGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS
        QANSL PGN VAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLL NAY SLFQAFISIV+DK+KSVKGI+E LGS P EPQKHSE+S
Subjt:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV
        HHKFKRENLQNLE SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALYDTAMHPERH+++VLETSDDVVVL DIYPKARKHLLVVAR EGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV

Query:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME
        C EHL LLRTMHA+GLKWI KFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA IMD+E L+SME
Subjt:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_011651853.1 transcription factor bHLH140 [Cucumis sativus]0.0e+0090.36Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS  KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         PQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV S IDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPS+T SSANT K+SPT Q TQEK +SC KKE+ +CTMSRNV MESEKGESPG+RSL+D ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGS ILSMVKAKA +KNISSTKFFTFVGDITKLNSEGGL CNVIANAANWRL PGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS
        QANSLQPGN VAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLL NAY SLFQAFISIV+DK+KSVKGIHE LGS P E QKHSED 
Subjt:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV
        HHKFKRENLQNLERSKKWKGSQ+ TE LNQNNN TVPK SKHWGSWAQALYDTAMHPERH+++VLETSDDVVVL DIYPKARKHLLVVAR EGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV

Query:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME
        C EHL LLRTMHA+GLKWI+KFF ED  LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V++EVSSHGKA+IMD+ESLMSME
Subjt:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0085.22Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NST KG EGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE DVQS ID YKSL LH+ LPHGCFGQ   D KVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQT-QEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P++T SSAN +KDSP LQT +E S SCDKKE+PACT+  NVD ESEKGE+PGVRSL D+IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQT-QEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GS +LS+VKAKAAKK I+ +KFFTFVGDITKLNSEGGL CNVIANAANWRL PG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS
        QANSL+PGN V   LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLLR AY SLFQ FISIVE++FKSVKGI + LGSAPSE +KHSEDS
Subjt:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS

Query:  --------HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARRE
                 HKFKRE++QN ERSKKWKGSQD  EA NQNNN  V KMSKHWGSWAQALY+TAMHPERH DTVLE SDDV VLNDIY KA KHLLVVAR E
Subjt:  --------HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARRE

Query:  GLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMD
        GLDQLADV REHL LLRTMH VGLKWIDKFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VMDEV SHGKASI D
Subjt:  GLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMD

Query:  NESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSN
        +ESLMSMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLEG RLVIAPSN
Subjt:  NESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSN

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0093.31Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENST KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQRADFVKL 
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         P+VDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE DVQS IDMYKSLDLH+MLP GCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAEKPSET SSANTVKDSP  Q TQEKSDSCDKKE+ ACT+SRNVD+ES+KGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGS ILS+VKAKAAKKNISSTKFFTFVGDITKLNSEGGL CNVIANAANWRL PGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS
        QANSLQPGN VAV LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLLRNAY SLFQAFIS+VEDKFKSVKGIH RLG  PSEP+KHSE+S
Subjt:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV
        HHKFKRENLQN ERSKKWKGSQD TEALNQNNNKTVPKMSKHWGSWAQALY+TAMHPE+HSDTVLETSDDVVVLNDIYPKARKHLLVVAR EGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV

Query:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME
        C EHL LLRTMHAVGLKWIDKFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSVAVMDEVSSHGKAS+MD+ESLMSME
Subjt:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAHI KCQAPFPSTLLEGGRLVIAPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0093.39Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGK+VFVDRCNLEIEQRADFVKL  P+VDV AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSETRSSAN
        SVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE DVQS IDMYKSLDLH+MLP GCFGQKNPDKKVQLGIMKFLKKAEKPSET SSAN
Subjt:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSETRSSAN

Query:  TVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD
        TVKDSP  Q TQEKSDSCDKKE+ ACT+SRNVD+ES+KGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD
Subjt:  TVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD

Query:  LSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNVVAVPLPSTSPL
        LSHGS ILS+VKAKAAKKNISSTKFFTFVGDITKLNSEGGL CNVIANAANWRL PGGGGVNAAIFSAAG GLEVATKQQANSLQPGN VAV LPSTSPL
Subjt:  LSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNVVAVPLPSTSPL

Query:  FNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKKWKGS
        FNREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLLRNAY SLFQAFIS+VEDKFKSVKGIH RLG  PSEP+KHSE+SHHKFKRENLQN ERSKKWKGS
Subjt:  FNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKKWKGS

Query:  QDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDK
        QD TEALNQNNNKTVPKMSKHWGSWAQALY+TAMHPE+HSDTVLETSDDVVVLNDIYPKARKHLLVVAR EGLDQLADVC EHL LLRTMHAVGLKWIDK
Subjt:  QDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDK

Query:  FFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMELRCNRCRSAHPNLPKLKAHIS
        FFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSVAVMDEVSSHGKAS+MD+ESLMSMELRCNRCRSAHPNLPKLKAHI 
Subjt:  FFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMELRCNRCRSAHPNLPKLKAHIS

Query:  KCQAPFPSTLLEGGRLVIAPSNAPLS
        KCQAPFPSTLLEGGRLVIAPSNAPLS
Subjt:  KCQAPFPSTLLEGGRLVIAPSNAPLS

TrEMBL top hitse value%identityAlignment
A0A0A0L9U1 Uncharacterized protein0.0e+0090.36Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS  KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         PQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV S IDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPS+T SSANT K+SPT Q TQEK +SC KKE+ +CTMSRNV MESEKGESPG+RSL+D ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGS ILSMVKAKA +KNISSTKFFTFVGDITKLNSEGGL CNVIANAANWRL PGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS
        QANSLQPGN VAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLL NAY SLFQAFISIV+DK+KSVKGIHE LGS P E QKHSED 
Subjt:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV
        HHKFKRENLQNLERSKKWKGSQ+ TE LNQNNN TVPK SKHWGSWAQALYDTAMHPERH+++VLETSDDVVVL DIYPKARKHLLVVAR EGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV

Query:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME
        C EHL LLRTMHA+GLKWI+KFF ED  LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V++EVSSHGKA+IMD+ESLMSME
Subjt:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A1S3B6C4 transcription factor bHLH1400.0e+0090.36Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS  KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         PQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV S IDMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPT Q TQEK +SCDKKE+ +C MSRNV MESEKGESPGVRSLE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGS ILSMVKAKA +KNISSTKFFTFVGDITKLNSEGGL CNVIANAANWRL PGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS
        QANSL PGN VAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLL NAY SLFQAFISIV+DK+KSVKGI+E LGS P EPQKHSE+S
Subjt:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV
        HHKFKRENLQNLE SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALYDTAMHPERH+++VLETSDDVVVL DIYPKARKHLLVVAR EGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV

Query:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME
        C EHL LLRTMHA+GLKWI KFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA IMD+E L+SME
Subjt:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A5A7TLV2 Transcription factor bHLH1400.0e+0090.36Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS  KG EGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
         PQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV S IDMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPT Q TQEK +SCDKKE+ +C MSRNV MESEKGESPGVRSLE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQ-TQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGS ILSMVKAKA +KNISSTKFFTFVGDITKLNSEGGL CNVIANAANWRL PGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS
        QANSL PGN VAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLL NAY SLFQAFISIV+DK+KSVKGI+E LGS P EPQKHSE+S
Subjt:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV
        HHKFKRENLQNLE SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALYDTAMHPERH+++VLETSDDVVVL DIYPKARKHLLVVAR EGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADV

Query:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME
        C EHL LLRTMHA+GLKWI KFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA IMD+E L+SME
Subjt:  CREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A6J1CV45 transcription factor bHLH1400.0e+0085.22Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NST KG EGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE DVQS ID YKSL LH+ LPHGCFGQ   D KVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQT-QEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P++T SSAN +KDSP LQT +E S SCDKKE+PACT+  NVD ESEKGE+PGVRSL D+IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQT-QEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GS +LS+VKAKAAKK I+ +KFFTFVGDITKLNSEGGL CNVIANAANWRL PG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS
        QANSL+PGN V   LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLLR AY SLFQ FISIVE++FKSVKGI + LGSAPSE +KHSEDS
Subjt:  QANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDS

Query:  --------HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARRE
                 HKFKRE++QN ERSKKWKGSQD  EA NQNNN  V KMSKHWGSWAQALY+TAMHPERH DTVLE SDDV VLNDIY KA KHLLVVAR E
Subjt:  --------HHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARRE

Query:  GLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMD
        GLDQLADV REHL LLRTMH VGLKWIDKFFHEDA LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VMDEV SHGKASI D
Subjt:  GLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMD

Query:  NESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSN
        +ESLMSMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLEG RLVIAPSN
Subjt:  NESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSN

A0A6J1GCV0 transcription factor bHLH1400.0e+0084.39Show/hide
Query:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD DENST KG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQR++FVKLG
Subjt:  MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        S  VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+E DVQS ID YKSL LH+ LP GCFGQKN DKKVQ
Subjt:  SPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEII
        LGIM+FLKKAE P++T S+ANT KD P+ QT +      +K++ +CTM  NV+ ESEKGE+PGV SLE+NIS SDPPTLAFPSISTSDFKFSHEKAAEII
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEII

Query:  VEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQQ
        VE VEEFMDKLGNARLV+VDLSHGS ILS+VKAKAAKKNI STKFFTFVGDITKL S+GGL CNVIANAANWRL PGGGGVNAAIFSAAGP LE ATKQQ
Subjt:  VEKVEEFMDKLGNARLVLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQQ

Query:  ANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSH
        A SL+PGNVVAV LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLN+DYDEGCKLLR+AY SLFQAFISIV+D+FKS KGI ERLGSAPSE +KHSED+H
Subjt:  ANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSH

Query:  HKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVC
        HKFKR  LQ  ERSKKWKG+Q+  EALNQNNNK   KMSKHWGSWAQALY+TAM+PERH++ VLETSDDVVVLNDIYPKARKHLL+VAR EGLDQLADV 
Subjt:  HKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVC

Query:  REHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMEL
        +EHL LL+TMHAVG+KWIDKF H+DA LVFR+GYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA I D+ESLMSME 
Subjt:  REHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMEL

Query:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNA
        RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLEGGRLV A  N+
Subjt:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNA

SwissProt top hitse value%identityAlignment
P61798 Aprataxin (Fragment)2.7e-3434.24Show/hide
Query:  MNPQRPNYLNHDYDEGCKL----LRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNN
        +NP   + ++   DE  K+    + +    L+   +   E+  +SV    E++       ++  EDS      EN+    +  +   +Q  +  L  + +
Subjt:  MNPQRPNYLNHDYDEGCKL----LRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNN

Query:  KTVP-----KMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDAL
           P        +H G W+Q L  +   P+      +   +  VV+ D YPKAR H LV+   + +  L  V REHL LL  MHAVG K I +   +++ 
Subjt:  KTVP-----KMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDAL

Query:  LVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNES-LMSMELRCNRCRSAHPNLPKLKAHISK
        L FR+GYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T++F +S  V++ V S GK ++ D  S L+ + LRC+ C+     +P+LK H+ K
Subjt:  LVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNES-LMSMELRCNRCRSAHPNLPKLKAHISK

Q7YRZ2 Aprataxin9.2e-3540.95Show/hide
Query:  RSKKWKGSQDPTE---------------ALNQNNNKTVPKMSK-------HWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARR
        R +K  GS DPTE                 N +     PK  K         G W+Q L  +   P+      +   + VVV+ D YPKAR H LV+   
Subjt:  RSKKWKGSQDPTE---------------ALNQNNNKTVPKMSK-------HWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARR

Query:  EGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIM
          +  L  V REHL LLR MHAVG K I  F        FR+GYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S AV++ V   G+ ++ 
Subjt:  EGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIM

Query:  DN-ESLMSMELRCNRCRSAHPNLPKLKAHISK
        D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  DN-ESLMSMELRCNRCRSAHPNLPKLKAHISK

Q8K4H4 Aprataxin1.2e-3444.85Show/hide
Query:  PKMSKH-------WGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALL
        PK  KH        G W+Q L  +   P+      +   D VVV+ D YPKAR H LV+     +  L  V  EHL LL+ MHAVG K I  F     L 
Subjt:  PKMSKH-------WGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALL

Query:  VFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNE-SLMSMELRCNRCRSAHPNLPKLKAHISK
         FR+GYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S AV+  V   G+ ++ D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  VFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNE-SLMSMELRCNRCRSAHPNLPKLKAHISK

Q9BGQ0 Aprataxin1.2e-3442.16Show/hide
Query:  GSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWI
        GS     ++     K  P   +  G W+Q L  +   P+      +   + VVV+ D YPKAR H LV+     +  L  V REHL LL+ MH VG K I
Subjt:  GSQDPTEALNQNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWI

Query:  DKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDN-ESLMSMELRCNRCRSAHPNLPKLKA
          F     L  FR+GYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S AV++ V   G+ ++ D    L+ + LRC+ C+   P++P+LK 
Subjt:  DKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDN-ESLMSMELRCNRCRSAHPNLPKLKA

Query:  HISK
        H+ K
Subjt:  HISK

Q9M041 Transcription factor bHLH1409.9e-23959.83Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L +GKSVF+DRCNL+ EQR++F+KLG P+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSET
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ADV + ++MY  L   + LP GCFG+K  D K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSET

Query:  RSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARL
              V   P   + E +++  K ++    M+ NV +   K  S  +            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG ARL
Subjt:  RSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARL

Query:  VLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNVVAVPLPS
        VLVDLS GS ILS+VKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NWRL PGGGGVNAAIF AAGP LE AT+ +AN+L PG  V VPLPS
Subjt:  VLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNVVAVPLPS

Query:  TSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKK
        T PL N EG+THVIHVLGPNMNP RP+ LN+DY +GCK LR AY SLF+ F+S+V+D+ K  K   +   S   E  K  EDS            ER+KK
Subjt:  TSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKK

Query:  WKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTM
        +KGSQD     N      ++   +  KMSK W +WA AL+  AMHPERH + VLE  D++VV+ND YPKARKH+LV+AR+E LD L DV +E+L LL+ M
Subjt:  WKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTM

Query:  HAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMELRCNRCRSAHP
        H VGLKW+D+F +EDA L+FR+GYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V++EV+S GKA++  +E L+  ELRCNRCRSAHP
Subjt:  HAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMELRCNRCRSAHP

Query:  NLPKLKAHISKCQAPFPSTLLEGGRLV
        N+PKLK+H+  C + FP  LL+  RLV
Subjt:  NLPKLKAHISKCQAPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT5G01310.1 APRATAXIN-like7.0e-24059.83Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L +GKSVF+DRCNL+ EQR++F+KLG P+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSET
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ADV + ++MY  L   + LP GCFG+K  D K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSET

Query:  RSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARL
              V   P   + E +++  K ++    M+ NV +   K  S  +            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG ARL
Subjt:  RSSANTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARL

Query:  VLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNVVAVPLPS
        VLVDLS GS ILS+VKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NWRL PGGGGVNAAIF AAGP LE AT+ +AN+L PG  V VPLPS
Subjt:  VLVDLSHGSTILSMVKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNVVAVPLPS

Query:  TSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKK
        T PL N EG+THVIHVLGPNMNP RP+ LN+DY +GCK LR AY SLF+ F+S+V+D+ K  K   +   S   E  K  EDS            ER+KK
Subjt:  TSPLFNREGVTHVIHVLGPNMNPQRPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKK

Query:  WKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTM
        +KGSQD     N      ++   +  KMSK W +WA AL+  AMHPERH + VLE  D++VV+ND YPKARKH+LV+AR+E LD L DV +E+L LL+ M
Subjt:  WKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYDTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTM

Query:  HAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMELRCNRCRSAHP
        H VGLKW+D+F +EDA L+FR+GYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V++EV+S GKA++  +E L+  ELRCNRCRSAHP
Subjt:  HAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMELRCNRCRSAHP

Query:  NLPKLKAHISKCQAPFPSTLLEGGRLV
        N+PKLK+H+  C + FP  LL+  RLV
Subjt:  NLPKLKAHISKCQAPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGATACCGACGAGAATTCGACAGTCAAAGGAACGGAAGGGCAAGGGAAGCTCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTCTG
CGAACTTGTAATGGGCTCCTCTTCTCGCCCTTGGGTTCGAATCTGCCAGGATACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGCAGCGAGTG
CACTGAATGATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTGAAGCTCGGTAGCCCTCAAGTGGATGTACATGCTGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCTGTTAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAGATGCTGCAAAA
GAAAGAATTGCCCAAACTAAATGAAGGGTTCACTCGCATAACCTTTTGCCACAATGAGGCCGACGTTCAATCCACTATAGATATGTACAAATCACTTGATTTACATAATA
TGCTTCCACATGGATGTTTTGGACAGAAGAATCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAAACCTTCTGAGACGCGTTCTAGTGCT
AATACTGTTAAGGATTCTCCAACTCTGCAAACCCAGGAAAAGAGCGACTCTTGTGATAAAAAGGAAGATCCTGCCTGCACAATGTCGAGAAATGTAGATATGGAGTCGGA
GAAAGGTGAAAGTCCAGGCGTTAGATCCTTAGAAGACAATATTTCCCAAAGTGATCCCCCAACTCTTGCATTTCCATCTATTTCAACTTCAGATTTCAAGTTTAGCCATG
AAAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTACTGGTAGACTTGAGTCATGGATCAACGATATTGTCTATG
GTTAAAGCTAAAGCAGCTAAGAAAAATATTAGTTCCACCAAGTTTTTTACATTCGTAGGTGATATAACTAAACTCAATTCAGAAGGTGGATTGTGCTGCAATGTTATAGC
CAATGCTGCAAACTGGCGACTGAATCCGGGAGGCGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAAACAACAAGCAAACTCCC
TTCAACCTGGCAATGTAGTGGCCGTTCCGTTGCCTTCAACTTCTCCGTTGTTCAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCCAACATGAATCCACAA
AGGCCAAATTATCTCAACCATGACTATGATGAAGGTTGCAAGCTTCTGCGCAATGCTTACTTTTCCCTATTTCAAGCCTTTATTTCAATCGTAGAAGACAAATTTAAGTC
GGTTAAGGGAATTCACGAACGCCTCGGCTCGGCACCTTCAGAGCCACAAAAGCATTCTGAGGACAGCCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAAGAA
GCAAAAAGTGGAAAGGATCTCAGGACCCAACTGAAGCATTAAACCAAAACAACAATAAGACTGTCCCCAAAATGAGTAAGCACTGGGGTTCATGGGCACAAGCACTTTAC
GACACTGCAATGCATCCGGAGCGACATAGTGATACTGTACTGGAGACATCAGATGATGTTGTAGTACTCAATGATATTTATCCAAAGGCACGCAAGCATCTTCTAGTAGT
GGCTCGGCGTGAAGGCCTCGACCAACTAGCCGATGTATGTAGAGAACACCTTTCATTGTTGAGGACAATGCACGCTGTGGGTTTGAAGTGGATCGATAAGTTCTTTCATG
AAGATGCATTGTTGGTTTTTCGCATCGGATACCACTCGGCTCCATCAATGAGGCAACTGCACCTACATGTTATAAGCCAGGATTTCGACTCCAGTCATCTGAAAAACAAG
AAGCATTGGAATTCTTTCAACACCGATTTCTTCAGAGATTCGGTTGCCGTTATGGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCATGGACAATGAGAGCTTGATGTC
TATGGAGTTACGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAATTGAAAGCGCATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTCGAGGGCG
GTCGTTTAGTGATTGCGCCAAGTAATGCTCCTCTATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACATGGATACCGACGAGAATTCGACAGTCAAAGGAACGGAAGGGCAAGGGAAGCTCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTCTG
CGAACTTGTAATGGGCTCCTCTTCTCGCCCTTGGGTTCGAATCTGCCAGGATACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGTGCAGCGAGTG
CACTGAATGATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTGAAGCTCGGTAGCCCTCAAGTGGATGTACATGCTGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCTGTTAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAGATGCTGCAAAA
GAAAGAATTGCCCAAACTAAATGAAGGGTTCACTCGCATAACCTTTTGCCACAATGAGGCCGACGTTCAATCCACTATAGATATGTACAAATCACTTGATTTACATAATA
TGCTTCCACATGGATGTTTTGGACAGAAGAATCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAAACCTTCTGAGACGCGTTCTAGTGCT
AATACTGTTAAGGATTCTCCAACTCTGCAAACCCAGGAAAAGAGCGACTCTTGTGATAAAAAGGAAGATCCTGCCTGCACAATGTCGAGAAATGTAGATATGGAGTCGGA
GAAAGGTGAAAGTCCAGGCGTTAGATCCTTAGAAGACAATATTTCCCAAAGTGATCCCCCAACTCTTGCATTTCCATCTATTTCAACTTCAGATTTCAAGTTTAGCCATG
AAAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTACTGGTAGACTTGAGTCATGGATCAACGATATTGTCTATG
GTTAAAGCTAAAGCAGCTAAGAAAAATATTAGTTCCACCAAGTTTTTTACATTCGTAGGTGATATAACTAAACTCAATTCAGAAGGTGGATTGTGCTGCAATGTTATAGC
CAATGCTGCAAACTGGCGACTGAATCCGGGAGGCGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAAACAACAAGCAAACTCCC
TTCAACCTGGCAATGTAGTGGCCGTTCCGTTGCCTTCAACTTCTCCGTTGTTCAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCCAACATGAATCCACAA
AGGCCAAATTATCTCAACCATGACTATGATGAAGGTTGCAAGCTTCTGCGCAATGCTTACTTTTCCCTATTTCAAGCCTTTATTTCAATCGTAGAAGACAAATTTAAGTC
GGTTAAGGGAATTCACGAACGCCTCGGCTCGGCACCTTCAGAGCCACAAAAGCATTCTGAGGACAGCCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAAGAA
GCAAAAAGTGGAAAGGATCTCAGGACCCAACTGAAGCATTAAACCAAAACAACAATAAGACTGTCCCCAAAATGAGTAAGCACTGGGGTTCATGGGCACAAGCACTTTAC
GACACTGCAATGCATCCGGAGCGACATAGTGATACTGTACTGGAGACATCAGATGATGTTGTAGTACTCAATGATATTTATCCAAAGGCACGCAAGCATCTTCTAGTAGT
GGCTCGGCGTGAAGGCCTCGACCAACTAGCCGATGTATGTAGAGAACACCTTTCATTGTTGAGGACAATGCACGCTGTGGGTTTGAAGTGGATCGATAAGTTCTTTCATG
AAGATGCATTGTTGGTTTTTCGCATCGGATACCACTCGGCTCCATCAATGAGGCAACTGCACCTACATGTTATAAGCCAGGATTTCGACTCCAGTCATCTGAAAAACAAG
AAGCATTGGAATTCTTTCAACACCGATTTCTTCAGAGATTCGGTTGCCGTTATGGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCATGGACAATGAGAGCTTGATGTC
TATGGAGTTACGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAATTGAAAGCGCATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTCGAGGGCG
GTCGTTTAGTGATTGCGCCAAGTAATGCTCCTCTATCTTAG
Protein sequenceShow/hide protein sequence
MDMDTDENSTVKGTEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNDGKSVFVDRCNLEIEQRADFVKLGSPQVDVHAVV
LDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNEADVQSTIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSETRSSA
NTVKDSPTLQTQEKSDSCDKKEDPACTMSRNVDMESEKGESPGVRSLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVDLSHGSTILSM
VKAKAAKKNISSTKFFTFVGDITKLNSEGGLCCNVIANAANWRLNPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNVVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQ
RPNYLNHDYDEGCKLLRNAYFSLFQAFISIVEDKFKSVKGIHERLGSAPSEPQKHSEDSHHKFKRENLQNLERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALY
DTAMHPERHSDTVLETSDDVVVLNDIYPKARKHLLVVARREGLDQLADVCREHLSLLRTMHAVGLKWIDKFFHEDALLVFRIGYHSAPSMRQLHLHVISQDFDSSHLKNK
KHWNSFNTDFFRDSVAVMDEVSSHGKASIMDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVIAPSNAPLS