; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0015664 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0015664
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptiontranscription factor bHLH140
Genome locationchr08:23039975..23043346
RNA-Seq ExpressionPI0015664
SyntenyPI0015664
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442389.1 PREDICTED: transcription factor bHLH140 [Cucumis melo]0.0e+0094.38Show/hide
Query:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG
        MDMD DENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADF+KLG
Subjt:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDV SA+DMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PSKTCS AN D NSPTPQPTQEKRESC KKEES+C MSRNVA E EKGESPGVR LE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVL+DLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS
        QANSL PGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL NAY+SLFQAFISIVQDKYKSVKGI+ECLGSTPPEPQKHSE+S
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERH N+VLETSDDVVVL DIYPKARKHLLV+ARHEGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME
        CTEHLPLLRTMHAMGLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKA IMDD+ L+SME
Subjt:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME

Query:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS
        LRCN+CRSAHPNLPKLKAH+SKCQAPFPSTLLE GRLV  PSNAPLS
Subjt:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS

XP_011651853.1 transcription factor bHLH140 [Cucumis sativus]0.0e+0094.51Show/hide
Query:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG
        MDMD DENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLK+ATSALNDGKSVFVDRCNLEIEQRADF+KLG
Subjt:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDV SA+DMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPSKTCS ANTD NSPTPQPTQEKRESCGKKEES+CTMSRNVA E EKGESPG+R L+D ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVL+DLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS
        QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL NAY+SLFQAFISIVQDKYKSVKGIHECLGSTPPE QKHSED 
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV
        HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN TVPKKSKHWGSWAQALYDTAMHPERH N+VLETSDDVVVL DIYPKARKHLLV+ARHEGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME
        CTEHLPLLRTMHAMGLKWI KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VI+EVSSHGKA+IMDD+SLMSME
Subjt:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME

Query:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS
        LRCN+CRSAHPNLPKLKAH+SKCQAPFPSTLLEGGRLV  PSNAPLS
Subjt:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0082.42Show/hide
Query:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG
        MDMDID+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLKSA+SAL+DGKS+FVDRCNLEIEQRADF+K+G
Subjt:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKR+GHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DVQSA+D YKSL LH+ LPHGCFGQ   D KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P+KTCS AN   +SP  Q ++E   SC KKEE ACT+  NV  E EKGE+PGVR L D+IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEF+DKLGNARLVL+DL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS
        QANSL+PGN V  QLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AY+SLFQ FISIV++++KSVKGI + LGS P E +KHSEDS
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS

Query:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHE
                 HKFKRE++QN ERSKKWKGSQ+S E  NQNNN  V K SKHWGSWAQALY+TAMHPERH +TVLE SDDV VLNDIY KA KHLLV+AR+E
Subjt:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHE

Query:  GLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMD
        GLDQLADV  EHLPLLRTMH +GLKWI+KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V+DEV SHGKASI D
Subjt:  GLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMD

Query:  DDSLMSMELRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSN
        D+SLMSMELRCN+CRSAHPNL KLKAH+SKC+APFPSTLLEG RLV APSN
Subjt:  DDSLMSMELRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSN

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0090.36Show/hide
Query:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG
        MDMD DENS AKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGK+VFVDRCNLEIEQRADF+KL 
Subjt:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GP+VDV AVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE+DVQSA+DMYKSLDLH+MLP GCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAEKPS+T S ANT  +SP PQ TQEK +SC KKEESACT+SRNV  E +KGESPGVR LEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVL+DLSHGSKILS+VKAKA +KNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS
        QANSLQPGNAVAVQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAY+SLFQAFIS+V+DK+KSVKGIH  LG TP EP+KHSE+S
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV
        HHKFKRENLQN ERSKKWKGSQ+STE LNQNNNKTVPK SKHWGSWAQALY+TAMHPE+H++TVLETSDDVVVLNDIYPKARKHLLV+ARHEGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME
        C EHLPLLRTMHA+GLKWI+KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSV V+DEVSSHGKAS+MDD+SLMSME
Subjt:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME

Query:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS
        LRCN+CRSAHPNLPKLKAH+ KCQAPFPSTLLEGGRLV APSNAPLS
Subjt:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0090.36Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLGGPQVDVHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGK+VFVDRCNLEIEQRADF+KL GP+VDV AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLGGPQVDVHAVVLDLPAQLCISR

Query:  SVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSKTCSGAN
        SVKR+GHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE+DVQSA+DMYKSLDLH+MLP GCFGQKNPDKKVQLGIMKFLKKAEKPS+T S AN
Subjt:  SVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSKTCSGAN

Query:  TDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFVDKLGNARLVLLD
        T  +SP PQ TQEK +SC KKEESACT+SRNV  E +KGESPGVR LEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEF+DKLGNARLVL+D
Subjt:  TDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFVDKLGNARLVLLD

Query:  LSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVQLPSTSPL
        LSHGSKILS+VKAKA +KNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQQANSLQPGNAVAVQLPSTSPL
Subjt:  LSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVQLPSTSPL

Query:  LNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDSHHKFKRENLQNLERSKKWKGS
         NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAY+SLFQAFIS+V+DK+KSVKGIH  LG TP EP+KHSE+SHHKFKRENLQN ERSKKWKGS
Subjt:  LNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDSHHKFKRENLQNLERSKKWKGS

Query:  QNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADVCTEHLPLLRTMHAMGLKWIEK
        Q+STE LNQNNNKTVPK SKHWGSWAQALY+TAMHPE+H++TVLETSDDVVVLNDIYPKARKHLLV+ARHEGLDQLADVC EHLPLLRTMHA+GLKWI+K
Subjt:  QNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADVCTEHLPLLRTMHAMGLKWIEK

Query:  FFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSMELRCNKCRSAHPNLPKLKAHVS
        FFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSV V+DEVSSHGKAS+MDD+SLMSMELRCN+CRSAHPNLPKLKAH+ 
Subjt:  FFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSMELRCNKCRSAHPNLPKLKAHVS

Query:  KCQAPFPSTLLEGGRLVFAPSNAPLS
        KCQAPFPSTLLEGGRLV APSNAPLS
Subjt:  KCQAPFPSTLLEGGRLVFAPSNAPLS

TrEMBL top hitse value%identityAlignment
A0A0A0L9U1 Uncharacterized protein0.0e+0094.51Show/hide
Query:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG
        MDMD DENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLK+ATSALNDGKSVFVDRCNLEIEQRADF+KLG
Subjt:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDV SA+DMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPSKTCS ANTD NSPTPQPTQEKRESCGKKEES+CTMSRNVA E EKGESPG+R L+D ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVL+DLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS
        QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL NAY+SLFQAFISIVQDKYKSVKGIHECLGSTPPE QKHSED 
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV
        HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN TVPKKSKHWGSWAQALYDTAMHPERH N+VLETSDDVVVL DIYPKARKHLLV+ARHEGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME
        CTEHLPLLRTMHAMGLKWI KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VI+EVSSHGKA+IMDD+SLMSME
Subjt:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME

Query:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS
        LRCN+CRSAHPNLPKLKAH+SKCQAPFPSTLLEGGRLV  PSNAPLS
Subjt:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS

A0A1S3B6C4 transcription factor bHLH1400.0e+0094.38Show/hide
Query:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG
        MDMD DENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADF+KLG
Subjt:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDV SA+DMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PSKTCS AN D NSPTPQPTQEKRESC KKEES+C MSRNVA E EKGESPGVR LE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVL+DLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS
        QANSL PGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL NAY+SLFQAFISIVQDKYKSVKGI+ECLGSTPPEPQKHSE+S
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERH N+VLETSDDVVVL DIYPKARKHLLV+ARHEGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME
        CTEHLPLLRTMHAMGLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKA IMDD+ L+SME
Subjt:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME

Query:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS
        LRCN+CRSAHPNLPKLKAH+SKCQAPFPSTLLE GRLV  PSNAPLS
Subjt:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS

A0A5A7TLV2 Transcription factor bHLH1400.0e+0094.38Show/hide
Query:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG
        MDMD DENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADF+KLG
Subjt:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDV SA+DMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PSKTCS AN D NSPTPQPTQEKRESC KKEES+C MSRNVA E EKGESPGVR LE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVL+DLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS
        QANSL PGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL NAY+SLFQAFISIVQDKYKSVKGI+ECLGSTPPEPQKHSE+S
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERH N+VLETSDDVVVL DIYPKARKHLLV+ARHEGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME
        CTEHLPLLRTMHAMGLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKA IMDD+ L+SME
Subjt:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME

Query:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS
        LRCN+CRSAHPNLPKLKAH+SKCQAPFPSTLLE GRLV  PSNAPLS
Subjt:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS

A0A6J1CV45 transcription factor bHLH1400.0e+0082.42Show/hide
Query:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG
        MDMDID+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLKSA+SAL+DGKS+FVDRCNLEIEQRADF+K+G
Subjt:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKR+GHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DVQSA+D YKSL LH+ LPHGCFGQ   D KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P+KTCS AN   +SP  Q ++E   SC KKEE ACT+  NV  E EKGE+PGVR L D+IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEF+DKLGNARLVL+DL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS
        QANSL+PGN V  QLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR AY+SLFQ FISIV++++KSVKGI + LGS P E +KHSEDS
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS

Query:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHE
                 HKFKRE++QN ERSKKWKGSQ+S E  NQNNN  V K SKHWGSWAQALY+TAMHPERH +TVLE SDDV VLNDIY KA KHLLV+AR+E
Subjt:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHE

Query:  GLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMD
        GLDQLADV  EHLPLLRTMH +GLKWI+KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V+DEV SHGKASI D
Subjt:  GLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMD

Query:  DDSLMSMELRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSN
        D+SLMSMELRCN+CRSAHPNL KLKAH+SKC+APFPSTLLEG RLV APSN
Subjt:  DDSLMSMELRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSN

A0A6J1GCV0 transcription factor bHLH1400.0e+0082.93Show/hide
Query:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG
        MDMDIDENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLEIEQR++F+KLG
Subjt:  MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKR+GHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+E+DVQSA+D YKSL LH+ LP GCFGQKN DKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAE P+KTCS ANT+ + P+ Q TQEK+       ES+CTM  NV  E EKGE+PGV  LE+NIS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEF+DKLGNARLV++DLSHGSKILS+VKAKA +KNI STKFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS
        QA SL+PGN VAVQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR+AY+SLFQAFISIV+D++KS KGI E LGS P E +KHSED+
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDS

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV
        HHKFKR  LQ  ERSKKWKG+Q S E LNQNNNK   K SKHWGSWAQALY+TAM+PERH N VLETSDDVVVLNDIYPKARKHLL++AR+EGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME
          EHLPLL+TMHA+G+KWI+KF H+DA LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VIDEVSSHGKA I DD+SLMSME
Subjt:  CTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSME

Query:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNA
         RCN+CRSAHPNLPKLK H+SKCQ+PFPSTLLEGGRLV A  N+
Subjt:  LRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNA

SwissProt top hitse value%identityAlignment
P61797 Aprataxin7.0e-3540.08Show/hide
Query:  EPQKHSEDSHHKFKRE-NLQNLERSKKWK---------GSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKA
        E +    ++H K KR  N  ++ER    +         GS +S   +  N  K  P K +  G W+Q L  +   P+      +   + VVV+ D YPKA
Subjt:  EPQKHSEDSHHKFKRE-NLQNLERSKKWK---------GSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKA

Query:  RKHLLVLARHEGLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDE
        R H LVL     +  L  V  EHL LL+ MH +G K I  F     L  FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI+ 
Subjt:  RKHLLVLARHEGLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDE

Query:  VSSHGKASIMDD-DSLMSMELRCNKCRSAHPNLPKLKAHVSK
        V   G+ S+ D    L+ + LRC++C+   P++P+LK H+ +
Subjt:  VSSHGKASIMDD-DSLMSMELRCNKCRSAHPNLPKLKAHVSK

P61798 Aprataxin (Fragment)2.0e-3440.26Show/hide
Query:  EDSHHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVP-----KKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHE
        EDS      EN+    +  +   +Q+S+  L  + +   P      + +H G W+Q L  +   P+      +   +  VV+ D YPKAR H LVL   +
Subjt:  EDSHHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVP-----KKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHE

Query:  GLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMD
         +  L  V  EHL LL  MHA+G K I++   +++ L FRLGYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T++F +S  VI+ V S GK ++ D
Subjt:  GLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMD

Query:  DDS-LMSMELRCNKCRSAHPNLPKLKAHVSK
          S L+ + LRC+ C+     +P+LK H+ K
Subjt:  DDS-LMSMELRCNKCRSAHPNLPKLKAHVSK

Q7TQC5 Aprataxin5.4e-3543.63Show/hide
Query:  GSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADVCTEHLPLLRTMHAMGLKWI
        GS  S   ++   +K    K +  G W+Q L  +   P+      +   D VVV+ D YPKAR H LVL     +  L  V +EHL LL+ MHA+G K I
Subjt:  GSQNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADVCTEHLPLLRTMHAMGLKWI

Query:  EKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDD-SLMSMELRCNKCRSAHPNLPKLKA
          F     L  FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+ ++ D    L+ + LRC++C+   P++P+LK 
Subjt:  EKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDD-SLMSMELRCNKCRSAHPNLPKLKA

Query:  HVSK
        H+ K
Subjt:  HVSK

Q7YRZ1 Aprataxin9.2e-3542.31Show/hide
Query:  SHHKFKRE-NLQNLER--SKKWKGSQNSTEGLNQNNNKTVPKKSKHWGS-------WAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLA
        +H K KR  N  ++ER  S++ K S  +  G N +     PKK K   +       W+Q L  +   P+      +   D VVV+ D YPKAR H LVL 
Subjt:  SHHKFKRE-NLQNLER--SKKWKGSQNSTEGLNQNNNKTVPKKSKHWGS-------WAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLA

Query:  RHEGLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKAS
            +  L  V  EHL LLR MH +G K I  F     L  FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI+ V   G+ +
Subjt:  RHEGLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKAS

Query:  IMDD-DSLMSMELRCNKCRSAHPNLPKLKAHVSK
        + D    L+ + LRC++C+   P++P+LK H+ K
Subjt:  IMDD-DSLMSMELRCNKCRSAHPNLPKLKAHVSK

Q9M041 Transcription factor bHLH1409.9e-23959.28Show/hide
Query:  MDMDIDEN-SNAKGKEG----QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRAD
        M++ I+E+  N    E     + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GT+AQCLK AT +L +GKSVF+DRCNL+ EQR++
Subjt:  MDMDIDEN-SNAKGKEG----QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRAD

Query:  FMKLGGPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNP
        F+KLGGP+ +VHAVVL+LPAQ+CISRSVKR+GHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC++++DV +AV+MY  L   + LP GCFG+K  
Subjt:  FMKLGGPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNP

Query:  DKKVQLGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHE
        D K Q GIMKF KK                S  P  +  +  +  +K +    M+ NV     K  S  +            PTLAFPSIST+DF+F  E
Subjt:  DKKVQLGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHE

Query:  KAAEIIVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLE
        KA++IIVEK EEF+ KLG ARLVL+DLS GSKILS+VKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE
Subjt:  KAAEIIVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLE

Query:  VATKQQANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQK
         AT+ +AN+L PG AV V LPST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR AY SLF+ F+S+VQD+ K  K   +   S   E  K
Subjt:  VATKQQANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQK

Query:  HSEDSHHKFKRENLQNLERSKKWKGSQ-----NSTEGLNQNNNKTVPKK-SKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLA
          EDS            ER+KK+KGSQ     N+ E  +  + +   KK SK W +WA AL+  AMHPERH N VLE  D++VV+ND YPKARKH+LVLA
Subjt:  HSEDSHHKFKRENLQNLERSKKWKGSQ-----NSTEGLNQNNNKTVPKK-SKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLA

Query:  RHEGLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKAS
        R E LD L DV  E+L LL+ MH +GLKW+++F +EDA L+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V++EV+S GKA+
Subjt:  RHEGLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKAS

Query:  IMDDDSLMSMELRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLV
        +  +D L+  ELRCN+CRSAHPN+PKLK+HV  C + FP  LL+  RLV
Subjt:  IMDDDSLMSMELRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT5G01310.1 APRATAXIN-like7.0e-24059.28Show/hide
Query:  MDMDIDEN-SNAKGKEG----QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRAD
        M++ I+E+  N    E     + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GT+AQCLK AT +L +GKSVF+DRCNL+ EQR++
Subjt:  MDMDIDEN-SNAKGKEG----QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRAD

Query:  FMKLGGPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNP
        F+KLGGP+ +VHAVVL+LPAQ+CISRSVKR+GHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC++++DV +AV+MY  L   + LP GCFG+K  
Subjt:  FMKLGGPQVDVHAVVLDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNP

Query:  DKKVQLGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHE
        D K Q GIMKF KK                S  P  +  +  +  +K +    M+ NV     K  S  +            PTLAFPSIST+DF+F  E
Subjt:  DKKVQLGIMKFLKKAEKPSKTCSGANTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHE

Query:  KAAEIIVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLE
        KA++IIVEK EEF+ KLG ARLVL+DLS GSKILS+VKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE
Subjt:  KAAEIIVEKVEEFVDKLGNARLVLLDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLE

Query:  VATKQQANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQK
         AT+ +AN+L PG AV V LPST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR AY SLF+ F+S+VQD+ K  K   +   S   E  K
Subjt:  VATKQQANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQK

Query:  HSEDSHHKFKRENLQNLERSKKWKGSQ-----NSTEGLNQNNNKTVPKK-SKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLA
          EDS            ER+KK+KGSQ     N+ E  +  + +   KK SK W +WA AL+  AMHPERH N VLE  D++VV+ND YPKARKH+LVLA
Subjt:  HSEDSHHKFKRENLQNLERSKKWKGSQ-----NSTEGLNQNNNKTVPKK-SKHWGSWAQALYDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLA

Query:  RHEGLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKAS
        R E LD L DV  E+L LL+ MH +GLKW+++F +EDA L+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V++EV+S GKA+
Subjt:  RHEGLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKAS

Query:  IMDDDSLMSMELRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLV
        +  +D L+  ELRCN+CRSAHPN+PKLK+HV  C + FP  LL+  RLV
Subjt:  IMDDDSLMSMELRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGATATCGACGAAAATTCGAATGCCAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTAATATTAGTGGGTGCACCTGGAAGCGGCAAGTCCACCTTTTG
CGAACTTGTTATGGGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGTCAGGATACCATTGGAAATGGCAAGTCTGGAACCAGAGCACAGTGCTTGAAGAGTGCAACGAGTG
CACTGAATGATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTATGAAGCTCGGTGGCCCTCAAGTGGATGTACATGCTGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCTGTTAAGCGGAGTGGCCACGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCAAAACTTAATGAAGGGTTCACTCGCATAACCTTTTGCCATAATGAGTCCGACGTTCAATCCGCTGTAGATATGTACAAATCGCTTGATTTACATAATA
TGCTTCCGCATGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAAACCTTCAAAAACGTGTTCTGGTGCC
AATACCGACAATAATTCTCCAACTCCTCAACCTACCCAGGAAAAGAGGGAGTCTTGTGGTAAAAAGGAAGAGTCTGCTTGCACAATGTCGAGAAATGTAGCTACGGAGTG
GGAGAAAGGTGAAAGTCCAGGCGTTAGATTCTTAGAAGACAATATTTCTCAAAGTGATCCACCAACTCTTGCATTCCCATCTATTTCGACTTCAGATTTCAAGTTTAGCC
ATGAGAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAAGAATTCGTGGATAAGCTTGGAAATGCCAGACTTGTACTGCTAGACTTGAGTCATGGATCAAAGATATTGTCT
ATGGTTAAAGCTAAAGCAACTGAGAAAAATATTAGTTCCACCAAGTTTTTTACATTCGTAGGTGATATAACTAAACTCAATTCCGAAGGTGGATTGCGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGTGGTGGTGTGAATGCTGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAAACAACAAGCAAATT
CCCTTCAACCTGGCAATGCCGTGGCTGTTCAGTTGCCTTCAACTTCTCCTTTGTTAAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCCAACATGAATCCA
CAAAGGCCAAATTATCTCAACAATGACTATGATGAAGGTTGCAAGCTTCTTCGCAATGCTTACGCTTCCCTATTTCAGGCCTTTATTTCAATCGTACAAGACAAATATAA
GTCGGTTAAGGGAATTCACGAATGCCTCGGCTCAACACCTCCAGAACCACAAAAGCACTCTGAGGACAGTCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAA
GAAGCAAAAAATGGAAAGGATCTCAGAACTCAACCGAAGGATTAAACCAAAACAACAATAAGACTGTCCCCAAAAAGAGTAAGCATTGGGGTTCATGGGCACAAGCACTT
TATGACACTGCAATGCATCCCGAGCGACATAACAATACTGTACTAGAGACATCAGACGATGTTGTAGTACTGAATGATATTTATCCGAAGGCACGCAAGCATCTTTTAGT
ACTGGCTCGGCATGAAGGTCTCGATCAACTAGCCGACGTATGTACAGAACACCTTCCATTGTTGAGAACAATGCATGCTATGGGTTTGAAGTGGATCGAGAAGTTCTTTC
ATGAAGATGCATTATTGGTCTTTCGCCTCGGATACCACTCGGCTCCATCAATGAGGCAGCTGCACCTACATGTTATAAGCCAGGATTTCGATTCCAGTCATCTGAAAAAC
AAGAAGCACTGGAATTCTTTCAACACCGATTTCTTCAGAGACTCGGTCACCGTTATAGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCATGGACGATGACAGCTTGAT
GTCTATGGAGTTACGTTGCAACAAATGCAGAAGTGCTCATCCCAACTTACCCAAATTGAAAGCGCATGTTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTTGAGG
GCGGTCGTTTAGTCTTTGCGCCAAGTAATGCTCCTCTTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACATGGATATCGACGAAAATTCGAATGCCAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTAATATTAGTGGGTGCACCTGGAAGCGGCAAGTCCACCTTTTG
CGAACTTGTTATGGGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGTCAGGATACCATTGGAAATGGCAAGTCTGGAACCAGAGCACAGTGCTTGAAGAGTGCAACGAGTG
CACTGAATGATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTATGAAGCTCGGTGGCCCTCAAGTGGATGTACATGCTGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCTGTTAAGCGGAGTGGCCACGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCAAAACTTAATGAAGGGTTCACTCGCATAACCTTTTGCCATAATGAGTCCGACGTTCAATCCGCTGTAGATATGTACAAATCGCTTGATTTACATAATA
TGCTTCCGCATGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAAACCTTCAAAAACGTGTTCTGGTGCC
AATACCGACAATAATTCTCCAACTCCTCAACCTACCCAGGAAAAGAGGGAGTCTTGTGGTAAAAAGGAAGAGTCTGCTTGCACAATGTCGAGAAATGTAGCTACGGAGTG
GGAGAAAGGTGAAAGTCCAGGCGTTAGATTCTTAGAAGACAATATTTCTCAAAGTGATCCACCAACTCTTGCATTCCCATCTATTTCGACTTCAGATTTCAAGTTTAGCC
ATGAGAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAAGAATTCGTGGATAAGCTTGGAAATGCCAGACTTGTACTGCTAGACTTGAGTCATGGATCAAAGATATTGTCT
ATGGTTAAAGCTAAAGCAACTGAGAAAAATATTAGTTCCACCAAGTTTTTTACATTCGTAGGTGATATAACTAAACTCAATTCCGAAGGTGGATTGCGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGTGGTGGTGTGAATGCTGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAAACAACAAGCAAATT
CCCTTCAACCTGGCAATGCCGTGGCTGTTCAGTTGCCTTCAACTTCTCCTTTGTTAAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCCAACATGAATCCA
CAAAGGCCAAATTATCTCAACAATGACTATGATGAAGGTTGCAAGCTTCTTCGCAATGCTTACGCTTCCCTATTTCAGGCCTTTATTTCAATCGTACAAGACAAATATAA
GTCGGTTAAGGGAATTCACGAATGCCTCGGCTCAACACCTCCAGAACCACAAAAGCACTCTGAGGACAGTCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAA
GAAGCAAAAAATGGAAAGGATCTCAGAACTCAACCGAAGGATTAAACCAAAACAACAATAAGACTGTCCCCAAAAAGAGTAAGCATTGGGGTTCATGGGCACAAGCACTT
TATGACACTGCAATGCATCCCGAGCGACATAACAATACTGTACTAGAGACATCAGACGATGTTGTAGTACTGAATGATATTTATCCGAAGGCACGCAAGCATCTTTTAGT
ACTGGCTCGGCATGAAGGTCTCGATCAACTAGCCGACGTATGTACAGAACACCTTCCATTGTTGAGAACAATGCATGCTATGGGTTTGAAGTGGATCGAGAAGTTCTTTC
ATGAAGATGCATTATTGGTCTTTCGCCTCGGATACCACTCGGCTCCATCAATGAGGCAGCTGCACCTACATGTTATAAGCCAGGATTTCGATTCCAGTCATCTGAAAAAC
AAGAAGCACTGGAATTCTTTCAACACCGATTTCTTCAGAGACTCGGTCACCGTTATAGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCATGGACGATGACAGCTTGAT
GTCTATGGAGTTACGTTGCAACAAATGCAGAAGTGCTCATCCCAACTTACCCAAATTGAAAGCGCATGTTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTTGAGG
GCGGTCGTTTAGTCTTTGCGCCAAGTAATGCTCCTCTTTCTTAG
Protein sequenceShow/hide protein sequence
MDMDIDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFMKLGGPQVDVHAVV
LDLPAQLCISRSVKRSGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVQSAVDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSKTCSGA
NTDNNSPTPQPTQEKRESCGKKEESACTMSRNVATEWEKGESPGVRFLEDNISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFVDKLGNARLVLLDLSHGSKILS
MVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNP
QRPNYLNNDYDEGCKLLRNAYASLFQAFISIVQDKYKSVKGIHECLGSTPPEPQKHSEDSHHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNKTVPKKSKHWGSWAQAL
YDTAMHPERHNNTVLETSDDVVVLNDIYPKARKHLLVLARHEGLDQLADVCTEHLPLLRTMHAMGLKWIEKFFHEDALLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN
KKHWNSFNTDFFRDSVTVIDEVSSHGKASIMDDDSLMSMELRCNKCRSAHPNLPKLKAHVSKCQAPFPSTLLEGGRLVFAPSNAPLS