; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy4G089930 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy4G089930
OrganismCucumis hystrix (Cucumber (hystrix) v1)
Descriptiontranscription factor bHLH140
Genome locationchrH04:27353268..27356708
RNA-Seq ExpressionChy4G089930
SyntenyChy4G089930
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442389.1 PREDICTED: transcription factor bHLH140 [Cucumis melo]0.095.85Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLK+ATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAENPSKTCSSAN DKNSP+PQPTQEKRESC KK ESSC MSRNVAMESEKGESPGVR LE KISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRC VIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG
        QANSL PG+AVAVQLPSTSPLLNREGVTHV+HVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGI+ECLGSTP EPQKHSE+ 
Subjt:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADI
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNN TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLAD+
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADI

Query:  CTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSME
        CTEHLPLLRTMHAMGLKWI+KFF ED PLVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSVTVI+EVSSHGKA IMD+E L+SME
Subjt:  CTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

XP_011651853.1 transcription factor bHLH140 [Cucumis sativus]0.097.72Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAE PSKTCSSANTDKNSP+PQPTQEKRESCGKK ESSCTMSRNVAMESEKGESPG+R L+DKISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRC VIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG
        QANSLQPG+AVAVQLPSTSPLLNREGVTHV+HVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTP E QKHSEDG
Subjt:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADIC
        HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLAD+C
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADIC

Query:  TEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMEL
        TEHLPLLRTMHAMGLKWINKFF EDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VINEVSSHGKANIMD+ESLMSMEL
Subjt:  TEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMEL

Query:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
Subjt:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.081.89Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLK+A+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH  LPHGCFGQ   D KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+NP+KTCSSAN  K+SP+ Q ++E   SC KK E +CT+  NV  ESEKGE+PGVR L D IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRC VIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG
        QANSL+PG+ V  QLPSTSPL NREGVTHV+HVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSLFQ FISIV++++KSVKGI + LGS P E +KHSED 
Subjt:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG

Query:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNT-VPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE
                 HKFKRE++QN ERSKKWKGSQ+S E  NQNNNT V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV VL DIY KA KHLLVVAR+E
Subjt:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNT-VPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE

Query:  GLDQLADICTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMD
        GLDQLAD+  EHLPLLRTMH +GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V++EV SHGKA+I D
Subjt:  GLDQLADICTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMD

Query:  NESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSN
        +ESLMSMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLEG RLV+ PSN
Subjt:  NESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSN

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.089.42Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS AKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALNDGK+VFVDRCNLEIEQRADFVKL 
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GP+VDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE+DV SAIDMYKSLDLH MLP GCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSANT K+SP PQ TQEK +SC KK ES+CT+SRNV +ES+KGESPGVR LED ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILS+VKAKA +KNISSTKFFTFVGDITKLNSEGGLRC VIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG
        QANSLQPG+AVAVQLPSTSPL NREGVTHV+HVLGPNMNPQRPNYLNNDYDEGCKLL NAYSSLFQAFIS+V+DK+KSVKGIH  LG TP EP+KHSE+ 
Subjt:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADI
        HHKFKRENLQN ERSKKWKGSQ+STE LNQNNN TVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVVVL DIYPKARKHLLVVARHEGLDQLAD+
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADI

Query:  CTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSME
        C EHLPLLRTMHA+GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSV V++EVSSHGKA++MD+ESLMSME
Subjt:  CTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHI KCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.089.26Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALNDGK+VFVDRCNLEIEQRADFVKL GP+VDV AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKTCSSAN
        SVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE+DV SAIDMYKSLDLH MLP GCFGQKNPDKKVQLGIMKFLKKAE PS+T SSAN
Subjt:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKTCSSAN

Query:  TDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD
        T K+SP PQ TQEK +SC KK ES+CT+SRNV +ES+KGESPGVR LED ISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD
Subjt:  TDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD

Query:  LSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGSAVAVQLPSTSPL
        LSHGSKILS+VKAKA +KNISSTKFFTFVGDITKLNSEGGLRC VIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQQANSLQPG+AVAVQLPSTSPL
Subjt:  LSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGSAVAVQLPSTSPL

Query:  LNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDGHHKFKRENLQNLERSKKWKGS
         NREGVTHV+HVLGPNMNPQRPNYLNNDYDEGCKLL NAYSSLFQAFIS+V+DK+KSVKGIH  LG TP EP+KHSE+ HHKFKRENLQN ERSKKWKGS
Subjt:  LNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDGHHKFKRENLQNLERSKKWKGS

Query:  QNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLKWINK
        Q+STE LNQNNN TVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVVVL DIYPKARKHLLVVARHEGLDQLAD+C EHLPLLRTMHA+GLKWI+K
Subjt:  QNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLKWINK

Query:  FFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMELRCNRCRSAHPNLPKLKAHIS
        FF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSV V++EVSSHGKA++MD+ESLMSMELRCNRCRSAHPNLPKLKAHI 
Subjt:  FFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMELRCNRCRSAHPNLPKLKAHIS

Query:  KCQAPFPSTLLEGGRLVVEPSNAPLS
        KCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  KCQAPFPSTLLEGGRLVVEPSNAPLS

TrEMBL top hitse value%identityAlignment
A0A0A0L9U1 Uncharacterized protein0.0e+0097.72Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAE PSKTCSSANTDKNSP+PQPTQEKRESCGKK ESSCTMSRNVAMESEKGESPG+R L+DKISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRC VIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG
        QANSLQPG+AVAVQLPSTSPLLNREGVTHV+HVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTP E QKHSEDG
Subjt:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADIC
        HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLAD+C
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADIC

Query:  TEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMEL
        TEHLPLLRTMHAMGLKWINKFF EDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VINEVSSHGKANIMD+ESLMSMEL
Subjt:  TEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMEL

Query:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
Subjt:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

A0A1S3B6C4 transcription factor bHLH1400.0e+0095.85Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLK+ATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAENPSKTCSSAN DKNSP+PQPTQEKRESC KK ESSC MSRNVAMESEKGESPGVR LE KISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRC VIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG
        QANSL PG+AVAVQLPSTSPLLNREGVTHV+HVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGI+ECLGSTP EPQKHSE+ 
Subjt:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADI
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNN TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLAD+
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADI

Query:  CTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSME
        CTEHLPLLRTMHAMGLKWI+KFF ED PLVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSVTVI+EVSSHGKA IMD+E L+SME
Subjt:  CTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

A0A5A7TLV2 Transcription factor bHLH1400.0e+0095.85Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLK+ATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAENPSKTCSSAN DKNSP+PQPTQEKRESC KK ESSC MSRNVAMESEKGESPGVR LE KISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRC VIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG
        QANSL PG+AVAVQLPSTSPLLNREGVTHV+HVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGI+ECLGSTP EPQKHSE+ 
Subjt:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADI
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNN TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLAD+
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADI

Query:  CTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSME
        CTEHLPLLRTMHAMGLKWI+KFF ED PLVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSVTVI+EVSSHGKA IMD+E L+SME
Subjt:  CTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

A0A6J1CV45 transcription factor bHLH1400.0e+0081.89Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLK+A+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH  LPHGCFGQ   D KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+NP+KTCSSAN  K+SP+ Q ++E   SC KK E +CT+  NV  ESEKGE+PGVR L D IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRC VIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG
        QANSL+PG+ V  QLPSTSPL NREGVTHV+HVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSLFQ FISIV++++KSVKGI + LGS P E +KHSED 
Subjt:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG

Query:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNT-VPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE
                 HKFKRE++QN ERSKKWKGSQ+S E  NQNNNT V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV VL DIY KA KHLLVVAR+E
Subjt:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNT-VPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE

Query:  GLDQLADICTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMD
        GLDQLAD+  EHLPLLRTMH +GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V++EV SHGKA+I D
Subjt:  GLDQLADICTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMD

Query:  NESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSN
        +ESLMSMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLEG RLV+ PSN
Subjt:  NESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSN

A0A6J1GCV0 transcription factor bHLH1400.0e+0082.8Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD DENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGT+AQCLK+A SALNDGKSVFVDRCNLEIEQR++FVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+E+DV SAID YKSL LH  LP GCFGQKN DKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENP+KTCS+ANT+K+ PS Q TQEK+       ESSCTM  NV  ESEKGE+PGV  LE+ IS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLV+VDLSHGSKILS+VKAKA +KNI STKFFTFVGDITKL S+GGL C VIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG
        QA SL+PG+ VAVQLPSTSPL NREGVTHV+HVLGPNMNPQRPNYLNNDYDEGCKLL +AYSSLFQAFISIV+D++KS KGI E LGS P E +KHSED 
Subjt:  QANSLQPGSAVAVQLPSTSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVP-KKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADI
        HHKFKR  LQ  ERSKKWKG+Q S E LNQNNN +  K SKHWGSWAQALY+TAM+PERH N VLETSDDVVVL DIYPKARKHLL+VAR+EGLDQLAD+
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVP-KKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADI

Query:  CTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSME
          EHLPLL+TMHA+G+KWI+KF  +D  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VI+EVSSHGKA I D+ESLMSME
Subjt:  CTEHLPLLRTMHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNA
         RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLEGGRLV    N+
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNA

SwissProt top hitse value%identityAlignment
P61798 Aprataxin (Fragment)6.0e-3434.59Show/hide
Query:  MNPQRPNYLNNDYDEGCKLL-GNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDGHHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTV-
        +NP   + ++   DE  K+  G     + + +  +VQ   +S + + E       E ++  ED       EN+    +  +   +Q+S+  L  + ++V 
Subjt:  MNPQRPNYLNNDYDEGCKLL-GNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDGHHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTV-

Query:  -----PKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLKWINKFFCEDGPLVF
               + +H G W+Q L  +   P+      +   +  VV+ D YPKAR H LV+   + +  L  +  EHL LL  MHA+G K I +   ++  L F
Subjt:  -----PKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLKWINKFFCEDGPLVF

Query:  RLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNES-LMSMELRCNRCRSAHPNLPKLKAHISK
        RLGYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T++F +S  VI  V S GK  + D  S L+ + LRC+ C+     +P+LK H+ K
Subjt:  RLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNES-LMSMELRCNRCRSAHPNLPKLKAHISK

P61801 Aprataxin2.7e-3443.69Show/hide
Query:  SQNSTEGLNQNNNTVPK----KSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLK
        S +S+ G N + +T  K    + K  G W+Q L  +   P   T  V +  D VVV+ D YPKAR H LV+   + +  L  +  EHL L++ MHA+G K
Subjt:  SQNSTEGLNQNNNTVPK----KSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLK

Query:  WINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNES-LMSMELRCNRCRSAHPNLPKL
         I K   +     F+LGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSF TD+F +S  +I  + +HGK N+ D  S L+   L C+ CR    N+P+L
Subjt:  WINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNES-LMSMELRCNRCRSAHPNLPKL

Query:  KAHISK
        K H+ K
Subjt:  KAHISK

Q7TQC5 Aprataxin1.2e-3443.56Show/hide
Query:  SQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLKWINK
        SQ S       N    K+S   G W+Q L  +   P+      +   D VVV+ D YPKAR H LV+     +  L  + +EHL LL+ MHA+G K I  
Subjt:  SQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLKWINK

Query:  FFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNE-SLMSMELRCNRCRSAHPNLPKLKAHI
         F     L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+  + D    L+ + LRC+ C+   P++P+LK H+
Subjt:  FFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNE-SLMSMELRCNRCRSAHPNLPKLKAHI

Query:  SK
         K
Subjt:  SK

Q8K4H4 Aprataxin9.2e-3544.33Show/hide
Query:  PKKSKH-------WGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLKWINKFFCEDGPL
        PK  KH        G W+Q L  +   P+      +   D VVV+ D YPKAR H LV+     +  L  + +EHL LL+ MHA+G K I   F     L
Subjt:  PKKSKH-------WGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLKWINKFFCEDGPL

Query:  VFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNE-SLMSMELRCNRCRSAHPNLPKLKAHISK
         FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+  + D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  VFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNE-SLMSMELRCNRCRSAHPNLPKLKAHISK

Q9M041 Transcription factor bHLH1404.2e-23759.62Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GT+AQCLK AT +L +GKSVF+DRCNL+ EQR++F+KLGGP+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKT
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC++++DV +A++MY  L     LP GCFG+K  D K Q GIMKF KK    +  
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKT

Query:  CSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR
         SS+N   N      T  K +           M+ NV +   K  S  +            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG AR
Subjt:  CSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR

Query:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGSAVAVQLP
        LVLVDLS GSKILS+VKAKA++KNI S KFFTFVGDITKL SEGGL C VIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +AN+L PG AV V LP
Subjt:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGSAVAVQLP

Query:  STSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDGHHKFKRENLQNLERSK
        ST PL N EG+THV+HVLGPNMNP RP+ LNNDY +GCK L  AY+SLF+ F+S+VQD+ K  K          R  Q    D     K ++    ER+K
Subjt:  STSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDGHHKFKRENLQNLERSK

Query:  KWKGSQN-------STEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRT
        K+KGSQ+        +E L     +  K SK W +WA AL+  AMHPERH N VLE  D++VV+ D YPKARKH+LV+AR E LD L D+  E+L LL+ 
Subjt:  KWKGSQN-------STEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRT

Query:  MHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMELRCNRCRSAH
        MH +GLKW+++F  ED  L+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V+ EV+S GKAN+  +E L+  ELRCNRCRSAH
Subjt:  MHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMELRCNRCRSAH

Query:  PNLPKLKAHISKCQAPFPSTLLEGGRLV
        PN+PKLK+H+  C + FP  LL+  RLV
Subjt:  PNLPKLKAHISKCQAPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT5G01310.1 APRATAXIN-like3.0e-23859.62Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GT+AQCLK AT +L +GKSVF+DRCNL+ EQR++F+KLGGP+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKT
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC++++DV +A++MY  L     LP GCFG+K  D K Q GIMKF KK    +  
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKT

Query:  CSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR
         SS+N   N      T  K +           M+ NV +   K  S  +            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG AR
Subjt:  CSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR

Query:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGSAVAVQLP
        LVLVDLS GSKILS+VKAKA++KNI S KFFTFVGDITKL SEGGL C VIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +AN+L PG AV V LP
Subjt:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGSAVAVQLP

Query:  STSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDGHHKFKRENLQNLERSK
        ST PL N EG+THV+HVLGPNMNP RP+ LNNDY +GCK L  AY+SLF+ F+S+VQD+ K  K          R  Q    D     K ++    ER+K
Subjt:  STSPLLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDGHHKFKRENLQNLERSK

Query:  KWKGSQN-------STEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRT
        K+KGSQ+        +E L     +  K SK W +WA AL+  AMHPERH N VLE  D++VV+ D YPKARKH+LV+AR E LD L D+  E+L LL+ 
Subjt:  KWKGSQN-------STEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRT

Query:  MHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMELRCNRCRSAH
        MH +GLKW+++F  ED  L+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V+ EV+S GKAN+  +E L+  ELRCNRCRSAH
Subjt:  MHAMGLKWINKFFCEDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMELRCNRCRSAH

Query:  PNLPKLKAHISKCQAPFPSTLLEGGRLV
        PN+PKLK+H+  C + FP  LL+  RLV
Subjt:  PNLPKLKAHISKCQAPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGATACCGACGAAAATTCGAATGCTAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTCATATTAGTGGGTGCACCTGGAAGCGGCAAGTCCACC
TTTTGCGAACTTGTTATGGGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGTCAGGATACCATTGGAAATGGCAAGTCTGGAACCAGAGCACAGTGCTTGAAGACT
GCAACGAGTGCACTGAATGATGGAAAGAGTGTATTCGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTGAAGCTCGGGGGCCCTCAAGTGGAT
GTACATGCTGTTGTATTAGATCTTCCTGCTCAGCTCTGTATCTCTCGTTCTGTTAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTT
GTGAATAAAATGCTGCAAAAGAAAGAATTGCCCAAACTTAATGAAGGGTTCACTCGCATAACCTTTTGCCATAACGAGTCCGACGTTCTATCTGCTATAGATATG
TACAAATCGCTTGATTTACATACTATGCTTCCACATGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCA
GAAAACCCTTCAAAAACGTGTTCTAGTGCCAATACCGACAAGAATTCTCCATCTCCTCAACCTACCCAGGAAAAGAGGGAGTCTTGTGGTAAAAAGGTAGAGTCT
TCCTGCACAATGTCGAGAAATGTAGCTATGGAGTCAGAGAAAGGTGAAAGTCCAGGCGTTAGATGCTTAGAAGACAAGATTTCTCAAAGTGATCCACCAACTCTT
GCATTTCCATCTATTTCAACTTCAGATTTCAAGTTTAGCCATGAGAAGGCTGCTGAAATTATTGTCGAGAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCC
AGACTTGTGCTGGTAGACTTGAGTCATGGATCAAAGATATTGTCTATGGTTAAAGCTAAAGCAACCGAGAAAAATATTAGTTCCACCAAGTTTTTTACATTCGTA
GGTGATATAACTAAACTCAATTCCGAAGGTGGATTGCGCTGCACTGTTATAGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGTGGTGGTGTGAATGCTGCA
ATTTTTAGTGCTGCAGGTCCCGGTCTGGAAGTGGCAACTAAACAACAAGCTAACTCTCTTCAACCTGGCAGTGCGGTGGCTGTTCAGTTGCCTTCAACTTCTCCT
TTGTTAAATAGGGAAGGAGTAACCCATGTCGTACATGTTCTTGGACCCAACATGAATCCACAAAGGCCAAATTATCTCAACAATGACTATGATGAAGGGTGCAAG
CTTCTTGGCAATGCTTACTCTTCCCTATTTCAGGCCTTTATTTCAATCGTACAAGACAAATATAAGTCGGTGAAGGGAATTCATGAATGCCTCGGCTCAACACCT
CGAGAACCACAAAAGCACTCTGAGGACGGTCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAAGAAGCAAAAAATGGAAAGGATCTCAGAACTCAACC
GAAGGATTAAACCAAAACAACAATACTGTCCCCAAAAAGAGTAAGCACTGGGGTTCATGGGCACAAGCACTTTATGACACTGCAATGCATCCCGAGCGACATACC
AATTCTGTACTAGAGACATCAGATGATGTTGTAGTACTGTATGATATTTATCCAAAGGCACGCAAGCATCTTTTAGTAGTGGCTCGGCATGAAGGCCTCGATCAA
CTAGCCGACATATGTACAGAACACCTTCCATTGTTGAGGACAATGCACGCTATGGGTTTGAAGTGGATCAATAAGTTCTTTTGTGAAGATGGCCCATTGGTCTTT
CGCCTCGGATACCACTCGGCTCCATCAATGAGGCAGCTGCACCTACATGTTATAAGCCAGGATTTCGATTCCAGTCATCTGAAAAACAAGAAGCACTGGAATTCT
TTCAACACCGATTTCTTCAGAGACTCAGTCACCGTTATAAACGAAGTCAGTAGCCATGGAAAGGCGAACATCATGGACAATGAGAGCTTGATGTCTATGGAGTTA
CGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAGTTGAAAGCGCATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTTGAAGGCGGCCGT
TTAGTTGTTGAGCCAAGTAATGCTCCTCTTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACATGGATACCGACGAAAATTCGAATGCTAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTCATATTAGTGGGTGCACCTGGAAGCGGCAAGTCCACC
TTTTGCGAACTTGTTATGGGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGTCAGGATACCATTGGAAATGGCAAGTCTGGAACCAGAGCACAGTGCTTGAAGACT
GCAACGAGTGCACTGAATGATGGAAAGAGTGTATTCGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTGAAGCTCGGGGGCCCTCAAGTGGAT
GTACATGCTGTTGTATTAGATCTTCCTGCTCAGCTCTGTATCTCTCGTTCTGTTAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTT
GTGAATAAAATGCTGCAAAAGAAAGAATTGCCCAAACTTAATGAAGGGTTCACTCGCATAACCTTTTGCCATAACGAGTCCGACGTTCTATCTGCTATAGATATG
TACAAATCGCTTGATTTACATACTATGCTTCCACATGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCA
GAAAACCCTTCAAAAACGTGTTCTAGTGCCAATACCGACAAGAATTCTCCATCTCCTCAACCTACCCAGGAAAAGAGGGAGTCTTGTGGTAAAAAGGTAGAGTCT
TCCTGCACAATGTCGAGAAATGTAGCTATGGAGTCAGAGAAAGGTGAAAGTCCAGGCGTTAGATGCTTAGAAGACAAGATTTCTCAAAGTGATCCACCAACTCTT
GCATTTCCATCTATTTCAACTTCAGATTTCAAGTTTAGCCATGAGAAGGCTGCTGAAATTATTGTCGAGAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCC
AGACTTGTGCTGGTAGACTTGAGTCATGGATCAAAGATATTGTCTATGGTTAAAGCTAAAGCAACCGAGAAAAATATTAGTTCCACCAAGTTTTTTACATTCGTA
GGTGATATAACTAAACTCAATTCCGAAGGTGGATTGCGCTGCACTGTTATAGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGTGGTGGTGTGAATGCTGCA
ATTTTTAGTGCTGCAGGTCCCGGTCTGGAAGTGGCAACTAAACAACAAGCTAACTCTCTTCAACCTGGCAGTGCGGTGGCTGTTCAGTTGCCTTCAACTTCTCCT
TTGTTAAATAGGGAAGGAGTAACCCATGTCGTACATGTTCTTGGACCCAACATGAATCCACAAAGGCCAAATTATCTCAACAATGACTATGATGAAGGGTGCAAG
CTTCTTGGCAATGCTTACTCTTCCCTATTTCAGGCCTTTATTTCAATCGTACAAGACAAATATAAGTCGGTGAAGGGAATTCATGAATGCCTCGGCTCAACACCT
CGAGAACCACAAAAGCACTCTGAGGACGGTCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAAGAAGCAAAAAATGGAAAGGATCTCAGAACTCAACC
GAAGGATTAAACCAAAACAACAATACTGTCCCCAAAAAGAGTAAGCACTGGGGTTCATGGGCACAAGCACTTTATGACACTGCAATGCATCCCGAGCGACATACC
AATTCTGTACTAGAGACATCAGATGATGTTGTAGTACTGTATGATATTTATCCAAAGGCACGCAAGCATCTTTTAGTAGTGGCTCGGCATGAAGGCCTCGATCAA
CTAGCCGACATATGTACAGAACACCTTCCATTGTTGAGGACAATGCACGCTATGGGTTTGAAGTGGATCAATAAGTTCTTTTGTGAAGATGGCCCATTGGTCTTT
CGCCTCGGATACCACTCGGCTCCATCAATGAGGCAGCTGCACCTACATGTTATAAGCCAGGATTTCGATTCCAGTCATCTGAAAAACAAGAAGCACTGGAATTCT
TTCAACACCGATTTCTTCAGAGACTCAGTCACCGTTATAAACGAAGTCAGTAGCCATGGAAAGGCGAACATCATGGACAATGAGAGCTTGATGTCTATGGAGTTA
CGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAGTTGAAAGCGCATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTTGAAGGCGGCCGT
TTAGTTGTTGAGCCAAGTAATGCTCCTCTTTCTTAG
Protein sequenceShow/hide protein sequence
MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVD
VHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKA
ENPSKTCSSANTDKNSPSPQPTQEKRESCGKKVESSCTMSRNVAMESEKGESPGVRCLEDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNA
RLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCTVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGSAVAVQLPSTSP
LLNREGVTHVVHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPREPQKHSEDGHHKFKRENLQNLERSKKWKGSQNST
EGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADICTEHLPLLRTMHAMGLKWINKFFCEDGPLVF
RLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVTVINEVSSHGKANIMDNESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGR
LVVEPSNAPLS