; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G31060 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G31060
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptiontranscription factor bHLH140
Genome locationChr3:28220185..28224001
RNA-Seq ExpressionCSPI03G31060
SyntenyCSPI03G31060
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442389.1 PREDICTED: transcription factor bHLH140 [Cucumis melo]0.0e+0096.12Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLK+ATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PSKTCSS N DKNSPTPQPTQEKRESC KKEESSC MSRNVAMESEKGESPG+RSL+ KISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
        QANSL PGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGI+ECLGSTPPE QKHSE+ 
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNN TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSME
        CTEHLPLLRTMHAMGLKWI+KFF ED PLVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV VI+EVSSHGKA IMDDE L+SME
Subjt:  CTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

XP_011651853.1 transcription factor bHLH140 [Cucumis sativus]0.0e+0099.73Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPSKTCSS NTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
        QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVC
        HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVC
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVC

Query:  TEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMEL
        TEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMEL
Subjt:  TEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMEL

Query:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
Subjt:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0082.29Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLK+A+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH  LPHGCFGQ   D KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P+KTCSS N  K+SP  Q ++E   SC KKEE +CT+  NV  ESEKGE+PG+RSL D IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
        QANSL+PGN V  QLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSLFQ FISIV++++KSVKGI + LGS P E +KHSED 
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG

Query:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNT-VPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE
                 HKFKRE++QN ERSKKWKGSQ+S E  NQNNNT V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV VL DIY KA KHLLVVAR+E
Subjt:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNT-VPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE

Query:  GLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMD
        GLDQLADV  EHLPLLRTMH +GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V++EV SHGKA+I D
Subjt:  GLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMD

Query:  DESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSN
        DESLMSMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLEG RLV+ PSN
Subjt:  DESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSN

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0090.09Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS AKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALNDGK+VFVDRCNLEIEQRADFVKL 
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GP+VDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE+DV SAIDMYKSLDLH MLP GCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAEKPS+T SS NT K+SP PQ TQEK +SC KKEES+CT+SRNV +ES+KGESPG+RSL+D ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILS+VKAKA +KNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
        QANSLQPGNAVAVQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL NAYSSLFQAFIS+V+DK+KSVKGIH  LG TP E +KHSE+ 
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKRENLQN ERSKKWKGSQ+STE LNQNNN TVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVVVL DIYPKARKHLLVVARHEGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSME
        C EHLPLLRTMHA+GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSV V++EVSSHGKA++MDDESLMSME
Subjt:  CTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHI KCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0089.94Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALNDGK+VFVDRCNLEIEQRADFVKL GP+VDV AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSKTCSSVN
        SVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE+DV SAIDMYKSLDLH MLP GCFGQKNPDKKVQLGIMKFLKKAEKPS+T SS N
Subjt:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSKTCSSVN

Query:  TDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD
        T K+SP PQ TQEK +SC KKEES+CT+SRNV +ES+KGESPG+RSL+D ISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD
Subjt:  TDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD

Query:  LSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQQANSLQPGNAVAVQLPSTSPL
        LSHGSKILS+VKAKA +KNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQQANSLQPGNAVAVQLPSTSPL
Subjt:  LSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQQANSLQPGNAVAVQLPSTSPL

Query:  LNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDGHHKFKRENLQNLERSKKWKGS
         NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL NAYSSLFQAFIS+V+DK+KSVKGIH  LG TP E +KHSE+ HHKFKRENLQN ERSKKWKGS
Subjt:  LNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDGHHKFKRENLQNLERSKKWKGS

Query:  QNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWINK
        Q+STE LNQNNN TVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVVVL DIYPKARKHLLVVARHEGLDQLADVC EHLPLLRTMHA+GLKWI+K
Subjt:  QNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWINK

Query:  FFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMELRCNRCRSAHPNLPKLKAHIS
        FF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSV V++EVSSHGKA++MDDESLMSMELRCNRCRSAHPNLPKLKAHI 
Subjt:  FFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMELRCNRCRSAHPNLPKLKAHIS

Query:  KCQAPFPSTLLEGGRLVVEPSNAPLS
        KCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  KCQAPFPSTLLEGGRLVVEPSNAPLS

TrEMBL top hitse value%identityAlignment
A0A0A0L9U1 Uncharacterized protein0.0e+0099.73Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPSKTCSS NTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
        QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVC
        HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVC
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVC

Query:  TEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMEL
        TEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMEL
Subjt:  TEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMEL

Query:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
Subjt:  RCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

A0A1S3B6C4 transcription factor bHLH1400.0e+0096.12Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLK+ATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PSKTCSS N DKNSPTPQPTQEKRESC KKEESSC MSRNVAMESEKGESPG+RSL+ KISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
        QANSL PGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGI+ECLGSTPPE QKHSE+ 
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNN TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSME
        CTEHLPLLRTMHAMGLKWI+KFF ED PLVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV VI+EVSSHGKA IMDDE L+SME
Subjt:  CTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

A0A5A7TLV2 Transcription factor bHLH1400.0e+0096.12Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLK+ATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PSKTCSS N DKNSPTPQPTQEKRESC KKEESSC MSRNVAMESEKGESPG+RSL+ KISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
        QANSL PGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGI+ECLGSTPPE QKHSE+ 
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNN TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNN-TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSME
        CTEHLPLLRTMHAMGLKWI+KFF ED PLVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV VI+EVSSHGKA IMDDE L+SME
Subjt:  CTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS

A0A6J1CV45 transcription factor bHLH1400.0e+0082.29Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLK+A+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH  LPHGCFGQ   D KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P+KTCSS N  K+SP  Q ++E   SC KKEE +CT+  NV  ESEKGE+PG+RSL D IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
        QANSL+PGN V  QLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSLFQ FISIV++++KSVKGI + LGS P E +KHSED 
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG

Query:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNT-VPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE
                 HKFKRE++QN ERSKKWKGSQ+S E  NQNNNT V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV VL DIY KA KHLLVVAR+E
Subjt:  --------HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNT-VPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE

Query:  GLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMD
        GLDQLADV  EHLPLLRTMH +GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V++EV SHGKA+I D
Subjt:  GLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMD

Query:  DESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSN
        DESLMSMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLEG RLV+ PSN
Subjt:  DESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSN

A0A6J1GCV0 transcription factor bHLH1400.0e+0082.8Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD DENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGT+AQCLK+A SALNDGKSVFVDRCNLEIEQR++FVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+E+DV SAID YKSL LH  LP GCFGQKN DKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAE P+KTCS+ NT+K+ P+ Q TQEK+       ESSCTM  NV  ESEKGE+PG+ SL++ IS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSKTCSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ
        IVE VEEFMDKLGNARLV+VDLSHGSKILS+VKAKA +KNI STKFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAG  LE ATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQ

Query:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG
        QA SL+PGN VAVQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL +AYSSLFQAFISIV+D++KS KGI E LGS P E +KHSED 
Subjt:  QANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDG

Query:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVP-KKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKR  LQ  ERSKKWKG+Q S E LNQNNN +  K SKHWGSWAQALY+TAM+PERH N VLETSDDVVVL DIYPKARKHLL+VAR+EGLDQLADV
Subjt:  HHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVP-KKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSME
          EHLPLL+TMHA+G+KWI+KF  +D  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VI+EVSSHGKA I DDESLMSME
Subjt:  CTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNA
         RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLEGGRLV    N+
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNA

SwissProt top hitse value%identityAlignment
P61798 Aprataxin (Fragment)6.0e-3434.93Show/hide
Query:  MNPQRPNYLNNDYDEGCKLL-GNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDGHHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTV-
        +NP   + ++   DE  K+  G     + + +  +VQ   +S + + E       E ++  ED       EN+    +  +   +Q+S+  L  + ++V 
Subjt:  MNPQRPNYLNNDYDEGCKLL-GNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDGHHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTV-

Query:  -----PKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPLVF
               + +H G W+Q L  +   P+      +   +  VV+ D YPKAR H LV+   + +  L  V  EHL LL  MHA+G K I +   ++  L F
Subjt:  -----PKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPLVF

Query:  RLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDES-LMSMELRCNRCRSAHPNLPKLKAHISK
        RLGYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T++F +S  VI  V S GK  + D  S L+ + LRC+ C+     +P+LK H+ K
Subjt:  RLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDES-LMSMELRCNRCRSAHPNLPKLKAHISK

Q7TQC5 Aprataxin2.0e-3444.06Show/hide
Query:  SQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWINK
        SQ S       N    K+S   G W+Q L  +   P+      +   D VVV+ D YPKAR H LV+     +  L  V +EHL LL+ MHA+G K I  
Subjt:  SQNSTEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWINK

Query:  FFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDE-SLMSMELRCNRCRSAHPNLPKLKAHI
         F     L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+  + D    L+ + LRC+ C+   P++P+LK H+
Subjt:  FFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDE-SLMSMELRCNRCRSAHPNLPKLKAHI

Query:  SK
         K
Subjt:  SK

Q7YRZ1 Aprataxin7.8e-3441.63Show/hide
Query:  HHKFKRE-NLQNLER--SKKWKGSQNSTEGLNQNNNTVP--------KKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVAR
        H K KR  N  ++ER  S++ K S  +  G N +  +VP         K +    W+Q L  +   P+      +   D VVV+ D YPKAR H LV+  
Subjt:  HHKFKRE-NLQNLER--SKKWKGSQNSTEGLNQNNNTVP--------KKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVAR

Query:  HEGLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANI
           +  L  V  EHL LLR MH +G K I   F     L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+  +
Subjt:  HEGLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANI

Query:  MDD-ESLMSMELRCNRCRSAHPNLPKLKAHISK
         D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  MDD-ESLMSMELRCNRCRSAHPNLPKLKAHISK

Q8K4H4 Aprataxin1.2e-3444.85Show/hide
Query:  PKKSKH-------WGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPL
        PK  KH        G W+Q L  +   P+      +   D VVV+ D YPKAR H LV+     +  L  V +EHL LL+ MHA+G K I   F     L
Subjt:  PKKSKH-------WGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPL

Query:  VFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDE-SLMSMELRCNRCRSAHPNLPKLKAHISK
         FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+  + D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  VFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDE-SLMSMELRCNRCRSAHPNLPKLKAHISK

Q9M041 Transcription factor bHLH1401.2e-23659.48Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GT+AQCLK AT +L +GKSVF+DRCNL+ EQR++F+KLGGP+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSKT
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC++++DV +A++MY  L     LP GCFG+K  D K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSKT

Query:  CSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR
                 S  P  +  +  +  +K +    M+ NV +   K  S  I            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG AR
Subjt:  CSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR

Query:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQQANSLQPGNAVAVQLP
        LVLVDLS GSKILS+VKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAG  LE AT+ +AN+L PG AV V LP
Subjt:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQQANSLQPGNAVAVQLP

Query:  STSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDGHHKFKRENLQNLERSK
        ST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK L  AY+SLF+ F+S+VQD+ K  K   +   S   E  K              ++ ER+K
Subjt:  STSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDGHHKFKRENLQNLERSK

Query:  KWKGSQN-------STEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRT
        K+KGSQ+        +E L     +  K SK W +WA AL+  AMHPERH N VLE  D++VV+ D YPKARKH+LV+AR E LD L DV  E+L LL+ 
Subjt:  KWKGSQN-------STEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRT

Query:  MHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMELRCNRCRSAH
        MH +GLKW+++F  ED  L+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V+ EV+S GKAN+   E L+  ELRCNRCRSAH
Subjt:  MHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMELRCNRCRSAH

Query:  PNLPKLKAHISKCQAPFPSTLLEGGRLV
        PN+PKLK+H+  C + FP  LL+  RLV
Subjt:  PNLPKLKAHISKCQAPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT5G01310.1 APRATAXIN-like8.6e-23859.48Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GT+AQCLK AT +L +GKSVF+DRCNL+ EQR++F+KLGGP+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSKT
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC++++DV +A++MY  L     LP GCFG+K  D K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSKT

Query:  CSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR
                 S  P  +  +  +  +K +    M+ NV +   K  S  I            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG AR
Subjt:  CSSVNTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR

Query:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQQANSLQPGNAVAVQLP
        LVLVDLS GSKILS+VKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAG  LE AT+ +AN+L PG AV V LP
Subjt:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQQANSLQPGNAVAVQLP

Query:  STSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDGHHKFKRENLQNLERSK
        ST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK L  AY+SLF+ F+S+VQD+ K  K   +   S   E  K              ++ ER+K
Subjt:  STSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDGHHKFKRENLQNLERSK

Query:  KWKGSQN-------STEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRT
        K+KGSQ+        +E L     +  K SK W +WA AL+  AMHPERH N VLE  D++VV+ D YPKARKH+LV+AR E LD L DV  E+L LL+ 
Subjt:  KWKGSQN-------STEGLNQNNNTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRT

Query:  MHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMELRCNRCRSAH
        MH +GLKW+++F  ED  L+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V+ EV+S GKAN+   E L+  ELRCNRCRSAH
Subjt:  MHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMELRCNRCRSAH

Query:  PNLPKLKAHISKCQAPFPSTLLEGGRLV
        PN+PKLK+H+  C + FP  LL+  RLV
Subjt:  PNLPKLKAHISKCQAPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGATACAGACGAAAATTCGAATGCTAAAGGAAAGGAAGGGCAAGGAAAGCTCATAATGGTAATATTAGTGGGTGCACCTGGAAGTGGCAAGTCCACCTTTTG
CGAACTTGTTATGGGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGTCAGGATACCATTGGAAATGGCAAGTCTGGAACCAGAGCACAGTGCTTGAAGACTGCAACGAGTG
CACTGAATGATGGAAAGAGTGTATTCGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTGAAGCTCGGGGGCCCTCAAGTGGATGTACATGCTGTTGTA
TTAGATCTTCCTGCTCAGCTCTGTATCTCTCGTTCTGTTAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCCAAACTTAATGAAGGGTTCACTCGCATAACCTTTTGCCATAATGAGTCCGACGTTCTATCCGCTATAGATATGTACAAATCGCTTGATTTACATACTA
TGCTTCCACATGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAAACCTTCAAAAACGTGTTCTAGTGTC
AATACCGACAAGAATTCTCCAACTCCTCAACCTACCCAGGAAAAGAGGGAGTCTTGTGGTAAAAAGGAAGAGTCTTCCTGCACAATGTCGAGAAATGTAGCTATGGAGTC
AGAGAAAGGTGAAAGTCCAGGCATTAGATCCTTAAAAGACAAGATTTCTCAAAGTGATCCACCAACTCTTGCATTCCCATCTATTTCGACTTCAGATTTCAAGTTTAGCC
ATGAGAAGGCTGCTGAAATTATTGTTGAGAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTACTGGTAGACTTGAGTCATGGATCAAAGATATTGTCT
ATGGTTAAAGCTAAAGCAACCGAGAAAAATATTAGTTCCACCAAGTTTTTTACATTCGTAGGTGATATAACTAAACTCAATTCCGAAGGTGGATTGCGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGTGGTGGTGTGAATGCTGCAATTTTTAGTGCTGCAGGTTCCGGTCTGGAAGTGGCAACTAAACAACAAGCAAACT
CCCTTCAACCTGGAAATGCCGTGGCTGTTCAATTGCCTTCAACTTCTCCTTTATTAAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGGCCCAACATGAATCCA
CAAAGGCCAAATTATCTCAACAATGACTATGATGAAGGTTGCAAGCTTCTTGGCAATGCTTACTCTTCCCTATTTCAGGCCTTTATTTCAATCGTACAAGACAAATACAA
GTCGGTTAAGGGAATTCATGAATGTCTCGGCTCAACACCTCCAGAACTACAAAAGCACTCTGAGGACGGTCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAA
GAAGCAAAAAATGGAAAGGATCTCAGAACTCAACCGAAGGATTAAACCAAAACAACAATACTGTTCCCAAAAAGAGTAAGCACTGGGGTTCGTGGGCACAAGCACTTTAT
GACACTGCAATGCATCCCGAGCGACATACCAATTCTGTACTAGAGACATCAGATGATGTTGTAGTACTGTATGATATTTATCCGAAGGCACGCAAGCATCTTTTAGTAGT
GGCTCGGCATGAAGGCCTCGATCAACTAGCCGACGTATGTACAGAACACCTTCCATTATTGAGGACAATGCACGCTATGGGTTTGAAGTGGATCAATAAGTTCTTTCGTG
AAGATGGCCCATTGGTCTTTCGCCTCGGATACCACTCGGCTCCATCAATGAGGCAGCTGCACCTCCATGTTATAAGCCAGGATTTCGATTCCAGTCATCTGAAAAACAAG
AAGCACTGGAATTCTTTCAACACCGATTTCTTCAGAGACTCAGTCATCGTTATAAACGAAGTCAGTAGCCATGGAAAGGCGAACATCATGGACGATGAGAGCTTGATGTC
TATGGAGTTACGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAGTTGAAAGCGCATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTTGAGGGCG
GTCGTTTAGTCGTTGAGCCAAGTAATGCTCCTCTCTCTTAG
mRNA sequenceShow/hide mRNA sequence
CTAATTGCTCTATTTGAAATCATTCTTCATTCGGTACAAGGACATGCTATATGCGTAACCAAACAATCCCGTTGCAGCAAAATTTGGGGGAATGAGCTGACCGTGACCGT
GGCGTATTCTTTAAGCTTCACTCATCGGCAGCGGGAACGTCGGTAATGGCGATTGAAAGGGTTTAAAACCAAGAGTGAAGACCCAATTTCTAATTAAGAAATGGACATGG
ATACAGACGAAAATTCGAATGCTAAAGGAAAGGAAGGGCAAGGAAAGCTCATAATGGTAATATTAGTGGGTGCACCTGGAAGTGGCAAGTCCACCTTTTGCGAACTTGTT
ATGGGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGTCAGGATACCATTGGAAATGGCAAGTCTGGAACCAGAGCACAGTGCTTGAAGACTGCAACGAGTGCACTGAATGA
TGGAAAGAGTGTATTCGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCAGATTTTGTGAAGCTCGGGGGCCCTCAAGTGGATGTACATGCTGTTGTATTAGATCTTC
CTGCTCAGCTCTGTATCTCTCGTTCTGTTAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAAGAAAGAATTG
CCCAAACTTAATGAAGGGTTCACTCGCATAACCTTTTGCCATAATGAGTCCGACGTTCTATCCGCTATAGATATGTACAAATCGCTTGATTTACATACTATGCTTCCACA
TGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAAACCTTCAAAAACGTGTTCTAGTGTCAATACCGACA
AGAATTCTCCAACTCCTCAACCTACCCAGGAAAAGAGGGAGTCTTGTGGTAAAAAGGAAGAGTCTTCCTGCACAATGTCGAGAAATGTAGCTATGGAGTCAGAGAAAGGT
GAAAGTCCAGGCATTAGATCCTTAAAAGACAAGATTTCTCAAAGTGATCCACCAACTCTTGCATTCCCATCTATTTCGACTTCAGATTTCAAGTTTAGCCATGAGAAGGC
TGCTGAAATTATTGTTGAGAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCCAGACTTGTACTGGTAGACTTGAGTCATGGATCAAAGATATTGTCTATGGTTAAAG
CTAAAGCAACCGAGAAAAATATTAGTTCCACCAAGTTTTTTACATTCGTAGGTGATATAACTAAACTCAATTCCGAAGGTGGATTGCGCTGCAATGTTATAGCCAATGCT
GCAAACTGGCGACTGAAACCAGGAGGTGGTGGTGTGAATGCTGCAATTTTTAGTGCTGCAGGTTCCGGTCTGGAAGTGGCAACTAAACAACAAGCAAACTCCCTTCAACC
TGGAAATGCCGTGGCTGTTCAATTGCCTTCAACTTCTCCTTTATTAAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGGCCCAACATGAATCCACAAAGGCCAA
ATTATCTCAACAATGACTATGATGAAGGTTGCAAGCTTCTTGGCAATGCTTACTCTTCCCTATTTCAGGCCTTTATTTCAATCGTACAAGACAAATACAAGTCGGTTAAG
GGAATTCATGAATGTCTCGGCTCAACACCTCCAGAACTACAAAAGCACTCTGAGGACGGTCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAAGAAGCAAAAA
ATGGAAAGGATCTCAGAACTCAACCGAAGGATTAAACCAAAACAACAATACTGTTCCCAAAAAGAGTAAGCACTGGGGTTCGTGGGCACAAGCACTTTATGACACTGCAA
TGCATCCCGAGCGACATACCAATTCTGTACTAGAGACATCAGATGATGTTGTAGTACTGTATGATATTTATCCGAAGGCACGCAAGCATCTTTTAGTAGTGGCTCGGCAT
GAAGGCCTCGATCAACTAGCCGACGTATGTACAGAACACCTTCCATTATTGAGGACAATGCACGCTATGGGTTTGAAGTGGATCAATAAGTTCTTTCGTGAAGATGGCCC
ATTGGTCTTTCGCCTCGGATACCACTCGGCTCCATCAATGAGGCAGCTGCACCTCCATGTTATAAGCCAGGATTTCGATTCCAGTCATCTGAAAAACAAGAAGCACTGGA
ATTCTTTCAACACCGATTTCTTCAGAGACTCAGTCATCGTTATAAACGAAGTCAGTAGCCATGGAAAGGCGAACATCATGGACGATGAGAGCTTGATGTCTATGGAGTTA
CGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAGTTGAAAGCGCATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTTGAGGGCGGTCGTTTAGT
CGTTGAGCCAAGTAATGCTCCTCTCTCTTAGCTTTCATAATGTCTTCCACACACAAAATTTTTGTTGTAAATATTTTGAGCTCAATATCTTTGGGGTTTATGTAGTTCTG
ATGACCTCGAGAAGCATTAGTTAATAGTTATAAGCAATATCACATTTTGTCTCTATCGGATTTCGAATATTCAAGTACCATAATTCCATTTAATTGCACGG
Protein sequenceShow/hide protein sequence
MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKTATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVV
LDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHTMLPHGCFGQKNPDKKVQLGIMKFLKKAEKPSKTCSSV
NTDKNSPTPQPTQEKRESCGKKEESSCTMSRNVAMESEKGESPGIRSLKDKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVDLSHGSKILS
MVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGSGLEVATKQQANSLQPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNP
QRPNYLNNDYDEGCKLLGNAYSSLFQAFISIVQDKYKSVKGIHECLGSTPPELQKHSEDGHHKFKRENLQNLERSKKWKGSQNSTEGLNQNNNTVPKKSKHWGSWAQALY
DTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWINKFFREDGPLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNK
KHWNSFNTDFFRDSVIVINEVSSHGKANIMDDESLMSMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEGGRLVVEPSNAPLS