; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010860 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010860
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Descriptiontranscription factor bHLH140
Genome locationchr04:33023729..33027535
RNA-Seq ExpressionPay0010860
SyntenyPay0010860
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442389.1 PREDICTED: transcription factor bHLH140 [Cucumis melo]0.0e+0099.87Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
        QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSL QAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
Subjt:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS

Query:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
Subjt:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME
        CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME
Subjt:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS

XP_011651853.1 transcription factor bHLH140 [Cucumis sativus]0.0e+0095.98Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLK+ATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAE PSKTCSSAN DKNSPTPQPTQEKRESC KKEESSC MSRNVAMESEKGESPG+RSL+ KISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
        QANSL PGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSL QAFISIVQDKYKSVKGI+ECLGSTPPE QKHSE+ 
Subjt:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS

Query:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNN TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
Subjt:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME
        CTEHLPLLRTMHAMGLKWI+KFF ED PLVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV VI+EVSSHGKA IMDDE L+SME
Subjt:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0082.29Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLKSA+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LPHGCFGQ   D KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+NP+KTCSSANA K+SP  Q ++E   SCDKKEE +C +  NV  ESEKGE+PGVRSL   IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
        QANSL PGN V  QLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSL Q FISIV++++KSVKGI + LGS P E +KHSE+S
Subjt:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS

Query:  --------HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE
                 HKFKRE++QN E SKKWKGS +S E  NQNNN  V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV VL DIY KA KHLLVVAR+E
Subjt:  --------HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE

Query:  GLDQLADVCTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMD
        GLDQLADV  EHLPLLRTMH +GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEV SHGKA I D
Subjt:  GLDQLADVCTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMD

Query:  DERLISMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSN
        DE L+SMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLE  RLV+ PSN
Subjt:  DERLISMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSN

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0089.83Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS AKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGK+VFVDRCNLEIEQRADFVKL 
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GP+VDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE+DV SAIDMYKSLDLH+MLP GCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SP PQ TQEK +SCDKKEES+C +SRNV +ES+KGESPGVRSLE  ISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILS+VKAKA +KNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
        QANSL PGNAVAVQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL NAYSSL QAFIS+V+DK+KSVKGI+  LG TP EP+KHSENS
Subjt:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS

Query:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKRENLQN E SKKWKGS +STE LNQNNNKTVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVVVL DIYPKARKHLLVVARHEGLDQLADV
Subjt:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME
        C EHLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKN+KHWNSFNTDFFRDSV V+DEVSSHGKA +MDDE L+SME
Subjt:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHI KCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0089.67Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGK+VFVDRCNLEIEQRADFVKL GP+VDV AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKTCSSAN
        SVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNE+DV SAIDMYKSLDLH+MLP GCFGQKNPDKKVQLGIMKFLKKAE PS+T SSAN
Subjt:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKTCSSAN

Query:  ADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD
          K+SP PQ TQEK +SCDKKEES+C +SRNV +ES+KGESPGVRSLE  ISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD
Subjt:  ADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVD

Query:  LSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLHPGNAVAVQLPSTSPL
        LSHGSKILS+VKAKA +KNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQQANSL PGNAVAVQLPSTSPL
Subjt:  LSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLHPGNAVAVQLPSTSPL

Query:  LNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENSHHKFKRENLQNLEISKKWKGS
         NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL NAYSSL QAFIS+V+DK+KSVKGI+  LG TP EP+KHSENSHHKFKRENLQN E SKKWKGS
Subjt:  LNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENSHHKFKRENLQNLEISKKWKGS

Query:  HNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWIHK
         +STE LNQNNNKTVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVVVL DIYPKARKHLLVVARHEGLDQLADVC EHLPLLRTMHA+GLKWI K
Subjt:  HNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWIHK

Query:  FFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISMELRCNRCRSAHPNLPKLKAHIS
        FFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKN+KHWNSFNTDFFRDSV V+DEVSSHGKA +MDDE L+SMELRCNRCRSAHPNLPKLKAHI 
Subjt:  FFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISMELRCNRCRSAHPNLPKLKAHIS

Query:  KCQAPFPSTLLEDGRLVVEPSNAPLS
        KCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  KCQAPFPSTLLEDGRLVVEPSNAPLS

TrEMBL top hitse value%identityAlignment
A0A0A0L9U1 Uncharacterized protein0.0e+0095.98Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLK+ATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLH MLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAE PSKTCSSAN DKNSPTPQPTQEKRESC KKEESSC MSRNVAMESEKGESPG+RSL+ KISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
        QANSL PGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSL QAFISIVQDKYKSVKGI+ECLGSTPPE QKHSE+ 
Subjt:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS

Query:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKRENLQNLE SKKWKGS NSTEGLNQNNN TVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
Subjt:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME
        CTEHLPLLRTMHAMGLKWI+KFF ED PLVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV VI+EVSSHGKA IMDDE L+SME
Subjt:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLE GRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS

A0A1S3B6C4 transcription factor bHLH1400.0e+0099.87Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
        QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSL QAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
Subjt:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS

Query:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
Subjt:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME
        CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME
Subjt:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS

A0A5A7TLV2 Transcription factor bHLH1400.0e+0099.87Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
        QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSL QAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
Subjt:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS

Query:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
Subjt:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME
        CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME
Subjt:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS
        LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS

A0A6J1CV45 transcription factor bHLH1400.0e+0082.29Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGT+AQCLKSA+SAL+DGKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAID YKSL LH+ LPHGCFGQ   D KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+NP+KTCSSANA K+SP  Q ++E   SCDKKEE +C +  NV  ESEKGE+PGVRSL   IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLVLVDL++GSK+LS+VKAKA +K I+ +KFFTFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
        QANSL PGN V  QLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL  AYSSL Q FISIV++++KSVKGI + LGS P E +KHSE+S
Subjt:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS

Query:  --------HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE
                 HKFKRE++QN E SKKWKGS +S E  NQNNN  V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV VL DIY KA KHLLVVAR+E
Subjt:  --------HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE

Query:  GLDQLADVCTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMD
        GLDQLADV  EHLPLLRTMH +GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEV SHGKA I D
Subjt:  GLDQLADVCTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMD

Query:  DERLISMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSN
        DE L+SMELRCNRCRSAHPNL KLKAHISKC+APFPSTLLE  RLV+ PSN
Subjt:  DERLISMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSN

A0A6J1GCV0 transcription factor bHLH1400.0e+0082.8Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG
        MDMD DENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGT+AQCLKSA SALNDGKSVFVDRCNLEIEQR++FVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+E+DV SAID YKSL LH+ LP GCFGQKN DKKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQ

Query:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAENP+KTCS+AN +K+ P+ Q TQEK+       ESSC M  NV  ESEKGE+PGV SLE  IS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAENPSKTCSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNARLV+VDLSHGSKILS+VKAKA +KNI STKFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFMDKLGNARLVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS
        QA SL PGN VAVQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL +AYSSL QAFISIV+D++KS KGI+E LGS P E +KHSE++
Subjt:  QANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENS

Query:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV
        HHKFKR  LQ  E SKKWKG+  S E LNQNNNK   K SKHWGSWAQALY+TAM+PERH N VLETSDDVVVL DIYPKARKHLL+VAR+EGLDQLADV
Subjt:  HHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADV

Query:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME
          EHLPLL+TMHA+G+KWI KF H+DA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV VIDEVSSHGKA I DDE L+SME
Subjt:  CTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISME

Query:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNA
         RCNRCRSAHPNLPKLK HISKCQ+PFPSTLLE GRLV    N+
Subjt:  LRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNA

SwissProt top hitse value%identityAlignment
P61797 Aprataxin1.2e-3441.13Show/hide
Query:  QKHSENSHHKFKRENLQNLEISKKWK-GSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE
        +K S NS    +R+  Q  E S   + GS +S   +  N  K  P K +  G W+Q L  +   P+      +   + VVV+ D YPKAR H LV+    
Subjt:  QKHSENSHHKFKRENLQNLEISKKWK-GSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHE

Query:  GLDQLADVCTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMD
         +  L  V  EHL LL+ MH +G K I   F   + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI+ V   G+  + D
Subjt:  GLDQLADVCTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMD

Query:  D-ERLISMELRCNRCRSAHPNLPKLKAHISK
            L+ + LRC+ C+   P++P+LK H+ +
Subjt:  D-ERLISMELRCNRCRSAHPNLPKLKAHISK

Q7TQC5 Aprataxin1.2e-3443.14Show/hide
Query:  GSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWI
        GS  S   ++   +K    K +  G W+Q L  +   P+      +   D VVV+ D YPKAR H LV+     +  L  V +EHL LL+ MHA+G K I
Subjt:  GSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWI

Query:  HKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDE-RLISMELRCNRCRSAHPNLPKLKA
           F   + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI  V   G+  + D    L+ + LRC+ C+   P++P+LK 
Subjt:  HKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDE-RLISMELRCNRCRSAHPNLPKLKA

Query:  HISK
        H+ K
Subjt:  HISK

Q7YRZ1 Aprataxin3.5e-3441.45Show/hide
Query:  SHHKFKRE-NLQNLE--ISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGS-------WAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVA
        +H K KR  N  ++E   S++ K S  +  G N +     PKK K   +       W+Q L  +   P+      +   D VVV+ D YPKAR H LV+ 
Subjt:  SHHKFKRE-NLQNLE--ISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGS-------WAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVA

Query:  RHEGLDQLADVCTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKAR
            +  L  V  EHL LLR MH +G K I   F   + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI+ V   G+  
Subjt:  RHEGLDQLADVCTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKAR

Query:  IMDD-ERLISMELRCNRCRSAHPNLPKLKAHISK
        + D    L+ + LRC+ C+   P++P+LK H+ K
Subjt:  IMDD-ERLISMELRCNRCRSAHPNLPKLKAHISK

Q9BGQ0 Aprataxin1.6e-3442.16Show/hide
Query:  GSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWI
        GS+++   +     K  P K +  G W+Q L  +   P+      +   + VVV+ D YPKAR H LV+     +  L  V  EHL LL+ MH +G K I
Subjt:  GSHNSTEGLNQNNNKTVPKKSKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWI

Query:  HKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDD-ERLISMELRCNRCRSAHPNLPKLKA
           F   + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  VI+ V   G+  + D    L+ + LRC+ C+   P++P+LK 
Subjt:  HKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDD-ERLISMELRCNRCRSAHPNLPKLKA

Query:  HISK
        H+ K
Subjt:  HISK

Q9M041 Transcription factor bHLH1401.9e-23458.79Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GT+AQCLK AT +L +GKSVF+DRCNL+ EQR++F+KLGGP+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKT
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC++++DV +A++MY  L   + LP GCFG+K  D K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKT

Query:  CSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR
                 S  P  +  +  +  +K +    M+ NV +   K  S  +            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG AR
Subjt:  CSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR

Query:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLHPGNAVAVQLP
        LVLVDLS GSKILS+VKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +AN+L PG AV V LP
Subjt:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLHPGNAVAVQLP

Query:  STSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENSHHKFKRENLQNLEISK
        ST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK L  AY+SL + F+S+VQD+ K               P++ S+ +      +  ++ E +K
Subjt:  STSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENSHHKFKRENLQNLEISK

Query:  KWKGSH-----NSTEGLNQNNNKTVPKK-SKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRT
        K+KGS      N+ E  +  + +   KK SK W +WA AL+  AMHPERH N VLE  D++VV+ D YPKARKH+LV+AR E LD L DV  E+L LL+ 
Subjt:  KWKGSH-----NSTEGLNQNNNKTVPKK-SKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRT

Query:  MHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISMELRCNRCRSAH
        MH +GLKW+ +F +EDA L+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V++EV+S GKA +  ++ L+  ELRCNRCRSAH
Subjt:  MHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISMELRCNRCRSAH

Query:  PNLPKLKAHISKCQAPFPSTLLEDGRLV
        PN+PKLK+H+  C + FP  LL++ RLV
Subjt:  PNLPKLKAHISKCQAPFPSTLLEDGRLV

Arabidopsis top hitse value%identityAlignment
AT5G01310.1 APRATAXIN-like1.4e-23558.79Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GT+AQCLK AT +L +GKSVF+DRCNL+ EQR++F+KLGGP+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKT
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC++++DV +A++MY  L   + LP GCFG+K  D K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKT

Query:  CSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR
                 S  P  +  +  +  +K +    M+ NV +   K  S  +            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG AR
Subjt:  CSSANADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAR

Query:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLHPGNAVAVQLP
        LVLVDLS GSKILS+VKAKA++KNI S KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +AN+L PG AV V LP
Subjt:  LVLVDLSHGSKILSMVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLHPGNAVAVQLP

Query:  STSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENSHHKFKRENLQNLEISK
        ST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK L  AY+SL + F+S+VQD+ K               P++ S+ +      +  ++ E +K
Subjt:  STSPLLNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENSHHKFKRENLQNLEISK

Query:  KWKGSH-----NSTEGLNQNNNKTVPKK-SKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRT
        K+KGS      N+ E  +  + +   KK SK W +WA AL+  AMHPERH N VLE  D++VV+ D YPKARKH+LV+AR E LD L DV  E+L LL+ 
Subjt:  KWKGSH-----NSTEGLNQNNNKTVPKK-SKHWGSWAQALYDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRT

Query:  MHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISMELRCNRCRSAH
        MH +GLKW+ +F +EDA L+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V++EV+S GKA +  ++ L+  ELRCNRCRSAH
Subjt:  MHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKNKKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISMELRCNRCRSAH

Query:  PNLPKLKAHISKCQAPFPSTLLEDGRLV
        PN+PKLK+H+  C + FP  LL++ RLV
Subjt:  PNLPKLKAHISKCQAPFPSTLLEDGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGATACCGACGAAAATTCGAATGCCAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTTATATTAGTGGGTGCACCTGGAAGCGGCAAGTCCACCTTTTG
CGAACTTGTTATGGGTTCCTCTTCTCGCCCTTGGGTTCGGATCTGTCAGGATACCATTGGAAATGGCAAGTCTGGAACCAGAGCACAGTGCTTGAAGAGTGCAACAAGTG
CACTGAATGATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGTGCTGATTTTGTGAAGCTCGGGGGCCCTCAAGTGGATGTACATGCTGTTGTA
TTAGATCTCCCTGCTCAGCTCTGTATCTCTCGTTCTGTTAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCCAAACTTAATGAAGGGTTCACTCGCATAACCTTTTGCCACAATGAGAGCGACGTTCTATCCGCTATAGATATGTACAAATCGCTTGATTTACATAATA
TGCTTCCACATGGATGTTTTGGACAGAAGAACCCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAACCCTTCAAAAACGTGTTCTAGTGCC
AATGCCGACAAGAATTCTCCAACTCCTCAACCTACCCAGGAAAAGAGGGAGTCTTGTGATAAAAAGGAAGAGTCTTCCTGCAGAATGTCAAGGAATGTAGCTATGGAGTC
GGAGAAAGGTGAAAGTCCAGGCGTTAGATCCTTAGAAGGCAAGATTTCTCAAAGTGATCCACCAACTCTTGCATTCCCATCTATTTCGACTTCAGATTTCAAGTTTAGCC
ATGAGAAGGCTGCTGAAATTATTGTTGAAAAGGTTGAAGAATTCATGGATAAGCTTGGAAATGCCAGACTCGTACTGGTAGACTTGAGTCATGGATCAAAGATATTGTCG
ATGGTTAAAGCTAAAGCAACCGAGAAAAATATTAGTTCCACCAAGTTTTTCACATTCGTAGGTGATATAACTAAACTCAATTCCGAAGGTGGATTGCGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGCGACTGAAACCGGGAGGTGGTGGTGTGAATGCTGCGATTTTTAGTGCTGCAGGCCCTGGTCTGGAAGTGGCGACTAAACAACAAGCAAACT
CCCTTCATCCTGGCAATGCCGTGGCTGTTCAGTTGCCTTCAACTTCTCCTTTGTTAAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCCAACATGAATCCA
CAAAGGCCAAATTATCTCAACAATGACTATGATGAAGGTTGCAAGCTTCTTGGCAATGCTTACTCTTCCCTACTTCAGGCCTTTATTTCAATCGTACAAGACAAATATAA
GTCGGTTAAGGGAATTAATGAATGCCTCGGCTCAACACCTCCAGAACCACAAAAGCACTCCGAGAACAGTCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAA
TAAGCAAAAAATGGAAAGGATCTCATAACTCAACCGAAGGATTAAACCAAAACAACAATAAGACTGTCCCCAAAAAGAGTAAGCACTGGGGTTCATGGGCACAAGCACTT
TATGACACTGCAATGCATCCCGAGCGACATACCAATTCTGTACTAGAGACATCAGATGATGTTGTAGTACTGTATGATATTTATCCGAAGGCACGCAAGCATCTTTTAGT
CGTGGCTCGGCATGAAGGCCTCGATCAACTAGCCGACGTATGTACAGAACACCTTCCATTGTTGAGGACAATGCACGCTATGGGTTTGAAGTGGATCCATAAGTTCTTTC
ATGAAGATGCACCATTGGTCTTTCGCCTTGGATACCACTCGGCTCCATCAATGAGGCAGCTGCACCTACATGTTATAAGCCAGGATTTCGATTCCACTCATCTGAAAAAC
AAGAAGCACTGGAATTCTTTCAACACCGATTTCTTCAGAGACTCGGTGACCGTTATAGACGAAGTCAGTAGCCATGGAAAGGCGAGGATCATGGACGACGAGAGATTGAT
ATCTATGGAGTTACGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTACCCAAATTGAAAGCGCATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTTGAGG
ACGGTCGTTTAGTCGTTGAGCCAAGCAATGCTCCTCTTTCTTAG
mRNA sequenceShow/hide mRNA sequence
GGACATGCTATCTGCCTAACCAAACACTCCCGTTGCAGCAAATTTGGGGGAGTGAGCTGACCTTGACCGTGGCCGTGGCCGTGGCCGTGGCGTATTCTCGAAGCTTCACT
CATCGGTAGCGCGAACGTCGGTTATGGTGACTGAAAGTGTTTAAAAGCAAGAGTGAAGACCCAACATCTAATCAAGAAATGGACATGGATACCGACGAAAATTCGAATGC
CAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTTATATTAGTGGGTGCACCTGGAAGCGGCAAGTCCACCTTTTGCGAACTTGTTATGGGTTCCTCTTCTCGCCCTT
GGGTTCGGATCTGTCAGGATACCATTGGAAATGGCAAGTCTGGAACCAGAGCACAGTGCTTGAAGAGTGCAACAAGTGCACTGAATGATGGAAAGAGTGTATTTGTGGAC
AGGTGCAATCTTGAAATAGAGCAGCGTGCTGATTTTGTGAAGCTCGGGGGCCCTCAAGTGGATGTACATGCTGTTGTATTAGATCTCCCTGCTCAGCTCTGTATCTCTCG
TTCTGTTAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAAGAAAGAATTGCCCAAACTTAATGAAGGGTTCA
CTCGCATAACCTTTTGCCACAATGAGAGCGACGTTCTATCCGCTATAGATATGTACAAATCGCTTGATTTACATAATATGCTTCCACATGGATGTTTTGGACAGAAGAAC
CCAGACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAACCCTTCAAAAACGTGTTCTAGTGCCAATGCCGACAAGAATTCTCCAACTCCTCAACC
TACCCAGGAAAAGAGGGAGTCTTGTGATAAAAAGGAAGAGTCTTCCTGCAGAATGTCAAGGAATGTAGCTATGGAGTCGGAGAAAGGTGAAAGTCCAGGCGTTAGATCCT
TAGAAGGCAAGATTTCTCAAAGTGATCCACCAACTCTTGCATTCCCATCTATTTCGACTTCAGATTTCAAGTTTAGCCATGAGAAGGCTGCTGAAATTATTGTTGAAAAG
GTTGAAGAATTCATGGATAAGCTTGGAAATGCCAGACTCGTACTGGTAGACTTGAGTCATGGATCAAAGATATTGTCGATGGTTAAAGCTAAAGCAACCGAGAAAAATAT
TAGTTCCACCAAGTTTTTCACATTCGTAGGTGATATAACTAAACTCAATTCCGAAGGTGGATTGCGCTGCAATGTTATAGCCAATGCTGCAAACTGGCGACTGAAACCGG
GAGGTGGTGGTGTGAATGCTGCGATTTTTAGTGCTGCAGGCCCTGGTCTGGAAGTGGCGACTAAACAACAAGCAAACTCCCTTCATCCTGGCAATGCCGTGGCTGTTCAG
TTGCCTTCAACTTCTCCTTTGTTAAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCCAACATGAATCCACAAAGGCCAAATTATCTCAACAATGACTATGA
TGAAGGTTGCAAGCTTCTTGGCAATGCTTACTCTTCCCTACTTCAGGCCTTTATTTCAATCGTACAAGACAAATATAAGTCGGTTAAGGGAATTAATGAATGCCTCGGCT
CAACACCTCCAGAACCACAAAAGCACTCCGAGAACAGTCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCTTGAAATAAGCAAAAAATGGAAAGGATCTCATAACTCA
ACCGAAGGATTAAACCAAAACAACAATAAGACTGTCCCCAAAAAGAGTAAGCACTGGGGTTCATGGGCACAAGCACTTTATGACACTGCAATGCATCCCGAGCGACATAC
CAATTCTGTACTAGAGACATCAGATGATGTTGTAGTACTGTATGATATTTATCCGAAGGCACGCAAGCATCTTTTAGTCGTGGCTCGGCATGAAGGCCTCGATCAACTAG
CCGACGTATGTACAGAACACCTTCCATTGTTGAGGACAATGCACGCTATGGGTTTGAAGTGGATCCATAAGTTCTTTCATGAAGATGCACCATTGGTCTTTCGCCTTGGA
TACCACTCGGCTCCATCAATGAGGCAGCTGCACCTACATGTTATAAGCCAGGATTTCGATTCCACTCATCTGAAAAACAAGAAGCACTGGAATTCTTTCAACACCGATTT
CTTCAGAGACTCGGTGACCGTTATAGACGAAGTCAGTAGCCATGGAAAGGCGAGGATCATGGACGACGAGAGATTGATATCTATGGAGTTACGTTGCAACAGATGCAGAA
GTGCTCATCCCAACTTACCCAAATTGAAAGCGCATATTTCCAAATGCCAAGCGCCTTTCCCTTCCACACTGCTTGAGGACGGTCGTTTAGTCGTTGAGCCAAGCAATGCT
CCTCTTTCTTAGCTTTCATAATGTCTTCCACACACAAAAATTTTGTTGTGAATATTTTGAGCTCTATCTTTGGGGTTTATGCAGTTCTGATGATGGCCTCGAGAAGCATA
CATTTTGTCTCTCTCTCGGATTTCGAAGATTCAAGTTCCATGAATCCATTTAATTGCACGGAATTTTCAAACTACATTTTTGTTGATTTTTCCTTCTCCTAGTGTTTTGT
GGGAAAGTCTGGTGTCACTCTCCTGATTATTATGTTCTCC
Protein sequenceShow/hide protein sequence
MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTRAQCLKSATSALNDGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVV
LDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNESDVLSAIDMYKSLDLHNMLPHGCFGQKNPDKKVQLGIMKFLKKAENPSKTCSSA
NADKNSPTPQPTQEKRESCDKKEESSCRMSRNVAMESEKGESPGVRSLEGKISQSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNARLVLVDLSHGSKILS
MVKAKATEKNISSTKFFTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLHPGNAVAVQLPSTSPLLNREGVTHVIHVLGPNMNP
QRPNYLNNDYDEGCKLLGNAYSSLLQAFISIVQDKYKSVKGINECLGSTPPEPQKHSENSHHKFKRENLQNLEISKKWKGSHNSTEGLNQNNNKTVPKKSKHWGSWAQAL
YDTAMHPERHTNSVLETSDDVVVLYDIYPKARKHLLVVARHEGLDQLADVCTEHLPLLRTMHAMGLKWIHKFFHEDAPLVFRLGYHSAPSMRQLHLHVISQDFDSTHLKN
KKHWNSFNTDFFRDSVTVIDEVSSHGKARIMDDERLISMELRCNRCRSAHPNLPKLKAHISKCQAPFPSTLLEDGRLVVEPSNAPLS