; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014449 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014449
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontranscription factor bHLH140
Genome locationtig00000589:471368..480649
RNA-Seq ExpressionSgr014449
SyntenySgr014449
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442389.1 PREDICTED: transcription factor bHLH140 [Cucumis melo]0.0e+0083.02Show/hide
Query:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG
        MDMD D+N   KGKEGQGKL+MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALN+GKS+FVDRCNLEIEQRADFVKLG
Subjt:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG

Query:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ
         P+VD+HAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKL+EGFTRITFCHNE+DV SAID YK L LH+MLP+GCFGQKNPD KVQ
Subjt:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ

Query:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE P+KTCSSAN  K+SP  Q T+EK  SC+KKEES+C MS  V  ESEKGESPGVRSL   IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVLVDLSHGSKILS+VKAKA +KNI S KFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS
        Q NSL PGN V VQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL   YSSLFQ FISI++D++KSVKGI+          +KHSE+S
Subjt:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS

Query:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE
                 HKFKRE++QN E SKKWKGS +S   LNQNNNK V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV+VL DIYPKA KHLLVVAR+E
Subjt:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE

Query:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD
        GLDQLAD+  EHLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDS+ V++EVSSHGKA I D
Subjt:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD

Query:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLVTD
        DERL+SMELRCNRCRSAHPNLPKLKAHISKC+APFPSTLLE  RLV +
Subjt:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLVTD

XP_011651853.1 transcription factor bHLH140 [Cucumis sativus]0.0e+0083.02Show/hide
Query:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG
        MDMD D+N   KGKEGQGKL+MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALN+GKS+FVDRCNLEIEQRADFVKLG
Subjt:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG

Query:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ
         P+VD+HAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKL+EGFTRITFCHNE+DV SAID YK L LH MLP+GCFGQKNPD KVQ
Subjt:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ

Query:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAE P+KTCSSANT K+SP  Q T+EK  SC KKEES+CTMS  V  ESEKGESPG+RSL D IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVLVDLSHGSKILS+VKAKA +KNI S KFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS
        Q NSL+PGN V VQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL   YSSLFQ FISI++D++KSVKGIH          +KHSED 
Subjt:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS

Query:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE
                 HKFKRE++QN ERSKKWKGSQ+S   LNQNNN  V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV+VL DIYPKA KHLLVVAR+E
Subjt:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE

Query:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD
        GLDQLAD+  EHLPLLRTMHA+GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDS+ V+ EVSSHGKA+I D
Subjt:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD

Query:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLVTD
        DE LMSMELRCNRCRSAHPNLPKLKAHISKC+APFPSTLLE  RLV +
Subjt:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLVTD

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0088.74Show/hide
Query:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG
        MDMDIDDN T KGKEGQ KL+MVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL++GKSIFVDRCNLEIEQRADFVK+G
Subjt:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG

Query:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ
           VD+HAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKL+EGFTRITFCHNETDVQSAIDTYK LGLHD LP+GCFGQ   DNKVQ
Subjt:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ

Query:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ PAKTCSSAN IKDSP  Q +RE SYSC+KKEE ACT+   VDKESEKGE+PGVRSLGD+ISRSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEF+DKLGNARLVLVDL++GSK+LSLVKAKA KK I+ +KFFTFVGDITKLNSEGGL CNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHK----------KHSEDS
        Q NSLRPGN V  QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRA YSSLFQGFISI+E+QFKSVKGI K          KHSEDS
Subjt:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHK----------KHSEDS

Query:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE
         RSTFPSCDHKFKREDVQNPERSKKWKGSQDSA A NQNNN IVHKMSKHWGSWAQALYNTAMHPERH D VLE+SDDV VLNDIY KAHKHLLVVARYE
Subjt:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE

Query:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD
        GLDQLAD+RREHLPLLRTMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDS+DVM+EV SHGKASIKD
Subjt:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD

Query:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLV
        DE LMSMELRCNRCRSAHPNL KLKAHISKCRAPFPSTLLE  RLV
Subjt:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLV

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0086.19Show/hide
Query:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG
        MDMD D+N T KGKEGQGKL+MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALN+GK++FVDRCNLEIEQRADFVKL 
Subjt:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG

Query:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ
         PRVD+ AVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP+L+EGF RITFCHNETDVQSAID YK L LHDMLP GCFGQKNPD KVQ
Subjt:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ

Query:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE P++T SSANT+KDSP+ Q T+EKS SC+KKEESACT+S  VD ES+KGESPGVRSL DNIS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVLVDLSHGSKILSLVKAKA KKNI S KFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS
        Q NSL+PGN V VQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR  YSSLFQ FIS++ED+FKSVKGIH          +KHSE+S
Subjt:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS

Query:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE
                 HKFKRE++QNPERSKKWKGSQDS  ALNQNNNK V KMSKHWGSWAQALYNTAMHPE+H D VLE SDDV+VLNDIYPKA KHLLVVAR+E
Subjt:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE

Query:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD
        GLDQLAD+  EHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDS+ VM+EVSSHGKAS+ D
Subjt:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD

Query:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLV
        DE LMSMELRCNRCRSAHPNLPKLKAHI KC+APFPSTLLE  RLV
Subjt:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLV

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0086.48Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLGCPRVDLHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALN+GK++FVDRCNLEIEQRADFVKL  PRVD+ AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLGCPRVDLHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQLGIMKFLKKAEIPAKTCSSAN
        SVKRTGHEGNL GGKAAAVVNKMLQKKELP+L+EGF RITFCHNETDVQSAID YK L LHDMLP GCFGQKNPD KVQLGIMKFLKKAE P++T SSAN
Subjt:  SVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQLGIMKFLKKAEIPAKTCSSAN

Query:  TIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFLDKLGNARLVLVD
        T+KDSP+ Q T+EKS SC+KKEESACT+S  VD ES+KGESPGVRSL DNIS+SDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEF+DKLGNARLVLVD
Subjt:  TIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFLDKLGNARLVLVD

Query:  LSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQVNSLRPGNVVTVQLPSTSPL
        LSHGSKILSLVKAKA KKNI S KFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQQ NSL+PGN V VQLPSTSPL
Subjt:  LSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQVNSLRPGNVVTVQLPSTSPL

Query:  FNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDSSRSTFPSCDHKFKREDVQNPE
        FNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR  YSSLFQ FIS++ED+FKSVKGIH          +KHSE+S         HKFKRE++QNPE
Subjt:  FNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDSSRSTFPSCDHKFKREDVQNPE

Query:  RSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADIRREHLPLLRTMHA
        RSKKWKGSQDS  ALNQNNNK V KMSKHWGSWAQALYNTAMHPE+H D VLE SDDV+VLNDIYPKA KHLLVVAR+EGLDQLAD+  EHLPLLRTMHA
Subjt:  RSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADIRREHLPLLRTMHA

Query:  VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDERLMSMELRCNRCRSAHPNL
        VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDS+ VM+EVSSHGKAS+ DDE LMSMELRCNRCRSAHPNL
Subjt:  VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDERLMSMELRCNRCRSAHPNL

Query:  PKLKAHISKCRAPFPSTLLERDRLV
        PKLKAHI KC+APFPSTLLE  RLV
Subjt:  PKLKAHISKCRAPFPSTLLERDRLV

TrEMBL top hitse value%identityAlignment
A0A0A0L9U1 Uncharacterized protein0.0e+0083.02Show/hide
Query:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG
        MDMD D+N   KGKEGQGKL+MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALN+GKS+FVDRCNLEIEQRADFVKLG
Subjt:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG

Query:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ
         P+VD+HAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKL+EGFTRITFCHNE+DV SAID YK L LH MLP+GCFGQKNPD KVQ
Subjt:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ

Query:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAE P+KTCSSANT K+SP  Q T+EK  SC KKEES+CTMS  V  ESEKGESPG+RSL D IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVLVDLSHGSKILS+VKAKA +KNI S KFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS
        Q NSL+PGN V VQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL   YSSLFQ FISI++D++KSVKGIH          +KHSED 
Subjt:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS

Query:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE
                 HKFKRE++QN ERSKKWKGSQ+S   LNQNNN  V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV+VL DIYPKA KHLLVVAR+E
Subjt:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE

Query:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD
        GLDQLAD+  EHLPLLRTMHA+GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDS+ V+ EVSSHGKA+I D
Subjt:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD

Query:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLVTD
        DE LMSMELRCNRCRSAHPNLPKLKAHISKC+APFPSTLLE  RLV +
Subjt:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLVTD

A0A1S3B6C4 transcription factor bHLH1400.0e+0083.02Show/hide
Query:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG
        MDMD D+N   KGKEGQGKL+MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALN+GKS+FVDRCNLEIEQRADFVKLG
Subjt:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG

Query:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ
         P+VD+HAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKL+EGFTRITFCHNE+DV SAID YK L LH+MLP+GCFGQKNPD KVQ
Subjt:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ

Query:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE P+KTCSSAN  K+SP  Q T+EK  SC+KKEES+C MS  V  ESEKGESPGVRSL   IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVLVDLSHGSKILS+VKAKA +KNI S KFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS
        Q NSL PGN V VQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL   YSSLFQ FISI++D++KSVKGI+          +KHSE+S
Subjt:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS

Query:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE
                 HKFKRE++QN E SKKWKGS +S   LNQNNNK V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV+VL DIYPKA KHLLVVAR+E
Subjt:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE

Query:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD
        GLDQLAD+  EHLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDS+ V++EVSSHGKA I D
Subjt:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD

Query:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLVTD
        DERL+SMELRCNRCRSAHPNLPKLKAHISKC+APFPSTLLE  RLV +
Subjt:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLVTD

A0A5A7TLV2 Transcription factor bHLH1400.0e+0083.02Show/hide
Query:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG
        MDMD D+N   KGKEGQGKL+MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALN+GKS+FVDRCNLEIEQRADFVKLG
Subjt:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG

Query:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ
         P+VD+HAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKL+EGFTRITFCHNE+DV SAID YK L LH+MLP+GCFGQKNPD KVQ
Subjt:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ

Query:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE P+KTCSSAN  K+SP  Q T+EK  SC+KKEES+C MS  V  ESEKGESPGVRSL   IS+SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEF+DKLGNARLVLVDLSHGSKILS+VKAKA +KNI S KFFTFVGDITKLNSEGGL CNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS
        Q NSL PGN V VQLPSTSPL NREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLL   YSSLFQ FISI++D++KSVKGI+          +KHSE+S
Subjt:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIH----------KKHSEDS

Query:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE
                 HKFKRE++QN E SKKWKGS +S   LNQNNNK V K SKHWGSWAQALY+TAMHPERH ++VLE SDDV+VL DIYPKA KHLLVVAR+E
Subjt:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE

Query:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD
        GLDQLAD+  EHLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDS+ V++EVSSHGKA I D
Subjt:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD

Query:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLVTD
        DERL+SMELRCNRCRSAHPNLPKLKAHISKC+APFPSTLLE  RLV +
Subjt:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLVTD

A0A6J1CV45 transcription factor bHLH1400.0e+0088.74Show/hide
Query:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG
        MDMDIDDN T KGKEGQ KL+MVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL++GKSIFVDRCNLEIEQRADFVK+G
Subjt:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG

Query:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ
           VD+HAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKL+EGFTRITFCHNETDVQSAIDTYK LGLHD LP+GCFGQ   DNKVQ
Subjt:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ

Query:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ PAKTCSSAN IKDSP  Q +RE SYSC+KKEE ACT+   VDKESEKGE+PGVRSLGD+ISRSDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEF+DKLGNARLVLVDL++GSK+LSLVKAKA KK I+ +KFFTFVGDITKLNSEGGL CNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHK----------KHSEDS
        Q NSLRPGN V  QLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRA YSSLFQGFISI+E+QFKSVKGI K          KHSEDS
Subjt:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHK----------KHSEDS

Query:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE
         RSTFPSCDHKFKREDVQNPERSKKWKGSQDSA A NQNNN IVHKMSKHWGSWAQALYNTAMHPERH D VLE+SDDV VLNDIY KAHKHLLVVARYE
Subjt:  SRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYE

Query:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD
        GLDQLAD+RREHLPLLRTMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDS+DVM+EV SHGKASIKD
Subjt:  GLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKD

Query:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLV
        DE LMSMELRCNRCRSAHPNL KLKAHISKCRAPFPSTLLE  RLV
Subjt:  DERLMSMELRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLV

A0A6J1GCV0 transcription factor bHLH1400.0e+0083.33Show/hide
Query:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG
        MDMDID+N T KG E + KL+MVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALN+GKS+FVDRCNLEIEQR++FVKLG
Subjt:  MDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLG

Query:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ
           VD+HAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELP L+EGF RITFCH+ETDVQSAIDTYK LGLHD LP+GCFGQKN D KVQ
Subjt:  CPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQ

Query:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAE PAKTCS+ANT KD P SQ T+E       K+ES+CTM   V+KESEKGE+PGV SL +NIS SDPPTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEF+DKLGNARLV+VDLSHGSKILSLVKAKA KKNI S KFFTFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHKKHSEDSSRSTFPSCD-
        Q  SLRPGNVV VQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLR  YSSLFQ FISI++D+FKS KGI ++     S S   S D 
Subjt:  QVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHKKHSEDSSRSTFPSCD-

Query:  -HKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADI
         HKFKR ++Q PERSKKWKG+Q+SA ALNQNNNKI HKMSKHWGSWAQALYNTAM+PERH + VLE SDDV+VLNDIYPKA KHLL+VARYEGLDQLAD+
Subjt:  -HKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADI

Query:  RREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDERLMSME
         +EHLPLL+TMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDS+DV++EVSSHGKA IKDDE LMSME
Subjt:  RREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDERLMSME

Query:  LRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLV
         RCNRCRSAHPNLPKLK HISKC++PFPSTLLE  RLV
Subjt:  LRCNRCRSAHPNLPKLKAHISKCRAPFPSTLLERDRLV

SwissProt top hitse value%identityAlignment
P61798 Aprataxin (Fragment)7.8e-3738.27Show/hide
Query:  SEDSSRSTFPSCDHKFKREDVQN-PERSKKWK--GSQDSAAALNQNNNKI-----VHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPK
        +E+  ++    C+   + +D++N P+++KK +   +Q S+A L  + + +          +H G W+Q L ++   P+      +   +  +V+ D YPK
Subjt:  SEDSSRSTFPSCDHKFKREDVQN-PERSKKWK--GSQDSAAALNQNNNKI-----VHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPK

Query:  AHKHLLVVARYEGLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVME
        A  H LV+  ++ +  L  + REHL LL  MHAVG K I +   ++ SL FRLGYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T++F +S +V+E
Subjt:  AHKHLLVVARYEGLDQLADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVME

Query:  EVSSHGKASIKDD-ERLMSMELRCNRCRSAHPNLPKLKAHISK
         V S GK ++ D    L+ + LRC+ C+     +P+LK H+ K
Subjt:  EVSSHGKASIKDD-ERLMSMELRCNRCRSAHPNLPKLKAHISK

Q7TQC5 Aprataxin6.2e-3439.82Show/hide
Query:  KFKREDVQNPERSKK-----WKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQL
        K KR D  + E   +       GS  S  +++   +K      +  G W+Q L  +   P+      +   D V+V+ D YPKA  H LV+  +  +  L
Subjt:  KFKREDVQNPERSKK-----WKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQL

Query:  ADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDE-RL
          +  EHL LL+ MHAVG K I   F   + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  V++ V   G+ ++KD    L
Subjt:  ADIRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDE-RL

Query:  MSMELRCNRCRSAHPNLPKLKAHISK
        + + LRC+ C+   P++P+LK H+ K
Subjt:  MSMELRCNRCRSAHPNLPKLKAHISK

Q7YRZ1 Aprataxin2.8e-3441.67Show/hide
Query:  GSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADIRREHLPLLRTMHAVGLKWI
        GS  S  ++     K      +    W+Q L  +   P+      +   D V+V+ D YPKA  H LV+  +  +  L  + REHL LLR MH VG K I
Subjt:  GSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADIRREHLPLLRTMHAVGLKWI

Query:  DKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDD-ERLMSMELRCNRCRSAHPNLPKLKA
           F   + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  V+E V   G+ +++D    L+ + LRC+ C+   P++P+LK 
Subjt:  DKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDD-ERLMSMELRCNRCRSAHPNLPKLKA

Query:  HISK
        H+ K
Subjt:  HISK

Q8K4H4 Aprataxin6.2e-3440.27Show/hide
Query:  KFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADIRR
        K KR D  + E   +  G+  S  +++    K      +  G W+Q L  +   P+      +   D V+V+ D YPKA  H LV+  +  +  L  +  
Subjt:  KFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADIRR

Query:  EHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDE-RLMSMEL
        EHL LL+ MHAVG K I   F   + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S  V++ V   G+ ++KD    L+ + L
Subjt:  EHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDE-RLMSMEL

Query:  RCNRCRSAHPNLPKLKAHISK
        RC+ C+   P++P+LK H+ K
Subjt:  RCNRCRSAHPNLPKLKAHISK

Q9M041 Transcription factor bHLH1403.1e-24360.11Show/hide
Query:  QGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLGCPRVDLHAVVLDLPAQ
        + K ++V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L  GKS+F+DRCNL+ EQR++F+KLG P  ++HAVVL+LPAQ
Subjt:  QGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLGCPRVDLHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQLGIMKFLKKAEIPAKT
        +CISRSVKRTGHEGNLQGG+AAAVVNKMLQ KELPK++EGF+RI FC+++ DV +A++ Y  LG  D LP+GCFG+K  D K Q GIMKF KK  + A  
Subjt:  LCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQLGIMKFLKKAEIPAKT

Query:  CSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFLDKLGNAR
         SS+N            E + +  K +E    +            SP      D +     PTLAFPSIST+DF+F  EKA++IIVEK EEFL KLG AR
Subjt:  CSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFLDKLGNAR

Query:  LVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQVNSLRPGNVVTVQLP
        LVLVDLS GSKILSLVKAKA +KNIDS KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ + N+L PG  V V LP
Subjt:  LVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQVNSLRPGNVVTVQLP

Query:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHKKHSEDSSRSTFPSCDHKFKREDV-QNPERSKK
        ST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR  Y+SLF+GF+S+++DQ K  K   +    DS              ED+ ++ ER+KK
Subjt:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHKKHSEDSSRSTFPSCDHKFKREDV-QNPERSKK

Query:  WKGSQDSAAALNQNNNKIV------HKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADIRREHLPLLRTM
        +KGSQD A   N  +  +        KMSK W +WA AL++ AMHPERH + VLE  D+++V+ND YPKA KH+LV+AR E LD L D+R+E+L LL+ M
Subjt:  WKGSQDSAAALNQNNNKIV------HKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADIRREHLPLLRTM

Query:  HAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDERLMSMELRCNRCRSAHP
        H VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDS+DV+EEV+S GKA++  ++ L+  ELRCNRCRSAHP
Subjt:  HAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDERLMSMELRCNRCRSAHP

Query:  NLPKLKAHISKCRAPFPSTLLERDRLV
        N+PKLK+H+  C + FP  LL+ +RLV
Subjt:  NLPKLKAHISKCRAPFPSTLLERDRLV

Arabidopsis top hitse value%identityAlignment
AT2G40600.1 appr-1-p processing enzyme family protein8.1e-0534.71Show/hide
Query:  DSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGP----N
        DS+      GDITK + +     + I N AN R+  GGGG + AI  AAGP L  A   +V  +RPG          +P FN    + VIH +GP    +
Subjt:  DSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGP----N

Query:  MNPQRPNYLNNDYDEGCKLLR
        +NPQ    L N Y    ++ +
Subjt:  MNPQRPNYLNNDYDEGCKLLR

AT5G01310.1 APRATAXIN-like2.2e-24460.11Show/hide
Query:  QGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLGCPRVDLHAVVLDLPAQ
        + K ++V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L  GKS+F+DRCNL+ EQR++F+KLG P  ++HAVVL+LPAQ
Subjt:  QGKLVMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLGCPRVDLHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQLGIMKFLKKAEIPAKT
        +CISRSVKRTGHEGNLQGG+AAAVVNKMLQ KELPK++EGF+RI FC+++ DV +A++ Y  LG  D LP+GCFG+K  D K Q GIMKF KK  + A  
Subjt:  LCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGFTRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQLGIMKFLKKAEIPAKT

Query:  CSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFLDKLGNAR
         SS+N            E + +  K +E    +            SP      D +     PTLAFPSIST+DF+F  EKA++IIVEK EEFL KLG AR
Subjt:  CSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRSLGDNISRSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFLDKLGNAR

Query:  LVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQVNSLRPGNVVTVQLP
        LVLVDLS GSKILSLVKAKA +KNIDS KFFTFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ + N+L PG  V V LP
Subjt:  LVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQVNSLRPGNVVTVQLP

Query:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHKKHSEDSSRSTFPSCDHKFKREDV-QNPERSKK
        ST PL N EG+THVIHVLGPNMNP RP+ LNNDY +GCK LR  Y+SLF+GF+S+++DQ K  K   +    DS              ED+ ++ ER+KK
Subjt:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHKKHSEDSSRSTFPSCDHKFKREDV-QNPERSKK

Query:  WKGSQDSAAALNQNNNKIV------HKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADIRREHLPLLRTM
        +KGSQD A   N  +  +        KMSK W +WA AL++ AMHPERH + VLE  D+++V+ND YPKA KH+LV+AR E LD L D+R+E+L LL+ M
Subjt:  WKGSQDSAAALNQNNNKIV------HKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLADIRREHLPLLRTM

Query:  HAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDERLMSMELRCNRCRSAHP
        H VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDS+DV+EEV+S GKA++  ++ L+  ELRCNRCRSAHP
Subjt:  HAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDERLMSMELRCNRCRSAHP

Query:  NLPKLKAHISKCRAPFPSTLLERDRLV
        N+PKLK+H+  C + FP  LL+ +RLV
Subjt:  NLPKLKAHISKCRAPFPSTLLERDRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTGATAGAGTGCTCACTTCTGCTAACTCTTCCTTCTACAGAATCTATAACGAAATTCTTCAAGCGATAAAGTTCCCTTTCACTCTGTCGCGAACTCTCTCACG
ATCGCGAGAGAGAGGCGTAGACAGCGACTGCGCTTTTCTGCAGCAGGGTTTAAAGGAAGACCTTAAAGCTAATCAGGAAATGGACATGGATATCGACGATAATCCCACAG
TCAAAGGAAAGGAAGGACAAGGGAAGCTCGTCATGGTAATATTAGTGGGTGCACCAGGAAGCGGCAAGTCCACCTTCTGCGAACTCGTAATGGGTTCCTCTTCTCGCCCG
TGGGTTCGCATCTGCCAGGACACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGCGCAGCCAGTGCATTGAATAATGGAAAGAGTATATTTGTTGA
TAGGTGTAATCTTGAAATAGAGCAGCGTGCAGATTTTGTGAAACTCGGCTGCCCTCGAGTGGATCTACATGCTGTTGTACTAGATCTTCCTGCACAGCTCTGTATTTCTC
GTTCGGTAAAGCGGACTGGTCATGAAGGGAATTTACAAGGTGGAAAAGCTGCTGCTGTGGTGAATAAAATGCTGCAAAAGAAAGAATTGCCCAAGCTAAGTGAAGGGTTT
ACTCGAATAACCTTTTGCCACAATGAGACCGACGTTCAATCTGCTATCGATACGTACAAGTTGCTTGGTTTACATGATATGCTTCCAAATGGATGTTTTGGACAAAAGAA
CCCAGACAATAAAGTACAACTTGGCATAATGAAGTTCTTAAAGAAAGCAGAAATTCCTGCTAAGACATGTTCTAGTGCCAATACCATTAAGGATTCTCCAGTTTCTCAAG
CTACCCGGGAAAAGAGCTACTCTTGTAATAAAAAGGAAGAGTCTGCCTGTACAATGTCCTGCATTGTTGATAAAGAGTCAGAGAAAGGTGAAAGTCCAGGTGTAAGATCC
TTAGGAGACAATATTTCTCGAAGTGATCCTCCAACTCTTGCATTTCCATCTATTTCGACATCGGATTTCAAGTTCAGCCATGAAAAGGCTGCTGAAATTATTGTTGAGAA
GGTTGAGGAGTTCTTGGATAAGCTTGGGAATGCCAGACTTGTTTTAGTAGACTTGAGTCATGGATCAAAGATTTTGTCTCTGGTTAAAGCTAAAGCAGTCAAGAAAAACA
TTGATTCCAACAAATTTTTTACATTTGTCGGAGATATAACTAAACTCAATTCAGAAGGTGGATTGTGCTGCAATGTAATAGCCAATGCTGCAAATTGGCGACTGAAACCG
GGAGGTGGTGGTGTCAATGCCGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAAACAACAAGTGAACTCCCTTCGACCTGGCAATGTCGTGACTGTTCA
ATTGCCTTCAACTTCTCCTTTGTTTAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCAAACATGAATCCACAAAGGCCAAATTATCTCAATAATGACTATG
ATGAAGGTTGCAAACTTCTTCGTGCCACTTACTCTTCCCTGTTTCAAGGTTTTATTTCAATAATAGAAGACCAATTTAAGTCGGTGAAGGGAATTCACAAAAAGCACTCC
GAGGACAGCTCAAGAAGTACCTTCCCAAGTTGCGATCACAAGTTTAAGAGAGAGGATGTGCAAAATCCTGAAAGAAGCAAAAAGTGGAAAGGATCTCAGGATTCAGCTGC
AGCATTAAACCAAAACAACAATAAGATTGTCCACAAAATGAGTAAGCACTGGGGCTCATGGGCACAAGCACTTTACAACACTGCAATGCATCCCGAGAGACATTGCGATA
ATGTACTGGAGATATCAGATGATGTCATAGTACTGAATGATATTTATCCAAAGGCACACAAGCATCTTCTGGTAGTGGCTCGGTATGAAGGCCTCGATCAACTGGCTGAT
ATACGTCGAGAACACCTTCCATTGTTGAGGACAATGCATGCTGTGGGTTTGAAGTGGATCGATAAATTCTTTCATGAAGATGCATCATTGGTTTTTCGCCTTGGATACCA
CTCGGCTCCATCCATGAGGCAGCTGCACCTACACGTTATAAGCCAGGATTTCGACTCAAGTCACCTGAAGAACAAGAAGCACTGGAATTCTTTCAACACCGATTTCTTCA
GAGATTCGATAGATGTTATGGAGGAAGTCAGTAGCCATGGAAAGGCAAGCATCAAGGATGATGAGAGGTTGATGTCTATGGAGTTGCGTTGCAACCGATGCAGAAGTGCG
CATCCCAACTTACCCAAATTGAAAGCACATATTTCCAAATGTAGAGCGCCTTTCCCTTCCACACTACTCGAGCGCGATCGTTTAGTGACTGACCAAAAGATGGTAAATGC
TGTAGAGTTGATGCATTGCATCAAAGTATTACTGAGGATTGTGAGGGCGGCTGTTTGCTGCAGAGACTCGAGAGCCCCATTCCAACAGCTCTTTGCTTGTATCATCCTTA
TGAGCAAATACCTCAATGAAGCACATGCAGTCCTTTTTGGCTCCTGTAGCTGTATCAATGGCCTCCACAAGCTCTTCCTCGCAACGAACCTGCAACAAGAAATTTGTTCA
CAAAATGGAGAATTCCTTCCCAGAGATCTCAAACCAGCAAGTTCGATATCAACTGACCTTTGTCGTCCAACACTTGCCTTCCCCGTTATGGATCGCATCGACCAATCCTG
TGTAGTTCCAGTTTTTGATCACGTTGTAAGGCCCGTCGTGGATCTCGACTTCAATGGTGTAACCACCATTGTTTATCAAGAATATGATGTTAAACCAAGAGTCCCCTGTC
TCAGCAATCACAGCCGTTTGATCACTGAGCATCTTCTGAATGTGTTGGAACAAAACATTGACTCTCAACGCCTCTTTTGGCTCAGATTTCAAAGGATGGCCTTCAGGGAC
GAAGATCCTGTGATAGTTTTCATAGGCAGTGGTGTTGTTCTTCACCCGCTTAGCAAAGCTATAGTCATTGAAAATCGGCCCGGTGAACAAATACGCATCGGCGGACTCAA
CGATCTCGGCGCAGAAGGCAGTGCTGACAGCGCCCCAGTAAGTTCCAATGAAGTGAGGATGGTGCTCGGGCACGAGCCCTTTGGCCGACGGCATCACTGCCAAGGCGTAG
CCACAAGCATCAGCAAGCTCTACAAAGGCATCGCACGCCTTGGCAACTCGCATCTTAGGCCCACCAACAAGCACTGGCTTCACAGCCTTGTTCAAGAAATGTGCCGCCGC
CTCCACCGCAGCCTCCAGCCCAGATGGATTGCTCAATCTGTTGCCACCAGAGAAATCCATTCAATAATTTCAAAGTTAAGTTCCTCTATAAAATGGCATGATAAATTTCT
TGACAGAACTTACTTGGGAGACAGTGAAAATGGAACTGGTTCGCGGCTGAAGCTCTTGGCTGAAGTCGGGGAGGCCGATGGTGTGGTGGAGAATTCTGTTGGTTCCATAA
TCGTTGGAGTTGGGGCCTCCGACCACACAAATGAGTGGCAGTTCTCACTGTACGCGCCGGCAATCGCATTCAGAACGCTGAGTCCTCCGACCGTGAACGTGACGACGCAA
GCGCCGACGCCGCGAGACCTCGCATACCCATCGGCGGCGTACCCAGCATTGAGCTCGTTGCAGCAGCCGATGTTGTTGAGGCCGGGCTCTGCGATGAGATGGTCGAGAAG
CGTCAGGTTGAAGTCGCCGGGGACGGTGAAAACATCGGTCACGCCAATCTGGACGAGGCGGCGAGCGAGGTGGCGGCCGAGCGTGGCCTCTGACGAGTTAACGACGGCGG
AAGGGACCGAATCCTGGATGGTACAAACAGAGCCATTTGCTGGGCAGCAGACGACGCTGTTGGCGGGCTTGCAAGTGTCGAGCGATCCGATTTTTGTGTCCATGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTGATAGAGTGCTCACTTCTGCTAACTCTTCCTTCTACAGAATCTATAACGAAATTCTTCAAGCGATAAAGTTCCCTTTCACTCTGTCGCGAACTCTCTCACG
ATCGCGAGAGAGAGGCGTAGACAGCGACTGCGCTTTTCTGCAGCAGGGTTTAAAGGAAGACCTTAAAGCTAATCAGGAAATGGACATGGATATCGACGATAATCCCACAG
TCAAAGGAAAGGAAGGACAAGGGAAGCTCGTCATGGTAATATTAGTGGGTGCACCAGGAAGCGGCAAGTCCACCTTCTGCGAACTCGTAATGGGTTCCTCTTCTCGCCCG
TGGGTTCGCATCTGCCAGGACACCATTGGAAATGGCAAGTCTGGAACCAAAGCACAGTGCTTGAAGAGCGCAGCCAGTGCATTGAATAATGGAAAGAGTATATTTGTTGA
TAGGTGTAATCTTGAAATAGAGCAGCGTGCAGATTTTGTGAAACTCGGCTGCCCTCGAGTGGATCTACATGCTGTTGTACTAGATCTTCCTGCACAGCTCTGTATTTCTC
GTTCGGTAAAGCGGACTGGTCATGAAGGGAATTTACAAGGTGGAAAAGCTGCTGCTGTGGTGAATAAAATGCTGCAAAAGAAAGAATTGCCCAAGCTAAGTGAAGGGTTT
ACTCGAATAACCTTTTGCCACAATGAGACCGACGTTCAATCTGCTATCGATACGTACAAGTTGCTTGGTTTACATGATATGCTTCCAAATGGATGTTTTGGACAAAAGAA
CCCAGACAATAAAGTACAACTTGGCATAATGAAGTTCTTAAAGAAAGCAGAAATTCCTGCTAAGACATGTTCTAGTGCCAATACCATTAAGGATTCTCCAGTTTCTCAAG
CTACCCGGGAAAAGAGCTACTCTTGTAATAAAAAGGAAGAGTCTGCCTGTACAATGTCCTGCATTGTTGATAAAGAGTCAGAGAAAGGTGAAAGTCCAGGTGTAAGATCC
TTAGGAGACAATATTTCTCGAAGTGATCCTCCAACTCTTGCATTTCCATCTATTTCGACATCGGATTTCAAGTTCAGCCATGAAAAGGCTGCTGAAATTATTGTTGAGAA
GGTTGAGGAGTTCTTGGATAAGCTTGGGAATGCCAGACTTGTTTTAGTAGACTTGAGTCATGGATCAAAGATTTTGTCTCTGGTTAAAGCTAAAGCAGTCAAGAAAAACA
TTGATTCCAACAAATTTTTTACATTTGTCGGAGATATAACTAAACTCAATTCAGAAGGTGGATTGTGCTGCAATGTAATAGCCAATGCTGCAAATTGGCGACTGAAACCG
GGAGGTGGTGGTGTCAATGCCGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAAACAACAAGTGAACTCCCTTCGACCTGGCAATGTCGTGACTGTTCA
ATTGCCTTCAACTTCTCCTTTGTTTAATAGGGAAGGAGTAACCCATGTCATACATGTTCTTGGACCAAACATGAATCCACAAAGGCCAAATTATCTCAATAATGACTATG
ATGAAGGTTGCAAACTTCTTCGTGCCACTTACTCTTCCCTGTTTCAAGGTTTTATTTCAATAATAGAAGACCAATTTAAGTCGGTGAAGGGAATTCACAAAAAGCACTCC
GAGGACAGCTCAAGAAGTACCTTCCCAAGTTGCGATCACAAGTTTAAGAGAGAGGATGTGCAAAATCCTGAAAGAAGCAAAAAGTGGAAAGGATCTCAGGATTCAGCTGC
AGCATTAAACCAAAACAACAATAAGATTGTCCACAAAATGAGTAAGCACTGGGGCTCATGGGCACAAGCACTTTACAACACTGCAATGCATCCCGAGAGACATTGCGATA
ATGTACTGGAGATATCAGATGATGTCATAGTACTGAATGATATTTATCCAAAGGCACACAAGCATCTTCTGGTAGTGGCTCGGTATGAAGGCCTCGATCAACTGGCTGAT
ATACGTCGAGAACACCTTCCATTGTTGAGGACAATGCATGCTGTGGGTTTGAAGTGGATCGATAAATTCTTTCATGAAGATGCATCATTGGTTTTTCGCCTTGGATACCA
CTCGGCTCCATCCATGAGGCAGCTGCACCTACACGTTATAAGCCAGGATTTCGACTCAAGTCACCTGAAGAACAAGAAGCACTGGAATTCTTTCAACACCGATTTCTTCA
GAGATTCGATAGATGTTATGGAGGAAGTCAGTAGCCATGGAAAGGCAAGCATCAAGGATGATGAGAGGTTGATGTCTATGGAGTTGCGTTGCAACCGATGCAGAAGTGCG
CATCCCAACTTACCCAAATTGAAAGCACATATTTCCAAATGTAGAGCGCCTTTCCCTTCCACACTACTCGAGCGCGATCGTTTAGTGACTGACCAAAAGATGGTAAATGC
TGTAGAGTTGATGCATTGCATCAAAGTATTACTGAGGATTGTGAGGGCGGCTGTTTGCTGCAGAGACTCGAGAGCCCCATTCCAACAGCTCTTTGCTTGTATCATCCTTA
TGAGCAAATACCTCAATGAAGCACATGCAGTCCTTTTTGGCTCCTGTAGCTGTATCAATGGCCTCCACAAGCTCTTCCTCGCAACGAACCTGCAACAAGAAATTTGTTCA
CAAAATGGAGAATTCCTTCCCAGAGATCTCAAACCAGCAAGTTCGATATCAACTGACCTTTGTCGTCCAACACTTGCCTTCCCCGTTATGGATCGCATCGACCAATCCTG
TGTAGTTCCAGTTTTTGATCACGTTGTAAGGCCCGTCGTGGATCTCGACTTCAATGGTGTAACCACCATTGTTTATCAAGAATATGATGTTAAACCAAGAGTCCCCTGTC
TCAGCAATCACAGCCGTTTGATCACTGAGCATCTTCTGAATGTGTTGGAACAAAACATTGACTCTCAACGCCTCTTTTGGCTCAGATTTCAAAGGATGGCCTTCAGGGAC
GAAGATCCTGTGATAGTTTTCATAGGCAGTGGTGTTGTTCTTCACCCGCTTAGCAAAGCTATAGTCATTGAAAATCGGCCCGGTGAACAAATACGCATCGGCGGACTCAA
CGATCTCGGCGCAGAAGGCAGTGCTGACAGCGCCCCAGTAAGTTCCAATGAAGTGAGGATGGTGCTCGGGCACGAGCCCTTTGGCCGACGGCATCACTGCCAAGGCGTAG
CCACAAGCATCAGCAAGCTCTACAAAGGCATCGCACGCCTTGGCAACTCGCATCTTAGGCCCACCAACAAGCACTGGCTTCACAGCCTTGTTCAAGAAATGTGCCGCCGC
CTCCACCGCAGCCTCCAGCCCAGATGGATTGCTCAATCTGTTGCCACCAGAGAAATCCATTCAATAATTTCAAAGTTAAGTTCCTCTATAAAATGGCATGATAAATTTCT
TGACAGAACTTACTTGGGAGACAGTGAAAATGGAACTGGTTCGCGGCTGAAGCTCTTGGCTGAAGTCGGGGAGGCCGATGGTGTGGTGGAGAATTCTGTTGGTTCCATAA
TCGTTGGAGTTGGGGCCTCCGACCACACAAATGAGTGGCAGTTCTCACTGTACGCGCCGGCAATCGCATTCAGAACGCTGAGTCCTCCGACCGTGAACGTGACGACGCAA
GCGCCGACGCCGCGAGACCTCGCATACCCATCGGCGGCGTACCCAGCATTGAGCTCGTTGCAGCAGCCGATGTTGTTGAGGCCGGGCTCTGCGATGAGATGGTCGAGAAG
CGTCAGGTTGAAGTCGCCGGGGACGGTGAAAACATCGGTCACGCCAATCTGGACGAGGCGGCGAGCGAGGTGGCGGCCGAGCGTGGCCTCTGACGAGTTAACGACGGCGG
AAGGGACCGAATCCTGGATGGTACAAACAGAGCCATTTGCTGGGCAGCAGACGACGCTGTTGGCGGGCTTGCAAGTGTCGAGCGATCCGATTTTTGTGTCCATGGATTGA
Protein sequenceShow/hide protein sequence
MDSDRVLTSANSSFYRIYNEILQAIKFPFTLSRTLSRSRERGVDSDCAFLQQGLKEDLKANQEMDMDIDDNPTVKGKEGQGKLVMVILVGAPGSGKSTFCELVMGSSSRP
WVRICQDTIGNGKSGTKAQCLKSAASALNNGKSIFVDRCNLEIEQRADFVKLGCPRVDLHAVVLDLPAQLCISRSVKRTGHEGNLQGGKAAAVVNKMLQKKELPKLSEGF
TRITFCHNETDVQSAIDTYKLLGLHDMLPNGCFGQKNPDNKVQLGIMKFLKKAEIPAKTCSSANTIKDSPVSQATREKSYSCNKKEESACTMSCIVDKESEKGESPGVRS
LGDNISRSDPPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFLDKLGNARLVLVDLSHGSKILSLVKAKAVKKNIDSNKFFTFVGDITKLNSEGGLCCNVIANAANWRLKP
GGGGVNAAIFSAAGPGLEVATKQQVNSLRPGNVVTVQLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLNNDYDEGCKLLRATYSSLFQGFISIIEDQFKSVKGIHKKHS
EDSSRSTFPSCDHKFKREDVQNPERSKKWKGSQDSAAALNQNNNKIVHKMSKHWGSWAQALYNTAMHPERHCDNVLEISDDVIVLNDIYPKAHKHLLVVARYEGLDQLAD
IRREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSIDVMEEVSSHGKASIKDDERLMSMELRCNRCRSA
HPNLPKLKAHISKCRAPFPSTLLERDRLVTDQKMVNAVELMHCIKVLLRIVRAAVCCRDSRAPFQQLFACIILMSKYLNEAHAVLFGSCSCINGLHKLFLATNLQQEICS
QNGEFLPRDLKPASSISTDLCRPTLAFPVMDRIDQSCVVPVFDHVVRPVVDLDFNGVTTIVYQEYDVKPRVPCLSNHSRLITEHLLNVLEQNIDSQRLFWLRFQRMAFRD
EDPVIVFIGSGVVLHPLSKAIVIENRPGEQIRIGGLNDLGAEGSADSAPVSSNEVRMVLGHEPFGRRHHCQGVATSISKLYKGIARLGNSHLRPTNKHWLHSLVQEMCRR
LHRSLQPRWIAQSVATREIHSIISKLSSSIKWHDKFLDRTYLGDSENGTGSRLKLLAEVGEADGVVENSVGSIIVGVGASDHTNEWQFSLYAPAIAFRTLSPPTVNVTTQ
APTPRDLAYPSAAYPALSSLQQPMLLRPGSAMRWSRSVRLKSPGTVKTSVTPIWTRRRARWRPSVASDELTTAEGTESWMVQTEPFAGQQTTLLAGLQVSSDPIFVSMD