; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G21840 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G21840
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptiontranscription factor bHLH140
Genome locationClcChr10:34987743..34991535
RNA-Seq ExpressionClc10G21840
SyntenyClc10G21840
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442389.1 PREDICTED: transcription factor bHLH140 [Cucumis melo]0.0e+0089.29Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALN+GKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAIDMYKSLDLHN+LPHGCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPTPQ TQE  +SCDKKEES+C MS++V +E+EKGE PG+RSLE  ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILS+VKAKA +KNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS
        QANSL PGNAVAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLL NAYSSLFQAFISIV+DK+KSVK I+E LGS P EPQKHSE+S
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQN E SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVV+L DIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV ++DEVSSHGKA IMDDE L+SME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH  KCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_011651853.1 transcription factor bHLH140 [Cucumis sativus]0.0e+0089.56Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALN+GKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAIDMYKSLDLH +LPHGCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPS+T SSANT K+SPTPQ TQE  +SC KKEES+CTMS++V +E+EKGE PGIRSL+D ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILS+VKAKA +KNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS
        QANSLQPGNAVAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLL NAYSSLFQAFISIV+DK+KSVK IHE LGS P E QKHSED 
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQN ERSKKWKGSQ+ TE LNQNNN TVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVV+L DIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHA+GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV +++EVSSHGKA+IMDDESLMSME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH  KCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0084.95Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL++GKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAID YKSL LH+ LPHGCFGQ   + KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P++T SSAN +KDSP  QT++ENS SCDKKEE ACT+  +VD E+EKGE PG+RSL D IS+S+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNA+LVLVDL+ GSK+LSLVKAKAAKK I+ +KF TFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS
        QANSL+PGN V   LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLLR AYSSLFQ FISIVE++FKSVK I + LGS PSE +KHSEDS
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS

Query:  --------HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHE
                 HKFKRE++QNPERSKKWKGSQD  EA NQNNN  V KMSKHWGSWAQALYNTAMHPE+H DTVLE SDDV +LNDIY KA KHLLVVAR+E
Subjt:  --------HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHE

Query:  GLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMD
        GLDQL DV REHLPLLRTMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV +MDEV SHGKASI D
Subjt:  GLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMD

Query:  DESLMSMELRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSN
        DESLMSMELRCNRCRSAHPNL KLKAH  KC+APFPSTLLEG RLVIAPSN
Subjt:  DESLMSMELRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSN

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0093.84Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS AKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALN+GK+VFVDRCNLEIEQRADFVKL 
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GP+VDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAIDMYKSLDLH++LP GCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAEKPSET SSANTVKDSP PQTTQE SDSCDKKEESACT+S++VDIE++KGE PG+RSLED+ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILSLVKAKAAKKNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS
        QANSLQPGNAVAV LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLLRNAYSSLFQAFIS+VEDKFKSVK IH RLG  PSEP+KHSE+S
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQNPERSKKWKGSQD TEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVV+LNDIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSVA+MDEVSSHGKAS+MDDESLMSME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH FKCQAPFPSTLLEGGRLVIAPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0093.8Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALN+GK+VFVDRCNLEIEQRADFVKL GP+VDV AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSETRSSAN
        SVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAIDMYKSLDLH++LP GCFGQKNP+KKVQLGIMKFLKKAEKPSET SSAN
Subjt:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSETRSSAN

Query:  TVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAKLVLVD
        TVKDSP PQTTQE SDSCDKKEESACT+S++VDIE++KGE PG+RSLED+ISQS+ PTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNA+LVLVD
Subjt:  TVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAKLVLVD

Query:  LSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLPSTSPL
        LS GSKILSLVKAKAAKKNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQQANSLQPGNAVAV LPSTSPL
Subjt:  LSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLPSTSPL

Query:  FNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWKGS
        FNREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLLRNAYSSLFQAFIS+VEDKFKSVK IH RLG  PSEP+KHSE+SHHKFKRENLQNPERSKKWKGS
Subjt:  FNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWKGS

Query:  QDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDK
        QD TEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVV+LNDIYPKARKHLLVVARHEGLDQL DVC EHLPLLRTMHAVGLKWIDK
Subjt:  QDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDK

Query:  FFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSMELRCNRCRSAHPNLPKLKAHSF
        FFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSVA+MDEVSSHGKAS+MDDESLMSMELRCNRCRSAHPNLPKLKAH F
Subjt:  FFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSMELRCNRCRSAHPNLPKLKAHSF

Query:  KCQAPFPSTLLEGGRLVIAPSNAPLS
        KCQAPFPSTLLEGGRLVIAPSNAPLS
Subjt:  KCQAPFPSTLLEGGRLVIAPSNAPLS

TrEMBL top hitse value%identityAlignment
A0A0A0L9U1 Uncharacterized protein0.0e+0089.56Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALN+GKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAIDMYKSLDLH +LPHGCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPS+T SSANT K+SPTPQ TQE  +SC KKEES+CTMS++V +E+EKGE PGIRSL+D ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILS+VKAKA +KNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS
        QANSLQPGNAVAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLL NAYSSLFQAFISIV+DK+KSVK IHE LGS P E QKHSED 
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQN ERSKKWKGSQ+ TE LNQNNN TVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVV+L DIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHA+GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV +++EVSSHGKA+IMDDESLMSME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH  KCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A1S3B6C4 transcription factor bHLH1400.0e+0089.29Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALN+GKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAIDMYKSLDLHN+LPHGCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPTPQ TQE  +SCDKKEES+C MS++V +E+EKGE PG+RSLE  ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILS+VKAKA +KNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS
        QANSL PGNAVAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLL NAYSSLFQAFISIV+DK+KSVK I+E LGS P EPQKHSE+S
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQN E SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVV+L DIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV ++DEVSSHGKA IMDDE L+SME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH  KCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A5A7TLV2 Transcription factor bHLH1400.0e+0089.29Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALN+GKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAIDMYKSLDLHN+LPHGCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPTPQ TQE  +SCDKKEES+C MS++V +E+EKGE PG+RSLE  ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILS+VKAKA +KNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS
        QANSL PGNAVAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLL NAYSSLFQAFISIV+DK+KSVK I+E LGS P EPQKHSE+S
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQN E SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVV+L DIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV ++DEVSSHGKA IMDDE L+SME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH  KCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A6J1CV45 transcription factor bHLH1400.0e+0084.95Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL++GKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAID YKSL LH+ LPHGCFGQ   + KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P++T SSAN +KDSP  QT++ENS SCDKKEE ACT+  +VD E+EKGE PG+RSL D IS+S+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNA+LVLVDL+ GSK+LSLVKAKAAKK I+ +KF TFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS
        QANSL+PGN V   LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLLR AYSSLFQ FISIVE++FKSVK I + LGS PSE +KHSEDS
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS

Query:  --------HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHE
                 HKFKRE++QNPERSKKWKGSQD  EA NQNNN  V KMSKHWGSWAQALYNTAMHPE+H DTVLE SDDV +LNDIY KA KHLLVVAR+E
Subjt:  --------HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHE

Query:  GLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMD
        GLDQL DV REHLPLLRTMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV +MDEV SHGKASI D
Subjt:  GLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMD

Query:  DESLMSMELRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSN
        DESLMSMELRCNRCRSAHPNL KLKAH  KC+APFPSTLLEG RLVIAPSN
Subjt:  DESLMSMELRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSN

A0A6J1GCV0 transcription factor bHLH1400.0e+0083.74Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMD DENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALN+GKSVFVDRCNLEIEQR++FVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAID YKSL LH+ LP GCFGQKN +KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAE P++T S+ANT KD P+ QTTQE       K+ES+CTM  +V+ E+EKGE PG+ SLE++IS S+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNA+LV+VDLS GSKILSLVKAKAAKKNI STKF TFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS
        QA SL+PGN VAV LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLLR+AYSSLFQAFISIV+D+FKS K I ERLGS PSE +KHSED+
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKR  LQ PERSKKWKG+Q+  EALNQNNNK   KMSKHWGSWAQALYNTAM+PE+H++ VLETSDDVV+LNDIYPKARKHLL+VAR+EGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME
         +EHLPLL+TMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV ++DEVSSHGKA I DDESLMSME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNA
         RCNRCRSAHPNLPKLK H  KCQ+PFPSTLLEGGRLV A  N+
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNA

SwissProt top hitse value%identityAlignment
P61797 Aprataxin6.0e-3437.7Show/hide
Query:  VKEIHERLGSVPSEPQKHSEDSHHKFKRE-NLQNPERSKKWK---------GSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETS
        V E++  +     E +    ++H K KR  N  + ER    +         GS     ++  N  K  P   +  G W+Q L  +   P+      +   
Subjt:  VKEIHERLGSVPSEPQKHSEDSHHKFKRE-NLQNPERSKKWK---------GSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETS

Query:  DDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFN
        + VV++ D YPKAR H LV+     +  L  V  EHL LL+ MH VG K I   F   + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFN
Subjt:  DDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFN

Query:  TDFFRDSVAIMDEVSSHGKASIMDD-ESLMSMELRCNRCRSAHPNLPKLKAH
        T++F +S A+++ V   G+ S+ D    L+ + LRC+ C+   P++P+LK H
Subjt:  TDFFRDSVAIMDEVSSHGKASIMDD-ESLMSMELRCNRCRSAHPNLPKLKAH

P61798 Aprataxin (Fragment)3.2e-3535.37Show/hide
Query:  MNPQRPNYLSNDYDEGCKL----LRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWK--GSQDPTEALNQN
        +NP   + +    DE  K+    + +  + L+   +   E+  +SV E  E++       ++  EDS      EN+  P+++KK +   +Q  +  L  +
Subjt:  MNPQRPNYLSNDYDEGCKL----LRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWK--GSQDPTEALNQN

Query:  NNKTVP-----KMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHED
         +   P        +H G W+Q L ++   P+      +   +  V++ D YPKAR H LV+   + +  L  V REHL LL  MHAVG K I +   ++
Subjt:  NNKTVP-----KMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHED

Query:  ASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDES-LMSMELRCNRCRSAHPNLPKLKAH
         SL FRLGYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T++F +S  +++ V S GK ++ D  S L+ + LRC+ C+     +P+LK H
Subjt:  ASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDES-LMSMELRCNRCRSAHPNLPKLKAH

Q7TQC5 Aprataxin6.0e-3441.79Show/hide
Query:  GSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWI
        GS     +++   +K      +  G W+Q L  +   P+      +   D VV++ D YPKAR H LV+     +  L  V  EHL LL+ MHAVG K I
Subjt:  GSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWI

Query:  DKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDE-SLMSMELRCNRCRSAHPNLPKLKA
           F   + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S A++  V   G+ ++ D    L+ + LRC+ C+   P++P+LK 
Subjt:  DKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDE-SLMSMELRCNRCRSAHPNLPKLKA

Query:  H
        H
Subjt:  H

Q8K4H4 Aprataxin2.7e-3444.5Show/hide
Query:  PKMSKH-------WGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASL
        PK  KH        G W+Q L  +   P+      +   D VV++ D YPKAR H LV+     +  L  V  EHL LL+ MHAVG K I   F   + L
Subjt:  PKMSKH-------WGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASL

Query:  VFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDE-SLMSMELRCNRCRSAHPNLPKLKAH
         FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S A++  V   G+ ++ D    L+ + LRC+ C+   P++P+LK H
Subjt:  VFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDE-SLMSMELRCNRCRSAHPNLPKLKAH

Q9M041 Transcription factor bHLH1401.7e-23859.75Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L  GKSVF+DRCNL+ EQR++F+KLGGP+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSET
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++MY  L   + LP GCFG+K  + K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSET

Query:  RSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAK
                 S  P ++   + +  +K +    M+ +V +   K     I            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG A+
Subjt:  RSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAK

Query:  LVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLP
        LVLVDLSRGSKILSLVKAKA++KNI S KF TFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +AN+L PG AV VPLP
Subjt:  LVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLP

Query:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDSHHKFKRENLQNPERSK
        ST PL N EG+THVIHVLGPNMNP RP+ L+NDY +GCK LR AY+SLF+ F+S+V+D+ K  K   +   S   E  K  EDS            ER+K
Subjt:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDSHHKFKRENLQNPERSK

Query:  KWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRT
        K+KGSQD     N      ++   +  KMSK W +WA AL++ AMHPE+H + VLE  D++V++ND YPKARKH+LV+AR E LD L DV +E+L LL+ 
Subjt:  KWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRT

Query:  MHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSMELRCNRCRSAH
        MH VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV +++EV+S GKA++   E L+  ELRCNRCRSAH
Subjt:  MHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSMELRCNRCRSAH

Query:  PNLPKLKAHSFKCQAPFPSTLLEGGRLV
        PN+PKLK+H   C + FP  LL+  RLV
Subjt:  PNLPKLKAHSFKCQAPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT5G01310.1 APRATAXIN-like1.2e-23959.75Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L  GKSVF+DRCNL+ EQR++F+KLGGP+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSET
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++MY  L   + LP GCFG+K  + K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSET

Query:  RSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAK
                 S  P ++   + +  +K +    M+ +V +   K     I            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG A+
Subjt:  RSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAK

Query:  LVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLP
        LVLVDLSRGSKILSLVKAKA++KNI S KF TFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +AN+L PG AV VPLP
Subjt:  LVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLP

Query:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDSHHKFKRENLQNPERSK
        ST PL N EG+THVIHVLGPNMNP RP+ L+NDY +GCK LR AY+SLF+ F+S+V+D+ K  K   +   S   E  K  EDS            ER+K
Subjt:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDSHHKFKRENLQNPERSK

Query:  KWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRT
        K+KGSQD     N      ++   +  KMSK W +WA AL++ AMHPE+H + VLE  D++V++ND YPKARKH+LV+AR E LD L DV +E+L LL+ 
Subjt:  KWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRT

Query:  MHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSMELRCNRCRSAH
        MH VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV +++EV+S GKA++   E L+  ELRCNRCRSAH
Subjt:  MHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSMELRCNRCRSAH

Query:  PNLPKLKAHSFKCQAPFPSTLLEGGRLV
        PN+PKLK+H   C + FP  LL+  RLV
Subjt:  PNLPKLKAHSFKCQAPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGATACCGACGAAAATTCAAATGCCAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTCTG
CGAGCTTGTAATGGGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGTCAGGATACCATTGGAAATGGAAAGTCTGGAACCAAAGCCCAGTGCTTAAAGAGTGCAGCCAGTG
CACTGAATAATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGCGCTGATTTTGTGAAGCTCGGTGGCCCTCAAGTAGATGTACATGCCGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCTGTGAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCCAAACTAAATGAAGGGTTCACTCGCATAACCTTTTGCCACAATGAGACCGATGTTCAATCCGCTATAGATATGTACAAATCGCTTGATTTACATAATT
TGCTTCCACATGGATGTTTTGGACAGAAGAATCCAAACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAAACCTTCTGAGACGCGTTCTAGTGCC
AATACTGTTAAGGATTCTCCAACTCCACAAACTACCCAGGAAAATAGCGACTCTTGTGATAAAAAGGAAGAGTCTGCCTGCACAATGTCAAAAGATGTAGATATAGAGGC
AGAGAAAGGTGAATGTCCAGGCATTAGATCCTTAGAAGACGATATTTCTCAAAGCAATTCTCCAACTCTTGCATTTCCATCTATTTCAACTTCAGATTTCAAGTTTAGCC
ATGAAAAAGCTGCTGAAATTATTGTTGAGAAGGTTGAGGAATTCATGGATAAGCTTGGAAATGCCAAACTTGTACTGGTAGACTTGAGTCGTGGATCAAAGATCTTGTCT
TTGGTTAAAGCTAAAGCAGCTAAGAAAAATATTAGTTCCACCAAGTTTCTTACATTTGTAGGTGATATAACTAAACTCAATTCAGAAGGTGGATTGCGCTGCAATGTTAT
AGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGGGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAAACAACAAGCAAACT
CCCTTCAACCTGGCAATGCCGTGGCCGTTCCGTTGCCTTCAACTTCTCCTTTGTTCAATAGGGAAGGAGTAACCCATGTTATACATGTTCTTGGACCCAACATGAATCCA
CAGAGGCCAAATTATCTCAGCAATGACTATGATGAAGGCTGCAAACTTCTTCGCAATGCTTACTCTTCCTTATTTCAAGCCTTTATTTCAATTGTAGAAGACAAATTTAA
GTCGGTTAAGGAAATTCACGAACGCCTCGGCTCAGTACCTTCAGAACCACAAAAGCACTCTGAGGACAGCCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCCTGAAA
GAAGCAAAAAATGGAAAGGATCTCAAGACCCAACTGAAGCATTAAACCAAAACAACAATAAGACTGTCCCCAAAATGAGTAAGCACTGGGGTTCATGGGCACAAGCACTT
TACAACACTGCAATGCATCCCGAGCAACATAGCGATACTGTACTGGAGACATCAGATGATGTTGTAATACTGAATGATATTTATCCAAAGGCACGCAAGCATCTTCTAGT
AGTGGCTCGCCATGAAGGCCTCGATCAACTAACTGATGTATGTAGAGAACACCTTCCATTGTTGAGGACAATGCACGCTGTGGGTTTGAAGTGGATCGATAAGTTCTTTC
ATGAAGATGCATCATTAGTTTTTCGCCTTGGATACCACTCGGCTCCATCAATGAGGCAACTGCACCTACATGTTATAAGCCAGGATTTCGACTCCAGTCATCTGAAAAAC
AAGAAGCATTGGAATTCTTTCAACACTGATTTCTTCAGAGACTCGGTTGCCATTATGGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCATGGACGATGAGAGCTTGAT
GTCTATGGAGTTACGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTGCCCAAATTGAAAGCGCATAGTTTCAAATGCCAAGCGCCTTTCCCTTCCACACTACTTGAGG
GCGGTCGTTTAGTGATTGCGCCAAGTAATGCTCCTCTTTCTTAG
mRNA sequenceShow/hide mRNA sequence
CAAGGACGTGCTAAATGCGTAACCAAACACTCTCGTAGCAGCAAATTTGGGGAGGGAACCGACCGTGGCGTATTCTTCAAGCTTCACTCCTTGGCAGCGGGAACGTCTGA
AATGACGATTGATAGGGTCTAAAAGGAAGACCTACATCTAATCAAGAAATGGACATGGATACCGACGAAAATTCAAATGCCAAAGGAAAGGAAGGGCAAGGGAAGCTCAT
AATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTCTGCGAGCTTGTAATGGGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGTCAGGATACCATTGGAA
ATGGAAAGTCTGGAACCAAAGCCCAGTGCTTAAAGAGTGCAGCCAGTGCACTGAATAATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGCGCT
GATTTTGTGAAGCTCGGTGGCCCTCAAGTAGATGTACATGCCGTTGTATTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCTGTGAAGCGGACTGGTCATGAAGGGAA
CTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAAGAAAGAATTGCCCAAACTAAATGAAGGGTTCACTCGCATAACCTTTTGCCACAATGAGACCG
ATGTTCAATCCGCTATAGATATGTACAAATCGCTTGATTTACATAATTTGCTTCCACATGGATGTTTTGGACAGAAGAATCCAAACAAGAAAGTACAACTTGGCATAATG
AAGTTCTTGAAGAAAGCAGAAAAACCTTCTGAGACGCGTTCTAGTGCCAATACTGTTAAGGATTCTCCAACTCCACAAACTACCCAGGAAAATAGCGACTCTTGTGATAA
AAAGGAAGAGTCTGCCTGCACAATGTCAAAAGATGTAGATATAGAGGCAGAGAAAGGTGAATGTCCAGGCATTAGATCCTTAGAAGACGATATTTCTCAAAGCAATTCTC
CAACTCTTGCATTTCCATCTATTTCAACTTCAGATTTCAAGTTTAGCCATGAAAAAGCTGCTGAAATTATTGTTGAGAAGGTTGAGGAATTCATGGATAAGCTTGGAAAT
GCCAAACTTGTACTGGTAGACTTGAGTCGTGGATCAAAGATCTTGTCTTTGGTTAAAGCTAAAGCAGCTAAGAAAAATATTAGTTCCACCAAGTTTCTTACATTTGTAGG
TGATATAACTAAACTCAATTCAGAAGGTGGATTGCGCTGCAATGTTATAGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGGGGTGGTGTCAATGCTGCAATTTTTA
GTGCTGCAGGTCCTGGTCTGGAAGTGGCAACTAAACAACAAGCAAACTCCCTTCAACCTGGCAATGCCGTGGCCGTTCCGTTGCCTTCAACTTCTCCTTTGTTCAATAGG
GAAGGAGTAACCCATGTTATACATGTTCTTGGACCCAACATGAATCCACAGAGGCCAAATTATCTCAGCAATGACTATGATGAAGGCTGCAAACTTCTTCGCAATGCTTA
CTCTTCCTTATTTCAAGCCTTTATTTCAATTGTAGAAGACAAATTTAAGTCGGTTAAGGAAATTCACGAACGCCTCGGCTCAGTACCTTCAGAACCACAAAAGCACTCTG
AGGACAGCCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCCTGAAAGAAGCAAAAAATGGAAAGGATCTCAAGACCCAACTGAAGCATTAAACCAAAACAACAATAAG
ACTGTCCCCAAAATGAGTAAGCACTGGGGTTCATGGGCACAAGCACTTTACAACACTGCAATGCATCCCGAGCAACATAGCGATACTGTACTGGAGACATCAGATGATGT
TGTAATACTGAATGATATTTATCCAAAGGCACGCAAGCATCTTCTAGTAGTGGCTCGCCATGAAGGCCTCGATCAACTAACTGATGTATGTAGAGAACACCTTCCATTGT
TGAGGACAATGCACGCTGTGGGTTTGAAGTGGATCGATAAGTTCTTTCATGAAGATGCATCATTAGTTTTTCGCCTTGGATACCACTCGGCTCCATCAATGAGGCAACTG
CACCTACATGTTATAAGCCAGGATTTCGACTCCAGTCATCTGAAAAACAAGAAGCATTGGAATTCTTTCAACACTGATTTCTTCAGAGACTCGGTTGCCATTATGGACGA
AGTCAGTAGCCATGGAAAGGCGAGCATCATGGACGATGAGAGCTTGATGTCTATGGAGTTACGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTGCCCAAATTGAAAG
CGCATAGTTTCAAATGCCAAGCGCCTTTCCCTTCCACACTACTTGAGGGCGGTCGTTTAGTGATTGCGCCAAGTAATGCTCCTCTTTCTTAGTTCATAACGTCTTTCACA
CTTTCTTTTGTAAATATTTTGAGCTCAATATCTTTGTAGTTTTGATGGCCTAGGGAAGCATTAATTGAATGGTTATATCACGTTTTGGTCTCTCTACCAGACTGACTACC
TCTGTACTTTTCCGAGAATTTTGAAGATTCGAGTTCCATCTAATTCGCACGGAATCTTCAAACTATATTTTTGTTCAATTCTAAATTTTGATTAATTTTTT
Protein sequenceShow/hide protein sequence
MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVV
LDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSETRSSA
NTVKDSPTPQTTQENSDSCDKKEESACTMSKDVDIEAEKGECPGIRSLEDDISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAKLVLVDLSRGSKILS
LVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNP
QRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHERLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQAL
YNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN
KKHWNSFNTDFFRDSVAIMDEVSSHGKASIMDDESLMSMELRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS