; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G199690 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G199690
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptiontranscription factor bHLH140
Genome locationCiama_Chr10:35039297..35042702
RNA-Seq ExpressionCaUC10G199690
SyntenyCaUC10G199690
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:1990165 - single-strand break-containing DNA binding (molecular function)
GO:0047627 - adenylylsulfatase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0033699 - DNA 5'-adenosine monophosphate hydrolase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0003725 - double-stranded RNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
InterPro domainsIPR043472 - Macro domain-like
IPR036265 - HIT-like superfamily
IPR032566 - Aprataxin, C2HE/C2H2/C2HC zinc finger
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR026963 - Aprataxin-like
IPR019808 - Histidine triad, conserved site
IPR011146 - HIT-like domain
IPR002589 - Macro domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442389.1 PREDICTED: transcription factor bHLH140 [Cucumis melo]0.0e+0089.56Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALN+GKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAIDMYKSLDLHN+LPHGCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPTPQ TQE  +SCDKKEES+C MS+NV +E+EKGE PG+RSLE  ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILS+VKAKA +KNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS
        QANSL PGNAVAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLL NAYSSLFQAFISIV+DK+KSVK I+E LGS P EPQKHSE+S
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQN E SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVV+L DIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA IMDDE L+SME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH  KCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_011651853.1 transcription factor bHLH140 [Cucumis sativus]0.0e+0089.83Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALN+GKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAIDMYKSLDLH +LPHGCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPS+T SSANT K+SPTPQ TQE  +SC KKEES+CTMS+NV +E+EKGE PGIRSL+D ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILS+VKAKA +KNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS
        QANSLQPGNAVAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLL NAYSSLFQAFISIV+DK+KSVK IHE LGS P E QKHSED 
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQN ERSKKWKGSQ+ TE LNQNNN TVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVV+L DIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHA+GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V++EVSSHGKA+IMDDESLMSME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH  KCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_022145665.1 transcription factor bHLH140 [Momordica charantia]0.0e+0085.35Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL++GKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAID YKSL LH+ LPHGCFGQ   + KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P++T SSAN +KDSP  QT++ENS SCDKKEE ACT+  NVD E+EKGE PG+RSL D+IS+S+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNA+LVLVDL+ GSK+LSLVKAKAAKK I+ +KF TFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS
        QANSL+PGN V   LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLLR AYSSLFQ FISIVE++FKSVK I +HLGS PSE +KHSEDS
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS

Query:  --------HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHE
                 HKFKRE++QNPERSKKWKGSQD  EA NQNNN  V KMSKHWGSWAQALYNTAMHPE+H DTVLE SDDV +LNDIY KA KHLLVVAR+E
Subjt:  --------HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHE

Query:  GLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMD
        GLDQL DV REHLPLLRTMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VMDEV SHGKASI D
Subjt:  GLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMD

Query:  DESLMSMELRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSN
        DESLMSMELRCNRCRSAHPNL KLKAH  KC+APFPSTLLEG RLVIAPSN
Subjt:  DESLMSMELRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSN

XP_038906052.1 transcription factor bHLH140 isoform X1 [Benincasa hispida]0.0e+0094.11Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENS AKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALN+GK+VFVDRCNLEIEQRADFVKL 
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GP+VDV AVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAIDMYKSLDLH++LP GCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAEKPSET SSANTVKDSP PQTTQE SDSCDKKEESACT+S+NVDIE++KGE PG+RSLEDNISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILSLVKAKAAKKNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS
        QANSLQPGNAVAV LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLLRNAYSSLFQAFIS+VEDKFKSVK IH  LG  PSEP+KHSE+S
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQNPERSKKWKGSQD TEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVV+LNDIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSVAVMDEVSSHGKAS+MDDESLMSME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH FKCQAPFPSTLLEGGRLVIAPSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

XP_038906054.1 transcription factor bHLH140 isoform X2 [Benincasa hispida]0.0e+0094.08Show/hide
Query:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQLCISR
        MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALN+GK+VFVDRCNLEIEQRADFVKL GP+VDV AVVLDLPAQLCISR
Subjt:  MVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQLCISR

Query:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSETRSSAN
        SVKRTGHEGNLSGGKAAAVVNKMLQKKELP+LNEGF RITFCHNETDVQSAIDMYKSLDLH++LP GCFGQKNP+KKVQLGIMKFLKKAEKPSET SSAN
Subjt:  SVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSETRSSAN

Query:  TVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAKLVLVD
        TVKDSP PQTTQE SDSCDKKEESACT+S+NVDIE++KGE PG+RSLEDNISQS+ PTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNA+LVLVD
Subjt:  TVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAKLVLVD

Query:  LSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLPSTSPL
        LS GSKILSLVKAKAAKKNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQQANSLQPGNAVAV LPSTSPL
Subjt:  LSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLPSTSPL

Query:  FNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWKGS
        FNREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLLRNAYSSLFQAFIS+VEDKFKSVK IH  LG  PSEP+KHSE+SHHKFKRENLQNPERSKKWKGS
Subjt:  FNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWKGS

Query:  QDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDK
        QD TEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVV+LNDIYPKARKHLLVVARHEGLDQL DVC EHLPLLRTMHAVGLKWIDK
Subjt:  QDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDK

Query:  FFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSMELRCNRCRSAHPNLPKLKAHSF
        FFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN+KHWNSFNTDFFRDSVAVMDEVSSHGKAS+MDDESLMSMELRCNRCRSAHPNLPKLKAH F
Subjt:  FFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSMELRCNRCRSAHPNLPKLKAHSF

Query:  KCQAPFPSTLLEGGRLVIAPSNAPLS
        KCQAPFPSTLLEGGRLVIAPSNAPLS
Subjt:  KCQAPFPSTLLEGGRLVIAPSNAPLS

TrEMBL top hitse value%identityAlignment
A0A0A0L9U1 Uncharacterized protein0.0e+0089.83Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLK+A SALN+GKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAIDMYKSLDLH +LPHGCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGI KFLKKAEKPS+T SSANT K+SPTPQ TQE  +SC KKEES+CTMS+NV +E+EKGE PGIRSL+D ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILS+VKAKA +KNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAG GLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS
        QANSLQPGNAVAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLL NAYSSLFQAFISIV+DK+KSVK IHE LGS P E QKHSED 
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQN ERSKKWKGSQ+ TE LNQNNN TVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVV+L DIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHA+GLKWI+KFF ED  LVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V++EVSSHGKA+IMDDESLMSME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH  KCQAPFPSTLLEGGRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A1S3B6C4 transcription factor bHLH1400.0e+0089.56Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALN+GKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAIDMYKSLDLHN+LPHGCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPTPQ TQE  +SCDKKEES+C MS+NV +E+EKGE PG+RSLE  ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILS+VKAKA +KNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS
        QANSL PGNAVAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLL NAYSSLFQAFISIV+DK+KSVK I+E LGS P EPQKHSE+S
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQN E SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVV+L DIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA IMDDE L+SME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH  KCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A5A7TLV2 Transcription factor bHLH1400.0e+0089.56Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGT+AQCLKSA SALN+GKSVFVDRCNLEIEQRADFVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNE+DV SAIDMYKSLDLHN+LPHGCFGQKNP+KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKAE PS+T SSAN  K+SPTPQ TQE  +SCDKKEES+C MS+NV +E+EKGE PG+RSLE  ISQS+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVEKVEEFMDKLGNA+LVLVDLS GSKILS+VKAKA +KNISSTKF TFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS
        QANSL PGNAVAV LPSTSPL NREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLL NAYSSLFQAFISIV+DK+KSVK I+E LGS P EPQKHSE+S
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKRENLQN E SKKWKGS + TE LNQNNNKTVPK SKHWGSWAQALY+TAMHPE+H+++VLETSDDVV+L DIYPKARKHLLVVARHEGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME
        C EHLPLLRTMHA+GLKWI KFFHEDA LVFRLGYHSAPSMRQLHLHVISQDFDS+HLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA IMDDE L+SME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS
        LRCNRCRSAHPNLPKLKAH  KCQAPFPSTLLE GRLV+ PSNAPLS
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS

A0A6J1CV45 transcription factor bHLH1400.0e+0085.35Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMD D+NS AKGKEGQ KLIMVILVGAPGSGKSTFCELVMGSSSRPW RICQDTIGNGKSGTKAQCLKSA+SAL++GKS+FVDRCNLEIEQRADFVK+G
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
        G  VDVHAVVLDLPAQLCISRSVKRTGHEGNL GGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAID YKSL LH+ LPHGCFGQ   + KVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIMKFLKKA+ P++T SSAN +KDSP  QT++ENS SCDKKEE ACT+  NVD E+EKGE PG+RSL D+IS+S+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNA+LVLVDL+ GSK+LSLVKAKAAKK I+ +KF TFVGDITKLNSEGGLRCNVIANAANWRLKPG GGVNAAIFSAAGPGLEVATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS
        QANSL+PGN V   LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLLR AYSSLFQ FISIVE++FKSVK I +HLGS PSE +KHSEDS
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS

Query:  --------HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHE
                 HKFKRE++QNPERSKKWKGSQD  EA NQNNN  V KMSKHWGSWAQALYNTAMHPE+H DTVLE SDDV +LNDIY KA KHLLVVAR+E
Subjt:  --------HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHE

Query:  GLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMD
        GLDQL DV REHLPLLRTMH VGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV VMDEV SHGKASI D
Subjt:  GLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMD

Query:  DESLMSMELRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSN
        DESLMSMELRCNRCRSAHPNL KLKAH  KC+APFPSTLLEG RLVIAPSN
Subjt:  DESLMSMELRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSN

A0A6J1GCV0 transcription factor bHLH1400.0e+0084.01Show/hide
Query:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG
        MDMD DENS AKG E + KLIMVILVGAPGSGKSTFCELVM SSSRPWVRICQDTIGNGKSGTKAQCLKSAASALN+GKSVFVDRCNLEIEQR++FVKLG
Subjt:  MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLG

Query:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ
           VDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELP LNEGF RITFCH+ETDVQSAID YKSL LH+ LP GCFGQKN +KKVQ
Subjt:  GPQVDVHAVVLDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQ

Query:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI
        LGIM+FLKKAE P++T S+ANT KD P+ QTTQE       K+ES+CTM  NV+ E+EKGE PG+ SLE+NIS S+ PTLAFPSISTSDFKFSHEKAAEI
Subjt:  LGIMKFLKKAEKPSETRSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEI

Query:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ
        IVE VEEFMDKLGNA+LV+VDLS GSKILSLVKAKAAKKNI STKF TFVGDITKL S+GGL CNVIANAANWRLKPGGGGVNAAIFSAAGP LE ATKQ
Subjt:  IVEKVEEFMDKLGNAKLVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQ

Query:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS
        QA SL+PGN VAV LPSTSPLFNREGVTHVIHVLGPNMNPQRPNYL+NDYDEGCKLLR+AYSSLFQAFISIV+D+FKS K I E LGS PSE +KHSED+
Subjt:  QANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDS

Query:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV
        HHKFKR  LQ PERSKKWKG+Q+  EALNQNNNK   KMSKHWGSWAQALYNTAM+PE+H++ VLETSDDVV+LNDIYPKARKHLL+VAR+EGLDQL DV
Subjt:  HHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDV

Query:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME
         +EHLPLL+TMHAVG+KWIDKF H+DASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSV V+DEVSSHGKA I DDESLMSME
Subjt:  CREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSME

Query:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNA
         RCNRCRSAHPNLPKLK H  KCQ+PFPSTLLEGGRLV A  N+
Subjt:  LRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNA

SwissProt top hitse value%identityAlignment
P61797 Aprataxin2.0e-3438.1Show/hide
Query:  VKEIHEHLGSVPSEPQKHSEDSHHKFKRE-NLQNPERSKKWK---------GSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETS
        V E++ ++     E +    ++H K KR  N  + ER    +         GS     ++  N  K  P   +  G W+Q L  +   P+      +   
Subjt:  VKEIHEHLGSVPSEPQKHSEDSHHKFKRE-NLQNPERSKKWK---------GSQDPTEALNQNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETS

Query:  DDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFN
        + VV++ D YPKAR H LV+     +  L  V  EHL LL+ MH VG K I   F   + L FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFN
Subjt:  DDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFN

Query:  TDFFRDSVAVMDEVSSHGKASIMDD-ESLMSMELRCNRCRSAHPNLPKLKAH
        T++F +S AV++ V   G+ S+ D    L+ + LRC+ C+   P++P+LK H
Subjt:  TDFFRDSVAVMDEVSSHGKASIMDD-ESLMSMELRCNRCRSAHPNLPKLKAH

P61798 Aprataxin (Fragment)7.0e-3535.71Show/hide
Query:  MNPQRPNYLSNDYDEGCKL----LRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWK--GSQDPTEALNQN
        +NP   + +    DE  K+    + +  + L+   +   E+  +SV E  E +       ++  EDS      EN+  P+++KK +   +Q  +  L  +
Subjt:  MNPQRPNYLSNDYDEGCKL----LRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWK--GSQDPTEALNQN

Query:  NNKTVP-----KMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHED
         +   P        +H G W+Q L ++   P+      +   +  V++ D YPKAR H LV+   + +  L  V REHL LL  MHAVG K I +   ++
Subjt:  NNKTVP-----KMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHED

Query:  ASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDES-LMSMELRCNRCRSAHPNLPKLKAH
         SL FRLGYH+ PSM QLHLHVISQDFDS  LK KKHWNSF T++F +S  V++ V S GK ++ D  S L+ + LRC+ C+     +P+LK H
Subjt:  ASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDES-LMSMELRCNRCRSAHPNLPKLKAH

Q7YRZ2 Aprataxin4.6e-3438.34Show/hide
Query:  VKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWKGSQDPTEALNQNNN----KTVPKMSK-------HWGSWAQALYNTAMHPEQHSDTVLET
        V E++ ++     E +    +SH K KR    +P   +      +P+  L   +N       PK  K         G W+Q L  +   P+      +  
Subjt:  VKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWKGSQDPTEALNQNNN----KTVPKMSK-------HWGSWAQALYNTAMHPEQHSDTVLET

Query:  SDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSF
         + VV++ D YPKAR H LV+     +  L  V REHL LLR MHAVG K I   F   +   FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSF
Subjt:  SDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSF

Query:  NTDFFRDSVAVMDEVSSHGKASIMDD-ESLMSMELRCNRCRSAHPNLPKLKAH
        NT++F +S AV++ V   G+ ++ D    L+ + LRC+ C+   P++P+LK H
Subjt:  NTDFFRDSVAVMDEVSSHGKASIMDD-ESLMSMELRCNRCRSAHPNLPKLKAH

Q8K4H4 Aprataxin2.0e-3445.03Show/hide
Query:  PKMSKH-------WGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASL
        PK  KH        G W+Q L  +   P+      +   D VV++ D YPKAR H LV+     +  L  V  EHL LL+ MHAVG K I   F   + L
Subjt:  PKMSKH-------WGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASL

Query:  VFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDE-SLMSMELRCNRCRSAHPNLPKLKAH
         FRLGYH+ PSM  +HLHVISQDFDS  LKNKKHWNSFNT++F +S AV+  V   G+ ++ D    L+ + LRC+ C+   P++P+LK H
Subjt:  VFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDE-SLMSMELRCNRCRSAHPNLPKLKAH

Q9M041 Transcription factor bHLH1402.2e-23859.75Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L  GKSVF+DRCNL+ EQR++F+KLGGP+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSET
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++MY  L   + LP GCFG+K  + K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSET

Query:  RSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAK
                 S  P ++   + +  +K +    M+ NV +   K     I            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG A+
Subjt:  RSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAK

Query:  LVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLP
        LVLVDLSRGSKILSLVKAKA++KNI S KF TFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +AN+L PG AV VPLP
Subjt:  LVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLP

Query:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSK
        ST PL N EG+THVIHVLGPNMNP RP+ L+NDY +GCK LR AY+SLF+ F+S+V+D+ K  K             Q    DS    K ++    ER+K
Subjt:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSK

Query:  KWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRT
        K+KGSQD     N      ++   +  KMSK W +WA AL++ AMHPE+H + VLE  D++V++ND YPKARKH+LV+AR E LD L DV +E+L LL+ 
Subjt:  KWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRT

Query:  MHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSMELRCNRCRSAH
        MH VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V++EV+S GKA++   E L+  ELRCNRCRSAH
Subjt:  MHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSMELRCNRCRSAH

Query:  PNLPKLKAHSFKCQAPFPSTLLEGGRLV
        PN+PKLK+H   C + FP  LL+  RLV
Subjt:  PNLPKLKAHSFKCQAPFPSTLLEGGRLV

Arabidopsis top hitse value%identityAlignment
AT5G01310.1 APRATAXIN-like1.6e-23959.75Show/hide
Query:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ
        + K I+V+L+G PGSGKSTFC+  M SS RPW RICQD + NGK+GTKAQCLK A  +L  GKSVF+DRCNL+ EQR++F+KLGGP+ +VHAVVL+LPAQ
Subjt:  QGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVVLDLPAQ

Query:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSET
        +CISRSVKRTGHEGNL GG+AAAVVNKMLQ KELPK+NEGF+RI FC+++ DV +A++MY  L   + LP GCFG+K  + K Q GIMKF KK       
Subjt:  LCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSET

Query:  RSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAK
                 S  P ++   + +  +K +    M+ NV +   K     I            PTLAFPSIST+DF+F  EKA++IIVEK EEF+ KLG A+
Subjt:  RSSANTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAK

Query:  LVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLP
        LVLVDLSRGSKILSLVKAKA++KNI S KF TFVGDITKL SEGGL CNVIANA NWRLKPGGGGVNAAIF AAGP LE AT+ +AN+L PG AV VPLP
Subjt:  LVLVDLSRGSKILSLVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLP

Query:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSK
        ST PL N EG+THVIHVLGPNMNP RP+ L+NDY +GCK LR AY+SLF+ F+S+V+D+ K  K             Q    DS    K ++    ER+K
Subjt:  STSPLFNREGVTHVIHVLGPNMNPQRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSK

Query:  KWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRT
        K+KGSQD     N      ++   +  KMSK W +WA AL++ AMHPE+H + VLE  D++V++ND YPKARKH+LV+AR E LD L DV +E+L LL+ 
Subjt:  KWKGSQDPTEALN------QNNNKTVPKMSKHWGSWAQALYNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRT

Query:  MHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSMELRCNRCRSAH
        MH VGLKW+D+F +EDASL+FRLGYHS PSMRQLHLHVISQDF+S  LKNKKHWNSF T FFRDSV V++EV+S GKA++   E L+  ELRCNRCRSAH
Subjt:  MHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKNKKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSMELRCNRCRSAH

Query:  PNLPKLKAHSFKCQAPFPSTLLEGGRLV
        PN+PKLK+H   C + FP  LL+  RLV
Subjt:  PNLPKLKAHSFKCQAPFPSTLLEGGRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGGATACCGACGAAAATTCAAATGCCAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTCTG
CGAGCTTGTAATGGGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGTCAGGATACCATTGGAAATGGAAAGTCTGGAACCAAAGCCCAGTGCTTAAAGAGTGCAGCCAGTG
CACTGAATAATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGCGCTGATTTTGTGAAGCTCGGTGGCCCTCAAGTAGATGTACATGCCGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCTGTGAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCCAAACTAAATGAAGGGTTCACTCGCATAACCTTTTGCCACAATGAGACCGATGTTCAATCCGCTATAGATATGTACAAATCGCTTGATTTACATAATT
TGCTTCCACATGGATGTTTTGGACAGAAGAATCCAAACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAAACCTTCTGAGACGCGTTCTAGTGCC
AATACTGTTAAGGATTCTCCAACTCCACAAACTACCCAGGAAAATAGCGACTCTTGTGATAAAAAGGAAGAGTCTGCCTGCACAATGTCGAAAAATGTAGATATAGAGGC
AGAGAAAGGTGAATGTCCAGGCATTAGATCCTTAGAAGACAATATTTCTCAAAGCAATTCTCCAACTCTTGCATTTCCATCTATTTCAACTTCAGATTTCAAGTTTAGCC
ATGAAAAAGCTGCTGAAATTATTGTTGAGAAGGTTGAGGAATTCATGGATAAGCTTGGAAATGCCAAACTTGTACTGGTAGACTTGAGTCGTGGATCAAAGATCTTGTCT
TTGGTTAAAGCTAAAGCAGCTAAGAAAAATATTAGTTCCACCAAGTTTCTTACATTTGTAGGTGATATAACTAAACTCAATTCAGAAGGTGGATTGCGCTGCAATGTGAT
AGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGGGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGGCCTGGTCTGGAAGTGGCAACTAAACAACAAGCAAACT
CCCTTCAACCTGGCAATGCCGTGGCCGTTCCGTTGCCTTCAACTTCTCCTTTGTTCAATAGGGAAGGAGTAACCCATGTTATACATGTTCTTGGACCCAACATGAATCCA
CAGAGGCCAAATTATCTCAGCAATGACTATGATGAAGGCTGCAAACTTCTTCGCAATGCTTACTCTTCCTTATTTCAAGCCTTTATTTCAATTGTAGAAGACAAATTTAA
GTCGGTTAAGGAAATTCACGAACACCTCGGCTCAGTACCTTCAGAACCACAAAAGCACTCTGAGGACAGCCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCCTGAAA
GAAGCAAAAAATGGAAAGGATCTCAAGACCCAACTGAAGCATTAAACCAAAACAACAATAAGACTGTCCCCAAAATGAGTAAGCACTGGGGTTCATGGGCACAAGCACTT
TACAACACTGCAATGCATCCCGAGCAACATAGCGATACTGTACTGGAGACATCAGATGATGTTGTAATACTGAATGATATTTATCCAAAGGCACGCAAGCATCTTCTAGT
AGTGGCTCGCCATGAAGGCCTCGATCAACTAACTGATGTATGTAGAGAACACCTTCCATTGTTGAGGACAATGCACGCTGTGGGTTTGAAGTGGATCGATAAGTTCTTTC
ATGAAGATGCATCATTAGTTTTTCGCCTTGGATACCACTCGGCTCCATCAATGAGGCAACTGCACCTACATGTTATAAGCCAGGATTTCGACTCCAGTCATCTGAAAAAC
AAGAAGCATTGGAATTCTTTCAACACCGATTTCTTCAGAGACTCGGTTGCCGTTATGGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCATGGACGATGAGAGCTTGAT
GTCTATGGAGTTGCGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTGCCCAAATTGAAAGCGCATAGTTTCAAATGCCAAGCGCCTTTCCCTTCCACACTACTTGAGG
GCGGTCGTTTAGTGATTGCGCCAAGTAATGCTCCTCTTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACATGGATACCGACGAAAATTCAAATGCCAAAGGAAAGGAAGGGCAAGGGAAGCTCATAATGGTAATATTAGTGGGCGCACCTGGAAGCGGCAAATCCACCTTCTG
CGAGCTTGTAATGGGTTCCTCTTCTCGCCCTTGGGTTCGAATCTGTCAGGATACCATTGGAAATGGAAAGTCTGGAACCAAAGCCCAGTGCTTAAAGAGTGCAGCCAGTG
CACTGAATAATGGAAAGAGTGTATTTGTGGACAGGTGCAATCTTGAAATAGAGCAGCGCGCTGATTTTGTGAAGCTCGGTGGCCCTCAAGTAGATGTACATGCCGTTGTA
TTAGATCTTCCTGCACAGCTCTGTATCTCTCGTTCTGTGAAGCGGACTGGTCATGAAGGGAACTTATCAGGTGGAAAAGCTGCTGCTGTTGTGAATAAAATGCTGCAAAA
GAAAGAATTGCCCAAACTAAATGAAGGGTTCACTCGCATAACCTTTTGCCACAATGAGACCGATGTTCAATCCGCTATAGATATGTACAAATCGCTTGATTTACATAATT
TGCTTCCACATGGATGTTTTGGACAGAAGAATCCAAACAAGAAAGTACAACTTGGCATAATGAAGTTCTTGAAGAAAGCAGAAAAACCTTCTGAGACGCGTTCTAGTGCC
AATACTGTTAAGGATTCTCCAACTCCACAAACTACCCAGGAAAATAGCGACTCTTGTGATAAAAAGGAAGAGTCTGCCTGCACAATGTCGAAAAATGTAGATATAGAGGC
AGAGAAAGGTGAATGTCCAGGCATTAGATCCTTAGAAGACAATATTTCTCAAAGCAATTCTCCAACTCTTGCATTTCCATCTATTTCAACTTCAGATTTCAAGTTTAGCC
ATGAAAAAGCTGCTGAAATTATTGTTGAGAAGGTTGAGGAATTCATGGATAAGCTTGGAAATGCCAAACTTGTACTGGTAGACTTGAGTCGTGGATCAAAGATCTTGTCT
TTGGTTAAAGCTAAAGCAGCTAAGAAAAATATTAGTTCCACCAAGTTTCTTACATTTGTAGGTGATATAACTAAACTCAATTCAGAAGGTGGATTGCGCTGCAATGTGAT
AGCCAATGCTGCAAACTGGCGACTGAAACCAGGAGGGGGTGGTGTCAATGCTGCAATTTTTAGTGCTGCAGGGCCTGGTCTGGAAGTGGCAACTAAACAACAAGCAAACT
CCCTTCAACCTGGCAATGCCGTGGCCGTTCCGTTGCCTTCAACTTCTCCTTTGTTCAATAGGGAAGGAGTAACCCATGTTATACATGTTCTTGGACCCAACATGAATCCA
CAGAGGCCAAATTATCTCAGCAATGACTATGATGAAGGCTGCAAACTTCTTCGCAATGCTTACTCTTCCTTATTTCAAGCCTTTATTTCAATTGTAGAAGACAAATTTAA
GTCGGTTAAGGAAATTCACGAACACCTCGGCTCAGTACCTTCAGAACCACAAAAGCACTCTGAGGACAGCCATCACAAGTTTAAGAGAGAGAATTTGCAAAATCCTGAAA
GAAGCAAAAAATGGAAAGGATCTCAAGACCCAACTGAAGCATTAAACCAAAACAACAATAAGACTGTCCCCAAAATGAGTAAGCACTGGGGTTCATGGGCACAAGCACTT
TACAACACTGCAATGCATCCCGAGCAACATAGCGATACTGTACTGGAGACATCAGATGATGTTGTAATACTGAATGATATTTATCCAAAGGCACGCAAGCATCTTCTAGT
AGTGGCTCGCCATGAAGGCCTCGATCAACTAACTGATGTATGTAGAGAACACCTTCCATTGTTGAGGACAATGCACGCTGTGGGTTTGAAGTGGATCGATAAGTTCTTTC
ATGAAGATGCATCATTAGTTTTTCGCCTTGGATACCACTCGGCTCCATCAATGAGGCAACTGCACCTACATGTTATAAGCCAGGATTTCGACTCCAGTCATCTGAAAAAC
AAGAAGCATTGGAATTCTTTCAACACCGATTTCTTCAGAGACTCGGTTGCCGTTATGGACGAAGTCAGTAGCCATGGAAAGGCGAGCATCATGGACGATGAGAGCTTGAT
GTCTATGGAGTTGCGTTGCAACAGATGCAGAAGTGCTCATCCCAACTTGCCCAAATTGAAAGCGCATAGTTTCAAATGCCAAGCGCCTTTCCCTTCCACACTACTTGAGG
GCGGTCGTTTAGTGATTGCGCCAAGTAATGCTCCTCTTTCTTAG
Protein sequenceShow/hide protein sequence
MDMDTDENSNAKGKEGQGKLIMVILVGAPGSGKSTFCELVMGSSSRPWVRICQDTIGNGKSGTKAQCLKSAASALNNGKSVFVDRCNLEIEQRADFVKLGGPQVDVHAVV
LDLPAQLCISRSVKRTGHEGNLSGGKAAAVVNKMLQKKELPKLNEGFTRITFCHNETDVQSAIDMYKSLDLHNLLPHGCFGQKNPNKKVQLGIMKFLKKAEKPSETRSSA
NTVKDSPTPQTTQENSDSCDKKEESACTMSKNVDIEAEKGECPGIRSLEDNISQSNSPTLAFPSISTSDFKFSHEKAAEIIVEKVEEFMDKLGNAKLVLVDLSRGSKILS
LVKAKAAKKNISSTKFLTFVGDITKLNSEGGLRCNVIANAANWRLKPGGGGVNAAIFSAAGPGLEVATKQQANSLQPGNAVAVPLPSTSPLFNREGVTHVIHVLGPNMNP
QRPNYLSNDYDEGCKLLRNAYSSLFQAFISIVEDKFKSVKEIHEHLGSVPSEPQKHSEDSHHKFKRENLQNPERSKKWKGSQDPTEALNQNNNKTVPKMSKHWGSWAQAL
YNTAMHPEQHSDTVLETSDDVVILNDIYPKARKHLLVVARHEGLDQLTDVCREHLPLLRTMHAVGLKWIDKFFHEDASLVFRLGYHSAPSMRQLHLHVISQDFDSSHLKN
KKHWNSFNTDFFRDSVAVMDEVSSHGKASIMDDESLMSMELRCNRCRSAHPNLPKLKAHSFKCQAPFPSTLLEGGRLVIAPSNAPLS