; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G07110 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G07110
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptiontranscription factor bHLH92
Genome locationClcChr06:7497816..7499099
RNA-Seq ExpressionClc06G07110
SyntenyClc06G07110
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily
IPR044658 - Transcription factor bHLH92/bHLH041-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026190.1 Transcription factor bHLH92, partial [Cucurbita argyrosperma subsp. argyrosperma]6.8e-7663.18Show/hide
Query:  MDHTFTAEFWHNDFFWLDAPISVPAAGPARQISAFVPYDPSQVATGLGQENNPTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLGELEREKCHRHM
        MD  F  EFWH D +WLD PI VP   P  QISAFVPY P Q   G  Q+NNP T T  T+  S NV+KR++EYWRK W EKK+ + +G+LERE+ HRHM
Subjt:  MDHTFTAEFWHNDFFWLDAPISVPAAGPARQISAFVPYDPSQVATGLGQENNPTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLGELEREKCHRHM

Query:  LNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVALSNPSCGINSML
        LNERMRREK ++SY  LHSMLP  TK         NDKNSIVQ AAR I+E+KA E +LK+RN+ELEMA++ KK+++EK TTP I VA++NPS GINSML
Subjt:  LNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVALSNPSCGINSML

Query:  AVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQ
         VLN+LKTVGVN+KAIHATF++S+FS  +AID+HM AAEVER LQ+T++EAERKFQRQ
Subjt:  AVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQ

XP_008465193.1 PREDICTED: transcription factor bHLH92 [Cucumis melo]8.3e-9874.3Show/hide
Query:  MDHTFTAEFWHNDFFWLDAPISVPAAG-----PARQISAFVPYDPSQVATGLGQENN---PTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETS-PLGEL
        MD TF  EFWHND FWLDAPISVP +      PA QISAF+PY  +     LGQ+NN    TTTT  TSY+SRNVNKRM+EYW KHWHEKKE     G+L
Subjt:  MDHTFTAEFWHNDFFWLDAPISVPAAG-----PARQISAFVPYDPSQVATGLGQENN---PTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETS-PLGEL

Query:  EREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRN--LELEMAIARKKKEKEKETTPIISVAL
        EREKCHRHMLNERMRREKQKQSYLALHSMLPKNTK         NDKNSIVQSAAR IQEMK LE+ LKRRN  LE+E+AIA KKKEKEK    II+VAL
Subjt:  EREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRN--LELEMAIARKKKEKEKETTPIISVAL

Query:  SNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHFNF
         N SCGINSML VLNVLKTVGVNSKAIHATF NSQFSAQLAIDTHMGAAEVERALQVTLNEAERKF+RQ  EGSK+IK+++F F
Subjt:  SNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHFNF

XP_011649165.2 transcription factor bHLH92 [Cucumis sativus]2.9e-9571.97Show/hide
Query:  MDHTFTAEFWHNDFFWLDAPISVPAAG-----PARQISAFVPYDPSQVATG-LGQENNPTTTT-----------VTTSYNSRNVNKRMVEYWRKHWHEKK
        MD TF  EFWHND FWLDAPISVP +      PA+QISAF+PYD S   TG LGQ+NN  TTT            +TSY+SRNVNKRM+EYWRKHWHEKK
Subjt:  MDHTFTAEFWHNDFFWLDAPISVPAAG-----PARQISAFVPYDPSQVATG-LGQENNPTTTT-----------VTTSYNSRNVNKRMVEYWRKHWHEKK

Query:  ETS-PLGELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETT
        E     G+LEREKCHRHMLNERMRREKQKQ YLALHSMLP+NTK         NDKNSIVQSAAR IQEMK LE+ LKRRN ELEM IA   K+KEK+  
Subjt:  ETS-PLGELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETT

Query:  PIISVALSNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHF
         II+VAL+N SCGINSML VLNVLKTVGVNS AIHATF NSQFSAQLAIDTHM  AEVERALQ+TLNEAERKFQRQ  EGSK+IKES+F
Subjt:  PIISVALSNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHF

XP_023000539.1 transcription factor bHLH92-like [Cucurbita maxima]3.3e-7862.69Show/hide
Query:  MDHTFTAEFWHNDFFWLDAPISVPAAGPARQISAFVPYDPSQVATGLGQENNPTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLGELEREKCHRHM
        MD  F  EFWH D +WLD PISV    P  QISAFVPY P Q A G GQ+NNP   T      S N++KR+++YWRK W EKK+ + +G+LEREK H+HM
Subjt:  MDHTFTAEFWHNDFFWLDAPISVPAAGPARQISAFVPYDPSQVATGLGQENNPTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLGELEREKCHRHM

Query:  LNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVALSNPSCGINSML
        LNERMRREK ++SY  LHSMLP  TK         NDKNSIVQ AAR I+E+KA E +LK+RN+ELEMA++ KK+++EK TTP I VA++NPS GINSML
Subjt:  LNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVALSNPSCGINSML

Query:  AVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKE
         VLN+LKTVGVN+KAIHATF++SQFS  +AI++HM AAEVERALQ+TL+EAERKFQRQC EGS  +K+
Subjt:  AVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKE

XP_038875776.1 transcription factor bHLH92 [Benincasa hispida]9.1e-12179.87Show/hide
Query:  MDHTFTAEFWHNDFFWLDAPISVP----AAGPARQISAFVPYDPSQVATGLGQENNPTT-------TTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLG
        MD +F+ E+WHNDFFWLDAPISVP     + PARQISAFVPY  ++ ATGLGQENNPTT       TT ++SYNSRNVNKRMVEYWRKHWHEKKE + LG
Subjt:  MDHTFTAEFWHNDFFWLDAPISVP----AAGPARQISAFVPYDPSQVATGLGQENNPTT-------TTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLG

Query:  ELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVAL
        + EREKCHRHMLNERMRREKQKQSYLALHSMLPKNTK         NDKNSI+QSA R IQEMKALE+ LKRRNLELEMAIARKKKEKEK TTPII+VAL
Subjt:  ELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVAL

Query:  SNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHFNFNNNKGAYEVGPEFQ
        SNPSCGINSMLAVLN LKTVGVNSKAIHATF NSQFSAQLAIDTHMGAAEVERALQVTL+EAERKFQRQC EGSK+IKE+HFNFNN++G + VGPEF+
Subjt:  SNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHFNFNNNKGAYEVGPEFQ

TrEMBL top hitse value%identityAlignment
A0A0A0LIK3 BHLH domain-containing protein1.4e-9571.97Show/hide
Query:  MDHTFTAEFWHNDFFWLDAPISVPAAG-----PARQISAFVPYDPSQVATG-LGQENNPTTTT-----------VTTSYNSRNVNKRMVEYWRKHWHEKK
        MD TF  EFWHND FWLDAPISVP +      PA+QISAF+PYD S   TG LGQ+NN  TTT            +TSY+SRNVNKRM+EYWRKHWHEKK
Subjt:  MDHTFTAEFWHNDFFWLDAPISVPAAG-----PARQISAFVPYDPSQVATG-LGQENNPTTTT-----------VTTSYNSRNVNKRMVEYWRKHWHEKK

Query:  ETS-PLGELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETT
        E     G+LEREKCHRHMLNERMRREKQKQ YLALHSMLP+NTK         NDKNSIVQSAAR IQEMK LE+ LKRRN ELEM IA   K+KEK+  
Subjt:  ETS-PLGELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETT

Query:  PIISVALSNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHF
         II+VAL+N SCGINSML VLNVLKTVGVNS AIHATF NSQFSAQLAIDTHM  AEVERALQ+TLNEAERKFQRQ  EGSK+IKES+F
Subjt:  PIISVALSNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHF

A0A1S3CPU0 transcription factor bHLH924.0e-9874.3Show/hide
Query:  MDHTFTAEFWHNDFFWLDAPISVPAAG-----PARQISAFVPYDPSQVATGLGQENN---PTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETS-PLGEL
        MD TF  EFWHND FWLDAPISVP +      PA QISAF+PY  +     LGQ+NN    TTTT  TSY+SRNVNKRM+EYW KHWHEKKE     G+L
Subjt:  MDHTFTAEFWHNDFFWLDAPISVPAAG-----PARQISAFVPYDPSQVATGLGQENN---PTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETS-PLGEL

Query:  EREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRN--LELEMAIARKKKEKEKETTPIISVAL
        EREKCHRHMLNERMRREKQKQSYLALHSMLPKNTK         NDKNSIVQSAAR IQEMK LE+ LKRRN  LE+E+AIA KKKEKEK    II+VAL
Subjt:  EREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRN--LELEMAIARKKKEKEKETTPIISVAL

Query:  SNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHFNF
         N SCGINSML VLNVLKTVGVNSKAIHATF NSQFSAQLAIDTHMGAAEVERALQVTLNEAERKF+RQ  EGSK+IK+++F F
Subjt:  SNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHFNF

A0A5A7UQX2 Transcription factor bHLH924.0e-9874.3Show/hide
Query:  MDHTFTAEFWHNDFFWLDAPISVPAAG-----PARQISAFVPYDPSQVATGLGQENN---PTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETS-PLGEL
        MD TF  EFWHND FWLDAPISVP +      PA QISAF+PY  +     LGQ+NN    TTTT  TSY+SRNVNKRM+EYW KHWHEKKE     G+L
Subjt:  MDHTFTAEFWHNDFFWLDAPISVPAAG-----PARQISAFVPYDPSQVATGLGQENN---PTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETS-PLGEL

Query:  EREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRN--LELEMAIARKKKEKEKETTPIISVAL
        EREKCHRHMLNERMRREKQKQSYLALHSMLPKNTK         NDKNSIVQSAAR IQEMK LE+ LKRRN  LE+E+AIA KKKEKEK    II+VAL
Subjt:  EREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRN--LELEMAIARKKKEKEKETTPIISVAL

Query:  SNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHFNF
         N SCGINSML VLNVLKTVGVNSKAIHATF NSQFSAQLAIDTHMGAAEVERALQVTLNEAERKF+RQ  EGSK+IK+++F F
Subjt:  SNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHFNF

A0A6J1EQ49 transcription factor bHLH92-like2.4e-7462.79Show/hide
Query:  MDHTFTAEFWHNDFFWLDAPISVPAAGPARQISAFVPYDPSQVATGLGQENNPTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLGELEREKCHRHM
        MD  F  EFWH D  WLD PI V    P  QISAFVPY P Q   G  Q+NNP T T   +  S NV+KR++EYWRK W EKK+ + +G+LERE+ HRHM
Subjt:  MDHTFTAEFWHNDFFWLDAPISVPAAGPARQISAFVPYDPSQVATGLGQENNPTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLGELEREKCHRHM

Query:  LNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVALSNPSCGINSML
        LNERMRREK ++SY  LHSMLP  TK         NDKNSIVQ AAR I+E+KA E +LK+RN+ELEMA++ KK+++EK TTP I VA++NPS GINSML
Subjt:  LNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVALSNPSCGINSML

Query:  AVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQ
         VLN+LKTVGVN+KAIHATF++S+FS  +AID+HM AAEVERALQ+TL+EAERK QRQ
Subjt:  AVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQ

A0A6J1KK97 transcription factor bHLH92-like1.6e-7862.69Show/hide
Query:  MDHTFTAEFWHNDFFWLDAPISVPAAGPARQISAFVPYDPSQVATGLGQENNPTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLGELEREKCHRHM
        MD  F  EFWH D +WLD PISV    P  QISAFVPY P Q A G GQ+NNP   T      S N++KR+++YWRK W EKK+ + +G+LEREK H+HM
Subjt:  MDHTFTAEFWHNDFFWLDAPISVPAAGPARQISAFVPYDPSQVATGLGQENNPTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLGELEREKCHRHM

Query:  LNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVALSNPSCGINSML
        LNERMRREK ++SY  LHSMLP  TK         NDKNSIVQ AAR I+E+KA E +LK+RN+ELEMA++ KK+++EK TTP I VA++NPS GINSML
Subjt:  LNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVALSNPSCGINSML

Query:  AVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKE
         VLN+LKTVGVN+KAIHATF++SQFS  +AI++HM AAEVERALQ+TL+EAERKFQRQC EGS  +K+
Subjt:  AVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKE

SwissProt top hitse value%identityAlignment
Q75KV9 Transcription factor BHLH1481.2e-1435.37Show/hide
Query:  ELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVAL
        ++E  +  RHM+ ER RREK  QSY  L++M+   +K          DKNSIVQSAA  I E+K   + L+RRN EL+   A+     E++    +   +
Subjt:  ELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVAL

Query:  SNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAER
          PS  I+SM+A L  LK + V ++ I ++   ++   ++ ++T + A EVE+A++  L E ER
Subjt:  SNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAER

Q9FIX5 Transcription factor bHLH925.1e-2644.21Show/hide
Query:  NVNKRMVEYWRKHWHEKKET-SPLGELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNL
        NV KRMV   RK+W EKK T +P    E+E+  RHML ER RREKQKQSYLALHS+LP  TK         NDKNSIV+ A   I +++ L++ L RR  
Subjt:  NVNKRMVEYWRKHWHEKKET-SPLGELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNL

Query:  ELEMAIARKKKEKEKETTPIISVALSNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERK
         +E   A+   ++  ET   + V L  P  G++SML  L+ LK++G   K +HA F   +FSA + I+T +   EVE+ ++  L E E K
Subjt:  ELEMAIARKKKEKEKETTPIISVALSNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERK

Arabidopsis top hitse value%identityAlignment
AT5G43650.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.6e-2744.21Show/hide
Query:  NVNKRMVEYWRKHWHEKKET-SPLGELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNL
        NV KRMV   RK+W EKK T +P    E+E+  RHML ER RREKQKQSYLALHS+LP  TK         NDKNSIV+ A   I +++ L++ L RR  
Subjt:  NVNKRMVEYWRKHWHEKKET-SPLGELEREKCHRHMLNERMRREKQKQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNL

Query:  ELEMAIARKKKEKEKETTPIISVALSNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERK
         +E   A+   ++  ET   + V L  P  G++SML  L+ LK++G   K +HA F   +FSA + I+T +   EVE+ ++  L E E K
Subjt:  ELEMAIARKKKEKEKETTPIISVALSNPSCGINSMLAVLNVLKTVGVNSKAIHATFVNSQFSAQLAIDTHMGAAEVERALQVTLNEAERK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCATACTTTTACTGCGGAATTCTGGCATAACGATTTCTTTTGGCTCGACGCTCCCATTTCCGTCCCCGCCGCCGGTCCTGCGAGACAGATAAGTGCTTTCGTACC
ATATGATCCGTCTCAAGTGGCTACCGGATTGGGACAAGAAAACAATCCGACAACTACAACCGTCACCACGAGCTATAATTCTAGGAATGTCAACAAGAGGATGGTTGAGT
ATTGGAGGAAGCATTGGCATGAAAAGAAAGAAACAAGTCCCCTAGGGGAATTAGAGAGAGAAAAATGTCACCGTCACATGTTGAACGAGAGGATGAGAAGAGAGAAACAG
AAACAGAGTTATTTGGCACTCCACTCCATGCTCCCCAAAAATACTAAGCGTCCAAATTTAGTAGTGTTTGTCATTAACGATAAGAATTCGATCGTTCAAAGTGCGGCGAG
AGCAATACAAGAAATGAAAGCCTTAGAGGAGATTTTAAAGAGGAGAAACTTAGAGTTGGAGATGGCAATAGCAAGGAAGAAGAAAGAAAAGGAAAAAGAGACAACACCAA
TAATAAGTGTAGCATTGTCCAATCCTTCTTGTGGGATCAACTCAATGCTGGCTGTTCTCAATGTTCTCAAAACTGTTGGAGTAAATTCCAAAGCCATTCATGCCACTTTC
GTCAACTCTCAGTTTTCAGCCCAATTAGCCATTGATACCCATATGGGAGCTGCCGAAGTAGAAAGAGCATTGCAGGTGACGCTAAACGAAGCTGAGAGGAAATTTCAAAG
GCAATGCATGGAAGGGTCCAAACAAATAAAAGAGAGCCATTTTAATTTTAATAATAATAAAGGGGCCTACGAGGTGGGTCCAGAATTTCAATTTTAA
mRNA sequenceShow/hide mRNA sequence
GTTATCCCCTCTCTTCTCCCCTCATTTCCTCCCACGAGACCGTCGCAAGATCTGAACTGAATTTTGAAAAGAAGAAAAACTTGATCACCCAACTGATATATGGATCATAC
TTTTACTGCGGAATTCTGGCATAACGATTTCTTTTGGCTCGACGCTCCCATTTCCGTCCCCGCCGCCGGTCCTGCGAGACAGATAAGTGCTTTCGTACCATATGATCCGT
CTCAAGTGGCTACCGGATTGGGACAAGAAAACAATCCGACAACTACAACCGTCACCACGAGCTATAATTCTAGGAATGTCAACAAGAGGATGGTTGAGTATTGGAGGAAG
CATTGGCATGAAAAGAAAGAAACAAGTCCCCTAGGGGAATTAGAGAGAGAAAAATGTCACCGTCACATGTTGAACGAGAGGATGAGAAGAGAGAAACAGAAACAGAGTTA
TTTGGCACTCCACTCCATGCTCCCCAAAAATACTAAGCGTCCAAATTTAGTAGTGTTTGTCATTAACGATAAGAATTCGATCGTTCAAAGTGCGGCGAGAGCAATACAAG
AAATGAAAGCCTTAGAGGAGATTTTAAAGAGGAGAAACTTAGAGTTGGAGATGGCAATAGCAAGGAAGAAGAAAGAAAAGGAAAAAGAGACAACACCAATAATAAGTGTA
GCATTGTCCAATCCTTCTTGTGGGATCAACTCAATGCTGGCTGTTCTCAATGTTCTCAAAACTGTTGGAGTAAATTCCAAAGCCATTCATGCCACTTTCGTCAACTCTCA
GTTTTCAGCCCAATTAGCCATTGATACCCATATGGGAGCTGCCGAAGTAGAAAGAGCATTGCAGGTGACGCTAAACGAAGCTGAGAGGAAATTTCAAAGGCAATGCATGG
AAGGGTCCAAACAAATAAAAGAGAGCCATTTTAATTTTAATAATAATAAAGGGGCCTACGAGGTGGGTCCAGAATTTCAATTTTAAATAGCCACGTGGGCACGTGGGAAA
TAAGGT
Protein sequenceShow/hide protein sequence
MDHTFTAEFWHNDFFWLDAPISVPAAGPARQISAFVPYDPSQVATGLGQENNPTTTTVTTSYNSRNVNKRMVEYWRKHWHEKKETSPLGELEREKCHRHMLNERMRREKQ
KQSYLALHSMLPKNTKRPNLVVFVINDKNSIVQSAARAIQEMKALEEILKRRNLELEMAIARKKKEKEKETTPIISVALSNPSCGINSMLAVLNVLKTVGVNSKAIHATF
VNSQFSAQLAIDTHMGAAEVERALQVTLNEAERKFQRQCMEGSKQIKESHFNFNNNKGAYEVGPEFQF