; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0186 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0186
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontranscription initiation factor TFIID subunit 9
Genome locationMC09:1654872..1662067
RNA-Seq ExpressionMC09g0186
SyntenyMC09g0186
Gene Ontology termsGO:0043966 - histone H3 acetylation (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0051123 - RNA polymerase II preinitiation complex assembly (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0005669 - transcription factor TFIID complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
GO:0016251 - RNA polymerase II general transcription initiation factor activity (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR003162 - Transcription initiation factor TAFII31
IPR009072 - Histone-fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575445.1 Transcription initiation factor TFIID subunit 9, partial [Cucurbita argyrosperma subsp. sororia]1.47e-12286.05Show/hide
Query:  MSRGTGGG------YDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAI
        MSRG+GGG      YDNQNTNFSPEARPSQLG+RG KE++EGDEDLPRDA+IVK LLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHA KAAI
Subjt:  MSRGTGGG------YDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAI

Query:  DCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HT
        DCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPR+IGGPGIALPPD D L+SPNYQLAIP+K+ VE MEETEE+E VD   SQE STEVPQ HT
Subjt:  DCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HT

Query:  PQRVSFPLAKRPKIT
        PQRVSFPL+KRPKIT
Subjt:  PQRVSFPLAKRPKIT

KAG7013987.1 Transcription initiation factor TFIID subunit 9 [Cucurbita argyrosperma subsp. argyrosperma]3.13e-12387.68Show/hide
Query:  MSRGTGGG--YDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDD
        MSRG+GGG  YDNQNTNFSPEARPSQLG+RG KE++EGDEDLPRDA+IVK LLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHA KAAIDCDD
Subjt:  MSRGTGGG--YDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDD

Query:  VKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HTPQRV
        VKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPR+IGGPGIALPPD D L+SPNYQLAIP+K+ VE MEETEE+E VD   SQE STEVPQ HTPQRV
Subjt:  VKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HTPQRV

Query:  SFPLAKRPKIT
        SFPL+KRPKIT
Subjt:  SFPLAKRPKIT

XP_022144675.1 transcription initiation factor TFIID subunit 9 [Momordica charantia]1.28e-144100Show/hide
Query:  MSRGTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVK
        MSRGTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVK
Subjt:  MSRGTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVK

Query:  LAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVSFP
        LAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVSFP
Subjt:  LAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVSFP

Query:  LAKRPKIT
        LAKRPKIT
Subjt:  LAKRPKIT

XP_022953465.1 transcription initiation factor TFIID subunit 9 [Cucurbita moschata]6.70e-12488.83Show/hide
Query:  GTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAI
        G GGGYDNQNTNFSPEARPSQLG+RG KE++EGDEDLPRDA+IVK LLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHA KAAIDCDDVKLAI
Subjt:  GTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAI

Query:  QSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HTPQRVSFPLA
        QSKVNFSFSQPPPREVLLELARNRNKIPLPR+IGGPGIALPPD D L+SPNYQLAIP+K+ VE MEETEE+E VD   SQEPSTEVPQ HTPQRVSFPL+
Subjt:  QSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HTPQRVSFPLA

Query:  KRPKIT
        KRPKIT
Subjt:  KRPKIT

XP_022992393.1 transcription initiation factor TFIID subunit 9 [Cucurbita maxima]5.82e-12487.79Show/hide
Query:  MSRGTGGG----YDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDC
        MSRG GGG    YDNQNTNFSPEAR SQLG+RG KE++EGDEDLPRDARIVK LLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHA KAAIDC
Subjt:  MSRGTGGG----YDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDC

Query:  DDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HTPQ
        DDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPR+IGGPGIALPPD+D L+SPNYQLAIPKK+PVE MEETEE+E VD   SQE STEVPQ HTPQ
Subjt:  DDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HTPQ

Query:  RVSFPLAKRPKIT
        RVSFPL+KRPKIT
Subjt:  RVSFPLAKRPKIT

TrEMBL top hitse value%identityAlignment
A0A1S3CGN5 transcription initiation factor TFIID subunit 9 isoform X21.67e-9986.59Show/hide
Query:  VTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPL
        +++GDE+LPRDA+IVK LLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHA KAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPL
Subjt:  VTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPL

Query:  PRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVD-FASSQEPST-EVPQ-HTPQRVSFPLAKRPKIT
        PR+IGGPGIALPPD D L+SPNYQLAIPKKQ VETMEETEE+E  D    SQEPS+ EVPQ H PQRVSFPLAKRPK+T
Subjt:  PRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVD-FASSQEPST-EVPQ-HTPQRVSFPLAKRPKIT

A0A1S4E4D8 transcription initiation factor TFIID subunit 9 isoform X13.38e-10186.81Show/hide
Query:  SKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNK
        SKE+++GDE+LPRDA+IVK LLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHA KAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNK
Subjt:  SKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNK

Query:  IPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVD-FASSQEPST-EVPQ-HTPQRVSFPLAKRPKIT
        IPLPR+IGGPGIALPPD D L+SPNYQLAIPKKQ VETMEETEE+E  D    SQEPS+ EVPQ H PQRVSFPLAKRPK+T
Subjt:  IPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVD-FASSQEPST-EVPQ-HTPQRVSFPLAKRPKIT

A0A6J1CSB1 transcription initiation factor TFIID subunit 96.21e-145100Show/hide
Query:  MSRGTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVK
        MSRGTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVK
Subjt:  MSRGTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVK

Query:  LAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVSFP
        LAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVSFP
Subjt:  LAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVSFP

Query:  LAKRPKIT
        LAKRPKIT
Subjt:  LAKRPKIT

A0A6J1GPP8 transcription initiation factor TFIID subunit 93.24e-12488.83Show/hide
Query:  GTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAI
        G GGGYDNQNTNFSPEARPSQLG+RG KE++EGDEDLPRDA+IVK LLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHA KAAIDCDDVKLAI
Subjt:  GTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAI

Query:  QSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HTPQRVSFPLA
        QSKVNFSFSQPPPREVLLELARNRNKIPLPR+IGGPGIALPPD D L+SPNYQLAIP+K+ VE MEETEE+E VD   SQEPSTEVPQ HTPQRVSFPL+
Subjt:  QSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HTPQRVSFPLA

Query:  KRPKIT
        KRPKIT
Subjt:  KRPKIT

A0A6J1JZ27 transcription initiation factor TFIID subunit 92.82e-12487.79Show/hide
Query:  MSRGTGGG----YDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDC
        MSRG GGG    YDNQNTNFSPEAR SQLG+RG KE++EGDEDLPRDARIVK LLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHA KAAIDC
Subjt:  MSRGTGGG----YDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDC

Query:  DDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HTPQ
        DDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPR+IGGPGIALPPD+D L+SPNYQLAIPKK+PVE MEETEE+E VD   SQE STEVPQ HTPQ
Subjt:  DDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQ-HTPQ

Query:  RVSFPLAKRPKIT
        RVSFPL+KRPKIT
Subjt:  RVSFPLAKRPKIT

SwissProt top hitse value%identityAlignment
Q16594 Transcription initiation factor TFIID subunit 91.2e-2942.77Show/hide
Query:  TEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLP
        T   + +P+DA+++  +LK MG+ +YEPRVI+Q LE  +RYV  +L DA++YS HA KA +D DDV+LAIQ + + SF+ PPPR+ LL++AR RN+ PLP
Subjt:  TEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLP

Query:  RTIGGPGIALPPDLDALVSPNYQL-AIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVS
              G  LPPD   L +PNY+L ++ KK        T    +V   +S+  +  +   TPQ +S
Subjt:  RTIGGPGIALPPDLDALVSPNYQL-AIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVS

Q17QQ4 Transcription initiation factor TFIID subunit 91.0e-2842.59Show/hide
Query:  EDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIG
        + +P+DA+++  +LK MG+ +YEPRVI+Q LE  +RYV  +L DA++YS HA KA +D DDV+LAIQ + + SF+ PPPR+ LL++AR RN+ PLP    
Subjt:  EDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIG

Query:  GPGIALPPDLDALVSPNYQL-AIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVS
          G  LPPD   L +PNY+L ++ KK        T    +V   +S+  +  +   TP  +S
Subjt:  GPGIALPPDLDALVSPNYQL-AIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVS

Q5BKE0 Transcription initiation factor TFIID subunit 95.5e-3043.83Show/hide
Query:  EDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIG
        + +P+DA+++  +LK MG+ +YEPRVI+Q LE  +RYV  +L DA++YS HA K  +D DDV+LAIQ + + SF+ PPPR+ LL++AR RN+ PLP    
Subjt:  EDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIG

Query:  GPGIALPPDLDALVSPNYQL-AIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVS
          G  LPPD   L +PNY+L ++ KK P      T    +V   SS+  +  +   TPQ +S
Subjt:  GPGIALPPDLDALVSPNYQL-AIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVS

Q8VI33 Transcription initiation factor TFIID subunit 91.5e-3044.44Show/hide
Query:  EDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIG
        + +P+DA+++  +LK MG+ +YEPRVI+Q LE  +RYV  +L DA++YS HA KA +D DDV+LAIQ + + SF+ PPPR+ LL++AR RN+ PLP    
Subjt:  EDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPRTIG

Query:  GPGIALPPDLDALVSPNYQL-AIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVS
          G  LPPD   L +PNY+L ++ KK P      T    +V   SS+  +  +   TPQ +S
Subjt:  GPGIALPPDLDALVSPNYQL-AIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVS

Q9SYH2 Transcription initiation factor TFIID subunit 93.7e-6670.56Show/hide
Query:  EGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPR
        EG+ED+PRDA+IVK+LLKSMGVEDYEPRVIHQFLELWYRYVV+VLTDAQVYSEHA K  IDCDDVKLAIQSKVNFSFSQPPPREVLLELA +RNKIPLP+
Subjt:  EGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPR

Query:  TIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENV--------DFASSQEPSTEVPQHTPQRVSFPLAKRPK
        +I GPG+ LPP+ D L+SPNYQL IPKK      EETE++E +        +    Q+ ++++P  TPQRVSFPL++RPK
Subjt:  TIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENV--------DFASSQEPSTEVPQHTPQRVSFPLAKRPK

Arabidopsis top hitse value%identityAlignment
AT1G54140.1 TATA binding protein associated factor 21kDa subunit2.6e-6770.56Show/hide
Query:  EGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPR
        EG+ED+PRDA+IVK+LLKSMGVEDYEPRVIHQFLELWYRYVV+VLTDAQVYSEHA K  IDCDDVKLAIQSKVNFSFSQPPPREVLLELA +RNKIPLP+
Subjt:  EGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFSFSQPPPREVLLELARNRNKIPLPR

Query:  TIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENV--------DFASSQEPSTEVPQHTPQRVSFPLAKRPK
        +I GPG+ LPP+ D L+SPNYQL IPKK      EETE++E +        +    Q+ ++++P  TPQRVSFPL++RPK
Subjt:  TIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENV--------DFASSQEPSTEVPQHTPQRVSFPLAKRPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCGTGGAACTGGAGGTGGATATGACAACCAAAACACGAATTTCTCTCCCGAAGCTCGACCTTCACAACTGGGTGAAAGGGGCAGTAAGGAAGTGACTGAGGGGGA
CGAGGACCTGCCAAGGGATGCAAGGATTGTGAAAGCACTTCTGAAATCAATGGGAGTAGAAGATTATGAACCGCGTGTTATACACCAGTTTCTTGAGCTGTGGTACCGTT
ATGTCGTTGATGTATTGACGGATGCACAAGTTTACTCGGAGCACGCATGCAAGGCTGCTATTGACTGTGATGATGTGAAGCTTGCCATTCAGTCAAAAGTTAATTTCAGC
TTCTCACAACCTCCTCCAAGAGAGGTTCTTCTAGAGCTGGCCAGGAACAGGAACAAAATTCCATTGCCAAGGACAATTGGCGGGCCTGGTATCGCCCTTCCGCCCGACCT
AGACGCATTGGTCAGTCCCAACTACCAACTGGCAATCCCAAAGAAGCAGCCAGTTGAAACCATGGAGGAAACGGAAGAGGAGGAAAACGTCGATTTCGCTTCGTCGCAAG
AACCAAGCACAGAAGTACCACAACATACCCCACAGAGAGTATCCTTCCCATTGGCAAAACGTCCAAAAATAACATAG
mRNA sequenceShow/hide mRNA sequence
GATGATGTGATTGACAATAAGATATGTTTTGTACTAAGTTGTAAATATATTATATTTTAATTTAAAAATTGACAATAAGATATGTTTTGTACTAAGTTGTAAATATATTA
TATTTTAATTTAAAAATTGACAATAAGATATGTTTTGTACTAAGTTGTAAATATATTATATTTTAATTTAAAAATAACGTTGAATCTCTCACCCTTCAATTATTGAACTA
AAAAAATGATAAAATTATTAAGACAAATATATATTAACCTAAATTAAATCCTACCCATGTCTAACTATCTTGTGATCAAGTTTAAAATAAAACATCGTCAACTACGTTAT
TCTAACTTTTTATTTATGATAATCAAATAATTTTTCATGAAATATAGATACTTAGTTTTTCTTTTTCTTTTCATTGAGTAATGTGAGGGTGAGATCGAACTCACAACCTT
TTTCGAAGATATACGTTAATTATCACTGCTGTAGCAATCTAACTACTTAAAAATTGTGGCTTCATATTCCTCCTCCTTGCAAGGGCATATATTAATATGACCTTATTGAA
TTACAATATGAACATTGTCCAAAAAAAAACATTATGACCATTCTAACCCTCCGTCGCACTTTTCAAGTTCCTTCACCACTCTCGATTTGTCTCTCCCTCATTTTGTAGTT
CAGTTCGGTCGGCGCTGCTTTAGACTATTATCGTTTCGACGCAAATTTTGTTTGTTTTTAAGTTTATTTTCGGTCGGTTCCTTCAGTTTCGATTTTACTGATAAACGGGA
TCGTTAACACCCCGTATTTGTTAGGGCTACGAACTTTGTGGTTTGAATTCTCCAAACGATGAGTCGTGGAACTGGAGGTGGATATGACAACCAAAACACGAATTTCTCTC
CCGAAGCTCGACCTTCACAACTGGGTGAAAGGGGCAGTAAGGAAGTGACTGAGGGGGACGAGGACCTGCCAAGGGATGCAAGGATTGTGAAAGCACTTCTGAAATCAATG
GGAGTAGAAGATTATGAACCGCGTGTTATACACCAGTTTCTTGAGCTGTGGTACCGTTATGTCGTTGATGTATTGACGGATGCACAAGTTTACTCGGAGCACGCATGCAA
GGCTGCTATTGACTGTGATGATGTGAAGCTTGCCATTCAGTCAAAAGTTAATTTCAGCTTCTCACAACCTCCTCCAAGAGAGGTTCTTCTAGAGCTGGCCAGGAACAGGA
ACAAAATTCCATTGCCAAGGACAATTGGCGGGCCTGGTATCGCCCTTCCGCCCGACCTAGACGCATTGGTCAGTCCCAACTACCAACTGGCAATCCCAAAGAAGCAGCCA
GTTGAAACCATGGAGGAAACGGAAGAGGAGGAAAACGTCGATTTCGCTTCGTCGCAAGAACCAAGCACAGAAGTACCACAACATACCCCACAGAGAGTATCCTTCCCATT
GGCAAAACGTCCAAAAATAACATAGTTGTTTTTCTTCTTGGTCGCCTTCCTCCCAACTTCCCCTATGAAAAAGTTCTTCACACACGTTCATCTGAAGTTGGCTGGAGCTT
TGTTGGATCTCATCATACCGTGTTCGAACCGGATCTATTGTATAATATAACGTTGTTGCTGTTAATTAAAAGACTTCAGTGATTCTCTCTTTCTAGCATTTTGTCAATTC
CAAAGGAATCAATGAGAATGGAATATGGACGTTTCTTTCCCACTTTTGTGGAGGCATGGTATCACGGGTCTAAAAGTTGTCTGTCAAAATCAAAATTAAATGTAAGCCAA
CACATGAAGTTTTCTTTTTTAAGGACATCATATTAAAGAAAGATTAGGTTCTGAAAATGAAATTCTACAAGAATGAGCTCCATGATTTTGAACTTTGTTGAATTTCAGTT
AGTCATCTATTTCTTGGTACAGAACTATTGGGAGTTGTTCTGTACAAAGACCACTTATATACAAGAATAATTTGATCATCACTAAACGTATATCATAGGCAAACTAAGGT
ATGGTCAGCTGCAGAAACCAACAGTGTTGGGAGCAGCCGGGGATGGTCCCGTGTTTGAAACCGTAATCCCTCGGTCGTACATTCGCTGGATTTCTTCCCTGTACTCCGTC
AATGACTTGTCGGGGATGGCTGGAGTTAACTCTACCACATCACCCATCTTCAGCTTGCACTTTGGATCGTTCACTCGCTCATGGTTTAGCCTCGGCCTCAAATCTTCTTT
CATGGGGAAGCCACCATGGGAGGCCCATCTTGAACTTCCTCGCCCACATCTTTCCATCAAGTCCATGATAGTTGAATTTGCAGGAAACTCTTGCACTGACATCTGCAGTA
TTCAAACATCGTCATTGTTAGACTGCATCACACAACCGAAATCAAGAACCTAGCACTAATGTAACAGGAAAAAGGGGTAACGAACCTTGTCGTTCTCAATCGTTATGACG
AAAATAGGTCCATCCTGGCCATCACATTGAGTTTTGTATGAATATGGACAGCCTTCGGAATGAGAAGGGAATTTGCAGGGTGGTTTGATTGAATCAGCAGAGCCAATGGA
CGATCCATCTTTGCTCATAGCCAGACACTGCCAAGTGACAACCCATCGGGCCCATTCAACCATCTGAACGACGAAAGGAGAGTATTCGCTGTCGCCCTCCTTATATCTCC
AATGAGCTGCAATCCCAAACTCAGCTTGCAAATGCATCTCTTTCGTCCTAATTTGAACTTCAAGGGGCGCCATGTCTTCACCCATCACGACCGTGTGTAGAGATCGATAC
CCATTAAACTTTGGATGGCTAATATAATCTTTGCACCGCCCGGGCACTTCAGACCACAACTGGTGAACTATTCTTAAAGCTCTCTGGCAGTCTTCTTCATTTTTTACAAT
AAGCCTTAAGCCGTGAATGTCGTGGATTTCATCCATGGTCAACTTCTTTCTGTAAATAACAAGGAAAAGTGCACACTTTTAGTGAGCCATCACTATGCAAGATTCATGTC
AGAATTCATAACCATCAGATCATGGAAAAACAAGCTCTTTAACTTGAAGTTTTGATAGACTTATTTAAGATCTGGGTTCTTTTCACCATAAAAATGTTTATTTCCAAAGC
TGAAAACTTAAGGATTTTGGCTGTTAAATAGAATATGCAACAGACTAGAGGTGAAATGGAGAGTAGAGAGAGAACGTACCTCAACATTTTTAAGTAAATGCTGTATAAGC
TCTTATTGCGTCCAGATAAGAGATGGTAAGAAATCCCTTCATCCTTAAGAGCCTGATCTAATTTCTCAATTGCAGAAGTTATCATTGCAGAATCAAATGAATCCACAAGT
TTCGACGACAGTTCTTTATGCTCTTCAGGATGGAGATGCTTAAAGCATAAATTTTCCAACTGCTCCTTCCAACTCAATATTCCTAAGCGATTAGCCAATGGTACAAAAAT
TTCCAAAGTCTCCTTGGCAAATCTTAATCGTTTTGTCGATGGCAATGCATCTAATGTCATCATATTATGTAGACGATCAGCCAATTTAACAAGAACAGCCCGAGTATCTG
CCATGGCAAGGAACATGGTATGCAAGCGATCTGCCTCGACGGTTTTGTTGGCAGTATTGTTCTCACGTGCAAGTTTGCTCAAATGACTTAGCTGAGACACCTAAACCCAT
TACAAAGTGAAAATTTAGAATACAAAAACTTTGGAGTAATGTCGAGAGTATAAAACGCATAGGACAGAACTGTATGAGAAAATAAAAACTTGAGAACCAAAAGATTGCAT
CCAACAGGTAAAAGCAAACTAAGATCAGTTCTAATACAGATCCACAATTTAACCACAATGATTTCATTTCACCACACGCTCTAATAATAGTTAAATCACAAAACACACTC
CCCATGTGGTCACATATCCAGAGCTCAGAGCCATTAAATCCAATCAACCTTCATAGATTATGAAAGAAAATTTTGCTCACCTCTTCAACTAAATCAGCAACCCCAGCTCC
GGCTGTCCCTAAAATGTAGTCATAACACATGAAAGAATCATCCAGCACATCGTGCAGAAGCCCAGCAGCCACCACCGTGGAATTTGCACCGATCAATGCCAGCAACATCG
CCGTTTCCACACAATGTTGTAAATATGGATCTCCACTGGCTCGCATCTGCAAAAGAAACCATACAAATTCAATCATCTTCGCTCATGCCACAGATTGAAACAGGAGAAAA
AAGTTCATCAAAATGAATTCAATTACTTGTCCTCTATGCGCCTTCTCGGCTTCGTAAAACGCCTTGATCACAAATTCATCGAGAAAGATCTTATGCCTCATTTGTGCTCC
AAGAAGCATATCCCTTGCGTATGGCTCCGAAGTGGACTCCGCAAACCCATCTTCCAAATTAAAAGTCAACTCATCCATGAGCACAGCCGATGAGCTCACATCGAGAGCAT
TCCTGTGAACATCCACATACGATCCCGAAGCATTCCTCAAGAACCCATTGAAGAACCCGTTGGACCCAACTCCAATCGAGCTCTCGAAATTAACTTCCCCACTCTTCTCT
CGCGAAATCCACATCGGAGGACTTTTCGCCGACGAACCAACTCCACTACTACAACACGAAACCGGTCCCTGAAACACCGACACCGGGCTCGAATCCCTAGTCAATGAAGA
CCCCAAATATTTGCTCGAAGAATATCGAAATGAGCTGCTCAAGTCCTCCCCTCTATCATGCCATAACGATCCCAATTCCTCCCCGCAACCGGAAAAGCCCGCCGTCGATG
AGACATGCCTTACCGGGGACGCGGAGAATAGGCACGACAGCCCACCGGCCGCCGGCTTCTGGGATGCGGATGCCGTTGACGGCGCCGACGAAGATCGAGAACCGATTTCG
AAATCGAATGATGCATGAGCATTGATCTGGCAAGGATGCGTAGAGCAAATACTGCTCGCTGGGCCCGCGTATAGAGCTATAGTTGGAACTCCCATGATCTAGGGCAAAAA
GTCACTTTCGTCTCCCGCTCAGATCGCAGTCGCACACCCAAATAAGTTCTTGAGATCAGAAAACAATTCTGTTCGAATCAAATTTCCTCAACGACCGAACTAATTTCCTC
GCTAATCCGTTACCAATTCCACGGAAAAATATACGTGCGTATGATTACAGAACGAGCAACTTTCTGAAATGACCGAAAACTGTTCTTTCCCGGAAAGTATAAATTGGTAT
CAACCTTAACAGAAGTACCACGAATCGAACAATTTTATACAGAGAAGAGATGAAAGAAATAAACAATAGACTACAAAAAATTTCCTCAATGACCCACTAATCTAACCTAA
TCGATAAACCCAGATCAGTACTACAATCGAATCAAAGAGCATCAACTTTCTGAGAGACCTGAAATTTTGCCCTCGGAAAGCAGAGAATCGGTACCGAAGTAATTAACGAA
GGAAGAACAAAACAACCAAAAATTTCTAGATCGTATAGAACAATATGTACAGATTTATAGAAGTTGAGCTGAAATTGAGAGATGAAGGAGAAAACGCCCCGCAATCGAAG
TGGAATTACGACAAAAAAATCGAAGAAGAAGTGTCAAAGAAGAACAGGGAATTGGTGAGGAGAGATTGAAAGGAACAGGGAGAGCTTCCTACGATATTCACAGAATCCGT
GTCCGTGACCGAGTTCTATTTACGTTACCATTACAGCGAAGATTAAATAAACAAAAAAAGATTTTAAAAGAAAACATATCCGAAATTATTAAATGACTGAAATAACCTCA
ACTTGTA
Protein sequenceShow/hide protein sequence
MSRGTGGGYDNQNTNFSPEARPSQLGERGSKEVTEGDEDLPRDARIVKALLKSMGVEDYEPRVIHQFLELWYRYVVDVLTDAQVYSEHACKAAIDCDDVKLAIQSKVNFS
FSQPPPREVLLELARNRNKIPLPRTIGGPGIALPPDLDALVSPNYQLAIPKKQPVETMEETEEEENVDFASSQEPSTEVPQHTPQRVSFPLAKRPKIT