; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029154 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029154
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptiongeneral transcription and DNA repair factor IIH subunit TFB1-1-like
Genome locationscaffold11:23803089..23811642
RNA-Seq ExpressionSpg029154
SyntenySpg029154
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006351 - transcription, DNA-templated (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR027079 - TFIIH subunit Tfb1/GTF2H1
IPR035925 - BSD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597731.1 General transcription and DNA repair factor IIH subunit TFB1-1, partial [Cucurbita argyrosperma subsp. sororia]5.7e-8593.18Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        TERKFVFRPSDPTSASKLDVEFR IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGE  Q PSEK VATFPHEQLSKSEM
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD S KSKQLVGFKSSMVLDTKPMSDGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

KAG7029183.1 putative RNA polymerase II transcription factor B subunit 1-1 [Cucurbita argyrosperma subsp. argyrosperma]5.7e-8593.18Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        TERKFVFRPSDPTSASKLDVEFR IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGE  Q PSEK VATFPHEQLSKSEM
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD S KSKQLVGFKSSMVLDTKPMSDGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

XP_008457278.1 PREDICTED: probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 [Cucumis melo]9.7e-8591.48Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        TE KFVFRPSDPTSASKLDVEFR IKGHKNTKEGSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCREFVGSALAK GE AQ PSE+PVA FPHEQLSKSEM
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD+SKKSKQL+GFKSSMVLDTKPMSDGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

XP_022932684.1 probable RNA polymerase II transcription factor B subunit 1-1 [Cucurbita moschata]5.7e-8593.18Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        TERKFVFRPSDPTSASKLDVEFR IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGE  Q PSEK VATFPHEQLSKSEM
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD S KSKQLVGFKSSMVLDTKPMSDGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

XP_022972269.1 probable RNA polymerase II transcription factor B subunit 1-1 [Cucurbita maxima]5.7e-8593.18Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        TERKFVFRPSDPTSASKLDVEFR IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGE  Q PSEK VATFPHEQLSKSEM
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD S KSKQLVGFKSSMVLDTKPMSDGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

TrEMBL top hitse value%identityAlignment
A0A1S3C6E8 probable RNA polymerase II transcription factor B subunit 1-1 isoform X14.7e-8591.48Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        TE KFVFRPSDPTSASKLDVEFR IKGHKNTKEGSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCREFVGSALAK GE AQ PSE+PVA FPHEQLSKSEM
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD+SKKSKQL+GFKSSMVLDTKPMSDGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

A0A5A7V248 Putative RNA polymerase II transcription factor B subunit 1-1 isoform X14.7e-8591.48Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        TE KFVFRPSDPTSASKLDVEFR IKGHKNTKEGSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCREFVGSALAK GE AQ PSE+PVA FPHEQLSKSEM
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD+SKKSKQL+GFKSSMVLDTKPMSDGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

A0A6J1DNY2 probable RNA polymerase II transcription factor B subunit 1-1 isoform X21.8e-8491.48Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        TERKFVF+PSDPTSASKLDVEFR IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGE AQ  SE+ VATFPHEQLSK EM
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELRM+CLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFK+SMVLDTKPMSDGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

A0A6J1EXP7 probable RNA polymerase II transcription factor B subunit 1-12.8e-8593.18Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        TERKFVFRPSDPTSASKLDVEFR IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGE  Q PSEK VATFPHEQLSKSEM
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD S KSKQLVGFKSSMVLDTKPMSDGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

A0A6J1IB04 probable RNA polymerase II transcription factor B subunit 1-12.8e-8593.18Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        TERKFVFRPSDPTSASKLDVEFR IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGE  Q PSEK VATFPHEQLSKSEM
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD S KSKQLVGFKSSMVLDTKPMSDGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

SwissProt top hitse value%identityAlignment
P32780 General transcription factor IIH subunit 11.4e-0437.84Show/hide
Query:  SKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKS--KQLVGFKSSMVLDTKPMSDG
        +  E+E + R LQED  L +L+K  V+  V++  EFWA R  +   D+S  S  KQ VG  ++ + D +P +DG
Subjt:  SKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKS--KQLVGFKSSMVLDTKPMSDG

Q3ECP0 General transcription and DNA repair factor IIH subunit TFB1-16.6e-4454.86Show/hide
Query:  ERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEME
        E   +F P+DP S SKL V  + IK  K TKEGSNKPPWLNLT  Q  S+IFEF+N+ D+H CR+F+  ALAK     +    K V +   EQLS  E+E
Subjt:  ERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEME

Query:  LRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        LR + L+E+SELQ+LHKQFV   VLTE EFWA RKKLL +D+ +KSKQ +G KS MV   KP +DGR + +TF+L
Subjt:  LRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

Q55FP1 General transcription factor IIH subunit 14.2e-0632.29Show/hide
Query:  SGEPAQTPSE---KPVATFPHEQLSKSEMELRMRCLQEDSELQKLHKQFV-IGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSD
        S +PAQ   +   K ++      LS+ +++ R+  LQ + EL++L++Q V    V++ES+FW +RK +L+ D+++  KQ  G  S+++ D +P S+
Subjt:  SGEPAQTPSE---KPVATFPHEQLSKSEMELRMRCLQEDSELQKLHKQFV-IGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSD

Q9DBA9 General transcription factor IIH subunit 11.0e-0438.36Show/hide
Query:  SKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKS-KQLVGFKSSMVLDTKPMSDG
        +  E+E + R LQED  L +L+K  V+  V++  EFWA R  +   D+S  S KQ VG  ++ + D +P +DG
Subjt:  SKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKS-KQLVGFKSSMVLDTKPMSDG

Q9M322 General transcription and DNA repair factor IIH subunit TFB1-37.8e-4555.68Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        +E   +F P+DP S  KL V+   IK  K TKEGSNKPPWLNLT  QG S+IFEF+N+ D+H CR+F+  ALAK  E       K V   P EQLS +E 
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELR + L+E+SELQKLHKQFV   VLTE EFW+ RKKLL +D+ +KSKQ +G KS MV   KP +DGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

Arabidopsis top hitse value%identityAlignment
AT1G55750.1 BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins)4.7e-4554.86Show/hide
Query:  ERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEME
        E   +F P+DP S SKL V  + IK  K TKEGSNKPPWLNLT  Q  S+IFEF+N+ D+H CR+F+  ALAK     +    K V +   EQLS  E+E
Subjt:  ERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEME

Query:  LRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        LR + L+E+SELQ+LHKQFV   VLTE EFWA RKKLL +D+ +KSKQ +G KS MV   KP +DGR + +TF+L
Subjt:  LRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL

AT3G61420.1 BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins)5.6e-4655.68Show/hide
Query:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM
        +E   +F P+DP S  KL V+   IK  K TKEGSNKPPWLNLT  QG S+IFEF+N+ D+H CR+F+  ALAK  E       K V   P EQLS +E 
Subjt:  TERKFVFRPSDPTSASKLDVEFRLIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEM

Query:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL
        ELR + L+E+SELQKLHKQFV   VLTE EFW+ RKKLL +D+ +KSKQ +G KS MV   KP +DGR + +TF+L
Subjt:  ELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCTCCTCAGCCCACTCTTTCAAGATCGAAAGAAAAACCTTCTCCATCACTACTGACCAGAAGAACCCTAGTCTCTTCCGACTCACTGAAGCTAGCAAG
GAAAAAAGCTTTACTCTGACACTCACAAATGAGACTCTATCTTGGCTCCAATCTTGTTTTGACAGACTTTGCAACCTTCCCCAGACTCAGAAGGTTTTTAACGAG
ATCAGAGTAGATGAAGCTTTTGGTCTTACTGCCTGCATTAACTCCCGCAACACCATTTGGGTCAGCAAGGAACCACGGCTGTTTGGTAAAGGGACGGAAAATACC
TCATACAAGGATGCATTAAAGACGTCTCAAAGGAAATGTGATCCACCCAACCACCAACAACCTCTGGATGTAGTCCCCACTCCTTCTAACCCACAGTTCGCTGCT
TTATACCTGTCATCATCGCCCCATCCAGCCAGACAAAGCCCTCCTGCTTGTAAAGATGAAGAACAAGCTCGGGTTATAGCCAATATTAAGGGGTGGTATAAGGTT
GGAAACTTCCAAGTTCGCTTCTTCCCATGGAGTGCCGAAGCAATGCTGAGTGAACCAAAAGTACCCTCTTACGGTGGCTGGATAAAAGTCAGGAACTTACCATTA
GGCAAATGGTCGATCGACATCTTCAAGAAAATAGGAGATGAATGTGGGGGCTATTTGGAAACAGCAAACAAAACCCTATCTCGTTTGGATATGATGGAGATAGGA
ATCAAGGTCAAGGAAAATCTTTTGGGCTTTCTTCCAGCGATAGTTCACCTCCCCTCAACCTCCAACAGTCCCATATCAGTATCCATCGACCCATTCTTCATGGAG
GAATACAACATAGGTTATGTCATCGGTATTCATGGCAAGATACCATCATTTTCGATGGCTTCCAGTGCTACACGCGCCGATGAGAGAATATATGTTGCGGATAAC
GAACGGGTTTATAACCCACGCGCCACCACTCTGAATGATGAAACAAAAGGGGAAAAGAAACAGGCCACATACAGAAGCGACTTTCCACAGGCCCCAAATGATGTA
TTGGACACATCCACTGCTCTGATGTCTGCGTCTTTATTTGACAGGGACCCACTGGCGCCGTCTATCACCACAGAGCCCCAATCCCCAAATTCCTCTCTTGATAAT
CACTCGGTTAAAGTACCCAATAAAACCCTACCTTTTGACGGTAACCAGAGAGGCCCACTTGGCCCACCCAATACCAACCCATCTCACCCACCTGGCCCACCAAAA
AGCCCTATCCAATGCTTAGCTGCTCATAGAAAGCCCATTGTTATCAACAATAAAAAAACCTATCTCATCATCGGAAACAAACACTCCACGAATACAGAGCTCCCT
GTTTCAGACTCTGAAGGATTTTTATCCTCCTCATGTTCTACCGCCATGATTAGATCCTCAACCTCAACACCTGAAAGAGCGACACAGAACCAGACCTCCCCACCC
ACGATTAGCAGACTTTTCAAACACAATCAAGAACAAGAGCCTTTTATCGAAGATCCTATTCCCTTGCAAATAGAAGAGCCTCAATCCGATCAGCAGGGAATAGGT
TTACAAGACACAGATCTCATTGAGGTTTTTGTGGAAGAAGACGTCTCGGTAGAGTTGTATCCCGAGGACACCAAAATTGACCCAGCTGTATATCTTCCCATGATC
TTCCCCTGGCTGACTGAGCACGGAATGTGCATTATGCCCATGCCTAGTAAACAGAAGACATCTCTTATTGCCAAAAAAAAGGCAAAATGGGCCAAGGAACTTCAA
AACTTACACTCCACGACAGAGCGCAAGTTTGTATTTAGACCCAGTGATCCCACTTCAGCTTCTAAGCTTGACGTGGAGTTCAGATTGATTAAAGGCCATAAAAAC
ACCAAGGAAGGATCAAATAAACCACCATGGCTTAATCTCACCAGGGACCAGGGTGGAAGTTACATTTTTGAGTTTAAAAACTTCTCCGATCTTCATGTTTGCAGG
GAGTTTGTGGGAAGTGCTCTAGCAAAGTCGGGAGAGCCTGCACAAACTCCCTCTGAGAAGCCTGTAGCCACATTTCCTCATGAACAACTCAGTAAATCAGAAATG
GAACTCCGAATGAGATGTTTGCAAGAGGATAGTGAACTGCAGAAACTTCATAAACAATTTGTGATTGGTGGTGTGTTGACCGAATCTGAGTTCTGGGCAGCAAGG
AAGAAATTACTGGAACGAGACAACTCCAAAAAGTCAAAACAGTTGGTTGGTTTCAAGAGTTCAATGGTTCTGGACACCAAACCAATGTCTGATGGTCGGGTCAGC
TTCATTACCTTTTCCCTACCCTATCAAACGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCACCTCCTCAGCCCACTCTTTCAAGATCGAAAGAAAAACCTTCTCCATCACTACTGACCAGAAGAACCCTAGTCTCTTCCGACTCACTGAAGCTAGCAAG
GAAAAAAGCTTTACTCTGACACTCACAAATGAGACTCTATCTTGGCTCCAATCTTGTTTTGACAGACTTTGCAACCTTCCCCAGACTCAGAAGGTTTTTAACGAG
ATCAGAGTAGATGAAGCTTTTGGTCTTACTGCCTGCATTAACTCCCGCAACACCATTTGGGTCAGCAAGGAACCACGGCTGTTTGGTAAAGGGACGGAAAATACC
TCATACAAGGATGCATTAAAGACGTCTCAAAGGAAATGTGATCCACCCAACCACCAACAACCTCTGGATGTAGTCCCCACTCCTTCTAACCCACAGTTCGCTGCT
TTATACCTGTCATCATCGCCCCATCCAGCCAGACAAAGCCCTCCTGCTTGTAAAGATGAAGAACAAGCTCGGGTTATAGCCAATATTAAGGGGTGGTATAAGGTT
GGAAACTTCCAAGTTCGCTTCTTCCCATGGAGTGCCGAAGCAATGCTGAGTGAACCAAAAGTACCCTCTTACGGTGGCTGGATAAAAGTCAGGAACTTACCATTA
GGCAAATGGTCGATCGACATCTTCAAGAAAATAGGAGATGAATGTGGGGGCTATTTGGAAACAGCAAACAAAACCCTATCTCGTTTGGATATGATGGAGATAGGA
ATCAAGGTCAAGGAAAATCTTTTGGGCTTTCTTCCAGCGATAGTTCACCTCCCCTCAACCTCCAACAGTCCCATATCAGTATCCATCGACCCATTCTTCATGGAG
GAATACAACATAGGTTATGTCATCGGTATTCATGGCAAGATACCATCATTTTCGATGGCTTCCAGTGCTACACGCGCCGATGAGAGAATATATGTTGCGGATAAC
GAACGGGTTTATAACCCACGCGCCACCACTCTGAATGATGAAACAAAAGGGGAAAAGAAACAGGCCACATACAGAAGCGACTTTCCACAGGCCCCAAATGATGTA
TTGGACACATCCACTGCTCTGATGTCTGCGTCTTTATTTGACAGGGACCCACTGGCGCCGTCTATCACCACAGAGCCCCAATCCCCAAATTCCTCTCTTGATAAT
CACTCGGTTAAAGTACCCAATAAAACCCTACCTTTTGACGGTAACCAGAGAGGCCCACTTGGCCCACCCAATACCAACCCATCTCACCCACCTGGCCCACCAAAA
AGCCCTATCCAATGCTTAGCTGCTCATAGAAAGCCCATTGTTATCAACAATAAAAAAACCTATCTCATCATCGGAAACAAACACTCCACGAATACAGAGCTCCCT
GTTTCAGACTCTGAAGGATTTTTATCCTCCTCATGTTCTACCGCCATGATTAGATCCTCAACCTCAACACCTGAAAGAGCGACACAGAACCAGACCTCCCCACCC
ACGATTAGCAGACTTTTCAAACACAATCAAGAACAAGAGCCTTTTATCGAAGATCCTATTCCCTTGCAAATAGAAGAGCCTCAATCCGATCAGCAGGGAATAGGT
TTACAAGACACAGATCTCATTGAGGTTTTTGTGGAAGAAGACGTCTCGGTAGAGTTGTATCCCGAGGACACCAAAATTGACCCAGCTGTATATCTTCCCATGATC
TTCCCCTGGCTGACTGAGCACGGAATGTGCATTATGCCCATGCCTAGTAAACAGAAGACATCTCTTATTGCCAAAAAAAAGGCAAAATGGGCCAAGGAACTTCAA
AACTTACACTCCACGACAGAGCGCAAGTTTGTATTTAGACCCAGTGATCCCACTTCAGCTTCTAAGCTTGACGTGGAGTTCAGATTGATTAAAGGCCATAAAAAC
ACCAAGGAAGGATCAAATAAACCACCATGGCTTAATCTCACCAGGGACCAGGGTGGAAGTTACATTTTTGAGTTTAAAAACTTCTCCGATCTTCATGTTTGCAGG
GAGTTTGTGGGAAGTGCTCTAGCAAAGTCGGGAGAGCCTGCACAAACTCCCTCTGAGAAGCCTGTAGCCACATTTCCTCATGAACAACTCAGTAAATCAGAAATG
GAACTCCGAATGAGATGTTTGCAAGAGGATAGTGAACTGCAGAAACTTCATAAACAATTTGTGATTGGTGGTGTGTTGACCGAATCTGAGTTCTGGGCAGCAAGG
AAGAAATTACTGGAACGAGACAACTCCAAAAAGTCAAAACAGTTGGTTGGTTTCAAGAGTTCAATGGTTCTGGACACCAAACCAATGTCTGATGGTCGGGTCAGC
TTCATTACCTTTTCCCTACCCTATCAAACGTAG
Protein sequenceShow/hide protein sequence
MATSSAHSFKIERKTFSITTDQKNPSLFRLTEASKEKSFTLTLTNETLSWLQSCFDRLCNLPQTQKVFNEIRVDEAFGLTACINSRNTIWVSKEPRLFGKGTENT
SYKDALKTSQRKCDPPNHQQPLDVVPTPSNPQFAALYLSSSPHPARQSPPACKDEEQARVIANIKGWYKVGNFQVRFFPWSAEAMLSEPKVPSYGGWIKVRNLPL
GKWSIDIFKKIGDECGGYLETANKTLSRLDMMEIGIKVKENLLGFLPAIVHLPSTSNSPISVSIDPFFMEEYNIGYVIGIHGKIPSFSMASSATRADERIYVADN
ERVYNPRATTLNDETKGEKKQATYRSDFPQAPNDVLDTSTALMSASLFDRDPLAPSITTEPQSPNSSLDNHSVKVPNKTLPFDGNQRGPLGPPNTNPSHPPGPPK
SPIQCLAAHRKPIVINNKKTYLIIGNKHSTNTELPVSDSEGFLSSSCSTAMIRSSTSTPERATQNQTSPPTISRLFKHNQEQEPFIEDPIPLQIEEPQSDQQGIG
LQDTDLIEVFVEEDVSVELYPEDTKIDPAVYLPMIFPWLTEHGMCIMPMPSKQKTSLIAKKKAKWAKELQNLHSTTERKFVFRPSDPTSASKLDVEFRLIKGHKN
TKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEPAQTPSEKPVATFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAAR
KKLLERDNSKKSKQLVGFKSSMVLDTKPMSDGRVSFITFSLPYQT