; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000796 (gene) of Snake gourd v1 genome

Gene IDTan0000796
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBZIP domain-containing protein
Genome locationLG01:12179256..12180014
RNA-Seq ExpressionTan0000796
SyntenyTan0000796
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AOZ56991.1 bZIP2 [Citrullus lanatus]4.5e-6488.74Show/hide
Query:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL
        MASPVGSSSGSPSSDEDLR IVDQRKRKRMISNRESARRSRMRKQKQLDDLT+QV QIR+ENEQIAVN NFT QLY+NLEAENSVLRAQM ELRHRLDSL
Subjt:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL

Query:  NEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
        NEII F+NSSTRNL+DSE+H E  GIDGFVDSW FPFLNQPIMAAGD+FMC
Subjt:  NEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

KAA0044785.1 bZIP transcription factor 53 [Cucumis melo var. makuwa]1.0e-6389.47Show/hide
Query:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL
        MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLT+QV QIR+ENEQIAVN NFTTQLY+NLEAENSVLRAQM ELRHRLDSL
Subjt:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL

Query:  NEIIGFINSSTRNLYD-SENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
        NEII F+NSSTR+LYD SE +DE  GIDGFVDSW FPFLNQPIMAAGD+FMC
Subjt:  NEIIGFINSSTRNLYD-SENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

KAG7015319.1 bZIP transcription factor 44, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-7576.04Show/hide
Query:  MSPIVSEILLSGFIINSALRRRTHLVQSFSVVFLYWFYKNPHRIPSPFSKENEKLCFPLFETRLVSMASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNR
        MSP+V EILLSGF INSALRR THLVQS SVVFLYWFY                       TR VSMAS VG+ S S SSDEDLR IVD RKRKRMISNR
Subjt:  MSPIVSEILLSGFIINSALRRRTHLVQSFSVVFLYWFYKNPHRIPSPFSKENEKLCFPLFETRLVSMASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNR

Query:  ESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWC
        ESARRSRMRKQKQLDDLT+Q S +++ENEQIAVN NFTTQLY+NLEAENSVLRAQMAELRHRLDSLNEII FI SSTRNLYD E HDEVS IDG VDSW 
Subjt:  ESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWC

Query:  FPFLNQPIMAAGDMFMC
         PFLNQPIMAAGDMFMC
Subjt:  FPFLNQPIMAAGDMFMC

XP_011653153.1 bZIP transcription factor 44 [Cucumis sativus]2.0e-6488.74Show/hide
Query:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL
        MASP+GSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLT+QV QIR+ENEQIAVN NFTTQLY+NLEAENSVLRAQM ELRHRLDSL
Subjt:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL

Query:  NEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
        NEII F+NSS+R++YDSE +DEV GIDGFVDSW FPFLNQPIMAAGD+FMC
Subjt:  NEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

XP_038897679.1 bZIP transcription factor 44-like [Benincasa hispida]6.3e-6690.73Show/hide
Query:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL
        MASPVGSSSGSPSSDEDLR IVDQRKRKRMISNRESARRSRMRKQKQLDDLT+QV QIR+ENEQIAVN NFTTQLY+NLEAENSVLRAQM ELRHRLDSL
Subjt:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL

Query:  NEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
        NEII F+NSSTRNL+DSE+H EVSGIDGFVDSW FPFLNQPIMAAGD+FMC
Subjt:  NEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

TrEMBL top hitse value%identityAlignment
A0A0A0KZH6 BZIP domain-containing protein9.8e-6588.74Show/hide
Query:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL
        MASP+GSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLT+QV QIR+ENEQIAVN NFTTQLY+NLEAENSVLRAQM ELRHRLDSL
Subjt:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL

Query:  NEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
        NEII F+NSS+R++YDSE +DEV GIDGFVDSW FPFLNQPIMAAGD+FMC
Subjt:  NEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

A0A1I9RYK7 BZIP22.2e-6488.74Show/hide
Query:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL
        MASPVGSSSGSPSSDEDLR IVDQRKRKRMISNRESARRSRMRKQKQLDDLT+QV QIR+ENEQIAVN NFT QLY+NLEAENSVLRAQM ELRHRLDSL
Subjt:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL

Query:  NEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
        NEII F+NSSTRNL+DSE+H E  GIDGFVDSW FPFLNQPIMAAGD+FMC
Subjt:  NEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

A0A1S3BT51 bZIP transcription factor 536.3e-6489.47Show/hide
Query:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL
        MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLT+QV QIR+ENEQIAVN NFTTQLY+NLEAENSVLRAQM ELRHRLDSL
Subjt:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL

Query:  NEIIGFINSSTRNLYD-SENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
        NEII F+NSSTR LYD SE +DE  GIDGFVDSW FPFLNQPIMAAGD+FMC
Subjt:  NEIIGFINSSTRNLYD-SENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

A0A5A7TTR6 BZIP transcription factor 534.9e-6489.47Show/hide
Query:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL
        MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLT+QV QIR+ENEQIAVN NFTTQLY+NLEAENSVLRAQM ELRHRLDSL
Subjt:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL

Query:  NEIIGFINSSTRNLYD-SENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
        NEII F+NSSTR+LYD SE +DE  GIDGFVDSW FPFLNQPIMAAGD+FMC
Subjt:  NEIIGFINSSTRNLYD-SENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

A0A5D3CZ93 BZIP transcription factor 536.3e-6489.47Show/hide
Query:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL
        MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLT+QV QIR+ENEQIAVN NFTTQLY+NLEAENSVLRAQM ELRHRLDSL
Subjt:  MASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSL

Query:  NEIIGFINSSTRNLYD-SENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
        NEII F+NSSTR LYD SE +DE  GIDGFVDSW FPFLNQPIMAAGD+FMC
Subjt:  NEIIGFINSSTRNLYD-SENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

SwissProt top hitse value%identityAlignment
C0Z2L5 bZIP transcription factor 448.3e-2950.63Show/hide
Query:  SMASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDS
        S  S  G ++    SD   R ++D+RKRKR  SNRESARRSRMRKQK LDDLTAQV+ +R EN QI      TTQ Y+ +EAEN +LRAQ+ EL HRL S
Subjt:  SMASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDS

Query:  LNEIIGFINSSTRNLYDSENHDEVSG--IDGFVDSWCFPFLNQPIMA----AGDMFMC
        LNEI+ F+ SS+             G   DG ++     F NQPIMA    AGD+F C
Subjt:  LNEIIGFINSSTRNLYDSENHDEVSG--IDGFVDSWCFPFLNQPIMA----AGDMFMC

O65683 bZIP transcription factor 114.6e-2749.68Show/hide
Query:  ASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLN
        +S + +SSGS  S      +++QRKRKRM+SNRESARRSRM+KQK LDDLTAQV+ ++ EN +I  + + TTQ Y+ +EAENSVLRAQ+ EL HRL SLN
Subjt:  ASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLN

Query:  EIIGFINSSTRNLYDSEN--HDEVSGI---DGFVDSWCFPF-LNQPIMAAGDMFM
        +II F++SS  N  ++     + + G+   D FV+     + +NQP+MA+ D  M
Subjt:  EIIGFINSSTRNLYDSEN--HDEVSGI---DGFVDSWCFPF-LNQPIMAAGDMFM

P24068 Ocs element-binding factor 11.4e-1233.82Show/hide
Query:  ASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLN
        +S +  ++G  S  +        R+ KR +SNRESARRSR+RKQ+ LD+L  +V++++++N ++A         Y  +E EN+VLRA+ AEL  RL S+N
Subjt:  ASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLN

Query:  EIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPF
        E++  +   +    D +  +E+   D  +  W  P+
Subjt:  EIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPF

Q9LZP8 bZIP transcription factor 532.4e-2044.06Show/hide
Query:  SPSSDEDLR--LIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGFIN
        SP SD D R   + D+RKRKRMISNRESARRSRMRKQKQL DL  +V+ ++++N +I    +  ++ Y+ +E++N+VLRAQ +EL  RL SLN ++  + 
Subjt:  SPSSDEDLR--LIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGFIN

Query:  SSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
          +    D     E        + W  P   QPI A+ DMF C
Subjt:  SSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

Q9SI15 bZIP transcription factor 29.5e-2547.71Show/hide
Query:  SSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGF
        SS G  ++  D  + VD+RKRKRM+SNRESARRSRMRKQK +DDLTAQ++Q+ ++N QI  +   T+QLYM ++AENSVL AQM EL  RL SLNEI+  
Subjt:  SSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGF

Query:  INSS-----TRNLYDSENHDEVSGIDGFVDSWCF----------PFLNQPIMA
        + S+        +      D   GIDG+ D               + NQPIMA
Subjt:  INSS-----TRNLYDSENHDEVSGIDGFVDSWCF----------PFLNQPIMA

Arabidopsis top hitse value%identityAlignment
AT1G75390.1 basic leucine-zipper 445.9e-3050.63Show/hide
Query:  SMASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDS
        S  S  G ++    SD   R ++D+RKRKR  SNRESARRSRMRKQK LDDLTAQV+ +R EN QI      TTQ Y+ +EAEN +LRAQ+ EL HRL S
Subjt:  SMASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDS

Query:  LNEIIGFINSSTRNLYDSENHDEVSG--IDGFVDSWCFPFLNQPIMA----AGDMFMC
        LNEI+ F+ SS+             G   DG ++     F NQPIMA    AGD+F C
Subjt:  LNEIIGFINSSTRNLYDSENHDEVSG--IDGFVDSWCFPFLNQPIMA----AGDMFMC

AT1G75390.2 basic leucine-zipper 446.8e-1857.78Show/hide
Query:  SMASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQ
        S  S  G ++    SD   R ++D+RKRKR  SNRESARRSRMRKQK LDDLTAQV+ +R EN QI      TTQ Y+ +EAEN +LRAQ
Subjt:  SMASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQ

AT2G18160.1 basic leucine-zipper 26.8e-2647.71Show/hide
Query:  SSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGF
        SS G  ++  D  + VD+RKRKRM+SNRESARRSRMRKQK +DDLTAQ++Q+ ++N QI  +   T+QLYM ++AENSVL AQM EL  RL SLNEI+  
Subjt:  SSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGF

Query:  INSS-----TRNLYDSENHDEVSGIDGFVDSWCF----------PFLNQPIMA
        + S+        +      D   GIDG+ D               + NQPIMA
Subjt:  INSS-----TRNLYDSENHDEVSGIDGFVDSWCF----------PFLNQPIMA

AT3G62420.1 basic region/leucine zipper motif 531.7e-2144.06Show/hide
Query:  SPSSDEDLR--LIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGFIN
        SP SD D R   + D+RKRKRMISNRESARRSRMRKQKQL DL  +V+ ++++N +I    +  ++ Y+ +E++N+VLRAQ +EL  RL SLN ++  + 
Subjt:  SPSSDEDLR--LIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGFIN

Query:  SSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC
          +    D     E        + W  P   QPI A+ DMF C
Subjt:  SSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC

AT4G34590.1 G-box binding factor 63.2e-2849.68Show/hide
Query:  ASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLN
        +S + +SSGS  S      +++QRKRKRM+SNRESARRSRM+KQK LDDLTAQV+ ++ EN +I  + + TTQ Y+ +EAENSVLRAQ+ EL HRL SLN
Subjt:  ASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRKQKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLN

Query:  EIIGFINSSTRNLYDSEN--HDEVSGI---DGFVDSWCFPF-LNQPIMAAGDMFM
        +II F++SS  N  ++     + + G+   D FV+     + +NQP+MA+ D  M
Subjt:  EIIGFINSSTRNLYDSEN--HDEVSGI---DGFVDSWCFPF-LNQPIMAAGDMFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCTATAGTCAGTGAGATCCTTCTCTCTGGTTTCATCATCAACTCCGCTCTCCGCCGCCGCACCCATCTCGTTCAATCCTTCTCTGTTGTATTCCTCTACTGGTT
CTACAAAAATCCACATCGAATTCCATCCCCTTTTTCCAAAGAAAACGAAAAGCTCTGTTTTCCTCTGTTTGAGACACGGCTAGTTTCAATGGCGTCTCCGGTAGGAAGTT
CATCCGGATCTCCGAGCTCCGACGAAGATCTGCGGCTAATCGTGGATCAGAGGAAAAGGAAGAGAATGATATCGAATCGAGAATCCGCTCGCCGATCTAGGATGCGAAAA
CAGAAGCAGCTCGACGATCTGACGGCTCAGGTGAGCCAGATCAGATCGGAGAATGAGCAAATCGCCGTCAATACCAATTTCACCACCCAACTTTACATGAATCTGGAGGC
GGAGAACTCGGTGCTCCGGGCACAGATGGCGGAACTCCGCCACAGATTGGACTCGCTCAACGAAATCATAGGGTTCATAAACTCGAGTACTAGAAATCTGTATGATTCTG
AGAATCATGATGAAGTTTCTGGCATTGATGGGTTTGTTGATTCTTGGTGTTTCCCCTTTCTTAACCAGCCAATCATGGCGGCTGGTGACATGTTCATGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCTATAGTCAGTGAGATCCTTCTCTCTGGTTTCATCATCAACTCCGCTCTCCGCCGCCGCACCCATCTCGTTCAATCCTTCTCTGTTGTATTCCTCTACTGGTT
CTACAAAAATCCACATCGAATTCCATCCCCTTTTTCCAAAGAAAACGAAAAGCTCTGTTTTCCTCTGTTTGAGACACGGCTAGTTTCAATGGCGTCTCCGGTAGGAAGTT
CATCCGGATCTCCGAGCTCCGACGAAGATCTGCGGCTAATCGTGGATCAGAGGAAAAGGAAGAGAATGATATCGAATCGAGAATCCGCTCGCCGATCTAGGATGCGAAAA
CAGAAGCAGCTCGACGATCTGACGGCTCAGGTGAGCCAGATCAGATCGGAGAATGAGCAAATCGCCGTCAATACCAATTTCACCACCCAACTTTACATGAATCTGGAGGC
GGAGAACTCGGTGCTCCGGGCACAGATGGCGGAACTCCGCCACAGATTGGACTCGCTCAACGAAATCATAGGGTTCATAAACTCGAGTACTAGAAATCTGTATGATTCTG
AGAATCATGATGAAGTTTCTGGCATTGATGGGTTTGTTGATTCTTGGTGTTTCCCCTTTCTTAACCAGCCAATCATGGCGGCTGGTGACATGTTCATGTGCTGA
Protein sequenceShow/hide protein sequence
MSPIVSEILLSGFIINSALRRRTHLVQSFSVVFLYWFYKNPHRIPSPFSKENEKLCFPLFETRLVSMASPVGSSSGSPSSDEDLRLIVDQRKRKRMISNRESARRSRMRK
QKQLDDLTAQVSQIRSENEQIAVNTNFTTQLYMNLEAENSVLRAQMAELRHRLDSLNEIIGFINSSTRNLYDSENHDEVSGIDGFVDSWCFPFLNQPIMAAGDMFMC