; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010900 (gene) of Snake gourd v1 genome

Gene IDTan0010900
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionbZIP transcription factor 53-like
Genome locationLG04:81072561..81074974
RNA-Seq ExpressionTan0010900
SyntenyTan0010900
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022949400.1 bZIP transcription factor 53-like [Cucurbita moschata]6.1e-6294.37Show/hide
Query:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
        MASIP+Q+S GSNGGSP AIPDERKRKRMQSNRESARRSRMRKQKQ+EDLTGEVSRLQIANNQLLQSI AKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
Subjt:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC
        SVLQIVEEVSGLAMDIPEIPDPLLKPW+FSRPVLP ADMFLC
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC

XP_022998313.1 bZIP transcription factor 53-like [Cucurbita maxima]2.8e-6295.07Show/hide
Query:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
        MASIP+Q+S GSNGGSP AIPDERKRKRMQSNRESARRSRMRKQKQ+EDLTGEVSRLQIANNQLLQSI AKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
Subjt:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC
        SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLP ADMFLC
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC

XP_023525883.1 bZIP transcription factor 53-like [Cucurbita pepo subsp. pepo]8.0e-6294.37Show/hide
Query:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
        MASIP+Q+S GSNGGSP AIPDERKRKRMQSNRESARRSRMRKQKQ+EDLTGEVSRLQIANNQLLQSI AKEQAFVQVDNMNNVLRAQAMELTDRL+SLN
Subjt:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC
        SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLP ADMFLC
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC

XP_023529126.1 bZIP transcription factor 53-like [Cucurbita pepo subsp. pepo]1.2e-6092.96Show/hide
Query:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
        MASIP++SS GSNGGS  AIPDERKRKRMQSNRESARRSRM+KQKQ+EDLTGE+SRLQIANNQLLQSI AKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
Subjt:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC
        SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRP LPVADMFLC
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC

XP_038905019.1 bZIP transcription factor 53 [Benincasa hispida]6.1e-6294.37Show/hide
Query:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
        MASIP+Q+SSGSNGGSPSA+PDERKRKRMQSNRESARRSRMRKQKQLEDL GEVSRLQIANNQL+QSI AKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
Subjt:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC
        SVL IVEEVSGLAMDIPEIPDPLLKPWE SRPVLPVADMFLC
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC

TrEMBL top hitse value%identityAlignment
A0A0A0KWP0 BZIP domain-containing protein1.4e-5992.31Show/hide
Query:  MASIPKQSSSGSNGGS-PSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSL
        MASIP+Q+SSGSNG S PSA+PDERKRKRMQSNRESARRSRMRKQKQLEDL GEVSRLQ ANNQL+QSI AKEQAFVQVDNMNNVLRAQAMELTDRLRSL
Subjt:  MASIPKQSSSGSNGGS-PSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSL

Query:  NSVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC
        NSVL IVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVAD FLC
Subjt:  NSVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC

A0A6J1F1P1 bZIP transcription factor 53-like6.2e-6091.55Show/hide
Query:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
        MASIP++SS GSNGG   AIPDERKRKRMQSNRESARRSRM+KQKQ+EDLTGE+SRLQIANNQLLQSI AKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
Subjt:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC
        SVLQIVE+VSGLAMDIPEIPDPLLKPWEFSRP LPVADMFLC
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC

A0A6J1GCN2 bZIP transcription factor 53-like3.0e-6294.37Show/hide
Query:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
        MASIP+Q+S GSNGGSP AIPDERKRKRMQSNRESARRSRMRKQKQ+EDLTGEVSRLQIANNQLLQSI AKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
Subjt:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC
        SVLQIVEEVSGLAMDIPEIPDPLLKPW+FSRPVLP ADMFLC
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC

A0A6J1J475 bZIP transcription factor 53-like1.4e-5990.85Show/hide
Query:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
        MASIP++SS GSNGGS  AIPDERKRKRMQSNRESARRSRM+KQKQ+EDLTGE+SRLQ+ANNQLLQSI AKEQAFVQVDNMNNVLRAQA+EL DRLRSLN
Subjt:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC
        SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRP LPVADMFLC
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC

A0A6J1KGE0 bZIP transcription factor 53-like1.3e-6295.07Show/hide
Query:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
        MASIP+Q+S GSNGGSP AIPDERKRKRMQSNRESARRSRMRKQKQ+EDLTGEVSRLQIANNQLLQSI AKEQAFVQVDNMNNVLRAQAMELTDRLRSLN
Subjt:  MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC
        SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLP ADMFLC
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC

SwissProt top hitse value%identityAlignment
C0Z2L5 bZIP transcription factor 442.4e-1642Show/hide
Query:  SNGGSPSAIP-----DERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIV
        +N GS S +      DERKRKR QSNRESARRSRMRKQK L+DLT +V+ L+  N Q++  IA   Q +V ++  N++LRAQ +EL  RL+SLN ++  V
Subjt:  SNGGSPSAIP-----DERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIV

Query:  E-EVSGLAMDIPE------IPDPLLKPWE---FSRPVLPVA----DMFLC
        E   SG  M+  +      + D ++ P     +++P++  A    D+F C
Subjt:  E-EVSGLAMDIPE------IPDPLLKPWE---FSRPVLPVA----DMFLC

O65683 bZIP transcription factor 116.5e-1446Show/hide
Query:  SSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIVE
        S+  ++ GS  ++ ++RKRKRM SNRESARRSRM+KQK L+DLT +V+ L+  N +++ S++   Q ++ V+  N+VLRAQ  EL  RL+SLN +++ ++
Subjt:  SSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIVE

P24068 Ocs element-binding factor 11.0e-1950.39Show/hide
Query:  SSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIVEE
        +SGS+G   SA    R+ KR  SNRESARRSR+RKQ+ L++L  EV+RLQ  N ++          + +V+  N VLRA+A EL DRLRS+N VL++VEE
Subjt:  SSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIVEE

Query:  VSGLAMDIPE---IPDPLLKPWEFSRP
         SG+AMDI E     DPLL+PW+   P
Subjt:  VSGLAMDIPE---IPDPLLKPWEFSRP

Q9LZP8 bZIP transcription factor 536.4e-3054.11Show/hide
Query:  MASIPKQSSSGS-NGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSL
        M S+  Q+S  S N    + + DERKRKRM SNRESARRSRMRKQKQL DL  EV+ L+  N ++ + +    + ++++++ NNVLRAQA ELTDRLRSL
Subjt:  MASIPKQSSSGS-NGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSL

Query:  NSVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPV---ADMFLC
        NSVL++VEE+SG A+DIPEIP+ +  PW+   P+ P+   ADMF C
Subjt:  NSVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPV---ADMFLC

Q9SI15 bZIP transcription factor 22.6e-1546.61Show/hide
Query:  QSSSGSNGG--SPS---AIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNS
        +SSS S+GG  +PS      DERKRKRM SNRESARRSRMRKQK ++DLT ++++L   N Q+L S+    Q ++++   N+VL AQ  EL+ RL+SLN 
Subjt:  QSSSGSNGG--SPS---AIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNS

Query:  VLQIVEEVSGLAMDIPEI
        ++ +V+  +G    + +I
Subjt:  VLQIVEEVSGLAMDIPEI

Arabidopsis top hitse value%identityAlignment
AT1G75390.1 basic leucine-zipper 441.7e-1742Show/hide
Query:  SNGGSPSAIP-----DERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIV
        +N GS S +      DERKRKR QSNRESARRSRMRKQK L+DLT +V+ L+  N Q++  IA   Q +V ++  N++LRAQ +EL  RL+SLN ++  V
Subjt:  SNGGSPSAIP-----DERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIV

Query:  E-EVSGLAMDIPE------IPDPLLKPWE---FSRPVLPVA----DMFLC
        E   SG  M+  +      + D ++ P     +++P++  A    D+F C
Subjt:  E-EVSGLAMDIPE------IPDPLLKPWE---FSRPVLPVA----DMFLC

AT1G75390.2 basic leucine-zipper 448.1e-1253.66Show/hide
Query:  SNGGSPSAIP-----DERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQ
        +N GS S +      DERKRKR QSNRESARRSRMRKQK L+DLT +V+ L+  N Q++  IA   Q +V ++  N++LRAQ
Subjt:  SNGGSPSAIP-----DERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQ

AT2G18160.1 basic leucine-zipper 21.9e-1646.61Show/hide
Query:  QSSSGSNGG--SPS---AIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNS
        +SSS S+GG  +PS      DERKRKRM SNRESARRSRMRKQK ++DLT ++++L   N Q+L S+    Q ++++   N+VL AQ  EL+ RL+SLN 
Subjt:  QSSSGSNGG--SPS---AIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNS

Query:  VLQIVEEVSGLAMDIPEI
        ++ +V+  +G    + +I
Subjt:  VLQIVEEVSGLAMDIPEI

AT3G62420.1 basic region/leucine zipper motif 534.6e-3154.11Show/hide
Query:  MASIPKQSSSGS-NGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSL
        M S+  Q+S  S N    + + DERKRKRM SNRESARRSRMRKQKQL DL  EV+ L+  N ++ + +    + ++++++ NNVLRAQA ELTDRLRSL
Subjt:  MASIPKQSSSGS-NGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSL

Query:  NSVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPV---ADMFLC
        NSVL++VEE+SG A+DIPEIP+ +  PW+   P+ P+   ADMF C
Subjt:  NSVLQIVEEVSGLAMDIPEIPDPLLKPWEFSRPVLPV---ADMFLC

AT4G34590.1 G-box binding factor 64.6e-1546Show/hide
Query:  SSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIVE
        S+  ++ GS  ++ ++RKRKRM SNRESARRSRM+KQK L+DLT +V+ L+  N +++ S++   Q ++ V+  N+VLRAQ  EL  RL+SLN +++ ++
Subjt:  SSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTATTCCAAAGCAAAGCAGCTCGGGATCTAACGGAGGTTCGCCATCTGCCATTCCTGATGAGAGGAAGAGGAAGCGGATGCAATCCAACAGAGAATCGGCTCG
GAGATCTCGGATGAGAAAGCAGAAGCAATTGGAGGATCTCACGGGCGAGGTAAGCCGATTACAAATTGCGAATAATCAGCTTCTGCAGAGCATTGCTGCCAAGGAGCAAG
CGTTTGTCCAGGTCGACAACATGAACAATGTTCTCAGGGCGCAAGCTATGGAGCTGACCGATCGCCTGCGGTCCCTAAACTCGGTCCTTCAGATTGTGGAGGAGGTTAGC
GGACTTGCCATGGATATCCCTGAAATTCCCGATCCTCTCTTGAAGCCGTGGGAGTTCTCTCGCCCAGTTCTGCCCGTCGCTGATATGTTTCTGTGTTAG
mRNA sequenceShow/hide mRNA sequence
TCGCAATTCGCTCGCAGCCTTCAACCAATCAGCTAAGTCTCCTTTTTCCTTTTCCTTTCTCTCTCTTTTTTCAGTCGTTGTTTTTATCTTCCTTCTTTCTTTCTTTCTTT
CCTTGTTCTTGTGTTTCTTTTCCAATTCTCTTCTTGAATTGGAATCTTTCAGATTTCTGTGTTGAAATCTTCGTCGGAGTTCTTGATTCGCTCTTGATTTTGTGGTTTTT
CTCGATCTGTTTGGTTTCGCCGTTCTTTGATTCATTTCCCTTGCTTCGTGTTTGTTTCTCTTGCTCTGGTTCTAAAATTCAGGGTTAAGTTCTTCTTTCCTAGGGTTTTG
TTGATTACCTCGATGATTTTAGGGTTTCTATGACGTTCTCTTTACTAAGGCGACGTATTCGTATCCTCCACTCGTTTTCGGTTGTGTTTCTGTACTGGTTCTACGTCTTC
TCGTGAACTAATCGCCTTATTCGTCGTGGTGAAATTCGATTGATCTCTTCTCGCGGTTAGCCTGAGTTGTAGATCGGAAGATTCATTTGATTCTTATCAGATTAGAGACT
AAATTAACTCTTGACTCGAGATTGCTTGGTAATATTTTGGCGGGAAGTATTTGAAAGAGAGAATTGTATGGCTTCTATTCCAAAGCAAAGCAGCTCGGGATCTAACGGAG
GTTCGCCATCTGCCATTCCTGATGAGAGGAAGAGGAAGCGGATGCAATCCAACAGAGAATCGGCTCGGAGATCTCGGATGAGAAAGCAGAAGCAATTGGAGGATCTCACG
GGCGAGGTAAGCCGATTACAAATTGCGAATAATCAGCTTCTGCAGAGCATTGCTGCCAAGGAGCAAGCGTTTGTCCAGGTCGACAACATGAACAATGTTCTCAGGGCGCA
AGCTATGGAGCTGACCGATCGCCTGCGGTCCCTAAACTCGGTCCTTCAGATTGTGGAGGAGGTTAGCGGACTTGCCATGGATATCCCTGAAATTCCCGATCCTCTCTTGA
AGCCGTGGGAGTTCTCTCGCCCAGTTCTGCCCGTCGCTGATATGTTTCTGTGTTAGTCAGTCAGGTCCATCTTAAAGGGCTTAGGGACGCCCGATTTCCGAGTATTTTGT
CTAATAGAACCATGGTCGTCTGTGAATTTATCGCAGAACTTAGGATTCTGCTGTGTGTTGGTGTTTGCGTGTCATGTAGTAGTGGTTTTTTGTATCTAGAACGTTGTACT
TTATGTGTTTTTGATTTATGGTAATGATATACTGATATTATCTGTATTTGGGGGAACTTCTATATTTGCTCCAGCGATTTCTCACTGGGTAATTGCAGAACCAATGACAT
AGAATTTGGTCTTTATGTATTGAATATTGCCTTTTATCTCTTGCTATTAGAATTTCGGCTGTTATTTTGAACTGAGTCAAATTCTCTATCTCTCTACTACTAATTTGATT
TAGATTGTCAGGAATTTGCCGCCTCACCTTTTGAATTGGTTCATTACAGTTGAATCAAAAAGTCTCGGGGTTGATTCTGATTCATGGTTATCCCAGATGGGGAAAGGAAT
ATGCCTTGATCATATGCTGCTGGAGATTGGAGAACCCCAGATGAGTACTGTGGCTTATACTAATCCATTATATAATCATTCATGGGGTTGATTTTTCTTTATTGAATTTG
TTTGATTTTATCCTCTGGAATCTTCTTTCTGATTTCTTTTTTTGGAGCTCCTTTCTGGACATTTGTAGGTGGAGTTGGTAGGAAATTATTTAATGGGAAAAAGTAGAAAT
ACTGTAAACATGGTCTCCCTTTAATTAATACCTTATTAAATTGTAAAGTAAAAGATACAAAAAGGAAGTGTTTGGATATTATTATGGATAGAAGTCAAAGTAGTCCTAAA
ACTTATGCTGATTCTTTTGTTGCTTATCTTTGAACAAATTTGACCTGCTTGCCCTTTTGGTTCCGTGAGCAGTTTAGCTTATCCAATGTGTTAAATCGTCTTATTTCTCT
TCCAGAATATGCCGATTCTTTTCTTTATAAAATAATCTTGTTCTCTAAAATAGTCTTCTCGCGACAAATGGCGTAGATGTAAAAAATTACGTGTTTGCTTTGAACATAAT
TTTCCATTTTCTATGGCTTCTCTTCCTTTGGGAAAAGAAAACTAGCAACTGCCACTTGTTTCTCGTATTTACCTTTATTTTTAAATAAACTAAAGCTATTTTCAAATATT
GTTTTCGGTTTTTAGAACAAAAATCTAGTGTGTATTGTTAGTTTCTTTTTTAAAGCTGTCATCCCAGGTTCTTTAGATATTTTTATAACAGTTTATCATTTGGAATGTCA
GTCAACGTGACTAGGTTAAATCTATCGACTTTCAAGTTTGACTCTATTTAGTCCTTAAATTTTATCAATCAAGCCTCTCTAATATGCAAACTTATCAATAATTG
Protein sequenceShow/hide protein sequence
MASIPKQSSSGSNGGSPSAIPDERKRKRMQSNRESARRSRMRKQKQLEDLTGEVSRLQIANNQLLQSIAAKEQAFVQVDNMNNVLRAQAMELTDRLRSLNSVLQIVEEVS
GLAMDIPEIPDPLLKPWEFSRPVLPVADMFLC