; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018163 (gene) of Snake gourd v1 genome

Gene IDTan0018163
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHistone H3.2
Genome locationLG06:32589314..32602633
RNA-Seq ExpressionTan0018163
SyntenyTan0018163
Gene Ontology termsGO:0006996 - organelle organization (biological process)
GO:0051321 - meiotic cell cycle (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR000164 - Histone H3/CENP-A
IPR007125 - Histone H2A/H2B/H3
IPR009072 - Histone-fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011659152.1 histone H3-1 isoform X1 [Cucumis sativus]8.2e-5284.56Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MARARHP +R SNR PSG+GA+ SS  APSTPL GRTQNVRQAQ+  SRT +KK+RFRPGTVAL+EIR LQKSWNLLIPAS FIRAVKEVS QLAPQITR
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTI
        WQAEAL+ALQEAAEDFLVHLFEDTMLCAIHAKRVTI
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTI

XP_011659153.1 histone H3-like centromeric protein HTR12 isoform X2 [Cucumis sativus]2.1e-6386.36Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MARARHP +R SNR PSG+GA+ SS  APSTPL GRTQNVRQAQ+  SRT +KK+RFRPGTVAL+EIR LQKSWNLLIPAS FIRAVKEVS QLAPQITR
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEAL+ALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

XP_022156549.1 histone H3-like centromeric protein HTR12 [Momordica charantia]4.3e-6990.91Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MARARHPAQRN+NRKP G+G +PSS A PSTPLGGRTQNVRQAQSPP+RT+ KKRRFRPGTVALREIRQ QK+WNLLIPAS FIRAVKEVSYQLAP+ITR
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

XP_022959605.1 histone H3-like centromeric protein HTR12 [Cucurbita moschata]6.2e-6891.56Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MARA+HP QRNSNRKPS  GASPSS+AAPSTPL GRTQ+ RQ QSP SRTT KKRRFRPGTVALREIRQLQKSWNLLIPAS FIRAVKEVSYQLAPQ+TR
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

XP_038896752.1 histone H3-like centromeric protein HTR12 [Benincasa hispida]6.9e-6790.26Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MARARHP QR SNR PSGTGA+ SS AAPSTPL GRTQNV QAQS P RTT+KK+RFRPGTVALREIR LQKSWNLLIPAS FIRAVKEVSYQLAPQITR
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEAL+ALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

TrEMBL top hitse value%identityAlignment
A0A0A0K4F4 Histone H3.23.9e-5284.56Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MARARHP +R SNR PSG+GA+ SS  APSTPL GRTQNVRQAQ+  SRT +KK+RFRPGTVAL+EIR LQKSWNLLIPAS FIRAVKEVS QLAPQITR
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTI
        WQAEAL+ALQEAAEDFLVHLFEDTMLCAIHAKRVTI
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTI

A0A6J1DTT5 Histone H3.22.1e-6990.91Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MARARHPAQRN+NRKP G+G +PSS A PSTPLGGRTQNVRQAQSPP+RT+ KKRRFRPGTVALREIRQ QK+WNLLIPAS FIRAVKEVSYQLAP+ITR
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

A0A6J1H6R9 Histone H3.23.0e-6891.56Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MARA+HP QRNSNRKPS  GASPSS+AAPSTPL GRTQ+ RQ QSP SRTT KKRRFRPGTVALREIRQLQKSWNLLIPAS FIRAVKEVSYQLAPQ+TR
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

A0A6J1KTZ4 Histone H3.23.0e-6891.56Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MARA+HP QRNSNRKPS  GASPSS+AAPSTPL GRTQ+ RQ QSP SRTT KKRRFRPGTVALREIRQLQKSWNLLIPAS FIRAVKEVSYQLAPQ+TR
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

B7XEI8 Centromere specific histone H3 variant4.2e-4668.15Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRT---QNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQ
        MAR +H A R  +R PS   A+ S+AAA S+     T    + R A S P RT +KK R+RPGTVALREIR+ QK+WNLLIPA+ FIR VKE+SY  AP+
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRT---QNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQ

Query:  ITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        +TRWQAEALIALQEAAEDFLVHLF+D+MLCAIHAKRVT+MKKDFELARRLGGK RPW
Subjt:  ITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

SwissProt top hitse value%identityAlignment
Q0MXD1 Histone H3-like centromeric protein CSE49.8e-2460.18Show/hide
Query:  QAQSPP---SRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEV--SYQLAPQITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTI
        Q   PP   S  T  KRR+RPGT ALREIR+ Q+S  LLI    F R VKEV  +Y  A    RWQ+ A++ALQEA E FLVHL EDT LCAIHAKRVTI
Subjt:  QAQSPP---SRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEV--SYQLAPQITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTI

Query:  MKKDFELARRLGG
        M+KD +LARR+ G
Subjt:  MKKDFELARRLGG

Q59LN9 Histone H3-like centromeric protein CSE48.3e-2346.75Show/hide
Query:  AQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQ---------AQSPPSRTT-----------RKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAV
        A+R   R+   T  SP   AA S+    +    R          A S P RTT           R K+R+RPGT ALREIRQ QKS +LLI    F R V
Subjt:  AQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQ---------AQSPPSRTT-----------RKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAV

Query:  KEVSYQ-LAPQI-TRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        +E+S   + P    RWQ+ A++ALQEA+E FL+HL EDT LCAIHAKRVTIM+KD +LARR+  +G+ W
Subjt:  KEVSYQ-LAPQI-TRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

Q7RXR3 Histone H3-like centromeric protein hH3v3.6e-2652.63Show/hide
Query:  SPSSAAAP---STPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAP--QITRWQAEALIALQEAAEDF
        S  +AA P   +TP G R       Q        KKRR+RPGT+AL+EIR  Q++ +LL+    F R V+E++ Q  P  +  RWQ++A++ALQEAAE F
Subjt:  SPSSAAAP---STPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAP--QITRWQAEALIALQEAAEDF

Query:  LVHLFEDTMLCAIHAKRVTIMKKDFELARRLGG
        LVHLFEDT LCAIHAKRVTIM+KD +LARR+ G
Subjt:  LVHLFEDTMLCAIHAKRVTIMKKDFELARRLGG

Q8RVQ9 Histone H3-like centromeric protein HTR125.9e-3753.93Show/hide
Query:  MARARHPAQRNSNR-KPSGTGASPSSAAAPSTPL-------GGRTQNVRQAQSPPSRTTR---------------KKRRFRPGTVALREIRQLQKSWNLL
        MAR +H   R+  R +    GAS S AA P+T         G  TQ      SP + T R               K  R+RPGTVAL+EIR  QK  NLL
Subjt:  MARARHPAQRNSNR-KPSGTGASPSSAAAPSTPL-------GGRTQNVRQAQSPPSRTTR---------------KKRRFRPGTVALREIRQLQKSWNLL

Query:  IPASRFIRAVKEVSYQLA-PQITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        IPA+ FIR V+ +++ LA PQI RW AEAL+ALQEAAED+LV LF D+MLCAIHA+RVT+M+KDFELARRLGGKGRPW
Subjt:  IPASRFIRAVKEVSYQLA-PQITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

Q9Y812 Histone H3-like centromeric protein cnp19.8e-2458.82Show/hide
Query:  KKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQIT-----RWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRL
        +K+R+RPGT ALREIR+ Q+S +LLI    F R V+E+S +     +     RWQ+ AL  LQEAAE FLVHLFEDT LCAIHAKRVTIM++D +LARR+
Subjt:  KKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQIT-----RWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRL

Query:  GG
         G
Subjt:  GG

Arabidopsis top hitse value%identityAlignment
AT1G01370.1 Histone superfamily protein4.2e-3853.93Show/hide
Query:  MARARHPAQRNSNR-KPSGTGASPSSAAAPSTPL-------GGRTQNVRQAQSPPSRTTR---------------KKRRFRPGTVALREIRQLQKSWNLL
        MAR +H   R+  R +    GAS S AA P+T         G  TQ      SP + T R               K  R+RPGTVAL+EIR  QK  NLL
Subjt:  MARARHPAQRNSNR-KPSGTGASPSSAAAPSTPL-------GGRTQNVRQAQSPPSRTTR---------------KKRRFRPGTVALREIRQLQKSWNLL

Query:  IPASRFIRAVKEVSYQLA-PQITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        IPA+ FIR V+ +++ LA PQI RW AEAL+ALQEAAED+LV LF D+MLCAIHA+RVT+M+KDFELARRLGGKGRPW
Subjt:  IPASRFIRAVKEVSYQLA-PQITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

AT1G01370.2 Histone superfamily protein4.2e-3853.93Show/hide
Query:  MARARHPAQRNSNR-KPSGTGASPSSAAAPSTPL-------GGRTQNVRQAQSPPSRTTR---------------KKRRFRPGTVALREIRQLQKSWNLL
        MAR +H   R+  R +    GAS S AA P+T         G  TQ      SP + T R               K  R+RPGTVAL+EIR  QK  NLL
Subjt:  MARARHPAQRNSNR-KPSGTGASPSSAAAPSTPL-------GGRTQNVRQAQSPPSRTTR---------------KKRRFRPGTVALREIRQLQKSWNLL

Query:  IPASRFIRAVKEVSYQLA-PQITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        IPA+ FIR V+ +++ LA PQI RW AEAL+ALQEAAED+LV LF D+MLCAIHA+RVT+M+KDFELARRLGGKGRPW
Subjt:  IPASRFIRAVKEVSYQLA-PQITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

AT1G09200.1 Histone superfamily protein4.2e-2247.33Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MAR +  A++++  K            AP   L   T+  R++ +P +   +K  RFRPGTVALREIR+ QKS  LLI    F R V+E++      + R
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGK
        +Q+ A+ ALQEAAE +LV LFEDT LCAIHAKRVTIM KD +LARR+ G+
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGK

AT1G19890.1 male-gamete-specific histone H32.5e-2248Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MAR +  A     RK +G G  P    A        T+  R+ + P     ++  RFRPGTVALREIR+ QKS +LLI    F R V+E++      + R
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGK
        +Q+ A++ALQEAAE +LV LFEDT LCAIHAKRVTIM KD +LARR+ G+
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGK

AT1G75600.1 Histone superfamily protein4.2e-2247.33Show/hide
Query:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR
        MAR +  A+++   K            AP T L   T+  R++ +P +   +K  R+RPGTVALREIR+ QKS  LLI    F R V+E++      + R
Subjt:  MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGK
        +Q+ A++ALQEAAE +LV LFEDT LCAIHAKRVTIM KD +LARR+ G+
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCGGGCAAGGCATCCAGCCCAAAGGAACTCAAATCGCAAGCCATCAGGTACTGGAGCTTCACCGTCTTCCGCAGCTGCGCCGTCGACGCCACTTGGTGGAAGAAC
ACAAAATGTGAGGCAAGCTCAAAGTCCACCATCAAGGACAACGCGAAAAAAAAGACGCTTCAGACCAGGGACAGTGGCATTAAGGGAAATTCGGCAACTCCAGAAATCAT
GGAATTTGCTAATTCCAGCTAGCCGTTTCATCCGAGCAGTGAAAGAAGTAAGCTACCAGTTGGCTCCACAGATTACGCGTTGGCAAGCTGAAGCTTTAATAGCACTTCAG
GAAGCAGCAGAAGATTTTTTGGTTCATCTATTTGAAGATACAATGCTATGTGCTATTCATGCCAAGCGTGTAACAATCATGAAAAAGGATTTTGAACTGGCACGTCGGTT
AGGAGGGAAAGGGAGGCCATGGTAA
mRNA sequenceShow/hide mRNA sequence
GCCACTTTTGCCCAAAATTGAAACAGCACAGAGCTCCCTCCAATTCTTTTCTGCCTGCTCGTCACTCTCACGCTACTGCTTCTAATTTCAAATCAATGGCGCGGGCAAGG
CATCCAGCCCAAAGGAACTCAAATCGCAAGCCATCAGGTACTGGAGCTTCACCGTCTTCCGCAGCTGCGCCGTCGACGCCACTTGGTGGAAGAACACAAAATGTGAGGCA
AGCTCAAAGTCCACCATCAAGGACAACGCGAAAAAAAAGACGCTTCAGACCAGGGACAGTGGCATTAAGGGAAATTCGGCAACTCCAGAAATCATGGAATTTGCTAATTC
CAGCTAGCCGTTTCATCCGAGCAGTGAAAGAAGTAAGCTACCAGTTGGCTCCACAGATTACGCGTTGGCAAGCTGAAGCTTTAATAGCACTTCAGGAAGCAGCAGAAGAT
TTTTTGGTTCATCTATTTGAAGATACAATGCTATGTGCTATTCATGCCAAGCGTGTAACAATCATGAAAAAGGATTTTGAACTGGCACGTCGGTTAGGAGGGAAAGGGAG
GCCATGGTAAGAAGATATGTTCATTAGAAAGAGTAGCCATCGTAGCGACACCGACATCACCACATTAACTGATGATGGCGTAGGGGACAACTTCTGTAGTTGATTTCTGG
TTTCAAGTTCAACCATGGCAGAATGGAGGATTGAACTATCAATTTTTAAGATGGTAATTGGTATCTTATCCTACTGAGCTATCTTTAGATTGATTGTAGTTAATTTGTGA
TTTGTAACTTTGTTTCATTATGTTATCAAAATACAACATTTTGCTGGCAGCTATGCAAAAAAACTAGACACCATTTTTTGGTTAGCTCAAACTATGAAGTGGTAACGAAA
A
Protein sequenceShow/hide protein sequence
MARARHPAQRNSNRKPSGTGASPSSAAAPSTPLGGRTQNVRQAQSPPSRTTRKKRRFRPGTVALREIRQLQKSWNLLIPASRFIRAVKEVSYQLAPQITRWQAEALIALQ
EAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW