; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g05440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g05440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionHistone H3.2
Genome locationchr7:4548374..4553873
RNA-Seq ExpressionMoc07g05440
SyntenyMoc07g05440
Gene Ontology termsGO:0006996 - organelle organization (biological process)
GO:0051321 - meiotic cell cycle (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR000164 - Histone H3/CENP-A
IPR007125 - Histone H2A/H2B/H3
IPR009072 - Histone-fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011659152.1 histone H3-1 isoform X1 [Cucumis sativus]6.9e-5180.88Show/hide
Query:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
        MARARHP +R +NR P GSG A SSP  PSTPL GRTQNVRQAQ+  +RT  KK+RFRPGTVAL+EIR  QK+WNLLIPAS FIRAVKEVS QLAP+ITR
Subjt:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTI
        WQAEAL+ALQEAAEDFLVHLFEDTMLCAIHAKRVTI
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTI

XP_011659153.1 histone H3-like centromeric protein HTR12 isoform X2 [Cucumis sativus]1.8e-6283.12Show/hide
Query:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
        MARARHP +R +NR P GSG A SSP  PSTPL GRTQNVRQAQ+  +RT  KK+RFRPGTVAL+EIR  QK+WNLLIPAS FIRAVKEVS QLAP+ITR
Subjt:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEAL+ALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

XP_022156549.1 histone H3-like centromeric protein HTR12 [Momordica charantia]6.6e-78100Show/hide
Query:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
        MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
Subjt:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

XP_022959605.1 histone H3-like centromeric protein HTR12 [Cucurbita moschata]7.6e-6685.71Show/hide
Query:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
        MARA+HP QRN+NRKP  +G +PSS A PSTPL GRTQ+ RQ QSP +RT+GKKRRFRPGTVALREIRQ QK+WNLLIPAS+FIRAVKEVSYQLAP++TR
Subjt:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

XP_038896752.1 histone H3-like centromeric protein HTR12 [Benincasa hispida]2.9e-6585.71Show/hide
Query:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
        MARARHP QR +NR P G+G A SSPA PSTPL GRTQNV QAQS P RT+ KK+RFRPGTVALREIR  QK+WNLLIPAS FIRAVKEVSYQLAP+ITR
Subjt:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEAL+ALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

TrEMBL top hitse value%identityAlignment
A0A0A0K4F4 Histone H3.23.3e-5180.88Show/hide
Query:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
        MARARHP +R +NR P GSG A SSP  PSTPL GRTQNVRQAQ+  +RT  KK+RFRPGTVAL+EIR  QK+WNLLIPAS FIRAVKEVS QLAP+ITR
Subjt:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTI
        WQAEAL+ALQEAAEDFLVHLFEDTMLCAIHAKRVTI
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTI

A0A6J1DTT5 Histone H3.23.2e-78100Show/hide
Query:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
        MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
Subjt:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

A0A6J1H6R9 Histone H3.23.7e-6685.71Show/hide
Query:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
        MARA+HP QRN+NRKP  +G +PSS A PSTPL GRTQ+ RQ QSP +RT+GKKRRFRPGTVALREIRQ QK+WNLLIPAS+FIRAVKEVSYQLAP++TR
Subjt:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

A0A6J1KTZ4 Histone H3.23.7e-6685.71Show/hide
Query:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR
        MARA+HP QRN+NRKP  +G +PSS A PSTPL GRTQ+ RQ QSP +RT+GKKRRFRPGTVALREIRQ QK+WNLLIPAS+FIRAVKEVSYQLAP++TR
Subjt:  MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITR

Query:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
Subjt:  WQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

B7XEI8 Centromere specific histone H3 variant2.9e-4769.38Show/hide
Query:  MARARHPAQRNANRKP------LGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQL
        MAR +H A R  +R P        +  A SS A  STP   RT   R A S P RT  KK R+RPGTVALREIR+FQKTWNLLIPA+ FIR VKE+SY  
Subjt:  MARARHPAQRNANRKP------LGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQL

Query:  APEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        APE+TRWQAEALIALQEAAEDFLVHLF+D+MLCAIHAKRVT+MKKDFELARRLGGK RPW
Subjt:  APEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

SwissProt top hitse value%identityAlignment
P15512 Histone H3.31.3e-2354.24Show/hide
Query:  RTQNVRQAQSPPARTSG---KKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAK
        R Q   +A    A  SG   K  +FRPGTVALREIR++QKT +LLI    F R V++++ ++  +I R+Q++A++ALQEAAE +LV LFEDT LCAIHA+
Subjt:  RTQNVRQAQSPPARTSG---KKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAK

Query:  RVTIMKKDFELARRLGGK
        RVTIM KD +LARR+ G+
Subjt:  RVTIMKKDFELARRLGGK

Q59LN9 Histone H3-like centromeric protein CSE46.8e-2559.22Show/hide
Query:  KRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQ-LAPEI-TRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKG
        K+R+RPGT ALREIRQ+QK+ +LLI    F R V+E+S   + P    RWQ+ A++ALQEA+E FL+HL EDT LCAIHAKRVTIM+KD +LARR+  +G
Subjt:  KRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQ-LAPEI-TRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKG

Query:  RPW
        + W
Subjt:  RPW

Q7RXR3 Histone H3-like centromeric protein hH3v3.5e-2950.34Show/hide
Query:  PAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAP--EITRWQAE
        P +    +    S  A + P   +TP G R       Q       GKKRR+RPGT+AL+EIR +Q+T +LL+    F R V+E++ Q  P  E  RWQ++
Subjt:  PAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAP--EITRWQAE

Query:  ALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGG
        A++ALQEAAE FLVHLFEDT LCAIHAKRVTIM+KD +LARR+ G
Subjt:  ALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGG

Q8RVQ9 Histone H3-like centromeric protein HTR121.2e-3752.81Show/hide
Query:  MARARHPAQRNANRKPL-GSGVAPSSPAGPSTPL-------GGRTQNVRQAQSP---------------PARTSGKKRRFRPGTVALREIRQFQKTWNLL
        MAR +H   R+  R     +G + S  AGP+T         G  TQ      SP               P  +  K  R+RPGTVAL+EIR FQK  NLL
Subjt:  MARARHPAQRNANRKPL-GSGVAPSSPAGPSTPL-------GGRTQNVRQAQSP---------------PARTSGKKRRFRPGTVALREIRQFQKTWNLL

Query:  IPASSFIRAVKEVSYQLA-PEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        IPA+SFIR V+ +++ LA P+I RW AEAL+ALQEAAED+LV LF D+MLCAIHA+RVT+M+KDFELARRLGGKGRPW
Subjt:  IPASSFIRAVKEVSYQLA-PEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

Q9Y812 Histone H3-like centromeric protein cnp12.6e-2457.84Show/hide
Query:  KKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEIT-----RWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRL
        +K+R+RPGT ALREIR++Q++ +LLI    F R V+E+S +     +     RWQ+ AL  LQEAAE FLVHLFEDT LCAIHAKRVTIM++D +LARR+
Subjt:  KKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEIT-----RWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRL

Query:  GG
         G
Subjt:  GG

Arabidopsis top hitse value%identityAlignment
AT1G01370.1 Histone superfamily protein8.4e-3952.81Show/hide
Query:  MARARHPAQRNANRKPL-GSGVAPSSPAGPSTPL-------GGRTQNVRQAQSP---------------PARTSGKKRRFRPGTVALREIRQFQKTWNLL
        MAR +H   R+  R     +G + S  AGP+T         G  TQ      SP               P  +  K  R+RPGTVAL+EIR FQK  NLL
Subjt:  MARARHPAQRNANRKPL-GSGVAPSSPAGPSTPL-------GGRTQNVRQAQSP---------------PARTSGKKRRFRPGTVALREIRQFQKTWNLL

Query:  IPASSFIRAVKEVSYQLA-PEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        IPA+SFIR V+ +++ LA P+I RW AEAL+ALQEAAED+LV LF D+MLCAIHA+RVT+M+KDFELARRLGGKGRPW
Subjt:  IPASSFIRAVKEVSYQLA-PEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

AT1G01370.2 Histone superfamily protein8.4e-3952.81Show/hide
Query:  MARARHPAQRNANRKPL-GSGVAPSSPAGPSTPL-------GGRTQNVRQAQSP---------------PARTSGKKRRFRPGTVALREIRQFQKTWNLL
        MAR +H   R+  R     +G + S  AGP+T         G  TQ      SP               P  +  K  R+RPGTVAL+EIR FQK  NLL
Subjt:  MARARHPAQRNANRKPL-GSGVAPSSPAGPSTPL-------GGRTQNVRQAQSP---------------PARTSGKKRRFRPGTVALREIRQFQKTWNLL

Query:  IPASSFIRAVKEVSYQLA-PEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW
        IPA+SFIR V+ +++ LA P+I RW AEAL+ALQEAAED+LV LF D+MLCAIHA+RVT+M+KDFELARRLGGKGRPW
Subjt:  IPASSFIRAVKEVSYQLA-PEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW

AT4G40030.1 Histone superfamily protein3.4e-2454.24Show/hide
Query:  RTQNVRQAQSPPARTSG---KKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAK
        R Q   +A    A T+G   K  R+RPGTVALREIR++QK+  LLI    F R V+E++     ++ R+Q+ A++ALQEAAE +LV LFEDT LCAIHAK
Subjt:  RTQNVRQAQSPPARTSG---KKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAK

Query:  RVTIMKKDFELARRLGGK
        RVTIM KD +LARR+ G+
Subjt:  RVTIMKKDFELARRLGGK

AT4G40040.1 Histone superfamily protein3.4e-2454.24Show/hide
Query:  RTQNVRQAQSPPARTSG---KKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAK
        R Q   +A    A T+G   K  R+RPGTVALREIR++QK+  LLI    F R V+E++     ++ R+Q+ A++ALQEAAE +LV LFEDT LCAIHAK
Subjt:  RTQNVRQAQSPPARTSG---KKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAK

Query:  RVTIMKKDFELARRLGGK
        RVTIM KD +LARR+ G+
Subjt:  RVTIMKKDFELARRLGGK

AT4G40040.2 Histone superfamily protein3.4e-2454.24Show/hide
Query:  RTQNVRQAQSPPARTSG---KKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAK
        R Q   +A    A T+G   K  R+RPGTVALREIR++QK+  LLI    F R V+E++     ++ R+Q+ A++ALQEAAE +LV LFEDT LCAIHAK
Subjt:  RTQNVRQAQSPPARTSG---KKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITRWQAEALIALQEAAEDFLVHLFEDTMLCAIHAK

Query:  RVTIMKKDFELARRLGGK
        RVTIM KD +LARR+ G+
Subjt:  RVTIMKKDFELARRLGGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCGAGCCAGGCATCCAGCTCAAAGGAACGCAAATCGCAAGCCACTAGGTTCTGGAGTTGCACCGTCTTCCCCAGCTGGGCCATCGACGCCACTTGGTGGAAGAAC
ACAAAATGTGAGGCAAGCTCAAAGTCCACCAGCAAGGACATCTGGGAAGAAAAGACGGTTCAGGCCAGGGACAGTTGCATTGAGGGAAATTCGGCAATTCCAAAAGACAT
GGAATCTGCTAATTCCAGCTAGCAGTTTCATCCGAGCAGTAAAAGAAGTAAGCTACCAGTTAGCTCCAGAGATTACGCGGTGGCAAGCTGAAGCTTTAATAGCTCTTCAG
GAAGCAGCAGAAGATTTTTTGGTTCATCTATTTGAAGATACAATGCTATGTGCTATTCATGCCAAACGTGTAACTATCATGAAAAAGGATTTTGAACTGGCACGCCGGTT
AGGAGGAAAAGGGAGGCCATGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCGAGCCAGGCATCCAGCTCAAAGGAACGCAAATCGCAAGCCACTAGGTTCTGGAGTTGCACCGTCTTCCCCAGCTGGGCCATCGACGCCACTTGGTGGAAGAAC
ACAAAATGTGAGGCAAGCTCAAAGTCCACCAGCAAGGACATCTGGGAAGAAAAGACGGTTCAGGCCAGGGACAGTTGCATTGAGGGAAATTCGGCAATTCCAAAAGACAT
GGAATCTGCTAATTCCAGCTAGCAGTTTCATCCGAGCAGTAAAAGAAGTAAGCTACCAGTTAGCTCCAGAGATTACGCGGTGGCAAGCTGAAGCTTTAATAGCTCTTCAG
GAAGCAGCAGAAGATTTTTTGGTTCATCTATTTGAAGATACAATGCTATGTGCTATTCATGCCAAACGTGTAACTATCATGAAAAAGGATTTTGAACTGGCACGCCGGTT
AGGAGGAAAAGGGAGGCCATGGTGA
Protein sequenceShow/hide protein sequence
MARARHPAQRNANRKPLGSGVAPSSPAGPSTPLGGRTQNVRQAQSPPARTSGKKRRFRPGTVALREIRQFQKTWNLLIPASSFIRAVKEVSYQLAPEITRWQAEALIALQ
EAAEDFLVHLFEDTMLCAIHAKRVTIMKKDFELARRLGGKGRPW