; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr002174 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr002174
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHistone H2A
Genome locationtig00001384:21957..25846
RNA-Seq ExpressionSgr002174
SyntenySgr002174
Gene Ontology termsGO:0000724 - double-strand break repair via homologous recombination (biological process)
GO:0010212 - response to ionizing radiation (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0070876 - SOSS complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR002119 - Histone H2A
IPR007125 - Histone H2A/H2B/H3
IPR009072 - Histone-fold
IPR012340 - Nucleic acid-binding, OB-fold
IPR032454 - Histone H2A, C-terminal domain
IPR032458 - Histone H2A conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAY40811.1 hypothetical protein CUMW_054740 [Citrus unshiu]2.2e-6869.96Show/hide
Query:  MSSTDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELS
        MSS  A  KGGRG+    K VSRSHKAGLQFPVGRVAR+LKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKK RIIPRHIQLAV+NDEE S
Subjt:  MSSTDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELS

Query:  KLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEI---------------------------GSTKGGRGKPKAAKSVSRSQKAGLQFPV------------
        KLLGSVTIA+GGV+P IHQ+LLPKKA   KGEI                           GSTKGGRGKPK+ KS+SRS KAGLQFPV            
Subjt:  KLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEI---------------------------GSTKGGRGKPKAAKSVSRSQKAGLQFPV------------

Query:  -ERVGAGAPVYLSAVLEYLAAEL
         +RVGAGAPVYLSAVLEYLAAE+
Subjt:  -ERVGAGAPVYLSAVLEYLAAEL

KAF4368296.1 hypothetical protein G4B88_008600 [Cannabis sativa]1.5e-6969.36Show/hide
Query:  LPSLRSHFQSHHRKMSSTDAPA--KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRI
        +P+LR +       M ST A    KGGRGK  A KSVSRS KAGLQFPVGR+AR+LKKGRYAQRVGSGSPVYLSAVLEYLAAE+LELAGNAARDNKK+RI
Subjt:  LPSLRSHFQSHHRKMSSTDAPA--KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRI

Query:  IPRHIQLAVRNDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEIGS-----------------------TKGGRGKPKAAKSVSRSQKAGLQFPV
        IPRHIQLAVRNDEELSKLLG+VTIA+GGV+P IHQ+LLPKK GKGK EIGS                       TKGGRGKPK++KSVSRS KAGLQFPV
Subjt:  IPRHIQLAVRNDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEIGS-----------------------TKGGRGKPKAAKSVSRSQKAGLQFPV

Query:  -------------ERVGAGAPVYLSAVLEYLAAEL
                     ERVGAGAPVYLSAVLEYLAAE+
Subjt:  -------------ERVGAGAPVYLSAVLEYLAAEL

KAG6751671.1 hypothetical protein POTOM_043868 [Populus tomentosa]4.2e-6765.02Show/hide
Query:  RSHFQSHHRKMSSTDAPA---KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPR
        + HF     KMSS  A A   +GGRGK  A+K+VSRS KAGLQFPVGR+AR+LK G+YA+R+G+GSPVYLSAVLEYLAAEVLELAGNAARDNKK RIIPR
Subjt:  RSHFQSHHRKMSSTDAPA---KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPR

Query:  HIQLAVRNDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGK-GEIG---------------------------------STKGGRGKPKAAKSVSRSQ
        HIQLAVRNDEEL KLLGSVTIA+GGV+P I+Q+LLPKKAGKGK G++G                                 STKGGRGKPKA+KSVSRSQ
Subjt:  HIQLAVRNDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGK-GEIG---------------------------------STKGGRGKPKAAKSVSRSQ

Query:  KAGLQFPV-------------ERVGAGAPVYLSAVLEYLAAEL
        KAGLQFPV             ERVGAGAPVYL+AVLEYLAAE+
Subjt:  KAGLQFPV-------------ERVGAGAPVYLSAVLEYLAAEL

KAG8389410.1 hypothetical protein BUALT_Bualt02G0226300 [Buddleja alternifolia]2.8e-12771.88Show/hide
Query:  STDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKL
        +T      GRGK   +K+VSRS KAGLQFPVGRVAR+LKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKK RI+PRHIQLAVRND+E SKL
Subjt:  STDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKL

Query:  LGSVTIASGGVMPKIHQSLLPKK--AGKGKGEIGS-------------TKGGRGKPKAAKSVSRSQKAGLQFPV-------------ERVGAGAPVYLSA
        LGSVTIA+GGV+P IHQ+LLPKK  AGKGK EIGS             TKGGRGKPKA+KSV RS KAGLQFPV             ERVGAGAPVYLSA
Subjt:  LGSVTIASGGVMPKIHQSLLPKK--AGKGKGEIGS-------------TKGGRGKPKAAKSVSRSQKAGLQFPV-------------ERVGAGAPVYLSA

Query:  VLEYLAAE-LRSSQMMISLKDLVPAAQNNVNAQFIVLEKGKATVEGQNKTCLSLVADETAAVHFQLWGDECDAVEPSDIIRLTNGIFSYSRNNLVLRAGK
        VLEYLAAE +    M+  LK +VPAAQNN+N +FI+L+KGK T+EGQ KTCL+LVADETAAVHFQ+WGDEC+A EP DII L NGIFSYSRNNLVLRAGK
Subjt:  VLEYLAAE-LRSSQMMISLKDLVPAAQNNVNAQFIVLEKGKATVEGQNKTCLSLVADETAAVHFQLWGDECDAVEPSDIIRLTNGIFSYSRNNLVLRAGK

Query:  RGKIGKVGEFNMVFVEMPNMSEIHWVPDATDSKKYVKESVVSPYSRIFPPNH
        RGK  KVGEF M FVE PN+SEI W PD  DS+KYV+++V+SPYSR+F   H
Subjt:  RGKIGKVGEFNMVFVEMPNMSEIHWVPDATDSKKYVKESVVSPYSRIFPPNH

KDP21535.1 hypothetical protein JCGZ_22006 [Jatropha curcas]1.8e-7076.42Show/hide
Query:  MSSTDAP------AKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVR
        MSST  P      A+GGRG  S AKSVSRSHKAGLQFPVGRVAR+LK+GRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKK RIIPRHIQLAVR
Subjt:  MSSTDAP------AKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVR

Query:  NDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKG----------EIGSTKGGRGKPKAAKSVSRSQKAGLQFPV-------------ERVGAGAPVY
        NDEELSKLLGSVTIA+GGV+P IHQ+LLPKK  KGKG            GSTKGGRGKPK++KSVSRSQKAGLQFPV             +RVGAGAPVY
Subjt:  NDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKG----------EIGSTKGGRGKPKAAKSVSRSQKAGLQFPV-------------ERVGAGAPVY

Query:  LSAVLEYLAAEL
        LSAVLEYLAAE+
Subjt:  LSAVLEYLAAEL

TrEMBL top hitse value%identityAlignment
A0A067JNE6 Histone H2A8.8e-7176.42Show/hide
Query:  MSSTDAP------AKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVR
        MSST  P      A+GGRG  S AKSVSRSHKAGLQFPVGRVAR+LK+GRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKK RIIPRHIQLAVR
Subjt:  MSSTDAP------AKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVR

Query:  NDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKG----------EIGSTKGGRGKPKAAKSVSRSQKAGLQFPV-------------ERVGAGAPVY
        NDEELSKLLGSVTIA+GGV+P IHQ+LLPKK  KGKG            GSTKGGRGKPK++KSVSRSQKAGLQFPV             +RVGAGAPVY
Subjt:  NDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKG----------EIGSTKGGRGKPKAAKSVSRSQKAGLQFPV-------------ERVGAGAPVY

Query:  LSAVLEYLAAEL
        LSAVLEYLAAE+
Subjt:  LSAVLEYLAAEL

A0A2H5NLL6 Histone H2A1.1e-6869.96Show/hide
Query:  MSSTDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELS
        MSS  A  KGGRG+    K VSRSHKAGLQFPVGRVAR+LKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKK RIIPRHIQLAV+NDEE S
Subjt:  MSSTDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELS

Query:  KLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEI---------------------------GSTKGGRGKPKAAKSVSRSQKAGLQFPV------------
        KLLGSVTIA+GGV+P IHQ+LLPKKA   KGEI                           GSTKGGRGKPK+ KS+SRS KAGLQFPV            
Subjt:  KLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEI---------------------------GSTKGGRGKPKAAKSVSRSQKAGLQFPV------------

Query:  -ERVGAGAPVYLSAVLEYLAAEL
         +RVGAGAPVYLSAVLEYLAAE+
Subjt:  -ERVGAGAPVYLSAVLEYLAAEL

A0A6J1D803 SOSS complex subunit B homolog7.2e-6591.97Show/hide
Query:  MISLKDLVPAAQNNVNAQFIVLEKGKATVEGQNKTCLSLVADETAAVHFQLWGDECDAVEPSDIIRLTNGIFSYSRNNLVLRAGKRGKIGKVGEFNMVFV
        MISLKDLVPAAQNNVNAQFIVL+KGK T EGQNK CLSLVADETAAVHFQLWGDECDAVEPSDIIRLTNGIFSYSRNNLVLRAGKRGKI KVGEFNMVFV
Subjt:  MISLKDLVPAAQNNVNAQFIVLEKGKATVEGQNKTCLSLVADETAAVHFQLWGDECDAVEPSDIIRLTNGIFSYSRNNLVLRAGKRGKIGKVGEFNMVFV

Query:  EMPNMSEIHWVPDATDSKKYVKESVVSPYSRIFPPNH
        E PNMSEI WVPD  DSKKYVKESV+SPYSRIFPPN+
Subjt:  EMPNMSEIHWVPDATDSKKYVKESVVSPYSRIFPPNH

A0A7J6EH34 Uncharacterized protein6.5e-6660.59Show/hide
Query:  LPSLRSHFQSHHRKMSSTDAPA--KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRI
        +P+LR +       M ST A    KGGRGK  A KSVSRS KAGLQFPVGR+AR+LKKGRYAQRVGSGSPVYLSAVLEYLAAE+LELAGNAARDNKK+RI
Subjt:  LPSLRSHFQSHHRKMSSTDAPA--KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRI

Query:  IPRHIQLAVRNDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEIGS-------------------------------------------------
        IPRHIQLAVRNDEELSKLLG+VTIA+GGV+P IHQ+LLPKK GKGK EIGS                                                 
Subjt:  IPRHIQLAVRNDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEIGS-------------------------------------------------

Query:  --------TKGGRGKPKAAKSVSRSQKAGLQFPV-------------ERVGAGAPVYLSAVLEYLAAEL
                TKGGRGKPK++KSVSRS KAGLQFPV             ERVGAGAPVYLSAVLEYLAAE+
Subjt:  --------TKGGRGKPKAAKSVSRSQKAGLQFPV-------------ERVGAGAPVYLSAVLEYLAAEL

A0A7J6FC78 Uncharacterized protein7.5e-7069.36Show/hide
Query:  LPSLRSHFQSHHRKMSSTDAPA--KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRI
        +P+LR +       M ST A    KGGRGK  A KSVSRS KAGLQFPVGR+AR+LKKGRYAQRVGSGSPVYLSAVLEYLAAE+LELAGNAARDNKK+RI
Subjt:  LPSLRSHFQSHHRKMSSTDAPA--KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRI

Query:  IPRHIQLAVRNDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEIGS-----------------------TKGGRGKPKAAKSVSRSQKAGLQFPV
        IPRHIQLAVRNDEELSKLLG+VTIA+GGV+P IHQ+LLPKK GKGK EIGS                       TKGGRGKPK++KSVSRS KAGLQFPV
Subjt:  IPRHIQLAVRNDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEIGS-----------------------TKGGRGKPKAAKSVSRSQKAGLQFPV

Query:  -------------ERVGAGAPVYLSAVLEYLAAEL
                     ERVGAGAPVYLSAVLEYLAAE+
Subjt:  -------------ERVGAGAPVYLSAVLEYLAAEL

SwissProt top hitse value%identityAlignment
A2ZL69 Probable histone H2AXb1.5e-5178.79Show/hide
Query:  TDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLL
        + A   GGRGK   +KSVSRS KAGLQFPVGR+ARYLK G+YA+RVG+G+PVYLSAVLEYLAAEVLELAGNAARDNKK RI+PRHIQLAVRNDEELS+LL
Subjt:  TDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLL

Query:  GSVTIASGGVMPKIHQSLLPKKAGKGKGEIGS
        G+VTIA+GGV+P IHQ+LLPKK GK K +IGS
Subjt:  GSVTIASGGVMPKIHQSLLPKKAGKGKGEIGS

O04848 Probable histone H2AXa1.6e-5384.25Show/hide
Query:  KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSVTI
        KGGRGK  A KSVSRS KAGLQFPVGR+AR+LK G+YA+RVG+G+PVYLSAVLEYLAAEVLELAGNAARDNKKTRI+PRHIQLAVRNDEELSKLLGSVTI
Subjt:  KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSVTI

Query:  ASGGVMPKIHQSLLPKKAGKGKGEIGS
        A+GGV+P IHQ+LLP K GK KG+IGS
Subjt:  ASGGVMPKIHQSLLPKKAGKGKGEIGS

O65759 Histone H2AX8.5e-5583.7Show/hide
Query:  MSSTDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELS
        MSST A  KGGRGK  A+KSVSRS KAGLQFPVGR+AR+LK G+YA+RVG+G+PVYLSAVLEYLAAEVLELAGNAARDNK  RI+PRHIQLAVRNDEELS
Subjt:  MSSTDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELS

Query:  KLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEIGS
        KLLGSVTIA+GGV+P IHQ+LLPKK GKGKGEIGS
Subjt:  KLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEIGS

P35063 Histone H2AX8.0e-5382.81Show/hide
Query:  AKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSVT
        AKGGRGK  + KSVSRS KAGLQFPVGR+AR+LK G+YA+RVG+G+PVYL+AVLEYLAAEVLELAGNAARDNKK RI+PRHIQLAVRNDEELSKLLG+VT
Subjt:  AKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSVT

Query:  IASGGVMPKIHQSLLPKKAGKGKGEIGS
        IA+GGV+P IHQ LLPKK+GK KGEIGS
Subjt:  IASGGVMPKIHQSLLPKKAGKGKGEIGS

Q9S9K7 Probable histone H2AXb2.1e-5384.25Show/hide
Query:  KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSVTI
        KGGRGK  A KSVSRS KAGLQFPVGR+AR+LK G+YA+RVG+G+PVYLSAVLEYLAAEVLELAGNAARDNKKTRI+PRHIQLAVRNDEELSKLLGSVTI
Subjt:  KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSVTI

Query:  ASGGVMPKIHQSLLPKKAGKGKGEIGS
        A+GGV+P IHQ+LLP K GK KG+IGS
Subjt:  ASGGVMPKIHQSLLPKKAGKGKGEIGS

Arabidopsis top hitse value%identityAlignment
AT1G08880.1 Histone superfamily protein1.1e-5484.25Show/hide
Query:  KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSVTI
        KGGRGK  A KSVSRS KAGLQFPVGR+AR+LK G+YA+RVG+G+PVYLSAVLEYLAAEVLELAGNAARDNKKTRI+PRHIQLAVRNDEELSKLLGSVTI
Subjt:  KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSVTI

Query:  ASGGVMPKIHQSLLPKKAGKGKGEIGS
        A+GGV+P IHQ+LLP K GK KG+IGS
Subjt:  ASGGVMPKIHQSLLPKKAGKGKGEIGS

AT1G51060.1 histone H2A 101.4e-4779.03Show/hide
Query:  GRGKK----SAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSV
        GRGK     SA K+ +RS KAGLQFPVGR+AR+LKKG+YA+RVG+G+PVYL+AVLEYLAAEVLELAGNAARDNKKTRI+PRHIQLAVRNDEELSKLLG V
Subjt:  GRGKK----SAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSV

Query:  TIASGGVMPKIHQSLLPKKAGKGK
        TIA+GGVMP IH  LLPKK G  K
Subjt:  TIASGGVMPKIHQSLLPKKAGKGK

AT1G54690.1 gamma histone variant H2AX1.5e-5484.25Show/hide
Query:  KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSVTI
        KGGRGK  A KSVSRS KAGLQFPVGR+AR+LK G+YA+RVG+G+PVYLSAVLEYLAAEVLELAGNAARDNKKTRI+PRHIQLAVRNDEELSKLLGSVTI
Subjt:  KGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSVTI

Query:  ASGGVMPKIHQSLLPKKAGKGKGEIGS
        A+GGV+P IHQ+LLP K GK KG+IGS
Subjt:  ASGGVMPKIHQSLLPKKAGKGKGEIGS

AT4G27230.1 histone H2A 24.6e-4879.84Show/hide
Query:  GRGKK----SAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSV
        GRGK+    +A KS SRS KAGLQFPVGR+AR+LK G+YA+RVG+G+PVYL+AVLEYLAAEVLELAGNAARDNKKTRI+PRHIQLAVRNDEELSKLLG V
Subjt:  GRGKK----SAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSV

Query:  TIASGGVMPKIHQSLLPKKAGKGK
        TIA+GGVMP IH  LLPKKAG  K
Subjt:  TIASGGVMPKIHQSLLPKKAGKGK

AT4G27230.2 histone H2A 24.6e-4879.84Show/hide
Query:  GRGKK----SAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSV
        GRGK+    +A KS SRS KAGLQFPVGR+AR+LK G+YA+RVG+G+PVYL+AVLEYLAAEVLELAGNAARDNKKTRI+PRHIQLAVRNDEELSKLLG V
Subjt:  GRGKK----SAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARDNKKTRIIPRHIQLAVRNDEELSKLLGSV

Query:  TIASGGVMPKIHQSLLPKKAGKGK
        TIA+GGVMP IH  LLPKKAG  K
Subjt:  TIASGGVMPKIHQSLLPKKAGKGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAGCCCATCTTCTTCTCCGTCTCGACAATTCTTCTCTGCTCTTAATCCGCTTCCTTCCCTCCGATCGCATTTTCAATCTCACCACCGCAAAATGAGTTCCACCGA
CGCGCCGGCTAAGGGCGGCAGAGGCAAGAAATCCGCCGCCAAGTCCGTCTCCAGGTCTCACAAGGCCGGGCTTCAATTTCCCGTCGGCCGAGTCGCTAGGTACCTCAAGA
AGGGCCGTTATGCTCAGCGTGTCGGCTCTGGCTCTCCGGTCTACCTCTCCGCCGTCCTTGAATATCTCGCCGCTGAGGTTTTGGAGTTGGCTGGTAATGCTGCGAGGGAC
AACAAGAAGACCAGAATCATCCCGAGGCACATTCAACTTGCGGTGAGGAACGACGAGGAATTGAGCAAGCTTTTGGGATCGGTTACGATTGCCAGTGGAGGAGTGATGCC
GAAAATTCATCAGTCTCTTTTGCCAAAGAAGGCCGGAAAGGGGAAAGGCGAAATCGGATCGACCAAGGGCGGCAGAGGAAAACCCAAGGCCGCGAAGTCTGTTTCGCGGT
CACAGAAGGCCGGTCTTCAGTTCCCCGTCGAGCGTGTCGGCGCCGGTGCTCCCGTCTACCTTTCCGCCGTCCTTGAATATCTCGCCGCTGAGTTAAGAAGTTCACAAATG
ATGATATCTCTCAAGGACTTGGTGCCAGCAGCCCAGAACAATGTGAATGCACAGTTCATAGTTTTGGAGAAAGGGAAGGCAACAGTTGAAGGGCAGAACAAGACATGCCT
ATCACTGGTGGCCGACGAAACAGCAGCAGTTCACTTCCAGCTATGGGGAGATGAGTGCGACGCGGTCGAGCCGAGCGACATTATCCGCTTGACGAATGGGATATTCTCTT
ACAGCCGAAACAACCTGGTGCTGAGGGCAGGGAAGAGGGGGAAGATAGGGAAGGTGGGAGAGTTCAACATGGTGTTTGTGGAGATGCCCAATATGAGCGAGATCCATTGG
GTTCCCGACGCGACTGACTCGAAGAAGTATGTGAAGGAATCTGTTGTATCTCCCTATTCTCGTATCTTTCCACCAAACCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCAGCCCATCTTCTTCTCCGTCTCGACAATTCTTCTCTGCTCTTAATCCGCTTCCTTCCCTCCGATCGCATTTTCAATCTCACCACCGCAAAATGAGTTCCACCGA
CGCGCCGGCTAAGGGCGGCAGAGGCAAGAAATCCGCCGCCAAGTCCGTCTCCAGGTCTCACAAGGCCGGGCTTCAATTTCCCGTCGGCCGAGTCGCTAGGTACCTCAAGA
AGGGCCGTTATGCTCAGCGTGTCGGCTCTGGCTCTCCGGTCTACCTCTCCGCCGTCCTTGAATATCTCGCCGCTGAGGTTTTGGAGTTGGCTGGTAATGCTGCGAGGGAC
AACAAGAAGACCAGAATCATCCCGAGGCACATTCAACTTGCGGTGAGGAACGACGAGGAATTGAGCAAGCTTTTGGGATCGGTTACGATTGCCAGTGGAGGAGTGATGCC
GAAAATTCATCAGTCTCTTTTGCCAAAGAAGGCCGGAAAGGGGAAAGGCGAAATCGGATCGACCAAGGGCGGCAGAGGAAAACCCAAGGCCGCGAAGTCTGTTTCGCGGT
CACAGAAGGCCGGTCTTCAGTTCCCCGTCGAGCGTGTCGGCGCCGGTGCTCCCGTCTACCTTTCCGCCGTCCTTGAATATCTCGCCGCTGAGTTAAGAAGTTCACAAATG
ATGATATCTCTCAAGGACTTGGTGCCAGCAGCCCAGAACAATGTGAATGCACAGTTCATAGTTTTGGAGAAAGGGAAGGCAACAGTTGAAGGGCAGAACAAGACATGCCT
ATCACTGGTGGCCGACGAAACAGCAGCAGTTCACTTCCAGCTATGGGGAGATGAGTGCGACGCGGTCGAGCCGAGCGACATTATCCGCTTGACGAATGGGATATTCTCTT
ACAGCCGAAACAACCTGGTGCTGAGGGCAGGGAAGAGGGGGAAGATAGGGAAGGTGGGAGAGTTCAACATGGTGTTTGTGGAGATGCCCAATATGAGCGAGATCCATTGG
GTTCCCGACGCGACTGACTCGAAGAAGTATGTGAAGGAATCTGTTGTATCTCCCTATTCTCGTATCTTTCCACCAAACCATTGA
Protein sequenceShow/hide protein sequence
MCSPSSSPSRQFFSALNPLPSLRSHFQSHHRKMSSTDAPAKGGRGKKSAAKSVSRSHKAGLQFPVGRVARYLKKGRYAQRVGSGSPVYLSAVLEYLAAEVLELAGNAARD
NKKTRIIPRHIQLAVRNDEELSKLLGSVTIASGGVMPKIHQSLLPKKAGKGKGEIGSTKGGRGKPKAAKSVSRSQKAGLQFPVERVGAGAPVYLSAVLEYLAAELRSSQM
MISLKDLVPAAQNNVNAQFIVLEKGKATVEGQNKTCLSLVADETAAVHFQLWGDECDAVEPSDIIRLTNGIFSYSRNNLVLRAGKRGKIGKVGEFNMVFVEMPNMSEIHW
VPDATDSKKYVKESVVSPYSRIFPPNH