; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023408 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023408
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHistone H2A
Genome locationtig00000892:3058622..3062994
RNA-Seq ExpressionSgr023408
SyntenySgr023408
Gene Ontology termsGO:0070828 - heterochromatin organization (biological process)
GO:0000775 - chromosome, centromeric region (cellular component)
GO:0000786 - nucleosome (cellular component)
GO:0000792 - heterochromatin (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR002119 - Histone H2A
IPR002885 - Pentatricopeptide repeat
IPR007125 - Histone H2A/H2B/H3
IPR009072 - Histone-fold
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032454 - Histone H2A, C-terminal domain
IPR032458 - Histone H2A conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570725.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]7.1e-8267.23Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL
        MHSQGM+PD VIYTTLIDG IKAG +RKAFG WDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLF+RMLV                    GNME+AL
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL

Query:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS
        QLHNAMLKGTLANPVTYNILIRGYCQ+GKFHEAAKLLDGMIGNGI PDCITYSTFIYEYC+RGNV AA+EMWECML+RGLKPDT+              +
Subjt:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS

Query:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF
         +F    C    EL + +   N M+ +  L P+ S Y+
Subjt:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF

KAG7010569.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]7.1e-8267.23Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL
        MHSQGM+PD VIYTTLIDG IKAG +RKAFG WDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLF+RMLV                    GNME+AL
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL

Query:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS
        QLHNAMLKGTLANPVTYNILIRGYCQ+GKFHEAAKLLDGMIGNGI PDCITYSTFIYEYC+RGNV AA+EMWECML+RGLKPDT+              +
Subjt:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS

Query:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF
         +F    C    EL + +   N M+ +  L P+ S Y+
Subjt:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF

XP_022944482.1 putative pentatricopeptide repeat-containing protein At5g59900 isoform X1 [Cucurbita moschata]7.1e-8267.23Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL
        MHSQGM+PD VIYTTLIDG IKAG +RKAFG WDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLF+RMLV                    GNME+AL
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL

Query:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS
        QLHNAMLKGTLANPVTYNILIRGYCQ+GKFHEAAKLLDGMIGNGI PDCITYSTFIYEYC+RGNV AA+EMWECML+RGLKPDT+              +
Subjt:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS

Query:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF
         +F    C    EL + +   N M+ +  L P+ S Y+
Subjt:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF

XP_022944483.1 putative pentatricopeptide repeat-containing protein At5g59900 isoform X2 [Cucurbita moschata]7.1e-8267.23Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL
        MHSQGM+PD VIYTTLIDG IKAG +RKAFG WDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLF+RMLV                    GNME+AL
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL

Query:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS
        QLHNAMLKGTLANPVTYNILIRGYCQ+GKFHEAAKLLDGMIGNGI PDCITYSTFIYEYC+RGNV AA+EMWECML+RGLKPDT+              +
Subjt:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS

Query:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF
         +F    C    EL + +   N M+ +  L P+ S Y+
Subjt:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF

XP_023511929.1 putative pentatricopeptide repeat-containing protein At5g59900 [Cucurbita pepo subsp. pepo]9.3e-8267.23Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL
        MHSQGM+PD VIYTTLIDG IKAG +RKAFGFWDIMIGEGCIPNTVTYTALVNGL KAGYVNEAKLLF+RMLV                    GNME+AL
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL

Query:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS
        QLHNAMLKGTLANPVTYNILIRGYCQ+GKFHEAAKLLDGMIGNGI PDCITYSTFIYEYC+RGNV AA+EMWECML+RGLKPDT+              +
Subjt:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS

Query:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF
         +F    C    EL + +   N M+ +  L P+ S Y+
Subjt:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF

TrEMBL top hitse value%identityAlignment
A0A5D3D034 Putative pentatricopeptide repeat-containing protein1.4e-7561.67Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL
        MH +GM+PD+VIYTTLIDG++KAG ++KAFGFW+IMI EGC+PNTVTYTALVNGLFKAGYVNEAKLLF+RMLV                    GNME+AL
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL

Query:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS
        QLHNAMLKG+LANPVTYNILIRGYCQ+GKF EAAKLLD MIG G+ PDCITYSTFIYEYC+RG+V+AA++MWECMLQRGLKPD +              +
Subjt:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS

Query:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYFQY
         +F    C    EL R +   N M+ +  L P+ S Y  +
Subjt:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYFQY

A0A6J1FVS2 putative pentatricopeptide repeat-containing protein At5g59900 isoform X13.4e-8267.23Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL
        MHSQGM+PD VIYTTLIDG IKAG +RKAFG WDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLF+RMLV                    GNME+AL
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL

Query:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS
        QLHNAMLKGTLANPVTYNILIRGYCQ+GKFHEAAKLLDGMIGNGI PDCITYSTFIYEYC+RGNV AA+EMWECML+RGLKPDT+              +
Subjt:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS

Query:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF
         +F    C    EL + +   N M+ +  L P+ S Y+
Subjt:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF

A0A6J1FY36 putative pentatricopeptide repeat-containing protein At5g59900 isoform X23.4e-8267.23Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL
        MHSQGM+PD VIYTTLIDG IKAG +RKAFG WDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLF+RMLV                    GNME+AL
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL

Query:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS
        QLHNAMLKGTLANPVTYNILIRGYCQ+GKFHEAAKLLDGMIGNGI PDCITYSTFIYEYC+RGNV AA+EMWECML+RGLKPDT+              +
Subjt:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS

Query:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF
         +F    C    EL + +   N M+ +  L P+ S Y+
Subjt:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF

A0A6J1J8G9 putative pentatricopeptide repeat-containing protein At5g59900 isoform X25.0e-8166.39Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL
        MHSQGM+PD VIYTTLIDG IKAG +RKAFGFWDIMIGEGCIPN+VTYTALVNGL KAGYVNEAKLLF+RMLV                    GNME+AL
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL

Query:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS
        QLHNAMLKGTLANPVTYNILIRGYCQ+GKFHEAA+LLDGMIGNGI PDCITYSTFIYEYC+RGNV AA+EMWECML+RGLKPDT++              
Subjt:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS

Query:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF
         +F    C    EL + +   N M+ +  L P+ S Y+
Subjt:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF

A0A6J1JC67 putative pentatricopeptide repeat-containing protein At5g59900 isoform X15.0e-8166.39Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL
        MHSQGM+PD VIYTTLIDG IKAG +RKAFGFWDIMIGEGCIPN+VTYTALVNGL KAGYVNEAKLLF+RMLV                    GNME+AL
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLV--------------------GNMESAL

Query:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS
        QLHNAMLKGTLANPVTYNILIRGYCQ+GKFHEAA+LLDGMIGNGI PDCITYSTFIYEYC+RGNV AA+EMWECML+RGLKPDT++              
Subjt:  QLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFS

Query:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF
         +F    C    EL + +   N M+ +  L P+ S Y+
Subjt:  TDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYF

SwissProt top hitse value%identityAlignment
A2WQG7 Probable histone H2A.54.4e-5886.09Show/hide
Query:  GGKAKKGAGGRR-GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEELG
        GGK KKGA GR+ GG +KKAVSRSVKAGLQFPVGRI RYLKKGRYAQR+GTGAPVYLAAV+EYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEELG
Subjt:  GGKAKKGAGGRR-GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEELG

Query:  KLLAGVTIASGGVLPNINPVLLPKKTDK----ALKEPKSPSKAA-KSPSKA
        KLLAGVTIA GGVLPNINPVLLPKKT +    A KE KSP KAA KSP KA
Subjt:  KLLAGVTIASGGVLPNINPVLLPKKTDK----ALKEPKSPSKAA-KSPSKA

P19177 Histone H2A7.7e-6387.25Show/hide
Query:  METGGKAKKGAGGRRGGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEE
        MET GKAKKG GGR+GG +KK+V+RSVKAGLQFPVGRI RYLKKGRYAQRVGTGAPVYLAAV+EYLAAEVLELAGNAARDNKK RIIPRH+LLA+RNDEE
Subjt:  METGGKAKKGAGGRRGGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEE

Query:  LGKLLAGVTIASGGVLPNINPVLLPKKT-DKALKEPKSPSKAAKSPSKA
        LGKLLAGVT A GGVLPNINPVLLPKKT +KA KEPKSPSKA KSP KA
Subjt:  LGKLLAGVTIASGGVLPNINPVLLPKKT-DKALKEPKSPSKAAKSPSKA

Q2HU68 Probable histone H2A.12.7e-6083.11Show/hide
Query:  METGGKAKKGAGGRRGGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEE
        M+   K KKGAGGR+GG +KK+V+RS +AGLQFPVGRI RYLKKGRYAQRVGTGAPVYLAAV+EYLAAEVLELAGNAARDNKKNRIIPRHVLLA+RNDEE
Subjt:  METGGKAKKGAGGRRGGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEE

Query:  LGKLLAGVTIASGGVLPNINPVLLPKKTDKALKEPKSPSKAAKSPSKA
        LGKLLAGVTIA GGVLPNINP+LLPKK +KA    KSPSKA KSP KA
Subjt:  LGKLLAGVTIASGGVLPNINPVLLPKKTDKALKEPKSPSKAAKSPSKA

Q94E96 Probable histone H2A.54.4e-5886.09Show/hide
Query:  GGKAKKGAGGRR-GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEELG
        GGK KKGA GR+ GG +KKAVSRSVKAGLQFPVGRI RYLKKGRYAQR+GTGAPVYLAAV+EYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEELG
Subjt:  GGKAKKGAGGRR-GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEELG

Query:  KLLAGVTIASGGVLPNINPVLLPKKTDK----ALKEPKSPSKAA-KSPSKA
        KLLAGVTIA GGVLPNINPVLLPKKT +    A KE KSP KAA KSP KA
Subjt:  KLLAGVTIASGGVLPNINPVLLPKKTDK----ALKEPKSPSKAA-KSPSKA

Q9M531 Histone H2A4.7e-6084.31Show/hide
Query:  METGGKAKKGAGGRR-GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDE
        M+TG K KKGAG R+ GG KKK VSRSVKAGLQFPVGRI R+LKKGRYAQRVG+GAPVYLAAV+EYLAAEVLELAGNAARDNKKNRIIPRHVLLA+RNDE
Subjt:  METGGKAKKGAGGRR-GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDE

Query:  ELGKLLAGVTIASGGVLPNINPVLLPKKTDK----ALKEPKSPSKAAKSPSKA
        ELGKLLAGVTIA GGVLPNINPVLLPKK +K    A KEPKSP+KA KSP KA
Subjt:  ELGKLLAGVTIASGGVLPNINPVLLPKKTDK----ALKEPKSPSKAAKSPSKA

Arabidopsis top hitse value%identityAlignment
AT5G02560.1 histone H2A 125.3e-5981.05Show/hide
Query:  METGGKAKKGAGGRR--GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRND
        M++G K KKGA GRR  GG KKK VSRSVK+GLQFPVGRI RYLKKGRY++RVGTGAPVYLAAV+EYLAAEVLELAGNAARDNKKNRIIPRHVLLA+RND
Subjt:  METGGKAKKGAGGRR--GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRND

Query:  EELGKLLAGVTIASGGVLPNINPVLLPKKTDKA---LKEPKSPSKAAKSPSKA
        EELG LL GVTIA GGVLPNINP+LLPKK++KA    K PKSPSKA KSP K+
Subjt:  EELGKLLAGVTIASGGVLPNINPVLLPKKTDKA---LKEPKSPSKAAKSPSKA

AT5G02560.2 histone H2A 126.1e-5570.06Show/hide
Query:  METGGKAKKGAGGRR--GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAE------------------------VLELA
        M++G K KKGA GRR  GG KKK VSRSVK+GLQFPVGRI RYLKKGRY++RVGTGAPVYLAAV+EYLAAE                        VLELA
Subjt:  METGGKAKKGAGGRR--GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAE------------------------VLELA

Query:  GNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPNINPVLLPKKTDKA---LKEPKSPSKAAKSPSKA
        GNAARDNKKNRIIPRHVLLA+RNDEELG LL GVTIA GGVLPNINP+LLPKK++KA    K PKSPSKA KSP K+
Subjt:  GNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPNINPVLLPKKTDKA---LKEPKSPSKAAKSPSKA

AT5G27670.1 histone H2A 75.7e-5375.84Show/hide
Query:  SMETGGKAKKGAGGRRGGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDE
        S +   K  +GAGGR+GG++KK+VS+SVKAGLQFPVGRIARYLKKGRYA R G+GAPVYLAAV+EYLAAEVLELAGNAARDNKKNRI PRH+ LAIRNDE
Subjt:  SMETGGKAKKGAGGRRGGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDE

Query:  ELGKLLAGVTIASGGVLPNINPVLLPKKTDKALKEPKSPSKAAKSPSKA
        ELG+LL GVTIASGGVLPNINPVLLPKK+  +  + +  S A KSP KA
Subjt:  ELGKLLAGVTIASGGVLPNINPVLLPKKTDKALKEPKSPSKAAKSPSKA

AT5G59870.1 histone H2A 63.1e-5174Show/hide
Query:  METGGKAKKGAGGRR--GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRND
        ME+ GK KK  GGR+  G  K K+VS+S+KAGLQFPVGRI R+LKKGRYAQR+G GAPVY+AAV+EYLAAEVLELAGNAARDNKK+RIIPRH+LLAIRND
Subjt:  METGGKAKKGAGGRR--GGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAPVYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRND

Query:  EELGKLLAGVTIASGGVLPNINPVLLPKKTDKALKEPKSPSKAAKSPSKA
        EELGKLL+GVTIA GGVLPNIN VLLPKK+     E K+     KSP KA
Subjt:  EELGKLLAGVTIASGGVLPNINPVLLPKKTDKALKEPKSPSKAAKSPSKA

AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein8.7e-5444.26Show/hide
Query:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERM----LVGN-----------------MESA
        MH +G++PD VIYT++ID   K G  ++AFG WD+MI EGC+PN VTYTA++NGL KAG+VNEA++L  +M     V N                 M+ A
Subjt:  MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERM----LVGN-----------------MESA

Query:  LQLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIF
        ++LHNA+LKG LAN  TYN+LIRG+C+ G+  EA++L+  MIG+G+ PDCITY+T I E CRR +V  AIE+W  M ++G++PD +             +
Subjt:  LQLHNAMLKGTLANPVTYNILIRGYCQMGKFHEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIF

Query:  STDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSN
        +T   G   AG  E+ +  +  N ML++  L+P+N
Subjt:  STDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAGTCAAGGAATGAGACCTGATAGTGTAATATACACCACTTTGATTGATGGATACATCAAAGCAGGAAAGGTCAGAAAGGCATTTGGATTTTGGGACATTATGAT
TGGTGAAGGATGCATTCCCAACACTGTGACATACACGGCATTGGTGAATGGATTATTCAAGGCTGGATATGTCAATGAAGCCAAGCTACTTTTCGAGCGTATGCTGGTTG
GAAATATGGAGAGCGCTCTGCAACTACACAATGCAATGCTCAAAGGAACTTTAGCAAATCCTGTCACATATAATATACTTATCCGGGGTTATTGCCAGATGGGAAAATTT
CACGAGGCAGCCAAGCTTCTTGATGGAATGATTGGAAATGGTATCTTCCCAGATTGTATCACATACTCGACATTTATCTATGAATATTGTAGGAGGGGTAATGTTAACGC
AGCTATTGAGATGTGGGAGTGTATGTTACAGAGGGGCTTGAAGCCTGATACACTCCTCCCTTTCCCCTTGCCTCCCTATCCATGGCCAGACATTTTCTCGACTGATTTTT
GGGGAATGAACTGCGCTGGGCAGTATGAGTTATCTAGACCTGTTGATTCCATGAATCATATGCTGAAGAAGCTATCTCTATTGCCATCTAATTCAAAATATTTTCAGTAT
AGTCAATCTGTTCTCTTTCAACCCGCCATTCGGGTTTCTTCCCACGATGGACGAGCTTACAAACAACTGATTGAAAAAACCAGATCAACTCCAGGAGCAAAGGTGGAGAA
CAATGAGTTATTTCGCTCAATGCACTTTCTCCACGTTGAAGAAAAGCGTTCAATGGAGACCGGCGGGAAGGCAAAGAAGGGCGCAGGAGGTAGAAGAGGAGGCGAGAAGA
AGAAGGCGGTTTCCCGCTCCGTCAAGGCCGGTTTGCAGTTCCCCGTCGGCCGTATCGCTCGTTATCTGAAAAAAGGAAGGTACGCTCAGCGAGTCGGCACCGGCGCTCCG
GTTTACTTGGCCGCAGTTATGGAGTACTTAGCCGCTGAGGTTCTAGAGTTGGCCGGAAATGCAGCACGAGACAACAAGAAGAACAGGATCATACCAAGGCACGTACTATT
GGCGATCAGGAACGACGAAGAACTTGGAAAGTTGCTGGCCGGCGTAACCATCGCCAGTGGCGGCGTTCTTCCGAACATCAACCCGGTTCTGTTGCCGAAGAAGACGGATA
AAGCTTTGAAAGAACCGAAATCTCCATCGAAGGCAGCAAAGTCTCCGAGCAAGGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATAGTCAAGGAATGAGACCTGATAGTGTAATATACACCACTTTGATTGATGGATACATCAAAGCAGGAAAGGTCAGAAAGGCATTTGGATTTTGGGACATTATGAT
TGGTGAAGGATGCATTCCCAACACTGTGACATACACGGCATTGGTGAATGGATTATTCAAGGCTGGATATGTCAATGAAGCCAAGCTACTTTTCGAGCGTATGCTGGTTG
GAAATATGGAGAGCGCTCTGCAACTACACAATGCAATGCTCAAAGGAACTTTAGCAAATCCTGTCACATATAATATACTTATCCGGGGTTATTGCCAGATGGGAAAATTT
CACGAGGCAGCCAAGCTTCTTGATGGAATGATTGGAAATGGTATCTTCCCAGATTGTATCACATACTCGACATTTATCTATGAATATTGTAGGAGGGGTAATGTTAACGC
AGCTATTGAGATGTGGGAGTGTATGTTACAGAGGGGCTTGAAGCCTGATACACTCCTCCCTTTCCCCTTGCCTCCCTATCCATGGCCAGACATTTTCTCGACTGATTTTT
GGGGAATGAACTGCGCTGGGCAGTATGAGTTATCTAGACCTGTTGATTCCATGAATCATATGCTGAAGAAGCTATCTCTATTGCCATCTAATTCAAAATATTTTCAGTAT
AGTCAATCTGTTCTCTTTCAACCCGCCATTCGGGTTTCTTCCCACGATGGACGAGCTTACAAACAACTGATTGAAAAAACCAGATCAACTCCAGGAGCAAAGGTGGAGAA
CAATGAGTTATTTCGCTCAATGCACTTTCTCCACGTTGAAGAAAAGCGTTCAATGGAGACCGGCGGGAAGGCAAAGAAGGGCGCAGGAGGTAGAAGAGGAGGCGAGAAGA
AGAAGGCGGTTTCCCGCTCCGTCAAGGCCGGTTTGCAGTTCCCCGTCGGCCGTATCGCTCGTTATCTGAAAAAAGGAAGGTACGCTCAGCGAGTCGGCACCGGCGCTCCG
GTTTACTTGGCCGCAGTTATGGAGTACTTAGCCGCTGAGGTTCTAGAGTTGGCCGGAAATGCAGCACGAGACAACAAGAAGAACAGGATCATACCAAGGCACGTACTATT
GGCGATCAGGAACGACGAAGAACTTGGAAAGTTGCTGGCCGGCGTAACCATCGCCAGTGGCGGCGTTCTTCCGAACATCAACCCGGTTCTGTTGCCGAAGAAGACGGATA
AAGCTTTGAAAGAACCGAAATCTCCATCGAAGGCAGCAAAGTCTCCGAGCAAGGCTTAA
Protein sequenceShow/hide protein sequence
MHSQGMRPDSVIYTTLIDGYIKAGKVRKAFGFWDIMIGEGCIPNTVTYTALVNGLFKAGYVNEAKLLFERMLVGNMESALQLHNAMLKGTLANPVTYNILIRGYCQMGKF
HEAAKLLDGMIGNGIFPDCITYSTFIYEYCRRGNVNAAIEMWECMLQRGLKPDTLLPFPLPPYPWPDIFSTDFWGMNCAGQYELSRPVDSMNHMLKKLSLLPSNSKYFQY
SQSVLFQPAIRVSSHDGRAYKQLIEKTRSTPGAKVENNELFRSMHFLHVEEKRSMETGGKAKKGAGGRRGGEKKKAVSRSVKAGLQFPVGRIARYLKKGRYAQRVGTGAP
VYLAAVMEYLAAEVLELAGNAARDNKKNRIIPRHVLLAIRNDEELGKLLAGVTIASGGVLPNINPVLLPKKTDKALKEPKSPSKAAKSPSKA