; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013729 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013729
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPDZ domain-containing protein
Genome locationtig00153968:23402..29244
RNA-Seq ExpressionSgr013729
SyntenySgr013729
Gene Ontology termsGO:0000786 - nucleosome (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR001478 - PDZ domain
IPR002119 - Histone H2A
IPR007125 - Histone H2A/H2B/H3
IPR009003 - Peptidase S1, PA clan
IPR009072 - Histone-fold
IPR032454 - Histone H2A, C-terminal domain
IPR032458 - Histone H2A conserved site
IPR036034 - PDZ superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAE6123816.1 unnamed protein product [Arabidopsis arenosa]2.4e-13252.53Show/hide
Query:  KGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQGV
        +GAGGRKGGDR K V+KS+KAGLQFPVGRI RYLKKGRYA R  AGAP+YLAAVLEYLAAEVLELAGNAARDNKKNRINPRH+ LA+RNDEELGKLL GV
Subjt:  KGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQGV

Query:  TIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKTQKGFELPEHTQTDRRSCCCWFLFIVCQNR
        TIASGGVLPNINP     K  + +  +   K       + M   LR     S   SEL R   V      T     +  ++  D R+            R
Subjt:  TIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKTQKGFELPEHTQTDRRSCCCWFLFIVCQNR

Query:  IGFQTHGSVVNSCCLERVTVSSMADH---TRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR-----------
        I      SV  S  L    +S    H     L G    SSRVSP S  P+  EK  P  + D  KP    LGRDTIANAAA VGPAVV            
Subjt:  IGFQTHGSVVNSCCLERVTVSSMADH---TRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR-----------

Query:  ----------------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQN
                                                             NAD  SDIA+VKI SK+PLP AKLG SSKLRPGDWV+A+GCPLSLQN
Subjt:  ----------------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQN

Query:  TVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRGKRY--------------
        T+TAGIVSCVDRKSSDLGLGG RREYLQTDCAIN GNSGGPLVN+DGEV+GVNIMKV  A GL F+VPIDSVSKI E FKK G+                
Subjt:  TVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRGKRY--------------

Query:  ----------SFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM
                   FPDV +G+LV  V PGSPA RAGF PGDVV+  D  PV +IKEIIEI+ DR+G  ++ VV+RS    +TL V+PEE+NPDM
Subjt:  ----------SFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM

KAF3976576.1 hypothetical protein CMV_000255 [Castanea mollissima]2.4e-14855.5Show/hide
Query:  TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        TKGAGGR+GGDR K VSKS KAGLQFPVGRIGR+LKKGRYA+RT  GAP+YLAAVLEYLAAEVLELAGNAARDNKK RINPRHVLLAVRNDEELGKLLQG
Subjt:  TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRGTGPPSLSASEL----------HRNRRVQAGIWKTQKGFELPEHTQTDRRSCC
        VTIASGGVLPNINPVLLPKKT S+    A EK PK+   A +  V+       +  SE           +RN  ++     +     L  ++ ++ R+  
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRGTGPPSLSASEL----------HRNRRVQAGIWKTQKGFELPEHTQTDRRSCC

Query:  CWFLFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR---
           +    Q  +      +     C   ++    +D  +   L   SSRV   SAP S  +K+  GV     KPC  CLGRDTIANAAA VGPAVV    
Subjt:  CWFLFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR---

Query:  ------------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAI
                                                                     NAD HSDIAIVKI+SK+PLP A LGSSSKLRPGDWVVA+
Subjt:  ------------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAI

Query:  GCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG---------
        GCPLSLQNT+TAGIVSCVDRKSSDLGLGG+RREYLQTDCAIN GNSGGPLVNVDGE++GVNIMKV  A GLSFAVP+DSVSKI + FKK G         
Subjt:  GCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG---------

Query:  ---------------KRYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM
                       +  +FP+V KGVLV MVTPGSPA RAGF PGDVVIE D  PV SIKEIIEIMGDRVGVP+K  VKR+ D+ +TLTV+PEES  DM
Subjt:  ---------------KRYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM

PQQ06470.1 putative protease Do-like 14 isoform X2 [Prunus yedoensis var. nudiflora]3.5e-12849.68Show/hide
Query:  MEGTKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKL
        ME  KG  GR GG R K VSKS +AGLQFPVGRIGR++K GRYA R   GAPIY+AAVLEYLAAEVLELAGNAARDNKK RI+PRH+LLAV+NDEEL  L
Subjt:  MEGTKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKL

Query:  LQGVTIASGGVLPNINPVLLPKKTSS-----------------NSTPAAAEKAP-----KSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKTQKG
        L+GVTIASGGVLP INPVLLPK+T+S                 +S PAA E AP      +   + +        PP   AS         +G+  + + 
Subjt:  LQGVTIASGGVLPNINPVLLPKKTSS-----------------NSTPAAAEKAP-----KSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKTQKG

Query:  FELPEHTQTDRRSCCCWFLFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGR--KPCPKCLGRDT
           P        +     L+    NR    T    + +   E + +   ++      LS     +  +S   S   K+  GVS  G   K C  CLGRD+
Subjt:  FELPEHTQTDRRSCCCWFLFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGR--KPCPKCLGRDT

Query:  IANAAANVGPAVVR--------------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAK
         A AAA VGPAVV                                                                NAD  SD+AIVKINSK+PLP AK
Subjt:  IANAAANVGPAVVR--------------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAK

Query:  LGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKIT
        LGSSSKL+PGD V+A+GCPLSLQNTVT+GIVSCVDRKS+DLGLGG+RREYLQTDCAIN GNSGGPLVN+DGEV+GVNIMKV  A GL FAVPIDSV+KI 
Subjt:  LGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKIT

Query:  EQFKKRGKRY------------------------SFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLD
        + FK+ G+                          +FP+V KG+LV MVTPGSPA RAGF PGDVVIE D   V SIKEI+EIMGDRVGVP+K +VKR+ D
Subjt:  EQFKKRGKRY------------------------SFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLD

Query:  STLTLTVLPEESNPDM
          LTLTV  EESN DM
Subjt:  STLTLTVLPEESNPDM

THF98620.1 hypothetical protein TEA_020688 [Camellia sinensis var. sinensis]9.3e-13752.4Show/hide
Query:  MEG-TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGK
        MEG +KGAGGR+GG+R K VSKS+K+GLQFPVGRI RYLKKGRYA+R   GAPIYLAA        VLELAGNAARDNKKNRINPRH+LLAVRND+ELGK
Subjt:  MEG-TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGK

Query:  LLQGVTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKT-QKGFELPEHTQTDRRSCCCWFLF
        LLQGVTIASGGVLPNI+P+LLPKKT+++ +      +  SP +  +                        +  W+T Q+    P     D        LF
Subjt:  LLQGVTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKT-QKGFELPEHTQTDRRSCCCWFLF

Query:  IVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR--------
            +RIG                                          P +    +  G  GDG KPC  CLG+DTIANAAA VGPAVV         
Subjt:  IVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR--------

Query:  --------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVS
                                                     NAD HSDIAIVKI SK+PLPMAKLG+SSKLRPGDWV+A+GCPLSLQNT+TAGIVS
Subjt:  --------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVS

Query:  CVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------------------K
        CVDRKSSDLGLGGMRREYLQTDCAIN GNSGGPLVN+DGEV+GVNIMKV  A GLSF+VPIDSVS I E FKK G                        K
Subjt:  CVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------------------K

Query:  RYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM
           FP+V KGVLV MV+PGSPA RAGF PGDVV+E +  PV SIKEI EIMGD+VG PLK +VKR+ D+++TLTV+PEE+NPD+
Subjt:  RYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM

XP_022965762.1 putative protease Do-like 14 isoform X1 [Cucurbita maxima]6.2e-11763.98Show/hide
Query:  LFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR------
        LF+  Q   GF  H              S   DH +L GLSF SSRVSP  APPS  EKE P   GD +KPCP+CL RDTIANAAA+VGPAVV       
Subjt:  LFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR------

Query:  ---------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCP
                                                                  NADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCP
Subjt:  ---------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCP

Query:  LSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------
        LSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG            
Subjt:  LSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------

Query:  ------------KRYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM
                    +  SFPDVTKGVLVAMVTPGSPASRAGF PGDVVIE D+ PV SI+EIIEIMGDRVGVPLKAVVKRSL+ST+TLTVLPEESNPDM
Subjt:  ------------KRYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM

TrEMBL top hitse value%identityAlignment
A0A2N9HEI6 PDZ domain-containing protein1.8e-12251.71Show/hide
Query:  IYLAAVLEYLAA-EVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQGVTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRG
        ++L  V E +   EVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQGVTIA+GGV+PNINPVLLPKK+SS     A+EK PKS            
Subjt:  IYLAAVLEYLAA-EVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQGVTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRG

Query:  TGPPSLSASELHRNRRVQAGIWKTQKGFELPEHTQTDRRSCCCWFLFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTR-------------LRGLSF
                                                C                   S   SC    +T++S A++TR             +  L  
Subjt:  TGPPSLSASELHRNRRVQAGIWKTQKGFELPEHTQTDRRSCCCWFLFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTR-------------LRGLSF

Query:  LSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR------------------------------------------------
         SSRV   S P S  +K+T GV G   KPC  CLGRDTIANAAA VGPAVV                                                 
Subjt:  LSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR------------------------------------------------

Query:  ---------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGN
                        NAD HSDIAIVKI+SK+PLP A LGSSSKLRPGDWVVA+GCPLSLQNT+TAGIVSCVDRKSSDLGLGG+RREYLQTDCAINVGN
Subjt:  ---------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGN

Query:  SGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------------------KRYSFPDVTKGVLVAMVTPGSPASRAGFHP
        SGGPLVN+DGE+VGVNIMKV  A GLSFAVPIDSVSKI   FKK G                        +  +FP+V KGVLV MVTPGSP  RAGF P
Subjt:  SGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------------------KRYSFPDVTKGVLVAMVTPGSPASRAGFHP

Query:  GDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM
        GDVVIE D  PV SIKEIIEIMGDRVGVP+KAVVKR+ D+ +TLTV+PEES PDM
Subjt:  GDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM

A0A314YJY7 Putative protease Do-like 14 isoform X21.7e-12849.68Show/hide
Query:  MEGTKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKL
        ME  KG  GR GG R K VSKS +AGLQFPVGRIGR++K GRYA R   GAPIY+AAVLEYLAAEVLELAGNAARDNKK RI+PRH+LLAV+NDEEL  L
Subjt:  MEGTKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKL

Query:  LQGVTIASGGVLPNINPVLLPKKTSS-----------------NSTPAAAEKAP-----KSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKTQKG
        L+GVTIASGGVLP INPVLLPK+T+S                 +S PAA E AP      +   + +        PP   AS         +G+  + + 
Subjt:  LQGVTIASGGVLPNINPVLLPKKTSS-----------------NSTPAAAEKAP-----KSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKTQKG

Query:  FELPEHTQTDRRSCCCWFLFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGR--KPCPKCLGRDT
           P        +     L+    NR    T    + +   E + +   ++      LS     +  +S   S   K+  GVS  G   K C  CLGRD+
Subjt:  FELPEHTQTDRRSCCCWFLFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGR--KPCPKCLGRDT

Query:  IANAAANVGPAVVR--------------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAK
         A AAA VGPAVV                                                                NAD  SD+AIVKINSK+PLP AK
Subjt:  IANAAANVGPAVVR--------------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAK

Query:  LGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKIT
        LGSSSKL+PGD V+A+GCPLSLQNTVT+GIVSCVDRKS+DLGLGG+RREYLQTDCAIN GNSGGPLVN+DGEV+GVNIMKV  A GL FAVPIDSV+KI 
Subjt:  LGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKIT

Query:  EQFKKRGKRY------------------------SFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLD
        + FK+ G+                          +FP+V KG+LV MVTPGSPA RAGF PGDVVIE D   V SIKEI+EIMGDRVGVP+K +VKR+ D
Subjt:  EQFKKRGKRY------------------------SFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLD

Query:  STLTLTVLPEESNPDM
          LTLTV  EESN DM
Subjt:  STLTLTVLPEESNPDM

A0A4S4D849 Uncharacterized protein4.5e-13752.4Show/hide
Query:  MEG-TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGK
        MEG +KGAGGR+GG+R K VSKS+K+GLQFPVGRI RYLKKGRYA+R   GAPIYLAA        VLELAGNAARDNKKNRINPRH+LLAVRND+ELGK
Subjt:  MEG-TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGK

Query:  LLQGVTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKT-QKGFELPEHTQTDRRSCCCWFLF
        LLQGVTIASGGVLPNI+P+LLPKKT+++ +      +  SP +  +                        +  W+T Q+    P     D        LF
Subjt:  LLQGVTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKT-QKGFELPEHTQTDRRSCCCWFLF

Query:  IVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR--------
            +RIG                                          P +    +  G  GDG KPC  CLG+DTIANAAA VGPAVV         
Subjt:  IVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR--------

Query:  --------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVS
                                                     NAD HSDIAIVKI SK+PLPMAKLG+SSKLRPGDWV+A+GCPLSLQNT+TAGIVS
Subjt:  --------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVS

Query:  CVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------------------K
        CVDRKSSDLGLGGMRREYLQTDCAIN GNSGGPLVN+DGEV+GVNIMKV  A GLSF+VPIDSVS I E FKK G                        K
Subjt:  CVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------------------K

Query:  RYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM
           FP+V KGVLV MV+PGSPA RAGF PGDVV+E +  PV SIKEI EIMGD+VG PLK +VKR+ D+++TLTV+PEE+NPD+
Subjt:  RYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM

A0A6J1HL69 putative protease Do-like 14 isoform X13.0e-11763.98Show/hide
Query:  LFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR------
        LF+  Q   GF  H              S   DH +L GLSF SSRVSP  APPS  EKE P   GD +KPCP+CL RDTIANAAA+VGPAVV       
Subjt:  LFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR------

Query:  ---------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCP
                                                                  NADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCP
Subjt:  ---------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCP

Query:  LSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------
        LSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG            
Subjt:  LSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------

Query:  ------------KRYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM
                    +  SFPDVTKGVLVAMVTPGSPASRAGF PGDVVIE D+ PV SI+EIIEIMGDRVGVPLKAVVKRSL+ST+TLTVLPEESNPDM
Subjt:  ------------KRYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM

A0A6J1HPX7 putative protease Do-like 14 isoform X23.0e-11763.98Show/hide
Query:  LFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR------
        LF+  Q   GF  H              S   DH +L GLSF SSRVSP  APPS  EKE P   GD +KPCP+CL RDTIANAAA+VGPAVV       
Subjt:  LFIVCQNRIGFQTHGSVVNSCCLERVTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVR------

Query:  ---------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCP
                                                                  NADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCP
Subjt:  ---------------------------------------------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCP

Query:  LSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------
        LSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG            
Subjt:  LSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRG------------

Query:  ------------KRYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM
                    +  SFPDVTKGVLVAMVTPGSPASRAGF PGDVVIE D+ PV SI+EIIEIMGDRVGVPLKAVVKRSL+ST+TLTVLPEESNPDM
Subjt:  ------------KRYSFPDVTKGVLVAMVTPGSPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM

SwissProt top hitse value%identityAlignment
A2Y5G8 Probable histone H2A.42.8e-5179.43Show/hide
Query:  GAGGRKGGDRTK---VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        GAGGR+GG   K   VS+S+KAGLQFPVGRIGRYLK+GRY++R   GAP+YLAAVLEYLAAEVLELAGNAARDNKKNRI PRHVLLA+RNDEELGKLL G
Subjt:  GAGGRKGGDRTK---VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAE-KAPKSPKKA
        VTIA GGVLPNINPVLLPKKT S +   A E K PKSPKKA
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAE-KAPKSPKKA

P25469 Histone H2A.11.3e-5383.57Show/hide
Query:  TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        TKGAGGRKGG R K V+KSIKAGLQFPVGRIGRYLKKGRYA+R  +GAPIYLAAVLEYLAAEVLELAGNAARDNKK+RI PRHVLLAVRNDEELGKLL G
Subjt:  TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKA
        VTIASGGVLPNINPVLLPKK++     +   KA KSPKKA
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKA

Q2HU65 Probable histone H2A.21.6e-5178.62Show/hide
Query:  KGAGGRKGGDRTK--VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        KGAGGRKGG   K  V++SI+AGLQFPVGRIGRYLKKGRYA+R   GAP+YLAAVLEYLAAEVLELAGNAARDNKKNRI PRHVLLAVRNDEELGKLL G
Subjt:  KGAGGRKGGDRTK--VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAP-----KSPKKA
        VTIA GGVLPNINPVLLPKKT  ++T +   K+P     KSPKKA
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAP-----KSPKKA

Q3E6S8 Putative protease Do-like 148.3e-8051.38Show/hide
Query:  LRGLSFLSSRVSPASAPPSAAEKETPGVSGD-GRKPCPKCLGRDTIANAAANVGPAVVR-----------------------------------------
        L G    SSRVSP S  P   EK     + D   KP    LGRDTIANAAA +GPAVV                                          
Subjt:  LRGLSFLSSRVSPASAPPSAAEKETPGVSGD-GRKPCPKCLGRDTIANAAANVGPAVVR-----------------------------------------

Query:  ----------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTD
                               NAD  SDIA+VKI SK+PLP AKLG SSKLRPGDWV+A+GCPLSLQNTVTAGIVSCVDRKSSDLGLGG  REYLQTD
Subjt:  ----------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTD

Query:  CAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRGKRY------------------------SFPDVTKGVLVAMVTPGSPA
        C+IN GNSGGPLVN+DGEV+GVNIMKV  A GL F+VPIDSVSKI E FKK G+                           FPDV +GVLV  V PGSPA
Subjt:  CAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRGKRY------------------------SFPDVTKGVLVAMVTPGSPA

Query:  SRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM
         RAGF PGDVV+  D  PV      IEIM DRVG  ++ VV+RS    +TL V+PEE+NPDM
Subjt:  SRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM

Q94F49 Probable histone H2A.51.6e-5481.43Show/hide
Query:  TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        T+GAGGRKGGDR K VSKS+KAGLQFPVGRI RYLKKGRYA R  +GAP+YLAAVLEYLAAEVLELAGNAARDNKKNRINPRH+ LA+RNDEELG+LL G
Subjt:  TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKA
        VTIASGGVLPNINPVLLPKK++++S+ A    A KSPKKA
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKA

Arabidopsis top hitse value%identityAlignment
AT5G02560.1 histone H2A 126.4e-5175.17Show/hide
Query:  KGAGGRKGGDRTK---VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQ
        KGA GR+ G   K   VS+S+K+GLQFPVGRIGRYLKKGRY+KR   GAP+YLAAVLEYLAAEVLELAGNAARDNKKNRI PRHVLLAVRNDEELG LL+
Subjt:  KGAGGRKGGDRTK---VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQ

Query:  GVTIASGGVLPNINPVLLPKKT----SSNSTPAAAEKAPKSPKKA
        GVTIA GGVLPNINP+LLPKK+    S+  TP +  KA KSPKK+
Subjt:  GVTIASGGVLPNINPVLLPKKT----SSNSTPAAAEKAPKSPKKA

AT5G02560.2 histone H2A 127.3e-4764.5Show/hide
Query:  KGAGGRKGGDRTK---VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAE------------------------VLELAGNAARDNK
        KGA GR+ G   K   VS+S+K+GLQFPVGRIGRYLKKGRY+KR   GAP+YLAAVLEYLAAE                        VLELAGNAARDNK
Subjt:  KGAGGRKGGDRTK---VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAE------------------------VLELAGNAARDNK

Query:  KNRINPRHVLLAVRNDEELGKLLQGVTIASGGVLPNINPVLLPKKT----SSNSTPAAAEKAPKSPKKA
        KNRI PRHVLLAVRNDEELG LL+GVTIA GGVLPNINP+LLPKK+    S+  TP +  KA KSPKK+
Subjt:  KNRINPRHVLLAVRNDEELGKLLQGVTIASGGVLPNINPVLLPKKT----SSNSTPAAAEKAPKSPKKA

AT5G27660.1 Trypsin family protein with PDZ domain2.7e-7351.5Show/hide
Query:  LRGLSFLSSRVSPASAPPSAAEKETPGVSGD-GRKPCPKCLGRDTIANAAANVGPAVVR-----------------------------------------
        L G    SSRVSP S  P   EK     + D   KP    LGRDTIANAAA +GPAVV                                          
Subjt:  LRGLSFLSSRVSPASAPPSAAEKETPGVSGD-GRKPCPKCLGRDTIANAAANVGPAVVR-----------------------------------------

Query:  ----------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTD
                               NAD  SDIA+VKI SK+PLP AKLG SSKLRPGDWV+A+GCPLSLQNTVTAGIVSCVDRKSSDLGLGG  REYLQTD
Subjt:  ----------------------NNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTD

Query:  CAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRGKRY------------------------SFPDVTKGVLVAMVTPGSPA
        C+IN GNSGGPLVN+DGEV+GVNIMKV  A GL F+VPIDSVSKI E FKK G+                           FPDV +GVLV  V PGSPA
Subjt:  CAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRGKRY------------------------SFPDVTKGVLVAMVTPGSPA

Query:  SRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVG
         RAGF PGDVV+  D  PV      IEIM DRVG
Subjt:  SRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVG

AT5G27670.1 histone H2A 71.1e-5581.43Show/hide
Query:  TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        T+GAGGRKGGDR K VSKS+KAGLQFPVGRI RYLKKGRYA R  +GAP+YLAAVLEYLAAEVLELAGNAARDNKKNRINPRH+ LA+RNDEELG+LL G
Subjt:  TKGAGGRKGGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKA
        VTIASGGVLPNINPVLLPKK++++S+ A    A KSPKKA
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKA

AT5G59870.1 histone H2A 61.9e-4775.35Show/hide
Query:  KGAGGRK--GGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQ
        K  GGRK  G  +TK VSKS+KAGLQFPVGRI R+LKKGRYA+R   GAP+Y+AAVLEYLAAEVLELAGNAARDNKK+RI PRH+LLA+RNDEELGKLL 
Subjt:  KGAGGRK--GGDRTK-VSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQ

Query:  GVTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAP
        GVTIA GGVLPNIN VLLPKK   ++T  A EKA KSP K+P
Subjt:  GVTIASGGVLPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAACGAAGGGCGCTGGAGGAAGAAAAGGAGGCGACAGAACTAAGGTCTCGAAGTCCATCAAGGCCGGACTCCAATTTCCAGTCGGAAGAATCGGACGGTATCT
CAAGAAAGGCCGCTACGCCAAGCGCACGGCCGCCGGTGCTCCAATCTACCTGGCTGCAGTTCTCGAGTACCTCGCCGCTGAGGTTCTGGAATTGGCGGGGAATGCAGCGC
GCGATAACAAGAAGAACAGAATAAACCCTAGGCACGTTCTTCTAGCTGTTCGGAACGATGAGGAGCTCGGGAAATTGCTTCAAGGAGTCACCATTGCTAGCGGCGGAGTT
CTTCCGAACATCAATCCGGTTTTGCTTCCGAAGAAGACATCGTCGAATTCCACTCCTGCTGCTGCTGAAAAGGCTCCGAAATCGCCAAAAAAGGCGCCCATGGGCCGAGT
GTTAAGAGGAACGGGGCCCCCGTCATTGTCCGCGTCGGAGTTGCACCGGAATCGAAGGGTTCAGGCGGGGATTTGGAAGACACAGAAAGGTTTCGAGCTCCCAGAACACA
CTCAAACGGATCGCCGCAGTTGCTGCTGCTGGTTCTTGTTTATTGTATGCCAGAACCGAATTGGATTCCAGACCCACGGTAGCGTTGTCAATTCCTGCTGCTTGGAGCGA
GTCACTGTTTCTTCCATGGCAGACCACACAAGGCTTCGCGGTCTTTCATTTCTTTCTTCAAGAGTCAGTCCTGCTTCTGCTCCACCATCTGCTGCGGAGAAGGAAACGCC
TGGAGTTTCTGGGGATGGCCGCAAGCCTTGTCCAAAATGTTTGGGTAGAGATACGATTGCTAATGCAGCAGCAAATGTTGGTCCTGCTGTTGTACGTAATAATGCTGATT
TTCACTCTGATATTGCCATTGTGAAAATCAATTCTAAAAGCCCTCTTCCAATGGCAAAACTTGGTTCTTCAAGCAAGCTTCGACCTGGGGATTGGGTTGTAGCAATCGGG
TGTCCACTTTCACTTCAGAATACTGTCACGGCTGGTATAGTAAGTTGTGTTGACCGCAAGAGTAGTGATTTGGGTCTTGGTGGAATGCGTAGGGAGTATCTACAAACAGA
TTGTGCGATTAATGTGGGAAATTCTGGGGGCCCTCTTGTTAATGTTGACGGAGAAGTTGTTGGTGTAAACATTATGAAAGTGGATGATGCTGTTGGATTAAGTTTTGCTG
TACCAATTGATTCAGTCTCCAAAATTACTGAGCAATTCAAGAAAAGAGGAAAGAGATACAGCTTTCCAGATGTCACTAAAGGGGTTCTTGTAGCTATGGTAACGCCTGGA
TCCCCTGCTAGTCGTGCTGGATTCCATCCTGGGGATGTCGTCATTGAGCTTGATAGGAATCCTGTTGCAAGTATCAAAGAGATCATTGAAATTATGGGAGATAGAGTTGG
GGTTCCATTGAAGGCCGTTGTGAAAAGATCACTTGATAGTACCCTCACTTTGACTGTTCTTCCTGAGGAGTCCAATCCAGATATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGAACGAAGGGCGCTGGAGGAAGAAAAGGAGGCGACAGAACTAAGGTCTCGAAGTCCATCAAGGCCGGACTCCAATTTCCAGTCGGAAGAATCGGACGGTATCT
CAAGAAAGGCCGCTACGCCAAGCGCACGGCCGCCGGTGCTCCAATCTACCTGGCTGCAGTTCTCGAGTACCTCGCCGCTGAGGTTCTGGAATTGGCGGGGAATGCAGCGC
GCGATAACAAGAAGAACAGAATAAACCCTAGGCACGTTCTTCTAGCTGTTCGGAACGATGAGGAGCTCGGGAAATTGCTTCAAGGAGTCACCATTGCTAGCGGCGGAGTT
CTTCCGAACATCAATCCGGTTTTGCTTCCGAAGAAGACATCGTCGAATTCCACTCCTGCTGCTGCTGAAAAGGCTCCGAAATCGCCAAAAAAGGCGCCCATGGGCCGAGT
GTTAAGAGGAACGGGGCCCCCGTCATTGTCCGCGTCGGAGTTGCACCGGAATCGAAGGGTTCAGGCGGGGATTTGGAAGACACAGAAAGGTTTCGAGCTCCCAGAACACA
CTCAAACGGATCGCCGCAGTTGCTGCTGCTGGTTCTTGTTTATTGTATGCCAGAACCGAATTGGATTCCAGACCCACGGTAGCGTTGTCAATTCCTGCTGCTTGGAGCGA
GTCACTGTTTCTTCCATGGCAGACCACACAAGGCTTCGCGGTCTTTCATTTCTTTCTTCAAGAGTCAGTCCTGCTTCTGCTCCACCATCTGCTGCGGAGAAGGAAACGCC
TGGAGTTTCTGGGGATGGCCGCAAGCCTTGTCCAAAATGTTTGGGTAGAGATACGATTGCTAATGCAGCAGCAAATGTTGGTCCTGCTGTTGTACGTAATAATGCTGATT
TTCACTCTGATATTGCCATTGTGAAAATCAATTCTAAAAGCCCTCTTCCAATGGCAAAACTTGGTTCTTCAAGCAAGCTTCGACCTGGGGATTGGGTTGTAGCAATCGGG
TGTCCACTTTCACTTCAGAATACTGTCACGGCTGGTATAGTAAGTTGTGTTGACCGCAAGAGTAGTGATTTGGGTCTTGGTGGAATGCGTAGGGAGTATCTACAAACAGA
TTGTGCGATTAATGTGGGAAATTCTGGGGGCCCTCTTGTTAATGTTGACGGAGAAGTTGTTGGTGTAAACATTATGAAAGTGGATGATGCTGTTGGATTAAGTTTTGCTG
TACCAATTGATTCAGTCTCCAAAATTACTGAGCAATTCAAGAAAAGAGGAAAGAGATACAGCTTTCCAGATGTCACTAAAGGGGTTCTTGTAGCTATGGTAACGCCTGGA
TCCCCTGCTAGTCGTGCTGGATTCCATCCTGGGGATGTCGTCATTGAGCTTGATAGGAATCCTGTTGCAAGTATCAAAGAGATCATTGAAATTATGGGAGATAGAGTTGG
GGTTCCATTGAAGGCCGTTGTGAAAAGATCACTTGATAGTACCCTCACTTTGACTGTTCTTCCTGAGGAGTCCAATCCAGATATGTGA
Protein sequenceShow/hide protein sequence
MEGTKGAGGRKGGDRTKVSKSIKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQGVTIASGGV
LPNINPVLLPKKTSSNSTPAAAEKAPKSPKKAPMGRVLRGTGPPSLSASELHRNRRVQAGIWKTQKGFELPEHTQTDRRSCCCWFLFIVCQNRIGFQTHGSVVNSCCLER
VTVSSMADHTRLRGLSFLSSRVSPASAPPSAAEKETPGVSGDGRKPCPKCLGRDTIANAAANVGPAVVRNNADFHSDIAIVKINSKSPLPMAKLGSSSKLRPGDWVVAIG
CPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRGKRYSFPDVTKGVLVAMVTPG
SPASRAGFHPGDVVIELDRNPVASIKEIIEIMGDRVGVPLKAVVKRSLDSTLTLTVLPEESNPDM