; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017896 (gene) of Snake gourd v1 genome

Gene IDTan0017896
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPDZ domain-containing protein
Genome locationLG08:6087475..6097871
RNA-Seq ExpressionTan0017896
SyntenyTan0017896
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0004252 - serine-type endopeptidase activity (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR001478 - PDZ domain
IPR002119 - Histone H2A
IPR007125 - Histone H2A/H2B/H3
IPR009003 - Peptidase S1, PA clan
IPR009072 - Histone-fold
IPR032454 - Histone H2A, C-terminal domain
IPR032458 - Histone H2A conserved site
IPR036034 - PDZ superfamily
IPR041489 - PDZ domain 6


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3976576.1 hypothetical protein CMV_000255 [Castanea mollissima]1.8e-20267.62Show/hide
Query:  TKGAGGRKGGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        TKGAGGR+GGDR K VSKS KAGLQFPVGRIGR+LKKGRYA+RT  GAP+YLAAVLEYLAAEVLELAGNAARDNKK RINPRHVLLAVRNDEELGKLLQG
Subjt:  TKGAGGRKGGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAPKSPRKA--------HGPSHQKDSSIVCFLVGRKAS-SSHNLLNRIAAVAAAGSCFFYAKCKLDSGS
        VTIASGGVLPNINPVLLPKKT S+      EK PK+   A        H    Q   +I   L  RK S S+ N L RI A+A+AGS   YA    +S +
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAPKSPRKA--------HGPSHQKDSSIVCFLVGRKAS-SSHNLLNRIAAVAAAGSCFFYAKCKLDSGS

Query:  SVVLSIPAVWSEPPYLPWQ-TTHGFAVHWSGVFDHRLLG-ISLCSSRVSPSPSSGVEKETPGIAGDGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGI
        +V + +PA   E   LPW+ T + F        D    G + L SSRV  +PSS ++K+  G+     KPC  CLGRDTIANAAA VGPAVVN+SV  G 
Subjt:  SVVLSIPAVWSEPPYLPWQ-TTHGFAVHWSGVFDHRLLG-ISLCSSRVSPSPSSGVEKETPGIAGDGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGI

Query:  YGIATAKSMGSGTII------------------------GKDEVTLQDGRTFEGTVMNADFHSDIAIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPL
        YGI   KS+GSGTII                        GK +VTLQDGRTFEGTV+NAD HSDIAIVKI+SKTPLP A LGSSSKLRPGDWVVA+GCPL
Subjt:  YGIATAKSMGSGTII------------------------GKDEVTLQDGRTFEGTVMNADFHSDIAIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPL

Query:  SLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMID
        SLQNT+TAGIVSCVDRKSSDLGLGG+RREYLQTDCAIN GNSGGPLVNVDGE++GVNIMKV  A GLSFAVP+DSVSKI + FKK GRV+RPWLGLKM+D
Subjt:  SLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMID

Query:  LNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVGVPLKAVVKRSLNVTITLTVLPEESNPDM
        LN+MII QLKERD  FP+V KGVLV MVTPGSPA RAGF PGDVVIE D +PV SIKEIIEIMGDRVGVP+K  VKR+ +  +TLTV+PEES  DM
Subjt:  LNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVGVPLKAVVKRSLNVTITLTVLPEESNPDM

KAG6586251.1 putative protease Do-like 14, partial [Cucurbita argyrosperma subsp. sororia]9.8e-19683.76Show/hide
Query:  LVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAG
        L+ RK SSS N L RIAA+AAAGSCF+YA  KLD+GSSVVLSIPA  SEP +LPWQTTHGF +H SG FDH+ LG+S CSSRVSP+P SGVEKE P   G
Subjt:  LVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAG

Query:  DGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDI
        D QKPC +CL RDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII KD                        EVTLQDGRTFEGTVMNADFHSDI
Subjt:  DGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDI

Query:  AIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAV
        AIVKINSK+PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAV
Subjt:  AIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAV

Query:  GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGD
        GLSFAVPIDS+SKITEQFKKRGRVIRPWLGLKMIDLN+MIIEQLKERDA+FPDVTKGVLVAMVTPGSPASRAGF PGDVVIE DKQPV SI+EIIEIMGD
Subjt:  GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGD

Query:  RVGVPLKAVVKRSLNVTITLTVLPEESNPDM
        RVG+PLKAVVKRSLN  ITLTVLPEESNPDM
Subjt:  RVGVPLKAVVKRSLNVTITLTVLPEESNPDM

XP_022938300.1 putative protease Do-like 14 isoform X1 [Cucurbita moschata]3.7e-19583.76Show/hide
Query:  LVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAG
        L+ RK SSS N L RIAA+AAAGSCF+YA  KLD+GSSVVLSIPA  SEP +LPWQT HGF +H SG FDH+ LG+S CSSRVSP+P SGVEKE P   G
Subjt:  LVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAG

Query:  DGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDI
        D QKPC +CL RDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII KD                        EVTLQDGRTFEGTVMNADFHSDI
Subjt:  DGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDI

Query:  AIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAV
        AIVKINSK+PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAV
Subjt:  AIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAV

Query:  GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGD
        GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLN+MIIEQLKERDA+FPDVTKGVLVAMVTPGSPASRAGF PGDVVIE DKQPV SI+EIIEIMGD
Subjt:  GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGD

Query:  RVGVPLKAVVKRSLNVTITLTVLPEESNPDM
        RVG+PLKAVVKRSLN  ITLTVLPEESNPDM
Subjt:  RVGVPLKAVVKRSLNVTITLTVLPEESNPDM

XP_022965762.1 putative protease Do-like 14 isoform X1 [Cucurbita maxima]2.9e-19583.99Show/hide
Query:  LVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAG
        L+  K SSS N L RIAA+AAAGSCF+YA  KLDSGSSVVLSIPA  SEP +LPWQT HGF +H SG FDH+ LG+S CSSRVSP+P SGVEKE P   G
Subjt:  LVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAG

Query:  DGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDI
        D QKPC +CL RDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII KD                        EVTLQDGRTFEGTVMNADFHSDI
Subjt:  DGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDI

Query:  AIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAV
        AIVKINSK+PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAV
Subjt:  AIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAV

Query:  GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGD
        GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLN+MIIEQLKERD +FPDVTKGVLVAMVTPGSPASRAGF PGDVVIE DKQPV SI+EIIEIMGD
Subjt:  GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGD

Query:  RVGVPLKAVVKRSLNVTITLTVLPEESNPDM
        RVGVPLKAVVKRSLN TITLTVLPEESNPDM
Subjt:  RVGVPLKAVVKRSLNVTITLTVLPEESNPDM

XP_022965763.1 putative protease Do-like 14 isoform X2 [Cucurbita maxima]3.7e-19584.54Show/hide
Query:  KASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAGDGQK
        K SSS N L RIAA+AAAGSCF+YA  KLDSGSSVVLSIPA  SEP +LPWQT HGF +H SG FDH+ LG+S CSSRVSP+P SGVEKE P   GD QK
Subjt:  KASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAGDGQK

Query:  PCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDIAIVK
        PC +CL RDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII KD                        EVTLQDGRTFEGTVMNADFHSDIAIVK
Subjt:  PCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDIAIVK

Query:  INSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSF
        INSK+PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAVGLSF
Subjt:  INSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSF

Query:  AVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVGV
        AVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLN+MIIEQLKERD +FPDVTKGVLVAMVTPGSPASRAGF PGDVVIE DKQPV SI+EIIEIMGDRVGV
Subjt:  AVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVGV

Query:  PLKAVVKRSLNVTITLTVLPEESNPDM
        PLKAVVKRSLN TITLTVLPEESNPDM
Subjt:  PLKAVVKRSLNVTITLTVLPEESNPDM

TrEMBL top hitse value%identityAlignment
A0A6J1D9G4 putative protease Do-like 146.4e-18580.37Show/hide
Query:  RKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAGDGQ
        +K S SHN L RIAA AAAGSCF Y + +LDS  +V LSIPA WSEP +LP QTT G  V     FDHR LG+ L +SRV P+P S  EKETPG+AGDGQ
Subjt:  RKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAGDGQ

Query:  KPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDIAIV
        KPC +CLGRDTIANAAA+VGPAVVNISV HGIYGIA AKSMGSGTII KD                        EVTLQDGRTFEGTVMNADFHSDIAIV
Subjt:  KPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDIAIV

Query:  KINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLS
        KINSK+PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGG+RREYLQTDCAINVGNSGGPLVN+DGEVVGVNIMKVDDAVGLS
Subjt:  KINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLS

Query:  FAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVG
        FAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLN+MIIEQLKERD  FP VT+GVL+AMVTPGSPASRAGF  GDVVIELD   VASIKEIIEIMGDRVG
Subjt:  FAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVG

Query:  VPLKAVVKRSLNVTITLTVLPEESNPDM
        VPLKAVVKRS+N TITLTVLPEESNPDM
Subjt:  VPLKAVVKRSLNVTITLTVLPEESNPDM

A0A6J1FCS0 putative protease Do-like 14 isoform X23.1e-19584.11Show/hide
Query:  RKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAGDGQ
        RK SSS N L RIAA+AAAGSCF+YA  KLD+GSSVVLSIPA  SEP +LPWQT HGF +H SG FDH+ LG+S CSSRVSP+P SGVEKE P   GD Q
Subjt:  RKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAGDGQ

Query:  KPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDIAIV
        KPC +CL RDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII KD                        EVTLQDGRTFEGTVMNADFHSDIAIV
Subjt:  KPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDIAIV

Query:  KINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLS
        KINSK+PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAVGLS
Subjt:  KINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLS

Query:  FAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVG
        FAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLN+MIIEQLKERDA+FPDVTKGVLVAMVTPGSPASRAGF PGDVVIE DKQPV SI+EIIEIMGDRVG
Subjt:  FAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVG

Query:  VPLKAVVKRSLNVTITLTVLPEESNPDM
        +PLKAVVKRSLN  ITLTVLPEESNPDM
Subjt:  VPLKAVVKRSLNVTITLTVLPEESNPDM

A0A6J1FJD3 putative protease Do-like 14 isoform X11.8e-19583.76Show/hide
Query:  LVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAG
        L+ RK SSS N L RIAA+AAAGSCF+YA  KLD+GSSVVLSIPA  SEP +LPWQT HGF +H SG FDH+ LG+S CSSRVSP+P SGVEKE P   G
Subjt:  LVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAG

Query:  DGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDI
        D QKPC +CL RDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII KD                        EVTLQDGRTFEGTVMNADFHSDI
Subjt:  DGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDI

Query:  AIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAV
        AIVKINSK+PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAV
Subjt:  AIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAV

Query:  GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGD
        GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLN+MIIEQLKERDA+FPDVTKGVLVAMVTPGSPASRAGF PGDVVIE DKQPV SI+EIIEIMGD
Subjt:  GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGD

Query:  RVGVPLKAVVKRSLNVTITLTVLPEESNPDM
        RVG+PLKAVVKRSLN  ITLTVLPEESNPDM
Subjt:  RVGVPLKAVVKRSLNVTITLTVLPEESNPDM

A0A6J1HL69 putative protease Do-like 14 isoform X11.4e-19583.99Show/hide
Query:  LVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAG
        L+  K SSS N L RIAA+AAAGSCF+YA  KLDSGSSVVLSIPA  SEP +LPWQT HGF +H SG FDH+ LG+S CSSRVSP+P SGVEKE P   G
Subjt:  LVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAG

Query:  DGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDI
        D QKPC +CL RDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII KD                        EVTLQDGRTFEGTVMNADFHSDI
Subjt:  DGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDI

Query:  AIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAV
        AIVKINSK+PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAV
Subjt:  AIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAV

Query:  GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGD
        GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLN+MIIEQLKERD +FPDVTKGVLVAMVTPGSPASRAGF PGDVVIE DKQPV SI+EIIEIMGD
Subjt:  GLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGD

Query:  RVGVPLKAVVKRSLNVTITLTVLPEESNPDM
        RVGVPLKAVVKRSLN TITLTVLPEESNPDM
Subjt:  RVGVPLKAVVKRSLNVTITLTVLPEESNPDM

A0A6J1HPX7 putative protease Do-like 14 isoform X21.8e-19584.54Show/hide
Query:  KASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAGDGQK
        K SSS N L RIAA+AAAGSCF+YA  KLDSGSSVVLSIPA  SEP +LPWQT HGF +H SG FDH+ LG+S CSSRVSP+P SGVEKE P   GD QK
Subjt:  KASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAGDGQK

Query:  PCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDIAIVK
        PC +CL RDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII KD                        EVTLQDGRTFEGTVMNADFHSDIAIVK
Subjt:  PCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKD------------------------EVTLQDGRTFEGTVMNADFHSDIAIVK

Query:  INSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSF
        INSK+PLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEV+GVNIMKVDDAVGLSF
Subjt:  INSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSF

Query:  AVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVGV
        AVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLN+MIIEQLKERD +FPDVTKGVLVAMVTPGSPASRAGF PGDVVIE DKQPV SI+EIIEIMGDRVGV
Subjt:  AVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVGV

Query:  PLKAVVKRSLNVTITLTVLPEESNPDM
        PLKAVVKRSLN TITLTVLPEESNPDM
Subjt:  PLKAVVKRSLNVTITLTVLPEESNPDM

SwissProt top hitse value%identityAlignment
P25469 Histone H2A.14.3e-5382.14Show/hide
Query:  TKGAGGRKGGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        TKGAGGRKGG R K V+KS+KAGLQFPVGRIGRYLKKGRYA+R  +GAPIYLAAVLEYLAAEVLELAGNAARDNKK+RI PRHVLLAVRNDEELGKLL G
Subjt:  TKGAGGRKGGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAPKSPRKA
        VTIASGGVLPNINPVLLPKK++     +   KA KSP+KA
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAPKSPRKA

Q2HU65 Probable histone H2A.21.8e-5176.87Show/hide
Query:  KGAGGRKGGDRTK--VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        KGAGGRKGG   K  V++S++AGLQFPVGRIGRYLKKGRYA+R   GAP+YLAAVLEYLAAEVLELAGNAARDNKKNRI PRHVLLAVRNDEELGKLL G
Subjt:  KGAGGRKGGDRTK--VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAPKSPRKAHGPSHQK
        VTIA GGVLPNINPVLLPKKT  ++   T  K PKSP+   G S +K
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAPKSPRKAHGPSHQK

Q2HU68 Probable histone H2A.11.8e-5180Show/hide
Query:  KGAGGRKGGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQGV
        KGAGGRKGG R K V++S +AGLQFPVGRIGRYLKKGRYA+R   GAP+YLAAVLEYLAAEVLELAGNAARDNKKNRI PRHVLLAVRNDEELGKLL GV
Subjt:  KGAGGRKGGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQGV

Query:  TIASGGVLPNINPVLLPKKTSSNSTPT-TAEKAPKSPRKA
        TIA GGVLPNINP+LLPKK    +T T +  KA KSP+KA
Subjt:  TIASGGVLPNINPVLLPKKTSSNSTPT-TAEKAPKSPRKA

Q3E6S8 Putative protease Do-like 142.1e-12458.26Show/hide
Query:  FLVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIP-AVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKE---T
        FL    +SS  + L RI +VA A S   YA    D+ + V L+IP +V      LPWQ + G  +H     +  L G  + SSRVSP   + +  E   +
Subjt:  FLVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIP-AVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKE---T

Query:  PGIAGDGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII------------------------GKDEVTLQDGRTFEGTVMNAD
           +    KP    LGRDTIANAAA +GPAVVN+SV  G +GI+  KS+GSGTII                        G+ +VTLQDGRTFEG V+NAD
Subjt:  PGIAGDGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII------------------------GKDEVTLQDGRTFEGTVMNAD

Query:  FHSDIAIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMK
          SDIA+VKI SKTPLP AKLG SSKLRPGDWV+A+GCPLSLQNTVTAGIVSCVDRKSSDLGLGG  REYLQTDC+IN GNSGGPLVN+DGEV+GVNIMK
Subjt:  FHSDIAIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMK

Query:  VDDAVGLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEII
        V  A GL F+VPIDSVSKI E FKK GRVIRPW+GLKM++LN++I+ QLKERD  FPDV +GVLV  V PGSPA RAGF PGDVV+  D +PV      I
Subjt:  VDDAVGLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEII

Query:  EIMGDRVGVPLKAVVKRSLNVTITLTVLPEESNPDM
        EIM DRVG  ++ VV+RS    +TL V+PEE+NPDM
Subjt:  EIMGDRVGVPLKAVVKRSLNVTITLTVLPEESNPDM

Q94F49 Probable histone H2A.58.7e-5480.71Show/hide
Query:  TKGAGGRKGGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        T+GAGGRKGGDR K VSKSVKAGLQFPVGRI RYLKKGRYA R  +GAP+YLAAVLEYLAAEVLELAGNAARDNKKNRINPRH+ LA+RNDEELG+LL G
Subjt:  TKGAGGRKGGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAPKSPRKA
        VTIASGGVLPNINPVLLPKK++++S+      A KSP+KA
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAPKSPRKA

Arabidopsis top hitse value%identityAlignment
AT5G02560.1 histone H2A 121.2e-5075.17Show/hide
Query:  KGAGGRKGGDRTK---VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQ
        KGA GR+ G   K   VS+SVK+GLQFPVGRIGRYLKKGRY+KR   GAP+YLAAVLEYLAAEVLELAGNAARDNKKNRI PRHVLLAVRNDEELG LL+
Subjt:  KGAGGRKGGDRTK---VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQ

Query:  GVTIASGGVLPNINPVLLPKKT----SSNSTPTTAEKAPKSPRKA
        GVTIA GGVLPNINP+LLPKK+    S+  TP +  KA KSP+K+
Subjt:  GVTIASGGVLPNINPVLLPKKT----SSNSTPTTAEKAPKSPRKA

AT5G02560.2 histone H2A 121.4e-4664.5Show/hide
Query:  KGAGGRKGGDRTK---VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAE------------------------VLELAGNAARDNK
        KGA GR+ G   K   VS+SVK+GLQFPVGRIGRYLKKGRY+KR   GAP+YLAAVLEYLAAE                        VLELAGNAARDNK
Subjt:  KGAGGRKGGDRTK---VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAE------------------------VLELAGNAARDNK

Query:  KNRINPRHVLLAVRNDEELGKLLQGVTIASGGVLPNINPVLLPKKT----SSNSTPTTAEKAPKSPRKA
        KNRI PRHVLLAVRNDEELG LL+GVTIA GGVLPNINP+LLPKK+    S+  TP +  KA KSP+K+
Subjt:  KNRINPRHVLLAVRNDEELGKLLQGVTIASGGVLPNINPVLLPKKT----SSNSTPTTAEKAPKSPRKA

AT5G27660.1 Trypsin family protein with PDZ domain6.7e-11858.82Show/hide
Query:  FLVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIP-AVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKE---T
        FL    +SS  + L RI +VA A S   YA    D+ + V L+IP +V      LPWQ + G  +H     +  L G  + SSRVSP   + +  E   +
Subjt:  FLVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIP-AVWSEPPYLPWQTTHGFAVHWSGVFDHRLLGISLCSSRVSPSPSSGVEKE---T

Query:  PGIAGDGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII------------------------GKDEVTLQDGRTFEGTVMNAD
           +    KP    LGRDTIANAAA +GPAVVN+SV  G +GI+  KS+GSGTII                        G+ +VTLQDGRTFEG V+NAD
Subjt:  PGIAGDGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTII------------------------GKDEVTLQDGRTFEGTVMNAD

Query:  FHSDIAIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMK
          SDIA+VKI SKTPLP AKLG SSKLRPGDWV+A+GCPLSLQNTVTAGIVSCVDRKSSDLGLGG  REYLQTDC+IN GNSGGPLVN+DGEV+GVNIMK
Subjt:  FHSDIAIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMK

Query:  VDDAVGLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEII
        V  A GL F+VPIDSVSKI E FKK GRVIRPW+GLKM++LN++I+ QLKERD  FPDV +GVLV  V PGSPA RAGF PGDVV+  D +PV      I
Subjt:  VDDAVGLSFAVPIDSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEII

Query:  EIMGDRVG
        EIM DRVG
Subjt:  EIMGDRVG

AT5G27670.1 histone H2A 76.2e-5580.71Show/hide
Query:  TKGAGGRKGGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG
        T+GAGGRKGGDR K VSKSVKAGLQFPVGRI RYLKKGRYA R  +GAP+YLAAVLEYLAAEVLELAGNAARDNKKNRINPRH+ LA+RNDEELG+LL G
Subjt:  TKGAGGRKGGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQG

Query:  VTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAPKSPRKA
        VTIASGGVLPNINPVLLPKK++++S+      A KSP+KA
Subjt:  VTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAPKSPRKA

AT5G59870.1 histone H2A 66.2e-4773.94Show/hide
Query:  KGAGGRK--GGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQ
        K  GGRK  G  +TK VSKS+KAGLQFPVGRI R+LKKGRYA+R   GAP+Y+AAVLEYLAAEVLELAGNAARDNKK+RI PRH+LLA+RNDEELGKLL 
Subjt:  KGAGGRK--GGDRTK-VSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQ

Query:  GVTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAP-KSPRKA
        GVTIA GGVLPNIN VLLPKK+++      A K+P KSP+KA
Subjt:  GVTIASGGVLPNINPVLLPKKTSSNSTPTTAEKAP-KSPRKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGAACTAAGGGCGCCGGAGGAAGAAAAGGAGGCGACAGAACTAAGGTCTCGAAGTCCGTCAAGGCCGGACTCCAATTCCCCGTTGGAAGAATTGGACGCTACCT
CAAGAAAGGCCGCTACGCTAAGCGCACCGCTGCCGGTGCTCCGATCTACCTGGCTGCCGTCCTCGAATACCTCGCTGCTGAGGTCCTGGAATTGGCTGGGAATGCAGCAC
GTGACAACAAGAAGAACAGGATAAACCCTAGGCATGTTCTTCTGGCTGTGAGGAACGACGAGGAGCTTGGAAAATTGCTCCAAGGAGTCACCATTGCTAGCGGCGGAGTT
CTCCCGAACATCAATCCGGTTCTTCTTCCGAAGAAGACTTCGTCGAATTCAACTCCTACTACTGCTGAAAAGGCTCCGAAATCGCCAAGAAAGGCCCATGGGCCGAGTCA
CCAAAAGGATTCAAGTATTGTTTGTTTCCTCGTCGGAAGGAAGGCTTCTAGTTCACACAATTTGCTAAATCGGATAGCCGCAGTTGCTGCTGCTGGTTCTTGTTTTTTTT
ACGCCAAGTGCAAATTAGATTCTGGATCCTCTGTAGTGCTGTCAATTCCTGCTGTTTGGAGTGAGCCACCATATCTTCCATGGCAGACTACGCACGGCTTTGCGGTTCAT
TGGTCGGGCGTCTTCGATCACCGGCTATTGGGTATTTCACTTTGTTCTTCCAGAGTCAGTCCTTCTCCATCATCAGGTGTGGAGAAGGAAACGCCTGGGATAGCTGGAGA
CGGCCAGAAGCCTTGTGCAAAATGTTTGGGTAGAGACACAATTGCAAATGCTGCAGCAGATGTCGGCCCTGCTGTTGTAAATATATCTGTTTCACATGGTATTTACGGAA
TTGCTACCGCTAAAAGCATGGGATCCGGAACAATTATTGGTAAAGATGAGGTTACTTTACAAGATGGTCGGACATTTGAAGGGACAGTAATGAATGCTGATTTTCACTCT
GATATTGCCATTGTGAAAATCAACTCTAAAACCCCCCTTCCCATGGCAAAACTTGGTTCTTCTAGCAAGCTCCGACCTGGGGATTGGGTTGTGGCCATTGGGTGTCCACT
TTCACTTCAGAATACTGTCACAGCTGGTATAGTAAGTTGTGTTGACCGTAAGAGTAGTGATTTGGGTCTTGGTGGAATGCGAAGGGAATATCTACAAACAGATTGTGCAA
TTAATGTGGGAAATTCTGGGGGCCCTCTTGTTAACGTTGATGGAGAAGTTGTTGGTGTAAATATTATGAAAGTGGATGATGCTGTTGGATTAAGTTTTGCTGTACCCATT
GATTCAGTCTCCAAAATTACAGAGCAATTCAAGAAAAGAGGGAGAGTTATTCGGCCTTGGCTTGGATTGAAAATGATTGATCTCAATGATATGATAATCGAACAACTTAA
AGAAAGAGATGCAGCTTTTCCAGATGTTACTAAAGGGGTTCTTGTCGCCATGGTAACTCCTGGATCCCCTGCTAGTCGTGCTGGATTCTGTCCCGGTGATGTCGTGATTG
AGCTCGATAAGCAACCTGTTGCTAGTATCAAAGAGATCATTGAAATTATGGGAGATAGAGTTGGTGTTCCATTGAAAGCAGTTGTGAAAAGATCACTGAATGTTACCATT
ACTTTGACTGTTCTTCCAGAGGAGTCCAATCCAGATATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGAACTAAGGGCGCCGGAGGAAGAAAAGGAGGCGACAGAACTAAGGTCTCGAAGTCCGTCAAGGCCGGACTCCAATTCCCCGTTGGAAGAATTGGACGCTACCT
CAAGAAAGGCCGCTACGCTAAGCGCACCGCTGCCGGTGCTCCGATCTACCTGGCTGCCGTCCTCGAATACCTCGCTGCTGAGGTCCTGGAATTGGCTGGGAATGCAGCAC
GTGACAACAAGAAGAACAGGATAAACCCTAGGCATGTTCTTCTGGCTGTGAGGAACGACGAGGAGCTTGGAAAATTGCTCCAAGGAGTCACCATTGCTAGCGGCGGAGTT
CTCCCGAACATCAATCCGGTTCTTCTTCCGAAGAAGACTTCGTCGAATTCAACTCCTACTACTGCTGAAAAGGCTCCGAAATCGCCAAGAAAGGCCCATGGGCCGAGTCA
CCAAAAGGATTCAAGTATTGTTTGTTTCCTCGTCGGAAGGAAGGCTTCTAGTTCACACAATTTGCTAAATCGGATAGCCGCAGTTGCTGCTGCTGGTTCTTGTTTTTTTT
ACGCCAAGTGCAAATTAGATTCTGGATCCTCTGTAGTGCTGTCAATTCCTGCTGTTTGGAGTGAGCCACCATATCTTCCATGGCAGACTACGCACGGCTTTGCGGTTCAT
TGGTCGGGCGTCTTCGATCACCGGCTATTGGGTATTTCACTTTGTTCTTCCAGAGTCAGTCCTTCTCCATCATCAGGTGTGGAGAAGGAAACGCCTGGGATAGCTGGAGA
CGGCCAGAAGCCTTGTGCAAAATGTTTGGGTAGAGACACAATTGCAAATGCTGCAGCAGATGTCGGCCCTGCTGTTGTAAATATATCTGTTTCACATGGTATTTACGGAA
TTGCTACCGCTAAAAGCATGGGATCCGGAACAATTATTGGTAAAGATGAGGTTACTTTACAAGATGGTCGGACATTTGAAGGGACAGTAATGAATGCTGATTTTCACTCT
GATATTGCCATTGTGAAAATCAACTCTAAAACCCCCCTTCCCATGGCAAAACTTGGTTCTTCTAGCAAGCTCCGACCTGGGGATTGGGTTGTGGCCATTGGGTGTCCACT
TTCACTTCAGAATACTGTCACAGCTGGTATAGTAAGTTGTGTTGACCGTAAGAGTAGTGATTTGGGTCTTGGTGGAATGCGAAGGGAATATCTACAAACAGATTGTGCAA
TTAATGTGGGAAATTCTGGGGGCCCTCTTGTTAACGTTGATGGAGAAGTTGTTGGTGTAAATATTATGAAAGTGGATGATGCTGTTGGATTAAGTTTTGCTGTACCCATT
GATTCAGTCTCCAAAATTACAGAGCAATTCAAGAAAAGAGGGAGAGTTATTCGGCCTTGGCTTGGATTGAAAATGATTGATCTCAATGATATGATAATCGAACAACTTAA
AGAAAGAGATGCAGCTTTTCCAGATGTTACTAAAGGGGTTCTTGTCGCCATGGTAACTCCTGGATCCCCTGCTAGTCGTGCTGGATTCTGTCCCGGTGATGTCGTGATTG
AGCTCGATAAGCAACCTGTTGCTAGTATCAAAGAGATCATTGAAATTATGGGAGATAGAGTTGGTGTTCCATTGAAAGCAGTTGTGAAAAGATCACTGAATGTTACCATT
ACTTTGACTGTTCTTCCAGAGGAGTCCAATCCAGATATGTGA
Protein sequenceShow/hide protein sequence
MEGTKGAGGRKGGDRTKVSKSVKAGLQFPVGRIGRYLKKGRYAKRTAAGAPIYLAAVLEYLAAEVLELAGNAARDNKKNRINPRHVLLAVRNDEELGKLLQGVTIASGGV
LPNINPVLLPKKTSSNSTPTTAEKAPKSPRKAHGPSHQKDSSIVCFLVGRKASSSHNLLNRIAAVAAAGSCFFYAKCKLDSGSSVVLSIPAVWSEPPYLPWQTTHGFAVH
WSGVFDHRLLGISLCSSRVSPSPSSGVEKETPGIAGDGQKPCAKCLGRDTIANAAADVGPAVVNISVSHGIYGIATAKSMGSGTIIGKDEVTLQDGRTFEGTVMNADFHS
DIAIVKINSKTPLPMAKLGSSSKLRPGDWVVAIGCPLSLQNTVTAGIVSCVDRKSSDLGLGGMRREYLQTDCAINVGNSGGPLVNVDGEVVGVNIMKVDDAVGLSFAVPI
DSVSKITEQFKKRGRVIRPWLGLKMIDLNDMIIEQLKERDAAFPDVTKGVLVAMVTPGSPASRAGFCPGDVVIELDKQPVASIKEIIEIMGDRVGVPLKAVVKRSLNVTI
TLTVLPEESNPDM