; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039614 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039614
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
Genome locationchr2:47330372..47347680
RNA-Seq ExpressionLag0039614
SyntenyLag0039614
Gene Ontology termsGO:0070940 - dephosphorylation of RNA polymerase II C-terminal domain (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008420 - RNA polymerase II CTD heptapeptide repeat phosphatase activity (molecular function)
GO:0106306 - protein serine/threonine phosphatase activity (molecular function)
GO:0106307 - protein serine/threonine phosphatase activity (molecular function)
InterPro domainsIPR004274 - FCP1 homology domain
IPR011947 - FCP1-like phosphatase, phosphatase domain
IPR023214 - HAD superfamily
IPR036412 - HAD-like superfamily
IPR039189 - CTD phosphatase Fcp1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607512.1 RNA polymerase II C-terminal domain phosphatase-like 4, partial [Cucurbita argyrosperma subsp. sororia]3.7e-17888Show/hide
Query:  RLLGTISEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGN
        +L   +   MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGN
Subjt:  RLLGTISEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGN

Query:  MCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTK
        MCI+CGQRLDEE+GVTFGYIHKGLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHL PEEEYLR+Q DSLEDVTKGSLFLL+SVHTMTK
Subjt:  MCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTK

Query:  LRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASS
        LRPFVHTFLKEASQLFEMYIYTMGERAYA+EMAKLLDPK+EYFSSKVISRDDGTQKHQKGLDVVLG ESAVLILDDTENAWTKHKENLILMERYHFFASS
Subjt:  LRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASS

Query:  CHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY
        CHQFGFNCKSLSELKSDESETDGALA ILKVLKQVH+IFFNELS+DLVDRDVRQVL+       +G  VV SR +
Subjt:  CHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY

XP_022133134.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica charantia]6.3e-17892.88Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L   E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+CG
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG

Query:  QRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH
        QRLDEE+GVTFGYIHKGLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH
Subjt:  QRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH

Query:  TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGF
        TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPK+EYFS+KVISRDDGTQKH+KGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC QFG+
Subjt:  TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGF

Query:  NCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLR
        NCKSLSELKSDESETDGALA ILKVLKQVH+IFFNEL DDLVDRDVRQVL+
Subjt:  NCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLR

XP_022949466.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita moschata]1.4e-17789.89Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+VTNS AHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFL
        DEE+GVTFGYIHKGLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHLTPEE+YLR+QTDSLEDVTKGSLFLL+SVHTMTKLRPFVHTFL
Subjt:  DEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFL

Query:  KEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK
        KEASQLFEMYIYTMGERAYA+EMAKLLDPK+EYFSSKVISRDDGTQKHQKGLDVVLG ESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK
Subjt:  KEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK

Query:  SLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY
        SLSELKSDESETDGALA ILKVLKQVH+IFFNELS+DLVDRDVRQVL+       +G  VV SR +
Subjt:  SLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY

XP_023525838.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pepo]1.6e-17889.89Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFL
        DEE+GVTFGYIHKGLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHLTPEE+YLR+QTDSLEDVTKGSLFLL+SVHTMTKLRPFVHTFL
Subjt:  DEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFL

Query:  KEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK
        KEASQLFEMYIYTMGERAYA+EMAKLLDPK+EYFSSKVISRDDGTQKHQKGLDVVLG ESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK
Subjt:  KEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK

Query:  SLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY
        SLSELKSDESETDGALA ILKVLKQVH+IFFNE+S+DLVDRDVRQVL+       +G  VV SR +
Subjt:  SLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY

XP_038890381.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida]2.5e-17990.16Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+ TNSPAHSSSSDDFAAFLDVALDSHSSDSSP EKAEGDNN  ESERIKRRKVEKLEN EEDILYGVE QSSE +SKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFL
        DEE+GVTFGYIHKGLRLNNDEINRLRNIDMKSLL HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSL+DVTKGSLFLLNSVHTMTKLRPFVH+FL
Subjt:  DEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFL

Query:  KEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK
        KEA+QLFEMYIYTMGERAYAFEMAKLLDPKKEYF+ KVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAW KHK+NLILMERYHFFASSCHQFGFNCK
Subjt:  KEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK

Query:  SLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY
        SLSELKSDESETDGALA ILKVLKQVHS+FFNELSDDLVDRDVRQ+L+       +G  VV SR +
Subjt:  SLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY

TrEMBL top hitse value%identityAlignment
A0A6J1BUF9 RNA polymerase II C-terminal domain phosphatase-like1.7e-17693.08Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L   E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+CG
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG

Query:  QRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH
        QRLDEE+GVTFGYIHKGLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH
Subjt:  QRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH

Query:  TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGF
        TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPK+EYFS+KVISRDDGTQKH+KGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC QFG+
Subjt:  TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGF

Query:  NCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVR
        NCKSLSELKSDESETDGALA ILKVLKQVH+IFFNEL DDLVDRDVR
Subjt:  NCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVR

A0A6J1BV42 RNA polymerase II C-terminal domain phosphatase-like3.0e-17892.88Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L   E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+CG
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG

Query:  QRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH
        QRLDEE+GVTFGYIHKGLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH
Subjt:  QRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH

Query:  TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGF
        TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPK+EYFS+KVISRDDGTQKH+KGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC QFG+
Subjt:  TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGF

Query:  NCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLR
        NCKSLSELKSDESETDGALA ILKVLKQVH+IFFNEL DDLVDRDVRQVL+
Subjt:  NCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLR

A0A6J1CJQ5 RNA polymerase II C-terminal domain phosphatase-like1.1e-17591.17Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG
        MS+VT+SPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESERIKRRKVEKL   E P+EDI+Y VE QSSEVLSKQQLC HPGSFGNMCI+CG
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG

Query:  QRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH
        QRLD E+GVTFGYIHKGLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSL+DVTKGSLFLLNS+HTMTKLRPF+H
Subjt:  QRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVH

Query:  TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGF
        TFLKEASQLFEMYIYTMGERAYA EMAKLLDPK+ YFS++VISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSC QFG+
Subjt:  TFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGF

Query:  NCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLR
        NCKSLSELKSDESETDGALA+ILKVLKQVH+IFFNELSDDLVDRDVRQVL+
Subjt:  NCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLR

A0A6J1GC38 RNA polymerase II C-terminal domain phosphatase-like6.8e-17889.89Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+VTNS AHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFL
        DEE+GVTFGYIHKGLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHLTPEE+YLR+QTDSLEDVTKGSLFLL+SVHTMTKLRPFVHTFL
Subjt:  DEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFL

Query:  KEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK
        KEASQLFEMYIYTMGERAYA+EMAKLLDPK+EYFSSKVISRDDGTQKHQKGLDVVLG ESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK
Subjt:  KEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK

Query:  SLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY
        SLSELKSDESETDGALA ILKVLKQVH+IFFNELS+DLVDRDVRQVL+       +G  VV SR +
Subjt:  SLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY

A0A6J1ID30 RNA polymerase II C-terminal domain phosphatase-like2.0e-17789.62Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDS P+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFL
        DEE+GVTFGYIHKGLRLNNDEINRLRNIDMK LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLR+Q DSLEDVTKGSLFLL+SVHTMTKLRPFVHTFL
Subjt:  DEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFL

Query:  KEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK
        KEASQLFEMYIYTMGERAYA+EMAKLLDPK+EYFSSKVISRDDGTQKHQKGLDVVLG ESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK
Subjt:  KEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCK

Query:  SLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY
        SLSELKSDESE+DGALA ILKVLKQVH+IFFNELS+DLVDRDVRQVL+       +G  VV SR +
Subjt:  SLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY

SwissProt top hitse value%identityAlignment
F4JCB2 RNA polymerase II C-terminal domain phosphatase-like 58.3e-4838.59Show/hide
Query:  NVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLIL
        N     + KRRK+E   N           +SS  LS    C H      +CI C   + +  G  F YI  GL+L+++ +   +    K S L  KKL L
Subjt:  NVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLIL

Query:  VLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSV----HTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSK
        VLDLDHTLL++  +  L+  E+YL  +  S    T+  L+ + +V      +TKLRPF+  FLKEA++ F MY+YT G R YA ++ +L+DPKK YF  +
Subjt:  VLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSV----HTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSK

Query:  VISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDD
        VI++ +    H K LD VL +E  V+I+DDT N W  HK NL+ + +Y +F       G +    SE K+DESE++G LA +LK+LK+VH  FF  + ++
Subjt:  VISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDD

Query:  LVDRDVRQVLR
        L  +DVR +L+
Subjt:  LVDRDVRQVLR

Q00IB6 RNA polymerase II C-terminal domain phosphatase-like 41.7e-10960.38Show/hide
Query:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQ
        MS+ ++SP H SSSSDD AAFLD  LDS S  SS P E+ E +++V     +KR+K+E LE               E  S +  C HPGSFGNMC VCGQ
Subjt:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQ

Query:  RLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLED---VTKGSLFLLNSVHTMTKLRPF
        +L EE GV+F YIHK +RLN DEI+RLR+ D + L + +KL LVLDLDHTLLN+T L  L PEEEYL+S T SL+D   V+ GSLFLL  +  MTKLRPF
Subjt:  RLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLED---VTKGSLFLLNSVHTMTKLRPF

Query:  VHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQF
        VH+FLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  +VISRDDGT +H+K LDVVLGQESAVLILDDTENAW KHK+NLI++ERYHFF+SSC QF
Subjt:  VHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQF

Query:  GFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY
            KSLSELKSDESE DGALA +LKVLKQ H++FF  + + + +RDVR +L+       KG  +V SR +
Subjt:  GFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY

Q8LL04 RNA polymerase II C-terminal domain phosphatase-like 31.0e-3741.25Show/hide
Query:  LNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHL-TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMG
        +  + + RL   +   +   +KL LVLD+DHTLLNS +   + +  EE LR + +   +     LF    +   TKLRP +  FL++AS+L+E+++YTMG
Subjt:  LNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHL-TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMG

Query:  ERAYAFEMAKLLDPKKEYFSSKVISR-DDGTQ-------KHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKS
         + YA EMAKLLDPK   F+ +VIS+ DDG            K L+ V+G ES+V+I+DD+   W +HK NLI +ERY +F  S  QFG    SL EL  
Subjt:  ERAYAFEMAKLLDPKKEYFSSKVISR-DDGTQ-------KHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKS

Query:  DESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVL
        DE   +G LA+ L V++++H  FF+  S D V  DVR +L
Subjt:  DESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVL

Q8SV03 RNA polymerase II subunit A C-terminal domain phosphatase6.0e-2229.62Show/hide
Query:  CSHPGSFGNMCIVCGQRLDEEAGVTFG-YIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF
        C+HP   G +C VCG  + EE+ +    Y    +++ ++E   +    M++L    KLILVLDLD T+L++T               T SLE   K   F
Subjt:  CSHPGSFGNMCIVCGQRLDEEAGVTFG-YIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF

Query:  LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESA-VLILDDTENAWTKHKENLI
        +++      KLRP +   L+  S+L+E+++YTMG RAYA  + +++DP  +YF  ++I+RD+      K L  +   +   ++ILDD  + W  + ENL+
Subjt:  LLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESA-VLILDDTENAWTKHKENLI

Query:  LMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVD
        L+  + +F           K   E ++ E++   AL   +   K++  I   E++  L D
Subjt:  LMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVD

Q9P376 RNA polymerase II subunit A C-terminal domain phosphatase4.6e-2230.68Show/hide
Query:  IKRRKVEKLENPEEDILY--------GVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRLDEE----------AGVTFGYIHKGLRLNNDEINRLRNIDMK
        + R  VE+ E P E  L          +E  S  V    + C+H  ++G +C +CG+ +  +          A ++  +    L ++ +E +RL + ++K
Subjt:  IKRRKVEKLENPEEDILY--------GVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRLDEE----------AGVTFGYIHKGLRLNNDEINRLRNIDMK

Query:  SLLQHKKLILVLDLDHTLLNST---QLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHT---MTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAK
         L Q K+L L++DLD T++++T    +G    +   +    D L DV   +L    S +T     K RP +  FL++ S+L+E++IYTMG +AYA E+AK
Subjt:  SLLQHKKLILVLDLDHTLLNST---QLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHT---MTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAK

Query:  LLDPKKEYFSSKVISRDDGTQKHQKGLDVVLG-QESAVLILDDTENAWTKHKENLILMERYHFF
        ++DP  + F  +V+SRDD     QK L  +     S V+++DD  + W     NLI +  Y FF
Subjt:  LLDPKKEYFSSKVISRDDGTQKHQKGLDVVLG-QESAVLILDDTENAWTKHKENLILMERYHFF

Arabidopsis top hitse value%identityAlignment
AT2G04930.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein3.4e-4940.86Show/hide
Query:  VLSKQQLCSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDS--L
        V +    C H   F  +CI C  ++ +     F YI KGL+L+N+ +   +++  K S L  KKL LVLDLDHTLL+S  + +L+  E YL  +  S   
Subjt:  VLSKQQLCSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDS--L

Query:  EDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAW
        ED+ K    + + +  + KLRPFV  FLKEA+++F M++YTMG R YA  + +++DPKK YF ++VI++D+  +   K L++VL +E  V+I+DDT + W
Subjt:  EDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAW

Query:  TKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFF-NELSDDLVDRDVRQVLRLG
          HK NLI + +Y +F  S    G +  S SE K+DE E DG LA +LK+L++VH  FF  E+ + L   DVR +L+ G
Subjt:  TKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFF-NELSDDLVDRDVRQVLRLG

AT3G17550.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein7.7e-4940.07Show/hide
Query:  LENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLILVLDLDHTLLNSTQLG
        +EN   +    +   SS + S +  C H      +CI C   +++  G  F Y+ +GL+L+++     +    +   L  KKL LVLDLDHTLL+S ++ 
Subjt:  LENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLILVLDLDHTLLNSTQLG

Query:  HLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVL
         L+  E+ L  +  S    T+  L+ L+S + +TKLRPFVH FLKEA++LF MY+YTMG R YA  + KL+DPK+ YF  +VI+RD+    + K LD+VL
Subjt:  HLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVL

Query:  GQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLR
         +E  V+I+DDT + WT HK NL+ +  YHFF  +  +      S +E K DES+ +G LA +LK+LK+VH  FF  + ++L  +DVR +L+
Subjt:  GQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLR

AT3G19595.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein5.9e-4938.59Show/hide
Query:  NVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLIL
        N     + KRRK+E   N           +SS  LS    C H      +CI C   + +  G  F YI  GL+L+++ +   +    K S L  KKL L
Subjt:  NVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLIL

Query:  VLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSV----HTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSK
        VLDLDHTLL++  +  L+  E+YL  +  S    T+  L+ + +V      +TKLRPF+  FLKEA++ F MY+YT G R YA ++ +L+DPKK YF  +
Subjt:  VLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSV----HTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSK

Query:  VISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDD
        VI++ +    H K LD VL +E  V+I+DDT N W  HK NL+ + +Y +F       G +    SE K+DESE++G LA +LK+LK+VH  FF  + ++
Subjt:  VISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDD

Query:  LVDRDVRQVLR
        L  +DVR +L+
Subjt:  LVDRDVRQVLR

AT5G54210.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.4e-4740.44Show/hide
Query:  CSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF
        C H      +C  C   ++   G +F Y+  GL+L++  +   + +  + +    KKL LVLDLDHTLL++  + +LT EE YL  + DS ED+ +    
Subjt:  CSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMK-SLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF

Query:  LLN---SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKEN
         LN   S   + KLRPFVH FLKEA+++F MY+YTMG+R YA  +  L+DP+K YF  +VI+R++    + K LD+VL  E  V+I+DDT + W  HK N
Subjt:  LLN---SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKEN

Query:  LILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFN---ELSDDLVDRDVRQVL
        L+ + +Y++F+          KS +E K DES  DG+LA +LKV+KQV+  FF+   E   D+  +DVR +L
Subjt:  LILMERYHFFASSCHQFGFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFN---ELSDDLVDRDVRQVL

AT5G58003.1 C-terminal domain phosphatase-like 41.2e-11060.38Show/hide
Query:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQ
        MS+ ++SP H SSSSDD AAFLD  LDS S  SS P E+ E +++V     +KR+K+E LE               E  S +  C HPGSFGNMC VCGQ
Subjt:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQ

Query:  RLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLED---VTKGSLFLLNSVHTMTKLRPF
        +L EE GV+F YIHK +RLN DEI+RLR+ D + L + +KL LVLDLDHTLLN+T L  L PEEEYL+S T SL+D   V+ GSLFLL  +  MTKLRPF
Subjt:  RLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLED---VTKGSLFLLNSVHTMTKLRPF

Query:  VHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQF
        VH+FLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  +VISRDDGT +H+K LDVVLGQESAVLILDDTENAW KHK+NLI++ERYHFF+SSC QF
Subjt:  VHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQF

Query:  GFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY
            KSLSELKSDESE DGALA +LKVLKQ H++FF  + + + +RDVR +L+       KG  +V SR +
Subjt:  GFNCKSLSELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGN----KGNLVVMSRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCCACTTTCCGTTCACTGCTGCCGCCGGCTCTCTCCCGTTCACTGCTGCCGCCGCCATCATCGTCTTTCTTCCAATCGTGCCTCCGCAGCCTCCCTCCCATCTCC
GGCGTCGTGTGCGTGCGTCCGACGTAGGCAGCAGCGACAGCAAGTTCTTCCAGTGGCGGTTTTGTTTCCGTGTGTTTCCGGCGATCCGAGCGGCTCCGACGTTATTTCCT
CTCAACCAACAGAGGGTTGTGAGCTCGCGCGGGGTACGTTTTCCGGCAGTATCAGAGCGCGAACAGTAGCGTATTGGCGCGTTTCTGGCAACTCCTTCGCAACAGCAGCT
TTACTTCGCGAGTTCCAGCGGTTTGAGCTGTGGGTCTTCCCCTCTGGCGAGTTTGGACGTTTATTAGGCACCATTAGCGAGTGGATGAGCATTGTGACTAATTCTCCAGC
TCACTCGTCGAGCAGTGATGATTTTGCTGCATTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCGCCCGATGAAAAGGCTGAGGGTGACAATAATGTTGTTG
AAAGTGAGAGGATAAAACGTCGTAAAGTGGAGAAGCTGGAAAACCCAGAGGAGGACATTCTGTATGGAGTTGAAGGGCAAAGTTCAGAAGTATTATCAAAGCAGCAACTA
TGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCGTCTGTGGGCAGAGGTTGGATGAGGAAGCTGGCGTGACATTTGGGTATATACATAAGGGACTCAGACTTAATAA
TGATGAAATTAATCGGCTACGTAACATAGACATGAAGAGCTTGTTGCAACATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTGTTAAATTCAACGCAACTGG
GGCATTTGACACCTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTTTAGAAGATGTCACGAAAGGCAGTCTTTTCTTATTGAACTCCGTGCATACAATGACAAAGTTG
AGGCCGTTTGTCCACACATTTTTGAAAGAAGCTAGTCAATTATTCGAGATGTATATATACACTATGGGGGAGCGAGCATATGCATTTGAAATGGCAAAGTTGTTGGACCC
CAAGAAGGAGTATTTTAGTTCTAAAGTTATTTCACGGGATGATGGCACTCAAAAACATCAAAAAGGTCTGGATGTGGTGCTGGGTCAGGAAAGTGCTGTTCTGATCCTTG
ATGATACTGAAAATGCATGGACAAAACATAAAGAAAACTTGATATTGATGGAGAGATATCATTTTTTTGCTTCAAGTTGTCACCAATTTGGCTTCAACTGTAAATCCTTA
TCTGAGTTGAAGAGCGACGAGAGTGAAACCGATGGGGCACTGGCGGCCATCCTCAAAGTTCTCAAGCAAGTCCATAGTATATTCTTTAATGAACTATCGGATGATTTAGT
TGACAGAGATGTGAGGCAGGTCTTGAGATTAGGGAACAAAGGGAACTTAGTGGTTATGTCTAGGAAATACCTCAAGGGATGGGAGGGATTCAAAAATCTTCTCATCAATT
TCTTGAATGGACGGAGTGATACTAAAGCAGAAAGCGTAGGTAAAGAAAAAGAAAAGAGGTCAGGATGTTCTTATGTCGAAGTTCTGAAGTTAGATGATTCTAATCCCCTC
CTGACTAGTGGTAGGGCCACTCCTCAAGACAAGGCCCTCCTTAAATGCCCAAATGTAGAAGTTGCTAGGTTGCTGGCTATGAATAGGGGGTGGGTTACTTTTGGTCCGAT
CACGTTGAAATTGGAAAGGTGGGATGTTAGGAAACATAGTCGGATTTCAGTAGTTCCTTTGACTGTTGATGACAACTCAAGCCTCATTGAATGCCTAGAGGTGGCTATTA
AAGTTAGGAGAAATTATTGCGGTATTGTTCCAATAGAGCTTAAATTGCGGGAGACTAATCTAGAATTTATGGTGTGGGTCGTGACTTTTCAAGATGCAAATTTGTTGATA
GATAGAGTCGTTGGCATCCATGATAGCTTCTCACCCGTGGAGGCTGAGAGATTTTTTCAGGGGCTTGAGGGCTTAGGATTTGAATTGGAACAACTAGATGAAGAAATTTT
GACAGATTATCAGAAAGTTTTCTTCGTAGGTGAAACAAGTTTGGATAATGAGGACAATAACAAGCAGCTGATGGCGATGCCGGGTCGAGAGGATTTCTTGTATTACGCCA
GATCACTTCCCTATTCTCCTAAAGTTCGGAGCTCATTTGTGGGGTCCAATGGCTTAACCACCAAGAACTTGGATGGGTGGTGGATGAGTCTTCACACTGGAGCATTTGAT
CACATGGAGGAAAATGGTACTTTATTTGAGGTCGAAAAGGAATTGAGAAATGACTATAAAGCTTCCCTGACACAACCACCATCAAAGGGCGCAGAGCATATGGATCCGAA
AAAGCAAAATTCAATGGCTCAAAGAAGCCCTAAAGGCGCAATGCGCCTCCTCAAGGCACTTACTGGAGTTGCAACAAGGCATGATGATGAAGAGAAAGAAGAAGAGGAAT
GGGGAAGACAGGGATCAATATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGCCACTTTCCGTTCACTGCTGCCGCCGGCTCTCTCCCGTTCACTGCTGCCGCCGCCATCATCGTCTTTCTTCCAATCGTGCCTCCGCAGCCTCCCTCCCATCTCC
GGCGTCGTGTGCGTGCGTCCGACGTAGGCAGCAGCGACAGCAAGTTCTTCCAGTGGCGGTTTTGTTTCCGTGTGTTTCCGGCGATCCGAGCGGCTCCGACGTTATTTCCT
CTCAACCAACAGAGGGTTGTGAGCTCGCGCGGGGTACGTTTTCCGGCAGTATCAGAGCGCGAACAGTAGCGTATTGGCGCGTTTCTGGCAACTCCTTCGCAACAGCAGCT
TTACTTCGCGAGTTCCAGCGGTTTGAGCTGTGGGTCTTCCCCTCTGGCGAGTTTGGACGTTTATTAGGCACCATTAGCGAGTGGATGAGCATTGTGACTAATTCTCCAGC
TCACTCGTCGAGCAGTGATGATTTTGCTGCATTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCGCCCGATGAAAAGGCTGAGGGTGACAATAATGTTGTTG
AAAGTGAGAGGATAAAACGTCGTAAAGTGGAGAAGCTGGAAAACCCAGAGGAGGACATTCTGTATGGAGTTGAAGGGCAAAGTTCAGAAGTATTATCAAAGCAGCAACTA
TGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCGTCTGTGGGCAGAGGTTGGATGAGGAAGCTGGCGTGACATTTGGGTATATACATAAGGGACTCAGACTTAATAA
TGATGAAATTAATCGGCTACGTAACATAGACATGAAGAGCTTGTTGCAACATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTGTTAAATTCAACGCAACTGG
GGCATTTGACACCTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTTTAGAAGATGTCACGAAAGGCAGTCTTTTCTTATTGAACTCCGTGCATACAATGACAAAGTTG
AGGCCGTTTGTCCACACATTTTTGAAAGAAGCTAGTCAATTATTCGAGATGTATATATACACTATGGGGGAGCGAGCATATGCATTTGAAATGGCAAAGTTGTTGGACCC
CAAGAAGGAGTATTTTAGTTCTAAAGTTATTTCACGGGATGATGGCACTCAAAAACATCAAAAAGGTCTGGATGTGGTGCTGGGTCAGGAAAGTGCTGTTCTGATCCTTG
ATGATACTGAAAATGCATGGACAAAACATAAAGAAAACTTGATATTGATGGAGAGATATCATTTTTTTGCTTCAAGTTGTCACCAATTTGGCTTCAACTGTAAATCCTTA
TCTGAGTTGAAGAGCGACGAGAGTGAAACCGATGGGGCACTGGCGGCCATCCTCAAAGTTCTCAAGCAAGTCCATAGTATATTCTTTAATGAACTATCGGATGATTTAGT
TGACAGAGATGTGAGGCAGGTCTTGAGATTAGGGAACAAAGGGAACTTAGTGGTTATGTCTAGGAAATACCTCAAGGGATGGGAGGGATTCAAAAATCTTCTCATCAATT
TCTTGAATGGACGGAGTGATACTAAAGCAGAAAGCGTAGGTAAAGAAAAAGAAAAGAGGTCAGGATGTTCTTATGTCGAAGTTCTGAAGTTAGATGATTCTAATCCCCTC
CTGACTAGTGGTAGGGCCACTCCTCAAGACAAGGCCCTCCTTAAATGCCCAAATGTAGAAGTTGCTAGGTTGCTGGCTATGAATAGGGGGTGGGTTACTTTTGGTCCGAT
CACGTTGAAATTGGAAAGGTGGGATGTTAGGAAACATAGTCGGATTTCAGTAGTTCCTTTGACTGTTGATGACAACTCAAGCCTCATTGAATGCCTAGAGGTGGCTATTA
AAGTTAGGAGAAATTATTGCGGTATTGTTCCAATAGAGCTTAAATTGCGGGAGACTAATCTAGAATTTATGGTGTGGGTCGTGACTTTTCAAGATGCAAATTTGTTGATA
GATAGAGTCGTTGGCATCCATGATAGCTTCTCACCCGTGGAGGCTGAGAGATTTTTTCAGGGGCTTGAGGGCTTAGGATTTGAATTGGAACAACTAGATGAAGAAATTTT
GACAGATTATCAGAAAGTTTTCTTCGTAGGTGAAACAAGTTTGGATAATGAGGACAATAACAAGCAGCTGATGGCGATGCCGGGTCGAGAGGATTTCTTGTATTACGCCA
GATCACTTCCCTATTCTCCTAAAGTTCGGAGCTCATTTGTGGGGTCCAATGGCTTAACCACCAAGAACTTGGATGGGTGGTGGATGAGTCTTCACACTGGAGCATTTGAT
CACATGGAGGAAAATGGTACTTTATTTGAGGTCGAAAAGGAATTGAGAAATGACTATAAAGCTTCCCTGACACAACCACCATCAAAGGGCGCAGAGCATATGGATCCGAA
AAAGCAAAATTCAATGGCTCAAAGAAGCCCTAAAGGCGCAATGCGCCTCCTCAAGGCACTTACTGGAGTTGCAACAAGGCATGATGATGAAGAGAAAGAAGAAGAGGAAT
GGGGAAGACAGGGATCAATATGA
Protein sequenceShow/hide protein sequence
MRPLSVHCCRRLSPVHCCRRHHRLSSNRASAASLPSPASCACVRRRQQRQQVLPVAVLFPCVSGDPSGSDVISSQPTEGCELARGTFSGSIRARTVAYWRVSGNSFATAA
LLREFQRFELWVFPSGEFGRLLGTISEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQL
CSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKGLRLNNDEINRLRNIDMKSLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKL
RPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENAWTKHKENLILMERYHFFASSCHQFGFNCKSL
SELKSDESETDGALAAILKVLKQVHSIFFNELSDDLVDRDVRQVLRLGNKGNLVVMSRKYLKGWEGFKNLLINFLNGRSDTKAESVGKEKEKRSGCSYVEVLKLDDSNPL
LTSGRATPQDKALLKCPNVEVARLLAMNRGWVTFGPITLKLERWDVRKHSRISVVPLTVDDNSSLIECLEVAIKVRRNYCGIVPIELKLRETNLEFMVWVVTFQDANLLI
DRVVGIHDSFSPVEAERFFQGLEGLGFELEQLDEEILTDYQKVFFVGETSLDNEDNNKQLMAMPGREDFLYYARSLPYSPKVRSSFVGSNGLTTKNLDGWWMSLHTGAFD
HMEENGTLFEVEKELRNDYKASLTQPPSKGAEHMDPKKQNSMAQRSPKGAMRLLKALTGVATRHDDEEKEEEEWGRQGSI