; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G000650 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G000650
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
Genome locationchr08:1386353..1410746
RNA-Seq ExpressionLsi08G000650
SyntenyLsi08G000650
Gene Ontology termsGO:0070940 - dephosphorylation of RNA polymerase II C-terminal domain (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008420 - RNA polymerase II CTD heptapeptide repeat phosphatase activity (molecular function)
InterPro domainsIPR001357 - BRCT domain
IPR004274 - FCP1 homology domain
IPR011947 - FCP1-like phosphatase, phosphatase domain
IPR023214 - HAD superfamily
IPR036412 - HAD-like superfamily
IPR036420 - BRCT domain superfamily
IPR039189 - CTD phosphatase Fcp1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025178.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma subsp. argyrosperma]5.0e-19180.54Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSLATNSPAHSSSSDDFAAFLDVAL+SHSSDSSP +N E  NN ESERIKRRKVEKL  SEED L GVEEQ+LEVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK
        EESGVTFGYIH+GLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQ DSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK
Subjt:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK

Query:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------
        EASQLFEMYIYTMGERAYA+EMAKLLDPKREYF++KVISRDDGTQKHQKGLD+VLGQESAVLILDDTE                                
Subjt:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------

Query:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS
                                      NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQA+NHHLWKMVEQLGGTCSTELD SVTHVVS
Subjt:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS

Query:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        TD GTEKSRWALKE KFLVHPRWIEASNYFWKRQAE+NFPVEQ+KKQ
Subjt:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

XP_022133134.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica charantia]7.7e-19280.22Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESER+KRRKVE+LE SE   EDI YGVEEQ+ EVLSKQQLCSHPGSFGNMCI+CGQ
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ

Query:  RLDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHT
        RLDEESGVTFGYIH+GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHT
Subjt:  RLDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHT

Query:  FLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE-----------------------------
        FLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKH+KGLDVVLGQESAVLILDDTE                             
Subjt:  FLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE-----------------------------

Query:  ---------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTH
                                         NEL DDLVDRDVRQVLKTVRSKVLEGCKVVF+RVFPTKF ADNHHLWKMVEQLGG+CST+LD SVTH
Subjt:  ---------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTH

Query:  VVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        VVSTDAGTEKSRWA+KE+KFLVHPRWIEASNYFWKRQ EENFPVEQTKKQ
Subjt:  VVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

XP_022925487.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Cucurbita moschata]7.7e-19280.76Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSLATNSPAHSSSSDDFAAFLDVAL+SHSSDSSP +N E  NN ESERIKRRKVEKL  SEED L GVEEQ+LEVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK
        EESGVTFGYIH+GLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQ DSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK
Subjt:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK

Query:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------
        EASQLFEMYIYTMGERAYA+EMAKLLDPKREYF++KVISRDDGTQKHQKGLD+VLGQESAVLILDDTE                                
Subjt:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------

Query:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS
                                      NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQA+NHHLWKMVEQLGGTCSTELD SVTHVVS
Subjt:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS

Query:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        TD GTEKSRWALKEEKFLVHPRWIEASNYFWKRQAE+NFPVEQ+KKQ
Subjt:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

XP_023525838.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pepo]8.5e-19179.42Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EG NN E+ERIKR KVEKLENS EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK
        EESGVTFGYIH+GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEE+YLR+QTDSLEDVTKGSLFLL+SVHTMTKLRPFVHTFLK
Subjt:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK

Query:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------
        EASQLFEMYIYTMGERAYA+EMAKLLDPKREYFS+KVISRDDGTQKHQKGLDVVLG ESAVLILDDTE                                
Subjt:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------

Query:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS
                                      NE+S+DLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVE+LGGTCSTELD SVTH+VS
Subjt:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS

Query:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        TDAGTEKSRWA+KE+KFLVHP+WIEASNYFWKR+AEE FPVE TKKQ
Subjt:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

XP_038890381.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida]8.8e-19681.66Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNNAESERIKRRKVEKLENSEEDILYGVEEQ+ E +SKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK
        EESGVTFGYIH+GLRLNNDEINRLRNIDMK+LL HKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSL+DVTKGSLFLLNSVHTMTKLRPFVH+FLK
Subjt:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK

Query:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------
        EA+QLFEMYIYTMGERAYAFEMAKLLDPK+EYF+ KVISRDDGTQKHQKGLDVVLGQESAVLILDDTE                                
Subjt:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------

Query:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS
                                      NELSDDLVDRDVRQ+LKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELD SVTHVVS
Subjt:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS

Query:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
         DAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQ+EENFPVEQTKKQ
Subjt:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

TrEMBL top hitse value%identityAlignment
A0A6J1BV42 RNA polymerase II C-terminal domain phosphatase-like3.7e-19280.22Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESER+KRRKVE+LE SE   EDI YGVEEQ+ EVLSKQQLCSHPGSFGNMCI+CGQ
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ

Query:  RLDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHT
        RLDEESGVTFGYIH+GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHT
Subjt:  RLDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHT

Query:  FLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE-----------------------------
        FLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKH+KGLDVVLGQESAVLILDDTE                             
Subjt:  FLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE-----------------------------

Query:  ---------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTH
                                         NEL DDLVDRDVRQVLKTVRSKVLEGCKVVF+RVFPTKF ADNHHLWKMVEQLGG+CST+LD SVTH
Subjt:  ---------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTH

Query:  VVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        VVSTDAGTEKSRWA+KE+KFLVHPRWIEASNYFWKRQ EENFPVEQTKKQ
Subjt:  VVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

A0A6J1CJQ5 RNA polymerase II C-terminal domain phosphatase-like5.9e-19079.33Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ
        MSL T+SPAHSSSSDDFAAFLDVALDSHSSDSSP E  EGDNN ESERIKRRKVEKLE SE   EDI+Y VEEQ+ EVLSKQQLC HPGSFGNMCIICGQ
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSE---EDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQ

Query:  RLDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHT
        RLD ESGVTFGYIH+GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGH+TPEEEYLRSQTDSL+DVTKGSLFLLNS+HTMTKLRPF+HT
Subjt:  RLDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHT

Query:  FLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE-----------------------------
        FLKEASQLFEMYIYTMGERAYA EMAKLLDPKR YFSA+VISRDDGTQKHQKGLDVVLGQESAVLILDDTE                             
Subjt:  FLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE-----------------------------

Query:  ---------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTH
                                         NELSDDLVDRDVRQVLKTVRSKVLEGCKVVF+RVFP KFQADNHHLWKMVEQLGG+CST+LD SVTH
Subjt:  ---------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTH

Query:  VVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        VVSTDAGTEKSRWA+KE+KFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
Subjt:  VVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

A0A6J1EFC1 RNA polymerase II C-terminal domain phosphatase-like3.7e-19280.76Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSLATNSPAHSSSSDDFAAFLDVAL+SHSSDSSP +N E  NN ESERIKRRKVEKL  SEED L GVEEQ+LEVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK
        EESGVTFGYIH+GLRLNNDEINRLRNIDMK+LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQ DSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK
Subjt:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK

Query:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------
        EASQLFEMYIYTMGERAYA+EMAKLLDPKREYF++KVISRDDGTQKHQKGLD+VLGQESAVLILDDTE                                
Subjt:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------

Query:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS
                                      NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQA+NHHLWKMVEQLGGTCSTELD SVTHVVS
Subjt:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS

Query:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        TD GTEKSRWALKEEKFLVHPRWIEASNYFWKRQAE+NFPVEQ+KKQ
Subjt:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

A0A6J1GC38 RNA polymerase II C-terminal domain phosphatase-like8.6e-18979.19Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSL TNS AHSSSSDDFAAFLDVALDSHSSDSSP E  EG NN E+ERIKR KVEKLENS EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK
        EESGVTFGYIH+GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEE+YLR+QTDSLEDVTKGSLFLL+SVHTMTKLRPFVHTFLK
Subjt:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK

Query:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------
        EASQLFEMYIYTMGERAYA+EMAKLLDPKREYFS+KVISRDDGTQKHQKGLDVVLG ESAVLILDDTE                                
Subjt:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------

Query:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS
                                      NELS+DLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELD SVTH+VS
Subjt:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS

Query:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        TDAG EKSRWA+KE+KFLVHP+WIEASNYFWKR+AEE F VE TKKQ
Subjt:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

A0A6J1ID30 RNA polymerase II C-terminal domain phosphatase-like5.9e-19079.42Show/hide
Query:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD
        MSL TNSPAHSSSSDDFAAFLDVALDSHSSDS P E  EG NN E+ERIKR KVEKLENS EDILYGVEE + EVLSKQQLCSHPGSFGNMCIICGQRLD
Subjt:  MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLD

Query:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK
        EESGVTFGYIH+GLRLNNDEINRLRNIDMK LLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLR+Q DSLEDVTKGSLFLL+SVHTMTKLRPFVHTFLK
Subjt:  EESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLK

Query:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------
        EASQLFEMYIYTMGERAYA+EMAKLLDPKREYFS+KVISRDDGTQKHQKGLDVVLG ESAVLILDDTE                                
Subjt:  EASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTE--------------------------------

Query:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS
                                      NELS+DLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELD SVTH+VS
Subjt:  ------------------------------NELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVS

Query:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        TDAGTEKSRWA+KE+KFLVHP+WIEASNYFWKR+AEE FPVE TKKQ
Subjt:  TDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

SwissProt top hitse value%identityAlignment
F4JCB2 RNA polymerase II C-terminal domain phosphatase-like 53.0e-2936.33Show/hide
Query:  DNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLLQHKKLI
        +N +   + KRRK+E   N           ++   LS    C H      +CI C   + +  G  F YI  GL+L+++ +   +    K + L  KKL 
Subjt:  DNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLLQHKKLI

Query:  LVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSV----HTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSA
        LVLDLDHTLL++  +  L+  E+YL  +  S    T+  L+ + +V      +TKLRPF+  FLKEA++ F MY+YT G R YA ++ +L+DPK+ YF  
Subjt:  LVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSV----HTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSA

Query:  KVISRDDGTQKHQKGLDVVLGQESAVLILDDTENELSD---DLVD
        +VI++ +    H K LD VL +E  V+I+DDT N   D   +LVD
Subjt:  KVISRDDGTQKHQKGLDVVLGQESAVLILDDTENELSD---DLVD

Q00IB6 RNA polymerase II C-terminal domain phosphatase-like 41.0e-11453.1Show/hide
Query:  MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQR
        MS+A++SP H SSSSDD AAFLD  LDS S  SS P E  E +++ ES  +KR+K+E LE               E  S +  C HPGSFGNMC +CGQ+
Subjt:  MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQR

Query:  LDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLED---VTKGSLFLLNSVHTMTKLRPFV
        L EE+GV+F YIH+ +RLN DEI+RLR+ D + L + +KL LVLDLDHTLLN+T L  L PEEEYL+S T SL+D   V+ GSLFLL  +  MTKLRPFV
Subjt:  LDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLED---VTKGSLFLLNSVHTMTKLRPFV

Query:  HTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEN--------------------------
        H+FLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  +VISRDDGT +H+K LDVVLGQESAVLILDDTEN                          
Subjt:  HTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEN--------------------------

Query:  -------ELSDD-----------------------------LVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSV
               EL  D                             + +RDVR +LK VR ++L+GCK+VFSRVFPTK + ++H LWKM E+LG TC+TE+D SV
Subjt:  -------ELSDD-----------------------------LVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSV

Query:  THVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        THVV+ D GTEK+RWA++E+K++VH  WI+A+NY W +Q EENF +EQ KKQ
Subjt:  THVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ

Q8LL04 RNA polymerase II C-terminal domain phosphatase-like 32.6e-4133.84Show/hide
Query:  RLRNIDMKN-LLQHKKLILVLDLDHTLLNSTQLGHL-TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAF
        R+R ++ +N +   +KL LVLD+DHTLLNS +   + +  EE LR + +   +     LF    +   TKLRP +  FL++AS+L+E+++YTMG + YA 
Subjt:  RLRNIDMKN-LLQHKKLILVLDLDHTLLNSTQLGHL-TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAF

Query:  EMAKLLDPKREYFSAKVISR-DDGTQ-------KHQKGLDVVLGQESAVLILDDT--------------------------------------------E
        EMAKLLDPK   F+ +VIS+ DDG            K L+ V+G ES+V+I+DD+                                            E
Subjt:  EMAKLLDPKREYFSAKVISR-DDGTQ-------KHQKGLDVVLGQESAVLILDDT--------------------------------------------E

Query:  NELSDDLV----------------DRDVRQVLKTVRSKVLEGCKVVFSRVFPT-KFQADNHHLWKMVEQLGGTCSTELDGSVTHVVSTDAGTEKSRWALK
          L+  L                 + DVR +L + + K+L GC++VFSR+ P  + +   H LW+  EQ G  C+T++D  VTHVV+   GT+K  WAL 
Subjt:  NELSDDLV----------------DRDVRQVLKTVRSKVLEGCKVVFSRVFPT-KFQADNHHLWKMVEQLGGTCSTELDGSVTHVVSTDAGTEKSRWALK

Query:  EEKFLVHPRWIEASNYFWKRQAEENFPV
          +F+VHP W+EAS + ++R  E  + +
Subjt:  EEKFLVHPRWIEASNYFWKRQAEENFPV

Q95QG8 RNA polymerase II subunit A C-terminal domain phosphatase9.9e-2525.26Show/hide
Query:  EVLSKQQLCSHPGSFGNMCIICGQRLDEESG----------VTFGYIHR--GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEE
        +V++    C+H     +MC  CG+ L E+ G               IH    L +++     + + D  NL+ ++KL+L++DLD T+++++        +
Subjt:  EVLSKQQLCSHPGSFGNMCIICGQRLDEESG----------VTFGYIHR--GLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEE

Query:  EYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDD--GTQKHQKGLDVVLG-QE
        + +   T++ +D+TK +L   + V+T TKLRP    FL + S ++EM+I T G+R YA  +A++LDP    F  +++SRD+    Q     L  +    +
Subjt:  EYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDD--GTQKHQKGLDVVLG-QE

Query:  SAVLILDDTEN-----------------------------------ELSDDL--------VDR-----------------------DVRQVLKTVRSKVL
        + V+I+DD  +                                   ++ DD         ++R                       DV++V+K  R KVL
Subjt:  SAVLILDDTEN-----------------------------------ELSDDL--------VDR-----------------------DVRQVLKTVRSKVL

Query:  EGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEEN
        +GC +VFS + P   + +   ++++  Q G     ++   VTHVV    GT+K   A +  KF+V  +W+ A    W + A+EN
Subjt:  EGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEEN

Q9P376 RNA polymerase II subunit A C-terminal domain phosphatase2.1e-1929.65Show/hide
Query:  EEQNLEVLSK-----QQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRG----------LRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNST-
        +E+++E  SK      + C+H  ++G +C ICG+ +  +  + +  + R           L ++ +E +RL + ++K L Q K+L L++DLD T++++T 
Subjt:  EEQNLEVLSK-----QQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRG----------LRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNST-

Query:  --QLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHT---MTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKH
           +G    +   +    D L DV   +L    S +T     K RP +  FL++ S+L+E++IYTMG +AYA E+AK++DP  + F  +V+SRDD     
Subjt:  --QLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHT---MTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKH

Query:  QKGLDVVLGQESAVLILDDTENELSD
        QK L  +   +++++++ D   ++ D
Subjt:  QKGLDVVLGQESAVLILDDTENELSD

Arabidopsis top hitse value%identityAlignment
AT1G20320.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.2e-3038.91Show/hide
Query:  VEEQNLEVLSKQQ--LCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYL
        +E   LE  SK    +C H      +C  C   +D + G  F Y+  GL+L++  +   +++  +   L  +KL LVLDLDHTLL+S  +  L+  E+YL
Subjt:  VEEQNLEVLSKQQ--LCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYL

Query:  RSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLIL
          ++D  ED     L+ L+    + KLRPFVH FLKEA+++F MY+YTMG R YA  + K +DPK+ YF  +VI+RD+      K LD+VL  E  V+I+
Subjt:  RSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLIL

Query:  DDTENELSDDLVDRDVRQVLK
        DDT +   D   +R++ Q+ K
Subjt:  DDTENELSDDLVDRDVRQVLK

AT2G33540.1 C-terminal domain phosphatase-like 31.8e-4233.84Show/hide
Query:  RLRNIDMKN-LLQHKKLILVLDLDHTLLNSTQLGHL-TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAF
        R+R ++ +N +   +KL LVLD+DHTLLNS +   + +  EE LR + +   +     LF    +   TKLRP +  FL++AS+L+E+++YTMG + YA 
Subjt:  RLRNIDMKN-LLQHKKLILVLDLDHTLLNSTQLGHL-TPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAF

Query:  EMAKLLDPKREYFSAKVISR-DDGTQ-------KHQKGLDVVLGQESAVLILDDT--------------------------------------------E
        EMAKLLDPK   F+ +VIS+ DDG            K L+ V+G ES+V+I+DD+                                            E
Subjt:  EMAKLLDPKREYFSAKVISR-DDGTQ-------KHQKGLDVVLGQESAVLILDDT--------------------------------------------E

Query:  NELSDDLV----------------DRDVRQVLKTVRSKVLEGCKVVFSRVFPT-KFQADNHHLWKMVEQLGGTCSTELDGSVTHVVSTDAGTEKSRWALK
          L+  L                 + DVR +L + + K+L GC++VFSR+ P  + +   H LW+  EQ G  C+T++D  VTHVV+   GT+K  WAL 
Subjt:  NELSDDLV----------------DRDVRQVLKTVRSKVLEGCKVVFSRVFPT-KFQADNHHLWKMVEQLGGTCSTELDGSVTHVVSTDAGTEKSRWALK

Query:  EEKFLVHPRWIEASNYFWKRQAEENFPV
          +F+VHP W+EAS + ++R  E  + +
Subjt:  EEKFLVHPRWIEASNYFWKRQAEENFPV

AT3G17550.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein3.0e-3239.72Show/hide
Query:  LENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLG
        +EN   +    + E +  + S +  C H      +CI C   +++  G  F Y+ +GL+L+++     +    +   L  KKL LVLDLDHTLL+S ++ 
Subjt:  LENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLG

Query:  HLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVL
         L+  E+ L  +  S    T+  L+ L+S + +TKLRPFVH FLKEA++LF MY+YTMG R YA  + KL+DPKR YF  +VI+RD+    + K LD+VL
Subjt:  HLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVL

Query:  GQESAVLILDDTEN
         +E  V+I+DDT +
Subjt:  GQESAVLILDDTEN

AT5G54210.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.2e-3039.8Show/hide
Query:  CSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF
        C H      +C  C   ++   G +F Y+  GL+L++  +   + +  +      KKL LVLDLDHTLL++  + +LT EE YL  + DS ED+ +    
Subjt:  CSHPGSFGNMCIICGQRLDEESGVTFGYIHRGLRLNNDEINRLRNIDMK-NLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLF

Query:  LLN---SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENELSD
         LN   S   + KLRPFVH FLKEA+++F MY+YTMG+R YA  +  L+DP++ YF  +VI+R++    + K LD+VL  E  V+I+DDT +   D
Subjt:  LLN---SVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENELSD

AT5G58003.1 C-terminal domain phosphatase-like 47.3e-11653.1Show/hide
Query:  MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQR
        MS+A++SP H SSSSDD AAFLD  LDS S  SS P E  E +++ ES  +KR+K+E LE               E  S +  C HPGSFGNMC +CGQ+
Subjt:  MSLATNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQR

Query:  LDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLED---VTKGSLFLLNSVHTMTKLRPFV
        L EE+GV+F YIH+ +RLN DEI+RLR+ D + L + +KL LVLDLDHTLLN+T L  L PEEEYL+S T SL+D   V+ GSLFLL  +  MTKLRPFV
Subjt:  LDEESGVTFGYIHRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLED---VTKGSLFLLNSVHTMTKLRPFV

Query:  HTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEN--------------------------
        H+FLKEAS++F MYIYTMG+R YA +MAKLLDPK EYF  +VISRDDGT +H+K LDVVLGQESAVLILDDTEN                          
Subjt:  HTFLKEASQLFEMYIYTMGERAYAFEMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTEN--------------------------

Query:  -------ELSDD-----------------------------LVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSV
               EL  D                             + +RDVR +LK VR ++L+GCK+VFSRVFPTK + ++H LWKM E+LG TC+TE+D SV
Subjt:  -------ELSDD-----------------------------LVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELDGSV

Query:  THVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ
        THVV+ D GTEK+RWA++E+K++VH  WI+A+NY W +Q EENF +EQ KKQ
Subjt:  THVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCAAGCAGTGACGATTTTGCTGCGTTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCACCCTGTGAAAA
TACCGAGGGTGACAATAATGCTGAAAGTGAGAGGATAAAGCGTCGTAAGGTGGAGAAACTGGAAAACTCAGAGGAGGATATTCTGTATGGAGTTGAAGAGCAAAATTTAG
AAGTATTATCAAAGCAACAACTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTTGGGTATATA
CATCGGGGACTCAGACTTAATAATGATGAAATTAACCGGCTACGTAACATAGACATGAAGAACTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACAC
ACTGTTAAACTCAACTCAGCTGGGGCATTTGACACCTGAAGAGGAGTATTTAAGGAGTCAAACAGATTCTCTAGAAGATGTCACGAAAGGCAGCCTTTTCCTATTGAACT
CCGTTCATACAATGACAAAGTTGAGGCCATTTGTCCATACGTTTTTGAAAGAAGCTAGTCAATTATTCGAGATGTATATATACACTATGGGGGAGCGAGCATATGCTTTT
GAAATGGCAAAGTTGTTGGACCCCAAGAGGGAGTATTTTAGTGCAAAAGTTATTTCTCGGGATGATGGCACTCAAAAGCATCAAAAAGGTCTTGATGTGGTGCTGGGTCA
GGAAAGTGCTGTTCTGATACTCGATGATACTGAAAATGAACTCTCGGATGATTTGGTTGACAGAGATGTGAGGCAGGTATTGAAGACAGTTCGTAGCAAAGTTCTCGAGG
GATGCAAAGTCGTCTTCAGCCGGGTCTTCCCTACCAAATTTCAGGCTGACAACCATCATCTCTGGAAGATGGTAGAGCAGTTGGGAGGCACTTGCTCAACTGAACTTGAC
GGATCCGTGACACACGTGGTCTCAACGGATGCTGGAACGGAGAAGTCGCGTTGGGCTTTGAAAGAGGAGAAGTTTCTGGTCCATCCACGGTGGATAGAAGCATCGAACTA
CTTCTGGAAACGGCAAGCAGAAGAAAACTTTCCTGTTGAGCAAACCAAGAAACAATAA
mRNA sequenceShow/hide mRNA sequence
TTATCACCTTCAGTTTCACATATACAGCCGCACGGCCCCCTTCTCTCCCTCGCGAGTTGCGAGTTCCCTCACCCACAGCGCCGGCATTTTCATTCTCCTCAGCCGCACGA
CCGCCGTCGTCCACCGTTCACCCATCGGCCGTCGCGCCCCAGCCTCCGTTCGCGCCGCCGTACGCTGGTGAGAGTACCGTCTCCCTTTCGCCTCTTCCGGTGTGTTTCCC
CTTCTGCGACGAACGACGGCCGACAACCCATCATCTCCAGCGTCTTCATCTTCTCTTCAGCGTGTGCATCGACCCACGCTACGACGGATGGTTTGTTCGACGGCTGTGCA
ACCAGCCGACGTAACCCTACGGATTCGCGGCCAGATTTGCATTCGTTTGAGTTCTCAGCAGCGGGTTCCTCACTATCGGCAAGGTTAGGATTAAGTTTATTTGAACACCA
ACCAGCAAACCTTCGAGGCTGTTGGCGGCGTCCAGTAGTTCGCAACTGATTCGGAGCATTCGAATTAAGATGAGCCTTGCAACTAATTCTCCAGCTCACTCATCAAGCAG
TGACGATTTTGCTGCGTTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCACCCTGTGAAAATACCGAGGGTGACAATAATGCTGAAAGTGAGAGGATAAAGC
GTCGTAAGGTGGAGAAACTGGAAAACTCAGAGGAGGATATTCTGTATGGAGTTGAAGAGCAAAATTTAGAAGTATTATCAAAGCAACAACTATGCAGTCATCCTGGTTCA
TTTGGAAATATGTGTATCATATGTGGGCAGAGGTTGGATGAGGAATCTGGCGTGACATTTGGGTATATACATCGGGGACTCAGACTTAATAATGATGAAATTAACCGGCT
ACGTAACATAGACATGAAGAACTTGTTGCAGCATAAAAAGCTTATCCTGGTTCTTGATCTGGATCACACACTGTTAAACTCAACTCAGCTGGGGCATTTGACACCTGAAG
AGGAGTATTTAAGGAGTCAAACAGATTCTCTAGAAGATGTCACGAAAGGCAGCCTTTTCCTATTGAACTCCGTTCATACAATGACAAAGTTGAGGCCATTTGTCCATACG
TTTTTGAAAGAAGCTAGTCAATTATTCGAGATGTATATATACACTATGGGGGAGCGAGCATATGCTTTTGAAATGGCAAAGTTGTTGGACCCCAAGAGGGAGTATTTTAG
TGCAAAAGTTATTTCTCGGGATGATGGCACTCAAAAGCATCAAAAAGGTCTTGATGTGGTGCTGGGTCAGGAAAGTGCTGTTCTGATACTCGATGATACTGAAAATGAAC
TCTCGGATGATTTGGTTGACAGAGATGTGAGGCAGGTATTGAAGACAGTTCGTAGCAAAGTTCTCGAGGGATGCAAAGTCGTCTTCAGCCGGGTCTTCCCTACCAAATTT
CAGGCTGACAACCATCATCTCTGGAAGATGGTAGAGCAGTTGGGAGGCACTTGCTCAACTGAACTTGACGGATCCGTGACACACGTGGTCTCAACGGATGCTGGAACGGA
GAAGTCGCGTTGGGCTTTGAAAGAGGAGAAGTTTCTGGTCCATCCACGGTGGATAGAAGCATCGAACTACTTCTGGAAACGGCAAGCAGAAGAAAACTTTCCTGTTGAGC
AAACCAAGAAACAATAACACGGTTTCCCTCTTTACAGTAGTCCCACATTCCTCTTAAATGGAGCTTGCATTTGTTGGGGGGGTGCTGTTAGTGTTTCCCCATACATATCA
CTTATGGACTCTCACCTTCTTTGTAGGTCAGCATCTCATTTTTGAAGTATATCCCTAAAATCAACTTCAATGGTTGATTGAAAGTCAAGCTCTTTGGTTTCTGTAGATGT
GTAGATGCATTGTGTTGTAATTTTGGGTAACCATTTTGAGTTGGGTTATAATCATAGGGGTGTTATGTTTGGCTTGTGTTTCCTTTTTAAAAAAAAATTCAAGCCTTAAC
AACTATGGTTCTGTATTTGTAATTCCTCTAATGTATTTGTCTCAGATCAAAGATGGCATTAATTGAAAAGTTTCATGAAGGAATCTAGAAGAATTACATTTTCGAATAAT
GGGGAGTAAGAAGGAAAATTCTCAGGGAACTAATGAAGCTTGGGACGTGAGCTGAAATGCAGTTACATACATATGGGGACAACAAGGGAGGTTTAAGAAGGGTTTTGATA
GTTGGGAGGAAAATCATTATGGCTGAGGCTGAAGCTTTCTTGAATGTGCTGGATGGCACGGCATGAGTATGATTGGAGCCCTACAATATTTAACACATATATAAGACCGA
ACATTGCTTATATAGTGAATTACCTTTGTCAATTCCTTCGTCTTCCTATGGATGTTTGGATGTTTACTGGCAGGTAGTCAAATGACTTCTTTGGTACGCAAGTGGGACAA
CATTTTGGGATTACGCCATATTCAAACGCTGATTCGGCCTCTAATGTTGATAGCATTGTTGCAAGTTTGTCTAATGCTGATTGCACCTTCGTGGGTGAAAACTAGTTTCC
TAGTCATCCAAAATAATAGTGTGGTTGCTTGATCAAGTACGAGCCTAAATACAGGGCTCCAGCTCATGCCTCTGCAGAAGTTACCTGAATCCAGTAACACTCCGGTGTAA
GTTGTGTTTCATCCCATCCCTACTTCAATCTATGACCTAACACTTTTTTCTATTTTTCCTCTTTTTTCTTCCCCTCCTCTTCAATACCCTCTACTCCACTTCTTTGTTTC
TCCTACCCCTTGGTTGTCTTCTTGTTATTGATTATGTTGCTTCATGTTAACTATTTAACCGTTTATCCCAAAATTTAT
Protein sequenceShow/hide protein sequence
MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPCENTEGDNNAESERIKRRKVEKLENSEEDILYGVEEQNLEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYI
HRGLRLNNDEINRLRNIDMKNLLQHKKLILVLDLDHTLLNSTQLGHLTPEEEYLRSQTDSLEDVTKGSLFLLNSVHTMTKLRPFVHTFLKEASQLFEMYIYTMGERAYAF
EMAKLLDPKREYFSAKVISRDDGTQKHQKGLDVVLGQESAVLILDDTENELSDDLVDRDVRQVLKTVRSKVLEGCKVVFSRVFPTKFQADNHHLWKMVEQLGGTCSTELD
GSVTHVVSTDAGTEKSRWALKEEKFLVHPRWIEASNYFWKRQAEENFPVEQTKKQ