; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020952 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020952
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
Genome locationscaffold9:4158747..4167238
RNA-Seq ExpressionSpg020952
SyntenySpg020952
Gene Ontology termsGO:0070940 - dephosphorylation of RNA polymerase II C-terminal domain (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008420 - RNA polymerase II CTD heptapeptide repeat phosphatase activity (molecular function)
InterPro domainsIPR039189 - CTD phosphatase Fcp1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607512.1 RNA polymerase II C-terminal domain phosphatase-like 4, partial [Cucurbita argyrosperma subsp. sororia]2.1e-4482.79Show/hide
Query:  RLLGSTSEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGN
        +L  +    MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGN
Subjt:  RLLGSTSEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGN

Query:  MCIVCVQRLDEVSGVTFGYIHK
        MCI+C QRLDE SGVTFGYIHK
Subjt:  MCIVCVQRLDEVSGVTFGYIHK

KAG7037160.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma subsp. argyrosperma]2.1e-4482.79Show/hide
Query:  RLLGSTSEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGN
        +L  +    MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGN
Subjt:  RLLGSTSEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGN

Query:  MCIVCVQRLDEVSGVTFGYIHK
        MCI+C QRLDE SGVTFGYIHK
Subjt:  MCIVCVQRLDEVSGVTFGYIHK

XP_022133134.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica charantia]4.6e-4487.07Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCV
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L   E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+C 
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCV

Query:  QRLDEVSGVTFGYIHK
        QRLDE SGVTFGYIHK
Subjt:  QRLDEVSGVTFGYIHK

XP_023525838.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pepo]2.7e-4488.5Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRL
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+C QRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRL

Query:  DEVSGVTFGYIHK
        DE SGVTFGYIHK
Subjt:  DEVSGVTFGYIHK

XP_038890381.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida]2.4e-4589.38Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRL
        MS+ TNSPAHSSSSDDFAAFLDVALDSHSSDSSP EKAEGDNN  ESERIKRRKVEKLEN EEDILYGVE QSSE +SKQQLCSHPGSFGNMCI+C QRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRL

Query:  DEVSGVTFGYIHK
        DE SGVTFGYIHK
Subjt:  DEVSGVTFGYIHK

TrEMBL top hitse value%identityAlignment
A0A6J1BUF9 RNA polymerase II C-terminal domain phosphatase-like2.2e-4487.07Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCV
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L   E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+C 
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCV

Query:  QRLDEVSGVTFGYIHK
        QRLDE SGVTFGYIHK
Subjt:  QRLDEVSGVTFGYIHK

A0A6J1BV42 RNA polymerase II C-terminal domain phosphatase-like2.2e-4487.07Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCV
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L   E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+C 
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCV

Query:  QRLDEVSGVTFGYIHK
        QRLDE SGVTFGYIHK
Subjt:  QRLDEVSGVTFGYIHK

A0A6J1CJQ5 RNA polymerase II C-terminal domain phosphatase-like4.2e-4385.34Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCV
        MS+VT+SPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESERIKRRKVEKL   E P+EDI+Y VE QSSEVLSKQQLC HPGSFGNMCI+C 
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCV

Query:  QRLDEVSGVTFGYIHK
        QRLD  SGVTFGYIHK
Subjt:  QRLDEVSGVTFGYIHK

A0A6J1GC38 RNA polymerase II C-terminal domain phosphatase-like1.9e-4387.61Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRL
        MS+VTNS AHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+C QRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRL

Query:  DEVSGVTFGYIHK
        DE SGVTFGYIHK
Subjt:  DEVSGVTFGYIHK

A0A6J1ID30 RNA polymerase II C-terminal domain phosphatase-like6.5e-4487.61Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRL
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDS P+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+C QRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRL

Query:  DEVSGVTFGYIHK
        DE SGVTFGYIHK
Subjt:  DEVSGVTFGYIHK

SwissProt top hitse value%identityAlignment
Q00IB6 RNA polymerase II C-terminal domain phosphatase-like 42.1e-1548.28Show/hide
Query:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQ
        MS+ ++SP H SSSSDD AAFLD  LDS S  SS P E+ E +++V     +KR+K+E LE               E  S +  C HPGSFGNMC VC Q
Subjt:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQ

Query:  RLDEVSGVTFGYIHKQ
        +L+E +GV+F YIHK+
Subjt:  RLDEVSGVTFGYIHKQ

Arabidopsis top hitse value%identityAlignment
AT5G58003.1 C-terminal domain phosphatase-like 41.5e-1648.28Show/hide
Query:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQ
        MS+ ++SP H SSSSDD AAFLD  LDS S  SS P E+ E +++V     +KR+K+E LE               E  S +  C HPGSFGNMC VC Q
Subjt:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQ

Query:  RLDEVSGVTFGYIHKQ
        +L+E +GV+F YIHK+
Subjt:  RLDEVSGVTFGYIHKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGCGTTTCTGGCAACTCCTTCGCAGACAGCAGCTTTACTTCGCGAGTTCCGGCGGTTTGAGCTGTGGGTCTTCCCCTCTGGCGAGTTTGGACGTTTATTAGGCTC
CACCAGCGAGTGGATGAGCATTGTGACTAATTCTCCGGCTCACTCGTCGAGCAGTGACGATTTTGCTGCATTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCAT
CGCCTGATGAAAAGGCTGAGGGTGACAATAATGTTGTTGAAAGTGAGAGGATAAAACGTCGTAAAGTGGAGAAGCTGGAAAACCCAGAGGAGGACATTCTGTATGGAGTT
GAAAGGCAAAGTTCAGAAGTATTGTCAAAGCAGCAACTATGCAGTCATCCTGGTTCATTTGGGAATATGTGTATCGTCTGTGTGCAGAGGTTGGATGAGGTATCTGGCGT
GACATTTGGGTATATACATAAGCAACCTGCCTATTGGTTAAGGCAAACAAAAGAGGGAAAGCAGTACATAATGGGGAATCACCTAGCCAACGATCGTGCCAGAAATCCGT
CTCTTCTCATCAATTTCTTGAATGGACAGAGTGATACTAAAGCAGAAAACGGTAAAGAAAAAGAAAAGAGGTCGGAATGTTCTTATGTCGAAGTTATGAAGCTAGATGAT
TCTAATCCCCTCCTGACTAGTGGTAGAGCCACTCTCAAGGTGAAAAACGAGCAATACATGAAGTTAGGTCAATCTGTTGGGAAGACACTCTTACTGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGCGTTTCTGGCAACTCCTTCGCAGACAGCAGCTTTACTTCGCGAGTTCCGGCGGTTTGAGCTGTGGGTCTTCCCCTCTGGCGAGTTTGGACGTTTATTAGGCTC
CACCAGCGAGTGGATGAGCATTGTGACTAATTCTCCGGCTCACTCGTCGAGCAGTGACGATTTTGCTGCATTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCAT
CGCCTGATGAAAAGGCTGAGGGTGACAATAATGTTGTTGAAAGTGAGAGGATAAAACGTCGTAAAGTGGAGAAGCTGGAAAACCCAGAGGAGGACATTCTGTATGGAGTT
GAAAGGCAAAGTTCAGAAGTATTGTCAAAGCAGCAACTATGCAGTCATCCTGGTTCATTTGGGAATATGTGTATCGTCTGTGTGCAGAGGTTGGATGAGGTATCTGGCGT
GACATTTGGGTATATACATAAGCAACCTGCCTATTGGTTAAGGCAAACAAAAGAGGGAAAGCAGTACATAATGGGGAATCACCTAGCCAACGATCGTGCCAGAAATCCGT
CTCTTCTCATCAATTTCTTGAATGGACAGAGTGATACTAAAGCAGAAAACGGTAAAGAAAAAGAAAAGAGGTCGGAATGTTCTTATGTCGAAGTTATGAAGCTAGATGAT
TCTAATCCCCTCCTGACTAGTGGTAGAGCCACTCTCAAGGTGAAAAACGAGCAATACATGAAGTTAGGTCAATCTGTTGGGAAGACACTCTTACTGAAATAG
Protein sequenceShow/hide protein sequence
MGAFLATPSQTAALLREFRRFELWVFPSGEFGRLLGSTSEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGV
ERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHKQPAYWLRQTKEGKQYIMGNHLANDRARNPSLLINFLNGQSDTKAENGKEKEKRSECSYVEVMKLDD
SNPLLTSGRATLKVKNEQYMKLGQSVGKTLLLK