; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021076 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021076
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
Genome locationscaffold9:3484577..3489068
RNA-Seq ExpressionSpg021076
SyntenySpg021076
Gene Ontology termsGO:0070940 - dephosphorylation of RNA polymerase II C-terminal domain (biological process)
GO:0005634 - nucleus (cellular component)
GO:0008420 - RNA polymerase II CTD heptapeptide repeat phosphatase activity (molecular function)
InterPro domainsIPR039189 - CTD phosphatase Fcp1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607512.1 RNA polymerase II C-terminal domain phosphatase-like 4, partial [Cucurbita argyrosperma subsp. sororia]1.2e-4589.38Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHK
        DEE+GVTFGYIHK
Subjt:  DEEAGVTFGYIHK

KAG7037160.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-4589.38Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHK
        DEE+GVTFGYIHK
Subjt:  DEEAGVTFGYIHK

XP_022133134.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica charantia]2.1e-4587.93Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L   E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+CG
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG

Query:  QRLDEEAGVTFGYIHK
        QRLDEE+GVTFGYIHK
Subjt:  QRLDEEAGVTFGYIHK

XP_023525838.1 RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pepo]1.2e-4589.38Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHK
        DEE+GVTFGYIHK
Subjt:  DEEAGVTFGYIHK

XP_038890381.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida]1.1e-4690.27Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+ TNSPAHSSSSDDFAAFLDVALDSHSSDSSP EKAEGDNN  ESERIKRRKVEKLEN EEDILYGVE QSSE +SKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHK
        DEE+GVTFGYIHK
Subjt:  DEEAGVTFGYIHK

TrEMBL top hitse value%identityAlignment
A0A6J1BUF9 RNA polymerase II C-terminal domain phosphatase-like9.9e-4687.93Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L   E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+CG
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG

Query:  QRLDEEAGVTFGYIHK
        QRLDEE+GVTFGYIHK
Subjt:  QRLDEEAGVTFGYIHK

A0A6J1BV42 RNA polymerase II C-terminal domain phosphatase-like9.9e-4687.93Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L   E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+CG
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG

Query:  QRLDEEAGVTFGYIHK
        QRLDEE+GVTFGYIHK
Subjt:  QRLDEEAGVTFGYIHK

A0A6J1CJQ5 RNA polymerase II C-terminal domain phosphatase-like1.9e-4486.21Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG
        MS+VT+SPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESERIKRRKVEKL   E P+EDI+Y VE QSSEVLSKQQLC HPGSFGNMCI+CG
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL---ENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCG

Query:  QRLDEEAGVTFGYIHK
        QRLD E+GVTFGYIHK
Subjt:  QRLDEEAGVTFGYIHK

A0A6J1GC38 RNA polymerase II C-terminal domain phosphatase-like8.4e-4588.5Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+VTNS AHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHK
        DEE+GVTFGYIHK
Subjt:  DEEAGVTFGYIHK

A0A6J1ID30 RNA polymerase II C-terminal domain phosphatase-like2.9e-4588.5Show/hide
Query:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL
        MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDS P+EKAEG NN VE+ERIKR KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+CGQRL
Subjt:  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRL

Query:  DEEAGVTFGYIHK
        DEE+GVTFGYIHK
Subjt:  DEEAGVTFGYIHK

SwissProt top hitse value%identityAlignment
Q00IB6 RNA polymerase II C-terminal domain phosphatase-like 42.1e-1650.43Show/hide
Query:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQ
        MS+ ++SP H SSSSDD AAFLD  LDS S  SS P E+ E +++V     +KR+K+E LE               E  S +  C HPGSFGNMC VCGQ
Subjt:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQ

Query:  RLDEEAGVTFGYIHK
        +L EE GV+F YIHK
Subjt:  RLDEEAGVTFGYIHK

Arabidopsis top hitse value%identityAlignment
AT5G58003.1 C-terminal domain phosphatase-like 41.5e-1750.43Show/hide
Query:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQ
        MS+ ++SP H SSSSDD AAFLD  LDS S  SS P E+ E +++V     +KR+K+E LE               E  S +  C HPGSFGNMC VCGQ
Subjt:  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQ

Query:  RLDEEAGVTFGYIHK
        +L EE GV+F YIHK
Subjt:  RLDEEAGVTFGYIHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAAAGGAAAGAGAAAAGCCCCAACAGCTCCACGCCAGTTGTTTTCTTCTTCACGCCGTCTCATCCTCCGTTTCAGCCGTTCACTGGTGCCGCCGGCTCTCTCCCG
TTCACTGCTGCCGCCGCCATCATTCTTTCTTTCAATCGTGCCTCCGCAGCCTCCCTTTAGCAGTTTTACACTGATTTTGTACTCCCTTGTTGGTAGCGCCGCCACTGCCT
CCGATCAAACTTCGGCGTCCACAGAGAGCGTCGGGAGGACTTTTCCCTCCCATCTCCAGCGTCGTGTGCGTGCGTCCGACGTAGGCAGCAGCGATAGCAAGTTCTTCCGG
TGTGGGTTTTGTTTCCGTGTGTTTCCGGCGATCCGAGCGGCTCCGACGTTATTTCCTCTCAACCAACAGAGGGTTGTGAGCTCGCGCGGGGTGCTTTTTCCGGCAGCACC
ACCAGCGAGTGGGAAGCAATTAGCCGAGATGAGCATTGTGACTAATTCTCCAGCTCACTCGTCGAGCAGTGACGATTTTGCTGCATTTCTTGATGTAGCTCTAGATTCCC
ATTCCTCTGACTCATCGCCCGATGAAAAGGCTGAGGGTGACAATAATGTTGTTGAAAGTGAGAGGATAAAACGTCGTAAAGTGGAGAAGCTGGAAAACCCAGAGGAGGAC
ATTCTGTATGGAGTTGAAGGGCAAAGTTCAGAAGTATTATCAAAGCAGCAACTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCGTCTGTGGGCAGAGGTTGGA
TGAGGAAGCTGGCGTGACATTTGGGTATATACATAAGGTATATTTTCTTTACATGGATGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAAAGGAAAGAGAAAAGCCCCAACAGCTCCACGCCAGTTGTTTTCTTCTTCACGCCGTCTCATCCTCCGTTTCAGCCGTTCACTGGTGCCGCCGGCTCTCTCCCG
TTCACTGCTGCCGCCGCCATCATTCTTTCTTTCAATCGTGCCTCCGCAGCCTCCCTTTAGCAGTTTTACACTGATTTTGTACTCCCTTGTTGGTAGCGCCGCCACTGCCT
CCGATCAAACTTCGGCGTCCACAGAGAGCGTCGGGAGGACTTTTCCCTCCCATCTCCAGCGTCGTGTGCGTGCGTCCGACGTAGGCAGCAGCGATAGCAAGTTCTTCCGG
TGTGGGTTTTGTTTCCGTGTGTTTCCGGCGATCCGAGCGGCTCCGACGTTATTTCCTCTCAACCAACAGAGGGTTGTGAGCTCGCGCGGGGTGCTTTTTCCGGCAGCACC
ACCAGCGAGTGGGAAGCAATTAGCCGAGATGAGCATTGTGACTAATTCTCCAGCTCACTCGTCGAGCAGTGACGATTTTGCTGCATTTCTTGATGTAGCTCTAGATTCCC
ATTCCTCTGACTCATCGCCCGATGAAAAGGCTGAGGGTGACAATAATGTTGTTGAAAGTGAGAGGATAAAACGTCGTAAAGTGGAGAAGCTGGAAAACCCAGAGGAGGAC
ATTCTGTATGGAGTTGAAGGGCAAAGTTCAGAAGTATTATCAAAGCAGCAACTATGCAGTCATCCTGGTTCATTTGGAAATATGTGTATCGTCTGTGGGCAGAGGTTGGA
TGAGGAAGCTGGCGTGACATTTGGGTATATACATAAGGTATATTTTCTTTACATGGATGTGTGA
Protein sequenceShow/hide protein sequence
MNKGKRKAPTAPRQLFSSSRRLILRFSRSLVPPALSRSLLPPPSFFLSIVPPQPPFSSFTLILYSLVGSAATASDQTSASTESVGRTFPSHLQRRVRASDVGSSDSKFFR
CGFCFRVFPAIRAAPTLFPLNQQRVVSSRGVLFPAAPPASGKQLAEMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEED
ILYGVEGQSSEVLSKQQLCSHPGSFGNMCIVCGQRLDEEAGVTFGYIHKVYFLYMDV