; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022854 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022854
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionpolyadenylation and cleavage factor homolog 4-like isoform X2
Genome locationtig00000589:2434851..2436794
RNA-Seq ExpressionSgr022854
SyntenySgr022854
Gene Ontology termsGO:0006369 - termination of RNA polymerase II transcription (biological process)
GO:0006378 - mRNA polyadenylation (biological process)
GO:0006379 - mRNA cleavage (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0005849 - mRNA cleavage factor complex (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily
IPR045154 - Protein PCF11-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043917.1 polyadenylation and cleavage factor-like protein 4-like isoform X1 [Cucumis melo var. makuwa]6.8e-14073.02Show/hide
Query:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------
        M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VKLSRTKVEE  LPSDPLPPSSP +S STETS+V        
Subjt:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------

Query:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF
                                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSATTEINNLIGFEFSSHVIRKF
Subjt:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF

Query:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC
        HPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRTEANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Subjt:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC

Query:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKMA
        GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GLATDIKMA
Subjt:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKMA

KAG7017425.1 Polyadenylation and cleavage factor-like 4 [Cucurbita argyrosperma subsp. argyrosperma]1.5e-14266.19Show/hide
Query:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------
        M+QN S QDVGN+QP SS+NP LPS+SSPAHTQ TFSEPK VGESSLGPPS ES S LVKLS+ KVE+TPLPSDPLPPSS  NS STETSNV        
Subjt:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------

Query:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF
                                       PENLKSGD VT SIPVPSIP+ SSS SS++P+ PS+ AA+SST PPPSATTEINNLIG+EFSSHVIRKF
Subjt:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF

Query:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC
        HPSVISGLFDDIP+QCKICGLRLK EE+LDTH+ WH  RTE+ NS RAPRRWYPSS DWVSGNARLLLDA +S+DKS  MEEDNEPMVPADEDQFACVLC
Subjt:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC

Query:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKMASPKI------------VLNAMALDWNYTGGHLA
        GELFEDFY+QE+GKWMFKGA +ITIPS G EVGSTNE+ A GPIVH +C+TESS+++LGLATDIK  +  +             L  + L+  +   H A
Subjt:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKMASPKI------------VLNAMALDWNYTGGHLA

Query:  HAIGGGMRKGSLNMSSSVVVVLL
         AIG GMRKGS+++S+S+VV L+
Subjt:  HAIGGGMRKGSLNMSSSVVVVLL

XP_008442798.1 PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X1 [Cucumis melo]2.6e-13972.95Show/hide
Query:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------
        M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VKLSRTKVEE  LPSDPLPPSSP +S STETS+V        
Subjt:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------

Query:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF
                                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSATTEINNLIGFEFSSHVIRKF
Subjt:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF

Query:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC
        HPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRTEANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Subjt:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC

Query:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM
        GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GLATDIKM
Subjt:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM

XP_008442799.1 PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X2 [Cucumis melo]2.6e-13972.95Show/hide
Query:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------
        M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VKLSRTKVEE  LPSDPLPPSSP +S STETS+V        
Subjt:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------

Query:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF
                                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSATTEINNLIGFEFSSHVIRKF
Subjt:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF

Query:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC
        HPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRTEANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Subjt:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC

Query:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM
        GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GLATDIKM
Subjt:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM

XP_022144638.1 uncharacterized protein LOC111014280, partial [Momordica charantia]4.4e-14775.75Show/hide
Query:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------
        M+QNISFQDVGN+QPHSSI P LPSRSSPAHTQ T SE K+VGESSLGPPSRESPSALVKLSRTKVEETP PSDP+PPSSP +S+STETSNV        
Subjt:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------

Query:  -------------------------------PENLKS-GDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRK
                                       P+NLKS GDAVTSSIPVPSIP+SSSS+SS + +PPSEPA KSST  PPSATTEI+NLIGF+FSSHVIRK
Subjt:  -------------------------------PENLKS-GDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRK

Query:  FHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVL
        FHPSV+SGLFDDIPYQCK+CGLRLKLEEQL+TH+QWH LRTEAN SNRAPRRWYPSSDDWV   ARL LDA TS+D SD+MEEDNEPMVPADEDQFACVL
Subjt:  FHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVL

Query:  CGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM
        CGELFEDF++Q++G WMFKGATYIT PSAG E+GSTNEQGARGPIVHT+C+TESSVYDLGLATDIKM
Subjt:  CGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM

TrEMBL top hitse value%identityAlignment
A0A0A0LGI0 Uncharacterized protein1.4e-13872.95Show/hide
Query:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------
        M+QNISFQDVGN++P SSI P LPSRSSPAH   TFSEPKI GESS+GPPS ESPS +VKLS+TKVEE  LPSDPLPPSSP +S STETSNV        
Subjt:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------

Query:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF
                                       PE LKSGDAVTSS+PVPSIP+SSS  S  K + PS+ AAK ST+PPPSATTEINNLIGFEFSSHVIRKF
Subjt:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF

Query:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC
        HPSVISGLF+DIPYQCKICGLRLK EE LD H +WH LRTEANNS+ APRRWYPSSDDW+SGNAR LLDAVTS+D+SD MEEDNEPMVPADEDQFACV+C
Subjt:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC

Query:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM
        GELFED Y+QE+G WMFKGA YITIPS G EVGSTNEQ ARGPIVHT C+TESSVYD+GLATDIKM
Subjt:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM

A0A1S3B6K6 polyadenylation and cleavage factor homolog 4-like isoform X11.2e-13972.95Show/hide
Query:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------
        M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VKLSRTKVEE  LPSDPLPPSSP +S STETS+V        
Subjt:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------

Query:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF
                                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSATTEINNLIGFEFSSHVIRKF
Subjt:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF

Query:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC
        HPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRTEANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Subjt:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC

Query:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM
        GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GLATDIKM
Subjt:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM

A0A1S3B794 polyadenylation and cleavage factor homolog 4-like isoform X21.2e-13972.95Show/hide
Query:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------
        M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VKLSRTKVEE  LPSDPLPPSSP +S STETS+V        
Subjt:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------

Query:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF
                                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSATTEINNLIGFEFSSHVIRKF
Subjt:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF

Query:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC
        HPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRTEANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Subjt:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC

Query:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM
        GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GLATDIKM
Subjt:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM

A0A5A7TQ23 Polyadenylation and cleavage factor-like protein 4-like isoform X13.3e-14073.02Show/hide
Query:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------
        M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VKLSRTKVEE  LPSDPLPPSSP +S STETS+V        
Subjt:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------

Query:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF
                                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSATTEINNLIGFEFSSHVIRKF
Subjt:  -------------------------------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKF

Query:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC
        HPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRTEANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Subjt:  HPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC

Query:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKMA
        GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GLATDIKMA
Subjt:  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKMA

A0A6J1CTT8 uncharacterized protein LOC1110142802.1e-14775.75Show/hide
Query:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------
        M+QNISFQDVGN+QPHSSI P LPSRSSPAHTQ T SE K+VGESSLGPPSRESPSALVKLSRTKVEETP PSDP+PPSSP +S+STETSNV        
Subjt:  MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNV--------

Query:  -------------------------------PENLKS-GDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRK
                                       P+NLKS GDAVTSSIPVPSIP+SSSS+SS + +PPSEPA KSST  PPSATTEI+NLIGF+FSSHVIRK
Subjt:  -------------------------------PENLKS-GDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRK

Query:  FHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVL
        FHPSV+SGLFDDIPYQCK+CGLRLKLEEQL+TH+QWH LRTEAN SNRAPRRWYPSSDDWV   ARL LDA TS+D SD+MEEDNEPMVPADEDQFACVL
Subjt:  FHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVL

Query:  CGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM
        CGELFEDF++Q++G WMFKGATYIT PSAG E+GSTNEQGARGPIVHT+C+TESSVYDLGLATDIKM
Subjt:  CGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKM

SwissProt top hitse value%identityAlignment
Q0WPF2 Polyadenylation and cleavage factor homolog 44.1e-3138.64Show/hide
Query:  IGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHAL--RTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSM---DKSDKMEE
        +G EF + +++  + S IS L+ D+P QC  CGLR K +E+   HM WH    R   N+     R+W+ S+  W+SG   L  +AV      + + + ++
Subjt:  IGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHAL--RTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSM---DKSDKMEE

Query:  DNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDL
        D +  VPADEDQ +C LCGE FEDFY+ E  +WM+KGA Y+  P    E  +  ++   GPIVH  C  ES+  D+
Subjt:  DNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDL

Q9C710 Polyadenylation and cleavage factor homolog 14.1e-1527.95Show/hide
Query:  NPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNVPENLKSGDAVTSSIPVPSI--PVSSSS
        N S   R++ ++T   + +P + G  +  P     P    KL      +  L  D LP   P  ++ T T N P  ++S + V ++    ++  P++ S+
Subjt:  NPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNVPENLKSGDAVTSSIPVPSI--PVSSSS

Query:  LSSMKPKPPSEPAAKSS---------TNPPPSATTEINNL----IGFEFSS-HVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALR---
        + S+  +   +P   S           N     T E +N     +G  F +   +   H SVI  L+ D+P QC  CGLR K +E+   HM WH  +   
Subjt:  LSSMKPKPPSEPAAKSS---------TNPPPSATTEINNL----IGFEFSS-HVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALR---

Query:  ----TEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTS-----MDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGG
            T      +  R W  S+  W+          V S       K  K EE  + MVPADEDQ  C LC E FE+F++ E   WM+K A Y+T      
Subjt:  ----TEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTS-----MDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGG

Query:  EVGSTNEQGARGPIVHTNCVTE
                   G IVH  C+ E
Subjt:  EVGSTNEQGARGPIVHTNCVTE

Q9FIX8 Polyadenylation and cleavage factor homolog 51.6e-1428.1Show/hide
Query:  PLPSDPLPP-SSPTNSTSTETSNVPENLKSGDAVTSSIPVPSI--PVSSSSLSSMKPKPPSEPAAKSS---------TNPPPSATTEINN----LIGFEF
        PLP   L P  S        T N P  ++S + V ++    ++  P++ S++ S+  +   +P   S           N     T+E +N     +G  F
Subjt:  PLPSDPLPP-SSPTNSTSTETSNVPENLKSGDAVTSSIPVPSI--PVSSSSLSSMKPKPPSEPAAKSS---------TNPPPSATTEINN----LIGFEF

Query:  SS-HVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALR-------TEANNSNRAPRRWYPSSDDWV---SGNARLLLDAVTSMDKSDKME
         +   +   H SVI  L+ D+P QC  CG+R K +E+   HM WH  +       T      +  R W  S+  W+   +G   + + +    +   K E
Subjt:  SS-HVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALR-------TEANNSNRAPRRWYPSSDDWV---SGNARLLLDAVTSMDKSDKME

Query:  ED---NEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTE
        +D    + MVPADEDQ  C LC E FE+F++ E   WM+K A Y+T                 G IVH  C+ E
Subjt:  ED---NEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTE

Arabidopsis top hitse value%identityAlignment
AT1G66500.1 Pre-mRNA cleavage complex II2.9e-1627.95Show/hide
Query:  NPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNVPENLKSGDAVTSSIPVPSI--PVSSSS
        N S   R++ ++T   + +P + G  +  P     P    KL      +  L  D LP   P  ++ T T N P  ++S + V ++    ++  P++ S+
Subjt:  NPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNVPENLKSGDAVTSSIPVPSI--PVSSSS

Query:  LSSMKPKPPSEPAAKSS---------TNPPPSATTEINNL----IGFEFSS-HVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALR---
        + S+  +   +P   S           N     T E +N     +G  F +   +   H SVI  L+ D+P QC  CGLR K +E+   HM WH  +   
Subjt:  LSSMKPKPPSEPAAKSS---------TNPPPSATTEINNL----IGFEFSS-HVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALR---

Query:  ----TEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTS-----MDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGG
            T      +  R W  S+  W+          V S       K  K EE  + MVPADEDQ  C LC E FE+F++ E   WM+K A Y+T      
Subjt:  ----TEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTS-----MDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGG

Query:  EVGSTNEQGARGPIVHTNCVTE
                   G IVH  C+ E
Subjt:  EVGSTNEQGARGPIVHTNCVTE

AT2G36480.1 ENTH/VHS family protein8.8e-3733.33Show/hide
Query:  QPHSSINP---SLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSR-TKVE---ETPLPSDPLP-------PSSPTNSTSTETSNVPENLK--
        + H  +NP   +LP+ S P     + +   ++    +   S    S    L+  T V+   E    SDPL             +++ TE  + P   +  
Subjt:  QPHSSINP---SLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSR-TKVE---ETPLPSDPLP-------PSSPTNSTSTETSNVPENLK--

Query:  SGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWH
        S D  T+S    S+  + +  S +   P + P  K    P  ++ +E  +LIG +F +  IR+ HPSVIS LFDD+P+ C  C +RLK +E+LD HM+ H
Subjt:  SGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWH

Query:  -ALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGST
           + E + +N   R W+P  D+W++  A  L      +    +   ++   V ADE Q AC+LCGE+FED+++QEM +WMFKGA+Y+T P A  E    
Subjt:  -ALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGST

Query:  NEQGARGPIVHTNCVTESSVYDLGLATDIK
            A GPIVHT C+T SS+  L +   IK
Subjt:  NEQGARGPIVHTNCVTESSVYDLGLATDIK

AT2G36480.2 ENTH/VHS family protein5.1e-3732.84Show/hide
Query:  QPHSSINP---SLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSR-TKVE---ETPLPSDPLP-------PSSPTNSTSTETSNVPENLK--
        + H  +NP   +LP+ S P     + +   ++    +   S    S    L+  T V+   E    SDPL             +++ TE  + P   +  
Subjt:  QPHSSINP---SLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSR-TKVE---ETPLPSDPLP-------PSSPTNSTSTETSNVPENLK--

Query:  SGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWH
        S D  T+S    S+  + +  S +   P + P  K    P  ++ +E  +LIG +F +  IR+ HPSVIS LFDD+P+ C  C +RLK +E+LD HM+ H
Subjt:  SGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWH

Query:  -ALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGST
           + E + +N   R W+P  D+W++  A  L      +    +   ++   V ADE Q AC+LCGE+FED+++QEM +WMFKGA+Y+T P A  E    
Subjt:  -ALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGST

Query:  NEQGARGPIVHTNCVTESSVYDLGLATDIKMASPKIVL
            A GPIVHT C+T SS+  L +   IK    +  L
Subjt:  NEQGARGPIVHTNCVTESSVYDLGLATDIKMASPKIVL

AT2G36480.3 ENTH/VHS family protein8.8e-3733.33Show/hide
Query:  QPHSSINP---SLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSR-TKVE---ETPLPSDPLP-------PSSPTNSTSTETSNVPENLK--
        + H  +NP   +LP+ S P     + +   ++    +   S    S    L+  T V+   E    SDPL             +++ TE  + P   +  
Subjt:  QPHSSINP---SLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSR-TKVE---ETPLPSDPLP-------PSSPTNSTSTETSNVPENLK--

Query:  SGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWH
        S D  T+S    S+  + +  S +   P + P  K    P  ++ +E  +LIG +F +  IR+ HPSVIS LFDD+P+ C  C +RLK +E+LD HM+ H
Subjt:  SGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWH

Query:  -ALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGST
           + E + +N   R W+P  D+W++  A  L      +    +   ++   V ADE Q AC+LCGE+FED+++QEM +WMFKGA+Y+T P A  E    
Subjt:  -ALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGST

Query:  NEQGARGPIVHTNCVTESSVYDLGLATDIK
            A GPIVHT C+T SS+  L +   IK
Subjt:  NEQGARGPIVHTNCVTESSVYDLGLATDIK

AT4G04885.1 PCF11P-similar protein 42.9e-3238.64Show/hide
Query:  IGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHAL--RTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSM---DKSDKMEE
        +G EF + +++  + S IS L+ D+P QC  CGLR K +E+   HM WH    R   N+     R+W+ S+  W+SG   L  +AV      + + + ++
Subjt:  IGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHAL--RTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSM---DKSDKMEE

Query:  DNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDL
        D +  VPADEDQ +C LCGE FEDFY+ E  +WM+KGA Y+  P    E  +  ++   GPIVH  C  ES+  D+
Subjt:  DNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGCAGAATATCAGCTTCCAAGATGTGGGAAATATTCAACCCCACTCAAGCATCAACCCTTCTTTACCAAGCCGGTCTTCTCCTGCCCACACTCAGTGTACATTCTC
AGAGCCAAAGATTGTGGGAGAATCTTCATTAGGTCCTCCATCTCGTGAAAGCCCATCAGCTCTGGTTAAGCTATCTCGGACTAAGGTAGAAGAGACACCATTACCATCTG
ATCCACTGCCACCTTCATCTCCTACGAATAGTACATCCACTGAAACTTCAAATGTGCCTGAAAATTTGAAGTCAGGTGATGCTGTGACTAGTTCTATACCAGTTCCTTCC
ATCCCTGTTTCCTCTTCCAGTCTATCATCTATGAAACCTAAACCACCTTCAGAACCTGCTGCTAAGAGCTCCACTAATCCACCTCCATCAGCCACAACTGAGATAAACAA
CCTCATAGGCTTTGAATTTAGTTCACATGTTATTCGCAAATTTCATCCCTCTGTGATCAGTGGACTCTTTGATGATATTCCATACCAATGTAAGATCTGTGGTCTTCGAC
TGAAACTTGAAGAGCAGTTGGATACGCACATGCAGTGGCATGCATTAAGAACTGAGGCAAACAATTCAAATAGGGCACCAAGAAGGTGGTATCCAAGTTCAGATGATTGG
GTTTCTGGAAATGCCAGACTTCTACTTGATGCTGTCACTTCTATGGACAAGTCCGACAAAATGGAAGAAGATAATGAGCCAATGGTTCCTGCAGACGAAGATCAATTTGC
TTGTGTTTTATGTGGTGAACTTTTTGAAGATTTTTATAATCAAGAGATGGGTAAGTGGATGTTCAAAGGAGCAACGTACATCACCATCCCATCAGCTGGTGGTGAGGTAG
GAAGCACAAATGAACAAGGTGCTAGAGGACCCATTGTGCACACAAATTGTGTAACTGAAAGTTCAGTATATGATTTGGGACTGGCAACTGATATTAAGATGGCCAGTCCA
AAGATAGTTTTGAATGCTATGGCATTGGACTGGAACTACACTGGTGGACATCTTGCTCATGCAATTGGCGGGGGGATGCGAAAAGGGAGTTTGAATATGAGTTCTTCGGT
TGTGGTAGTTCTGTTGAGTGCCAAAAGAGATAATAGGGAGTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGGCAGAATATCAGCTTCCAAGATGTGGGAAATATTCAACCCCACTCAAGCATCAACCCTTCTTTACCAAGCCGGTCTTCTCCTGCCCACACTCAGTGTACATTCTC
AGAGCCAAAGATTGTGGGAGAATCTTCATTAGGTCCTCCATCTCGTGAAAGCCCATCAGCTCTGGTTAAGCTATCTCGGACTAAGGTAGAAGAGACACCATTACCATCTG
ATCCACTGCCACCTTCATCTCCTACGAATAGTACATCCACTGAAACTTCAAATGTGCCTGAAAATTTGAAGTCAGGTGATGCTGTGACTAGTTCTATACCAGTTCCTTCC
ATCCCTGTTTCCTCTTCCAGTCTATCATCTATGAAACCTAAACCACCTTCAGAACCTGCTGCTAAGAGCTCCACTAATCCACCTCCATCAGCCACAACTGAGATAAACAA
CCTCATAGGCTTTGAATTTAGTTCACATGTTATTCGCAAATTTCATCCCTCTGTGATCAGTGGACTCTTTGATGATATTCCATACCAATGTAAGATCTGTGGTCTTCGAC
TGAAACTTGAAGAGCAGTTGGATACGCACATGCAGTGGCATGCATTAAGAACTGAGGCAAACAATTCAAATAGGGCACCAAGAAGGTGGTATCCAAGTTCAGATGATTGG
GTTTCTGGAAATGCCAGACTTCTACTTGATGCTGTCACTTCTATGGACAAGTCCGACAAAATGGAAGAAGATAATGAGCCAATGGTTCCTGCAGACGAAGATCAATTTGC
TTGTGTTTTATGTGGTGAACTTTTTGAAGATTTTTATAATCAAGAGATGGGTAAGTGGATGTTCAAAGGAGCAACGTACATCACCATCCCATCAGCTGGTGGTGAGGTAG
GAAGCACAAATGAACAAGGTGCTAGAGGACCCATTGTGCACACAAATTGTGTAACTGAAAGTTCAGTATATGATTTGGGACTGGCAACTGATATTAAGATGGCCAGTCCA
AAGATAGTTTTGAATGCTATGGCATTGGACTGGAACTACACTGGTGGACATCTTGCTCATGCAATTGGCGGGGGGATGCGAAAAGGGAGTTTGAATATGAGTTCTTCGGT
TGTGGTAGTTCTGTTGAGTGCCAAAAGAGATAATAGGGAGTGGTAG
Protein sequenceShow/hide protein sequence
MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNVPENLKSGDAVTSSIPVPS
IPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDW
VSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKMASP
KIVLNAMALDWNYTGGHLAHAIGGGMRKGSLNMSSSVVVVLLSAKRDNREW