; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022853 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022853
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionpolyadenylation and cleavage factor homolog 4-like isoform X2
Genome locationtig00000589:2429718..2434809
RNA-Seq ExpressionSgr022853
SyntenySgr022853
Gene Ontology termsGO:0006369 - termination of RNA polymerase II transcription (biological process)
GO:0006378 - mRNA polyadenylation (biological process)
GO:0006379 - mRNA cleavage (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0005849 - mRNA cleavage factor complex (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR006569 - CID domain
IPR008942 - ENTH/VHS
IPR045154 - Protein PCF11-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043917.1 polyadenylation and cleavage factor-like protein 4-like isoform X1 [Cucumis melo var. makuwa]5.4e-31081.75Show/hide
Query:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIV-QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI
        M+MESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG+NIV QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI
Subjt:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIV-QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI

Query:  ITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH
        ITNLTIIAGENLQAAKAIS T+ ANILE                   VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH
Subjt:  ITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH

Query:  LFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RP-PHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADA
        LFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQA RP PHSIHVNPKYIERQRLQQSGRVKGM  DAT  +TNV+QDV QAKISTGRPWADA
Subjt:  LFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RP-PHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADA

Query:  PIKVLDTQRPIRDAPNDIA----------DYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQN
        PIKVLD QRP+RDAPND+A          DYEYGSDLSR   +GRRVVDEGRDKPW +AGSN+ EKLSGQRNGFNIKLGYENY APKS NTGARLLPVQN
Subjt:  PIKVLDTQRPIRDAPNDIA----------DYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQN

Query:  FSSSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQ
        FSSSS  R LSTNWKNSEEEEFMWG+M+SMLTGHGAPAI +STGKDQWTPEDSDNSG  NK  S+RDTGASVDREASSDSQSSEQRE+GDSGQQRSS WQ
Subjt:  FSSSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQ

Query:  MQESISLDGLRGGVPRKNL----AYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSL
        +QESISLDGLR GVPRKN      YGA+LTALSG +SSVDQMGGRPQ+T SNIG SGHGFLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS  L
Subjt:  MQESISLDGLRGGVPRKNL----AYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSL

Query:  VDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSS
        VD HVPHQ+H+ KT+SFSN+DPRKR+ QDAAL L PSVRPDN QKP+  DL+A ASSIP S+P HQFSLSESLKPDVTQ E SSQ +VSIP  DFGP SS
Subjt:  VDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSS

Query:  DGNSVPD
         G +VPD
Subjt:  DGNSVPD

KAG6596545.1 Polyadenylation and cleavage factor-like 4, partial [Cucurbita argyrosperma subsp. sororia]5.4e-31081.38Show/hide
Query:  MESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITN
        MESSRRPFDRTREPGLKK RLADEAERGGNINGRPFPQRP+ SG+NIVQPRFRASDRDSGSSDSGRGGYQPQP QHQELVSQYRTALAELTFNSKPIITN
Subjt:  MESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITN

Query:  LTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFG
        LTIIAGENLQAAKAISATVCANILE                   V SEQKLPSLYLLDSIVKNIGRDYIKYFAA+LPEVFCKAYRQVD  VH SMRHLFG
Subjt:  LTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFG

Query:  TWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKV
        TWKGVFPPQTLQ+IEKELGF+ +S SSSGTI+SKP+LQ+ RPPHSIHVNPKYIERQRLQQSGRVKGMT+DAT+ATTNVTQDV QAKISTGRPWADA IKV
Subjt:  TWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKV

Query:  LDTQRPIRDAPNDI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSS
         D QRP+RDAPNDI          ADYEYGSDLSR PGIGRR VDEGRDKPWS  GSN+ EKLSGQRNGFNIKLGYENYPAP+SANTGARLLP QNFSSS
Subjt:  LDTQRPIRDAPNDI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSS

Query:  S--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQES
        S  RGLSTNWKNSEEEEFMWGEMNSMLTGHGA AI +S GKDQWTPEDSDNSG  NK  S+RDTG SVDREASSDSQSSEQRE+GDSGQQRSS+WQ+QE 
Subjt:  S--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQES

Query:  ISLDGLRGGVPRKNLA----YGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVDHH
        +SLDGLRGG+P+KN A    YGA+LTALSG +SSVDQMGGRPQ+TSSNIG SGH FLNKGGSGSIG VG Q FPSR+VAF SGQPPLHQRPPSP  VD H
Subjt:  ISLDGLRGGVPRKNLA----YGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVDHH

Query:  VPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSSDG
        +PHQM NHKTSSFSN+DPRKR+ QDA+L   P+V+ DNL+KP+PQD QA+ASSIP S+P   FSLSESLKPDV Q E S QH+VSIP  DFGPPSS G
Subjt:  VPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSSDG

XP_008442798.1 PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X1 [Cucumis melo]5.4e-31081.75Show/hide
Query:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIV-QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI
        M+MESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG+NIV QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI
Subjt:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIV-QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI

Query:  ITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH
        ITNLTIIAGENLQAAKAIS T+ ANILE                   VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH
Subjt:  ITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH

Query:  LFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RP-PHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADA
        LFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQA RP PHSIHVNPKYIERQRLQQSGRVKGM  DAT  +TNV+QDV QAKISTGRPWADA
Subjt:  LFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RP-PHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADA

Query:  PIKVLDTQRPIRDAPNDIA----------DYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQN
        PIKVLD QRP+RDAPND+A          DYEYGSDLSR   +GRRVVDEGRDKPW +AGSN+ EKLSGQRNGFNIKLGYENY APKS NTGARLLPVQN
Subjt:  PIKVLDTQRPIRDAPNDIA----------DYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQN

Query:  FSSSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQ
        FSSSS  R LSTNWKNSEEEEFMWG+M+SMLTGHGAPAI +STGKDQWTPEDSDNSG  NK  S+RDTGASVDREASSDSQSSEQRE+GDSGQQRSS WQ
Subjt:  FSSSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQ

Query:  MQESISLDGLRGGVPRKNL----AYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSL
        +QESISLDGLR GVPRKN      YGA+LTALSG +SSVDQMGGRPQ+T SNIG SGHGFLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS  L
Subjt:  MQESISLDGLRGGVPRKNL----AYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSL

Query:  VDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSS
        VD HVPHQ+H+ KT+SFSN+DPRKR+ QDAAL L PSVRPDN QKP+  DL+A ASSIP S+P HQFSLSESLKPDVTQ E SSQ +VSIP  DFGP SS
Subjt:  VDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSS

Query:  DGNSVPD
         G +VPD
Subjt:  DGNSVPD

XP_022144638.1 uncharacterized protein LOC111014280, partial [Momordica charantia]0.0e+0083.94Show/hide
Query:  EPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAA
        EPGLKKPRL DEAERGGNINGRPFPQRPVVSG+NIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAA
Subjt:  EPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAA

Query:  KAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQ
        KA++ATVCANI+E                   VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVF KAYRQVDPSVHPSMRHLFGTWKGVFPPQ LQ
Subjt:  KAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQ

Query:  IIEKELGFMPSSSSSSGTITSKPDLQA-RPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKVLDTQRPIRDAPN
        IIEKELGFMPSSSSSSGTI SKPDLQ  RPPHSIHVNPKYIERQRLQQSGRVKG+ +DAT  TTNVTQDV QAKISTGRPWADAPIKVLD QRP+RDAPN
Subjt:  IIEKELGFMPSSSSSSGTITSKPDLQA-RPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKVLDTQRPIRDAPN

Query:  DI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNF--SSSSRGLSTNWKN
        D+          ADYEYGSDLSR PGIGRRV+DEGRDKPWSAAGSNV EKLSGQRNGFN+K GYENYPAPKSANTGARLLP+QNF  SSSSRGLSTNWKN
Subjt:  DI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNF--SSSSRGLSTNWKN

Query:  SEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPR
        SEEEEFMWGEMNSMLTGHG P I +S GKDQW PEDSDNSG  NKP S+RD GASVDREASSDSQSSEQRE+GDSGQQRSS WQ+QESIS+DGLRGGVPR
Subjt:  SEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPR

Query:  KNLA----YGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVDHHVPHQMHNHKTSS
        KNLA    YGA+LT LSGASSSVDQMGGR Q+TSSNIG SGHGFLNKGGSGS G +GHQRFPSR VAFP GQPPLHQRPPSPSLVDH VPHQMH+HKT S
Subjt:  KNLA----YGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVDHHVPHQMHNHKTSS

Query:  FSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSSDGNSVPD
        FSN+DPRK++ QDAALNL P+VRPD+LQKP+PQDL A ASS+PAS+P HQFSLSESLKPDVTQPE SSQ +VS  V DFGP  S GNS+PD
Subjt:  FSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSSDGNSVPD

XP_038906013.1 polyadenylation and cleavage factor homolog 4 [Benincasa hispida]0.0e+0083.31Show/hide
Query:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIV-QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI
        M+MESSRRPFDRTREPGLKKPRLADEA+RGGNINGRPFPQRPVVSG+NIV QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI
Subjt:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIV-QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI

Query:  ITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH
        ITNLTIIAGENLQAAKAISATV ANILE                   VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH
Subjt:  ITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH

Query:  LFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RP-PHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADA
        LFGTWKGVFPPQ LQIIEKELGF+PS SSSSG ITSKPDLQA RP PHSIHVNPKYIERQRLQQSGRVKGMT+D T ATT  +QDV QAKISTGRPW DA
Subjt:  LFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RP-PHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADA

Query:  PIKVLDTQRPIRDAPNDI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQN
        PIKVLD QRP+RDAPND+          ADYEYGSDLSR  G+GRRVVDEGRDKPWS+AGSN+ +KLSGQRNGFN+KLGYENYPAPKSANTGARLLP+QN
Subjt:  PIKVLDTQRPIRDAPNDI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQN

Query:  FS--SSSRGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQ
        FS  SS+R LSTNWKNSEEEEFMWGEMNSMLTGHGAPAI  STGKDQWTPEDSDNSG  NKP S+RDTGASVDREASSDSQSSEQRE+GDSGQQRSS WQ
Subjt:  FS--SSSRGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQ

Query:  MQESISLDGLRGGVPRKNL----AYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSL
        +QESISLDGLR GVPRKN      YGA+LTALSGA+SSVDQMGGRPQ+TSSNIG SGHGFL+KGGSG +G VGHQRFPSRSVAFPSGQP LHQ PPSPSL
Subjt:  MQESISLDGLRGGVPRKNL----AYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSL

Query:  VDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSS
        VD H+PHQ+H+ K +SFSN+DPRKR+ QDAAL L  SVRPDNLQKP+P DLQASASSIPA +P HQFSLSESLKP+VTQ E SSQH+VSIP  DFGP SS
Subjt:  VDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSS

Query:  DGNSVPD
         G +VPD
Subjt:  DGNSVPD

TrEMBL top hitse value%identityAlignment
A0A1S3B6K6 polyadenylation and cleavage factor homolog 4-like isoform X12.6e-31081.75Show/hide
Query:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIV-QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI
        M+MESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG+NIV QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI
Subjt:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIV-QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI

Query:  ITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH
        ITNLTIIAGENLQAAKAIS T+ ANILE                   VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH
Subjt:  ITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH

Query:  LFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RP-PHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADA
        LFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQA RP PHSIHVNPKYIERQRLQQSGRVKGM  DAT  +TNV+QDV QAKISTGRPWADA
Subjt:  LFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RP-PHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADA

Query:  PIKVLDTQRPIRDAPNDIA----------DYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQN
        PIKVLD QRP+RDAPND+A          DYEYGSDLSR   +GRRVVDEGRDKPW +AGSN+ EKLSGQRNGFNIKLGYENY APKS NTGARLLPVQN
Subjt:  PIKVLDTQRPIRDAPNDIA----------DYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQN

Query:  FSSSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQ
        FSSSS  R LSTNWKNSEEEEFMWG+M+SMLTGHGAPAI +STGKDQWTPEDSDNSG  NK  S+RDTGASVDREASSDSQSSEQRE+GDSGQQRSS WQ
Subjt:  FSSSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQ

Query:  MQESISLDGLRGGVPRKNL----AYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSL
        +QESISLDGLR GVPRKN      YGA+LTALSG +SSVDQMGGRPQ+T SNIG SGHGFLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS  L
Subjt:  MQESISLDGLRGGVPRKNL----AYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSL

Query:  VDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSS
        VD HVPHQ+H+ KT+SFSN+DPRKR+ QDAAL L PSVRPDN QKP+  DL+A ASSIP S+P HQFSLSESLKPDVTQ E SSQ +VSIP  DFGP SS
Subjt:  VDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSS

Query:  DGNSVPD
         G +VPD
Subjt:  DGNSVPD

A0A5A7TQ23 Polyadenylation and cleavage factor-like protein 4-like isoform X12.6e-31081.75Show/hide
Query:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIV-QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI
        M+MESSRRPFDRTREPGLKKPRLADEA+RG NINGRPFPQRPVVSG+NIV QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI
Subjt:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIV-QPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPI

Query:  ITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH
        ITNLTIIAGENLQAAKAIS T+ ANILE                   VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH
Subjt:  ITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRH

Query:  LFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RP-PHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADA
        LFGTWKGVFP QTLQIIEKELGF+P+ SSSS  I SKPDLQA RP PHSIHVNPKYIERQRLQQSGRVKGM  DAT  +TNV+QDV QAKISTGRPWADA
Subjt:  LFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RP-PHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADA

Query:  PIKVLDTQRPIRDAPNDIA----------DYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQN
        PIKVLD QRP+RDAPND+A          DYEYGSDLSR   +GRRVVDEGRDKPW +AGSN+ EKLSGQRNGFNIKLGYENY APKS NTGARLLPVQN
Subjt:  PIKVLDTQRPIRDAPNDIA----------DYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQN

Query:  FSSSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQ
        FSSSS  R LSTNWKNSEEEEFMWG+M+SMLTGHGAPAI +STGKDQWTPEDSDNSG  NK  S+RDTGASVDREASSDSQSSEQRE+GDSGQQRSS WQ
Subjt:  FSSSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQ

Query:  MQESISLDGLRGGVPRKNL----AYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSL
        +QESISLDGLR GVPRKN      YGA+LTALSG +SSVDQMGGRPQ+T SNIG SGHGFLNKGGSG +G VGHQRFPSRSVAFPSGQPPLHQR PS  L
Subjt:  MQESISLDGLRGGVPRKNL----AYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSL

Query:  VDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSS
        VD HVPHQ+H+ KT+SFSN+DPRKR+ QDAAL L PSVRPDN QKP+  DL+A ASSIP S+P HQFSLSESLKPDVTQ E SSQ +VSIP  DFGP SS
Subjt:  VDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSS

Query:  DGNSVPD
         G +VPD
Subjt:  DGNSVPD

A0A6J1CTT8 uncharacterized protein LOC1110142800.0e+0083.94Show/hide
Query:  EPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAA
        EPGLKKPRL DEAERGGNINGRPFPQRPVVSG+NIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAA
Subjt:  EPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAA

Query:  KAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQ
        KA++ATVCANI+E                   VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVF KAYRQVDPSVHPSMRHLFGTWKGVFPPQ LQ
Subjt:  KAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQ

Query:  IIEKELGFMPSSSSSSGTITSKPDLQA-RPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKVLDTQRPIRDAPN
        IIEKELGFMPSSSSSSGTI SKPDLQ  RPPHSIHVNPKYIERQRLQQSGRVKG+ +DAT  TTNVTQDV QAKISTGRPWADAPIKVLD QRP+RDAPN
Subjt:  IIEKELGFMPSSSSSSGTITSKPDLQA-RPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKVLDTQRPIRDAPN

Query:  DI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNF--SSSSRGLSTNWKN
        D+          ADYEYGSDLSR PGIGRRV+DEGRDKPWSAAGSNV EKLSGQRNGFN+K GYENYPAPKSANTGARLLP+QNF  SSSSRGLSTNWKN
Subjt:  DI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNF--SSSSRGLSTNWKN

Query:  SEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPR
        SEEEEFMWGEMNSMLTGHG P I +S GKDQW PEDSDNSG  NKP S+RD GASVDREASSDSQSSEQRE+GDSGQQRSS WQ+QESIS+DGLRGGVPR
Subjt:  SEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPR

Query:  KNLA----YGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVDHHVPHQMHNHKTSS
        KNLA    YGA+LT LSGASSSVDQMGGR Q+TSSNIG SGHGFLNKGGSGS G +GHQRFPSR VAFP GQPPLHQRPPSPSLVDH VPHQMH+HKT S
Subjt:  KNLA----YGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVDHHVPHQMHNHKTSS

Query:  FSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSSDGNSVPD
        FSN+DPRK++ QDAALNL P+VRPD+LQKP+PQDL A ASS+PAS+P HQFSLSESLKPDVTQPE SSQ +VS  V DFGP  S GNS+PD
Subjt:  FSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSSDGNSVPD

A0A6J1F7E8 uncharacterized protein LOC111442777 isoform X22.6e-30880.86Show/hide
Query:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPII
        M+MESSRRPFDRTREPGLKK RLADEAERGGNINGRPFPQRP+ SG+NIVQPRFRASDRDSGSSDSGRGGYQPQP QHQELVSQYRTALAELTFNSKPII
Subjt:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPII

Query:  TNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHL
        TNLTIIAGENLQAAKAISATVCANILE                   V SEQKLPSLYLLDSIVKNIGRDYIKYFAA+LPEVFCKAYRQVD  VH SMRHL
Subjt:  TNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHL

Query:  FGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPI
        FGTWKGVFPPQTLQ+IEKELGF+ +S SSSGTI+SKP+L + RPPHSIHVNPKYIERQRLQQSGRVKGMT+DAT+ATTNVTQDV QAKISTGRPWADA I
Subjt:  FGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPI

Query:  KVLDTQRPIRDAPNDI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFS
        K  D QRP+RDAPNDI          ADYEYGSDLSR PGIGRR VDEGRDKPWS  GSN+ EKLSGQRNGFNIKLGYENYPAP+SANTGARLLP QNFS
Subjt:  KVLDTQRPIRDAPNDI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFS

Query:  SSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQ
        SSS  RGLSTNWKNSEEEEFMWGEMNSMLTGHGA AI +S GKDQWTPEDSDNSG  NK  S+RDTG SVDREASSDSQSSEQRE+GDSGQQRSS+WQ+Q
Subjt:  SSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQ

Query:  ESISLDGLRGGVPRKNLA----YGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVD
        E +SLDGLRGG+P+KN A    YGA+LTALSG +SSVDQMGGRPQ+TSSNIG SGH FLNKGGSGSIG VG Q FPSR+VAF SGQPPLHQRPPSP  VD
Subjt:  ESISLDGLRGGVPRKNLA----YGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVD

Query:  HHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSSDG
         H+PHQM NHKTSSFSN+DPRKR+ QDA+L   P+V+ DNL+KP+PQD QA+AS IP S+P   FSLSESLKPDV Q E S QH+VSIP  DFGPPSS G
Subjt:  HHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSSDG

A0A6J1FCJ8 uncharacterized protein LOC111442777 isoform X15.3e-31081Show/hide
Query:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPII
        M+MESSRRPFDRTREPGLKK RLADEAERGGNINGRPFPQRP+ SG+NIVQPRFRASDRDSGSSDSGRGGYQPQP QHQELVSQYRTALAELTFNSKPII
Subjt:  MDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPII

Query:  TNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHL
        TNLTIIAGENLQAAKAISATVCANILE                   V SEQKLPSLYLLDSIVKNIGRDYIKYFAA+LPEVFCKAYRQVD  VH SMRHL
Subjt:  TNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHL

Query:  FGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPI
        FGTWKGVFPPQTLQ+IEKELGF+ +S SSSGTI+SKP+L + RPPHSIHVNPKYIERQRLQQSGRVKGMT+DAT+ATTNVTQDV QAKISTGRPWADA I
Subjt:  FGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQA-RPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPI

Query:  KVLDTQRPIRDAPNDI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFS
        KV D QRP+RDAPNDI          ADYEYGSDLSR PGIGRR VDEGRDKPWS  GSN+ EKLSGQRNGFNIKLGYENYPAP+SANTGARLLP QNFS
Subjt:  KVLDTQRPIRDAPNDI----------ADYEYGSDLSRNPGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFS

Query:  SSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQ
        SSS  RGLSTNWKNSEEEEFMWGEMNSMLTGHGA AI +S GKDQWTPEDSDNSG  NK  S+RDTG SVDREASSDSQSSEQRE+GDSGQQRSS+WQ+Q
Subjt:  SSS--RGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQ

Query:  ESISLDGLRGGVPRKNLA----YGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVD
        E +SLDGLRGG+P+KN A    YGA+LTALSG +SSVDQMGGRPQ+TSSNIG SGH FLNKGGSGSIG VG Q FPSR+VAF SGQPPLHQRPPSP  VD
Subjt:  ESISLDGLRGGVPRKNLA----YGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSGSIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVD

Query:  HHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSSDG
         H+PHQM NHKTSSFSN+DPRKR+ QDA+L   P+V+ DNL+KP+PQD QA+AS IP S+P   FSLSESLKPDV Q E S QH+VSIP  DFGPPSS G
Subjt:  HHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDVTQPEPSSQHSVSIPVADFGPPSSDG

SwissProt top hitse value%identityAlignment
O94913 Pre-mRNA cleavage complex 2 protein Pcf117.4e-1828.71Show/hide
Query:  QELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARL
        ++    Y+++L +LTFNSKP I  LTI+A ENL  AK I                    +L++ ++   PS +KLP +YL+DSIVKN+GR+Y+  F   L
Subjt:  QELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARL

Query:  PEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQARPPH----SIHVNPKYIERQRLQQSGRVKGMTNDATV
           F   + +VD +   S+  L  TW  +FP + L  ++  +           ++     ++  PP+    SIHVNPK++ +   ++      + +  ++
Subjt:  PEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQARPPH----SIHVNPKYIERQRLQQSGRVKGMTNDATV

Query:  ATTNVTQDV
        +T  +  D+
Subjt:  ATTNVTQDV

P39081 Protein PCF112.6e-1029.15Show/hide
Query:  LVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPE
        +V  + + L ELTFNS+PIIT LT +A EN+  A+     +              E+ + KC  K     QKL + Y LDSI KN+G  Y  YF+  L  
Subjt:  LVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPE

Query:  VFCKAYRQVDPSVHPSMRHLFGTWKG-------VFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAR-PPHSIHVNPKYIERQRLQQSGRVKGMTND
        ++ + Y  VD +    + ++F  W         +F    L+ IE+   F+  +S+       + +LQA  P  ++ +  + I++     S R+K   ND
Subjt:  VFCKAYRQVDPSVHPSMRHLFGTWKG-------VFPPQTLQIIEKELGFMPSSSSSSGTITSKPDLQAR-PPHSIHVNPKYIERQRLQQSGRVKGMTND

Q0WPF2 Polyadenylation and cleavage factor homolog 44.0e-4835.88Show/hide
Query:  PQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENA
        PQ+P    S + + +   + R+    D   GG +  PP   E+V  Y   L ELTFNSKPIIT+LTIIAGE  +  + I+  +C  ILE           
Subjt:  PQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENA

Query:  LLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPD
                 P EQKLPSLYLLDSIVKNIGRDY +YF++RLPEVFC AYRQ  PS+HPSMRHLFGTW  VFPP  L+ I+ +L  + S+++ S    S+P 
Subjt:  LLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPD

Query:  LQARPPHSIHVNPKYIER-QRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKVLDTQRPIRDAPNDIADYEYGSDLSRNPGIGRRVVDE
          ++P   IHVNPKY+ R +       ++G+ + A V   N          +      ++P  +  T        ND A+    S+ + N G+GR    +
Subjt:  LQARPPHSIHVNPKYIER-QRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKVLDTQRPIRDAPNDIADYEYGSDLSRNPGIGRRVVDE

Query:  GRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSSSRGLSTNWKNSEEEEFMWGEMNSML
             W        E L    +    +   + Y    S +      P+++ +     + T W+N+EEEEF W +M+  L
Subjt:  GRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSSSRGLSTNWKNSEEEEFMWGEMNSML

Q10237 Uncharacterized protein C4G9.04c7.2e-1339.67Show/hide
Query:  YRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCK
        Y +AL +LTFNSKPII  LT IA EN   A +I   +  +I +C                   P   KLP+LYLLDSI KN+G  Y  +F   L   F  
Subjt:  YRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCK

Query:  AYRQVDPSVHPSMRHLFGTWK
        AY  V+P +   +  L  TWK
Subjt:  AYRQVDPSVHPSMRHLFGTWK

Arabidopsis top hitse value%identityAlignment
AT2G36480.1 ENTH/VHS family protein2.6e-6642.89Show/hide
Query:  VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTI-TSKPDLQA-RPP
        VPS+QKLP+LYLLDSIVKNIGRDYIKYF ARLPEVF KAYRQVDP +H +MRHLFGTWKGVF PQTLQ+IEKELGF   S  S+  + T++ + Q+ RPP
Subjt:  VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTI-TSKPDLQA-RPP

Query:  HSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQ----AKISTGRPWADAPIKVLDTQRPIRDAPND----------IADYEYGSDLSRN---
        HSIHVNPKY+ERQRLQQSGR KGM  D      N+T+D  +    + I++G  W   P KV + +RP RD  ++            +Y+Y SDL  N   
Subjt:  HSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQ----AKISTGRPWADAPIKVLDTQRPIRDAPND----------IADYEYGSDLSRN---

Query:  --PGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSS--SRGLST---NWKNSEEEEFMWGEMNSMLTGHG
            +G R+ D+G +K W  A +   + +S QR+G + K    NY   +          V+N  SS  SR +     +WKNSEEEEFMW +M+S L+   
Subjt:  --PGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSS--SRGLST---NWKNSEEEEFMWGEMNSMLTGHG

Query:  APAIGNST---GKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPRKNLAYGASLTALSGAS
           I         D+    +S+N       FS  D     D   S++S SSEQ++    G    S      + +  G++   P+  +A    L + SG+ 
Subjt:  APAIGNST---GKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPRKNLAYGASLTALSGAS

Query:  S
        S
Subjt:  S

AT2G36480.2 ENTH/VHS family protein2.6e-6642.89Show/hide
Query:  VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTI-TSKPDLQA-RPP
        VPS+QKLP+LYLLDSIVKNIGRDYIKYF ARLPEVF KAYRQVDP +H +MRHLFGTWKGVF PQTLQ+IEKELGF   S  S+  + T++ + Q+ RPP
Subjt:  VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTI-TSKPDLQA-RPP

Query:  HSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQ----AKISTGRPWADAPIKVLDTQRPIRDAPND----------IADYEYGSDLSRN---
        HSIHVNPKY+ERQRLQQSGR KGM  D      N+T+D  +    + I++G  W   P KV + +RP RD  ++            +Y+Y SDL  N   
Subjt:  HSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQ----AKISTGRPWADAPIKVLDTQRPIRDAPND----------IADYEYGSDLSRN---

Query:  --PGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSS--SRGLST---NWKNSEEEEFMWGEMNSMLTGHG
            +G R+ D+G +K W  A +   + +S QR+G + K    NY   +          V+N  SS  SR +     +WKNSEEEEFMW +M+S L+   
Subjt:  --PGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSS--SRGLST---NWKNSEEEEFMWGEMNSMLTGHG

Query:  APAIGNST---GKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPRKNLAYGASLTALSGAS
           I         D+    +S+N       FS  D     D   S++S SSEQ++    G    S      + +  G++   P+  +A    L + SG+ 
Subjt:  APAIGNST---GKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPRKNLAYGASLTALSGAS

Query:  S
        S
Subjt:  S

AT2G36480.3 ENTH/VHS family protein2.6e-6642.89Show/hide
Query:  VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTI-TSKPDLQA-RPP
        VPS+QKLP+LYLLDSIVKNIGRDYIKYF ARLPEVF KAYRQVDP +H +MRHLFGTWKGVF PQTLQ+IEKELGF   S  S+  + T++ + Q+ RPP
Subjt:  VPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTI-TSKPDLQA-RPP

Query:  HSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQ----AKISTGRPWADAPIKVLDTQRPIRDAPND----------IADYEYGSDLSRN---
        HSIHVNPKY+ERQRLQQSGR KGM  D      N+T+D  +    + I++G  W   P KV + +RP RD  ++            +Y+Y SDL  N   
Subjt:  HSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQ----AKISTGRPWADAPIKVLDTQRPIRDAPND----------IADYEYGSDLSRN---

Query:  --PGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSS--SRGLST---NWKNSEEEEFMWGEMNSMLTGHG
            +G R+ D+G +K W  A +   + +S QR+G + K    NY   +          V+N  SS  SR +     +WKNSEEEEFMW +M+S L+   
Subjt:  --PGIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSS--SRGLST---NWKNSEEEEFMWGEMNSMLTGHG

Query:  APAIGNST---GKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPRKNLAYGASLTALSGAS
           I         D+    +S+N       FS  D     D   S++S SSEQ++    G    S      + +  G++   P+  +A    L + SG+ 
Subjt:  APAIGNST---GKDQWTPEDSDNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPRKNLAYGASLTALSGAS

Query:  S
        S
Subjt:  S

AT2G36485.1 ENTH/VHS family protein4.3e-2957.04Show/hide
Query:  MESSRRPFDRTREPG-LKKPRLADEAERGGNINGRPF-PQRPVVSGSNIVQP----RFRASDRDSGS---SDSGRGGYQPQPPQ-HQELVSQYRTALAEL
        ME+ RRPFDR+R+PG +KKPRL++E+ R  N N R F  QR + + + +  P    RFR S R++ S   SD  R  YQPQP   H ELV+QY++ALAEL
Subjt:  MESSRRPFDRTREPG-LKKPRLADEAERGGNINGRPF-PQRPVVSGSNIVQP----RFRASDRDSGS---SDSGRGGYQPQPPQ-HQELVSQYRTALAEL

Query:  TFNSKPIITNLTIIAGENLQAAKAISATVCANILE
        TFNSKPIITNLTIIAGEN+ AAKA+   +C NILE
Subjt:  TFNSKPIITNLTIIAGENLQAAKAISATVCANILE

AT4G04885.1 PCF11P-similar protein 42.9e-4935.88Show/hide
Query:  PQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENA
        PQ+P    S + + +   + R+    D   GG +  PP   E+V  Y   L ELTFNSKPIIT+LTIIAGE  +  + I+  +C  ILE           
Subjt:  PQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAGENLQAAKAISATVCANILECRRKRGNTENA

Query:  LLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPD
                 P EQKLPSLYLLDSIVKNIGRDY +YF++RLPEVFC AYRQ  PS+HPSMRHLFGTW  VFPP  L+ I+ +L  + S+++ S    S+P 
Subjt:  LLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEKELGFMPSSSSSSGTITSKPD

Query:  LQARPPHSIHVNPKYIER-QRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKVLDTQRPIRDAPNDIADYEYGSDLSRNPGIGRRVVDE
          ++P   IHVNPKY+ R +       ++G+ + A V   N          +      ++P  +  T        ND A+    S+ + N G+GR    +
Subjt:  LQARPPHSIHVNPKYIER-QRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKVLDTQRPIRDAPNDIADYEYGSDLSRNPGIGRRVVDE

Query:  GRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSSSRGLSTNWKNSEEEEFMWGEMNSML
             W        E L    +    +   + Y    S +      P+++ +     + T W+N+EEEEF W +M+  L
Subjt:  GRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSSSRGLSTNWKNSEEEEFMWGEMNSML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAATGGACATGGAGAGCTCGCGGAGACCATTCGATCGAACAAGGGAACCGGGTTTGAAGAAGCCCCGACTGGCCGATGAGGCTGAACGCGGCGGGAACATCAATGG
CCGGCCATTTCCGCAGAGACCAGTTGTCTCGGGGAGCAATATTGTGCAACCCAGATTTCGAGCAAGTGATAGAGATTCGGGAAGCAGCGATTCTGGCCGAGGGGGTTATC
AGCCTCAGCCGCCGCAGCACCAGGAGCTTGTGAGCCAGTACAGGACAGCCCTTGCTGAGCTGACTTTCAATTCGAAACCCATCATCACTAATTTGACCATAATCGCGGGT
GAAAATCTCCAGGCTGCAAAAGCAATCTCCGCTACCGTTTGCGCCAACATTCTCGAGTGTAGAAGGAAAAGAGGAAATACAGAAAATGCTTTGTTGAAGTGCAGAAGTAA
AATTGTTCCGAGTGAGCAGAAGCTACCATCACTTTATCTATTGGACAGTATTGTAAAGAATATTGGAAGAGATTACATAAAATACTTTGCAGCAAGACTGCCTGAGGTAT
TCTGCAAAGCTTATAGGCAGGTTGACCCTTCTGTACATCCAAGTATGAGACATCTTTTTGGCACCTGGAAAGGAGTGTTTCCTCCTCAAACTCTGCAGATTATAGAGAAA
GAACTTGGCTTCATGCCCAGCAGTAGTTCTTCTTCTGGGACCATAACCTCGAAGCCTGATTTGCAGGCACGTCCACCCCATAGCATCCATGTAAATCCCAAGTATATAGA
GAGGCAACGGCTTCAGCAGTCTGGCAGGGTGAAAGGAATGACTAATGATGCCACAGTGGCAACAACAAATGTAACTCAGGATGTCGTCCAAGCCAAAATTAGTACTGGAC
GCCCATGGGCGGATGCTCCAATTAAAGTGCTTGACACTCAGCGTCCAATTAGAGATGCACCAAATGATATTGCAGACTATGAATATGGTTCTGATCTTTCAAGGAATCCA
GGTATCGGAAGAAGGGTCGTTGATGAAGGGCGAGACAAACCATGGTCTGCAGCTGGAAGCAATGTGGTTGAGAAGTTATCTGGCCAAAGAAATGGGTTCAACATCAAGCT
TGGGTATGAAAACTACCCTGCACCAAAGTCTGCAAACACTGGTGCACGTCTACTGCCCGTGCAAAATTTTTCAAGCAGCAGCAGAGGATTGTCTACTAACTGGAAGAACT
CAGAGGAAGAGGAGTTTATGTGGGGTGAAATGAACTCAATGTTGACAGGTCACGGTGCACCTGCCATCGGTAATAGCACCGGAAAAGATCAATGGACTCCTGAGGATTCG
GATAATTCGGGTACTGGAAATAAGCCATTCAGCATACGGGATACTGGGGCAAGTGTTGATAGAGAAGCTTCCAGTGATTCACAATCATCTGAACAGAGAGAAGTAGGGGA
TTCTGGACAGCAAAGGTCATCAATATGGCAAATGCAGGAGTCGATATCTTTGGATGGGCTGAGAGGTGGGGTTCCTAGAAAGAATTTAGCTTATGGTGCCAGTCTTACTG
CACTATCAGGTGCTAGCTCTTCTGTGGATCAAATGGGAGGTCGACCACAGGTCACATCATCTAATATTGGAACTTCAGGACATGGGTTTCTGAATAAAGGAGGTTCAGGA
TCCATTGGCGCTGTGGGCCATCAAAGATTTCCATCACGAAGTGTTGCATTTCCATCTGGACAGCCACCCTTACACCAACGTCCCCCTTCACCATCGTTAGTGGACCACCA
TGTTCCTCATCAAATGCACAACCATAAAACTTCTTCATTTTCTAATGTTGACCCACGGAAAAGGAATACTCAGGATGCTGCCCTTAACCTGCCTCCCAGTGTTCGGCCGG
ATAACCTTCAAAAACCAAAGCCTCAGGACCTGCAAGCTTCAGCTTCATCCATACCTGCTTCTAAACCCGGGCATCAGTTTTCTTTATCTGAGTCACTAAAACCTGATGTC
ACGCAGCCAGAACCTTCTAGTCAACATTCAGTGTCAATTCCAGTCGCCGATTTTGGACCACCCTCATCAGATGGGAATTCTGTTCCAGATGTTCACCTGCAGAAATTTTG
GCAGAGCCAAGCACTAGTAGTTTGTTGGCTGCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAATGGACATGGAGAGCTCGCGGAGACCATTCGATCGAACAAGGGAACCGGGTTTGAAGAAGCCCCGACTGGCCGATGAGGCTGAACGCGGCGGGAACATCAATGG
CCGGCCATTTCCGCAGAGACCAGTTGTCTCGGGGAGCAATATTGTGCAACCCAGATTTCGAGCAAGTGATAGAGATTCGGGAAGCAGCGATTCTGGCCGAGGGGGTTATC
AGCCTCAGCCGCCGCAGCACCAGGAGCTTGTGAGCCAGTACAGGACAGCCCTTGCTGAGCTGACTTTCAATTCGAAACCCATCATCACTAATTTGACCATAATCGCGGGT
GAAAATCTCCAGGCTGCAAAAGCAATCTCCGCTACCGTTTGCGCCAACATTCTCGAGTGTAGAAGGAAAAGAGGAAATACAGAAAATGCTTTGTTGAAGTGCAGAAGTAA
AATTGTTCCGAGTGAGCAGAAGCTACCATCACTTTATCTATTGGACAGTATTGTAAAGAATATTGGAAGAGATTACATAAAATACTTTGCAGCAAGACTGCCTGAGGTAT
TCTGCAAAGCTTATAGGCAGGTTGACCCTTCTGTACATCCAAGTATGAGACATCTTTTTGGCACCTGGAAAGGAGTGTTTCCTCCTCAAACTCTGCAGATTATAGAGAAA
GAACTTGGCTTCATGCCCAGCAGTAGTTCTTCTTCTGGGACCATAACCTCGAAGCCTGATTTGCAGGCACGTCCACCCCATAGCATCCATGTAAATCCCAAGTATATAGA
GAGGCAACGGCTTCAGCAGTCTGGCAGGGTGAAAGGAATGACTAATGATGCCACAGTGGCAACAACAAATGTAACTCAGGATGTCGTCCAAGCCAAAATTAGTACTGGAC
GCCCATGGGCGGATGCTCCAATTAAAGTGCTTGACACTCAGCGTCCAATTAGAGATGCACCAAATGATATTGCAGACTATGAATATGGTTCTGATCTTTCAAGGAATCCA
GGTATCGGAAGAAGGGTCGTTGATGAAGGGCGAGACAAACCATGGTCTGCAGCTGGAAGCAATGTGGTTGAGAAGTTATCTGGCCAAAGAAATGGGTTCAACATCAAGCT
TGGGTATGAAAACTACCCTGCACCAAAGTCTGCAAACACTGGTGCACGTCTACTGCCCGTGCAAAATTTTTCAAGCAGCAGCAGAGGATTGTCTACTAACTGGAAGAACT
CAGAGGAAGAGGAGTTTATGTGGGGTGAAATGAACTCAATGTTGACAGGTCACGGTGCACCTGCCATCGGTAATAGCACCGGAAAAGATCAATGGACTCCTGAGGATTCG
GATAATTCGGGTACTGGAAATAAGCCATTCAGCATACGGGATACTGGGGCAAGTGTTGATAGAGAAGCTTCCAGTGATTCACAATCATCTGAACAGAGAGAAGTAGGGGA
TTCTGGACAGCAAAGGTCATCAATATGGCAAATGCAGGAGTCGATATCTTTGGATGGGCTGAGAGGTGGGGTTCCTAGAAAGAATTTAGCTTATGGTGCCAGTCTTACTG
CACTATCAGGTGCTAGCTCTTCTGTGGATCAAATGGGAGGTCGACCACAGGTCACATCATCTAATATTGGAACTTCAGGACATGGGTTTCTGAATAAAGGAGGTTCAGGA
TCCATTGGCGCTGTGGGCCATCAAAGATTTCCATCACGAAGTGTTGCATTTCCATCTGGACAGCCACCCTTACACCAACGTCCCCCTTCACCATCGTTAGTGGACCACCA
TGTTCCTCATCAAATGCACAACCATAAAACTTCTTCATTTTCTAATGTTGACCCACGGAAAAGGAATACTCAGGATGCTGCCCTTAACCTGCCTCCCAGTGTTCGGCCGG
ATAACCTTCAAAAACCAAAGCCTCAGGACCTGCAAGCTTCAGCTTCATCCATACCTGCTTCTAAACCCGGGCATCAGTTTTCTTTATCTGAGTCACTAAAACCTGATGTC
ACGCAGCCAGAACCTTCTAGTCAACATTCAGTGTCAATTCCAGTCGCCGATTTTGGACCACCCTCATCAGATGGGAATTCTGTTCCAGATGTTCACCTGCAGAAATTTTG
GCAGAGCCAAGCACTAGTAGTTTGTTGGCTGCTGTAA
Protein sequenceShow/hide protein sequence
MLMDMESSRRPFDRTREPGLKKPRLADEAERGGNINGRPFPQRPVVSGSNIVQPRFRASDRDSGSSDSGRGGYQPQPPQHQELVSQYRTALAELTFNSKPIITNLTIIAG
ENLQAAKAISATVCANILECRRKRGNTENALLKCRSKIVPSEQKLPSLYLLDSIVKNIGRDYIKYFAARLPEVFCKAYRQVDPSVHPSMRHLFGTWKGVFPPQTLQIIEK
ELGFMPSSSSSSGTITSKPDLQARPPHSIHVNPKYIERQRLQQSGRVKGMTNDATVATTNVTQDVVQAKISTGRPWADAPIKVLDTQRPIRDAPNDIADYEYGSDLSRNP
GIGRRVVDEGRDKPWSAAGSNVVEKLSGQRNGFNIKLGYENYPAPKSANTGARLLPVQNFSSSSRGLSTNWKNSEEEEFMWGEMNSMLTGHGAPAIGNSTGKDQWTPEDS
DNSGTGNKPFSIRDTGASVDREASSDSQSSEQREVGDSGQQRSSIWQMQESISLDGLRGGVPRKNLAYGASLTALSGASSSVDQMGGRPQVTSSNIGTSGHGFLNKGGSG
SIGAVGHQRFPSRSVAFPSGQPPLHQRPPSPSLVDHHVPHQMHNHKTSSFSNVDPRKRNTQDAALNLPPSVRPDNLQKPKPQDLQASASSIPASKPGHQFSLSESLKPDV
TQPEPSSQHSVSIPVADFGPPSSDGNSVPDVHLQKFWQSQALVVCWLL