; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019451 (gene) of Chayote v1 genome

Gene IDSed0019451
OrganismSechium edule (Chayote v1)
Descriptionpolyadenylation and cleavage factor homolog 4-like isoform X2
Genome locationLG02:43295285..43303085
RNA-Seq ExpressionSed0019451
SyntenySed0019451
Gene Ontology termsGO:0006369 - termination of RNA polymerase II transcription (biological process)
GO:0006378 - mRNA polyadenylation (biological process)
GO:0006379 - mRNA cleavage (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0005849 - mRNA cleavage factor complex (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR006569 - CID domain
IPR008942 - ENTH/VHS
IPR013087 - Zinc finger C2H2-type
IPR045154 - Protein PCF11-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596545.1 Polyadenylation and cleavage factor-like 4, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0080.34Show/hide
Query:  MESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTFNS
        MESSRRPFDR REPGLKK RL DE AERGG +INGRPFPQRP+ S TNI QPRFR +DRDSGS DSGR    GGYQPQPLQHQELVSQYRTALAELTFNS
Subjt:  MESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTFNS

Query:  KPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTLQV
        KPIITNLTIIAGEN QAAKAISATVCANI+EV+S+QKLPSLYLLDSIVKNIGRDYIKYFA +LPEVFCKAYRQVD  VHTSMRHLFGTWKGVFPPQTLQV
Subjt:  KPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTLQV

Query:  IEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAPND
        IEKELGF+++  SSSGT +SKP+LQ+QRPPHSIHVNPKYIERQRLQQSGRVKGMT DAT+ TT+VTQDVA+AKISTGRPWADA +K+ DIQRPLRDAPND
Subjt:  IEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAPND

Query:  MAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWKNS
        +AQEKNIT AYADYEYGSDLSRT G GRR  DEGRD+ WS  G+NLAEKLSGQRNGFN KLGYENYPAP+SANTGARLLP QNFSSSSSNR LS NWKNS
Subjt:  MAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWKNS

Query:  EEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVPRK
        EEEEFMWGE+N MLTGHGAS IASS GKDQWTPEDSDNSGIE K LSLRDTGGSVDRE+SSDSQSSEQREL DS QQRSSMWQVQEP+SLDGLRG +P+K
Subjt:  EEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVPRK

Query:  NSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTSSL
        NS QSGGY AT TALSG  +SV+Q+GGRPQI S N+G  GH  LNKGGSG IGTVG Q FPSR+VA F SGQP LHQRPPSPLSVDH+PHQMPNHKTSS 
Subjt:  NSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTSSL

Query:  SNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGEPS
        SNLDPRKRH+QDAS+G HP+V+ DNLKK Q QD QA+AS +PTSQPR  FSLSES KPDVR+SE S QH VSIPGTDF  PSSAG++P RLPA+ILGE S
Subjt:  SNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGEPS

Query:  TSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPLPP
        TSSLLAAVMKSGIFSNHSI +SMQQNISFQD  N  PHSN+K PLPS+SSPAHT++TF EPKT GESSLGP  LES SALVKLSQTKVE+  LPSDP  P
Subjt:  TSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPLPP

Query:  SSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVTSS
        SSPMNSASTETSNVVN +S+PISNLLSSLVAKGLISASKGELTNS TSQMP+QPENL                       K GDA+ CS PVPS+PVTSS
Subjt:  SSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVTSS

Query:  SQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWYPS
        SQSS  LES SK  AK STSP P  +TEI NLIGFEFSSHVIRKF PSVI GLFDDIPYQCKICGLRLKLEE+LDTH+QWH++RTEANNSNRT  RWYPS
Subjt:  SQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWYPS

Query:  SDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITESSV
        SDDWISGN  LL DAATS D+ D+MEE NEPMVPADED   CVLCGE FEDFYS +L KWMFKGA +ITIPSA SE+GST E+VARGPIVHT CITESS+
Subjt:  SDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITESSV

Query:  YDLGLATDIKMEMD
        +DLGLATDIKMEMD
Subjt:  YDLGLATDIKMEMD

KAG7028080.1 Polyadenylation and cleavage factor-like 4 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0080.39Show/hide
Query:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF
        MEMESSRRPFDR REPGLKK RL DE AERGG +INGRPFPQRP+ S TNI QPRFR +DRDSGS DSGR    GGYQPQPLQHQELVSQYRTALAELTF
Subjt:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF

Query:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL
        NSKPIITNLTIIAGEN QAAKAISATVCANI+EV+S+QKLPSLYLLDSIVKNIGRDYIKYFA +LPEVFCKAYRQVD  VHTSMRHLFGTWKGVFPPQTL
Subjt:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL

Query:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP
        QVIEKELGF+++  SSSGT +SKP+LQ+QRPPHSIHVNPKYIERQRLQQSGRVKGMT DAT+ TT+VTQDVA+AKISTGRPWADA +K+ DIQRPLRDAP
Subjt:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP

Query:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK
        ND+AQEKNIT AYADYEYGSDLSRT G GRR  DEGRD+ WS  G+NLAEKLSGQRNGFN KLGYENYPAP+SANTGARLLP QNFSSSSSNR LS NWK
Subjt:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK

Query:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP
        NSEEEEFMWGE+N MLTGHGAS IASS GKDQWTPEDSDNSGIE K LSLRDTGGSVDRE+SSDSQSSEQREL DS QQRSSMWQVQEP+SLDGLRG +P
Subjt:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP

Query:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS
        +KNS QSGGY AT TALSG  +SV+Q+GGRPQI S N+G  GH  LNKGGSG IGTVG Q FPSR+VA F SGQP LHQRPPSPLSVDH+PHQMPNHKTS
Subjt:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS

Query:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE
        S SNLDPRKRH+QDAS+G HP+V+ DNLKK Q QD QA+AS +PTSQPR  FSLSES KPDVR+SE   QH VSIPGTDF  PSSAG++P RLPA+ILGE
Subjt:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE

Query:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL
         STSSLLAAVMKSGIFSNHSI +SMQQNISFQD  N  PHSN+K PLPS+SSPAHT++TF EPKT GESSLGP  LES SALVKLSQTKVE+ PLPSDP 
Subjt:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL

Query:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT
         PSSPMNSASTETSNVVN +S+PISNLLSSLVAKGLISASKGELTNS TSQMP+QPENL                       K GDA+ CS PVPS+PVT
Subjt:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT

Query:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY
        SSSQSS  LES SK  AK STSP P  +TEI NLIGFEFSSHVIRKF PSVI GLFDDIPYQCKICGLRLKLEE+LDTH+QWH++RTEANNSNRT  RWY
Subjt:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY

Query:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES
        PSSDDWISGN  LL DAATS D+ D+MEE NEPMVPADED   CVLCGE FEDFYS +L KWMFKGA +ITIPSA SE+GST E+VARGPIVHT CITES
Subjt:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES

Query:  SVYDLGLATDIKMEMDV
        S++DLGLATDIKMEMDV
Subjt:  SVYDLGLATDIKMEMDV

XP_022936065.1 uncharacterized protein LOC111442777 isoform X1 [Cucurbita moschata]0.0e+0080.39Show/hide
Query:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF
        MEMESSRRPFDR REPGLKK RL DE AERGG +INGRPFPQRP+ S TNI QPRFR +DRDSGS DSGR    GGYQPQPLQHQELVSQYRTALAELTF
Subjt:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF

Query:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL
        NSKPIITNLTIIAGEN QAAKAISATVCANI+EV+S+QKLPSLYLLDSIVKNIGRDYIKYFA +LPEVFCKAYRQVD  VHTSMRHLFGTWKGVFPPQTL
Subjt:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL

Query:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP
        QVIEKELGF+++  SSSGT +SKP+L +QRPPHSIHVNPKYIERQRLQQSGRVKGMT DAT+ TT+VTQDVA+AKISTGRPWADA +K+ DIQRPLRDAP
Subjt:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP

Query:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK
        ND+AQEKNIT AYADYEYGSDLSRT G GRR  DEGRD+ WS  G+NLAEKLSGQRNGFN KLGYENYPAP+SANTGARLLP QNFSSSSSNR LS NWK
Subjt:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK

Query:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP
        NSEEEEFMWGE+N MLTGHGAS IASS GKDQWTPEDSDNSGIE K LSLRDTGGSVDRE+SSDSQSSEQREL DS QQRSSMWQVQEP+SLDGLRG +P
Subjt:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP

Query:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS
        +KNS QSGGY AT TALSG  +SV+Q+GGRPQI S N+G  GH  LNKGGSG IGTVG Q FPSR+VA F SGQP LHQRPPSPLSVDH+PHQMPNHKTS
Subjt:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS

Query:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE
        S SNLDPRKRH+QDAS+G HP+V+ DNLKK Q QD QA+AS +PTSQPR  FSLSES KPDVR+SE S QH VSIPGTDF  PSSAG++P RLPA+ILGE
Subjt:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE

Query:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIKP-LPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL
         STSSLLAAVMKSGIFSNHSI +SMQQNISFQD  N  PHSN+KP LPSQSSPAHT++TF EPKT GESSLGP  LES SALVKLSQTKVE+ PLPSDP 
Subjt:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIKP-LPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL

Query:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT
         PSSPMNSASTETSNVVN +S+PISNLLSSLVAKGLISASKGELTNS TSQM +QPENL                       K GDA+TCS PVPS+PVT
Subjt:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT

Query:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY
        SSSQSS  LES SK  AK STSP P  +TEI NLIGFEFSSHVIRKF PSVI GLFDDIPYQCKICGLRLKLEE+LDTH+QWH++RTEANNSNR   RWY
Subjt:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY

Query:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES
        PSSDDWISGN  LL DAATS D+ D+MEE NEPMVPADED   CVLCGE FEDFYS +L KWMFKGA +ITIPSA SE GST E+VARGPIVHT CITES
Subjt:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES

Query:  SVYDLGLATDIKMEMDV
        S++DLGLATDIKMEMDV
Subjt:  SVYDLGLATDIKMEMDV

XP_023539204.1 uncharacterized protein LOC111799917 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0080.39Show/hide
Query:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF
        MEMESSRRPFDR REPGLKK RL DE AERGG +INGRPFPQRP+ S TNI QPRFR +DRDSGS DSGR    GGYQPQPLQHQELVSQYRTALAELTF
Subjt:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF

Query:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL
        NSKPIITNLTIIAGEN QAAKAISATVCANI+EV+S+QKLPSLYLLDSIVKNIGRDYIKYFA +LPEVFCKAYRQVD  VHTSMRHLFGTWKGVFPPQTL
Subjt:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL

Query:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP
        QVIEKELGF+++  SSSGT +SKP+LQ+QRPPHSIHVNPKYIERQRLQQSGRVKGMT DAT+ TT+VTQDVA+AKISTGRPWADA +K+ DIQRPLRDAP
Subjt:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP

Query:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK
        NDMAQEKNIT AYADYEYGSDLSRT G GRR  DEGRD+ WS  G+NLAEKLSGQRNGFN KLGYENYPAP+SANTGARLLP QNFSSSSSNR LS NWK
Subjt:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK

Query:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP
        NSEEEEFMWGE+N MLTGH AS IASS GKDQWTPEDSDNSGIE K LSLRDTGGSVDRE+SSDSQSSEQREL DS QQRSSMWQVQEP+SLDGLRG +P
Subjt:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP

Query:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS
        +KNS QSGGY  T TALSG  +SV+Q+GGRPQI S N+G  GH  LNKGGSG IGTVG Q FPSR+VA F SGQP LHQRPPSPLSVDH+PHQMPNHKTS
Subjt:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS

Query:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE
        S SNLDPRKRH+QDAS+G HP+V+ DNLKK Q QD QA+AS +PTSQPR  FSLSES KPDVR+SE S QH VSIPGTDF  PSSAG++P RLPA+ILGE
Subjt:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE

Query:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL
         STSSLLAAVMKSGIFSNHSI +SMQQNISFQD  N  PHSN+K PLPS+SSPAHT++TF EPKT GESSLGP  LES SALVKLSQTKVE+ PLPSDP 
Subjt:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL

Query:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT
         PSSPM SASTETSNVVN +S+PISNLLSSLVAKGLISASKGELTNS TSQMP+QPENL                       K GDA+TCS PVPS+P T
Subjt:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT

Query:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY
        SSSQSS  LES SK  AK STSP P  +TEI NLIGFEFSSHVIRKF PSVI GLFDDIPYQCKICGLRLKLEE+LDTH+QWH++RTEANNSNRT  RWY
Subjt:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY

Query:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES
        PSSDDWISGN  LL DAATS D+ D+MEE NEPMVPADED   CVLCGE FEDFYS +L KWMFKGA +ITIPSA SE+GSTNE+VARGPIVHT CITES
Subjt:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES

Query:  SVYDLGLATDIKMEMDV
        S++DLGLATDIKMEMDV
Subjt:  SVYDLGLATDIKMEMDV

XP_023539205.1 flocculation protein FLO11-like isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0080.39Show/hide
Query:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF
        MEMESSRRPFDR REPGLKK RL DE AERGG +INGRPFPQRP+ S TNI QPRFR +DRDSGS DSGR    GGYQPQPLQHQELVSQYRTALAELTF
Subjt:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF

Query:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL
        NSKPIITNLTIIAGEN QAAKAISATVCANI+EV+S+QKLPSLYLLDSIVKNIGRDYIKYFA +LPEVFCKAYRQVD  VHTSMRHLFGTWKGVFPPQTL
Subjt:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL

Query:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP
        QVIEKELGF+++  SSSGT +SKP+LQ+QRPPHSIHVNPKYIERQRLQQSGRVKGMT DAT+ TT+VTQDVA+AKISTGRPWADA +K  DIQRPLRDAP
Subjt:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP

Query:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK
        NDMAQEKNIT AYADYEYGSDLSRT G GRR  DEGRD+ WS  G+NLAEKLSGQRNGFN KLGYENYPAP+SANTGARLLP QNFSSSSSNR LS NWK
Subjt:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK

Query:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP
        NSEEEEFMWGE+N MLTGH AS IASS GKDQWTPEDSDNSGIE K LSLRDTGGSVDRE+SSDSQSSEQREL DS QQRSSMWQVQEP+SLDGLRG +P
Subjt:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP

Query:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS
        +KNS QSGGY  T TALSG  +SV+Q+GGRPQI S N+G  GH  LNKGGSG IGTVG Q FPSR+VA F SGQP LHQRPPSPLSVDH+PHQMPNHKTS
Subjt:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS

Query:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE
        S SNLDPRKRH+QDAS+G HP+V+ DNLKK Q QD QA+AS +PTSQPR  FSLSES KPDVR+SE S QH VSIPGTDF  PSSAG++P RLPA+ILGE
Subjt:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE

Query:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL
         STSSLLAAVMKSGIFSNHSI +SMQQNISFQD  N  PHSN+K PLPS+SSPAHT++TF EPKT GESSLGP  LES SALVKLSQTKVE+ PLPSDP 
Subjt:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL

Query:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT
         PSSPM SASTETSNVVN +S+PISNLLSSLVAKGLISASKGELTNS TSQMP+QPENL                       K GDA+TCS PVPS+P T
Subjt:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT

Query:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY
        SSSQSS  LES SK  AK STSP P  +TEI NLIGFEFSSHVIRKF PSVI GLFDDIPYQCKICGLRLKLEE+LDTH+QWH++RTEANNSNRT  RWY
Subjt:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY

Query:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES
        PSSDDWISGN  LL DAATS D+ D+MEE NEPMVPADED   CVLCGE FEDFYS +L KWMFKGA +ITIPSA SE+GSTNE+VARGPIVHT CITES
Subjt:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES

Query:  SVYDLGLATDIKMEMDV
        S++DLGLATDIKMEMDV
Subjt:  SVYDLGLATDIKMEMDV

TrEMBL top hitse value%identityAlignment
A0A1S3B6K6 polyadenylation and cleavage factor homolog 4-like isoform X10.0e+0077.93Show/hide
Query:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNI-GQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELT
        MEMESSRRPFDR REPGLKKPRL DEA    G++INGRPFPQRPVVS  NI  QPRFR +DRDSGS DSGR    GGYQPQP QHQELVSQYRTALAELT
Subjt:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNI-GQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELT

Query:  FNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQT
        FNSKPIITNLTIIAGEN QAAKAIS T+ ANI+EV S+QKLPSLYLLDSIVKNIGRDYIKYFA RLPEVFCKAYRQVDPSVH SMRHLFGTWKGVFP QT
Subjt:  FNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQT

Query:  LQVIEKELGFMSSGSSSSGTTTSKPDLQAQRP-PHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRD
        LQ+IEKELGF+ +GSSSS    SKPDLQAQRP PHSIHVNPKYIERQRLQQSGRVKGM  DAT  +T+V+QDVA+AKISTGRPWADAP+K+LDIQRPLRD
Subjt:  LQVIEKELGFMSSGSSSSGTTTSKPDLQAQRP-PHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRD

Query:  APNDMAQEKNITA-YADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSAN
        APNDMAQEKN+TA Y+DYEYGSDLSRT   GRRV DEGRD+ W  AG+NL+EKLSGQRNGFN KLGYENY APKS NTGARLLP+QNFSSSS+NRVLS N
Subjt:  APNDMAQEKNITA-YADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSAN

Query:  WKNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGR
        WKNSEEEEFMWG+++ MLTGHGA  I SSTGKDQWTPEDSDNSGI+ K LS+RDTG SVDRE+SSDSQSSEQREL DS QQRSS WQ+QE ISLDGLR  
Subjt:  WKNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGR

Query:  VPRKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHK
        VPRKNS QSGGY AT TALSG  +SV+Q+GGRPQI   N+G  GHG LNKGGSGP+G VGHQRFPSRSVA FPSGQP LHQR PS L VDHVPHQ+ + K
Subjt:  VPRKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHK

Query:  TSSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADIL
        T+S SNLDPRKRH+QDA++GLHPSVRPDN +K Q  DL+A AS +P SQPRHQFSLSES KPDV +SE SSQ  VSIPGTDF   SSAG++P+RLPA+IL
Subjt:  TSSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADIL

Query:  GEPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSD
        G PSTSSLLAAVMKSG+FSNHSIT++MQQN+SFQD  N  P S+IK PLP++SSPAH   TF EPK  GESS+GPPS+ES S +VKLS+TKVEE  LPSD
Subjt:  GEPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSD

Query:  PLPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVP
        PLPPSSPM+SASTETS+VVN ASSPISNLLSSLVAKGLISASKGE TNS+TSQMPSQPENL                       KSGDA+T S PVPS+ 
Subjt:  PLPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVP

Query:  VTSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGR
        V+SS  SS +LESP K  AK STSP PS +TEINNLIGFEFSSHVIRKFHPSVI GLFDDIPYQCKICGLRLK EE+LDTH +WH++RTEANNS+    R
Subjt:  VTSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGR

Query:  WYPSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCIT
        WYP SDDWISGN+R LLDA TSLD+ D+MEEDNEPMVPADEDQFACV+CGE FEDFYSQELG WM+KGA +ITIPS GSEVG TNE+VA+GPIVHT C+T
Subjt:  WYPSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCIT

Query:  ESSVYDLGLATDIKMEMDV
        ESSVYD+GLATDIKMEMDV
Subjt:  ESSVYDLGLATDIKMEMDV

A0A6J1F7E8 uncharacterized protein LOC111442777 isoform X20.0e+0080.39Show/hide
Query:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF
        MEMESSRRPFDR REPGLKK RL DE AERGG +INGRPFPQRP+ S TNI QPRFR +DRDSGS DSGR    GGYQPQPLQHQELVSQYRTALAELTF
Subjt:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF

Query:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL
        NSKPIITNLTIIAGEN QAAKAISATVCANI+EV+S+QKLPSLYLLDSIVKNIGRDYIKYFA +LPEVFCKAYRQVD  VHTSMRHLFGTWKGVFPPQTL
Subjt:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL

Query:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP
        QVIEKELGF+++  SSSGT +SKP+L +QRPPHSIHVNPKYIERQRLQQSGRVKGMT DAT+ TT+VTQDVA+AKISTGRPWADA +K  DIQRPLRDAP
Subjt:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP

Query:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK
        ND+AQEKNIT AYADYEYGSDLSRT G GRR  DEGRD+ WS  G+NLAEKLSGQRNGFN KLGYENYPAP+SANTGARLLP QNFSSSSSNR LS NWK
Subjt:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK

Query:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP
        NSEEEEFMWGE+N MLTGHGAS IASS GKDQWTPEDSDNSGIE K LSLRDTGGSVDRE+SSDSQSSEQREL DS QQRSSMWQVQEP+SLDGLRG +P
Subjt:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP

Query:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS
        +KNS QSGGY AT TALSG  +SV+Q+GGRPQI S N+G  GH  LNKGGSG IGTVG Q FPSR+VA F SGQP LHQRPPSPLSVDH+PHQMPNHKTS
Subjt:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS

Query:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE
        S SNLDPRKRH+QDAS+G HP+V+ DNLKK Q QD QA+AS +PTSQPR  FSLSES KPDVR+SE S QH VSIPGTDF  PSSAG++P RLPA+ILGE
Subjt:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE

Query:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIKP-LPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL
         STSSLLAAVMKSGIFSNHSI +SMQQNISFQD  N  PHSN+KP LPSQSSPAHT++TF EPKT GESSLGP  LES SALVKLSQTKVE+ PLPSDP 
Subjt:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIKP-LPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL

Query:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT
         PSSPMNSASTETSNVVN +S+PISNLLSSLVAKGLISASKGELTNS TSQM +QPENL                       K GDA+TCS PVPS+PVT
Subjt:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT

Query:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY
        SSSQSS  LES SK  AK STSP P  +TEI NLIGFEFSSHVIRKF PSVI GLFDDIPYQCKICGLRLKLEE+LDTH+QWH++RTEANNSNR   RWY
Subjt:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY

Query:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES
        PSSDDWISGN  LL DAATS D+ D+MEE NEPMVPADED   CVLCGE FEDFYS +L KWMFKGA +ITIPSA SE GST E+VARGPIVHT CITES
Subjt:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES

Query:  SVYDLGLATDIKMEMDV
        S++DLGLATDIKMEMDV
Subjt:  SVYDLGLATDIKMEMDV

A0A6J1FCJ8 uncharacterized protein LOC111442777 isoform X10.0e+0080.39Show/hide
Query:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF
        MEMESSRRPFDR REPGLKK RL DE AERGG +INGRPFPQRP+ S TNI QPRFR +DRDSGS DSGR    GGYQPQPLQHQELVSQYRTALAELTF
Subjt:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF

Query:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL
        NSKPIITNLTIIAGEN QAAKAISATVCANI+EV+S+QKLPSLYLLDSIVKNIGRDYIKYFA +LPEVFCKAYRQVD  VHTSMRHLFGTWKGVFPPQTL
Subjt:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL

Query:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP
        QVIEKELGF+++  SSSGT +SKP+L +QRPPHSIHVNPKYIERQRLQQSGRVKGMT DAT+ TT+VTQDVA+AKISTGRPWADA +K+ DIQRPLRDAP
Subjt:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP

Query:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK
        ND+AQEKNIT AYADYEYGSDLSRT G GRR  DEGRD+ WS  G+NLAEKLSGQRNGFN KLGYENYPAP+SANTGARLLP QNFSSSSSNR LS NWK
Subjt:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWK

Query:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP
        NSEEEEFMWGE+N MLTGHGAS IASS GKDQWTPEDSDNSGIE K LSLRDTGGSVDRE+SSDSQSSEQREL DS QQRSSMWQVQEP+SLDGLRG +P
Subjt:  NSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVP

Query:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS
        +KNS QSGGY AT TALSG  +SV+Q+GGRPQI S N+G  GH  LNKGGSG IGTVG Q FPSR+VA F SGQP LHQRPPSPLSVDH+PHQMPNHKTS
Subjt:  RKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTS

Query:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE
        S SNLDPRKRH+QDAS+G HP+V+ DNLKK Q QD QA+AS +PTSQPR  FSLSES KPDVR+SE S QH VSIPGTDF  PSSAG++P RLPA+ILGE
Subjt:  SLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGE

Query:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIKP-LPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL
         STSSLLAAVMKSGIFSNHSI +SMQQNISFQD  N  PHSN+KP LPSQSSPAHT++TF EPKT GESSLGP  LES SALVKLSQTKVE+ PLPSDP 
Subjt:  PSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIKP-LPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPL

Query:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT
         PSSPMNSASTETSNVVN +S+PISNLLSSLVAKGLISASKGELTNS TSQM +QPENL                       K GDA+TCS PVPS+PVT
Subjt:  PPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVT

Query:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY
        SSSQSS  LES SK  AK STSP P  +TEI NLIGFEFSSHVIRKF PSVI GLFDDIPYQCKICGLRLKLEE+LDTH+QWH++RTEANNSNR   RWY
Subjt:  SSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWY

Query:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES
        PSSDDWISGN  LL DAATS D+ D+MEE NEPMVPADED   CVLCGE FEDFYS +L KWMFKGA +ITIPSA SE GST E+VARGPIVHT CITES
Subjt:  PSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITES

Query:  SVYDLGLATDIKMEMDV
        S++DLGLATDIKMEMDV
Subjt:  SVYDLGLATDIKMEMDV

A0A6J1KTP6 flocculation protein FLO11-like isoform X10.0e+0079.61Show/hide
Query:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF
        MEMESSRRPFDR REPGLKK RL DE AERGG +INGRPFPQRP+ S TNI QPRFR +DRDSGS DSGR    GGYQ QPLQHQELVSQYRTALAELTF
Subjt:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF

Query:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL
        NSKPIITNLTIIAGEN QAAKAISATVCANI+EV+S+QKLPSLYLLDSIVKNIGRDYIKYFA +LPEVFCKAYRQVD  VHTSMRHLFGTWKGVFPPQTL
Subjt:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL

Query:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP
        QVIEKELGF+++  SSSGT +SKP+LQ+QRPPHSIHVNPKYIERQRLQQSGRVKGMT DAT+ TT+VTQDVA+AKISTGRPWADA +K+ DIQRPLRDAP
Subjt:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP

Query:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNF-SSSSSNRVLSANW
        NDMAQEKNIT AYADYEYGSDLSRT G GRR  DEGRD+ WS  G+NLAEK+SGQRNGFN KLGY+NYPAP+SANTGARLLP QNF SSSSSNR LS NW
Subjt:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNF-SSSSSNRVLSANW

Query:  KNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRV
        KNSEEEEFMWGE+N MLTGHGAS IA+S GKDQWTPEDSDNSGIE K LSLRDTGGSVDRE+SSDSQSSEQREL DS QQRSSMWQVQEP+SLDGLRG +
Subjt:  KNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRV

Query:  PRKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKT
        P+KNS QSGGY AT TALSG  +SV+Q+GGR QI S N+G  GH  LNKGGSG IGT G Q FPSR+VA F SGQP LHQRPPSPLSVDH+PHQMPNHKT
Subjt:  PRKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKT

Query:  SSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILG
        SS SNLDPRKRH+QDAS+G HP+V+ DNLKK Q QD QA+AS +PTSQPR  FSLSES KPDVR+SE S QH VSIPGTDF  PSSAG++P RLPA+ILG
Subjt:  SSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILG

Query:  EPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDP
        E STSSLLAAVMKSGIFSNHSI +SMQQNISFQD  N  PHSN+K PLPS+SSPAHT++TF EPKT GESSLGP  LES SALVKLSQTKVE+ PLPSDP
Subjt:  EPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDP

Query:  LPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPV
         PPSSPMNSAST TSNVVN +S+PISNLLSSLVAKGLISASKGE+TNS TSQMP+QPENL                       K GDA+TCS PVPS+PV
Subjt:  LPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPV

Query:  TSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRW
        TSSSQSS  LES +K  AK STSP P  +TEI N+IGFEFSSHVIRKF PSVI GLFDDIPYQCKICGLRLKLEE+LDTH+QWH++RTEANNSN+T  RW
Subjt:  TSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRW

Query:  YPSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITE
        YPSSDDWISGN  LL DAATS D+ D+MEE NEPMVPADED   CVLCGE FEDFYS +L KWMFKGA +ITIPSA SE+GSTNE+VARGPIVH  CITE
Subjt:  YPSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITE

Query:  SSVYDLGLATDIKMEMDV
        S+++DLGLATDIKMEMDV
Subjt:  SSVYDLGLATDIKMEMDV

A0A6J1KZU2 flocculation protein FLO11-like isoform X20.0e+0079.61Show/hide
Query:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF
        MEMESSRRPFDR REPGLKK RL DE AERGG +INGRPFPQRP+ S TNI QPRFR +DRDSGS DSGR    GGYQ QPLQHQELVSQYRTALAELTF
Subjt:  MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTF

Query:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL
        NSKPIITNLTIIAGEN QAAKAISATVCANI+EV+S+QKLPSLYLLDSIVKNIGRDYIKYFA +LPEVFCKAYRQVD  VHTSMRHLFGTWKGVFPPQTL
Subjt:  NSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTL

Query:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP
        QVIEKELGF+++  SSSGT +SKP+LQ+QRPPHSIHVNPKYIERQRLQQSGRVKGMT DAT+ TT+VTQDVA+AKISTGRPWADA +K  DIQRPLRDAP
Subjt:  QVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAP

Query:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNF-SSSSSNRVLSANW
        NDMAQEKNIT AYADYEYGSDLSRT G GRR  DEGRD+ WS  G+NLAEK+SGQRNGFN KLGY+NYPAP+SANTGARLLP QNF SSSSSNR LS NW
Subjt:  NDMAQEKNIT-AYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNF-SSSSSNRVLSANW

Query:  KNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRV
        KNSEEEEFMWGE+N MLTGHGAS IA+S GKDQWTPEDSDNSGIE K LSLRDTGGSVDRE+SSDSQSSEQREL DS QQRSSMWQVQEP+SLDGLRG +
Subjt:  KNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRV

Query:  PRKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKT
        P+KNS QSGGY AT TALSG  +SV+Q+GGR QI S N+G  GH  LNKGGSG IGT G Q FPSR+VA F SGQP LHQRPPSPLSVDH+PHQMPNHKT
Subjt:  PRKNSVQSGGYSATFTALSGA-TSVNQIGGRPQIASPNMG--GHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKT

Query:  SSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILG
        SS SNLDPRKRH+QDAS+G HP+V+ DNLKK Q QD QA+AS +PTSQPR  FSLSES KPDVR+SE S QH VSIPGTDF  PSSAG++P RLPA+ILG
Subjt:  SSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILG

Query:  EPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDP
        E STSSLLAAVMKSGIFSNHSI +SMQQNISFQD  N  PHSN+K PLPS+SSPAHT++TF EPKT GESSLGP  LES SALVKLSQTKVE+ PLPSDP
Subjt:  EPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGEN--PHSNIK-PLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDP

Query:  LPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPV
         PPSSPMNSAST TSNVVN +S+PISNLLSSLVAKGLISASKGE+TNS TSQMP+QPENL                       K GDA+TCS PVPS+PV
Subjt:  LPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPV

Query:  TSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRW
        TSSSQSS  LES +K  AK STSP P  +TEI N+IGFEFSSHVIRKF PSVI GLFDDIPYQCKICGLRLKLEE+LDTH+QWH++RTEANNSN+T  RW
Subjt:  TSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRW

Query:  YPSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITE
        YPSSDDWISGN  LL DAATS D+ D+MEE NEPMVPADED   CVLCGE FEDFYS +L KWMFKGA +ITIPSA SE+GSTNE+VARGPIVH  CITE
Subjt:  YPSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITE

Query:  SSVYDLGLATDIKMEMDV
        S+++DLGLATDIKMEMDV
Subjt:  SSVYDLGLATDIKMEMDV

SwissProt top hitse value%identityAlignment
O94913 Pre-mRNA cleavage complex 2 protein Pcf112.1e-1936.25Show/hide
Query:  QELVSQYRTALAELTFNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSM
        ++    Y+++L +LTFNSKP I  LTI+A EN   AK I + + A   +  S +KLP +YL+DSIVKN+GR+Y+  F   L   F   + +VD +   S+
Subjt:  QELVSQYRTALAELTFNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSM

Query:  RHLFGTWKGVFPPQTLQVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIER
          L  TW  +FP + L  ++  +      +S       KP L       SIHVNPK++ +
Subjt:  RHLFGTWKGVFPPQTLQVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIER

Q0WPF2 Polyadenylation and cleavage factor homolog 45.5e-6026.31Show/hide
Query:  GGGYQPQPLQHQELVSQYRTALAELTFNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAY
        GGG +  P    E+V  Y   L ELTFNSKPIIT+LTIIAGE ++  + I+  +C  I+E   +QKLPSLYLLDSIVKNIGRDY +YF+ RLPEVFC AY
Subjt:  GGGYQPQPLQHQELVSQYRTALAELTFNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAY

Query:  RQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIER-QRLQQSGRVKGMTGDATVTTTHVTQDVA
        RQ  PS+H SMRHLFGTW  VFPP  L+ I+ +L  +SS ++ S    S+P     +P   IHVNPKY+ R +       ++G+   A V          
Subjt:  RQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIER-QRLQQSGRVKGMTGDATVTTTHVTQDVA

Query:  EAKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKS
        +  +     + D     L+    L   P+   +  N     D    S+ +   G GR    +     W    N       GQ N         +     +
Subjt:  EAKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKS

Query:  ANTGARLLPMQNFSSSSSNRVLSANWKNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQREL
        +       P+++ +   S  V    W+N+EEEEF W +++P L    A     S+      P       +  +P      G + D    SD ++    +L
Subjt:  ANTGARLLPMQNFSSSSSNRVLSANWKNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQREL

Query:  RDSEQQRSSMWQVQE--PISLDGLRGRVPRKNSVQSGGYSATFTALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQ
        R++       W + +  P + + +  R  +   V              A+SV  +    +  +P              P  ++  Q   SR   + P G 
Subjt:  RDSEQQRSSMWQVQE--PISLDGLRGRVPRKNSVQSGGYSATFTALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQ

Query:  -PTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGV
         P L  R P+ L V                            S  LH    P N   ++LQ         P  +P +Q S S      + +    +Q  V
Subjt:  -PTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGV

Query:  SIPGTDFVAPSSAGSIPNRLPADILGEPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSL
        +        PSS+   P  +           SLL  V  S  +  H  T  ++ ++S Q GE  H                     P + G         
Subjt:  SIPGTDFVAPSSAGSIPNRLPADILGEPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSL

Query:  ESLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMP
                LSQ     +P                              S L+ SL+A+GLIS     L N    Q P                       
Subjt:  ESLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMP

Query:  SQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEEL
                                                                  +G EF + +++  + S I  L+ D+P QC  CGLR K +EE 
Subjt:  SQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEEL

Query:  DTHVQWH--SIRTEANNSNRTSGRWYPSSDDWISGNSRLLLDAATSL---DKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFIT
          H+ WH    R   N+    S +W+ S+  W+SG   L  +A       +     ++D +  VPADEDQ +C LCGE FEDFYS E  +WM+KGA ++ 
Subjt:  DTHVQWH--SIRTEANNSNRTSGRWYPSSDDWISGNSRLLLDAATSL---DKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFIT

Query:  IPSAGSEVGSTNEEVARGPIVHTNCITESSVYDL
         P    E  +  ++   GPIVH  C  ES+  D+
Subjt:  IPSAGSEVGSTNEEVARGPIVHTNCITESSVYDL

Q10237 Uncharacterized protein C4G9.04c1.6e-1445.1Show/hide
Query:  YRTALAELTFNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGT
        Y +AL +LTFNSKPII  LT IA EN+  A +I   +  +I +   + KLP+LYLLDSI KN+G  Y  +F   L   F  AY  V+P +   +  L  T
Subjt:  YRTALAELTFNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGT

Query:  WK
        WK
Subjt:  WK

Q9C710 Polyadenylation and cleavage factor homolog 11.3e-1627.88Show/hide
Query:  TNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSS-HV
        + +LT   P +  N   +  V +++    Q P       S  + +   P+      +   S +  E   K +   ++  LP         +G  F +   
Subjt:  TNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSS-HV

Query:  IRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWYPSSDDWISGNSRLLLDAATSLDKFDVM-------------EED
        +   H SVI  L+ D+P QC  CGLR K +EE   H+ WH  +  +  +    G+    S  W++  S L L AAT  +  +V              EE 
Subjt:  IRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWYPSSDDWISGNSRLLLDAATSLDKFDVM-------------EED

Query:  NEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITE
         + MVPADEDQ  C LC E FE+F+S E   WM+K A ++T                 G IVH  C+ E
Subjt:  NEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITE

Q9FIX8 Polyadenylation and cleavage factor homolog 55.8e-1734.94Show/hide
Query:  HPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWYPSSDDWISGNSRLLLDAATSLDKFDV-------MEEDNEP-------
        H SVI  L+ D+P QC  CG+R K +EE   H+ WH  +  +  +    G+    S  W++  S L L A T     +V       M++ NE        
Subjt:  HPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSGRWYPSSDDWISGNSRLLLDAATSLDKFDV-------MEEDNEP-------

Query:  MVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITE
        MVPADEDQ  C LC E FE+F+S E   WM+K A ++T                 G IVH  C+ E
Subjt:  MVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITE

Arabidopsis top hitse value%identityAlignment
AT2G36480.1 ENTH/VHS family protein1.5e-11634.73Show/hide
Query:  IEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGF-MSSGSSSSGTTTSKPDLQAQR
        ++V SDQKLP+LYLLDSIVKNIGRDYIKYF  RLPEVF KAYRQVDP +H++MRHLFGTWKGVF PQTLQ+IEKELGF   S  S++  +T++ + Q+QR
Subjt:  IEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGF-MSSGSSSSGTTTSKPDLQAQR

Query:  PPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAE----AKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYA-DYEYGSDLSRTQ
        PPHSIHVNPKY+ERQRLQQSGR KGM  D   T  ++T+D       + I++G  W   P K+ +I+RP RD  ++   EK+I + A +Y+Y SDL    
Subjt:  PPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAE----AKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYA-DYEYGSDLSRTQ

Query:  GS-----GRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSA---NWKNSEEEEFMWGEVNPMLTG
         S     G R+ D+G ++ W  A N   + +S QR+G ++K    NY   +          ++N  SS  +R +     +WKNSEEEEFMW +++  L+ 
Subjt:  GS-----GRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSA---NWKNSEEEEFMWGEVNPMLTG

Query:  HGASTIASST---GKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVPRKNSVQSGGYSATFT
           +TI         D+    +S+N  ++    S  D     D  +S++S SSEQ+                +P S+                G+ A F+
Subjt:  HGASTIASST---GKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVPRKNSVQSGGYSATFT

Query:  ALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIG
        + +  ++  + G +PQ   P +   G+L   GSG                                        Q P H ++S  N+             
Subjt:  ALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIG

Query:  LHPSVRPDNLKKSQLQDLQASASFVPTSQ--PRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPN-RLPADILGEPSTSSLLAAVMKSGI
             + D  +   L      AS  P  Q  PR      +S +     S+  + +   +P   F + S+A + P   L ++  G+P+ S LL AVMKSGI
Subjt:  LHPSVRPDNLKKSQLQDLQASASFVPTSQ--PRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPN-RLPADILGEPSTSSLLAAVMKSGI

Query:  FSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVN
         SN+S   +++        E  H  + P       A T     +PKT       P SL + + L +L   KVE+   P      +S     S +TS   +
Subjt:  FSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVN

Query:  GASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAK
         AS P+S LLSSLV+KGLISASK EL ++ +      P++  +    +S +P  +Q                    PSV V   S +            K
Subjt:  GASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAK

Query:  CSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWH-SIRTEANNSNRTSGRWYPSSDDWISGNS-RLLLD
           +P  ++ +E  +LIG +F +  IR+ HPSVI  LFDD+P+ C  C +RLK +EELD H++ H   + E + +N     W+P  D+WI+  +  L  +
Subjt:  CSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWH-SIRTEANNSNRTSGRWYPSSDDWISGNS-RLLLD

Query:  AATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITESSVYDLGLATDIKMEM
            L + +   ED +  V ADE Q AC+LCGE FED++SQE+ +WMFKGA+++T P A SE        A GPIVHT C+T SS+  L +   IK E+
Subjt:  AATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITESSVYDLGLATDIKMEM

AT2G36480.2 ENTH/VHS family protein7.3e-11634.74Show/hide
Query:  IEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGF-MSSGSSSSGTTTSKPDLQAQR
        ++V SDQKLP+LYLLDSIVKNIGRDYIKYF  RLPEVF KAYRQVDP +H++MRHLFGTWKGVF PQTLQ+IEKELGF   S  S++  +T++ + Q+QR
Subjt:  IEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGF-MSSGSSSSGTTTSKPDLQAQR

Query:  PPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAE----AKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYA-DYEYGSDLSRTQ
        PPHSIHVNPKY+ERQRLQQSGR KGM  D   T  ++T+D       + I++G  W   P K+ +I+RP RD  ++   EK+I + A +Y+Y SDL    
Subjt:  PPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAE----AKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYA-DYEYGSDLSRTQ

Query:  GS-----GRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSA---NWKNSEEEEFMWGEVNPMLTG
         S     G R+ D+G ++ W  A N   + +S QR+G ++K    NY   +          ++N  SS  +R +     +WKNSEEEEFMW +++  L+ 
Subjt:  GS-----GRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSA---NWKNSEEEEFMWGEVNPMLTG

Query:  HGASTIASST---GKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVPRKNSVQSGGYSATFT
           +TI         D+    +S+N  ++    S  D     D  +S++S SSEQ+                +P S+                G+ A F+
Subjt:  HGASTIASST---GKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVPRKNSVQSGGYSATFT

Query:  ALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIG
        + +  ++  + G +PQ   P +   G+L   GSG                                        Q P H ++S  N+             
Subjt:  ALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIG

Query:  LHPSVRPDNLKKSQLQDLQASASFVPTSQ--PRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPN-RLPADILGEPSTSSLLAAVMKSGI
             + D  +   L      AS  P  Q  PR      +S +     S+  + +   +P   F + S+A + P   L ++  G+P+ S LL AVMKSGI
Subjt:  LHPSVRPDNLKKSQLQDLQASASFVPTSQ--PRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPN-RLPADILGEPSTSSLLAAVMKSGI

Query:  FSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVN
         SN+S   +++        E  H  + P       A T     +PKT       P SL + + L +L   KVE+   P      +S     S +TS   +
Subjt:  FSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVN

Query:  GASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAK
         AS P+S LLSSLV+KGLISASK EL ++ +      P++  +    +S +P  +Q                    PSV V   S +            K
Subjt:  GASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAK

Query:  CSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWH-SIRTEANNSNRTSGRWYPSSDDWISGNS-RLLLD
           +P  ++ +E  +LIG +F +  IR+ HPSVI  LFDD+P+ C  C +RLK +EELD H++ H   + E + +N     W+P  D+WI+  +  L  +
Subjt:  CSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWH-SIRTEANNSNRTSGRWYPSSDDWISGNS-RLLLD

Query:  AATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITESSVYDLGLATDIK
            L + +   ED +  V ADE Q AC+LCGE FED++SQE+ +WMFKGA+++T P A SE        A GPIVHT C+T SS+  L +   IK
Subjt:  AATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITESSVYDLGLATDIK

AT2G36480.3 ENTH/VHS family protein1.5e-11634.73Show/hide
Query:  IEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGF-MSSGSSSSGTTTSKPDLQAQR
        ++V SDQKLP+LYLLDSIVKNIGRDYIKYF  RLPEVF KAYRQVDP +H++MRHLFGTWKGVF PQTLQ+IEKELGF   S  S++  +T++ + Q+QR
Subjt:  IEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGF-MSSGSSSSGTTTSKPDLQAQR

Query:  PPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAE----AKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYA-DYEYGSDLSRTQ
        PPHSIHVNPKY+ERQRLQQSGR KGM  D   T  ++T+D       + I++G  W   P K+ +I+RP RD  ++   EK+I + A +Y+Y SDL    
Subjt:  PPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAE----AKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYA-DYEYGSDLSRTQ

Query:  GS-----GRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSA---NWKNSEEEEFMWGEVNPMLTG
         S     G R+ D+G ++ W  A N   + +S QR+G ++K    NY   +          ++N  SS  +R +     +WKNSEEEEFMW +++  L+ 
Subjt:  GS-----GRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSA---NWKNSEEEEFMWGEVNPMLTG

Query:  HGASTIASST---GKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVPRKNSVQSGGYSATFT
           +TI         D+    +S+N  ++    S  D     D  +S++S SSEQ+                +P S+                G+ A F+
Subjt:  HGASTIASST---GKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVPRKNSVQSGGYSATFT

Query:  ALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIG
        + +  ++  + G +PQ   P +   G+L   GSG                                        Q P H ++S  N+             
Subjt:  ALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIG

Query:  LHPSVRPDNLKKSQLQDLQASASFVPTSQ--PRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPN-RLPADILGEPSTSSLLAAVMKSGI
             + D  +   L      AS  P  Q  PR      +S +     S+  + +   +P   F + S+A + P   L ++  G+P+ S LL AVMKSGI
Subjt:  LHPSVRPDNLKKSQLQDLQASASFVPTSQ--PRHQFSLSESPKPDVRESEHSSQHGVSIPGTDFVAPSSAGSIPN-RLPADILGEPSTSSLLAAVMKSGI

Query:  FSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVN
         SN+S   +++        E  H  + P       A T     +PKT       P SL + + L +L   KVE+   P      +S     S +TS   +
Subjt:  FSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSLESLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVN

Query:  GASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAK
         AS P+S LLSSLV+KGLISASK EL ++ +      P++  +    +S +P  +Q                    PSV V   S +            K
Subjt:  GASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAK

Query:  CSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWH-SIRTEANNSNRTSGRWYPSSDDWISGNS-RLLLD
           +P  ++ +E  +LIG +F +  IR+ HPSVI  LFDD+P+ C  C +RLK +EELD H++ H   + E + +N     W+P  D+WI+  +  L  +
Subjt:  CSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWH-SIRTEANNSNRTSGRWYPSSDDWISGNS-RLLLD

Query:  AATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITESSVYDLGLATDIKMEM
            L + +   ED +  V ADE Q AC+LCGE FED++SQE+ +WMFKGA+++T P A SE        A GPIVHT C+T SS+  L +   IK E+
Subjt:  AATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITESSVYDLGLATDIKMEM

AT2G36485.1 ENTH/VHS family protein7.7e-2551.68Show/hide
Query:  MESSRRPFDRAREPG-LKKPRLGDEAAERGGSSINGRPF-PQRPVVSATNIGQP----RFRPTDRDSGS---GDSGRGGGGGGYQPQPLQ-HQELVSQYR
        ME+ RRPFDR+R+PG +KKPRL +E+     S  N R F  QR + +AT +  P    RFR + R++ S    D  R      YQPQP+  H ELV+QY+
Subjt:  MESSRRPFDRAREPG-LKKPRLGDEAAERGGSSINGRPF-PQRPVVSATNIGQP----RFRPTDRDSGS---GDSGRGGGGGGYQPQPLQ-HQELVSQYR

Query:  TALAELTFNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLP
        +ALAELTFNSKPIITNLTIIAGEN  AAKA+   +C NI+EV +    P
Subjt:  TALAELTFNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLP

AT4G04885.1 PCF11P-similar protein 43.9e-6126.31Show/hide
Query:  GGGYQPQPLQHQELVSQYRTALAELTFNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAY
        GGG +  P    E+V  Y   L ELTFNSKPIIT+LTIIAGE ++  + I+  +C  I+E   +QKLPSLYLLDSIVKNIGRDY +YF+ RLPEVFC AY
Subjt:  GGGYQPQPLQHQELVSQYRTALAELTFNSKPIITNLTIIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAY

Query:  RQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIER-QRLQQSGRVKGMTGDATVTTTHVTQDVA
        RQ  PS+H SMRHLFGTW  VFPP  L+ I+ +L  +SS ++ S    S+P     +P   IHVNPKY+ R +       ++G+   A V          
Subjt:  RQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGFMSSGSSSSGTTTSKPDLQAQRPPHSIHVNPKYIER-QRLQQSGRVKGMTGDATVTTTHVTQDVA

Query:  EAKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKS
        +  +     + D     L+    L   P+   +  N     D    S+ +   G GR    +     W    N       GQ N         +     +
Subjt:  EAKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYADYEYGSDLSRTQGSGRRVFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKS

Query:  ANTGARLLPMQNFSSSSSNRVLSANWKNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQREL
        +       P+++ +   S  V    W+N+EEEEF W +++P L    A     S+      P       +  +P      G + D    SD ++    +L
Subjt:  ANTGARLLPMQNFSSSSSNRVLSANWKNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNSGIETKPLSLRDTGGSVDRESSSDSQSSEQREL

Query:  RDSEQQRSSMWQVQE--PISLDGLRGRVPRKNSVQSGGYSATFTALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQ
        R++       W + +  P + + +  R  +   V              A+SV  +    +  +P              P  ++  Q   SR   + P G 
Subjt:  RDSEQQRSSMWQVQE--PISLDGLRGRVPRKNSVQSGGYSATFTALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPIGTVGHQRFPSRSVASFPSGQ

Query:  -PTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGV
         P L  R P+ L V                            S  LH    P N   ++LQ         P  +P +Q S S      + +    +Q  V
Subjt:  -PTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRESEHSSQHGV

Query:  SIPGTDFVAPSSAGSIPNRLPADILGEPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSL
        +        PSS+   P  +           SLL  V  S  +  H  T  ++ ++S Q GE  H                     P + G         
Subjt:  SIPGTDFVAPSSAGSIPNRLPADILGEPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSL

Query:  ESLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMP
                LSQ     +P                              S L+ SL+A+GLIS     L N    Q P                       
Subjt:  ESLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMP

Query:  SQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEEL
                                                                  +G EF + +++  + S I  L+ D+P QC  CGLR K +EE 
Subjt:  SQAEKFKSGDAMTCSTPVPSVPVTSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEEL

Query:  DTHVQWH--SIRTEANNSNRTSGRWYPSSDDWISGNSRLLLDAATSL---DKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFIT
          H+ WH    R   N+    S +W+ S+  W+SG   L  +A       +     ++D +  VPADEDQ +C LCGE FEDFYS E  +WM+KGA ++ 
Subjt:  DTHVQWH--SIRTEANNSNRTSGRWYPSSDDWISGNSRLLLDAATSL---DKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFIT

Query:  IPSAGSEVGSTNEEVARGPIVHTNCITESSVYDL
         P    E  +  ++   GPIVH  C  ES+  D+
Subjt:  IPSAGSEVGSTNEEVARGPIVHTNCITESSVYDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATGGAGAGCTCGCGGAGACCTTTCGATCGAGCCAGGGAACCCGGTTTGAAGAAACCCCGATTGGGCGACGAGGCAGCCGAGCGCGGTGGGAGTAGTATTAATGG
CCGGCCGTTTCCGCAGAGACCAGTTGTTTCTGCCACCAATATTGGGCAACCCAGATTTCGACCAACTGATAGAGATTCGGGAAGTGGCGATTCGGGTCGAGGAGGAGGAG
GAGGTGGGTATCAGCCTCAACCGCTGCAGCATCAGGAGCTTGTCAGCCAGTACAGGACAGCCCTGGCTGAGCTGACTTTCAATTCCAAACCAATCATCACCAATTTGACA
ATAATCGCGGGGGAAAATCAACAGGCTGCGAAAGCCATCTCCGCCACCGTTTGCGCCAACATTATCGAGGTTGCTAGTGATCAAAAGCTTCCATCACTTTATCTTCTGGA
CAGTATTGTGAAGAATATTGGAAGAGATTACATAAAATATTTTGCACCAAGACTGCCTGAGGTATTCTGCAAAGCTTATAGGCAAGTTGACCCTTCTGTACATACAAGTA
TGAGACATCTTTTTGGCACCTGGAAAGGAGTGTTCCCTCCTCAAACTCTGCAGGTCATTGAGAAAGAACTTGGCTTCATGTCCAGTGGCAGTTCTTCTTCTGGGACCACA
ACCTCGAAGCCGGATTTGCAGGCCCAACGTCCACCCCATAGTATCCATGTAAATCCCAAGTATATAGAAAGGCAACGGCTTCAGCAGTCAGGCCGGGTGAAGGGAATGAC
CGGGGATGCTACTGTCACAACAACACATGTAACTCAGGATGTTGCTGAAGCCAAAATTAGCACTGGACGTCCATGGGCAGATGCTCCAGTAAAAATGCTTGACATTCAGC
GTCCACTTAGAGATGCACCAAATGATATGGCACAAGAGAAGAACATCACAGCATATGCAGATTATGAATATGGTTCTGATCTTTCGAGGACTCAAGGTTCCGGAAGAAGG
GTTTTCGATGAAGGGCGAGACAGATCTTGGTCTTTGGCTGGAAACAATTTGGCAGAGAAGTTATCTGGCCAAAGAAATGGGTTCAACACTAAGCTTGGATATGAAAATTA
CCCTGCACCCAAGTCTGCGAACACTGGTGCACGTCTACTACCCATGCAAAATTTTTCAAGTAGCAGTAGCAACCGAGTACTTTCTGCTAATTGGAAGAACTCTGAGGAAG
AGGAGTTTATGTGGGGTGAAGTGAACCCTATGTTGACAGGTCATGGTGCATCTACCATTGCAAGTAGCACTGGAAAAGATCAATGGACTCCTGAAGATTCAGATAATTCG
GGTATTGAAACTAAGCCATTAAGCCTACGGGATACAGGGGGAAGTGTTGATAGAGAATCTTCCAGTGATTCACAATCATCTGAACAGAGAGAACTACGGGATTCTGAACA
GCAAAGGTCATCAATGTGGCAAGTGCAGGAGCCGATATCTCTTGATGGGCTCAGAGGCAGGGTTCCTAGAAAGAATTCAGTTCAGTCTGGAGGTTATAGTGCTACATTTA
CTGCACTGTCAGGTGCCACTTCTGTGAATCAAATAGGAGGTCGACCACAAATCGCATCACCTAATATGGGAGGTCATGGGCTTCTGAATAAAGGAGGTTCAGGGCCCATT
GGGACTGTGGGCCATCAAAGATTTCCATCACGAAGTGTTGCATCATTCCCATCTGGACAGCCAACCTTGCACCAACGTCCCCCCTCACCATTGTCAGTGGATCATGTTCC
TCATCAAATGCCCAACCATAAAACTTCTTCATTGTCTAATCTTGACCCACGTAAAAGGCACATACAGGATGCTTCCATTGGCCTACATCCCAGCGTCCGACCAGATAACC
TTAAAAAATCACAGCTTCAGGACCTTCAAGCTTCAGCTTCATTTGTACCAACTTCTCAACCCAGGCACCAGTTCTCTTTATCCGAGTCACCAAAACCTGACGTCAGGGAA
TCTGAACATTCTAGCCAGCATGGCGTATCAATACCGGGCACCGATTTTGTAGCTCCCTCATCAGCTGGGTCAATTCCAAATCGTTTACCTGCAGATATTTTGGGAGAGCC
AAGCACTAGTAGTTTGTTGGCTGCTGTAATGAAGAGTGGAATTTTCTCCAACCATTCAATAACCAATAGCATGCAGCAGAATATCAGCTTCCAAGATGGGGAAAATCCCC
ATTCAAACATCAAACCTCTACCAAGCCAGTCTTCTCCTGCCCATACTCGGTCTACATTCTTCGAGCCAAAGACTGGGGGAGAATCTTCATTAGGTCCTCCATCTCTTGAA
AGCTTATCAGCTCTTGTTAAGCTATCTCAGACTAAGGTTGAAGAGAAACCGTTGCCTTCTGATCCACTTCCACCTTCATCTCCTATGAATAGTGCATCCACTGAAACTTC
AAATGTGGTAAACGGTGCTTCTAGTCCAATTTCTAACCTTTTGAGCTCATTGGTTGCGAAGGGCCTCATATCTGCTTCAAAAGGGGAATTAACAAATAGCCTGACATCCC
AGATGCCTTCACAGCCCGAAAATTTGAAGTCAGATGATGCTGTGATTTCTTCAATTCCAGTGACATCTCAGATGCCTTCACAGGCTGAAAAATTTAAGTCAGGTGATGCT
ATGACTTGTTCTACACCAGTTCCTTCCGTCCCTGTTACTTCTTCCAGTCAATCATCTATTAGACTCGAATCACCTTCGAAAAATGTTGCTAAGTGCTCAACTAGTCCACT
TCCATCCACCTCAACTGAAATAAACAACCTCATAGGCTTTGAGTTTAGTTCACATGTTATTCGCAAATTTCATCCATCTGTTATCGGTGGACTCTTTGACGATATTCCAT
ATCAATGTAAGATCTGTGGTCTGAGACTGAAACTAGAAGAAGAGTTGGATACACACGTACAGTGGCACTCGATAAGAACTGAGGCAAACAATTCAAATAGGACATCAGGA
AGATGGTATCCAAGTTCAGATGATTGGATTTCTGGAAATTCAAGACTCTTACTTGATGCTGCCACTTCTCTGGACAAGTTCGACGTGATGGAAGAAGATAACGAGCCAAT
GGTTCCTGCAGATGAAGATCAATTTGCCTGTGTTTTGTGTGGTGAATTTTTTGAAGATTTTTATAGTCAAGAATTGGGCAAGTGGATGTTCAAAGGAGCAGCGTTTATCA
CCATTCCATCAGCGGGTAGTGAGGTAGGAAGCACAAATGAAGAAGTTGCTAGAGGACCCATTGTGCACACAAATTGTATAACTGAAAGTTCAGTATATGACTTGGGACTG
GCAACTGATATTAAGATGGAAATGGATGTATGA
mRNA sequenceShow/hide mRNA sequence
CAACGAAAAAAAAAAACAGACAGCTGAAACTGGGGCAAAGCATTTTCGCAATCTCTCTATATAACGAAGCCGAGTCTGAATCTCAACCAAAAACCTCATCCCGGTTTCTA
GCCTTCTTCTTCCCAATTTTAGCCGTTTTTCATTTGTATTCATTGTATAAATTCTTCTTCTTTCTCTGTAATTTCGTACGATTAGGGTTTTGTTTTTTCTTCTTTCTCGT
TCAATTTCAGTGTTTCAGTTGGGCTTCTTCTTGTAATTTCAATGTTTCTTTCGCGCATATTAGTCGTTGTTAGGATCAAATCGAAGGTCATTGAGATCTTCTCTGCGTGT
TTTCTTTGATTTATTTTCGCCTTTTGTACCCATGATGTAATTTTAGAGAATTTAGATTTATGATTAGGGTTTTGGTTGATTTCGTTTGCTGATGGAGATGGAGAGCTCGC
GGAGACCTTTCGATCGAGCCAGGGAACCCGGTTTGAAGAAACCCCGATTGGGCGACGAGGCAGCCGAGCGCGGTGGGAGTAGTATTAATGGCCGGCCGTTTCCGCAGAGA
CCAGTTGTTTCTGCCACCAATATTGGGCAACCCAGATTTCGACCAACTGATAGAGATTCGGGAAGTGGCGATTCGGGTCGAGGAGGAGGAGGAGGTGGGTATCAGCCTCA
ACCGCTGCAGCATCAGGAGCTTGTCAGCCAGTACAGGACAGCCCTGGCTGAGCTGACTTTCAATTCCAAACCAATCATCACCAATTTGACAATAATCGCGGGGGAAAATC
AACAGGCTGCGAAAGCCATCTCCGCCACCGTTTGCGCCAACATTATCGAGGTTGCTAGTGATCAAAAGCTTCCATCACTTTATCTTCTGGACAGTATTGTGAAGAATATT
GGAAGAGATTACATAAAATATTTTGCACCAAGACTGCCTGAGGTATTCTGCAAAGCTTATAGGCAAGTTGACCCTTCTGTACATACAAGTATGAGACATCTTTTTGGCAC
CTGGAAAGGAGTGTTCCCTCCTCAAACTCTGCAGGTCATTGAGAAAGAACTTGGCTTCATGTCCAGTGGCAGTTCTTCTTCTGGGACCACAACCTCGAAGCCGGATTTGC
AGGCCCAACGTCCACCCCATAGTATCCATGTAAATCCCAAGTATATAGAAAGGCAACGGCTTCAGCAGTCAGGCCGGGTGAAGGGAATGACCGGGGATGCTACTGTCACA
ACAACACATGTAACTCAGGATGTTGCTGAAGCCAAAATTAGCACTGGACGTCCATGGGCAGATGCTCCAGTAAAAATGCTTGACATTCAGCGTCCACTTAGAGATGCACC
AAATGATATGGCACAAGAGAAGAACATCACAGCATATGCAGATTATGAATATGGTTCTGATCTTTCGAGGACTCAAGGTTCCGGAAGAAGGGTTTTCGATGAAGGGCGAG
ACAGATCTTGGTCTTTGGCTGGAAACAATTTGGCAGAGAAGTTATCTGGCCAAAGAAATGGGTTCAACACTAAGCTTGGATATGAAAATTACCCTGCACCCAAGTCTGCG
AACACTGGTGCACGTCTACTACCCATGCAAAATTTTTCAAGTAGCAGTAGCAACCGAGTACTTTCTGCTAATTGGAAGAACTCTGAGGAAGAGGAGTTTATGTGGGGTGA
AGTGAACCCTATGTTGACAGGTCATGGTGCATCTACCATTGCAAGTAGCACTGGAAAAGATCAATGGACTCCTGAAGATTCAGATAATTCGGGTATTGAAACTAAGCCAT
TAAGCCTACGGGATACAGGGGGAAGTGTTGATAGAGAATCTTCCAGTGATTCACAATCATCTGAACAGAGAGAACTACGGGATTCTGAACAGCAAAGGTCATCAATGTGG
CAAGTGCAGGAGCCGATATCTCTTGATGGGCTCAGAGGCAGGGTTCCTAGAAAGAATTCAGTTCAGTCTGGAGGTTATAGTGCTACATTTACTGCACTGTCAGGTGCCAC
TTCTGTGAATCAAATAGGAGGTCGACCACAAATCGCATCACCTAATATGGGAGGTCATGGGCTTCTGAATAAAGGAGGTTCAGGGCCCATTGGGACTGTGGGCCATCAAA
GATTTCCATCACGAAGTGTTGCATCATTCCCATCTGGACAGCCAACCTTGCACCAACGTCCCCCCTCACCATTGTCAGTGGATCATGTTCCTCATCAAATGCCCAACCAT
AAAACTTCTTCATTGTCTAATCTTGACCCACGTAAAAGGCACATACAGGATGCTTCCATTGGCCTACATCCCAGCGTCCGACCAGATAACCTTAAAAAATCACAGCTTCA
GGACCTTCAAGCTTCAGCTTCATTTGTACCAACTTCTCAACCCAGGCACCAGTTCTCTTTATCCGAGTCACCAAAACCTGACGTCAGGGAATCTGAACATTCTAGCCAGC
ATGGCGTATCAATACCGGGCACCGATTTTGTAGCTCCCTCATCAGCTGGGTCAATTCCAAATCGTTTACCTGCAGATATTTTGGGAGAGCCAAGCACTAGTAGTTTGTTG
GCTGCTGTAATGAAGAGTGGAATTTTCTCCAACCATTCAATAACCAATAGCATGCAGCAGAATATCAGCTTCCAAGATGGGGAAAATCCCCATTCAAACATCAAACCTCT
ACCAAGCCAGTCTTCTCCTGCCCATACTCGGTCTACATTCTTCGAGCCAAAGACTGGGGGAGAATCTTCATTAGGTCCTCCATCTCTTGAAAGCTTATCAGCTCTTGTTA
AGCTATCTCAGACTAAGGTTGAAGAGAAACCGTTGCCTTCTGATCCACTTCCACCTTCATCTCCTATGAATAGTGCATCCACTGAAACTTCAAATGTGGTAAACGGTGCT
TCTAGTCCAATTTCTAACCTTTTGAGCTCATTGGTTGCGAAGGGCCTCATATCTGCTTCAAAAGGGGAATTAACAAATAGCCTGACATCCCAGATGCCTTCACAGCCCGA
AAATTTGAAGTCAGATGATGCTGTGATTTCTTCAATTCCAGTGACATCTCAGATGCCTTCACAGGCTGAAAAATTTAAGTCAGGTGATGCTATGACTTGTTCTACACCAG
TTCCTTCCGTCCCTGTTACTTCTTCCAGTCAATCATCTATTAGACTCGAATCACCTTCGAAAAATGTTGCTAAGTGCTCAACTAGTCCACTTCCATCCACCTCAACTGAA
ATAAACAACCTCATAGGCTTTGAGTTTAGTTCACATGTTATTCGCAAATTTCATCCATCTGTTATCGGTGGACTCTTTGACGATATTCCATATCAATGTAAGATCTGTGG
TCTGAGACTGAAACTAGAAGAAGAGTTGGATACACACGTACAGTGGCACTCGATAAGAACTGAGGCAAACAATTCAAATAGGACATCAGGAAGATGGTATCCAAGTTCAG
ATGATTGGATTTCTGGAAATTCAAGACTCTTACTTGATGCTGCCACTTCTCTGGACAAGTTCGACGTGATGGAAGAAGATAACGAGCCAATGGTTCCTGCAGATGAAGAT
CAATTTGCCTGTGTTTTGTGTGGTGAATTTTTTGAAGATTTTTATAGTCAAGAATTGGGCAAGTGGATGTTCAAAGGAGCAGCGTTTATCACCATTCCATCAGCGGGTAG
TGAGGTAGGAAGCACAAATGAAGAAGTTGCTAGAGGACCCATTGTGCACACAAATTGTATAACTGAAAGTTCAGTATATGACTTGGGACTGGCAACTGATATTAAGATGG
AAATGGATGTATGATGCTTCCCCTACGAAGCGTCACATAGGAACTACTCTGGTGGACGTCTTGCTCGTGCAATCGGTGGGGCAATACAAAAAGGGAGTATGAACGAGTAC
TTTGCTCGTGGTAGTTCTGTTGAGTGCCAAAAGAGATAGTAGGGGAGTGGTAGCTCTATACTGGTATTTTCTAGCTTCCAAATTCCTTTTTGTTTTCTTTCCGTTTTTTT
CCCTTATAAAAAAGATATATATAGTTGAGAATACATGAATGCAATGATCATTTTTATTCGGATCTTCACCGTCCTGTTTCTCAGTGAATTTTCACTTAACTTTCAATATC
TATTGATATAAATTTGAAGCAGTTGCAGTGCACTCCCAAGTCCAGGCTTGGCGTCTCTATCGACATCAAAGTCTATGTATTATTGAAATCATTTCAGCAAGATGGTTTTG
ATTCACCTCAACATCACCTTTTTATCTTGTAATTTTGCCTTCACTTAGTTTCTGCATATCATGTTAGACAGTAATGGGGAGTTATTTAAGATTCTCTCAGCTTTTTATAA
GGGTTTTTTTTTATTACTTTCACATGTCTTGCTGGTCTAGATATGACTATGATGATGTTGGATTTTGCTTCTTTTTGCCTCTTATCATTTGCCCCCACCTTGTTTGGACA
AGCTTGGCATGGCAGATATTGTGATATATTATGGCACATTTTGTTACAATAATTGTATTCAACAATCATGAACTGTGGAATTATGAAAACATTTCTGTTTGGATGGCGC
Protein sequenceShow/hide protein sequence
MEMESSRRPFDRAREPGLKKPRLGDEAAERGGSSINGRPFPQRPVVSATNIGQPRFRPTDRDSGSGDSGRGGGGGGYQPQPLQHQELVSQYRTALAELTFNSKPIITNLT
IIAGENQQAAKAISATVCANIIEVASDQKLPSLYLLDSIVKNIGRDYIKYFAPRLPEVFCKAYRQVDPSVHTSMRHLFGTWKGVFPPQTLQVIEKELGFMSSGSSSSGTT
TSKPDLQAQRPPHSIHVNPKYIERQRLQQSGRVKGMTGDATVTTTHVTQDVAEAKISTGRPWADAPVKMLDIQRPLRDAPNDMAQEKNITAYADYEYGSDLSRTQGSGRR
VFDEGRDRSWSLAGNNLAEKLSGQRNGFNTKLGYENYPAPKSANTGARLLPMQNFSSSSSNRVLSANWKNSEEEEFMWGEVNPMLTGHGASTIASSTGKDQWTPEDSDNS
GIETKPLSLRDTGGSVDRESSSDSQSSEQRELRDSEQQRSSMWQVQEPISLDGLRGRVPRKNSVQSGGYSATFTALSGATSVNQIGGRPQIASPNMGGHGLLNKGGSGPI
GTVGHQRFPSRSVASFPSGQPTLHQRPPSPLSVDHVPHQMPNHKTSSLSNLDPRKRHIQDASIGLHPSVRPDNLKKSQLQDLQASASFVPTSQPRHQFSLSESPKPDVRE
SEHSSQHGVSIPGTDFVAPSSAGSIPNRLPADILGEPSTSSLLAAVMKSGIFSNHSITNSMQQNISFQDGENPHSNIKPLPSQSSPAHTRSTFFEPKTGGESSLGPPSLE
SLSALVKLSQTKVEEKPLPSDPLPPSSPMNSASTETSNVVNGASSPISNLLSSLVAKGLISASKGELTNSLTSQMPSQPENLKSDDAVISSIPVTSQMPSQAEKFKSGDA
MTCSTPVPSVPVTSSSQSSIRLESPSKNVAKCSTSPLPSTSTEINNLIGFEFSSHVIRKFHPSVIGGLFDDIPYQCKICGLRLKLEEELDTHVQWHSIRTEANNSNRTSG
RWYPSSDDWISGNSRLLLDAATSLDKFDVMEEDNEPMVPADEDQFACVLCGEFFEDFYSQELGKWMFKGAAFITIPSAGSEVGSTNEEVARGPIVHTNCITESSVYDLGL
ATDIKMEMDV