; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G3593 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G3593
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProtein PAF1-like protein
Genome locationctg1047:1477733..1483729
RNA-Seq ExpressionCucsat.G3593
SyntenyCucsat.G3593
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0016570 - histone modification (biological process)
GO:0016593 - Cdc73/Paf1 complex (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR007133 - RNA polymerase II associated factor Paf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051046.1 protein PAF1-like protein [Cucumis melo var. makuwa]0.095.39Show/hide
Query:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH
        MASYRPYPPQSSFGSAPAQNSIPPP +QSAS SSQQRGGATTQYNQNWG YAGDAS P APSSSYPQNYNNQLHQTSNYHHQQYG PRTQHPPPPPP  H
Subjt:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH

Query:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP
        QSYPYA QPPPPPPPDSSYPPPPPPPA SQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEG NMGAHERDKG  
Subjt:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP

Query:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
        KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
Subjt:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP

Query:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP
        FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDH         YTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVR+PLAP
Subjt:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP

Query:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN
        EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIE SFEACKSRPIHATN
Subjt:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN

Query:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
        KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
Subjt:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE

Query:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN
        YHWDVRGDNVDDPTTYLVSFDD+EARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGR HKHDR+
Subjt:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN

Query:  -QDMDQFSGAEDEMSD
         QDMDQ+SGAEDEMSD
Subjt:  -QDMDQFSGAEDEMSD

XP_004141783.2 LOW QUALITY PROTEIN: protein PAF1 homolog [Cucumis sativus]0.097.76Show/hide
Query:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH
        MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAP APSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH
Subjt:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH

Query:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP
        QSYPYA QPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSS PPPPPPNSPPPPSASQQKAEGTNMGAHERDKG P
Subjt:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP

Query:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
        KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
Subjt:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP

Query:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP
        FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDH         YTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP
Subjt:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP

Query:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN
        EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIE SFEACKSRPIHATN
Subjt:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN

Query:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
        KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMAT SDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
Subjt:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE

Query:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN
        YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDR+
Subjt:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN

Query:  QDMDQFSGAEDEMSD
        QDMDQFSGAEDEMSD
Subjt:  QDMDQFSGAEDEMSD

XP_008462415.1 PREDICTED: LOW QUALITY PROTEIN: protein PAF1 homolog [Cucumis melo]0.095.25Show/hide
Query:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH
        MASYRPYPPQSSFGSAPAQNSIPPP +QSAS SSQQRGGATTQYNQNWG YAGDAS P APSSSYPQNYNNQLHQTSNYHHQQYG PRTQHPPPPPP  H
Subjt:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH

Query:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP
        QSYPYA QPPPPPPPDSSYPPPPPPPA SQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEG NMGAHERDKG  
Subjt:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP

Query:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
        KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
Subjt:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP

Query:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP
        FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDH         YTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVR+PLAP
Subjt:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP

Query:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN
        EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIE SFEACKSRPIHATN
Subjt:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN

Query:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
        KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
Subjt:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE

Query:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN
        YHWDVRGDNVDDPTTYLVSFDD+EARYVPLPTKLVL KKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGR HKHDR+
Subjt:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN

Query:  -QDMDQFSGAEDEMSD
         QDMDQ+SGAEDEMSD
Subjt:  -QDMDQFSGAEDEMSD

XP_022953373.1 protein PAF1 homolog [Cucurbita moschata]0.089.55Show/hide
Query:  MASYRPYPPQSSFGSAPAQNSIPPPSAQ-SASVSSQQRGGATTQYNQNWGTYAGDASAPS-APSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPP
        MASYRPYPPQSSFG +P QN IPPP A  +ASV +QQRGG  +QYNQNWG Y GD S P  A SSSYPQNYN Q+HQ+SNYH Q YGPPR+Q PPPPPPP
Subjt:  MASYRPYPPQSSFGSAPAQNSIPPPSAQ-SASVSSQQRGGATTQYNQNWGTYAGDASAPS-APSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPP

Query:  PHQSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPP-SSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDK
         HQSYPYA QPPPPPPPDSSYPPPPPPPA+SQP   Y+P SQY QGNQNQQS+QPPPPP SSPPPSSSIPPPPPPNSPPPPSA QQK EG+++GAHERDK
Subjt:  PHQSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPP-SSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDK

Query:  GAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERK
        G  KDPSYGRR+RENSNHDKHQ+HSGPPMPPKK+NGPSGR+ETDDEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGERK
Subjt:  GAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERK

Query:  ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMP
        ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDH         YTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVR+P
Subjt:  ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMP

Query:  LAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIH
        LAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ SFEACKSRP+H
Subjt:  LAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIH

Query:  ATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW
        ATNKNLYPVEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW
Subjt:  ATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW

Query:  VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKH
        VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPG+YSN KRGSDIEDG+GRSHKH
Subjt:  VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKH

Query:  DRNQDMDQFSGAEDEMSD
        DR+QDMDQ+SGAED+MSD
Subjt:  DRNQDMDQFSGAEDEMSD

XP_038898523.1 protein PAF1 homolog [Benincasa hispida]0.091.05Show/hide
Query:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH
        MASYRPYPPQSSFG AP QN +PPP  QSASV +QQRGG + QYNQNWG Y GD S P A SSSYPQNYN Q HQ+SNYH Q YGPPR+QHPPPPPP  +
Subjt:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH

Query:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP
        QSYPYA QPPPPP PDSSYPPPPPPPA SQPPNLYYP SQ         SMQPPPPPSSPPPSSSIPPPPPPNSPPP SA QQKAEGTNMGAHERDKG  
Subjt:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP

Query:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
        KDPSYGRRDRENSNHDKHQ+HSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGERKATP
Subjt:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP

Query:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP
        FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDH         YTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVR+PLAP
Subjt:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP

Query:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN
        EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIE SFEACKSRP+HATN
Subjt:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN

Query:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
        KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD HESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
Subjt:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE

Query:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN
        YHWDVRGDNVDDPTTYLVSFDD EARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPG+YSNSKRGSDIEDG+GRSHKHDR+
Subjt:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN

Query:  QDMDQFSGAEDEMSD
        QDMDQ+SGAEDEMSD
Subjt:  QDMDQFSGAEDEMSD

TrEMBL top hitse value%identityAlignment
A0A0A0KCT6 Uncharacterized protein0.097.9Show/hide
Query:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH
        MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAP APSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH
Subjt:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH

Query:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP
        QSYPYA QPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKG P
Subjt:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP

Query:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
        KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
Subjt:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP

Query:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP
        FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDH         YTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP
Subjt:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP

Query:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN
        EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIE SFEACKSRPIHATN
Subjt:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN

Query:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
        KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMAT SDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
Subjt:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE

Query:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN
        YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDR+
Subjt:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN

Query:  QDMDQFSGAEDEMSD
        QDMDQFSGAEDEMSD
Subjt:  QDMDQFSGAEDEMSD

A0A1S3CHF3 LOW QUALITY PROTEIN: protein PAF1 homolog0.095.25Show/hide
Query:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH
        MASYRPYPPQSSFGSAPAQNSIPPP +QSAS SSQQRGGATTQYNQNWG YAGDAS P APSSSYPQNYNNQLHQTSNYHHQQYG PRTQHPPPPPP  H
Subjt:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH

Query:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP
        QSYPYA QPPPPPPPDSSYPPPPPPPA SQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEG NMGAHERDKG  
Subjt:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP

Query:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
        KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
Subjt:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP

Query:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP
        FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDH         YTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVR+PLAP
Subjt:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP

Query:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN
        EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIE SFEACKSRPIHATN
Subjt:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN

Query:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
        KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
Subjt:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE

Query:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN
        YHWDVRGDNVDDPTTYLVSFDD+EARYVPLPTKLVL KKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGR HKHDR+
Subjt:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN

Query:  -QDMDQFSGAEDEMSD
         QDMDQ+SGAEDEMSD
Subjt:  -QDMDQFSGAEDEMSD

A0A5A7UA23 Protein PAF1-like protein0.095.39Show/hide
Query:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH
        MASYRPYPPQSSFGSAPAQNSIPPP +QSAS SSQQRGGATTQYNQNWG YAGDAS P APSSSYPQNYNNQLHQTSNYHHQQYG PRTQHPPPPPP  H
Subjt:  MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPH

Query:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP
        QSYPYA QPPPPPPPDSSYPPPPPPPA SQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEG NMGAHERDKG  
Subjt:  QSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAP

Query:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
        KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP
Subjt:  KDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATP

Query:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP
        FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDH         YTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVR+PLAP
Subjt:  FLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAP

Query:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN
        EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIE SFEACKSRPIHATN
Subjt:  EDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATN

Query:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
        KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
Subjt:  KNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE

Query:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN
        YHWDVRGDNVDDPTTYLVSFDD+EARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGR HKHDR+
Subjt:  YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRN

Query:  -QDMDQFSGAEDEMSD
         QDMDQ+SGAEDEMSD
Subjt:  -QDMDQFSGAEDEMSD

A0A6J1GN64 protein PAF1 homolog0.089.55Show/hide
Query:  MASYRPYPPQSSFGSAPAQNSIPPPSAQ-SASVSSQQRGGATTQYNQNWGTYAGDASAPS-APSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPP
        MASYRPYPPQSSFG +P QN IPPP A  +ASV +QQRGG  +QYNQNWG Y GD S P  A SSSYPQNYN Q+HQ+SNYH Q YGPPR+Q PPPPPPP
Subjt:  MASYRPYPPQSSFGSAPAQNSIPPPSAQ-SASVSSQQRGGATTQYNQNWGTYAGDASAPS-APSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPP

Query:  PHQSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPP-SSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDK
         HQSYPYA QPPPPPPPDSSYPPPPPPPA+SQP   Y+P SQY QGNQNQQS+QPPPPP SSPPPSSSIPPPPPPNSPPPPSA QQK EG+++GAHERDK
Subjt:  PHQSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPP-SSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDK

Query:  GAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERK
        G  KDPSYGRR+RENSNHDKHQ+HSGPPMPPKK+NGPSGR+ETDDEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGERK
Subjt:  GAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERK

Query:  ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMP
        ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDH         YTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVR+P
Subjt:  ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMP

Query:  LAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIH
        LAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ SFEACKSRP+H
Subjt:  LAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIH

Query:  ATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW
        ATNKNLYPVEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW
Subjt:  ATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW

Query:  VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKH
        VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPG+YSN KRGSDIEDG+GRSHKH
Subjt:  VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKH

Query:  DRNQDMDQFSGAEDEMSD
        DR+QDMDQ+SGAED+MSD
Subjt:  DRNQDMDQFSGAEDEMSD

A0A6J1JP14 protein PAF1 homolog0.089.42Show/hide
Query:  MASYRPYPPQSSFGSAPAQNSIPPPSAQ-SASVSSQQRGGATTQYNQNWGTYAGDASAPS-APSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPP
        MASYRPYPPQSSFG +P QN IPPP A  +ASV +QQRG  ++QYNQNWG Y GD S P  A SSSYPQNYN Q+HQ+SNYH Q YGPPR+Q PPPPPPP
Subjt:  MASYRPYPPQSSFGSAPAQNSIPPPSAQ-SASVSSQQRGGATTQYNQNWGTYAGDASAPS-APSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPP

Query:  PHQSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPP-SSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDK
         HQSYPYA QPPPPPPPDSSYPPPPPPPA+SQP   Y+P SQY QGNQNQQS+QPPPPP SSPPPSSSIPPPPPPNSPPPPSA Q K EG+++GAHERDK
Subjt:  PHQSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPP-SSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDK

Query:  GAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERK
        G  KDPSYGRR+RENSNHDKHQ+HSGPPMPPKK+NGPSGR+ETDDEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGERK
Subjt:  GAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERK

Query:  ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMP
        ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDH         YTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVR+P
Subjt:  ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMP

Query:  LAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIH
        LAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ SFEACKSRP+H
Subjt:  LAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIH

Query:  ATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW
        ATNKNLYPVEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW
Subjt:  ATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW

Query:  VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKH
        VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPG+YSN KRGSDIEDG+GRSHKH
Subjt:  VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKH

Query:  DRNQDMDQFSGAEDEMSD
        DR+QDMDQ+SGAED+MSD
Subjt:  DRNQDMDQFSGAEDEMSD

SwissProt top hitse value%identityAlignment
F4HQA1 Protein PAF1 homolog2.4e-17358.58Show/hide
Query:  YGPPRTQHPPPP------PPPPHQSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPP
        Y PP    P PP      PPPP  S P    PPPP     SYPPPPPP     PP+ YY     Y Q NQ    +Q PPPP  PPPS+     PPP  P 
Subjt:  YGPPRTQHPPPP------PPPPHQSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPP

Query:  PPSASQQKAEGTNMGAHERDKGAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKT
        PP          + G ++ +KGA K    GRR+R   +  KH   S  P         S ++ET++E+RLRKKRE EKQRQDE+HR  +K S  + + K 
Subjt:  PPSASQQKAEGTNMGAHERDKGAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKT

Query:  QMLSTGKVHGSIVGSRMGERKATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPD
                          E+K TP L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM+++++KD          +T+YTITSLEK +KP+++VEPD
Subjt:  QMLSTGKVHGSIVGSRMGERKATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPD

Query:  LGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNN
        LGIPLDLLDLSVYNP  V+ PLAPEDEELLRDD   TP+KKD GI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLNN
Subjt:  LGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNN

Query:  RERQIKEIETSFEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMV
        RERQIK+IE SFEACKSRP+HATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIRDAHES+AI+KSY+  GSD + PEKFLAYMV
Subjt:  RERQIKEIETSFEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMV

Query:  PSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPG
        PS DELSKDI+DE E++SY+WVREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRSSDE+EHFP P+RVTVRRR TV+ +E KD G
Subjt:  PSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPG

Query:  IY-------SNSKRGSDIEDGIGRSHKHDRNQDMDQFS-GAEDEMSD
        +Y       S+  R  + E G+GRS KH+  QD +Q+S G ED+ S+
Subjt:  IY-------SNSKRGSDIEDGIGRSHKHDRNQDMDQFS-GAEDEMSD

Q4U0S5 RNA polymerase II-associated factor 1 homolog7.5e-2630.26Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKK
        +C++K+ N LPD    PK          I Y F    + +Y  TSLEK +K +L  EPDLG+ +DL++   Y      + L P DE+LL ++     ++ 
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKK

Query:  DGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIETSFE-ACKSRPIHATNKNLYPVEVLP
            KR ++   K V W+ KT+YI   S E  +  ++ ++     E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVEVLP
Subjt:  DGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIETSFE-ACKSRPIHATNKNLYPVEVLP

Query:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDV--------SYSWVREYH
        + PDF  + +P   V FDS P     +          A     +M   M  G    +  +F+AY +P+ D + K   D +E++         Y   REY+
Subjt:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDV--------SYSWVREYH

Query:  WDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSS
        W+V+   +      Y   F DA+  Y   L T++ L K+RAK G  S
Subjt:  WDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSS

Q4V886 RNA polymerase II-associated factor 1 homolog3.7e-2528.16Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKK
        +C++K+ N LPD    PK          I Y F    + +Y  TSLEK +K  L  EPDLG+ +DL++   Y      + L P DE+LL +++ + P   
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKK

Query:  DGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIETSFE-ACKSRPIHATNKNLYPVEVLP
             ++ +   K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVEV+P
Subjt:  DGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIETSFE-ACKSRPIHATNKNLYPVEVLP

Query:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYH
        + PDF  + +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+
Subjt:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYH

Query:  WDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSH
        W+V+   +      Y   F + +  Y   L T++ L K+RAK G  S       V+H     +    +    A LE  +P      +  ++ ++  G   
Subjt:  WDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSH

Query:  KHDRNQDMDQFSGAEDEMS
        +H++    ++  G+EDE S
Subjt:  KHDRNQDMDQFSGAEDEMS

Q5RAX0 RNA polymerase II-associated factor 1 homolog3.1e-2427.92Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKK
        +C++K+ N LPD    PK          I Y F    + +Y  TSLEK +K  L  EPDLG+ +DL++   Y      + L P DE+LL +++ + P   
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKK

Query:  DGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIETSFE-ACKSRPIHATNKNLYPVEVLP
             ++ +   K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVEV+P
Subjt:  DGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIETSFE-ACKSRPIHATNKNLYPVEVLP

Query:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYH
        + PDF  + +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+
Subjt:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYH

Query:  WDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSH
        W+V+   +      Y   F + +  Y   L T++ L K+RAK G  S       V+H     +    +    A LE  +P      +  ++ ++  G   
Subjt:  WDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSH

Query:  KHDRNQDMDQFSGAEDEMS
        + ++    ++  G+EDE S
Subjt:  KHDRNQDMDQFSGAEDEMS

Q8N7H5 RNA polymerase II-associated factor 1 homolog4.1e-2427.92Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKK
        +C++K+ N LPD    PK          I Y F    + +Y  TSLEK +K  L  EPDLG+ +DL++   Y      + L P DE+LL +++ + P   
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKK

Query:  DGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIETSFE-ACKSRPIHATNKNLYPVEVLP
             ++ +   K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVEV+P
Subjt:  DGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIETSFE-ACKSRPIHATNKNLYPVEVLP

Query:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYH
        + PDF  + +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+
Subjt:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYH

Query:  WDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSH
        W+V+   +      Y   F + +  Y   L T++ L K+RAK G  S       V+H     +    +    A LE  +P      +  ++ ++  G   
Subjt:  WDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSH

Query:  KHDRNQDMDQFSGAEDEMS
        + ++    ++  G+EDE S
Subjt:  KHDRNQDMDQFSGAEDEMS

Arabidopsis top hitse value%identityAlignment
AT1G79730.1 hydroxyproline-rich glycoprotein family protein1.7e-17458.58Show/hide
Query:  YGPPRTQHPPPP------PPPPHQSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPP
        Y PP    P PP      PPPP  S P    PPPP     SYPPPPPP     PP+ YY     Y Q NQ    +Q PPPP  PPPS+     PPP  P 
Subjt:  YGPPRTQHPPPP------PPPPHQSYPYASQPPPPPPPDSSYPPPPPPPATSQPPNLYYPSS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPP

Query:  PPSASQQKAEGTNMGAHERDKGAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKT
        PP          + G ++ +KGA K    GRR+R   +  KH   S  P         S ++ET++E+RLRKKRE EKQRQDE+HR  +K S  + + K 
Subjt:  PPSASQQKAEGTNMGAHERDKGAPKDPSYGRRDRENSNHDKHQKHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKT

Query:  QMLSTGKVHGSIVGSRMGERKATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPD
                          E+K TP L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM+++++KD          +T+YTITSLEK +KP+++VEPD
Subjt:  QMLSTGKVHGSIVGSRMGERKATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPD

Query:  LGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNN
        LGIPLDLLDLSVYNP  V+ PLAPEDEELLRDD   TP+KKD GI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLNN
Subjt:  LGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNN

Query:  RERQIKEIETSFEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMV
        RERQIK+IE SFEACKSRP+HATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIRDAHES+AI+KSY+  GSD + PEKFLAYMV
Subjt:  RERQIKEIETSFEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPSKPEKFLAYMV

Query:  PSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPG
        PS DELSKDI+DE E++SY+WVREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRSSDE+EHFP P+RVTVRRR TV+ +E KD G
Subjt:  PSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPG

Query:  IY-------SNSKRGSDIEDGIGRSHKHDRNQDMDQFS-GAEDEMSD
        +Y       S+  R  + E G+GRS KH+  QD +Q+S G ED+ S+
Subjt:  IY-------SNSKRGSDIEDGIGRSHKHDRNQDMDQFS-GAEDEMSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTACCGGCCATATCCTCCTCAATCGTCTTTCGGTTCTGCTCCAGCTCAAAACTCTATTCCACCTCCATCTGCCCAATCCGCTTCCGTCTCATCCCAACAGCG
AGGAGGAGCTACTACTCAATATAATCAGAATTGGGGTACTTACGCCGGTGATGCCTCTGCACCTTCTGCTCCTTCTTCATCCTATCCCCAAAATTACAACAACCAACTTC
ATCAAACTTCTAATTACCACCATCAACAGTATGGTCCTCCCAGAACCCAACACCCTCCACCTCCTCCTCCTCCTCCTCACCAGTCCTATCCTTATGCATCTCAACCACCG
CCCCCTCCTCCTCCTGATTCTTCCTATCCACCGCCTCCACCCCCACCAGCGACTTCACAGCCTCCTAATCTTTACTATCCTTCTTCACAGTATTCCCAGGGTAATCAAAA
CCAGCAGTCAATGCAGCCACCACCTCCGCCCTCATCTCCACCACCGAGTTCTTCAATCCCCCCACCTCCACCCCCAAATTCTCCCCCACCTCCATCAGCCTCTCAGCAAA
AAGCAGAGGGTACAAACATGGGAGCACACGAACGCGATAAAGGAGCTCCCAAGGATCCCTCATATGGCAGGCGTGATCGTGAAAATTCAAATCATGATAAACACCAGAAG
CATTCTGGTCCTCCAATGCCTCCCAAGAAAGCAAACGGTCCTTCAGGGAGAATGGAGACAGATGATGAGAAAAGACTGAGGAAGAAGAGAGAGTTCGAAAAACAAAGGCA
AGATGAGAGGCACAGACACCATCTAAAAGAATCCCAAAACACTATACTGCAAAAGACCCAGATGTTATCTACTGGGAAGGTGCATGGATCAATTGTAGGATCCCGAATGG
GGGAAAGGAAGGCTACTCCTTTTCTTAGTGGGGAGAGAATAGAAAATAGGTTGAAGAAGCCAACCACATTTTTGTGCAAGTTGAAATTCCGCAACGAGCTTCCAGATACA
AGTGCTCAGCCGAAGCTCATGTCACTACGGAAAGAGAAAGATCACATCTTCTATTTTTTTGGATGTTGCAGCTATACAAGATATACAATTACATCACTAGAGAAAACCTA
CAAACCTCAGCTTTATGTAGAGCCAGATCTTGGAATACCTCTCGATTTGCTTGACCTCAGTGTATACAACCCTTCTAGTGTTAGAATGCCCCTTGCTCCTGAAGATGAGG
AATTATTACGTGATGATGTATTGAAAACTCCAGTTAAAAAAGATGGTGGTATAAAAAGAAAAGAGCGTCCTACTGATAAAGGTGTTGCCTGGCTTGTTAAGACACAGTAC
ATTTCTCCTCTCAGCATTGAATCTGCGAAGCAGTCTTTGACTGAAAAACAGGCAAAAGAACTGCGAGAAATGAAGGGAGGACGCAATATTCTTGAGAACCTCAATAATAG
GGAAAGACAAATTAAGGAAATTGAGACATCATTTGAGGCATGCAAGTCACGCCCTATTCACGCCACTAATAAGAATTTATATCCTGTTGAGGTTTTACCTCTTCTACCTG
ATTTTGATAGGTATGATGATCCATTTGTTGTGGTGGCGTTTGATAGTGCTCCCACAGCTGATTCAGAGACTTTCAACAAGTTAGACCAATCCATCCGTGACGCTCACGAA
TCACAGGCGATAATGAAAAGTTATATGGCAACAGGCTCAGATCCCTCAAAACCTGAGAAATTTCTTGCGTACATGGTTCCCTCTCCAGATGAGCTTTCGAAGGATATTTA
CGATGAACAAGAAGATGTCTCTTATTCCTGGGTTCGTGAGTACCATTGGGATGTACGGGGTGACAATGTGGACGACCCCACTACATATCTCGTTTCATTTGATGATGCAG
AAGCTCGTTATGTGCCACTTCCCACAAAGCTTGTTCTTAGAAAAAAGAGGGCTAAAGAAGGGAGATCCAGTGATGAGGTCGAACATTTTCCAGCACCTGCAAGAGTGACT
GTAAGGAGAAGACCAACCGTAGCTACTTTGGAAGTGAAGGATCCTGGGATTTACTCAAACTCAAAAAGAGGATCAGATATTGAAGATGGTATAGGAAGATCGCATAAACA
TGATAGAAATCAAGACATGGATCAATTCAGTGGAGCTGAAGACGAGATGTCTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTACCGGCCATATCCTCCTCAATCGTCTTTCGGTTCTGCTCCAGCTCAAAACTCTATTCCACCTCCATCTGCCCAATCCGCTTCCGTCTCATCCCAACAGCG
AGGAGGAGCTACTACTCAATATAATCAGAATTGGGGTACTTACGCCGGTGATGCCTCTGCACCTTCTGCTCCTTCTTCATCCTATCCCCAAAATTACAACAACCAACTTC
ATCAAACTTCTAATTACCACCATCAACAGTATGGTCCTCCCAGAACCCAACACCCTCCACCTCCTCCTCCTCCTCCTCACCAGTCCTATCCTTATGCATCTCAACCACCG
CCCCCTCCTCCTCCTGATTCTTCCTATCCACCGCCTCCACCCCCACCAGCGACTTCACAGCCTCCTAATCTTTACTATCCTTCTTCACAGTATTCCCAGGGTAATCAAAA
CCAGCAGTCAATGCAGCCACCACCTCCGCCCTCATCTCCACCACCGAGTTCTTCAATCCCCCCACCTCCACCCCCAAATTCTCCCCCACCTCCATCAGCCTCTCAGCAAA
AAGCAGAGGGTACAAACATGGGAGCACACGAACGCGATAAAGGAGCTCCCAAGGATCCCTCATATGGCAGGCGTGATCGTGAAAATTCAAATCATGATAAACACCAGAAG
CATTCTGGTCCTCCAATGCCTCCCAAGAAAGCAAACGGTCCTTCAGGGAGAATGGAGACAGATGATGAGAAAAGACTGAGGAAGAAGAGAGAGTTCGAAAAACAAAGGCA
AGATGAGAGGCACAGACACCATCTAAAAGAATCCCAAAACACTATACTGCAAAAGACCCAGATGTTATCTACTGGGAAGGTGCATGGATCAATTGTAGGATCCCGAATGG
GGGAAAGGAAGGCTACTCCTTTTCTTAGTGGGGAGAGAATAGAAAATAGGTTGAAGAAGCCAACCACATTTTTGTGCAAGTTGAAATTCCGCAACGAGCTTCCAGATACA
AGTGCTCAGCCGAAGCTCATGTCACTACGGAAAGAGAAAGATCACATCTTCTATTTTTTTGGATGTTGCAGCTATACAAGATATACAATTACATCACTAGAGAAAACCTA
CAAACCTCAGCTTTATGTAGAGCCAGATCTTGGAATACCTCTCGATTTGCTTGACCTCAGTGTATACAACCCTTCTAGTGTTAGAATGCCCCTTGCTCCTGAAGATGAGG
AATTATTACGTGATGATGTATTGAAAACTCCAGTTAAAAAAGATGGTGGTATAAAAAGAAAAGAGCGTCCTACTGATAAAGGTGTTGCCTGGCTTGTTAAGACACAGTAC
ATTTCTCCTCTCAGCATTGAATCTGCGAAGCAGTCTTTGACTGAAAAACAGGCAAAAGAACTGCGAGAAATGAAGGGAGGACGCAATATTCTTGAGAACCTCAATAATAG
GGAAAGACAAATTAAGGAAATTGAGACATCATTTGAGGCATGCAAGTCACGCCCTATTCACGCCACTAATAAGAATTTATATCCTGTTGAGGTTTTACCTCTTCTACCTG
ATTTTGATAGGTATGATGATCCATTTGTTGTGGTGGCGTTTGATAGTGCTCCCACAGCTGATTCAGAGACTTTCAACAAGTTAGACCAATCCATCCGTGACGCTCACGAA
TCACAGGCGATAATGAAAAGTTATATGGCAACAGGCTCAGATCCCTCAAAACCTGAGAAATTTCTTGCGTACATGGTTCCCTCTCCAGATGAGCTTTCGAAGGATATTTA
CGATGAACAAGAAGATGTCTCTTATTCCTGGGTTCGTGAGTACCATTGGGATGTACGGGGTGACAATGTGGACGACCCCACTACATATCTCGTTTCATTTGATGATGCAG
AAGCTCGTTATGTGCCACTTCCCACAAAGCTTGTTCTTAGAAAAAAGAGGGCTAAAGAAGGGAGATCCAGTGATGAGGTCGAACATTTTCCAGCACCTGCAAGAGTGACT
GTAAGGAGAAGACCAACCGTAGCTACTTTGGAAGTGAAGGATCCTGGGATTTACTCAAACTCAAAAAGAGGATCAGATATTGAAGATGGTATAGGAAGATCGCATAAACA
TGATAGAAATCAAGACATGGATCAATTCAGTGGAGCTGAAGACGAGATGTCTGATTGA
Protein sequenceShow/hide protein sequence
MASYRPYPPQSSFGSAPAQNSIPPPSAQSASVSSQQRGGATTQYNQNWGTYAGDASAPSAPSSSYPQNYNNQLHQTSNYHHQQYGPPRTQHPPPPPPPPHQSYPYASQPP
PPPPPDSSYPPPPPPPATSQPPNLYYPSSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASQQKAEGTNMGAHERDKGAPKDPSYGRRDRENSNHDKHQK
HSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKVHGSIVGSRMGERKATPFLSGERIENRLKKPTTFLCKLKFRNELPDT
SAQPKLMSLRKEKDHIFYFFGCCSYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPSSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQY
ISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIETSFEACKSRPIHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHE
SQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVT
VRRRPTVATLEVKDPGIYSNSKRGSDIEDGIGRSHKHDRNQDMDQFSGAEDEMSD