; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009529 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009529
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein PAF1 homolog
Genome locationscaffold813:1941479..1946121
RNA-Seq ExpressionMS009529
SyntenyMS009529
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0016570 - histone modification (biological process)
GO:0016593 - Cdc73/Paf1 complex (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR007133 - RNA polymerase II associated factor Paf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148278.1 protein PAF1 homolog [Momordica charantia]0.0e+0099.15Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA
        MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA

Query:  PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERDKGISKD
        PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQ NQNQQSVQSMQPPPPPSSPPP+SSIPPPPPPNSPPP SAPQPRAEGANMGAHERDKGISKD
Subjt:  PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERDKGISKD

Query:  PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
        PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
Subjt:  PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL

Query:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVL
        SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVL
Subjt:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVL

Query:  TTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLL
        TTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLL
Subjt:  TTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLL

Query:  PDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVDD
        PDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVDD
Subjt:  PDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVDD

Query:  PTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE
        PTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNS+RGSDIE GLGRSHKHDRHQDMDQYSGAEDE
Subjt:  PTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE

Query:  LSD
        LSD
Subjt:  LSD

XP_022953373.1 protein PAF1 homolog [Cucurbita moschata]0.0e+0091.55Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPA-QAAPVPTQQRGGSSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFGPSPG NPIPPPPA  AA VPTQQRGG SQYNQNWGGYGGDGSV PPA SSSYPQNYNQ    +NYH QQHYGPPR+Q PPPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPA-QAAPVPTQQRGGSSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPA S P  H Y+PPSQY QGNQNQQS+Q   PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQ + EG+++GAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG+SKDPSYGRRERENSNHDKHQRH GPPMPPKK+NGPSGRIET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPS R+ LAPEDEEL
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL

Query:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP
        LRDDVL TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RER+IKEI+ASFEACKSRPVHATNKNLYP
Subjt:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP

Query:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV
        VEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWDV
Subjt:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV

Query:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ
        RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQ
Subjt:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ

Query:  YSGAEDELSD
        YSGAED++SD
Subjt:  YSGAEDELSD

XP_022992172.1 protein PAF1 homolog [Cucurbita maxima]0.0e+0091.55Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPA-QAAPVPTQQRGGSSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFGPSPG NPIPPPPA  AA VPTQQR GSSQYNQNWGGYGGDGSV PPA SSSYPQNYNQ    +NYH QQHYGPPR+Q PPPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPA-QAAPVPTQQRGGSSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPA S P  H Y+PPSQY QGNQNQQS+Q   PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQP+ EG+++GAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG++KDPSYGRRERENSNHDKHQRH GPPMPPKK+NGPSGRIET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPS R+ LAPEDEEL
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL

Query:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP
        LRDDVL TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RER+IKEI+ASFEACKSRPVHATNKNLYP
Subjt:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP

Query:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV
        VEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWDV
Subjt:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV

Query:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ
        RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQ
Subjt:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ

Query:  YSGAEDELSD
        YSGAED++SD
Subjt:  YSGAEDELSD

XP_023547399.1 protein PAF1 homolog [Cucurbita pepo subsp. pepo]0.0e+0091.55Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPA-QAAPVPTQQRGGSSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFGPSPG NPIPPPPA  AA VPTQQRGG SQYNQNWGGYGGDGSV PPA SSSYPQNYNQ    +N+H QQHYGPPR+Q PPPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPA-QAAPVPTQQRGGSSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPA S P  H Y+PPSQY QGNQNQQS+Q   PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQP+ EG+++GAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG+SKDPSYGRRERENSNHDKHQRH GPPMPPKK+NGPSGRIET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPS R+ LAPEDEEL
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL

Query:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP
        LRDDVL TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RER+IKEI+ASFEACKSRPVHATNKNLYP
Subjt:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP

Query:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV
        VEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWDV
Subjt:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV

Query:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ
        RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQ
Subjt:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ

Query:  YSGAEDELSD
        YSGAED++SD
Subjt:  YSGAEDELSD

XP_038898523.1 protein PAF1 homolog [Benincasa hispida]0.0e+0091.24Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQA----NYHQQQHYGPPRTQHPPPPPPPHQS
        MASYRPYPPQSSFGP+PG NP+PPPP Q+A VP QQRGG SQYNQNWGGYGGDGS+PPA SSSYPQNYNQA    NYH QQHYGPPR+QH PPPPPP+QS
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQA----NYHQQQHYGPPRTQHPPPPPPPHQS

Query:  YPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERDKG
        YPYAPQ PPPPPPDSSYPPPPPPPAPS P P+LYYPPS            QSMQPPPPPSSPPPSSSIPPPPPPNSPPP SAPQ +AEG NMGAHERDKG
Subjt:  YPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERDKG

Query:  ISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        +SKDPSYGRR+RENSNHDKHQRH GPPMPPKKANGPSGR+ET+DEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  ISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLR
        TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPS R+ LAPEDEELLR
Subjt:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLR

Query:  DDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DDVL TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLN+RERQIKEIEASFEACKSRPVHATNKNLYPVE
Subjt:  DDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRG
        VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD HESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWDVRG
Subjt:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDD EARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDELSD
        GAEDE+SD
Subjt:  GAEDELSD

TrEMBL top hitse value%identityAlignment
A0A0A0KCT6 Uncharacterized protein0.0e+0090.3Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGG-SSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQ-----ANYHQQQHYGPPRTQH-PPPPPPP
        MASYRPYPPQSSFG +P  N IPPP AQ+A V +QQRGG ++QYNQNWG Y GD S PPAPSSSYPQNYN      +NYH QQ YGPPRTQH PPPPPPP
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGG-SSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQ-----ANYHQQQHYGPPRTQH-PPPPPPP

Query:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHER
        HQSYPYAPQPPPPPPPDSSYPPPPPPPA S P P+LYYP SQYSQGNQNQ   QSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSA Q +AEG NMGAHER
Subjt:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHER

Query:  DKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGE
        DKG+ KDPSYGRR+RENSNHDKHQ+H GPPMPPKKANGPSGR+ET+DEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGE
Subjt:  DKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGE

Query:  RRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEE
        R+ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNP S RM LAPEDEE
Subjt:  RRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEE

Query:  LLRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY
        LLRDDVL TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLN+RERQIKEIEASFEACKSRP+HATNKNLY
Subjt:  LLRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY

Query:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD
        PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMAT SDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWD
Subjt:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD

Query:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMD
        VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAP+RVTVRRRPTVATLEVKDPG+YSNSKRGSDIEDG+GRSHKHDRHQDMD
Subjt:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMD

Query:  QYSGAEDELSD
        Q+SGAEDE+SD
Subjt:  QYSGAEDELSD

A0A1S3CHF3 LOW QUALITY PROTEIN: protein PAF1 homolog0.0e+0089.87Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGG-SSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQ-----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFG +P  N IPPPP+Q+A   +QQRGG ++QYNQNWG Y GD SVPPAPSSSYPQNYN      +NYH QQ YG PRTQH PPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGG-SSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQ-----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPAPS P P+LYYP SQYSQGNQNQ   QSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSA Q +AEG NMGAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG+SKDPSYGRR+RENSNHDKHQ+H GPPMPPKKANGPSGR+ET+DEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPS R+ LAPEDEEL
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL

Query:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP
        LRDDVL TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RERQIKEIEASFEACKSRP+HATNKNLYP
Subjt:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP

Query:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV
        VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWDV
Subjt:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV

Query:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDR-HQDMD
        RGDNVDDPTTYLVSFDD+EARYVPLPTKLVL KKRAKEGRS+DEVEHFPAP+RVTVRRRPTVATLEVKDPG+YSNSKRGSDIEDG+GR HKHDR HQDMD
Subjt:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDR-HQDMD

Query:  QYSGAEDELSD
        QYSGAEDE+SD
Subjt:  QYSGAEDELSD

A0A6J1D3N7 protein PAF1 homolog0.0e+0099.15Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA
        MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA

Query:  PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERDKGISKD
        PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQ NQNQQSVQSMQPPPPPSSPPP+SSIPPPPPPNSPPP SAPQPRAEGANMGAHERDKGISKD
Subjt:  PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERDKGISKD

Query:  PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
        PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
Subjt:  PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL

Query:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVL
        SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVL
Subjt:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVL

Query:  TTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLL
        TTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLL
Subjt:  TTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLL

Query:  PDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVDD
        PDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVDD
Subjt:  PDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVDD

Query:  PTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE
        PTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNS+RGSDIE GLGRSHKHDRHQDMDQYSGAEDE
Subjt:  PTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE

Query:  LSD
        LSD
Subjt:  LSD

A0A6J1GN64 protein PAF1 homolog0.0e+0091.55Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPA-QAAPVPTQQRGGSSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFGPSPG NPIPPPPA  AA VPTQQRGG SQYNQNWGGYGGDGSV PPA SSSYPQNYNQ    +NYH QQHYGPPR+Q PPPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPA-QAAPVPTQQRGGSSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPA S P  H Y+PPSQY QGNQNQQS+Q   PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQ + EG+++GAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG+SKDPSYGRRERENSNHDKHQRH GPPMPPKK+NGPSGRIET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPS R+ LAPEDEEL
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL

Query:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP
        LRDDVL TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RER+IKEI+ASFEACKSRPVHATNKNLYP
Subjt:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP

Query:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV
        VEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWDV
Subjt:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV

Query:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ
        RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQ
Subjt:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ

Query:  YSGAEDELSD
        YSGAED++SD
Subjt:  YSGAEDELSD

A0A6J1JP14 protein PAF1 homolog0.0e+0091.55Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPA-QAAPVPTQQRGGSSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFGPSPG NPIPPPPA  AA VPTQQR GSSQYNQNWGGYGGDGSV PPA SSSYPQNYNQ    +NYH QQHYGPPR+Q PPPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPA-QAAPVPTQQRGGSSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPA S P  H Y+PPSQY QGNQNQQS+Q   PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQP+ EG+++GAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG++KDPSYGRRERENSNHDKHQRH GPPMPPKK+NGPSGRIET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPS R+ LAPEDEEL
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEEL

Query:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP
        LRDDVL TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RER+IKEI+ASFEACKSRPVHATNKNLYP
Subjt:  LRDDVLTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYP

Query:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV
        VEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWDV
Subjt:  VEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDV

Query:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ
        RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQ
Subjt:  RGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ

Query:  YSGAEDELSD
        YSGAED++SD
Subjt:  YSGAEDELSD

SwissProt top hitse value%identityAlignment
F4HQA1 Protein PAF1 homolog3.5e-17761.1Show/hide
Query:  PPRTQHPPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSGPQPHLYYPPS-QYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPP
        PP      PPPPP    P  P P PPPPP     SYPPPPPPP      PH YY     Y Q NQ       +Q P            PPPPPP++PPP 
Subjt:  PPRTQHPPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSGPQPHLYYPPS-QYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPP

Query:  SAPQPRAEGANMGAHERDKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQM
            PR +G N    + +KG SK    GRRER   +  KH  H    +P  K      +IETE+E+RLRKKRE EKQRQDE+HR  +K S      K+QM
Subjt:  SAPQPRAEGANMGAHERDKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQM

Query:  LSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLS
            KGH         E++ TP L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM++++DKD +TKYTITSLEK++KP+++VEPDLGIPLDLLDLS
Subjt:  LSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLS

Query:  VYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASF
        VYNPP  +  LAPEDEELLRDD   TP+KKDGI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLN+RERQIK+IEASF
Subjt:  VYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASF

Query:  EACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYD
        EACKSRPVHATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIRDAHES+AI+KSY+  GSD   PEKFLAYMVPS DELSKD++D
Subjt:  EACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYD

Query:  EQEDVSFSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVY-------SNS
        E E++S++WVREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRS+DE+EHFP PSRVTVRRR TV+ +E KD GVY       S+ 
Subjt:  EQEDVSFSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVY-------SNS

Query:  KRGSDIEDGLGRSHKHDRHQDMDQYS-GAEDELSD
         R  + E GLGRS KH+  QD +QYS G ED+ S+
Subjt:  KRGSDIEDGLGRSHKHDRHQDMDQYS-GAEDELSD

Q4U0S5 RNA polymerase II-associated factor 1 homolog5.6e-2628.28Show/hide
Query:  GSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSL
        G R    R  P  SG             +C++K+ N LPD    PK ++   D+  + +Y  TSLEK +K +L  EPDLG+ +DL++   Y      + L
Subjt:  GSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSL

Query:  APEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACK
         P DE+LL ++     ++     ++ +   K V W+ KT+YI   S E  +  ++ ++     E+K G ++ +          R+ QI  IE +FE A K
Subjt:  APEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACK

Query:  SRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQED
        S   H +   + PVEVLP+ PDF  + +P   V FDS P     +          A     +M   M  G    +  +F+AY +P+ D + K   D +E+
Subjt:  SRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQED

Query:  V--------SFSWVREYHWDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEG--RSTDEV---EHFPAPSRVTVRRRPTVATLEVKDPGV
        +         +   REY+W+V+   +      Y   F DA+  Y   L T++ L K+RAK G   ST+ V   +H     +    +    A LE  +P  
Subjt:  V--------SFSWVREYHWDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEG--RSTDEV---EHFPAPSRVTVRRRPTVATLEVKDPGV

Query:  YSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE
                D E+ L      D  +DM + SG E E
Subjt:  YSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE

Q4V886 RNA polymerase II-associated factor 1 homolog1.9e-2627.87Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERP
        +C++K+ N LPD    PK ++   D++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y      + L P DE+LL +++      K     + + 
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERP

Query:  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYDD
          K V W+ KT+YI   S E  +  +    + E  E+K G ++ +          R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + +
Subjt:  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYDD

Query:  PFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NVD
        P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ ++          REY+W+V+   +  
Subjt:  PFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NVD

Query:  DPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ
            Y   F + +  Y   L T++ L K+RAK G  +       V+H     +    +    A LE  +P      +  ++ ++  G   +H++    ++
Subjt:  DPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQ

Query:  YSGAEDELS
          G+EDE S
Subjt:  YSGAEDELS

Q8K2T8 RNA polymerase II-associated factor 1 homolog1.6e-2528.3Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERP
        +C++K+ N LPD    PK ++   D++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y      + L P DE+LL +++      K     + + 
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERP

Query:  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYDD
          K V W+ KT+YI   S E  +  +    + E  E+K G ++ +          R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + +
Subjt:  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYDD

Query:  PFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NVD
        P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ ++          REY+W+V+   +  
Subjt:  PFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NVD

Query:  DPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKR--------GSDIEDGLGRSHKH
            Y   F + +  Y   L T++ L K+RAK G  +       V+H     +    +    A LE  +P      +         GSD E   G S + 
Subjt:  DPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKR--------GSDIEDGLGRSHKH

Query:  DRHQD------MDQYSGAEDELSD
        +  +D       D+  G  DE SD
Subjt:  DRHQD------MDQYSGAEDELSD

Q8N7H5 RNA polymerase II-associated factor 1 homolog9.6e-2628.43Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERP
        +C++K+ N LPD    PK ++   D++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y      + L P DE+LL +++      K     + + 
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERP

Query:  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYDD
          K V W+ KT+YI   S E  +  +    + E  E+K G ++ +          R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + +
Subjt:  TDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYDD

Query:  PFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NVD
        P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ ++          REY+W+V+   +  
Subjt:  PFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NVD

Query:  DPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKR--------GSDIEDGLGRSHKH
            Y   F + +  Y   L T++ L K+RAK G  +       V+H     +    +    A LE  +P      +         GSD E   G S + 
Subjt:  DPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSKR--------GSDIEDGLGRSHKH

Query:  DRHQDMDQYSGAEDE
        +  +  D++SG+E E
Subjt:  DRHQDMDQYSGAEDE

Arabidopsis top hitse value%identityAlignment
AT1G79730.1 hydroxyproline-rich glycoprotein family protein2.5e-17861.1Show/hide
Query:  PPRTQHPPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSGPQPHLYYPPS-QYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPP
        PP      PPPPP    P  P P PPPPP     SYPPPPPPP      PH YY     Y Q NQ       +Q P            PPPPPP++PPP 
Subjt:  PPRTQHPPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSGPQPHLYYPPS-QYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPP

Query:  SAPQPRAEGANMGAHERDKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQM
            PR +G N    + +KG SK    GRRER   +  KH  H    +P  K      +IETE+E+RLRKKRE EKQRQDE+HR  +K S      K+QM
Subjt:  SAPQPRAEGANMGAHERDKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQM

Query:  LSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLS
            KGH         E++ TP L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM++++DKD +TKYTITSLEK++KP+++VEPDLGIPLDLLDLS
Subjt:  LSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLS

Query:  VYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASF
        VYNPP  +  LAPEDEELLRDD   TP+KKDGI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLN+RERQIK+IEASF
Subjt:  VYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASF

Query:  EACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYD
        EACKSRPVHATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIRDAHES+AI+KSY+  GSD   PEKFLAYMVPS DELSKD++D
Subjt:  EACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYD

Query:  EQEDVSFSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVY-------SNS
        E E++S++WVREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRS+DE+EHFP PSRVTVRRR TV+ +E KD GVY       S+ 
Subjt:  EQEDVSFSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVY-------SNS

Query:  KRGSDIEDGLGRSHKHDRHQDMDQYS-GAEDELSD
         R  + E GLGRS KH+  QD +QYS G ED+ S+
Subjt:  KRGSDIEDGLGRSHKHDRHQDMDQYS-GAEDELSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTACAGGCCATATCCTCCACAATCGTCCTTCGGTCCTTCGCCAGGTCTAAATCCGATTCCGCCACCGCCAGCGCAAGCAGCTCCCGTTCCAACGCAGCAGCG
AGGAGGTAGTAGTCAATATAATCAGAATTGGGGTGGTTATGGCGGTGACGGGTCTGTGCCTCCAGCTCCATCTTCCTCGTATCCTCAAAATTACAACCAAGCTAATTACC
ACCAGCAGCAGCATTATGGTCCGCCGAGAACCCAACACCCTCCACCACCTCCTCCTCCTCACCAATCGTATCCTTATGCACCGCAGCCGCCGCCACCGCCGCCACCCGAT
TCTTCCTATCCTCCGCCTCCACCACCGCCAGCTCCTTCGGGTCCCCAACCTCATCTTTACTATCCTCCTTCACAGTATTCCCAGGGTAATCAAAATCAGCAGTCCGTACA
GTCAATGCAGCCACCACCTCCGCCCTCGTCTCCACCACCAAGCTCTTCAATTCCGCCGCCCCCACCCCCAAATTCTCCGCCACCTCCATCGGCGCCTCAACCAAGAGCGG
AGGGTGCAAACATGGGAGCACACGAGCGTGATAAAGGGATTTCAAAGGATCCGTCATATGGCAGGCGTGAACGTGAAAATTCAAATCATGATAAACACCAGAGGCACCCT
GGTCCCCCAATGCCTCCGAAGAAAGCAAACGGACCTTCAGGGAGAATTGAGACGGAGGATGAGAAAAGACTGAGGAAGAAGAGAGAATTCGAAAAGCAAAGGCAAGATGA
GAGGCATAGACATCATCTAAAAGAATCCCAAAACACAATTCTGCAGAAGACCCAGATGTTATCTACCGGGAAGGGGCATGGATCAATTGTGGGGTCCCGAATGGGGGAAA
GGAGGGCCACTCCATTTCTTAGTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAATTCCGGAATGAGCTTCCAGATACAAGTGCA
CAGCCGAAACTCATGTCATTGCGGAAAGATAAAGATTACTATACGAAATATACAATCACATCGCTAGAGAAAATGTACAAACCTCAGCTTTACGTTGAGCCAGATCTTGG
AATACCTCTCGATTTGCTTGACCTCAGTGTATACAACCCTCCTAGTTCTAGAATGTCCCTTGCTCCTGAAGATGAGGAATTATTACGTGATGATGTGTTGACAACTCCAG
TTAAAAAAGATGGTATAAAAAGAAAAGAACGTCCTACTGATAAAGGGGTTGCCTGGCTTGTTAAGACACAGTACATCTCTCCTCTTAGCATTGAATCCGCAAAACAGTCT
TTGACTGAAAAACAAGCAAAAGAGCTTCGAGAAATGAAGGGAGGGCGTAATATTCTTGAGAACCTCAATAGTAGGGAAAGGCAAATTAAGGAAATTGAGGCGTCGTTTGA
GGCATGCAAGTCACGCCCTGTTCATGCAACTAATAAGAATTTATATCCTGTAGAGGTTTTACCTCTTCTGCCTGATTTTGATAGGTATGATGATCCATTTGTTGTGGTGG
CGTTTGATAGTGCTCCCACGGCTGATTCAGAGACTTTCAACAAGTTAGACCAATCCATCCGCGATGCTCATGAATCGCAGGCGATAATGAAAAGCTATATGGCAACAGGC
TCAGACCCAACAAAACCTGAGAAATTTCTGGCGTACATGGTTCCTTCTCCAGATGAGCTGTCCAAGGATATGTATGATGAACAAGAAGACGTTTCATTTTCCTGGGTTCG
TGAGTACCATTGGGATGTACGAGGAGATAATGTGGATGATCCCACGACATATCTCGTTTCGTTTGATGATGCAGAAGCTCGTTATGTGCCACTTCCTACAAAGCTTGTTC
TGAGAAAAAAGAGGGCTAAAGAAGGGAGATCAACCGATGAGGTTGAACATTTTCCTGCACCTTCGAGAGTGACTGTAAGGAGGAGACCAACAGTAGCTACTTTGGAAGTG
AAGGATCCAGGGGTTTACTCGAATTCGAAAAGGGGATCTGATATTGAAGATGGTCTTGGAAGATCACATAAACATGATAGACACCAAGATATGGATCAATACAGTGGAGC
TGAAGACGAGTTGTCTGAT
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTACAGGCCATATCCTCCACAATCGTCCTTCGGTCCTTCGCCAGGTCTAAATCCGATTCCGCCACCGCCAGCGCAAGCAGCTCCCGTTCCAACGCAGCAGCG
AGGAGGTAGTAGTCAATATAATCAGAATTGGGGTGGTTATGGCGGTGACGGGTCTGTGCCTCCAGCTCCATCTTCCTCGTATCCTCAAAATTACAACCAAGCTAATTACC
ACCAGCAGCAGCATTATGGTCCGCCGAGAACCCAACACCCTCCACCACCTCCTCCTCCTCACCAATCGTATCCTTATGCACCGCAGCCGCCGCCACCGCCGCCACCCGAT
TCTTCCTATCCTCCGCCTCCACCACCGCCAGCTCCTTCGGGTCCCCAACCTCATCTTTACTATCCTCCTTCACAGTATTCCCAGGGTAATCAAAATCAGCAGTCCGTACA
GTCAATGCAGCCACCACCTCCGCCCTCGTCTCCACCACCAAGCTCTTCAATTCCGCCGCCCCCACCCCCAAATTCTCCGCCACCTCCATCGGCGCCTCAACCAAGAGCGG
AGGGTGCAAACATGGGAGCACACGAGCGTGATAAAGGGATTTCAAAGGATCCGTCATATGGCAGGCGTGAACGTGAAAATTCAAATCATGATAAACACCAGAGGCACCCT
GGTCCCCCAATGCCTCCGAAGAAAGCAAACGGACCTTCAGGGAGAATTGAGACGGAGGATGAGAAAAGACTGAGGAAGAAGAGAGAATTCGAAAAGCAAAGGCAAGATGA
GAGGCATAGACATCATCTAAAAGAATCCCAAAACACAATTCTGCAGAAGACCCAGATGTTATCTACCGGGAAGGGGCATGGATCAATTGTGGGGTCCCGAATGGGGGAAA
GGAGGGCCACTCCATTTCTTAGTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAATTCCGGAATGAGCTTCCAGATACAAGTGCA
CAGCCGAAACTCATGTCATTGCGGAAAGATAAAGATTACTATACGAAATATACAATCACATCGCTAGAGAAAATGTACAAACCTCAGCTTTACGTTGAGCCAGATCTTGG
AATACCTCTCGATTTGCTTGACCTCAGTGTATACAACCCTCCTAGTTCTAGAATGTCCCTTGCTCCTGAAGATGAGGAATTATTACGTGATGATGTGTTGACAACTCCAG
TTAAAAAAGATGGTATAAAAAGAAAAGAACGTCCTACTGATAAAGGGGTTGCCTGGCTTGTTAAGACACAGTACATCTCTCCTCTTAGCATTGAATCCGCAAAACAGTCT
TTGACTGAAAAACAAGCAAAAGAGCTTCGAGAAATGAAGGGAGGGCGTAATATTCTTGAGAACCTCAATAGTAGGGAAAGGCAAATTAAGGAAATTGAGGCGTCGTTTGA
GGCATGCAAGTCACGCCCTGTTCATGCAACTAATAAGAATTTATATCCTGTAGAGGTTTTACCTCTTCTGCCTGATTTTGATAGGTATGATGATCCATTTGTTGTGGTGG
CGTTTGATAGTGCTCCCACGGCTGATTCAGAGACTTTCAACAAGTTAGACCAATCCATCCGCGATGCTCATGAATCGCAGGCGATAATGAAAAGCTATATGGCAACAGGC
TCAGACCCAACAAAACCTGAGAAATTTCTGGCGTACATGGTTCCTTCTCCAGATGAGCTGTCCAAGGATATGTATGATGAACAAGAAGACGTTTCATTTTCCTGGGTTCG
TGAGTACCATTGGGATGTACGAGGAGATAATGTGGATGATCCCACGACATATCTCGTTTCGTTTGATGATGCAGAAGCTCGTTATGTGCCACTTCCTACAAAGCTTGTTC
TGAGAAAAAAGAGGGCTAAAGAAGGGAGATCAACCGATGAGGTTGAACATTTTCCTGCACCTTCGAGAGTGACTGTAAGGAGGAGACCAACAGTAGCTACTTTGGAAGTG
AAGGATCCAGGGGTTTACTCGAATTCGAAAAGGGGATCTGATATTGAAGATGGTCTTGGAAGATCACATAAACATGATAGACACCAAGATATGGATCAATACAGTGGAGC
TGAAGACGAGTTGTCTGAT
Protein sequenceShow/hide protein sequence
MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYAPQPPPPPPPD
SSYPPPPPPPAPSGPQPHLYYPPSQYSQGNQNQQSVQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQPRAEGANMGAHERDKGISKDPSYGRRERENSNHDKHQRHP
GPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSA
QPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQS
LTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATG
SDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEV
KDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDELSD