; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0245 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0245
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein PAF1 homolog
Genome locationMC09:2259716..2265444
RNA-Seq ExpressionMC09g0245
SyntenyMC09g0245
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0016570 - histone modification (biological process)
GO:0016593 - Cdc73/Paf1 complex (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR007133 - RNA polymerase II associated factor Paf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148278.1 protein PAF1 homolog [Momordica charantia]0.099.72Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA
        MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA

Query:  PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERDKGISKD
        PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQ NQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERDKGISKD
Subjt:  PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERDKGISKD

Query:  PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
        PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
Subjt:  PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL

Query:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDV
        SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPP SSRMSLAPEDEELLRDDV
Subjt:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDV

Query:  LTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL
        LTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL
Subjt:  LTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL

Query:  LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVD
        LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVD
Subjt:  LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVD

Query:  DPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMDQYSGAED
        DPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMDQYSGAED
Subjt:  DPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMDQYSGAED

Query:  ELSD
        ELSD
Subjt:  ELSD

XP_022953373.1 protein PAF1 homolog [Cucurbita moschata]0.090.58Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQ-AAPVPTQQRGGSSQYNQNWGGYGGDGSVPP-APSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFGPSPG NPIPPPPA  AA VPTQQRGGS QYNQNWGGYGGDGSVPP A SSSYPQNYNQ    +NYHQQ HYGPPR+Q PPPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQ-AAPVPTQQRGGSSQYNQNWGGYGGDGSVPP-APSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPA S P  H Y+PPSQY Q NQNQQS+Q   PPPPPSSPPP+SSIPPPPPPNSPPP SAPQ + EG+++GAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG+SKDPSYGRRERENSNHDKHQRH GPPMPPKK+NGPSGRIET+DEKR RKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPP S R+ LAPEDEE
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE

Query:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY
        LLRDDVL TPVKKDG IKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RER+IKEI+ASFEACKSRPVHATNKNLY
Subjt:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY

Query:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD
        PVEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWD
Subjt:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD

Query:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMD
        VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN +RGSDIE GLGRSHKHDRHQDMD
Subjt:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMD

Query:  QYSGAEDELSD
        QYSGAED++SD
Subjt:  QYSGAEDELSD

XP_022992172.1 protein PAF1 homolog [Cucurbita maxima]0.090.58Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQ-AAPVPTQQRGGSSQYNQNWGGYGGDGSVPP-APSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFGPSPG NPIPPPPA  AA VPTQQRG SSQYNQNWGGYGGDGSVPP A SSSYPQNYNQ    +NYHQQ HYGPPR+Q PPPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQ-AAPVPTQQRGGSSQYNQNWGGYGGDGSVPP-APSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPA S P  H Y+PPSQY Q NQNQQS+Q   PPPPPSSPPP+SSIPPPPPPNSPPP SAPQP+ EG+++GAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG++KDPSYGRRERENSNHDKHQRH GPPMPPKK+NGPSGRIET+DEKR RKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPP S R+ LAPEDEE
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE

Query:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY
        LLRDDVL TPVKKDG IKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RER+IKEI+ASFEACKSRPVHATNKNLY
Subjt:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY

Query:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD
        PVEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWD
Subjt:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD

Query:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMD
        VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN +RGSDIE GLGRSHKHDRHQDMD
Subjt:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMD

Query:  QYSGAEDELSD
        QYSGAED++SD
Subjt:  QYSGAEDELSD

XP_023547399.1 protein PAF1 homolog [Cucurbita pepo subsp. pepo]0.090.58Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQ-AAPVPTQQRGGSSQYNQNWGGYGGDGSVPP-APSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFGPSPG NPIPPPPA  AA VPTQQRGGS QYNQNWGGYGGDGSVPP A SSSYPQNYNQ    +N+HQQ HYGPPR+Q PPPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQ-AAPVPTQQRGGSSQYNQNWGGYGGDGSVPP-APSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPA S P  H Y+PPSQY Q NQNQQS+Q   PPPPPSSPPP+SSIPPPPPPNSPPP SAPQP+ EG+++GAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG+SKDPSYGRRERENSNHDKHQRH GPPMPPKK+NGPSGRIET+DEKR RKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPP S R+ LAPEDEE
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE

Query:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY
        LLRDDVL TPVKKDG IKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RER+IKEI+ASFEACKSRPVHATNKNLY
Subjt:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY

Query:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD
        PVEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWD
Subjt:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD

Query:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMD
        VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN +RGSDIE GLGRSHKHDRHQDMD
Subjt:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMD

Query:  QYSGAEDELSD
        QYSGAED++SD
Subjt:  QYSGAEDELSD

XP_038898523.1 protein PAF1 homolog [Benincasa hispida]0.090.55Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQA----NYHQQQHYGPPRTQHPPPPPPPHQS
        MASYRPYPPQSSFGP+PG NP+PPPP Q+A VP QQRGG SQYNQNWGGYGGDGS+PPA SSSYPQNYNQA    NYHQQ HYGPPR+QHPPPPPP +QS
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQA----NYHQQQHYGPPRTQHPPPPPPPHQS

Query:  YPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERDKG
        YPYAPQPPPPPP DSSYPPPPPPPAPS P P+LYYPPSQ            SMQPPPPPSSPPP+SSIPPPPPPNSPPP SAPQ +AEG NMGAHERDKG
Subjt:  YPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERDKG

Query:  ISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        +SKDPSYGRR+RENSNHDKHQRH GPPMPPKKANGPSGR+ET+DEKRLRKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  ISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELL
        TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPP S R+ LAPEDEELL
Subjt:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELL

Query:  RDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPV
        RDDVL TPVKKDG IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLN+RERQIKEIEASFEACKSRPVHATNKNLYPV
Subjt:  RDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPV

Query:  EVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVR
        EVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD HESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWDVR
Subjt:  EVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVR

Query:  GDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMDQY
        GDNVDDPTTYLVSFDD EARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSNS+RGSDIE GLGRSHKHDRHQDMDQY
Subjt:  GDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMDQY

Query:  SGAEDELSD
        SGAEDE+SD
Subjt:  SGAEDELSD

TrEMBL top hitse value%identityAlignment
A0A0A0KCT6 Uncharacterized protein0.089.33Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSS-QYNQNWGGYGGDGSVPPAPSSSYPQNYNQ-----ANYHQQQHYGPPRTQHPPPPPPP-
        MASYRPYPPQSSFG +P  N IPPP AQ+A V +QQRGG++ QYNQNWG Y GD S PPAPSSSYPQNYN      +NYH QQ YGPPRTQHPPPPPPP 
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSS-QYNQNWGGYGGDGSVPPAPSSSYPQNYNQ-----ANYHQQQHYGPPRTQHPPPPPPP-

Query:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHER
        HQSYPYAPQPPPPPPPDSSYPPPPPPPA S P P+LYYP SQYSQ NQNQQS   MQPPPPPSSPPP+SSIPPPPPPNSPPP SA Q +AEG NMGAHER
Subjt:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHER

Query:  DKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGE
        DKG+ KDPSYGRR+RENSNHDKHQ+H GPPMPPKKANGPSGR+ET+DEKRLRKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGK HGSIVGSRMGE
Subjt:  DKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGE

Query:  RRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDE
        R+ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNP  S RM LAPEDE
Subjt:  RRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDE

Query:  ELLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNL
        ELLRDDVL TPVKKDG IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLN+RERQIKEIEASFEACKSRP+HATNKNL
Subjt:  ELLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNL

Query:  YPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHW
        YPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMAT SDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHW
Subjt:  YPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHW

Query:  DVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDM
        DVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAP+RVTVRRRPTVATLEVKDPG+YSNS+RGSDIE G+GRSHKHDRHQDM
Subjt:  DVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDM

Query:  DQYSGAEDELSD
        DQ+SGAEDE+SD
Subjt:  DQYSGAEDELSD

A0A1S3CHF3 LOW QUALITY PROTEIN: protein PAF1 homolog0.088.9Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSS-QYNQNWGGYGGDGSVPPAPSSSYPQNYNQ-----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFG +P  N IPPPP+Q+A   +QQRGG++ QYNQNWG Y GD SVPPAPSSSYPQNYN      +NYH QQ YG PRTQHPPPPPP H
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSS-QYNQNWGGYGGDGSVPPAPSSSYPQNYNQ-----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPAPS P P+LYYP SQYSQ NQNQQS   MQPPPPPSSPPP+SSIPPPPPPNSPPP SA Q +AEG NMGAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG+SKDPSYGRR+RENSNHDKHQ+H GPPMPPKKANGPSGR+ET+DEK+LRKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGK HGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPP S R+ LAPEDEE
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE

Query:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY
        LLRDDVL TPVKKDG IKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RERQIKEIEASFEACKSRP+HATNKNLY
Subjt:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY

Query:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD
        PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWD
Subjt:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD

Query:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRH-QDM
        VRGDNVDDPTTYLVSFDD+EARYVPLPTKLVL KKRAKEGRS+DEVEHFPAP+RVTVRRRPTVATLEVKDPG+YSNS+RGSDIE G+GR HKHDRH QDM
Subjt:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRH-QDM

Query:  DQYSGAEDELSD
        DQYSGAEDE+SD
Subjt:  DQYSGAEDELSD

A0A6J1D3N7 protein PAF1 homolog0.099.72Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA
        MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYA

Query:  PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERDKGISKD
        PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQ NQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERDKGISKD
Subjt:  PQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERDKGISKD

Query:  PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
        PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
Subjt:  PSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL

Query:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDV
        SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPP SSRMSLAPEDEELLRDDV
Subjt:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDV

Query:  LTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL
        LTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL
Subjt:  LTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL

Query:  LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVD
        LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVD
Subjt:  LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVD

Query:  DPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMDQYSGAED
        DPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMDQYSGAED
Subjt:  DPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMDQYSGAED

Query:  ELSD
        ELSD
Subjt:  ELSD

A0A6J1GN64 protein PAF1 homolog0.090.58Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQ-AAPVPTQQRGGSSQYNQNWGGYGGDGSVPP-APSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFGPSPG NPIPPPPA  AA VPTQQRGGS QYNQNWGGYGGDGSVPP A SSSYPQNYNQ    +NYHQQ HYGPPR+Q PPPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQ-AAPVPTQQRGGSSQYNQNWGGYGGDGSVPP-APSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPA S P  H Y+PPSQY Q NQNQQS+Q   PPPPPSSPPP+SSIPPPPPPNSPPP SAPQ + EG+++GAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG+SKDPSYGRRERENSNHDKHQRH GPPMPPKK+NGPSGRIET+DEKR RKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPP S R+ LAPEDEE
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE

Query:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY
        LLRDDVL TPVKKDG IKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RER+IKEI+ASFEACKSRPVHATNKNLY
Subjt:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY

Query:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD
        PVEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWD
Subjt:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD

Query:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMD
        VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN +RGSDIE GLGRSHKHDRHQDMD
Subjt:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMD

Query:  QYSGAEDELSD
        QYSGAED++SD
Subjt:  QYSGAEDELSD

A0A6J1JP14 protein PAF1 homolog0.090.58Show/hide
Query:  MASYRPYPPQSSFGPSPGLNPIPPPPAQ-AAPVPTQQRGGSSQYNQNWGGYGGDGSVPP-APSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH
        MASYRPYPPQSSFGPSPG NPIPPPPA  AA VPTQQRG SSQYNQNWGGYGGDGSVPP A SSSYPQNYNQ    +NYHQQ HYGPPR+Q PPPPPPPH
Subjt:  MASYRPYPPQSSFGPSPGLNPIPPPPAQ-AAPVPTQQRGGSSQYNQNWGGYGGDGSVPP-APSSSYPQNYNQ----ANYHQQQHYGPPRTQHPPPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPA S P  H Y+PPSQY Q NQNQQS+Q   PPPPPSSPPP+SSIPPPPPPNSPPP SAPQP+ EG+++GAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERD

Query:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG++KDPSYGRRERENSNHDKHQRH GPPMPPKK+NGPSGRIET+DEKR RKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE
        +ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRK+KD+YT+YTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPP S R+ LAPEDEE
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEE

Query:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY
        LLRDDVL TPVKKDG IKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLN+RER+IKEI+ASFEACKSRPVHATNKNLY
Subjt:  LLRDDVLTTPVKKDG-IKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLY

Query:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD
        PVEVLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWD
Subjt:  PVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWD

Query:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMD
        VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN +RGSDIE GLGRSHKHDRHQDMD
Subjt:  VRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMD

Query:  QYSGAEDELSD
        QYSGAED++SD
Subjt:  QYSGAEDELSD

SwissProt top hitse value%identityAlignment
F4HQA1 Protein PAF1 homolog3.9e-17661.01Show/hide
Query:  PPRTQHPPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSGPQPHLYYPPS-QYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPS
        PP      PPPPP    P  P P PPPPP     SYPPPPPPP      PH YY     Y Q NQ       +Q P            PPPPPP++PPP 
Subjt:  PPRTQHPPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSGPQPHLYYPPS-QYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPS

Query:  SAPQPRAEGANMGAHERDKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQM
            PR +G N    + +KG SK    GRRER   +  KH  H    +P  K      +IETE+E+RLRKKRE EKQRQDE+HR  +K S      K+QM
Subjt:  SAPQPRAEGANMGAHERDKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQM

Query:  LSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLS
            KGH         E++ TP L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM++++DKD +TKYTITSLEK++KP+++VEPDLGIPLDLLDLS
Subjt:  LSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLS

Query:  VYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEAS
        VYN PP  +  LAPEDEELLRDD   TP+KKDGI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLN+RERQIK+IEAS
Subjt:  VYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEAS

Query:  FEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMY
        FEACKSRPVHATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIRDAHES+AI+KSY+  GSD   PEKFLAYMVPS DELSKD++
Subjt:  FEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMY

Query:  DEQEDVSFSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVY-------SN
        DE E++S++WVREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRS+DE+EHFP PSRVTVRRR TV+ +E KD GVY       S+
Subjt:  DEQEDVSFSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVY-------SN

Query:  SERGSDIEHGLGRSHKHDRHQDMDQYS-GAEDELSD
          R  + E GLGRS KH+  QD +QYS G ED+ S+
Subjt:  SERGSDIEHGLGRSHKHDRHQDMDQYS-GAEDELSD

Q4U0S5 RNA polymerase II-associated factor 1 homolog2.5e-2627.92Show/hide
Query:  GSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMS
        G R    R  P  SG             +C++K+ N LPD    PK ++   D+  + +Y  TSLEK +K +L  EPDLG+ +DL++   Y   P+  + 
Subjt:  GSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMS

Query:  LAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-AC
        L P DE+LL ++     ++     ++ +   K V W+ KT+YI   S E  +  ++ ++     E+K G ++ +          R+ QI  IE +FE A 
Subjt:  LAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-AC

Query:  KSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQE
        KS   H +   + PVEVLP+ PDF  + +P   V FDS P     +          A     +M   M  G    +  +F+AY +P+ D + K   D +E
Subjt:  KSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQE

Query:  DV--------SFSWVREYHWDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEG--RSTDEV---EHFPAPSRVTVRRRPTVATLEVKDPG
        ++         +   REY+W+V+   +      Y   F DA+  Y   L T++ L K+RAK G   ST+ V   +H     +    +    A LE  +P 
Subjt:  DV--------SFSWVREYHWDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEG--RSTDEV---EHFPAPSRVTVRRRPTVATLEVKDPG

Query:  VYSNSERGSDIEHGLGRSHKHDRHQDMD-QYSGAEDE
           + E   D+E  +      +R +  D + S +E E
Subjt:  VYSNSERGSDIEHGLGRSHKHDRHQDMD-QYSGAEDE

Q4V886 RNA polymerase II-associated factor 1 homolog1.8e-2728.94Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   D++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+  + L P DE+LL +++      K     + +
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +          R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NV
        +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ ++          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSER--------GSDIEHGLGRSHK
             Y   F + +  Y   L T++ L K+RAK G  +       V+H     +    +    A LE  +P      E         GSD EH  G S +
Subjt:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSER--------GSDIEHGLGRSHK

Query:  HDRHQD------MDQYSGAEDELSD
         +  +D       D+  G  DE SD
Subjt:  HDRHQD------MDQYSGAEDELSD

Q8K2T8 RNA polymerase II-associated factor 1 homolog1.5e-2628.71Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   D++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+  + L P DE+LL +++      K     + +
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +          R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NV
        +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ ++          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSER--------GSDIEHGLGRSHK
             Y   F + +  Y   L T++ L K+RAK G  +       V+H     +    +    A LE  +P      E         GSD E   G S +
Subjt:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSER--------GSDIEHGLGRSHK

Query:  HDRHQD------MDQYSGAEDELSD
         +  +D       D+  G  DE SD
Subjt:  HDRHQD------MDQYSGAEDELSD

Q8N7H5 RNA polymerase II-associated factor 1 homolog1.1e-2628.85Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   D++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+  + L P DE+LL +++      K     + +
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +          R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NSRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NV
        +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ ++          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSER--------GSDIEHGLGRSHK
             Y   F + +  Y   L T++ L K+RAK G  +       V+H     +    +    A LE  +P      E         GSD E   G S +
Subjt:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPSRVTVRRRPTVATLEVKDPGVYSNSER--------GSDIEHGLGRSHK

Query:  HDRHQDMDQYSGAEDE
         +  +  D++SG+E E
Subjt:  HDRHQDMDQYSGAEDE

Arabidopsis top hitse value%identityAlignment
AT1G79730.1 hydroxyproline-rich glycoprotein family protein2.7e-17761.01Show/hide
Query:  PPRTQHPPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSGPQPHLYYPPS-QYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPS
        PP      PPPPP    P  P P PPPPP     SYPPPPPPP      PH YY     Y Q NQ       +Q P            PPPPPP++PPP 
Subjt:  PPRTQHPPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSGPQPHLYYPPS-QYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPS

Query:  SAPQPRAEGANMGAHERDKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQM
            PR +G N    + +KG SK    GRRER   +  KH  H    +P  K      +IETE+E+RLRKKRE EKQRQDE+HR  +K S      K+QM
Subjt:  SAPQPRAEGANMGAHERDKGISKDPSYGRRERENSNHDKHQRHPGPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQM

Query:  LSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLS
            KGH         E++ TP L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM++++DKD +TKYTITSLEK++KP+++VEPDLGIPLDLLDLS
Subjt:  LSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLS

Query:  VYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEAS
        VYN PP  +  LAPEDEELLRDD   TP+KKDGI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLN+RERQIK+IEAS
Subjt:  VYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNSRERQIKEIEAS

Query:  FEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMY
        FEACKSRPVHATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIRDAHES+AI+KSY+  GSD   PEKFLAYMVPS DELSKD++
Subjt:  FEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDMY

Query:  DEQEDVSFSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVY-------SN
        DE E++S++WVREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRS+DE+EHFP PSRVTVRRR TV+ +E KD GVY       S+
Subjt:  DEQEDVSFSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLEVKDPGVY-------SN

Query:  SERGSDIEHGLGRSHKHDRHQDMDQYS-GAEDELSD
          R  + E GLGRS KH+  QD +QYS G ED+ S+
Subjt:  SERGSDIEHGLGRSHKHDRHQDMDQYS-GAEDELSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTACAGGCCATATCCTCCACAATCGTCCTTCGGTCCTTCGCCAGGTCTAAATCCGATTCCGCCACCGCCAGCGCAAGCAGCTCCCGTTCCAACGCAGCAGCG
AGGAGGTAGTAGTCAATATAATCAGAATTGGGGTGGTTATGGCGGTGACGGGTCTGTGCCTCCAGCTCCATCTTCCTCGTATCCTCAAAATTACAACCAAGCTAATTACC
ACCAGCAGCAGCATTATGGTCCGCCGAGAACCCAACACCCTCCACCACCTCCTCCTCCTCACCAATCGTATCCTTATGCACCGCAGCCGCCGCCACCGCCGCCACCCGAT
TCTTCCTATCCTCCGCCTCCACCACCGCCAGCTCCTTCGGGTCCCCAACCTCATCTTTACTATCCTCCTTCACAGTATTCCCAGAGTAATCAAAATCAGCAGTCCGTACA
GTCAATGCAGCCACCACCTCCGCCCTCGTCTCCACCACCAAACTCTTCAATTCCGCCGCCCCCACCCCCAAATTCTCCGCCACCTTCATCGGCGCCTCAACCAAGAGCGG
AGGGTGCAAACATGGGAGCACACGAGCGTGATAAAGGGATTTCAAAGGATCCGTCATATGGCAGGCGTGAACGTGAAAATTCAAATCATGATAAACACCAGAGGCACCCT
GGTCCCCCAATGCCTCCGAAGAAAGCAAACGGACCTTCAGGGAGAATTGAGACGGAGGATGAGAAAAGACTGAGGAAGAAGAGAGAATTCGAAAAGCAAAGGCAAGATGA
GAGGCATAGACATCATATAAAAGAATCCCAAAACACAATTCTGCAGAAGACCCAGATGTTATCTACCGGGAAGGGGCATGGATCAATTGTGGGGTCCCGAATGGGGGAAA
GGAGGGCCACTCCATTTCTTAGTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAATTCCGGAATGAGCTTCCAGATACAAGTGCA
CAGCCGAAACTCATGTCATTGCGGAAAGATAAAGATTACTATACAAAATATACAATCACATCGCTAGAGAAAATGTACAAACCTCAGCTTTACGTTGAGCCAGATCTTGG
AATACCTCTCGATTTGCTTGACCTCAGTGTATACAACCCCCCTCCTAGTTCTAGAATGTCCCTTGCTCCTGAAGATGAGGAATTATTACGTGATGATGTGTTGACAACTC
CAGTTAAAAAAGATGGTATAAAAAGAAAAGAACGTCCTACTGATAAAGGGGTTGCCTGGCTTGTTAAGACACAGTACATCTCTCCTCTTAGCATTGAATCCGCAAAACAG
TCTTTGACTGAAAAACAAGCAAAAGAGCTTCGAGAAATGAAGGGAGGGCGTAATATTCTTGAGAACCTCAATAGTAGGGAAAGGCAAATTAAGGAAATTGAGGCGTCGTT
TGAGGCATGCAAGTCACGCCCTGTTCATGCAACTAATAAGAATTTATATCCTGTAGAGGTTTTACCTCTTCTGCCTGATTTTGATAGGTATGATGATCCATTTGTTGTGG
TGGCGTTTGATAGTGCTCCCACGGCTGATTCAGAGACTTTCAACAAGTTAGACCAATCCATCCGCGATGCTCATGAATCGCAGGCGATAATGAAAAGCTATATGGCAACA
GGCTCAGACCCAACAAAACCTGAGAAATTTCTGGCGTACATGGTTCCTTCTCCAGATGAGCTGTCCAAGGATATGTATGATGAACAAGAAGACGTTTCATTTTCCTGGGT
TCGTGAGTACCATTGGGATGTACGAGGAGATAATGTGGATGATCCCACGACATATCTCGTTTCGTTTGATGATGCAGAAGCTCGTTATGTGCCACTTCCTACAAAGCTTG
TTCTGAGAAAAAAGAGGGCTAAAGAAGGGAGATCAACCGATGAGGTTGAACATTTTCCTGCACCTTCGAGAGTGACTGTAAGGAGGAGACCAACAGTAGCTACTTTGGAA
GTGAAGGATCCAGGGGTTTACTCGAATTCGGAAAGGGGATCTGATATTGAACATGGTCTTGGAAGATCACATAAACATGATAGACACCAAGATATGGATCAATACAGTGG
AGCTGAAGACGAGTTGTCTGATTGA
mRNA sequenceShow/hide mRNA sequence
AATTTTCTTTTGAAAAATTGACAATAAGATTATAACATAACTGAAAATTAGGAAAAAATGAAATATCCGACGAAAAAAGATTGAAAAAAAAAAAAAAAGAATCGACGAAA
TGCTGCGCTCAGAAAACGCCGTGACGTAACAGAAAAGTGGGAGATCCACGAAGGCCGCCCTCGCCCTCCCCTGCGCTGTGCCTGCGTTTGCTTGGGAACCAAAGTTCTTT
CCACTACTTCTTCCCCTCAATTCCCCCAACCACTTCCATTTCTCTCTCTCTTAAACCTTTCTTCTTTGATCTTCTATCAATTCCTGGTTCCCCCTCGTCTCTACTTTTTT
CTTCAGCTAGGGTTTTATTGCTCATCGATTCAGATTGCAAGCTTCGAGGTGGTGAATTCGCCTTCATCTTCTCGGGACTCGGGTTTCTTTTTCTTGGAAACTAGTATTTG
CATTGGCAGTATCACTACCTGGATTGATCCTTTTTGTTGTGTTCGTTCTGTTCTCCTCCTCTCTCTCTCTAACTTCTAGATGCGTTGATAGGGTTTTGGAGATTTCGCGT
TGGAGGGTTTTGATGATCGCCCGGGATATCTGAGTGAAATTGTACTTTGTGATTTCGCGATTTTGTTTCTTCCAGGGTTCTGTTTGATCGGCTTTTCGGGGAGATAGCCA
TGGCTTCTTACAGGCCATATCCTCCACAATCGTCCTTCGGTCCTTCGCCAGGTCTAAATCCGATTCCGCCACCGCCAGCGCAAGCAGCTCCCGTTCCAACGCAGCAGCGA
GGAGGTAGTAGTCAATATAATCAGAATTGGGGTGGTTATGGCGGTGACGGGTCTGTGCCTCCAGCTCCATCTTCCTCGTATCCTCAAAATTACAACCAAGCTAATTACCA
CCAGCAGCAGCATTATGGTCCGCCGAGAACCCAACACCCTCCACCACCTCCTCCTCCTCACCAATCGTATCCTTATGCACCGCAGCCGCCGCCACCGCCGCCACCCGATT
CTTCCTATCCTCCGCCTCCACCACCGCCAGCTCCTTCGGGTCCCCAACCTCATCTTTACTATCCTCCTTCACAGTATTCCCAGAGTAATCAAAATCAGCAGTCCGTACAG
TCAATGCAGCCACCACCTCCGCCCTCGTCTCCACCACCAAACTCTTCAATTCCGCCGCCCCCACCCCCAAATTCTCCGCCACCTTCATCGGCGCCTCAACCAAGAGCGGA
GGGTGCAAACATGGGAGCACACGAGCGTGATAAAGGGATTTCAAAGGATCCGTCATATGGCAGGCGTGAACGTGAAAATTCAAATCATGATAAACACCAGAGGCACCCTG
GTCCCCCAATGCCTCCGAAGAAAGCAAACGGACCTTCAGGGAGAATTGAGACGGAGGATGAGAAAAGACTGAGGAAGAAGAGAGAATTCGAAAAGCAAAGGCAAGATGAG
AGGCATAGACATCATATAAAAGAATCCCAAAACACAATTCTGCAGAAGACCCAGATGTTATCTACCGGGAAGGGGCATGGATCAATTGTGGGGTCCCGAATGGGGGAAAG
GAGGGCCACTCCATTTCTTAGTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAATTCCGGAATGAGCTTCCAGATACAAGTGCAC
AGCCGAAACTCATGTCATTGCGGAAAGATAAAGATTACTATACAAAATATACAATCACATCGCTAGAGAAAATGTACAAACCTCAGCTTTACGTTGAGCCAGATCTTGGA
ATACCTCTCGATTTGCTTGACCTCAGTGTATACAACCCCCCTCCTAGTTCTAGAATGTCCCTTGCTCCTGAAGATGAGGAATTATTACGTGATGATGTGTTGACAACTCC
AGTTAAAAAAGATGGTATAAAAAGAAAAGAACGTCCTACTGATAAAGGGGTTGCCTGGCTTGTTAAGACACAGTACATCTCTCCTCTTAGCATTGAATCCGCAAAACAGT
CTTTGACTGAAAAACAAGCAAAAGAGCTTCGAGAAATGAAGGGAGGGCGTAATATTCTTGAGAACCTCAATAGTAGGGAAAGGCAAATTAAGGAAATTGAGGCGTCGTTT
GAGGCATGCAAGTCACGCCCTGTTCATGCAACTAATAAGAATTTATATCCTGTAGAGGTTTTACCTCTTCTGCCTGATTTTGATAGGTATGATGATCCATTTGTTGTGGT
GGCGTTTGATAGTGCTCCCACGGCTGATTCAGAGACTTTCAACAAGTTAGACCAATCCATCCGCGATGCTCATGAATCGCAGGCGATAATGAAAAGCTATATGGCAACAG
GCTCAGACCCAACAAAACCTGAGAAATTTCTGGCGTACATGGTTCCTTCTCCAGATGAGCTGTCCAAGGATATGTATGATGAACAAGAAGACGTTTCATTTTCCTGGGTT
CGTGAGTACCATTGGGATGTACGAGGAGATAATGTGGATGATCCCACGACATATCTCGTTTCGTTTGATGATGCAGAAGCTCGTTATGTGCCACTTCCTACAAAGCTTGT
TCTGAGAAAAAAGAGGGCTAAAGAAGGGAGATCAACCGATGAGGTTGAACATTTTCCTGCACCTTCGAGAGTGACTGTAAGGAGGAGACCAACAGTAGCTACTTTGGAAG
TGAAGGATCCAGGGGTTTACTCGAATTCGGAAAGGGGATCTGATATTGAACATGGTCTTGGAAGATCACATAAACATGATAGACACCAAGATATGGATCAATACAGTGGA
GCTGAAGACGAGTTGTCTGATTGATTAGTTCATGTTTTGCCTCAGCCAAAGATTATCTTCCTGATGCAAACGCAGCCATCTGGCAGAAATTCTCCCAAATTTTTTTAAAA
CCACCGGATGATTATGGTATTGTGTTGAGTGTGTACGTTACATGTCTATCACCACACAAACTAAAGTCTTTGTATGCAATCTTGAATATTTCTAATTTAATCTGTATAGT
TTCTCTCTTGGACATGGCAAAGAAAAAAGTGAAATCTGAAAAAGGGAATCGCATCAGATCCTCCCTCCCTCCCATTTTTCTAGTAAAATTGATGTAATGATCTCTAAGAT
GATTCTTTTAGATGGGCAGACAATTCTCTCGTGGATGGGCACTGTACACGTGTCTTGGATAATAAATTGGTGAAAGTTGAACTCTACTTTATAATAGACGAGTATTTTCA
ATTTTCT
Protein sequenceShow/hide protein sequence
MASYRPYPPQSSFGPSPGLNPIPPPPAQAAPVPTQQRGGSSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQANYHQQQHYGPPRTQHPPPPPPPHQSYPYAPQPPPPPPPD
SSYPPPPPPPAPSGPQPHLYYPPSQYSQSNQNQQSVQSMQPPPPPSSPPPNSSIPPPPPPNSPPPSSAPQPRAEGANMGAHERDKGISKDPSYGRRERENSNHDKHQRHP
GPPMPPKKANGPSGRIETEDEKRLRKKREFEKQRQDERHRHHIKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSA
QPKLMSLRKDKDYYTKYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPPSSRMSLAPEDEELLRDDVLTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQ
SLTEKQAKELREMKGGRNILENLNSRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMAT
GSDPTKPEKFLAYMVPSPDELSKDMYDEQEDVSFSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPSRVTVRRRPTVATLE
VKDPGVYSNSERGSDIEHGLGRSHKHDRHQDMDQYSGAEDELSD