; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014017 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014017
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein PAF1 homolog
Genome locationscaffold3:48458668..48464312
RNA-Seq ExpressionSpg014017
SyntenySpg014017
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0016570 - histone modification (biological process)
GO:0016593 - Cdc73/Paf1 complex (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR007133 - RNA polymerase II associated factor Paf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014045.1 Protein PAF1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0092.49Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR  GGSQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSNYHQQHYGPPRSQ  PPPPPPH
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        QSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QG+QNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK EG+++G HERDKGV
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT
        SKDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+AT
Subjt:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD

Query:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        D+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
        NVDDPTTYLVSFDDAEARYV            PLPTKLVLRKKRAKEGRS DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
Subjt:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK

Query:  HDRHQDMDQYSGAEDEMSD
        HDRHQDMDQYSGA+D+MSD
Subjt:  HDRHQDMDQYSGAEDEMSD

XP_022953373.1 protein PAF1 homolog [Cucurbita moschata]0.0e+0092.91Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR  GGSQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSNYHQQHYGPPRSQ  PPPPPPH
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        QSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK EG+++GAHERDKGV
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT
        SKDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+AT
Subjt:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD

Query:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        D+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
        NVDDPTTYLVSFDDAEARYV            PLPTKLVLRKKRAKEGRS DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
Subjt:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK

Query:  HDRHQDMDQYSGAEDEMSD
        HDRHQDMDQYSGAED+MSD
Subjt:  HDRHQDMDQYSGAEDEMSD

XP_022992172.1 protein PAF1 homolog [Cucurbita maxima]0.0e+0092.49Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR  G SQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSNYHQQHYGPPRSQ  PPPPPPH
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        QSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQ K EG+++GAHERDKGV
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT
        +KDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+AT
Subjt:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD

Query:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        D+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
        NVDDPTTYLVSFDDAEARYV            PLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
Subjt:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK

Query:  HDRHQDMDQYSGAEDEMSD
        HDRHQDMDQYSGAED+MSD
Subjt:  HDRHQDMDQYSGAEDEMSD

XP_023547399.1 protein PAF1 homolog [Cucurbita pepo subsp. pepo]0.0e+0092.63Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR  GGSQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSN+HQQHYGPPRSQ  PPPPPPH
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        QSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQ K EG+++GAHERDKGV
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT
        SKDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+AT
Subjt:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD

Query:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        D+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
        NVDDPTTYLVSFDDAEARYV            PLPTKLVLRKKRAKEGRS DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
Subjt:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK

Query:  HDRHQDMDQYSGAEDEMSD
        HDRHQDMDQYSGAED+MSD
Subjt:  HDRHQDMDQYSGAEDEMSD

XP_038898523.1 protein PAF1 homolog [Benincasa hispida]0.0e+0092.73Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQHPPPPPPHQSY
        MASYRPYP QSSFGP+PGQNP+PPPP Q ASVP QQR GGGSQYNQNWGGYGGDGS+PPA SSSYPQNYNQ HQSSNYHQQHYGPPRSQHPPPPPP+QSY
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQHPPPPPPHQSY

Query:  PYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKDP
        PYAPQ PPPPPPDSSYPPPPPPPAPSQP +LYYPPS         QSMQPPPPPSSPPPSSSIPPPPPPNSPPP SAPQQKAEGTNMGAHERDKGVSKDP
Subjt:  PYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKDP

Query:  SYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFLS
        SYGRR+RE SNHDKHQRHSGPPMPPKK NGPSGRMET+DEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+ATPFLS
Subjt:  SYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFLS

Query:  GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDILT
        GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRDD+L 
Subjt:  GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDILT

Query:  TPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLL
        TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLL
Subjt:  TPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLL

Query:  PDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDD
        PDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD HESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDD
Subjt:  PDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDD

Query:  PTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRH
        PTTYLVSFDD EARYV            PLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRH
Subjt:  PTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRH

Query:  QDMDQYSGAEDEMSD
        QDMDQYSGAEDEMSD
Subjt:  QDMDQYSGAEDEMSD

TrEMBL top hitse value%identityAlignment
A0A1S3CHF3 LOW QUALITY PROTEIN: protein PAF1 homolog0.0e+0090.93Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHPPPPPPHQS
        MASYRPYP QSSFG +P QN IPPPP+Q AS  +QQRGG  +QYNQNWG Y GD SVPPAPSSSYPQNY NQ+HQ+SNYH Q YG PR+QHPPPPPPHQS
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHPPPPPPHQS

Query:  YPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKD
        YPYAPQPPPPPPPDSSYPPPPPPPAPSQP +LYYP SQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSA QQKAEG NMGAHERDKGVSKD
Subjt:  YPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKD

Query:  PSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
        PSYGRR+RE SNHDKHQ+HSGPPMPPKK NGPSGRMET+DEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGER+ATPFL
Subjt:  PSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL

Query:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDIL
        SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRDD+L
Subjt:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDIL

Query:  TTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL
         TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVEVLPL
Subjt:  TTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL

Query:  LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVD
        LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVD
Subjt:  LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVD

Query:  DPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDR
        DPTTYLVSFDD+EARYV            PLPTKLVL KKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPG+YSN KRGSDIEDG+GR HKHDR
Subjt:  DPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDR

Query:  -HQDMDQYSGAEDEMSD
         HQDMDQYSGAEDEMSD
Subjt:  -HQDMDQYSGAEDEMSD

A0A5A7UA23 Protein PAF1-like protein0.0e+0090.93Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHPPPPPPHQS
        MASYRPYP QSSFG +P QN IPPPP+Q AS  +QQRGG  +QYNQNWG Y GD S PPAPSSSYPQNY NQ+HQ+SNYH Q YG PR+QHPPPPPPHQS
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHPPPPPPHQS

Query:  YPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKD
        YPYAPQPPPPPPPDSSYPPPPPPPAPSQP +LYYP SQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSA QQKAEG NMGAHERDKGVSKD
Subjt:  YPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKD

Query:  PSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
        PSYGRR+RE SNHDKHQ+HSGPPMPPKK NGPSGRMET+DEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGER+ATPFL
Subjt:  PSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL

Query:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDIL
        SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRDD+L
Subjt:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDIL

Query:  TTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL
         TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVEVLPL
Subjt:  TTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL

Query:  LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVD
        LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVD
Subjt:  LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVD

Query:  DPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDR
        DPTTYLVSFDD+EARYV            PLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPG+YSN KRGSDIEDG+GR HKHDR
Subjt:  DPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDR

Query:  -HQDMDQYSGAEDEMSD
         HQDMDQYSGAEDEMSD
Subjt:  -HQDMDQYSGAEDEMSD

A0A6J1D3N7 protein PAF1 homolog0.0e+0091.67Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQVHQSSNYH-QQHYGPPRSQH-PPPPPPHQ
        MASYRPYP QSSFGPSPG NPIPPPPAQ A VPTQQR GG SQYNQNWGGYGGDGSVPPAPSSSYPQNYNQ    +NYH QQHYGPPR+QH PPPPPPHQ
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQVHQSSNYH-QQHYGPPRSQH-PPPPPPHQ

Query:  SYPYAPQPPPPPPPDSSYPPPPPPPAPSQPS-HLYYPPSQYSQGNQNQ---QSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDK
        SYPYAPQPPPPPPPDSSYPPPPPPPAPS P  HLYYPPSQYSQ NQNQ   QSMQPPPPPSSPPP+SSIPPPPPPNSPPP SAPQ +AEG NMGAHERDK
Subjt:  SYPYAPQPPPPPPPDSSYPPPPPPPAPSQPS-HLYYPPSQYSQGNQNQ---QSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDK

Query:  GVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERR
        G+SKDPSYGRRERE SNHDKHQRH GPPMPPKK NGPSGR+ETEDEKRLRKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGKGHGSIVGSRMGERR
Subjt:  GVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERR

Query:  ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELL
        ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RK+KD+YT+YTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPS R  LAPEDEELL
Subjt:  ATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELL

Query:  RDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        RDD+LTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLN+RERQIKEIEASFEACKSRPVHATNKNLYPVE
Subjt:  RDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
        VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWDVRG
Subjt:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSH
        DNVDDPTTYLVSFDDAEARYV            PLPTKLVLRKKRAKEGRSTDEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN +RGSDIE GLGRSH
Subjt:  DNVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSH

Query:  KHDRHQDMDQYSGAEDEMSD
        KHDRHQDMDQYSGAEDE+SD
Subjt:  KHDRHQDMDQYSGAEDEMSD

A0A6J1GN64 protein PAF1 homolog0.0e+0092.91Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR  GGSQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSNYHQQHYGPPRSQ  PPPPPPH
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        QSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK EG+++GAHERDKGV
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT
        SKDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+AT
Subjt:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD

Query:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        D+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
        NVDDPTTYLVSFDDAEARYV            PLPTKLVLRKKRAKEGRS DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
Subjt:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK

Query:  HDRHQDMDQYSGAEDEMSD
        HDRHQDMDQYSGAED+MSD
Subjt:  HDRHQDMDQYSGAEDEMSD

A0A6J1JP14 protein PAF1 homolog0.0e+0092.49Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR  G SQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSNYHQQHYGPPRSQ  PPPPPPH
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        QSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQ K EG+++GAHERDKGV
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT
        +KDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+AT
Subjt:  SKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRD

Query:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        D+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
        NVDDPTTYLVSFDDAEARYV            PLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
Subjt:  NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK

Query:  HDRHQDMDQYSGAEDEMSD
        HDRHQDMDQYSGAED+MSD
Subjt:  HDRHQDMDQYSGAEDEMSD

SwissProt top hitse value%identityAlignment
F4HQA1 Protein PAF1 homolog1.5e-17560.44Show/hide
Query:  PPRSQHPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK
        P ++   PPPPP    P  P P PPPPP     SYPPPPPPP    P   Y     Y Q NQ    +Q PPPP  PPPS+     PPP  P PP      
Subjt:  PPRSQHPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK

Query:  AEGTNMGAHERDKGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKG
            + G ++ +KG SK    GRRER   +  KH   S  P         S ++ETE+E+RLRKKRE EKQRQDE+HR  +K S      K+QM    KG
Subjt:  AEGTNMGAHERDKGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKG

Query:  HGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPS
        H         E++ TP L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM+ +++KD +T+YTITSLEK++KP+++VEPDLGIPLDLLDLSVYNPP 
Subjt:  HGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPS

Query:  VRTPLAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSR
        V+ PLAPEDEELLRDD   TP+KKDGI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLNNRERQIK+IEASFEACKSR
Subjt:  VRTPLAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSR

Query:  PVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVS
        PVHATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIRDAHES+AI+KSY+  GSD   PEKFLAYMVPS DELSKDI+DE E++S
Subjt:  PVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVS

Query:  YSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN--
        Y+WVREY WDV+  N +DP TYLVSFD+  A Y+            PLP +L LRKKRA+EGRS+DE+EHFP P+RVTVRRR TV+ +E KD GVYS+  
Subjt:  YSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN--

Query:  ------LKRGSDIEDGLGRSHKHDRHQDMDQYS-GAEDEMSD
              ++R  D E GLGRS KH+  QD +QYS G ED+ S+
Subjt:  ------LKRGSDIEDGLGRSHKHDRHQDMDQYS-GAEDEMSD

Q4U0S5 RNA polymerase II-associated factor 1 homolog1.3e-2528.41Show/hide
Query:  GSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTP
        G R    R  P  SG             +C++K+ N LPD    PK ++   ++  + +Y  TSLEK +K +L  EPDLG+ +DL++   Y   P++   
Subjt:  GSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTP

Query:  LAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-AC
        L P DE+LL ++I     +     ++ +   K V W+ KT+YI   S E  +  ++ ++     E+K G ++ +         +R+ QI  IE +FE A 
Subjt:  LAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-AC

Query:  KSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQE
        KS   H +   + PVEVLP+ PDF  + +P   V FDS P     +          A     +M   M  G    +  +F+AY +P+ D + K   D +E
Subjt:  KSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQE

Query:  DV--------SYSWVREYHWDVRGD-NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEG--RSTDEV---EHFPAPARVTVRRRP
        ++         Y   REY+W+V+   +      Y   F DA+  Y         YN   L T++ L K+RAK G   ST+ V   +H     +    +  
Subjt:  DV--------SYSWVREYHWDVRGD-NVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEG--RSTDEV---EHFPAPARVTVRRRP

Query:  TVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE
          A LE  +P          D E+ L      D  +DM + SG E E
Subjt:  TVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE

Q4V886 RNA polymerase II-associated factor 1 homolog7.5e-2628.27Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   +++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+V   L P DE+LL ++I      K     + +
Subjt:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV
        +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDE-----VEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGR
             Y   F + +  Y         YN   L T++ L K+RAK G  +       V+H     +    +    A LE  +P      +  ++ ++  G 
Subjt:  DDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDE-----VEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGR

Query:  SHKHDRHQDMDQYSGAEDEMS
          +H++    ++  G+EDE S
Subjt:  SHKHDRHQDMDQYSGAEDEMS

Q5RAX0 RNA polymerase II-associated factor 1 homolog6.3e-2528.03Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   +++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+V   L P DE+LL ++I      K     + +
Subjt:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV
        +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDE-----VEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGR
             Y   F + +  Y         YN   L T++ L K+RAK G  +       V+H     +    +    A LE  +P      +  ++ ++  G 
Subjt:  DDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDE-----VEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGR

Query:  SHKHDRHQDMDQYSGAEDEMS
          + ++    ++  G+EDE S
Subjt:  SHKHDRHQDMDQYSGAEDEMS

Q8N7H5 RNA polymerase II-associated factor 1 homolog3.7e-2528.6Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   +++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+V   L P DE+LL ++I      K     + +
Subjt:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV
        +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEG-------------RSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGS
             Y   F + +  Y         YN   L T++ L K+RAK G             R  +E E     AR            E ++         GS
Subjt:  DDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEG-------------RSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGS

Query:  DIEDGLGRSHKHDRHQDMDQYSGAEDEMSD
        D E   G S + +  +  D++SG+E E  +
Subjt:  DIEDGLGRSHKHDRHQDMDQYSGAEDEMSD

Arabidopsis top hitse value%identityAlignment
AT1G79730.1 hydroxyproline-rich glycoprotein family protein1.1e-17660.44Show/hide
Query:  PPRSQHPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK
        P ++   PPPPP    P  P P PPPPP     SYPPPPPPP    P   Y     Y Q NQ    +Q PPPP  PPPS+     PPP  P PP      
Subjt:  PPRSQHPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK

Query:  AEGTNMGAHERDKGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKG
            + G ++ +KG SK    GRRER   +  KH   S  P         S ++ETE+E+RLRKKRE EKQRQDE+HR  +K S      K+QM    KG
Subjt:  AEGTNMGAHERDKGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKG

Query:  HGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPS
        H         E++ TP L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM+ +++KD +T+YTITSLEK++KP+++VEPDLGIPLDLLDLSVYNPP 
Subjt:  HGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPS

Query:  VRTPLAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSR
        V+ PLAPEDEELLRDD   TP+KKDGI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLNNRERQIK+IEASFEACKSR
Subjt:  VRTPLAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSR

Query:  PVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVS
        PVHATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIRDAHES+AI+KSY+  GSD   PEKFLAYMVPS DELSKDI+DE E++S
Subjt:  PVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVS

Query:  YSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN--
        Y+WVREY WDV+  N +DP TYLVSFD+  A Y+            PLP +L LRKKRA+EGRS+DE+EHFP P+RVTVRRR TV+ +E KD GVYS+  
Subjt:  YSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN--

Query:  ------LKRGSDIEDGLGRSHKHDRHQDMDQYS-GAEDEMSD
              ++R  D E GLGRS KH+  QD +QYS G ED+ S+
Subjt:  ------LKRGSDIEDGLGRSHKHDRHQDMDQYS-GAEDEMSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTACAGGCCATATCCTACACAATCGTCTTTCGGTCCTTCGCCGGGTCAAAATCCGATTCCGCCCCCACCGGCGCAACCAGCTTCCGTTCCAACGCAGCAGCG
AGGAGGAGGAGGTAGTCAGTATAATCAGAATTGGGGTGGTTATGGCGGTGATGGGTCTGTGCCTCCTGCTCCATCTTCCTCGTATCCCCAAAATTACAATCAAGTTCATC
AAAGTTCTAATTACCACCAGCAACATTATGGTCCGCCGAGAAGCCAACACCCTCCACCTCCTCCTCCTCACCAGTCGTATCCTTATGCACCACAGCCGCCGCCGCCGCCT
CCTCCCGATTCTTCCTATCCTCCACCCCCACCCCCACCAGCGCCTTCGCAACCTTCTCATCTTTACTATCCCCCTTCACAGTATTCCCAGGGTAATCAAAATCAGCAGTC
AATGCAGCCACCACCTCCACCCTCATCTCCACCACCGAGCTCTTCAATCCCGCCGCCTCCACCCCCAAATTCTCCACCACCTCCTTCGGCGCCTCAACAAAAAGCAGAGG
GTACAAACATGGGAGCACACGAGCGCGATAAAGGGGTTTCAAAGGATCCGTCATATGGCAGGCGTGAACGTGAAACTTCAAATCATGATAAACACCAGAGGCACTCTGGT
CCCCCAATGCCTCCGAAGAAAGTAAACGGACCTTCAGGGAGAATGGAGACAGAGGATGAGAAAAGACTGAGGAAGAAGAGAGAGTTCGAAAAACAAAGACAGGATGAGAG
GCATAGACATCATCTAAAAGAATCTCAAAACACTATTCTGCAAAAGACCCAGATGTTATCTACTGGGAAGGGGCATGGATCAATTGTGGGGTCCCGGATGGGGGAAAGGA
GGGCCACTCCATTTCTTAGTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAATTCCGGAACGAGCTTCCAGATACAAGTGCTCAG
CCAAAGCTCATGTCGCAACGGAAAGAGAAAGATCACTATACAAGATATACAATCACATCACTGGAGAAAATGTATAAACCTCAGCTTTATGTCGAGCCAGATCTTGGAAT
ACCTCTCGATTTACTTGACCTCAGTGTGTACAACCCTCCTAGTGTTAGAACACCCCTTGCTCCTGAAGATGAGGAATTATTACGTGATGATATATTGACAACTCCAGTTA
AAAAGGATGGTATAAAAAGAAAAGAACGTCCTACTGATAAAGGTGTTGCCTGGCTTGTTAAGACACAGTACATCTCTCCTCTCAGCATTGAATCGGCGAAACAGTCTTTG
ACTGAGAAACAAGCGAAAGAACTGCGAGAAATGAAGGGAGGGCGCAATATTCTTGAGAACCTCAACAATAGGGAAAGGCAAATTAAGGAAATTGAGGCGTCGTTTGAGGC
ATGCAAGTCACGCCCTGTTCATGCAACTAATAAGAATTTATATCCTGTGGAGGTTTTACCTCTTCTACCTGATTTTGATAGGTATGATGATCCATTTGTCGTGGTGGCAT
TTGATAGTGCTCCCACAGCTGATTCAGAGACTTTCAACAAGTTAGACCAATCCATCCGTGATGCTCATGAATCACAGGCGATAATGAAAAGCTATATGGCAACAGGCTCA
GATCCAACAAAACCTGAGAAATTTCTTGCATACATGGTTCCTTCTCCAGATGAGCTGTCAAAGGATATCTATGATGAACAAGAAGATGTTTCATATTCCTGGGTTCGAGA
GTATCATTGGGATGTACGAGGTGATAATGTGGATGATCCCACTACATATCTCGTTTCGTTTGATGATGCAGAAGCTCGTTATGTGGTATTCATCTGTCTTGAATCTCATT
ACAATTCATTTCCACTTCCTACAAAGCTTGTTCTGAGAAAAAAGAGGGCTAAAGAAGGTAGATCAACTGATGAGGTTGAACATTTTCCTGCACCTGCAAGAGTGACAGTT
AGGAGAAGACCAACTGTAGCTACTTTGGAAGTGAAGGATCCAGGGGTATACTCAAATTTGAAAAGAGGATCAGATATTGAAGACGGTCTTGGAAGATCACATAAACATGA
TAGACACCAAGACATGGATCAATACAGCGGCGCTGAAGACGAGATGTCTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTACAGGCCATATCCTACACAATCGTCTTTCGGTCCTTCGCCGGGTCAAAATCCGATTCCGCCCCCACCGGCGCAACCAGCTTCCGTTCCAACGCAGCAGCG
AGGAGGAGGAGGTAGTCAGTATAATCAGAATTGGGGTGGTTATGGCGGTGATGGGTCTGTGCCTCCTGCTCCATCTTCCTCGTATCCCCAAAATTACAATCAAGTTCATC
AAAGTTCTAATTACCACCAGCAACATTATGGTCCGCCGAGAAGCCAACACCCTCCACCTCCTCCTCCTCACCAGTCGTATCCTTATGCACCACAGCCGCCGCCGCCGCCT
CCTCCCGATTCTTCCTATCCTCCACCCCCACCCCCACCAGCGCCTTCGCAACCTTCTCATCTTTACTATCCCCCTTCACAGTATTCCCAGGGTAATCAAAATCAGCAGTC
AATGCAGCCACCACCTCCACCCTCATCTCCACCACCGAGCTCTTCAATCCCGCCGCCTCCACCCCCAAATTCTCCACCACCTCCTTCGGCGCCTCAACAAAAAGCAGAGG
GTACAAACATGGGAGCACACGAGCGCGATAAAGGGGTTTCAAAGGATCCGTCATATGGCAGGCGTGAACGTGAAACTTCAAATCATGATAAACACCAGAGGCACTCTGGT
CCCCCAATGCCTCCGAAGAAAGTAAACGGACCTTCAGGGAGAATGGAGACAGAGGATGAGAAAAGACTGAGGAAGAAGAGAGAGTTCGAAAAACAAAGACAGGATGAGAG
GCATAGACATCATCTAAAAGAATCTCAAAACACTATTCTGCAAAAGACCCAGATGTTATCTACTGGGAAGGGGCATGGATCAATTGTGGGGTCCCGGATGGGGGAAAGGA
GGGCCACTCCATTTCTTAGTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAATTCCGGAACGAGCTTCCAGATACAAGTGCTCAG
CCAAAGCTCATGTCGCAACGGAAAGAGAAAGATCACTATACAAGATATACAATCACATCACTGGAGAAAATGTATAAACCTCAGCTTTATGTCGAGCCAGATCTTGGAAT
ACCTCTCGATTTACTTGACCTCAGTGTGTACAACCCTCCTAGTGTTAGAACACCCCTTGCTCCTGAAGATGAGGAATTATTACGTGATGATATATTGACAACTCCAGTTA
AAAAGGATGGTATAAAAAGAAAAGAACGTCCTACTGATAAAGGTGTTGCCTGGCTTGTTAAGACACAGTACATCTCTCCTCTCAGCATTGAATCGGCGAAACAGTCTTTG
ACTGAGAAACAAGCGAAAGAACTGCGAGAAATGAAGGGAGGGCGCAATATTCTTGAGAACCTCAACAATAGGGAAAGGCAAATTAAGGAAATTGAGGCGTCGTTTGAGGC
ATGCAAGTCACGCCCTGTTCATGCAACTAATAAGAATTTATATCCTGTGGAGGTTTTACCTCTTCTACCTGATTTTGATAGGTATGATGATCCATTTGTCGTGGTGGCAT
TTGATAGTGCTCCCACAGCTGATTCAGAGACTTTCAACAAGTTAGACCAATCCATCCGTGATGCTCATGAATCACAGGCGATAATGAAAAGCTATATGGCAACAGGCTCA
GATCCAACAAAACCTGAGAAATTTCTTGCATACATGGTTCCTTCTCCAGATGAGCTGTCAAAGGATATCTATGATGAACAAGAAGATGTTTCATATTCCTGGGTTCGAGA
GTATCATTGGGATGTACGAGGTGATAATGTGGATGATCCCACTACATATCTCGTTTCGTTTGATGATGCAGAAGCTCGTTATGTGGTATTCATCTGTCTTGAATCTCATT
ACAATTCATTTCCACTTCCTACAAAGCTTGTTCTGAGAAAAAAGAGGGCTAAAGAAGGTAGATCAACTGATGAGGTTGAACATTTTCCTGCACCTGCAAGAGTGACAGTT
AGGAGAAGACCAACTGTAGCTACTTTGGAAGTGAAGGATCCAGGGGTATACTCAAATTTGAAAAGAGGATCAGATATTGAAGACGGTCTTGGAAGATCACATAAACATGA
TAGACACCAAGACATGGATCAATACAGCGGCGCTGAAGACGAGATGTCTGATTGA
Protein sequenceShow/hide protein sequence
MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQHPPPPPPHQSYPYAPQPPPPP
PPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKDPSYGRRERETSNHDKHQRHSG
PPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQ
PKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSL
TEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGS
DPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVVFICLESHYNSFPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTV
RRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDEMSD