; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0003047 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0003047
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein PAF1 homolog
Genome locationchr4:47619564..47626685
RNA-Seq ExpressionLag0003047
SyntenyLag0003047
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0016570 - histone modification (biological process)
GO:0016593 - Cdc73/Paf1 complex (cellular component)
GO:0035327 - transcriptionally active chromatin (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR007133 - RNA polymerase II associated factor Paf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014045.1 Protein PAF1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0093.93Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR   GGSQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSNYHQQHYGPPRSQ  PPPPPP
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP

Query:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG
        HQSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QG+QNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK EG+++G HERDKG
Subjt:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        VSKDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR
        TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLR
Subjt:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR

Query:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DD+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
        VLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GA+D+MSD
Subjt:  GAEDEMSD

XP_022953373.1 protein PAF1 homolog [Cucurbita moschata]0.0e+0094.35Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR   GGSQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSNYHQQHYGPPRSQ  PPPPPP
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP

Query:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG
        HQSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK EG+++GAHERDKG
Subjt:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        VSKDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR
        TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLR
Subjt:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR

Query:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DD+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
        VLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GAED+MSD
Subjt:  GAEDEMSD

XP_022992172.1 protein PAF1 homolog [Cucurbita maxima]0.0e+0093.93Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR   G SQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSNYHQQHYGPPRSQ  PPPPPP
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP

Query:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG
        HQSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQ K EG+++GAHERDKG
Subjt:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        V+KDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR
        TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLR
Subjt:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR

Query:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DD+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
        VLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GAED+MSD
Subjt:  GAEDEMSD

XP_023547399.1 protein PAF1 homolog [Cucurbita pepo subsp. pepo]0.0e+0094.07Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR   GGSQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSN+HQQHYGPPRSQ  PPPPPP
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP

Query:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG
        HQSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQ K EG+++GAHERDKG
Subjt:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        VSKDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR
        TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLR
Subjt:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR

Query:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DD+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
        VLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GAED+MSD
Subjt:  GAEDEMSD

XP_038898523.1 protein PAF1 homolog [Benincasa hispida]0.0e+0094.18Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQHPPPPPPHQS
        MASYRPYP QSSFGP+PGQNP+PPPP Q ASVP QQR  GGGSQYNQNWGGYGGDGS+PPA SSSYPQNYNQ HQSSNYHQQHYGPPRSQHPPPPPP+QS
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQHPPPPPPHQS

Query:  YPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKD
        YPYAPQ PPPPPPDSSYPPPPPPPAPSQP +LYYPPS         QSMQPPPPPSSPPPSSSIPPPPPPNSPPP SAPQQKAEGTNMGAHERDKGVSKD
Subjt:  YPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKD

Query:  PSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL
        PSYGRR+RE SNHDKHQRHSGPPMPPKK NGPSGRMET+DEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+ATPFL
Subjt:  PSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFL

Query:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDIL
        SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRDD+L
Subjt:  SGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDIL

Query:  TTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL
         TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL
Subjt:  TTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLPL

Query:  LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVD
        LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRD HESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVD
Subjt:  LPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVD

Query:  DPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYSGAED
        DPTTYLVSFDD EARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQYSGAED
Subjt:  DPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYSGAED

Query:  EMSD
        EMSD
Subjt:  EMSD

TrEMBL top hitse value%identityAlignment
A0A1S3CHF3 LOW QUALITY PROTEIN: protein PAF1 homolog0.0e+0092.35Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHPPPPPPHQ
        MASYRPYP QSSFG +P QN IPPPP+Q AS  +QQR GG  +QYNQNWG Y GD SVPPAPSSSYPQNY NQ+HQ+SNYH Q YG PR+QHPPPPPPHQ
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHPPPPPPHQ

Query:  SYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSK
        SYPYAPQPPPPPPPDSSYPPPPPPPAPSQP +LYYP SQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSA QQKAEG NMGAHERDKGVSK
Subjt:  SYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSK

Query:  DPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPF
        DPSYGRR+RE SNHDKHQ+HSGPPMPPKK NGPSGRMET+DEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGER+ATPF
Subjt:  DPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPF

Query:  LSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDI
        LSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRDD+
Subjt:  LSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDI

Query:  LTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLP
        L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVEVLP
Subjt:  LTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLP

Query:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV
        LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV
Subjt:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV

Query:  DDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDR-HQDMDQYSGA
        DDPTTYLVSFDD+EARYVPLPTKLVL KKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPG+YSN KRGSDIEDG+GR HKHDR HQDMDQYSGA
Subjt:  DDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDR-HQDMDQYSGA

Query:  EDEMSD
        EDEMSD
Subjt:  EDEMSD

A0A5A7UA23 Protein PAF1-like protein0.0e+0092.35Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHPPPPPPHQ
        MASYRPYP QSSFG +P QN IPPPP+Q AS  +QQR GG  +QYNQNWG Y GD S PPAPSSSYPQNY NQ+HQ+SNYH Q YG PR+QHPPPPPPHQ
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHPPPPPPHQ

Query:  SYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSK
        SYPYAPQPPPPPPPDSSYPPPPPPPAPSQP +LYYP SQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSA QQKAEG NMGAHERDKGVSK
Subjt:  SYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSK

Query:  DPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPF
        DPSYGRR+RE SNHDKHQ+HSGPPMPPKK NGPSGRMET+DEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGER+ATPF
Subjt:  DPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPF

Query:  LSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDI
        LSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLRDD+
Subjt:  LSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLRDDI

Query:  LTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLP
        L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVEVLP
Subjt:  LTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLP

Query:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV
        LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV
Subjt:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV

Query:  DDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDR-HQDMDQYSGA
        DDPTTYLVSFDD+EARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPG+YSN KRGSDIEDG+GR HKHDR HQDMDQYSGA
Subjt:  DDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDR-HQDMDQYSGA

Query:  EDEMSD
        EDEMSD
Subjt:  EDEMSD

A0A6J1D3N7 protein PAF1 homolog0.0e+0093.09Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQVHQSSNYH-QQHYGPPRSQH-PPPPPPH
        MASYRPYP QSSFGPSPG NPIPPPPAQ A VPTQQR  GG SQYNQNWGGYGGDGSVPPAPSSSYPQNYNQ    +NYH QQHYGPPR+QH PPPPPPH
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQVHQSSNYH-QQHYGPPRSQH-PPPPPPH

Query:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPS-HLYYPPSQYSQGNQNQ---QSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERD
        QSYPYAPQPPPPPPPDSSYPPPPPPPAPS P  HLYYPPSQYSQ NQNQ   QSMQPPPPPSSPPP+SSIPPPPPPNSPPP SAPQ +AEG NMGAHERD
Subjt:  QSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPS-HLYYPPSQYSQGNQNQ---QSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERD

Query:  KGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER
        KG+SKDPSYGRRERE SNHDKHQRH GPPMPPKK NGPSGR+ETEDEKRLRKKREFEKQRQDERHRHH+KESQNTILQKTQMLSTGKGHGSIVGSRMGER
Subjt:  KGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER

Query:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEEL
        RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RK+KD+YT+YTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPS R  LAPEDEEL
Subjt:  RATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEEL

Query:  LRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPV
        LRDD+LTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLN+RERQIKEIEASFEACKSRPVHATNKNLYPV
Subjt:  LRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPV

Query:  EVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVR
        EVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKD+YDEQEDVS+SWVREYHWDVR
Subjt:  EVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVR

Query:  GDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQY
        GDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAP+RVTVRRRPTVATLEVKDPGVYSN +RGSDIE GLGRSHKHDRHQDMDQY
Subjt:  GDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQY

Query:  SGAEDEMSD
        SGAEDE+SD
Subjt:  SGAEDEMSD

A0A6J1GN64 protein PAF1 homolog0.0e+0094.35Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR   GGSQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSNYHQQHYGPPRSQ  PPPPPP
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP

Query:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG
        HQSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK EG+++GAHERDKG
Subjt:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        VSKDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR
        TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLR
Subjt:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR

Query:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DD+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
        VLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GAED+MSD
Subjt:  GAEDEMSD

A0A6J1JP14 protein PAF1 homolog0.0e+0093.93Show/hide
Query:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP
        MASYRPYP QSSFGPSPGQNPIPPPPA P ASVPTQQR   G SQYNQNWGGYGGDGSV PPA SSSYPQNYNQVHQSSNYHQQHYGPPRSQ  PPPPPP
Subjt:  MASYRPYPTQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-PPPPPP

Query:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG
        HQSYPYAPQPPPPPPPDSSYPPPPPPPA SQPS  Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQ K EG+++GAHERDKG
Subjt:  HQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        V+KDPSYGRRERE SNHDKHQRHSGPPMPPKK NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR
        TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMS RKEKDHYTRYTITSLEK YKPQLYVEPDLGIPLDLLDLSVYNPPSVR PLAPEDEELLR
Subjt:  TPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPSVRTPLAPEDEELLR

Query:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DD+L TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDILTTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
        VLPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GAED+MSD
Subjt:  GAEDEMSD

SwissProt top hitse value%identityAlignment
F4HQA1 Protein PAF1 homolog5.2e-17861.59Show/hide
Query:  PPRSQHPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK
        P ++   PPPPP    P  P P PPPPP     SYPPPPPPP    P   Y     Y Q NQ    +Q PPPP  PPPS+     PPP  P PP      
Subjt:  PPRSQHPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK

Query:  AEGTNMGAHERDKGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKG
            + G ++ +KG SK    GRRER   +  KH   S  P         S ++ETE+E+RLRKKRE EKQRQDE+HR  +K S      K+QM    KG
Subjt:  AEGTNMGAHERDKGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKG

Query:  HGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPS
        H         E++ TP L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM+ +++KD +T+YTITSLEK++KP+++VEPDLGIPLDLLDLSVYNPP 
Subjt:  HGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPS

Query:  VRTPLAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSR
        V+ PLAPEDEELLRDD   TP+KKDGI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLNNRERQIK+IEASFEACKSR
Subjt:  VRTPLAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSR

Query:  PVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVS
        PVHATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIRDAHES+AI+KSY+  GSD   PEKFLAYMVPS DELSKDI+DE E++S
Subjt:  PVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVS

Query:  YSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN--------LKRGSD
        Y+WVREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRS+DE+EHFP P+RVTVRRR TV+ +E KD GVYS+        ++R  D
Subjt:  YSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN--------LKRGSD

Query:  IEDGLGRSHKHDRHQDMDQYS-GAEDEMSD
         E GLGRS KH+  QD +QYS G ED+ S+
Subjt:  IEDGLGRSHKHDRHQDMDQYS-GAEDEMSD

Q4U0S5 RNA polymerase II-associated factor 1 homolog5.4e-2628.67Show/hide
Query:  GSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTP
        G R    R  P  SG             +C++K+ N LPD    PK ++   ++  + +Y  TSLEK +K +L  EPDLG+ +DL++   Y   P++   
Subjt:  GSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTP

Query:  LAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-AC
        L P DE+LL ++I     +     ++ +   K V W+ KT+YI   S E  +  ++ ++     E+K G ++ +         +R+ QI  IE +FE A 
Subjt:  LAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-AC

Query:  KSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQE
        KS   H +   + PVEVLP+ PDF  + +P   V FDS P     +          A     +M   M  G    +  +F+AY +P+ D + K   D +E
Subjt:  KSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQE

Query:  DV--------SYSWVREYHWDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEG--RSTDEV---EHFPAPARVTVRRRPTVATLEVKDPG
        ++         Y   REY+W+V+   +      Y   F DA+  Y   L T++ L K+RAK G   ST+ V   +H     +    +    A LE  +P 
Subjt:  DV--------SYSWVREYHWDVRGD-NVDDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEG--RSTDEV---EHFPAPARVTVRRRPTVATLEVKDPG

Query:  VYSNLKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE
                 D E+ L      D  +DM + SG E E
Subjt:  VYSNLKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE

Q4V886 RNA polymerase II-associated factor 1 homolog3.2e-2628.54Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   +++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+V   L P DE+LL ++I      K     + +
Subjt:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV
        +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMD
             Y   F + +  Y   L T++ L K+RAK G  +       V+H     +    +    A LE  +P      +  ++ ++  G   +H++    +
Subjt:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMD

Query:  QYSGAEDEMS
        +  G+EDE S
Subjt:  QYSGAEDEMS

Q5RAX0 RNA polymerase II-associated factor 1 homolog2.7e-2528.29Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   +++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+V   L P DE+LL ++I      K     + +
Subjt:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV
        +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMD
             Y   F + +  Y   L T++ L K+RAK G  +       V+H     +    +    A LE  +P      +  ++ ++  G   + ++    +
Subjt:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSTDE-----VEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMD

Query:  QYSGAEDEMS
        +  G+EDE S
Subjt:  QYSGAEDEMS

Q8N7H5 RNA polymerase II-associated factor 1 homolog1.6e-2528.88Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   +++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+V   L P DE+LL ++I      K     + +
Subjt:  LCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVEV+P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEVLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV
        +P   V FDS P          D S   A E   +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEG-------------RSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK
             Y   F + +  Y   L T++ L K+RAK G             R  +E E     AR            E ++         GSD E   G S +
Subjt:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEG-------------RSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHK

Query:  HDRHQDMDQYSGAEDEMSD
         +  +  D++SG+E E  +
Subjt:  HDRHQDMDQYSGAEDEMSD

Arabidopsis top hitse value%identityAlignment
AT1G79730.1 hydroxyproline-rich glycoprotein family protein3.7e-17961.59Show/hide
Query:  PPRSQHPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK
        P ++   PPPPP    P  P P PPPPP     SYPPPPPPP    P   Y     Y Q NQ    +Q PPPP  PPPS+     PPP  P PP      
Subjt:  PPRSQHPPPPPPHQSYPYAPQPPPPPPPDS---SYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK

Query:  AEGTNMGAHERDKGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKG
            + G ++ +KG SK    GRRER   +  KH   S  P         S ++ETE+E+RLRKKRE EKQRQDE+HR  +K S      K+QM    KG
Subjt:  AEGTNMGAHERDKGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKG

Query:  HGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPS
        H         E++ TP L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM+ +++KD +T+YTITSLEK++KP+++VEPDLGIPLDLLDLSVYNPP 
Subjt:  HGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGIPLDLLDLSVYNPPS

Query:  VRTPLAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSR
        V+ PLAPEDEELLRDD   TP+KKDGI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLNNRERQIK+IEASFEACKSR
Subjt:  VRTPLAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSR

Query:  PVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVS
        PVHATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIRDAHES+AI+KSY+  GSD   PEKFLAYMVPS DELSKDI+DE E++S
Subjt:  PVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVS

Query:  YSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN--------LKRGSD
        Y+WVREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRS+DE+EHFP P+RVTVRRR TV+ +E KD GVYS+        ++R  D
Subjt:  YSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN--------LKRGSD

Query:  IEDGLGRSHKHDRHQDMDQYS-GAEDEMSD
         E GLGRS KH+  QD +QYS G ED+ S+
Subjt:  IEDGLGRSHKHDRHQDMDQYS-GAEDEMSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGGGGAGACGAAGAAGGAAACGAAGAGCGAGAATGAGAGGAACGAGAAAAGCAGCAGAAGGAGAGAGGTAGATGAGGAAGCTAGTTCCGGTGGCCGCGAGGATGA
GAAGCTGAAGGGGAGAGAGAAGAATGGAGCGGCGGCGGCGTCGTTCGGCGAGGAGGAGGCTCCGGCGAAGGGTGGGAAGAAGAAGATAGATGGTGAGGAGGGAGAGAAGG
GTCACGGGGGACTTGATTGTTCGATTTTGAAACCTAGGGACCTATTGGAAACAATCCCGAAATATGAGGGACCAAAAAGATTGCCGGGTCTCTGTCTCGGGCCCATTCAG
GCGGTCAATTGCCCTTCATCTTCTCGGGACTCGGTTTTTCTTTTCGGAAAAGAAGTATTTGCATTGGCAGAACCGCCTCCTGTGTTGCTGCTTTTTGTCTTTGTTCCGTT
TCTCTCTGTTTCTCCAACTTCTAGATGCGTAGTAGATAGGGTTTTGGGGATTTCTTGTTCGGGAGGAGGGGTTCTGTTTGATCGATCTTTTGGGGGAGATATAGCCATGG
CTTCTTACAGGCCATATCCTACACAATCGTCTTTCGGTCCTTCGCCGGGTCAAAATCCGATTCCGCCCCCACCGGCGCAACCAGCTTCCGTTCCAACGCAGCAGCGAGGA
GGAGGAGGAGGTAGTCAGTATAATCAGAATTGGGGTGGTTATGGCGGTGATGGGTCTGTGCCTCCTGCTCCATCTTCCTCGTATCCCCAAAATTACAATCAAGTTCATCA
AAGTTCTAATTACCACCAGCAACATTATGGTCCGCCGAGAAGCCAACACCCTCCACCTCCTCCTCCTCACCAGTCGTATCCTTATGCACCACAGCCGCCGCCGCCGCCTC
CTCCCGATTCTTCCTATCCTCCACCTCCACCCCCACCAGCGCCTTCGCAACCTTCTCATCTTTACTATCCCCCTTCACAGTATTCCCAGGGTAATCAAAATCAGCAGTCA
ATGCAGCCACCACCTCCACCCTCATCTCCACCACCGAGCTCTTCAATCCCGCCGCCTCCACCCCCAAATTCTCCACCACCTCCTTCGGCGCCTCAACAAAAAGCAGAGGG
TACAAACATGGGAGCACACGAGCGCGATAAAGGGGTTTCAAAGGATCCGTCATATGGCAGGCGTGAACGTGAAACTTCAAATCATGATAAACACCAGAGGCACTCTGGTC
CCCCAATGCCTCCGAAGAAAGTAAACGGACCTTCAGGAAGAATGGAGACAGAGGATGAGAAAAGACTGAGGAAGAAGAGAGAGTTCGAAAAACAAAGGCAGGATGAGAGG
CATAGACATCATCTAAAAGAATCTCAAAACACTATTCTGCAAAAGACCCAGATGTTATCTACTGGGAAGGGGCATGGATCAATTGTGGGGTCCCGGATGGGGGAAAGGAG
GGCCACTCCATTTCTTAGTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAATTCCGGAACGAGCTTCCAGATACAAGTGCTCAGC
CAAAGCTCATGTCGCAACGGAAAGAGAAAGATCACTATACAAGATATACAATCACATCGCTGGAGAAAATGTATAAACCTCAGCTTTATGTCGAGCCAGATCTTGGAATA
CCTCTCGATTTACTTGACCTCAGTGTGTACAACCCTCCTAGTGTTAGAACACCCCTTGCTCCTGAAGATGAGGAATTATTACGTGACGATATATTGACAACTCCAGTTAA
AAAGGATGGTATAAAAAGAAAAGAACGTCCTACTGATAAAGGTGTTGCCTGGCTTGTTAAGACACAGTACATCTCTCCTCTCAGCATTGAATCCGCAAAACAGTCTTTGA
CTGAAAAACAAGCGAAAGAACTGCGAGAAATGAAGGGAGGGCGCAATATTCTTGAGAACCTCAACAATAGGGAAAGGCAAATTAAGGAAATTGAGGCGTCGTTTGAGGCA
TGCAAGTCACGCCCTGTTCATGCAACTAATAAGAATTTATATCCTGTGGAGGTTTTACCTCTTCTACCTGATTTTGATAGGTATGATGATCCATTTGTCGTGGTGGCATT
TGATAGTGCTCCCACAGCTGATTCAGAGACTTTCAACAAGTTAGACCAATCCATCCGTGATGCTCATGAATCACAGGCGATAATGAAAAGCTATATGGCAACAGGCTCAG
ATCCAACAAAACCTGAAAAATTTCTTGCATACATGGTTCCTTCTCCAGATGAGCTGTCAAAGGATATCTATGATGAACAAGAAGATGTTTCATATTCCTGGGTTCGAGAG
TATCATTGGGATGTACGAGGTGATAATGTGGATGATCCCACTACATATCTCGTTTCGTTTGATGATGCAGAAGCTCGTTATGTGCCACTTCCTACAAAGCTTGTTCTGAG
AAAAAAGAGGGCTAAAGAAGGTAGATCAACTGATGAGGTTGAACATTTTCCTGCACCTGCAAGAGTGACAGTTAGGAGAAGACCAACTGTAGCTACTTTGGAAGTGAAGG
ATCCAGGGGTATACTCAAATTTGAAAAGAGGATCAGATATTGAAGACGGTCTTGGAAGATCACATAAACATGATAGACACCAAGACATGGATCAATACAGCGGCGCTGAA
GACGAGATGTCTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGGGGAGACGAAGAAGGAAACGAAGAGCGAGAATGAGAGGAACGAGAAAAGCAGCAGAAGGAGAGAGGTAGATGAGGAAGCTAGTTCCGGTGGCCGCGAGGATGA
GAAGCTGAAGGGGAGAGAGAAGAATGGAGCGGCGGCGGCGTCGTTCGGCGAGGAGGAGGCTCCGGCGAAGGGTGGGAAGAAGAAGATAGATGGTGAGGAGGGAGAGAAGG
GTCACGGGGGACTTGATTGTTCGATTTTGAAACCTAGGGACCTATTGGAAACAATCCCGAAATATGAGGGACCAAAAAGATTGCCGGGTCTCTGTCTCGGGCCCATTCAG
GCGGTCAATTGCCCTTCATCTTCTCGGGACTCGGTTTTTCTTTTCGGAAAAGAAGTATTTGCATTGGCAGAACCGCCTCCTGTGTTGCTGCTTTTTGTCTTTGTTCCGTT
TCTCTCTGTTTCTCCAACTTCTAGATGCGTAGTAGATAGGGTTTTGGGGATTTCTTGTTCGGGAGGAGGGGTTCTGTTTGATCGATCTTTTGGGGGAGATATAGCCATGG
CTTCTTACAGGCCATATCCTACACAATCGTCTTTCGGTCCTTCGCCGGGTCAAAATCCGATTCCGCCCCCACCGGCGCAACCAGCTTCCGTTCCAACGCAGCAGCGAGGA
GGAGGAGGAGGTAGTCAGTATAATCAGAATTGGGGTGGTTATGGCGGTGATGGGTCTGTGCCTCCTGCTCCATCTTCCTCGTATCCCCAAAATTACAATCAAGTTCATCA
AAGTTCTAATTACCACCAGCAACATTATGGTCCGCCGAGAAGCCAACACCCTCCACCTCCTCCTCCTCACCAGTCGTATCCTTATGCACCACAGCCGCCGCCGCCGCCTC
CTCCCGATTCTTCCTATCCTCCACCTCCACCCCCACCAGCGCCTTCGCAACCTTCTCATCTTTACTATCCCCCTTCACAGTATTCCCAGGGTAATCAAAATCAGCAGTCA
ATGCAGCCACCACCTCCACCCTCATCTCCACCACCGAGCTCTTCAATCCCGCCGCCTCCACCCCCAAATTCTCCACCACCTCCTTCGGCGCCTCAACAAAAAGCAGAGGG
TACAAACATGGGAGCACACGAGCGCGATAAAGGGGTTTCAAAGGATCCGTCATATGGCAGGCGTGAACGTGAAACTTCAAATCATGATAAACACCAGAGGCACTCTGGTC
CCCCAATGCCTCCGAAGAAAGTAAACGGACCTTCAGGAAGAATGGAGACAGAGGATGAGAAAAGACTGAGGAAGAAGAGAGAGTTCGAAAAACAAAGGCAGGATGAGAGG
CATAGACATCATCTAAAAGAATCTCAAAACACTATTCTGCAAAAGACCCAGATGTTATCTACTGGGAAGGGGCATGGATCAATTGTGGGGTCCCGGATGGGGGAAAGGAG
GGCCACTCCATTTCTTAGTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAATTCCGGAACGAGCTTCCAGATACAAGTGCTCAGC
CAAAGCTCATGTCGCAACGGAAAGAGAAAGATCACTATACAAGATATACAATCACATCGCTGGAGAAAATGTATAAACCTCAGCTTTATGTCGAGCCAGATCTTGGAATA
CCTCTCGATTTACTTGACCTCAGTGTGTACAACCCTCCTAGTGTTAGAACACCCCTTGCTCCTGAAGATGAGGAATTATTACGTGACGATATATTGACAACTCCAGTTAA
AAAGGATGGTATAAAAAGAAAAGAACGTCCTACTGATAAAGGTGTTGCCTGGCTTGTTAAGACACAGTACATCTCTCCTCTCAGCATTGAATCCGCAAAACAGTCTTTGA
CTGAAAAACAAGCGAAAGAACTGCGAGAAATGAAGGGAGGGCGCAATATTCTTGAGAACCTCAACAATAGGGAAAGGCAAATTAAGGAAATTGAGGCGTCGTTTGAGGCA
TGCAAGTCACGCCCTGTTCATGCAACTAATAAGAATTTATATCCTGTGGAGGTTTTACCTCTTCTACCTGATTTTGATAGGTATGATGATCCATTTGTCGTGGTGGCATT
TGATAGTGCTCCCACAGCTGATTCAGAGACTTTCAACAAGTTAGACCAATCCATCCGTGATGCTCATGAATCACAGGCGATAATGAAAAGCTATATGGCAACAGGCTCAG
ATCCAACAAAACCTGAAAAATTTCTTGCATACATGGTTCCTTCTCCAGATGAGCTGTCAAAGGATATCTATGATGAACAAGAAGATGTTTCATATTCCTGGGTTCGAGAG
TATCATTGGGATGTACGAGGTGATAATGTGGATGATCCCACTACATATCTCGTTTCGTTTGATGATGCAGAAGCTCGTTATGTGCCACTTCCTACAAAGCTTGTTCTGAG
AAAAAAGAGGGCTAAAGAAGGTAGATCAACTGATGAGGTTGAACATTTTCCTGCACCTGCAAGAGTGACAGTTAGGAGAAGACCAACTGTAGCTACTTTGGAAGTGAAGG
ATCCAGGGGTATACTCAAATTTGAAAAGAGGATCAGATATTGAAGACGGTCTTGGAAGATCACATAAACATGATAGACACCAAGACATGGATCAATACAGCGGCGCTGAA
GACGAGATGTCTGATTGA
Protein sequenceShow/hide protein sequence
MVGETKKETKSENERNEKSSRRREVDEEASSGGREDEKLKGREKNGAAAASFGEEEAPAKGGKKKIDGEEGEKGHGGLDCSILKPRDLLETIPKYEGPKRLPGLCLGPIQ
AVNCPSSSRDSVFLFGKEVFALAEPPPVLLLFVFVPFLSVSPTSRCVVDRVLGISCSGGGVLFDRSFGGDIAMASYRPYPTQSSFGPSPGQNPIPPPPAQPASVPTQQRG
GGGGSQYNQNWGGYGGDGSVPPAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQHPPPPPPHQSYPYAPQPPPPPPPDSSYPPPPPPPAPSQPSHLYYPPSQYSQGNQNQQS
MQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKDPSYGRRERETSNHDKHQRHSGPPMPPKKVNGPSGRMETEDEKRLRKKREFEKQRQDER
HRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSQRKEKDHYTRYTITSLEKMYKPQLYVEPDLGI
PLDLLDLSVYNPPSVRTPLAPEDEELLRDDILTTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEA
CKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVRE
YHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSTDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNLKRGSDIEDGLGRSHKHDRHQDMDQYSGAE
DEMSD