; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002537 (gene) of Snake gourd v1 genome

Gene IDTan0002537
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein PAF1 homolog
Genome locationLG06:1981484..1986799
RNA-Seq ExpressionTan0002537
SyntenyTan0002537
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0016570 - histone modification (biological process)
GO:0016593 - Cdc73/Paf1 complex (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR007133 - RNA polymerase II associated factor Paf1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014045.1 Protein PAF1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0093.22Show/hide
Query:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP
        MASYRPYPPQSSFGPSPGQNPIPPPPA P ASVPTQQR   GGSQYNQNWGGYGGDGSV P A SSSYPQNYNQVHQSSNYHQQHYGPPRSQ   PPPPP
Subjt:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP

Query:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG
        HQSYPYAPQ PPPPPPDSSYPPPPPPP  SQP   Y+PPSQY QG+QNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSA  QK EG+++G HERDKG
Subjt:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        VSKDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR
        TPFL+GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLR
Subjt:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR

Query:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DDVLKTPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
         LPLLPDFDRYDDPFVVVAFD+APTADSETFNKL+QSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRR TVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GA+D+MSD
Subjt:  GAEDEMSD

XP_022953373.1 protein PAF1 homolog [Cucurbita moschata]0.0e+0093.64Show/hide
Query:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP
        MASYRPYPPQSSFGPSPGQNPIPPPPA P ASVPTQQR   GGSQYNQNWGGYGGDGSV P A SSSYPQNYNQVHQSSNYHQQHYGPPRSQ   PPPPP
Subjt:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP

Query:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG
        HQSYPYAPQ PPPPPPDSSYPPPPPPP  SQP   Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSA  QK EG+++GAHERDKG
Subjt:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        VSKDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR
        TPFL+GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLR
Subjt:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR

Query:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DDVLKTPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
         LPLLPDFDRYDDPFVVVAFD+APTADSETFNKL+QSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRR TVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GAED+MSD
Subjt:  GAEDEMSD

XP_022992172.1 protein PAF1 homolog [Cucurbita maxima]0.0e+0093.36Show/hide
Query:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP
        MASYRPYPPQSSFGPSPGQNPIPPPPA P ASVPTQQR   G SQYNQNWGGYGGDGSV P A SSSYPQNYNQVHQSSNYHQQHYGPPRSQ   PPPPP
Subjt:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP

Query:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG
        HQSYPYAPQ PPPPPPDSSYPPPPPPP  SQP   Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSA   K EG+++GAHERDKG
Subjt:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        V+KDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR
        TPFL+GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLR
Subjt:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR

Query:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DDVLKTPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
         LPLLPDFDRYDDPFVVVAFD+APTADSETFNKL+QSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRR TVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GAED+MSD
Subjt:  GAEDEMSD

XP_023547399.1 protein PAF1 homolog [Cucurbita pepo subsp. pepo]0.0e+0093.36Show/hide
Query:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP
        MASYRPYPPQSSFGPSPGQNPIPPPPA P ASVPTQQR   GGSQYNQNWGGYGGDGSV P A SSSYPQNYNQVHQSSN+HQQHYGPPRSQ   PPPPP
Subjt:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP

Query:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG
        HQSYPYAPQ PPPPPPDSSYPPPPPPP  SQP   Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSA   K EG+++GAHERDKG
Subjt:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        VSKDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR
        TPFL+GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLR
Subjt:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR

Query:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DDVLKTPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
         LPLLPDFDRYDDPFVVVAFD+APTADSETFNKL+QSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRR TVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GAED+MSD
Subjt:  GAEDEMSD

XP_038898523.1 protein PAF1 homolog [Benincasa hispida]0.0e+0094.31Show/hide
Query:  MASYRPYPPQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQHAPPPPPHQS
        MASYRPYPPQSSFGP+PGQNP+PPPP Q ASVP QQR  GGGSQYNQNWGGYGGDGS+P A SSSYPQNYNQ HQSSNYHQQHYGPPRSQH PPPPP+QS
Subjt:  MASYRPYPPQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQHAPPPPPHQS

Query:  YPYAPQPPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKGVSKDP
        YPYAPQPPPPPPDSSYPPPPPPP PSQPP+LYYPPS         QSMQPPPPPSSPPPSSSIPPPPPPNSPPP SA  QKAEGTNMGAHERDKGVSKDP
Subjt:  YPYAPQPPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKGVSKDP

Query:  SYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFLT
        SYGRR+RENSNHDKHQRHSGPPMPPKKANGPSGRMET+DEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+ATPFL+
Subjt:  SYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFLT

Query:  GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRDDVLK
        GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRDDVLK
Subjt:  GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRDDVLK

Query:  TPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEGLPLL
        TPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE LPLL
Subjt:  TPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEGLPLL

Query:  PDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDD
        PDFDRYDDPFVVVAFDSAPTADSETFNKL+QSIRD HESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDD
Subjt:  PDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDD

Query:  PTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE
        PTTYLVSFDD EARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRR TVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE
Subjt:  PTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDE

Query:  MSD
        MSD
Subjt:  MSD

TrEMBL top hitse value%identityAlignment
A0A0A0KCT6 Uncharacterized protein0.0e+0092.79Show/hide
Query:  MASYRPYPPQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPTAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQH--APPPPP
        MASYRPYPPQSSFG +P QN IPPP AQ ASV +QQR GG  +QYNQNWG Y GD S P APSSSYPQNY NQ+HQ+SNYH Q YGPPR+QH   PPPPP
Subjt:  MASYRPYPPQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPTAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQH--APPPPP

Query:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKGV
        HQSYPYAPQ PPPPPPDSSYPPPPPPP  SQPP+LYYP SQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAS QKAEGTNMGAHERDKGV
Subjt:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT
         KDPSYGRR+RENSNHDKHQ+HSGPPMPPKKANGPSGRMET+DEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGER+AT
Subjt:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRAT

Query:  PFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD
        PFL+GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVRMPLAPEDEELLRD
Subjt:  PFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD

Query:  DVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEG
        DVLKTPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVE 
Subjt:  DVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEG

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKL+QSIRDAHESQAIMKSYMAT SDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSG
        NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRR TVATLEVKDPG+YSNSKRGSDIEDG+GRSHKHDRHQDMDQ+SG
Subjt:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSG

Query:  AEDEMSD
        AEDEMSD
Subjt:  AEDEMSD

A0A1S3CHF3 LOW QUALITY PROTEIN: protein PAF1 homolog0.0e+0092.78Show/hide
Query:  MASYRPYPPQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPTAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHAPPPPPHQ
        MASYRPYPPQSSFG +P QN IPPPP+Q AS  +QQR GG  +QYNQNWG Y GD SVP APSSSYPQNY NQ+HQ+SNYH Q YG PR+QH PPPPPHQ
Subjt:  MASYRPYPPQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPTAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHAPPPPPHQ

Query:  SYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKGVSK
        SYPYAPQ PPPPPPDSSYPPPPPPP PSQPP+LYYP SQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAS QKAEG NMGAHERDKGVSK
Subjt:  SYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKGVSK

Query:  DPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPF
        DPSYGRR+RENSNHDKHQ+HSGPPMPPKKANGPSGRMET+DEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGER+ATPF
Subjt:  DPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPF

Query:  LTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRDDV
        L+GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRDDV
Subjt:  LTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRDDV

Query:  LKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEGLP
        LKTPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVE LP
Subjt:  LKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEGLP

Query:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV
        LLPDFDRYDDPFVVVAFDSAPTADSETFNKL+QSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV
Subjt:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV

Query:  DDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDR-HQDMDQYSGA
        DDPTTYLVSFDD+EARYVPLPTKLVL KKRAKEGRSSDEVEHFPAPARVTVRRR TVATLEVKDPG+YSNSKRGSDIEDG+GR HKHDR HQDMDQYSGA
Subjt:  DDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDR-HQDMDQYSGA

Query:  EDEMSD
        EDEMSD
Subjt:  EDEMSD

A0A5A7UA23 Protein PAF1-like protein0.0e+0092.78Show/hide
Query:  MASYRPYPPQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPTAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHAPPPPPHQ
        MASYRPYPPQSSFG +P QN IPPPP+Q AS  +QQR GG  +QYNQNWG Y GD S P APSSSYPQNY NQ+HQ+SNYH Q YG PR+QH PPPPPHQ
Subjt:  MASYRPYPPQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPTAPSSSYPQNY-NQVHQSSNYHQQHYGPPRSQHAPPPPPHQ

Query:  SYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKGVSK
        SYPYAPQ PPPPPPDSSYPPPPPPP PSQPP+LYYP SQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAS QKAEG NMGAHERDKGVSK
Subjt:  SYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKGVSK

Query:  DPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPF
        DPSYGRR+RENSNHDKHQ+HSGPPMPPKKANGPSGRMET+DEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGK HGSIVGSRMGER+ATPF
Subjt:  DPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPF

Query:  LTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRDDV
        L+GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRDDV
Subjt:  LTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRDDV

Query:  LKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEGLP
        LKTPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVE LP
Subjt:  LKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEGLP

Query:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV
        LLPDFDRYDDPFVVVAFDSAPTADSETFNKL+QSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV
Subjt:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNV

Query:  DDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDR-HQDMDQYSGA
        DDPTTYLVSFDD+EARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRR TVATLEVKDPG+YSNSKRGSDIEDG+GR HKHDR HQDMDQYSGA
Subjt:  DDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDR-HQDMDQYSGA

Query:  EDEMSD
        EDEMSD
Subjt:  EDEMSD

A0A6J1GN64 protein PAF1 homolog0.0e+0093.64Show/hide
Query:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP
        MASYRPYPPQSSFGPSPGQNPIPPPPA P ASVPTQQR   GGSQYNQNWGGYGGDGSV P A SSSYPQNYNQVHQSSNYHQQHYGPPRSQ   PPPPP
Subjt:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP

Query:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG
        HQSYPYAPQ PPPPPPDSSYPPPPPPP  SQP   Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSA  QK EG+++GAHERDKG
Subjt:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        VSKDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR
        TPFL+GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLR
Subjt:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR

Query:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DDVLKTPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
         LPLLPDFDRYDDPFVVVAFD+APTADSETFNKL+QSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRR TVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GAED+MSD
Subjt:  GAEDEMSD

A0A6J1JP14 protein PAF1 homolog0.0e+0093.36Show/hide
Query:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP
        MASYRPYPPQSSFGPSPGQNPIPPPPA P ASVPTQQR   G SQYNQNWGGYGGDGSV P A SSSYPQNYNQVHQSSNYHQQHYGPPRSQ   PPPPP
Subjt:  MASYRPYPPQSSFGPSPGQNPIPPPPAQP-ASVPTQQRGGGGGSQYNQNWGGYGGDGSV-PTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQH-APPPPP

Query:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG
        HQSYPYAPQ PPPPPPDSSYPPPPPPP  SQP   Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSA   K EG+++GAHERDKG
Subjt:  HQSYPYAPQ-PPPPPPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA
        V+KDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ET+DEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGER+A
Subjt:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRA

Query:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR
        TPFL+GERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLR
Subjt:  TPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR

Query:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DDVLKTPVKKD GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVE
Subjt:  DDVLKTPVKKD-GIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
         LPLLPDFDRYDDPFVVVAFD+APTADSETFNKL+QSIRDAHESQAIMKSYMATGSDP+KPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG
Subjt:  GLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRR TVATLEVKDPGVYSN KRGSDIEDGLGRSHKHDRHQDMDQYS
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYS

Query:  GAEDEMSD
        GAED+MSD
Subjt:  GAEDEMSD

SwissProt top hitse value%identityAlignment
F4HQA1 Protein PAF1 homolog4.4e-18062.14Show/hide
Query:  PPRSQHAPPPPPHQSYPYAPQPPPPPPDSSYPPPPPPPGPSQPPHLYYPPS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEG
        P ++  APPPPP    P  P PPP     SYPPPPPP     PPH YY     Y Q NQ    +Q PPPP  PPPS+     PPP  P PP   HQ    
Subjt:  PPRSQHAPPPPPHQSYPYAPQPPPPPPDSSYPPPPPPPGPSQPPHLYYPPS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEG

Query:  TNMGAHERDKGVSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGS
           G ++ +KG SK    GRRER   +  KH   S  P         S ++ETE+E+RLRKKRE EKQRQDE+HR  +K S      K+QM    KGH  
Subjt:  TNMGAHERDKGVSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGS

Query:  IVGSRMGERRATPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRM
               E++ TP LT +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM+++++KD +T+YTITSLEK +KP+++VEPDLGIPLDLLDLSVYNPP V+ 
Subjt:  IVGSRMGERRATPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRM

Query:  PLAPEDEELLRDDVLKTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVH
        PLAPEDEELLRDD   TP+KKDGI+RKERPTDKG++WLVKTQYIS ++ ES +QSLTEKQAKELREMKGG NIL NLNNRERQIK+IEASFEACKSRPVH
Subjt:  PLAPEDEELLRDDVLKTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVH

Query:  ATNKNLYPVEGLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW
        ATNKNL PVE LPLLP FDRYD+ FVV  FD AP ADSE F KL+ SIRDAHES+AI+KSY+  GSD   PEKFLAYMVPS DELSKDI+DE E++SY+W
Subjt:  ATNKNLYPVEGLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW

Query:  VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVY-------SNSKRGSDIEDG
        VREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRSSDE+EHFP P+RVTVRRRSTV+ +E KD GVY       S+  R  + E G
Subjt:  VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVY-------SNSKRGSDIEDG

Query:  LGRSHKHDRHQDMDQYS-GAEDEMSD
        LGRS KH+  QD +QYS G ED+ S+
Subjt:  LGRSHKHDRHQDMDQYS-GAEDEMSD

Q4U0S5 RNA polymerase II-associated factor 1 homolog1.3e-2528.19Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRMPLAPEDEELLRDDVLKTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   ++  + +Y  TSLEK +K +L  EPDLG+ +DL++   Y   P++   L P DE+LL ++     ++     ++ +
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRMPLAPEDEELLRDDVLKTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEGLPLLPDFDRYD
           K V W+ KT+YI   S E  +  ++ ++     E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVE LP+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEGLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDV--------SYSWVREYHWDVRGD-NV
        +P   V FDS P     +          A     +M   M  G    +  +F+AY +P+ D + K   D +E++         Y   REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDV--------SYSWVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMD
             Y   F DA+  Y   L T++ L K+RAK G  S        +H     +    + +  A LE  +P          D E+ L      D  +DM 
Subjt:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMD

Query:  QYSGAEDE
        + SG E E
Subjt:  QYSGAEDE

Q4V886 RNA polymerase II-associated factor 1 homolog3.3e-2627.8Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRMPLAPEDEELLRDDVLKTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   +++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+V   L P DE+LL +++ + P       ++ +
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRMPLAPEDEELLRDDVLKTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEGLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVE +P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEGLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV
        +P   V FDS P A  +T             +  +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMD
             Y   F + +  Y   L T++ L K+RAK G  S       V+H     +    + +  A LE  +P      +  ++ ++  G   +H++    +
Subjt:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMD

Query:  QYSGAEDEMS
        +  G+EDE S
Subjt:  QYSGAEDEMS

Q5RAX0 RNA polymerase II-associated factor 1 homolog2.8e-2527.56Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRMPLAPEDEELLRDDVLKTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   +++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+V   L P DE+LL +++ + P       ++ +
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRMPLAPEDEELLRDDVLKTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEGLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVE +P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEGLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV
        +P   V FDS P A  +T             +  +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMD
             Y   F + +  Y   L T++ L K+RAK G  S       V+H     +    + +  A LE  +P      +  ++ ++  G   + ++    +
Subjt:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKRGSDIEDGLGRSHKHDRHQDMD

Query:  QYSGAEDEMS
        +  G+EDE S
Subjt:  QYSGAEDEMS

Q8N7H5 RNA polymerase II-associated factor 1 homolog2.1e-2528.16Show/hide
Query:  LCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRMPLAPEDEELLRDDVLKTPVKKDGIKRKER
        +C++K+ N LPD    PK ++   +++ + +Y  TSLEK +K  L  EPDLG+ +DL++   Y   P+V   L P DE+LL +++ + P       ++ +
Subjt:  LCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYN-PPSVRMPLAPEDEELLRDDVLKTPVKKDGIKRKER

Query:  PTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEGLPLLPDFDRYD
           K V W+ KT+YI   S E  +  +    + E  E+K G ++ +         +R+ QI  IE +FE A KS   H +   + PVE +P+ PDF  + 
Subjt:  PTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENL------NNRERQIKEIEASFE-ACKSRPVHATNKNLYPVEGLPLLPDFDRYD

Query:  DPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV
        +P   V FDS P A  +T             +  +M   M  G    +  +F+AY +P  + L K   D++E++ Y+          REY+W+V+   + 
Subjt:  DPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYS--------WVREYHWDVRGD-NV

Query:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKR--------GSDIEDGLGRSHK
             Y   F + +  Y   L T++ L K+RAK G  S       V+H     +    + +  A LE  +P      +         GSD E   G S +
Subjt:  DDPTTYLVSFDDAEARYV-PLPTKLVLRKKRAKEGRSSDE-----VEHFPAPARVTVRRRSTVATLEVKDPGVYSNSKR--------GSDIEDGLGRSHK

Query:  HDRHQDMDQYSGAEDEMSD
         +  +  D++SG+E E  +
Subjt:  HDRHQDMDQYSGAEDEMSD

Arabidopsis top hitse value%identityAlignment
AT1G79730.1 hydroxyproline-rich glycoprotein family protein3.1e-18162.14Show/hide
Query:  PPRSQHAPPPPPHQSYPYAPQPPPPPPDSSYPPPPPPPGPSQPPHLYYPPS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEG
        P ++  APPPPP    P  P PPP     SYPPPPPP     PPH YY     Y Q NQ    +Q PPPP  PPPS+     PPP  P PP   HQ    
Subjt:  PPRSQHAPPPPPHQSYPYAPQPPPPPPDSSYPPPPPPPGPSQPPHLYYPPS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEG

Query:  TNMGAHERDKGVSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGS
           G ++ +KG SK    GRRER   +  KH   S  P         S ++ETE+E+RLRKKRE EKQRQDE+HR  +K S      K+QM    KGH  
Subjt:  TNMGAHERDKGVSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGS

Query:  IVGSRMGERRATPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRM
               E++ TP LT +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM+++++KD +T+YTITSLEK +KP+++VEPDLGIPLDLLDLSVYNPP V+ 
Subjt:  IVGSRMGERRATPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRM

Query:  PLAPEDEELLRDDVLKTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVH
        PLAPEDEELLRDD   TP+KKDGI+RKERPTDKG++WLVKTQYIS ++ ES +QSLTEKQAKELREMKGG NIL NLNNRERQIK+IEASFEACKSRPVH
Subjt:  PLAPEDEELLRDDVLKTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESTKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVH

Query:  ATNKNLYPVEGLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW
        ATNKNL PVE LPLLP FDRYD+ FVV  FD AP ADSE F KL+ SIRDAHES+AI+KSY+  GSD   PEKFLAYMVPS DELSKDI+DE E++SY+W
Subjt:  ATNKNLYPVEGLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGSDPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSW

Query:  VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVY-------SNSKRGSDIEDG
        VREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRSSDE+EHFP P+RVTVRRRSTV+ +E KD GVY       S+  R  + E G
Subjt:  VREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVKDPGVY-------SNSKRGSDIEDG

Query:  LGRSHKHDRHQDMDQYS-GAEDEMSD
        LGRS KH+  QD +QYS G ED+ S+
Subjt:  LGRSHKHDRHQDMDQYS-GAEDEMSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTACCGGCCATATCCTCCACAATCGTCCTTCGGTCCTTCACCTGGTCAAAATCCGATTCCGCCTCCACCAGCACAACCGGCTTCGGTTCCAACGCAACAGCG
AGGAGGAGGAGGAGGTAGTCAGTATAATCAGAATTGGGGTGGTTATGGTGGTGATGGGTCTGTGCCTACCGCTCCTTCTTCTTCGTATCCCCAAAATTACAACCAAGTTC
ATCAAAGTTCTAATTACCACCAGCAACATTATGGTCCGCCGAGAAGCCAACACGCTCCGCCTCCTCCTCCTCATCAATCGTATCCTTATGCACCACAGCCGCCGCCGCCT
CCTCCCGATTCTTCCTATCCTCCGCCTCCACCCCCACCAGGGCCTTCGCAACCTCCTCATCTTTACTATCCCCCTTCACAGTATTCCCAGGGTAATCAAAATCAGCAGTC
AATGCAGCCACCACCTCCGCCCTCATCTCCACCACCGAGCTCCTCAATACCGCCGCCTCCACCACCAAATTCTCCACCACCTCCTTCAGCTTCTCATCAAAAGGCAGAGG
GTACAAACATGGGAGCGCACGAGCGCGATAAAGGGGTTTCAAAGGATCCGTCATATGGCAGGCGTGAACGTGAAAATTCAAATCATGATAAACACCAGAGGCACTCTGGT
CCCCCAATGCCTCCAAAGAAAGCAAACGGACCTTCAGGGAGAATGGAGACGGAGGATGAGAAAAGACTGAGGAAGAAGAGAGAGTTCGAAAAACAAAGGCAAGACGAGAG
GCATAGACATCATCTAAAAGAATCCCAAAACACTATTCTGCAAAAGACCCAGATGTTATCTACTGGGAAGGGGCATGGATCAATTGTGGGGTCTCGAATGGGGGAAAGGA
GGGCCACTCCATTTCTTACTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAATTCCGGAACGAGCTTCCAGATACAAGTGCTCAG
CCGAAGCTCATGTCGCTACGGAAAGAGAAAGATCACTATACAAGATATACAATCACATCGCTAGAGAAAACGTACAAGCCTCAGCTTTATGTCGAGCCAGATCTTGGAAT
ACCTCTCGATTTGCTTGACCTCAGCGTATACAACCCTCCTAGTGTTAGAATGCCCCTTGCTCCTGAAGATGAGGAATTATTACGTGATGATGTATTGAAAACTCCAGTTA
AAAAGGATGGTATAAAAAGAAAAGAACGTCCTACTGATAAAGGTGTTGCCTGGCTTGTTAAGACACAGTACATCTCTCCTCTTAGCATTGAATCAACAAAACAGTCTTTG
ACTGAAAAACAAGCAAAAGAACTGCGAGAAATGAAGGGAGGGCGCAATATTCTCGAGAACCTCAACAATAGGGAAAGGCAAATTAAGGAAATTGAGGCATCATTTGAGGC
ATGCAAGTCACGCCCTGTTCATGCAACTAATAAGAATTTATATCCTGTAGAGGGTTTACCTCTTCTACCTGATTTTGATAGGTATGATGATCCATTTGTTGTAGTGGCGT
TTGATAGTGCTCCCACTGCTGATTCAGAGACTTTCAACAAGTTAAACCAATCTATCCGTGACGCTCATGAATCACAGGCGATAATGAAAAGCTATATGGCAACAGGCTCA
GATCCTACAAAACCTGAGAAATTTCTAGCATACATGGTTCCTTCTCCAGATGAGCTTTCAAAGGATATCTACGATGAACAAGAAGATGTTTCATATTCCTGGGTTCGTGA
GTACCATTGGGATGTACGGGGTGATAATGTGGATGATCCCACCACGTATCTCGTTTCATTTGATGATGCAGAAGCTCGTTATGTGCCACTTCCTACAAAGCTTGTTCTGA
GAAAAAAGAGGGCTAAAGAAGGTAGATCAAGTGATGAGGTTGAACATTTTCCTGCACCTGCAAGAGTGACTGTAAGGAGAAGATCAACTGTAGCTACATTGGAAGTGAAG
GATCCAGGGGTTTACTCAAACTCGAAAAGAGGATCAGATATTGAAGATGGTCTTGGAAGATCACATAAACATGATAGACACCAAGACATGGATCAATACAGTGGAGCTGA
AGATGAGATGTCTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTACCGGCCATATCCTCCACAATCGTCCTTCGGTCCTTCACCTGGTCAAAATCCGATTCCGCCTCCACCAGCACAACCGGCTTCGGTTCCAACGCAACAGCG
AGGAGGAGGAGGAGGTAGTCAGTATAATCAGAATTGGGGTGGTTATGGTGGTGATGGGTCTGTGCCTACCGCTCCTTCTTCTTCGTATCCCCAAAATTACAACCAAGTTC
ATCAAAGTTCTAATTACCACCAGCAACATTATGGTCCGCCGAGAAGCCAACACGCTCCGCCTCCTCCTCCTCATCAATCGTATCCTTATGCACCACAGCCGCCGCCGCCT
CCTCCCGATTCTTCCTATCCTCCGCCTCCACCCCCACCAGGGCCTTCGCAACCTCCTCATCTTTACTATCCCCCTTCACAGTATTCCCAGGGTAATCAAAATCAGCAGTC
AATGCAGCCACCACCTCCGCCCTCATCTCCACCACCGAGCTCCTCAATACCGCCGCCTCCACCACCAAATTCTCCACCACCTCCTTCAGCTTCTCATCAAAAGGCAGAGG
GTACAAACATGGGAGCGCACGAGCGCGATAAAGGGGTTTCAAAGGATCCGTCATATGGCAGGCGTGAACGTGAAAATTCAAATCATGATAAACACCAGAGGCACTCTGGT
CCCCCAATGCCTCCAAAGAAAGCAAACGGACCTTCAGGGAGAATGGAGACGGAGGATGAGAAAAGACTGAGGAAGAAGAGAGAGTTCGAAAAACAAAGGCAAGACGAGAG
GCATAGACATCATCTAAAAGAATCCCAAAACACTATTCTGCAAAAGACCCAGATGTTATCTACTGGGAAGGGGCATGGATCAATTGTGGGGTCTCGAATGGGGGAAAGGA
GGGCCACTCCATTTCTTACTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAATTCCGGAACGAGCTTCCAGATACAAGTGCTCAG
CCGAAGCTCATGTCGCTACGGAAAGAGAAAGATCACTATACAAGATATACAATCACATCGCTAGAGAAAACGTACAAGCCTCAGCTTTATGTCGAGCCAGATCTTGGAAT
ACCTCTCGATTTGCTTGACCTCAGCGTATACAACCCTCCTAGTGTTAGAATGCCCCTTGCTCCTGAAGATGAGGAATTATTACGTGATGATGTATTGAAAACTCCAGTTA
AAAAGGATGGTATAAAAAGAAAAGAACGTCCTACTGATAAAGGTGTTGCCTGGCTTGTTAAGACACAGTACATCTCTCCTCTTAGCATTGAATCAACAAAACAGTCTTTG
ACTGAAAAACAAGCAAAAGAACTGCGAGAAATGAAGGGAGGGCGCAATATTCTCGAGAACCTCAACAATAGGGAAAGGCAAATTAAGGAAATTGAGGCATCATTTGAGGC
ATGCAAGTCACGCCCTGTTCATGCAACTAATAAGAATTTATATCCTGTAGAGGGTTTACCTCTTCTACCTGATTTTGATAGGTATGATGATCCATTTGTTGTAGTGGCGT
TTGATAGTGCTCCCACTGCTGATTCAGAGACTTTCAACAAGTTAAACCAATCTATCCGTGACGCTCATGAATCACAGGCGATAATGAAAAGCTATATGGCAACAGGCTCA
GATCCTACAAAACCTGAGAAATTTCTAGCATACATGGTTCCTTCTCCAGATGAGCTTTCAAAGGATATCTACGATGAACAAGAAGATGTTTCATATTCCTGGGTTCGTGA
GTACCATTGGGATGTACGGGGTGATAATGTGGATGATCCCACCACGTATCTCGTTTCATTTGATGATGCAGAAGCTCGTTATGTGCCACTTCCTACAAAGCTTGTTCTGA
GAAAAAAGAGGGCTAAAGAAGGTAGATCAAGTGATGAGGTTGAACATTTTCCTGCACCTGCAAGAGTGACTGTAAGGAGAAGATCAACTGTAGCTACATTGGAAGTGAAG
GATCCAGGGGTTTACTCAAACTCGAAAAGAGGATCAGATATTGAAGATGGTCTTGGAAGATCACATAAACATGATAGACACCAAGACATGGATCAATACAGTGGAGCTGA
AGATGAGATGTCTGATTGA
Protein sequenceShow/hide protein sequence
MASYRPYPPQSSFGPSPGQNPIPPPPAQPASVPTQQRGGGGGSQYNQNWGGYGGDGSVPTAPSSSYPQNYNQVHQSSNYHQQHYGPPRSQHAPPPPPHQSYPYAPQPPPP
PPDSSYPPPPPPPGPSQPPHLYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSASHQKAEGTNMGAHERDKGVSKDPSYGRRERENSNHDKHQRHSG
PPMPPKKANGPSGRMETEDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMLSTGKGHGSIVGSRMGERRATPFLTGERIENRLKKPTTFLCKLKFRNELPDTSAQ
PKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRDDVLKTPVKKDGIKRKERPTDKGVAWLVKTQYISPLSIESTKQSL
TEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEGLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLNQSIRDAHESQAIMKSYMATGS
DPTKPEKFLAYMVPSPDELSKDIYDEQEDVSYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRSTVATLEVK
DPGVYSNSKRGSDIEDGLGRSHKHDRHQDMDQYSGAEDEMSD