; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G00190 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G00190
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein PAF1 homolog
Genome locationClcChr09:175098..184956
RNA-Seq ExpressionClc09G00190
SyntenyClc09G00190
Gene Ontology termsGO:0000160 - phosphorelay signal transduction system (biological process)
GO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0016570 - histone modification (biological process)
GO:0016593 - Cdc73/Paf1 complex (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR007133 - RNA polymerase II associated factor Paf1
IPR008207 - Signal transduction histidine kinase, phosphotransfer (Hpt) domain
IPR036641 - HPT domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014045.1 Protein PAF1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0093.07Show/hide
Query:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSA-PPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQS
        MASYRPYPPQSSFGP+P QNPIPPPPA  AA  P Q+ GGSQYNQNWGGYGGDGS  PPA SSSYPQNYNQ+HQSSNYHQQHYGPPRSQ PPPPPPPHQS
Subjt:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSA-PPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQS

Query:  YPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        YPYAPQ  PPPPPPPDSSYPPPPPPPA SQ    Y+PPSQY QG+QNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK EG+++G HERDKGV
Subjt:  YPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA
        SKDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ETDDEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQM+STGKGHGSIVGSRMGERKA 
Subjt:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD

Query:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIR+AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDV+YSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG
        NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN KRGSDIEDG+GRSHKHDR+QDMDQYSG
Subjt:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG

Query:  AEDEMSD
        A+D+MSD
Subjt:  AEDEMSD

XP_004141783.2 LOW QUALITY PROTEIN: protein PAF1 homolog [Cucumis sativus]0.0e+0094.21Show/hide
Query:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGG-SQYNQNWGGYGGDGSAPPAPSSSYPQNY-NQLHQSSNYHQQHYGPPRSQH-PPPPPPPH
        MASYRPYPPQSSFG AP+QN IPPP AQSA+V  QQRGG  +QYNQNWG Y GD SAPPAPSSSYPQNY NQLHQ+SNYH Q YGPPR+QH PPPPPPPH
Subjt:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGG-SQYNQNWGGYGGDGSAPPAPSSSYPQNY-NQLHQSSNYHQQHYGPPRSQH-PPPPPPPH

Query:  QSYPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG
        QSYPYAPQ  PPPPPPPDSSYPPPPPPPA SQ PN YYP SQYSQGNQNQQSMQPPPPPSSPPPSSS PPPPPPNSPPPPSA QQKAEGTNMGAHERDKG
Subjt:  QSYPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKA
        V KDPSYGRR+RENSNHDKHQ+HSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQM+STGK HGSIVGSRMGERKA
Subjt:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKA

Query:  APFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR
         PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVRMPLAPEDEELLR
Subjt:  APFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR

Query:  DDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVE
Subjt:  DDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRG
        VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIR+AHESQAIMKSYMAT SDPSKPEKFLAYMVPSPDELSKDIYDEQEDV+YSWVREYHWDVRG
Subjt:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPG+YSNSKRGSDIEDGIGRSHKHDR+QDMDQ+S
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYS

Query:  GAEDEMSD
        GAEDEMSD
Subjt:  GAEDEMSD

XP_022953373.1 protein PAF1 homolog [Cucurbita moschata]0.0e+0093.49Show/hide
Query:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSA-PPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQS
        MASYRPYPPQSSFGP+P QNPIPPPPA  AA  P Q+ GGSQYNQNWGGYGGDGS  PPA SSSYPQNYNQ+HQSSNYHQQHYGPPRSQ PPPPPPPHQS
Subjt:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSA-PPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQS

Query:  YPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        YPYAPQ  PPPPPPPDSSYPPPPPPPA SQ    Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK EG+++GAHERDKGV
Subjt:  YPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA
        SKDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ETDDEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQM+STGKGHGSIVGSRMGERKA 
Subjt:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD

Query:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIR+AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDV+YSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG
        NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN KRGSDIEDG+GRSHKHDR+QDMDQYSG
Subjt:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG

Query:  AEDEMSD
        AED+MSD
Subjt:  AEDEMSD

XP_023547399.1 protein PAF1 homolog [Cucurbita pepo subsp. pepo]0.0e+0093.21Show/hide
Query:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSA-PPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQS
        MASYRPYPPQSSFGP+P QNPIPPPPA  AA  P Q+ GGSQYNQNWGGYGGDGS  PPA SSSYPQNYNQ+HQSSN+HQQHYGPPRSQ PPPPPPPHQS
Subjt:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSA-PPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQS

Query:  YPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        YPYAPQ  PPPPPPPDSSYPPPPPPPA SQ    Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQ K EG+++GAHERDKGV
Subjt:  YPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA
        SKDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ETDDEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQM+STGKGHGSIVGSRMGERKA 
Subjt:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD

Query:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIR+AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDV+YSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG
        NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN KRGSDIEDG+GRSHKHDR+QDMDQYSG
Subjt:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG

Query:  AEDEMSD
        AED+MSD
Subjt:  AEDEMSD

XP_038898523.1 protein PAF1 homolog [Benincasa hispida]0.0e+0095.04Show/hide
Query:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSAPPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQSY
        MASYRPYPPQSSFGPAP QNP+PPPP QSA+VP QQRGGGSQYNQNWGGYGGDGS PPA SSSYPQNYNQ HQSSNYHQQHYGPPRSQH PPPPPP+QSY
Subjt:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSAPPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQSY

Query:  PYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSK
        PYAPQ   PPPPPPDSSYPPPPPPPAPSQ PN YYPPS         QSMQPPPPPSSPPPSSSIPPPPPPNSPPP SAPQQKAEGTNMGAHERDKGVSK
Subjt:  PYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSK

Query:  DPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAAPF
        DPSYGRR+RENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQM+STGKGHGSIVGSRMGERKA PF
Subjt:  DPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAAPF

Query:  LSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRDDV
        LSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRDDV
Subjt:  LSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRDDV

Query:  LKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLP
        LKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLP
Subjt:  LKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEVLP

Query:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGDNV
        LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIR+ HESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDV+YSWVREYHWDVRGDNV
Subjt:  LLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGDNV

Query:  DDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSGAE
        DDPTTYLVSFDD EARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDG+GRSHKHDR+QDMDQYSGAE
Subjt:  DDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSGAE

Query:  DEMSD
        DEMSD
Subjt:  DEMSD

TrEMBL top hitse value%identityAlignment
A0A0A0KCT6 Uncharacterized protein0.0e+0094.35Show/hide
Query:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGG-SQYNQNWGGYGGDGSAPPAPSSSYPQNY-NQLHQSSNYHQQHYGPPRSQH-PPPPPPPH
        MASYRPYPPQSSFG AP+QN IPPP AQSA+V  QQRGG  +QYNQNWG Y GD SAPPAPSSSYPQNY NQLHQ+SNYH Q YGPPR+QH PPPPPPPH
Subjt:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGG-SQYNQNWGGYGGDGSAPPAPSSSYPQNY-NQLHQSSNYHQQHYGPPRSQH-PPPPPPPH

Query:  QSYPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG
        QSYPYAPQ  PPPPPPPDSSYPPPPPPPA SQ PN YYP SQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSA QQKAEGTNMGAHERDKG
Subjt:  QSYPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKG

Query:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKA
        V KDPSYGRR+RENSNHDKHQ+HSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQM+STGK HGSIVGSRMGERKA
Subjt:  VSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKA

Query:  APFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR
         PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNP SVRMPLAPEDEELLR
Subjt:  APFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLR

Query:  DDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE
        DDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVE
Subjt:  DDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVE

Query:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRG
        VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIR+AHESQAIMKSYMAT SDPSKPEKFLAYMVPSPDELSKDIYDEQEDV+YSWVREYHWDVRG
Subjt:  VLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRG

Query:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYS
        DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPG+YSNSKRGSDIEDGIGRSHKHDR+QDMDQ+S
Subjt:  DNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYS

Query:  GAEDEMSD
        GAEDEMSD
Subjt:  GAEDEMSD

A0A1S3CHF3 LOW QUALITY PROTEIN: protein PAF1 homolog0.0e+0093.5Show/hide
Query:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGG-SQYNQNWGGYGGDGSAPPAPSSSYPQNY-NQLHQSSNYHQQHYGPPRSQHPPPPPPPHQ
        MASYRPYPPQSSFG AP+QN IPPPP+QSA+   QQRGG  +QYNQNWG Y GD S PPAPSSSYPQNY NQLHQ+SNYH Q YG PR+QH PPPPPPHQ
Subjt:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGG-SQYNQNWGGYGGDGSAPPAPSSSYPQNY-NQLHQSSNYHQQHYGPPRSQHPPPPPPPHQ

Query:  SYPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        SYPYAPQ  PPPPPPPDSSYPPPPPPPAPSQ PN YYP SQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSA QQKAEG NMGAHERDKGV
Subjt:  SYPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA
        SKDPSYGRR+RENSNHDKHQ+HSGPPMPPKKANGPSGRMETDDEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQM+STGK HGSIVGSRMGERKA 
Subjt:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD

Query:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVEV
Subjt:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIR+AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDV+YSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDR-NQDMDQYS
        NVDDPTTYLVSFDD+EARYVPLPTKLVL KKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPG+YSNSKRGSDIEDGIGR HKHDR +QDMDQYS
Subjt:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDR-NQDMDQYS

Query:  GAEDEMSD
        GAEDEMSD
Subjt:  GAEDEMSD

A0A5A7UA23 Protein PAF1-like protein0.0e+0093.64Show/hide
Query:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGG-SQYNQNWGGYGGDGSAPPAPSSSYPQNY-NQLHQSSNYHQQHYGPPRSQHPPPPPPPHQ
        MASYRPYPPQSSFG AP+QN IPPPP+QSA+   QQRGG  +QYNQNWG Y GD S PPAPSSSYPQNY NQLHQ+SNYH Q YG PR+QH PPPPPPHQ
Subjt:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGG-SQYNQNWGGYGGDGSAPPAPSSSYPQNY-NQLHQSSNYHQQHYGPPRSQHPPPPPPPHQ

Query:  SYPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        SYPYAPQ  PPPPPPPDSSYPPPPPPPAPSQ PN YYP SQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSA QQKAEG NMGAHERDKGV
Subjt:  SYPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA
        SKDPSYGRR+RENSNHDKHQ+HSGPPMPPKKANGPSGRMETDDEK+LRKKREFEKQRQDERHRHHLKESQNTILQKTQM+STGK HGSIVGSRMGERKA 
Subjt:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD

Query:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRP+HATNKNLYPVEV
Subjt:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIR+AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDV+YSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDR-NQDMDQYS
        NVDDPTTYLVSFDD+EARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPG+YSNSKRGSDIEDGIGR HKHDR +QDMDQYS
Subjt:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDR-NQDMDQYS

Query:  GAEDEMSD
        GAEDEMSD
Subjt:  GAEDEMSD

A0A6J1GN64 protein PAF1 homolog0.0e+0093.49Show/hide
Query:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSA-PPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQS
        MASYRPYPPQSSFGP+P QNPIPPPPA  AA  P Q+ GGSQYNQNWGGYGGDGS  PPA SSSYPQNYNQ+HQSSNYHQQHYGPPRSQ PPPPPPPHQS
Subjt:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSA-PPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQS

Query:  YPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        YPYAPQ  PPPPPPPDSSYPPPPPPPA SQ    Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQK EG+++GAHERDKGV
Subjt:  YPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA
        SKDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ETDDEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQM+STGKGHGSIVGSRMGERKA 
Subjt:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD

Query:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIR+AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDV+YSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG
        NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRS+DEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN KRGSDIEDG+GRSHKHDR+QDMDQYSG
Subjt:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG

Query:  AEDEMSD
        AED+MSD
Subjt:  AEDEMSD

A0A6J1JP14 protein PAF1 homolog0.0e+0093.21Show/hide
Query:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSA-PPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQS
        MASYRPYPPQSSFGP+P QNPIPPPPA  AA  P Q+ G SQYNQNWGGYGGDGS  PPA SSSYPQNYNQ+HQSSNYHQQHYGPPRSQ PPPPPPPHQS
Subjt:  MASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRGGGSQYNQNWGGYGGDGSA-PPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQS

Query:  YPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV
        YPYAPQ  PPPPPPPDSSYPPPPPPPA SQ    Y+PPSQY QGNQNQQS+Q PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQ K EG+++GAHERDKGV
Subjt:  YPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQSMQ-PPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGV

Query:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA
        +KDPSYGRRERENSNHDKHQRHSGPPMPPKK+NGPSGR+ETDDEKR RKKREFEKQRQDERHRHHLKESQNTILQKTQM+STGKGHGSIVGSRMGERKA 
Subjt:  SKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAA

Query:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD
        PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVR+PLAPEDEELLRD
Subjt:  PFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPLDLLDLSVYNPPSVRMPLAPEDEELLRD

Query:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV
        DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIES KQSLTEKQAKELREMKGGRNILENLNNRER+IKEI+ASFEACKSRPVHATNKNLYPVEV
Subjt:  DVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASFEACKSRPVHATNKNLYPVEV

Query:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD
        LPLLPDFDRYDDPFVVVAFD+APTADSETFNKLDQSIR+AHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDV+YSWVREYHWDVRGD
Subjt:  LPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWVREYHWDVRGD

Query:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG
        NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSN KRGSDIEDG+GRSHKHDR+QDMDQYSG
Subjt:  NVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG

Query:  AEDEMSD
        AED+MSD
Subjt:  AEDEMSD

SwissProt top hitse value%identityAlignment
F4HQA1 Protein PAF1 homolog5.9e-17460.9Show/hide
Query:  YGPPRSQHPPPPPPPHQSYPYAPQPPPP------PPPPPDS---SYPPPPPPPAPSQLPNQYYPPS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPP
        Y PP   +PP P PP Q+   AP PPPP      PPPPP     SYPPPPPPP     P+ YY     Y Q NQ    +Q PPPP  PPPS+     PPP
Subjt:  YGPPRSQHPPPPPPPHQSYPYAPQPPPP------PPPPPDS---SYPPPPPPPAPSQLPNQYYPPS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPP

Query:  NSPPPPSAPQQKAEGTNMGAHERDKGVSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTI
          P PP          + G ++ +KG SK    GRRER   +  KH   S  P         S ++ET++E+RLRKKRE EKQRQDE+HR  +K S    
Subjt:  NSPPPPSAPQQKAEGTNMGAHERDKGVSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTI

Query:  LQKTQMISTGKGHGSIVGSRMGERKAAPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPL
          K+QM    KGH         E+K  P L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM+++++KD +T+YTITSLEK +KP+++VEPDLGIPL
Subjt:  LQKTQMISTGKGHGSIVGSRMGERKAAPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPL

Query:  DLLDLSVYNPPSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQI
        DLLDLSVYNPP V+ PLAPEDEELLRDD   TP+KKD GI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLNNRERQI
Subjt:  DLLDLSVYNPPSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQI

Query:  KEIEASFEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDE
        K+IEASFEACKSRPVHATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIR+AHES+AI+KSY+  GSD + PEKFLAYMVPS DE
Subjt:  KEIEASFEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDE

Query:  LSKDIYDEQEDVAYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVY---
        LSKDI+DE E+++Y+WVREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRSSDE+EHFP P+RVTVRRR TV+ +E KD GVY   
Subjt:  LSKDIYDEQEDVAYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVY---

Query:  ----SNSKRGSDIEDGIGRSHKHDRNQDMDQYS-GAEDEMSD
            S+  R  + E G+GRS KH+  QD +QYS G ED+ S+
Subjt:  ----SNSKRGSDIEDGIGRSHKHDRNQDMDQYS-GAEDEMSD

Q0DK78 Pseudo histidine-containing phosphotransfer protein 22.6e-3654.81Show/hide
Query:  MEKTQLRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQL
        ME + LRRQ+  ++++LFDQG+LDEQF QLEELQD+++PNFVEE+  L+++DSSRL+ +IEQA+ K P DF +LD+L+ Q KGS SSIGA ++K EC+  
Subjt:  MEKTQLRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQL

Query:  REYCKAGSGEGCLRTFQQLKKEYTTLRKKLEAYFQ
        +  C   + EGC R+ Q++K+E+ TL++KLE+YFQ
Subjt:  REYCKAGSGEGCLRTFQQLKKEYTTLRKKLEAYFQ

Q0JJE3 Pseudo histidine-containing phosphotransfer protein 14.2e-3458.62Show/hide
Query:  QGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREYCKAGSGEGCLRTFQQL
        +G+LDEQF Q+E+LQD+A+PNFVEE+VTL+++DS RL+ +IEQAL+K P DFN+ D  M Q KGS SSIGA ++K EC   R+ C  G+ EGC+R+FQ++
Subjt:  QGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREYCKAGSGEGCLRTFQQL

Query:  KKEYTTLRKKLEAYFQ
        K+E+  LR+KLE+YFQ
Subjt:  KKEYTTLRKKLEAYFQ

Q6F303 Pseudo histidine-containing phosphotransfer protein 51.2e-3856.3Show/hide
Query:  MEKTQLRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQL
        ME   LRRQ A++++SLFDQG+LDEQF Q+E+LQD+ANPNF EE+V+L+++DS+R++L+ EQA++K P DF + D  M Q KGS SSIGA +VK ECT  
Subjt:  MEKTQLRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQL

Query:  REYCKAGSGEGCLRTFQQLKKEYTTLRKKLEAYFQ
        R +C   + EGC R+FQ++K+E+  LR+K E+YFQ
Subjt:  REYCKAGSGEGCLRTFQQLKKEYTTLRKKLEAYFQ

Q9LU15 Histidine-containing phosphotransfer protein 41.2e-4474.14Show/hide
Query:  QGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREYCKAGSGEGCLRTFQQL
        QG+LDEQF++LEELQDDANPNFVEE+  LY++DS+RLI +I+QAL++   DFN+LD+ MHQFKGSS+SIGA KVKAECT  REYC+AG+ EGCLRTFQQL
Subjt:  QGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREYCKAGSGEGCLRTFQQL

Query:  KKEYTTLRKKLEAYFQ
        KKE++TLRKKLE YFQ
Subjt:  KKEYTTLRKKLEAYFQ

Arabidopsis top hitse value%identityAlignment
AT1G79730.1 hydroxyproline-rich glycoprotein family protein4.2e-17560.9Show/hide
Query:  YGPPRSQHPPPPPPPHQSYPYAPQPPPP------PPPPPDS---SYPPPPPPPAPSQLPNQYYPPS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPP
        Y PP   +PP P PP Q+   AP PPPP      PPPPP     SYPPPPPPP     P+ YY     Y Q NQ    +Q PPPP  PPPS+     PPP
Subjt:  YGPPRSQHPPPPPPPHQSYPYAPQPPPP------PPPPPDS---SYPPPPPPPAPSQLPNQYYPPS-QYSQGNQNQQSMQPPPPPSSPPPSSSIPPPPPP

Query:  NSPPPPSAPQQKAEGTNMGAHERDKGVSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTI
          P PP          + G ++ +KG SK    GRRER   +  KH   S  P         S ++ET++E+RLRKKRE EKQRQDE+HR  +K S    
Subjt:  NSPPPPSAPQQKAEGTNMGAHERDKGVSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDERHRHHLKESQNTI

Query:  LQKTQMISTGKGHGSIVGSRMGERKAAPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPL
          K+QM    KGH         E+K  P L+ +R+ENRLKKPTTF+CKLKFRNELPD SAQ KLM+++++KD +T+YTITSLEK +KP+++VEPDLGIPL
Subjt:  LQKTQMISTGKGHGSIVGSRMGERKAAPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLGIPL

Query:  DLLDLSVYNPPSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQI
        DLLDLSVYNPP V+ PLAPEDEELLRDD   TP+KKD GI+RKERPTDKG++WLVKTQYIS ++ ESA+QSLTEKQAKELREMKGG NIL NLNNRERQI
Subjt:  DLLDLSVYNPPSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQI

Query:  KEIEASFEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDE
        K+IEASFEACKSRPVHATNKNL PVEVLPLLP FDRYD+ FVV  FD AP ADSE F KLD SIR+AHES+AI+KSY+  GSD + PEKFLAYMVPS DE
Subjt:  KEIEASFEACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDE

Query:  LSKDIYDEQEDVAYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVY---
        LSKDI+DE E+++Y+WVREY WDV+  N +DP TYLVSFD+  A Y+PLP +L LRKKRA+EGRSSDE+EHFP P+RVTVRRR TV+ +E KD GVY   
Subjt:  LSKDIYDEQEDVAYSWVREYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVY---

Query:  ----SNSKRGSDIEDGIGRSHKHDRNQDMDQYS-GAEDEMSD
            S+  R  + E G+GRS KH+  QD +QYS G ED+ S+
Subjt:  ----SNSKRGSDIEDGIGRSHKHDRNQDMDQYS-GAEDEMSD

AT3G16360.1 HPT phosphotransmitter 48.3e-4674.14Show/hide
Query:  QGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREYCKAGSGEGCLRTFQQL
        QG+LDEQF++LEELQDDANPNFVEE+  LY++DS+RLI +I+QAL++   DFN+LD+ MHQFKGSS+SIGA KVKAECT  REYC+AG+ EGCLRTFQQL
Subjt:  QGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREYCKAGSGEGCLRTFQQL

Query:  KKEYTTLRKKLEAYFQ
        KKE++TLRKKLE YFQ
Subjt:  KKEYTTLRKKLEAYFQ

AT3G16360.2 HPT phosphotransmitter 48.5e-5173.08Show/hide
Query:  LRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREYCK
        ++RQ+A I+QSLFDQG+LDEQF++LEELQDDANPNFVEE+  LY++DS+RLI +I+QAL++   DFN+LD+ MHQFKGSS+SIGA KVKAECT  REYC+
Subjt:  LRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREYCK

Query:  AGSGEGCLRTFQQLKKEYTTLRKKLEAYFQ
        AG+ EGCLRTFQQLKKE++TLRKKLE YFQ
Subjt:  AGSGEGCLRTFQQLKKEYTTLRKKLEAYFQ

AT3G21510.1 histidine-containing phosphotransmitter 14.6e-2842.96Show/hide
Query:  MEKTQLRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQL
        M+  Q ++ L +  +SLF +G LD QF+QL++LQD++NP+FV ++VTL+++DS R++  +  +L +  +DF K+D  +HQ KGSSSSIGA++VK  C   
Subjt:  MEKTQLRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQL

Query:  REYCKAGSGEGCLRTFQQLKKEYTTLRKKLEAYFQ
        R +C+  + E C R  QQ+K+EY  ++ +LE  F+
Subjt:  REYCKAGSGEGCLRTFQQLKKEYTTLRKKLEAYFQ

AT3G29350.1 histidine-containing phosphotransmitter 23.9e-2745.8Show/hide
Query:  QLRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKS-PLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREY
        QL+RQ  +   SL+ QGFLD+QF +L++LQDD +P+FV E+++L++ D  +LI ++ +AL  +  +DF+++ A +HQ KGSSSS+GAK+VK  C   +E 
Subjt:  QLRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKS-PLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREY

Query:  CKAGSGEGCLRTFQQLKKEYTTLRKKLEAYF
        C+A + EGC+R  QQ+  EY  L+ KL+  F
Subjt:  CKAGSGEGCLRTFQQLKKEYTTLRKKLEAYF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAAACACAGTTGCGTAGGCAGCTTGCTAACATTAGGCAGTCCCTCTTTGATCAGGGATTTCTTGATGAACAGTTTGTGCAGTTGGAGGAACTGCAAGATGATGC
TAACCCAAACTTTGTGGAGGAAATTGTTACATTGTACTACAGAGACTCATCCAGACTCATCCTCAGCATAGAGCAAGCACTACAGAAGAGCCCTCTGGACTTCAATAAGT
TGGATGCCCTCATGCACCAGTTTAAAGGAAGTAGCTCAAGTATTGGAGCCAAAAAGGTGAAAGCTGAGTGCACACAGTTGAGGGAATATTGCAAGGCAGGAAGTGGAGAA
GGATGCTTGAGGACATTCCAACAACTGAAGAAAGAATACACAACTCTGAGAAAGAAGCTTGAAGCCTATTTTCAGGGTTTTGGAGATTTCTTGTTTGGGAGGAGGGTTTC
TGATGATTGGCCCCGAAATCTGACTCAACTTCAACGTACTGTTGATTTCGCCATTTTTCTTTCTTCCAGGGTTCTGCTTCGTCCAGTTTTTGGGGGAGATATTGCTATGG
CTTCTTACAGGCCATATCCTCCACAATCCTCCTTCGGTCCTGCACCTAGTCAAAACCCGATTCCGCCTCCACCAGCGCAATCGGCTGCCGTTCCACCGCAACAGCGAGGT
GGAGGTAGTCAGTATAATCAGAATTGGGGTGGTTATGGTGGTGATGGGTCTGCGCCTCCTGCTCCATCTTCATCGTATCCCCAAAATTACAACCAACTCCATCAAAGTTC
TAATTACCACCAGCAACATTATGGTCCCCCGCGAAGCCAACACCCTCCACCTCCTCCTCCTCCTCACCAGTCGTATCCTTATGCACCACAACCGCCCCCGCCCCCGCCCC
CGCCTCCCGATTCTTCCTATCCACCGCCTCCACCCCCACCAGCGCCCTCGCAACTTCCCAATCAATACTACCCCCCTTCACAGTATTCCCAGGGTAATCAAAATCAGCAG
TCAATGCAGCCACCACCTCCGCCCTCATCTCCACCACCGAGCTCTTCAATCCCGCCACCTCCACCTCCAAATTCTCCACCACCTCCATCAGCGCCTCAGCAAAAAGCAGA
GGGTACAAACATGGGAGCACATGAACGTGATAAAGGGGTTTCAAAGGATCCCTCATACGGCAGGCGTGAACGTGAAAATTCAAATCATGATAAACACCAGAGGCATTCTG
GTCCCCCAATGCCTCCCAAGAAAGCAAACGGTCCTTCAGGAAGAATGGAAACAGATGATGAGAAAAGACTGAGGAAGAAGAGAGAGTTCGAAAAACAAAGGCAGGATGAG
AGGCACAGACACCATCTTAAAGAATCCCAAAACACTATACTGCAAAAGACCCAGATGATATCTACTGGGAAGGGGCATGGATCAATTGTAGGGTCCCGAATGGGGGAAAG
GAAGGCCGCTCCATTTCTTAGTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAGTTCCGGAACGAGCTTCCAGATACAAGTGCTC
AGCCAAAGCTCATGTCACTACGGAAAGAGAAAGATCACTATACAAGATATACAATCACATCGCTAGAGAAAACGTACAAACCTCAGCTTTATGTAGAGCCAGATCTTGGA
ATACCTCTCGATTTGCTTGACCTCAGTGTTTACAACCCTCCTAGTGTTAGAATGCCCCTTGCTCCTGAAGATGAGGAATTATTACGGGATGATGTACTGAAAACTCCAGT
TAAAAAGGATGGTGGTATAAAAAGAAAAGAGCGTCCTACTGATAAAGGTGTTGCCTGGCTTGTTAAGACGCAGTACATCTCTCCTCTTAGCATTGAATCGGCGAAACAGT
CTTTGACTGAAAAACAGGCTAAAGAACTGCGAGAAATGAAGGGAGGGCGGAATATTCTTGAGAACCTCAATAATAGGGAAAGGCAAATTAAGGAAATTGAGGCATCATTC
GAGGCATGCAAGTCACGCCCTGTTCATGCCACCAATAAGAATTTATATCCTGTAGAGGTTTTACCTCTTCTACCTGATTTTGATAGGTATGATGATCCATTTGTTGTGGT
GGCGTTTGATAGCGCTCCCACAGCTGATTCAGAGACTTTCAATAAGTTAGATCAATCCATCCGTAACGCTCATGAATCACAGGCGATAATGAAAAGCTACATGGCAACAG
GGTCAGACCCTTCAAAACCTGAGAAATTTCTTGCGTACATGGTTCCCTCTCCAGATGAGCTATCAAAGGATATTTATGATGAACAAGAAGATGTCGCATATTCCTGGGTT
CGTGAGTACCATTGGGATGTACGGGGTGACAATGTGGATGATCCCACTACATATCTTGTTTCATTTGATGATGCAGAAGCTCGTTATGTGCCACTTCCTACAAAGCTTGT
TCTTAGAAAAAAGAGGGCTAAAGAAGGGAGATCTAGTGATGAGGTTGAACATTTTCCTGCACCTGCAAGAGTGACTGTAAGGAGAAGACCAACTGTAGCCACTTTGGAAG
TGAAGGATCCAGGGGTTTACTCGAATTCGAAAAGAGGATCAGATATTGAAGATGGTATAGGGAGATCACATAAACATGATAGAAACCAAGACATGGATCAGTACAGTGGA
GCTGAAGACGAGATGTCTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAAACACAGTTGCGTAGGCAGCTTGCTAACATTAGGCAGTCCCTCTTTGATCAGGGATTTCTTGATGAACAGTTTGTGCAGTTGGAGGAACTGCAAGATGATGC
TAACCCAAACTTTGTGGAGGAAATTGTTACATTGTACTACAGAGACTCATCCAGACTCATCCTCAGCATAGAGCAAGCACTACAGAAGAGCCCTCTGGACTTCAATAAGT
TGGATGCCCTCATGCACCAGTTTAAAGGAAGTAGCTCAAGTATTGGAGCCAAAAAGGTGAAAGCTGAGTGCACACAGTTGAGGGAATATTGCAAGGCAGGAAGTGGAGAA
GGATGCTTGAGGACATTCCAACAACTGAAGAAAGAATACACAACTCTGAGAAAGAAGCTTGAAGCCTATTTTCAGGGTTTTGGAGATTTCTTGTTTGGGAGGAGGGTTTC
TGATGATTGGCCCCGAAATCTGACTCAACTTCAACGTACTGTTGATTTCGCCATTTTTCTTTCTTCCAGGGTTCTGCTTCGTCCAGTTTTTGGGGGAGATATTGCTATGG
CTTCTTACAGGCCATATCCTCCACAATCCTCCTTCGGTCCTGCACCTAGTCAAAACCCGATTCCGCCTCCACCAGCGCAATCGGCTGCCGTTCCACCGCAACAGCGAGGT
GGAGGTAGTCAGTATAATCAGAATTGGGGTGGTTATGGTGGTGATGGGTCTGCGCCTCCTGCTCCATCTTCATCGTATCCCCAAAATTACAACCAACTCCATCAAAGTTC
TAATTACCACCAGCAACATTATGGTCCCCCGCGAAGCCAACACCCTCCACCTCCTCCTCCTCCTCACCAGTCGTATCCTTATGCACCACAACCGCCCCCGCCCCCGCCCC
CGCCTCCCGATTCTTCCTATCCACCGCCTCCACCCCCACCAGCGCCCTCGCAACTTCCCAATCAATACTACCCCCCTTCACAGTATTCCCAGGGTAATCAAAATCAGCAG
TCAATGCAGCCACCACCTCCGCCCTCATCTCCACCACCGAGCTCTTCAATCCCGCCACCTCCACCTCCAAATTCTCCACCACCTCCATCAGCGCCTCAGCAAAAAGCAGA
GGGTACAAACATGGGAGCACATGAACGTGATAAAGGGGTTTCAAAGGATCCCTCATACGGCAGGCGTGAACGTGAAAATTCAAATCATGATAAACACCAGAGGCATTCTG
GTCCCCCAATGCCTCCCAAGAAAGCAAACGGTCCTTCAGGAAGAATGGAAACAGATGATGAGAAAAGACTGAGGAAGAAGAGAGAGTTCGAAAAACAAAGGCAGGATGAG
AGGCACAGACACCATCTTAAAGAATCCCAAAACACTATACTGCAAAAGACCCAGATGATATCTACTGGGAAGGGGCATGGATCAATTGTAGGGTCCCGAATGGGGGAAAG
GAAGGCCGCTCCATTTCTTAGTGGTGAGAGGATAGAAAATAGGTTGAAGAAGCCAACAACATTTTTGTGCAAGTTGAAGTTCCGGAACGAGCTTCCAGATACAAGTGCTC
AGCCAAAGCTCATGTCACTACGGAAAGAGAAAGATCACTATACAAGATATACAATCACATCGCTAGAGAAAACGTACAAACCTCAGCTTTATGTAGAGCCAGATCTTGGA
ATACCTCTCGATTTGCTTGACCTCAGTGTTTACAACCCTCCTAGTGTTAGAATGCCCCTTGCTCCTGAAGATGAGGAATTATTACGGGATGATGTACTGAAAACTCCAGT
TAAAAAGGATGGTGGTATAAAAAGAAAAGAGCGTCCTACTGATAAAGGTGTTGCCTGGCTTGTTAAGACGCAGTACATCTCTCCTCTTAGCATTGAATCGGCGAAACAGT
CTTTGACTGAAAAACAGGCTAAAGAACTGCGAGAAATGAAGGGAGGGCGGAATATTCTTGAGAACCTCAATAATAGGGAAAGGCAAATTAAGGAAATTGAGGCATCATTC
GAGGCATGCAAGTCACGCCCTGTTCATGCCACCAATAAGAATTTATATCCTGTAGAGGTTTTACCTCTTCTACCTGATTTTGATAGGTATGATGATCCATTTGTTGTGGT
GGCGTTTGATAGCGCTCCCACAGCTGATTCAGAGACTTTCAATAAGTTAGATCAATCCATCCGTAACGCTCATGAATCACAGGCGATAATGAAAAGCTACATGGCAACAG
GGTCAGACCCTTCAAAACCTGAGAAATTTCTTGCGTACATGGTTCCCTCTCCAGATGAGCTATCAAAGGATATTTATGATGAACAAGAAGATGTCGCATATTCCTGGGTT
CGTGAGTACCATTGGGATGTACGGGGTGACAATGTGGATGATCCCACTACATATCTTGTTTCATTTGATGATGCAGAAGCTCGTTATGTGCCACTTCCTACAAAGCTTGT
TCTTAGAAAAAAGAGGGCTAAAGAAGGGAGATCTAGTGATGAGGTTGAACATTTTCCTGCACCTGCAAGAGTGACTGTAAGGAGAAGACCAACTGTAGCCACTTTGGAAG
TGAAGGATCCAGGGGTTTACTCGAATTCGAAAAGAGGATCAGATATTGAAGATGGTATAGGGAGATCACATAAACATGATAGAAACCAAGACATGGATCAGTACAGTGGA
GCTGAAGACGAGATGTCTGATTGATTTACTTTGGTACAATCAATTAATTTGGCTTTCCCATGGATTGTCTGCCTGATTCAAACGCGGCTATCTGGCAGGGAATCCGCCCA
AAATTTTTTTGAACCACCAGATGTTTATGGTGCTGTGTTGAGTGTGTACGTTACATATCTATTGCTACACTTAACTTTTTTAGTACTTGTTTGCATTATTGAATTTCTTA
ATTTAATCTGTATAGTTTCTTCTCGACACGGCAGGAAAAATGAAATATGAAAGAAGGCTTTACATTAGATACTCCCAATTGTTGTAACAATCTTAACATGATTCTTTTAC
ATGGGCAAAGAATTATCTCGTGTGATGCATGTAAGGATTCTGTACGTGTGTTTGATAATAAAATTCCATTATTTTAGTGGATCATAAACTTCCATCCCCCATCTTTGTAT
TTTTT
Protein sequenceShow/hide protein sequence
MEKTQLRRQLANIRQSLFDQGFLDEQFVQLEELQDDANPNFVEEIVTLYYRDSSRLILSIEQALQKSPLDFNKLDALMHQFKGSSSSIGAKKVKAECTQLREYCKAGSGE
GCLRTFQQLKKEYTTLRKKLEAYFQGFGDFLFGRRVSDDWPRNLTQLQRTVDFAIFLSSRVLLRPVFGGDIAMASYRPYPPQSSFGPAPSQNPIPPPPAQSAAVPPQQRG
GGSQYNQNWGGYGGDGSAPPAPSSSYPQNYNQLHQSSNYHQQHYGPPRSQHPPPPPPPHQSYPYAPQPPPPPPPPPDSSYPPPPPPPAPSQLPNQYYPPSQYSQGNQNQQ
SMQPPPPPSSPPPSSSIPPPPPPNSPPPPSAPQQKAEGTNMGAHERDKGVSKDPSYGRRERENSNHDKHQRHSGPPMPPKKANGPSGRMETDDEKRLRKKREFEKQRQDE
RHRHHLKESQNTILQKTQMISTGKGHGSIVGSRMGERKAAPFLSGERIENRLKKPTTFLCKLKFRNELPDTSAQPKLMSLRKEKDHYTRYTITSLEKTYKPQLYVEPDLG
IPLDLLDLSVYNPPSVRMPLAPEDEELLRDDVLKTPVKKDGGIKRKERPTDKGVAWLVKTQYISPLSIESAKQSLTEKQAKELREMKGGRNILENLNNRERQIKEIEASF
EACKSRPVHATNKNLYPVEVLPLLPDFDRYDDPFVVVAFDSAPTADSETFNKLDQSIRNAHESQAIMKSYMATGSDPSKPEKFLAYMVPSPDELSKDIYDEQEDVAYSWV
REYHWDVRGDNVDDPTTYLVSFDDAEARYVPLPTKLVLRKKRAKEGRSSDEVEHFPAPARVTVRRRPTVATLEVKDPGVYSNSKRGSDIEDGIGRSHKHDRNQDMDQYSG
AEDEMSD