; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007996 (gene) of Snake gourd v1 genome

Gene IDTan0007996
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlus3 domain-containing protein
Genome locationLG01:12973466..12977639
RNA-Seq ExpressionTan0007996
SyntenyTan0007996
Gene Ontology termsGO:0016593 - Cdc73/Paf1 complex (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:1990269 - RNA polymerase II C-terminal domain phosphoserine binding (molecular function)
InterPro domainsIPR004343 - Plus-3 domain
IPR036128 - Plus3-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149150.1 protein RTF1 homolog [Cucumis sativus]0.0e+0094.85Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDP ERDDD GSQEEG+ EDVGS+REGDSSD
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
        ESDVGDDLYKD+DDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP

Query:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
        EAHRKL+DASRGNAN RRFSPTKRKPFTAPSLSSSSQSESESRFQS+DEGSTGDGGMIDSDDERS+PGSDGPTFEDIKE+TIRRSKLAKWLMEPFFEELI
Subjt:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI

Query:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA
        VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE SAARWQMAMVSDS PLEDEYKQW+KEVERTGGRMLSKQDILEKKEA
Subjt:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA

Query:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP
        IQKVNNFVYSAATVKQMLQ+KKSAS+RPLNIAAEKDRLRREMDVA+SKNDE EVERIK RLQQLEASRRLQMKDAKAIRL EMNRKNRVENFKNASELRP
Subjt:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP

Query:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
        +KDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEA  NSDNV PA E  RT +G  ++AGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
Subjt:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE

Query:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        LPISLAMLQKFGG LGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Subjt:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

XP_023522520.1 protein RTF1 homolog isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0094.85Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDP ERDDDAGSQEEGD EDVGSDREGDSS+
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
        ESDVGDDLYKD+DDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAK DKGK+APSRKET PLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP

Query:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
        EAHRKL+DASRGNAN RRFSPTKRKPFTAPSLSSS  SESESRFQSEDE STGDGGMIDSDDERSM G  GPTFEDIKEITIRRSKLAKWLMEPFFEELI
Subjt:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI

Query:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA
        VGCFVRVGIGRSRSG IYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE+SAARWQMAMVSDS PLEDEYKQWLKEVERT GRMLSKQD+LEKKEA
Subjt:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA

Query:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP
        IQK NNFVYSAATVKQML+EKKSASSRPLNIAAEKDRLRREMDVALSKNDE EVERIKARLQQLEASRRLQMKD KAIRLVEMNRKNRVENFKNASELRP
Subjt:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP

Query:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
        +KDLKAGEAGYDPFSRRWTRSRNYYV NAGEANGAAEAG NSDN +PASETNRTGSG  AEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
Subjt:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE

Query:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        LPISLA+LQKFGGP+GAQAGFLARKQ+IEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Subjt:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

XP_023522521.1 protein RTF1 homolog isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0094.85Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDP ERDDDAGSQEEGD EDVGSDREGDSS+
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
        ESDVGDDLYKD+DDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAK DKGK+APSRKET PLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP

Query:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
        EAHRKL+DASRGNAN RRFSPTKRKPFTAPSLSSS  SESESRFQSEDE STGDGGMIDSDDERSM G  GPTFEDIKEITIRRSKLAKWLMEPFFEELI
Subjt:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI

Query:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA
        VGCFVRVGIGRSRSG IYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE+SAARWQMAMVSDS PLEDEYKQWLKEVERT GRMLSKQD+LEKKEA
Subjt:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA

Query:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP
        IQK NNFVYSAATVKQML+EKKSASSRPLNIAAEKDRLRREMDVALSKNDE EVERIKARLQQLEASRRLQMKD KAIRLVEMNRKNRVENFKNASELRP
Subjt:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP

Query:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
        +KDLKAGEAGYDPFSRRWTRSRNYYV NAGEANGAAEAG NSDN +PASETNRTGSG  AEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
Subjt:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE

Query:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        LPISLA+LQKFGGP+GAQAGFLARKQ+IEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Subjt:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

XP_023534918.1 protein RTF1 homolog [Cucurbita pepo subsp. pepo]0.0e+0094.7Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDP ERDDDAGSQEEGD EDVGSDREGDSS+
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
        ESDVGDDLYKD+DDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAK DKGK+APSRKET PLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP

Query:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
        EAHRKL+DASRGNAN RRFSPTKRKPFTAPSLSSS  SESESRFQSEDE STGDGGMIDSDDERSM G  GPTFEDIKEITIRRSKLAKWLMEPFFEELI
Subjt:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI

Query:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA
        VGCFVRVGIGRSRSG IYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE+SAARWQMAM+SDS PLEDEYKQWLKEVERT GRMLSKQD+LEKKEA
Subjt:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA

Query:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP
        IQK NNFVYSAATVKQML+EKKSASSRPLNIAAEKDRLRREMDVALSKNDE EVERIKARLQQLEASRRLQMKD KAIRLVEMNRKNRVENFKNASELRP
Subjt:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP

Query:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
        +KDLKAGEAGYDPFSRRWTRSRNYYV NAGEANGAAEAG NSDN +PASETNRTGSG  AEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
Subjt:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE

Query:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        LPISLA+LQKFGGP+GAQAGFLARKQ+IEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Subjt:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

XP_038894309.1 protein RTF1 homolog [Benincasa hispida]0.0e+0094.85Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDP ERDDD GSQEEG+ EDVGS+REGDSSD
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
        ESDVGDDLYKD+DDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGK+APSRK+TPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP

Query:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
        EAHRKL+DASRGNAN RRFSPTKRKP TAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
Subjt:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI

Query:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA
        VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE+SAARWQMAMVSDS PLEDEYKQW+KEVE+TGGRMLSKQDIL+KKEA
Subjt:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA

Query:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP
        IQKVNNFVYSAATVKQMLQ+KKSAS+RPLNIAAEKDRLRREMDVALSKNDE EVERIKARLQQL+ASR+LQMKDAKAIRL EMNRKNRVENFKNASELRP
Subjt:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP

Query:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
        MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEAN AAEA  NSDNV PA E+ RTGSG  ++AGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
Subjt:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE

Query:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        LPISLAMLQKFGG LGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Subjt:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

TrEMBL top hitse value%identityAlignment
A0A0A0KXW1 Plus3 domain-containing protein0.0e+0094.85Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDP ERDDD GSQEEG+ EDVGS+REGDSSD
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
        ESDVGDDLYKD+DDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP

Query:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
        EAHRKL+DASRGNAN RRFSPTKRKPFTAPSLSSSSQSESESRFQS+DEGSTGDGGMIDSDDERS+PGSDGPTFEDIKE+TIRRSKLAKWLMEPFFEELI
Subjt:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI

Query:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA
        VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE SAARWQMAMVSDS PLEDEYKQW+KEVERTGGRMLSKQDILEKKEA
Subjt:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA

Query:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP
        IQKVNNFVYSAATVKQMLQ+KKSAS+RPLNIAAEKDRLRREMDVA+SKNDE EVERIK RLQQLEASRRLQMKDAKAIRL EMNRKNRVENFKNASELRP
Subjt:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP

Query:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
        +KDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEA  NSDNV PA E  RT +G  ++AGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
Subjt:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE

Query:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        LPISLAMLQKFGG LGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Subjt:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

A0A1S3BXS8 protein RTF1 homolog0.0e+0093.64Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        MADLENLLLEAAGRTNA G NRHSHPPSRRQREGSYSD GSDSRDDDSDD+RGYASRKPSGSQVPLKKRLDP ERDDD GS EEG+ EDVGS+ EGDSSD
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
        ESDVGDDLYKD+DDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGK+APSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP

Query:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
        EAHRKL+DASRGN+N RRFSPTKRKPFTAPSLSSSSQSESESRFQS+DEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
Subjt:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI

Query:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA
        VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE SAARWQMAMVSDS PLEDEYKQW+KEVERTGGRMLSKQD+LEKK+A
Subjt:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA

Query:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP
        IQKVNNFVYSAATVKQMLQ+KKSAS+RPLNIAAEKDRLRREMDVA+SKNDE EVERIK RLQQLEASRRLQMKDAKAIRL EMNRKNRVENFKNASELRP
Subjt:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP

Query:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
        +KDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEA  NSD V PA E+ RTG+G  ++AGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
Subjt:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE

Query:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        LPISLAMLQKFGG LGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Subjt:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

A0A5A7TNN4 Protein RTF1-like protein0.0e+0093.64Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        MADLENLLLEAAGRTNA G NRHSHPPSRRQREGSYSD GSDSRDDDSDD+RGYASRKPSGSQVPLKKRLDP ERDDD GS EEG+ EDVGS+ EGDSSD
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
        ESDVGDDLYKD+DDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGK+APSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP

Query:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
        EAHRKL+DASRGN+N RRFSPTKRKPFTAPSLSSSSQSESESRFQS+DEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
Subjt:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI

Query:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA
        VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE SAARWQMAMVSDS PLEDEYKQW+KEVERTGGRMLSKQD+LEKK+A
Subjt:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA

Query:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP
        IQKVNNFVYSAATVKQMLQ+KKSAS+RPLNIAAEKDRLRREMDVA+SKNDE EVERIK RLQQLEASRRLQMKDAKAIRL EMNRKNRVENFKNASELRP
Subjt:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP

Query:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
        +KDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEA  NSD V PA E+ RTG+G  ++AGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
Subjt:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE

Query:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        LPISLAMLQKFGG LGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Subjt:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

A0A6J1FPT3 protein RTF1 homolog0.0e+0094.24Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDP ERDDDAGSQEEGD EDVGSDREGDSS+
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
        ESDVGDDLYKD+DDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAK DKGK+APSRKE  PLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP

Query:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
        EAHRKL+DASRGNAN RRFSPTKRKPFTAPSLSSS  SESESRFQSEDE STGDGGM+DSDDERSM G  GPTFEDIKEITIRRSKLAKWLMEPFFEELI
Subjt:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI

Query:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA
        VGCFVRVGIGRSRSG IYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE+SAARWQMAMVSDS PLEDEYKQWLKEVERT GRMLSKQD+LEKKEA
Subjt:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA

Query:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP
        IQK NNFVYSAATVKQML+EKKSASSRPLNIAAEKDRLRREMDVALSKNDE EVERIKARLQQLEASRRLQMKD KAIRLVEMNRKNRVENFKNASELRP
Subjt:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP

Query:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
        +KDLKAGEAGYDPFSRRWTRSRNYYV NAGEANGAAEAG NSDN +P SETNRTGSG  AEAGMAATAAALEAAAGAGKLVDT+APVDGGTESNSLHNFE
Subjt:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE

Query:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        LPISLA+LQKFGGP+GAQAGFLARKQ+IEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Subjt:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

A0A6J1ITJ0 protein RTF1 homolog0.0e+0094.24Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDP ERDDDAGSQEEGD EDVGSDREGDSS+
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
        ESDVGDDLYKD+DDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAK DKGK+APSRKET PLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP

Query:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI
        EAHRKL+DASRGN N RRFSPTKRKPFTAPSLSSS  SESESRFQSEDE STGDGGM+DSDDERSM G  GPTFEDIKEITIRRSKLAKWLMEPFFEELI
Subjt:  EAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI

Query:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA
        VGCFVRVGIGRSRSG IYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE+SAARWQMAMVSDS PLEDEYKQWLKEVERT GRMLS+QD+LEKKEA
Subjt:  VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEA

Query:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP
        IQK NNFVYSAATVKQML++KKSASSRPLNIAAEKDRLRREMDVALSKNDE EVERIKARLQQLEASRRLQMKD KAIRLVEMNRKNRVENFKNASELRP
Subjt:  IQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRP

Query:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
        +KDLKAGEAGYDPFSRRWTRSRNYYV NAGEANGAAEAG NSDN +PASETNRTGSG  AEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE
Subjt:  MKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE

Query:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        LPISLA+LQKFGGP+GAQAGFLARKQ+IEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Subjt:  LPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

SwissProt top hitse value%identityAlignment
A2AQ19 RNA polymerase-associated protein RTF1 homolog1.2e-2226.05Show/hide
Query:  AGSDSRDDDSDDDRGYAS--RKPSGSQVPLKKR------------LDPPERDDDAGSQ--EEGDGED-----VGSDREGDSSDESD-----VGDDLYKDE
        A SDS   DSDD+  + S   K  G    ++K+                +RD  A S   EEG+  D       S  + DSS E +      G+DL  DE
Subjt:  AGSDSRDDDSDDDRGYAS--RKPSGSQVPLKKR------------LDPPERDDDAGSQ--EEGDGED-----VGSDREGDSSDESD-----VGDDLYKDE

Query:  DDRRKLAGMSELQREMILSDRASK----KNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQD--PEAHRKL
        +DR +L  M+E +RE  L +R  K    K    + + L+    K K    +K+       ++     S   +  K     E R+KR ++ D   +A  +L
Subjt:  DDRRKLAGMSELQREMILSDRASK----KNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQD--PEAHRKL

Query:  KDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDER-SMPGSDGPTF--EDIKEITIRRSKLAKWLMEPFFEELIVGC
        K       N+      K++P     + S  + E +    SE    +      D ++E+  +P    P    E++  + + R KL +W   PFF + + GC
Subjt:  KDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDER-SMPGSDGPTF--EDIKEITIRRSKLAKWLMEPFFEELIVGC

Query:  FVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKE-AIQ
        FVR+GIG   S P+YR+  +  V   E  + Y+L    T+K L +  GN+    R  +  VS+    E E+ +W KE   + G  L   D + KKE +I+
Subjt:  FVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKE-AIQ

Query:  KVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQL----EASRRLQMKDAKAIRLVEMNRKNRVENFKNASEL
        +  N+ ++   ++++++EK+     P N A +K +L +E  +A    D+ + ++I+ +L +L    EA  R + K+  AI  +  N++NR  N   + + 
Subjt:  KVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQL----EASRRLQMKDAKAIRLVEMNRKNRVENFKNASEL

Query:  RPMKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPV--DGGTESNSL
           +         DPF+RR  + +   VSN+ +   A +A       I A    + GSG + +       A  E + G GK  D N+    D   +   +
Subjt:  RPMKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPV--DGGTESNSL

Query:  HNFELPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        H+F++ I L +               + + +  A   +  P  DG     +L + DYK+RRGL+
Subjt:  HNFELPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

G5EBY0 RNA polymerase-associated protein RTF1 homolog3.2e-1524.37Show/hide
Query:  DDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGE----DVGSDREGDSSDESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDK
        D DSD D G    KP  +        D    D DA   +    +         R   SSD+  V DDL+ D++D+ +   ++EL++E  + +R   + + 
Subjt:  DDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGE----DVGSDREGDSSDESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDK

Query:  HLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSA--DRAAAKDDALNELRAKRLKQQDPEAHRKLKDASRGNANKRRFSPTKRKP---------FTA
           E +  ++ K     S K        ++ S    A   +  A  D+ +E+ A   +  D     K K+A     NKR+    K            F A
Subjt:  HLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSA--DRAAAKDDALNELRAKRLKQQDPEAHRKLKDASRGNANKRRFSPTKRKP---------FTA

Query:  PSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGR-SRSGPIYRLCLVRNVDA
         S SSSS S SES   S    S+ +       ++  +   D     +++   + R KL+  +  PFF+  +VGC+VR+G G+ S SG  YR+  +  V+ 
Subjt:  PSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGR-SRSGPIYRLCLVRNVDA

Query:  TEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILE-KKEAIQKVNNFVYSAATVKQMLQEKKSASSR
         E ++ Y+LE K T+K +     N  S   ++M  VS++   + E+ +WL   +R G   L   DI++ KK+ I+K  N  YS   V  M++EK    + 
Subjt:  TEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILE-KKEAIQKVNNFVYSAATVKQMLQEKKSASSR

Query:  PLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEAS----RRLQMKDAKAIRLVEMNRKNRVENFKNASELRPMKDLKAGEAGYDPFSRRWTRSRN
        P N A  K    ++ ++A  + D  E E+I+ ++ ++E       + + K   AI  +    ++++++   + +L+  ++ +      DPF+R+    R 
Subjt:  PLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEAS----RRLQMKDAKAIRLVEMNRKNRVENFKNASELRPMKDLKAGEAGYDPFSRRWTRSRN

Query:  YYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFELPISLAMLQKFGGP
            +    +G   A  ++ N+   S+  +  S  +A+      +  ++                  T+ +SLH+F+L I L  L+ F  P
Subjt:  YYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFELPISLAMLQKFGGP

Q5RAD5 RNA polymerase-associated protein RTF1 homolog (Fragment)6.3e-1925.5Show/hide
Query:  LLLEAAGRTNAAGRNRHSHPPSRRQREGSYSD----------AGSDSRDDDSDDDRGYAS--RKPSGSQVPLKKR------------LDPPERDDDAGSQ
        +++++    + +  N     PS  +R+ S S+          A SDS   DSDD+  + S   K  G    ++K+                ++D  A S 
Subjt:  LLLEAAGRTNAAGRNRHSHPPSRRQREGSYSD----------AGSDSRDDDSDDDRGYAS--RKPSGSQVPLKKR------------LDPPERDDDAGSQ

Query:  --EEGDGEDVGSDREGDSSDESD----------VGDDLYKDEDDRRKLAGMSELQREMILSDRASK----KNDKHLYESLRAKMDKGKSAPSRKETPPLP
          EEG+  D  S+    SSD              G+DL  DE+DR +L  M+E +RE  L +R  K    K    + + L+    K K    +K+     
Subjt:  --EEGDGEDVGSDREGDSSDESD----------VGDDLYKDEDDRRKLAGMSELQREMILSDRASK----KNDKHLYESLRAKMDKGKSAPSRKETPPLP

Query:  SSRIRSSARSADRAAAKDDALNELRAKRLKQQD--PEAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDER
          ++     S   +  K     E R+KR ++ D   +A  +LK       N+      K++P     + S  + E E    SE    +      D ++E+
Subjt:  SSRIRSSARSADRAAAKDDALNELRAKRLKQQD--PEAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDER

Query:  -SMPGSDGPTF--EDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQM
          +P    P    E++  + + R KL +W   PFF + + GCFVR+GIG   S P+YR+  +  V   E  + Y+L    T+K L +  GN+    R  +
Subjt:  -SMPGSDGPTF--EDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQM

Query:  AMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKE-AIQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKAR
          VS+    E E+ +W KE   + G  L   D + KKE +I++  N+ ++   ++++++EK+     P N A +K +L +E  +A    D+ + ++I+ +
Subjt:  AMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKE-AIQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKAR

Query:  LQQL----EASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRPMKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGS
        L +L    EA  R + K+  AI  +  N++NR  N   + +    +         DPF+RR  + +   VSN+ +   A +A       I A    + GS
Subjt:  LQQL----EASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRPMKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGS

Query:  GPIAEAGMAATAAALEAAAGAGKLVDTN--APVDGGTESNSLHNFELPISL
        G + +       A  E + G GK  D N  +  D   +   +H+F++ I L
Subjt:  GPIAEAGMAATAAALEAAAGAGKLVDTN--APVDGGTESNSLHNFELPISL

Q92541 RNA polymerase-associated protein RTF1 homolog9.3e-2325.9Show/hide
Query:  AGSDSRDDDSDDDRGYAS--RKPSGSQVPLKKR------------LDPPERDDDAGSQ--EEGDGEDVGSDREGDSSDESD----------VGDDLYKDE
        A SDS   DSDD+  + S   K  G    ++K+                ++D  A S   EEG+  D  S+    SSD              G+DL  DE
Subjt:  AGSDSRDDDSDDDRGYAS--RKPSGSQVPLKKR------------LDPPERDDDAGSQ--EEGDGEDVGSDREGDSSDESD----------VGDDLYKDE

Query:  DDRRKLAGMSELQREMILSDRASK----KNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQD--PEAHRKL
        +DR +L  M+E +RE  L +R  K    K    + + L+    K K    +K+       ++     S   +  K     E R+KR ++ D   +A  +L
Subjt:  DDRRKLAGMSELQREMILSDRASK----KNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQD--PEAHRKL

Query:  KDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDER-SMPGSDGPTF--EDIKEITIRRSKLAKWLMEPFFEELIVGC
        K       N+      K++P     + S  + E E    SE    +      D ++E+  +P    P    E++  + + R KL +W   PFF + + GC
Subjt:  KDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDER-SMPGSDGPTF--EDIKEITIRRSKLAKWLMEPFFEELIVGC

Query:  FVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKE-AIQ
        FVR+GIG   S P+YR+  +  V   E  + Y+L    T+K L +  GN+    R  +  VS+    E E+ +W KE   + G  L   D + KKE +I+
Subjt:  FVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKE-AIQ

Query:  KVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQL----EASRRLQMKDAKAIRLVEMNRKNRVENFKNASEL
        +  N+ ++   ++++++EK+     P N A +K +L +E  +A    D+ + ++I+ +L +L    EA  R + K+  AI  +  N++NR  N   + + 
Subjt:  KVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQL----EASRRLQMKDAKAIRLVEMNRKNRVENFKNASEL

Query:  RPMKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTN--APVDGGTESNSL
           +         DPF+RR  + +   VSN+ +   A +A       I A    + GSG + +       A  E + G GK  D N  +  D   +   +
Subjt:  RPMKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTN--APVDGGTESNSL

Query:  HNFELPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        H+F++ I L +               + + +  A   +  P  DG     +L + DYK+RRGL+
Subjt:  HNFELPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

Q9C950 Protein RTF1 homolog5.3e-23670.54Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        M DLENLLLEAAGRTN+AGR+R  HPPS R+REGSYSD  SDSR DDSD+DRGYASRKPSGSQVPLKKRL+  ER+D A   E G G D  SDREGDSS+
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSR-IRSSARSADRAAAKDDALNELRAKRLKQQD
        ESD GDDLYK+E+DR+KLAGM+E QREMILS+RA KK DK+  E LR+K +  K+  S+KET PLP+SR +RSSARSADRAAAKDDALNELRAKR+KQQD
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSR-IRSSARSADRAAAKDDALNELRAKRLKQQD

Query:  PEAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEEL
        P A RKL+DAS+G +  R FS TKRKP  + +LSSSSQS+S+SR QS+DEGS  +GGM+DSDD+R    SD PTFED+KE+TIRRSKLAKWLMEPFFEEL
Subjt:  PEAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEEL

Query:  IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKE
        IVGCFVRVGIGRS+SGPIYRLC V+NVDAT+PD+ YKLENK THKYLNV+WGNETSAARWQMAM+SD  PLE+EY+QW++EVERT GRM +KQDI EKKE
Subjt:  IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKE

Query:  AIQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELR
        AIQ+ N+FVYSA TVKQMLQEKKSAS RP+N+AAEKDRLR+E+++A SKNDE  VERIK++++QL+ASR  +  D KA++L EMN+KNR ENFKNASE++
Subjt:  AIQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELR

Query:  PM-KDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHN
         +   LKAGEAGYDPFSRRWTRS NYY       N   +  EN   V  A ETN    G  A AG+ AT AALEAAA AGKL+DT AP+  G E N LHN
Subjt:  PM-KDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHN

Query:  FELPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        FEL +SL  LQK+GGP G Q  F+ARKQ  EATVG +V ENDG+RH LTLTVSDYKRRRGLL
Subjt:  FELPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL

Arabidopsis top hitse value%identityAlignment
AT1G61040.1 plus-3 domain-containing protein3.8e-23770.54Show/hide
Query:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD
        M DLENLLLEAAGRTN+AGR+R  HPPS R+REGSYSD  SDSR DDSD+DRGYASRKPSGSQVPLKKRL+  ER+D A   E G G D  SDREGDSS+
Subjt:  MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSD

Query:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSR-IRSSARSADRAAAKDDALNELRAKRLKQQD
        ESD GDDLYK+E+DR+KLAGM+E QREMILS+RA KK DK+  E LR+K +  K+  S+KET PLP+SR +RSSARSADRAAAKDDALNELRAKR+KQQD
Subjt:  ESDVGDDLYKDEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSR-IRSSARSADRAAAKDDALNELRAKRLKQQD

Query:  PEAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEEL
        P A RKL+DAS+G +  R FS TKRKP  + +LSSSSQS+S+SR QS+DEGS  +GGM+DSDD+R    SD PTFED+KE+TIRRSKLAKWLMEPFFEEL
Subjt:  PEAHRKLKDASRGNANKRRFSPTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEEL

Query:  IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKE
        IVGCFVRVGIGRS+SGPIYRLC V+NVDAT+PD+ YKLENK THKYLNV+WGNETSAARWQMAM+SD  PLE+EY+QW++EVERT GRM +KQDI EKKE
Subjt:  IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKE

Query:  AIQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELR
        AIQ+ N+FVYSA TVKQMLQEKKSAS RP+N+AAEKDRLR+E+++A SKNDE  VERIK++++QL+ASR  +  D KA++L EMN+KNR ENFKNASE++
Subjt:  AIQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRREMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELR

Query:  PM-KDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHN
         +   LKAGEAGYDPFSRRWTRS NYY       N   +  EN   V  A ETN    G  A AG+ AT AALEAAA AGKL+DT AP+  G E N LHN
Subjt:  PM-KDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASETNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHN

Query:  FELPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
        FEL +SL  LQK+GGP G Q  F+ARKQ  EATVG +V ENDG+RH LTLTVSDYKRRRGLL
Subjt:  FELPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGATCTAGAAAATTTGCTTCTGGAGGCTGCAGGAAGAACTAATGCAGCAGGGAGAAATCGACACTCTCATCCACCATCACGAAGACAGCGCGAGGGTTCATATTC
TGATGCTGGAAGTGACTCTAGGGATGATGACTCAGATGACGATCGTGGTTATGCAAGCAGGAAGCCTTCTGGATCTCAGGTTCCTCTGAAAAAGAGGCTAGATCCTCCTG
AAAGAGATGATGATGCAGGAAGCCAAGAAGAAGGGGATGGAGAAGATGTTGGTTCAGATCGTGAGGGTGACAGCAGTGATGAATCTGATGTTGGGGATGATCTTTATAAA
GATGAAGATGACAGGCGGAAGCTTGCTGGTATGTCTGAACTTCAAAGGGAGATGATTCTGTCAGACAGAGCGTCGAAGAAGAATGATAAGCATTTATATGAAAGCTTAAG
AGCTAAGATGGATAAAGGGAAGAGTGCTCCATCTCGGAAAGAGACCCCACCTCTCCCATCATCTCGTATTAGATCGTCAGCCAGATCTGCTGACAGAGCAGCTGCGAAAG
ATGATGCTTTAAATGAATTGCGTGCAAAAAGGTTGAAGCAGCAAGACCCCGAGGCTCATCGCAAATTGAAAGATGCATCTAGAGGGAATGCAAATAAGCGAAGGTTCTCA
CCAACAAAACGAAAGCCCTTCACTGCTCCTAGTTTGAGTAGCTCAAGCCAAAGTGAAAGTGAGAGTAGGTTTCAAAGTGAAGATGAAGGTTCTACAGGAGACGGCGGAAT
GATTGACAGTGATGATGAAAGATCCATGCCTGGTTCAGATGGGCCAACATTTGAAGATATCAAGGAAATTACTATTCGTAGATCAAAGCTTGCAAAATGGTTAATGGAAC
CATTCTTTGAGGAGTTGATAGTTGGATGCTTTGTGAGAGTTGGAATCGGAAGATCAAGATCTGGGCCTATCTACAGGCTCTGCTTGGTGCGCAATGTTGATGCAACAGAA
CCTGATCGTCAGTACAAACTAGAGAACAAAATCACACATAAATATCTTAATGTTATTTGGGGAAATGAAACTTCTGCTGCCAGGTGGCAGATGGCCATGGTGTCGGACTC
TGTACCACTCGAGGATGAATATAAACAGTGGCTTAAGGAGGTAGAGCGAACTGGTGGTCGGATGCTGAGCAAGCAGGATATATTGGAAAAGAAGGAAGCTATACAGAAAG
TCAACAACTTTGTCTACTCAGCAGCCACAGTGAAGCAGATGTTGCAGGAGAAAAAATCTGCCTCGTCAAGGCCACTAAATATTGCAGCTGAGAAGGACCGGCTGAGGAGA
GAGATGGATGTAGCACTAAGCAAAAACGATGAAGTTGAGGTGGAGAGGATCAAGGCAAGGCTGCAGCAATTAGAGGCATCCAGGAGGTTGCAGATGAAAGATGCCAAGGC
TATTAGGTTAGTTGAGATGAACAGGAAGAACAGGGTGGAGAACTTCAAAAATGCATCAGAACTGAGACCCATGAAAGATTTGAAAGCTGGAGAGGCTGGATACGATCCCT
TCTCAAGGAGATGGACCAGGTCGAGGAATTACTATGTTTCAAATGCTGGTGAAGCCAATGGGGCTGCAGAAGCAGGTGAGAACAGTGATAATGTAATTCCTGCATCAGAG
ACTAATAGAACAGGATCTGGTCCAATTGCAGAAGCTGGCATGGCAGCTACAGCAGCGGCTTTGGAAGCTGCTGCTGGGGCTGGAAAGTTGGTCGATACTAATGCTCCTGT
AGATGGAGGGACGGAATCAAACTCACTGCACAACTTTGAGCTGCCTATATCATTGGCTATGCTTCAGAAATTTGGTGGACCGTTGGGAGCTCAGGCTGGGTTCTTAGCCA
GGAAACAGCGGATAGAAGCCACAGTTGGACGTCAAGTCCCTGAGAATGATGGTAGGCGGCATGCACTGACACTGACGGTTAGCGACTACAAGAGAAGAAGAGGGCTTCTT
TGA
mRNA sequenceShow/hide mRNA sequence
ATACGATAAAATCCTCCTCCTTCCTTCCATCTGCCTCCATTTGAAGCCCTACTTTTCTTCGTTTCTTCGATCTTTTTTTCGCAGATCTGGCTCCGTCGTCTTCTCCGTTG
CCGTTCCGGTGGTTGTAGATCTACACGTTGATAGTGGTTTCTTGCCGATTTGGTCTCTGGTTACGTATTTTGATTCTGACAGGTTCATGTCCAGATCTGATTTCTCCTCT
TCTCTGTGTATTTTCTGACGGAGAAGCTTGGCTGGTTCTATTGTTCTTTTGTTTTCGTGTGGTATTTTGCATATTTGCGACGATTGTGGGATTTCCGTTTGAATACGCGC
CAATCCGGGGTCTGATCCTTGATTTCGGATTGGAGGGCAAGACACAGATTTTGGTGCCAGAGGCTGCAAAGCATGGCAGATCTAGAAAATTTGCTTCTGGAGGCTGCAGG
AAGAACTAATGCAGCAGGGAGAAATCGACACTCTCATCCACCATCACGAAGACAGCGCGAGGGTTCATATTCTGATGCTGGAAGTGACTCTAGGGATGATGACTCAGATG
ACGATCGTGGTTATGCAAGCAGGAAGCCTTCTGGATCTCAGGTTCCTCTGAAAAAGAGGCTAGATCCTCCTGAAAGAGATGATGATGCAGGAAGCCAAGAAGAAGGGGAT
GGAGAAGATGTTGGTTCAGATCGTGAGGGTGACAGCAGTGATGAATCTGATGTTGGGGATGATCTTTATAAAGATGAAGATGACAGGCGGAAGCTTGCTGGTATGTCTGA
ACTTCAAAGGGAGATGATTCTGTCAGACAGAGCGTCGAAGAAGAATGATAAGCATTTATATGAAAGCTTAAGAGCTAAGATGGATAAAGGGAAGAGTGCTCCATCTCGGA
AAGAGACCCCACCTCTCCCATCATCTCGTATTAGATCGTCAGCCAGATCTGCTGACAGAGCAGCTGCGAAAGATGATGCTTTAAATGAATTGCGTGCAAAAAGGTTGAAG
CAGCAAGACCCCGAGGCTCATCGCAAATTGAAAGATGCATCTAGAGGGAATGCAAATAAGCGAAGGTTCTCACCAACAAAACGAAAGCCCTTCACTGCTCCTAGTTTGAG
TAGCTCAAGCCAAAGTGAAAGTGAGAGTAGGTTTCAAAGTGAAGATGAAGGTTCTACAGGAGACGGCGGAATGATTGACAGTGATGATGAAAGATCCATGCCTGGTTCAG
ATGGGCCAACATTTGAAGATATCAAGGAAATTACTATTCGTAGATCAAAGCTTGCAAAATGGTTAATGGAACCATTCTTTGAGGAGTTGATAGTTGGATGCTTTGTGAGA
GTTGGAATCGGAAGATCAAGATCTGGGCCTATCTACAGGCTCTGCTTGGTGCGCAATGTTGATGCAACAGAACCTGATCGTCAGTACAAACTAGAGAACAAAATCACACA
TAAATATCTTAATGTTATTTGGGGAAATGAAACTTCTGCTGCCAGGTGGCAGATGGCCATGGTGTCGGACTCTGTACCACTCGAGGATGAATATAAACAGTGGCTTAAGG
AGGTAGAGCGAACTGGTGGTCGGATGCTGAGCAAGCAGGATATATTGGAAAAGAAGGAAGCTATACAGAAAGTCAACAACTTTGTCTACTCAGCAGCCACAGTGAAGCAG
ATGTTGCAGGAGAAAAAATCTGCCTCGTCAAGGCCACTAAATATTGCAGCTGAGAAGGACCGGCTGAGGAGAGAGATGGATGTAGCACTAAGCAAAAACGATGAAGTTGA
GGTGGAGAGGATCAAGGCAAGGCTGCAGCAATTAGAGGCATCCAGGAGGTTGCAGATGAAAGATGCCAAGGCTATTAGGTTAGTTGAGATGAACAGGAAGAACAGGGTGG
AGAACTTCAAAAATGCATCAGAACTGAGACCCATGAAAGATTTGAAAGCTGGAGAGGCTGGATACGATCCCTTCTCAAGGAGATGGACCAGGTCGAGGAATTACTATGTT
TCAAATGCTGGTGAAGCCAATGGGGCTGCAGAAGCAGGTGAGAACAGTGATAATGTAATTCCTGCATCAGAGACTAATAGAACAGGATCTGGTCCAATTGCAGAAGCTGG
CATGGCAGCTACAGCAGCGGCTTTGGAAGCTGCTGCTGGGGCTGGAAAGTTGGTCGATACTAATGCTCCTGTAGATGGAGGGACGGAATCAAACTCACTGCACAACTTTG
AGCTGCCTATATCATTGGCTATGCTTCAGAAATTTGGTGGACCGTTGGGAGCTCAGGCTGGGTTCTTAGCCAGGAAACAGCGGATAGAAGCCACAGTTGGACGTCAAGTC
CCTGAGAATGATGGTAGGCGGCATGCACTGACACTGACGGTTAGCGACTACAAGAGAAGAAGAGGGCTTCTTTGAAATTCGATGGCTGCAATTAAGCAGTCTGCTCCGAC
ATAGGTGTTCTGATCTTTGAGCATTGGATTTGGCATATAAATAATTATTGAGTTCGAAAAAAGTCCCTTAATTTTAGTAAAATTCTGCCTGATTGTGGGAAATATTGAGC
CTTGAAGAAATCTGTTTACTTGTTGTTGGTCTTCATGACCAATGCCGCTGCCCCCATTGATTTGCCAAGTCTGAAACAGTTGAGACTATCCAATCCAATAGAAGAAATGA
AAACATATATGGTTTGAATTTCCTCTCATGTCTCAGATTAAATCCAGTCACGGTAAAACACTCTTCTGTTACTTCTTAGGGTTATTTAATCATGATCTTAAGAATAGTTT
TACCTAACTTTCTACCACTAGAGCTCAGATCTTGTGTATGTAGCAGCAATGATAATTGTATAGGTTGACTTATGTAGCAGCAATCTATTTTTCATCTAATTTTTTAAAAA
TTTAGAGTGACTGATTGGATTTGATATTCTTGGTTACATTACAACTTTTACAACTGTGTACGAGTTTTGGGGATTGAAAAATATTTCTGATTTCTAGAAAACATTAAGGG
GGCGTTTGGATTGCAGAGTTGGGTTGGGTTGAGTTGTGCAGAGTTGTTTTGTCAGTGAGTTCTGGAAACTAATGTTTTGTAAAGTGAATAAGTCTGTTTTTTTGTGATTT
TTTTTTTACTTTTTTTGTTTCCAC
Protein sequenceShow/hide protein sequence
MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPSGSQVPLKKRLDPPERDDDAGSQEEGDGEDVGSDREGDSSDESDVGDDLYK
DEDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDPEAHRKLKDASRGNANKRRFS
PTKRKPFTAPSLSSSSQSESESRFQSEDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGRSRSGPIYRLCLVRNVDATE
PDRQYKLENKITHKYLNVIWGNETSAARWQMAMVSDSVPLEDEYKQWLKEVERTGGRMLSKQDILEKKEAIQKVNNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRR
EMDVALSKNDEVEVERIKARLQQLEASRRLQMKDAKAIRLVEMNRKNRVENFKNASELRPMKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAGENSDNVIPASE
TNRTGSGPIAEAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFELPISLAMLQKFGGPLGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL