; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G12920 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G12920
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr1:8416335..8419041
RNA-Seq ExpressionCSPI01G12920
SyntenyCSPI01G12920
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]0.0e+0086.43Show/hide
Query:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYL+SPD
Subjt:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
        VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
Subjt:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS

Query:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSD
        YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLV RPAGKKAIGCKWVFAVK+NPDGTVARLKARLVAKGYAQIYGTDYSD
Subjt:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSD

Query:  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGY
                                                                                                            
Subjt:  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGY

Query:  AQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM
                  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM
Subjt:  AQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM

Query:  KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM
        KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM
Subjt:  KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM

Query:  PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST
        PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST
Subjt:  PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST

Query:  SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV
        SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV
Subjt:  SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV

Query:  STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]0.0e+0086.43Show/hide
Query:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYL+SPD
Subjt:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
        VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
Subjt:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS

Query:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSD
        YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLV RPAGKKAIGCKWVFAVK+NPDGTVARLKARLVAKGYAQIYGTDYSD
Subjt:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSD

Query:  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGY
                                                                                                            
Subjt:  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGY

Query:  AQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM
                  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM
Subjt:  AQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM

Query:  KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM
        KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM
Subjt:  KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM

Query:  PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST
        PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST
Subjt:  PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST

Query:  SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV
        SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV
Subjt:  SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV

Query:  STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]0.0e+0086.43Show/hide
Query:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYL+SPD
Subjt:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
        VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
Subjt:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS

Query:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSD
        YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLV RPAGKKAIGCKWVFAVK+NPDGTVARLKARLVAKGYAQIYGTDYSD
Subjt:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSD

Query:  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGY
                                                                                                            
Subjt:  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGY

Query:  AQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM
                  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM
Subjt:  AQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM

Query:  KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM
        KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM
Subjt:  KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM

Query:  PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST
        PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST
Subjt:  PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST

Query:  SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV
        SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV
Subjt:  SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV

Query:  STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]0.0e+0086.43Show/hide
Query:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYL+SPD
Subjt:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
        VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
Subjt:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS

Query:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSD
        YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLV RPAGKKAIGCKWVFAVK+NPDGTVARLKARLVAKGYAQIYGTDYSD
Subjt:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSD

Query:  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGY
                                                                                                            
Subjt:  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGY

Query:  AQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM
                  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM
Subjt:  AQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM

Query:  KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM
        KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM
Subjt:  KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM

Query:  PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST
        PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST
Subjt:  PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST

Query:  SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV
        SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV
Subjt:  SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV

Query:  STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]0.0e+0086.43Show/hide
Query:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYL+SPD
Subjt:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
        VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS
Subjt:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFIS

Query:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSD
        YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLV RPAGKKAIGCKWVFAVK+NPDGTVARLKARLVAKGYAQIYGTDYSD
Subjt:  YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSD

Query:  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGY
                                                                                                            
Subjt:  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGY

Query:  AQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM
                  TFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM
Subjt:  AQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGM

Query:  KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM
        KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM
Subjt:  KKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMM

Query:  PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST
        PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST
Subjt:  PNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRST

Query:  SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV
        SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV
Subjt:  SGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLV

Query:  STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  STGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

TrEMBL top hitse value%identityAlignment
A0A438G5Y3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-28859.98Show/hide
Query:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        M V K FW DA+STACFLINRMPS+VLN +IPY +LFP K LFP+ P+IFG  C+VRDVRP  TKLDPK+LKC+FLGYSR+QKGYRC+ P L +Y++S D
Subjt:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  VVFFEDTPFTSS-PSSLCQGEDDNLFIYEVTSPTPSLSTD------------VPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSD-DLPIALRK
        VVF EDTPF SS P+S  +GE +N  IY+ T  TPS STD              P++P I Q YSR    +  D+CP     SS    PSD DLPI LRK
Subjt:  VVFFEDTPFTSS-PSSLCQGEDDNLFIYEVTSPTPSLSTD------------VPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSD-DLPIALRK

Query:  GKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKAR
        GKR+C   Y +++F+SY QLSPS+ AF+ SL+S SIP ++ EAL+HPGW NAM+EE+ AL+ N TW+LV  P GK  +GCKWVFA+K+NP+G+VARLK R
Subjt:  GKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKAR

Query:  LVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQA
        LVAKGYAQ YG DYSDTFSPVA+L S+RL +S+AA+  W LHQ+DIKNAFLHGDLQEEVYMEQPPGFVAQGE                            
Subjt:  LVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQA

Query:  NGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSP
                           YG                                                               KVC LRKSLYGLKQSP
Subjt:  NGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSP

Query:  RAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLL
        RAWFGKFS+ +  FGM KS  DHSVFYR+S  GI+LLVVY+DDIVITG+D  GISSLK F+  +F+TKDLG+LKYFLG+EV RSK+GI+LSQRKYVLDLL
Subjt:  RAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLL

Query:  SETGKLGAKPSGTPMMPNQQLVK-EGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVE
        +E GK+ AKP  T M+PN  L K +G+   +PERY+RLVGKLNYLTVTRPDIAY+VS+VSQFM +PTV HWAA+EQILCYLK APG GILY +HGH ++E
Subjt:  SETGKLGAKPSGTPMMPNQQLVK-EGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVE

Query:  CFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKH
        CF+DADWAGS+ DRRST+G+CVFVG NLVSWKSKKQNVVSRSSAES+YRAMAQ+ CEI+W++ LL EIG    +  KL CDNQAA+HIASNPV+HERTKH
Subjt:  CFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKH

Query:  IEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        IEVDCHFIREKIQ+ L+ST YVKTGEQLGDI TKALNGT++ Y CNKLGMI+I+APA
Subjt:  IEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

A0A438GAA6 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-29760.44Show/hide
Query:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        M V K FW DAVSTACFLINRMP+ VL  +IPY+V+ P K LFP+AP+IFGC C+VRD RP   KLDPK+L+C+FLGYSR+QKGYRC+ P L +YL+S D
Subjt:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLP
        VVF EDT F SSP+S    ED+   +Y+V +  P            SL+   P       P++P I QVYSRR  P  +D+C P+  PSS DP+   DLP
Subjt:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLP

Query:  IALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVA
        I+LRKGKR C   Y +++F+SY  LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK+NPDG+VA
Subjt:  IALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVA

Query:  RLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFG
        RLKARLVA+GYAQ YG DYSDTFSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE                       
Subjt:  RLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFG

Query:  KFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYG
                                YG                                                               KVCRL+K+LYG
Subjt:  KFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYG

Query:  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKY
        LKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKY
Subjt:  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKY

Query:  VLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHG
        VLDLL ETGK+ AKP  TPM+PN QL+  +G+   +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S+PT+ HWAA+EQILCYLK APG GILY   G
Subjt:  VLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHG

Query:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH
        HTR+ECFSDADWAGS+ DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAM+Q+ CEI+WIHQLL E+G   T+PAKLWCDNQAALHIA+NPV+H
Subjt:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH

Query:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        ERTKHIEVDCHFIREKI++ LVSTGYVKTGEQLGDI TKALNGTR+ Y CNKLGMI+I+APA
Subjt:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

A0A438HEX0 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-28860.28Show/hide
Query:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        M V K FW DAVSTACFLINRMP+ VL G+IPY+V+ P K LF +AP+IFGC C+VRD RP  TKLDPK+L+C+FLGYSR+QKGYRC+ P L +YL+S D
Subjt:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLP
        VVF EDT F SSP+S    ED+   +Y+V +  P            SL+   P       P++P I QVYSRR  P  +D+C P+  PSS DP+   DLP
Subjt:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLP

Query:  IALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVA
        I+LRKGKR C   Y +++F+SY  LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL DN TW LV  P GKK +GCKWVFAVK+NPDG+VA
Subjt:  IALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVA

Query:  RLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFG
        RLKARLVA+GYAQ YG DYSDTFSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE                       
Subjt:  RLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFG

Query:  KFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYG
                                YG                                                               KVCRL+K+LYG
Subjt:  KFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYG

Query:  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKY
        LKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKY
Subjt:  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKY

Query:  VLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHG
        VLDLL ETGK+ AKP  TPM+PN QL+  +G+   +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S+PT+ HWAA+EQILCYLK APG GILY   G
Subjt:  VLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHG

Query:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH
        HTR+ECFSDADWAGS+ DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAM+Q+ CEI+WIHQLL E+G   T+PAKLWCDNQAALHIA+NPV+H
Subjt:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH

Query:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRI
        ERTKHIEVDCHFIREKI++ LVSTGYVKTGEQLGDI TKALNGTR+
Subjt:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRI

A0A438IRR9 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-29760.56Show/hide
Query:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        M V K FW DAVSTACFLINRMP+ VL G+IPY+V+ P K LFP+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGYSR+QKGYRC+ P L +YL+S D
Subjt:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLP
        VVF EDT F SSP+S    ED+   +Y+V +  P            SL+   P       P++P I QVYSRR  P  +D+C P+  PSS DP+   DLP
Subjt:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLP

Query:  IALRKGKR--KCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVA
        I+LRKGKR  K  Y +++F+SY  LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK+N DG+VA
Subjt:  IALRKGKR--KCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVA

Query:  RLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFG
        RLKARLVA+GYAQ YG DYSDTFSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE                       
Subjt:  RLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFG

Query:  KFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYG
                                YG                                                               KVCRL+K+LYG
Subjt:  KFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYG

Query:  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKY
        LKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKY
Subjt:  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKY

Query:  VLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHG
        VLDLL ETGK+ AKP  TPM+PN QL+  +G+   +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S+PT+ HWAA+EQILCYLK APG GILY   G
Subjt:  VLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHG

Query:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH
        HTR+ECFSDADWAGS+ DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAMAQ+ CEI+WIHQLL E+G   T+PAKLWCDNQAALHIA+NP++H
Subjt:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH

Query:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        ERTKHIEVDCHFIREKI++ LVSTGYVKTGEQLGDI TKALNGTR+ Y CNKLGMI+I+APA
Subjt:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

B0FBS2 Uncharacterized protein1.7e-29860.67Show/hide
Query:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        M V K FW DAVSTACFLINRMP+ VL G+IPY+V+ P K LFP+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGYSR+QKGYRC+ P L +YL+S D
Subjt:  MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLP
        VVF EDT F SSP+S    ED+   +Y+V +  P            SL+   P       P++P I QVYSRR  P  +D+C P+  PSS DP+   DLP
Subjt:  VVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLP

Query:  IALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVA
        I+LRKGKR C   Y +++F+SY  LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK+NPDG+VA
Subjt:  IALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVA

Query:  RLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFG
        RLKARLVA+GYAQ YG DYSDTFSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE                       
Subjt:  RLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFG

Query:  KFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYG
                                YG                                                               KVCRL+K+LYG
Subjt:  KFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYG

Query:  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKY
        LKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKY
Subjt:  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKY

Query:  VLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHG
        VLDLL ETGK+ AKP  TPM+PN QL+  +G+   +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S+PT+ HWAA+EQILCYLK APG GILY   G
Subjt:  VLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHG

Query:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH
        HTR+ECFSDADWAGS+ DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAM+Q+ CEI+WIHQLL E+G   T+PAKLWCDNQAALHIA+NPV+H
Subjt:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH

Query:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        ERTKHIEVDCHFIREKI++ LVSTGYVKTGEQLGDI TKALNGTR+ Y CNKLGMI+I+APA
Subjt:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-8927.73Show/hide
Query:  VSKIFWVDAVSTACFLINRMPSSVL--NGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD
        + K FW +AV TA +LINR+PS  L  + + PY  ++  K  +    ++FG   +V  ++    K D KS K IF+GY     G++ +    ++++++ D
Subjt:  VSKIFWVDAVSTACFLINRMPSSVL--NGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPD

Query:  V-----------------VFFEDTP------FTSSPSSLCQGE--------------------------DDNLFIYEVTSPTPS--------LSTDVPPS
        V                 VF +D+       F +    + Q E                          +D+  I +   P  S        L      +
Subjt:  V-----------------VFFEDTP------FTSSPSSLCQGE--------------------------DDNLFIYEVTSPTPS--------LSTDVPPS

Query:  RPLISQVYSRRPPPQPSDS----CPPSMLPSSC----------DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLES--TSIPNSVHEA
        +  +++   R+     ++S     P     S            +P  +D + I  R+ +R  T P    ISY++   S    + +  +    +PNS  E 
Subjt:  RPLISQVYSRRPPPQPSDS----CPPSMLPSSC----------DPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLES--TSIPNSVHEA

Query:  L---SHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWS
                W+ A+  E+ A   N TW +  RP  K  +  +WVF+VK N  G   R KARLVA+G+ Q Y  DY +TF+PVA+++S R  LS+       
Subjt:  L---SHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWS

Query:  LHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLF
        +HQ+D+K AFL+G L+EE+YM  P G      SD VC+L K++YGLKQ+ R WF  F QA                  +   ++ N  S V +       
Subjt:  LHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLF

Query:  LSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVY
                  ++ LD       G++ E +Y                                                                  +++Y
Subjt:  LSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVY

Query:  VDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMP--NQQLVKEGELCKDPERYRRLV
        VDD+VI   D   +++ K +L  +F   DL ++K+F+GI +   +  IYLSQ  YV  +LS+          TP+    N +L+   E C  P   R L+
Subjt:  VDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMP--NQQLVKEGELCKDPERYRRLV

Query:  GKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDH--GHTRVECFSDADWAGSREDRRSTSGYCV-FVGGNLVSWKSKK
        G L Y+ + TRPD+  +V+++S++ S    + W  ++++L YLK      +++K +     ++  + D+DWAGS  DR+ST+GY       NL+ W +K+
Subjt:  GKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDH--GHTRVECFSDADWAGSREDRRSTSGYCV-FVGGNLVSWKSKK

Query:  QNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKA
        QN V+ SS E+EY A+ ++V E +W+  LL+ I   +  P K++ DNQ  + IA+NP  H+R KHI++  HF RE++Q+ ++   Y+ T  QL DI TK 
Subjt:  QNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKA

Query:  LNGTRISYLCNKLGMI
        L   R   L +KLG++
Subjt:  LNGTRISYLCNKLGMI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-11032.36Show/hide
Query:  KIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPDVVFF
        K FW +AV TAC+LINR PS  L  EIP RV +  K +     K+FGC  F    +   TKLD KS+ CIF+GY   + GYR + P  K+ + S DVVF 
Subjt:  KIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPDVVFF

Query:  EDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQL
        E    T++  S  +   + +    VT P+ S +                   P  ++S    +      P    +    L +G  +  +P      +  L
Subjt:  EDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQL

Query:  SPSTYAFITSLESTSI----------PNSVHEALSHP---GWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYA
          S    + S    S           P S+ E LSHP       AM EEM +L  NGT+ LV  P GK+ + CKWVF +K + D  + R KARLV KG+ 
Subjt:  SPSTYAFITSLESTSI----------PNSVHEALSHP---GWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYA

Query:  QIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARL
        Q  G D+ + FSPV K+TSIR  LS+AA+    + QLD+K AFLHGDL+EE+YMEQP GF   G+   VC+L KSLYGLKQ+PR W+ KF          
Subjt:  QIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARL

Query:  KARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKF
                                                                 ++ + Y+                                    
Subjt:  KARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKF

Query:  SQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSK--KGIYLSQRKYVLDLLSETG
                  K+ SD  V+++R SE   ++L++YVDD++I G D   I+ LK  L   F  KDLG  +  LG++++R +  + ++LSQ KY+  +L    
Subjt:  SQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSK--KGIYLSQRKYVLDLLSETG

Query:  KLGAKPSGTPMMPNQQLVK---------EGELCKDPERYRRLVGKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHG
           AKP  TP+  + +L K         +G + K P  Y   VG L Y  V TRPDIA++V VVS+F+ +P  +HW AV+ IL YL+   G  + +    
Subjt:  KLGAKPSGTPMMPNQQLVK---------EGELCKDPERYRRLVGKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHG

Query:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH
           ++ ++DAD AG  ++R+S++GY     G  +SW+SK Q  V+ S+ E+EY A  ++  E++W+ + L E+G        ++CD+Q+A+ ++ N ++H
Subjt:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH

Query:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKL
         RTKHI+V  H+IRE + D  +    + T E   D+LTK +   +   LC +L
Subjt:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKL

P92519 Uncharacterized mitochondrial protein AtMg008102.2e-4540.81Show/hide
Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        L++YVDDI++TG+    ++ L   L   F  KDLG + YFLGI++     G++LSQ KY   +L+  G L  KP  TP+              DP  +R 
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        +VG L YLT+TRPDI+Y+V++V Q M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+G+C F+G N++SW +K+Q 
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAESEYRAMAQSVCEIVW
         VSRSS E+EYRA+A +  E+ W
Subjt:  VVSRSSAESEYRAMAQSVCEIVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-13536.75Show/hide
Query:  VSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPDV
        + K +W  A + A +LINR+P+ +L  E P++ LF T   +    ++FGC C+   +RP++  KLD KS +C+FLGYS  Q  Y C      R  +S  V
Subjt:  VSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPDV

Query:  VFFEDT-PFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVP--PSRPLISQVYSRRPPPQPSDSCPPSMLPSS------CDPAPSDDLPIALRK-GKRKC
         F E+  PF++  ++L   ++       V SP  +L T  P  P+       ++  PP  PS     S + SS          PS   P A R+ G +  
Subjt:  VFFEDT-PFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVP--PSRPLISQVYSRRPPPQPSDSCPPSMLPSS------CDPAPSDDLPIALRK-GKRKC

Query:  TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTW-----DLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLV
        T P  +    H  S +T     + ES   P+ + ++LS P   ++     T    + +       ++  P    A          +N      R KA ++
Subjt:  TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTW-----DLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLV

Query:  AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAW-FGKFSQAN
                   YS   S  A+ +  R  +      +W        NA +     + V    PP  V             ++ G +     W F K   ++
Subjt:  AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAW-FGKFSQAN

Query:  GTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPR
        G++ R KARLVAKGY Q  G DY+ TFSPV K TSIR+ L +A    W + QLD+ NAFL G L ++VYM QPPGF+ +   + VC+LRK+LYGLKQ+PR
Subjt:  GTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPR

Query:  AWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLS
        AW+ +    L+  G   S SD S+F  +  K IV ++VYVDDI+ITGND   + +    L  +F  KD  +L YFLGIE  R   G++LSQR+Y+LDLL+
Subjt:  AWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLS

Query:  ETGKLGAKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVEC
         T  + AKP  TPM P+ +L +  G    DP  YR +VG L YL  TRPDI+Y+V+ +SQFM  PT +H  A+++IL YL   P  GI  K      +  
Subjt:  ETGKLGAKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVEC

Query:  FSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHI
        +SDADWAG ++D  ST+GY V++G + +SW SKKQ  V RSS E+EYR++A +  E+ WI  LL+E+G  +T P  ++CDN  A ++ +NPVFH R KHI
Subjt:  FSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHI

Query:  EVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGM
         +D HFIR ++Q G +   +V T +QL D LTK L+ T      +K+G+
Subjt:  EVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.5e-13436.14Show/hide
Query:  VSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPDV
        V K +W  A S A +LINR+P+ +L  + P++ LF     +    K+FGC C+   +RP++  KL+ KS +C F+GYS  Q  Y C      R   S  V
Subjt:  VSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPDV

Query:  VF------FEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPS-RPLISQVYSRRPPPQPSDSC----PPSMLPSSCDPAPSDDLPIA-LRKGKR
         F      F  T F  S S   + +    +    T PT  L    PP   P +    S RPP  PS  C      S LPSS   +PS   P A    G +
Subjt:  VF------FEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPS-RPLISQVYSRRPPPQPSDSC----PPSMLPSSCDPAPSDDLPIA-LRKGKR

Query:  KCTYPVSSFIS------YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKA
            P  +  S       +  +P++ +  +  +++ +P S   +  H    +  I E  +   + T      P        +      +N      R K 
Subjt:  KCTYPVSSFIS------YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKA

Query:  RLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAW-FGKFS
         +        Y T  +    P       R  +     ++W        NA + G+   ++    PP     G     CR              W F K  
Subjt:  RLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAW-FGKFS

Query:  QANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQ
         ++G++ R KARLVAKGY Q  G DY+ TFSPV K TSIR+ L +A    W + QLD+ NAFL G L +EVYM QPPGFV +   D VCRLRK++YGLKQ
Subjt:  QANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQ

Query:  SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLD
        +PRAW+ +    L+  G   S SD S+F  +  + I+ ++VYVDDI+ITGND + +      L  +F  K+   L YFLGIE  R  +G++LSQR+Y LD
Subjt:  SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLD

Query:  LLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTR
        LL+ T  L AKP  TPM  + +L +  G    DP  YR +VG L YL  TRPD++Y+V+ +SQ+M  PT DHW A++++L YL   P  GI  K      
Subjt:  LLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTR

Query:  VECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERT
        +  +SDADWAG  +D  ST+GY V++G + +SW SKKQ  V RSS E+EYR++A +  E+ WI  LL+E+G  ++ P  ++CDN  A ++ +NPVFH R 
Subjt:  VECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERT

Query:  KHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDI
        KHI +D HFIR ++Q G +   +V T +QL D LTK L+         K+G+I +
Subjt:  KHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.8e-12839.02Show/hide
Query:  YPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQ
        + +S F+SY ++SP  ++F+  +     P++ +EA     W  AM +E+ A++   TW++   P  KK IGCKWV+ +K N DGT+ R KARLVAKGY Q
Subjt:  YPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQ

Query:  IYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARL
          G D+ +TFSPV KLTS++L L+++A   ++LHQLDI NAFL+GDL EE+YM+ PPG+ A QG+S                                  
Subjt:  IYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARL

Query:  KARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKF
                                                                          PP        + VC L+KS+YGLKQ+ R WF KF
Subjt:  KARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKF

Query:  SQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG
        S  L+ FG  +S SDH+ F + +    + ++VYVDDI+I  N+   +  LK+ L+  F  +DLG LKYFLG+E+ RS  GI + QRKY LDLL ETG LG
Subjt:  SQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG

Query:  AKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADW
         KPS  PM P+       G    D + YRRL+G+L YL +TR DI+++V+ +SQF  +P + H  AV +IL Y+K   G+G+ Y      +++ FSDA +
Subjt:  AKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADW

Query:  AGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHF
           ++ RRST+GYC+F+G +L+SWKSKKQ VVS+SSAE+EYRA++ +  E++W+ Q   E+   ++ P  L+CDN AA+HIA+N VFHERTKHIE DCH 
Subjt:  AGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHF

Query:  IREK-IQDGLVSTGYVKTGEQLG--DILTKALNGTRISYLCNKLGMIDIFA
        +RE+ +    +S  +    EQ G  + L+  L GT I Y+ +  G+  + A
Subjt:  IREK-IQDGLVSTGYVKTGEQLG--DILTKALNGTRISYLCNKLGMIDIFA

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.1e-1238.27Show/hide
Query:  YLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFV
        YLT+TRPD+ ++V+ +SQF S+       AV ++L Y+K   G+G+ Y      +++ F+D+DWA   + RRS +G+C  V
Subjt:  YLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFV

ATMG00810.1 DNA/RNA polymerases superfamily protein1.6e-4640.81Show/hide
Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        L++YVDDI++TG+    ++ L   L   F  KDLG + YFLGI++     G++LSQ KY   +L+  G L  KP  TP+              DP  +R 
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        +VG L YLT+TRPDI+Y+V++V Q M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+G+C F+G N++SW +K+Q 
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAESEYRAMAQSVCEIVW
         VSRSS E+EYRA+A +  E+ W
Subjt:  VVSRSSAESEYRAMAQSVCEIVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.0e-2145.3Show/hide
Query:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSDT
        ++L+P  Y+   +      P SV  AL  PGW  AM EE+ AL  N TW LV  P  +  +GCKWVF  K++ DGT+ RLKARLVAKG+ Q  G  + +T
Subjt:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVAKLTSIRLFLSMA
        +SPV +  +IR  L++A
Subjt:  FSPVAKLTSIRLFLSMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGTTTCAAAAATCTTTTGGGTTGATGCTGTCTCTACAGCTTGTTTTTTGATTAATAGGATGCCTTCCTCTGTTCTTAATGGTGAGATTCCCTATCGTGTTCTTTT
TCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACGTTCGTCCTCATCATACTAAGTTAGATCCCAAATCCTTGAAGTGTA
TCTTCTTGGGTTATTCACGTGTTCAAAAGGGTTATCGTTGTTATTGTCCTACCCTTAAAAGGTATCTGCTTTCGCCTGATGTTGTCTTTTTTGAAGATACACCCTTTACT
TCATCACCATCGAGTTTGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACTGATGTGCCTCCTTCCCGCCCGTTGAT
TTCTCAAGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGCGCCAAGTGATGATCTTCCCATTGCTC
TTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATT
CCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATTTCGCCCTGC
AGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATAAATCCTGATGGAACAGTGGCTCGTTTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATCT
ATGGCACTGATTATTCAGATACATTCTCTCCGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATT
AAGAATGCTTTTCTTCACGGTGATCTCCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCT
GTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCAATGGAACAATGGCTCGTTTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATAT
ATGGCACTGATTATTCAAATACATTCTCTCCGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATT
AAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCT
GTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCGCCGAT
CTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATTGGGTATTTCGTCTCTCAAAACTTTCCTTCAGGGTCAGTTTTAT
ACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAATATGTACTTGATTTGTTGTCTGAAAC
AGGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGCCAAATCAGCAACTTGTTAAAGAAGGAGAATTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGA
AGTTGAACTACTTAACAGTGACTCGACCAGACATTGCATATTCTGTAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGGATCATTGGGCTGCAGTAGAGCAGATT
TTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATACAAAGATCATGGACATACGAGAGTTGAATGTTTTTCTGATGCTGATTGGGCGGGGTCTCGTGAGGATAG
GAGATCGACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAACAAAATGTTGTTTCTCGTTCGAGTGCTGAGTCAGAATATAGAGCTA
TGGCACAATCTGTGTGTGAAATAGTATGGATTCACCAACTATTATCTGAGATAGGCTTCAGTATTACCGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTCAT
ATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAGAC
CGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGCTGGGCATGATCGACATATTTGCTCCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
TCATTCTCTTGGCTCTTACTTGTGTGAAAATGGCATTATTCATCAATCTTCCTGTGCTGACACTCCATCTCAAAATGGTGTTGCAGAGCGGAAAAATAGGCATTTACTTG
AAACTGCCCGTGCTTTATCGTTTCAAATGCATGTTTCAAAAATCTTTTGGGTTGATGCTGTCTCTACAGCTTGTTTTTTGATTAATAGGATGCCTTCCTCTGTTCTTAAT
GGTGAGATTCCCTATCGTGTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACGTTCGTCCTCATCATACTAA
GTTAGATCCCAAATCCTTGAAGTGTATCTTCTTGGGTTATTCACGTGTTCAAAAGGGTTATCGTTGTTATTGTCCTACCCTTAAAAGGTATCTGCTTTCGCCTGATGTTG
TCTTTTTTGAAGATACACCCTTTACTTCATCACCATCGAGTTTGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACT
GATGTGCCTCCTTCCCGCCCGTTGATTTCTCAAGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGC
GCCAAGTGATGATCTTCCCATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTA
TTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGT
ACTTGGGATTTGGTATTTCGCCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATAAATCCTGATGGAACAGTGGCTCGTTTAAAGGCTCGCCT
TGTTGCCAAAGGTTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTCCGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATA
AATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTCCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGAT
AAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCAATGGAACAATGGCTCGTTTAAAGGCTCGCCT
TGTTGCCAAAGGTTATGCTCAAATATATGGCACTGATTATTCAAATACATTCTCTCCGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATA
AATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGAT
AAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATC
TGATCATTCAGTTTTCTATCGCCGATCTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATTGGGTATTTCGTCTCTCA
AAACTTTCCTTCAGGGTCAGTTTTATACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAA
TATGTACTTGATTTGTTGTCTGAAACAGGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGCCAAATCAGCAACTTGTTAAAGAAGGAGAATTATGTAAAGATCC
TGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAACAGTGACTCGACCAGACATTGCATATTCTGTAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGG
ATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATACAAAGATCATGGACATACGAGAGTTGAATGTTTTTCTGATGCT
GATTGGGCGGGGTCTCGTGAGGATAGGAGATCGACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAACAAAATGTTGTTTCTCGTTC
GAGTGCTGAGTCAGAATATAGAGCTATGGCACAATCTGTGTGTGAAATAGTATGGATTCACCAACTATTATCTGAGATAGGCTTCAGTATTACCGTGCCAGCTAAATTAT
GGTGTGATAATCAAGCTGCACTTCATATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGG
TTGGTGTCCACAGGATATGTGAAGACCGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGCTGGGCATGATCGA
CATATTTGCTCCAGCTTGAGGGGGAGTGTTATGATATATATATATATACATATGTCCTTTATTGTAA
Protein sequenceShow/hide protein sequence
MHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLLSPDVVFFEDTPFT
SSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI
PNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVFRPAGKKAIGCKWVFAVKINPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDI
KNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQANGTMARLKARLVAKGYAQIYGTDYSNTFSPVAKLTSIRLFLSMAATNKWSLHQLDI
KNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFY
TKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQI
LCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALH
IASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKALNGTRISYLCNKLGMIDIFAPA