; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G16190 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G16190
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr1:11714631..11718719
RNA-Seq ExpressionCSPI01G16190
SyntenyCSPI01G16190
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0000325 - plant-type vacuole (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]0.0e+0099.19Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQK DWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE
        RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE
Subjt:  RIESSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQE

Query:  SLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTS
        SLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTS
Subjt:  SLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTS

Query:  QLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSS
        QLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQ VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSL SLNCDSCQF KFHRLSSS
Subjt:  QLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSS

Query:  PRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGII
        PRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGII
Subjt:  PRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGII

Query:  HQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDP
        HQSSCADTPSQNGV ERKNRHLL+TARALSFQMHVSKIFWVD VSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDP
Subjt:  HQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDP

Query:  KSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSML
        KSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSML
Subjt:  KSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSML

Query:  PSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVF
        PSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVF
Subjt:  PSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVF

Query:  AVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLY
        AVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLY
Subjt:  AVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLY

Query:  GLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRK
        GLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRK
Subjt:  GLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRK

Query:  YVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHG
        YVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILY DHG
Subjt:  YVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHG

Query:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH
        HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH
Subjt:  HTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFH

Query:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP
        ERTKHIEVDCHFIREKIQDGLVSTGYVKT EQLGDILTK+LNGTRISYLCNKLGMIDIFAP
Subjt:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]0.0e+0098.83Show/hide
Query:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDI
        F+DRVTKKIIGRGYESGGLYLFDHQVSQ VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSL SLNCDSCQF KFHRLSSSPRVDKRAIAPFELVHSDI
Subjt:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDI

Query:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERK
        WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGV ERK
Subjt:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERK

Query:  NRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYR
        NRHLL+TARALSFQMHVSKIFWVD VSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYR
Subjt:  NRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYR

Query:  CYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRK
        CYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRK
Subjt:  CYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRK

Query:  GKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLV
        GKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLV
Subjt:  GKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLV

Query:  AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALV
        AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALV
Subjt:  AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALV

Query:  CFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSG
        CFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSG
Subjt:  CFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSG

Query:  TPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSRED
        TPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILY DHGHTRVECFSDADWAGSRED
Subjt:  TPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSRED

Query:  RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQ
        RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQ
Subjt:  RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQ

Query:  DGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP
        DGLVSTGYVKT EQLGDILTK+LNGTRISYLCNKLGMIDIFAP
Subjt:  DGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]0.0e+0098.83Show/hide
Query:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDI
        F+DRVTKKIIGRGYESGGLYLFDHQVSQ VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSL SLNCDSCQF KFHRLSSSPRVDKRAIAPFELVHSDI
Subjt:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDI

Query:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERK
        WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGV ERK
Subjt:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERK

Query:  NRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYR
        NRHLL+TARALSFQMHVSKIFWVD VSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYR
Subjt:  NRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYR

Query:  CYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRK
        CYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRK
Subjt:  CYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRK

Query:  GKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLV
        GKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLV
Subjt:  GKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLV

Query:  AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALV
        AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALV
Subjt:  AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALV

Query:  CFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSG
        CFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSG
Subjt:  CFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSG

Query:  TPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSRED
        TPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILY DHGHTRVECFSDADWAGSRED
Subjt:  TPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSRED

Query:  RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQ
        RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQ
Subjt:  RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQ

Query:  DGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP
        DGLVSTGYVKT EQLGDILTK+LNGTRISYLCNKLGMIDIFAP
Subjt:  DGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]0.0e+0098.83Show/hide
Query:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDI
        F+DRVTKKIIGRGYESGGLYLFDHQVSQ VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSL SLNCDSCQF KFHRLSSSPRVDKRAIAPFELVHSDI
Subjt:  FQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDI

Query:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERK
        WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGV ERK
Subjt:  WGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERK

Query:  NRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYR
        NRHLL+TARALSFQMHVSKIFWVD VSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYR
Subjt:  NRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYR

Query:  CYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRK
        CYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRK
Subjt:  CYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRK

Query:  GKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLV
        GKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLV
Subjt:  GKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLV

Query:  AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALV
        AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALV
Subjt:  AKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALV

Query:  CFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSG
        CFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSG
Subjt:  CFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSG

Query:  TPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSRED
        TPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILY DHGHTRVECFSDADWAGSRED
Subjt:  TPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSRED

Query:  RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQ
        RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQ
Subjt:  RRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQ

Query:  DGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP
        DGLVSTGYVKT EQLGDILTK+LNGTRISYLCNKLGMIDIFAP
Subjt:  DGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]0.0e+0098.94Show/hide
Query:  QDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIW
        QDRVTKKIIGRGYESGGLYLFDHQVSQ VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSL SLNCDSCQF KFHRLSSSPRVDKRAIAPFELVHSDIW
Subjt:  QDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIW

Query:  GPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKN
        GPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGV ERKN
Subjt:  GPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKN

Query:  RHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRC
        RHLL+TARALSFQMHVSKIFWVD VSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRC
Subjt:  RHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRC

Query:  YCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKG
        YCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKG
Subjt:  YCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKG

Query:  KRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVA
        KRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVA
Subjt:  KRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVA

Query:  KGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVC
        KGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVC
Subjt:  KGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVC

Query:  FGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGT
        FGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGT
Subjt:  FGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGT

Query:  PMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDR
        PMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILY DHGHTRVECFSDADWAGSREDR
Subjt:  PMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDR

Query:  RSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQD
        RSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQD
Subjt:  RSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQD

Query:  GLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP
        GLVSTGYVKT EQLGDILTK+LNGTRISYLCNKLGMIDIFAP
Subjt:  GLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP

TrEMBL top hitse value%identityAlignment
A0A438DZQ8 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0061.33Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN  V++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D  +K  W++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGK  V +M++V   F   E+ A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ Q E+M VM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTIS
        R E    +VS  Q ++ L +K  N    +R        + ++   +S  IVC YC + GH K++CRKL  +N +R Q   +A++      D     VT++
Subjt:  RIESSPTSVSIPQPSSALFSKNNNPRAPQR-------NSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTIS

Query:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLH
        A EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGAT HMTGN   FS        P VT+ADGST  + GSGT+  T S +LSSVL+
Subjt:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLH

Query:  LPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDS
        LPNL+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY+ D  V + VAC    SP E HCRLGHPSL VLKKL P+F +L SL+C+S
Subjt:  LPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDS

Query:  CQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSH
        C F K HR S  PR++KR  + FELVHSD+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K LR+DN  EY S+
Subjt:  CQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSH

Query:  SLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFV
        S  +Y+  NGI+HQ+SC DTPSQNGV ERKNRHLL+TARAL FQM V K FW D VSTACFLINRMP+ VL G+IPY+ + P K LFP+AP+IFGC C+V
Subjt:  SLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFV

Query:  RDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLS-----------------TD
        RD RP  TKLDPK+L+C+FLGYSR+QKGYRC+ P L +YLVS D+VF EDT F SSP+S    ED+   +Y+V +  P++                   +
Subjt:  RDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLS-----------------TD

Query:  VP--PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQN
        +P  P++P I Q+YSRR  P  +D+C P+  PSS DP+   DLPI+L KGKR C   Y +++F+SY  LS S+   + S++S S+P +V EAL+HPGW+N
Subjt:  VP--PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQN

Query:  AMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFL
        AM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK++PDG+VARLKARLVA+GYAQ YG DYSDTFSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFL
Subjt:  AMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFL

Query:  HGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFL
        HGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYV+DIVIT ND  GIS LKTF+
Subjt:  HGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFL

Query:  QGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQ
          +F+TKDLG+LKYFLGIEV RSKK ++LSQRKYVLDLL ETGK+ AKP  TPM+PN QL+  +G+   +PER RR+VGKLNYLTVTRPDIAY+VSVVSQ
Subjt:  QGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQ

Query:  FMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWI
        F  +PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSD DWAGS+ DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAMAQ+ CEI+WI
Subjt:  FMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWI

Query:  HQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTK
        HQLL E+G   T+PAKLWCDNQAALHIA+NPV+HERTKHIEVDCHFIREKI++ LVSTGYVKT EQLG  L K
Subjt:  HQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTK

A0A438GAA6 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0063.13Show/hide
Query:  LRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKM
        ++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LDFLYSGK  V RM++V   F   E+ A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ QRE+M
Subjt:  LRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKM

Query:  AVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCR
        AVM FL+GL  EF  AK+QILS S I SL + F+RVLR E    S  T+V + +  +A  ++  N R   R   +     +  IVC YC + GH K++CR
Subjt:  AVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCR

Query:  KLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPA
        KL  +N +R Q A +A++      D     VT++A+EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGAT HMTGN   FS      
Subjt:  KLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPA

Query:  PFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVP
          P VT+ADGST  + GSGT+  T S +LSSVL+LPNL+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY+ D  V + VAC    
Subjt:  PFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVP

Query:  SPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSE
        SP E HCRLGHPSL VLKKL P+F +L SL+C+SC F K HR S  PR++KRA + FELVHSD+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE
Subjt:  SPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSE

Query:  LLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRM
        + SHFCAF  EIK Q++VS+K LR+DN  EY S+S  +Y+  NGI+HQ+SC DTPSQNGV ERKNRHLL+TARAL FQM V K FW D VSTACFLINRM
Subjt:  LLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRM

Query:  PSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDD
        P+ VL  +IPY+V+ P K LFP+AP+IFGC C+VRD RP   KLDPK+L+C+FLGYSR+QKGYRC+ P L +YLVS DVVF EDT F SSP+S    ED+
Subjt:  PSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDD

Query:  NLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISY
           +Y+V +  P            SL+   P       P++P I QVYSRR  P  +D+C P+  PSS DP+   DLPI+LRKGKR C   Y +++F+SY
Subjt:  NLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISY

Query:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT
          LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK+NPDG+VARLKARLVA+GYAQ YG DYSDT
Subjt:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY
        FSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY
Subjt:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY

Query:  RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGE
        ++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+ AKP  TPM+PN QL+  +G+
Subjt:  RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGE

Query:  LCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGN
           +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S+PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSDADWAGS+ DRRST+GYCVF GGN
Subjt:  LCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGN

Query:  LVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQ
        LV+WKSKKQ+VVSRSSAESEYRAM+Q+ CEI+WIHQLL E+G   T+PAKLWCDNQAALHIA+NPV+HERTKHIEVDCHFIREKI++ LVSTGYVKT EQ
Subjt:  LVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQ

Query:  LGDILTKSLNGTRISYLCNKLGMIDIFAP
        LGDI TK+LNGTR+ Y CNKLGMI+I+AP
Subjt:  LGDILTKSLNGTRISYLCNKLGMIDIFAP

A0A438H537 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0060.99Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN  V++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D  +K  W++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGK  V RM++V   F   E+ A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ QRE+MAVM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADE
        R E    S  T+V + +  +A  ++  N R   R + ++R  +S  IVC Y  + GH K++CRKL  +N +R Q A +A++      D     VT++A+E
Subjt:  RIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADE

Query:  FAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPN
        F+K+  YQ++L+A   STP+ + V  G   CL++SS KW+IDSGAT HMTGN   FS        P VT+ADGST  + GSGT+  T S +LSSVL+LPN
Subjt:  FAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPN

Query:  LSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQF
        L+FNLIS S+LT DLNC V FF  +C+FQD +TK+  G+G+ S GLY+ D  V + VAC    SP E HCRLGHPSL VLKKL P+F +L SL+C+SC F
Subjt:  LSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQF

Query:  TKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLG
         K HR S  PR++KRA + FELVHSD+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K LR+DN  EY S+S  
Subjt:  TKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLG

Query:  SYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDV
        +Y+  NGI+HQ+SC DTPSQNGV ERKNRHLL+TARAL FQM V K FW D VS ACFLINRMP+ VL G+I Y+V+ P K LFP+AP+IFGC C+VRD 
Subjt:  SYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDV

Query:  RPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLS-----------------TDVP-
        RP  TKLDPK+L+C+FLGYSR+QKGYRC+ P L +YLVS DVVF EDT F SSP+S    ED+   +Y+V +  P++                   ++P 
Subjt:  RPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLS-----------------TDVP-

Query:  -PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMI
         P++P I QVYSRR  P  +D+C P+  PSS DP+   DLPI+LRKGKR C   Y +++F+SY  LS S+   + S++S S+P +V EAL+HPGW+NAM+
Subjt:  -PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMI

Query:  EEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGD
        EE+ AL+DN TW LV  P GKK +GCKWVFAVK+NPDG+VARLKARLVA+GYAQ YG DYSDTFSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGD
Subjt:  EEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGD

Query:  LQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ
        L+EEV                             SPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +
Subjt:  LQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ

Query:  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMS
        F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+ AKP  TPM+PN QL+  +G+   +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S
Subjt:  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMS

Query:  SPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQL
        +PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSDADWAGS+ DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAM Q+ CEI+WIHQL
Subjt:  SPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQL

Query:  LSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP
        L E+G   T+PAKLWCDNQAALHIA+NPV+HERTKHI+VDCHFIREKI++ LVSTGYVKT EQLGDI TK+LNGTR+ Y CNKLGMI+I+AP
Subjt:  LSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP

A0A438IRR9 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0063.13Show/hide
Query:  LRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKM
        ++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LDFLYSGK  V RM++V   F   E+ A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ QRE+M
Subjt:  LRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKM

Query:  AVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCR
        AVM FL+GL  EF  AK+QILS S I SL + F+RVLR E    S  T+V + +  +A  ++  N R   R   +     S  IVC YC + GH K++CR
Subjt:  AVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCR

Query:  KLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPA
        KL  +N +R Q A +A++      D     VT++A+EF+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGAT HMTGN   FS      
Subjt:  KLLYKNSQRSQHAQIAST-----CDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPA

Query:  PFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVP
          P VT+ADGST  + GSGT+  T S +LSSVL+LPNL+FNLIS S+LT +LN  V FF  +C+FQD +TK+  G+G+ S GLY+ D  V + VAC    
Subjt:  PFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVP

Query:  SPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSE
        SP E HC+LGHPSL VLKKL P+F +L SL+C+SC F K HR S  PR++KRA + FELVHSD+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE
Subjt:  SPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSE

Query:  LLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRM
        + SHFCAF  EIK Q++VS+K LR+DN  EY S+S  +Y+ +NGI+HQ+SC DTPSQNGV ERKNRHLL+TARAL FQM V K FW D VSTACFLINRM
Subjt:  LLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRM

Query:  PSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDD
        P+ VL G+IPY+V+ P K LFP+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGYSR+QKGYRC+ P L +YLVS DVVF EDT F SSP+S    ED+
Subjt:  PSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDD

Query:  NLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKR--KCTYPVSSFISY
           +Y+V +  P            SL+   P       P++P I QVYSRR  P  +D+C P+  PSS DP+   DLPI+LRKGKR  K  Y +++F+SY
Subjt:  NLFIYEVTSPTP------------SLSTDVP-------PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKR--KCTYPVSSFISY

Query:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT
          LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK+N DG+VARLKARLVA+GYAQ YG DYSDT
Subjt:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY
        FSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY
Subjt:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY

Query:  RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGE
        ++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+ AKP  TPM+PN QL+  +G+
Subjt:  RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGE

Query:  LCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGN
           +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S+PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSDADWAGS+ DRRST+GYCVF GGN
Subjt:  LCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGN

Query:  LVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQ
        LV+WKSKKQ+VVSRSSAESEYRAMAQ+ CEI+WIHQLL E+G   T+PAKLWCDNQAALHIA+NP++HERTKHIEVDCHFIREKI++ LVSTGYVKT EQ
Subjt:  LVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQ

Query:  LGDILTKSLNGTRISYLCNKLGMIDIFAP
        LGDI TK+LNGTR+ Y CNKLGMI+I+AP
Subjt:  LGDILTKSLNGTRISYLCNKLGMIDIFAP

B0FBS2 Uncharacterized protein0.0e+0063.07Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN  V++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D  +K  W++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL
        FLYSGK  V RM++V   F   E+ A+S+T+YFM  KK+  EL  L+PFSPDV+VQQ QRE+MAVM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL

Query:  RIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADE
        R E    S  T+V I +  +A  ++  N R   R   +     S  IVC YC + GH K++CRKL  +N +R Q A +A++      D     VT++A+E
Subjt:  RIE----SSPTSVSIPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIAST-----CDIPEASVTISADE

Query:  FAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPN
        F+K+  YQ++L+A   STP+ S +A     CL++SS KW+IDSGAT HMTGN   FS        P VT+ADGST  + GSGT+  T S +LSSVL+LPN
Subjt:  FAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPN

Query:  LSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQF
        L+FNLIS S+LT +LNC V FF  +C+FQD +TK+  G+G+ S GLY+ D  V + VAC    SP E HCRLGHPSL VLKKL P+F +L SL+C+SC F
Subjt:  LSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQF

Query:  TKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLG
         K HR S  PR++KRA + FELVHSD+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K LR+DN  EY S+S  
Subjt:  TKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLG

Query:  SYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDV
        +Y+  NGI+HQ+SC DTPSQNGV ERKNRHLL+TARAL FQM V K FW D VSTACFLINRMP+ VL G+IPY+V+ P K LFP+AP+IFGC C+VRD 
Subjt:  SYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDV

Query:  RPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP------
        RP  TKLDPK+L+C+FLGYSR+QKGYRC+ P L +YLVS DVVF EDT F SSP+S    ED+   +Y+V +  P            SL+   P      
Subjt:  RPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP------

Query:  -PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMI
         P++P I QVYSRR  P  +D+C P+  PSS DP+   DLPI+LRKGKR C   Y +++F+SY  LS S+   + S++S S+P +V EAL+HPGW+NAM+
Subjt:  -PSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMI

Query:  EEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGD
        EE+ AL+DN TW LV  P GKK +GCKWVFAVK+NPDG+VARLKARLVA+GYAQ YG DYSDTFSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGD
Subjt:  EEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGD

Query:  LQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ
        L+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +
Subjt:  LQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ

Query:  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMS
        F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+ AKP  TPM+PN QL+  +G+   +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S
Subjt:  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMS

Query:  SPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQL
        +PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSDADWAGS+ DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAM+Q+ CEI+WIHQL
Subjt:  SPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQL

Query:  LSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP
        L E+G   T+PAKLWCDNQAALHIA+NPV+HERTKHIEVDCHFIREKI++ LVSTGYVKT EQLGDI TK+LNGTR+ Y CNKLGMI+I+AP
Subjt:  LSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAP

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.2e-12628.33Show/hide
Query:  SALFSKNNNPRAPQRNSTDHRKPESV-------EIVCNYCRKPGHMKRDC----RKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQA
        +A+   NNN            KP+ +       ++ C++C + GH+K+DC    R L  KN +  +  Q A                             
Subjt:  SALFSKNNNPRAPQRNSTDHRKPESV-------EIVCNYCRKPGHMKRDC----RKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQA

Query:  SSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLA-DGSTSSVLGSGTIHL--TPSFSLSSVLHLPNLSFNLISTSQ
          +S  IA  V   N   ++  +  +V+DSGA+ H+  +  L++  +   P   + +A  G        G + L      +L  VL     + NL+S  +
Subjt:  SSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLA-DGSTSSVLGSGTIHL--TPSFSLSSVLHLPNLSFNLISTSQ

Query:  LTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEV-HCRLGHPSLFVL-----KKLYPEFRSLFSLN-----CDSCQ
        L      +    SG  + ++ +   ++        + + + Q   + A     + F + H R GH S   L     K ++ +   L +L      C+ C 
Subjt:  LTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEV-HCRLGHPSLFVL-----KKLYPEFRSLFSLN-----CDSCQ

Query:  FTKFHRLSSSPRVDKRAI-APFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHS
          K  RL      DK  I  P  +VHSD+ GP   V+     YFV FVD  +     YL+K +S++ S F  F  + +  FN+ +  L  DN  EY S+ 
Subjt:  FTKFHRLSSSPRVDKRAI-APFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHS

Query:  LGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVL--NGEIPYRVLFPTKHLFPIAPKIFGCVCF
        +  +  + GI +  +   TP  NGV ER  R + + AR +     + K FW + V TA +LINR+PS  L  + + PY  ++  K  +    ++FG   +
Subjt:  LGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVL--NGEIPYRVLFPTKHLFPIAPKIFGCVCF

Query:  VRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDV-----------------VFFEDTP------FTSSPSSLCQGE------------
        V  ++    K D KS K IF+GY     G++ +    ++++V+ DV                 VF +D+       F +    + Q E            
Subjt:  VRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDV-----------------VFFEDTP------FTSSPSSLCQGE------------

Query:  --------------DDNLFIYEVTSPTPS--------LSTDVPPSRPLISQVYSRRPPPQPSDS----CPPSMLPSSC----------DPAPSDDLPIAL
                      +D+  I +   P  S        L      ++  +++   R+     ++S     P     S            +P  +D + I  
Subjt:  --------------DDNLFIYEVTSPTPS--------LSTDVPPSRPLISQVYSRRPPPQPSDS----CPPSMLPSSC----------DPAPSDDLPIAL

Query:  RKGKRKCTYPVSSFISYHQLSPSTYAFITSLES--TSIPNSVHEAL---SHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVA
        R+ +R  T P    ISY++   S    + +  +    +PNS  E         W+ A+  E+ A   N TW +  RP  K  +  +WVF+VK N  G   
Subjt:  RKGKRKCTYPVSSFISYHQLSPSTYAFITSLES--TSIPNSVHEAL---SHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVA

Query:  RLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFG
        R KARLVA+G+ Q Y  DY +TF+PVA+++S R  LS+       +HQ+D+K AFL+G L+EE+YM  P G      SD VC+L K++YGLKQ+ R WF 
Subjt:  RLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFG

Query:  KFSQALVCFGMKKSTSDHSVFY--RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSET
         F QAL       S+ D  ++   + +    + +++YVDD+VI   D   +++ K +L  +F   DL ++K+F+GI +   +  IYLSQ  YV  +LS+ 
Subjt:  KFSQALVCFGMKKSTSDHSVFY--RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSET

Query:  GKLGAKPSGTPMMP--NQQLVKEGELCKDPERYRRLVGKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDH--GHTRV
                 TP+    N +L+   E C  P   R L+G L Y+ + TRPD+  +V+++S++ S    + W  ++++L YLK      +++  +     ++
Subjt:  GKLGAKPSGTPMMP--NQQLVKEGELCKDPERYRRLVGKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDH--GHTRV

Query:  ECFSDADWAGSREDRRSTSGYCV-FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERT
          + D+DWAGS  DR+ST+GY       NL+ W +K+QN V+ SS E+EY A+ ++V E +W+  LL+ I   +  P K++ DNQ  + IA+NP  H+R 
Subjt:  ECFSDADWAGSREDRRSTSGYCV-FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERT

Query:  KHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMI
        KHI++  HF RE++Q+ ++   Y+ TE QL DI TK L   R   L +KLG++
Subjt:  KHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.6e-17231.67Show/hide
Query:  KLNGSN-YYDWRRTILFYLRSTDMDDHMTEDPPK-DAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFF
        K NG N +  W+R +   L    +   +  D  K D  +  DW   D R    I+  +  +++  +   ++ + +   L+ LY  K   ++++     + 
Subjt:  KLNGSN-YYDWRRTILFYLRSTDMDDHMTEDPPK-DAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFF

Query:  RAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL---RIESSPTS-----VSIP
            +  +  S+      +I +L  L      VK++    E+   ++ LN L   +    T IL       L D  + +L   ++   P +     ++  
Subjt:  RAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVL---RIESSPTS-----VSIP

Query:  QPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIA
        +  S   S NN  R+  R  + +R    V   C  C +PGH KRDC      N ++ +        D   A++  + D    F N +E     S      
Subjt:  QPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIA

Query:  STVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFS----LSSVLHLPNLSFNLISTSQLTHDLNC
                       ++WV+D+ A+ H T    LF R ++   F +V + + S S + G G I +  +      L  V H+P+L  NLIS   L  D   
Subjt:  STVAPGNIKCLLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFS----LSSVLHLPNLSFNLISTSQLTHDLNC

Query:  VVMFFSGYCLFQDRVTK--KIIGRGYESGGLYLFDHQVSQ--VVACPVVPSPFEVHCRLGHPS-----LFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSS
           + S +   + R+TK   +I +G   G LY  + ++ Q  + A     S    H R+GH S     +   K L    +      CD C F K HR+S 
Subjt:  VVMFFSGYCLFQDRVTK--KIIGRGYESGGLYLFDHQVSQ--VVACPVVPSPFEVHCRLGHPS-----LFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSS

Query:  SPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGI
             +R +   +LV+SD+ GP  + S  G +YFVTF+DD SR  W+Y++K + ++   F  FH  ++ +    +K LR+DN GEY S     Y   +GI
Subjt:  SPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGI

Query:  IHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLD
         H+ +   TP  NGV ER NR +++  R++     + K FW + V TAC+LINR PS  L  EIP RV +  K +     K+FGC  F    +   TKLD
Subjt:  IHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLD

Query:  PKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSM
         KS+ CIF+GY   + GYR + P  K+ + S DVVF E    T++  S  +   + +    VT P+ S +                   P  ++S    +
Subjt:  PKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRPLISQVYSRRPPPQPSDSCPPSM

Query:  LPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI----------PNSVHEALSHP---GWQNAMIEEMTALDDNGTWDLVS
              P    +    L +G  +  +P      +  L  S    + S    S           P S+ E LSHP       AM EEM +L  NGT+ LV 
Subjt:  LPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI----------PNSVHEALSHP---GWQNAMIEEMTALDDNGTWDLVS

Query:  RPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQ
         P GK+ + CKWVF +K + D  + R KARLV KG+ Q  G D+ + FSPV K+TSIR  LS+AA+    + QLD+K AFLHGDL+EE+YMEQP GF   
Subjt:  RPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQ

Query:  GESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGI
        G+   VC+L KSLYGLKQ+PR W+ KF   +      K+ SD  V+++R SE   ++L++YVDD++I G D   I+ LK  L   F  KDLG  +  LG+
Subjt:  GESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGI

Query:  EVMRSK--KGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVK---------EGELCKDPERYRRLVGKLNYLTV-TRPDIAYSVSVVSQFMSSPTV
        +++R +  + ++LSQ KY+  +L       AKP  TP+  + +L K         +G + K P  Y   VG L Y  V TRPDIA++V VVS+F+ +P  
Subjt:  EVMRSK--KGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVK---------EGELCKDPERYRRLVGKLNYLTV-TRPDIAYSVSVVSQFMSSPTV

Query:  DHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEI
        +HW AV+ IL YL+   G   L        ++ ++DAD AG  ++R+S++GY     G  +SW+SK Q  V+ S+ E+EY A  ++  E++W+ + L E+
Subjt:  DHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEI

Query:  GFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKL
        G        ++CD+Q+A+ ++ N ++H RTKHI+V  H+IRE + D  +    + T E   D+LTK +   +   LC +L
Subjt:  GFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKL

P92519 Uncharacterized mitochondrial protein AtMg008106.2e-4540.81Show/hide
Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        L++YVDDI++TG+    ++ L   L   F  KDLG + YFLGI++     G++LSQ KY   +L+  G L  KP  TP+              DP  +R 
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        +VG L YLT+TRPDI+Y+V++V Q M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+G+C F+G N++SW +K+Q 
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAESEYRAMAQSVCEIVW
         VSRSS E+EYRA+A +  E+ W
Subjt:  VVSRSSAESEYRAMAQSVCEIVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.2e-19532.68Show/hide
Query:  ADIKNLVVSNVIPLASKITE-HKLNGSNYYDWRRTI--LF--YLRSTDMDDHMTEDP--------PKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDH
        A  + LV++N   L   ++   KL  +NY  W R +  LF  Y  +  +D   T  P        P+       W R D  +Y  +  +I   +   V  
Subjt:  ADIKNLVVSNVIPLASKITE-HKLNGSNYYDWRRTI--LF--YLRSTDMDDHMTEDP--------PKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDH

Query:  CESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLL-LPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDS
          +  ++ E L  +Y+     H + ++  Q  +  +  +++  Y   L     +L LL  P   D +V++V          L  L  E+     QI +  
Subjt:  CESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLL-LPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDS

Query:  KIPSLDDAFTRVLRIESSPTSVS----IPQPSSALFSK-----NNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTC
          P+L +   R+L  ES   +VS    IP  ++A+  +     NNN    + N  D+R         N   KP   ++        N+Q   +      C
Subjt:  KIPSLDDAFTRVLRIESSPTSVS----IPQPSSALFSK-----NNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTC

Query:  DIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKC-LLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHL
         +       SA   ++ Q++  S+ +    +P        N+      SS  W++DSGAT H+T + +  S          V +ADGST  +  +G+  L
Subjt:  DIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKC-LLTSSTKWVIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHL

Query:  TPS---FSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEV----HCRLGHPSLFV
        +      +L ++L++PN+  NLIS  +L +     V FF      +D  T   + +G     LY +    SQ V+    PS        H RLGHP+  +
Subjt:  TPS---FSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVPSPFEV----HCRLGHPSLFV

Query:  LKKLYPEFRSLFSLN-------CDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFH
        L  +   + SL  LN       C  C   K +++  S +    +  P E ++SD+W   P++S   +RY+V FVD  +R TWLY +K +S++   F  F 
Subjt:  LKKLYPEFRSLFSLN-------CDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFH

Query:  TEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEI
          ++N+F   I T  +DN GE+   +L  Y  ++GI H +S   TP  NG+ ERK+RH+++T   L     + K +W    + A +LINR+P+ +L  E 
Subjt:  TEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEI

Query:  PYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDT-PFTSSPSSLCQGEDDNLFIYEV
        P++ LF T   +    ++FGC C+   +RP++  KLD KS +C+FLGYS  Q  Y C      R  +S  V F E+  PF++  ++L   ++       V
Subjt:  PYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDT-PFTSSPSSLCQGEDDNLFIYEV

Query:  TSPTPSLSTDVP--PSRPLISQVYSRRPPPQPS-------------DSCPPSMLPSSCDPA------------------------------PSDDLPIAL
         SP  +L T  P  P+       ++  PP  PS             DS   S  PSS +P                               P+++ P  L
Subjt:  TSPTPSLSTDVP--PSRPLISQVYSRRPPPQPS-------------DSCPPSMLPSSCDPA------------------------------PSDDLPIAL

Query:  RK------------------GKRKCTYPVSSFISYHQLSP------------------------------STYAFITSLESTSIPNSVHEALSHPGWQNA
         +                       T P    I  H   P                                Y+   SL + S P +  +AL    W+NA
Subjt:  RK------------------GKRKCTYPVSSFISYHQLSP------------------------------STYAFITSLESTSIPNSVHEALSHPGWQNA

Query:  MIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFL
        M  E+ A   N TWDLV  P     I GC+W+F  K N DG++ R KARLVAKGY Q  G DY++TFSPV K TSIR+ L +A    W + QLD+ NAFL
Subjt:  MIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFL

Query:  HGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFL
         G L ++VYM QPPGF+ +   + VC+LRK+LYGLKQ+PRAW+ +    L+  G   S SD S+F  +  K IV ++VYVDDI+ITGND   + +    L
Subjt:  HGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFL

Query:  QGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQ
          +F  KD  +L YFLGIE  R   G++LSQR+Y+LDLL+ T  + AKP  TPM P+ +L +  G    DP  YR +VG L YL  TRPDI+Y+V+ +SQ
Subjt:  QGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQ

Query:  FMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHT-RVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVW
        FM  PT +H  A+++IL YL   P  GI ++  G+T  +  +SDADWAG ++D  ST+GY V++G + +SW SKKQ  V RSS E+EYR++A +  E+ W
Subjt:  FMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHT-RVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVW

Query:  IHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGM
        I  LL+E+G  +T P  ++CDN  A ++ +NPVFH R KHI +D HFIR ++Q G +   +V T +QL D LTK L+ T      +K+G+
Subjt:  IHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.3e-19132.55Show/hide
Query:  LVVSNVIPL-ASKITEHKLNGSNYYDWRRTIL----------FYLRSTDMDDHM--TEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVK
        LV +N++ +  S +T  KL  +NY  W R +           F   ST M      T+  P+       W R D  +Y  I  +I   +   V    +  
Subjt:  LVVSNVIPL-ASKITEHKLNGSNYYDWRRTIL----------FYLRSTDMDDHM--TEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVK

Query:  ELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLL-LPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSL
        ++ E L  +Y+     H                  VT   +R      +L LL  P   D +V++V          L  L  ++     QI +    PSL
Subjt:  ELLEFLDFLYSGKEQVHRMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLL-LPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSL

Query:  DDAFTRVLRIESSPTSVS----IPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTIS
         +   R++  ES   +++    +P  ++ +  +N N    Q N  D+R   +     N  +      R        N Q   +      C +       S
Subjt:  DDAFTRVLRIESSPTSVS----IPQPSSALFSKNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTIS

Query:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLT-SSTKWVIDSGATAHMTG--NSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHL---TPSFS
        A    +   +Q +     S++P        N+      ++  W++DSGAT H+T   N+  F +P +      V +ADGST  +  +G+  L   + S  
Subjt:  ADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLT-SSTKWVIDSGATAHMTG--NSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHL---TPSFS

Query:  LSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVP----SPFEVHCRLGHPSLFVLKKLYPEF
        L+ VL++PN+  NLIS  +L +     V FF      +D  T   + +G     LY +    SQ V+    P    +    H RLGHPSL +L  +    
Subjt:  LSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQVVACPVVP----SPFEVHCRLGHPSLFVLKKLYPEF

Query:  R------SLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNV
               S   L+C  C   K H++  S      +  P E ++SD+W   P++S   +RY+V FVD  +R TWLY +K +S++   F  F + ++N+F  
Subjt:  R------SLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNV

Query:  SIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTK
         I TL +DN GE+    L  YL ++GI H +S   TP  NG+ ERK+RH+++    L     V K +W    S A +LINR+P+ +L  + P++ LF   
Subjt:  SIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTK

Query:  HLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVF------FEDTPFTSSPSSLCQGEDDNLFIYEVTSPT
          +    K+FGC C+   +RP++  KL+ KS +C F+GYS  Q  Y C      R   S  V F      F  T F  S S   + +    +    T PT
Subjt:  HLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVF------FEDTPFTSSPSSLCQGEDDNLFIYEVTSPT

Query:  PSLSTDVPPS-RPLISQVYSRRPPPQPSDSC----PPSMLPSSCDPAPSDDLPIA----------------------------------LRKGKRKCTYP
          L    PP   P +    S RPP  PS  C      S LPSS   +PS   P A                                       +    P
Subjt:  PSLSTDVPPS-RPLISQVYSRRPPPQPSDSC----PPSMLPSSCDPAPSDDLPIA----------------------------------LRKGKRKCTYP

Query:  VSSFISYHQLSPST-------------------------------------------------------YAFITSLESTSIPNSVHEALSHPGWQNAMIE
         S   S H  +PST                                                       Y++ TSL + S P +  +A+    W+ AM  
Subjt:  VSSFISYHQLSPST-------------------------------------------------------YAFITSLESTSIPNSVHEALSHPGWQNAMIE

Query:  EMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGD
        E+ A   N TWDLV  P     I GC+W+F  K N DG++ R KARLVAKGY Q  G DY++TFSPV K TSIR+ L +A    W + QLD+ NAFL G 
Subjt:  EMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGD

Query:  LQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ
        L +EVYM QPPGFV +   D VCRLRK++YGLKQ+PRAW+ +    L+  G   S SD S+F  +  + I+ ++VYVDDI+ITGND + +      L  +
Subjt:  LQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQ

Query:  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMS
        F  K+   L YFLGIE  R  +G++LSQR+Y LDLL+ T  L AKP  TPM  + +L +  G    DP  YR +VG L YL  TRPD++Y+V+ +SQ+M 
Subjt:  FYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMS

Query:  SPTVDHWAAVEQILCYLKAAPGRGILYIDHGHT-RVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQ
         PT DHW A++++L YL   P  GI ++  G+T  +  +SDADWAG  +D  ST+GY V++G + +SW SKKQ  V RSS E+EYR++A +  E+ WI  
Subjt:  SPTVDHWAAVEQILCYLKAAPGRGILYIDHGHT-RVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQ

Query:  LLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDI
        LL+E+G  ++ P  ++CDN  A ++ +NPVFH R KHI +D HFIR ++Q G +   +V T +QL D LTK L+         K+G+I +
Subjt:  LLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.0e-14046.32Show/hide
Query:  YPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQ
        + +S F+SY ++SP  ++F+  +     P++ +EA     W  AM +E+ A++   TW++ + P  KK IGCKWV+ +K N DGT+ R KARLVAKGY Q
Subjt:  YPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQ

Query:  IYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGES---DKVCRLRKSLYGLKQSPRAWFGKFSQALVCF
          G D+ +TFSPV KLTS++L L+++A   ++LHQLDI NAFL+GDL EE+YM+ PPG+ A QG+S   + VC L+KS+YGLKQ+ R WF KFS  L+ F
Subjt:  IYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGES---DKVCRLRKSLYGLKQSPRAWFGKFSQALVCF

Query:  GMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTP
        G  +S SDH+ F + +    + ++VYVDDI+I  N+   +  LK+ L+  F  +DLG LKYFLG+E+ RS  GI + QRKY LDLL ETG LG KPS  P
Subjt:  GMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTP

Query:  MMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDR
        M P+       G    D + YRRL+G+L YL +TR DI+++V+ +SQF  +P + H  AV +IL Y+K   G+G+ Y      +++ FSDA +   ++ R
Subjt:  MMPNQQL-VKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDR

Query:  RSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREK-IQ
        RST+GYC+F+G +L+SWKSKKQ VVS+SSAE+EYRA++ +  E++W+ Q   E+   ++ P  L+CDN AA+HIA+N VFHERTKHIE DCH +RE+ + 
Subjt:  RSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREK-IQ

Query:  DGLVSTGYVKTEEQLG--DILTKSLNGTRISYLCNKLGMIDIFA
           +S  +   +EQ G  + L+  L GT I Y+ +  G+  + A
Subjt:  DGLVSTGYVKTEEQLG--DILTKSLNGTRISYLCNKLGMIDIFA

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.2e-1238.27Show/hide
Query:  YLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFV
        YLT+TRPD+ ++V+ +SQF S+       AV ++L Y+K   G+G+ Y      +++ F+D+DWA   + RRS +G+C  V
Subjt:  YLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.1e-0427.91Show/hide
Query:  NRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLK
        NR +++  R++  +  + K F  D  +TA  +IN+ PS+ +N  +P  V F +   +    + FGCV +   +     KL P++ K
Subjt:  NRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLK

ATMG00810.1 DNA/RNA polymerases superfamily protein4.4e-4640.81Show/hide
Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        L++YVDDI++TG+    ++ L   L   F  KDLG + YFLGI++     G++LSQ KY   +L+  G L  KP  TP+              DP  +R 
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        +VG L YLT+TRPDI+Y+V++V Q M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+G+C F+G N++SW +K+Q 
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAESEYRAMAQSVCEIVW
         VSRSS E+EYRA+A +  E+ W
Subjt:  VVSRSSAESEYRAMAQSVCEIVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.7e-2145.3Show/hide
Query:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT
        ++L+P  Y+   +      P SV  AL  PGW  AM EE+ AL  N TW LV  P  +  +GCKWVF  K++ DGT+ RLKARLVAKG+ Q  G  + +T
Subjt:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVAKLTSIRLFLSMA
        +SPV +  +IR  L++A
Subjt:  FSPVAKLTSIRLFLSMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACATAAAAAATTTGGTAGTCTCCAACGTTATTCCCTTGGCCTCTAAGATCACAGAACATAAGTTAAATGGATCCAATTATTACGATTGGCGTCGGACAATTTT
ATTTTATTTGAGAAGTACTGATATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAATGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGA
TCAAGAATTCAATTGAGAGTGAGATAATTGGATTGGTTGATCACTGTGAGTCTGTTAAAGAACTTTTGGAATTTTTGGATTTTCTATACTCAGGTAAAGAGCAAGTGCAT
AGAATGTTTGAAGTTTGTATGCAATTTTTTCGTGCGGAACAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGATCATTGCCGAGCTTGGCTTGTTGTT
ACCTTTTAGTCCTGATGTTAAAGTTCAACAAGTTCAACGAGAGAAGATGGCTGTTATGATTTTCCTGAATGGACTCTTACCTGAATTTGGAATGGCAAAGACACAGATTC
TCTCTGACTCCAAGATTCCATCATTAGATGATGCCTTCACTCGAGTCCTTCGCATTGAAAGCTCTCCGACTAGTGTGTCTATTCCTCAACCCAGTAGTGCTCTCTTTAGC
AAGAACAATAACCCTCGGGCACCTCAGAGGAATAGTACTGATCATCGAAAACCAGAGTCTGTAGAGATTGTTTGTAACTACTGTCGTAAGCCAGGCCATATGAAACGTGA
TTGTCGGAAATTGCTATATAAGAATAGTCAACGGTCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACTATTTCTGCAGATGAGTTTGCTA
AGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGTAATATAAAGTGTCTTCTTACATCATCTACCAAATGG
GTCATAGACTCTGGTGCCACAGCTCATATGACAGGTAATTCTCACCTATTTTCTAGACCGTTGTCCCCTGCCCCTTTCCCATCTGTTACATTGGCCGATGGCTCCACATC
TTCTGTTCTTGGCTCTGGCACTATTCACCTTACCCCATCCTTTTCTCTCTCTTCTGTGTTACATTTGCCTAACTTATCCTTTAATTTAATTTCTACTAGTCAACTTACTC
ATGACCTAAATTGTGTTGTCATGTTCTTTTCTGGTTATTGCTTGTTTCAGGATCGTGTGACGAAGAAGATTATTGGTAGAGGATATGAGTCAGGAGGCCTTTATCTCTTT
GATCATCAAGTATCACAAGTTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATCCATCTTTGTTTGTGTTGAAGAAACTTTATCCAGA
ATTTAGGTCTTTGTTCTCTTTAAATTGTGATTCGTGTCAATTTACGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGAGTTAG
TTCATTCTGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGACGATCATTCTCGTCTAACTTGGTTATATTTAATGAAA
AATCGTTCTGAGTTATTATCTCATTTTTGTGCCTTTCATACTGAAATAAAAAACCAATTTAATGTCTCTATCAAAACTTTGCGTACTGATAATGCGGGTGAATATTTTTC
TCATTCTCTTGGCTCTTACTTGTGTGAAAATGGCATTATTCATCAATCTTCCTGTGCTGACACTCCATCTCAAAATGGTGTGGAAGAGCGAAAAAATAGGCATTTACTTG
ACACTGCCCGTGCTTTATCGTTTCAAATGCATGTTTCAAAAATATTTTGGGTGGATGTTGTCTCTACAGCTTGCTTTTTGATTAATAGAATGCCTTCCTCTGTTCTTAAT
GGTGAGATTCCCTATCGTGTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACGTTCGTCCTCATCATACTAA
GTTAGATCCCAAATCCTTGAAGTGTATCTTCTTGGGTTATTCACGTGTTCAAAAGGGTTATCGTTGTTATTGTCCTACCCTTAAAAGGTATCTGGTTTCGCCTGATGTTG
TCTTTTTTGAAGATACACCCTTTACTTCATCACCATCGAGTTTGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACT
GATGTGCCTCCTTCCCGCCCGTTGATTTCTCAAGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGC
GCCAAGTGATGATCTTCCCATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTA
TTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGT
ACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTCGTTTAAAGGCTCGCCT
TGTTGCCAAAGGTTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTCCGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATA
AATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGAT
AAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATC
TGATCATTCAGTTTTCTATCGCCGATCTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATTGGGTATTTCGTCTCTCA
AAACTTTCCTTCAGGGTCAGTTTTATACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAA
TATGTACTTGATTTGTTGTCTGAAACAGGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGCCAAATCAGCAACTTGTTAAAGAAGGAGAATTATGTAAAGATCC
TGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAACAGTGACTCGACCAGACATTGCATATTCTGTAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGG
ATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATACATAGATCATGGACATACGAGAGTTGAATGTTTTTCTGATGCT
GATTGGGCGGGGTCTCGTGAGGATAGGAGATCGACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAACAAAATGTTGTTTCTCGTTC
GAGTGCTGAGTCAGAATATAGAGCTATGGCACAATCTGTGTGTGAAATAGTATGGATTCACCAACTATTATCTGAGATAGGCTTCAGTATTACCGTGCCAGCTAAATTAT
GGTGTGATAATCAAGCTGCACTTCATATTGCATCTAATCCAGTATTTCATGAACGAACTAAGCATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGG
TTGGTATCCACAGGATATGTGAAGACTGAAGAACAATTGGGAGATATTCTAACTAAATCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGTTGGGCATGATCGA
CATATTTGCTCCAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACATAAAAAATTTGGTAGTCTCCAACGTTATTCCCTTGGCCTCTAAGATCACAGAACATAAGTTAAATGGATCCAATTATTACGATTGGCGTCGGACAATTTT
ATTTTATTTGAGAAGTACTGATATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAATGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGA
TCAAGAATTCAATTGAGAGTGAGATAATTGGATTGGTTGATCACTGTGAGTCTGTTAAAGAACTTTTGGAATTTTTGGATTTTCTATACTCAGGTAAAGAGCAAGTGCAT
AGAATGTTTGAAGTTTGTATGCAATTTTTTCGTGCGGAACAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGATCATTGCCGAGCTTGGCTTGTTGTT
ACCTTTTAGTCCTGATGTTAAAGTTCAACAAGTTCAACGAGAGAAGATGGCTGTTATGATTTTCCTGAATGGACTCTTACCTGAATTTGGAATGGCAAAGACACAGATTC
TCTCTGACTCCAAGATTCCATCATTAGATGATGCCTTCACTCGAGTCCTTCGCATTGAAAGCTCTCCGACTAGTGTGTCTATTCCTCAACCCAGTAGTGCTCTCTTTAGC
AAGAACAATAACCCTCGGGCACCTCAGAGGAATAGTACTGATCATCGAAAACCAGAGTCTGTAGAGATTGTTTGTAACTACTGTCGTAAGCCAGGCCATATGAAACGTGA
TTGTCGGAAATTGCTATATAAGAATAGTCAACGGTCTCAACATGCTCAGATAGCCTCCACATGCGATATACCAGAGGCGTCAGTTACTATTTCTGCAGATGAGTTTGCTA
AGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGTAATATAAAGTGTCTTCTTACATCATCTACCAAATGG
GTCATAGACTCTGGTGCCACAGCTCATATGACAGGTAATTCTCACCTATTTTCTAGACCGTTGTCCCCTGCCCCTTTCCCATCTGTTACATTGGCCGATGGCTCCACATC
TTCTGTTCTTGGCTCTGGCACTATTCACCTTACCCCATCCTTTTCTCTCTCTTCTGTGTTACATTTGCCTAACTTATCCTTTAATTTAATTTCTACTAGTCAACTTACTC
ATGACCTAAATTGTGTTGTCATGTTCTTTTCTGGTTATTGCTTGTTTCAGGATCGTGTGACGAAGAAGATTATTGGTAGAGGATATGAGTCAGGAGGCCTTTATCTCTTT
GATCATCAAGTATCACAAGTTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATCCATCTTTGTTTGTGTTGAAGAAACTTTATCCAGA
ATTTAGGTCTTTGTTCTCTTTAAATTGTGATTCGTGTCAATTTACGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGAGTTAG
TTCATTCTGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGACGATCATTCTCGTCTAACTTGGTTATATTTAATGAAA
AATCGTTCTGAGTTATTATCTCATTTTTGTGCCTTTCATACTGAAATAAAAAACCAATTTAATGTCTCTATCAAAACTTTGCGTACTGATAATGCGGGTGAATATTTTTC
TCATTCTCTTGGCTCTTACTTGTGTGAAAATGGCATTATTCATCAATCTTCCTGTGCTGACACTCCATCTCAAAATGGTGTGGAAGAGCGAAAAAATAGGCATTTACTTG
ACACTGCCCGTGCTTTATCGTTTCAAATGCATGTTTCAAAAATATTTTGGGTGGATGTTGTCTCTACAGCTTGCTTTTTGATTAATAGAATGCCTTCCTCTGTTCTTAAT
GGTGAGATTCCCTATCGTGTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACGTTCGTCCTCATCATACTAA
GTTAGATCCCAAATCCTTGAAGTGTATCTTCTTGGGTTATTCACGTGTTCAAAAGGGTTATCGTTGTTATTGTCCTACCCTTAAAAGGTATCTGGTTTCGCCTGATGTTG
TCTTTTTTGAAGATACACCCTTTACTTCATCACCATCGAGTTTGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACT
GATGTGCCTCCTTCCCGCCCGTTGATTTCTCAAGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGC
GCCAAGTGATGATCTTCCCATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTA
TTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGT
ACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTCGTTTAAAGGCTCGCCT
TGTTGCCAAAGGTTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTCCGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATA
AATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGAT
AAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATC
TGATCATTCAGTTTTCTATCGCCGATCTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAATGATGCATTGGGTATTTCGTCTCTCA
AAACTTTCCTTCAGGGTCAGTTTTATACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAA
TATGTACTTGATTTGTTGTCTGAAACAGGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGCCAAATCAGCAACTTGTTAAAGAAGGAGAATTATGTAAAGATCC
TGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAACAGTGACTCGACCAGACATTGCATATTCTGTAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGG
ATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATACATAGATCATGGACATACGAGAGTTGAATGTTTTTCTGATGCT
GATTGGGCGGGGTCTCGTGAGGATAGGAGATCGACTTCTGGATATTGTGTTTTTGTAGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAACAAAATGTTGTTTCTCGTTC
GAGTGCTGAGTCAGAATATAGAGCTATGGCACAATCTGTGTGTGAAATAGTATGGATTCACCAACTATTATCTGAGATAGGCTTCAGTATTACCGTGCCAGCTAAATTAT
GGTGTGATAATCAAGCTGCACTTCATATTGCATCTAATCCAGTATTTCATGAACGAACTAAGCATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGG
TTGGTATCCACAGGATATGTGAAGACTGAAGAACAATTGGGAGATATTCTAACTAAATCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGTTGGGCATGATCGA
CATATTTGCTCCAACTTGA
Protein sequenceShow/hide protein sequence
MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRRTILFYLRSTDMDDHMTEDPPKDAKQKNDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVH
RMFEVCMQFFRAEQKAESVTSYFMRLKKIIAELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDDAFTRVLRIESSPTSVSIPQPSSALFS
KNNNPRAPQRNSTDHRKPESVEIVCNYCRKPGHMKRDCRKLLYKNSQRSQHAQIASTCDIPEASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKW
VIDSGATAHMTGNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQLTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLF
DHQVSQVVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLFSLNCDSCQFTKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMK
NRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVEERKNRHLLDTARALSFQMHVSKIFWVDVVSTACFLINRMPSSVLN
GEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLST
DVPPSRPLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNG
TWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESD
KVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRK
YVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYIDHGHTRVECFSDA
DWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCEIVWIHQLLSEIGFSITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDG
LVSTGYVKTEEQLGDILTKSLNGTRISYLCNKLGMIDIFAPT