; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G13040 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G13040
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr7:11486940..11490463
RNA-Seq ExpressionCSPI07G13040
SyntenyCSPI07G13040
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]0.0e+0077.75Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        MADIKNLVVSNVIPLASKITEHKLNGSNYYDWR+TILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVL
        FLYSGKEQVHRMFEVCMQFFRA+QKAESVTSYFMRLKK I ELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILS+SKIPSLDDAFTRVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVL

Query:  RIESSQTGV-----------KNNNPRAPQRNSTDHRKPESVEIVCNYCR---------------------------------------------------
        RIESS T V           KNNNPRAPQRNSTDHRKPESVEIVCNYCR                                                   
Subjt:  RIESSQTGV-----------KNNNPRAPQRNSTDHRKPESVEIVCNYCR---------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSS
                                                                        HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSS
Subjt:  ----------------------------------------------------------------HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSS

Query:  PRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGII
        PRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGII
Subjt:  PRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGII

Query:  HQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDP
        HQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDP
Subjt:  HQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDP

Query:  KSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPSML
        KSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDV PSR LISQVYSRRPPPQPSDSCPPSML
Subjt:  KSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPSML

Query:  PSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVF
        PSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI NS H+ALSHP WQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVF
Subjt:  PSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVF

Query:  AVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLY
        AVKMNPDGTVARLKARLVAKGYAQIYGTDYS TFSPVAKLT  RLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQG SDKVCRLR SLY
Subjt:  AVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLY

Query:  GLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRK
        GLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVD+IVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRK
Subjt:  GLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRK

Query:  YVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHG
        YVLDLLSETGKLG KPSGTP+MPNQQLVKEGELCKDPERYRRLV KLNYLTVTR DI+YS+SVVSQFMSSPTVDHW AVEQILCYLKAAPGRGILYKDHG
Subjt:  YVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHG

Query:  HTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT-------------------------------------------KLWCDNQAALHIASNPVFH
        HTR+ECF   DWAGSREDRRSTSGYCVF+GGN VSWK+                                           KLWCDNQAALHIASNPVFH
Subjt:  HTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT-------------------------------------------KLWCDNQAALHIASNPVFH

Query:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        ERTKHIEVDCHFIREKIQDGLVSTGYVKT EQLGDILTKALNGTRISYLCNKLGMIDIFAPA
Subjt:  ERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]0.0e+0091.32Show/hide
Query:  PESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR
        P   E+ C    HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR
Subjt:  PESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR

Query:  SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN
        SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN
Subjt:  SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN

Query:  RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE
        RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE
Subjt:  RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE

Query:  DDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI
        DDNLFIYEVTSPTPSLSTDV PSR LISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI
Subjt:  DDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI

Query:  SNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAAT
         NS H+ALSHP WQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYS TFSPVAKLT  RLFLSMAAT
Subjt:  SNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAAT

Query:  NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVI
        NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQG SDKVCRLR SLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVD+IVI
Subjt:  NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVI

Query:  TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTV
        TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG KPSGTP+MPNQQLVKEGELCKDPERYRRLV KLNYLTV
Subjt:  TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTV

Query:  TRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT--------------
        TR DI+YS+SVVSQFMSSPTVDHW AVEQILCYLKAAPGRGILYKDHGHTR+ECF   DWAGSREDRRSTSGYCVF+GGN VSWK+              
Subjt:  TRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT--------------

Query:  -----------------------------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNK
                                     KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKT EQLGDILTKALNGTRISYLCNK
Subjt:  -----------------------------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNK

Query:  LGMIDIFAPA
        LGMIDIFAPA
Subjt:  LGMIDIFAPA

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]0.0e+0091.32Show/hide
Query:  PESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR
        P   E+ C    HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR
Subjt:  PESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR

Query:  SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN
        SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN
Subjt:  SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN

Query:  RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE
        RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE
Subjt:  RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE

Query:  DDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI
        DDNLFIYEVTSPTPSLSTDV PSR LISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI
Subjt:  DDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI

Query:  SNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAAT
         NS H+ALSHP WQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYS TFSPVAKLT  RLFLSMAAT
Subjt:  SNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAAT

Query:  NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVI
        NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQG SDKVCRLR SLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVD+IVI
Subjt:  NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVI

Query:  TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTV
        TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG KPSGTP+MPNQQLVKEGELCKDPERYRRLV KLNYLTV
Subjt:  TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTV

Query:  TRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT--------------
        TR DI+YS+SVVSQFMSSPTVDHW AVEQILCYLKAAPGRGILYKDHGHTR+ECF   DWAGSREDRRSTSGYCVF+GGN VSWK+              
Subjt:  TRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT--------------

Query:  -----------------------------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNK
                                     KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKT EQLGDILTKALNGTRISYLCNK
Subjt:  -----------------------------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNK

Query:  LGMIDIFAPA
        LGMIDIFAPA
Subjt:  LGMIDIFAPA

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]0.0e+0091.32Show/hide
Query:  PESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR
        P   E+ C    HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR
Subjt:  PESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR

Query:  SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN
        SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN
Subjt:  SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN

Query:  RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE
        RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE
Subjt:  RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE

Query:  DDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI
        DDNLFIYEVTSPTPSLSTDV PSR LISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI
Subjt:  DDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI

Query:  SNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAAT
         NS H+ALSHP WQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYS TFSPVAKLT  RLFLSMAAT
Subjt:  SNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAAT

Query:  NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVI
        NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQG SDKVCRLR SLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVD+IVI
Subjt:  NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVI

Query:  TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTV
        TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG KPSGTP+MPNQQLVKEGELCKDPERYRRLV KLNYLTV
Subjt:  TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTV

Query:  TRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT--------------
        TR DI+YS+SVVSQFMSSPTVDHW AVEQILCYLKAAPGRGILYKDHGHTR+ECF   DWAGSREDRRSTSGYCVF+GGN VSWK+              
Subjt:  TRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT--------------

Query:  -----------------------------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNK
                                     KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKT EQLGDILTKALNGTRISYLCNK
Subjt:  -----------------------------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNK

Query:  LGMIDIFAPA
        LGMIDIFAPA
Subjt:  LGMIDIFAPA

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]0.0e+0091.32Show/hide
Query:  PESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR
        P   E+ C    HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR
Subjt:  PESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNR

Query:  SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN
        SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN
Subjt:  SELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLIN

Query:  RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE
        RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE
Subjt:  RMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGE

Query:  DDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI
        DDNLFIYEVTSPTPSLSTDV PSR LISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI
Subjt:  DDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSI

Query:  SNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAAT
         NS H+ALSHP WQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYS TFSPVAKLT  RLFLSMAAT
Subjt:  SNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAAT

Query:  NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVI
        NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQG SDKVCRLR SLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVD+IVI
Subjt:  NKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVI

Query:  TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTV
        TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLG KPSGTP+MPNQQLVKEGELCKDPERYRRLV KLNYLTV
Subjt:  TGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTV

Query:  TRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT--------------
        TR DI+YS+SVVSQFMSSPTVDHW AVEQILCYLKAAPGRGILYKDHGHTR+ECF   DWAGSREDRRSTSGYCVF+GGN VSWK+              
Subjt:  TRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT--------------

Query:  -----------------------------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNK
                                     KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKT EQLGDILTKALNGTRISYLCNK
Subjt:  -----------------------------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNK

Query:  LGMIDIFAPA
        LGMIDIFAPA
Subjt:  LGMIDIFAPA

TrEMBL top hitse value%identityAlignment
A0A438DZQ8 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0050.04Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN  V++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D   +K W++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVL
        FLYSGK  V +M++V   F   ++ A+S+T+YFM  KK   EL  L+PFSPDV+VQQ Q E+M VM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVL

Query:  RIESSQTGVKNNNPRAPQRNSTDHRK--------------PESVEIVCNYCR------------------------------------------------
        R E+  +    N   A   N+ + R+               +S  IVC YC                                                 
Subjt:  RIESSQTGVKNNNPRAPQRNSTDHRK--------------PESVEIVCNYCR------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLS
                                                                          HPSL VLKKL P+F +L SL+C+SC FAK HR S
Subjt:  ------------------------------------------------------------------HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLS

Query:  SSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENG
          PR++KR  + FELVHSD+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K LR+DN  EY S+S  +Y+  NG
Subjt:  SSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENG

Query:  IIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKL
        I+HQ+SC DTPSQNGVAERKNRHLLETARAL FQM V K FW DAVSTACFLINRMP+ VL G+IPY+ + P K LFP+AP+IFGC C+VRD RP  TKL
Subjt:  IIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKL

Query:  DPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLS-----------------TDVP--PSRLLI
        DPK+L+C+FLGYSR+QKGYRC+ P L +YLVS D+VF EDT F SSP+S    ED+   +Y+V +  P++                   ++P  P++  I
Subjt:  DPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLS-----------------TDVP--PSRLLI

Query:  SQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALD
         Q+YSRR  P  +D+C P+  PSS DP+   DLPI+L KGKR C   Y +++F+SY  LS S+   + S++S S+  +  +AL+HP W+NAM+EE+ AL+
Subjt:  SQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALD

Query:  DNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKL-TFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYM
        DN TW LV  P GKK +GCKWVFAVK++PDG+VARLKARLVA+GYAQ YG DYS TFSPVAKL + RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+
Subjt:  DNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKL-TFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYM

Query:  EQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLG
        EQPPGFVAQG   KVCRL+ +LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYV++IVIT ND  GIS LKTF+  +F+TKDLG
Subjt:  EQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLG

Query:  QLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLV-KEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHW
        +LKYFLGIEV RSKK ++LSQRKYVLDLL ETGK+  KP  TP++PN QL+  +G+   +PER RR+V KLNYLTVTR DI+Y++SVVSQF  +PT+ HW
Subjt:  QLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLV-KEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHW

Query:  VAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT--------------------------------------
         A+EQILCYLK APG GILY   GHTRIECF  TDWAGS+ DRRST+GYCVF GGN V+WK+                                      
Subjt:  VAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT--------------------------------------

Query:  -----KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTK
             KLWCDNQAALHIA+NPV+HERTKHIEVDCHFIREKI++ LVSTGYVKT EQLG  L K
Subjt:  -----KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTK

A0A438HPS2 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0051.9Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN  V++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D   +K W++DDARL LQ+KNSI S+I+GL  HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVL
        FLYSGK  V RM++V   F   ++ A+S+T+YFM  KK   EL  L+PFSPDV+VQQ QRE+MAVM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVL

Query:  RIE----SSQTGV---KNNNPRAPQR-------NSTDHRKPESVEIVCNYCR------------------------------------------------
        R E    S  T V   K  N    +R        + ++R  +S  IVC YC                                                 
Subjt:  RIE----SSQTGV---KNNNPRAPQR-------NSTDHRKPESVEIVCNYCR------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLS
                                                                          HP L VLKKL P+F +L SL+C+SC FAK HR S
Subjt:  ------------------------------------------------------------------HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLS

Query:  SSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENG
          PR++KRA + FELVHSD+WG CPV SQTGF+YFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K LR+DN  EY S+S  +Y+  NG
Subjt:  SSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENG

Query:  IIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKL
        I+HQ+SC DTPSQNGVAERKNRHLLET RAL FQM V K FWVDAVSTACFLIN MP+ VL G+IPY+V+ P K LFP+ P+IFGC C+VRD RP  TKL
Subjt:  IIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKL

Query:  DPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPS
        DPK+L+C+FLGYSR+QKGYRC+ P L +YLVS DVVF EDT F SSP+S    ED+   +Y+V +  P++           S V S    P  +D+C P+
Subjt:  DPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSCPPS

Query:  MLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGC
          PSS DP+   DL I+LRKGKR C   Y +++F+SY  LS S+   + S++S S+  +  +AL+HP W+NA++EE+ AL+DN TW LV  P GKK +GC
Subjt:  MLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGC

Query:  KWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKL-TFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLR
        KWVFAVK+NPDG+VARLKARLVAKGYAQ YG DYS TFSPVAKL + RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQG   KVCRL+
Subjt:  KWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKL-TFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLR

Query:  NSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYL
         +LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVD+IVITGND  GIS LKTF+  +F+TKDLG+ KYFLGIEV RSKKG++L
Subjt:  NSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYL

Query:  SQRKYVLDLLSETGKLGEKPSGTPIMPNQQLV-KEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGIL
        SQRKYVLDLL ETGK+  KP  TP++PN QL+  +G+   +PERYRR+V KLNYLTVTR DI+Y++SVVSQF S+PT+ HW A+EQILCYLK A G GIL
Subjt:  SQRKYVLDLLSETGKLGEKPSGTPIMPNQQLV-KEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGIL

Query:  YKDHGHTRIECFFYTDWAGSREDRRSTS---------GYCVFIG--------GNSVSWKTKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGL
        Y   GHTRIECF   DWAGS+ DRRST+           C  I         G   +   KLWCDNQAALHIA+NPV+HERTKHIEVDCHFIREKI++ L
Subjt:  YKDHGHTRIECFFYTDWAGSREDRRSTS---------GYCVFIG--------GNSVSWKTKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGL

Query:  VSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
        VSTGYVKT EQLGDI TKALNGTR+ Y CNKLGMI+I+APA
Subjt:  VSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

A0A438IRR9 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0051.36Show/hide
Query:  LRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKM
        ++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LDFLYSGK  V RM++V   F   ++ A+S+T+YFM  KK   EL  L+PFSPDV+VQQ QRE+M
Subjt:  LRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKM

Query:  AVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVLRIESSQTGVKNN---------------NPRAPQRNSTDHRKPESVEIVCNYCR----------
        AVM FL+GL  EF  AK+QILS S I SL + F+RVLR E+  +    N               N R   R   +     S  IVC YC           
Subjt:  AVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVLRIESSQTGVKNN---------------NPRAPQRNSTDHRKPESVEIVCNYCR----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFC
            HPSL VLKKL P+F +L SL+C+SC FAK HR S  PR++KRA + FELVHSD+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFC
Subjt:  ----HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFC

Query:  AFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLN
        AF  EIK Q++VS+K LR+DN  EY S+S  +Y+ +NGI+HQ+SC DTPSQNGVAERKNRHLLETARAL FQM V K FW DAVSTACFLINRMP+ VL 
Subjt:  AFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLN

Query:  GEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYE
        G+IPY+V+ P K LFP+AP+IFGC C+VRD RP  TKLDPK+L+C+FLGYSR+QKGYRC+ P L +YLVS DVVF EDT F SSP+S    ED+   +Y+
Subjt:  GEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYE

Query:  VTSPTP------------SLSTDVP-------PSRLLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKR--KCTYPVSSFISYHQLSPS
        V +  P            SL+   P       P++  I QVYSRR  P  +D+C P+  PSS DP+   DLPI+LRKGKR  K  Y +++F+SY  LS S
Subjt:  VTSPTP------------SLSTDVP-------PSRLLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKR--KCTYPVSSFISYHQLSPS

Query:  TYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAK
        +   + S++S S+  +  +AL+HP W+NAM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK+N DG+VARLKARLVA+GYAQ YG DYS TFSPVAK
Subjt:  TYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAK

Query:  L-TFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG
        L + RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQG   KVCRL+ +LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  G
Subjt:  L-TFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG

Query:  IVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLV-KEGELCKDPE
        I+LLVVYVD+IVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+  KP  TP++PN QL+  +G+   +PE
Subjt:  IVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLV-KEGELCKDPE

Query:  RYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT
        RYRR+V KLNYLTVTR DI+Y++SVVSQF S+PT+ HW A+EQILCYLK APG GILY   GHTRIECF   DWAGS+ DRRST+GYCVF GGN V+WK+
Subjt:  RYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT

Query:  -------------------------------------------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILT
                                                   KLWCDNQAALHIA+NP++HERTKHIEVDCHFIREKI++ LVSTGYVKT EQLGDI T
Subjt:  -------------------------------------------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILT

Query:  KALNGTRISYLCNKLGMIDIFAPA
        KALNGTR+ Y CNKLGMI+I+APA
Subjt:  KALNGTRISYLCNKLGMIDIFAPA

B0FBS2 Uncharacterized protein0.0e+0051.69Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        M + KN  V++++P+ SKITEHKLNGSNY +W +TI  YLRS   DDH+TE+PP D   +K W++DDARL+LQ+KNSI S+I+GL+ HCE VKEL+++LD
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVL
        FLYSGK  V RM++V   F   ++ A+S+T+YFM  KK   EL  L+PFSPDV+VQQ QRE+MAVM FL+GL  EF  AK+QILS S I SL + F+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVL

Query:  RIESSQTGVKNNNPRAPQRNSTDHRK---------------PESVEIVCNYCR-----------------------------------------------
        R E+  +    N   A + N+ + R+                 S  IVC YC                                                
Subjt:  RIESSQTGVKNNNPRAPQRNSTDHRK---------------PESVEIVCNYCR-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRL
                                                                           HPSL VLKKL P+F +L SL+C+SC FAK HR 
Subjt:  -------------------------------------------------------------------HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRL

Query:  SSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCEN
        S  PR++KRA + FELVHSD+WGPCPV SQTGFRYFVTFVDD SR+TW+Y MKNRSE+ SHFCAF  EIK Q++VS+K LR+DN  EY S+S  +Y+  N
Subjt:  SSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCEN

Query:  GIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTK
        GI+HQ+SC DTPSQNGVAERKNRHLLETARAL FQM V K FW DAVSTACFLINRMP+ VL G+IPY+V+ P K LFP+AP+IFGC C+VRD RP  TK
Subjt:  GIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTK

Query:  LDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRLL
        LDPK+L+C+FLGYSR+QKGYRC+ P L +YLVS DVVF EDT F SSP+S    ED+   +Y+V +  P            SL+   P       P++  
Subjt:  LDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTP------------SLSTDVP-------PSRLL

Query:  ISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTAL
        I QVYSRR  P  +D+C P+  PSS DP+   DLPI+LRKGKR C   Y +++F+SY  LS S+   + S++S S+  +  +AL+HP W+NAM+EE+ AL
Subjt:  ISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKC--TYPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTAL

Query:  DDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKL-TFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVY
        +DN TW LV  P GKK +GCKWVFAVK+NPDG+VARLKARLVA+GYAQ YG DYS TFSPVAKL + RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY
Subjt:  DDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKL-TFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVY

Query:  MEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDL
        +EQPPGFVAQG   KVCRL+ +LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVD+IVITGND  GIS LKTF+  +F+TKDL
Subjt:  MEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDL

Query:  GQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLV-KEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDH
        G+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+  KP  TP++PN QL+  +G+   +PERYRR+V KLNYLTVTR DI+Y++SVVSQF S+PT+ H
Subjt:  GQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLV-KEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDH

Query:  WVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT-------------------------------------
        W A+EQILCYLK APG GILY   GHTRIECF   DWAGS+ DRRST+GYCVF GGN V+WK+                                     
Subjt:  WVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKT-------------------------------------

Query:  ------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMIDIFAPA
              KLWCDNQAALHIA+NPV+HERTKHIEVDCHFIREKI++ LVSTGYVKT EQLGDI TKALNGTR+ Y CNKLGMI+I+APA
Subjt:  ------KLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMIDIFAPA

Q6L3Q0 Polyprotein, putative0.0e+0051.07Show/hide
Query:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD
        MA+ K+  V+ ++P  SKIT+ KLNGSNY DW + I  YLRS + DDH+ +DPP D   KK WLRDDARL LQI NSI++E++GLV+HCE VKEL+++L+
Subjt:  MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLD

Query:  FLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVL
        +LYSGK  + R++EV   F+R++++A+S+T+YFM  KK   EL +LLPFS D+KVQQ QRE+MA+M FL GL  EF  AK+QILS+S+I SL D F++VL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVL

Query:  RIES---------------------------------------------------------------SQTGVKNNNPR----------------------
        R ES                                                               +  GV NNN                        
Subjt:  RIES---------------------------------------------------------------SQTGVKNNNPR----------------------

Query:  ------------------------------------------------APQRN-------------------STD--------------HRKPESVEIV-
                                                        AP  N                   +TD              H+ P SV IV 
Subjt:  ------------------------------------------------APQRN-------------------STD--------------HRKPESVEIV-

Query:  --------------------------------------------------------------------------------------CN--------YCR-
                                                                                              C+        +CR 
Subjt:  --------------------------------------------------------------------------------------CN--------YCR-

Query:  -HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFH
         HPSL VLKKL P+F ++ S++C+SC FAK HR+S SPR +KRA   FELVHSD+WGPCPVVS+ GFRYFVTF+DD SR+TW+Y MKNRSE+ SHF  F 
Subjt:  -HPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFH

Query:  TEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEI
         EIK QFN S+  LR+DNA E+ S S  +Y+ + GI+HQSSC DTPSQNGVAERKNRHLLETAR L FQM V K FW D VSTA FLINRMPS+VLNG+I
Subjt:  TEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEI

Query:  PYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSL---CQGEDDNLFIYE
        PY VLFP K LFP+ PK+FG  C+VRDVRPH TKLDPK+LKC+FLGYSR+QKGYRCY PTL RY+VS DVVF E   F SSP +     Q ED+   IY 
Subjt:  PYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSL---CQGEDDNLFIYE

Query:  VTSPTPSLSTDVPPS-----------------RLLISQVYSRRPPPQPSDSCPPSMLPSS----CDPAPSD--DLPIALRKGKRKC--TYPVSSFISYHQ
         T        +VP S                 +  I QVYSRR     +D+CP   L SS     +P+P++  D+PIALRKGKR+C   Y +++FISY  
Subjt:  VTSPTPSLSTDVPPS-----------------RLLISQVYSRRPPPQPSDSCPPSMLPSS----CDPAPSD--DLPIALRKGKRKC--TYPVSSFISYHQ

Query:  LSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFS
        LSP++ + I SL+S  +  +  +AL+HP W +AM++E+ ALDDN TW+LV  P GKKA+GCKWVF +K+NPDG++ARLKARLVAKGYAQ YG DYS TFS
Subjt:  LSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFS

Query:  PVAKLT-FRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRR
        PVAKLT  RLF+S+AA+  W LHQL IKNAFLHGDLQEEVYMEQPPGFVAQG + KVC L+  LYGLKQSPRAWFGKFS+ +  FG+ KS  DHSVFYR+
Subjt:  PVAKLT-FRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRR

Query:  SEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVK-EGELC
        S  GI+LLVVYVD+IVIT +D  GISSLK FL   F+TKDLGQLKYFLGIEV RSKKGI+LSQRKY+LDLL ETGK   KP  TP++PN QL   +G+  
Subjt:  SEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVK-EGELC

Query:  KDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSV
         DPERYRRLV KLNYLTVTR DIS+++S+VSQFMS+PT+ HW A+EQILCYLK APG GI+Y+++ HTRIECF   DWAGS+ DRRST+GYCVF+GGN V
Subjt:  KDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSV

Query:  SWK---------------------TKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMI
        SW+                      KLWCDNQAALHIASNPV+HERTKHIEVDCHFIREKIQ+ L+ST YVKT EQL D+ TKAL G R++YLCNKLGMI
Subjt:  SWK---------------------TKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMI

Query:  DIFAPA
        +I+APA
Subjt:  DIFAPA

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-10429.28Show/hide
Query:  LSSLNCDSCQFAKFHRLSSSPRVDKRAI-APFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRT
        LS   C+ C   K  RL      DK  I  P  +VHSD+ GP   V+     YFV FVD  +     YL+K +S++ S F  F  + +  FN+ +  L  
Subjt:  LSSLNCDSCQFAKFHRLSSSPRVDKRAI-APFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRT

Query:  DNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVL--NGEIPYRVLFPTKHLFPI
        DN  EY S+ +  +  + GI +  +   TP  NGV+ER  R + E AR +     + K FW +AV TA +LINR+PS  L  + + PY  ++  K  +  
Subjt:  DNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVL--NGEIPYRVLFPTKHLFPI

Query:  APKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDV-----------------VFFEDTP------FTSSPSSLCQGE--
          ++FG   +V  ++    K D KS K IF+GY     G++ +    ++++V+ DV                 VF +D+       F +    + Q E  
Subjt:  APKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDV-----------------VFFEDTP------FTSSPSSLCQGE--

Query:  ------------------------DDNLFIYEVTSPTPS--------LSTDVPPSRLLISQVYSRRPPPQPSDS----CPPSMLPSSC----------DP
                                +D+  I +   P  S        L      ++  +++   R+     ++S     P     S            +P
Subjt:  ------------------------DDNLFIYEVTSPTPS--------LSTDVPPSRLLISQVYSRRPPPQPSDS----CPPSMLPSSC----------DP

Query:  APSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLES--TSISNSAHKALSHPD---WQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFA
          +D + I  R+ +R  T P    ISY++   S    + +  +    + NS  +     D   W+ A+  E+ A   N TW +  RP  K  +  +WVF+
Subjt:  APSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLES--TSISNSAHKALSHPD---WQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFA

Query:  VKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKL-TFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYG
        VK N  G   R KARLVA+G+ Q Y  DY  TF+PVA++ +FR  LS+       +HQ+D+K AFL+G L+EE+YM  P G      SD VC+L  ++YG
Subjt:  VKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKL-TFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYG

Query:  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY--RRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQR
        LKQ+ R WF  F QAL       S+ D  ++   + +    + +++YVD++VI   D   +++ K +L  +F   DL ++K+F+GI +   +  IYLSQ 
Subjt:  LKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY--RRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQR

Query:  KYVLDLLSETGKLGEKPSGTPIMP--NQQLVKEGELCKDPERYRRLVKKLNYLTV-TRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILY
         YV  +LS+          TP+    N +L+   E C  P   R L+  L Y+ + TR D++ +++++S++ S    + W  ++++L YLK      +++
Subjt:  KYVLDLLSETGKLGEKPSGTPIMP--NQQLVKEGELCKDPERYRRLVKKLNYLTV-TRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILY

Query:  KDH--GHTRIECFFYTDWAGSREDRRSTSGYCV-FIGGNSVSWKTK------------------------LWC-------------------DNQAALHI
        K +     +I  +  +DWAGS  DR+ST+GY       N + W TK                        LW                    DNQ  + I
Subjt:  KDH--GHTRIECFFYTDWAGSREDRRSTSGYCV-FIGGNSVSWKTK------------------------LWC-------------------DNQAALHI

Query:  ASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMI
        A+NP  H+R KHI++  HF RE++Q+ ++   Y+ TE QL DI TK L   R   L +KLG++
Subjt:  ASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.4e-13734.38Show/hide
Query:  CDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEY
        CD C F K HR+S      +R +   +LV+SD+ GP  + S  G +YFVTF+DD SR  W+Y++K + ++   F  FH  ++ +    +K LR+DN GEY
Subjt:  CDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGEY

Query:  FSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCV
         S     Y   +GI H+ +   TP  NGVAER NR ++E  R++     + K FW +AV TAC+LINR PS  L  EIP RV +  K +     K+FGC 
Subjt:  FSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCV

Query:  CFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSR
         F    +   TKLD KS+ CIF+GY   + GYR + P  K+ + S DVVF E    T++  S  +   + +    VT P+ S +                
Subjt:  CFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSR

Query:  RPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIS----------NSAHKALSHPD---WQNAMIEE
           P  ++S    +      P    +    L +G  +  +P      +  L  S    + S    S             S  + LSHP+      AM EE
Subjt:  RPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIS----------NSAHKALSHPD---WQNAMIEE

Query:  MTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAATNKWSLHQLDIKNAFLHGDLQ
        M +L  NGT+ LV  P GK+ + CKWVF +K + D  + R KARLV KG+ Q  G D+   FSPV K+T  R  LS+AA+    + QLD+K AFLHGDL+
Subjt:  MTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAATNKWSLHQLDIKNAFLHGDLQ

Query:  EEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQF
        EE+YMEQP GF   G    VC+L  SLYGLKQ+PR W+ KF   +      K+ SD  V+++R SE   ++L++YVD+++I G D   I+ LK  L   F
Subjt:  EEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQF

Query:  YTKDLGQLKYFLGIEVMRSK--KGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVK---------EGELCKDPERYRRLVKKLNYLTV-TRSDISY
          KDLG  +  LG++++R +  + ++LSQ KY+  +L        KP  TP+  + +L K         +G + K P  Y   V  L Y  V TR DI++
Subjt:  YTKDLGQLKYFLGIEVMRSK--KGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVK---------EGELCKDPERYRRLVKKLNYLTV-TRSDISY

Query:  SISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKTKL-------------------
        ++ VVS+F+ +P  +HW AV+ IL YL+   G  + +       ++ +   D AG  ++R+S++GY     G ++SW++KL                   
Subjt:  SISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKTKL-------------------

Query:  -----------------------WCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKL
                               +CD+Q+A+ ++ N ++H RTKHI+V  H+IRE + D  +    + T E   D+LTK +   +   LC +L
Subjt:  -----------------------WCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKL

P92519 Uncharacterized mitochondrial protein AtMg008104.8e-3236.55Show/hide
Query:  LVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRR
        L++YVD+I++TG+    ++ L   L   F  KDLG + YFLGI++     G++LSQ KY   +L+  G L  KP  TP+              DP  +R 
Subjt:  LVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRR

Query:  LVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKTK
        +V  L YLT+TR DISY++++V Q M  PT+  +  ++++L Y+K     G+    +    ++ F  +DWAG    RRST+G+C F+G N +SW  K
Subjt:  LVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKTK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-15133.43Show/hide
Query:  NGLLPEFGMAKTQILS-NSKIPSLDDAFTRVL---RIESSQTGVKNNNPRAPQRNSTDHRKPESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLN------
        NG+  EF  A  Q+   N+ +P L       L    I SSQ      +P +   +S+ H +            HP+  +L  +   + SLS LN      
Subjt:  NGLLPEFGMAKTQILS-NSKIPSLDDAFTRVL---RIESSQTGVKNNNPRAPQRNSTDHRKPESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLN------

Query:  -CDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGE
         C  C   K +++  S +    +  P E ++SD+W   P++S   +RY+V FVD  +R TWLY +K +S++   F  F   ++N+F   I T  +DN GE
Subjt:  -CDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDNAGE

Query:  YFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGC
        +   +L  Y  ++GI H +S   TP  NG++ERK+RH++ET   L     + K +W  A + A +LINR+P+ +L  E P++ LF T   +    ++FGC
Subjt:  YFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGC

Query:  VCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDT-PFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVP--PSRLLIS
         C+   +RP++  KLD KS +C+FLGYS  Q  Y C      R  +S  V F E+  PF++  ++L   ++       V SP  +L T  P  P+     
Subjt:  VCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDT-PFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVP--PSRLLIS

Query:  QVYSRRPPPQPS-------------DSCPPSMLPSSCDPA------------------------------PSDDLPIALRK------------------G
          ++  PP  PS             DS   S  PSS +P                               P+++ P  L +                   
Subjt:  QVYSRRPPPQPS-------------DSCPPSMLPSSCDPA------------------------------PSDDLPIALRK------------------G

Query:  KRKCTYPVSSFISYHQLSP------------------------------STYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPA
            T P    I  H   P                                Y+   SL + S   +A +AL    W+NAM  E+ A   N TWDLV  P 
Subjt:  KRKCTYPVSSFISYHQLSP------------------------------STYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPA

Query:  GKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGA
            I GC+W+F  K N DG++ R KARLVAKGY Q  G DY+ TFSPV K T  R+ L +A    W + QLD+ NAFL G L ++VYM QPPGF+ +  
Subjt:  GKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGA

Query:  SDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVM
         + VC+LR +LYGLKQ+PRAW+ +    L+  G   S SD S+F  +  K IV ++VYVD+I+ITGND   + +    L  +F  KD  +L YFLGIE  
Subjt:  SDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVM

Query:  RSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQL-VKEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLK
        R   G++LSQR+Y+LDLL+ T  +  KP  TP+ P+ +L +  G    DP  YR +V  L YL  TR DISY+++ +SQFM  PT +H  A+++IL YL 
Subjt:  RSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQL-VKEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLK

Query:  AAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKTK-------------------------------------------LWCDN
          P  GI  K      +  +   DWAG ++D  ST+GY V++G + +SW +K                                           ++CDN
Subjt:  AAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKTK-------------------------------------------LWCDN

Query:  QAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGM
          A ++ +NPVFH R KHI +D HFIR ++Q G +   +V T +QL D LTK L+ T      +K+G+
Subjt:  QAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.3e-15034.23Show/hide
Query:  HPSLFVLKKLYPEFR------SLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSH
        HPSL +L  +           S   L+C  C   K H++  S      +  P E ++SD+W   P++S   +RY+V FVD  +R TWLY +K +S++   
Subjt:  HPSLFVLKKLYPEFR------SLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSH

Query:  FCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSV
        F  F + ++N+F   I TL +DN GE+    L  YL ++GI H +S   TP  NG++ERK+RH++E    L     V K +W  A S A +LINR+P+ +
Subjt:  FCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSV

Query:  LNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVF------FEDTPFTSSPSSLCQG
        L  + P++ LF     +    K+FGC C+   +RP++  KL+ KS +C F+GYS  Q  Y C      R   S  V F      F  T F  S S   + 
Subjt:  LNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHT-KLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVF------FEDTPFTSSPSSLCQG

Query:  EDDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSC----PPSMLPSSCDPAPSDDLPIA--------------------------------
        +    +    T PT  L    PP  L      S RPP  PS  C      S LPSS   +PS   P A                                
Subjt:  EDDNLFIYEVTSPTPSLSTDVPPSRLLISQVYSRRPPPQPSDSC----PPSMLPSSCDPAPSDDLPIA--------------------------------

Query:  --LRKGKRKCTYPVSSFISYHQLSPST-------------------------------------------------------YAFITSLESTSISNSAHK
               +    P S   S H  +PST                                                       Y++ TSL + S   +A +
Subjt:  --LRKGKRKCTYPVSSFISYHQLSPST-------------------------------------------------------YAFITSLESTSISNSAHK

Query:  ALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAATNKWSL
        A+    W+ AM  E+ A   N TWDLV  P     I GC+W+F  K N DG++ R KARLVAKGY Q  G DY+ TFSPV K T  R+ L +A    W +
Subjt:  ALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLT-FRLFLSMAATNKWSL

Query:  HQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDA
         QLD+ NAFL G L +EVYM QPPGFV +   D VCRLR ++YGLKQ+PRAW+ +    L+  G   S SD S+F  +  + I+ ++VYVD+I+ITGND 
Subjt:  HQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRNSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDA

Query:  LGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQL-VKEGELCKDPERYRRLVKKLNYLTVTRSD
        + +      L  +F  K+   L YFLGIE  R  +G++LSQR+Y LDLL+ T  L  KP  TP+  + +L +  G    DP  YR +V  L YL  TR D
Subjt:  LGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQL-VKEGELCKDPERYRRLVKKLNYLTVTRSD

Query:  ISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKTK-----------------
        +SY+++ +SQ+M  PT DHW A++++L YL   P  GI  K      +  +   DWAG  +D  ST+GY V++G + +SW +K                 
Subjt:  ISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKTK-----------------

Query:  --------------------------LWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMI
                                  ++CDN  A ++ +NPVFH R KHI +D HFIR ++Q G +   +V T +QL D LTK L+         K+G+I
Subjt:  --------------------------LWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMI

Query:  DI
         +
Subjt:  DI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.7e-11340.81Show/hide
Query:  YPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQ
        + +S F+SY ++SP  ++F+  +      ++ ++A     W  AM +E+ A++   TW++ + P  KK IGCKWV+ +K N DGT+ R KARLVAKGY Q
Subjt:  YPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQ

Query:  IYGTDYSYTFSPVAKLT-FRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGAS---DKVCRLRNSLYGLKQSPRAWFGKFSQALVCF
          G D+  TFSPV KLT  +L L+++A   ++LHQLDI NAFL+GDL EE+YM+ PPG+ A QG S   + VC L+ S+YGLKQ+ R WF KFS  L+ F
Subjt:  IYGTDYSYTFSPVAKLT-FRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGAS---DKVCRLRNSLYGLKQSPRAWFGKFSQALVCF

Query:  GMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTP
        G  +S SDH+ F + +    + ++VYVD+I+I  N+   +  LK+ L+  F  +DLG LKYFLG+E+ RS  GI + QRKY LDLL ETG LG KPS  P
Subjt:  GMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTP

Query:  IMPNQQL-VKEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDR
        + P+       G    D + YRRL+ +L YL +TR DIS++++ +SQF  +P + H  AV +IL Y+K   G+G+ Y      +++ F    +   ++ R
Subjt:  IMPNQQL-VKEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDR

Query:  RSTSGYCVFIGGNSVSWKTK-------------------------------------------LWCDNQAALHIASNPVFHERTKHIEVDCHFIREK-IQ
        RST+GYC+F+G + +SWK+K                                           L+CDN AA+HIA+N VFHERTKHIE DCH +RE+ + 
Subjt:  RSTSGYCVFIGGNSVSWKTK-------------------------------------------LWCDNQAALHIASNPVFHERTKHIEVDCHFIREK-IQ

Query:  DGLVSTGYVKTEEQLG--DILTKALNGTRISYLCNKLGMIDIFA
           +S  +   +EQ G  + L+  L GT I Y+ +  G+  + A
Subjt:  DGLVSTGYVKTEEQLG--DILTKALNGTRISYLCNKLGMIDIFA

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.3e-1033.33Show/hide
Query:  YLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFI
        YLT+TR D++++++ +SQF S+       AV ++L Y+K   G+G+ Y      +++ F  +DWA   + RRS +G+C  +
Subjt:  YLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFI

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.7e-0530.23Show/hide
Query:  NRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLK
        NR ++E  R++  +  + K F  DA +TA  +IN+ PS+ +N  +P  V F +   +    + FGCV +   +     KL P++ K
Subjt:  NRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYRVLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLK

ATMG00810.1 DNA/RNA polymerases superfamily protein3.4e-3336.55Show/hide
Query:  LVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRR
        L++YVD+I++TG+    ++ L   L   F  KDLG + YFLGI++     G++LSQ KY   +L+  G L  KP  TP+              DP  +R 
Subjt:  LVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRR

Query:  LVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKTK
        +V  L YLT+TR DISY++++V Q M  PT+  +  ++++L Y+K     G+    +    ++ F  +DWAG    RRST+G+C F+G N +SW  K
Subjt:  LVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSREDRRSTSGYCVFIGGNSVSWKTK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.6e-1750Show/hide
Query:  ALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAK-LTFRLFLSMA
        AL  P W  AM EE+ AL  N TW LV  P  +  +GCKWVF  K++ DGT+ RLKARLVAKG+ Q  G  +  T+SPV +  T R  L++A
Subjt:  ALSHPDWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAK-LTFRLFLSMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACATAAAAAATTTGGTAGTCTCCAACGTTATTCCCTTGGCCTCTAAGATCACAGAACATAAGTTAAATGGATCTAATTATTACGATTGGCGTCAGACAATTTT
ATTTTATTTGAGAAGTACTGATATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGA
TCAAGAATTCAATTGAGAGTGAGATAATTGGATTGGTTGATCACTGTGAGTCTGTTAAAGAACTTTTGGAATTTTTGGATTTTCTATACTCAGGTAAAGAGCAAGTGCAT
AGAATGTTTGAAGTTTGTATGCAATTTTTTCGTGCGAAACAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGTTCATTGTTGAGCTTGGCTTGTTGTT
ACCTTTTAGTCCTGATGTTAAAGTTCAACAAGTTCAACGAGAGAAGATGGCTGTTATGATTTTCCTGAATGGACTCTTACCTGAATTTGGAATGGCAAAGACACAGATTC
TCTCTAACTCCAAGATTCCATCATTAGATGATGCCTTCACTCGCGTCCTTCGCATTGAAAGCTCTCAGACTGGTGTCAAGAACAATAACCCTCGGGCACCTCAGAGGAAT
AGTACTGATCATCGAAAACCAGAGTCTGTAGAGATTGTTTGTAACTACTGTCGTCATCCATCTTTGTTTGTGTTGAAGAAACTTTATCCAGAATTTAGGTCTTTGTCCTC
TTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGAGTTAGTTCATTCTGATATTTGGG
GTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGACGATCATTCTCGTCTAACTTGGTTATATTTAATGAAAAATCGTTCTGAGTTATTA
TCTCATTTTTGTGCCTTTCATACTGAAATAAAAAACCAATTTAATGTCTCTATCAAAACTTTGCGTACTGATAATGCGGGTGAATATTTTTCTCATTCTCTTGGCTCTTA
CTTGTGTGAAAATGGCATTATTCATCAATCTTCCTGTGCTGACACTCCATCTCAAAATGGTGTTGCAGAGCGGAAAAATAGGCATTTACTTGAAACTGCCCGTGCTTTAT
CGTTTCAAATGCATGTTTCAAAAATCTTTTGGGTTGATGCTGTCTCTACAGCTTGTTTTTTGATTAATAGGATGCCTTCCTCTGTTCTTAATGGTGAGATTCCCTATCGT
GTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACGTTCGTCCTCATCATACTAAGTTAGATCCCAAATCCTT
GAAGTGTATCTTCTTGGGTTATTCACGTGTTCAAAAGGGTTATCGTTGTTATTGTCCTACCCTTAAAAGGTATCTGGTTTCGCCTGATGTTGTCTTTTTTGAAGATACAC
CCTTTACTTCATCACCATCGAGTTTGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACTGATGTGCCTCCTTCCCGC
CTGTTGATTTCTCAAGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGCGCCAAGTGATGATCTTCC
CATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAATCCA
CATCTATTTCTAACTCTGCTCATAAAGCTTTGTCTCATCCTGACTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCT
CGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTCGATTGAAGGCTCGCCTTGTTGCCAAAGGTTATGC
TCAAATATATGGAACTGATTATTCATATACATTCTCTCCAGTTGCCAAATTAACTTTTCGCCTATTTCTTTCCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTG
ACATTAAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGCGAGTGATAAAGTATGTCGCCTTCGAAAC
TCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCG
CCGATCTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATAATATTGTTATTACTGGAAATGATGCATTGGGTATTTCATCTCTCAAAACTTTCCTTCAAGGTCAGT
TTTATACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAATATGTACTTGATTTGTTGTCT
GAGACAGGAAAATTAGGCGAAAAACCAAGTGGCACTCCAATAATGCCAAATCAACAACTTGTTAAAGAAGGAGAATTATGTAAAGATCCTGAGAGATATAGGAGATTAGT
TAAGAAGTTGAACTACTTAACAGTGACTCGATCAGACATTTCCTATTCTATAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGGATCATTGGGTTGCAGTAGAGC
AGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATATAAAGATCATGGACATACGAGAATTGAATGTTTTTTTTATACTGATTGGGCGGGGTCTCGTGAG
GATAGGAGATCAACTTCTGGATATTGTGTCTTTATAGGAGGAAACTCAGTTTCATGGAAGACTAAATTATGGTGTGATAATCAAGCTGCACTTCATATTGCATCTAATCC
AGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAGACCGAAGAACAATTGG
GAGATATTCTAACTAAAGCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGCTGGGCATGATCGACATATTTGCTCCAGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACATAAAAAATTTGGTAGTCTCCAACGTTATTCCCTTGGCCTCTAAGATCACAGAACATAAGTTAAATGGATCTAATTATTACGATTGGCGTCAGACAATTTT
ATTTTATTTGAGAAGTACTGATATGGATGATCATATGACTGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGCTTCGTGATGATGCCCGTTTATATCTTCAGA
TCAAGAATTCAATTGAGAGTGAGATAATTGGATTGGTTGATCACTGTGAGTCTGTTAAAGAACTTTTGGAATTTTTGGATTTTCTATACTCAGGTAAAGAGCAAGTGCAT
AGAATGTTTGAAGTTTGTATGCAATTTTTTCGTGCGAAACAGAAAGCTGAGTCTGTCACCAGCTACTTTATGCGGCTTAAGAAGTTCATTGTTGAGCTTGGCTTGTTGTT
ACCTTTTAGTCCTGATGTTAAAGTTCAACAAGTTCAACGAGAGAAGATGGCTGTTATGATTTTCCTGAATGGACTCTTACCTGAATTTGGAATGGCAAAGACACAGATTC
TCTCTAACTCCAAGATTCCATCATTAGATGATGCCTTCACTCGCGTCCTTCGCATTGAAAGCTCTCAGACTGGTGTCAAGAACAATAACCCTCGGGCACCTCAGAGGAAT
AGTACTGATCATCGAAAACCAGAGTCTGTAGAGATTGTTTGTAACTACTGTCGTCATCCATCTTTGTTTGTGTTGAAGAAACTTTATCCAGAATTTAGGTCTTTGTCCTC
TTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGAGTCGATAAACGAGCAATTGCTCCATTTGAGTTAGTTCATTCTGATATTTGGG
GTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACTTTTGTTGACGATCATTCTCGTCTAACTTGGTTATATTTAATGAAAAATCGTTCTGAGTTATTA
TCTCATTTTTGTGCCTTTCATACTGAAATAAAAAACCAATTTAATGTCTCTATCAAAACTTTGCGTACTGATAATGCGGGTGAATATTTTTCTCATTCTCTTGGCTCTTA
CTTGTGTGAAAATGGCATTATTCATCAATCTTCCTGTGCTGACACTCCATCTCAAAATGGTGTTGCAGAGCGGAAAAATAGGCATTTACTTGAAACTGCCCGTGCTTTAT
CGTTTCAAATGCATGTTTCAAAAATCTTTTGGGTTGATGCTGTCTCTACAGCTTGTTTTTTGATTAATAGGATGCCTTCCTCTGTTCTTAATGGTGAGATTCCCTATCGT
GTTCTTTTTCCTACCAAGCATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTGTCTGTTTTGTTCGTGACGTTCGTCCTCATCATACTAAGTTAGATCCCAAATCCTT
GAAGTGTATCTTCTTGGGTTATTCACGTGTTCAAAAGGGTTATCGTTGTTATTGTCCTACCCTTAAAAGGTATCTGGTTTCGCCTGATGTTGTCTTTTTTGAAGATACAC
CCTTTACTTCATCACCATCGAGTTTGTGTCAGGGGGAGGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTTGTCTACTGATGTGCCTCCTTCCCGC
CTGTTGATTTCTCAAGTCTACTCCCGACGACCTCCACCACAACCTTCAGACTCATGTCCTCCATCAATGCTTCCTTCATCATGTGATCCAGCGCCAAGTGATGATCTTCC
CATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCAGTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAATCCA
CATCTATTTCTAACTCTGCTCATAAAGCTTTGTCTCATCCTGACTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCT
CGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTCGATTGAAGGCTCGCCTTGTTGCCAAAGGTTATGC
TCAAATATATGGAACTGATTATTCATATACATTCTCTCCAGTTGCCAAATTAACTTTTCGCCTATTTCTTTCCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTG
ACATTAAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGCGAGTGATAAAGTATGTCGCCTTCGAAAC
TCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCG
CCGATCTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATAATATTGTTATTACTGGAAATGATGCATTGGGTATTTCATCTCTCAAAACTTTCCTTCAAGGTCAGT
TTTATACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAATATGTACTTGATTTGTTGTCT
GAGACAGGAAAATTAGGCGAAAAACCAAGTGGCACTCCAATAATGCCAAATCAACAACTTGTTAAAGAAGGAGAATTATGTAAAGATCCTGAGAGATATAGGAGATTAGT
TAAGAAGTTGAACTACTTAACAGTGACTCGATCAGACATTTCCTATTCTATAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGGATCATTGGGTTGCAGTAGAGC
AGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGTGGGATCCTATATAAAGATCATGGACATACGAGAATTGAATGTTTTTTTTATACTGATTGGGCGGGGTCTCGTGAG
GATAGGAGATCAACTTCTGGATATTGTGTCTTTATAGGAGGAAACTCAGTTTCATGGAAGACTAAATTATGGTGTGATAATCAAGCTGCACTTCATATTGCATCTAATCC
AGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAGACCGAAGAACAATTGG
GAGATATTCTAACTAAAGCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGCTGGGCATGATCGACATATTTGCTCCAGCCTGA
Protein sequenceShow/hide protein sequence
MADIKNLVVSNVIPLASKITEHKLNGSNYYDWRQTILFYLRSTDMDDHMTEDPPKDAKQKKDWLRDDARLYLQIKNSIESEIIGLVDHCESVKELLEFLDFLYSGKEQVH
RMFEVCMQFFRAKQKAESVTSYFMRLKKFIVELGLLLPFSPDVKVQQVQREKMAVMIFLNGLLPEFGMAKTQILSNSKIPSLDDAFTRVLRIESSQTGVKNNNPRAPQRN
STDHRKPESVEIVCNYCRHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELL
SHFCAFHTEIKNQFNVSIKTLRTDNAGEYFSHSLGSYLCENGIIHQSSCADTPSQNGVAERKNRHLLETARALSFQMHVSKIFWVDAVSTACFLINRMPSSVLNGEIPYR
VLFPTKHLFPIAPKIFGCVCFVRDVRPHHTKLDPKSLKCIFLGYSRVQKGYRCYCPTLKRYLVSPDVVFFEDTPFTSSPSSLCQGEDDNLFIYEVTSPTPSLSTDVPPSR
LLISQVYSRRPPPQPSDSCPPSMLPSSCDPAPSDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSISNSAHKALSHPDWQNAMIEEMTALDDNGTWDLVS
RPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSYTFSPVAKLTFRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGASDKVCRLRN
SLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDNIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLS
ETGKLGEKPSGTPIMPNQQLVKEGELCKDPERYRRLVKKLNYLTVTRSDISYSISVVSQFMSSPTVDHWVAVEQILCYLKAAPGRGILYKDHGHTRIECFFYTDWAGSRE
DRRSTSGYCVFIGGNSVSWKTKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTEEQLGDILTKALNGTRISYLCNKLGMIDIFAPA