; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G19490 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G19490
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr1:15087578..15089820
RNA-Seq ExpressionCSPI01G19490
SyntenyCSPI01G19490
Gene Ontology termsGO:0007165 - signal transduction (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003953 - NAD+ nucleosidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]2.2e-29374.31Show/hide
Query:  MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        MLPSSCDPAP+DDLPIALRKGKRKCTYPVSSFISYHQ                                                               
Subjt:  MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYA
                                                                                                     LSPSTYA
Subjt:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYA

Query:  FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS
        FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS
Subjt:  FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS

Query:  IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL
        IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL
Subjt:  IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL

Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLN
        VVSRSSA+                          ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTK+LN
Subjt:  VVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLN

Query:  GTRISYLCNKLGMIDIFAPA
        GTRISYLCNKLGMIDIFAPA
Subjt:  GTRISYLCNKLGMIDIFAPA

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]2.2e-29374.31Show/hide
Query:  MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        MLPSSCDPAP+DDLPIALRKGKRKCTYPVSSFISYHQ                                                               
Subjt:  MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYA
                                                                                                     LSPSTYA
Subjt:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYA

Query:  FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS
        FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS
Subjt:  FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS

Query:  IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL
        IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL
Subjt:  IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL

Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLN
        VVSRSSA+                          ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTK+LN
Subjt:  VVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLN

Query:  GTRISYLCNKLGMIDIFAPA
        GTRISYLCNKLGMIDIFAPA
Subjt:  GTRISYLCNKLGMIDIFAPA

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]2.2e-29374.31Show/hide
Query:  MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        MLPSSCDPAP+DDLPIALRKGKRKCTYPVSSFISYHQ                                                               
Subjt:  MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYA
                                                                                                     LSPSTYA
Subjt:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYA

Query:  FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS
        FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS
Subjt:  FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS

Query:  IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL
        IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL
Subjt:  IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL

Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLN
        VVSRSSA+                          ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTK+LN
Subjt:  VVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLN

Query:  GTRISYLCNKLGMIDIFAPA
        GTRISYLCNKLGMIDIFAPA
Subjt:  GTRISYLCNKLGMIDIFAPA

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]2.2e-29374.31Show/hide
Query:  MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        MLPSSCDPAP+DDLPIALRKGKRKCTYPVSSFISYHQ                                                               
Subjt:  MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYA
                                                                                                     LSPSTYA
Subjt:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYA

Query:  FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS
        FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS
Subjt:  FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS

Query:  IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL
        IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL
Subjt:  IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL

Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLN
        VVSRSSA+                          ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTK+LN
Subjt:  VVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLN

Query:  GTRISYLCNKLGMIDIFAPA
        GTRISYLCNKLGMIDIFAPA
Subjt:  GTRISYLCNKLGMIDIFAPA

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]2.2e-29374.31Show/hide
Query:  MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        MLPSSCDPAP+DDLPIALRKGKRKCTYPVSSFISYHQ                                                               
Subjt:  MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYA
                                                                                                     LSPSTYA
Subjt:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYA

Query:  FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS
        FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS
Subjt:  FITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTS

Query:  IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL
        IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL
Subjt:  IRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVL

Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLN
        VVSRSSA+                          ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTK+LN
Subjt:  VVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLN

Query:  GTRISYLCNKLGMIDIFAPA
        GTRISYLCNKLGMIDIFAPA
Subjt:  GTRISYLCNKLGMIDIFAPA

TrEMBL top hitse value%identityAlignment
A0A438DTG5 Retrovirus-related Pol polyprotein from transposon RE14.6e-22571.13Show/hide
Query:  DKLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT
        D LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK++PDG+VARLKARLVA+GYAQ YG DYSDT
Subjt:  DKLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY
        FSPVAKL S+RLF+S+ A+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY
Subjt:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY

Query:  RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGE
        ++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ET K+ AKP  TPM+PN QL+  +G+
Subjt:  RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGE

Query:  LCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGN
           +PERYRR+VGKLNYLTVTRPDIAY+VSVV QF S+PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSDADWAGS+ DRRST+GYCVF GGN
Subjt:  LCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGN

Query:  LVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQ
        LV+WKSKKQ+VVSRSSA+                           T+PAKLWCDNQAALHIA+NPV+HERTKHIEVDCHFIREKI++ LVSTGYVKTGEQ
Subjt:  LVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQ

Query:  LGDILTKSLNGTRISYLCNKLGMIDIFAPA
        LGDI TK+LNGTR+ Y CNKLGMI+I+APA
Subjt:  LGDILTKSLNGTRISYLCNKLGMIDIFAPA

A0A438F8M2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-22671.7Show/hide
Query:  DKLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT
        D LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK+NPDG+VARLKARLVA+GYAQ YG DYSDT
Subjt:  DKLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY
        FSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+E+VY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY
Subjt:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY

Query:  RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGE
        ++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+ AKP  TPM+PN QL+  +G+
Subjt:  RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGE

Query:  LCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGN
           +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S+PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSDADWAGS+ DRRST+GYCVF GGN
Subjt:  LCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGN

Query:  LVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQ
        LV+WKSKKQ+VVSRSSA+                           T+PAKLWCDNQAALHIA+NPV+HERTKHIEVDCHFIREKI++ LVSTGYVKTGEQ
Subjt:  LVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQ

Query:  LGDILTKSLNGTRISYLCNKLGMIDIFAPA
        LGDI TK+LNGTR+ Y CNKLGMI+I+APA
Subjt:  LGDILTKSLNGTRISYLCNKLGMIDIFAPA

A0A438G922 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-22572.78Show/hide
Query:  SLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRL
        S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK+NPDG+VARLKARLVA+GYAQ YG DYSDTFSPVAKL S+RL
Subjt:  SLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRL

Query:  FLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVV
        F+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVV
Subjt:  FLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVV

Query:  YVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLV
        YVDDIVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+ AKP  TPM+PN QL+  +G+   +PERYRR+V
Subjt:  YVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGELCKDPERYRRLV

Query:  GKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVV
        GKLNYLTVTRPDIAY+VSVVSQF S+PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSDADWAGS+ DRRST+GYCVF GGNLV+WKSKKQ+VV
Subjt:  GKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVV

Query:  SRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGT
        SRSSA+                           T+PAKLWCDNQAALHIA+NPV+HERTKHIEVDCHFIREKI++ LVSTGYVKTGEQLGDI TK+LNGT
Subjt:  SRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGT

Query:  RISYLCNKLGMIDIFAPA
        R+ Y CNKLGMI+I+APA
Subjt:  RISYLCNKLGMIDIFAPA

A0A438IRR9 Retrovirus-related Pol polyprotein from transposon TNT 1-945.4e-22671.51Show/hide
Query:  DKLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT
        D LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKWVFAVK+N DG+VARLKARLVA+GYAQ YG DYSDT
Subjt:  DKLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY
        FSPVAKL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY
Subjt:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY

Query:  RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGE
        ++S  GI+LLVVYVDDIVITGND  GIS LKTF+  +F+TKDLG+LKYFLGIEV RSKKG++LSQRKYVLDLL ETGK+ AKP  TPM+PN QL+  +G+
Subjt:  RRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV-KEGE

Query:  LCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGN
           +PERYRR+VGKLNYLTVTRPDIAY+VSVVSQF S+PT+ HWAA+EQILCYLK APG GILY   GHTR+ECFSDADWAGS+ DRRST+GYCVF GGN
Subjt:  LCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGN

Query:  LVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQ
        LV+WKSKKQ+VVSRSSA+                           T+PAKLWCDNQAALHIA+NP++HERTKHIEVDCHFIREKI++ LVSTGYVKTGEQ
Subjt:  LVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQ

Query:  LGDILTKSLNGTRISYLCNKLGMIDIFAPA
        LGDI TK+LNGTR+ Y CNKLGMI+I+APA
Subjt:  LGDILTKSLNGTRISYLCNKLGMIDIFAPA

A0A438KNH0 Retrovirus-related Pol polyprotein from transposon RE17.3e-24753.29Show/hide
Query:  PSSCDPAPNDDLPIALRKGKR--KCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        P+   P+PN DLPIA+RKG R  +  +P+ +F+SYH+LS    AF++++ S S+P S HEALSHPGW+ AM++EM AL  NGTWDLV  P GK  +GC+W
Subjt:  PSSCDPAPNDDLPIALRKGKR--KCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESD---KLSPS
        V+AVK+ PDG V RLKARLVAK Y Q+YG+DY DTFS VAK+ S+RL LSM A   W L+QLDIKNAFLHGD+ EEVYMEQPPGFVAQGES    +L  S
Subjt:  VFAVKMNPDGTVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESD---KLSPS

Query:  TYAFITS---------------------------------------------------------------------------------------------
         Y    S                                                                                             
Subjt:  TYAFITS---------------------------------------------------------------------------------------------

Query:  --------LESTSI-----------PN------------------------------SVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCK
                LE T +           PN                              S HEALSHPGW+ AM++EM AL  NGT DLV  P GK  +GC+
Subjt:  --------LESTSI-----------PN------------------------------SVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCK

Query:  WVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRK
        WV+AVK+ PDG V RLKARLVAKGY Q+YG+DY DTFSPVAK+ S+RL LSMAA   W L+QLDIKNAFLHGDL EEVYMEQPPGFVAQGES  VCRLR+
Subjt:  WVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRK

Query:  SLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG-IVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYL
        SLYGLKQSPRAWF +FS  +  FGM +ST+DHSVFY  +  G  + LVVYVDDIVITG+D  GI  LK  L   F TKDLG+LKYFLGIE+ +S  G+ L
Subjt:  SLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG-IVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYL

Query:  SQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVK-EGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGIL
        SQRKY LD+L ETG L  KP  TPM PN +LV  +GE   DP RYRRLVGKLNYLT+TRPDI++ VSVVSQF+ SP   HW AV +IL Y+K+ PG+G+L
Subjt:  SQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVK-EGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGIL

Query:  YKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSADITVPA--------------------------KLWCDNQAALHIAS
        Y++ GHT+V  ++DADWAGS  DRRSTSGYCVF+GGNL+SWKSKKQ+VV+RSSA+    A                          KL CDNQAALHIAS
Subjt:  YKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSADITVPA--------------------------KLWCDNQAALHIAS

Query:  NPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISYLCNKLGMIDIFAPA
        NPVFHERTKHIEVDCHFIREKI  G V+T +V + +QL DI TKSL G RI Y+CNKLG  D++AP+
Subjt:  NPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKSLNGTRISYLCNKLGMIDIFAPA

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-8735.08Show/hide
Query:  IPNSVHEAL---SHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLS
        +PNS  E         W+ A+  E+ A   N TW +  RP  K  +  +WVF+VK N  G   R KARLVA+G+ Q Y  DY +TF+PVA+++S R  LS
Subjt:  IPNSVHEAL---SHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLS

Query:  MAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY--RRSEKGIVLLVVY
        +       +HQ+D+K AFL+G L+EE+YM  P G      SD VC+L K++YGLKQ+ R WF  F QAL       S+ D  ++   + +    + +++Y
Subjt:  MAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY--RRSEKGIVLLVVY

Query:  VDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMP--NQQLVKEGELCKDPERYRRLV
        VDD+VI   D   +++ K +L  +F   DL ++K+F+GI +   +  IYLSQ  YV  +LS+          TP+    N +L+   E C  P   R L+
Subjt:  VDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMP--NQQLVKEGELCKDPERYRRLV

Query:  GKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDH--GHTRVECFSDADWAGSREDRRSTSGYCV-FVGGNLVSWKSKK
        G L Y+ + TRPD+  +V+++S++ S    + W  ++++L YLK      +++K +     ++  + D+DWAGS  DR+ST+GY       NL+ W +K+
Subjt:  GKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDH--GHTRVECFSDADWAGSREDRRSTSGYCV-FVGGNLVSWKSKK

Query:  QNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKS
        QN V+ SS +                          +  P K++ DNQ  + IA+NP  H+R KHI++  HF RE++Q+ ++   Y+ T  QL DI TK 
Subjt:  QNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILTKS

Query:  LNGTRISYLCNKLGMI
        L   R   L +KLG++
Subjt:  LNGTRISYLCNKLGMI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-9639.34Show/hide
Query:  PNSVHEALSHP---GWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSM
        P S+ E LSHP       AM EEM +L  NGT+ LV  P GK+ + CKWVF +K + D  + R KARLV KG+ Q  G D+ + FSPV K+TSIR  LS+
Subjt:  PNSVHEALSHP---GWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSM

Query:  AATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVD
        AA+    + QLD+K AFLHGDL+EE+YMEQP GF   G+   VC+L KSLYGLKQ+PR W+ KF   +      K+ SD  V+++R SE   ++L++YVD
Subjt:  AATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVD

Query:  DIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSK--KGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVK---------EGELCKDP
        D++I G D   I+ LK  L   F  KDLG  +  LG++++R +  + ++LSQ KY+  +L       AKP  TP+  + +L K         +G + K P
Subjt:  DIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSK--KGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVK---------EGELCKDP

Query:  ERYRRLVGKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSW
          Y   VG L Y  V TRPDIA++V VVS+F+ +P  +HW AV+ IL YL+   G  + +       ++ ++DAD AG  ++R+S++GY     G  +SW
Subjt:  ERYRRLVGKLNYLTV-TRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSW

Query:  KSKKQNVVSRSSADITVPAK-------------------------LWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDIL
        +SK Q  V+ S+ +    A                          ++CD+Q+A+ ++ N ++H RTKHI+V  H+IRE + D  +    + T E   D+L
Subjt:  KSKKQNVVSRSSADITVPAK-------------------------LWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDIL

Query:  TKSLNGTRISYLCNKL
        TK +   +   LC +L
Subjt:  TKSLNGTRISYLCNKL

P92519 Uncharacterized mitochondrial protein AtMg008103.0e-4039.9Show/hide
Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        L++YVDDI++TG+    ++ L   L   F  KDLG + YFLGI++     G++LSQ KY   +L+  G L  KP  TP+              DP  +R 
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        +VG L YLT+TRPDI+Y+V++V Q M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+G+C F+G N++SW +K+Q 
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAD
         VSRSS +
Subjt:  VVSRSSAD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.9e-12444.68Show/hide
Query:  YAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAK
        Y+   SL + S P +  +AL    W+NAM  E+ A   N TWDLV  P     I GC+W+F  K N DG++ R KARLVAKGY Q  G DY++TFSPV K
Subjt:  YAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAK

Query:  LTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG
         TSIR+ L +A    W + QLD+ NAFL G L ++VYM QPPGF+ +   + VC+LRK+LYGLKQ+PRAW+ +    L+  G   S SD S+F  +  K 
Subjt:  LTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG

Query:  IVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPE
        IV ++VYVDDI+ITGND   + +    L  +F  KD  +L YFLGIE  R   G++LSQR+Y+LDLL+ T  + AKP  TPM P+ +L +  G    DP 
Subjt:  IVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPE

Query:  RYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
         YR +VG L YL  TRPDI+Y+V+ +SQFM  PT +H  A+++IL YL   P  GI  K      +  +SDADWAG ++D  ST+GY V++G + +SW S
Subjt:  RYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS

Query:  KKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILT
        KKQ  V RSS +                          +T P  ++CDN  A ++ +NPVFH R KHI +D HFIR ++Q G +   +V T +QL D LT
Subjt:  KKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILT

Query:  KSLNGTRISYLCNKLGM
        K L+ T      +K+G+
Subjt:  KSLNGTRISYLCNKLGM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-12343.85Show/hide
Query:  YAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAK
        Y++ TSL + S P +  +A+    W+ AM  E+ A   N TWDLV  P     I GC+W+F  K N DG++ R KARLVAKGY Q  G DY++TFSPV K
Subjt:  YAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAK

Query:  LTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG
         TSIR+ L +A    W + QLD+ NAFL G L +EVYM QPPGFV +   D VCRLRK++YGLKQ+PRAW+ +    L+  G   S SD S+F  +  + 
Subjt:  LTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG

Query:  IVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPE
        I+ ++VYVDDI+ITGND + +      L  +F  K+   L YFLGIE  R  +G++LSQR+Y LDLL+ T  L AKP  TPM  + +L +  G    DP 
Subjt:  IVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-VKEGELCKDPE

Query:  RYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS
         YR +VG L YL  TRPD++Y+V+ +SQ+M  PT DHW A++++L YL   P  GI  K      +  +SDADWAG  +D  ST+GY V++G + +SW S
Subjt:  RYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKS

Query:  KKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILT
        KKQ  V RSS +                          ++ P  ++CDN  A ++ +NPVFH R KHI +D HFIR ++Q G +   +V T +QL D LT
Subjt:  KKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVKTGEQLGDILT

Query:  KSLNGTRISYLCNKLGMIDI
        K L+         K+G+I +
Subjt:  KSLNGTRISYLCNKLGMIDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-12844.86Show/hide
Query:  DKLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT
        +K+SP  ++F+  +     P++ +EA     W  AM +E+ A++   TW++ + P  KK IGCKWV+ +K N DGT+ R KARLVAKGY Q  G D+ +T
Subjt:  DKLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGES---DKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDH
        FSPV KLTS++L L+++A   ++LHQLDI NAFL+GDL EE+YM+ PPG+ A QG+S   + VC L+KS+YGLKQ+ R WF KFS  L+ FG  +S SDH
Subjt:  FSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGES---DKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDH

Query:  SVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-V
        + F + +    + ++VYVDDI+I  N+   +  LK+ L+  F  +DLG LKYFLG+E+ RS  GI + QRKY LDLL ETG LG KPS  PM P+     
Subjt:  SVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL-V

Query:  KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVF
          G    D + YRRL+G+L YL +TR DI+++V+ +SQF  +P + H  AV +IL Y+K   G+G+ Y      +++ FSDA +   ++ RRST+GYC+F
Subjt:  KEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVF

Query:  VGGNLVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREK-IQDGLVSTGYV
        +G +L+SWKSKKQ VVS+SSA+                          ++ P  L+CDN AA+HIA+N VFHERTKHIE DCH +RE+ +    +S  + 
Subjt:  VGGNLVSWKSKKQNVVSRSSAD--------------------------ITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREK-IQDGLVSTGYV

Query:  KTGEQLG--DILTKSLNGTRISYLCNKLGMIDIFA
           EQ G  + L+  L GT I Y+ +  G+  + A
Subjt:  KTGEQLG--DILTKSLNGTRISYLCNKLGMIDIFA

ATMG00240.1 Gag-Pol-related retrotransposon family protein9.4e-1338.27Show/hide
Query:  YLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFV
        YLT+TRPD+ ++V+ +SQF S+       AV ++L Y+K   G+G+ Y      +++ F+D+DWA   + RRS +G+C  V
Subjt:  YLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFV

ATMG00810.1 DNA/RNA polymerases superfamily protein2.2e-4139.9Show/hide
Query:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR
        L++YVDDI++TG+    ++ L   L   F  KDLG + YFLGI++     G++LSQ KY   +L+  G L  KP  TP+              DP  +R 
Subjt:  LVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRR

Query:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN
        +VG L YLT+TRPDI+Y+V++V Q M  PT+  +  ++++L Y+K     G+    +    V+ F D+DWAG    RRST+G+C F+G N++SW +K+Q 
Subjt:  LVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGRGILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAD
         VSRSS +
Subjt:  VVSRSSAD

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.0e-2246.15Show/hide
Query:  DKLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT
        +KL+P  Y+   +      P SV  AL  PGW  AM EE+ AL  N TW LV  P  +  +GCKWVF  K++ DGT+ RLKARLVAKG+ Q  G  + +T
Subjt:  DKLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVAKLTSIRLFLSMA
        +SPV +  +IR  L++A
Subjt:  FSPVAKLTSIRLFLSMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCCTTCATCATGTGATCCAGCGCCAAATGATGATCTTCCCATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCA
GTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGG
AGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGA
ACAGTGGCTCGTTTAAAGGCCCGCCTTGTTGCCAAAGATTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTCTGGTTGCCAAGTTAACTTCCATTCGCCT
ATTTCTTTCCATGGATGCTACCAATAAATGGTCGTTGCATCAACTTGACATAAAGAATGCTTTTCTTCACGGTGATATTCAAGAGGAAGTTTATATGGAACAACCACCAG
GGTTTGTTGCTCAGGGAGAGAGTGATAAATTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCAT
CCTGGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGT
GTTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTCGTTTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTC
CGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTTCAA
GAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTG
GTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCGCCGATCTGAGAAGGGTATAGTTCTACTAGTTGTAT
ATGTTGATGATATTGTTATTACTGGAAATGATGCATTGGGTATTTCGTCTCTCAAAACTTTCCTTCAGGGTCAGTTTTATACAAAAGATTTGGGCCAATTGAAATATTTT
TTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAATATGTACTTGATTTGTTGTCTGAAACAGGAAAATTAGGCGCCAAACCAAGTGGCAC
TCCAATGATGCCAAATCAGCAACTTGTTAAAGAAGGAGAATTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAACAGTGACTCGACCAG
ACATTGCATATTCTGTAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGGATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGT
GGGATCCTATACAAAGATCATGGACATACGAGAGTTGAATGTTTTTCTGATGCTGATTGGGCGGGGTCTCGTGAGGATAGGAGATCGACTTCTGGATATTGTGTTTTTGT
AGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAACAAAATGTTGTTTCTCGTTCGAGTGCTGATATTACCGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTC
ATATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAG
ACCGGAGAACAATTGGGAGATATTCTAACTAAATCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGTTGGGCATGATCGACATATTTGCTCCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCCTTCATCATGTGATCCAGCGCCAAATGATGATCTTCCCATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCGTTTCTTCCTTTATTTCCTATCACCA
GTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGG
AGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGA
ACAGTGGCTCGTTTAAAGGCCCGCCTTGTTGCCAAAGATTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTCTGGTTGCCAAGTTAACTTCCATTCGCCT
ATTTCTTTCCATGGATGCTACCAATAAATGGTCGTTGCATCAACTTGACATAAAGAATGCTTTTCTTCACGGTGATATTCAAGAGGAAGTTTATATGGAACAACCACCAG
GGTTTGTTGCTCAGGGAGAGAGTGATAAATTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCAT
CCTGGCTGGCAAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGT
GTTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTCGTTTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTC
CGGTTGCCAAGTTAACTTCCATTCGCCTATTTCTTTCCATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTTCAA
GAGGAAGTTTATATGGAACAACCACCAGGGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTG
GTTTGGTAAGTTTAGTCAAGCCCTTGTATGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCGCCGATCTGAGAAGGGTATAGTTCTACTAGTTGTAT
ATGTTGATGATATTGTTATTACTGGAAATGATGCATTGGGTATTTCGTCTCTCAAAACTTTCCTTCAGGGTCAGTTTTATACAAAAGATTTGGGCCAATTGAAATATTTT
TTGGGCATTGAAGTGATGAGAAGCAAGAAAGGTATTTATTTGTCTCAACGAAAATATGTACTTGATTTGTTGTCTGAAACAGGAAAATTAGGCGCCAAACCAAGTGGCAC
TCCAATGATGCCAAATCAGCAACTTGTTAAAGAAGGAGAATTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAACAGTGACTCGACCAG
ACATTGCATATTCTGTAAGTGTTGTAAGTCAATTCATGTCTTCCCCTACAGTGGATCATTGGGCTGCAGTAGAGCAGATTTTGTGTTATCTAAAAGCTGCTCCTGGACGT
GGGATCCTATACAAAGATCATGGACATACGAGAGTTGAATGTTTTTCTGATGCTGATTGGGCGGGGTCTCGTGAGGATAGGAGATCGACTTCTGGATATTGTGTTTTTGT
AGGTGGAAACTTAGTTTCATGGAAGAGTAAGAAACAAAATGTTGTTTCTCGTTCGAGTGCTGATATTACCGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTC
ATATTGCATCTAATCCAGTATTTCATGAACGAACTAAACATATTGAGGTGGATTGTCACTTCATTCGTGAGAAAATCCAAGATGGGTTGGTGTCCACAGGATATGTGAAG
ACCGGAGAACAATTGGGAGATATTCTAACTAAATCTTTAAATGGAACAAGGATAAGCTATCTGTGCAACAAGTTGGGCATGATCGACATATTTGCTCCAGCTTGA
Protein sequenceShow/hide protein sequence
MLPSSCDPAPNDDLPIALRKGKRKCTYPVSSFISYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDG
TVARLKARLVAKDYAQIYGTDYSDTFSLVAKLTSIRLFLSMDATNKWSLHQLDIKNAFLHGDIQEEVYMEQPPGFVAQGESDKLSPSTYAFITSLESTSIPNSVHEALSH
PGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVAKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQ
EEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGISSLKTFLQGQFYTKDLGQLKYF
LGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGELCKDPERYRRLVGKLNYLTVTRPDIAYSVSVVSQFMSSPTVDHWAAVEQILCYLKAAPGR
GILYKDHGHTRVECFSDADWAGSREDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSADITVPAKLWCDNQAALHIASNPVFHERTKHIEVDCHFIREKIQDGLVSTGYVK
TGEQLGDILTKSLNGTRISYLCNKLGMIDIFAPA