; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G21760 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G21760
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr3:18103958..18106224
RNA-Seq ExpressionCSPI03G21760
SyntenyCSPI03G21760
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043391.1 pol protein [Cucumis melo var. makuwa]2.1e-25961.35Show/hide
Query:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG
        MPP+RG RRGGR GRGRGAGR QP  +  A+   P APVTH +     A MEQRF +L+  + + Q+  +  PA    PAP     PA  P P A     
Subjt:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG

Query:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC
             PQ +P+QLSA+ KHLRDFRKY+P TFDGSLEDPT+A++WLSS+E IF YM+CP++ +VQCA F+L DRG  WW TT RMLGGDV QITW Q K+ 
Subjt:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC

Query:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK
        FY KFFSA+LRDAK Q FL ++QG MTVE+Y+ EFDMLSRFAPE++  E ARA++FV+GLR +I+G VRA +P T A+ALRLAVD+S+ +    ++ + +
Subjt:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK

Query:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT
        G +SGQKRKAEQ+ V VPQRN RSG  F  FQQ    AG+  R KPLC TCGK HLGRCL GTR  +KC+QEGH ADRCPLR TG  Q +QGA  P +G 
Subjt:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT

Query:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD
        +F TN++EAEK GTVVTGTLPVLGH+AL LF SGSSHSFISS FV HA LEVEPL   LSVSTPSGE MLSKEK+KAC+IEIAG V+++TL+VLDM DFD
Subjt:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD

Query:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR
        VILGMDWLA NHASIDCSRKEV F+PP+ +SFKFKG G+  LP+VISA++ASKLL+QGTW ILASVVDTRE + SL+SEPVVR+YPDVFPE+LPGLPPHR
Subjt:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR

Query:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------
        +++FAIELEP T PISRAPYRM PAELKELKVQLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK                         
Subjt:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------

Query:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL
                                                                +FLDTFVIVFIDDIL+YSKTEAEHE+HL
Subjt:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL

KAA0053290.1 gag protease polyprotein [Cucumis melo var. makuwa]4.6e-25961.22Show/hide
Query:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG
        MPP+RG RRGGR GRGRGAGR QP     A+   P APVTH +     A MEQRF +++  + + Q+  +  PAP   PAP     PA  P P A     
Subjt:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG

Query:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC
             PQ++P+QLSA+ KHLRDFRKY+P TFDGSLEDPT+A++WLSS+E IF YM+CP++ +VQCA F+L DRG  WW TT RMLGGDV QITW Q K+ 
Subjt:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC

Query:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK
        FY KFFSA+LRDAK Q FL L+QG MTVE+Y+ EFDMLSRFAPE++  E ARA++FV+GLR +I+G VRA +P T A+ALRLAVD+S+ +    ++ + +
Subjt:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK

Query:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT
         ++SGQKRKAEQ+ V VPQRN R G  F SFQQ     G+  R KPLC TCGK HLGRCL GTR  +KC+QEGH ADRCPLR TG  Q +QGA  P +G 
Subjt:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT

Query:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD
        +F TNR+EAEK GTVVTGTLPVLGH+AL LF SGSSHSFISS FV HA LEVEPL   LSVSTPSGE MLSKEK+KAC+IEIAG V+++TL+VLDM DFD
Subjt:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD

Query:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR
        VILGMDWLA NHASIDCSRKEV F+PP+ +SFKFKG G+  LP+VISA++ASKLL+QGTW ILASVVDTRE + SL+SEPVVR+YPDVFPE+LPGLPPHR
Subjt:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR

Query:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------
        +++FAIELEP T PISRAPYRM PAELKELKVQLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGS+RLCIDYRELNK                         
Subjt:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------

Query:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL
                                                                +FLDTFVIVFIDDIL+YSKTEAEHE+HL
Subjt:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL

KAA0054678.1 gag protease polyprotein [Cucumis melo var. makuwa]1.7e-25861.35Show/hide
Query:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG
        MPP+RG RRGGR GRGRGAGR QP  +  A+   P APVTH +     A MEQRF +++  + + Q+  +  PAP   PAP     PA  P P A     
Subjt:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG

Query:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC
             PQ +P+QLSA+ KHLRDFRKY+P TFDGSLEDPT+A++WLSS+E IF YM+CP++ +VQCA F+L DRG   W TT RMLGGDV QITW Q K+ 
Subjt:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC

Query:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK
        FY KFFSA+LRDAK Q FL L+QG MTVE+Y+ EFDMLSRFAPE++  E ARA++FV+GLR +I+G VRA +P T A+ALRLAVD+S+ +    ++ + +
Subjt:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK

Query:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT
        G++SGQKRKAEQ+ V VPQRN R G  F SFQQ    AG+  R KPLC TCGK HLGRCL GTR  +KC+QEGH ADRCPLR TG  Q +QGA  P +G 
Subjt:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT

Query:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD
        +F TNR+EAEK GTVVTGTLPVLGH+AL LF SGSSHSFISS FV+HA LEVEPL   LSVSTPSGE MLSKEK+KAC+IEIAG V+ +TL+VLDM DFD
Subjt:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD

Query:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR
        VILGMDWLA NHASIDCSRKEV F PP+ +SFKFKG G+  LP+VISA++ASKLL+QGTW ILASVVDTRE + SL+SEPVVR+YPDVFPE+LPGLPPHR
Subjt:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR

Query:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------
        +++FAIELEP T PISRAPYRM PAELKELKVQLQ+LLDKGFIRPSVSPWGAPVLFVKKK+GSMRLCIDYRELNK                         
Subjt:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------

Query:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL
                                                                +FLDTFVIVFIDDIL+YSKTEAEHE+HL
Subjt:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL

KAA0067481.1 pol protein [Cucumis melo var. makuwa]7.8e-25961.48Show/hide
Query:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG
        MPP+RG RRGGR GRGRGAGR QP  + +A+   P APVTH +     A MEQRF +L+  + + QQ PA PPAP   PAP    VP A           
Subjt:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG

Query:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC
             PQ +P+QLSA+ KHLRDFRKY+P TFDGSLEDPT+A+LWLSS+E IF YM+CP++ +VQCA F+L DRG  WW TT RMLGGDV QITW Q K+ 
Subjt:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC

Query:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK
        FY KFFSA+LRDAK Q FL L+QG MTVE+Y+ EFDMLSRFAPE++  E ARA++FV+GLR +I+G VRA +PTT A+ALRLAVD+S+ +    ++ + +
Subjt:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK

Query:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT
        G++SGQKRKAEQ+ V VPQRN R+G  F  FQQ    AG+  R KPLC TCGK HLGRCL GTR  +KC+QEGH ADRCPLR TG  Q +QGA  P +G 
Subjt:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT

Query:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD
        +F TN++EAEK GTVVTGTLPVLGH+AL LF SGSSHSFISS FV HA LEVEPL   LSVSTPSGE MLSKEK+K C+IEIAG V+++TLLVLDM DFD
Subjt:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD

Query:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR
        VILGMDWLA NHASIDCSRKEV F+PP+ +SFKFKG G+  LP+VISA++ASKLL+QGTW ILASVVDTRE + SL+SEPVVR+YPDVFPE+LPGLPPHR
Subjt:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR

Query:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------
        +++FAIE EP T PISRAPYRM P  LKELK+QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK                         
Subjt:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------

Query:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL
                                                                +FLDTFVIVFIDDIL+YSKTEAEHE+HL
Subjt:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL

TYK01613.1 pol protein [Cucumis melo var. makuwa]4.9e-26161.61Show/hide
Query:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG
        MPP+RG RRGGR GRGRGAGR QP  +  A+   P APVTH +     A MEQRF +++  + + Q+  +  PAP   PAP     PA  P P A     
Subjt:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG

Query:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC
             PQ +P+QLSA+ KHLRDFRKY+P TFDGSLEDPT+A++WLSS+E IF YM+CP++ +VQCA F+L DRG  WW TT RMLGGDV QITW Q K+ 
Subjt:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC

Query:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK
        FY KFFSA+LRDAK Q FL L+QG MTVE+Y+ EFDMLSRFAPE++  E ARA++FV+GLR +I+G VRA +P T A+ALRLAVD+S+ +    ++ + +
Subjt:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK

Query:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT
        G++SGQKRKAEQ+ V VPQRN R G  F SFQQ    AG+  R KPLC TCGK HLGRCL GTR  +KC+QEGH ADRCPLR TG  Q +QGA  P +G 
Subjt:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT

Query:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD
        +F TNR+EAEK GTVVTGTLPVLGH+AL LF SGSSHSFISS FV+HA LEVEPL   LSVSTPSGE MLSKEK+KAC+IEIAG V+++TL+VLDM DFD
Subjt:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD

Query:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR
        VILGMDWLA NHASIDCSRKEV F+PP+ +SFKFKG G+  LP+VISA++ASKLL+QGTW ILASVVDTRE + SL+SEPVVR+YPDVFPE+LPGLPPHR
Subjt:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR

Query:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------
        +++FAIELEP T PISRAPYRM PAELKELKVQLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK                         
Subjt:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------

Query:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL
                                                                +FLDTFVIVFIDDIL+YSKTEAEHE+HL
Subjt:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL

TrEMBL top hitse value%identityAlignment
A0A5A7TP96 Reverse transcriptase1.0e-25961.35Show/hide
Query:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG
        MPP+RG RRGGR GRGRGAGR QP  +  A+   P APVTH +     A MEQRF +L+  + + Q+  +  PA    PAP     PA  P P A     
Subjt:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG

Query:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC
             PQ +P+QLSA+ KHLRDFRKY+P TFDGSLEDPT+A++WLSS+E IF YM+CP++ +VQCA F+L DRG  WW TT RMLGGDV QITW Q K+ 
Subjt:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC

Query:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK
        FY KFFSA+LRDAK Q FL ++QG MTVE+Y+ EFDMLSRFAPE++  E ARA++FV+GLR +I+G VRA +P T A+ALRLAVD+S+ +    ++ + +
Subjt:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK

Query:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT
        G +SGQKRKAEQ+ V VPQRN RSG  F  FQQ    AG+  R KPLC TCGK HLGRCL GTR  +KC+QEGH ADRCPLR TG  Q +QGA  P +G 
Subjt:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT

Query:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD
        +F TN++EAEK GTVVTGTLPVLGH+AL LF SGSSHSFISS FV HA LEVEPL   LSVSTPSGE MLSKEK+KAC+IEIAG V+++TL+VLDM DFD
Subjt:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD

Query:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR
        VILGMDWLA NHASIDCSRKEV F+PP+ +SFKFKG G+  LP+VISA++ASKLL+QGTW ILASVVDTRE + SL+SEPVVR+YPDVFPE+LPGLPPHR
Subjt:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR

Query:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------
        +++FAIELEP T PISRAPYRM PAELKELKVQLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK                         
Subjt:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------

Query:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL
                                                                +FLDTFVIVFIDDIL+YSKTEAEHE+HL
Subjt:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL

A0A5A7UDK9 Gag protease polyprotein2.2e-25961.22Show/hide
Query:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG
        MPP+RG RRGGR GRGRGAGR QP     A+   P APVTH +     A MEQRF +++  + + Q+  +  PAP   PAP     PA  P P A     
Subjt:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG

Query:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC
             PQ++P+QLSA+ KHLRDFRKY+P TFDGSLEDPT+A++WLSS+E IF YM+CP++ +VQCA F+L DRG  WW TT RMLGGDV QITW Q K+ 
Subjt:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC

Query:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK
        FY KFFSA+LRDAK Q FL L+QG MTVE+Y+ EFDMLSRFAPE++  E ARA++FV+GLR +I+G VRA +P T A+ALRLAVD+S+ +    ++ + +
Subjt:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK

Query:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT
         ++SGQKRKAEQ+ V VPQRN R G  F SFQQ     G+  R KPLC TCGK HLGRCL GTR  +KC+QEGH ADRCPLR TG  Q +QGA  P +G 
Subjt:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT

Query:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD
        +F TNR+EAEK GTVVTGTLPVLGH+AL LF SGSSHSFISS FV HA LEVEPL   LSVSTPSGE MLSKEK+KAC+IEIAG V+++TL+VLDM DFD
Subjt:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD

Query:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR
        VILGMDWLA NHASIDCSRKEV F+PP+ +SFKFKG G+  LP+VISA++ASKLL+QGTW ILASVVDTRE + SL+SEPVVR+YPDVFPE+LPGLPPHR
Subjt:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR

Query:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------
        +++FAIELEP T PISRAPYRM PAELKELKVQLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGS+RLCIDYRELNK                         
Subjt:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------

Query:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL
                                                                +FLDTFVIVFIDDIL+YSKTEAEHE+HL
Subjt:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL

A0A5A7UI54 Gag protease polyprotein8.4e-25961.35Show/hide
Query:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG
        MPP+RG RRGGR GRGRGAGR QP  +  A+   P APVTH +     A MEQRF +++  + + Q+  +  PAP   PAP     PA  P P A     
Subjt:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG

Query:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC
             PQ +P+QLSA+ KHLRDFRKY+P TFDGSLEDPT+A++WLSS+E IF YM+CP++ +VQCA F+L DRG   W TT RMLGGDV QITW Q K+ 
Subjt:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC

Query:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK
        FY KFFSA+LRDAK Q FL L+QG MTVE+Y+ EFDMLSRFAPE++  E ARA++FV+GLR +I+G VRA +P T A+ALRLAVD+S+ +    ++ + +
Subjt:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK

Query:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT
        G++SGQKRKAEQ+ V VPQRN R G  F SFQQ    AG+  R KPLC TCGK HLGRCL GTR  +KC+QEGH ADRCPLR TG  Q +QGA  P +G 
Subjt:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT

Query:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD
        +F TNR+EAEK GTVVTGTLPVLGH+AL LF SGSSHSFISS FV+HA LEVEPL   LSVSTPSGE MLSKEK+KAC+IEIAG V+ +TL+VLDM DFD
Subjt:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD

Query:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR
        VILGMDWLA NHASIDCSRKEV F PP+ +SFKFKG G+  LP+VISA++ASKLL+QGTW ILASVVDTRE + SL+SEPVVR+YPDVFPE+LPGLPPHR
Subjt:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR

Query:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------
        +++FAIELEP T PISRAPYRM PAELKELKVQLQ+LLDKGFIRPSVSPWGAPVLFVKKK+GSMRLCIDYRELNK                         
Subjt:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------

Query:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL
                                                                +FLDTFVIVFIDDIL+YSKTEAEHE+HL
Subjt:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL

A0A5A7VPI8 Reverse transcriptase3.8e-25961.48Show/hide
Query:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG
        MPP+RG RRGGR GRGRGAGR QP  + +A+   P APVTH +     A MEQRF +L+  + + QQ PA PPAP   PAP    VP A           
Subjt:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG

Query:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC
             PQ +P+QLSA+ KHLRDFRKY+P TFDGSLEDPT+A+LWLSS+E IF YM+CP++ +VQCA F+L DRG  WW TT RMLGGDV QITW Q K+ 
Subjt:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC

Query:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK
        FY KFFSA+LRDAK Q FL L+QG MTVE+Y+ EFDMLSRFAPE++  E ARA++FV+GLR +I+G VRA +PTT A+ALRLAVD+S+ +    ++ + +
Subjt:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK

Query:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT
        G++SGQKRKAEQ+ V VPQRN R+G  F  FQQ    AG+  R KPLC TCGK HLGRCL GTR  +KC+QEGH ADRCPLR TG  Q +QGA  P +G 
Subjt:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT

Query:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD
        +F TN++EAEK GTVVTGTLPVLGH+AL LF SGSSHSFISS FV HA LEVEPL   LSVSTPSGE MLSKEK+K C+IEIAG V+++TLLVLDM DFD
Subjt:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD

Query:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR
        VILGMDWLA NHASIDCSRKEV F+PP+ +SFKFKG G+  LP+VISA++ASKLL+QGTW ILASVVDTRE + SL+SEPVVR+YPDVFPE+LPGLPPHR
Subjt:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR

Query:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------
        +++FAIE EP T PISRAPYRM P  LKELK+QLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK                         
Subjt:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------

Query:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL
                                                                +FLDTFVIVFIDDIL+YSKTEAEHE+HL
Subjt:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL

A0A5D3BPI1 Reverse transcriptase2.4e-26161.61Show/hide
Query:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG
        MPP+RG RRGGR GRGRGAGR QP  +  A+   P APVTH +     A MEQRF +++  + + Q+  +  PAP   PAP     PA  P P A     
Subjt:  MPPKRGVRRGGRRGRGRGAGRNQP-TEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQG

Query:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC
             PQ +P+QLSA+ KHLRDFRKY+P TFDGSLEDPT+A++WLSS+E IF YM+CP++ +VQCA F+L DRG  WW TT RMLGGDV QITW Q K+ 
Subjt:  LAAQQPQILPNQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDC

Query:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK
        FY KFFSA+LRDAK Q FL L+QG MTVE+Y+ EFDMLSRFAPE++  E ARA++FV+GLR +I+G VRA +P T A+ALRLAVD+S+ +    ++ + +
Subjt:  FYTKFFSANLRDAKSQVFLELKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDK

Query:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT
        G++SGQKRKAEQ+ V VPQRN R G  F SFQQ    AG+  R KPLC TCGK HLGRCL GTR  +KC+QEGH ADRCPLR TG  Q +QGA  P +G 
Subjt:  GASSGQKRKAEQRIVGVPQRNLRSGDPFHSFQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGT

Query:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD
        +F TNR+EAEK GTVVTGTLPVLGH+AL LF SGSSHSFISS FV+HA LEVEPL   LSVSTPSGE MLSKEK+KAC+IEIAG V+++TL+VLDM DFD
Subjt:  IFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFISSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFD

Query:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR
        VILGMDWLA NHASIDCSRKEV F+PP+ +SFKFKG G+  LP+VISA++ASKLL+QGTW ILASVVDTRE + SL+SEPVVR+YPDVFPE+LPGLPPHR
Subjt:  VILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHR

Query:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------
        +++FAIELEP T PISRAPYRM PAELKELKVQLQ+LLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK                         
Subjt:  DIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK-------------------------

Query:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL
                                                                +FLDTFVIVFIDDIL+YSKTEAEHE+HL
Subjt:  --------------------------------------------------------DFLDTFVIVFIDDILVYSKTEAEHEKHL

SwissProt top hitse value%identityAlignment
P0CT41 Transposon Tf2-12 polyprotein1.7e-0628.7Show/hide
Query:  TSLTSEP----VVREYPDVFPE-DLPGLP-PHRDIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLC
        +++  EP    + +E+ D+  E +   LP P + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ 
Subjt:  TSLTSEP----VVREYPDVFPE-DLPGLP-PHRDIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLC

Query:  IDYRELNK
        +DY+ LNK
Subjt:  IDYRELNK

P10394 Retrovirus-related Pol polyprotein from transposon 4128.2e-0932.56Show/hide
Query:  REYPDVFPEDLPGLPPHRDIDFAIELEPNTT--------------PISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDG------
        + +P++F   L  +       FA+E EP T               P+    YR   ++++E++ Q+QKL+    + PSVS + +P+L V KK        
Subjt:  REYPDVFPEDLPGLPPHRDIDFAIELEPNTT--------------PISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDG------

Query:  SMRLCIDYRELNKDFL-DTFVIVFIDDIL
          RL IDYR++NK  L D F +  IDDIL
Subjt:  SMRLCIDYRELNKDFL-DTFVIVFIDDIL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-1234.42Show/hide
Query:  KASKLLNQGTWSILASVVDTRE-------DETSLTSEPV--VREYPDVFPEDLPGLPP---HRDIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKL
        +AS L   G +S + S + + E       ++ +  + PV   ++Y ++   DLP  P    +  +   IE++P        PY +T    +E+   +QKL
Subjt:  KASKLLNQGTWSILASVVDTRE-------DETSLTSEPV--VREYPDVFPEDLPGLPP---HRDIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKL

Query:  LDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKDFL-DTFVIVFIDDIL
        LD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK  + D F +  ID++L
Subjt:  LDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKDFL-DTFVIVFIDDIL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.2e-1234.42Show/hide
Query:  KASKLLNQGTWSILASVVDTRE-------DETSLTSEPV--VREYPDVFPEDLPGLPP---HRDIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKL
        +AS L   G +S + S + + E       ++ +  + PV   ++Y ++   DLP  P    +  +   IE++P        PY +T    +E+   +QKL
Subjt:  KASKLLNQGTWSILASVVDTRE-------DETSLTSEPV--VREYPDVFPEDLPGLPP---HRDIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKL

Query:  LDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKDFL-DTFVIVFIDDIL
        LD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK  + D F +  ID++L
Subjt:  LDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKDFL-DTFVIVFIDDIL

Q9UR07 Transposon Tf2-11 polyprotein1.7e-0628.7Show/hide
Query:  TSLTSEP----VVREYPDVFPE-DLPGLP-PHRDIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLC
        +++  EP    + +E+ D+  E +   LP P + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ 
Subjt:  TSLTSEP----VVREYPDVFPE-DLPGLP-PHRDIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKKDGSMRLC

Query:  IDYRELNK
        +DY+ LNK
Subjt:  IDYRELNK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATTATGCCGCCAAAGAGAGGTGTACGTAGAGGAGGTCGTAGAGGCCGGGGTAGAGGAGCAGGTCGTAATCAGCCTACTGAGGGTCAAGCTGAACAGCGAATTCCTGTTGC
ACCCGTGACTCACGTCGAGTTTGATGCACTGTCTGCTCACATGGAGCAGAGGTTTACGGAACTTATGACGGCTATAGCTCAGAACCAGCAGGCACCTGCAATTCCACCTG
CACCAGTTGTTCCGCCTGCACCTGTGGTTCCCCTTGTACCAGCAGCTCCACCTACACCAGCAGCCCCTCCTGCACAAGGATTGGCTGCACAACAGCCACAGATACTACCG
AACCAGCTTTCTGCTAAGGTGAAACATTTGAGAGACTTTAGGAAATATGACCCTCAGACGTTTGATGGGTCACTGGAGGATCCTACTAAAGCGGAGTTGTGGTTGTCCTC
TGTGGAAGCCATATTTAATTACATGAGATGTCCAAAGGAGCATAGGGTTCAGTGTGCTGCTTTTCTTCTGAGGGACAGAGGTATTATTTGGTGGAGGACTACGATGCGTA
TGTTGGGTGGAGATGTGAGACAGATCACTTGGGATCAGTTGAAAGACTGCTTCTATACCAAGTTTTTCTCGGCTAACCTTAGAGACGCCAAAAGCCAGGTATTCTTGGAA
TTGAAGCAAGGACACATGACAGTTGAGGAGTATAACCAGGAGTTTGATATGCTGTCGCGCTTTGCCCCTGAACTTGTTGGTAATGAGCAGGCTAGAGCTGAGAGGTTTGT
CAAAGGATTGAGAGATGAGATTAGAGGTTTTGTGCGAGCACTAAAGCCTACTACCCAGGCTGAAGCGCTGCGTCTGGCAGTGGATATGAGTATTGGGAAGGATGAAATTC
AGGCAAGGGGTTCTGATAAGGGAGCGTCGTCTGGTCAGAAGAGGAAAGCAGAGCAGAGAATTGTGGGAGTTCCTCAGAGAAACTTGAGATCAGGCGATCCTTTTCACAGT
TTCCAGCAGAGTTCTGGTGGGGCAGGAGACACTACTAGAAAGAAGCCACTATGCAATACGTGTGGGAAATGCCACCTGGGTCGTTGTTTGATGGGAACGAGAGTCTTTTA
TAAGTGCAAGCAAGAGGGACACATGGCTGATCGATGTCCCTTGAGATCTACTGGGGCTGGACAGAGCAGTCAGGGAGCAAGACCTCCACAGCGGGGTACAATCTTTACCA
CTAATAGATCAGAAGCAGAGAAGGTCGGCACAGTGGTGACAGGTACATTACCAGTGTTAGGGCATTTTGCCTTAACCTTGTTTGGCTCAGGGTCTTCTCATTCATTTATT
TCATCGCTTTTTGTGACGCATGCATGCTTAGAGGTGGAACCCTTAGACTTTTTTTTGTCAGTGTCTACACCGTCTGGAGAAATTATGTTGTCTAAGGAAAAGATTAAAGC
ATGTAAAATTGAGATAGCGGGTCCTGTGCTGGACATAACCTTGTTAGTATTAGATATGCGTGACTTTGATGTAATTTTAGGCATGGATTGGCTAGCTACTAATCATGCTA
GTATTGATTGCTCTCGTAAGGAGGTTGTGTTCAGTCCCCCTACCGAATCTAGCTTTAAGTTCAAAGGGGTAGGAACCGTAGTATTGCCTAAAGTAATCTCAGCTATGAAA
GCTAGTAAACTGCTCAACCAGGGTACCTGGAGTATTTTGGCAAGTGTGGTGGATACTAGGGAAGATGAGACTTCTTTAACTTCAGAACCTGTGGTAAGAGAGTACCCAGA
TGTGTTTCCAGAAGATCTTCCAGGACTTCCGCCACATAGGGATATTGATTTTGCCATTGAGTTGGAGCCAAACACTACTCCTATTTCTAGAGCCCCTTATAGGATGACTC
CTGCTGAGTTGAAAGAACTGAAGGTACAGTTACAGAAGTTGCTTGACAAAGGTTTTATTCGACCTAGTGTGTCACCTTGGGGTGCACCAGTATTGTTTGTGAAGAAGAAG
GATGGGTCGATGCGTCTTTGCATTGACTATAGAGAGTTGAATAAAGATTTCTTAGACACTTTTGTGATAGTCTTCATTGATGATATTTTGGTTTATTCCAAGACTGAGGC
CGAACATGAGAAACACTTACATAAA
mRNA sequenceShow/hide mRNA sequence
GAATTATGCCGCCAAAGAGAGGTGTACGTAGAGGAGGTCGTAGAGGCCGGGGTAGAGGAGCAGGTCGTAATCAGCCTACTGAGGGTCAAGCTGAACAGCGAATTCCTGTT
GCACCCGTGACTCACGTCGAGTTTGATGCACTGTCTGCTCACATGGAGCAGAGGTTTACGGAACTTATGACGGCTATAGCTCAGAACCAGCAGGCACCTGCAATTCCACC
TGCACCAGTTGTTCCGCCTGCACCTGTGGTTCCCCTTGTACCAGCAGCTCCACCTACACCAGCAGCCCCTCCTGCACAAGGATTGGCTGCACAACAGCCACAGATACTAC
CGAACCAGCTTTCTGCTAAGGTGAAACATTTGAGAGACTTTAGGAAATATGACCCTCAGACGTTTGATGGGTCACTGGAGGATCCTACTAAAGCGGAGTTGTGGTTGTCC
TCTGTGGAAGCCATATTTAATTACATGAGATGTCCAAAGGAGCATAGGGTTCAGTGTGCTGCTTTTCTTCTGAGGGACAGAGGTATTATTTGGTGGAGGACTACGATGCG
TATGTTGGGTGGAGATGTGAGACAGATCACTTGGGATCAGTTGAAAGACTGCTTCTATACCAAGTTTTTCTCGGCTAACCTTAGAGACGCCAAAAGCCAGGTATTCTTGG
AATTGAAGCAAGGACACATGACAGTTGAGGAGTATAACCAGGAGTTTGATATGCTGTCGCGCTTTGCCCCTGAACTTGTTGGTAATGAGCAGGCTAGAGCTGAGAGGTTT
GTCAAAGGATTGAGAGATGAGATTAGAGGTTTTGTGCGAGCACTAAAGCCTACTACCCAGGCTGAAGCGCTGCGTCTGGCAGTGGATATGAGTATTGGGAAGGATGAAAT
TCAGGCAAGGGGTTCTGATAAGGGAGCGTCGTCTGGTCAGAAGAGGAAAGCAGAGCAGAGAATTGTGGGAGTTCCTCAGAGAAACTTGAGATCAGGCGATCCTTTTCACA
GTTTCCAGCAGAGTTCTGGTGGGGCAGGAGACACTACTAGAAAGAAGCCACTATGCAATACGTGTGGGAAATGCCACCTGGGTCGTTGTTTGATGGGAACGAGAGTCTTT
TATAAGTGCAAGCAAGAGGGACACATGGCTGATCGATGTCCCTTGAGATCTACTGGGGCTGGACAGAGCAGTCAGGGAGCAAGACCTCCACAGCGGGGTACAATCTTTAC
CACTAATAGATCAGAAGCAGAGAAGGTCGGCACAGTGGTGACAGGTACATTACCAGTGTTAGGGCATTTTGCCTTAACCTTGTTTGGCTCAGGGTCTTCTCATTCATTTA
TTTCATCGCTTTTTGTGACGCATGCATGCTTAGAGGTGGAACCCTTAGACTTTTTTTTGTCAGTGTCTACACCGTCTGGAGAAATTATGTTGTCTAAGGAAAAGATTAAA
GCATGTAAAATTGAGATAGCGGGTCCTGTGCTGGACATAACCTTGTTAGTATTAGATATGCGTGACTTTGATGTAATTTTAGGCATGGATTGGCTAGCTACTAATCATGC
TAGTATTGATTGCTCTCGTAAGGAGGTTGTGTTCAGTCCCCCTACCGAATCTAGCTTTAAGTTCAAAGGGGTAGGAACCGTAGTATTGCCTAAAGTAATCTCAGCTATGA
AAGCTAGTAAACTGCTCAACCAGGGTACCTGGAGTATTTTGGCAAGTGTGGTGGATACTAGGGAAGATGAGACTTCTTTAACTTCAGAACCTGTGGTAAGAGAGTACCCA
GATGTGTTTCCAGAAGATCTTCCAGGACTTCCGCCACATAGGGATATTGATTTTGCCATTGAGTTGGAGCCAAACACTACTCCTATTTCTAGAGCCCCTTATAGGATGAC
TCCTGCTGAGTTGAAAGAACTGAAGGTACAGTTACAGAAGTTGCTTGACAAAGGTTTTATTCGACCTAGTGTGTCACCTTGGGGTGCACCAGTATTGTTTGTGAAGAAGA
AGGATGGGTCGATGCGTCTTTGCATTGACTATAGAGAGTTGAATAAAGATTTCTTAGACACTTTTGTGATAGTCTTCATTGATGATATTTTGGTTTATTCCAAGACTGAG
GCCGAACATGAGAAACACTTACATAAA
Protein sequenceShow/hide protein sequence
IMPPKRGVRRGGRRGRGRGAGRNQPTEGQAEQRIPVAPVTHVEFDALSAHMEQRFTELMTAIAQNQQAPAIPPAPVVPPAPVVPLVPAAPPTPAAPPAQGLAAQQPQILP
NQLSAKVKHLRDFRKYDPQTFDGSLEDPTKAELWLSSVEAIFNYMRCPKEHRVQCAAFLLRDRGIIWWRTTMRMLGGDVRQITWDQLKDCFYTKFFSANLRDAKSQVFLE
LKQGHMTVEEYNQEFDMLSRFAPELVGNEQARAERFVKGLRDEIRGFVRALKPTTQAEALRLAVDMSIGKDEIQARGSDKGASSGQKRKAEQRIVGVPQRNLRSGDPFHS
FQQSSGGAGDTTRKKPLCNTCGKCHLGRCLMGTRVFYKCKQEGHMADRCPLRSTGAGQSSQGARPPQRGTIFTTNRSEAEKVGTVVTGTLPVLGHFALTLFGSGSSHSFI
SSLFVTHACLEVEPLDFFLSVSTPSGEIMLSKEKIKACKIEIAGPVLDITLLVLDMRDFDVILGMDWLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMK
ASKLLNQGTWSILASVVDTREDETSLTSEPVVREYPDVFPEDLPGLPPHRDIDFAIELEPNTTPISRAPYRMTPAELKELKVQLQKLLDKGFIRPSVSPWGAPVLFVKKK
DGSMRLCIDYRELNKDFLDTFVIVFIDDILVYSKTEAEHEKHLHK