; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038926 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038926
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:31034230..31042513
RNA-Seq ExpressionLag0038926
SyntenyLag0038926
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]4.8e-26540.7Show/hide
Query:  VNQVTEE--ACVYCGEDHNYEFCPNNPTSVFFVGN-----QRNNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSG
        VN V+    +C  CG  H    C      V FV N     Q+NNPYSN YNPGWRNHPNFSW  QG+   + + ++ PGF   Q  P Q +     + + 
Subjt:  VNQVTEE--ACVYCGEDHNYEFCPNNPTSVFFVGN-----QRNNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSG

Query:  SSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELESGQ
          L     E   R++  +    +S R +E+Q+GQLAN + +R QG LP  TE      KE  KAVTLRSGK L +          S +++V +K      
Subjt:  SSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELESGQ

Query:  GVGGSNKDAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTE
         V     +   S  V D+     P  P  PYVPP+PFPQR K    D QF+KFL++ +Q  INIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTE
Subjt:  GVGGSNKDAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTE

Query:  ECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK------------------------------------------------
        ECSAI++N LPPK +DPGSF+IP +IG  +  RALC LGAS++LMPLSV RK                                                
Subjt:  ECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK------------------------------------------------

Query:  ---------LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP-----------------------------------------DEMEDCSFIK
                 LGRPFLAT  A+IDV+ G LT++V  EEV+FN+F+A KYP                                         D +E      
Subjt:  ---------LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP-----------------------------------------DEMEDCSFIK

Query:  ILEST-----------------------------------------------------IVETTIQDS---------------------------------
         LE+T                                                     IV  ++ D                                  
Subjt:  ILEST-----------------------------------------------------IVETTIQDS---------------------------------

Query:  ----ADKH---------------------------------LEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLP
             D H                                 + D    +PVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRKDHFPLP
Subjt:  ----ADKH---------------------------------LEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLP

Query:  FIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDN
        F+DQ+L+RLAG AYYCFLDGYSGYNQI+I+PED+EKTTFTCPYGTFAFR+  + F +           ++ FSD +E  +EVFMDDFSVFG SF SCLDN
Subjt:  FIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDN

Query:  LGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------D
        L  VL+RCE+T+LVLNWEKCHFMV+EGIVLGH+IS  GLEVDRAKIE+IE+L PP +VKG++SFL                                  D
Subjt:  LGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------D

Query:  CRKAFETLKSALISTPILCAPNWNLPFEVMCDASDAA---------------------------------------------------------------
        C  AF  LK  L+S PI+ AP+W+LPFE+MCDASD A                                                               
Subjt:  CRKAFETLKSALISTPILCAPNWNLPFEVMCDASDAA---------------------------------------------------------------

Query:  -----------------------EFDLEIKDKKGSENVIADHLSSLLKQS-----AISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQ
                               EFDLEI+DK+G ENV+ADHLS L  QS      I++SFPDEQL AV V      PWYAD+ N+LV G+ P D+ + Q
Subjt:  -----------------------EFDLEIKDKKGSENVIADHLSSLLKQS-----AISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQ

Query:  KKKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFT
        KKKF  D K ++W+E  +YK C+DG+IRRCV  D+ ++IL+ CHS   GGHFS  +T  ++   GF+WPT+++D   +   CD CQR GN+  ++EMP T
Subjt:  KKKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFT

Query:  YILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPY
         ILEVELFD+WGIDFM PF  S  N ++L+AVDYV KWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG+H  N     LL KY + H++A  Y
Subjt:  YILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPY

Query:  HPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL
        HPQ N Q E++NRE+K ILEK V  SRK+W+ +LD+ALWAYRT +KTPL
Subjt:  HPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]3.7e-26540.7Show/hide
Query:  VNQVTEE--ACVYCGEDHNYEFCPNNPTSVFFVGN-----QRNNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSG
        VN V+    +C  CG  H    C      V FV N     Q+NNPYSN YNPGWRNHPNFSW  QG+   + + ++ PGF   Q  P Q +     + + 
Subjt:  VNQVTEE--ACVYCGEDHNYEFCPNNPTSVFFVGN-----QRNNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSG

Query:  SSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELESGQ
          L     E   R++  +    +S R +E+Q+GQLAN + +R QG LP  TE      KE  KAVTLRSGK L +          S +++V +K      
Subjt:  SSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELESGQ

Query:  GVGGSNKDAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTE
         V     +   S  V D+     P  P  PYVPP+PFPQR K    D QF+KFL++ +Q  INIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTE
Subjt:  GVGGSNKDAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTE

Query:  ECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK------------------------------------------------
        ECSAI++N LPPK +DPGSF+IP +IG  +  RALC LGAS++LMPLSV RK                                                
Subjt:  ECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK------------------------------------------------

Query:  ---------LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP-----------------------------------------DEMEDCSFIK
                 LGRPFLAT  A+IDV+ G LT++V  EEV+FN+F+A KYP                                         D +E      
Subjt:  ---------LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP-----------------------------------------DEMEDCSFIK

Query:  ILEST-----------------------------------------------------IVETTIQDS---------------------------------
         LE+T                                                     IV  ++ D                                  
Subjt:  ILEST-----------------------------------------------------IVETTIQDS---------------------------------

Query:  ----ADKH---------------------------------LEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLP
             D H                                 + D    +PVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRKDHFPLP
Subjt:  ----ADKH---------------------------------LEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLP

Query:  FIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDN
        F+DQ+L+RLAG AYYCFLDGYSGYNQI+I+PED+EKTTFTCPYGTFAFR+  + F +           ++ FSD +E  +EVFMDDFSVFG SF SCLDN
Subjt:  FIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDN

Query:  LGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------D
        L  VL+RCE+T+LVLNWEKCHFMV+EGIVLGH+IS  GLEVDRAKIE+IE+L PP +VKG++SFL                                  D
Subjt:  LGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------D

Query:  CRKAFETLKSALISTPILCAPNWNLPFEVMCDASDAA---------------------------------------------------------------
        C  AF  LK  L+S PI+ AP+W+LPFE+MCDASD A                                                               
Subjt:  CRKAFETLKSALISTPILCAPNWNLPFEVMCDASDAA---------------------------------------------------------------

Query:  -----------------------EFDLEIKDKKGSENVIADHLSSLLKQS-----AISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQ
                               EFDLEI+DK+G ENV+ADHLS L  QS      I++SFPDEQL AV V      PWYAD+ N+LV G+ P D+ + Q
Subjt:  -----------------------EFDLEIKDKKGSENVIADHLSSLLKQS-----AISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQ

Query:  KKKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFT
        KKKF  D K ++W+E  +YK C+DG+IRRCV  D+ ++IL+ CHS   GGHFS  +T  ++   GF+WPT+++D   +   CD CQR GN+  ++EMP T
Subjt:  KKKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFT

Query:  YILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPY
         ILEVELFD+WGIDFM PF  S  N ++L+AVDYV KWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG+H  N     LL KY + H++A  Y
Subjt:  YILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPY

Query:  HPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL
        HPQ N Q E++NRE+K ILEK V  SRK+W+ +LD+ALWAYRT +KTPL
Subjt:  HPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]1.8e-26440.62Show/hide
Query:  VNQVTEE--ACVYCGEDHNYEFCPNNPTSVFFVGN-----QRNNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSG
        VN V+    +C  CG  H    C      V FV N     Q+NNPYSN YNPGWRNHPNFSW  QG+   + + ++ PGF   Q  P Q +     + + 
Subjt:  VNQVTEE--ACVYCGEDHNYEFCPNNPTSVFFVGN-----QRNNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSG

Query:  SSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELESGQ
          L     E   R++  +    +S R +E+Q+GQLAN + +R QG LP  TE      KE  KAVTLRSGK L +          S +++V +K      
Subjt:  SSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELESGQ

Query:  GVGGSNKDAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTE
         V     +   S  V D+     P  P  PYVPP+PFPQR K    D QF+KFL++ +Q  INIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTE
Subjt:  GVGGSNKDAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTE

Query:  ECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK------------------------------------------------
        ECSAI++N LPPK +DPGSF+IP +IG  +  RALC LGAS++LMPLSV RK                                                
Subjt:  ECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK------------------------------------------------

Query:  ---------LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP-----------------------------------------DEMEDCSFIK
                 LGRPFLAT  A+IDV+ G LT++V  EEV+FN+F+A KYP                                         D +E      
Subjt:  ---------LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP-----------------------------------------DEMEDCSFIK

Query:  ILEST----------------------------------------------------------------------------IVETTIQD-----------
         LE+T                                                                             +  TI D           
Subjt:  ILEST----------------------------------------------------------------------------IVETTIQD-----------

Query:  ---SADKH---------------------------------LEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLP
             D H                                 + D    +PVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRKDHFPLP
Subjt:  ---SADKH---------------------------------LEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLP

Query:  FIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDN
        F+DQ+L+RLAG AYYCFLDGYSGYNQI+I+PED+EKTTFTCPYGTFAFR+  + F +           ++ FSD +E  +EVFMDDFSVFG SF SCLDN
Subjt:  FIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDN

Query:  LGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------D
        L  VL+RCE+T+LVLNWEKCHFMV+EGI+LGH+IS  GLEVDRAKIE+IE+L PP +VKG++SFL                                  D
Subjt:  LGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------D

Query:  CRKAFETLKSALISTPILCAPNWNLPFEVMCDASDAA---------------------------------------------------------------
        C  AF  LK  L+S PI+ AP+W+LPFE+MCDASD A                                                               
Subjt:  CRKAFETLKSALISTPILCAPNWNLPFEVMCDASDAA---------------------------------------------------------------

Query:  -----------------------EFDLEIKDKKGSENVIADHLSSLLKQS-----AISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQ
                               EFDLEI+DK+G ENV+ADHLS L  QS      I++SFPDEQL AV V      PWYAD+ N+LV G+ P D+ + Q
Subjt:  -----------------------EFDLEIKDKKGSENVIADHLSSLLKQS-----AISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQ

Query:  KKKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFT
        KKKF  D K ++W+E  +YK C+DG+IRRCV  D+ ++IL+ CHS   GGHFS  +T  ++   GF+WPT+++D   +   CD CQR GN+  ++EMP T
Subjt:  KKKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFT

Query:  YILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPY
         ILEVELFD+WGIDFM PF  S  N ++L+AVDYV KWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG+H  N     LL KY + H++A  Y
Subjt:  YILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPY

Query:  HPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL
        HPQ N Q E++NRE+K ILEK V  SRK+W+ +LD+ALWAYRT +KTPL
Subjt:  HPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]9.1e-26440.28Show/hide
Query:  VNQVTEE--ACVYCGEDHNYEFCPNNPTSVFFVGN-----QRNNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQ--QNKQVLPQQN
        VN V+    +C  CG  H      ++   V FV N     Q+NNPYSN YNPGWRNHPNFSW  QG+   + + ++ PGF      P+  Q+ ++  ++ 
Subjt:  VNQVTEE--ACVYCGEDHNYEFCPNNPTSVFFVGN-----QRNNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQ--QNKQVLPQQN

Query:  SGSSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELES
        + +S      E   R++  +    +S R +E+Q+GQLAN + +R QG LP  TE      KE  KAVTLRSGK L +      +    D   V +K  E 
Subjt:  SGSSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELES

Query:  GQGVGGSNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEE
         +              +     P  P  PYVPP+PFPQR K    D QF+KFL++ +Q  INIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEE
Subjt:  GQGVGGSNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEE

Query:  CSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK-------------------------------------------------
        CSAI++N LPPK +DPGSF+IP +IG  +  RALC LGAS++LMPLSV RK                                                 
Subjt:  CSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK-------------------------------------------------

Query:  --------LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP-----------------------------------------DEMEDCSFIKI
                LGRPFLAT  A+IDV+ G LT++V  EEV+FN+F+A KYP                                         D +E       
Subjt:  --------LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP-----------------------------------------DEMEDCSFIKI

Query:  LEST-----------------------------------------------------IVETTIQDS----------------------------------
        LE+T                                                     IV  ++ D                                   
Subjt:  LEST-----------------------------------------------------IVETTIQDS----------------------------------

Query:  ---ADKH---------------------------------LEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPF
            D H                                 + D    +PVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRKDHFPLPF
Subjt:  ---ADKH---------------------------------LEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPF

Query:  IDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNL
        +DQ+L+RLAG AYYCFLDGYSGYNQI+I+PED+EKTTFTCPYGTFAFR+  + F +           ++ FSD +E  +E+FMDDFSVFG SF SCLDNL
Subjt:  IDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNL

Query:  GHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------DC
          VL+RCE+T+LVLNWEKCHFMV+EGIVLGH+IS  GLEVDRAKIE+IE+L PP +VKG++SFL                                  DC
Subjt:  GHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------DC

Query:  RKAFETLKSALISTPILCAPNWNLPFEVMCDASDAA----------------------------------------------------------------
          AF  LK  L+S PI+ AP+W+LPFE+MCDASD A                                                                
Subjt:  RKAFETLKSALISTPILCAPNWNLPFEVMCDASDAA----------------------------------------------------------------

Query:  ----------------------EFDLEIKDKKGSENVIADHLSSLLKQS-----AISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQK
                              EFDLEI+DK+G ENV+ADHLS L  QS      I++SFPDEQL AV V      PWYAD+ N+LV G+ P D+ + QK
Subjt:  ----------------------EFDLEIKDKKGSENVIADHLSSLLKQS-----AISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQK

Query:  KKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTY
        KKF  D K ++W+E  +YK C+DG+IRRCV  D+ ++IL+ CHS   GGHFS  +T  ++   GF+WPT+++D   +   CD CQR GN+  ++EMP T 
Subjt:  KKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTY

Query:  ILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYH
        ILEVELFD+WGIDFM PF  S  N ++L+AVDYV KWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG+H  N     LL KY + H++A  YH
Subjt:  ILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYH

Query:  PQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL
        PQ N Q E++NRE+K ILEK V  SRK+W+ +LD+ALWAYRT +KTPL
Subjt:  PQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL

XP_042757945.1 uncharacterized protein LOC111885853 [Lactuca sativa]1.8e-27242.21Show/hide
Query:  CVYCGEDHNYEFCPNNPTSVFFVGNQ----RNNPYSNFYNPGWRNHPNFSWGGQGSNMK------AQQKVNQPGFAKAQVLPQQNKQVLPQ-----QNSG
        C  C   H+Y  CP NP SVFF+G+Q    +NNPYS  YNPGWRNHPNFSWGGQ   M        Q+  N PGF +       N+QV PQ     Q SG
Subjt:  CVYCGEDHNYEFCPNNPTSVFFVGNQ----RNNPYSNFYNPGWRNHPNFSWGGQGSNMK------AQQKVNQPGFAKAQVLPQQNKQVLPQ-----QNSG

Query:  SS--------------LEAMMKEFMARID---VAIQSNQA-SMRVLELQVGQLANELMARPQGKLPLDTEHPRR-EGKEQVKAVTLRSGKPL--------
        SS               E  M EFM + D    A + NQA +MR LE Q+GQLA  L +R  G LP +T++P     K Q  A+TLRSGK L        
Subjt:  SS--------------LEAMMKEFMARID---VAIQSNQA-SMRVLELQVGQLANELMARPQGKLPLDTEHPRR-EGKEQVKAVTLRSGKPL--------

Query:  -----------EEPRKTQDIERNSDKSVVAEKELESGQGVGGSNKDAGASGSVPDV------EPPYVPPPPYVPP--LPFPQRQ-KPKNHDGQFKKFLEI
                   E  +K  +++ +  + V  E   +   G    NK+ G + S P V      +  +      V P  LPFP RQ K K  DGQFKKFLEI
Subjt:  -----------EEPRKTQDIERNSDKSVVAEKELESGQGVGGSNKDAGASGSVPDV------EPPYVPPPPYVPP--LPFPQRQ-KPKNHDGQFKKFLEI

Query:  LKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK----
        L Q  INIP VEA++Q+P YAKF+KD+LTKK+  GEFETV++T+ C++I++N LP K  DPGSF +P  I GK     LC LGASINLMPLS++R+    
Subjt:  LKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK----

Query:  -----------------------------------------------------LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCS
                                                             LGRPFLAT  ALIDV+KGE+T+RV +E+  FN+FKA+K P  ME+CS
Subjt:  -----------------------------------------------------LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCS

Query:  FIKILESTIVETTIQDSADKH-------------------------------------------------------------------------------
        F++++++ +     Q S   H                                                                               
Subjt:  FIKILESTIVETTIQDSADKH-------------------------------------------------------------------------------

Query:  ------------LEDHGETTPVQCVPK------------------------KGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQM
                    L D    +P  C+ K                        KGG TVV N   E+I  RTVTGWR+C+DYR+LN ATRKDHFPLPFIDQM
Subjt:  ------------LEDHGETTPVQCVPK------------------------KGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQM

Query:  LDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVL
        LDRLAG+++YCFLDGYSGYNQI+IAPED+ KTTFTCP+GTFAFR+  + F +           +S FSDM+E+ VEVFMDDFSV G +F+SCL NL  VL
Subjt:  LDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVL

Query:  KRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------DCRKAF
        ++C   +LVLNWEKCHFMVKEGIVLGH++S+ G+EVDRAKIE+IERLE P +VKGI+SFL                                   C++AF
Subjt:  KRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------DCRKAF

Query:  ETLKSALISTPILCAPNWNLPFEVMCDASDAA--------------------------------------------------------------------
        + LK  L S P++ AP+W+ PF++M DASD A                                                                    
Subjt:  ETLKSALISTPILCAPNWNLPFEVMCDASDAA--------------------------------------------------------------------

Query:  ------------------EFDLEIKDKKGSENVIADHLSSLLKQS------AISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKF
                          EFDLE++DKKG ENV+ADHLS L K+S       I DSFPDE++    ++V  + PWYA+I N+LV GV P    W QKKK 
Subjt:  ------------------EFDLEIKDKKGSENVIADHLSSLLKQS------AISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKF

Query:  KHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILE
          DAKF++WDE ++++   D + RRC+   + K+ILE+CH+S YGGHF G++T +R+LH GF+WP+LFKDA+ F K+CD CQR GN+G R EMP + I+E
Subjt:  KHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILE

Query:  VELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQA
        VELFDVWGIDFM PF+PS+G +++L+AVDYV KWVEA+AC ++DA+TV  FL+  IF+RFGTPRA++SDEGTH  N +L  +LAKY+IKHR+AT YHPQ 
Subjt:  VELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQA

Query:  NSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPLDTN
        N  AE +N+++K+ILEKVV+ SRK+W+ +LD+ LWAYRT Y+T L T+
Subjt:  NSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPLDTN

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase1.9e-25942.05Show/hide
Query:  VNQVTE--EACVYCGEDHNYEFCPNNPTSVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNS
        VNQV      C  CGE H  + CP++  S+ FV N R   NNPYSN YNPGWR HPNFSW    GQGS  + QQ              QQ +Q  P Q  
Subjt:  VNQVTE--EACVYCGEDHNYEFCPNNPTSVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNS

Query:  GSSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTE-HPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELES
          SLE  + +FMA       S  A+ + ++ Q+GQLAN + +RPQG LP +TE +PR++GK Q +AVTLR+G+ L+E  K  +  ++ +K V++E   E 
Subjt:  GSSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTE-HPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELES

Query:  GQGVGGSNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEE
        G+                +VE P                                L++  INIP  EA+EQ+P+Y KF+KDIL+KK+ LG++ETV+LTEE
Subjt:  GQGVGGSNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEE

Query:  CSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLG----------ASINL------------------------------------MPLSVYRKLGR
        CSAI++N LPPK KDPGSFTIP +IG    GRALC LG           SI L                                    + + V   LGR
Subjt:  CSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLG----------ASINL------------------------------------MPLSVYRKLGR

Query:  PFLATGRALIDVQKGELTMRVCNE---------------------------------------EVKFNVFK---AMKY-----PDEMEDCSFIKILESTI
        PFLATGR LIDVQK    M+  NE                                       E  + V K   A KY      + +E  +  K+L+ +I
Subjt:  PFLATGRALIDVQKGELTMRVCNE---------------------------------------EVKFNVFK---AMKY-----PDEMEDCSFIKILESTI

Query:  VET----------------------------------------------------TIQD---------------------SADKH---------------
         E                                                     TI D                     S +                 
Subjt:  VET----------------------------------------------------TIQD---------------------SADKH---------------

Query:  -----------LEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIA
                   + D    +PVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPL FIDQMLDRLAG+ +YCFLDGYSGYNQI IA
Subjt:  -----------LEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIA

Query:  PEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVL
        PED+EK TFTCPYGTFAFR+  + F +           ++ F+DM+E+ +EVFMDDFSV+G SF  CL+NL  VLKRCEDT+L+LNWEKCHFMV+EGIVL
Subjt:  PEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVL

Query:  GHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLD----------------------------------CRKAFETLKSALISTPILCAPNWNLPFEVM
        GH++S  G+EVD+AK+E IE+L PP SVKG++SFL                                   CR AF  LK  LIS PI+  P+W+ PFE+M
Subjt:  GHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLD----------------------------------CRKAFETLKSALISTPILCAPNWNLPFEVM

Query:  CDASDAA--------------------------------------------------------------------------------------EFDLEIK
        CDASD A                                                                                      EFDLEI+
Subjt:  CDASDAA--------------------------------------------------------------------------------------EFDLEIK

Query:  DKKGSENVIADHLSSLL------KQSAISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEQFMYKQCSDGLIRR
        D+KG+EN IADHLS L       + + I+D+FPDEQL A+   V  D PWYADI N+L  G+ P D+  +QKKKF  D + ++WD+ F++KQ  D ++RR
Subjt:  DKKGSENVIADHLSSLL------KQSAISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEQFMYKQCSDGLIRR

Query:  CVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWGIDFMRPFLPSNGNVFML
        CV   +  +ILEQCH+SPYGGHF G RT  +IL  GFFWP LFKDAH F   CD CQR GN+  R EMP   ILEVELFDVWGIDFM PF+PS GN+++L
Subjt:  CVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWGIDFMRPFLPSNGNVFML

Query:  LAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISNREIKSILEKVVHPSRKN
        +AVDYV KWVEA A   +D+K V  F++ +IF RFGTPRA++SD GTH  N     LL+KY +KH+I+TPYHPQ + Q E+SNREIK ILEK V  +RK+
Subjt:  LAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISNREIKSILEKVVHPSRKN

Query:  WSFRLDEALWAYRTTYKTPL
        WS RLDEALWAYRT YKTP+
Subjt:  WSFRLDEALWAYRTTYKTPL

A0A2G9HWF8 Reverse transcriptase3.7e-25540.16Show/hide
Query:  CGEDHNYEFCPNNPTSVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSGSSLEAMMKEFMA
        CGE H  + CP++  S+ FV N R   NNPYSN YNPGWR HPNFSW    GQGS  + QQ                      Q N+             
Subjt:  CGEDHNYEFCPNNPTSVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSGSSLEAMMKEFMA

Query:  RIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLE----EPRKTQDIERNSDKSVVAEKELESGQGVGGSNKD
                                                +PR++GK Q +AVTLR+G+ L+    EP K+++ E  S++    EKE+E+          
Subjt:  RIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLE----EPRKTQDIERNSDKSVVAEKELESGQGVGGSNKD

Query:  AGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLP
                   P  V  P  + P PFPQR + +    QF KFLE+ K+  INIP  EA+EQ+P+Y KF+KDIL+KK+RLG++ETV+LTEECSAI++N LP
Subjt:  AGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLP

Query:  PKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK---------------------------------------------------------LG
        PK KDPG              RALC LGASINLMP S+YR                                                          LG
Subjt:  PKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK---------------------------------------------------------LG

Query:  RPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIKILESTI------------VETTIQDSADKHLEDHGET---------------
        RPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++              +E  + D  D+  E+  E                
Subjt:  RPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIKILESTI------------VETTIQDSADKHLEDHGET---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------TPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAY
                                          +PVQCVPKKGG+TVV N  NE IPT+TVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +
Subjt:  ----------------------------------TPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAY

Query:  YCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLL----AFAMLQQHLAVYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHL
        YCFLDGYSGYNQI IAPED+EKTTFTCPYGTFAFR+       A A  Q+ +   ++ F+DM+E+ +EVFMDDFSV+G SF  CL+NL  VLKRCEDT+L
Subjt:  YCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLL----AFAMLQQHLAVYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHL

Query:  VLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLD----------------------------------CRKAFETLKSALI
        VLNWEKCHFMV+EGIVLGH++S  G+EVD+AK+E IE+L P  SVKG++SFL                                   C  AF+ LK  LI
Subjt:  VLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLD----------------------------------CRKAFETLKSALI

Query:  STPILCAPNWNLPFEVMCDASDAA----------------------------------------------------------------------------
        S PI+  P+W+ PFE+MCDASD A                                                                            
Subjt:  STPILCAPNWNLPFEVMCDASDAA----------------------------------------------------------------------------

Query:  ----------EFDLEIKDKKGSENVIADHLSSLL------KQSAISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFY
                  EFDLEI+D+KG EN IADHLS L       + + I+D+FPDEQL A+   V  D PWYADI N+L  G+ P D+  +QKKKF  D + ++
Subjt:  ----------EFDLEIKDKKGSENVIADHLSSLL------KQSAISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFY

Query:  WDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWG
        WD+ F++KQ  D ++RRCV   +  +I EQCH+SPYGGHF   RT  +IL  GFFWP LFKD H F   CD CQR GN+  R EMP   ILEVELFDVWG
Subjt:  WDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWG

Query:  IDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISN
        IDFM PF+PS GN+++L+AVDY+ KWVEA+A   +D+K V  F++ +IF RFGTPRA++SD GTH  N     LL+KY +KH+I+TPYHPQ + Q E+SN
Subjt:  IDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISN

Query:  REIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL
        REIK  LEK V  +RK+WS RLDEALWAYRT +KTP+
Subjt:  REIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL

A0A2G9IA86 DNA-directed DNA polymerase2.0e-26141.82Show/hide
Query:  VNQV--TEEACVYCGEDHNYEFCPNNPTSVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSGSS
        VNQV  T   C  CGE H    CPN+  S+ FV N R   NNPYSN YNPGWR HPNFSW         Q++ + P F ++    QQ +Q  P Q    S
Subjt:  VNQV--TEEACVYCGEDHNYEFCPNNPTSVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSGSS

Query:  LEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTE-HPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELESGQG
        LE  + +FMA       S   +++++E Q+GQLAN + +RPQG L  +TE +PR++GK Q +AVTLR+G+ L+E  K  +  ++  K V++EKE      
Subjt:  LEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTE-HPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELESGQG

Query:  VGGSNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEECSA
             K+  A                   PL   Q+QK K    QF KFLE+ K+  IN P  EA+EQ+P+Y KF+K IL+KK+RLG++ETV+LTEECSA
Subjt:  VGGSNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEECSA

Query:  ILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK----------------------------------------------------
        I++N LPPK KDPGSFTIP +IG    GRALC LGASINLMP S+YR                                                     
Subjt:  ILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK----------------------------------------------------

Query:  -----LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIKILESTIVETTIQDSADKHL-----EDHGE----------------
             LGRPFLATGR LIDVQKG+LTMRV ++++ FNVFKAMK+P+E ++C  + + ++     +I D  ++ L     ED+ E                
Subjt:  -----LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIKILESTIVETTIQDSADKHL-----EDHGE----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------TTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAG
                                             +PVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG
Subjt:  ------------------------------------TTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAG

Query:  QAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDT
        + +YCFLDGY           D+EKTTFTCPYGTFAFR+  + F +           ++ F+DM+E+ +EVFMDDFSV+G SF  CL+NL  VLKRCEDT
Subjt:  QAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDT

Query:  HLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLD----------------------------------CRKAFETLKSA
        +LVLNW+KCHFMV+EGIVL H++S  G+EV++AK+E IE+L PP SVKGI+SFL                                   C  AF  LK  
Subjt:  HLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLD----------------------------------CRKAFETLKSA

Query:  LISTPILCAPNWNLPFEVMCDASDAA------------------------------------------EFDLEIKDKKGSENVI-ADH------------
        LIS PI+  P+W+ PFE+MCDASD A                                           FD       G++ ++  DH            
Subjt:  LISTPILCAPNWNLPFEVMCDASDAA------------------------------------------EFDLEIKDKKGSENVI-ADH------------

Query:  ---LSSLLK---QSAISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILE
           L S  K    + I+D+FPDEQL A+   V  D PWY+DI N+L  G+ P D+  +QKKKF  D + ++WD+ F++KQ  D ++RRCV   +  +ILE
Subjt:  ---LSSLLK---QSAISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILE

Query:  QCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEA
        QCH+SPYGGHF G RT  +IL  GFFWP LFKDAH F   CD CQR GN+  R EMP   IL+VELFDVWGIDF+ PF+PS GN+++L+AVDYV KWVEA
Subjt:  QCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEA

Query:  IACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAY
        +A   +D+K V  F++ +IF RFGTPRA++SD G H  N      L+KY +KH+I TPYHPQ + Q E+SNREIK ILEK V  +R +WS RLDEALWAY
Subjt:  IACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAY

Query:  RTTYKTPL
        RT YKTP+
Subjt:  RTTYKTPL

A0A6P8CBX2 Reverse transcriptase3.3e-25940.46Show/hide
Query:  CVYCGEDHN-YEFCPNNPTS------VFFVGN-QRNN--PYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSGSSLEA
        C  C   H+  E    NP++      V FV N QR+N  PYSN YNPGWRNHPNFSW  + + +K       PGF K    P QN    P Q S S +E 
Subjt:  CVYCGEDHN-YEFCPNNPTS------VFFVGN-QRNN--PYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSGSSLEA

Query:  MMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLE-EPRKTQDIERNSDKSVVAEKELESGQGVGG
        +M  +M + D  +Q+ QA++R LE Q+ Q++ +L  RP G LP +TE    E  + V A+ LRSGK LE   RK Q  E + +K    +K  E  Q    
Subjt:  MMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLE-EPRKTQDIERNSDKSVVAEKELESGQGVGG

Query:  SNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEECSAILK
          K  G                PYVPP+PFP R K +  D QF KFL++ K+  INIP  EA++Q+P+YA+F+KD+LTKK++    E V LT ECS IL+
Subjt:  SNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEECSAILK

Query:  N---GLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK----------------------------------------------------
             LP K +D GSFT+P +IG       L   GASINLMPLS++RK                                                    
Subjt:  N---GLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRK----------------------------------------------------

Query:  -----LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIKILESTI------------VETTIQD-----SADKH----------
             LGRPFLATG+ALIDV++G+LT+RV NE++ FNV+ A+K  D+ + C  I I++  I            +E+ ++D       D+H          
Subjt:  -----LGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIKILESTI------------VETTIQD-----SADKH----------

Query:  -----------------------------------------------------------------LEDHGE-----------------------------
                                                                         L +H E                             
Subjt:  -----------------------------------------------------------------LEDHGE-----------------------------

Query:  -------------------------------------TTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLA
                                              +PVQ VPKKGG+TVV N+ N+LIPTRTVTGWRVC+DYR+LN ATRKDHFPLPFIDQML++LA
Subjt:  -------------------------------------TTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLA

Query:  GQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCED
        G  YYCFLDGYSGYNQI IAPED+EKTTFTCPYGTFAFR+  + F +           +S FSDM+E+ +E+FMDDFSVFG SF+SCL NLG VLKRC++
Subjt:  GQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCED

Query:  THLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------DCRKAFETLKS
        T+L+LNWEKCHFMV+EGIVLGH++SK G+EVDRAK+E+IE+L PP S KG++SFL                                  +C +AF  LK 
Subjt:  THLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------DCRKAFETLKS

Query:  ALISTPILCAPNWNLPFEVMCDASDAA-------------------------------------------------------------------------
         L S P++ APNW LPFE+MCDASD A                                                                         
Subjt:  ALISTPILCAPNWNLPFEVMCDASDAA-------------------------------------------------------------------------

Query:  -------------EFDLEIKDKKGSENVIADHLSSL---LKQSAISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFY
                     EFDLEI+D KG+ENV+ADHLS L      S I++ FPDEQL   E   ++  PWYADI N++V  +TP  +  +QKKKF HD K+++
Subjt:  -------------EFDLEIKDKKGSENVIADHLSSL---LKQSAISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFY

Query:  WDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWG
        WDE +++K C+D +IRRCV   +   I++ CHS   GGHF  +RT  +IL CGF+WP +F D   +   C  CQR GN+  R E+P   IL +ELFDVWG
Subjt:  WDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWG

Query:  IDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISN
        IDFM PF  S  N ++L+AVDYV KWVEA+A   +DA+ V RFL+ +IF+RFG PRA++SD G+H  N    KLL+KY + H+IATPYHPQ   Q E+SN
Subjt:  IDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISN

Query:  REIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL
        REIK ILEK V+ SRK+WS +LD+ALWAYRT +KTP+
Subjt:  REIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL

A0A6P8DLJ8 Reverse transcriptase4.1e-24640.14Show/hide
Query:  AALVNQVTEEACVYCGEDHN-YEFCPNNPTS------VFFVGN-QRNN--PYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVL
        +AL  QV    C  C   H+  E    NP++      V FV N QR+N  PYSN YNPGWRNHPNFSW  + + +K      + G       P QN    
Subjt:  AALVNQVTEEACVYCGEDHN-YEFCPNNPTS------VFFVGN-QRNN--PYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVL

Query:  PQQNSGSSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEE-PRKTQDIERNSDKSVVAE
        P Q S S +E +M  +M + D  +Q+ QA++R LE+Q+ Q++ +L  RP G LP +TE    E  + V A+ LRSGK LE   RK Q  E + +K    +
Subjt:  PQQNSGSSLEAMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEE-PRKTQDIERNSDKSVVAE

Query:  KELESGQGVGGSNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETV
        K  E  Q      K  G                 YVPP+PFP+R K +  D QF KFL++ K+  INIP  EA++Q+P+YA+F+KD+LTKK++    E V
Subjt:  KELESGQGVGGSNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETV

Query:  SLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRKLGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEME
         LT E     K  +         F  PV     E+                 V   LGRPFLATG+ALIDV++G+LT+RV NE++ FNV+ A+K  D+ +
Subjt:  SLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCGLGASINLMPLSVYRKLGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEME

Query:  DCSFIKILESTI------------VETTIQD-----SADKH-----------------------------------------------------------
         C  I I++  I            +E+ ++D       D+H                                                           
Subjt:  DCSFIKILESTI------------VETTIQD-----SADKH-----------------------------------------------------------

Query:  ----------------LEDHGE------------------------------------------------------------------TTPVQCVPKKGG
                        L +H E                                                                   +PVQ VPKKGG
Subjt:  ----------------LEDHGE------------------------------------------------------------------TTPVQCVPKKGG

Query:  VTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQ
        +TVV N+ NELIPTRTVTGWRVC+DYR+LN ATRKDHFPLPFIDQML++L G  YYCFLDGYSGYNQI IAPED+EKTTFTCPYGTFAFR+  + F +  
Subjt:  VTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQ

Query:  QHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVK
                 +S FSDM+E+ +E+FMDDFSVFG SF+SCL NLG VLKRC++T+L+LNWEKCHFMV+EGIVLGH++SK G+EVDRAK+E+IE+L PP S K
Subjt:  QHLA---VYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVK

Query:  GIQSFL----------------------------------DCRKAFETLKSALISTPILCAPNWNLPFEVMCDASDAA----------------------
        G++SFL                                  +C +AF  LK  L S P++ APNW LPFE+MC ASD A                      
Subjt:  GIQSFL----------------------------------DCRKAFETLKSALISTPILCAPNWNLPFEVMCDASDAA----------------------

Query:  ----------------------------------------------------------------EFDLEIKDKKGSENVIADHLSSL---LKQSAISDSF
                                                                        EFDLEI+D KG+ENV+ADHLS L      S I++ F
Subjt:  ----------------------------------------------------------------EFDLEIKDKKGSENVIADHLSSL---LKQSAISDSF

Query:  PDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRI
        PDEQL   E   ++  PWYADI N++V  +TP  +  +QKKKF HD K+++WDE +++K C+D +IRRCV   +   I++ CHS   GGHF  +RT  +I
Subjt:  PDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEQFMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRI

Query:  LHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIF
        L CGF+WP +F D   +   C  CQR GN+  R E+P   IL +ELFDVWGIDFM PF  S  N ++L+AVDYV KWVEA+A   +DA+ V RFL+ +IF
Subjt:  LHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWGIDFMRPFLPSNGNVFMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIF

Query:  ARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL
        +R G PRA++SD G+H  N    KLL+KY + H+IATPYHPQ   Q E+SNREIK ILEK V+ SRK+WS +LD+ALWAYRT +KTP+
Subjt:  ARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDEALWAYRTTYKTPL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.9e-2327.16Show/hide
Query:  VETTIQDSADKHL---EDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ
        VE+ IQD  ++ +    +    +P+  VPKK      S K            +R+ +DYR+LN+ T  D  P+P +D++L +L    Y+  +D   G++Q
Subjt:  VETTIQDSADKHL---EDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQ

Query:  ITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLAVYVSNFSDMIESTVE----VFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVK
        I + PE   KT F+  +G + + +  + F  L+   A +    +D++   +     V++DD  VF  S    L +LG V ++    +L L  +KC F+ +
Subjt:  ITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLAVYVSNFSDMIESTVE----VFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVK

Query:  EGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL---------------------DCRK--------------AFETLKSALISTPILCAPNWN
        E   LGH ++ +G++ +  KIE I++   P   K I++FL                      C K              AF+ LK  +   PIL  P++ 
Subjt:  EGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL---------------------DCRK--------------AFETLKSALISTPILCAPNWN

Query:  LPFEVMCDASDAA
          F +  DASD A
Subjt:  LPFEVMCDASDAA

P10394 Retrovirus-related Pol polyprotein from transposon 4122.8e-2625.27Show/hide
Query:  KILESTIVETTIQDSADKHLEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYS
        K+++  IVE ++              +P+  VPKK              P      WR+ +DYR++NK    D FPLP ID +LD+L    Y+  LD  S
Subjt:  KILESTIVETTIQDSADKHLEDHGETTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYS

Query:  GYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHL-AVYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMV
        G++QI +    ++ T+F+   G++ F +      +       +    FS +  S   ++MDD  V G S +  L NL  V  +C + +L L+ EKC F +
Subjt:  GYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHL-AVYVSNFSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMV

Query:  KEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------DCRKAFETLKSALISTPILCAPNWN
         E   LGH+ +  G+  D  K +VI+    P+     + F+                                  +C+KAF  LKS LI+  +L  P+++
Subjt:  KEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL----------------------------------DCRKAFETLKSALISTPILCAPNWN

Query:  LPFEVMCDASDAAEFDLEIKDKKGSENVIADHLSSLLKQSAISDSFPDEQLFAVEVKVVRDAPW
          F +  DAS  A   +  ++  G +  +A + S    +   + S  +++L A+   ++   P+
Subjt:  LPFEVMCDASDAAEFDLEIKDKKGSENVIADHLSSLLKQSAISDSFPDEQLFAVEVKVVRDAPW

P20825 Retrovirus-related Pol polyprotein from transposon 2971.8e-2026.36Show/hide
Query:  WRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLAVYVSN-FSDMIESTV
        +RV +DYR+LN+ T  D +P+P +D++L +L    Y+  +D   G++QI +  E   KT F+   G + + +               ++N    ++    
Subjt:  WRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLAVYVSN-FSDMIESTV

Query:  EVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL---------------
         V++DD  +F  S    L+++  V  +  D +L L  +KC F+ KE   LGH ++ +G++ +  K++ I     P   K I++FL               
Subjt:  EVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFL---------------

Query:  ------DCRK--------------AFETLKSALISTPILCAPNWNLPFEVMCDASDAA
               C K              AFE LK+ +I  PIL  P++   F +  DAS+ A
Subjt:  ------DCRK--------------AFETLKSALISTPILCAPNWNLPFEVMCDASDAA

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein7.0e-2528.98Show/hide
Query:  TTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFA
        ++PV  VPKK G                   +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + 
Subjt:  TTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFA

Query:  FRQCLLAFAMLQQHLAVYVSN-FSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVI
        +              A Y+++ F D+    V V++DD  +F  S +    +L  VL+R ++ +L++  +KC F  +E   LG+ I    +   + K   I
Subjt:  FRQCLLAFAMLQQHLAVYVSN-FSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVI

Query:  ERLEPPNSVKGIQSFL-----------DCR---------------------KAFETLKSALISTPILCAPNWNLPFEVMCDAS-DAAEFDLEIKDKKGSE
             P +VK  Q FL           +C                      KA E LK+AL ++P+L   N    + +  DAS D     LE  D K   
Subjt:  ERLEPPNSVKGIQSFL-----------DCR---------------------KAFETLKSALISTPILCAPNWNLPFEVMCDAS-DAAEFDLEIKDKKGSE

Query:  NVIADHLSSLLKQS
          +  + S  L+ +
Subjt:  NVIADHLSSLLKQS

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.0e-2428.66Show/hide
Query:  TTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFA
        ++PV  VPKK G                   +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + 
Subjt:  TTPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFA

Query:  FRQCLLAFAMLQQHLAVYVSN-FSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVI
        +              A Y+++ F D+    V V++DD  +F  S +    +L  VL+R ++ +L++  +KC F  +E   LG+ I    +   + K   I
Subjt:  FRQCLLAFAMLQQHLAVYVSN-FSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVI

Query:  ERLEPPNSVKGIQSFL-----------DCR---------------------KAFETLKSALISTPILCAPNWNLPFEVMCDAS-DAAEFDLEIKDKKGSE
             P +VK  Q FL           +C                      KA + LK AL ++P+L   N    + +  DAS D     LE  D K   
Subjt:  ERLEPPNSVKGIQSFL-----------DCR---------------------KAFETLKSALISTPILCAPNWNLPFEVMCDAS-DAAEFDLEIKDKKGSE

Query:  NVIADHLSSLLKQS
          +  + S  L+ +
Subjt:  NVIADHLSSLLKQS

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein8.5e-1853.42Show/hide
Query:  ILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWGIDFM-------RPFLPSNGNV
        +L  GF+WPT FKDAH F   CD CQR+GN   R+EMP  +ILEVE+FDVWGI FM       +P  P+ G +
Subjt:  ILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWGIDFM-------RPFLPSNGNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCTTGGTGAGGAGTCTTGGAGTTCTACACCATTTAACTGAAGACAAACCTCCAAAGGCAGAAGTGAACAACAAAGAAGGTAAAGCAGAAACTAATCCAATTTTTCA
GACTTGGCTCAACAATGATGGCTTGCTAACCTCTTGGGTACTCGGGATCATTTCTGAAGAAGTCTTAGACATGATTGAAGGATTGGATTCAGTCTACGAGGTTACTTCCA
CCTTTGGGAGCCGAATAGAGGAGAGATTGGAGACCATCAAGGAGACCATCGATTGCAAGCTGTTAAAGGGAATTAATCTTTTCTTGTTTTATCTTGTAGTCATGGACATG
CTTACTCAGATTGTAACTTGTGTTTTCATGCATCTTATGAACCAAGTAGCTTTAGGCGTTGGTGTGAATTTGGTGCATGAGCGATCCGCCTGGGGTAAGGGTCCTGGAGA
TCCAGCAGACCCCCAGAATCGTTTACTGCAGCGAAATCCACCGCTGGAGCAAAATGTGCAGCAAAATAATCAGACTGAGAATCCTATCTTGGTAGCGAACGATAGGACCG
GAGCCATTCGAGCATATGCTTTTCCAATGTTTGATGAGTTAAATCCAGGGATTGCACGTCCTCAAATTCAAGAGGAAAATTTTGAAATTAAACCGAATGTGACAATGATT
AGTCATCAGCAGCCACCAGCAGTGGAGCCTGCTGCATTGGTGAACCAAGTCACAGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTTTGCCCCAACAA
TCCAACTTCTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTA
ATATGAAAGCGCAACAAAAGGTGAACCAACCGGGATTTGCTAAAGCGCAGGTATTGCCCCAGCAAAATAAGCAGGTTTTGCCCCAGCAAAATTCGGGGAGTTCTCTCGAG
GCAATGATGAAAGAATTTATGGCTCGTATAGACGTCGCAATTCAAAGTAATCAAGCTTCAATGAGAGTCCTGGAATTACAAGTGGGTCAGCTAGCTAATGAGCTGATGGC
ACGACCTCAGGGGAAACTTCCCTTAGATACCGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGCCTAGAA
AGACCCAGGATATAGAAAGAAATAGTGATAAAAGTGTTGTTGCTGAGAAAGAGTTGGAGTCTGGTCAGGGTGTTGGAGGCAGCAATAAAGATGCTGGAGCATCTGGTTCT
GTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCATGATGGTCAATTTAAAAAGTTTTT
AGAGATTCTTAAGCAATGGGATATAAATATCCCTTTAGTAGAAGCTATTGAGCAGTTGCCTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAGAAGAAGAGGTTAG
GAGAGTTTGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCACCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTATCTATAGGT
GGAAAAGAATTAGGTAGAGCACTCTGTGGTTTAGGTGCGAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCATT
AATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTATGTAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAGTATCCAGACGAAATGGAAGATTGCTCTTTCA
TTAAGATTCTGGAGAGCACAATTGTTGAGACAACAATACAAGATTCGGCTGACAAACATTTGGAAGATCATGGAGAGACAACCCCTGTCCAATGTGTTCCTAAGAAAGGA
GGTGTCACTGTGGTGAGCAATAAAGACAATGAGTTGATCCCAACTAGGACAGTAACTGGCTGGAGGGTTTGCATGGACTACAGGAGGCTTAATAAAGCTACCCGTAAGGA
CCATTTCCCTCTACCATTTATTGATCAGATGTTGGATAGATTGGCTGGTCAGGCCTACTACTGTTTCTTAGATGGTTACTCTGGGTATAACCAGATTACTATTGCTCCTG
AGGATAAGGAGAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTTAGGCAATGCCTTTTGGCCTTTGCAATGCTCCAACAACATTTAGCGGTGTATGTTAGCAAT
TTTTCTGATATGATTGAGTCTACTGTTGAGGTATTTATGGACGATTTCTCAGTGTTTGGAGGGTCTTTTCAGAGTTGTTTAGATAATTTAGGTCATGTGTTAAAAAGATG
TGAAGATACCCATCTAGTTCTTAATTGGGAAAAATGCCACTTCATGGTGAAGGAGGGCATAGTGTTAGGTCATAGGATTTCTAAGAATGGTCTAGAAGTTGATAGAGCAA
AAATAGAGGTGATTGAAAGACTAGAACCACCAAATTCAGTGAAAGGGATTCAGAGTTTTTTAGATTGTAGGAAGGCTTTTGAGACTTTAAAATCTGCTTTAATCTCAACA
CCCATTCTTTGTGCACCTAATTGGAATTTACCATTTGAGGTAATGTGTGATGCAAGTGATGCTGCGGAGTTTGACTTAGAGATAAAGGATAAGAAGGGATCAGAAAATGT
TATTGCAGATCACTTGTCATCTTTGCTGAAGCAATCTGCCATTTCAGATTCTTTTCCAGATGAACAACTTTTTGCTGTTGAGGTAAAGGTAGTCAGGGATGCCCCTTGGT
ATGCTGACATTGCCAACTTTTTGGTAAAGGGGGTCACTCCTATTGACATGGATTGGAGGCAGAAGAAAAAGTTTAAGCATGATGCAAAGTTTTTCTATTGGGATGAGCAA
TTTATGTATAAGCAATGTTCTGACGGTCTTATTCGAAGGTGTGTTTCCAGTGATAAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAG
CGGTCAGAGGACAACTATGAGGATTCTGCATTGTGGATTCTTCTGGCCTACCTTATTTAAGGATGCCCATTGGTTCTACAAGCAATGTGATGTTTGCCAAAGGAGAGGAA
ATTTAGGGCCTAGAGATGAAATGCCTTTTACTTACATTTTAGAAGTTGAATTATTCGATGTATGGGGTATTGATTTTATGAGGCCATTTCTCCCTTCTAATGGCAATGTT
TTTATGTTATTGGCAGTTGATTACGTGTTCAAGTGGGTTGAGGCCATTGCATGCCATCAAAGTGATGCCAAGACAGTAGCAAGGTTCCTTCAATCGCACATCTTTGCACG
GTTTGGGACACCTAGGGCTCTAGTGAGTGATGAGGGTACACATATTGTTAATAATATCTTAACTAAGCTTTTAGCTAAGTATGAGATTAAGCATAGGATAGCTACCCCTT
ATCACCCACAAGCAAATAGTCAAGCTGAGATTAGTAATAGGGAAATTAAATCTATTCTAGAGAAAGTAGTCCATCCATCTAGAAAGAATTGGTCTTTTAGGTTGGATGAG
GCTCTTTGGGCTTATAGGACAACCTATAAGACTCCTCTAGACACCAATCGCCGACTAGCTCCTCCCTCCAGACGCCATCTCCGCGCGACAACAACGACGCAGCACCCCTC
CACAGCCGCCGTCTCTCTCTCTCTGAAACCCACCTGCGAAATCCACGAAACCCACCTGCAAGAAACCCACCTGCGAAACCCACCGTTGCACGAAACCCACTCGCGTCGAC
CCCTCTCTCAAGACGCATCTCCGACGACAGCGAAGTGCAGACGACGCAGTACCCCTCCGCCGCCGCCGTCTCTCTCTGAAACCCACGAAACTCACCGCTGCACGAAACCC
ATTCGCGTCACTGTCGAACTTCACCGCTGCACCCTCGCACCGCCGCCGTTTCTCTCTCTCCCTCTTGCTCGTGATGGTTTCCGCAACAAGAATGGGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCTTGGTGAGGAGTCTTGGAGTTCTACACCATTTAACTGAAGACAAACCTCCAAAGGCAGAAGTGAACAACAAAGAAGGTAAAGCAGAAACTAATCCAATTTTTCA
GACTTGGCTCAACAATGATGGCTTGCTAACCTCTTGGGTACTCGGGATCATTTCTGAAGAAGTCTTAGACATGATTGAAGGATTGGATTCAGTCTACGAGGTTACTTCCA
CCTTTGGGAGCCGAATAGAGGAGAGATTGGAGACCATCAAGGAGACCATCGATTGCAAGCTGTTAAAGGGAATTAATCTTTTCTTGTTTTATCTTGTAGTCATGGACATG
CTTACTCAGATTGTAACTTGTGTTTTCATGCATCTTATGAACCAAGTAGCTTTAGGCGTTGGTGTGAATTTGGTGCATGAGCGATCCGCCTGGGGTAAGGGTCCTGGAGA
TCCAGCAGACCCCCAGAATCGTTTACTGCAGCGAAATCCACCGCTGGAGCAAAATGTGCAGCAAAATAATCAGACTGAGAATCCTATCTTGGTAGCGAACGATAGGACCG
GAGCCATTCGAGCATATGCTTTTCCAATGTTTGATGAGTTAAATCCAGGGATTGCACGTCCTCAAATTCAAGAGGAAAATTTTGAAATTAAACCGAATGTGACAATGATT
AGTCATCAGCAGCCACCAGCAGTGGAGCCTGCTGCATTGGTGAACCAAGTCACAGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTTTGCCCCAACAA
TCCAACTTCTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTA
ATATGAAAGCGCAACAAAAGGTGAACCAACCGGGATTTGCTAAAGCGCAGGTATTGCCCCAGCAAAATAAGCAGGTTTTGCCCCAGCAAAATTCGGGGAGTTCTCTCGAG
GCAATGATGAAAGAATTTATGGCTCGTATAGACGTCGCAATTCAAAGTAATCAAGCTTCAATGAGAGTCCTGGAATTACAAGTGGGTCAGCTAGCTAATGAGCTGATGGC
ACGACCTCAGGGGAAACTTCCCTTAGATACCGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGCCTAGAA
AGACCCAGGATATAGAAAGAAATAGTGATAAAAGTGTTGTTGCTGAGAAAGAGTTGGAGTCTGGTCAGGGTGTTGGAGGCAGCAATAAAGATGCTGGAGCATCTGGTTCT
GTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCATGATGGTCAATTTAAAAAGTTTTT
AGAGATTCTTAAGCAATGGGATATAAATATCCCTTTAGTAGAAGCTATTGAGCAGTTGCCTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAGAAGAAGAGGTTAG
GAGAGTTTGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTTAAGAATGGGCTACCACCCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTATCTATAGGT
GGAAAAGAATTAGGTAGAGCACTCTGTGGTTTAGGTGCGAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCATT
AATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTATGTAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAGTATCCAGACGAAATGGAAGATTGCTCTTTCA
TTAAGATTCTGGAGAGCACAATTGTTGAGACAACAATACAAGATTCGGCTGACAAACATTTGGAAGATCATGGAGAGACAACCCCTGTCCAATGTGTTCCTAAGAAAGGA
GGTGTCACTGTGGTGAGCAATAAAGACAATGAGTTGATCCCAACTAGGACAGTAACTGGCTGGAGGGTTTGCATGGACTACAGGAGGCTTAATAAAGCTACCCGTAAGGA
CCATTTCCCTCTACCATTTATTGATCAGATGTTGGATAGATTGGCTGGTCAGGCCTACTACTGTTTCTTAGATGGTTACTCTGGGTATAACCAGATTACTATTGCTCCTG
AGGATAAGGAGAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTTAGGCAATGCCTTTTGGCCTTTGCAATGCTCCAACAACATTTAGCGGTGTATGTTAGCAAT
TTTTCTGATATGATTGAGTCTACTGTTGAGGTATTTATGGACGATTTCTCAGTGTTTGGAGGGTCTTTTCAGAGTTGTTTAGATAATTTAGGTCATGTGTTAAAAAGATG
TGAAGATACCCATCTAGTTCTTAATTGGGAAAAATGCCACTTCATGGTGAAGGAGGGCATAGTGTTAGGTCATAGGATTTCTAAGAATGGTCTAGAAGTTGATAGAGCAA
AAATAGAGGTGATTGAAAGACTAGAACCACCAAATTCAGTGAAAGGGATTCAGAGTTTTTTAGATTGTAGGAAGGCTTTTGAGACTTTAAAATCTGCTTTAATCTCAACA
CCCATTCTTTGTGCACCTAATTGGAATTTACCATTTGAGGTAATGTGTGATGCAAGTGATGCTGCGGAGTTTGACTTAGAGATAAAGGATAAGAAGGGATCAGAAAATGT
TATTGCAGATCACTTGTCATCTTTGCTGAAGCAATCTGCCATTTCAGATTCTTTTCCAGATGAACAACTTTTTGCTGTTGAGGTAAAGGTAGTCAGGGATGCCCCTTGGT
ATGCTGACATTGCCAACTTTTTGGTAAAGGGGGTCACTCCTATTGACATGGATTGGAGGCAGAAGAAAAAGTTTAAGCATGATGCAAAGTTTTTCTATTGGGATGAGCAA
TTTATGTATAAGCAATGTTCTGACGGTCTTATTCGAAGGTGTGTTTCCAGTGATAAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAG
CGGTCAGAGGACAACTATGAGGATTCTGCATTGTGGATTCTTCTGGCCTACCTTATTTAAGGATGCCCATTGGTTCTACAAGCAATGTGATGTTTGCCAAAGGAGAGGAA
ATTTAGGGCCTAGAGATGAAATGCCTTTTACTTACATTTTAGAAGTTGAATTATTCGATGTATGGGGTATTGATTTTATGAGGCCATTTCTCCCTTCTAATGGCAATGTT
TTTATGTTATTGGCAGTTGATTACGTGTTCAAGTGGGTTGAGGCCATTGCATGCCATCAAAGTGATGCCAAGACAGTAGCAAGGTTCCTTCAATCGCACATCTTTGCACG
GTTTGGGACACCTAGGGCTCTAGTGAGTGATGAGGGTACACATATTGTTAATAATATCTTAACTAAGCTTTTAGCTAAGTATGAGATTAAGCATAGGATAGCTACCCCTT
ATCACCCACAAGCAAATAGTCAAGCTGAGATTAGTAATAGGGAAATTAAATCTATTCTAGAGAAAGTAGTCCATCCATCTAGAAAGAATTGGTCTTTTAGGTTGGATGAG
GCTCTTTGGGCTTATAGGACAACCTATAAGACTCCTCTAGACACCAATCGCCGACTAGCTCCTCCCTCCAGACGCCATCTCCGCGCGACAACAACGACGCAGCACCCCTC
CACAGCCGCCGTCTCTCTCTCTCTGAAACCCACCTGCGAAATCCACGAAACCCACCTGCAAGAAACCCACCTGCGAAACCCACCGTTGCACGAAACCCACTCGCGTCGAC
CCCTCTCTCAAGACGCATCTCCGACGACAGCGAAGTGCAGACGACGCAGTACCCCTCCGCCGCCGCCGTCTCTCTCTGAAACCCACGAAACTCACCGCTGCACGAAACCC
ATTCGCGTCACTGTCGAACTTCACCGCTGCACCCTCGCACCGCCGCCGTTTCTCTCTCTCCCTCTTGCTCGTGATGGTTTCCGCAACAAGAATGGGATCTGA
Protein sequenceShow/hide protein sequence
MPLVRSLGVLHHLTEDKPPKAEVNNKEGKAETNPIFQTWLNNDGLLTSWVLGIISEEVLDMIEGLDSVYEVTSTFGSRIEERLETIKETIDCKLLKGINLFLFYLVVMDM
LTQIVTCVFMHLMNQVALGVGVNLVHERSAWGKGPGDPADPQNRLLQRNPPLEQNVQQNNQTENPILVANDRTGAIRAYAFPMFDELNPGIARPQIQEENFEIKPNVTMI
SHQQPPAVEPAALVNQVTEEACVYCGEDHNYEFCPNNPTSVFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGSNMKAQQKVNQPGFAKAQVLPQQNKQVLPQQNSGSSLE
AMMKEFMARIDVAIQSNQASMRVLELQVGQLANELMARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKSVVAEKELESGQGVGGSNKDAGASGS
VPDVEPPYVPPPPYVPPLPFPQRQKPKNHDGQFKKFLEILKQWDINIPLVEAIEQLPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIG
GKELGRALCGLGASINLMPLSVYRKLGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIKILESTIVETTIQDSADKHLEDHGETTPVQCVPKKG
GVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDKEKTTFTCPYGTFAFRQCLLAFAMLQQHLAVYVSN
FSDMIESTVEVFMDDFSVFGGSFQSCLDNLGHVLKRCEDTHLVLNWEKCHFMVKEGIVLGHRISKNGLEVDRAKIEVIERLEPPNSVKGIQSFLDCRKAFETLKSALIST
PILCAPNWNLPFEVMCDASDAAEFDLEIKDKKGSENVIADHLSSLLKQSAISDSFPDEQLFAVEVKVVRDAPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEQ
FMYKQCSDGLIRRCVSSDKAKEILEQCHSSPYGGHFSGQRTTMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPFTYILEVELFDVWGIDFMRPFLPSNGNV
FMLLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHIVNNILTKLLAKYEIKHRIATPYHPQANSQAEISNREIKSILEKVVHPSRKNWSFRLDE
ALWAYRTTYKTPLDTNRRLAPPSRRHLRATTTTQHPSTAAVSLSLKPTCEIHETHLQETHLRNPPLHETHSRRPLSQDASPTTAKCRRRSTPPPPPSLSETHETHRCTKP
IRVTVELHRCTLAPPPFLSLPLARDGFRNKNGI