; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005548 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005548
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr6:21536755..21540921
RNA-Seq ExpressionLag0005548
SyntenyLag0005548
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]8.6e-24056.42Show/hide
Query:  RNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPL
        +NNPYSN YNPGWR HPNFSW    N  Q      Q G   +   P Q    SLE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG LP 
Subjt:  RNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPL

Query:  DTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFK
        +TE +PR++GK Q +AVTLR+G+ L+E  K +   S    ++  EKE E                    VE P     P     PFPQR + +  + QF 
Subjt:  DTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFK

Query:  KFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLSVYR
        KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ETV+LTEECSAI++N LPPK KDPGSFTIP +IG    GRALCDLGA INLMP S+YR
Subjt:  KFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLSVYR

Query:  KLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDE
         LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E
Subjt:  KLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDE

Query:  MEDC---------------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------GEHQ
         ++C                                                         S  R   S V++ +I+D      K L  H      GE  
Subjt:  MEDC---------------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------GEHQ

Query:  TL-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCV
        TL                               ADI+GIS SFCMHKI LE+    S+E +RRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCV
Subjt:  TL-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCV

Query:  PKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFG
        PKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRK+HFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFG
Subjt:  PKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFG

Query:  LCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID
        LCNAPATFQRCM+AIF+DM+E+ +EVFMDDFSV+G SF   L NL  VLKRCEDT+L+LNWEKCHFMV+EGIVLGHK+S  G++VD+ K+E I+
Subjt:  LCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.1e-23756.21Show/hide
Query:  RNNPYSNFYNPGWRNHPNFSWG---GQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGK
        +NNPYSN YNPGWR HPNFSW    GQG+  + QQ   QP        P Q    SLE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG 
Subjt:  RNNPYSNFYNPGWRNHPNFSWG---GQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGK

Query:  LPLDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDG
        LP +TE +PR++GK Q +AVTLR+G+ L+E               V++K  +S +    SKE         +VE P     P     PFPQ+ + +  + 
Subjt:  LPLDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDG

Query:  QFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLS
        QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ET +LTEEC+AI++N LPPK KDPGSFTIP +IG    GRALCDLGA INLMP S
Subjt:  QFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLS

Query:  VYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY
        +YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+
Subjt:  VYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY

Query:  PDEMEDC---------------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------G
        P+E ++C                                                         S  R   S V++ +I+D      K L  H      G
Subjt:  PDEMEDC---------------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------G

Query:  EHQTL-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPV
        E  TL                               ADI+GIS SFCMHKI LE+    S+E +RRLN  MKEVVKKE+IKWLDAGIIYPI+DS+WVSPV
Subjt:  EHQTL-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPV

Query:  QCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRM
        QCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRK+HFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRM
Subjt:  QCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRM

Query:  PFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID
        PFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFSV+G SF   L NL  VLKRCEDT+LVLNWEKCHFMV+EGIVLGHK+S  G++VD+ K+E I+
Subjt:  PFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]4.4e-23657.35Show/hide
Query:  AMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPL------------EESRKTQDLNSNNYNIVVI
        +++KE+MA+ DAAIQS QAS+R LE+QVGQLANEL+ RP  KLP DTE P+REG EQ +A+ LRSGK +              S++T D        VV 
Subjt:  AMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPL------------EESRKTQDLNSNNYNIVVI

Query:  EKELESGQGAGGSKENAGASGSVPDVEPPYVPPPP--------YVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKK
        E+  ++   A   KE++     +  V+PP              Y P  PFPQR K K ++  F+KF++I K++HINIPLVEA++QMPNY KFLKD+LT +
Subjt:  EKELESGQGAGGSKENAGASGSVPDVEPPYVPPPP--------YVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKK

Query:  KRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVD
        ++  EF+ V L EECSAILKN +P K KDPGSFTIP+SIGGK+LGRALCDLG+ INLMPLS+Y+KLGIGEARPTTVTLQLADRS TYPEGKIED+L++VD
Subjt:  KRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVD

Query:  KFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI-----------------------------
        KFIFP DFIILDYEAD DVPIILGRPFL TGR L+DV KG +T+R+ +++V+FN+  +MKYP   E+CS +                             
Subjt:  KFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI-----------------------------

Query:  --------------------------------------------------------RILESTVIETTIQDSADKHLEKH--GEHQTLADIQGISQSFCMH
                                                                 I+ S  + +T +D   + L+KH      TLADI+GIS S CMH
Subjt:  --------------------------------------------------------RILESTVIETTIQDSADKHLEKH--GEHQTLADIQGISQSFCMH

Query:  KITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLP
        KI LEEG  +SIEQ+RRLNP MKEVV+KE++KWLDAGIIYPIA+S+ VSP+QCVPKKGG+TV++N++NELI TR V GWR+CMDYRRLNKATRK+HFPLP
Subjt:  KITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLP

Query:  FIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLG
        FIDQMLDRLAG+++YCFLDGYSGYNQITI+PEDQEKTTFTCPYG FAFRRMPFGLCNAPATFQRCM+AIF+DM+E+ +E+FMDDFSV+G SF++ L NLG
Subjt:  FIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLG

Query:  KVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID
        KVL+RCE+ +LVLNWEKCHFMV EGIVLGHKIS+ G++VDR K+E I+
Subjt:  KVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]4.0e-22151.55Show/hide
Query:  QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSG-----NSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARP
        Q+NNPYSN YNPGWRNHPNFSW  QGN   + + ++ PGF  K + P+           L     E   R +A +    +S R +E+Q+GQLAN + +R 
Subjt:  QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSG-----NSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARP

Query:  QGKLPLDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKP
        QG LP  TE      KE  KAVTLRSGK L +                +  E   G      + N   S  V D+     P  P  PYVPP+PFPQR K 
Subjt:  QGKLPLDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKP

Query:  KNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARIN
           D QF+KFL++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECSAI++N LPPK +DPGSF+IP +IG  +  RALCDLGA ++
Subjt:  KNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARIN

Query:  LMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVF
        LMPLSV RKLG+ E +PTT++LQLADRS+ YP G +E+VL+KV KFI PVDFI+L+ E D ++PIILGRPFLAT  A+IDV+ G LT++V  EEV+FN+F
Subjt:  LMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVF

Query:  KAMKYP-----------------------------------------DEMEDCSFIRILEST--------------------------------------
        +A KYP                                         D +E       LE+T                                      
Subjt:  KAMKYP-----------------------------------------DEMEDCSFIRILEST--------------------------------------

Query:  ---------------VIETTIQD-SADK-----HLEKHGEHQTLADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIA
                       ++  ++ D   DK      L K     T++D++GIS S CMH+I +E+     +E +RRLNP MKEVV+ EV+KWLDAGIIYPI+
Subjt:  ---------------VIETTIQD-SADK-----HLEKHGEHQTLADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIA

Query:  DSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPY
        DS W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRK+HFPLPF+DQ+L+RLAG AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPY
Subjt:  DSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPY

Query:  GTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREK
        GTFAFRRMPFGLCNAPATFQRCM+AIFSD +E  +EVFMDDFSVFG SF S L+NL +VL+RCE+T+LVLNWEKCHFMV+EGIVLGHKIS  GL+VDR K
Subjt:  GTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREK

Query:  VEVID
        +E+I+
Subjt:  VEVID

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]4.0e-22151.43Show/hide
Query:  QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSG-----NSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARP
        Q+NNPYSN YNPGWRNHPNFSW  QGN   + + ++ PGF  K + P+           L     E   R +A +    +S R +E+Q+GQLAN + +R 
Subjt:  QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSG-----NSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARP

Query:  QGKLPLDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKP
        QG LP  TE      KE  KAVTLRSGK L +                +  E   G      + N   S  V D+     P  P  PYVPP+PFPQR K 
Subjt:  QGKLPLDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKP

Query:  KNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARIN
           D QF+KFL++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECSAI++N LPPK +DPGSF+IP +IG  +  RALCDLGA ++
Subjt:  KNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARIN

Query:  LMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVF
        LMPLSV RKLG+ E +PTT++LQLADRS+ YP G +E+VL+KV KFI PVDFI+L+ E D ++PIILGRPFLAT  A+IDV+ G LT++V  EEV+FN+F
Subjt:  LMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVF

Query:  KAMKYP-----------------------------------------DEMEDCSFIRILEST--------------------------------------
        +A KYP                                         D +E       LE+T                                      
Subjt:  KAMKYP-----------------------------------------DEMEDCSFIRILEST--------------------------------------

Query:  ---------------VIETTIQD-SADK-----HLEKHGEHQTLADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIA
                       ++  ++ D   DK      L K     T++D++GIS S CMH+I +E+     +E +RRLNP MKEVV+ EV+KWLDAGIIYPI+
Subjt:  ---------------VIETTIQD-SADK-----HLEKHGEHQTLADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIA

Query:  DSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPY
        DS W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYR+LN  TRK+HFPLPF+DQ+L+RLAG AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPY
Subjt:  DSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPY

Query:  GTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREK
        GTFAFRRMPFGLCNAPATFQRCM+AIFSD +E  +E+FMDDFSVFG SF S L+NL +VL+RCE+T+LVLNWEKCHFMV+EGIVLGHKIS  GL+VDR K
Subjt:  GTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREK

Query:  VEVID
        +E+I+
Subjt:  VEVID

TrEMBL top hitse value%identityAlignment
A0A2G9H4Y0 Reverse transcriptase6.3e-22053.11Show/hide
Query:  RNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPL
        +NNPYSN YNPGWR HPNFSW    N  Q      Q G   +   P Q    SLE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG L  
Subjt:  RNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPL

Query:  DTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFK
        +TE +PR++ K Q +AVTLR+G+ L+E  K +   S    ++  +KE E                    VE P                           
Subjt:  DTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFK

Query:  KFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLSVYR
             L++LHINIP  EA+EQMP+Y KF+KDIL+KK+R+G+++TV+LTEECSAI++N LPPK KDPGSFTIP +IG    G+ALCDLGA INLM  S+YR
Subjt:  KFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLSVYR

Query:  KLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDE
         LG+GEA+PT++TLQL DRS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E
Subjt:  KLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDE

Query:  MEDC---------------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------GEHQ
         ++C                                                         S  R   S V++ +I++      K L  H      GE  
Subjt:  MEDC---------------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------GEHQ

Query:  TL------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVT
        TL                         DI+ IS SFCMHKI LE+    S+E +RRLN  MKEVVKKE+IKWLDAGIIYPI+DS+W+SPVQCVPKKGG+T
Subjt:  TL------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVT

Query:  VVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPAT
        VV N  NELIPTRTVTGWRVCMDYR+LNKATRK+HFPL FIDQMLD+LAG+ +YCFLDGYSGYNQI+I  E+QEKTTFTCPYGTFAFRRMPFGLCNAPAT
Subjt:  VVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPAT

Query:  FQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID
        FQRCM+AIF+DM+E+ +EVFMDDFSV+G SF   L NL  VLKR EDT+LVLNWEKCHFMV+EGIVLGHK+S  G++VD+ K+E+I+
Subjt:  FQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID

A0A2G9HH15 Reverse transcriptase8.2e-22053.26Show/hide
Query:  RNNPYSNFYNPGWRNHPNFSWG---GQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGK
        +NNPYSN YNPGWR HPNFSW    GQG  ++ QQ     G   +   P Q    SLE  + +FMA       S  A+ + +E Q+GQLAN + +RP+  
Subjt:  RNNPYSNFYNPGWRNHPNFSWG---GQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGK

Query:  LPLDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDG
        LP +TE +PR++ K Q +AVTLR+G  L+E                + KE                                       P + K      
Subjt:  LPLDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDG

Query:  QFKKFLEILKQLHINIPL-VEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPL
          K+ +   K   I  PL V+A+EQMP+Y KF+KDIL+KK+RLG++ETV+LTEECSAI++N LPPK KDPGSFTIP +IG    GRALCDLGA INLMP 
Subjt:  QFKKFLEILKQLHINIPL-VEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPL

Query:  SVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMK
        S+YR LG+GEA+PT++TLQLADRS+TYP G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK
Subjt:  SVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMK

Query:  YPDEM---------------------------------------EDCSFIRILE------------------STVIETTIQDSAD---KHLEKH------
        +P+E                                        EDC  ++ L+                  S V++ +I++      K L  H      
Subjt:  YPDEM---------------------------------------EDCSFIRILE------------------STVIETTIQDSAD---KHLEKH------

Query:  GEHQTL-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSP
        GE  TL                               ADI+GIS SFCMHKI LE+G   S+E +RRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSP
Subjt:  GEHQTL-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSP

Query:  VQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRR
        VQCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRK+HFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI I PEDQEKTTFTCPYGTF FR+
Subjt:  VQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRR

Query:  MPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID
        MPFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFSV+G SF   L NL  VLKRCEDT+L+LNWEKCHFMV+EGIVL HK+S  G++VD+ K+E I+
Subjt:  MPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID

A0A2G9HYA0 Reverse transcriptase4.2e-24056.42Show/hide
Query:  RNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPL
        +NNPYSN YNPGWR HPNFSW    N  Q      Q G   +   P Q    SLE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG LP 
Subjt:  RNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGKLPL

Query:  DTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFK
        +TE +PR++GK Q +AVTLR+G+ L+E  K +   S    ++  EKE E                    VE P     P     PFPQR + +  + QF 
Subjt:  DTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFK

Query:  KFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLSVYR
        KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ETV+LTEECSAI++N LPPK KDPGSFTIP +IG    GRALCDLGA INLMP S+YR
Subjt:  KFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLSVYR

Query:  KLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDE
         LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E
Subjt:  KLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDE

Query:  MEDC---------------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------GEHQ
         ++C                                                         S  R   S V++ +I+D      K L  H      GE  
Subjt:  MEDC---------------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------GEHQ

Query:  TL-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCV
        TL                               ADI+GIS SFCMHKI LE+    S+E +RRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCV
Subjt:  TL-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCV

Query:  PKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFG
        PKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRK+HFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFG
Subjt:  PKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFG

Query:  LCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID
        LCNAPATFQRCM+AIF+DM+E+ +EVFMDDFSV+G SF   L NL  VLKRCEDT+L+LNWEKCHFMV+EGIVLGHK+S  G++VD+ K+E I+
Subjt:  LCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID

A0A2G9HYD8 Reverse transcriptase5.1e-23856.21Show/hide
Query:  RNNPYSNFYNPGWRNHPNFSWG---GQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGK
        +NNPYSN YNPGWR HPNFSW    GQG+  + QQ   QP        P Q    SLE  + +FMA       S  A+ + +E Q+GQLAN + +RPQG 
Subjt:  RNNPYSNFYNPGWRNHPNFSWG---GQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGK

Query:  LPLDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDG
        LP +TE +PR++GK Q +AVTLR+G+ L+E               V++K  +S +    SKE         +VE P     P     PFPQ+ + +  + 
Subjt:  LPLDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDG

Query:  QFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLS
        QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++ET +LTEEC+AI++N LPPK KDPGSFTIP +IG    GRALCDLGA INLMP S
Subjt:  QFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLS

Query:  VYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY
        +YR LG+GEA+PT++TLQLADRS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+
Subjt:  VYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY

Query:  PDEMEDC---------------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------G
        P+E ++C                                                         S  R   S V++ +I+D      K L  H      G
Subjt:  PDEMEDC---------------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------G

Query:  EHQTL-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPV
        E  TL                               ADI+GIS SFCMHKI LE+    S+E +RRLN  MKEVVKKE+IKWLDAGIIYPI+DS+WVSPV
Subjt:  EHQTL-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPV

Query:  QCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRM
        QCVPKKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRK+HFPLPFIDQMLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRM
Subjt:  QCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRM

Query:  PFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID
        PFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFSV+G SF   L NL  VLKRCEDT+LVLNWEKCHFMV+EGIVLGHK+S  G++VD+ K+E I+
Subjt:  PFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID

A0A2G9IA86 DNA-directed DNA polymerase7.4e-22154.22Show/hide
Query:  RNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQ---QKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGK
        +NNPYSN YNPGWR HPNFSW    NN Q Q    +  Q G    Q  P Q    SLE  + +FMA       S   +++ +E Q+GQLAN + +RPQG 
Subjt:  RNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQ---QKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAAIQSNQASMRALELQVGQLANELKARPQGK

Query:  LPLDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDG
        L  +TE +PR++GK Q +AVTLR+G+ L+E              VV E     G+     KE         +VE           PL   Q+QK K    
Subjt:  LPLDTE-HPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDG

Query:  QFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLS
        QF KFLE+ K+LHIN P  EA+EQMP+Y KF+K IL+KK+RLG++ETV+LTEECSAI++N LPPK KDPGSFTIP +IG    GRALCDLGA INLMP S
Subjt:  QFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGARINLMPLS

Query:  VYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY
        +YR LG+GEA+PT++TLQLA+RS+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKG+LTMRV ++++ FNVFKAMK+
Subjt:  VYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY

Query:  PDEMEDC-----------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------GEHQT
        P+E ++C                                                     S  R   S V++ +I++S     K L  H      GE  T
Subjt:  PDEMEDC-----------------------------------------------------SFIRILESTVIETTIQDSAD---KHLEKH------GEHQT

Query:  L-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVP
        L                               ADI+GIS SFCMHKI LE+    S+E +RRLNP MKEVVKKE+IKW+DAGIIYPI+DS+WVSPVQCVP
Subjt:  L-------------------------------ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVP

Query:  KKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGL
        KKGG+TVV N  NELIPTRTVTGWRVCMDYR+LNKATRK+HFPLPFIDQMLDRLAG+ +YCFLDGY           DQEKTTFTCPYGTFAFRR+PFGL
Subjt:  KKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGL

Query:  CNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID
        CNAPATFQRCM+AIF+DM+E+ +EVFMDDFSV+G SF   L NL  VLKRCEDT+LVLNW+KCHFMV+EGIVL HK+S  G++V++ K+E I+
Subjt:  CNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVID

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.67.1e-3529.06Show/hide
Query:  KFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMR----------VCNEEVKFNVFKAMKYPDEM--EDCSFIRILESTVIETTIQDSAD
        K +FP     L +   ++  ++LGR  LA  +A I  +  E+T+             +E+  F        PD M  +      ILES +          
Subjt:  KFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMR----------VCNEEVKFNVFKAMKYPDEM--EDCSFIRILESTVIETTIQDSAD

Query:  KHLEKHGEHQTLADIQGIS--QSFCMHKITLEEGSFRSIEQKRRL--------NPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSN
        +HL    + +  A +Q     Q     K+T    +  +I  K  L          A ++ V+ ++   L+ GII   ++S + SP+  VPKK      S 
Subjt:  KHLEKHGEHQTLADIQGIS--QSFCMHKITLEEGSFRSIEQKRRL--------NPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSN

Query:  KDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRC
        K            +R+ +DYR+LN+ T  +  P+P +D++L +L    Y+  +D   G++QI + PE   KT F+  +G + + RMPFGL NAPATFQRC
Subjt:  KDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRC

Query:  MLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVI
        M  I   ++     V++DD  VF  S    L++LG V ++    +L L  +KC F+ +E   LGH ++ +G++ + EK+E I
Subjt:  MLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVI

P10394 Retrovirus-related Pol polyprotein from transposon 4121.9e-3236.16Show/hide
Query:  EVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSG
        E ++ +V K +   I+ P + S + SP+  VPKK              P      WR+ +DYR++NK    + FPLP ID +LD+L    Y+  LD  SG
Subjt:  EVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSG

Query:  YNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVK
        ++QI +    ++ T+F+   G++ F R+PFGL  AP +FQR M   FS +  S   ++MDD  V G S +  L+NL +V  +C + +L L+ EKC F + 
Subjt:  YNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVK

Query:  EGIVLGHKISKNGLQVDREKVEVI
        E   LGHK +  G+  D +K +VI
Subjt:  EGIVLGHKISKNGLQVDREKVEVI

P20825 Retrovirus-related Pol polyprotein from transposon 2973.1e-3034.23Show/hide
Query:  VKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYN
        V+ +V + L+ G+I   ++S + SP   VPKK   +  +NK            +RV +DYR+LN+ T  + +P+P +D++L +L    Y+  +D   G++
Subjt:  VKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYN

Query:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEG
        QI +  E   KT F+   G + + RMPFGL NAPATFQRCM  I   ++     V++DD  +F  S    L ++  V  +  D +L L  +KC F+ KE 
Subjt:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEG

Query:  IVLGHKISKNGLQVDREKVEVI
          LGH ++ +G++ +  KV+ I
Subjt:  IVLGHKISKNGLQVDREKVEVI

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein7.3e-3235.92Show/hide
Query:  ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRR
        ADI  I      H I ++ G+     Q   +    ++ + K V K LD   I P + S   SPV  VPKK G                   +R+C+DYR 
Subjt:  ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRR

Query:  LNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSV
        LNKAT  + FPLP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F D+    V V++DD  +
Subjt:  LNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSV

Query:  FGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKI
        F  S +   ++L  VL+R ++ +L++  +KC F  +E   LG+ I
Subjt:  FGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKI

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.3e-3235.92Show/hide
Query:  ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRR
        ADI  I      H I ++ G+     Q   +    ++ + K V K LD   I P + S   SPV  VPKK G                   +R+C+DYR 
Subjt:  ADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRR

Query:  LNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSV
        LNKAT  + FPLP ID +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F D+    V V++DD  +
Subjt:  LNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSV

Query:  FGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKI
        F  S +   ++L  VL+R ++ +L++  +KC F  +E   LG+ I
Subjt:  FGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTATATCGAGTGGACAAACATATATCTACAGTGAGGGGAGTACAGCTGCAAGCTATAGTGGAGTGTCCAGTATGGATCTTGTGGTTCCACCAGACGGGGTAAAACC
CATAGGGTGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTACAAGCGCAAC
AGAAGATGAACCAGCCGGGATTTGCTAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAATTCTCTCGAGGCAATGATGAAAGAATTTATGGCTCGTACAGACGCTGCA
ATTCAAAGTAATCAAGCTTCGATGAGAGCCCTAGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCGAGGCCTCAAGGGAAACTTCCTTTGGATACTGAACACCC
TCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCATTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAATTATAATATTGTTG
TTATTGAAAAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAAAGAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCTTATGTGCCGCCCCCACCT
TATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAAGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGT
AGAAGCTATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTG
CTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAGGATCATTTACTATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCA
AGGATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGG
TAAAATTGAGGACGTCTTAGTAAAAGTTGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCAT
TTTTGGCTACTGGTAGGGCATTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGAC
GAAATGGAGGATTGCTCCTTCATTAGGATTCTAGAGAGCACAGTTATTGAGACAACAATACAGGATTCGGCTGATAAGCATTTGGAAAAGCATGGAGAGCATCAGACATT
GGCTGATATTCAGGGTATTAGCCAATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGGTCCTTTAGGAGTATTGAGCAAAAAAGAAGGCTTAACCCTGCAATGAAAG
AGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGGATCATCTATCCAATTGCTGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTC
ACTGTGGTGAGCAATAAAGATAATGAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCAACCCGAAAGGAACATTT
CCCTCTACCATTTATCGACCAGATGTTGGATCGATTGGCTGGTCAGGCCTATTACTGCTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAAGATC
AGGAAAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTCAGGAGGATGCCTTTCGGCCTCTGCAATGCTCCAGCAACATTTCAGCGGTGTATGTTAGCAATTTTT
TCTGATATGATTGAGTCCACTGTTGAGGTCTTTATGGACGATTTTTCAGTTTTTGGAGGGTCTTTTCAAAGTTTTTTAGAAAATTTAGGAAAGGTATTAAAAAGATGTGA
GGATACCCATCTAGTTCTTAATTGGGAGAAGTGCCACTTCATGGTGAAGGAGGGCATAGTGTTAGGACATAAAATTTCTAAAAATGGTCTACAAGTTGATAGAGAAAAAG
TTGAGGTTATTGATGTAGGATCGTATGGCAACACTAGAGGGGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTATATCGAGTGGACAAACATATATCTACAGTGAGGGGAGTACAGCTGCAAGCTATAGTGGAGTGTCCAGTATGGATCTTGTGGTTCCACCAGACGGGGTAAAACC
CATAGGGTGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTACAAGCGCAAC
AGAAGATGAACCAGCCGGGATTTGCTAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAATTCTCTCGAGGCAATGATGAAAGAATTTATGGCTCGTACAGACGCTGCA
ATTCAAAGTAATCAAGCTTCGATGAGAGCCCTAGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAAAGGCGAGGCCTCAAGGGAAACTTCCTTTGGATACTGAACACCC
TCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTAACTCTTAGGAGTGGTAAGCCATTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAGTAATAATTATAATATTGTTG
TTATTGAAAAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAAAGAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCTTATGTGCCGCCCCCACCT
TATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAAGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGT
AGAAGCTATAGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAGTGTAGTG
CTATTCTTAAGAATGGGCTACCCCCCAAGGCTAAGGATCCAGGATCATTTACTATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGTGCA
AGGATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGG
TAAAATTGAGGACGTCTTAGTAAAAGTTGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTCGTCCAT
TTTTGGCTACTGGTAGGGCATTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGAC
GAAATGGAGGATTGCTCCTTCATTAGGATTCTAGAGAGCACAGTTATTGAGACAACAATACAGGATTCGGCTGATAAGCATTTGGAAAAGCATGGAGAGCATCAGACATT
GGCTGATATTCAGGGTATTAGCCAATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGGTCCTTTAGGAGTATTGAGCAAAAAAGAAGGCTTAACCCTGCAATGAAAG
AGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGGATCATCTATCCAATTGCTGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTC
ACTGTGGTGAGCAATAAAGATAATGAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCAACCCGAAAGGAACATTT
CCCTCTACCATTTATCGACCAGATGTTGGATCGATTGGCTGGTCAGGCCTATTACTGCTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAAGATC
AGGAAAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTCAGGAGGATGCCTTTCGGCCTCTGCAATGCTCCAGCAACATTTCAGCGGTGTATGTTAGCAATTTTT
TCTGATATGATTGAGTCCACTGTTGAGGTCTTTATGGACGATTTTTCAGTTTTTGGAGGGTCTTTTCAAAGTTTTTTAGAAAATTTAGGAAAGGTATTAAAAAGATGTGA
GGATACCCATCTAGTTCTTAATTGGGAGAAGTGCCACTTCATGGTGAAGGAGGGCATAGTGTTAGGACATAAAATTTCTAAAAATGGTCTACAAGTTGATAGAGAAAAAG
TTGAGGTTATTGATGTAGGATCGTATGGCAACACTAGAGGGGGGTGA
Protein sequenceShow/hide protein sequence
MVISSGQTYIYSEGSTAASYSGVSSMDLVVPPDGVKPIGCNQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFANKQALPQQNSGNSLEAMMKEFMARTDAA
IQSNQASMRALELQVGQLANELKARPQGKLPLDTEHPRREGKEQVKAVTLRSGKPLEESRKTQDLNSNNYNIVVIEKELESGQGAGGSKENAGASGSVPDVEPPYVPPPP
YVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGA
RINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPD
EMEDCSFIRILESTVIETTIQDSADKHLEKHGEHQTLADIQGISQSFCMHKITLEEGSFRSIEQKRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGV
TVVSNKDNELIPTRTVTGWRVCMDYRRLNKATRKEHFPLPFIDQMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIF
SDMIESTVEVFMDDFSVFGGSFQSFLENLGKVLKRCEDTHLVLNWEKCHFMVKEGIVLGHKISKNGLQVDREKVEVIDVGSYGNTRGG