; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001082 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001082
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:23891743..23896909
RNA-Seq ExpressionLag0001082
SyntenyLag0001082
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]1.7e-12943.41Show/hide
Query:  EDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVRGS-NKKVKSVLEVDGVSTIRA
        EDET SEAWERFKE+LRKCPHHGLPHCI+METFYNGLN+AT+ +VDASA GA+L KT+NEA+EILERI++N+CQW+DVR +  +K + VLEVD +S+I A
Subjt:  EDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVRGS-NKKVKSVLEVDGVSTIRA

Query:  NIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGNQRNNPYSHFYNPCWRNHPNFAGGGQGSNSQVPQAQQ
         +A + N L+N+ +   Q      PV  ++    +  ESCVYCGEEH ++ CP+NPAS+F+VGNQ              N   F         +V    +
Subjt:  NIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGNQRNNPYSHFYNPCWRNHPNFAGGGQGSNSQVPQAQQ

Query:  KMPAIQSNQASMRALEFQMEERKESSKTQDVENSSDKKVEKELESSKGAGGSNNDAGASSSVPDVDPPYVSPPPYVPPPPYVPPLPFPQRQ----RPKNQ
        +  AI  N       +  ++E+   S T  +     K            G +  D G+S ++  +              P    L    R       K +
Subjt:  KMPAIQSNQASMRALEFQMEERKESSKTQDVENSSDKKVEKELESSKGAGGSNNDAGASSSVPDVDPPYVSPPPYVPPPPYVPPLPFPQRQ----RPKNQ

Query:  NISVSTD----------IDQCREDDAPLFKPRNQPLREHTSTYPNRGRSEFHLVLLCSQPTFGLAPEMDTPTRMSPTWMLWIIASVSNT-RWVVSHSVTR
        +I +  D          +D   + D P+   R  P  +   T  +  +    L +   +  F +   M  P        ++ +     T  W    S   
Subjt:  NISVSTD----------IDQCREDDAPLFKPRNQPLREHTSTYPNRGRSEFHLVLLCSQPTFGLAPEMDTPTRMSPTWMLWIIASVSNT-RWVVSHSVTR

Query:  IRILENTIVETAMEDFA--NKHLEDHGEISVEDLETPPIKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKALIKLLQQFRKAI----
            E++     +E  A   +       +  E  ++ P++PS+ EAP LDLKPL  +LKY YLG+ +TLPII+++ L S  E  L++ L++ + AI    
Subjt:  IRILENTIVETAMEDFA--NKHLEDHGEISVEDLETPPIKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKALIKLLQQFRKAI----

Query:  -----------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRR
                         EEG  + IEQQRRL P MKEVV+KE++KWLDAGIIYPIA+S+ VS +QCVP KGG+TV+ N++NELIPTR V GW++CMDYRR
Subjt:  -----------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRR

Query:  LKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF
        L KATRKDHFPLPFIDQM DRLAGK++YCFLDGYSGYN ITI PEDQEKTTFTCPYG F+FRRMPFGLCNAPATF
Subjt:  LKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]5.8e-11433.4Show/hide
Query:  RAIRAYAFPMFNELNPGIERPQIEVANFEMKP------------------------------------GVSRDALRLTLFSYSLRDGIKAWI--------
        R +  YA P  N   P I RP +   NFE+KP                                    GVS DA+RL LF +SL+D  KAW+        
Subjt:  RAIRAYAFPMFNELNPGIERPQIEVANFEMKP------------------------------------GVSRDALRLTLFSYSLRDGIKAWI--------

Query:  --------------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILER
                            AKLR +I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL  + +  +DA+AGG L+ K+  EA+E+LE 
Subjt:  --------------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILER

Query:  ISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGN----
        +++N+ QWS+ R   KKV  + +VDG++ + A +  L      +  V+    P +               SC  CG  H    C      V FV N    
Subjt:  ISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGN----

Query:  -QRNNPYSHFYNPCWRNHPNFA---GGGQGSNSQV--PQAQQKMP----AIQSNQASMRALEFQMEERKESSKTQ-DVENSSDKKVEKELE------SSK
         Q+NNPYS+ YNP WRNHPNF+    G QGS+S+   P   Q  P    + QS + ++  L     ER E  + + D   SS++ VE +L       +S+
Subjt:  -QRNNPYSHFYNPCWRNHPNFA---GGGQGSNSQV--PQAQQKMP----AIQSNQASMRALEFQMEERKESSKTQ-DVENSSDKKVEKELE------SSK

Query:  GAGG-------------------SNNDAGASSS-----------------VPDVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDD
        G G                    S    G  S                    +V+    +P P  P  PYVPP+PFPQR +           IDQ  E  
Subjt:  GAGG-------------------SNNDAGASSS-----------------VPDVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDD

Query:  APLFKPR--NQPLREHTSTYP------------NRGRSEFHLVLLCSQPTF----GLAPEMDTPTRMSPTWMLWII----------ASVSNTRWVVSHSV
          +F+    N P  +  +  P             R   +F  + L  + +      L P++  P   S    +  +          ASVS     VS  +
Subjt:  APLFKPR--NQPLREHTSTYP------------NRGRSEFHLVLLCSQPTF----GLAPEMDTPTRMSPTWMLWII----------ASVSNTRWVVSHSV

Query:  --------------------TRIRILENTIVETAM----EDFANKHLEDHGEISV---------------------------------------------
                              + ILEN +++        DF    +E+  EI +                                             
Subjt:  --------------------TRIRILENTIVETAM----EDFANKHLEDHGEISV---------------------------------------------

Query:  ----------------------EDLET-------------------------------------------PPIKPSLIEAPILDLKPLTDHLKYVYLGEG
                              E LET                                           PP  PS ++AP+L+LKPL  HL Y +LGE 
Subjt:  ----------------------EDLET-------------------------------------------PPIKPSLIEAPILDLKPLTDHLKYVYLGEG

Query:  ETLPIIVASDLMSEDEKALIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQC
         TLP+IV+  L  E    LI++L+  +KAI                     E+     +E QRRL P MKEVV+ EV+KWLDAGIIYPI+DS W+S VQ 
Subjt:  ETLPIIVASDLMSEDEKALIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQC

Query:  VPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPF
        VP KGG+TVV N++NELIPTRTVTGW+VC+DYR+L   TRKDHFPLPF+DQ+ +RLAG AYYCFLDGYSGYN I+I PEDQEKTTFTCPYGTF+FRRMPF
Subjt:  VPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPF

Query:  GLCNAPATF
        GLCNAPATF
Subjt:  GLCNAPATF

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]5.8e-11433.4Show/hide
Query:  RAIRAYAFPMFNELNPGIERPQIEVANFEMKP------------------------------------GVSRDALRLTLFSYSLRDGIKAWI--------
        R +  YA P  N   P I RP +   NFE+KP                                    GVS DA+RL LF +SL+D  KAW+        
Subjt:  RAIRAYAFPMFNELNPGIERPQIEVANFEMKP------------------------------------GVSRDALRLTLFSYSLRDGIKAWI--------

Query:  --------------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILER
                            AKLR +I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL  + +  +DA+AGG L+ K+  EA+E+LE 
Subjt:  --------------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILER

Query:  ISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGN----
        +++N+ QWS+ R   KKV  + +VDG++ + A +  L      +  V+    P +               SC  CG  H    C      V FV N    
Subjt:  ISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGN----

Query:  -QRNNPYSHFYNPCWRNHPNFA---GGGQGSNSQV--PQAQQKMP----AIQSNQASMRALEFQMEERKESSKTQ-DVENSSDKKVEKELE------SSK
         Q+NNPYS+ YNP WRNHPNF+    G QGS+S+   P   Q  P    + QS + ++  L     ER E  + + D   SS++ VE +L       +S+
Subjt:  -QRNNPYSHFYNPCWRNHPNFA---GGGQGSNSQV--PQAQQKMP----AIQSNQASMRALEFQMEERKESSKTQ-DVENSSDKKVEKELE------SSK

Query:  GAGG-------------------SNNDAGASSS-----------------VPDVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDD
        G G                    S    G  S                    +V+    +P P  P  PYVPP+PFPQR +           IDQ  E  
Subjt:  GAGG-------------------SNNDAGASSS-----------------VPDVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDD

Query:  APLFKPR--NQPLREHTSTYP------------NRGRSEFHLVLLCSQPTF----GLAPEMDTPTRMSPTWMLWII----------ASVSNTRWVVSHSV
          +F+    N P  +  +  P             R   +F  + L  + +      L P++  P   S    +  +          ASVS     VS  +
Subjt:  APLFKPR--NQPLREHTSTYP------------NRGRSEFHLVLLCSQPTF----GLAPEMDTPTRMSPTWMLWII----------ASVSNTRWVVSHSV

Query:  --------------------TRIRILENTIVETAM----EDFANKHLEDHGEISV---------------------------------------------
                              + ILEN +++        DF    +E+  EI +                                             
Subjt:  --------------------TRIRILENTIVETAM----EDFANKHLEDHGEISV---------------------------------------------

Query:  ----------------------EDLET-------------------------------------------PPIKPSLIEAPILDLKPLTDHLKYVYLGEG
                              E LET                                           PP  PS ++AP+L+LKPL  HL Y +LGE 
Subjt:  ----------------------EDLET-------------------------------------------PPIKPSLIEAPILDLKPLTDHLKYVYLGEG

Query:  ETLPIIVASDLMSEDEKALIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQC
         TLP+IV+  L  E    LI++L+  +KAI                     E+     +E QRRL P MKEVV+ EV+KWLDAGIIYPI+DS W+S VQ 
Subjt:  ETLPIIVASDLMSEDEKALIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQC

Query:  VPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPF
        VP KGG+TVV N++NELIPTRTVTGW+VC+DYR+L   TRKDHFPLPF+DQ+ +RLAG AYYCFLDGYSGYN I+I PEDQEKTTFTCPYGTF+FRRMPF
Subjt:  VPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPF

Query:  GLCNAPATF
        GLCNAPATF
Subjt:  GLCNAPATF

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]2.6e-11433.4Show/hide
Query:  RAIRAYAFPMFNELNPGIERPQIEVANFEMKP------------------------------------GVSRDALRLTLFSYSLRDGIKAWI--------
        R +  YA P  N   P I RP +   NFE+KP                                    GVS DA+RL LF +SL+D  KAW+        
Subjt:  RAIRAYAFPMFNELNPGIERPQIEVANFEMKP------------------------------------GVSRDALRLTLFSYSLRDGIKAWI--------

Query:  --------------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILER
                            AKLR +I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL  + +  +DA+AGG L+ K+  EA+E+LE 
Subjt:  --------------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILER

Query:  ISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGN----
        +++N+ QWS+ R   KKV  + +VDG++ + A +  L      +  V+    P +               SC  CG  H    C      V FV N    
Subjt:  ISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGN----

Query:  -QRNNPYSHFYNPCWRNHPNFA---GGGQGSNSQV--PQAQQKMP----AIQSNQASMRALEFQMEERKESSKTQ-DVENSSDKKVEKELE------SSK
         Q+NNPYS+ YNP WRNHPNF+    G QGS+S+   P   Q  P    + QS + ++  L     ER E  + + D   SS++ VE +L       +S+
Subjt:  -QRNNPYSHFYNPCWRNHPNFA---GGGQGSNSQV--PQAQQKMP----AIQSNQASMRALEFQMEERKESSKTQ-DVENSSDKKVEKELE------SSK

Query:  GAGG-------------------SNNDAGASSS-----------------VPDVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDD
        G G                    S    G  S                    +V+    +P P  P  PYVPP+PFPQR +           IDQ  E  
Subjt:  GAGG-------------------SNNDAGASSS-----------------VPDVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDD

Query:  APLFKPR--NQPLREHTSTYP------------NRGRSEFHLVLLCSQPTF----GLAPEMDTPTRMSPTWMLWII----------ASVSNTRWVVSHSV
          +F+    N P  +  +  P             R   +F  + L  + +      L P++  P   S    +  +          ASVS     VS  +
Subjt:  APLFKPR--NQPLREHTSTYP------------NRGRSEFHLVLLCSQPTF----GLAPEMDTPTRMSPTWMLWII----------ASVSNTRWVVSHSV

Query:  --------------------TRIRILENTIVETAM----EDFANKHLEDHGEISV---------------------------------------------
                              + ILEN +++        DF    +E+  EI +                                             
Subjt:  --------------------TRIRILENTIVETAM----EDFANKHLEDHGEISV---------------------------------------------

Query:  ----------------------EDLET-------------------------------------------PPIKPSLIEAPILDLKPLTDHLKYVYLGEG
                              E LET                                           PP  PS ++AP+L+LKPL  HL Y +LGE 
Subjt:  ----------------------EDLET-------------------------------------------PPIKPSLIEAPILDLKPLTDHLKYVYLGEG

Query:  ETLPIIVASDLMSEDEKALIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQC
         TLP+IV+  L +E    LI++L+  +KAI                     E+     +E QRRL P MKEVV+ EV+KWLDAGIIYPI+DS W+S VQ 
Subjt:  ETLPIIVASDLMSEDEKALIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQC

Query:  VPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPF
        VP KGG+TVV N++NELIPTRTVTGW+VC+DYR+L   TRKDHFPLPF+DQ+ +RLAG AYYCFLDGYSGYN I+I PEDQEKTTFTCPYGTF+FRRMPF
Subjt:  VPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPF

Query:  GLCNAPATF
        GLCNAPATF
Subjt:  GLCNAPATF

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]2.0e-11433.4Show/hide
Query:  RAIRAYAFPMFNELNPGIERPQIEVANFEMKP------------------------------------GVSRDALRLTLFSYSLRDGIKAWI--------
        R +  YA P  N   P I RP +   NFE+KP                                    GVS DA+RL LF +SL+D  KAW+        
Subjt:  RAIRAYAFPMFNELNPGIERPQIEVANFEMKP------------------------------------GVSRDALRLTLFSYSLRDGIKAWI--------

Query:  --------------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILER
                            AKLR +I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL  + +  +DA+AGG L+ K+  EA+E+LE 
Subjt:  --------------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILER

Query:  ISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGN----
        +++N+ QWS+ R   KKV  + +VDG++ + A +  L      +  V+    P +               SC  CG  H      ++   V FV N    
Subjt:  ISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGN----

Query:  -QRNNPYSHFYNPCWRNHPNFA---GGGQGSNSQ---VPQAQQKMPAIQSNQA---SMRALEFQMEERKESSKTQ-DVENSSDKKVEKELE------SSK
         Q+NNPYS+ YNP WRNHPNF+    G QGS+S+    P  Q K+   +S Q+   ++  L     ER E  + + D   SS++ VE +L       +S+
Subjt:  -QRNNPYSHFYNPCWRNHPNFA---GGGQGSNSQ---VPQAQQKMPAIQSNQA---SMRALEFQMEERKESSKTQ-DVENSSDKKVEKELE------SSK

Query:  GAGG-------------------SNNDAGASSS-----------------VPDVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDD
        G G                    S    G  SS                   +V+    +P P  P  PYVPP+PFPQR +           IDQ  E  
Subjt:  GAGG-------------------SNNDAGASSS-----------------VPDVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDD

Query:  APLFKPR--NQPLREHTSTYP------------NRGRSEFHLVLLCSQPTF----GLAPEMDTPTRMSPTWMLWII----------ASVSNTRWVVSHSV
          +F+    N P  +  +  P             R   +F  + L  + +      L P++  P   S    +  +          ASVS     VS  +
Subjt:  APLFKPR--NQPLREHTSTYP------------NRGRSEFHLVLLCSQPTF----GLAPEMDTPTRMSPTWMLWII----------ASVSNTRWVVSHSV

Query:  --------------------TRIRILENTIVETAM----EDFANKHLEDHGEISV---------------------------------------------
                              + ILEN +++        DF    +E+  EI +                                             
Subjt:  --------------------TRIRILENTIVETAM----EDFANKHLEDHGEISV---------------------------------------------

Query:  ----------------------EDLET-------------------------------------------PPIKPSLIEAPILDLKPLTDHLKYVYLGEG
                              E LET                                           PP  PS ++AP+L+LKPL  HL Y +LGE 
Subjt:  ----------------------EDLET-------------------------------------------PPIKPSLIEAPILDLKPLTDHLKYVYLGEG

Query:  ETLPIIVASDLMSEDEKALIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQC
         TLP+IV+  L  E    LI++L+  +KAI                     E+     +E QRRL P MKEVV+ EV+KWLDAGIIYPI+DS W+S VQ 
Subjt:  ETLPIIVASDLMSEDEKALIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQC

Query:  VPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPF
        VP KGG+TVV N++NELIPTRTVTGW+VC+DYR+L   TRKDHFPLPF+DQ+ +RLAG AYYCFLDGYSGYN I+I PEDQEKTTFTCPYGTF+FRRMPF
Subjt:  VPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPF

Query:  GLCNAPATF
        GLCNAPATF
Subjt:  GLCNAPATF

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase2.7e-10933.63Show/hide
Query:  DPPGVRFELDPEIERTFRRRRRE--QRRNQNPMDKPNPQLEQNGQKNNQAENPILVAKDRARAIRAYAFPMFNELNPGI---------------------
        +P   R++L  EIERTF+ RRR+  +   Q    K N  +       N     + + + R R     +  +F EL  G+                     
Subjt:  DPPGVRFELDPEIERTFRRRRRE--QRRNQNPMDKPNPQLEQNGQKNNQAENPILVAKDRARAIRAYAFPMFNELNPGI---------------------

Query:  ERPQIEVANF------EMKPGVSRDALRLTLFSYSL------------RDGIKAW----------------IAKLRTEIVGFRQLEDETFSEAWERFKEL
        E P   + NF        + GVS+DALRL LFS+SL             D I  W                IA LR EI+ FRQ   ET  EAW RF+++
Subjt:  ERPQIEVANF------EMKPGVSRDALRLTLFSYSL------------RDGIKAW----------------IAKLRTEIVGFRQLEDETFSEAWERFKEL

Query:  LRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVS
        LR CP+H +P  I++ TFY+GL    +  +D   G + L  T  E H +L  +  N  +    R +  K   V+EVD V+ + A I  L  ++KN  V  
Subjt:  LRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVS

Query:  HQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGNQR---NNPYSHFYNPCWRNHPNFA-----GGG------QGSNSQVPQ-AQQK
         Q  P                 +C  CGE H  + CP++  S+ FV N R   NNPYS+ YNP WR HPNF+     G G      QG   QV Q  Q+K
Subjt:  HQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGNQR---NNPYSHFYNPCWRNHPNFA-----GGG------QGSNSQVPQ-AQQK

Query:  MPAIQ--------SNQASMRALEFQMEE---------------------RKES------------SKTQDVENSSDKKVEKELESSKGAGGSNNDAGASS
         P+++        S  A+ + +E Q+ +                     R++S            ++ Q+V     K  EKE+ S +   G   +A    
Subjt:  MPAIQ--------SNQASMRALEFQMEE---------------------RKES------------SKTQDVENSSDKKVEKELESSKGAGGSNNDAGASS

Query:  SVPDVDPPYVS--------------------------------PPPYVPPPPYVPP---------------------LPFP-------QRQRPKN-----
           +  P YV                                 PP    P  +  P                     +P+           +P +     
Subjt:  SVPDVDPPYVS--------------------------------PPPYVPPPPYVPP---------------------LPFP-------QRQRPKN-----

Query:  ------------QNISVSTD----------IDQCREDDAPLFKPRNQPLREHTSTYPNRGRSEFHLVLLCSQPTFGLAPEMDTPTRMSPTWMLWIIASVS
                    ++I V  D          +D   + + P+   R  P      T  +  + E  + +   Q TF +   M  P      + + +   ++
Subjt:  ------------QNISVSTD----------IDQCREDDAPLFKPRNQPLREHTSTYPNRGRSEFHLVLLCSQPTFGLAPEMDTPTRMSPTWMLWIIASVS

Query:  NTRWVVSHSVTRI-RILENTIVETAMEDF-------ANKHLEDHGEISVEDLETPP---IKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMS
            +    +  + R L + + E   ED        A+K+ +  G  S+E   T P   +KPS+ E P L+LKPL  HL Y YLGE +TLP+I++S L  
Subjt:  NTRWVVSHSVTRI-RILENTIVETAMEDF-------ANKHLEDHGEISVEDLETPP---IKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMS

Query:  EDEKALIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNK
           + L+++L+  + AI                     E+G    +E QRRL P MKEVVKKE+IKWLDAGIIYPI+DS+WVS VQCVP KGG+TVV N 
Subjt:  EDEKALIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNK

Query:  DNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF
         NELIPTRTVTGW+VCMDYR+L KATRKDHFPLPFIDQM DRLAGK +YCFLDGYSGYN I I PEDQEKTTFTCPYGTF+FR+MPFGLCNAPATF
Subjt:  DNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF

A0A2G9HWF8 Reverse transcriptase1.5e-10734.06Show/hide
Query:  DPEIERTFRRRRREQRRNQNPMDKPNPQLEQNG---QKNNQAENPIL-VAKDRARAIRAYAFPMFNELNPGI---------------------ERPQIEV
        DPEIERTFR RRR+   +     +   ++E+N      +N    P+  +A  RAR  ++    +F EL  G+                     E P   +
Subjt:  DPEIERTFRRRRREQRRNQNPMDKPNPQLEQNG---QKNNQAENPIL-VAKDRARAIRAYAFPMFNELNPGI---------------------ERPQIEV

Query:  ANF------EMKPGVSRDALRLTLFSYSL------------RDGIKAWI----------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHH
         NF        + GVS+DALRL LFS+SL             D I  W+                A LR EI+ FRQ   ET  EAW RF+++LR CP+H
Subjt:  ANF------EMKPGVSRDALRLTLFSYSL------------RDGIKAWI----------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHH

Query:  GLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAV
         +P  I++ TFY+GL    +  +D   G + L  T  E H +L  +  N  +    R +  K   V+EVD V+ + A I  L  ++KN            
Subjt:  GLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAV

Query:  EPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGNQR---NNPYSHFYNPCWRNHPNFA---GGGQGSNSQVPQAQQ---KMPAIQSNQASMRA
                           CGE H  + CP++  S+ FV N R   NNPYS+ YNP WR HPNF+     GQGS  +  Q  Q   K    Q  +A  +A
Subjt:  EPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGNQR---NNPYSHFYNPCWRNHPNFA---GGGQGSNSQVPQAQQ---KMPAIQSNQASMRA

Query:  LEFQ-----MEERKESSKTQDVENSSDKKVEKELE--------------------------------------------------------------SSK
        +  +      E  KE +K+++ E +S++K EKE+E                                                              S K
Subjt:  LEFQ-----MEERKESSKTQDVENSSDKKVEKELE--------------------------------------------------------------SSK

Query:  GAGGSNNDAG----ASSSVPDVDPPYVSPP---------------PY--------VPPPPYVPPLPFPQRQ--RPKN--QNISVSTD----------IDQ
           G           S+ + +  PP +  P               PY        V   P    L    R    PK   ++I V  D          +D 
Subjt:  GAGGSNNDAG----ASSSVPDVDPPYVSPP---------------PY--------VPPPPYVPPLPFPQRQ--RPKN--QNISVSTD----------IDQ

Query:  CREDDAPLFKPRNQPLREHTSTYPNRGRSEFHLVLLCSQPTFGLAPEMDTPTRMSPTWMLWIIASVSNTRWVVSHSVTRI-RILENTIVETAMED-----
          + + P+   R  P      T  +  + E  + +   Q TF +   M  P      + + +  +++  + +    +  + R L + + E   ED     
Subjt:  CREDDAPLFKPRNQPLREHTSTYPNRGRSEFHLVLLCSQPTFGLAPEMDTPTRMSPTWMLWIIASVSNTRWVVSHSVTRI-RILENTIVETAMED-----

Query:  --FANKHLEDHGEISVEDLETPP---IKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKALIKLLQQFRKAI----------------
           A+K+ +  G  S+E   T P   +KPS+ E P L+LKPL  HL Y YLGE +TLP+I++S L     + L+++L+  + AI                
Subjt:  --FANKHLEDHGEISVEDLETPP---IKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKALIKLLQQFRKAI----------------

Query:  -----EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPL
             E+     +E QRRL P MKEVVKKE+IKWLDAGIIYPI+D +W+S VQCVP KGG+TVV N  NE IPT+TVTGW+VCMDYR+L KATRKDHFPL
Subjt:  -----EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPL

Query:  PFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF
        PFIDQM DRLAGK +YCFLDGYSGYN I I PEDQEKTTFTCPYGTF+FRR+PF LCNAPATF
Subjt:  PFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF

A0A5B6VWJ0 Retroelement pol polyprotein-like1.1e-11038.62Show/hide
Query:  TLFSYSLRDGIKAWI----------------------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSM
        TL  +SLRD  +AW+                            AKLR EI  F  ++DE+  EAWERFKELL+KCPHHG+PHCI++ETFYNGL   T+ +
Subjt:  TLFSYSLRDGIKAWI----------------------------AKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSM

Query:  VDASAGGALLEKTFNEAHEILERISTNSCQWSDVR-GSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCG
        VDASA GALL K++NEA+EI+ERI++N+ QW   R  S ++V  + EVD ++++ + ++ +++  KN+T        A  P    +        + VYCG
Subjt:  VDASAGGALLEKTFNEAHEILERISTNSCQWSDVR-GSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCG

Query:  EEHNYEFCPNNPASVFFVGNQRNNP-----YSHFYNPCWRNHPNFAGGGQGSN----------SQVPQAQQKMPAIQSNQASMRALEFQMEERKESSKTQ
        E H  E CP+NP SV+++GNQ  N       S+FYN  WRNH +F+   QG+           +Q+P   Q++  +   +AS  +LE  +   K      
Subjt:  EEHNYEFCPNNPASVFFVGNQRNNP-----YSHFYNPCWRNHPNFAGGGQGSN----------SQVPQAQQKMPAIQSNQASMRALEFQMEERKESSKTQ

Query:  DVENSSDKKVEKELESSKGAGGSNNDAGASSSVP-DVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDDAPLFKPR-NQPLR-EHT
        D    S     K LE+  G   +        ++P D + P      +              R     +  +V  + +Q    DA   +P    P+  E  
Subjt:  DVENSSDKKVEKELESSKGAGGSNNDAGASSSVP-DVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDDAPLFKPR-NQPLR-EHT

Query:  STYPNRGRSEFHLVLLCSQPTFGLAPEM------------DTPTRMSPTWMLWIIASVSNT--RWVVSHSVTRIRILENTIVETAMEDFANKHLEDHGEI
        ST P++  S     +   QPT  L  E+              P ++       I  ++  T   + V   V  I           M D     L    ++
Subjt:  STYPNRGRSEFHLVLLCSQPTFGLAPEM------------DTPTRMSPTWMLWIIASVSNT--RWVVSHSVTRIRILENTIVETAMEDFANKHLEDHGEI

Query:  SVEDLETPPIKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKALIKLLQQFRKAIEEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAG
        + +   + P K S+ E P L+LK +           G T+           D + +   +   +  +E+G    I+ QRRL P MK+VVKKE+ KWLDAG
Subjt:  SVEDLETPPIKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKALIKLLQQFRKAIEEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAG

Query:  IIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKT
        IIYPI DS+W+S VQCVP KGG+TVV NK NELIPTRTVTGW++C+DY +L KATRKDHFPLPF+DQM DRLAG+ YY FLDGYSGYNHIT+ P+DQ K 
Subjt:  IIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKT

Query:  TFTCPYGTFSFRRMPFGLCNAPATF
        TFTCPYGTF+FRRMPFGLCNAPATF
Subjt:  TFTCPYGTFSFRRMPFGLCNAPATF

A0A6A3BRM8 Reverse transcriptase4.9e-11133.3Show/hide
Query:  LDPEIERTFRRRRREQRRNQNPMDKPNPQLEQ-NGQKNNQAENPILVAKDRARAIRAYAFPMFNELNPGI------------ERPQIEVANFEMKPGVSR
        ++PEIER FRRRRR QR   N     N  + Q +G +N +A N +  A  R   IR +  PM ++LNPGI             R  +EV +   + GV  
Subjt:  LDPEIERTFRRRRREQRRNQNPMDKPNPQLEQ-NGQKNNQAENPILVAKDRARAIRAYAFPMFNELNPGI------------ERPQIEVANFEMKPGVSR

Query:  DALRLTLFSYSLRDGIKAWIA----------------------------KLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNM
        D L+L LF YSLRD  +AW++                            +LR EI  FRQ +DE+  E W+R+K LLRKC +HG     ++  FYNG+N 
Subjt:  DALRLTLFSYSLRDGIKAWIA----------------------------KLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNM

Query:  ATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVR-GSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEES
         T+ ++DASA G LL+K+  EA  IL+RI+ N  Q+   R GS ++     E++   ++   ++ + N LKN+                    Q ST+ +
Subjt:  ATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVR-GSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEES

Query:  CVYCGEEHNYEFCPNNPASVFFVGNQRNNPYSHFYNPCWRNHPNFAGGGQGS----------NSQVPQA-QQKMPA------------------------
                                   NNPYS+ YN  WR HPNF+ G QG+          N   PQ+ Q  MP                         
Subjt:  CVYCGEEHNYEFCPNNPASVFFVGNQRNNPYSHFYNPCWRNHPNFAGGGQGS----------NSQVPQA-QQKMPA------------------------

Query:  ---------------------------IQSNQASMRALEFQ-------MEERKESSKTQDVENS------SDKKVEKELESSKGAGGSNNDAGASSSVPD
                                   +QS+  S+RALE Q       ++ER++     D E+S      S+   E EL+  + +     +   S+ VPD
Subjt:  ---------------------------IQSNQASMRALEFQ-------MEERKESSKTQDVENS------SDKKVEKELESSKGAGGSNNDAGASSSVPD

Query:  VDPPYVSPPPYVPPPPYV-PPLPFPQRQRPKNQNISVSTDIDQCRE-----------DDAPLF----------KPRNQPLREHT------STYPNRGRSE
         D    +    +P      PP PFPQR +  N  +     +D   +           +  P++          K + + +   T      S  P +    
Subjt:  VDPPYVSPPPYVPPPPYV-PPLPFPQRQRPKNQNISVSTDIDQCRE-----------DDAPLF----------KPRNQPLREHT------STYPNRGRSE

Query:  FHLVLLCS--QPTFGLA-PEMDTPTRM--------------SPTWMLWIIASVSNTR--WVVSHSVTRI-------------------------------
           ++ CS     FG A  ++ +   +               PT ++  +A  S+ +    V   + R+                               
Subjt:  FHLVLLCS--QPTFGLA-PEMDTPTRM--------------SPTWMLWIIASVSNTR--WVVSHSVTRI-------------------------------

Query:  -RIL---------------ENTIVETAMEDFANKHLEDHGEISVED---LETPPIKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKA
         RIL               +N  ++ A  ++    ++D   +  +D   LE     PSL+ AP L+LK L  HLKYVYLG  ETLP+I++++L +  E++
Subjt:  -RIL---------------ENTIVETAMEDFANKHLEDHGEISVED---LETPPIKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKA

Query:  LIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELI
        L+ +L Q +KAI                     E+     IE +RRL P MK+VV KE++KWLDAG+IYPI++S+WVS VQC+P K G TVVTN+DNEL+
Subjt:  LIKLLQQFRKAI---------------------EEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELI

Query:  PTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF
        PTRTVTGW++CMDYR++ KAT+KDHFPLPFIDQM DRLAGKA+YCFLDGYSGYN I I PEDQE TTFTCPYGTF+FRRMPFGLCNAPATF
Subjt:  PTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF

A0A6P8CDK6 Reverse transcriptase7.6e-11234.98Show/hide
Query:  LDPEIERTFRRRRREQRRNQNPMDKPNPQLEQNGQKNNQAENPILVAKDRARAIRAYAFPMFNELNPGIERPQIEVANFEMKP-----------------
        LDPEIERT  R RRE RR +        +L+     ++     I   +  ARA+R YA P    +   I RP I   NFE+KP                 
Subjt:  LDPEIERTFRRRRREQRRNQNPMDKPNPQLEQNGQKNNQAENPILVAKDRARAIRAYAFPMFNELNPGIERPQIEVANFEMKP-----------------

Query:  -------------------GVSRDALRLTLFSYSLRDGIKAWI----------------------------AKLRTEIVGFRQLEDETFSEAWERFKELL
                            V+ D +RL LF +SLRD  +AW                             A+LR EI  F +   E+  EAWERFKE +
Subjt:  -------------------GVSRDALRLTLFSYSLRDGIKAWI----------------------------AKLRTEIVGFRQLEDETFSEAWERFKELL

Query:  RKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSH
        RKCPHHGLP  + +E FY  L+   +S+VDA+AGGAL+ K ++EA  ++E +++++  W + R S  +V SV ++D ++ +   I+ L   +        
Subjt:  RKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVRGSNKKVKSVLEVDGVSTIRANIAMLANTLKNVTVVSH

Query:  QQPPAVEPVVISSRQQWSTEESCVYCGEEHN-YEFCPNNPAS------VFFVGN-QRNN--PYSHFYNPCWRNHPNFA---------------GGGQGSN
                              C  C   H+  E    NP++      V FV N QR+N  PYS+ YNP WRNHPNF+                 G   N
Subjt:  QQPPAVEPVVISSRQQWSTEESCVYCGEEHN-YEFCPNNPAS------VFFVGN-QRNN--PYSHFYNPCWRNHPNFA---------------GGGQGSN

Query:  SQVPQAQQKM------------PAIQSNQASMRALEFQMEE-------RKESSKTQDVENSSDKKVEKELESSKGAGGSNNDAGASSSVPDVDPP--YVS
        +   Q+Q +M              +Q+ QA++R LE Q+ +       R   S   + E +        L S K     N  A      P+ D     V 
Subjt:  SQVPQAQQKM------------PAIQSNQASMRALEFQMEE-------RKESSKTQDVENSSDKKVEKELESSKGAGGSNNDAGASSSVPDVDPP--YVS

Query:  PP--PYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDDAPLFKPR--NQPLREHTSTYPNRGRSEFHLVLLCSQPTF-GLAPEMDTPTRM-----S
         P    +   PYVPP+PFP R + +  +   +  +D        +FK    N P  E     P+  R  F   LL  +  F G  P M T  ++      
Subjt:  PP--PYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDDAPLFKPR--NQPLREHTSTYPNRGRSEFHLVLLCSQPTF-GLAPEMDTPTRM-----S

Query:  PTWMLWIIA--SVSNTRWVVSHSVTRI---------------------RILENTIVETA-------------------MEDFANKHLEDHGEISV-----
         T +   +A  S+   + +V + + ++                      IL    + T                    ++D+++   ++H E SV     
Subjt:  PTWMLWIIA--SVSNTRWVVSHSVTRI---------------------RILENTIVETA-------------------MEDFANKHLEDHGEISV-----

Query:  ------EDLETPPIKP--SLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKALIKLLQQFRKAI---------------------EEGSFR
              E+L T   KP  SL ++P+L+LKPL  HLKY YLG  +TLPII++S L  + E+ L+ +L++ ++AI                     E     
Subjt:  ------EDLETPPIKP--SLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKALIKLLQQFRKAI---------------------EEGSFR

Query:  RIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLA
         ++ QRRL P +KEVVKKEV+K LDAGIIYPI+DS WVS VQ VP KGG+TVV N+ N+LIPTRTVTGW+VC+DYR+L  ATRKDHFPLPFIDQM ++LA
Subjt:  RIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLA

Query:  GKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF
        G  YYCFLDGYSGYN I I PEDQEKTTFTCPYGTF+FRRMPFGLCNAPATF
Subjt:  GKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.7e-1330.99Show/hide
Query:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDG
        A ++ V+ ++   L+ GII          R    P    + VV  K +     +    +++ +DYR+L + T  D  P+P +D++  +L    Y+  +D 
Subjt:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDG

Query:  YSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF
          G++ I +DPE   KT F+  +G + + RMPFGL NAPATF
Subjt:  YSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF

P10394 Retrovirus-related Pol polyprotein from transposon 4127.8e-1331.65Show/hide
Query:  EVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSG
        E ++ +V K +   I+ P + S + S +  VP K              P      W++ +DYR++ K    D FPLP ID + D+L    Y+  LD  SG
Subjt:  EVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSG

Query:  YNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF
        ++ I +D   ++ T+F+   G++ F R+PFGL  AP +F
Subjt:  YNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF

P31843 RNA-directed DNA polymerase homolog7.5e-1649.4Show/hide
Query:  KVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF
        ++C+DYR L K T K+ +P+P +D +FDRLA   ++  LD  SGY  + I   D+ KTT    YG+F FR MPFGL NA ATF
Subjt:  KVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.0e-1837.86Show/hide
Query:  KEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYS
        ++ + K V K LD   I P + S   S V  VP K G                   +++C+DYR L KAT  D FPLP ID +  R+     +  LD +S
Subjt:  KEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYS

Query:  GYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF
        GY+ I ++P+D+ KT F  P G + +  MPFGL NAP+TF
Subjt:  GYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.0e-1837.86Show/hide
Query:  KEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYS
        ++ + K V K LD   I P + S   S V  VP K G                   +++C+DYR L KAT  D FPLP ID +  R+     +  LD +S
Subjt:  KEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPMKGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYS

Query:  GYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF
        GY+ I ++P+D+ KT F  P G + +  MPFGL NAP+TF
Subjt:  GYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTGAGGTTCGAGCTTGATCCAGAAATTGAAAGGACATTCAGGAGAAGAAGGAGAGAGCAGCGTAGAAATCAAAATCCAATGGATAAGCCTAA
TCCACAGCTGGAGCAAAATGGACAGAAAAATAATCAGGCTGAGAATCCTATCCTAGTGGCGAAAGATAGGGCCAGAGCCATTCGAGCGTATGCTTTTCCGATGTTTAATG
AATTAAATCCAGGAATTGAACGTCCTCAAATTGAGGTAGCAAATTTTGAAATGAAACCGGGAGTGTCTAGAGATGCCCTTAGATTAACTTTGTTCTCGTATTCTCTTAGA
GATGGAATAAAGGCGTGGATTGCCAAATTGAGGACTGAGATAGTGGGATTTAGGCAACTTGAAGATGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCG
AAAGTGTCCCCACCATGGTTTACCTCATTGTATAGAAATGGAAACATTTTACAATGGGTTAAACATGGCAACCCAAAGTATGGTCGATGCCTCGGCTGGAGGGGCCCTTT
TGGAAAAAACCTTTAATGAAGCCCATGAAATTTTAGAGAGAATATCGACCAATAGTTGTCAGTGGTCGGATGTTAGAGGCTCAAATAAAAAAGTTAAGAGTGTGTTAGAG
GTTGATGGTGTGTCTACCATTAGGGCTAATATTGCAATGTTAGCTAACACTCTTAAAAATGTGACAGTGGTTAGTCATCAGCAACCGCCAGCTGTGGAGCCTGTAGTCAT
CAGCAGCCGACAGCAGTGGAGTACAGAGGAATCTTGTGTTTATTGTGGTGAAGAGCACAACTACGAGTTCTGCCCTAACAATCCAGCTTCTGTGTTTTTTGTAGGGAATC
AAAGGAATAACCCTTATTCTCACTTTTATAATCCATGTTGGCGCAACCATCCCAACTTTGCAGGGGGAGGGCAAGGAAGCAATTCACAAGTCCCTCAAGCGCAGCAAAAG
ATGCCCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCCCTGGAATTTCAAATGGAAGAGAGAAAAGAGTCCAGTAAGACCCAGGATGTTGAGAATAGTAGTGATAAAAA
AGTTGAAAAAGAGTTGGAGTCTAGTAAAGGTGCTGGAGGCAGCAACAATGATGCTGGAGCATCTAGTTCTGTTCCAGATGTGGATCCACCTTATGTGTCGCCCCCACCTT
ATGTACCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATCAGAATATTTCAGTGTCCACGGATATCGACCAATGCAGAGAGGACGAC
GCGCCTTTGTTCAAGCCCCGAAATCAGCCCTTAAGGGAACACACATCTACTTACCCCAATAGGGGAAGGAGTGAATTCCATCTTGTACTGTTATGTTCCCAGCCCACATT
CGGTCTTGCCCCTGAAATGGATACCCCCACTCGCATGTCTCCTACATGGATGCTTTGGATCATTGCATCTGTATCGAATACAAGGTGGGTCGTATCACATAGTGTCACCA
GGATAAGGATTCTGGAGAACACAATTGTTGAGACAGCAATGGAGGATTTTGCAAACAAGCATTTGGAAGATCATGGAGAGATTAGTGTAGAGGATTTAGAAACTCCTCCT
ATTAAGCCATCCCTGATTGAGGCACCCATTTTAGATTTGAAGCCCTTAACGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGAGACGTTGCCCATTATTGTTGCATC
AGATTTAATGTCGGAGGATGAAAAGGCATTAATAAAGTTGCTACAGCAATTTCGCAAAGCAATTGAGGAGGGATCCTTTAGGAGAATTGAGCAACAGAGAAGGCTTAAGC
CTGCCATGAAAGAGGTTGTTAAGAAGGAGGTGATCAAATGGTTGGATGCTGGGATTATCTATCCAATTGCAGATAGCAATTGGGTAAGTCGTGTCCAATGTGTTCCTATG
AAAGGAGGTGTAACGGTAGTGACTAATAAAGACAATGAGTTGATCCCAACCAGAACAGTAACTGGTTGGAAGGTTTGCATGGATTACAGGAGGCTTAAAAAAGCCACCCG
TAAGGACCACTTCCCTCTACCATTTATTGACCAGATGTTTGATAGATTGGCTGGTAAGGCCTACTACTGTTTCTTAGATGGTTACTCTGGGTATAACCACATTACCATTG
ATCCTGAGGATCAGGAAAAAACCACTTTCACCTGCCCTTATGGGACGTTTTCCTTTAGGCGAATGCCTTTTGGCCTTTGCAATGCTCCAGCAACATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTGAGGTTCGAGCTTGATCCAGAAATTGAAAGGACATTCAGGAGAAGAAGGAGAGAGCAGCGTAGAAATCAAAATCCAATGGATAAGCCTAA
TCCACAGCTGGAGCAAAATGGACAGAAAAATAATCAGGCTGAGAATCCTATCCTAGTGGCGAAAGATAGGGCCAGAGCCATTCGAGCGTATGCTTTTCCGATGTTTAATG
AATTAAATCCAGGAATTGAACGTCCTCAAATTGAGGTAGCAAATTTTGAAATGAAACCGGGAGTGTCTAGAGATGCCCTTAGATTAACTTTGTTCTCGTATTCTCTTAGA
GATGGAATAAAGGCGTGGATTGCCAAATTGAGGACTGAGATAGTGGGATTTAGGCAACTTGAAGATGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCG
AAAGTGTCCCCACCATGGTTTACCTCATTGTATAGAAATGGAAACATTTTACAATGGGTTAAACATGGCAACCCAAAGTATGGTCGATGCCTCGGCTGGAGGGGCCCTTT
TGGAAAAAACCTTTAATGAAGCCCATGAAATTTTAGAGAGAATATCGACCAATAGTTGTCAGTGGTCGGATGTTAGAGGCTCAAATAAAAAAGTTAAGAGTGTGTTAGAG
GTTGATGGTGTGTCTACCATTAGGGCTAATATTGCAATGTTAGCTAACACTCTTAAAAATGTGACAGTGGTTAGTCATCAGCAACCGCCAGCTGTGGAGCCTGTAGTCAT
CAGCAGCCGACAGCAGTGGAGTACAGAGGAATCTTGTGTTTATTGTGGTGAAGAGCACAACTACGAGTTCTGCCCTAACAATCCAGCTTCTGTGTTTTTTGTAGGGAATC
AAAGGAATAACCCTTATTCTCACTTTTATAATCCATGTTGGCGCAACCATCCCAACTTTGCAGGGGGAGGGCAAGGAAGCAATTCACAAGTCCCTCAAGCGCAGCAAAAG
ATGCCCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCCCTGGAATTTCAAATGGAAGAGAGAAAAGAGTCCAGTAAGACCCAGGATGTTGAGAATAGTAGTGATAAAAA
AGTTGAAAAAGAGTTGGAGTCTAGTAAAGGTGCTGGAGGCAGCAACAATGATGCTGGAGCATCTAGTTCTGTTCCAGATGTGGATCCACCTTATGTGTCGCCCCCACCTT
ATGTACCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATCAGAATATTTCAGTGTCCACGGATATCGACCAATGCAGAGAGGACGAC
GCGCCTTTGTTCAAGCCCCGAAATCAGCCCTTAAGGGAACACACATCTACTTACCCCAATAGGGGAAGGAGTGAATTCCATCTTGTACTGTTATGTTCCCAGCCCACATT
CGGTCTTGCCCCTGAAATGGATACCCCCACTCGCATGTCTCCTACATGGATGCTTTGGATCATTGCATCTGTATCGAATACAAGGTGGGTCGTATCACATAGTGTCACCA
GGATAAGGATTCTGGAGAACACAATTGTTGAGACAGCAATGGAGGATTTTGCAAACAAGCATTTGGAAGATCATGGAGAGATTAGTGTAGAGGATTTAGAAACTCCTCCT
ATTAAGCCATCCCTGATTGAGGCACCCATTTTAGATTTGAAGCCCTTAACGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGAGACGTTGCCCATTATTGTTGCATC
AGATTTAATGTCGGAGGATGAAAAGGCATTAATAAAGTTGCTACAGCAATTTCGCAAAGCAATTGAGGAGGGATCCTTTAGGAGAATTGAGCAACAGAGAAGGCTTAAGC
CTGCCATGAAAGAGGTTGTTAAGAAGGAGGTGATCAAATGGTTGGATGCTGGGATTATCTATCCAATTGCAGATAGCAATTGGGTAAGTCGTGTCCAATGTGTTCCTATG
AAAGGAGGTGTAACGGTAGTGACTAATAAAGACAATGAGTTGATCCCAACCAGAACAGTAACTGGTTGGAAGGTTTGCATGGATTACAGGAGGCTTAAAAAAGCCACCCG
TAAGGACCACTTCCCTCTACCATTTATTGACCAGATGTTTGATAGATTGGCTGGTAAGGCCTACTACTGTTTCTTAGATGGTTACTCTGGGTATAACCACATTACCATTG
ATCCTGAGGATCAGGAAAAAACCACTTTCACCTGCCCTTATGGGACGTTTTCCTTTAGGCGAATGCCTTTTGGCCTTTGCAATGCTCCAGCAACATTTTAG
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDPEIERTFRRRRREQRRNQNPMDKPNPQLEQNGQKNNQAENPILVAKDRARAIRAYAFPMFNELNPGIERPQIEVANFEMKPGVSRDALRLTLFSYSLR
DGIKAWIAKLRTEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIEMETFYNGLNMATQSMVDASAGGALLEKTFNEAHEILERISTNSCQWSDVRGSNKKVKSVLE
VDGVSTIRANIAMLANTLKNVTVVSHQQPPAVEPVVISSRQQWSTEESCVYCGEEHNYEFCPNNPASVFFVGNQRNNPYSHFYNPCWRNHPNFAGGGQGSNSQVPQAQQK
MPAIQSNQASMRALEFQMEERKESSKTQDVENSSDKKVEKELESSKGAGGSNNDAGASSSVPDVDPPYVSPPPYVPPPPYVPPLPFPQRQRPKNQNISVSTDIDQCREDD
APLFKPRNQPLREHTSTYPNRGRSEFHLVLLCSQPTFGLAPEMDTPTRMSPTWMLWIIASVSNTRWVVSHSVTRIRILENTIVETAMEDFANKHLEDHGEISVEDLETPP
IKPSLIEAPILDLKPLTDHLKYVYLGEGETLPIIVASDLMSEDEKALIKLLQQFRKAIEEGSFRRIEQQRRLKPAMKEVVKKEVIKWLDAGIIYPIADSNWVSRVQCVPM
KGGVTVVTNKDNELIPTRTVTGWKVCMDYRRLKKATRKDHFPLPFIDQMFDRLAGKAYYCFLDGYSGYNHITIDPEDQEKTTFTCPYGTFSFRRMPFGLCNAPATF