; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036595 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036595
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:49004847..49010376
RNA-Seq ExpressionLag0036595
SyntenyLag0036595
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR021109 - Aspartic peptidase domain superfamily
IPR038765 - Papain-like cysteine peptidase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN00904.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.9e-14742.69Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN
        + TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ +   +  +  ++KN  V   Q  P         
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN

Query:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQAQQKVNQSGFAKMQ----MLQFKVIKLDESPGI
              C  CGE H  + CP++  S+ FV N R   NNPYSN YNPGWR HPNF+W    GQGS+ + QQ+  Q     MQ     L+  +I+   S   
Subjt:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQAQQKVNQSGFAKMQ----MLQFKVIKLDESPGI

Query:  ASKPLE----------------------EPRKTQDL---------------NKHGDKDVV---VEKELE------------------------FGQGAGG
          K +E                      EP   QD+                K  +K+V+    EKE+E                        F +    
Subjt:  ASKPLE----------------------EPRKTQDL---------------NKHGDKDVV---VEKELE------------------------FGQGAGG

Query:  SKKDVGASGSVPDVEPPYVSPPPYVPP-LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVK
         K+ +G    V   E         +PP L  P         G    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TY +G I+D+LVK
Subjt:  SKKDVGASGSVPDVEPPYVSPPPYVPP-LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVK

Query:  VDKFIFPVDFIILDYEADKDVPIILGCPFLAT---------AMKYPDEMEDCSFIRILES------------TVIETAIQD-------------------
        VDKFIFP DF++LD E D +VPIILG PFLAT         AMK+P+E ++C  + + ++              +E A+ D                   
Subjt:  VDKFIFPVDFIILDYEADKDVPIILGCPFLAT---------AMKYPDEMEDCSFIRILES------------TVIETAIQD-------------------

Query:  --------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEE
                 + +R AP   +KPS+ E PTL+LK L  HL Y YLGE +TLP+I+ S L     E L+++L+ ++  IGWT+ADI+GISPSFCMHKI LE+
Subjt:  --------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEE

Query:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLNKATRKDHYPLPFIDQML
            SIE QRRLNP MKEVVKKE+IKWLDAGI+YPI                                TVTGWRVCMDYR+LNKATRKDH+PLPFIDQML
Subjt:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLNKATRKDHYPLPFIDQML

Query:  NRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE
        +RLAG+ +YCFLDGYSGYNQI I PEDQEKTTFTCPYGTFAFRRM FGLCNAPATFQRCM+AIF+DM+E+ +E
Subjt:  NRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE

PIN16590.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]8.0e-14641.77Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN
        + TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ +   +  +  ++KN  V   Q  P         
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN

Query:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQ----AQQKVNQSGFAKMQMLQFKVIKLDESPGI
              C  CGE H  + CP++  S+ FV N R   NNPYSN YNPGWR HPNF+W    GQG + +     QQ+V Q    K   L+  +I+   S   
Subjt:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQ----AQQKVNQSGFAKMQMLQFKVIKLDESPGI

Query:  ASKPLE----------------------EPRKTQDL----------------------NKHGDKDVVVE---KELE----------------FGQGAGGS
          K +E                      EP   QD                        K  +K+V+ E   KE+E                F +     
Subjt:  ASKPLE----------------------EPRKTQDL----------------------NKHGDKDVVVE---KELE----------------FGQGAGGS

Query:  KKDVGASGSVPDVEPPYVSPPPYVPP-LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKV
        K+ +G   +V   E         +PP L  P         G    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TY  G IED+LVKV
Subjt:  KKDVGASGSVPDVEPPYVSPPPYVPP-LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKV

Query:  DKFIFPVDFIILDYEADKDVPIILGCPFLAT---------------------------AMKYPDEMEDCSFIRILESTV------------IETAIQD--
        DKFIFP DF++LD E D +VPIILG PFLAT                           AMK+P+E ++C  + + +               +E A+ D  
Subjt:  DKFIFPVDFIILDYEADKDVPIILGCPFLAT---------------------------AMKYPDEMEDCSFIRILESTV------------IETAIQD--

Query:  -------------------------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLAD
                                  + +R AP   +KPS+ E PTL+LK L  HL Y YLGE +TLP+I+ S L     E L+++L+ ++ AIGWT+AD
Subjt:  -------------------------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLAD

Query:  IQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLN
        I+GISPSFCMHKI LE+G   S+E QRRLNP MKEVVKKE+IKWLDAGI+YPI                                TVTGWRVCMDYR+LN
Subjt:  IQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLN

Query:  KATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE
        KATRKDH+PLPFIDQML+RLAG+ +YCFLDGYSGYNQI IVPEDQEKTTFTCPYGTF FR+M FGLCNAPATFQRCM+AIF+DM+E+ +E
Subjt:  KATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE

XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]5.1e-16948.31Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGT-NKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPA-IEPAAV
        METFYNGLN  T+ +VDASA GA+L+KT++EAYEILERI++N+CQW+DVR    +K R VLEVD +S+I   LA + N L+N+ +       A +  AAV
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGT-NKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPA-IEPAAV

Query:  VNQVAEEACVYCGENHNYEFCPNNPASVFFVGNQRNNPYSNFYNPGWRNHPNFAWGGQGSSAQAQQKVNQSGFAKMQMLQFKVIKLDESPGIASKPLEEP
        +NQ A E+CVYCGE H ++ CP+NPAS+F+VGNQ                                  N+  F      +FKV+ L+E      K     
Subjt:  VNQVAEEACVYCGENHNYEFCPNNPASVFFVGNQRNNPYSNFYNPGWRNHPNFAWGGQGSSAQAQQKVNQSGFAKMQMLQFKVIKLDESPGIASKPLEEP

Query:  RKTQDLNKHGDKDVVVEKELEFGQGAGGSKKDVGASGSVPDVEPPYVSPPPYVPPLPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEA
                  +K  + EK+                             P  +  P+            GGK+LGRALCDLG+SINLMPLS+Y++LGIGEA
Subjt:  RKTQDLNKHGDKDVVVEKELEFGQGAGGSKKDVGASGSVPDVEPPYVSPPPYVPPLPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEA

Query:  RPTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGCPFLAT---------------------------AMKYPDEMEDCSFI
        RPTTVTLQLADRS TY EGKIED+L++VDKFIFP DFIILDYEAD DVPIILG PFL T                           +MKYP   E+CS +
Subjt:  RPTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGCPFLAT---------------------------AMKYPDEMEDCSFI

Query:  -RILESTVIETAIQDSANQ------------------------------RKAPPIKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEA
          + E    E      + Q                              RK+ P++PS+ EAP LDLK L  +LKY YLG+ +TLPII+ + L    E+ 
Subjt:  -RILESTVIETAIQDSANQ------------------------------RKAPPIKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEA

Query:  LIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIGT---------------------------
        L++ L++++ AIGWTLADI+GISPS CMHKI LEEG  +SIEQQRRLNP MKEVV+KE++KWLDAGI+YPI                             
Subjt:  LIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIGT---------------------------

Query:  ----VTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFS
            V GWR+CMDYRRLNKATRKDH+PLPFIDQML+RLAG+++YCFLDGYSGYNQITI PEDQEKTTFTCPYG FAFRRM FGLCNAPATFQRCM+AIF+
Subjt:  ----VTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFS

Query:  DMIESTIE
        DM+E+ +E
Subjt:  DMIESTIE

XP_023521781.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785639, partial [Cucurbita pepo subsp. pepo]2.1e-17048.47Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGT-NKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPA-IEPAAV
        METFYNGLN  T+ +VDASA GA+L+KT++EAYEILERI++N+CQW+DVR    +K + VLEVD +S+I   LA + N L+N+ +       A +  AAV
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGT-NKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPA-IEPAAV

Query:  VNQVAEEACVYCGENHNYEFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHPNFAWGGQGS-SAQAQQKVN-QSGFAKMQMLQF----------K
        +NQ A E+CV CGE H ++ C +N  S+F+VGNQ      +NNP+SN YNPGWRNHPNF+W GQGS + Q   K N   GF     L +           
Subjt:  VNQVAEEACVYCGENHNYEFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHPNFAWGGQGS-SAQAQQKVN-QSGFAKMQMLQF----------K

Query:  VIKLDESPGIASKPL-EEPRKTQDLNKHGDKDVVVEKELEFGQGAGGSKKDVGASGSVPDVEP------------------PYVSPPPYVPPLPFPQRQK
          +    PG + + L +E     D      +  +   E++ GQ A   +    A   +P+                     P       +     P ++K
Subjt:  VIKLDESPGIASKPL-EEPRKTQDLNKHGDKDVVVEKELEFGQGAGGSKKDVGASGSVPDVEP------------------PYVSPPPYVPPLPFPQRQK

Query:  PRN------QNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGCPFL
                   GGK+LGRALCDLG+SINLMPLS+Y++LGIGEARPTTVTLQLADRS TY EGKIED+L++VDKF FP DFIILDYEAD DVPIILG PFL
Subjt:  PRN------QNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGCPFL

Query:  AT---------------------------AMKYPDEMEDCSFIRIL----------------------ESTVIETAIQDSANQ---------RKAPPIKP
         T                           +MKYP   ++CS +  L                         + + A+    N+         RK+ P++P
Subjt:  AT---------------------------AMKYPDEMEDCSFIRIL----------------------ESTVIETAIQDSANQ---------RKAPPIKP

Query:  SLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKK
        S+ EAP LDLK L  +LKY YLG+ +TLPII+ + L    E+ L++ L++++ AIGWTLADI+GISPS CMHKI LEEG  +SIEQQRRLNP MKEV   
Subjt:  SLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKK

Query:  EVIKWLDAGIVYPIGTVTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAP
          +   +   +     V GWR+CMDYRRLNKATRKDH+PLPFIDQML+RLAG+++YCFLDGYSGYNQITI PEDQEKTTFTCPY  FAFRRM FGLCNAP
Subjt:  EVIKWLDAGIVYPIGTVTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAP

Query:  ATFQRCMLAIFSDMIESTIE
        ATFQRCM+AIF+DM+E+ +E
Subjt:  ATFQRCMLAIFSDMIESTIE

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]3.6e-15443.49Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGT-NKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPA-IEPAAV
        METFYNGLN  T+ +VDASA GA+L+KT++EAYEILERI++N+CQW+DVR    +K R VLEVD +S+I   LA + N L+N+ +       A +  AAV
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGT-NKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPA-IEPAAV

Query:  VNQVAEEAC----VYCGEN-----HNYEFCPNNPASVFFVGNQ-RNNPYSNFYNPGWRNHPNFAWGGQGSSAQAQQKVNQSGFAKMQMLQFKVIKLDESP
        +NQ A E+C     Y  +N            N    V  + N+ RN P S    P     P                  + G  + Q ++ +  K   S 
Subjt:  VNQVAEEAC----VYCGEN-----HNYEFCPNNPASVFFVGNQ-RNNPYSNFYNPGWRNHPNFAWGGQGSSAQAQQKVNQSGFAKMQMLQFKVIKLDESP

Query:  GIASKPLEEPR--KTQDLNKHGDKDVVVEKELEFGQGAGGSKKDVGASGSVPDVEPPYVSPPP--------YVPPLPFPQRQK-----------------
        G   K   + R  +T D  +  +K VV E+  +    A   K+       +  V+PP              Y P  PFPQR K                 
Subjt:  GIASKPLEEPR--KTQDLNKHGDKDVVVEKELEFGQGAGGSKKDVGASGSVPDVEPPYVSPPP--------YVPPLPFPQRQK-----------------

Query:  -------------------------------------PRNQN-------------------------GGKELGRALCDLGASINLMPLSVYRRLGIGEAR
                                             P N+                          GGK+LGRALCDLG+SINLMPLS+Y++LGIGEAR
Subjt:  -------------------------------------PRNQN-------------------------GGKELGRALCDLGASINLMPLSVYRRLGIGEAR

Query:  PTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGCPFLAT---------------------------AMKYPDEMEDCSFI-
        PTTVTLQLADRS TY EGKIED+L++VDKFIFP DFIILDYEAD DVPIILG PFL T                           +MKYP   E+CS + 
Subjt:  PTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGCPFLAT---------------------------AMKYPDEMEDCSFI-

Query:  RILESTVIETAIQDSANQ------------------------------RKAPPIKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEAL
         + E    E      + Q                              RK+ P++PS+ EAP LDLK L  +LKY YLG+ +TLPII+ + L    E+ L
Subjt:  RILESTVIETAIQDSANQ------------------------------RKAPPIKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEAL

Query:  IKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIGT----------------------------
        ++ L++++ AIGWTLADI+GISPS CMHKI LEEG  +SIEQQRRLNP MKEVV+KE++KWLDAGI+YPI                              
Subjt:  IKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIGT----------------------------

Query:  ---VTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSD
           V GWR+CMDYRRLNKATRKDH+PLPFIDQML+RLAG+++YCFLDGYSGYNQITI PEDQEKTTFTCPYG FAFRRM FGLCNAPATFQRCM+AIF+D
Subjt:  ---VTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSD

Query:  MIESTIE
        M+E+ +E
Subjt:  MIESTIE

TrEMBL top hitse value%identityAlignment
A0A2G9G6G2 Reverse transcriptase9.2e-14842.69Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN
        + TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ +   +  +  ++KN  V   Q  P         
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN

Query:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQAQQKVNQSGFAKMQ----MLQFKVIKLDESPGI
              C  CGE H  + CP++  S+ FV N R   NNPYSN YNPGWR HPNF+W    GQGS+ + QQ+  Q     MQ     L+  +I+   S   
Subjt:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQAQQKVNQSGFAKMQ----MLQFKVIKLDESPGI

Query:  ASKPLE----------------------EPRKTQDL---------------NKHGDKDVV---VEKELE------------------------FGQGAGG
          K +E                      EP   QD+                K  +K+V+    EKE+E                        F +    
Subjt:  ASKPLE----------------------EPRKTQDL---------------NKHGDKDVV---VEKELE------------------------FGQGAGG

Query:  SKKDVGASGSVPDVEPPYVSPPPYVPP-LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVK
         K+ +G    V   E         +PP L  P         G    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TY +G I+D+LVK
Subjt:  SKKDVGASGSVPDVEPPYVSPPPYVPP-LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVK

Query:  VDKFIFPVDFIILDYEADKDVPIILGCPFLAT---------AMKYPDEMEDCSFIRILES------------TVIETAIQD-------------------
        VDKFIFP DF++LD E D +VPIILG PFLAT         AMK+P+E ++C  + + ++              +E A+ D                   
Subjt:  VDKFIFPVDFIILDYEADKDVPIILGCPFLAT---------AMKYPDEMEDCSFIRILES------------TVIETAIQD-------------------

Query:  --------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEE
                 + +R AP   +KPS+ E PTL+LK L  HL Y YLGE +TLP+I+ S L     E L+++L+ ++  IGWT+ADI+GISPSFCMHKI LE+
Subjt:  --------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEE

Query:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLNKATRKDHYPLPFIDQML
            SIE QRRLNP MKEVVKKE+IKWLDAGI+YPI                                TVTGWRVCMDYR+LNKATRKDH+PLPFIDQML
Subjt:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLNKATRKDHYPLPFIDQML

Query:  NRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE
        +RLAG+ +YCFLDGYSGYNQI I PEDQEKTTFTCPYGTFAFRRM FGLCNAPATFQRCM+AIF+DM+E+ +E
Subjt:  NRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE

A0A2G9HH15 Reverse transcriptase3.9e-14641.77Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN
        + TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ +   +  +  ++KN  V   Q  P         
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN

Query:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQ----AQQKVNQSGFAKMQMLQFKVIKLDESPGI
              C  CGE H  + CP++  S+ FV N R   NNPYSN YNPGWR HPNF+W    GQG + +     QQ+V Q    K   L+  +I+   S   
Subjt:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQ----AQQKVNQSGFAKMQMLQFKVIKLDESPGI

Query:  ASKPLE----------------------EPRKTQDL----------------------NKHGDKDVVVE---KELE----------------FGQGAGGS
          K +E                      EP   QD                        K  +K+V+ E   KE+E                F +     
Subjt:  ASKPLE----------------------EPRKTQDL----------------------NKHGDKDVVVE---KELE----------------FGQGAGGS

Query:  KKDVGASGSVPDVEPPYVSPPPYVPP-LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKV
        K+ +G   +V   E         +PP L  P         G    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TY  G IED+LVKV
Subjt:  KKDVGASGSVPDVEPPYVSPPPYVPP-LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKV

Query:  DKFIFPVDFIILDYEADKDVPIILGCPFLAT---------------------------AMKYPDEMEDCSFIRILESTV------------IETAIQD--
        DKFIFP DF++LD E D +VPIILG PFLAT                           AMK+P+E ++C  + + +               +E A+ D  
Subjt:  DKFIFPVDFIILDYEADKDVPIILGCPFLAT---------------------------AMKYPDEMEDCSFIRILESTV------------IETAIQD--

Query:  -------------------------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLAD
                                  + +R AP   +KPS+ E PTL+LK L  HL Y YLGE +TLP+I+ S L     E L+++L+ ++ AIGWT+AD
Subjt:  -------------------------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLAD

Query:  IQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLN
        I+GISPSFCMHKI LE+G   S+E QRRLNP MKEVVKKE+IKWLDAGI+YPI                                TVTGWRVCMDYR+LN
Subjt:  IQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLN

Query:  KATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE
        KATRKDH+PLPFIDQML+RLAG+ +YCFLDGYSGYNQI IVPEDQEKTTFTCPYGTF FR+M FGLCNAPATFQRCM+AIF+DM+E+ +E
Subjt:  KATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE

A0A2G9HWC5 DNA-directed DNA polymerase8.9e-14341.68Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN
        + TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ +   +  +  ++KN  V   Q  P         
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN

Query:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQ----AQQKVNQSGFAKMQMLQFKVIKLDESPGI
              C  CGE++  + CP++  S+ FV N R   NNPYSN YNPGWR HPNF+W    GQGS+ +     QQ V Q    K   L+  +++   S   
Subjt:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQ----AQQKVNQSGFAKMQMLQFKVIKLDESPGI

Query:  ASKPLE----------------------EPRKTQD---------LNKHGDKDVVVEKE-----LEFGQGAGGSKKDVGASGSVPDVEPPYVSPPPYVPP-
          K +E                      EP   QD         L  H +       E     ++F +     K+ +G   +V   E         +PP 
Subjt:  ASKPLE----------------------EPRKTQD---------LNKHGDKDVVVEKE-----LEFGQGAGGSKKDVGASGSVPDVEPPYVSPPPYVPP-

Query:  LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGCP
        L  P         G    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TY +G IED+LVKVDKFIFP D ++LD E D ++ IILG P
Subjt:  LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGCP

Query:  FLAT---------------------------AMKYPDEMEDCSFIRILESTV------------IETAIQDSANQ-------------------------
        FLAT                           AMK+P+E ++C  + + ++              +E A+ D  ++                         
Subjt:  FLAT---------------------------AMKYPDEMEDCSFIRILESTV------------IETAIQDSANQ-------------------------

Query:  ----RKAPPIKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQR
             ++  +KPS+ E PTL+LK L  HL YVYLGE +TLP+I+ S L     E L+++L+ +  AIGWT+ADI+GISPSFCMHKI LE+    S+E QR
Subjt:  ----RKAPPIKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQR

Query:  RLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYC
        RLNP MKEVVKKE+IKWLDAGI+YPI                                TVTGWR CMDYR+LNKATRKDH+PLPFIDQML+RLAG+ +YC
Subjt:  RLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYC

Query:  FLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE
        FLDGYSGYNQI I PEDQEK TFTCPYGTFAFRRM FGLCNAPATFQRCM+AIF+DM+E+ +E
Subjt:  FLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE

A0A2G9HWF8 Reverse transcriptase9.8e-14241.26Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN
        + TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ +   +  +  ++KN                   
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN

Query:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQAQQ--------KVNQSGFAKMQMLQF-------
                 CGE H  + CP++  S+ FV N R   NNPYSN YNPGWR HPNF+W    GQGS+ + QQ           Q G A+ Q +         
Subjt:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQAQQ--------KVNQSGFAKMQMLQF-------

Query:  -----------KVIKLDESPGIASKPLEEPRKT------------QDLNKHGDKDVVVEKEL-----------------EFGQGAGGSKKDVGASGSVPD
                   K +  +E       PLE  + T            Q L K   K + V K+L                 +F +     K+ +G   +V  
Subjt:  -----------KVIKLDESPGIASKPLEEPRKT------------QDLNKHGDKDVVVEKEL-----------------EFGQGAGGSKKDVGASGSVPD

Query:  VEPPYVSPPPYVPP-LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIIL
         E         +PP L  P R             RALCDLGASINLMP S+YR LG+ EA+PT++TLQLADRS+TY +G IED+LVKVDKFIFP DF++L
Subjt:  VEPPYVSPPPYVPP-LPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIIL

Query:  DYEADKDVPIILGCPFLAT---------------------------AMKYPDEMEDCSFIRILESTV------------IETAIQD--------------
        D E D +VPIILG PFLAT                           AMK+P+E ++C  + + ++              +E A+ D              
Subjt:  DYEADKDVPIILGCPFLAT---------------------------AMKYPDEMEDCSFIRILESTV------------IETAIQD--------------

Query:  -------------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHK
                      + +R AP   +KPS+ E PTL+LK L  HL Y YLGE +TLP+I+ S L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHK
Subjt:  -------------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHK

Query:  ITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLNKATRKDHYPLPF
        I LE+    S+E QRRLNP MKEVVKKE+IKWLDAGI+YPI                                TVTGWRVCMDYR+LNKATRKDH+PLPF
Subjt:  ITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG-------------------------------TVTGWRVCMDYRRLNKATRKDHYPLPF

Query:  IDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE
        IDQML+RLAG+ +YCFLDGYSGYNQI I PEDQEKTTFTCPYGTFAFRR+ F LCNAPATFQRCM+AIF+DM+E+ +E
Subjt:  IDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE

A0A2G9HYA0 Reverse transcriptase8.1e-14440.02Show/hide
Query:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN
        + TFY+GL    +  +D   G + L+ T  E + +L  +  N  +    R T  K   V+EVD V+ +   +  +  ++KN  V   Q  P         
Subjt:  METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVN

Query:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQ----AQQKVNQSGFAKMQMLQFKVIKLDESPGI
              C  CGE H  + CP++  S+ FV N R   NNPYSN YNPGWR HPNF+W    GQGS+ +     QQ+V Q    K   L+  +I+   S   
Subjt:  QVAEEACVYCGENHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWRNHPNFAWG---GQGSSAQ----AQQKVNQSGFAKMQMLQFKVIKLDESPGI

Query:  ASKPLE----------------------EPRKTQDLNKHGDKDVVVE--KELE--FGQGAGGSKKDVGASGSVPDVEPPY-VSPPPYVPPLPFPQR----
          K +E                      EP   QD  K   + V +   +EL+    +     +K+V +     +VE P  VS P  + P PFPQR    
Subjt:  ASKPLE----------------------EPRKTQDLNKHGDKDVVVE--KELE--FGQGAGGSKKDVGASGSVPDVEPPY-VSPPPYVPPLPFPQR----

Query:  ---------------------------QKPRNQNGGKEL------------------------------------------------GRALCDLGASINL
                                   Q P      K++                                                GRALCDLGASINL
Subjt:  ---------------------------QKPRNQNGGKEL------------------------------------------------GRALCDLGASINL

Query:  MPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGCPFLAT---------------------------
        MP S+YR LG+GEA+PT++TLQLADRS+TY +G IED+LVKVDKFIFP DF++LD E D +VPIILG PFLAT                           
Subjt:  MPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGCPFLAT---------------------------

Query:  AMKYPDEMEDCSFIRILESTV------------IETAIQD---------------------------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKY
        AMK+P+E ++C  + + +               +E A+ D                            + +R  P   +KPS+ + PTL+LK L  HL Y
Subjt:  AMKYPDEMEDCSFIRILESTV------------IETAIQD---------------------------SANQRKAPP--IKPSLIEAPTLDLKLLSDHLKY

Query:  VYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG----
         YLGE +TLP+I+ S L     E L+++L+ ++ AIGWT+ADI+GISPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAGI+YPI     
Subjt:  VYLGEGETLPIIVGSDLMPEHEEALIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIG----

Query:  ---------------------------TVTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFA
                                   TVTGWRVCMDYR+LNKATRKDH+PLPFIDQML+RLAG+ +YCFLDGYSGYNQI I PEDQEKTTFTCPYGTFA
Subjt:  ---------------------------TVTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFA

Query:  FRRMSFGLCNAPATFQRCMLAIFSDMIESTIE
        FRRM FGLCNAPATFQRCM+AIF+DM+E+ +E
Subjt:  FRRMSFGLCNAPATFQRCMLAIFSDMIESTIE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.8e-1641.67Show/hide
Query:  WRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMI
        +R+ +DYR+LN+ T  D +P+P +D++L +L    Y+  +D   G++QI + PE   KT F+  +G + + RM FGL NAPATFQRCM  I   ++
Subjt:  WRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMI

P20825 Retrovirus-related Pol polyprotein from transposon 2978.0e-1642.71Show/hide
Query:  WRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMI
        +RV +DYR+LN+ T  D YP+P +D++L +L    Y+  +D   G++QI +  E   KT F+   G + + RM FGL NAPATFQRCM  I   ++
Subjt:  WRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMI

P31843 RNA-directed DNA polymerase homolog3.1e-1539.47Show/hide
Query:  RVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE
        R+C+DYR L K T K+ YP+P +D + +RLA   ++  LD  SGY Q+ I   D+ KTT    YG+F FR M FGL NA ATF   M  +  + ++  + 
Subjt:  RVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIE

Query:  TVYACTGILESFYT
         VY    ++ + Y+
Subjt:  TVYACTGILESFYT

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-2035.33Show/hide
Query:  LLQQYRKAIGWTL----ADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIGTVTG------------WRVCMDYRRLN
        L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V K LD   + P  +               +R+C+DYR LN
Subjt:  LLQQYRKAIGWTL----ADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIGTVTG------------WRVCMDYRRLN

Query:  KATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDM
        KAT  D +PLP ID +L+R+     +  LD +SGY+QI + P+D+ KT F  P G + +  M FGL NAP+TF R M   F D+
Subjt:  KATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDM

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-2035.33Show/hide
Query:  LLQQYRKAIGWTL----ADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIGTVTG------------WRVCMDYRRLN
        L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V K LD   + P  +               +R+C+DYR LN
Subjt:  LLQQYRKAIGWTL----ADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIGTVTG------------WRVCMDYRRLN

Query:  KATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDM
        KAT  D +PLP ID +L+R+     +  LD +SGY+QI + P+D+ KT F  P G + +  M FGL NAP+TF R M   F D+
Subjt:  KATRKDHYPLPFIDQMLNRLAGQAYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDM

Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases5.6e-0436.73Show/hide
Query:  WSSVDVVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADITKQM
        ++  D VYMPFN  + HWV LC D +  +  + DS   L  DA +  ++
Subjt:  WSSVDVVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADITKQM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTTGATGCTTCAGCTGGAGGGGCACTTTTGGCAAAGACTTTTGATGAAGCCTACGAAATTTTAGA
AAGAATATCTACTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAGGAATGTATTAGAGGTTGATGGTGTGTCCACCATAAGGGTTGATCTTGCTA
TGATTGCTAATGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCAATAGAGCCTGCTGCAGTGGTGAACCAAGTGGCAGAAGAAGCATGTGTCTATTGT
GGTGAAAATCACAACTACGAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAA
CCACCCCAACTTCGCATGGGGAGGACAAGGAAGCAGTGCGCAAGCACAACAAAAGGTGAATCAGTCGGGATTTGCTAAAATGCAGATGCTGCAATTCAAAGTAATCAAGC
TCGATGAGAGCCCTGGAATTGCAAGTAAGCCACTAGAAGAGCCTAGAAAAACCCAAGATTTAAATAAACATGGTGATAAAGACGTTGTTGTTGAGAAAGAGTTGGAGTTT
GGTCAGGGTGCTGGAGGCAGCAAGAAAGATGTTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGTCGCCCCCACCTTATGTACCACCTCTACCTTTTCC
ACAAAGGCAAAAGCCTAGGAATCAGAACGGTGGAAAAGAGTTAGGTAGAGCACTTTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAGGCTAG
GTATTGGTGAAGCTAGGCCTACCACAGTCACACTCCAATTAGCTGATAGGTCTATCACATATTCAGAGGGTAAAATTGAGGATGTCTTAGTAAAAGTGGATAAATTCATA
TTTCCAGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTTGTCCATTTTTGGCTACTGCCATGAAGTATCCAGACGAAATGGAGGA
TTGTTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCAGCTAATCAAAGGAAGGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCTA
CTTTAGATTTGAAGCTCTTGTCGGATCATCTAAAGTATGTGTATCTTGGGGAAGGTGAGACATTGCCCATTATTGTTGGATCAGACTTAATGCCGGAGCATGAAGAGGCC
TTAATAAAGTTGTTGCAGCAATACCGCAAGGCTATAGGTTGGACGTTGGCTGACATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATC
CTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTGGGATCGTTTATCCAATTGGGACAG
TAACTGGCTGGAGGGTTTGCATGGACTACAGGAGGCTTAATAAAGCTACCCGTAAGGACCATTACCCTCTACCATTTATTGACCAGATGTTGAATAGATTGGCTGGTCAG
GCCTACTACTGTTTCTTAGATGGTTATTCTGGGTATAACCAGATTACCATTGTTCCTGAGGATCAGGAAAAAACCACTTTCACCTGTCCGTATGGGACGTTTGCTTTTAG
GCGAATGTCTTTTGGCCTTTGCAATGCTCCAGCAACATTTCAGCGGTGTATGTTGGCAATTTTTTCTGATATGATTGAGTCCACTATCGAGACTGTGTATGCTTGCACAG
GAATTCTAGAGTCTTTTTATACCATATTTCTACTCACTCTAGGAGCCCAACAAATACCTCGCTTTATCTTATTGCATAGGGTTTATAGTAAGAGAGGACTTAGGGTTTAT
GATTTAGGGAGCGGGGATGTGTACGAAGAACTCCTTCGTAGTGACCATGTGAACGACACGTTCGATTCGAGTAGATTCAAGACGGTCATTAACTACATAATGGGAGAACA
CACAGATTACAACGTTCCTTGGAGTTCTGTTGATGTTGTCTACATGCCCTTCAACTTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTTGAAACGGGCGAATTTG
TGTTGACAGACTCCCTTACGGCACTGAATTCAGATGCAGACATAACGAAGCAGATGAATACGAGAAATGTACTAAAATTGTCTAAAACATCAAGAATTGTTCAATGGTTG
TCTGCATGTTTTTCTGTTATGACTCCTGTTCCCGCACCGGCTGCATTTGCAGACATGTCGCACTTCTCCCATAGACGGTATTCTCAACGTCTGACGACGACCAACTCTAG
GCACTCTTCGAGGAGGAAAGACTTACACATCGACGAATTGAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTTGATGCTTCAGCTGGAGGGGCACTTTTGGCAAAGACTTTTGATGAAGCCTACGAAATTTTAGA
AAGAATATCTACTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAGGAATGTATTAGAGGTTGATGGTGTGTCCACCATAAGGGTTGATCTTGCTA
TGATTGCTAATGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCAATAGAGCCTGCTGCAGTGGTGAACCAAGTGGCAGAAGAAGCATGTGTCTATTGT
GGTGAAAATCACAACTACGAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAA
CCACCCCAACTTCGCATGGGGAGGACAAGGAAGCAGTGCGCAAGCACAACAAAAGGTGAATCAGTCGGGATTTGCTAAAATGCAGATGCTGCAATTCAAAGTAATCAAGC
TCGATGAGAGCCCTGGAATTGCAAGTAAGCCACTAGAAGAGCCTAGAAAAACCCAAGATTTAAATAAACATGGTGATAAAGACGTTGTTGTTGAGAAAGAGTTGGAGTTT
GGTCAGGGTGCTGGAGGCAGCAAGAAAGATGTTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGTCGCCCCCACCTTATGTACCACCTCTACCTTTTCC
ACAAAGGCAAAAGCCTAGGAATCAGAACGGTGGAAAAGAGTTAGGTAGAGCACTTTGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAGGCTAG
GTATTGGTGAAGCTAGGCCTACCACAGTCACACTCCAATTAGCTGATAGGTCTATCACATATTCAGAGGGTAAAATTGAGGATGTCTTAGTAAAAGTGGATAAATTCATA
TTTCCAGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTCTAGGTTGTCCATTTTTGGCTACTGCCATGAAGTATCCAGACGAAATGGAGGA
TTGTTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCAGCTAATCAAAGGAAGGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCTA
CTTTAGATTTGAAGCTCTTGTCGGATCATCTAAAGTATGTGTATCTTGGGGAAGGTGAGACATTGCCCATTATTGTTGGATCAGACTTAATGCCGGAGCATGAAGAGGCC
TTAATAAAGTTGTTGCAGCAATACCGCAAGGCTATAGGTTGGACGTTGGCTGACATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATC
CTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTGGGATCGTTTATCCAATTGGGACAG
TAACTGGCTGGAGGGTTTGCATGGACTACAGGAGGCTTAATAAAGCTACCCGTAAGGACCATTACCCTCTACCATTTATTGACCAGATGTTGAATAGATTGGCTGGTCAG
GCCTACTACTGTTTCTTAGATGGTTATTCTGGGTATAACCAGATTACCATTGTTCCTGAGGATCAGGAAAAAACCACTTTCACCTGTCCGTATGGGACGTTTGCTTTTAG
GCGAATGTCTTTTGGCCTTTGCAATGCTCCAGCAACATTTCAGCGGTGTATGTTGGCAATTTTTTCTGATATGATTGAGTCCACTATCGAGACTGTGTATGCTTGCACAG
GAATTCTAGAGTCTTTTTATACCATATTTCTACTCACTCTAGGAGCCCAACAAATACCTCGCTTTATCTTATTGCATAGGGTTTATAGTAAGAGAGGACTTAGGGTTTAT
GATTTAGGGAGCGGGGATGTGTACGAAGAACTCCTTCGTAGTGACCATGTGAACGACACGTTCGATTCGAGTAGATTCAAGACGGTCATTAACTACATAATGGGAGAACA
CACAGATTACAACGTTCCTTGGAGTTCTGTTGATGTTGTCTACATGCCCTTCAACTTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTTGAAACGGGCGAATTTG
TGTTGACAGACTCCCTTACGGCACTGAATTCAGATGCAGACATAACGAAGCAGATGAATACGAGAAATGTACTAAAATTGTCTAAAACATCAAGAATTGTTCAATGGTTG
TCTGCATGTTTTTCTGTTATGACTCCTGTTCCCGCACCGGCTGCATTTGCAGACATGTCGCACTTCTCCCATAGACGGTATTCTCAACGTCTGACGACGACCAACTCTAG
GCACTCTTCGAGGAGGAAAGACTTACACATCGACGAATTGAGGTGA
Protein sequenceShow/hide protein sequence
METFYNGLNGVTQGMVDASAGGALLAKTFDEAYEILERISTNSCQWSDVRGTNKKVRNVLEVDGVSTIRVDLAMIANALKNVTVISHQQPPAIEPAAVVNQVAEEACVYC
GENHNYEFCPNNPASVFFVGNQRNNPYSNFYNPGWRNHPNFAWGGQGSSAQAQQKVNQSGFAKMQMLQFKVIKLDESPGIASKPLEEPRKTQDLNKHGDKDVVVEKELEF
GQGAGGSKKDVGASGSVPDVEPPYVSPPPYVPPLPFPQRQKPRNQNGGKELGRALCDLGASINLMPLSVYRRLGIGEARPTTVTLQLADRSITYSEGKIEDVLVKVDKFI
FPVDFIILDYEADKDVPIILGCPFLATAMKYPDEMEDCSFIRILESTVIETAIQDSANQRKAPPIKPSLIEAPTLDLKLLSDHLKYVYLGEGETLPIIVGSDLMPEHEEA
LIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIVYPIGTVTGWRVCMDYRRLNKATRKDHYPLPFIDQMLNRLAGQ
AYYCFLDGYSGYNQITIVPEDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMLAIFSDMIESTIETVYACTGILESFYTIFLLTLGAQQIPRFILLHRVYSKRGLRVY
DLGSGDVYEELLRSDHVNDTFDSSRFKTVINYIMGEHTDYNVPWSSVDVVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADITKQMNTRNVLKLSKTSRIVQWL
SACFSVMTPVPAPAAFADMSHFSHRRYSQRLTTTNSRHSSRRKDLHIDELR