; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000341 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000341
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:4668578..4670897
RNA-Seq ExpressionLag0000341
SyntenyLag0000341
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]6.9e-13440.89Show/hide
Query:  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE-------------------------GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQ
        E SS  +QIFGSGNK+ +VKL DD FLLWKF I  ALE                          +S   TPN AY   KR D++ISSWL+ SM+EEI++Q
Subjt:  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE-------------------------GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQ

Query:  MLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEV
        MLHC + KEIW+ L  IFSS  L Q M+FK KL  I+KG                           +DHILYIL GLGS+Y+SM+SVISA+    SV EV
Subjt:  MLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEV

Query:  MALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKY---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP
        M+LL TQE++ ESKL+ ++T+LP VN+  Q+   +  A+ Y   N  ++  + S   R GR   +SNRG R   NRN+ QCQ+C K  ++A +C+FRY P
Subjt:  MALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKY---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP

Query:  --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------------------
           S+  +P+S   SY   N   + P+M+ M+ A D+N D+ WYPDS  TNHLTH+  NL +G+EYGG                                
Subjt:  --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------------------

Query:  ---------------------------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKL
                                               D   G+VLLQG L++GLY+F +    +PSHK    +  +T  +     + +V   SN+  L
Subjt:  ---------------------------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKL

Query:  DIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS
        D+WHRRLGHP L  VK VL     N S    K  FC+ACA+ K H+LPFS S T YT PL                              +YTWIYFL S
Subjt:  DIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS

Query:  KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINR
        KSDAF AF      +EK L   I    +D G EF  FKPFL+ HGI  R + P+TS+QN I E  HR+I++  L LLS +++PL F DEAFS +++LINR
Subjt:  KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINR

Query:  LPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG
        LP+ VL   SPLE +F  KP++  L+ FG +C+P L P+ +SHKL+ RSTP TF+GY++SHKG
Subjt:  LPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG

KAF7814697.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora]1.9e-9133.15Show/hide
Query:  VKLTDDNFLLWKFHI---------------------QFALEGSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWD
        +KLT++NFL+WK  I                     +FA    +  +  N  Y      D+++ SWL+ SMTEE++++ + C TTK++W+ L   +S+  
Subjt:  VKLTDDNFLLWKFHI---------------------QFALEGSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWD

Query:  LVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIE--SKLVHT-D
          +  + + +L+  +KG                            DH+  I  GL  EYES V+ IS +    SV E+ ALL  QE R+E   K V +  
Subjt:  LVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIE--SKLVHT-D

Query:  TSLPYVNLSVQSKPADNDA--QKYNPPSFPPHFSGGNRGRWGDQSNRGGR------TWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYN
         ++  V+LS ++ P+ N    Q +N        +G  RG + + + + GR      TW   NR QCQ+CGK  H AV CY R+       +         
Subjt:  TSLPYVNLSVQSKPADNDA--QKYNPPSFPPHFSGGNRGRWGDQSNRGGR------TWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYN

Query:  QFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNL-----FVGTE---YGGDQAFG--------------RVLLQGTLHEGLYRFN-VSISQ
        Q NR P+   M  M+  P+   D  WYPDS  +NHLT++  NL     + GTE    G  Q                  + L+  +H      N +S+S+
Subjt:  QFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNL-----FVGTE---YGGDQAFG--------------RVLLQGTLHEGLYRFN-VSISQ

Query:  QPSHKPT----------VQALHSTTTILIHTAYLSVYSVSNSSKL----------------------DIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKF
                         V++  +  T+L  +    +Y V + S L                      ++WH RLGHPS   V HVL     ++ +N    
Subjt:  QPSHKPT----------VQALHSTTTILIHTAYLSVYSVSNSSKL----------------------DIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKF

Query:  QFCDACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKSKSDAFDAFVHIEKL----LNLPIVQFPSDNGGE
          C AC + K+H+LPFS S T+Y+APL                              K+TW+Y LK+K DA  AF+  + L    LN  I    SD GGE
Subjt:  QFCDACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKSKSDAFDAFVHIEKL----LNLPIVQFPSDNGGE

Query:  FLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCF
        +  F  +L  HGI  R S PHT  QNGIAE  H+HI +  L LL+H+SMPLKF D+AF  A+FLINRLP+ +LH RSPLE++F+TKP+Y FLK FG  C+
Subjt:  FLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCF

Query:  PCLCPHNRSHKLAYRSTPSTFIGYNSSHKG
        P L P+N  HK A++S   TFIGY+  HKG
Subjt:  PCLCPHNRSHKLAYRSTPSTFIGYNSSHKG

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]9.7e-9634.72Show/hide
Query:  GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG------------------------
        G++ V  PN  +    R D+++ S+L+ SM+E    QM+ C T+ ++W  + Q+F++    +VM++K +LQT++KG                        
Subjt:  GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG------------------------

Query:  --VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIES-KLVHTDTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGGNRGRWGDQS
           +D IL+IL G+G EYES+V  ++++V   S+ EV ALL   E RIE+  +    T+ P VN  V + P+   A+  N     P + G  RGR G   
Subjt:  --VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIES-KLVHTDTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGGNRGRWGDQS

Query:  NRGGR-TWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYNQFNR-SPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEY
         RGGR  W+N  R  CQ+CG   H A  CY+R+     P +      S  QFNR SPS+P      T  +   +  WYPDS  ++H+T++ GNL V +EY
Subjt:  NRGGR-TWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYNQFNR-SPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEY

Query:  GG----------------------------------------------------------------------DQAFGRVLLQGTLHEGLYRFNV-SISQQ
         G                                                                      D A   VLL+GTLH GLYRFN+ S    
Subjt:  GG----------------------------------------------------------------------DQAFGRVLLQGTLHEGLYRFNV-SISQQ

Query:  PSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----
        P H P    L S+ + +       +    N+  LD WH RLGHPS++TVK VL      +S N+    FC +C + K H LPF  S+T ++AP       
Subjt:  PSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----

Query:  ------------------------KYTWIYFLKSKSDAFDAFVHIEKL----LNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTH
                                +YTWIYFLK KS+    F++ +K      N  I    +D GGEF     + +S+GI  RFS P+TS+QNG+ E  H
Subjt:  ------------------------KYTWIYFLKSKSDAFDAFVHIEKL----LNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTH

Query:  RHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG
        RH+VDT L+LL+H+S+P +F ++AF  A++LINRLPS  L  +SP   ++  +PDYS L+ FG  CFPCL P+N +HKLA+RSTP TF+GY+  HKG
Subjt:  RHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.5e-9331.59Show/hide
Query:  NKVLIVKLTDDNFLLWKFHIQFALEGSSV--------------------VKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIF
        ++++ ++L DDNFL+WK+ I+ A+ G  +                    V  PN  +   +R D ++ SWL+ S+    + Q++ C +  E+W+ ++Q F
Subjt:  NKVLIVKLTDDNFLLWKFHIQFALEGSSV--------------------VKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIF

Query:  SSWDLVQVMKFKTKLQTIQK--------------------------GVEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHT
        +S    +VM +K+++Q ++K                             DHIL I+ GLG EYES+++VIS+K    S+  V + L   E RI  K+   
Subjt:  SSWDLVQVMKFKTKLQTIQK--------------------------GVEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHT

Query:  DTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGG--NRGRWGDQSNRGGRTWNNRNR----------IQCQLCGKFNHTAVKCYFRYAP------PSAP
        D S+ Y +      P+ +    +N   +P   S G  NR ++G      G   +NR R           QCQLC KF HT  +C++RY P      P+  
Subjt:  DTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGG--NRGRWGDQSNRGGRTWNNRNR----------IQCQLCGKFNHTAVKCYFRYAP------PSAP

Query:  PNPS--------------SFAPSYN----QFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------
        P P               S A + N        +  +  M  M+  P+  Q+  W+PDS  TNH+TH+ GNL  G EY G                    
Subjt:  PNPS--------------SFAPSYN----QFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------

Query:  ---------------------------------------------------DQAFGRVLLQGTLHEGLYRFNVS-----ISQQPSHKPTVQALHSTTTIL
                                                           D++   +LLQG LH+GLY+FN+S      +   S       L      L
Subjt:  ---------------------------------------------------DQAFGRVLLQGTLHEGLYRFNVS-----ISQQPSHKPTVQALHSTTTIL

Query:  IHTAYLSVYSVSNSS--KLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH--------------------
        +H         +NSS    D+WH+RLGHP+   V  VL   K   S  +     C AC + K+H+LPF  S T YT PL                     
Subjt:  IHTAYLSVYSVSNSS--KLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH--------------------

Query:  ---------KYTWIYFLKSKSDAFDAFV----HIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSS
                 +YTW+YFLK+KS   +AF+      E      +  F +D GGEF   K + E +GI  R S PHTS+QNGI E  HRHIV+  L LL+ +S
Subjt:  ---------KYTWIYFLKSKSDAFDAFV----HIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSS

Query:  MPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG
        +PLK+  +AFS A+FLINRLP+EVL  + P E +FN+KP+YS LK FG  CFP L P+N+ HKL +RS+P TF+GY+S HKG
Subjt:  MPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]6.9e-13440.89Show/hide
Query:  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE-------------------------GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQ
        E SS  +QIFGSGNK+ +VKL DD FLLWKF I  ALE                          +S   TPN AY   KR D++ISSWL+ SM+EEI++Q
Subjt:  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE-------------------------GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQ

Query:  MLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEV
        MLHC + KEIW+ L  IFSS  L Q M+FK KL  I+KG                           +DHILYIL GLGS+Y+SM+SVISA+    SV EV
Subjt:  MLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEV

Query:  MALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKY---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP
        M+LL TQE++ ESKL+ ++T+LP VN+  Q+   +  A+ Y   N  ++  + S   R GR   +SNRG R   NRN+ QCQ+C K  ++A +C+FRY P
Subjt:  MALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKY---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP

Query:  --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------------------
           S+  +P+S   SY   N   + P+M+ M+ A D+N D+ WYPDS  TNHLTH+  NL +G+EYGG                                
Subjt:  --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------------------

Query:  ---------------------------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKL
                                               D   G+VLLQG L++GLY+F +    +PSHK    +  +T  +     + +V   SN+  L
Subjt:  ---------------------------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKL

Query:  DIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS
        D+WHRRLGHP L  VK VL     N S    K  FC+ACA+ K H+LPFS S T YT PL                              +YTWIYFL S
Subjt:  DIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS

Query:  KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINR
        KSDAF AF      +EK L   I    +D G EF  FKPFL+ HGI  R + P+TS+QN I E  HR+I++  L LLS +++PL F DEAFS +++LINR
Subjt:  KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINR

Query:  LPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG
        LP+ VL   SPLE +F  KP++  L+ FG +C+P L P+ +SHKL+ RSTP TF+GY++SHKG
Subjt:  LPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein4.7e-9634.72Show/hide
Query:  GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG------------------------
        G++ V  PN  +    R D+++ S+L+ SM+E    QM+ C T+ ++W  + Q+F++    +VM++K +LQT++KG                        
Subjt:  GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG------------------------

Query:  --VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIES-KLVHTDTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGGNRGRWGDQS
           +D IL+IL G+G EYES+V  ++++V   S+ EV ALL   E RIE+  +    T+ P VN  V + P+   A+  N     P + G  RGR G   
Subjt:  --VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIES-KLVHTDTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGGNRGRWGDQS

Query:  NRGGR-TWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYNQFNR-SPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEY
         RGGR  W+N  R  CQ+CG   H A  CY+R+     P +      S  QFNR SPS+P      T  +   +  WYPDS  ++H+T++ GNL V +EY
Subjt:  NRGGR-TWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYNQFNR-SPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEY

Query:  GG----------------------------------------------------------------------DQAFGRVLLQGTLHEGLYRFNV-SISQQ
         G                                                                      D A   VLL+GTLH GLYRFN+ S    
Subjt:  GG----------------------------------------------------------------------DQAFGRVLLQGTLHEGLYRFNV-SISQQ

Query:  PSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----
        P H P    L S+ + +       +    N+  LD WH RLGHPS++TVK VL      +S N+    FC +C + K H LPF  S+T ++AP       
Subjt:  PSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----

Query:  ------------------------KYTWIYFLKSKSDAFDAFVHIEKL----LNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTH
                                +YTWIYFLK KS+    F++ +K      N  I    +D GGEF     + +S+GI  RFS P+TS+QNG+ E  H
Subjt:  ------------------------KYTWIYFLKSKSDAFDAFVHIEKL----LNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTH

Query:  RHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG
        RH+VDT L+LL+H+S+P +F ++AF  A++LINRLPS  L  +SP   ++  +PDYS L+ FG  CFPCL P+N +HKLA+RSTP TF+GY+  HKG
Subjt:  RHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-9331.59Show/hide
Query:  NKVLIVKLTDDNFLLWKFHIQFALEGSSV--------------------VKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIF
        ++++ ++L DDNFL+WK+ I+ A+ G  +                    V  PN  +   +R D ++ SWL+ S+    + Q++ C +  E+W+ ++Q F
Subjt:  NKVLIVKLTDDNFLLWKFHIQFALEGSSV--------------------VKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIF

Query:  SSWDLVQVMKFKTKLQTIQK--------------------------GVEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHT
        +S    +VM +K+++Q ++K                             DHIL I+ GLG EYES+++VIS+K    S+  V + L   E RI  K+   
Subjt:  SSWDLVQVMKFKTKLQTIQK--------------------------GVEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHT

Query:  DTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGG--NRGRWGDQSNRGGRTWNNRNR----------IQCQLCGKFNHTAVKCYFRYAP------PSAP
        D S+ Y +      P+ +    +N   +P   S G  NR ++G      G   +NR R           QCQLC KF HT  +C++RY P      P+  
Subjt:  DTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGG--NRGRWGDQSNRGGRTWNNRNR----------IQCQLCGKFNHTAVKCYFRYAP------PSAP

Query:  PNPS--------------SFAPSYN----QFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------
        P P               S A + N        +  +  M  M+  P+  Q+  W+PDS  TNH+TH+ GNL  G EY G                    
Subjt:  PNPS--------------SFAPSYN----QFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------

Query:  ---------------------------------------------------DQAFGRVLLQGTLHEGLYRFNVS-----ISQQPSHKPTVQALHSTTTIL
                                                           D++   +LLQG LH+GLY+FN+S      +   S       L      L
Subjt:  ---------------------------------------------------DQAFGRVLLQGTLHEGLYRFNVS-----ISQQPSHKPTVQALHSTTTIL

Query:  IHTAYLSVYSVSNSS--KLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH--------------------
        +H         +NSS    D+WH+RLGHP+   V  VL   K   S  +     C AC + K+H+LPF  S T YT PL                     
Subjt:  IHTAYLSVYSVSNSS--KLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH--------------------

Query:  ---------KYTWIYFLKSKSDAFDAFV----HIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSS
                 +YTW+YFLK+KS   +AF+      E      +  F +D GGEF   K + E +GI  R S PHTS+QNGI E  HRHIV+  L LL+ +S
Subjt:  ---------KYTWIYFLKSKSDAFDAFV----HIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSS

Query:  MPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG
        +PLK+  +AFS A+FLINRLP+EVL  + P E +FN+KP+YS LK FG  CFP L P+N+ HKL +RS+P TF+GY+S HKG
Subjt:  MPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG

A0A438K147 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-8932.92Show/hide
Query:  NKVLIVKLTDDNFLLWKFHIQFALEGSSVVK---------------------TPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQI
        N  L VKL + NFL+WK  I  A+ G  + K                          + + ++ D+++ SWL+ S++E I+ +++ C T+  +W  L Q 
Subjt:  NKVLIVKLTDDNFLLWKFHIQFALEGSSVVK---------------------TPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQI

Query:  FSSWDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVH
        F+S    +  +FKT+LQ  +KG                           +DH+  IL GL ++YES ++ +  +    SV E+ ALL   E+R+E     
Subjt:  FSSWDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVH

Query:  TDTS-LPYVNLSVQSKPADNDAQKYNPPSFPPHFSG--GNRGRWGD-------------------QSNRGG-------------------RTWNNRNRIQ
         D+S   +V  S   +  +   Q Y   +   + SG  G+ GR GD                   +SNRGG                     WN+ N+ +
Subjt:  TDTS-LPYVNLSVQSKPADNDAQKYNPPSFPPHFSG--GNRGRWGD-------------------QSNRGG-------------------RTWNNRNRIQ

Query:  ---CQLCGKFNHTAVKCYFRYAPP-SAPPNPSSFAPS---YNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGGD------
           CQLCGK  H   +CY+R+      P N S   PS   Y  F+     P++N ++   ++  D  WYPDS  +NH+T N  NL    E+ G       
Subjt:  ---CQLCGKFNHTAVKCYFRYAPP-SAPPNPSSFAPS---YNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGGD------

Query:  QAFGRVLLQ--GTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMS-INNMKFQFCD
           G       G + +GLY F+   S   + +PT Q+L  + +++  +    V   S SS  D+WH+RLG PS +T+K+VL   K N++ IN M   FC 
Subjt:  QAFGRVLLQ--GTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMS-INNMKFQFCD

Query:  ACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKSKSDAFDAFVH----IEKLLNLPIVQFPSDNGGEFLCF
        +C + K H  PFS S TTYT PL                              +++WI+ L++KS+A   FV+    +E   +L I    +D GGEF  F
Subjt:  ACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKSKSDAFDAFVH----IEKLLNLPIVQFPSDNGGEFLCF

Query:  KPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLC
        + +L  +GI  R S PHT QQNG+AE  HR IV+  L LL   S+PLKF DE+F   ++L NRLP+ VLH + P+E++F + PDYSFLK FG  CFP L 
Subjt:  KPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLC

Query:  PHNRSHKLAYRSTPSTFIGYNSSHKG
        P+N +HKL YRS   TF+GY+  HKG
Subjt:  PHNRSHKLAYRSTPSTFIGYNSSHKG

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-13440.89Show/hide
Query:  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE-------------------------GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQ
        E SS  +QIFGSGNK+ +VKL DD FLLWKF I  ALE                          +S   TPN AY   KR D++ISSWL+ SM+EEI++Q
Subjt:  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE-------------------------GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQ

Query:  MLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEV
        MLHC + KEIW+ L  IFSS  L Q M+FK KL  I+KG                           +DHILYIL GLGS+Y+SM+SVISA+    SV EV
Subjt:  MLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEV

Query:  MALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKY---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP
        M+LL TQE++ ESKL+ ++T+LP VN+  Q+   +  A+ Y   N  ++  + S   R GR   +SNRG R   NRN+ QCQ+C K  ++A +C+FRY P
Subjt:  MALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKY---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP

Query:  --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------------------
           S+  +P+S   SY   N   + P+M+ M+ A D+N D+ WYPDS  TNHLTH+  NL +G+EYGG                                
Subjt:  --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------------------

Query:  ---------------------------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKL
                                               D   G+VLLQG L++GLY+F +    +PSHK    +  +T  +     + +V   SN+  L
Subjt:  ---------------------------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKL

Query:  DIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS
        D+WHRRLGHP L  VK VL     N S    K  FC+ACA+ K H+LPFS S T YT PL                              +YTWIYFL S
Subjt:  DIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS

Query:  KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINR
        KSDAF AF      +EK L   I    +D G EF  FKPFL+ HGI  R + P+TS+QN I E  HR+I++  L LLS +++PL F DEAFS +++LINR
Subjt:  KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINR

Query:  LPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG
        LP+ VL   SPLE +F  KP++  L+ FG +C+P L P+ +SHKL+ RSTP TF+GY++SHKG
Subjt:  LPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-13440.89Show/hide
Query:  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE-------------------------GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQ
        E SS  +QIFGSGNK+ +VKL DD FLLWKF I  ALE                          +S   TPN AY   KR D++ISSWL+ SM+EEI++Q
Subjt:  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE-------------------------GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQ

Query:  MLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEV
        MLHC + KEIW+ L  IFSS  L Q M+FK KL  I+KG                           +DHILYIL GLGS+Y+SM+SVISA+    SV EV
Subjt:  MLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEV

Query:  MALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKY---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP
        M+LL TQE++ ESKL+ ++T+LP VN+  Q+   +  A+ Y   N  ++  + S   R GR   +SNRG R   NRN+ QCQ+C K  ++A +C+FRY P
Subjt:  MALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKY---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP

Query:  --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------------------
           S+  +P+S   SY   N   + P+M+ M+ A D+N D+ WYPDS  TNHLTH+  NL +G+EYGG                                
Subjt:  --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGG--------------------------------

Query:  ---------------------------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKL
                                               D   G+VLLQG L++GLY+F +    +PSHK    +  +T  +     + +V   SN+  L
Subjt:  ---------------------------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKL

Query:  DIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS
        D+WHRRLGHP L  VK VL     N S    K  FC+ACA+ K H+LPFS S T YT PL                              +YTWIYFL S
Subjt:  DIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS

Query:  KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINR
        KSDAF AF      +EK L   I    +D G EF  FKPFL+ HGI  R + P+TS+QN I E  HR+I++  L LLS +++PL F DEAFS +++LINR
Subjt:  KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINR

Query:  LPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG
        LP+ VL   SPLE +F  KP++  L+ FG +C+P L P+ +SHKL+ RSTP TF+GY++SHKG
Subjt:  LPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKG

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-1727.34Show/hide
Query:  YSVS--NSSKLDIWHRRLGHPSLSTVKHVLQ--LFKPNMSINNMKF--QFCDACAMEKTHSLPFS--PSSTTYTAPLH----------------------
        YS++  + +   +WH R GH S   +  + +  +F     +NN++   + C+ C   K   LPF      T    PL                       
Subjt:  YSVS--NSSKLDIWHRRLGHPSLSTVKHVLQ--LFKPNMSINNMKF--QFCDACAMEKTHSLPFS--PSSTTYTAPLH----------------------

Query:  -------KYTWIYFLKSKSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLC--FKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSS
                Y   Y +K KSD F  F       E   NL +V    DNG E+L    + F    GI+   + PHT Q NG++E   R I +    ++S + 
Subjt:  -------KYTWIYFLKSKSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLC--FKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSS

Query:  MPLKFQDEAFSIALFLINRLPSEVL--HGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGY
        +   F  EA   A +LINR+PS  L    ++P E+  N KP    L+ FG   +  +   N+  K   +S  S F+GY
Subjt:  MPLKFQDEAFSIALFLINRLPSEVL--HGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.1e-5726.95Show/hide
Query:  GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKGV-----------------------
        G+      N  YT+ KR DK+I S ++ +++  +   +    T  +IW+ L +I+++     V + +T+L+   KG                        
Subjt:  GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKGV-----------------------

Query:  ---EDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGGNRGRWGDQSN
           ++ +  +L  L  EY+ ++  I+AK  P ++ E+   L   E++I +    T   +    +S ++    N+    N  +   + +  N  +   QS+
Subjt:  ---EDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGGNRGRWGDQSN

Query:  RGGRTWNNRNRI---QCQLCGKFNHTAVKC----YFRYAPPSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFV
              NN+++    +CQ+CG   H+A +C    +F  +  S  P PS F P           PR N+ L +P       W  DS  T+H+T +F NL +
Subjt:  RGGRTWNNRNRI---QCQLCGKFNHTAVKC----YFRYAPPSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFV

Query:  GTEY-GGDQA--------------------------FGRVLLQGTLHEGL---YRF----NVSISQQPSHKPTVQALHSTTTILIHTAYLSVY-------
           Y GGD                               +L    +H+ L   YR      VS+   P+    V+ L++   +L       +Y       
Subjt:  GTEY-GGDQA--------------------------FGRVLLQGTLHEGL---YRF----NVSISQQPSHKPTVQALHSTTTILIHTAYLSVY-------

Query:  -------SVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH------------------------
               S S+ +    WH RLGHP+ S +  V+  +  ++   + KF  C  C + K++ +PFS S+   T PL                         
Subjt:  -------SVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH------------------------

Query:  ----KYTWIYFLKSKSDAFDAFVHIEKLL----NLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKF
            +YTW+Y LK KS   + F+  + LL       I  F SDNGGEF+    +   HGI+   S PHT + NG++E  HRHIV+T L LLSH+S+P  +
Subjt:  ----KYTWIYFLKSKSDAFDAFVHIEKLL----NLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKF

Query:  QDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKGLL
           AF++A++LINRLP+ +L   SP + +F T P+Y  L+ FG  C+P L P+N+ HKL  +S    F+GY+ +    L
Subjt:  QDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKGLL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-5826.86Show/hide
Query:  IVKLTDDNFLLW--KFHIQF-----------------ALEGSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFS--SW
        + KLT  N+L+W  + H  F                 A  G+  V   N  YT+ +R DK+I S ++ +++  +   +    T  +IW+ L +I++  S+
Subjt:  IVKLTDDNFLLW--KFHIQF-----------------ALEGSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFS--SW

Query:  DLVQVMKFKTKLQTI-----QKGVEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKYN
          V  ++F T+   +         ++ +  +L  L  +Y+ ++  I+AK  P S+ E+   L  +E+++ +  +++   +P     V  +  + +  + N
Subjt:  DLVQVMKFKTKLQTI-----QKGVEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKYN

Query:  PPSFPPHFSGGNRGRWGDQSNRGGRTWNNRNRI---QCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYNQFNRSPSF----PRMNVMLTAPDINQDT
              + +  NR      S+ G R+ N + +    +CQ+C    H+A +C          P    F  + NQ   +  F    PR N+ + +P      
Subjt:  PPSFPPHFSGGNRGRWGDQSNRGGRTWNNRNRI---QCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYNQFNRSPSF----PRMNVMLTAPDINQDT

Query:  TWYPDSDTTNHLTHNFGNLFVGTEY-GGDQA--------------------------FGRVLLQGTLHEGL---YRF----NVSISQQPSHKPTVQALHS
         W  DS  T+H+T +F NL     Y GGD                              +VL    +H+ L   YR      VS+   P+    V+ L++
Subjt:  TWYPDSDTTNHLTHNFGNLFVGTEY-GGDQA--------------------------FGRVLLQGTLHEGL---YRF----NVSISQQPSHKPTVQALHS

Query:  TTTILIHTAYLSVYS--VSNSSKLDI------------WHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH---
           +L       +Y   +++S  + +            WH RLGHPSL+ +  V+      +   + K   C  C + K+H +PFS S+ T + PL    
Subjt:  TTTILIHTAYLSVYS--VSNSSKLDI------------WHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH---

Query:  -------------------------KYTWIYFLKSKSDAFDAFV----HIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECT
                                 +YTW+Y LK KS   D F+     +E      I    SDNGGEF+  + +L  HGI+   S PHT + NG++E  
Subjt:  -------------------------KYTWIYFLKSKSDAFDAFV----HIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECT

Query:  HRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKGLL
        HRHIV+  L LLSH+S+P  +   AFS+A++LINRLP+ +L  +SP + +F   P+Y  LK FG  C+P L P+NR HKL  +S    F+GY+ +    L
Subjt:  HRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKGLL

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-0431.25Show/hide
Query:  HRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFG
        +R I++   ++L    +P  F+ +A + A+ +IN+ PS  ++   P E+ F + P YS+L+ FG
Subjt:  HRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTGTCAGAGTGGTATCAGTGCTGCGATGGAGGAGTCTTCAGCTTCCTCTCAAATCTTTGGCTCTGGTAATAAGGTTTTGATTGTGAAACTTACTGATGATAACTT
CCTTTTGTGGAAATTTCATATTCAATTTGCTCTTGAGGGTTCTTCGGTGGTAAAAACCCCAAATCTAGCTTATACTAAATGTAAACGTCATGACAAAATAATTTCATCAT
GGCTTGTCGATTCGATGACAGAGGAAATTATTCATCAAATGCTTCACTGTGGAACAACGAAGGAAATCTGGGATTGTCTTGCTCAAATTTTCTCTTCTTGGGATCTTGTC
CAGGTGATGAAATTCAAAACTAAATTACAAACTATCCAGAAGGGAGTTGAGGATCATATTTTATATATCTTATATGGTCTCGGTTCTGAATACGAATCGATGGTTTCGGT
TATTTCGGCAAAGGTAGGTCCTCAATCGGTCCATGAGGTCATGGCCCTCTTGTTTACTCAAGAGAATAGGATTGAGAGTAAACTTGTTCACACTGATACTTCTCTACCAT
ATGTAAATCTCTCCGTTCAATCAAAACCTGCTGATAACGATGCTCAGAAATATAATCCACCTTCGTTTCCTCCTCATTTTAGTGGTGGTAACAGAGGACGTTGGGGTGAC
CAGTCCAATCGAGGAGGTAGAACATGGAATAATCGAAATAGAATTCAATGCCAATTGTGTGGGAAATTCAATCATACTGCAGTGAAGTGCTATTTTCGATATGCTCCTCC
CAGTGCTCCTCCCAATCCAAGTTCGTTTGCTCCTTCTTATAACCAATTTAATCGATCTCCTTCATTTCCTCGGATGAATGTTATGCTCACTGCTCCTGATATTAACCAAG
ATACGACTTGGTACCCTGACTCCGATACCACGAATCACCTTACTCATAACTTTGGGAATCTCTTCGTTGGAACTGAGTATGGTGGTGATCAAGCGTTTGGTCGAGTTCTG
CTCCAAGGGACTCTCCATGAGGGACTTTATCGATTCAATGTCTCTATCTCTCAACAACCATCTCATAAACCAACGGTTCAAGCTCTTCATTCCACCACCACTATTCTGAT
TCATACTGCTTATCTTTCTGTTTACTCTGTTTCAAATAGTTCTAAATTAGATATTTGGCATAGACGTCTAGGCCATCCAAGTTTGTCCACTGTCAAACATGTGTTACAGT
TGTTTAAACCAAATATGTCTATAAATAATATGAAGTTTCAATTCTGTGATGCTTGTGCAATGGAAAAAACTCACTCCCTACCCTTCTCTCCTTCCTCTACTACTTACACT
GCTCCTCTTCATAAATATACCTGGATTTATTTCTTAAAATCGAAGTCGGATGCCTTTGATGCTTTTGTTCATATTGAGAAACTTCTAAACTTACCAATTGTGCAATTTCC
ATCTGATAATGGTGGTGAGTTCCTATGTTTCAAACCATTTTTGGAGTCTCATGGCATTACTCGTAGGTTTTCTTATCCTCACACATCCCAACAAAATGGGATTGCAGAAT
GCACGCACAGACACATTGTTGATACTGACCTTGCCTTACTCTCTCATTCCTCAATGCCTCTAAAATTCCAGGATGAAGCGTTTTCTATCGCTCTGTTTTTAATTAATAGG
CTGCCTTCTGAAGTTCTTCATGGTAGGAGTCCCTTGGAAATCATCTTTAACACTAAACCTGACTATTCTTTTCTTAAGGCCTTTGGTTTCCAATGTTTTCCTTGTCTCTG
TCCACATAATCGATCTCATAAGTTAGCCTACAGGTCTACTCCTAGCACCTTTATTGGTTACAACTCATCTCATAAAGGTTTATTGTTGTTTGTCTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTGTCAGAGTGGTATCAGTGCTGCGATGGAGGAGTCTTCAGCTTCCTCTCAAATCTTTGGCTCTGGTAATAAGGTTTTGATTGTGAAACTTACTGATGATAACTT
CCTTTTGTGGAAATTTCATATTCAATTTGCTCTTGAGGGTTCTTCGGTGGTAAAAACCCCAAATCTAGCTTATACTAAATGTAAACGTCATGACAAAATAATTTCATCAT
GGCTTGTCGATTCGATGACAGAGGAAATTATTCATCAAATGCTTCACTGTGGAACAACGAAGGAAATCTGGGATTGTCTTGCTCAAATTTTCTCTTCTTGGGATCTTGTC
CAGGTGATGAAATTCAAAACTAAATTACAAACTATCCAGAAGGGAGTTGAGGATCATATTTTATATATCTTATATGGTCTCGGTTCTGAATACGAATCGATGGTTTCGGT
TATTTCGGCAAAGGTAGGTCCTCAATCGGTCCATGAGGTCATGGCCCTCTTGTTTACTCAAGAGAATAGGATTGAGAGTAAACTTGTTCACACTGATACTTCTCTACCAT
ATGTAAATCTCTCCGTTCAATCAAAACCTGCTGATAACGATGCTCAGAAATATAATCCACCTTCGTTTCCTCCTCATTTTAGTGGTGGTAACAGAGGACGTTGGGGTGAC
CAGTCCAATCGAGGAGGTAGAACATGGAATAATCGAAATAGAATTCAATGCCAATTGTGTGGGAAATTCAATCATACTGCAGTGAAGTGCTATTTTCGATATGCTCCTCC
CAGTGCTCCTCCCAATCCAAGTTCGTTTGCTCCTTCTTATAACCAATTTAATCGATCTCCTTCATTTCCTCGGATGAATGTTATGCTCACTGCTCCTGATATTAACCAAG
ATACGACTTGGTACCCTGACTCCGATACCACGAATCACCTTACTCATAACTTTGGGAATCTCTTCGTTGGAACTGAGTATGGTGGTGATCAAGCGTTTGGTCGAGTTCTG
CTCCAAGGGACTCTCCATGAGGGACTTTATCGATTCAATGTCTCTATCTCTCAACAACCATCTCATAAACCAACGGTTCAAGCTCTTCATTCCACCACCACTATTCTGAT
TCATACTGCTTATCTTTCTGTTTACTCTGTTTCAAATAGTTCTAAATTAGATATTTGGCATAGACGTCTAGGCCATCCAAGTTTGTCCACTGTCAAACATGTGTTACAGT
TGTTTAAACCAAATATGTCTATAAATAATATGAAGTTTCAATTCTGTGATGCTTGTGCAATGGAAAAAACTCACTCCCTACCCTTCTCTCCTTCCTCTACTACTTACACT
GCTCCTCTTCATAAATATACCTGGATTTATTTCTTAAAATCGAAGTCGGATGCCTTTGATGCTTTTGTTCATATTGAGAAACTTCTAAACTTACCAATTGTGCAATTTCC
ATCTGATAATGGTGGTGAGTTCCTATGTTTCAAACCATTTTTGGAGTCTCATGGCATTACTCGTAGGTTTTCTTATCCTCACACATCCCAACAAAATGGGATTGCAGAAT
GCACGCACAGACACATTGTTGATACTGACCTTGCCTTACTCTCTCATTCCTCAATGCCTCTAAAATTCCAGGATGAAGCGTTTTCTATCGCTCTGTTTTTAATTAATAGG
CTGCCTTCTGAAGTTCTTCATGGTAGGAGTCCCTTGGAAATCATCTTTAACACTAAACCTGACTATTCTTTTCTTAAGGCCTTTGGTTTCCAATGTTTTCCTTGTCTCTG
TCCACATAATCGATCTCATAAGTTAGCCTACAGGTCTACTCCTAGCACCTTTATTGGTTACAACTCATCTCATAAAGGTTTATTGTTGTTTGTCTTCTAA
Protein sequenceShow/hide protein sequence
MFCQSGISAAMEESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALEGSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLV
QVMKFKTKLQTIQKGVEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGGNRGRWGD
QSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGGDQAFGRVL
LQGTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYT
APLHKYTWIYFLKSKSDAFDAFVHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINR
LPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKGLLLFVF