; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041024 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041024
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr13:10959961..10966422
RNA-Seq ExpressionLag0041024
SyntenyLag0041024
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN71138.1 hypothetical protein VITISV_008660 [Vitis vinifera]7.6e-5826.97Show/hide
Query:  PPPEKVQDCNWYPDSGATNHLTNSLGNMFVSSEY-------LGKISVSQFARDTVVFFEFHPTFCVVKDLATRQALLQGTLHEGLYKFN-----------
        P  E   D NWYPD GA+NH+T++  N+  S E+       +G  +VS+FA+D  VFFEFH   C VK   T+  L+ G + +GLY F+           
Subjt:  PPPEKVQDCNWYPDSGATNHLTNSLGNMFVSSEY-------LGKISVSQFARDTVVFFEFHPTFCVVKDLATRQALLQGTLHEGLYKFN-----------

Query:  -------------------------------------------LPKPLSSPNTIVSQPKSN-------------------------------------NF
                                                   L K L  P  ++S                                           F
Subjt:  -------------------------------------------LPKPLSSPNTIVSQPKSN-------------------------------------NF

Query:  GKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQKQ
           I SLQT+ G EF+ F   L  +GI HRVSCP+T QQNG+ E KHR IV  GLTLL  + +PL FWD+ F T V+L NRLP+ VL     +E LF+  
Subjt:  GKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQKQ

Query:  PDYSTLKVFGCS------------------------------------------------HPIVTRSKVGIFKPKAYLT--------------TFVDIEP
        PDYS LKVFGCS                                                HP++TR+K GI KPK  L                +   E 
Subjt:  PDYSTLKVFGCS------------------------------------------------HPIVTRSKVGIFKPKAYLT--------------TFVDIEP

Query:  P-------KVKEVFKCSHWKNAIQ--DEYDALIKNNTWDLVPTP------LNQKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQF
        P       K + V K  H +      + +  ++K +T  +V T         +++D+NN FL+G L E V+M QP GF  +   +LVCRL KALYGLKQ 
Subjt:  P-------KVKEVFKCSHWKNAIQ--DEYDALIKNNTWDLVPTP------LNQKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQF

Query:  PWAWFERLSLFLNSLCFLNSKADPSLLFRRSGLIRLDATTKFCYLSILSARFLGTSIWDSWKPKNRAEQRLGRLKQRERSGNQPIVRGDRDGDVRRWARF
        P AWFE+L   L S  F+++K+D SL  R         T    Y+  +  +++   +          + ++   K         +  G  DGD       
Subjt:  PWAWFERLSLFLNSLCFLNSKADPSLLFRRSGLIRLDATTKFCYLSILSARFLGTSIWDSWKPKNRAEQRLGRLKQRERSGNQPIVRGDRDGDVRRWARF

Query:  GLIYGIYAIDVIYACVIGISDSIAYGRI-ITKMVDANPITTPMVSGPLL--SAHQDWASDLDDRKSTSGFCVF---------------------------
          ++G       Y   +G    +   R  ++  V+ + +     S  +L      DWASDLDDR+STSG CVF                           
Subjt:  GLIYGIYAIDVIYACVIGISDSIAYGRI-ITKMVDANPITTPMVSGPLL--SAHQDWASDLDDRKSTSGFCVF---------------------------

Query:  --------------------------------------FVVTSLHGDLRN-------------RRSLNVQHLPTSDQIVDVLTNLLSAVNFLKLRSKLNV
                                              +    LH   ++             R+ + V+H+P++DQ+ DV T  + +  F++ R KL +
Subjt:  --------------------------------------FVVTSLHGDLRN-------------RRSLNVQHLPTSDQIVDVLTNLLSAVNFLKLRSKLNV

Query:  REPSSIGLRGGV
           S++ LRG V
Subjt:  REPSSIGLRGGV

KAG8480454.1 hypothetical protein CXB51_024676 [Gossypium anomalum]4.4e-5829.53Show/hide
Query:  ISVSQFARDTVVFFEFHPTFCVVKDLATRQALLQGTLHEGLYKFNLPK----PLSSP--------------NTIVSQPKSN--------------NFGKP
        ISV QFA+D  V+FEFHP  C VKD+ TR+ LL G +H+GLY+F+L +     + SP              ++ +  P SN              N G P
Subjt:  ISVSQFARDTVVFFEFHPTFCVVKDLATRQALLQGTLHEGLYKFNLPK----PLSSP--------------NTIVSQPKSN--------------NFGKP

Query:  I-----------------------LSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLIN
        +                          +T+GG EF+     L   G+ HRV+CPYTS+QNG+VER+HRHIV++GLTLL+ +S+PL +W D FS +  LIN
Subjt:  I-----------------------LSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLIN

Query:  RLPSIVLGGLSPLEKLFQKQPDYSTLKVFGC---------------------------------------------------------------------
        RLP+ VL   SP EKL++ QPDY+ L+ + C                                                                     
Subjt:  RLPSIVLGGLSPLEKLFQKQPDYSTLKVFGC---------------------------------------------------------------------

Query:  ---------------------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYDA
                                                            HP+ TRSK GIFKP+ + +   + EP  + E F+ + W  A + EY A
Subjt:  ---------------------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYDA

Query:  LIKNNTWDLVPTPLN-------------------------------------------------------------------QKIDINNVFLHGILSETV
        LI N+TWDLVP P                                                                     +++D+NN FL+G L E +
Subjt:  LIKNNTWDLVPTPLN-------------------------------------------------------------------QKIDINNVFLHGILSETV

Query:  YMDQPAGF--HVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSGLIRLDATTKFCYLSI
        YM QP GF  H  G   LVCRL KALYGLKQ P AWF +L  FL +  F+ SKAD SL  R++G       T+F Y+ +
Subjt:  YMDQPAGF--HVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSGLIRLDATTKFCYLSI

KAG8491907.1 hypothetical protein CXB51_015260 [Gossypium anomalum]2.6e-5830.32Show/hide
Query:  PPEKVQDCNWYPDSGATNHLTNSLGNMFVSSEYLGK-------------------------------------------ISVSQFARDTVVFFEFHPTFC
        P   V D  WYPDSGATNH+T    N+   S Y G                                            +SV QFA+D VV+FEFHP FC
Subjt:  PPEKVQDCNWYPDSGATNHLTNSLGNMFVSSEYLGK-------------------------------------------ISVSQFARDTVVFFEFHPTFC

Query:  VVKDLATRQALLQGTLHEGLYKFNL--------------PKP-----------------LSSP--NTIVSQPKSNN--------FGKPILSLQTNGGAEF
         VKD+ TR+ LL G +H GLYKF+               P P                 L  P  N +V   +S N        FG  I  LQT+ G E+
Subjt:  VVKDLATRQALLQGTLHEGLYKFNL--------------PKP-----------------LSSP--NTIVSQPKSNN--------FGKPILSLQTNGGAEF

Query:  KRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPS-------------------------------
        +  +  L   GI HRV+ P+TS+QNG+VERKHRH+VD+GLT+L+ +S+ L FW   F+ +V L+N LP+                               
Subjt:  KRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPS-------------------------------

Query:  ---------------------IVLGG------------LSPLEKLF----QKQPDYSTLKV--------------------FGCSHPIVTRSKVGIFKPK
                             +V+G              SP+++ F     + P Y +  +                     G  HP+ TRSK GIFKPK
Subjt:  ---------------------IVLGG------------LSPLEKLF----QKQPDYSTLKV--------------------FGCSHPIVTRSKVGIFKPK

Query:  AYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVPTPLNQK--------------------------------------------------
         + +T  + EP  + E  +   WK A   EY+AL+ N+T DL+P P ++K                                                  
Subjt:  AYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVPTPLNQK--------------------------------------------------

Query:  -----------------IDINNVFLHGILSETVYMDQPAGFHV--KGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSG
                         +DINN FL+G L E +YM QP GF     G   LVC+L KALYGLKQ P AWF +L  FL +  F  SKAD SL   RSG
Subjt:  -----------------IDINNVFLHGILSETVYMDQPAGFHV--KGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSG

RVW87716.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.8e-6029.07Show/hide
Query:  EKVQDCNWYPDSGATNHLTNSLGNMFVSSEY---------LGK-------------------------------------ISVSQFARDTVVFFEFHPTF
        E +QD NWYPDSGAT+HLT +L N+   S++          GK                                     +SVS+FA D  VFFEFHPT 
Subjt:  EKVQDCNWYPDSGATNHLTNSLGNMFVSSEY---------LGK-------------------------------------ISVSQFARDTVVFFEFHPTF

Query:  CVVKDLATRQALLQGTL------------------------------HEG-------LYKFNLP-------------------------------KPLSS
        C VKDL+TR  L+ G L                              H         L K NLP                               KPL  
Subjt:  CVVKDLATRQALLQGTL------------------------------HEG-------LYKFNLP-------------------------------KPLSS

Query:  PNTIVSQPKSN-------------------------NFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSS
         +T +  P S                          N    I ++Q++ G E++ F   L ++GI HR+SCPYT +QNG+ ERKHRHIV+ G+ LL+ +S
Subjt:  PNTIVSQPKSN-------------------------NFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSS

Query:  MPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFGC---------------------------------------------------
        +P  +WD+ F TSV+LINRLP+ VL   SPLE LF ++P YS LKVFGC                                                   
Subjt:  MPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFGC---------------------------------------------------

Query:  ----------------------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYD
                                                            SH ++TRSK GIFKPKAYL   +   P  V E  + SHWK A+ DEY 
Subjt:  ----------------------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYD

Query:  ALIKNNTWDLVPTPLNQK-------------------------------------------------------------------IDINNVFLHGILSET
        AL++NNTWDLVP P ++K                                                                   +D+NN FL+G L E 
Subjt:  ALIKNNTWDLVPTPLNQK-------------------------------------------------------------------IDINNVFLHGILSET

Query:  VYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSL
        ++M QP GF      + VC+L K+LYGLKQ P AWFE+L   L  L F ++K+D SL
Subjt:  VYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSL

RVX03305.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.8e-5828.17Show/hide
Query:  EKVQDCNWYPDSGATNHLTNSLGNMFVSSEY---------LGK-------------------------------------ISVSQFARDTVVFFEFHPTF
        E +QD NWY DSGAT+HLT +L N+   S++          GK                                     +SVS+FA D  VFFEFHPT 
Subjt:  EKVQDCNWYPDSGATNHLTNSLGNMFVSSEY---------LGK-------------------------------------ISVSQFARDTVVFFEFHPTF

Query:  CVVKDLATRQALLQGTLHEGLYKFN---LPKPL------------SSPNTIVSQPKS------NNFGKP-------------------------------
        C VKDL+TR  L+   L  GLY F+   L  PL            S   T+ + P S      N  G P                               
Subjt:  CVVKDLATRQALLQGTLHEGLYKFN---LPKPL------------SSPNTIVSQPKS------NNFGKP-------------------------------

Query:  --------------------------------ILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDV
                                        I ++Q++ G E++ F   L ++GI HR+SCPYT +QNG+ ERKHRHIV+ G+ LL+ +S+P  +WD+ 
Subjt:  --------------------------------ILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDV

Query:  FSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFGC------------------------------------------------------------
        F TSV+LINRLP+ VL   SPLE LF ++P YS LKVFGC                                                            
Subjt:  FSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFGC------------------------------------------------------------

Query:  ------------------------------------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVF
                                                                          SH ++TRSK GIFKPKAYL   +   P  V E  
Subjt:  ------------------------------------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVF

Query:  KCSHWKNAIQDEYDALIKNNTWDLVPTPLNQK-------------------------------------------------------------------I
        + SHWK  + DEY AL++NNTWDLVP P ++K                                                                   +
Subjt:  KCSHWKNAIQDEYDALIKNNTWDLVPTPLNQK-------------------------------------------------------------------I

Query:  DINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSL
        D+NN FL+G L E ++M QP GF      + VC+L K+LYGLKQ P AWFE+L   L  L F ++K+D SL
Subjt:  DINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSL

TrEMBL top hitse value%identityAlignment
A0A438FHJ8 Retrovirus-related Pol polyprotein from transposon RE25.9e-5641.82Show/hide
Query:  PPPEKVQDCNWYPDSGATNHLTNSLGNMFVSSEYLGKISVSQF------ARDTVVFFEFHPTFCVVKDLATR--QALLQGTLHEGLYKFNLPKPLSSPNT
        P  E   D NWY +SGA+NH+T +  NM  S+E+  +  V+Q        RD +  F+          LA R  Q+LL+  L   L  F L     +  T
Subjt:  PPPEKVQDCNWYPDSGATNHLTNSLGNMFVSSEYLGKISVSQF------ARDTVVFFEFHPTFCVVKDLATR--QALLQGTLHEGLYKFNLPKPLSSPNT

Query:  IVS--QPKSNNFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLG
         V+        F   I SLQTN G EF+ F   L  +GI HRVSCP T QQNG+VERKHR IV+ GLTLL  +S+PL FWD+ F   V+L NRLP++VL 
Subjt:  IVS--QPKSNNFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLG

Query:  GLSPLEKLFQKQPDYSTLKVFGCS----------HPIVTRSKVGIF-----KPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVPTP
           P+E LF+  PDYS LKVFGCS          H +  RS+   F     K K Y       EP  V    +   WK A+  EYDAL +NNTW LV  P
Subjt:  GLSPLEKLFQKQPDYSTLKVFGCS----------HPIVTRSKVGIF-----KPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVPTP

Query:  -----------------LN---QKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWA
                         +N   +K+D+NNVFL+  L E V+M QP GF  +   +LVC+L KALYGLKQ P A
Subjt:  -----------------LN---QKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWA

A0A438HTD6 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-6029.07Show/hide
Query:  EKVQDCNWYPDSGATNHLTNSLGNMFVSSEY---------LGK-------------------------------------ISVSQFARDTVVFFEFHPTF
        E +QD NWYPDSGAT+HLT +L N+   S++          GK                                     +SVS+FA D  VFFEFHPT 
Subjt:  EKVQDCNWYPDSGATNHLTNSLGNMFVSSEY---------LGK-------------------------------------ISVSQFARDTVVFFEFHPTF

Query:  CVVKDLATRQALLQGTL------------------------------HEG-------LYKFNLP-------------------------------KPLSS
        C VKDL+TR  L+ G L                              H         L K NLP                               KPL  
Subjt:  CVVKDLATRQALLQGTL------------------------------HEG-------LYKFNLP-------------------------------KPLSS

Query:  PNTIVSQPKSN-------------------------NFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSS
         +T +  P S                          N    I ++Q++ G E++ F   L ++GI HR+SCPYT +QNG+ ERKHRHIV+ G+ LL+ +S
Subjt:  PNTIVSQPKSN-------------------------NFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSS

Query:  MPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFGC---------------------------------------------------
        +P  +WD+ F TSV+LINRLP+ VL   SPLE LF ++P YS LKVFGC                                                   
Subjt:  MPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFGC---------------------------------------------------

Query:  ----------------------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYD
                                                            SH ++TRSK GIFKPKAYL   +   P  V E  + SHWK A+ DEY 
Subjt:  ----------------------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYD

Query:  ALIKNNTWDLVPTPLNQK-------------------------------------------------------------------IDINNVFLHGILSET
        AL++NNTWDLVP P ++K                                                                   +D+NN FL+G L E 
Subjt:  ALIKNNTWDLVPTPLNQK-------------------------------------------------------------------IDINNVFLHGILSET

Query:  VYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSL
        ++M QP GF      + VC+L K+LYGLKQ P AWFE+L   L  L F ++K+D SL
Subjt:  VYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSL

A0A438J300 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-5828.17Show/hide
Query:  EKVQDCNWYPDSGATNHLTNSLGNMFVSSEY---------LGK-------------------------------------ISVSQFARDTVVFFEFHPTF
        E +QD NWY DSGAT+HLT +L N+   S++          GK                                     +SVS+FA D  VFFEFHPT 
Subjt:  EKVQDCNWYPDSGATNHLTNSLGNMFVSSEY---------LGK-------------------------------------ISVSQFARDTVVFFEFHPTF

Query:  CVVKDLATRQALLQGTLHEGLYKFN---LPKPL------------SSPNTIVSQPKS------NNFGKP-------------------------------
        C VKDL+TR  L+   L  GLY F+   L  PL            S   T+ + P S      N  G P                               
Subjt:  CVVKDLATRQALLQGTLHEGLYKFN---LPKPL------------SSPNTIVSQPKS------NNFGKP-------------------------------

Query:  --------------------------------ILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDV
                                        I ++Q++ G E++ F   L ++GI HR+SCPYT +QNG+ ERKHRHIV+ G+ LL+ +S+P  +WD+ 
Subjt:  --------------------------------ILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDV

Query:  FSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFGC------------------------------------------------------------
        F TSV+LINRLP+ VL   SPLE LF ++P YS LKVFGC                                                            
Subjt:  FSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFGC------------------------------------------------------------

Query:  ------------------------------------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVF
                                                                          SH ++TRSK GIFKPKAYL   +   P  V E  
Subjt:  ------------------------------------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVF

Query:  KCSHWKNAIQDEYDALIKNNTWDLVPTPLNQK-------------------------------------------------------------------I
        + SHWK  + DEY AL++NNTWDLVP P ++K                                                                   +
Subjt:  KCSHWKNAIQDEYDALIKNNTWDLVPTPLNQK-------------------------------------------------------------------I

Query:  DINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSL
        D+NN FL+G L E ++M QP GF      + VC+L K+LYGLKQ P AWFE+L   L  L F ++K+D SL
Subjt:  DINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSL

A0A803NRU8 Uncharacterized protein7.7e-5628.77Show/hide
Query:  QDCNWYPDSGATNHLTNSLGNMFVSSEYLGK---------------------------------------------ISVSQFARDTVVFFEFHPTFCVVK
        +DCNWYPDSGATNH T  L N+  ++EY G+                                             +SVS+FARD  V FEFH   C VK
Subjt:  QDCNWYPDSGATNHLTNSLGNMFVSSEYLGK---------------------------------------------ISVSQFARDTVVFFEFHPTFCVVK

Query:  DLATRQALLQGTLHEGLYKFN------LPKPLSSPN-TIVSQPKSNNF------------------------------------------GKPILSLQTN
        D  T+  LL GTLH GLY F+      LP    S N  I+ +  SN F                                          GK I   Q++
Subjt:  DLATRQALLQGTLHEGLYKFN------LPKPLSSPN-TIVSQPKSNNF------------------------------------------GKPILSLQTN

Query:  GGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFG
         G E++ F   L+  GI HR  CP T +QNG+VERKHR IV+ GL+LL+ +SMPL FWD+ F  +V+L NRLP+ VL  LSP+E LF  +PDY  LK+FG
Subjt:  GGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFG

Query:  C---------------------------------------------------------------------------------------------------
        C                                                                                                   
Subjt:  C---------------------------------------------------------------------------------------------------

Query:  ----------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVP
                                                +H + TR+K GI+KPKA L   V  EP  VK   K   W NA+ +E  AL KN TW  VP
Subjt:  ----------------------------------------SHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVP

Query:  TP---------------LN----------------------------------------------------QKIDINNVFLHGILSETVYMDQPAGFHVK
         P               LN                                                    Q++D+NN FL+G L E VYM QP GF + 
Subjt:  TP---------------LN----------------------------------------------------QKIDINNVFLHGILSETVYMDQPAGFHVK

Query:  GATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRS
         A  LVC+L KALYGLKQ P AWF +L   L++  F +SK+D SL  + +
Subjt:  GATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRS

A5AD69 Integrase catalytic domain-containing protein3.7e-5826.97Show/hide
Query:  PPPEKVQDCNWYPDSGATNHLTNSLGNMFVSSEY-------LGKISVSQFARDTVVFFEFHPTFCVVKDLATRQALLQGTLHEGLYKFN-----------
        P  E   D NWYPD GA+NH+T++  N+  S E+       +G  +VS+FA+D  VFFEFH   C VK   T+  L+ G + +GLY F+           
Subjt:  PPPEKVQDCNWYPDSGATNHLTNSLGNMFVSSEY-------LGKISVSQFARDTVVFFEFHPTFCVVKDLATRQALLQGTLHEGLYKFN-----------

Query:  -------------------------------------------LPKPLSSPNTIVSQPKSN-------------------------------------NF
                                                   L K L  P  ++S                                           F
Subjt:  -------------------------------------------LPKPLSSPNTIVSQPKSN-------------------------------------NF

Query:  GKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQKQ
           I SLQT+ G EF+ F   L  +GI HRVSCP+T QQNG+ E KHR IV  GLTLL  + +PL FWD+ F T V+L NRLP+ VL     +E LF+  
Subjt:  GKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQKQ

Query:  PDYSTLKVFGCS------------------------------------------------HPIVTRSKVGIFKPKAYLT--------------TFVDIEP
        PDYS LKVFGCS                                                HP++TR+K GI KPK  L                +   E 
Subjt:  PDYSTLKVFGCS------------------------------------------------HPIVTRSKVGIFKPKAYLT--------------TFVDIEP

Query:  P-------KVKEVFKCSHWKNAIQ--DEYDALIKNNTWDLVPTP------LNQKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQF
        P       K + V K  H +      + +  ++K +T  +V T         +++D+NN FL+G L E V+M QP GF  +   +LVCRL KALYGLKQ 
Subjt:  P-------KVKEVFKCSHWKNAIQ--DEYDALIKNNTWDLVPTP------LNQKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQF

Query:  PWAWFERLSLFLNSLCFLNSKADPSLLFRRSGLIRLDATTKFCYLSILSARFLGTSIWDSWKPKNRAEQRLGRLKQRERSGNQPIVRGDRDGDVRRWARF
        P AWFE+L   L S  F+++K+D SL  R         T    Y+  +  +++   +          + ++   K         +  G  DGD       
Subjt:  PWAWFERLSLFLNSLCFLNSKADPSLLFRRSGLIRLDATTKFCYLSILSARFLGTSIWDSWKPKNRAEQRLGRLKQRERSGNQPIVRGDRDGDVRRWARF

Query:  GLIYGIYAIDVIYACVIGISDSIAYGRI-ITKMVDANPITTPMVSGPLL--SAHQDWASDLDDRKSTSGFCVF---------------------------
          ++G       Y   +G    +   R  ++  V+ + +     S  +L      DWASDLDDR+STSG CVF                           
Subjt:  GLIYGIYAIDVIYACVIGISDSIAYGRI-ITKMVDANPITTPMVSGPLL--SAHQDWASDLDDRKSTSGFCVF---------------------------

Query:  --------------------------------------FVVTSLHGDLRN-------------RRSLNVQHLPTSDQIVDVLTNLLSAVNFLKLRSKLNV
                                              +    LH   ++             R+ + V+H+P++DQ+ DV T  + +  F++ R KL +
Subjt:  --------------------------------------FVVTSLHGDLRN-------------RRSLNVQHLPTSDQIVDVLTNLLSAVNFLKLRSKLNV

Query:  REPSSIGLRGGV
           S++ LRG V
Subjt:  REPSSIGLRGGV

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.1e-0637.04Show/hide
Query:  KIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSGLI
        ++D+   FL+G L E +YM  P G  +   +  VC+L KA+YGLKQ    WFE     L    F+NS  D  +     G I
Subjt:  KIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSGLI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-1443.59Show/hide
Query:  QKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRR
        +++D+   FLHG L E +YM+QP GF V G   +VC+L K+LYGLKQ P  W+ +   F+ S  +L + +DP + F+R
Subjt:  QKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-1436.28Show/hide
Query:  GKPILSLQTNGGAEF--KRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQ
        G+ +  L+++ G E+  + F     +HGI H  + P T Q NG+ ER +R IV+   ++L  + +P +FW +   T+ +LINR PS+ L    P      
Subjt:  GKPILSLQTNGGAEF--KRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLFQ

Query:  KQPDYSTLKVFGC
        K+  YS LKVFGC
Subjt:  KQPDYSTLKVFGC

P92520 Uncharacterized mitochondrial protein AtMg008202.4e-0648.53Show/hide
Query:  IVTRSKVGIFK--PKAYL--TTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVPTPLNQKI
        ++TRSK GI K  PK  L  TT +  EP  V    K   W  A+Q+E DAL +N TW LVP P+NQ I
Subjt:  IVTRSKVGIFK--PKAYL--TTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVPTPLNQKI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-2547.83Show/hide
Query:  NNFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLF
        N F   I +  ++ G EF         HGI+H  S P+T + NG+ ERKHRHIV+ GLTLLSH+S+P T+W   F+ +V+LINRLP+ +L   SP +KLF
Subjt:  NNFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLF

Query:  QKQPDYSTLKVFGCS
           P+Y  L+VFGC+
Subjt:  QKQPDYSTLKVFGCS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.1e-1447.5Show/hide
Query:  QKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSG
        +++D+NN FL G L++ VYM QP GF  K   + VC+L KALYGLKQ P AW+  L  +L ++ F+NS +D SL   + G
Subjt:  QKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.2e-2744.37Show/hide
Query:  TLHEGLYKFNLPKPLSSPNTIVSQPKSNNFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDD
        T +  LY       +     I      N F   I +L ++ G EF      L  HGI+H  S P+T + NG+ ERKHRHIV++GLTLLSH+S+P T+W  
Subjt:  TLHEGLYKFNLPKPLSSPNTIVSQPKSNNFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDD

Query:  VFSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFGCS
         FS +V+LINRLP+ +L   SP +KLF + P+Y  LKVFGC+
Subjt:  VFSTSVFLINRLPSIVLGGLSPLEKLFQKQPDYSTLKVFGCS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.1e-1447.5Show/hide
Query:  QKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSG
        +++D+NN FL G L++ VYM QP GF  K     VCRL KA+YGLKQ P AW+  L  +L ++ F+NS +D SL   + G
Subjt:  QKIDINNVFLHGILSETVYMDQPAGFHVKGATSLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSG

Arabidopsis top hitse value%identityAlignment
ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.7e-0748.53Show/hide
Query:  IVTRSKVGIFK--PKAYL--TTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVPTPLNQKI
        ++TRSK GI K  PK  L  TT +  EP  V    K   W  A+Q+E DAL +N TW LVP P+NQ I
Subjt:  IVTRSKVGIFK--PKAYL--TTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVPTPLNQKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCTGAGTAGAACCCAACGTGGGCATGATCTTGATGAGCATATCAGTGACGATTCTGAACCACCACCTGAGAAGGTCCAGGATTGTAACTGGTATCCTGATTCCGG
AGCTACAAACCATTTAACCAACAGCCTCGGTAATATGTTTGTGAGCTCTGAATATCTTGGAAAAATCAGTGTCAGTCAGTTTGCTAGAGATACTGTTGTTTTCTTCGAAT
TTCATCCCACTTTTTGTGTTGTGAAGGATCTAGCAACTAGACAGGCACTTCTTCAAGGGACTCTACATGAAGGGCTATATAAGTTCAACCTGCCAAAGCCTTTGTCGTCA
CCAAACACCATTGTCTCTCAACCTAAGTCTAATAATTTTGGGAAACCTATTCTTTCTCTTCAAACTAATGGGGGTGCTGAATTTAAGCGTTTTATTCCTCTTCTTCAAAC
TCATGGCATAAATCATCGAGTCTCATGTCCTTATACATCTCAACAAAATGGTATAGTTGAGCGAAAGCACAGACATATTGTTGATGTTGGTCTTACATTATTATCTCATT
CATCTATGCCTTTAACGTTTTGGGATGATGTCTTTTCTACAAGTGTCTTTCTTATTAACAGGCTACCTTCTATAGTTCTTGGTGGCTTGAGTCCCTTGGAGAAGCTCTTC
CAGAAGCAACCAGATTATTCCACACTTAAGGTGTTTGGATGTTCTCATCCTATAGTTACTCGAAGTAAAGTAGGCATATTCAAACCTAAGGCTTACCTAACTACTTTTGT
TGATATAGAACCTCCTAAAGTTAAAGAGGTTTTTAAGTGTTCTCATTGGAAGAATGCTATACAAGATGAATATGATGCTCTTATTAAGAATAATACTTGGGATCTTGTTC
CTACACCTTTGAATCAAAAGATTGATATTAATAATGTCTTCTTGCATGGTATATTATCTGAGACTGTGTATATGGATCAGCCCGCTGGTTTTCATGTTAAGGGTGCTACT
TCTTTAGTTTGTCGGCTATGCAAAGCTTTATATGGTCTTAAACAATTTCCATGGGCTTGGTTTGAGCGCCTGAGTTTGTTTCTTAACTCTCTTTGTTTTCTAAATTCTAA
GGCTGATCCTTCTTTATTGTTTCGACGCTCGGGACTCATTCGTCTCGATGCTACGACGAAATTTTGCTATTTAAGCATCCTTTCGGCTAGGTTTTTGGGGACTTCGATTT
GGGACTCTTGGAAGCCGAAAAACAGGGCAGAACAGAGGCTTGGAAGGCTGAAACAGAGGGAGCGAAGTGGAAATCAACCCATTGTTCGTGGGGATCGTGACGGGGACGTT
CGCCGTTGGGCACGTTTTGGATTAATTTACGGTATTTATGCTATTGATGTCATTTATGCATGTGTTATAGGGATATCGGATTCAATTGCATATGGGAGAATCATAACTAA
AATGGTTGATGCAAATCCAATTACTACTCCTATGGTCAGTGGTCCCTTATTATCTGCCCATCAAGATTGGGCTTCTGACCTTGATGATCGTAAGTCAACTTCTGGCTTTT
GTGTTTTCTTTGTGGTAACCTCATTACATGGGGATCTAAGAAACAGACGGAGTTTGAACGTGCAACATTTACCAACTTCTGATCAGATTGTCGATGTTCTCACAAATCTG
TTGTCTGCTGTCAATTTTCTAAAGCTTCGGTCCAAGCTCAATGTTCGAGAGCCCTCTTCCATTGGCTTGAGAGGGGGTGTTAATGTTAAGACAACCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATCTGAGTAGAACCCAACGTGGGCATGATCTTGATGAGCATATCAGTGACGATTCTGAACCACCACCTGAGAAGGTCCAGGATTGTAACTGGTATCCTGATTCCGG
AGCTACAAACCATTTAACCAACAGCCTCGGTAATATGTTTGTGAGCTCTGAATATCTTGGAAAAATCAGTGTCAGTCAGTTTGCTAGAGATACTGTTGTTTTCTTCGAAT
TTCATCCCACTTTTTGTGTTGTGAAGGATCTAGCAACTAGACAGGCACTTCTTCAAGGGACTCTACATGAAGGGCTATATAAGTTCAACCTGCCAAAGCCTTTGTCGTCA
CCAAACACCATTGTCTCTCAACCTAAGTCTAATAATTTTGGGAAACCTATTCTTTCTCTTCAAACTAATGGGGGTGCTGAATTTAAGCGTTTTATTCCTCTTCTTCAAAC
TCATGGCATAAATCATCGAGTCTCATGTCCTTATACATCTCAACAAAATGGTATAGTTGAGCGAAAGCACAGACATATTGTTGATGTTGGTCTTACATTATTATCTCATT
CATCTATGCCTTTAACGTTTTGGGATGATGTCTTTTCTACAAGTGTCTTTCTTATTAACAGGCTACCTTCTATAGTTCTTGGTGGCTTGAGTCCCTTGGAGAAGCTCTTC
CAGAAGCAACCAGATTATTCCACACTTAAGGTGTTTGGATGTTCTCATCCTATAGTTACTCGAAGTAAAGTAGGCATATTCAAACCTAAGGCTTACCTAACTACTTTTGT
TGATATAGAACCTCCTAAAGTTAAAGAGGTTTTTAAGTGTTCTCATTGGAAGAATGCTATACAAGATGAATATGATGCTCTTATTAAGAATAATACTTGGGATCTTGTTC
CTACACCTTTGAATCAAAAGATTGATATTAATAATGTCTTCTTGCATGGTATATTATCTGAGACTGTGTATATGGATCAGCCCGCTGGTTTTCATGTTAAGGGTGCTACT
TCTTTAGTTTGTCGGCTATGCAAAGCTTTATATGGTCTTAAACAATTTCCATGGGCTTGGTTTGAGCGCCTGAGTTTGTTTCTTAACTCTCTTTGTTTTCTAAATTCTAA
GGCTGATCCTTCTTTATTGTTTCGACGCTCGGGACTCATTCGTCTCGATGCTACGACGAAATTTTGCTATTTAAGCATCCTTTCGGCTAGGTTTTTGGGGACTTCGATTT
GGGACTCTTGGAAGCCGAAAAACAGGGCAGAACAGAGGCTTGGAAGGCTGAAACAGAGGGAGCGAAGTGGAAATCAACCCATTGTTCGTGGGGATCGTGACGGGGACGTT
CGCCGTTGGGCACGTTTTGGATTAATTTACGGTATTTATGCTATTGATGTCATTTATGCATGTGTTATAGGGATATCGGATTCAATTGCATATGGGAGAATCATAACTAA
AATGGTTGATGCAAATCCAATTACTACTCCTATGGTCAGTGGTCCCTTATTATCTGCCCATCAAGATTGGGCTTCTGACCTTGATGATCGTAAGTCAACTTCTGGCTTTT
GTGTTTTCTTTGTGGTAACCTCATTACATGGGGATCTAAGAAACAGACGGAGTTTGAACGTGCAACATTTACCAACTTCTGATCAGATTGTCGATGTTCTCACAAATCTG
TTGTCTGCTGTCAATTTTCTAAAGCTTCGGTCCAAGCTCAATGTTCGAGAGCCCTCTTCCATTGGCTTGAGAGGGGGTGTTAATGTTAAGACAACCCATTGA
Protein sequenceShow/hide protein sequence
MYLSRTQRGHDLDEHISDDSEPPPEKVQDCNWYPDSGATNHLTNSLGNMFVSSEYLGKISVSQFARDTVVFFEFHPTFCVVKDLATRQALLQGTLHEGLYKFNLPKPLSS
PNTIVSQPKSNNFGKPILSLQTNGGAEFKRFIPLLQTHGINHRVSCPYTSQQNGIVERKHRHIVDVGLTLLSHSSMPLTFWDDVFSTSVFLINRLPSIVLGGLSPLEKLF
QKQPDYSTLKVFGCSHPIVTRSKVGIFKPKAYLTTFVDIEPPKVKEVFKCSHWKNAIQDEYDALIKNNTWDLVPTPLNQKIDINNVFLHGILSETVYMDQPAGFHVKGAT
SLVCRLCKALYGLKQFPWAWFERLSLFLNSLCFLNSKADPSLLFRRSGLIRLDATTKFCYLSILSARFLGTSIWDSWKPKNRAEQRLGRLKQRERSGNQPIVRGDRDGDV
RRWARFGLIYGIYAIDVIYACVIGISDSIAYGRIITKMVDANPITTPMVSGPLLSAHQDWASDLDDRKSTSGFCVFFVVTSLHGDLRNRRSLNVQHLPTSDQIVDVLTNL
LSAVNFLKLRSKLNVREPSSIGLRGGVNVKTTH