; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018465 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018465
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:27860186..27865846
RNA-Seq ExpressionLag0018465
SyntenyLag0018465
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]1.7e-10934.31Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAETISVD---GVTKPNPAYEIWFEKDQALITLINATLSPV--------RPSLCNWI----------------LKSDLQS
        +S +LKAH L G++DG    P + +  +      + NP Y+IW  +DQAL+TL+NATLS            S   W+                LKS L +
Subjt:  VSPLLKAHKLFGFLDGKISAPAETISVD---GVTKPNPAYEIWFEKDQALITLINATLSPV--------RPSLCNWI----------------LKSDLQS

Query:  ITKLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVS--------------
        I+K   +SI+SY+Q+IK   + LA+VSV+I+ ED+ IY LNGLP  YN FKT++RT+S++ + EE++ ++  EE  ++   K   S              
Subjt:  ITKLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVS--------------

Query:  -----------------------------------------------------SQTLAMNANFLAVVVVILEIAEDLVVMAVVVADLVV-------ELAA
                                                              Q+   + N   VV  I        +      D          +L A
Subjt:  -----------------------------------------------------SQTLAMNANFLAVVVVILEIAEDLVVMAVVVADLVV-------ELAA

Query:  LAASTTYGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTHTGQ----------------------TNGAST-------
        ++A+   G    PN      W +D+G   H+T+DLAN+    EY G+ N+T+ +GQAL ++H+GQ                      TN  S        
Subjt:  LAASTTYGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTHTGQ----------------------TNGAST-------

Query:  ----------------------FHGPSINGLYPLTTQS------------------SPHVPSHL------------TAQLGTKSHSSTWHDRLGHPNNST
                              F GPS +GLYPL T S                  + H  +H             TA LG +  +  WHDRLGHP+ +T
Subjt:  ----------------------FHGPSINGLYPLTTQS------------------SPHVPSHL------------TAQLGTKSHSSTWHDRLGHPNNST

Query:  LASVLRFLNVN-NSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVE
        L S+L   ++         C HCL GKM KL FPLS++ S+ PL+L+HSD+WGP+P  S +   YY+SF+DD S                          
Subjt:  LASVLRFLNVN-NSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVE

Query:  TLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEI
                  RSDGGGEY    L     + GI H++SCPHTP+QNGIAERKHRHI+E  L LLS++S+PLKYW  AF+ A ++INR+P+  L + SP+E 
Subjt:  TLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEI

Query:  LFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTS-PTAS
        LFH PPDYT L+ FG ACYPLL+PY  +K+QP+TTQC F+GY L YKG+ CLD ST ++Y++RHV+FDE  +PF    S PT++
Subjt:  LFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTS-PTAS

PKU76601.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]1.3e-10936.81Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAETI-SVDGVTKPNPAYEIWFEKDQ----ALITLINATLSPVRPSL--CN--W-ILKSDLQSITK--------------
        V  L +A+   GFLDG  S P++ I S  G + PNPAY  W   DQ    AL ++I+ T+ P   S+  C+  W IL   LQS T+              
Subjt:  VSPLLKAHKLFGFLDGKISAPAETI-SVDGVTKPNPAYEIWFEKDQ----ALITLINATLSPVRPSL--CN--W-ILKSDLQSITK--------------

Query:  LPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAMNANFLAVV---
        +  +++  Y+  IK  V+ LAA    I  E++  Y L+GLPS+Y  FKT++RT  Q  S ++L+ L+ SEE  L  +   E+ S  ++ N+  LA     
Subjt:  LPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAMNANFLAVV---

Query:  -------------------------------VVILEIAEDLVVMAVVVADLVVELAALAASTT-YGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQVSS
                                        +  +I       AV       +       TT    S VP   +   W  DSG +AHLT D + +Q S 
Subjt:  -------------------------------VVILEIAEDLVVMAVVVADLVVELAALAASTT-YGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQVSS

Query:  EYNGEANVTVGSGQALPVTHTG---------------------------------------------------QTNGASTFHGPSINGLYPLTTQSSPHV
         Y G   VT+G+GQ LP+ +TG                                                   +TN      GP INGLYPL    +   
Subjt:  EYNGEANVTVGSGQALPVTHTG---------------------------------------------------QTNGASTFHGPSINGLYPLTTQSSPHV

Query:  PSHLTAQLGTKSHSSTWHDRLGHPNNSTLASV-LRFLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDD
         +   A +  ++  + WH RLGHP+  TLA++  +F N+ N S  + C  C  GK  +L F  S + SS+P EL+HSDVWGPS   S  G RYY+SFIDD
Subjt:  PSHLTAQLGTKSHSSTWHDRLGHPNNSTLASV-LRFLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDD

Query:  LSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKY
         S+YTWIYPL  KSDV     +F+  +E    T ++  R+DGGGE++NN  +TY    GI+HQ +CP+TP QNG+AERKHRHI E    LL ++ +P   
Subjt:  LSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKY

Query:  WFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVF
        W      AV +INRLPSP   N+SP+EIL+ +PPDYTHL+VFG  CYP L+PY +HK  P +  CVFIGY    KGY CLDP   +++ SRHVVF+E +F
Subjt:  WFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVF

Query:  PFSSFTSPTASSSSSPLPQPSSPIPSLL
        PF    +   S+S+      ++PIP LL
Subjt:  PFSSFTSPTASSSSSPLPQPSSPIPSLL

PKU76715.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]6.5e-10936.36Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAET-ISVDGVTKPNPAYEIWFEKDQ----ALITLINATLSPVRPSL--CN--W-ILKSDLQSITK--------------
        V  L +A+   GFLDG  S P +T IS  G + PNPAY  W   DQ    AL ++I+ T+ P   S+  C+  W IL   LQS T+              
Subjt:  VSPLLKAHKLFGFLDGKISAPAET-ISVDGVTKPNPAYEIWFEKDQ----ALITLINATLSPVRPSL--CN--W-ILKSDLQSITK--------------

Query:  LPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAMNANFLAVV---
        +  +++  Y+  IK  V+ LAA    I  E++  Y L+GLPS+Y  FKT++RT  Q  S ++L+ L+ SEE  L  +   E+ S  ++ N+  LA     
Subjt:  LPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAMNANFLAVV---

Query:  -----------------VVILEIAEDLVVMAVVVADLVVELAALAASTTYGQSSVPNVANNQV-------------WLSDSGCNAHLTSDLANMQVSSEY
                                      + +   +  +    A    Y      N                   W  DSG +AHLTSD   +Q S  Y
Subjt:  -----------------VVILEIAEDLVVMAVVVADLVVELAALAASTTYGQSSVPNVANNQV-------------WLSDSGCNAHLTSDLANMQVSSEY

Query:  NGEANVTVGSGQALPVTHTG---------------------------------------------------QTNGASTFHGPSINGLYPLTTQSSPHVPS
         G   VTVG+GQ L + +TG                                                   +T+      GP INGLYPL    +    +
Subjt:  NGEANVTVGSGQALPVTHTG---------------------------------------------------QTNGASTFHGPSINGLYPLTTQSSPHVPS

Query:  HLTAQLGTKSHSSTWHDRLGHPNNSTLASV-LRFLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLS
           A +  ++ S+ WH RLGHP+ +TLA++  +F  + N+S  ++C  C  GK  +L F  S + SS+P EL+HSDVWGPS   S +G RYY+SFIDD S
Subjt:  HLTAQLGTKSHSSTWHDRLGHPNNSTLASV-LRFLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLS

Query:  RYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWF
        +YTWIYPL  KSDV     +F   +E    T ++  R+DGGGE++NN  + Y    GI+HQ +CP+TP QNG+AERKHRHI E    LL ++S+P   W 
Subjt:  RYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWF

Query:  HAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPF
             AV +INRLPSP   N+SP EIL+ +PPDYT+L+VFG  CYP L+PY  HK  P +  CVFIGY    KGY CLDP   +++ SRHVVF+E +FPF
Subjt:  HAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPF

Query:  SSFTSPTASSSSSPLPQPSSPIPSLL
            +   S+S+     P++PIP LL
Subjt:  SSFTSPTASSSSSPLPQPSSPIPSLL

PKU83090.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]5.7e-10534.76Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAETISVDGVTK-PNPAYEIWFEKDQALITLINATLSP-VRPSLCN-------WI----------------LKSDLQSIT
        V  L +A+    FL+   S P + I    +T   NP Y  W   DQ L   + +T+SP + P + N       W                 LK++L +I+
Subjt:  VSPLLKAHKLFGFLDGKISAPAETISVDGVTK-PNPAYEIWFEKDQALITLINATLSP-VRPSLCN-------WI----------------LKSDLQSIT

Query:  KLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEV--SSQTLAM--------
         + T ++  Y+  IK +V+++A    V+D ED+ +Y LNGLP SY  FKT +RT     S ++L+ L+ SEE  +       +  +   +A+        
Subjt:  KLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEV--SSQTLAM--------

Query:  ------NANFLAVVVVILEIAEDLVVMAVVVADLVVELAALAASTTY--GQSSVPNVANNQV----------WLSDSGCNAHLTSDLANMQVSSEYNGEA
              N N    V    +        +     +  +    AA+  +   ++ VP   N  +          W  DSG ++HLT+ L N+ V+  YNG  
Subjt:  ------NANFLAVVVVILEIAEDLVVMAVVVADLVVELAALAASTTY--GQSSVPNVANNQV----------WLSDSGCNAHLTSDLANMQVSSEYNGEA

Query:  NVTVGSGQALPVTHTG---------QTNGASTFH------------------------------------------GPSINGLYPLTTQSSPHVPSHLTA
        +VT+G G+++ + H+G         + N A   H                                          GP  +GLYP+    +    S LTA
Subjt:  NVTVGSGQALPVTHTG---------QTNGASTFH------------------------------------------GPSINGLYPLTTQSSPHVPSHLTA

Query:  QLGTKSHSSTWHDRLGHPNNSTLASVLR---FLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRY
            K     WH+RLGHP+   L  V +    LN++ +  K +C  C+  K  KL F  S +    PLE++HSDVWGP+P  S+ G R+Y+ FIDD +R+
Subjt:  QLGTKSHSSTWHDRLGHPNNSTLASVLR---FLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRY

Query:  TWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHA
        TW+YP+ +K++V SI + FK +VE L S ++K  RSDGG EY N+    +    GI HQ SCP+TPEQNG+AERKHRHI+E   +LL  +SVP KYW  A
Subjt:  TWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHA

Query:  FACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSS
           AV++INR+PS T  N+SPFE+++ + P+Y ++R FGCACYPL+ P + HKLQPR   CVF+GY   YKGY CLD S+ ++++SRHV FDEH+FPFS+
Subjt:  FACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSS

Query:  FTSPTASSSSSPLPQPSSPIPS
          + ++ +  +   QPS  IP+
Subjt:  FTSPTASSSSSPLPQPSSPIPS

PKU87026.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]1.8e-10635.64Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAETISV-DGVTKPNPAYEIWFEKDQALITLINATLS-PVRPSLCN-------W----------------ILKSDLQSIT
        ++ +L+A+    FLD KI  P+  +   +G + PNPAY  W   DQ L+  I +T+S  V P + N       W                 LK++L +IT
Subjt:  VSPLLKAHKLFGFLDGKISAPAETISV-DGVTKPNPAYEIWFEKDQALITLINATLS-PVRPSLCN-------W----------------ILKSDLQSIT

Query:  KLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQ--------------
         +  +S+  Y+  IK +V+++AA    +D ED+ +Y LNGLP  Y  FKT +RT     S + L+ L+ SEE  +       ++ Q              
Subjt:  KLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQ--------------

Query:  ----TLAMNANFLAVVVVILEIAEDLVVMAVVVADLVVELAALAASTTYGQSSVPNVANNQV----WLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGS
             L  N+N  +       I +         A+    + +     +   S    VA ++     W  DSG ++HLT+ L N+Q+S+ Y     +TVG 
Subjt:  ----TLAMNANFLAVVVVILEIAEDLVVMAVVVADLVVELAALAASTTYGQSSVPNVANNQV----WLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGS

Query:  GQALPVTHTG---------------------------------QTNGAS------------------TFHGPSINGLYPLTTQSSPHVPSHLTAQLGTKS
        G+++P+ H+G                                 + N  S                     GP   GLYP+ + SS    + L+A   T +
Subjt:  GQALPVTHTG---------------------------------QTNGAS------------------TFHGPSINGLYPLTTQSSPHVPSHLTAQLGTKS

Query:  HSSTWHDRLGHPNNSTLASVLRF-LNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTY
         S+ WH RLGHP+   L ++ +   ++N S   S C  C   K  KL F  S + S+  LELIHSDVWGPSP  S    +YY+ F+DD SR+TW++P+ +
Subjt:  HSSTWHDRLGHPNNSTLASVLRF-LNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTY

Query:  KSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFII
        KS+V +I   FK  +E L S K+K  R+DGG EYVN+ L+ +  ++GI HQ SCP+TPEQNG+AERKHRH+IE    LL  SSVP KYW  A   A ++I
Subjt:  KSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFII

Query:  NRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTSPTASS
        NR+PSPT+ N SP+E L+H+ P Y HLR FGC C+PL   ++ HKLQP+    VF+GY   YKGY CLD +T +I+ISRH  F+E  FPFS     T + 
Subjt:  NRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTSPTASS

Query:  SSSPLPQ
        S S  P+
Subjt:  SSSPLPQ

TrEMBL top hitse value%identityAlignment
A0A2N9F8N5 Uncharacterized protein1.7e-11837.42Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAE-TISVDG--VTKPNPAYEIWFEKDQALITLINATLSP-VRPSLCN-------W----------------ILKSDLQS
        +S +L+A+ +  F+DG + +P+   +  +G   T  NP ++ W  +DQAL+TLIN+TLSP V P +         W                 LK +L +
Subjt:  VSPLLKAHKLFGFLDGKISAPAE-TISVDG--VTKPNPAYEIWFEKDQALITLINATLSP-VRPSLCN-------W----------------ILKSDLQS

Query:  ITKLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHAL-----------------------
        + K  +ES++SY+Q+IK+  +KL AV  +ID EDL    L GLP  +  F +++RTR+   SFEE+ VL+ +EE +L                       
Subjt:  ITKLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHAL-----------------------

Query:  --DQQVKSEVSSQTLAM---------------------NANFLAVVVVILEIAEDLVVMAVVVADLVVEL--------------AALAASTTYGQSSVPN
          +++V S  +S + A                      NA F        E +  L  +   +    ++               A LAA  +   SS   
Subjt:  --DQQVKSEVSSQTLAM---------------------NANFLAVVVVILEIAEDLVVMAVVVADLVVEL--------------AALAASTTYGQSSVPN

Query:  VANNQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTH--TGQ-----------------------------------------------
        +   + WL+D+G   HLTS+L N+   + Y G   V VG+GQA+P+ +  TGQ                                               
Subjt:  VANNQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTH--TGQ-----------------------------------------------

Query:  --TNGASTFHGPSINGLYPLTTQSSPHVP------SHLTAQLGTKSHSSTWHDRLGHPNNSTLASVLRFL----NVNNSSMKSACIHCLNGKMCKLSFPL
           +G   + G S NGLYP+ TQ  P VP      S + A L +K+    WH RLGHP++  L S +  L    +VNN  ++  C HCL GKM KL F  
Subjt:  --TNGASTFHGPSINGLYPLTTQSSPHVP------SHLTAQLGTKSHSSTWHDRLGHPNNSTLASVLRFL----NVNNSSMKSACIHCLNGKMCKLSFPL

Query:  SSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQ
        S   S+ PLEL+HSDVWGP+P  S NG R+Y+ F+DD SR++W+Y L  KSDV      F+  VE  LS K+K  R+D GGEY +N   T+   +GI+H 
Subjt:  SSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQ

Query:  KSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTT
         SCPHTP+QNG+ ERKHRH+IE AL LLS + + + YW +A + AV +INRLP+PTL +++P+E+LFHKPPD  HLR FGC C+P LRPY +HKLQPRTT
Subjt:  KSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTT

Query:  QCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTSPTASSSSSPLPQPSS
         C+F+GYP   KGY+CLDP T R+YISRHV+F+E  F        +  S SSP P  SS
Subjt:  QCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTSPTASSSSSPLPQPSS

A0A2N9G7E3 Integrase catalytic domain-containing protein4.0e-12037.47Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAETISVDGVTKP--NPAYEIWFEKDQALITLINATLSPVRPSLC--------NW----------------ILKSDLQSI
        ++  LKA+KL   +DG    P E  + D    P  N  +  W  KDQALI++I ATLSP   +L          W                 LK DL SI
Subjt:  VSPLLKAHKLFGFLDGKISAPAETISVDGVTKP--NPAYEIWFEKDQALITLINATLSPVRPSLC--------NW----------------ILKSDLQSI

Query:  TKLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAM---------
         K   ESIN Y+Q+IK+  +KL AV V I+ E++    L+GLP+ +  F + +RTR+   SFEELHVLM  EE +L++  +S   S  LAM         
Subjt:  TKLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAM---------

Query:  --------------------------------------NAN----------------------FLAVVVVILEIAEDL---VVMAVVVADLVVELAALAA
                                              N+N                         +       A D    +  A        +LAA+A 
Subjt:  --------------------------------------NAN----------------------FLAVVVVILEIAEDL---VVMAVVVADLVVELAALAA

Query:  STTYGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTHTGQT-------------------------------------
        S+        N +++  W+SD+G   H T DLAN+Q + +YNG   VTVG+GQ LP+TH G +                                     
Subjt:  STTYGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTHTGQT-------------------------------------

Query:  --------------NGASTFHGPSINGLYPL--------------TTQSSPHVPS-HLTAQLGTKSHSSTWHDRLGHPNNSTLASVLRFL---NVNNSSM
                      +G   + G +  GLYP+               T +   +PS H +A   TK  SSTWH RLGHPN+  L SV + L    +++SS 
Subjt:  --------------NGASTFHGPSINGLYPL--------------TTQSSPHVPS-HLTAQLGTKSHSSTWHDRLGHPNNSTLASVLRFL---NVNNSSM

Query:  KSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGG
         S C HC  GKM +L F  S ++++ PL+L+HSDVWGP+P  SING+RYY+SFIDD S++TW +PL +KS V S    FK  +E LL+ K+K  R+D GG
Subjt:  KSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGG

Query:  EYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGC
        EY ++  + Y    GI HQ SCPHTP+QNG+AERKHRHIIE AL L+S+SS+PL YW +AFA ++F+INRLP+ +L  +SP+E+LFH PPDY+  +VFGC
Subjt:  EYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGC

Query:  ACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTSPTASSSSSPLP
        +CYPLL PY  HKLQ ++ +C+F+GY    KGY+CLDP+  ++ +SR+V FDE  FPFS+ TS T ++ ++P P
Subjt:  ACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTSPTASSSSSPLP

A0A2N9H0D6 Uncharacterized protein2.2e-11837.98Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAE-TISVDG--VTKPNPAYEIWFEKDQALITLINATLSPVRPSLCNWILKSDLQSITKLPTESINSYVQRIKDIVNKLA
        +S +LKA+ +  F+DG + +P+   ++ +G   T  NP ++IW  +DQAL+TLIN+TLS    S+       +L ++ K  +E+++SY+Q++K+  +KL 
Subjt:  VSPLLKAHKLFGFLDGKISAPAE-TISVDG--VTKPNPAYEIWFEKDQALITLINATLSPVRPSLCNWILKSDLQSITKLPTESINSYVQRIKDIVNKLA

Query:  AVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAMNANFLAVVVVILEIA-----------------
        AV  +ID E++    L GLP  Y  F + +RTR++  +FEE+ VL+ +EE ++ +   S     ++AM A+         +++                 
Subjt:  AVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAMNANFLAVVVVILEIA-----------------

Query:  ---------EDLVVMAVVVADLVVELAALAASTTYGQSSVPNVAN----------NQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTH
                 +                ++ +   + G      +            +++  +  G   HLTS+L N+ V + + G   V VG+GQ++P+ +
Subjt:  ---------EDLVVMAVVVADLVVELAALAASTTYGQSSVPNVAN----------NQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTH

Query:  TGQ------------------------TNGASTFHGPSINGLYPLTTQSSPH----------VPSHLTAQLGTKSHSSTWHDRLGHPNNSTLASVLRFL-
         G                         ++G   + G S NGLYP+ T    H          V   ++A L +K+    WH RLGHP++  L S +  L 
Subjt:  TGQ------------------------TNGASTFHGPSINGLYPLTTQSSPH----------VPSHLTAQLGTKSHSSTWHDRLGHPNNSTLASVLRFL-

Query:  ---NVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTK
           +VNN  ++  C HC+ GKM KL F  S  +S+ PLELIHSDVWGP+P  S NG RYY+ F+DD SR++W+Y L +KSDV +  + FK  +E  LS K
Subjt:  ---NVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTK

Query:  VKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPP
        +K  R+D GGEY +N    Y   +GI HQ SCPHTP+QNGI ERKHRHI+E A+ LLS +S+P+ +W HA   A+ +INR+P+P L ++SP+E+LFHKPP
Subjt:  VKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPP

Query:  DYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSS---FTSPTASSS
        D THL+ FGC C+P LRPY  HKLQPRTT C+F+GYP   KGY+CLD  T R YISRHV+F+E  F  SS     SPT+SSS
Subjt:  DYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSS---FTSPTASSS

A0A2N9HZ49 Uncharacterized protein6.8e-12037.14Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAETIS-VDG--VTKPNPAYEIWFEKDQALITLINATLSPVRPSLCNWILKSDLQSITKLPTESINSYVQRIKDIVNKLA
        +S +LKA+ +  F+DG I +P++ ++  +G   T  NP +++W  +DQAL+TLINATLS   PS+ + +L +      K  ++S++SY+QR+K+  +KL 
Subjt:  VSPLLKAHKLFGFLDGKISAPAETIS-VDG--VTKPNPAYEIWFEKDQALITLINATLSPVRPSLCNWILKSDLQSITKLPTESINSYVQRIKDIVNKLA

Query:  AVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAM--------------------------------
        AV  +ID E++    L GLP  Y  F + +RTR++  SFEE+ VL+ +EE +L +   S     ++AM                                
Subjt:  AVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAM--------------------------------

Query:  ---------NANFLAVVVVILEIAEDLVVMAVVVADLVVE-----------------LAALAASTTYGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQV
                 NA F        E ++ +  +   V    ++                 LAA+A+++   Q+        + WL+D+G   HLTS+L N+  
Subjt:  ---------NANFLAVVVVILEIAEDLVVMAVVVADLVVE-----------------LAALAASTTYGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQV

Query:  SSEYNGEANVTVGSGQALPVTH--TGQ-------------------------------------------------TNGASTFHGPSINGLYPLTTQSSP
         + Y G   V VG+GQ++P+ +  TGQ                                                  +G   + G S NGLYP+ TQ  P
Subjt:  SSEYNGEANVTVGSGQALPVTH--TGQ-------------------------------------------------TNGASTFHGPSINGLYPLTTQSSP

Query:  HVP---------SHLTAQLGTKSHSSTWHDRLGHPNNSTLASVLRFL----NVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPEL
         VP         S + A L +++    WH RLGHP++  L S +  L    +VNN  ++  C HCL GKM KL F  S   S++PLEL+HSDVWGP+P  
Subjt:  HVP---------SHLTAQLGTKSHSSTWHDRLGHPNNSTLASVLRFL----NVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPEL

Query:  SINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEM
        S NG R+Y+ F+DD SR++W+Y L  KSDV      F+  VE LLS K+K  R+  GGEY +N +  +    GI H  SCPHTP+QNGI ERKHRH+IE 
Subjt:  SINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEM

Query:  ALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTER
        AL LLS + + + +W +A + AV IINRLP+P L +++P+E+LFHKPPD THL+ FGC C+P LRPY +HKLQPR+T C+F+GYP   KGY+CLDP T R
Subjt:  ALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTER

Query:  IYISRHVVFDEHVFPFSSFTSPTASSSSSPLPQPS
        +YISRHV+F+E  F        +  S S+P P  S
Subjt:  IYISRHVVFDEHVFPFSSFTSPTASSSSSPLPQPS

A0A2N9J0Y8 Integrase catalytic domain-containing protein3.4e-11937.57Show/hide
Query:  LSVSPLLKAHKLFGFLDGKISAPAETISVD-GVTKPNPAYEIWFEKDQALITLINAT--------------------------LSPVRPSLCNWILKSDL
        L ++ +L A+ +   LDG +  P++ ++ + G+   NP + IW +KD+AL+TL+ +T                           S  R ++ N  LK +L
Subjt:  LSVSPLLKAHKLFGFLDGKISAPAETISVD-GVTKPNPAYEIWFEKDQALITLINAT--------------------------LSPVRPSLCNWILKSDL

Query:  QSITKLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAM------
        QSI K   E+++SY+QRIK + +KL+AV V  D E+L    L GLP  +  F + +RTR    S E+L VL+ +EE ++++  +S +S+  LAM      
Subjt:  QSITKLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAM------

Query:  -----NANFLAVVVVILEIAEDLVVMAVVVADLVVELAALAASTTYGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVT
             N  F              +  A    +   +LAA+A++     S++    N + WL+D+G   H+T++  N+   + Y G+  V+VG+GQ LP+ 
Subjt:  -----NANFLAVVVVILEIAEDLVVMAVVVADLVVELAALAASTTYGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVT

Query:  HTGQT----------------NGASTFHGPSINGLYPLTTQSSPHVPSHLTAQLGTKSHSS----TWHDRLGHPNNSTLASVLRFLNVNNSS--MKSACI
        +                     G   + G S NG+YP+  QS   +P+         S SS     WH RLGHP+   LA+V    + N+SS  +K  C 
Subjt:  HTGQT----------------NGASTFHGPSINGLYPLTTQSSPHVPSHLTAQLGTKSHSS----TWHDRLGHPNNSTLASVLRFLNVNNSS--MKSACI

Query:  HCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNN
        HCL GKM +L FP S+   + P EL+H+D+WGP+P ++ N  R+Y+ F+D+ +++TW+Y L +KS+   + +QF+  ++T  S  +K  R+D GGE+++ 
Subjt:  HCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNN

Query:  TLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPL
            +    GI+H  SCPHTP+QNG+AERKHRH+++ ALALLS+S +P+ YW +A + A  +IN+LP+P LGN+SP+E L+H  PD THL+ FGC C+PL
Subjt:  TLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPL

Query:  LRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTSPTASSSSSPLPQPSSPIPSLLRLFLI
        L PY SHKL P+TT CVFIGYPL  KGY CLDP+T RIY SRHV+F+E VFP     S   S+++S     S+ + S L+  L+
Subjt:  LRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTSPTASSSSSPLPQPSSPIPSLLRLFLI

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-3932.21Show/hide
Query:  KSHSSTWHDRLGHPNNSTLASVLRFLNVNNSSMKS-------ACIHCLNGKMCKLSFP--LSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLS
        K++   WH+R GH ++  L  + R    ++ S+ +        C  CLNGK  +L F      ++   PL ++HSDV GP   ++++   Y++ F+D  +
Subjt:  KSHSSTWHDRLGHPNNSTLASVLRFLNVNNSSMKS-------ACIHCLNGKMCKLSFP--LSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLS

Query:  RYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWF
         Y   Y + YKSDV S+   F    E   + KV     D G EY++N +R +  K GI +  + PHTP+ NG++ER  R I E A  ++S + +   +W 
Subjt:  RYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWF

Query:  HAFACAVFIINRLPSPTL--GNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGY-PLGYKGYLCLDPSTERIYISRHVVFDE
         A   A ++INR+PS  L   +++P+E+  +K P   HLRVFG   Y  ++     K   ++ + +F+GY P G+K +   D   E+  ++R VV DE
Subjt:  HAFACAVFIINRLPSPTL--GNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGY-PLGYKGYLCLDPSTERIYISRHVVFDE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-4835.82Show/hide
Query:  WHDRLGHPNNSTLASVLR--FLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSD
        WH R+GH +   L  + +   ++    +    C +CL GK  ++SF  SS      L+L++SDV GP    S+ G++Y+++FIDD SR  W+Y L  K  
Subjt:  WHDRLGHPNNSTLASVLR--FLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKSD

Query:  VPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRL
        V  +  +F   VE     K+K  RSD GGEY +     Y   +GI H+K+ P TP+ NG+AER +R I+E   ++L  + +P  +W  A   A ++INR 
Subjt:  VPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRL

Query:  PSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDE
        PS  L    P  +  +K   Y+HL+VFGC  +  +      KL  ++  C+FIGY     GY   DP  +++  SR VVF E
Subjt:  PSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDE

Q07163 Transposon TyH3 Gag-Pol polyprotein2.7e-1727.44Show/hide
Query:  HDRLGHPNNSTLA-----SVLRFLNVNNSSMKSA----CIHCLNGKMCK----LSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYT
        H  L H N  T+      + + + N ++    SA    C  CL GK  K        L    S  P + +H+D++GP   L  +   Y+ISF D+ +++ 
Subjt:  HDRLGHPNNSTLA-----SVLRFLNVNNSSMKSA----CIHCLNGKMCK----LSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYT

Query:  WIYPL--TYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFH
        W+YPL    +  +  + +    F++      V   + D G EY N TL  + EK GI    +       +G+AER +R +++     L  S +P   WF 
Subjt:  WIYPL--TYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFH

Query:  AFACAVFIINRLPSP
        A   +  + N L SP
Subjt:  AFACAVFIINRLPSP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-8733.2Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAETISVDGVTKPNPAYEIWFEKDQALITLINATLS-PVRPSLCN-------W----------------ILKSDLQSITK
        V  L   ++L GFLDG  + P  TI  D   + NP Y  W  +D+ + + +   +S  V+P++         W                 L++ L+  TK
Subjt:  VSPLLKAHKLFGFLDGKISAPAETISVDGVTKPNPAYEIWFEKDQALITLINATLS-PVRPSLCN-------W----------------ILKSDLQSITK

Query:  LPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHV-LMNSEEHAL-------------------------
          T++I+ Y+Q +    ++LA +   +D ++     L  LP  Y      +  +    +  E+H  L+N E   L                         
Subjt:  LPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHV-LMNSEEHAL-------------------------

Query:  ----------DQQVKSEVSSQTLAMNANFLAVVVVILEIAEDLVVMAVV--VADLVVELAALAASTTYGQSSVP--------NVA-----NNQVWLSDSG
                  D +  +  S      + NF               +  V    A    +L    +S    Q   P        N+A     ++  WL DSG
Subjt:  ----------DQQVKSEVSSQTLAMNANFLAVVVVILEIAEDLVVMAVV--VADLVVELAALAASTTYGQSSVP--------NVA-----NNQVWLSDSG

Query:  CNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTHTGQTN---------------------------------------------------GASTFHGP
           H+TSD  N+ +   Y G  +V V  G  +P++HTG T+                                                   G     G 
Subjt:  CNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTHTGQTN---------------------------------------------------GASTFHGP

Query:  SINGLYPLTTQSSPHVPSHLTAQLGTKSHSSTWHDRLGHPNNSTLASVLR--FLNVNNSSMK-SACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGP
        + + LY     SS   P  L A   +K+  S+WH RLGHP  S L SV+    L+V N S K  +C  CL  K  K+ F  S+  S+ PLE I+SDVW  
Subjt:  SINGLYPLTTQSSPHVPSHLTAQLGTKSHSSTWHDRLGHPNNSTLASVLR--FLNVNNSSMK-SACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGP

Query:  SPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRH
        SP LS +  RYY+ F+D  +RYTW+YPL  KS V      FK  +E    T++ TF SD GGE+V   L  YF ++GI H  S PHTPE NG++ERKHRH
Subjt:  SPELSINGSRYYISFIDDLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRH

Query:  IIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDP
        I+E  L LLS +S+P  YW +AFA AV++INRLP+P L   SPF+ LF   P+Y  LRVFGCACYP LRPY  HKL  ++ QCVF+GY L    YLCL  
Subjt:  IIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDP

Query:  STERIYISRHVVFDEHVFPFSSF
         T R+YISRHV FDE+ FPFS++
Subjt:  STERIYISRHVVFDEHVFPFSSF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.3e-8733.1Show/hide
Query:  VSPLLKAHKLFGFLDGKISAPAETISVDGVTKPNPAYEIWFEKDQALITLINATLS-PVRPSLCNWILKSDL-QSITKLPTESINSYVQRIKDIV--NKL
        V  L   ++L GFLDG    P  TI  D V + NP Y  W  +D+ + + I   +S  V+P++      + + +++ K+       +V +++ I   ++L
Subjt:  VSPLLKAHKLFGFLDGKISAPAETISVDGVTKPNPAYEIWFEKDQALITLINATLS-PVRPSLCNWILKSDL-QSITKLPTESINSYVQRIKDIV--NKL

Query:  AAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHV-LMNSEEHALDQQVKSEVSSQTLAMNANFLAVVVVILEIAED-------------
        A +   +D ++     L  LP  Y      +  +    S  E+H  L+N E   L        S++ + + AN +          ++             
Subjt:  AAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHV-LMNSEEHALDQQVKSEVSSQTLAMNANFLAVVVVILEIAED-------------

Query:  ---------------------------LVVMAVVVADLVVELAALAASTTYGQSSVP--------NVA-----NNQVWLSDSGCNAHLTSDLANMQVSSE
                                   +  +    A    +L    ++T   QS+ P        N+A     N   WL DSG   H+TSD  N+     
Subjt:  ---------------------------LVVMAVVVADLVVELAALAASTTYGQSSVP--------NVA-----NNQVWLSDSGCNAHLTSDLANMQVSSE

Query:  YNGEANVTVGSGQALPVTHTGQTN---------------------------------------------------GASTFHGPSINGLYPLTTQSSPHVP
        Y G  +V +  G  +P+THTG  +                                                   G     G + + LY     SS  V 
Subjt:  YNGEANVTVGSGQALPVTHTGQTN---------------------------------------------------GASTFHGPSINGLYPLTTQSSPHVP

Query:  SHLTAQLGTKSHSSTWHDRLGHPNNSTLASVL--RFLNVNNSSMK-SACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFID
          + A   +K+  S+WH RLGHP+ + L SV+    L V N S K  +C  C   K  K+ F  S+  SS PLE I+SDVW  SP LSI+  RYY+ F+D
Subjt:  SHLTAQLGTKSHSSTWHDRLGHPNNSTLASVL--RFLNVNNSSMK-SACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFID

Query:  DLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLK
          +RYTW+YPL  KS V      FK  VE    T++ T  SD GGE+V   LR Y  ++GI H  S PHTPE NG++ERKHRHI+EM L LLS +SVP  
Subjt:  DLSRYTWIYPLTYKSDVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLK

Query:  YWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHV
        YW +AF+ AV++INRLP+P L  +SPF+ LF +PP+Y  L+VFGCACYP LRPY  HKL+ ++ QC F+GY L    YLCL   T R+Y SRHV FDE  
Subjt:  YWFHAFACAVFIINRLPSPTLGNRSPFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHV

Query:  FPFSS-----FTSPTASSSSSP
        FPFS+      TS    S S+P
Subjt:  FPFSS-----FTSPTASSSSSP

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.7e-0431.65Show/hide
Query:  KSHSSTWHDRLGHPNNSTLASVLR--FLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWG-PSPELS
        K  +  WH RL H +   +  +++  FL+ +  S    C  C+ GK  +++F      +  PL+ +HSD+WG PS  LS
Subjt:  KSHSSTWHDRLGHPNNSTLASVLR--FLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWG-PSPELS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTTAACCTTTGGGTGTCCTCTAGTTTCAGTTTCGTATCAATTCGGGCTCGAGACAACACTTATTTAAAGGATTTCAGTGCTGTTTGGTATTTTCCAGCAAGGTA
CCCGCCCCTGTTCGTGTGTGGTCATTGGGGAGAGGAAGAGCATGACGCGTATATCATGTGCTGGGTGTGTGTGCGGAGCGTGACACAGAAATCACGTGTGGGTGTGAGTG
CGGAGCGTGACGCGAAAATCACGTGTGAGTGGCTCTACGCATGGCGCGTTATAATACATGTGTTGGGTGGGTTTGCGGAGCGTGACGCGGAAATCACGTATTATGCAGGT
GACGAACTTTATGTGCCCGGTGATATAGATGAGGAGGTTCTGTTGTTGGATGAAGATGCGAGATCTGACTCGTATTCTAATATGAAGTTGTTTCTGTGGAGTTGTGGGGT
CGTTACGCTGCCGAAATTTTCGGAGCTCTCGCGGCGATCATTAGCAGGAGTAGTAACTGCTCAGCAGCAGCGGCGCAGAAGGTCGAAGAAGAGAAATAGGTGTTTCAGAC
TAAGGGAGAAGAAGGGAGGAGAGAAGAGAAGAGAAGGGAGAAAGGAGAGAGAGGAAGAAGGCTATGGACCATCTGGACAAGGGAGAACTGGAAAAGGAAAAATTATCAGT
CTCTTCTTCCTCAGAACAGTCGCTCAGTTCTCCGATTGTTCTTTTGACGAATATCAGCAATCTGATCACCGTTCGTCTGGATTCATCCAATTACGTTCTTTGGCACTTTC
AGTTTCGCCTCTTCTCAAAGCTCACAAGTTATTTGGCTTTCTTGATGGTAAAATTTCTGCTCCAGCTGAGACGATTTCGGTAGATGGAGTTACTAAACCTAATCCGGCTT
ATGAGATTTGGTTTGAGAAGGATCAGGCGTTGATTACACTTATCAATGCGACACTGTCTCCGGTCCGCCCTAGCTTATGCAATTGGATTCTGAAATCTGATTTGCAAAGC
ATCACGAAGTTACCTACTGAATCCATCAATTCTTATGTTCAACGGATCAAAGATATTGTAAATAAACTTGCTGCTGTGTCTGTTGTCATTGATCAAGAGGATCTTGCGAT
TTATGCACTAAACGGACTTCCGTCTTCTTATAATGTGTTTAAAACTACTGTTCGTACAAGATCTCAGCATCATTCTTTTGAGGAGCTACATGTATTGATGAATTCTGAAG
AACACGCACTTGATCAACAAGTCAAAAGTGAGGTTTCCTCTCAAACTCTTGCAATGAATGCTAATTTTCTGGCAGTGGTCGTGGTAATTCTGGAAATAGCAGAGGATTTA
GTGGTAATGGCCGTGGTCGTGGCAGATCTGGTGGTAGAGCTTGCTGCACTTGCTGCTTCAACAACCTATGGACAATCGTCTGTTCCAAATGTAGCAAACAATCAAGTTTG
GCTATCAGACTCAGGCTGCAATGCACATCTCACATCAGATTTGGCAAATATGCAAGTTTCGTCTGAATATAATGGTGAGGCGAATGTAACAGTTGGCAGTGGTCAGGCCT
TACCAGTCACACACACAGGACAAACGAACGGGGCAAGTACTTTCCATGGGCCTAGTATTAACGGTCTATATCCTTTGACCACTCAAAGTTCTCCTCATGTTCCTAGTCAC
CTCACTGCTCAACTAGGAACCAAGTCTCATAGCTCCACTTGGCATGACCGATTAGGCCATCCGAATAATTCTACTCTTGCCTCTGTTTTACGTTTCCTAAATGTTAATAA
TTCTTCTATGAAATCTGCTTGTATTCATTGTTTGAATGGCAAAATGTGCAAGCTTTCTTTTCCATTGTCTAGTTCATATTCGTCTTATCCTTTAGAGCTAATACACAGTG
ATGTATGGGGGCCCTCTCCCGAACTTTCAATAAATGGATCTCGCTATTATATTTCCTTTATTGATGACTTGTCACGCTATACTTGGATATATCCTCTCACTTACAAGTCG
GATGTTCCTAGCATTGTTTCACAATTCAAACCTTTTGTTGAAACATTATTATCTACTAAAGTCAAAACCTTTAGGAGTGATGGAGGGGGTGAGTATGTTAATAATACTCT
TCGTACTTATTTTGAGAAATATGGTATTATGCATCAAAAATCATGTCCCCATACACCCGAACAAAACGGTATTGCCGAACGGAAGCACCGTCATATCATTGAAATGGCTC
TTGCCTTATTGTCAAAGTCTTCTGTTCCTCTTAAGTATTGGTTTCATGCCTTTGCTTGTGCCGTTTTTATTATTAACCGGTTGCCTTCCCCTACTCTTGGCAATCGTTCT
CCTTTTGAGATTCTCTTTCACAAGCCACCTGACTATACACACTTAAGAGTCTTTGGTTGTGCTTGTTATCCCCTTCTTCGTCCTTATGTGTCTCACAAGCTTCAACCCCG
CACAACTCAGTGTGTCTTTATTGGATATCCTCTTGGATACAAAGGATATCTGTGTCTTGATCCTAGTACTGAACGAATATATATCTCCCGACATGTTGTCTTTGATGAAC
ATGTTTTTCCCTTCTCTTCCTTTACTTCTCCTACTGCTTCTTCGAGTTCTTCTCCTCTACCTCAGCCTTCATCTCCTATTCCGTCTTTACTTCGTTTATTTCTCATTCTC
TTGAACAAGCTGTTGAGGAACCAACAAATGTGTCTGCTGTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTTAACCTTTGGGTGTCCTCTAGTTTCAGTTTCGTATCAATTCGGGCTCGAGACAACACTTATTTAAAGGATTTCAGTGCTGTTTGGTATTTTCCAGCAAGGTA
CCCGCCCCTGTTCGTGTGTGGTCATTGGGGAGAGGAAGAGCATGACGCGTATATCATGTGCTGGGTGTGTGTGCGGAGCGTGACACAGAAATCACGTGTGGGTGTGAGTG
CGGAGCGTGACGCGAAAATCACGTGTGAGTGGCTCTACGCATGGCGCGTTATAATACATGTGTTGGGTGGGTTTGCGGAGCGTGACGCGGAAATCACGTATTATGCAGGT
GACGAACTTTATGTGCCCGGTGATATAGATGAGGAGGTTCTGTTGTTGGATGAAGATGCGAGATCTGACTCGTATTCTAATATGAAGTTGTTTCTGTGGAGTTGTGGGGT
CGTTACGCTGCCGAAATTTTCGGAGCTCTCGCGGCGATCATTAGCAGGAGTAGTAACTGCTCAGCAGCAGCGGCGCAGAAGGTCGAAGAAGAGAAATAGGTGTTTCAGAC
TAAGGGAGAAGAAGGGAGGAGAGAAGAGAAGAGAAGGGAGAAAGGAGAGAGAGGAAGAAGGCTATGGACCATCTGGACAAGGGAGAACTGGAAAAGGAAAAATTATCAGT
CTCTTCTTCCTCAGAACAGTCGCTCAGTTCTCCGATTGTTCTTTTGACGAATATCAGCAATCTGATCACCGTTCGTCTGGATTCATCCAATTACGTTCTTTGGCACTTTC
AGTTTCGCCTCTTCTCAAAGCTCACAAGTTATTTGGCTTTCTTGATGGTAAAATTTCTGCTCCAGCTGAGACGATTTCGGTAGATGGAGTTACTAAACCTAATCCGGCTT
ATGAGATTTGGTTTGAGAAGGATCAGGCGTTGATTACACTTATCAATGCGACACTGTCTCCGGTCCGCCCTAGCTTATGCAATTGGATTCTGAAATCTGATTTGCAAAGC
ATCACGAAGTTACCTACTGAATCCATCAATTCTTATGTTCAACGGATCAAAGATATTGTAAATAAACTTGCTGCTGTGTCTGTTGTCATTGATCAAGAGGATCTTGCGAT
TTATGCACTAAACGGACTTCCGTCTTCTTATAATGTGTTTAAAACTACTGTTCGTACAAGATCTCAGCATCATTCTTTTGAGGAGCTACATGTATTGATGAATTCTGAAG
AACACGCACTTGATCAACAAGTCAAAAGTGAGGTTTCCTCTCAAACTCTTGCAATGAATGCTAATTTTCTGGCAGTGGTCGTGGTAATTCTGGAAATAGCAGAGGATTTA
GTGGTAATGGCCGTGGTCGTGGCAGATCTGGTGGTAGAGCTTGCTGCACTTGCTGCTTCAACAACCTATGGACAATCGTCTGTTCCAAATGTAGCAAACAATCAAGTTTG
GCTATCAGACTCAGGCTGCAATGCACATCTCACATCAGATTTGGCAAATATGCAAGTTTCGTCTGAATATAATGGTGAGGCGAATGTAACAGTTGGCAGTGGTCAGGCCT
TACCAGTCACACACACAGGACAAACGAACGGGGCAAGTACTTTCCATGGGCCTAGTATTAACGGTCTATATCCTTTGACCACTCAAAGTTCTCCTCATGTTCCTAGTCAC
CTCACTGCTCAACTAGGAACCAAGTCTCATAGCTCCACTTGGCATGACCGATTAGGCCATCCGAATAATTCTACTCTTGCCTCTGTTTTACGTTTCCTAAATGTTAATAA
TTCTTCTATGAAATCTGCTTGTATTCATTGTTTGAATGGCAAAATGTGCAAGCTTTCTTTTCCATTGTCTAGTTCATATTCGTCTTATCCTTTAGAGCTAATACACAGTG
ATGTATGGGGGCCCTCTCCCGAACTTTCAATAAATGGATCTCGCTATTATATTTCCTTTATTGATGACTTGTCACGCTATACTTGGATATATCCTCTCACTTACAAGTCG
GATGTTCCTAGCATTGTTTCACAATTCAAACCTTTTGTTGAAACATTATTATCTACTAAAGTCAAAACCTTTAGGAGTGATGGAGGGGGTGAGTATGTTAATAATACTCT
TCGTACTTATTTTGAGAAATATGGTATTATGCATCAAAAATCATGTCCCCATACACCCGAACAAAACGGTATTGCCGAACGGAAGCACCGTCATATCATTGAAATGGCTC
TTGCCTTATTGTCAAAGTCTTCTGTTCCTCTTAAGTATTGGTTTCATGCCTTTGCTTGTGCCGTTTTTATTATTAACCGGTTGCCTTCCCCTACTCTTGGCAATCGTTCT
CCTTTTGAGATTCTCTTTCACAAGCCACCTGACTATACACACTTAAGAGTCTTTGGTTGTGCTTGTTATCCCCTTCTTCGTCCTTATGTGTCTCACAAGCTTCAACCCCG
CACAACTCAGTGTGTCTTTATTGGATATCCTCTTGGATACAAAGGATATCTGTGTCTTGATCCTAGTACTGAACGAATATATATCTCCCGACATGTTGTCTTTGATGAAC
ATGTTTTTCCCTTCTCTTCCTTTACTTCTCCTACTGCTTCTTCGAGTTCTTCTCCTCTACCTCAGCCTTCATCTCCTATTCCGTCTTTACTTCGTTTATTTCTCATTCTC
TTGAACAAGCTGTTGAGGAACCAACAAATGTGTCTGCTGTTATGA
Protein sequenceShow/hide protein sequence
MVFNLWVSSSFSFVSIRARDNTYLKDFSAVWYFPARYPPLFVCGHWGEEEHDAYIMCWVCVRSVTQKSRVGVSAERDAKITCEWLYAWRVIIHVLGGFAERDAEITYYAG
DELYVPGDIDEEVLLLDEDARSDSYSNMKLFLWSCGVVTLPKFSELSRRSLAGVVTAQQQRRRRSKKRNRCFRLREKKGGEKRREGRKEREEEGYGPSGQGRTGKGKIIS
LFFLRTVAQFSDCSFDEYQQSDHRSSGFIQLRSLALSVSPLLKAHKLFGFLDGKISAPAETISVDGVTKPNPAYEIWFEKDQALITLINATLSPVRPSLCNWILKSDLQS
ITKLPTESINSYVQRIKDIVNKLAAVSVVIDQEDLAIYALNGLPSSYNVFKTTVRTRSQHHSFEELHVLMNSEEHALDQQVKSEVSSQTLAMNANFLAVVVVILEIAEDL
VVMAVVVADLVVELAALAASTTYGQSSVPNVANNQVWLSDSGCNAHLTSDLANMQVSSEYNGEANVTVGSGQALPVTHTGQTNGASTFHGPSINGLYPLTTQSSPHVPSH
LTAQLGTKSHSSTWHDRLGHPNNSTLASVLRFLNVNNSSMKSACIHCLNGKMCKLSFPLSSSYSSYPLELIHSDVWGPSPELSINGSRYYISFIDDLSRYTWIYPLTYKS
DVPSIVSQFKPFVETLLSTKVKTFRSDGGGEYVNNTLRTYFEKYGIMHQKSCPHTPEQNGIAERKHRHIIEMALALLSKSSVPLKYWFHAFACAVFIINRLPSPTLGNRS
PFEILFHKPPDYTHLRVFGCACYPLLRPYVSHKLQPRTTQCVFIGYPLGYKGYLCLDPSTERIYISRHVVFDEHVFPFSSFTSPTASSSSSPLPQPSSPIPSLLRLFLIL
LNKLLRNQQMCLLL