; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G01155 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G01155
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationClcChr04:3235465..3239703
RNA-Seq ExpressionClc04G01155
SyntenyClc04G01155
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7578768.1 GAG-pre-integrase domain [Arabidopsis thaliana x Arabidopsis arenosa]3.5e-19636.5Show/hide
Query:  KDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPT------------------------------------------------
        K  QRP+CT CG++GH+V KCYKLHGYP GYKS       NP   ++Q+TP+                                                
Subjt:  KDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPT------------------------------------------------

Query:  -------------ANAQPKPSPSQQQQQQLTLQSK-------------------------------------EWILDSGASRHICNDRSLFQNWNQVFDI
                     +NA  + SP Q +Q    L SK                                      WI+DSGA+ H+C + SLF + N + + 
Subjt:  -------------ANAQPKPSPSQQQQQQLTLQSK-------------------------------------EWILDSGASRHICNDRSLFQNWNQVFDI

Query:  AVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDRLSLK----------------------MIGKVNNKHGLYLLNF-IDSSNHHTTAGVSC
         V L N  QI ++  G V++S+ L+L +VLF+P F  NLIS    L+                      MIGK   ++ LY L+    SS   ++  + C
Subjt:  AVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDRLSLK----------------------MIGKVNNKHGLYLLNF-IDSSNHHTTAGVSC

Query:  AISI----ETWHHFLDHLSPKCLSLLKDTLSLPR------------------------------------------PFKHSTYSGYKYFLTIVDNCSRFT
         +++      WH  L H S   L  L + LS+ +                                          PF   ++  +KYFLT+VD+C+R T
Subjt:  AISI----ETWHHFLDHLSPKCLSLLKDTLSLPR------------------------------------------PFKHSTYSGYKYFLTIVDNCSRFT

Query:  WTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTA
        W YL+++KSD   I P F+  VET ++  +K  RSDNA EL FT+L  +KG  H FSCV+ PQQNSVVERKHQH+ NVARAL FQS +PI++W DCI T+
Subjt:  WTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTA

Query:  TFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDE-----------------------------LNLGNTPHTAEEQVV--------------
         +LINRT  PLL+NK+PFE+L +K   Y  L+ FGC+                                  LNL     +    V+              
Subjt:  TFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDE-----------------------------LNLGNTPHTAEEQVV--------------

Query:  ------FNEGIVQNPSTTITTDSTKIIEP----NNIVEPNEAANPPHD--ITVGLRRSTRRHQPVGFLRDYHCNLLQG-QVLNTTTLYSINNYLSYDKLS
              FN  I+  P    T+ S  +  P    N+++  N  ++   D  I V   R  R  +   +L DYHCNL+     ++  T + +++ L Y KL+
Subjt:  ------FNEGIVQNPSTTITTDSTKIIEP----NNIVEPNEAANPPHD--ITVGLRRSTRRHQPVGFLRDYHCNLLQG-QVLNTTTLYSINNYLSYDKLS

Query:  ALHQNFIFNISSIVLPSYYNQAV----------------------NIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPV
          +Q FI NIS+   P  + +AV                      ++V LP G + IGC+WVY                   + Y Q+EG+D+IDTFSPV
Subjt:  ALHQNFIFNISSIVLPSYYNQAV----------------------NIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPV

Query:  AKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGY---------------------------------YQDLKSSSSNTLSKSNYSLFT
        AK+VTVKLLL L+A  GW L QMD+ NAFL+G+L EE+YM LP GY                                 + D+  ++  T S+S+++LF 
Subjt:  AKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGY---------------------------------YQDLKSSSSNTLSKSNYSLFT

Query:  KGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGI
        K   + FIALLVYVDDILI   S   ++ +K++L + F LKDLG AKYFLGLE++++  GI +SQRKY L +LE  G L  KP S P  S ++LT  +G 
Subjt:  KGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGI

Query:  PLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLG
           L DA+ YR LIG LLYL I+R +++FAVHKLS ++++P + HL AAH ++RYLKG  G+ +F +A ++ +L+A+ D+DW +C DTRR  T FC FLG
Subjt:  PLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLG

Query:  DSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSH
         SL+SWKSKKQ T SRSSAE+EYRA A  + EL W+S +LKDLH+  P    ++CD+ A + IA+N  FHE TKHI+IDCH VRDK+  G LK++ V + 
Subjt:  DSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSH

Query:  SQLANMFTKPL
        +QL ++FTK L
Subjt:  SQLANMFTKPL

KYP61022.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.3e-18738.22Show/hide
Query:  KDGFVDETENAKKKNQLRKKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQS----KEWI
        K GF   T  + K +  R     R ICT+CG  GH ++ CY+ HG+PPG K   ++ A      +     T + Q   S   +    +   +      WI
Subjt:  KDGFVDETENAKKKNQLRKKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQS----KEWI

Query:  LDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLIS----------------------DRLSLKMIGKVN
        LDSGA+ H+ +  S F +++ +  I V L  G  +   + G V+ + S  L DVL++P F YNLIS                      +  ++K IG V+
Subjt:  LDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLIS----------------------DRLSLKMIGKVN

Query:  NKHGLYLLNFIDSSNHHTTAG-----VSCAIS-IETWHHFLDHLSPKCLS----LLKDT------------LSLPRPFKHSTYS----------------
           GLY  +F  S+ HH +         C+I  I+ WH  + HLS + L     L  DT              LP P  HS  S                
Subjt:  NKHGLYLLNFIDSSNHHTTAG-----VSCAIS-IETWHHFLDHLSPKCLS----LLKDT------------LSLPRPFKHSTYS----------------

Query:  ----GYKYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVAR
            G+KYFLTIVD+ +RFTW +LM++KS+    +  FI  VET F K +KV R+DN LE   T  F++KG IHQ +CVE PQQN +VERKHQHL NV R
Subjt:  ----GYKYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVAR

Query:  ALFFQSRVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELN---------------LGNTPHT--------------
        +L FQ+ +P  FW   ++ ATFLIN    P L N SPFE LY    +   L VFGC+  S  +                LG  PHT              
Subjt:  ALFFQSRVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELN---------------LGNTPHT--------------

Query:  AEEQVVFNEG------------------IVQNPSTTITTDSTKIIEPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTTLYS
            V+F+E                   I  N     T  S+ I+E ++    ++ ++PP      LRRSTR  +P  +L+D+H     G   +T+T +S
Subjt:  AEEQVVFNEG------------------IVQNPSTTITTDSTKIIEPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTTLYS

Query:  -------INNYLSYDKLSALHQNFIFNISSIVLPSYYNQAVN----------------------IVPLPNGHRVIGCKWVY-------------------
               ++++LSYD LS    +++F+ISS+  P  + +A                        +  LP     IGC+WVY                   
Subjt:  -------INNYLSYDKLSALHQNFIFNISSIVLPSYYNQAVN----------------------IVPLPNGHRVIGCKWVY-------------------

Query:  RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFG-------------------------YYQDLKS---SS
        + Y Q EG+DF DTFSPVAK+ TV+LLLSL A   W L Q+D+NNAFL+G+L EEVYMQLP G                         +Y  L S     
Subjt:  RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFG-------------------------YYQDLKS---SS

Query:  SNTLSKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFP
            S S++SLF K S ++  A+L+YVDDI++ G   TEI  + +LL + F +KDLGN KYFLGLE++++  GI++ QRKY L +L D+G LA KP S P
Subjt:  SNTLSKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFP

Query:  FASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLD
           ++ L+A++G PL   D ++YRRL+G L+YL  +RP++++AV +LS +V+ P + H  A   +LRYLKGT G  IFL+  ++ QL+A+ DSDW  C D
Subjt:  FASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLD

Query:  TRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKI
        TRR +T F  +LGDSLISWKSKKQ TVSRSS+EAEYRA A  + EL W+S++LKD HI+  + ++++CDN + + IA+NP FHE TKHI+IDCH VRDK+
Subjt:  TRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKI

Query:  INGALKILPVRSHSQLANMFTKPLN
          G LK+LPV S  QLA++ TKPL+
Subjt:  INGALKILPVRSHSQLANMFTKPLN

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]2.8e-19338.02Show/hide
Query:  RPICTNCGIKGHVVDKCYKLHGYPPG---YKSRINENAENPPQNSSQS---TPTANAQPKPSPSQQQQQQL-----------------------------
        R IC++C  + H VDKCYKLHGYPPG   +KS+I++ + +  Q SS S     T       S +Q Q +QL                             
Subjt:  RPICTNCGIKGHVVDKCYKLHGYPPG---YKSRINENAENPPQNSSQS---TPTANAQPKPSPSQQQQQQL-----------------------------

Query:  ---------TLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLIS-----------------
                  +  K+WI+D+GA+ HIC   S+F++ ++     V L N   I V   G V V+ +L+L +VL++P F +NL+S                 
Subjt:  ---------TLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLIS-----------------

Query:  -----DRLSLKMIGKVNNKHGLYLL----NFIDSSNHHTTAGVSCAISIETWHHFLDHLSPKCLSLLKDTLSLPR-------------------------
             D   ++MIG       LY+L     F+ S   +T    S     E WH  + H S   LS LK+ L++                           
Subjt:  -----DRLSLKMIGKVNNKHGLYLL----NFIDSSNHHTTAGVSCAISIETWHHFLDHLSPKCLSLLKDTLSLPR-------------------------

Query:  ---------------PFKHSTYSGYKYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVER
                       PF  ++  G+++F TIVD+ SR+TW Y+++SKSD L I P F  +V T F  T+K  RSDNA EL F + FA  G  H  SCVER
Subjt:  ---------------PFKHSTYSGYKYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVER

Query:  PQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDE-------------------
        PQQNSVVERKHQH+ NVARAL FQS +P+ +W DCI T+ +LINRT  P+L++K+PFE+L+ K  +Y  L+VFGC+  +                     
Subjt:  PQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDE-------------------

Query:  ----------LNLGNTPHTAEEQVVFNEGI--VQNPSTTITTDSTKIIEPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTT
                  LNL          V+F+E     QN S    +D T  + P++ + P+  A+          R++R H     LRDYHC  +     +T+T
Subjt:  ----------LNLGNTPHTAEEQVVFNEGI--VQNPSTTITTDSTKIIEPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTT

Query:  LYSINNYLSYDKLSALHQNFIFNISSIVLPSYYNQAV----------------------NIVPLPNGHRVIGCKWVYRR-------------------YN
         + I+  ++Y KLS+ H+ F+ NISSI+ P+ ++QAV                      +IV LP G   +GC+WVY+                    Y 
Subjt:  LYSINNYLSYDKLSALHQNFIFNISSIVLPSYYNQAV----------------------NIVPLPNGHRVIGCKWVYRR-------------------YN

Query:  QKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGYYQD-----------------LKSSS-------SNTL----
        Q+EG+D+++TFSPVAK+VTV+ LL+L A  GW L+Q+D+NNAFL+G+L EEVYM LP G+  +                 LK +S       S+TL    
Subjt:  QKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGYYQD-----------------LKSSS-------SNTL----

Query:  ---SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPF
           S ++ SLF +   + F+AL+VYVDDI+I        + +K  L S F LKDLGN KYFLG+E+++ST G+ I QR Y + +L ++G L  KP + P 
Subjt:  ---SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPF

Query:  ASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDT
         +N KL   +G  L+  D +SYRRLIG LLYL I+RP++ FAV+KLS YV+ P   H+ AA ++L+Y+KGT GQ +F +++++ +L+A+ D+DW +CLDT
Subjt:  ASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDT

Query:  RRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKII
        RR VT +C FLG+SLISW++KKQ TVSRSSAEAEYR+ A  + E+ W+  +L DL + +   T++FCD+ A V IA+NP FHE TKHI IDCH VR+K+ 
Subjt:  RRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKII

Query:  NGALKILPVRSHSQLANMFTKPL
           +K++ V S  QLA++FTKPL
Subjt:  NGALKILPVRSHSQLANMFTKPL

RVW21404.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-18739.66Show/hide
Query:  KKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYK--------SRINENAE-----------------------------NPPQNSSQSTPTANAQPKPSPS
        K    R  C+ CG +GH+ DKCYKL GYPPG+K        S +  N+E                                 +S+ S  T N+   PS S
Subjt:  KKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYK--------SRINENAE-----------------------------NPPQNSSQSTPTANAQPKPSPS

Query:  QQQQQQLTLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDRLSLKMIGKVNNK-----
             ++ +Q+K WI++SGA+ H+CND SLF +   V ++ V L  G  + +D +G+V +S+ + L +VLF+P F YNL+S+    KMIGK + K     
Subjt:  QQQQQQLTLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDRLSLKMIGKVNNK-----

Query:  --------------------HGLYLLNFIDSSNHHTTAGVSCAISIETWHHFLDHLSPKCLSLLKDTLSLP--RPFKHSTYSGYKYFLTIVDNCSRFTWT
                             GL  +   DSS   T   V C ++ +    ++  L+ +C S   D L L    PF   +  GYK+FLTIVD+ SR TW 
Subjt:  --------------------HGLYLLNFIDSSNHHTTAGVSCAISIETWHHFLDHLSPKCLSLLKDTLSLP--RPFKHSTYSGYKYFLTIVDNCSRFTWT

Query:  YLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATF
        Y++++KS+    +P F A V+  F K +K  RSDNA EL  +N + + G IH  SCVE PQQNSVVERKHQH+ NVARAL FQS +PI +W DCILTA +
Subjt:  YLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATF

Query:  LINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELNLGNTPH-----------------------------TAEEQVVFNEGIV----QNP--S
        LINRT  P L+NK+PFE+L+DK  +Y  LRVFGC+     L    T                               +    V+F+E I      NP  S
Subjt:  LINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELNLGNTPH-----------------------------TAEEQVVFNEGIV----QNP--S

Query:  TTITTD--STKII-------EPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQ--GQVLNTTTLYSINNYLSYDKLSALHQNFIFNISS
          I++D    +++       + ++ V P   + PP  +     R TR  +   +L+DYHC+L+     V   +T + I ++LSYDKLS+ ++ F  ++S 
Subjt:  TTITTD--STKII-------EPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQ--GQVLNTTTLYSINNYLSYDKLSALHQNFIFNISS

Query:  IVLPSYYNQAVNIVPLPNGHRVIGCKWVYRRYNQKEGIDFIDTFSP------VAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGYY
        I  P  + +A  I   P     + C+      N+   I  +   S       +AK+VTVKLLL++ A  GW L Q+D+NNAFL+G+L EEVYM+LP GY 
Subjt:  IVLPSYYNQAVNIVPLPNGHRVIGCKWVYRRYNQKEGIDFIDTFSP------VAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGYY

Query:  QDLKSSSSNTL--------------------------------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKY
        +  +S  SN +                                S SN+SLF K     FIALLVYVDD++I   +   I  +K+ L   F LKDLG+ KY
Subjt:  QDLKSSSSNTL--------------------------------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKY

Query:  FLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSA
        FLGLE+++S+ GI +SQRKY L +L D G+L  K  S P  +N+KL+   G+  +L D S YRRL+G LLYL ++RP++S AV +LS ++++P   HL A
Subjt:  FLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSA

Query:  AHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFP
        A  +LRYLKG  G  +F  + +  +L AY DSDW  C D+RR VT FC FLG+SL+SWKSKKQ  VSRSSAEAEYRA A  S E+TW+  +LKD  I+  
Subjt:  AHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFP

Query:  ALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN
        A  L+FCDN + + IA NP FHE TKHI+IDCH VRDK+ +G LK + V +  QLA++ TK L+
Subjt:  ALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN

RVW51959.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.0e-19539.59Show/hide
Query:  KKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYK--------SRINENAE-----------------------------NPPQNSSQSTPTANAQPKPSPS
        K    R  C++CG +GH  DKCYKL GYPPG+K        S +  N+E                                 +S+ S  T N+   PS S
Subjt:  KKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYK--------SRINENAE-----------------------------NPPQNSSQSTPTANAQPKPSPS

Query:  QQQQQQLTLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDRLSLKMIGKVNNKHGLYL
             ++ +Q+K WI+DSGA+ H+CND SLF +   V ++ V L  G  + +D +G+V +S+ + L +VLF+P F YNL+S+    KMIGK + K  LY 
Subjt:  QQQQQQLTLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDRLSLKMIGKVNNKHGLYL

Query:  L---NFIDSSNHHTTAGVSCAISIETWHHFLDH--------------------LSP---------KCLSLLK---------DTLSLP--RPFKHSTYSGY
        L   +F+        + +  +  +  WH  L H                    L+P         +CL  +          D L L    PF   +  GY
Subjt:  L---NFIDSSNHHTTAGVSCAISIETWHHFLDH--------------------LSP---------KCLSLLK---------DTLSLP--RPFKHSTYSGY

Query:  KYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQS
        K+FLTIVD+ SR TW Y++++KS+    +P F A V+  F K +K  RSDNA EL  +N + + G IH  SCVE PQQNSVVERKHQH+ NVARAL FQS
Subjt:  KYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQS

Query:  RVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELNLGNTPHTAEEQVVFNEGIVQNPSTTITTDSTKIIEPNNIVEP
         +P+ +W DCILTA +LINRT  P L+NK+PFE+L+DK  +Y  LRVFGC+     L    T  +   +       +  P      D +  + P  I +P
Subjt:  RVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELNLGNTPHTAEEQVVFNEGIVQNPSTTITTDSTKIIEPNNIVEP

Query:  NEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQ--GQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYY---------------------
             P         R TR  +   +L+DYHC+L+     V   +T + I ++LSYDKLS  ++ F  ++S I  PS +                     
Subjt:  NEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQ--GQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYY---------------------

Query:  -NQAVNIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYM
         N+  +IV LP G   +GCKWV+                   + Y Q+EGID++DTFSPVAK+VTVKLLL++ A  GW L Q+D+NNAFL+G+L EEVYM
Subjt:  -NQAVNIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYM

Query:  QLPFGYYQDLKSSSSNTL--------------------------------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLK
        +LP GY +  +S  SN +                                S S++SLF K     FIALLVYVDD                         
Subjt:  QLPFGYYQDLKSSSSNTL--------------------------------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLK

Query:  DLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKP
        DLG+ KYFLGLE+++S+ GI +SQRKY L +L D G+L  K  S P  +N+KL+   G+  +L D S YRRL+G LLYL ++RP++S+AV +LS ++++P
Subjt:  DLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKP

Query:  YSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLK
           HL AA  +LRYLKG  G  +F    +  +L AY DSDW  C D+RR VT FC FLG+SL+SWKSKKQ  VSRSSAEAEYRA A  S E+TW+  +LK
Subjt:  YSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLK

Query:  DLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN
        D  I+  A  L+FCDN + + +A NP FHE TKHI+IDCH VRDK+ +G LK + V +  QLA++ TK L+
Subjt:  DLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN

TrEMBL top hitse value%identityAlignment
A0A2N9EHN7 Integrase catalytic domain-containing protein4.6e-19436.33Show/hide
Query:  DGFVDETENAKKKNQLRKKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPTA----NAQPKPS------PSQQQQQQLTLQ
        + F   T+N+K+  Q  +KD    IC++CG KGH  DKCYKLHGYPPG++S+   N     Q SS + P +    N Q  P+        QQ    LT Q
Subjt:  DGFVDETENAKKKNQLRKKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPTA----NAQPKPS------PSQQQQQQLTLQ

Query:  SK--------------------------------------------------------------EWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGF
        ++                                                              +W++D+GA+ H+      +   + V +I+V L NG 
Subjt:  SK--------------------------------------------------------------EWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGF

Query:  QIKVDYIGNVRVSESLMLNDVLFLPKFAYNLIS----------------------DRLSLKMIGKVNNKHGLYLLNFIDSSNHHTTAGVSCAI-------
         + V +IG+V+++ +L+L DVL +P F +NLIS                      D +  +MIG     +GLYLL+F   S +   A +S          
Subjt:  QIKVDYIGNVRVSESLMLNDVLFLPKFAYNLIS----------------------DRLSLKMIGKVNNKHGLYLLNFIDSSNHHTTAGVSCAI-------

Query:  ----------SIETWH-----------HFLDHLSPKCLSLLKDTLS------------LPRPFKH--------------------STYSGYKYFLTIVDN
                   I  WH           HFL  + P  +SL  +  S            LP P K+                     T  GY+YFLT+VD+
Subjt:  ----------SIETWH-----------HFLDHLSPKCLSLLKDTLS------------LPRPFKH--------------------STYSGYKYFLTIVDN

Query:  CSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGD
        C+R TW YLMRSKSD   ++  FI +++T F   IK  RSDN  E H    +A+KG IHQ SCVE PQQNSVVERKHQH+ NVAR+L FQS +P+++WG 
Subjt:  CSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGD

Query:  CILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELNLGNTPHTAEEQ-----------------------------VVFNEGI----
        CI TA +LINR   P+LSNKSPFE L  K  +Y  L+VFGC+  +  L+   T      Q                             VVF+E I    
Subjt:  CILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELNLGNTPHTAEEQ-----------------------------VVFNEGI----

Query:  VQNPSTTITT----------------DSTKIIE-----------------------PNNIVEPN--------------EAANPPHDITVGLRRSTRRHQP
         Q P    TT                 S  II                        P + + P+              E  +P   ++  LRRSTR H+P
Subjt:  VQNPSTTITT----------------DSTKIIE-----------------------PNNIVEPN--------------EAANPPHDITVGLRRSTRRHQP

Query:  VGFLRDYHCNLL-------QGQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYYNQA----------------------VNIVPLPNGHRVIG
          +L+DYHC L           + ++ T Y ++  LSYD LS  H+NF  ++++I  PS ++QA                        + PLP G   IG
Subjt:  VGFLRDYHCNLL-------QGQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYYNQA----------------------VNIVPLPNGHRVIG

Query:  CKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGY------------
        CKWVY                   + Y Q+EG+D+ +TFSPVAK  TV+ LL++ ++  W L Q+D+NNAFL+G+L EEVYM LP G+            
Subjt:  CKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGY------------

Query:  ----YQDLKSSS-------SNTL-------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGI
               LK +S       S+T+       S S+YSLFT+  G  FIALLVYVDDILI       +  +K  L + F LKDLGN K+FLGLE+++ST GI
Subjt:  ----YQDLKSSS-------SNTL-------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGI

Query:  YISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAG
         + QRKY L IL DSG L  KP + P   NLK++ + G    L D S YRRLIG LLYL ++RP++S++V +LS +++KP   HL+AA+ +LRY+KGT+G
Subjt:  YISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAG

Query:  QRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVV
        Q +F  + ++ QLKA+ DSDW  C DTRR +T +C ++G SLISWKSKKQ TVSRSSAEAEYRA A V  EL W+  +L +L    P   L+FCD+ A +
Subjt:  QRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVV

Query:  SIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLNMLFD
         IA NP +HE TKHI++DCH +R+KI  G ++ L V S +QLA++ TK L ++ D
Subjt:  SIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLNMLFD

A0A2N9EL12 Integrase catalytic domain-containing protein1.6e-19937.14Show/hide
Query:  QRPICTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPTANAQPKPSPSQQQQQQL----------------------------------
        +RP CT+CG+ GH VDKCYKLHG+PPGYK+R   +A N    S  + PTA    + S SQ  Q Q                                   
Subjt:  QRPICTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPTANAQPKPSPSQQQQQQL----------------------------------

Query:  ------------------TLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDRLSLKMI
                           ++   WILD+GA+ H+    S            V L NG  + V +IG V++S SL+L DVL +P F +NLIS     +MI
Subjt:  ------------------TLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDRLSLKMI

Query:  GKVNNKHGLYLL-----------------------NFIDSSNHHTTAGVSCAISIETWH-----------HFLDHLSPKCLSLLKDT-------------
        G     +GLY+L                       +F   S H T A VS    ++ WH           HFL    P   ++ K++             
Subjt:  GKVNNKHGLYLL-----------------------NFIDSSNHHTTAGVSCAISIETWH-----------HFLDHLSPKCLSLLKDT-------------

Query:  LSLPR------------------PFKHSTYSGYKYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTI
        L  P                   P+  ST+ G+KYFLTIVD+CSR TW YLM SK+D   ++  F  ++ET F+  IK  RSDN LE   ++ F++KG I
Subjt:  LSLPR------------------PFKHSTYSGYKYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTI

Query:  HQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDEL----------
        HQ SCV+ PQQNSVVERKHQHL NVARAL FQS VP+ FWGD IL A +LINR   PLL NK+PFE+L     +Y  L+VFGC+  +  L          
Subjt:  HQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDEL----------

Query:  -------------------NLGNTPHTAEEQVVFNEGI--------VQNPSTTITTDSTK-------IIEPNNIVE------------------------
                           +L          V+F+E          + NP T+ +T S         ++ P N  E                        
Subjt:  -------------------NLGNTPHTAEEQVVFNEGI--------VQNPSTTITTDSTK-------IIEPNNIVE------------------------

Query:  --------------------------------------------------PNEAANPPHDI------------TVGLRRSTRRHQPVGFLRDYHCNLL--
                                                          P + + P + +            +  LR+S+R  +   +L+DYHCNL   
Subjt:  --------------------------------------------------PNEAANPPHDI------------TVGLRRSTRRHQPVGFLRDYHCNLL--

Query:  --QGQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYYNQAVN----------------------IVPLPNGHRVIGCKWVY------------
               +    + I + LSY  LS  H+ F   +S+   P +Y++A++                      +  L  G   IGCKWVY            
Subjt:  --QGQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYYNQAVN----------------------IVPLPNGHRVIGCKWVY------------

Query:  -------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGY----------------YQDLKSSS-----
               + YNQ+EGID+ +TFSPVAK+VTV+  +++ A+ GW L Q+D+NNAFL+G+L EEVYM LP GY                   LK +S     
Subjt:  -------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGY----------------YQDLKSSS-----

Query:  --SNTL-------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGF
          S+TL       SK +YSLFTK  GS+FIALLVYVDDILI   +PT +T + T L   F LKDLG+AKYFLGLEL++S  GI + QRKY L ILED+GF
Subjt:  --SNTL-------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGF

Query:  LAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYV
        LA KP  FP   ++KL+   G  L   D + YRRL+G LLYL ++RP++S++V +LS ++ +P + HL AAH +L+YLKG+ GQ +F  ATN+ QLKA+ 
Subjt:  LAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYV

Query:  DSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKI
        DSDW  C DTRR VT FC FLGDSLISW+SKKQ+ VSRSSAEAEYRA A+ + E+TW+  +L+D  I+     L+FC+N A + IA NP FHE TKHI++
Subjt:  DSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKI

Query:  DCHFVRDKIINGALKILPVRSHSQLANMFTKPLNML
        DCHF+RDKI  G LK L + S  QLA++FTKPL  +
Subjt:  DCHFVRDKIINGALKILPVRSHSQLANMFTKPLNML

A0A2N9H2Y3 Integrase catalytic domain-containing protein1.4e-19836.79Show/hide
Query:  DGFVDETENAKKKNQLRKKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSR-------------------INENAENPPQNSSQSTP--------TANA
        + F   T+N K+  Q  +KD    IC++CG KGH  DKCYKLHGYPPG++S+                     +NA++ P  ++ S          TA A
Subjt:  DGFVDETENAKKKNQLRKKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSR-------------------INENAENPPQNSSQSTP--------TANA

Query:  QPKPSPSQQQQQQL-----------------------------------------TLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKV
        Q     S  Q  Q                                          T  S +W++D+GA  H+      +   + V +I+V L NG  + V
Subjt:  QPKPSPSQQQQQQL-----------------------------------------TLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKV

Query:  DYIGNVRVSESLMLNDVLFLPKFAYNLIS----------------------DRLSLKMIGKVNNKHGLYLLNFIDSSNHHTTAGVSCAIS----------
         +IG+V+++ +L+L +VL +P F +NLIS                      D +  +MIG     +GLYLL+   SS+  TTA    + S          
Subjt:  DYIGNVRVSESLMLNDVLFLPKFAYNLIS----------------------DRLSLKMIGKVNNKHGLYLLNFIDSSNHHTTAGVSCAIS----------

Query:  --------IETWH-----------HFL---------------------------------DHLSPKCLSLLKDTLSLPRPFKHSTYSGYKYFLTIVDNCS
                I  WH           HFL                                 +HLS K   LL   + +  P+   T  GY+YFLT+VD+C+
Subjt:  --------IETWH-----------HFL---------------------------------DHLSPKCLSLLKDTLSLPRPFKHSTYSGYKYFLTIVDNCS

Query:  RFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCI
        R TW YLMRSKSD   ++  FI ++ T F   IK  RSDN  E H  + +A+KG IHQ SCVE PQQNSVVERKHQH+ NVARAL FQS +P+++WG CI
Subjt:  RFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCI

Query:  LTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELN-----------------------------LGNTPHTAEEQVVFNEGIV----Q
         TA +LINR   P+LSNKSPFE L  K  +Y  L+VFGC+  +  L+                             L          VVF+E I     Q
Subjt:  LTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELN-----------------------------LGNTPHTAEEQVVFNEGIV----Q

Query:  NPSTTITT----------------------------------------------DSTKIIEPNNIVEPN----EAANPPHDITVGLRRSTRRHQPVGFLR
         P    +T                                              D++ +++ N+   P+    E  +P   ++  LRRSTR H+P  +L+
Subjt:  NPSTTITT----------------------------------------------DSTKIIEPNNIVEPN----EAANPPHDITVGLRRSTRRHQPVGFLR

Query:  DYHCNLL-------QGQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYYNQA----------------------VNIVPLPNGHRVIGCKWVY
        DYHC L           + ++   Y ++  LSYD LS  H+NF  ++++I+ PS+++QA                        + PLP G   IGCKWVY
Subjt:  DYHCNLL-------QGQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYYNQA----------------------VNIVPLPNGHRVIGCKWVY

Query:  -------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGY----------------Y
                           + Y Q+EG+D+ +TFSPVAK  TV+ LL++ +   W L Q+D+NNAFL+G+L EEVYM LP G+                 
Subjt:  -------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGY----------------Y

Query:  QDLKSSS-------SNTL-------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQR
          LK +S       S+T+       SKS+YSLFT+  G++FIALLVYVDDILI   +  ++ ++K  L + F LKDLGN KYFLGLE+++ST GI + QR
Subjt:  QDLKSSS-------SNTL-------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQR

Query:  KYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFL
        KY L IL DSG L  KP + P   NLK++ + G    LDD S YRRL+G LLYL ++RP++S++V KLS +++KP S HLSAA+ +LRY+KGT+GQ +F 
Subjt:  KYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFL

Query:  AATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATN
         + +  QLKA+ DSDW  CLDTRR +T +C ++GDSLISWKSKKQ TVSRSSAEAEYRA A V  EL W+  +L +L    P   L+FCD+ A + IA N
Subjt:  AATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATN

Query:  PTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPL
        P +HE TKHI++DCH +R+KI  G ++ L V S +QLA++ TK L
Subjt:  PTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPL

A0A2N9IF64 Integrase catalytic domain-containing protein3.6e-19938.56Show/hide
Query:  KKDSQRPICT--NCGIKGHVVDKCYKLHGYPPGYKSR--------------INENA-----ENPPQNSSQ-------------------STP--------
        +++ +RP CT  +CG+ GH VDKCYKLHG+PPGYK+R                 NA     E  P   SQ                   S P        
Subjt:  KKDSQRPICT--NCGIKGHVVDKCYKLHGYPPGYKSR--------------INENA-----ENPPQNSSQ-------------------STP--------

Query:  -TANA----------------------QPKPSP-------SQQQQQQLTLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVR
         T NA                       P  +P       S       +++   WILD+GA+ H+ +  S F     +    V L NG  + V +IG V+
Subjt:  -TANA----------------------QPKPSP-------SQQQQQQLTLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVR

Query:  VSESLMLNDVLFLPKFAYNLISDRLSLKMIGKVNNKHGLYLLNFIDSSNHHTTAGVSCAISIETWHHFLDHLSPKCLSLLKDTLSLPRPFKHSTYSGYKY
        +S SL+L DVL +P F +NLIS     +MIG    K   + L+F  +   H T+     I  + W                       P+   T+ G+KY
Subjt:  VSESLMLNDVLFLPKFAYNLISDRLSLKMIGKVNNKHGLYLLNFIDSSNHHTTAGVSCAISIETWHHFLDHLSPKCLSLLKDTLSLPRPFKHSTYSGYKY

Query:  FLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRV
        FLTIVD+CSR TW YLM SK     ++  F  +VET F+  IK  RSDN LE   ++ F++KG IHQ SCV+ PQQNSVVERKHQHL NVARA+ FQS +
Subjt:  FLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRV

Query:  PIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELNLGNTPHTAE-----------------------------EQVVFN
        P+ FWG+CIL A +LINR   P+L  K+P+EVL  K   Y  L+VFGC+  +  L+   T   A+                               VVF+
Subjt:  PIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELNLGNTPHTAE-----------------------------EQVVFN

Query:  EGI--------VQNPSTTITTDSTKIIEPNNIVEP---------------------------------NEAANPPHDITV--------------------
        E I        + NP +++ + S     P + V P                                  E  NPP  + V                    
Subjt:  EGI--------VQNPSTTITTDSTKIIEPNNIVEP---------------------------------NEAANPPHDITV--------------------

Query:  ------GLRRSTRRHQPVGFLRDYHCNLL----QGQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYYNQAVN--------------------
               +R+S+R  +P  +L+DYHC+L        + +  T+Y I + LSY KLSA H+ F   IS+ + P +Y++AV                     
Subjt:  ------GLRRSTRRHQPVGFLRDYHCNLL----QGQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYYNQAVN--------------------

Query:  --IVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPF
          +  LP G + IGCKWVY                   + YNQ+EGID+ +TFSPVAK+VTV+  ++L A+ GW + Q+D+NNAFL+G+L EEV+M LP 
Subjt:  --IVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPF

Query:  GY----------------YQDLKSSS-------SNTL-------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAK
        G+                   LK +S       S+TL       SK +YSLFTK  G +F+ALLVYVDDILI       +T +   L  HF LKDLG AK
Subjt:  GY----------------YQDLKSSS-------SNTL-------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAK

Query:  YFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLS
        YFLGLEL+++  GI + QRKY L IL+D+GFL  KP  FP   +LKL+   G    L D + YRRLIG LLYL ++RP++S++V +LS ++ +P + HL 
Subjt:  YFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLS

Query:  AAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINF
        AAH +L+YLKG+ GQ +F ++  + QLKA+ DSDW  C DTRR VT FC FLGDSLISW+SKKQ+ VSRSSAEAEYRA A+ + E+TW+  +LKD HI+ 
Subjt:  AAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINF

Query:  PALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLNML
        P   ++FCDN A + IA+NP FHE TKHI++DCHF+RDKI  G LK L V S  QLA++FTKPL  +
Subjt:  PALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLNML

A0A438EW68 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-19539.59Show/hide
Query:  KKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYK--------SRINENAE-----------------------------NPPQNSSQSTPTANAQPKPSPS
        K    R  C++CG +GH  DKCYKL GYPPG+K        S +  N+E                                 +S+ S  T N+   PS S
Subjt:  KKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYK--------SRINENAE-----------------------------NPPQNSSQSTPTANAQPKPSPS

Query:  QQQQQQLTLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDRLSLKMIGKVNNKHGLYL
             ++ +Q+K WI+DSGA+ H+CND SLF +   V ++ V L  G  + +D +G+V +S+ + L +VLF+P F YNL+S+    KMIGK + K  LY 
Subjt:  QQQQQQLTLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDRLSLKMIGKVNNKHGLYL

Query:  L---NFIDSSNHHTTAGVSCAISIETWHHFLDH--------------------LSP---------KCLSLLK---------DTLSLP--RPFKHSTYSGY
        L   +F+        + +  +  +  WH  L H                    L+P         +CL  +          D L L    PF   +  GY
Subjt:  L---NFIDSSNHHTTAGVSCAISIETWHHFLDH--------------------LSP---------KCLSLLK---------DTLSLP--RPFKHSTYSGY

Query:  KYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQS
        K+FLTIVD+ SR TW Y++++KS+    +P F A V+  F K +K  RSDNA EL  +N + + G IH  SCVE PQQNSVVERKHQH+ NVARAL FQS
Subjt:  KYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQS

Query:  RVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELNLGNTPHTAEEQVVFNEGIVQNPSTTITTDSTKIIEPNNIVEP
         +P+ +W DCILTA +LINRT  P L+NK+PFE+L+DK  +Y  LRVFGC+     L    T  +   +       +  P      D +  + P  I +P
Subjt:  RVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCVDISDELNLGNTPHTAEEQVVFNEGIVQNPSTTITTDSTKIIEPNNIVEP

Query:  NEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQ--GQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYY---------------------
             P         R TR  +   +L+DYHC+L+     V   +T + I ++LSYDKLS  ++ F  ++S I  PS +                     
Subjt:  NEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQ--GQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPSYY---------------------

Query:  -NQAVNIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYM
         N+  +IV LP G   +GCKWV+                   + Y Q+EGID++DTFSPVAK+VTVKLLL++ A  GW L Q+D+NNAFL+G+L EEVYM
Subjt:  -NQAVNIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYM

Query:  QLPFGYYQDLKSSSSNTL--------------------------------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLK
        +LP GY +  +S  SN +                                S S++SLF K     FIALLVYVDD                         
Subjt:  QLPFGYYQDLKSSSSNTL--------------------------------SKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLK

Query:  DLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKP
        DLG+ KYFLGLE+++S+ GI +SQRKY L +L D G+L  K  S P  +N+KL+   G+  +L D S YRRL+G LLYL ++RP++S+AV +LS ++++P
Subjt:  DLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKP

Query:  YSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLK
           HL AA  +LRYLKG  G  +F    +  +L AY DSDW  C D+RR VT FC FLG+SL+SWKSKKQ  VSRSSAEAEYRA A  S E+TW+  +LK
Subjt:  YSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLK

Query:  DLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN
        D  I+  A  L+FCDN + + +A NP FHE TKHI+IDCH VRDK+ +G LK + V +  QLA++ TK L+
Subjt:  DLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.8e-6825Show/hide
Query:  KKKNQLRKKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSRI-NENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQSKEWILDSGASRHICNDRS
        K K   +     +  C +CG +GH+   C+        YK  + N+N EN  Q       TA +       ++      + +  ++LDSGAS H+ ND S
Subjt:  KKKNQLRKKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSRI-NENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQSKEWILDSGASRHICNDRS

Query:  LFQNWNQV---FDIAVALHNGFQIKVDY-IGNVRVSESLMLNDVLFLPKFAYNLISDR--------LSLKMIGKVNNKHGLY------LLNFIDSSNHHT
        L+ +  +V     IAVA    F       I  +R    + L DVLF  + A NL+S +        +     G   +K+GL       +LN +   N   
Subjt:  LFQNWNQV---FDIAVALHNGFQIKVDY-IGNVRVSESLMLNDVLFLPKFAYNLISDR--------LSLKMIGKVNNKHGLY------LLNFIDSSNHHT

Query:  -TAGVSCAISIETWHHFLDHLS-----------------------------PKCLS---------LLKDTLSLPRPF--KHS---------TYSGYKYFL
         +       +   WH    H+S                               CL+          LKD   + RP    HS         T     YF+
Subjt:  -TAGVSCAISIETWHHFLDHLS-----------------------------PKCLS---------LLKDTLSLPRPF--KHS---------TYSGYKYFL

Query:  TIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALEL---HFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSR
          VD  + +  TYL++ KSD   +   F+A  E HF+  +     DN  E            KG  +  +    PQ N V ER  + +   AR +   ++
Subjt:  TIVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALEL---HFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSR

Query:  VPIRFWGDCILTATFLINRTSIPLL--SNKSPFEVLYDKDVNYPSLRVFGCV--------------------------------------------DISD
        +   FWG+ +LTAT+LINR     L  S+K+P+E+ ++K      LRVFG                                               + D
Subjt:  VPIRFWGDCILTATFLINRTSIPLL--SNKSPFEVLYDKDVNYPSLRVFGCV--------------------------------------------DISD

Query:  ELNLGNTPHTAEEQVVFNEGIVQNPSTTITTDSTKIIE---PNNIVE-----------PNEAANPPHD----ITVGLRRSTRRHQPVGFL----------
        E N+ N+    + + VF +   ++ +     DS KII+   PN   E            +E  N P+D    I       ++    + FL          
Subjt:  ELNLGNTPHTAEEQVVFNEGIVQNPSTTITTDSTKIIE---PNNIVE-----------PNEAANPPHD----ITVGLRRSTRRHQPVGFL----------

Query:  --------RDYHCNLLQG-------------------------------------QVLNTTTLYSINNYLSYDKLSALHQNFIFN---------------
                RD H N  +G                                     + L T    S N   +      L+ + IFN               
Subjt:  --------RDYHCNLLQG-------------------------------------QVLNTTTLYSINNYLSYDKLSALHQNFIFN---------------

Query:  ------ISSIVLPSYYNQAVNIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDI
              I++ +     N    I   P    ++  +WV+                   R + QK  ID+ +TF+PVA+I + + +LSL   +   + QMD+
Subjt:  ------ISSIVLPSYYNQAVNIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDI

Query:  NNAFLNGELFEEVYMQLPFGYYQDLKSSSSNT--LSKSNYSL-------------------------------FTKGSGSSFIALLVYVDDILIIGPSPT
          AFLNG L EE+YM+LP    Q +  +S N   L+K+ Y L                                 KG+ +  I +L+YVDD++I     T
Subjt:  NNAFLNGELFEEVYMQLPFGYYQDLKSSSSNT--LSKSNYSL-------------------------------FTKGSGSSFIALLVYVDDILIIGPSPT

Query:  EITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDD--ASSYRRLIGSLLYLQI-
         +   K  L   F + DL   K+F+G+ +      IY+SQ  Y  +IL           S P  S +         LN D+   +  R LIG L+Y+ + 
Subjt:  EITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDD--ASSYRRLIGSLLYLQI-

Query:  SRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLK--AYVDSDWDSCLDTRRFVTDFC-KFLGDSLISWKSKKQATVSRSSA
        +RP+++ AV+ LS Y +K  SE       +LRYLKGT   ++       F+ K   YVDSDW      R+  T +  K    +LI W +K+Q +V+ SS 
Subjt:  SRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLK--AYVDSDWDSCLDTRRFVTDFC-KFLGDSLISWKSKKQATVSRSSA

Query:  EAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPL
        EAEY A      E  W+  +L  ++I       ++ DN   +SIA NP+ H+  KHI I  HF R+++ N  + +  + + +QLA++FTKPL
Subjt:  EAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.2e-8626.3Show/hide
Query:  AKKKNQLRKKDSQRPICTNCGIKGHVVDKC---YKLHGYPPGYKSRINENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQSKEWILDSGASRHICN
        A+ K++ R K   R  C NC   GH    C    K  G   G K+  ++N     QN+       N        +++   L+    EW++D+ AS H   
Subjt:  AKKKNQLRKKDSQRPICTNCGIKGHVVDKC---YKLHGYPPGYKSRINENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQSKEWILDSGASRHICN

Query:  DRSLFQNWNQVFDIAVALHNGFQIKVDYIGNV----RVSESLMLNDVLFLPKFAYNLISDRLSLKMIGKVN---------NKHGLYLLNFIDSSNHHTTA
         R LF  +       V + N    K+  IG++     V  +L+L DV  +P    NLIS  ++L   G  +          K  L +   +     + T 
Subjt:  DRSLFQNWNQVFDIAVALHNGFQIKVDYIGNV----RVSESLMLNDVLFLPKFAYNLISDRLSLKMIGKVN---------NKHGLYLLNFIDSSNHHTTA

Query:  GVSC---------AISIETWHHFLDHLSPKCLSLL--KDTLSLPR----------------------------------------PFKHSTYSGYKYFLT
           C          IS++ WH  + H+S K L +L  K  +S  +                                        P +  +  G KYF+T
Subjt:  GVSC---------AISIETWHHFLDHLSPKCLSLL--KDTLSLPR----------------------------------------PFKHSTYSGYKYFLT

Query:  IVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALEL---HFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRV
         +D+ SR  W Y++++K     +  +F ALVE    + +K  RSDN  E     F    ++ G  H+ +    PQ N V ER ++ +    R++   +++
Subjt:  IVDNCSRFTWTYLMRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALEL---HFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRV

Query:  PIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGC-----------VDISDE------LNLGNTPH------------TAEEQVVFN
        P  FWG+ + TA +LINR+    L+ + P  V  +K+V+Y  L+VFGC             + D+      +  G+                    VVF 
Subjt:  PIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGC-----------VDISDE------LNLGNTPH------------TAEEQVVFN

Query:  E---------------GIVQN----PSTT-------ITTD--STKIIEPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTTL
        E               GI+ N    PST+        TTD  S +  +P  ++E  E  +   +      +   +HQP   LR      ++ +   +T  
Subjt:  E---------------GIVQN----PSTT-------ITTD--STKIIEPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTTL

Query:  Y---------SINNYLSYDKLSALHQNFIFNISSIVLPSYYNQAVNIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPV
                  S+   LS+ + + L +     + S+      N    +V LP G R + CKWV+                   + + QK+GIDF + FSPV
Subjt:  Y---------SINNYLSYDKLSALHQNFIFNISSIVLPSYYNQAVNIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPV

Query:  AKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGYYQDLKSSSSNTLSKSNYSL------------------------------FTKGS
         K+ +++ +LSL AS   ++ Q+D+  AFL+G+L EE+YM+ P G+    K      L+KS Y L                              F + S
Subjt:  AKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGYYQDLKSSSSNTLSKSNYSL------------------------------FTKGS

Query:  GSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLEL--SQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIP
         ++FI LL+YVDD+LI+G     I  +K  L   F +KDLG A+  LG+++   +++  +++SQ KY  ++LE       KP S P A +LKL+     P
Subjt:  GSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLEL--SQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIP

Query:  LNLDDASS-----YRRLIGSLLYLQI-SRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDF
          +++  +     Y   +GSL+Y  + +RP+++ AV  +S ++  P  EH  A   +LRYL+GT G  +    ++   LK Y D+D    +D R+  T +
Subjt:  LNLDDASS-----YRRLIGSLLYLQI-SRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDF

Query:  CKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKIL
                ISW+SK Q  V+ S+ EAEY A      E+ W+   L++L ++     +V+CD+ + + ++ N  +H  TKHI +  H++R+ + + +LK+L
Subjt:  CKFLGDSLISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKIL

Query:  PVRSHSQLANMFTK
         + ++   A+M TK
Subjt:  PVRSHSQLANMFTK

P92519 Uncharacterized mitochondrial protein AtMg008108.5e-4441.92Show/hide
Query:  LLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLT-ATAGIPLNLDDAS
        LL+YVDDIL+ G S T +  +   L S F +KDLG   YFLG+++     G+++SQ KY  QIL ++G L  KP S P    L  + +TA  P    D S
Subjt:  LLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLT-ATAGIPLNLDDAS

Query:  SYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKS
         +R ++G+L YL ++RP++S+AV+ +   + +P          +LRY+KGT    +++   +   ++A+ DSDW  C  TRR  T FC FLG ++ISW +
Subjt:  SYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKS

Query:  KKQATVSRSSAEAEYRAFAMVSSELTWVS
        K+Q TVSRSS E EYRA A+ ++ELTW S
Subjt:  KKQATVSRSSAEAEYRAFAMVSSELTWVS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.2e-10226.81Show/hide
Query:  CTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQSKEWILDSGASRHIC---NDRSLFQNWNQVFDIAV
        C  CG++GH   +C +L  +        + N++ PP   +   P AN     SP           S  W+LDSGA+ HI    N+ SL Q +    D+ V
Subjt:  CTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQSKEWILDSGASRHIC---NDRSLFQNWNQVFDIAV

Query:  ALHNGFQIKVDYIGNVRV---SESLMLNDVLFLPKFAYNLIS-------DRLSLKM---------------IGKVNNKHGLYLLNFIDSSNHHTTAGVSC
        A  +G  I + + G+  +   S  L L+++L++P    NLIS       + +S++                + +   K  LY      S      A  S 
Subjt:  ALHNGFQIKVDYIGNVRV---SESLMLNDVLFLPKFAYNLIS-------DRLSLKM---------------IGKVNNKHGLYLLNFIDSSNHHTTAGVSC

Query:  AISIETWHHFLDHLSPKCLSLLKDTLSL---------------------PRPFKHST---------------------YSGYKYFLTIVDNCSRFTWTYL
          +  +WH  L H +P  L+ +    SL                       PF  ST                     +  Y+Y++  VD+ +R+TW Y 
Subjt:  AISIETWHHFLDHLSPKCLSLLKDTLSL---------------------PRPFKHST---------------------YSGYKYFLTIVDNCSRFTWTYL

Query:  MRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALE-LHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATFL
        ++ KS        F  L+E  F   I  F SDN  E +     F+  G  H  S    P+ N + ERKH+H+      L   + +P  +W      A +L
Subjt:  MRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALE-LHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATFL

Query:  INRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCV-----------DISDE------------------LNLGNTPHTAEEQVVFNE--------------
        INR   PLL  +SPF+ L+    NY  LRVFGC             + D+                  L+L  +       V F+E              
Subjt:  INRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGCV-----------DISDE------------------LNLGNTPHTAEEQVVFNE--------------

Query:  --------GIVQNPSTTITTDSTKIIEPNNIVEPNEAANPPHDITVGLRRS---------------------------------------TRRHQPVGFL
                  V +P TT+ T  T ++   +  +P+ AA PP   +   R S                                       T+ H      
Subjt:  --------GIVQNPSTTITTDSTKIIEPNNIVEPNEAANPPHDITVGLRRS---------------------------------------TRRHQPVGFL

Query:  RDYHCNLLQGQV----------------------------------------------------LNTTTL--------------YSINNYLSYDK-----
        ++   N    Q+                                                    LNT ++              YS+   L+ +      
Subjt:  RDYHCNLLQGQV----------------------------------------------------LNTTTL--------------YSINNYLSYDK-----

Query:  LSALHQNFIFN-ISSIVLPSYYNQAVNIVPLPNGH-RVIGCKWVYRR-------------------YNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGW
        + AL      N + S +     N   ++VP P  H  ++GC+W++ +                   YNQ+ G+D+ +TFSPV K  +++++L +     W
Subjt:  LSALHQNFIFN-ISSIVLPSYYNQAVNIVPLPNGH-RVIGCKWVYRR-------------------YNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGW

Query:  DLVQMDINNAFLNGELFEEVYM--------------------------QLPFGYYQDLKS---SSSNTLSKSNYSLFTKGSGSSFIALLVYVDDILIIGP
         + Q+D+NNAFL G L ++VYM                          Q P  +Y +L++   +     S S+ SLF    G S + +LVYVDDILI G 
Subjt:  DLVQMDINNAFLNGELFEEVYM--------------------------QLPFGYYQDLKS---SSSNTLSKSNYSLFTKGSGSSFIALLVYVDDILIIGP

Query:  SPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQI
         PT +      L   F +KD     YFLG+E  +   G+++SQR+Y L +L  +  +  KP + P A + KL+  +G    L D + YR ++GSL YL  
Subjt:  SPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSLLYLQI

Query:  SRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAE
        +RP++S+AV++LS ++  P  EHL A   +LRYL GT    IFL   N   L AY D+DW    D       +  +LG   ISW SKKQ  V RSS EAE
Subjt:  SRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRSSAEAE

Query:  YRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN
        YR+ A  SSE+ W+  +L +L I      +++CDN     +  NP FH   KHI ID HF+R+++ +GAL+++ V +H QLA+  TKPL+
Subjt:  YRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.2e-9526.36Show/hide
Query:  CTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQSKEWILDSGASRHIC---NDRSLFQNWNQVFDIAV
        C  C ++GH   +C +LH +         ++  N  Q++S  TP    QP+ + +          +  W+LDSGA+ HI    N+ S  Q +    D+ +
Subjt:  CTNCGIKGHVVDKCYKLHGYPPGYKSRINENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQSKEWILDSGASRHIC---NDRSLFQNWNQVFDIAV

Query:  ALHNGFQIKVDYIGNVRV---SESLMLNDVLFLPKFAYNLIS-------DRLSLKM---------------IGKVNNKHGLYLLNFIDSSNHHTTAGVSC
        A  +G  I + + G+  +   S SL LN VL++P    NLIS       +R+S++                + +   K  LY      S      A    
Subjt:  ALHNGFQIKVDYIGNVRV---SESLMLNDVLFLPKFAYNLIS-------DRLSLKM---------------IGKVNNKHGLYLLNFIDSSNHHTTAGVSC

Query:  AISIETWHHFLDHLSPKCLSLLKDTLSLP---------------------RPFKHST----------YS-----------GYKYFLTIVDNCSRFTWTYL
          +  +WH  L H S   L+ +    SLP                      PF +ST          YS            Y+Y++  VD+ +R+TW Y 
Subjt:  AISIETWHHFLDHLSPKCLSLLKDTLSLP---------------------RPFKHST----------YS-----------GYKYFLTIVDNCSRFTWTYL

Query:  MRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALE-LHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATFL
        ++ KS        F +LVE  F   I    SDN  E +   +  +  G  H  S    P+ N + ERKH+H+  +   L   + VP  +W      A +L
Subjt:  MRSKSDALYIVPRFIALVETHFSKTIKVFRSDNALE-LHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATFL

Query:  INRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGC----------------------------------------------------------------VDI
        INR   PLL  +SPF+ L+ +  NY  L+VFGC                                                                V  
Subjt:  INRTSIPLLSNKSPFEVLYDKDVNYPSLRVFGC----------------------------------------------------------------VDI

Query:  SDELNLGNTPH----------------------------------------------------------------------TAEEQVVFNEG----IVQN
        S E    + P+                                                                      TA+     N      I+ N
Subjt:  SDELNLGNTPH----------------------------------------------------------------------TAEEQVVFNEG----IVQN

Query:  P----------------------STTITTDSTKIIEPNNIVEPNEAANP-------PHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTTLYSINN
        P                      S  I T ST I EPN+    + +  P       P  I V  +     H      +D        Q  +  T  + N+
Subjt:  P----------------------STTITTDSTKIIEPNNIVEPNEAANP-------PHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTTLYSINN

Query:  YLSYDKLSALHQNFIFNISSIVLPSYYNQAVNIV-PLPNGHRVIGCKWVYRR-------------------YNQKEGIDFIDTFSPVAKIVTVKLLLSLT
               +     +   + S +     N   ++V P P    ++GC+W++ +                   YNQ+ G+D+ +TFSPV K  +++++L + 
Subjt:  YLSYDKLSALHQNFIFNISSIVLPSYYNQAVNIV-PLPNGHRVIGCKWVYRR-------------------YNQKEGIDFIDTFSPVAKIVTVKLLLSLT

Query:  ASFGWDLVQMDINNAFLNGELFEEVYM--------------------------QLPFGYYQDLKS---SSSNTLSKSNYSLFTKGSGSSFIALLVYVDDI
            W + Q+D+NNAFL G L +EVYM                          Q P  +Y +L++   +     S S+ SLF    G S I +LVYVDDI
Subjt:  ASFGWDLVQMDINNAFLNGELFEEVYM--------------------------QLPFGYYQDLKS---SSSNTLSKSNYSLFTKGSGSSFIALLVYVDDI

Query:  LIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSL
        LI G     +      L   F +K+  +  YFLG+E  +   G+++SQR+Y L +L  +  L  KP + P A++ KLT  +G    L D + YR ++GSL
Subjt:  LIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSL

Query:  LYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRS
         YL  +RP++S+AV++LS Y+  P  +H +A   +LRYL GT    IFL   N   L AY D+DW    D       +  +LG   ISW SKKQ  V RS
Subjt:  LYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRS

Query:  SAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN
        S EAEYR+ A  SSEL W+  +L +L I      +++CDN     +  NP FH   KHI +D HF+R+++ +GAL+++ V +H QLA+  TKPL+
Subjt:  SAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.5e-10138.7Show/hide
Query:  IVQNPSTTITTDSTKIIEPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPS
        +V +   + ++ S  I+   NI   N+   P       +  S RR +   +L+DY+C+ +      + T++ I+ +LSY+K+S L+ +F+  I+    PS
Subjt:  IVQNPSTTITTDSTKIIEPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTTLYSINNYLSYDKLSALHQNFIFNISSIVLPS

Query:  YYNQAV----------------------NIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFG
         YN+A                        I  LP   + IGCKWVY                   + Y Q+EGIDFI+TFSPV K+ +VKL+L+++A + 
Subjt:  YYNQAV----------------------NIVPLPNGHRVIGCKWVY-------------------RRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFG

Query:  WDLVQMDINNAFLNGELFEEVYMQLPFGY-------------------YQDLKSSS-------SNTL-------SKSNYSLFTKGSGSSFIALLVYVDDI
        + L Q+DI+NAFLNG+L EE+YM+LP GY                      LK +S       S TL       S S+++ F K + + F+ +LVYVDDI
Subjt:  WDLVQMDINNAFLNGELFEEVYMQLPFGY-------------------YQDLKSSS-------SNTL-------SKSNYSLFTKGSGSSFIALLVYVDDI

Query:  LIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSL
        +I   +   +  +K+ L+S F L+DLG  KYFLGLE+++S  GI I QRKY L +L+++G L  KP+S P   ++  +A +G   +  DA +YRRLIG L
Subjt:  LIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLTATAGIPLNLDDASSYRRLIGSL

Query:  LYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRS
        +YLQI+R ++SFAV+KLS +   P   H  A   +L Y+KGT GQ +F ++    QL+ + D+ + SC DTRR    +C FLG SLISWKSKKQ  VS+S
Subjt:  LYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKSKKQATVSRS

Query:  SAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKII
        SAEAEYRA +  + E+ W++   ++L +     TL+FCDN A + IATN  FHE TKHI+ DCH VR++ +
Subjt:  SAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKII

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.3e-1747.56Show/hide
Query:  LYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFL
        +YL I+RP+++FAV++LS + +   +  + A + +L Y+KGT GQ +F +AT++ QLKA+ DSDW SC DTRR VT FC  +
Subjt:  LYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFL

ATMG00810.1 DNA/RNA polymerases superfamily protein6.0e-4541.92Show/hide
Query:  LLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLT-ATAGIPLNLDDAS
        LL+YVDDIL+ G S T +  +   L S F +KDLG   YFLG+++     G+++SQ KY  QIL ++G L  KP S P    L  + +TA  P    D S
Subjt:  LLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNLKLT-ATAGIPLNLDDAS

Query:  SYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKS
         +R ++G+L YL ++RP++S+AV+ +   + +P          +LRY+KGT    +++   +   ++A+ DSDW  C  TRR  T FC FLG ++ISW +
Subjt:  SYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDSLISWKS

Query:  KKQATVSRSSAEAEYRAFAMVSSELTWVS
        K+Q TVSRSS E EYRA A+ ++ELTW S
Subjt:  KKQATVSRSSAEAEYRAFAMVSSELTWVS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.8e-0428.17Show/hide
Query:  NQAVNIVPLPNGHRVIGCKWVYRR-------------------YNQKEGIDFIDTFSPVAKIVTVKLLLSL
        N+   +VP P    ++GCKWV++                    ++Q+EGI F++T+SPV +  T++ +L++
Subjt:  NQAVNIVPLPNGHRVIGCKWVYRR-------------------YNQKEGIDFIDTFSPVAKIVTVKLLLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGACAATATCGGTGATGAAGCTTCAATTGGTGCATTGTTGAATCCGTATTCCTTGCATCATTCTTTCACCTCAACCAGTGTTTTGGTTACTCAGCCGTTGATCGG
TGCTTCCAATTATGGTTCTTGGAATCGTGCCATGATTATGGTTCTCTCAAGTAAGAACAAAGATGGTTTCGTTGATGAAACAGAGAATGCTAAAAAGAAAAATCAGTTAC
GAAAGAAGGATTCTCAGCGACCTATTTGTACAAATTGTGGCATTAAAGGTCATGTAGTTGATAAATGCTACAAGCTTCATGGTTATCCCCCAGGATATAAATCAAGAATT
AATGAGAATGCTGAAAATCCTCCACAAAATAGTTCTCAATCTACACCTACTGCAAACGCTCAACCAAAACCGAGCCCATCACAGCAGCAACAGCAACAACTCACACTGCA
GTCTAAGGAATGGATTCTAGATTCTGGCGCTTCTCGACATATCTGCAATGATCGTTCTTTATTTCAAAATTGGAATCAAGTTTTTGATATTGCAGTTGCTTTGCATAATG
GTTTTCAAATTAAGGTTGACTATATTGGGAATGTTCGTGTATCTGAATCATTGATGTTGAATGATGTTCTCTTTTTACCAAAATTCGCTTACAACTTGATATCAGACAGA
CTTTCCTTGAAGATGATTGGCAAGGTTAACAATAAACATGGACTCTATTTGCTCAACTTTATTGACAGCTCCAATCATCATACTACTGCTGGTGTTTCTTGTGCCATTTC
TATTGAAACTTGGCATCACTTTTTGGACCATTTATCTCCTAAATGTTTATCCTTGTTAAAAGATACTTTGTCTTTACCAAGACCCTTTAAACACTCGACTTATAGTGGAT
ATAAATACTTTTTGACTATTGTTGATAATTGTTCTCGCTTCACATGGACTTATTTGATGCGTTCCAAATCTGATGCTTTGTATATTGTTCCACGCTTCATTGCTCTTGTC
GAGACACATTTCTCCAAGACCATCAAAGTTTTTCGATCAGACAATGCACTAGAACTTCATTTTACTAATCTTTTTGCTGCAAAAGGAACGATTCATCAATTTTCTTGTGT
AGAACGACCACAACAGAACTCTGTCGTTGAGAGAAAGCACCAACACCTTTTCAATGTTGCTCGAGCATTATTCTTCCAGTCTAGAGTTCCAATCAGATTTTGGGGGGATT
GTATATTAACAGCAACATTCCTTATCAATCGAACCTCAATCCCCTTGTTGTCAAATAAGTCTCCCTTTGAAGTGTTATATGACAAAGATGTTAATTACCCTTCTTTAAGA
GTGTTTGGATGTGTTGACATTTCTGATGAACTTAACCTTGGGAATACTCCTCATACAGCAGAAGAGCAAGTGGTTTTTAATGAAGGAATTGTGCAAAATCCTTCTACCAC
CATCACTACTGACTCCACTAAAATTATTGAGCCCAACAATATAGTTGAACCTAATGAAGCTGCCAATCCTCCACATGACATTACTGTTGGTCTAAGAAGATCAACAAGAA
GGCATCAACCAGTTGGTTTTCTTCGAGATTACCATTGTAATTTGCTTCAAGGCCAAGTTTTGAACACCACAACTCTATATTCCATCAACAATTACTTGTCTTACGACAAA
CTATCTGCTTTACATCAGAACTTTATTTTCAATATATCATCAATTGTTTTGCCTTCTTATTATAATCAAGCTGTCAATATTGTTCCTCTACCTAATGGTCATCGTGTTAT
TGGTTGCAAATGGGTGTATCGCAGATATAATCAAAAAGAAGGAATCGATTTCATTGATACTTTCTCCCCAGTAGCAAAAATTGTTACAGTCAAGTTATTATTATCCTTGA
CTGCTTCTTTTGGATGGGATTTGGTGCAAATGGATATCAACAATGCCTTTCTAAATGGGGAGTTATTCGAAGAAGTTTATATGCAACTTCCTTTTGGATATTATCAAGAT
TTGAAATCATCCTCTAGCAACACACTTTCGAAATCCAATTATTCCCTCTTTACAAAAGGCAGTGGTAGTTCCTTTATTGCTCTTTTGGTCTATGTAGATGATATCTTGAT
TATTGGACCATCCCCTACTGAAATCACAACTGTGAAAACTCTTCTCCGATCTCATTTCTTATTAAAAGATTTGGGGAATGCAAAATACTTCCTAGGCCTCGAATTATCAC
AGTCTACAATGGGAATTTACATCTCCCAAAGAAAATATCGTCTTCAAATATTGGAAGATAGTGGTTTTTTAGCAGTTAAACCAACATCTTTCCCATTTGCATCCAATTTA
AAGTTGACTGCTACTGCTGGCATTCCATTGAATTTGGATGATGCTTCCTCCTATAGAAGATTGATTGGGAGCCTTCTATATCTACAAATATCACGACCAAATGTTTCCTT
TGCAGTTCATAAACTTAGCCCATATGTTGCTAAACCATATTCTGAACATTTGTCTGCTGCTCATCACTTACTGCGCTATTTGAAAGGTACTGCTGGGCAAAGAATTTTCT
TAGCTGCTACCAATAACTTCCAACTAAAAGCATATGTTGATTCTGATTGGGATTCATGCTTAGATACCAGAAGATTTGTTACCGATTTTTGCAAATTTTTGGGTGATTCA
TTGATATCTTGGAAGTCAAAGAAACAAGCTACCGTGTCAAGATCTTCAGCTGAAGCAGAATATCGAGCTTTTGCTATGGTTTCCAGTGAGCTGACTTGGGTCTCTCATGT
TTTGAAAGATCTCCATATTAATTTCCCTGCACTAACTCTGGTTTTCTGTGACAATGCAGCTGTTGTTTCCATTGCCACTAATCCAACCTTTCATGAGTGCACTAAGCACA
TTAAGATCGATTGTCATTTTGTTCGGGACAAGATTATTAATGGAGCTTTGAAGATTTTGCCTGTTCGTTCTCATTCACAACTTGCAAACATGTTTACAAAACCCTTAAAT
ATGCTCTTCGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGACAATATCGGTGATGAAGCTTCAATTGGTGCATTGTTGAATCCGTATTCCTTGCATCATTCTTTCACCTCAACCAGTGTTTTGGTTACTCAGCCGTTGATCGG
TGCTTCCAATTATGGTTCTTGGAATCGTGCCATGATTATGGTTCTCTCAAGTAAGAACAAAGATGGTTTCGTTGATGAAACAGAGAATGCTAAAAAGAAAAATCAGTTAC
GAAAGAAGGATTCTCAGCGACCTATTTGTACAAATTGTGGCATTAAAGGTCATGTAGTTGATAAATGCTACAAGCTTCATGGTTATCCCCCAGGATATAAATCAAGAATT
AATGAGAATGCTGAAAATCCTCCACAAAATAGTTCTCAATCTACACCTACTGCAAACGCTCAACCAAAACCGAGCCCATCACAGCAGCAACAGCAACAACTCACACTGCA
GTCTAAGGAATGGATTCTAGATTCTGGCGCTTCTCGACATATCTGCAATGATCGTTCTTTATTTCAAAATTGGAATCAAGTTTTTGATATTGCAGTTGCTTTGCATAATG
GTTTTCAAATTAAGGTTGACTATATTGGGAATGTTCGTGTATCTGAATCATTGATGTTGAATGATGTTCTCTTTTTACCAAAATTCGCTTACAACTTGATATCAGACAGA
CTTTCCTTGAAGATGATTGGCAAGGTTAACAATAAACATGGACTCTATTTGCTCAACTTTATTGACAGCTCCAATCATCATACTACTGCTGGTGTTTCTTGTGCCATTTC
TATTGAAACTTGGCATCACTTTTTGGACCATTTATCTCCTAAATGTTTATCCTTGTTAAAAGATACTTTGTCTTTACCAAGACCCTTTAAACACTCGACTTATAGTGGAT
ATAAATACTTTTTGACTATTGTTGATAATTGTTCTCGCTTCACATGGACTTATTTGATGCGTTCCAAATCTGATGCTTTGTATATTGTTCCACGCTTCATTGCTCTTGTC
GAGACACATTTCTCCAAGACCATCAAAGTTTTTCGATCAGACAATGCACTAGAACTTCATTTTACTAATCTTTTTGCTGCAAAAGGAACGATTCATCAATTTTCTTGTGT
AGAACGACCACAACAGAACTCTGTCGTTGAGAGAAAGCACCAACACCTTTTCAATGTTGCTCGAGCATTATTCTTCCAGTCTAGAGTTCCAATCAGATTTTGGGGGGATT
GTATATTAACAGCAACATTCCTTATCAATCGAACCTCAATCCCCTTGTTGTCAAATAAGTCTCCCTTTGAAGTGTTATATGACAAAGATGTTAATTACCCTTCTTTAAGA
GTGTTTGGATGTGTTGACATTTCTGATGAACTTAACCTTGGGAATACTCCTCATACAGCAGAAGAGCAAGTGGTTTTTAATGAAGGAATTGTGCAAAATCCTTCTACCAC
CATCACTACTGACTCCACTAAAATTATTGAGCCCAACAATATAGTTGAACCTAATGAAGCTGCCAATCCTCCACATGACATTACTGTTGGTCTAAGAAGATCAACAAGAA
GGCATCAACCAGTTGGTTTTCTTCGAGATTACCATTGTAATTTGCTTCAAGGCCAAGTTTTGAACACCACAACTCTATATTCCATCAACAATTACTTGTCTTACGACAAA
CTATCTGCTTTACATCAGAACTTTATTTTCAATATATCATCAATTGTTTTGCCTTCTTATTATAATCAAGCTGTCAATATTGTTCCTCTACCTAATGGTCATCGTGTTAT
TGGTTGCAAATGGGTGTATCGCAGATATAATCAAAAAGAAGGAATCGATTTCATTGATACTTTCTCCCCAGTAGCAAAAATTGTTACAGTCAAGTTATTATTATCCTTGA
CTGCTTCTTTTGGATGGGATTTGGTGCAAATGGATATCAACAATGCCTTTCTAAATGGGGAGTTATTCGAAGAAGTTTATATGCAACTTCCTTTTGGATATTATCAAGAT
TTGAAATCATCCTCTAGCAACACACTTTCGAAATCCAATTATTCCCTCTTTACAAAAGGCAGTGGTAGTTCCTTTATTGCTCTTTTGGTCTATGTAGATGATATCTTGAT
TATTGGACCATCCCCTACTGAAATCACAACTGTGAAAACTCTTCTCCGATCTCATTTCTTATTAAAAGATTTGGGGAATGCAAAATACTTCCTAGGCCTCGAATTATCAC
AGTCTACAATGGGAATTTACATCTCCCAAAGAAAATATCGTCTTCAAATATTGGAAGATAGTGGTTTTTTAGCAGTTAAACCAACATCTTTCCCATTTGCATCCAATTTA
AAGTTGACTGCTACTGCTGGCATTCCATTGAATTTGGATGATGCTTCCTCCTATAGAAGATTGATTGGGAGCCTTCTATATCTACAAATATCACGACCAAATGTTTCCTT
TGCAGTTCATAAACTTAGCCCATATGTTGCTAAACCATATTCTGAACATTTGTCTGCTGCTCATCACTTACTGCGCTATTTGAAAGGTACTGCTGGGCAAAGAATTTTCT
TAGCTGCTACCAATAACTTCCAACTAAAAGCATATGTTGATTCTGATTGGGATTCATGCTTAGATACCAGAAGATTTGTTACCGATTTTTGCAAATTTTTGGGTGATTCA
TTGATATCTTGGAAGTCAAAGAAACAAGCTACCGTGTCAAGATCTTCAGCTGAAGCAGAATATCGAGCTTTTGCTATGGTTTCCAGTGAGCTGACTTGGGTCTCTCATGT
TTTGAAAGATCTCCATATTAATTTCCCTGCACTAACTCTGGTTTTCTGTGACAATGCAGCTGTTGTTTCCATTGCCACTAATCCAACCTTTCATGAGTGCACTAAGCACA
TTAAGATCGATTGTCATTTTGTTCGGGACAAGATTATTAATGGAGCTTTGAAGATTTTGCCTGTTCGTTCTCATTCACAACTTGCAAACATGTTTACAAAACCCTTAAAT
ATGCTCTTCGACTGA
Protein sequenceShow/hide protein sequence
MGDNIGDEASIGALLNPYSLHHSFTSTSVLVTQPLIGASNYGSWNRAMIMVLSSKNKDGFVDETENAKKKNQLRKKDSQRPICTNCGIKGHVVDKCYKLHGYPPGYKSRI
NENAENPPQNSSQSTPTANAQPKPSPSQQQQQQLTLQSKEWILDSGASRHICNDRSLFQNWNQVFDIAVALHNGFQIKVDYIGNVRVSESLMLNDVLFLPKFAYNLISDR
LSLKMIGKVNNKHGLYLLNFIDSSNHHTTAGVSCAISIETWHHFLDHLSPKCLSLLKDTLSLPRPFKHSTYSGYKYFLTIVDNCSRFTWTYLMRSKSDALYIVPRFIALV
ETHFSKTIKVFRSDNALELHFTNLFAAKGTIHQFSCVERPQQNSVVERKHQHLFNVARALFFQSRVPIRFWGDCILTATFLINRTSIPLLSNKSPFEVLYDKDVNYPSLR
VFGCVDISDELNLGNTPHTAEEQVVFNEGIVQNPSTTITTDSTKIIEPNNIVEPNEAANPPHDITVGLRRSTRRHQPVGFLRDYHCNLLQGQVLNTTTLYSINNYLSYDK
LSALHQNFIFNISSIVLPSYYNQAVNIVPLPNGHRVIGCKWVYRRYNQKEGIDFIDTFSPVAKIVTVKLLLSLTASFGWDLVQMDINNAFLNGELFEEVYMQLPFGYYQD
LKSSSSNTLSKSNYSLFTKGSGSSFIALLVYVDDILIIGPSPTEITTVKTLLRSHFLLKDLGNAKYFLGLELSQSTMGIYISQRKYRLQILEDSGFLAVKPTSFPFASNL
KLTATAGIPLNLDDASSYRRLIGSLLYLQISRPNVSFAVHKLSPYVAKPYSEHLSAAHHLLRYLKGTAGQRIFLAATNNFQLKAYVDSDWDSCLDTRRFVTDFCKFLGDS
LISWKSKKQATVSRSSAEAEYRAFAMVSSELTWVSHVLKDLHINFPALTLVFCDNAAVVSIATNPTFHECTKHIKIDCHFVRDKIINGALKILPVRSHSQLANMFTKPLN
MLFD