; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038653 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038653
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:22395829..22399906
RNA-Seq ExpressionLag0038653
SyntenyLag0038653
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN67403.1 hypothetical protein VITISV_025614 [Vitis vinifera]2.8e-10028.29Show/hide
Query:  LLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKEL
        +LN A  IKLDRNN++LW+     ++ +     H+ G  +CP   T           +   NP++ MW   D+M++ W+Y+S+T EI  Q+ G +++   
Subjt:  LLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKEL

Query:  WDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG-----------LRGDQGTDF-----------------------------
        W AL+  +   + ++   L+   Q TRKG+  M EY+  +KS ADNL   G           L G  G D+                             
Subjt:  WDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG-----------LRGDQGTDF-----------------------------

Query:  -LCSCLVLMRSTHL-LWWYSNQTYQRSYR-GRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFN----NTQNAQGSSSNSNNQTKGNNPAALIA
         + + L   +  H      S Q  Q  +   RG   GR +SS  +  CQ+CGK GHT   CYHRF+ NF     N    Q +  N+ NQ +     A++A
Subjt:  -LCSCLVLMRSTHL-LWWYSNQTYQRSYR-GRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFN----NTQNAQGSSSNSNNQTKGNNPAALIA

Query:  TLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG--------------------------------------LMHVPQISKNLISISRLTMDNSVIVEF
        +   + D AW+ D+GA++HL+  +  LS    Y G                                      L H+ +    L++ + L      +  F
Subjt:  TLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG--------------------------------------LMHVPQISKNLISISRLTMDNSVIVEF

Query:  HDSFCAVKDKETG--------KVLLEGTLKY--------ACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRCLSP-TGRVYTSRHVCFNEQDFPYSQL
        H +   +    T         ++L   +  Y         CYP +RPY   K  +R+++CVF+GY SNHKGY CL+P TGR+Y +RHV F+E  FP+   
Subjt:  HDSFCAVKDKETG--------KVLLEGTLKY--------ACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRCLSP-TGRVYTSRHVCFNEQDFPYSQL

Query:  FSQPQATELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTSPSYSTSTSSTRTI-------FTGITSSH-----------TSKLGITKP
         S P  +         ++LP       S+ P S   S          PST+SP  +   SST ++       F  I++S             +K GI+K 
Subjt:  FSQPQATELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTSPSYSTSTSSTRTI-------FTGITSSH-----------TSKLGITKP

Query:  KQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHE
        K  F               EPT+   ++    W  AM++EFS+L RN TW LVP     +++G KW++KLK   +G+++R+KARLVAQGF+QT G+++ E
Subjt:  KQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHE

Query:  TYSPVI----------------------------------------------------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLF
        T+SPV+                                                                K APRAWY+KL  +LL W F  S+ADSS+F
Subjt:  TYSPVI----------------------------------------------------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLF

Query:  YFRQNQQVLFLLVYVDDMLI--------------------------------------------------------------------------------
               VL LL+YVDD+L+                                                                                
Subjt:  YFRQNQQVLFLLVYVDDMLI--------------------------------------------------------------------------------

Query:  -------------------------------------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGP
                                             A P   HWL++KR+LRYL+GTL  G+ +Q ++S+ + GY+DADW +CP DR+S GGY +FLGP
Subjt:  -------------------------------------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGP

Query:  SLISWSSKKQQVVARSSTESEYRALAHVACEL
        +L+SWSS KQ+VV+RSS ESEYRALA    E+
Subjt:  SLISWSSKKQQVVARSSTESEYRALAHVACEL

CAN68489.1 hypothetical protein VITISV_037543 [Vitis vinifera]2.7e-8724.83Show/hide
Query:  LLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKEL
        +LN A  IKLDRNN++LW+     ++ +     H+ G  +CP   T           +   NP++ MW   D+M++ W+Y+S+T EI  Q+ G +++   
Subjt:  LLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKEL

Query:  WDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG-----------LRGDQGTDF----------------------LCSCLVL
        W AL+  +   + ++   L+   Q TRKG+  M EY+  +KS ADNL   G           L G  G D+                      + + L  
Subjt:  WDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG-----------LRGDQGTDF----------------------LCSCLVL

Query:  MRSTHL-LWWYSNQTYQRSYR-GRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFN----NTQNAQGSSSNSNNQTKGNNPAALIATLEHLRDT
         +  H      S Q  Q  +   RG   GR +SS  +  CQ+CGK GHT   CYHRF+ NF     N    Q +  N+ NQ +     A++A+   + D 
Subjt:  MRSTHL-LWWYSNQTYQRSYR-GRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFN----NTQNAQGSSSNSNNQTKGNNPAALIATLEHLRDT

Query:  AWYLDSGASNHLTADLANLSLKTEYAG---------------------------------LMHVPQISKNLISISRLTMDNSVIVEFHDSFCAVKDKETG
        AW+ D+GA++HL+  +  LS    Y G                                 ++HVP I+ NLIS+S+   DN+   EFH  F  VKD+ T 
Subjt:  AWYLDSGASNHLTADLANLSLKTEYAG---------------------------------LMHVPQISKNLISISRLTMDNSVIVEFHDSFCAVKDKETG

Query:  KVLLEGTLKYA-----------------------------------------------------------------------------------------
        K+LL+G+L++                                                                                          
Subjt:  KVLLEGTLKYA-----------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------CYPCLRPYQHQKFDFRTTKCVFIGYISN
                                                                                CYP +RPY   K  +R+++CVF+GY SN
Subjt:  ------------------------------------------------------------------------CYPCLRPYQHQKFDFRTTKCVFIGYISN

Query:  HKGYRCLSP-TGRVYTSRHVCFNEQDFPYSQLFSQPQATELTTQHSIL--SWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTS-PSYSTSTSSTRTI
        HKGY CL+P TGR+Y +RHV F+E  FP+     Q  +       + L  S  P++     +T  TS    T    +    P     P    STS     
Subjt:  HKGYRCLSP-TGRVYTSRHVCFNEQDFPYSQLFSQPQATELTTQHSIL--SWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTS-PSYSTSTSSTRTI

Query:  FTGITSSH----TSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIE
            T+ H     +K GI+K K  F               EPT+   ++    W  AM++EFS+L RN TW LVP     +++G KW++KLK   +G+++
Subjt:  FTGITSSH----TSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIE

Query:  RHKARLVAQGFSQTPGIDFHETYSPVIKPA---------------PRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI--------
        R+KARLVAQGF+QT G+D+ ET+SPV+K +               P     KL  +LL W F  S+ADSS+F       VL LL+YVDD+L+        
Subjt:  RHKARLVAQGFSQTPGIDFHETYSPVIKPA---------------PRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHV
                 A P   HWL++KR+LRYL+GTL  G+ +Q ++S+ + GY+DADW +CP DR+S GGY +FLGP+L+SWSS KQ+VV+RSS ESEYRALA  
Subjt:  ---------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHV

Query:  ACEL
          E+
Subjt:  ACEL

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]6.9e-9924.98Show/hide
Query:  NLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKE
        N L  + S+KLDRNN+ LW+++ +P+++  +L+G++ G   CPE            + +K  N  +  W A DQ L+GW+ NSMT EIA+Q+  CET+K+
Subjt:  NLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKE

Query:  LWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG---------LRGDQGTDFLCSCLVLMRS--THLLWW------------
        LWD  +   G    SQ  YLK      RKG  KM +YL  MK+  D L L G         ++   G D   + +V+  S  T L W             
Subjt:  LWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG---------LRGDQGTDFLCSCLVLMRS--THLLWW------------

Query:  -----------------------YSNQTYQRSYRG---RGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFNNTQNAQGSSSNSNNQTKGNNPAA
                               +  ++   ++RG   RG   GRGR    K  CQVCG + H    C+HRF+K ++ + ++ G     ++        A
Subjt:  -----------------------YSNQTYQRSYRG---RGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFNNTQNAQGSSSNSNNQTKGNNPAA

Query:  LIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG-----------------------------LMHVPQISKNLISISRLTMDNSVIVEFHDSFCA
         +A+   + D  WY DSGASNH+T         TE+ G                             +++VP I+KNL+S+S+L  DN+++VEF ++ C 
Subjt:  LIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG-----------------------------LMHVPQISKNLISISRLTMDNSVIVEFHDSFCA

Query:  VKDKETGKVLLEGTLK------------------------------------------------------------------------------------
        VKDK TGKV+L+G LK                                                                                    
Subjt:  VKDKETGKVLLEGTLK------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------YACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRC
                                                                         ACYPCL+PY   K  + TT+CVF+GY ++HKGY+C
Subjt:  ----------------------------------------------------------------YACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRC

Query:  LSPTGRVYTSRHVCFNEQDFPYSQLFSQPQAT------------ELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTSPSYSTSTSS--
        L+  GR++ SRHV FNE  FP+   F   ++              L T  +++    +   E E+   T+ E+S     N D + +   PS   +T    
Subjt:  LSPTGRVYTSRHVCFNEQDFPYSQLFSQPQAT------------ELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTSPSYSTSTSS--

Query:  ---TRTIFTGITSSHT---------SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWI
           T+    G  S +T         SK GI KPK  +  L++T        +EP +   +L+ P W +AM++EF +L  N+TW LVP+  + ++V +KW+
Subjt:  ---TRTIFTGITSSHT---------SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWI

Query:  FKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI-------------------------------------------------------------
        FK K   +GS+ER KARLVA+GF QT GID+ ET+SPVI                                                             
Subjt:  FKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI-------------------------------------------------------------

Query:  ---KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI-----------------------------------------------
           K APRAW+D L+  LL+W F  +K+DSSLF  +    + FLL+YVDD+++                                               
Subjt:  ---KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI-----------------------------------------------

Query:  ---------------------------------------------------------------------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPA
                                                                             ++P   HW  +KR+LRYLQGT++  L ++P+
Subjt:  ---------------------------------------------------------------------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPA

Query:  SSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA
        + + + G+SDADW     DRKS+ G CVFLG +LISWSS+KQ+VV+RSSTESEYRALA +A E+A
Subjt:  SSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]9.3e-9624.67Show/hide
Query:  NLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKE
        N L    S+KLDR+N+ LW+++ + +++  +L+G++ G + CPE            + +K  NP++  WIA DQ L+GWL NSM ++IA+Q+  CET+K+
Subjt:  NLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKE

Query:  LWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG---------LRGDQGTDFLCSCLVLMRSTHL-LWW-------------
        LWD  +   G    S+  YLK     TRKG  KM EYL  MK+ +D L L G         ++   G D   + +V+  S  + L W             
Subjt:  LWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG---------LRGDQGTDFLCSCLVLMRSTHL-LWW-------------

Query:  -----------------YSNQTYQR-------------SYRGRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFE-----KNFNNTQNAQGSSSNSNN
                         ++N+T  R             ++RG   GRG+GR S +K  CQVC  TGH    C +RF+     +N++   + QGS S    
Subjt:  -----------------YSNQTYQR-------------SYRGRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFE-----KNFNNTQNAQGSSSNSNN

Query:  QTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG-----------------------------LMHVPQISKNLISISRLTMDNSVI
                A IA+  H +D  WY DSGA+NH+T          E+ G                             +++VPQI+KNL+S+S+LT DN+++
Subjt:  QTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG-----------------------------LMHVPQISKNLISISRLTMDNSVI

Query:  VEFHDSFCAVKDKETGKVLLEGTLK---------------------------------------------------------------------------
        VEF  + C+VKDK TG+ LL+G LK                                                                           
Subjt:  VEFHDSFCAVKDKETGKVLLEGTLK---------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------YACYPCLRPYQHQKFDFRTTKCVFIGYIS
                                                                                ACYPCL+PY   K  F TT+CVF+GY +
Subjt:  -----------------------------------------------------------------------YACYPCLRPYQHQKFDFRTTKCVFIGYIS

Query:  NHKGYRCLSPTGRVYTSRHVCFNEQDFPYSQLF---SQPQATELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNV--------DVQPSTTSPSYST
        +HKGY+C++  GR++ SRHV FNE  FP+   F     P  T       +L       T  ++  P ++  S    H++        + Q  ++    +T
Subjt:  NHKGYRCLSPTGRVYTSRHVCFNEQDFPYSQLF---SQPQATELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNV--------DVQPSTTSPSYST

Query:  STSSTRTI---------------FTGI---------TSSH----TSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRN
        + SST+ I                TG          +++H     SK GI KPK  +  +++T ++      EP S   +L  P W +AM +E+ +L  N
Subjt:  STSSTRTI---------------FTGI---------TSSH----TSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRN

Query:  QTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI-----------------------------------------
         TW LVP+  + +++ +KWIFK K  ++GSIER KARLVA+GF QT G+DF ET+SPV+                                         
Subjt:  QTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI-----------------------------------------

Query:  -----------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI---------------------------
                               K APRAWYD LR TL++W F  +K D+SLF+ +      FLL+YVDD+++                           
Subjt:  -----------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI---------------------------

Query:  -----------------------------------------------------------------------------------------AAPKQAHWLSL
                                                                                                 + P   HW  +
Subjt:  -----------------------------------------------------------------------------------------AAPKQAHWLSL

Query:  KRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA
        KR+LRYLQGT +  L ++P++++ + G+ DADW     DRKS GG CVFLG +L+SW+S+KQ+VV+RSSTESEYR+LA +  E++
Subjt:  KRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA

RVX12711.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.3e-8926.92Show/hide
Query:  LLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKEL
        +LN A  IKLDRNN++LW+     ++ +     H+ G  +CP   T           +   NP++ MW   D+M++ W+Y+S+T EI  Q+ G +++   
Subjt:  LLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKEL

Query:  WDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG-----------LRGDQGTDFLCSCLVL--------MRSTH--LLWWYSN
        W AL+  +   + ++   L+   Q TRKG+  M EY+  +KS ADNL   G           L G  G D+      L        + S H  LL     
Subjt:  WDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG-----------LRGDQGTDFLCSCLVL--------MRSTH--LLWWYSN

Query:  QTYQRSY---------------------------------RGRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFN----NTQNAQGSSSNSNNQ
         ++Q S                                    RG   GR +SS  +  CQ+CGK GHT   CYHRF+ NF     N    Q +  N+ NQ
Subjt:  QTYQRSY---------------------------------RGRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFN----NTQNAQGSSSNSNNQ

Query:  TKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG---------------------------------LMHVPQISKNLISISRLTMDN
         +     A++A+   + D AW+ D+GA++HL+  +  LS    Y G                                 ++HVP I+ NLIS+S+   DN
Subjt:  TKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG---------------------------------LMHVPQISKNLISISRLTMDN

Query:  SVIVEFHDSFCAVKDKETGKVLLEGTLKYACY-------PCLRPYQHQKFD--------------------FRTTKCVFIGYISNHKGYRCLSP------
        +   EFH  F  VKD+ T K+LL+G+L++  Y       P    +    +D                      T    FI ++ +   +  + P      
Subjt:  SVIVEFHDSFCAVKDKETGKVLLEGTLKYACY-------PCLRPYQHQKFD--------------------FRTTKCVFIGYISNHKGYRCLSP------

Query:  -----------TGRVYTSRHVCFNEQDFPYSQLFSQPQATELTTQHSILSWLPINYTEHE---STFPTSHENSTG-------------GLHNVDVQPSTT
                       + SR  C    +    + FS   AT     H I S     YT  +   +     H   TG              + +  V P  +
Subjt:  -----------TGRVYTSRHVCFNEQDFPYSQLFSQPQATELTTQHSILSWLPINYTEHE---STFPTSHENSTG-------------GLHNVDVQPSTT

Query:  SPSYSTSTSSTRT-IFTGITSSHTSKLG--ITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKW
        +P  S+S  +  T  F   +S   S L   +T+ K     +S+          EPT+   ++    W  AM++EFS+L RN TW LVP     +++G KW
Subjt:  SPSYSTSTSSTRT-IFTGITSSHTSKLG--ITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKW

Query:  IFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI------------------------------------------------------------
        ++KLK   +G+++R+KARLVAQGF+QT G+D+ ET+SPV+                                                            
Subjt:  IFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI------------------------------------------------------------

Query:  ----KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI----------------------------------------------
            K APRAWY+KL  +LL W F  S+ADSS+F       VL LL+YVDD+L+                                              
Subjt:  ----KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI----------------------------------------------

Query:  -----------------------------------------------------------------------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQ
                                                                               A P   HWL++KR+LRYL+GTL  G+ +Q
Subjt:  -----------------------------------------------------------------------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQ

Query:  PASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACEL
         ++S+ + GY+DADW +CP DR+S GGY +FLGP+L+SWSS KQ+VV+RSS ESEYRALA    E+
Subjt:  PASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACEL

TrEMBL top hitse value%identityAlignment
A0A2Z6MBG6 Integrase catalytic domain-containing protein3.3e-9924.98Show/hide
Query:  NLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKE
        N L  + S+KLDRNN+ LW+++ +P+++  +L+G++ G   CPE            + +K  N  +  W A DQ L+GW+ NSMT EIA+Q+  CET+K+
Subjt:  NLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKE

Query:  LWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG---------LRGDQGTDFLCSCLVLMRS--THLLWW------------
        LWD  +   G    SQ  YLK      RKG  KM +YL  MK+  D L L G         ++   G D   + +V+  S  T L W             
Subjt:  LWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG---------LRGDQGTDFLCSCLVLMRS--THLLWW------------

Query:  -----------------------YSNQTYQRSYRG---RGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFNNTQNAQGSSSNSNNQTKGNNPAA
                               +  ++   ++RG   RG   GRGR    K  CQVCG + H    C+HRF+K ++ + ++ G     ++        A
Subjt:  -----------------------YSNQTYQRSYRG---RGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFNNTQNAQGSSSNSNNQTKGNNPAA

Query:  LIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG-----------------------------LMHVPQISKNLISISRLTMDNSVIVEFHDSFCA
         +A+   + D  WY DSGASNH+T         TE+ G                             +++VP I+KNL+S+S+L  DN+++VEF ++ C 
Subjt:  LIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG-----------------------------LMHVPQISKNLISISRLTMDNSVIVEFHDSFCA

Query:  VKDKETGKVLLEGTLK------------------------------------------------------------------------------------
        VKDK TGKV+L+G LK                                                                                    
Subjt:  VKDKETGKVLLEGTLK------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------YACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRC
                                                                         ACYPCL+PY   K  + TT+CVF+GY ++HKGY+C
Subjt:  ----------------------------------------------------------------YACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRC

Query:  LSPTGRVYTSRHVCFNEQDFPYSQLFSQPQAT------------ELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTSPSYSTSTSS--
        L+  GR++ SRHV FNE  FP+   F   ++              L T  +++    +   E E+   T+ E+S     N D + +   PS   +T    
Subjt:  LSPTGRVYTSRHVCFNEQDFPYSQLFSQPQAT------------ELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTSPSYSTSTSS--

Query:  ---TRTIFTGITSSHT---------SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWI
           T+    G  S +T         SK GI KPK  +  L++T        +EP +   +L+ P W +AM++EF +L  N+TW LVP+  + ++V +KW+
Subjt:  ---TRTIFTGITSSHT---------SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWI

Query:  FKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI-------------------------------------------------------------
        FK K   +GS+ER KARLVA+GF QT GID+ ET+SPVI                                                             
Subjt:  FKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI-------------------------------------------------------------

Query:  ---KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI-----------------------------------------------
           K APRAW+D L+  LL+W F  +K+DSSLF  +    + FLL+YVDD+++                                               
Subjt:  ---KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI-----------------------------------------------

Query:  ---------------------------------------------------------------------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPA
                                                                             ++P   HW  +KR+LRYLQGT++  L ++P+
Subjt:  ---------------------------------------------------------------------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPA

Query:  SSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA
        + + + G+SDADW     DRKS+ G CVFLG +LISWSS+KQ+VV+RSSTESEYRALA +A E+A
Subjt:  SSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA

A0A2Z6P4D5 Integrase catalytic domain-containing protein4.5e-9624.67Show/hide
Query:  NLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKE
        N L    S+KLDR+N+ LW+++ + +++  +L+G++ G + CPE            + +K  NP++  WIA DQ L+GWL NSM ++IA+Q+  CET+K+
Subjt:  NLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKE

Query:  LWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG---------LRGDQGTDFLCSCLVLMRSTHL-LWW-------------
        LWD  +   G    S+  YLK     TRKG  KM EYL  MK+ +D L L G         ++   G D   + +V+  S  + L W             
Subjt:  LWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG---------LRGDQGTDFLCSCLVLMRSTHL-LWW-------------

Query:  -----------------YSNQTYQR-------------SYRGRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFE-----KNFNNTQNAQGSSSNSNN
                         ++N+T  R             ++RG   GRG+GR S +K  CQVC  TGH    C +RF+     +N++   + QGS S    
Subjt:  -----------------YSNQTYQR-------------SYRGRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFE-----KNFNNTQNAQGSSSNSNN

Query:  QTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG-----------------------------LMHVPQISKNLISISRLTMDNSVI
                A IA+  H +D  WY DSGA+NH+T          E+ G                             +++VPQI+KNL+S+S+LT DN+++
Subjt:  QTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG-----------------------------LMHVPQISKNLISISRLTMDNSVI

Query:  VEFHDSFCAVKDKETGKVLLEGTLK---------------------------------------------------------------------------
        VEF  + C+VKDK TG+ LL+G LK                                                                           
Subjt:  VEFHDSFCAVKDKETGKVLLEGTLK---------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------YACYPCLRPYQHQKFDFRTTKCVFIGYIS
                                                                                ACYPCL+PY   K  F TT+CVF+GY +
Subjt:  -----------------------------------------------------------------------YACYPCLRPYQHQKFDFRTTKCVFIGYIS

Query:  NHKGYRCLSPTGRVYTSRHVCFNEQDFPYSQLF---SQPQATELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNV--------DVQPSTTSPSYST
        +HKGY+C++  GR++ SRHV FNE  FP+   F     P  T       +L       T  ++  P ++  S    H++        + Q  ++    +T
Subjt:  NHKGYRCLSPTGRVYTSRHVCFNEQDFPYSQLF---SQPQATELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNV--------DVQPSTTSPSYST

Query:  STSSTRTI---------------FTGI---------TSSH----TSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRN
        + SST+ I                TG          +++H     SK GI KPK  +  +++T ++      EP S   +L  P W +AM +E+ +L  N
Subjt:  STSSTRTI---------------FTGI---------TSSH----TSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRN

Query:  QTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI-----------------------------------------
         TW LVP+  + +++ +KWIFK K  ++GSIER KARLVA+GF QT G+DF ET+SPV+                                         
Subjt:  QTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI-----------------------------------------

Query:  -----------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI---------------------------
                               K APRAWYD LR TL++W F  +K D+SLF+ +      FLL+YVDD+++                           
Subjt:  -----------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLI---------------------------

Query:  -----------------------------------------------------------------------------------------AAPKQAHWLSL
                                                                                                 + P   HW  +
Subjt:  -----------------------------------------------------------------------------------------AAPKQAHWLSL

Query:  KRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA
        KR+LRYLQGT +  L ++P++++ + G+ DADW     DRKS GG CVFLG +L+SW+S+KQ+VV+RSSTESEYR+LA +  E++
Subjt:  KRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA

A0A803PEH4 Uncharacterized protein9.0e-9726.77Show/hide
Query:  LNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKELW
        LNQ  S+KLDRNN+ LW+ +   I++ ++L+G+L+G  +CP    ++         T++ NPEY+ WI  DQ+L+GWLY+SMT  IA++V G  +   L 
Subjt:  LNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKELW

Query:  DALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTGLRGDQ------------GTDF-LCSCLVLMRSTHLLWWYSNQTYQRSY--
          L+  YG  + S+ D  + +IQ TRKG++ M+EYL   K+++   N+  L GD             G D    S +V + +     W   Q    S+  
Subjt:  DALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTGLRGDQ------------GTDF-LCSCLVLMRSTHLLWWYSNQTYQRSY--

Query:  --------------------------------RGRG------------------------RGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFNNTQ
                                        RGRG                        RGRGRG  SGS+  CQV GK GHT AVCY+RF++++  + 
Subjt:  --------------------------------RGRG------------------------RGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFNNTQ

Query:  NAQGSSSNSNNQTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG----------------------------------LMHVPQIS
             + N   QT  NN +A +AT E L   AW+ DSGASNH+T+D ANL+ K +Y G                                  ++ VP+I+
Subjt:  NAQGSSSNSNNQTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG----------------------------------LMHVPQIS

Query:  KNLISISRLTMDNSVIVEFHDSFCAVKDKETGKVLLEGTLK-----------------------------------------------------------
        KNL+S+S+L  DN+V++EF+ +FC VKDK T KVLL G LK                                                           
Subjt:  KNLISISRLTMDNSVIVEFHDSFCAVKDKETGKVLLEGTLK-----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------YACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGY
                                                                            C+PCLR YQ  KF F + KCV +GY  ++KGY
Subjt:  ------------------------------------------------------------------YACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGY

Query:  RCLSPTGRVYTSRHVCFNEQDFPYSQLF--------------------------------------SQPQATELTTQHSILSWLPINYTEHESTFPTSHE
        +CLSPTGR+Y S+ V FNE  FP+   F                                      + P+A+  T+  S  +  PI + +  S+  +SH+
Subjt:  RCLSPTGRVYTSRHVCFNEQDFPYSQLF--------------------------------------SQPQATELTTQHSILSWLPINYTEHESTFPTSHE

Query:  NSTGGL-------HNVDVQPSTTSPSYSTSTSSTRTIFTGITSSH----TSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFS
          +          HN+    +   P+ +       T       SH      K+GI KP+        +  Q + +  EP S V ++  P W KAM  EF 
Subjt:  NSTGGL-------HNVDVQPSTTSPSYSTSTSSTRTIFTGITSSH----TSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFS

Query:  SLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVIKPAPRAWYDKLRITLL---DWKFSQSKADSSLFYFR
        +L    T  LVP S  Y+LVGNKW++++K  A+G++ R KARLVA+GF Q PGI++ ET+SP+IK A      ++ +T++    W+ S SKA +SLF+++
Subjt:  SLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVIKPAPRAWYDKLRITLL---DWKFSQSKADSSLFYFR

Query:  QNQQVLFLLVYVDDMLIA---------------------------------APKQAHWLSLKR-------------------------------------
          + ++  L+YVDD++I                                  A + A  L L +                                     
Subjt:  QNQQVLFLLVYVDDMLIA---------------------------------APKQAHWLSLKR-------------------------------------

Query:  ------------VLRYL--------QGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVAC
                     L+YL         GT   GL +  +  +S+ G+SDADW  CP D +SV GYCV+LG +L+SWSSKKQ VV+RSSTESEYRALAHVA 
Subjt:  ------------VLRYL--------QGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVAC

Query:  ELA
        E++
Subjt:  ELA

A0A803PYD1 Uncharacterized protein8.7e-12430.33Show/hide
Query:  PMGNPSWTNLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGT--KMHNPEYDMWIAADQMLVGWLYNSMTVEIAS
        P+  P ++N L+Q  S+KLDRNN+ LW+ +   I++ ++L G LTG+  CP  +  +P ++EE   +   + N EY+  I  DQ+L+GWLY SMT  I S
Subjt:  PMGNPSWTNLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGT--KMHNPEYDMWIAADQMLVGWLYNSMTVEIAS

Query:  QVTGCETTKELWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG-------LRGD--QGTDFLCSCLVL------------M
        +V GC +   LW AL+E YG Q+ +  D L+  +Q TRKG+  M EYL   +  AD L + G       LRG+   G D     +V+            +
Subjt:  QVTGCETTKELWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG-------LRGD--QGTDFLCSCLVL------------M

Query:  RSTHLLWWYSNQTYQR--------------------------SYRGRG---------------------------RGRGRGRSSGSKLICQVCGKTGHTT
        +ST L +    +  Q                            ++GRG                           RGRGRG    SK  CQVCGK GH+ 
Subjt:  RSTHLLWWYSNQTYQR--------------------------SYRGRG---------------------------RGRGRGRSSGSKLICQVCGKTGHTT

Query:  AVCYHRFEKNFNNTQNAQGSSSNSNNQTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG---------------------------
        A+C +RF++++         S  + +Q K N  + L+AT + L D +WY DSGA+NHLT D   L  K EY G                           
Subjt:  AVCYHRFEKNFNNTQNAQGSSSNSNNQTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG---------------------------

Query:  -------LMHVPQISKNLISISRLTMDNSVIVEFHDSFCAVKDKETGKVLLEGTLK--------------------------------------------
               L+HVP I+KNLISIS LT DN V VEF   FC VKD+ TGKV+L+ TLK                                            
Subjt:  -------LMHVPQISKNLISISRLTMDNSVIVEFHDSFCAVKDKETGKVLLEGTLK--------------------------------------------

Query:  ------------------------------------------------------------------------------------------YACYPCLRPY
                                                                                                    C+P LRPY
Subjt:  ------------------------------------------------------------------------------------------YACYPCLRPY

Query:  QHQKFDFRTTKCVFIGYISNHKGYRCLSPTGRVYTSRHVCFNEQDFP-YSQLFSQPQATELTTQHSILSWL----PINYTEHESTFP-------------
        Q  KF + + KC+ +GY   HKGY+CLSP GR+Y SR+V FNE +FP +S  F+  Q  +L T  +  SW     PI  T    T P             
Subjt:  QHQKFDFRTTKCVFIGYISNHKGYRCLSPTGRVYTSRHVCFNEQDFP-YSQLFSQPQATELTTQHSILSWL----PINYTEHESTFP-------------

Query:  ---TSHENSTG--------------GLHNVDVQPSTTSPSYSTSTSSTRTIFTGITSSHTSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPA
           +SH + +G               +H+  +      P   T+T+ T  + T       +K GI KPK        +  +I      P S   +L    
Subjt:  ---TSHENSTG--------------GLHNVDVQPSTTSPSYSTSTSSTRTIFTGITSSHTSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPA

Query:  WHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI--------------------------
        W+ AM +EF +L R +TW LVP S   ++VG KWIF+ K  A+GS +R KARLVA+GF Q PG+DF ET+SPV+                          
Subjt:  WHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI--------------------------

Query:  --------------------------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLIAAP---------
                                              K APRAWYD+LR  LL WKF  SKADSS F  ++ Q  + LLVYVDD+++            
Subjt:  --------------------------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLIAAP---------

Query:  --------------------------------KQAHWLS------------------------------------------LKRVLRYLQGTLHVGLSLQ
                                         Q+ ++S                                          +KR+LRYL+GT H GL L 
Subjt:  --------------------------------KQAHWLS------------------------------------------LKRVLRYLQGTLHVGLSLQ

Query:  PASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA
        P+ ++ L+G+SD DW  CP DRKSV GYCV+LG SLISWSSKKQ VVARSSTESEYRALA +A E++
Subjt:  PASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA

A5B1N8 Integrase catalytic domain-containing protein1.3e-10028.29Show/hide
Query:  LLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKEL
        +LN A  IKLDRNN++LW+     ++ +     H+ G  +CP   T           +   NP++ MW   D+M++ W+Y+S+T EI  Q+ G +++   
Subjt:  LLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCETTKEL

Query:  WDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG-----------LRGDQGTDF-----------------------------
        W AL+  +   + ++   L+   Q TRKG+  M EY+  +KS ADNL   G           L G  G D+                             
Subjt:  WDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTG-----------LRGDQGTDF-----------------------------

Query:  -LCSCLVLMRSTHL-LWWYSNQTYQRSYR-GRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFN----NTQNAQGSSSNSNNQTKGNNPAALIA
         + + L   +  H      S Q  Q  +   RG   GR +SS  +  CQ+CGK GHT   CYHRF+ NF     N    Q +  N+ NQ +     A++A
Subjt:  -LCSCLVLMRSTHL-LWWYSNQTYQRSYR-GRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFN----NTQNAQGSSSNSNNQTKGNNPAALIA

Query:  TLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG--------------------------------------LMHVPQISKNLISISRLTMDNSVIVEF
        +   + D AW+ D+GA++HL+  +  LS    Y G                                      L H+ +    L++ + L      +  F
Subjt:  TLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG--------------------------------------LMHVPQISKNLISISRLTMDNSVIVEF

Query:  HDSFCAVKDKETG--------KVLLEGTLKY--------ACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRCLSP-TGRVYTSRHVCFNEQDFPYSQL
        H +   +    T         ++L   +  Y         CYP +RPY   K  +R+++CVF+GY SNHKGY CL+P TGR+Y +RHV F+E  FP+   
Subjt:  HDSFCAVKDKETG--------KVLLEGTLKY--------ACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRCLSP-TGRVYTSRHVCFNEQDFPYSQL

Query:  FSQPQATELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTSPSYSTSTSSTRTI-------FTGITSSH-----------TSKLGITKP
         S P  +         ++LP       S+ P S   S          PST+SP  +   SST ++       F  I++S             +K GI+K 
Subjt:  FSQPQATELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTSPSYSTSTSSTRTI-------FTGITSSH-----------TSKLGITKP

Query:  KQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHE
        K  F               EPT+   ++    W  AM++EFS+L RN TW LVP     +++G KW++KLK   +G+++R+KARLVAQGF+QT G+++ E
Subjt:  KQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHE

Query:  TYSPVI----------------------------------------------------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLF
        T+SPV+                                                                K APRAWY+KL  +LL W F  S+ADSS+F
Subjt:  TYSPVI----------------------------------------------------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLF

Query:  YFRQNQQVLFLLVYVDDMLI--------------------------------------------------------------------------------
               VL LL+YVDD+L+                                                                                
Subjt:  YFRQNQQVLFLLVYVDDMLI--------------------------------------------------------------------------------

Query:  -------------------------------------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGP
                                             A P   HWL++KR+LRYL+GTL  G+ +Q ++S+ + GY+DADW +CP DR+S GGY +FLGP
Subjt:  -------------------------------------AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGP

Query:  SLISWSSKKQQVVARSSTESEYRALAHVACEL
        +L+SWSS KQ+VV+RSS ESEYRALA    E+
Subjt:  SLISWSSKKQQVVARSSTESEYRALAHVACEL

SwissProt top hitse value%identityAlignment
P0CV72 Secreted RxLR effector protein 1611.5e-1643.59Show/hide
Query:  VLFLLVYVDDMLIAA----------PKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVA
        +++L+V     L AA          P   HW +LKRVLRYLQ T   GL    A +  L+GYSDADW      R+S  GY   L    +SW SKKQ+ VA
Subjt:  VLFLLVYVDDMLIAA----------PKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVA

Query:  RSSTESEYRALAHVACE
         SSTE EY AL+    E
Subjt:  RSSTESEYRALAHVACE

P92519 Uncharacterized mitochondrial protein AtMg008102.8e-1849.46Show/hide
Query:  PKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACEL
        P  A +  LKRVLRY++GT+  GL +   S +++  + D+DW  C   R+S  G+C FLG ++ISWS+K+Q  V+RSSTE+EYRALA  A EL
Subjt:  PKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACEL

P92520 Uncharacterized mitochondrial protein AtMg008203.3e-1944.44Show/hide
Query:  SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQ
        SK GI K    +     TT +      EP S + +L  P W +AM+EE  +L+RN+TW LVP     +++G KW+FK K  ++G+++R KARLVA+GF Q
Subjt:  SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQ

Query:  TPGIDFHETYSPVIKPA
          GI F ETYSPV++ A
Subjt:  TPGIDFHETYSPVIKPA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.2e-4325Show/hide
Query:  ACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRCLS-PTGRVYTSRHVCFNEQDFPYSQLFSQPQATELTTQHSILSWLP-------------------
        ACYP LRPY   K D ++ +CVF+GY      Y CL   T R+Y SRHV F+E  FP+S   +     +   + S   W P                   
Subjt:  ACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRCLS-PTGRVYTSRHVCFNEQDFPYSQLFSQPQATELTTQHSILSWLP-------------------

Query:  ------------------INYTEHESTF-----------------------PTSHENSTGGLHNVD-------------------VQPSTTSPSYSTSTS
                          ++ +  +S+F                       PT  +  T    N                      Q S++SPS +TS S
Subjt:  ------------------INYTEHESTF-----------------------PTSHENSTGGLHNVD-------------------VQPSTTSPSYSTSTS

Query:  STRTIFT---------------------GITSSHT----SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLV
        S+ T  T                        ++H+    +K GI KP   +       A+      EP + + +L    W  AM  E ++   N TWDLV
Subjt:  STRTIFT---------------------GITSSHT----SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLV

Query:  PFSPKY-HLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI----------------------------------------------
        P  P +  +VG +WIF  K  ++GS+ R+KARLVA+G++Q PG+D+ ET+SPVI                                              
Subjt:  PFSPKY-HLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI----------------------------------------------

Query:  ------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLIAA------------------------------
                          K APRAWY +LR  LL   F  S +D+SLF  ++ + ++++LVYVDD+LI                                
Subjt:  ------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLIAA------------------------------

Query:  ---------------------------------------------------------------------------------------PKQAHWLSLKRVL
                                                                                               P + H  +LKR+L
Subjt:  ---------------------------------------------------------------------------------------PKQAHWLSLKRVL

Query:  RYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACEL
        RYL GT + G+ L+  +++SL  YSDADW     D  S  GY V+LG   ISWSSKKQ+ V RSSTE+EYR++A+ + E+
Subjt:  RYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACEL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.2e-1721.92Show/hide
Query:  MGNPSWTNLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVT
        + N S  N +N +   KL   N+++W      +   Y+L G L G       +T +PP+    +     NP+Y  W   D+++   +  ++++ +   V+
Subjt:  MGNPSWTNLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVT

Query:  GCETTKELWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTGLRGDQGTDF--------------------------------
           T  ++W+ L++ Y   +      L+  ++Q  KG   + +Y+  + +  D L L G   D                                     
Subjt:  GCETTKELWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTGLRGDQGTDF--------------------------------

Query:  -----------LCSCLVLMRSTHLLWWYSNQTYQRSYRGRGRGR--GRGRSSGSKL--------------------ICQVCGKTGHTTAVCYHRFEKNFN
                   + S  V+  + + +   +  T   +  G    R   R  ++ SK                      CQ+CG  GH+   C     ++F 
Subjt:  -----------LCSCLVLMRSTHLLWWYSNQTYQRSYRGRGRGR--GRGRSSGSKL--------------------ICQVCGKTGHTTAVCYHRFEKNFN

Query:  NTQNAQGSSSNSNNQTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG---------------------------------LMHVPQ
        ++ N+Q   S          P A +A         W LDSGA++H+T+D  NLSL   Y G                                 +++VP 
Subjt:  NTQNAQGSSSNSNNQTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG---------------------------------LMHVPQ

Query:  ISKNLISISRLTMDNSVIVEFHDSFCAVKDKETGKVLLEGTLKYACY
        I KNLIS+ RL   N V VEF  +   VKD  TG  LL+G  K   Y
Subjt:  ISKNLISISRLTMDNSVIVEFHDSFCAVKDKETGKVLLEGTLKYACY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-4225.73Show/hide
Query:  ACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRCLS-PTGRVYTSRHVCFNEQDFPYSQL-----FSQPQATELT---TQHSILSWLPI----------
        ACYP LRPY   K + ++ +C F+GY      Y CL  PTGR+YTSRHV F+E+ FP+S        SQ Q ++       H+ L   P+          
Subjt:  ACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRCLS-PTGRVYTSRHVCFNEQDFPYSQL-----FSQPQATELT---TQHSILSWLPI----------

Query:  --------------------------------------NYTEHESTFPTSHENSTGG-------LHNVDV-QPSTTSPSY-----------------STS
                                                  H    PT+  + T         L+N +   PS  SP+                  STS
Subjt:  --------------------------------------NYTEHESTFPTSHENSTGG-------LHNVDV-QPSTTSPSY-----------------STS

Query:  TSSTRTIFTGITS-----------------------SHT----SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQT
         S   +  +  TS                       +H+    +K GI KP Q +   +   A       EP + + ++    W +AM  E ++   N T
Subjt:  TSSTRTIFTGITS-----------------------SHT----SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQT

Query:  WDLV-PFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI------------------------------------------
        WDLV P  P   +VG +WIF  K  ++GS+ R+KARLVA+G++Q PG+D+ ET+SPVI                                          
Subjt:  WDLV-PFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVI------------------------------------------

Query:  ----------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLIAA--------------------------
                              K APRAWY +LR  LL   F  S +D+SLF  ++ + ++++LVYVDD+LI                            
Subjt:  ----------------------KPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFLLVYVDDMLIAA--------------------------

Query:  -------------------------------------------------------------------------------------------PKQAHWLSL
                                                                                                   P   HW +L
Subjt:  -------------------------------------------------------------------------------------------PKQAHWLSL

Query:  KRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACEL
        KRVLRYL GT   G+ L+  +++SL  YSDADW     D  S  GY V+LG   ISWSSKKQ+ V RSSTE+EYR++A+ + EL
Subjt:  KRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACEL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.8e-1623.35Show/hide
Query:  TNLL--NQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCET
        TN+L  N +   KL   N+++W      +   Y+L G L G       +T +PP+    +     NP+Y  W   D+++   +  ++++ +   V+   T
Subjt:  TNLL--NQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQMLVGWLYNSMTVEIASQVTGCET

Query:  TKELWDALKEFYGVQA---SSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNL------------------NLTGLR----GDQGTDFLCSCLVLMRST
          ++W+ L++ Y   +    +Q  ++ R  Q    G  K  ++   ++   +NL                  +LT +       +      +   ++  T
Subjt:  TKELWDALKEFYGVQA---SSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNL------------------NLTGLR----GDQGTDFLCSCLVLMRST

Query:  HLLWWYSNQTYQRSYRGRGRGRG-----------RGRSSGSKL----------ICQVCGKTGHTTAVCYHRFEKNFNNTQNAQGSSSNSNNQTKGNNPAA
          +  + N    R+   RG  R            +  SSGS+            CQ+C   GH+   C    +  F +T N Q S+S          P A
Subjt:  HLLWWYSNQTYQRSYRGRGRGRG-----------RGRSSGSKL----------ICQVCGKTGHTTAVCYHRFEKNFNNTQNAQGSSSNSNNQTKGNNPAA

Query:  LIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG---------------------------------LMHVPQISKNLISISRLTMDNSVIVEFHD
         +A         W LDSGA++H+T+D  NLS    Y G                                 +++VP I KNLIS+ RL   N V VEF  
Subjt:  LIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAG---------------------------------LMHVPQISKNLISISRLTMDNSVIVEFHD

Query:  SFCAVKDKETGKVLLEGTLKYACY
        +   VKD  TG  LL+G  K   Y
Subjt:  SFCAVKDKETGKVLLEGTLKYACY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.0e-2824.34Show/hide
Query:  EPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPV--------------
        EP+++  +     W  AM +E  ++    TW++    P    +G KW++K+K  ++G+IER+KARLVA+G++Q  GIDF ET+SPV              
Subjt:  EPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPV--------------

Query:  ------------------------------------------------------IKPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFL--LV
                                                              +K A R W+ K  +TL+ + F QS +D +  YF +    LFL  LV
Subjt:  ------------------------------------------------------IKPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQVLFL--LV

Query:  YVDDMLIA--------------------------------------------------------------------------------------------
        YVDD++I                                                                                             
Subjt:  YVDDMLIA--------------------------------------------------------------------------------------------

Query:  -------------------------APKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVV
                                 AP+ AH  ++ ++L Y++GT+  GL     + + L  +SDA + +C   R+S  GYC+FLG SLISW SKKQQVV
Subjt:  -------------------------APKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVV

Query:  ARSSTESEYRALAHVACEL
        ++SS E+EYRAL+    E+
Subjt:  ARSSTESEYRALAHVACEL

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.3e-0636.21Show/hide
Query:  AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYC
        +A + A   ++ +VL Y++GT+  GL     S + L  ++D+DW +CP  R+SV G+C
Subjt:  AAPKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYC

ATMG00810.1 DNA/RNA polymerases superfamily protein2.0e-1949.46Show/hide
Query:  PKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACEL
        P  A +  LKRVLRY++GT+  GL +   S +++  + D+DW  C   R+S  G+C FLG ++ISWS+K+Q  V+RSSTE+EYRALA  A EL
Subjt:  PKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACEL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.3e-2044.44Show/hide
Query:  SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQ
        SK GI K    +     TT +      EP S + +L  P W +AM+EE  +L+RN+TW LVP     +++G KW+FK K  ++G+++R KARLVA+GF Q
Subjt:  SKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAWHKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQ

Query:  TPGIDFHETYSPVIKPA
          GI F ETYSPV++ A
Subjt:  TPGIDFHETYSPVIKPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAACACCAACCCAATTATCAAGAAGTCTGGACCTGTGGAAAAAGACCCTACTACCTTTGAGGTGAAATATGGAAAAACTCCAATGGGCAATCCATCTTGGACCAA
TTTGCTCAACCAAGCTACTTCAATCAAGCTTGACAGAAACAACTTTATGTTGTGGCAAAACATAGCTATCCCTATTCTTAAAAGCTACCAGCTGAATGGGCACCTTACTG
GTAAGAGTGTTTGTCCTGAGATGACAACTGTGATCCCACCATCCGAAGAGGAACCCGAAGGTACGAAGATGCATAACCCTGAATATGACATGTGGATAGCTGCTGATCAA
ATGCTGGTGGGATGGTTGTATAACTCCATGACCGTGGAGATAGCATCTCAAGTCACGGGTTGTGAAACAACTAAAGAACTATGGGATGCTCTTAAAGAATTCTATGGAGT
GCAAGCATCATCTCAACAAGACTATCTAAAAAGGATGATTCAACAAACAAGAAAAGGCAACTCAAAAATGACTGAATACTTAGCTTTAATGAAAAGCTTTGCTGACAATC
TCAACTTGACAGGTCTCCGTGGAGACCAAGGAACTGATTTCCTATGTAGTTGCTTGGTCTTGATGAGGAGTACACACCTATTGTGGTGGTACTCAAATCAAACCTACCAA
AGAAGCTATCGTGGAAGAGGAAGAGGAAGAGGAAGAGGAAGAAGTTCTGGCTCGAAATTAATTTGCCAAGTTTGTGGCAAAACTGGTCATACAACTGCTGTTTGCTATCA
TCGATTTGAAAAGAATTTCAATAATACTCAGAATGCCCAAGGATCATCTTCAAATTCTAACAACCAGACCAAAGGAAACAATCCAGCAGCCTTAATAGCCACTCTTGAGC
ATCTAAGGGACACCGCTTGGTACTTGGATAGTGGAGCAAGCAACCACTTGACAGCTGATCTAGCTAATCTGAGTTTGAAAACTGAATATGCAGGTCTTATGCATGTACCA
CAAATAAGTAAAAATTTGATTAGCATATCAAGATTAACTATGGATAACTCTGTGATTGTTGAGTTTCATGACTCTTTTTGTGCTGTTAAGGACAAGGAAACGGGGAAAGT
TCTTCTGGAAGGGACTCTTAAATATGCATGTTATCCTTGTCTCAGGCCGTATCAACATCAAAAGTTTGACTTTCGCACTACTAAGTGTGTCTTTATTGGTTATATCAGCA
ATCACAAGGGCTATAGATGTCTTAGTCCAACGGGTCGCGTATATACTTCAAGGCATGTGTGTTTTAATGAGCAAGACTTTCCTTATTCTCAGTTATTTTCTCAGCCTCAA
GCTACTGAATTGACTACTCAGCATTCAATTTTGTCTTGGCTGCCTATTAACTACACTGAGCATGAATCTACATTTCCAACCTCTCATGAAAACTCTACTGGTGGTCTTCA
TAATGTTGATGTTCAGCCTTCCACCACTTCTCCAAGCTATTCTACTAGTACTAGCTCCACACGAACCATATTTACTGGAATTACTTCTTCTCACACAAGCAAGCTTGGTA
TTACTAAACCAAAGCAGGTTTTTGGTTGTCTTTCTCAAACAACAGCTCAAATTGATTGGTCTTGCATTGAACCAACCTCTCATGTTGTTTCTCTCACAGTTCCAGCTTGG
CACAAAGCTATGAAAGAGGAATTCTCTTCCTTGACACGTAATCAGACTTGGGATTTAGTTCCATTCTCTCCAAAGTATCACTTAGTTGGTAACAAGTGGATTTTTAAACT
AAAACGGGTTGCTAATGGTTCTATTGAAAGACACAAGGCAAGACTAGTTGCTCAAGGTTTTTCTCAAACGCCGGGTATAGATTTTCATGAAACCTACAGTCCTGTGATCA
AGCCGGCTCCCCGTGCTTGGTATGACAAGCTTAGAATTACCTTGCTTGACTGGAAATTCTCACAATCCAAGGCTGATTCTTCATTATTTTATTTTAGGCAAAATCAGCAA
GTGTTATTTCTCTTAGTGTATGTTGATGATATGTTAATAGCAGCACCGAAGCAAGCTCACTGGTTGTCACTTAAACGTGTACTACGTTATCTTCAAGGTACTCTTCATGT
TGGCCTCTCCTTACAACCAGCTTCTTCTATTTCTCTTATTGGATATTCAGATGCAGATTGGGTTGCTTGCCCCATTGATAGGAAATCAGTTGGAGGCTATTGTGTGTTCC
TCGGGCCCTCTCTGATCTCATGGTCGTCTAAAAAGCAACAAGTGGTTGCTCGTTCTAGCACTGAGTCTGAATACCGTGCTCTTGCTCATGTTGCTTGTGAACTTGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAACACCAACCCAATTATCAAGAAGTCTGGACCTGTGGAAAAAGACCCTACTACCTTTGAGGTGAAATATGGAAAAACTCCAATGGGCAATCCATCTTGGACCAA
TTTGCTCAACCAAGCTACTTCAATCAAGCTTGACAGAAACAACTTTATGTTGTGGCAAAACATAGCTATCCCTATTCTTAAAAGCTACCAGCTGAATGGGCACCTTACTG
GTAAGAGTGTTTGTCCTGAGATGACAACTGTGATCCCACCATCCGAAGAGGAACCCGAAGGTACGAAGATGCATAACCCTGAATATGACATGTGGATAGCTGCTGATCAA
ATGCTGGTGGGATGGTTGTATAACTCCATGACCGTGGAGATAGCATCTCAAGTCACGGGTTGTGAAACAACTAAAGAACTATGGGATGCTCTTAAAGAATTCTATGGAGT
GCAAGCATCATCTCAACAAGACTATCTAAAAAGGATGATTCAACAAACAAGAAAAGGCAACTCAAAAATGACTGAATACTTAGCTTTAATGAAAAGCTTTGCTGACAATC
TCAACTTGACAGGTCTCCGTGGAGACCAAGGAACTGATTTCCTATGTAGTTGCTTGGTCTTGATGAGGAGTACACACCTATTGTGGTGGTACTCAAATCAAACCTACCAA
AGAAGCTATCGTGGAAGAGGAAGAGGAAGAGGAAGAGGAAGAAGTTCTGGCTCGAAATTAATTTGCCAAGTTTGTGGCAAAACTGGTCATACAACTGCTGTTTGCTATCA
TCGATTTGAAAAGAATTTCAATAATACTCAGAATGCCCAAGGATCATCTTCAAATTCTAACAACCAGACCAAAGGAAACAATCCAGCAGCCTTAATAGCCACTCTTGAGC
ATCTAAGGGACACCGCTTGGTACTTGGATAGTGGAGCAAGCAACCACTTGACAGCTGATCTAGCTAATCTGAGTTTGAAAACTGAATATGCAGGTCTTATGCATGTACCA
CAAATAAGTAAAAATTTGATTAGCATATCAAGATTAACTATGGATAACTCTGTGATTGTTGAGTTTCATGACTCTTTTTGTGCTGTTAAGGACAAGGAAACGGGGAAAGT
TCTTCTGGAAGGGACTCTTAAATATGCATGTTATCCTTGTCTCAGGCCGTATCAACATCAAAAGTTTGACTTTCGCACTACTAAGTGTGTCTTTATTGGTTATATCAGCA
ATCACAAGGGCTATAGATGTCTTAGTCCAACGGGTCGCGTATATACTTCAAGGCATGTGTGTTTTAATGAGCAAGACTTTCCTTATTCTCAGTTATTTTCTCAGCCTCAA
GCTACTGAATTGACTACTCAGCATTCAATTTTGTCTTGGCTGCCTATTAACTACACTGAGCATGAATCTACATTTCCAACCTCTCATGAAAACTCTACTGGTGGTCTTCA
TAATGTTGATGTTCAGCCTTCCACCACTTCTCCAAGCTATTCTACTAGTACTAGCTCCACACGAACCATATTTACTGGAATTACTTCTTCTCACACAAGCAAGCTTGGTA
TTACTAAACCAAAGCAGGTTTTTGGTTGTCTTTCTCAAACAACAGCTCAAATTGATTGGTCTTGCATTGAACCAACCTCTCATGTTGTTTCTCTCACAGTTCCAGCTTGG
CACAAAGCTATGAAAGAGGAATTCTCTTCCTTGACACGTAATCAGACTTGGGATTTAGTTCCATTCTCTCCAAAGTATCACTTAGTTGGTAACAAGTGGATTTTTAAACT
AAAACGGGTTGCTAATGGTTCTATTGAAAGACACAAGGCAAGACTAGTTGCTCAAGGTTTTTCTCAAACGCCGGGTATAGATTTTCATGAAACCTACAGTCCTGTGATCA
AGCCGGCTCCCCGTGCTTGGTATGACAAGCTTAGAATTACCTTGCTTGACTGGAAATTCTCACAATCCAAGGCTGATTCTTCATTATTTTATTTTAGGCAAAATCAGCAA
GTGTTATTTCTCTTAGTGTATGTTGATGATATGTTAATAGCAGCACCGAAGCAAGCTCACTGGTTGTCACTTAAACGTGTACTACGTTATCTTCAAGGTACTCTTCATGT
TGGCCTCTCCTTACAACCAGCTTCTTCTATTTCTCTTATTGGATATTCAGATGCAGATTGGGTTGCTTGCCCCATTGATAGGAAATCAGTTGGAGGCTATTGTGTGTTCC
TCGGGCCCTCTCTGATCTCATGGTCGTCTAAAAAGCAACAAGTGGTTGCTCGTTCTAGCACTGAGTCTGAATACCGTGCTCTTGCTCATGTTGCTTGTGAACTTGCCTAG
Protein sequenceShow/hide protein sequence
MANTNPIIKKSGPVEKDPTTFEVKYGKTPMGNPSWTNLLNQATSIKLDRNNFMLWQNIAIPILKSYQLNGHLTGKSVCPEMTTVIPPSEEEPEGTKMHNPEYDMWIAADQ
MLVGWLYNSMTVEIASQVTGCETTKELWDALKEFYGVQASSQQDYLKRMIQQTRKGNSKMTEYLALMKSFADNLNLTGLRGDQGTDFLCSCLVLMRSTHLLWWYSNQTYQ
RSYRGRGRGRGRGRSSGSKLICQVCGKTGHTTAVCYHRFEKNFNNTQNAQGSSSNSNNQTKGNNPAALIATLEHLRDTAWYLDSGASNHLTADLANLSLKTEYAGLMHVP
QISKNLISISRLTMDNSVIVEFHDSFCAVKDKETGKVLLEGTLKYACYPCLRPYQHQKFDFRTTKCVFIGYISNHKGYRCLSPTGRVYTSRHVCFNEQDFPYSQLFSQPQ
ATELTTQHSILSWLPINYTEHESTFPTSHENSTGGLHNVDVQPSTTSPSYSTSTSSTRTIFTGITSSHTSKLGITKPKQVFGCLSQTTAQIDWSCIEPTSHVVSLTVPAW
HKAMKEEFSSLTRNQTWDLVPFSPKYHLVGNKWIFKLKRVANGSIERHKARLVAQGFSQTPGIDFHETYSPVIKPAPRAWYDKLRITLLDWKFSQSKADSSLFYFRQNQQ
VLFLLVYVDDMLIAAPKQAHWLSLKRVLRYLQGTLHVGLSLQPASSISLIGYSDADWVACPIDRKSVGGYCVFLGPSLISWSSKKQQVVARSSTESEYRALAHVACELA