; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041077 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041077
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr13:11722601..11730143
RNA-Seq ExpressionLag0041077
SyntenyLag0041077
Gene Ontology termsNA
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65540.1 hypothetical protein VITISV_029946 [Vitis vinifera]7.5e-13232.79Show/hide
Query:  SSATDSSSPYFLHHSDTSILILVSDLL--TDDNYVTWSRSMLLALSIRNKLGLIDGTLPK-PTGDLLSAS-NRGNNVVIAWILNSVSKGISSSTMFPDST
        S   D SSPYFLH+ D   L LVS  L  +  NY +W RSM+ AL+ +NKLG IDGT+ +    DLL+   +R N++VI+W+ NSV K I+ S ++ ++T
Subjt:  SSATDSSSPYFLHHSDTSILILVSDLL--TDDNYVTWSRSMLLALSIRNKLGLIDGTLPK-PTGDLLSAS-NRGNNVVIAWILNSVSKGISSSTMFPDST

Query:  QAIWLDLKDQFQ-----------------------------RKNGIWDEYVTYRPRCTCGRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLM
          IW DL ++F                              R+  +WDE   ++       CNCGG     +  Q E +M  L+GLNESF P R+QILLM
Subjt:  QAIWLDLKDQFQ-----------------------------RKNGIWDEYVTYRPRCTCGRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLM

Query:  DPPPAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKGGTYPNSSSHSPW
         P P ++K FSL+ QEE QRSL +   PA  T  V+       + +   N+ +SRK RP+CTHC + GHT+D CYK+HGYPP +R +    PN S  +  
Subjt:  DPPPAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKGGTYPNSSSHSPW

Query:  IIDS------------------DALTHICFNK--------------ASFTTLFPVATYVV-LPDNTRISV----NYAGSVVLLGSICLHRVLFVLEFQYN
        + +S                    LTH   N+              ASF    P+   +    D+++  +       G++ LL S     + FV     N
Subjt:  IIDS------------------DALTHICFNK--------------ASFTTLFPVATYVV-LPDNTRISV----NYAGSVVLLGSICLHRVLFVLEFQYN

Query:  LISISALTYD---NSIMVNFSTGYCEIRERSTLKTISKGSLHDG----LFMLDD--SNIALNLVVCASVTQKLSPSLWHSKLGHLSFPCLHILKDILHIE
          +   L +D   N     F   +C+I     + T      HDG    L ++DD   N  ++L+   S  + + P  +                  L I+
Subjt:  LISISALTYD---NSIMVNFSTGYCEIRERSTLKTISKGSLHDG----LFMLDD--SNIALNLVVCASVTQKLSPSLWHSKLGHLSFPCLHILKDILHIE

Query:  PSRCDSFVPCDIC-LLAKQRKLSFFIII-----NLLLHLLIWYTLIFGARLPLLHMLSRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDY
          R D+    ++  L  +   L FF  +     N ++     + L        L+  S +P+ +W +C+LT VYLIN+ PS +L  +TP+ +L    P Y
Subjt:  PSRCDSFVPCDIC-LLAKQRKLSFFIII-----NLLLHLLIWYTLIFGARLPLLHMLSRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDY

Query:  SLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPL---DNFSAKTDPPTIPTSIHTNPTSPDMNSIPSSETDSPSIDIPNTSSHVSTPRRTSRPTKLST
        S +K F CLC++STL + R KF+P+A+P +F+GYP    D FS K   P +P S   +P+  +  S P+S   S +   P+T SH +T  R+SR ++   
Subjt:  SLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPL---DNFSAKTDPPTIPTSIHTNPTSPDMNSIPSSETDSPSIDIPNTSSHVSTPRRTSRPTKLST

Query:  YLKDYHCSFLTDSPF--PTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYK
        YL DYHC     +P    + ST YPL   I Y KLSPS+RA S++IST      Y +A+    W+ AM+AEL+A+E+N+TW++ +LPP K   GCKW+Y+
Subjt:  YLKDYHCSFLTDSPF--PTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYK

Query:  INI-------RLMALSNVTKPVWWQRWPLIQLDVNNAFLHGDLIEEVYMDLSLGY----QPNVP------------------------------------
        +         R +     +  + +  W L  LDVNNAFLHGDL EEV+M L  GY    +P +P                                    
Subjt:  INI-------RLMALSNVTKPVWWQRWPLIQLDVNNAFLHGDLIEEVYMDLSLGY----QPNVP------------------------------------

Query:  --------------------------VPSKGERL----------------------------------------------LIKDTGLIGAKPEAVPMDPR
                                  + +   ++                                              L+ +TG +G KP   PM P 
Subjt:  --------------------------VPSKGERL----------------------------------------------LIKDTGLIGAKPEAVPMDPR

Query:  LKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVS-------KPSFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTE
        ++L Q + +LL   S YRRLIG L+YLTI+RPD+ ++ NKLSQ++S       + +F DSDWA+C D+R S  G+CI L DSL+SW+SKK  TV RSS E
Subjt:  LKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVS-------KPSFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTE

Query:  DEYRALAHTTCEL
         EY A+AH TCEL
Subjt:  DEYRALAHTTCEL

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]2.0e-14030.4Show/hide
Query:  LQMALSSATDSSSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP-TGDLLSAS-NRGNNVVIAWILNSVSKGISSSTMFP
        L +  ++  DSSSPY+LH+ D   L LVS+ L   NY TW R+M++AL+ +NKLG ID ++ +P + DLL  S  R N++VI+WILNSV++ I+ S M+ 
Subjt:  LQMALSSATDSSSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP-TGDLLSAS-NRGNNVVIAWILNSVSKGISSSTMFP

Query:  DSTQAIWLDLKDQFQRKNG-----------------------------IWDEYVTYRP--RCTCGRCNCGGHEAMEKLFQF---EYLMSILMGLNESFGP
         + + IW DL ++F   N                              +WDE   Y+P   CTCG        +M + F +   E +M  LMGLN+S+  
Subjt:  DSTQAIWLDLKDQFQRKNG-----------------------------IWDEYVTYRP--RCTCGRCNCGGHEAMEKLFQF---EYLMSILMGLNESFGP

Query:  TRSQILLMDPPPAISKAFSLIAQEELQRSLPLLPLPAVITLA-----VAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQ-
         R+Q+L+++P P I+K F+L+ QEE QRS+      A +  +     V    +        QN+   R  R IC+HC  + HT+D CYKLHGYPP + + 
Subjt:  TRSQILLMDPPPAISKAFSLIAQEELQRSLPLLPLPAVITLA-----VAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQ-

Query:  -----KGGTYPNSSS-------------------------------------------HSP------------------------WIIDSDALTHICFNK
             +G  + + +S                                           H P                        WI+D+ A  HIC + 
Subjt:  -----KGGTYPNSSS-------------------------------------------HSP------------------------WIIDSDALTHICFNK

Query:  ASFTTLFPVATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDGLFMLDDSNIA
        + F +   + + VVLP+   I V  AG+V +  ++ L  VL+V  FQ+NL+S+S+LT +++  V+F +  C+I++ S ++ I  G     L++L   +  
Subjt:  ASFTTLFPVATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDGLFMLDDSNIA

Query:  LNLVVCASVTQKLSPSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSFFIIINL------LLHLLIW------------------
        L   +C +     +  LWH ++GH SF  L  LK++L+IE    D    C  C L+KQR+L      N+      LLH+  W                  
Subjt:  LNLVVCASVTQKLSPSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSFFIIINL------LLHLLIW------------------

Query:  -----YTLI-------------------------------------------FGARLPLLH-----------------------------MLSRVPLHFW
             YT +                                           F A+  + H                               S +PL +W
Subjt:  -----YTLI-------------------------------------------FGARLPLLH-----------------------------MLSRVPLHFW

Query:  SECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYP-------LDNFSAKTDPPTIPTSIHTNPTS
         +CI T VYLINRTPS +L  +TP+ +L+G LP YS +KVF CLC+ASTL ++R KF+P+AI  +F+GYP       L N        +     H N T 
Subjt:  SECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYP-------LDNFSAKTDPPTIPTSIHTNPTS

Query:  PDMNSIPSSETD-----SPSIDI-PNTSSHVSTPRRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHK
        P  N+ P S +D     SPS  I P+  +      RTSRP    ++L+DYHC +   +P  T ST +P+H  + Y+KLS S+RA   NIS+      + +
Subjt:  PDMNSIPSSETD-----SPSIDI-PNTSSHVSTPRRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHK

Query:  ALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINI-----------RLMA--------------LSNVTKPVWWQR---------WPLI
        A+    W++AM  EL+A+E NHTW++VSLP  K  +GC+W+YK              RL+A               S V K V  +          W LI
Subjt:  ALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINI-----------RLMA--------------LSNVTKPVWWQR---------WPLI

Query:  QLDVNNAFLHGDLIEEVYMDLSLGYQPNVPVPSKG-----------------------------------------------------------------
        QLDVNNAFLHGDL EEVYM L  G+     +PS+                                                                  
Subjt:  QLDVNNAFLHGDLIEEVYMDLSLGYQPNVPVPSKG-----------------------------------------------------------------

Query:  ---------------------------------------------ERLLIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRP
                                                        L+ + GL+G KP   PM+   KL Q + ++L   +SYRRLIG LLYLTI+RP
Subjt:  ---------------------------------------------ERLLIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRP

Query:  DIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRA
        D+ F  NKLSQYVS P                                  +F D+DW +CLDTR S  GYC+ LG+SL+SW++KKQ TV RSS E EYR+
Subjt:  DIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRA

Query:  LAHTTCELI
        LA +TCE++
Subjt:  LAHTTCELI

RVW81690.1 Copia protein [Vitis vinifera]1.5e-12428.26Show/hide
Query:  IPNPGPGALQMALSSATDSSSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKPTGDLL---SASNRGNNVVIAWILNSVSK
        +P+      Q  ++ + DSSSPY+LH SD    +LVS++   +NYV WSRS+++AL+++NK+  IDG++  P+ D L   +A  R NN+V++W++NS+SK
Subjt:  IPNPGPGALQMALSSATDSSSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKPTGDLL---SASNRGNNVVIAWILNSVSK

Query:  GISSSTMFPDSTQAIWLDLKDQFQRKNG-----------------------------IWDEYVTYR--PRCTCGR---CNCGGHEAMEKLFQFEYLMSIL
         I +S +F  S   +W +LK ++ R +G                              WDEY++YR  P CTCG+   C C     ++   Q +Y++  L
Subjt:  GISSSTMFPDSTQAIWLDLKDQFQRKNG-----------------------------IWDEYVTYR--PRCTCGR---CNCGGHEAMEKLFQFEYLMSIL

Query:  MGLNESFGPTRSQILLMDPPPAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPI--CTHCGVQGHTIDHCYKLHGYP
        +GLN+S+   RSQ+LLM P P +SK FSL+ QEE QR    L   A      A    ++ Q + Q  +FK +  +    CTHCG  GHT+D C++LHGYP
Subjt:  MGLNESFGPTRSQILLMDPPPAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPI--CTHCGVQGHTIDHCYKLHGYP

Query:  PRYRQKGG-------------------------------------------TYPNSS---------------------------------------SHSP
        P +    G                                             PNSS                                       S S 
Subjt:  PRYRQKGG-------------------------------------------TYPNSS---------------------------------------SHSP

Query:  WIIDSDALTH-ICFNKASFTTLFPVATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISK
        W++D+ A  H IC      +T  P+   + LP+ T +   + G+V L  S+ LH VL V  F +NL+S+S LT  +++ + F+  +C ++++S  K I  
Subjt:  WIIDSDALTH-ICFNKASFTTLFPVATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISK

Query:  GSLHDGLFMLDDSNIALNLVVCASVTQKLSP----------SLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSFFIIINL-----
            +GL+ L  ++       C S     +P           +WH +LGH+S    H LK I        D    CDIC LAKQR+L F +  +      
Subjt:  GSLHDGLFMLDDSNIALNLVVCASVTQKLSP----------SLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSFFIIINL-----

Query:  -LLHLLIW---------------------------YTLIFGARLPLL-----------------------------------------------------
         L+H  IW                           Y +   + +P L                                                     
Subjt:  -LLHLLIW---------------------------YTLIFGARLPLL-----------------------------------------------------

Query:  ------HMLS---------RVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNF
              H+LS          +PL FWS+C+LT  ++INR P+ +L+ +TP+ +L    P+ S ++VF CLCFASTL+++R KF  +A   IF+GYP D  
Subjt:  ------HMLS---------RVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNF

Query:  SAK-TDPPTIPTSIHTN-------------PTSPDMNS----IPSSETDSP-------------SIDIPNTSSHVSTP----------------------
          K  D  T    +  N             P  P+ +S    +  S +DSP             S   PN S  +S P                      
Subjt:  SAK-TDPPTIPTSIHTN-------------PTSPDMNS----IPSSETDSP-------------SIDIPNTSSHVSTP----------------------

Query:  -----RRTSRPTKLSTYLKDYHCSFLTDSP-----------FPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAM
             R+++R  +  +YL+DY+C  ++ S             PT    Y LH ++  ++LS S++A   +I+     + Y +A     W+ AMQ EL A+
Subjt:  -----RRTSRPTKLSTYLKDYHCSFLTDSP-----------FPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAM

Query:  ETNHTWNVVSLPPNKHFIGCKWIYKI-----------NIRLMA--------------LSNVTK---------PVWWQRWPLIQLDVNNAFLHGDLIEEVY
        E N TW +V+LP NK  IGCKW++K+             RL+A               S V K             ++W L QLDVNNAFLHGDL EEVY
Subjt:  ETNHTWNVVSLPPNKHFIGCKWIYKI-----------NIRLMA--------------LSNVTK---------PVWWQRWPLIQLDVNNAFLHGDLIEEVY

Query:  MDLSLG--------------------------------------------------------------YQPNVPVPSKG----ERL--------------
        M+L  G                                                              Y  +V + S      +RL              
Subjt:  MDLSLG--------------------------------------------------------------YQPNVPVPSKG----ERL--------------

Query:  ---------------------------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKPS----
                                   +++DTGL G+KP A PM+  LKL   + +  +  S YRRLIG LLYLT++RPD+A++   LSQ+++KP+    
Subjt:  ---------------------------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKPS----

Query:  ------------------------------FVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL
                                      F DSDWA C+DTR S  G+ I LG+SL+SW+SKKQ TV RSS E EYRALA TTCE+
Subjt:  ------------------------------FVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL

RVW82526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.7e-13429.89Show/hide
Query:  SSATDSSSPYFLHHSDTSILILVSDLL--TDDNYVTWSRSMLLALSIRNKLGLIDGTLPKPTGDLLSAS--NRGNNVVIAWILNSVSKGISSSTMFPDST
        S   D SSPYFLH+ D   L LVS  L  +  NY +W RSM+ AL+ +NKLG IDGT+ +P    L AS  +R N++VI+W+ NSV K I+ S ++ ++ 
Subjt:  SSATDSSSPYFLHHSDTSILILVSDLL--TDDNYVTWSRSMLLALSIRNKLGLIDGTLPKPTGDLLSAS--NRGNNVVIAWILNSVSKGISSSTMFPDST

Query:  QAIWLDLKDQFQRKNG-----------------------------IWDEYVTYRPRCTCGRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLM
          IW DL ++F + +G                             +WDE   ++       CNCGG     +  Q E +M  L+GLNESF P ++QILLM
Subjt:  QAIWLDLKDQFQRKNG-----------------------------IWDEYVTYRPRCTCGRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLM

Query:  DPPPAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKGGTYPNSS-----
        +P P ++K FSL+ QEE QRSL     PA  T  V+       + +   N+ +SRK RP+CTHC + GHT+D CYK+HGY P +R +    PN S     
Subjt:  DPPPAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKGGTYPNSS-----

Query:  ------------------SHSP---------------------------------------------------------WIIDSDALTHICFNKASFTTL
                          S SP                                                         WI+DS A  H+C N + F ++
Subjt:  ------------------SHSP---------------------------------------------------------WIIDSDALTHICFNKASFTTL

Query:  FPVAT-YVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDGLFMLDDS---NIALN
           ++  V LP  T+I +   G++ L   + L  VL++  FQ+NLISISALT  N    +F+  +C I++ S  K I  G     L++LD S   +I+  
Subjt:  FPVAT-YVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDGLFMLDDS---NIALN

Query:  LVVCASVTQKLSPSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSFFIIINL------LLHLLIW--------------------
         VV  + +  ++  LWH +L H S   L +LK  L ++ S  ++ + C IC LAKQ++L F    NL      L+H  IW                    
Subjt:  LVVCASVTQKLSPSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSFFIIINL------LLHLLIW--------------------

Query:  -----------------------YTLI---FG--------------------ARLPLLHML-----------------------------SRVPLHFWSE
                               ++++   FG                     +L +LH                               S +P+ +W +
Subjt:  -----------------------YTLI---FG--------------------ARLPLLHML-----------------------------SRVPLHFWSE

Query:  CILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPL-------------------------------
        C+LT VYLINR PS +L  +TP+ +L+   P YS +K F CLC++STL + R KF+P+A+P +F+GYP                                
Subjt:  CILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPL-------------------------------

Query:  ---------DNFSAKTDPPTIPTSIHTNPTSPDMNSIPSSETDSPSIDIPNTSSHVSTPRRTSRPTKLSTYLKDYHCSFLTDSPF--PTYSTKYPLHQYI
                  +F +K   P +P S   +P+  +  S P++   S +   P+T+SH +T  R+SR ++   YL DYHC   + +P    + ST YPL   I
Subjt:  ---------DNFSAKTDPPTIPTSIHTNPTSPDMNSIPSSETDSPSIDIPNTSSHVSTPRRTSRPTKLSTYLKDYHCSFLTDSPF--PTYSTKYPLHQYI

Query:  LYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKI-----------NIRLMA------------
         Y KLSPS+RA S++IST      Y +A+    W+ AM+AEL+A+E+N+TW++ +LPP K  +GCKW+Y++             RL+A            
Subjt:  LYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKI-----------NIRLMA------------

Query:  ---------------LSNVTKPVWWQRWPLIQLDVN-----------------NAFLHGDLIEE------------------------VYMD--------
                         ++      +R PL+  ++                   A   G LI E                        VY+D        
Subjt:  ---------------LSNVTKPVWWQRWPLIQLDVN-----------------NAFLHGDLIEE------------------------VYMD--------

Query:  --------------------LSLGYQPNVPVPSKGERLLI----------KDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRP
                             +L Y   + V    + +LI           +TG +G KP   PM P ++L Q + +LL   + YRRLIG L+YLTI+RP
Subjt:  --------------------LSLGYQPNVPVPSKGERLLI----------KDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRP

Query:  DIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRA
        D+ ++ NKLSQ++S+P                                  +F DSDWA+C D+R S  G+CI L DSL+SW+SKKQ TV RSS E EYRA
Subjt:  DIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRA

Query:  LAHTTCEL
        +AH TCEL
Subjt:  LAHTTCEL

RVX06074.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.6e-13531.58Show/hide
Query:  GALQMALSSATDSSSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP-TGDLL-SASNRGNNVVIAWILNSVSKGISSSTM
        G +   +    DSSSPYFLH+ D   L LVS+LLT  NY TW R+ML+AL+ +NK+G +DGT+ +P + DL+  A NR N+++ +WI+N VS+ I+ S +
Subjt:  GALQMALSSATDSSSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP-TGDLL-SASNRGNNVVIAWILNSVSKGISSSTM

Query:  FPDSTQAIWLDLKDQFQRKNG-----------------------------IWDEYVTYRPRCTCGRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRS
        + DS   IW DL D+F + NG                             +WDE   ++P      C+CGG          EY++  LMGLN+S+   R 
Subjt:  FPDSTQAIWLDLKDQFQRKNG-----------------------------IWDEYVTYRPRCTCGRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRS

Query:  QILLMDPPPAISKAFSLIAQEELQRSLPLLPLPAVITLAVAH---PVSKTVQNNQ---QQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPP--RYRQK
        QIL+MDP PA++K FSL+ QEE  R++         + + +H   P++    +N         K+R+ R  C++CG QGH  D CYKL GYPP  +++ K
Subjt:  QILLMDPPPAISKAFSLIAQEELQRSLPLLPLPAVITLAVAH---PVSKTVQNNQ---QQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPP--RYRQK

Query:  GGTYPNSSS-------------------------------------------------------------------HSPWIIDSDALTHICFNKASFTTL
        G   PNSSS                                                                   +  WIIDS A  H+C   + F + 
Subjt:  GGTYPNSSS-------------------------------------------------------------------HSPWIIDSDALTHICFNKASFTTL

Query:  FPVATY-VVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDGLFMLD-DSNIALNLV
          V    V LP    + ++  GSV+L   + L  VLFV  F+YNL+S+SA T   S+ + F+   C I++ S  K I KGS    L+ LD DS +A    
Subjt:  FPVATY-VVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDGLFMLD-DSNIALNLV

Query:  VCAS-VTQKLSPSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPC----DICLLAKQRKLSFFIIINLLLHLLIWYTLI-------------------
        V AS +      SLWHS+LGH SF  L  L+ +L  + S      PC    D+ LLA  +   FF+ I      + W  ++                   
Subjt:  VCAS-VTQKLSPSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPC----DICLLAKQRKLSFFIIINLLLHLLIWYTLI-------------------

Query:  -FGARLPLL----------------------------------------HML---------SRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGS
         FG  +  +                                        H+L         S +P+ +WS+CILT VYLINRTPS  L  +TP+ +L+  
Subjt:  -FGARLPLL----------------------------------------HML---------SRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGS

Query:  LPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSAK-TDPPTIPTSI--------------HTNP-TSPDMNSIPSSETDSPSIDIPNTS
        L DYS ++VF CLC+ STL ANR+KF+P+A   +F+GYP      K  D  T   SI               TNP +S D++S    +   P I   N  
Subjt:  LPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSAK-TDPPTIPTSI--------------HTNP-TSPDMNSIPSSETDSPSIDIPNTS

Query:  SHVSTPR----------RTSRPTKLS---TYLKDYHCSFLTD-SPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAE
        S    PR           +SRPT++S   +YLKDYHCS +   +   T+ST +P+  ++ Y KLSPSY+  SL++S       + KA     W+ AM  E
Subjt:  SHVSTPR----------RTSRPTKLS---TYLKDYHCSFLTD-SPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAE

Query:  LEAMETNHTWNVVSLPPNKHFIGCKWIYKI-----------NIRLMA--------------LSNVTKPVW---------WQRWPLIQLDVNNAFLHGDLI
        LEA+E N TW++VSLP  KH +GCKW+YKI             RL+A               S V K V           + W L QLDVNNAFLHGDL 
Subjt:  LEAMETNHTWNVVSLPPNKHFIGCKWIYKI-----------NIRLMA--------------LSNVTKPVW---------WQRWPLIQLDVNNAFLHGDLI

Query:  EEVYMDLSLGY-QPNVPVPSKGERL---------------------------------------------------------------------------
        EEVYM L  GY +    +PS    L                                                                           
Subjt:  EEVYMDLSLGY-QPNVPVPSKGERL---------------------------------------------------------------------------

Query:  -----------------------------------LIKDTGLIGAKPEAVPMDPRLKL-QQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQY
                                           L+ D G +G K  + PM+  +KL     VDL D +S YRRL+G LLYLT++RPDI++   +LSQ+
Subjt:  -----------------------------------LIKDTGLIGAKPEAVPMDPRLKL-QQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQY

Query:  VSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCE
        +S+P                                  ++ DSDWA C D+R S  G+C+ LG+SLVSW+SKKQ  V RSS E EYRA+A+T+ E
Subjt:  VSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCE

TrEMBL top hitse value%identityAlignment
A0A2N9FH27 Uncharacterized protein2.1e-14030.6Show/hide
Query:  SSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP----TGDLLSASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLD
        SSPY+LH SD S LILV+  LT DNY +W RSM + LSI+NKLG +DG++  P    +  L S  NR N VVI WILN VSK I ++ ++  +   IW  
Subjt:  SSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP----TGDLLSASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLD

Query:  LKDQFQRKN-----------------------------GIWDEYVTYR--PRCTC-GRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPP
        L+++F + N                             G+W+E + YR  P CTC   C+CG    +   +Q   LM  LMGLNE+F P R QILLMDP 
Subjt:  LKDQFQRKN-----------------------------GIWDEYVTYR--PRCTC-GRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPP

Query:  PAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKG-------------GT
        P I K FSL+ QEE QRS+  + LP V + A+    S + +  Q +  ++ +K +P CTHCG  GHT+D CYKLHGYPP Y+ KG             GT
Subjt:  PAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKG-------------GT

Query:  -------------------------------------------------------------------------------------YPNSSSHSPWIIDSD
                                                                                              P S S + W+ID+ 
Subjt:  -------------------------------------------------------------------------------------YPNSSSHSPWIIDSD

Query:  ALTHICFNKASFTTLFP-VATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDG
        A  H+  + +  +T+   V T V LP+   +SV + G+V+L  S+ L  VL V  F  NLIS+S L + +   + F + YC I+  +  + I  G L  G
Subjt:  ALTHICFNKASFTTLFP-VATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDG

Query:  LFML----DDSNIA---------LNLVVCASVTQKLSP-SLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSF-----FIIINL-L
        L++L    D S+ A          N V   S   + +P  LWH +LGH SF  LH L   +   PS   +   CDIC LAKQ++L F       + N  L
Subjt:  LFML----DDSNIA---------LNLVVCASVTQKLSP-SLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSF-----FIIINL-L

Query:  LHLLIW------------YTLIF----------------GARLPLL------------------------------------------------------
        +H  IW            Y L                      PLL                                                      
Subjt:  LHLLIW------------YTLIF----------------GARLPLL------------------------------------------------------

Query:  ----HML---------SRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPL--DNF
            H+L         S +P  FW ECILT  Y+INR PS +L+ +TP+ +L    P YS +KVF CL +ASTL ++R+KF  +A+P +FMGYP    + 
Subjt:  ----HML---------SRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPL--DNF

Query:  SAKTDPPTIPTSI--------HTNPT-----------------------SPDMNSIPSSETDSP-----SIDIP--------------NTSSHVSTP---
           T  P+ P  +        H  PT                       S DM   P    D P     S D+P              + +  +S P   
Subjt:  SAKTDPPTIPTSI--------HTNPT-----------------------SPDMNSIPSSETDSP-----SIDIP--------------NTSSHVSTP---

Query:  --RRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPN
          R+++R T + +YL+DYHC   + +  P  +  YP+ + + Y+ LSP+++A ++ I+     +FYH+A+   HW +AM+ ELEA+E NHTW++ +LP  
Subjt:  --RRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPN

Query:  KHFIGCKWIYKINI-----------RLMA--------------LSNVTKPVW---------WQRWPLIQLDVNNAFLHGDLIEEVYMDL------SLGYQ
        KH IGCKW+YKI             RL+A               S V K V           + W L QLDVNNAFLHG+L EEV+M L      S G  
Subjt:  KHFIGCKWIYKINI-----------RLMA--------------LSNVTKPVW---------WQRWPLIQLDVNNAFLHGDLIEEVYMDL------SLGYQ

Query:  PNVPV-----------------------------------------------------------------------------------------------
        PN+                                                                                                 
Subjt:  PNVPV-----------------------------------------------------------------------------------------------

Query:  -PSKGERL--------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP----------------
           KG  L        +++D+G +G+KP  +PM+  LKL +    LL   S+YRRLIG LLYLT++RPDIA++ + LSQ++S P                
Subjt:  -PSKGERL--------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP----------------

Query:  ------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL
                           F DSDWA C DTR S  G+CI LGDSL+SW+SKKQT V RSS E EYRA++  TCEL
Subjt:  ------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL

A0A2N9G0F1 Integrase catalytic domain-containing protein1.7e-13730.28Show/hide
Query:  SSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP----TGDLLSASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLD
        SSPY+LH SD S LILV+  LT DNY +W RSM + LSI+NKLG +DG++  P    +  L S  NR N VVI WILN VSK I ++ ++  +   IW  
Subjt:  SSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP----TGDLLSASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLD

Query:  LKDQFQRKN-----------------------------GIWDEYVTYR--PRCTC-GRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPP
        L+++F + N                             G+W+E + YR  P CTC   C+CG    +   +Q   LM  LMGLNE+F P R QILLMDP 
Subjt:  LKDQFQRKN-----------------------------GIWDEYVTYR--PRCTC-GRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPP

Query:  PAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKG-------------GT
        P I K FSL+ QEE QRS+  + LP V + A+    S + +  Q +  ++ +K +P CTHCG  GHT+D CYKLHGYPP Y+ KG             GT
Subjt:  PAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKG-------------GT

Query:  -------------------------------------------------------------------------------------YPNSSSHSPWIIDSD
                                                                                              P S S + W+ID+ 
Subjt:  -------------------------------------------------------------------------------------YPNSSSHSPWIIDSD

Query:  ALTHICFNKASFTTLFP-VATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDG
        A  H+  + +  +T+   V T V LP+   +SV + G+V+L  S+ L  VL V  F  NLIS+S L + +   + F + YC I+  +  + I  G L  G
Subjt:  ALTHICFNKASFTTLFP-VATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDG

Query:  LFML----DDSNIA---------LNLVVCASVTQKLSP-SLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSF-----FIIINL-L
        L++L    D S+ A          N V   S   + +P  LWH +LGH SF  LH L   +   PS   +   CDIC LAKQ++L F       + N  L
Subjt:  LFML----DDSNIA---------LNLVVCASVTQKLSP-SLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSF-----FIIINL-L

Query:  LHLLIW------------YTLIF----------------GARLPLL------------------------------------------------------
        +H  IW            Y L                      PLL                                                      
Subjt:  LHLLIW------------YTLIF----------------GARLPLL------------------------------------------------------

Query:  ----HML---------SRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSA
            H+L         S +P  FW ECILT  Y+INR PS +L+ +TP+ +L    P YS +KVF CL +ASTL ++R+KF  +A+P +FMGYP      
Subjt:  ----HML---------SRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSA

Query:  K------------------------------TDPPTIPTSIHT-NPTSPDMNSIPSSET------------------------------DS---PSIDIP
        K                              T PP   + + T  P+ P    I  S                                DS   P +DI 
Subjt:  K------------------------------TDPPTIPTSIHT-NPTSPDMNSIPSSET------------------------------DS---PSIDIP

Query:  NTSSHVSTP----RRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETN
         T      P    R+++R T + +YL+DYHC   + +  P  +  YP+ + + Y+ LSP+++A ++ I+     +FYH+A+   HW +AM+ ELEA+E N
Subjt:  NTSSHVSTP----RRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETN

Query:  HTWNVVSLPPNKHFIGCKWIYKINI-----------RLMA--------------LSNVTKPVW---------WQRWPLIQLDVNNAFLHGDLIEEVYMDL
        HTW++ +LP  KH IGCKW+YKI             RL+A               S V K V           + W L QLDVNNAFLHG+L EEV+M L
Subjt:  HTWNVVSLPPNKHFIGCKWIYKINI-----------RLMA--------------LSNVTKPVW---------WQRWPLIQLDVNNAFLHGDLIEEVYMDL

Query:  ------SLGYQPNVPV------------------------------------------------------------------------------------
              S G  PN+                                                                                      
Subjt:  ------SLGYQPNVPV------------------------------------------------------------------------------------

Query:  ------------PSKGERL--------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP-----
                      KG  L        +++D+G +G+KP  +PM+  LKL +    LL   S+YRRLIG LLYLT++RPDIA++ + LSQ++S P     
Subjt:  ------------PSKGERL--------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP-----

Query:  -----------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL
                                      F DSDWA C DTR S  G+CI LGDSL+SW+SKKQT V RSS E EYRA++  TCEL
Subjt:  -----------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL

A0A2N9GNR1 Integrase catalytic domain-containing protein2.9e-14531.6Show/hide
Query:  SSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP----TGDLLSASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLD
        SSPY+LH SD S LILV+  LT DNY +W RSM + LSI+NKLG +DG++  P    +  L S  NR N VVI WILN VSK I ++ ++  +   IW  
Subjt:  SSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP----TGDLLSASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLD

Query:  LKDQFQRKN-----------------------------GIWDEYVTYR--PRCTC-GRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPP
        L+++F + N                             G+W+E + YR  P CTC   C+CG    +   +Q   LM  LMGLNE+F P R QILLMDP 
Subjt:  LKDQFQRKN-----------------------------GIWDEYVTYR--PRCTC-GRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPP

Query:  PAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKG-------------GT
        P I K FSL+ QEE QRS+  + LP V + A+    S + +  Q +  ++ +K +P CTHCG  GHT+D CYKLHGYPP Y+ KG             GT
Subjt:  PAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKG-------------GT

Query:  -------------------------------------------------------------------------------------YPNSSSHSPWIIDSD
                                                                                              P S S + W+ID+ 
Subjt:  -------------------------------------------------------------------------------------YPNSSSHSPWIIDSD

Query:  ALTHICFNKASFTTLFP-VATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDG
        A  H+  + +  +T+   V T V LP+   +SV + G+V+L  S+ L  VL V  F  NLIS+S L + +   + F + YC I+  +  + I  G L  G
Subjt:  ALTHICFNKASFTTLFP-VATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDG

Query:  LFML----DDSNIA---------LNLVVCASVTQKLSP-SLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSF-----FIIINL-L
        L++L    D S+ A          N V   S   + +P  LWH +LGH SF  LH L   +   PS   +   CDIC LAKQ++L F       + N  L
Subjt:  LFML----DDSNIA---------LNLVVCASVTQKLSP-SLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSF-----FIIINL-L

Query:  LHLLIW------------YTLIF----------------GARLPLL------------------------------------------------------
        +H  IW            Y L                      PLL                                                      
Subjt:  LHLLIW------------YTLIF----------------GARLPLL------------------------------------------------------

Query:  ----HML---------SRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSA
            H+L         S +P  FW ECILT  Y+INR PS +L+ +TP+ +L    P YS +KVF CL +ASTL ++R+KF  +A+P +FMGYP   F  
Subjt:  ----HML---------SRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSA

Query:  KTDPPTIPTSIHTNPTSPDMNSIPSSETDS---PSIDIPNTSSHVSTP----RRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSY
        K  PP    S    P   D   +P    DS   P +DI  T      P    R+++R T + +YL+DYHC   + +  P  +  YP+ + + Y+ LSP++
Subjt:  KTDPPTIPTSIHTNPTSPDMNSIPSSETDS---PSIDIPNTSSHVSTP----RRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSY

Query:  RALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINI-----------RLMA--------------LSNVTKP
        +A ++ I+     +FYH+A+   HW +AM+ ELEA+E NHTW++ +LP  KH IGCKW+YKI             RL+A               S V K 
Subjt:  RALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINI-----------RLMA--------------LSNVTKP

Query:  VW---------WQRWPLIQLDVNNAFLHGDLIEEVYMDL------SLGYQPNVPV---------------------------------------------
        V           + W L QLDVNNAFLHG+L EEV+M L      S G  PN+                                               
Subjt:  VW---------WQRWPLIQLDVNNAFLHGDLIEEVYMDL------SLGYQPNVPV---------------------------------------------

Query:  ---------------------------------------------------PSKGERL--------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYL
                                                             KG  L        +++D+G +G+KP  +PM+  LKL +    LL   
Subjt:  ---------------------------------------------------PSKGERL--------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYL

Query:  SSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQ
        S+YRRLIG LLYLT++RPDIA++ + LSQ++S P                                   F DSDWA C DTR S  G+CI LGDSL+SW+
Subjt:  SSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQ

Query:  SKKQTTVFRSSTEDEYRALAHTTCEL
        SKKQT V RSS E EYRA++  TCEL
Subjt:  SKKQTTVFRSSTEDEYRALAHTTCEL

A0A2N9GP35 Integrase catalytic domain-containing protein2.9e-14531.6Show/hide
Query:  SSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP----TGDLLSASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLD
        SSPY+LH SD S LILV+  LT DNY +W RSM + LSI+NKLG +DG++  P    +  L S  NR N VVI WILN VSK I ++ ++  +   IW  
Subjt:  SSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP----TGDLLSASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLD

Query:  LKDQFQRKN-----------------------------GIWDEYVTYR--PRCTC-GRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPP
        L+++F + N                             G+W+E + YR  P CTC   C+CG    +   +Q   LM  LMGLNE+F P R QILLMDP 
Subjt:  LKDQFQRKN-----------------------------GIWDEYVTYR--PRCTC-GRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPP

Query:  PAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKG-------------GT
        P I K FSL+ QEE QRS+  + LP V + A+    S + +  Q +  ++ +K +P CTHCG  GHT+D CYKLHGYPP Y+ KG             GT
Subjt:  PAISKAFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKG-------------GT

Query:  -------------------------------------------------------------------------------------YPNSSSHSPWIIDSD
                                                                                              P S S + W+ID+ 
Subjt:  -------------------------------------------------------------------------------------YPNSSSHSPWIIDSD

Query:  ALTHICFNKASFTTLFP-VATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDG
        A  H+  + +  +T+   V T V LP+   +SV + G+V+L  S+ L  VL V  F  NLIS+S L + +   + F + YC I+  +  + I  G L  G
Subjt:  ALTHICFNKASFTTLFP-VATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDG

Query:  LFML----DDSNIA---------LNLVVCASVTQKLSP-SLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSF-----FIIINL-L
        L++L    D S+ A          N V   S   + +P  LWH +LGH SF  LH L   +   PS   +   CDIC LAKQ++L F       + N  L
Subjt:  LFML----DDSNIA---------LNLVVCASVTQKLSP-SLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSF-----FIIINL-L

Query:  LHLLIW------------YTLIF----------------GARLPLL------------------------------------------------------
        +H  IW            Y L                      PLL                                                      
Subjt:  LHLLIW------------YTLIF----------------GARLPLL------------------------------------------------------

Query:  ----HML---------SRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSA
            H+L         S +P  FW ECILT  Y+INR PS +L+ +TP+ +L    P YS +KVF CL +ASTL ++R+KF  +A+P +FMGYP   F  
Subjt:  ----HML---------SRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSA

Query:  KTDPPTIPTSIHTNPTSPDMNSIPSSETDS---PSIDIPNTSSHVSTP----RRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSY
        K  PP    S    P   D   +P    DS   P +DI  T      P    R+++R T + +YL+DYHC   + +  P  +  YP+ + + Y+ LSP++
Subjt:  KTDPPTIPTSIHTNPTSPDMNSIPSSETDS---PSIDIPNTSSHVSTP----RRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSY

Query:  RALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINI-----------RLMA--------------LSNVTKP
        +A ++ I+     +FYH+A+   HW +AM+ ELEA+E NHTW++ +LP  KH IGCKW+YKI             RL+A               S V K 
Subjt:  RALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINI-----------RLMA--------------LSNVTKP

Query:  VW---------WQRWPLIQLDVNNAFLHGDLIEEVYMDL------SLGYQPNVPV---------------------------------------------
        V           + W L QLDVNNAFLHG+L EEV+M L      S G  PN+                                               
Subjt:  VW---------WQRWPLIQLDVNNAFLHGDLIEEVYMDL------SLGYQPNVPV---------------------------------------------

Query:  ---------------------------------------------------PSKGERL--------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYL
                                                             KG  L        +++D+G +G+KP  +PM+  LKL +    LL   
Subjt:  ---------------------------------------------------PSKGERL--------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYL

Query:  SSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQ
        S+YRRLIG LLYLT++RPDIA++ + LSQ++S P                                   F DSDWA C DTR S  G+CI LGDSL+SW+
Subjt:  SSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQ

Query:  SKKQTTVFRSSTEDEYRALAHTTCEL
        SKKQT V RSS E EYRA++  TCEL
Subjt:  SKKQTTVFRSSTEDEYRALAHTTCEL

A0A2Z7AT15 Cysteine-rich RLK (Receptor-like protein kinase) 89.6e-14130.4Show/hide
Query:  LQMALSSATDSSSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP-TGDLLSAS-NRGNNVVIAWILNSVSKGISSSTMFP
        L +  ++  DSSSPY+LH+ D   L LVS+ L   NY TW R+M++AL+ +NKLG ID ++ +P + DLL  S  R N++VI+WILNSV++ I+ S M+ 
Subjt:  LQMALSSATDSSSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKP-TGDLLSAS-NRGNNVVIAWILNSVSKGISSSTMFP

Query:  DSTQAIWLDLKDQFQRKNG-----------------------------IWDEYVTYRP--RCTCGRCNCGGHEAMEKLFQF---EYLMSILMGLNESFGP
         + + IW DL ++F   N                              +WDE   Y+P   CTCG        +M + F +   E +M  LMGLN+S+  
Subjt:  DSTQAIWLDLKDQFQRKNG-----------------------------IWDEYVTYRP--RCTCGRCNCGGHEAMEKLFQF---EYLMSILMGLNESFGP

Query:  TRSQILLMDPPPAISKAFSLIAQEELQRSLPLLPLPAVITLA-----VAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQ-
         R+Q+L+++P P I+K F+L+ QEE QRS+      A +  +     V    +        QN+   R  R IC+HC  + HT+D CYKLHGYPP + + 
Subjt:  TRSQILLMDPPPAISKAFSLIAQEELQRSLPLLPLPAVITLA-----VAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQ-

Query:  -----KGGTYPNSSS-------------------------------------------HSP------------------------WIIDSDALTHICFNK
             +G  + + +S                                           H P                        WI+D+ A  HIC + 
Subjt:  -----KGGTYPNSSS-------------------------------------------HSP------------------------WIIDSDALTHICFNK

Query:  ASFTTLFPVATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDGLFMLDDSNIA
        + F +   + + VVLP+   I V  AG+V +  ++ L  VL+V  FQ+NL+S+S+LT +++  V+F +  C+I++ S ++ I  G     L++L   +  
Subjt:  ASFTTLFPVATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDGLFMLDDSNIA

Query:  LNLVVCASVTQKLSPSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSFFIIINL------LLHLLIW------------------
        L   +C +     +  LWH ++GH SF  L  LK++L+IE    D    C  C L+KQR+L      N+      LLH+  W                  
Subjt:  LNLVVCASVTQKLSPSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSFFIIINL------LLHLLIW------------------

Query:  -----YTLI-------------------------------------------FGARLPLLH-----------------------------MLSRVPLHFW
             YT +                                           F A+  + H                               S +PL +W
Subjt:  -----YTLI-------------------------------------------FGARLPLLH-----------------------------MLSRVPLHFW

Query:  SECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYP-------LDNFSAKTDPPTIPTSIHTNPTS
         +CI T VYLINRTPS +L  +TP+ +L+G LP YS +KVF CLC+ASTL ++R KF+P+AI  +F+GYP       L N        +     H N T 
Subjt:  SECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYP-------LDNFSAKTDPPTIPTSIHTNPTS

Query:  PDMNSIPSSETD-----SPSIDI-PNTSSHVSTPRRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHK
        P  N+ P S +D     SPS  I P+  +      RTSRP    ++L+DYHC +   +P  T ST +P+H  + Y+KLS S+RA   NIS+      + +
Subjt:  PDMNSIPSSETD-----SPSIDI-PNTSSHVSTPRRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHK

Query:  ALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINI-----------RLMA--------------LSNVTKPVWWQR---------WPLI
        A+    W++AM  EL+A+E NHTW++VSLP  K  +GC+W+YK              RL+A               S V K V  +          W LI
Subjt:  ALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINI-----------RLMA--------------LSNVTKPVWWQR---------WPLI

Query:  QLDVNNAFLHGDLIEEVYMDLSLGYQPNVPVPSKG-----------------------------------------------------------------
        QLDVNNAFLHGDL EEVYM L  G+     +PS+                                                                  
Subjt:  QLDVNNAFLHGDLIEEVYMDLSLGYQPNVPVPSKG-----------------------------------------------------------------

Query:  ---------------------------------------------ERLLIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRP
                                                        L+ + GL+G KP   PM+   KL Q + ++L   +SYRRLIG LLYLTI+RP
Subjt:  ---------------------------------------------ERLLIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRP

Query:  DIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRA
        D+ F  NKLSQYVS P                                  +F D+DW +CLDTR S  GYC+ LG+SL+SW++KKQ TV RSS E EYR+
Subjt:  DIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRA

Query:  LAHTTCELI
        LA +TCE++
Subjt:  LAHTTCELI

SwissProt top hitse value%identityAlignment
P0CV72 Secreted RxLR effector protein 1612.2e-0934.13Show/hide
Query:  YRRLIGWLLYL-TISRPDIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQS
        Y   +G ++YL  ++RPD+A     LSQ+ S P                                   + D+DWA  +++R ST GY   L    VSW+S
Subjt:  YRRLIGWLLYL-TISRPDIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQS

Query:  KKQTTVFRSSTEDEYRALAHTTCELI
        KKQ TV  SSTEDEY AL+  T E +
Subjt:  KKQTTVFRSSTEDEYRALAHTTCELI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-1821.11Show/hide
Query:  LSRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSAKTDPPTIPTSIHTNP
        ++++P  FW E + T  YLINR+PS  L ++ P  V       YS +KVF C  FA      R+K   ++IP IF+GY  + F  +   P     I +  
Subjt:  LSRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSAKTDPPTIPTSIHTNP

Query:  ---------TSPDMNSIPSSETDSPSIDIPNTSSHVSTPRRTS------------------------RPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQ
                 T+ DM+    +      + IP+TS++ ++   T+                           +  T  ++ H            S +YP  +
Subjt:  ---------TSPDMNSIPSSETDSPSIDIPNTSSHVSTPRRTS------------------------RPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQ

Query:  YILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKIN--------------------------
        Y+L   +S      SL     +  +        +   KAMQ E+E+++ N T+ +V LP  K  + CKW++K+                           
Subjt:  YILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKIN--------------------------

Query:  --------IRLMALSNVTKPVWWQRWPLIQLDVNNAFLHGDLIEEVYMDLSLGYQ--------------------------------------------P
                +++ ++  +          + QLDV  AFLHGDL EE+YM+   G++                                            P
Subjt:  --------IRLMALSNVTKPVWWQRWPLIQLDVNNAFLHGDLIEEVYMDLSLGYQ--------------------------------------------P

Query:  NVPVPSKGERLLI-------------KDTGLI------------------------------------------------------GAKPEAVPMDPRLK
         V      E   I             KD GLI                                                       AKP + P+   LK
Subjt:  NVPVPSKGERLLI-------------KDTGLI------------------------------------------------------GAKPEAVPMDPRLK

Query:  L----------QQFNVDLLDYLSSYRRLIGWLLY-LTISRPDIAFTDNKLSQYVSKP---------------------------------SFVDSDWASC
        L          ++ N+  + Y S+    +G L+Y +  +RPDIA     +S+++  P                                  + D+D A  
Subjt:  L----------QQFNVDLLDYLSSYRRLIGWLLY-LTISRPDIAFTDNKLSQYVSKP---------------------------------SFVDSDWASC

Query:  LDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCELI
        +D R S+ GY        +SWQSK Q  V  S+TE EY A   T  E+I
Subjt:  LDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCELI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.2e-0425.76Show/hide
Query:  SSSHSPWIIDSDALTHICFNKASFTTLFPVATYVVLPDNTRISVNYAGSVVLLGSIC----------LHRVLFVLEFQYNLISISALTYDNSIMVNFSTG
        S   S W++D+ A  H    +  F          V   NT  S      +  +G IC          L  V  V + + NLIS  AL  D      + + 
Subjt:  SSSHSPWIIDSDALTHICFNKASFTTLFPVATYVVLPDNTRISVNYAGSVVLLGSIC----------LHRVLFVLEFQYNLISISALTYDNSIMVNFSTG

Query:  YCEIRERSTLKTISKGSLHDGLFMLDDSNIALNLVVCA----SVTQKLSPSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSF
        +   + R     ++KGSL     +   +    N  +C     +   ++S  LWH ++GH+S   L IL     I  ++  +  PCD CL  KQ ++SF
Subjt:  YCEIRERSTLKTISKGSLHDGLFMLDDSNIALNLVVCA----SVTQKLSPSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSF

P92519 Uncharacterized mitochondrial protein AtMg008102.6e-1834.38Show/hide
Query:  LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP--------------------------------
        ++ + G++  KP + P+  +L          D  S +R ++G L YLT++RPDI++  N + Q + +P                                
Subjt:  LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP--------------------------------

Query:  --SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL
          +F DSDWA C  TR ST G+C  LG +++SW +K+Q TV RSSTE EYRALA T  EL
Subjt:  --SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-1735Show/hide
Query:  LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP--------------------------------
        L+  T +I AKP   PM P  KL  ++   L   + YR ++G L YL  +RPDI++  N+LSQ++  P                                
Subjt:  LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP--------------------------------

Query:  --SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL
          ++ D+DWA   D   ST GY + LG   +SW SKKQ  V RSSTE EYR++A+T+ E+
Subjt:  --SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.8e-0732.47Show/hide
Query:  VPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPL
        +P  +W       VYLINR P+ +L+ ++P+  L G+ P+Y  ++VF C C+      N+ K   ++   +F+GY L
Subjt:  VPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.8e-4321.4Show/hide
Query:  LTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKPTGDL-----------LSASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLDLKDQFQRKNGIWD
        LT  NY+ WSR +          G +DG+ P P   +            +   R + ++ + IL ++S  +  +     +   IW  L+  +   N  + 
Subjt:  LTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKPTGDL-----------LSASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLDLKDQFQRKNGIWD

Query:  EYVTYRPRCTCGRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPPPAISKAFS-LIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQN--
             R      +    G    + +   E +  +L  L + + P   QI   D PP++++    LI +E    +L    +  +    V H  + T +N  
Subjt:  EYVTYRPRCTCGRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPPPAISKAFS-LIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQN--

Query:  ------NQQQNNFKSRKVRPI-----------------CTHCGVQGHTIDHCYKLHGYPPRYRQKGGTYP--------NSSSHSP-----WIIDSDALTH
              N   NN +S   +P                  C  C VQGH+   C +LH +     Q+  T P        N + +SP     W++DS A  H
Subjt:  ------NQQQNNFKSRKVRPI-----------------CTHCGVQGHTIDHCYKLHGYPPRYRQKGGTYP--------NSSSHSP-----WIIDSDALTH

Query:  IC--FNKASFTTLFPVATYVVLPDNTRISVNYAGSVVL---LGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDG
        I   FN  SF   +     V++ D + I + + GS  L     S+ L++VL+V     NLIS+  L   N + V F     ++++ +T   + +G   D 
Subjt:  IC--FNKASFTTLFPVATYVVLPDNTRISVNYAGSVVL---LGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDG

Query:  LFMLDDSNIALNLVVC--ASVTQKLSPSLWHSKLGHLSFPCLHILKDILH------IEPSRCDSFVPCDICLLAKQRKLSF-------------------
        L+   +  IA +  V   AS   K + S WHS+LGH   P L IL  ++       + PS     + C  C + K  K+ F                   
Subjt:  LFMLDDSNIALNLVVC--ASVTQKLSPSLWHSKLGHLSFPCLHILKDILH------IEPSRCDSFVPCDICLLAKQRKLSF-------------------

Query:  --------------------------------------FIIINLLL----------------------------HLLIWYTL-----------------I
                                              FII   L+                            H +  +T                  I
Subjt:  --------------------------------------FIIINLLL----------------------------HLLIWYTL-----------------I

Query:  FGARLPLLHMLSRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPL----------
            L LL   S VP  +W       VYLINR P+ +L+ Q+P+  L G  P+Y  +KVF C C+      NR K   ++    FMGY L          
Subjt:  FGARLPLLHMLSRVPLHFWSECILTVVYLINRTPSHVLKWQTPYVVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPL----------

Query:  ----------------------DNFSAKTD--------------------------PPTIPTSIHTNPTSP-----------DMNSIPSSETDSPSIDIP
                               NF   T                           PP +   + T+P  P             +++PSS   SPS   P
Subjt:  ----------------------DNFSAKTD--------------------------PPTIPTSIHTNPTSP-----------DMNSIPSSETDSPSIDIP

Query:  NTSSH-----VSTPRRTSRPTKLSTYLKD-----------YHCSFLTDSP-----FPTYST--------------------------------KYPLHQY
           SH      + P +T      S  L +              S L  SP      PT ST                                + P++ +
Subjt:  NTSSH-----VSTPRRTSRPTKLSTYLKD-----------YHCSFLTDSP-----FPTYST--------------------------------KYPLHQY

Query:  ILYTKLSPSYR------ALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVV-SLPPNKHFIGCKWIY-----------KINIRLMALSNV
         + T+     R      + + +++ +   +   +A+  D W++AM +E+ A   NHTW++V   PP+   +GC+WI+           +   RL+A    
Subjt:  ILYTKLSPSYR------ALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVV-SLPPNKHFIGCKWIY-----------KINIRLMALSNV

Query:  TKP-----------------------VWWQRWPLIQLDVNNAFLHGDLIEEVYMD---------------------------------------LSLGYQ
         +P                          + WP+ QLDVNNAFL G L +EVYM                                        L++G+ 
Subjt:  TKP-----------------------VWWQRWPLIQLDVNNAFLHGDLIEEVYMD---------------------------------------LSLGYQ

Query:  PNV------------------------------------PVPSKGERLLIKD---------------------------------TGLIGAKPEAVPMDP
         ++                                     + +  +R  +K+                                 T ++ AKP A PM  
Subjt:  PNV------------------------------------PVPSKGERLLIKD---------------------------------TGLIGAKPEAVPMDP

Query:  RLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCST
          KL   +   L   + YR ++G L YL  +RPD+++  N+LSQY+  P                                  ++ D+DWA   D   ST
Subjt:  RLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP----------------------------------SFVDSDWASCLDTRCST

Query:  IGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL
         GY + LG   +SW SKKQ  V RSSTE EYR++A+T+ EL
Subjt:  IGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).6.8e-2229.82Show/hide
Query:  ALSSATDSSSPYFL-----HHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKPT--GDLLSASNRGNNVVIAWILNSVSKGISSSTM
        ++S  +D  SPY+L     H SD SI  L  D   +DNYV W       L +  K G IDGTLPKP     L     + N +V+ W++NS++  +  S M
Subjt:  ALSSATDSSSPYFL-----HHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKPT--GDLLSASNRGNNVVIAWILNSVSKGISSSTM

Query:  FPDSTQAIWLDLK-----------------------------DQFQRKNGIWDEYVTYR--PRCTCGRCNC----GGHEAMEKLFQFEYLMSILMGLNES
        + ++   +W DL+                             + F + + +W E   Y   P C CG CNC       EA EK  ++E+LM   + LN+ 
Subjt:  FPDSTQAIWLDLK-----------------------------DQFQRKNGIWDEYVTYR--PRCTCGRCNC----GGHEAMEKLFQFEYLMSILMGLNES

Query:  FGPTRSQILLMDPPPAISKAFSLIAQEE
        F    ++I+   PPP++ +AF+++   E
Subjt:  FGPTRSQILLMDPPPAISKAFSLIAQEE

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-3828.6Show/hide
Query:  TSPDMNSIPSSETDSPSIDIPNTSSHVSTPRRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPF
        +S  ++ +PS+   +   D+P  S H S  RRT +P     YL+DY+C  +         T + + Q++ Y K+SP Y +  + I+       Y++A  F
Subjt:  TSPDMNSIPSSETDSPSIDIPNTSSHVSTPRRTSRPTKLSTYLKDYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPF

Query:  DHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKI-----------NIRLMA--------------LSNVTKPVWWQ---------RWPLIQLDV
          W  AM  E+ AMET HTW + +LPPNK  IGCKW+YKI             RL+A               S V K    +          + L QLD+
Subjt:  DHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKI-----------NIRLMA--------------LSNVTKPVWWQ---------RWPLIQLDV

Query:  NNAFLHGDLIEEVYMDLSLGY--------QPNVP------------------------------VPSKGERL----------------------------
        +NAFL+GDL EE+YM L  GY         PN                                V S  +                              
Subjt:  NNAFLHGDLIEEVYMDLSLGY--------QPNVP------------------------------VPSKGERL----------------------------

Query:  ----------------------------------------------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDI
                                                      L+ +TGL+G KP +VPMDP +     +        +YRRLIG L+YL I+R DI
Subjt:  ----------------------------------------------LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDI

Query:  AFTDNKLSQYVSKPS----------------------------------FVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALA
        +F  NKLSQ+   P                                   F D+ + SC DTR ST GYC+ LG SL+SW+SKKQ  V +SS E EYRAL+
Subjt:  AFTDNKLSQYVSKPS----------------------------------FVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALA

Query:  HTTCELI
          T E++
Subjt:  HTTCELI

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.3e-0432.93Show/hide
Query:  LYLTISRPDIAFTDNKLSQYVSK----------------------------------PSFVDSDWASCLDTRCSTIGYCIIL
        +YLTI+RPD+ F  N+LSQ+ S                                    +F DSDWASC DTR S  G+C ++
Subjt:  LYLTISRPDIAFTDNKLSQYVSK----------------------------------PSFVDSDWASCLDTRCSTIGYCIIL

ATMG00810.1 DNA/RNA polymerases superfamily protein1.8e-1934.38Show/hide
Query:  LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP--------------------------------
        ++ + G++  KP + P+  +L          D  S +R ++G L YLT++RPDI++  N + Q + +P                                
Subjt:  LIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQYVSKP--------------------------------

Query:  --SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL
          +F DSDWA C  TR ST G+C  LG +++SW +K+Q TV RSSTE EYRALA T  EL
Subjt:  --SFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCEL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.0e-0635.71Show/hide
Query:  KLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINI
        KL+P Y +L++  +     +    AL    W +AMQ EL+A+  N TW +V  P N++ +GCKW++K  +
Subjt:  KLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGAACTGAGATCACATCCGAATGAGAGGGATCCTGAGGACATGAAAGCATGGTCGCAGGGAAATGTCAGGACCGTTAAGGGTCGAATCCGGATTCCGAATCCTGG
GCCTGGGGCGTTACAGATGGCTCTCTCTTCTGCCACTGATTCTTCGAGCCCTTACTTTCTTCACCATTCTGACACCTCCATTCTTATTCTTGTTTCTGATTTGCTCACTG
ATGATAATTATGTCACTTGGAGTCGATCTATGTTGTTGGCGCTTTCAATTCGGAACAAATTGGGATTGATTGATGGCACATTGCCTAAACCTACTGGAGATCTTCTCTCC
GCATCGAATCGGGGCAATAATGTCGTTATTGCTTGGATTTTGAACTCTGTATCCAAAGGTATTTCTTCTAGCACAATGTTTCCTGATTCCACACAGGCGATTTGGCTTGA
TCTCAAAGATCAATTTCAACGCAAGAATGGCATTTGGGATGAATATGTTACATATCGCCCTAGATGCACTTGCGGGCGATGCAATTGTGGAGGGCATGAAGCCATGGAGA
AATTATTTCAATTCGAGTACCTGATGAGCATCTTGATGGGATTGAATGAATCCTTCGGTCCTACACGTTCTCAGATATTGCTCATGGATCCTCCACCAGCGATTTCGAAG
GCTTTTTCTCTGATTGCTCAAGAAGAGCTTCAACGATCTCTTCCTCTACTTCCTTTGCCAGCTGTAATTACCCTCGCCGTTGCTCATCCTGTTTCAAAGACTGTTCAGAA
CAATCAGCAACAAAATAATTTCAAATCTCGCAAAGTGCGCCCAATTTGCACTCACTGTGGAGTGCAAGGTCATACAATTGATCATTGCTACAAACTTCATGGCTATCCTC
CTAGATATCGCCAAAAGGGAGGTACCTATCCGAATTCCTCTTCTCATAGCCCTTGGATAATCGATTCGGACGCATTAACTCATATTTGTTTTAACAAAGCATCATTTACT
ACTTTATTTCCTGTTGCAACTTATGTGGTTCTACCAGATAATACACGTATCTCTGTGAATTATGCTGGTTCTGTGGTTCTACTTGGTTCGATATGTCTTCATCGAGTTTT
GTTTGTACTTGAATTTCAGTACAACTTGATCTCCATCAGTGCTTTGACCTATGATAATTCCATTATGGTCAATTTCTCTACTGGTTATTGTGAAATTCGGGAAAGATCCA
CTTTGAAGACGATTAGCAAGGGTAGTTTACATGATGGTCTTTTTATGCTCGATGACAGTAACATTGCCCTTAATTTAGTTGTTTGTGCGTCTGTTACACAGAAGTTATCA
CCCAGTTTGTGGCATTCCAAGCTTGGTCATCTCTCATTTCCTTGTTTACACATTTTGAAAGATATTTTGCATATTGAACCATCTCGATGTGATTCCTTCGTTCCTTGTGA
TATTTGTCTTTTGGCTAAACAGAGAAAACTATCTTTTTTTATAATAATAAACTTGCTTCTGCACCTTTTGATTTGGTACACGCTGATATTTGGGGCCCGTTTGCCACTCC
TTCATATGCTGTCTCGTGTTCCTTTACATTTCTGGAGTGAATGCATATTGACTGTTGTATATTTGATTAATCGGACTCCATCACATGTTTTGAAATGGCAAACTCCTTAT
GTTGTCTTGAATGGATCTTTGCCTGATTATTCGTTGATGAAAGTCTTTAGATGTCTCTGCTTTGCATCCACTCTATCTGCTAATCGGTCTAAGTTTGCTCCTCAAGCTAT
ACCTGCTATTTTTATGGGATATCCGCTTGACAATTTTTCCGCTAAGACTGATCCCCCAACCATCCCTACGTCTATACATACCAATCCTACCTCCCCAGACATGAATTCTA
TACCTTCTTCCGAAACTGATTCCCCATCCATTGATATTCCAAACACATCCTCTCATGTCTCAACACCTAGGCGCACCTCGAGGCCAACTAAATTGTCTACTTATCTCAAA
GATTATCATTGTTCCTTCCTTACCGATTCCCCTTTTCCGACTTACTCCACCAAATATCCTTTACACCAATATATTTTGTATACCAAACTTTCCCCCTCTTACCGAGCCTT
GTCTCTTAATATTTCTACCCATTATAACCACCAATTTTACCACAAAGCCCTACCTTTTGATCATTGGAAAAAGGCTATGCAAGCTGAGTTAGAGGCTATGGAAACCAATC
ATACTTGGAATGTTGTTTCTCTTCCTCCAAACAAACATTTTATTGGATGCAAATGGATTTATAAGATAAACATAAGGCTGATGGCTCTATCGAACGTTACAAAGCCCGTT
TGGTGGCAAAGGTGGCCTTTGATACAACTTGATGTTAATAATGCTTTCCTCCATGGTGATTTGATTGAGGAAGTCTATATGGATTTATCTTTGGGATATCAACCCAATGT
CCCTGTTCCTAGTAAGGGGGAGCGTCTTCTCATTAAGGATACAGGTCTTATTGGTGCTAAACCTGAAGCAGTTCCTATGGATCCTCGTTTAAAGTTGCAACAATTTAATG
TTGATTTACTTGATTACCTTTCTTCTTATAGAAGGCTTATTGGATGGTTGCTTTATTTGACTATTTCTAGACCTGATATTGCTTTCACGGATAATAAACTCAGTCAATAT
GTTTCAAAGCCATCCTTTGTTGATTCTGATTGGGCTTCTTGCCTTGACACTCGGTGCTCTACCATTGGTTATTGCATAATCTTGGGTGATTCATTGGTTTCATGGCAATC
AAAGAAACAAACTACTGTTTTTAGATCTTCTACAGAGGATGAGTATCGCGCATTGGCTCATACTACTTGTGAACTTATATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGAACTGAGATCACATCCGAATGAGAGGGATCCTGAGGACATGAAAGCATGGTCGCAGGGAAATGTCAGGACCGTTAAGGGTCGAATCCGGATTCCGAATCCTGG
GCCTGGGGCGTTACAGATGGCTCTCTCTTCTGCCACTGATTCTTCGAGCCCTTACTTTCTTCACCATTCTGACACCTCCATTCTTATTCTTGTTTCTGATTTGCTCACTG
ATGATAATTATGTCACTTGGAGTCGATCTATGTTGTTGGCGCTTTCAATTCGGAACAAATTGGGATTGATTGATGGCACATTGCCTAAACCTACTGGAGATCTTCTCTCC
GCATCGAATCGGGGCAATAATGTCGTTATTGCTTGGATTTTGAACTCTGTATCCAAAGGTATTTCTTCTAGCACAATGTTTCCTGATTCCACACAGGCGATTTGGCTTGA
TCTCAAAGATCAATTTCAACGCAAGAATGGCATTTGGGATGAATATGTTACATATCGCCCTAGATGCACTTGCGGGCGATGCAATTGTGGAGGGCATGAAGCCATGGAGA
AATTATTTCAATTCGAGTACCTGATGAGCATCTTGATGGGATTGAATGAATCCTTCGGTCCTACACGTTCTCAGATATTGCTCATGGATCCTCCACCAGCGATTTCGAAG
GCTTTTTCTCTGATTGCTCAAGAAGAGCTTCAACGATCTCTTCCTCTACTTCCTTTGCCAGCTGTAATTACCCTCGCCGTTGCTCATCCTGTTTCAAAGACTGTTCAGAA
CAATCAGCAACAAAATAATTTCAAATCTCGCAAAGTGCGCCCAATTTGCACTCACTGTGGAGTGCAAGGTCATACAATTGATCATTGCTACAAACTTCATGGCTATCCTC
CTAGATATCGCCAAAAGGGAGGTACCTATCCGAATTCCTCTTCTCATAGCCCTTGGATAATCGATTCGGACGCATTAACTCATATTTGTTTTAACAAAGCATCATTTACT
ACTTTATTTCCTGTTGCAACTTATGTGGTTCTACCAGATAATACACGTATCTCTGTGAATTATGCTGGTTCTGTGGTTCTACTTGGTTCGATATGTCTTCATCGAGTTTT
GTTTGTACTTGAATTTCAGTACAACTTGATCTCCATCAGTGCTTTGACCTATGATAATTCCATTATGGTCAATTTCTCTACTGGTTATTGTGAAATTCGGGAAAGATCCA
CTTTGAAGACGATTAGCAAGGGTAGTTTACATGATGGTCTTTTTATGCTCGATGACAGTAACATTGCCCTTAATTTAGTTGTTTGTGCGTCTGTTACACAGAAGTTATCA
CCCAGTTTGTGGCATTCCAAGCTTGGTCATCTCTCATTTCCTTGTTTACACATTTTGAAAGATATTTTGCATATTGAACCATCTCGATGTGATTCCTTCGTTCCTTGTGA
TATTTGTCTTTTGGCTAAACAGAGAAAACTATCTTTTTTTATAATAATAAACTTGCTTCTGCACCTTTTGATTTGGTACACGCTGATATTTGGGGCCCGTTTGCCACTCC
TTCATATGCTGTCTCGTGTTCCTTTACATTTCTGGAGTGAATGCATATTGACTGTTGTATATTTGATTAATCGGACTCCATCACATGTTTTGAAATGGCAAACTCCTTAT
GTTGTCTTGAATGGATCTTTGCCTGATTATTCGTTGATGAAAGTCTTTAGATGTCTCTGCTTTGCATCCACTCTATCTGCTAATCGGTCTAAGTTTGCTCCTCAAGCTAT
ACCTGCTATTTTTATGGGATATCCGCTTGACAATTTTTCCGCTAAGACTGATCCCCCAACCATCCCTACGTCTATACATACCAATCCTACCTCCCCAGACATGAATTCTA
TACCTTCTTCCGAAACTGATTCCCCATCCATTGATATTCCAAACACATCCTCTCATGTCTCAACACCTAGGCGCACCTCGAGGCCAACTAAATTGTCTACTTATCTCAAA
GATTATCATTGTTCCTTCCTTACCGATTCCCCTTTTCCGACTTACTCCACCAAATATCCTTTACACCAATATATTTTGTATACCAAACTTTCCCCCTCTTACCGAGCCTT
GTCTCTTAATATTTCTACCCATTATAACCACCAATTTTACCACAAAGCCCTACCTTTTGATCATTGGAAAAAGGCTATGCAAGCTGAGTTAGAGGCTATGGAAACCAATC
ATACTTGGAATGTTGTTTCTCTTCCTCCAAACAAACATTTTATTGGATGCAAATGGATTTATAAGATAAACATAAGGCTGATGGCTCTATCGAACGTTACAAAGCCCGTT
TGGTGGCAAAGGTGGCCTTTGATACAACTTGATGTTAATAATGCTTTCCTCCATGGTGATTTGATTGAGGAAGTCTATATGGATTTATCTTTGGGATATCAACCCAATGT
CCCTGTTCCTAGTAAGGGGGAGCGTCTTCTCATTAAGGATACAGGTCTTATTGGTGCTAAACCTGAAGCAGTTCCTATGGATCCTCGTTTAAAGTTGCAACAATTTAATG
TTGATTTACTTGATTACCTTTCTTCTTATAGAAGGCTTATTGGATGGTTGCTTTATTTGACTATTTCTAGACCTGATATTGCTTTCACGGATAATAAACTCAGTCAATAT
GTTTCAAAGCCATCCTTTGTTGATTCTGATTGGGCTTCTTGCCTTGACACTCGGTGCTCTACCATTGGTTATTGCATAATCTTGGGTGATTCATTGGTTTCATGGCAATC
AAAGAAACAAACTACTGTTTTTAGATCTTCTACAGAGGATGAGTATCGCGCATTGGCTCATACTACTTGTGAACTTATATGA
Protein sequenceShow/hide protein sequence
MNELRSHPNERDPEDMKAWSQGNVRTVKGRIRIPNPGPGALQMALSSATDSSSPYFLHHSDTSILILVSDLLTDDNYVTWSRSMLLALSIRNKLGLIDGTLPKPTGDLLS
ASNRGNNVVIAWILNSVSKGISSSTMFPDSTQAIWLDLKDQFQRKNGIWDEYVTYRPRCTCGRCNCGGHEAMEKLFQFEYLMSILMGLNESFGPTRSQILLMDPPPAISK
AFSLIAQEELQRSLPLLPLPAVITLAVAHPVSKTVQNNQQQNNFKSRKVRPICTHCGVQGHTIDHCYKLHGYPPRYRQKGGTYPNSSSHSPWIIDSDALTHICFNKASFT
TLFPVATYVVLPDNTRISVNYAGSVVLLGSICLHRVLFVLEFQYNLISISALTYDNSIMVNFSTGYCEIRERSTLKTISKGSLHDGLFMLDDSNIALNLVVCASVTQKLS
PSLWHSKLGHLSFPCLHILKDILHIEPSRCDSFVPCDICLLAKQRKLSFFIIINLLLHLLIWYTLIFGARLPLLHMLSRVPLHFWSECILTVVYLINRTPSHVLKWQTPY
VVLNGSLPDYSLMKVFRCLCFASTLSANRSKFAPQAIPAIFMGYPLDNFSAKTDPPTIPTSIHTNPTSPDMNSIPSSETDSPSIDIPNTSSHVSTPRRTSRPTKLSTYLK
DYHCSFLTDSPFPTYSTKYPLHQYILYTKLSPSYRALSLNISTHYNHQFYHKALPFDHWKKAMQAELEAMETNHTWNVVSLPPNKHFIGCKWIYKINIRLMALSNVTKPV
WWQRWPLIQLDVNNAFLHGDLIEEVYMDLSLGYQPNVPVPSKGERLLIKDTGLIGAKPEAVPMDPRLKLQQFNVDLLDYLSSYRRLIGWLLYLTISRPDIAFTDNKLSQY
VSKPSFVDSDWASCLDTRCSTIGYCIILGDSLVSWQSKKQTTVFRSSTEDEYRALAHTTCELI