; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031835 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031835
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr11:15885083..15887283
RNA-Seq ExpressionLag0031835
SyntenyLag0031835
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]2.3e-12540.65Show/hide
Query:  TIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEVATQVM
        ++KLDR N+ LWK+L LP++R  KL+G++LGT+ CP E IT                    SS +S   N  +  W   DQ LLGW+ NSMT+E+ATQ++
Subjt:  TIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEVATQVM

Query:  GYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWSEMQAE
            +K LW+  Q+L G  +R++  YL+  F   RKG  KM DYL  MK   D L  AG+PVS+ +LI Q L GLD EYNPVV  +  +  +SW ++QA+
Subjt:  GYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWSEMQAE

Query:  LLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPICQVCGRP
        LL FE R+E Q    T+++   N + N+A                 N ++ RG++ NN  RG    N    RGGRGR         G S +  CQVCG  
Subjt:  LLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPICQVCGRP

Query:  GHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPREYGGNE
         H+A+ C+HRFDK +S + +  G+                                S N FLA+  +V D +WY DSGAS+HVT       +  E+ G  
Subjt:  GHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPREYGGNE

Query:  QVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT-----LSSS
         +++GNG KL I  TGS+ L      L+L + L VP+I KNL+SVS+L  DNN+  EF  N C +KDK +G+V+LKG+L+DGLYQL   +      +S  
Subjt:  QVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT-----LSSS

Query:  PSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQK
         S  RRLGHP +KVLD V+  C + V  ++   FC +CQ+GK H LPF  S+SHAQ+P +L+H+D+WGP P+ +  GF++Y+ F+DD SRFTW+YPLKQK
Subjt:  PSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQK

Query:  NDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF
        ++   AF  F  L + QFN RIK  Q D G E+  + +     G  F
Subjt:  NDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]4.3e-12440.71Show/hide
Query:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV
        L    ++KLDR N+ LW+++ LPI+R  +L+G++LG K CP E IT                    ++ +S   NP +E W   DQ LLGWL NSMT  +
Subjt:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV

Query:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS
        ATQ++    +  LW+  Q+L G  +R++  YL+  F  TRKG  KM DYL  MK  AD L  AG+P+S+ +LI Q L GLD EYNPVV  +  +  +SW 
Subjt:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS

Query:  EMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQ-RGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPIC
        ++QA+LL FE R+E Q    T+++   N + N+A K       + +RG   N NN  RG N N     GSNF G RG  GRGR +           +  C
Subjt:  EMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQ-RGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPIC

Query:  QVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPR
        QVCG   H+A+ C++RFDK +S +       N S N    G++                     N FLA+  ++ D +WY DSGAS+HVT   +   N  
Subjt:  QVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPR

Query:  EYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQL---ESVQT
        E+ G   +I+GNG KL I  TGS+ L      L+L + L VP I KNL+SVS+L  DNN+  EF  N C +KDK +G+ +L+GIL+DGLYQL   +S   
Subjt:  EYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQL---ESVQT

Query:  LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYP
        +S   S  R+LGHP +KVLD V++ CN+ +  ++   FC +CQ+GK H LPF  S SHA++  +L+H+D+WGP P+ S  GF++Y+ F+DD +RFTW+YP
Subjt:  LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYP

Query:  LKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF
        LKQK+D   AF  F  +V+ QF+ +IK  Q D G E+  + +     G  F
Subjt:  LKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]7.6e-12139.85Show/hide
Query:  NQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMT
        N L ++I ++ LDR NF LWK+L LPI+R  +L+G++LGTK CP + IT                AEAS        NP +  W   DQ +LGWL N+MT
Subjt:  NQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMT

Query:  SEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDI
        +  A+Q++    +K LWE  Q+L    +R+   YLR  F  TRKG  KM DYL  MK  AD L  AGSP+++ +LI Q L GLD +YNP+V  +  + ++
Subjt:  SEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDI

Query:  SWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKG-LTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNR
        SW ++QA+LL FE RL+   +     + + N + N+A+K        NH   W                  GS+F   RG  G+GR            + 
Subjt:  SWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKG-LTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNR

Query:  PICQVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLV
         ICQVC + GH A+ C HR+DK ++ +   N N                                  N FLA+     D  WY DSGAS+HVT   +   
Subjt:  PICQVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLV

Query:  NPREYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLE--SV
           E  G   +I+GNG KL I  +GS+ L +    L+L + L VP I KNL+SVS+LT DNN+  EF  + C +KDK +G+VLL+GIL+DGLYQL   S 
Subjt:  NPREYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLE--SV

Query:  QT-------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDD
        QT       LS   S  R+LGHP++ VLD V++ CN+    ++  +FC +CQ GK+H LPF  S+SHAQ+  +LIH+D+WGP P+ S  GF++Y+ F+DD
Subjt:  QT-------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDD

Query:  HSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF
         SRFTW+YPLKQK+D   AF  F  +V+ QFN RIK  Q D G EF  + +     G  F
Subjt:  HSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]5.8e-12140.43Show/hide
Query:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV
        L    ++KLDR NF LWK+L LP++R  K +G++LGTK CP + +T                   S   T  + NP Y+ W   DQ LLGWL NSMT ++
Subjt:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV

Query:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS
        ATQV+    +K LW+  Q+L G  +R+   YL+  F  T K   KM  YL  MK  AD L  AGSP+SS +L+ Q L GLD EYNPVV  +  + +ISW 
Subjt:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS

Query:  EMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPICQ
        + QA+LL FE RL+ Q     +++   N S N ASK                  N+ G N    + G    N    RGGRGR      +      RPICQ
Subjt:  EMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPICQ

Query:  VCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPRE
        +CG+ GH A  CY+RFDK                      +Y    +YA            S + F+A+P    D  WY DSGAS+HVT     L +  E
Subjt:  VCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPRE

Query:  YGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT----
          G   +++GNG KL I  +GST L D    ++L N L VP+I KNL+SVS+LT DNN   EF  N+C +KDK +G+ LLKG L+DGLYQL + +     
Subjt:  YGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT----

Query:  ------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSR
              +S      R+LGHP +KVL+ V+++ N+ +  ++   FC +CQFGK H LPF  S+SHA++P DLIH+D+WGP P+ S   F++Y+ FLDD SR
Subjt:  ------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSR

Query:  FTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHR
        FTW++PLKQK++   AF  F  LV+ QFN +IK  + D G E+  + +
Subjt:  FTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHR

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]1.7e-12039.27Show/hide
Query:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV
        L  I ++KLDR N+ LWK+L LP++R  K +G++LGTK CP + +T                    S+  S   NP ++ W+  DQ LLGWL NSM  ++
Subjt:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV

Query:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS
        ATQ++    +K LW+  Q+L G  +++   YL+  F  TRKG  KM +YL  MK  +D L  +GSP+S+ +L+ Q L GLD EYNPVV  +  + ++SW 
Subjt:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS

Query:  EMQAELLVFEKRLELQTAQKTSVS-FSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPIC
        ++QA+LL FE RL+    Q  + S  + N S N A+K        H+RG              N +R  SNF G RG  G+GR            +   C
Subjt:  EMQAELLVFEKRLELQTAQKTSVS-FSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPIC

Query:  QVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPR
        QVC   GH A+ C +RFD+ ++         N S      G++                     + F+A+P    D  WY DSGAS+HVT   +      
Subjt:  QVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPR

Query:  EYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT---
        E+ G   +++GNG KL I  +GST L    + L+L + L VP I KNL+SVS+LT DNN++ EF  N C +KDK +G+ LLKG L+DGLYQL  V     
Subjt:  EYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT---

Query:  ------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSR
              +S   S  R+LGHP +KVL+ V+++CN+ +  ++   FC +CQFGK H LPF  S+SH Q+P  LIHSD+WGP P+ SP GF++Y+ F+DD SR
Subjt:  ------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSR

Query:  FTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF
        FTW++PLKQK+D   AF  F  L + QFN +IK  Q D G E+  + +     G  F
Subjt:  FTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)2.1e-12440.71Show/hide
Query:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV
        L    ++KLDR N+ LW+++ LPI+R  +L+G++LG K CP E IT                    ++ +S   NP +E W   DQ LLGWL NSMT  +
Subjt:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV

Query:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS
        ATQ++    +  LW+  Q+L G  +R++  YL+  F  TRKG  KM DYL  MK  AD L  AG+P+S+ +LI Q L GLD EYNPVV  +  +  +SW 
Subjt:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS

Query:  EMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQ-RGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPIC
        ++QA+LL FE R+E Q    T+++   N + N+A K       + +RG   N NN  RG N N     GSNF G RG  GRGR +           +  C
Subjt:  EMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQ-RGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPIC

Query:  QVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPR
        QVCG   H+A+ C++RFDK +S +       N S N    G++                     N FLA+  ++ D +WY DSGAS+HVT   +   N  
Subjt:  QVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPR

Query:  EYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQL---ESVQT
        E+ G   +I+GNG KL I  TGS+ L      L+L + L VP I KNL+SVS+L  DNN+  EF  N C +KDK +G+ +L+GIL+DGLYQL   +S   
Subjt:  EYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQL---ESVQT

Query:  LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYP
        +S   S  R+LGHP +KVLD V++ CN+ +  ++   FC +CQ+GK H LPF  S SHA++  +L+H+D+WGP P+ S  GF++Y+ F+DD +RFTW+YP
Subjt:  LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYP

Query:  LKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF
        LKQK+D   AF  F  +V+ QF+ +IK  Q D G E+  + +     G  F
Subjt:  LKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF

A0A2K3LJ49 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-12139.85Show/hide
Query:  NQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMT
        N L ++I ++ LDR NF LWK+L LPI+R  +L+G++LGTK CP + IT                AEAS        NP +  W   DQ +LGWL N+MT
Subjt:  NQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMT

Query:  SEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDI
        +  A+Q++    +K LWE  Q+L    +R+   YLR  F  TRKG  KM DYL  MK  AD L  AGSP+++ +LI Q L GLD +YNP+V  +  + ++
Subjt:  SEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDI

Query:  SWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKG-LTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNR
        SW ++QA+LL FE RL+   +     + + N + N+A+K        NH   W                  GS+F   RG  G+GR            + 
Subjt:  SWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKG-LTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNR

Query:  PICQVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLV
         ICQVC + GH A+ C HR+DK ++ +   N N                                  N FLA+     D  WY DSGAS+HVT   +   
Subjt:  PICQVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLV

Query:  NPREYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLE--SV
           E  G   +I+GNG KL I  +GS+ L +    L+L + L VP I KNL+SVS+LT DNN+  EF  + C +KDK +G+VLL+GIL+DGLYQL   S 
Subjt:  NPREYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLE--SV

Query:  QT-------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDD
        QT       LS   S  R+LGHP++ VLD V++ CN+    ++  +FC +CQ GK+H LPF  S+SHAQ+  +LIH+D+WGP P+ S  GF++Y+ F+DD
Subjt:  QT-------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDD

Query:  HSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF
         SRFTW+YPLKQK+D   AF  F  +V+ QFN RIK  Q D G EF  + +     G  F
Subjt:  HSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)2.8e-12140.43Show/hide
Query:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV
        L    ++KLDR NF LWK+L LP++R  K +G++LGTK CP + +T                   S   T  + NP Y+ W   DQ LLGWL NSMT ++
Subjt:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV

Query:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS
        ATQV+    +K LW+  Q+L G  +R+   YL+  F  T K   KM  YL  MK  AD L  AGSP+SS +L+ Q L GLD EYNPVV  +  + +ISW 
Subjt:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS

Query:  EMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPICQ
        + QA+LL FE RL+ Q     +++   N S N ASK                  N+ G N    + G    N    RGGRGR      +      RPICQ
Subjt:  EMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPICQ

Query:  VCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPRE
        +CG+ GH A  CY+RFDK                      +Y    +YA            S + F+A+P    D  WY DSGAS+HVT     L +  E
Subjt:  VCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPRE

Query:  YGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT----
          G   +++GNG KL I  +GST L D    ++L N L VP+I KNL+SVS+LT DNN   EF  N+C +KDK +G+ LLKG L+DGLYQL + +     
Subjt:  YGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT----

Query:  ------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSR
              +S      R+LGHP +KVL+ V+++ N+ +  ++   FC +CQFGK H LPF  S+SHA++P DLIH+D+WGP P+ S   F++Y+ FLDD SR
Subjt:  ------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSR

Query:  FTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHR
        FTW++PLKQK++   AF  F  LV+ QFN +IK  + D G E+  + +
Subjt:  FTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHR

A0A2K3NEN7 Copia-like polyprotein (Fragment)8.2e-12139.27Show/hide
Query:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV
        L  I ++KLDR N+ LWK+L LP++R  K +G++LGTK CP + +T                    S+  S   NP ++ W+  DQ LLGWL NSM  ++
Subjt:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV

Query:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS
        ATQ++    +K LW+  Q+L G  +++   YL+  F  TRKG  KM +YL  MK  +D L  +GSP+S+ +L+ Q L GLD EYNPVV  +  + ++SW 
Subjt:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWS

Query:  EMQAELLVFEKRLELQTAQKTSVS-FSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPIC
        ++QA+LL FE RL+    Q  + S  + N S N A+K        H+RG              N +R  SNF G RG  G+GR            +   C
Subjt:  EMQAELLVFEKRLELQTAQKTSVS-FSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPIC

Query:  QVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPR
        QVC   GH A+ C +RFD+ ++         N S      G++                     + F+A+P    D  WY DSGAS+HVT   +      
Subjt:  QVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPR

Query:  EYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT---
        E+ G   +++GNG KL I  +GST L    + L+L + L VP I KNL+SVS+LT DNN++ EF  N C +KDK +G+ LLKG L+DGLYQL  V     
Subjt:  EYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT---

Query:  ------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSR
              +S   S  R+LGHP +KVL+ V+++CN+ +  ++   FC +CQFGK H LPF  S+SH Q+P  LIHSD+WGP P+ SP GF++Y+ F+DD SR
Subjt:  ------LSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSR

Query:  FTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF
        FTW++PLKQK+D   AF  F  L + QFN +IK  Q D G E+  + +     G  F
Subjt:  FTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.1e-12540.65Show/hide
Query:  TIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEVATQVM
        ++KLDR N+ LWK+L LP++R  KL+G++LGT+ CP E IT                    SS +S   N  +  W   DQ LLGW+ NSMT+E+ATQ++
Subjt:  TIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEVATQVM

Query:  GYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWSEMQAE
            +K LW+  Q+L G  +R++  YL+  F   RKG  KM DYL  MK   D L  AG+PVS+ +LI Q L GLD EYNPVV  +  +  +SW ++QA+
Subjt:  GYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRGDISWSEMQAE

Query:  LLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPICQVCGRP
        LL FE R+E Q    T+++   N + N+A                 N ++ RG++ NN  RG    N    RGGRGR         G S +  CQVCG  
Subjt:  LLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPICQVCGRP

Query:  GHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPREYGGNE
         H+A+ C+HRFDK +S + +  G+                                S N FLA+  +V D +WY DSGAS+HVT       +  E+ G  
Subjt:  GHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPREYGGNE

Query:  QVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT-----LSSS
         +++GNG KL I  TGS+ L      L+L + L VP+I KNL+SVS+L  DNN+  EF  N C +KDK +G+V+LKG+L+DGLYQL   +      +S  
Subjt:  QVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQT-----LSSS

Query:  PSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQK
         S  RRLGHP +KVLD V+  C + V  ++   FC +CQ+GK H LPF  S+SHAQ+P +L+H+D+WGP P+ +  GF++Y+ F+DD SRFTW+YPLKQK
Subjt:  PSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQK

Query:  NDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF
        ++   AF  F  L + QFN RIK  Q D G E+  + +     G  F
Subjt:  NDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECYRLGQAF

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.8e-1719.59Show/hide
Query:  VVSNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTR-KGNTKMTDYLRLMKTHADNLGQAGSPVSSR
        ++ N + ++W   ++     +   ++            A+ + E +  ++  +S A +  LR+     +      +  +  +       L  AG+ +   
Subjt:  VVSNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTR-KGNTKMTDYLRLMKTHADNLGQAGSPVSSR

Query:  NLISQVLLGLDEEYNPVVAMIHGRGDISWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSN
        + IS +L+ L   Y+ ++  I      + SE    L   + RL     Q+  +   HN   + + K +    +N+N  +  N    R   P    +G S 
Subjt:  NLISQVLLGLDEEYNPVVAMIHGRGDISWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSN

Query:  FNGNRGRGGRGRGYGNYGSNYGNSNRPICQVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANP
        +                        +  C  CGR GH+   C+H   K    N+N+     +    S+                   F+    N    N 
Subjt:  FNGNRGRGGRGRGYGNYGSNYGNSNRPICQVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANP

Query:  ETVIDSNWYVDSGASSHVTGDYNNLVNPREYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLL
          + +  + +DSGAS H+  D +   +  E     ++ +    +   +           H ++LE+ L   + A NL+SV RL Q+  +  EF  +   +
Subjt:  ETVIDSNWYVDSGASSHVTGDYNNLVNPREYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLL

Query:  KDKASGRVLLKGILRD-GLYQLESVQTLSSSPSLFR----RLGHPA-SKVLDF----VIRECNLPVKRNEALQFCSSCQFGKAHNLPFP--LSNSHAQKP
               V   G+L +  +   ++    +   + FR    R GH +  K+L+     +  + +L      + + C  C  GK   LPF      +H ++P
Subjt:  KDKASGRVLLKGILRD-GLYQLESVQTLSSSPSLFR----RLGHPA-SKVLDF----VIRECNLPVKRNEALQFCSSCQFGKAHNLPFP--LSNSHAQKP

Query:  FDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFV--HIHRECYRLGQAFH
          ++HSD+ GP    + D   ++++F+D  + +   Y +K K+D FS F+ F+   +  FN ++     DNGRE++   + + C + G ++H
Subjt:  FDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFV--HIHRECYRLGQAFH

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-2523.22Show/hide
Query:  EAWITTDQLLLGWLYNSMTSEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYL-RQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVL
        E W   D+     +   ++ +V   ++  + A+ +W  +++L+  ++   + YL +Q +       T    +L +       L   G  +   +    +L
Subjt:  EAWITTDQLLLGWLYNSMTSEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYL-RQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVL

Query:  LGLDEEY-NPVVAMIHGRGDISWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRG
          L   Y N    ++HG+  I   ++ + LL+ EK                          +   P N  +        +  Q  +N          N G
Subjt:  LGLDEEY-NPVVAMIHGRGDISWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRG

Query:  R-GGRGRGYGNYGSNYGNSNRPICQVCGRPGHLALTCYHRFDKEFSPNQNR-NGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVI
        R G RG+      S   N     C  C +PGH    C         PN  +  G  +   N  N+   V N +     +       +   P         
Subjt:  R-GGRGRGYGNYGSNYGNSNRPICQVCGRPGHLALTCYHRFDKEFSPNQNR-NGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVI

Query:  DSNWYVDSGASSHVTGDYNNLVNPREYGGNEQVIIGNGTKLPISFTGSTYL-TDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDK
        +S W VD+ AS H T    +L      G    V +GN +   I+  G   + T+    L L++   VPD+  NL+S   L +D   Y  +  N      K
Subjt:  DSNWYVDSGASSHVTGDYNNLVNPREYGGNEQVIIGNGTKLPISFTGSTYL-TDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDK

Query:  ASGRVLLKGILRDGLY---------QLESVQTLSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHS
         S  V+ KG+ R  LY         +L + Q   S     +R+GH + K L  + ++  +   +   ++ C  C FGK H + F  S+       DL++S
Subjt:  ASGRVLLKGILRDGLY---------QLESVQTLSSSPSLFRRLGHPASKVLDFVIRECNLPVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHS

Query:  DLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECY
        D+ GP  ++S  G ++++ F+DD SR  W+Y LK K+  F  F+ F  LV+ +   ++K  +SDNG E+     E Y
Subjt:  DLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFVHIHRECY

Q12501 Transposon Ty2-OR2 Gag-Pol polyprotein8.4e-1422.65Show/hide
Query:  ANPETVIDSN------WYVDSGASSHVTGDYNNLVNPREYGGNEQVIIGNGTK--LPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNL
        + P   IDSN        +DSGAS  +    + L +      N ++ I +  K  +PI+  G+ +    +   +    L  P+IA +L+S+S LT + N+
Subjt:  ANPETVIDSN------WYVDSGASSHVTGDYNNLVNPREYGGNEQVIIGNGTK--LPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNL

Query:  YFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQTLSSS--------------------PSLFRRLGHPASKVLDFVIRECNLPVKRNEALQF-----
           F  N     +++ G VL   +     Y L     + S                     P + R LGH   + +   +++  +   +   +++     
Subjt:  YFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQTLSSS--------------------PSLFRRLGHPASKVLDFVIRECNLPVKRNEALQF-----

Query:  --CSSCQFGKA----HNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPL--KQKNDAFSAFKHFIQLVQTQFNSRIKAFQ
          C  C  GK+    H     L    + +PF  +H+D++GP          ++I F D+ +RF W+YPL  +++    + F   +  ++ QFN+R+   Q
Subjt:  --CSSCQFGKA----HNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPL--KQKNDAFSAFKHFIQLVQTQFNSRIKAFQ

Query:  SDNGREFVH
         D G E+ +
Subjt:  SDNGREFVH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-6730.52Show/hide
Query:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV
        +N     KL   N+L+W      +   Y+L G L G+   PP  I     GT + P                  NP Y  W   D+L+   +  +++  V
Subjt:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITTDQLLLGWLYNSMTSEV

Query:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRG-DISW
           V     A  +WE ++ ++   S      LR   +Q  KG   + DY++ + T  D L   G P+     + +VL  L EEY PV+  I  +    + 
Subjt:  ATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGRG-DISW

Query:  SEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNP-QRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPI
        +E+   LL  E ++ L  +  T +  + N    ++ +  T   NN+N   N   +N+   N + P Q+  +NF+ N                  N ++P 
Subjt:  SEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNP-QRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPI

Query:  ---CQVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNL
           CQ+CG  GH A  C                        S   +++ + N   S  P + F        LA       +NW +DSGA+ H+T D+NNL
Subjt:  ---CQVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNL

Query:  VNPREYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQ--LES
           + Y G + V++ +G+ +PIS TGST L+  S  L+L N L VP+I KNL+SV RL   N +  EF      +KD  +G  LL+G  +D LY+  + S
Subjt:  VNPREYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQ--LES

Query:  VQTLS--SSP-------SLFRRLGHPASKVLDFVIRECNLPVKRNEALQF--CSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYI
         Q +S  +SP       S   RLGHPA  +L+ VI   +L V  N + +F  CS C   K++ +PF  S  ++ +P + I+SD+W  +P+ S D +R+Y+
Subjt:  VQTLS--SSP-------SLFRRLGHPASKVLDFVIRECNLPVKRNEALQF--CSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYI

Query:  LFLDDHSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFV
        +F+D  +R+TWLYPLKQK+     F  F  L++ +F +RI  F SDNG EFV
Subjt:  LFLDDHSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.4e-5531.6Show/hide
Query:  NPLYEAWITTDQLLLGWLYNSMTSEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLIS
        NP Y  W   D+L+   +  +++  V   V     A  +WE ++ ++   S      LR                     T  D L   G P+     + 
Subjt:  NPLYEAWITTDQLLLGWLYNSMTSEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLIS

Query:  QVLLGLDEEYNPVVAMIHGRG-DISWSEMQAELLVFE-KRLELQTAQKTSVSFSHNTSVNMASKGLTNP-PNNHNRGWNVNPNNQRGQ-NPNNPQRGGSN
        +VL  L ++Y PV+  I  +    S +E+   L+  E K L L +A+   +      + N+ +   TN   N +NRG N N NN   + N   P   GS 
Subjt:  QVLLGLDEEYNPVVAMIHGRG-DISWSEMQAELLVFE-KRLELQTAQKTSVSFSHNTSVNMASKGLTNP-PNNHNRGWNVNPNNQRGQ-NPNNPQRGGSN

Query:  FNGNRGRGGRGRGYGNYGSNYGNSNRPICQVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANP
         +  + +   GR                CQ+C   GH A  C         P  ++   F  + NQ  S      T+      P      NS  P+ AN 
Subjt:  FNGNRGRGGRGRGYGNYGSNYGNSNRPICQVCGRPGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANP

Query:  ETVIDSNWYVDSGASSHVTGDYNNLVNPREYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLL
              NW +DSGA+ H+T D+NNL   + Y G + V+I +G+ +PI+ TGS  L   S  L L   L VP+I KNL+SV RL   N +  EF      +
Subjt:  ETVIDSNWYVDSGASSHVTGDYNNLVNPREYGGNEQVIIGNGTKLPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLL

Query:  KDKASGRVLLKGILRDGLYQ--LESVQTLS--SSP-------SLFRRLGHPASKVLDFVIRECNLPV-KRNEALQFCSSCQFGKAHNLPFPLSNSHAQKP
        KD  +G  LL+G  +D LY+  + S Q +S  +SP       S   RLGHP+  +L+ VI   +LPV   +  L  CS C   K+H +PF  S   + KP
Subjt:  KDKASGRVLLKGILRDGLYQ--LESVQTLS--SSP-------SLFRRLGHPASKVLDFVIRECNLPV-KRNEALQFCSSCQFGKAHNLPFPLSNSHAQKP

Query:  FDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFV
         + I+SD+W  +P+ S D +R+Y++F+D  +R+TWLYPLKQK+     F  F  LV+ +F +RI    SDNG EFV
Subjt:  FDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFV

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.7e-0624.77Show/hide
Query:  MNTTTFSSPPLNQLLNQI-----TTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYE-A
        M  TT SS   +  + QI      T+ L++ N+ +W+ L   +  S+ + GH+ G+                S PT                  P+ E  
Subjt:  MNTTTFSSPPLNQLLNQI-----TTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYE-A

Query:  WITTDQLLLGWLYNSMTSEVATQVMGYN-NAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLG
        W   D L+  W+Y ++T  +   ++     A+DLW +++NLF     A         + T   +  + +Y + +K+ +D L    SP+S R L+  +L G
Subjt:  WITTDQLLLGWLYNSMTSEVATQVMGYN-NAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLG

Query:  LDEEYNPVVAMIHGRGDI-SWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRG
        L E+Y+ ++ +I  +    S++E ++ LL+ E RL    + K+  S SH    ++++   T P          + NN      +N  RG S    NRG G
Subjt:  LDEEYNPVVAMIHGRGDI-SWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRG

Query:  GRGRGYGNYGSNYGNSNRPICQVCGRP
             Y N  +N    N+P   + G P
Subjt:  GRGRGYGNYGSNYGNSNRPICQVCGRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAACGTCTCACTTAACAGTACTGCACCACTGATGAACACAACTACGTTTAGCTCACCACCGCTCAATCAACTGTTGAATCAAATTACTACGATCAAGTTGGATCG
AGGAAATTTCTTGTTGTGGAAAAATCTTGCTCTCCCAATTTTGCGAAGTTACAAGCTCGAAGGGCATTTACTCGGAACAAAGCCCTGTCCGCCTGAAATTATTACTCAAA
CTACTGGAGGTACTGTCAGCAATCCCACAGAAAATGCAGCTGGAGCAGAAGCTTCAAGCTCTACAACATCAGTGGTCTCAAATCCACTATATGAAGCATGGATTACGACT
GATCAATTGCTATTAGGATGGCTGTACAACTCCATGACATCTGAGGTTGCCACTCAAGTTATGGGATACAACAATGCAAAGGATCTGTGGGAAGCCATCCAGAATCTCTT
TGGAATTCAATCGAGAGCAGAGGAGGATTATCTCCGGCAAACATTTCAGCAGACACGGAAAGGTAATACTAAAATGACAGATTATTTGAGACTAATGAAAACTCATGCCG
ATAATCTAGGGCAAGCTGGGAGTCCTGTCTCTTCGAGAAATCTTATTTCTCAAGTGTTGCTCGGCTTAGACGAGGAGTATAACCCTGTTGTCGCTATGATTCATGGCAGA
GGAGATATATCTTGGTCCGAAATGCAAGCAGAACTACTTGTCTTTGAGAAACGACTGGAACTACAGACGGCTCAAAAAACCTCGGTCTCATTCAGTCATAATACGTCTGT
CAACATGGCAAGCAAAGGCTTGACCAATCCTCCCAATAACCACAACCGAGGGTGGAATGTCAATCCTAACAATCAGAGAGGTCAGAATCCTAACAATCCACAGAGAGGAG
GATCAAATTTTAATGGAAACAGAGGTCGTGGTGGAAGAGGCAGAGGCTATGGCAACTACGGTAGCAACTACGGTAACTCTAATCGACCTATTTGTCAAGTCTGTGGTAGG
CCAGGACATCTAGCTCTCACTTGTTATCATAGGTTTGACAAAGAGTTTAGTCCAAATCAAAATAGGAATGGCAACTTCAATATCTCTAATAATCAGTCGAATTCTGGAAA
TTATGTTGGGAATACTAATTATGCTGGAAGTTCTGTTCCTGCTACGACTTTTGTTGCAAATTCAGGAAACCCGTTTTTAGCAAATCCTGAAACTGTGATTGACTCAAATT
GGTACGTGGATAGCGGCGCCTCGAGCCATGTCACTGGTGATTACAACAACCTGGTTAATCCAAGAGAATATGGAGGTAATGAGCAAGTCATTATAGGTAATGGTACTAAG
TTGCCTATTTCTTTTACTGGAAGCACTTATTTAACTGATGGGTCTCATGTTCTTAGCCTCGAAAACACACTTTTAGTGCCTGATATTGCTAAGAATTTGGTCAGTGTTTC
AAGGTTAACACAAGATAATAATCTATACTTTGAATTTCATGGAAATTTTTGTCTTCTAAAGGACAAGGCCTCGGGTCGGGTGCTGCTGAAAGGAATCCTTAGAGATGGCT
TGTATCAGCTAGAGAGTGTTCAAACCTTATCTTCAAGCCCTTCGCTATTTCGAAGGCTTGGTCATCCAGCCTCTAAGGTTCTTGATTTTGTTATTAGAGAATGTAACCTT
CCAGTTAAAAGAAATGAAGCTTTACAGTTTTGCTCATCCTGTCAATTTGGTAAAGCTCACAACCTTCCTTTTCCCTTGTCTAACAGTCATGCGCAGAAACCTTTTGATTT
AATTCATTCTGACCTTTGGGGTCCAACACCTGTTCAATCCCCTGATGGTTTTCGTTTTTATATATTATTCTTGGATGATCACAGCCGTTTCACATGGTTATATCCTCTTA
AACAGAAGAATGATGCGTTCTCTGCTTTTAAACACTTCATTCAGCTGGTTCAAACTCAGTTTAATAGCAGAATAAAGGCTTTCCAGTCAGATAATGGGAGAGAATTTGTA
CATATTCATCGTGAATGCTACAGGTTGGGTCAGGCTTTCCACCCTCTTTTTTTATCTTCAACCATTCTCACACCACCAATTTGCTGCCGGCGACCGACCACCGGCAACCG
TCGCCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCAACGTCTCACTTAACAGTACTGCACCACTGATGAACACAACTACGTTTAGCTCACCACCGCTCAATCAACTGTTGAATCAAATTACTACGATCAAGTTGGATCG
AGGAAATTTCTTGTTGTGGAAAAATCTTGCTCTCCCAATTTTGCGAAGTTACAAGCTCGAAGGGCATTTACTCGGAACAAAGCCCTGTCCGCCTGAAATTATTACTCAAA
CTACTGGAGGTACTGTCAGCAATCCCACAGAAAATGCAGCTGGAGCAGAAGCTTCAAGCTCTACAACATCAGTGGTCTCAAATCCACTATATGAAGCATGGATTACGACT
GATCAATTGCTATTAGGATGGCTGTACAACTCCATGACATCTGAGGTTGCCACTCAAGTTATGGGATACAACAATGCAAAGGATCTGTGGGAAGCCATCCAGAATCTCTT
TGGAATTCAATCGAGAGCAGAGGAGGATTATCTCCGGCAAACATTTCAGCAGACACGGAAAGGTAATACTAAAATGACAGATTATTTGAGACTAATGAAAACTCATGCCG
ATAATCTAGGGCAAGCTGGGAGTCCTGTCTCTTCGAGAAATCTTATTTCTCAAGTGTTGCTCGGCTTAGACGAGGAGTATAACCCTGTTGTCGCTATGATTCATGGCAGA
GGAGATATATCTTGGTCCGAAATGCAAGCAGAACTACTTGTCTTTGAGAAACGACTGGAACTACAGACGGCTCAAAAAACCTCGGTCTCATTCAGTCATAATACGTCTGT
CAACATGGCAAGCAAAGGCTTGACCAATCCTCCCAATAACCACAACCGAGGGTGGAATGTCAATCCTAACAATCAGAGAGGTCAGAATCCTAACAATCCACAGAGAGGAG
GATCAAATTTTAATGGAAACAGAGGTCGTGGTGGAAGAGGCAGAGGCTATGGCAACTACGGTAGCAACTACGGTAACTCTAATCGACCTATTTGTCAAGTCTGTGGTAGG
CCAGGACATCTAGCTCTCACTTGTTATCATAGGTTTGACAAAGAGTTTAGTCCAAATCAAAATAGGAATGGCAACTTCAATATCTCTAATAATCAGTCGAATTCTGGAAA
TTATGTTGGGAATACTAATTATGCTGGAAGTTCTGTTCCTGCTACGACTTTTGTTGCAAATTCAGGAAACCCGTTTTTAGCAAATCCTGAAACTGTGATTGACTCAAATT
GGTACGTGGATAGCGGCGCCTCGAGCCATGTCACTGGTGATTACAACAACCTGGTTAATCCAAGAGAATATGGAGGTAATGAGCAAGTCATTATAGGTAATGGTACTAAG
TTGCCTATTTCTTTTACTGGAAGCACTTATTTAACTGATGGGTCTCATGTTCTTAGCCTCGAAAACACACTTTTAGTGCCTGATATTGCTAAGAATTTGGTCAGTGTTTC
AAGGTTAACACAAGATAATAATCTATACTTTGAATTTCATGGAAATTTTTGTCTTCTAAAGGACAAGGCCTCGGGTCGGGTGCTGCTGAAAGGAATCCTTAGAGATGGCT
TGTATCAGCTAGAGAGTGTTCAAACCTTATCTTCAAGCCCTTCGCTATTTCGAAGGCTTGGTCATCCAGCCTCTAAGGTTCTTGATTTTGTTATTAGAGAATGTAACCTT
CCAGTTAAAAGAAATGAAGCTTTACAGTTTTGCTCATCCTGTCAATTTGGTAAAGCTCACAACCTTCCTTTTCCCTTGTCTAACAGTCATGCGCAGAAACCTTTTGATTT
AATTCATTCTGACCTTTGGGGTCCAACACCTGTTCAATCCCCTGATGGTTTTCGTTTTTATATATTATTCTTGGATGATCACAGCCGTTTCACATGGTTATATCCTCTTA
AACAGAAGAATGATGCGTTCTCTGCTTTTAAACACTTCATTCAGCTGGTTCAAACTCAGTTTAATAGCAGAATAAAGGCTTTCCAGTCAGATAATGGGAGAGAATTTGTA
CATATTCATCGTGAATGCTACAGGTTGGGTCAGGCTTTCCACCCTCTTTTTTTATCTTCAACCATTCTCACACCACCAATTTGCTGCCGGCGACCGACCACCGGCAACCG
TCGCCGCTAG
Protein sequenceShow/hide protein sequence
MTNVSLNSTAPLMNTTTFSSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLLGTKPCPPEIITQTTGGTVSNPTENAAGAEASSSTTSVVSNPLYEAWITT
DQLLLGWLYNSMTSEVATQVMGYNNAKDLWEAIQNLFGIQSRAEEDYLRQTFQQTRKGNTKMTDYLRLMKTHADNLGQAGSPVSSRNLISQVLLGLDEEYNPVVAMIHGR
GDISWSEMQAELLVFEKRLELQTAQKTSVSFSHNTSVNMASKGLTNPPNNHNRGWNVNPNNQRGQNPNNPQRGGSNFNGNRGRGGRGRGYGNYGSNYGNSNRPICQVCGR
PGHLALTCYHRFDKEFSPNQNRNGNFNISNNQSNSGNYVGNTNYAGSSVPATTFVANSGNPFLANPETVIDSNWYVDSGASSHVTGDYNNLVNPREYGGNEQVIIGNGTK
LPISFTGSTYLTDGSHVLSLENTLLVPDIAKNLVSVSRLTQDNNLYFEFHGNFCLLKDKASGRVLLKGILRDGLYQLESVQTLSSSPSLFRRLGHPASKVLDFVIRECNL
PVKRNEALQFCSSCQFGKAHNLPFPLSNSHAQKPFDLIHSDLWGPTPVQSPDGFRFYILFLDDHSRFTWLYPLKQKNDAFSAFKHFIQLVQTQFNSRIKAFQSDNGREFV
HIHRECYRLGQAFHPLFLSSTILTPPICCRRPTTGNRRR