; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005008 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005008
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:9466341..9468362
RNA-Seq ExpressionLag0005008
SyntenyLag0005008
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]3.9e-12241.3Show/hide
Query:  AGTTGFNNPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQY
        A   G NN   N L + + S+KLDR N+ LWK+L LP++R  KL+G++ G+E CP +F++SS +                                N  +
Subjt:  AGTTGFNNPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQY

Query:  ESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLL
          W A DQ LLGW+ NSM+ E+ATQ++  E ++ LW   Q L G  +R++  YL+  F   RKG +KM DYL  MK+  D L  AG+PVST  LI Q L 
Subjt:  ESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLL

Query:  GLDEEFNPVVAMIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRG-NGNRGRGRSRGY
        GLD E+NPVV  +  +  ++W ++QA+LL FE R+E  N L T+LT++   + N+A   D  G+          S   N+   N RG  G RGRG+S   
Subjt:  GLDEEFNPVVAMIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRG-NGNRGRGRSRGY

Query:  GGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANP
              K  CQVCG   H A+ C+ RF+K YS                     +  A      + N F+AS ++V D +WY DSGASNHVT       + 
Subjt:  GGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANP

Query:  VDYEGNECVTVGNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTP
         ++ G   + VGNG KL I   G++ L     SLNL + L VPNI KNL+S+SKLA DN+I VEF ++ C VK K TGKV+LKG+L DGLY     K  P
Subjt:  VDYEGNECVTVGNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTP

Query:  VGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELI
                                AFV   SV     K+ WH+RLGHP+ KVL+ VL+SC +    +++  FC+ACQYGK H LPF +S S A    EL+
Subjt:  VGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELI

Query:  HTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR
        HTD+WGPAPI ++ GFKYYV F+DDFSRF WIYPLKQKS+T  AF  F  + +NQF   I+
Subjt:  HTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]9.2e-12442.5Show/hide
Query:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWL
        L    S+KLDR N+ LW+++ LPI+R  +L+G++ G + CP +F+++               A SS+               NP++E W A DQ LLGWL
Subjt:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWL

Query:  YNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQ
         NSM+  +ATQ++  E +  LW   Q L G  +R++  YL+  F  +RKG +KM DYL  MK+ AD L  AG+P+ST  LI Q L GLD E+NPVV  + 
Subjt:  YNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQ

Query:  GRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRG-NGNRGRGRSRGYGGYNNNKPTCQVCG
         +  ++W ++QA+LL FE R+E  N+L T+LT++   + N+A   D  G R +  N   GS   N+   N RG  G RGRGRS         K TCQVCG
Subjt:  GRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRG-NGNRGRGRSRGYGGYNNNKPTCQVCG

Query:  KIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNG
           H A+ C+ RF+K YS             R N   N  +  +       N F+AS +++ D +WY DSGASNHVT   +   N  ++ G   + VGNG
Subjt:  KIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNG

Query:  NKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKES
         KL+I   G++ L     SLNL + L VP I KNL+S+SKLA DN+I VEF ++ C VK K TGK +L+G+L DGLY                + S+K+S
Subjt:  NKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKES

Query:  VNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQ
                      S  V++   K+ WH++LGHP+ KVL+ VLKSCN+    ++   FC+ACQYGK H LPF  S S A    EL+HTD+WGPAPI S+ 
Subjt:  VNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQ

Query:  GFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR
        GFKYYV FIDDF+RF WIYPLKQKSDT  AF  F  MV+NQF   I+
Subjt:  GFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]7.1e-11640.46Show/hide
Query:  NQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLL
        N L ++I S+ LDR NF LWK+L LPI+R  +L+G++ G++ CP +F++S+ A  +                            +NP +  W A DQ +L
Subjt:  NQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLL

Query:  GWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVA
        GWL N+M+   A+Q++  E ++ LW   Q L    +R+   YLR  F  +RKG  KM DYL  MK  AD L  AGSP++   LI Q L GLD ++NP+V 
Subjt:  GWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVA

Query:  MIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQV
         +  ++ ++W ++QA+LL FE RL+  N+      ++   + N+A        RG+  N           RG+ RG+  R     RG G  +N+   CQV
Subjt:  MIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQV

Query:  CGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNT-NPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTV
        C K GHTA+ C  R++K Y+G    N                   A V  Q T N F+AS     D  WY DSGASNHVT   +      +  G   + V
Subjt:  CGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNT-NPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTV

Query:  GNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSK
        GNG KLKI   G++ L     +LNL + L VP I KNL+S+SKL  DN+I VEF +  C VK K TGKVLL+G+L DGLY   +          S + +K
Subjt:  GNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSK

Query:  KESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPIT
           V                   +  K+ WH++LGHPS  VL+ VLK CN+ T  ++  KFC+ACQ GKSH LPF +S S A    ELIHTD+WGPAPI 
Subjt:  KESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPIT

Query:  STQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR
        S  GFKYYV FIDD SRF WIYPLKQKSDT  AF  F  MV+NQF   I+
Subjt:  STQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]1.6e-12041.02Show/hide
Query:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWL
        L    S+KLDR NF LWK+L LP++R  K +G++ G++ CP +FV +S   TE                            +NP Y+ W A DQ LLGWL
Subjt:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWL

Query:  YNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQ
         NSM+ ++ATQV+  E ++ LW   Q L G  +R+   YL+  F  + K  +KM  YL  MK+ AD L  AGSP+S+  L+ Q L GLD E+NPVV  + 
Subjt:  YNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQ

Query:  GRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQVCGK
         +  I+W + QA+LL FE RL+  N       I+   S N A+  ++GG +        GSR G  G  ++   G RGR R          +P CQ+CGK
Subjt:  GRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQVCGK

Query:  IGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNGN
         GHTA  CY RF+K Y+       G+GS+                     + FVASP    D  WY DSGASNHVT     + +  +  G   + VGNG 
Subjt:  IGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNGN

Query:  KLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKESV
        KLKI   G+T L+D    +NL N L VP I KNL+S+SKL  DN+  VEF +++C VK K TGK LLKG L DGLY   + K  P               
Subjt:  KLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKESV

Query:  NKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQG
        NK   + +              K++WH++LGHP+ KVLE VLK  N+    ++   FC+ACQ+GK H LPF  S S A    +LIHTD+WGPAPI S   
Subjt:  NKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQG

Query:  FKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR
        FKYYV F+DDFSRF WI+PLKQKS+T  AF+ F  +V+NQF   I+
Subjt:  FKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]2.1e-11540.03Show/hide
Query:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWL
        L  I S+KLDR N+ LWK+L LP++R  K +G++ G++ CP +FV+S+    +                            VNP ++ W+A DQ LLGWL
Subjt:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWL

Query:  YNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQ
         NSM+ ++ATQ++  E ++ LW   Q L G  +++   YL+  F  +RKG +KM +YL  MK+ +D L  +GSP+S   L+ Q L GLD E+NPVV  + 
Subjt:  YNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQ

Query:  GRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRG-NGNRGRGRSRGYGGYNNNKPTCQVCG
         ++ ++W ++QA+LL FE RL+  N   + LT++   S N A   +  G +           +GN+ R N RG  G RG+GR       +N K  CQVC 
Subjt:  GRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRG-NGNRGRGRSRGYGGYNNNKPTCQVCG

Query:  KIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNG
          GHTA+ C  RF++ Y+G       D                      + + FVASP    D  WY DSGASNHVT   +      ++ G   + VGNG
Subjt:  KIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNG

Query:  NKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKES
         KLKI   G+T L    N+LNL + L VP I KNL+S+SKL  DN+I+VEF  + C VK K TG+ LLKG L DGLY    V         S + +K   
Subjt:  NKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKES

Query:  VNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQ
        V                   +  K+ WH++LGHP+ KVLE VLK CN+    ++   FC+ACQ+GK H LPF +S S       LIH+D+WGPAPI S  
Subjt:  VNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQ

Query:  GFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR
        GFKYYV FIDDFSRF WI+PLKQKSDT  AF  F  + +NQF   I+
Subjt:  GFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)4.4e-12442.5Show/hide
Query:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWL
        L    S+KLDR N+ LW+++ LPI+R  +L+G++ G + CP +F+++               A SS+               NP++E W A DQ LLGWL
Subjt:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWL

Query:  YNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQ
         NSM+  +ATQ++  E +  LW   Q L G  +R++  YL+  F  +RKG +KM DYL  MK+ AD L  AG+P+ST  LI Q L GLD E+NPVV  + 
Subjt:  YNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQ

Query:  GRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRG-NGNRGRGRSRGYGGYNNNKPTCQVCG
         +  ++W ++QA+LL FE R+E  N+L T+LT++   + N+A   D  G R +  N   GS   N+   N RG  G RGRGRS         K TCQVCG
Subjt:  GRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRG-NGNRGRGRSRGYGGYNNNKPTCQVCG

Query:  KIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNG
           H A+ C+ RF+K YS             R N   N  +  +       N F+AS +++ D +WY DSGASNHVT   +   N  ++ G   + VGNG
Subjt:  KIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNG

Query:  NKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKES
         KL+I   G++ L     SLNL + L VP I KNL+S+SKLA DN+I VEF ++ C VK K TGK +L+G+L DGLY                + S+K+S
Subjt:  NKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKES

Query:  VNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQ
                      S  V++   K+ WH++LGHP+ KVL+ VLKSCN+    ++   FC+ACQYGK H LPF  S S A    EL+HTD+WGPAPI S+ 
Subjt:  VNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQ

Query:  GFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR
        GFKYYV FIDDF+RF WIYPLKQKSDT  AF  F  MV+NQF   I+
Subjt:  GFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR

A0A2K3LJ49 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-11640.46Show/hide
Query:  NQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLL
        N L ++I S+ LDR NF LWK+L LPI+R  +L+G++ G++ CP +F++S+ A  +                            +NP +  W A DQ +L
Subjt:  NQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLL

Query:  GWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVA
        GWL N+M+   A+Q++  E ++ LW   Q L    +R+   YLR  F  +RKG  KM DYL  MK  AD L  AGSP++   LI Q L GLD ++NP+V 
Subjt:  GWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVA

Query:  MIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQV
         +  ++ ++W ++QA+LL FE RL+  N+      ++   + N+A        RG+  N           RG+ RG+  R     RG G  +N+   CQV
Subjt:  MIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQV

Query:  CGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNT-NPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTV
        C K GHTA+ C  R++K Y+G    N                   A V  Q T N F+AS     D  WY DSGASNHVT   +      +  G   + V
Subjt:  CGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNT-NPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTV

Query:  GNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSK
        GNG KLKI   G++ L     +LNL + L VP I KNL+S+SKL  DN+I VEF +  C VK K TGKVLL+G+L DGLY   +          S + +K
Subjt:  GNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSK

Query:  KESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPIT
           V                   +  K+ WH++LGHPS  VL+ VLK CN+ T  ++  KFC+ACQ GKSH LPF +S S A    ELIHTD+WGPAPI 
Subjt:  KESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPIT

Query:  STQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR
        S  GFKYYV FIDD SRF WIYPLKQKSDT  AF  F  MV+NQF   I+
Subjt:  STQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)7.9e-12141.02Show/hide
Query:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWL
        L    S+KLDR NF LWK+L LP++R  K +G++ G++ CP +FV +S   TE                            +NP Y+ W A DQ LLGWL
Subjt:  LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWL

Query:  YNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQ
         NSM+ ++ATQV+  E ++ LW   Q L G  +R+   YL+  F  + K  +KM  YL  MK+ AD L  AGSP+S+  L+ Q L GLD E+NPVV  + 
Subjt:  YNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQ

Query:  GRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQVCGK
         +  I+W + QA+LL FE RL+  N       I+   S N A+  ++GG +        GSR G  G  ++   G RGR R          +P CQ+CGK
Subjt:  GRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQVCGK

Query:  IGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNGN
         GHTA  CY RF+K Y+       G+GS+                     + FVASP    D  WY DSGASNHVT     + +  +  G   + VGNG 
Subjt:  IGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNGN

Query:  KLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKESV
        KLKI   G+T L+D    +NL N L VP I KNL+S+SKL  DN+  VEF +++C VK K TGK LLKG L DGLY   + K  P               
Subjt:  KLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKESV

Query:  NKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQG
        NK   + +              K++WH++LGHP+ KVLE VLK  N+    ++   FC+ACQ+GK H LPF  S S A    +LIHTD+WGPAPI S   
Subjt:  NKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQG

Query:  FKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR
        FKYYV F+DDFSRF WI+PLKQKS+T  AF+ F  +V+NQF   I+
Subjt:  FKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.9e-12241.3Show/hide
Query:  AGTTGFNNPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQY
        A   G NN   N L + + S+KLDR N+ LWK+L LP++R  KL+G++ G+E CP +F++SS +                                N  +
Subjt:  AGTTGFNNPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQY

Query:  ESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLL
          W A DQ LLGW+ NSM+ E+ATQ++  E ++ LW   Q L G  +R++  YL+  F   RKG +KM DYL  MK+  D L  AG+PVST  LI Q L 
Subjt:  ESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLL

Query:  GLDEEFNPVVAMIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRG-NGNRGRGRSRGY
        GLD E+NPVV  +  +  ++W ++QA+LL FE R+E  N L T+LT++   + N+A   D  G+          S   N+   N RG  G RGRG+S   
Subjt:  GLDEEFNPVVAMIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRG-NGNRGRGRSRGY

Query:  GGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANP
              K  CQVCG   H A+ C+ RF+K YS                     +  A      + N F+AS ++V D +WY DSGASNHVT       + 
Subjt:  GGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANP

Query:  VDYEGNECVTVGNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTP
         ++ G   + VGNG KL I   G++ L     SLNL + L VPNI KNL+S+SKLA DN+I VEF ++ C VK K TGKV+LKG+L DGLY     K  P
Subjt:  VDYEGNECVTVGNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTP

Query:  VGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELI
                                AFV   SV     K+ WH+RLGHP+ KVL+ VL+SC +    +++  FC+ACQYGK H LPF +S S A    EL+
Subjt:  VGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELI

Query:  HTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR
        HTD+WGPAPI ++ GFKYYV F+DDFSRF WIYPLKQKS+T  AF  F  + +NQF   I+
Subjt:  HTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFGWTIR

A0A803PEH4 Uncharacterized protein3.0e-12841.51Show/hide
Query:  MASANSTGISALSAGTTGFNNPPLNQL--------LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASS
        +++A+S   S+++  ++  N    +QL        LNQ  S+KLDR N+ LWK +   I+R ++L G+LSG+  CPP+FV     +T+            
Subjt:  MASANSTGISALSAGTTGFNNPPLNQL--------LNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASS

Query:  SQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHA
                         NP+YE+W+  DQLL+GWLY+SM+  +AT+VMG  +A +L   ++ L+G  S+++ D  R + Q +RKGS  M++YLR  K+ +
Subjt:  SQTGASVASRSTPVAAVNPQYESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHA

Query:  DNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRVGITWSEMQAELLVFEKRLE-LQN-TLKTSLTISHGVSVNMATSKDAGGQ-RGSQQNFPNGSR
        + L  AG P     L++ VL GLD E+  +V  I+ R   TW E+Q  LL F+ ++E LQN TL ++   S     NMA   +  G+ RG Q    + + 
Subjt:  DNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRVGITWSEMQAELLVFEKRLE-LQN-TLKTSLTISHGVSVNMATSKDAGGQ-RGSQQNFPNGSR

Query:  QGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTD
         G F   N RG  NR RGR RG G  + ++PTCQV GK GHTA +CY RF++ Y G    N        P++Q    Q      + N + FVA+P+ +  
Subjt:  QGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTD

Query:  PNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNGNKLKISCIGNTCLS-DGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKG
          W+ADSGASNH+T+D  N+    DY G E V VGNG+KL+I+ IGN  L+ +  N L L++ L VP IAKNLVS+SKLA DN++ +EF+ +FCLVK K 
Subjt:  PNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNGNKLKISCIGNTCLS-DGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKG

Query:  TGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKESVNKHGVSGVGAFVISNSVNV---------VVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVN
        T KVLL GVL D LY  DS        Y+   +             + AF IS   NV         +   DV H+RLGHPS+KVL  VL+S N+    N
Subjt:  TGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKESVNKHGVSGVGAFVISNSVNV---------VVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVN

Query:  ESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQF
             CDACQYGK+HALPF +S ++A +  +LIHTDLWGPAPI S     YY+ F+DD+SR+ W+YPLK KSD  AAF  F A+V+NQF
Subjt:  ESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQF

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.8e-1224.33Show/hide
Query:  NRGRGRSRGYGGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHV
        NR     + + G +  K  C  CG+ GH    C+          +I N  +  N     Q   S   AF+  +  N  V     + +  +  DSGAS+H+
Subjt:  NRGRGRSRGYGGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHV

Query:  TADYNNIANPVDYEGNECVTVGNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGL
          D +   + V+      + V    +   +           + + LE+ L     A NL+S+ +L ++  + +EF  S   +   G   V   G+LN   
Subjt:  TADYNNIANPVDYEGNECVTVGNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGL

Query:  YCFDSVKVTPVGAYKSERWSKKESVN-KHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPS-LKVLE----SVLKSCNLPTKVNESVKFCDACQYGKSHAL
            +V V    AY         S+N KH           N+        +WH+R GH S  K+LE    ++    +L   +  S + C+ C  GK   L
Subjt:  YCFDSVKVTPVGAYKSERWSKKESVN-KHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPS-LKVLE----SVLKSCNLPTKVNESVKFCDACQYGKSHAL

Query:  PFPNSLSQASTKFEL--IHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQF
        PF     +   K  L  +H+D+ GP    +     Y+V+F+D F+ +   Y +K KSD  + F  F+A  +  F
Subjt:  PFPNSLSQASTKFEL--IHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.3e-2725.27Show/hide
Query:  ESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYL-RQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVL
        E W  +D+     +   +S +V   ++  + A+ +W  ++ L+  ++   + YL +Q++            +L V       L   G  +        +L
Subjt:  ESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYL-RQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVL

Query:  LGLDEEF-NPVVAMIQGRVGITWSEMQAELLVFEK-RLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSR
          L   + N    ++ G+  I   ++ + LL+ EK R + +N  +  +T   G S           QR S           N+GR   RG  ++ R +SR
Subjt:  LGLDEEF-NPVVAMIQGRVGITWSEMQAELLVFEK-RLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSR

Query:  GYGGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVA-SQNTNPFVASPD-----TVTDPNWYADSGASNHVTA
            YN N+P        GH        F ++   P+   +G G     + Q N    AA V  + N   F+   +     +  +  W  D+ AS+H T 
Subjt:  GYGGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVA-SQNTNPFVASPD-----TVTDPNWYADSGASNHVTA

Query:  DYNNIANPVDYEGNECVTVGNGNKLKISCIGNTCLSDGRN-SLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLY
          +     V  +    V +GN +  KI+ IG+ C+      +L L++   VP++  NL  IS +A D D Y  +  +      KG+  V+ KGV    LY
Subjt:  DYNNIANPVDYEGNECVTVGNGNKLKISCIGNTCLSDGRN-SLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLY

Query:  CFDSVKVTPVGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLS
                                N     G       N+    +S D+WHKR+GH S K L+ + K   +      +VK CD C +GK H + F  S  
Subjt:  CFDSVKVTPVGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLS

Query:  QASTKFELIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFG
        +     +L+++D+ GP  I S  G KY+V FIDD SR +W+Y LK K      F  F A+V+ + G
Subjt:  QASTKFELIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQFG

Q12337 Transposon Ty2-GR1 Gag-Pol polyprotein5.8e-1223.83Show/hide
Query:  SRGYGGYNNNKPTCQVCGKIGHTALMCYQRFNKEY---SGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHV--T
        +R Y   N++KP       I  ++   + R N ++   S    Q   D        Q   S+P   + S +  P           +   DSGAS  +  +
Subjt:  SRGYGGYNNNKPTCQVCGKIGHTALMCYQRFNKEY---SGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHV--T

Query:  ADYNNIANPVDYEGNECVTVGNGNK--LKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVK---AKGTGKVLLKGVL
        A Y + A P     N  + + +  K  + I+ IGN   +    +      L  PNIA +L+S+S+LA  N        + C  +    +  G VL   V 
Subjt:  ADYNNIANPVDYEGNECVTVGNGNK--LKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVK---AKGTGKVLLKGVL

Query:  NDGLYCFDSVKVTP--VGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKV--------NESVKFCDAC
        +   Y      + P  +        +K +SVNK+    +                  H+ LGH + + ++  LK  N  T +        N S   C  C
Subjt:  NDGLYCFDSVKVTP--VGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKV--------NESVKFCDAC

Query:  QYGKS----HALPFPNSLSQASTKFELIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTA--AFSHFLAMVKNQF
          GKS    H         ++   F+ +HTD++GP          Y++ F D+ +RF W+YPL  + + +    F+  LA +KNQF
Subjt:  QYGKS----HALPFPNSLSQASTKFELIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTA--AFSHFLAMVKNQF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-6329.18Show/hide
Query:  NNPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAV
        N   LN  ++ +T  KL   N+L+W      +   Y+L G L GS + PP  + + AA                               VNP Y  W   
Subjt:  NNPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVNPQYESWVAV

Query:  DQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEF
        D+L+   +  ++S  V   V     A  +W  +++++   S      LR   +Q  KG+  + DY++ + +  D L   G P+     + +VL  L EE+
Subjt:  DQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEF

Query:  NPVVAMIQGR-VGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNN-
         PV+  I  +    T +E+   LL  E ++         L +S    + +  +  +     +  N  NG+R   +   N   N    +  S  +   NN 
Subjt:  NPVVAMIQGR-VGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNN-

Query:  NKP---TCQVCGKIGHTALMC--YQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANP
        +KP    CQ+CG  GH+A  C   Q F    +  Q           P S   P QP A +A       + SP   +  NW  DSGA++H+T+D+NN++  
Subjt:  NKP---TCQVCGKIGHTALMC--YQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANP

Query:  VDYEGNECVTVGNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTP
          Y G + V V +G+ + IS  G+T LS     LNL N L VPNI KNL+S+ +L   N + VEF  +   VK   TG  LL+G   D LY +      P
Subjt:  VDYEGNECVTVGNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTP

Query:  VGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKF--CDACQYGKSHALPFPNSLSQASTKFE
        V  + S   S K                        +   WH RLGHP+  +L SV+ + +L + +N S KF  C  C   KS+ +PF  S   ++   E
Subjt:  VGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKF--CDACQYGKSHALPFPNSLSQASTKFE

Query:  LIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQF
         I++D+W  +PI S   ++YYV+F+D F+R+ W+YPLKQKS     F  F  +++N+F
Subjt:  LIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.7e-5328.62Show/hide
Query:  VAAVNPQYESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTR
        V  VNP Y  W   D+L+   +  ++S  V   V     A  +W  +++++   S      LR +                   +  D L   G P+   
Subjt:  VAAVNPQYESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTR

Query:  SLISQVLLGLDEEFNPVVAMIQGR-VGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNR
          + +VL  L +++ PV+  I  +    + +E+   L+  E +L   N+ +     ++ V+     +      RG  +N+ N + + N  + +  G+ + 
Subjt:  SLISQVLLGLDEEFNPVVAMIQGR-VGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNR

Query:  GRGRSRGYGGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTA
         R + + Y G       CQ+C   GH+A  C Q    + +  Q Q+          S   P QP A +A    +P+ A+       NW  DSGA++H+T+
Subjt:  GRGRSRGYGGYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTA

Query:  DYNNIANPVDYEGNECVTVGNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYC
        D+NN++    Y G + V + +G+ + I+  G+  L     SL+L   L VPNI KNL+S+ +L   N + VEF  +   VK   TG  LL+G   D LY 
Subjt:  DYNNIANPVDYEGNECVTVGNGNKLKISCIGNTCLSDGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYC

Query:  FDSVKVTPVGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKF--CDACQYGKSHALPFPNSL
        +       V  + S   SK                         +   WH RLGHPSL +L SV+ + +LP  +N S K   C  C   KSH +PF NS 
Subjt:  FDSVKVTPVGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSKDVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKF--CDACQYGKSHALPFPNSL

Query:  SQASTKFELIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQF
          +S   E I++D+W  +PI S   ++YYV+F+D F+R+ W+YPLKQKS     F  F ++V+N+F
Subjt:  SQASTKFELIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHFLAMVKNQF

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.2e-0526.96Show/hide
Query:  SWVAVDQLLLGWLYNSMSP-EVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLL
        +W   D ++   LY +++P +     +    ++D+W  ++  F     A    L    +    G +++ADY R MK  AD+L     PV+ R+L+  VL 
Subjt:  SWVAVDQLLLGWLYNSMSP-EVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLL

Query:  GLDEEFNPVVAMIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLT-ISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGY
        GL+ +F+ ++ +I+ R     S   A  ++ E+   L+  +K + T + H  S  +    +A      Q++   G++ G  GRG  RGN N  RGR   +
Subjt:  GLDEEFNPVVAMIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLT-ISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGY

Query:  GGYN
          YN
Subjt:  GGYN

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.9e-0624.19Show/hide
Query:  WVAVDQLLLGWLYNSMSPEVATQVMGYE-NAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLG
        W   D L+  W+Y +++  +   ++     A+DLW +++ LF     A         + +    L + +Y + +KS +D L    SP+S R L+  +L G
Subjt:  WVAVDQLLLGWLYNSMSPEVATQVMGYE-NAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLG

Query:  LDEEFNPVVAMIQGRVGI-TWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYG
        L E+++ ++ +I+ +    +++E ++ LL+ E R  L N  K+SL+ ++  S++         Q    Q + N +   N GRG  +   NRG G S G  
Subjt:  LDEEFNPVVAMIQGRVGI-TWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYG

Query:  GYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDP
          NNN        ++       Y      Y  P       G  F       P QP  +++  +  P   SP ++ +P
Subjt:  GYNNNKPTCQVCGKIGHTALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDP

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.4e-0835.82Show/hide
Query:  VWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWG
        +WH RL H S + +E ++K   L +    S+KFC+ C YGK+H + F           + +H+DLWG
Subjt:  VWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAGCGCCAATTCAACCGGAATTTCTGCCCTGTCTGCCGGAACAACTGGATTCAACAATCCACCGCTGAATCAGTTGTTAAATCAAATCACGTCTATAAAGTTGGA
TCGAGGAAACTTCTTATTGTGGAAGAATTTGGCACTACCGATTCTGAGGAGCTATAAGCTTGAAGGCCATTTATCAGGTAGTGAATCTTGTCCTCCTAAATTCGTCTCAT
CATCTGCTGCAGAGACAGAAGACGTATCTCTCCAAGCACAAGGAGGTGCTTCGAGCTCACAGACTGGTGCAAGTGTGGCTTCTAGGTCCACTCCCGTCGCCGCAGTGAAT
CCTCAATACGAGTCATGGGTTGCGGTAGACCAGCTTCTGTTAGGGTGGCTTTATAATTCCATGTCACCGGAAGTCGCCACCCAGGTAATGGGTTATGAGAATGCTCAAGA
TTTGTGGGCAGCTGTACAGGAGCTTTTTGGTGTTCAGTCAAGGGCCGAGGAGGATTATCTCCGCCAAGTGTTTCAACAGTCGAGGAAAGGAAGCTTGAAGATGGCAGACT
ACTTACGAGTAATGAAGAGCCATGCTGATAATCTTGGCCAAGCCGGCAGTCCGGTAAGTACACGGTCCCTAATCTCTCAAGTTCTGTTAGGGCTTGATGAGGAATTTAAT
CCTGTTGTAGCGATGATCCAAGGGAGAGTTGGGATTACATGGTCTGAGATGCAGGCCGAATTACTAGTATTCGAAAAAAGACTTGAGCTACAAAATACTTTGAAAACTTC
TCTAACCATTAGTCATGGAGTTTCTGTGAATATGGCTACTAGCAAAGATGCAGGAGGACAACGAGGAAGTCAACAAAATTTCCCAAATGGCAGTCGTCAAGGCAACTTTG
GTCGAGGAAATCAAAGAGGGAATGGAAATCGTGGTCGAGGAAGGTCCAGAGGTTATGGAGGCTACAATAACAACAAACCAACTTGTCAAGTGTGTGGCAAGATTGGGCAC
ACTGCACTGATGTGTTACCAACGATTCAACAAAGAGTATTCTGGTCCACAAATTCAAAACAGAGGAGATGGAAGCAACTTTCGTCCCAATAGTCAGATAAATCCTTCGCA
GCCAGCTGCATTTGTTGCCAGTCAAAATACCAATCCATTTGTAGCCTCCCCAGATACAGTGACAGACCCAAATTGGTATGCAGATAGTGGAGCGTCAAACCATGTCACTG
CAGACTACAACAATATAGCCAATCCAGTCGACTATGAAGGTAATGAATGTGTAACAGTTGGGAATGGTAATAAGCTGAAAATATCATGTATTGGCAATACTTGTTTATCT
GATGGAAGAAATAGTCTTAATCTGGAAAATACCCTGTGTGTACCCAATATTGCAAAGAACTTAGTGAGTATTTCTAAGTTGGCTAGAGACAACGATATTTATGTTGAATT
TCATGATAGTTTTTGTCTTGTTAAGGCCAAGGGTACGGGCAAGGTGCTGCTGAAAGGGGTGCTTAACGATGGACTATACTGTTTTGACAGTGTGAAGGTTACTCCAGTTG
GTGCTTATAAATCAGAAAGGTGGAGCAAGAAAGAATCAGTCAACAAACATGGAGTTTCTGGTGTTGGTGCTTTTGTTATTTCAAATAGTGTTAATGTTGTAGTCTCGAAA
GATGTATGGCATAAACGGTTAGGACATCCCTCTTTAAAAGTTCTTGAATCAGTATTAAAGAGTTGTAATCTGCCCACTAAAGTCAATGAATCAGTAAAATTTTGTGATGC
TTGTCAATATGGCAAATCCCATGCCCTACCCTTCCCAAACTCTTTGTCACAAGCCTCTACTAAGTTTGAACTCATACATACAGACCTTTGGGGACCTGCACCTATAACTT
CGACACAAGGGTTCAAATATTATGTCTTGTTTATAGATGACTTCAGTAGATTTGTATGGATCTATCCCTTAAAACAGAAAAGCGATACTACAGCAGCCTTTAGTCACTTC
CTAGCAATGGTGAAAAATCAGTTTGGTTGGACGATACGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAGCGCCAATTCAACCGGAATTTCTGCCCTGTCTGCCGGAACAACTGGATTCAACAATCCACCGCTGAATCAGTTGTTAAATCAAATCACGTCTATAAAGTTGGA
TCGAGGAAACTTCTTATTGTGGAAGAATTTGGCACTACCGATTCTGAGGAGCTATAAGCTTGAAGGCCATTTATCAGGTAGTGAATCTTGTCCTCCTAAATTCGTCTCAT
CATCTGCTGCAGAGACAGAAGACGTATCTCTCCAAGCACAAGGAGGTGCTTCGAGCTCACAGACTGGTGCAAGTGTGGCTTCTAGGTCCACTCCCGTCGCCGCAGTGAAT
CCTCAATACGAGTCATGGGTTGCGGTAGACCAGCTTCTGTTAGGGTGGCTTTATAATTCCATGTCACCGGAAGTCGCCACCCAGGTAATGGGTTATGAGAATGCTCAAGA
TTTGTGGGCAGCTGTACAGGAGCTTTTTGGTGTTCAGTCAAGGGCCGAGGAGGATTATCTCCGCCAAGTGTTTCAACAGTCGAGGAAAGGAAGCTTGAAGATGGCAGACT
ACTTACGAGTAATGAAGAGCCATGCTGATAATCTTGGCCAAGCCGGCAGTCCGGTAAGTACACGGTCCCTAATCTCTCAAGTTCTGTTAGGGCTTGATGAGGAATTTAAT
CCTGTTGTAGCGATGATCCAAGGGAGAGTTGGGATTACATGGTCTGAGATGCAGGCCGAATTACTAGTATTCGAAAAAAGACTTGAGCTACAAAATACTTTGAAAACTTC
TCTAACCATTAGTCATGGAGTTTCTGTGAATATGGCTACTAGCAAAGATGCAGGAGGACAACGAGGAAGTCAACAAAATTTCCCAAATGGCAGTCGTCAAGGCAACTTTG
GTCGAGGAAATCAAAGAGGGAATGGAAATCGTGGTCGAGGAAGGTCCAGAGGTTATGGAGGCTACAATAACAACAAACCAACTTGTCAAGTGTGTGGCAAGATTGGGCAC
ACTGCACTGATGTGTTACCAACGATTCAACAAAGAGTATTCTGGTCCACAAATTCAAAACAGAGGAGATGGAAGCAACTTTCGTCCCAATAGTCAGATAAATCCTTCGCA
GCCAGCTGCATTTGTTGCCAGTCAAAATACCAATCCATTTGTAGCCTCCCCAGATACAGTGACAGACCCAAATTGGTATGCAGATAGTGGAGCGTCAAACCATGTCACTG
CAGACTACAACAATATAGCCAATCCAGTCGACTATGAAGGTAATGAATGTGTAACAGTTGGGAATGGTAATAAGCTGAAAATATCATGTATTGGCAATACTTGTTTATCT
GATGGAAGAAATAGTCTTAATCTGGAAAATACCCTGTGTGTACCCAATATTGCAAAGAACTTAGTGAGTATTTCTAAGTTGGCTAGAGACAACGATATTTATGTTGAATT
TCATGATAGTTTTTGTCTTGTTAAGGCCAAGGGTACGGGCAAGGTGCTGCTGAAAGGGGTGCTTAACGATGGACTATACTGTTTTGACAGTGTGAAGGTTACTCCAGTTG
GTGCTTATAAATCAGAAAGGTGGAGCAAGAAAGAATCAGTCAACAAACATGGAGTTTCTGGTGTTGGTGCTTTTGTTATTTCAAATAGTGTTAATGTTGTAGTCTCGAAA
GATGTATGGCATAAACGGTTAGGACATCCCTCTTTAAAAGTTCTTGAATCAGTATTAAAGAGTTGTAATCTGCCCACTAAAGTCAATGAATCAGTAAAATTTTGTGATGC
TTGTCAATATGGCAAATCCCATGCCCTACCCTTCCCAAACTCTTTGTCACAAGCCTCTACTAAGTTTGAACTCATACATACAGACCTTTGGGGACCTGCACCTATAACTT
CGACACAAGGGTTCAAATATTATGTCTTGTTTATAGATGACTTCAGTAGATTTGTATGGATCTATCCCTTAAAACAGAAAAGCGATACTACAGCAGCCTTTAGTCACTTC
CTAGCAATGGTGAAAAATCAGTTTGGTTGGACGATACGCTGA
Protein sequenceShow/hide protein sequence
MASANSTGISALSAGTTGFNNPPLNQLLNQITSIKLDRGNFLLWKNLALPILRSYKLEGHLSGSESCPPKFVSSSAAETEDVSLQAQGGASSSQTGASVASRSTPVAAVN
PQYESWVAVDQLLLGWLYNSMSPEVATQVMGYENAQDLWAAVQELFGVQSRAEEDYLRQVFQQSRKGSLKMADYLRVMKSHADNLGQAGSPVSTRSLISQVLLGLDEEFN
PVVAMIQGRVGITWSEMQAELLVFEKRLELQNTLKTSLTISHGVSVNMATSKDAGGQRGSQQNFPNGSRQGNFGRGNQRGNGNRGRGRSRGYGGYNNNKPTCQVCGKIGH
TALMCYQRFNKEYSGPQIQNRGDGSNFRPNSQINPSQPAAFVASQNTNPFVASPDTVTDPNWYADSGASNHVTADYNNIANPVDYEGNECVTVGNGNKLKISCIGNTCLS
DGRNSLNLENTLCVPNIAKNLVSISKLARDNDIYVEFHDSFCLVKAKGTGKVLLKGVLNDGLYCFDSVKVTPVGAYKSERWSKKESVNKHGVSGVGAFVISNSVNVVVSK
DVWHKRLGHPSLKVLESVLKSCNLPTKVNESVKFCDACQYGKSHALPFPNSLSQASTKFELIHTDLWGPAPITSTQGFKYYVLFIDDFSRFVWIYPLKQKSDTTAAFSHF
LAMVKNQFGWTIR