; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005870 (gene) of Snake gourd v1 genome

Gene IDTan0005870
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationLG07:20102462..20110416
RNA-Seq ExpressionTan0005870
SyntenyTan0005870
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]3.8e-14242.94Show/hide
Query:  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTA
        ++KL+R N+ LWK+L LP++R  KL+ ++LGT+ CP  F++               SS+S+ + N  +  W   DQ L+ W+ NSM +E+ATQ++ C T+
Subjt:  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTA

Query:  KDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQNDLFMFE
        K LWD  Q L G  +R++  YL   F   RKG MKM +YL  MK   D L  A +PVST  LI Q L GLD EYNPVVV +  +  +SW D Q  L  FE
Subjt:  KDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQNDLFMFE

Query:  KRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP
         R+E  N   N  + + NA+ N+A        R   +  S NN  R +N+   RG        GRGRG +  N   CQVCG   H A+ C++RF+K YS 
Subjt:  KRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP

Query:  STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL
        S   N  +  ++QG                        N FLA+  ++ D  WY +SGAS+HVT       +  E+ G +S++VGNG +  I  TG+S L
Subjt:  STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL

Query:  TSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCH
         S    L L ++L  PN+ KNL+S+S+LA DN+I +EF +  C VKDK TGK +LKG LK+GLY L+ T                    K N  AF    
Subjt:  TSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCH

Query:  KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW
              K  WHR LGHP+ K+LD V+ SC + V  ++   FC +CQYGK H LPF  S S A +  ELV++DVWGPAP++++SGF+YYV F+DD+SRF W
Subjt:  KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW

Query:  VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW
        +YPLKQKS+T  AF  F  + + QFN  I+  Q D G E+  V +L  + GI+ R SCPYTSQQNGRAERKHRH+ E  LTLLAQA MPL +WW
Subjt:  VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]1.7e-14242.86Show/hide
Query:  NSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMI
        NS   N L + V ++KL+R N+ LW+++ LPI+R  +L+ ++LG K CP  F++ A               +S+   NP +E W   DQ L+ WL NSM 
Subjt:  NSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMI

Query:  SEVATQVMGCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEI
          +ATQ++ C T+  LWD  Q L G  +R++  YL   F  +RKG MKM +YL  MK  AD L  A +P+ST  LI Q L GLD EYNPVVV +  +  +
Subjt:  SEVATQVMGCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEI

Query:  SWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTA
        SW D Q  L  FE R+E    Q NS++   N ++N   N   +S  +  +  S NN+  SNNN   RG   N RG   GRG   + +  CQVCG   H A
Subjt:  SWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTA

Query:  LVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNG
        + C+ RF+K YS S   N  +  ++QG                        N FLA+  +I D  WY +SGAS+HVT       N  E+ G +S+IVGNG
Subjt:  LVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNG

Query:  SQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESK
         +  I  TG+S L S    L L ++L  P + KNL+S+S+LA DN+I +EF +  C VKDK TGK +L+G LK+GLY L+                    
Subjt:  SQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESK

Query:  MYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRY
          K++S   S+        K  WHR LGHP+ K+LD V++SCN+ +  +++  FC +CQYGK H LPF  S S A +  ELV++DVWGPAP++S+SGF+Y
Subjt:  MYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRY

Query:  YVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQAS
        YV F+DD++RF W+YPLKQKSDT +AF  F  MV+ QF+  I++ Q D G E+  V +   + GI+ R SCPYTSQQNGRAERKHRH+ E  LTLLAQA 
Subjt:  YVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQAS

Query:  MPLVFWW
        MPL +WW
Subjt:  MPLVFWW

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]1.4e-13641.79Show/hide
Query:  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTA
        ++ L+R NF LWK+L LPI+R  +L+ ++LGTK CP  F++ A   G                INP +  W   DQ ++ WL N+M +  A+Q++ C T+
Subjt:  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTA

Query:  KDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQNDLFMFE
        K LW+  Q L    +R+   YL   F  +RKG  KM +YL  MK  AD L  A SP++   LI Q L GLD +YNP+VV +  +  +SW D Q  L  FE
Subjt:  KDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQNDLFMFE

Query:  KRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP
         RL+  N+  N    + NA+ N+A             N +    N  N+  S RG +  +   GRG+G  SN+  ICQVC K GHTA+ C +R++K Y+ 
Subjt:  KRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP

Query:  STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL
        S+  N   +  +                          N FLA+     D  WY +SGAS+HVT          E  G +S+IVGNG++  I  +G+S L
Subjt:  STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL

Query:  TSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCH
             NL L +VL  P + KNL+S+S+L  DN+I +EF +  C VKDK TGK LL+G LK+GLY L+N                 S+  K+  V  SV  
Subjt:  TSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCH

Query:  KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW
              K  WHR LGHPS  +LD V++ CN+    +++  FC +CQ GKSH LPF  S S A +  EL+++DVWGPAP+ S SGF+YYV F+DD SRF W
Subjt:  KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW

Query:  VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW
        +YPLKQKSDT +AF  F  MV+ QFN  I+  Q D G EF  V ++  + GIK R SCPYTSQQNGRAERKHRH+ E  LTLLAQA+M L +WW
Subjt:  VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]2.5e-13841.49Show/hide
Query:  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVM
        L    ++KL+R NF LWK+L LP++R  K + ++LGTK CP  F++               S ++T  INP Y+ W   DQ L+ WL NSM  ++ATQV+
Subjt:  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVM

Query:  GCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQND
         C T+K LWD  Q L G  +R+   YL   F  + K  MKM +YL  MK  AD L  A SP+S+  L+ Q L GLD EYNPVVV +  +  ISW D Q  
Subjt:  GCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQND

Query:  LFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN
        L  FE RL+  N   N    + NAS N A    +   +     +      R +N+   RG      GRGR R  +   RPICQ+CGK GHTA  CY RF+
Subjt:  LFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN

Query:  KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFT
        K Y         ++ N    G  + S+                  F+A+P    D  WY +SGAS+HVT   G L +  E  G +S++VGNG +  I  +
Subjt:  KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFT

Query:  GNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVA
        G++ L     ++ L+NVL  P + KNL+S+S+L  DN+  +EF + YC VKDK TGK LLKG LK+GLY L+              ++ E    K+    
Subjt:  GNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVA

Query:  FSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY
         S+        K  WHR LGHP+ K+L+ V++  N+ +  +++  FC +CQ+GK H LPF  S S A +  +L+++DVWGPAP+LS S F+YYV FLDD+
Subjt:  FSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY

Query:  SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW
        SRF W++PLKQKS+T +AF  F  +V+ QFN  I+  + D G E+  V +     GI+ + SCPYTSQQNGRAERKHRH+ E  LTLLAQA MPL +WW
Subjt:  SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]1.4e-13641.77Show/hide
Query:  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVM
        L  + ++KL+R N+ LWK+L LP++R  K + ++LGTK CP  F++               S++ +  +NP ++ WM  DQ L+ WL NSM  ++ATQ++
Subjt:  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVM

Query:  GCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQND
         C T+K LWD  Q L G  +++   YL   F  +RKG MKM EYL  MK  +D L  + SP+S   L+ Q L GLD EYNPVVV +  +  +SW D Q  
Subjt:  GCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQND

Query:  LFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN
        L  FE RL+  N   N    + NAS N A     +++ +  + +S  N+ RS N    RG        GRG+G  SN +  CQVC   GHTA+ C  RF+
Subjt:  LFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN

Query:  KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFT
        + Y   T +N  ++ ++QG           + + FVASP               D  WY +SGAS+HVT          E+ G +S++VGNG +  I  +
Subjt:  KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFT

Query:  GNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVA
        G++ L +    L L +VL  P + KNL+S+S+L  DN+I++EF    C VKDK TG+ LLKG LK+GLY L+       D+S     SN     K+  V 
Subjt:  GNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVA

Query:  FSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY
         SV        K  WHR LGHP+ K+L+ V++ CN+ +  +++  FC +CQ+GK H LPF  S S   +   L++SDVWGPAP+LS SGF+YYV F+DD+
Subjt:  FSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY

Query:  SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW
        SRF W++PLKQKSDT +AF  F  + + QFN  I+  Q D G E+  V ++  + GI+ R SCPYTSQQNGRAERKHRH+VE  LTLLAQA MPL +WW
Subjt:  SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)8.2e-14342.86Show/hide
Query:  NSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMI
        NS   N L + V ++KL+R N+ LW+++ LPI+R  +L+ ++LG K CP  F++ A               +S+   NP +E W   DQ L+ WL NSM 
Subjt:  NSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMI

Query:  SEVATQVMGCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEI
          +ATQ++ C T+  LWD  Q L G  +R++  YL   F  +RKG MKM +YL  MK  AD L  A +P+ST  LI Q L GLD EYNPVVV +  +  +
Subjt:  SEVATQVMGCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEI

Query:  SWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTA
        SW D Q  L  FE R+E    Q NS++   N ++N   N   +S  +  +  S NN+  SNNN   RG   N RG   GRG   + +  CQVCG   H A
Subjt:  SWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTA

Query:  LVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNG
        + C+ RF+K YS S   N  +  ++QG                        N FLA+  +I D  WY +SGAS+HVT       N  E+ G +S+IVGNG
Subjt:  LVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNG

Query:  SQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESK
         +  I  TG+S L S    L L ++L  P + KNL+S+S+LA DN+I +EF +  C VKDK TGK +L+G LK+GLY L+                    
Subjt:  SQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESK

Query:  MYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRY
          K++S   S+        K  WHR LGHP+ K+LD V++SCN+ +  +++  FC +CQYGK H LPF  S S A +  ELV++DVWGPAP++S+SGF+Y
Subjt:  MYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRY

Query:  YVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQAS
        YV F+DD++RF W+YPLKQKSDT +AF  F  MV+ QF+  I++ Q D G E+  V +   + GI+ R SCPYTSQQNGRAERKHRH+ E  LTLLAQA 
Subjt:  YVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQAS

Query:  MPLVFWW
        MPL +WW
Subjt:  MPLVFWW

A0A2K3LJ49 Retrovirus-related Pol polyprotein from transposon TNT 1-946.7e-13741.79Show/hide
Query:  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTA
        ++ L+R NF LWK+L LPI+R  +L+ ++LGTK CP  F++ A   G                INP +  W   DQ ++ WL N+M +  A+Q++ C T+
Subjt:  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTA

Query:  KDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQNDLFMFE
        K LW+  Q L    +R+   YL   F  +RKG  KM +YL  MK  AD L  A SP++   LI Q L GLD +YNP+VV +  +  +SW D Q  L  FE
Subjt:  KDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQNDLFMFE

Query:  KRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP
         RL+  N+  N    + NA+ N+A             N +    N  N+  S RG +  +   GRG+G  SN+  ICQVC K GHTA+ C +R++K Y+ 
Subjt:  KRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP

Query:  STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL
        S+  N   +  +                          N FLA+     D  WY +SGAS+HVT          E  G +S+IVGNG++  I  +G+S L
Subjt:  STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL

Query:  TSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCH
             NL L +VL  P + KNL+S+S+L  DN+I +EF +  C VKDK TGK LL+G LK+GLY L+N                 S+  K+  V  SV  
Subjt:  TSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCH

Query:  KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW
              K  WHR LGHPS  +LD V++ CN+    +++  FC +CQ GKSH LPF  S S A +  EL+++DVWGPAP+ S SGF+YYV F+DD SRF W
Subjt:  KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW

Query:  VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW
        +YPLKQKSDT +AF  F  MV+ QFN  I+  Q D G EF  V ++  + GIK R SCPYTSQQNGRAERKHRH+ E  LTLLAQA+M L +WW
Subjt:  VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)1.2e-13841.49Show/hide
Query:  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVM
        L    ++KL+R NF LWK+L LP++R  K + ++LGTK CP  F++               S ++T  INP Y+ W   DQ L+ WL NSM  ++ATQV+
Subjt:  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVM

Query:  GCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQND
         C T+K LWD  Q L G  +R+   YL   F  + K  MKM +YL  MK  AD L  A SP+S+  L+ Q L GLD EYNPVVV +  +  ISW D Q  
Subjt:  GCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQND

Query:  LFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN
        L  FE RL+  N   N    + NAS N A    +   +     +      R +N+   RG      GRGR R  +   RPICQ+CGK GHTA  CY RF+
Subjt:  LFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN

Query:  KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFT
        K Y         ++ N    G  + S+                  F+A+P    D  WY +SGAS+HVT   G L +  E  G +S++VGNG +  I  +
Subjt:  KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFT

Query:  GNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVA
        G++ L     ++ L+NVL  P + KNL+S+S+L  DN+  +EF + YC VKDK TGK LLKG LK+GLY L+              ++ E    K+    
Subjt:  GNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVA

Query:  FSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY
         S+        K  WHR LGHP+ K+L+ V++  N+ +  +++  FC +CQ+GK H LPF  S S A +  +L+++DVWGPAP+LS S F+YYV FLDD+
Subjt:  FSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY

Query:  SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW
        SRF W++PLKQKS+T +AF  F  +V+ QFN  I+  + D G E+  V +     GI+ + SCPYTSQQNGRAERKHRH+ E  LTLLAQA MPL +WW
Subjt:  SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW

A0A2K3NEN7 Copia-like polyprotein (Fragment)6.7e-13741.77Show/hide
Query:  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVM
        L  + ++KL+R N+ LWK+L LP++R  K + ++LGTK CP  F++               S++ +  +NP ++ WM  DQ L+ WL NSM  ++ATQ++
Subjt:  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVM

Query:  GCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQND
         C T+K LWD  Q L G  +++   YL   F  +RKG MKM EYL  MK  +D L  + SP+S   L+ Q L GLD EYNPVVV +  +  +SW D Q  
Subjt:  GCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQND

Query:  LFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN
        L  FE RL+  N   N    + NAS N A     +++ +  + +S  N+ RS N    RG        GRG+G  SN +  CQVC   GHTA+ C  RF+
Subjt:  LFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN

Query:  KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFT
        + Y   T +N  ++ ++QG           + + FVASP               D  WY +SGAS+HVT          E+ G +S++VGNG +  I  +
Subjt:  KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFT

Query:  GNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVA
        G++ L +    L L +VL  P + KNL+S+S+L  DN+I++EF    C VKDK TG+ LLKG LK+GLY L+       D+S     SN     K+  V 
Subjt:  GNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVA

Query:  FSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY
         SV        K  WHR LGHP+ K+L+ V++ CN+ +  +++  FC +CQ+GK H LPF  S S   +   L++SDVWGPAP+LS SGF+YYV F+DD+
Subjt:  FSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY

Query:  SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW
        SRF W++PLKQKSDT +AF  F  + + QFN  I+  Q D G E+  V ++  + GI+ R SCPYTSQQNGRAERKHRH+VE  LTLLAQA MPL +WW
Subjt:  SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.8e-14242.94Show/hide
Query:  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTA
        ++KL+R N+ LWK+L LP++R  KL+ ++LGT+ CP  F++               SS+S+ + N  +  W   DQ L+ W+ NSM +E+ATQ++ C T+
Subjt:  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTA

Query:  KDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQNDLFMFE
        K LWD  Q L G  +R++  YL   F   RKG MKM +YL  MK   D L  A +PVST  LI Q L GLD EYNPVVV +  +  +SW D Q  L  FE
Subjt:  KDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQNDLFMFE

Query:  KRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP
         R+E  N   N  + + NA+ N+A        R   +  S NN  R +N+   RG        GRGRG +  N   CQVCG   H A+ C++RF+K YS 
Subjt:  KRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP

Query:  STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL
        S   N  +  ++QG                        N FLA+  ++ D  WY +SGAS+HVT       +  E+ G +S++VGNG +  I  TG+S L
Subjt:  STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL

Query:  TSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCH
         S    L L ++L  PN+ KNL+S+S+LA DN+I +EF +  C VKDK TGK +LKG LK+GLY L+ T                    K N  AF    
Subjt:  TSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCH

Query:  KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW
              K  WHR LGHP+ K+LD V+ SC + V  ++   FC +CQYGK H LPF  S S A +  ELV++DVWGPAP++++SGF+YYV F+DD+SRF W
Subjt:  KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW

Query:  VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW
        +YPLKQKS+T  AF  F  + + QFN  I+  Q D G E+  V +L  + GI+ R SCPYTSQQNGRAERKHRH+ E  LTLLAQA MPL +WW
Subjt:  VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-3327.89Show/hide
Query:  NYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFL
        +Y RS+NN  + G    S+ R + R  N      C  C + GH    C N    +   S  +N            NT++      NV +          L
Subjt:  NYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFL

Query:  ATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL-TSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDT
        + PE+     W  ++ ASHH  T   +L      G   ++ +GN S S I   G+ C+ T+    L L++V   P++  NL  IS +A D D Y  +   
Subjt:  ATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL-TSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDT

Query:  YCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCH-----KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVN
                      K  L +G   +A  + +               +Y+ N+    +C        ++++   WH+ +GH S K L  + +   +     
Subjt:  YCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCH-----KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVN

Query:  EEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDN
             C+ C +GK H + F  S  R     +LVYSDV GP  + S  G +Y+V F+DD SR +WVY LK K      FQ F A+V+ +    ++  +SDN
Subjt:  EEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDN

Query:  GTEFL--RVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFW
        G E+      + CS  GI+   + P T Q NG AER +R +VE   ++L  A +P  FW
Subjt:  GTEFL--RVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFW

Q03494 Transposon Ty2-DR2 Gag-Pol polyprotein1.8e-2226.97Show/hide
Query:  SIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEG-LYCLANTLVKPVDISQP
        +I+       PI   GN               L  PN+A +L+S+S LA  N       +T  + +  GT   +L   +K G  Y L+   + P  IS+ 
Subjt:  SIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEG-LYCLANTLVKPVDISQP

Query:  ILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVL-------VNEEHYFCNSCQYGKS----HALPFSISESRASKKFEL
         +++    + K+ SV        NK      HR LGH + + +   ++   +  L        N   Y C  C  GKS    H     +    + + F+ 
Subjt:  ILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVL-------VNEEHYFCNSCQYGKS----HALPFSISESRASKKFEL

Query:  VYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTG--NAFQHFLAMVQTQFNGNIQSFQSDNGTEFLR--VHQLCSQLGIKSRYSCPYTSQQ
        +++D++GP   L  S   Y++ F D+ +RF WVYPL  + +    N F   LA ++ QFN  +   Q D G+E+    +H+  +  GI + Y+    S+ 
Subjt:  VYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTG--NAFQHFLAMVQTQFNGNIQSFQSDNGTEFLR--VHQLCSQLGIKSRYSCPYTSQQ

Query:  NGRAERKHRHLVETCLTLLAQASMPLVFWW
        +G AER +R L+  C TLL  + +P   W+
Subjt:  NGRAERKHRHLVETCLTLLAQASMPLVFWW

Q12337 Transposon Ty2-GR1 Gag-Pol polyprotein1.8e-2226.97Show/hide
Query:  SIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEG-LYCLANTLVKPVDISQP
        +I+       PI   GN               L  PN+A +L+S+S LA  N       +T  + +  GT   +L   +K G  Y L+   + P  IS+ 
Subjt:  SIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEG-LYCLANTLVKPVDISQP

Query:  ILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVL-------VNEEHYFCNSCQYGKS----HALPFSISESRASKKFEL
         +++    + K+ SV        NK      HR LGH + + +   ++   +  L        N   Y C  C  GKS    H     +    + + F+ 
Subjt:  ILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVL-------VNEEHYFCNSCQYGKS----HALPFSISESRASKKFEL

Query:  VYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTG--NAFQHFLAMVQTQFNGNIQSFQSDNGTEFLR--VHQLCSQLGIKSRYSCPYTSQQ
        +++D++GP   L  S   Y++ F D+ +RF WVYPL  + +    N F   LA ++ QFN  +   Q D G+E+    +H+  +  GI + Y+    S+ 
Subjt:  VYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTG--NAFQHFLAMVQTQFNGNIQSFQSDNGTEFLR--VHQLCSQLGIKSRYSCPYTSQQ

Query:  NGRAERKHRHLVETCLTLLAQASMPLVFWW
        +G AER +R L+  C TLL  + +P   W+
Subjt:  NGRAERKHRHLVETCLTLLAQASMPLVFWW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.4e-8532.54Show/hide
Query:  NSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMI
        N+  LN  ++ VT  KL   N+L+W      +   Y+L   L G+   PP                A+  +++   +NP Y  W   D+L+ S +  ++ 
Subjt:  NSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMI

Query:  SEVATQVMGCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEI
          V   V    TA  +W+ ++ ++   S      L    +Q  KG   + +Y++ +    D L     P+     + +VL  L EEY PV+  I  K   
Subjt:  SEVATQVMGCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEI

Query:  SWADAQNDLFMFEKRLEFQNTQRNSVSFSH--NASVNMAINRGTQSQRQQQQNYSPNNY-NRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPI---CQVCG
           D    L    +RL    ++  +VS +     + N   +R T +          N Y NR+NNNNS+    P  +        N+ ++P    CQ+CG
Subjt:  SWADAQNDLFMFEKRLEFQNTQRNSVSFSH--NASVNMAINRGTQSQRQQQQNYSPNNY-NRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPI---CQVCG

Query:  KIGHTALVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNP--FLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGM
          GH+A  C          S  Q+  S  N Q           Q P     SP TP  P   LA        +W  +SGA+HH+T+DF NL+    Y G 
Subjt:  KIGHTALVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNP--FLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGM

Query:  DSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQP
        D ++V +GS  PI+ TG++ L++    L L N+L  PN+ KNLIS+ RL   N + +EF      VKD  TG  LL+G  K+ LY              P
Subjt:  DSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQP

Query:  ILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYF--CNSCQYGKSHALPFSISESRASKKFELVYSDVWGPA
        I SS    ++ + S         +K T + WH  LGHP+  IL+SVI + +L VL N  H F  C+ C   KS+ +PFS S   +++  E +YSDVW  +
Subjt:  ILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYF--CNSCQYGKSHALPFSISESRASKKFELVYSDVWGPA

Query:  PVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVE
        P+LS   +RYYV+F+D ++R+ W+YPLKQKS     F  F  +++ +F   I +F SDNG EF+ + +  SQ GI    S P+T + NG +ERKHRH+VE
Subjt:  PVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVE

Query:  TCLTLLAQASMPLVFW
        T LTLL+ AS+P  +W
Subjt:  TCLTLLAQASMPLVFW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-8332.2Show/hide
Query:  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVM
        +N     KL   N+L+W      +   Y+L   L G+ P PP                A+  +++   +NP Y  W   D+L+ S +  ++   V   V 
Subjt:  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVM

Query:  GCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQND
           TA  +W+ ++ ++     A   Y          G++    ++       D L     P+     + +VL  L ++Y PV+  I  K      D    
Subjt:  GCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQND

Query:  LFMFEKRLEFQNTQRNSVSFSHNASV--NMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPI---CQVCGKIGHTALVC
        L    +RL  + ++  +++ +    +  N+  +R T + R Q       NYN  NNNN      P+S G    R  N   +P    CQ+C   GH+A  C
Subjt:  LFMFEKRLEFQNTQRNSVSFSHNASV--NMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPI---CQVCGKIGHTALVC

Query:  YNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQS
                 P  +Q + +   QQ   P T        N+ V SP   +N             W  +SGA+HH+T+DF NL+    Y G D +++ +GS  
Subjt:  YNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQS

Query:  PITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYK
        PIT TG++ L +   +L L  VL  PN+ KNLIS+ RL   N + +EF      VKD  TG  LL+G  K+ LY              PI SS    M+ 
Subjt:  PITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYK

Query:  NNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYF--CNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYY
              S C   +K T + WH  LGHPS  IL+SVI + +LPVL N  H    C+ C   KSH +PFS S   +SK  E +YSDVW  +P+LS   +RYY
Subjt:  NNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYF--CNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYY

Query:  VLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASM
        V+F+D ++R+ W+YPLKQKS   + F  F ++V+ +F   I +  SDNG EF+ +    SQ GI    S P+T + NG +ERKHRH+VE  LTLL+ AS+
Subjt:  VLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASM

Query:  PLVFW
        P  +W
Subjt:  PLVFW

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.4e-0925.9Show/hide
Query:  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQV--MGCN
        T+ L + N+ +W+ L   +  S+ +  H+ G+    PM      TE                      + W   D L+  W+Y ++   +   +  +GC 
Subjt:  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQV--MGCN

Query:  TAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEI-SWADAQNDLF
        TA+DLW  ++ LF     A         + +   ++ + EY + +K  +D L    SP+S R L+  +L GL E+Y+ ++  I+ K    S+ +A++ L 
Subjt:  TAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEI-SWADAQNDLF

Query:  MFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYN--RSNNNNSQRGGTPNSRGRGRGRGYNSNN
        M E RL   N  ++S+S +++ S++  +    + Q +  Q Y  NN N  R  +    RGG     G   GR YN+NN
Subjt:  MFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYN--RSNNNNSQRGGTPNSRGRGRGRGYNSNN

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.6e-0530Show/hide
Query:  WHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPV
        WH  L H S + ++ +++   L         FC  C YGK+H + FS  +       + V+SD+WG   V
Subjt:  WHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGCCTTATTGAATGAATCCTCGTCGTTCTCTACTGGTGCACCCCATTTTAACAGTCCACCACTCAACCAACTCTTAAATCAGGTAACTACTATCAAATTGGA
AAGAGGGAATTTCCTTCTATGGAAAAATCTAGCATTACCAATCCTTCGTAGCTACAAACTCGAGAGTCATCTCCTGGGAACCAAGCCTTGCCCTCCTATGTTTCTATCTC
AAGCTAGAACCGAAGGGAATGTAACAGTCGAAGGCGCCTCCTTTAGCTCAGAATCAACTACATCAATCAATCCTCTCTATGAAGCATGGATGACAGTTGATCAGCTACTG
ATCAGTTGGCTTTATAATTCAATGATTTCAGAAGTCGCAACACAGGTTATGGGGTGCAACACGGCCAAAGACCTGTGGGATGTTATTCAGCTCTTATTTGGAGTTCAATC
ACGAGCAGAGGAAGATTACCTGTGTCAAACATTTCAACAATCACGCAAAGGTAATATGAAAATGTCTGAATATTTAAGGATTATGAAGTGTCATGCTGACAATCTAGGGC
AAGCTAGAAGCCCTGTTTCCACTCGATCTTTGATATCACAGGTTTTACTTGGACTTGACGAAGAATACAACCCAGTTGTGGTTGGCATTCAAGGTAAATATGAGATATCA
TGGGCAGATGCTCAAAACGATCTTTTTATGTTTGAAAAACGACTGGAGTTTCAGAATACTCAACGAAACAGTGTGTCCTTCAGTCACAATGCCTCAGTCAATATGGCAAT
AAATAGAGGAACACAATCGCAGAGGCAACAACAACAAAATTATTCTCCAAACAATTACAATCGGTCGAACAACAACAACAGTCAACGAGGAGGAACCCCAAATTCCAGAG
GACGTGGTCGAGGTAGAGGATATAACTCTAACAATCGACCAATATGTCAAGTTTGTGGGAAAATAGGACATACTGCACTCGTTTGTTACAATCGTTTCAATAAGGAATAT
TCTCCAAGCACAAATCAGAACAGACAAAGTCAACCTAATCAACAAGGTTTTGGACCGAATACCTCCTCCTCATCATTTCAGGCTCCAAATGTTTTTGTGGCCAGTCCTAA
CACTCCGAGTAACCCCTTCTTGGCCACTCCAGAAACTATTGGTGACCCCTCTTGGTATGCTAATAGTGGGGCTTCACATCATGTGACAACTGACTTTGGAAACCTAGCCA
ATCCAATTGAATATGGAGGTATGGACTCAATCATTGTAGGTAATGGTTCACAGTCTCCTATTACCTTCACTGGCAATTCATGTTTAACTTCTGGTAAATACAATCTGCGT
TTGCAAAATGTGTTATGTGCACCTAATATGGCTAAGAATTTGATTAGCATTTCGAGACTTGCTCAAGATAATGATATTTATATTGAGTTTCATGACACCTATTGTGTTGT
TAAGGACAAGGGCACGGGCAAACAACTTTTGAAAGGGGATCTTAAAGAAGGTCTATACTGCCTTGCGAATACCTTAGTCAAACCAGTGGACATCTCTCAGCCAATATTGA
GTAGTAATGAGTCCAAAATGTACAAAAATAATAGTGTTGCTTTTTCTGTTTGTCACAAACCCAACAAGGTGACCAAGACTTTTTGGCACAGACATCTTGGTCATCCTTCA
ACTAAAATTTTAGACTCTGTCATTCGTTCTTGTAATCTTCCTGTTTTGGTTAATGAAGAACACTATTTTTGTAACTCTTGTCAGTATGGTAAATCACATGCTCTACCGTT
TTCGATATCAGAGTCTCGCGCATCTAAGAAATTTGAATTGGTTTATTCTGATGTATGGGGACCTGCACCTGTTTTATCTACCTCTGGTTTCCGATATTATGTGCTATTCC
TTGATGATTACAGTAGATTTGTGTGGGTTTATCCCTTAAAACAAAAATCAGACACTGGTAATGCGTTTCAACACTTCTTGGCTATGGTCCAGACTCAATTCAATGGTAAC
ATTCAGTCATTTCAGTCTGATAATGGCACAGAATTCTTGAGAGTTCATCAACTCTGTAGTCAGCTGGGAATTAAGTCACGGTATTCGTGTCCTTACACTTCTCAACAGAA
TGGCAGGGCTGAACGGAAGCATAGACACTTGGTTGAGACTTGCTTAACTCTGTTAGCTCAGGCTTCGATGCCTCTTGTATTTTGGTGGTGGGAGCTTCTTGGTCGCGAAT
CGATCGATCAATGGCCTCCCCACACCTACATTGCAAGGTCAGTCACCACGTTTTCTTCTCACAGGTAA
mRNA sequenceShow/hide mRNA sequence
AAAAGATTTTGGTGGCATTAGGAACGGAAGACTCGAGCGATGAGATGGTAATATGAAAGAGGGAAAAAACGAGAGCCCCCAAATCAAATCCCTAGCCTTTTGCAATTCGC
TAGCCCTAGGCCCCAAATCGCTATGCCGTCATTGGTAATATGTATTTTTCAGGTCTTACAAGCCCTAGCTGCAGCTTCCGCCATCTCTTCCTCTTCTTCCACCAGCAACT
CCAATCTTCCTCCACCTCCTAAAGATGCATGAAGTCTTGTGTACCAACGACTACTTCCTCGATGGAAATCTCTCTCTCACTCCCACCTGTCTCCAATTCGAATTTCGATA
TCAAAAGTTGATCAAGTGGATGCAACGCGGTTAGATATTGAAATGTCGGCCATGCTGAAAGAACAACTGGTTAAGGTTTTTGCTTTGATGAAGATAGGCAAGCCCACACC
GAGAATTTCTCTCATTAATTTGCAGTATAGAGATGAGCGTGCAATAAAAATTCCACGAAAAGATGCAAGTTTCTTTGGAAATACTTTCAAGAGACGGAAAGACTAAATAT
GTTCAGAGTAGTAGAGTTCAAACAATCTTGCACCATGTTTTGGTACACTGATGAAACCACGATAGACAATACTAATTTCATACAATCCTAGGAGTCGAGCAAGAGCATAC
AGTTAGAAGCATTTCATGTTTTTAAGACAAAGCCCACAAAGGATTGATCCACTCTCCAAGAGCCCTGTTATTGGTGAAAGGAGCTGGCCCAAAGGAATACATCTAGAATG
TTCACACCAACAAATTTGTTAGGAGGATTCCTCTTTCTCCTCTAAGTTTTTGCCATCTGTCCCTTGTTTGTTACTCTCAGAATTATTCTTTTATTATTTTATTCTTTGGC
TCTTCTGCCCTTATATAACCGATGTATTATTATCGAGAAATGGTGAATAGAAATTGAGGCTTCCTCACAATTTTCTCTGTGTTAATTCTCAAACTTGGTATCAAAGCTTC
AATGGCCAACGCCTTATTGAATGAATCCTCGTCGTTCTCTACTGGTGCACCCCATTTTAACAGTCCACCACTCAACCAACTCTTAAATCAGGTAACTACTATCAAATTGG
AAAGAGGGAATTTCCTTCTATGGAAAAATCTAGCATTACCAATCCTTCGTAGCTACAAACTCGAGAGTCATCTCCTGGGAACCAAGCCTTGCCCTCCTATGTTTCTATCT
CAAGCTAGAACCGAAGGGAATGTAACAGTCGAAGGCGCCTCCTTTAGCTCAGAATCAACTACATCAATCAATCCTCTCTATGAAGCATGGATGACAGTTGATCAGCTACT
GATCAGTTGGCTTTATAATTCAATGATTTCAGAAGTCGCAACACAGGTTATGGGGTGCAACACGGCCAAAGACCTGTGGGATGTTATTCAGCTCTTATTTGGAGTTCAAT
CACGAGCAGAGGAAGATTACCTGTGTCAAACATTTCAACAATCACGCAAAGGTAATATGAAAATGTCTGAATATTTAAGGATTATGAAGTGTCATGCTGACAATCTAGGG
CAAGCTAGAAGCCCTGTTTCCACTCGATCTTTGATATCACAGGTTTTACTTGGACTTGACGAAGAATACAACCCAGTTGTGGTTGGCATTCAAGGTAAATATGAGATATC
ATGGGCAGATGCTCAAAACGATCTTTTTATGTTTGAAAAACGACTGGAGTTTCAGAATACTCAACGAAACAGTGTGTCCTTCAGTCACAATGCCTCAGTCAATATGGCAA
TAAATAGAGGAACACAATCGCAGAGGCAACAACAACAAAATTATTCTCCAAACAATTACAATCGGTCGAACAACAACAACAGTCAACGAGGAGGAACCCCAAATTCCAGA
GGACGTGGTCGAGGTAGAGGATATAACTCTAACAATCGACCAATATGTCAAGTTTGTGGGAAAATAGGACATACTGCACTCGTTTGTTACAATCGTTTCAATAAGGAATA
TTCTCCAAGCACAAATCAGAACAGACAAAGTCAACCTAATCAACAAGGTTTTGGACCGAATACCTCCTCCTCATCATTTCAGGCTCCAAATGTTTTTGTGGCCAGTCCTA
ACACTCCGAGTAACCCCTTCTTGGCCACTCCAGAAACTATTGGTGACCCCTCTTGGTATGCTAATAGTGGGGCTTCACATCATGTGACAACTGACTTTGGAAACCTAGCC
AATCCAATTGAATATGGAGGTATGGACTCAATCATTGTAGGTAATGGTTCACAGTCTCCTATTACCTTCACTGGCAATTCATGTTTAACTTCTGGTAAATACAATCTGCG
TTTGCAAAATGTGTTATGTGCACCTAATATGGCTAAGAATTTGATTAGCATTTCGAGACTTGCTCAAGATAATGATATTTATATTGAGTTTCATGACACCTATTGTGTTG
TTAAGGACAAGGGCACGGGCAAACAACTTTTGAAAGGGGATCTTAAAGAAGGTCTATACTGCCTTGCGAATACCTTAGTCAAACCAGTGGACATCTCTCAGCCAATATTG
AGTAGTAATGAGTCCAAAATGTACAAAAATAATAGTGTTGCTTTTTCTGTTTGTCACAAACCCAACAAGGTGACCAAGACTTTTTGGCACAGACATCTTGGTCATCCTTC
AACTAAAATTTTAGACTCTGTCATTCGTTCTTGTAATCTTCCTGTTTTGGTTAATGAAGAACACTATTTTTGTAACTCTTGTCAGTATGGTAAATCACATGCTCTACCGT
TTTCGATATCAGAGTCTCGCGCATCTAAGAAATTTGAATTGGTTTATTCTGATGTATGGGGACCTGCACCTGTTTTATCTACCTCTGGTTTCCGATATTATGTGCTATTC
CTTGATGATTACAGTAGATTTGTGTGGGTTTATCCCTTAAAACAAAAATCAGACACTGGTAATGCGTTTCAACACTTCTTGGCTATGGTCCAGACTCAATTCAATGGTAA
CATTCAGTCATTTCAGTCTGATAATGGCACAGAATTCTTGAGAGTTCATCAACTCTGTAGTCAGCTGGGAATTAAGTCACGGTATTCGTGTCCTTACACTTCTCAACAGA
ATGGCAGGGCTGAACGGAAGCATAGACACTTGGTTGAGACTTGCTTAACTCTGTTAGCTCAGGCTTCGATGCCTCTTGTATTTTGGTGGTGGGAGCTTCTTGGTCGCGAA
TCGATCGATCAATGGCCTCCCCACACCTACATTGCAAGGTCAGTCACCACGTTTTCTTCTCACAGGTAAACACTTAGACTTTGCTAATCTAAGGGTGTTTGGGTGTGCTT
GCTTTCCAAATTTACGACCCTATCAGAGGCACAAGTTTGATTTTCACTCTCAGCGGTGTGTTTATCTTGGTCCCAATCCAACTCATAAAGGTTTCAGTGCAAGAACTCTG
CTGGCAGGATTTTTGTTACTCACCATGTGATATTTAATGAGGTTGATTTCCCTTTTACAGATGCTACTTGGGCCACACCTTCAGCTCCTCTCACGAGTACCAGTCCTCCC
TTGTCTCCTTCCCCTGCTACCTGGTTTCCCTCCTACCCTGTGCCTCTCCCCAATACCTCCTCTCAGCATACCGTCAGTATACCTGCCTCATCGACCTCACAACCCTCTCC
CAACCTTCCGCATTCCTCCCCGCTACTTGCCTCTCGGTCCTCCAGCAGTCCTCTGACACCTTGCTCTGCTCCCACCTGCCCCTCTCTATCTCCTAGTCCTCATCCCTCCT
TGTCTCCAAATCATCCTATCCTCTCTGTTGATTCTTCCTTTGACAGTTCTGCTCCCCCTTCATCCTCCTTCTATCCTGACATTCCTTCTTCTCCTTTACCCTTGTCAGCT
CCGTCTACTATGCCCCAGCCTTCACATCCTATGGTTACTCGTGGGAAAGCTGGCATTTTTTAAGCCTAAAGCTTGGTTATCCACAAAGCCTCAGGTTGACTGGTCCCTTA
CCGAACCCACGCGTGTTCAGGTTGACTGGTCCCCAATGGAAGGCTGCTATGGACCAGGAATACACTGCTCTAATGCAAAATCATACCTGGGAGTTGGTCCCACCTGATCC
TATATATAATATCATTGGAAACAAGTGGATCTTTCGGATCAAACGGAATGCTGATGGATCCATTCAAAGGTACAAAGCCCGCCTTGTAGCCAAGGGCTTTCATCAGAATC
CTGGGGTTGACTTCTTTGAGACGTTCAGTCCGGTTGTCAAATTCTCCACCATTCGAGTTGTTCTCAACTTGGCTGTCACAAATAATTGGAGGTTGCGGCAACTCGACTTC
AACAATACATTTCTTAATGGTTCTCTCACTGAGGATGTCTATATGCAACAACCACCTGGCTATGTGGATCCCGTTCATCCCTCACATATTTGCAAGCTGACCAAGGCCAT
TTATGGTTTGAAGCAAGCTCCGAGAGCTTGGAATTCTACCTTGAAATCTGTCTTACTTGACTGGGGATTCGTGAATTCCAAGGCTGATACATCCCTCTTCATATATAGTT
CTGGTCAGTCCATTCTACTTCTTTTAGTGTATGTTGATGATGTGGTTCTCACGGGCAATGATGCTGTTTTGATGGATAATCTGGTCACCTCTCTGGATCAACGTTTTGCT
CTTAAAGATTTGGGTCCGTTGAGCTATTTCTTGGGCATTCAGGTACAATATGTTGAATCTGGCATTATTTTAACTCAATCTCAGTATGCCACTGATCTTCTCCTTCGCCT
TGACTGCCCTGCATTGAAACCGGCGCCCTCCCCAAGTGTTGTGGGCAAATTTCTATCTGTCAATAGTGGAACTCCTGCCTAATCCGTTCATCTATCGGAGCACTATTGGT
GCGCTTGATGCCTCACGAACACACGCCGACGATCATCCTACATTGTCAACTATCTCGAGCGGTTTCTTGGTCTCCAACCGATGAACACTGACAGGCTGTGAAGCGTGTCT
TGCGCTACATCTCAGGCACAAAACATTATGGTCTACTGATTCAACCCAGCTCTGATCAATCCATCCATGCCTTTTCCGATGCCGACTAGGCCTCTAATCCTGACGACAGA
CGTTCAGTGGTTGCTTACTGTGTTTTTATTGGTAATAGTCTCATCTCTTGGTCCTCGAAGAAGCAATCCGTCGTGGCGAGGTCAAGCACGGAGTCAGAATATCGTGCCTT
GGCTCATGCTTCCACTGAGATTATATGGCTTCAACAACTTCTTGGTGAATTAGGTGTTCAATCCTCAGCTCCACCAATTATATGGTGTGACAACTTAAGTGCCAGTGCCT
TGGCTGCCAATCCTGTTTTTCACGCCAGGACCAAACATATCGAAATCGACGTACACTTTGTTCGTGATCAAGTACTCCGTGGTGCACTTGAGATACGTTACGTCCCTACT
TCTGATCAAGTGGCCAACTGCTTGACCAAGCCATTGTCACACTCTCAGTTCTCTATGTTTCGATCCAAACTTGGGGTCACTTCGTTACCCTCTCGTTTCCGGGGGGTATT
GAGGTTAAAAGCCCACCTTGTGAGACCAGTGACTCAACATGCAGACAAAGCCAACAAAGGATTGATCCACTCTCCAAGAGCCCTGTTATTGGTGAAAGGAGCTGGCCCAA
AGGAATACATCTAGAATGTTCACACCAACAAATTTGTTACGAGGATTCCTCTCTCCTCTAAGTTTTTGCCATCTGTCCCTTGTTTGTTACTCTCAGAATTATTCTTTTAT
TCTTTTATTCTTTGGCTCTTCTGCCCTTATATAACCAGTGTATTATCATCGAGAAATGGTGAATAGAAATTGAGGCTTCCTCAAAATTTTCTGTGTTAATTCTCAAACTA
TCCTCGTCACAAATCCAAGCAAGCTTCTAAGGCTGTTTACTGATTTCAAAACTGATAAAGAGGATGAACAGTTTGAGGCTGACAAAGCTCACGTTGTTAGAGAAATTGCT
GCGCTTGAATCGAAAGGTCCATGAGAAAGAGGGAGATTCTTATATGAATATCTTGTATTTTTTTTCTTCCCTTCTCTATCCAATTTGGATGTCAATTATAATAGCTAAAT
CAGCTCTCCAAATGATGACTTTGATCAAATGTTTTACAATATTAATAATGGGCTTTATATATATTTATATATTCGCCTTTTTCTCTGTATTCTAACGGTTTCCACGTGTG
TGTATATTATTATAAGGACACTGATTGTAC
Protein sequenceShow/hide protein sequence
MANALLNESSSFSTGAPHFNSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLL
ISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEIS
WADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEY
SPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLR
LQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPS
TKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGN
IQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWWWELLGRESIDQWPPHTYIARSVTTFSSHR