; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008107 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008107
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:12344293..12349995
RNA-Seq ExpressionLag0008107
SyntenyLag0008107
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]2.0e-17438.45Show/hide
Query:  TIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLYNSMTS
        ++KLDR N+ LWK+L +P++R  KL+G++LGT+ CP  FI                           ++ D     N  +  W A DQ LLGW+ NSMT+
Subjt:  TIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLYNSMTS

Query:  EVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGRASVT
        E+AT               Q L G  +R++  +L+  F   RKG MKM DYL  MK   D L LAG+PVS  +L+ Q L GLD EYN +V  +  + +++
Subjt:  EVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGRASVT

Query:  WAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVCG--------
        W +LQA+LL FE R+E  N + N    + NA+AN+A     N      K+SNN   +W   N R  G+RG GRGRG      + +  CQVCG        
Subjt:  WAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVCG--------

Query:  -------------------KNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHL
                           K    N F+ +Q      ++ D +WY DSGASNHVT   +   + TE+ GK  ++VGNGEKL I   GS  L      L+L
Subjt:  -------------------KNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHL

Query:  EHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVN
          +L VP I KNL+S+SKLA DNN+ +EF EN C VKDK TG+V+LKG LKDGLYQL G K +NP +                              +V+
Subjt:  EHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVN

Query:  VVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLY
        V   K+ WHRRLGHP  KVLD +++ C ++V  +++  FCE+CQ+GK H++ F   +  A    +LVH D+WGPAP+++  G++YY+ F+DD+SR+ W+Y
Subjt:  VVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLY

Query:  PLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATR
        PLKQKS+ +                                  L  E GI  R SCPYTS QN RAERKHRH+ E GLTLLA A MPLHYWW+AF  A  
Subjt:  PLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATR

Query:  LVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLS-----ILQWFPQVEL
        L+N LP+ V   +SP  L+L K+ D+  LK FGC+CYPCL+PY QHK  +HT +CV+LG S+SHKGY+C+N  G     RH+  +         F     
Subjt:  LVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLS-----ILQWFPQVEL

Query:  YQPNSSNPSSQQQPATPCLQNQPEYHSPI--SANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQPTHPMITRAKAGI
            + N  S   P         +   PI  + N    +T  S   ++D  Q N+ P   S  +T           +    +S     +H + TR+K+GI
Subjt:  YQPNSSNPSSQQQPATPCLQNQPEYHSPI--SANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQPTHPMITRAKAGI

Query:  FKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
         KPK  ++   +      EP   ++ALS P WK AM  EF AL+ N TW LVPY    N+V +KW+F+ K   DGS++  KAR
Subjt:  FKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]2.2e-16837.96Show/hide
Query:  SPPLNQLLNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQ
        SP  N  L  I ++KLDR N+ LWK+L + ++R  KL+G++LGT  CP  F+    +S K                            NP +  W+A DQ
Subjt:  SPPLNQLLNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQ

Query:  LLLGWLYNSMTSEVA---------------TQELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNA
         LLGWL NSM  ++A               TQ L G  +++   +L+  F  +RKG MKM +YL  MK  +D L LAGSP+SN +L+ Q L GLD EYN 
Subjt:  LLLGWLYNSMTSEVA---------------TQELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNA

Query:  IVAMIQGRASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTC
        +V  +  + +++W ++QA+LL FE RL+  N   N    + NASAN A            K    GN+  +  N R + +RG   GRG    +N   Q C
Subjt:  IVAMIQGRASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTC

Query:  QVCGKNQQPNVFMTTQP-------------------TATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSN
           G       +   +P                    A+P    D  WY DSGA+NHVT   D      E+ GK  ++VGNGEKL I   GS  L    N
Subjt:  QVCGKNQQPNVFMTTQP-------------------TATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSN

Query:  VLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSI
         L+L  VL VP+I KNL+S+SKL  DNN+ +EF  N C VKDK TGQ +LKG LKDGLYQL                                   SN  
Subjt:  VLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSI

Query:  PYVNVVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRY
        P V + V K+ WHR+LGHP  KVLD ++KDCN+++  ++   FCE+CQFGK H++ F   +        L+H D+WGPAP+LS  G++YY+ F+DD+SR+
Subjt:  PYVNVVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRY

Query:  VWLYPLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFL
         W++PLKQKSD +                                  +  E GI  R SCPYTS QN RAERKHRHV E+GLTLLA A MPL YWW+AF 
Subjt:  VWLYPLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFL

Query:  AATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLSILQWFPQVELY
         A  L+N LP+SV   +SP  L+  ++ D+ ALK FGC+CYPCL+PY QHK  FHT +CV++G S+SHKGY+C+N  G     RH+  +    FP    +
Subjt:  AATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLSILQWFPQVELY

Query:  QPNSSNP----SSQQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPS---------SSPDTVLAPQQPD----QNSELASSSSPCP
          ++ NP    +       P           I  +        +H+ ++  N  N E V S         SS   + A    D     NS +  +     
Subjt:  QPNSSNP----SSQQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPS---------SSPDTVLAPQQPD----QNSELASSSSPCP

Query:  QP----THPMITRAKAGIFKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKA
        Q     TH M TR+K GI KPK  ++   + D    EP  V++AL  P WK AMD E+ ALV N TW+LVPY    N++ +KWIF+ K  +DGSI+  KA
Subjt:  QP----THPMITRAKAGIFKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKA

Query:  R
        R
Subjt:  R

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]4.4e-15338.14Show/hide
Query:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY
        L    ++KLDR N+ LW+++ +PI+R  +L+G++LG K CP  FI             TA  SS +               NP++E W A DQ LLGWL 
Subjt:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY

Query:  NSMTSEVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG
        NSMT  +AT               Q L G  +R++  +L+  F  +RKG MKM DYL  MK  AD L LAG+P+S  +L+ Q L GLD EYN +V  +  
Subjt:  NSMTSEVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG

Query:  RASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYR----GRGRGRGYNNYNNNNRQTCQVC
        + +++W +LQA+LL FE R+E  N++ N    + NA+AN+A      S  +  + ++N N   ++ N RG+ +R    GRGRGR +       + TCQVC
Subjt:  RASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYR----GRGRGRGYNNYNNNNRQTCQVC

Query:  G---------------------------KNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNL
        G                           K    N F+ +Q      +I D +WY DSGASNHVT   D   N +E+ GK  +IVGNGEKL I   GS  L
Subjt:  G---------------------------KNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNL

Query:  TDGSNVLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFV
              L+L  +L VP+I KNL+S+SKLA DNN+ +EF EN C VKDK TG+ +L+G LKDGLYQL                                  
Subjt:  TDGSNVLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFV

Query:  VSNSIPYVNVVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLD
          +S  YV++   K+ WHR+LGHP  KVLD ++K CN+++  ++   FCE+CQ+GK H + F      A    +LVH D+WGPAP++S  G++YY+ F+D
Subjt:  VSNSIPYVNVVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLD

Query:  DYSRYVWLYPLKQKSD---------------------ALLCG-------------ELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYW
        D++R+ W+YPLKQKSD                      + C              E GI  R SCPYTS QN RAERKHRH+ E GLTLLA A MPL+YW
Subjt:  DYSRYVWLYPLKQKSD---------------------ALLCG-------------ELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYW

Query:  WDAFLAATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEGRHITLSILQWFPQVELY
        W+AF  A  L+N LP+SV + KSP  LL  ++ D+ +LK FGC+CYP L+PY +HK  FHT +CV+LG S+SHKGY+C+N  GR   + I +     E +
Subjt:  WDAFLAATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEGRHITLSILQWFPQVELY

Query:  QPNSSNPSSQQQPATPCLQNQPEYHSPISANTTPCH---TPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQP-----DQNSELAS-------SSSPCPQP
         P      + + P    L   P  H P+ A  TP     TP  +   +++++  +     SS D +   Q       D  ++LA+       + +     
Subjt:  QPNSSNPSSQQQPATPCLQNQPEYHSPISANTTPCH---TPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQP-----DQNSELAS-------SSSPCPQP

Query:  THPMITRAKAGIFKP
        TH M TR+K GI KP
Subjt:  THPMITRAKAGIFKP

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]1.6e-17138.48Show/hide
Query:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY
        L    ++KLDR NF LWK+L +P++R  K +G++LGTK CP  F+     + K                            NP Y+ W A DQ LLGWL 
Subjt:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY

Query:  NSMTSEVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG
        NSMT ++AT               Q L G  +R+   +L+  F  + K  MKM  YL  MK  AD L LAGSP+S+ +L+ Q L GLD EYN +V  +  
Subjt:  NSMTSEVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG

Query:  RASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVCGK--
        + +++W + QA+LL FE RL+  N   N    + NASAN A           +KN + GN+  +    RG+  RG   GRG    +   R  CQ+CGK  
Subjt:  RASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVCGK--

Query:  -------------NQQPNVFMTTQPT-----ATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHLEH
                       + N +   + +     A+P    D  WY DSGASNHVT     + +  E  GK  ++VGNGEKL I   GS  L D    ++L +
Subjt:  -------------NQQPNVFMTTQPT-----ATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHLEH

Query:  VLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVNVV
        VL VPEI KNL+S+SKL  DNN  +EF EN+C VKDK TG+ +LKG LKDGLYQL   K               E  T  D              Y+++ 
Subjt:  VLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVNVV

Query:  VSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPL
          K+IWHR+LGHP  KVL+ ++KD N+++  ++   FCE+CQFGK H++ F   +  A    DL+H D+WGPAP+LS   ++YY+ FLDD+SR+ W++PL
Subjt:  VSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPL

Query:  KQKSDAL---------------------------------LCG-ELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLV
        KQKS+ +                                  C  + GI  + SCPYTS QN RAERKHRHV E+GLTLLA A MPL YWW+AF  A  L+
Subjt:  KQKSDAL---------------------------------LCG-ELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLV

Query:  NGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLSILQWFPQVELYQPNSSN
        N LP+SV   +SP  L+  K+ D+TALK FGC+CYPCL+PY QHK  FHT +CV+LG S+SHKGY+C+N  G     RH+  +    FP  E +  ++ N
Subjt:  NGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLSILQWFPQVELYQPNSSN

Query:  PSSQQQPATPCLQNQPEYHSPISANTTPCHT------------------PQSHTPDT--DVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQP-
        P       TP     P + + I+ N T   T                   QS   DT    ++NN     +         +  ++ S+  + ++P PQ  
Subjt:  PSSQQQPATPCLQNQPEYHSPISANTTPCHT------------------PQSHTPDT--DVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQP-

Query:  ---THPMITRAKAGIFKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
           TH M TR+KAG++KPK  ++   +      EP  V +ALS+P W  AMD E+ AL+ N TW+LVP++   NV+ +KWIF+ K  ADG+I+  KAR
Subjt:  ---THPMITRAKAGIFKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]5.3e-16737.8Show/hide
Query:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY
        L  I ++KLDR N+ LWK+L +P++R  K +G++LGTK CP  F+    +S K                            NP ++ W+A DQ LLGWL 
Subjt:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY

Query:  NSMTSEVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG
        NSM  ++AT               Q L G  +++   +L+  F  +RKG MKM +YL  MK  +D L L+GSP+SN +L+ Q L GLD EYN +V  +  
Subjt:  NSMTSEVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG

Query:  RASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVC----
        + +++W ++QA+LL FE RL+  N   N    + NASAN A            K    GN+  +  N R + +RG   GRG    +N     CQVC    
Subjt:  RASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVC----

Query:  ---------------GKNQQPNVFMTTQPT---ATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHL
                       G+N           +   A+P    D  WY DSGASNHVT   D      E+ GK  ++VGNGEKL I   GS  L    N L+L
Subjt:  ---------------GKNQQPNVFMTTQPT---ATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHL

Query:  EHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVN
          VL VP+I KNL+S+SKL  DNN+++EF  N C VKDK TGQ +LKG LKDGLYQL  +  +                             SN  P V 
Subjt:  EHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVN

Query:  VVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLY
        + V K+ WHR+LGHP  KVL+ ++KDCN+++  ++   FCE+CQFGK H++ F   +        L+H D+WGPAP+LS  G++YY+ F+DD+SR+ W++
Subjt:  VVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLY

Query:  PLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATR
        PLKQKSD +                                  +  E GI  R SCPYTS QN RAERKHRHVVE+GLTLLA A MPL YWW+AF  A  
Subjt:  PLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATR

Query:  LVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEGRHITLSILQWFPQVELYQPNSSNPSS
        L+N L +SV   +SP  L+  ++ D+ ALK FGC+CYPCL+PY QHK  FHT +CV++G S+SHKG    +  G    +              +    + 
Subjt:  LVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEGRHITLSILQWFPQVELYQPNSSNPSS

Query:  QQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQPTHPMITRAKAGIFKPK-AWLSKQQ
        Q   +T    N  E H+  S +    +T    T D +V+        +S   T  + QQ  Q++            TH + TR+K GI KPK  ++   +
Subjt:  QQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQPTHPMITRAKAGIFKPK-AWLSKQQ

Query:  IDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
         D    EP  V++AL  P WK AMD E+ AL+ N TW+LVP+    N++ +KWIF+ K  +DGSI+  KAR
Subjt:  IDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR

TrEMBL top hitse value%identityAlignment
A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)7.7e-17238.48Show/hide
Query:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY
        L    ++KLDR NF LWK+L +P++R  K +G++LGTK CP  F+     + K                            NP Y+ W A DQ LLGWL 
Subjt:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY

Query:  NSMTSEVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG
        NSMT ++AT               Q L G  +R+   +L+  F  + K  MKM  YL  MK  AD L LAGSP+S+ +L+ Q L GLD EYN +V  +  
Subjt:  NSMTSEVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG

Query:  RASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVCGK--
        + +++W + QA+LL FE RL+  N   N    + NASAN A           +KN + GN+  +    RG+  RG   GRG    +   R  CQ+CGK  
Subjt:  RASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVCGK--

Query:  -------------NQQPNVFMTTQPT-----ATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHLEH
                       + N +   + +     A+P    D  WY DSGASNHVT     + +  E  GK  ++VGNGEKL I   GS  L D    ++L +
Subjt:  -------------NQQPNVFMTTQPT-----ATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHLEH

Query:  VLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVNVV
        VL VPEI KNL+S+SKL  DNN  +EF EN+C VKDK TG+ +LKG LKDGLYQL   K               E  T  D              Y+++ 
Subjt:  VLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVNVV

Query:  VSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPL
          K+IWHR+LGHP  KVL+ ++KD N+++  ++   FCE+CQFGK H++ F   +  A    DL+H D+WGPAP+LS   ++YY+ FLDD+SR+ W++PL
Subjt:  VSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPL

Query:  KQKSDAL---------------------------------LCG-ELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLV
        KQKS+ +                                  C  + GI  + SCPYTS QN RAERKHRHV E+GLTLLA A MPL YWW+AF  A  L+
Subjt:  KQKSDAL---------------------------------LCG-ELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLV

Query:  NGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLSILQWFPQVELYQPNSSN
        N LP+SV   +SP  L+  K+ D+TALK FGC+CYPCL+PY QHK  FHT +CV+LG S+SHKGY+C+N  G     RH+  +    FP  E +  ++ N
Subjt:  NGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLSILQWFPQVELYQPNSSN

Query:  PSSQQQPATPCLQNQPEYHSPISANTTPCHT------------------PQSHTPDT--DVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQP-
        P       TP     P + + I+ N T   T                   QS   DT    ++NN     +         +  ++ S+  + ++P PQ  
Subjt:  PSSQQQPATPCLQNQPEYHSPISANTTPCHT------------------PQSHTPDT--DVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQP-

Query:  ---THPMITRAKAGIFKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
           TH M TR+KAG++KPK  ++   +      EP  V +ALS+P W  AMD E+ AL+ N TW+LVP++   NV+ +KWIF+ K  ADG+I+  KAR
Subjt:  ---THPMITRAKAGIFKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR

A0A2K3NEN7 Copia-like polyprotein (Fragment)2.6e-16737.8Show/hide
Query:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY
        L  I ++KLDR N+ LWK+L +P++R  K +G++LGTK CP  F+    +S K                            NP ++ W+A DQ LLGWL 
Subjt:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY

Query:  NSMTSEVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG
        NSM  ++AT               Q L G  +++   +L+  F  +RKG MKM +YL  MK  +D L L+GSP+SN +L+ Q L GLD EYN +V  +  
Subjt:  NSMTSEVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG

Query:  RASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVC----
        + +++W ++QA+LL FE RL+  N   N    + NASAN A            K    GN+  +  N R + +RG   GRG    +N     CQVC    
Subjt:  RASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVC----

Query:  ---------------GKNQQPNVFMTTQPT---ATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHL
                       G+N           +   A+P    D  WY DSGASNHVT   D      E+ GK  ++VGNGEKL I   GS  L    N L+L
Subjt:  ---------------GKNQQPNVFMTTQPT---ATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHL

Query:  EHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVN
          VL VP+I KNL+S+SKL  DNN+++EF  N C VKDK TGQ +LKG LKDGLYQL  +  +                             SN  P V 
Subjt:  EHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVN

Query:  VVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLY
        + V K+ WHR+LGHP  KVL+ ++KDCN+++  ++   FCE+CQFGK H++ F   +        L+H D+WGPAP+LS  G++YY+ F+DD+SR+ W++
Subjt:  VVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLY

Query:  PLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATR
        PLKQKSD +                                  +  E GI  R SCPYTS QN RAERKHRHVVE+GLTLLA A MPL YWW+AF  A  
Subjt:  PLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATR

Query:  LVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEGRHITLSILQWFPQVELYQPNSSNPSS
        L+N L +SV   +SP  L+  ++ D+ ALK FGC+CYPCL+PY QHK  FHT +CV++G S+SHKG    +  G    +              +    + 
Subjt:  LVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEGRHITLSILQWFPQVELYQPNSSNPSS

Query:  QQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQPTHPMITRAKAGIFKPK-AWLSKQQ
        Q   +T    N  E H+  S +    +T    T D +V+        +S   T  + QQ  Q++            TH + TR+K GI KPK  ++   +
Subjt:  QQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQPTHPMITRAKAGIFKPK-AWLSKQQ

Query:  IDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
         D    EP  V++AL  P WK AMD E+ AL+ N TW+LVP+    N++ +KWIF+ K  +DGSI+  KAR
Subjt:  IDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR

A0A2Z6MBG6 Integrase catalytic domain-containing protein9.7e-17538.45Show/hide
Query:  TIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLYNSMTS
        ++KLDR N+ LWK+L +P++R  KL+G++LGT+ CP  FI                           ++ D     N  +  W A DQ LLGW+ NSMT+
Subjt:  TIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLYNSMTS

Query:  EVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGRASVT
        E+AT               Q L G  +R++  +L+  F   RKG MKM DYL  MK   D L LAG+PVS  +L+ Q L GLD EYN +V  +  + +++
Subjt:  EVAT---------------QELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGRASVT

Query:  WAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVCG--------
        W +LQA+LL FE R+E  N + N    + NA+AN+A     N      K+SNN   +W   N R  G+RG GRGRG      + +  CQVCG        
Subjt:  WAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVCG--------

Query:  -------------------KNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHL
                           K    N F+ +Q      ++ D +WY DSGASNHVT   +   + TE+ GK  ++VGNGEKL I   GS  L      L+L
Subjt:  -------------------KNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHL

Query:  EHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVN
          +L VP I KNL+S+SKLA DNN+ +EF EN C VKDK TG+V+LKG LKDGLYQL G K +NP +                              +V+
Subjt:  EHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVN

Query:  VVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLY
        V   K+ WHRRLGHP  KVLD +++ C ++V  +++  FCE+CQ+GK H++ F   +  A    +LVH D+WGPAP+++  G++YY+ F+DD+SR+ W+Y
Subjt:  VVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLY

Query:  PLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATR
        PLKQKS+ +                                  L  E GI  R SCPYTS QN RAERKHRH+ E GLTLLA A MPLHYWW+AF  A  
Subjt:  PLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATR

Query:  LVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLS-----ILQWFPQVEL
        L+N LP+ V   +SP  L+L K+ D+  LK FGC+CYPCL+PY QHK  +HT +CV+LG S+SHKGY+C+N  G     RH+  +         F     
Subjt:  LVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLS-----ILQWFPQVEL

Query:  YQPNSSNPSSQQQPATPCLQNQPEYHSPI--SANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQPTHPMITRAKAGI
            + N  S   P         +   PI  + N    +T  S   ++D  Q N+ P   S  +T           +    +S     +H + TR+K+GI
Subjt:  YQPNSSNPSSQQQPATPCLQNQPEYHSPI--SANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPCPQPTHPMITRAKAGI

Query:  FKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
         KPK  ++   +      EP   ++ALS P WK AM  EF AL+ N TW LVPY    N+V +KW+F+ K   DGS++  KAR
Subjt:  FKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR

A0A2Z6P4D5 Integrase catalytic domain-containing protein1.0e-16837.96Show/hide
Query:  SPPLNQLLNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQ
        SP  N  L  I ++KLDR N+ LWK+L + ++R  KL+G++LGT  CP  F+    +S K                            NP +  W+A DQ
Subjt:  SPPLNQLLNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQ

Query:  LLLGWLYNSMTSEVA---------------TQELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNA
         LLGWL NSM  ++A               TQ L G  +++   +L+  F  +RKG MKM +YL  MK  +D L LAGSP+SN +L+ Q L GLD EYN 
Subjt:  LLLGWLYNSMTSEVA---------------TQELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNA

Query:  IVAMIQGRASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTC
        +V  +  + +++W ++QA+LL FE RL+  N   N    + NASAN A            K    GN+  +  N R + +RG   GRG    +N   Q C
Subjt:  IVAMIQGRASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQTC

Query:  QVCGKNQQPNVFMTTQP-------------------TATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSN
           G       +   +P                    A+P    D  WY DSGA+NHVT   D      E+ GK  ++VGNGEKL I   GS  L    N
Subjt:  QVCGKNQQPNVFMTTQP-------------------TATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSN

Query:  VLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSI
         L+L  VL VP+I KNL+S+SKL  DNN+ +EF  N C VKDK TGQ +LKG LKDGLYQL                                   SN  
Subjt:  VLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSI

Query:  PYVNVVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRY
        P V + V K+ WHR+LGHP  KVLD ++KDCN+++  ++   FCE+CQFGK H++ F   +        L+H D+WGPAP+LS  G++YY+ F+DD+SR+
Subjt:  PYVNVVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRY

Query:  VWLYPLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFL
         W++PLKQKSD +                                  +  E GI  R SCPYTS QN RAERKHRHV E+GLTLLA A MPL YWW+AF 
Subjt:  VWLYPLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFL

Query:  AATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLSILQWFPQVELY
         A  L+N LP+SV   +SP  L+  ++ D+ ALK FGC+CYPCL+PY QHK  FHT +CV++G S+SHKGY+C+N  G     RH+  +    FP    +
Subjt:  AATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEG-----RHITLSILQWFPQVELY

Query:  QPNSSNP----SSQQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPS---------SSPDTVLAPQQPD----QNSELASSSSPCP
          ++ NP    +       P           I  +        +H+ ++  N  N E V S         SS   + A    D     NS +  +     
Subjt:  QPNSSNP----SSQQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPS---------SSPDTVLAPQQPD----QNSELASSSSPCP

Query:  QP----THPMITRAKAGIFKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKA
        Q     TH M TR+K GI KPK  ++   + D    EP  V++AL  P WK AMD E+ ALV N TW+LVPY    N++ +KWIF+ K  +DGSI+  KA
Subjt:  QP----THPMITRAKAGIFKPK-AWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKA

Query:  R
        R
Subjt:  R

A0A803PM38 Uncharacterized protein9.8e-16739.14Show/hide
Query:  PPLNQLLNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQL
        P     LNQ   +KLDR NF LW+ +   I+R ++L+G+L GT P P  F+                 SS+    S +S        NP +E W+  DQL
Subjt:  PPLNQLLNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQL

Query:  LLGWLYNSMTSEV---------------ATQELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAI
        LLGWLY SMT  +               A +ELFG  S+A+ D  R   Q +RKG + M DYLR  +  AD L LAG P     LVS VL GLD EY  +
Subjt:  LLGWLYNSMTSEV---------------ATQELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAI

Query:  VAMIQGRASVTWAELQAELLVFEKRLELQNTVKNT---VAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQ
        V +I+ R S TW +LQ  LL  + ++E  ++   +        N SA++A NKG + P     N NN NR   H N RG+  R RGRG       +  R 
Subjt:  VAMIQGRASVTWAELQAELLVFEKRLELQNTVKNT---VAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQ

Query:  TCQVCGKNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNL-TDGSNVLHLEHVLCVPEIAKN
        TCQVCGK          +                 GASNH+TS  + ++   EY GKE V V NG +LPI +IG  +L T  ++ L L+ +L VP I KN
Subjt:  TCQVCGKNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNL-TDGSNVLHLEHVLCVPEIAKN

Query:  LVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSN-SIPYVNVVVS--KQIWH
        L+S+SKL  DNNV +EF  + C VKDK TGQVVLKG LKDGLYQ        P ST++ +  R  +       +    V SN + P  N ++   K  WH
Subjt:  LVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSN-SIPYVNVVVS--KQIWH

Query:  RRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPLKQKSDAL
        RRLGHP+ +VLD+++   N++  +N S  FC++CQ GKSH + F +   RA++  +LVH D+WGP+P++S   +RYYI F+DD+SRY W+YPLK KS+AL
Subjt:  RRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPLKQKSDAL

Query:  LC----------------------------------GELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLVNGLPTSV
                                             + GI  ++ CP+TS QN RAERKHRH+VEMGLTLLA A +P  YWWDAF  A  L+N LPT V
Subjt:  LC----------------------------------GELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLVNGLPTSV

Query:  LNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEGR-HITLSILQWFPQVELYQPNSSNPSSQQQPATPC
        L  K+P E+L  ++ D+  LKVFG SC+PCLR YQ HKF FH+ +CV LG S  HKGY+C++  GR +I+  ++  F + E   P  S   +  +P TP 
Subjt:  LNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKEGR-HITLSILQWFPQVELYQPNSSNPSSQQQPATPC

Query:  ----------------LQNQPEYHSPISANTTP-----CHTPQSHTPDTDVNQNNSEPVPSSSPDTV-----LAPQQPDQNSELASSSSPCPQ-------
                          +Q ++ S I  N T        T     PD    Q N      S    +     +  QQ    + L S++ P          
Subjt:  ----------------LQNQPEYHSPISANTTP-----CHTPQSHTPDTDVNQNNSEPVPSSSPDTV-----LAPQQPDQNSELASSSSPCPQ-------

Query:  ----PTHPMITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
             THPMITRAKAGIFKPK +L++ +     +EP  +++AL    W  AM +E  AL +N TW LVP  P ++++ NKW+++ KRNADGS Q  KAR
Subjt:  ----PTHPMITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.0e-1722.46Show/hide
Query:  MKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQ--GRASVTWAELQAELLVFEKRLELQNTVKNT--------VAFSQNASANM
        M +  +  +       L  AG+ +   + +S +L+ L   Y+ I+  I+     ++T A ++  LL  ++ ++++N   +T        V  + N   N 
Subjt:  MKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQ--GRASVTWAELQAELLVFEKRLELQNTVKNT--------VAFSQNASANM

Query:  AINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGR-GRGYNNYNNNNRQTCQVCGKNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNI
                P++  K ++       H  + G+  +      R  NN N  N +  QV         FM  +   T   + +  +  DSGAS+H+ ++    
Subjt:  AINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGR-GRGYNNYNNNNRQTCQVCGKNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNI

Query:  SNPTE-YGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGL
        ++  E     +  +   GE +  T  G   L +   +  LE VL   E A NL+S+ +L Q+  + IEF        DKS G  + K  L   + +  G+
Subjt:  SNPTE-YGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGL

Query:  KTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVNVVVSKQIWHRRLGHPAE-KVLD----SIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSL
            P+     N+     N     N                    ++WH R GH ++ K+L+    ++  D +L   +  S   CE C  GK   + F  
Subjt:  KTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVNVVVSKQIWHRRLGHPAE-KVLD----SIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSL

Query:  FTGRASSRFDL--VHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPLKQKSDAL------------------------------------LCGELGIH
           +   +  L  VH D+ GP   ++ D   Y+++F+D ++ Y   Y +K KSD                                       C + GI 
Subjt:  FTGRASSRFDL--VHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPLKQKSDAL------------------------------------LCGELGIH

Query:  SRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLVNGLPTSVL--NGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFS
           + P+T   N  +ER  R + E   T+++ A +   +W +A L AT L+N +P+  L  + K+P E+  +KK     L+VFG + Y  ++  +Q KF 
Subjt:  SRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLVNGLPTSVL--NGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFS

Query:  FHTEQCVYLG
          + + +++G
Subjt:  FHTEQCVYLG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.7e-3622.5Show/hide
Query:  THADNLGLAGSPVSNRNLVSQVLLGLDEEY-NAIVAMIQGRASVTWAELQAELLVFEK-RLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNG
        T   NLG+    +   +    +L  L   Y N    ++ G+ ++   ++ + LL+ EK R + +N  +  +   +  S   + N  G S  +    + + 
Subjt:  THADNLGLAGSPVSNRNLVSQVLLGLDEEY-NAIVAMIQGRASVTWAELQAELLVFEK-RLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNG

Query:  NRSWTHYNQRGNGYRGRG-----RGRGYNNYNNNNRQTCQVCGKNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVG
        +R    YN    G+  R      +G+G  +   N+  T  +   N    +F+  +      +  +  W  D+ AS+H T   D        G    V +G
Subjt:  NRSWTHYNQRGNGYRGRG-----RGRGYNNYNNNNRQTCQVCGKNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVG

Query:  NGEKLPITYIGS-CNLTDGSNVLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLR
        N     I  IG  C  T+    L L+ V  VP++  NL+S   L +D       ++ + L K      V+ KG  +  LY+            + +   +
Subjt:  NGEKLPITYIGS-CNLTDGSNVLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLR

Query:  QENNTGDDRNIGSVFVVSNSIPYVNVVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPA
         E N   D                   +S  +WH+R+GH +EK L  + K   +      +   C+ C FGK H V F   + R  +  DLV+ D+ GP 
Subjt:  QENNTGDDRNIGSVFVVSNSIPYVNVVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPA

Query:  PVLSGDGYRYYILFLDDYSRYVWLYPLK---------QKSDALL---------------------------CGELGIHSRYSCPYTSAQNDRAERKHRHV
         + S  G +Y++ F+DD SR +W+Y LK         QK  AL+                           C   GI    + P T   N  AER +R +
Subjt:  PVLSGDGYRYYILFLDDYSRYVWLYPLK---------QKSDALL---------------------------CGELGIHSRYSCPYTSAQNDRAERKHRHV

Query:  VEMGLTLLAHASMPLHYWWDAFLAATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKE
        VE   ++L  A +P  +W +A   A  L+N  P+  L  + P  +  +K++ ++ LKVFGC  +  +   Q+ K    +  C+++G      GYR  +  
Subjt:  VEMGLTLLAHASMPLHYWWDAFLAATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKE

Query:  GRHITLSILQWFPQVELYQPNSSNPSSQQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPC
         + +  S    F + E+      +   +       +      ++P SA +T            +V++   +P      + +   +Q D+  E     +  
Subjt:  GRHITLSILQWFPQVELYQPNSSNPSSQQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASSSSPC

Query:  PQPTHPMITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSMP---RWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
         +   P+    +  +   +   ++  +     EP  +++ LS P   +   AM  E  +L KN T+ LV        +  KW+F++K++ D  +  YKAR
Subjt:  PQPTHPMITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSMP---RWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR

P92520 Uncharacterized mitochondrial protein AtMg008201.1e-1041.76Show/hide
Query:  MITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
        M+TR+KAGI K     S         EP  V  AL  P W  AM  E  AL +N TW LVP   + N++G KW+F+ K ++DG++   KAR
Subjt:  MITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-10529.67Show/hide
Query:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY
        +N     KL   N+L+W      +   Y+L G L G+   PP  I          T+A  R                    NP Y  W   D+L+   + 
Subjt:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY

Query:  NSMTSEV---------------ATQELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG
         +++  V                 ++++   S      LR   +Q  KG   + DY++ + T  D L L G P+ +   V +VL  L EEY  ++  I  
Subjt:  NSMTSEV---------------ATQELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQG

Query:  R-ASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQT------C
        +    T  E+   LL  E ++         +A S      +  N   +     T N+NNGNR+   Y+ R N    +   +   N++ NN Q+      C
Subjt:  R-ASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRGRGYNNYNNNNRQT------C

Query:  QVCG--------------------KNQQPNVFMTTQPTAT---PETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLT
        Q+CG                      Q P+ F   QP A        +  NW  DSGA++H+TS+F+N+S    Y G + V+V +G  +PI++ GS +L+
Subjt:  QVCG--------------------KNQQPNVFMTTQPTAT---PETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLT

Query:  DGSNVLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVV
          S  L+L ++L VP I KNL+S+ +L   N V +EF      VKD +TG  +L+G  KD LY+   + +  P+S   S   +  +++            
Subjt:  DGSNVLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVV

Query:  SNSIPYVNVVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHF--CESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFL
                       WH RLGHPA  +L+S++ + +L V +N S  F  C  C   KS+ V FS  T  ++   + ++ D+W  +P+LS D YRYY++F+
Subjt:  SNSIPYVNVVVSKQIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHF--CESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFL

Query:  DDYSRYVWLYPLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHY
        D ++RY WLYPLKQKS                                        + GI    S P+T   N  +ERKHRH+VE GLTLL+HAS+P  Y
Subjt:  DDYSRYVWLYPLKQKSDAL----------------------------------LCGELGIHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHY

Query:  WWDAFLAATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKE------GRHI-------
        W  AF  A  L+N LPT +L  +SP + L     ++  L+VFGC+CYP LRPY QHK    + QCV+LG S +   Y C++ +       RH+       
Subjt:  WWDAFLAATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMNKE------GRHI-------

Query:  -------TLSILQ---------WFPQVEL-------------------------------YQPNSSN---------PSSQQ---------QPATPCLQNQ
               TLS +Q         W P   L                                Q +SSN         PSS +         QP T   Q Q
Subjt:  -------TLSILQ---------WFPQVEL-------------------------------YQPNSSN---------PSSQQ---------QPATPCLQNQ

Query:  PEYHSPISANTTPCHTPQSHTP-------DTDVNQNNSEPVPSSS---------PDTVLAPQQPDQNSELASSSSPCPQPTHPMITRAKAGIFKPKAWLS
         + HS  S NT+  + P + +P        T    ++S P P++S         P ++L    P   +++ ++++  P  TH M TRAKAGI KP    S
Subjt:  PEYHSPISANTTPCHTPQSHTP-------DTDVNQNNSEPVPSSS---------PDTVLAPQQPDQNSELASSSSPCPQPTHPMITRAKAGIFKPKAWLS

Query:  KQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPS-LNVVGNKWIFRIKRNADGSIQCYKAR
              A +EP     AL   RW+ AM +E +A + N TW LVP  PS + +VG +WIF  K N+DGS+  YKAR
Subjt:  KQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPS-LNVVGNKWIFRIKRNADGSIQCYKAR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.3e-9829Show/hide
Query:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY
        +N     KL   N+L+W      +   Y+L G L G+ P PP  I          T+A  R                    NP Y  W   D+L+   + 
Subjt:  LNQITTIKLDRGNFLLWKNLAMPILRSYKLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLY

Query:  NSMTSEVATQELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGR-ASVTWAELQAELL
         +++  V         +    + LR+ +     G++    ++    T  D L L G P+ +   V +VL  L ++Y  ++  I  +    +  E+   L+
Subjt:  NSMTSEVATQELFGVQSRAEEDFLRQTFQQSRKGNMKMCDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGR-ASVTWAELQAELL

Query:  VFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRG-RGYNNYNNNNRQTCQVCG-------KNQQPNVFM
          E +L   N+ +         +AN+  ++  N+ R   +N+   NR++ + N R N ++    G R  N         CQ+C        +  Q + F 
Subjt:  VFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSWTHYNQRGNGYRGRGRG-RGYNNYNNNNRQTCQVCG-------KNQQPNVFM

Query:  TT---QPTATPETIADP-------------NWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHLEHVLCVPEIAKN
        +T   Q + +P T   P             NW  DSGA++H+TS+F+N+S    Y G + V++ +G  +PIT+ GS +L   S  L L  VL VP I KN
Subjt:  TT---QPTATPETIADP-------------NWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGSNVLHLEHVLCVPEIAKN

Query:  LVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVNVVVSKQIWHRRL
        L+S+ +L   N V +EF      VKD +TG  +L+G  KD LY+        P+++S +                S+F    S        +   WH RL
Subjt:  LVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVNVVVSKQIWHRRL

Query:  GHPAEKVLDSIVKDCNLQVKVNESSHF--CESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPLKQKS---D
        GHP+  +L+S++ + +L V +N S     C  C   KSH V FS  T  +S   + ++ D+W  +P+LS D YRYY++F+D ++RY WLYPLKQKS   D
Subjt:  GHPAEKVLDSIVKDCNLQVKVNESSHF--CESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPLKQKS---D

Query:  ALLCGELGIHSRY-------------------------------SCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLVNGLPTSVL
          +  +  + +R+                               S P+T   N  +ERKHRH+VEMGLTLL+HAS+P  YW  AF  A  L+N LPT +L
Subjt:  ALLCGELGIHSRY-------------------------------SCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLVNGLPTSVL

Query:  NGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMN-KEGRHITLSILQWFPQVELYQPNSSNPSSQQQ------
          +SP + L  +  ++  LKVFGC+CYP LRPY +HK    ++QC ++G S +   Y C++   GR  T   +Q+  +   +   +   S+ Q+      
Subjt:  NGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLGMSSSHKGYRCMN-KEGRHITLSILQWFPQVELYQPNSSNPSSQQQ------

Query:  ----------------PATPCL-------QNQPEYHSPI--------------------SANTTPCHTPQSHT--PDTDVNQNNSEPVPSSSPDTVLAPQ
                        PA PCL          P   SP+                    S  T P H     T  P    N N++ P+ ++      +P 
Subjt:  ----------------PATPCL-------QNQPEYHSPI--------------------SANTTPCHTPQSHT--PDTDVNQNNSEPVPSSSPDTVLAPQ

Query:  QPDQNSEL------------------------ASSSSPCPQP------------------THPMITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSM
         P+QNS L                        +SS+S  P P                  TH M TRAK GI KP    S      A +EP     A+  
Subjt:  QPDQNSEL------------------------ASSSSPCPQP------------------THPMITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSM

Query:  PRWKTAMDNEFSALVKNDTWSLV-PYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
         RW+ AM +E +A + N TW LV P  PS+ +VG +WIF  K N+DGS+  YKAR
Subjt:  PRWKTAMDNEFSALVKNDTWSLV-PYDPSLNVVGNKWIFRIKRNADGSIQCYKAR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-0834.04Show/hide
Query:  EPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKARCGGCVATLSRYRKSGKFTESVTTLGKLAS
        EP+   +A     W  AMD+E  A+    TW +    P+   +G KW+++IK N+DG+I+ YKAR    VA     ++   F E+ + + KL S
Subjt:  EPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKARCGGCVATLSRYRKSGKFTESVTTLGKLAS

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.4e-0837.5Show/hide
Query:  QIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPV
        ++WH RL H +++ ++ +VK   L      S  FCE C +GK+H V FS       +  D VH DLWG   V
Subjt:  QIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.0e-1241.76Show/hide
Query:  MITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR
        M+TR+KAGI K     S         EP  V  AL  P W  AM  E  AL +N TW LVP   + N++G KW+F+ K ++DG++   KAR
Subjt:  MITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGCGGGGCCCCTTTGTTCAAGACTCGGAGTCAGCACTTCAGAGAACAACCTTTACTAACCCTAAAACAGATCCCTCTTTATTTCTACTCTCTCCCTCGGTTCC
CTTTTCAATCGGTGCCTTCATCACAAGTCCACATCGTCGGCCACTATTGAAGCTCCACTTCCGGCGATGTCTTCACGCGATCTCTAGCGAACGTCTTCGTCGAGCAGCGC
ACATCCAGCAACGTCTCCTAGCGAGCAGAGCAGCAACAGCAAGCACCGTCTTTGTTCGTCCTCTACGGCGGCGTCCATGGTGGTGCAGCGGTAGTTCATGTTTTCGGCTG
CGGCAGCTGTCCCAAAATATTATTCCAAAATTTGTTGCATTTGATTTTGCCCTAAGGGCCTCTCTGTATAAATGTAGCTATGAGCTACTGTTGAAAAAAATTAAGAAGGA
AGATAGAAAATTCTCCTCTGCTCTCTGTGTTTTATCTTGGTATCAAAGCTTCAAAATGGCCAACGCCTCGTTGACAGCTTCACCCACCATGACGGTTGGCACCACAACCT
TCACAAGCCCGCCTCTTAATCAACTGTTGAACCAGATTACCACTATCAAGCTGGATCGTGGAAATTTTCTGTTATGGAAGAATTTGGCCATGCCCATCCTTCGGAGCTAC
AAGCTTGAAGGCCATTTACTGGGAACTAAACCATGTCCCCCGATGTTTATTCAACAAGAAGGAGAATCAGCTAAGGGTGTTACTGAAGCAACCGCAAGAGCATCGAGCTC
TCAAACTGGAGCAAGTAGAGCTTCCACCCCTGATGAGAGGATGATTGCAAATCCTCAATACGAAGCCTGGGTAGCTGTTGATCAACTACTCCTCGGATGGTTGTATAACT
CCATGACCTCAGAGGTTGCTACACAGGAACTCTTCGGAGTACAGTCAAGGGCAGAAGAGGATTTTCTTCGCCAGACGTTTCAACAATCAAGGAAAGGTAACATGAAAATG
TGTGATTATCTTCGTTTAATGAAAACTCATGCCGACAATCTGGGACTTGCTGGGAGCCCGGTGTCGAATAGAAATTTGGTTTCACAGGTTTTGCTAGGATTGGATGAGGA
ATATAATGCGATTGTGGCCATGATACAAGGAAGGGCCAGTGTAACTTGGGCAGAACTCCAGGCTGAGTTGCTAGTGTTTGAAAAAAGACTGGAGTTACAAAACACGGTGA
AGAATACAGTGGCTTTTAGCCAAAATGCATCGGCCAACATGGCGATCAACAAAGGAGGAAACTCTCCAAGACAACCCACTAAGAATTCAAACAATGGCAATCGGTCATGG
ACACATTATAACCAGAGGGGAAATGGCTATCGCGGTCGTGGCAGAGGCCGTGGATATAATAACTACAACAACAACAATCGTCAAACCTGTCAAGTGTGTGGAAAGAATCA
ACAACCTAATGTGTTCATGACCACTCAACCAACAGCAACGCCCGAGACTATAGCTGATCCTAATTGGTATGCTGATAGTGGGGCCTCAAATCATGTGACAAGCAACTTTG
ACAACATTTCCAACCCTACTGAGTATGGAGGTAAGGAATGTGTGATAGTAGGAAATGGAGAAAAATTACCTATAACATATATAGGATCCTGCAACTTGACTGATGGTAGC
AATGTGTTGCATTTAGAACATGTTTTATGTGTACCTGAAATTGCCAAGAACCTAGTTAGTATGTCTAAGCTGGCACAAGATAATAATGTTTATATTGAGTTTCATGAGAA
TTTCTGCCTTGTTAAGGACAAGAGTACGGGACAAGTGGTGCTGAAAGGAGCTCTTAAAGATGGGCTTTATCAACTTCAAGGACTAAAGACGAAGAATCCTCTGTCTACCT
CTACTTCTAATTACCTGCGGCAAGAAAATAACACTGGAGATGATAGAAATATAGGGTCTGTTTTTGTTGTTTCAAACTCAATTCCATATGTTAATGTTGTTGTGTCTAAA
CAAATATGGCACAGAAGGCTTGGTCATCCTGCTGAAAAAGTGTTAGATTCTATAGTTAAAGATTGTAATCTTCAAGTTAAAGTCAATGAATCCTCTCATTTTTGTGAGTC
CTGCCAGTTTGGGAAGTCTCATGTTGTTAAATTCTCATTATTTACGGGTCGAGCATCTAGTCGTTTTGATCTTGTTCACATTGACCTTTGGGGTCCAGCCCCTGTTCTTT
CAGGTGATGGGTACAGATATTACATACTATTCTTGGATGACTACAGTCGCTATGTATGGTTGTATCCTTTGAAACAAAAGAGCGATGCACTGTTATGTGGTGAACTGGGG
ATTCATTCTCGTTACTCTTGCCCATATACTTCAGCACAAAACGATAGAGCGGAAAGGAAACATCGCCATGTTGTAGAAATGGGACTCACATTATTAGCACATGCATCAAT
GCCATTACATTACTGGTGGGATGCCTTTCTCGCTGCTACAAGATTGGTCAATGGGCTGCCAACTTCAGTGTTAAATGGTAAGTCTCCGATGGAACTCTTGCTGTCTAAGA
AGCTTGATTTTACTGCTTTAAAAGTCTTTGGGTGCTCATGCTATCCATGTTTAAGGCCTTATCAACAACATAAATTCTCTTTTCATACTGAACAATGTGTTTACCTTGGT
ATGAGTTCCTCTCACAAAGGCTACAGGTGTATGAATAAGGAAGGCAGGCACATCACCCTCTCTATCCTACAATGGTTCCCTCAAGTTGAACTCTATCAACCAAATTCGTC
CAACCCATCCTCACAACAACAACCAGCCACACCGTGTCTCCAAAACCAACCTGAGTATCATTCTCCTATATCTGCCAATACCACACCCTGTCATACACCACAGTCTCATA
CTCCAGACACTGATGTTAACCAAAACAATTCTGAACCAGTACCCTCCTCCTCTCCAGACACTGTCTTAGCCCCACAGCAACCTGATCAGAACTCAGAACTGGCCTCTTCT
TCCAGTCCCTGTCCTCAGCCCACACATCCTATGATTACCCGTGCAAAAGCTGGAATCTTTAAACCGAAAGCCTGGTTATCGAAGCAACAGATTGATTGGGCCCTCACTGA
GCCCACACGAGTACAGGATGCTTTGTCCATGCCTCGATGGAAGACTGCAATGGACAATGAATTCTCGGCTCTAGTAAAAAATGATACCTGGTCGCTGGTGCCTTATGATC
CCTCCTTGAATGTAGTAGGCAATAAGTGGATATTCAGAATTAAGAGAAATGCAGATGGCTCCATACAATGTTATAAGGCTCGTTGTGGGGGTTGCGTCGCGACGCTGTCG
CGATACCGCAAATCTGGAAAATTCACCGAGAGCGTCACGACGCTGGGAAAACTAGCTAGCTATGCTCGCTATCATGATCTCAAATCTGCACGTCTTCGCAGAAGCTCATC
TGAAAATATGGAGAAATGGAGGAAGAAAGGAAGGAAACACAAGGAGAAGAAAAGAGGAAAAAAAAATATTCCGGCGACGACAATCGACGGTGAATGGCGACAGGCAACGG
CTCCGGTGGCGGTGGCGGTGATGGTTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGCGGGGCCCCTTTGTTCAAGACTCGGAGTCAGCACTTCAGAGAACAACCTTTACTAACCCTAAAACAGATCCCTCTTTATTTCTACTCTCTCCCTCGGTTCC
CTTTTCAATCGGTGCCTTCATCACAAGTCCACATCGTCGGCCACTATTGAAGCTCCACTTCCGGCGATGTCTTCACGCGATCTCTAGCGAACGTCTTCGTCGAGCAGCGC
ACATCCAGCAACGTCTCCTAGCGAGCAGAGCAGCAACAGCAAGCACCGTCTTTGTTCGTCCTCTACGGCGGCGTCCATGGTGGTGCAGCGGTAGTTCATGTTTTCGGCTG
CGGCAGCTGTCCCAAAATATTATTCCAAAATTTGTTGCATTTGATTTTGCCCTAAGGGCCTCTCTGTATAAATGTAGCTATGAGCTACTGTTGAAAAAAATTAAGAAGGA
AGATAGAAAATTCTCCTCTGCTCTCTGTGTTTTATCTTGGTATCAAAGCTTCAAAATGGCCAACGCCTCGTTGACAGCTTCACCCACCATGACGGTTGGCACCACAACCT
TCACAAGCCCGCCTCTTAATCAACTGTTGAACCAGATTACCACTATCAAGCTGGATCGTGGAAATTTTCTGTTATGGAAGAATTTGGCCATGCCCATCCTTCGGAGCTAC
AAGCTTGAAGGCCATTTACTGGGAACTAAACCATGTCCCCCGATGTTTATTCAACAAGAAGGAGAATCAGCTAAGGGTGTTACTGAAGCAACCGCAAGAGCATCGAGCTC
TCAAACTGGAGCAAGTAGAGCTTCCACCCCTGATGAGAGGATGATTGCAAATCCTCAATACGAAGCCTGGGTAGCTGTTGATCAACTACTCCTCGGATGGTTGTATAACT
CCATGACCTCAGAGGTTGCTACACAGGAACTCTTCGGAGTACAGTCAAGGGCAGAAGAGGATTTTCTTCGCCAGACGTTTCAACAATCAAGGAAAGGTAACATGAAAATG
TGTGATTATCTTCGTTTAATGAAAACTCATGCCGACAATCTGGGACTTGCTGGGAGCCCGGTGTCGAATAGAAATTTGGTTTCACAGGTTTTGCTAGGATTGGATGAGGA
ATATAATGCGATTGTGGCCATGATACAAGGAAGGGCCAGTGTAACTTGGGCAGAACTCCAGGCTGAGTTGCTAGTGTTTGAAAAAAGACTGGAGTTACAAAACACGGTGA
AGAATACAGTGGCTTTTAGCCAAAATGCATCGGCCAACATGGCGATCAACAAAGGAGGAAACTCTCCAAGACAACCCACTAAGAATTCAAACAATGGCAATCGGTCATGG
ACACATTATAACCAGAGGGGAAATGGCTATCGCGGTCGTGGCAGAGGCCGTGGATATAATAACTACAACAACAACAATCGTCAAACCTGTCAAGTGTGTGGAAAGAATCA
ACAACCTAATGTGTTCATGACCACTCAACCAACAGCAACGCCCGAGACTATAGCTGATCCTAATTGGTATGCTGATAGTGGGGCCTCAAATCATGTGACAAGCAACTTTG
ACAACATTTCCAACCCTACTGAGTATGGAGGTAAGGAATGTGTGATAGTAGGAAATGGAGAAAAATTACCTATAACATATATAGGATCCTGCAACTTGACTGATGGTAGC
AATGTGTTGCATTTAGAACATGTTTTATGTGTACCTGAAATTGCCAAGAACCTAGTTAGTATGTCTAAGCTGGCACAAGATAATAATGTTTATATTGAGTTTCATGAGAA
TTTCTGCCTTGTTAAGGACAAGAGTACGGGACAAGTGGTGCTGAAAGGAGCTCTTAAAGATGGGCTTTATCAACTTCAAGGACTAAAGACGAAGAATCCTCTGTCTACCT
CTACTTCTAATTACCTGCGGCAAGAAAATAACACTGGAGATGATAGAAATATAGGGTCTGTTTTTGTTGTTTCAAACTCAATTCCATATGTTAATGTTGTTGTGTCTAAA
CAAATATGGCACAGAAGGCTTGGTCATCCTGCTGAAAAAGTGTTAGATTCTATAGTTAAAGATTGTAATCTTCAAGTTAAAGTCAATGAATCCTCTCATTTTTGTGAGTC
CTGCCAGTTTGGGAAGTCTCATGTTGTTAAATTCTCATTATTTACGGGTCGAGCATCTAGTCGTTTTGATCTTGTTCACATTGACCTTTGGGGTCCAGCCCCTGTTCTTT
CAGGTGATGGGTACAGATATTACATACTATTCTTGGATGACTACAGTCGCTATGTATGGTTGTATCCTTTGAAACAAAAGAGCGATGCACTGTTATGTGGTGAACTGGGG
ATTCATTCTCGTTACTCTTGCCCATATACTTCAGCACAAAACGATAGAGCGGAAAGGAAACATCGCCATGTTGTAGAAATGGGACTCACATTATTAGCACATGCATCAAT
GCCATTACATTACTGGTGGGATGCCTTTCTCGCTGCTACAAGATTGGTCAATGGGCTGCCAACTTCAGTGTTAAATGGTAAGTCTCCGATGGAACTCTTGCTGTCTAAGA
AGCTTGATTTTACTGCTTTAAAAGTCTTTGGGTGCTCATGCTATCCATGTTTAAGGCCTTATCAACAACATAAATTCTCTTTTCATACTGAACAATGTGTTTACCTTGGT
ATGAGTTCCTCTCACAAAGGCTACAGGTGTATGAATAAGGAAGGCAGGCACATCACCCTCTCTATCCTACAATGGTTCCCTCAAGTTGAACTCTATCAACCAAATTCGTC
CAACCCATCCTCACAACAACAACCAGCCACACCGTGTCTCCAAAACCAACCTGAGTATCATTCTCCTATATCTGCCAATACCACACCCTGTCATACACCACAGTCTCATA
CTCCAGACACTGATGTTAACCAAAACAATTCTGAACCAGTACCCTCCTCCTCTCCAGACACTGTCTTAGCCCCACAGCAACCTGATCAGAACTCAGAACTGGCCTCTTCT
TCCAGTCCCTGTCCTCAGCCCACACATCCTATGATTACCCGTGCAAAAGCTGGAATCTTTAAACCGAAAGCCTGGTTATCGAAGCAACAGATTGATTGGGCCCTCACTGA
GCCCACACGAGTACAGGATGCTTTGTCCATGCCTCGATGGAAGACTGCAATGGACAATGAATTCTCGGCTCTAGTAAAAAATGATACCTGGTCGCTGGTGCCTTATGATC
CCTCCTTGAATGTAGTAGGCAATAAGTGGATATTCAGAATTAAGAGAAATGCAGATGGCTCCATACAATGTTATAAGGCTCGTTGTGGGGGTTGCGTCGCGACGCTGTCG
CGATACCGCAAATCTGGAAAATTCACCGAGAGCGTCACGACGCTGGGAAAACTAGCTAGCTATGCTCGCTATCATGATCTCAAATCTGCACGTCTTCGCAGAAGCTCATC
TGAAAATATGGAGAAATGGAGGAAGAAAGGAAGGAAACACAAGGAGAAGAAAAGAGGAAAAAAAAATATTCCGGCGACGACAATCGACGGTGAATGGCGACAGGCAACGG
CTCCGGTGGCGGTGGCGGTGATGGTTGTTTGA
Protein sequenceShow/hide protein sequence
MRGRGPFVQDSESALQRTTFTNPKTDPSLFLLSPSVPFSIGAFITSPHRRPLLKLHFRRCLHAISSERLRRAAHIQQRLLASRAATASTVFVRPLRRRPWWCSGSSCFRL
RQLSQNIIPKFVAFDFALRASLYKCSYELLLKKIKKEDRKFSSALCVLSWYQSFKMANASLTASPTMTVGTTTFTSPPLNQLLNQITTIKLDRGNFLLWKNLAMPILRSY
KLEGHLLGTKPCPPMFIQQEGESAKGVTEATARASSSQTGASRASTPDERMIANPQYEAWVAVDQLLLGWLYNSMTSEVATQELFGVQSRAEEDFLRQTFQQSRKGNMKM
CDYLRLMKTHADNLGLAGSPVSNRNLVSQVLLGLDEEYNAIVAMIQGRASVTWAELQAELLVFEKRLELQNTVKNTVAFSQNASANMAINKGGNSPRQPTKNSNNGNRSW
THYNQRGNGYRGRGRGRGYNNYNNNNRQTCQVCGKNQQPNVFMTTQPTATPETIADPNWYADSGASNHVTSNFDNISNPTEYGGKECVIVGNGEKLPITYIGSCNLTDGS
NVLHLEHVLCVPEIAKNLVSMSKLAQDNNVYIEFHENFCLVKDKSTGQVVLKGALKDGLYQLQGLKTKNPLSTSTSNYLRQENNTGDDRNIGSVFVVSNSIPYVNVVVSK
QIWHRRLGHPAEKVLDSIVKDCNLQVKVNESSHFCESCQFGKSHVVKFSLFTGRASSRFDLVHIDLWGPAPVLSGDGYRYYILFLDDYSRYVWLYPLKQKSDALLCGELG
IHSRYSCPYTSAQNDRAERKHRHVVEMGLTLLAHASMPLHYWWDAFLAATRLVNGLPTSVLNGKSPMELLLSKKLDFTALKVFGCSCYPCLRPYQQHKFSFHTEQCVYLG
MSSSHKGYRCMNKEGRHITLSILQWFPQVELYQPNSSNPSSQQQPATPCLQNQPEYHSPISANTTPCHTPQSHTPDTDVNQNNSEPVPSSSPDTVLAPQQPDQNSELASS
SSPCPQPTHPMITRAKAGIFKPKAWLSKQQIDWALTEPTRVQDALSMPRWKTAMDNEFSALVKNDTWSLVPYDPSLNVVGNKWIFRIKRNADGSIQCYKARCGGCVATLS
RYRKSGKFTESVTTLGKLASYARYHDLKSARLRRSSSENMEKWRKKGRKHKEKKRGKKNIPATTIDGEWRQATAPVAVAVMVV