; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035476 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035476
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:22280080..22285989
RNA-Seq ExpressionLag0035476
SyntenyLag0035476
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]1.1e-16831.48Show/hide
Query:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIV-SIIEGKDGVTWHDLYSTLLTFENNLVRLN--VVNPPSGIVGEATT----------N
        + EYL  +K    SL   G  ++    +  +L GL  +Y   V S+I   D  +  ++ + L+  E+ + + N  + + PS  V  +             
Subjt:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIV-SIIEGKDGVTWHDLYSTLLTFENNLVRLN--VVNPPSGIVGEATT----------N

Query:  YAYNNKMQPA---------------GGNQSNTGQNKNY-GKTNRNE-RGRG-RGRFS---PYRGN--VSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTN
        YA N++   +               GG     G N NY G++NR   RGRG +G F    P+  +    KP CQ+CGK GH  A CY+RFD  +   Q  
Subjt:  YAYNNKMQPA---------------GGNQSNTGQNKNY-GKTNRNE-RGRG-RGRFS---PYRGN--VSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTN

Query:  SGNQGG-------SNRNSAYVTTPEVVTDPSWLVDSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCP-TQKPMSLNHILLVPQIT
        S            S + +  + T EV +D +W  DSGA++H+T   +NL    E+ G++ + VGNG  L + HIG S    P + KP+ LNH+L VP IT
Subjt:  SGNQGG-------SNRNSAYVTTPEVVTDPSWLVDSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCP-TQKPMSLNHILLVPQIT

Query:  QNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYV
        +NLLSV++  +DN  + EFHS  CFVKD+ T+ V++ G ++DGLY              S+++L  T+ L  +   S   SSF S V     + +SST+ 
Subjt:  QNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYV

Query:  SHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRL
               D+WH+RLGHPS   +  V    NV+  +  +   +FC SC  GK H  PF  S +T ++ LE+++ DLWGP+ VLS +GY++YI FVD ++R 
Subjt:  SHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRL

Query:  THIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFS
        + IF L++K+EA+  FV +K  VE +F+ KIK  Q+DWGGEFR F ++L E GI  R  CPHT +QNG+ E KHR IVE GLTLL  AS+PL++W ++F 
Subjt:  THIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFS

Query:  SSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSG----------------------
        + VYL N LPT IL++  P+E  +K  PDYS LKVFG +CFP LRPY +HK Q+ S +C FLGYS  HKGYKC+SS+G                      
Subjt:  SSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSG----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------NN------------------------------------------------
                                                          NN                                                
Subjt:  --------------------------------------------------NN------------------------------------------------

Query:  -------------------------------IDVINQF--------------------------------------------------------------
                                       +DV N F                                                              
Subjt:  -------------------------------IDVINQF--------------------------------------------------------------

Query:  -----------------------------------VKELNKEFALKDLGDLAYFLGIEACPTS-----------------------------AVTGKHIT
                                           + +LN EF+LKDLG++ YFLGI+   T+                               TG  + 
Subjt:  -----------------------------------VKELNKEFALKDLGDLAYFLGIEACPTS-----------------------------AVTGKHIT

Query:  ANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCV
          DG  + D   YRS +GALQYVT TRPE+S+ +NK+ Q++QNPT  HW+ VKR+LRYL+GTL HGL++ KS +  ++ + DADW   LDD++S +GHCV
Subjt:  ANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCV

Query:  FIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYV
        F+G +LISW SKKQ +V+RSS E EYRSLA   AEITW+++LL+E+     + P++WCDN+S + L+ANPV H+++KH+E+D++FVR+KVI KE+EVR+V
Subjt:  FIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYV

Query:  PSEKQIADGLTKPLSEFKFTELKNKLNV
        PS  Q+AD LTK +S  +F E ++KL +
Subjt:  PSEKQIADGLTKPLSEFKFTELKNKLNV

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.0e-20134.66Show/hide
Query:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGN
        M +YL  MK     LKLAG PV+ S L+   L+GL++EY P+V  +  +  ++W DL + LLTFE+ + +LN +   + +   AT N A  N+    G +
Subjt:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGN

Query:  QSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSHIT
         +N  +    G  +R  RG GRG     RG   K  CQVCG   H A  C+HRFDK Y+    ++G+    + N A++ +   V D  W  DSGA++H+T
Subjt:  QSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSHIT

Query:  AGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGL
           +     TE+HGK+SL VGNG++L +   GSS +     K ++L+ IL VP IT+NLLSV++L  DNN  VEF  + CFVKDK T KVIL+G+LKDGL
Subjt:  AGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGL

Query:  YQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFC
        YQL                                 S  K N          S +VS     K+ WH+RLGHP+ +V+  V ++  V  P S N   SFC
Subjt:  YQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFC

Query:  DSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRP
        ++CQ+GK HLLPF+ SSS   + LE+V+ D+WGP+P+++++G+K+Y+ FVDD++R T I+PLK K+E V  F+++K L EN+F K+IKV Q D GGE++P
Subjt:  DSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRP

Query:  FANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCL
              E GI+FR  CP+TS+QNG  E KHRHI E GLTLLAQA MPL YWW+AFS++VYLIN LP+ +  N SP     +K PDY LLK FG AC+PCL
Subjt:  FANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCL

Query:  RPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSG------------------------------------------------------------------
        +PY  HK Q+H+T+CVFLGYS SHKGYKCL+S G                                                                  
Subjt:  RPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSG------------------------------------------------------------------

Query:  -------------NN------------IDVINQ-------------------------------------------------------------------
                     NN            +D+  Q                                                                   
Subjt:  -------------NN------------IDVINQ-------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------FVKELNKEFALKDL
                                                                                              F+K+LN  F+LKDL
Subjt:  --------------------------------------------------------------------------------------FVKELNKEFALKDL

Query:  GDLAYFLGIEA-----------------------------CPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLH
        G L YFLGIE                              CPT  +TG+  T  +G+ + DPT++R AIG LQY+THT P+I++ +NKLSQY+ +P+  H
Subjt:  GDLAYFLGIEA-----------------------------CPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLH

Query:  WQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDC
        WQ +KR+LRYL+GT+N+ L+I  S D  I  +SDADW  S+DD+KS++G CVF+G +LISWSS+KQ+VV+RSSTESEYR+LA  AAEI WI++LL E++ 
Subjt:  WQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDC

Query:  KDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPP
           + PI+WCDN+SA +LA+NPV H++SKH+EID+H++RD+V+  E+ V YVP+  QIAD LTKPLS  +F++L++KL V+  P
Subjt:  KDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPP

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]2.7e-18632.12Show/hide
Query:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGN
        M EYL  MK  S  LKLAG P++ S LM   L+GL+AEY P+V  +  +  ++W D+ + LL FE+   RL+  N  SG+   A+ N+A   + +   GN
Subjt:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGN

Query:  QSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYT--NVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSH
        + N+  N           GRG+GR       +S   CQVC   GH A  C +RFD+ YT  N  T +  QG    +SA++ +P    D  W  DSGA +H
Subjt:  QSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYT--NVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSH

Query:  ITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKD
        +T          E++GK+SL VGNG++LK+   GS+ +N      ++L+ +L VPQIT+NLLSV++LT DNN  VEF ++ C VKDK T + +L+G LKD
Subjt:  ITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKD

Query:  GLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKIS
        GLYQL                           C+                      Y+S     K+ WH++LGHP+ +V+  V K+ NV    S +++ S
Subjt:  GLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKIS

Query:  FCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEF
        FC++CQFGK HLLPF+ SSS   + L +++ D+WGP+P+LS +G+K+Y+ F+DD++R T IFPLK K++ +  F+++K L EN+F KKIK+ Q D GGE+
Subjt:  FCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEF

Query:  RPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFP
        +       E GI+FR  CP+TS+QNG  E KHRH+ E+GLTLLAQA MPLRYWW+AFS++VYLIN LP+ +  N SP    +K+ PDY+ LK FG AC+P
Subjt:  RPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFP

Query:  CLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSS------------------------------------------------------------------
        CL+PY  HK QFH+T+CVF+GYS SHKGYKC++S                                                                  
Subjt:  CLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSS------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------SGNNIDV
                                                                                                     +G+NI  
Subjt:  ---------------------------------------------------------------------------------------------SGNNIDV

Query:  INQFVKELNKEFALKDLGDLAYFLGIE-----------------------------ACPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEIS
        +  F  +LN  ++LKDLG L YFLG+E                             ACPT  VTG+   A +G++M++PTLYR AIGALQY+T+TRP+I+
Subjt:  INQFVKELNKEFALKDLGDLAYFLGIE-----------------------------ACPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEIS

Query:  YIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAH
        + +NKLSQY+  PT  HWQ +KR+LRYL+GT NH L+I  S +  I  + DADW  S DD+KS  G CVF+G +L+SW+S+KQ+VV+RSSTESEYRSLA 
Subjt:  YIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAH

Query:  TAAEITWIQ-------------------ALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTK
          AE++                       LL E+    ++ P++WCDN+SA +LA+NPV H++SKH+EID+H++RD+V+  ++ + YVP+  QIAD LTK
Subjt:  TAAEITWIQ-------------------ALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTK

Query:  PLSEFKFTELKNKLNVVAPP
        PL   +F  +++KL V   P
Subjt:  PLSEFKFTELKNKLNVVAPP

RVW39271.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.6e-16831.88Show/hide
Query:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIE-GKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGG
        M +Y+  +K  + SL   GEPV+    +  +L GL ++Y  +V+ I   +D ++   ++S LL FE  L +   +      +   + NYA ++  +  GG
Subjt:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIE-GKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGG

Query:  NQSNTGQNKNYGKTNRNERGRGRGRFSPYRG-----NVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNS---GNQGGSNRNSAYVTTPEVVTDPSWLV
         + N G+  N+  TN N RGRGRG      G     +  +P CQ+CGKFGH+  +CYHRFD  + + Q N+    N G SN   A V +   + D +W +
Subjt:  NQSNTGQNKNYGKTNRNERGRGRGRFSPYRG-----NVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNS---GNQGGSNRNSAYVTTPEVVTDPSWLV

Query:  DSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIG-SSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKV
        DSGA+ H+T  + NL   T Y G D +T+GNG  L +S+ G + L + P      L  +  VP I+ NL+SVA+   DNN  +EFHS+  FVKD  T++V
Subjt:  DSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIG-SSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKV

Query:  ILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFP
        + QG L++GLY  K P++  +      ++  ST                          F  ST  + R    ++WH RLGH +  +++ +  N NV   
Subjt:  ILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFP

Query:  SSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVF
        S    K + C SCQ  KSH LP   SS   S+ LE+VY D+WGP+ V ST+G K++I FVDDY+R T ++ L+SK +A+  F ++K  VEN+F+ KIK  
Subjt:  SSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVF

Query:  QSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLK
        QSD GGEFR F +FL+E GI  R  CP+ S QNG VE KHRH+VE GL LLA A +PL++W  AF ++ +LIN +P+ +L N SP    +K+ PDY  L+
Subjt:  QSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLK

Query:  VFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSS----------------------------------------------------------
        VFG  C+P +RPY +HK Q+ S KCVFLGYS  HKGY CL +                                                          
Subjt:  VFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSS----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------SGNNIDVINQFVKELNKEFALKDLGDLAYFLGIE-----------------------------ACPTSAVTGKHITANDGQ
                           +G++  +I Q + +L+  FAL+DLG L+YFLGIE                             A  T    G  ++  DG 
Subjt:  -------------------SGNNIDVINQFVKELNKEFALKDLGDLAYFLGIE-----------------------------ACPTSAVTGKHITANDGQ

Query:  VMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGAS
        +M+D T+YRS +GALQY T TR +I++ +NK  Q++  PT+ HW +VKR+LRYLKGT  HGL +  S    + AY+DADWG   DD++S +G+ V++G +
Subjt:  VMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGAS

Query:  LISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQ
        L+SW++ KQ+VV+ SS ESEYR LA   AEI W QALL+E+       P ++ DN+SA  +A NPVFH+++KH+EID+HF+RD+V+  +++++Y+PS  Q
Subjt:  LISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQ

Query:  IADGLTKPLSEFKFTELKNKLNVVAPPLSLSGAPPP
         AD LTK L+  +F  L++ L +V  P SL G   P
Subjt:  IADGLTKPLSEFKFTELKNKLNVVAPPLSLSGAPPP

RVW52450.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.7e-16832.46Show/hide
Query:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIE-GKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGG
        M +Y+  +K  + SL   GE V+    +  +L GL ++Y  +V+ I   +D ++   ++S LL FE  L +   +      +   + NYA ++  +  GG
Subjt:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIE-GKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGG

Query:  NQSNTGQNKNYGKTNRNERGRGRGRFSPYRG-----NVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNS---GNQGGSNRNSAYVTTPEVVTDPSWLV
         + N G+  N+  TN N R RGRG      G     +  +P CQ+CGKFGH+  +CYHRFD  + + Q N+    N G SN   A V +   + D +W +
Subjt:  NQSNTGQNKNYGKTNRNERGRGRGRFSPYRG-----NVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNS---GNQGGSNRNSAYVTTPEVVTDPSWLV

Query:  DSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIG-SSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKV
        D GA+ H+T  + NL   T Y G D +T+GNG  L +S+ G + L + P      L  +  VP I+ NL+SVA+   DNN  +EFHS+  FVKD  T++V
Subjt:  DSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIG-SSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKV

Query:  ILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFP
        + QG L++GLY  K P++  +      ++  ST                          F  ST  + R    ++WH RL                    
Subjt:  ILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFP

Query:  SSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVF
               + C SCQ  KSH LP   SS   S+ LE+VY D+WGP+ V ST+G K++I FVDDY+R T ++ L+SK +A+  F ++K  VEN+F+ KIK  
Subjt:  SSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVF

Query:  QSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLK
        QSD GGEFR F +FL+E GI  R  CP+ S QNG VE KHRH+VE GL LLA A +PL++W  AF ++ +LIN +P+ +L N SP    +K+ PDY  L+
Subjt:  QSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLK

Query:  VFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSS----------------------------------------------------------
        VFG  C+P ++PY +HK Q+ S KCVFLGYS  HKGY CL +                                                          
Subjt:  VFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSS----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------SGNNIDVINQFVKELNKEFALKDLGDLAYFL
                                                                             +G++  +I Q + +L+  FAL+DLG L+YFL
Subjt:  ---------------------------------------------------------------------SGNNIDVINQFVKELNKEFALKDLGDLAYFL

Query:  GIE-----------------------------ACPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRV
        GIE                             A  T    G  ++  DG +M+D T+YRS +GALQY T TRP+I++ +NK  Q++  PT+ HW +VKR+
Subjt:  GIE-----------------------------ACPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRV

Query:  LRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPI
        LRYLKGT  HGL++  S    I AY+DADWG   DD++S +G+ V++G +L+SW++ KQ+VV+RSS ESEYR LA   AEI W QALL+E+       P 
Subjt:  LRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPI

Query:  IWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSGAPPPPLR
        ++ DN+SA  +A NPVFH+++KH+EID+HF+RD+V+  +++++Y+PS  Q AD LTK L+  +F  L++ L +V  P SL G   P ++
Subjt:  IWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSGAPPPPLR

TrEMBL top hitse value%identityAlignment
A0A2Z6MBG6 Integrase catalytic domain-containing protein5.0e-20234.66Show/hide
Query:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGN
        M +YL  MK     LKLAG PV+ S L+   L+GL++EY P+V  +  +  ++W DL + LLTFE+ + +LN +   + +   AT N A  N+    G +
Subjt:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGN

Query:  QSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSHIT
         +N  +    G  +R  RG GRG     RG   K  CQVCG   H A  C+HRFDK Y+    ++G+    + N A++ +   V D  W  DSGA++H+T
Subjt:  QSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSHIT

Query:  AGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGL
           +     TE+HGK+SL VGNG++L +   GSS +     K ++L+ IL VP IT+NLLSV++L  DNN  VEF  + CFVKDK T KVIL+G+LKDGL
Subjt:  AGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGL

Query:  YQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFC
        YQL                                 S  K N          S +VS     K+ WH+RLGHP+ +V+  V ++  V  P S N   SFC
Subjt:  YQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFC

Query:  DSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRP
        ++CQ+GK HLLPF+ SSS   + LE+V+ D+WGP+P+++++G+K+Y+ FVDD++R T I+PLK K+E V  F+++K L EN+F K+IKV Q D GGE++P
Subjt:  DSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRP

Query:  FANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCL
              E GI+FR  CP+TS+QNG  E KHRHI E GLTLLAQA MPL YWW+AFS++VYLIN LP+ +  N SP     +K PDY LLK FG AC+PCL
Subjt:  FANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCL

Query:  RPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSG------------------------------------------------------------------
        +PY  HK Q+H+T+CVFLGYS SHKGYKCL+S G                                                                  
Subjt:  RPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSG------------------------------------------------------------------

Query:  -------------NN------------IDVINQ-------------------------------------------------------------------
                     NN            +D+  Q                                                                   
Subjt:  -------------NN------------IDVINQ-------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------FVKELNKEFALKDL
                                                                                              F+K+LN  F+LKDL
Subjt:  --------------------------------------------------------------------------------------FVKELNKEFALKDL

Query:  GDLAYFLGIEA-----------------------------CPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLH
        G L YFLGIE                              CPT  +TG+  T  +G+ + DPT++R AIG LQY+THT P+I++ +NKLSQY+ +P+  H
Subjt:  GDLAYFLGIEA-----------------------------CPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLH

Query:  WQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDC
        WQ +KR+LRYL+GT+N+ L+I  S D  I  +SDADW  S+DD+KS++G CVF+G +LISWSS+KQ+VV+RSSTESEYR+LA  AAEI WI++LL E++ 
Subjt:  WQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDC

Query:  KDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPP
           + PI+WCDN+SA +LA+NPV H++SKH+EID+H++RD+V+  E+ V YVP+  QIAD LTKPLS  +F++L++KL V+  P
Subjt:  KDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPP

A0A2Z6P4D5 Integrase catalytic domain-containing protein1.3e-18632.12Show/hide
Query:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGN
        M EYL  MK  S  LKLAG P++ S LM   L+GL+AEY P+V  +  +  ++W D+ + LL FE+   RL+  N  SG+   A+ N+A   + +   GN
Subjt:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGN

Query:  QSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYT--NVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSH
        + N+  N           GRG+GR       +S   CQVC   GH A  C +RFD+ YT  N  T +  QG    +SA++ +P    D  W  DSGA +H
Subjt:  QSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYT--NVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSH

Query:  ITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKD
        +T          E++GK+SL VGNG++LK+   GS+ +N      ++L+ +L VPQIT+NLLSV++LT DNN  VEF ++ C VKDK T + +L+G LKD
Subjt:  ITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKD

Query:  GLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKIS
        GLYQL                           C+                      Y+S     K+ WH++LGHP+ +V+  V K+ NV    S +++ S
Subjt:  GLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKIS

Query:  FCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEF
        FC++CQFGK HLLPF+ SSS   + L +++ D+WGP+P+LS +G+K+Y+ F+DD++R T IFPLK K++ +  F+++K L EN+F KKIK+ Q D GGE+
Subjt:  FCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEF

Query:  RPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFP
        +       E GI+FR  CP+TS+QNG  E KHRH+ E+GLTLLAQA MPLRYWW+AFS++VYLIN LP+ +  N SP    +K+ PDY+ LK FG AC+P
Subjt:  RPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFP

Query:  CLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSS------------------------------------------------------------------
        CL+PY  HK QFH+T+CVF+GYS SHKGYKC++S                                                                  
Subjt:  CLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSS------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------SGNNIDV
                                                                                                     +G+NI  
Subjt:  ---------------------------------------------------------------------------------------------SGNNIDV

Query:  INQFVKELNKEFALKDLGDLAYFLGIE-----------------------------ACPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEIS
        +  F  +LN  ++LKDLG L YFLG+E                             ACPT  VTG+   A +G++M++PTLYR AIGALQY+T+TRP+I+
Subjt:  INQFVKELNKEFALKDLGDLAYFLGIE-----------------------------ACPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEIS

Query:  YIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAH
        + +NKLSQY+  PT  HWQ +KR+LRYL+GT NH L+I  S +  I  + DADW  S DD+KS  G CVF+G +L+SW+S+KQ+VV+RSSTESEYRSLA 
Subjt:  YIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAH

Query:  TAAEITWIQ-------------------ALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTK
          AE++                       LL E+    ++ P++WCDN+SA +LA+NPV H++SKH+EID+H++RD+V+  ++ + YVP+  QIAD LTK
Subjt:  TAAEITWIQ-------------------ALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTK

Query:  PLSEFKFTELKNKLNVVAPP
        PL   +F  +++KL V   P
Subjt:  PLSEFKFTELKNKLNVVAPP

A0A803PEH4 Uncharacterized protein5.5e-17733.07Show/hide
Query:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLN---------VVNPPSGIVGEATTNYAYN
        MSEYL   K  S  L LAG+P   ++L++ VL GL+AEYL IV  IE +   TW +L   LL+F++ + RL            + P   +   T N    
Subjt:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLN---------VVNPPSGIVGEATTNYAYN

Query:  NKMQPAGGNQSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTN---SGNQGG--SNRNSAYVTTPEVVTD
           Q    + ++ G   N   T+   RGRGRG      G+ S+PTCQV GK+GH+AA+CY+RFD+ Y     N   + N+ G  +N +SA+V TPEV+  
Subjt:  NKMQPAGGNQSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTN---SGNQGG--SNRNSAYVTTPEVVTD

Query:  PSWLVDSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKA
         +W  DSGA++HIT+   NL  K +Y+GK+S+ VGNG +L+++HIG+  +N  +   + L  +LLVP+I +NL+SV++L  DNN  +EF+S+ C VKDK 
Subjt:  PSWLVDSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKA

Query:  TRKVILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSN
        T+KV+L G+LKD LYQL  P               S+   + ++ LS++  S  SNV    +S   S  +S      DV H+RLGHPS +V++ V ++ N
Subjt:  TRKVILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSN

Query:  VSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKK
        VS   S N   + CD+CQ+GK+H LPFR S++    +L++++ DLWGP+P+ S   + +YI FVDDY+R T ++PLK K++A++ F+++K LVEN+F KK
Subjt:  VSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKK

Query:  IKVFQSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDY
        IK  +SD GGE++PF + ++  GIEF+HPCPHTS QNG  + KHRH VEMGLTLLAQA+                                         
Subjt:  IKVFQSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDY

Query:  SLLKVFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSG----------------------------------------------------
                 CFPCLR YQSHKFQFHS KCV LGYS S+KGYKCLS +G                                                    
Subjt:  SLLKVFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSG----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------NNIDVINQFVKELNKEFALKDLGDLAYFLGIEA------------------
                                                         N+   +  F+ ELNK+F LKD+G L YFLGIEA                  
Subjt:  -------------------------------------------------NNIDVINQFVKELNKEFALKDLGDLAYFLGIEA------------------

Query:  ------------CPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQ
                    CP     GK ++  DG+ M +PTLYRS +GALQY+ HTRP+I+Y                            GT   GL+I  SD   
Subjt:  ------------CPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQ

Query:  IVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKS
        I  +SDADW C  DD +S+AG+CV++G +L+SWSSKKQ VV+RSSTESEYR+LAH AAEI+WI++LL E++   +   + W DN+SA +LAANPVFH+++
Subjt:  IVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKS

Query:  KHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSG
        KH+EID H+VRDKV+ K++E+RY+PS  QIAD LTK LS  +F  L +KL V   P  L G
Subjt:  KHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSG

A0A803QCY3 Uncharacterized protein2.0e-18736.2Show/hide
Query:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGN
        M EYL   K  + SL LAGEP   + L + VLS L+  YL +V  I+ +   +W +L   LL+FE+ + RL                           GN
Subjt:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGN

Query:  QSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNSGNQG--GSNRN--SAYVTTPEVVTDPSWLVDSGAT
         ++ G++ N G   R+   RGRG     R N SKPTCQVCGK+ HSA +CY+ FD  Y     +S NQ   G N N  SA++ TPE +   +W  DSGA+
Subjt:  QSNTGQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNSGNQG--GSNRN--SAYVTTPEVVTDPSWLVDSGAT

Query:  SHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGIL
        ++ITA    +  K EY GK+ +TVGNGD+L +SH G+  +   T + + LN +LLVP I +N LSV++LT DN+  +EFHS+ CFVKD ATR+V+LQG+L
Subjt:  SHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGIL

Query:  KDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEK
        KDGLYQL+ P                      ++ L    S F  +   PF        V H                                      
Subjt:  KDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEK

Query:  ISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGG
          FCD+CQ+GKSH LPF+ S+S   ++L++V+ DLWGPSP+ S   +K+Y+ FVDD TR T I+PLK+K+EA   F+ +K L EN+FE+KIK  ++D GG
Subjt:  ISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGG

Query:  EFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSAC
        E++  ++F+   GI F H CPHTS QNG  E KHRHIVEMGLTLLAQ+ MPL+YWWDAFS++VYLIN LPTPIL++ +P E  +KK PDY  LK FG AC
Subjt:  EFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSAC

Query:  FPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSG----------------------NN--------------------------------------
        FPCLRPYQ+HKFQFHS KCV LGYS +HKGYKCLS +G                      NN                                      
Subjt:  FPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSG----------------------NN--------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------IDVIN-------------------------QFVKELNKEF---------------------ALKDLGDLAYFLG
                                  +D+ N                          FV +LNK                        ++DL     F+ 
Subjt:  --------------------------IDVIN-------------------------QFVKELNKEF---------------------ALKDLGDLAYFLG

Query:  IEACPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADW
        +++CPT A+ GK ++ NDG+ + D T Y+S IG LQY+ HTRP+I+Y +NKL                                            + DW
Subjt:  IEACPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADW

Query:  GCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHF
         C  DD+KS+AG+CV++G +L+SWSSKKQ VV+RSSTESEYR+L H +A+I+WI++LL EI    ++T + WCDN+ A +LA+NPVFH++ KH+EIDIHF
Subjt:  GCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHF

Query:  VRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSG
        VRDKV+ KE+EVRY+PS  Q+AD LTK L+  +F  L +K+  +  PL L G
Subjt:  VRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSG

A0A803QD60 Uncharacterized protein1.5e-17741Show/hide
Query:  SGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVAR
        + N    N +SA+V  PE++   +W  DSGA++HIT+   ++  K EY GK+ +TVGNG +L +SH+ + ++   + + + L  +LLVP+I +NLLSV++
Subjt:  SGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVAR

Query:  LTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSK-
        LT DNN  +EF+S  C VKDK T+KV+LQG+L+DGLYQL+ P        +   +  ST  ++S    S    SF S VV   KS  + + +    VSK 
Subjt:  LTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSK-

Query:  DVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLK
        DVW +RLGHPS RV+S V  +  V    S NE + FCD+CQFGKSH LPF+ S+S  + +L++V+ DLWGPSPV+S  G++ YI F+DD TR T I+PLK
Subjt:  DVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLK

Query:  SKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLIN
        +KAEA   F+++K   E + E+KIK  ++D GGE++   N + E GI+F H CPHTS QNG  + KHRH+VEMGLTLLAQA MPL+YWW+ F ++VYLIN
Subjt:  SKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLIN

Query:  CLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSGNNIDVINQFVKELNKEFALKDLGD-----
         LPTPIL N SP E  Y K PDY+ LKVFG ACFPC+RPYQ+HKFQFHS K V LGYS SHKGY+CL+ +G      N    EL   F L  L +     
Subjt:  CLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSGNNIDVINQFVKELNKEFALKDLGD-----

Query:  -------------------------LAYFLGI----EACPT----------------SAVTGKHITAND---GQVMNDPTLYRSAIGALQYVTHT-----
                                 L   LG      + PT                ++V+G  ++ N+      + +P  +    G +   + +     
Subjt:  -------------------------LAYFLGI----EACPT----------------SAVTGKHITAND---GQVMNDPTLYRSAIGALQYVTHT-----

Query:  RPEISYIINKLSQYLQNPTTL----HWQAVKRVLRYLKGTLNH-------------------------------------------------GLYIPK--
         P   +++ +  + +  P        W+ V +V   ++  L+H                                                  LY+ +  
Subjt:  RPEISYIINKLSQYLQNPTTL----HWQAVKRVLRYLKGTLNH-------------------------------------------------GLYIPK--

Query:  -----------------------------SDDSQ---IVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWI
                                     SD  Q   I  YSDADW C  DD+KSIAG+CV+ G +L+ WSSKKQ VVARSSTESEYR+LAH AAEI+WI
Subjt:  -----------------------------SDDSQ---IVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWI

Query:  QALLNEIDCKDIQTP--IIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSG
        ++LL E+    +QTP  + WCDN+ A +LA+NPVFH+++KH+EIDIHF+RDKV+ KE+++RY+PS  Q+AD LTK LS  +F  L +KL V+  PL L G
Subjt:  QALLNEIDCKDIQTP--IIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSG

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.8e-3934.54Show/hide
Query:  INQFVKELNKEFALKDLGDLAYFLGIEA--------CPTSAVTGKHIT----------------------ANDGQVMNDPTLYRSAIGALQYV-THTRPE
        +N F + L ++F + DL ++ +F+GI             SA   K ++                       N  +  N P   RS IG L Y+   TRP+
Subjt:  INQFVKELNKEFALKDLGDLAYFLGIEA--------CPTSAVTGKHIT----------------------ANDGQVMNDPTLYRSAIGALQYV-THTRPE

Query:  ISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPK--SDDSQIVAYSDADWGCSLDDKKSIAGHCV-FIGASLISWSSKKQRVVARSSTESEY
        ++  +N LS+Y     +  WQ +KRVLRYLKGT++  L   K  + +++I+ Y D+DW  S  D+KS  G+       +LI W++K+Q  VA SSTE+EY
Subjt:  ISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPK--SDDSQIVAYSDADWGCSLDDKKSIAGHCV-FIGASLISWSSKKQRVVARSSTESEY

Query:  RSLAHTAAEITWIQALLNEIDCKDIQTPI-IWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNK
         +L     E  W++ LL  I+ K ++ PI I+ DN   +S+A NP  H ++KH++I  HF R++V    I + Y+P+E Q+AD  TKPL   +F EL++K
Subjt:  RSLAHTAAEITWIQALLNEIDCKDIQTPI-IWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNK

Query:  LNVV
        L ++
Subjt:  LNVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-5020.5Show/hide
Query:  EATTNYAYNNKMQPAGGNQSNT----GQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYH-RFDKEYTNVQTNSGNQGGSNRNS--
        + T+    N KM+    NQ       G+ ++Y +++ N    G    S  R       C  C + GH    C + R  K  T+ Q N  N     +N+  
Subjt:  EATTNYAYNNKMQPAGGNQSNT----GQNKNYGKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYH-RFDKEYTNVQTNSGNQGGSNRNS--

Query:  --AYVTTPEVV-----TDPSWLVDSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQD
           ++   E        +  W+VD+ A+ H T  +++L  +       ++ +GN    K++ IG   I       + L  +  VP +  NL+S   L +D
Subjt:  --AYVTTPEVV-----TDPSWLVDSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQD

Query:  NNCYVEFHSSHCFVKDKATRKVILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQ
           Y  + ++  +   K +  VI +G+ +  LY+    +   +L                                            +  ++S D+WH+
Subjt:  NNCYVEFHSSHCFVKDKATRKVILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQ

Query:  RLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEA
        R+GH SE+ + +++K S +S+   T  K   CD C FGK H + F+ SS     IL++VY D+ GP  + S  G K++++F+DD +R   ++ LK+K + 
Subjt:  RLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEA

Query:  VSKFVEYKKLVENKFEKKIKVFQSDWGGEF--RPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLP
           F ++  LVE +  +K+K  +SD GGE+  R F  +    GI      P T + NG+ E  +R IVE   ++L  A +P  +W +A  ++ YLIN  P
Subjt:  VSKFVEYKKLVENKFEKKIKVFQSDWGGEF--RPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLP

Query:  TPILNNISPLEKAYKKTPDYSLLKVFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYK--------------------------------------
        +  L    P      K   YS LKVFG   F  +   Q  K    S  C+F+GY     GY+                                      
Subjt:  TPILNNISPLEKAYKKTPDYSLLKVFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYK--------------------------------------

Query:  ---CLSSSGNN-----------------------------------------------------------------------------------------
            + S+ NN                                                                                         
Subjt:  ---CLSSSGNN-----------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------IDV--
                                                                                                       +DV  
Subjt:  -----------------------------------------------------------------------------------------------IDV--

Query:  ------------------------------------------------INQFVK----------------------------------------------
                                                         + F+K                                              
Subjt:  ------------------------------------------------INQFVK----------------------------------------------

Query:  --ELNKEFALKDLGDLAYFLGIE-----------------------------ACPTSAVTGKHI----------TANDGQVMNDPTLYRSAIGALQY-VT
          +L+K F +KDLG     LG++                             A P S     H+              G +   P  Y SA+G+L Y + 
Subjt:  --ELNKEFALKDLGDLAYFLGIE-----------------------------ACPTSAVTGKHI----------TANDGQVMNDPTLYRSAIGALQY-VT

Query:  HTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTES
         TRP+I++ +  +S++L+NP   HW+AVK +LRYL+GT    L    SD   +  Y+DAD    +D++KS  G+        ISW SK Q+ VA S+TE+
Subjt:  HTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTES

Query:  EYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKF
        EY +   T  E+ W++  L E+     +  +++CD+ SA+ L+ N ++H+++KH+++  H++R+ V  + ++V  + + +  AD LTK +   KF
Subjt:  EYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKF

P92519 Uncharacterized mitochondrial protein AtMg008109.8e-3838.89Show/hide
Query:  SGNNIDVINQFVKELNKEFALKDLGDLAYFLGIEACPTSAVTGKHITANDGQVMN----------------------------DPTLYRSAIGALQYVTH
        +G++  ++N  + +L+  F++KDLG + YFLGI+     +      T    Q++N                            DP+ +RS +GALQY+T 
Subjt:  SGNNIDVINQFVKELNKEFALKDLGDLAYFLGIEACPTSAVTGKHITANDGQVMN----------------------------DPTLYRSAIGALQYVTH

Query:  TRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESE
        TRP+ISY +N + Q +  PT   +  +KRVLRY+KGT+ HGLYI K+    + A+ D+DW      ++S  G C F+G ++ISWS+K+Q  V+RSSTE+E
Subjt:  TRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESE

Query:  YRSLAHTAAEITWIQA
        YR+LA TAAE+TW  A
Subjt:  YRSLAHTAAEITWIQA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-11826.22Show/hide
Query:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKD-GVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGG
        + +Y+  +      L L G+P+     +  VL  L  EY P++  I  KD   T  +++  LL  E+ ++ ++     S  V   T N A +++      
Subjt:  MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKD-GVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGG

Query:  NQSNTGQNKNYGKTNRNERGR----GRGRFSPYRGNVSKP---TCQVCGKFGHSAAIC--YHRFDKEYTNVQTNSGNQGGSNRNSAYVTTPEVVTDPSWL
        N +N  +N  Y   N N   +        F P   N SKP    CQ+CG  GHSA  C     F     + Q  S       R +  + +P   +  +WL
Subjt:  NQSNTGQNKNYGKTNRNERGR----GRGRFSPYRGNVSKP---TCQVCGKFGHSAAIC--YHRFDKEYTNVQTNSGNQGGSNRNSAYVTTPEVVTDPSWL

Query:  VDSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKV
        +DSGAT HIT+   NL++   Y G D + V +G  + +SH GS+ ++  + +P++L++IL VP I +NL+SV RL   N   VEF  +   VKD  T   
Subjt:  VDSGATSHITAGLQNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKV

Query:  ILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFP
        +LQG  KD LY+  +    P  +  S    PS++   S+                                    WH RLGHP+  +++ V  N ++S  
Subjt:  ILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFP

Query:  SSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVF
        + +++ +S C  C   KS+ +PF +S+  +++ LE +Y D+W  SP+LS   Y++Y+ FVD +TR T ++PLK K++    F+ +K L+EN+F+ +I  F
Subjt:  SSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVF

Query:  QSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLK
         SD GGEF     +  + GI      PHT E NG+ E KHRHIVE GLTLL+ AS+P  YW  AF+ +VYLIN LPTP+L   SP +K +  +P+Y  L+
Subjt:  QSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLK

Query:  VFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCL------------------------------------------------------------
        VFG AC+P LRPY  HK    S +CVFLGYS +   Y CL                                                            
Subjt:  VFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCL------------------------------------------------------------

Query:  -------------------------------------------------------------------SSSGNN---------------------------
                                                                           ++S NN                           
Subjt:  -------------------------------------------------------------------SSSGNN---------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------IDVINQFVK-----------------
                                                                                  +DV N F++                 
Subjt:  --------------------------------------------------------------------------IDVINQFVK-----------------

Query:  --------------------------------------------------------------------------------ELNKEFALKDLGDLAYFLGI
                                                                                         L++ F++KD  +L YFLGI
Subjt:  --------------------------------------------------------------------------------ELNKEFALKDLGDLAYFLGI

Query:  EA-------------------CPTSAVTGKHITA----------NDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLR
        EA                     T+ +T K +T             G  + DPT YR  +G+LQY+  TRP+ISY +N+LSQ++  PT  H QA+KR+LR
Subjt:  EA-------------------CPTSAVTGKHITA----------NDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLR

Query:  YLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIW
        YL GT NHG+++ K +   + AYSDADW    DD  S  G+ V++G   ISWSSKKQ+ V RSSTE+EYRS+A+T++E+ WI +LL E+  +  + P+I+
Subjt:  YLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIW

Query:  CDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLS
        CDNV A  L ANPVFHS+ KH+ ID HF+R++V +  + V +V +  Q+AD LTKPLS   F    +K+ V   P S
Subjt:  CDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-12126.82Show/hide
Query:  LKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKD-GVTWHDLYSTLLTFENNLVRLN---VVNPPSGIVGEATTNYAYNNKMQPAGGNQSNTGQNKNY
        L L G+P+     +  VL  L  +Y P++  I  KD   +  +++  L+  E+ L+ LN   VV   + +V    TN            NQ+N G N+NY
Subjt:  LKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKD-GVTWHDLYSTLLTFENNLVRLN---VVNPPSGIVGEATTNYAYNNKMQPAGGNQSNTGQNKNY

Query:  GKTNRNERGRGRGRFSPYRGNVSKP-----TCQVCGKFGHSAAIC--YHRFDKEYTNVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSHITAGL
           N N     +   S  R +  +P      CQ+C   GHSA  C   H+F       Q+ S       R +  V +P      +WL+DSGAT HIT+  
Subjt:  GKTNRNERGRGRGRFSPYRGNVSKP-----TCQVCGKFGHSAAIC--YHRFDKEYTNVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSHITAGL

Query:  QNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPT-QKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGLYQ
         NL+    Y G D + + +G  + ++H GS+  + PT  + + LN +L VP I +NL+SV RL   N   VEF  +   VKD  T   +LQG  KD LY+
Subjt:  QNLNMKTEYHGKDSLTVGNGDRLKVSHIGSSLINCPT-QKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGLYQ

Query:  LKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDS
           P+   Q      VS+ ++ C K+T                                    WH RLGHPS  +++ V  N ++   + +++ +S C  
Subjt:  LKLPLLGPQLIHRSNVSLPSTRCLKSTSCLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDS

Query:  CQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRPFA
        C   KSH +PF  S+ T+S+ LE +Y D+W  SP+LS   Y++Y+ FVD +TR T ++PLK K++    F+ +K LVEN+F+ +I    SD GGEF    
Subjt:  CQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLSTAGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRPFA

Query:  NFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCLRP
        ++L + GI      PHT E NG+ E KHRHIVEMGLTLL+ AS+P  YW  AFS +VYLIN LPTP+L   SP +K + + P+Y  LKVFG AC+P LRP
Subjt:  NFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRYWWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCLRP

Query:  YQSHKFQFHSTKCVFLGYSGSHKGYKCL------------------------------------------------------------------------
        Y  HK +  S +C F+GYS +   Y CL                                                                        
Subjt:  YQSHKFQFHSTKCVFLGYSGSHKGYKCL------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  SSS--------------------------------------------------------------------------------GNN--------------
        SSS                                                                                GN+              
Subjt:  SSS--------------------------------------------------------------------------------GNN--------------

Query:  ------------------------------------------------------------------IDVINQFVK-------------------------
                                                                          +DV N F++                         
Subjt:  ------------------------------------------------------------------IDVINQFVK-------------------------

Query:  ------------------------------------------------------------------------ELNKEFALKDLGDLAYFLGIEA------
                                                                                 L++ F++K+  DL YFLGIEA      
Subjt:  ------------------------------------------------------------------------ELNKEFALKDLGDLAYFLGIEA------

Query:  -----------------------CPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNH
                                 T   T   +T + G  + DPT YR  +G+LQY+  TRP++SY +N+LSQY+  PT  HW A+KRVLRYL GT +H
Subjt:  -----------------------CPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNH

Query:  GLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALS
        G+++ K +   + AYSDADW    DD  S  G+ V++G   ISWSSKKQ+ V RSSTE+EYRS+A+T++E+ WI +LL E+  +    P+I+CDNV A  
Subjt:  GLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALS

Query:  LAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSG
        L ANPVFHS+ KH+ +D HF+R++V +  + V +V +  Q+AD LTKPLS   F     K+ V+  P S  G
Subjt:  LAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.8e-4534.6Show/hide
Query:  NNIDVINQFVKELNKEFALKDLGDLAYFLGIEAC---------------------------PTSAVTGKHIT--ANDGQVMNDPTLYRSAIGALQYVTHT
        NN   +++   +L   F L+DLG L YFLG+E                             P+S      +T  A+ G    D   YR  IG L Y+  T
Subjt:  NNIDVINQFVKELNKEFALKDLGDLAYFLGIEAC---------------------------PTSAVTGKHIT--ANDGQVMNDPTLYRSAIGALQYVTHT

Query:  RPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEY
        R +IS+ +NKLSQ+ + P   H QAV ++L Y+KGT+  GL+     + Q+  +SDA +    D ++S  G+C+F+G SLISW SKKQ+VV++SS E+EY
Subjt:  RPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESEY

Query:  RSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLS
        R+L+    E+ W+     E+     +  +++CDN +A+ +A N VFH ++KH+E D H VR++ +  +  + Y        DG T+ LS
Subjt:  RSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVPSEKQIADGLTKPLS

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.9e-1338.27Show/hide
Query:  YVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFI
        Y+T TRP++++ +N+LSQ+     T   QAV +VL Y+KGT+  GL+   + D Q+ A++D+DW    D ++S+ G C  +
Subjt:  YVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFI

ATMG00300.1 Gag-Pol-related retrotransposon family protein4.9e-0840.79Show/hide
Query:  VWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWG-PSPVLS
        +WH RL H S+R M L+ K   +   SS    + FC+ C +GK+H + F     TT   L+ V+ DLWG PS  LS
Subjt:  VWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWG-PSPVLS

ATMG00810.1 DNA/RNA polymerases superfamily protein7.0e-3938.89Show/hide
Query:  SGNNIDVINQFVKELNKEFALKDLGDLAYFLGIEACPTSAVTGKHITANDGQVMN----------------------------DPTLYRSAIGALQYVTH
        +G++  ++N  + +L+  F++KDLG + YFLGI+     +      T    Q++N                            DP+ +RS +GALQY+T 
Subjt:  SGNNIDVINQFVKELNKEFALKDLGDLAYFLGIEACPTSAVTGKHITANDGQVMN----------------------------DPTLYRSAIGALQYVTH

Query:  TRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESE
        TRP+ISY +N + Q +  PT   +  +KRVLRY+KGT+ HGLYI K+    + A+ D+DW      ++S  G C F+G ++ISWS+K+Q  V+RSSTE+E
Subjt:  TRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDDKKSIAGHCVFIGASLISWSSKKQRVVARSSTESE

Query:  YRSLAHTAAEITWIQA
        YR+LA TAAE+TW  A
Subjt:  YRSLAHTAAEITWIQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAATATCTAGGAATCATGAAACAAACCTCCAAAAGTCTAAAACTTGCTGGTGAACCAGTTACTTTCAGTTATCTAATGTCTTGTGTGCTTTCAGGATTGGAAGC
AGAGTATTTACCGATTGTCAGCATTATTGAAGGCAAGGATGGCGTCACTTGGCATGATCTTTATTCTACACTTCTTACTTTTGAGAATAACTTGGTGCGGTTGAATGTTG
TTAATCCTCCAAGTGGAATTGTTGGAGAAGCCACAACCAATTATGCCTACAACAACAAAATGCAACCTGCTGGTGGTAACCAAAGCAATACTGGACAGAACAAAAATTAT
GGGAAAACTAATCGCAATGAGAGAGGAAGAGGCCGTGGAAGATTCTCTCCATATCGAGGTAATGTGTCCAAACCAACATGTCAAGTTTGTGGAAAGTTTGGACACTCTGC
TGCTATTTGTTACCATAGGTTTGACAAAGAATATACCAATGTGCAAACTAATTCTGGTAACCAAGGAGGAAGCAACAGAAATTCAGCCTATGTAACAACTCCGGAGGTGG
TTACAGACCCTAGTTGGTTGGTTGACAGTGGGGCTACCAGTCACATCACTGCTGGTCTTCAGAATCTGAACATGAAAACTGAATATCATGGTAAGGATTCTCTAACAGTA
GGTAATGGAGATAGACTTAAAGTTTCTCATATAGGCTCTAGTTTGATCAATTGTCCTACCCAAAAACCTATGTCTTTAAATCACATACTTCTTGTCCCTCAAATTACTCA
AAACCTTCTTAGTGTTGCTCGTTTAACTCAAGATAATAATTGTTATGTTGAATTTCATTCATCTCATTGTTTTGTCAAGGACAAGGCAACAAGGAAAGTAATATTACAGG
GGATACTTAAGGATGGACTCTACCAGTTGAAGCTTCCTCTCCTAGGGCCTCAACTCATTCACAGAAGCAATGTTTCTTTGCCCAGCACAAGATGTTTGAAGTCTACTAGT
TGTCTATCTTCCTATGTTAGTAGTTTCAAGTCAAATGTTGTTTTTCCTTTCAAGTCCTTTGTTTCTTCTACTTATGTAAGTCATAGACAAGTCTCAAAAGATGTGTGGCA
CCAAAGATTAGGTCATCCATCTGAGAGAGTTATGTCTTTAGTTTCAAAAAACAGCAATGTATCTTTTCCTTCTAGTACTAATGAAAAAATAAGCTTTTGTGACTCATGTC
AATTTGGGAAATCTCATCTCCTGCCCTTTCGAAAGTCTAGTTCAACAACAAGTCAAATTCTCGAAATTGTTTATTGTGACTTATGGGGTCCTTCACCAGTCTTGTCAACT
GCTGGATATAAATTCTATATAAGTTTTGTTGATGATTATACTCGATTAACCCACATTTTTCCTCTCAAAAGCAAAGCAGAAGCCGTATCCAAATTTGTTGAGTATAAGAA
ATTAGTAGAAAACAAGTTTGAGAAGAAGATTAAAGTATTTCAAAGTGATTGGGGAGGTGAGTTTAGACCATTTGCCAATTTTCTAAGAGAGTGTGGGATTGAGTTTAGAC
ATCCTTGTCCTCATACAAGTGAGCAAAACGGAATTGTTGAAAATAAGCATAGACACATTGTTGAAATGGGTCTAACTTTATTAGCTCAAGCTTCTATGCCCTTGAGATAT
TGGTGGGATGCTTTTAGTTCATCAGTTTATCTTATAAATTGTTTACCAACACCTATTCTTAACAATATTTCTCCTCTAGAAAAGGCATATAAGAAGACACCTGATTACTC
TCTACTTAAAGTCTTTGGAAGTGCATGTTTTCCTTGTCTAAGGCCATATCAATCTCACAAATTCCAGTTTCATAGCACTAAATGTGTATTCTTAGGCTACAGTGGTTCAC
ACAAAGGCTATAAATGTCTTAGCTCAAGTGGAAACAACATCGATGTTATCAACCAATTTGTGAAGGAGCTAAATAAAGAGTTTGCTTTGAAAGACCTAGGAGATCTTGCT
TATTTTCTAGGAATTGAAGCTTGTCCTACTTCAGCAGTAACAGGTAAACATATTACAGCCAATGATGGACAGGTAATGAATGATCCTACCTTGTATAGAAGTGCTATTGG
AGCTCTACAATATGTAACACACACTCGACCAGAAATCTCATATATTATCAACAAGTTAAGTCAATATTTGCAAAATCCCACTACATTACATTGGCAAGCTGTGAAAAGAG
TATTGAGATATTTAAAGGGAACATTAAATCATGGTTTGTACATACCCAAATCAGATGACTCTCAAATTGTTGCCTATTCTGATGCAGATTGGGGTTGTAGCCTTGATGAC
AAAAAATCTATTGCTGGTCACTGTGTATTCATTGGAGCATCACTAATTTCTTGGTCCTCGAAGAAGCAACGTGTAGTAGCGAGATCAAGTACAGAATCTGAATATCGTTC
CTTAGCACATACAGCAGCTGAAATTACTTGGATTCAAGCCCTTCTAAATGAAATTGATTGCAAAGATATTCAAACACCAATAATTTGGTGTGATAATGTAAGTGCTTTAT
CACTAGCAGCAAATCCAGTTTTTCATTCCAAAAGCAAGCATGTTGAAATTGATATACACTTTGTTAGAGACAAAGTGATTGCTAAAGAAATAGAAGTGAGATATGTACCT
TCAGAGAAACAAATTGCAGATGGGTTGACAAAACCCTTGTCAGAATTCAAGTTTACTGAGCTGAAGAACAAACTCAACGTTGTTGCACCCCCGTTGAGTTTGAGTGGGGC
TCCGCCCCCTCCCCTTCGCGTGTGTTTAACCGCTGCACGACCCAGCCGCCGCCGTCGCTTCAGTCGCTGTCGCCACGCCCTGCTGTTGCACCGCCCCATTGCCCTTTCGC
CCGAATCCCCCTCTCTCTCGCTGCGATCTCCCTCTCTCCCGGGTCTTCGTGTGAAGGTCGCGAAGCCAGCCAATCGCCGTCGTGCAGCCGCTCCCGTTTCGTTGTCGCTT
TCCGTCGCGCAGCCTAAGCCCCGCCGCCCTCTCTCTCCCTCGTCCCCCTCTCGATTCAGACAGCAGCTCGGAGCCTCATTTCCTCGGGTTTTCGCCTCTGTCCAGAAGCG
TCATTGGGCGTTTTCAGCGTCATTTAGTGATTCTGGTGAGTTTAAGCGACTTTCTAAGATGATAATGCTACAGCGTGAGCTGGCAGAGGCAGAGTGGAAGCTGGAATTTG
TTGATTGCTGGAGCTGTTTTCCACGTGGAGGGCTTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGAATATCTAGGAATCATGAAACAAACCTCCAAAAGTCTAAAACTTGCTGGTGAACCAGTTACTTTCAGTTATCTAATGTCTTGTGTGCTTTCAGGATTGGAAGC
AGAGTATTTACCGATTGTCAGCATTATTGAAGGCAAGGATGGCGTCACTTGGCATGATCTTTATTCTACACTTCTTACTTTTGAGAATAACTTGGTGCGGTTGAATGTTG
TTAATCCTCCAAGTGGAATTGTTGGAGAAGCCACAACCAATTATGCCTACAACAACAAAATGCAACCTGCTGGTGGTAACCAAAGCAATACTGGACAGAACAAAAATTAT
GGGAAAACTAATCGCAATGAGAGAGGAAGAGGCCGTGGAAGATTCTCTCCATATCGAGGTAATGTGTCCAAACCAACATGTCAAGTTTGTGGAAAGTTTGGACACTCTGC
TGCTATTTGTTACCATAGGTTTGACAAAGAATATACCAATGTGCAAACTAATTCTGGTAACCAAGGAGGAAGCAACAGAAATTCAGCCTATGTAACAACTCCGGAGGTGG
TTACAGACCCTAGTTGGTTGGTTGACAGTGGGGCTACCAGTCACATCACTGCTGGTCTTCAGAATCTGAACATGAAAACTGAATATCATGGTAAGGATTCTCTAACAGTA
GGTAATGGAGATAGACTTAAAGTTTCTCATATAGGCTCTAGTTTGATCAATTGTCCTACCCAAAAACCTATGTCTTTAAATCACATACTTCTTGTCCCTCAAATTACTCA
AAACCTTCTTAGTGTTGCTCGTTTAACTCAAGATAATAATTGTTATGTTGAATTTCATTCATCTCATTGTTTTGTCAAGGACAAGGCAACAAGGAAAGTAATATTACAGG
GGATACTTAAGGATGGACTCTACCAGTTGAAGCTTCCTCTCCTAGGGCCTCAACTCATTCACAGAAGCAATGTTTCTTTGCCCAGCACAAGATGTTTGAAGTCTACTAGT
TGTCTATCTTCCTATGTTAGTAGTTTCAAGTCAAATGTTGTTTTTCCTTTCAAGTCCTTTGTTTCTTCTACTTATGTAAGTCATAGACAAGTCTCAAAAGATGTGTGGCA
CCAAAGATTAGGTCATCCATCTGAGAGAGTTATGTCTTTAGTTTCAAAAAACAGCAATGTATCTTTTCCTTCTAGTACTAATGAAAAAATAAGCTTTTGTGACTCATGTC
AATTTGGGAAATCTCATCTCCTGCCCTTTCGAAAGTCTAGTTCAACAACAAGTCAAATTCTCGAAATTGTTTATTGTGACTTATGGGGTCCTTCACCAGTCTTGTCAACT
GCTGGATATAAATTCTATATAAGTTTTGTTGATGATTATACTCGATTAACCCACATTTTTCCTCTCAAAAGCAAAGCAGAAGCCGTATCCAAATTTGTTGAGTATAAGAA
ATTAGTAGAAAACAAGTTTGAGAAGAAGATTAAAGTATTTCAAAGTGATTGGGGAGGTGAGTTTAGACCATTTGCCAATTTTCTAAGAGAGTGTGGGATTGAGTTTAGAC
ATCCTTGTCCTCATACAAGTGAGCAAAACGGAATTGTTGAAAATAAGCATAGACACATTGTTGAAATGGGTCTAACTTTATTAGCTCAAGCTTCTATGCCCTTGAGATAT
TGGTGGGATGCTTTTAGTTCATCAGTTTATCTTATAAATTGTTTACCAACACCTATTCTTAACAATATTTCTCCTCTAGAAAAGGCATATAAGAAGACACCTGATTACTC
TCTACTTAAAGTCTTTGGAAGTGCATGTTTTCCTTGTCTAAGGCCATATCAATCTCACAAATTCCAGTTTCATAGCACTAAATGTGTATTCTTAGGCTACAGTGGTTCAC
ACAAAGGCTATAAATGTCTTAGCTCAAGTGGAAACAACATCGATGTTATCAACCAATTTGTGAAGGAGCTAAATAAAGAGTTTGCTTTGAAAGACCTAGGAGATCTTGCT
TATTTTCTAGGAATTGAAGCTTGTCCTACTTCAGCAGTAACAGGTAAACATATTACAGCCAATGATGGACAGGTAATGAATGATCCTACCTTGTATAGAAGTGCTATTGG
AGCTCTACAATATGTAACACACACTCGACCAGAAATCTCATATATTATCAACAAGTTAAGTCAATATTTGCAAAATCCCACTACATTACATTGGCAAGCTGTGAAAAGAG
TATTGAGATATTTAAAGGGAACATTAAATCATGGTTTGTACATACCCAAATCAGATGACTCTCAAATTGTTGCCTATTCTGATGCAGATTGGGGTTGTAGCCTTGATGAC
AAAAAATCTATTGCTGGTCACTGTGTATTCATTGGAGCATCACTAATTTCTTGGTCCTCGAAGAAGCAACGTGTAGTAGCGAGATCAAGTACAGAATCTGAATATCGTTC
CTTAGCACATACAGCAGCTGAAATTACTTGGATTCAAGCCCTTCTAAATGAAATTGATTGCAAAGATATTCAAACACCAATAATTTGGTGTGATAATGTAAGTGCTTTAT
CACTAGCAGCAAATCCAGTTTTTCATTCCAAAAGCAAGCATGTTGAAATTGATATACACTTTGTTAGAGACAAAGTGATTGCTAAAGAAATAGAAGTGAGATATGTACCT
TCAGAGAAACAAATTGCAGATGGGTTGACAAAACCCTTGTCAGAATTCAAGTTTACTGAGCTGAAGAACAAACTCAACGTTGTTGCACCCCCGTTGAGTTTGAGTGGGGC
TCCGCCCCCTCCCCTTCGCGTGTGTTTAACCGCTGCACGACCCAGCCGCCGCCGTCGCTTCAGTCGCTGTCGCCACGCCCTGCTGTTGCACCGCCCCATTGCCCTTTCGC
CCGAATCCCCCTCTCTCTCGCTGCGATCTCCCTCTCTCCCGGGTCTTCGTGTGAAGGTCGCGAAGCCAGCCAATCGCCGTCGTGCAGCCGCTCCCGTTTCGTTGTCGCTT
TCCGTCGCGCAGCCTAAGCCCCGCCGCCCTCTCTCTCCCTCGTCCCCCTCTCGATTCAGACAGCAGCTCGGAGCCTCATTTCCTCGGGTTTTCGCCTCTGTCCAGAAGCG
TCATTGGGCGTTTTCAGCGTCATTTAGTGATTCTGGTGAGTTTAAGCGACTTTCTAAGATGATAATGCTACAGCGTGAGCTGGCAGAGGCAGAGTGGAAGCTGGAATTTG
TTGATTGCTGGAGCTGTTTTCCACGTGGAGGGCTTCATTAA
Protein sequenceShow/hide protein sequence
MSEYLGIMKQTSKSLKLAGEPVTFSYLMSCVLSGLEAEYLPIVSIIEGKDGVTWHDLYSTLLTFENNLVRLNVVNPPSGIVGEATTNYAYNNKMQPAGGNQSNTGQNKNY
GKTNRNERGRGRGRFSPYRGNVSKPTCQVCGKFGHSAAICYHRFDKEYTNVQTNSGNQGGSNRNSAYVTTPEVVTDPSWLVDSGATSHITAGLQNLNMKTEYHGKDSLTV
GNGDRLKVSHIGSSLINCPTQKPMSLNHILLVPQITQNLLSVARLTQDNNCYVEFHSSHCFVKDKATRKVILQGILKDGLYQLKLPLLGPQLIHRSNVSLPSTRCLKSTS
CLSSYVSSFKSNVVFPFKSFVSSTYVSHRQVSKDVWHQRLGHPSERVMSLVSKNSNVSFPSSTNEKISFCDSCQFGKSHLLPFRKSSSTTSQILEIVYCDLWGPSPVLST
AGYKFYISFVDDYTRLTHIFPLKSKAEAVSKFVEYKKLVENKFEKKIKVFQSDWGGEFRPFANFLRECGIEFRHPCPHTSEQNGIVENKHRHIVEMGLTLLAQASMPLRY
WWDAFSSSVYLINCLPTPILNNISPLEKAYKKTPDYSLLKVFGSACFPCLRPYQSHKFQFHSTKCVFLGYSGSHKGYKCLSSSGNNIDVINQFVKELNKEFALKDLGDLA
YFLGIEACPTSAVTGKHITANDGQVMNDPTLYRSAIGALQYVTHTRPEISYIINKLSQYLQNPTTLHWQAVKRVLRYLKGTLNHGLYIPKSDDSQIVAYSDADWGCSLDD
KKSIAGHCVFIGASLISWSSKKQRVVARSSTESEYRSLAHTAAEITWIQALLNEIDCKDIQTPIIWCDNVSALSLAANPVFHSKSKHVEIDIHFVRDKVIAKEIEVRYVP
SEKQIADGLTKPLSEFKFTELKNKLNVVAPPLSLSGAPPPPLRVCLTAARPSRRRRFSRCRHALLLHRPIALSPESPSLSLRSPSLPGLRVKVAKPANRRRAAAPVSLSL
SVAQPKPRRPLSPSSPSRFRQQLGASFPRVFASVQKRHWAFSASFSDSGEFKRLSKMIMLQRELAEAEWKLEFVDCWSCFPRGGLH