; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020526 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020526
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr06:15921593..15923845
RNA-Seq ExpressionPay0020526
SyntenyPay0020526
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN01369.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.3e-20251.41Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYG----------------------------DSFDDVRSFLGSASFYRRFIKDFSKIALPLTNL
        MPFGLCNA  TFQRCM+SIF D++E  IEVFMDDFTVYG                             S  +VRSFLG A FYRRFIKDFSKI  PL  L
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYG----------------------------DSFDDVRSFLGSASFYRRFIKDFSKIALPLTNL

Query:  WKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHAIYYASRTLN---------------------QFRSYI
         +KDV F  D  CK AFD+LK+ L S PI+Q PNWNL FEIMCDASN A+GAVL Q I K  H IYYASRTL+                     +FRSY+
Subjt:  WKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHAIYYASRTLN---------------------QFRSYI

Query:  IGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK-------------DFPDEHLFQTNLQAPWYADIVNYLVT-----
        +G+ VI+Y+DHA +KY +SKKE+KPRL+RW+LLLQEFNL I+D++G+ N VADHL               FPDE+LF     +PWYADIVNYLVT     
Subjt:  IGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK-------------DFPDEHLFQTNLQAPWYADIVNYLVT-----

Query:  -------------------------------------------------------GHFSPKRTTRKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRR
                                                               GHF PKRT RK+L+ G FW  LF +++ FCK+C +CQ+TG+L  R
Subjt:  -------------------------------------------------------GHFSPKRTTRKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRR

Query:  NEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVSNIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQH
        ++M L  ++ C++F + G+DFMGP P S+   YILL VDYVSKWVEA  TRTND+ +V  F+ S+IF+RFG+PRAIISD+GTHFCNR +E L +KY V H
Subjt:  NEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVSNIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQH

Query:  RISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVYGKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDL
        R+S+ YHPQTN QA+  NRE+K+ILEKTV+   K+WS  L+DALWAYRTAYKT I  SP++LV+GK CH+P+E+EH+AYWAI++CN+++ +  E + L L
Subjt:  RISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVYGKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDL

Query:  LELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVVIDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSV
         EL+E+R +AYENS+I+K+KTKL HD+ + RK F +GQKV LY+  +KL P KLRS+W+GPFV+ +  T G V I + +TGKIFKVNGHRLK F E   V
Subjt:  LELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVVIDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSV

Query:  QQCSIETL
         Q  I  L
Subjt:  QQCSIETL

XP_031392147.1 uncharacterized protein LOC116204210 [Punica granatum]2.6e-20750.47Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------
        MPFGLC+A GTFQRCMMSIF D IE CIEVFMDDFTV+GDSFD                                                         
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------

Query:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA
                +RSFLG A FYRRFIKDFSKIA PL +L +KD  F+   NC++AFD LK+ L S PI+Q P+W L FEIM DAS+ A+GAVL Q +DK+ H 
Subjt:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA

Query:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------
        IYYAS+TL+                     +F+SY++GS ++++TDHA +K+ ++KKESKPRL+RW+LLLQEF+L IKDRKGS NSVADHL         
Subjt:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------

Query:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT
              FPDEHLF    + PWYAD+VN++VT                                                            GHF PKRT 
Subjt:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT

Query:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS
        RKI+DSGF+W+TLF ++  FCK C  CQR G++SRRNEM    ++ C+VF + GMDFMGP PSSF + YILL VDYVSKWVEA  TRTND+ +V  FL S
Subjt:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS

Query:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY
        NIFSRFGIPRAIISDQGTHFCNR++EAL +KYGV HR+++ YHPQ+N QA+  NRE+K+ILEKTVN   K+WSL L+DALWAYRTAYKT I  SP++L++
Subjt:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY

Query:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV
        GK CH+P+EIEH+A+WA++QCN+ L  + E++   L ELEE+ L+AY+N+ ++KE+ KLLHDK +LRK+F IGQKV L++  +KLMP KLRS+W GPFVV
Subjt:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV

Query:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY
         +  + GV+ I+NL+T +IFKVNGHRLK F E  +V       LS P+Y
Subjt:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY

XP_031393661.1 uncharacterized protein LOC116205261 [Punica granatum]1.1e-20850.73Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------
        MPFGLC+A GTFQRCMMSIF D IE CIEVFMDDFTV+GDSFD                                                         
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------

Query:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA
                +RSFLG A FYRRFIKDFSKIA PL +L +KD  F+   NC++AFD LK+ L S PI+Q P+W L FEIM DAS+ A+GAVL Q +DK+ H 
Subjt:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA

Query:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------
        IYYAS+TL+                     +FRSY++GS ++++TDHA +K+ ++KKESKPRL+RW+LLLQEF+L IKDRKGS NSVADHL         
Subjt:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------

Query:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT
              FPDEHLF    + PWYAD+VN++VT                                                            GHF PKRT 
Subjt:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT

Query:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS
        RKI+DSGF+W+TLF ++  FCK C  CQR G++SRRNEM    ++ C+VF + GMDFMGP PSSF + YILL VDYVSKWVEA  TRTND+ +V  FL S
Subjt:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS

Query:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY
        NIFSRFGIPRAIISDQGTHFCNR++EAL +KYGV HR+++ YHPQ+N QA+  NRE+K+ILEKTVN   K+WSL L+DALWAYRTAYKT I  SP++L++
Subjt:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY

Query:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV
        GK CH+P+EIEH+A+WA++QCN+ L  + E++   L ELEE+RL+AY+N+ ++KE+ KLLHDK +LRK+F IGQKV L++  +KLMP KLRS+W GPFVV
Subjt:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV

Query:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY
         +  + GV+ I+NL+T +IFKVNGHRLK F E  +V       LS P+Y
Subjt:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY

XP_031402684.1 uncharacterized protein LOC116212259 [Punica granatum]1.1e-20850.73Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------
        MPFGLC+A GTFQRCMMSIF D IE CIEVFMDDFTV+GDSFD                                                         
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------

Query:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA
                +RSFLG A FYRRFIKDFSKIA PL +L +KD  F+   NC++AFD LK+ L S PI+Q P+W L FEIM DAS+ A+GAVL Q +DK+ H 
Subjt:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA

Query:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------
        IYYAS+TL+                     +FRSY++GS ++++TDHA +K+ ++KKESKPRL+RW+LLLQEF+L IKDRKGS NSVADHL         
Subjt:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------

Query:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT
              FPDEHLF    + PWYAD+VN++VT                                                            GHF PKRT 
Subjt:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT

Query:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS
        RKI+DSGF+W+TLF ++  FCK C  CQR G++SRRNEM    ++ C+VF + GMDFMGP PSSF + YILL VDYVSKWVEA  TRTND+ +V  FL S
Subjt:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS

Query:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY
        NIFSRFGIPRAIISDQGTHFCNR++EAL +KYGV HR+++ YHPQ+N QA+  NRE+K+ILEKTVN   K+WSL L+DALWAYRTAYKT I  SP++L++
Subjt:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY

Query:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV
        GK CH+P+EIEH+A+WA++QCN+ L  + E++   L ELEE+RL+AY+N+ ++KE+ KLLHDK +LRK+F IGQKV L++  +KLMP KLRS+W GPFVV
Subjt:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV

Query:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY
         +  + GV+ I+NL+T +IFKVNGHRLK F E  +V       LS P+Y
Subjt:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY

XP_038978516.1 uncharacterized protein LOC120108856 [Phoenix dactylifera]1.6e-20149.6Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------
        MPFGLCNA  TFQRCM+SIF D+IE  IEVFMDDFTVYGDSFD                                                         
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------

Query:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA
               +VRSFLG A FYRRFIKDFSK+ALPL  L +K++ F  D+ CK AFD LK+ L S P++Q PNWN+ FEIMCDAS  A+GA L Q I K  HA
Subjt:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA

Query:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHL---------
        IYY S TLN                     +FRSY++G+ VI+Y+DHA ++Y + KKE+KPRL+RW+LLL EF+L IKD++G+ N VADHL         
Subjt:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHL---------

Query:  ----KDFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT
            + FPDE LF T++  PWYA++VNYLVT                                                            GHF  KRT 
Subjt:  ----KDFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT

Query:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS
        RK+L+ GF+W +LF +S+SFCKSC +CQ+TG++S+RNEM    ++ C++F + G+DFMGP P SF Y+YILL VDYVSKWVEA  TRT+DS +V+ F+ S
Subjt:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS

Query:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY
        NIFSRFGIP A+ISD+GTHFCNRT+EAL RKY V H++S+ YHPQT+ QA+  NREIK+ILEKTVN   K+WSL L+DALWAYRTAYKT I  SP++LV+
Subjt:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY

Query:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV
        GK CH+P+E+EHKAYWAI+  N+ + E+ E + L L ELEE+R  AYE++RI+KEKTK  HDK I RKEF++GQKV LY+  ++L P KLRS+W+GPFVV
Subjt:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV

Query:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLYT
         +    G V I++L T K+FKVNGHRLK F E   V+  +   L  P+YT
Subjt:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLYT

TrEMBL top hitse value%identityAlignment
A0A2G9G7U9 DNA-directed DNA polymerase1.6e-20251.41Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYG----------------------------DSFDDVRSFLGSASFYRRFIKDFSKIALPLTNL
        MPFGLCNA  TFQRCM+SIF D++E  IEVFMDDFTVYG                             S  +VRSFLG A FYRRFIKDFSKI  PL  L
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYG----------------------------DSFDDVRSFLGSASFYRRFIKDFSKIALPLTNL

Query:  WKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHAIYYASRTLN---------------------QFRSYI
         +KDV F  D  CK AFD+LK+ L S PI+Q PNWNL FEIMCDASN A+GAVL Q I K  H IYYASRTL+                     +FRSY+
Subjt:  WKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHAIYYASRTLN---------------------QFRSYI

Query:  IGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK-------------DFPDEHLFQTNLQAPWYADIVNYLVT-----
        +G+ VI+Y+DHA +KY +SKKE+KPRL+RW+LLLQEFNL I+D++G+ N VADHL               FPDE+LF     +PWYADIVNYLVT     
Subjt:  IGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK-------------DFPDEHLFQTNLQAPWYADIVNYLVT-----

Query:  -------------------------------------------------------GHFSPKRTTRKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRR
                                                               GHF PKRT RK+L+ G FW  LF +++ FCK+C +CQ+TG+L  R
Subjt:  -------------------------------------------------------GHFSPKRTTRKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRR

Query:  NEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVSNIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQH
        ++M L  ++ C++F + G+DFMGP P S+   YILL VDYVSKWVEA  TRTND+ +V  F+ S+IF+RFG+PRAIISD+GTHFCNR +E L +KY V H
Subjt:  NEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVSNIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQH

Query:  RISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVYGKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDL
        R+S+ YHPQTN QA+  NRE+K+ILEKTV+   K+WS  L+DALWAYRTAYKT I  SP++LV+GK CH+P+E+EH+AYWAI++CN+++ +  E + L L
Subjt:  RISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVYGKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDL

Query:  LELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVVIDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSV
         EL+E+R +AYENS+I+K+KTKL HD+ + RK F +GQKV LY+  +KL P KLRS+W+GPFV+ +  T G V I + +TGKIFKVNGHRLK F E   V
Subjt:  LELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVVIDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSV

Query:  QQCSIETL
         Q  I  L
Subjt:  QQCSIETL

A0A6P6TWR2 uncharacterized protein LOC1137046585.9e-19749Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------
        MPFGLCNA  TFQRCM+SIF +++EK IEVFMDDF+VYGDSFD                                                         
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------

Query:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA
               +VRSFLG A FYRRFIKDFSKI  PL  L +KDV F  +D C+KAF+ LK+ L S PI+Q P+WNL FEIMCDAS+ A+GAVL Q + K  H 
Subjt:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA

Query:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADH----------
        IYYASR LN                     +FRSY++G+ VI+++DHA ++Y ++KK++KPRL+RW+LLLQEF+L I+D+KGS N VADH          
Subjt:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADH----------

Query:  --LKD-FPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT
          LKD FP+EHLF  N Q PWYAD+VNYLVT                                                            GHF PKRT 
Subjt:  --LKD-FPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT

Query:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS
         K+L+SGF+W +LF +++ FCKSC  CQR G+++RR++M    +I  ++F + G+DFMGP P+SF +LYILL VDYVSKWVEA  TRTNDS +V+ F+ S
Subjt:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS

Query:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY
        NIF RFG+PRAI+SD+GTHFCNRTI AL RKYGV HR+S+ YHPQTN QA+  NREIK+ILEK V    K+WS  L DALWAYRTAYKT I  SP++LV+
Subjt:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY

Query:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV
        GK CH+P+E EHKA+WAI+QCN++L EA  ++ LDL ELEE+R +AYEN+ I+KEK++  HD++I RK FE+GQKV LY   +KL P KLRS+W+GPF+V
Subjt:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV

Query:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLP
             +G V I++  T   F VNGHRLK ++E  S ++     L  P
Subjt:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLP

A0A6P8D4X0 Reverse transcriptase1.3e-20750.47Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------
        MPFGLC+A GTFQRCMMSIF D IE CIEVFMDDFTV+GDSFD                                                         
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------

Query:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA
                +RSFLG A FYRRFIKDFSKIA PL +L +KD  F+   NC++AFD LK+ L S PI+Q P+W L FEIM DAS+ A+GAVL Q +DK+ H 
Subjt:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA

Query:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------
        IYYAS+TL+                     +F+SY++GS ++++TDHA +K+ ++KKESKPRL+RW+LLLQEF+L IKDRKGS NSVADHL         
Subjt:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------

Query:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT
              FPDEHLF    + PWYAD+VN++VT                                                            GHF PKRT 
Subjt:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT

Query:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS
        RKI+DSGF+W+TLF ++  FCK C  CQR G++SRRNEM    ++ C+VF + GMDFMGP PSSF + YILL VDYVSKWVEA  TRTND+ +V  FL S
Subjt:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS

Query:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY
        NIFSRFGIPRAIISDQGTHFCNR++EAL +KYGV HR+++ YHPQ+N QA+  NRE+K+ILEKTVN   K+WSL L+DALWAYRTAYKT I  SP++L++
Subjt:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY

Query:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV
        GK CH+P+EIEH+A+WA++QCN+ L  + E++   L ELEE+ L+AY+N+ ++KE+ KLLHDK +LRK+F IGQKV L++  +KLMP KLRS+W GPFVV
Subjt:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV

Query:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY
         +  + GV+ I+NL+T +IFKVNGHRLK F E  +V       LS P+Y
Subjt:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY

A0A6P8DJV3 Reverse transcriptase5.2e-20950.73Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------
        MPFGLC+A GTFQRCMMSIF D IE CIEVFMDDFTV+GDSFD                                                         
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------

Query:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA
                +RSFLG A FYRRFIKDFSKIA PL +L +KD  F+   NC++AFD LK+ L S PI+Q P+W L FEIM DAS+ A+GAVL Q +DK+ H 
Subjt:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA

Query:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------
        IYYAS+TL+                     +FRSY++GS ++++TDHA +K+ ++KKESKPRL+RW+LLLQEF+L IKDRKGS NSVADHL         
Subjt:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------

Query:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT
              FPDEHLF    + PWYAD+VN++VT                                                            GHF PKRT 
Subjt:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT

Query:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS
        RKI+DSGF+W+TLF ++  FCK C  CQR G++SRRNEM    ++ C+VF + GMDFMGP PSSF + YILL VDYVSKWVEA  TRTND+ +V  FL S
Subjt:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS

Query:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY
        NIFSRFGIPRAIISDQGTHFCNR++EAL +KYGV HR+++ YHPQ+N QA+  NRE+K+ILEKTVN   K+WSL L+DALWAYRTAYKT I  SP++L++
Subjt:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY

Query:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV
        GK CH+P+EIEH+A+WA++QCN+ L  + E++   L ELEE+RL+AY+N+ ++KE+ KLLHDK +LRK+F IGQKV L++  +KLMP KLRS+W GPFVV
Subjt:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV

Query:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY
         +  + GV+ I+NL+T +IFKVNGHRLK F E  +V       LS P+Y
Subjt:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY

A0A6P8E830 Reverse transcriptase5.2e-20950.73Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------
        MPFGLC+A GTFQRCMMSIF D IE CIEVFMDDFTV+GDSFD                                                         
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------

Query:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA
                +RSFLG A FYRRFIKDFSKIA PL +L +KD  F+   NC++AFD LK+ L S PI+Q P+W L FEIM DAS+ A+GAVL Q +DK+ H 
Subjt:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHA

Query:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------
        IYYAS+TL+                     +FRSY++GS ++++TDHA +K+ ++KKESKPRL+RW+LLLQEF+L IKDRKGS NSVADHL         
Subjt:  IYYASRTLN---------------------QFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLK--------

Query:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT
              FPDEHLF    + PWYAD+VN++VT                                                            GHF PKRT 
Subjt:  -----DFPDEHLFQTNLQAPWYADIVNYLVT------------------------------------------------------------GHFSPKRTT

Query:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS
        RKI+DSGF+W+TLF ++  FCK C  CQR G++SRRNEM    ++ C+VF + GMDFMGP PSSF + YILL VDYVSKWVEA  TRTND+ +V  FL S
Subjt:  RKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS

Query:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY
        NIFSRFGIPRAIISDQGTHFCNR++EAL +KYGV HR+++ YHPQ+N QA+  NRE+K+ILEKTVN   K+WSL L+DALWAYRTAYKT I  SP++L++
Subjt:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY

Query:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV
        GK CH+P+EIEH+A+WA++QCN+ L  + E++   L ELEE+RL+AY+N+ ++KE+ KLLHDK +LRK+F IGQKV L++  +KLMP KLRS+W GPFVV
Subjt:  GKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVV

Query:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY
         +  + GV+ I+NL+T +IFKVNGHRLK F E  +V       LS P+Y
Subjt:  IDFSTFGVVSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLY

SwissProt top hitse value%identityAlignment
O93209 Pro-Pol polyprotein7.3e-1925.99Show/hide
Query:  LDSGFFWKTLFANSFSFCKSCANCQRTGSLSRR---NEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS
        +   ++W  +  +  SF  +C  C+    L+ +    + ++H     D FY   MD++GPLP S  Y+++L+VVD  + +    PT+   S    + L  
Subjt:  LDSGFFWKTLFANSFSFCKSCANCQRTGSLSRR---NEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRTNDSVIVSRFLVS

Query:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY
        N  +   IP+ + SDQG+ F +       ++  +Q   S+PYHPQ++ + +  N EIK +L K +  +   W   ++    A    +  +   +P +L++
Subjt:  NIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKLVY

Query:  GKSCHIPIEIEHKAYWAIRQCNLSLLE
        G  C++P   +    W  R+  L+LL+
Subjt:  GKSCHIPIEIEHKAYWAIRQCNLSLLE

P04323 Retrovirus-related Pol polyprotein from transposon 17.63.8e-2328.87Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------
        MPFGL NA  TFQRCM  I    + K   V++DD  V+  S D                                                         
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------

Query:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPF-LIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQ-------
               ++++FLG   +YR+FI +F+ IA P+T   KK++     +     AF  LK  +   PIL+ P++   F +  DAS++ALGAVL Q       
Subjt:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPF-LIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQ-------

Query:  -------------MIDKKLHAIYYASRTLNQFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHL
                      I+K+L AI +A++T   FR Y++G    I +DH  + +    K+   +L RW + L EF+  IK  KG  N VAD L
Subjt:  -------------MIDKKLHAIYYASRTLNQFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHL

P10394 Retrovirus-related Pol polyprotein from transposon 4124.9e-2336.22Show/hide
Query:  DDVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHAIYYASR
        D  R F+   ++YRRFIK+F+  +  +T L KK+VPF   D C+KAF  LK +L++  +LQ P+++  F I  DAS  A GAVL Q  +     + YASR
Subjt:  DDVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQMIDKKLHAIYYASR

Query:  TLNQ---------------------FRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHL
           +                     FR YI G    + TDH  + Y  S      +L R  L L+E+N T++  KG +N VAD L
Subjt:  TLNQ---------------------FRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHL

P20825 Retrovirus-related Pol polyprotein from transposon 2977.6e-2429.59Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSF----------------------------------------------------------
        MPFGL NA  TFQRCM +I    + K   V++DD  ++  S                                                           
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSF----------------------------------------------------------

Query:  ------DDVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCK----KAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQ----
               ++R+FLG   +YR+FI +++ IA P+T+  KK       D  K    +AF+ LK  ++  PILQ P++   F +  DASNLALGAVL Q    
Subjt:  ------DDVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCK----KAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAVLRQ----

Query:  ----------------MIDKKLHAIYYASRTLNQFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHL
                         I+K+L AI +A++T   FR Y++G   +I +DH  +++  + KE   +L RW + L E+   I   KG  NSVAD L
Subjt:  ----------------MIDKKLHAIYYASRTLNQFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus8.9e-2526.53Show/hide
Query:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------
        +PFGL NA   FQR +  I  + I K   V++DD  V+ + +D                                                         
Subjt:  MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFD---------------------------------------------------------

Query:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWK-----------KDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAV
               +++ FLG  S+YR+FI+D++K+A PLTNL +             VP  +D+   ++F+DLK  L S+ IL  P +   F +  DASN A+GAV
Subjt:  -------DVRSFLGSASFYRRFIKDFSKIALPLTNLWK-----------KDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSFEIMCDASNLALGAV

Query:  LRQMIDKKLHAIYYASRTLNQ---------------------FRSYIIGSPVI-IYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVA
        L Q    +   I Y SR+LN+                      R+Y+ G+  I +YTDH  + + +  +    +L RW   ++E+N  +  + G +N VA
Subjt:  LRQMIDKKLHAIYYASRTLNQ---------------------FRSYIIGSPVI-IYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVA

Query:  DHLKDFPDE-HLFQTNLQAPWYADIVNYLVTGHFSPKRTTRKI
        D L   P + +   T+L A    D +  L T H +   ++R I
Subjt:  DHLKDFPDE-HLFQTNLQAPWYADIVNYLVTGHFSPKRTTRKI

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein7.3e-0638.24Show/hide
Query:  DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSF
        ++R FLG   +YRRF+K++ KI  PLT L KK+      +    AF  LK  + + P+L  P+  L F
Subjt:  DVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTTGGTCTTTGTAATGCACAGGGCACATTCCAACGTTGCATGATGAGCATATTTTTCGACTTTATTGAAAAATGCATTGAAGTTTTCATGGATGATTTCACCGT
TTATGGTGATAGTTTTGATGATGTTAGATCCTTCCTTGGTAGTGCCAGCTTTTATAGACGTTTTATAAAAGACTTTTCTAAAATTGCTTTGCCTCTAACTAATCTCTGGA
AAAAAGATGTACCATTTCTAATTGATGACAATTGTAAGAAGGCATTTGATGATCTCAAACAAAGGTTAGTCTCTACCCCTATCCTTCAATCTCCTAATTGGAATTTATCT
TTCGAAATAATGTGTGATGCAAGCAACTTAGCATTAGGAGCTGTTTTACGACAAATGATAGATAAAAAATTGCATGCTATATACTATGCATCTAGGACCCTTAACCAATT
TAGAAGCTACATTATTGGTTCCCCAGTAATTATTTACACTGATCATGCAACGGTTAAGTATCCTGTATCAAAAAAAGAATCAAAACCAAGGCTTGTTCGATGGGTTTTAC
TTTTGCAAGAATTCAACCTAACCATCAAGGATAGAAAAGGATCCAACAATTCTGTAGCCGACCATCTTAAAGACTTCCCTGATGAACATCTCTTTCAAACAAATCTTCAA
GCACCATGGTACGCCGATATTGTAAACTACTTAGTCACGGGTCACTTTAGTCCTAAAAGAACGACTAGGAAAATTTTAGATAGTGGATTCTTTTGGAAAACATTATTTGC
AAATTCTTTTTCATTTTGTAAATCATGTGCAAACTGTCAAAGAACTGGATCTCTATCTAGGAGAAATGAAATGCTCCTTCACCTCGTTATTACTTGTGATGTTTTTTATA
TTTGTGGCATGGATTTTATGGGTCCCCTTCCTTCTTCTTTTAGATATCTTTACATTCTATTAGTCGTTGACTATGTATCGAAGTGGGTTGAAGCAATCCCCACTAGGACT
AATGATTCTGTCATTGTCTCAAGATTTCTAGTTTCTAACATATTTTCTAGATTTGGCATCCCAAGGGCAATCATTAGTGATCAAGGAACACACTTTTGCAATCGGACCAT
TGAAGCCTTGAGGAGAAAATATGGTGTTCAACATCGTATTTCCTCACCATACCATCCTCAAACGAACGAACAAGCTAAAACTTTTAATCGAGAGATAAAAAATATCCTAG
AGAAAACCGTCAACACAAAAAGTAAGAATTGGAGCCTCCACCTCAACGATGCACTTTGGGCATACCGAACAGCCTATAAAACTACAATTGACACATCTCCGTTCAAGCTT
GTGTACGGTAAGTCTTGTCATATCCCAATAGAAATAGAACATAAAGCATATTGGGCAATTAGACAATGCAATTTATCTCTCTTAGAAGCCGATGAGAAAAAATTCCTAGA
TTTGCTAGAATTAGAAGAATTGAGATTAAAGGCATATGAAAATTCTAGGATTCACAAAGAAAAGACTAAACTTTTGCATGATAAAAAAATCCTAAGAAAAGAATTTGAAA
TAGGGCAAAAAGTTCCTTTATATAATTTTTCTATTAAACTCATGCCCAGAAAGTTAAGATCTAAATGGCTTGGTCCTTTTGTTGTAATTGATTTCTCTACTTTTGGTGTT
GTTTCCATAAAAAATCTTGACACGGGAAAAATTTTCAAAGTGAATGGGCATAGATTAAAAATATTCCATGAAAGGCAATCCGTACAACAATGTTCTATAGAAACACTTTC
TCTCCCCCTCTACACATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTTGGTCTTTGTAATGCACAGGGCACATTCCAACGTTGCATGATGAGCATATTTTTCGACTTTATTGAAAAATGCATTGAAGTTTTCATGGATGATTTCACCGT
TTATGGTGATAGTTTTGATGATGTTAGATCCTTCCTTGGTAGTGCCAGCTTTTATAGACGTTTTATAAAAGACTTTTCTAAAATTGCTTTGCCTCTAACTAATCTCTGGA
AAAAAGATGTACCATTTCTAATTGATGACAATTGTAAGAAGGCATTTGATGATCTCAAACAAAGGTTAGTCTCTACCCCTATCCTTCAATCTCCTAATTGGAATTTATCT
TTCGAAATAATGTGTGATGCAAGCAACTTAGCATTAGGAGCTGTTTTACGACAAATGATAGATAAAAAATTGCATGCTATATACTATGCATCTAGGACCCTTAACCAATT
TAGAAGCTACATTATTGGTTCCCCAGTAATTATTTACACTGATCATGCAACGGTTAAGTATCCTGTATCAAAAAAAGAATCAAAACCAAGGCTTGTTCGATGGGTTTTAC
TTTTGCAAGAATTCAACCTAACCATCAAGGATAGAAAAGGATCCAACAATTCTGTAGCCGACCATCTTAAAGACTTCCCTGATGAACATCTCTTTCAAACAAATCTTCAA
GCACCATGGTACGCCGATATTGTAAACTACTTAGTCACGGGTCACTTTAGTCCTAAAAGAACGACTAGGAAAATTTTAGATAGTGGATTCTTTTGGAAAACATTATTTGC
AAATTCTTTTTCATTTTGTAAATCATGTGCAAACTGTCAAAGAACTGGATCTCTATCTAGGAGAAATGAAATGCTCCTTCACCTCGTTATTACTTGTGATGTTTTTTATA
TTTGTGGCATGGATTTTATGGGTCCCCTTCCTTCTTCTTTTAGATATCTTTACATTCTATTAGTCGTTGACTATGTATCGAAGTGGGTTGAAGCAATCCCCACTAGGACT
AATGATTCTGTCATTGTCTCAAGATTTCTAGTTTCTAACATATTTTCTAGATTTGGCATCCCAAGGGCAATCATTAGTGATCAAGGAACACACTTTTGCAATCGGACCAT
TGAAGCCTTGAGGAGAAAATATGGTGTTCAACATCGTATTTCCTCACCATACCATCCTCAAACGAACGAACAAGCTAAAACTTTTAATCGAGAGATAAAAAATATCCTAG
AGAAAACCGTCAACACAAAAAGTAAGAATTGGAGCCTCCACCTCAACGATGCACTTTGGGCATACCGAACAGCCTATAAAACTACAATTGACACATCTCCGTTCAAGCTT
GTGTACGGTAAGTCTTGTCATATCCCAATAGAAATAGAACATAAAGCATATTGGGCAATTAGACAATGCAATTTATCTCTCTTAGAAGCCGATGAGAAAAAATTCCTAGA
TTTGCTAGAATTAGAAGAATTGAGATTAAAGGCATATGAAAATTCTAGGATTCACAAAGAAAAGACTAAACTTTTGCATGATAAAAAAATCCTAAGAAAAGAATTTGAAA
TAGGGCAAAAAGTTCCTTTATATAATTTTTCTATTAAACTCATGCCCAGAAAGTTAAGATCTAAATGGCTTGGTCCTTTTGTTGTAATTGATTTCTCTACTTTTGGTGTT
GTTTCCATAAAAAATCTTGACACGGGAAAAATTTTCAAAGTGAATGGGCATAGATTAAAAATATTCCATGAAAGGCAATCCGTACAACAATGTTCTATAGAAACACTTTC
TCTCCCCCTCTACACATAA
Protein sequenceShow/hide protein sequence
MPFGLCNAQGTFQRCMMSIFFDFIEKCIEVFMDDFTVYGDSFDDVRSFLGSASFYRRFIKDFSKIALPLTNLWKKDVPFLIDDNCKKAFDDLKQRLVSTPILQSPNWNLS
FEIMCDASNLALGAVLRQMIDKKLHAIYYASRTLNQFRSYIIGSPVIIYTDHATVKYPVSKKESKPRLVRWVLLLQEFNLTIKDRKGSNNSVADHLKDFPDEHLFQTNLQ
APWYADIVNYLVTGHFSPKRTTRKILDSGFFWKTLFANSFSFCKSCANCQRTGSLSRRNEMLLHLVITCDVFYICGMDFMGPLPSSFRYLYILLVVDYVSKWVEAIPTRT
NDSVIVSRFLVSNIFSRFGIPRAIISDQGTHFCNRTIEALRRKYGVQHRISSPYHPQTNEQAKTFNREIKNILEKTVNTKSKNWSLHLNDALWAYRTAYKTTIDTSPFKL
VYGKSCHIPIEIEHKAYWAIRQCNLSLLEADEKKFLDLLELEELRLKAYENSRIHKEKTKLLHDKKILRKEFEIGQKVPLYNFSIKLMPRKLRSKWLGPFVVIDFSTFGV
VSIKNLDTGKIFKVNGHRLKIFHERQSVQQCSIETLSLPLYT