; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0015821 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0015821
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr01:13664269..13666401
RNA-Seq ExpressionCmc01g0015821
SyntenyCmc01g0015821
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]1.8e-20654.38Show/hide
Query:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN
        AEA+NT C+IHNR+T+R GT  TLYE+WKGRK +VK+FHIFGS CYILADRE  +K D KS+ G+FLGYS N RAYRVFN+RT  VME+INVVVDD +  
Subjt:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN

Query:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE
         K+  +E+   S + +A  +       E+++    E + N   KR+         S+ +QK HP   IIGDP+ G+  R ++    ++++++ C+ S IE
Subjt:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE

Query:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF
        P +V               EL QFKR  VW LVP+ +  N+IGTKWIFKNKT+E G + RNKARLVAQGY Q+EGVDFDETFAPVA+LE+IRLLL ++C 
Subjt:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF

Query:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF
          FKLYQMDVKSAFLNGYLNEEVY              ++Y+L KALYGLKQAPRAWYE LT +L ++GY +   DKTLF+ + + +L++AQIYVDDI+F
Subjt:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF

Query:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT
        GG    ++ +F+  M+SEFEMSLVG+L+ FLGLQ+KQ  + IF+SQ +YAKNIVK FG++ + +KRTP  TH K+++D  G +VD  LYRSMIGSLLYLT
Subjt:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT

Query:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA
        ASRPDI YAVG+CA                                                     RKSTSGGCF+LGNNL+SWFSKK+NC+SLSTAEA
Subjt:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA

Query:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL
        EYI AG   +QL+WMK ML EY + QDVMTLYCDNMSAI+ISKNPVQHSRTKHIDIRHH+IR+L+++K+ITL+H     Q+AD FTK LDA  FE L
Subjt:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL

AAO73523.1 gag-pol polyprotein [Glycine max]3.1e-20654.38Show/hide
Query:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN
        AEA+NT C+IHNR+T+R GT  TLYE+WKGRK +VK+FHIFGS CYILADRE  +K D KS+ G+FLGYS N RAYRVFN+RT  VME+INVVVDD +  
Subjt:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN

Query:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE
         K+  +E+   S + +A  +       E+++    E + N   KR+         S+ +QK HP   IIGDP+ G+  R ++    ++++++ C+ S IE
Subjt:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE

Query:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF
        P +V               EL QFKR  VW LVP+ +  N+IGTKWIFKNKT+E G + RNKARLVAQGY Q+EGVDFDETFAPVA+LE+IRLLL ++C 
Subjt:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF

Query:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF
          FKLYQMDVKSAFLNGYLNEEVY              ++Y+L KALYGLKQAPRAWYE LT +L ++GY +   DKTLF+ + + +L++AQIYVDDI+F
Subjt:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF

Query:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT
        GG    ++ +F+  M+SEFEMSLVG+L+ FLGLQ+KQ  + IF+SQ +YAKNIVK FG++ + +KRTP  TH K+++D  G +VD K YRSMIGSLLYLT
Subjt:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT

Query:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA
        ASRPDI YAVG+CA                                                     RKSTSGGCF+LGNNL+SWFSKK+NC+SLSTAEA
Subjt:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA

Query:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL
        EYI AG   +QL+WMK ML EY + QDVMTLYCDNMSAI+ISKNPVQHSRTKHIDIRHH+IR+L+++K+ITL+H     Q+AD FTK LDA  FE L
Subjt:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL

AAO73529.1 gag-pol polyprotein [Glycine max]8.1e-20754.66Show/hide
Query:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN
        AEA+NT C+IHNR+T+R GT  TLYE+WKGRK  VK+FHIFGS CYILADRE  +K D KS+ G+FLGYS N RAYRVFN+RT  VME+INVVVDD    
Subjt:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN

Query:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE
         K+  +E+   S + +A T+       E+++    E + N   KR          S  +QK HP   IIGDP+ G+  R ++    ++++++ C+ S IE
Subjt:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE

Query:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF
        P +V               EL QFKR  VW LVP+ +  N+IGTKWIFKNKT+E G + RNKARLVAQGY Q+EGVDFDETFAPVA+LE+IRLLL ++C 
Subjt:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF

Query:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF
          FKLYQMDVKSAFLNGYLNEE Y              ++Y+L KALYGLKQAPRAWYE LT +L ++GY +   DKTLF+ + + +L++AQIYVDDI+F
Subjt:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF

Query:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT
        GG    ++ +F+  M+SEFEMSLVG+L+ FLGLQ+KQ  + IF+SQ KYAKNIVK FG++ + +KRTP  TH K+++D  G +VD  LYRSMIGSLLYLT
Subjt:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT

Query:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA
        ASRPDI YAVG+CA                                                     RKSTSGGCF+LGNNL+SWFSKK+NC+SLSTAEA
Subjt:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA

Query:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL
        EYI AG   +QL+WMK ML EY + QDVMTLYCDNMSAI+ISKNPVQHSRTKHIDIRHH+IREL+++K+ITL+H     Q+AD FTK LDA  FE L
Subjt:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL

MCH79363.1 gag-pol polyprotein [Trifolium medium]3.1e-20654.44Show/hide
Query:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN
        AEA+NT C+IHNR+T+RSGT+ TLYELWKGRK  VK+FH+FGS CYILADRE  +K D KSE G+FLGYS N RAYRV N+RT+++ME+INVVVDD   +
Subjt:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN

Query:  DKQIDDEED-EASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAI
         K  D E D   S++ +  T      P+ D E +  +L+P  ++K         + S  +QKNHP   IIG P+ GI  RR +     + I++ C+ S I
Subjt:  DKQIDDEED-EASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAI

Query:  EPTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISC
        EP +V               EL QFKR  VW LVP+    N+IGTKW+++NK+DE+G V RNKARLVAQGY+QVEG+DFDETFAPVA+LE+IRLL+ ++C
Subjt:  EPTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISC

Query:  FQNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDII
           FKLYQMDVKSAFLNGYL+EEVY              ++YKL KALYGLKQAPRAWYE LT++L  +GY +   DKTLF+   + +L++AQIYVDDI+
Subjt:  FQNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDII

Query:  FGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYL
        FGG    +V +F+  M+SEFEMSLVG+L+ FLGLQ+KQ  + IF+SQ KYAKNIVK FG++ + YKRTP  TH K+T D  G+ VD  +Y+SMIGSLLYL
Subjt:  FGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYL

Query:  TASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAE
        TASRPDI +AVG+CA                                                     RKSTSGGCFFLGNNL+SWFSKK+NC+SLSTAE
Subjt:  TASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAE

Query:  AEYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL
        AEYI AG   +QL+WMK ML +Y + QDVMTL+CDN+SAI+ISKNP+QHSRTKHIDIRHHFIR+L+E  ++TL+H   + QLAD FTK LDA  +E L
Subjt:  AEYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL

TYK23179.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.0e-23888.93Show/hide
Query:  FLAEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYN
        FLAEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTE+VMETINVVVDDYN
Subjt:  FLAEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYN

Query:  NNDKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSA
        NNDKQIDDEEDEASEETIAPTSTPIVVPKEDTE TNIELSP SISKRAT EGTLTILSSHV+KNHPLSSIIGDPSAGIIARRKDK               
Subjt:  NNDKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSA

Query:  IEPTSVELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSA
              ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLL              V S 
Subjt:  IEPTSVELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSA

Query:  FLNGYLNEEVYYIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKL
        F          YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDT+KSEFEMSLVGKL
Subjt:  FLNGYLNEEVYYIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKL

Query:  SCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAVGICAR
        SCFLGLQIKQRSE IFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAVGICAR
Subjt:  SCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAVGICAR

TrEMBL top hitse value%identityAlignment
A0A392LWM0 Gag-pol polyprotein (Fragment)1.5e-20654.44Show/hide
Query:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN
        AEA+NT C+IHNR+T+RSGT+ TLYELWKGRK  VK+FH+FGS CYILADRE  +K D KSE G+FLGYS N RAYRV N+RT+++ME+INVVVDD   +
Subjt:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN

Query:  DKQIDDEED-EASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAI
         K  D E D   S++ +  T      P+ D E +  +L+P  ++K         + S  +QKNHP   IIG P+ GI  RR +     + I++ C+ S I
Subjt:  DKQIDDEED-EASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAI

Query:  EPTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISC
        EP +V               EL QFKR  VW LVP+    N+IGTKW+++NK+DE+G V RNKARLVAQGY+QVEG+DFDETFAPVA+LE+IRLL+ ++C
Subjt:  EPTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISC

Query:  FQNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDII
           FKLYQMDVKSAFLNGYL+EEVY              ++YKL KALYGLKQAPRAWYE LT++L  +GY +   DKTLF+   + +L++AQIYVDDI+
Subjt:  FQNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDII

Query:  FGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYL
        FGG    +V +F+  M+SEFEMSLVG+L+ FLGLQ+KQ  + IF+SQ KYAKNIVK FG++ + YKRTP  TH K+T D  G+ VD  +Y+SMIGSLLYL
Subjt:  FGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYL

Query:  TASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAE
        TASRPDI +AVG+CA                                                     RKSTSGGCFFLGNNL+SWFSKK+NC+SLSTAE
Subjt:  TASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAE

Query:  AEYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL
        AEYI AG   +QL+WMK ML +Y + QDVMTL+CDN+SAI+ISKNP+QHSRTKHIDIRHHFIR+L+E  ++TL+H   + QLAD FTK LDA  +E L
Subjt:  AEYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL

A0A5D3DIW3 Gag-pol polyprotein1.9e-23888.93Show/hide
Query:  FLAEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYN
        FLAEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTE+VMETINVVVDDYN
Subjt:  FLAEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYN

Query:  NNDKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSA
        NNDKQIDDEEDEASEETIAPTSTPIVVPKEDTE TNIELSP SISKRAT EGTLTILSSHV+KNHPLSSIIGDPSAGIIARRKDK               
Subjt:  NNDKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSA

Query:  IEPTSVELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSA
              ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLL              V S 
Subjt:  IEPTSVELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSA

Query:  FLNGYLNEEVYYIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKL
        F          YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDT+KSEFEMSLVGKL
Subjt:  FLNGYLNEEVYYIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKL

Query:  SCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAVGICAR
        SCFLGLQIKQRSE IFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAVGICAR
Subjt:  SCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAVGICAR

Q84VH6 Gag-pol polyprotein3.9e-20754.66Show/hide
Query:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN
        AEA+NT C+IHNR+T+R GT  TLYE+WKGRK  VK+FHIFGS CYILADRE  +K D KS+ G+FLGYS N RAYRVFN+RT  VME+INVVVDD    
Subjt:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN

Query:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE
         K+  +E+   S + +A T+       E+++    E + N   KR          S  +QK HP   IIGDP+ G+  R ++    ++++++ C+ S IE
Subjt:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE

Query:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF
        P +V               EL QFKR  VW LVP+ +  N+IGTKWIFKNKT+E G + RNKARLVAQGY Q+EGVDFDETFAPVA+LE+IRLLL ++C 
Subjt:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF

Query:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF
          FKLYQMDVKSAFLNGYLNEE Y              ++Y+L KALYGLKQAPRAWYE LT +L ++GY +   DKTLF+ + + +L++AQIYVDDI+F
Subjt:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF

Query:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT
        GG    ++ +F+  M+SEFEMSLVG+L+ FLGLQ+KQ  + IF+SQ KYAKNIVK FG++ + +KRTP  TH K+++D  G +VD  LYRSMIGSLLYLT
Subjt:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT

Query:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA
        ASRPDI YAVG+CA                                                     RKSTSGGCF+LGNNL+SWFSKK+NC+SLSTAEA
Subjt:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA

Query:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL
        EYI AG   +QL+WMK ML EY + QDVMTLYCDNMSAI+ISKNPVQHSRTKHIDIRHH+IREL+++K+ITL+H     Q+AD FTK LDA  FE L
Subjt:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL

Q84VI2 Gag-pol polyprotein1.5e-20654.38Show/hide
Query:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN
        AEA+NT C+IHNR+T+R GT  TLYE+WKGRK +VK+FHIFGS CYILADRE  +K D KS+ G+FLGYS N RAYRVFN+RT  VME+INVVVDD +  
Subjt:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN

Query:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE
         K+  +E+   S + +A  +       E+++    E + N   KR+         S+ +QK HP   IIGDP+ G+  R ++    ++++++ C+ S IE
Subjt:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE

Query:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF
        P +V               EL QFKR  VW LVP+ +  N+IGTKWIFKNKT+E G + RNKARLVAQGY Q+EGVDFDETFAPVA+LE+IRLLL ++C 
Subjt:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF

Query:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF
          FKLYQMDVKSAFLNGYLNEEVY              ++Y+L KALYGLKQAPRAWYE LT +L ++GY +   DKTLF+ + + +L++AQIYVDDI+F
Subjt:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF

Query:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT
        GG    ++ +F+  M+SEFEMSLVG+L+ FLGLQ+KQ  + IF+SQ +YAKNIVK FG++ + +KRTP  TH K+++D  G +VD K YRSMIGSLLYLT
Subjt:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT

Query:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA
        ASRPDI YAVG+CA                                                     RKSTSGGCF+LGNNL+SWFSKK+NC+SLSTAEA
Subjt:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA

Query:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL
        EYI AG   +QL+WMK ML EY + QDVMTLYCDNMSAI+ISKNPVQHSRTKHIDIRHH+IR+L+++K+ITL+H     Q+AD FTK LDA  FE L
Subjt:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL

Q84VI4 Gag-pol polyprotein8.7e-20754.38Show/hide
Query:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN
        AEA+NT C+IHNR+T+R GT  TLYE+WKGRK +VK+FHIFGS CYILADRE  +K D KS+ G+FLGYS N RAYRVFN+RT  VME+INVVVDD +  
Subjt:  AEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYNNN

Query:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE
         K+  +E+   S + +A  +       E+++    E + N   KR+         S+ +QK HP   IIGDP+ G+  R ++    ++++++ C+ S IE
Subjt:  DKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE

Query:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF
        P +V               EL QFKR  VW LVP+ +  N+IGTKWIFKNKT+E G + RNKARLVAQGY Q+EGVDFDETFAPVA+LE+IRLLL ++C 
Subjt:  PTSV---------------ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCF

Query:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF
          FKLYQMDVKSAFLNGYLNEEVY              ++Y+L KALYGLKQAPRAWYE LT +L ++GY +   DKTLF+ + + +L++AQIYVDDI+F
Subjt:  QNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIF

Query:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT
        GG    ++ +F+  M+SEFEMSLVG+L+ FLGLQ+KQ  + IF+SQ +YAKNIVK FG++ + +KRTP  TH K+++D  G +VD  LYRSMIGSLLYLT
Subjt:  GGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLT

Query:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA
        ASRPDI YAVG+CA                                                     RKSTSGGCF+LGNNL+SWFSKK+NC+SLSTAEA
Subjt:  ASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFSKKKNCISLSTAEA

Query:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL
        EYI AG   +QL+WMK ML EY + QDVMTLYCDNMSAI+ISKNPVQHSRTKHIDIRHH+IR+L+++K+ITL+H     Q+AD FTK LDA  FE L
Subjt:  EYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-7032.92Show/hide
Query:  EPTSVELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSAF
        E  + EL   K  N WT+  + +  NI+ ++W+F  K +E G  IR KARLVA+G+ Q   +D++ETFAPVA++ + R +LS+    N K++QMDVK+AF
Subjt:  EPTSVELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSAF

Query:  LNGYLNEEVYY------------IYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFI--NRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDT
        LNG L EE+Y             + KLNKA+YGLKQA R W+E     L E  +     D+ ++I         I   +YVDD++      T +NNF   
Subjt:  LNGYLNEEVYY------------IYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFI--NRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDT

Query:  MKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVD-HKLYRSMIGSLLY-LTASRPDIAYAVGI
        +  +F M+ + ++  F+G++I+ + + I++SQ  Y K I+  F ++      TP    +KI  +++    D +   RS+IG L+Y +  +RPD+  AV I
Subjt:  MKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVD-HKLYRSMIGSLLY-LTASRPDIAYAVGI

Query:  CA-------------------------------------------------------RKSTSGGCFFLGN-NLVSWFSKKKNCISLSTAEAEYIVAGIEF
         +                                                       RKST+G  F + + NL+ W +K++N ++ S+ EAEY+      
Subjt:  CA-------------------------------------------------------RKSTSGGCFFLGN-NLVSWFSKKKNCISLSTAEAEYIVAGIEF

Query:  TQLIWMKNMLNEYGI-IQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL
         + +W+K +L    I +++ + +Y DN   I I+ NP  H R KHIDI++HF RE ++N +I L++ P  +QLAD FTKPL A  F  L
Subjt:  TQLIWMKNMLNEYGI-IQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-7627.59Show/hide
Query:  FLAEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYN
        F  EAV T C++ NR             +W  ++++  +  +FG   +    +E   K D KS   +F+GY      YR+++   + V+ + +VV   + 
Subjt:  FLAEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVDDYN

Query:  NNDKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQK-----NHPLSS------IIGDPSAGIIARRKDKVDYL
         ++ +   +  E  +  I P    + +P      T+ E + + +S++    G +      + +      HP         +       + +RR    +Y+
Subjt:  NNDKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQK-----NHPLSS------IIGDPSAGIIARRKDKVDYL

Query:  KMIADLCYTSAIEPTS------------VELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEA
         +  D    S  E  S             E+   ++   + LV        +  KW+FK K D    ++R KARLV +G+ Q +G+DFDE F+PV K+ +
Subjt:  KMIADLCYTSAIEPTS------------VELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEA

Query:  IRLLLSISCFQNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTS-THLI
        IR +LS++   + ++ Q+DVK+AFL+G L EE+Y               + KLNK+LYGLKQAPR WY     ++  + Y +  +D  ++  R S  + I
Subjt:  IRLLLSISCFQNFKLYQMDVKSAFLNGYLNEEVY--------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTS-THLI

Query:  VAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQI--KQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHK
        +  +YVDD++  G  K L+      +   F+M  +G     LG++I  ++ S  +++SQ+KY + +++ F +  ++   TP   H K+++ +    V+ K
Subjt:  VAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQI--KQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHK

Query:  ------LYRSMIGSLLY-LTASRPDIAYAVGICA----------------------------------------------------RKSTSGGCFFLGNN
               Y S +GSL+Y +  +RPDIA+AVG+ +                                                    RKS++G  F     
Subjt:  ------LYRSMIGSLLY-LTASRPDIAYAVGICA----------------------------------------------------RKSTSGGCFFLGNN

Query:  LVSWFSKKKNCISLSTAEAEYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQL
         +SW SK + C++LST EAEYI A     ++IW+K  L E G+ Q    +YCD+ SAID+SKN + H+RTKHID+R+H+IRE+++++ + +     N   
Subjt:  LVSWFSKKKNCISLSTAEAEYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQL

Query:  ADTFTKPLDATMFE
        AD  TK +    FE
Subjt:  ADTFTKPLDATMFE

P92519 Uncharacterized mitochondrial protein AtMg008102.0e-1930.04Show/hide
Query:  IYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDI-VGIAVDHKLYRS
        +YVDDI+  G   TL+N  I  + S F M  +G +  FLG+QIK     +F+SQ KYA+ I+ N G+   +   TP     K+   +      D   +RS
Subjt:  IYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDI-VGIAVDHKLYRS

Query:  MIGSLLYLTASRPDIAYAVGI-----------------------------------------------------CARKSTSGGCFFLGNNLVSWFSKKKN
        ++G+L YLT +RPDI+YAV I                                                       R+ST+G C FLG N++SW +K++ 
Subjt:  MIGSLLYLTASRPDIAYAVGI-----------------------------------------------------CARKSTSGGCFFLGNNLVSWFSKKKN

Query:  CISLSTAEAEYIVAGIEFTQLIW
         +S S+ E EY    +   +L W
Subjt:  CISLSTAEAEYIVAGIEFTQLIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-6633.33Show/hide
Query:  NVWTLV-PKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSAFLNGYLNEEVY--
        + W LV P      I+G +WIF  K +  G + R KARLVA+GY Q  G+D+ ETF+PV K  +IR++L ++  +++ + Q+DV +AFL G L ++VY  
Subjt:  NVWTLV-PKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSAFLNGYLNEEVY--

Query:  ------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGK
                    Y+ KL KALYGLKQAPRAWY  L  YL   G+    +D +LF+ +    ++   +YVDDI+  G   TL++N +D +   F +    +
Subjt:  ------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGK

Query:  LSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAV----------------
        L  FLG++ K+    + +SQ++Y  +++    +  ++   TP     K++        D   YR ++GSL YL  +RPDI+YAV                
Subjt:  LSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAV----------------

Query:  ---------------GICARK----------------------STSGGCFFLGNNLVSWFSKKKNCISLSTAEAEYIVAGIEFTQLIWMKNMLNEYGI-I
                       GI  +K                      ST+G   +LG++ +SW SKK+  +  S+ EAEY       +++ W+ ++L E GI +
Subjt:  ---------------GICARK----------------------STSGGCFFLGNNLVSWFSKKKNCISLSTAEAEYIVAGIEFTQLIWMKNMLNEYGI-I

Query:  QDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEH
             +YCDN+ A  +  NPV HSR KHI I +HFIR  +++  + + H   + QLADT TKPL  T F++
Subjt:  QDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.0e-6632.91Show/hide
Query:  NVWTLV-PKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSAFLNGYLNEEVY--
        + W LV P      I+G +WIF  K +  G + R KARLVA+GY Q  G+D+ ETF+PV K  +IR++L ++  +++ + Q+DV +AFL G L +EVY  
Subjt:  NVWTLV-PKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSAFLNGYLNEEVY--

Query:  ------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGK
                    Y+ +L KA+YGLKQAPRAWY  L  YL   G+    +D +LF+ +    +I   +YVDDI+  G    L+ + +D +   F +     
Subjt:  ------------YIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGK

Query:  LSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAV----------------
        L  FLG++ K+  + + +SQ++Y  +++    +  ++   TP  T  K+T        D   YR ++GSL YL  +RPD++YAV                
Subjt:  LSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAV----------------

Query:  ---------------GICARK----------------------STSGGCFFLGNNLVSWFSKKKNCISLSTAEAEYIVAGIEFTQLIWMKNMLNEYGI-I
                       GI  +K                      ST+G   +LG++ +SW SKK+  +  S+ EAEY       ++L W+ ++L E GI +
Subjt:  ---------------GICARK----------------------STSGGCFFLGNNLVSWFSKKKNCISLSTAEAEYIVAGIEFTQLIWMKNMLNEYGI-I

Query:  QDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEH
             +YCDN+ A  +  NPV HSR KHI + +HFIR  +++  + + H   + QLADT TKPL    F++
Subjt:  QDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.1e-6332.29Show/hide
Query:  IARRKDKVDYLKMIADLCYTSAIEPTSVELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAI
        IA+ K+   Y +    L +  A++    E+   +  + W +         IG KW++K K +  G + R KARLVA+GY Q EG+DF ETF+PV KL ++
Subjt:  IARRKDKVDYLKMIADLCYTSAIEPTSVELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAI

Query:  RLLLSISCFQNFKLYQMDVKSAFLNGYLNEEVYY------------------IYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTH
        +L+L+IS   NF L+Q+D+ +AFLNG L+EE+Y                   +  L K++YGLKQA R W+   ++ L   G+ +  +D T F+  T+T 
Subjt:  RLLLSISCFQNFKLYQMDVKSAFLNGYLNEEVYY------------------IYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTH

Query:  LIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHK
         +   +YVDDII        V+     +KS F++  +G L  FLGL+I + +  I I Q+KYA +++   GL   +    P       +    G  VD K
Subjt:  LIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHK

Query:  LYRSMIGSLLYLTASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFS
         YR +IG L+YL  +R DI++AV   +                                                     R+ST+G C FLG +L+SW S
Subjt:  LYRSMIGSLLYLTASRPDIAYAVGICA-----------------------------------------------------RKSTSGGCFFLGNNLVSWFS

Query:  KKKNCISLSTAEAEYIVAGIEFTQLIWMKNMLNEYGI-IQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRE
        KK+  +S S+AEAEY        +++W+     E  + +     L+CDN +AI I+ N V H RTKHI+   H +RE
Subjt:  KKKNCISLSTAEAEYIVAGIEFTQLIWMKNMLNEYGI-IQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRE

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-2030.04Show/hide
Query:  IYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDI-VGIAVDHKLYRS
        +YVDDI+  G   TL+N  I  + S F M  +G +  FLG+QIK     +F+SQ KYA+ I+ N G+   +   TP     K+   +      D   +RS
Subjt:  IYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRSECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDI-VGIAVDHKLYRS

Query:  MIGSLLYLTASRPDIAYAVGI-----------------------------------------------------CARKSTSGGCFFLGNNLVSWFSKKKN
        ++G+L YLT +RPDI+YAV I                                                       R+ST+G C FLG N++SW +K++ 
Subjt:  MIGSLLYLTASRPDIAYAVGI-----------------------------------------------------CARKSTSGGCFFLGNNLVSWFSKKKN

Query:  CISLSTAEAEYIVAGIEFTQLIW
         +S S+ E EY    +   +L W
Subjt:  CISLSTAEAEYIVAGIEFTQLIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.0e-1344.3Show/hide
Query:  ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSIS
        EL    R   W LVP     NI+G KW+FK K    G + R KARLVA+G+ Q EG+ F ET++PV +   IR +L+++
Subjt:  ELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAAAGTTTACCACCGCATTTTTTTGGCAGAAGCAGTTAACACAACTTGCCATATTCACAACAGAATTACTATTCGATCTGGAACTAATGTGACCTTATAT
GAGCTATGGAAAGGCAGGAAGCTTAATGTTAAATATTTTCATATCTTCGGAAGTACATGTTATATTCTTGCTGACAGAGAATATCATCAGAAGTGGGATGCAAAG
TCAGAACATGGACTATTCCTTGGATATTCCCAGAACAGGAGAGCTTATAGAGTCTTCAACAATCGAACTGAATTGGTTATGGAAACAATCAATGTTGTGGTTGAT
GATTATAACAATAATGATAAGCAAATTGATGACGAGGAGGATGAAGCATCTGAGGAGACTATAGCTCCAACATCTACACCTATTGTTGTACCCAAAGAGGATACT
GAGGTAACTAATATAGAGTTAAGCCCTAATTCTATATCAAAAAGGGCCACTGCTGAAGGGACGTTAACAATTCTTTCATCACATGTCCAGAAAAATCATCCCTTA
AGCTCAATTATCGGTGATCCCTCTGCTGGAATTATCGCTAGAAGGAAGGACAAAGTAGATTATCTGAAAATGATAGCTGACTTGTGTTATACTTCAGCTATTGAA
CCTACATCAGTTGAGTTACTACAGTTCAAGCGTAAAAATGTATGGACTTTGGTTCCTAAACTTGATGAGGCAAACATCATAGGAACCAAGTGGATCTTTAAAAAT
AAAACCGATGAATCTGGGTGTGTAATAAGGAACAAAGCTCGTTTGGTGGCTCAGGGCTATGCACAGGTAGAAGGTGTTGATTTTGATGAAACCTTTGCACCTGTG
GCTAAACTTGAAGCTATTCGCCTGTTGCTCAGTATATCATGTTTCCAAAATTTTAAATTGTATCAAATGGACGTTAAAAGTGCTTTTCTGAATGGATACTTGAAT
GAAGAAGTCTATTATATCTACAAGCTTAATAAAGCTCTATATGGATTAAAGCAAGCACCTAGGGCTTGGTATGAATGTTTAACAATGTATCTGGGTGAGAAAGGA
TATTCCAGGGAAGAAACTGACAAGACACTGTTTATTAATAGAACAAGCACTCATCTCATTGTAGCTCAAATCTATGTTGATGATATTATCTTTGGTGGATTTCCT
AAAACACTTGTTAATAACTTCATTGACACAATGAAATCAGAATTCGAAATGAGCTTAGTAGGCAAATTGTCTTGCTTTCTGGGGTTGCAGATCAAACAGAGAAGT
GAATGTATATTTATATCGCAAAAGAAGTATGCCAAGAACATAGTCAAGAATTTTGGTCTAGATCAGTCACAATACAAAAGGACTCCAACTACGACACATGCTAAA
ATTACCGAGGATATTGTTGGTATCGCAGTAGATCATAAACTGTACAGGAGCATGATTGGGAGCCTCTTATATTTAACAGCAAGCAGACCTGATATTGCCTATGCT
GTTGGAATATGTGCTCGGAAAAGCACCTCTGGTGGATGTTTCTTTCTTGGAAATAATCTTGTTTCATGGTTCAGTAAGAAAAAAAATTGTATATCTCTTTCTACA
GCAGAAGCTGAGTATATAGTTGCAGGGATTGAGTTTACTCAATTGATATGGATGAAAAACATGTTGAATGAATATGGGATTATCCAAGATGTTATGACTTTATAT
TGTGATAATATGAGTGCTATAGATATATCGAAAAATCCAGTCCAGCATAGTCGAACTAAGCACATTGATATAAGACATCATTTTATTAGAGAGCTCATTGAAAAT
AAGATTATTACATTGCAACACTTTCCCGCGAACTCACAATTGGCAGATACTTTTACTAAACCCCTTGATGCAACCATGTTTGAGCATTTACACGTTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAAAGTTTACCACCGCATTTTTTTGGCAGAAGCAGTTAACACAACTTGCCATATTCACAACAGAATTACTATTCGATCTGGAACTAATGTGACCTTATAT
GAGCTATGGAAAGGCAGGAAGCTTAATGTTAAATATTTTCATATCTTCGGAAGTACATGTTATATTCTTGCTGACAGAGAATATCATCAGAAGTGGGATGCAAAG
TCAGAACATGGACTATTCCTTGGATATTCCCAGAACAGGAGAGCTTATAGAGTCTTCAACAATCGAACTGAATTGGTTATGGAAACAATCAATGTTGTGGTTGAT
GATTATAACAATAATGATAAGCAAATTGATGACGAGGAGGATGAAGCATCTGAGGAGACTATAGCTCCAACATCTACACCTATTGTTGTACCCAAAGAGGATACT
GAGGTAACTAATATAGAGTTAAGCCCTAATTCTATATCAAAAAGGGCCACTGCTGAAGGGACGTTAACAATTCTTTCATCACATGTCCAGAAAAATCATCCCTTA
AGCTCAATTATCGGTGATCCCTCTGCTGGAATTATCGCTAGAAGGAAGGACAAAGTAGATTATCTGAAAATGATAGCTGACTTGTGTTATACTTCAGCTATTGAA
CCTACATCAGTTGAGTTACTACAGTTCAAGCGTAAAAATGTATGGACTTTGGTTCCTAAACTTGATGAGGCAAACATCATAGGAACCAAGTGGATCTTTAAAAAT
AAAACCGATGAATCTGGGTGTGTAATAAGGAACAAAGCTCGTTTGGTGGCTCAGGGCTATGCACAGGTAGAAGGTGTTGATTTTGATGAAACCTTTGCACCTGTG
GCTAAACTTGAAGCTATTCGCCTGTTGCTCAGTATATCATGTTTCCAAAATTTTAAATTGTATCAAATGGACGTTAAAAGTGCTTTTCTGAATGGATACTTGAAT
GAAGAAGTCTATTATATCTACAAGCTTAATAAAGCTCTATATGGATTAAAGCAAGCACCTAGGGCTTGGTATGAATGTTTAACAATGTATCTGGGTGAGAAAGGA
TATTCCAGGGAAGAAACTGACAAGACACTGTTTATTAATAGAACAAGCACTCATCTCATTGTAGCTCAAATCTATGTTGATGATATTATCTTTGGTGGATTTCCT
AAAACACTTGTTAATAACTTCATTGACACAATGAAATCAGAATTCGAAATGAGCTTAGTAGGCAAATTGTCTTGCTTTCTGGGGTTGCAGATCAAACAGAGAAGT
GAATGTATATTTATATCGCAAAAGAAGTATGCCAAGAACATAGTCAAGAATTTTGGTCTAGATCAGTCACAATACAAAAGGACTCCAACTACGACACATGCTAAA
ATTACCGAGGATATTGTTGGTATCGCAGTAGATCATAAACTGTACAGGAGCATGATTGGGAGCCTCTTATATTTAACAGCAAGCAGACCTGATATTGCCTATGCT
GTTGGAATATGTGCTCGGAAAAGCACCTCTGGTGGATGTTTCTTTCTTGGAAATAATCTTGTTTCATGGTTCAGTAAGAAAAAAAATTGTATATCTCTTTCTACA
GCAGAAGCTGAGTATATAGTTGCAGGGATTGAGTTTACTCAATTGATATGGATGAAAAACATGTTGAATGAATATGGGATTATCCAAGATGTTATGACTTTATAT
TGTGATAATATGAGTGCTATAGATATATCGAAAAATCCAGTCCAGCATAGTCGAACTAAGCACATTGATATAAGACATCATTTTATTAGAGAGCTCATTGAAAAT
AAGATTATTACATTGCAACACTTTCCCGCGAACTCACAATTGGCAGATACTTTTACTAAACCCCTTGATGCAACCATGTTTGAGCATTTACACGTTCTGTAA
Protein sequenceShow/hide protein sequence
MLKVYHRIFLAEAVNTTCHIHNRITIRSGTNVTLYELWKGRKLNVKYFHIFGSTCYILADREYHQKWDAKSEHGLFLGYSQNRRAYRVFNNRTELVMETINVVVD
DYNNNDKQIDDEEDEASEETIAPTSTPIVVPKEDTEVTNIELSPNSISKRATAEGTLTILSSHVQKNHPLSSIIGDPSAGIIARRKDKVDYLKMIADLCYTSAIE
PTSVELLQFKRKNVWTLVPKLDEANIIGTKWIFKNKTDESGCVIRNKARLVAQGYAQVEGVDFDETFAPVAKLEAIRLLLSISCFQNFKLYQMDVKSAFLNGYLN
EEVYYIYKLNKALYGLKQAPRAWYECLTMYLGEKGYSREETDKTLFINRTSTHLIVAQIYVDDIIFGGFPKTLVNNFIDTMKSEFEMSLVGKLSCFLGLQIKQRS
ECIFISQKKYAKNIVKNFGLDQSQYKRTPTTTHAKITEDIVGIAVDHKLYRSMIGSLLYLTASRPDIAYAVGICARKSTSGGCFFLGNNLVSWFSKKKNCISLST
AEAEYIVAGIEFTQLIWMKNMLNEYGIIQDVMTLYCDNMSAIDISKNPVQHSRTKHIDIRHHFIRELIENKIITLQHFPANSQLADTFTKPLDATMFEHLHVL