; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G007450 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G007450
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionCCHC-type domain-containing protein
Genome locationCmo_Chr18:9150990..9154927
RNA-Seq ExpressionCmoCh18G007450
SyntenyCmoCh18G007450
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004491 - methylmalonate-semialdehyde dehydrogenase (acylating) activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW50731.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.4e-14648.38Show/hide
Query:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK---------------------------------NDEVICYECK
        MT+EI++   ++E E KKKKSIALK+     E+EDV +E      DD+A  TR+ +K                                   ++IC++CK
Subjt:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK---------------------------------NDEVICYECK

Query:  KPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKD
        KPGHI+ DCPL K  +K+  KKAM ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+ 
Subjt:  KPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKD

Query:  GGLVTFGDNKKGKIIGKGT-----------------------------------------------------------IERNFGDLLVSDNGKEIVTS--
        GG VTFGDN KG+IIG+                                                             +E + G L + D  ++  +   
Subjt:  GGLVTFGDNKKGKIIGKGT-----------------------------------------------------------IERNFGDLLVSDNGKEIVTS--

Query:  -KEEVS---------LKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV
         K+E S         ++ E S  +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELV
Subjt:  -KEEVS---------LKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV

Query:  PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFEN
        PRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++
Subjt:  PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFEN

Query:  VEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGL
          FP+HV+KLKKALYGLKQAPRAWY+RL+                                                F+KCMHSEFEMSMMGEL++FLGL
Subjt:  VEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGL

Query:  QIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK
        QIKQLK+G FINQ KY KDLLKRF     K+ +TPMS+S KLD DEK
Subjt:  QIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK

RVW71911.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.5e-14375.99Show/hide
Query:  EGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNK
        E S  +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTKWVFRNK
Subjt:  EGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNK

Query:  MDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLK
        MDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLK
Subjt:  MDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLK

Query:  QAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTK
        QAPRAWY+RLS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLGLQIKQLK+G FINQ KY K
Subjt:  QAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTK

Query:  DLLKRFKFNGGKIARTPMSTSTKLDKDEK
        DLLKRF     K+ +TPMS+S KLD DEK
Subjt:  DLLKRFKFNGGKIARTPMSTSTKLDKDEK

RVW71911.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.3e-1751.82Show/hide
Query:  KKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTI
        KKAM ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+G I
Subjt:  KKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTI

Query:  ERNFGDLLVS
              L+ S
Subjt:  ERNFGDLLVS

RVW80634.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.1e-17554.17Show/hide
Query:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK----------------------------------NDEVICYEC
        MT+EI++   ++E E KKKKSIALK+     E+EDV +E      DD+A  TR+ +K                                    ++IC++C
Subjt:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK----------------------------------NDEVICYEC

Query:  KKPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKK
        KKPGHI+ DCPL K  +K+  KKAM ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+
Subjt:  KKPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKK

Query:  DGGLVTFGDNKKGKIIGKGT-----------------------------------------------------------IERNFGDLLVSDNGKEIVTS-
         GG VTFGDN KG+IIG+                                                             +E + G L + D  ++  +  
Subjt:  DGGLVTFGDNKKGKIIGKGT-----------------------------------------------------------IERNFGDLLVSDNGKEIVTS-

Query:  --KEEVS---------LKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWEL
          K+E S         ++ E S  +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWEL
Subjt:  --KEEVS---------LKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWEL

Query:  VPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFE
        VPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF+
Subjt:  VPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFE

Query:  NVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLG
        +  FP+HV+KLKKALYGLKQAPRAWY+RLS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLG
Subjt:  NVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLG

Query:  LQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK
        LQIKQLK+G FINQ KY KDLLKRF     K+ +TPMS+S KLD DEK
Subjt:  LQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK

RVW93906.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]7.5e-17253.55Show/hide
Query:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK----------------------------------NDEVICYEC
        MT+EI++   ++E E KKKKSIALK+     E+EDV +E      DD+A  TR+ +K                                    ++IC++C
Subjt:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK----------------------------------NDEVICYEC

Query:  KKPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDES-ASGSDEEVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKK
        KKPGHI+ DCPL K  +K+  KKAM ATW +S+ES     ++EVAN CFMA  D  DE             SK++KW+LDSGCSRHMTG+ SKF  L+K+
Subjt:  KKPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDES-ASGSDEEVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKK

Query:  DGGLVTFGDNKKGKIIGKGTIER-----------------------------------------------------------NFGDLLVSDNGKEIVT--
         GG VTFGDN KG+IIG+  +E+                                                           + G L + D  ++  +  
Subjt:  DGGLVTFGDNKKGKIIGKGTIER-----------------------------------------------------------NFGDLLVSDNGKEIVT--

Query:  --SKEEVSL--------KEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWEL
           KEE  L        + E S  +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWEL
Subjt:  --SKEEVSL--------KEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWEL

Query:  VPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFE
        VPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETF  VARLEAIRMLLAFA +K+F+LYQMDVKS FLNG+I EEVYVEQPP F+
Subjt:  VPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFE

Query:  NVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLG
        +  FP+HV+KLKKALYGLKQAPRAWY+RLS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLG
Subjt:  NVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLG

Query:  LQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK
        LQIKQLK+G F+NQ KY KDLLKRF     K+ +TPMS+  KLD DEK
Subjt:  LQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK

RVW98982.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.6e-17654.25Show/hide
Query:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK---------------------------------NDEVICYECK
        MT+EI++   ++E E KKKKSIALK+     E+EDV +E      DD+A  TR+ +K                                   ++IC++CK
Subjt:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK---------------------------------NDEVICYECK

Query:  KPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKD
        KPGHI+ DCPL K  +K+  KKAM ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+ 
Subjt:  KPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKD

Query:  GGLVTFGDNKKGKIIGKGT-----------------------------------------------------------IERNFGDLLVSDNGKEIVTS--
        GG VTFGDN KG+IIG+                                                             +E + G L + D  ++  +   
Subjt:  GGLVTFGDNKKGKIIGKGT-----------------------------------------------------------IERNFGDLLVSDNGKEIVTS--

Query:  -KEEVS---------LKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV
         K+E S         ++ E S  +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELV
Subjt:  -KEEVS---------LKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV

Query:  PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFEN
        PRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++
Subjt:  PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFEN

Query:  VEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGL
          FP+HV+KLKKALYGLKQAPRAWY+RLS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLGL
Subjt:  VEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGL

Query:  QIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK
        QIKQLK+G FINQ KY KDLLKRF     K+ +TPMS+S KLD DEK
Subjt:  QIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK

TrEMBL top hitse value%identityAlignment
A0A2N9G5J4 CCHC-type domain-containing protein4.8e-17257.87Show/hide
Query:  MTHEITMNGHMEEES-KKKKSIALKSI--KVDSEDEDVLDEDDVAYFTR------------------------EKSKNDEVICYECKKPGHIRTDCPLLK
        MT+E+ MN  +EEE  K KK+ ALKS     D+ +E+  +E+++A  TR                        E SK +   CY+CKK GH + +CP + 
Subjt:  MTHEITMNGHMEEES-KKKKSIALKSI--KVDSEDEDVLDEDDVAYFTR------------------------EKSKNDEVICYECKKPGHIRTDCPLLK

Query:  SSK-KSKKKAMKATWDDSDESAS---GSDEEVANFCFMAHSDKEDEQEDEQE-----------------------DEVCL-KASKKNKWYLDSGCSRHMT
          K K KKKA+K TWDDSDES S    SD EVAN C + + ++ +  EDE                         DEVCL   S K KW+LDSGCSRHMT
Subjt:  SSK-KSKKKAMKATWDDSDESAS---GSDEEVANFCFMAHSDKEDEQEDEQE-----------------------DEVCL-KASKKNKWYLDSGCSRHMT

Query:  GNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEVSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NL
        G+ +KF +L+ KDGG V FGDN KGKIIG   I  +   + +S+  K+ V     EE +L    +  +PK W    SHPK+LI+G++E+GV TRS + N+
Subjt:  GNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEVSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NL

Query:  FNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLE
         NN+AF+SQIEPK++ +A  DE WILAMQEELNQFERNKVW L PRP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+APVARLE
Subjt:  FNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLE

Query:  AIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDML
        AIRMLLAFA +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFEN EFP+HV+KL KALYGLKQAPRAWY+RLS FLI   F  GKLDTTLF+     DML
Subjt:  AIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDML

Query:  LVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK
        +VQIYVDDIIFGSTN +LC+EF+K M  EFEMSMMGEL FFLGLQIKQ +DGIF+NQ KY  DLLKRF     K   TPMS STKLDKDEK
Subjt:  LVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK

A0A2N9J511 CCHC-type domain-containing protein4.8e-17257.87Show/hide
Query:  MTHEITMNGHMEEES-KKKKSIALKSI--KVDSEDEDVLDEDDVAYFTR------------------------EKSKNDEVICYECKKPGHIRTDCPLLK
        MT+E+ MN  +EEE  K KK+ ALKS     D+ +E+  +E+++A  TR                        E SK +   CY+CKK GH + +CP + 
Subjt:  MTHEITMNGHMEEES-KKKKSIALKSI--KVDSEDEDVLDEDDVAYFTR------------------------EKSKNDEVICYECKKPGHIRTDCPLLK

Query:  SSK-KSKKKAMKATWDDSDESAS---GSDEEVANFCFMAHSDKEDEQEDEQE-----------------------DEVCL-KASKKNKWYLDSGCSRHMT
          K K KKKA+K TWDDSDES S    SD EVAN C + + ++ +  EDE                         DEVCL   S K KW+LDSGCSRHMT
Subjt:  SSK-KSKKKAMKATWDDSDESAS---GSDEEVANFCFMAHSDKEDEQEDEQE-----------------------DEVCL-KASKKNKWYLDSGCSRHMT

Query:  GNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEVSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NL
        G+ +KF +L+ KDGG V FGDN KGKIIG   I  +   + +S+  K+ V     EE +L    +  +PK W    SHPK+LI+G++E+GV TRS + N+
Subjt:  GNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEVSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NL

Query:  FNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLE
         NN+AF+SQIEPK++ +A  DE WILAMQEELNQFERNKVW L PRP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+APVARLE
Subjt:  FNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLE

Query:  AIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDML
        AIRMLLAFA +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFEN EFP+HV+KL KALYGLKQAPRAWY+RLS FLI   F  GKLDTTLF+     DML
Subjt:  AIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDML

Query:  LVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK
        +VQIYVDDIIFGSTN +LC+EF+K M  EFEMSMMGEL FFLGLQIKQ +DGIF+NQ KY  DLLKRF     K   TPMS STKLDKDEK
Subjt:  LVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK

A0A438H7V2 Retrovirus-related Pol polyprotein from transposon RE15.4e-17654.17Show/hide
Query:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK----------------------------------NDEVICYEC
        MT+EI++   ++E E KKKKSIALK+     E+EDV +E      DD+A  TR+ +K                                    ++IC++C
Subjt:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK----------------------------------NDEVICYEC

Query:  KKPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKK
        KKPGHI+ DCPL K  +K+  KKAM ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+
Subjt:  KKPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKK

Query:  DGGLVTFGDNKKGKIIGKGT-----------------------------------------------------------IERNFGDLLVSDNGKEIVTS-
         GG VTFGDN KG+IIG+                                                             +E + G L + D  ++  +  
Subjt:  DGGLVTFGDNKKGKIIGKGT-----------------------------------------------------------IERNFGDLLVSDNGKEIVTS-

Query:  --KEEVS---------LKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWEL
          K+E S         ++ E S  +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWEL
Subjt:  --KEEVS---------LKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWEL

Query:  VPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFE
        VPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF+
Subjt:  VPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFE

Query:  NVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLG
        +  FP+HV+KLKKALYGLKQAPRAWY+RLS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLG
Subjt:  NVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLG

Query:  LQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK
        LQIKQLK+G FINQ KY KDLLKRF     K+ +TPMS+S KLD DEK
Subjt:  LQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK

A0A438IB12 Retrovirus-related Pol polyprotein from transposon RE13.6e-17253.55Show/hide
Query:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK----------------------------------NDEVICYEC
        MT+EI++   ++E E KKKKSIALK+     E+EDV +E      DD+A  TR+ +K                                    ++IC++C
Subjt:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK----------------------------------NDEVICYEC

Query:  KKPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDES-ASGSDEEVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKK
        KKPGHI+ DCPL K  +K+  KKAM ATW +S+ES     ++EVAN CFMA  D  DE             SK++KW+LDSGCSRHMTG+ SKF  L+K+
Subjt:  KKPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDES-ASGSDEEVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKK

Query:  DGGLVTFGDNKKGKIIGKGTIER-----------------------------------------------------------NFGDLLVSDNGKEIVT--
         GG VTFGDN KG+IIG+  +E+                                                           + G L + D  ++  +  
Subjt:  DGGLVTFGDNKKGKIIGKGTIER-----------------------------------------------------------NFGDLLVSDNGKEIVT--

Query:  --SKEEVSL--------KEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWEL
           KEE  L        + E S  +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWEL
Subjt:  --SKEEVSL--------KEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWEL

Query:  VPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFE
        VPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETF  VARLEAIRMLLAFA +K+F+LYQMDVKS FLNG+I EEVYVEQPP F+
Subjt:  VPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFE

Query:  NVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLG
        +  FP+HV+KLKKALYGLKQAPRAWY+RLS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLG
Subjt:  NVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLG

Query:  LQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK
        LQIKQLK+G F+NQ KY KDLLKRF     K+ +TPMS+  KLD DEK
Subjt:  LQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK

A0A438IQK9 Retrovirus-related Pol polyprotein from transposon RE14.2e-17654.25Show/hide
Query:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK---------------------------------NDEVICYECK
        MT+EI++   ++E E KKKKSIALK+     E+EDV +E      DD+A  TR+ +K                                   ++IC++CK
Subjt:  MTHEITMNGHMEE-ESKKKKSIALKSIKVDSEDEDVLDE------DDVAYFTREKSK---------------------------------NDEVICYECK

Query:  KPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKD
        KPGHI+ DCPL K  +K+  KKAM ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+ 
Subjt:  KPGHIRTDCPLLK-SSKKSKKKAMKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKD

Query:  GGLVTFGDNKKGKIIGKGT-----------------------------------------------------------IERNFGDLLVSDNGKEIVTS--
        GG VTFGDN KG+IIG+                                                             +E + G L + D  ++  +   
Subjt:  GGLVTFGDNKKGKIIGKGT-----------------------------------------------------------IERNFGDLLVSDNGKEIVTS--

Query:  -KEEVS---------LKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV
         K+E S         ++ E S  +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELV
Subjt:  -KEEVS---------LKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV

Query:  PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFEN
        PRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++
Subjt:  PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFEN

Query:  VEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGL
          FP+HV+KLKKALYGLKQAPRAWY+RLS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLGL
Subjt:  VEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGL

Query:  QIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK
        QIKQLK+G FINQ KY KDLLKRF     K+ +TPMS+S KLD DEK
Subjt:  QIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.8e-4735.84Show/hide
Query:  WILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKS
        W  A+  ELN  + N  W +  RP N +I+ ++WVF  K +E GN +R KARLVA+G+ Q+  IDYEETFAPVAR+ + R +L+     N  ++QMDVK+
Subjt:  WILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKS

Query:  AFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFI--KIKENDMLLVQIYVDDIIFGSTNPSLCEE
        AFLNG + EE+Y+  P G        +V KL KA+YGLKQA R W++     L   +F    +D  ++I  K   N+ + V +YVDD++  + + +    
Subjt:  AFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFI--KIKENDMLLVQIYVDDIIFGSTNPSLCEE

Query:  FAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMST--STKLDKDEKDFNLLLRNLIYMRLREYLNIC
        F + +  +F M+ + E+  F+G++I+  +D I+++Q  Y K +L +F         TP+ +  + +L   ++D N   R+LI   +  Y+ +C
Subjt:  FAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMST--STKLDKDEKDFNLLLRNLIYMRLREYLNIC

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.6e-4635.79Show/hide
Query:  EPKSLKDA----ENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLL
        EP+SLK+     E ++  + AMQEE+   ++N  ++LV  P     +  KWVF+ K D +  ++R KARLV +G+ Q++GID++E F+PV ++ +IR +L
Subjt:  EPKSLKDA----ENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLL

Query:  AFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIK-IKENDMLLVQIY
        + A+  +  + Q+DVK+AFL+G + EE+Y+EQP GFE     H V KL K+LYGLKQAPR WY +  +F+    +     D  ++ K   EN+ +++ +Y
Subjt:  AFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIK-IKENDMLLVQIY

Query:  VDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQI--KQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDK
        VDD++    +  L  +    +   F+M  +G     LG++I  ++    ++++QEKY + +L+RF     K   TP++   KL K
Subjt:  VDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQI--KQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDK

P92520 Uncharacterized mitochondrial protein AtMg008207.4e-2146.72Show/hide
Query:  KTRSSINLFN---NLAFVSQI--EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGI
        ++++ IN  N   +L   + I  EPKS+  A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI
Subjt:  KTRSSINLFN---NLAFVSQI--EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGI

Query:  DYEETFAPVARLEAIRMLLAFA
         + ET++PV R   IR +L  A
Subjt:  DYEETFAPVARLEAIRMLLAFA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-5639.79Show/hide
Query:  LAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAI
        ++  ++ EP++   A  DE W  AM  E+N    N  W+LV P PS+ +I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +I
Subjt:  LAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAI

Query:  RMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLV
        R++L  A  +++ + Q+DV +AFL G + ++VY+ QPPGF + + P++V KL+KALYGLKQAPRAWY  L N+L+   F     DT+LF+  +   ++ +
Subjt:  RMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLV

Query:  QIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKL
         +YVDDI+    +P+L       +   F +    EL +FLG++ K++  G+ ++Q +Y  DLL R      K   TPM+ S KL
Subjt:  QIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-5439.35Show/hide
Query:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EP++   A  D+ W  AM  E+N    N  W+LV P P + +I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A
Subjt:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA

Query:  SYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDI
          +++ + Q+DV +AFL G + +EVY+ QPPGF + + P +V +L+KA+YGLKQAPRAWY  L  +L+   F     DT+LF+  +   ++ + +YVDDI
Subjt:  SYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDI

Query:  IFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKL
        +    +  L +     +   F +    +L +FLG++ K++  G+ ++Q +YT DLL R      K   TPM+TS KL
Subjt:  IFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.1e-5539.22Show/hide
Query:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EP +  +A+    W  AM +E+   E    WE+   P N   IG KWV++ K + +G I R KARLVA+GY Q+EGID+ ETF+PV +L +++++LA ++
Subjt:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEF----PHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYV
          NF L+Q+D+ +AFLNG + EE+Y++ PPG+   +     P+ V  LKK++YGLKQA R W+ + S  LIG  F     D T F+KI     L V +YV
Subjt:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEF----PHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYV

Query:  DDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDK-------DEKDFNLLLRN
        DDII  S N +  +E    + S F++  +G L +FLGL+I +   GI I Q KY  DLL      G K +  PM  S            D K +  L+  
Subjt:  DDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDK-------DEKDFNLLLRN

Query:  LIYMRL
        L+Y+++
Subjt:  LIYMRL

ATMG00810.1 DNA/RNA polymerases superfamily protein3.7e-0732.14Show/hide
Query:  IYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMST----------STKLDKDEKDF
        +YVDDI+   ++ +L       + S F M  +G + +FLG+QIK    G+F++Q KY + +L     N G +   PMST          ST    D  DF
Subjt:  IYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMST----------STKLDKDEKDF

Query:  NLLLRNLIYMRL
          ++  L Y+ L
Subjt:  NLLLRNLIYMRL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.3e-2246.72Show/hide
Query:  KTRSSINLFN---NLAFVSQI--EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGI
        ++++ IN  N   +L   + I  EPKS+  A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI
Subjt:  KTRSSINLFN---NLAFVSQI--EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGI

Query:  DYEETFAPVARLEAIRMLLAFA
         + ET++PV R   IR +L  A
Subjt:  DYEETFAPVARLEAIRMLLAFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACATGAGATTACCATGAACGGGCACATGGAAGAAGAATCCAAAAAGAAGAAAAGTATAGCCCTAAAGTCCATCAAAGTTGACTCTGAAGATGAGGACGTC
CTTGATGAAGATGATGTCGCCTACTTCACACGTGAGAAGAGCAAAAACGATGAGGTAATATGTTATGAATGTAAAAAGCCGGGTCATATAAGAACGGATTGTCCT
CTCCTCAAATCATCTAAGAAATCCAAGAAGAAAGCAATGAAGGCTACTTGGGATGATAGTGATGAAAGTGCAAGTGGAAGTGATGAAGAAGTAGCAAATTTTTGC
TTCATGGCTCATAGTGACAAAGAGGATGAACAAGAAGATGAACAAGAAGATGAGGTTTGTTTGAAAGCCTCCAAGAAAAATAAGTGGTACTTGGATAGTGGCTGC
TCAAGACACATGACGGGTAATCCATCCAAGTTTGTGAATCTTTCCAAGAAGGATGGTGGCTTAGTAACCTTTGGTGATAACAAGAAGGGTAAAATAATTGGTAAG
GGTACTATAGAAAGAAATTTTGGAGACTTACTCGTTAGTGACAACGGCAAGGAAATTGTTACGAGTAAAGAAGAGGTGAGCTTAAAGGAAGAAGGTTCTTCATCA
ATGCCAAAAGAATGGAGGTATGCCTTGTCTCATCCCAAAGACTTGATTCTTGGTGATCTCGAACAAGGTGTGAAAACTCGCTCTTCTATTAATTTATTTAATAAT
CTTGCTTTTGTTTCTCAAATTGAACCAAAAAGTCTTAAGGATGCCGAAAATGATGAGTTTTGGATTTTAGCCATGCAAGAAGAGCTAAATCAATTTGAAAGGAAC
AAAGTTTGGGAATTAGTCCCGAGGCCATCTAATACTTCTATTATTGGAACCAAATGGGTTTTTAGAAATAAAATGGATGAAAATGGGAATATCATGAGAAATAAA
GCTAGACTTGTAGCTCAAGGTTATTGTCAAGAGGAAGGCATAGATTATGAAGAAACTTTTGCACCCGTTGCTAGATTAGAAGCTATTAGAATGTTACTTGCTTTT
GCTTCTTACAAAAATTTTGTATTATATCAAATGGATGTGAAAAGTGCATTTTTGAATGGTTATATTATGGAGGAAGTTTATGTAGAACAACCTCCCGGATTTGAA
AATGTAGAATTTCCTCATCATGTCTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGTAATTTTCTTATTGGG
AATGATTTTAAAATGGGCAAACTCGACACTACACTCTTTATTAAGATTAAAGAAAACGATATGCTATTAGTGCAAATATATGTGGATGATATCATATTTGGTTCT
ACTAATCCTTCTTTATGTGAAGAATTTGCTAAATGTATGCATAGTGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTTCTTGGACTTCAAATCAAACAA
CTCAAGGATGGTATCTTCATCAATCAAGAGAAATACACTAAAGATTTGCTCAAAAGATTCAAGTTCAATGGAGGTAAGATTGCAAGAACTCCCATGAGCACATCC
ACTAAGCTTGACAAGGATGAAAAAGATTTCAATCTTCTCCTAAGGAATCTCATTTACATGCGGTTAAGAGAATATTTAAATATTTGCTTGGAACTATTGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACACATGAGATTACCATGAACGGGCACATGGAAGAAGAATCCAAAAAGAAGAAAAGTATAGCCCTAAAGTCCATCAAAGTTGACTCTGAAGATGAGGACGTC
CTTGATGAAGATGATGTCGCCTACTTCACACGTGAGAAGAGCAAAAACGATGAGGTAATATGTTATGAATGTAAAAAGCCGGGTCATATAAGAACGGATTGTCCT
CTCCTCAAATCATCTAAGAAATCCAAGAAGAAAGCAATGAAGGCTACTTGGGATGATAGTGATGAAAGTGCAAGTGGAAGTGATGAAGAAGTAGCAAATTTTTGC
TTCATGGCTCATAGTGACAAAGAGGATGAACAAGAAGATGAACAAGAAGATGAGGTTTGTTTGAAAGCCTCCAAGAAAAATAAGTGGTACTTGGATAGTGGCTGC
TCAAGACACATGACGGGTAATCCATCCAAGTTTGTGAATCTTTCCAAGAAGGATGGTGGCTTAGTAACCTTTGGTGATAACAAGAAGGGTAAAATAATTGGTAAG
GGTACTATAGAAAGAAATTTTGGAGACTTACTCGTTAGTGACAACGGCAAGGAAATTGTTACGAGTAAAGAAGAGGTGAGCTTAAAGGAAGAAGGTTCTTCATCA
ATGCCAAAAGAATGGAGGTATGCCTTGTCTCATCCCAAAGACTTGATTCTTGGTGATCTCGAACAAGGTGTGAAAACTCGCTCTTCTATTAATTTATTTAATAAT
CTTGCTTTTGTTTCTCAAATTGAACCAAAAAGTCTTAAGGATGCCGAAAATGATGAGTTTTGGATTTTAGCCATGCAAGAAGAGCTAAATCAATTTGAAAGGAAC
AAAGTTTGGGAATTAGTCCCGAGGCCATCTAATACTTCTATTATTGGAACCAAATGGGTTTTTAGAAATAAAATGGATGAAAATGGGAATATCATGAGAAATAAA
GCTAGACTTGTAGCTCAAGGTTATTGTCAAGAGGAAGGCATAGATTATGAAGAAACTTTTGCACCCGTTGCTAGATTAGAAGCTATTAGAATGTTACTTGCTTTT
GCTTCTTACAAAAATTTTGTATTATATCAAATGGATGTGAAAAGTGCATTTTTGAATGGTTATATTATGGAGGAAGTTTATGTAGAACAACCTCCCGGATTTGAA
AATGTAGAATTTCCTCATCATGTCTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGTAATTTTCTTATTGGG
AATGATTTTAAAATGGGCAAACTCGACACTACACTCTTTATTAAGATTAAAGAAAACGATATGCTATTAGTGCAAATATATGTGGATGATATCATATTTGGTTCT
ACTAATCCTTCTTTATGTGAAGAATTTGCTAAATGTATGCATAGTGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTTCTTGGACTTCAAATCAAACAA
CTCAAGGATGGTATCTTCATCAATCAAGAGAAATACACTAAAGATTTGCTCAAAAGATTCAAGTTCAATGGAGGTAAGATTGCAAGAACTCCCATGAGCACATCC
ACTAAGCTTGACAAGGATGAAAAAGATTTCAATCTTCTCCTAAGGAATCTCATTTACATGCGGTTAAGAGAATATTTAAATATTTGCTTGGAACTATTGATTTAG
Protein sequenceShow/hide protein sequence
MTHEITMNGHMEEESKKKKSIALKSIKVDSEDEDVLDEDDVAYFTREKSKNDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESASGSDEEVANFC
FMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIVTSKEEVSLKEEGSSS
MPKEWRYALSHPKDLILGDLEQGVKTRSSINLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNK
ARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIG
NDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTS
TKLDKDEKDFNLLLRNLIYMRLREYLNICLELLI