; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G001760 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G001760
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionCCHC-type domain-containing protein
Genome locationCmo_Chr14:790115..798789
RNA-Seq ExpressionCmoCh14G001760
SyntenyCmoCh14G001760
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004491 - methylmalonate-semialdehyde dehydrogenase (acylating) activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW71911.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.4e-15776.54Show/hide
Query:  EGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNK
        E S  +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTKWVFRNK
Subjt:  EGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNK

Query:  MDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLK
        MDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLK
Subjt:  MDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLK

Query:  QAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTK
        QAPRAWY+RLS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLGLQIKQLK+G FINQ KY K
Subjt:  QAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTK

Query:  DLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS
        DLLKRF     K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRPDIM+S
Subjt:  DLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS

RVW71911.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.0e-1752.43Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIERNFGDL
        M ATW +S+ES+    E EVAN CFMA  D ++          SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+G I      L
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIERNFGDL

Query:  LVS
        + S
Subjt:  LVS

RVW71911.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.1e-15676.54Show/hide
Query:  EGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNK
        E S  +PK W++ ++HP+D I+G+   GV TRSS+ N+ NNLAF+SQI+PK++KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTKWVFRNK
Subjt:  EGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNK

Query:  MDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLK
        MDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLK
Subjt:  MDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLK

Query:  QAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTK
        QAPRAWY+RLS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL+FFLGLQIKQLK+G FINQ KY K
Subjt:  QAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTK

Query:  DLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS
        DLLKRF     K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRPDIM+S
Subjt:  DLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS

RVW80634.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]5.3e-17659.74Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT--------
        M ATW +S+ES+    E EVAN CFMA  D ++          SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+          
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT--------

Query:  ---------------------------------------------------IERNFGDLLVSDNGKEIVTSKE------------EMSLKEEGSSSMPKE
                                                           +E + G L + D  ++  + ++               ++ E S  +PK+
Subjt:  ---------------------------------------------------IERNFGDLLVSDNGKEIVTSKE------------EMSLKEEGSSSMPKE

Query:  WRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMR
        W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTKWVFRNKMDENG I+R
Subjt:  WRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMR

Query:  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDR
        NKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLKQAPRAWY+R
Subjt:  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDR

Query:  LSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFN
        LS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLGLQIKQLK+G FINQ KY KDLLKRF   
Subjt:  LSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFN

Query:  GGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS
          K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRPDIM+S
Subjt:  GGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS

RVW93906.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.5e-17059.02Show/hide
Query:  MKATWDDSDES-ASGSDEEVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIER-----
        M ATW +S+ES     ++EVAN CFMA  D  DE         SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+  +E+     
Subjt:  MKATWDDSDES-ASGSDEEVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIER-----

Query:  ------------------------------------------------------NFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSSMPKE
                                                              + G L + D  ++  +     KEE  L        + E S  +PK+
Subjt:  ------------------------------------------------------NFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSSMPKE

Query:  WRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMR
        W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTKWVFRNKMDENG I+R
Subjt:  WRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMR

Query:  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDR
        NKARLVAQGY QEEGIDYEETF  VARLEAIRMLLAFA +K+F+LYQMDVKS FLNG+I EEVYVEQPP F++  FP+HV+KLKKALYGLKQAPRAWY+R
Subjt:  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDR

Query:  LSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFN
        LS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLGLQIKQLK+G F+NQ KY KDLLKRF   
Subjt:  LSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFN

Query:  GGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS
          K+ +TPMS+  KLD DEKGKS+D   YRGMIG LLYLTA +PDIM+S
Subjt:  GGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS

RVW98982.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.6e-17560.29Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT--------
        M ATW +S+ES+    E EVAN CFMA  D ++          SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+          
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT--------

Query:  ---------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSSMPKE
                                                           +E + G L + D  ++  +     KE+  L        + E S  +PK+
Subjt:  ---------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSSMPKE

Query:  WRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMR
        W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTKWVFRNKMDENG I+R
Subjt:  WRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMR

Query:  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDR
        NKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLKQAPRAWY+R
Subjt:  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDR

Query:  LSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFN
        LS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLGLQIKQLK+G FINQ KY KDLLKRF   
Subjt:  LSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFN

Query:  GGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS
          K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRPDIM S
Subjt:  GGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS

TrEMBL top hitse value%identityAlignment
A0A2N9ERY5 CCHC-type domain-containing protein5.9e-17364.71Show/hide
Query:  MKATWDDSDESAS---GSDEEVAN-------------------FCFMAHSDKEDEQE--------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS
        +K TWDDSDES S    SD EVAN                   FC +A +D E   E        DEVCL   S K KW+LDSGCSRHMTG+ +KF +L+
Subjt:  MKATWDDSDESAS---GSDEEVAN-------------------FCFMAHSDKEDEQE--------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS

Query:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQI
         KDGG V FGDN KGKIIG   I  +   + +S+  K+ V     EE +L    +  +PK W    SHPK+LI+G++E GV TRS + ++ NN+AF+SQI
Subjt:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQI

Query:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EPK++ +A  DE WILAMQEELNQFERNKVW L PRP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+APVARLEAIRMLLAFA 
Subjt:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDII
        +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFEN EFP+HV+KL KALYGLKQAPRAWY+RLS FLI   F  GKLDTTLF+     DML+VQIYVDDII
Subjt:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDII

Query:  FGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYL
        FGSTN +LC+EF+K M  EFEMSMMGEL FFLGLQIKQ +DGIF+NQ KY  DLLKRF     K   TPMS STKLDKDEKGK VD+K YRGMIGSLLYL
Subjt:  FGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYL

Query:  TASRPDIMFS
        TASRPDIMFS
Subjt:  TASRPDIMFS

A0A2N9G3V4 CCHC-type domain-containing protein5.9e-17364.71Show/hide
Query:  MKATWDDSDESAS---GSDEEVAN-------------------FCFMAHSDKEDEQE--------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS
        +K TWDDSDES S    SD EVAN                   FC +A +D E   E        DEVCL   S K KW+LDSGCSRHMTG+ +KF +L+
Subjt:  MKATWDDSDESAS---GSDEEVAN-------------------FCFMAHSDKEDEQE--------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS

Query:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQI
         KDGG V FGDN KGKIIG   I  +   + +S+  K+ V     EE +L    +  +PK W    SHPK+LI+G++E GV TRS + ++ NN+AF+SQI
Subjt:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQI

Query:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EPK++ +A  DE WILAMQEELNQFERNKVW L PRP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+APVARLEAIRMLLAFA 
Subjt:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDII
        +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFEN EFP+HV+KL KALYGLKQAPRAWY+RLS FLI   F  GKLDTTLF+     DML+VQIYVDDII
Subjt:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDII

Query:  FGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYL
        FGSTN +LC+EF+K M  EFEMSMMGEL FFLGLQIKQ +DGIF+NQ KY  DLLKRF     K   TPMS STKLDKDEKGK VD+K YRGMIGSLLYL
Subjt:  FGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYL

Query:  TASRPDIMFS
        TASRPDIMFS
Subjt:  TASRPDIMFS

A0A2N9IDJ4 CCHC-type domain-containing protein6.5e-17264.57Show/hide
Query:  MKATWDDSDESAS---GSDEEVAN-------------------FCFMAHSDKEDEQE--------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS
        +K TWDDSDES S    SD EVAN                   FC +A +D E   E        DEVCL   S K KW+LDSGCSRHMTG+ +KF +L+
Subjt:  MKATWDDSDESAS---GSDEEVAN-------------------FCFMAHSDKEDEQE--------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS

Query:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIVTSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEP
         KDGG V FGDN KGKIIG           +V D   E     EE +L    +  +PK W    +HPK+LI+G++E GV TRS + ++ NN+AF+SQIEP
Subjt:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIVTSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEP

Query:  KSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYK
        K++ +A  DE WILAMQEELNQFERNKVW L PRP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+APVARLEAIRMLLAFA +K
Subjt:  KSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYK

Query:  NFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFG
        NF L+QMDVKSAFLNG+I EEVYVEQPPGFEN EFP+HV+KL KALYGLKQAPRAWY+RLS FLI   F  GKLDTTLF+     DML+VQIYVDDIIFG
Subjt:  NFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFG

Query:  STNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTA
        STN +LC+EF+K M  EFEMSMMGEL FFLGLQIKQ +DGIF+NQ KY  DLLKRF     K   TPMS STKLDKDEKGK VD+K YRGMIGSLLYLTA
Subjt:  STNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTA

Query:  SRPDIMFS
        SRPDIMFS
Subjt:  SRPDIMFS

A0A438H7V2 Retrovirus-related Pol polyprotein from transposon RE12.6e-17659.74Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT--------
        M ATW +S+ES+    E EVAN CFMA  D ++          SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+          
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT--------

Query:  ---------------------------------------------------IERNFGDLLVSDNGKEIVTSKE------------EMSLKEEGSSSMPKE
                                                           +E + G L + D  ++  + ++               ++ E S  +PK+
Subjt:  ---------------------------------------------------IERNFGDLLVSDNGKEIVTSKE------------EMSLKEEGSSSMPKE

Query:  WRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMR
        W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTKWVFRNKMDENG I+R
Subjt:  WRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMR

Query:  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDR
        NKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLKQAPRAWY+R
Subjt:  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDR

Query:  LSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFN
        LS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLGLQIKQLK+G FINQ KY KDLLKRF   
Subjt:  LSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFN

Query:  GGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS
          K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRPDIM+S
Subjt:  GGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS

A0A438IQK9 Retrovirus-related Pol polyprotein from transposon RE11.3e-17560.29Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT--------
        M ATW +S+ES+    E EVAN CFMA  D ++          SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+          
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT--------

Query:  ---------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSSMPKE
                                                           +E + G L + D  ++  +     KE+  L        + E S  +PK+
Subjt:  ---------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSSMPKE

Query:  WRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMR
        W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELVPRPSN S+IGTKWVFRNKMDENG I+R
Subjt:  WRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMR

Query:  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDR
        NKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLKQAPRAWY+R
Subjt:  NKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDR

Query:  LSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFN
        LS FL+   FKMGK+DTTLFIK KE DMLLVQIYVDDIIFG+TN SLCE+F+KCMHSEFEMSMMGEL++FLGLQIKQLK+G FINQ KY KDLLKRF   
Subjt:  LSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFN

Query:  GGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS
          K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRPDIM S
Subjt:  GGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFS

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.9e-4936.24Show/hide
Query:  WILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKS
        W  A+  ELN  + N  W +  RP N +I+ ++WVF  K +E GN +R KARLVA+G+ Q+  IDYEETFAPVAR+ + R +L+     N  ++QMDVK+
Subjt:  WILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKS

Query:  AFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFI--KIKENDMLLVQIYVDDIIFGSTNPSLCEE
        AFLNG + EE+Y+  P G        +V KL KA+YGLKQA R W++     L   +F    +D  ++I  K   N+ + V +YVDD++  + + +    
Subjt:  AFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFI--KIKENDMLLVQIYVDDIIFGSTNPSLCEE

Query:  FAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKT-YRGMIGSLLY-LTASRPDI
        F + +  +F M+ + E+  F+G++I+  +D I+++Q  Y K +L +F         TP+   +K++ +      D  T  R +IG L+Y +  +RPD+
Subjt:  FAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKT-YRGMIGSLLY-LTASRPDI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-4935.4Show/hide
Query:  EPKSLKDA----ENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLL
        EP+SLK+     E ++  + AMQEE+   ++N  ++LV  P     +  KWVF+ K D +  ++R KARLV +G+ Q++GID++E F+PV ++ +IR +L
Subjt:  EPKSLKDA----ENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLL

Query:  AFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIK-IKENDMLLVQIY
        + A+  +  + Q+DVK+AFL+G + EE+Y+EQP GFE     H V KL K+LYGLKQAPR WY +  +F+    +     D  ++ K   EN+ +++ +Y
Subjt:  AFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIK-IKENDMLLVQIY

Query:  VDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQI--KQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDK-------DEKGKSVD
        VDD++    +  L  +    +   F+M  +G     LG++I  ++    ++++QEKY + +L+RF     K   TP++   KL K       +EKG    
Subjt:  VDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQI--KQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDK-------DEKGKSVD

Query:  IKTYRGMIGSLLY-LTASRPDI
        +  Y   +GSL+Y +  +RPDI
Subjt:  IKTYRGMIGSLLY-LTASRPDI

P25600 Putative transposon Ty5-1 protein YCL074W1.3e-2331.03Show/hide
Query:  MDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSL
        MDV +AFLN  + E +YV+QPPGF N   P +V++L   +YGLKQAP  W + ++N L    F   + +  L+ +   +  + + +YVDD++  + +P +
Subjt:  MDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSL

Query:  CEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDG-IFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLY-LTASRPD
         +   + +   + M  +G++  FLGL I Q  +G I ++ + Y        + N  K+ +TP+  S  L +       DI  Y+ ++G LL+     RPD
Subjt:  CEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDG-IFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLY-LTASRPD

Query:  IMF
        I +
Subjt:  IMF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-6339.94Show/hide
Query:  LAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAI
        ++  ++ EP++   A  DE W  AM  E+N    N  W+LV P PS+ +I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +I
Subjt:  LAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAI

Query:  RMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLV
        R++L  A  +++ + Q+DV +AFL G + ++VY+ QPPGF + + P++V KL+KALYGLKQAPRAWY  L N+L+   F     DT+LF+  +   ++ +
Subjt:  RMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLV

Query:  QIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRG
         +YVDDI+    +P+L       +   F +    EL +FLG++ K++  G+ ++Q +Y  DLL R      K   TPM+ S KL      K  D   YRG
Subjt:  QIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRG

Query:  MIGSLLYLTASRPDIMFS
        ++GSL YL  +RPDI ++
Subjt:  MIGSLLYLTASRPDIMFS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-6139.23Show/hide
Query:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EP++   A  D+ W  AM  E+N    N  W+LV P P + +I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A
Subjt:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELV-PRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA

Query:  SYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDI
          +++ + Q+DV +AFL G + +EVY+ QPPGF + + P +V +L+KA+YGLKQAPRAWY  L  +L+   F     DT+LF+  +   ++ + +YVDDI
Subjt:  SYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDI

Query:  IFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLY
        +    +  L +     +   F +    +L +FLG++ K++  G+ ++Q +YT DLL R      K   TPM+TS KL      K  D   YRG++GSL Y
Subjt:  IFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGSLLY

Query:  LTASRPDIMFS
        L  +RPD+ ++
Subjt:  LTASRPDIMFS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.4e-6241.4Show/hide
Query:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EP +  +A+    W  AM +E+   E    WE+   P N   IG KWV++ K + +G I R KARLVA+GY Q+EGID+ ETF+PV +L +++++LA ++
Subjt:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEF----PHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYV
          NF L+Q+D+ +AFLNG + EE+Y++ PPG+   +     P+ V  LKK++YGLKQA R W+ + S  LIG  F     D T F+KI     L V +YV
Subjt:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEF----PHHVYKLKKALYGLKQAPRAWYDRLSNFLIGNDFKMGKLDTTLFIKIKENDMLLVQIYV

Query:  DDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGS
        DDII  S N +  +E    + S F++  +G L +FLGL+I +   GI I Q KY  DLL      G K +  PM  S        G  VD K YR +IG 
Subjt:  DDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKTYRGMIGS

Query:  LLYLTASRPDIMFS
        L+YL  +R DI F+
Subjt:  LLYLTASRPDIMFS

ATMG00810.1 DNA/RNA polymerases superfamily protein1.4e-1234.17Show/hide
Query:  IYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK---GKSVDIKTY
        +YVDDI+   ++ +L       + S F M  +G + +FLG+QIK    G+F++Q KY + +L     N G +   PMST   L  +      K  D   +
Subjt:  IYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK---GKSVDIKTY

Query:  RGMIGSLLYLTASRPDIMFS
        R ++G+L YLT +RPDI ++
Subjt:  RGMIGSLLYLTASRPDIMFS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.5e-2246.72Show/hide
Query:  KTRSSINLFN---NLAFVSQI--EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGI
        ++++ IN  N   +L   + I  EPKS+  A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI
Subjt:  KTRSSINLFN---NLAFVSQI--EPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMDENGNIMRNKARLVAQGYCQEEGI

Query:  DYEETFAPVARLEAIRMLLAFA
         + ET++PV R   IR +L  A
Subjt:  DYEETFAPVARLEAIRMLLAFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCTACTTGGGATGATAGTGATGAAAGTGCAAGTGGAAGTGATGAAGAAGTAGCAAATTTTTGCTTCATGGCTCATAGTGACAAAGAGGATGAACAAGAAGATGA
GGTTTGTTTGAAAGCCTCCAAGAAAAATAAGTGGTACTTGGATAGTGGCTGCTCAAGACACATGACGGGTAATCCATCCAAGTTTGTGAATCTTTCCAAGAAGGATGGTG
GCTTAGTAACCTTTGGTGATAACAAGAAGGGTAAAATAATTGGTAAGGGTACTATAGAAAGAAATTTTGGAGACTTACTCGTTAGTGACAACGGCAAGGAAATTGTTACG
AGTAAAGAAGAGATGAGCTTAAAGGAAGAAGGTTCTTCATCAATGCCAAAAGAATGGAGGTATGCCTTGTCTCATCCCAAGGACTTGATTCTTGGTGATCTCGAACAAGG
TGTGAAAACTCGCTCTTCTATTAATTTATTTAATAATCTTGCTTTTGTTTCTCAAATTGAACCAAAAAGTCTTAAGGATGCCGAAAATGATGAGTTTTGGATTTTAGCCA
TGCAAGAAGAGCTAAATCAATTTGAAAGGAACAAAGTTTGGGAATTAGTCCCGAGGCCATCTAATACTTCTATTATTGGAACCAAATGGGTTTTTAGAAATAAAATGGAT
GAAAATGGGAATATCATGAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAGGAAGGCATAGATTATGAAGAAACTTTTGCACCCGTTGCTAGATTAGAAGC
TATTAGAATGTTACTTGCTTTTGCTTCTTACAAAAATTTTGTATTATATCAAATGGATGTGAAAAGTGCATTTTTGAATGGTTATATTATGGAGGAAGTTTATGTAGAAC
AACCTCCCGGATTTGAAAATGTAGAATTTCCTCATCATGTCTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGTAAT
TTTCTTATTGGGAATGATTTTAAAATGGGCAAACTCGACACTACACTCTTTATTAAGATTAAAGAAAACGATATGCTATTGGTGCAAATATATGTGGATGATATCATATT
TGGTTCTACTAATCCTTCTTTATGTGAAGAATTTGCTAAATGTATGCATAGTGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTTCTTGGACTTCAAATCAAAC
AACTCAAGGATGGTATCTTCATCAATCAAGAGAAATACACTAAAGATTTGCTCAAAAGATTCAAGTTCAATGGAGGTAAGATTGCAAGAACTCCCATGAGCACATCCACT
AAGCTTGACAAGGATGAAAAAGGTAAAAGTGTGGATATTAAAACTTATCGAGGTATGATTGGATCTTTACTTTACTTGACCGCTAGTAGACCCGATATTATGTTTAGTCA
GCAGCAGCGAAGAACCGAAGCTTCATATCTCGATCCGATCGCCGTCTTCACTGCTCTGAAACAACATTTTTGTCACGCAATTGATAGGCGAAAGCTTTTAATCCCTTGTC
TGCCTCCAGAAAGGAGCCTACCAATTGTAGCCATCGCTAAAGAGAGAAAGGGATGTACCTCTGCTTCTGTAAGGAGAGTCCGAAACGACCGATTATTCTTTGTGAATTGG
GGACGAAAACTTTGGAAGGAACTTCGCTTGATAAGGAGAAGACGGATCAGTAGGGTGCAGGCGGAGGTGACGGCGGATGCCGGCGAGGCGGGCGGAGATGGGCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCTACTTGGGATGATAGTGATGAAAGTGCAAGTGGAAGTGATGAAGAAGTAGCAAATTTTTGCTTCATGGCTCATAGTGACAAAGAGGATGAACAAGAAGATGA
GGTTTGTTTGAAAGCCTCCAAGAAAAATAAGTGGTACTTGGATAGTGGCTGCTCAAGACACATGACGGGTAATCCATCCAAGTTTGTGAATCTTTCCAAGAAGGATGGTG
GCTTAGTAACCTTTGGTGATAACAAGAAGGGTAAAATAATTGGTAAGGGTACTATAGAAAGAAATTTTGGAGACTTACTCGTTAGTGACAACGGCAAGGAAATTGTTACG
AGTAAAGAAGAGATGAGCTTAAAGGAAGAAGGTTCTTCATCAATGCCAAAAGAATGGAGGTATGCCTTGTCTCATCCCAAGGACTTGATTCTTGGTGATCTCGAACAAGG
TGTGAAAACTCGCTCTTCTATTAATTTATTTAATAATCTTGCTTTTGTTTCTCAAATTGAACCAAAAAGTCTTAAGGATGCCGAAAATGATGAGTTTTGGATTTTAGCCA
TGCAAGAAGAGCTAAATCAATTTGAAAGGAACAAAGTTTGGGAATTAGTCCCGAGGCCATCTAATACTTCTATTATTGGAACCAAATGGGTTTTTAGAAATAAAATGGAT
GAAAATGGGAATATCATGAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAGGAAGGCATAGATTATGAAGAAACTTTTGCACCCGTTGCTAGATTAGAAGC
TATTAGAATGTTACTTGCTTTTGCTTCTTACAAAAATTTTGTATTATATCAAATGGATGTGAAAAGTGCATTTTTGAATGGTTATATTATGGAGGAAGTTTATGTAGAAC
AACCTCCCGGATTTGAAAATGTAGAATTTCCTCATCATGTCTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGTAAT
TTTCTTATTGGGAATGATTTTAAAATGGGCAAACTCGACACTACACTCTTTATTAAGATTAAAGAAAACGATATGCTATTGGTGCAAATATATGTGGATGATATCATATT
TGGTTCTACTAATCCTTCTTTATGTGAAGAATTTGCTAAATGTATGCATAGTGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTTCTTGGACTTCAAATCAAAC
AACTCAAGGATGGTATCTTCATCAATCAAGAGAAATACACTAAAGATTTGCTCAAAAGATTCAAGTTCAATGGAGGTAAGATTGCAAGAACTCCCATGAGCACATCCACT
AAGCTTGACAAGGATGAAAAAGGTAAAAGTGTGGATATTAAAACTTATCGAGGTATGATTGGATCTTTACTTTACTTGACCGCTAGTAGACCCGATATTATGTTTAGTCA
GCAGCAGCGAAGAACCGAAGCTTCATATCTCGATCCGATCGCCGTCTTCACTGCTCTGAAACAACATTTTTGTCACGCAATTGATAGGCGAAAGCTTTTAATCCCTTGTC
TGCCTCCAGAAAGGAGCCTACCAATTGTAGCCATCGCTAAAGAGAGAAAGGGATGTACCTCTGCTTCTGTAAGGAGAGTCCGAAACGACCGATTATTCTTTGTGAATTGG
GGACGAAAACTTTGGAAGGAACTTCGCTTGATAAGGAGAAGACGGATCAGTAGGGTGCAGGCGGAGGTGACGGCGGATGCCGGCGAGGCGGGCGGAGATGGGCCTTAACT
GCCGTAGGCACAGGCCCTACGATGAAGATGGACAGAGATCATGAATGCGACCCCGCAATGAGACGAATATTCTTAGGTAAGTATTTCGAACTTATTTTTTGCATTTCGAA
ACTTTGCTGTAGATGCATTAATTTTATATGTTAATAGAGGAAACATGACCATAGAAGAAATTTTTAGGTTTTAATGGAATAGGAATACTTCATCTATTTGTTCGTCACCA
TCACTGTGTAAGGAAAATGGGTACGATGCTAGAAATTTTCGTTGCTTATAGATTGATGATTAGGATATTTACTCTTTTCGTTGGGTCCGATATTCAAATTTTCATTACCA
CATTGGTTGTATTTAAATTTTTTCTCATTTAAGTTTAACAAAAAAAAATAAAGAAATGTAATTTTTTACTTTTAAAATTGAAGTGTACCTTTCTTTTATAAAGTGTCTAT
AGTTTCACGATTTTAAAATATATTTACTAGAGAGAGTTGTTTTAAAACGCATGGAGAGTGGTCGTCCACATTGCCCAATATTTGATTCTGATACTATTTATAATAGCCCA
AATGCAACGCAGACAAATATTGTCTACTTTATTCTGTTACATATAATCATGTCTGCTTCACAGTTTAAAAATGCGTCTAGCAATGAGAGGTTTCCACATCCATATATGGA
ATAATTTGCTCCCCTAGTTGATGTGAGATCTCACGCTATTAGCTTCAACGAAATGGTAAAAACGGAACCATAAGAAACAGAGCTTAAAAACTCAAATATGGAAGATAAGG
AAGTGAAGGAAAGCCATGGTCTTAGGTTCACTAATAAGCCCAACATAACAACATTGTCAATGAAGCTATAAAAGAAAAAAAAAAGGAAATGAGTGATATGAACATCCAGT
AACCTCAACATACTGAGAATCTTAAAATACATCAATAAATGGCGTTGAAATAAGGCAGCGGACTCAGACTATGGGTTACCTGTAATCTTTCCGAAACAAAGAAAGAATGA
TGCATTTGAAGTGAGATATTTGTGTTCCTTCTTCCCATTCCACCCGGTGCTCTCTATCTCTGTTCAACTCCTACAGTAGAAGATTGCTGGGAAATGGGTTTCACCAACAC
CAGATGAAGCTGATGGAATGGCAGTCGAAGAAGAAGAAGAAGAAGAAGAAGAAG
Protein sequenceShow/hide protein sequence
MKATWDDSDESASGSDEEVANFCFMAHSDKEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIVT
SKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSINLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVPRPSNTSIIGTKWVFRNKMD
ENGNIMRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAWYDRLSN
FLIGNDFKMGKLDTTLFIKIKENDMLLVQIYVDDIIFGSTNPSLCEEFAKCMHSEFEMSMMGELSFFLGLQIKQLKDGIFINQEKYTKDLLKRFKFNGGKIARTPMSTST
KLDKDEKGKSVDIKTYRGMIGSLLYLTASRPDIMFSQQQRRTEASYLDPIAVFTALKQHFCHAIDRRKLLIPCLPPERSLPIVAIAKERKGCTSASVRRVRNDRLFFVNW
GRKLWKELRLIRRRRISRVQAEVTADAGEAGGDGP