; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh20G010010 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh20G010010
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionCCHC-type domain-containing protein
Genome locationCma_Chr20:6134477..6138222
RNA-Seq ExpressionCmaCh20G010010
SyntenyCmaCh20G010010
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW50731.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.2e-14155.84Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----
        M ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+      
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----

Query:  -------------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSS
                                                               +E + G L + D  ++  +     KE+  L        + E S  
Subjt:  -------------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSS

Query:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG
        +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELV RPSN S+IGTKWVFRNKMDENG
Subjt:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA
         I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLKQAPRA
Subjt:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA

Query:  ------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRP
                     EFEMSMMGEL++FLGLQIKQLK G FINQ KY KDLLKRF     K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRP
Subjt:  ------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRP

Query:  DIMFS
        DIM+S
Subjt:  DIMFS

RVW74396.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.7e-13055.6Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----
        M ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+      
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----

Query:  -------------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSS
                                                               +E + G L + D  ++  +     KEE  L        + E S  
Subjt:  -------------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSS

Query:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG
        +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELV RPSN S+IGTKWVFRNKMDENG
Subjt:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA
         I+RNKARLVAQGY QEEGI+YEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKK LYGLKQAPRA
Subjt:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA

Query:  CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK
        CEFEMSMMGEL+FFLGLQIKQLK G FINQ KY KDLLKRF     K+ +TPMS+S K D DEK
Subjt:  CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK

RVW80634.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.5e-13550.45Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----
        M ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+      
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----

Query:  -------------------------------------------------------IERNFGDLLVSDNGKEIVTSKE------------EMSLKEEGSSS
                                                               +E + G L + D  ++  + ++               ++ E S  
Subjt:  -------------------------------------------------------IERNFGDLLVSDNGKEIVTSKE------------EMSLKEEGSSS

Query:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG
        +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELV RPSN S+IGTKWVFRNKMDENG
Subjt:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA
         I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLKQAPRA
Subjt:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA

Query:  ------------------------------------------------------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKR
                                                                     EFEMSMMGEL++FLGLQIKQLK G FINQ KY KDLLKR
Subjt:  ------------------------------------------------------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKR

Query:  FKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRPDIMFS
        F     K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRPDIM+S
Subjt:  FKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRPDIMFS

RVW93906.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.4e-13049.73Show/hide
Query:  MKATWDDSDES-ASGSDEEVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIER-
        M ATW +S+ES     ++EVAN CFMA  D  DE             SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+  +E+ 
Subjt:  MKATWDDSDES-ASGSDEEVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIER-

Query:  ----------------------------------------------------------NFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSS
                                                                  + G L + D  ++  +     KEE  L        + E S  
Subjt:  ----------------------------------------------------------NFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSS

Query:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG
        +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELV RPSN S+IGTKWVFRNKMDENG
Subjt:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA
         I+RNKARLVAQGY QEEGIDYEETF  VARLEAIRMLLAFA +K+F+LYQMDVKS FLNG+I EEVYVEQPP F++  FP+HV+KLKKALYGLKQAPRA
Subjt:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA

Query:  ------------------------------------------------------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKR
                                                                     EFEMSMMGEL++FLGLQIKQLK G F+NQ KY KDLLKR
Subjt:  ------------------------------------------------------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKR

Query:  FKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRPDIMFS
        F     K+ +TPMS+  KLD DEKGKS+D   YRGMIG LLYLTA +PDIM+S
Subjt:  FKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRPDIMFS

RVW98982.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]7.7e-13550.99Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----
        M ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+      
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----

Query:  -------------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSS
                                                               +E + G L + D  ++  +     KE+  L        + E S  
Subjt:  -------------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSS

Query:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG
        +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELV RPSN S+IGTKWVFRNKMDENG
Subjt:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA
         I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLKQAPRA
Subjt:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA

Query:  ------------------------------------------------------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKR
                                                                     EFEMSMMGEL++FLGLQIKQLK G FINQ KY KDLLKR
Subjt:  ------------------------------------------------------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKR

Query:  FKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRPDIMFS
        F     K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRPDIM S
Subjt:  FKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRPDIMFS

TrEMBL top hitse value%identityAlignment
A0A2N9ERY5 CCHC-type domain-containing protein1.9e-13956.27Show/hide
Query:  MKATWDDSDESAS---GSDEEVANFCFMAHSDKEDEQEDEQE-----------------------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS
        +K TWDDSDES S    SD EVAN C + + ++ +  EDE                         DEVCL   S K KW+LDSGCSRHMTG+ +KF +L+
Subjt:  MKATWDDSDESAS---GSDEEVANFCFMAHSDKEDEQEDEQE-----------------------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS

Query:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQI
         KDGG V FGDN KGKIIG   I  +   + +S+  K+ V     EE +L    +  +PK W    SHPK+LI+G++E GV TRS + ++ NN+AF+SQI
Subjt:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQI

Query:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EPK++ +A  DE WILAMQEELNQFERNKVW L  RP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+APVARLEAIRMLLAFA 
Subjt:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA-------------------------------------------
        +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFEN EFP+HV+KL KALYGLKQAPRA                                           
Subjt:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA-------------------------------------------

Query:  --------C---------EFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYL
                C         EFEMSMMGEL FFLGLQIKQ ++GIF+NQ KY  DLLKRF     K   TPMS STKLDKDEKGK VD+K YRGMIGSLLYL
Subjt:  --------C---------EFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYL

Query:  TASRPDIMFS
        TASRPDIMFS
Subjt:  TASRPDIMFS

A0A2N9G3V4 CCHC-type domain-containing protein1.9e-13956.27Show/hide
Query:  MKATWDDSDESAS---GSDEEVANFCFMAHSDKEDEQEDEQE-----------------------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS
        +K TWDDSDES S    SD EVAN C + + ++ +  EDE                         DEVCL   S K KW+LDSGCSRHMTG+ +KF +L+
Subjt:  MKATWDDSDESAS---GSDEEVANFCFMAHSDKEDEQEDEQE-----------------------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS

Query:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQI
         KDGG V FGDN KGKIIG   I  +   + +S+  K+ V     EE +L    +  +PK W    SHPK+LI+G++E GV TRS + ++ NN+AF+SQI
Subjt:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIV--TSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQI

Query:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EPK++ +A  DE WILAMQEELNQFERNKVW L  RP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+APVARLEAIRMLLAFA 
Subjt:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA-------------------------------------------
        +KNF L+QMDVKSAFLNG+I EEVYVEQPPGFEN EFP+HV+KL KALYGLKQAPRA                                           
Subjt:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA-------------------------------------------

Query:  --------C---------EFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYL
                C         EFEMSMMGEL FFLGLQIKQ ++GIF+NQ KY  DLLKRF     K   TPMS STKLDKDEKGK VD+K YRGMIGSLLYL
Subjt:  --------C---------EFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYL

Query:  TASRPDIMFS
        TASRPDIMFS
Subjt:  TASRPDIMFS

A0A2N9IDJ4 CCHC-type domain-containing protein2.8e-13856.1Show/hide
Query:  MKATWDDSDESAS---GSDEEVANFCFMAHSDKEDEQEDEQE-----------------------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS
        +K TWDDSDES S    SD EVAN C + + ++ +  EDE                         DEVCL   S K KW+LDSGCSRHMTG+ +KF +L+
Subjt:  MKATWDDSDESAS---GSDEEVANFCFMAHSDKEDEQEDEQE-----------------------DEVCL-KASKKNKWYLDSGCSRHMTGNPSKFVNLS

Query:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIVTSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEP
         KDGG V FGDN KGKIIG           +V D   E     EE +L    +  +PK W    +HPK+LI+G++E GV TRS + ++ NN+AF+SQIEP
Subjt:  KKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGKEIVTSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEP

Query:  KSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYK
        K++ +A  DE WILAMQEELNQFERNKVW L  RP + S+IGTKWVFRNK DE G I+RNKARLVAQGY QEEGIDY ET+APVARLEAIRMLLAFA +K
Subjt:  KSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYK

Query:  NFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA---------------------------------------------
        NF L+QMDVKSAFLNG+I EEVYVEQPPGFEN EFP+HV+KL KALYGLKQAPRA                                             
Subjt:  NFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA---------------------------------------------

Query:  ------C---------EFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTA
              C         EFEMSMMGEL FFLGLQIKQ ++GIF+NQ KY  DLLKRF     K   TPMS STKLDKDEKGK VD+K YRGMIGSLLYLTA
Subjt:  ------C---------EFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTA

Query:  SRPDIMFS
        SRPDIMFS
Subjt:  SRPDIMFS

A0A438ESK8 Retrovirus-related Pol polyprotein from transposon RE12.0e-14155.84Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----
        M ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+      
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----

Query:  -------------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSS
                                                               +E + G L + D  ++  +     KE+  L        + E S  
Subjt:  -------------------------------------------------------IERNFGDLLVSDNGKEIVT----SKEEMSL--------KEEGSSS

Query:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG
        +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELV RPSN S+IGTKWVFRNKMDENG
Subjt:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA
         I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLKQAPRA
Subjt:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA

Query:  ------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRP
                     EFEMSMMGEL++FLGLQIKQLK G FINQ KY KDLLKRF     K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRP
Subjt:  ------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRP

Query:  DIMFS
        DIM+S
Subjt:  DIMFS

A0A438H7V2 Retrovirus-related Pol polyprotein from transposon RE17.5e-13650.45Show/hide
Query:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----
        M ATW +S+ES+    E EVAN CFMA  D ++              SK++KW+LDSGCSRHMTG+ SKF  L+K+ GG VTFGDN KG+IIG+      
Subjt:  MKATWDDSDESASGSDE-EVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGT----

Query:  -------------------------------------------------------IERNFGDLLVSDNGKEIVTSKE------------EMSLKEEGSSS
                                                               +E + G L + D  ++  + ++               ++ E S  
Subjt:  -------------------------------------------------------IERNFGDLLVSDNGKEIVTSKE------------EMSLKEEGSSS

Query:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG
        +PK+W++ ++HP+D I+G+   GV+TRSS+ N+ NNLAF+SQIEPK++KDA  DE W++AMQEELNQFER++VWELV RPSN S+IGTKWVFRNKMDENG
Subjt:  MPKEWRYALSHPKDLILGDLEQGVKTRSSI-NLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA
         I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+F+LYQMDVKSAFLNG+I EEVYVEQPPGF++  FP+HV+KLKKALYGLKQAPRA
Subjt:  NIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA

Query:  ------------------------------------------------------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKR
                                                                     EFEMSMMGEL++FLGLQIKQLK G FINQ KY KDLLKR
Subjt:  ------------------------------------------------------------CEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKR

Query:  FKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRPDIMFS
        F     K+ +TPMS+S KLD DEKGKS+D   YRGMIGSLLYLTASRPDIM+S
Subjt:  FKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRPDIMFS

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-3432.33Show/hide
Query:  WILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKS
        W  A+  ELN  + N  W +  RP N +I+ ++WVF  K +E GN IR KARLVA+G+ Q+  IDYEETFAPVAR+ + R +L+     N  ++QMDVK+
Subjt:  WILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKS

Query:  AFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA-----------CE-------------------------------------------
        AFLNG + EE+Y+  P G        +V KL KA+YGLKQA R            CE                                           
Subjt:  AFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRA-----------CE-------------------------------------------

Query:  --------FEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTK---LDKDEKGKSVDIKAYRGMIGSLLY-LTASRPDI
                F M+ + E+  F+G++I+  ++ I+++Q  Y K +L +F         TP+ +      L+ DE   +      R +IG L+Y +  +RPD+
Subjt:  --------FEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTK---LDKDEKGKSVDIKAYRGMIGSLLY-LTASRPDI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-3531.68Show/hide
Query:  EPKSLKDA----ENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLL
        EP+SLK+     E ++  + AMQEE+   ++N  ++LV  P     +  KWVF+ K D +  ++R KARLV +G+ Q++GID++E F+PV ++ +IR +L
Subjt:  EPKSLKDA----ENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLL

Query:  AFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPR----------------------------------------
        + A+  +  + Q+DVK+AFL+G + EE+Y+EQP GFE     H V KL K+LYGLKQAPR                                        
Subjt:  AFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPR----------------------------------------

Query:  ---------------------ACEFEMSMMGELSFFLGLQI--KQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDK-------DEKGKSVD
                             +  F+M  +G     LG++I  ++    ++++QEKY + +L+RF     K   TP++   KL K       +EKG    
Subjt:  ---------------------ACEFEMSMMGELSFFLGLQI--KQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDK-------DEKGKSVD

Query:  IKAYRGMIGSLLY-LTASRPDI
        +  Y   +GSL+Y +  +RPDI
Subjt:  IKAYRGMIGSLLY-LTASRPDI

P92520 Uncharacterized mitochondrial protein AtMg008206.5e-2045.9Show/hide
Query:  KTRSSINLFN---NLAFVSQI--EPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI
        ++++ IN  N   +L   + I  EPKS+  A  D  W  AMQEEL+   RNK W LV  P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI
Subjt:  KTRSSINLFN---NLAFVSQI--EPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI

Query:  DYEETFAPVARLEAIRMLLAFA
         + ET++PV R   IR +L  A
Subjt:  DYEETFAPVARLEAIRMLLAFA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-4434.28Show/hide
Query:  LAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV-HRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAI
        ++  ++ EP++   A  DE W  AM  E+N    N  W+LV   PS+ +I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +I
Subjt:  LAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELV-HRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAI

Query:  RMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAC----------------------------------
        R++L  A  +++ + Q+DV +AFL G + ++VY+ QPPGF + + P++V KL+KALYGLKQAPRA                                   
Subjt:  RMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAC----------------------------------

Query:  --------------------------EFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRG
                                   F +    EL +FLG++ K++  G+ ++Q +Y  DLL R      K   TPM+ S KL      K  D   YRG
Subjt:  --------------------------EFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRG

Query:  MIGSLLYLTASRPDIMFS
        ++GSL YL  +RPDI ++
Subjt:  MIGSLLYLTASRPDIMFS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-4434.08Show/hide
Query:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELV-HRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EP++   A  D+ W  AM  E+N    N  W+LV   P + +I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A
Subjt:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELV-HRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA

Query:  SYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAC-----------------------------------------
          +++ + Q+DV +AFL G + +EVY+ QPPGF + + P +V +L+KA+YGLKQAPRA                                          
Subjt:  SYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRAC-----------------------------------------

Query:  -------------------EFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLY
                            F +    +L +FLG++ K++  G+ ++Q +YT DLL R      K   TPM+TS KL      K  D   YRG++GSL Y
Subjt:  -------------------EFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLY

Query:  LTASRPDIMFS
        L  +RPD+ ++
Subjt:  LTASRPDIMFS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-4334.6Show/hide
Query:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EP +  +A+    W  AM +E+   E    WE+   P N   IG KWV++ K + +G I R KARLVA+GY Q+EGID+ ETF+PV +L +++++LA ++
Subjt:  EPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEF----PHHVYKLKKALYGLKQAPR----------------------------------------
          NF L+Q+D+ +AFLNG + EE+Y++ PPG+   +     P+ V  LKK++YGLKQA R                                        
Subjt:  YKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEF----PHHVYKLKKALYGLKQAPR----------------------------------------

Query:  ---------------------ACEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIG
                             +C F++  +G L +FLGL+I +   GI I Q KY  DLL      G K +  PM  S        G  VD KAYR +IG
Subjt:  ---------------------ACEFEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIG

Query:  SLLYLTASRPDIMFS
         L+YL  +R DI F+
Subjt:  SLLYLTASRPDIMFS

ATMG00810.1 DNA/RNA polymerases superfamily protein5.3e-0935.51Show/hide
Query:  FEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK---GKSVDIKAYRGMIGSLLYLTASRPDIMFS-DILMQ
        F M  +G + +FLG+QIK   +G+F++Q KY + +L     N G +   PMST   L  +      K  D   +R ++G+L YLT +RPDI ++ +I+ Q
Subjt:  FEMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEK---GKSVDIKAYRGMIGSLLYLTASRPDIMFS-DILMQ

Query:  ILPEAYL
         + E  L
Subjt:  ILPEAYL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.6e-2145.9Show/hide
Query:  KTRSSINLFN---NLAFVSQI--EPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI
        ++++ IN  N   +L   + I  EPKS+  A  D  W  AMQEEL+   RNK W LV  P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI
Subjt:  KTRSSINLFN---NLAFVSQI--EPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI

Query:  DYEETFAPVARLEAIRMLLAFA
         + ET++PV R   IR +L  A
Subjt:  DYEETFAPVARLEAIRMLLAFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCTACTTGGGATGATAGTGATGAAAGTGCAAGTGGAAGTGATGAAGAAGTTGCAAATTTTTGCTTCATGGCTCATAGTGACAAAGAGGATGAACAAGAAGATGA
ACAAGAAGATGAGGTTTGTTTGAAAGCCTCCAAGAAAAATAAGTGGTACTTGGATAGTGGTTGCTCAAGACACATGACGGGTAATCCATCCAAGTTTGTCAATCTTTCCA
AGAAGGATGGTGGCTTAGTAACCTTTGGTGATAACAAGAAGGGTAAAATAATTGGTAAGGGTACTATAGAAAGAAATTTTGGAGATTTACTCGTTAGTGACAACGGCAAA
GAAATTGTTACGAGTAAAGAAGAGATGAGCTTAAAGGAAGAAGGTTCTTCATCAATGCCAAAAGAATGGAGGTATGCCTTGTCTCATCCCAAGGACTTGATTCTTGGTGA
TCTCGAACAAGGTGTGAAAACTCGCTCTTCTATTAATTTATTTAATAATCTTGCTTTTGTTTCTCAAATTGAACCAAAAAGTCTTAAGGATGCCGAAAATGATGAGTTTT
GGATTTTAGCCATGCAAGAAGAGCTAAATCAATTTGAAAGGAACAAAGTTTGGGAATTAGTCCATAGGCCATCTAATACTTCTATTATTGGAACCAAATGGGTTTTTAGA
AATAAAATGGATGAAAATGGGAATATCATTAGAAATAAAGCTAGGCTTGTAGCTCAAGGTTATTGTCAAGAGGAAGGCATAGATTATGAAGAAACTTTTGCACCCGTTGC
TAGATTAGAAGCTATTAGAATGTTACTTGCTTTTGCTTCTTACAAAAATTTTGTATTATATCAAATGGATGTGAAAAGTGCATTTTTGAATGGTTATATTATGGAGGAAG
TTTATGTAGAACAACCTCCCGGATTTGAAAATGTAGAATTTCCTCATCATGTCTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGTGAGTTT
GAGATGAGTATGATGGGAGAACTCAGTTTCTTTCTTGGACTTCAAATCAAACAACTCAAGAATGGTATCTTCATCAATCAAGAGAAATACACTAAAGATTTGCTCAAAAG
ATTCAAGTTCAATGGAGGTAAGATTGCAAGAACTCCCATGAGCACATCCACTAAGCTTGACAAGGATGAAAAAGGTAAAAGTGTGGATATTAAAGCTTATCGAGGTATGA
TTGGATCTTTACTTTACTTGACCGCTAGTAGACCCGATATTATGTTTAGTGATATTCTGATGCAGATTTTGCCGGAAGCTTACTTGATCGTAAAAGTACTAGTGGAACTT
GTCAATTTCTTGGTAGTTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCTACTTGGGATGATAGTGATGAAAGTGCAAGTGGAAGTGATGAAGAAGTTGCAAATTTTTGCTTCATGGCTCATAGTGACAAAGAGGATGAACAAGAAGATGA
ACAAGAAGATGAGGTTTGTTTGAAAGCCTCCAAGAAAAATAAGTGGTACTTGGATAGTGGTTGCTCAAGACACATGACGGGTAATCCATCCAAGTTTGTCAATCTTTCCA
AGAAGGATGGTGGCTTAGTAACCTTTGGTGATAACAAGAAGGGTAAAATAATTGGTAAGGGTACTATAGAAAGAAATTTTGGAGATTTACTCGTTAGTGACAACGGCAAA
GAAATTGTTACGAGTAAAGAAGAGATGAGCTTAAAGGAAGAAGGTTCTTCATCAATGCCAAAAGAATGGAGGTATGCCTTGTCTCATCCCAAGGACTTGATTCTTGGTGA
TCTCGAACAAGGTGTGAAAACTCGCTCTTCTATTAATTTATTTAATAATCTTGCTTTTGTTTCTCAAATTGAACCAAAAAGTCTTAAGGATGCCGAAAATGATGAGTTTT
GGATTTTAGCCATGCAAGAAGAGCTAAATCAATTTGAAAGGAACAAAGTTTGGGAATTAGTCCATAGGCCATCTAATACTTCTATTATTGGAACCAAATGGGTTTTTAGA
AATAAAATGGATGAAAATGGGAATATCATTAGAAATAAAGCTAGGCTTGTAGCTCAAGGTTATTGTCAAGAGGAAGGCATAGATTATGAAGAAACTTTTGCACCCGTTGC
TAGATTAGAAGCTATTAGAATGTTACTTGCTTTTGCTTCTTACAAAAATTTTGTATTATATCAAATGGATGTGAAAAGTGCATTTTTGAATGGTTATATTATGGAGGAAG
TTTATGTAGAACAACCTCCCGGATTTGAAAATGTAGAATTTCCTCATCATGTCTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGTGAGTTT
GAGATGAGTATGATGGGAGAACTCAGTTTCTTTCTTGGACTTCAAATCAAACAACTCAAGAATGGTATCTTCATCAATCAAGAGAAATACACTAAAGATTTGCTCAAAAG
ATTCAAGTTCAATGGAGGTAAGATTGCAAGAACTCCCATGAGCACATCCACTAAGCTTGACAAGGATGAAAAAGGTAAAAGTGTGGATATTAAAGCTTATCGAGGTATGA
TTGGATCTTTACTTTACTTGACCGCTAGTAGACCCGATATTATGTTTAGTGATATTCTGATGCAGATTTTGCCGGAAGCTTACTTGATCGTAAAAGTACTAGTGGAACTT
GTCAATTTCTTGGTAGTTCCTTAG
Protein sequenceShow/hide protein sequence
MKATWDDSDESASGSDEEVANFCFMAHSDKEDEQEDEQEDEVCLKASKKNKWYLDSGCSRHMTGNPSKFVNLSKKDGGLVTFGDNKKGKIIGKGTIERNFGDLLVSDNGK
EIVTSKEEMSLKEEGSSSMPKEWRYALSHPKDLILGDLEQGVKTRSSINLFNNLAFVSQIEPKSLKDAENDEFWILAMQEELNQFERNKVWELVHRPSNTSIIGTKWVFR
NKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFVLYQMDVKSAFLNGYIMEEVYVEQPPGFENVEFPHHVYKLKKALYGLKQAPRACEF
EMSMMGELSFFLGLQIKQLKNGIFINQEKYTKDLLKRFKFNGGKIARTPMSTSTKLDKDEKGKSVDIKAYRGMIGSLLYLTASRPDIMFSDILMQILPEAYLIVKVLVEL
VNFLVVP