; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G012700 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G012700
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCmo_Chr10:13088354..13092458
RNA-Seq ExpressionCmoCh10G012700
SyntenyCmoCh10G012700
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG93649.1 wall-associated kinase 2 [Prunus dulcis]2.3e-12744.74Show/hide
Query:  MQSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPSIPSPPTPPSPPIPSPPTPPSPPIPSPTTP--------SSPPPSPDSPT
        M  + F +SRDV F E+ FP+    QT +  PS+ ++PL    +    PP S P+ P P     P+P TP  PP  + + P        SS  P   SPT
Subjt:  MQSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPSIPSPPTPPSPPIPSPPTPPSPPIPSPTTP--------SSPPPSPDSPT

Query:  NSN---PIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSP---GTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAM
         +    P PP+  APLR STR K  PAWH DY +S  +     SS P    TGT+YPL  +LS+S FSP+ R+FLA I+   EP TYD+AV DP W  AM
Subjt:  NSN---PIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSP---GTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAM

Query:  NDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHG
          E+ AL  N+TWSLVPLP G+K IGC+WVY+IKYNSDG++ERYKARLVAKGYTQVEG+DY ETFSPTAKLTTLRCLL +AA+R WF HQLDVQNAFLHG
Subjt:  NDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHG

Query:  NLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL---------------------------------------
        +LDEEVYM  PPGLRRQGEN VCRL+KSLYGLKQASRNWFS FST IQNAG+ Q KADYSL                                       
Subjt:  NLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL---------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYI
                                                                        ++WKSKKQTNVSRSSA+AEYRAMA TCLELTWLRYI
Subjt:  ---------------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYI

Query:  LQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
        LQDL V   EPA L+CDNQAAL+IAANPVFHERTKHIEIDCHIVREKL AGII   +V ++ QLAD+ TKALGR  F  +  KL
Subjt:  LQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

CAN68148.1 hypothetical protein VITISV_035665 [Vitis vinifera]6.3e-14146.6Show/hide
Query:  MQSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPSIP------SPPTPPSPPIPSPPT---------------PPSPPIPSPT
        +Q+    +SRDV F E+ FPF S+S  S     +L +PL   S+ +   P S+P      +PP     P+ SPP+                P P  PSP+
Subjt:  MQSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPSIP------SPPTPPSPPIPSPPT---------------PPSPPIPSPT

Query:  TPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGD
        + SSPP  P  P+N++   P    PLRRSTR  QPPAWH DY MS+ +NH ++ SS   GTRYPL  +LSF RFSP  RAFLAL+T+QTEP ++++A  D
Subjt:  TPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGD

Query:  PLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDV
        P W+QAM+ E+ ALERN+TW +VPLP GHK IGCRWVYKIKY+SDG++ERYKARLVAKGYTQV GIDY ETFSPTAKLTTLRCLLTVAA+R W+ HQLDV
Subjt:  PLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDV

Query:  QNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------
         NAFLHGNL EEVYM+ PPGLRRQGEN VCRL KS+YGLKQASRNWFS F+ T+++AGY Q KADYSL                                
Subjt:  QNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLE
                                                                               ++WKSKKQTNVSRSSAEAEYRAMANTCLE
Subjt:  ----------------------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLE

Query:  LTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
        LTWLRYIL+DL V L +PA L+CDNQAAL+IAANPVFHERTKHIEIDCHIVREKLQAG+I+PCYVSTKMQLADV TKALGR+QF+FL  KL
Subjt:  LTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

KAD4180152.1 hypothetical protein E3N88_28743 [Mikania micrantha]4.8e-13346.61Show/hide
Query:  MQSHTFFISRDVKFCEDDFPFSSAS--QTSTLAPSTLIVPLHDPSYSNIHPPPSIP-SPPTPPSPPIPSP-PTPPSPPI----PSPTTPSSPPPSPDSPT
        + S  FF+SRDVKF E  FPFSS S   +S+L P+      H+P  ++  P PS P      PS  IPS  P   SP +     S T+ S+  P+   PT
Subjt:  MQSHTFFISRDVKFCEDDFPFSSAS--QTSTLAPSTLIVPLHDPSYSNIHPPPSIP-SPPTPPSPPIPSP-PTPPSPPI----PSPTTPSSPPPSPDSPT

Query:  NSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAA
        N N  PP     LRRS+R KQ PAWH  Y M S  NH + ++S   GTRYPL  +LSFSRFSP+ R FL  IT+QTEP++YDEA+  P WQQAM  E+ A
Subjt:  NSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAA

Query:  LERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEV
        L+ N+TWSLVPLP+GHK IGCRWVYKIKYNSDG++ERYKARLVAKGYTQVEGIDY ETFSPTAKLTTLRCLLTVA AR WFTHQLDVQNAFLHG+L E V
Subjt:  LERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEV

Query:  YMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL---------------------------------------------
        YM+ PPG  ++G+N VCRL+KSLYGLKQASRNWFS FS T+Q AGYTQ KADYSL                                             
Subjt:  YMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNV
                                                                  ++WKSKKQ NVSRSSAEAEYRAMANTCLELTWLRY+LQDL V
Subjt:  ---------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNV

Query:  PLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
        PLS P  LYCDN+AALHIAANPVFHERTKHIEIDCHIVR+K   G+I P Y+ T++QLAD+ TKALGR QF+ L++KL
Subjt:  PLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

PNX93906.1 hypothetical protein L195_g017068 [Trifolium pratense]2.6e-13145.77Show/hide
Query:  QSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPP-PSIPSPPTPPSPPIPSPPTPPSPPIPSPTTPSSPPPSPDS-PTNSN-PIP
        ++  FF+SRDV+FCE DFP    S  +T  P+++          + HPP  ++   P P    +PS      PP P   TPS+  P  DS PT S+ P P
Subjt:  QSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPP-PSIPSPPTPPSPPIPSPPTPPSPPIPSPTTPSSPPPSPDS-PTNSN-PIP

Query:  PDTSAP-LRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNH
        P +  P +RRS R K PP WH+DY MS  VN  +S  +  +GTRYPL HYLS+SR S T   FLA IT+  EP++YD+AV DP WQ AMN E+ AL++N+
Subjt:  PDTSAP-LRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNH

Query:  TWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLP
        TW+LVPLP GHK IGC+WVYKIKY SDG++ERYKARLVAKGYTQVEGIDY ETFSPTAK+TTLRCLLTVAA+R WF HQLDVQNAFLHG+L E VYM  P
Subjt:  TWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLP

Query:  PGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------------------------
        PGLRRQGEN VCRL+KSLYGLKQASRNWFS FS  IQ AGY Q KADYSL                                                  
Subjt:  PGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEP
                                                             ++WKSKKQ  VSRSSAE+EYRAMANTCLELTWLR+ILQDL V  + P
Subjt:  ----------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEP

Query:  ALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
          L+CDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAG+I P YV T+ QLADV TKALG+ QF  L++KL
Subjt:  ALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

PNX93928.1 hypothetical protein L195_g017092, partial [Trifolium pratense]6.5e-13046.66Show/hide
Query:  QSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPS-IPSPPTPPSPPIPSPPTPPSPPIPSPTTPS--SPPPSPDSPTNSNPIP
        ++ TFF+SRDVKFCE +FP    S  +T  P+  ++  H PSY  I   PS   S        IPS   P SP   +  T S  SP   P   T     P
Subjt:  QSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPS-IPSPPTPPSPPIPSPPTPPSPPIPSPTTPS--SPPPSPDSPTNSNPIP

Query:  PDTSAP-LRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNH
        P    P +R+S R K PP WH DY MS+ VN   S  + G+GTRYPL HYLS+SR S +  AFLA IT+  EP++YD+AV DPLWQ AMN E+ ALE+N+
Subjt:  PDTSAP-LRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNH

Query:  TWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLP
        TWSLVPLP GHK IGC+WVYKIKY SDG++ERYKARLVAKGYTQVEGIDY ETFSPTAK+TTLRCLLTVAAAR WF HQLDVQNAFLHG+L E VYM  P
Subjt:  TWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLP

Query:  PGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------------------------
        PGLRRQGEN VCRL+KSLYGLKQASRNWFS FS  IQ AGY Q KADYSL                                                  
Subjt:  PGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEP
                                                             ++WKSKKQ  VSRSSAE+EYRAMANTCLELTWLR+ILQDL V  + P
Subjt:  ----------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEP

Query:  ALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
          L+CDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGII P YV T+ QLADV TKALG+ QF  L+ KL
Subjt:  ALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

TrEMBL top hitse value%identityAlignment
A0A2K3MSX0 Integrase catalytic domain-containing protein (Fragment)3.2e-13046.66Show/hide
Query:  QSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPS-IPSPPTPPSPPIPSPPTPPSPPIPSPTTPS--SPPPSPDSPTNSNPIP
        ++ TFF+SRDVKFCE +FP    S  +T  P+  ++  H PSY  I   PS   S        IPS   P SP   +  T S  SP   P   T     P
Subjt:  QSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPS-IPSPPTPPSPPIPSPPTPPSPPIPSPTTPS--SPPPSPDSPTNSNPIP

Query:  PDTSAP-LRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNH
        P    P +R+S R K PP WH DY MS+ VN   S  + G+GTRYPL HYLS+SR S +  AFLA IT+  EP++YD+AV DPLWQ AMN E+ ALE+N+
Subjt:  PDTSAP-LRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNH

Query:  TWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLP
        TWSLVPLP GHK IGC+WVYKIKY SDG++ERYKARLVAKGYTQVEGIDY ETFSPTAK+TTLRCLLTVAAAR WF HQLDVQNAFLHG+L E VYM  P
Subjt:  TWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLP

Query:  PGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------------------------
        PGLRRQGEN VCRL+KSLYGLKQASRNWFS FS  IQ AGY Q KADYSL                                                  
Subjt:  PGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEP
                                                             ++WKSKKQ  VSRSSAE+EYRAMANTCLELTWLR+ILQDL V  + P
Subjt:  ----------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEP

Query:  ALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
          L+CDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGII P YV T+ QLADV TKALG+ QF  L+ KL
Subjt:  ALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

A0A2K3MT28 Uncharacterized protein1.3e-13145.77Show/hide
Query:  QSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPP-PSIPSPPTPPSPPIPSPPTPPSPPIPSPTTPSSPPPSPDS-PTNSN-PIP
        ++  FF+SRDV+FCE DFP    S  +T  P+++          + HPP  ++   P P    +PS      PP P   TPS+  P  DS PT S+ P P
Subjt:  QSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPP-PSIPSPPTPPSPPIPSPPTPPSPPIPSPTTPSSPPPSPDS-PTNSN-PIP

Query:  PDTSAP-LRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNH
        P +  P +RRS R K PP WH+DY MS  VN  +S  +  +GTRYPL HYLS+SR S T   FLA IT+  EP++YD+AV DP WQ AMN E+ AL++N+
Subjt:  PDTSAP-LRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNH

Query:  TWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLP
        TW+LVPLP GHK IGC+WVYKIKY SDG++ERYKARLVAKGYTQVEGIDY ETFSPTAK+TTLRCLLTVAA+R WF HQLDVQNAFLHG+L E VYM  P
Subjt:  TWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLP

Query:  PGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------------------------
        PGLRRQGEN VCRL+KSLYGLKQASRNWFS FS  IQ AGY Q KADYSL                                                  
Subjt:  PGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEP
                                                             ++WKSKKQ  VSRSSAE+EYRAMANTCLELTWLR+ILQDL V  + P
Subjt:  ----------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEP

Query:  ALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
          L+CDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAG+I P YV T+ QLADV TKALG+ QF  L++KL
Subjt:  ALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

A0A4Y1QPA0 Wall-associated kinase 21.1e-12744.74Show/hide
Query:  MQSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPSIPSPPTPPSPPIPSPPTPPSPPIPSPTTP--------SSPPPSPDSPT
        M  + F +SRDV F E+ FP+    QT +  PS+ ++PL    +    PP S P+ P P     P+P TP  PP  + + P        SS  P   SPT
Subjt:  MQSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPSIPSPPTPPSPPIPSPPTPPSPPIPSPTTP--------SSPPPSPDSPT

Query:  NSN---PIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSP---GTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAM
         +    P PP+  APLR STR K  PAWH DY +S  +     SS P    TGT+YPL  +LS+S FSP+ R+FLA I+   EP TYD+AV DP W  AM
Subjt:  NSN---PIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSP---GTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAM

Query:  NDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHG
          E+ AL  N+TWSLVPLP G+K IGC+WVY+IKYNSDG++ERYKARLVAKGYTQVEG+DY ETFSPTAKLTTLRCLL +AA+R WF HQLDVQNAFLHG
Subjt:  NDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHG

Query:  NLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL---------------------------------------
        +LDEEVYM  PPGLRRQGEN VCRL+KSLYGLKQASRNWFS FST IQNAG+ Q KADYSL                                       
Subjt:  NLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL---------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYI
                                                                        ++WKSKKQTNVSRSSA+AEYRAMA TCLELTWLRYI
Subjt:  ---------------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYI

Query:  LQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
        LQDL V   EPA L+CDNQAAL+IAANPVFHERTKHIEIDCHIVREKL AGII   +V ++ QLAD+ TKALGR  F  +  KL
Subjt:  LQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

A0A5N6N393 Integrase catalytic domain-containing protein2.3e-13346.61Show/hide
Query:  MQSHTFFISRDVKFCEDDFPFSSAS--QTSTLAPSTLIVPLHDPSYSNIHPPPSIP-SPPTPPSPPIPSP-PTPPSPPI----PSPTTPSSPPPSPDSPT
        + S  FF+SRDVKF E  FPFSS S   +S+L P+      H+P  ++  P PS P      PS  IPS  P   SP +     S T+ S+  P+   PT
Subjt:  MQSHTFFISRDVKFCEDDFPFSSAS--QTSTLAPSTLIVPLHDPSYSNIHPPPSIP-SPPTPPSPPIPSP-PTPPSPPI----PSPTTPSSPPPSPDSPT

Query:  NSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAA
        N N  PP     LRRS+R KQ PAWH  Y M S  NH + ++S   GTRYPL  +LSFSRFSP+ R FL  IT+QTEP++YDEA+  P WQQAM  E+ A
Subjt:  NSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAA

Query:  LERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEV
        L+ N+TWSLVPLP+GHK IGCRWVYKIKYNSDG++ERYKARLVAKGYTQVEGIDY ETFSPTAKLTTLRCLLTVA AR WFTHQLDVQNAFLHG+L E V
Subjt:  LERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEV

Query:  YMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL---------------------------------------------
        YM+ PPG  ++G+N VCRL+KSLYGLKQASRNWFS FS T+Q AGYTQ KADYSL                                             
Subjt:  YMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNV
                                                                  ++WKSKKQ NVSRSSAEAEYRAMANTCLELTWLRY+LQDL V
Subjt:  ---------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNV

Query:  PLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
        PLS P  LYCDN+AALHIAANPVFHERTKHIEIDCHIVR+K   G+I P Y+ T++QLAD+ TKALGR QF+ L++KL
Subjt:  PLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

A5BNR5 Integrase catalytic domain-containing protein3.0e-14146.6Show/hide
Query:  MQSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPSIP------SPPTPPSPPIPSPPT---------------PPSPPIPSPT
        +Q+    +SRDV F E+ FPF S+S  S     +L +PL   S+ +   P S+P      +PP     P+ SPP+                P P  PSP+
Subjt:  MQSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPSIP------SPPTPPSPPIPSPPT---------------PPSPPIPSPT

Query:  TPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGD
        + SSPP  P  P+N++   P    PLRRSTR  QPPAWH DY MS+ +NH ++ SS   GTRYPL  +LSF RFSP  RAFLAL+T+QTEP ++++A  D
Subjt:  TPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGD

Query:  PLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDV
        P W+QAM+ E+ ALERN+TW +VPLP GHK IGCRWVYKIKY+SDG++ERYKARLVAKGYTQV GIDY ETFSPTAKLTTLRCLLTVAA+R W+ HQLDV
Subjt:  PLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDV

Query:  QNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------
         NAFLHGNL EEVYM+ PPGLRRQGEN VCRL KS+YGLKQASRNWFS F+ T+++AGY Q KADYSL                                
Subjt:  QNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKADYSL--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLE
                                                                               ++WKSKKQTNVSRSSAEAEYRAMANTCLE
Subjt:  ----------------------------------------------------------------------ELTWKSKKQTNVSRSSAEAEYRAMANTCLE

Query:  LTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
        LTWLRYIL+DL V L +PA L+CDNQAAL+IAANPVFHERTKHIEIDCHIVREKLQAG+I+PCYVSTKMQLADV TKALGR+QF+FL  KL
Subjt:  LTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-4124.7Show/hide
Query:  WQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQN
        W++A+N E+ A + N+TW++   P     +  RWV+ +KYN  G+  RYKARLVA+G+TQ   IDY ETF+P A++++ R +L++        HQ+DV+ 
Subjt:  WQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQN

Query:  AFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQ-----------------------------------------
        AFL+G L EE+YM LP G+    +N VC+L+K++YGLKQA+R WF +F   ++   +                                           
Subjt:  AFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQ-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------FKADYSLE---------------------------------LTWKSKKQTNVSRSSAEAEYRAMANT
                                         FK + + E                                 + W +K+Q +V+ SS EAEY A+   
Subjt:  ---------------------------------FKADYSLE---------------------------------LTWKSKKQTNVSRSSAEAEYRAMANT

Query:  CLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL
          E  WL+++L  +N+ L  P  +Y DNQ  + IA NP  H+R KHI+I  H  RE++Q  +I   Y+ T+ QLAD+ TK L   +F  L+DKL
Subjt:  CLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQLADVLTKALGRQQFDFLKDKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-3943.55Show/hide
Query:  LITSQTEPKTYDEAVGDPLWQQ---AMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTT
        LI+   EP++  E +  P   Q   AM +E+ +L++N T+ LV LP G + + C+WV+K+K + D  + RYKARLV KG+ Q +GID+ E FSP  K+T+
Subjt:  LITSQTEPKTYDEAVGDPLWQQ---AMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTT

Query:  LRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQG-ENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKAD
        +R +L++AA+      QLDV+ AFLHG+L+EE+YM  P G    G ++ VC+L+KSLYGLKQA R W+  F + +++  Y +  +D
Subjt:  LRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQG-ENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKAD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-1738.14Show/hide
Query:  LTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQL
        ++W+SK Q  V+ S+ EAEY A   T  E+ WL+  LQ+L +   E  ++YCD+Q+A+ ++ N ++H RTKHI++  H +RE +    +K   +ST    
Subjt:  LTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCYVSTKMQL

Query:  ADVLTKALGRQQFDFLKD
        AD+LTK + R +F+  K+
Subjt:  ADVLTKALGRQQFDFLKD

P92520 Uncharacterized mitochondrial protein AtMg008202.9e-2451.46Show/hide
Query:  TSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLL
        T + EPK+   A+ DP W QAM +E+ AL RN TW LVP P+    +GC+WV+K K +SDG+++R KARLVAKG+ Q EGI + ET+SP  +  T+R +L
Subjt:  TSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLL

Query:  TVA
         VA
Subjt:  TVA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-5328.4Show/hide
Query:  MQSHTFFISRDVKFCEDDFPFSSASQT------------------STLAPSTLIVPLHDPSYSNIHPPPSIPSPPTPP---------------SPPIPSP
        +Q+   +ISR V+F E+ FPFS+   T                  +TL   T ++P   PS S+ H   + PS P+ P               S   PS 
Subjt:  MQSHTFFISRDVKFCEDDFPFSSASQT------------------STLAPSTLIVPLHDPSYSNIHPPPSIPSPPTPP---------------SPPIPSP

Query:  PTPPSP------PIPSPT-----TPSSPPPSPDSPTNSNP--IPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSP------GTGTRYPLHHYL
        P P +P      P   PT     T SS   S ++PTN +P  +    S P + S+ +  P         S     +     P          + PL+ + 
Subjt:  PTPPSP------PIPSPT-----TPSSPPPSPDSPTNSNP--IPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSP------GTGTRYPLHHYL

Query:  SFSR-----FSPTQRAFLAL-ITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPLGHKAI-GCRWVYKIKYNSDGSVERYKARLVAKGYTQ
          +R       P  +  LA+ + +++EP+T  +A+ D  W+ AM  EI A   NHTW LVP P  H  I GCRW++  KYNSDGS+ RYKARLVAKGY Q
Subjt:  SFSR-----FSPTQRAFLAL-ITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPLGHKAI-GCRWVYKIKYNSDGSVERYKARLVAKGYTQ

Query:  VEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG-LRRQGENTVCRLHKSLYGLKQASRNWF--------------
          G+DY ETFSP  K T++R +L VA  R W   QLDV NAFL G L ++VYMS PPG + +   N VC+L K+LYGLKQA R W+              
Subjt:  VEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG-LRRQGENTVCRLHKSLYGLKQASRNWF--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---SIFSTT--------------IQNAGYTQFKADYSLE-------------------------------------------------------------
           S++S T              +Q   +T+    Y++                                                              
Subjt:  ---SIFSTT--------------IQNAGYTQFKADYSLE-------------------------------------------------------------

Query:  ----------LTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIK
                  ++W SKKQ  V RSS EAEYR++ANT  E+ W+  +L +L + L+ P ++YCDN  A ++ ANPVFH R KHI ID H +R ++Q+G ++
Subjt:  ----------LTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIK

Query:  PCYVSTKMQLADVLTKALGRQQFDFLKDKLEGVQIRFVISPPS
          +VST  QLAD LTK L R  F     K+   ++     PPS
Subjt:  PCYVSTKMQLADVLTKALGRQQFDFLKDKLEGVQIRFVISPPS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.4e-5529.69Show/hide
Query:  HPPPSIPSPPTP-PSPPIPSPPTPPSPPIPSPTTPSSPPPSP-DSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLH
        +P P+ PSP +P  + P+P  P   SP IP+P+T  S P SP  S T++ P+PP   AP       + P            VN  + ++    G R P  
Subjt:  HPPPSIPSPPTP-PSPPIPSPPTPPSPPIPSPTTPSSPPPSP-DSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLH

Query:  HYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPLGHKAI-GCRWVYKIKYNSDGSVERYKARLVAKGYTQVEG
         Y           ++   + + +EP+T  +A+ D  W+QAM  EI A   NHTW LVP P     I GCRW++  K+NSDGS+ RYKARLVAKGY Q  G
Subjt:  HYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPLGHKAI-GCRWVYKIKYNSDGSVERYKARLVAKGYTQVEG

Query:  IDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG-LRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKA
        +DY ETFSP  K T++R +L VA  R W   QLDV NAFL G L +EVYMS PPG + +   + VCRL K++YGLKQA R W+    T +   G+    +
Subjt:  IDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG-LRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKA

Query:  DYSL------------------------------------------------------------------------------------------------
        D SL                                                                                                
Subjt:  DYSL------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCY
               ++W SKKQ  V RSS EAEYR++ANT  EL W+  +L +L + LS P ++YCDN  A ++ ANPVFH R KHI +D H +R ++Q+G ++  +
Subjt:  ------ELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPCY

Query:  VSTKMQLADVLTKALGRQQFDFLKDKLEGVQIRFVISPPS
        VST  QLAD LTK L R  F     K+  +++     PPS
Subjt:  VSTKMQLADVLTKALGRQQFDFLKDKLEGVQIRFVISPPS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-6632.68Show/hide
Query:  TTPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVG
        +T SS      S    N +P  +     R TR    PA+ +DY             S  + T + +  +LS+ + SP   +FL  I    EP TY+EA  
Subjt:  TTPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVG

Query:  DPLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLD
          +W  AM+DEI A+E  HTW +  LP   K IGC+WVYKIKYNSDG++ERYKARLVAKGYTQ EGID+ ETFSP  KLT+++ +L ++A   +  HQLD
Subjt:  DPLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLD

Query:  VQNAFLHGNLDEEVYMSLPPG-LRRQGE----NTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKAD-----------------------------
        + NAFL+G+LDEE+YM LPPG   RQG+    N VC L KS+YGLKQASR WF  FS T+   G+ Q  +D                             
Subjt:  VQNAFLHGNLDEEVYMSLPPG-LRRQGE----NTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQFKAD-----------------------------

Query:  -------------------------------------------YSLEL----------------------------------------------------
                                                   Y+L+L                                                    
Subjt:  -------------------------------------------YSLEL----------------------------------------------------

Query:  ------------------------------------------------------------------------------TWKSKKQTNVSRSSAEAEYRAM
                                                                                      +WKSKKQ  VS+SSAEAEYRA+
Subjt:  ------------------------------------------------------------------------------TWKSKKQTNVSRSSAEAEYRAM

Query:  ANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREK
        +    E+ WL    ++L +PLS+P LL+CDN AA+HIA N VFHERTKHIE DCH VRE+
Subjt:  ANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.1e-2551.46Show/hide
Query:  TSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLL
        T + EPK+   A+ DP W QAM +E+ AL RN TW LVP P+    +GC+WV+K K +SDG+++R KARLVAKG+ Q EGI + ET+SP  +  T+R +L
Subjt:  TSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIKYNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLL

Query:  TVA
         VA
Subjt:  TVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCTCACACATTCTTTATCAGCCGTGATGTCAAATTTTGTGAAGATGATTTTCCTTTTTCATCAGCTTCACAAACTTCGACATTAGCTCCTTCGACTCTTATTGT
ACCACTTCATGATCCATCCTACTCAAACATCCATCCTCCACCTTCTATTCCTTCACCTCCTACTCCTCCTTCACCTCCTATTCCTTCACCTCCTACTCCTCCTTCACCTC
CTATCCCTTCACCTACTACTCCGTCGTCTCCTCCACCTTCTCCAGATTCGCCCACTAATTCCAATCCTATCCCACCTGATACATCAGCTCCACTTCGACGTTCTACTCGT
ACTAAACAGCCTCCAGCTTGGCATAAGGATTATGAGATGTCTTCTGGAGTCAATCATTTAACCTCTAGCTCAAGTCCCGGCACTGGCACCAGGTATCCCCTTCATCATTA
CCTTTCATTCTCTCGTTTTTCTCCTACTCAACGTGCTTTTCTAGCTCTTATTACATCCCAGACAGAACCTAAAACCTATGACGAGGCAGTTGGCGACCCGTTATGGCAGC
AGGCTATGAATGATGAAATTGCAGCTTTGGAACGTAATCATACATGGTCTCTCGTTCCTCTACCACTTGGTCATAAAGCTATTGGTTGTCGTTGGGTGTACAAAATTAAA
TACAACTCTGATGGTTCTGTTGAACGTTATAAAGCTCGACTAGTAGCAAAGGGGTACACTCAGGTTGAAGGTATTGATTACACAGAAACATTTTCCCCTACAGCGAAACT
TACTACACTTCGTTGCTTACTCACTGTTGCTGCTGCTCGAAAATGGTTCACCCATCAGTTGGATGTTCAAAATGCCTTTCTCCATGGTAATCTAGACGAGGAAGTTTATA
TGTCTTTACCACCAGGTCTTCGCCGACAGGGGGAGAATACAGTATGTCGGCTCCATAAATCTCTTTATGGATTAAAACAGGCTTCTCGCAATTGGTTCTCCATATTTTCT
ACAACTATACAAAATGCAGGCTACACTCAGTTCAAAGCAGATTACTCTTTAGAGTTAACTTGGAAGTCTAAAAAGCAGACTAATGTGTCCAGATCATCAGCAGAAGCCGA
GTATCGAGCTATGGCAAATACTTGTTTAGAGTTAACTTGGTTAAGATACATTCTTCAAGACTTGAATGTTCCACTGTCCGAACCAGCATTATTATATTGTGATAATCAAG
CAGCATTACATATAGCAGCCAATCCAGTTTTTCATGAACGTACGAAACACATTGAAATAGATTGTCATATAGTTCGAGAAAAATTACAAGCTGGAATCATCAAACCGTGT
TATGTTTCGACCAAAATGCAATTGGCAGATGTTCTTACTAAAGCTTTGGGAAGACAGCAATTTGACTTTTTGAAGGACAAGTTGGAAGGAGTGCAGATCAGATTTGTGAT
CTCACCACCTTCTTCTTTCTTGTTTATTGACTTCCCAATCAAATTGCAGTATGTGTTCAGGTTCCATGACGACAGAAAGAGAGAAATAAATGAAGAATCCAGCCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAATCTCACACATTCTTTATCAGCCGTGATGTCAAATTTTGTGAAGATGATTTTCCTTTTTCATCAGCTTCACAAACTTCGACATTAGCTCCTTCGACTCTTATTGT
ACCACTTCATGATCCATCCTACTCAAACATCCATCCTCCACCTTCTATTCCTTCACCTCCTACTCCTCCTTCACCTCCTATTCCTTCACCTCCTACTCCTCCTTCACCTC
CTATCCCTTCACCTACTACTCCGTCGTCTCCTCCACCTTCTCCAGATTCGCCCACTAATTCCAATCCTATCCCACCTGATACATCAGCTCCACTTCGACGTTCTACTCGT
ACTAAACAGCCTCCAGCTTGGCATAAGGATTATGAGATGTCTTCTGGAGTCAATCATTTAACCTCTAGCTCAAGTCCCGGCACTGGCACCAGGTATCCCCTTCATCATTA
CCTTTCATTCTCTCGTTTTTCTCCTACTCAACGTGCTTTTCTAGCTCTTATTACATCCCAGACAGAACCTAAAACCTATGACGAGGCAGTTGGCGACCCGTTATGGCAGC
AGGCTATGAATGATGAAATTGCAGCTTTGGAACGTAATCATACATGGTCTCTCGTTCCTCTACCACTTGGTCATAAAGCTATTGGTTGTCGTTGGGTGTACAAAATTAAA
TACAACTCTGATGGTTCTGTTGAACGTTATAAAGCTCGACTAGTAGCAAAGGGGTACACTCAGGTTGAAGGTATTGATTACACAGAAACATTTTCCCCTACAGCGAAACT
TACTACACTTCGTTGCTTACTCACTGTTGCTGCTGCTCGAAAATGGTTCACCCATCAGTTGGATGTTCAAAATGCCTTTCTCCATGGTAATCTAGACGAGGAAGTTTATA
TGTCTTTACCACCAGGTCTTCGCCGACAGGGGGAGAATACAGTATGTCGGCTCCATAAATCTCTTTATGGATTAAAACAGGCTTCTCGCAATTGGTTCTCCATATTTTCT
ACAACTATACAAAATGCAGGCTACACTCAGTTCAAAGCAGATTACTCTTTAGAGTTAACTTGGAAGTCTAAAAAGCAGACTAATGTGTCCAGATCATCAGCAGAAGCCGA
GTATCGAGCTATGGCAAATACTTGTTTAGAGTTAACTTGGTTAAGATACATTCTTCAAGACTTGAATGTTCCACTGTCCGAACCAGCATTATTATATTGTGATAATCAAG
CAGCATTACATATAGCAGCCAATCCAGTTTTTCATGAACGTACGAAACACATTGAAATAGATTGTCATATAGTTCGAGAAAAATTACAAGCTGGAATCATCAAACCGTGT
TATGTTTCGACCAAAATGCAATTGGCAGATGTTCTTACTAAAGCTTTGGGAAGACAGCAATTTGACTTTTTGAAGGACAAGTTGGAAGGAGTGCAGATCAGATTTGTGAT
CTCACCACCTTCTTCTTTCTTGTTTATTGACTTCCCAATCAAATTGCAGTATGTGTTCAGGTTCCATGACGACAGAAAGAGAGAAATAAATGAAGAATCCAGCCGTTGA
Protein sequenceShow/hide protein sequence
MQSHTFFISRDVKFCEDDFPFSSASQTSTLAPSTLIVPLHDPSYSNIHPPPSIPSPPTPPSPPIPSPPTPPSPPIPSPTTPSSPPPSPDSPTNSNPIPPDTSAPLRRSTR
TKQPPAWHKDYEMSSGVNHLTSSSSPGTGTRYPLHHYLSFSRFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPLGHKAIGCRWVYKIK
YNSDGSVERYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLLTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFS
TTIQNAGYTQFKADYSLELTWKSKKQTNVSRSSAEAEYRAMANTCLELTWLRYILQDLNVPLSEPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKLQAGIIKPC
YVSTKMQLADVLTKALGRQQFDFLKDKLEGVQIRFVISPPSSFLFIDFPIKLQYVFRFHDDRKREINEESSR