; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G012540 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G012540
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCmo_Chr09:10810615..10815444
RNA-Seq ExpressionCmoCh09G012540
SyntenyCmoCh09G012540
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD1840611.1 unnamed protein product [Ananas comosus var. bracteatus]5.0e-14652.26Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHD-QFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLM
        +I+SWLTHSVE D+A+G+IHAKTA+QVW DL D QFSQKNAPAIFQIQ SIAT+SQG+M++++Y+ KLKALWDELE YR P+TCN  + H +Q +ED+LM
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHD-QFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLM

Query:  QLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQDLAIGKMIGSGKQFGGLYHISSSPIKS---------SAHQVSQSSDLWHL-----RLGY
        Q LMGLN+SYK VRSNILMM+PLPNVRQAYS ++QE+ QRQ   +   +             +S IK+         S H + +   L +      + G+
Subjt:  QLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQDLAIGKMIGSGKQFGGLYHISSSPIKS---------SAHQVSQSSDLWHL-----RLGY

Query:  P---CGHKGYKLYHMQSHKFFISR--------DVKFCEDDFPFSS---------ASQTSTLAP--------------------STPVVPLHDPSYSNIHP
            C  K  K    Q ++   S+        +     DD    S         A   ST+                      S P + L   S ++   
Subjt:  P---CGHKGYKLYHMQSHKFFISR--------DVKFCEDDFPFSS---------ASQTSTLAP--------------------STPVVPLHDPSYSNIHP

Query:  PPSIP-SPPIPPSPPIPLPTTPSSPPPS----PDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSP
        P + P  PP+ P    PLP   + P P      +S  +S P P +    LRRSTR +QPPAWH++Y MS+  NH +  +SP   TRYPL H+LS S FS 
Subjt:  PPSIP-SPPIPPSPPIPLPTTPSSPPPS----PDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSP

Query:  TQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTA
        + R FLA ITSQ EP TY +AV  P WQ AM+ E+ AL+RN+TWSLV LP  HK IGCRWVYKIKY+SDG++E YKARLVAKGYTQ+ G+DY ETFSPTA
Subjt:  TQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTA

Query:  KLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK
        KLTTLRCL T+AAAR WFT QLDVQNAFLHGNL E VYM  PPGLRRQGEN VCRL+KSLYGLKQASRNWFS+ S  +++A ++QSKADYSLFTK +
Subjt:  KLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK

CAN68148.1 hypothetical protein VITISV_035665 [Vitis vinifera]2.6e-14236.11Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ
        MI+SWLTH+VE+DIA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQ Q H++QREED+LMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQR-------------------------------------------------------------
         LMGLN+SYK VRSNILMMSPLPNVRQAYSL+VQEEMQR                                                             
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQR-------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------QDLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDL
                                                                     QDL  GKMIGSGKQ GGLY++  S  KS    VSQ SDL
Subjt:  -------------------------------------------------------------QDLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDL

Query:  WHLR------------------------------------------------------------------------------------------------
        WHLR                                                                                                
Subjt:  WHLR------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------LGYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFPF-SSASQTSTLAPSTPV--
                                                       LGYP G KGYK+  +Q+ K  +SRDV F E+ FPF SS+SQ+   +PS P+  
Subjt:  -----------------------------------------------LGYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFPF-SSASQTSTLAPSTPV--

Query:  --------VPLHDPSYSNIHPPP-----SIPSPP-----IP----------PSPPIPLPTTPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHK
                 P+  P +S    PP      + SPP     +P          P P  P P++ SSPP  P  P+N++   P    PLRRSTR  QPPAWH 
Subjt:  --------VPLHDPSYSNIHPPP-----SIPSPP-----IP----------PSPPIPLPTTPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHK

Query:  DYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKI
        DY MS+  NH ++ SS   GTRYPL  +LSF  FSP  RAFLAL+T+QTEP ++++A  DP W+QAM+ E+ ALERN+TW +VPLPPGHK IGCRWVYKI
Subjt:  DYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKI

Query:  KYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLK
        KY+SDG++E YKARLVAKGYTQV GIDY ETFSPTAKLTTLRCL TVAA+R W+ HQLDV NAFLHGNL EEVYM+ PPGLRRQGEN VCRL KS+YGLK
Subjt:  KYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLK

Query:  QASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK
        QASRNWFS F+ T+++AGY QSKADYSLFTKS+
Subjt:  QASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK

PNX93906.1 hypothetical protein L195_g017068 [Trifolium pratense]5.5e-16145.54Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ
        MI+SWLTHSVE D+AKG+IHAKTA+QVW D  DQFSQKN PAI+QIQ S+A++SQGTM++STYFTK+K LWDELE+YRT  TC+Q + H +QREED+LMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ------------------------------------------------------------
         LMGLN SY TVRSNILMMSPLPNVRQAYSL++QEE QRQ                                                            
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------DLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDLWHLRL-----------
                                                      DLA GKMIGSG+Q GGLY++S       +HQVSQ+SD+WH+RL           
Subjt:  ----------------------------------------------DLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDLWHLRL-----------

Query:  ----------------------------GYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFP-FSSASQTSTLAPSTPVVPLHDPSYSNIHPPPSIPSPPI
                                    GYP G KGYK+Y  ++  FF+SRDV+FCE DFP   + S+ ++++   P   L D   S I  P  +PS   
Subjt:  ----------------------------GYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFP-FSSASQTSTLAPSTPVVPLHDPSYSNIHPPPSIPSPPI

Query:  PPSPPIPLPTTPSSPPPSPDS-PTNSN-PIPPDTSAP-LRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITS
           PP P   TPS+  P  DS PT S+ P PP +  P +RRS R K PP WH+DY MS   N  +S  +  +GTRYPL HYLS+S  S T   FLA IT+
Subjt:  PPSPPIPLPTTPSSPPPSPDS-PTNSN-PIPPDTSAP-LRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITS

Query:  QTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTV
          EP++YD+AV DP WQ AMN E+ AL++N+TW+LVPLPPGHK IGC+WVYKIKY SDG++E YKARLVAKGYTQVEGIDY ETFSPTAK+TTLRCL TV
Subjt:  QTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTV

Query:  AAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTK
        AA+R WF HQLDVQNAFLHG+L E VYM  PPGLRRQGEN VCRL+KSLYGLKQASRNWFS FS  IQ AGY QSKADYSLFTK
Subjt:  AAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTK

RVW69506.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.7e-14448.8Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ
        MI+SWLTH+VE+DIA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQ Q H++QREED+LMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ-------DLAIGKMI-GSG-----------------------------------------
         LMGLN+SYK VRSNILMMSPLPNVRQAYSL+VQEEMQRQ       + +I   + G G                                         
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ-------DLAIGKMI-GSG-----------------------------------------

Query:  ----------------KQFGGLYHISSSPIKSSAHQVSQSSD-------------------LWHLRLG------YPCGHKGYKLYHMQSHKFF-------
                        + FG     S++  KS     S SS                    L H   G         G   + + HM    +        
Subjt:  ----------------KQFGGLYHISSSPIKSSAHQVSQSSD-------------------LWHLRLG------YPCGHKGYKLYHMQSHKFF-------

Query:  --ISRDVKFCEDDFPFSSASQ--------------TSTLAP---------STPVVPLHDPSYSNIHPPPSIPSPPIPPSPPIPLPTTP-----SSPPPSP
          +   + F EDD+  SS SQ               ST  P         STP +  H+P  S       +P P  P S   PLP++P     SSPP  P
Subjt:  --ISRDVKFCEDDFPFSSASQ--------------TSTLAP---------STPVVPLHDPSYSNIHPPPSIPSPPIPPSPPIPLPTTP-----SSPPPSP

Query:  DSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMND
          P+N++   P    PLRRSTR  QPPAWH DY MS+  NH ++ SS                              +Q EP ++++A  DP W QAM+ 
Subjt:  DSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMND

Query:  EIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNL
        ++ ALERN+TW +VPL PGHK IGCRWVYKIKY+SDG++E YKARLVAKGYTQV GIDY ETFSPTAKLTTLRCL TVAA+R W+ HQLDV NAFLHGNL
Subjt:  EIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNL

Query:  DEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK
         EEVYM+ PPGLRRQGEN VCRL KS+YGLKQASRNWFS F+ T+++ GY QSKADYSLFTKS+
Subjt:  DEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK

RVX15598.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]6.6e-13850.58Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ
        MI+SWLTH VE+DIA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQ Q H++QREED+LMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ-------DLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDL-WHLRLGYPCGHKGY
         LMGL++SYK VRSNILMMSPLPNVRQAYSL+VQEEMQRQ       + +I   +   ++ G            S H V +   L +H +    C  +G+
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ-------DLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDL-WHLRLGYPCGHKGY

Query:  -----KLYHMQSHKFFISRDVK-FCEDDFPFSSASQTSTLAPST------------------PVVPLHDPSYSNIHPPPSIPS------------PPIPP
             +L +  ++K    R  + F   + P ++A+++  ++ ST                   +  L+  +  NI    +                 + P
Subjt:  -----KLYHMQSHKFFISRDVK-FCEDDFPFSSASQTSTLAPST------------------PVVPLHDPSYSNIHPPPSIPS------------PPIPP

Query:  S--PPIPLPTTPS----------SPPPSPDS----------PTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLS
        S    + LP              S  P  DS          P+N++   P    PLRRSTR  QPPAWH DY MS+  NH ++ SS              
Subjt:  S--PPIPLPTTPS----------SPPPSPDS----------PTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLS

Query:  FSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTE
                        +Q EP ++++A  DP W+QAM+ E+ ALERN+TW +VPLPPGHK IGCRWVYKIKY+ DG++E YKA LVAKGYTQV GIDY E
Subjt:  FSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTE

Query:  TFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFT
        TFSPTAKLTTLRCL TVAA+R W+ HQLDV NAFLHGNL EEVYM+ PPGLRRQGEN VCRL KS+YGLKQASRNWFS F+ T+++AGY QSKADYSLFT
Subjt:  TFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFT

Query:  KSK
        KS+
Subjt:  KSK

TrEMBL top hitse value%identityAlignment
A0A2K3MT28 Uncharacterized protein2.7e-16145.54Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ
        MI+SWLTHSVE D+AKG+IHAKTA+QVW D  DQFSQKN PAI+QIQ S+A++SQGTM++STYFTK+K LWDELE+YRT  TC+Q + H +QREED+LMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ------------------------------------------------------------
         LMGLN SY TVRSNILMMSPLPNVRQAYSL++QEE QRQ                                                            
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------DLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDLWHLRL-----------
                                                      DLA GKMIGSG+Q GGLY++S       +HQVSQ+SD+WH+RL           
Subjt:  ----------------------------------------------DLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDLWHLRL-----------

Query:  ----------------------------GYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFP-FSSASQTSTLAPSTPVVPLHDPSYSNIHPPPSIPSPPI
                                    GYP G KGYK+Y  ++  FF+SRDV+FCE DFP   + S+ ++++   P   L D   S I  P  +PS   
Subjt:  ----------------------------GYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFP-FSSASQTSTLAPSTPVVPLHDPSYSNIHPPPSIPSPPI

Query:  PPSPPIPLPTTPSSPPPSPDS-PTNSN-PIPPDTSAP-LRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITS
           PP P   TPS+  P  DS PT S+ P PP +  P +RRS R K PP WH+DY MS   N  +S  +  +GTRYPL HYLS+S  S T   FLA IT+
Subjt:  PPSPPIPLPTTPSSPPPSPDS-PTNSN-PIPPDTSAP-LRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITS

Query:  QTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTV
          EP++YD+AV DP WQ AMN E+ AL++N+TW+LVPLPPGHK IGC+WVYKIKY SDG++E YKARLVAKGYTQVEGIDY ETFSPTAK+TTLRCL TV
Subjt:  QTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTV

Query:  AAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTK
        AA+R WF HQLDVQNAFLHG+L E VYM  PPGLRRQGEN VCRL+KSLYGLKQASRNWFS FS  IQ AGY QSKADYSLFTK
Subjt:  AAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTK

A0A438GBE7 Retrovirus-related Pol polyprotein from transposon RE12.3e-14448.8Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ
        MI+SWLTH+VE+DIA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQ Q H++QREED+LMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ-------DLAIGKMI-GSG-----------------------------------------
         LMGLN+SYK VRSNILMMSPLPNVRQAYSL+VQEEMQRQ       + +I   + G G                                         
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ-------DLAIGKMI-GSG-----------------------------------------

Query:  ----------------KQFGGLYHISSSPIKSSAHQVSQSSD-------------------LWHLRLG------YPCGHKGYKLYHMQSHKFF-------
                        + FG     S++  KS     S SS                    L H   G         G   + + HM    +        
Subjt:  ----------------KQFGGLYHISSSPIKSSAHQVSQSSD-------------------LWHLRLG------YPCGHKGYKLYHMQSHKFF-------

Query:  --ISRDVKFCEDDFPFSSASQ--------------TSTLAP---------STPVVPLHDPSYSNIHPPPSIPSPPIPPSPPIPLPTTP-----SSPPPSP
          +   + F EDD+  SS SQ               ST  P         STP +  H+P  S       +P P  P S   PLP++P     SSPP  P
Subjt:  --ISRDVKFCEDDFPFSSASQ--------------TSTLAP---------STPVVPLHDPSYSNIHPPPSIPSPPIPPSPPIPLPTTP-----SSPPPSP

Query:  DSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMND
          P+N++   P    PLRRSTR  QPPAWH DY MS+  NH ++ SS                              +Q EP ++++A  DP W QAM+ 
Subjt:  DSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMND

Query:  EIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNL
        ++ ALERN+TW +VPL PGHK IGCRWVYKIKY+SDG++E YKARLVAKGYTQV GIDY ETFSPTAKLTTLRCL TVAA+R W+ HQLDV NAFLHGNL
Subjt:  EIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNL

Query:  DEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK
         EEVYM+ PPGLRRQGEN VCRL KS+YGLKQASRNWFS F+ T+++ GY QSKADYSLFTKS+
Subjt:  DEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK

A0A438K345 Retrovirus-related Pol polyprotein from transposon RE13.2e-13850.58Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ
        MI+SWLTH VE+DIA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQ Q H++QREED+LMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ-------DLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDL-WHLRLGYPCGHKGY
         LMGL++SYK VRSNILMMSPLPNVRQAYSL+VQEEMQRQ       + +I   +   ++ G            S H V +   L +H +    C  +G+
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQ-------DLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDL-WHLRLGYPCGHKGY

Query:  -----KLYHMQSHKFFISRDVK-FCEDDFPFSSASQTSTLAPST------------------PVVPLHDPSYSNIHPPPSIPS------------PPIPP
             +L +  ++K    R  + F   + P ++A+++  ++ ST                   +  L+  +  NI    +                 + P
Subjt:  -----KLYHMQSHKFFISRDVK-FCEDDFPFSSASQTSTLAPST------------------PVVPLHDPSYSNIHPPPSIPS------------PPIPP

Query:  S--PPIPLPTTPS----------SPPPSPDS----------PTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLS
        S    + LP              S  P  DS          P+N++   P    PLRRSTR  QPPAWH DY MS+  NH ++ SS              
Subjt:  S--PPIPLPTTPS----------SPPPSPDS----------PTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLS

Query:  FSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTE
                        +Q EP ++++A  DP W+QAM+ E+ ALERN+TW +VPLPPGHK IGCRWVYKIKY+ DG++E YKA LVAKGYTQV GIDY E
Subjt:  FSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTE

Query:  TFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFT
        TFSPTAKLTTLRCL TVAA+R W+ HQLDV NAFLHGNL EEVYM+ PPGLRRQGEN VCRL KS+YGLKQASRNWFS F+ T+++AGY QSKADYSLFT
Subjt:  TFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFT

Query:  KSK
        KS+
Subjt:  KSK

A0A6V7QCA8 Uncharacterized protein2.4e-14652.26Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHD-QFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLM
        +I+SWLTHSVE D+A+G+IHAKTA+QVW DL D QFSQKNAPAIFQIQ SIAT+SQG+M++++Y+ KLKALWDELE YR P+TCN  + H +Q +ED+LM
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHD-QFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLM

Query:  QLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQDLAIGKMIGSGKQFGGLYHISSSPIKS---------SAHQVSQSSDLWHL-----RLGY
        Q LMGLN+SYK VRSNILMM+PLPNVRQAYS ++QE+ QRQ   +   +             +S IK+         S H + +   L +      + G+
Subjt:  QLLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQRQDLAIGKMIGSGKQFGGLYHISSSPIKS---------SAHQVSQSSDLWHL-----RLGY

Query:  P---CGHKGYKLYHMQSHKFFISR--------DVKFCEDDFPFSS---------ASQTSTLAP--------------------STPVVPLHDPSYSNIHP
            C  K  K    Q ++   S+        +     DD    S         A   ST+                      S P + L   S ++   
Subjt:  P---CGHKGYKLYHMQSHKFFISR--------DVKFCEDDFPFSS---------ASQTSTLAP--------------------STPVVPLHDPSYSNIHP

Query:  PPSIP-SPPIPPSPPIPLPTTPSSPPPS----PDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSP
        P + P  PP+ P    PLP   + P P      +S  +S P P +    LRRSTR +QPPAWH++Y MS+  NH +  +SP   TRYPL H+LS S FS 
Subjt:  PPSIP-SPPIPPSPPIPLPTTPSSPPPS----PDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSP

Query:  TQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTA
        + R FLA ITSQ EP TY +AV  P WQ AM+ E+ AL+RN+TWSLV LP  HK IGCRWVYKIKY+SDG++E YKARLVAKGYTQ+ G+DY ETFSPTA
Subjt:  TQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTA

Query:  KLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK
        KLTTLRCL T+AAAR WFT QLDVQNAFLHGNL E VYM  PPGLRRQGEN VCRL+KSLYGLKQASRNWFS+ S  +++A ++QSKADYSLFTK +
Subjt:  KLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK

A5BNR5 Integrase catalytic domain-containing protein1.3e-14236.11Show/hide
Query:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ
        MI+SWLTH+VE+DIA+GIIHAKTA +VWVDL DQFSQKNAPA+FQIQ SIATMSQGTM ++ YFTK+KALWDELE YR+P TCNQ Q H++QREED+LMQ
Subjt:  MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQ

Query:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQR-------------------------------------------------------------
         LMGLN+SYK VRSNILMMSPLPNVRQAYSL+VQEEMQR                                                             
Subjt:  LLMGLNQSYKTVRSNILMMSPLPNVRQAYSLLVQEEMQR-------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------QDLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDL
                                                                     QDL  GKMIGSGKQ GGLY++  S  KS    VSQ SDL
Subjt:  -------------------------------------------------------------QDLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDL

Query:  WHLR------------------------------------------------------------------------------------------------
        WHLR                                                                                                
Subjt:  WHLR------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------LGYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFPF-SSASQTSTLAPSTPV--
                                                       LGYP G KGYK+  +Q+ K  +SRDV F E+ FPF SS+SQ+   +PS P+  
Subjt:  -----------------------------------------------LGYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFPF-SSASQTSTLAPSTPV--

Query:  --------VPLHDPSYSNIHPPP-----SIPSPP-----IP----------PSPPIPLPTTPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHK
                 P+  P +S    PP      + SPP     +P          P P  P P++ SSPP  P  P+N++   P    PLRRSTR  QPPAWH 
Subjt:  --------VPLHDPSYSNIHPPP-----SIPSPP-----IP----------PSPPIPLPTTPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHK

Query:  DYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKI
        DY MS+  NH ++ SS   GTRYPL  +LSF  FSP  RAFLAL+T+QTEP ++++A  DP W+QAM+ E+ ALERN+TW +VPLPPGHK IGCRWVYKI
Subjt:  DYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKI

Query:  KYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLK
        KY+SDG++E YKARLVAKGYTQV GIDY ETFSPTAKLTTLRCL TVAA+R W+ HQLDV NAFLHGNL EEVYM+ PPGLRRQGEN VCRL KS+YGLK
Subjt:  KYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLK

Query:  QASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK
        QASRNWFS F+ T+++AGY QSKADYSLFTKS+
Subjt:  QASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.4e-3640.35Show/hide
Query:  WQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQN
        W++A+N E+ A + N+TW++   P     +  RWV+ +KYN  G+   YKARLVA+G+TQ   IDY ETF+P A++++ R + ++        HQ+DV+ 
Subjt:  WQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQN

Query:  AFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK
        AFL+G L EE+YM LP G+    +N VC+L+K++YGLKQA+R WF +F   ++   +  S  D  ++   K
Subjt:  AFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-3941.67Show/hide
Query:  LITSQTEPKTYDEAVGDPLWQQ---AMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTT
        LI+   EP++  E +  P   Q   AM +E+ +L++N T+ LV LP G + + C+WV+K+K + D  +  YKARLV KG+ Q +GID+ E FSP  K+T+
Subjt:  LITSQTEPKTYDEAVGDPLWQQ---AMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTT

Query:  LRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQG-ENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTK
        +R + ++AA+      QLDV+ AFLHG+L+EE+YM  P G    G ++ VC+L+KSLYGLKQA R W+  F + +++  Y ++ +D  ++ K
Subjt:  LRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQG-ENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTK

P92520 Uncharacterized mitochondrial protein AtMg008201.2e-2249.51Show/hide
Query:  TSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLF
        T + EPK+   A+ DP W QAM +E+ AL RN TW LVP P     +GC+WV+K K +SDG+++  KARLVAKG+ Q EGI + ET+SP  +  T+R + 
Subjt:  TSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLF

Query:  TVA
         VA
Subjt:  TVA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-4833.21Show/hide
Query:  LGYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFPFSSASQT------------------STLAPSTPVVPLHDPSYSNIH---PPPSIPSPPIPPS----
        LGY      Y   H+Q+ + +ISR V+F E+ FPFS+   T                  +TL   TPV+P   PS S+ H    PPS PS P   S    
Subjt:  LGYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFPFSSASQT------------------STLAPSTPVVPLHDPSYSNIH---PPPSIPSPPIPPS----

Query:  -----------PPIPLPTTPSSPPPSP-------------------DSPTNSNP--IPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSP----
                   P  P PT P    P P                   ++PTN +P  +    S P + S+ +  P         S     +     P    
Subjt:  -----------PPIPLPTTPSSPPPSP-------------------DSPTNSNP--IPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSP----

Query:  --GTGTRYPLHHYLSFSH-----FSPTQRAFLAL-ITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAI-GCRWVYKIKYNSDGSV
              + PL+ +   +        P  +  LA+ + +++EP+T  +A+ D  W+ AM  EI A   NHTW LVP PP H  I GCRW++  KYNSDGS+
Subjt:  --GTGTRYPLHHYLSFSH-----FSPTQRAFLAL-ITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAI-GCRWVYKIKYNSDGSV

Query:  ECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG-LRRQGENTVCRLHKSLYGLKQASRNWF
          YKARLVAKGY Q  G+DY ETFSP  K T++R +  VA  R W   QLDV NAFL G L ++VYMS PPG + +   N VC+L K+LYGLKQA R W+
Subjt:  ECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG-LRRQGENTVCRLHKSLYGLKQASRNWF

Query:  SIFSTTIQNAGYTQSKADYSLFT----KSKVIAWQCCVKVGATFGDVT----GVMNLDHQFSFR----------------ALGSHETQAPTVAPIGSSTE
              +   G+  S +D SLF     KS V        +  T  D T     + NL  +FS +                  G H +Q   +  + + T 
Subjt:  SIFSTTIQNAGYTQSKADYSLFT----KSKVIAWQCCVKVGATFGDVT----GVMNLDHQFSFR----------------ALGSHETQAPTVAPIGSSTE

Query:  VIVDAPFGSIAQTPEALTPEGASVGGTALT
        +I   P      TP A +P+ +   GT LT
Subjt:  VIVDAPFGSIAQTPEALTPEGASVGGTALT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-4535.63Show/hide
Query:  LGYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFPFS------SASQ-----------TSTLAPSTPVV----PLHDPSYSNIHPPPSIPSP---------
        +GY      Y   H+ + + + SR V+F E  FPFS      S SQ           + T  P+TP+V    P   P       PPS PSP         
Subjt:  LGYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFPFS------SASQ-----------TSTLAPSTPVV----PLHDPSYSNIHPPPSIPSP---------

Query:  --------------PIPPSPPIPLPT-----------------TPSSPPPSPDSPTNSNPIP-----------PDTS-----APLRRSTRTKQ-PPAWHK
                      P  PS   P PT                  P+   PSP+SP  ++P+P           P TS     +P   ST T   PP    
Subjt:  --------------PIPPSPPIPLPT-----------------TPSSPPPSPDSPTNSNPIP-----------PDTS-----APLRRSTRTKQ-PPAWHK

Query:  DYEMSSGANHLTSSSSPGT----GTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAI-GCR
           +   A    ++ S  T    G R P   Y           ++   + + +EP+T  +A+ D  W+QAM  EI A   NHTW LVP PP    I GCR
Subjt:  DYEMSSGANHLTSSSSPGT----GTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAI-GCR

Query:  WVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG-LRRQGENTVCRLHK
        W++  K+NSDGS+  YKARLVAKGY Q  G+DY ETFSP  K T++R +  VA  R W   QLDV NAFL G L +EVYMS PPG + +   + VCRL K
Subjt:  WVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPG-LRRQGENTVCRLHK

Query:  SLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLF
        ++YGLKQA R W+    T +   G+  S +D SLF
Subjt:  SLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.6e-6046.32Show/hide
Query:  TTPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVG
        +T SS      S    N +P  +     R TR    PA+ +DY   S A+           T + +  +LS+   SP   +FL  I    EP TY+EA  
Subjt:  TTPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLHHYLSFSHFSPTQRAFLALITSQTEPKTYDEAVG

Query:  DPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLD
          +W  AM+DEI A+E  HTW +  LPP  K IGC+WVYKIKYNSDG++E YKARLVAKGYTQ EGID+ ETFSP  KLT+++ +  ++A   +  HQLD
Subjt:  DPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLFTVAAARKWFTHQLD

Query:  VQNAFLHGNLDEEVYMSLPPG-LRRQGE----NTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSKVIAWQC
        + NAFL+G+LDEE+YM LPPG   RQG+    N VC L KS+YGLKQASR WF  FS T+   G+ QS +D++ F K     + C
Subjt:  VQNAFLHGNLDEEVYMSLPPG-LRRQGE----NTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSKVIAWQC

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.9e-2449.51Show/hide
Query:  TSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLF
        T + EPK+   A+ DP W QAM +E+ AL RN TW LVP P     +GC+WV+K K +SDG+++  KARLVAKG+ Q EGI + ET+SP  +  T+R + 
Subjt:  TSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTAKLTTLRCLF

Query:  TVA
         VA
Subjt:  TVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCTCATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACA
AAAGAATGCTCCAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCACTGTCAACATATTTCACCAAGCTCAAAGCACTCTGGGATGAAC
TGGAAGCGTACCGCACACCATTTACCTGTAATCAATGTCAAATACATATTGATCAACGCGAAGAAGACAAGTTGATGCAATTGCTCATGGGGCTCAATCAGTCTTATAAA
ACGGTGAGATCTAACATATTGATGATGTCTCCATTACCTAACGTGAGGCAAGCCTATTCATTACTTGTACAAGAAGAGATGCAGCGTCAGGACTTGGCTATAGGGAAGAT
GATTGGCTCGGGTAAACAATTTGGAGGTCTCTATCATATTTCTTCATCTCCAATCAAATCTTCAGCTCATCAAGTATCTCAGTCATCTGATTTGTGGCATTTACGCCTAG
GATATCCTTGTGGTCATAAAGGTTACAAGTTGTATCACATGCAATCTCACAAATTCTTTATCAGCCGTGATGTTAAATTTTGTGAAGATGATTTTCCTTTTTCATCAGCT
TCACAAACTTCGACATTAGCTCCTTCGACTCCTGTTGTACCACTTCATGATCCATCCTACTCAAACATCCATCCTCCACCTTCTATTCCTTCACCTCCTATTCCTCCTTC
ACCTCCTATCCCTTTACCTACTACTCCGTCCTCTCCTCCACCTTCTCCAGATTCGCCCACTAATTCCAATCCTATCCCACCTGATACATCAGCTCCACTTCGACGTTCTA
CTCGTACTAAACAGCCTCCAGCTTGGCATAAGGATTATGAGATGTCTTCTGGAGCCAATCATTTAACCTCTAGCTCAAGTCCCGGCACTGGCACCAGGTATCCCCTTCAT
CATTACCTTTCATTCTCTCATTTTTCTCCTACTCAACGTGCTTTTCTAGCTCTTATTACATCCCAGACAGAACCTAAAACCTATGACGAGGCAGTTGGCGACCCGTTATG
GCAGCAGGCTATGAATGATGAAATTGCAGCTTTGGAACGTAATCATACATGGTCTCTCGTTCCTCTACCACCTGGTCATAAAGCTATTGGTTGTCGTTGGGTGTACAAAA
TTAAATACAACTCTGATGGTTCTGTTGAATGTTATAAAGCTCGACTTGTAGCAAAGGGATACACTCAAGTTGAAGGTATTGATTACACAGAAACATTTTCCCCTACAGCG
AAACTTACTACACTTCGTTGCTTATTCACTGTTGCTGCTGCTCGGAAATGGTTCACCCATCAGTTGGATGTTCAAAATGCCTTTCTCCATGGTAATCTAGACGAGGAAGT
TTATATGTCTTTACCGCCAGGTCTTCGCCGACAGGGGGAGAATACAGTATGTCGGCTCCATAAATCTCTTTATGGATTAAAACAGGCTTCTCGCAATTGGTTCTCCATAT
TTTCTACAACTATACAAAATGCAGGCTACACTCAGTCCAAAGCAGATTACTCTTTGTTTACCAAGAGTAAAGTGATTGCTTGGCAATGTTGTGTGAAAGTGGGTGCCACT
TTTGGTGATGTCACCGGCGTTATGAATTTGGATCATCAATTTTCATTTCGTGCTCTTGGATCACATGAAACACAAGCCCCAACAGTTGCACCAATTGGAAGTAGTACAGA
AGTCATAGTGGATGCACCGTTTGGAAGCATTGCTCAAACTCCGGAGGCTCTAACCCCGGAGGGTGCATCGGTTGGAGGCACTGCTCTAACCCCGGAGGGTGCATCGGTTG
GAGGCACTGCTCTAACCCCAGAGGGTGCATCGGTTGGAGGCACTGCTCTAACCCCAGAGGGTGCATCGGTTGGAGGCACTGCTCTAACCCCTGAAGGTGCATCGGTTGGA
GGCACTGCACTAACCCCAACCATTACACCAATTGGAAGTAGCACACAAGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATAATATCTTGGTTAACTCATTCCGTTGAAGCAGATATCGCTAAAGGCATTATTCACGCCAAGACAGCTCATCAAGTGTGGGTTGATCTTCACGATCAATTCTCACA
AAAGAATGCTCCAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCACTGTCAACATATTTCACCAAGCTCAAAGCACTCTGGGATGAAC
TGGAAGCGTACCGCACACCATTTACCTGTAATCAATGTCAAATACATATTGATCAACGCGAAGAAGACAAGTTGATGCAATTGCTCATGGGGCTCAATCAGTCTTATAAA
ACGGTGAGATCTAACATATTGATGATGTCTCCATTACCTAACGTGAGGCAAGCCTATTCATTACTTGTACAAGAAGAGATGCAGCGTCAGGACTTGGCTATAGGGAAGAT
GATTGGCTCGGGTAAACAATTTGGAGGTCTCTATCATATTTCTTCATCTCCAATCAAATCTTCAGCTCATCAAGTATCTCAGTCATCTGATTTGTGGCATTTACGCCTAG
GATATCCTTGTGGTCATAAAGGTTACAAGTTGTATCACATGCAATCTCACAAATTCTTTATCAGCCGTGATGTTAAATTTTGTGAAGATGATTTTCCTTTTTCATCAGCT
TCACAAACTTCGACATTAGCTCCTTCGACTCCTGTTGTACCACTTCATGATCCATCCTACTCAAACATCCATCCTCCACCTTCTATTCCTTCACCTCCTATTCCTCCTTC
ACCTCCTATCCCTTTACCTACTACTCCGTCCTCTCCTCCACCTTCTCCAGATTCGCCCACTAATTCCAATCCTATCCCACCTGATACATCAGCTCCACTTCGACGTTCTA
CTCGTACTAAACAGCCTCCAGCTTGGCATAAGGATTATGAGATGTCTTCTGGAGCCAATCATTTAACCTCTAGCTCAAGTCCCGGCACTGGCACCAGGTATCCCCTTCAT
CATTACCTTTCATTCTCTCATTTTTCTCCTACTCAACGTGCTTTTCTAGCTCTTATTACATCCCAGACAGAACCTAAAACCTATGACGAGGCAGTTGGCGACCCGTTATG
GCAGCAGGCTATGAATGATGAAATTGCAGCTTTGGAACGTAATCATACATGGTCTCTCGTTCCTCTACCACCTGGTCATAAAGCTATTGGTTGTCGTTGGGTGTACAAAA
TTAAATACAACTCTGATGGTTCTGTTGAATGTTATAAAGCTCGACTTGTAGCAAAGGGATACACTCAAGTTGAAGGTATTGATTACACAGAAACATTTTCCCCTACAGCG
AAACTTACTACACTTCGTTGCTTATTCACTGTTGCTGCTGCTCGGAAATGGTTCACCCATCAGTTGGATGTTCAAAATGCCTTTCTCCATGGTAATCTAGACGAGGAAGT
TTATATGTCTTTACCGCCAGGTCTTCGCCGACAGGGGGAGAATACAGTATGTCGGCTCCATAAATCTCTTTATGGATTAAAACAGGCTTCTCGCAATTGGTTCTCCATAT
TTTCTACAACTATACAAAATGCAGGCTACACTCAGTCCAAAGCAGATTACTCTTTGTTTACCAAGAGTAAAGTGATTGCTTGGCAATGTTGTGTGAAAGTGGGTGCCACT
TTTGGTGATGTCACCGGCGTTATGAATTTGGATCATCAATTTTCATTTCGTGCTCTTGGATCACATGAAACACAAGCCCCAACAGTTGCACCAATTGGAAGTAGTACAGA
AGTCATAGTGGATGCACCGTTTGGAAGCATTGCTCAAACTCCGGAGGCTCTAACCCCGGAGGGTGCATCGGTTGGAGGCACTGCTCTAACCCCGGAGGGTGCATCGGTTG
GAGGCACTGCTCTAACCCCAGAGGGTGCATCGGTTGGAGGCACTGCTCTAACCCCAGAGGGTGCATCGGTTGGAGGCACTGCTCTAACCCCTGAAGGTGCATCGGTTGGA
GGCACTGCACTAACCCCAACCATTACACCAATTGGAAGTAGCACACAAGCCTGA
Protein sequenceShow/hide protein sequence
MIISWLTHSVEADIAKGIIHAKTAHQVWVDLHDQFSQKNAPAIFQIQNSIATMSQGTMALSTYFTKLKALWDELEAYRTPFTCNQCQIHIDQREEDKLMQLLMGLNQSYK
TVRSNILMMSPLPNVRQAYSLLVQEEMQRQDLAIGKMIGSGKQFGGLYHISSSPIKSSAHQVSQSSDLWHLRLGYPCGHKGYKLYHMQSHKFFISRDVKFCEDDFPFSSA
SQTSTLAPSTPVVPLHDPSYSNIHPPPSIPSPPIPPSPPIPLPTTPSSPPPSPDSPTNSNPIPPDTSAPLRRSTRTKQPPAWHKDYEMSSGANHLTSSSSPGTGTRYPLH
HYLSFSHFSPTQRAFLALITSQTEPKTYDEAVGDPLWQQAMNDEIAALERNHTWSLVPLPPGHKAIGCRWVYKIKYNSDGSVECYKARLVAKGYTQVEGIDYTETFSPTA
KLTTLRCLFTVAAARKWFTHQLDVQNAFLHGNLDEEVYMSLPPGLRRQGENTVCRLHKSLYGLKQASRNWFSIFSTTIQNAGYTQSKADYSLFTKSKVIAWQCCVKVGAT
FGDVTGVMNLDHQFSFRALGSHETQAPTVAPIGSSTEVIVDAPFGSIAQTPEALTPEGASVGGTALTPEGASVGGTALTPEGASVGGTALTPEGASVGGTALTPEGASVG
GTALTPTITPIGSSTQA