; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G12810 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G12810
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr11:22197771..22202483
RNA-Seq ExpressionClc11G12810
SyntenyClc11G12810
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR041588 - Integrase zinc-binding domain
IPR039417 - Papain-like cysteine endopeptidase
IPR038765 - Papain-like cysteine peptidase superfamily
IPR036397 - Ribonuclease H superfamily
IPR025660 - Cysteine peptidase, histidine active site
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core
IPR000668 - Peptidase C1A, papain C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035480.1 pol protein [Cucumis melo var. makuwa]1.0e-19867.63Show/hide
Query:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------
        MRQRRWLELVKDYD +ILYHPGKANVVADALSRK  HSAALIT Q PL  DLER  I V++G VT                                   
Subjt:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------

Query:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK
          + +FS+SSD GL +  RLC P+ S VK ELLSEAH+SPFS+HPGSTKMYQDLKR YWW NMKREVAEFVS+C VCQQVKA RQK AGLL PL+V EWK
Subjt:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK

Query:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD
        WENVSMDFI GLP+T +G TVIWV+VDRLTKSAHF+PGKST++ SKWAQ+Y+ E+VRLHGVPVSIVSD+D RF S FWK L+TA+GTRLDFST FHPQTD
Subjt:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD

Query:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY------------------------------GDKVFLKVAPMKGVLRF
        GQTERLNQ+LEDMLRACAL+FPGSWD HLHLMEFAYNNSYQATIGMAPFEALY                              GDKVFLKVAPM+GVLRF
Subjt:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY------------------------------GDKVFLKVAPMKGVLRF

Query:  GKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYEEKPVQVLAREVKVLRNRKIALVKVLWQNHQ
         ++GKLS +F+GPFEILE++GPVAY LALPPS STVH+VFHVSMLR+Y+ DPSHV+D+EPL+++++LSY E+PV+VLAREVK+LRN++I LVKVLW+NH+
Subjt:  GKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYEEKPVQVLAREVKVLRNRKIALVKVLWQNHQ

Query:  CEEATWEREDEMRSKHPEF
         EEATWERED+MR    EF
Subjt:  CEEATWEREDEMRSKHPEF

KAA0040695.1 pol protein [Cucumis melo var. makuwa]1.9e-20067.77Show/hide
Query:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------
        MRQRRWLELVKDYD +ILYHPGKANVVADALSRK  HSAALIT Q PL +DLER  IAV++G VT                                   
Subjt:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------

Query:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK
          + +FS+SSD GL +   LC P+ S VK ELLSEAH+SPFS+HPGSTKMYQDLKR YWW NMKREVAEFVS+C VCQQVKA RQK AGLL PL++ EWK
Subjt:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK

Query:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD
        WENVSMDFI GLP+T +G TVIWV+VDRLTK AHF+PGKST++ SKWAQ+Y+ E+VRLHGVPVSIVS++D RF S FWK L+TA+GTRLDFST FHPQTD
Subjt:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD

Query:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY------------------------GDKVFLKVAPMKGVLRFGKKGKL
        GQTERLNQ+LEDMLRACAL+FPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY                        GDKVFLKVAPM+GVLRF ++GKL
Subjt:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY------------------------GDKVFLKVAPMKGVLRFGKKGKL

Query:  SLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYEEKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATW
        S +F+GPFEILE++GPVAY L LPPS STVH+V HVSMLR+Y+ DPSHV+D+EPL+++++LSY E+PV+VLAREVK LRN++I LVKVLW+NH+ EEATW
Subjt:  SLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYEEKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATW

Query:  EREDEMRSKHPEFFQ
        ERED+MRS++PE F+
Subjt:  EREDEMRSKHPEFFQ

KAA0043669.1 pol protein [Cucumis melo var. makuwa]1.4e-19864.18Show/hide
Query:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVTSH---------------------------------
        MRQRRWLELVKDYD +ILYHPGKANVVADALSRK  HSAALIT Q PL +D+ER  I V++G VT+                                  
Subjt:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVTSH---------------------------------

Query:  ----KFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK
            +FS+SSD GL +  RLC P+ S VK ELLSEAH+SPFS+HPGSTKMYQDLKR YWW NMKREVAEFVSKC VCQQVKA RQK AGLL PL++ EWK
Subjt:  ----KFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK

Query:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD
        WENVSMDFI GLP+T +G TVIWV+VDRLTKSAHF+PGKST++ SKWAQ+Y+ E+VRLHGVPVSIVSD+D RF S FWK L+TA+GTRLDFST FHPQTD
Subjt:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD

Query:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------
        GQTERLNQ+LEDMLRACAL+FP SWDSHLHLMEFAYNNSYQATIGMAPFEALY                                               
Subjt:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------

Query:  -------------GDKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE
                     GDKVFLKVAPMKGVLRF ++GKLS +F+GPFEILE++GPVAY LALPPS STVH+VFHVSMLR+Y+ DPSHV+D+EPL+++++LSY 
Subjt:  -------------GDKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE

Query:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFF
        E+PV+VLAREVK LRN++I LVKVLW+NH+ EEATWERED+MRS++PE F
Subjt:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFF

KAA0051357.1 pol protein [Cucumis melo var. makuwa]7.2e-20064.31Show/hide
Query:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVTSH---------------------------------
        MRQRRWLELVKDYD +ILYHPGKANVVADALSRK  HSAALIT Q PL +DLER  IAV++G VT                                   
Subjt:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVTSH---------------------------------

Query:  ----KFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK
            +FS+SSD GLS+ GRLC P+ S VK ELL EAH+SPFS+HPGSTKMYQDLKR YWW NMKREVAEFVSKC VCQQVK  RQK AGLL PL++ EWK
Subjt:  ----KFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK

Query:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD
        WENVSMDFI GLP+T +G TVIWV+VDRLTKSAHF+PGKST++ SKWAQ+Y+ E+VRLHGVPVSIVSD+D RF S FWK L+TA+GTRLDFST FHPQ D
Subjt:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD

Query:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG----------------------------------------------
        GQTERLNQ+LEDMLRACAL+FPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG                                              
Subjt:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG----------------------------------------------

Query:  --------------DKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE
                      DKVFLKVAPMKGVLRF ++GKLS +F+GPFEILE++GPVAY LALPPS STVH+VFHVSMLR+Y+ DPSHV+D+EPL+++++LSY 
Subjt:  --------------DKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE

Query:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFFQQ
        E+PV+VLAREVK LRN++I LVKVLW+NH+ EEATWERED+MRS++PE F++
Subjt:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFFQQ

KAA0051368.1 pol protein [Cucumis melo var. makuwa]4.2e-20064.49Show/hide
Query:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------
        MRQRRWLELVKDYD +ILYHPGKANVVADALSRK  HSAALIT Q PL +DLER  IAV++G VT                                   
Subjt:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------

Query:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK
          + +FS+SSD GL +  RLC P+ S VK ELLSEAH+SPFS+HPGSTKMYQDLKR YWW NMKREVAEFVSKC VCQQVKA RQK AGLL PL++ EWK
Subjt:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK

Query:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD
        WENVSMDFI GLP+T +G TVIWV+VDRLTKSAHF+PGKST++ SKWAQ+Y+ E+VRLHGVPVSIVSD+D RF S FWK L+TA+GTRLDFST FHPQTD
Subjt:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD

Query:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------
        GQTERLNQ+LEDMLRACAL+FPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY                                               
Subjt:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------

Query:  -------------GDKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE
                     GDKVFLKVAPMKGVLRF ++GKLS +F+GPFEILE++GPVAY LALPPS STVH+VFHVSMLR+Y+ DPSHV+D+EPL+++++LSY 
Subjt:  -------------GDKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE

Query:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFFQQ
        E+PV+VLAREVK LRN++I LVKVLW+NH+ EEATWERED+MRS++PE F++
Subjt:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFFQQ

TrEMBL top hitse value%identityAlignment
A0A5A7SWR6 Reverse transcriptase5.1e-19967.63Show/hide
Query:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------
        MRQRRWLELVKDYD +ILYHPGKANVVADALSRK  HSAALIT Q PL  DLER  I V++G VT                                   
Subjt:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------

Query:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK
          + +FS+SSD GL +  RLC P+ S VK ELLSEAH+SPFS+HPGSTKMYQDLKR YWW NMKREVAEFVS+C VCQQVKA RQK AGLL PL+V EWK
Subjt:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK

Query:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD
        WENVSMDFI GLP+T +G TVIWV+VDRLTKSAHF+PGKST++ SKWAQ+Y+ E+VRLHGVPVSIVSD+D RF S FWK L+TA+GTRLDFST FHPQTD
Subjt:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD

Query:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY------------------------------GDKVFLKVAPMKGVLRF
        GQTERLNQ+LEDMLRACAL+FPGSWD HLHLMEFAYNNSYQATIGMAPFEALY                              GDKVFLKVAPM+GVLRF
Subjt:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY------------------------------GDKVFLKVAPMKGVLRF

Query:  GKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYEEKPVQVLAREVKVLRNRKIALVKVLWQNHQ
         ++GKLS +F+GPFEILE++GPVAY LALPPS STVH+VFHVSMLR+Y+ DPSHV+D+EPL+++++LSY E+PV+VLAREVK+LRN++I LVKVLW+NH+
Subjt:  GKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYEEKPVQVLAREVKVLRNRKIALVKVLWQNHQ

Query:  CEEATWEREDEMRSKHPEF
         EEATWERED+MR    EF
Subjt:  CEEATWEREDEMRSKHPEF

A0A5A7TGX4 Reverse transcriptase9.2e-20167.77Show/hide
Query:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------
        MRQRRWLELVKDYD +ILYHPGKANVVADALSRK  HSAALIT Q PL +DLER  IAV++G VT                                   
Subjt:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------

Query:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK
          + +FS+SSD GL +   LC P+ S VK ELLSEAH+SPFS+HPGSTKMYQDLKR YWW NMKREVAEFVS+C VCQQVKA RQK AGLL PL++ EWK
Subjt:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK

Query:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD
        WENVSMDFI GLP+T +G TVIWV+VDRLTK AHF+PGKST++ SKWAQ+Y+ E+VRLHGVPVSIVS++D RF S FWK L+TA+GTRLDFST FHPQTD
Subjt:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD

Query:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY------------------------GDKVFLKVAPMKGVLRFGKKGKL
        GQTERLNQ+LEDMLRACAL+FPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY                        GDKVFLKVAPM+GVLRF ++GKL
Subjt:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY------------------------GDKVFLKVAPMKGVLRFGKKGKL

Query:  SLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYEEKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATW
        S +F+GPFEILE++GPVAY L LPPS STVH+V HVSMLR+Y+ DPSHV+D+EPL+++++LSY E+PV+VLAREVK LRN++I LVKVLW+NH+ EEATW
Subjt:  SLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYEEKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATW

Query:  EREDEMRSKHPEFFQ
        ERED+MRS++PE F+
Subjt:  EREDEMRSKHPEFFQ

A0A5A7TR61 Pol protein6.6e-19964.18Show/hide
Query:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVTSH---------------------------------
        MRQRRWLELVKDYD +ILYHPGKANVVADALSRK  HSAALIT Q PL +D+ER  I V++G VT+                                  
Subjt:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVTSH---------------------------------

Query:  ----KFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK
            +FS+SSD GL +  RLC P+ S VK ELLSEAH+SPFS+HPGSTKMYQDLKR YWW NMKREVAEFVSKC VCQQVKA RQK AGLL PL++ EWK
Subjt:  ----KFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK

Query:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD
        WENVSMDFI GLP+T +G TVIWV+VDRLTKSAHF+PGKST++ SKWAQ+Y+ E+VRLHGVPVSIVSD+D RF S FWK L+TA+GTRLDFST FHPQTD
Subjt:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD

Query:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------
        GQTERLNQ+LEDMLRACAL+FP SWDSHLHLMEFAYNNSYQATIGMAPFEALY                                               
Subjt:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------

Query:  -------------GDKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE
                     GDKVFLKVAPMKGVLRF ++GKLS +F+GPFEILE++GPVAY LALPPS STVH+VFHVSMLR+Y+ DPSHV+D+EPL+++++LSY 
Subjt:  -------------GDKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE

Query:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFF
        E+PV+VLAREVK LRN++I LVKVLW+NH+ EEATWERED+MRS++PE F
Subjt:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFF

A0A5A7U7V9 Reverse transcriptase2.1e-20064.49Show/hide
Query:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------
        MRQRRWLELVKDYD +ILYHPGKANVVADALSRK  HSAALIT Q PL +DLER  IAV++G VT                                   
Subjt:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVT-----------------------------------

Query:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK
          + +FS+SSD GL +  RLC P+ S VK ELLSEAH+SPFS+HPGSTKMYQDLKR YWW NMKREVAEFVSKC VCQQVKA RQK AGLL PL++ EWK
Subjt:  --SHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK

Query:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD
        WENVSMDFI GLP+T +G TVIWV+VDRLTKSAHF+PGKST++ SKWAQ+Y+ E+VRLHGVPVSIVSD+D RF S FWK L+TA+GTRLDFST FHPQTD
Subjt:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD

Query:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------
        GQTERLNQ+LEDMLRACAL+FPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY                                               
Subjt:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------

Query:  -------------GDKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE
                     GDKVFLKVAPMKGVLRF ++GKLS +F+GPFEILE++GPVAY LALPPS STVH+VFHVSMLR+Y+ DPSHV+D+EPL+++++LSY 
Subjt:  -------------GDKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE

Query:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFFQQ
        E+PV+VLAREVK LRN++I LVKVLW+NH+ EEATWERED+MRS++PE F++
Subjt:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFFQQ

A0A5A7UAA8 Reverse transcriptase3.5e-20064.31Show/hide
Query:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVTSH---------------------------------
        MRQRRWLELVKDYD +ILYHPGKANVVADALSRK  HSAALIT Q PL +DLER  IAV++G VT                                   
Subjt:  MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVTSH---------------------------------

Query:  ----KFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK
            +FS+SSD GLS+ GRLC P+ S VK ELL EAH+SPFS+HPGSTKMYQDLKR YWW NMKREVAEFVSKC VCQQVK  RQK AGLL PL++ EWK
Subjt:  ----KFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWK

Query:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD
        WENVSMDFI GLP+T +G TVIWV+VDRLTKSAHF+PGKST++ SKWAQ+Y+ E+VRLHGVPVSIVSD+D RF S FWK L+TA+GTRLDFST FHPQ D
Subjt:  WENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTD

Query:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG----------------------------------------------
        GQTERLNQ+LEDMLRACAL+FPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG                                              
Subjt:  GQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG----------------------------------------------

Query:  --------------DKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE
                      DKVFLKVAPMKGVLRF ++GKLS +F+GPFEILE++GPVAY LALPPS STVH+VFHVSMLR+Y+ DPSHV+D+EPL+++++LSY 
Subjt:  --------------DKVFLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYE

Query:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFFQQ
        E+PV+VLAREVK LRN++I LVKVLW+NH+ EEATWERED+MRS++PE F++
Subjt:  EKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWEREDEMRSKHPEFFQQ

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.4e-4426.41Show/hide
Query:  RQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAAL----------------ITDQ------TPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSY
        R  RW   ++D++ +I Y PG AN +ADALSR    +  +                ITD       T    D + + +     +       +     ++ 
Subjt:  RQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAAL----------------ITDQ------TPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSY

Query:  HGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTA
          ++  P  + +   ++ + H     IHPG   +   + R + W  +++++ E+V  C  CQ  K+   K  G L P+   E  WE++SMDFI  LP+++
Subjt:  HGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTA

Query:  KGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRA
         G+  ++V+VDR +K A  +P   + +  + A+++ + V+   G P  I++D D  F S  WK         + FS  + PQTDGQTER NQ +E +LR 
Subjt:  KGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRA

Query:  CALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------------------GDKV
             P +W  H+ L++ +YNN+  +   M PFE ++                                                           GD V
Subjt:  CALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------------------GDKV

Query:  FLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTV-HNVFHVSMLRRY
         +K     G L   K  KL+  F GPF +L++ GP  Y L LP S   +  + FHVS L +Y
Subjt:  FLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTV-HNVFHVSMLRRY

P0CT35 Transposon Tf2-2 polyprotein2.4e-4426.41Show/hide
Query:  RQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAAL----------------ITDQ------TPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSY
        R  RW   ++D++ +I Y PG AN +ADALSR    +  +                ITD       T    D + + +     +       +     ++ 
Subjt:  RQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAAL----------------ITDQ------TPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSY

Query:  HGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTA
          ++  P  + +   ++ + H     IHPG   +   + R + W  +++++ E+V  C  CQ  K+   K  G L P+   E  WE++SMDFI  LP+++
Subjt:  HGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTA

Query:  KGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRA
         G+  ++V+VDR +K A  +P   + +  + A+++ + V+   G P  I++D D  F S  WK         + FS  + PQTDGQTER NQ +E +LR 
Subjt:  KGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRA

Query:  CALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------------------GDKV
             P +W  H+ L++ +YNN+  +   M PFE ++                                                           GD V
Subjt:  CALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------------------GDKV

Query:  FLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTV-HNVFHVSMLRRY
         +K     G L   K  KL+  F GPF +L++ GP  Y L LP S   +  + FHVS L +Y
Subjt:  FLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTV-HNVFHVSMLRRY

P0CT36 Transposon Tf2-3 polyprotein2.4e-4426.41Show/hide
Query:  RQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAAL----------------ITDQ------TPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSY
        R  RW   ++D++ +I Y PG AN +ADALSR    +  +                ITD       T    D + + +     +       +     ++ 
Subjt:  RQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAAL----------------ITDQ------TPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSY

Query:  HGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTA
          ++  P  + +   ++ + H     IHPG   +   + R + W  +++++ E+V  C  CQ  K+   K  G L P+   E  WE++SMDFI  LP+++
Subjt:  HGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTA

Query:  KGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRA
         G+  ++V+VDR +K A  +P   + +  + A+++ + V+   G P  I++D D  F S  WK         + FS  + PQTDGQTER NQ +E +LR 
Subjt:  KGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRA

Query:  CALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------------------GDKV
             P +W  H+ L++ +YNN+  +   M PFE ++                                                           GD V
Subjt:  CALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------------------GDKV

Query:  FLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTV-HNVFHVSMLRRY
         +K     G L   K  KL+  F GPF +L++ GP  Y L LP S   +  + FHVS L +Y
Subjt:  FLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTV-HNVFHVSMLRRY

P0CT41 Transposon Tf2-12 polyprotein2.4e-4426.41Show/hide
Query:  RQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAAL----------------ITDQ------TPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSY
        R  RW   ++D++ +I Y PG AN +ADALSR    +  +                ITD       T    D + + +     +       +     ++ 
Subjt:  RQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAAL----------------ITDQ------TPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSY

Query:  HGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTA
          ++  P  + +   ++ + H     IHPG   +   + R + W  +++++ E+V  C  CQ  K+   K  G L P+   E  WE++SMDFI  LP+++
Subjt:  HGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTA

Query:  KGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRA
         G+  ++V+VDR +K A  +P   + +  + A+++ + V+   G P  I++D D  F S  WK         + FS  + PQTDGQTER NQ +E +LR 
Subjt:  KGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRA

Query:  CALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------------------GDKV
             P +W  H+ L++ +YNN+  +   M PFE ++                                                           GD V
Subjt:  CALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------------------GDKV

Query:  FLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTV-HNVFHVSMLRRY
         +K     G L   K  KL+  F GPF +L++ GP  Y L LP S   +  + FHVS L +Y
Subjt:  FLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTV-HNVFHVSMLRRY

Q9UR07 Transposon Tf2-11 polyprotein2.4e-4426.41Show/hide
Query:  RQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAAL----------------ITDQ------TPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSY
        R  RW   ++D++ +I Y PG AN +ADALSR    +  +                ITD       T    D + + +     +       +     ++ 
Subjt:  RQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAAL----------------ITDQ------TPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSY

Query:  HGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTA
          ++  P  + +   ++ + H     IHPG   +   + R + W  +++++ E+V  C  CQ  K+   K  G L P+   E  WE++SMDFI  LP+++
Subjt:  HGRLCFPAGSVVKDELLSEAHNSPFSIHPGSTKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTA

Query:  KGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRA
         G+  ++V+VDR +K A  +P   + +  + A+++ + V+   G P  I++D D  F S  WK         + FS  + PQTDGQTER NQ +E +LR 
Subjt:  KGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVRLHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRA

Query:  CALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------------------GDKV
             P +W  H+ L++ +YNN+  +   M PFE ++                                                           GD V
Subjt:  CALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY-----------------------------------------------------------GDKV

Query:  FLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTV-HNVFHVSMLRRY
         +K     G L   K  KL+  F GPF +L++ GP  Y L LP S   +  + FHVS L +Y
Subjt:  FLKVAPMKGVLRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTV-HNVFHVSMLRRY

Arabidopsis top hitse value%identityAlignment
AT3G48340.1 Cysteine proteinases superfamily protein3.6e-3262.18Show/hide
Query:  IVKIDGYESVPEN-ENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTDYWILRNSWGIGWREDDYINMKR-
        +V IDG+E VPEN ENAL++ VANQPVSVAIDA   DFQFY   V +       S GTELNHGV A+GYG +E G  YWI+RNSWG  W E  YI ++R 
Subjt:  IVKIDGYESVPEN-ENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTDYWILRNSWGIGWREDDYINMKR-

Query:  --SPEGLCGIAMEASYPIK
           PEG CGIAMEASYPIK
Subjt:  --SPEGLCGIAMEASYPIK

AT3G48350.1 Cysteine proteinases superfamily protein1.7e-2955.28Show/hide
Query:  ISSPIVKIDGYESVPEN-ENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTDYWILRNSWGIGWREDDYIN
        I    V IDG+E VPEN E  L++ VA+QPVSVAIDA   DFQ Y   V       I   GT+LNHGVV +GYG T++GT YWI+RNSWG  W E  Y+ 
Subjt:  ISSPIVKIDGYESVPEN-ENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTDYWILRNSWGIGWREDDYIN

Query:  MKR---SPEGLCGIAMEASYPIK
        ++R     EG CGIAMEASYP K
Subjt:  MKR---SPEGLCGIAMEASYPIK

AT4G36880.1 cysteine proteinase17.1e-2852.42Show/hide
Query:  SSPIVKIDGYESVP-ENENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTDYWILRNSWGIGWREDDYINM
        +S +V IDGYE VP ++E AL + ++ QPVSVAI+A GR FQ Y   + +       S GT L+H VVA+GYG +E+G DYWI+RNSWG  W E+ YI M
Subjt:  SSPIVKIDGYESVP-ENENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTDYWILRNSWGIGWREDDYINM

Query:  KR----SPEGLCGIAMEASYPIKF
        +R    S  G CGIA+EASYP+K+
Subjt:  KR----SPEGLCGIAMEASYPIKF

AT5G43060.1 Granulin repeat cysteine protease family protein9.9e-3057.38Show/hide
Query:  SSPIVKIDGYESVPEN-ENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTDYWILRNSWGIGWREDDYINM
        ++ +V ID YE VPEN E +L + +A+QP+SVAI+A GR FQ Y   V   L       GTEL+HGVVA+GYG TE+G DYWI+RNSWG  W E  YI M
Subjt:  SSPIVKIDGYESVPEN-ENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTDYWILRNSWGIGWREDDYINM

Query:  KR---SPEGLCGIAMEASYPIK
         R   +P G CGIAMEASYPIK
Subjt:  KR---SPEGLCGIAMEASYPIK

AT5G50260.1 Cysteine proteinases superfamily protein3.0e-3460.66Show/hide
Query:  SSPIVKIDGYESVPEN-ENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTDYWILRNSWGIGWREDDYINM
        ++P+V IDG+E VP+N E+ LM+ VANQPVSVAIDA G DFQFY   V +         GTELNHGV  +GYGTT DGT YWI++NSWG  W E  YI M
Subjt:  SSPIVKIDGYESVPEN-ENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTDYWILRNSWGIGWREDDYINM

Query:  KRS---PEGLCGIAMEASYPIK
        +R     EGLCGIAMEASYP+K
Subjt:  KRS---PEGLCGIAMEASYPIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACAGCGCAGGTGGTTGGAGTTAGTAAAGGATTATGATGTAAAAATCCTGTACCATCCCGGAAAAGCAAATGTAGTAGCAGATGCCTTAAGCAGAAAGGCAGTTCA
TTCAGCAGCTCTGATTACTGACCAGACCCCACTATGCAAGGACCTTGAGCGAGTAGGCATTGCAGTGGCGCTAGGAGAAGTCACTTCCCATAAGTTCTCGGTATCCTCTG
ATAGTGGCCTCTCATACCACGGACGCTTGTGTTTTCCAGCAGGTAGTGTGGTTAAAGATGAACTGTTGTCGGAAGCTCATAACTCCCCATTTTCGATACATCCGGGCAGT
ACCAAGATGTATCAGGACCTGAAACGTCATTATTGGTGGCTCAATATGAAGAGGGAAGTGGCTGAATTCGTTAGCAAGTGTCCTGTATGCCAGCAAGTGAAGGCCTCGAG
GCAGAAGCAGGCAGGATTGTTGCATCCCCTGACGGTACTCGAGTGGAAATGGGAGAATGTATCTATGGATTTCATAGTAGGTTTGCCTAAAACGGCGAAAGGTCACACTG
TGATATGGGTCATTGTAGACAGACTCACCAAGTCAGCACATTTTATCCCAGGAAAGTCCACCTTTTCAGTAAGTAAGTGGGCACAAATTTATATAAAGGAAGTGGTAAGG
TTACACGGAGTACCAGTTTCTATAGTGTCTGACAAAGACCCTCGCTTCATGTCTAACTTTTGGAAGAGCCTCCGGACTGCGTTGGGCACTCGGTTGGATTTTAGTACGAC
TTTTCACCCCCAGACGGATGGACAAACGGAACGTTTGAATCAGATACTAGAGGACATGTTGCGTGCTTGCGCCCTAGACTTTCCAGGAAGTTGGGACTCCCACTTGCACC
TAATGGAGTTTGCCTACAATAACAGTTATCAAGCTACCATTGGCATGGCGCCATTCGAAGCCTTATATGGTGATAAGGTATTCTTGAAAGTGGCACCTATGAAGGGTGTC
TTGAGGTTCGGGAAAAAGGGGAAGTTAAGCCTACAGTTCATTGGACCGTTTGAAATATTGGAGCAAGTTGGCCCCGTGGCCTACTGTTTAGCCTTGCCACCGTCGTTCTC
CACAGTTCACAACGTGTTCCATGTCTCCATGCTTCGAAGGTATATGGTAGATCCATCCCACGTGATCGATTTTGAGCCTTTACAGTTGAACAAAGATTTAAGTTATGAGG
AAAAGCCGGTGCAAGTTCTTGCAAGGGAAGTGAAGGTTTTGCGCAACCGGAAAATAGCACTGGTGAAAGTCCTCTGGCAGAACCACCAGTGTGAGGAGGCAACATGGGAA
CGAGAGGATGAAATGAGGTCAAAGCATCCGGAGTTTTTCCAACAAAAAAAAAAGAGAACTCTCTCCCTCTTCTCTCTCCCTCTCTGCGTCACTCTTTCCCCCTTCCTCTC
TCGATTTTTCTTCGCCGCCGCCCTTCCTTCTTTTTTTCCGTTTTATCTCTCCACTGCCACACTCTCTTCTACCTCTGCTTCCGCCGTCTCTCGCTCTCAGTCGCCGCTCG
CTTTCCCGGAAGTTCACGCTCTCTCTCGCCCAGGATCACGCTCTCTCGCTCAGGCTCTCGCTCTCTCGCTCTCCCACTCTCTCGCTCGGACTCAAGCTCTCCCGCTTTCC
AGCTCTCCCGCTCTCTCGCTCAGAATCTCGTTCTCGGCTGCGATTAACTGCTTGCCTGATCGTCAGTCGCCGGATCAGTATTCCCTAAATGAGTATCTCAAACCTGAGAT
CGGAGGAAATTGCGAGTTTAGCCACGTTTGGTTTGTTCAGGAATTTTGTGAGAGTTGCGGTGTGGTTAGCCTTATTGATCAATCATCAAGTGTCCATAGGAGGTTGTGTT
CCTTCGGGTTCACTGGAGGTTGTGTTCCTTCGAGTCCACTGGAGGTTGTGTTCCTTCAGGTCCACTGGAGGATGTGTTCATTCGGGTTCACTGGAGGATGTGTTCCTTCG
GGTTCACTAGAGGTTGTGTTTCTTCGGGTTCACCAGAGGTTGTGTTTCTTCGGGTACACCAAAGGTTGTGTTCCTTCGGGATCACCAGAGATATCTTCACCAATAGTTAA
AATTGATGGATATGAAAGCGTACCTGAAAACGAGAATGCTCTGATGCAAGTCGTCGCAAACCAACCAGTCTCAGTCGCCATTGACGCCGCGGGAAGAGATTTCCAATTTT
ACTGGCGGCAAGTTCCTAGTTCACTCGAGCAATCCATCGATTCACTCGGAACAGAGCTTAATCATGGAGTGGTGGCGATTGGGTATGGAACAACCGAAGACGGAACAGAT
TATTGGATCTTGAGGAATTCATGGGGAATTGGATGGAGAGAGGACGATTATATAAATATGAAGCGATCGCCCGAAGGTCTCTGTGGAATAGCCATGGAAGCTTCTTATCC
CATCAAGTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGACAGCGCAGGTGGTTGGAGTTAGTAAAGGATTATGATGTAAAAATCCTGTACCATCCCGGAAAAGCAAATGTAGTAGCAGATGCCTTAAGCAGAAAGGCAGTTCA
TTCAGCAGCTCTGATTACTGACCAGACCCCACTATGCAAGGACCTTGAGCGAGTAGGCATTGCAGTGGCGCTAGGAGAAGTCACTTCCCATAAGTTCTCGGTATCCTCTG
ATAGTGGCCTCTCATACCACGGACGCTTGTGTTTTCCAGCAGGTAGTGTGGTTAAAGATGAACTGTTGTCGGAAGCTCATAACTCCCCATTTTCGATACATCCGGGCAGT
ACCAAGATGTATCAGGACCTGAAACGTCATTATTGGTGGCTCAATATGAAGAGGGAAGTGGCTGAATTCGTTAGCAAGTGTCCTGTATGCCAGCAAGTGAAGGCCTCGAG
GCAGAAGCAGGCAGGATTGTTGCATCCCCTGACGGTACTCGAGTGGAAATGGGAGAATGTATCTATGGATTTCATAGTAGGTTTGCCTAAAACGGCGAAAGGTCACACTG
TGATATGGGTCATTGTAGACAGACTCACCAAGTCAGCACATTTTATCCCAGGAAAGTCCACCTTTTCAGTAAGTAAGTGGGCACAAATTTATATAAAGGAAGTGGTAAGG
TTACACGGAGTACCAGTTTCTATAGTGTCTGACAAAGACCCTCGCTTCATGTCTAACTTTTGGAAGAGCCTCCGGACTGCGTTGGGCACTCGGTTGGATTTTAGTACGAC
TTTTCACCCCCAGACGGATGGACAAACGGAACGTTTGAATCAGATACTAGAGGACATGTTGCGTGCTTGCGCCCTAGACTTTCCAGGAAGTTGGGACTCCCACTTGCACC
TAATGGAGTTTGCCTACAATAACAGTTATCAAGCTACCATTGGCATGGCGCCATTCGAAGCCTTATATGGTGATAAGGTATTCTTGAAAGTGGCACCTATGAAGGGTGTC
TTGAGGTTCGGGAAAAAGGGGAAGTTAAGCCTACAGTTCATTGGACCGTTTGAAATATTGGAGCAAGTTGGCCCCGTGGCCTACTGTTTAGCCTTGCCACCGTCGTTCTC
CACAGTTCACAACGTGTTCCATGTCTCCATGCTTCGAAGGTATATGGTAGATCCATCCCACGTGATCGATTTTGAGCCTTTACAGTTGAACAAAGATTTAAGTTATGAGG
AAAAGCCGGTGCAAGTTCTTGCAAGGGAAGTGAAGGTTTTGCGCAACCGGAAAATAGCACTGGTGAAAGTCCTCTGGCAGAACCACCAGTGTGAGGAGGCAACATGGGAA
CGAGAGGATGAAATGAGGTCAAAGCATCCGGAGTTTTTCCAACAAAAAAAAAAGAGAACTCTCTCCCTCTTCTCTCTCCCTCTCTGCGTCACTCTTTCCCCCTTCCTCTC
TCGATTTTTCTTCGCCGCCGCCCTTCCTTCTTTTTTTCCGTTTTATCTCTCCACTGCCACACTCTCTTCTACCTCTGCTTCCGCCGTCTCTCGCTCTCAGTCGCCGCTCG
CTTTCCCGGAAGTTCACGCTCTCTCTCGCCCAGGATCACGCTCTCTCGCTCAGGCTCTCGCTCTCTCGCTCTCCCACTCTCTCGCTCGGACTCAAGCTCTCCCGCTTTCC
AGCTCTCCCGCTCTCTCGCTCAGAATCTCGTTCTCGGCTGCGATTAACTGCTTGCCTGATCGTCAGTCGCCGGATCAGTATTCCCTAAATGAGTATCTCAAACCTGAGAT
CGGAGGAAATTGCGAGTTTAGCCACGTTTGGTTTGTTCAGGAATTTTGTGAGAGTTGCGGTGTGGTTAGCCTTATTGATCAATCATCAAGTGTCCATAGGAGGTTGTGTT
CCTTCGGGTTCACTGGAGGTTGTGTTCCTTCGAGTCCACTGGAGGTTGTGTTCCTTCAGGTCCACTGGAGGATGTGTTCATTCGGGTTCACTGGAGGATGTGTTCCTTCG
GGTTCACTAGAGGTTGTGTTTCTTCGGGTTCACCAGAGGTTGTGTTTCTTCGGGTACACCAAAGGTTGTGTTCCTTCGGGATCACCAGAGATATCTTCACCAATAGTTAA
AATTGATGGATATGAAAGCGTACCTGAAAACGAGAATGCTCTGATGCAAGTCGTCGCAAACCAACCAGTCTCAGTCGCCATTGACGCCGCGGGAAGAGATTTCCAATTTT
ACTGGCGGCAAGTTCCTAGTTCACTCGAGCAATCCATCGATTCACTCGGAACAGAGCTTAATCATGGAGTGGTGGCGATTGGGTATGGAACAACCGAAGACGGAACAGAT
TATTGGATCTTGAGGAATTCATGGGGAATTGGATGGAGAGAGGACGATTATATAAATATGAAGCGATCGCCCGAAGGTCTCTGTGGAATAGCCATGGAAGCTTCTTATCC
CATCAAGTTCTAG
Protein sequenceShow/hide protein sequence
MRQRRWLELVKDYDVKILYHPGKANVVADALSRKAVHSAALITDQTPLCKDLERVGIAVALGEVTSHKFSVSSDSGLSYHGRLCFPAGSVVKDELLSEAHNSPFSIHPGS
TKMYQDLKRHYWWLNMKREVAEFVSKCPVCQQVKASRQKQAGLLHPLTVLEWKWENVSMDFIVGLPKTAKGHTVIWVIVDRLTKSAHFIPGKSTFSVSKWAQIYIKEVVR
LHGVPVSIVSDKDPRFMSNFWKSLRTALGTRLDFSTTFHPQTDGQTERLNQILEDMLRACALDFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGDKVFLKVAPMKGV
LRFGKKGKLSLQFIGPFEILEQVGPVAYCLALPPSFSTVHNVFHVSMLRRYMVDPSHVIDFEPLQLNKDLSYEEKPVQVLAREVKVLRNRKIALVKVLWQNHQCEEATWE
REDEMRSKHPEFFQQKKKRTLSLFSLPLCVTLSPFLSRFFFAAALPSFFPFYLSTATLSSTSASAVSRSQSPLAFPEVHALSRPGSRSLAQALALSLSHSLARTQALPLS
SSPALSLRISFSAAINCLPDRQSPDQYSLNEYLKPEIGGNCEFSHVWFVQEFCESCGVVSLIDQSSSVHRRLCSFGFTGGCVPSSPLEVVFLQVHWRMCSFGFTGGCVPS
GSLEVVFLRVHQRLCFFGYTKGCVPSGSPEISSPIVKIDGYESVPENENALMQVVANQPVSVAIDAAGRDFQFYWRQVPSSLEQSIDSLGTELNHGVVAIGYGTTEDGTD
YWILRNSWGIGWREDDYINMKRSPEGLCGIAMEASYPIKF