; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018822 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018822
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationChr04:9201795..9225221
RNA-Seq ExpressionHG10018822
SyntenyHG10018822
Gene Ontology termsGO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455144.1 PREDICTED: probable ubiquitin-like-specific protease 2A isoform X1 [Cucumis melo]1.7e-13762.72Show/hide
Query:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS
        MTRTS+S   SS  R GR +G+GGGKRFSVFDFSEED RVEKVSRSLLGKFSARRSSPVT+HQFL CF KGAKSVSRNLSDELI IDA VGK AN+DSFS
Subjt:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS

Query:  EDVSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRL
        ED+SYELIH DSE                          D SLEGGGS KQEI ETNDLL SRSSTNEDD TV+FPDFVIYEGNWCTTSKLIFSCSCI+ 
Subjt:  EDVSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRL

Query:  QGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDIS-----------GIELLKFSVCDRLWSESEKAIRSLNLRYNDLWN
        QGSA+SGLQRTFD+EWAVSDIIGIESEWC RVETAIVNLRLKGKHFT A NSNDIS           GIELLKFSVCD LWSESEKAIR+LN+RYNDLWN
Subjt:  QGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDIS-----------GIELLKFSVCDRLWSESEKAIRSLNLRYNDLWN

Query:  ADY---------------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV--------------------------
        ADY                           EFVDTFEEVIYPKGDPDAVTISKRDLELLKPG FINDTIIDFYV                          
Subjt:  ADY---------------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV--------------------------

Query:  -----------------------------------------NLHWSLVVICHPGDV
                                                 +LHWSLVVICHPG+V
Subjt:  -----------------------------------------NLHWSLVVICHPGDV

XP_008455146.1 PREDICTED: probable ubiquitin-like-specific protease 2A isoform X2 [Cucumis melo]4.7e-14064.27Show/hide
Query:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS
        MTRTS+S   SS  R GR +G+GGGKRFSVFDFSEED RVEKVSRSLLGKFSARRSSPVT+HQFL CF KGAKSVSRNLSDELI IDA VGK AN+DSFS
Subjt:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS

Query:  EDVSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRL
        ED+SYELIH DSE                          D SLEGGGS KQEI ETNDLL SRSSTNEDD TV+FPDFVIYEGNWCTTSKLIFSCSCI+ 
Subjt:  EDVSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRL

Query:  QGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY--------
        QGSA+SGLQRTFD+EWAVSDIIGIESEWC RVETAIVNLRLKGKHFT A NSNDISGIELLKFSVCD LWSESEKAIR+LN+RYNDLWNADY        
Subjt:  QGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY--------

Query:  -------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV-------------------------------------
                           EFVDTFEEVIYPKGDPDAVTISKRDLELLKPG FINDTIIDFYV                                     
Subjt:  -------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV-------------------------------------

Query:  ------------------------------NLHWSLVVICHPGDV
                                      +LHWSLVVICHPG+V
Subjt:  ------------------------------NLHWSLVVICHPGDV

XP_038888439.1 probable ubiquitin-like-specific protease 2A isoform X1 [Benincasa hispida]7.8e-14365.53Show/hide
Query:  MTRTSNSSSSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSED
        MTRTS  SSS SRKGR +GDGGGKRFSVFDFSEED+RVEKVSRSLLGKFSAR+SSPV +HQFLQCFAKGA+SVSRNLSD  I IDA VGKSAN+DSFSED
Subjt:  MTRTSNSSSSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSED

Query:  VSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQG
        VS ELIHNDSE                          D SLEG GSTKQEIFETND LLS SSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSC+CI+LQG
Subjt:  VSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQG

Query:  SAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY----------
        SAVSGLQRTFD+EWAVSD++GIESEWC RVETAIVNLRLKGKH TRAANSNDISGIELLKFSVCD LWSE EKAIR+LNLRYNDLWNADY          
Subjt:  SAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY----------

Query:  ---------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV-----------------------------------------
                       EFVDTFEEVIYP+GDPDAVTISKRDLELLKP TFINDTIIDFYV                                         
Subjt:  ---------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV-----------------------------------------

Query:  --------------------------NLHWSLVVICHPGDV
                                  +LHWSLVVICHPG+V
Subjt:  --------------------------NLHWSLVVICHPGDV

XP_038888440.1 probable ubiquitin-like-specific protease 2A isoform X2 [Benincasa hispida]4.7e-14865.85Show/hide
Query:  MTRTSNSSSSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSED
        MTRTS  SSS SRKGR +GDGGGKRFSVFDFSEED+RVEKVSRSLLGKFSAR+SSPV +HQFLQCFAKGA+SVSRNLSD  I IDA VGKSAN+DSFSED
Subjt:  MTRTSNSSSSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSED

Query:  VSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQG
        VS ELIHNDSE                          D SLEG GSTKQEIFETND LLS SSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSC+CI+LQG
Subjt:  VSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQG

Query:  SAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY----------
        SAVSGLQRTFD+EWAVSD++GIESEWC RVETAIVNLRLKGKH TRAANSNDISGIELLKFSVCD LWSE EKAIR+LNLRYNDLWNADY          
Subjt:  SAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY----------

Query:  ---------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV-----------------------------------------
                       EFVDTFEEVIYP+GDPDAVTISKRDLELLKP TFINDTIIDFYV                                         
Subjt:  ---------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV-----------------------------------------

Query:  --------------------------NLHWSLVVICHPGDVTKNVITYPRY
                                  +LHWSLVVICHPG+V KNVITYP Y
Subjt:  --------------------------NLHWSLVVICHPGDVTKNVITYPRY

XP_038888442.1 uncharacterized protein LOC120078284 isoform X4 [Benincasa hispida]3.5e-14376.94Show/hide
Query:  MTRTSNSSSSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSED
        MTRTS  SSS SRKGR +GDGGGKRFSVFDFSEED+RVEKVSRSLLGKFSAR+SSPV +HQFLQCFAKGA+SVSRNLSD  I IDA VGKSAN+DSFSED
Subjt:  MTRTSNSSSSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSED

Query:  VSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQG
        VS ELIHNDSE                          D SLEG GSTKQEIFETND LLS SSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSC+CI+LQG
Subjt:  VSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQG

Query:  SAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY----------
        SAVSGLQRTFD+EWAVSD++GIESEWC RVETAIVNLRLKGKH TRAANSNDISGIELLKFSVCD LWSE EKAIR+LNLRYNDLWNADY          
Subjt:  SAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY----------

Query:  ---------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYVN
                       EFVDTFEEVIYP+GDPDAVTISKRDLELLKP TFINDTIIDFYVN
Subjt:  ---------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYVN

TrEMBL top hitse value%identityAlignment
A0A0A0K633 ULP_PROTEASE domain-containing protein9.0e-13764.53Show/hide
Query:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS
        MTRTS+S    S  RK R  GDGGGKRFSVFDFSEED RVEKVSR LLGKFSARRSSPVT+HQFL CF KGAKSVSRNLSDELI IDA VGKSAN DSFS
Subjt:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS

Query:  EDVSYELIHNDSE------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGL
        EDVS ELIH DSE                  D  LEGGGS KQEI ETND+LLSRSSTNEDDVTV+FPDFVIYEGNWCTTSKLIFSCSCI+ +GSA+SGL
Subjt:  EDVSYELIHNDSE------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGL

Query:  QRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY----------------
        QRTFD+EWA+SDIIGIESEWC RVETAIVNL LKGKHFTRA NS DISGIELLKFSVCD LWSESEKAIR+LNLRYNDLWNAD+                
Subjt:  QRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY----------------

Query:  -----------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV---------------------------------------------
                   EFVDTFEEVIYP GDPDAVTISKRDLELLKPG FINDTIIDFYV                                             
Subjt:  -----------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV---------------------------------------------

Query:  ----------------------NLHWSLVVICHPGDV
                              +LHWSLVVICHPG+V
Subjt:  ----------------------NLHWSLVVICHPGDV

A0A1S3C0Z2 probable ubiquitin-like-specific protease 2A isoform X18.1e-13862.72Show/hide
Query:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS
        MTRTS+S   SS  R GR +G+GGGKRFSVFDFSEED RVEKVSRSLLGKFSARRSSPVT+HQFL CF KGAKSVSRNLSDELI IDA VGK AN+DSFS
Subjt:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS

Query:  EDVSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRL
        ED+SYELIH DSE                          D SLEGGGS KQEI ETNDLL SRSSTNEDD TV+FPDFVIYEGNWCTTSKLIFSCSCI+ 
Subjt:  EDVSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRL

Query:  QGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDIS-----------GIELLKFSVCDRLWSESEKAIRSLNLRYNDLWN
        QGSA+SGLQRTFD+EWAVSDIIGIESEWC RVETAIVNLRLKGKHFT A NSNDIS           GIELLKFSVCD LWSESEKAIR+LN+RYNDLWN
Subjt:  QGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDIS-----------GIELLKFSVCDRLWSESEKAIRSLNLRYNDLWN

Query:  ADY---------------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV--------------------------
        ADY                           EFVDTFEEVIYPKGDPDAVTISKRDLELLKPG FINDTIIDFYV                          
Subjt:  ADY---------------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV--------------------------

Query:  -----------------------------------------NLHWSLVVICHPGDV
                                                 +LHWSLVVICHPG+V
Subjt:  -----------------------------------------NLHWSLVVICHPGDV

A0A1S3C1G7 probable ubiquitin-like-specific protease 2A isoform X22.3e-14064.27Show/hide
Query:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS
        MTRTS+S   SS  R GR +G+GGGKRFSVFDFSEED RVEKVSRSLLGKFSARRSSPVT+HQFL CF KGAKSVSRNLSDELI IDA VGK AN+DSFS
Subjt:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS

Query:  EDVSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRL
        ED+SYELIH DSE                          D SLEGGGS KQEI ETNDLL SRSSTNEDD TV+FPDFVIYEGNWCTTSKLIFSCSCI+ 
Subjt:  EDVSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRL

Query:  QGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY--------
        QGSA+SGLQRTFD+EWAVSDIIGIESEWC RVETAIVNLRLKGKHFT A NSNDISGIELLKFSVCD LWSESEKAIR+LN+RYNDLWNADY        
Subjt:  QGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY--------

Query:  -------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV-------------------------------------
                           EFVDTFEEVIYPKGDPDAVTISKRDLELLKPG FINDTIIDFYV                                     
Subjt:  -------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV-------------------------------------

Query:  ------------------------------NLHWSLVVICHPGDV
                                      +LHWSLVVICHPG+V
Subjt:  ------------------------------NLHWSLVVICHPGDV

A0A5A7SPN7 Putative ubiquitin-like-specific protease 2A isoform X22.3e-14064.27Show/hide
Query:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS
        MTRTS+S   SS  R GR +G+GGGKRFSVFDFSEED RVEKVSRSLLGKFSARRSSPVT+HQFL CF KGAKSVSRNLSDELI IDA VGK AN+DSFS
Subjt:  MTRTSNSS--SSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFS

Query:  EDVSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRL
        ED+SYELIH DSE                          D SLEGGGS KQEI ETNDLL SRSSTNEDD TV+FPDFVIYEGNWCTTSKLIFSCSCI+ 
Subjt:  EDVSYELIHNDSE--------------------------DWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRL

Query:  QGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY--------
        QGSA+SGLQRTFD+EWAVSDIIGIESEWC RVETAIVNLRLKGKHFT A NSNDISGIELLKFSVCD LWSESEKAIR+LN+RYNDLWNADY        
Subjt:  QGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADY--------

Query:  -------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV-------------------------------------
                           EFVDTFEEVIYPKGDPDAVTISKRDLELLKPG FINDTIIDFYV                                     
Subjt:  -------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYV-------------------------------------

Query:  ------------------------------NLHWSLVVICHPGDV
                                      +LHWSLVVICHPG+V
Subjt:  ------------------------------NLHWSLVVICHPGDV

A0A6J1CK91 probable ubiquitin-like-specific protease 2A isoform X32.5e-10249.68Show/hide
Query:  MTRTSNSSSSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSED
        M R+S  SSS SRKG   G GG KRF VFDFS++D RVE+VS+SLLGKFS   SS VT++QFLQCFAKGA S+  N+S E I IDA V K A++DS  ED
Subjt:  MTRTSNSSSSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSED

Query:  VSYELIHNDSE----------------DWSLEGG------------------------------------------------GSTKQEIFETNDLLLSRS
        VSYE+ H +S+                D   EG                                                 G  ++EIFET+ +LLSR+
Subjt:  VSYELIHNDSE----------------DWSLEGG------------------------------------------------GSTKQEIFETNDLLLSRS

Query:  STNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVS-GLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKF
        + NEDDV V+FPDFVIYEG  CTT+KLIFS SCI+LQGS V+ G QRTFD +WA+SDI+GIESEWC RVETAIVNL LKGK+ TRA N+N+ISGIE +KF
Subjt:  STNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVS-GLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISGIELLKF

Query:  SVCDRLWSESEKAIRSLNLRYNDLWNADY-------------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYVN-
        S+CD LWSE EKAIRSLNL+YNDLWNA+Y                         EF++ FEEVIYPKGD DAVTI+KRDLELLKPG FINDTIIDFY+N 
Subjt:  SVCDRLWSESEKAIRSLNLRYNDLWNADY-------------------------EFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYVN-

Query:  ---------------------------------------------------LHWSLVVICHPGDV
                                                           LHWSLVVICHPG+V
Subjt:  ---------------------------------------------------LHWSLVVICHPGDV

SwissProt top hitse value%identityAlignment
Q0WKV8 Probable ubiquitin-like-specific protease 2A2.7e-5332.08Show/hide
Query:  KRFSVFDFSEEDERVEKVSRSLLGKFSA----RRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSEDVSYELIHNDSEDWSLE-GG
        K   VFD+S+ED+RVE+ S+ LL KF +    +    + +++FL+CFAK  +S S+ L   +I ++  V +  +    S D + +LI   S       G 
Subjt:  KRFSVFDFSEEDERVEKVSRSLLGKFSA----RRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSEDVSYELIHNDSEDWSLE-GG

Query:  GSTKQEIFETNDLLLSRSSTN----------EDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAI
         S        ND + +  +TN          E+   ++ PD +IY   +CT SKL FS +C+ ++ S+V+  + TF  +W + DII IES+WC  VETA 
Subjt:  GSTKQEIFETNDLLLSRSSTN----------EDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAI

Query:  VNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLW------------------NADYEFVDTFEEVIYPKGDPDAVTISKRDL
        VN+ LK +       + DISGI+LLKFSV D  WS+  + IRSL+ RY ++W                   +     D+FE+++YP+G+PDAV + K+D+
Subjt:  VNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLW------------------NADYEFVDTFEEVIYPKGDPDAVTISKRDL

Query:  ELLKPGTFINDTIIDFYV-------------------------------------------------------------------NLHWSLVVICHPGDV
        ELLKP  FINDTIIDFY+                                                                   + HWSLV+ICHPG++
Subjt:  ELLKPGTFINDTIIDFYV-------------------------------------------------------------------NLHWSLVVICHPGDV

Query:  TKNVITYPRYRASCIWIL--LKGATEG
          + +  P+ R  CI  L  +KG+ +G
Subjt:  TKNVITYPRYRASCIWIL--LKGATEG

Q8L7S0 Probable ubiquitin-like-specific protease 2B3.5e-2933.11Show/hide
Query:  SEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAID-AVVGKSANSD-SFSEDVSYELIHNDSEDWSLEGGGSTKQEIFETN
        ++ D RVE  S  L G      +S  ++ Q    F+    S S    D + AID ++  +SA S+ S SED        D EDW  E   + +++I    
Subjt:  SEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAID-AVVGKSANSD-SFSEDVSYELIHNDSEDWSLEGGGSTKQEIFETN

Query:  DLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISG
        DL  +   T+E         +VI +   C  S +IFSC+ I+++    +  +  F  E+ V DI+ I+  W   V   I+ +R+  K      + N    
Subjt:  DLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISG

Query:  IELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADYE-------------------FVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYVN
        +E LK +V +  W   ++ I SL+++Y  +WN D E                   F + FE+V+YPKGDPDAV+I KRD+ELL+P TF+NDTIIDFY+N
Subjt:  IELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADYE-------------------FVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYVN

Arabidopsis top hitse value%identityAlignment
AT1G09730.1 Cysteine proteinases superfamily protein8.0e-2930.32Show/hide
Query:  SEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAID-AVVGKSANSD-SFSEDVSYELIHNDSEDWSLEGGGSTKQE-----
        ++ D RVE  S  L G      +S  ++ Q    F+    S S    D + AID ++  +SA S+ S SED      +  S  + ++  GS   +     
Subjt:  SEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAID-AVVGKSANSD-SFSEDVSYELIHNDSEDWSLEGGGSTKQE-----

Query:  -IFETNDLLLSRSSTNEDDV-----TVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHF
         ++   D +      +E+ +      ++  ++VI +   C  S +IFSC+ I+++    +  +  F  E+ V DI+ I+  W   V   I+ +R+  K  
Subjt:  -IFETNDLLLSRSSTNEDDV-----TVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHF

Query:  TRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADYE-------------------FVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFI
            + N    +E LK +V +  W   ++ I SL+++Y  +WN D E                   F + FE+V+YPKGDPDAV+I KRD+ELL+P TF+
Subjt:  TRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADYE-------------------FVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFI

Query:  NDTIIDFYVN
        NDTIIDFY+N
Subjt:  NDTIIDFYVN

AT1G09730.2 Cysteine proteinases superfamily protein2.5e-3033.11Show/hide
Query:  SEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAID-AVVGKSANSD-SFSEDVSYELIHNDSEDWSLEGGGSTKQEIFETN
        ++ D RVE  S  L G      +S  ++ Q    F+    S S    D + AID ++  +SA S+ S SED        D EDW  E   + +++I    
Subjt:  SEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAID-AVVGKSANSD-SFSEDVSYELIHNDSEDWSLEGGGSTKQEIFETN

Query:  DLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISG
        DL  +   T+E         +VI +   C  S +IFSC+ I+++    +  +  F  E+ V DI+ I+  W   V   I+ +R+  K      + N    
Subjt:  DLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVNLRLKGKHFTRAANSNDISG

Query:  IELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADYE-------------------FVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYVN
        +E LK +V +  W   ++ I SL+++Y  +WN D E                   F + FE+V+YPKGDPDAV+I KRD+ELL+P TF+NDTIIDFY+N
Subjt:  IELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADYE-------------------FVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYVN

AT4G33620.1 Cysteine proteinases superfamily protein1.4e-5237.42Show/hide
Query:  KRFSVFDFSEEDERVEKVSRSLLGKFSA----RRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSEDVSYELIHNDSEDWSLE-GG
        K   VFD+S+ED+RVE+ S+ LL KF +    +    + +++FL+CFAK  +S S+ L   +I ++  V +  +    S D + +LI   S       G 
Subjt:  KRFSVFDFSEEDERVEKVSRSLLGKFSA----RRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSEDVSYELIHNDSEDWSLE-GG

Query:  GSTKQEIFETNDLLLSRSSTN----------EDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAI
         S        ND + +  +TN          E+   ++ PD +IY   +CT SKL FS +C+ ++ S+V+  + TF  +W + DII IES+WC  VETA 
Subjt:  GSTKQEIFETNDLLLSRSSTN----------EDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAI

Query:  VNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLW------------------NADYEFVDTFEEVIYPKGDPDAVTISKRDL
        VN+ LK +       + DISGI+LLKFSV D  WS+  + IRSL+ RY ++W                   +     D+FE+++YP+G+PDAV + K+D+
Subjt:  VNLRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLW------------------NADYEFVDTFEEVIYPKGDPDAVTISKRDL

Query:  ELLKPGTFINDTIIDFYV
        ELLKP  FINDTIIDFY+
Subjt:  ELLKPGTFINDTIIDFYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCGGACCTCCAACTCATCTTCATCGGGAAGCAGAAAGGGCAGAGCCAAAGGCGATGGCGGCGGAAAGAGATTTTCCGTTTTCGACTTCAGCGAAGAGGAC
GAGCGCGTCGAGAAAGTCTCTCGAAGCTTGCTCGGCAAGTTTTCTGCCCGCAGGAGCTCTCCCGTTACCGAACATCAGTTTCTCCAATGCTTTGCAAAAGGTGCC
AAAAGTGTAAGCAGGAATCTTAGCGATGAGCTCATTGCTATTGATGCTGTAGTTGGAAAAAGTGCCAACAGCGATAGTTTCTCTGAAGATGTTAGCTACGAACTC
ATTCACAATGATTCTGAAGATTGGTCACTTGAAGGTGGTGGATCTACAAAGCAGGAGATTTTTGAAACCAATGATCTATTGCTTTCTCGTTCTTCAACTAATGAG
GATGATGTGACAGTTGTTTTCCCTGATTTTGTCATCTATGAAGGTAACTGGTGTACAACATCGAAGTTAATATTTTCTTGTAGCTGCATTAGGCTTCAAGGTTCA
GCAGTGAGTGGGTTGCAAAGAACATTTGATACGGAATGGGCTGTCTCTGACATTATTGGCATTGAGTCAGAGTGGTGTTGTAGGGTTGAAACTGCAATTGTTAAT
CTTCGTCTCAAAGGAAAGCATTTCACGAGGGCTGCAAATTCAAATGACATTTCAGGCATAGAGTTATTGAAGTTTTCTGTTTGTGACCGTCTTTGGTCTGAAAGT
GAAAAAGCAATCAGATCATTGAATCTTAGATATAATGACTTATGGAATGCAGACTATGAATTTGTTGATACTTTTGAAGAGGTCATCTATCCAAAGGGAGATCCT
GATGCTGTGACCATTAGTAAGAGAGACCTTGAGCTTCTGAAGCCAGGGACATTTATTAACGATACTATCATTGACTTTTATGTTAATCTCCATTGGAGTTTGGTT
GTCATCTGCCATCCTGGTGACGTGACAAAAAATGTGATAACTTATCCAAGGTACCGTGCATCTTGCATATGGATTCTATTAAAGGGAGCCACAGAGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTCGGACCTCCAACTCATCTTCATCGGGAAGCAGAAAGGGCAGAGCCAAAGGCGATGGCGGCGGAAAGAGATTTTCCGTTTTCGACTTCAGCGAAGAGGAC
GAGCGCGTCGAGAAAGTCTCTCGAAGCTTGCTCGGCAAGTTTTCTGCCCGCAGGAGCTCTCCCGTTACCGAACATCAGTTTCTCCAATGCTTTGCAAAAGGTGCC
AAAAGTGTAAGCAGGAATCTTAGCGATGAGCTCATTGCTATTGATGCTGTAGTTGGAAAAAGTGCCAACAGCGATAGTTTCTCTGAAGATGTTAGCTACGAACTC
ATTCACAATGATTCTGAAGATTGGTCACTTGAAGGTGGTGGATCTACAAAGCAGGAGATTTTTGAAACCAATGATCTATTGCTTTCTCGTTCTTCAACTAATGAG
GATGATGTGACAGTTGTTTTCCCTGATTTTGTCATCTATGAAGGTAACTGGTGTACAACATCGAAGTTAATATTTTCTTGTAGCTGCATTAGGCTTCAAGGTTCA
GCAGTGAGTGGGTTGCAAAGAACATTTGATACGGAATGGGCTGTCTCTGACATTATTGGCATTGAGTCAGAGTGGTGTTGTAGGGTTGAAACTGCAATTGTTAAT
CTTCGTCTCAAAGGAAAGCATTTCACGAGGGCTGCAAATTCAAATGACATTTCAGGCATAGAGTTATTGAAGTTTTCTGTTTGTGACCGTCTTTGGTCTGAAAGT
GAAAAAGCAATCAGATCATTGAATCTTAGATATAATGACTTATGGAATGCAGACTATGAATTTGTTGATACTTTTGAAGAGGTCATCTATCCAAAGGGAGATCCT
GATGCTGTGACCATTAGTAAGAGAGACCTTGAGCTTCTGAAGCCAGGGACATTTATTAACGATACTATCATTGACTTTTATGTTAATCTCCATTGGAGTTTGGTT
GTCATCTGCCATCCTGGTGACGTGACAAAAAATGTGATAACTTATCCAAGGTACCGTGCATCTTGCATATGGATTCTATTAAAGGGAGCCACAGAGGGCTGA
Protein sequenceShow/hide protein sequence
MTRTSNSSSSGSRKGRAKGDGGGKRFSVFDFSEEDERVEKVSRSLLGKFSARRSSPVTEHQFLQCFAKGAKSVSRNLSDELIAIDAVVGKSANSDSFSEDVSYEL
IHNDSEDWSLEGGGSTKQEIFETNDLLLSRSSTNEDDVTVVFPDFVIYEGNWCTTSKLIFSCSCIRLQGSAVSGLQRTFDTEWAVSDIIGIESEWCCRVETAIVN
LRLKGKHFTRAANSNDISGIELLKFSVCDRLWSESEKAIRSLNLRYNDLWNADYEFVDTFEEVIYPKGDPDAVTISKRDLELLKPGTFINDTIIDFYVNLHWSLV
VICHPGDVTKNVITYPRYRASCIWILLKGATEG