; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004898 (gene) of Snake gourd v1 genome

Gene IDTan0004898
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionmetalloendoproteinase 1-like
Genome locationLG09:53301698..53302627
RNA-Seq ExpressionTan0004898
SyntenyTan0004898
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0030198 - extracellular matrix organization (biological process)
GO:0030574 - collagen catabolic process (biological process)
GO:0031012 - extracellular matrix (cellular component)
GO:0031225 - anchored component of membrane (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001818 - Peptidase M10, metallopeptidase
IPR002477 - Peptidoglycan binding-like
IPR006026 - Peptidase, metallopeptidase
IPR021158 - Peptidase M10A, cysteine switch, zinc binding site
IPR021190 - Peptidase M10A
IPR024079 - Metallopeptidase, catalytic domain superfamily
IPR033739 - Peptidase M10A, catalytic domain
IPR036365 - PGBD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023772.1 Metalloendoproteinase 3-MMP, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-15486.08Show/hide
Query:  KPLVLILFIFLPLCFSLP---------LPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHL
        KPL+L  FIFLPLC SLP         LPQVSPFAFLNDLQG KKGDNVKGISKLKNFF YYGYLNH  N TG+LND D D FDD LE AIKTYQ+YFHL
Subjt:  KPLVLILFIFLPLCFSLP---------LPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHL

Query:  NPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHF
        NPTG+LN ETISQLATPRCGVPDIVNGTTGR+L EHDD+  H H +HLPHVVSHYAFFPG+RRWPSSKYRLTYAF+PGTR DAKAPV RAFATWAR THF
Subjt:  NPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHF

Query:  KFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKG
        KFSLTTNY+RA+LKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAV+GRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKG
Subjt:  KFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKG

Query:  LNADDIKGIKVLYNRR
        LNADDIKGIKVLYN+R
Subjt:  LNADDIKGIKVLYNRR

XP_004139164.1 metalloendoproteinase 3-MMP [Cucumis sativus]3.2e-15689.22Show/hide
Query:  LVLILFIFLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLND-DDNDAFDDLLESAIKTYQQYFHLNPTGSLNAET
        L+LI+ IFLPLCFSLPL QVSPFAFLNDLQG KKGDNVKGISKLKNFFRYYGYLNH+ NATGHL D D ND FDD LESAIKTYQQYFHLNPTGSLNAET
Subjt:  LVLILFIFLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLND-DDNDAFDDLLESAIKTYQQYFHLNPTGSLNAET

Query:  ISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTTNYRR
        +SQLATPRCG PDI+N TTGRMLSE  DN     +HHLPH VSHYAFFPGR RWPS+KYRLTYAF+PGTRADAKAPVARAFATWARNTHFKF+L TNYRR
Subjt:  ISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTTNYRR

Query:  ADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDIKGIK
        ADLKIGFY+GNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSG+TKGLN DDIKGIK
Subjt:  ADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDIKGIK

Query:  VLYNRR
        VLYNRR
Subjt:  VLYNRR

XP_008443669.1 PREDICTED: metalloendoproteinase 2-MMP-like [Cucumis melo]2.7e-15587.42Show/hide
Query:  KPLVLILFI--FLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDD-NDAFDDLLESAIKTYQQYFHLNPTGSL
        K L ++LFI  FLPLCFSLPL QVSPFAFLNDLQG KKGDNVKGISKLKNFFRYYGYLNH+ NATGHL D D +D FDD LE AIKTYQQYFHLNPTGSL
Subjt:  KPLVLILFI--FLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDD-NDAFDDLLESAIKTYQQYFHLNPTGSL

Query:  NAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTT
        NAETISQLATPRCG PDI+N +TGRMLSEH++N     +HHLPH VSHYAFFPGRRRWPS+KYRLTYAF+PGTRADAKAPV RAFATWARNTHFKFSL T
Subjt:  NAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTT

Query:  NYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDI
        NYRRADLKIGFY+GNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAV+GRYDLQTVALHEIGHLLGLGHSTV+NAIMYPYI+SG+TKGLNADDI
Subjt:  NYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDI

Query:  KGIKVLYNRR
        KGIKVLYNRR
Subjt:  KGIKVLYNRR

XP_023516970.1 metalloendoproteinase 1-like [Cucurbita pepo subsp. pepo]4.6e-15586.35Show/hide
Query:  KPLVLILFIFLPLCFSLPLPQ------VSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPT
        KPL+L  FIFLPLC SLPLP+      VSPFAFLNDLQG KKGDNVKGISKLKNFF YYGYLN RTN T ++N DDND FDD LE AIKTYQQYFHLNPT
Subjt:  KPLVLILFIFLPLCFSLPLPQ------VSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPT

Query:  GSLNAETISQLATPRCGVPDIVNGTTGRMLSEH--DDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFK
        G+LN ETISQLATPRCGVPDIVNGTTGR+L EH  DD+ D  H++HLPHVVSHYAFFPG+RRWPSSKYRLTYAF+P TR DAKAPV RAFATWAR THFK
Subjt:  GSLNAETISQLATPRCGVPDIVNGTTGRMLSEH--DDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFK

Query:  FSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGL
        FSLTTNY+RA+LKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAV+GRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGL
Subjt:  FSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGL

Query:  NADDIKGIKVLYNRR
        NADDIKGIKVLYNRR
Subjt:  NADDIKGIKVLYNRR

XP_038880397.1 metalloendoproteinase 1-like [Benincasa hispida]7.1e-15687.26Show/hide
Query:  KPLVLILF--IFLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDD-NDAFDDLLESAIKTYQQYFHLNPTGSL
        KPLV++LF  IFLPLCFSLPL QVSPFAFLNDLQG KKGD+V GISKLKNFF YYGYLNHR N TGHL + D +D FDD LESAIKTYQQYFHLNPTG L
Subjt:  KPLVLILF--IFLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDD-NDAFDDLLESAIKTYQQYFHLNPTGSL

Query:  NAETISQLATPRCGVPDIVNGTTGRMLSEHDDN----YDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKF
        + ETISQLATPRCGVPDI+NGTT RMLSEHD++    +DH H+HHLPH VSHYAFFPGRRRWPS+KYRLTYAF+PGTRADAKAPVARAFATWARNTHFKF
Subjt:  NAETISQLATPRCGVPDIVNGTTGRMLSEHDDN----YDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKF

Query:  SLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLN
        SL TNYRRADLKIGFY GNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAV+GRYDLQTVALHEIGHLLGLGHS V+NAIMYPYIKSGTTKGLN
Subjt:  SLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLN

Query:  ADDIKGIKVLYNRR
        ADDIKGIKVLYNRR
Subjt:  ADDIKGIKVLYNRR

TrEMBL top hitse value%identityAlignment
A0A0A0M0J6 ZnMc domain-containing protein1.5e-15689.22Show/hide
Query:  LVLILFIFLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLND-DDNDAFDDLLESAIKTYQQYFHLNPTGSLNAET
        L+LI+ IFLPLCFSLPL QVSPFAFLNDLQG KKGDNVKGISKLKNFFRYYGYLNH+ NATGHL D D ND FDD LESAIKTYQQYFHLNPTGSLNAET
Subjt:  LVLILFIFLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLND-DDNDAFDDLLESAIKTYQQYFHLNPTGSLNAET

Query:  ISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTTNYRR
        +SQLATPRCG PDI+N TTGRMLSE  DN     +HHLPH VSHYAFFPGR RWPS+KYRLTYAF+PGTRADAKAPVARAFATWARNTHFKF+L TNYRR
Subjt:  ISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTTNYRR

Query:  ADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDIKGIK
        ADLKIGFY+GNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSG+TKGLN DDIKGIK
Subjt:  ADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDIKGIK

Query:  VLYNRR
        VLYNRR
Subjt:  VLYNRR

A0A1S3B8J8 metalloendoproteinase 2-MMP-like1.3e-15587.42Show/hide
Query:  KPLVLILFI--FLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDD-NDAFDDLLESAIKTYQQYFHLNPTGSL
        K L ++LFI  FLPLCFSLPL QVSPFAFLNDLQG KKGDNVKGISKLKNFFRYYGYLNH+ NATGHL D D +D FDD LE AIKTYQQYFHLNPTGSL
Subjt:  KPLVLILFI--FLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDD-NDAFDDLLESAIKTYQQYFHLNPTGSL

Query:  NAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTT
        NAETISQLATPRCG PDI+N +TGRMLSEH++N     +HHLPH VSHYAFFPGRRRWPS+KYRLTYAF+PGTRADAKAPV RAFATWARNTHFKFSL T
Subjt:  NAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTT

Query:  NYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDI
        NYRRADLKIGFY+GNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAV+GRYDLQTVALHEIGHLLGLGHSTV+NAIMYPYI+SG+TKGLNADDI
Subjt:  NYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDI

Query:  KGIKVLYNRR
        KGIKVLYNRR
Subjt:  KGIKVLYNRR

A0A5A7T4D6 Metalloendoproteinase 2-MMP-like1.3e-15587.42Show/hide
Query:  KPLVLILFI--FLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDD-NDAFDDLLESAIKTYQQYFHLNPTGSL
        K L ++LFI  FLPLCFSLPL QVSPFAFLNDLQG KKGDNVKGISKLKNFFRYYGYLNH+ NATGHL D D +D FDD LE AIKTYQQYFHLNPTGSL
Subjt:  KPLVLILFI--FLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDD-NDAFDDLLESAIKTYQQYFHLNPTGSL

Query:  NAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTT
        NAETISQLATPRCG PDI+N +TGRMLSEH++N     +HHLPH VSHYAFFPGRRRWPS+KYRLTYAF+PGTRADAKAPV RAFATWARNTHFKFSL T
Subjt:  NAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTT

Query:  NYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDI
        NYRRADLKIGFY+GNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAV+GRYDLQTVALHEIGHLLGLGHSTV+NAIMYPYI+SG+TKGLNADDI
Subjt:  NYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDI

Query:  KGIKVLYNRR
        KGIKVLYNRR
Subjt:  KGIKVLYNRR

A0A6J1H9F9 metalloendoproteinase 1-like3.2e-15485.27Show/hide
Query:  KPLVLILFIFLPLCFSLP------------LPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQY
        KPL+L  FIFLPLC SLP            LPQVSPFAFLNDLQG KKGDNVKGISKLKNFF YYGYLNH  N TG+LND D D FDD LESAIKTYQ+Y
Subjt:  KPLVLILFIFLPLCFSLP------------LPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQY

Query:  FHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARN
        FHLNPTG+LN ETISQLATPRCGVPDIVNGTTGR+L EHDD+  H H +HL HVVSHYAFFPG+RRWPSSKYRLTYAF+PGTR DAKAPV RAFATWAR 
Subjt:  FHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARN

Query:  THFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGT
        THFKFSLTTNY+RA+LKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAV+GRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGT
Subjt:  THFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGT

Query:  TKGLNADDIKGIKVLYNRR
        TKGLNADDIKGIKVLYN+R
Subjt:  TKGLNADDIKGIKVLYNRR

A0A6J1JJJ4 LOW QUALITY PROTEIN: metalloendoproteinase 1-like1.3e-14782.81Show/hide
Query:  KPLVLILFIFLPLCFSLP------------LPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQY
        KPLVL  FIFLPLC SLP            LPQVSPFAFLNDL G KKGDNVKGISKLKNFF YYGYLNH TN T + N D NDAFDD LESAIKTYQQY
Subjt:  KPLVLILFIFLPLCFSLP------------LPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQY

Query:  FHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRML-SEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWAR
        FHLNPTG+LN ETISQLATPRCGVPDIVN TT  +L  E DD+ D  H +HLPHVVSHYAFFPG+RRWPSSKYRLTYAF+P TR DAKA V RAF  WAR
Subjt:  FHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRML-SEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWAR

Query:  NTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSG
         THFKFSLTTNY+RA+LKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAV+GRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSG
Subjt:  NTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSG

Query:  TTKGLNADDIKGIKVLYNRR
        TTKGLNADDIKGIKVLY +R
Subjt:  TTKGLNADDIKGIKVLYNRR

SwissProt top hitse value%identityAlignment
O04529 Metalloendoproteinase 2-MMP1.2e-6043.25Show/hide
Query:  NDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTT---GRMLS
        ++  G   G NV G+ ++K +F+ +GY+      +G+  DD    FDD+L++A++ YQ  F+LN TG L+A TI  +  PRCG PD+VNGT+   G    
Subjt:  NDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTT---GRMLS

Query:  EHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGT--RADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPG
          + N+   H     H V  Y  FPG  RWP ++  LTYAF P      + K+  +RAF  W+  T   F+L+ ++  +D+ IGFY G+HGDG PFDG  
Subjt:  EHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGT--RADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPG

Query:  GTLAHAFAPTDGRFHYDSTEKWAVG-------AVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY
        GTLAHAF+P  G+FH D+ E W V        +V    DL++VA+HEIGHLLGLGHS+V+ +IMYP I +G  K  L  DD++GI+ LY
Subjt:  GTLAHAFAPTDGRFHYDSTEKWAVG-------AVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY

O23507 Metalloendoproteinase 1-MMP1.5e-4740Show/hide
Query:  GDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHY
        G +V G+S+LK +   +GY+N  +          +D FD  LESAI  YQ+   L  TG L+  T++ ++ PRCGV D        M   +D        
Subjt:  GDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHY

Query:  HHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPG------TRADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAF
            H  +HY +F G+ +W  ++  LTYA          T  D K    RAF+ W+      F    ++  ADLKIGFY G+HGDG PFDG  GTLAHAF
Subjt:  HHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPG------TRADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAF

Query:  APTDGRFHYDSTEKWAV-----GAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY
        AP +GR H D+ E W V     G+     DL++VA HEIGHLLGLGHS+ ++A+MYP ++  T K  L  DD+ G+  LY
Subjt:  APTDGRFHYDSTEKWAV-----GAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY

P29136 Metalloendoproteinase 11.2e-5746.21Show/hide
Query:  GDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHY
        G N KG+S +KN+F + GY+ +  +       DDN  FDD L SAIKTYQ+ ++LN TG  +  T+ Q+ TPRCGVPDI+  T         +    F  
Subjt:  GDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHY

Query:  HHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRAD--AKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTD
             ++S Y FF    RW +   +LTYAF P  R D   K+ +ARAF+ W    +  F  TT+Y  A++KI F   NHGD YPFDGPGG L HAFAPTD
Subjt:  HHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRAD--AKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTD

Query:  GRFHYDSTEKWAVGA------VRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY
        GR H+D+ E W          V   +DL++VA+HEIGHLLGLGHS+   AIMYP I   T K  L  DDI GI+ LY
Subjt:  GRFHYDSTEKWAVGA------VRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY

Q5XF51 Metalloendoproteinase 3-MMP7.6e-6042.76Show/hide
Query:  AFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLS
        +FLN   G   G    G+  LK +F+++GY+   TN +G+  DD    FDD+L++A++ YQ+ F LN TG L+  T+  +  PRCG PD+VNGT+     
Subjt:  AFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLS

Query:  EHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVP--GTRADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPG
                       H V HY+FFPG  RWP ++  LTYAF P      + K+  +RAF  W   T   F+    +  +D+ IGFY G HGDG PFDGP 
Subjt:  EHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVP--GTRADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPG

Query:  GTLAHAFAPTDGRFHYDSTEKWAVG--------AVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY
         TLAHAF+P  G FH D  E W V         +V    DL++VA+HEIGHLLGLGHS+V+ +IMYP I++G  K  L  DD++G++ LY
Subjt:  GTLAHAFAPTDGRFHYDSTEKWAVG--------AVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY

Q9ZUJ5 Metalloendoproteinase 5-MMP6.0e-5740.55Show/hide
Query:  LVLILFIF----LPLCFSLPLPQVSPFAFLN----------DLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQY
        L +++F F    +   F   +  + P  FLN           L G   G+N+ G+SKLK +FR +GY+      TG+  DD    FDD+L+SAI TYQ+ 
Subjt:  LVLILFIF----LPLCFSLPLPQVSPFAFLN----------DLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQY

Query:  FHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRA--DAKAPVARAFATWA
        F+L  TG L++ T+ Q+  PRCG PD+++G +              +   +      Y+FFPG+ RWP  K  LTYAF P      + K   +RAF  WA
Subjt:  FHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRA--DAKAPVARAFATWA

Query:  RNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAV--GAVRGR-------YDLQTVALHEIGHLLGLGHSTVKN
          T   F+ + +  RAD+ IGF+ G HGDG PFDG  GTLAHA +P  G  H D  E W +  G +  R        DL++VA+HEIGHLLGLGHS+V++
Subjt:  RNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAV--GAVRGR-------YDLQTVALHEIGHLLGLGHSTVKN

Query:  AIMYPYIKSGTTK-GLNADDIKGIKVLY
        AIM+P I  G  K  L  DDI+GI+ LY
Subjt:  AIMYPYIKSGTTK-GLNADDIKGIKVLY

Arabidopsis top hitse value%identityAlignment
AT1G24140.1 Matrixin family protein5.4e-6142.76Show/hide
Query:  AFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLS
        +FLN   G   G    G+  LK +F+++GY+   TN +G+  DD    FDD+L++A++ YQ+ F LN TG L+  T+  +  PRCG PD+VNGT+     
Subjt:  AFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLS

Query:  EHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVP--GTRADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPG
                       H V HY+FFPG  RWP ++  LTYAF P      + K+  +RAF  W   T   F+    +  +D+ IGFY G HGDG PFDGP 
Subjt:  EHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVP--GTRADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPG

Query:  GTLAHAFAPTDGRFHYDSTEKWAVG--------AVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY
         TLAHAF+P  G FH D  E W V         +V    DL++VA+HEIGHLLGLGHS+V+ +IMYP I++G  K  L  DD++G++ LY
Subjt:  GTLAHAFAPTDGRFHYDSTEKWAVG--------AVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY

AT1G59970.1 Matrixin family protein4.3e-5840.55Show/hide
Query:  LVLILFIF----LPLCFSLPLPQVSPFAFLN----------DLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQY
        L +++F F    +   F   +  + P  FLN           L G   G+N+ G+SKLK +FR +GY+      TG+  DD    FDD+L+SAI TYQ+ 
Subjt:  LVLILFIF----LPLCFSLPLPQVSPFAFLN----------DLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQY

Query:  FHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRA--DAKAPVARAFATWA
        F+L  TG L++ T+ Q+  PRCG PD+++G +              +   +      Y+FFPG+ RWP  K  LTYAF P      + K   +RAF  WA
Subjt:  FHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRA--DAKAPVARAFATWA

Query:  RNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAV--GAVRGR-------YDLQTVALHEIGHLLGLGHSTVKN
          T   F+ + +  RAD+ IGF+ G HGDG PFDG  GTLAHA +P  G  H D  E W +  G +  R        DL++VA+HEIGHLLGLGHS+V++
Subjt:  RNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDGRFHYDSTEKWAV--GAVRGR-------YDLQTVALHEIGHLLGLGHSTVKN

Query:  AIMYPYIKSGTTK-GLNADDIKGIKVLY
        AIM+P I  G  K  L  DDI+GI+ LY
Subjt:  AIMYPYIKSGTTK-GLNADDIKGIKVLY

AT1G70170.1 matrix metalloproteinase8.3e-6243.25Show/hide
Query:  NDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTT---GRMLS
        ++  G   G NV G+ ++K +F+ +GY+      +G+  DD    FDD+L++A++ YQ  F+LN TG L+A TI  +  PRCG PD+VNGT+   G    
Subjt:  NDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTT---GRMLS

Query:  EHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGT--RADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPG
          + N+   H     H V  Y  FPG  RWP ++  LTYAF P      + K+  +RAF  W+  T   F+L+ ++  +D+ IGFY G+HGDG PFDG  
Subjt:  EHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGT--RADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPG

Query:  GTLAHAFAPTDGRFHYDSTEKWAVG-------AVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY
        GTLAHAF+P  G+FH D+ E W V        +V    DL++VA+HEIGHLLGLGHS+V+ +IMYP I +G  K  L  DD++GI+ LY
Subjt:  GTLAHAFAPTDGRFHYDSTEKWAVG-------AVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY

AT2G45040.1 Matrixin family protein4.6e-4439.05Show/hide
Query:  ISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHV
        I ++K   + YGYL             + ++ D   E A+  YQ+   L  TG  +++T+SQ+  PRCG PD V   T                    H 
Subjt:  ISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHV

Query:  VSHYAFFPGRRRWPSS-KYRLTYAFVPGTRADAKAPV------ARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDG
           Y +FPGR RW      +LTYAF         AP        RAF  WA      F  T +Y  AD+KIGF+ G+HGDG PFDG  G LAH F+P +G
Subjt:  VSHYAFFPGRRRWPSS-KYRLTYAFVPGTRADAKAPV------ARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAFAPTDG

Query:  RFHYDSTEKWAVGAVRGR----YDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY
        R H D  E WAV     +     DL++VA+HEIGH+LGLGHS+VK+A MYP +K  + K  LN DD+ G++ LY
Subjt:  RFHYDSTEKWAVGAVRGR----YDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY

AT4G16640.1 Matrixin family protein1.1e-4840Show/hide
Query:  GDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHY
        G +V G+S+LK +   +GY+N  +          +D FD  LESAI  YQ+   L  TG L+  T++ ++ PRCGV D        M   +D        
Subjt:  GDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATPRCGVPDIVNGTTGRMLSEHDDNYDHFHY

Query:  HHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPG------TRADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAF
            H  +HY +F G+ +W  ++  LTYA          T  D K    RAF+ W+      F    ++  ADLKIGFY G+HGDG PFDG  GTLAHAF
Subjt:  HHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPG------TRADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYPFDGPGGTLAHAF

Query:  APTDGRFHYDSTEKWAV-----GAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY
        AP +GR H D+ E W V     G+     DL++VA HEIGHLLGLGHS+ ++A+MYP ++  T K  L  DD+ G+  LY
Subjt:  APTDGRFHYDSTEKWAV-----GAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTK-GLNADDIKGIKVLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAACCTTTGGTTCTTATCTTATTCATCTTCCTTCCCCTTTGCTTTTCTCTTCCATTGCCCCAAGTTTCTCCATTTGCATTTCTTAATGACCTCCAAGGCTCCAA
GAAAGGTGATAATGTCAAAGGAATATCTAAGCTTAAGAACTTTTTTCGTTATTATGGTTATTTGAACCATCGAACCAACGCTACAGGTCATCTTAACGATGACGATAATG
ATGCCTTTGACGATCTCCTCGAGTCTGCCATCAAAACCTACCAACAATACTTTCATCTCAATCCTACTGGATCTTTGAATGCCGAGACGATATCCCAACTTGCAACGCCT
CGGTGCGGCGTTCCAGATATTGTCAATGGAACTACCGGTCGAATGCTTTCAGAACACGACGATAATTATGACCATTTTCACTACCACCACCTCCCCCACGTTGTATCTCA
CTATGCCTTCTTTCCTGGAAGGCGTAGGTGGCCATCGTCCAAATACCGCCTTACTTATGCGTTTGTTCCAGGCACTCGTGCTGATGCCAAGGCGCCAGTGGCTCGAGCGT
TCGCGACGTGGGCTCGAAACACTCACTTTAAGTTTTCATTGACCACAAACTATAGAAGAGCGGACTTGAAGATAGGATTCTATAAAGGCAACCATGGAGATGGCTATCCA
TTCGATGGCCCCGGAGGGACTTTGGCACATGCCTTTGCTCCAACGGATGGGAGGTTTCATTATGATTCGACTGAGAAATGGGCAGTTGGGGCAGTGAGAGGGCGATATGA
CTTGCAAACGGTGGCTTTGCATGAAATTGGACACCTTCTTGGACTTGGACATAGCACTGTTAAAAATGCTATAATGTATCCTTATATCAAATCTGGGACTACTAAAGGTT
TGAATGCAGATGACATCAAAGGAATCAAGGTTCTGTACAATCGACGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAACCTTTGGTTCTTATCTTATTCATCTTCCTTCCCCTTTGCTTTTCTCTTCCATTGCCCCAAGTTTCTCCATTTGCATTTCTTAATGACCTCCAAGGCTCCAA
GAAAGGTGATAATGTCAAAGGAATATCTAAGCTTAAGAACTTTTTTCGTTATTATGGTTATTTGAACCATCGAACCAACGCTACAGGTCATCTTAACGATGACGATAATG
ATGCCTTTGACGATCTCCTCGAGTCTGCCATCAAAACCTACCAACAATACTTTCATCTCAATCCTACTGGATCTTTGAATGCCGAGACGATATCCCAACTTGCAACGCCT
CGGTGCGGCGTTCCAGATATTGTCAATGGAACTACCGGTCGAATGCTTTCAGAACACGACGATAATTATGACCATTTTCACTACCACCACCTCCCCCACGTTGTATCTCA
CTATGCCTTCTTTCCTGGAAGGCGTAGGTGGCCATCGTCCAAATACCGCCTTACTTATGCGTTTGTTCCAGGCACTCGTGCTGATGCCAAGGCGCCAGTGGCTCGAGCGT
TCGCGACGTGGGCTCGAAACACTCACTTTAAGTTTTCATTGACCACAAACTATAGAAGAGCGGACTTGAAGATAGGATTCTATAAAGGCAACCATGGAGATGGCTATCCA
TTCGATGGCCCCGGAGGGACTTTGGCACATGCCTTTGCTCCAACGGATGGGAGGTTTCATTATGATTCGACTGAGAAATGGGCAGTTGGGGCAGTGAGAGGGCGATATGA
CTTGCAAACGGTGGCTTTGCATGAAATTGGACACCTTCTTGGACTTGGACATAGCACTGTTAAAAATGCTATAATGTATCCTTATATCAAATCTGGGACTACTAAAGGTT
TGAATGCAGATGACATCAAAGGAATCAAGGTTCTGTACAATCGACGTTGA
Protein sequenceShow/hide protein sequence
MAKPLVLILFIFLPLCFSLPLPQVSPFAFLNDLQGSKKGDNVKGISKLKNFFRYYGYLNHRTNATGHLNDDDNDAFDDLLESAIKTYQQYFHLNPTGSLNAETISQLATP
RCGVPDIVNGTTGRMLSEHDDNYDHFHYHHLPHVVSHYAFFPGRRRWPSSKYRLTYAFVPGTRADAKAPVARAFATWARNTHFKFSLTTNYRRADLKIGFYKGNHGDGYP
FDGPGGTLAHAFAPTDGRFHYDSTEKWAVGAVRGRYDLQTVALHEIGHLLGLGHSTVKNAIMYPYIKSGTTKGLNADDIKGIKVLYNRR