; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G018610 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G018610
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionmetalloendoproteinase 2-MMP-like
Genome locationCG_Chr09:35835761..35845003
RNA-Seq ExpressionClCG09G018610
SyntenyClCG09G018610
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0030198 - extracellular matrix organization (biological process)
GO:0030574 - collagen catabolic process (biological process)
GO:0031012 - extracellular matrix (cellular component)
GO:0031225 - anchored component of membrane (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001818 - Peptidase M10, metallopeptidase
IPR002477 - Peptidoglycan binding-like
IPR006026 - Peptidase, metallopeptidase
IPR021190 - Peptidase M10A
IPR024079 - Metallopeptidase, catalytic domain superfamily
IPR033739 - Peptidase M10A, catalytic domain
IPR036365 - PGBD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037451.1 metalloendoproteinase 2-MMP-like [Cucumis melo var. makuwa]1.1e-11072.79Show/hide
Query:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYL-TNVESTNPN-VFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQS
        ARV HT I S+L+ SRIGNNI GI  VKLYL+RYGYL TNV +T+ N  FD LLE AIK FQKYHSLNVSG+LDKETLTLMS PRC + DI+H+NN+  +
Subjt:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYL-TNVESTNPN-VFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQS

Query:  AIQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP
         IQ+NS++FHSH+TF PGNPKWPISKY+L YTFL+ FPN+F  PV NAM+QW +FS F FS     + ADITFNFVRGNHGDG+PF+GKGG LAHAFGP 
Subjt:  AIQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP

Query:  DGRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
        DGRVHFD DE W DGSV G  NVGMV LHELGHVLGL HST RDAIMWPYM+ G+QTRGLQFDDI+GIQTLY
Subjt:  DGRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

KGN52411.1 hypothetical protein Csa_008387 [Cucumis sativus]2.3e-10564.24Show/hide
Query:  VKDLDMDDVNGLWE-LYDGFDDADRKGHNIEGIHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCS
        +   D++ ++G  + L+  +    RKGHN+EG H++KTYLQ YGYLSK YN ID NGVY+NA+D+ LESS+KKYQKFF+LN+SGILD ETLRQMSQ RCS
Subjt:  VKDLDMDDVNGLWE-LYDGFDDADRKGHNIEGIHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCS

Query:  VPDIFENDDNETSVRTSDLHLRSKYTFFPGKPKWPSSTKYSLKYSFIKNFPEEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFR
        VPD FE+DDNETS+ TS+LH+ S++ FFPG+PKWP S  YSL +SFI NFP  FK  V +AFLAWYE+SRF F+EV +  ++DIK+SFEVGDHGDG+PFR
Subjt:  VPDIFENDDNETSVRTSDLHLRSKYTFFPGKPKWPSSTKYSLKYSFIKNFPEEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFR

Query:  KGSGVLAHAFGPGDGRFHFNADQSFSVQVRYDKYHVRTVALHELGHSLGLGHSNSEDAIMFPSIPPNFSKGLDMDDVNGLWELYNGFH
           GVLAHAF P DGR HFN D+ FS +V   KYHVR+VALHELGHSLGL H+   DAIM+P++PPNF+K ++ DDVNGLW LY+ FH
Subjt:  KGSGVLAHAFGPGDGRFHFNADQSFSVQVRYDKYHVRTVALHELGHSLGLGHSNSEDAIMFPSIPPNFSKGLDMDDVNGLWELYNGFH

RWR93789.1 Peptidase M10 [Cinnamomum micranthum f. kanehirae]8.3e-11934.28Show/hide
Query:  LQESRIGNNIVGIQDVKLYLQRYGYLTNVEST----NPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNF
        L+ S+ G  I  +  +K YL+++GYL    +T    + + FDD LE AIK +Q    LN++G +D  T   M  PRC VPDI+  N  +     + ST+ 
Subjt:  LQESRIGNNIVGIQDVKLYLQRYGYLTNVEST----NPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNF

Query:  H--SHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPV-------TNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP
        H  SHF FF   PKWP SKY LTY F   +P N +  +        +A  +W   + FTF        AD+   F  G+HGDG PF+G GG LAH+F P 
Subjt:  H--SHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPV-------TNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP

Query:  DGRVHFDADESW-VDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLYTLFHPISTHDLDHIHKSSHFLFPQHLL
         G  HFD+DE W ++ S S  +++  V +HE GH+LGL H++  +AIM+P + +G +   L  DDI GI+ LY         +   +  S  ++      
Subjt:  DGRVHFDADESW-VDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLYTLFHPISTHDLDHIHKSSHFLFPQHLL

Query:  GSRKDHNIEGIHSLRKNYNIIDTNGA------HNNTFDHHLESAVKKYQKFFKLNESGILDVETLYQMSESRCSVPDIFEKD------DNETSKLHIGSK
                  ++++ ++ +   TN A       +++FD  LESA+K YQ+ F LN +G +D  T  QM  SRC VPD+             ++ LH  S 
Subjt:  GSRKDHNIEGIHSLRKNYNIIDTNGA------HNNTFDHHLESAVKKYQKFFKLNESGILDVETLYQMSESRCSVPDIFEKD------DNETSKLHIGSK

Query:  YTFFPGRIKWASWKKYQLKYSF----IRNFPEEFKESV-SAAFMIWYERSRFNFTEVVENEDADIRISFEVGNHGDLHPFTKE--VLAHTFGPGDGRF--
         +FFP    W    K  L Y F      + P E  ES+ ++AF  W   + F+   + +   A IRI F  G HGDL+PF     +LA+ + P    +  
Subjt:  YTFFPGRIKWASWKKYQLKYSF----IRNFPEEFKESV-SAAFMIWYERSRFNFTEVVENEDADIRISFEVGNHGDLHPFTKE--VLAHTFGPGDGRF--

Query:  ---HFNAEQSFS-VEVTYGKYHVRTLALHELGHALGLAHSTNEDAIMFPSLSP-----------------NVVKDLDMDDVNGLWELYDGFDDAD--RKG
           HF+A+  ++   +T   +H++++ +HE+GH LGL  S+   A+M P L                   N    L + ++  +  L D     +   KG
Subjt:  ---HFNAEQSFS-VEVTYGKYHVRTLALHELGHALGLAHSTNEDAIMFPSLSP-----------------NVVKDLDMDDVNGLWELYDGFDDAD--RKG

Query:  HNIEGIHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCSVPDIFENDDNETSVRTSDLHLRSKYTF
          ++G+H +K YL+ +GY+    NT    G  D++FDD LES++K YQ +F LN +G LD  T +QM  PRC V D+        ++ ++ LH  S+Y+F
Subjt:  HNIEGIHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCSVPDIFENDDNETSVRTSDLHLRSKYTF

Query:  FPGKPKWPSSTKYSLKYSFIKNFP----EEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFRKGSGVLAHAFGPGDGRFHFNADQ
        F   P+WPSS K  L Y F++       +  +     AF  W   + F F E      ADIK+ F    HGD   F    G LAHA+ P  G FHF+AD+
Subjt:  FPGKPKWPSSTKYSLKYSFIKNFP----EEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFRKGSGVLAHAFGPGDGRFHFNADQ

Query:  SFSVQVRYDK-YHVRTVALHELGHSLGLGHSNSEDAIMFPSIP
         +++     + + V +VA+HE+GH LGL HS+  +AIMFPSIP
Subjt:  SFSVQVRYDK-YHVRTVALHELGHSLGLGHSNSEDAIMFPSIP

XP_004142465.1 metalloendoproteinase 5-MMP [Cucumis sativus]7.7e-11776.01Show/hide
Query:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNP-NVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSA
        ARV HT ISS+LQ SRIGNNI GI +VKLYL+RYGYLTNVESTN  N FD LLE AIK FQKYHSLNVSG++D+ETLTLMS PRC +PDI+HN N+  + 
Subjt:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNP-NVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSA

Query:  IQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPD
        +Q+NS++FHSHFTFFP N KWP+SKY+L YTFLD+FPN+F  PV NAMEQW +FS F FS A  +Q ADITFNFVRGNHGDGYPF+GKGG LAHAFGP D
Subjt:  IQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPD

Query:  GRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
        GRVHFD DE W DGSV G  NVGMV LHELGHVLGL HST RDAIMWPYM AG+QTRGLQFDDI+GIQTLY
Subjt:  GRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

XP_008458910.1 PREDICTED: metalloendoproteinase 2-MMP-like [Cucumis melo]8.3e-11172.79Show/hide
Query:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYL-TNVESTNPN-VFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQS
        ARV HT I S+L+ SRIGNNI GI  VKLYL+RYGYL TNV +T+ N  FD LLE AIK FQKYHSLNVSG+LDKETLTLMS PRC + DI+HNNN+  +
Subjt:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYL-TNVESTNPN-VFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQS

Query:  AIQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP
         IQ+NS++FHSH+TF PGNPKWPISKY+L YTFL+ FPN+F  PV NAM+QW +FS F FS     +  DITFNFVRGNHGDG+PF+GKGG LAHAFGP 
Subjt:  AIQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP

Query:  DGRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
        DGRVHFD DE W DGSV G  NVGMV LHELGHVLGL HST RDAIMWPYM+ G+QTRGLQFDDI+GIQTLY
Subjt:  DGRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

TrEMBL top hitse value%identityAlignment
A0A0A0KU02 ZnMc domain-containing protein3.7e-11776.01Show/hide
Query:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNP-NVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSA
        ARV HT ISS+LQ SRIGNNI GI +VKLYL+RYGYLTNVESTN  N FD LLE AIK FQKYHSLNVSG++D+ETLTLMS PRC +PDI+HN N+  + 
Subjt:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNP-NVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSA

Query:  IQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPD
        +Q+NS++FHSHFTFFP N KWP+SKY+L YTFLD+FPN+F  PV NAMEQW +FS F FS A  +Q ADITFNFVRGNHGDGYPF+GKGG LAHAFGP D
Subjt:  IQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPD

Query:  GRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
        GRVHFD DE W DGSV G  NVGMV LHELGHVLGL HST RDAIMWPYM AG+QTRGLQFDDI+GIQTLY
Subjt:  GRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

A0A0A0KX75 ZnMc domain-containing protein1.1e-10564.24Show/hide
Query:  VKDLDMDDVNGLWE-LYDGFDDADRKGHNIEGIHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCS
        +   D++ ++G  + L+  +    RKGHN+EG H++KTYLQ YGYLSK YN ID NGVY+NA+D+ LESS+KKYQKFF+LN+SGILD ETLRQMSQ RCS
Subjt:  VKDLDMDDVNGLWE-LYDGFDDADRKGHNIEGIHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCS

Query:  VPDIFENDDNETSVRTSDLHLRSKYTFFPGKPKWPSSTKYSLKYSFIKNFPEEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFR
        VPD FE+DDNETS+ TS+LH+ S++ FFPG+PKWP S  YSL +SFI NFP  FK  V +AFLAWYE+SRF F+EV +  ++DIK+SFEVGDHGDG+PFR
Subjt:  VPDIFENDDNETSVRTSDLHLRSKYTFFPGKPKWPSSTKYSLKYSFIKNFPEEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFR

Query:  KGSGVLAHAFGPGDGRFHFNADQSFSVQVRYDKYHVRTVALHELGHSLGLGHSNSEDAIMFPSIPPNFSKGLDMDDVNGLWELYNGFH
           GVLAHAF P DGR HFN D+ FS +V   KYHVR+VALHELGHSLGL H+   DAIM+P++PPNF+K ++ DDVNGLW LY+ FH
Subjt:  KGSGVLAHAFGPGDGRFHFNADQSFSVQVRYDKYHVRTVALHELGHSLGLGHSNSEDAIMFPSIPPNFSKGLDMDDVNGLWELYNGFH

A0A1S3C931 metalloendoproteinase 2-MMP-like4.0e-11172.79Show/hide
Query:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYL-TNVESTNPN-VFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQS
        ARV HT I S+L+ SRIGNNI GI  VKLYL+RYGYL TNV +T+ N  FD LLE AIK FQKYHSLNVSG+LDKETLTLMS PRC + DI+HNNN+  +
Subjt:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYL-TNVESTNPN-VFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQS

Query:  AIQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP
         IQ+NS++FHSH+TF PGNPKWPISKY+L YTFL+ FPN+F  PV NAM+QW +FS F FS     +  DITFNFVRGNHGDG+PF+GKGG LAHAFGP 
Subjt:  AIQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP

Query:  DGRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
        DGRVHFD DE W DGSV G  NVGMV LHELGHVLGL HST RDAIMWPYM+ G+QTRGLQFDDI+GIQTLY
Subjt:  DGRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

A0A3S3R1B8 Peptidase M104.0e-11934.28Show/hide
Query:  LQESRIGNNIVGIQDVKLYLQRYGYLTNVEST----NPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNF
        L+ S+ G  I  +  +K YL+++GYL    +T    + + FDD LE AIK +Q    LN++G +D  T   M  PRC VPDI+  N  +     + ST+ 
Subjt:  LQESRIGNNIVGIQDVKLYLQRYGYLTNVEST----NPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNF

Query:  H--SHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPV-------TNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP
        H  SHF FF   PKWP SKY LTY F   +P N +  +        +A  +W   + FTF        AD+   F  G+HGDG PF+G GG LAH+F P 
Subjt:  H--SHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPV-------TNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP

Query:  DGRVHFDADESW-VDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLYTLFHPISTHDLDHIHKSSHFLFPQHLL
         G  HFD+DE W ++ S S  +++  V +HE GH+LGL H++  +AIM+P + +G +   L  DDI GI+ LY         +   +  S  ++      
Subjt:  DGRVHFDADESW-VDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLYTLFHPISTHDLDHIHKSSHFLFPQHLL

Query:  GSRKDHNIEGIHSLRKNYNIIDTNGA------HNNTFDHHLESAVKKYQKFFKLNESGILDVETLYQMSESRCSVPDIFEKD------DNETSKLHIGSK
                  ++++ ++ +   TN A       +++FD  LESA+K YQ+ F LN +G +D  T  QM  SRC VPD+             ++ LH  S 
Subjt:  GSRKDHNIEGIHSLRKNYNIIDTNGA------HNNTFDHHLESAVKKYQKFFKLNESGILDVETLYQMSESRCSVPDIFEKD------DNETSKLHIGSK

Query:  YTFFPGRIKWASWKKYQLKYSF----IRNFPEEFKESV-SAAFMIWYERSRFNFTEVVENEDADIRISFEVGNHGDLHPFTKE--VLAHTFGPGDGRF--
         +FFP    W    K  L Y F      + P E  ES+ ++AF  W   + F+   + +   A IRI F  G HGDL+PF     +LA+ + P    +  
Subjt:  YTFFPGRIKWASWKKYQLKYSF----IRNFPEEFKESV-SAAFMIWYERSRFNFTEVVENEDADIRISFEVGNHGDLHPFTKE--VLAHTFGPGDGRF--

Query:  ---HFNAEQSFS-VEVTYGKYHVRTLALHELGHALGLAHSTNEDAIMFPSLSP-----------------NVVKDLDMDDVNGLWELYDGFDDAD--RKG
           HF+A+  ++   +T   +H++++ +HE+GH LGL  S+   A+M P L                   N    L + ++  +  L D     +   KG
Subjt:  ---HFNAEQSFS-VEVTYGKYHVRTLALHELGHALGLAHSTNEDAIMFPSLSP-----------------NVVKDLDMDDVNGLWELYDGFDDAD--RKG

Query:  HNIEGIHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCSVPDIFENDDNETSVRTSDLHLRSKYTF
          ++G+H +K YL+ +GY+    NT    G  D++FDD LES++K YQ +F LN +G LD  T +QM  PRC V D+        ++ ++ LH  S+Y+F
Subjt:  HNIEGIHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCSVPDIFENDDNETSVRTSDLHLRSKYTF

Query:  FPGKPKWPSSTKYSLKYSFIKNFP----EEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFRKGSGVLAHAFGPGDGRFHFNADQ
        F   P+WPSS K  L Y F++       +  +     AF  W   + F F E      ADIK+ F    HGD   F    G LAHA+ P  G FHF+AD+
Subjt:  FPGKPKWPSSTKYSLKYSFIKNFP----EEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFRKGSGVLAHAFGPGDGRFHFNADQ

Query:  SFSVQVRYDK-YHVRTVALHELGHSLGLGHSNSEDAIMFPSIP
         +++     + + V +VA+HE+GH LGL HS+  +AIMFPSIP
Subjt:  SFSVQVRYDK-YHVRTVALHELGHSLGLGHSNSEDAIMFPSIP

A0A5D3BRW3 Metalloendoproteinase 2-MMP-like5.2e-11172.79Show/hide
Query:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYL-TNVESTNPN-VFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQS
        ARV HT I S+L+ SRIGNNI GI  VKLYL+RYGYL TNV +T+ N  FD LLE AIK FQKYHSLNVSG+LDKETLTLMS PRC + DI+H+NN+  +
Subjt:  ARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYL-TNVESTNPN-VFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQS

Query:  AIQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP
         IQ+NS++FHSH+TF PGNPKWPISKY+L YTFL+ FPN+F  PV NAM+QW +FS F FS     + ADITFNFVRGNHGDG+PF+GKGG LAHAFGP 
Subjt:  AIQINSTNFHSHFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPP

Query:  DGRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
        DGRVHFD DE W DGSV G  NVGMV LHELGHVLGL HST RDAIMWPYM+ G+QTRGLQFDDI+GIQTLY
Subjt:  DGRVHFDADESWVDGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

SwissProt top hitse value%identityAlignment
O04529 Metalloendoproteinase 2-MMP2.1e-4836.88Show/hide
Query:  DISSNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTN-PNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNI----TQSAIQ
        D  SN      G N+ G+  +K Y QR+GY+    S N  + FDD+L+ A++++Q   +LNV+G LD  T+  +  PRC  PD+++  ++     +   +
Subjt:  DISSNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTN-PNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNI----TQSAIQ

Query:  INSTNFHSH----FTFFPGNPKWPISKYHLTYTFLDNFP--NNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAF
        +N +  H H    +T FPG P+WP ++  LTY F    P      +  + A  +W   +   F+ +    T+DIT  F  G+HGDG PF+G  G LAHAF
Subjt:  INSTNFHSH----FTFFPGNPKWPISKYHLTYTFLDNFP--NNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAF

Query:  GPPDGRVHFDADESWVDG-------SVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
         PP G+ H DADE+WV         SV+ + ++  V +HE+GH+LGLGHS+  ++IM+P +  G +   L  DD++GIQ LY
Subjt:  GPPDGRVHFDADESWVDG-------SVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

O23507 Metalloendoproteinase 1-MMP2.0e-4337.87Show/hide
Query:  SNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHS
        S L + +IG+++ G+ ++K YL R+GY+ +      +VFD  LE AI ++Q+   L ++G LD  T+TLMS PRC V D       T   I  +  +  +
Subjt:  SNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHS

Query:  HFTFFPGNPKWPISKYHLTYTF-----LDNFPNNFIAPV-TNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVH
        H+T+F G PKW  ++  LTY       LD   +  +  V   A  QW      +F       TAD+   F  G+HGDG PF+G  G LAHAF P +GR+H
Subjt:  HFTFFPGNPKWPISKYHLTYTF-----LDNFPNNFIAPV-TNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVH

Query:  FDADESW-VDGSVSGSFNVGM----VVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
         DA E+W VD  + GS  V +    V  HE+GH+LGLGHS+   A+M+P +    +   L  DD+ G+  LY
Subjt:  FDADESW-VDGSVSGSFNVGM----VVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

P29136 Metalloendoproteinase 12.0e-4641Show/hide
Query:  GNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHSHFTFFPGN
        G N  G+ +VK Y    GY+ N    + N FDD L  AIK +QK ++LNV+G  D  TL  +  PRC VPDI+ N N T S   I      S +TFF   
Subjt:  GNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHSHFTFFPGN

Query:  PKWPISKYHLTYTFL--DNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVHFDADESWV----
        P+W      LTY F       + F + +  A  +W       F      +TA+I   F   NHGD YPF+G GG L HAF P DGR HFDADE WV    
Subjt:  PKWPISKYHLTYTFL--DNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVHFDADESWV----

Query:  --DGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
             V+ +F++  V +HE+GH+LGLGHS+   AIM+P +    +   L  DDI GI+ LY
Subjt:  --DGSVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

Q5XF51 Metalloendoproteinase 3-MMP2.2e-4535.92Show/hide
Query:  NLQESRIGNNIVGIQDVKLYLQRYGYL--TNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQI------
        N      G    G+  +K Y Q +GY+  TN+     + FDD+L+ A++++Q+   LNV+G+LD+ TL  +  PRC  PD+++  +   S  +       
Subjt:  NLQESRIGNNIVGIQDVKLYLQRYGYL--TNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQI------

Query:  -NSTNFHS--HFTFFPGNPKWPISKYHLTYTFLDNFPNNFI-----APVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHA
             FH+  H++FFPG P+WP ++  LTY F    P N +     +  + A  +W   +  TF+   R  T+DI+  F  G HGDG PF+G    LAHA
Subjt:  -NSTNFHS--HFTFFPGNPKWPISKYHLTYTFLDNFPNNFI-----APVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHA

Query:  FGPPDGRVHFDADESWV------DG--SVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
        F PP G  H D +E+W+      DG  SVS + ++  V +HE+GH+LGLGHS+   +IM+P +  G +   L  DD++G+Q LY
Subjt:  FGPPDGRVHFDADESWV------DG--SVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

Q9ZUJ5 Metalloendoproteinase 5-MMP1.5e-4638.24Show/hide
Query:  SNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHS
        S L    IG NI G+  +K Y +R+GY+T   +   + FDD+L+ AI  +QK  +L V+G LD  TL  + +PRC  PD++   +       + +T    
Subjt:  SNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHS

Query:  HFTFFPGNPKWPISKYHLTYTFL--DNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVHFDAD
         ++FFPG P+WP  K  LTY F   +N  +      + A  +W   +   F+ +     ADI   F  G HGDG PF+G  G LAHA  PP G +H D D
Subjt:  HFTFFPGNPKWPISKYHLTYTFL--DNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVHFDAD

Query:  ESWV--DGSVSGSF-------NVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
        E W+  +G +S          ++  V +HE+GH+LGLGHS+  DAIM+P ++ GD+   L  DDI+GIQ LY
Subjt:  ESWV--DGSVSGSF-------NVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

Arabidopsis top hitse value%identityAlignment
AT1G24140.1 Matrixin family protein1.5e-4635.92Show/hide
Query:  NLQESRIGNNIVGIQDVKLYLQRYGYL--TNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQI------
        N      G    G+  +K Y Q +GY+  TN+     + FDD+L+ A++++Q+   LNV+G+LD+ TL  +  PRC  PD+++  +   S  +       
Subjt:  NLQESRIGNNIVGIQDVKLYLQRYGYL--TNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQI------

Query:  -NSTNFHS--HFTFFPGNPKWPISKYHLTYTFLDNFPNNFI-----APVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHA
             FH+  H++FFPG P+WP ++  LTY F    P N +     +  + A  +W   +  TF+   R  T+DI+  F  G HGDG PF+G    LAHA
Subjt:  -NSTNFHS--HFTFFPGNPKWPISKYHLTYTFLDNFPNNFI-----APVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHA

Query:  FGPPDGRVHFDADESWV------DG--SVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
        F PP G  H D +E+W+      DG  SVS + ++  V +HE+GH+LGLGHS+   +IM+P +  G +   L  DD++G+Q LY
Subjt:  FGPPDGRVHFDADESWV------DG--SVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

AT1G59970.1 Matrixin family protein1.1e-4738.24Show/hide
Query:  SNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHS
        S L    IG NI G+  +K Y +R+GY+T   +   + FDD+L+ AI  +QK  +L V+G LD  TL  + +PRC  PD++   +       + +T    
Subjt:  SNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHS

Query:  HFTFFPGNPKWPISKYHLTYTFL--DNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVHFDAD
         ++FFPG P+WP  K  LTY F   +N  +      + A  +W   +   F+ +     ADI   F  G HGDG PF+G  G LAHA  PP G +H D D
Subjt:  HFTFFPGNPKWPISKYHLTYTFL--DNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVHFDAD

Query:  ESWV--DGSVSGSF-------NVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
        E W+  +G +S          ++  V +HE+GH+LGLGHS+  DAIM+P ++ GD+   L  DDI+GIQ LY
Subjt:  ESWV--DGSVSGSF-------NVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

AT1G70170.1 matrix metalloproteinase1.5e-4936.88Show/hide
Query:  DISSNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTN-PNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNI----TQSAIQ
        D  SN      G N+ G+  +K Y QR+GY+    S N  + FDD+L+ A++++Q   +LNV+G LD  T+  +  PRC  PD+++  ++     +   +
Subjt:  DISSNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTN-PNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNI----TQSAIQ

Query:  INSTNFHSH----FTFFPGNPKWPISKYHLTYTFLDNFP--NNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAF
        +N +  H H    +T FPG P+WP ++  LTY F    P      +  + A  +W   +   F+ +    T+DIT  F  G+HGDG PF+G  G LAHAF
Subjt:  INSTNFHSH----FTFFPGNPKWPISKYHLTYTFLDNFP--NNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAF

Query:  GPPDGRVHFDADESWVDG-------SVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
         PP G+ H DADE+WV         SV+ + ++  V +HE+GH+LGLGHS+  ++IM+P +  G +   L  DD++GIQ LY
Subjt:  GPPDGRVHFDADESWVDG-------SVSGSFNVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY

AT2G45040.1 Matrixin family protein1.2e-3836.12Show/hide
Query:  IHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCSVPDIFENDDNETSVRTSDLHLRSKYTFFPGKP
        I  +K +LQ YGYL +   + D +           E ++ +YQK   L  +G  D +TL Q+  PRC  PD       +   +T+  H   KY +FPG+P
Subjt:  IHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCSVPDIFENDDNETSVRTSDLHLRSKYTFFPGKP

Query:  KWPSSTKYSLKYSFIKN------FPEEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFRKGSGVLAHAFGPGDGRFHFNADQSFS
        +W       L Y+F +        P + +     AF  W       F E +  V ADIK+ F  GDHGDG PF    GVLAH F P +GR H +  ++++
Subjt:  KWPSSTKYSLKYSFIKN------FPEEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFRKGSGVLAHAFGPGDGRFHFNADQSFS

Query:  VQVRYDKYHV----RTVALHELGHSLGLGHSNSEDAIMFPSIPPNFSK-GLDMDDVNGLWELY
        V    +K  V     +VA+HE+GH LGLGHS+ +DA M+P++ P   K  L+MDDV G+  LY
Subjt:  VQVRYDKYHV----RTVALHELGHSLGLGHSNSEDAIMFPSIPPNFSK-GLDMDDVNGLWELY

AT4G16640.1 Matrixin family protein1.4e-4437.87Show/hide
Query:  SNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHS
        S L + +IG+++ G+ ++K YL R+GY+ +      +VFD  LE AI ++Q+   L ++G LD  T+TLMS PRC V D       T   I  +  +  +
Subjt:  SNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHS

Query:  HFTFFPGNPKWPISKYHLTYTF-----LDNFPNNFIAPV-TNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVH
        H+T+F G PKW  ++  LTY       LD   +  +  V   A  QW      +F       TAD+   F  G+HGDG PF+G  G LAHAF P +GR+H
Subjt:  HFTFFPGNPKWPISKYHLTYTF-----LDNFPNNFIAPV-TNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVH

Query:  FDADESW-VDGSVSGSFNVGM----VVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY
         DA E+W VD  + GS  V +    V  HE+GH+LGLGHS+   A+M+P +    +   L  DD+ G+  LY
Subjt:  FDADESW-VDGSVSGSFNVGM----VVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACGTGTTCTTCACACCGATATATCCTCCAATCTTCAAGAAAGTCGCATAGGTAACAATATCGTAGGAATCCAAGATGTCAAGCTTTACCTTCAACGTTACGGTTA
CTTAACCAACGTTGAGAGTACCAATCCTAACGTATTCGATGATCTCCTAGAGTTTGCCATTAAAATATTTCAAAAATATCACAGCCTCAACGTGAGTGGCATTTTGGATA
AGGAGACATTAACTTTAATGTCTCAGCCTCGATGCGAAGTTCCAGATATCTTGCACAATAATAATATCACTCAAAGTGCCATTCAAATAAATAGTACCAACTTTCATTCT
CATTTCACATTCTTTCCGGGAAATCCGAAGTGGCCGATTTCGAAATACCATTTGACGTACACGTTTCTCGACAATTTCCCGAATAACTTCATAGCGCCGGTGACAAACGC
AATGGAGCAATGGGGGATGTTCAGCAAGTTCACATTCTCAGCAGCTGCACGGTCTCAAACAGCGGACATCACATTTAACTTTGTGAGAGGAAACCATGGGGATGGTTATC
CATTTGAAGGAAAAGGAGGAGCTTTGGCGCATGCTTTTGGACCACCGGACGGGAGGGTGCACTTTGATGCGGACGAAAGTTGGGTGGATGGGTCCGTTAGTGGTTCGTTT
AATGTGGGGATGGTAGTGTTGCATGAGCTTGGGCATGTGCTCGGGCTTGGCCACAGCACCACTCGAGATGCCATTATGTGGCCCTACATGAACGCCGGTGATCAGACCAG
GGGCTTACAGTTTGATGATATTCAAGGCATCCAAACTTTGTATACATTATTTCACCCGATTTCGACACATGATCTCGATCACATCCATAAATCATCTCACTTTCTATTTC
CTCAACATCTTCTGGGAAGTCGTAAGGATCACAACATCGAAGGAATCCATAGCTTAAGAAAAAATTATAACATTATCGATACCAATGGCGCTCATAATAACACCTTCGAC
CACCACCTAGAATCCGCCGTAAAAAAATACCAAAAATTCTTCAAGCTTAACGAGAGTGGAATTTTAGACGTGGAGACATTGTACCAAATGTCAGAGTCCCGTTGTTCGGT
TCCCGACATATTCGAGAAGGACGACAATGAGACGAGTAAACTCCACATAGGAAGCAAGTACACATTTTTTCCCGGGAGAATAAAATGGGCGAGTTGGAAGAAATACCAAT
TAAAATACTCATTCATTCGGAATTTCCCAGAAGAGTTTAAGGAGTCGGTGAGTGCGGCGTTTATGATATGGTATGAACGCAGCCGATTTAATTTCACAGAAGTTGTTGAG
AATGAAGATGCGGATATAAGAATAAGCTTTGAGGTAGGAAACCATGGAGATTTGCATCCTTTCACGAAGGAAGTTTTGGCACATACGTTTGGGCCTGGGGATGGGAGATT
TCACTTCAATGCTGAACAATCTTTTTCTGTTGAAGTTACATATGGTAAGTATCATGTGAGAACTTTGGCACTTCATGAGCTCGGACATGCACTTGGGCTGGCGCACAGCA
CCAATGAAGATGCTATCATGTTTCCCTCTCTATCTCCTAATGTTGTTAAGGATTTAGATATGGACGATGTTAATGGACTGTGGGAATTATATGATGGATTTGATGATGCT
GATCGTAAGGGTCACAACATTGAAGGAATTCATAGCGTCAAAACGTACCTCCAACATTATGGTTACCTAAGCAAAAAATATAACACTATCGATCCCAATGGCGTTTATGA
TAACGCCTTCGATGACCACCTAGAATCATCTGTAAAAAAATATCAAAAGTTCTTCAAGCTTAACGAGAGCGGAATTTTAGACGTGGAGACATTGCGCCAAATGTCACAGC
CTCGATGTTCGGTTCCGGACATATTCGAGAACGACGACAACGAAACGAGTGTGAGGACGAGTGATCTCCACTTAAGAAGTAAGTACACATTTTTCCCAGGGAAACCAAAA
TGGCCGAGTTCGACGAAATACTCTCTAAAATACTCATTTATTAAGAATTTCCCAGAAGAGTTTAAAGTGGGAGTGAATGAGGCATTTTTGGCATGGTATGAACAAAGCCG
ATTTAGGTTCTCAGAAGTGGATAAGAATGTAAAGGCAGATATAAAAGTAAGCTTTGAAGTGGGAGACCATGGAGATGGGTATCCTTTCCGGAAGGGATCAGGTGTTTTGG
CACATGCGTTTGGGCCTGGAGATGGGAGATTTCACTTCAATGCGGATCAATCTTTTTCAGTTCAAGTTAGATATGATAAGTATCATGTGAGGACTGTTGCACTGCATGAG
CTTGGACATAGCCTTGGTCTAGGGCACAGCAACAGTGAAGACGCCATCATGTTTCCCTCTATACCTCCTAATTTTAGTAAGGGTTTAGATATGGACGATGTTAATGGACT
GTGGGAATTATATAATGGATTTCATGATGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCACGTGTTCTTCACACCGATATATCCTCCAATCTTCAAGAAAGTCGCATAGGTAACAATATCGTAGGAATCCAAGATGTCAAGCTTTACCTTCAACGTTACGGTTA
CTTAACCAACGTTGAGAGTACCAATCCTAACGTATTCGATGATCTCCTAGAGTTTGCCATTAAAATATTTCAAAAATATCACAGCCTCAACGTGAGTGGCATTTTGGATA
AGGAGACATTAACTTTAATGTCTCAGCCTCGATGCGAAGTTCCAGATATCTTGCACAATAATAATATCACTCAAAGTGCCATTCAAATAAATAGTACCAACTTTCATTCT
CATTTCACATTCTTTCCGGGAAATCCGAAGTGGCCGATTTCGAAATACCATTTGACGTACACGTTTCTCGACAATTTCCCGAATAACTTCATAGCGCCGGTGACAAACGC
AATGGAGCAATGGGGGATGTTCAGCAAGTTCACATTCTCAGCAGCTGCACGGTCTCAAACAGCGGACATCACATTTAACTTTGTGAGAGGAAACCATGGGGATGGTTATC
CATTTGAAGGAAAAGGAGGAGCTTTGGCGCATGCTTTTGGACCACCGGACGGGAGGGTGCACTTTGATGCGGACGAAAGTTGGGTGGATGGGTCCGTTAGTGGTTCGTTT
AATGTGGGGATGGTAGTGTTGCATGAGCTTGGGCATGTGCTCGGGCTTGGCCACAGCACCACTCGAGATGCCATTATGTGGCCCTACATGAACGCCGGTGATCAGACCAG
GGGCTTACAGTTTGATGATATTCAAGGCATCCAAACTTTGTATACATTATTTCACCCGATTTCGACACATGATCTCGATCACATCCATAAATCATCTCACTTTCTATTTC
CTCAACATCTTCTGGGAAGTCGTAAGGATCACAACATCGAAGGAATCCATAGCTTAAGAAAAAATTATAACATTATCGATACCAATGGCGCTCATAATAACACCTTCGAC
CACCACCTAGAATCCGCCGTAAAAAAATACCAAAAATTCTTCAAGCTTAACGAGAGTGGAATTTTAGACGTGGAGACATTGTACCAAATGTCAGAGTCCCGTTGTTCGGT
TCCCGACATATTCGAGAAGGACGACAATGAGACGAGTAAACTCCACATAGGAAGCAAGTACACATTTTTTCCCGGGAGAATAAAATGGGCGAGTTGGAAGAAATACCAAT
TAAAATACTCATTCATTCGGAATTTCCCAGAAGAGTTTAAGGAGTCGGTGAGTGCGGCGTTTATGATATGGTATGAACGCAGCCGATTTAATTTCACAGAAGTTGTTGAG
AATGAAGATGCGGATATAAGAATAAGCTTTGAGGTAGGAAACCATGGAGATTTGCATCCTTTCACGAAGGAAGTTTTGGCACATACGTTTGGGCCTGGGGATGGGAGATT
TCACTTCAATGCTGAACAATCTTTTTCTGTTGAAGTTACATATGGTAAGTATCATGTGAGAACTTTGGCACTTCATGAGCTCGGACATGCACTTGGGCTGGCGCACAGCA
CCAATGAAGATGCTATCATGTTTCCCTCTCTATCTCCTAATGTTGTTAAGGATTTAGATATGGACGATGTTAATGGACTGTGGGAATTATATGATGGATTTGATGATGCT
GATCGTAAGGGTCACAACATTGAAGGAATTCATAGCGTCAAAACGTACCTCCAACATTATGGTTACCTAAGCAAAAAATATAACACTATCGATCCCAATGGCGTTTATGA
TAACGCCTTCGATGACCACCTAGAATCATCTGTAAAAAAATATCAAAAGTTCTTCAAGCTTAACGAGAGCGGAATTTTAGACGTGGAGACATTGCGCCAAATGTCACAGC
CTCGATGTTCGGTTCCGGACATATTCGAGAACGACGACAACGAAACGAGTGTGAGGACGAGTGATCTCCACTTAAGAAGTAAGTACACATTTTTCCCAGGGAAACCAAAA
TGGCCGAGTTCGACGAAATACTCTCTAAAATACTCATTTATTAAGAATTTCCCAGAAGAGTTTAAAGTGGGAGTGAATGAGGCATTTTTGGCATGGTATGAACAAAGCCG
ATTTAGGTTCTCAGAAGTGGATAAGAATGTAAAGGCAGATATAAAAGTAAGCTTTGAAGTGGGAGACCATGGAGATGGGTATCCTTTCCGGAAGGGATCAGGTGTTTTGG
CACATGCGTTTGGGCCTGGAGATGGGAGATTTCACTTCAATGCGGATCAATCTTTTTCAGTTCAAGTTAGATATGATAAGTATCATGTGAGGACTGTTGCACTGCATGAG
CTTGGACATAGCCTTGGTCTAGGGCACAGCAACAGTGAAGACGCCATCATGTTTCCCTCTATACCTCCTAATTTTAGTAAGGGTTTAGATATGGACGATGTTAATGGACT
GTGGGAATTATATAATGGATTTCATGATGTTTAA
Protein sequenceShow/hide protein sequence
MARVLHTDISSNLQESRIGNNIVGIQDVKLYLQRYGYLTNVESTNPNVFDDLLEFAIKIFQKYHSLNVSGILDKETLTLMSQPRCEVPDILHNNNITQSAIQINSTNFHS
HFTFFPGNPKWPISKYHLTYTFLDNFPNNFIAPVTNAMEQWGMFSKFTFSAAARSQTADITFNFVRGNHGDGYPFEGKGGALAHAFGPPDGRVHFDADESWVDGSVSGSF
NVGMVVLHELGHVLGLGHSTTRDAIMWPYMNAGDQTRGLQFDDIQGIQTLYTLFHPISTHDLDHIHKSSHFLFPQHLLGSRKDHNIEGIHSLRKNYNIIDTNGAHNNTFD
HHLESAVKKYQKFFKLNESGILDVETLYQMSESRCSVPDIFEKDDNETSKLHIGSKYTFFPGRIKWASWKKYQLKYSFIRNFPEEFKESVSAAFMIWYERSRFNFTEVVE
NEDADIRISFEVGNHGDLHPFTKEVLAHTFGPGDGRFHFNAEQSFSVEVTYGKYHVRTLALHELGHALGLAHSTNEDAIMFPSLSPNVVKDLDMDDVNGLWELYDGFDDA
DRKGHNIEGIHSVKTYLQHYGYLSKKYNTIDPNGVYDNAFDDHLESSVKKYQKFFKLNESGILDVETLRQMSQPRCSVPDIFENDDNETSVRTSDLHLRSKYTFFPGKPK
WPSSTKYSLKYSFIKNFPEEFKVGVNEAFLAWYEQSRFRFSEVDKNVKADIKVSFEVGDHGDGYPFRKGSGVLAHAFGPGDGRFHFNADQSFSVQVRYDKYHVRTVALHE
LGHSLGLGHSNSEDAIMFPSIPPNFSKGLDMDDVNGLWELYNGFHDV