; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G04360 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G04360
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function, DUF547
Genome locationClcChr01:4185091..4196407
RNA-Seq ExpressionClc01G04360
SyntenyClc01G04360
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0031012 - extracellular matrix (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001818 - Peptidase M10, metallopeptidase
IPR002477 - Peptidoglycan binding-like
IPR006026 - Peptidase, metallopeptidase
IPR006869 - Domain of unknown function DUF547
IPR021190 - Peptidase M10A
IPR024079 - Metallopeptidase, catalytic domain superfamily
IPR025757 - Ternary complex factor MIP1, leucine-zipper
IPR033739 - Peptidase M10A, catalytic domain
IPR036365 - PGBD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579407.1 hypothetical protein SDJN03_23855, partial [Cucurbita argyrosperma subsp. sororia]3.9e-28789.64Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        MSGFDMH+RGEE ASG RELRDYLAS  VH RHRRSRSSSDRNSNV RGGVLHSNSKN RSD QASPLS SGIRA SPLHE +T+ NDNSS+K RASLEN
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ+SVT SPAHGKHES+KHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKR SNAGPNSL GGK +IS GK SG AKVREA SQ K+TSL+TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        N AKKKA SF +VK+P+ GPVEEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQLE VNVSKM IDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA
        RRLALFHKAAYNIGG IISANAIEQSIF FKSPRIGWWLETIISTALRKKSGEERQLISSKLGL S QPLVCFGLCTGASSDPVLKVYTASN+KEELE+A
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR
        KR+FLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKW+SENVDGKLHESIQKC + +TGKKAS IIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR

XP_008437070.1 PREDICTED: uncharacterized protein LOC103482606 isoform X2 [Cucumis melo]4.5e-30794.99Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        MSG DMHMRGEE+ASGKRELRDYLASQRVH RHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLS SGIRA SPLHE+STD NDNSSTKQRASLEN
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+IS GKTSGTAKVREAFSQ KRTSL++LKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKA+KKAGSFP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLE VNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA
        RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTGASSDPVLKVYTASN+KEELE A
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR
        KRDFLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKWVS+NVDGKL ESIQKC EHRTGKK SQIIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR

XP_011654811.1 uncharacterized protein LOC101204173 isoform X1 [Cucumis sativus]3.1e-30895.16Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        MSG DMHMRGEE+ASGKRELRDYLASQRVH RHRRSRSSSD+NSN FRG  LHSNSKNDRSD QASPLS SGIRA SPLHE+STD NDNSSTKQRASLEN
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH AQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFG KS+IS GKTSGTAKVREAFSQ KRTSL++LKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKA+KKAGSFP+VKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLE VNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA
        RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTGASSDPVLKVYTASN+KEELEVA
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR
        KRDFLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKWVSENVDGKL ESIQKC EHRTGKK SQIIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR

XP_016903702.1 PREDICTED: uncharacterized protein LOC103482606 isoform X1 [Cucumis melo]3.0e-30391.51Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        MSG DMHMRGEE+ASGKRELRDYLASQRVH RHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLS SGIRA SPLHE+STD NDNSSTKQRASLEN
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA                      QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS

Query:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPS
        QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+IS GKTSGTAKVREAFSQ KRTSL++LKDHLFECPS
Subjt:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPS

Query:  KLSEEMVRCMAFIYCSLHRVASNKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFW
        KLSEEMVRCMAFIYCSLHRVASNKA+KKAGSFP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLE VNVSKMGIDAQTAFW
Subjt:  KLSEEMVRCMAFIYCSLHRVASNKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFW

Query:  INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTG
        INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTG
Subjt:  INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTG

Query:  ASSDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRF
        ASSDPVLKVYTASN+KEELE AKRDFLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKWVS+NVDGKL ESIQKC EHRTGKK SQIIEWLPYSSRF
Subjt:  ASSDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRF

Query:  R
        R
Subjt:  R

XP_038874743.1 uncharacterized protein LOC120067282 isoform X1 [Benincasa hispida]5.0e-30694.47Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        MSGFDMHMRGEE+AS KRELRD+LASQRVH  HRRSRSSSDRNSNVFRGGVLHS+SKNDRSD QASPLS SGIRA SPLHE S ++NDNSS+KQRASLEN
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS++S GKTSGTAKVREAFSQ KR SL+TLKDHLFECPSKLSEEMVRCMAFIYCSLHR AS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKA+KKAGSFP+VKQPQCGPVEEQFGG KAMLEIHCIST+NSQFSRASYAINNYRVLVEQLE VNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA
        RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTGASSDPVLKVYTASN+KEELEVA
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR
        KRDFLQANIVVKKSKKVFLPKVLERFAREASI S+EL KWVSENVDGKLHESIQKC EHRTGKKASQIIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR

TrEMBL top hitse value%identityAlignment
A0A1S3AT95 uncharacterized protein LOC103482606 isoform X22.2e-30794.99Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        MSG DMHMRGEE+ASGKRELRDYLASQRVH RHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLS SGIRA SPLHE+STD NDNSSTKQRASLEN
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+IS GKTSGTAKVREAFSQ KRTSL++LKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKA+KKAGSFP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLE VNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA
        RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTGASSDPVLKVYTASN+KEELE A
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR
        KRDFLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKWVS+NVDGKL ESIQKC EHRTGKK SQIIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR

A0A1S4E657 uncharacterized protein LOC103482606 isoform X11.5e-30391.51Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        MSG DMHMRGEE+ASGKRELRDYLASQRVH RHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLS SGIRA SPLHE+STD NDNSSTKQRASLEN
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA                      QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS

Query:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPS
        QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+IS GKTSGTAKVREAFSQ KRTSL++LKDHLFECPS
Subjt:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPS

Query:  KLSEEMVRCMAFIYCSLHRVASNKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFW
        KLSEEMVRCMAFIYCSLHRVASNKA+KKAGSFP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLE VNVSKMGIDAQTAFW
Subjt:  KLSEEMVRCMAFIYCSLHRVASNKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFW

Query:  INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTG
        INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTG
Subjt:  INVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTG

Query:  ASSDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRF
        ASSDPVLKVYTASN+KEELE AKRDFLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKWVS+NVDGKL ESIQKC EHRTGKK SQIIEWLPYSSRF
Subjt:  ASSDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRF

Query:  R
        R
Subjt:  R

A0A6J1DMR5 uncharacterized protein LOC111021990 isoform X14.3e-28790.17Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        MSGFD  MRGEET +GKRELRDYLASQRVH RHRRSRSSSDRNSNVFRGGVLHSN KND+SD QASPLS SGIRA SPLHE ST  NDNSS+K RASLEN
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKT-SGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNI+ GKT SGT+KVRE  SQ KRTSL+TLKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKT-SGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVA

Query:  SNKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGS
        SNKA+KK GS P+VKQPQCGP+EEQ   GKAMLEIH ISTNNSQFSRAS+AIN YRVLVEQLE VNVSKM IDAQTAFWINVYNALLMHAYLAYGIP  S
Subjt:  SNKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGS

Query:  LRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEV
        LRRLALFHKAAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGL SPQPLVCFGLCTGASSDPVLKVYTASN+KEELEV
Subjt:  LRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEV

Query:  AKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR
        AKRDFLQANIVVKKSKKVFLPKVLERFAREASIS +EL K VS+NVD +LH+SIQKC +HRTGKKASQIIEWLPYSSRFR
Subjt:  AKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR

A0A6J1E2A4 uncharacterized protein LOC1114301772.8e-28689.29Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        MSGFDMH+RGEE ASG RELRDYLAS  VH RHRRSRSSSDRNSNV RGGVLHSNSKN RSD QASPLS SGIRA SPLHE +T+ NDNS +K RASLEN
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ+SVT SPAHGKHES+KHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKR SNAGPNSL GGK +IS GK SG AKVREA S  K+TSL+TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        N AKKKA SF +VK+P+ GPVEEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQLE VNVSKM IDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA
        RRLALFHKAAYNIGG IISANAIEQSIF FKSPRIGWWLETIISTALRKKSGEERQLISSKLGL S QPLVCFGLCTGASSDPVLKVYTASN+KEELE+A
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR
        KR+FLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKW+SENVDGKLHESIQKC + +TGKKAS IIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR

A0A6J1I221 uncharacterized protein LOC1114691319.5e-28789.46Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        MSGFDMH+RGEE ASG RELRDYLAS  VH RHRRSRSSSDRNSNV RGGVLHSNSKN RSD QASPLS SGIRA SPLHE +T+ NDNSS+K RASLEN
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ+SVT SPAHGKHES+KHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVN+LGKR SNAGPNSL GGK +IS  K SG AKVRE  SQ K+TSL+TLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        N AKKKA SFP+VK+P+ GPVEEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQLE VNVSKM IDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA
        RRLALFHKAAYNIGG IISANAIEQSIF FKSPRIGWWLETIISTALRKKSGEERQLISSKLGL S QPLVCFGLCTGASSDPVLKVYTASN+KEELE+A
Subjt:  RRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR
        KR+FLQANIVVKKSKKVFLPKVLERFAREASISS+ELPKW+SENVDGKLHESIQKC + +TGKKAS IIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHRTGKKASQIIEWLPYSSRFR

SwissProt top hitse value%identityAlignment
O04529 Metalloendoproteinase 2-MMP1.5e-7449.15Show/hide
Query:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLI--------------
        W  F+ F     G  V+G+  +KKY  RFGY+P     NF+D FDD   +A+ LYQ    L+VTG+LD+ TI  I+ PRCG  D++              
Subjt:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLI--------------

Query:  KINNNNTTIHSTRRYAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGV
        ++N + T +H+ +RY  F G+PRW R+   LTYA  P   +    + E++ V  R+F RWS V  LNFT S  + +SDI IGFY GDHGDGE FDGVLG 
Subjt:  KINNNNTTIHSTRRYAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGV

Query:  LAHAFSPENGRLHLDAAERWAVDFEQE---KSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNF
        LAHAFSP +G+ HLDA E W V  + +       AVDLESV  HEIGH+LGL HS+V+ES+MYP+++   +KVDL  DDVEGIQYLYG NPNF
Subjt:  LAHAFSPENGRLHLDAAERWAVDFEQE---KSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNF

O23507 Metalloendoproteinase 1-MMP1.2e-8952.8Show/hide
Query:  NNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIH
        +N  W +F+R +D   GS V+G+SELK+YL+RFGY+       FSD FD    SA+ LYQ  LGL +TG+LD+ T+  +  PRCG+SD   +  NN  +H
Subjt:  NNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIH

Query:  STRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGR
        +T  Y +FNG+P+W R  TLTYA+S  + ++YLTS +++ V RR+FS+WS+VIP++F E  D+ ++D++IGFY GDHGDG  FDGVLG LAHAF+PENGR
Subjt:  STRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGR

Query:  LHLDAAERWAVDFE-QEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSSSS
        LHLDAAE W VD + +  S+VAVDLESV THEIGH+LGL HS+ + +VMYPSL PR KKVDL +DDV G+  LYG NP  +L S  +SE SI +G+ S  
Subjt:  LHLDAAERWAVDFE-QEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSSSS

Query:  INTNFFFLLLFYLLVWVGSLFF
          +  F   + Y+L+ VG + F
Subjt:  INTNFFFLLLFYLLVWVGSLFF

Q5XF51 Metalloendoproteinase 3-MMP8.8e-6441.09Show/hide
Query:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYL-PIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIHSTR
        W +F  F     G + +G+  LK+Y   FGY+       NF+D FDD   +A+ +YQ    L+VTG LD  T+  ++ PRCG  D++   N  +T+HS R
Subjt:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYL-PIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIHSTR

Query:  R------------------YAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFD
        +                  Y+FF G+PRW R+   LTYA  P   +    + E++ V  R+F+RW  V PL FT    + +SDI IGFY G+HGDGE FD
Subjt:  R------------------YAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFD

Query:  GVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKV----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFK
        G +  LAHAFSP  G  HLD  E W V  E     +    AVDLESV  HEIGH+LGL HS+V+ S+MYP++    +KVDL  DDVEG+QYLYG NPNF 
Subjt:  GVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKV----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFK

Query:  -LKSFLESEKSINSGSS--------SSSINTN---FFFLLLFYLLVWV
          +S   S +  ++G S        S S+ TN   ++F ++F L +++
Subjt:  -LKSFLESEKSINSGSS--------SSSINTN---FFFLLLFYLLVWV

Q8GWW6 Metalloendoproteinase 4-MMP9.1e-8556.73Show/hide
Query:  ELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIHSTRRYAFFNGQPRWIRS--STLTY
        E+K++L ++GYL   PQN  SD  D  F  AL+ YQ  LGL +TGK DS+T++ I+ PRCG  D   +       H+ ++Y +F G+PRW R     LTY
Subjt:  ELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIHSTRRYAFFNGQPRWIRS--STLTY

Query:  ALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKVAV
        A S +    YL  ++IR+V RR+F +W++VIP++F E+ DY  +DI+IGF+ GDHGDGE FDGVLGVLAH FSPENGRLHLD AE WAVDF++EKS VAV
Subjt:  ALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKVAV

Query:  DLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSS
        DLESV  HEIGHVLGL HS+VK++ MYP+L PR KKV+L +DDV G+Q LYGTNPNF L S L SE S N    S
Subjt:  DLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSS

Q9ZUJ5 Metalloendoproteinase 5-MMP1.4e-6943.77Show/hide
Query:  PYSSRFRT----LPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIA
        P S++F T    +P    L+A  N  AW  F++      G  +NG+S+LK+Y  RFGY  I    N +D FDD   SA+  YQ    L VTGKLDS T+ 
Subjt:  PYSSRFRT----LPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIA

Query:  SIMSPRCGMSDLI---KINNNNTTIHSTRRYAFFNGQPRW-IRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFY
         I+ PRCG  DLI      N    + +T +Y+FF G+PRW  R   LTYA +P   +    + E+++V  R+F+RW+ V PLNFT S     +DI IGF+
Subjt:  SIMSPRCGMSDLI---KINNNNTTIHSTRRYAFFNGQPRW-IRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFY

Query:  RGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKV-----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEG
         G+HGDGE FDG +G LAHA SP  G LHLD  E W +   +   ++      VDLESV  HEIGH+LGL HS+V++++M+P++S   +KV+L  DD+EG
Subjt:  RGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKV-----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEG

Query:  IQYLYGTNPNFKLKSFLESEKSINSGSSS
        IQ+LYG NPN        S +S ++G  S
Subjt:  IQYLYGTNPNFKLKSFLESEKSINSGSSS

Arabidopsis top hitse value%identityAlignment
AT1G59970.1 Matrixin family protein1.0e-7043.77Show/hide
Query:  PYSSRFRT----LPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIA
        P S++F T    +P    L+A  N  AW  F++      G  +NG+S+LK+Y  RFGY  I    N +D FDD   SA+  YQ    L VTGKLDS T+ 
Subjt:  PYSSRFRT----LPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIA

Query:  SIMSPRCGMSDLI---KINNNNTTIHSTRRYAFFNGQPRW-IRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFY
         I+ PRCG  DLI      N    + +T +Y+FF G+PRW  R   LTYA +P   +    + E+++V  R+F+RW+ V PLNFT S     +DI IGF+
Subjt:  SIMSPRCGMSDLI---KINNNNTTIHSTRRYAFFNGQPRW-IRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFY

Query:  RGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKV-----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEG
         G+HGDGE FDG +G LAHA SP  G LHLD  E W +   +   ++      VDLESV  HEIGH+LGL HS+V++++M+P++S   +KV+L  DD+EG
Subjt:  RGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKV-----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEG

Query:  IQYLYGTNPNFKLKSFLESEKSINSGSSS
        IQ+LYG NPN        S +S ++G  S
Subjt:  IQYLYGTNPNFKLKSFLESEKSINSGSSS

AT1G70170.1 matrix metalloproteinase1.0e-7549.15Show/hide
Query:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLI--------------
        W  F+ F     G  V+G+  +KKY  RFGY+P     NF+D FDD   +A+ LYQ    L+VTG+LD+ TI  I+ PRCG  D++              
Subjt:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLI--------------

Query:  KINNNNTTIHSTRRYAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGV
        ++N + T +H+ +RY  F G+PRW R+   LTYA  P   +    + E++ V  R+F RWS V  LNFT S  + +SDI IGFY GDHGDGE FDGVLG 
Subjt:  KINNNNTTIHSTRRYAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGV

Query:  LAHAFSPENGRLHLDAAERWAVDFEQE---KSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNF
        LAHAFSP +G+ HLDA E W V  + +       AVDLESV  HEIGH+LGL HS+V+ES+MYP+++   +KVDL  DDVEGIQYLYG NPNF
Subjt:  LAHAFSPENGRLHLDAAERWAVDFEQE---KSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNF

AT2G45040.1 Matrixin family protein6.4e-8656.73Show/hide
Query:  ELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIHSTRRYAFFNGQPRWIRS--STLTY
        E+K++L ++GYL   PQN  SD  D  F  AL+ YQ  LGL +TGK DS+T++ I+ PRCG  D   +       H+ ++Y +F G+PRW R     LTY
Subjt:  ELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIHSTRRYAFFNGQPRWIRS--STLTY

Query:  ALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKVAV
        A S +    YL  ++IR+V RR+F +W++VIP++F E+ DY  +DI+IGF+ GDHGDGE FDGVLGVLAH FSPENGRLHLD AE WAVDF++EKS VAV
Subjt:  ALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKVAV

Query:  DLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSS
        DLESV  HEIGHVLGL HS+VK++ MYP+L PR KKV+L +DDV G+Q LYGTNPNF L S L SE S N    S
Subjt:  DLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSS

AT4G16640.1 Matrixin family protein8.7e-9152.8Show/hide
Query:  NNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIH
        +N  W +F+R +D   GS V+G+SELK+YL+RFGY+       FSD FD    SA+ LYQ  LGL +TG+LD+ T+  +  PRCG+SD   +  NN  +H
Subjt:  NNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINNNNTTIH

Query:  STRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGR
        +T  Y +FNG+P+W R  TLTYA+S  + ++YLTS +++ V RR+FS+WS+VIP++F E  D+ ++D++IGFY GDHGDG  FDGVLG LAHAF+PENGR
Subjt:  STRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGR

Query:  LHLDAAERWAVDFE-QEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSSSS
        LHLDAAE W VD + +  S+VAVDLESV THEIGH+LGL HS+ + +VMYPSL PR KKVDL +DDV G+  LYG NP  +L S  +SE SI +G+ S  
Subjt:  LHLDAAERWAVDFE-QEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINSGSSSSS

Query:  INTNFFFLLLFYLLVWVGSLFF
          +  F   + Y+L+ VG + F
Subjt:  INTNFFFLLLFYLLVWVGSLFF

AT5G47380.1 Protein of unknown function, DUF5472.5e-14651.35Show/hide
Query:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN
        M GFD++  G +    +R   D       H   R   +SS+R+ +    G   S S N+ +  QAS +  +  + P PLH       +N S+  RASLE 
Subjt:  MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA-QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHG-KHESRKH-
        D+E L LRLQQE+SMR +LERAMGRASS+LSPGHRH A Q  +LI+EIELLE EV NRE HVLSLYRSIFE  VS+  S+Q+S  +SPAH  K   RK  
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA-QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHG-KHESRKH-

Query:  PSIISSAFCSSRKFPLGPLQPF-SVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQ----AKRTSLQTLKDHLFECPSKLSEEMVRCMAFIY
        P++IS+AFCSS  FPL P     ++ D  ++TS    +S F  + N     TS +++ +  F +     K  S +TLKDHL++CP+KLSE+MV+CM+ +Y
Subjt:  PSIISSAFCSSRKFPLGPLQPF-SVNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQ----AKRTSLQTLKDHLFECPSKLSEEMVRCMAFIY

Query:  ----CSLHRVASNKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMH
            CS       K      S   V  P+    E++    ++M+E+  IS++  +FS+ +YAINNYR+LVEQLE V +++M  +A+ AFWIN+YNALLMH
Subjt:  ----CSLHRVASNKAKKKAGSFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMH

Query:  AYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVY
        AYLAYG+P  SLRRLALFHK+AYNIGGHII+AN IE SIF F++PR G WLETIISTALRKK  E++  + S   L  P+PLVCF LC GA SDPVLK Y
Subjt:  AYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLETIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVY

Query:  TASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHR-TGKKASQIIEWLPYSSRFR
        TASN+KEEL+ +KR+FL AN+VVK  KKV LPK++ERF +EAS+S ++L +W+ +N D KL ESIQKC + +   KKASQ++EWLPYSS+FR
Subjt:  TASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLHESIQKCTEHR-TGKKASQIIEWLPYSSRFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATTCGATATGCATATGCGTGGTGAAGAAACAGCTTCTGGGAAGAGAGAACTTAGAGATTATTTAGCGTCTCAACGTGTTCATTACCGCCATAGGCGATCTAG
AAGTTCTTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCTAATAGCAAGAATGATCGGAGTGACACACAGGCATCCCCACTTTCTGCAAGTGGTATCA
GAGCACCAAGTCCACTACATGAGAGGTCTACAGATATCAATGATAATTCATCAACTAAACAGCGAGCGTCTTTGGAAAATGATATTGAGTTGCTACAGCTGCGCTTGCAA
CAAGAGAGATCTATGCGAAGTATGCTAGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCCGGGCATAGGCACTTAGCCCAGACGAAGGATTTGATCTCAGAAAT
TGAATTACTTGAAGAAGAGGTCGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGAAGTATTTTTGAAAATTGTGTTAGTAAGCCATCTTCTCAGCAAAATTCAGTCA
CAGCCTCTCCAGCTCATGGGAAGCATGAATCAAGAAAACACCCCAGTATCATTTCAAGTGCGTTTTGTTCGTCGAGGAAGTTTCCTTTGGGACCTTTGCAACCTTTCTCT
GTAAATGACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAATTCCTTGTTTGGAGGTAAAAGCAACATAAGTATGGGGAAAACTTCAGGCACTGCAAAGGTTCGTGAAGC
CTTTTCGCAGGCGAAGAGAACTTCTCTGCAAACTCTAAAAGACCATCTTTTTGAGTGTCCAAGTAAATTATCAGAGGAGATGGTGAGGTGCATGGCTTTTATATACTGCT
CTCTTCATAGAGTGGCATCAAACAAGGCTAAAAAAAAGGCAGGTTCCTTTCCTGAAGTTAAACAGCCCCAATGTGGACCGGTGGAGGAACAATTTGGGGGTGGGAAAGCA
ATGCTGGAAATACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCATACGCAATCAACAATTATAGAGTATTAGTTGAGCAGCTGGAAAATGTGAATGTCAG
TAAGATGGGGATCGATGCTCAAACTGCATTCTGGATTAATGTGTATAATGCTCTTCTTATGCATGCCTATTTGGCATATGGAATCCCTCATGGCTCTCTAAGAAGGTTGG
CTTTGTTCCATAAGGCTGCTTACAACATCGGTGGCCATATCATCAGTGCAAATGCAATAGAGCAATCAATTTTTTTCTTCAAATCTCCCCGAATAGGATGGTGGCTTGAA
ACTATCATTTCAACTGCGCTGAGGAAGAAGTCTGGGGAAGAAAGGCAACTAATCTCTTCAAAATTGGGCCTTCATAGTCCTCAACCTCTTGTTTGCTTTGGCCTCTGCAC
TGGTGCCTCTTCAGATCCTGTGCTGAAAGTGTACACTGCATCAAATATTAAAGAGGAACTGGAGGTGGCCAAAAGGGATTTTCTCCAAGCAAATATAGTTGTGAAGAAGT
CAAAGAAAGTATTCCTACCAAAGGTGCTCGAGAGATTTGCACGTGAAGCATCCATCAGCTCAGAAGAACTCCCGAAATGGGTTTCTGAAAATGTCGACGGAAAACTCCAC
GAGTCCATACAGAAATGTACGGAACATCGGACTGGCAAGAAGGCATCTCAGATCATTGAGTGGCTACCTTACAGCTCAAGGTTCCGAACTTTGCCGGACTTCACCACTCT
CGACGCCGATAACAACAACTACGCATGGAGAAACTTCGCTCGGTTCCTCGACGCTGGCAAAGGCAGCGAAGTGAACGGCATGTCGGAACTGAAGAAATATTTGAATCGAT
TCGGTTACCTTCCGATTCCTCCTCAAAACAACTTCTCCGATTTCTTCGACGATCAATTCGTATCGGCGTTGATTCTCTATCAGAATCGCTTAGGTTTATCAGTCACTGGA
AAACTCGATTCCGAAACAATCGCAAGCATCATGTCGCCTAGATGCGGAATGAGTGACCTAATTAAAATCAACAACAACAACACAACAATTCACTCAACGCGTCGATACGC
TTTCTTCAACGGCCAACCGAGATGGATTCGATCCTCAACTCTAACGTACGCTCTCTCACCAGATTACACAATCGAATACCTAACTTCATCAGAAATCCGCAAGGTCGTTC
GACGATCGTTCTCGCGGTGGTCCGCAGTGATTCCGTTGAACTTCACCGAATCCTCTGATTACGAATCGTCAGATATCCGAATCGGATTCTACCGCGGCGATCACGGCGAT
GGAGAGGCGTTCGACGGAGTATTAGGCGTTTTGGCACACGCGTTTTCACCGGAAAACGGAAGGCTGCACTTAGATGCGGCGGAGCGCTGGGCGGTGGATTTCGAGCAGGA
GAAATCGAAAGTAGCTGTGGATTTGGAATCGGTTGTAACGCATGAGATAGGGCATGTTCTTGGACTGGCTCATTCCGCAGTGAAGGAATCTGTGATGTATCCAAGTTTGA
GTCCGCGAGGGAAGAAAGTGGACCTTAGGATCGATGACGTAGAAGGAATTCAGTATTTGTATGGTACGAACCCTAATTTCAAATTGAAATCCTTCTTGGAATCCGAAAAA
TCTATCAACAGTGGATCATCATCTTCTTCAATCAACACTAATTTCTTCTTCTTACTGTTATTCTACTTGTTGGTTTGGGTTGGGTCTCTGTTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATTCGATATGCATATGCGTGGTGAAGAAACAGCTTCTGGGAAGAGAGAACTTAGAGATTATTTAGCGTCTCAACGTGTTCATTACCGCCATAGGCGATCTAG
AAGTTCTTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCTAATAGCAAGAATGATCGGAGTGACACACAGGCATCCCCACTTTCTGCAAGTGGTATCA
GAGCACCAAGTCCACTACATGAGAGGTCTACAGATATCAATGATAATTCATCAACTAAACAGCGAGCGTCTTTGGAAAATGATATTGAGTTGCTACAGCTGCGCTTGCAA
CAAGAGAGATCTATGCGAAGTATGCTAGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCCGGGCATAGGCACTTAGCCCAGACGAAGGATTTGATCTCAGAAAT
TGAATTACTTGAAGAAGAGGTCGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGAAGTATTTTTGAAAATTGTGTTAGTAAGCCATCTTCTCAGCAAAATTCAGTCA
CAGCCTCTCCAGCTCATGGGAAGCATGAATCAAGAAAACACCCCAGTATCATTTCAAGTGCGTTTTGTTCGTCGAGGAAGTTTCCTTTGGGACCTTTGCAACCTTTCTCT
GTAAATGACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAATTCCTTGTTTGGAGGTAAAAGCAACATAAGTATGGGGAAAACTTCAGGCACTGCAAAGGTTCGTGAAGC
CTTTTCGCAGGCGAAGAGAACTTCTCTGCAAACTCTAAAAGACCATCTTTTTGAGTGTCCAAGTAAATTATCAGAGGAGATGGTGAGGTGCATGGCTTTTATATACTGCT
CTCTTCATAGAGTGGCATCAAACAAGGCTAAAAAAAAGGCAGGTTCCTTTCCTGAAGTTAAACAGCCCCAATGTGGACCGGTGGAGGAACAATTTGGGGGTGGGAAAGCA
ATGCTGGAAATACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCATACGCAATCAACAATTATAGAGTATTAGTTGAGCAGCTGGAAAATGTGAATGTCAG
TAAGATGGGGATCGATGCTCAAACTGCATTCTGGATTAATGTGTATAATGCTCTTCTTATGCATGCCTATTTGGCATATGGAATCCCTCATGGCTCTCTAAGAAGGTTGG
CTTTGTTCCATAAGGCTGCTTACAACATCGGTGGCCATATCATCAGTGCAAATGCAATAGAGCAATCAATTTTTTTCTTCAAATCTCCCCGAATAGGATGGTGGCTTGAA
ACTATCATTTCAACTGCGCTGAGGAAGAAGTCTGGGGAAGAAAGGCAACTAATCTCTTCAAAATTGGGCCTTCATAGTCCTCAACCTCTTGTTTGCTTTGGCCTCTGCAC
TGGTGCCTCTTCAGATCCTGTGCTGAAAGTGTACACTGCATCAAATATTAAAGAGGAACTGGAGGTGGCCAAAAGGGATTTTCTCCAAGCAAATATAGTTGTGAAGAAGT
CAAAGAAAGTATTCCTACCAAAGGTGCTCGAGAGATTTGCACGTGAAGCATCCATCAGCTCAGAAGAACTCCCGAAATGGGTTTCTGAAAATGTCGACGGAAAACTCCAC
GAGTCCATACAGAAATGTACGGAACATCGGACTGGCAAGAAGGCATCTCAGATCATTGAGTGGCTACCTTACAGCTCAAGGTTCCGAACTTTGCCGGACTTCACCACTCT
CGACGCCGATAACAACAACTACGCATGGAGAAACTTCGCTCGGTTCCTCGACGCTGGCAAAGGCAGCGAAGTGAACGGCATGTCGGAACTGAAGAAATATTTGAATCGAT
TCGGTTACCTTCCGATTCCTCCTCAAAACAACTTCTCCGATTTCTTCGACGATCAATTCGTATCGGCGTTGATTCTCTATCAGAATCGCTTAGGTTTATCAGTCACTGGA
AAACTCGATTCCGAAACAATCGCAAGCATCATGTCGCCTAGATGCGGAATGAGTGACCTAATTAAAATCAACAACAACAACACAACAATTCACTCAACGCGTCGATACGC
TTTCTTCAACGGCCAACCGAGATGGATTCGATCCTCAACTCTAACGTACGCTCTCTCACCAGATTACACAATCGAATACCTAACTTCATCAGAAATCCGCAAGGTCGTTC
GACGATCGTTCTCGCGGTGGTCCGCAGTGATTCCGTTGAACTTCACCGAATCCTCTGATTACGAATCGTCAGATATCCGAATCGGATTCTACCGCGGCGATCACGGCGAT
GGAGAGGCGTTCGACGGAGTATTAGGCGTTTTGGCACACGCGTTTTCACCGGAAAACGGAAGGCTGCACTTAGATGCGGCGGAGCGCTGGGCGGTGGATTTCGAGCAGGA
GAAATCGAAAGTAGCTGTGGATTTGGAATCGGTTGTAACGCATGAGATAGGGCATGTTCTTGGACTGGCTCATTCCGCAGTGAAGGAATCTGTGATGTATCCAAGTTTGA
GTCCGCGAGGGAAGAAAGTGGACCTTAGGATCGATGACGTAGAAGGAATTCAGTATTTGTATGGTACGAACCCTAATTTCAAATTGAAATCCTTCTTGGAATCCGAAAAA
TCTATCAACAGTGGATCATCATCTTCTTCAATCAACACTAATTTCTTCTTCTTACTGTTATTCTACTTGTTGGTTTGGGTTGGGTCTCTGTTTTTCTGA
Protein sequenceShow/hide protein sequence
MSGFDMHMRGEETASGKRELRDYLASQRVHYRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDTQASPLSASGIRAPSPLHERSTDINDNSSTKQRASLENDIELLQLRLQ
QERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFS
VNDLGKRTSNAGPNSLFGGKSNISMGKTSGTAKVREAFSQAKRTSLQTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGSFPEVKQPQCGPVEEQFGGGKA
MLEIHCISTNNSQFSRASYAINNYRVLVEQLENVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKAAYNIGGHIISANAIEQSIFFFKSPRIGWWLE
TIISTALRKKSGEERQLISSKLGLHSPQPLVCFGLCTGASSDPVLKVYTASNIKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSEELPKWVSENVDGKLH
ESIQKCTEHRTGKKASQIIEWLPYSSRFRTLPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTG
KLDSETIASIMSPRCGMSDLIKINNNNTTIHSTRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGD
GEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFEQEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEK
SINSGSSSSSINTNFFFLLLFYLLVWVGSLFF