; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009299 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009299
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function, DUF547
Genome locationChr06:4555341..4567256
RNA-Seq ExpressionHG10009299
SyntenyHG10009299
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0031012 - extracellular matrix (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001818 - Peptidase M10, metallopeptidase
IPR002477 - Peptidoglycan binding-like
IPR006026 - Peptidase, metallopeptidase
IPR006869 - Domain of unknown function DUF547
IPR021190 - Peptidase M10A
IPR024079 - Metallopeptidase, catalytic domain superfamily
IPR025757 - Ternary complex factor MIP1, leucine-zipper
IPR033739 - Peptidase M10A, catalytic domain
IPR036365 - PGBD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579407.1 hypothetical protein SDJN03_23855, partial [Cucurbita argyrosperma subsp. sororia]4.0e-26885.49Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSGFDMH+RGEE ASG RELRDYLAS  VH+RHRRSRSSSDRNSNV RGGVLHSNSKN RSD QASPLSTSGIRA+SPLHE + +FNDNS +K RASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ+SVT SPAHGKHES+KHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKR SNAGP+SL GGK DIS+ K SG AKVREA SQ K+TSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        N AKKKA  F +VK+P+ GPVEEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQLEKVNVSKM IDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA
        RRLALFHK                             WLETIISTALRKKSGEERQLISSKLGLPS QPLVCFGLCTGASSDPVLKVYTASNVKEELE+A
Subjt:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR
        KR+FLQANIVVKKSKKVFLPKVLER AREASISSDELPKW+SENVDGKLHESIQKCM+ +TGKKAS IIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR

XP_008437070.1 PREDICTED: uncharacterized protein LOC103482606 isoform X2 [Cucumis melo]2.5e-28690.67Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSG DMHMRGEE ASGKRELRDYLASQRVHSRHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKSDIS+ KTSGTAKVREAFSQ KRTSLR+LKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKA+KKAG FP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA
        RRLALFHK                             WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELE A
Subjt:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR
        KRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVS+NVDGKL ESIQKCMEHRTGKK SQIIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR

XP_011654811.1 uncharacterized protein LOC101204173 isoform X1 [Cucumis sativus]1.7e-28790.85Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSG DMHMRGEE ASGKRELRDYLASQRVHSRHRRSRSSSD+NSN FRG  LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH AQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFG KSDIS+ KTSGTAKVREAFSQ KRTSLR+LKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKA+KKAG FP+VKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA
        RRLALFHK                             WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA
Subjt:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR
        KRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVSENVDGKL ESIQKCMEHRTGKK SQIIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR

XP_016903702.1 PREDICTED: uncharacterized protein LOC103482606 isoform X1 [Cucumis melo]1.7e-28287.35Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSG DMHMRGEE ASGKRELRDYLASQRVHSRHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA                      QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS

Query:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPS
        QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKSDIS+ KTSGTAKVREAFSQ KRTSLR+LKDHLFECPS
Subjt:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPS

Query:  KLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW
        KLSEEMVRCMAFIYCSLHRVASNKA+KKAG FP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW
Subjt:  KLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW

Query:  INVYNALLMHAYLAYGIPHGSLRRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTG
        INVYNALLMHAYLAYGIPHGSLRRLALFHK                             WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTG
Subjt:  INVYNALLMHAYLAYGIPHGSLRRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTG

Query:  ASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRF
        ASSDPVLKVYTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVS+NVDGKL ESIQKCMEHRTGKK SQIIEWLPYSSRF
Subjt:  ASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRF

Query:  R
        R
Subjt:  R

XP_038874743.1 uncharacterized protein LOC120067282 isoform X1 [Benincasa hispida]6.1e-28590.16Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSGFDMHMRGEE AS KRELRD+LASQRVHS HRRSRSSSDRNSNVFRGGVLHS+SKNDRSD QASPLSTSGIRA+SPLHE S + NDNS +KQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKSD+S+ KTSGTAKVREAFSQ KR SLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHR AS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKA+KKAG FP+VKQPQCGPVEEQFGG KAMLEIHCIST+NSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA
        RRLALFHK                             WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA
Subjt:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR
        KRDFLQANIVVKKSKKVFLPKVLER AREASI SDEL KWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR

TrEMBL top hitse value%identityAlignment
A0A1S3AT95 uncharacterized protein LOC103482606 isoform X21.2e-28690.67Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSG DMHMRGEE ASGKRELRDYLASQRVHSRHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKSDIS+ KTSGTAKVREAFSQ KRTSLR+LKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        NKA+KKAG FP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA
        RRLALFHK                             WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELE A
Subjt:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR
        KRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVS+NVDGKL ESIQKCMEHRTGKK SQIIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR

A0A1S4E657 uncharacterized protein LOC103482606 isoform X18.1e-28387.35Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSG DMHMRGEE ASGKRELRDYLASQRVHSRHRRSRSSSD+NSN FRGG LHSNSKNDRSD QASPLSTSGIRA+SPLHE+S DFNDNS TKQRASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA                      QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSS

Query:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPS
        QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKSDIS+ KTSGTAKVREAFSQ KRTSLR+LKDHLFECPS
Subjt:  QQNSVTASPAHGKHESRKHPSIISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPS

Query:  KLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW
        KLSEEMVRCMAFIYCSLHRVASNKA+KKAG FP+VKQPQ  PVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW
Subjt:  KLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFW

Query:  INVYNALLMHAYLAYGIPHGSLRRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTG
        INVYNALLMHAYLAYGIPHGSLRRLALFHK                             WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTG
Subjt:  INVYNALLMHAYLAYGIPHGSLRRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTG

Query:  ASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRF
        ASSDPVLKVYTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLER AREASISSDELPKWVS+NVDGKL ESIQKCMEHRTGKK SQIIEWLPYSSRF
Subjt:  ASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRF

Query:  R
        R
Subjt:  R

A0A6J1DMR5 uncharacterized protein LOC111021990 isoform X14.0e-26685.69Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSGFD  MRGEE  +GKRELRDYLASQRVH+RHRRSRSSSDRNSNVFRGGVLHSN KND+SD QASPLSTSGIRAQSPLHE S  FNDNS +K RASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKT-SGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGP+SLFGGKS+I++ KT SGT+KVRE  SQ KRTSLRTLKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKT-SGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVA

Query:  SNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGS
        SNKA+KK G  P+VKQPQCGP+EEQ   GKAMLEIH ISTNNSQFSRAS+AIN YRVLVEQLEKVNVSKM IDAQTAFWINVYNALLMHAYLAYGIP  S
Subjt:  SNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGS

Query:  LRRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEV
        LRRLALFHK                             WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEV
Subjt:  LRRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEV

Query:  AKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR
        AKRDFLQANIVVKKSKKVFLPKVLER AREASIS DEL K VS+NVD +LH+SIQKCM+HRTGKKASQIIEWLPYSSRFR
Subjt:  AKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR

A0A6J1E2A4 uncharacterized protein LOC1114301771.2e-26785.32Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSGFDMH+RGEE ASG RELRDYLAS  VH+RHRRSRSSSDRNSNV RGGVLHSNSKN RSD QASPLSTSGIRA+SPLHE + +FNDNS +K RASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ+SVT SPAHGKHES+KHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVNDLGKR SNAGP+SL GGK DIS+ K SG AKVREA S  K+TSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        N AKKKA  F +VK+P+ GPVEEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQLEKVNVSKM IDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA
        RRLALFHK                             WLETIISTALRKKSGEERQLISSKLGLPS QPLVCFGLCTGASSDPVLKVYTASNVKEELE+A
Subjt:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR
        KR+FLQANIVVKKSKKVFLPKVLER AREASISSDELPKW+SENVDGKLHESIQKCM+ +TGKKAS IIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR

A0A6J1I221 uncharacterized protein LOC1114691312.5e-26885.32Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        MSGFDMH+RGEE ASG RELRDYLAS  VH+RHRRSRSSSDRNSNV RGGVLHSNSKN RSD QASPLSTSGIRA+SPLHE + +FNDNS +K RASLEN
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSK SSQQ+SVT SPAHGKHES+KHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
        ISSAFCSSRKFPLGPLQPFSVN+LGKR SNAGP+SL GGK DIS+ K SG AKVRE  SQ K+TSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS
Subjt:  ISSAFCSSRKFPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVAS

Query:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL
        N AKKKA  FP+VK+P+ GPVEEQ G  KAMLEIHCISTNN+QFSRASYAINNYRVLVEQLEKVNVSKM IDAQTAFWINVYNALLMHAYLAYGIPHGSL
Subjt:  NKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSL

Query:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA
        RRLALFHK                             WLETIISTALRKKSGEERQLISSKLGLPS QPLVCFGLCTGASSDPVLKVYTASNVKEELE+A
Subjt:  RRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVA

Query:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR
        KR+FLQANIVVKKSKKVFLPKVLER AREASISSDELPKW+SENVDGKLHESIQKC++ +TGKKAS IIEWLPYSSRFR
Subjt:  KRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHRTGKKASQIIEWLPYSSRFR

SwissProt top hitse value%identityAlignment
O04529 Metalloendoproteinase 2-MMP1.2e-7349.49Show/hide
Query:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLI-----------KIN
        W  F+ F     G  V+G+  +KKY  RFGY+P     NF+D FDD   +A+ LYQ    L+VTG+LD+ TI  I+ PRCG  D++           K  
Subjt:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLI-----------KIN

Query:  SNTTTTTTIHSTRRYAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGV
            + T +H+ +RY  F G+PRW R+   LTYA  P   +    + E++ V  R+F RWS V  LNFT S  + +SDI IGFY GDHGDGE FDGVLG 
Subjt:  SNTTTTTTIHSTRRYAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGV

Query:  LAHAFSPENGRLHLDAAERWAVDFDKE---KSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNF
        LAHAFSP +G+ HLDA E W V  D +       AVDLESV  HEIGH+LGL HS+V+ES+MYP+++   +KVDL  DDVEGIQYLYG NPNF
Subjt:  LAHAFSPENGRLHLDAAERWAVDFDKE---KSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNF

O23507 Metalloendoproteinase 1-MMP9.4e-8752.47Show/hide
Query:  NNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINSNTTTTT
        +N  W +F+R +D   GS V+G+SELK+YL+RFGY+       FSD FD    SA+ LYQ  LGL +TG+LD+ T+  +  PRCG+SD       T    
Subjt:  NNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINSNTTTTT

Query:  TIHSTRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPE
         +H+T  Y +FNG+P+W R  TLTYA+S  + ++YLTS +++ V RR+FS+WS+VIP++F E  D+ ++D++IGFY GDHGDG  FDGVLG LAHAF+PE
Subjt:  TIHSTRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPE

Query:  NGRLHLDAAERWAVDFD-KEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINAGSS
        NGRLHLDAAE W VD D K  S+VAVDLESV THEIGH+LGL HS+ + +VMYPSL PR KKVDL +DDV G+  LYG NP  +L S  +SE SI  G+ 
Subjt:  NGRLHLDAAERWAVDFD-KEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINAGSS

Query:  SSINTNLFFLFFFYLFILVWSLFF
        S  +  L   F  Y+ ++V  + F
Subjt:  SSINTNLFFLFFFYLFILVWSLFF

Q5XF51 Metalloendoproteinase 3-MMP1.6e-6241Show/hide
Query:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYL-PIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIK----INSNTTTT
        W +F  F     G + +G+  LK+Y   FGY+       NF+D FDD   +A+ +YQ    L+VTG LD  T+  ++ PRCG  D++     ++S   T 
Subjt:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYL-PIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIK----INSNTTTT

Query:  TT--------IHSTRRYAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVL
                   H+ + Y+FF G+PRW R+   LTYA  P   +    + E++ V  R+F+RW  V PL FT    + +SDI IGFY G+HGDGE FDG +
Subjt:  TT--------IHSTRRYAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVL

Query:  GVLAHAFSPENGRLHLDAAERWAVDFDKEKSKV----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFK-LK
          LAHAFSP  G  HLD  E W V  +     +    AVDLESV  HEIGH+LGL HS+V+ S+MYP++    +KVDL  DDVEG+QYLYG NPNF   +
Subjt:  GVLAHAFSPENGRLHLDAAERWAVDFDKEKSKV----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFK-LK

Query:  SFLESEKSINAGSSS---------SINTNLFFLFFFYLF
        S   S +  + G S          S+ TNL   +F+ +F
Subjt:  SFLESEKSINAGSSS---------SINTNLFFLFFFYLF

Q8GWW6 Metalloendoproteinase 4-MMP8.0e-8656.38Show/hide
Query:  ELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINSNTTTTTTIHSTRRYAFFNGQPRWIRS--ST
        E+K++L ++GYL   PQN  SD  D  F  AL+ YQ  LGL +TGK DS+T++ I+ PRCG  D ++       T   H+ ++Y +F G+PRW R     
Subjt:  ELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINSNTTTTTTIHSTRRYAFFNGQPRWIRS--ST

Query:  LTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFDKEKSK
        LTYA S +    YL  ++IR+V RR+F +W++VIP++F E+ DY  +DI+IGF+ GDHGDGE FDGVLGVLAH FSPENGRLHLD AE WAVDFD+EKS 
Subjt:  LTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFDKEKSK

Query:  VAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINAGSSSSINT
        VAVDLESV  HEIGHVLGL HS+VK++ MYP+L PR KKV+L +DDV G+Q LYGTNPNF L S L SE S N    S I +
Subjt:  VAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINAGSSSSINT

Q9ZUJ5 Metalloendoproteinase 5-MMP8.0e-7043.77Show/hide
Query:  PYSSRFRT----LPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIA
        P S++F T    +P    L+A  N  AW  F++      G  +NG+S+LK+Y  RFGY  I    N +D FDD   SA+  YQ    L VTGKLDS T+ 
Subjt:  PYSSRFRT----LPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIA

Query:  SIMSPRCGMSDLIKINSNTTTTTTIHSTRRYAFFNGQPRW-IRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFY
         I+ PRCG  DLI   S       + +T +Y+FF G+PRW  R   LTYA +P   +    + E+++V  R+F+RW+ V PLNFT S     +DI IGF+
Subjt:  SIMSPRCGMSDLIKINSNTTTTTTIHSTRRYAFFNGQPRW-IRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFY

Query:  RGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFDKEKSKV-----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEG
         G+HGDGE FDG +G LAHA SP  G LHLD  E W +   +   ++      VDLESV  HEIGH+LGL HS+V++++M+P++S   +KV+L  DD+EG
Subjt:  RGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFDKEKSKV-----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEG

Query:  IQYLYGTNPNFKLKSFLESEKSINAGSSS
        IQ+LYG NPN        S +S + G  S
Subjt:  IQYLYGTNPNFKLKSFLESEKSINAGSSS

Arabidopsis top hitse value%identityAlignment
AT1G59970.1 Matrixin family protein5.7e-7143.77Show/hide
Query:  PYSSRFRT----LPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIA
        P S++F T    +P    L+A  N  AW  F++      G  +NG+S+LK+Y  RFGY  I    N +D FDD   SA+  YQ    L VTGKLDS T+ 
Subjt:  PYSSRFRT----LPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIA

Query:  SIMSPRCGMSDLIKINSNTTTTTTIHSTRRYAFFNGQPRW-IRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFY
         I+ PRCG  DLI   S       + +T +Y+FF G+PRW  R   LTYA +P   +    + E+++V  R+F+RW+ V PLNFT S     +DI IGF+
Subjt:  SIMSPRCGMSDLIKINSNTTTTTTIHSTRRYAFFNGQPRW-IRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFY

Query:  RGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFDKEKSKV-----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEG
         G+HGDGE FDG +G LAHA SP  G LHLD  E W +   +   ++      VDLESV  HEIGH+LGL HS+V++++M+P++S   +KV+L  DD+EG
Subjt:  RGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFDKEKSKV-----AVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEG

Query:  IQYLYGTNPNFKLKSFLESEKSINAGSSS
        IQ+LYG NPN        S +S + G  S
Subjt:  IQYLYGTNPNFKLKSFLESEKSINAGSSS

AT1G70170.1 matrix metalloproteinase8.5e-7549.49Show/hide
Query:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLI-----------KIN
        W  F+ F     G  V+G+  +KKY  RFGY+P     NF+D FDD   +A+ LYQ    L+VTG+LD+ TI  I+ PRCG  D++           K  
Subjt:  WRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLI-----------KIN

Query:  SNTTTTTTIHSTRRYAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGV
            + T +H+ +RY  F G+PRW R+   LTYA  P   +    + E++ V  R+F RWS V  LNFT S  + +SDI IGFY GDHGDGE FDGVLG 
Subjt:  SNTTTTTTIHSTRRYAFFNGQPRWIRS-STLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGV

Query:  LAHAFSPENGRLHLDAAERWAVDFDKE---KSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNF
        LAHAFSP +G+ HLDA E W V  D +       AVDLESV  HEIGH+LGL HS+V+ES+MYP+++   +KVDL  DDVEGIQYLYG NPNF
Subjt:  LAHAFSPENGRLHLDAAERWAVDFDKE---KSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNF

AT2G45040.1 Matrixin family protein5.7e-8756.38Show/hide
Query:  ELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINSNTTTTTTIHSTRRYAFFNGQPRWIRS--ST
        E+K++L ++GYL   PQN  SD  D  F  AL+ YQ  LGL +TGK DS+T++ I+ PRCG  D ++       T   H+ ++Y +F G+PRW R     
Subjt:  ELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINSNTTTTTTIHSTRRYAFFNGQPRWIRS--ST

Query:  LTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFDKEKSK
        LTYA S +    YL  ++IR+V RR+F +W++VIP++F E+ DY  +DI+IGF+ GDHGDGE FDGVLGVLAH FSPENGRLHLD AE WAVDFD+EKS 
Subjt:  LTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFDKEKSK

Query:  VAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINAGSSSSINT
        VAVDLESV  HEIGHVLGL HS+VK++ MYP+L PR KKV+L +DDV G+Q LYGTNPNF L S L SE S N    S I +
Subjt:  VAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINAGSSSSINT

AT4G16640.1 Matrixin family protein6.7e-8852.47Show/hide
Query:  NNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINSNTTTTT
        +N  W +F+R +D   GS V+G+SELK+YL+RFGY+       FSD FD    SA+ LYQ  LGL +TG+LD+ T+  +  PRCG+SD       T    
Subjt:  NNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVTGKLDSETIASIMSPRCGMSDLIKINSNTTTTT

Query:  TIHSTRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPE
         +H+T  Y +FNG+P+W R  TLTYA+S  + ++YLTS +++ V RR+FS+WS+VIP++F E  D+ ++D++IGFY GDHGDG  FDGVLG LAHAF+PE
Subjt:  TIHSTRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRIGFYRGDHGDGEAFDGVLGVLAHAFSPE

Query:  NGRLHLDAAERWAVDFD-KEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINAGSS
        NGRLHLDAAE W VD D K  S+VAVDLESV THEIGH+LGL HS+ + +VMYPSL PR KKVDL +DDV G+  LYG NP  +L S  +SE SI  G+ 
Subjt:  NGRLHLDAAERWAVDFD-KEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGTNPNFKLKSFLESEKSINAGSS

Query:  SSINTNLFFLFFFYLFILVWSLFF
        S  +  L   F  Y+ ++V  + F
Subjt:  SSINTNLFFLFFFYLFILVWSLFF

AT5G47380.1 Protein of unknown function, DUF5471.6e-12948.31Show/hide
Query:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN
        M GFD++  G +    +R   D       H   R   +SS+R+ +    G   S S N+ +  QAS + T+  +   PLH       +N  +  RASLE 
Subjt:  MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA-QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHG-KHESRKH-
        D+E L LRLQQE+SMR +LERAMGRASS+LSPGHRH A Q  +LI+EIELLE EV NRE HVLSLYRSIFE  VS+  S+Q+S  +SPAH  K   RK  
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA-QTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHG-KHESRKH-

Query:  PSIISSAFCSSRKFPLGPLQPF-SVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQ----AKRTSLRTLKDHLFECPSKLSEEMVRCMAFIY
        P++IS+AFCSS  FPL P     ++ D  ++TS    SS F  ++ I S  TS +++ +  F +     K  S RTLKDHL++CP+KLSE+MV+CM+ +Y
Subjt:  PSIISSAFCSSRKFPLGPLQPF-SVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQ----AKRTSLRTLKDHLFECPSKLSEEMVRCMAFIY

Query:  ----CSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMH
            CS       K          V  P+    E++    ++M+E+  IS++  +FS+ +YAINNYR+LVEQLE+V +++M  +A+ AFWIN+YNALLMH
Subjt:  ----CSLHRVASNKAKKKAGFFPEVKQPQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMH

Query:  AYLAYGIPHGSLRRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVY
        AYLAYG+P  SLRRLALFHK                             WLETIISTALRKK  E++  + S   L  P+PLVCF LC GA SDPVLK Y
Subjt:  AYLAYGIPHGSLRRLALFHK-----------------------------WLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVY

Query:  TASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHR-TGKKASQIIEWLPYSSRFR
        TASNVKEEL+ +KR+FL AN+VVK  KKV LPK++ER  +EAS+S D+L +W+ +N D KL ESIQKC++ +   KKASQ++EWLPYSS+FR
Subjt:  TASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQKCMEHR-TGKKASQIIEWLPYSSRFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATTCGATATGCATATGCGTGGTGAAGAAATAGCTTCTGGGAAGAGAGAACTTAGAGATTATTTAGCGTCTCAACGTGTTCATTCCCGCCATAGGCGA
TCCAGAAGTTCTTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCTAATAGCAAGAATGATCGGAGTGACCCACAAGCATCCCCACTTTCTACG
AGCGGTATCAGAGCACAAAGTCCTCTACATGAAAGGTCTGCAGATTTCAATGATAATTCCCCAACTAAACAGCGAGCGTCTTTGGAAAATGATATTGAGTTGCTA
CAGCTGCGCTTGCAACAAGAGAGATCTATGCGAAGTATGCTTGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCCGGGCATAGGCACTTAGCCCAGACG
AAGGATTTGATCTCAGAAATTGAATTGCTTGAAGAAGAGGTTGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGGAGTATTTTTGAAAATTGTGTTAGTAAG
CCATCTTCTCAGCAAAATTCAGTCACAGCCTCTCCTGCTCATGGAAAGCATGAATCAAGAAAACACCCCAGTATCATTTCAAGTGCGTTTTGTTCGTCGAGGAAG
TTTCCCTTGGGACCTTTGCAACCTTTCTCTGTAAATGACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAGTTCCTTGTTTGGAGGTAAAAGCGACATAAGTTCC
AGAAAGACTTCAGGCACTGCAAAGGTTCGTGAAGCCTTTTCGCAGGCGAAGAGAACTTCTCTGCGAACTCTAAAAGATCATCTTTTTGAGTGTCCAAGTAAATTA
TCAGAGGAGATGGTGAGGTGCATGGCTTTTATATACTGCTCTCTTCATAGAGTGGCATCAAACAAGGCTAAAAAAAAGGCAGGTTTCTTCCCTGAAGTTAAACAG
CCCCAGTGTGGACCAGTGGAGGAACAATTTGGGGGTGGGAAAGCAATGCTGGAAATACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCATACGCA
ATCAACAATTATAGAGTATTAGTTGAGCAGCTGGAGAAAGTGAATGTCAGTAAGATGGGGATCGATGCCCAAACTGCATTCTGGATTAATGTGTATAATGCTCTT
CTAATGCATGCCTATTTGGCATATGGAATCCCTCATGGCTCTCTGAGAAGGTTGGCTTTGTTCCATAAGTGGCTTGAAACCATCATTTCAACTGCGCTGAGGAAG
AAGTCTGGGGAAGAAAGGCAACTTATCTCTTCAAAATTGGGCCTTCCTAGTCCTCAACCTCTTGTTTGCTTTGGCCTCTGCACTGGTGCCTCTTCAGATCCTGTG
CTGAAAGTGTACACTGCATCAAATGTTAAAGAGGAACTGGAAGTGGCCAAAAGGGATTTTCTCCAAGCGAATATAGTCGTGAAGAAGTCAAAGAAAGTATTCCTA
CCAAAGGTGCTCGAAAGACTTGCACGCGAAGCATCCATCAGCTCAGATGAACTCCCGAAATGGGTTTCTGAAAATGTCGATGGGAAACTCCACGAGTCGATACAG
AAATGTATGGAACATCGGACTGGCAAGAAGGCATCTCAAATCATTGAGTGGTTACCTTACAGCTCAAGGTTCCGAACTTTGCCGGACTTCACCACTCTCGACGCC
GATAACAACAACTACGCATGGAGAAACTTCGCTCGGTTCCTCGATGCTGGCAAAGGCAGCGAAGTGAACGGCATGTCGGAACTGAAGAAGTATTTGAATCGATTC
GGTTACCTTCCGATTCCTCCTCAAAACAACTTCTCCGATTTCTTCGACGATCAATTCGTATCGGCTTTGATTCTCTATCAGAATCGTTTAGGTTTATCAGTCACT
GGAAAACTTGATTCCGAAACAATCGCAAGCATCATGTCGCCGAGATGCGGAATGAGTGACCTAATTAAAATCAACAGCAACACAACAACAACAACAACGATTCAT
TCAACGCGTCGATATGCTTTCTTCAACGGCCAACCGAGATGGATTCGATCCTCAACTCTAACATACGCTCTCTCACCAGATTACACAATCGAATACCTAACTTCA
TCAGAAATCCGCAAAGTCGTCCGACGATCGTTCTCGCGGTGGTCCGCAGTGATTCCGTTGAACTTTACCGAATCATCCGATTACGAATCGTCTGATATCCGAATC
GGATTCTACCGCGGCGATCATGGTGACGGAGAGGCGTTCGACGGAGTATTAGGCGTTTTGGCGCACGCGTTTTCACCGGAAAACGGAAGGCTACACTTAGATGCG
GCGGAGAGGTGGGCGGTGGATTTCGATAAGGAGAAATCGAAAGTAGCTGTGGATTTGGAATCGGTAGTAACGCATGAGATAGGGCATGTTCTTGGACTGGCTCAC
TCTGCGGTGAAGGAATCTGTGATGTATCCAAGTTTGAGTCCGCGAGGGAAGAAAGTGGACCTTAGGATCGATGATGTGGAAGGAATTCAGTATTTGTACGGTACA
AACCCTAATTTCAAATTGAAATCCTTCTTGGAATCTGAAAAATCTATCAACGCTGGATCATCTTCTTCAATCAACACTAATTTATTCTTCTTATTCTTCTTCTAC
TTGTTCATTTTGGTTTGGTCTCTGTTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATTCGATATGCATATGCGTGGTGAAGAAATAGCTTCTGGGAAGAGAGAACTTAGAGATTATTTAGCGTCTCAACGTGTTCATTCCCGCCATAGGCGA
TCCAGAAGTTCTTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCTAATAGCAAGAATGATCGGAGTGACCCACAAGCATCCCCACTTTCTACG
AGCGGTATCAGAGCACAAAGTCCTCTACATGAAAGGTCTGCAGATTTCAATGATAATTCCCCAACTAAACAGCGAGCGTCTTTGGAAAATGATATTGAGTTGCTA
CAGCTGCGCTTGCAACAAGAGAGATCTATGCGAAGTATGCTTGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCCGGGCATAGGCACTTAGCCCAGACG
AAGGATTTGATCTCAGAAATTGAATTGCTTGAAGAAGAGGTTGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGGAGTATTTTTGAAAATTGTGTTAGTAAG
CCATCTTCTCAGCAAAATTCAGTCACAGCCTCTCCTGCTCATGGAAAGCATGAATCAAGAAAACACCCCAGTATCATTTCAAGTGCGTTTTGTTCGTCGAGGAAG
TTTCCCTTGGGACCTTTGCAACCTTTCTCTGTAAATGACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAGTTCCTTGTTTGGAGGTAAAAGCGACATAAGTTCC
AGAAAGACTTCAGGCACTGCAAAGGTTCGTGAAGCCTTTTCGCAGGCGAAGAGAACTTCTCTGCGAACTCTAAAAGATCATCTTTTTGAGTGTCCAAGTAAATTA
TCAGAGGAGATGGTGAGGTGCATGGCTTTTATATACTGCTCTCTTCATAGAGTGGCATCAAACAAGGCTAAAAAAAAGGCAGGTTTCTTCCCTGAAGTTAAACAG
CCCCAGTGTGGACCAGTGGAGGAACAATTTGGGGGTGGGAAAGCAATGCTGGAAATACATTGCATATCAACCAATAACAGCCAGTTCTCTCGTGCTTCATACGCA
ATCAACAATTATAGAGTATTAGTTGAGCAGCTGGAGAAAGTGAATGTCAGTAAGATGGGGATCGATGCCCAAACTGCATTCTGGATTAATGTGTATAATGCTCTT
CTAATGCATGCCTATTTGGCATATGGAATCCCTCATGGCTCTCTGAGAAGGTTGGCTTTGTTCCATAAGTGGCTTGAAACCATCATTTCAACTGCGCTGAGGAAG
AAGTCTGGGGAAGAAAGGCAACTTATCTCTTCAAAATTGGGCCTTCCTAGTCCTCAACCTCTTGTTTGCTTTGGCCTCTGCACTGGTGCCTCTTCAGATCCTGTG
CTGAAAGTGTACACTGCATCAAATGTTAAAGAGGAACTGGAAGTGGCCAAAAGGGATTTTCTCCAAGCGAATATAGTCGTGAAGAAGTCAAAGAAAGTATTCCTA
CCAAAGGTGCTCGAAAGACTTGCACGCGAAGCATCCATCAGCTCAGATGAACTCCCGAAATGGGTTTCTGAAAATGTCGATGGGAAACTCCACGAGTCGATACAG
AAATGTATGGAACATCGGACTGGCAAGAAGGCATCTCAAATCATTGAGTGGTTACCTTACAGCTCAAGGTTCCGAACTTTGCCGGACTTCACCACTCTCGACGCC
GATAACAACAACTACGCATGGAGAAACTTCGCTCGGTTCCTCGATGCTGGCAAAGGCAGCGAAGTGAACGGCATGTCGGAACTGAAGAAGTATTTGAATCGATTC
GGTTACCTTCCGATTCCTCCTCAAAACAACTTCTCCGATTTCTTCGACGATCAATTCGTATCGGCTTTGATTCTCTATCAGAATCGTTTAGGTTTATCAGTCACT
GGAAAACTTGATTCCGAAACAATCGCAAGCATCATGTCGCCGAGATGCGGAATGAGTGACCTAATTAAAATCAACAGCAACACAACAACAACAACAACGATTCAT
TCAACGCGTCGATATGCTTTCTTCAACGGCCAACCGAGATGGATTCGATCCTCAACTCTAACATACGCTCTCTCACCAGATTACACAATCGAATACCTAACTTCA
TCAGAAATCCGCAAAGTCGTCCGACGATCGTTCTCGCGGTGGTCCGCAGTGATTCCGTTGAACTTTACCGAATCATCCGATTACGAATCGTCTGATATCCGAATC
GGATTCTACCGCGGCGATCATGGTGACGGAGAGGCGTTCGACGGAGTATTAGGCGTTTTGGCGCACGCGTTTTCACCGGAAAACGGAAGGCTACACTTAGATGCG
GCGGAGAGGTGGGCGGTGGATTTCGATAAGGAGAAATCGAAAGTAGCTGTGGATTTGGAATCGGTAGTAACGCATGAGATAGGGCATGTTCTTGGACTGGCTCAC
TCTGCGGTGAAGGAATCTGTGATGTATCCAAGTTTGAGTCCGCGAGGGAAGAAAGTGGACCTTAGGATCGATGATGTGGAAGGAATTCAGTATTTGTACGGTACA
AACCCTAATTTCAAATTGAAATCCTTCTTGGAATCTGAAAAATCTATCAACGCTGGATCATCTTCTTCAATCAACACTAATTTATTCTTCTTATTCTTCTTCTAC
TTGTTCATTTTGGTTTGGTCTCTGTTTTTCTGA
Protein sequenceShow/hide protein sequence
MSGFDMHMRGEEIASGKRELRDYLASQRVHSRHRRSRSSSDRNSNVFRGGVLHSNSKNDRSDPQASPLSTSGIRAQSPLHERSADFNDNSPTKQRASLENDIELL
QLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLISEIELLEEEVANREQHVLSLYRSIFENCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSRK
FPLGPLQPFSVNDLGKRTSNAGPSSLFGGKSDISSRKTSGTAKVREAFSQAKRTSLRTLKDHLFECPSKLSEEMVRCMAFIYCSLHRVASNKAKKKAGFFPEVKQ
PQCGPVEEQFGGGKAMLEIHCISTNNSQFSRASYAINNYRVLVEQLEKVNVSKMGIDAQTAFWINVYNALLMHAYLAYGIPHGSLRRLALFHKWLETIISTALRK
KSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERLAREASISSDELPKWVSENVDGKLHESIQ
KCMEHRTGKKASQIIEWLPYSSRFRTLPDFTTLDADNNNYAWRNFARFLDAGKGSEVNGMSELKKYLNRFGYLPIPPQNNFSDFFDDQFVSALILYQNRLGLSVT
GKLDSETIASIMSPRCGMSDLIKINSNTTTTTTIHSTRRYAFFNGQPRWIRSSTLTYALSPDYTIEYLTSSEIRKVVRRSFSRWSAVIPLNFTESSDYESSDIRI
GFYRGDHGDGEAFDGVLGVLAHAFSPENGRLHLDAAERWAVDFDKEKSKVAVDLESVVTHEIGHVLGLAHSAVKESVMYPSLSPRGKKVDLRIDDVEGIQYLYGT
NPNFKLKSFLESEKSINAGSSSSINTNLFFLFFFYLFILVWSLFF