; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004217 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004217
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionOTU domain-containing protein
Genome locationscaffold92:891267..894320
RNA-Seq ExpressionMS004217
SyntenyMS004217
Gene Ontology termsGO:0016579 - protein deubiquitination (biological process)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
InterPro domainsIPR003323 - OTU domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012908.1 OTU domain-containing protein [Cucurbita argyrosperma subsp. argyrosperma]3.2e-14174.72Show/hide
Query:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
        METY++DPDV RWGLHLLDVCTFTNDSSR+TVTEY +DPSYSQV YVNEGYCEPC VNLENDEAIAHAFQEEISRIDSIE SGVS++GED LQASVLAQD
Subjt:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD

Query:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA
        WLGPS  H P+G E                     N V+KMGN SSY  NT++N F+++CS YSLEIMDES+LDGEVGKRLNQIVPVPHVPKTIEKIPSA
Subjt:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA

Query:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL
        DEEMSDHQRLL+R LQLYEL+ENK+QGDGNCQFRALSDQLYRSPEHHD VREQII QLK  R+IY GYVPMAYD+YLKKMSKKGEWGDHVTLQAAADW  
Subjt:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL

Query:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                                FGVKIFVITSFKDTCSIEILP+VQKSKR
Subjt:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

XP_022139286.1 OTU domain-containing protein DDB_G0284757-like [Momordica charantia]7.7e-16485.47Show/hide
Query:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
        METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
Subjt:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD

Query:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSAD
        WLGPSPVHSPYG +                   +SNGVDKMGNSSSYPN+LENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSAD
Subjt:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSAD

Query:  EEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLI
        EEMSDHQRLLDR LQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQII QLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADW   
Subjt:  EEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLI

Query:  YLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                               FGVKIFVITSFKDTCSIEILPQVQKSKR
Subjt:  YLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

XP_022966748.1 uncharacterized protein LOC111466368 isoform X1 [Cucurbita maxima]1.4e-14175Show/hide
Query:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
        METY++DPDV RWGLHLLDVCTFTNDSSR+TVTEY +DPSYSQV YVNEGYCEPC VNLENDEAIAHAFQEEISRIDSIE SGVS++GED LQASVLAQD
Subjt:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD

Query:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA
        WLGPS  H P+G E                     N V+KMGN SSY  NT++N F+E+CS YSLEIMDES+LDGEVGKRLNQIVPVPHVPKTIEKIPSA
Subjt:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA

Query:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL
        DEEMSDHQRLL+R LQLYEL+ENK+QGDGNCQFRALSDQLYRSPEHHD VREQII QLK  R+IY GYVPMAYD+YLKKMSKKGEWGDHVTLQAAADW  
Subjt:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL

Query:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                                FGVKIFVITSFKDTCSIEILP+VQKSKR
Subjt:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

XP_022966749.1 uncharacterized protein LOC111466368 isoform X2 [Cucurbita maxima]1.4e-14175Show/hide
Query:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
        METY++DPDV RWGLHLLDVCTFTNDSSR+TVTEY +DPSYSQV YVNEGYCEPC VNLENDEAIAHAFQEEISRIDSIE SGVS++GED LQASVLAQD
Subjt:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD

Query:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA
        WLGPS  H P+G E                     N V+KMGN SSY  NT++N F+E+CS YSLEIMDES+LDGEVGKRLNQIVPVPHVPKTIEKIPSA
Subjt:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA

Query:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL
        DEEMSDHQRLL+R LQLYEL+ENK+QGDGNCQFRALSDQLYRSPEHHD VREQII QLK  R+IY GYVPMAYD+YLKKMSKKGEWGDHVTLQAAADW  
Subjt:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL

Query:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                                FGVKIFVITSFKDTCSIEILP+VQKSKR
Subjt:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

XP_022966752.1 uncharacterized protein LOC111466368 isoform X4 [Cucurbita maxima]1.4e-14175Show/hide
Query:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
        METY++DPDV RWGLHLLDVCTFTNDSSR+TVTEY +DPSYSQV YVNEGYCEPC VNLENDEAIAHAFQEEISRIDSIE SGVS++GED LQASVLAQD
Subjt:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD

Query:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA
        WLGPS  H P+G E                     N V+KMGN SSY  NT++N F+E+CS YSLEIMDES+LDGEVGKRLNQIVPVPHVPKTIEKIPSA
Subjt:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA

Query:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL
        DEEMSDHQRLL+R LQLYEL+ENK+QGDGNCQFRALSDQLYRSPEHHD VREQII QLK  R+IY GYVPMAYD+YLKKMSKKGEWGDHVTLQAAADW  
Subjt:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL

Query:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                                FGVKIFVITSFKDTCSIEILP+VQKSKR
Subjt:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

TrEMBL top hitse value%identityAlignment
A0A6J1CC82 OTU domain-containing protein DDB_G0284757-like3.7e-16485.47Show/hide
Query:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
        METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
Subjt:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD

Query:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSAD
        WLGPSPVHSPYG +                   +SNGVDKMGNSSSYPN+LENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSAD
Subjt:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSAD

Query:  EEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLI
        EEMSDHQRLLDR LQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQII QLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADW   
Subjt:  EEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLI

Query:  YLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                               FGVKIFVITSFKDTCSIEILPQVQKSKR
Subjt:  YLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

A0A6J1G0U6 OTU domain-containing protein DDB_G0284757-like isoform X21.5e-14174.72Show/hide
Query:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
        METY++DPDV RWGLHLLDVCTFTNDSSR+TVTEY +DPSYSQV YVNEGYCEPC VNLENDEAIAHAFQEEISRIDSIE SGVS++GED LQASVLAQD
Subjt:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD

Query:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA
        WLGPS  H P+G E                     N V+KMGN SSY  NT++N F+++CS YSLEIMDES+LDGEVGKRLNQIVPVPHVPKTIEKIPSA
Subjt:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA

Query:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL
        DEEMSDHQRLL+R LQLYEL+ENK+QGDGNCQFRALSDQLYRSPEHHD VREQII QLK  R+IY GYVPMAYD+YLKKMSKKGEWGDHVTLQAAADW  
Subjt:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL

Query:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                                FGVKIFVITSFKDTCSIEILP+VQKSKR
Subjt:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

A0A6J1HNU1 uncharacterized protein LOC111466368 isoform X46.9e-14275Show/hide
Query:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
        METY++DPDV RWGLHLLDVCTFTNDSSR+TVTEY +DPSYSQV YVNEGYCEPC VNLENDEAIAHAFQEEISRIDSIE SGVS++GED LQASVLAQD
Subjt:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD

Query:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA
        WLGPS  H P+G E                     N V+KMGN SSY  NT++N F+E+CS YSLEIMDES+LDGEVGKRLNQIVPVPHVPKTIEKIPSA
Subjt:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA

Query:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL
        DEEMSDHQRLL+R LQLYEL+ENK+QGDGNCQFRALSDQLYRSPEHHD VREQII QLK  R+IY GYVPMAYD+YLKKMSKKGEWGDHVTLQAAADW  
Subjt:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL

Query:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                                FGVKIFVITSFKDTCSIEILP+VQKSKR
Subjt:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

A0A6J1HT37 uncharacterized protein LOC111466368 isoform X16.9e-14275Show/hide
Query:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
        METY++DPDV RWGLHLLDVCTFTNDSSR+TVTEY +DPSYSQV YVNEGYCEPC VNLENDEAIAHAFQEEISRIDSIE SGVS++GED LQASVLAQD
Subjt:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD

Query:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA
        WLGPS  H P+G E                     N V+KMGN SSY  NT++N F+E+CS YSLEIMDES+LDGEVGKRLNQIVPVPHVPKTIEKIPSA
Subjt:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA

Query:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL
        DEEMSDHQRLL+R LQLYEL+ENK+QGDGNCQFRALSDQLYRSPEHHD VREQII QLK  R+IY GYVPMAYD+YLKKMSKKGEWGDHVTLQAAADW  
Subjt:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL

Query:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                                FGVKIFVITSFKDTCSIEILP+VQKSKR
Subjt:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

A0A6J1HUQ0 uncharacterized protein LOC111466368 isoform X26.9e-14275Show/hide
Query:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD
        METY++DPDV RWGLHLLDVCTFTNDSSR+TVTEY +DPSYSQV YVNEGYCEPC VNLENDEAIAHAFQEEISRIDSIE SGVS++GED LQASVLAQD
Subjt:  METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQD

Query:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA
        WLGPS  H P+G E                     N V+KMGN SSY  NT++N F+E+CS YSLEIMDES+LDGEVGKRLNQIVPVPHVPKTIEKIPSA
Subjt:  WLGPSPVHSPYGSE-------------------NSNGVDKMGNSSSY-PNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSA

Query:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL
        DEEMSDHQRLL+R LQLYEL+ENK+QGDGNCQFRALSDQLYRSPEHHD VREQII QLK  R+IY GYVPMAYD+YLKKMSKKGEWGDHVTLQAAADW  
Subjt:  DEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVL

Query:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                                FGVKIFVITSFKDTCSIEILP+VQKSKR
Subjt:  IYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

SwissProt top hitse value%identityAlignment
Q0V869 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 117.0e-3537.84Show/hide
Query:  SPVHSPYGSENSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDES-TLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVEN
        +P  +   S  ++G     ++SS+ +++ ++  +D ++  +   DES   +G++GKRL+ +  +PH P+   +IP  ++   DH+ LL   L  Y L E 
Subjt:  SPVHSPYGSENSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDES-TLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVEN

Query:  KVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLL
        +++GDGNCQFRAL+DQL+R+ ++H  VR+ ++ QLK +R +Y  YVPM Y  Y +KM K GEWGDHVTLQAAAD                          
Subjt:  KVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLL

Query:  QFGVKIFVITSFKDTCSIEILP
        +F  KI ++TSF+D   IEILP
Subjt:  QFGVKIFVITSFKDTCSIEILP

Q54P70 OTU domain-containing protein DDB_G02847575.2e-1435.76Show/hide
Query:  NSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPV--PHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENK-VQGDGNCQFRALSDQL
        ++SS P+ +     +  S    EI     L+G V K +N    +   +V   +  +P + E     QRL +R L+LY L  +K + GDGNCQ  ALSDQL
Subjt:  NSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPV--PHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENK-VQGDGNCQFRALSDQL

Query:  YRSPEHHDFVREQIIVQLKARRDIY---GGYV-----PMAYDDYLKKMSKKGEWGDHVTLQAAAD
        Y    H   VR+ I+  L+  +D     G  +        +DDY   MSK G WGDH+TL AAA+
Subjt:  YRSPEHHDFVREQIIVQLKARRDIY---GGYV-----PMAYDDYLKKMSKKGEWGDHVTLQAAAD

Q8LBW2 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 91.6e-7949.42Show/hide
Query:  YNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQDWLG
        Y  DPD  RWGLH L+VCT TN  S S+VT Y      +Q GYV EGY +P    ++ND  IA  +Q+E+SR+   EASG+++        SV+AQDW  
Subjt:  YNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQDWLG

Query:  PSPVHSPYG-------------SENSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRL
        P       G             + N N  DK      +     +   +D S+ S+EI +ES    EVGKRLNQ++P+ HVPK   ++PS DE++SDH+RL
Subjt:  PSPVHSPYG-------------SENSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRL

Query:  LDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNI
          R LQLY LVENK++GDGNCQFR+LSDQLYRSPEHH+FVREQ++ QL   R+IY GYVPMAY+DYLK M + GEWGDHVTLQAAAD             
Subjt:  LDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNI

Query:  CNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                      FGV++FVITSFKDTC IEILP  QKS R
Subjt:  CNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

Q9LZF7 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 102.0e-5040.59Show/hide
Query:  YSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVS--NTGEDHLQASVL--------AQDWLGPSPVHSPYGSENSNGVDKMGNSSSYPN
        + Q G     Y +  + +++NDE IA   Q++  +++  E++  S  N  + H Q               W   SP          N  D+ G S    N
Subjt:  YSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVS--NTGEDHLQASVL--------AQDWLGPSPVHSPYGSENSNGVDKMGNSSSYPN

Query:  TLENSFSEDCS--LYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHD
            S   D    +YS E  D+   DGE G+RLNQ+VP+P++PK   +IP  +E +SDH+RL +R L++++  E KV GDGNCQFRAL+DQLY++ + H 
Subjt:  TLENSFSEDCS--LYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHD

Query:  FVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQ
         VR QI+ QLK+R D Y GYVPM + DYL+KMS+ GEWGDHVTLQAAAD                           + VKI V+TSFKDTC IEILP  Q
Subjt:  FVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQ

Query:  KSK
        +SK
Subjt:  KSK

Q9SGA5 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 128.9e-4648.82Show/hide
Query:  MGNSSSYPNTLENSFSEDCSLYSLEIMDE-STLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQL
        MG+SSS  +      +ED  + +  + +E S LDG VG+RL+ + PVPHVP+    IP+ ++   DHQRLL R L +Y L E KV GDGNCQFRALSDQL
Subjt:  MGNSSSYPNTLENSFSEDCSLYSLEIMDE-STLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQL

Query:  YRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCS
        YRSPE+H  VR +++ QLK  R +Y  YVPM Y  Y KKM K GEWGDH+TLQAAAD                          +F  KI ++TSF+DTC 
Subjt:  YRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCS

Query:  IEILPQVQKSK
        IEI+PQ Q  K
Subjt:  IEILPQVQKSK

Arabidopsis top hitse value%identityAlignment
AT3G02070.1 Cysteine proteinases superfamily protein6.3e-4748.82Show/hide
Query:  MGNSSSYPNTLENSFSEDCSLYSLEIMDE-STLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQL
        MG+SSS  +      +ED  + +  + +E S LDG VG+RL+ + PVPHVP+    IP+ ++   DHQRLL R L +Y L E KV GDGNCQFRALSDQL
Subjt:  MGNSSSYPNTLENSFSEDCSLYSLEIMDE-STLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQL

Query:  YRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCS
        YRSPE+H  VR +++ QLK  R +Y  YVPM Y  Y KKM K GEWGDH+TLQAAAD                          +F  KI ++TSF+DTC 
Subjt:  YRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCS

Query:  IEILPQVQKSK
        IEI+PQ Q  K
Subjt:  IEILPQVQKSK

AT5G03330.1 Cysteine proteinases superfamily protein1.4e-5140.59Show/hide
Query:  YSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVS--NTGEDHLQASVL--------AQDWLGPSPVHSPYGSENSNGVDKMGNSSSYPN
        + Q G     Y +  + +++NDE IA   Q++  +++  E++  S  N  + H Q               W   SP          N  D+ G S    N
Subjt:  YSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVS--NTGEDHLQASVL--------AQDWLGPSPVHSPYGSENSNGVDKMGNSSSYPN

Query:  TLENSFSEDCS--LYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHD
            S   D    +YS E  D+   DGE G+RLNQ+VP+P++PK   +IP  +E +SDH+RL +R L++++  E KV GDGNCQFRAL+DQLY++ + H 
Subjt:  TLENSFSEDCS--LYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHD

Query:  FVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQ
         VR QI+ QLK+R D Y GYVPM + DYL+KMS+ GEWGDHVTLQAAAD                           + VKI V+TSFKDTC IEILP  Q
Subjt:  FVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQ

Query:  KSK
        +SK
Subjt:  KSK

AT5G03330.2 Cysteine proteinases superfamily protein1.4e-5140.59Show/hide
Query:  YSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVS--NTGEDHLQASVL--------AQDWLGPSPVHSPYGSENSNGVDKMGNSSSYPN
        + Q G     Y +  + +++NDE IA   Q++  +++  E++  S  N  + H Q               W   SP          N  D+ G S    N
Subjt:  YSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVS--NTGEDHLQASVL--------AQDWLGPSPVHSPYGSENSNGVDKMGNSSSYPN

Query:  TLENSFSEDCS--LYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHD
            S   D    +YS E  D+   DGE G+RLNQ+VP+P++PK   +IP  +E +SDH+RL +R L++++  E KV GDGNCQFRAL+DQLY++ + H 
Subjt:  TLENSFSEDCS--LYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHD

Query:  FVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQ
         VR QI+ QLK+R D Y GYVPM + DYL+KMS+ GEWGDHVTLQAAAD                           + VKI V+TSFKDTC IEILP  Q
Subjt:  FVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQ

Query:  KSK
        +SK
Subjt:  KSK

AT5G04250.1 Cysteine proteinases superfamily protein1.1e-8049.42Show/hide
Query:  YNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQDWLG
        Y  DPD  RWGLH L+VCT TN  S S+VT Y      +Q GYV EGY +P    ++ND  IA  +Q+E+SR+   EASG+++        SV+AQDW  
Subjt:  YNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQDWLG

Query:  PSPVHSPYG-------------SENSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRL
        P       G             + N N  DK      +     +   +D S+ S+EI +ES    EVGKRLNQ++P+ HVPK   ++PS DE++SDH+RL
Subjt:  PSPVHSPYG-------------SENSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRL

Query:  LDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNI
          R LQLY LVENK++GDGNCQFR+LSDQLYRSPEHH+FVREQ++ QL   R+IY GYVPMAY+DYLK M + GEWGDHVTLQAAAD             
Subjt:  LDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNI

Query:  CNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                      FGV++FVITSFKDTC IEILP  QKS R
Subjt:  CNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR

AT5G04250.2 Cysteine proteinases superfamily protein1.1e-8049.42Show/hide
Query:  YNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQDWLG
        Y  DPD  RWGLH L+VCT TN  S S+VT Y      +Q GYV EGY +P    ++ND  IA  +Q+E+SR+   EASG+++        SV+AQDW  
Subjt:  YNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQDWLG

Query:  PSPVHSPYG-------------SENSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRL
        P       G             + N N  DK      +     +   +D S+ S+EI +ES    EVGKRLNQ++P+ HVPK   ++PS DE++SDH+RL
Subjt:  PSPVHSPYG-------------SENSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRL

Query:  LDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNI
          R LQLY LVENK++GDGNCQFR+LSDQLYRSPEHH+FVREQ++ QL   R+IY GYVPMAY+DYLK M + GEWGDHVTLQAAAD             
Subjt:  LDRSLQLYELVENKVQGDGNCQFRALSDQLYRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNI

Query:  CNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR
                      FGV++FVITSFKDTC IEILP  QKS R
Subjt:  CNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKSKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACGTATAATATTGATCCTGACGTAAGGAGATGGGGCCTTCATCTTTTGGATGTTTGCACCTTTACAAATGACAGTTCTCGCAGTACTGTTACTGAATACAGTTA
TGATCCAAGTTATAGTCAAGTGGGATATGTTAATGAAGGTTATTGTGAGCCGTGTAATGTGAATTTGGAGAATGATGAGGCTATAGCACACGCTTTTCAAGAAGAGATAT
CTAGAATCGATTCTATAGAAGCTTCTGGGGTATCTAATACTGGGGAAGACCATCTGCAAGCATCTGTTCTTGCACAGGATTGGCTTGGTCCTTCGCCTGTGCATAGCCCT
TATGGTAGTGAGAACAGTAACGGAGTGGACAAAATGGGAAACTCCTCTTCATATCCTAACACACTAGAAAATTCTTTTTCAGAGGATTGCTCTTTATATTCGTTGGAAAT
AATGGATGAGTCTACACTTGATGGTGAAGTGGGCAAAAGGTTGAATCAGATTGTTCCTGTTCCTCATGTACCCAAGACCATTGAGAAGATACCCTCAGCTGATGAAGAGA
TGTCGGATCATCAACGGCTCCTTGATAGGTCATTGCAGTTGTACGAGCTGGTTGAGAATAAGGTTCAAGGAGATGGTAACTGCCAGTTTCGTGCTTTATCAGATCAACTT
TACCGATCTCCCGAGCACCATGATTTTGTGAGAGAACAAATTATAGTGCAGCTAAAGGCTCGTCGAGATATATATGGGGGATATGTTCCAATGGCTTATGACGACTATCT
GAAGAAGATGAGCAAGAAAGGAGAATGGGGTGATCATGTTACACTACAAGCTGCTGCAGACTGGGTATTGATATATCTTCTTCTTATTGCACTGAACATTTGTAATAACT
TGGCATATAATTGGTTTTTGCATTTGTTGCAGTTTGGAGTTAAGATATTTGTAATTACATCATTCAAGGATACATGTTCAATTGAAATACTTCCACAAGTTCAAAAGTCC
AAACGAAGTAAGATT
mRNA sequenceShow/hide mRNA sequence
ATGGAAACGTATAATATTGATCCTGACGTAAGGAGATGGGGCCTTCATCTTTTGGATGTTTGCACCTTTACAAATGACAGTTCTCGCAGTACTGTTACTGAATACAGTTA
TGATCCAAGTTATAGTCAAGTGGGATATGTTAATGAAGGTTATTGTGAGCCGTGTAATGTGAATTTGGAGAATGATGAGGCTATAGCACACGCTTTTCAAGAAGAGATAT
CTAGAATCGATTCTATAGAAGCTTCTGGGGTATCTAATACTGGGGAAGACCATCTGCAAGCATCTGTTCTTGCACAGGATTGGCTTGGTCCTTCGCCTGTGCATAGCCCT
TATGGTAGTGAGAACAGTAACGGAGTGGACAAAATGGGAAACTCCTCTTCATATCCTAACACACTAGAAAATTCTTTTTCAGAGGATTGCTCTTTATATTCGTTGGAAAT
AATGGATGAGTCTACACTTGATGGTGAAGTGGGCAAAAGGTTGAATCAGATTGTTCCTGTTCCTCATGTACCCAAGACCATTGAGAAGATACCCTCAGCTGATGAAGAGA
TGTCGGATCATCAACGGCTCCTTGATAGGTCATTGCAGTTGTACGAGCTGGTTGAGAATAAGGTTCAAGGAGATGGTAACTGCCAGTTTCGTGCTTTATCAGATCAACTT
TACCGATCTCCCGAGCACCATGATTTTGTGAGAGAACAAATTATAGTGCAGCTAAAGGCTCGTCGAGATATATATGGGGGATATGTTCCAATGGCTTATGACGACTATCT
GAAGAAGATGAGCAAGAAAGGAGAATGGGGTGATCATGTTACACTACAAGCTGCTGCAGACTGGGTATTGATATATCTTCTTCTTATTGCACTGAACATTTGTAATAACT
TGGCATATAATTGGTTTTTGCATTTGTTGCAGTTTGGAGTTAAGATATTTGTAATTACATCATTCAAGGATACATGTTCAATTGAAATACTTCCACAAGTTCAAAAGTCC
AAACGAAGTAAGATT
Protein sequenceShow/hide protein sequence
METYNIDPDVRRWGLHLLDVCTFTNDSSRSTVTEYSYDPSYSQVGYVNEGYCEPCNVNLENDEAIAHAFQEEISRIDSIEASGVSNTGEDHLQASVLAQDWLGPSPVHSP
YGSENSNGVDKMGNSSSYPNTLENSFSEDCSLYSLEIMDESTLDGEVGKRLNQIVPVPHVPKTIEKIPSADEEMSDHQRLLDRSLQLYELVENKVQGDGNCQFRALSDQL
YRSPEHHDFVREQIIVQLKARRDIYGGYVPMAYDDYLKKMSKKGEWGDHVTLQAAADWVLIYLLLIALNICNNLAYNWFLHLLQFGVKIFVITSFKDTCSIEILPQVQKS
KRSKI