; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004911 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004911
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionervatamin-B
Genome locationscaffold176:1399703..1400750
RNA-Seq ExpressionMS004911
SyntenyMS004911
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025660 - Cysteine peptidase, histidine active site
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AKO60151.1 cysteine proteinase 1, partial [Citrullus lanatus]7.6e-14783.61Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ
        YQKW+ KYGREYKS EE E+RF IYQ NVQYID FNSLN SYTLA+N FADLTNDEFKTTYLG+ TDW PDT F+YGN+V LPTNVDWRKE AVTP+KDQ
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ

Query:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
        GQCGSCWAFSAVAAVEGI KIKTGKL+SLSEQEL+DCDV SGNQGC+GG+M KAFEFIKK G+TTE EYPYRG+E+VCNKQKVRY + TISGYEKVP ND
Subjt:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND

Query:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
        EKSLKAAVANQPVSVAIDAGGYDFQFYSGG+FSGNCGKQLNHGV IVGYGE   K+YWLVKNSWGT WGE GY+RMK +S+DKRGTCGIAM ASYPIKD
Subjt:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD

XP_022140756.1 ervatamin-B [Momordica charantia]3.0e-17599.67Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ
        YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIV LPTNVDWRKEGAVTPIKDQ
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ

Query:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
        GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
Subjt:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND

Query:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
        EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
Subjt:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD

XP_023513224.1 ervatamin-B-like [Cucurbita pepo subsp. pepo]1.4e-13778.6Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ
        Y+KW++K+ REYKS EE+E+RF +YQ NVQYID FNSLN SYTLA+N FADLTNDEFKTTYLGY T   PDTCF+Y ++  LPT+VDWR E AVTPIKDQ
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ

Query:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
        GQCGSCWAFSAVAAVEGI KI+TGKL SLSEQEL+DCD+ISGNQGC GGFM KAFE+IK+ G+TTE+EYPYRG+E  CN QKVRYHS TISGYEKVP N+
Subjt:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND

Query:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
        EK LKAAVANQPVSVAIDAGGYDFQFYS GIFSG+CGKQLNHGV IVGYGE    +YWLVKNSWGT WGE GY+RMK +S DKRG CGIAM+ASYPIKD
Subjt:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD

XP_038902648.1 ervatamin-B-like [Benincasa hispida]4.8e-14180.33Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ
        YQKW+ KYGR+YKS EE E+RF IYQ NVQYID FNSL+ SYTLA+N   DLTNDEFK TYLGY TDW PDTCF+YGN+V LPTNV+WRKEGAVTPI +Q
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ

Query:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
        GQCG+CWAFSAVAAVEGI KIKTGKL+SLSEQEL+DCDV SGNQGC+GGFM KAF+FIKK  +TTE EYPYRG+E+ CNKQKVR H+  ISGYEKVPAND
Subjt:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND

Query:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGE-DVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
        EKSLKA VANQPVS+AIDAGGYDFQFYSGG+FSGNCGKQLNHGV IVGY +  + KSYWLVKNSWGT+WGE GY+RMKS+S+DKRGTCGIAM ASYPIKD
Subjt:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGE-DVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD

XP_038902939.1 ervatamin-B [Benincasa hispida]2.9e-14682.94Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ
        YQKW+ KYGR+YKS EE E+RF IYQ NVQYID FNSL+ SYTLA+N FADLTNDEFK TYLGY TDW PDTCF+YGN+V LPTNV+WRKEGAVTPIK+Q
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ

Query:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
        GQCGSCWAFSAVAAVEGI KIKTGKL+SLSEQEL+DCDV SGNQGC+GGFM KAF+FIKK G+TTE EYPYRG+E+ CNKQKVR H+  ISGYEKVPAND
Subjt:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND

Query:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
        EKSLKAAVANQPVSVAIDAGGYDFQFYSGG+FSGNCGKQLNHGV IVGYG+   KSYWLVKNSWGT WGE GY+RMK +S+DKRGTCGIAM ASYPIKD
Subjt:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD

TrEMBL top hitse value%identityAlignment
A0A0A0LJV6 Uncharacterized protein6.5e-13677.26Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ
        YQKW+DKYGR+YKS EE E+RF IYQ+NVQYID FNS+N S+TLA+N FADLTN+EFK TYLGY T   PDTCF+YGN+V LPTNVDWR+EGAVTPIK+Q
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ

Query:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
        GQCGSCWAFSAVAAVEGI KIK GKL+SLSEQEL+DCDV SGNQGC+GG+M KAFEFIK+ G+TTE EYPY+G E+ CN+QK +Y   +ISGYEKVP ND
Subjt:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND

Query:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
        EKSLKAAVANQPVSVAIDA G +FQFYSGGIFSGNCG QLNHGV IVGYGE   ++YWLVKNSWGT WGE GY+RMK +S+D++GTCGIAM ASYP KD
Subjt:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD

A0A384S0D9 Cysteine proteinase 1 (Fragment)3.7e-14783.61Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ
        YQKW+ KYGREYKS EE E+RF IYQ NVQYID FNSLN SYTLA+N FADLTNDEFKTTYLG+ TDW PDT F+YGN+V LPTNVDWRKE AVTP+KDQ
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ

Query:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
        GQCGSCWAFSAVAAVEGI KIKTGKL+SLSEQEL+DCDV SGNQGC+GG+M KAFEFIKK G+TTE EYPYRG+E+VCNKQKVRY + TISGYEKVP ND
Subjt:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND

Query:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
        EKSLKAAVANQPVSVAIDAGGYDFQFYSGG+FSGNCGKQLNHGV IVGYGE   K+YWLVKNSWGT WGE GY+RMK +S+DKRGTCGIAM ASYPIKD
Subjt:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD

A0A6J1CH04 ervatamin-B1.4e-17599.67Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ
        YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIV LPTNVDWRKEGAVTPIKDQ
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ

Query:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
        GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
Subjt:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND

Query:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
        EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
Subjt:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD

A0A6J1FYZ3 ervatamin-B-like1.6e-13777.59Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ
        Y+KW++K+ REYKS EE+E+RF +YQ NVQYID FNSLN SYTLA+N FADLTNDEFKTTYLGY T   PDTCF+Y +++ LPT+VDWR E AVTP+KDQ
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ

Query:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
        GQCGSCWAFSAVAAVEGI KI+TGKL SLSEQEL+DCD+ISGNQGC GGFM KAFE+IK+ G+TTE+EYPYRG+E  CN QKVRYHS TISGYEKVP N+
Subjt:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND

Query:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
        EK LKAAVA+QPVSVAIDAGGYDFQFYS GIFSG+CGKQLNHGV IVGYGE    +YWLVKNSWGT WGE GY+RMK +S DKRG CGIAM+ASYP KD
Subjt:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD

A0A6J1J793 ervatamin-B2.5e-13576.59Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ
        Y+KW++K+ REYKS EE+E+RF +YQ NVQYID FNSLN SYTLA+N FADLTNDEFK TYLGY TD   DTCF+Y +++ LP +VDWR E AVTP+KDQ
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQ

Query:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND
        GQCGSCWAFSAVAAVEGI KI+TGKL SLSEQEL+DCD+  GNQGC GGFM KAFE+IK+ G+TTE+EYPYRG+E  CN QKVRYHS TISGYEKVP N+
Subjt:  GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPAND

Query:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD
        EK LKAAVA+QPVSVAIDAGGYDFQFYS GIFSG+CGKQLNHGV IVGYGE    +YWLVKNSWGT WGE GY+RMK +S DKRG CGIAM+ASYPIKD
Subjt:  EKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD

SwissProt top hitse value%identityAlignment
A2XQE8 Senescence-specific cysteine protease SAG394.7e-9154.46Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFK--TTYLGYLTDWS-PDTCFKYG--NIVKLPTNVDWRKEGAVT
        +++W+ +YGR Y+   E+ +RF ++++NV +I+ FN+ N ++ L  N FADLTNDEF+   T  G++   +   T F+Y   NI  LP  VDWR +GAVT
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFK--TTYLGYLTDWS-PDTCFKYG--NIVKLPTNVDWRKEGAVT

Query:  PIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEF-IKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYE
        PIKDQGQCG CWAFSAVAA+EGI K+ TGKL+SLSEQEL+DCDV   +QGC GG M  AF+F IK  G+TTE  YPY   ++ C  + V    A+I GYE
Subjt:  PIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEF-IKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYE

Query:  KVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGE-DVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDA
         VPAN+E +L  AVANQPVSVA+D G   FQFY GG+ +G+CG  L+HG+  +GYG+   G  YWL+KNSWGT+WGE G++RM+ + SDKRG CG+AM+ 
Subjt:  KVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGE-DVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDA

Query:  SYP
        SYP
Subjt:  SYP

P12412 Vignain2.1e-9155.02Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLG--------YLTDWSPDTCFKYGNIVKLPTNVDWRKEG
        Y++W   +      G E+ KRF ++++NV ++   N +++ Y L  N FAD+TN EF++TY G        +         F Y  +  +P +VDWRK+G
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLG--------YLTDWSPDTCFKYGNIVKLPTNVDWRKEG

Query:  AVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIK-KIGITTEKEYPYRGVENVCNKQKVRYHSATIS
        AVT +KDQGQCGSCWAFS + AVEGI +IKT KLVSLSEQEL+DCD    NQGC+GG M  AFEFIK K GITTE  YPY   E  C++ KV   + +I 
Subjt:  AVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIK-KIGITTEKEYPYRGVENVCNKQKVRYHSATIS

Query:  GYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDV-GKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIA
        G+E VP NDE +L  AVANQPVSVAIDAGG DFQFYS G+F+G+C   LNHGV IVGYG  V G +YW+V+NSWG  WGE GY+RM+ N S K G CGIA
Subjt:  GYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDV-GKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIA

Query:  MDASYPIKD
        M ASYPIK+
Subjt:  MDASYPIKD

P25803 Vignain9.5e-9255.02Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDW--------SPDTCFKYGNIVKLPTNVDWRKEG
        Y++W   +      G E+ KRF ++++N+ ++   N +++ Y L  N FAD+TN EF++TY G   +           +  F Y  +V +P +VDWRK+G
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDW--------SPDTCFKYGNIVKLPTNVDWRKEG

Query:  AVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIK-KIGITTEKEYPYRGVENVCNKQKVRYHSATIS
        AVT +KDQGQCGSCWAFS V AVEGI +IKT KLV+LSEQEL+DCD    NQGC+GG M  AFEFIK K GITTE  YPY+  E  C+  KV   + +I 
Subjt:  AVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIK-KIGITTEKEYPYRGVENVCNKQKVRYHSATIS

Query:  GYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDV-GKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIA
        G+E VPANDE +L  AVANQPVSVAIDAGG DFQFYS G+F+G+C   LNHGV IVGYG  V G +YW+V+NSWG  WGE+GY+RM+ N S K G CGIA
Subjt:  GYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDV-GKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIA

Query:  MDASYPIKD
        M  SYPIK+
Subjt:  MDASYPIKD

Q7XWK5 Senescence-specific cysteine protease SAG394.7e-9154.46Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFK--TTYLGYLTDWS-PDTCFKYG--NIVKLPTNVDWRKEGAVT
        +++W+ +YGR Y+   E+ +RF ++++NV +I+ FN+ N ++ L  N FADLTNDEF+   T  G++   +   T F+Y   NI  LP  VDWR +GAVT
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFK--TTYLGYLTDWS-PDTCFKYG--NIVKLPTNVDWRKEGAVT

Query:  PIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEF-IKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYE
        PIKDQGQCG CWAFSAVAA+EGI K+ TGKL+SLSEQEL+DCDV   +QGC GG M  AF+F IK  G+TTE  YPY   ++ C  + V    A+I GYE
Subjt:  PIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEF-IKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYE

Query:  KVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGE-DVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDA
         VPAN+E +L  AVANQPVSVA+D G   FQFY GG+ +G+CG  L+HG+  +GYG+   G  YWL+KNSWGT+WGE G++RM+ + SDKRG CG+AM+ 
Subjt:  KVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGE-DVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDA

Query:  SYP
        SYP
Subjt:  SYP

Q9STL4 KDEL-tailed cysteine endopeptidase CEP22.1e-9155.66Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTD----------WSPDTCFKYGNIVKLPTNVDWRK
        Y +W   +    +S  EREKRF +++ NV ++   N  NRSY L  N FADLT +EFK  Y G               S    + + N+ KLP++VDWRK
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTD----------WSPDTCFKYGNIVKLPTNVDWRK

Query:  EGAVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKI-GITTEKEYPYRGVENVCNKQKVRYHSAT
        +GAVT IK+QG+CGSCWAFS VAAVEGI KIKT KLVSLSEQEL+DCD    N+GC+GG M  AFEFIKK  GITTE  YPY G++  C+  K      T
Subjt:  EGAVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKI-GITTEKEYPYRGVENVCNKQKVRYHSAT

Query:  ISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGI
        I G+E VP NDE +L  AVANQPVSVAIDAG  DFQFYS G+F+G+CG +LNHGV  VGYG + GK YW+V+NSWG  WGE GY++++    +  G CGI
Subjt:  ISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGI

Query:  AMDASYPIK
        AM+ASYPIK
Subjt:  AMDASYPIK

Arabidopsis top hitse value%identityAlignment
AT1G06260.1 Cysteine proteinases superfamily protein8.0e-9455.74Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTD------WSPDTCFKYGNIVKLPTNVDWRKEGAV
        ++KW+  + + Y   +E   RF IYQSNVQ IDY NSL+  + L DN FAD+TN EFK  +LG  T            C   GN+   P  VDWR +GAV
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTD------WSPDTCFKYGNIVKLPTNVDWRKEGAV

Query:  TPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIK-KIGITTEKEYPYRGVENVCNKQKVRYHSATISGY
        TPI++QG+CG CWAFSAVAA+EGI KIKTG LVSLSEQ+L+DCDV + N+GCSGG M  AFEFIK   G+ TE +YPY G+E  C+++K +    TI GY
Subjt:  TPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIK-KIGITTEKEYPYRGVENVCNKQKVRYHSATISGY

Query:  EKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDA
        +KV A +E SL+ A A QPVSV IDAGG+ FQ YS G+F+  CG  LNHGVT+VGYG +  + YW+VKNSWGT WGE GY+RM+   S+  G CGIAM A
Subjt:  EKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDA

Query:  SYPIK
        SYP++
Subjt:  SYPIK

AT3G48340.1 Cysteine proteinases superfamily protein1.5e-9255.66Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTD----------WSPDTCFKYGNIVKLPTNVDWRK
        Y +W   +    +S  EREKRF +++ NV ++   N  NRSY L  N FADLT +EFK  Y G               S    + + N+ KLP++VDWRK
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTD----------WSPDTCFKYGNIVKLPTNVDWRK

Query:  EGAVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKI-GITTEKEYPYRGVENVCNKQKVRYHSAT
        +GAVT IK+QG+CGSCWAFS VAAVEGI KIKT KLVSLSEQEL+DCD    N+GC+GG M  AFEFIKK  GITTE  YPY G++  C+  K      T
Subjt:  EGAVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKI-GITTEKEYPYRGVENVCNKQKVRYHSAT

Query:  ISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGI
        I G+E VP NDE +L  AVANQPVSVAIDAG  DFQFYS G+F+G+CG +LNHGV  VGYG + GK YW+V+NSWG  WGE GY++++    +  G CGI
Subjt:  ISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGI

Query:  AMDASYPIK
        AM+ASYPIK
Subjt:  AMDASYPIK

AT4G35350.1 xylem cysteine peptidase 15.9e-8952.96Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLG-----YLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVT
        ++ W+ ++ + YKS EE+  RF +++ N+ +ID  N+   SY L  N FADLT++EFK  YLG     +     P   F+Y +I  LP +VDWRK+GAV 
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLG-----YLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVT

Query:  PIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKI-GITTEKEYPYRGVENVCNKQKVRYHSATISGYE
        P+KDQGQCGSCWAFS VAAVEGI +I TG L SLSEQEL+DCD  + N GC+GG M  AF++I    G+  E +YPY   E +C +QK      TISGYE
Subjt:  PIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKI-GITTEKEYPYRGVENVCNKQKVRYHSATISGYE

Query:  KVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDAS
         VP ND++SL  A+A+QPVSVAI+A G DFQFY GG+F+G CG  L+HGV  VGYG   G  Y +VKNSWG  WGE G++RMK N+    G CGI   AS
Subjt:  KVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDAS

Query:  YPIK
        YP K
Subjt:  YPIK

AT5G45890.1 senescence-associated gene 121.7e-9152.58Show/hide
Query:  KWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSL--NRSYTLADNMFADLTNDEFKTTYLGY----------LTDWSPDTCFKYGNIVK--LPTNVDW
        +W+ K+GR Y   +E   R+ ++++NV+ I++ NS+   R++ LA N FADLTNDEF++ Y G+           T  SP   F+Y N+    LP +VDW
Subjt:  KWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSL--NRSYTLADNMFADLTNDEFKTTYLGY----------LTDWSPDTCFKYGNIVK--LPTNVDW

Query:  RKEGAVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKI-GITTEKEYPYRGVENVCNKQKVRYHS
        RK+GAVTPIK+QG CG CWAFSAVAA+EG T+IK GKL+SLSEQ+L+DCD  + + GC GG M  AFE IK   G+TTE  YPY+G +  CN +K    +
Subjt:  RKEGAVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKI-GITTEKEYPYRGVENVCNKQKVRYHS

Query:  ATISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDV-GKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGT
         +I+GYE VP NDE++L  AVA+QPVSV I+ GG+DFQFYS G+F+G C   L+H VT +GYGE   G  YW++KNSWGT WGE GY+R++ +  DK+G 
Subjt:  ATISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDV-GKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGT

Query:  CGIAMDASYP
        CG+AM ASYP
Subjt:  CGIAMDASYP

AT5G50260.1 Cysteine proteinases superfamily protein4.1e-9053.4Show/hide
Query:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLG--------YLTDWSPDTCFKYGNIVKLPTNVDWRKEG
        Y++W   +    +S EE+ KRF +++ NV++I   N  ++SY L  N F D+T++EF+ TY G        +  +      F Y N+  LPT+VDWRK G
Subjt:  YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLG--------YLTDWSPDTCFKYGNIVKLPTNVDWRKEG

Query:  AVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIK-KIGITTEKEYPYRGVENVCNKQKVRYHSATIS
        AVTP+K+QGQCGSCWAFS V AVEGI +I+T KL SLSEQEL+DCD  + NQGC+GG M  AFEFIK K G+T+E  YPY+  +  C+  K      +I 
Subjt:  AVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIK-KIGITTEKEYPYRGVENVCNKQKVRYHSATIS

Query:  GYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDV-GKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIA
        G+E VP N E  L  AVANQPVSVAIDAGG DFQFYS G+F+G CG +LNHGV +VGYG  + G  YW+VKNSWG  WGE GY+RM+     K G CGIA
Subjt:  GYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDV-GKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIA

Query:  MDASYPIKD
        M+ASYP+K+
Subjt:  MDASYPIKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TACCAGAAATGGATCGATAAATATGGTCGAGAATACAAGAGTGGAGAGGAGCGGGAGAAGAGGTTTCCGATTTATCAGTCTAATGTTCAATACATTGACTACTTCAATTC
TCTGAATCGTTCATATACTCTGGCTGACAACATGTTTGCAGACCTTACAAATGATGAGTTTAAGACAACTTATTTGGGATATCTAACTGATTGGTCTCCTGACACATGCT
TCAAATATGGCAACATTGTTAAGTTGCCTACTAATGTTGACTGGAGAAAGGAAGGCGCAGTAACTCCGATAAAGGACCAAGGCCAATGCGGGAGTTGCTGGGCGTTCTCG
GCGGTAGCAGCTGTGGAAGGCATCACCAAAATAAAAACAGGAAAGTTGGTGTCTCTATCAGAACAAGAGCTCCTGGACTGCGATGTTATCTCGGGGAACCAGGGATGTAG
TGGTGGTTTCATGCCCAAAGCATTTGAGTTCATCAAGAAAATTGGAATCACTACAGAAAAAGAATATCCATACAGGGGAGTTGAAAATGTATGCAACAAACAAAAAGTGA
GATACCACTCTGCGACAATAAGTGGGTATGAAAAAGTACCTGCCAATGATGAGAAAAGCCTAAAAGCTGCAGTTGCTAACCAGCCAGTCTCTGTAGCAATTGATGCAGGG
GGGTATGATTTTCAGTTCTATTCTGGCGGAATCTTCTCAGGGAATTGTGGAAAGCAACTCAATCATGGAGTGACAATAGTTGGGTATGGGGAAGACGTCGGTAAAAGTTA
CTGGCTTGTCAAGAATTCATGGGGTACTAGCTGGGGTGAATATGGTTATGTAAGAATGAAAAGTAATTCAAGTGATAAGCGAGGTACTTGTGGCATAGCCATGGATGCTA
GCTACCCCATCAAAGAC
mRNA sequenceShow/hide mRNA sequence
TACCAGAAATGGATCGATAAATATGGTCGAGAATACAAGAGTGGAGAGGAGCGGGAGAAGAGGTTTCCGATTTATCAGTCTAATGTTCAATACATTGACTACTTCAATTC
TCTGAATCGTTCATATACTCTGGCTGACAACATGTTTGCAGACCTTACAAATGATGAGTTTAAGACAACTTATTTGGGATATCTAACTGATTGGTCTCCTGACACATGCT
TCAAATATGGCAACATTGTTAAGTTGCCTACTAATGTTGACTGGAGAAAGGAAGGCGCAGTAACTCCGATAAAGGACCAAGGCCAATGCGGGAGTTGCTGGGCGTTCTCG
GCGGTAGCAGCTGTGGAAGGCATCACCAAAATAAAAACAGGAAAGTTGGTGTCTCTATCAGAACAAGAGCTCCTGGACTGCGATGTTATCTCGGGGAACCAGGGATGTAG
TGGTGGTTTCATGCCCAAAGCATTTGAGTTCATCAAGAAAATTGGAATCACTACAGAAAAAGAATATCCATACAGGGGAGTTGAAAATGTATGCAACAAACAAAAAGTGA
GATACCACTCTGCGACAATAAGTGGGTATGAAAAAGTACCTGCCAATGATGAGAAAAGCCTAAAAGCTGCAGTTGCTAACCAGCCAGTCTCTGTAGCAATTGATGCAGGG
GGGTATGATTTTCAGTTCTATTCTGGCGGAATCTTCTCAGGGAATTGTGGAAAGCAACTCAATCATGGAGTGACAATAGTTGGGTATGGGGAAGACGTCGGTAAAAGTTA
CTGGCTTGTCAAGAATTCATGGGGTACTAGCTGGGGTGAATATGGTTATGTAAGAATGAAAAGTAATTCAAGTGATAAGCGAGGTACTTGTGGCATAGCCATGGATGCTA
GCTACCCCATCAAAGAC
Protein sequenceShow/hide protein sequence
YQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVKLPTNVDWRKEGAVTPIKDQGQCGSCWAFS
AVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPANDEKSLKAAVANQPVSVAIDAG
GYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD