; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001234 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001234
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCysteine proteinase 1
Genome locationscaffold8:40687775..40688963
RNA-Seq ExpressionSpg001234
SyntenySpg001234
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025660 - Cysteine peptidase, histidine active site
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AKO60151.1 cysteine proteinase 1, partial [Citrullus lanatus]4.6e-16487.15Show/hide
Query:  MASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLGYQTDWSPDTCFRYGNIV
        MASM MD  PG  S  L+DRYQ WM+KYGREYKSREEWE+RF IYQLNVQYID FNSLNHSYTLAEN+FADLTNDEFKTTYLG++TDW PDT FRYGN+V
Subjt:  MASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLGYQTDWSPDTCFRYGNIV

Query:  NLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKTGLTTEREYPYRGMEAVCNK
        NLPT+VDWRKE AVTP+KDQGQCGSCWAFSAVAAVEGINKIKTGKL+SLSEQELVDCDV SGNQGCNGG+MYKAF+FIKKTGLTTE EYPYRG+E+VCNK
Subjt:  NLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKTGLTTEREYPYRGMEAVCNK

Query:  QKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDS
        QKVRY +V ISGYEKVP NDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSG CGKQLNHGV  VGYGEA NKTYWLVKNSWGTDWGESGY+RMKRDS
Subjt:  QKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDS

Query:  RDKRGTCGIAMEASYPVKD
         DKRGTCGIAM ASYP+KD
Subjt:  RDKRGTCGIAMEASYPVKD

XP_022140756.1 ervatamin-B [Momordica charantia]4.5e-16783.38Show/hide
Query:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE
        ME+Y MI NVG M LIL VFWT SMAS+  D+PPG GS+ ++DRYQ W++KYGREYKS EE E+RF IYQ NVQYIDYFNSLN SYTLA+N FADLTNDE
Subjt:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE

Query:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ
        FKTTYLGY TDWSPDTCF+YGNIVNLPT+VDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGI KIKTGKLVSLSEQEL+DCDVISGNQGC+GGFM KAF+
Subjt:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ

Query:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT
        FIKK G+TTE+EYPYRG+E VCNKQKVRY+S  ISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGG+FSG CGKQLNHGVT VGYGE   K+
Subjt:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT

Query:  YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD
        YWLVKNSWGT WGE GYVRMK +S DKRGTCGIAM+ASYP+KD
Subjt:  YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD

XP_023513224.1 ervatamin-B-like [Cucurbita pepo subsp. pepo]6.3e-16180.76Show/hide
Query:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE
        ME+YK IWN+GL SLILWV  TPSMASM  DSP    SN L+DRY+ WMNK+ REYKSREE ERRF +YQLNVQYID FNSLNHSYTLAEN+FADLTNDE
Subjt:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE

Query:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ
        FKTTYLGYQT   PDTCFRY ++ +LPT VDWR E AVTPIKDQGQCGSCWAFSAVAAVEGI+KI+TGKL SLSEQELVDCD+ISGNQGC+GGFM KAF+
Subjt:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ

Query:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT
        +IK++GLTTEREYPYRG+EA CN QKVRY+SV ISGYEKVP N+EK LKAAVANQPVSVAIDAGGYDFQFYS G+FSG CGKQLNHGV  VGYGE  + T
Subjt:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT

Query:  YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD
        YWLVKNSWGT+WGESGY+RMKRDS DKRG CGIAMEASYP+KD
Subjt:  YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD

XP_038902648.1 ervatamin-B-like [Benincasa hispida]4.5e-16782.27Show/hide
Query:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE
        ME+Y+MIWNVGLMSLILWV WTP+M  M MD PPG  S  L+ RYQ WM+KYGR+YKSREEWERRF IYQLNVQYID FNSL+HSYTLAENN  DLTNDE
Subjt:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE

Query:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ
        FK TYLGY+TDW PDTCFRYGN+V+LPT+V+WRKEGAVTPI +QGQCG+CWAFSAVAAVEGINKIKTGKL+SLSEQELVDCDV SGNQGCNGGFM KAFQ
Subjt:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ

Query:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEAR-NK
        FIKKT LTTE EYPYRG+E+ CNKQKVR ++V ISGYEKVPANDEKSLKA VANQPVS+AIDAGGYDFQFYSGGVFSG CGKQLNHGV  VGY +A  +K
Subjt:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEAR-NK

Query:  TYWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD
        +YWLVKNSWGT+WGESGY+RMK DS DKRGTCGIAM ASYP+KD
Subjt:  TYWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD

XP_038902939.1 ervatamin-B [Benincasa hispida]3.4e-17585.71Show/hide
Query:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE
        ME+Y+MIWNVGLMSLILWV WTP+M  M MD PPG  S  L+ RYQ WM+KYGR+YKSREEWERRF IYQLNVQYID FNSL+HSYTLAENNFADLTNDE
Subjt:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE

Query:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ
        FK TYLGY+TDW PDTCFRYGN+V+LPT+V+WRKEGAVTPIK+QGQCGSCWAFSAVAAVEGINKIKTGKL+SLSEQELVDCDV SGNQGCNGGFM KAFQ
Subjt:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ

Query:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT
        FIKKTGLTTE EYPYRG+E+ CNKQKVR ++V ISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSG CGKQLNHGV  VGYG+A NK+
Subjt:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT

Query:  YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD
        YWLVKNSWGTDWGESGY+RMKRDS DKRGTCGIAM ASYP+KD
Subjt:  YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD

TrEMBL top hitse value%identityAlignment
A0A1S3C828 ervatamin-B-like2.8e-15978.9Show/hide
Query:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPG-PGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTND
        ME+YKMIW+V L+SLILWVFWTP+  SM MD P G   S +L+DRYQ WM+KYGR+YKSREEWE+RF IYQ NVQYID FNSLNHSYTLAENNF DLTN+
Subjt:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPG-PGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTND

Query:  EFKTTYLGYQTDWSP--DTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYK
        EF  TYLGY+T   P  DT FRYGN+VNLPT+VDWRKEGAVTPIK+QGQCGSCWAFSAVAAVEGINKIK GKL+SLSEQELVDCDV SGNQGCNGG+MYK
Subjt:  EFKTTYLGYQTDWSP--DTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYK

Query:  AFQFIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEAR
        AF+FIKKTGLTTE EYPY    + C+KQK +Y SV+ISGYEKVP NDEKSL+AAVA QPVSVAIDAGG DFQFYSGG+FSG CGKQLNHGV  VGYGE  
Subjt:  AFQFIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEAR

Query:  NKTYWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD
        N+ YWLVKNSWGT WGESGY+RM RDS DK+GTCGIAM ASYP+KD
Subjt:  NKTYWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD

A0A384S0D9 Cysteine proteinase 1 (Fragment)2.2e-16487.15Show/hide
Query:  MASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLGYQTDWSPDTCFRYGNIV
        MASM MD  PG  S  L+DRYQ WM+KYGREYKSREEWE+RF IYQLNVQYID FNSLNHSYTLAEN+FADLTNDEFKTTYLG++TDW PDT FRYGN+V
Subjt:  MASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLGYQTDWSPDTCFRYGNIV

Query:  NLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKTGLTTEREYPYRGMEAVCNK
        NLPT+VDWRKE AVTP+KDQGQCGSCWAFSAVAAVEGINKIKTGKL+SLSEQELVDCDV SGNQGCNGG+MYKAF+FIKKTGLTTE EYPYRG+E+VCNK
Subjt:  NLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKTGLTTEREYPYRGMEAVCNK

Query:  QKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDS
        QKVRY +V ISGYEKVP NDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSG CGKQLNHGV  VGYGEA NKTYWLVKNSWGTDWGESGY+RMKRDS
Subjt:  QKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDS

Query:  RDKRGTCGIAMEASYPVKD
         DKRGTCGIAM ASYP+KD
Subjt:  RDKRGTCGIAMEASYPVKD

A0A5A7SQK0 Ervatamin-B-like2.8e-15978.9Show/hide
Query:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPG-PGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTND
        ME+YKMIW+V L+SLILWVFWTP+  SM MD P G   S +L+DRYQ WM+KYGR+YKSREEWE+RF IYQ NVQYID FNSLNHSYTLAENNF DLTN+
Subjt:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPG-PGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTND

Query:  EFKTTYLGYQTDWSP--DTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYK
        EF  TYLGY+T   P  DT FRYGN+VNLPT+VDWRKEGAVTPIK+QGQCGSCWAFSAVAAVEGINKIK GKL+SLSEQELVDCDV SGNQGCNGG+MYK
Subjt:  EFKTTYLGYQTDWSP--DTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYK

Query:  AFQFIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEAR
        AF+FIKKTGLTTE EYPY    + C+KQK +Y SV+ISGYEKVP NDEKSL+AAVA QPVSVAIDAGG DFQFYSGG+FSG CGKQLNHGV  VGYGE  
Subjt:  AFQFIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEAR

Query:  NKTYWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD
        N+ YWLVKNSWGT WGESGY+RM RDS DK+GTCGIAM ASYP+KD
Subjt:  NKTYWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD

A0A6J1CH04 ervatamin-B2.2e-16783.38Show/hide
Query:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE
        ME+Y MI NVG M LIL VFWT SMAS+  D+PPG GS+ ++DRYQ W++KYGREYKS EE E+RF IYQ NVQYIDYFNSLN SYTLA+N FADLTNDE
Subjt:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE

Query:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ
        FKTTYLGY TDWSPDTCF+YGNIVNLPT+VDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGI KIKTGKLVSLSEQEL+DCDVISGNQGC+GGFM KAF+
Subjt:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ

Query:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT
        FIKK G+TTE+EYPYRG+E VCNKQKVRY+S  ISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGG+FSG CGKQLNHGVT VGYGE   K+
Subjt:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT

Query:  YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD
        YWLVKNSWGT WGE GYVRMK +S DKRGTCGIAM+ASYP+KD
Subjt:  YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD

A0A6J1FYZ3 ervatamin-B-like5.2e-16179.88Show/hide
Query:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE
        ME+YK IWN+GL SLILW+  TPSMASM  DSP    SN L+DRY+ WMNK+ REYKSREE ERRF +YQLNVQYID FNSLNHSYTLAEN+FADLTNDE
Subjt:  MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDE

Query:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ
        FKTTYLGYQT   PDTCFRY ++++LPT VDWR E AVTP+KDQGQCGSCWAFSAVAAVEGI+KI+TGKL SLSEQELVDCD+ISGNQGC+GGFM KAF+
Subjt:  FKTTYLGYQTDWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQ

Query:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT
        +IK++GLTTEREYPYRG+EA CN QKVRY+SV ISGYEKVP N+EK LKAAVA+QPVSVAIDAGGYDFQFYS G+FSG CGKQLNHGV  VGYGE  + T
Subjt:  FIKKTGLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT

Query:  YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD
        YWLVKNSWGT+WGESGY+RMKRDS DKRG CGIAMEASYP KD
Subjt:  YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVKD

SwissProt top hitse value%identityAlignment
A2XQE8 Senescence-specific cysteine protease SAG395.8e-9358.36Show/hide
Query:  RYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLG---YQTDWSPDTCFRYG--NIVNLPTDVDWRKEGAV
        R++ WM +YGR Y+   E  RRF +++ NV +I+ FN+ NH++ L  N FADLTNDEF+ T        +     T FRY   NI  LP  VDWR +GAV
Subjt:  RYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLG---YQTDWSPDTCFRYG--NIVNLPTDVDWRKEGAV

Query:  TPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQF-IKKTGLTTEREYPYRGMEAVCNKQKVRYNSVA-ISG
        TPIKDQGQCG CWAFSAVAA+EGI K+ TGKL+SLSEQELVDCDV   +QGC GG M  AF+F IK  GLTTE  YPY   +  C   K   NSVA I G
Subjt:  TPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQF-IKKTGLTTEREYPYRGMEAVCNKQKVRYNSVA-ISG

Query:  YEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT-YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAM
        YE VPAN+E +L  AVANQPVSVA+D G   FQFY GGV +G CG  L+HG+ A+GYG+A + T YWL+KNSWGT WGE+G++RM++D  DKRG CG+AM
Subjt:  YEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT-YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAM

Query:  EASYP
        E SYP
Subjt:  EASYP

O65493 Cysteine protease XCP14.9e-9254.98Show/hide
Query:  SNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLG-----YQTDWSPDTCFRYGNIVNLPTDVDW
        ++KL + +++WM+++ + YKS EE   RF +++ N+ +ID  N+  +SY L  N FADLT++EFK  YLG     +     P   FRY +I +LP  VDW
Subjt:  SNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLG-----YQTDWSPDTCFRYGNIVNLPTDVDW

Query:  RKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKT-GLTTEREYPYRGMEAVCNKQKVRYNS
        RK+GAV P+KDQGQCGSCWAFS VAAVEGIN+I TG L SLSEQEL+DCD  + N GCNGG M  AFQ+I  T GL  E +YPY   E +C +QK     
Subjt:  RKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKT-GLTTEREYPYRGMEAVCNKQKVRYNS

Query:  VAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDSRDKRGTC
        V ISGYE VP ND++SL  A+A+QPVSVAI+A G DFQFY GGVF+G CG  L+HGV AVGYG ++   Y +VKNSWG  WGE G++RMKR++    G C
Subjt:  VAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDSRDKRGTC

Query:  GIAMEASYPVK
        GI   ASYP K
Subjt:  GIAMEASYPVK

Q7XWK5 Senescence-specific cysteine protease SAG397.6e-9358.69Show/hide
Query:  RYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEF---KTTYLGYQTDWSPDTCFRYG--NIVNLPTDVDWRKEGAV
        R++ WM +YGR Y+   E  RRF +++ NV +I+ FN+ NH++ L  N FADLTNDEF   KT      +     T FRY   NI  LP  VDWR +GAV
Subjt:  RYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEF---KTTYLGYQTDWSPDTCFRYG--NIVNLPTDVDWRKEGAV

Query:  TPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQF-IKKTGLTTEREYPYRGMEAVCNKQKVRYNSVA-ISG
        TPIKDQGQCG CWAFSAVAA+EGI K+ TGKL+SLSEQELVDCDV   +QGC GG M  AF+F IK  GLTTE  YPY   +  C   K   NSVA I G
Subjt:  TPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQF-IKKTGLTTEREYPYRGMEAVCNKQKVRYNSVA-ISG

Query:  YEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT-YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAM
        YE VPAN+E +L  AVANQPVSVA+D G   FQFY GGV +G CG  L+HG+ A+GYG+A + T YWL+KNSWGT WGE+G++RM++D  DKRG CG+AM
Subjt:  YEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT-YWLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAM

Query:  EASYP
        E SYP
Subjt:  EASYP

Q9FGR9 KDEL-tailed cysteine endopeptidase CEP14.9e-9255.56Show/hide
Query:  NKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLG--------YQTDWSPDTCFRYGNIVNLPTDV
        N L + Y+ W + +    +S EE  +RF +++ NV++I   N  + SY L  N F D+T++EF+ TY G        +Q +      F Y N+  LPT V
Subjt:  NKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLG--------YQTDWSPDTCFRYGNIVNLPTDV

Query:  DWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIK-KTGLTTEREYPYRGMEAVCNKQKVRY
        DWRK GAVTP+K+QGQCGSCWAFS V AVEGIN+I+T KL SLSEQELVDCD  + NQGCNGG M  AF+FIK K GLT+E  YPY+  +  C+  K   
Subjt:  DWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIK-KTGLTTEREYPYRGMEAVCNKQKVRY

Query:  NSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT-YWLVKNSWGTDWGESGYVRMKRDSRDKR
          V+I G+E VP N E  L  AVANQPVSVAIDAGG DFQFYS GVF+G CG +LNHGV  VGYG   + T YW+VKNSWG +WGE GY+RM+R  R K 
Subjt:  NSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT-YWLVKNSWGTDWGESGYVRMKRDSRDKR

Query:  GTCGIAMEASYPVKD
        G CGIAMEASYP+K+
Subjt:  GTCGIAMEASYPVKD

Q9FJ47 Senescence-specific cysteine protease SAG123.1e-9454.43Show/hide
Query:  LKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSL--NHSYTLAENNFADLTNDEFKTTYLGY----------QTDWSPDTCFRYGNIVN--L
        ++ R+  WM K+GR Y   +E   R+ +++ NV+ I++ NS+    ++ LA N FADLTNDEF++ Y G+          QT  SP   FRY N+ +  L
Subjt:  LKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSL--NHSYTLAENNFADLTNDEFKTTYLGY----------QTDWSPDTCFRYGNIVN--L

Query:  PTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKT-GLTTEREYPYRGMEAVCNKQ
        P  VDWRK+GAVTPIK+QG CG CWAFSAVAA+EG  +IK GKL+SLSEQ+LVDCD  + + GC GG M  AF+ IK T GLTTE  YPY+G +A CN +
Subjt:  PTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKT-GLTTEREYPYRGMEAVCNKQ

Query:  KVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARN-KTYWLVKNSWGTDWGESGYVRMKRDS
        K    + +I+GYE VP NDE++L  AVA+QPVSV I+ GG+DFQFYS GVF+G C   L+H VTA+GYGE+ N   YW++KNSWGT WGESGY+R+++D 
Subjt:  KVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARN-KTYWLVKNSWGTDWGESGYVRMKRDS

Query:  RDKRGTCGIAMEASYP
        +DK+G CG+AM+ASYP
Subjt:  RDKRGTCGIAMEASYP

Arabidopsis top hitse value%identityAlignment
AT1G06260.1 Cysteine proteinases superfamily protein4.0e-9753.08Show/hide
Query:  NVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLGY
        N+ L  LI +V     + S  +DS        LK R++ W+  + + Y  R+EW  RF IYQ NVQ IDY NSL+  + L +N FAD+TN EFK  +LG 
Subjt:  NVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLGY

Query:  QTD------WSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFI
         T            C   GN+   P  VDWR +GAVTPI++QG+CG CWAFSAVAA+EGINKIKTG LVSLSEQ+L+DCDV + N+GC+GG M  AF+FI
Subjt:  QTD------WSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFI

Query:  KKT-GLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTY
        K   GL TE +YPY G+E  C+++K +   V I GY+KV A +E SL+ A A QPVSV IDAGG+ FQ YS GVF+  CG  LNHGVT VGYG   ++ Y
Subjt:  KKT-GLTTEREYPYRGMEAVCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTY

Query:  WLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVK
        W+VKNSWGT WGE GY+RM+R   +  G CGIAM ASYP++
Subjt:  WLVKNSWGTDWGESGYVRMKRDSRDKRGTCGIAMEASYPVK

AT3G48340.1 Cysteine proteinases superfamily protein7.8e-9355.27Show/hide
Query:  LKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLGYQTD----------WSPDTCFRYGNIVNLPTDV
        L   Y  W + +    +S  E E+RF +++ NV ++   N  N SY L  N FADLT +EFK  Y G               S    + + N+  LP+ V
Subjt:  LKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLGYQTD----------WSPDTCFRYGNIVNLPTDV

Query:  DWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKT-GLTTEREYPYRGMEAVCNKQKVRY
        DWRK+GAVT IK+QG+CGSCWAFS VAAVEGINKIKT KLVSLSEQELVDCD    N+GCNGG M  AF+FIKK  G+TTE  YPY G++  C+  K   
Subjt:  DWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKT-GLTTEREYPYRGMEAVCNKQKVRY

Query:  NSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDSRDKRG
          V I G+E VP NDE +L  AVANQPVSVAIDAG  DFQFYS GVF+G CG +LNHGV AVGYG  R K YW+V+NSWG +WGE GY++++R+  +  G
Subjt:  NSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDSRDKRG

Query:  TCGIAMEASYPVK
         CGIAMEASYP+K
Subjt:  TCGIAMEASYPVK

AT4G35350.1 xylem cysteine peptidase 13.5e-9354.98Show/hide
Query:  SNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLG-----YQTDWSPDTCFRYGNIVNLPTDVDW
        ++KL + +++WM+++ + YKS EE   RF +++ N+ +ID  N+  +SY L  N FADLT++EFK  YLG     +     P   FRY +I +LP  VDW
Subjt:  SNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLG-----YQTDWSPDTCFRYGNIVNLPTDVDW

Query:  RKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKT-GLTTEREYPYRGMEAVCNKQKVRYNS
        RK+GAV P+KDQGQCGSCWAFS VAAVEGIN+I TG L SLSEQEL+DCD  + N GCNGG M  AFQ+I  T GL  E +YPY   E +C +QK     
Subjt:  RKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKT-GLTTEREYPYRGMEAVCNKQKVRYNS

Query:  VAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDSRDKRGTC
        V ISGYE VP ND++SL  A+A+QPVSVAI+A G DFQFY GGVF+G CG  L+HGV AVGYG ++   Y +VKNSWG  WGE G++RMKR++    G C
Subjt:  VAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDSRDKRGTC

Query:  GIAMEASYPVK
        GI   ASYP K
Subjt:  GIAMEASYPVK

AT5G45890.1 senescence-associated gene 122.2e-9554.43Show/hide
Query:  LKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSL--NHSYTLAENNFADLTNDEFKTTYLGY----------QTDWSPDTCFRYGNIVN--L
        ++ R+  WM K+GR Y   +E   R+ +++ NV+ I++ NS+    ++ LA N FADLTNDEF++ Y G+          QT  SP   FRY N+ +  L
Subjt:  LKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSL--NHSYTLAENNFADLTNDEFKTTYLGY----------QTDWSPDTCFRYGNIVN--L

Query:  PTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKT-GLTTEREYPYRGMEAVCNKQ
        P  VDWRK+GAVTPIK+QG CG CWAFSAVAA+EG  +IK GKL+SLSEQ+LVDCD  + + GC GG M  AF+ IK T GLTTE  YPY+G +A CN +
Subjt:  PTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKT-GLTTEREYPYRGMEAVCNKQ

Query:  KVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARN-KTYWLVKNSWGTDWGESGYVRMKRDS
        K    + +I+GYE VP NDE++L  AVA+QPVSV I+ GG+DFQFYS GVF+G C   L+H VTA+GYGE+ N   YW++KNSWGT WGESGY+R+++D 
Subjt:  KVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARN-KTYWLVKNSWGTDWGESGYVRMKRDS

Query:  RDKRGTCGIAMEASYP
        +DK+G CG+AM+ASYP
Subjt:  RDKRGTCGIAMEASYP

AT5G50260.1 Cysteine proteinases superfamily protein3.5e-9355.56Show/hide
Query:  NKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLG--------YQTDWSPDTCFRYGNIVNLPTDV
        N L + Y+ W + +    +S EE  +RF +++ NV++I   N  + SY L  N F D+T++EF+ TY G        +Q +      F Y N+  LPT V
Subjt:  NKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLG--------YQTDWSPDTCFRYGNIVNLPTDV

Query:  DWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIK-KTGLTTEREYPYRGMEAVCNKQKVRY
        DWRK GAVTP+K+QGQCGSCWAFS V AVEGIN+I+T KL SLSEQELVDCD  + NQGCNGG M  AF+FIK K GLT+E  YPY+  +  C+  K   
Subjt:  DWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIK-KTGLTTEREYPYRGMEAVCNKQKVRY

Query:  NSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT-YWLVKNSWGTDWGESGYVRMKRDSRDKR
          V+I G+E VP N E  L  AVANQPVSVAIDAGG DFQFYS GVF+G CG +LNHGV  VGYG   + T YW+VKNSWG +WGE GY+RM+R  R K 
Subjt:  NSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKT-YWLVKNSWGTDWGESGYVRMKRDSRDKR

Query:  GTCGIAMEASYPVKD
        G CGIAMEASYP+K+
Subjt:  GTCGIAMEASYPVKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCATATAAAATGATTTGGAATGTGGGTTTGATGTCTCTGATTCTCTGGGTTTTCTGGACACCCTCAATGGCATCCATGCCAATGGACAGCCCTCCAGGACCTGG
CTCCAATAAATTAAAAGACAGGTACCAGAATTGGATGAATAAATACGGTCGAGAATACAAGAGCAGAGAGGAGTGGGAGCGGAGATTCGCAATTTATCAGTTGAATGTTC
AGTACATTGACTACTTCAATTCTCTGAATCATTCATATACTCTGGCAGAAAACAACTTTGCAGACCTCACAAATGATGAGTTTAAGACAACTTACCTGGGGTATCAAACT
GATTGGTCTCCTGATACATGCTTCAGATATGGAAATATTGTTAATTTGCCTACTGATGTTGACTGGAGAAAGGAAGGTGCAGTTACTCCAATAAAGGATCAAGGCCAATG
TGGGAGTTGCTGGGCGTTCTCTGCAGTAGCAGCAGTAGAAGGCATCAACAAAATAAAAACAGGCAAATTGGTGTCTCTATCAGAACAAGAGCTTGTGGACTGCGACGTCA
TCTCAGGGAACCAGGGATGCAATGGTGGTTTCATGTACAAAGCATTTCAGTTCATCAAGAAAACTGGACTCACTACAGAAAGAGAATATCCATACAGGGGAATGGAAGCT
GTATGCAACAAACAAAAAGTGAGATACAACTCTGTGGCAATAAGTGGATATGAAAAAGTACCAGCCAATGATGAGAAAAGCTTAAAAGCTGCAGTTGCTAACCAGCCAGT
CTCTGTAGCAATTGATGCAGGGGGATATGATTTTCAATTCTATTCTGGTGGAGTCTTCTCAGGGATTTGTGGAAAGCAGCTCAATCATGGAGTGACAGCAGTTGGGTATG
GGGAAGCTAGAAATAAAACTTACTGGCTTGTCAAGAATTCATGGGGCACTGACTGGGGTGAATCTGGTTACGTAAGAATGAAACGTGATTCAAGGGATAAGAGAGGTACT
TGTGGCATAGCTATGGAGGCTAGCTACCCTGTCAAAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCATATAAAATGATTTGGAATGTGGGTTTGATGTCTCTGATTCTCTGGGTTTTCTGGACACCCTCAATGGCATCCATGCCAATGGACAGCCCTCCAGGACCTGG
CTCCAATAAATTAAAAGACAGGTACCAGAATTGGATGAATAAATACGGTCGAGAATACAAGAGCAGAGAGGAGTGGGAGCGGAGATTCGCAATTTATCAGTTGAATGTTC
AGTACATTGACTACTTCAATTCTCTGAATCATTCATATACTCTGGCAGAAAACAACTTTGCAGACCTCACAAATGATGAGTTTAAGACAACTTACCTGGGGTATCAAACT
GATTGGTCTCCTGATACATGCTTCAGATATGGAAATATTGTTAATTTGCCTACTGATGTTGACTGGAGAAAGGAAGGTGCAGTTACTCCAATAAAGGATCAAGGCCAATG
TGGGAGTTGCTGGGCGTTCTCTGCAGTAGCAGCAGTAGAAGGCATCAACAAAATAAAAACAGGCAAATTGGTGTCTCTATCAGAACAAGAGCTTGTGGACTGCGACGTCA
TCTCAGGGAACCAGGGATGCAATGGTGGTTTCATGTACAAAGCATTTCAGTTCATCAAGAAAACTGGACTCACTACAGAAAGAGAATATCCATACAGGGGAATGGAAGCT
GTATGCAACAAACAAAAAGTGAGATACAACTCTGTGGCAATAAGTGGATATGAAAAAGTACCAGCCAATGATGAGAAAAGCTTAAAAGCTGCAGTTGCTAACCAGCCAGT
CTCTGTAGCAATTGATGCAGGGGGATATGATTTTCAATTCTATTCTGGTGGAGTCTTCTCAGGGATTTGTGGAAAGCAGCTCAATCATGGAGTGACAGCAGTTGGGTATG
GGGAAGCTAGAAATAAAACTTACTGGCTTGTCAAGAATTCATGGGGCACTGACTGGGGTGAATCTGGTTACGTAAGAATGAAACGTGATTCAAGGGATAAGAGAGGTACT
TGTGGCATAGCTATGGAGGCTAGCTACCCTGTCAAAGACTGA
Protein sequenceShow/hide protein sequence
MESYKMIWNVGLMSLILWVFWTPSMASMPMDSPPGPGSNKLKDRYQNWMNKYGREYKSREEWERRFAIYQLNVQYIDYFNSLNHSYTLAENNFADLTNDEFKTTYLGYQT
DWSPDTCFRYGNIVNLPTDVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGKLVSLSEQELVDCDVISGNQGCNGGFMYKAFQFIKKTGLTTEREYPYRGMEA
VCNKQKVRYNSVAISGYEKVPANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGICGKQLNHGVTAVGYGEARNKTYWLVKNSWGTDWGESGYVRMKRDSRDKRGT
CGIAMEASYPVKD