; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021580 (gene) of Snake gourd v1 genome

Gene IDTan0021580
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCysteine proteinase 1
Genome locationLG05:7373534..7374708
RNA-Seq ExpressionTan0021580
SyntenyTan0021580
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025660 - Cysteine peptidase, histidine active site
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AKO60151.1 cysteine proteinase 1, partial [Citrullus lanatus]6.1e-15682.5Show/hide
Query:  MASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQNGWFPDTCFRYGSIV
        MAS+  D  PGS S  L+DRYQKWM+KYGREYKSREEWE+RF IYQLNVQYID FNSLNHSYTLAEN+FADL NDEFKTTYLG++  W PDT FRYG++V
Subjt:  MASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQNGWFPDTCFRYGSIV

Query:  NLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKNGITTEREYPYRGIDDVCNK
        NLPTNVDWRK+ AVTP+KDQGQCGSCWAFSAVAAVEGIN+IKTG L+SLSEQELVDCD+ SGNQGCNGG+M KAF+FIKK G+TTE EYPYRGI+ VCNK
Subjt:  NLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKNGITTEREYPYRGIDDVCNK

Query:  EKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLG
        +K+R  +VTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVF G CG+QLNHGVAIVGY G+ASN++YWLVKNSWGTDWGESGYIRMK  
Subjt:  EKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLG

Query:  SSDKRGTCGIAMEASYPVKD
        S+DKRGTCGIAM ASYP+KD
Subjt:  SSDKRGTCGIAMEASYPVKD

XP_022140756.1 ervatamin-B [Momordica charantia]7.7e-15978.49Show/hide
Query:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE
        ME+Y MI N+G M+LIL VFWT SMAS+A+D  PG GS  ++DRYQKW++KYGREYKS EE E+RF IYQ NVQYID FNSLN SYTLA+N FADL NDE
Subjt:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE

Query:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ
        FKTTYLGY   W PDTCF+YG+IVNLPTNVDWRK+GAVTPIKDQGQCGSCWAFSAVAAVEGI +IKTG L+SLSEQEL+DCD+ SGNQGC+GGFM KAF+
Subjt:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ

Query:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE
        FIKK GITTE+EYPYRG+++VCNK+K+R HS TISGYEKVP NDEKSLKAAVANQPVSVAIDAGGYDFQFYSGG+F G CG+QLNHGV IVGY G+   +
Subjt:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE

Query:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD
        SYWLVKNSWGT WGE GY+RMK  SSDKRGTCGIAM+ASYP+KD
Subjt:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD

XP_023513224.1 ervatamin-B-like [Cucurbita pepo subsp. pepo]4.8e-15376.74Show/hide
Query:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE
        ME+Y+ IWN+G   LILWV  TPSMAS+A D    S S GL+DRY+KWMNK+ REYKSREE ERRF +YQLNVQYID FNSLNHSYTLAEN+FADL NDE
Subjt:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE

Query:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ
        FKTTYLGYQ    PDTCFRY  + +LPT+VDWR + AVTPIKDQGQCGSCWAFSAVAAVEGI++I+TG L SLSEQELVDCDI SGNQGC+GGFMNKAF+
Subjt:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ

Query:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE
        +IK++G+TTEREYPYRGI+  CN +K+R HSVTISGYEKVP+N+EK LKAAVANQPVSVAIDAGGYDFQFYS G+F G CG+QLNHGVAIVGY G+  + 
Subjt:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE

Query:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD
        +YWLVKNSWGT+WGESGYIRMK  S DKRG CGIAMEASYP+KD
Subjt:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD

XP_038902648.1 ervatamin-B-like [Benincasa hispida]3.7e-16178.2Show/hide
Query:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE
        ME+YRMIWN+G M LILWV WTP+M  +A D  PGS S  L+ RYQKWM+KYGR+YKSREEWERRF IYQLNVQYID FNSL+HSYTLAENN  DL NDE
Subjt:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE

Query:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ
        FK TYLGY+  W PDTCFRYG++V+LPTNV+WRK+GAVTPI +QGQCG+CWAFSAVAAVEGIN+IKTG L+SLSEQELVDCD+ SGNQGCNGGFM+KAFQ
Subjt:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ

Query:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE
        FIKK  +TTE EYPYRGI+  CNK+K+R H+V ISGYEKVP NDEKSLKA VANQPVS+AIDAGGYDFQFYSGGVF G CG+QLNHGVAIVGY     ++
Subjt:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE

Query:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD
        SYWLVKNSWGT+WGESGYIRMK  S+DKRGTCGIAM ASYP+KD
Subjt:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD

XP_038902939.1 ervatamin-B [Benincasa hispida]1.5e-16781.69Show/hide
Query:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE
        ME+YRMIWN+G M LILWV WTP+M  +A D  PGS S  L+ RYQKWM+KYGR+YKSREEWERRF IYQLNVQYID FNSL+HSYTLAENNFADL NDE
Subjt:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE

Query:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ
        FK TYLGY+  W PDTCFRYG++V+LPTNV+WRK+GAVTPIK+QGQCGSCWAFSAVAAVEGIN+IKTG L+SLSEQELVDCD+ SGNQGCNGGFM+KAFQ
Subjt:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ

Query:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE
        FIKK G+TTE EYPYRGI+  CNK+K+R H+V ISGYEKVP NDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVF G CG+QLNHGVAIVGY G+ASN+
Subjt:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE

Query:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD
        SYWLVKNSWGTDWGESGYIRMK  S+DKRGTCGIAM ASYP+KD
Subjt:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD

TrEMBL top hitse value%identityAlignment
A0A0A0LJV6 Uncharacterized protein4.9e-15175.37Show/hide
Query:  MIW-NLGSMFLILWVFWTPSMASIAKDCNPGSG-STGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKT
        M W N+  +FLILWVFWTP + S+A D + GS  S+ ++DRYQKWM+KYGR+YKSREEWERRF IYQ NVQYID FNS+NHS+TLAENNFADL N+EFK 
Subjt:  MIW-NLGSMFLILWVFWTPSMASIAKDCNPGSG-STGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKT

Query:  TYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIK
        TYLGY+    PDTCFRYG++VNLPTNVDWR++GAVTPIK+QGQCGSCWAFSAVAAVEGIN+IK G L+SLSEQELVDCD+TSGNQGCNGG+M KAF+FIK
Subjt:  TYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIK

Query:  KNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYW
        + G+TTE EYPY+G +  CN++K +   V+ISGYEKVPVNDEKSLKAAVANQPVSVAIDA G +FQFYSGG+F G CG QLNHGVAIVGY G+ SN++YW
Subjt:  KNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYW

Query:  LVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD
        LVKNSWGTDWGESGYIRMK  S+D++GTCGIAM ASYP KD
Subjt:  LVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD

A0A384S0D9 Cysteine proteinase 1 (Fragment)2.9e-15682.5Show/hide
Query:  MASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQNGWFPDTCFRYGSIV
        MAS+  D  PGS S  L+DRYQKWM+KYGREYKSREEWE+RF IYQLNVQYID FNSLNHSYTLAEN+FADL NDEFKTTYLG++  W PDT FRYG++V
Subjt:  MASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQNGWFPDTCFRYGSIV

Query:  NLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKNGITTEREYPYRGIDDVCNK
        NLPTNVDWRK+ AVTP+KDQGQCGSCWAFSAVAAVEGIN+IKTG L+SLSEQELVDCD+ SGNQGCNGG+M KAF+FIKK G+TTE EYPYRGI+ VCNK
Subjt:  NLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKNGITTEREYPYRGIDDVCNK

Query:  EKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLG
        +K+R  +VTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVF G CG+QLNHGVAIVGY G+ASN++YWLVKNSWGTDWGESGYIRMK  
Subjt:  EKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLG

Query:  SSDKRGTCGIAMEASYPVKD
        S+DKRGTCGIAM ASYP+KD
Subjt:  SSDKRGTCGIAMEASYPVKD

A0A6J1CH04 ervatamin-B3.7e-15978.49Show/hide
Query:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE
        ME+Y MI N+G M+LIL VFWT SMAS+A+D  PG GS  ++DRYQKW++KYGREYKS EE E+RF IYQ NVQYID FNSLN SYTLA+N FADL NDE
Subjt:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE

Query:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ
        FKTTYLGY   W PDTCF+YG+IVNLPTNVDWRK+GAVTPIKDQGQCGSCWAFSAVAAVEGI +IKTG L+SLSEQEL+DCD+ SGNQGC+GGFM KAF+
Subjt:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ

Query:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE
        FIKK GITTE+EYPYRG+++VCNK+K+R HS TISGYEKVP NDEKSLKAAVANQPVSVAIDAGGYDFQFYSGG+F G CG+QLNHGV IVGY G+   +
Subjt:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE

Query:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD
        SYWLVKNSWGT WGE GY+RMK  SSDKRGTCGIAM+ASYP+KD
Subjt:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD

A0A6J1FYZ3 ervatamin-B-like6.8e-15375.87Show/hide
Query:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE
        ME+Y+ IWN+G   LILW+  TPSMAS+A D    S S GL+DRY+KWMNK+ REYKSREE ERRF +YQLNVQYID FNSLNHSYTLAEN+FADL NDE
Subjt:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE

Query:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ
        FKTTYLGYQ    PDTCFRY  +++LPT+VDWR + AVTP+KDQGQCGSCWAFSAVAAVEGI++I+TG L SLSEQELVDCDI SGNQGC+GGFMNKAF+
Subjt:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ

Query:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE
        +IK++G+TTEREYPYRGI+  CN +K+R HSVTISGYEKVP N+EK LKAAVA+QPVSVAIDAGGYDFQFYS G+F G CG+QLNHGVAIVGY G+  + 
Subjt:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE

Query:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD
        +YWLVKNSWGT+WGESGYIRMK  S DKRG CGIAMEASYP KD
Subjt:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD

A0A6J1J793 ervatamin-B1.7e-15175.29Show/hide
Query:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE
        ME+Y+ IWN+G   LILWV  TPSMAS+A D    S S GL+DRY+KWMNK+ REYKSREE ERRF +YQLNVQYID FNSLNHSYTLAEN+FADL NDE
Subjt:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE

Query:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ
        FK TYLGYQ     DTCFRY  +++LP +VDWR + AVTP+KDQGQCGSCWAFSAVAAVEGI++I+TG L SLSEQELVDCDIT GNQGC+GGFMNKAF+
Subjt:  FKTTYLGYQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQ

Query:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE
        +IK++G+TTEREYPYRGI+  CN +K+R HSVTISGYEKVP+N+EK LKAAVA+QPVSVAIDAGGYDFQFYS G+F G CG+QLNHGVAIVGY G+  + 
Subjt:  FIKKNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNE

Query:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD
        +YWLVKNSWGT+WGESGYIRMK  S DKRG CGIAMEASYP+KD
Subjt:  SYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD

SwissProt top hitse value%identityAlignment
A2XQE8 Senescence-specific cysteine protease SAG391.3e-9254.92Show/hide
Query:  SGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQNGWFPDTC-----FRYG--SIVNLPT
        S    +  R+++WM +YGR Y+   E  RRFE+++ NV +I+ FN+ NH++ L  N FADL NDEF+ T      G+ P T      FRY   +I  LP 
Subjt:  SGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQNGWFPDTC-----FRYG--SIVNLPT

Query:  NVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKN-GITTEREYPYRGIDDVCNKEKM
         VDWR  GAVTPIKDQGQCG CWAFSAVAA+EGI ++ TG L+SLSEQELVDCD+   +QGC GG M+ AF+FI KN G+TTE  YPY   DD C   K 
Subjt:  NVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKN-GITTEREYPYRGIDDVCNKEKM

Query:  RCHSV-TISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSS
          +SV +I GYE VP N+E +L  AVANQPVSVA+D G   FQFY GGV  G CG  L+HG+  +GYG  +    YWL+KNSWGT WGE+G++RM+   S
Subjt:  RCHSV-TISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSS

Query:  DKRGTCGIAMEASYP
        DKRG CG+AME SYP
Subjt:  DKRGTCGIAMEASYP

O65039 Vignain1.2e-9355.84Show/hide
Query:  YQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLG--------YQNGWFPDTCFRYGSIVNLPTNVDWRKDG
        Y++W + +    +S  E ++RF +++ N  ++   N ++  Y L  N FAD+ N EF+ TY G        ++ G   +  F Y  +  +P +VDWRK G
Subjt:  YQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLG--------YQNGWFPDTCFRYGSIVNLPTNVDWRKDG

Query:  AVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIK-KNGITTEREYPYRGIDDVCNKEKMRCHSVTIS
        AVT +KDQGQCGSCWAFS + AVEGINQIKT  L+SLSEQELVDCD T  NQGCNGG M+ AF+FIK + GITTE  YPY   D  C+  K    +V+I 
Subjt:  AVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIK-KNGITTEREYPYRGIDDVCNKEKMRCHSVTIS

Query:  GYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIA
        G+E VP NDE +L  AVANQPVSVAIDAGG DFQFYS GVF G CG +L+HGVAIVGYG       YW VKNSWG +WGE GYIRM+ G SDK G CGIA
Subjt:  GYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIA

Query:  MEASYPVK
        MEASYP+K
Subjt:  MEASYPVK

P12412 Vignain1.7e-9250.71Show/hide
Query:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE
        M   +++W + S+ L+L V    S     KD         L D Y++W + +    +S  E  +RF +++ NV ++   N ++  Y L  N FAD+ N E
Subjt:  MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDE

Query:  FKTTYLG--------YQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNG
        F++TY G        ++        F Y  + ++P +VDWRK GAVT +KDQGQCGSCWAFS + AVEGINQIKT  L+SLSEQELVDCD    NQGCNG
Subjt:  FKTTYLG--------YQNGWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNG

Query:  GFMNKAFQFIK-KNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIV
        G M  AF+FIK K GITTE  YPY   +  C++ K+   +V+I G+E VPVNDE +L  AVANQPVSVAIDAGG DFQFYS GVF G C   LNHGVAIV
Subjt:  GFMNKAFQFIK-KNGITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIV

Query:  GYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD
        GYG      +YW+V+NSWG +WGE GYIRM+   S K G CGIAM ASYP+K+
Subjt:  GYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD

Q7XWK5 Senescence-specific cysteine protease SAG391.3e-9254.6Show/hide
Query:  SGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQNGWFPDTC-----FRYG--SIVNLPT
        S    +  R+++WM +YGR Y+   E  RRFE+++ NV +I+ FN+ NH++ L  N FADL NDEF+  ++    G+ P T      FRY   +I  LP 
Subjt:  SGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQNGWFPDTC-----FRYG--SIVNLPT

Query:  NVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKN-GITTEREYPYRGIDDVCNKEKM
         VDWR  GAVTPIKDQGQCG CWAFSAVAA+EGI ++ TG L+SLSEQELVDCD+   +QGC GG M+ AF+FI KN G+TTE  YPY   DD C   K 
Subjt:  NVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKN-GITTEREYPYRGIDDVCNKEKM

Query:  RCHSV-TISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSS
          +SV +I GYE VP N+E +L  AVANQPVSVA+D G   FQFY GGV  G CG  L+HG+  +GYG  +    YWL+KNSWGT WGE+G++RM+   S
Subjt:  RCHSV-TISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSS

Query:  DKRGTCGIAMEASYP
        DKRG CG+AME SYP
Subjt:  DKRGTCGIAMEASYP

Q9FGR9 KDEL-tailed cysteine endopeptidase CEP13.4e-9355.27Show/hide
Query:  LKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLG--------YQNGWFPDTCFRYGSIVNLPTNVDW
        L + Y++W + +    +S EE  +RF +++ NV++I   N  + SY L  N F D+ ++EF+ TY G        +Q        F Y ++  LPT+VDW
Subjt:  LKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLG--------YQNGWFPDTCFRYGSIVNLPTNVDW

Query:  RKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIK-KNGITTEREYPYRGIDDVCNKEKMRCHS
        RK+GAVTP+K+QGQCGSCWAFS V AVEGINQI+T  L SLSEQELVDCD T+ NQGCNGG M+ AF+FIK K G+T+E  YPY+  D+ C+  K     
Subjt:  RKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIK-KNGITTEREYPYRGIDDVCNKEKMRCHS

Query:  VTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRGT
        V+I G+E VP N E  L  AVANQPVSVAIDAGG DFQFYS GVF G+CG +LNHGVA+VGYG       YW+VKNSWG +WGE GYIRM+ G   K G 
Subjt:  VTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRGT

Query:  CGIAMEASYPVKD
        CGIAMEASYP+K+
Subjt:  CGIAMEASYPVKD

Arabidopsis top hitse value%identityAlignment
AT1G06260.1 Cysteine proteinases superfamily protein1.4e-9452.05Show/hide
Query:  NLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGY
        NL    LI +V     + S+  D +       LK R++KW+  + + Y  R+EW  RF IYQ NVQ ID  NSL+  + L +N FAD+ N EFK  +LG 
Subjt:  NLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGY

Query:  QNGWF------PDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFI
                      C   G   N+P  VDWR  GAVTPI++QG+CG CWAFSAVAA+EGIN+IKTG L+SLSEQ+L+DCD+ + N+GC+GG M  AF+FI
Subjt:  QNGWF------PDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFI

Query:  KKN-GITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNES
        K N G+ TE +YPY GI+  C++EK +   VTI GY+KV  N E SL+ A A QPVSV IDAGG+ FQ YS GVF   CG  LNHGV +VGYG +  ++ 
Subjt:  KKN-GITTEREYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNES

Query:  YWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVK
        YW+VKNSWGT WGE GYIRM+ G S+  G CGIAM ASYP++
Subjt:  YWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGIAMEASYPVK

AT1G47128.1 Granulin repeat cysteine protease family protein3.1e-8950.45Show/hide
Query:  MASIAKDCNPGSGSTGLKDR------YQKWMNKYGR--EYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLG--YQNGWFP
        M+ I+ D   G  +TG +        Y+ W+ K+G+     S  E +RRFEI++ N++++D  N  N SY L    FADL NDE+++ YLG   +     
Subjt:  MASIAKDCNPGSGSTGLKDR------YQKWMNKYGR--EYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLG--YQNGWFP

Query:  DTCFRYGSIV--NLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKN-GITTER
         T  RY + V   LP ++DWRK GAV  +KDQG CGSCWAFS + AVEGINQI TG L++LSEQELVDCD TS N+GCNGG M+ AF+FI KN GI T++
Subjt:  DTCFRYGSIV--NLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKN-GITTER

Query:  EYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGT
        +YPY+G+D  C++ +     VTI  YE VP   E+SLK AVA+QP+S+AI+AGG  FQ Y  G+F G CG QL+HGV  VGYG + + + YW+V+NSWG 
Subjt:  EYPYRGIDDVCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGT

Query:  DWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD
         WGESGY+RM    +   G CGIA+E SYP+K+
Subjt:  DWGESGYIRMKLGSSDKRGTCGIAMEASYPVKD

AT3G48340.1 Cysteine proteinases superfamily protein4.7e-9055.56Show/hide
Query:  GLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLG--------YQNGWFPDTCFRYG--SIVNLPTN
        GL   Y +W + +    +S  E E+RF +++ NV ++   N  N SY L  N FADL  +EFK  Y G         Q        F Y   ++  LP++
Subjt:  GLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLG--------YQNGWFPDTCFRYG--SIVNLPTN

Query:  VDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKN-GITTEREYPYRGIDDVCNKEKMR
        VDWRK GAVT IK+QG+CGSCWAFS VAAVEGIN+IKT  L+SLSEQELVDCD T  N+GCNGG M  AF+FIKKN GITTE  YPY GID  C+  K  
Subjt:  VDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKN-GITTEREYPYRGIDDVCNKEKMR

Query:  CHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDK
           VTI G+E VP NDE +L  AVANQPVSVAIDAG  DFQFYS GVF G CG +LNHGVA VGYG +   + YW+V+NSWG +WGE GYI+++    + 
Subjt:  CHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDK

Query:  RGTCGIAMEASYPVK
         G CGIAMEASYP+K
Subjt:  RGTCGIAMEASYPVK

AT4G35350.1 xylem cysteine peptidase 11.5e-9155.34Show/hide
Query:  LKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQNGWF-----PDTCFRYGSIVNLPTNVDWRKD
        L + ++ WM+++ + YKS EE   RFE+++ N+ +ID  N+  +SY L  N FADL ++EFK  YLG     F     P   FRY  I +LP +VDWRK 
Subjt:  LKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQNGWF-----PDTCFRYGSIVNLPTNVDWRKD

Query:  GAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQF-IKKNGITTEREYPYRGIDDVCNKEKMRCHSVTI
        GAV P+KDQGQCGSCWAFS VAAVEGINQI TG L SLSEQEL+DCD T+ N GCNGG M+ AFQ+ I   G+  E +YPY   + +C ++K     VTI
Subjt:  GAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQF-IKKNGITTEREYPYRGIDDVCNKEKMRCHSVTI

Query:  SGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGI
        SGYE VP ND++SL  A+A+QPVSVAI+A G DFQFY GGVF GKCG  L+HGVA VGYG    ++ Y +VKNSWG  WGE G+IRMK  +    G CGI
Subjt:  SGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRGTCGI

Query:  AMEASYPVK
           ASYP K
Subjt:  AMEASYPVK

AT5G50260.1 Cysteine proteinases superfamily protein2.4e-9455.27Show/hide
Query:  LKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLG--------YQNGWFPDTCFRYGSIVNLPTNVDW
        L + Y++W + +    +S EE  +RF +++ NV++I   N  + SY L  N F D+ ++EF+ TY G        +Q        F Y ++  LPT+VDW
Subjt:  LKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLG--------YQNGWFPDTCFRYGSIVNLPTNVDW

Query:  RKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIK-KNGITTEREYPYRGIDDVCNKEKMRCHS
        RK+GAVTP+K+QGQCGSCWAFS V AVEGINQI+T  L SLSEQELVDCD T+ NQGCNGG M+ AF+FIK K G+T+E  YPY+  D+ C+  K     
Subjt:  RKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIK-KNGITTEREYPYRGIDDVCNKEKMRCHS

Query:  VTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRGT
        V+I G+E VP N E  L  AVANQPVSVAIDAGG DFQFYS GVF G+CG +LNHGVA+VGYG       YW+VKNSWG +WGE GYIRM+ G   K G 
Subjt:  VTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRGT

Query:  CGIAMEASYPVKD
        CGIAMEASYP+K+
Subjt:  CGIAMEASYPVKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCATACAGAATGATTTGGAATCTGGGTTCGATGTTTCTGATTCTCTGGGTTTTCTGGACACCCTCAATGGCATCCATAGCAAAGGACTGCAATCCAGGATCTGG
TTCCACTGGCTTAAAAGACAGGTACCAGAAATGGATGAATAAATACGGTCGAGAATACAAGAGCAGAGAGGAGTGGGAGCGGAGATTCGAAATTTATCAGTTGAATGTTC
AGTACATTGACCTCTTTAATTCTCTGAATCATTCCTATACTCTGGCTGAAAATAACTTTGCAGACCTCAGAAATGATGAGTTTAAGACAACTTACTTGGGGTATCAAAAT
GGTTGGTTTCCTGATACATGCTTCAGATATGGAAGTATTGTTAATTTGCCTACTAATGTTGACTGGAGAAAGGATGGTGCAGTTACTCCAATAAAGGATCAAGGCCAATG
TGGGAGTTGCTGGGCGTTCTCTGCAGTAGCAGCTGTGGAAGGCATCAACCAAATAAAAACAGGCACATTGCTGTCTCTATCAGAACAAGAGCTTGTGGACTGCGACATCA
CCTCGGGGAACCAGGGATGCAATGGTGGTTTCATGAACAAAGCATTTCAGTTCATCAAGAAAAATGGAATCACTACAGAAAGAGAGTATCCATACAGAGGAATTGATGAT
GTATGCAACAAAGAAAAAATGAGATGCCACTCTGTGACAATAAGTGGATATGAAAAAGTACCTGTCAATGATGAGAAAAGCTTAAAAGCTGCAGTTGCGAACCAGCCAGT
CTCTGTAGCAATTGATGCAGGGGGATATGATTTCCAGTTCTATTCTGGTGGAGTCTTCCAAGGGAAATGTGGACAGCAACTCAATCATGGAGTGGCAATAGTAGGGTATG
GGGGGAAAGCTAGCAATGAATCTTACTGGCTTGTCAAGAATTCATGGGGCACTGACTGGGGTGAATCTGGTTACATAAGAATGAAACTTGGTTCAAGTGATAAGCGGGGT
ACTTGTGGCATAGCTATGGAGGCTAGCTACCCCGTCAAAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCATACAGAATGATTTGGAATCTGGGTTCGATGTTTCTGATTCTCTGGGTTTTCTGGACACCCTCAATGGCATCCATAGCAAAGGACTGCAATCCAGGATCTGG
TTCCACTGGCTTAAAAGACAGGTACCAGAAATGGATGAATAAATACGGTCGAGAATACAAGAGCAGAGAGGAGTGGGAGCGGAGATTCGAAATTTATCAGTTGAATGTTC
AGTACATTGACCTCTTTAATTCTCTGAATCATTCCTATACTCTGGCTGAAAATAACTTTGCAGACCTCAGAAATGATGAGTTTAAGACAACTTACTTGGGGTATCAAAAT
GGTTGGTTTCCTGATACATGCTTCAGATATGGAAGTATTGTTAATTTGCCTACTAATGTTGACTGGAGAAAGGATGGTGCAGTTACTCCAATAAAGGATCAAGGCCAATG
TGGGAGTTGCTGGGCGTTCTCTGCAGTAGCAGCTGTGGAAGGCATCAACCAAATAAAAACAGGCACATTGCTGTCTCTATCAGAACAAGAGCTTGTGGACTGCGACATCA
CCTCGGGGAACCAGGGATGCAATGGTGGTTTCATGAACAAAGCATTTCAGTTCATCAAGAAAAATGGAATCACTACAGAAAGAGAGTATCCATACAGAGGAATTGATGAT
GTATGCAACAAAGAAAAAATGAGATGCCACTCTGTGACAATAAGTGGATATGAAAAAGTACCTGTCAATGATGAGAAAAGCTTAAAAGCTGCAGTTGCGAACCAGCCAGT
CTCTGTAGCAATTGATGCAGGGGGATATGATTTCCAGTTCTATTCTGGTGGAGTCTTCCAAGGGAAATGTGGACAGCAACTCAATCATGGAGTGGCAATAGTAGGGTATG
GGGGGAAAGCTAGCAATGAATCTTACTGGCTTGTCAAGAATTCATGGGGCACTGACTGGGGTGAATCTGGTTACATAAGAATGAAACTTGGTTCAAGTGATAAGCGGGGT
ACTTGTGGCATAGCTATGGAGGCTAGCTACCCCGTCAAAGACTGA
Protein sequenceShow/hide protein sequence
MESYRMIWNLGSMFLILWVFWTPSMASIAKDCNPGSGSTGLKDRYQKWMNKYGREYKSREEWERRFEIYQLNVQYIDLFNSLNHSYTLAENNFADLRNDEFKTTYLGYQN
GWFPDTCFRYGSIVNLPTNVDWRKDGAVTPIKDQGQCGSCWAFSAVAAVEGINQIKTGTLLSLSEQELVDCDITSGNQGCNGGFMNKAFQFIKKNGITTEREYPYRGIDD
VCNKEKMRCHSVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFQGKCGQQLNHGVAIVGYGGKASNESYWLVKNSWGTDWGESGYIRMKLGSSDKRG
TCGIAMEASYPVKD