; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021335 (gene) of Chayote v1 genome

Gene IDSed0021335
OrganismSechium edule (Chayote v1)
Descriptionervatamin-B
Genome locationLG05:7515931..7517581
RNA-Seq ExpressionSed0021335
SyntenySed0021335
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025660 - Cysteine peptidase, histidine active site
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN62053.1 hypothetical protein Csa_006706 [Cucumis sativus]6.1e-13771.47Show/hide
Query:  MIW-NVVLMFLILCVFWTPS--SMASD-------SGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFME
        M W NV L+FLIL VFWTP   SMA D       S  ++DRY++W+DKYGR+Y S EE E RF IYQ+N +YID FNS+NHS+TLAEN+FADLTN+EF  
Subjt:  MIW-NVVLMFLILCVFWTPS--SMASD-------SGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFME

Query:  TYLGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIA
        TYLGY++V  PDTCFRYG+  NL   VDWR++GAVTP+KNQG CGSCWAFSAVAAVEGINKIK G L+SLSEQELVDCD  S NQGC+GGYMYKAFEFI 
Subjt:  TYLGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIA

Query:  RTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWL
        RTGL TE EYPY+G E  CN+ K K   V+ISGYEKVP NDEKSL+AAVANQPVSVAIDA G+N QFYS GIFS +CG +LNHGV +VGYGE SN  YWL
Subjt:  RTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWL

Query:  VKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD
        VKNSWGT WGESGYI+++RDS DR+GTCGIAM ASYP KD
Subjt:  VKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD

XP_004148072.2 ervatamin-B [Cucumis sativus]6.1e-13771.47Show/hide
Query:  MIW-NVVLMFLILCVFWTPS--SMASD-------SGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFME
        M W NV L+FLIL VFWTP   SMA D       S  ++DRY++W+DKYGR+Y S EE E RF IYQ+N +YID FNS+NHS+TLAEN+FADLTN+EF  
Subjt:  MIW-NVVLMFLILCVFWTPS--SMASD-------SGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFME

Query:  TYLGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIA
        TYLGY++V  PDTCFRYG+  NL   VDWR++GAVTP+KNQG CGSCWAFSAVAAVEGINKIK G L+SLSEQELVDCD  S NQGC+GGYMYKAFEFI 
Subjt:  TYLGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIA

Query:  RTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWL
        RTGL TE EYPY+G E  CN+ K K   V+ISGYEKVP NDEKSL+AAVANQPVSVAIDA G+N QFYS GIFS +CG +LNHGV +VGYGE SN  YWL
Subjt:  RTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWL

Query:  VKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD
        VKNSWGT WGESGYI+++RDS DR+GTCGIAM ASYP KD
Subjt:  VKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD

XP_008458487.1 PREDICTED: ervatamin-B-like [Cucumis melo]7.9e-13770.67Show/hide
Query:  MIWNVVLMFLILCVFWTPS---------SMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMET
        MIW+V L+ LIL VFWTP+         S +S+SG L+DRY++W+DKYGR+Y S EE E RF IYQ+N +YID FNSLNHSYTLAEN+F DLTN+EFM T
Subjt:  MIWNVVLMFLILCVFWTPS---------SMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMET

Query:  YLGYQSVCPP--DTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFI
        YLGY++V  P  DT FRYG+  NL   VDWRK+GAVTP+KNQG CGSCWAFSAVAAVEGINKIK G L+SLSEQELVDCD  S NQGC+GGYMYKAFEFI
Subjt:  YLGYQSVCPP--DTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFI

Query:  ARTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYW
         +TGL TE EYPY      C++ K K  +V+ISGYEKVP NDEKSLQAAVA QPVSVAIDAGG++ QFYS GIFS +CGK+LNHGV +VGYGEDSN  YW
Subjt:  ARTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYW

Query:  LVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD
        LVKNSWGT WGESGYI++ RDS D++GTCGIAM ASYP+KD
Subjt:  LVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD

XP_022140756.1 ervatamin-B [Momordica charantia]2.5e-13871.6Show/hide
Query:  MIWNVVLMFLILCVFWTPS--SMASD------SGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETY
        MI NV  M+LILCVFWT S  S+A D      S  ++DRY++WIDKYGREY SGEE+E RF IYQSN +YIDYFNSLN SYTLA+N FADLTNDEF  TY
Subjt:  MIWNVVLMFLILCVFWTPS--SMASD------SGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETY

Query:  LGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART
        LGY +   PDTCF+YG+  NL   VDWRK+GAVTP+K+QG CGSCWAFSAVAAVEGI KIK G LVSLSEQEL+DCD  S NQGCSGG+M KAFEFI + 
Subjt:  LGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART

Query:  GLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVK
        G+ TEKEYPYRG+E  CN+ KV+ H+ TISGYEKVPANDEKSL+AAVANQPVSVAIDAGG + QFYS GIFS +CGK+LNHGVT+VGYGED    YWLVK
Subjt:  GLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVK

Query:  NSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD
        NSWGT WGE GY++++ +S+D+RGTCGIAM+ASYP+KD
Subjt:  NSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD

XP_038902939.1 ervatamin-B [Benincasa hispida]4.2e-13871.3Show/hide
Query:  MIWNVVLMFLILCVFWTPSSM--------ASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETY
        MIWNV LM LIL V WTP+ +         S SG L+ RY++W+ KYGR+Y S EE E RF IYQ N +YID FNSL+HSYTLAEN+FADLTNDEF ETY
Subjt:  MIWNVVLMFLILCVFWTPSSM--------ASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETY

Query:  LGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART
        LGY++   PDTCFRYG+  +L   V+WRK+GAVTP+KNQG CGSCWAFSAVAAVEGINKIK G L+SLSEQELVDCD  S NQGC+GG+M KAF+FI +T
Subjt:  LGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART

Query:  GLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVK
        GL TE EYPYRGIE  CN+ KV+ HTV ISGYEKVPANDEKSL+AAVANQPVSVAIDAGG + QFYS G+FS +CGK+LNHGV +VGYG+ SN  YWLVK
Subjt:  GLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVK

Query:  NSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD
        NSWGT WGESGYI+++RDS D+RGTCGIAM ASYP+KD
Subjt:  NSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD

TrEMBL top hitse value%identityAlignment
A0A0A0LJV6 Uncharacterized protein2.9e-13771.47Show/hide
Query:  MIW-NVVLMFLILCVFWTPS--SMASD-------SGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFME
        M W NV L+FLIL VFWTP   SMA D       S  ++DRY++W+DKYGR+Y S EE E RF IYQ+N +YID FNS+NHS+TLAEN+FADLTN+EF  
Subjt:  MIW-NVVLMFLILCVFWTPS--SMASD-------SGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFME

Query:  TYLGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIA
        TYLGY++V  PDTCFRYG+  NL   VDWR++GAVTP+KNQG CGSCWAFSAVAAVEGINKIK G L+SLSEQELVDCD  S NQGC+GGYMYKAFEFI 
Subjt:  TYLGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIA

Query:  RTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWL
        RTGL TE EYPY+G E  CN+ K K   V+ISGYEKVP NDEKSL+AAVANQPVSVAIDA G+N QFYS GIFS +CG +LNHGV +VGYGE SN  YWL
Subjt:  RTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWL

Query:  VKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD
        VKNSWGT WGESGYI+++RDS DR+GTCGIAM ASYP KD
Subjt:  VKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD

A0A1S3C828 ervatamin-B-like3.8e-13770.67Show/hide
Query:  MIWNVVLMFLILCVFWTPS---------SMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMET
        MIW+V L+ LIL VFWTP+         S +S+SG L+DRY++W+DKYGR+Y S EE E RF IYQ+N +YID FNSLNHSYTLAEN+F DLTN+EFM T
Subjt:  MIWNVVLMFLILCVFWTPS---------SMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMET

Query:  YLGYQSVCPP--DTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFI
        YLGY++V  P  DT FRYG+  NL   VDWRK+GAVTP+KNQG CGSCWAFSAVAAVEGINKIK G L+SLSEQELVDCD  S NQGC+GGYMYKAFEFI
Subjt:  YLGYQSVCPP--DTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFI

Query:  ARTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYW
         +TGL TE EYPY      C++ K K  +V+ISGYEKVP NDEKSLQAAVA QPVSVAIDAGG++ QFYS GIFS +CGK+LNHGV +VGYGEDSN  YW
Subjt:  ARTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYW

Query:  LVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD
        LVKNSWGT WGESGYI++ RDS D++GTCGIAM ASYP+KD
Subjt:  LVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD

A0A384S0D9 Cysteine proteinase 1 (Fragment)1.2e-13375.32Show/hide
Query:  SDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQSVCPPDTCFRYGDTANLSAEVDWRKD
        S SG L+DRY++W+ KYGREY S EE E RF IYQ N +YID FNSLNHSYTLAENSFADLTNDEF  TYLG+++   PDT FRYG+  NL   VDWRK+
Subjt:  SDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQSVCPPDTCFRYGDTANLSAEVDWRKD

Query:  GAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIARTGLATEKEYPYRGIEGECNQHKVKEHTVTIS
         AVTPVK+QG CGSCWAFSAVAAVEGINKIK G L+SLSEQELVDCD  S NQGC+GGYMYKAFEFI +TGL TE EYPYRGIE  CN+ KV+  TVTIS
Subjt:  GAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIARTGLATEKEYPYRGIEGECNQHKVKEHTVTIS

Query:  GYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAM
        GYEKVP NDEKSL+AAVANQPVSVAIDAGG + QFYS G+FS +CGK+LNHGV +VGYGE SN  YWLVKNSWGT WGESGYI+++RDS D+RGTCGIAM
Subjt:  GYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAM

Query:  EASYPVKD
         ASYP+KD
Subjt:  EASYPVKD

A0A5A7SQK0 Ervatamin-B-like3.8e-13770.67Show/hide
Query:  MIWNVVLMFLILCVFWTPS---------SMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMET
        MIW+V L+ LIL VFWTP+         S +S+SG L+DRY++W+DKYGR+Y S EE E RF IYQ+N +YID FNSLNHSYTLAEN+F DLTN+EFM T
Subjt:  MIWNVVLMFLILCVFWTPS---------SMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMET

Query:  YLGYQSVCPP--DTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFI
        YLGY++V  P  DT FRYG+  NL   VDWRK+GAVTP+KNQG CGSCWAFSAVAAVEGINKIK G L+SLSEQELVDCD  S NQGC+GGYMYKAFEFI
Subjt:  YLGYQSVCPP--DTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFI

Query:  ARTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYW
         +TGL TE EYPY      C++ K K  +V+ISGYEKVP NDEKSLQAAVA QPVSVAIDAGG++ QFYS GIFS +CGK+LNHGV +VGYGEDSN  YW
Subjt:  ARTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYW

Query:  LVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD
        LVKNSWGT WGESGYI++ RDS D++GTCGIAM ASYP+KD
Subjt:  LVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD

A0A6J1CH04 ervatamin-B1.2e-13871.6Show/hide
Query:  MIWNVVLMFLILCVFWTPS--SMASD------SGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETY
        MI NV  M+LILCVFWT S  S+A D      S  ++DRY++WIDKYGREY SGEE+E RF IYQSN +YIDYFNSLN SYTLA+N FADLTNDEF  TY
Subjt:  MIWNVVLMFLILCVFWTPS--SMASD------SGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETY

Query:  LGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART
        LGY +   PDTCF+YG+  NL   VDWRK+GAVTP+K+QG CGSCWAFSAVAAVEGI KIK G LVSLSEQEL+DCD  S NQGCSGG+M KAFEFI + 
Subjt:  LGYQSVCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART

Query:  GLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVK
        G+ TEKEYPYRG+E  CN+ KV+ H+ TISGYEKVPANDEKSL+AAVANQPVSVAIDAGG + QFYS GIFS +CGK+LNHGVT+VGYGED    YWLVK
Subjt:  GLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVK

Query:  NSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD
        NSWGT WGE GY++++ +S+D+RGTCGIAM+ASYP+KD
Subjt:  NSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD

SwissProt top hitse value%identityAlignment
A2XQE8 Senescence-specific cysteine protease SAG397.3e-8548.94Show/hide
Query:  VLMFLILCVFWTPSSMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEF--METYLGY-QSVCPPD
        +L  L LC     +   SD   +  R+ RW+ +YGR Y    E+  RF ++++N  +I+ FN+ NH++ L  N FADLTNDEF   +T  G+  S     
Subjt:  VLMFLILCVFWTPSSMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEF--METYLGY-QSVCPPD

Query:  TCFRYGDT--ANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKE
        T FRY +     L A VDWR  GAVTP+K+QG CG CWAFSAVAA+EGI K+  G L+SLSEQELVDCD +  +QGC GG M  AF+FI +  GL TE  
Subjt:  TCFRYGDT--ANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKE

Query:  YPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSN-NPYWLVKNSWGTG
        YPY   + +C    V     +I GYE VPAN+E +L  AVANQPVSVA+D G    QFY  G+ +  CG  L+HG+  +GYG+ S+   YWL+KNSWGT 
Subjt:  YPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSN-NPYWLVKNSWGTG

Query:  WGESGYIKIQRDSNDRRGTCGIAMEASYP
        WGE+G++++++D +D+RG CG+AME SYP
Subjt:  WGESGYIKIQRDSNDRRGTCGIAMEASYP

Q7XWK5 Senescence-specific cysteine protease SAG391.5e-8549.24Show/hide
Query:  VLMFLILCVFWTPSSMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEF--METYLGY-QSVCPPD
        +L  L LC     +   SD   +  R+ RW+ +YGR Y    E+  RF ++++N  +I+ FN+ NH++ L  N FADLTNDEF  M+T  G+  S     
Subjt:  VLMFLILCVFWTPSSMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEF--METYLGY-QSVCPPD

Query:  TCFRYGDT--ANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKE
        T FRY +     L A VDWR  GAVTP+K+QG CG CWAFSAVAA+EGI K+  G L+SLSEQELVDCD +  +QGC GG M  AF+FI +  GL TE  
Subjt:  TCFRYGDT--ANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKE

Query:  YPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSN-NPYWLVKNSWGTG
        YPY   + +C    V     +I GYE VPAN+E +L  AVANQPVSVA+D G    QFY  G+ +  CG  L+HG+  +GYG+ S+   YWL+KNSWGT 
Subjt:  YPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSN-NPYWLVKNSWGTG

Query:  WGESGYIKIQRDSNDRRGTCGIAMEASYP
        WGE+G++++++D +D+RG CG+AME SYP
Subjt:  WGESGYIKIQRDSNDRRGTCGIAMEASYP

Q9FJ47 Senescence-specific cysteine protease SAG124.1e-8848.67Show/hide
Query:  IWNVVLMFLILCVFWTPSSMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSL--NHSYTLAENSFADLTNDEFMETYLGYQSVC
        I+  V +F   C   T S    +  +++ R+  W+ K+GR Y   +E+  R+ ++++N E I++ NS+    ++ LA N FADLTNDEF   Y G++ V 
Subjt:  IWNVVLMFLILCVFWTPSSMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSL--NHSYTLAENSFADLTNDEFMETYLGYQSVC

Query:  PPD-------TCFRYGDTAN--LSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIA
                  + FRY + ++  L   VDWRK GAVTP+KNQG CG CWAFSAVAA+EG  +IK G L+SLSEQ+LVDCD N  + GC GG M  AFE I 
Subjt:  PPD-------TCFRYGDTAN--LSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIA

Query:  RT-GLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSN-NPY
         T GL TE  YPY+G +  CN  K      +I+GYE VP NDE++L  AVA+QPVSV I+ GG + QFYS+G+F+  C   L+H VT +GYGE +N + Y
Subjt:  RT-GLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSN-NPY

Query:  WLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYP
        W++KNSWGT WGESGY++IQ+D  D++G CG+AM+ASYP
Subjt:  WLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYP

Q9FMH8 Probable cysteine protease RD21B2.1e-8451.39Show/hide
Query:  TPSSMASDSGVLKDRYRRWIDKYGR----EYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQSVCPP-DTCFRY----
        T  +  SDS V +  Y  W+ ++G+    + G G E++ RF I++ N  +ID  N+ N SY L    FADLTN+E+   YLG +       T  RY    
Subjt:  TPSSMASDSGVLKDRYRRWIDKYGR----EYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQSVCPP-DTCFRY----

Query:  GDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKEYPYRGIE
        GD   L   VDWRK+GAV  VK+QG CGSCWAFS + AVEGINKI  G L+SLSEQELVDCD  S NQGC+GG M  AFEFI +  G+ TE +YPY+  +
Subjt:  GDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKEYPYRGIE

Query:  GECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIK
        G C+Q++     VTI  YE VP N E SL+ A+A+QP+SVAI+AGG   Q YS+G+F   CG  L+HGV  VGYG ++   YW+V+NSWG  WGESGYIK
Subjt:  GECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIK

Query:  IQRDSNDRRGTCGIAMEASYPVK
        + R+     G CGIAMEASYP+K
Subjt:  IQRDSNDRRGTCGIAMEASYPVK

Q9STL4 KDEL-tailed cysteine endopeptidase CEP29.5e-8552.72Show/hide
Query:  LKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLG-----YQSVCPP-----DTCFRYGDTANLSAEV
        L   Y RW   +     S  E+E RF +++ N  ++   N  N SY L  N FADLT +EF   Y G     ++ +  P        + + + + L + V
Subjt:  LKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLG-----YQSVCPP-----DTCFRYGDTANLSAEV

Query:  DWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKEYPYRGIEGECNQHKVKE
        DWRK GAVT +KNQG CGSCWAFS VAAVEGINKIK   LVSLSEQELVDCD    N+GC+GG M  AFEFI +  G+ TE  YPY GI+G+C+  K   
Subjt:  DWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKEYPYRGIEGECNQHKVKE

Query:  HTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIKIQRDSNDRRG
          VTI G+E VP NDE +L  AVANQPVSVAIDAG S+ QFYS G+F+  CG  LNHGV  VGYG +    YW+V+NSWG  WGE GYIKI+R+ ++  G
Subjt:  HTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIKIQRDSNDRRG

Query:  TCGIAMEASYPVK
         CGIAMEASYP+K
Subjt:  TCGIAMEASYPVK

Arabidopsis top hitse value%identityAlignment
AT1G06260.1 Cysteine proteinases superfamily protein3.8e-9752.91Show/hide
Query:  NVVLMFLILCVFWTPSSMASDSGV------LKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQS
        N+ L  LI  V       + DS V      LK R+ +W+  + + YG  +E   RF IYQSN + IDY NSL+  + L +N FAD+TN EF   +LG  +
Subjt:  NVVLMFLILCVFWTPSSMASDSGV------LKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQS

Query:  -----------VCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAF
                   VC P          N+   VDWR  GAVTP++NQG CG CWAFSAVAA+EGINKIK G LVSLSEQ+L+DCD  + N+GCSGG M  AF
Subjt:  -----------VCPPDTCFRYGDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAF

Query:  EFI-ARTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSN
        EFI    GLATE +YPY GIEG C+Q K K   VTI GY+KV A +E SLQ A A QPVSV IDAGG   Q YS+G+F+ +CG  LNHGVTVVGYG + +
Subjt:  EFI-ARTGLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSN

Query:  NPYWLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVK
          YW+VKNSWGTGWGE GYI+++R  ++  G CGIAM ASYP++
Subjt:  NPYWLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVK

AT1G47128.1 Granulin repeat cysteine protease family protein3.1e-8349.68Show/hide
Query:  YRRWIDKYGREYGSGE--EQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQSVCPPD--TCFRY----GDTANLSAEVDWRKDG
        Y  W+ K+G+        E++ RF I++ N  ++D  N  N SY L    FADLTNDE+   YLG +     +  T  RY    GD   L   +DWRK G
Subjt:  YRRWIDKYGREYGSGE--EQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQSVCPPD--TCFRY----GDTANLSAEVDWRKDG

Query:  AVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKEYPYRGIEGECNQHKVKEHTVTIS
        AV  VK+QGGCGSCWAFS + AVEGIN+I  G L++LSEQELVDCD  S N+GC+GG M  AFEFI +  G+ T+K+YPY+G++G C+Q +     VTI 
Subjt:  AVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKEYPYRGIEGECNQHKVKEHTVTIS

Query:  GYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAM
         YE VP   E+SL+ AVA+QP+S+AI+AGG   Q Y +GIF   CG +L+HGV  VGYG ++   YW+V+NSWG  WGESGY+++ R+     G CGIA+
Subjt:  GYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAM

Query:  EASYPVKD
        E SYP+K+
Subjt:  EASYPVKD

AT3G48340.1 Cysteine proteinases superfamily protein6.8e-8652.72Show/hide
Query:  LKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLG-----YQSVCPP-----DTCFRYGDTANLSAEV
        L   Y RW   +     S  E+E RF +++ N  ++   N  N SY L  N FADLT +EF   Y G     ++ +  P        + + + + L + V
Subjt:  LKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLG-----YQSVCPP-----DTCFRYGDTANLSAEV

Query:  DWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKEYPYRGIEGECNQHKVKE
        DWRK GAVT +KNQG CGSCWAFS VAAVEGINKIK   LVSLSEQELVDCD    N+GC+GG M  AFEFI +  G+ TE  YPY GI+G+C+  K   
Subjt:  DWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKEYPYRGIEGECNQHKVKE

Query:  HTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIKIQRDSNDRRG
          VTI G+E VP NDE +L  AVANQPVSVAIDAG S+ QFYS G+F+  CG  LNHGV  VGYG +    YW+V+NSWG  WGE GYIKI+R+ ++  G
Subjt:  HTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIKIQRDSNDRRG

Query:  TCGIAMEASYPVK
         CGIAMEASYP+K
Subjt:  TCGIAMEASYPVK

AT5G43060.1 Granulin repeat cysteine protease family protein1.5e-8551.39Show/hide
Query:  TPSSMASDSGVLKDRYRRWIDKYGR----EYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQSVCPP-DTCFRY----
        T  +  SDS V +  Y  W+ ++G+    + G G E++ RF I++ N  +ID  N+ N SY L    FADLTN+E+   YLG +       T  RY    
Subjt:  TPSSMASDSGVLKDRYRRWIDKYGR----EYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQSVCPP-DTCFRY----

Query:  GDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKEYPYRGIE
        GD   L   VDWRK+GAV  VK+QG CGSCWAFS + AVEGINKI  G L+SLSEQELVDCD  S NQGC+GG M  AFEFI +  G+ TE +YPY+  +
Subjt:  GDTANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIART-GLATEKEYPYRGIE

Query:  GECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIK
        G C+Q++     VTI  YE VP N E SL+ A+A+QP+SVAI+AGG   Q YS+G+F   CG  L+HGV  VGYG ++   YW+V+NSWG  WGESGYIK
Subjt:  GECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIK

Query:  IQRDSNDRRGTCGIAMEASYPVK
        + R+     G CGIAMEASYP+K
Subjt:  IQRDSNDRRGTCGIAMEASYPVK

AT5G45890.1 senescence-associated gene 122.9e-8948.67Show/hide
Query:  IWNVVLMFLILCVFWTPSSMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSL--NHSYTLAENSFADLTNDEFMETYLGYQSVC
        I+  V +F   C   T S    +  +++ R+  W+ K+GR Y   +E+  R+ ++++N E I++ NS+    ++ LA N FADLTNDEF   Y G++ V 
Subjt:  IWNVVLMFLILCVFWTPSSMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSL--NHSYTLAENSFADLTNDEFMETYLGYQSVC

Query:  PPD-------TCFRYGDTAN--LSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIA
                  + FRY + ++  L   VDWRK GAVTP+KNQG CG CWAFSAVAA+EG  +IK G L+SLSEQ+LVDCD N  + GC GG M  AFE I 
Subjt:  PPD-------TCFRYGDTAN--LSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIA

Query:  RT-GLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSN-NPY
         T GL TE  YPY+G +  CN  K      +I+GYE VP NDE++L  AVA+QPVSV I+ GG + QFYS+G+F+  C   L+H VT +GYGE +N + Y
Subjt:  RT-GLATEKEYPYRGIEGECNQHKVKEHTVTISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSN-NPY

Query:  WLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYP
        W++KNSWGT WGESGY++IQ+D  D++G CG+AM+ASYP
Subjt:  WLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTGGAATGTGGTATTGATGTTTCTGATTCTCTGTGTTTTCTGGACACCTTCCTCAATGGCATCCGACTCCGGTGTTTTAAAAGACAGGTACCGGAGATGGATAGA
TAAATACGGTCGAGAATACGGGAGCGGAGAGGAGCAGGAGTGGAGATTCGCAATCTACCAGTCGAATGCTGAGTACATTGACTACTTCAATTCTCTGAATCATTCATATA
CTCTTGCTGAAAATAGCTTTGCAGACCTCACAAATGATGAGTTTATGGAAACTTACTTGGGGTATCAATCTGTTTGCCCACCTGATACATGCTTCAGATATGGAGATACT
GCTAATTTGTCTGCTGAGGTTGACTGGAGAAAAGATGGTGCAGTTACTCCGGTAAAGAATCAAGGCGGATGCGGGAGTTGCTGGGCGTTCTCTGCAGTAGCAGCAGTGGA
AGGTATCAACAAAATAAAAAACGGCACATTGGTGTCCCTATCAGAGCAAGAGCTGGTGGACTGCGACTTCAACTCGGCGAACCAGGGATGCAGCGGCGGATACATGTACA
AAGCATTCGAGTTCATCGCGAGAACCGGACTCGCCACAGAAAAAGAATATCCATACAGAGGAATTGAAGGTGAATGCAACCAACACAAAGTGAAAGAGCACACTGTGACA
ATAAGTGGATATGAAAAAGTACCTGCCAATGATGAGAAAAGCTTACAGGCTGCAGTTGCCAACCAGCCAGTCTCTGTAGCCATTGATGCAGGGGGGTCTAACTTGCAGTT
CTATTCTGCTGGGATCTTCTCAGAGCACTGTGGAAAGCGGCTCAATCATGGAGTCACTGTGGTTGGGTATGGAGAGGATAGCAATAACCCTTACTGGCTCGTCAAGAATT
CATGGGGTACTGGCTGGGGTGAATCTGGGTACATAAAGATTCAACGAGACTCGAATGATAGGCGCGGTACTTGCGGCATCGCTATGGAGGCTAGCTACCCTGTCAAAGAC
TGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTGGAATGTGGTATTGATGTTTCTGATTCTCTGTGTTTTCTGGACACCTTCCTCAATGGCATCCGACTCCGGTGTTTTAAAAGACAGGTACCGGAGATGGATAGA
TAAATACGGTCGAGAATACGGGAGCGGAGAGGAGCAGGAGTGGAGATTCGCAATCTACCAGTCGAATGCTGAGTACATTGACTACTTCAATTCTCTGAATCATTCATATA
CTCTTGCTGAAAATAGCTTTGCAGACCTCACAAATGATGAGTTTATGGAAACTTACTTGGGGTATCAATCTGTTTGCCCACCTGATACATGCTTCAGATATGGAGATACT
GCTAATTTGTCTGCTGAGGTTGACTGGAGAAAAGATGGTGCAGTTACTCCGGTAAAGAATCAAGGCGGATGCGGGAGTTGCTGGGCGTTCTCTGCAGTAGCAGCAGTGGA
AGGTATCAACAAAATAAAAAACGGCACATTGGTGTCCCTATCAGAGCAAGAGCTGGTGGACTGCGACTTCAACTCGGCGAACCAGGGATGCAGCGGCGGATACATGTACA
AAGCATTCGAGTTCATCGCGAGAACCGGACTCGCCACAGAAAAAGAATATCCATACAGAGGAATTGAAGGTGAATGCAACCAACACAAAGTGAAAGAGCACACTGTGACA
ATAAGTGGATATGAAAAAGTACCTGCCAATGATGAGAAAAGCTTACAGGCTGCAGTTGCCAACCAGCCAGTCTCTGTAGCCATTGATGCAGGGGGGTCTAACTTGCAGTT
CTATTCTGCTGGGATCTTCTCAGAGCACTGTGGAAAGCGGCTCAATCATGGAGTCACTGTGGTTGGGTATGGAGAGGATAGCAATAACCCTTACTGGCTCGTCAAGAATT
CATGGGGTACTGGCTGGGGTGAATCTGGGTACATAAAGATTCAACGAGACTCGAATGATAGGCGCGGTACTTGCGGCATCGCTATGGAGGCTAGCTACCCTGTCAAAGAC
TGA
Protein sequenceShow/hide protein sequence
MIWNVVLMFLILCVFWTPSSMASDSGVLKDRYRRWIDKYGREYGSGEEQEWRFAIYQSNAEYIDYFNSLNHSYTLAENSFADLTNDEFMETYLGYQSVCPPDTCFRYGDT
ANLSAEVDWRKDGAVTPVKNQGGCGSCWAFSAVAAVEGINKIKNGTLVSLSEQELVDCDFNSANQGCSGGYMYKAFEFIARTGLATEKEYPYRGIEGECNQHKVKEHTVT
ISGYEKVPANDEKSLQAAVANQPVSVAIDAGGSNLQFYSAGIFSEHCGKRLNHGVTVVGYGEDSNNPYWLVKNSWGTGWGESGYIKIQRDSNDRRGTCGIAMEASYPVKD