; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017417 (gene) of Snake gourd v1 genome

Gene IDTan0017417
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCysteine Protease
Genome locationLG02:92636130..92637441
RNA-Seq ExpressionTan0017417
SyntenyTan0017417
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025660 - Cysteine peptidase, histidine active site
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029224.1 hypothetical protein SDJN02_07562, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-12566.17Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLLVPFLLIVLVSGL ESFE  E +LA+E SLW+LY RW NHH ISR  K+KHKRFNVFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT
        A SN+S+Y+ L+G+R     GGFMY  AT+LPS IDWR++G VN+IK Q  C SCWAF+ VAAVE INQIKT +LLSLSEQEL+DC+ R+ GCNGGFM  
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT

Query:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMK------------AGVFEGYCGPSLDHAVVVIGYGTTEDGIDYWILR
        A DFIK NGGI TE++YPY+ +   CRS  ++SP VTIDGYE VPEN ENA+M+            AGVF+GYCG  L+H VV IGYGTTE+G DYWI+R
Subjt:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMK------------AGVFEGYCGPSLDHAVVVIGYGTTEDGIDYWILR

Query:  NSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
        NSWG+GWG++GY+R+KRG++   G CGI  EASYPIK
Subjt:  NSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

XP_008444390.1 PREDICTED: vignain-like [Cucumis melo]1.5e-12464.84Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLLVP LLIVLVSGL+ESFE  E +LA+E SLWQLY RW NHH ISR+ K+KHKRF+VFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT
        A SN+S+++ L+GRR+    GGFMY   T+LPSS+DWR++G VN IK Q TC SCWAF+ VAAVEAIN+IKT +LLSLSEQEL+DC+ R+ GCNGGFM  
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT

Query:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT
        A DFIK NGGI TE++YPY+ S   CRS  ++SP V IDGYE VPEN E+A+M+A                      GVF+GYCG  L+H VV IGYGTT
Subjt:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT

Query:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
        EDG DYWI+RNSWG+GWG+DGY+R+KRG++ P G CGIA EASYPIK
Subjt:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

XP_022996765.1 vignain-like [Cucurbita maxima]1.3e-12058.89Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KF+LVPFLLIVLVSGL ESFE  E +LA+E SLW+LY RW NHH ISR  K+KHKRFNVFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT
        A SN+S+Y+ L+G+R     GGFMY  AT+LPS IDWR++G VN+IK Q  C SCWAF+ VAAVE INQIKT +LLSLSEQEL+DC+ R+ GCNGGFM  
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT

Query:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA---------------------------------------------
        A DFIK NGGI TE+NYPY+ +   CRS  ++SP VTIDGYE VPEN ENA+M+A                                             
Subjt:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA---------------------------------------------

Query:  -------GVFEGYCGPSLDHAVVVIGYGTTEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
               GVF+GYCG  L+H VV IGYGTTE+G DYW++RNSWG+GWG++GY+R+KRG++   G CGI  EASYPIK
Subjt:  -------GVFEGYCGPSLDHAVVVIGYGTTEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

XP_023545499.1 vignain-like [Cucurbita pepo subsp. pepo]1.0e-12059.31Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLLVPFLLIVLVSGL ESFE  E +LA+E SLW+LY RW NHH ISR  K+KHKRFNVFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT
        A SN+S+Y+ L+G+R     GGFMY  AT+LPS IDWR++G VN+IK Q  C SCWAF+ VAAVE INQIKT +LLSLSEQEL+DC+ R+ GCNGGFM  
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT

Query:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA---------------------------------------------
        A DFIK+NGGI TE+NYPY+ +   CRS   +SP VTIDGYE VPEN ENA+M+A                                             
Subjt:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA---------------------------------------------

Query:  ------GVFEGYCGPSLDHAVVVIGYGTTEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
              GVF+GYCG  L+H VV IGYGTTE+G DYW++RNSWG+GWG++GY+R+KRG++   G CGI  EASYPIK
Subjt:  ------GVFEGYCGPSLDHAVVVIGYGTTEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

XP_038885064.1 LOW QUALITY PROTEIN: vignain-like [Benincasa hispida]4.6e-12162.54Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLL+P LLIVL+SG +ESFE  E +LA+E SLW+LY RW NHH ISR  K+KHKRF VFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT
        A SN+S+Y+ L+GRR+E    GFMY  AT+LPS IDWR++G VN IK Q  C SCWAF+ VAAVE IN+I+T +LLSLSEQEL+DC+ R+ GCNGGFM  
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT

Query:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT
        A DFI+ NGGI TE++YPY+ S   CRS  ++SP V IDGYE +PEN E+A+M+A                      GVF+GYCG  L+H VV IGYGTT
Subjt:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT

Query:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
        EDG DYWI+RNSWG+GWG++GY+R+KRG++ P G CGIA EASYPIK
Subjt:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

TrEMBL top hitse value%identityAlignment
A0A0A0LMU4 Uncharacterized protein1.1e-12062.82Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLLVP LLIVLVSGL+ESFE  E +LA+E SLWQLY RW  HH ISR+ K+KHKRF+VFK+N  HVF VNQM+KPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT
        A SN+S+Y+ L+ RR+    GGFMY   T+LPSS+DWR++G VN +K Q  C SCWAF+ VAAVE IN+IKT +LLSLSEQEL+DC+ R+ GCNGGFM  
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT

Query:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT
        A DFIK NGGI TE++YPY+ S   CRS  ++SP V IDGYE VPEN E+A+M+A                      GVF+GYCG  L+H VV IGYGTT
Subjt:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT

Query:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
        EDG DYW++RNSWG+GWG+DGY+R+KRG++   G CGIA EASYPIK
Subjt:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

A0A1S3BA70 vignain-like7.4e-12564.84Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLLVP LLIVLVSGL+ESFE  E +LA+E SLWQLY RW NHH ISR+ K+KHKRF+VFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT
        A SN+S+++ L+GRR+    GGFMY   T+LPSS+DWR++G VN IK Q TC SCWAF+ VAAVEAIN+IKT +LLSLSEQEL+DC+ R+ GCNGGFM  
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT

Query:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT
        A DFIK NGGI TE++YPY+ S   CRS  ++SP V IDGYE VPEN E+A+M+A                      GVF+GYCG  L+H VV IGYGTT
Subjt:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT

Query:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
        EDG DYWI+RNSWG+GWG+DGY+R+KRG++ P G CGIA EASYPIK
Subjt:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

A0A6J1GHN5 vignain-like4.4e-11761.38Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI K LLVPF LIVL+SGL++SFE  E +LA++ SLW+LY RWS+HH ISR  K+KHKR+NVFK+NA HV  VNQMNKPYKL+LNKF DMSN EFV +Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT
        A SN+++Y+ L+G+R+E    GFMY  AT+LPS IDWR++G V++IK+Q  C SCWAF+ VAAVE INQIKT +LLSLSEQEL+DC+ R+ GC GGFM T
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT

Query:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT
        A +FI+ NGGI +E+NYPY  +   CRS  M SP VTIDG+E VPEN ENA+M+A                      GVF+G CG  L+H VVVIGYGTT
Subjt:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT

Query:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
        + G DYW +RNSWG+GWG+DGY+R+KRG++DP G CGI  EASYP+K
Subjt:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

A0A6J1K7P4 vignain-like6.5e-12158.89Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KF+LVPFLLIVLVSGL ESFE  E +LA+E SLW+LY RW NHH ISR  K+KHKRFNVFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT
        A SN+S+Y+ L+G+R     GGFMY  AT+LPS IDWR++G VN+IK Q  C SCWAF+ VAAVE INQIKT +LLSLSEQEL+DC+ R+ GCNGGFM  
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT

Query:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA---------------------------------------------
        A DFIK NGGI TE+NYPY+ +   CRS  ++SP VTIDGYE VPEN ENA+M+A                                             
Subjt:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA---------------------------------------------

Query:  -------GVFEGYCGPSLDHAVVVIGYGTTEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
               GVF+GYCG  L+H VV IGYGTTE+G DYW++RNSWG+GWG++GY+R+KRG++   G CGI  EASYPIK
Subjt:  -------GVFEGYCGPSLDHAVVVIGYGTTEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

A0A6J1KIL0 vignain-like3.6e-11962.82Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        M I K LLVPF LIVLVSGL++SFE  E +LA++ SLW+LY RWS+HH ISR  K+KHKR+NVFK+NA HV  VNQMNKPYKL+LNKF DMSN EFV LY
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT
        A SN+++Y+ L+GRR+E    GFMY  AT+LPS IDWR++G VN+IK Q  C SCWAF+ VAAVE INQIKT +LLSLSEQEL+DC+ R+ GC GGFM T
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNT

Query:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT
        A +FI+ NGGI +E+NYPY  +   CRS  M SP VTIDG+E VPEN ENA+M+A                      GVF+GYCG  L+H VVVIGYGTT
Subjt:  ALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTT

Query:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
        + G DYW +RNSWG+GWG+DGY+R+KRG++DP G CGIA EASYP+K
Subjt:  EDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

SwissProt top hitse value%identityAlignment
O65039 Vignain1.2e-10052.89Show/hide
Query:  IRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYAS
        ++KF+L+  L + LV  ++ESF+ HE +L SE SLW LY RW +HH +SRS  +K KRFNVFK NA HV   N+M+KPYKL+LNKF DM+N EF   Y+ 
Subjt:  IRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYAS

Query:  SNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDR-RSAGCNGGFMNTA
        S + +++   G  + N  G FMY     +P+S+DWRKKG V  +K+Q  C SCWAF+ + AVE INQIKT KL+SLSEQEL+DCD  ++ GCNGG M+ A
Subjt:  SNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDR-RSAGCNGGFMNTA

Query:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTE
         +FIK+ GGITTE NYPY   D  C     N+P V+IDG+E+VPENDENA++KA                      GVF G CG  LDH V ++GYGTT 
Subjt:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTE

Query:  DGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
        DG  YW ++NSWG  WG+ GY+R++RGI D  G CGIA EASYPIK
Subjt:  DGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

P12412 Vignain1.4e-9952.15Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MA++K L V  L + LV G++ SF+ HE DL SE SLW LY RW +HH +SRS  +KHKRFNVFK N  HV   N+M+KPYKL+LNKF DM+N EF   Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDR-RSAGCNGGFMN
        A S ++++K   G   ++  G FMY    ++P+S+DWRKKG V ++K+Q  C SCWAF+ + AVE INQIKT KL+SLSEQEL+DCD+  + GCNGG M 
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDR-RSAGCNGGFMN

Query:  TALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGT
        +A +FIK+ GGITTE NYPY   +  C    +N   V+IDG+E+VP NDENA++KA                      GVF G C   L+H V ++GYGT
Subjt:  TALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGT

Query:  TEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN
        T DG +YWI+RNSWG  WG+ GY+R++R I    G CGIA  ASYPIKN
Subjt:  TEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN

P25803 Vignain5.7e-9851.86Show/hide
Query:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MA +K L V  L   LV G++ SF+ H+ DLASE SLW LY RW +HH +SRS  +KHKRFNVFK N  HV   N+M+KPYKL+LNKF DM+N EF   Y
Subjt:  MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDR-RSAGCNGGFMN
        A S +++ +   G   EN  G FMY    ++P S+DWRKKG V ++K+Q  C SCWAF+ V AVE INQIKT KL++LSEQEL+DCD+  + GCNGG M 
Subjt:  ASSNMSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDR-RSAGCNGGFMN

Query:  TALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGT
        +A +FIK+ GGITTE NYPY   +  C +  +N   V+IDG+E+VP NDE+A++KA                      GVF G C   L+H V ++GYGT
Subjt:  TALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGT

Query:  TEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN
        T DG +YWI+RNSWG  WG+ GY+R++R I    G CGIA   SYPIKN
Subjt:  TEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN

Q9FGR9 KDEL-tailed cysteine endopeptidase CEP12.2e-9750.14Show/hide
Query:  KFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN
        K  +V  L +++V   ++  + H  D+ SE SLW+LY RW +HH ++RS ++K KRFNVFK N +H+ + N+ +K YKL+LNKFGDM+++EF + YA SN
Subjt:  KFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN

Query:  MSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCD-RRSAGCNGGFMNTALD
        + +++   G +K  +   FMYA+   LP+S+DWRK G V  +KNQ  C SCWAF+ V AVE INQI+TKKL SLSEQEL+DCD  ++ GCNGG M+ A +
Subjt:  MSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCD-RRSAGCNGGFMNTALD

Query:  FIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTEDG
        FIKE GG+T+E  YPY  SD+ C +   N+P V+IDG+EDVP+N E+ +MKA                      GVF G CG  L+H V V+GYGTT DG
Subjt:  FIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTEDG

Query:  IDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN
          YWI++NSWG  WG+ GY+R++RGI+   G CGIA EASYP+KN
Subjt:  IDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN

Q9STL5 KDEL-tailed cysteine endopeptidase CEP31.2e-9550.73Show/hide
Query:  IVLVSGL-----SESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSNMSYY
        IVL+S L     S+ F+  E +L +E ++W+LY RW  HH +SR+  +  KRFNVF+ N  HV + N+ NKPYKL++N+F D+++ EF   YA SN+ ++
Subjt:  IVLVSGL-----SESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSNMSYY

Query:  KNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCD-RRSAGCNGGFMNTALDFIKE
        + L G ++ +  GGFMY + T +PSS+DWR+KG V E+KNQ  C SCWAF+ VAAVE IN+I+T KL+SLSEQEL+DCD   + GC GG M  A +FIK 
Subjt:  KNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCD-RRSAGCNGGFMNTALDFIKE

Query:  NGGITTEDNYPYYNSD-QFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTEDGIDY
        NGGI TE+ YPY +SD QFCR+  +    VTIDG+E VPENDE  ++KA                      GVF G CG  L+H VV++GYG T++G  Y
Subjt:  NGGITTEDNYPYYNSD-QFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTEDGIDY

Query:  WILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
        WI+RNSWG  WG+ GY+RI+RGI +  G+CGIA EASYP K
Subjt:  WILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

Arabidopsis top hitse value%identityAlignment
AT3G19390.1 Granulin repeat cysteine protease family protein3.6e-7143.35Show/hide
Query:  LLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRW-SNHHEISRSHKQKHKRFNVFKQNARHVFKVNQM-NKPYKLRLNKFGDMSNDEFVKLYASSN
        L+   LLI L  G   + E   N+  + R    +Y RW   + +      +K +RF +FK N + V + + + N+ Y++ L +F D++NDEF  +Y  S 
Subjt:  LLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRW-SNHHEISRSHKQKHKRFNVFKQNARHVFKVNQM-NKPYKLRLNKFGDMSNDEFVKLYASSN

Query:  MSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRR-SAGCNGGFMNTALD
        M   +      K      ++Y    +LP +IDWR KG VN +K+Q +C SCWAF+ + AVE INQIKT +L+SLSEQEL+DCD   + GC GG M+ A  
Subjt:  MSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRR-SAGCNGGFMNTALD

Query:  FIKENGGITTEDNYPYYNSD-QFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTED
        FI ENGGI TE++YPY  +D   C S   N+  VTIDGYEDVP+NDE ++ KA                      GVF G CG SLDH VV +GYG +E 
Subjt:  FIKENGGITTEDNYPYYNSD-QFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTED

Query:  GIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN
        G DYWI+RNSWG  WG+ GY +++R I++ SG+CG+A  ASYP K+
Subjt:  GIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN

AT3G48340.1 Cysteine proteinases superfamily protein4.3e-9350.29Show/hide
Query:  KFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN
        K LL+ FL  +++   +  F+  + ++ SE  L  LY RW +HH + RS  ++ KRFNVF+ N  HV   N+ N+ YKL+LNKF D++ +EF   Y  SN
Subjt:  KFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN

Query:  MSYYKNLYGRRKENRLGGFMYAH--ATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCD-RRSAGCNGGFMNTA
        + +++ L G ++ ++   FMY H   + LPSS+DWRKKG V EIKNQ  C SCWAF+ VAAVE IN+IKT KL+SLSEQEL+DCD +++ GCNGG M  A
Subjt:  MSYYKNLYGRRKENRLGGFMYAH--ATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCD-RRSAGCNGGFMNTA

Query:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTE
         +FIK+NGGITTED+YPY   D  C +   N   VTIDG+EDVPENDENA++KA                      GVF G CG  L+H V  +GYG +E
Subjt:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTE

Query:  DGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
         G  YWI+RNSWG  WG+ GY++I+R I +P G+CGIA EASYPIK
Subjt:  DGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

AT3G48350.1 Cysteine proteinases superfamily protein8.4e-9750.73Show/hide
Query:  IVLVSGL-----SESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSNMSYY
        IVL+S L     S+ F+  E +L +E ++W+LY RW  HH +SR+  +  KRFNVF+ N  HV + N+ NKPYKL++N+F D+++ EF   YA SN+ ++
Subjt:  IVLVSGL-----SESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSNMSYY

Query:  KNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCD-RRSAGCNGGFMNTALDFIKE
        + L G ++ +  GGFMY + T +PSS+DWR+KG V E+KNQ  C SCWAF+ VAAVE IN+I+T KL+SLSEQEL+DCD   + GC GG M  A +FIK 
Subjt:  KNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCD-RRSAGCNGGFMNTALDFIKE

Query:  NGGITTEDNYPYYNSD-QFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTEDGIDY
        NGGI TE+ YPY +SD QFCR+  +    VTIDG+E VPENDE  ++KA                      GVF G CG  L+H VV++GYG T++G  Y
Subjt:  NGGITTEDNYPYYNSD-QFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTEDGIDY

Query:  WILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK
        WI+RNSWG  WG+ GY+RI+RGI +  G+CGIA EASYP K
Subjt:  WILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIK

AT4G36880.1 cysteine proteinase11.3e-7342.74Show/hide
Query:  LLVPFLLIVLVSG----LSESFEL-HENDLASERSLWQLYRRWSNHHEISRSH-----KQKHKRFNVFKQNARHVFKVNQMNK--PYKLRLNKFGDMSND
        LL+ ++++ L SG    +++  +L  +    ++  +  +Y +WS  H  + ++       + KRFN+FK N R +   N+ NK   YKL L KF D++ND
Subjt:  LLVPFLLIVLVSG----LSESFEL-HENDLASERSLWQLYRRWSNHHEISRSH-----KQKHKRFNVFKQNARHVFKVNQMNK--PYKLRLNKFGDMSND

Query:  EFVKLYASSNMSYYKNLYGRRKENRLGGFMYAHATN---LPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRR-S
        E+ KLY  +     + +   +  N+     Y+ A N   +P ++DWR+KG VN IK+Q TC SCWAF+  AAVE IN+I T +L+SLSEQEL+DCD+  +
Subjt:  EFVKLYASSNMSYYKNLYGRRKENRLGGFMYAHATN---LPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRR-S

Query:  AGCNGGFMNTALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDH
         GCNGG M+ A  FI +NGG+ TE +YPY      C S++ NS  V+IDGYEDVP  DE A+ KA                      G+F G CG +LDH
Subjt:  AGCNGGFMNTALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDH

Query:  AVVVIGYGTTEDGIDYWILRNSWGIGWGDDGYMRIKRGI-QDPSGQCGIATEASYPIK
        AVV +GYG +E+G+DYWI+RNSWG  WG++GY+R++R +    SG+CGIA EASYP+K
Subjt:  AVVVIGYGTTEDGIDYWILRNSWGIGWGDDGYMRIKRGI-QDPSGQCGIATEASYPIK

AT5G50260.1 Cysteine proteinases superfamily protein1.5e-9850.14Show/hide
Query:  KFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN
        K  +V  L +++V   ++  + H  D+ SE SLW+LY RW +HH ++RS ++K KRFNVFK N +H+ + N+ +K YKL+LNKFGDM+++EF + YA SN
Subjt:  KFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN

Query:  MSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCD-RRSAGCNGGFMNTALD
        + +++   G +K  +   FMYA+   LP+S+DWRK G V  +KNQ  C SCWAF+ V AVE INQI+TKKL SLSEQEL+DCD  ++ GCNGG M+ A +
Subjt:  MSYYKNLYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCD-RRSAGCNGGFMNTALD

Query:  FIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTEDG
        FIKE GG+T+E  YPY  SD+ C +   N+P V+IDG+EDVP+N E+ +MKA                      GVF G CG  L+H V V+GYGTT DG
Subjt:  FIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKA----------------------GVFEGYCGPSLDHAVVVIGYGTTEDG

Query:  IDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN
          YWI++NSWG  WG+ GY+R++RGI+   G CGIA EASYP+KN
Subjt:  IDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATAAGAAAATTCCTTTTGGTTCCATTTCTTTTAATTGTTCTAGTTTCAGGATTGAGTGAGAGCTTTGAATTACATGAAAATGATCTAGCATCAGAAAGAAGCCT
ATGGCAACTATATCGGAGATGGAGCAACCACCATGAAATCTCAAGGAGTCATAAGCAGAAGCACAAACGTTTCAATGTGTTTAAACAAAATGCGAGACATGTGTTCAAAG
TGAACCAAATGAACAAACCATACAAATTGAGGTTGAACAAGTTTGGAGATATGTCAAATGATGAGTTTGTGAAGTTATATGCTTCGTCTAATATGAGTTACTACAAAAAC
TTATATGGTAGGAGAAAAGAAAATAGATTAGGAGGATTTATGTATGCACATGCTACAAATCTTCCATCCTCCATTGATTGGAGGAAAAAAGGAGTTGTGAATGAAATCAA
GAATCAATACACATGTCAGAGCTGTTGGGCGTTTGCAGTAGTGGCGGCTGTTGAAGCAATCAACCAAATTAAAACAAAGAAGCTATTATCTCTGTCAGAACAAGAACTGA
TCGATTGTGATCGAAGAAGTGCGGGTTGTAATGGAGGATTCATGAACACTGCTTTAGATTTCATAAAAGAAAATGGTGGGATCACCACTGAGGACAACTATCCCTATTAT
AATTCAGACCAATTTTGTCGCTCATACATGATGAATTCACCGAGAGTAACAATTGATGGATATGAAGATGTGCCTGAAAACGATGAGAATGCTATGATGAAAGCTGGAGT
GTTCGAAGGATATTGTGGACCATCACTTGATCACGCAGTGGTGGTGATTGGCTATGGAACAACTGAAGACGGAATAGATTATTGGATTTTGAGGAACTCGTGGGGAATTG
GATGGGGAGACGATGGTTACATGAGGATCAAAAGAGGAATCCAAGATCCATCGGGTCAATGTGGCATAGCCACAGAAGCATCTTATCCCATTAAGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCATAAGAAAATTCCTTTTGGTTCCATTTCTTTTAATTGTTCTAGTTTCAGGATTGAGTGAGAGCTTTGAATTACATGAAAATGATCTAGCATCAGAAAGAAGCCT
ATGGCAACTATATCGGAGATGGAGCAACCACCATGAAATCTCAAGGAGTCATAAGCAGAAGCACAAACGTTTCAATGTGTTTAAACAAAATGCGAGACATGTGTTCAAAG
TGAACCAAATGAACAAACCATACAAATTGAGGTTGAACAAGTTTGGAGATATGTCAAATGATGAGTTTGTGAAGTTATATGCTTCGTCTAATATGAGTTACTACAAAAAC
TTATATGGTAGGAGAAAAGAAAATAGATTAGGAGGATTTATGTATGCACATGCTACAAATCTTCCATCCTCCATTGATTGGAGGAAAAAAGGAGTTGTGAATGAAATCAA
GAATCAATACACATGTCAGAGCTGTTGGGCGTTTGCAGTAGTGGCGGCTGTTGAAGCAATCAACCAAATTAAAACAAAGAAGCTATTATCTCTGTCAGAACAAGAACTGA
TCGATTGTGATCGAAGAAGTGCGGGTTGTAATGGAGGATTCATGAACACTGCTTTAGATTTCATAAAAGAAAATGGTGGGATCACCACTGAGGACAACTATCCCTATTAT
AATTCAGACCAATTTTGTCGCTCATACATGATGAATTCACCGAGAGTAACAATTGATGGATATGAAGATGTGCCTGAAAACGATGAGAATGCTATGATGAAAGCTGGAGT
GTTCGAAGGATATTGTGGACCATCACTTGATCACGCAGTGGTGGTGATTGGCTATGGAACAACTGAAGACGGAATAGATTATTGGATTTTGAGGAACTCGTGGGGAATTG
GATGGGGAGACGATGGTTACATGAGGATCAAAAGAGGAATCCAAGATCCATCGGGTCAATGTGGCATAGCCACAGAAGCATCTTATCCCATTAAGAACTAG
Protein sequenceShow/hide protein sequence
MAIRKFLLVPFLLIVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSNMSYYKN
LYGRRKENRLGGFMYAHATNLPSSIDWRKKGVVNEIKNQYTCQSCWAFAVVAAVEAINQIKTKKLLSLSEQELIDCDRRSAGCNGGFMNTALDFIKENGGITTEDNYPYY
NSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAGVFEGYCGPSLDHAVVVIGYGTTEDGIDYWILRNSWGIGWGDDGYMRIKRGIQDPSGQCGIATEASYPIKN