; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000593 (gene) of Snake gourd v1 genome

Gene IDTan0000593
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCysteine Protease
Genome locationLG02:91692553..91693961
RNA-Seq ExpressionTan0000593
SyntenyTan0000593
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025660 - Cysteine peptidase, histidine active site
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142960.1 vignain [Cucumis sativus]4.0e-11762.36Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLLVP LL+VLVSGL+ESFE  E +LA+E SLWQLY RW  HH ISR+ K+KHKRF+VFK+N  HVF VNQM+KPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTAL
        A SN+S+Y+ L+                           AVN +K Q  C SCWAF+ VAAVEGIN+IKTN+LLSLSEQEL+DC+ R+ GCNGGFM  A 
Subjt:  ASSNMSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTAL

Query:  DFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGT
        DFIK NGGI TE++YPY+ S   CRS  ++SP V IDGYE VPEN E+A+M+AVA+QPV V I A+G DFQFY   S GVF+GYCG  L+H VV IGYGT
Subjt:  DFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGT

Query:  TEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        TED TDYW++RNSWG+ WG+DGY+++KRG++   G CGIA EASYPIK
Subjt:  TEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

XP_008444390.1 PREDICTED: vignain-like [Cucumis melo]2.3e-12063.51Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLLVP LL+VLVSGL+ESFE  E +LA+E SLWQLY RW NHH ISR+ K+KHKRF+VFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTAL
        A SN+S+++ L+                           AVN IK Q TC SCWAF+ VAAVE IN+IKTN+LLSLSEQEL+DC+ R+ GCNGGFM  A 
Subjt:  ASSNMSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTAL

Query:  DFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGT
        DFIK NGGI TE++YPY+ S   CRS  ++SP V IDGYE VPEN E+A+M+AVA+QPV V I A+G DFQFYW+   GVF+GYCG  L+H VV IGYGT
Subjt:  DFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGT

Query:  TEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        TED TDYWI+RNSWG+ WG+DGY+++KRG++ P G CGIA EASYPIK
Subjt:  TEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

XP_022996765.1 vignain-like [Cucurbita maxima]2.1e-11859.52Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KF+LVPFLL+VLVSGL ESFE  E +LA+E SLW+LY RW NHH ISR  K+KHKRFNVFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLY-------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTALDF
        A SN+S+Y+ L+                         AVN+IK Q  C SCWAF+ VAAVEGINQIKTN+LLSLSEQEL+DC+ R+ GCNGGFM  A DF
Subjt:  ASSNMSYYKNLY-------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTALDF

Query:  IKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASL------------------------
        IK NGGI TE+NYPY+ +   CRS  ++SP VTIDGYE VPEN ENA+M+AVA+QPV V I A G DFQFY +ASL                        
Subjt:  IKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASL------------------------

Query:  ---GVFEGYCGPSLDHAVVVIGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
           GVF+GYCG  L+H VV IGYGTTE+ TDYW++RNSWG+ WG++GY+++KRG++   G CGI  EASYPIK
Subjt:  ---GVFEGYCGPSLDHAVVVIGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

XP_023545499.1 vignain-like [Cucurbita pepo subsp. pepo]1.6e-11859.95Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLLVPFLL+VLVSGL ESFE  E +LA+E SLW+LY RW NHH ISR  K+KHKRFNVFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLY-------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTALDF
        A SN+S+Y+ L+                         AVN+IK Q  C SCWAF+ VAAVEGINQIKTN+LLSLSEQEL+DC+ R+ GCNGGFM  A DF
Subjt:  ASSNMSYYKNLY-------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTALDF

Query:  IKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASL------------------------
        IK+NGGI TE+NYPY+ +   CRS   +SP VTIDGYE VPEN ENA+M+AVA+QPV V I A G DFQFY +ASL                        
Subjt:  IKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASL------------------------

Query:  --GVFEGYCGPSLDHAVVVIGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
          GVF+GYCG  L+H VV IGYGTTE+ TDYW++RNSWG+ WG++GY+++KRG++   G CGI  EASYPIK
Subjt:  --GVFEGYCGPSLDHAVVVIGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

XP_038885064.1 LOW QUALITY PROTEIN: vignain-like [Benincasa hispida]1.4e-11761.32Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLL+P LL+VL+SG +ESFE  E +LA+E SLW+LY RW NHH ISR  K+KHKRF VFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLY----------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTA
        A SN+S+Y+ L+                            AVN IK Q  C SCWAF+ VAAVEGIN+I+TN+LLSLSEQEL+DC+ R+ GCNGGFM  A
Subjt:  ASSNMSYYKNLY----------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTA

Query:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYG
         DFI+ NGGI TE++YPY+ S   CRS  ++SP V IDGYE +PEN E+A+M+AVA+QPV V I A+G DFQFYW+   GVF+GYCG  L+H VV IGYG
Subjt:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYG

Query:  TTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        TTED TDYWI+RNSWG+ WG++GY+++KRG++ P G CGIA EASYPIK
Subjt:  TTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

TrEMBL top hitse value%identityAlignment
A0A0A0LMU4 Uncharacterized protein1.9e-11762.36Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLLVP LL+VLVSGL+ESFE  E +LA+E SLWQLY RW  HH ISR+ K+KHKRF+VFK+N  HVF VNQM+KPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTAL
        A SN+S+Y+ L+                           AVN +K Q  C SCWAF+ VAAVEGIN+IKTN+LLSLSEQEL+DC+ R+ GCNGGFM  A 
Subjt:  ASSNMSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTAL

Query:  DFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGT
        DFIK NGGI TE++YPY+ S   CRS  ++SP V IDGYE VPEN E+A+M+AVA+QPV V I A+G DFQFY   S GVF+GYCG  L+H VV IGYGT
Subjt:  DFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGT

Query:  TEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        TED TDYW++RNSWG+ WG+DGY+++KRG++   G CGIA EASYPIK
Subjt:  TEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

A0A1S3BA70 vignain-like1.1e-12063.51Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KFLLVP LL+VLVSGL+ESFE  E +LA+E SLWQLY RW NHH ISR+ K+KHKRF+VFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTAL
        A SN+S+++ L+                           AVN IK Q TC SCWAF+ VAAVE IN+IKTN+LLSLSEQEL+DC+ R+ GCNGGFM  A 
Subjt:  ASSNMSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTAL

Query:  DFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGT
        DFIK NGGI TE++YPY+ S   CRS  ++SP V IDGYE VPEN E+A+M+AVA+QPV V I A+G DFQFYW+   GVF+GYCG  L+H VV IGYGT
Subjt:  DFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGT

Query:  TEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        TED TDYWI+RNSWG+ WG+DGY+++KRG++ P G CGIA EASYPIK
Subjt:  TEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

A0A6J1GHN5 vignain-like7.6e-11460.46Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI K LLVPFL+ VL+SGL++SFE  E +LA++ SLW+LY RWS+HH ISR  K+KHKR+NVFK+NA HV  VNQMNKPYKL+LNKF DMSN EFV +Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLY----------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTA
        A SN+++Y+ L+                            AV++IK+Q  C SCWAF+ VAAVEGINQIKTN+LLSLSEQEL+DC+ R+ GC GGFM TA
Subjt:  ASSNMSYYKNLY----------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTA

Query:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYG
         +FI+ NGGI +E+NYPY  +   CRS  M SP VTIDG+E VPEN ENA+M+AVA+QPV V I A G DFQFYW+   GVF+G CG  L+H VVVIGYG
Subjt:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYG

Query:  TTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        TT+  TDYW +RNSWG+ WG+DGY+++KRG++DP G CGI  EASYP+K
Subjt:  TTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

A0A6J1K7P4 vignain-like1.0e-11859.52Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MAI KF+LVPFLL+VLVSGL ESFE  E +LA+E SLW+LY RW NHH ISR  K+KHKRFNVFK+N  HVF VNQMNKPYKL+LNKF DMSN EFV  Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLY-------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTALDF
        A SN+S+Y+ L+                         AVN+IK Q  C SCWAF+ VAAVEGINQIKTN+LLSLSEQEL+DC+ R+ GCNGGFM  A DF
Subjt:  ASSNMSYYKNLY-------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTALDF

Query:  IKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASL------------------------
        IK NGGI TE+NYPY+ +   CRS  ++SP VTIDGYE VPEN ENA+M+AVA+QPV V I A G DFQFY +ASL                        
Subjt:  IKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASL------------------------

Query:  ---GVFEGYCGPSLDHAVVVIGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
           GVF+GYCG  L+H VV IGYGTTE+ TDYW++RNSWG+ WG++GY+++KRG++   G CGI  EASYPIK
Subjt:  ---GVFEGYCGPSLDHAVVVIGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

A0A6J1KIL0 vignain-like1.4e-11561.6Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        M I K LLVPFL+ VLVSGL++SFE  E +LA++ SLW+LY RWS+HH ISR  K+KHKR+NVFK+NA HV  VNQMNKPYKL+LNKF DMSN EFV LY
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKNLY----------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTA
        A SN+++Y+ L+                            AVN+IK Q  C SCWAF+ VAAVEGINQIKTN+LLSLSEQEL+DC+ R+ GC GGFM TA
Subjt:  ASSNMSYYKNLY----------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTA

Query:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYG
         +FI+ NGGI +E+NYPY  +   CRS  M SP VTIDG+E VPEN ENA+M+AVA+QPV V I A G DFQFYW+   GVF+GYCG  L+H VVVIGYG
Subjt:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYG

Query:  TTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        TT+  TDYW +RNSWG+ WG+DGY+++KRG++DP G CGIA EASYP+K
Subjt:  TTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

SwissProt top hitse value%identityAlignment
O65039 Vignain1.2e-10054.18Show/hide
Query:  IRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYAS
        ++KF+L+  L + LV  ++ESF+ HE +L SE SLW LY RW +HH +SRS  +K KRFNVFK NA HV   N+M+KPYKL+LNKF DM+N EF   Y+ 
Subjt:  IRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYAS

Query:  SNMSYYKN-----------LY----------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDR-RSAGCNGGFMNTALD
        S + +++            +Y                AV  +K+Q  C SCWAF+ + AVEGINQIKTNKL+SLSEQELVDCD  ++ GCNGG M+ A +
Subjt:  SNMSYYKN-----------LY----------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDR-RSAGCNGGFMNTALD

Query:  FIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTT
        FIK+ GGITTE NYPY   D  C     N+P V+IDG+E+VPENDENA++KAVA+QPV V I A G+DFQFY   S GVF G CG  LDH V ++GYGTT
Subjt:  FIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTT

Query:  EDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
         D T YW ++NSWG  WG+ GY++++RGI D  G CGIA EASYPIK
Subjt:  EDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

P12412 Vignain7.8e-10053.71Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MA++K L V  L + LV G++ SF+ HE DL SE SLW LY RW +HH +SRS  +KHKRFNVFK N  HV   N+M+KPYKL+LNKF DM+N EF   Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSYYKN-----------LY----------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDR-RSAGCNGGFMNTA
        A S ++++K            +Y                AV ++K+Q  C SCWAF+ + AVEGINQIKTNKL+SLSEQELVDCD+  + GCNGG M +A
Subjt:  ASSNMSYYKN-----------LY----------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDR-RSAGCNGGFMNTA

Query:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYG
         +FIK+ GGITTE NYPY   +  C    +N   V+IDG+E+VP NDENA++KAVA+QPV V I A G+DFQFY   S GVF G C   L+H V ++GYG
Subjt:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYG

Query:  TTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIKN
        TT D T+YWI+RNSWG  WG+ GY++++R I    G CGIA  ASYPIKN
Subjt:  TTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIKN

P25803 Vignain1.3e-9752.57Show/hide
Query:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY
        MA +K L V  L   LV G++ SF+ H+ DLASE SLW LY RW +HH +SRS  +KHKRFNVFK N  HV   N+M+KPYKL+LNKF DM+N EF   Y
Subjt:  MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLY

Query:  ASSNMSY---------------------------YKNLYAVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDR-RSAGCNGGFMNTA
        A S +++                           ++   AV ++K+Q  C SCWAF+ V AVEGINQIKTNKL++LSEQELVDCD+  + GCNGG M +A
Subjt:  ASSNMSY---------------------------YKNLYAVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDR-RSAGCNGGFMNTA

Query:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYG
         +FIK+ GGITTE NYPY   +  C +  +N   V+IDG+E+VP NDE+A++KAVA+QPV V I A G+DFQFY   S GVF G C   L+H V ++GYG
Subjt:  LDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYG

Query:  TTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIKN
        TT D T+YWI+RNSWG  WG+ GY++++R I    G CGIA   SYPIKN
Subjt:  TTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIKN

Q9FGR9 KDEL-tailed cysteine endopeptidase CEP11.2e-9550.87Show/hide
Query:  KFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN
        K  +V  L M++V   ++  + H  D+ SE SLW+LY RW +HH ++RS ++K KRFNVFK N +H+ + N+ +K YKL+LNKFGDM+++EF + YA SN
Subjt:  KFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN

Query:  MSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCD-RRSAGCNGGFMNTALDFI
        + +++                              AV  +KNQ  C SCWAF+ V AVEGINQI+T KL SLSEQELVDCD  ++ GCNGG M+ A +FI
Subjt:  MSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCD-RRSAGCNGGFMNTALDFI

Query:  KENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTTED
        KE GG+T+E  YPY  SD+ C +   N+P V+IDG+EDVP+N E+ +MKAVA+QPV V I A G+DFQFY   S GVF G CG  L+H V V+GYGTT D
Subjt:  KENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTTED

Query:  ETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIKN
         T YWI++NSWG  WG+ GY++++RGI+   G CGIA EASYP+KN
Subjt:  ETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIKN

Q9STL5 KDEL-tailed cysteine endopeptidase CEP31.2e-9250.29Show/hide
Query:  LLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSNMS
        +L+ FL ++     S+ F+  E +L +E ++W+LY RW  HH +SR+  +  KRFNVF+ N  HV + N+ NKPYKL++N+F D+++ EF   YA SN+ 
Subjt:  LLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSNMS

Query:  YYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCD-RRSAGCNGGFMNTALDFIKE
        +++ L                            AV E+KNQ  C SCWAF+ VAAVEGIN+I+TNKL+SLSEQELVDCD   + GC GG M  A +FIK 
Subjt:  YYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCD-RRSAGCNGGFMNTALDFIKE

Query:  NGGITTEDNYPYYNSD-QFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTTEDE
        NGGI TE+ YPY +SD QFCR+  +    VTIDG+E VPENDE  ++KAVAHQPV V I A  +DFQ Y   S GVF G CG  L+H VV++GYG T++ 
Subjt:  NGGITTEDNYPYYNSD-QFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTTEDE

Query:  TDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        T YWI+RNSWG  WG+ GY++I+RGI +  G+CGIA EASYP K
Subjt:  TDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

Arabidopsis top hitse value%identityAlignment
AT3G48340.1 Cysteine proteinases superfamily protein7.3e-9351.01Show/hide
Query:  KFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN
        K LL+ FL  +++   +  F+  + ++ SE  L  LY RW +HH + RS  ++ KRFNVF+ N  HV   N+ N+ YKL+LNKF D++ +EF   Y  SN
Subjt:  KFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN

Query:  MSYYKNLY-----------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCD-RRSAGCNGGFMNTALD
        + +++ L                              AV EIKNQ  C SCWAF+ VAAVEGIN+IKTNKL+SLSEQELVDCD +++ GCNGG M  A +
Subjt:  MSYYKNLY-----------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCD-RRSAGCNGGFMNTALD

Query:  FIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTT
        FIK+NGGITTED+YPY   D  C +   N   VTIDG+EDVPENDENA++KAVA+QPV V I A  +DFQFY   S GVF G CG  L+H V  +GYG +
Subjt:  FIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTT

Query:  EDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        E    YWI+RNSWG  WG+ GY+KI+R I +P G+CGIA EASYPIK
Subjt:  EDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

AT3G48350.1 Cysteine proteinases superfamily protein8.6e-9450.29Show/hide
Query:  LLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSNMS
        +L+ FL ++     S+ F+  E +L +E ++W+LY RW  HH +SR+  +  KRFNVF+ N  HV + N+ NKPYKL++N+F D+++ EF   YA SN+ 
Subjt:  LLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSNMS

Query:  YYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCD-RRSAGCNGGFMNTALDFIKE
        +++ L                            AV E+KNQ  C SCWAF+ VAAVEGIN+I+TNKL+SLSEQELVDCD   + GC GG M  A +FIK 
Subjt:  YYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCD-RRSAGCNGGFMNTALDFIKE

Query:  NGGITTEDNYPYYNSD-QFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTTEDE
        NGGI TE+ YPY +SD QFCR+  +    VTIDG+E VPENDE  ++KAVAHQPV V I A  +DFQ Y   S GVF G CG  L+H VV++GYG T++ 
Subjt:  NGGITTEDNYPYYNSD-QFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTTEDE

Query:  TDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        T YWI+RNSWG  WG+ GY++I+RGI +  G+CGIA EASYP K
Subjt:  TDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

AT4G36880.1 cysteine proteinase15.8e-7443.98Show/hide
Query:  LLVPFLLMVLVSG----LSESFEL-HENDLASERSLWQLYRRWSNHHEISRSH-----KQKHKRFNVFKQNARHVFKVNQMNK--PYKLRLNKFGDMSND
        LL+ ++++ L SG    +++  +L  +    ++  +  +Y +WS  H  + ++       + KRFN+FK N R +   N+ NK   YKL L KF D++ND
Subjt:  LLVPFLLMVLVSG----LSESFEL-HENDLASERSLWQLYRRWSNHHEISRSH-----KQKHKRFNVFKQNARHVFKVNQMNK--PYKLRLNKFGDMSND

Query:  EFVKLYASS--------------NMSY--------------YKNLYAVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRR-SAGCN
        E+ KLY  +              N  Y              ++   AVN IK+Q TC SCWAF+  AAVEGIN+I T +L+SLSEQELVDCD+  + GCN
Subjt:  EFVKLYASS--------------NMSY--------------YKNLYAVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRR-SAGCN

Query:  GGFMNTALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHA
        GG M+ A  FI +NGG+ TE +YPY      C S++ NS  V+IDGYEDVP  DE A+ KA+++QPV V I A G  FQ Y     G+F G CG +LDHA
Subjt:  GGFMNTALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHA

Query:  VVVIGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGI-QDPSGQCGIATEASYPIK
        VV +GYG +E+  DYWI+RNSWG RWG++GY++++R +    SG+CGIA EASYP+K
Subjt:  VVVIGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGI-QDPSGQCGIATEASYPIK

AT5G43060.1 Granulin repeat cysteine protease family protein7.6e-7442.78Show/hide
Query:  LVPFLLMVLVSGLS-----------ESFELHENDLASERSLWQLYRRWSNHHEISRSHK-----QKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDM
        L P +L++ + G+S           E+  +      S+  + ++Y  W   H   + ++     +K +RF +FK N R + + N  N  YKL L +F D+
Subjt:  LVPFLLMVLVSGLS-----------ESFELHENDLASERSLWQLYRRWSNHHEISRSHK-----QKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDM

Query:  SNDEFVKLYASSNMSY----------------------YKNLYAVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRR-SAGCNGGF
        +N+E+  +Y  +  +                       ++   AV ++K+Q +C SCWAF+ + AVEGIN+I T  L+SLSEQELVDCD   + GCNGG 
Subjt:  SNDEFVKLYASSNMSY----------------------YKNLYAVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRR-SAGCNGGF

Query:  MNTALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVV
        M+ A +FI +NGGI TE +YPY  +D  C     N+  VTID YEDVPEN E ++ KA+AHQP+ V I A G  FQ Y   S GVF+G CG  LDH VV 
Subjt:  MNTALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVV

Query:  IGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK
        +GYG TE+  DYWI+RNSWG RWG+ GY+K+ R I+ P+G+CGIA EASYPIK
Subjt:  IGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIK

AT5G50260.1 Cysteine proteinases superfamily protein8.3e-9750.87Show/hide
Query:  KFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN
        K  +V  L M++V   ++  + H  D+ SE SLW+LY RW +HH ++RS ++K KRFNVFK N +H+ + N+ +K YKL+LNKFGDM+++EF + YA SN
Subjt:  KFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSN

Query:  MSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCD-RRSAGCNGGFMNTALDFI
        + +++                              AV  +KNQ  C SCWAF+ V AVEGINQI+T KL SLSEQELVDCD  ++ GCNGG M+ A +FI
Subjt:  MSYYKNLY---------------------------AVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCD-RRSAGCNGGFMNTALDFI

Query:  KENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTTED
        KE GG+T+E  YPY  SD+ C +   N+P V+IDG+EDVP+N E+ +MKAVA+QPV V I A G+DFQFY   S GVF G CG  L+H V V+GYGTT D
Subjt:  KENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDENAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTTED

Query:  ETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIKN
         T YWI++NSWG  WG+ GY++++RGI+   G CGIA EASYP+KN
Subjt:  ETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATAAGAAAATTCCTTTTGGTTCCATTTCTTTTAATGGTTCTAGTTTCAGGATTGAGTGAGAGCTTTGAATTACATGAAAATGATCTAGCATCAGAAAGAAGCCT
ATGGCAACTATATCGGAGATGGAGCAACCACCATGAAATCTCAAGGAGTCATAAGCAGAAGCACAAACGTTTCAATGTGTTTAAACAAAATGCGAGGCATGTGTTCAAAG
TGAACCAAATGAACAAACCATACAAATTGAGGTTGAACAAGTTCGGAGATATGTCAAATGATGAGTTTGTGAAGTTATATGCTTCATCTAATATGAGTTACTACAAAAAC
TTATATGCTGTGAATGAAATCAAGAATCAATACACATGTCAGAGTTGTTGGGCGTTTGCAGTAGTGGCTGCTGTTGAAGGAATCAACCAAATTAAAACGAATAAGCTATT
ATCTCTATCAGAACAAGAATTGGTCGATTGTGATCGAAGAAGTGCGGGTTGTAATGGAGGATTCATGAACACTGCTTTAGATTTTATAAAAGAAAATGGTGGAATCACCA
CTGAGGACAACTATCCCTACTATAATTCAGACCAATTTTGTCGTTCATACATGATGAATTCACCGAGAGTAACAATTGATGGATATGAAGATGTGCCTGAAAACGACGAG
AATGCTATGATGAAAGCTGTTGCACATCAACCAGTGATAGTTGGCATAGCCGCCTCAGGAGCAGATTTCCAATTCTATTGGAAGGCAAGTTTGGGAGTGTTCGAAGGATA
TTGTGGACCATCACTTGATCACGCAGTGGTGGTGATTGGCTACGGAACAACTGAAGACGAAACAGATTATTGGATTTTGAGGAACTCGTGGGGAATTAGATGGGGAGACG
ATGGTTACATGAAGATCAAAAGAGGGATCCAAGATCCATCGGGTCAATGTGGCATAGCCACAGAAGCATCTTATCCCATTAAGAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCATAAGAAAATTCCTTTTGGTTCCATTTCTTTTAATGGTTCTAGTTTCAGGATTGAGTGAGAGCTTTGAATTACATGAAAATGATCTAGCATCAGAAAGAAGCCT
ATGGCAACTATATCGGAGATGGAGCAACCACCATGAAATCTCAAGGAGTCATAAGCAGAAGCACAAACGTTTCAATGTGTTTAAACAAAATGCGAGGCATGTGTTCAAAG
TGAACCAAATGAACAAACCATACAAATTGAGGTTGAACAAGTTCGGAGATATGTCAAATGATGAGTTTGTGAAGTTATATGCTTCATCTAATATGAGTTACTACAAAAAC
TTATATGCTGTGAATGAAATCAAGAATCAATACACATGTCAGAGTTGTTGGGCGTTTGCAGTAGTGGCTGCTGTTGAAGGAATCAACCAAATTAAAACGAATAAGCTATT
ATCTCTATCAGAACAAGAATTGGTCGATTGTGATCGAAGAAGTGCGGGTTGTAATGGAGGATTCATGAACACTGCTTTAGATTTTATAAAAGAAAATGGTGGAATCACCA
CTGAGGACAACTATCCCTACTATAATTCAGACCAATTTTGTCGTTCATACATGATGAATTCACCGAGAGTAACAATTGATGGATATGAAGATGTGCCTGAAAACGACGAG
AATGCTATGATGAAAGCTGTTGCACATCAACCAGTGATAGTTGGCATAGCCGCCTCAGGAGCAGATTTCCAATTCTATTGGAAGGCAAGTTTGGGAGTGTTCGAAGGATA
TTGTGGACCATCACTTGATCACGCAGTGGTGGTGATTGGCTACGGAACAACTGAAGACGAAACAGATTATTGGATTTTGAGGAACTCGTGGGGAATTAGATGGGGAGACG
ATGGTTACATGAAGATCAAAAGAGGGATCCAAGATCCATCGGGTCAATGTGGCATAGCCACAGAAGCATCTTATCCCATTAAGAACTAA
Protein sequenceShow/hide protein sequence
MAIRKFLLVPFLLMVLVSGLSESFELHENDLASERSLWQLYRRWSNHHEISRSHKQKHKRFNVFKQNARHVFKVNQMNKPYKLRLNKFGDMSNDEFVKLYASSNMSYYKN
LYAVNEIKNQYTCQSCWAFAVVAAVEGINQIKTNKLLSLSEQELVDCDRRSAGCNGGFMNTALDFIKENGGITTEDNYPYYNSDQFCRSYMMNSPRVTIDGYEDVPENDE
NAMMKAVAHQPVIVGIAASGADFQFYWKASLGVFEGYCGPSLDHAVVVIGYGTTEDETDYWILRNSWGIRWGDDGYMKIKRGIQDPSGQCGIATEASYPIKN