; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G03170 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G03170
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCysteine Protease
Genome locationClcChr08:7409171..7411107
RNA-Seq ExpressionClc08G03170
SyntenyClc08G03170
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025660 - Cysteine peptidase, histidine active site
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020106.1 hypothetical protein SDJN02_16788, partial [Cucurbita argyrosperma subsp. argyrosperma]7.7e-10157.27Show/hide
Query:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY
        M ++K +LVP  LI LI GL +SFEF+ +ELE+  SLWKLY +WS HH ISR   E+  R+ VF+ENA HV  VNQMNK YKLKLN+F DMSN+EF ++Y
Subjt:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY

Query:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG
        A SNI +YR L   R+     G  GFMYE+  +LP  IDWR  GAV+ +K Q  G  G      A+ AVEGINQI+T QL+ LSEQEL+DC+  + GC G
Subjt:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG

Query:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVSGVFNGNCGVVLNHAVVAVGY-----GPDYWIMK
        G+META++FIR+NGGI +E NYPY   +G C SS     +VTIDG+++VP  NE+ALMQAVANQPVSGVF+G+CG  LNH VV +GY     G DYW ++
Subjt:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVSGVFNGNCGVVLNHAVVAVGY-----GPDYWIMK

Query:  NSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        NSWG GWGE GY+RM+RG   P GLCGI ++AS+P+K
Subjt:  NSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

XP_004142960.1 vignain [Cucumis sativus]3.1e-10255.97Show/hide
Query:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY
        M + KF+LVPL+LI L+ GL ESFEF+ KEL ++ESLW+LY +W  HH ISR   E+  RF VF+EN  HV  VNQM+K YKLKLN+F DMSN+EF + Y
Subjt:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY

Query:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG
        A SNI++YR L + R+  G     GFMYEQ  +LP S+DWR  GAV  VK Q  G  G      ++ AVEGIN+I+T QL+ LSEQEL+DC+  + GC G
Subjt:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG

Query:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV
        G+ME AFDFI++NGGI TE +YPY+  +G C SS     +V IDGY++VP  NEDALMQAVANQPVS               GVF+G CG  LNH VVA+
Subjt:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV

Query:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        GY     G DYW+++NSWG GWGE GYVRM+RG     GLCGIA++AS+PIK
Subjt:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

XP_008444390.1 PREDICTED: vignain-like [Cucumis melo]1.8e-10255.97Show/hide
Query:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY
        M + KF+LVPL+LI L+ GL ESFEF+ KEL ++ESLW+LY +W  HH ISR   E+  RF VF+EN  HV  VNQMNK YKLKLN+F DMSN+EF + Y
Subjt:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY

Query:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG
        A SNI+++R L   R+  G     GFMYEQ  +LP S+DWR  GAV  +K Q  G  G      ++ AVE IN+I+T QL+ LSEQEL+DC+  + GC G
Subjt:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG

Query:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV
        G+ME AFDFI++NGGI TE +YPY+  +G C SS     +V IDGY++VP  NEDALMQAVANQPVS               GVF+G CG  LNH VVA+
Subjt:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV

Query:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        GY     G DYWI++NSWG GWGE GYVRM+RG   P GLCGIA++AS+PIK
Subjt:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

XP_038885064.1 LOW QUALITY PROTEIN: vignain-like [Benincasa hispida]9.7e-10457.1Show/hide
Query:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY
        M + KF+L+PL+LI LI G  ESFEF+ KEL ++ESLWKLY +W  HH ISR   E+  RFKVF+EN  HV  VNQMNK YKLKLN+F DMSN+EF + Y
Subjt:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY

Query:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG
        A SNI++YR L   R+     G  GFMYE+  +LP  IDWR  GAV G+K Q  G  G      ++ AVEGIN+IRT QL+ LSEQEL+DC+  + GC G
Subjt:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG

Query:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV
        G+ME AFDFIR+NGGI TE +YPY+  +G C SS     +V IDGY+++P  NEDALMQAVANQPVS               GVF+G CG  LNH VVA+
Subjt:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV

Query:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        GY     G DYWI++NSWG GWGE GYVRM+RG   P GLCGIA++AS+PIK
Subjt:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

XP_038885798.1 vignain-like [Benincasa hispida]3.5e-10157.14Show/hide
Query:  KFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSN
        KF++VPLVLI LI GLCESF FEG ELES+ESLW LY++WS HH ISR   E   RFKVF++NAK+V KVNQMNKS KLKLN+F DMS+ EF + + NSN
Subjt:  KFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSN

Query:  INYYRNL-KKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIGC----AIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGGGYME
        I YY+NL  K ++A GGG   GFMYE   +LP SIDWR  GAVT +K+Q  G   C    A+  VEGI+QI+T +LV LSEQE+VDCD  DGGCGGG+ +
Subjt:  INYYRNL-KKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIGC----AIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGGGYME

Query:  TAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGN--CGVVLNHAVVAVGY
        +AF+F+ +N GITTE NYPY A+  YC S  +    VTIDGY+NVPP NE+AL +AVANQPV+               G+F  N  CG  ++H VV VGY
Subjt:  TAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGN--CGVVLNHAVVAVGY

Query:  -----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
             G DYWI+KNSWG  WG  GY++MQRGA  P  +CG+A++ S+PIK
Subjt:  -----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

TrEMBL top hitse value%identityAlignment
A0A0A0LMU4 Uncharacterized protein1.5e-10255.97Show/hide
Query:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY
        M + KF+LVPL+LI L+ GL ESFEF+ KEL ++ESLW+LY +W  HH ISR   E+  RF VF+EN  HV  VNQM+K YKLKLN+F DMSN+EF + Y
Subjt:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY

Query:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG
        A SNI++YR L + R+  G     GFMYEQ  +LP S+DWR  GAV  VK Q  G  G      ++ AVEGIN+I+T QL+ LSEQEL+DC+  + GC G
Subjt:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG

Query:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV
        G+ME AFDFI++NGGI TE +YPY+  +G C SS     +V IDGY++VP  NEDALMQAVANQPVS               GVF+G CG  LNH VVA+
Subjt:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV

Query:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        GY     G DYW+++NSWG GWGE GYVRM+RG     GLCGIA++AS+PIK
Subjt:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

A0A1S3BA70 vignain-like8.9e-10355.97Show/hide
Query:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY
        M + KF+LVPL+LI L+ GL ESFEF+ KEL ++ESLW+LY +W  HH ISR   E+  RF VF+EN  HV  VNQMNK YKLKLN+F DMSN+EF + Y
Subjt:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY

Query:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG
        A SNI+++R L   R+  G     GFMYEQ  +LP S+DWR  GAV  +K Q  G  G      ++ AVE IN+I+T QL+ LSEQEL+DC+  + GC G
Subjt:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG

Query:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV
        G+ME AFDFI++NGGI TE +YPY+  +G C SS     +V IDGY++VP  NEDALMQAVANQPVS               GVF+G CG  LNH VVA+
Subjt:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV

Query:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        GY     G DYWI++NSWG GWGE GYVRM+RG   P GLCGIA++AS+PIK
Subjt:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

A0A6J1GHN5 vignain-like1.9e-9754.55Show/hide
Query:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY
        M ++K +LVP  LI LI GL +SFEF+ +EL +  SLWKLY +WS HH ISR   E+  R+ VF+ENA HV  VNQMNK YKLKLN+F DMSN+EF ++Y
Subjt:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY

Query:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG
        A SNI +YR L   R+     G  GFMYE+  +LP  IDWR  GAV+ +K Q  G  G      A+ AVEGINQI+T QL+ LSEQEL+DC+  + GC G
Subjt:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG

Query:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV
        G+META++FIR+NGGI +E NYPY   +G C SS     +VTIDG+++VP  NE+ALMQAVANQPVS               GVF+G+CG  LNH VV +
Subjt:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV

Query:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        GY     G DYW ++NSWG GWGE GY+RM+RG   P GLCGI ++AS+P+K
Subjt:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

A0A6J1KIL0 vignain-like3.3e-9754.83Show/hide
Query:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY
        M ++K +LVP  LI L+ GL +SFEF+ +EL +  SLWKLY +WS HH ISR   E+  R+ VF+ENA HV  VNQMNK YKLKLN+F DMSN+EF +LY
Subjt:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY

Query:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG
        A SNI +YR L   R+     G  GFMYE+  +LP  IDWR  GAV  +K Q  G  G      A+ AVEGINQI+T QL+ LSEQEL+DC+  + GC G
Subjt:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG

Query:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV
        G+META++FIR+NGGI +E NYPY   +G C SS     +VTIDG+++VP  NE+ALMQAVANQPVS               GVF+G CG  LNH VV +
Subjt:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV

Query:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        GY     G DYW ++NSWG GWGE GY+RM+RG   P GLCGIA++AS+P+K
Subjt:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

A0A7N2L9B0 Uncharacterized protein1.9e-9753.13Show/hide
Query:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY
        M L KF LV L L  L+L L +  +F+ ++L S+E LW+LY KW  HH  S+   E+  RF VF+EN KHVHKVN+MNK YKLKLN+F DM+N EF+S Y
Subjt:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY

Query:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG
        A+S ++++R  +  R+        GFM+E++ NLP S+DWR  GAVTGVK Q  G  G       + AVEGINQI+TK+LVPLSEQELVDCD  + GC G
Subjt:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGG

Query:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV
        G ME AF+FI+Q GG+TTE NYPY A+   C SS     +V IDGY+NVP  +E+ALM+AVANQPVS               GVF G+CG  LNH V  V
Subjt:  GYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAV

Query:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK-----KKKIVSKEEI
        GY     G  YWI+KNSWG  WGE GY+RMQRG N   GLCGIA++AS+P+K     KKK   K+E+
Subjt:  GY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK-----KKKIVSKEEI

SwissProt top hitse value%identityAlignment
O65039 Vignain2.3e-9252.27Show/hide
Query:  LNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYAN
        + KFIL+ L L  L+L + ESF+F  KELES+ESLW LY +W  HH +SR+  E+  RF VF+ NA HVH  N+M+K YKLKLN+F DM+N EF + Y+ 
Subjt:  LNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYAN

Query:  SNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGG
        S + ++R  +     GG  G   FMYE+V  +P S+DWR  GAVT VK Q  G  G       I AVEGINQI+T +LV LSEQELVDCD   + GC GG
Subjt:  SNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGG

Query:  YMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVG
         M+ AF+FI+Q GGITTEANYPY A  G C  S      V+IDG++NVP  +E+AL++AVANQPVS               GVF G+CG  L+H V  VG
Subjt:  YMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVG

Query:  Y-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIKK
        Y     G  YW +KNSWG  WGE GY+RM+RG +   GLCGIA++AS+PIKK
Subjt:  Y-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIKK

P12412 Vignain9.8e-9151.27Show/hide
Query:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY
        M + K + V L L  L+LG+  SF+F  K+LES+ESLW LY +W  HH +SR+  E+  RF VF+ N  HVH  N+M+K YKLKLN+F DM+N EF S Y
Subjt:  MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLY

Query:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDG-GCG
        A S +N+++  +     G   G   FMYE+V ++P S+DWR  GAVT VK Q  G  G       I AVEGINQI+T +LV LSEQELVDCD+ +  GC 
Subjt:  ANSNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDG-GCG

Query:  GGYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVA
        GG ME+AF+FI+Q GGITTE+NYPY A++G C  S      V+IDG++NVP  +E+AL++AVANQPVS               GVF G+C   LNH V  
Subjt:  GGYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVA

Query:  VGY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        VGY     G +YWI++NSWG  WGE GY+RMQR  +   GLCGIA+ AS+PIK
Subjt:  VGY-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

P25803 Vignain3.4e-9151Show/hide
Query:  KFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSN
        K +L  ++   L+LG+  SF+F  K+L S+ESLW LY +W  HH +SR+  E+  RF VF+ N  HVH  N+M+K YKLKLN+F DM+N EF S YA S 
Subjt:  KFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSN

Query:  INYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDG-GCGGGYM
        +N+ R  + T    G      FMYE+V+++PPS+DWR  GAVT VK Q  G  G       + AVEGINQI+T +LV LSEQELVDCD+ +  GC GG M
Subjt:  INYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCDQMDG-GCGGGYM

Query:  ETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVGY-
        E+AF+FI+Q GGITTE+NYPY A++G C +S      V+IDG++NVP  +EDAL++AVANQPVS               GVF G+C   LNH V  VGY 
Subjt:  ETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVGY-

Query:  ----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
            G +YWI++NSWG  WGE GY+RMQR  +   GLCGIA+  S+PIK
Subjt:  ----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

Q9FGR9 KDEL-tailed cysteine endopeptidase CEP11.1e-8648.15Show/hide
Query:  LNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYAN
        + +FI++ L ++ ++L   +  +F  K++ES+ SLW+LY +W  HH ++R+  E+  RF VF+ N KH+H+ N+ +KSYKLKLN+FGDM++ EF   YA 
Subjt:  LNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYAN

Query:  SNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGG
        SNI ++R  +  +KA      K FMY  V  LP S+DWR  GAVT VK+Q  G  G       + AVEGINQIRTK+L  LSEQELVDCD   + GC GG
Subjt:  SNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGG

Query:  YMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVG
         M+ AF+FI++ GG+T+E  YPY A    C ++     +V+IDG+++VP  +ED LM+AVANQPVS               GVF G CG  LNH V  VG
Subjt:  YMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVG

Query:  Y-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        Y     G  YWI+KNSWG  WGE GY+RMQRG     GLCGIA++AS+P+K
Subjt:  Y-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

Q9STL5 KDEL-tailed cysteine endopeptidase CEP31.5e-8647.62Show/hide
Query:  KFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSN
        K   + L+    +L   + F+F+ KELE++E++WKLY +W  HH++SR   E   RF VF  N  HVH+ N+ NK YKLK+NRF D+++ EF S YA SN
Subjt:  KFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSN

Query:  INYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKS-QDLGAIGC--AIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGGYMET
        + ++R L+  ++  G     GFMYE V  +P S+DWR  GAVT VK+ QD G+      + AVEGIN+IRT +LV LSEQELVDCD + + GC GG ME 
Subjt:  INYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKS-QDLGAIGC--AIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGGYMET

Query:  AFDFIRQNGGITTEANYPYNAKQ-GYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVGY--
        AF+FI+ NGGI TE  YPY++    +C ++S     VTIDG+++VP  +E+ L++AVA+QPVS               GVF G CG  LNH VV VGY  
Subjt:  AFDFIRQNGGITTEANYPYNAKQ-GYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVGY--

Query:  ---GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIKKKKIVSKEE
           G  YWI++NSWG  WGEGGYVR++RG +   G CGIA++AS+P K     S  E
Subjt:  ---GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIKKKKIVSKEE

Arabidopsis top hitse value%identityAlignment
AT1G47128.1 Granulin repeat cysteine protease family protein3.6e-6442.86Show/hide
Query:  SKESLWKLYRKWSIHHNISRTP---MERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSNINYYRNLKKTRKAGGGGGGKGFMYE
        S+  +  +Y  W + H  +++    +E+  RF++F++N + V + N+ N SY+L L RF D++N E+ S Y  +         K  K G       +   
Subjt:  SKESLWKLYRKWSIHHNISRTP---MERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSNINYYRNLKKTRKAGGGGGGKGFMYE

Query:  QVLNLPPSIDWRTMGAVTGVKSQDLGAIGC----AIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGGYMETAFDFIRQNGGITTEANYPYNAKQG
            LP SIDWR  GAV  VK Q  G   C     IGAVEGINQI T  L+ LSEQELVDCD   + GC GG M+ AF+FI +NGGI T+ +YPY    G
Subjt:  QVLNLPPSIDWRTMGAVTGVKSQDLGAIGC----AIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGGYMETAFDFIRQNGGITTEANYPYNAKQG

Query:  YCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPV---------------SGVFNGNCGVVLNHAVVAVGY----GPDYWIMKNSWGAGWGEGGYVRM
         C        +VTID Y++VP Y+E++L +AVA+QP+               SG+F+G+CG  L+H VVAVGY    G DYWI++NSWG  WGE GY+RM
Subjt:  YCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPV---------------SGVFNGNCGVVLNHAVVAVGY----GPDYWIMKNSWGAGWGEGGYVRM

Query:  QRGANPPAGLCGIAIDASFPIK
         R     +G CGIAI+ S+PIK
Subjt:  QRGANPPAGLCGIAIDASFPIK

AT3G19390.1 Granulin repeat cysteine protease family protein5.6e-6541.26Show/hide
Query:  ILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPM-ERFNRFKVFEENAKHVHKVNQM-NKSYKLKLNRFGDMSNFEFASLYANSN
        ++  ++LI L LG   + E    E E++    ++Y +W + +  +   + E+  RF++F++N K V + + + N++Y++ L RF D++N EF ++Y  S 
Subjt:  ILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPM-ERFNRFKVFEENAKHVHKVNQM-NKSYKLKLNRFGDMSNFEFASLYANSN

Query:  INYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGGYM
        +   R   K         G+ ++Y+   +LP +IDWR  GAV  VK Q  G+ G      AIGAVEGINQI+T +L+ LSEQELVDCD   + GCGGG M
Subjt:  INYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGGYM

Query:  ETAFDFIRQNGGITTEANYPYNAKQ-GYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPV---------------SGVFNGNCGVVLNHAVVAVGY
        + AF FI +NGGI TE +YPY A     C S      +VTIDGY++VP  +E +L +A+ANQP+               SGVF G CG  L+H VVAVGY
Subjt:  ETAFDFIRQNGGITTEANYPYNAKQ-GYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPV---------------SGVFNGNCGVVLNHAVVAVGY

Query:  ----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
            G DYWI++NSWG+ WGE GY +++R     +G CG+A+ AS+P K
Subjt:  ----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

AT3G48340.1 Cysteine proteinases superfamily protein5.7e-8648.01Show/hide
Query:  LNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYAN
        + K +L+ L  + ++   C  F+++ KE+ES+E L  LY +W  HH++ R+  ER  RF VF  N  HVH  N+ N+SYKLKLN+F D++  EF + Y  
Subjt:  LNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYAN

Query:  SNINYYRNLKKTRKAGGGGGGKGFMY--EQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCG
        SNI ++R L+  ++     G K FMY  E +  LP S+DWR  GAVT +K+Q  G  G       + AVEGIN+I+T +LV LSEQELVDCD + + GC 
Subjt:  SNINYYRNLKKTRKAGGGGGGKGFMY--EQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCG

Query:  GGYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVA
        GG ME AF+FI++NGGITTE +YPY    G C +S     LVTIDG+++VP  +E+AL++AVANQPVS               GVF G+CG  LNH V A
Subjt:  GGYMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVA

Query:  VGYGPD----YWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        VGYG +    YWI++NSWGA WGEGGY++++R  + P G CGIA++AS+PIK
Subjt:  VGYGPD----YWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK

AT3G48350.1 Cysteine proteinases superfamily protein1.0e-8747.62Show/hide
Query:  KFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSN
        K   + L+    +L   + F+F+ KELE++E++WKLY +W  HH++SR   E   RF VF  N  HVH+ N+ NK YKLK+NRF D+++ EF S YA SN
Subjt:  KFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSN

Query:  INYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKS-QDLGAIGC--AIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGGYMET
        + ++R L+  ++  G     GFMYE V  +P S+DWR  GAVT VK+ QD G+      + AVEGIN+IRT +LV LSEQELVDCD + + GC GG ME 
Subjt:  INYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKS-QDLGAIGC--AIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGGYMET

Query:  AFDFIRQNGGITTEANYPYNAKQ-GYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVGY--
        AF+FI+ NGGI TE  YPY++    +C ++S     VTIDG+++VP  +E+ L++AVA+QPVS               GVF G CG  LNH VV VGY  
Subjt:  AFDFIRQNGGITTEANYPYNAKQ-GYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVGY--

Query:  ---GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIKKKKIVSKEE
           G  YWI++NSWG  WGEGGYVR++RG +   G CGIA++AS+P K     S  E
Subjt:  ---GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIKKKKIVSKEE

AT5G50260.1 Cysteine proteinases superfamily protein8.0e-8848.15Show/hide
Query:  LNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYAN
        + +FI++ L ++ ++L   +  +F  K++ES+ SLW+LY +W  HH ++R+  E+  RF VF+ N KH+H+ N+ +KSYKLKLN+FGDM++ EF   YA 
Subjt:  LNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYAN

Query:  SNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGG
        SNI ++R  +  +KA      K FMY  V  LP S+DWR  GAVT VK+Q  G  G       + AVEGINQIRTK+L  LSEQELVDCD   + GC GG
Subjt:  SNINYYRNLKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIG-----CAIGAVEGINQIRTKQLVPLSEQELVDCD-QMDGGCGGG

Query:  YMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVG
         M+ AF+FI++ GG+T+E  YPY A    C ++     +V+IDG+++VP  +ED LM+AVANQPVS               GVF G CG  LNH V  VG
Subjt:  YMETAFDFIRQNGGITTEANYPYNAKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVS---------------GVFNGNCGVVLNHAVVAVG

Query:  Y-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK
        Y     G  YWI+KNSWG  WGE GY+RMQRG     GLCGIA++AS+P+K
Subjt:  Y-----GPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCTCAACAAATTTATTCTAGTTCCTCTTGTTTTGATTCCTCTAATTTTAGGATTGTGTGAGAGCTTTGAATTTGAAGGAAAAGAGTTGGAATCAAAAGAAAGTCT
ATGGAAACTCTACAGGAAATGGAGTATTCACCATAACATCTCAAGGACACCAATGGAGAGGTTTAACCGATTCAAAGTGTTTGAAGAGAATGCGAAGCATGTGCACAAAG
TGAATCAGATGAACAAATCCTACAAGTTGAAGTTGAATAGATTTGGTGATATGTCAAATTTTGAGTTTGCGAGCTTATATGCTAACTCGAATATTAACTACTATAGAAAC
TTAAAAAAAACACGAAAGGCTGGTGGCGGTGGCGGTGGCAAAGGATTCATGTATGAGCAAGTCCTGAATCTTCCACCTTCTATTGATTGGAGAACTATGGGAGCTGTCAC
TGGTGTGAAGTCTCAAGACTTGGGTGCTATAGGATGTGCCATAGGTGCAGTGGAAGGAATAAACCAAATAAGAACAAAGCAATTAGTACCTCTGTCAGAACAAGAGCTTG
TGGATTGTGATCAAATGGATGGAGGTTGTGGTGGAGGATACATGGAAACTGCTTTTGATTTCATAAGGCAAAATGGTGGAATTACAACTGAGGCTAATTATCCCTACAAT
GCTAAACAAGGCTATTGTACCTCTAGCTCATCTAGGATGAATTTAGTCACAATTGATGGATATCAGAACGTACCTCCATACAACGAGGATGCTTTGATGCAAGCTGTGGC
GAACCAACCAGTGTCCGGAGTGTTTAATGGAAACTGTGGAGTGGTGCTTAATCATGCAGTGGTAGCAGTTGGATATGGACCAGATTATTGGATAATGAAGAACTCGTGGG
GTGCTGGATGGGGAGAAGGAGGTTATGTAAGGATGCAACGAGGAGCGAATCCACCAGCTGGGCTATGTGGAATAGCCATTGATGCCTCCTTTCCCATTAAGAAGAAGAAG
ATCGTGAGTAAGGAAGAGATTTTGTTCGCCATCAACAGCGTTCGGTTCGCTATTCTGTACCGGCGTAGGTTTCTTCGTCTTGCACTGGGTCTGTACGCCATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTCTCAACAAATTTATTCTAGTTCCTCTTGTTTTGATTCCTCTAATTTTAGGATTGTGTGAGAGCTTTGAATTTGAAGGAAAAGAGTTGGAATCAAAAGAAAGTCT
ATGGAAACTCTACAGGAAATGGAGTATTCACCATAACATCTCAAGGACACCAATGGAGAGGTTTAACCGATTCAAAGTGTTTGAAGAGAATGCGAAGCATGTGCACAAAG
TGAATCAGATGAACAAATCCTACAAGTTGAAGTTGAATAGATTTGGTGATATGTCAAATTTTGAGTTTGCGAGCTTATATGCTAACTCGAATATTAACTACTATAGAAAC
TTAAAAAAAACACGAAAGGCTGGTGGCGGTGGCGGTGGCAAAGGATTCATGTATGAGCAAGTCCTGAATCTTCCACCTTCTATTGATTGGAGAACTATGGGAGCTGTCAC
TGGTGTGAAGTCTCAAGACTTGGGTGCTATAGGATGTGCCATAGGTGCAGTGGAAGGAATAAACCAAATAAGAACAAAGCAATTAGTACCTCTGTCAGAACAAGAGCTTG
TGGATTGTGATCAAATGGATGGAGGTTGTGGTGGAGGATACATGGAAACTGCTTTTGATTTCATAAGGCAAAATGGTGGAATTACAACTGAGGCTAATTATCCCTACAAT
GCTAAACAAGGCTATTGTACCTCTAGCTCATCTAGGATGAATTTAGTCACAATTGATGGATATCAGAACGTACCTCCATACAACGAGGATGCTTTGATGCAAGCTGTGGC
GAACCAACCAGTGTCCGGAGTGTTTAATGGAAACTGTGGAGTGGTGCTTAATCATGCAGTGGTAGCAGTTGGATATGGACCAGATTATTGGATAATGAAGAACTCGTGGG
GTGCTGGATGGGGAGAAGGAGGTTATGTAAGGATGCAACGAGGAGCGAATCCACCAGCTGGGCTATGTGGAATAGCCATTGATGCCTCCTTTCCCATTAAGAAGAAGAAG
ATCGTGAGTAAGGAAGAGATTTTGTTCGCCATCAACAGCGTTCGGTTCGCTATTCTGTACCGGCGTAGGTTTCTTCGTCTTGCACTGGGTCTGTACGCCATGTAG
Protein sequenceShow/hide protein sequence
MGLNKFILVPLVLIPLILGLCESFEFEGKELESKESLWKLYRKWSIHHNISRTPMERFNRFKVFEENAKHVHKVNQMNKSYKLKLNRFGDMSNFEFASLYANSNINYYRN
LKKTRKAGGGGGGKGFMYEQVLNLPPSIDWRTMGAVTGVKSQDLGAIGCAIGAVEGINQIRTKQLVPLSEQELVDCDQMDGGCGGGYMETAFDFIRQNGGITTEANYPYN
AKQGYCTSSSSRMNLVTIDGYQNVPPYNEDALMQAVANQPVSGVFNGNCGVVLNHAVVAVGYGPDYWIMKNSWGAGWGEGGYVRMQRGANPPAGLCGIAIDASFPIKKKK
IVSKEEILFAINSVRFAILYRRRFLRLALGLYAM