; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004675 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004675
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCysteine Protease
Genome locationchr6:6015322..6016626
RNA-Seq ExpressionLag0004675
SyntenyLag0004675
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142960.1 vignain [Cucumis sativus]8.8e-13167.92Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        MAI KFL V + LIVL SG+AE+FEFD KELA++ESLW+LYERW K+H+ISR+ +EKH+RF+VF EN  +VF VNQM+KPYKL+LNKFADMSNYEFVN +
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHYSK---RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAA
        ARSNISHY K   R+  A  FMYE+  D+PSS+DWR+RGAV  +K+Q   CGSCWAFS+VAAVEGIN+IKT QLLSLSEQELLDC+  + GC GG    A
Subjt:  ARSNISHYSK---RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAA

Query:  FEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTE
        F+FI++NGGI T+++YPY    G C SS  +SP V IDGYE VP N E+ALM+AVANQPVSV I + G DFQFY QGVFDGYCG  L+H +VAIGYGTTE
Subjt:  FEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTE

Query:  DGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        DGTDYWLV+NSWGV WGE GY R+KRG+E  +G+CGIA  ASYP+K
Subjt:  DGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

XP_008444390.1 PREDICTED: vignain-like [Cucumis melo]1.2e-13268.21Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        MAI KFL V + LIVL SG+AE+FEFD KELA++ESLW+LYERW  +H+ISR+ +EKH+RF+VF EN  +VF VNQMNKPYKL+LNKFADMSNYEFVN +
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHYSK---RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAA
        ARSNISH+ K   R+  A  FMYE+  D+PSS+DWR+RGAV  IK+Q T CGSCWAFS+VAAVE IN+IKT QLLSLSEQELLDC+  + GC GG    A
Subjt:  ARSNISHYSK---RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAA

Query:  FEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTE
        F+FI++NGGI T+++YPY    G C SS  +SP V IDGYE VP N E+ALM+AVANQPVSV I + G DFQFYWQGVFDGYCG  L+H +VAIGYGTTE
Subjt:  FEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTE

Query:  DGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        DGTDYW+V+NSWGV WGE GY R+KRG+E P+G+CGIA  ASYP+K
Subjt:  DGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

XP_023002122.1 vignain-like [Cucurbita maxima]8.3e-12967.72Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        M I+K L V   LIVL SG+A++FEFD +ELA+  SLWKLYERWS +H+ISR+ +EKH+R+NVF ENA +V  VNQMNKPYKL+LNKFADMSNYEFVNL+
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHY----SKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASA
        ARSNI+HY     +R+E AS FMYE+  D+PS IDWR+RGAV  IK Q   CGSCWAFSAVAAVEGINQIKT QLLSLSEQELLDC++ + GC GG    
Subjt:  ARSNISHY----SKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASA

Query:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT
        A+ FI +NGGI +++NYPY    G C SS   SP VTIDG+E VP N ENALM+AVANQPVSV+I + G DFQFYWQGVFDGYCG  L+H +V IGYGTT
Subjt:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT

Query:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        + GTDYW V+NSWGV WGE GY R+KRG+EDP+G+CGIA  ASYP+K
Subjt:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

XP_023537428.1 vignain-like [Cucurbita pepo subsp. pepo]4.8e-12968.01Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        MAI+K L V   LIVL SG+A++FEFD +ELA+  SLWKLYERWS +H+ISR+ +EKH+R+NVF ENA +V  VNQMNKPYKL+LNKFADMSNYEFVNL+
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHY----SKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASA
        ARSNI+HY     KR+E AS FMYE+  D+PS IDWR+RGAV+ IK Q   CGSCWAFSAVAAVEGINQIKT QLLSLSEQELLDC++ + GC GG    
Subjt:  ARSNISHY----SKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASA

Query:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT
        A+ FI +NGGI +++NYPY    G C SS   SP VTIDG+E VP N ENALM+AVANQPVSV+I + G DFQFYWQGVFDG CG  L+H +V IGYGTT
Subjt:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT

Query:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        +DGTDYW V+NSWGV WGE GY R+KRG+EDP+G+CGI   ASYP+K
Subjt:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

XP_038885064.1 LOW QUALITY PROTEIN: vignain-like [Benincasa hispida]1.1e-13368.59Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        MAI KFL + + LIVL SG AE+FEFD KELA++ESLWKLYERW  +H+ISRD +EKH+RF VF EN  +VF VNQMNKPYKL+LNKFADMSNYEFVN +
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHYSK----RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASA
        ARSNISHY K    R+E  S FMYEE  D+PS IDWR+RGAV  IK+Q   CGSCWAFS+VAAVEGIN+I+T QLLSLSEQELLDC+  + GC GG    
Subjt:  ARSNISHYSK----RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASA

Query:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT
        AF+FI +NGGI T+++YPY    G C SS  +SP V IDGYE +P N E+ALM+AVANQPVSV I + G DFQFYWQGVFDGYCG  L+H +VAIGYGTT
Subjt:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT

Query:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        EDGTDYW+V+NSWGV WGE+GY R+KRG+E P+G+CGIA  ASYP+K
Subjt:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

TrEMBL top hitse value%identityAlignment
A0A0A0LMU4 Uncharacterized protein4.3e-13167.92Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        MAI KFL V + LIVL SG+AE+FEFD KELA++ESLW+LYERW K+H+ISR+ +EKH+RF+VF EN  +VF VNQM+KPYKL+LNKFADMSNYEFVN +
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHYSK---RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAA
        ARSNISHY K   R+  A  FMYE+  D+PSS+DWR+RGAV  +K+Q   CGSCWAFS+VAAVEGIN+IKT QLLSLSEQELLDC+  + GC GG    A
Subjt:  ARSNISHYSK---RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAA

Query:  FEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTE
        F+FI++NGGI T+++YPY    G C SS  +SP V IDGYE VP N E+ALM+AVANQPVSV I + G DFQFY QGVFDGYCG  L+H +VAIGYGTTE
Subjt:  FEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTE

Query:  DGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        DGTDYWLV+NSWGV WGE GY R+KRG+E  +G+CGIA  ASYP+K
Subjt:  DGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

A0A1S3BA70 vignain-like6.0e-13368.21Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        MAI KFL V + LIVL SG+AE+FEFD KELA++ESLW+LYERW  +H+ISR+ +EKH+RF+VF EN  +VF VNQMNKPYKL+LNKFADMSNYEFVN +
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHYSK---RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAA
        ARSNISH+ K   R+  A  FMYE+  D+PSS+DWR+RGAV  IK+Q T CGSCWAFS+VAAVE IN+IKT QLLSLSEQELLDC+  + GC GG    A
Subjt:  ARSNISHYSK---RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAA

Query:  FEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTE
        F+FI++NGGI T+++YPY    G C SS  +SP V IDGYE VP N E+ALM+AVANQPVSV I + G DFQFYWQGVFDGYCG  L+H +VAIGYGTTE
Subjt:  FEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTE

Query:  DGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        DGTDYW+V+NSWGV WGE GY R+KRG+E P+G+CGIA  ASYP+K
Subjt:  DGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

A0A6J1GHN5 vignain-like2.0e-12867.44Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        MAI+K L V   LIVL SG+A++FEFD +ELA+  SLWKLYERWS +H+ISR+ +EKH+R+NVF ENA +V  VNQMNKPYKL+LNKFADMSNYEFVN++
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHY----SKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASA
        ARSNI+HY     KR+E AS FMYE+  D+PS IDWR+RGAV+ IK Q   CGSCWAFSAVAAVEGINQIKT QLLSLSEQELLDC++ + GC GG    
Subjt:  ARSNISHY----SKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASA

Query:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT
        A+ FI +NGGI +++NYPY    G C SS   SP VTIDG+E VP N ENALM+AVANQPVSV+I + G DFQFYWQGVFDG CG  L+H +V IGYGTT
Subjt:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT

Query:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        + GTDYW V+NSWGV WGE GY R+KRG+EDP+G+CGI   ASYP+K
Subjt:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

A0A6J1K7P4 vignain-like1.7e-12462.3Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        MAI+KF+ V   LIVL SG+ E+FEFD KELA++ESLW+LYERW  +H+ISR  +EKH+RFNVF EN  +VF VNQMNKPYKL+LNKFADMSN EFV+ +
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHYSK-RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAAFE
        ARSNISHY K   + +  FMYE+  D+PS IDWR+RGAV  IK+Q   CGSCWAFS VAAVEGINQIKT QLLSLSEQELLDC+  + GC GG    AF+
Subjt:  ARSNISHYSK-RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAAFE

Query:  FIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFY------------W---------------
        FI++NGGI T++NYPY    G C SS  +SP VTIDGYE VP N ENALM+AVANQPVSV I + G DFQFY            W               
Subjt:  FIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFY------------W---------------

Query:  ---QGVFDGYCGPWLDHQIVAIGYGTTEDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
           QGVFDGYCG  L+H +VAIGYGTTE+GTDYW+V+NSWGV WGE+GY R+KRG+E  +G+CGI   ASYP+K
Subjt:  ---QGVFDGYCGPWLDHQIVAIGYGTTEDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

A0A6J1KIL0 vignain-like4.0e-12967.72Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        M I+K L V   LIVL SG+A++FEFD +ELA+  SLWKLYERWS +H+ISR+ +EKH+R+NVF ENA +V  VNQMNKPYKL+LNKFADMSNYEFVNL+
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHY----SKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASA
        ARSNI+HY     +R+E AS FMYE+  D+PS IDWR+RGAV  IK Q   CGSCWAFSAVAAVEGINQIKT QLLSLSEQELLDC++ + GC GG    
Subjt:  ARSNISHY----SKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASA

Query:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT
        A+ FI +NGGI +++NYPY    G C SS   SP VTIDG+E VP N ENALM+AVANQPVSV+I + G DFQFYWQGVFDGYCG  L+H +V IGYGTT
Subjt:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT

Query:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        + GTDYW V+NSWGV WGE GY R+KRG+EDP+G+CGIA  ASYP+K
Subjt:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

SwissProt top hitse value%identityAlignment
O65039 Vignain2.7e-11458.6Show/hide
Query:  KFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSN
        KF+ ++++L ++ + + E+F+F  KEL S+ESLW LYERW  +H++SR   EK +RFNVF  NA +V   N+M+KPYKL+LNKFADM+N+EF N ++ S 
Subjt:  KFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSN

Query:  ISHYSKRK---ESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASAAFEF
        + H+   +        FMYE+   +P+S+DWRK+GAVT +K Q   CGSCWAFS + AVEGINQIKT +L+SLSEQEL+DCD+  N GC GG    AFEF
Subjt:  ISHYSKRK---ESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASAAFEF

Query:  IEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGT
        I+Q GGITT++NYPY A +G C  S  N+P V+IDG+E VP NDENAL+KAVANQPVSV I + GSDFQFY +GVF G CG  LDH +  +GYGTT DGT
Subjt:  IEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGT

Query:  DYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
         YW VKNSWG EWGEKGY R++RGI D +G+CGIA  ASYP+K
Subjt:  DYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

P12412 Vignain8.6e-11357.47Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        MA+ K L+V ++L ++  G+A +F+F  K+L S+ESLW LYERW  +H++SR   EKH+RFNVF  N  +V   N+M+KPYKL+LNKFADM+N+EF + +
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHYSKRKES---ASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASA
        A S ++H+   + S   +  FMYE+   +P+S+DWRK+GAVT +K Q   CGSCWAFS + AVEGINQIKT +L+SLSEQEL+DCD  +N GC GG   +
Subjt:  ARSNISHYSKRKES---ASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASA

Query:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT
        AFEFI+Q GGITT+SNYPY A+EG C  S  N   V+IDG+E VP NDENAL+KAVANQPVSV I + GSDFQFY +GVF G C   L+H +  +GYGTT
Subjt:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT

Query:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS
         DGT+YW+V+NSWG EWGE+GY R++R I   +G+CGIA  ASYP+K+
Subjt:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS

P25803 Vignain8.9e-11057.18Show/hide
Query:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF
        MA  K L+V V    L  G+A +F+F  K+LAS+ESLW LYERW  +H++SR   EKH+RFNVF  N  +V   N+M+KPYKL+LNKFADM+N+EF + +
Subjt:  MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISH---YSKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASA
        A S ++H   +         FMYE+   +P S+DWRK+GAVT +K Q   CGSCWAFS V AVEGINQIKT +L++LSEQEL+DCD  +N GC GG   +
Subjt:  ARSNISH---YSKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASA

Query:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT
        AFEFI+Q GGITT+SNYPY A+EG C +S  N   V+IDG+E VP NDE+AL+KAVANQPVSV I + GSDFQFY +GVF G C   L+H +  +GYGTT
Subjt:  AFEFIEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTT

Query:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS
         DGT+YW+V+NSWG EWGE GY R++R I   +G+CGIA   SYP+K+
Subjt:  EDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS

P43156 Thiol protease SEN1021.7e-10554.94Show/hide
Query:  FVSVALIVL-FSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQ-MNKPYKLRLNKFADMSNYEFVNLFARSNI
        F+++AL+ L F  +A++  F  K+LAS++SLW LYE+W  +H+++RD  EK++RFNVF EN +++   NQ  + PYKL LNKF DM+N EF + +A S I
Subjt:  FVSVALIVL-FSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQ-MNKPYKLRLNKFADMSNYEFVNLFARSNI

Query:  SHYSKRK---ESASPFMYEETRDIP-SSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCD-SWDNGCGGGDASAAFEF
         H+  ++   ++   FMYE    +P +SIDWR +GAVT +K Q   CGSCWAFS +A+VEGINQIKT +L+SLSEQEL+DCD S++ GC GG    AFEF
Subjt:  SHYSKRK---ESASPFMYEETRDIP-SSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCD-SWDNGCGGGDASAAFEF

Query:  IEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGT
        I++N GITT+ +YPY  ++G C S++ NSP V+IDG++ VP N+ENALM+AVANQP+SV+I + G  FQFY +GVF G CG  LDH +  +GYG T DGT
Subjt:  IEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGT

Query:  DYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS
         YW+VKNSWG EWGE GY R++RGI D +G CGIA  ASYP+K+
Subjt:  DYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS

Q9FGR9 KDEL-tailed cysteine endopeptidase CEP14.0e-10252.33Show/hide
Query:  KFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSN
        +F+ +++ ++++     +  +F  K++ S+ SLW+LYERW  +H+++R   EK +RFNVF  N +++   N+ +K YKL+LNKF DM++ EF   +A SN
Subjt:  KFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSN

Query:  ISH---YSKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASAAFEF
        I H   +   K++   FMY     +P+S+DWRK GAVT +K Q   CGSCWAFS V AVEGINQI+TK+L SLSEQEL+DCD+  N GC GG    AFEF
Subjt:  ISH---YSKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASAAFEF

Query:  IEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGT
        I++ GG+T++  YPY A +  C ++  N+P V+IDG+E VP N E+ LMKAVANQPVSV I + GSDFQFY +GVF G CG  L+H +  +GYGTT DGT
Subjt:  IEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGT

Query:  DYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS
         YW+VKNSWG EWGEKGY R++RGI   +G+CGIA  ASYP+K+
Subjt:  DYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS

Arabidopsis top hitse value%identityAlignment
AT1G20850.1 xylem cysteine peptidase 21.4e-8147.52Show/hide
Query:  FDGKELASKESLWKLYERW-SKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSNISHYSKRKE-SASPFMYEET
        +  ++L S + L +L+E W S +        EK  RF VF +N +++   N+  K Y L LN+FAD+S+ EF  ++         + +E S + F Y + 
Subjt:  FDGKELASKESLWKLYERW-SKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSNISHYSKRKE-SASPFMYEET

Query:  RDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCD-SWDNGCGGGDASAAFEFIEQNGGITTDSNYPYVAREGYC
          +P S+DWRK+GAV ++K Q + CGSCWAFS VAAVEGIN+I T  L +LSEQEL+DCD +++NGC GG    AFE+I +NGG+  + +YPY   EG C
Subjt:  RDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCD-SWDNGCGGGDASAAFEFIEQNGGITTDSNYPYVAREGYC

Query:  MSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGTDYWLVKNSWGVEWGEKGYGRLK
              S  VTI+G++ VP NDE +L+KA+A+QP+SV I + G +FQFY  GVFDG CG  LDH + A+GYG+++ G+DY +VKNSWG +WGEKGY RLK
Subjt:  MSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGTDYWLVKNSWGVEWGEKGYGRLK

Query:  RGIEDPKGICGIATGASYPVKS
        R    P+G+CGI   AS+P K+
Subjt:  RGIEDPKGICGIATGASYPVKS

AT3G19390.1 Granulin repeat cysteine protease family protein1.6e-8247.11Show/hide
Query:  LFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERW-----SKYHSISRDPREKHQRFNVFNENARYVFRVNQM-NKPYKLRLNKFADMSNYEFVNLF
        L  SV LI L  G     E    E  ++    ++YERW       Y+ +     EK +RF +F +N ++V   + + N+ Y++ L +FAD++N EF  ++
Subjt:  LFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERW-----SKYHSISRDPREKHQRFNVFNENARYVFRVNQM-NKPYKLRLNKFADMSNYEFVNLF

Query:  ARSNISHYSKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCD-SWDNGCGGGDASAAFE
         RS +   ++       ++Y+    +P +IDWR +GAV  +K Q + CGSCWAFSA+ AVEGINQIKT +L+SLSEQEL+DCD S+++GCGGG    AF+
Subjt:  ARSNISHYSKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCD-SWDNGCGGGDASAAFE

Query:  FIEQNGGITTDSNYPYVARE-GYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTED
        FI +NGGI T+ +YPY+A +   C S  +N+  VTIDGYE VP NDE +L KA+ANQP+SV I + G  FQ Y  GVF G CG  LDH +VA+GYG +E 
Subjt:  FIEQNGGITTDSNYPYVARE-GYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTED

Query:  GTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS
        G DYW+V+NSWG  WGE GY +L+R I++  G CG+A  ASYP KS
Subjt:  GTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS

AT3G48340.1 Cysteine proteinases superfamily protein6.0e-10155.43Show/hide
Query:  LIVLFS----GMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSNISHY
        LI LFS      A  F++D KE+ S+E L  LY+RW  +HS+ R   E+ +RFNVF  N  +V   N+ N+ YKL+LNKFAD++  EF N +  SNI H+
Subjt:  LIVLFS----GMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSNISHY

Query:  ---SKRKESASPFMY--EETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASAAFEFIE
              K  +  FMY  E    +PSS+DWRK+GAVT+IK Q   CGSCWAFS VAAVEGIN+IKT +L+SLSEQEL+DCD+  N GC GG    AFEFI+
Subjt:  ---SKRKESASPFMY--EETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASAAFEFIE

Query:  QNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGTDY
        +NGGITT+ +YPY   +G C +S  N   VTIDG+E VP NDENAL+KAVANQPVSV I +  SDFQFY +GVF G CG  L+H + A+GYG +E G  Y
Subjt:  QNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGTDY

Query:  WLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        W+V+NSWG EWGE GY +++R I++P+G CGIA  ASYP+K
Subjt:  WLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

AT3G48350.1 Cysteine proteinases superfamily protein4.1e-10252.33Show/hide
Query:  KFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSN
        K  F+ +   +     ++ F+FD KEL ++E++WKLYERW  +HS+SR   E  +RFNVF  N  +V R N+ NKPYKL++N+FAD++++EF + +A SN
Subjt:  KFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSN

Query:  ISHYSK---RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASAAFEF
        + H+      K  +  FMYE    +PSS+DWR++GAVT++K Q   CGSCWAFS VAAVEGIN+I+T +L+SLSEQEL+DCD+ +N GC GG    AFEF
Subjt:  ISHYSK---RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASAAFEF

Query:  IEQNGGITTDSNYPYVARE-GYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDG
        I+ NGGI T+  YPY + +  +C ++      VTIDG+E+VP NDE  L+KAVA+QPVSV I +  SDFQ Y +GVF G CG  L+H +V +GYG T++G
Subjt:  IEQNGGITTDSNYPYVARE-GYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDG

Query:  TDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK
        T YW+V+NSWG EWGE GY R++RGI + +G CGIA  ASYP K
Subjt:  TDYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVK

AT5G50260.1 Cysteine proteinases superfamily protein2.9e-10352.33Show/hide
Query:  KFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSN
        +F+ +++ ++++     +  +F  K++ S+ SLW+LYERW  +H+++R   EK +RFNVF  N +++   N+ +K YKL+LNKF DM++ EF   +A SN
Subjt:  KFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSN

Query:  ISH---YSKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASAAFEF
        I H   +   K++   FMY     +P+S+DWRK GAVT +K Q   CGSCWAFS V AVEGINQI+TK+L SLSEQEL+DCD+  N GC GG    AFEF
Subjt:  ISH---YSKRKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDN-GCGGGDASAAFEF

Query:  IEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGT
        I++ GG+T++  YPY A +  C ++  N+P V+IDG+E VP N E+ LMKAVANQPVSV I + GSDFQFY +GVF G CG  L+H +  +GYGTT DGT
Subjt:  IEQNGGITTDSNYPYVAREGYCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGT

Query:  DYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS
         YW+VKNSWG EWGEKGY R++RGI   +G+CGIA  ASYP+K+
Subjt:  DYWLVKNSWGVEWGEKGYGRLKRGIEDPKGICGIATGASYPVKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTACCAAATTTCTTTTCGTTTCTGTTGCTTTGATTGTTCTATTTTCAGGAATGGCTGAAACCTTTGAATTTGATGGGAAGGAATTGGCATCAAAAGAAAGTCT
ATGGAAGTTGTATGAGAGATGGAGTAAATATCATTCAATCTCAAGGGATCCAAGGGAGAAGCATCAACGTTTCAACGTGTTTAACGAGAATGCGAGATACGTATTCAGAG
TGAACCAAATGAACAAACCATACAAGCTGAGGCTAAACAAATTTGCAGATATGTCCAATTATGAGTTTGTGAACTTGTTTGCTCGATCCAATATTAGCCACTACAGCAAG
AGAAAAGAAAGTGCCAGTCCATTCATGTACGAGGAGACTAGAGATATTCCATCATCCATTGATTGGAGGAAAAGAGGAGCTGTCACTAAAATCAAGCAGCAAACCACCCA
TTGTGGTAGTTGTTGGGCGTTTTCAGCTGTGGCTGCAGTCGAAGGAATCAATCAAATCAAAACGAAACAACTATTATCTTTGTCAGAGCAGGAGCTGCTCGATTGTGATT
CATGGGACAATGGTTGTGGAGGAGGAGACGCGTCTGCTGCGTTTGAATTCATAGAGCAAAATGGTGGAATCACCACTGATAGCAACTATCCCTACGTTGCTAGAGAAGGA
TACTGTATGTCATCAATGAGGAATTCACCAAAGGTAACAATTGATGGATACGAATATGTTCCGCCAAACGACGAGAATGCTCTGATGAAAGCAGTGGCGAACCAGCCAGT
CTCAGTCACCATAGCATCGAGAGGAAGTGATTTTCAATTCTATTGGCAGGGAGTGTTCGATGGATATTGTGGACCATGGCTTGATCATCAAATAGTGGCGATTGGCTACG
GAACAACTGAAGATGGAACAGATTATTGGCTCGTGAAAAACTCATGGGGAGTTGAATGGGGAGAGAAAGGCTACGGTAGGTTGAAACGAGGAATCGAGGATCCAAAAGGA
ATATGTGGAATAGCCACGGGAGCCTCTTATCCCGTCAAGTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTACCAAATTTCTTTTCGTTTCTGTTGCTTTGATTGTTCTATTTTCAGGAATGGCTGAAACCTTTGAATTTGATGGGAAGGAATTGGCATCAAAAGAAAGTCT
ATGGAAGTTGTATGAGAGATGGAGTAAATATCATTCAATCTCAAGGGATCCAAGGGAGAAGCATCAACGTTTCAACGTGTTTAACGAGAATGCGAGATACGTATTCAGAG
TGAACCAAATGAACAAACCATACAAGCTGAGGCTAAACAAATTTGCAGATATGTCCAATTATGAGTTTGTGAACTTGTTTGCTCGATCCAATATTAGCCACTACAGCAAG
AGAAAAGAAAGTGCCAGTCCATTCATGTACGAGGAGACTAGAGATATTCCATCATCCATTGATTGGAGGAAAAGAGGAGCTGTCACTAAAATCAAGCAGCAAACCACCCA
TTGTGGTAGTTGTTGGGCGTTTTCAGCTGTGGCTGCAGTCGAAGGAATCAATCAAATCAAAACGAAACAACTATTATCTTTGTCAGAGCAGGAGCTGCTCGATTGTGATT
CATGGGACAATGGTTGTGGAGGAGGAGACGCGTCTGCTGCGTTTGAATTCATAGAGCAAAATGGTGGAATCACCACTGATAGCAACTATCCCTACGTTGCTAGAGAAGGA
TACTGTATGTCATCAATGAGGAATTCACCAAAGGTAACAATTGATGGATACGAATATGTTCCGCCAAACGACGAGAATGCTCTGATGAAAGCAGTGGCGAACCAGCCAGT
CTCAGTCACCATAGCATCGAGAGGAAGTGATTTTCAATTCTATTGGCAGGGAGTGTTCGATGGATATTGTGGACCATGGCTTGATCATCAAATAGTGGCGATTGGCTACG
GAACAACTGAAGATGGAACAGATTATTGGCTCGTGAAAAACTCATGGGGAGTTGAATGGGGAGAGAAAGGCTACGGTAGGTTGAAACGAGGAATCGAGGATCCAAAAGGA
ATATGTGGAATAGCCACGGGAGCCTCTTATCCCGTCAAGTCTTAG
Protein sequenceShow/hide protein sequence
MAITKFLFVSVALIVLFSGMAETFEFDGKELASKESLWKLYERWSKYHSISRDPREKHQRFNVFNENARYVFRVNQMNKPYKLRLNKFADMSNYEFVNLFARSNISHYSK
RKESASPFMYEETRDIPSSIDWRKRGAVTKIKQQTTHCGSCWAFSAVAAVEGINQIKTKQLLSLSEQELLDCDSWDNGCGGGDASAAFEFIEQNGGITTDSNYPYVAREG
YCMSSMRNSPKVTIDGYEYVPPNDENALMKAVANQPVSVTIASRGSDFQFYWQGVFDGYCGPWLDHQIVAIGYGTTEDGTDYWLVKNSWGVEWGEKGYGRLKRGIEDPKG
ICGIATGASYPVKS