; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024789 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024789
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionaspartic proteinase CDR1-like
Genome locationtig00002486:2795438..2807402
RNA-Seq ExpressionSgr024789
SyntenySgr024789
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582237.1 Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia]4.6e-10858.79Show/hide
Query:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW
        +MV T VGFTARLIH DSPLSP YDH M +TARI+   HRS  RLN LY +  ++        L   LVHEGGEYLMSF IGNP S+V+GFADTSNGLIW
Subjt:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW

Query:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------
        VQC +C+S C+PEKGP  TKF  SKSFTYE+ P GS+ CNSLTGFQTCNS D+ CKYRL YEDNS TSG LS DSFSFD  DGK VDV Y          
Subjt:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------

Query:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS
                              +L + K S+                       GQ  LLYPN DAY+VKV+GISV  D+  LDGVFDVYDV  GWIIDS
Subjt:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS

Query:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL
        GTTYSSL+TDAFDRLL KF TLP L ++K+DPRN+FELCFAANAND+E+FPDVTVH  GA+ +L
Subjt:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL

KAG7018636.1 Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. argyrosperma]6.0e-10858.79Show/hide
Query:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW
        +MV T VGFTARLIH DSPLSP YDH M +TARI+   HRS  RLN LY +  ++        L   LVHEGGEYLMSF IGNP S+V+GFADTSNGLIW
Subjt:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW

Query:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------
        VQC +C+S C+PEKGP  TKF  SKSFTYE+ P GS+ CNSLTGFQTCNS D+ CKYRL YEDNS TSG LS DSFSFD  DGK VDV Y          
Subjt:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------

Query:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS
                              +L + K S+                       GQ  LLYPN DAY+VKV+GISV  D+  LDGVFDVYDV  GWIIDS
Subjt:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS

Query:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL
        GTTYSSL+TDAFDRLL KF TLP L ++K+DPRN+FELCFAANAND+E+FPDVTVH  GA+ +L
Subjt:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL

XP_022137990.1 aspartic proteinase CDR1-like [Momordica charantia]7.1e-10957.84Show/hide
Query:  STQSPYMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAF--------NKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADT
        ST    ++ T VGFTA LIH DSPLSP Y+H++ DTARI+   HRS  RLN LY H             L   LVHEGGEYLMSF+IGNPPSRV+GFADT
Subjt:  STQSPYMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAF--------NKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADT

Query:  SNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---
        SNGLIWVQC +C S C+ EKG   TKF SSKS TYE  P GS+FCNSLTGFQTCNS DK CKY+LEYEDNS T+GILS DSFSFD  DGKLVDV Y    
Subjt:  SNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---

Query:  ----------------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGG
                                    +L + K S+                       GQ  LLYPNLDAY+VKV+G+S+  DELYLDGV DV+DVG 
Subjt:  ----------------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGG

Query:  GWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL
        GWI+DSG TYSSL+TDAFD L+DK +  P LPKRKDDPRN+FE+CFA N +DLES P VTVHF GAD VL
Subjt:  GWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL

XP_022979793.1 aspartic proteinase CDR1-like [Cucurbita maxima]3.5e-10859.07Show/hide
Query:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW
        +MV T VGFTARLIH DSPLSP YDH M  TA I+   HRS  RLN LY +  +K        L   LVHEGGEYLMSF IGNPPS+V+GFADTSNGLIW
Subjt:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW

Query:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------
        VQC +C+SQC+PEKGP  TKF  SKSFTYE+ P GS+ CNSLTGFQTCNS D+ CKYRL YEDNS TSG LS DSFSFD  DGK VDV Y          
Subjt:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------

Query:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS
                              +L + K S+                       GQ  LLYPN DAY+VKV+GISV  D+  L+GVFDVYDV  GWIIDS
Subjt:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS

Query:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL
        GTTYSSL+TDAFDRLL KF+TLP L ++K+DPRN+FELCFAANAND+E+FP VTVHF GA+ +L
Subjt:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL

XP_023528351.1 aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo]3.2e-10959.07Show/hide
Query:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW
        +MV T VGFTARLIH DSP+SP YDH M +TA+I+   HRS  RLN LY +  +K        L   LVHEGGEYLMSF IGNPPS+V+GFADTSNGLIW
Subjt:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW

Query:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------
        VQC +C+SQC+PEKGP  TKF  SKSFTYE+ P GS+ CNSLTGFQTCNS D+ CKYRL YEDNS TSG LS DSFSFD  DGK VDV Y          
Subjt:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------

Query:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS
                              +L + K S+                       GQ  LLYPN DAY+VKV+GISV  D+  LDGVFDVYDV  GWIIDS
Subjt:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS

Query:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL
        GTTYSSL+TDAFDRLL KF+TLP L ++K+DPRN+FELCFAANAND+E+FPDVTVH  GA+ +L
Subjt:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL

TrEMBL top hitse value%identityAlignment
A0A0A0L7U3 Peptidase A1 domain-containing protein9.3e-10758.04Show/hide
Query:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLY------QHAFNK--CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLI
        +MV   VGFTARLIHHDSPLSP Y+HTM DTARI+   HRS  RLN LY      ++A +    L   LV+EGGEYLMSF IGNP S+V+GF DTSNGLI
Subjt:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLY------QHAFNK--CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLI

Query:  WVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF--------
        WVQC NC+SQC+PEK   TTKF SSKSFTYE+ P GS+FCNSLTGFQTCNS DK CKYRL Y DN ATSGILS DSF FD  DG LVDV +         
Subjt:  WVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF--------

Query:  -----------------------ELWLFKCSF---------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWII
                               +L + K S+                        GQ  LLYPN DAY+VKV+GIS+ +DE + DGVFDVY+V  GWII
Subjt:  -----------------------ELWLFKCSF---------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWII

Query:  DSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCF-AANANDLESFPDVTVHFYGADFVL
        D+G TYSSL+TDAFD LL KFLTL   P+RKDDP+ +FELCF   NANDLESFPDVTVHF GAD +L
Subjt:  DSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCF-AANANDLESFPDVTVHFYGADFVL

A0A5D3CXD4 Aspartic proteinase CDR1-like3.0e-10557.18Show/hide
Query:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK----------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNG
        +MV   VGFTARLIHHDSPLSP Y+H M  TARI+   HRS  RL+ LY    NK           L   LV+EGGEYLMSF IGNPPS+V+GF DTSNG
Subjt:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK----------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNG

Query:  LIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF------
        LIWVQC NC+SQC+PEK   TTKF SSKSFTYE+ P GS+FCNSLTGF+TCNS DK CKYRL Y DN ATSGILS DSF FD  DGKLVDV +       
Subjt:  LIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF------

Query:  -------------------------ELWLFKCSF---------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGW
                                 +L + K S+                        GQ  LLYPN DAY+VKV+GIS+ +DE + DGVFDVYDV  GW
Subjt:  -------------------------ELWLFKCSF---------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGW

Query:  IIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCF-AANANDLESFPDVTVHFYGADFVL
        IID+G TYSSL+TDAFD LL KFL L   P+RK+DP+++FELCF  ANANDLESFPD TVHF GAD +L
Subjt:  IIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCF-AANANDLESFPDVTVHFYGADFVL

A0A6J1C870 aspartic proteinase CDR1-like3.4e-10957.84Show/hide
Query:  STQSPYMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAF--------NKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADT
        ST    ++ T VGFTA LIH DSPLSP Y+H++ DTARI+   HRS  RLN LY H             L   LVHEGGEYLMSF+IGNPPSRV+GFADT
Subjt:  STQSPYMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAF--------NKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADT

Query:  SNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---
        SNGLIWVQC +C S C+ EKG   TKF SSKS TYE  P GS+FCNSLTGFQTCNS DK CKY+LEYEDNS T+GILS DSFSFD  DGKLVDV Y    
Subjt:  SNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---

Query:  ----------------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGG
                                    +L + K S+                       GQ  LLYPNLDAY+VKV+G+S+  DELYLDGV DV+DVG 
Subjt:  ----------------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGG

Query:  GWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL
        GWI+DSG TYSSL+TDAFD L+DK +  P LPKRKDDPRN+FE+CFA N +DLES P VTVHF GAD VL
Subjt:  GWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL

A0A6J1GWK9 aspartic proteinase CDR1-like1.2e-10658.24Show/hide
Query:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW
        +MV T VGFTARLIH DSPLSP Y+H M +TARI+   HRS  RLN LY +  ++        L   LVHEGGEYLMSF IGNP S+V+GFADTSNGLIW
Subjt:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW

Query:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------
        VQC +C+S CD EKGP  TK   SKSFTYE+ P GS+ CNSLTGFQTCNS D+ CKYRL YEDNS TSG LS DSFSFD  DGK VDV Y          
Subjt:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------

Query:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS
                              +L + K S+                       GQ  LLYPN DAY+VKV+GISV  D+  LDGVFDVYDV  GWIIDS
Subjt:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS

Query:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL
        GTTYSSL+TDAFDRLL KF TLP L ++K+DPRN+FELCFAANAND+E+FPDVTVH  GA+ +L
Subjt:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL

A0A6J1IXB1 aspartic proteinase CDR1-like1.7e-10859.07Show/hide
Query:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW
        +MV T VGFTARLIH DSPLSP YDH M  TA I+   HRS  RLN LY +  +K        L   LVHEGGEYLMSF IGNPPS+V+GFADTSNGLIW
Subjt:  YMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNK-------CLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIW

Query:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------
        VQC +C+SQC+PEKGP  TKF  SKSFTYE+ P GS+ CNSLTGFQTCNS D+ CKYRL YEDNS TSG LS DSFSFD  DGK VDV Y          
Subjt:  VQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYF---------

Query:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS
                              +L + K S+                       GQ  LLYPN DAY+VKV+GISV  D+  L+GVFDVYDV  GWIIDS
Subjt:  ----------------------ELWLFKCSF--------------------NRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDS

Query:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL
        GTTYSSL+TDAFDRLL KF+TLP L ++K+DPRN+FELCFAANAND+E+FP VTVHF GA+ +L
Subjt:  GTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL

SwissProt top hitse value%identityAlignment
Q3EBM5 Probable aspartic protease At2g356152.2e-2828.11Show/hide
Query:  FTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEKG
        F+  LIH DSPLSP+Y+  +  T R+  A  RS  R            L+  L+   GE+ MS  IG PP +V   ADT + L WVQC  C  QC  E G
Subjt:  FTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEKG

Query:  PTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQT-CNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLFKCSFNRRG----------
        P    F   KS TY+  P  S  C +L+  +  C+  +  CKYR  Y D S + G ++ ++ S D+  G  V    F   +F C +N  G          
Subjt:  PTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQT-CNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLFKCSFNRRG----------

Query:  -------------------------------QNSLLYPNLDA--------------------------YHVKVMGISVSDDEL-YLDGVFDVYDVG----
                                        N     NL                            Y++ +  ISV   ++ Y    ++  D G    
Subjt:  -------------------------------QNSLLYPNLDA--------------------------YHVKVMGISVSDDEL-YLDGVFDVYDVG----

Query:  --GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGAD
          G  IIDSGTT + L+   FD+            KR  DP+     CF + + ++   P++TVHF GAD
Subjt:  --GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGAD

Q6XBF8 Aspartic proteinase CDR13.7e-2827.79Show/hide
Query:  VGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQ---CPNCSSQC
        +GFTA LIH DSP SP Y+     + R++ A HRS  R+    +       +  L    GEYLM+  IG PP  ++  ADT + L+W Q   C +C +Q 
Subjt:  VGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQ---CPNCSSQC

Query:  DPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLFKC------SFNRRG
        DP   P T       S TY+ V   S  C +L    +C++ D  C Y L Y DNS T G ++ D+ +  + D + + +      +  C      +FN++G
Subjt:  DPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLFKC------SFNRRG

Query:  QNSL------------LYPNLDA----------------------------------------------YHVKVMGISVSDDELYLDGVFDVYDVGGGWI
           +            L  ++D                                               Y++ +  ISV   ++   G  D     G  I
Subjt:  QNSL------------LYPNLDA----------------------------------------------YHVKVMGISVSDDELYLDGVFDVYDVGGGWI

Query:  IDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL
        IDSGTT + L T+ +  L D  +   +  ++K DP++   LC++A   DL+  P +T+HF GAD  L
Subjt:  IDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL

Q766C2 Aspartic proteinase nepenthesin-23.3e-1626.01Show/hide
Query:  PLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSA
        P+    GEYLM+  IG P S      DT + LIW QC  C +QC  +  P    F+   S ++  +P  S +C  L   +TCN+ +  C+Y   Y D S 
Subjt:  PLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSA

Query:  TSGILSFDSFSFDNPD--------------------GKLVDVDYFELWL--------FKCSFNRRGQNS---------------------LLYPNLDA--
        T G ++ ++F+F+                         L+ + +  L L        F       G +S                     L++ +L+   
Subjt:  TSGILSFDSFSFDNPD--------------------GKLVDVDYFELWL--------FKCSFNRRGQNS---------------------LLYPNLDA--

Query:  YHVKVMGISVSDDELYL-DGVFDVYDVG-GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANAN-DLESFPDVTVHFYG
        Y++ + GI+V  D L +    F + D G GG IIDSGTT + L  DA++ +   F     LP   D+  +    CF   ++      P++++ F G
Subjt:  YHVKVMGISVSDDELYL-DGVFDVYDVG-GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANAN-DLESFPDVTVHFYG

Q9LNJ3 Aspartyl protease family protein 26.8e-1427.8Show/hide
Query:  FNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCP---NCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCK
        F+  +   L    GEY     +G P   V    DT + ++W+QC     C SQ DP        F   KS TY  +P  S  C  L     CN+  K C 
Subjt:  FNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCP---NCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCK

Query:  YRLEYEDNSATSGILSFDSFSF--------------DNP-------------DGKL----VDVDYFELWLFKCSFNRRGQNS------------------
        Y++ Y D S T G  S ++ +F              DN               GKL         F      C  +R   +                   
Subjt:  YRLEYEDNSATSGILSFDSFSF--------------DNP-------------DGKL----VDVDYFELWLFKCSFNRRGQNS------------------

Query:  --LLYPNLDA-YHVKVMGISVSDDEL--YLDGVFDVYDVG-GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCF-AANANDLESFP
          L  P LD  Y+V ++GISV    +      +F +  +G GG IIDSGT+ + L   A+  + D F       KR  D  + F+ CF  +N N+++  P
Subjt:  --LLYPNLDA-YHVKVMGISVSDDEL--YLDGVFDVYDVG-GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCF-AANANDLESFP

Query:  DVTVHFYGADFVL
         V +HF GAD  L
Subjt:  DVTVHFYGADFVL

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.2e-1327.08Show/hide
Query:  GEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILS
        GEY     +G P   +    DT + + W+QC  C + C  +  P    F+ + S TY+ +   +  C SL     C S    C Y++ Y D S T G L+
Subjt:  GEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILS

Query:  FDSFSFDNPDGKLVDV-----------------------------DYFELWLFK-CSFNR-RGQNSLLYPN------------------LDA-YHVKVMG
         D+ +F N  GK+ +V                             +  +   F  C  +R  G++S L  N                  +D  Y+V + G
Subjt:  FDSFSFDNPDGKLVDV-----------------------------DYFELWLFK-CSFNR-RGQNSLLYPN------------------LDA-YHVKVMG

Query:  ISVSDDELYL-DGVFDVYDVG-GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYG
         SV  +++ L D +FDV   G GG I+D GT  + L+T A++ L D FL L V  K+     + F+ C+  ++      P V  HF G
Subjt:  ISVSDDELYL-DGVFDVYDVG-GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYG

Arabidopsis top hitse value%identityAlignment
AT1G31450.1 Eukaryotic aspartyl protease family protein4.0e-2538.22Show/hide
Query:  TARLIHHDSPLSPLYD--HTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEK
        T  LIH DSP SPLY+  HT+ D  R+  A  RS  R     +      L+  L+  GGEY MS  IG PPS+V   ADT + L WVQC  C  QC  + 
Subjt:  TARLIHHDSPLSPLYD--HTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEK

Query:  GPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQT-CNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLFKCSFNRRG
         P    F   KS TY+     S  C +L+  +  C+     CKYR  Y DNS T G ++ ++ S D+  G  V    F   +F C +N  G
Subjt:  GPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQT-CNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLFKCSFNRRG

AT1G64830.1 Eukaryotic aspartyl protease family protein1.2e-2126.44Show/hide
Query:  FSTQSPYMVLTGV------GFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSS-FRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADT
        F+T    ++L+ V      GFT  LIH DSP SP Y+     + R++ A  RS+   L      A     +  +    GEYLM+  IG PP  +L  ADT
Subjt:  FSTQSPYMVLTGV------GFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSS-FRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADT

Query:  SNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSF------------------
         + LIW QC  C   C  +  P    F   +S TY  V   S  C +L    +C++ +  C Y + Y DNS T G ++ D+                   
Subjt:  SNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSF------------------

Query:  -------SFDNPDGKLVDV----------------DYFELWLF----------KCSFNRRG--------QNSLLYPNLDAYH-VKVMGISVSDDELYLDG
               +FD     ++ +                  F   L           K +F   G          S++  +   Y+ + +  ISV   ++    
Subjt:  -------SFDNPDGKLVDV----------------DYFELWLF----------KCSFNRRG--------QNSLLYPNLDAYH-VKVMGISVSDDELYLDG

Query:  VFDVYDVG-GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESF--PDVTVHFYGADFVL
           ++  G G  +IDSGTT + L ++ F   L+  +   +  +R  DP     LC+     D  SF  PD+TVHF G D  L
Subjt:  VFDVYDVG-GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESF--PDVTVHFYGADFVL

AT2G03200.1 Eukaryotic aspartyl protease family protein4.0e-1726.67Show/hide
Query:  LRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYED
        ++ P     GE+LM   IGNP  +     DT + LIW QC  C ++C  +  P    F   KS +Y  V   S  CN+L     CN     C+Y   Y D
Subjt:  LRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYED

Query:  NSATSGILSFDSFSFDNPD---------GKLVDVDYF------------------ELWLFKCSF------NRRGQNSLLYPNLDA---------------
         S+T G+L+ ++F+F++ +         G   + D F                  +L   K S+      +    +SL   +L +               
Subjt:  NSATSGILSFDSFSFDNPD---------GKLVDVDYF------------------ELWLFKCSF------NRRGQNSLLYPNLDA---------------

Query:  --------------YHVKVMGISVSDDELYLD-GVFDVYDVG-GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCF-AANANDLES
                      Y++++ GI+V    L ++   F++ + G GG IIDSGTT + L+  AF  L ++F +   LP   D      +LCF   +A    +
Subjt:  --------------YHVKVMGISVSDDELYLD-GVFDVYDVG-GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCF-AANANDLES

Query:  FPDVTVHFYGADFVL
         P +  HF GAD  L
Subjt:  FPDVTVHFYGADFVL

AT2G35615.1 Eukaryotic aspartyl protease family protein1.6e-2928.11Show/hide
Query:  FTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEKG
        F+  LIH DSPLSP+Y+  +  T R+  A  RS  R            L+  L+   GE+ MS  IG PP +V   ADT + L WVQC  C  QC  E G
Subjt:  FTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQCPNCSSQCDPEKG

Query:  PTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQT-CNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLFKCSFNRRG----------
        P    F   KS TY+  P  S  C +L+  +  C+  +  CKYR  Y D S + G ++ ++ S D+  G  V    F   +F C +N  G          
Subjt:  PTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQT-CNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLFKCSFNRRG----------

Query:  -------------------------------QNSLLYPNLDA--------------------------YHVKVMGISVSDDEL-YLDGVFDVYDVG----
                                        N     NL                            Y++ +  ISV   ++ Y    ++  D G    
Subjt:  -------------------------------QNSLLYPNLDA--------------------------YHVKVMGISVSDDEL-YLDGVFDVYDVG----

Query:  --GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGAD
          G  IIDSGTT + L+   FD+            KR  DP+     CF + + ++   P++TVHF GAD
Subjt:  --GGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGAD

AT5G33340.1 Eukaryotic aspartyl protease family protein2.6e-2927.79Show/hide
Query:  VGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQ---CPNCSSQC
        +GFTA LIH DSP SP Y+     + R++ A HRS  R+    +       +  L    GEYLM+  IG PP  ++  ADT + L+W Q   C +C +Q 
Subjt:  VGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPSRVLGFADTSNGLIWVQ---CPNCSSQC

Query:  DPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLFKC------SFNRRG
        DP   P T       S TY+ V   S  C +L    +C++ D  C Y L Y DNS T G ++ D+ +  + D + + +      +  C      +FN++G
Subjt:  DPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLFKC------SFNRRG

Query:  QNSL------------LYPNLDA----------------------------------------------YHVKVMGISVSDDELYLDGVFDVYDVGGGWI
           +            L  ++D                                               Y++ +  ISV   ++   G  D     G  I
Subjt:  QNSL------------LYPNLDA----------------------------------------------YHVKVMGISVSDDELYLDGVFDVYDVGGGWI

Query:  IDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL
        IDSGTT + L T+ +  L D  +   +  ++K DP++   LC++A   DL+  P +T+HF GAD  L
Subjt:  IDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHFYGADFVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CACCGTTAGTTCTTCATCCGTTTCTTCGACTCTTTGCCCATTGGAATGCAGAAGAAGGTCATGCACCACCAGAATTTTTCTCAACTCAATCACCGTATATGGTGCTAACT
GGAGTTGGCTTCACTGCACGTTTGATTCACCATGACTCACCTTTATCACCGCTCTACGATCACACAATGAAAGACACGGCACGGATCAAGGTAGCCACTCACCGTTCCAG
TTTCAGGCTGAATTGTCTATACCAACATGCTTTCAATAAATGCCTTAGACACCCATTGGTTCACGAAGGTGGCGAGTACCTTATGAGTTTCTACATTGGAAATCCTCCAA
GTCGAGTGCTAGGGTTTGCAGACACATCTAATGGTCTCATTTGGGTGCAATGCCCAAACTGCAGTAGTCAATGTGACCCAGAAAAAGGCCCCACCACCACTAAGTTCCAC
TCCTCCAAATCCTTCACCTATGAGGTGGTGCCATGGGGCTCTGACTTTTGCAATTCCTTGACTGGCTTCCAAACCTGCAATTCTCCTGACAAATGTTGCAAATACAGATT
AGAATATGAAGATAATTCTGCGACAAGTGGAATTCTTTCATTTGATAGTTTTAGTTTTGATAACCCAGATGGGAAACTTGTGGATGTTGACTACTTTGAACTTTGGCTGT
TCAAATGTTCCTTCAACAGGAGGGGTCAAAATTCTCTGTTATATCCAAATTTAGACGCTTATCATGTGAAGGTTATGGGAATTAGTGTGAGCGATGATGAGCTCTATTTA
GATGGAGTTTTTGACGTATATGATGTCGGAGGTGGATGGATCATTGATTCAGGAACAACATACTCAAGTCTTAAAACAGATGCATTTGATAGGTTGCTAGATAAATTCCT
TACACTACCAGTTTTACCAAAGAGAAAAGATGACCCTAGAAACAAATTTGAATTGTGCTTTGCAGCAAATGCCAATGATTTAGAGTCATTTCCAGATGTTACAGTTCATT
TTTATGGTGCAGATTTTGTTCTTAAATTTCTATGTCGGGGAACTTTCAGCTGCAAAACTACTATGCTGGGTCCGACCTTGAAGCTCAAAGCTCTGCGACGCTTTCTTGAG
GACGAAGCCTTCCCGTCGGCGGCTTCAGATTCATGCACCGGCATAATCTGCGGCAAAGTGGAATCACAGAGTCGAGGACGGTTGAAGCTGCCACTTTGTTCACTTCCATT
GCACGTTCGAGTTCCAGTGGTAGCCGATAATTCCGATGGCCATGGCGGTTCTGCTGTCGGAGGCCTTAGCAGCGCCACCATCACGCTTCAGCCTTTTATTTTTTGCGCGA
GTTCGGCAGCGGGAGCTCCTCAGAGCGAGTTCCTCTTCCGTCTGTGTAGCTTCGCAATTTCACTTTCTCTTCCACTCTTCTCTTCCCTCTTCCCCACTCTACCCTTCCTC
ACTCTTCTTGTCCATCTCTATTACAATGCAACTTTCAAGAGTCCTAAAACCTTTTCTTCCATTCAATCGGTTTTCTCTTCGACATTGCGAGGCTCGTCAATGGCGGACTC
CGAGGATCTTCTCCAGCCACGGCCTGAATCCGTGCATTATTTCTGCTTCGCGATTTCACTTTCTCTTCCACTCTTCTCTTCCCTCTTCCCCACTCTGCCCGTCCTCACTC
TTCTTGTCCATCTCTATTACAATGCAACTTTCAAGAGTCCTGAAACATTTTCTTCCATTCAATCGGTTTTCTCTTTGACATTGCAAGGCTCGTCAATGGCGGACTTCGAG
GATCTTCTCCAGCCACGGCCTGAATCCGTGCATTATCTCTGCTTCGCGATTTCACTTTCTCTGCCACTCTTCTCTTCCCTCTTCCCCACTCTGCCCGTCCTCACTCTTCT
TGTCCATCTCTATTACAATGCAACTTTCAAGAGTCCTGAAACATTTTCTTCCATTCAATCGGTTTTCTCTTCGACATTGCAAAGCTCGTCAATGGCGGACTCCGAGGATC
TTCTCCAGCCACGGCCTGAATCCGTGCATTATCTCTGCTTCGCGATTTCACTTTCTCTTCCACTCTTCTCTTCCCTCTTCCCCACTCTGCCCGTCCTCACTCTTCTTGTC
CATCTCTATTACAATGCAACTTTCAAGAGTCCTGAAACATTTTCTTCCATTCAATCGGTTTTCTCTTCGACATTGCGAGGCTCGTCAATGGCGGACTCCGAGGATCTTCT
CCAGCCACGGCCTGAATCCGTGCATTATCTCTGGTAA
mRNA sequenceShow/hide mRNA sequence
CACCGTTAGTTCTTCATCCGTTTCTTCGACTCTTTGCCCATTGGAATGCAGAAGAAGGTCATGCACCACCAGAATTTTTCTCAACTCAATCACCGTATATGGTGCTAACT
GGAGTTGGCTTCACTGCACGTTTGATTCACCATGACTCACCTTTATCACCGCTCTACGATCACACAATGAAAGACACGGCACGGATCAAGGTAGCCACTCACCGTTCCAG
TTTCAGGCTGAATTGTCTATACCAACATGCTTTCAATAAATGCCTTAGACACCCATTGGTTCACGAAGGTGGCGAGTACCTTATGAGTTTCTACATTGGAAATCCTCCAA
GTCGAGTGCTAGGGTTTGCAGACACATCTAATGGTCTCATTTGGGTGCAATGCCCAAACTGCAGTAGTCAATGTGACCCAGAAAAAGGCCCCACCACCACTAAGTTCCAC
TCCTCCAAATCCTTCACCTATGAGGTGGTGCCATGGGGCTCTGACTTTTGCAATTCCTTGACTGGCTTCCAAACCTGCAATTCTCCTGACAAATGTTGCAAATACAGATT
AGAATATGAAGATAATTCTGCGACAAGTGGAATTCTTTCATTTGATAGTTTTAGTTTTGATAACCCAGATGGGAAACTTGTGGATGTTGACTACTTTGAACTTTGGCTGT
TCAAATGTTCCTTCAACAGGAGGGGTCAAAATTCTCTGTTATATCCAAATTTAGACGCTTATCATGTGAAGGTTATGGGAATTAGTGTGAGCGATGATGAGCTCTATTTA
GATGGAGTTTTTGACGTATATGATGTCGGAGGTGGATGGATCATTGATTCAGGAACAACATACTCAAGTCTTAAAACAGATGCATTTGATAGGTTGCTAGATAAATTCCT
TACACTACCAGTTTTACCAAAGAGAAAAGATGACCCTAGAAACAAATTTGAATTGTGCTTTGCAGCAAATGCCAATGATTTAGAGTCATTTCCAGATGTTACAGTTCATT
TTTATGGTGCAGATTTTGTTCTTAAATTTCTATGTCGGGGAACTTTCAGCTGCAAAACTACTATGCTGGGTCCGACCTTGAAGCTCAAAGCTCTGCGACGCTTTCTTGAG
GACGAAGCCTTCCCGTCGGCGGCTTCAGATTCATGCACCGGCATAATCTGCGGCAAAGTGGAATCACAGAGTCGAGGACGGTTGAAGCTGCCACTTTGTTCACTTCCATT
GCACGTTCGAGTTCCAGTGGTAGCCGATAATTCCGATGGCCATGGCGGTTCTGCTGTCGGAGGCCTTAGCAGCGCCACCATCACGCTTCAGCCTTTTATTTTTTGCGCGA
GTTCGGCAGCGGGAGCTCCTCAGAGCGAGTTCCTCTTCCGTCTGTGTAGCTTCGCAATTTCACTTTCTCTTCCACTCTTCTCTTCCCTCTTCCCCACTCTACCCTTCCTC
ACTCTTCTTGTCCATCTCTATTACAATGCAACTTTCAAGAGTCCTAAAACCTTTTCTTCCATTCAATCGGTTTTCTCTTCGACATTGCGAGGCTCGTCAATGGCGGACTC
CGAGGATCTTCTCCAGCCACGGCCTGAATCCGTGCATTATTTCTGCTTCGCGATTTCACTTTCTCTTCCACTCTTCTCTTCCCTCTTCCCCACTCTGCCCGTCCTCACTC
TTCTTGTCCATCTCTATTACAATGCAACTTTCAAGAGTCCTGAAACATTTTCTTCCATTCAATCGGTTTTCTCTTTGACATTGCAAGGCTCGTCAATGGCGGACTTCGAG
GATCTTCTCCAGCCACGGCCTGAATCCGTGCATTATCTCTGCTTCGCGATTTCACTTTCTCTGCCACTCTTCTCTTCCCTCTTCCCCACTCTGCCCGTCCTCACTCTTCT
TGTCCATCTCTATTACAATGCAACTTTCAAGAGTCCTGAAACATTTTCTTCCATTCAATCGGTTTTCTCTTCGACATTGCAAAGCTCGTCAATGGCGGACTCCGAGGATC
TTCTCCAGCCACGGCCTGAATCCGTGCATTATCTCTGCTTCGCGATTTCACTTTCTCTTCCACTCTTCTCTTCCCTCTTCCCCACTCTGCCCGTCCTCACTCTTCTTGTC
CATCTCTATTACAATGCAACTTTCAAGAGTCCTGAAACATTTTCTTCCATTCAATCGGTTTTCTCTTCGACATTGCGAGGCTCGTCAATGGCGGACTCCGAGGATCTTCT
CCAGCCACGGCCTGAATCCGTGCATTATCTCTGGTAA
Protein sequenceShow/hide protein sequence
PLVLHPFLRLFAHWNAEEGHAPPEFFSTQSPYMVLTGVGFTARLIHHDSPLSPLYDHTMKDTARIKVATHRSSFRLNCLYQHAFNKCLRHPLVHEGGEYLMSFYIGNPPS
RVLGFADTSNGLIWVQCPNCSSQCDPEKGPTTTKFHSSKSFTYEVVPWGSDFCNSLTGFQTCNSPDKCCKYRLEYEDNSATSGILSFDSFSFDNPDGKLVDVDYFELWLF
KCSFNRRGQNSLLYPNLDAYHVKVMGISVSDDELYLDGVFDVYDVGGGWIIDSGTTYSSLKTDAFDRLLDKFLTLPVLPKRKDDPRNKFELCFAANANDLESFPDVTVHF
YGADFVLKFLCRGTFSCKTTMLGPTLKLKALRRFLEDEAFPSAASDSCTGIICGKVESQSRGRLKLPLCSLPLHVRVPVVADNSDGHGGSAVGGLSSATITLQPFIFCAS
SAAGAPQSEFLFRLCSFAISLSLPLFSSLFPTLPFLTLLVHLYYNATFKSPKTFSSIQSVFSSTLRGSSMADSEDLLQPRPESVHYFCFAISLSLPLFSSLFPTLPVLTL
LVHLYYNATFKSPETFSSIQSVFSLTLQGSSMADFEDLLQPRPESVHYLCFAISLSLPLFSSLFPTLPVLTLLVHLYYNATFKSPETFSSIQSVFSSTLQSSSMADSEDL
LQPRPESVHYLCFAISLSLPLFSSLFPTLPVLTLLVHLYYNATFKSPETFSSIQSVFSSTLRGSSMADSEDLLQPRPESVHYLW