; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009792 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009792
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationchr9:42413082..42414594
RNA-Seq ExpressionLag0009792
SyntenyLag0009792
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140022.2 aspartic proteinase NANA, chloroplast [Cucumis sativus]2.3e-12257.92Show/hide
Query:  MLDYRKPLMSPVSKFS--ILFFSIFFFLSVHIAF----GD------------GNDDQEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNR
        ML YRKP MSP+S F     FF +FFFLS   +F    GD             ++D++E ++ +LLHRHHPQV+EK+HG+ K   V ER+KDIHEHDHNR
Subjt:  MLDYRKPLMSPVSKFS--ILFFSIFFFLSVHIAF----GD------------GNDDQEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNR

Query:  HRFISESLNRTQTE-KKLKAERQTTME---------PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANP
        HR IS+S+N+ Q E  +L+AE +   E         P ++ TPIGM+M SGAD+G+SEYFVELKVGTP Q  MLIADTGSDLTW+KCRYRRC GNCS+N 
Subjt:  HRFISESLNRTQTE-KKLKAERQTTME---------PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANP

Query:  AHMIRNENKERFTKAFHANRSSSFQPITCGK-KCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATG
         H  +NE K+RF  AF AN SSSF+ ++C    C +D A LFA+ +C  PTSPC YDY Y+GG+SA+GIFA ETLTV LTNGKEK+LH+++IGCTES  G
Subjt:  AHMIRNENKERFTKAFHANRSSSFQPITCGK-KCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATG

Query:  RVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATS---LPSGNMTFTELFMGEFFSNYYGVGLIGISADGVM
         VF GADGV+GLGT+ YS  YKA +NANGGGFSYCLVDHL++  + SY VLG P  ++ A+TS   LP+  MT+T+L++G+ +S++YGV LIGISA+G+M
Subjt:  RVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATS---LPSGNMTFTELFMGEFFSNYYGVGLIGISADGVM

Query:  LNIP
        LNIP
Subjt:  LNIP

XP_008456273.1 PREDICTED: aspartic proteinase CDR1 [Cucumis melo]8.2e-12058.19Show/hide
Query:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAF----GD------GNDDQEE--AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISE
        ML YRKP MSP+S F   FF + FFLS   +F    GD       NDD++E   +R +LLHRHHPQVSEKL+G+ K   +HER+KDIHEHD NRHR IS+
Subjt:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAF----GD------GNDDQEE--AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISE

Query:  SLNRTQTEK---KLKAERQTTME-------PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRN
        S+N+ Q E    + +AE  T +E       P ++ TPIGMKM SGAD+G+SEYFV+LKVGTP Q  MLIADTGSDLTW+KCRYRRC GNCS N  H  +N
Subjt:  SLNRTQTEK---KLKAERQTTME-------PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRN

Query:  ENKERFTKAFHANRSSSFQPITCGK-KCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGA
        E K+RF  A  AN+SS+F+ ++C    C ++ A LFA+ +C TPTSPC YDY Y+GG+SA+GIFA ETLTV LTNGKEK+L +++IGCTE   G VF+GA
Subjt:  ENKERFTKAFHANRSSSFQPITCGK-KCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGA

Query:  DGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATS--LPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
        DGV+GLGT+ YS  YKA +NANGGGFSYCLVDHL++  + SY VLG P  ++ A+TS   P   M++T+L++G+ +S++YGV LIGISADG MLNIP
Subjt:  DGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATS--LPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

XP_022943788.1 aspartic proteinase NANA, chloroplast-like isoform X1 [Cucurbita moschata]2.9e-10955.12Show/hide
Query:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEE-------AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRT
        ML Y  P MSP+S   ++FF   FFLSVH+AF DG++ Q++        V+L+++HRHHP V EKL+GE + LG  +R +DIHEHDHNR R IS S+  +
Subjt:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEE-------AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRT

Query:  QTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRS
        +T+++L         P  S  PI +K+ SG D+G +EYFV+ +VGTPPQK +LI DTGSDLTW+KCRYRRC+GNC+A+  H  R E+K +F   F AN S
Subjt:  QTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRS

Query:  SSFQPITCGKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPY
        SSF+ ITCG   C+ D   LFA+PDC+ P++PC YDY Y GG +A G+FA ET+TV LTNGKEK+LHDTLIGCTE       +G DG+LGLGT  +SF +
Subjt:  SSFQPITCGKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPY

Query:  KARQNANGGGFSYCLVDHLSNNFSTSYLVLG-PPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
        +A  + NGGGFSYCL+DHLS++ +TSY +LG PPA     A   P GNMTF  L +G  F++YYGVGLIGIS DGV LNIP
Subjt:  KARQNANGGGFSYCLVDHLSNNFSTSYLVLG-PPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

XP_022943789.1 aspartic proteinase NANA, chloroplast-like isoform X2 [Cucurbita moschata]5.5e-10855.23Show/hide
Query:  MSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEE-------AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLKA
        MSP+S   ++FF   FFLSVH+AF DG++ Q++        V+L+++HRHHP V EKL+GE + LG  +R +DIHEHDHNR R IS S+  ++T+++L  
Subjt:  MSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEE-------AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLKA

Query:  ERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITC
               P  S  PI +K+ SG D+G +EYFV+ +VGTPPQK +LI DTGSDLTW+KCRYRRC+GNC+A+  H  R E+K +F   F AN SSSF+ ITC
Subjt:  ERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITC

Query:  GKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANG
        G   C+ D   LFA+PDC+ P++PC YDY Y GG +A G+FA ET+TV LTNGKEK+LHDTLIGCTE       +G DG+LGLGT  +SF ++A  + NG
Subjt:  GKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANG

Query:  GGFSYCLVDHLSNNFSTSYLVLG-PPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
        GGFSYCL+DHLS++ +TSY +LG PPA     A   P GNMTF  L +G  F++YYGVGLIGIS DGV LNIP
Subjt:  GGFSYCLVDHLSNNFSTSYLVLG-PPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

XP_038901983.1 aspartic proteinase NANA, chloroplast [Benincasa hispida]1.1e-13263.61Show/hide
Query:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLK
        ML YRKP MSP+S F + F  +FFFLSV IAFGDG+ DQE  V+L+LLHRHHPQVSEKLHG+ K   +++R+KDI EHD  R++ IS SLNR + +++L+
Subjt:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLK

Query:  AERQTTME-----PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSS
         E     E     P  S TPIG+KM SG+DYG+SEYFV+LKVGTPPQ  MLIADTGSDLTW+KCRYRRC+GNCS+NP H  RNE K RF  AF AN SSS
Subjt:  AERQTTME-----PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSS

Query:  FQPITCGKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKA
        F+ I C  K C +D A LF++ +C+TPTSPC YDY YSGG+SA+G+FAIETLTV LTNGKEK+LH+++IGCTES  GR+F GADGV+GLGT+ YSF YKA
Subjt:  FQPITCGKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKA

Query:  RQNANGGGFSYCLVDHLSNNFSTSYLVLGPPA----VASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
         +NANGGGF+YCLVDHLS+  +TSY +LG P      A+ A++  P+GNM+FT+LF+G+ +S++YGV L+GISADGVMLNIP
Subjt:  RQNANGGGFSYCLVDHLSNNFSTSYLVLGPPA----VASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

TrEMBL top hitse value%identityAlignment
A0A0A0KG92 Peptidase A1 domain-containing protein1.1e-12257.92Show/hide
Query:  MLDYRKPLMSPVSKFS--ILFFSIFFFLSVHIAF----GD------------GNDDQEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNR
        ML YRKP MSP+S F     FF +FFFLS   +F    GD             ++D++E ++ +LLHRHHPQV+EK+HG+ K   V ER+KDIHEHDHNR
Subjt:  MLDYRKPLMSPVSKFS--ILFFSIFFFLSVHIAF----GD------------GNDDQEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNR

Query:  HRFISESLNRTQTE-KKLKAERQTTME---------PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANP
        HR IS+S+N+ Q E  +L+AE +   E         P ++ TPIGM+M SGAD+G+SEYFVELKVGTP Q  MLIADTGSDLTW+KCRYRRC GNCS+N 
Subjt:  HRFISESLNRTQTE-KKLKAERQTTME---------PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANP

Query:  AHMIRNENKERFTKAFHANRSSSFQPITCGK-KCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATG
         H  +NE K+RF  AF AN SSSF+ ++C    C +D A LFA+ +C  PTSPC YDY Y+GG+SA+GIFA ETLTV LTNGKEK+LH+++IGCTES  G
Subjt:  AHMIRNENKERFTKAFHANRSSSFQPITCGK-KCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATG

Query:  RVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATS---LPSGNMTFTELFMGEFFSNYYGVGLIGISADGVM
         VF GADGV+GLGT+ YS  YKA +NANGGGFSYCLVDHL++  + SY VLG P  ++ A+TS   LP+  MT+T+L++G+ +S++YGV LIGISA+G+M
Subjt:  RVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATS---LPSGNMTFTELFMGEFFSNYYGVGLIGISADGVM

Query:  LNIP
        LNIP
Subjt:  LNIP

A0A1S3C2F3 aspartic proteinase CDR14.0e-12058.19Show/hide
Query:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAF----GD------GNDDQEE--AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISE
        ML YRKP MSP+S F   FF + FFLS   +F    GD       NDD++E   +R +LLHRHHPQVSEKL+G+ K   +HER+KDIHEHD NRHR IS+
Subjt:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAF----GD------GNDDQEE--AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISE

Query:  SLNRTQTEK---KLKAERQTTME-------PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRN
        S+N+ Q E    + +AE  T +E       P ++ TPIGMKM SGAD+G+SEYFV+LKVGTP Q  MLIADTGSDLTW+KCRYRRC GNCS N  H  +N
Subjt:  SLNRTQTEK---KLKAERQTTME-------PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRN

Query:  ENKERFTKAFHANRSSSFQPITCGK-KCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGA
        E K+RF  A  AN+SS+F+ ++C    C ++ A LFA+ +C TPTSPC YDY Y+GG+SA+GIFA ETLTV LTNGKEK+L +++IGCTE   G VF+GA
Subjt:  ENKERFTKAFHANRSSSFQPITCGK-KCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGA

Query:  DGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATS--LPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
        DGV+GLGT+ YS  YKA +NANGGGFSYCLVDHL++  + SY VLG P  ++ A+TS   P   M++T+L++G+ +S++YGV LIGISADG MLNIP
Subjt:  DGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATS--LPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

A0A6J1FVB3 aspartic proteinase NANA, chloroplast-like isoform X22.7e-10855.23Show/hide
Query:  MSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEE-------AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLKA
        MSP+S   ++FF   FFLSVH+AF DG++ Q++        V+L+++HRHHP V EKL+GE + LG  +R +DIHEHDHNR R IS S+  ++T+++L  
Subjt:  MSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEE-------AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLKA

Query:  ERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITC
               P  S  PI +K+ SG D+G +EYFV+ +VGTPPQK +LI DTGSDLTW+KCRYRRC+GNC+A+  H  R E+K +F   F AN SSSF+ ITC
Subjt:  ERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITC

Query:  GKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANG
        G   C+ D   LFA+PDC+ P++PC YDY Y GG +A G+FA ET+TV LTNGKEK+LHDTLIGCTE       +G DG+LGLGT  +SF ++A  + NG
Subjt:  GKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANG

Query:  GGFSYCLVDHLSNNFSTSYLVLG-PPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
        GGFSYCL+DHLS++ +TSY +LG PPA     A   P GNMTF  L +G  F++YYGVGLIGIS DGV LNIP
Subjt:  GGFSYCLVDHLSNNFSTSYLVLG-PPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

A0A6J1FXD5 aspartic proteinase NANA, chloroplast-like isoform X11.4e-10955.12Show/hide
Query:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEE-------AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRT
        ML Y  P MSP+S   ++FF   FFLSVH+AF DG++ Q++        V+L+++HRHHP V EKL+GE + LG  +R +DIHEHDHNR R IS S+  +
Subjt:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEE-------AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRT

Query:  QTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRS
        +T+++L         P  S  PI +K+ SG D+G +EYFV+ +VGTPPQK +LI DTGSDLTW+KCRYRRC+GNC+A+  H  R E+K +F   F AN S
Subjt:  QTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRS

Query:  SSFQPITCGKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPY
        SSF+ ITCG   C+ D   LFA+PDC+ P++PC YDY Y GG +A G+FA ET+TV LTNGKEK+LHDTLIGCTE       +G DG+LGLGT  +SF +
Subjt:  SSFQPITCGKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPY

Query:  KARQNANGGGFSYCLVDHLSNNFSTSYLVLG-PPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
        +A  + NGGGFSYCL+DHLS++ +TSY +LG PPA     A   P GNMTF  L +G  F++YYGVGLIGIS DGV LNIP
Subjt:  KARQNANGGGFSYCLVDHLSNNFSTSYLVLG-PPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

A0A6J1FY96 aspartic proteinase NANA, chloroplast-like isoform X35.6e-10654.07Show/hide
Query:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEE-------AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRT
        ML Y  P MSP+S   ++FF   FFLSVH+AF DG++ Q++        V+L+++HRHHP V EKL+GE + LG  +R +DIHEHDHNR R IS S+  +
Subjt:  MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEE-------AVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRT

Query:  QTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRS
        +T+++L         P  S  PI +K+ SG D+G +EYFV+ +VGTPPQK +LI DTGSDLTW+KCRYRRC+GNC+A+  H  R E+K +F   F AN S
Subjt:  QTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRS

Query:  SSFQPITCGKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPY
        SSF+ ITCG   C+ D   LFA+PDC+ P++PC YDY Y GG +A G+FA ET+TV LTNGKEK+LHDTLIGCTE       +G DG+LGLGT  +SF +
Subjt:  SSFQPITCGKK-CMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPY

Query:  KARQNANGGGFSYCLVDHLSNNFSTSYLVLG-PPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
        +A  + NGGGFSYCL+DHLS++ +TSY +LG PPA     A   P GNMTF  L +G  F++YYGVGLIGIS   +ML  P
Subjt:  KARQNANGGGFSYCLVDHLSNNFSTSYLVLG-PPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

SwissProt top hitse value%identityAlignment
Q9LEW3 Aspartyl protease AED12.1e-1730.03Show/hide
Query:  LHGEAKFLGVHERL--KDIHEHDHNRHRFISESLNRTQTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTW
        +HG    L    R+   +I   D  R   I   L++    +   +E ++T  P+           SG   G+  Y V + +GTP     L+ DTGSDLTW
Subjt:  LHGEAKFLGVHERL--KDIHEHDHNRHRFISESLNRTQTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTW

Query:  VKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEK
         +C    C+G+C +      + E K      F+ + SS++Q ++C     +D     A        S C Y   Y   S  +G  A E  T  LTN    
Subjt:  VKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEK

Query:  ELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGEFFSNY-Y
         L D   GC E+  G +F+G  G+LGLG    S P +     N   FSYCL    SN  ST +L  G   +         S ++ FT +    F S + Y
Subjt:  ELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGEFFSNY-Y

Query:  GVGLIGISADGVMLNI-PLAFGT
        G+ +IGIS     L I P +F T
Subjt:  GVGLIGISADGVMLNI-PLAFGT

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 24.5e-2026.92Show/hide
Query:  NDDQEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQT-EKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVE
        +D+      L LLHR             +F  V  R      H H  H  +    +R     +++  +   + +        G  + SG D G+ EYFV 
Subjt:  NDDQEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQT-EKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVE

Query:  LKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKTPTSPCRYDYGYSGG
        + VG+PP+   ++ D+GSD+ WV+C+           P  +   ++   F  A    +S S+  ++CG    D       + +    +  CRY+  Y  G
Subjt:  LKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKTPTSPCRYDYGYSGG

Query:  SSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATS
        S  +G  A+ETLT   T      + +  +GC     G +F GA G+LG+G    SF  +      GG F YCLV   ++  ST  LV G  A        
Subjt:  SSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATS

Query:  LPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
        LP G  ++  L       ++Y VGL G+   GV + +P
Subjt:  LPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

Q9LNJ3 Aspartyl protease family protein 29.0e-2133.85Show/hide
Query:  SPTPIGM--KMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCR-YRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGK-KCMD
        +P P G    + SG   G+ EYF  L VGTP +   ++ DTGSD+ W++C   RRC                       F   +S ++  I C    C  
Subjt:  SPTPIGM--KMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCR-YRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGK-KCMD

Query:  -DYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYC
         D AG      C T    C Y   Y  GS   G F+ ETLT R        +    +GC     G +F GA G+LGLG    SFP +     N   FSYC
Subjt:  -DYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYC

Query:  LVDHLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADG
        LVD  S +   S +V G  AV SR A         FT L        +Y VGL+GIS  G
Subjt:  LVDHLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADG

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.6e-1726.3Show/hide
Query:  RFISESLNRTQTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKER
        RF  E ++R+  +     + +   E  ++P      + SGA  G+ EYF  + VGTP ++  L+ DTGSD+ W++C        C+         +  ++
Subjt:  RFISESLNRTQTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKER

Query:  FTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGL
            F+   SS+++ +TC          L     C+  ++ C Y   Y  GS   G  A +T+T     G   ++++  +GC     G +F GA G+LGL
Subjt:  FTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGL

Query:  GTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
        G  + S   + +  +    FSYCLVD  S          G  +     +  L  G+ T   L   +    +Y VGL G S  G  + +P
Subjt:  GTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

Q9LTW4 Aspartic proteinase NANA, chloroplast1.7e-4837.31Show/hide
Query:  QEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVG
        ++ +VRL+L HR           +        R++D+   D  RH  IS   N T                      + M + SG DYG ++YF E++VG
Subjt:  QEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVG

Query:  TPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITC-GKKCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSA
        TP +K  ++ DTGS+LTWV CRYR              R ++  R    F A+ S SF+ + C  + C  D   LF+L  C TP++PC YDY Y+ GS+A
Subjt:  TPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITC-GKKCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSA

Query:  RGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPS
        +G+FA ET+TV LTNG+   L   LIGC+ S TG+ F+GADGVLGL  + +SF   A  +  G  FSYCLVDHLSN   ++YL+ G     S  +T    
Subjt:  RGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPS

Query:  GNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
           T  +L        +Y + +IGIS    ML+IP
Subjt:  GNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein1.7e-2731.11Show/hide
Query:  DIHEHDHNRHRFISESLNRTQTEKKLKAERQTTME------PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGN
        D+   D  R + +    N+++ +K  K  ++ T +      P  SP  +   + SG   G+ EYF+++ VGTPP+   LI DTGSDL W++C        
Subjt:  DIHEHDHNRHRFISESLNRTQTEKKLKAERQTTME------PSSSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGN

Query:  CSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPD----CKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLT----NGKEKELH
            P +   ++N       +    S+SF+ ITC     D    L + PD    C++    C Y Y Y   S+  G FA+ET TV LT       E ++ 
Subjt:  CSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPD----CKTPTSPCRYDYGYSGGSSARGIFAIETLTVRLT----NGKEKELH

Query:  DTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGEFFS--NYYGV
        + + GC     G +F GA G+LGLG    SF  +  Q+  G  FSYCLVD  SN   +S L+ G           L   N+ FT    G+  S   +Y +
Subjt:  DTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGEFFS--NYYGV

Query:  GLIGISADGVMLNIP
         +  I   G  L+IP
Subjt:  GLIGISADGVMLNIP

AT3G12700.1 Eukaryotic aspartyl protease family protein1.2e-4937.31Show/hide
Query:  QEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVG
        ++ +VRL+L HR           +        R++D+   D  RH  IS   N T                      + M + SG DYG ++YF E++VG
Subjt:  QEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLKAERQTTMEPSSSPTPIGMKMFSGADYGASEYFVELKVG

Query:  TPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITC-GKKCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSA
        TP +K  ++ DTGS+LTWV CRYR              R ++  R    F A+ S SF+ + C  + C  D   LF+L  C TP++PC YDY Y+ GS+A
Subjt:  TPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITC-GKKCMDDYAGLFALPDCKTPTSPCRYDYGYSGGSSA

Query:  RGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPS
        +G+FA ET+TV LTNG+   L   LIGC+ S TG+ F+GADGVLGL  + +SF   A  +  G  FSYCLVDHLSN   ++YL+ G     S  +T    
Subjt:  RGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPS

Query:  GNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP
           T  +L        +Y + +IGIS    ML+IP
Subjt:  GNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIP

AT3G25700.1 Eukaryotic aspartyl protease family protein4.0e-3235.11Show/hide
Query:  SGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKTP
        SGA  G+ +YFV+L++G PPQ  +LIADTGSDL WVKC   R   NCS +    +           F    SS+F P      C D    L   PD + P
Subjt:  SGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKTP

Query:  T-------SPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGC-----TESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVD
                S C Y+YGY+ GS   G+FA ET +++ ++GKE  L     GC      +S +G  F GA+GV+GLG    SF  +  +   G  FSYCL+D
Subjt:  T-------SPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGC-----TESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVD

Query:  HLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNI
        +  +   TSYL++G                + FT L        +Y V L  +  +G  L I
Subjt:  HLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNI

AT3G25700.2 Eukaryotic aspartyl protease family protein3.4e-2337.93Show/hide
Query:  SGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKTP
        SGA  G+ +YFV+L++G PPQ  +LIADTGSDL WVKC   R   NCS +    +           F    SS+F P      C D    L   PD + P
Subjt:  SGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKTP

Query:  T-------SPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADG-VLGLGTTI
                S C Y+YGY+ GS   G+FA ET +++ ++GKE  L     GC    +G+   G  G V+  GTT+
Subjt:  T-------SPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADG-VLGLGTTI

AT3G59080.1 Eukaryotic aspartyl protease family protein8.7e-2731.76Show/hide
Query:  RLKDIHEHDHNRHRFISESLNRTQTEKKLKAERQ---TTMEPSSSPTPIGM---KMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRC
        +++D+        R + ++   T ++K+ K +++   TT   SS     G     + SG   G+ EYF+++ VG+PP+   LI DTGSDL W++C    C
Subjt:  RLKDIHEHDHNRHRFISESLNRTQTEKKLKAERQ---TTMEPSSSPTPIGM---KMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRC

Query:  MGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPD----CKTPTSPCRYDYGYSGGSSARGIFAIETLTVRL-TNGKEKELH
              N A              +    S+S++ ITC     D    L + PD    CK+    C Y Y Y   S+  G FA+ET TV L TNG   EL+
Subjt:  MGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPD----CKTPTSPCRYDYGYSGGSSARGIFAIETLTVRL-TNGKEKELH

Query:  DT---LIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGE--FFSNY
        +    + GC     G +F GA G+LGLG    SF  +  Q+  G  FSYCLVD  S+   +S L+ G           L   N+ FT    G+      +
Subjt:  DT---LIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASRAATSLPSGNMTFTELFMGE--FFSNY

Query:  YGVGLIGISADGVMLNIP
        Y V +  I   G +LNIP
Subjt:  YGVGLIGISADGVMLNIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGATTACAGGAAGCCATTAATGTCGCCTGTTTCAAAATTTTCGATCCTTTTCTTCTCCATCTTCTTTTTCCTCTCCGTTCACATTGCATTCGGCGACGGCAATGA
TGACCAAGAAGAGGCGGTCAGGCTGGAACTGCTGCACCGCCACCATCCACAAGTCTCCGAGAAGCTTCACGGCGAGGCGAAGTTTCTGGGAGTCCACGAACGCTTGAAGG
ATATTCACGAACACGACCACAATCGCCACCGTTTCATCTCGGAGTCGCTGAATCGGACTCAAACCGAGAAGAAATTGAAGGCGGAGAGGCAGACGACGATGGAGCCTTCG
TCGTCGCCAACGCCGATCGGGATGAAAATGTTCTCAGGCGCCGATTATGGAGCCAGCGAGTACTTCGTGGAATTGAAAGTCGGAACGCCGCCGCAGAAGTGCATGTTGAT
CGCGGATACCGGAAGTGACCTAACGTGGGTAAAATGCAGATACCGGCGGTGCATGGGGAATTGCAGCGCCAATCCCGCTCATATGATTCGAAACGAAAATAAAGAGAGAT
TCACTAAGGCGTTTCATGCCAATAGATCTTCCTCTTTCCAGCCGATCACTTGCGGCAAGAAATGTATGGATGATTATGCCGGTCTCTTCGCCCTCCCGGATTGTAAGACC
CCTACCAGCCCCTGTCGCTATGATTACGGCTACTCAGGTGGATCAAGTGCAAGGGGAATATTCGCAATCGAGACCCTAACGGTGCGCCTAACAAACGGAAAAGAGAAGGA
ACTCCACGACACTTTAATCGGCTGCACCGAATCAGCGACTGGCAGAGTCTTCGAAGGAGCCGACGGCGTCCTTGGCTTAGGGACTACCATATACTCCTTCCCCTACAAAG
CCAGACAAAACGCCAACGGCGGCGGCTTCTCCTACTGCCTCGTCGACCACCTCAGCAACAACTTCTCCACCAGCTACCTCGTCCTCGGCCCCCCCGCCGTCGCCTCCCGT
GCCGCCACCTCCCTCCCCTCCGGCAACATGACCTTCACCGAGCTATTTATGGGAGAATTCTTCAGCAACTACTACGGCGTGGGCCTCATCGGCATCTCCGCCGACGGGGT
CATGCTCAACATCCCCCTCGCGTTTGGGACATCACCGACGGCGGCGGCACCATCGTCGACTCCGGCACCAGCCTCACCATGCTGGCGGCGCCGGCCTTCGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGATTACAGGAAGCCATTAATGTCGCCTGTTTCAAAATTTTCGATCCTTTTCTTCTCCATCTTCTTTTTCCTCTCCGTTCACATTGCATTCGGCGACGGCAATGA
TGACCAAGAAGAGGCGGTCAGGCTGGAACTGCTGCACCGCCACCATCCACAAGTCTCCGAGAAGCTTCACGGCGAGGCGAAGTTTCTGGGAGTCCACGAACGCTTGAAGG
ATATTCACGAACACGACCACAATCGCCACCGTTTCATCTCGGAGTCGCTGAATCGGACTCAAACCGAGAAGAAATTGAAGGCGGAGAGGCAGACGACGATGGAGCCTTCG
TCGTCGCCAACGCCGATCGGGATGAAAATGTTCTCAGGCGCCGATTATGGAGCCAGCGAGTACTTCGTGGAATTGAAAGTCGGAACGCCGCCGCAGAAGTGCATGTTGAT
CGCGGATACCGGAAGTGACCTAACGTGGGTAAAATGCAGATACCGGCGGTGCATGGGGAATTGCAGCGCCAATCCCGCTCATATGATTCGAAACGAAAATAAAGAGAGAT
TCACTAAGGCGTTTCATGCCAATAGATCTTCCTCTTTCCAGCCGATCACTTGCGGCAAGAAATGTATGGATGATTATGCCGGTCTCTTCGCCCTCCCGGATTGTAAGACC
CCTACCAGCCCCTGTCGCTATGATTACGGCTACTCAGGTGGATCAAGTGCAAGGGGAATATTCGCAATCGAGACCCTAACGGTGCGCCTAACAAACGGAAAAGAGAAGGA
ACTCCACGACACTTTAATCGGCTGCACCGAATCAGCGACTGGCAGAGTCTTCGAAGGAGCCGACGGCGTCCTTGGCTTAGGGACTACCATATACTCCTTCCCCTACAAAG
CCAGACAAAACGCCAACGGCGGCGGCTTCTCCTACTGCCTCGTCGACCACCTCAGCAACAACTTCTCCACCAGCTACCTCGTCCTCGGCCCCCCCGCCGTCGCCTCCCGT
GCCGCCACCTCCCTCCCCTCCGGCAACATGACCTTCACCGAGCTATTTATGGGAGAATTCTTCAGCAACTACTACGGCGTGGGCCTCATCGGCATCTCCGCCGACGGGGT
CATGCTCAACATCCCCCTCGCGTTTGGGACATCACCGACGGCGGCGGCACCATCGTCGACTCCGGCACCAGCCTCACCATGCTGGCGGCGCCGGCCTTCGACATGA
Protein sequenceShow/hide protein sequence
MLDYRKPLMSPVSKFSILFFSIFFFLSVHIAFGDGNDDQEEAVRLELLHRHHPQVSEKLHGEAKFLGVHERLKDIHEHDHNRHRFISESLNRTQTEKKLKAERQTTMEPS
SSPTPIGMKMFSGADYGASEYFVELKVGTPPQKCMLIADTGSDLTWVKCRYRRCMGNCSANPAHMIRNENKERFTKAFHANRSSSFQPITCGKKCMDDYAGLFALPDCKT
PTSPCRYDYGYSGGSSARGIFAIETLTVRLTNGKEKELHDTLIGCTESATGRVFEGADGVLGLGTTIYSFPYKARQNANGGGFSYCLVDHLSNNFSTSYLVLGPPAVASR
AATSLPSGNMTFTELFMGEFFSNYYGVGLIGISADGVMLNIPLAFGTSPTAAAPSSTPAPASPCWRRRPST