; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029998 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029998
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionN(4)-(Beta-n-acetylglucosaminyl)-l-asparaginase, putative
Genome locationtig00153554:1978991..1987288
RNA-Seq ExpressionSgr029998
SyntenySgr029998
Gene Ontology termsGO:0006517 - protein deglycosylation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016020 - membrane (cellular component)
GO:0003948 - N4-(beta-N-acetylglucosaminyl)-L-asparaginase activity (molecular function)
InterPro domainsIPR000246 - Peptidase T2, asparaginase 2
IPR007599 - Derlin
IPR029055 - Nucleophile aminohydrolases, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576819.1 putative isoaspartyl peptidase/L-asparaginase 3, partial [Cucurbita argyrosperma subsp. sororia]6.8e-17288.96Show/hide
Query:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA
        VTGRVAGE  +YPLVVSTWPFLEAVERAWSAVNNG SAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVM+GVTMEVGAVAAMRYIKDGI+AA
Subjt:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA

Query:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH
        RLVMRHT+HTLLVGE+ASAFS+SMGLPGP +LSS ES  KW KWKENNCQPNFWKNVVPVNSCGPYHS DL+LVAE TC G D    V LRSNHFGLHSH
Subjt:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH

Query:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV
        DTISM VIDKFGHIAVGTSTNGATFKIPGRVGDGPI GSSAYAD DIGACGATGDGDIMMRFLPCYQVV+SMRLGMEPKDAAKDAISRIARKFPDF+GA+
Subjt:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV

Query:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI
        FAVNKNGTHAGACHGWTFQYSVRSP M  AEV  +
Subjt:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI

OWM79436.1 hypothetical protein CDL15_Pgr022848 [Punica granatum]1.7e-17562.11Show/hide
Query:  YPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTL
        YP+V+STWPF+EAV  AW+AV+ G SAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDA+    VTMEVGAVAAMRY+KDGIRAARLVM HT HTL
Subjt:  YPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTL

Query:  LVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDKF
        LVGE+ASAF++SMGLPGPANLSS ES  KW  WK N CQPNFWKNV+P + CGPY   D       TC   +  G +EL+S H   H+HDTISM VIDK 
Subjt:  LVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDKF

Query:  GHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHAG
        GHIAVGTSTNGATFKIPGRVGDGPI GSS+YAD ++GACGATGDGDIMMRFLPCYQVVESMRLGMEPK AAKDAISRIARK+P+FVGAVFA+N+ G HAG
Subjt:  GHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHAG

Query:  ACHGWTFQYSVRSPGMLRAEVSVISVMNETRELKVLYASVLNGRFFLLKFSKSTVGPVI---PFETT------FKVFSPNSSMLPKDKSMFINSFSSPVD
        ACHGWTFQYSV S          +S  +E          + NG   +L  +++  G  +   PF+          +F   S ++     + I S   PV 
Subjt:  ACHGWTFQYSVRSPGMLRAEVSVISVMNETRELKVLYASVLNGRFFLLKFSKSTVGPVI---PFETT------FKVFSPNSSMLPKDKSMFINSFSSPVD

Query:  PISPQRDSQTSSTVLANVYPFRVSPNLDYILVCRVCGFYYEAEYCDGCYSILLDSIHGTSLVFMIVYIWGREFPNARINIYGVVSLRGFYLPWALLALDL
           P   S           PF                                    G SLVFMIVYIWGREFPNARIN+YGVVSL+GFYLPWA+LALDL
Subjt:  PISPQRDSQTSSTVLANVYPFRVSPNLDYILVCRVCGFYYEAEYCDGCYSILLDSIHGTSLVFMIVYIWGREFPNARINIYGVVSLRGFYLPWALLALDL

Query:  IFGDPLMPDILGMVAGHLYYFLSVLHPLAGGKFILKTPFWM
        IFGDPLMPDILGMV GHLYYFL+VLHPL+GGKFILKTP W+
Subjt:  IFGDPLMPDILGMVAGHLYYFLSVLHPLAGGKFILKTPFWM

XP_022922742.1 probable isoaspartyl peptidase/L-asparaginase 3 [Cucurbita moschata]6.8e-17288.96Show/hide
Query:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA
        VTGRVAGE  +YPLVVSTWPF EAVERAWSAVNNG SAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVM+GVTMEVGAVAAMRYIKDGI+AA
Subjt:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA

Query:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH
        RLVMRHT+HTLLVGE+ASAFS+SMGLPGP +LSS ES  KW KWKENNCQPNFWKNVVPVNSCGPYHS DL+LVAE TC G D    V LRSNHFGLHSH
Subjt:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH

Query:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV
        DTISM VIDKFGHIAVGTSTNGATFKIPGRVGDGPI GSSAYAD DIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDF+GA+
Subjt:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV

Query:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI
        FAVNKNGTHAGACHGWTFQYSVRSP M  AEV  +
Subjt:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI

XP_023552142.1 probable isoaspartyl peptidase/L-asparaginase 3 [Cucurbita pepo subsp. pepo]6.8e-17288.96Show/hide
Query:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA
        VTGRVAGE  +YPLVVSTWPFLEAVERAWSAVNNG SAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVM+GVTMEVGAVAAMRYIKDGI+AA
Subjt:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA

Query:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH
        RLVMRHT+HTLLVGE+ASAFS+SMGLPGP +LSS ES  KW KWKENNCQPNFWKNVVPVNSCGPYHS DL+LVAE TC G D    V LRSNHFG HSH
Subjt:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH

Query:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV
        DTISM VIDKFGHIAVGTSTNGATFKIPGRVGDGPI GSSAYAD DIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDF+GA+
Subjt:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV

Query:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI
        FAVNKNGTHAGACHGWTFQYSVRSP M  AEV  +
Subjt:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI

XP_038900285.1 probable isoaspartyl peptidase/L-asparaginase 3 isoform X1 [Benincasa hispida]3.8e-17589.85Show/hide
Query:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA
        +TGRVAGE   YPLVVSTWPFLEAVERAWSAVNNGYSAVD+VVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVM+GVTMEVGAVAAMRYIKDGI+AA
Subjt:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA

Query:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH
        RLVMRHTKHTLLVGE+ASAFS+SMGLPGP +LSSPES  KW KWKENNCQPNFWKNVVPVNSCGPYHS  LLLVAE TCPG D + AVELRSNHFGLHSH
Subjt:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH

Query:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV
        DTISM VIDKFG+IAVGTSTNGATFKIPGRVGDGPI GSSAYAD DIGACGATGDGDIMMRFLPCYQVVESMRLGM+PKDAAKDAISRIARKFPDFVGA+
Subjt:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV

Query:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI
        FAVNKNGTHAGACHGWTFQYSVRSP M R E+  +
Subjt:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI

TrEMBL top hitse value%identityAlignment
A0A1S3BWS8 probable isoaspartyl peptidase/L-asparaginase 3 isoform X12.6e-16987.76Show/hide
Query:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA
        VT RVAG    YPLVVSTWPFLEAVERAWSA NNGYSAVD+VVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVM+GVTMEVGAVAAMRYIKDGI+AA
Subjt:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA

Query:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH
        RLVMRHT HTLLVGE+ASAFS+SMGLPGP +LSSPES  KW KWKENNCQPNFWKNVVP NSCGPYHS  LLL AE TC G   + AVELRSNHFGLHSH
Subjt:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH

Query:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV
        DTISM VIDKFGHIAVGTSTNGATFKIPGRVGDGPI GSSAY D DIGACGATGDGDIMMRFLPCYQVVESMRLGM PKDAAKDAISRIARKFPDFVGA+
Subjt:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV

Query:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI
        FAVNKNGTHAGACHGWTFQYSVRSP M  A+V  +
Subjt:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI

A0A218X444 Uncharacterized protein8.3e-17662.11Show/hide
Query:  YPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTL
        YP+V+STWPF+EAV  AW+AV+ G SAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDA+    VTMEVGAVAAMRY+KDGIRAARLVM HT HTL
Subjt:  YPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTL

Query:  LVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDKF
        LVGE+ASAF++SMGLPGPANLSS ES  KW  WK N CQPNFWKNV+P + CGPY   D       TC   +  G +EL+S H   H+HDTISM VIDK 
Subjt:  LVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDKF

Query:  GHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHAG
        GHIAVGTSTNGATFKIPGRVGDGPI GSS+YAD ++GACGATGDGDIMMRFLPCYQVVESMRLGMEPK AAKDAISRIARK+P+FVGAVFA+N+ G HAG
Subjt:  GHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHAG

Query:  ACHGWTFQYSVRSPGMLRAEVSVISVMNETRELKVLYASVLNGRFFLLKFSKSTVGPVI---PFETT------FKVFSPNSSMLPKDKSMFINSFSSPVD
        ACHGWTFQYSV S          +S  +E          + NG   +L  +++  G  +   PF+          +F   S ++     + I S   PV 
Subjt:  ACHGWTFQYSVRSPGMLRAEVSVISVMNETRELKVLYASVLNGRFFLLKFSKSTVGPVI---PFETT------FKVFSPNSSMLPKDKSMFINSFSSPVD

Query:  PISPQRDSQTSSTVLANVYPFRVSPNLDYILVCRVCGFYYEAEYCDGCYSILLDSIHGTSLVFMIVYIWGREFPNARINIYGVVSLRGFYLPWALLALDL
           P   S           PF                                    G SLVFMIVYIWGREFPNARIN+YGVVSL+GFYLPWA+LALDL
Subjt:  PISPQRDSQTSSTVLANVYPFRVSPNLDYILVCRVCGFYYEAEYCDGCYSILLDSIHGTSLVFMIVYIWGREFPNARINIYGVVSLRGFYLPWALLALDL

Query:  IFGDPLMPDILGMVAGHLYYFLSVLHPLAGGKFILKTPFWM
        IFGDPLMPDILGMV GHLYYFL+VLHPL+GGKFILKTP W+
Subjt:  IFGDPLMPDILGMVAGHLYYFLSVLHPLAGGKFILKTPFWM

A0A6J1CI32 probable isoaspartyl peptidase/L-asparaginase 3 isoform X16.2e-17188.06Show/hide
Query:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA
        VTGRV+GE  +YPLVVSTWPFLEAVERAW AVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVM+G+TMEVGAVAAMRYIKDGIRAA
Subjt:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA

Query:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH
        RLVMRHTKHTLLVGE+ASAFSLSMGLPGP NLSSPES  KW  WKENNCQPNFWKNVVP+NSCGPY+ +D L+V  K C G D +G VELRS+H GLHSH
Subjt:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH

Query:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV
        DTISM VIDKFGHIAVGTSTNGATFKIPGRVGDGPI GSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRI RKFPDFVGA+
Subjt:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV

Query:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI
        FAVNK GTHAGACHGWTFQYSVR+  MLRAEV  +
Subjt:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI

A0A6J1E462 probable isoaspartyl peptidase/L-asparaginase 33.3e-17288.96Show/hide
Query:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA
        VTGRVAGE  +YPLVVSTWPF EAVERAWSAVNNG SAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVM+GVTMEVGAVAAMRYIKDGI+AA
Subjt:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA

Query:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH
        RLVMRHT+HTLLVGE+ASAFS+SMGLPGP +LSS ES  KW KWKENNCQPNFWKNVVPVNSCGPYHS DL+LVAE TC G D    V LRSNHFGLHSH
Subjt:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH

Query:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV
        DTISM VIDKFGHIAVGTSTNGATFKIPGRVGDGPI GSSAYAD DIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDF+GA+
Subjt:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV

Query:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI
        FAVNKNGTHAGACHGWTFQYSVRSP M  AEV  +
Subjt:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI

A0A6J1JDF2 probable isoaspartyl peptidase/L-asparaginase 31.8e-17087.76Show/hide
Query:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA
        VTGRVAGE  +YPLVVSTWPFLEAVERAWSAVNNG SAVDAVV+GCSACEELRCDGTVGPGGSPDENGETTIDAMVM+GVTMEVGAVAAMRYIKDGI+AA
Subjt:  VTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAA

Query:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH
        RLVMRHT+HTLLVGE+ASAFS+SMGLPGP +LSS ES  KW KWKENNCQPNFWKNVVPVNSCGPY+S DL++VAE TC G D    V LRSNHFGLHSH
Subjt:  RLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSH

Query:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV
        DTISM VIDKFGHIAVGTSTNGATFKIPGRVGDGPI GSSAYAD DIGACGATGDGDIMMRFLPCYQVVESMRLGM+PKDAAKDAISRIARKFPDF+GA+
Subjt:  DTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAV

Query:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI
        FAVNKNGTHAGACHGWTF+YSVRSP M  AEV  +
Subjt:  FAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI

SwissProt top hitse value%identityAlignment
B3N6Y7 Putative N(4)-(beta-N-acetylglucosaminyl)-L-asparaginase GG240906.3e-8049.69Show/hide
Query:  LDTYPLVVSTWPFLEAVERAWSAVNNGYSAV----DAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVM
        ++  P+V++TW F  A   AW  +      +    +AVVEGCS CE+L+CD TVG GGSPDE GETT+DAMVM+G TM+VGAVA +R IKD I+ AR V+
Subjt:  LDTYPLVVSTWPFLEAVERAWSAVNNGYSAV----DAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVM

Query:  RHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNV--VPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDT
         HT+HT+LVG+ ASAF+ +MG     +L +PES   W++W   NCQPNFWKNV   P  SCGPY  +   L   K     +            G  +HDT
Subjt:  RHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNV--VPVNSCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDT

Query:  ISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFA
        I M  ID   +I  GTSTNGA  KIPGRVGD PI G+ AYAD ++GA  ATGDGD+MMRFLP    VE+MR G  P +AA++ + RI +   DF+GA+ A
Subjt:  ISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFA

Query:  VNKNGTHAGACHGW-TFQYSVRSP
        V++ G +A AC+G   F + V SP
Subjt:  VNKNGTHAGACHGW-TFQYSVRSP

B4NWI1 Putative N(4)-(beta-N-acetylglucosaminyl)-L-asparaginase GE192901.5e-8147.4Show/hide
Query:  ALAGTLKQNPAPSFSYSMSSAICRVWCLNNVEFVTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAV----DAVVEGCSACEELRCDGTVGPGGS
        +LA T    P  + ++S  S    V    +      + A EL   P+V++TW F  A   AW  +      +    +AVVEGCS CE+L+CD TVG GGS
Subjt:  ALAGTLKQNPAPSFSYSMSSAICRVWCLNNVEFVTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAV----DAVVEGCSACEELRCDGTVGPGGS

Query:  PDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNV--VPVN
        PDE GETT+DAMVM+G TMEVGAVA +R IKD I+ AR V+ HT+HT+LVG+ ASAF+ +MG     +L +PES   W++W   NCQPNFWKNV   P  
Subjt:  PDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNV--VPVN

Query:  SCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMR
        SCGPY  +   L   K     +            G  +HDTI M  ID   +I  GTSTNGA  KIPGRVGD PI G+ AYAD ++GA  ATGDGD+MMR
Subjt:  SCGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMR

Query:  FLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHAGACHGW-TFQYSVRSP
        FLP    VE+MR G  P DAA++++ RI R   DF+GA+ AV++ G +  AC+G   F + V SP
Subjt:  FLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHAGACHGW-TFQYSVRSP

P20933 N(4)-(beta-N-acetylglucosaminyl)-L-asparaginase2.6e-8153.29Show/hide
Query:  PLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTLL
        PLVV+TWPF  A E AW A+ +G SA+DAV  GC+ CE  +CDG+VG GGSPDE GETT+DAM+M+G TM+VGAV  +R IK+ I  AR V+ HT HTLL
Subjt:  PLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTLL

Query:  VGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNS--CGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDK
        VGE A+ F+ SMG     +LS+  S A    W   NCQPN+W+NV+P  S  CGPY             P G LK  + +         HDTI M VI K
Subjt:  VGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNS--CGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDK

Query:  FGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHA
         GHIA GTSTNG  FKI GRVGD PI G+ AYAD   GA  ATG+GDI+MRFLP YQ VE MR G +P  A +  ISRI + FP+F GAV   N  G++ 
Subjt:  FGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHA

Query:  GACH
         AC+
Subjt:  GACH

Q56W64 Probable isoaspartyl peptidase/L-asparaginase 39.3e-14073.48Show/hide
Query:  DTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKH
        D +P+VVSTWPFLEAV  AW AV+NG SAV+AVVEGCSACEELRCDGTVGPGGSPDENGET IDA+VM+GVTMEVGAVAAMRY+KDGIRAA LVM++++H
Subjt:  DTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKH

Query:  TLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPY--HSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTV
        TLL GE ASAF++SMGLPGP NLSSPES  KW  WKEN CQPNF KNVVP N CGPY  ++  + +  +K+    ++ GA+E +    G H+HDTISM V
Subjt:  TLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPY--HSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTV

Query:  IDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNG
        ID+ GHIAVGTSTNGAT+KIPGRVGDGPIVGSSAYAD ++G CGATGDGD MMRFLPCYQVVESMR GM+P++AAKDAISRIARKFPDFVGAV AV+KNG
Subjt:  IDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNG

Query:  THAGACHGWTFQYSVRSPGMLRAEVSVI
        +HAGAC+GWTFQYSV++P M   +V  +
Subjt:  THAGACHGWTFQYSVRSPGMLRAEVSVI

Q64191 N(4)-(beta-N-acetylglucosaminyl)-L-asparaginase1.5e-8151.97Show/hide
Query:  PLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTLL
        PLVV+TWPF  A E AW  + +G SA+DAV  GC+ CE+ +CDGTVG GGSPDE GETT+DAM+M+G  M+VGAV  +R IK+ I  AR V+ HT HTLL
Subjt:  PLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTLL

Query:  VGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNS--CGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDK
        VG+ A+ F+ SMG     +LS+  S      W   NCQPN+W+NV+P  S  CGPY             P G LK ++        +HSHDTI M VI K
Subjt:  VGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNS--CGPYHSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDK

Query:  FGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHA
         GH A GTSTNG  FKIPGRVGD PI G+ AYAD   GA  ATGDGD ++RFLP YQ VE MR G +P  A +  I RI + +P+F GAV   + NG++ 
Subjt:  FGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHA

Query:  GACH
         AC+
Subjt:  GACH

Arabidopsis top hitse value%identityAlignment
AT4G04860.1 DERLIN-2.23.6e-1436.46Show/hide
Query:  SLVFMIVYIWGREFPNARINIYGVVSLRGFYLPWALLALDLIFGDPLMPDILGMVAGHLYYFLSVLHPLAGGKFILKTPFWMLVAYWGEGIQVNSP
        SL FM+VY+W ++ P   ++  G+ +    YLPW LL   ++ G     D+LGM+AGH YYFL+ ++P    +  LKTP ++   +  E + V  P
Subjt:  SLVFMIVYIWGREFPNARINIYGVVSLRGFYLPWALLALDLIFGDPLMPDILGMVAGHLYYFLSVLHPLAGGKFILKTPFWMLVAYWGEGIQVNSP

AT4G29330.1 DERLIN-11.4e-3450Show/hide
Query:  GTSLVFMIVYIWGREFPNARINIYGVVSLRGFYLPWALLALDLIFGDPLMPDILGMVAGHLYYFLSVLHPLAGGKFILKTPFWM--LVAYWGEGIQVNSP
        G SLVFM++Y+W REFPNA I++YG+V+L+ FYLPWA+LALD+IFG P+MPD+LG++AGHLYYFL+VLHPLA GK  LKTP W+  +VA W  G  V S 
Subjt:  GTSLVFMIVYIWGREFPNARINIYGVVSLRGFYLPWALLALDLIFGDPLMPDILGMVAGHLYYFLSVLHPLAGGKFILKTPFWM--LVAYWGEGIQVNSP

Query:  VQ------RDPSAGTSFRGRSYRLNATRTSTRERTRTRSSPSPPPQPGSNQAEGAAFSGRSYRL
         Q        P AG    G                   SS   PP     ++   AF GRSYRL
Subjt:  VQ------RDPSAGTSFRGRSYRLNATRTSTRERTRTRSSPSPPPQPGSNQAEGAAFSGRSYRL

AT5G61540.1 N-terminal nucleophile aminohydrolases (Ntn hydrolases) superfamily protein6.6e-14173.48Show/hide
Query:  DTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKH
        D +P+VVSTWPFLEAV  AW AV+NG SAV+AVVEGCSACEELRCDGTVGPGGSPDENGET IDA+VM+GVTMEVGAVAAMRY+KDGIRAA LVM++++H
Subjt:  DTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDENGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKH

Query:  TLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPY--HSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTV
        TLL GE ASAF++SMGLPGP NLSSPES  KW  WKEN CQPNF KNVVP N CGPY  ++  + +  +K+    ++ GA+E +    G H+HDTISM V
Subjt:  TLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPY--HSKDLLLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTV

Query:  IDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNG
        ID+ GHIAVGTSTNGAT+KIPGRVGDGPIVGSSAYAD ++G CGATGDGD MMRFLPCYQVVESMR GM+P++AAKDAISRIARKFPDFVGAV AV+KNG
Subjt:  IDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPKDAAKDAISRIARKFPDFVGAVFAVNKNG

Query:  THAGACHGWTFQYSVRSPGMLRAEVSVI
        +HAGAC+GWTFQYSV++P M   +V  +
Subjt:  THAGACHGWTFQYSVRSPGMLRAEVSVI

AT5G61540.2 N-terminal nucleophile aminohydrolases (Ntn hydrolases) superfamily protein2.4e-10370.7Show/hide
Query:  MEVGAVAAMRYIKDGIRAARLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPY--HSKDLLLVAEKTC
        MEVGAVAAMRY+KDGIRAA LVM++++HTLL GE ASAF++SMGLPGP NLSSPES  KW  WKEN CQPNF KNVVP N CGPY  ++  + +  +K+ 
Subjt:  MEVGAVAAMRYIKDGIRAARLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPY--HSKDLLLVAEKTC

Query:  PGGDLKGAVELRSNHFGLHSHDTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPK
           ++ GA+E +    G H+HDTISM VID+ GHIAVGTSTNGAT+KIPGRVGDGPIVGSSAYAD ++G CGATGDGD MMRFLPCYQVVESMR GM+P+
Subjt:  PGGDLKGAVELRSNHFGLHSHDTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPK

Query:  DAAKDAISRIARKFPDFVGAVFAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI
        +AAKDAISRIARKFPDFVGAV AV+KNG+HAGAC+GWTFQYSV++P M   +V  +
Subjt:  DAAKDAISRIARKFPDFVGAVFAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI

AT5G61540.3 N-terminal nucleophile aminohydrolases (Ntn hydrolases) superfamily protein2.4e-10370.7Show/hide
Query:  MEVGAVAAMRYIKDGIRAARLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPY--HSKDLLLVAEKTC
        MEVGAVAAMRY+KDGIRAA LVM++++HTLL GE ASAF++SMGLPGP NLSSPES  KW  WKEN CQPNF KNVVP N CGPY  ++  + +  +K+ 
Subjt:  MEVGAVAAMRYIKDGIRAARLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPY--HSKDLLLVAEKTC

Query:  PGGDLKGAVELRSNHFGLHSHDTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPK
           ++ GA+E +    G H+HDTISM VID+ GHIAVGTSTNGAT+KIPGRVGDGPIVGSSAYAD ++G CGATGDGD MMRFLPCYQVVESMR GM+P+
Subjt:  PGGDLKGAVELRSNHFGLHSHDTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGMEPK

Query:  DAAKDAISRIARKFPDFVGAVFAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI
        +AAKDAISRIARKFPDFVGAV AV+KNG+HAGAC+GWTFQYSV++P M   +V  +
Subjt:  DAAKDAISRIARKFPDFVGAVFAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTCTGCTGTGTTGCTCTTGCAGGTACTTTGAAACAGAACCCAGCACCGTCTTTTTCATATTCAATGTCATCTGCAATTTGCCGAGTTTGGTGCCTGAAC
AACGTAGAATTTGTTACTGGACGTGTAGCTGGAGAGTTAGACACGTACCCTTTGGTTGTAAGCACATGGCCCTTTTTGGAGGCTGTTGAAAGGGCATGGAGTGCT
GTTAATAATGGCTATTCAGCAGTTGATGCTGTTGTAGAAGGATGTTCCGCCTGTGAAGAACTGAGATGTGATGGTACAGTTGGACCTGGTGGAAGTCCAGATGAG
AATGGAGAAACAACCATTGATGCCATGGTCATGAATGGGGTGACTATGGAAGTTGGAGCTGTTGCTGCCATGAGGTATATTAAGGATGGTATTAGAGCTGCGAGA
TTAGTGATGAGACATACCAAACACACTTTACTTGTTGGAGAGCAGGCCTCTGCTTTCTCCCTTTCAATGGGTCTTCCTGGACCTGCAAATCTTAGCTCGCCGGAG
TCGACAGCAAAGTGGATCAAATGGAAAGAGAACAATTGCCAACCTAATTTTTGGAAAAATGTTGTGCCTGTCAATAGTTGTGGCCCTTACCACTCTAAGGACCTT
CTGCTTGTCGCTGAGAAGACATGTCCGGGAGGAGATCTGAAGGGAGCTGTTGAATTAAGATCAAATCATTTTGGTCTTCACAGTCACGATACCATATCGATGACA
GTAATCGATAAGTTTGGGCACATTGCTGTTGGAACGTCCACCAATGGAGCCACCTTCAAGATCCCAGGCAGGGTGGGTGATGGACCTATAGTAGGATCTTCAGCA
TATGCAGATGGTGATATTGGCGCATGTGGAGCAACCGGGGATGGCGACATCATGATGCGATTCCTGCCATGCTACCAAGTTGTCGAGAGTATGCGACTGGGGATG
GAGCCGAAGGATGCTGCTAAAGACGCAATCTCACGGATTGCAAGAAAGTTCCCGGACTTTGTTGGAGCTGTTTTTGCTGTCAACAAAAATGGCACTCATGCTGGT
GCTTGCCATGGATGGACATTCCAATACTCAGTCAGAAGCCCGGGAATGCTTCGCGCTGAGGTGTCTGTGATCAGTGTCATGAATGAAACACGTGAATTGAAAGTA
CTGTATGCGAGCGTATTGAATGGGAGATTTTTCCTTCTGAAGTTTTCAAAGTCCACGGTTGGACCAGTGATTCCATTTGAGACGACTTTCAAAGTCTTCTCTCCG
AATTCTTCAATGCTCCCCAAAGACAAATCCATGTTCATTAATAGCTTCAGTTCTCCAGTCGACCCAATTTCTCCACAGCGAGATTCACAGACCAGTTCCACAGTT
CTTGCAAATGTCTACCCCTTTAGAGTTAGCCCTAATTTGGACTACATCCTGGTTTGTAGGGTTTGCGGGTTCTACTATGAAGCTGAATATTGTGATGGCTGCTAT
TCCATACTGTTGGACTCCATTCATGGGACTTCTTTGGTTTTCATGATTGTCTACATCTGGGGCCGTGAGTTCCCAAATGCACGTATCAACATCTATGGGGTCGTT
TCATTGAGGGGATTTTATCTTCCTTGGGCGTTGCTGGCTCTGGATCTAATCTTTGGTGATCCCTTGATGCCAGACATTTTGGGAATGGTGGCAGGGCATCTTTAT
TACTTTTTGAGTGTTCTACATCCACTTGCTGGTGGGAAATTCATCCTCAAAACCCCTTTCTGGATGCTAGTAGCATACTGGGGTGAAGGGATACAAGTTAACTCT
CCTGTGCAGCGTGACCCTTCTGCCGGTACATCTTTTCGTGGAAGAAGCTACCGTCTCAATGCCACTCGAACGAGCACTCGGGAGCGGACACGAACACGCTCTTCT
CCCTCTCCACCACCACAGCCGGGCTCTAATCAGGCTGAAGGAGCCGCTTTCAGTGGCAGAAGTTATCGTCTTGGTAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTCTGCTGTGTTGCTCTTGCAGGTACTTTGAAACAGAACCCAGCACCGTCTTTTTCATATTCAATGTCATCTGCAATTTGCCGAGTTTGGTGCCTGAAC
AACGTAGAATTTGTTACTGGACGTGTAGCTGGAGAGTTAGACACGTACCCTTTGGTTGTAAGCACATGGCCCTTTTTGGAGGCTGTTGAAAGGGCATGGAGTGCT
GTTAATAATGGCTATTCAGCAGTTGATGCTGTTGTAGAAGGATGTTCCGCCTGTGAAGAACTGAGATGTGATGGTACAGTTGGACCTGGTGGAAGTCCAGATGAG
AATGGAGAAACAACCATTGATGCCATGGTCATGAATGGGGTGACTATGGAAGTTGGAGCTGTTGCTGCCATGAGGTATATTAAGGATGGTATTAGAGCTGCGAGA
TTAGTGATGAGACATACCAAACACACTTTACTTGTTGGAGAGCAGGCCTCTGCTTTCTCCCTTTCAATGGGTCTTCCTGGACCTGCAAATCTTAGCTCGCCGGAG
TCGACAGCAAAGTGGATCAAATGGAAAGAGAACAATTGCCAACCTAATTTTTGGAAAAATGTTGTGCCTGTCAATAGTTGTGGCCCTTACCACTCTAAGGACCTT
CTGCTTGTCGCTGAGAAGACATGTCCGGGAGGAGATCTGAAGGGAGCTGTTGAATTAAGATCAAATCATTTTGGTCTTCACAGTCACGATACCATATCGATGACA
GTAATCGATAAGTTTGGGCACATTGCTGTTGGAACGTCCACCAATGGAGCCACCTTCAAGATCCCAGGCAGGGTGGGTGATGGACCTATAGTAGGATCTTCAGCA
TATGCAGATGGTGATATTGGCGCATGTGGAGCAACCGGGGATGGCGACATCATGATGCGATTCCTGCCATGCTACCAAGTTGTCGAGAGTATGCGACTGGGGATG
GAGCCGAAGGATGCTGCTAAAGACGCAATCTCACGGATTGCAAGAAAGTTCCCGGACTTTGTTGGAGCTGTTTTTGCTGTCAACAAAAATGGCACTCATGCTGGT
GCTTGCCATGGATGGACATTCCAATACTCAGTCAGAAGCCCGGGAATGCTTCGCGCTGAGGTGTCTGTGATCAGTGTCATGAATGAAACACGTGAATTGAAAGTA
CTGTATGCGAGCGTATTGAATGGGAGATTTTTCCTTCTGAAGTTTTCAAAGTCCACGGTTGGACCAGTGATTCCATTTGAGACGACTTTCAAAGTCTTCTCTCCG
AATTCTTCAATGCTCCCCAAAGACAAATCCATGTTCATTAATAGCTTCAGTTCTCCAGTCGACCCAATTTCTCCACAGCGAGATTCACAGACCAGTTCCACAGTT
CTTGCAAATGTCTACCCCTTTAGAGTTAGCCCTAATTTGGACTACATCCTGGTTTGTAGGGTTTGCGGGTTCTACTATGAAGCTGAATATTGTGATGGCTGCTAT
TCCATACTGTTGGACTCCATTCATGGGACTTCTTTGGTTTTCATGATTGTCTACATCTGGGGCCGTGAGTTCCCAAATGCACGTATCAACATCTATGGGGTCGTT
TCATTGAGGGGATTTTATCTTCCTTGGGCGTTGCTGGCTCTGGATCTAATCTTTGGTGATCCCTTGATGCCAGACATTTTGGGAATGGTGGCAGGGCATCTTTAT
TACTTTTTGAGTGTTCTACATCCACTTGCTGGTGGGAAATTCATCCTCAAAACCCCTTTCTGGATGCTAGTAGCATACTGGGGTGAAGGGATACAAGTTAACTCT
CCTGTGCAGCGTGACCCTTCTGCCGGTACATCTTTTCGTGGAAGAAGCTACCGTCTCAATGCCACTCGAACGAGCACTCGGGAGCGGACACGAACACGCTCTTCT
CCCTCTCCACCACCACAGCCGGGCTCTAATCAGGCTGAAGGAGCCGCTTTCAGTGGCAGAAGTTATCGTCTTGGTAGCTAA
Protein sequenceShow/hide protein sequence
MSFCCVALAGTLKQNPAPSFSYSMSSAICRVWCLNNVEFVTGRVAGELDTYPLVVSTWPFLEAVERAWSAVNNGYSAVDAVVEGCSACEELRCDGTVGPGGSPDE
NGETTIDAMVMNGVTMEVGAVAAMRYIKDGIRAARLVMRHTKHTLLVGEQASAFSLSMGLPGPANLSSPESTAKWIKWKENNCQPNFWKNVVPVNSCGPYHSKDL
LLVAEKTCPGGDLKGAVELRSNHFGLHSHDTISMTVIDKFGHIAVGTSTNGATFKIPGRVGDGPIVGSSAYADGDIGACGATGDGDIMMRFLPCYQVVESMRLGM
EPKDAAKDAISRIARKFPDFVGAVFAVNKNGTHAGACHGWTFQYSVRSPGMLRAEVSVISVMNETRELKVLYASVLNGRFFLLKFSKSTVGPVIPFETTFKVFSP
NSSMLPKDKSMFINSFSSPVDPISPQRDSQTSSTVLANVYPFRVSPNLDYILVCRVCGFYYEAEYCDGCYSILLDSIHGTSLVFMIVYIWGREFPNARINIYGVV
SLRGFYLPWALLALDLIFGDPLMPDILGMVAGHLYYFLSVLHPLAGGKFILKTPFWMLVAYWGEGIQVNSPVQRDPSAGTSFRGRSYRLNATRTSTRERTRTRSS
PSPPPQPGSNQAEGAAFSGRSYRLGS