; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021930 (gene) of Snake gourd v1 genome

Gene IDTan0021930
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotease Do-like 1, chloroplastic
Genome locationLG11:5501119..5509037
RNA-Seq ExpressionTan0021930
SyntenyTan0021930
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001940 - Peptidase S1C
IPR009003 - Peptidase S1, PA clan
IPR036034 - PDZ superfamily
IPR043504 - Peptidase S1, PA clan, chymotrypsin-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033121.1 Protease Do-like 1, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]1.8e-19395.16Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MAAAFSLAS+F HFSPLSRPPN RPTSFFLSKSI FH+FSNPS HFRYPIF+LLGHNKK +QISA+TD KL NPSNP+ASICESLLIF+TSV LSFSLFV
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG
        TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDG+IVTNYHVIRGASDLRVTLADQTTFDAKVVG
Subjt:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG

Query:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
        FDQDKDVAVL IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
Subjt:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN

Query:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        TAIYSPSGASSGVGFSIPVDTVSGIVDQLVR+GKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
Subjt:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

XP_022922573.1 protease Do-like 1, chloroplastic [Cucurbita moschata]4.9e-19495.43Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MAAAFSLAS+F HFSPLSRPPN RPTSFFLSKSI FH+FSNPS HFRYPIF+LLGHNKK +QISA+TD KLTNPSNP+ASICESLLIF+TSV LSFSLFV
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG
        TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDG+IVTNYHVIRGASDLRVTLADQTTFDAKVVG
Subjt:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG

Query:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
        FDQDKDVAVL IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
Subjt:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN

Query:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        TAIYSPSGASSGVGFSIPVDTVSGIVDQLVR+GKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
Subjt:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

XP_022990583.1 protease Do-like 1, chloroplastic [Cucurbita maxima]2.2e-19495.43Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MAAAFSLAS+F HFSPLSRPPN RPTSFFLSKSI FH+FSNPSCHFRYPIF+LLGH KK +QISA+TD KLTNPSNP ASICESLLIF+TSV LSFSLFV
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG
        TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDG+IVTNYHVIRGASDLRVTLADQTTFDAKVVG
Subjt:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG

Query:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
        FDQDKDVAVL IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
Subjt:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN

Query:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        TAIYSPSGASSGVGFSIPVDTVSGIVDQLVR+GKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
Subjt:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

XP_023515810.1 protease Do-like 1, chloroplastic [Cucurbita pepo subsp. pepo]7.5e-19595.43Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MAAAFSLAS+F HFSPLSRPPN RPTSFFLSKSI FH+FSNPSCHFRYPIF++LGHNKK +QISA+TD KLTNPSNP ASICESLLIF+TSV LSFSLFV
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG
        TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDG+IVTNYHVIRGASDLRVTLADQTTFDAKVVG
Subjt:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG

Query:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
        FDQDKDVAVL IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
Subjt:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN

Query:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        TAIYSPSGASSGVGFSIPVDTVSGIVDQLVR+GKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
Subjt:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

XP_023550169.1 protease Do-like 1, chloroplastic [Cucurbita pepo subsp. pepo]5.2e-18893.28Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MAAAFSLAS+FSHFSP SRPPN  PTSFFLSKSIS H+F+NPS H RYPI +LL HNK H+QISA+T  KL  PSNP ASI ESLLIFSTS LLSFSLF+
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG
        TDVDPA AFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGH+VTNYHVIRGASDLRVTLADQTTFDAKVVG
Subjt:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG

Query:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
        FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
Subjt:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN

Query:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
Subjt:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

TrEMBL top hitse value%identityAlignment
A0A6J1BXV6 protease Do-like 1, chloroplastic5.8e-18592.2Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MAAA+SLAS+F HFSPLSRPPN R T+F L KSIS H+FSNP+CH R+PIF+LL   KK+SQISA+  PKLT PSNP ASI ESLLIFSTSVLLSFSLFV
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG
        TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTT+DAKVVG
Subjt:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG

Query:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
        FDQDKDVAVL I+APKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
Subjt:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN

Query:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        TAIYSPSGASSGVGFSIPVDTVSGIV+QLV+FGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
Subjt:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

A0A6J1E723 protease Do-like 1, chloroplastic2.4e-19495.43Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MAAAFSLAS+F HFSPLSRPPN RPTSFFLSKSI FH+FSNPS HFRYPIF+LLGHNKK +QISA+TD KLTNPSNP+ASICESLLIF+TSV LSFSLFV
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG
        TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDG+IVTNYHVIRGASDLRVTLADQTTFDAKVVG
Subjt:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG

Query:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
        FDQDKDVAVL IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
Subjt:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN

Query:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        TAIYSPSGASSGVGFSIPVDTVSGIVDQLVR+GKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
Subjt:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

A0A6J1FEU2 protease Do-like 1, chloroplastic3.1e-18692.47Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MAAAFSLAS+FSHFSP SRPPN  PT FFLSKSIS  +F+NPS H RYPI +LL HNK H+QISA+T  KL  PSNP AS  ESLLIFSTS LLSFSLF+
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG
        TDVDPA AFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGH+VTNYHVIRGASDLRVTLADQTTFDAKVVG
Subjt:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG

Query:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
        FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
Subjt:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN

Query:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
Subjt:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

A0A6J1JTP9 protease Do-like 1, chloroplastic1.1e-19495.43Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MAAAFSLAS+F HFSPLSRPPN RPTSFFLSKSI FH+FSNPSCHFRYPIF+LLGH KK +QISA+TD KLTNPSNP ASICESLLIF+TSV LSFSLFV
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG
        TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDG+IVTNYHVIRGASDLRVTLADQTTFDAKVVG
Subjt:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG

Query:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
        FDQDKDVAVL IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
Subjt:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN

Query:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        TAIYSPSGASSGVGFSIPVDTVSGIVDQLVR+GKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
Subjt:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

A0A6J1K2S4 protease Do-like 1, chloroplastic5.2e-18692.2Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MAAAFSLAS+FSHFSP SRPPN  PTSFFL KSIS  +F++PS H RYPI +LL HNK H+QISA+T  KL  PSNP ASI ESLLIFSTS LLSFSLF+
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG
        TDV+PA AFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGH+VTNYHVIRGASDLRVTLADQTTFDAKVVG
Subjt:  TDVDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVG

Query:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
        FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN
Subjt:  FDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGIN

Query:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
Subjt:  TAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

SwissProt top hitse value%identityAlignment
O22609 Protease Do-like 1, chloroplastic9.0e-14376.47Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MA   S +   S    L  PP++  + F LS S S      P    RY  F +L  +K     +   D   T    P +++ +   +  TSV LSFSLF 
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TD--VDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKV
            V+ A+AFVV+TP+KLQTDELATVRLFQENTPSVVYITNLA RQDAFTLDVLEVPQGSGSGFVWDK GHIVTNYHVIRGASDLRVTLADQTTFDAKV
Subjt:  TD--VDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKV

Query:  VGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIG
        VGFDQDKDVAVL IDAPK+KLRPIPVG+SADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSG LIG
Subjt:  VGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIG

Query:  INTAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        INTAIYSPSGASSGVGFSIPVDTV GIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAP +GPAGKA
Subjt:  INTAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

P0C114 Probable periplasmic serine endoprotease DegP-like1.9e-4445.09Show/hide
Query:  EVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGV
        E P   GSGFV  +DG++VTN HV+       V L D T  DAK++G D   D+AVL I+APK K   +  G    + VG  V A+GNPFGL  T+T+G+
Subjt:  EVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGV

Query:  ISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAP--DQSVE
        +S   R+I +     P  D IQ DAA+N GNSGGP  D SG +IGINTAI+SPSG S G+ F+IP  T   +VDQL++ G V R  +G++  P       
Subjt:  ISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAP--DQSVE

Query:  QLGVS---GVLVLDAPANGPAGKA
         LG++   G +V     +GPA KA
Subjt:  QLGVS---GVLVLDAPANGPAGKA

Q2YMX6 Probable periplasmic serine endoprotease DegP-like1.9e-4445.09Show/hide
Query:  EVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGV
        E P   GSGFV  +DG++VTN HV+       V L D T  DAK++G D   D+AVL I+APK K   +  G    + VG  V A+GNPFGL  T+T+G+
Subjt:  EVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGV

Query:  ISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAP--DQSVE
        +S   R+I +     P  D IQ DAA+N GNSGGP  D SG +IGINTAI+SPSG S G+ F+IP  T   +VDQL++ G V R  +G++  P       
Subjt:  ISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAP--DQSVE

Query:  QLGVS---GVLVLDAPANGPAGKA
         LG++   G +V     +GPA KA
Subjt:  QLGVS---GVLVLDAPANGPAGKA

Q9LU10 Protease Do-like 8, chloroplastic3.8e-6453.82Show/hide
Query:  VRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLAD-------------QTTFDAKVVGFDQDKDVAVLS
        V+LF++NT SVV I ++  R       V+E+P+G+GSG VWD  G+IVTNYHVI  A     +  D             Q  F+ K+VG D+ KD+AVL 
Subjt:  VRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLAD-------------QTTFDAKVVGFDQDKDVAVLS

Query:  IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASS
        +DAP+  L+PI VG S  L VGQ+  AIGNPFG DHTLT GVISGL R+I S  TG  I   IQTDAAINPGNSGGPLLDS GNLIGINTAI++ +G S+
Subjt:  IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASS

Query:  GVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGV-SGVLVLDAPANGPAGKA
        GVGF+IP  TV  IV QL++F KV R  + I+ APD    QL V +G LVL  P    A KA
Subjt:  GVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGV-SGVLVLDAPANGPAGKA

Q9SEL7 Protease Do-like 5, chloroplastic1.7e-4845.49Show/hide
Query:  LLIFSTSVLLSFSLFVTD-----VDPAAAF--VVTTPRKLQTDELATVRLFQENTPSVVYITNL----AARQDAFTLDVLEVPQGSGSGFVWDKDGHIVT
        ++IF +S+ L+ SL  ++     ++ A A         +L+ +E   V LFQ+ +PSVVYI  +     +  D  T +     +G+GSGFVWDK GHIVT
Subjt:  LLIFSTSVLLSFSLFVTD-----VDPAAAF--VVTTPRKLQTDELATVRLFQENTPSVVYITNL----AARQDAFTLDVLEVPQGSGSGFVWDKDGHIVT

Query:  NYHVIR-------GASDLRVTLADQ--TTF--DAKVVGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREIS
        NYHVI        G    +V+L D   T F  + K+VG D D D+AVL I+    +L P+ +G S DL VGQ  FAIGNP+G ++TLT GV+SGL REI 
Subjt:  NYHVIR-------GASDLRVTLADQ--TTF--DAKVVGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREIS

Query:  SAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYS--PSGASSGVGFSIPVDTVSGIVDQLVRFGKVTR
        S   G+ I + IQTDA IN GNSGGPLLDS G+ IG+NTA ++   SG SSGV F+IP+DTV   V  L+ +G   R
Subjt:  SAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYS--PSGASSGVGFSIPVDTVSGIVDQLVRFGKVTR

Arabidopsis top hitse value%identityAlignment
AT3G27925.1 DegP protease 16.4e-14476.47Show/hide
Query:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV
        MA   S +   S    L  PP++  + F LS S S      P    RY  F +L  +K     +   D   T    P +++ +   +  TSV LSFSLF 
Subjt:  MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFV

Query:  TD--VDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKV
            V+ A+AFVV+TP+KLQTDELATVRLFQENTPSVVYITNLA RQDAFTLDVLEVPQGSGSGFVWDK GHIVTNYHVIRGASDLRVTLADQTTFDAKV
Subjt:  TD--VDPAAAFVVTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKV

Query:  VGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIG
        VGFDQDKDVAVL IDAPK+KLRPIPVG+SADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSG LIG
Subjt:  VGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIG

Query:  INTAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA
        INTAIYSPSGASSGVGFSIPVDTV GIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAP +GPAGKA
Subjt:  INTAIYSPSGASSGVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKA

AT4G18370.1 DEGP protease 51.2e-4945.49Show/hide
Query:  LLIFSTSVLLSFSLFVTD-----VDPAAAF--VVTTPRKLQTDELATVRLFQENTPSVVYITNL----AARQDAFTLDVLEVPQGSGSGFVWDKDGHIVT
        ++IF +S+ L+ SL  ++     ++ A A         +L+ +E   V LFQ+ +PSVVYI  +     +  D  T +     +G+GSGFVWDK GHIVT
Subjt:  LLIFSTSVLLSFSLFVTD-----VDPAAAF--VVTTPRKLQTDELATVRLFQENTPSVVYITNL----AARQDAFTLDVLEVPQGSGSGFVWDKDGHIVT

Query:  NYHVIR-------GASDLRVTLADQ--TTF--DAKVVGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREIS
        NYHVI        G    +V+L D   T F  + K+VG D D D+AVL I+    +L P+ +G S DL VGQ  FAIGNP+G ++TLT GV+SGL REI 
Subjt:  NYHVIR-------GASDLRVTLADQ--TTF--DAKVVGFDQDKDVAVLSIDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREIS

Query:  SAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYS--PSGASSGVGFSIPVDTVSGIVDQLVRFGKVTR
        S   G+ I + IQTDA IN GNSGGPLLDS G+ IG+NTA ++   SG SSGV F+IP+DTV   V  L+ +G   R
Subjt:  SAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYS--PSGASSGVGFSIPVDTVSGIVDQLVRFGKVTR

AT5G27660.1 Trypsin family protein with PDZ domain6.2e-3037.22Show/hide
Query:  ITNLAARQDAFTLDVLEVPQG---------SGSGFVWDKDGHIVTNYHVIRGASDLR--------VTLADQTTFDAKVVGFDQDKDVAVLSIDAPKDKLR
        I N AAR     ++ L VPQG          GSG + D DG I+T  HV+    ++R        VTL D  TF+  VV  D   D+A++ I + K  L 
Subjt:  ITNLAARQDAFTLDVLEVPQG---------SGSGFVWDKDGHIVTNYHVIRGASDLR--------VTLADQTTFDAKVVGFDQDKDVAVLSIDAPKDKLR

Query:  PIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASSGVGFSIPVD
           +G S+ L  G  V A+G P  L +T+T G++S + R+ S    G   ++ +QTD +IN GNSGGPL++  G +IG+N        A+ G+GFS+P+D
Subjt:  PIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASSGVGFSIPVD

Query:  TVSGIVDQLVRFGKVTRPILGIK
        +VS I++   + G+V RP +G+K
Subjt:  TVSGIVDQLVRFGKVTRPILGIK

AT5G39830.1 Trypsin family protein with PDZ domain2.7e-6553.82Show/hide
Query:  VRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLAD-------------QTTFDAKVVGFDQDKDVAVLS
        V+LF++NT SVV I ++  R       V+E+P+G+GSG VWD  G+IVTNYHVI  A     +  D             Q  F+ K+VG D+ KD+AVL 
Subjt:  VRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLAD-------------QTTFDAKVVGFDQDKDVAVLS

Query:  IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASS
        +DAP+  L+PI VG S  L VGQ+  AIGNPFG DHTLT GVISGL R+I S  TG  I   IQTDAAINPGNSGGPLLDS GNLIGINTAI++ +G S+
Subjt:  IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASS

Query:  GVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGV-SGVLVLDAPANGPAGKA
        GVGF+IP  TV  IV QL++F KV R  + I+ APD    QL V +G LVL  P    A KA
Subjt:  GVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGV-SGVLVLDAPANGPAGKA

AT5G39830.2 Trypsin family protein with PDZ domain9.2e-5850.76Show/hide
Query:  VRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLAD-------------QTTFDAKVVGFDQDKDVAVLS
        V+LF++NT SVV I ++  R       V+E+P+G+GSG VWD  G+IVTNYHVI  A     +  D             Q  F+ K+VG D+ KD+AVL 
Subjt:  VRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLAD-------------QTTFDAKVVGFDQDKDVAVLS

Query:  IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASS
        +DAP+  L+PI VG S  L VGQ+  AIGNPFG DHTLT GVISGL R+I S  TG  I   IQTDAAINPGNSGGPLLDS GNLIGINTAI++      
Subjt:  IDAPKDKLRPIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASS

Query:  GVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGV-SGVLVLDAPANGPAGKA
                 TV  IV QL++F KV R  + I+ APD    QL V +G LVL  P    A KA
Subjt:  GVGFSIPVDTVSGIVDQLVRFGKVTRPILGIKFAPDQSVEQLGV-SGVLVLDAPANGPAGKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTGCATTTTCCCTCGCTTCTAGTTTCTCCCATTTCTCCCCACTTTCTCGGCCTCCAAACACAAGACCCACTTCCTTTTTTCTCTCTAAATCCATTTCTTTTCA
CAGTTTCTCGAACCCCAGTTGCCATTTTAGATATCCCATCTTCGCTCTTCTCGGCCACAACAAGAAGCACAGTCAAATTTCTGCTTCAACTGACCCGAAACTCACCAATC
CCTCGAACCCCGTTGCTTCAATTTGCGAGTCGCTGCTTATTTTCTCCACTTCGGTGCTCTTGTCCTTTTCTCTCTTTGTTACAGATGTTGATCCGGCGGCTGCTTTTGTT
GTCACCACGCCGAGAAAATTGCAGACGGATGAGCTCGCAACAGTTCGTCTTTTCCAGGAGAATACTCCTTCTGTGGTGTATATCACAAATCTTGCTGCAAGGCAGGATGC
TTTTACTTTAGACGTGTTGGAGGTCCCACAGGGTTCGGGTTCTGGATTTGTGTGGGATAAAGATGGTCATATTGTCACTAATTACCATGTAATTCGCGGTGCTTCTGATC
TCAGAGTCACTCTAGCTGACCAAACTACTTTCGATGCAAAAGTTGTTGGATTTGACCAAGACAAGGATGTTGCTGTCTTGAGTATTGATGCACCTAAAGACAAACTAAGA
CCTATACCTGTTGGTATCTCAGCAGATTTGCTTGTTGGTCAGAAAGTTTTTGCTATTGGAAACCCTTTTGGACTTGATCATACTCTTACAACTGGGGTAATCAGTGGTCT
TCGCAGAGAAATTAGTTCTGCTGCTACTGGTAGGCCCATTCAAGATGTCATACAGACAGATGCTGCCATTAATCCAGGAAACAGTGGAGGGCCGCTTCTCGATAGTTCCG
GGAACCTTATCGGAATAAATACAGCAATATATTCTCCATCTGGGGCATCCTCAGGTGTTGGATTTTCAATCCCAGTTGACACTGTGAGCGGCATTGTTGATCAGTTGGTG
CGATTTGGGAAGGTGACAAGACCAATCTTAGGAATCAAGTTTGCTCCCGATCAGTCTGTCGAGCAGTTGGGCGTAAGTGGAGTGCTCGTCTTAGATGCTCCTGCAAATGG
TCCTGCTGGGAAAGCTGTATTCATTTCCTCTCTTTTTCTCTTTGCATTTCTGTTCTTATACCTTTAG
mRNA sequenceShow/hide mRNA sequence
CACAAAACTGGTTGAATTGGATTCAGCAATCCAATTTGCGGCAAAACTGTCCAAACAGAAAAGTAATGGCTGCTGCATTTTCCCTCGCTTCTAGTTTCTCCCATTTCTCC
CCACTTTCTCGGCCTCCAAACACAAGACCCACTTCCTTTTTTCTCTCTAAATCCATTTCTTTTCACAGTTTCTCGAACCCCAGTTGCCATTTTAGATATCCCATCTTCGC
TCTTCTCGGCCACAACAAGAAGCACAGTCAAATTTCTGCTTCAACTGACCCGAAACTCACCAATCCCTCGAACCCCGTTGCTTCAATTTGCGAGTCGCTGCTTATTTTCT
CCACTTCGGTGCTCTTGTCCTTTTCTCTCTTTGTTACAGATGTTGATCCGGCGGCTGCTTTTGTTGTCACCACGCCGAGAAAATTGCAGACGGATGAGCTCGCAACAGTT
CGTCTTTTCCAGGAGAATACTCCTTCTGTGGTGTATATCACAAATCTTGCTGCAAGGCAGGATGCTTTTACTTTAGACGTGTTGGAGGTCCCACAGGGTTCGGGTTCTGG
ATTTGTGTGGGATAAAGATGGTCATATTGTCACTAATTACCATGTAATTCGCGGTGCTTCTGATCTCAGAGTCACTCTAGCTGACCAAACTACTTTCGATGCAAAAGTTG
TTGGATTTGACCAAGACAAGGATGTTGCTGTCTTGAGTATTGATGCACCTAAAGACAAACTAAGACCTATACCTGTTGGTATCTCAGCAGATTTGCTTGTTGGTCAGAAA
GTTTTTGCTATTGGAAACCCTTTTGGACTTGATCATACTCTTACAACTGGGGTAATCAGTGGTCTTCGCAGAGAAATTAGTTCTGCTGCTACTGGTAGGCCCATTCAAGA
TGTCATACAGACAGATGCTGCCATTAATCCAGGAAACAGTGGAGGGCCGCTTCTCGATAGTTCCGGGAACCTTATCGGAATAAATACAGCAATATATTCTCCATCTGGGG
CATCCTCAGGTGTTGGATTTTCAATCCCAGTTGACACTGTGAGCGGCATTGTTGATCAGTTGGTGCGATTTGGGAAGGTGACAAGACCAATCTTAGGAATCAAGTTTGCT
CCCGATCAGTCTGTCGAGCAGTTGGGCGTAAGTGGAGTGCTCGTCTTAGATGCTCCTGCAAATGGTCCTGCTGGGAAAGCTGTATTCATTTCCTCTCTTTTTCTCTTTGC
ATTTCTGTTCTTATACCTTTAGGTTTCAATATTCCATTATGGTTCCAAGAACTTGAGAACACCACCAACCTCAATACTTCAGTTCAAGAAGATCTGATCTCTTCACATTA
TGCAGGGTCTGCAACCGACTAAGAGGGATGCTTATGGTAGACTTATATTGGGAGACATAATAACGTCAGTGAACGGGAAGAAGGTCACAAATGGAAGCGATTTGTATAGA
ATTCTTGACCAGTGTAAAGTTGGTGAAAAGGTGACAGTGGAGGTGCTACGTGGTGATCATATGGAGAAAATTCCAGTAATTCTGGAGCCAAAGCCTGACGAATCTTGAGC
CAAAAACAGTTGGGAATTGTTTGTATTTTTGTACATTACATTAATTAATATTATTAGACTCTTGTAAATAGTCCATTCTGACTTCTCACCTTTTTCTTTTTTAAAAAAAT
TAATCACACTTATACTTCCAGCTCCACTCCTGGCTTTTCATTTTTCAACTGGAGAATCTAATATCAAATCTAGTATGATTGAGATAAAA
Protein sequenceShow/hide protein sequence
MAAAFSLASSFSHFSPLSRPPNTRPTSFFLSKSISFHSFSNPSCHFRYPIFALLGHNKKHSQISASTDPKLTNPSNPVASICESLLIFSTSVLLSFSLFVTDVDPAAAFV
VTTPRKLQTDELATVRLFQENTPSVVYITNLAARQDAFTLDVLEVPQGSGSGFVWDKDGHIVTNYHVIRGASDLRVTLADQTTFDAKVVGFDQDKDVAVLSIDAPKDKLR
PIPVGISADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGPLLDSSGNLIGINTAIYSPSGASSGVGFSIPVDTVSGIVDQLV
RFGKVTRPILGIKFAPDQSVEQLGVSGVLVLDAPANGPAGKAVFISSLFLFAFLFLYL