; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0668 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0668
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotease Do-like 5, chloroplastic
Genome locationMC03:13621677..13625715
RNA-Seq ExpressionMC03g0668
SyntenyMC03g0668
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR001940 - Peptidase S1C
IPR009003 - Peptidase S1, PA clan
IPR043504 - Peptidase S1, PA clan, chymotrypsin-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444456.1 PREDICTED: protease Do-like 5, chloroplastic [Cucumis melo]9.69e-18887.87Show/hide
Query:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEA-L
        MAL  LGI  LPIPAPPNSS N LPFTSRRA++F+P+ALMASLLAFPLPT AALPQ+Q  + QEEDR+V+LFQ+ SPSVVYIKDLELAK PQN SEE  +
Subjt:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEA-L

Query:  LVEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC
        L+ED+NVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSG QRCKVNLVD KGNGIY+EA IVGFDPEYDLAVLKVEL G ELKPIV GTSRNLRVGQSC
Subjt:  LVEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC

Query:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
        YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTDAAIS+GNSGGPL+DSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Subjt:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP

Query:  YSERF
        YSERF
Subjt:  YSERF

XP_022140015.1 protease Do-like 5, chloroplastic isoform X1 [Momordica charantia]5.56e-20198.29Show/hide
Query:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
        MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
Subjt:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL

Query:  VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY
        VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY
Subjt:  VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY

Query:  AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYL
        AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV  T P++
Subjt:  AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYL

XP_022140016.1 protease Do-like 5, chloroplastic isoform X2 [Momordica charantia]5.96e-216100Show/hide
Query:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
        MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
Subjt:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL

Query:  VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY
        VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY
Subjt:  VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY

Query:  AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
        AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
Subjt:  AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY

Query:  SERF
        SERF
Subjt:  SERF

XP_022927238.1 protease Do-like 5, chloroplastic isoform X1 [Cucurbita moschata]1.18e-18887.54Show/hide
Query:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
        MAL SLGI LLP+ +PPNSS  SLPFTSRRA+VFAP+ALMASLLAFP+P+ AALPQ+Q +VPQEEDR+V LFQ+ SPSVVYIK+LE+AKKPQN SEEA+L
Subjt:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL

Query:  VED-ENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC
        +ED EN KVKGTGSGF+WDKFGHIVTNYHVVSALATDNSGLQRCKVNLVD KGNGI R+AKIVGFDPEYDLAVLKVEL G ELKPIV GTSR+LRVGQSC
Subjt:  VED-ENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC

Query:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
        YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTDAAIS+GNSGGPL+D YGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Subjt:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP

Query:  YSERF
        YSERF
Subjt:  YSERF

XP_038893976.1 protease Do-like 5, chloroplastic [Benincasa hispida]7.78e-18887.5Show/hide
Query:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
        MAL SLGI LLPI +PPNS+ N LP TSRRA+VFAP+ALMASLLAFP+PT+AALPQ+Q  +PQEEDR+VALFQ+ SPSVVYIKDLE+AK PQN S E   
Subjt:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL

Query:  VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY
          DEN KVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVD KGNGIY+EAKIVGFDPEYDLAVLKVEL G ELKPIV GTSRNLRVGQSCY
Subjt:  VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY

Query:  AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
        AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTDAAIS+GNSGGPL+DSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV+RTVPYLIVYGTPY
Subjt:  AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY

Query:  SERF
        SERF
Subjt:  SERF

TrEMBL top hitse value%identityAlignment
A0A1S3B9W5 protease Do-like 5, chloroplastic4.69e-18887.87Show/hide
Query:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEA-L
        MAL  LGI  LPIPAPPNSS N LPFTSRRA++F+P+ALMASLLAFPLPT AALPQ+Q  + QEEDR+V+LFQ+ SPSVVYIKDLELAK PQN SEE  +
Subjt:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEA-L

Query:  LVEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC
        L+ED+NVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSG QRCKVNLVD KGNGIY+EA IVGFDPEYDLAVLKVEL G ELKPIV GTSRNLRVGQSC
Subjt:  LVEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC

Query:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
        YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTDAAIS+GNSGGPL+DSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Subjt:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP

Query:  YSERF
        YSERF
Subjt:  YSERF

A0A5D3DAW0 Protease Do-like 54.69e-18887.87Show/hide
Query:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEA-L
        MAL  LGI  LPIPAPPNSS N LPFTSRRA++F+P+ALMASLLAFPLPT AALPQ+Q  + QEEDR+V+LFQ+ SPSVVYIKDLELAK PQN SEE  +
Subjt:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEA-L

Query:  LVEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC
        L+ED+NVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSG QRCKVNLVD KGNGIY+EA IVGFDPEYDLAVLKVEL G ELKPIV GTSRNLRVGQSC
Subjt:  LVEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC

Query:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
        YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTDAAIS+GNSGGPL+DSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Subjt:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP

Query:  YSERF
        YSERF
Subjt:  YSERF

A0A6J1CDW0 protease Do-like 5, chloroplastic isoform X12.69e-20198.29Show/hide
Query:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
        MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
Subjt:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL

Query:  VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY
        VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY
Subjt:  VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY

Query:  AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYL
        AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV  T P++
Subjt:  AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYL

A0A6J1CEE6 protease Do-like 5, chloroplastic isoform X22.89e-216100Show/hide
Query:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
        MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
Subjt:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL

Query:  VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY
        VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY
Subjt:  VEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCY

Query:  AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
        AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
Subjt:  AIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY

Query:  SERF
        SERF
Subjt:  SERF

A0A6J1EKF8 protease Do-like 5, chloroplastic isoform X15.72e-18987.54Show/hide
Query:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL
        MAL SLGI LLP+ +PPNSS  SLPFTSRRA+VFAP+ALMASLLAFP+P+ AALPQ+Q +VPQEEDR+V LFQ+ SPSVVYIK+LE+AKKPQN SEEA+L
Subjt:  MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALL

Query:  VED-ENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC
        +ED EN KVKGTGSGF+WDKFGHIVTNYHVVSALATDNSGLQRCKVNLVD KGNGI R+AKIVGFDPEYDLAVLKVEL G ELKPIV GTSR+LRVGQSC
Subjt:  VED-ENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC

Query:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
        YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTDAAIS+GNSGGPL+D YGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Subjt:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP

Query:  YSERF
        YSERF
Subjt:  YSERF

SwissProt top hitse value%identityAlignment
O22609 Protease Do-like 1, chloroplastic1.6e-4643.21Show/hide
Query:  NSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQA----------QVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGT
        ++L FT   A+   P  L+ + +A      AA P +++          ++  +E   V LFQ+ +PSVVYI +L +        ++A  ++   V  +G+
Subjt:  NSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQA----------QVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGT

Query:  GSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYEKT
        GSGFVWDK GHIVTNYHV+        G    +V L D        +AK+VGFD + D+AVL+++    +L+PI +G S +L VGQ  +AIGNPFG + T
Subjt:  GSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYEKT

Query:  LTAGVISGLGREIPS-PNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG
        LT GVISGL REI S   GR I+  IQTDAAI+ GNSGGPL+DS G +IG+NTA ++   +G SSGV F+IP+DTV   V  L+ +G
Subjt:  LTAGVISGLGREIPS-PNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG

Q2SL36 Probable periplasmic serine endoprotease DegP-like2.1e-3044.75Show/hide
Query:  KVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPF
        + + TGSGF+  K G+I+TN HVV       +G     V L+D +       AK++G D + DLAVLKVE    +L  + LG S  L+VG+   AIG+PF
Subjt:  KVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPF

Query:  GYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVV
        G+E T+TAG++S  GR +P+ N       IQTD AI+ GNSGGPL +  G V+G+N+  +TR G  M  GV+FAIPID  +
Subjt:  GYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVV

Q4KGQ4 Probable periplasmic serine endoprotease DegP-like4.4e-2839.46Show/hide
Query:  DENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAI
        D   + +  GSGF+    G+I+TN HV+       +      V L D        +AK++G DP  D+A+LK++  G +L  + LG S++L+ GQ   AI
Subjt:  DENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAI

Query:  GNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVV
        G+PFG++ T+T G++S +GR +P+ N       IQTD  I+ GNSGGPL +  G V+G+N+  +TR G  M  GV+FAIPID  +
Subjt:  GNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVV

Q9LU10 Protease Do-like 8, chloroplastic2.0e-4945.12Show/hide
Query:  IHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQV------PQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLV
        +H L + + P+++   L  +    L F PS  + S LA   P+ A +  +   V         E R+V LF+  + SVV I D+ L  +PQ      + +
Subjt:  IHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQV------PQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLV

Query:  EDENVKVKGTGSGFVWDKFGHIVTNYHVV-SALATDNS-GLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC
         +      G GSG VWD  G+IVTNYHV+ +AL+ + S G    +VN++ + G     E K+VG D   DLAVLKV+     LKPI +G S +L+VGQ C
Subjt:  EDENVKVKGTGSGFVWDKFGHIVTNYHVV-SALATDNS-GLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC

Query:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY
         AIGNPFG++ TLT GVISGL R+I S  G  I GGIQTDAAI+ GNSGGPL+DS G++IG+NTA FT+  TG S+GV FAIP  TV++ VP LI +
Subjt:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY

Q9SEL7 Protease Do-like 5, chloroplastic1.4e-9864.41Show/hide
Query:  SSHNSLPFTSRRALVFAPS-ALMASLLA-----FPLPTHAALPQI---QAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVK
        S+H  +    RR ++F  S AL +SLL       P+ +  AL Q    + ++ +EE+R V LFQ  SPSVVYI+ +EL K    +S   +L ++EN K++
Subjt:  SSHNSLPFTSRRALVFAPS-ALMASLLA-----FPLPTHAALPQI---QAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVK

Query:  GTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYE
        GTGSGFVWDK GHIVTNYHV++ LATD  GLQRCKV+LVDAKG    +E KIVG DP+ DLAVLK+E  G EL P+VLGTS +LRVGQSC+AIGNP+GYE
Subjt:  GTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYE

Query:  KTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF
         TLT GV+SGLGREIPSPNG++I   IQTDA I+SGNSGGPL+DSYGH IGVNTATFTRKG+GMSSGVNFAIPIDTVVRTVPYLIVYGT Y +RF
Subjt:  KTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF

Arabidopsis top hitse value%identityAlignment
AT3G27925.1 DegP protease 11.2e-4743.21Show/hide
Query:  NSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQA----------QVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGT
        ++L FT   A+   P  L+ + +A      AA P +++          ++  +E   V LFQ+ +PSVVYI +L +        ++A  ++   V  +G+
Subjt:  NSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQA----------QVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGT

Query:  GSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYEKT
        GSGFVWDK GHIVTNYHV+        G    +V L D        +AK+VGFD + D+AVL+++    +L+PI +G S +L VGQ  +AIGNPFG + T
Subjt:  GSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYEKT

Query:  LTAGVISGLGREIPS-PNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG
        LT GVISGL REI S   GR I+  IQTDAAI+ GNSGGPL+DS G +IG+NTA ++   +G SSGV F+IP+DTV   V  L+ +G
Subjt:  LTAGVISGLGREIPS-PNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG

AT4G18370.1 DEGP protease 59.9e-10064.41Show/hide
Query:  SSHNSLPFTSRRALVFAPS-ALMASLLA-----FPLPTHAALPQI---QAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVK
        S+H  +    RR ++F  S AL +SLL       P+ +  AL Q    + ++ +EE+R V LFQ  SPSVVYI+ +EL K    +S   +L ++EN K++
Subjt:  SSHNSLPFTSRRALVFAPS-ALMASLLA-----FPLPTHAALPQI---QAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVK

Query:  GTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYE
        GTGSGFVWDK GHIVTNYHV++ LATD  GLQRCKV+LVDAKG    +E KIVG DP+ DLAVLK+E  G EL P+VLGTS +LRVGQSC+AIGNP+GYE
Subjt:  GTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYE

Query:  KTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF
         TLT GV+SGLGREIPSPNG++I   IQTDA I+SGNSGGPL+DSYGH IGVNTATFTRKG+GMSSGVNFAIPIDTVVRTVPYLIVYGT Y +RF
Subjt:  KTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF

AT5G27660.1 Trypsin family protein with PDZ domain6.2e-1733.33Show/hide
Query:  KGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAK-GNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFG
        K  GSG + D  G I+T  HVV     D   ++      VD    +G   E  +V  D + D+A++K++     L    LG S  LR G    A+G P  
Subjt:  KGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAK-GNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFG

Query:  YEKTLTAGVISGLGREIPSPN-GRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPY
         + T+TAG++S + R+      G   R  +QTD +I++GNSGGPL++  G VIGVN           + G+ F++PID+V + + +
Subjt:  YEKTLTAGVISGLGREIPSPN-GRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPY

AT5G39830.1 Trypsin family protein with PDZ domain1.5e-5045.12Show/hide
Query:  IHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQV------PQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLV
        +H L + + P+++   L  +    L F PS  + S LA   P+ A +  +   V         E R+V LF+  + SVV I D+ L  +PQ      + +
Subjt:  IHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQV------PQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLV

Query:  EDENVKVKGTGSGFVWDKFGHIVTNYHVV-SALATDNS-GLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC
         +      G GSG VWD  G+IVTNYHV+ +AL+ + S G    +VN++ + G     E K+VG D   DLAVLKV+     LKPI +G S +L+VGQ C
Subjt:  EDENVKVKGTGSGFVWDKFGHIVTNYHVV-SALATDNS-GLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC

Query:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY
         AIGNPFG++ TLT GVISGL R+I S  G  I GGIQTDAAI+ GNSGGPL+DS G++IG+NTA FT+  TG S+GV FAIP  TV++ VP LI +
Subjt:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY

AT5G39830.2 Trypsin family protein with PDZ domain7.7e-4442.09Show/hide
Query:  IHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQV------PQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLV
        +H L + + P+++   L  +    L F PS  + S LA   P+ A +  +   V         E R+V LF+  + SVV I D+ L  +PQ      + +
Subjt:  IHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQV------PQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLV

Query:  EDENVKVKGTGSGFVWDKFGHIVTNYHVV-SALATDNS-GLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC
         +      G GSG VWD  G+IVTNYHV+ +AL+ + S G    +VN++ + G     E K+VG D   DLAVLKV+     LKPI +G S +L+VGQ C
Subjt:  EDENVKVKGTGSGFVWDKFGHIVTNYHVV-SALATDNS-GLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSC

Query:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY
         AIGNPFG++ TLT GVISGL R+I S  G  I GGIQTDAAI+ GNSGGPL+DS G++IG+NTA FT+                TV++ VP LI +
Subjt:  YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTAGCCTCACTGGGAATTCATCTTCTTCCAATTCCAGCTCCCCCAAATTCTTCCCACAACTCTCTACCCTTCACTTCGCGAAGAGCCCTAGTTTTTGCCCCATC
TGCTTTGATGGCTTCCCTCCTCGCTTTCCCTCTTCCCACTCACGCCGCTCTCCCCCAAATACAGGCCCAGGTTCCACAAGAAGAAGATCGAGTCGTCGCTCTCTTTCAGG
ATGCTTCACCTTCTGTCGTTTACATTAAGGACCTTGAATTAGCTAAGAAACCCCAGAACTCCTCTGAAGAGGCCCTGCTCGTCGAGGATGAGAATGTCAAGGTCAAAGGG
ACTGGTTCGGGCTTTGTATGGGATAAATTTGGCCATATCGTAACTAATTACCATGTTGTTTCCGCATTGGCTACTGATAACAGTGGATTGCAGCGTTGTAAGGTAAATTT
AGTCGATGCTAAAGGAAATGGAATTTATAGGGAAGCAAAAATTGTAGGTTTTGATCCAGAGTATGATCTAGCTGTTCTCAAGGTGGAACTTGGAGGATGTGAACTAAAGC
CCATCGTTCTCGGTACCTCTCGAAATTTACGTGTTGGTCAGAGCTGCTATGCCATTGGCAACCCTTTTGGTTATGAGAAGACACTAACAGCAGGGGTGATCAGTGGATTG
GGTAGAGAAATTCCATCACCAAATGGAAGGGCCATTCGGGGGGGTATTCAGACAGATGCTGCTATTAGTTCAGGGAATTCAGGGGGGCCATTAATTGACTCCTACGGTCA
TGTAATTGGAGTCAACACAGCAACTTTCACTCGCAAAGGAACTGGAATGTCTTCTGGTGTTAATTTTGCAATTCCAATAGACACAGTTGTACGTACGGTTCCATACCTTA
TTGTATACGGAACACCTTACAGTGAGAGATTTTGA
mRNA sequenceShow/hide mRNA sequence
CGAAAATCTATTCGAGAGTAAAAATTGCAATACTTTAGTAGTTCTACTCTCAAAACTAAACCATTTCAGATTAATTAACTTCTTAGATTTACCCTTCTGGATTAAACAAC
TTTCAGATAAGTTGTGGCGAAAGAGTTAGAGCATAGCTTTAATCTTTCATAGGGATAAGGGAACTTCAAGAACTTCAAAATTTATAATGTTCGTTTGTTTCTAAAGCATA
ATGTTCAATTGATTAAGGTTTGAATTGCAGTTGCTGAATTTAAAAAGGGAAAATCACGATAAGAATCAAGATCAGAGGAAGTGGAACACCATGGCGTTAGCCTCACTGGG
AATTCATCTTCTTCCAATTCCAGCTCCCCCAAATTCTTCCCACAACTCTCTACCCTTCACTTCGCGAAGAGCCCTAGTTTTTGCCCCATCTGCTTTGATGGCTTCCCTCC
TCGCTTTCCCTCTTCCCACTCACGCCGCTCTCCCCCAAATACAGGCCCAGGTTCCACAAGAAGAAGATCGAGTCGTCGCTCTCTTTCAGGATGCTTCACCTTCTGTCGTT
TACATTAAGGACCTTGAATTAGCTAAGAAACCCCAGAACTCCTCTGAAGAGGCCCTGCTCGTCGAGGATGAGAATGTCAAGGTCAAAGGGACTGGTTCGGGCTTTGTATG
GGATAAATTTGGCCATATCGTAACTAATTACCATGTTGTTTCCGCATTGGCTACTGATAACAGTGGATTGCAGCGTTGTAAGGTAAATTTAGTCGATGCTAAAGGAAATG
GAATTTATAGGGAAGCAAAAATTGTAGGTTTTGATCCAGAGTATGATCTAGCTGTTCTCAAGGTGGAACTTGGAGGATGTGAACTAAAGCCCATCGTTCTCGGTACCTCT
CGAAATTTACGTGTTGGTCAGAGCTGCTATGCCATTGGCAACCCTTTTGGTTATGAGAAGACACTAACAGCAGGGGTGATCAGTGGATTGGGTAGAGAAATTCCATCACC
AAATGGAAGGGCCATTCGGGGGGGTATTCAGACAGATGCTGCTATTAGTTCAGGGAATTCAGGGGGGCCATTAATTGACTCCTACGGTCATGTAATTGGAGTCAACACAG
CAACTTTCACTCGCAAAGGAACTGGAATGTCTTCTGGTGTTAATTTTGCAATTCCAATAGACACAGTTGTACGTACGGTTCCATACCTTATTGTATACGGAACACCTTAC
AGTGAGAGATTTTGATAAAGTTTTACAATCTTTTCACAAGGGAACACAACCATTCATGCAACTCCAACTCCTCTCCAAGACCGTGGACTGCACCGTAAAATGACAATTTG
CTAACATAAATGAATTTGCTTTCTTCACAAGTGTTATGATTCATATGAATTCTTTCTAGATCTTGGTCTGAGTTGTGAATAATTGTTTATTTATGGAATATTACTCTTCC
TTCCCTCTGTTGGGTGGTTCAATCATTGAAAATATTATTGTTGTACAGATTCAAAGAGTTTTTACTCATCTGTTAAAACTTTCTAGCCTTAATTTCTGGAGTGTAGCTTT
CCAAACTAGAAACAATTCAGTGTTTCTGTCTCCATCCAAATAGCATGCCATTGTTGAAGATTCCACTTTTATGCACTTGATTCAATGAATAAAGAAAAGGAGAATATATT
CTGGTTATCCACAACCTTGTAAGAGAAAATTTTTTTTATCTCCTTATCCCATATTTCTACTGAAAAGAAATAATCATAAACACTCTCTTCAAACAAGTGTGCTAGTCCTT
TTTTCTCCTTTCTTAACCCCAGGCTTAGTCTTGATTTCTAGACTGATAGCAGCGCTAATTGTTCCACAATGGTACTATCTACATGACAGGACACACGGCAAGAGTTGGGG
ATAAGAATAAGGCCGAGGCACCGTTGTTCCATCGGAGAAGGTGAGCTACGTCGAGTACTACGATACTCCACTGCATCTTGTAGAATGATATTTCCTTGCTTGTCAATGCA
GTGAAAAGTTCCCAGGAAAAATCTTCCATCTTTAATGCCTATGAGCATTCGGCGAAACAGTAGCTTTCTCACCTTTGCTATGCAATCTAAACTGCCCGGATTAGACTCGA
CATTGCTCCCAACCTGAACCCTGGGTCCCTCTGATTCTTGTTCCATGTATGGTTCGATTGATCCCTGTAAGTGGGGAAGGTTGTTGAATATAGTTGATGAACAGGACCAA
GTATACGATCAGAATTGCAACTTAAATCAGTATGACAACTCAATTAGTGGAAAACAAATGGGACGAAAACCGTCCTAAAGAAATATATGAAGATTTTTTGAAAAAGGGAA
AAAATGAAAGCCAGTAAAAGAGAGATTAATGTAAACCATGAACTCTTTACTAACATCCAATAGACACATAATACACTCCA
Protein sequenceShow/hide protein sequence
MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKG
TGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGL
GREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF