; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg25442 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg25442
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtease Do-like 5, chloroplastic
Genome locationCarg_Chr13:6638283..6644277
RNA-Seq ExpressionCarg25442
SyntenyCarg25442
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR009003 - Peptidase S1, PA clan
IPR043504 - Peptidase S1, PA clan, chymotrypsin-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583889.1 Protease Do-like 5, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.8e-3187.06Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGT------PYSNSIIEATIDEMQREEFGDCVFV
        GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLI   T      P+ NSIIEATIDEMQREEFGDCVFV
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGT------PYSNSIIEATIDEMQREEFGDCVFV

KAG7019506.1 Protease Do-like 5, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]3.4e-45100Show/hide
Query:  MLPKSSIIMHRSFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATIDEMQREEFGDCVFV
        MLPKSSIIMHRSFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATIDEMQREEFGDCVFV
Subjt:  MLPKSSIIMHRSFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATIDEMQREEFGDCVFV

OVA07424.1 Peptidase S1C [Macleaya cordata]2.5e-2487.88Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATID
        GNSGGPL+D YGHVIGVNTATFTRKGTG SSGVNFAIPIDTVVRTVPYLIVYGTPYSNSI +  +D
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATID

RVW34933.1 Protease Do-like 5, chloroplastic [Vitis vinifera]2.5e-2485.07Show/hide
Query:  SFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEAT
        + TGNSGGPL++ YGHVIGVNTATFTRKGTG+SSGVNFAIPIDTVVRTVPYLIVYGTPYSN  +E T
Subjt:  SFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEAT

RYR36331.1 hypothetical protein Ahy_A10g051315 isoform F [Arachis hypogaea]4.3e-2485.07Show/hide
Query:  IIMHRSFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNS
        + +H   TGNSGGPL+D YGHVIGVNTATFTRKG+GMSSGVNFAIPIDTV+RTVPYLIVYGTPYSNS
Subjt:  IIMHRSFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNS

TrEMBL top hitse value%identityAlignment
A0A200QAE4 Peptidase S1C1.2e-2487.88Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATID
        GNSGGPL+D YGHVIGVNTATFTRKGTG SSGVNFAIPIDTVVRTVPYLIVYGTPYSNSI +  +D
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATID

A0A2C9U6L0 Uncharacterized protein2.1e-2487.69Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATI
        GNSGGPL+D YGHVIGVNTATFTRKGTG+SSGVNFAIPIDTVVRTVPYLIVYGTPYS+S +  TI
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATI

A0A438DHL2 Protease Do-like 5, chloroplastic1.2e-2485.07Show/hide
Query:  SFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEAT
        + TGNSGGPL++ YGHVIGVNTATFTRKGTG+SSGVNFAIPIDTVVRTVPYLIVYGTPYSN  +E T
Subjt:  SFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEAT

A0A438HZP4 Protease Do-like 5, chloroplastic4.7e-2486.15Show/hide
Query:  TGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEAT
        +GNSGGPL++ YGHVIGVNTATFTRKGTG+SSGVNFAIPIDTVVRTVPYLIVYGTPYSN  +E T
Subjt:  TGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEAT

A0A445BCH2 Uncharacterized protein2.1e-2485.07Show/hide
Query:  IIMHRSFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNS
        + +H   TGNSGGPL+D YGHVIGVNTATFTRKG+GMSSGVNFAIPIDTV+RTVPYLIVYGTPYSNS
Subjt:  IIMHRSFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNS

SwissProt top hitse value%identityAlignment
O22609 Protease Do-like 1, chloroplastic6.5e-0752.83Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG
        GNSGGPL+D  G +IG+NTA ++   +G SSGV F+IP+DTV   V  L+ +G
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG

Q2YMX6 Probable periplasmic serine endoprotease DegP-like2.1e-0551.85Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGT
        GNSGGP  DL G VIG+NTA F+   +G S G+ FAIP  T  + V  LI  G+
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGT

Q4KGQ4 Probable periplasmic serine endoprotease DegP-like5.5e-0642.86Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATIDEMQR---EEFG
        GNSGGPL +L G V+G+N+  +TR G  M  GV+FAIPID V   V   +  G   S   +   I E+ +   E FG
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATIDEMQR---EEFG

Q9LU10 Protease Do-like 8, chloroplastic5.4e-0959.62Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY
        GNSGGPL+D  G++IG+NTA FT+  TG S+GV FAIP  TV++ VP LI +
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY

Q9SEL7 Protease Do-like 5, chloroplastic3.8e-2389.47Show/hide
Query:  TGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
        +GNSGGPL+D YGH IGVNTATFTRKG+GMSSGVNFAIPIDTVVRTVPYLIVYGT Y
Subjt:  TGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY

Arabidopsis top hitse value%identityAlignment
AT3G27925.1 DegP protease 14.7e-0852.83Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG
        GNSGGPL+D  G +IG+NTA ++   +G SSGV F+IP+DTV   V  L+ +G
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG

AT4G18370.1 DEGP protease 52.7e-2489.47Show/hide
Query:  TGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
        +GNSGGPL+D YGH IGVNTATFTRKG+GMSSGVNFAIPIDTVVRTVPYLIVYGT Y
Subjt:  TGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY

AT5G27660.1 Trypsin family protein with PDZ domain5.3e-0443.75Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPY
        GNSGGPLV+L G VIGVN           + G+ F++PID+V + + +
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPY

AT5G39830.1 Trypsin family protein with PDZ domain3.8e-1059.62Show/hide
Query:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY
        GNSGGPL+D  G++IG+NTA FT+  TG S+GV FAIP  TV++ VP LI +
Subjt:  GNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCCCAAGTCATCCATTATAATGCATAGAAGTTTTACAGGGAATTCAGGGGGGCCATTAGTTGACTTATACGGCCATGTAATTGGAGTCAACACAGCAACTTTCAC
TCGCAAAGGAACTGGAATGTCTTCTGGTGTTAATTTTGCTATACCAATAGACACAGTTGTACGGACTGTGCCATATCTTATTGTATATGGAACGCCTTACAGTAATTCTA
TCATTGAAGCTACGATTGATGAAATGCAACGGGAAGAGTTTGGCGATTGTGTGTTTGTTTAA
mRNA sequenceShow/hide mRNA sequence
TGAGATCTCACCTCGTTTGGAGAGGGGAACGAAGGATTCCTTATAAGGATGTGAAAACCTCTCCCTAGTAGACGCGTTTTAAAATCTTGAGGGGAAGTCCAAAAGAAAAA
GCCCAAAAAAGACAATATCTATTAGCAATGAGTTTAGGCTGTTACATTTTTCGTCCTATATATTATATGTCTATTTAGTACTCATCAAATATTTAGTGCATGTCTCACAA
ATGATGATCCTATATCGACTTTGTGCAGCTAGTGTCAAACACATAAAAGTATGTGCTAACAAGTATTCTATGTATCCAATAAGTGTCGGAGTGTCCAAGTATTTGAAACG
TGTTGGACACGGACACACTGCCCAAACTAAAATGTTTGTGCTTCTTAGAAACCATGATCGGCTGATTCTGAAGTTTCCATGTAACATCCAGGTGGAACTTCAAGGACATG
AACTAAAGCCCATAGTCTTTGGTACCTCTCGAAGTTTACGTGTTGGTCAGAGCTGCTATGCCATTGGCAACCCTTTCGGTTATGAGAAGACACTCACAGCAGGAGTAAGG
CATCAAGAATGTTACTTCCTCCATTTAATATAGGGAAACCCACTTTGAGTTTTCCATTCCAAACCTAAAACTAATACTTCTCTCTTATAATGAACATATTAATCATAGAA
GTTTATAATATCTCTGATCTCACACACATTTATGCTTTCTTCAAGTGTCATATCAAGTTTCGTACTGGTGTTCTTTACCCAAGTAACACAGAGTCATATAGCAGATTGAC
TAAATAATTAAGAACCGTAAATCATTCAGGGTTTGGATAGCAGGTGATCAGCGGATTGGGTAGAGAAATTCCATCCCCAAATGGAAGGGCCATCAGGGGAGCTATTCAGA
CAGATGCTGCTATTAGTGCAGGTTCATGGTTCAACTTCCTGAAATAAGATTTTTCCATTTCAAACACTGTTCTGTTATCTTTTCATGCTCTTTCTTCTCCCTTTTCAAAG
CTATTCTCCTTTCATCACTCATATTACAAGTGTTTATTGATTCCACTGATCTTGAATCTTGAAACTATTGAACAGTTAAATTCCCAGCTTATCTTGTTCTTTCTATTCGG
TCTCTGGTTTATGCTGCCCAAGTCATCCATTATAATGCATAGAAGTTTTACAGGGAATTCAGGGGGGCCATTAGTTGACTTATACGGCCATGTAATTGGAGTCAACACAG
CAACTTTCACTCGCAAAGGAACTGGAATGTCTTCTGGTGTTAATTTTGCTATACCAATAGACACAGTTGTACGGACTGTGCCATATCTTATTGTATATGGAACGCCTTAC
AGTAATTCTATCATTGAAGCTACGATTGATGAAATGCAACGGGAAGAGTTTGGCGATTGTGTGTTTGTTTAAATGTCTATAGTAAGCTTGTAACGTTAACACATTTGAGT
TCTAAACAAAATTTAAAAAATGCCCATATTTCCACTAAAACATCATATTGTTTTAGAATTTTTCTGGAGTAATAGAAACTGTCATAGACATTCTCTTCAAACAAGTGTGC
AGATTTCCCCCCTCCCCTCTTAGCCTAAGGCTTGATCTTATTAGCTAAACAGATAACAGCGCCAATTGTTCATCAATGGTACTATCTACATGACAAGATACACGGCAAGA
GTTAGGGATAAGAATAAGACCTAGGCACCGTTGTTCCATCGGAGAGGGTGAGCTACGTCGAGTGCTACGATACTCTACTGCATCTTGAAGGATGATATTTCCTTGCTTGT
CCATGCAGTAAAAGGAGCCCAAGAAAAACCTTCCATCTTTAATACCTATGAGCATTCGACGGAACAGAAGCTTTCTCACCTTTCCTACACGATCTAAACTGTCTGGATTA
GACTCAACATTGCTACCAACCTGAACCATGGATCCTCCTGATTCTTGTTCCATCTATGGTTACTTCAATTGGTCCCTGGAGTGGACGAAGAGAAAGAGACGGTGGGAGTC
GGCAGAACAGAGTCGATCTTGGCGGCCGATTGTAATTCTAAGCTTCCTGTCGCGGATGAGACCGGAGAAATAAAGGGTTA
Protein sequenceShow/hide protein sequence
MLPKSSIIMHRSFTGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSNSIIEATIDEMQREEFGDCVFV