; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000455 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000455
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionChymopapain
Genome locationchr4:7688742..7696005
RNA-Seq ExpressionLag0000455
SyntenyLag0000455
Gene Ontology termsGO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0097655 - serpin family protein binding (molecular function)
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
2BDZ_A Mexicain from Jacaratia mexicana [Jacaratia mexicana]8.8e-0546.03Show/hide
Query:  EGLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI
        EG  G   +H +TAVGY   Y++LK+ +G NWG+ GY+RI R S    S G  GVY  + +PI
Subjt:  EGLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI

P84346.1 RecName: Full=Mexicain [Jacaratia mexicana]8.8e-0546.03Show/hide
Query:  EGLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI
        EG  G   +H +TAVGY   Y++LK+ +G NWG+ GY+RI R S    S G  GVY  + +PI
Subjt:  EGLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI

XP_021905591.1 LOW QUALITY PROTEIN: papain-like [Carica papaya]3.3e-0441.27Show/hide
Query:  EGLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI
        EG  G   +H +TAVGY P YI++K+ +G  WG+ G++RI R +R+ +  G  G+Y  + YP+
Subjt:  EGLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI

XP_021911524.1 papain-like [Carica papaya]3.3e-0441.27Show/hide
Query:  EGLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI
        EG  G   +H +TAVGY P YI++K+ +G  WG+ G++RI R +R+ +  G  G+Y  + YP+
Subjt:  EGLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI

TrEMBL top hitse value%identityAlignment
Q9SMH8 Chymopapain isoform V (Fragment)6.1e-0437.36Show/hide
Query:  LENHEMIYLFD---DPSKLADAG--EGLIGMTPNHRMTAVGYDPC----YIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYP
        L N  + +L +    P +L  +G  +G  G   +H +TAVGY       YI++K+ +G NWG+ GYMR+ R+S   +S G  GVY  + YP
Subjt:  LENHEMIYLFD---DPSKLADAG--EGLIGMTPNHRMTAVGYDPC----YIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYP

Q9SMI0 Chymopapain isoform III6.1e-0437.36Show/hide
Query:  LENHEMIYLFD---DPSKLADAG--EGLIGMTPNHRMTAVGYDPC----YIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYP
        L N  + +L +    P +L  +G  +G  G   +H +TAVGY       YI++K+ +G NWG+ GYMR+ R+S   +S G  GVY  + YP
Subjt:  LENHEMIYLFD---DPSKLADAG--EGLIGMTPNHRMTAVGYDPC----YIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYP

Q9SMI1 Chymopapain isoform II6.1e-0437.36Show/hide
Query:  LENHEMIYLFD---DPSKLADAG--EGLIGMTPNHRMTAVGYDPC----YIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYP
        L N  + +L +    P +L  +G  +G  G   +H +TAVGY       YI++K+ +G NWG+ GYMR+ R+S   +S G  GVY  + YP
Subjt:  LENHEMIYLFD---DPSKLADAG--EGLIGMTPNHRMTAVGYDPC----YIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYP

SwissProt top hitse value%identityAlignment
P00784 Papain2.8e-0640.32Show/hide
Query:  GLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI
        G  G   +H + AVGY P YI++K+ +G  WG+ GY+RI R +   +S G  G+Y  + YP+
Subjt:  GLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI

P05994 Papaya proteinase 41.4e-0543.28Show/hide
Query:  EGLIGMTPNHRMTAVGYDPC----YIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI
        EG  G   +H +TAVGY       YI++K+ +G  WG+ GY+RI R S   +S G  GVY  + YPI
Subjt:  EGLIGMTPNHRMTAVGYDPC----YIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI

P14080 Chymopapain2.8e-0640.79Show/hide
Query:  PSKLADAG--EGLIGMTPNHRMTAVGYDPC----YIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYP
        P +L  +G  +G  G   +H +TAVGY       YI++K+ +G NWG+ GYMR+ R+S   +S G  GVY  + YP
Subjt:  PSKLADAG--EGLIGMTPNHRMTAVGYDPC----YIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYP

P84346 Mexicain1.1e-0746.03Show/hide
Query:  EGLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI
        EG  G   +H +TAVGY   Y++LK+ +G NWG+ GY+RI R S    S G  GVY  + +PI
Subjt:  EGLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI

P84347 Chymomexicain2.6e-0438.71Show/hide
Query:  GLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI
        G  G   +H +TA+GY    ++ K+ +G NWG+ GY++I R S    S G  GVY  + +PI
Subjt:  GLIGMTPNHRMTAVGYDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI

Arabidopsis top hitse value%identityAlignment
AT1G06260.1 Cysteine proteinases superfamily protein7.9e-0434.92Show/hide
Query:  GMTPNHRMTAVGY----DPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI
        G   NH +T VGY    D  Y ++K+ +G  WG+ GY+R+ R   +D   G  G+     YP+
Subjt:  GMTPNHRMTAVGY----DPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAAGACTATCCTAAAGGAGATCCATGGCTCAGCAGCTCCCGTCACGCGTACTGCACACCTTTACCTCTACCTGCAGTTGTATGTAAGGGAAAAGAATGAGTATTT
AAGAAAATACTCAGCGCGTAGCGCTCTAAGCTACCATTTCACTGGTGGAGCGTTCATGGGTTTAAGTCAGGAATTTACTTCCACACAGCGACGGCGCTTCAGGAGACGAC
GAATAGCGGTGGTTCACGGCGGCGAGACGACGGCAGCGTGCAGATCTGAAATCGGGGAGGCGGCGGCGCACGGATCGTTAGGTTGTAGACAGAGAGAGAGAGGGCTGCGA
GGAAGAGAGAGAGACGGAAACGAAGGAGAGAGAGAGAAAAAGATATTACCGGCAACGACTCGCGGAGGACGACGACGGGCGGTGGAGTGCGGCGGCGGCGTCGACGTCGG
CGGTGAGCTGCAGGCGGCGGCAAGCGGACCGGTGGAGTGTATTACAATGCAGTTTACCCTATTGTATAGATTTACTCGTTTGACGATTCATCTGAGCCCGCTGATGCTGG
AGAGATGGATGCAACTGGGAGATGGAGGATATATGCGAATTGCTAAAAAATCACGAGATGGCCATTCGTTAGGACCCGGTGGAGTGTATTACTATGCAATTTACCCTATT
ATACAGATTTACCTGTTTGACGATCCATCTGAGCCCGTTGATCTTGGAGAGATTTACCTATTTGACGATCCATTTCAGCCCGCTGATGCTGGAGAGAAACGAAACTTGAG
AATATTGTTTTATGTTTTTCTTCTAATGATTTACCTGTTTGACGATCCATCTGATCCCGCTGATGCTGGAGAGACGGATGCAACTGGGGAGATGGGGGATATATGCGAAT
TGCTAGAAAATCACGAGATGATTTACCTGTTTGATGATCCATCTAAGCTCGCTGATGCTGGAGAGGGCCTCATTGGAATGACACCGAACCATCGAATGACCGCAGTTGGG
TATGATCCTTGCTACATAGTTTTGAAACATATATACGGATGCAACTGGGGAGATGGAGGATATATGCGAATTGCTAGAAAATCACGAGATGACCATTCGTTAGGACCCGG
TGGAGTGTATTACTATACAATTTACCCTATTGTACAAATTTACCTGTTTGACGATCCATCCGAGCTCGCTGATGCTGGAGAGATTTATCTGCTGGACAATCCATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGGAAGACTATCCTAAAGGAGATCCATGGCTCAGCAGCTCCCGTCACGCGTACTGCACACCTTTACCTCTACCTGCAGTTGTATGTAAGGGAAAAGAATGAGTATTT
AAGAAAATACTCAGCGCGTAGCGCTCTAAGCTACCATTTCACTGGTGGAGCGTTCATGGGTTTAAGTCAGGAATTTACTTCCACACAGCGACGGCGCTTCAGGAGACGAC
GAATAGCGGTGGTTCACGGCGGCGAGACGACGGCAGCGTGCAGATCTGAAATCGGGGAGGCGGCGGCGCACGGATCGTTAGGTTGTAGACAGAGAGAGAGAGGGCTGCGA
GGAAGAGAGAGAGACGGAAACGAAGGAGAGAGAGAGAAAAAGATATTACCGGCAACGACTCGCGGAGGACGACGACGGGCGGTGGAGTGCGGCGGCGGCGTCGACGTCGG
CGGTGAGCTGCAGGCGGCGGCAAGCGGACCGGTGGAGTGTATTACAATGCAGTTTACCCTATTGTATAGATTTACTCGTTTGACGATTCATCTGAGCCCGCTGATGCTGG
AGAGATGGATGCAACTGGGAGATGGAGGATATATGCGAATTGCTAAAAAATCACGAGATGGCCATTCGTTAGGACCCGGTGGAGTGTATTACTATGCAATTTACCCTATT
ATACAGATTTACCTGTTTGACGATCCATCTGAGCCCGTTGATCTTGGAGAGATTTACCTATTTGACGATCCATTTCAGCCCGCTGATGCTGGAGAGAAACGAAACTTGAG
AATATTGTTTTATGTTTTTCTTCTAATGATTTACCTGTTTGACGATCCATCTGATCCCGCTGATGCTGGAGAGACGGATGCAACTGGGGAGATGGGGGATATATGCGAAT
TGCTAGAAAATCACGAGATGATTTACCTGTTTGATGATCCATCTAAGCTCGCTGATGCTGGAGAGGGCCTCATTGGAATGACACCGAACCATCGAATGACCGCAGTTGGG
TATGATCCTTGCTACATAGTTTTGAAACATATATACGGATGCAACTGGGGAGATGGAGGATATATGCGAATTGCTAGAAAATCACGAGATGACCATTCGTTAGGACCCGG
TGGAGTGTATTACTATACAATTTACCCTATTGTACAAATTTACCTGTTTGACGATCCATCCGAGCTCGCTGATGCTGGAGAGATTTATCTGCTGGACAATCCATCTTAG
Protein sequenceShow/hide protein sequence
MRKTILKEIHGSAAPVTRTAHLYLYLQLYVREKNEYLRKYSARSALSYHFTGGAFMGLSQEFTSTQRRRFRRRRIAVVHGGETTAACRSEIGEAAAHGSLGCRQRERGLR
GRERDGNEGEREKKILPATTRGGRRRAVECGGGVDVGGELQAAASGPVECITMQFTLLYRFTRLTIHLSPLMLERWMQLGDGGYMRIAKKSRDGHSLGPGGVYYYAIYPI
IQIYLFDDPSEPVDLGEIYLFDDPFQPADAGEKRNLRILFYVFLLMIYLFDDPSDPADAGETDATGEMGDICELLENHEMIYLFDDPSKLADAGEGLIGMTPNHRMTAVG
YDPCYIVLKHIYGCNWGDGGYMRIARKSRDDHSLGPGGVYYYTIYPIVQIYLFDDPSELADAGEIYLLDNPS