; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000456 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000456
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCysteine proteinases superfamily protein
Genome locationchr4:7697873..7713944
RNA-Seq ExpressionLag0000456
SyntenyLag0000456
Gene Ontology termsGO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0097655 - serpin family protein binding (molecular function)
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
2BDZ_A Mexicain from Jacaratia mexicana [Jacaratia mexicana]1.3e-0546.15Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        +Y+GP G  ++H +TAVGYG  Y+LLKN++G NW + GY++I R S  G S    G+Y  + +PI
Subjt:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI

KAG6572278.1 Cysteine protease XCP2, partial [Cucurbita argyrosperma subsp. sororia]1.3e-0543.28Show/hide
Query:  LYKGPFGMTSNHQMTAVGYCPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMY-YYAIYPIV
        +Y GPFG + +H + AVGY P YI++KN +G +W D GYM ++RK     +L+  G++  +A YP+V
Subjt:  LYKGPFGMTSNHQMTAVGYCPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMY-YYAIYPIV

P84346.1 RecName: Full=Mexicain [Jacaratia mexicana]1.3e-0546.15Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        +Y+GP G  ++H +TAVGYG  Y+LLKN++G NW + GY++I R S  G S    G+Y  + +PI
Subjt:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI

XP_021905591.1 LOW QUALITY PROTEIN: papain-like [Carica papaya]1.7e-0541.54Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        +++GP G   +H +TAVGYGP YIL+KN++G  W + G+++I R +R+ Y +   G+Y  + YP+
Subjt:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI

XP_021911524.1 papain-like [Carica papaya]1.7e-0541.54Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        +++GP G   +H +TAVGYGP YIL+KN++G  W + G+++I R +R+ Y +   G+Y  + YP+
Subjt:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI

TrEMBL top hitse value%identityAlignment
A0A0D9Z9X0 Uncharacterized protein1.2e-0442.86Show/hide
Query:  DDPSERVDAGESLYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMY
        DD ++ +D    +Y+G  G T NH M  VGYG  Y +LKN+YG  W D GY+ + R    G  L  GG Y
Subjt:  DDPSERVDAGESLYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMY

A0A2P6Q839 Putative fruit bromelain4.0e-0543.84Show/hide
Query:  GESLYKGPFGMTSNHQMTAVGY----GPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPIV
        G  ++KGP G T NH +T +GY    G  Y L+KN++G  W + GYM+I R   +G  L   G+  YA YPIV
Subjt:  GESLYKGPFGMTSNHQMTAVGY----GPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPIV

A0A6P5F3S2 fruit bromelain-like4.5e-0434.48Show/hide
Query:  LYKGPFGMTSNHQMTAVGY------GPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPIVQIYPFDDPSEPADSG
        +Y+GP     NH +T +GY      G  Y L+KN++G NW + GYM++AR++  G      G+  YA+YP +   P  + S+    G
Subjt:  LYKGPFGMTSNHQMTAVGY------GPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPIVQIYPFDDPSEPADSG

E5LBE8 Mitogenic proteinase (Fragment)3.1e-0546.15Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        +YKGP G   +H +TA+GYG  YIL+KN++G NW + GY++I R S  G S    G+Y  + +PI
Subjt:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI

I1Z743 Mexicain-like cystein protease (Fragment)3.4e-0442.19Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYP
        ++ GP G   +H +TA+GYG  YIL+KN++G NW + GY++I R S  G S    G+Y  + +P
Subjt:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYP

SwissProt top hitse value%identityAlignment
P00784 Papain1.1e-0740Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        ++ GP G   +H + AVGYGP YIL+KN++G  W + GY++I R + + Y +   G+Y  + YP+
Subjt:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI

P05994 Papaya proteinase 41.5e-0440.58Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPY----YILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        +++G  G   +H +TAVGYG      YIL+KN++G  W + GY++I R S  G S    G+Y  + YPI
Subjt:  LYKGPFGMTSNHQMTAVGYGPY----YILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI

P14080 Chymopapain6.6e-0539.71Show/hide
Query:  LYKGPFGMTLNHRMTAVGH----GPYYILLKYTYGFNWVDGGYMQIARKSRDGYSLRPGGMYNYAIYP
        ++ GP G  L+H +TAVG+    G  YI++K ++G NW + GYM++ R+S  G S    G+Y  + YP
Subjt:  LYKGPFGMTLNHRMTAVGH----GPYYILLKYTYGFNWVDGGYMQIARKSRDGYSLRPGGMYNYAIYP

P84346 Mexicain1.7e-0846.15Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        +Y+GP G  ++H +TAVGYG  Y+LLKN++G NW + GY++I R S  G S    G+Y  + +PI
Subjt:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI

P84347 Chymomexicain1.3e-0540Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        ++ GP G  ++H +TA+GYG   +L KN++G NW + GY++I R S  G S    G+Y  + +PI
Subjt:  LYKGPFGMTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI

Arabidopsis top hitse value%identityAlignment
AT2G34080.1 Cysteine proteinases superfamily protein5.2e-0538.57Show/hide
Query:  LYKGPFGMTSNHQMTAVGY-----GPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        +Y GP G +SNH +T VGY     G  Y L KN++G  W + GY++I R     +     G+  YA YP+
Subjt:  LYKGPFGMTSNHQMTAVGY-----GPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI

AT3G48340.1 Cysteine proteinases superfamily protein7.5e-0434.85Show/hide
Query:  SERVDAG--------ESLYKGPFGMTSNHQMTAVGY----GPYYILLKNTYGCNWVDGGYMQIARK
        S  +DAG        E ++ G  G   NH + AVGY    G  Y +++N++G  W +GGY++I R+
Subjt:  SERVDAG--------ESLYKGPFGMTSNHQMTAVGY----GPYYILLKNTYGCNWVDGGYMQIARK

AT4G23520.1 Cysteine proteinases superfamily protein3.0e-0537.68Show/hide
Query:  LYKGPFGMTSNHQMTAVGYGPY----YILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI
        +Y GP G   +H +  VGYG      Y +++N++G  W D GY++IAR   D   L   G+   A YPI
Subjt:  LYKGPFGMTSNHQMTAVGYGPY----YILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTGAACCATCGAATGACCGCAGTTGTGTATGGTCTTTACTACATAATTTTGAAAAATACAGACGGATGCAACTGGGGAGATGGAGGATGTATGCGAATTTCTAG
AAAATCACAAGATGGTCATTCGTTAGGACCTGGTGGAGTGTATTACTATGCAATTTACCCTATTATTTACCTGTTTAACGATCCATTTGAGCCCGCTGATATCCATTTGA
GCCCGCTGATGTTGGAGAGGACCGGTGGAGTGTATTACTATGCAATTTACCCTATTGTACAGATTTACCTTTTTGACGATCCATCTGAGCGAGTTGATGATGGAGAGCAT
ATTTACCGGTTTGATGATCCATCTGAGCACGCTGATGCTGGAGAGATTTACCTGTTTGACGATCCATCTGAGCGAGTTGATGCTGGAGACGGTCCCTTTGGAATGACATT
GAACCATCGAATGACCGCAGTTGGGCATGATCCTTACTACATACTTTTGAAATATACATACGGATTTAACTGGGTAGATGGAGGGTATATGCAAATTGCTAGAAAATCAC
GAGATGGCTATTCGTTAAGACCCGGTGGAATGTATAACTATGCAATTTACCCTATTGTACAGCATATTTATTCGTTTGACGATCCATCTGAGCACGCTGATGCCGGAGAG
ATTTACCTGTTTGACGATCCATCTGAGCGAGTTGATGCTGGAGAGAGTTTATATAAGGGTCCCTTTGGAATGACATCGAACCATCAAATGACCGCAGTTGGGTATTGTCC
TTACTACATACTTTTGAAAAATACATACGGATGTAACTGGGTAGATGGAGGATATATGCAAATTGCTAGAAAATCACGAGATGGCTATTCGTTAAGACCCGGTGGAATGT
ATTACTATGCAATTTACCCTATTGTACAGTATATTTACCCGTTTGACGATCCATCTGAGCACGCTAATGCTGGAGAGATTTACCTGTTTGACGATCCATCTGAGCGAGTT
GATGCTGGAGACGGGTCGGGTTTTAGGCCCGACCCCCTGCTCGGCCTCGGCCCGCTCGTGCGGGCCGAGCCCGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGGATGCCC
CGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTATTTATATCCCTCTTTGCCACTGAAGAGGGGATCCCGAATTCTAT
CCCTAAACTCTACTCTCTATTCTCTGCTTTCTCCTCTTGCTCTTACTTTTCCACGCCCTACCGTTCTGTTTGCTGACTTAAGCATCGGAGCCGGTGTGGCGAGCACCACA
CCGGTGTGCAGTTTATATAAGGGTCCCTTTGGAATGACATTGAACCATCGAATGACCGCAGTTGGGCATGGTCCTTACTACATACTTTTGAAATATACATATGGATTTAA
CTGGGTAGATGGAGGATATATGCAAATTGCTAGAAAATCACGAGATGGCTATTCGTTAAGACCCGGTGGAATGTATAACTATGCAATTTACCCTATTGTACAGCATATTT
ATCCGTTTGACGATCCATCTGAGCACGCTGATGTCGGAGAGATTTACCTGTTTGATGATCCATCTGAGCGAGTTGATGCTGGAGAGAGTTTATATAAGGGTCCCTTTGGA
ATGACATCGAACCATCAAATGACCGCAGTTGGGTATGGTCCTTACTACATACTTTTGAAAAATACATACGGATGTAACTGGGTAGATGGAGGATATATGCAAATTGCTAG
AAAATCACGAGATGGCTATTCGTTAAGACCCGGTGGAATGTATTACTATGCAATTTACCCTATTGTACAAATTTACCCGTTTGACGATCCATCTGAGCCCGCTGATTCTG
GGAGGCCCAGCATTCGGAATCCGATTTCGGCCCTTAACGGCCCTGACATTCCCCTGCGACCTGGCCACGCCACCATACTTGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACACTGAACCATCGAATGACCGCAGTTGTGTATGGTCTTTACTACATAATTTTGAAAAATACAGACGGATGCAACTGGGGAGATGGAGGATGTATGCGAATTTCTAG
AAAATCACAAGATGGTCATTCGTTAGGACCTGGTGGAGTGTATTACTATGCAATTTACCCTATTATTTACCTGTTTAACGATCCATTTGAGCCCGCTGATATCCATTTGA
GCCCGCTGATGTTGGAGAGGACCGGTGGAGTGTATTACTATGCAATTTACCCTATTGTACAGATTTACCTTTTTGACGATCCATCTGAGCGAGTTGATGATGGAGAGCAT
ATTTACCGGTTTGATGATCCATCTGAGCACGCTGATGCTGGAGAGATTTACCTGTTTGACGATCCATCTGAGCGAGTTGATGCTGGAGACGGTCCCTTTGGAATGACATT
GAACCATCGAATGACCGCAGTTGGGCATGATCCTTACTACATACTTTTGAAATATACATACGGATTTAACTGGGTAGATGGAGGGTATATGCAAATTGCTAGAAAATCAC
GAGATGGCTATTCGTTAAGACCCGGTGGAATGTATAACTATGCAATTTACCCTATTGTACAGCATATTTATTCGTTTGACGATCCATCTGAGCACGCTGATGCCGGAGAG
ATTTACCTGTTTGACGATCCATCTGAGCGAGTTGATGCTGGAGAGAGTTTATATAAGGGTCCCTTTGGAATGACATCGAACCATCAAATGACCGCAGTTGGGTATTGTCC
TTACTACATACTTTTGAAAAATACATACGGATGTAACTGGGTAGATGGAGGATATATGCAAATTGCTAGAAAATCACGAGATGGCTATTCGTTAAGACCCGGTGGAATGT
ATTACTATGCAATTTACCCTATTGTACAGTATATTTACCCGTTTGACGATCCATCTGAGCACGCTAATGCTGGAGAGATTTACCTGTTTGACGATCCATCTGAGCGAGTT
GATGCTGGAGACGGGTCGGGTTTTAGGCCCGACCCCCTGCTCGGCCTCGGCCCGCTCGTGCGGGCCGAGCCCGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGGATGCCC
CGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTATTTATATCCCTCTTTGCCACTGAAGAGGGGATCCCGAATTCTAT
CCCTAAACTCTACTCTCTATTCTCTGCTTTCTCCTCTTGCTCTTACTTTTCCACGCCCTACCGTTCTGTTTGCTGACTTAAGCATCGGAGCCGGTGTGGCGAGCACCACA
CCGGTGTGCAGTTTATATAAGGGTCCCTTTGGAATGACATTGAACCATCGAATGACCGCAGTTGGGCATGGTCCTTACTACATACTTTTGAAATATACATATGGATTTAA
CTGGGTAGATGGAGGATATATGCAAATTGCTAGAAAATCACGAGATGGCTATTCGTTAAGACCCGGTGGAATGTATAACTATGCAATTTACCCTATTGTACAGCATATTT
ATCCGTTTGACGATCCATCTGAGCACGCTGATGTCGGAGAGATTTACCTGTTTGATGATCCATCTGAGCGAGTTGATGCTGGAGAGAGTTTATATAAGGGTCCCTTTGGA
ATGACATCGAACCATCAAATGACCGCAGTTGGGTATGGTCCTTACTACATACTTTTGAAAAATACATACGGATGTAACTGGGTAGATGGAGGATATATGCAAATTGCTAG
AAAATCACGAGATGGCTATTCGTTAAGACCCGGTGGAATGTATTACTATGCAATTTACCCTATTGTACAAATTTACCCGTTTGACGATCCATCTGAGCCCGCTGATTCTG
GGAGGCCCAGCATTCGGAATCCGATTTCGGCCCTTAACGGCCCTGACATTCCCCTGCGACCTGGCCACGCCACCATACTTGTCTGA
Protein sequenceShow/hide protein sequence
MTLNHRMTAVVYGLYYIILKNTDGCNWGDGGCMRISRKSQDGHSLGPGGVYYYAIYPIIYLFNDPFEPADIHLSPLMLERTGGVYYYAIYPIVQIYLFDDPSERVDDGEH
IYRFDDPSEHADAGEIYLFDDPSERVDAGDGPFGMTLNHRMTAVGHDPYYILLKYTYGFNWVDGGYMQIARKSRDGYSLRPGGMYNYAIYPIVQHIYSFDDPSEHADAGE
IYLFDDPSERVDAGESLYKGPFGMTSNHQMTAVGYCPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPIVQYIYPFDDPSEHANAGEIYLFDDPSERV
DAGDGSGFRPDPLLGLGPLVRAEPVRSRLVPTASGCPGFAWFDLKRLRNPKKARRMNRYLYPSLPLKRGSRILSLNSTLYSLLSPLALTFPRPTVLFADLSIGAGVASTT
PVCSLYKGPFGMTLNHRMTAVGHGPYYILLKYTYGFNWVDGGYMQIARKSRDGYSLRPGGMYNYAIYPIVQHIYPFDDPSEHADVGEIYLFDDPSERVDAGESLYKGPFG
MTSNHQMTAVGYGPYYILLKNTYGCNWVDGGYMQIARKSRDGYSLRPGGMYYYAIYPIVQIYPFDDPSEPADSGRPSIRNPISALNGPDIPLRPGHATILV