; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036570 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036570
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCysteine proteinases superfamily protein
Genome locationchr3:48694743..48695154
RNA-Seq ExpressionLag0036570
SyntenyLag0036570
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029224.1 hypothetical protein SDJN02_07562, partial [Cucurbita argyrosperma subsp. argyrosperma]8.5e-2665.59Show/hide
Query:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGRVFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTENT
        S P VTIDGYE VP  +ENALM+ VANQPV  ++A    VFDGYCG   NH +VAIGYGT E+GTDYW+++NSWGVGWGE GY R+KRG E +
Subjt:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGRVFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTENT

XP_004142960.1 vignain [Cucumis sativus]1.5e-2563.64Show/hide
Query:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTE
        S P V IDGYE VP  +E+ALM+ VANQPV+  I + GR        VFDGYCG   NH +VAIGYGT EDGTDYWL++NSWGVGWGE GY R+KRG E
Subjt:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTE

XP_008444390.1 PREDICTED: vignain-like [Cucumis melo]2.5e-2562.63Show/hide
Query:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTE
        S P V IDGYE VP  +E+ALM+ VANQPV+  I + GR        VFDGYCG   NH +VAIGYGT EDGTDYW+++NSWGVGWGE GY R+KRG E
Subjt:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTE

XP_023002122.1 vignain-like [Cucurbita maxima]6.1e-2460.2Show/hide
Query:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTEN
        P VTIDG+E VP  +ENALM+ VANQPV+ +I + GR        VFDGYCG   NH +V IGYGT + GTDYW ++NSWGVGWGE GY R+KRG E+
Subjt:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTEN

XP_038885064.1 LOW QUALITY PROTEIN: vignain-like [Benincasa hispida]4.2e-2561.62Show/hide
Query:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTE
        S P V IDGYE +P  +E+ALM+ VANQPV+  I + GR        VFDGYCG   NH +VAIGYGT EDGTDYW+++NSWGVGWGE GY R+KRG E
Subjt:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTE

TrEMBL top hitse value%identityAlignment
A0A0A0LMU4 Uncharacterized protein7.0e-2663.64Show/hide
Query:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTE
        S P V IDGYE VP  +E+ALM+ VANQPV+  I + GR        VFDGYCG   NH +VAIGYGT EDGTDYWL++NSWGVGWGE GY R+KRG E
Subjt:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTE

A0A1S3BA70 vignain-like1.2e-2562.63Show/hide
Query:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTE
        S P V IDGYE VP  +E+ALM+ VANQPV+  I + GR        VFDGYCG   NH +VAIGYGT EDGTDYW+++NSWGVGWGE GY R+KRG E
Subjt:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTE

A0A6J1GHN5 vignain-like4.3e-2359.18Show/hide
Query:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTEN
        P VTIDG+E VP  +ENALM+ VANQPV+ +I + GR        VFDG CG   NH +V IGYGT + GTDYW ++NSWGVGWGE GY R+KRG E+
Subjt:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTEN

A0A6J1K7P4 vignain-like7.3e-2348.84Show/hide
Query:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------------------------------------VFDGYCGPSPNHQLVAIGYGTIE
        S P VTIDGYE VP  +ENALM+ VANQPV+  I + GR                                      VFDGYCG   NH +VAIGYGT E
Subjt:  SPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------------------------------------VFDGYCGPSPNHQLVAIGYGTIE

Query:  DGTDYWLMKNSWGVGWGEGGYGRLKRGTE
        +GTDYW+++NSWGVGWGE GY R+KRG E
Subjt:  DGTDYWLMKNSWGVGWGEGGYGRLKRGTE

A0A6J1KIL0 vignain-like3.0e-2460.2Show/hide
Query:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTEN
        P VTIDG+E VP  +ENALM+ VANQPV+ +I + GR        VFDGYCG   NH +V IGYGT + GTDYW ++NSWGVGWGE GY R+KRG E+
Subjt:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTEN

SwissProt top hitse value%identityAlignment
O65039 Vignain2.3e-2150.49Show/hide
Query:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENT
        P V+IDG+E VP  DENAL+K VANQPV+  I + G         VF G CG   +H +  +GYGT  DGT YW +KNSWG  WGE GY R++RG ++  
Subjt:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENT

Query:  GEC
        G C
Subjt:  GEC

P43156 Thiol protease SEN1023.8e-2146.6Show/hide
Query:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRG--------RVFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENT
        P V+IDG++ VP  +ENALM+ VANQP++ +I + G         VF G CG   +H +  +GYG   DGT YW++KNSWG  WGE GY R++RG ++  
Subjt:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRG--------RVFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENT

Query:  GEC
        G+C
Subjt:  GEC

P43297 Cysteine proteinase RD21A1.3e-2149.5Show/hide
Query:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE
        VTID YE VPTY E +L K VA+QP++  I + GR        +FDG CG   +H +VA+GYGT E+G DYW+++NSWG  WGE GY R+ R    ++G+
Subjt:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE

Query:  C
        C
Subjt:  C

Q9LT78 Probable cysteine protease RD21C2.9e-2150.5Show/hide
Query:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE
        VTIDGYE VP  DE +L K +ANQP++  I + GR        VF G CG S +H +VA+GYG+ E G DYW+++NSWG  WGE GY +L+R   E++G+
Subjt:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE

Query:  C
        C
Subjt:  C

Q9STL5 KDEL-tailed cysteine endopeptidase CEP31.7e-2149.5Show/hide
Query:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE
        VTIDG+E+VP  DE  L+K VA+QPV+  I +           VF G CG   NH +V +GYG  ++GT YW+++NSWG  WGEGGY R++RG +EN G 
Subjt:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE

Query:  C
        C
Subjt:  C

Arabidopsis top hitse value%identityAlignment
AT1G47128.1 Granulin repeat cysteine protease family protein9.4e-2349.5Show/hide
Query:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE
        VTID YE VPTY E +L K VA+QP++  I + GR        +FDG CG   +H +VA+GYGT E+G DYW+++NSWG  WGE GY R+ R    ++G+
Subjt:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE

Query:  C
        C
Subjt:  C

AT3G19390.1 Granulin repeat cysteine protease family protein2.1e-2250.5Show/hide
Query:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE
        VTIDGYE VP  DE +L K +ANQP++  I + GR        VF G CG S +H +VA+GYG+ E G DYW+++NSWG  WGE GY +L+R   E++G+
Subjt:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE

Query:  C
        C
Subjt:  C

AT3G48350.1 Cysteine proteinases superfamily protein1.2e-2249.5Show/hide
Query:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE
        VTIDG+E+VP  DE  L+K VA+QPV+  I +           VF G CG   NH +V +GYG  ++GT YW+++NSWG  WGEGGY R++RG +EN G 
Subjt:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG-TENTGE

Query:  C
        C
Subjt:  C

AT4G11310.1 Papain family cysteine protease1.0e-2153.12Show/hide
Query:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTEN
        V IDGYE +P  DE+ALMK VA+QPV   I S  R        VFDG CG + NH +V +GYGT E+G DYWL+KNS G+ WGE GY ++ R   N
Subjt:  VTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTEN

AT5G50260.1 Cysteine proteinases superfamily protein1.8e-2151.58Show/hide
Query:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG
        P V+IDG+E VP   E+ LMK VANQPV+  I + G         VF G CG   NH +  +GYGT  DGT YW++KNSWG  WGE GY R++RG
Subjt:  PKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGR--------VFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCGCCGCCAAAGGTAACAATTGATGGATACGAATATGTTCCGACATATGACGAGAATGCTCTGATGAAAGTTGTGGCGAACCAACCGGTCGCATGTACCATAGC
ATCAAGAGGAAGAGTGTTCGATGGATATTGTGGACCATCACCTAATCACCAACTTGTGGCGATTGGCTATGGAACAATTGAAGATGGAACAGATTATTGGCTCATGAAAA
ATTCATGGGGAGTTGGATGGGGAGAGGGAGGCTACGGAAGGTTGAAACGAGGAACCGAGAATACGGGGGAGTGTGTGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTCGCCGCCAAAGGTAACAATTGATGGATACGAATATGTTCCGACATATGACGAGAATGCTCTGATGAAAGTTGTGGCGAACCAACCGGTCGCATGTACCATAGC
ATCAAGAGGAAGAGTGTTCGATGGATATTGTGGACCATCACCTAATCACCAACTTGTGGCGATTGGCTATGGAACAATTGAAGATGGAACAGATTATTGGCTCATGAAAA
ATTCATGGGGAGTTGGATGGGGAGAGGGAGGCTACGGAAGGTTGAAACGAGGAACCGAGAATACGGGGGAGTGTGTGGATTAG
Protein sequenceShow/hide protein sequence
MNSPPKVTIDGYEYVPTYDENALMKVVANQPVACTIASRGRVFDGYCGPSPNHQLVAIGYGTIEDGTDYWLMKNSWGVGWGEGGYGRLKRGTENTGECVD