; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007874 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007874
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag protease polyprotein
Genome locationchr9:6877742..6878956
RNA-Seq ExpressionLag0007874
SyntenyLag0007874
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR032567 - LDOC1-related
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036197.1 gag protease polyprotein [Cucumis melo var. makuwa]2.4e-3641.6Show/hide
Query:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSE-----PSSGHKRRYDQMSSLDEVRPRT
        G+ TVE++  EF  LSRFAP ++A E  + ++F+ GL+ +IQG V A  P  +A  +R+A  +  +    SS+      +SG KR+ +Q       R   
Subjt:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSE-----PSSGHKRRYDQMSSLDEVRPRT

Query:  VDRKHRISQ-------------------GLIRIGR----TCVCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNVVV
         D + R  Q                   G   +GR    T  CFKC +EGH A  C  + T  A      NQ +G     HQG++FAT R EAE    VV
Subjt:  VDRKHRISQ-------------------GLIRIGR----TCVCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNVVV

Query:  TGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG
        TGT+P+LGHYA VLFDSGS+HSFIS++FVS A LE+EPL + LSVSTP+G
Subjt:  TGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG

KAA0039626.1 gag protease polyprotein [Cucumis melo var. makuwa]7.6e-3844.05Show/hide
Query:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSE-----PSSGHKRRYDQMSSLDEVRPRT
        G+ TVE++  EF  LSRFAP ++A E  + ++F+ GL+ +IQG V A  P  +A  +R+   +  +    SS+      +SG KR+  Q      V  R 
Subjt:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSE-----PSSGHKRRYDQMSSLDEVRPRT

Query:  VDRKHRISQGLIRIGRTCVCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNVVVTGTIPILGHYAFVLFDSGSTHSF
            H + + L    RT  CFKC +EGH A  C  + T      ++ NQ +G     HQG++FAT R EAE    VVTGT+P+LGHYA VLFDSGS+HSF
Subjt:  VDRKHRISQGLIRIGRTCVCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNVVVTGTIPILGHYAFVLFDSGSTHSF

Query:  ISASFVSQANLELEPLGYDLSVSTPAG
        IS++FVS A LE+EPL + LSVSTP+G
Subjt:  ISASFVSQANLELEPLGYDLSVSTPAG

KAA0061214.1 gag protease polyprotein [Cucumis melo var. makuwa]1.9e-3642.29Show/hide
Query:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSEPSS-----GHKRRYDQMSSLD---EVR
        GN TVE++  EF  LSRFAP +V  E  + E+F+ GL+ ++QG V A  P  +A  +R+A  +     V SS+ +S     G KR+ +   +L    ++R
Subjt:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSEPSS-----GHKRRYDQMSSLD---EVR

Query:  PRTVDRKHRISQGLIRIGRTC----------------------VCFKCGREGHIAPNCNKKK-TNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLN
         + V ++HR  Q L   GRT                       VCF+C + GH A  C +K       QPS          T  QG++FATTRQEAE   
Subjt:  PRTVDRKHRISQGLIRIGRTC----------------------VCFKCGREGHIAPNCNKKK-TNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLN

Query:  VVVTGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG
         VVTGT+PILGHYAFVLFDSGS+HSFIS+ FV    LE+EPLG  +SVSTP+G
Subjt:  VVVTGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG

XP_022937437.1 uncharacterized protein LOC111443845 [Cucurbita moschata]5.6e-4142.06Show/hide
Query:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELID----------------------RRPSVISSEPSSGH
        GNR+VEE++ EF RLSRFAP +V++E  KVE F+ GL+ +I+G V+ H+P DYAT +++AE +D                      R  +  +    SGH
Subjt:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELID----------------------RRPSVISSEPSSGH

Query:  --KRRYDQMSSLDEVRPRTVDR-KHRISQGLIRIGRTC-----VCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNV
          K   +Q     +  P+   R + R S       R C     VCF   +EGH+  +C + +    ++P  +  +S +RP   QG+++ATTRQEAEN N 
Subjt:  --KRRYDQMSSLDEVRPRTVDR-KHRISQGLIRIGRTC-----VCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNV

Query:  VVTGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG
         VT T+ ILG+YA VLFDSGSTHSFIS + V  A +E+EPLGY LSV TPAG
Subjt:  VVTGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG

XP_022962669.1 uncharacterized protein LOC111463090 [Cucurbita moschata]3.6e-4043.09Show/hide
Query:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELID----RRPSVISSEPSSGHKRRYDQMS--SLDEVRPR
        GNR+VEE++ EF RLSRF P +VA E +K + FI GL++EIQGSV AH  QD+ T    A  +D    R     SS+     KR++ Q+S  S  +  P+
Subjt:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELID----RRPSVISSEPSSGHKRRYDQMS--SLDEVRPR

Query:  TVDRKHRISQGLIRIGR-------TC-----VCFKCGREGHIAPNCNKKKTNYADQPSSS------NQASGVRPTQHQGKIFATTRQEAENLNVVVTGTI
         VDR+    +G     +       +C     +C+ CG+ GH+A  C   K     +P  S       Q S     Q +GK FATT + A + + VVTGT+
Subjt:  TVDRKHRISQGLIRIGR-------TC-----VCFKCGREGHIAPNCNKKKTNYADQPSSS------NQASGVRPTQHQGKIFATTRQEAENLNVVVTGTI

Query:  PILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG
        PI G++A+ +FDSGSTHSFIS SFV QA LE+EPLGY++ VSTP+G
Subjt:  PILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG

TrEMBL top hitse value%identityAlignment
A0A5A7T3M7 Gag protease polyprotein1.2e-3641.6Show/hide
Query:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSE-----PSSGHKRRYDQMSSLDEVRPRT
        G+ TVE++  EF  LSRFAP ++A E  + ++F+ GL+ +IQG V A  P  +A  +R+A  +  +    SS+      +SG KR+ +Q       R   
Subjt:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSE-----PSSGHKRRYDQMSSLDEVRPRT

Query:  VDRKHRISQ-------------------GLIRIGR----TCVCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNVVV
         D + R  Q                   G   +GR    T  CFKC +EGH A  C  + T  A      NQ +G     HQG++FAT R EAE    VV
Subjt:  VDRKHRISQ-------------------GLIRIGR----TCVCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNVVV

Query:  TGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG
        TGT+P+LGHYA VLFDSGS+HSFIS++FVS A LE+EPL + LSVSTP+G
Subjt:  TGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG

A0A5A7TC57 Gag protease polyprotein3.7e-3844.05Show/hide
Query:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSE-----PSSGHKRRYDQMSSLDEVRPRT
        G+ TVE++  EF  LSRFAP ++A E  + ++F+ GL+ +IQG V A  P  +A  +R+   +  +    SS+      +SG KR+  Q      V  R 
Subjt:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSE-----PSSGHKRRYDQMSSLDEVRPRT

Query:  VDRKHRISQGLIRIGRTCVCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNVVVTGTIPILGHYAFVLFDSGSTHSF
            H + + L    RT  CFKC +EGH A  C  + T      ++ NQ +G     HQG++FAT R EAE    VVTGT+P+LGHYA VLFDSGS+HSF
Subjt:  VDRKHRISQGLIRIGRTCVCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNVVVTGTIPILGHYAFVLFDSGSTHSF

Query:  ISASFVSQANLELEPLGYDLSVSTPAG
        IS++FVS A LE+EPL + LSVSTP+G
Subjt:  ISASFVSQANLELEPLGYDLSVSTPAG

A0A5A7V1M1 Gag protease polyprotein9.1e-3742.29Show/hide
Query:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSEPSS-----GHKRRYDQMSSLD---EVR
        GN TVE++  EF  LSRFAP +V  E  + E+F+ GL+ ++QG V A  P  +A  +R+A  +     V SS+ +S     G KR+ +   +L    ++R
Subjt:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSEPSS-----GHKRRYDQMSSLD---EVR

Query:  PRTVDRKHRISQGLIRIGRTC----------------------VCFKCGREGHIAPNCNKKK-TNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLN
         + V ++HR  Q L   GRT                       VCF+C + GH A  C +K       QPS          T  QG++FATTRQEAE   
Subjt:  PRTVDRKHRISQGLIRIGRTC----------------------VCFKCGREGHIAPNCNKKK-TNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLN

Query:  VVVTGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG
         VVTGT+PILGHYAFVLFDSGS+HSFIS+ FV    LE+EPLG  +SVSTP+G
Subjt:  VVVTGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG

A0A6J1FB78 uncharacterized protein LOC1114438452.7e-4142.06Show/hide
Query:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELID----------------------RRPSVISSEPSSGH
        GNR+VEE++ EF RLSRFAP +V++E  KVE F+ GL+ +I+G V+ H+P DYAT +++AE +D                      R  +  +    SGH
Subjt:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELID----------------------RRPSVISSEPSSGH

Query:  --KRRYDQMSSLDEVRPRTVDR-KHRISQGLIRIGRTC-----VCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNV
          K   +Q     +  P+   R + R S       R C     VCF   +EGH+  +C + +    ++P  +  +S +RP   QG+++ATTRQEAEN N 
Subjt:  --KRRYDQMSSLDEVRPRTVDR-KHRISQGLIRIGRTC-----VCFKCGREGHIAPNCNKKKTNYADQPSSSNQASGVRPTQHQGKIFATTRQEAENLNV

Query:  VVTGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG
         VT T+ ILG+YA VLFDSGSTHSFIS + V  A +E+EPLGY LSV TPAG
Subjt:  VVTGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG

A0A6J1HHR1 uncharacterized protein LOC1114630901.8e-4043.09Show/hide
Query:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELID----RRPSVISSEPSSGHKRRYDQMS--SLDEVRPR
        GNR+VEE++ EF RLSRF P +VA E +K + FI GL++EIQGSV AH  QD+ T    A  +D    R     SS+     KR++ Q+S  S  +  P+
Subjt:  GNRTVEEFKTEFARLSRFAPALVAVEVDKVERFITGLKEEIQGSVVAHEPQDYATTIRVAELID----RRPSVISSEPSSGHKRRYDQMS--SLDEVRPR

Query:  TVDRKHRISQGLIRIGR-------TC-----VCFKCGREGHIAPNCNKKKTNYADQPSSS------NQASGVRPTQHQGKIFATTRQEAENLNVVVTGTI
         VDR+    +G     +       +C     +C+ CG+ GH+A  C   K     +P  S       Q S     Q +GK FATT + A + + VVTGT+
Subjt:  TVDRKHRISQGLIRIGR-------TC-----VCFKCGREGHIAPNCNKKKTNYADQPSSS------NQASGVRPTQHQGKIFATTRQEAENLNVVVTGTI

Query:  PILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG
        PI G++A+ +FDSGSTHSFIS SFV QA LE+EPLGY++ VSTP+G
Subjt:  PILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCATGGACAAAATCTACTAGAAAACCCACAAGTGGAACAACCGGGGCCTGCGGCAGGACCTGTCCCAGTGTATATCCAGGCAATAGTTCAGACTGCAGTTCAGGC
TGTAGTTACGGGCGTGCTTGCAGGGCGACAGGCCCAAGCGCCTCAGAACAACGAAACCCTATCGCGGGAGGCTAGCCTCCCGATTCAAGAAGCAAGCAGAGTTTGTGGCT
CTTATGCAGGAAACCGAACCGTAGAGGAATTCAAGACAGAGTTCGCCAGACTTTCTCGGTTTGCTCCTGCCCTGGTAGCTGTAGAGGTTGATAAGGTAGAACGTTTCATC
ACAGGCTTAAAGGAAGAGATTCAAGGCAGTGTGGTAGCCCATGAACCCCAAGACTATGCCACAACAATCAGGGTGGCAGAGCTGATTGATCGACGACCGTCAGTTATCTC
TTCGGAACCTTCCTCAGGTCATAAGCGAAGGTATGATCAGATGAGCTCCCTAGACGAGGTCCGCCCCAGGACCGTCGACAGAAAGCACCGAATCAGCCAGGGCCTAATCA
GGATAGGCCGGACCTGTGTTTGTTTCAAGTGTGGCCGGGAAGGGCATATTGCTCCGAATTGTAATAAGAAGAAAACTAATTATGCCGACCAGCCGTCCAGTTCAAATCAG
GCATCCGGGGTAAGACCAACACAACATCAGGGCAAGATATTTGCCACCACCCGACAAGAGGCAGAGAACCTTAACGTAGTAGTGACAGGTACCATCCCTATTCTCGGCCA
TTATGCTTTTGTGTTGTTTGATTCGGGTTCAACGCATTCTTTTATTTCTGCATCGTTTGTAAGTCAAGCCAACCTAGAACTAGAGCCGTTAGGCTATGATTTGTCAGTCT
CAACCCCTGCGGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCATGGACAAAATCTACTAGAAAACCCACAAGTGGAACAACCGGGGCCTGCGGCAGGACCTGTCCCAGTGTATATCCAGGCAATAGTTCAGACTGCAGTTCAGGC
TGTAGTTACGGGCGTGCTTGCAGGGCGACAGGCCCAAGCGCCTCAGAACAACGAAACCCTATCGCGGGAGGCTAGCCTCCCGATTCAAGAAGCAAGCAGAGTTTGTGGCT
CTTATGCAGGAAACCGAACCGTAGAGGAATTCAAGACAGAGTTCGCCAGACTTTCTCGGTTTGCTCCTGCCCTGGTAGCTGTAGAGGTTGATAAGGTAGAACGTTTCATC
ACAGGCTTAAAGGAAGAGATTCAAGGCAGTGTGGTAGCCCATGAACCCCAAGACTATGCCACAACAATCAGGGTGGCAGAGCTGATTGATCGACGACCGTCAGTTATCTC
TTCGGAACCTTCCTCAGGTCATAAGCGAAGGTATGATCAGATGAGCTCCCTAGACGAGGTCCGCCCCAGGACCGTCGACAGAAAGCACCGAATCAGCCAGGGCCTAATCA
GGATAGGCCGGACCTGTGTTTGTTTCAAGTGTGGCCGGGAAGGGCATATTGCTCCGAATTGTAATAAGAAGAAAACTAATTATGCCGACCAGCCGTCCAGTTCAAATCAG
GCATCCGGGGTAAGACCAACACAACATCAGGGCAAGATATTTGCCACCACCCGACAAGAGGCAGAGAACCTTAACGTAGTAGTGACAGGTACCATCCCTATTCTCGGCCA
TTATGCTTTTGTGTTGTTTGATTCGGGTTCAACGCATTCTTTTATTTCTGCATCGTTTGTAAGTCAAGCCAACCTAGAACTAGAGCCGTTAGGCTATGATTTGTCAGTCT
CAACCCCTGCGGGGTAA
Protein sequenceShow/hide protein sequence
MGHGQNLLENPQVEQPGPAAGPVPVYIQAIVQTAVQAVVTGVLAGRQAQAPQNNETLSREASLPIQEASRVCGSYAGNRTVEEFKTEFARLSRFAPALVAVEVDKVERFI
TGLKEEIQGSVVAHEPQDYATTIRVAELIDRRPSVISSEPSSGHKRRYDQMSSLDEVRPRTVDRKHRISQGLIRIGRTCVCFKCGREGHIAPNCNKKKTNYADQPSSSNQ
ASGVRPTQHQGKIFATTRQEAENLNVVVTGTIPILGHYAFVLFDSGSTHSFISASFVSQANLELEPLGYDLSVSTPAG