; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015604 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015604
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag protease polyprotein
Genome locationchr12:17490927..17492051
RNA-Seq ExpressionLag0015604
SyntenyLag0015604
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156067.1 uncharacterized protein LOC111023035 [Momordica charantia]3.9e-3138.83Show/hide
Query:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTSG
        P TV+++K  E L  TQG++ V QY+RKFTELSRF +      +   +      RR  +      E T Y    +R ALV+DK L +  Q+   +GS+SG
Subjt:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTSG

Query:  LK-------------GSPHPLKRDLCRRHLASSSRSRPRPF-----LVAMCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIVED
        +K             G  H ++R        S  ++   P      +   C  E  +     + G                QGG Q+ARVFAL    VE 
Subjt:  LK-------------GSPHPLKRDLCRRHLASSSRSRPRPF-----LVAMCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIVED

Query:  -DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG
         +AVVTGTILVL +PA+ LFDSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  QV++ G++S  G
Subjt:  -DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]2.6e-3541.7Show/hide
Query:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPP--RRNSRNYCP-ECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTS
        PETVKD KE E LH  QGT+ V QYERKFTELSRF+L+L  +P E  ++ +     R+  R     +    +   +RGALV+DK+++  A    EVGS+S
Subjt:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPP--RRNSRNYCP-ECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTS

Query:  GLKGSPHPLKRDLCRRHLASSSRSRPRPFLVAMC-------------------------------AAETMWVN---ASQVPGQGGKQKARVFAL-NDEIV
        G+K        DL  R     ++ +  P +   C                               AA T  +       V  QG  Q+ARVFAL   E  
Subjt:  GLKGSPHPLKRDLCRRHLASSSRSRPRPFLVAMC-------------------------------AAETMWVN---ASQVPGQGGKQKARVFAL-NDEIV

Query:  EDDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVS
        + + VVTGT+LV  VPA+VLFDSGSSH+FISS FV QA L+LEPLGF+L VSTPSG +++A Q +R  E+S
Subjt:  EDDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVS

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]3.6e-2938.55Show/hide
Query:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQ--GPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGST
        P   +++K  E L  TQG++ V QYERKFTELSRF      +P E  ++ +     RR  +      E T Y    +R ALV+DK L +  Q+   +GS 
Subjt:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQ--GPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGST

Query:  SGLK-------------GSPHPLKRDLCRRHLASSSRSRPRPF-----LVAMCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIV
        SG+K             G  H  +R        S  ++  RP      +   C  E  +     + G                QGG Q ARVFAL    V
Subjt:  SGLK-------------GSPHPLKRDLCRRHLASSSRSRPRPF-----LVAMCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIV

Query:  ED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG
        E  +AVVTGTIL+L +PA+ LFDSGSSHSFI+S FV  ADL+LE  GF L VSTPSG V++  QV++ G++S  G
Subjt:  ED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]4.6e-3239.92Show/hide
Query:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQ---GPPRRNSRNYCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTS
        P TV+++K  E L  TQG++ V QYERKFTELSRF +    +P E  ++ +   G           +    +   +R ALV+DK L +  Q+   +GS+S
Subjt:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQ---GPPRRNSRNYCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTS

Query:  GLK-------------GSPHPLKRDLCRRHLASSSRSRPRPFLVAMCAAETMWVNASQVPGQGGKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLF
        G+K             G  H ++R        S  ++   P  +            +    QGG Q+ARVFAL    VE  +AVVTGTILV+ +PA+ LF
Subjt:  GLK-------------GSPHPLKRDLCRRHLASSSRSRPRPFLVAMCAAETMWVNASQVPGQGGKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLF

Query:  DSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG
        DSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  QV++ G++S  G
Subjt:  DSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]6.2e-2938.1Show/hide
Query:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTSG
        P TV+++K  E L  TQG++ V +YERKFTELSRF +      +   +      RR  +      E T Y    +R ALV+DK L +  Q+   +GS+SG
Subjt:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTSG

Query:  LKGS-------------PHPLKRDLCRRHLASSSRSRPRPFLVA-----MCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIVE-
        +K                H ++R        S  +S   P  V       C  E  +     + G                QGG  +ARVFAL    VE 
Subjt:  LKGS-------------PHPLKRDLCRRHLASSSRSRPRPFLVA-----MCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIVE-

Query:  DDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG
         +AVVT T+LVL +PA+ LFDSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  QV++ G++S  G
Subjt:  DDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase1.8e-2938.55Show/hide
Query:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQ--GPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGST
        P   +++K  E L  TQG++ V QYERKFTELSRF      +P E  ++ +     RR  +      E T Y    +R ALV+DK L +  Q+   +GS 
Subjt:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQ--GPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGST

Query:  SGLK-------------GSPHPLKRDLCRRHLASSSRSRPRPF-----LVAMCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIV
        SG+K             G  H  +R        S  ++  RP      +   C  E  +     + G                QGG Q ARVFAL    V
Subjt:  SGLK-------------GSPHPLKRDLCRRHLASSSRSRPRPF-----LVAMCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIV

Query:  ED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG
        E  +AVVTGTIL+L +PA+ LFDSGSSHSFI+S FV  ADL+LE  GF L VSTPSG V++  QV++ G++S  G
Subjt:  ED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG

A0A6J1DR22 uncharacterized protein LOC1110230351.9e-3138.83Show/hide
Query:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTSG
        P TV+++K  E L  TQG++ V QY+RKFTELSRF +      +   +      RR  +      E T Y    +R ALV+DK L +  Q+   +GS+SG
Subjt:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTSG

Query:  LK-------------GSPHPLKRDLCRRHLASSSRSRPRPF-----LVAMCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIVED
        +K             G  H ++R        S  ++   P      +   C  E  +     + G                QGG Q+ARVFAL    VE 
Subjt:  LK-------------GSPHPLKRDLCRRHLASSSRSRPRPF-----LVAMCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIVED

Query:  -DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG
         +AVVTGTILVL +PA+ LFDSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  QV++ G++S  G
Subjt:  -DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG

A0A6J1DTA8 uncharacterized protein LOC1110241142.2e-3239.92Show/hide
Query:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQ---GPPRRNSRNYCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTS
        P TV+++K  E L  TQG++ V QYERKFTELSRF +    +P E  ++ +   G           +    +   +R ALV+DK L +  Q+   +GS+S
Subjt:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQ---GPPRRNSRNYCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTS

Query:  GLK-------------GSPHPLKRDLCRRHLASSSRSRPRPFLVAMCAAETMWVNASQVPGQGGKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLF
        G+K             G  H ++R        S  ++   P  +            +    QGG Q+ARVFAL    VE  +AVVTGTILV+ +PA+ LF
Subjt:  GLK-------------GSPHPLKRDLCRRHLASSSRSRPRPFLVAMCAAETMWVNASQVPGQGGKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLF

Query:  DSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG
        DSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  QV++ G++S  G
Subjt:  DSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG

A0A6J1DUM2 uncharacterized protein LOC1110232471.3e-3541.7Show/hide
Query:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPP--RRNSRNYCP-ECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTS
        PETVKD KE E LH  QGT+ V QYERKFTELSRF+L+L  +P E  ++ +     R+  R     +    +   +RGALV+DK+++  A    EVGS+S
Subjt:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPP--RRNSRNYCP-ECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTS

Query:  GLKGSPHPLKRDLCRRHLASSSRSRPRPFLVAMC-------------------------------AAETMWVN---ASQVPGQGGKQKARVFAL-NDEIV
        G+K        DL  R     ++ +  P +   C                               AA T  +       V  QG  Q+ARVFAL   E  
Subjt:  GLKGSPHPLKRDLCRRHLASSSRSRPRPFLVAMC-------------------------------AAETMWVN---ASQVPGQGGKQKARVFAL-NDEIV

Query:  EDDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVS
        + + VVTGT+LV  VPA+VLFDSGSSH+FISS FV QA L+LEPLGF+L VSTPSG +++A Q +R  E+S
Subjt:  EDDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVS

A0A6J1DWP4 uncharacterized protein LOC1110252153.0e-2938.1Show/hide
Query:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTSG
        P TV+++K  E L  TQG++ V +YERKFTELSRF +      +   +      RR  +      E T Y    +R ALV+DK L +  Q+   +GS+SG
Subjt:  PETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPPRRNSRN--YCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTSG

Query:  LKGS-------------PHPLKRDLCRRHLASSSRSRPRPFLVA-----MCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIVE-
        +K                H ++R        S  +S   P  V       C  E  +     + G                QGG  +ARVFAL    VE 
Subjt:  LKGS-------------PHPLKRDLCRRHLASSSRSRPRPFLVA-----MCAAETMWVNASQVPG----------------QGGKQKARVFALNDEIVE-

Query:  DDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG
         +AVVT T+LVL +PA+ LFDSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  QV++ G++S  G
Subjt:  DDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTCCCTCTTTCGATGGGCGTTCTGAGAACCCGTTGGCAGCCGAGCGTTGGACGGCAACCTCGAAGCCATGTTCGACTATATGAACTGCGACGATCGCTTGAAAG
TATGAGGCGTAGTTTTCATGGATCTGCTTTACGATTACTACTTCCCGAGACCGTCAAGGACGACAAAGAGACAGAGCTCCTGCACTATACTCAGGGCACTATGTATGTAA
TCCAGTACGAGCGAAAGTTCACGGAGCTGTCGCGTTTTTCTCTGGATCTGTTTAGCATGCCGAAAGAAAATCAAGAGGTTCATCAAGGGCCTCCGAGAAGAAATTCGAGG
AACTATTGCCCTGAATGCACCTACTACTTTTGTTGCGGCCTCCGTGGGGCGTTGGTCTTGGATAAAAACCTGGCCAAGAATGCACAGACTCACTGGGAGGTCGGTTCGAC
CTCTGGGTTAAAAGGAAGCCCCCACCCACTCAAGCGAGACCTCTGCAGAAGGCACCTCGCCAGCAGTTCCAGAAGCAGACCTCGACCATTCCTTGTTGCAATGTGTGCAG
CAGAAACCATGTGGGTCAATGCGAGTCAGGTTCCTGGTCAAGGTGGCAAGCAAAAAGCTCGTGTTTTTGCCCTGAATGATGAAATTGTGGAGGATGATGCCGTGGTGACA
GGAACTATTCTTGTTTTGAAAGTCCCTGCTTTTGTGTTATTTGACTCGGGGTCGAGTCACTCTTTTATCTCATCGGCGTTTGTCGATCAAGCTGATCTGAAGTTAGAGCC
GCTAGGGTTTGTTCTCTTAGTGTCCACCCCCTCCGGATTTGTGATGCTTGCTCAGCAAGTCTTAAGGACGGGTGAAGTTTCAATCGCGGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCTCCCTCTTTCGATGGGCGTTCTGAGAACCCGTTGGCAGCCGAGCGTTGGACGGCAACCTCGAAGCCATGTTCGACTATATGAACTGCGACGATCGCTTGAAAG
TATGAGGCGTAGTTTTCATGGATCTGCTTTACGATTACTACTTCCCGAGACCGTCAAGGACGACAAAGAGACAGAGCTCCTGCACTATACTCAGGGCACTATGTATGTAA
TCCAGTACGAGCGAAAGTTCACGGAGCTGTCGCGTTTTTCTCTGGATCTGTTTAGCATGCCGAAAGAAAATCAAGAGGTTCATCAAGGGCCTCCGAGAAGAAATTCGAGG
AACTATTGCCCTGAATGCACCTACTACTTTTGTTGCGGCCTCCGTGGGGCGTTGGTCTTGGATAAAAACCTGGCCAAGAATGCACAGACTCACTGGGAGGTCGGTTCGAC
CTCTGGGTTAAAAGGAAGCCCCCACCCACTCAAGCGAGACCTCTGCAGAAGGCACCTCGCCAGCAGTTCCAGAAGCAGACCTCGACCATTCCTTGTTGCAATGTGTGCAG
CAGAAACCATGTGGGTCAATGCGAGTCAGGTTCCTGGTCAAGGTGGCAAGCAAAAAGCTCGTGTTTTTGCCCTGAATGATGAAATTGTGGAGGATGATGCCGTGGTGACA
GGAACTATTCTTGTTTTGAAAGTCCCTGCTTTTGTGTTATTTGACTCGGGGTCGAGTCACTCTTTTATCTCATCGGCGTTTGTCGATCAAGCTGATCTGAAGTTAGAGCC
GCTAGGGTTTGTTCTCTTAGTGTCCACCCCCTCCGGATTTGTGATGCTTGCTCAGCAAGTCTTAAGGACGGGTGAAGTTTCAATCGCGGGCTAG
Protein sequenceShow/hide protein sequence
MALPLSMGVLRTRWQPSVGRQPRSHVRLYELRRSLESMRRSFHGSALRLLLPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKENQEVHQGPPRRNSR
NYCPECTYYFCCGLRGALVLDKNLAKNAQTHWEVGSTSGLKGSPHPLKRDLCRRHLASSSRSRPRPFLVAMCAAETMWVNASQVPGQGGKQKARVFALNDEIVEDDAVVT
GTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG