; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008326 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008326
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag protease polyprotein
Genome locationchr9:17684438..17685101
RNA-Seq ExpressionLag0008326
SyntenyLag0008326
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025929.1 gag protease polyprotein [Cucumis melo var. makuwa]6.5e-0928.93Show/hide
Query:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKRSIDRVVKALGPVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQTTANFQRSQHSS-------EG
        + CP+ Q+V CA+FML      WW + +R ++  V  +    ++ +  A  F      +     ++   N  Q  +      N Q   +SS       E 
Subjt:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKRSIDRVVKALGPVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQTTANFQRSQHSS-------EG

Query:  SRQKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS
         R + +  + E     KP C +CG+HH G+CL G    FKC +EGH A+ CP +  GG+
Subjt:  SRQKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS

KAA0033018.1 gag protease polyprotein [Cucumis melo var. makuwa]1.4e-0831.74Show/hide
Query:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKRSI--DRVVKALGPVDYEVALRAATFMGMPIVNATPV------AKESEPNAR-----QKRKHEQTTANFQRS
        M CPE Q+V CAVFML      WW + +R +  D+    +   D E  + +   + M    A  V      A  S+   R     QKRK EQ      + 
Subjt:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKRSI--DRVVKALGPVDYEVALRAATFMGMPIVNATPV------AKESEPNAR-----QKRKHEQTTANFQRS

Query:  QHSSEGSRQKTQHGKQEGD--GNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS
             G  ++ Q    +       KP C +CG+HH G+CL G    FKC +EGH A+ CP +  G +
Subjt:  QHSSEGSRQKTQHGKQEGD--GNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS

KAA0047194.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]8.4e-0927.96Show/hide
Query:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKR--------------------------------SIDRVVKALGPVDYEVALRAATFMGM-PIVNATPVAKES
        M CPE Q+V CAVFML      WW + +R                                +I  +V+A  P  +  ALR A  + +    N++  A   
Subjt:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKR--------------------------------SIDRVVKALGPVDYEVALRAATFMGM-PIVNATPVAKES

Query:  EPNARQKRKHEQTTANFQRSQHSSEGSRQKTQHGKQEGD-GNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS
          + ++++  +Q     QR+  S    R+  Q   + G+    KP C +CG+HH G+CL      FKC +EGH A+ CP +  G +
Subjt:  EPNARQKRKHEQTTANFQRSQHSSEGSRQKTQHGKQEGD-GNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS

XP_022932138.1 uncharacterized protein LOC111438460 [Cucurbita moschata]8.4e-0933.53Show/hide
Query:  CPEAQQVSCAVFMLRGDTLLWWRSAKRSI-----------------------DRVVKALGPVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQ
        CPEA +V CA FMLR D  LWW++ +  I                       DR    L   DY  A R    +G+     T + K  E  A        
Subjt:  CPEAQQVSCAVFMLRGDTLLWWRSAKRSI-----------------------DRVVKALGPVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQ

Query:  TTANFQRSQHSSEGS-----RQKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNC
          A       SS  +      QK +H + E   +DKPKCN CG++HWGQCL      F+C KE HMA +C
Subjt:  TTANFQRSQHSSEGS-----RQKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNC

XP_022934554.1 uncharacterized protein LOC111441700 [Cucurbita moschata]1.3e-0927.96Show/hide
Query:  CPEAQQVSCAVFMLRGDTLLWWRSAK---------------------------------------------------------------RSIDRVVKALG
        CPEA +V CA+FML  D  LWW++ +                                                                 I   VKA+ 
Subjt:  CPEAQQVSCAVFMLRGDTLLWWRSAK---------------------------------------------------------------RSIDRVVKALG

Query:  PVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQTTAN---FQRSQHSSEGSR---QKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFK
        P  Y  ALRAA  M  P  +  P +       RQKR+ +QT  N   F ++Q  S+  R   ++ Q G  E     +PKC  C ++HWGQCL      F+
Subjt:  PVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQTTAN---FQRSQHSSEGSR---QKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFK

Query:  CHKEGHMANNC
        C K GH+A +C
Subjt:  CHKEGHMANNC

TrEMBL top hitse value%identityAlignment
A0A5A7SKZ4 Gag protease polyprotein3.1e-0928.93Show/hide
Query:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKRSIDRVVKALGPVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQTTANFQRSQHSS-------EG
        + CP+ Q+V CA+FML      WW + +R ++  V  +    ++ +  A  F      +     ++   N  Q  +      N Q   +SS       E 
Subjt:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKRSIDRVVKALGPVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQTTANFQRSQHSS-------EG

Query:  SRQKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS
         R + +  + E     KP C +CG+HH G+CL G    FKC +EGH A+ CP +  GG+
Subjt:  SRQKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS

A0A5A7TW65 DNA/RNA polymerases superfamily protein4.1e-0927.96Show/hide
Query:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKR--------------------------------SIDRVVKALGPVDYEVALRAATFMGM-PIVNATPVAKES
        M CPE Q+V CAVFML      WW + +R                                +I  +V+A  P  +  ALR A  + +    N++  A   
Subjt:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKR--------------------------------SIDRVVKALGPVDYEVALRAATFMGM-PIVNATPVAKES

Query:  EPNARQKRKHEQTTANFQRSQHSSEGSRQKTQHGKQEGD-GNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS
          + ++++  +Q     QR+  S    R+  Q   + G+    KP C +CG+HH G+CL      FKC +EGH A+ CP +  G +
Subjt:  EPNARQKRKHEQTTANFQRSQHSSEGSRQKTQHGKQEGD-GNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS

A0A5D3BWC8 Gag protease polyprotein7.0e-0931.74Show/hide
Query:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKRSI--DRVVKALGPVDYEVALRAATFMGMPIVNATPV------AKESEPNAR-----QKRKHEQTTANFQRS
        M CPE Q+V CAVFML      WW + +R +  D+    +   D E  + +   + M    A  V      A  S+   R     QKRK EQ      + 
Subjt:  MGCPEAQQVSCAVFMLRGDTLLWWRSAKRSI--DRVVKALGPVDYEVALRAATFMGMPIVNATPV------AKESEPNAR-----QKRKHEQTTANFQRS

Query:  QHSSEGSRQKTQHGKQEGD--GNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS
             G  ++ Q    +       KP C +CG+HH G+CL G    FKC +EGH A+ CP +  G +
Subjt:  QHSSEGSRQKTQHGKQEGD--GNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS

A0A6J1F0T4 uncharacterized protein LOC1114384604.1e-0933.53Show/hide
Query:  CPEAQQVSCAVFMLRGDTLLWWRSAKRSI-----------------------DRVVKALGPVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQ
        CPEA +V CA FMLR D  LWW++ +  I                       DR    L   DY  A R    +G+     T + K  E  A        
Subjt:  CPEAQQVSCAVFMLRGDTLLWWRSAKRSI-----------------------DRVVKALGPVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQ

Query:  TTANFQRSQHSSEGS-----RQKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNC
          A       SS  +      QK +H + E   +DKPKCN CG++HWGQCL      F+C KE HMA +C
Subjt:  TTANFQRSQHSSEGS-----RQKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNC

A0A6J1F2Y2 uncharacterized protein LOC1114417006.3e-1027.96Show/hide
Query:  CPEAQQVSCAVFMLRGDTLLWWRSAK---------------------------------------------------------------RSIDRVVKALG
        CPEA +V CA+FML  D  LWW++ +                                                                 I   VKA+ 
Subjt:  CPEAQQVSCAVFMLRGDTLLWWRSAK---------------------------------------------------------------RSIDRVVKALG

Query:  PVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQTTAN---FQRSQHSSEGSR---QKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFK
        P  Y  ALRAA  M  P  +  P +       RQKR+ +QT  N   F ++Q  S+  R   ++ Q G  E     +PKC  C ++HWGQCL      F+
Subjt:  PVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQTTAN---FQRSQHSSEGSR---QKTQHGKQEGDGNDKPKCNSCGRHHWGQCLEGKDVRFK

Query:  CHKEGHMANNC
        C K GH+A +C
Subjt:  CHKEGHMANNC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATGCCCCGAGGCACAACAGGTGTCGTGTGCAGTATTTATGTTAAGGGGCGATACCTTGTTGTGGTGGAGGTCGGCTAAGAGATCCATCGATAGGGTTGTTAAAGC
TCTTGGCCCAGTAGATTACGAAGTGGCCCTTCGAGCGGCCACCTTTATGGGCATGCCAATTGTCAATGCAACTCCAGTAGCCAAAGAGTCGGAGCCCAACGCAAGACAGA
AGAGAAAACACGAGCAGACAACTGCTAACTTCCAACGATCTCAACACTCTTCTGAAGGTTCAAGACAGAAAACTCAGCATGGCAAACAAGAGGGTGATGGTAACGATAAA
CCAAAGTGCAACTCTTGTGGAAGACATCATTGGGGTCAGTGCTTGGAAGGGAAAGACGTGCGTTTCAAATGTCACAAGGAAGGGCATATGGCAAATAACTGCCCTCAGAA
GAAAGTGGGAGGCTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGATGCCCCGAGGCACAACAGGTGTCGTGTGCAGTATTTATGTTAAGGGGCGATACCTTGTTGTGGTGGAGGTCGGCTAAGAGATCCATCGATAGGGTTGTTAAAGC
TCTTGGCCCAGTAGATTACGAAGTGGCCCTTCGAGCGGCCACCTTTATGGGCATGCCAATTGTCAATGCAACTCCAGTAGCCAAAGAGTCGGAGCCCAACGCAAGACAGA
AGAGAAAACACGAGCAGACAACTGCTAACTTCCAACGATCTCAACACTCTTCTGAAGGTTCAAGACAGAAAACTCAGCATGGCAAACAAGAGGGTGATGGTAACGATAAA
CCAAAGTGCAACTCTTGTGGAAGACATCATTGGGGTCAGTGCTTGGAAGGGAAAGACGTGCGTTTCAAATGTCACAAGGAAGGGCATATGGCAAATAACTGCCCTCAGAA
GAAAGTGGGAGGCTCGTGA
Protein sequenceShow/hide protein sequence
MGCPEAQQVSCAVFMLRGDTLLWWRSAKRSIDRVVKALGPVDYEVALRAATFMGMPIVNATPVAKESEPNARQKRKHEQTTANFQRSQHSSEGSRQKTQHGKQEGDGNDK
PKCNSCGRHHWGQCLEGKDVRFKCHKEGHMANNCPQKKVGGS