; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008242 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008242
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag protease polyprotein
Genome locationchr9:15474926..15475992
RNA-Seq ExpressionLag0008242
SyntenyLag0008242
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033513.1 gag protease polyprotein [Cucumis melo var. makuwa]1.7e-0453.06Show/hide
Query:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS
        C RK+  FNPPS ASFKFKG   K   +V+ A++A K++ +G WG+LAS
Subjt:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS

KAA0033513.1 gag protease polyprotein [Cucumis melo var. makuwa]1.5e-3240.34Show/hide
Query:  LAEDPQVEHPDEQPAPAAKPAAVAGVIVGQKTQTPQ-NNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL-
        + E  +   P+  PAPA  PA V           PQ   + +S EA  LRDFRK++P  FDG+  DPT A++WLSS+ET+FR+M  PED KV C VF+L 
Subjt:  LAEDPQVEHPDEQPAPAAKPAAVAGVIVGQKTQTPQ-NNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL-

Query:  ---------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAH
                                   E F+ K++ A+ R  K+ EF+ L+QG  TVE+Y+ EF  LSRFAP ++ATEA    +F+ GL+ +IQG V A 
Subjt:  ---------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAH

Query:  RPQDYVMAVRVA---ELIDR--RPMTVPPKPTSGHKRK
        RP  +V A+R+A    L +R     T     TSG KRK
Subjt:  RPQDYVMAVRVA---ELIDR--RPMTVPPKPTSGHKRK

KAA0062520.1 pol protein [Cucumis melo var. makuwa]8.7e-3339.83Show/hide
Query:  EDPQVEHPDEQPAPAAKPAAVAGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL----
        + P    P   PAPA  P  VA  +V          + +S EA  LRDFRK++P  FDG+  DPT A+LWLSS+ET+FR+M  PED KV C VF+L    
Subjt:  EDPQVEHPDEQPAPAAKPAAVAGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL----

Query:  ------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQ
                                E F+ K++ A+ R  K+ EF+ L+QG  TVE+Y+ EF  LSRFAP ++ATEA    +F+ GL+ +IQG V A RP 
Subjt:  ------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQ

Query:  DYVMAVRVA-----ELIDRRPMTVPPKPTSGHKRKP
         +  A+R+A     +  D          TSG KRKP
Subjt:  DYVMAVRVA-----ELIDRRPMTVPPKPTSGHKRKP

KAA0062520.1 pol protein [Cucumis melo var. makuwa]3.8e-0451.02Show/hide
Query:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS
        C RK+  FNPPS ASFKFKG   +   +V+ A++A K++ +G WG+LAS
Subjt:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS

KAA0062520.1 pol protein [Cucumis melo var. makuwa]1.1e-3241.32Show/hide
Query:  EDPQVEHPDEQPAPAAKPAAV-------AGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVV
        + P    P   PAPA+ PA V       A V+V  +    Q    +S EA  LRDFRK++P  FDG+  DPT A+LWLSS+ET+FR+M  PED KV C V
Subjt:  EDPQVEHPDEQPAPAAKPAAV-------AGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVV

Query:  FLL----------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGS
        F+L                            E F+ K++ A+ R  K+ EF+ L+QG  TVE+YE EF  LSRFAP ++ATEA    +F+ GL+ +IQG 
Subjt:  FLL----------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGS

Query:  VTAHRPQDYVMAVRVA---ELIDR--RPMTVPPKPTSGHKRK
        V A RP  +  A+R+A    L +R     TV    TSG KRK
Subjt:  VTAHRPQDYVMAVRVA---ELIDR--RPMTVPPKPTSGHKRK

KAA0066456.1 pol protein [Cucumis melo var. makuwa]1.7e-0453.06Show/hide
Query:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS
        C RK+  FNPPS ASFKFKG   K   +V+ A++A K++ +G WG+LAS
Subjt:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS

XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]2.4e-3546.15Show/hide
Query:  VAGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL------------------------
        V   +  Q TQ  QN  ++S EA  LRDF+K+DPR FDG S DP +AE WLS +ET+FR+M   E+ KV C VF+L                        
Subjt:  VAGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL------------------------

Query:  ----EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQDYVMAVRVAELIDRR
            E FF++YYPA + +RK+ EF+ LKQ +R+VEEY+ EF +LSRFAP LV TEA    RFI  LK+E +G V    P DY  A+R A LID R
Subjt:  ----EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQDYVMAVRVAELIDRR

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]4.6e-3443.81Show/hide
Query:  KTQTPQNNETM---SQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL----------------------------E
        +TQ    N++M   S EA  LRDF+K++P  F+G+ +DPT AELW+S IET+FR+M  PED KV C VF+L                            E
Subjt:  KTQTPQNNETM---SQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL----------------------------E

Query:  KFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQDYVMAVRVAELIDRR-----PMTVP
        +F+ KY+ A  R+ K+ EF+ L+QG R+VEEY+ EF  LSRFAP LVATEA    RFI GLKE I+G V A +P  +V A+R+A  +D +      +   
Subjt:  KFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQDYVMAVRVAELIDRR-----PMTVP

Query:  PKPTSGHKRK
          P+SG KRK
Subjt:  PKPTSGHKRK

TrEMBL top hitse value%identityAlignment
A0A5A7SSS7 Gag protease polyprotein8.2e-0553.06Show/hide
Query:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS
        C RK+  FNPPS ASFKFKG   K   +V+ A++A K++ +G WG+LAS
Subjt:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS

A0A5A7SSS7 Gag protease polyprotein7.1e-3340.61Show/hide
Query:  PDEQPAPAAKPAAVAGVIVGQKTQTPQ-NNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL----------
        P   PAPA  PA V           PQ   + +S EA  LRDFRK++P  FDG+  DPT A++WLSS+ET+FR+M  PED KV C VF+L          
Subjt:  PDEQPAPAAKPAAVAGVIVGQKTQTPQ-NNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL----------

Query:  ------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQDYVMAV
                          E F+ K++ A+ R  K+ EF+ L+QG  TVE+Y+ EF  LSRFAP ++ATEA    +F+ GL+ +IQG V A RP  +  A+
Subjt:  ------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQDYVMAV

Query:  RVAELIDRRPMTVPPK-----PTSGHKRK
        R+A  +  + M    K      TSG KRK
Subjt:  RVAELIDRRPMTVPPK-----PTSGHKRK

A0A5A7UAA8 Reverse transcriptase1.4e-0453.06Show/hide
Query:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS
        C RK   FNPPS ASFKFKG   K   +V+ A++A K++ +G WG+LAS
Subjt:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS

A0A5A7UAA8 Reverse transcriptase7.1e-3340.34Show/hide
Query:  LAEDPQVEHPDEQPAPAAKPAAVAGVIVGQKTQTPQ-NNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL-
        + E  +   P+  PAPA  PA V           PQ   + +S EA  LRDFRK++P  FDG+  DPT A++WLSS+ET+FR+M  PED KV C VF+L 
Subjt:  LAEDPQVEHPDEQPAPAAKPAAVAGVIVGQKTQTPQ-NNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL-

Query:  ---------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAH
                                   E F+ K++ A+ R  K+ EF+ L+QG  TVE+Y+ EF  LSRFAP ++ATEA    +F+ GL+ +IQG V A 
Subjt:  ---------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAH

Query:  RPQDYVMAVRVA---ELIDR--RPMTVPPKPTSGHKRK
        RP  +V A+R+A    L +R     T     TSG KRK
Subjt:  RPQDYVMAVRVA---ELIDR--RPMTVPPKPTSGHKRK

A0A5A7V5L6 Reverse transcriptase4.2e-3339.83Show/hide
Query:  EDPQVEHPDEQPAPAAKPAAVAGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL----
        + P    P   PAPA  P  VA  +V          + +S EA  LRDFRK++P  FDG+  DPT A+LWLSS+ET+FR+M  PED KV C VF+L    
Subjt:  EDPQVEHPDEQPAPAAKPAAVAGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL----

Query:  ------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQ
                                E F+ K++ A+ R  K+ EF+ L+QG  TVE+Y+ EF  LSRFAP ++ATEA    +F+ GL+ +IQG V A RP 
Subjt:  ------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQ

Query:  DYVMAVRVA-----ELIDRRPMTVPPKPTSGHKRKP
         +  A+R+A     +  D          TSG KRKP
Subjt:  DYVMAVRVA-----ELIDRRPMTVPPKPTSGHKRKP

A0A5A7V5L6 Reverse transcriptase1.8e-0451.02Show/hide
Query:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS
        C RK+  FNPPS ASFKFKG   +   +V+ A++A K++ +G WG+LAS
Subjt:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS

A0A5A7V5L6 Reverse transcriptase5.5e-3341.32Show/hide
Query:  EDPQVEHPDEQPAPAAKPAAV-------AGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVV
        + P    P   PAPA+ PA V       A V+V  +    Q    +S EA  LRDFRK++P  FDG+  DPT A+LWLSS+ET+FR+M  PED KV C V
Subjt:  EDPQVEHPDEQPAPAAKPAAV-------AGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVV

Query:  FLL----------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGS
        F+L                            E F+ K++ A+ R  K+ EF+ L+QG  TVE+YE EF  LSRFAP ++ATEA    +F+ GL+ +IQG 
Subjt:  FLL----------------------------EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGS

Query:  VTAHRPQDYVMAVRVA---ELIDR--RPMTVPPKPTSGHKRK
        V A RP  +  A+R+A    L +R     TV    TSG KRK
Subjt:  VTAHRPQDYVMAVRVA---ELIDR--RPMTVPPKPTSGHKRK

A0A5A7VJE2 Reverse transcriptase8.2e-0553.06Show/hide
Query:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS
        C RK+  FNPPS ASFKFKG   K   +V+ A++A K++ +G WG+LAS
Subjt:  CYRKKAVFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLAS

A0A6J1DSJ6 uncharacterized protein LOC1110235121.2e-3546.15Show/hide
Query:  VAGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL------------------------
        V   +  Q TQ  QN  ++S EA  LRDF+K+DPR FDG S DP +AE WLS +ET+FR+M   E+ KV C VF+L                        
Subjt:  VAGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLL------------------------

Query:  ----EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQDYVMAVRVAELIDRR
            E FF++YYPA + +RK+ EF+ LKQ +R+VEEY+ EF +LSRFAP LV TEA    RFI  LK+E +G V    P DY  A+R A LID R
Subjt:  ----EKFFRKYYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQDYVMAVRVAELIDRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCAGGAAAATAATCTGGCAGAAGACCCACAAGTTGAGCATCCGGATGAGCAACCGGCACCTGCAGCAAAACCTGCGGCAGTTGCAGGCGTGATTGTAGGCCAGAA
GACCCAAACGCCTCAGAACAATGAAACCATGTCGCAAGAGGCAAGTTGTTTAAGGGATTTTAGAAAGTGGGACCCCCGTCCGTTTGATGGAGCATCAAGGGACCCAACAG
TGGCAGAGTTGTGGTTGTCCTCCATTGAAACTGTCTTTCGCCACATGAACTATCCGGAAGACCATAAAGTTTATTGTGTCGTGTTCCTGTTGGAGAAGTTCTTTAGGAAG
TATTACCCTGCAGCCTCCCGTTTTAGAAAGAAAGCAGAGTTTGTGGCTCTTAAGCAGGGGAGCCGAACCGTAGAGGAATATGAGACGGAGTTTGCTAGACTATCTCGATT
TGCTCCTGTCCTAGTAGCTACAGAGGCTGATATGGTGGTACGATTTATCACGGGCTTAAAGGAAGAGATACAAGGCAGTGTGACAGCCCATCGACCACAAGATTATGTCA
TGGCAGTCAGGGTGGCAGAGTTGATTGATCGACGACCGATGACTGTCCCTCCAAAACCTACCTCAGGTCATAAGCGAAAGCCTGTCAAATATTGTTATCGTAAGAAGGCT
GTCTTCAACCCTCCATCAGAGGCGAGCTTCAAGTTTAAAGGATCAATGATCAAAGGTGCTTCTAAAGTAGTCTTTGCAATGAAGGCGAGAAAAATGATCGGCAGAGGAGC
TTGGGGTTTACTGGCTAGTGATAGCGCAATTTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCAGGAAAATAATCTGGCAGAAGACCCACAAGTTGAGCATCCGGATGAGCAACCGGCACCTGCAGCAAAACCTGCGGCAGTTGCAGGCGTGATTGTAGGCCAGAA
GACCCAAACGCCTCAGAACAATGAAACCATGTCGCAAGAGGCAAGTTGTTTAAGGGATTTTAGAAAGTGGGACCCCCGTCCGTTTGATGGAGCATCAAGGGACCCAACAG
TGGCAGAGTTGTGGTTGTCCTCCATTGAAACTGTCTTTCGCCACATGAACTATCCGGAAGACCATAAAGTTTATTGTGTCGTGTTCCTGTTGGAGAAGTTCTTTAGGAAG
TATTACCCTGCAGCCTCCCGTTTTAGAAAGAAAGCAGAGTTTGTGGCTCTTAAGCAGGGGAGCCGAACCGTAGAGGAATATGAGACGGAGTTTGCTAGACTATCTCGATT
TGCTCCTGTCCTAGTAGCTACAGAGGCTGATATGGTGGTACGATTTATCACGGGCTTAAAGGAAGAGATACAAGGCAGTGTGACAGCCCATCGACCACAAGATTATGTCA
TGGCAGTCAGGGTGGCAGAGTTGATTGATCGACGACCGATGACTGTCCCTCCAAAACCTACCTCAGGTCATAAGCGAAAGCCTGTCAAATATTGTTATCGTAAGAAGGCT
GTCTTCAACCCTCCATCAGAGGCGAGCTTCAAGTTTAAAGGATCAATGATCAAAGGTGCTTCTAAAGTAGTCTTTGCAATGAAGGCGAGAAAAATGATCGGCAGAGGAGC
TTGGGGTTTACTGGCTAGTGATAGCGCAATTTATTGA
Protein sequenceShow/hide protein sequence
MGQENNLAEDPQVEHPDEQPAPAAKPAAVAGVIVGQKTQTPQNNETMSQEASCLRDFRKWDPRPFDGASRDPTVAELWLSSIETVFRHMNYPEDHKVYCVVFLLEKFFRK
YYPAASRFRKKAEFVALKQGSRTVEEYETEFARLSRFAPVLVATEADMVVRFITGLKEEIQGSVTAHRPQDYVMAVRVAELIDRRPMTVPPKPTSGHKRKPVKYCYRKKA
VFNPPSEASFKFKGSMIKGASKVVFAMKARKMIGRGAWGLLASDSAIY