; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031738 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031738
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag protease polyprotein
Genome locationchr11:13271948..13272373
RNA-Seq ExpressionLag0031738
SyntenyLag0031738
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]2.0e-3360.5Show/hide
Query:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV
        FML+DDA LWW STE+ IDV+ GPVTWL FKE FFQ+YY  I  YRK+ EFL L Q  +SVEEY+ EFT+LS FAPE+VDT+A K +RFI+ LKD+ +  
Subjt:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV

Query:  VGALAPIDYATTLRATTFM
        V  L+P DYAT LR    +
Subjt:  VGALAPIDYATTLRATTFM

XP_022937437.1 uncharacterized protein LOC111443845 [Cucurbita moschata]6.6e-2446.49Show/hide
Query:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV
        ++LR+DA LWW+ST ++I  +   +TW  F++ F +K + +++RY+K+ EFL++ QG +SVEEYE EFTRLS FAP MV  +  K++ F++GL+ D++ V
Subjt:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV

Query:  VGALAPIDYATTLR
        V    P DYAT L+
Subjt:  VGALAPIDYATTLR

XP_038880159.1 uncharacterized protein LOC120071839 [Benincasa hispida]1.7e-2750.82Show/hide
Query:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV
        F+L D+A  WWR  E+ I+ + G  TW  FKE F++KY+S  +RY K+AEF+ L QG  +VEEYE +FTRLS FAP++V T+AK+ +RF+ GL+D+V+ +
Subjt:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV

Query:  VGALAPIDYATTLRATTFMGMP
        V AL P +YAT  RA   +G P
Subjt:  VGALAPIDYATTLRATTFMGMP

XP_038882211.1 uncharacterized protein LOC120073433 [Benincasa hispida]5.0e-2451.72Show/hide
Query:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV
        FML D+A +WW S EK ID N G  TW  FKE F++KY+ST  RY K+ +FL L QG   VEEYE EF +L+HFAP++V TKA +I+RF+ GL+  +  +
Subjt:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV

Query:  VGALAPIDYATTLRAT
        V AL    Y   L+AT
Subjt:  VGALAPIDYATTLRAT

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]1.9e-2349.12Show/hide
Query:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV
        FML D A +WW+  E+ + V   PVTW  FKE F+ KY+S  +RY K+ EFL L QG +SVEEY+ EF  LS FAPE+V T+A + +RFI GLK+ ++ +
Subjt:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV

Query:  VGALAPIDYATTLR
        V A  P  +   LR
Subjt:  VGALAPIDYATTLR

TrEMBL top hitse value%identityAlignment
A0A5A7VDM7 Gag protease polyprotein1.3e-2043.86Show/hide
Query:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV
        FML D    WW +TE+ +  + G +TW  FKE F+ K++S  +R  K+ EFL L QG+ +VE+Y+ EF  LS FAPEM+ T+A +  +F+ GL+ D+Q +
Subjt:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV

Query:  VGALAPIDYATTLR
        V A  P  +A  LR
Subjt:  VGALAPIDYATTLR

A0A6J1DSJ6 uncharacterized protein LOC1110235129.9e-3460.5Show/hide
Query:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV
        FML+DDA LWW STE+ IDV+ GPVTWL FKE FFQ+YY  I  YRK+ EFL L Q  +SVEEY+ EFT+LS FAPE+VDT+A K +RFI+ LKD+ +  
Subjt:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV

Query:  VGALAPIDYATTLRATTFM
        V  L+P DYAT LR    +
Subjt:  VGALAPIDYATTLRATTFM

A0A6J1EPR7 uncharacterized protein LOC1114345292.5e-2147.42Show/hide
Query:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDV
        ++LR+DA LWW+S  ++I  +   +TW+ F++ F +KY+  ++RY+K+ EFL++ Q  +SVEEYE EFTRLS FAP MV T+  K++ F++GL+ DV
Subjt:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDV

A0A6J1EZJ9 uncharacterized protein LOC1114378953.3e-2141.13Show/hide
Query:  GCFMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQ
        G +ML+ +A  WW+  +++I    G ++W  FKE +  KYY  + R++ +  FL L QG+K+VE+Y+LEF +L+ F PE V  +  KI RFI GL+ ++Q
Subjt:  GCFMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQ

Query:  RVVGALAPIDYATTLRATTFMGMP
          V      DYA  LR  T M MP
Subjt:  RVVGALAPIDYATTLRATTFMGMP

A0A6J1FB78 uncharacterized protein LOC1114438453.2e-2446.49Show/hide
Query:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV
        ++LR+DA LWW+ST ++I  +   +TW  F++ F +K + +++RY+K+ EFL++ QG +SVEEYE EFTRLS FAP MV  +  K++ F++GL+ D++ V
Subjt:  FMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRV

Query:  VGALAPIDYATTLR
        V    P DYAT L+
Subjt:  VGALAPIDYATTLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATGCTTTATGTTAAGAGATGATGCTTTGTTGTGGTGGAGGTCGACAGAGAAATCCATCGATGTTAACGCTGGTCCGGTCACTTGGTTGCATTTCAAGGAGGTGTT
CTTCCAGAAATATTACTCGACCATCATCAGGTACAGAAAGGAGGCGGAGTTCCTAGCCTTGATGCAAGGAGAGAAGTCAGTAGAAGAGTACGAACTTGAGTTCACCCGAC
TATCCCATTTTGCCCCTGAAATGGTGGATACGAAGGCAAAGAAAATAAAGAGGTTCATCTTGGGCCTCAAAGACGATGTCCAGAGGGTTGTTGGAGCCCTTGCCCCAATA
GATTACGCAACGACCCTTCGAGCGACCACATTTATGGGCATGCCAATTCTCAATGCAACTCCAATAGCCAAGGAGTCAGAGTCTAACACAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATGCTTTATGTTAAGAGATGATGCTTTGTTGTGGTGGAGGTCGACAGAGAAATCCATCGATGTTAACGCTGGTCCGGTCACTTGGTTGCATTTCAAGGAGGTGTT
CTTCCAGAAATATTACTCGACCATCATCAGGTACAGAAAGGAGGCGGAGTTCCTAGCCTTGATGCAAGGAGAGAAGTCAGTAGAAGAGTACGAACTTGAGTTCACCCGAC
TATCCCATTTTGCCCCTGAAATGGTGGATACGAAGGCAAAGAAAATAAAGAGGTTCATCTTGGGCCTCAAAGACGATGTCCAGAGGGTTGTTGGAGCCCTTGCCCCAATA
GATTACGCAACGACCCTTCGAGCGACCACATTTATGGGCATGCCAATTCTCAATGCAACTCCAATAGCCAAGGAGTCAGAGTCTAACACAAGATAG
Protein sequenceShow/hide protein sequence
MGCFMLRDDALLWWRSTEKSIDVNAGPVTWLHFKEVFFQKYYSTIIRYRKEAEFLALMQGEKSVEEYELEFTRLSHFAPEMVDTKAKKIKRFILGLKDDVQRVVGALAPI
DYATTLRATTFMGMPILNATPIAKESESNTR