; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy09g016880 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy09g016880
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionGag protease polyprotein
Genome locationChr09:30974895..30975266
RNA-Seq ExpressionLcy09g016880
SyntenyLcy09g016880
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK01089.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.9e-1541.18Show/hide
Query:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK
        M C E Q+V CA+FML +    WW + ER +   VG                       K  EFL L+QG+R VE+YD EF  LSRF PEM+ TEA +  
Subjt:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK

Query:  RFILGLRDDVQRVVGPLPQ
        +F+ GLR D+Q +V   PQ
Subjt:  RFILGLRDDVQRVVGPLPQ

XP_008456947.1 PREDICTED: uncharacterized protein LOC103496742 [Cucumis melo]2.0e-1442.11Show/hide
Query:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK
        M C + Q+V CA+F+LRE + +WW+SVER +   V                        K  EF+ LKQG+ +VEEYD EF  LS F PE+V+TEA + K
Subjt:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK

Query:  RFILGLRDDVQRVV
         F+ GLR D+Q  V
Subjt:  RFILGLRDDVQRVV

XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]1.9e-2048.72Show/hide
Query:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVGP----------------------KEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK
        M CLE Q+V C +FML++D+ +WW S ER IDV+ GP                      K+ EFL LKQ  RSVEEYD EFT+LSRF PE+VDTEA K +
Subjt:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVGP----------------------KEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK

Query:  RFILGLRDDVQRVVGPL
        RFI+ L+D+ +  V  L
Subjt:  RFILGLRDDVQRVVGPL

XP_038880159.1 uncharacterized protein LOC120071839 [Benincasa hispida]8.9e-1541.03Show/hide
Query:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK
        M C E Q++ CA F+L +++  WWR  ER I+ + G                       K+AEF+ LKQG  +VEEY+ +FTRLS F P++V TEAK+ +
Subjt:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK

Query:  RFILGLRDDVQRVVGPL
        RF+ GLRD+V+ +V  L
Subjt:  RFILGLRDDVQRVVGPL

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]1.2e-1442.11Show/hide
Query:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVGP----------------------KEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK
        M C E Q+V CA+FML + + +WW+  ER + V   P                      K+ EFL L+QG RSVEEYD EF  LSRF PE+V TEA + +
Subjt:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVGP----------------------KEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK

Query:  RFILGLRDDVQRVV
        RFI GL++ ++ +V
Subjt:  RFILGLRDDVQRVV

TrEMBL top hitse value%identityAlignment
A0A1S3C5M7 uncharacterized protein LOC1034967429.6e-1542.11Show/hide
Query:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK
        M C + Q+V CA+F+LRE + +WW+SVER +   V                        K  EF+ LKQG+ +VEEYD EF  LS F PE+V+TEA + K
Subjt:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK

Query:  RFILGLRDDVQRVV
         F+ GLR D+Q  V
Subjt:  RFILGLRDDVQRVV

A0A5D3BCD2 Gag protease polyprotein9.6e-1542.11Show/hide
Query:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK
        M C + Q+V CA+F+LRE + +WW+SVER +   V                        K  EF+ LKQG+ +VEEYD EF  LS F PE+V+TEA + K
Subjt:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK

Query:  RFILGLRDDVQRVV
         F+ GLR D+Q  V
Subjt:  RFILGLRDDVQRVV

A0A5D3BSM2 Reverse transcriptase4.3e-1541.18Show/hide
Query:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK
        M C E Q+V CA+FML +    WW + ER +   VG                       K  EFL L+QG+R VE+YD EF  LSRF PEM+ TEA +  
Subjt:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK

Query:  RFILGLRDDVQRVVGPLPQ
        +F+ GLR D+Q +V   PQ
Subjt:  RFILGLRDDVQRVVGPLPQ

A0A6J1DSJ6 uncharacterized protein LOC1110235129.0e-2148.72Show/hide
Query:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVGP----------------------KEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK
        M CLE Q+V C +FML++D+ +WW S ER IDV+ GP                      K+ EFL LKQ  RSVEEYD EFT+LSRF PE+VDTEA K +
Subjt:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVGP----------------------KEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK

Query:  RFILGLRDDVQRVVGPL
        RFI+ L+D+ +  V  L
Subjt:  RFILGLRDDVQRVVGPL

E5GB72 Ty3-gypsy retrotransposon protein9.6e-1542.11Show/hide
Query:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK
        M C + Q+V CA+F+LRE + +WW+SVER +   V                        K  EF+ LKQG+ +VEEYD EF  LS F PE+V+TEA + K
Subjt:  MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVG----------------------PKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTK

Query:  RFILGLRDDVQRVV
         F+ GLR D+Q  V
Subjt:  RFILGLRDDVQRVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTGCCTCGAGGCACAACAGGTCCCATGCGCTATGTTTATGCTGAGAGAGGATTCACTGATGTGGTGGCGGTCAGTAGAGAGATCCATTGATGTCACCGTAGGTCC
GAAGGAGGCAGAGTTCCTGGCCTTGAAGCAGGGAGAAAGGTCAGTGGAGGAGTATGATCTGGAGTTCACGCGATTATCTCGCTTTGTCCCAGAGATGGTGGACACTGAAG
CAAAGAAGACTAAAAGGTTCATCTTGGGCCTCAGAGATGACGTGCAGAGGGTTGTGGGGCCCTTGCCCCAACTGATTATCCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTGCCTCGAGGCACAACAGGTCCCATGCGCTATGTTTATGCTGAGAGAGGATTCACTGATGTGGTGGCGGTCAGTAGAGAGATCCATTGATGTCACCGTAGGTCC
GAAGGAGGCAGAGTTCCTGGCCTTGAAGCAGGGAGAAAGGTCAGTGGAGGAGTATGATCTGGAGTTCACGCGATTATCTCGCTTTGTCCCAGAGATGGTGGACACTGAAG
CAAAGAAGACTAAAAGGTTCATCTTGGGCCTCAGAGATGACGTGCAGAGGGTTGTGGGGCCCTTGCCCCAACTGATTATCCAGTGA
Protein sequenceShow/hide protein sequence
MGCLEAQQVPCAMFMLREDSLMWWRSVERSIDVTVGPKEAEFLALKQGERSVEEYDLEFTRLSRFVPEMVDTEAKKTKRFILGLRDDVQRVVGPLPQLIIQ