; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040615 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040615
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag-Pol polyprotein/retrotransposon
Genome locationchr13:6595411..6599701
RNA-Seq ExpressionLag0040615
SyntenyLag0040615
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601085.1 hypothetical protein SDJN03_06318, partial [Cucurbita argyrosperma subsp. sororia]1.4e-9785.07Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL
        MPTPLKLTNG I CV+FP  GSN+LQPKTI R SPPFR  KP N FV+RNPS+R CL NAEISANDPLKSENGFSNHE EGSM+KNEN Q+HPQKS EVL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+AP P KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGKA +A
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA

Query:  IVGFCLGLALLLFVAVTLLSA
        IVGFCLGL+LLLF+AVTLLSA
Subjt:  IVGFCLGLALLLFVAVTLLSA

KAG7031890.1 hypothetical protein SDJN02_05931, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-9382.19Show/hide
Query:  MPTPLKLTNGNIRCVIFPGSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDK
        MPTPLKLTNG I CV+FPGSN+LQPKTI R SPPFR  KP N FV+RNPS+R CL+NAEISANDPLKSENGFSNHE EGSM+KNEN Q+HPQKS EVLDK
Subjt:  MPTPLKLTNGNIRCVIFPGSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDK

Query:  LRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMAIV
        LRRYG+SGILSYGLLNTVYYL TFLVVWFY+AP P KMGY         IMATVWAGSQVTKLARAAGALA+APFVDR+LSWFTVKYNF+SQGKA MAIV
Subjt:  LRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMAIV

Query:  GFCLGLALLLFVAVTLLSA
        GFCLGL+LLLF+AVTLLSA
Subjt:  GFCLGLALLLFVAVTLLSA

XP_022956520.1 uncharacterized protein LOC111458238 [Cucurbita moschata]6.9e-9784.62Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL
        MPTPLKLTNG I CV+FP  GSN+LQPKTI   SPPFR  KP N FV+RNPS+R CL+NAEISANDPLKSENGFSNHE EGSM+KNEN Q+HPQKS EVL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+AP P KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR+LSWFTVKYNF+SQGKA +A
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA

Query:  IVGFCLGLALLLFVAVTLLSA
        IVGFCLGL+LLLF+AVTLLSA
Subjt:  IVGFCLGLALLLFVAVTLLSA

XP_022983937.1 uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima]1.8e-9784.16Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL
        MPTPLKLTNG I CV+FP  GSNELQPKTI R SPPFR  KP N FV+RNPS+R CL+NAEISANDPLKSE+GFSNHE EGSM+KNEN ++HP+KS EVL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+APAP KMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGKA +A
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA

Query:  IVGFCLGLALLLFVAVTLLSA
        IVGFCLGL+LLLF+AVTLLSA
Subjt:  IVGFCLGLALLLFVAVTLLSA

XP_023534257.1 uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo]4.1e-9784.16Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL
        MPTPLKLTNG I CV+FP  GSN+LQPKTI R SPPFR  KP N FV+RNPS+R CL+NAEISANDPLKSENGFSNHE EGSM+KNEN ++HP+KS EVL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+AP P KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGKA +A
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA

Query:  IVGFCLGLALLLFVAVTLLSA
        IVGFCLGL+LLLF+AVTLLSA
Subjt:  IVGFCLGLALLLFVAVTLLSA

TrEMBL top hitse value%identityAlignment
A0A1S3BDD7 uncharacterized protein LOC103488806 isoform X59.8e-8184.1Show/hide
Query:  PKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDKLRRYGISGILSYGLLNTVYYLVTF
        P+   R SPP        F V+RNPSVRLCLSNA+ISANDPLKSE+ FSNHE EGSM+KNEN Q+HPQKSNEVLDKLRRYG+SGILSYGLLNT YYL TF
Subjt:  PKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDKLRRYGISGILSYGLLNTVYYLVTF

Query:  LVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA
        LVVWFY+APAP KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDR LSWFTV YNFESQGKAFMAIVGFCLGLALLLF+ VTLLSA
Subjt:  LVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA

A0A5A7SW01 Uncharacterized protein1.4e-8277.63Show/hide
Query:  MPTPLKLTNGNIRCVIFPGSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDK
        M TP KLTNGNI CV FP S ELQ K+ISR SPP  +P   N                   ANDPLKSE+ FSNHE EGSM+KNEN Q+HPQKSNEVLDK
Subjt:  MPTPLKLTNGNIRCVIFPGSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDK

Query:  LRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMAIV
        LRRYG+SGILSYGLLNT YYL TFLVVWFY+APAP KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDR LSWFTV YNFESQGKAFMAIV
Subjt:  LRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMAIV

Query:  GFCLGLALLLFVAVTLLSA
        GFCLGLALLLF+ VTLLSA
Subjt:  GFCLGLALLLFVAVTLLSA

A0A6J1GY06 uncharacterized protein LOC1114582383.4e-9784.62Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL
        MPTPLKLTNG I CV+FP  GSN+LQPKTI   SPPFR  KP N FV+RNPS+R CL+NAEISANDPLKSENGFSNHE EGSM+KNEN Q+HPQKS EVL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+AP P KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR+LSWFTVKYNF+SQGKA +A
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA

Query:  IVGFCLGLALLLFVAVTLLSA
        IVGFCLGL+LLLF+AVTLLSA
Subjt:  IVGFCLGLALLLFVAVTLLSA

A0A6J1J0R3 uncharacterized protein LOC111482401 isoform X28.8e-9884.16Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL
        MPTPLKLTNG I CV+FP  GSNELQPKTI R SPPFR  KP N FV+RNPS+R CL+NAEISANDPLKSE+GFSNHE EGSM+KNEN ++HP+KS EVL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+APAP KMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGKA +A
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMA

Query:  IVGFCLGLALLLFVAVTLLSA
        IVGFCLGL+LLLF+AVTLLSA
Subjt:  IVGFCLGLALLLFVAVTLLSA

A0A6J1J900 uncharacterized protein LOC111482401 isoform X11.1e-8784.18Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL
        MPTPLKLTNG I CV+FP  GSNELQPKTI R SPPFR  KP N FV+RNPS+R CL+NAEISANDPLKSE+GFSNHE EGSM+KNEN ++HP+KS EVL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGK
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+APAP KMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGK
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38695.1 unknown protein5.7e-4963.01Show/hide
Query:  NPSVRLCLS-NAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGR
        N S RL  S +  +S     ++E      EM   + KN   +++P  S E+L KL+RYG+SGILSYGLLNTVYY   FL+VWFYVAPAP KMGY+AAA R
Subjt:  NPSVRLCLS-NAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGR

Query:  FLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA
        FLK+MA VWAGSQVTKL R  GA+ALAP VDR LSWFTVK NFESQGKAF A+VG CLG+AL+LF+ VTLL A
Subjt:  FLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA

AT2G38695.2 unknown protein2.1e-2757.38Show/hide
Query:  NPSVRLCLS-NAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGR
        N S RL  S +  +S     ++E      EM   + KN   +++P  S E+L KL+RYG+SGILSYGLLNTVYY   FL+VWFYVAPAP KMGY+AAA R
Subjt:  NPSVRLCLS-NAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGR

Query:  FLKIMATVWAGSQVTKLARAAG
        FLK+MA VWAGSQVTKL R  G
Subjt:  FLKIMATVWAGSQVTKLARAAG

AT2G38695.3 unknown protein4.0e-4249.32Show/hide
Query:  NPSVRLCLS-NAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGR
        N S RL  S +  +S     ++E      EM   + KN   +++P  S E+L KL+RYG+SGILSYGLLNTVYY   FL+VWFYVAPAP KMGY+AAA R
Subjt:  NPSVRLCLS-NAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGR

Query:  FLKIMATVWAGSQVTKLARAAG------------------------------------------------ALALAPFVDRSLSWFTVKYNFESQGKAFMA
        FLK+MA VWAGSQVTKL R  G                                                A+ALAP VDR LSWFTVK NFESQGKAF A
Subjt:  FLKIMATVWAGSQVTKLARAAG------------------------------------------------ALALAPFVDRSLSWFTVKYNFESQGKAFMA

Query:  IVGFCLGLALLLFVAVTLLSA
        +VG CLG+AL+LF+ VTLL A
Subjt:  IVGFCLGLALLLFVAVTLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACTCCACTCAAGCTCACCAATGGCAACATCCGTTGCGTCATTTTCCCGGGTTCGAACGAATTGCAGCCGAAGACAATTTCCAGAGTTTCCCCGCCATTTCGCGC
CCCAAAGCCGAAGAACTTTTTTGTTAATCGAAACCCTAGCGTCCGGCTTTGCCTTAGCAATGCCGAAATTAGCGCCAATGATCCATTGAAATCTGAAAATGGATTTTCCA
ATCATGAAATGGAAGGTTCAATGAAAAAGAATGAAAATTGTCAAAGACATCCGCAAAAATCAAATGAGGTACTGGATAAATTGAGAAGATATGGAATTTCTGGAATATTG
TCTTACGGATTATTGAATACAGTCTACTATCTTGTAACATTTCTCGTTGTGTGGTTCTACGTTGCACCAGCCCCTGTGAAAATGGGTTATGTTGCGGCTGCTGGAAGATT
TCTCAAAATAATGGCTACAGTATGGGCTGGAAGCCAAGTTACTAAGCTTGCAAGAGCTGCAGGGGCTCTTGCTCTGGCACCATTCGTTGACAGAAGCTTGTCGTGGTTCA
CGGTCAAGTACAACTTCGAGTCTCAGGGAAAGGCGTTTATGGCGATTGTTGGGTTCTGCTTAGGATTGGCTCTCTTGTTATTCGTTGCCGTTACTCTGCTTTCAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCAACTCCACTCAAGCTCACCAATGGCAACATCCGTTGCGTCATTTTCCCGGGTTCGAACGAATTGCAGCCGAAGACAATTTCCAGAGTTTCCCCGCCATTTCGCGC
CCCAAAGCCGAAGAACTTTTTTGTTAATCGAAACCCTAGCGTCCGGCTTTGCCTTAGCAATGCCGAAATTAGCGCCAATGATCCATTGAAATCTGAAAATGGATTTTCCA
ATCATGAAATGGAAGGTTCAATGAAAAAGAATGAAAATTGTCAAAGACATCCGCAAAAATCAAATGAGGTACTGGATAAATTGAGAAGATATGGAATTTCTGGAATATTG
TCTTACGGATTATTGAATACAGTCTACTATCTTGTAACATTTCTCGTTGTGTGGTTCTACGTTGCACCAGCCCCTGTGAAAATGGGTTATGTTGCGGCTGCTGGAAGATT
TCTCAAAATAATGGCTACAGTATGGGCTGGAAGCCAAGTTACTAAGCTTGCAAGAGCTGCAGGGGCTCTTGCTCTGGCACCATTCGTTGACAGAAGCTTGTCGTGGTTCA
CGGTCAAGTACAACTTCGAGTCTCAGGGAAAGGCGTTTATGGCGATTGTTGGGTTCTGCTTAGGATTGGCTCTCTTGTTATTCGTTGCCGTTACTCTGCTTTCAGCATAA
Protein sequenceShow/hide protein sequence
MPTPLKLTNGNIRCVIFPGSNELQPKTISRVSPPFRAPKPKNFFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGSMKKNENCQRHPQKSNEVLDKLRRYGISGIL
SYGLLNTVYYLVTFLVVWFYVAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRSLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA