; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg009865 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg009865
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGag-Pol polyprotein/retrotransposon
Genome locationscaffold7:7223933..7228312
RNA-Seq ExpressionSpg009865
SyntenySpg009865
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648632.1 hypothetical protein Csa_009176 [Cucumis sativus]2.0e-8479.61Show/hide
Query:  VNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAA
        V+RNPSVR CLSNA+ISANDPLKSE+ FSNHEMEG +               VLDKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+APAP KMGYVAAA
Subjt:  VNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAA

Query:  GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKVTRNETSIYPSCLLFHYIDVVLKVKSSPGVYGDCWVLLRIGSLVIRCRYS
        GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTV YNFESQGKV R ETSI+  C LFH+IDV LKVK   G+YGDCWVLLRIGSLVI C YS
Subjt:  GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKVTRNETSIYPSCLLFHYIDVVLKVKSSPGVYGDCWVLLRIGSLVIRCRYS

Query:  AFSIRQ
        AFS+RQ
Subjt:  AFSIRQ

XP_022983929.1 uncharacterized protein LOC111482401 isoform X1 [Cucurbita maxima]1.2e-10577.73Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL
        MPTPLKLTNG I CV+FP  GSNELQPKTI R  PPFR  KP NIFV+RNPS+R CL+NAEISANDPLKSE+GFSNHE EG +               VL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKVTRN
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+APAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKVTRN
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKVTRN

Query:  ETSIYPSCLLFHYIDVVLKVKSSPGVYGDCWVLLRIGSLVIRCRYSAFSIRQVIPL
        ETSI+    LFH+IDV+LKV  SPG  GDCW+LLR+ SLVI C YSAFSIRQV+PL
Subjt:  ETSIYPSCLLFHYIDVVLKVKSSPGVYGDCWVLLRIGSLVIRCRYSAFSIRQVIPL

XP_022983937.1 uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima]1.5e-7979.59Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL
        MPTPLKLTNG I CV+FP  GSNELQPKTI R  PPFR  KP NIFV+RNPS+R CL+NAEISANDPLKSE+GFSNHE EG +               VL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+APAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGK
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK

XP_023534249.1 uncharacterized protein LOC111795864 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-11378.95Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL
        MPTPLKLTNG I CV+FP  GSN+LQPKTI R  PPFR  KP NIFV+RNPS+R CL+NAEISANDPLKSENGFSNHE EG +               VL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKVTRN
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+AP PAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKVTRN
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKVTRN

Query:  ETSIYPSCLLFHYIDVVLKVKSSPGVYGDCWVLLRIGSLVIRCRYSAFSIRQVIPLESKYFFSHII
        ETSI+P C LFH+ID++LK+  SPG  GDCW+LLRI SLVI C YSAFSIRQV+PLESKYFFSHII
Subjt:  ETSIYPSCLLFHYIDVVLKVKSSPGVYGDCWVLLRIGSLVIRCRYSAFSIRQVIPLESKYFFSHII

XP_023534257.1 uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo]3.4e-7979.59Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL
        MPTPLKLTNG I CV+FP  GSN+LQPKTI R  PPFR  KP NIFV+RNPS+R CL+NAEISANDPLKSENGFSNHE EG +               VL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+AP PAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGK
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK

TrEMBL top hitse value%identityAlignment
A0A1S3BDD7 uncharacterized protein LOC103488806 isoform X54.2e-5978.26Show/hide
Query:  FRAPKPKNIF-VNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAP
        F  P+ +  F V+RNPSVRLCLSNA+ISANDPLKSE+ FSNHE EG +               VLDKLRRYG+SGILSYGLLNT YYL TFLVVWFY+AP
Subjt:  FRAPKPKNIF-VNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAP

Query:  APAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK
        APAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTV YNFESQGK
Subjt:  APAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK

A0A5A7SW01 Uncharacterized protein4.5e-6169.07Show/hide
Query:  MPTPLKLTNGNIRCVIFPGSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VLDK
        M TP KLTNGNI CV FP S ELQ K+ISR  PP  +P   N                   ANDPLKSE+ FSNHE EG +               VLDK
Subjt:  MPTPLKLTNGNIRCVIFPGSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VLDK

Query:  LRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK
        LRRYG+SGILSYGLLNT YYL TFLVVWFY+APAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTV YNFESQGK
Subjt:  LRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK

A0A6J1GY06 uncharacterized protein LOC1114582381.5e-7778.57Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL
        MPTPLKLTNG I CV+FP  GSN+LQPKTI    PPFR  KP NIFV+RNPS+R CL+NAEISANDPLKSENGFSNHE EG +               VL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+AP PAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGK
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK

A0A6J1J0R3 uncharacterized protein LOC111482401 isoform X27.4e-8079.59Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL
        MPTPLKLTNG I CV+FP  GSNELQPKTI R  PPFR  KP NIFV+RNPS+R CL+NAEISANDPLKSE+GFSNHE EG +               VL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+APAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGK
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK

A0A6J1J900 uncharacterized protein LOC111482401 isoform X16.0e-10677.73Show/hide
Query:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL
        MPTPLKLTNG I CV+FP  GSNELQPKTI R  PPFR  KP NIFV+RNPS+R CL+NAEISANDPLKSE+GFSNHE EG +               VL
Subjt:  MPTPLKLTNGNIRCVIFP--GSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGML---------------VL

Query:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKVTRN
        DKLRRYG+SGILSYGLLNTVYYL TFLVVWFY+APAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKVTRN
Subjt:  DKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKVTRN

Query:  ETSIYPSCLLFHYIDVVLKVKSSPGVYGDCWVLLRIGSLVIRCRYSAFSIRQVIPL
        ETSI+    LFH+IDV+LKV  SPG  GDCW+LLR+ SLVI C YSAFSIRQV+PL
Subjt:  ETSIYPSCLLFHYIDVVLKVKSSPGVYGDCWVLLRIGSLVIRCRYSAFSIRQVIPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38695.1 unknown protein7.4e-4061.54Show/hide
Query:  EISANDPLKSENGFSNHEMEGMLVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALA
        E+   + +  +N F + E+     L KL+RYG+SGILSYGLLNTVYY   FL+VWFYVAPAP KMGY+AAA RFLK+MA VWAGSQVTKL R  GA+ALA
Subjt:  EISANDPLKSENGFSNHEMEGMLVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALA

Query:  PFVDRGLSWFTVKYNFESQGKVTRNETSIYPSCLLFHYIDVVL
        P VDRGLSWFTVK NFESQGK       I     L  +I V L
Subjt:  PFVDRGLSWFTVKYNFESQGKVTRNETSIYPSCLLFHYIDVVL

AT2G38695.2 unknown protein1.3e-2563.16Show/hide
Query:  EISANDPLKSENGFSNHEMEGMLVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG
        E+   + +  +N F + E+     L KL+RYG+SGILSYGLLNTVYY   FL+VWFYVAPAP KMGY+AAA RFLK+MA VWAGSQVTKL R  G
Subjt:  EISANDPLKSENGFSNHEMEGMLVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG

AT2G38695.3 unknown protein5.1e-3346.07Show/hide
Query:  EISANDPLKSENGFSNHEMEGMLVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG-----
        E+   + +  +N F + E+     L KL+RYG+SGILSYGLLNTVYY   FL+VWFYVAPAP KMGY+AAA RFLK+MA VWAGSQVTKL R  G     
Subjt:  EISANDPLKSENGFSNHEMEGMLVLDKLRRYGISGILSYGLLNTVYYLVTFLVVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG-----

Query:  -------------------------------------------ALALAPFVDRGLSWFTVKYNFESQGKVTRNETSIYPSCLLFHYIDVVL
                                                   A+ALAP VDRGLSWFTVK NFESQGK       I     L  +I V L
Subjt:  -------------------------------------------ALALAPFVDRGLSWFTVKYNFESQGKVTRNETSIYPSCLLFHYIDVVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACTCCACTCAAGCTCACCAATGGCAACATCCGTTGCGTCATTTTCCCGGGTTCGAACGAACTGCAGCCGAAGACAATTTCCAGAGTTTTCCCGCCATTTCGCGC
CCCAAAGCCGAAGAACATTTTTGTTAATCGGAACCCTAGCGTCCGGCTCTGCCTTAGCAATGCCGAAATTAGCGCCAATGATCCATTGAAATCTGAAAATGGCTTTTCCA
ATCATGAAATGGAAGGTATGCTGGTACTGGATAAATTGAGAAGATATGGAATTTCTGGAATATTGTCTTACGGATTATTGAATACAGTCTACTATCTTGTAACATTTCTC
GTTGTGTGGTTCTACGTTGCACCAGCCCCTGCAAAAATGGGTTATGTTGCGGCTGCTGGAAGATTTCTCAAAATAATGGCTACAGTATGGGCTGGAAGCCAAGTTACTAA
GCTTGCAAGAGCTGCAGGGGCTCTTGCTCTGGCACCATTCGTCGACAGAGGCTTGTCGTGGTTCACGGTCAAGTACAACTTCGAGTCTCAGGGGAAGGTTACTAGAAACG
AAACTTCTATATATCCATCCTGCTTATTGTTTCATTACATTGATGTTGTGCTCAAAGTTAAATCATCTCCAGGCGTTTATGGCGATTGTTGGGTTCTGCTTAGGATTGGC
TCTCTTGTTATTCGTTGCCGTTACTCTGCTTTCAGCATAAGACAAGTTATTCCCTTGGAAAGTAAGTACTTTTTCTCTCACATCATTGGCCTTTTCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCAACTCCACTCAAGCTCACCAATGGCAACATCCGTTGCGTCATTTTCCCGGGTTCGAACGAACTGCAGCCGAAGACAATTTCCAGAGTTTTCCCGCCATTTCGCGC
CCCAAAGCCGAAGAACATTTTTGTTAATCGGAACCCTAGCGTCCGGCTCTGCCTTAGCAATGCCGAAATTAGCGCCAATGATCCATTGAAATCTGAAAATGGCTTTTCCA
ATCATGAAATGGAAGGTATGCTGGTACTGGATAAATTGAGAAGATATGGAATTTCTGGAATATTGTCTTACGGATTATTGAATACAGTCTACTATCTTGTAACATTTCTC
GTTGTGTGGTTCTACGTTGCACCAGCCCCTGCAAAAATGGGTTATGTTGCGGCTGCTGGAAGATTTCTCAAAATAATGGCTACAGTATGGGCTGGAAGCCAAGTTACTAA
GCTTGCAAGAGCTGCAGGGGCTCTTGCTCTGGCACCATTCGTCGACAGAGGCTTGTCGTGGTTCACGGTCAAGTACAACTTCGAGTCTCAGGGGAAGGTTACTAGAAACG
AAACTTCTATATATCCATCCTGCTTATTGTTTCATTACATTGATGTTGTGCTCAAAGTTAAATCATCTCCAGGCGTTTATGGCGATTGTTGGGTTCTGCTTAGGATTGGC
TCTCTTGTTATTCGTTGCCGTTACTCTGCTTTCAGCATAAGACAAGTTATTCCCTTGGAAAGTAAGTACTTTTTCTCTCACATCATTGGCCTTTTCCCTTAA
Protein sequenceShow/hide protein sequence
MPTPLKLTNGNIRCVIFPGSNELQPKTISRVFPPFRAPKPKNIFVNRNPSVRLCLSNAEISANDPLKSENGFSNHEMEGMLVLDKLRRYGISGILSYGLLNTVYYLVTFL
VVWFYVAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKVTRNETSIYPSCLLFHYIDVVLKVKSSPGVYGDCWVLLRIG
SLVIRCRYSAFSIRQVIPLESKYFFSHIIGLFP