; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg02031 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg02031
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionGag-Pol polyprotein/retrotransposon
Genome locationCarg_Chr04:7944864..7948937
RNA-Seq ExpressionCarg02031
SyntenyCarg02031
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601085.1 hypothetical protein SDJN03_06318, partial [Cucurbita argyrosperma subsp. sororia]3.0e-10593.67Show/hide
Query:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL
        MPTPLKLTNGAIPCVVFP  GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCL+NAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL
Subjt:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY         IMATVWAGSQVTKLARAAGALAMAPFVDR LSWFTVKYNFKSQGKAV+A
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

KAG7031890.1 hypothetical protein SDJN02_05931, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-110100Show/hide
Query:  MPTPLKLTNGAIPCVVFPGSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDK
        MPTPLKLTNGAIPCVVFPGSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDK
Subjt:  MPTPLKLTNGAIPCVVFPGSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDK

Query:  LRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGYIMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMAIVGFCLGLSLL
        LRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGYIMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMAIVGFCLGLSLL
Subjt:  LRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGYIMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMAIVGFCLGLSLL

Query:  LFIAVTLLSA
        LFIAVTLLSA
Subjt:  LFIAVTLLSA

XP_022956520.1 uncharacterized protein LOC111458238 [Cucurbita moschata]2.3e-10594.12Show/hide
Query:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL
        MPTPLKLTNGAIPCVVFP  GSNKLQPKTIF ASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL
Subjt:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY         IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAV+A
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

XP_022983937.1 uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima]5.3e-10290.95Show/hide
Query:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL
        MPTPLKLTNGAIPCVVFP  GSN+LQPKTIFRASPPFRVSKP NIFVSRNPSIRQCLNNAEISANDPLKSE+GFSNHETEGSMEKNENH+KHP+KSIEVL
Subjt:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAP PAKMGY         IMAT+WAGSQVTKLARAAGALAMAPFVDR LSWFTVKYNFKSQGKAV+A
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

XP_023534257.1 uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo]4.3e-10492.76Show/hide
Query:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL
        MPTPLKLTNGAIPCVVFP  GSNKLQPKTIFRASPPFRVSKP NIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENH+KHP+KSIEVL
Subjt:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY         IMATVWAGSQVTKLARAAGALAMAPFVDR LSWFTVKYNFKSQGKAV+A
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

TrEMBL top hitse value%identityAlignment
A0A1S3BDD7 uncharacterized protein LOC103488806 isoform X53.6e-7278.97Show/hide
Query:  PKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTF
        P+   R SPP      I   VSRNPS+R CL+NA+ISANDPLKSE+ FSNHETEGSMEKNEN QKHPQKS EVLDKLRRYG+SGILSYGLLNT YYLTTF
Subjt:  PKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTF

Query:  LVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMAIVGFCLGLSLLLFIAVTLLSA
        LVVWFYIAP PAKMGY         IMATVWAGSQVTKLARAAGALA+APFVDR LSWFTV YNF+SQGKA MAIVGFCLGL+LLLFI VTLLSA
Subjt:  LVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMAIVGFCLGLSLLLFIAVTLLSA

A0A5A7SW01 Uncharacterized protein7.2e-7373.97Show/hide
Query:  MPTPLKLTNGAIPCVVFPGSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDK
        M TP KLTNG IPCV FP S +LQ K+I RASPP  +S P        P+           ANDPLKSE+ FSNHETEGSMEKNEN QKHPQKS EVLDK
Subjt:  MPTPLKLTNGAIPCVVFPGSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDK

Query:  LRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMAIV
        LRRYG+SGILSYGLLNT YYLTTFLVVWFYIAP PAKMGY         IMATVWAGSQVTKLARAAGALA+APFVDR LSWFTV YNF+SQGKA MAIV
Subjt:  LRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMAIV

Query:  GFCLGLSLLLFIAVTLLSA
        GFCLGL+LLLFI VTLLSA
Subjt:  GFCLGLSLLLFIAVTLLSA

A0A6J1GY06 uncharacterized protein LOC1114582381.1e-10594.12Show/hide
Query:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL
        MPTPLKLTNGAIPCVVFP  GSNKLQPKTIF ASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL
Subjt:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY         IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAV+A
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

A0A6J1J0R3 uncharacterized protein LOC111482401 isoform X22.5e-10290.95Show/hide
Query:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL
        MPTPLKLTNGAIPCVVFP  GSN+LQPKTIFRASPPFRVSKP NIFVSRNPSIRQCLNNAEISANDPLKSE+GFSNHETEGSMEKNENH+KHP+KSIEVL
Subjt:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAP PAKMGY         IMAT+WAGSQVTKLARAAGALAMAPFVDR LSWFTVKYNFKSQGKAV+A
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMA

Query:  IVGFCLGLSLLLFIAVTLLSA
        IVGFCLGLSLLLFIAVTLLSA
Subjt:  IVGFCLGLSLLLFIAVTLLSA

A0A6J1J900 uncharacterized protein LOC111482401 isoform X14.5e-9190.31Show/hide
Query:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL
        MPTPLKLTNGAIPCVVFP  GSN+LQPKTIFRASPPFRVSKP NIFVSRNPSIRQCLNNAEISANDPLKSE+GFSNHETEGSMEKNENH+KHP+KSIEVL
Subjt:  MPTPLKLTNGAIPCVVFP--GSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVL

Query:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGK
        DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAP PAKMGY         IMAT+WAGSQVTKLARAAGALAMAPFVDR LSWFTVKYNFKSQGK
Subjt:  DKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38695.1 unknown protein2.5e-4160.25Show/hide
Query:  ISANDPLKSENGFSNHETEGSM-EKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGS
        +S N   KS+        EG M +KN   +K+P  S E+L KL+RYG+SGILSYGLLNTVYY T FL+VWFY+AP P KMGY         +MA VWAGS
Subjt:  ISANDPLKSENGFSNHETEGSM-EKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGS

Query:  QVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMAIVGFCLGLSLLLFIAVTLLSA
        QVTKL R  GA+A+AP VDR LSWFTVK NF+SQGKA  A+VG CLG++L+LFI VTLL A
Subjt:  QVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMAIVGFCLGLSLLLFIAVTLLSA

AT2G38695.2 unknown protein1.3e-2155.45Show/hide
Query:  ISANDPLKSENGFSNHETEGSM-EKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGS
        +S N   KS+        EG M +KN   +K+P  S E+L KL+RYG+SGILSYGLLNTVYY T FL+VWFY+AP P KMGY         +MA VWAGS
Subjt:  ISANDPLKSENGFSNHETEGSM-EKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGS

Query:  QVTKLARAAG
        QVTKL R  G
Subjt:  QVTKLARAAG

AT2G38695.3 unknown protein1.7e-3446.41Show/hide
Query:  ISANDPLKSENGFSNHETEGSM-EKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGS
        +S N   KS+        EG M +KN   +K+P  S E+L KL+RYG+SGILSYGLLNTVYY T FL+VWFY+AP P KMGY         +MA VWAGS
Subjt:  ISANDPLKSENGFSNHETEGSM-EKNENHQKHPQKSIEVLDKLRRYGVSGILSYGLLNTVYYLTTFLVVWFYIAPPPAKMGY---------IMATVWAGS

Query:  QVTKLARAAG------------------------------------------------ALAMAPFVDRALSWFTVKYNFKSQGKAVMAIVGFCLGLSLLL
        QVTKL R  G                                                A+A+AP VDR LSWFTVK NF+SQGKA  A+VG CLG++L+L
Subjt:  QVTKLARAAG------------------------------------------------ALAMAPFVDRALSWFTVKYNFKSQGKAVMAIVGFCLGLSLLL

Query:  FIAVTLLSA
        FI VTLL A
Subjt:  FIAVTLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACTCCACTTAAGCTCACCAATGGCGCCATCCCTTGCGTCGTTTTCCCGGGTTCGAATAAATTGCAGCCGAAGACAATTTTCAGAGCTTCCCCGCCATTTCGCGT
CTCAAAGCCGATTAACATTTTTGTGAGTCGGAACCCTAGCATCCGACAATGCCTTAACAATGCCGAAATTAGCGCCAATGATCCATTGAAATCGGAAAATGGCTTTTCCA
ATCATGAAACTGAAGGTTCAATGGAAAAGAATGAAAATCATCAAAAACATCCGCAAAAATCGATCGAGGTGCTGGATAAATTGAGGAGATATGGAGTTTCTGGAATATTG
TCTTATGGATTGTTGAATACAGTCTACTATCTTACAACATTTCTTGTTGTGTGGTTTTACATTGCACCACCACCTGCGAAAATGGGCTATATAATGGCTACAGTCTGGGC
TGGAAGCCAAGTTACTAAGCTGGCAAGAGCGGCAGGAGCTCTTGCTATGGCACCGTTCGTCGACAGAGCATTGTCGTGGTTCACGGTCAAATACAACTTCAAGTCTCAGG
GGAAGGCAGTTATGGCGATTGTTGGATTCTGCTTAGGATTGTCTCTCTTGTTATTCATTGCTGTTACTCTGCTTTCAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCAACTCCACTTAAGCTCACCAATGGCGCCATCCCTTGCGTCGTTTTCCCGGGTTCGAATAAATTGCAGCCGAAGACAATTTTCAGAGCTTCCCCGCCATTTCGCGT
CTCAAAGCCGATTAACATTTTTGTGAGTCGGAACCCTAGCATCCGACAATGCCTTAACAATGCCGAAATTAGCGCCAATGATCCATTGAAATCGGAAAATGGCTTTTCCA
ATCATGAAACTGAAGGTTCAATGGAAAAGAATGAAAATCATCAAAAACATCCGCAAAAATCGATCGAGGTGCTGGATAAATTGAGGAGATATGGAGTTTCTGGAATATTG
TCTTATGGATTGTTGAATACAGTCTACTATCTTACAACATTTCTTGTTGTGTGGTTTTACATTGCACCACCACCTGCGAAAATGGGCTATATAATGGCTACAGTCTGGGC
TGGAAGCCAAGTTACTAAGCTGGCAAGAGCGGCAGGAGCTCTTGCTATGGCACCGTTCGTCGACAGAGCATTGTCGTGGTTCACGGTCAAATACAACTTCAAGTCTCAGG
GGAAGGCAGTTATGGCGATTGTTGGATTCTGCTTAGGATTGTCTCTCTTGTTATTCATTGCTGTTACTCTGCTTTCAGCATAAGACAGGTTCTTCCCTTGGAAAGTAAGT
ACTTTTTCTCTCACATCATTCATTACCCTATTCCCAATGAGTGTTTCCTTGTCCTCAATATTGAAGGCTATCACCTTTTTTTCTTCCTTAAAAATTAAAATTTCCAATAT
TTTAAAACATAGGGTGGAGGTTTAACTAAAACGATACGTTTTATCAATGAGATTGTCGTTATTGTCGAGAATA
Protein sequenceShow/hide protein sequence
MPTPLKLTNGAIPCVVFPGSNKLQPKTIFRASPPFRVSKPINIFVSRNPSIRQCLNNAEISANDPLKSENGFSNHETEGSMEKNENHQKHPQKSIEVLDKLRRYGVSGIL
SYGLLNTVYYLTTFLVVWFYIAPPPAKMGYIMATVWAGSQVTKLARAAGALAMAPFVDRALSWFTVKYNFKSQGKAVMAIVGFCLGLSLLLFIAVTLLSA