; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy4G073630 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy4G073630
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchrH04:7378048..7379127
RNA-Seq ExpressionChy4G073630
SyntenyChy4G073630
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146827.2 uncharacterized protein LOC101206630 isoform X1 [Cucumis sativus]5.18e-18194.59Show/hide
Query:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRD
        MSSFNFLV+NMRTLVDAFATL LADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWF+FSEIFPLAQYLKDFGY SFSFSIGRD
Subjt:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRD

Query:  PHHAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQ
        PHHAQ+EFQGP+ LL+EVTLRLV+CHLPLRIHQFDLSVFVSMDSQ+FSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEV+
Subjt:  PHHAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQ

Query:  FIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYRRSK
        F+ITLGPQEVFN IASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFA YRRSK
Subjt:  FIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYRRSK

XP_008459734.1 PREDICTED: uncharacterized protein LOC103498776 [Cucumis melo]1.87e-14177.95Show/hide
Query:  SFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRDPH
        SFNF+V+NM+TL+DA ATL +AD   DATFSPEMFC+M DSNVSIHS IGLQLWPPFFDHYFC +L+ SWF+F+EIFPLAQ L+D GY SFSFSIG +P 
Subjt:  SFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRDPH

Query:  HAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQFI
         AQ++FQGP+GLL E    LVY H PLRI  FDLSVFVSMDSQEFSN+ISQYHMFDDVHVTITSERVIFSYS MQETIL+ QNGQCIIGG++AP++VQFI
Subjt:  HAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQFI

Query:  ITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYR
        +TLGP EVFN IASQTKRVWFFKQCNSN+GLITAPLGLN+RLVA F DVFA++R
Subjt:  ITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYR

XP_011652354.1 uncharacterized protein LOC101206630 isoform X2 [Cucumis sativus]2.47e-17693.44Show/hide
Query:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRD
        MSSFNFLV+NMRTLVDAFATL LADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWF+FSEIFPLAQYLKDFGY SFSFSIGRD
Subjt:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRD

Query:  PHHAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQ
        PHHAQ+EFQGP+ LL+EVTLRLV+CHLPLRIHQFDLSVFVSMDSQ+FSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQ   CIIGGLRAPDEV+
Subjt:  PHHAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQ

Query:  FIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYRRSK
        F+ITLGPQEVFN IASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFA YRRSK
Subjt:  FIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYRRSK

XP_011652355.1 uncharacterized protein LOC101206630 isoform X3 [Cucumis sativus]2.61e-9963.71Show/hide
Query:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRD
        MSSFNFLV+NMRTLVDAFATL LADHK                                                                         
Subjt:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRD

Query:  PHHAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQ
                    GLL+EVTLRLV+CHLPLRIHQFDLSVFVSMDSQ+FSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEV+
Subjt:  PHHAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQ

Query:  FIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYRRSK
        F+ITLGPQEVFN IASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFA YRRSK
Subjt:  FIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYRRSK

XP_038876014.1 uncharacterized protein LOC120068348 [Benincasa hispida]1.56e-11062.6Show/hide
Query:  FNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFC-RDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRDPH
        FNF+V++MRTL++A   + + D   D TFSPEM C+MADS VSI +AIG+QLWPPFFDHYFC  +L+ SWFY ++IFPL   L D GY SF+FSI  +P+
Subjt:  FNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFC-RDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRDPH

Query:  H--AQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQ
           A+L+F+GP+GLL E+  +L Y H PL + +FDLSVFVS+DSQEFS+++ +YHMFD VHVTITS RV FSY+ +QETIL+PQ+GQC+IGG+RAP+++Q
Subjt:  H--AQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQ

Query:  FIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQ
        FIITL P EVF  IA + KR+WFFK  NS +G+ITAP+GLN RLVAFFCDVFA+
Subjt:  FIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQ

TrEMBL top hitse value%identityAlignment
A0A0A0KA18 Uncharacterized protein9.9e-7256.35Show/hide
Query:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKN-SWFYFSEIFPLAQYLKDFGYNSFSFSIGR
        M SF   V +M+ L+D+  TL L D   DA FSP+MFC++ADS+VS+ SA GLQLWPPFFD ++  +L+   WF  + +FPLA  L + G  S +FSI R
Subjt:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKN-SWFYFSEIFPLAQYLKDFGYNSFSFSIGR

Query:  DPH-HAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDE
          H +AQ +F+GP+GLL EV  RL     PLRI Q DLS FV+MDSQEFS +ISQY+MFD V V ITS RV FS ST+QET +S ++G+CI+GG+RAP +
Subjt:  DPH-HAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDE

Query:  VQFIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCD
        VQFIIT+     F   ASQ+KR+W FK+ NS +G+ITAPLGL  RLV+FFCD
Subjt:  VQFIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCD

A0A0A0KFM5 Uncharacterized protein2.1e-7456.08Show/hide
Query:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKN-SWFYFSEIFPLAQYLKDFGYNSFSFSIGR
        M SF   V +M+ L+D+  TL L D   DA FSP+ FC+MADS+ SIHSA G+QLWPPFFD ++  D++   WF  + +F LA  L + GY S +FSI R
Subjt:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKN-SWFYFSEIFPLAQYLKDFGYNSFSFSIGR

Query:  DPH-HAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDE
          + +AQ +F+GP+GLL +V   L     PLRI Q DLS FVSMDS+EFSN+IS+YHMFD V V ITS RV FSY+ +QETI++P++GQC+IGG+RAP++
Subjt:  DPH-HAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDE

Query:  VQFIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFA
        VQFIIT+     F   ASQ+KR+W FK+ NS +G+ITAPLGL  RLV+FFCDV A
Subjt:  VQFIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFA

A0A0A0LIF2 Uncharacterized protein2.4e-14294.59Show/hide
Query:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRD
        MSSFNFLV+NMRTLVDAFATL LADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWF+FSEIFPLAQYLKDFGY SFSFSIGRD
Subjt:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRD

Query:  PHHAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQ
        PHHAQ+EFQGP+ LL+EVTLRLV+CHLPLRIHQFDLSVFVSMDSQ+FSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEV+
Subjt:  PHHAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQ

Query:  FIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYRRSK
        F+ITLGPQEVFN IASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFA YRRSK
Subjt:  FIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYRRSK

A0A1S3CBB9 uncharacterized protein LOC1034987763.4e-11277.95Show/hide
Query:  SFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRDPH
        SFNF+V+NM+TL+DA ATL +AD   DATFSPEMFC+M DSNVSIHS IGLQLWPPFFDHYFC +L+ SWF+F+EIFPLAQ L+D GY SFSFSIG +P 
Subjt:  SFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRDPH

Query:  HAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQFI
         AQ++FQGP+GLL E    LVY H PLRI  FDLSVFVSMDSQEFSN+ISQYHMFDDVHVTITSERVIFSYS MQETIL+ QNGQCIIGG++AP++VQFI
Subjt:  HAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQFI

Query:  ITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYR
        +TLGP EVFN IASQTKRVWFFKQCNSN+GLITAPLGLN+RLVA F DVFA++R
Subjt:  ITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYR

A0A5D3DMK7 Uncharacterized protein2.1e-7456.35Show/hide
Query:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKN-SWFYFSEIFPLAQYLKDFGYNSFSFSIGR
        M SF   V++M+ L+D   TL L D   D  FSP+MFC+MADS+VSIHSA G++L PPFFD ++  +++   WF  + +FPLA  L + GY SF+FSI R
Subjt:  MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKN-SWFYFSEIFPLAQYLKDFGYNSFSFSIGR

Query:  DPH-HAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDE
          + HAQ  F+GP+GLL EV   L     PL I + DLS FV+MDSQEFSN+IS+YHMFD V V IT+ RV FSY+ +QETI++PQ+GQCIIGG+R P+E
Subjt:  DPH-HAQLEFQGPSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDE

Query:  VQFIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCD
        VQFIIT+   + F   ASQ+KR+W FK+ NS +G+ITAPLGL+ RLV+FFCD
Subjt:  VQFIITLGPQEVFNRIASQTKRVWFFKQCNSNRGLITAPLGLNARLVAFFCD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTTCAATTTCCTCGTCCACAATATGCGAACTCTCGTAGATGCTTTTGCCACATTGAGACTGGCGGATCATAAGGGTGATGCAACATTTTCACCGGAAATGTT
TTGCCTAATGGCGGATTCAAATGTTTCCATTCACAGCGCCATTGGCCTTCAACTCTGGCCTCCATTCTTCGACCATTACTTCTGCAGAGACCTTAAAAATTCTTGGTTCT
ACTTCAGTGAAATTTTCCCTCTTGCCCAATATTTGAAAGATTTCGGTTACAATTCTTTCTCCTTCTCTATCGGCCGTGACCCTCACCACGCCCAACTCGAATTCCAAGGC
CCCAGTGGGCTTCTTATTGAAGTTACTTTAAGGTTGGTTTATTGCCATCTTCCGTTGCGCATCCACCAATTTGATCTCTCTGTTTTTGTTTCAATGGATTCTCAAGAATT
CTCCAACCTTATTTCTCAGTATCATATGTTTGATGACGTTCATGTTACTATAACAAGTGAGCGAGTGATATTCTCTTATTCAACTATGCAAGAGACAATTCTTAGTCCAC
AGAATGGGCAGTGCATAATTGGAGGTTTAAGAGCACCAGATGAAGTTCAATTCATAATAACTTTAGGTCCACAGGAAGTTTTCAACCGTATAGCAAGTCAAACGAAGAGG
GTATGGTTTTTTAAGCAATGTAATTCCAATAGAGGTTTAATTACGGCCCCTCTTGGATTGAATGCTCGACTTGTTGCTTTTTTCTGTGATGTCTTTGCCCAATATCGACG
ATCTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTTCAATTTCCTCGTCCACAATATGCGAACTCTCGTAGATGCTTTTGCCACATTGAGACTGGCGGATCATAAGGGTGATGCAACATTTTCACCGGAAATGTT
TTGCCTAATGGCGGATTCAAATGTTTCCATTCACAGCGCCATTGGCCTTCAACTCTGGCCTCCATTCTTCGACCATTACTTCTGCAGAGACCTTAAAAATTCTTGGTTCT
ACTTCAGTGAAATTTTCCCTCTTGCCCAATATTTGAAAGATTTCGGTTACAATTCTTTCTCCTTCTCTATCGGCCGTGACCCTCACCACGCCCAACTCGAATTCCAAGGC
CCCAGTGGGCTTCTTATTGAAGTTACTTTAAGGTTGGTTTATTGCCATCTTCCGTTGCGCATCCACCAATTTGATCTCTCTGTTTTTGTTTCAATGGATTCTCAAGAATT
CTCCAACCTTATTTCTCAGTATCATATGTTTGATGACGTTCATGTTACTATAACAAGTGAGCGAGTGATATTCTCTTATTCAACTATGCAAGAGACAATTCTTAGTCCAC
AGAATGGGCAGTGCATAATTGGAGGTTTAAGAGCACCAGATGAAGTTCAATTCATAATAACTTTAGGTCCACAGGAAGTTTTCAACCGTATAGCAAGTCAAACGAAGAGG
GTATGGTTTTTTAAGCAATGTAATTCCAATAGAGGTTTAATTACGGCCCCTCTTGGATTGAATGCTCGACTTGTTGCTTTTTTCTGTGATGTCTTTGCCCAATATCGACG
ATCTAAATGA
Protein sequenceShow/hide protein sequence
MSSFNFLVHNMRTLVDAFATLRLADHKGDATFSPEMFCLMADSNVSIHSAIGLQLWPPFFDHYFCRDLKNSWFYFSEIFPLAQYLKDFGYNSFSFSIGRDPHHAQLEFQG
PSGLLIEVTLRLVYCHLPLRIHQFDLSVFVSMDSQEFSNLISQYHMFDDVHVTITSERVIFSYSTMQETILSPQNGQCIIGGLRAPDEVQFIITLGPQEVFNRIASQTKR
VWFFKQCNSNRGLITAPLGLNARLVAFFCDVFAQYRRSK