; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5754 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5754
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationctg1402:2177759..2179709
RNA-Seq ExpressionCucsat.G5754
SyntenyCucsat.G5754
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045262.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.31e-12391.51Show/hide
Query:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
        MFLVKL NFDPLLDATSF AQISFD+AD+KFTPSKFFII+SHRSPRFIATLQLSPQWFT+FSVD+DHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQ+
Subjt:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI

Query:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE
        ILRFDTPSSEIQPLHHEL LSPPQAEDNQIGQHELDE KYFIVKSKALRRIIK+LPIFQNDSI+ VDVTNSRVKFSIASKEI+L EG HCKIEGFEEEVE
Subjt:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE

Query:  TQFQIILCPMMF
        TQFQIILCPM++
Subjt:  TQFQIILCPMMF

XP_008464344.1 PREDICTED: uncharacterized protein LOC103502250 [Cucumis melo]2.23e-16892.91Show/hide
Query:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
        MFLVKL NFDPLLDATSF AQISFD+AD+KFTPSKFFII+SHRSPRFIATLQLSPQWFT+FSVD+DHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQ+
Subjt:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI

Query:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE
        ILRFDTPSSEIQPLHHEL LSPPQAEDNQIGQHELDE KYFIVKSKALRRIIK+LPIFQNDSI+ VDVTNSRVKFSIASKEI+L EG HCKIEGFEEEVE
Subjt:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE

Query:  TQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVPSYGIFGQYVIYFPSV
        TQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVP+YGIFGQYVIYFP V
Subjt:  TQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVPSYGIFGQYVIYFPSV

XP_031743957.1 uncharacterized protein LOC116404737 [Cucumis sativus]6.56e-13775.49Show/hide
Query:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
        MFLVKLT F+PLLD+T F A  S ++A VKFTP K F+I S+R P FIATLQLSP+WFT+FSVDH HSSKVSLESFHDA+LDGG F++M+IHLLDKTNQ+
Subjt:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI

Query:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE
        ILRFDTPSS+IQPLH EL LSPPQ+ED++IGQHEL+  K+FIV SK+LRR+IKELPIFQNDSI+CV VT S++KFSIAS +IV +EG HC+IEGFEEEVE
Subjt:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE

Query:  TQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVPSYGIFGQYVIYFPS
        TQFQI LCPM+FFLNFTY+A+RVWFYKTKNNAYT+M+VP++GI+GQY IYFP+
Subjt:  TQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVPSYGIFGQYVIYFPS

XP_031743987.1 uncharacterized protein LOC116404759 [Cucumis sativus]1.97e-13398.98Show/hide
Query:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
        MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
Subjt:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI

Query:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEE
        ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIE  EE
Subjt:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEE

XP_038875055.1 uncharacterized protein LOC120067580 [Benincasa hispida]5.81e-12975.79Show/hide
Query:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
        MFLVKLTNF+PLLDATS+ AQIS + ADVKFTP +F++I+ + SPRF+ATLQLS + FT++SVDH+H+SKV LESFHDAILDGGSFASMTIHLL+K NQ+
Subjt:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI

Query:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE
        ILRF TPSSEI PLHHEL  SPPQ  DN IG  +L+EGK+FIVKS+ALRRIIKELPIFQ+DS+VCV VT+S++KFSIASKEIVL +  HC+I GFEEEVE
Subjt:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE

Query:  TQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVPSYGIFGQYVIYFP
        TQFQIIL PM+FFLNFTYKAN+VWFYKTKNN+Y++M VP++GI GQYVIYFP
Subjt:  TQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVPSYGIFGQYVIYFP

TrEMBL top hitse value%identityAlignment
A0A0A0K902 Uncharacterized protein8.86e-110100Show/hide
Query:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
        MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
Subjt:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI

Query:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDS
        ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDS
Subjt:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDS

A0A1S3C8J1 uncharacterized protein LOC1034980101.99e-9166.37Show/hide
Query:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
        MFLV+L  F+PL+DATS  AQ++  DADVKFTP    II S+RSP+F+ATLQLS + FT+FSVDH+ SSKVSL+ FHDA+LDGGSF+SMTIHLLD TNQ+
Subjt:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI

Query:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLA-EGPHCKIEGFEEEV
        +LRF+TPS ++ PLHHELALSPPQAE+  +GQ E   G +F V S+ LRRIIKELP+F  D+ V V VT S+VKFSI SKEI+L  EG HCKI G+E EV
Subjt:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLA-EGPHCKIEGFEEEV

Query:  ETQFQIILCPMMFFLNFTYKANR
        ET+ Q++L PMMFFLNFTY+AN+
Subjt:  ETQFQIILCPMMFFLNFTYKANR

A0A1S3CL88 uncharacterized protein LOC1035022501.08e-16892.91Show/hide
Query:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
        MFLVKL NFDPLLDATSF AQISFD+AD+KFTPSKFFII+SHRSPRFIATLQLSPQWFT+FSVD+DHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQ+
Subjt:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI

Query:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE
        ILRFDTPSSEIQPLHHEL LSPPQAEDNQIGQHELDE KYFIVKSKALRRIIK+LPIFQNDSI+ VDVTNSRVKFSIASKEI+L EG HCKIEGFEEEVE
Subjt:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE

Query:  TQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVPSYGIFGQYVIYFPSV
        TQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVP+YGIFGQYVIYFP V
Subjt:  TQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVPSYGIFGQYVIYFPSV

A0A5D3BV00 Uncharacterized protein1.55e-9173.66Show/hide
Query:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
        MFLVKLT F+PLLD+TS+ AQ S D+AD+KF P K F+I  +RSPRF ATLQLSP+WF+++SVDH HSSKVSLESFHDA+LDGG F+SM+IHLLDKTNQ+
Subjt:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI

Query:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAE
         LRFDTPSS+IQP HHEL LSPPQ+ED QI +HELD  K+ IVKSK+LRR+IKELPIF NDSI+CV VT S++KFSIASK IVL+E
Subjt:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAE

A0A5D3DZ07 LINE-1 retrotransposable element ORF2 protein6.34e-12491.51Show/hide
Query:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI
        MFLVKL NFDPLLDATSF AQISFD+AD+KFTPSKFFII+SHRSPRFIATLQLSPQWFT+FSVD+DHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQ+
Subjt:  MFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDGGSFASMTIHLLDKTNQI

Query:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE
        ILRFDTPSSEIQPLHHEL LSPPQAEDNQIGQHELDE KYFIVKSKALRRIIK+LPIFQNDSI+ VDVTNSRVKFSIASKEI+L EG HCKIEGFEEEVE
Subjt:  ILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIEGFEEEVE

Query:  TQFQIILCPMMF
        TQFQIILCPM++
Subjt:  TQFQIILCPMMF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCACAATGTATTAAGAGGTGTTTATAAAAGAAAAGGCAAAGCCATTCCCCTTCATTTTCACATTTACTACACCACCATGTTCTTGGTCAAGCTTACAAACTTTGA
TCCTCTTCTTGACGCAACCTCCTTTTTTGCTCAAATTTCCTTCGACGATGCTGATGTGAAATTCACGCCTTCGAAATTCTTCATAATTTCCTCTCACCGTTCCCCTCGCT
TCATCGCAACGCTACAATTGTCGCCACAATGGTTCACTAGTTTTTCCGTTGATCATGATCATAGTTCTAAGGTTTCCCTTGAATCCTTCCATGATGCTATATTGGATGGT
GGAAGTTTTGCTTCAATGACAATCCATCTTTTGGACAAAACAAACCAAATAATCCTTAGATTTGATACTCCTTCAAGCGAAATCCAACCTTTGCATCATGAATTGGCATT
GTCACCTCCCCAAGCAGAAGACAATCAAATTGGCCAACATGAACTTGACGAAGGAAAATATTTCATAGTTAAATCTAAGGCATTAAGACGAATTATTAAAGAGTTACCTA
TCTTCCAAAATGATTCAATCGTTTGTGTTGATGTAACAAATTCTCGAGTCAAATTCTCAATTGCATCTAAGGAGATTGTTCTTGCTGAGGGTCCACACTGTAAAATCGAA
GGTTTTGAAGAAGAAGTTGAAACTCAATTCCAAATCATTCTTTGTCCTATGATGTTTTTCCTCAACTTTACATATAAAGCAAATAGGGTTTGGTTTTATAAGACAAAAAA
TAATGCTTATACTCTAATGGTTGTCCCATCTTATGGAATTTTTGGTCAATATGTAATCTATTTCCCCTCCGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCACAATGTATTAAGAGGTGTTTATAAAAGAAAAGGCAAAGCCATTCCCCTTCATTTTCACATTTACTACACCACCATGTTCTTGGTCAAGCTTACAAACTTTGA
TCCTCTTCTTGACGCAACCTCCTTTTTTGCTCAAATTTCCTTCGACGATGCTGATGTGAAATTCACGCCTTCGAAATTCTTCATAATTTCCTCTCACCGTTCCCCTCGCT
TCATCGCAACGCTACAATTGTCGCCACAATGGTTCACTAGTTTTTCCGTTGATCATGATCATAGTTCTAAGGTTTCCCTTGAATCCTTCCATGATGCTATATTGGATGGT
GGAAGTTTTGCTTCAATGACAATCCATCTTTTGGACAAAACAAACCAAATAATCCTTAGATTTGATACTCCTTCAAGCGAAATCCAACCTTTGCATCATGAATTGGCATT
GTCACCTCCCCAAGCAGAAGACAATCAAATTGGCCAACATGAACTTGACGAAGGAAAATATTTCATAGTTAAATCTAAGGCATTAAGACGAATTATTAAAGAGTTACCTA
TCTTCCAAAATGATTCAATCGTTTGTGTTGATGTAACAAATTCTCGAGTCAAATTCTCAATTGCATCTAAGGAGATTGTTCTTGCTGAGGGTCCACACTGTAAAATCGAA
GGTTTTGAAGAAGAAGTTGAAACTCAATTCCAAATCATTCTTTGTCCTATGATGTTTTTCCTCAACTTTACATATAAAGCAAATAGGGTTTGGTTTTATAAGACAAAAAA
TAATGCTTATACTCTAATGGTTGTCCCATCTTATGGAATTTTTGGTCAATATGTAATCTATTTCCCCTCCGTATGA
Protein sequenceShow/hide protein sequence
MHHNVLRGVYKRKGKAIPLHFHIYYTTMFLVKLTNFDPLLDATSFFAQISFDDADVKFTPSKFFIISSHRSPRFIATLQLSPQWFTSFSVDHDHSSKVSLESFHDAILDG
GSFASMTIHLLDKTNQIILRFDTPSSEIQPLHHELALSPPQAEDNQIGQHELDEGKYFIVKSKALRRIIKELPIFQNDSIVCVDVTNSRVKFSIASKEIVLAEGPHCKIE
GFEEEVETQFQIILCPMMFFLNFTYKANRVWFYKTKNNAYTLMVVPSYGIFGQYVIYFPSV