; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G04200 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G04200
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationClcChr06:4372449..4373959
RNA-Seq ExpressionClc06G04200
SyntenyClc06G04200
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013723.1 hypothetical protein SDJN02_23890, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-2835.14Show/hide
Query:  VLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEP
        + +++L    P ++A S L+ IS  ADV+ + F  S+I +H    F ATL  S + F  Y+VD YYS  +SL+ FH+AM   G    +     +  +LE 
Subjt:  VLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEP

Query:  TKTIILNFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLL----DVLVCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYEE
         + +IL + +P+       +    L   P + + LG ++YGK+FT++S+ LR++I  L    D LV V  T++ V+F   S+ + VTK+ G C IVGYE 
Subjt:  TKTIILNFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLL----DVLVCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYEE

Query:  EIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP
        E    +  I  RP  FF   +Y +  RV FY+T  N+  V+  P F   +  QY +YFP
Subjt:  EIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP

KAG7013856.1 hypothetical protein SDJN02_24025, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-2736.64Show/hide
Query:  VLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEP
        + +I+LD+  PFM A+ Q+  IS  AD+KVSP SFS   +HP  GF   L+ SP+ FA Y+VD+++ + ISL  F +AM    + + L S+    + L+ 
Subjt:  VLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEP

Query:  TKTIILNFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRR--VINLLD----VLVCVTVTNSEVRFH-TTSQMVVVTKKGGGCEIVG
           + L F++    R A+ +      S   +E + GA+D+  +F+I+S+DLRR  +IN        +  VTVTNS+V+      ++V+++KK G C+IVG
Subjt:  TKTIILNFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRR--VINLLD----VLVCVTVTNSEVRFH-TTSQMVVVTKKGGGCEIVG

Query:  YEEEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP
        YEE     QF+I L     F +++ +E E++ FY+T+ N+H  +  P F +SL+    IYFP
Subjt:  YEEEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP

XP_008464344.1 PREDICTED: uncharacterized protein LOC103502250 [Cucumis melo]1.8e-2735.74Show/hide
Query:  VLVIKLDDCIPFMNALSQLSNIS-NKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLE
        + ++KL +  P ++A S L+ IS + AD+K +P  F +I +H    F ATL  SPQ F  ++VD  +S  +SL+ FH+A+   G    +     T  LL+
Subjt:  VLVIKLDDCIPFMNALSQLSNIS-NKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLE

Query:  PTKTIILNFTSPTRRREANGVATTWLTSLPSEEKELG--AIDYGKYFTIHSEDLRRVINLL-----DVLVCVTVTNSEVRFHTTSQMVVVTKKGGGCEIV
         T  +IL F +P+   +      T L+   +E+ ++G   +D  KYF + S+ LRR+I  L     D ++ V VTNS V+F   S+ +++T +G  C+I 
Subjt:  PTKTIILNFTSPTRRREANGVATTWLTSLPSEEKELG--AIDYGKYFTIHSEDLRRVINLL-----DVLVCVTVTNSEVRFHTTSQMVVVTKKGGGCEIV

Query:  GYEEEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP
        G+EEE+   QF+IIL P  FF   +Y +  RV FY+T NN + +++ P +   +  QY IYFP
Subjt:  GYEEEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP

XP_023000630.1 uncharacterized protein LOC111494874 [Cucurbita maxima]1.8e-3034.98Show/hide
Query:  TGVLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLL
        T + +++L +  P ++  S+L+ I+ ++D+  +P S  + T+ P   F ATL    Q F VY+V+  +   ISL+  H+A+   G      S  +T  LL
Subjt:  TGVLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLL

Query:  EPTKTIILNFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVL----VCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGY
        E T  ++L F +P    +           LP +E+ +  I+Y K   + S DLR+VI  L +     VCVTVT+S VRF   S+ ++  K+ G CEI+G+
Subjt:  EPTKTIILNFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVL----VCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGY

Query:  EEEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFPVM
        + +    +F+I+L P  FF  L+Y + + V F++TI NTH V++FPVF   +  QY IYFP M
Subjt:  EEEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFPVM

XP_038875055.1 uncharacterized protein LOC120067580 [Benincasa hispida]8.7e-3036.15Show/hide
Query:  VLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEP
        + ++KL +  P ++A S L+ ISN ADVK +P  F +I  +P   F ATL  S + F  Y+VD  ++  + L+ FH+A+   G    +     T  LLE 
Subjt:  VLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEP

Query:  TKTIILNFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLL-----DVLVCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYE
           +IL F +P+          T+     ++    G ++ GK+F + SE LRR+I  L     D +VCV VT+S+++F   S+ +V+  K   C IVG+E
Subjt:  TKTIILNFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLL-----DVLVCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYE

Query:  EEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP
        EE+   QF+IILRP  FF   +Y +  +V FY+T NN++ V+  P F   +  QY IYFP
Subjt:  EEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP

TrEMBL top hitse value%identityAlignment
A0A1S3CL88 uncharacterized protein LOC1035022508.8e-2835.74Show/hide
Query:  VLVIKLDDCIPFMNALSQLSNIS-NKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLE
        + ++KL +  P ++A S L+ IS + AD+K +P  F +I +H    F ATL  SPQ F  ++VD  +S  +SL+ FH+A+   G    +     T  LL+
Subjt:  VLVIKLDDCIPFMNALSQLSNIS-NKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLE

Query:  PTKTIILNFTSPTRRREANGVATTWLTSLPSEEKELG--AIDYGKYFTIHSEDLRRVINLL-----DVLVCVTVTNSEVRFHTTSQMVVVTKKGGGCEIV
         T  +IL F +P+   +      T L+   +E+ ++G   +D  KYF + S+ LRR+I  L     D ++ V VTNS V+F   S+ +++T +G  C+I 
Subjt:  PTKTIILNFTSPTRRREANGVATTWLTSLPSEEKELG--AIDYGKYFTIHSEDLRRVINLL-----DVLVCVTVTNSEVRFHTTSQMVVVTKKGGGCEIV

Query:  GYEEEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP
        G+EEE+   QF+IIL P  FF   +Y +  RV FY+T NN + +++ P +   +  QY IYFP
Subjt:  GYEEEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP

A0A6J1DM64 uncharacterized protein LOC1110222792.2e-2637.25Show/hide
Query:  MNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEPTKTIILNFTSPT
        +NA S L+ +SN AD   S  SFS+IT +P   F ATL  S   F+ Y+VD+     + +  F  AM    + +  PS+  T         +IL F SP+
Subjt:  MNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEPTKTIILNFTSPT

Query:  RRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVL----VCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYEEEIFGNQFKIILR
                    LT   S+ ++LG +D  K+F+IHS D RR+I  L +     +  T+T+S+VRF+   + +V+TK+GG C I GY+   F  QF++ LR
Subjt:  RRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVL----VCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYEEEIFGNQFKIILR

Query:  PSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP
        P  FF  LSY ++ER+ F +T N+   V+  P FA     QY IYFP
Subjt:  PSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFP

A0A6J1H2Z8 uncharacterized protein LOC1114600113.7e-2637.45Show/hide
Query:  VLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEP
        + +++L    P M A S L+ ISN+AD+K S   FS+IT++P   F AT   S + FA Y VD+ +S  +SLQ F+NAM        +    +T    E 
Subjt:  VLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEP

Query:  TKTIILNFTSPTR-RREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVL----VCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYE
        T  ++L F S    R + + V    L   PS+E+ELG I + ++F+I S+D R +I  L       + V++T+S V+F   S+  ++TK+GG C IVGYE
Subjt:  TKTIILNFTSPTR-RREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVL----VCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYE

Query:  EEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYF
         +     F+I L P  FF  LSY    R+ FY+TI ++  V+  P F   L+ QY IYF
Subjt:  EEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYF

A0A6J1KIW5 uncharacterized protein LOC1114948748.5e-3134.98Show/hide
Query:  TGVLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLL
        T + +++L +  P ++  S+L+ I+ ++D+  +P S  + T+ P   F ATL    Q F VY+V+  +   ISL+  H+A+   G      S  +T  LL
Subjt:  TGVLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLL

Query:  EPTKTIILNFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVL----VCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGY
        E T  ++L F +P    +           LP +E+ +  I+Y K   + S DLR+VI  L +     VCVTVT+S VRF   S+ ++  K+ G CEI+G+
Subjt:  EPTKTIILNFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVL----VCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGY

Query:  EEEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFPVM
        + +    +F+I+L P  FF  L+Y + + V F++TI NTH V++FPVF   +  QY IYFP M
Subjt:  EEEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFPVM

A0A6J1KZ05 uncharacterized protein LOC1114988873.1e-2536.29Show/hide
Query:  VLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEP
        + +++L    P   A S L+ ISN+AD+K S   FS+IT++P   F AT   S + FA Y+VD+ +S  +SLQ F++AM        +    +T    E 
Subjt:  VLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEP

Query:  TKTIILNFTSPTRRR-EANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVL----VCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYE
        T  ++L F S    + + + V    L   PS+E+ELG I + ++F+I S+D R +I  L       + V++T+S V+F   S+  ++TK+GG C I+GYE
Subjt:  TKTIILNFTSPTRRR-EANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVL----VCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYE

Query:  EEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYF
         E     F+I L P  FF  LSY    R+ FY+TI ++  V+  P F   L+ QY IYF
Subjt:  EEIFGNQFKIILRPSTFFCRLSYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGACGGGAGTGTTAGTGATTAAGCTTGATGACTGTATTCCTTTTATGAACGCACTTTCTCAACTCTCTAACATTTCTAACAAAGCCGACGTGAAAGTTTCGCCATT
CTCATTCTCAGTGATTACCACTCATCCTCATCTTGGCTTCACTGCTACACTCAATTTTTCTCCTCAACGCTTCGCTGTGTATGCCGTGGATCAATATTACAGTTTCAGTA
TTTCCCTTCAATTCTTCCATAATGCTATGTCCCAATTTGGCCAAGCCCAAGACTTGCCTTCATTGTGCATCACTGCTCTACTTCTTGAACCCACGAAGACAATCATCCTT
AATTTTACCTCTCCAACCAGGAGAAGAGAAGCAAATGGAGTAGCAACGACATGGTTGACATCGTTACCATCGGAAGAGAAGGAACTAGGAGCAATTGACTATGGAAAGTA
TTTCACAATTCATTCGGAAGATTTGAGGCGCGTTATAAACTTGCTAGATGTTTTAGTTTGTGTTACCGTAACCAATTCTGAAGTTAGATTCCATACTACATCTCAAATGG
TTGTTGTTACAAAAAAGGGTGGAGGATGTGAGATTGTAGGTTATGAAGAAGAAATATTTGGAAATCAATTCAAAATCATTCTTCGTCCCTCCACCTTTTTCTGTAGATTG
TCATATAATGAAACTGAAAGGGTAAGCTTTTATCAGACAATTAATAATACTCATCATGTCGTGCTTTTTCCTGTTTTTGCTGATTCTCTTTCTCCTCAATATGCAATCTA
CTTTCCCGTAATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGACGGGAGTGTTAGTGATTAAGCTTGATGACTGTATTCCTTTTATGAACGCACTTTCTCAACTCTCTAACATTTCTAACAAAGCCGACGTGAAAGTTTCGCCATT
CTCATTCTCAGTGATTACCACTCATCCTCATCTTGGCTTCACTGCTACACTCAATTTTTCTCCTCAACGCTTCGCTGTGTATGCCGTGGATCAATATTACAGTTTCAGTA
TTTCCCTTCAATTCTTCCATAATGCTATGTCCCAATTTGGCCAAGCCCAAGACTTGCCTTCATTGTGCATCACTGCTCTACTTCTTGAACCCACGAAGACAATCATCCTT
AATTTTACCTCTCCAACCAGGAGAAGAGAAGCAAATGGAGTAGCAACGACATGGTTGACATCGTTACCATCGGAAGAGAAGGAACTAGGAGCAATTGACTATGGAAAGTA
TTTCACAATTCATTCGGAAGATTTGAGGCGCGTTATAAACTTGCTAGATGTTTTAGTTTGTGTTACCGTAACCAATTCTGAAGTTAGATTCCATACTACATCTCAAATGG
TTGTTGTTACAAAAAAGGGTGGAGGATGTGAGATTGTAGGTTATGAAGAAGAAATATTTGGAAATCAATTCAAAATCATTCTTCGTCCCTCCACCTTTTTCTGTAGATTG
TCATATAATGAAACTGAAAGGGTAAGCTTTTATCAGACAATTAATAATACTCATCATGTCGTGCTTTTTCCTGTTTTTGCTGATTCTCTTTCTCCTCAATATGCAATCTA
CTTTCCCGTAATGTGA
Protein sequenceShow/hide protein sequence
MGTGVLVIKLDDCIPFMNALSQLSNISNKADVKVSPFSFSVITTHPHLGFTATLNFSPQRFAVYAVDQYYSFSISLQFFHNAMSQFGQAQDLPSLCITALLLEPTKTIIL
NFTSPTRRREANGVATTWLTSLPSEEKELGAIDYGKYFTIHSEDLRRVINLLDVLVCVTVTNSEVRFHTTSQMVVVTKKGGGCEIVGYEEEIFGNQFKIILRPSTFFCRL
SYNETERVSFYQTINNTHHVVLFPVFADSLSPQYAIYFPVM