; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G04210 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G04210
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationClcChr06:4376338..4380308
RNA-Seq ExpressionClc06G04210
SyntenyClc06G04210
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047513.1 hypothetical protein E6C27_scaffold498G001420 [Cucumis melo var. makuwa]8.4e-3056.43Show/hide
Query:  MMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENNIHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFS
        MM MP P FAEYSVDRNH S +SL   HNA+++G++F +++ I + E++N I L FE S PS LPL+ ELTF +PMEDWS G V    GKI SIESE+F+
Subjt:  MMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENNIHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFS

Query:  NILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEE
        +I+  FS +NEAD+ILIT  G++ TFSV PF  E  +TEE
Subjt:  NILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEE

KGN45637.1 hypothetical protein Csa_005027 [Cucumis sativus]1.4e-4056.44Show/hide
Query:  MFSIKIDDLDPFLDATTLFAEIVNDEICLKISPSTFSIISRYQHPLFFAMMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENN
        MFSIKID LDP LDA +LF +IVND+ICLK S STFSII+RYQHP FFAM+ +P P FAEY V R+H  ++SL + H A+  GQ++ SS+ IH+ E++N 
Subjt:  MFSIKIDDLDPFLDATTLFAEIVNDEICLKISPSTFSIISRYQHPLFFAMMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENN

Query:  IHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFSNILTTFSVYNEADS
        I L FEPSR S +P+  ++ F +PMEDWS GE+     K  SIES++F +I+TTF  YNE D+
Subjt:  IHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFSNILTTFSVYNEADS

KGN61523.1 hypothetical protein Csa_006491 [Cucumis sativus]1.1e-4589.32Show/hide
Query:  RVDNLDPFLDATSLFTEFINDEICVKFSPSTFSIIARYQCPPFFAMMFMPHPLFAEYSVDRNHISRISLRCFHNALLEGQSYSSMSMHLQEPQNSILLKF
        +VDNLDPFLDATSLFTEFINDEIC+KFSPSTFS+IARYQCP FFAM+FMPHPLF EYSVDRNHISRISLRCF NALLEGQSYSSMS+HL+EPQN+IL KF
Subjt:  RVDNLDPFLDATSLFTEFINDEICVKFSPSTFSIIARYQCPPFFAMMFMPHPLFAEYSVDRNHISRISLRCFHNALLEGQSYSSMSMHLQEPQNSILLKF

Query:  EPS
        EPS
Subjt:  EPS

TYK26483.1 hypothetical protein E5676_scaffold313G00490 [Cucumis melo var. makuwa]8.4e-3056.43Show/hide
Query:  MMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENNIHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFS
        MM MP P FAEYSVDRNH S +SL   HNA+++G++F +++ I + E++N I L FE S PS LPL+ ELTF +PMEDWS G V    GKI SIESE+F+
Subjt:  MMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENNIHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFS

Query:  NILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEE
        +I+  FS +NEAD+ILIT  G++ TFSV PF  E  +TEE
Subjt:  NILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEE

XP_031736681.1 uncharacterized protein LOC116402062 [Cucumis sativus]8.3e-6257.52Show/hide
Query:  MFSIKIDDLDPFLDATTLFAEIVNDEICLKISPSTFSIISRYQHPLFFAMMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENN
        MFSIKIDD+DPFLDAT LFAE+ +DEICLK  PST SII RY+ P+FF  M +P P F EYSVDRNH S+ISL +F++A+LE Q+F +++ IH+ E +N 
Subjt:  MFSIKIDDLDPFLDATTLFAEIVNDEICLKISPSTFSIISRYQHPLFFAMMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENN

Query:  IHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFSNILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEESGDCLVFLWDT
        I L FEPS PS LPL+ ELTF +PMEDWS  +V    GK+ SIES++F +I+  FS + EA++ILI   G++ +FS++PF  E  +TEESG CL+F  D 
Subjt:  IHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFSNILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEESGDCLVFLWDT

Query:  EVEVHMHMPLHPSSFFSNCGIQARRV
        E+E  + MPL PSSFFSNC +Q++RV
Subjt:  EVEVHMHMPLHPSSFFSNCGIQARRV

TrEMBL top hitse value%identityAlignment
A0A0A0KB76 Uncharacterized protein6.7e-4156.44Show/hide
Query:  MFSIKIDDLDPFLDATTLFAEIVNDEICLKISPSTFSIISRYQHPLFFAMMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENN
        MFSIKID LDP LDA +LF +IVND+ICLK S STFSII+RYQHP FFAM+ +P P FAEY V R+H  ++SL + H A+  GQ++ SS+ IH+ E++N 
Subjt:  MFSIKIDDLDPFLDATTLFAEIVNDEICLKISPSTFSIISRYQHPLFFAMMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENN

Query:  IHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFSNILTTFSVYNEADS
        I L FEPSR S +P+  ++ F +PMEDWS GE+     K  SIES++F +I+TTF  YNE D+
Subjt:  IHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFSNILTTFSVYNEADS

A0A0A0LIE0 Uncharacterized protein5.3e-4689.32Show/hide
Query:  RVDNLDPFLDATSLFTEFINDEICVKFSPSTFSIIARYQCPPFFAMMFMPHPLFAEYSVDRNHISRISLRCFHNALLEGQSYSSMSMHLQEPQNSILLKF
        +VDNLDPFLDATSLFTEFINDEIC+KFSPSTFS+IARYQCP FFAM+FMPHPLF EYSVDRNHISRISLRCF NALLEGQSYSSMS+HL+EPQN+IL KF
Subjt:  RVDNLDPFLDATSLFTEFINDEICVKFSPSTFSIIARYQCPPFFAMMFMPHPLFAEYSVDRNHISRISLRCFHNALLEGQSYSSMSMHLQEPQNSILLKF

Query:  EPS
        EPS
Subjt:  EPS

A0A1S3CL88 uncharacterized protein LOC1035022504.2e-2735.68Show/hide
Query:  MFSIKIDDLDPFLDATTLFAEIVNDEICLKISPSTFSIISRYQHPLFFAMMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENN
        MF +K+ + DP LDAT+  A+I  D   LK +PS F II+ ++ P F A + + P +F  +SVD +H+SK+SL +FH+A+L+G SF +SMTIH+ ++ N 
Subjt:  MFSIKIDDLDPFLDATTLFAEIVNDEICLKISPSTFSIISRYQHPLFFAMMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENN

Query:  IHLIFEPSRPSALPLHHELTFLP--MEDWSLGEVTLLSGKILSIESEMFSNILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEESGDCLVFLWD
        + L F+       PLHHELT  P   ED  +G+  L   K   ++S+    I+    ++     I + +  ++  FS+     EI +T E   C +  ++
Subjt:  IHLIFEPSRPSALPLHHELTFLP--MEDWSLGEVTLLSGKILSIESEMFSNILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEESGDCLVFLWD

Query:  TEVEVHMHMPLHPSSFFSNCGIQARRV
         EVE    + L P  FF N   +A RV
Subjt:  TEVEVHMHMPLHPSSFFSNCGIQARRV

A0A5A7TVD7 Uncharacterized protein4.0e-3056.43Show/hide
Query:  MMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENNIHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFS
        MM MP P FAEYSVDRNH S +SL   HNA+++G++F +++ I + E++N I L FE S PS LPL+ ELTF +PMEDWS G V    GKI SIESE+F+
Subjt:  MMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENNIHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFS

Query:  NILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEE
        +I+  FS +NEAD+ILIT  G++ TFSV PF  E  +TEE
Subjt:  NILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEE

A0A5D3DSC2 Uncharacterized protein4.0e-3056.43Show/hide
Query:  MMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENNIHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFS
        MM MP P FAEYSVDRNH S +SL   HNA+++G++F +++ I + E++N I L FE S PS LPL+ ELTF +PMEDWS G V    GKI SIESE+F+
Subjt:  MMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENNIHLIFEPSRPSALPLHHELTF-LPMEDWSLGEVTLLSGKILSIESEMFS

Query:  NILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEE
        +I+  FS +NEAD+ILIT  G++ TFSV PF  E  +TEE
Subjt:  NILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTCTATCAAGATTGACGACCTTGATCCTTTCTTGGATGCAACCACTCTATTCGCCGAAATTGTGAACGACGAAATCTGCCTGAAAATCTCACCGTCGACGTTTTC
GATAATTTCTCGCTACCAGCACCCTCTTTTCTTTGCAATGATGATTATGCCGCCACCATTTTTCGCTGAGTATTCCGTTGATCGGAATCACAATTCAAAGATTTCCCTAT
CGGCCTTCCACAATGCTATGTTGGAAGGCCAAAGCTTTTGTTCTTCAATGACCATCCATGTTCATGAACAAGAAAATAATATACACCTTATATTTGAGCCTTCAAGGCCT
TCTGCACTACCATTGCATCATGAATTGACATTCCTGCCTATGGAAGACTGGTCGCTTGGCGAAGTTACCTTATTAAGTGGAAAAATTTTATCCATTGAGTCAGAAATGTT
TAGCAATATTTTGACAACATTTTCTGTGTACAATGAAGCTGATTCAATTTTGATTACTTTAGTGGGTACACAAGCCACCTTCTCTGTCATTCCATTTGATCACGAGATAA
CTGTTACTGAAGAGAGTGGAGATTGTTTGGTATTTCTATGGGATACCGAAGTTGAAGTTCATATGCATATGCCTCTCCATCCATCCTCATTTTTCAGTAATTGTGGAATT
CAAGCTCGAAGGGTTGACAACCTTGATCCTTTTCTCGATGCAACCTCTCTATTCACTGAATTTATTAACGACGAGATCTGCGTGAAATTCTCGCCATCGACGTTCTCGAT
AATTGCTCGGTACCAATGCCCTCCCTTCTTTGCAATGATGTTCATGCCCCATCCATTATTCGCCGAGTATTCCGTTGATCGGAATCACATTTCCAGGATTTCCCTCAGAT
GCTTCCACAATGCTTTGTTGGAAGGCCAAAGCTATTCCTCTATGAGTATGCATCTCCAGGAACCCCAAAATAGCATACTCCTAAAATTTGAGCCTTCAAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTCTATCAAGATTGACGACCTTGATCCTTTCTTGGATGCAACCACTCTATTCGCCGAAATTGTGAACGACGAAATCTGCCTGAAAATCTCACCGTCGACGTTTTC
GATAATTTCTCGCTACCAGCACCCTCTTTTCTTTGCAATGATGATTATGCCGCCACCATTTTTCGCTGAGTATTCCGTTGATCGGAATCACAATTCAAAGATTTCCCTAT
CGGCCTTCCACAATGCTATGTTGGAAGGCCAAAGCTTTTGTTCTTCAATGACCATCCATGTTCATGAACAAGAAAATAATATACACCTTATATTTGAGCCTTCAAGGCCT
TCTGCACTACCATTGCATCATGAATTGACATTCCTGCCTATGGAAGACTGGTCGCTTGGCGAAGTTACCTTATTAAGTGGAAAAATTTTATCCATTGAGTCAGAAATGTT
TAGCAATATTTTGACAACATTTTCTGTGTACAATGAAGCTGATTCAATTTTGATTACTTTAGTGGGTACACAAGCCACCTTCTCTGTCATTCCATTTGATCACGAGATAA
CTGTTACTGAAGAGAGTGGAGATTGTTTGGTATTTCTATGGGATACCGAAGTTGAAGTTCATATGCATATGCCTCTCCATCCATCCTCATTTTTCAGTAATTGTGGAATT
CAAGCTCGAAGGGTTGACAACCTTGATCCTTTTCTCGATGCAACCTCTCTATTCACTGAATTTATTAACGACGAGATCTGCGTGAAATTCTCGCCATCGACGTTCTCGAT
AATTGCTCGGTACCAATGCCCTCCCTTCTTTGCAATGATGTTCATGCCCCATCCATTATTCGCCGAGTATTCCGTTGATCGGAATCACATTTCCAGGATTTCCCTCAGAT
GCTTCCACAATGCTTTGTTGGAAGGCCAAAGCTATTCCTCTATGAGTATGCATCTCCAGGAACCCCAAAATAGCATACTCCTAAAATTTGAGCCTTCAAGTTAG
Protein sequenceShow/hide protein sequence
MFSIKIDDLDPFLDATTLFAEIVNDEICLKISPSTFSIISRYQHPLFFAMMIMPPPFFAEYSVDRNHNSKISLSAFHNAMLEGQSFCSSMTIHVHEQENNIHLIFEPSRP
SALPLHHELTFLPMEDWSLGEVTLLSGKILSIESEMFSNILTTFSVYNEADSILITLVGTQATFSVIPFDHEITVTEESGDCLVFLWDTEVEVHMHMPLHPSSFFSNCGI
QARRVDNLDPFLDATSLFTEFINDEICVKFSPSTFSIIARYQCPPFFAMMFMPHPLFAEYSVDRNHISRISLRCFHNALLEGQSYSSMSMHLQEPQNSILLKFEPSS