; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G04180 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G04180
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationClcChr06:4357096..4359874
RNA-Seq ExpressionClc06G04180
SyntenyClc06G04180
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458680.1 PREDICTED: uncharacterized protein LOC103498008 [Cucumis melo]6.4e-2441.58Show/hide
Query:  QLHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQ
        Q+H  +EAISVLTET +D FDLKFS + FSIM A SPSSRCI++LQLSP  F  Y C  L +K ++ + F  T    Q  GF+ L F+    +Q+P    
Subjt:  QLHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQ

Query:  KFTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDN
               H  Y +                  +A+  F N E G +    +    S++++ D  EFD   FVSIESQEFINII RF +FDN
Subjt:  KFTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDN

XP_022958855.1 uncharacterized protein LOC111460009 [Cucurbita moschata]7.7e-3846.01Show/hide
Query:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK
        +HDLV+A SVL ET+DD FD+KFS + FSIMAAT+PSS CI+ LQLSP FF+ Y C++L YK +YI++FY    N +R GF+ L F   + D+       
Subjt:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK

Query:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK
             +  R  +   I +        +L   A+  F +   G +E  +LP   S   M D   FD   FVSI+SQEFINI+T F++FD VLVTL +SQV 
Subjt:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK

Query:  FSYGSTQIFLTKE
        FSYG T+I LT+E
Subjt:  FSYGSTQIFLTKE

XP_023006011.1 uncharacterized protein LOC111498888 [Cucurbita maxima]3.1e-3946.95Show/hide
Query:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK
        +HDLV+A SVL ET+DD FD+KFS + FSIMAAT+PSSRCI+ALQLSP FF+ Y C++L YK +YI++FY    N +R GF+ L F   + D+       
Subjt:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK

Query:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK
             +  R  +   I +        +L   A+  F +   G +E  +LP   S+  M D   FD   FVSI+SQEFINI+T F++FD VLVTL++SQV 
Subjt:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK

Query:  FSYGSTQIFLTKE
        FSYG T+I LT+E
Subjt:  FSYGSTQIFLTKE

XP_023548336.1 uncharacterized protein LOC111807004 [Cucurbita pepo subsp. pepo]2.4e-3947.42Show/hide
Query:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK
        +HDLV+A SVL ETFDD FD+KFS + FSIMAAT+PSSRCI+ALQLSP FF+ Y C++L YK +YI++FY    N +R GF+ L F   + D+       
Subjt:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK

Query:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK
             +  R  +   I +        +L   A+  F +   G +E  +LP   S+  M D   FD   FVSI+SQEFINI+T F++FD VLVTL +SQV 
Subjt:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK

Query:  FSYGSTQIFLTKE
        FSYG T+I LT+E
Subjt:  FSYGSTQIFLTKE

XP_038875059.1 uncharacterized protein LOC120067585 [Benincasa hispida]8.0e-1979.41Show/hide
Query:  DLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVKFSYGSTQIFLTKEVN
        DLPFCAST  MDD  EFD A FVSIESQ+FINIIT FSNFDNVLVT+SSSQVKFSYG T I LT+E N
Subjt:  DLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVKFSYGSTQIFLTKEVN

TrEMBL top hitse value%identityAlignment
A0A1S3C9N3 uncharacterized protein LOC1034980083.1e-2441.58Show/hide
Query:  QLHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQ
        Q+H  +EAISVLTET +D FDLKFS + FSIM A SPSSRCI++LQLSP  F  Y C  L +K ++ + F  T    Q  GF+ L F+    +Q+P    
Subjt:  QLHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQ

Query:  KFTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDN
               H  Y +                  +A+  F N E G +    +    S++++ D  EFD   FVSIESQEFINII RF +FDN
Subjt:  KFTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDN

A0A6J1H4N0 uncharacterized protein LOC1114600093.7e-3846.01Show/hide
Query:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK
        +HDLV+A SVL ET+DD FD+KFS + FSIMAAT+PSS CI+ LQLSP FF+ Y C++L YK +YI++FY    N +R GF+ L F   + D+       
Subjt:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK

Query:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK
             +  R  +   I +        +L   A+  F +   G +E  +LP   S   M D   FD   FVSI+SQEFINI+T F++FD VLVTL +SQV 
Subjt:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK

Query:  FSYGSTQIFLTKE
        FSYG T+I LT+E
Subjt:  FSYGSTQIFLTKE

A0A6J1H801 uncharacterized protein LOC1114603978.4e-1430.37Show/hide
Query:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK
        + DLV+A SV+    DD  D+KF +  F+IMA + P+   ++AL L P FFD Y CN+L     +++N +    +++ +GF  L F     +        
Subjt:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK

Query:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK
                ++QA N +              N V     P     E  D               FD ++FVS+ES+EF+NI+T +  FD V VT++S++V 
Subjt:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK

Query:  FSYG-STQIFLTKE
        FSY    +  LT+E
Subjt:  FSYG-STQIFLTKE

A0A6J1KWL1 uncharacterized protein LOC1114988881.5e-3946.95Show/hide
Query:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK
        +HDLV+A SVL ET+DD FD+KFS + FSIMAAT+PSSRCI+ALQLSP FF+ Y C++L YK +YI++FY    N +R GF+ L F   + D+       
Subjt:  LHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK

Query:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK
             +  R  +   I +        +L   A+  F +   G +E  +LP   S+  M D   FD   FVSI+SQEFINI+T F++FD VLVTL++SQV 
Subjt:  FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVK

Query:  FSYGSTQIFLTKE
        FSYG T+I LT+E
Subjt:  FSYGSTQIFLTKE

A0A6J1L2R2 uncharacterized protein LOC1114993086.9e-1630.49Show/hide
Query:  LVFMVNSNQLHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDS
        ++F    NQ+ DLV+A SV+    DD  D+KFS+  F+IMA + P+   ++AL L P FFD Y CN+L     +++N +    +++ +GF  L F     
Subjt:  LVFMVNSNQLHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDS

Query:  DQEPGFFQKFTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVL
        +                ++QA N +              N V     P     E  D               FD ++FVS++S+EF+NI+T +  FD V 
Subjt:  DQEPGFFQKFTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVL

Query:  VTLSSSQVKFSYG-STQIFLTKE
        VT++S++V FSY    +  LT+E
Subjt:  VTLSSSQVKFSYG-STQIFLTKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGTTTTCATGGTTAATAGTAACCAACTCCATGATTTGGTAGAAGCCATCTCTGTCCTCACCGAGACTTTCGATGACACGTTTGACCTCAAATTCTCACAGTCAAG
GTTCTCTATAATGGCGGCTACATCACCTTCGTCTCGTTGCATTTTAGCACTACAATTATCCCCTAGTTTCTTCGATGTATATGCTTGCAACAAACTTGAGTATAAACTTG
TTTACATCCAAAACTTTTATCATACTAAGTTCAATCTCCAACGCAATGGTTTTACTGAGTTGGGTTTCGCTTGTTTTGATTCTGACCAAGAGCCTGGTTTTTTTCAAAAG
TTTACTTGGCTGATCTTGCATAAACGATACCAAGCTCGAAATGATATCCGGATGCTTCTTGATATTGATGAAGATGGATACCTAGATCGAAATGCGGTTCGAATCTTTGC
TAATCCTGAAGCTGGTACTTATGAGAGAAATGATCTACCATTTTGTGCTTCGACTGATAGGATGGACGATACTTTCGAATTTGATTGTGCAAATTTTGTCTCCATTGAAT
CACAAGAGTTCATAAACATTATCACACGGTTTAGTAATTTTGATAATGTTCTTGTTACTCTATCAAGTTCACAAGTCAAGTTCTCTTATGGAAGCACACAGATTTTTCTA
ACTAAAGAGGTAAATATTTGGGTCTCTTTTGATGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTAGTTTTCATGGTTAATAGTAACCAACTCCATGATTTGGTAGAAGCCATCTCTGTCCTCACCGAGACTTTCGATGACACGTTTGACCTCAAATTCTCACAGTCAAG
GTTCTCTATAATGGCGGCTACATCACCTTCGTCTCGTTGCATTTTAGCACTACAATTATCCCCTAGTTTCTTCGATGTATATGCTTGCAACAAACTTGAGTATAAACTTG
TTTACATCCAAAACTTTTATCATACTAAGTTCAATCTCCAACGCAATGGTTTTACTGAGTTGGGTTTCGCTTGTTTTGATTCTGACCAAGAGCCTGGTTTTTTTCAAAAG
TTTACTTGGCTGATCTTGCATAAACGATACCAAGCTCGAAATGATATCCGGATGCTTCTTGATATTGATGAAGATGGATACCTAGATCGAAATGCGGTTCGAATCTTTGC
TAATCCTGAAGCTGGTACTTATGAGAGAAATGATCTACCATTTTGTGCTTCGACTGATAGGATGGACGATACTTTCGAATTTGATTGTGCAAATTTTGTCTCCATTGAAT
CACAAGAGTTCATAAACATTATCACACGGTTTAGTAATTTTGATAATGTTCTTGTTACTCTATCAAGTTCACAAGTCAAGTTCTCTTATGGAAGCACACAGATTTTTCTA
ACTAAAGAGGTAAATATTTGGGTCTCTTTTGATGTTTAG
Protein sequenceShow/hide protein sequence
MLVFMVNSNQLHDLVEAISVLTETFDDTFDLKFSQSRFSIMAATSPSSRCILALQLSPSFFDVYACNKLEYKLVYIQNFYHTKFNLQRNGFTELGFACFDSDQEPGFFQK
FTWLILHKRYQARNDIRMLLDIDEDGYLDRNAVRIFANPEAGTYERNDLPFCASTDRMDDTFEFDCANFVSIESQEFINIITRFSNFDNVLVTLSSSQVKFSYGSTQIFL
TKEVNIWVSFDV