; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G11150 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G11150
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon opus
Genome locationClcChr11:16598497..16598954
RNA-Seq ExpressionClc11G11150
SyntenyClc11G11150
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038888126.1 uncharacterized protein LOC120078022 [Benincasa hispida]4.6e-2051.33Show/hide
Query:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDVHVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQMIEDIIEEK-ASIIDARAEDETMLDDMD
        MP+SIFKKL +  ARPTT+TLQ+ADRSI +PEG+   + V++ ELTIRVDDQQVKFNV +ALKYP D  +CQ +E+++EE     ++   E++  +  M 
Subjt:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDVHVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQMIEDIIEEK-ASIIDARAEDETMLDDMD

Query:  EGREAEIGVDAHF
        E   A I V+++F
Subjt:  EGREAEIGVDAHF

XP_038891032.1 uncharacterized protein LOC120080447 [Benincasa hispida]4.7e-2558.82Show/hide
Query:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDV------HVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQMIEDIIEEK-ASIIDARAEDET
        MPLSIFKKL +  ARPTT+TLQLADRSI HPEGKIEDV       V++GELTIRVDDQQVKFNV NALKYP D  +CQ IE++ EE     ++   E++ 
Subjt:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDV------HVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQMIEDIIEEK-ASIIDARAEDET

Query:  MLDDMDEGREAEIGVDAHF
         +D M E   A I V+++F
Subjt:  MLDDMDEGREAEIGVDAHF

XP_038895928.1 uncharacterized protein LOC120084099 [Benincasa hispida]1.2e-2048.48Show/hide
Query:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDVH----------------------------------------VKEGELTIRVDDQQVKFNVFN
        MPLSIF+KLG+GEARPTTVTLQLADRSI H EGKIEDV                                         V++GELTI+VDDQ+VKFN+FN
Subjt:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDVH----------------------------------------VKEGELTIRVDDQQVKFNVFN

Query:  ALKYPDDFHSCQMIEDIIEEKASIIDARAEDE
        A  Y DD  +CQ ++D+I EK   I  + EDE
Subjt:  ALKYPDDFHSCQMIEDIIEEKASIIDARAEDE

XP_038896647.1 uncharacterized protein LOC120084908 [Benincasa hispida]1.7e-1951.28Show/hide
Query:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDVHVK----------------------------------------EGELTIRVDDQQVKFNVFN
        MPL IFKKLG+GEA+PTT+TLQLADRSI +PEGKIEDV VK                                        +GELTIRVDDQQVKFNVFN
Subjt:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDVHVK----------------------------------------EGELTIRVDDQQVKFNVFN

Query:  ALKYPDDFHSCQMIEDI
        ALK+ ++  +CQ IED+
Subjt:  ALKYPDDFHSCQMIEDI

XP_038899815.1 uncharacterized protein LOC120087043 [Benincasa hispida]2.1e-2050Show/hide
Query:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDV----------------------------------------HVKEGELTIRVDDQQVKFNVFN
        MPLSIFKKL +G ARPTT+TLQLADRSI HPEGKIEDV                                        +V++GELTI+V+DQQVKFNV N
Subjt:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDV----------------------------------------HVKEGELTIRVDDQQVKFNVFN

Query:  ALKYPDDFHSCQMIEDIIEE
        ALKYP D  +CQ +E++ EE
Subjt:  ALKYPDDFHSCQMIEDIIEE

TrEMBL top hitse value%identityAlignment
A0A1U8HW62 uncharacterized protein LOC1078878716.3e-1538.51Show/hide
Query:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDV----------------------------------------HVKEGELTIRVDDQQVKFNVFN
        MP+SIFKKLG+G+ RPTTVTLQL DRS  HPEGKIEDV                                         V++G+LTIRV+DQ+V FNVFN
Subjt:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDV----------------------------------------HVKEGELTIRVDDQQVKFNVFN

Query:  ALKYPDD---FHSCQMIEDIIEE--KASIIDARAEDETM-------LDDMDEGREAEIGVD
        ALK  D+    H+  +I+ +++E  K    ++ +ED+ M        +++DE  EA++  D
Subjt:  ALKYPDD---FHSCQMIEDIIEE--KASIIDARAEDETM-------LDDMDEGREAEIGVD

A0A1U8KID3 uncharacterized protein LOC1079174683.7e-1545.38Show/hide
Query:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDV----------HVKEGELTIRVDDQQVKFNVFNALKYPDDFHSC---QMIEDIIEEK-ASIID
        MP+S+F+ LG+ +ARPTTVTLQLA RS  HPE KIEDV           V++GELTIRV+DQQ+ FN+F+ LK  ++   C     IE ++EE+ AS   
Subjt:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDV----------HVKEGELTIRVDDQQVKFNVFNALKYPDDFHSC---QMIEDIIEEK-ASIID

Query:  ARAEDETMLDDMDEGREAE
          ++ +T   ++DE +  E
Subjt:  ARAEDETMLDDMDEGREAE

A0A5B6V914 Uncharacterized protein4.3e-1663.64Show/hide
Query:  KLGVGEARPTTVTLQLADRSIKHPEGKIEDVHVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQ---MIEDIIEEK
        KLG+G+ARP TVTLQL DRS  HPEGKIEDV  ++GELTIRV+DQ + FNVF+ALKY  D   C    +IE  IE K
Subjt:  KLGVGEARPTTVTLQLADRSIKHPEGKIEDVHVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQ---MIEDIIEEK

A0A6J1CR66 uncharacterized protein LOC1110134312.1e-1541.53Show/hide
Query:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIED-----------VHVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQMIEDIIEEKASIIDARA
        MPLS+ K+LG+GE RPT VTLQLADRSI +PE KIED           V V +GE+T+RV DQ++KF++++++KYP D   C  +  + E   +++    
Subjt:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIED-----------VHVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQMIEDIIEEKASIIDARA

Query:  -------EDETMLDDMDE
               E  +M++  DE
Subjt:  -------EDETMLDDMDE

A0A6P4N1Y6 uncharacterized protein LOC1084659771.3e-1550Show/hide
Query:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDVHVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQ---MIEDIIEEKASIIDARAEDETMLDD
        MP+SI +KLG+G+ RPTTVTLQ+ DRS  HP+GKIED  V +GELT+RV+D+QV FNVFN LK  D+   C    +IE  ++E        ++ E   D 
Subjt:  MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDVHVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQ---MIEDIIEEKASIIDARAEDETMLDD

Query:  MDEG
        M++G
Subjt:  MDEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACTGTCTATATTCAAGAAGTTGGGTGTAGGGGAAGCACGGCCCACCACAGTCACACTTCAGCTCGCTGACCGCTCCATTAAGCATCCCGAAGGAAAAATTGAGGA
TGTTCATGTGAAGGAAGGGGAATTAACAATTCGTGTGGACGACCAGCAGGTAAAGTTTAATGTTTTTAATGCATTGAAATATCCTGATGATTTTCATTCTTGCCAGATGA
TCGAAGATATAATTGAAGAAAAGGCAAGCATCATAGATGCACGGGCAGAAGACGAAACAATGCTGGATGACATGGATGAGGGAAGAGAAGCGGAAATAGGTGTTGATGCG
CACTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCACTGTCTATATTCAAGAAGTTGGGTGTAGGGGAAGCACGGCCCACCACAGTCACACTTCAGCTCGCTGACCGCTCCATTAAGCATCCCGAAGGAAAAATTGAGGA
TGTTCATGTGAAGGAAGGGGAATTAACAATTCGTGTGGACGACCAGCAGGTAAAGTTTAATGTTTTTAATGCATTGAAATATCCTGATGATTTTCATTCTTGCCAGATGA
TCGAAGATATAATTGAAGAAAAGGCAAGCATCATAGATGCACGGGCAGAAGACGAAACAATGCTGGATGACATGGATGAGGGAAGAGAAGCGGAAATAGGTGTTGATGCG
CACTTTTAG
Protein sequenceShow/hide protein sequence
MPLSIFKKLGVGEARPTTVTLQLADRSIKHPEGKIEDVHVKEGELTIRVDDQQVKFNVFNALKYPDDFHSCQMIEDIIEEKASIIDARAEDETMLDDMDEGREAEIGVDA
HF