; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C04G070247 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C04G070247
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCla97Chr04:10769643..10770164
RNA-Seq ExpressionCla97C04G070247
SyntenyCla97C04G070247
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN82302.1 hypothetical protein VITISV_013932 [Vitis vinifera]1.6e-3252.85Show/hide
Query:  PISGAESQVPFIESNNTQ---ITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKE
        P SG  S++P +  N++    IT HKL G NYLQWSQSV+ FIYG+G+++Y+ G+A  P++ +P F  W++EN+ +MSWLINSM  +I ENFLL+ T K+
Subjt:  PISGAESQVPFIESNNTQ---ITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKE

Query:  IWDAARDTFSNQENTAKLFQIET
        IWDAA++T+S+ ENT++LFQ+E+
Subjt:  IWDAARDTFSNQENTAKLFQIET

KAF5450286.1 hypothetical protein F2P56_030651 [Juglans regia]3.3e-3359.32Show/hide
Query:  SGAESQVPFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAA
        S  E+     +SN   IT HKL G NYLQWS SVMMFI G+G++DY+ G A  P   D KF +W  ENN VMSWLINSMT +I ENFLLY TTKEIWDAA
Subjt:  SGAESQVPFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAA

Query:  RDTFSNQENTAKLFQIET
        ++ +SN ENT++LF++E+
Subjt:  RDTFSNQENTAKLFQIET

KAF5451907.1 hypothetical protein F2P56_026963 [Juglans regia]1.2e-3258.47Show/hide
Query:  SGAESQVPFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAA
        S  E+     +SN   IT HKL G NYLQWS SVMMFI G+G++DY+ G A  P   D KF +W  ENN VMSWLINSMT +I ENFLLY T KEIWDAA
Subjt:  SGAESQVPFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAA

Query:  RDTFSNQENTAKLFQIET
        ++ +SN ENT++LF++E+
Subjt:  RDTFSNQENTAKLFQIET

KAG5219081.1 Retrovirus-related polyprotein from transposon [Salix suchowensis]2.7e-3564.55Show/hide
Query:  PFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAARDTFSNQ
        P  E+++  ITCHKL G NYLQWS SVMMFI G+G++DY+ G+AT P  EDP+F  W+ EN+ +MSWLINSMT EI ENFLLY T KEIWDAAR+T+SN 
Subjt:  PFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAARDTFSNQ

Query:  ENTAKLFQIE
        +NT++LF+IE
Subjt:  ENTAKLFQIE

XP_035544136.1 uncharacterized protein LOC118347921 [Juglans regia]1.2e-3258.47Show/hide
Query:  SGAESQVPFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAA
        S  E+     +SN   IT HKL G NYLQWS SVMMFI G+G++DY+ G A  P   D KF +W  ENN VMSWLINSMT +I ENFLLY T KEIWDAA
Subjt:  SGAESQVPFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAA

Query:  RDTFSNQENTAKLFQIET
        ++ +SN ENT++LF++E+
Subjt:  RDTFSNQENTAKLFQIET

TrEMBL top hitse value%identityAlignment
A0A438IYA6 Retrotrans_gag domain-containing protein7.9e-3352.85Show/hide
Query:  PISGAESQVPFIESNNTQ---ITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKE
        P SG  S++P +  N++    IT HKL G NYLQWSQSV+ FIYG+G+++Y+ G+A  P++ +P F  W++EN+ +MSWLINSM  +I ENFLL+ T K+
Subjt:  PISGAESQVPFIESNNTQ---ITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKE

Query:  IWDAARDTFSNQENTAKLFQIET
        IWDAA++T+S+ ENT++LFQ+E+
Subjt:  IWDAARDTFSNQENTAKLFQIET

A0A438KNE1 Copia protein1.3e-3252.85Show/hide
Query:  PISGAESQVPFIESNNTQ---ITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKE
        P SG  S++P +  N++    IT HKL G NYLQWSQSV++FI G+G+++Y+ G+A  P++ +P F  W++ENN +MSWLINSM  +I ENFLL+ T K+
Subjt:  PISGAESQVPFIESNNTQ---ITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKE

Query:  IWDAARDTFSNQENTAKLFQIET
        IWDAA++T+S+ ENT++LFQ+E+
Subjt:  IWDAARDTFSNQENTAKLFQIET

A0A6P9E9X7 uncharacterized protein LOC1183479216.0e-3358.47Show/hide
Query:  SGAESQVPFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAA
        S  E+     +SN   IT HKL G NYLQWS SVMMFI G+G++DY+ G A  P   D KF +W  ENN VMSWLINSMT +I ENFLLY T KEIWDAA
Subjt:  SGAESQVPFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAA

Query:  RDTFSNQENTAKLFQIET
        ++ +SN ENT++LF++E+
Subjt:  RDTFSNQENTAKLFQIET

A5AG88 Integrase catalytic domain-containing protein3.9e-3262.04Show/hide
Query:  ESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAARDTFSNQENT
        +SN   IT HKL G NYLQWS SVMM I G+G++DY+ G A  P   D KF +W  ENN VMSWLINSMT +I ENFLLY T KEIWDAA++T+SN ENT
Subjt:  ESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAARDTFSNQENT

Query:  AKLFQIET
         +LF++E+
Subjt:  AKLFQIET

A5AL90 Retrotrans_gag domain-containing protein7.9e-3352.85Show/hide
Query:  PISGAESQVPFIESNNTQ---ITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKE
        P SG  S++P +  N++    IT HKL G NYLQWSQSV+ FIYG+G+++Y+ G+A  P++ +P F  W++EN+ +MSWLINSM  +I ENFLL+ T K+
Subjt:  PISGAESQVPFIESNNTQ---ITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSEDPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKE

Query:  IWDAARDTFSNQENTAKLFQIET
        IWDAA++T+S+ ENT++LFQ+E+
Subjt:  IWDAARDTFSNQENTAKLFQIET

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTCGCTGAAGTATCTGTTCTTCTCTCTGATGTTGCTCGTTGATTGGCCGTCACTGTCTGAAGTCCTCACCACCGCTGCCATCATCAACTCCTCTGTCGCAGCCGC
CACCGCAGTAGTTTATAAACTAATTTTTTTCCCCTCTTTTGTTCCAATATCAGGAGCAGAATCTCAAGTTCCCTTCATCGAGAGTAACAATACTCAAATAACATGTCACA
AACTCAAAGGCCCTAATTATCTCCAATGGTCACAATCTGTGATGATGTTCATATATGGTCGTGGACAAGAAGACTATATCATGGGTAAGGCAACCTCACCTAAATCCGAA
GATCCTAAATTTTGCATATGGAGGGTTGAAAACAATCAAGTTATGAGTTGGTTAATCAACTCTATGACTACTGAGATCAGAGAGAATTTTCTCCTATATTCCACCACCAA
AGAAATTTGGGATGCTGCTCGAGATACTTTCTCAAATCAGGAGAACACTGCTAAACTTTTTCAGATCGAGACTACTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTCGCTGAAGTATCTGTTCTTCTCTCTGATGTTGCTCGTTGATTGGCCGTCACTGTCTGAAGTCCTCACCACCGCTGCCATCATCAACTCCTCTGTCGCAGCCGC
CACCGCAGTAGTTTATAAACTAATTTTTTTCCCCTCTTTTGTTCCAATATCAGGAGCAGAATCTCAAGTTCCCTTCATCGAGAGTAACAATACTCAAATAACATGTCACA
AACTCAAAGGCCCTAATTATCTCCAATGGTCACAATCTGTGATGATGTTCATATATGGTCGTGGACAAGAAGACTATATCATGGGTAAGGCAACCTCACCTAAATCCGAA
GATCCTAAATTTTGCATATGGAGGGTTGAAAACAATCAAGTTATGAGTTGGTTAATCAACTCTATGACTACTGAGATCAGAGAGAATTTTCTCCTATATTCCACCACCAA
AGAAATTTGGGATGCTGCTCGAGATACTTTCTCAAATCAGGAGAACACTGCTAAACTTTTTCAGATCGAGACTACTTTTTGA
Protein sequenceShow/hide protein sequence
MKSLKYLFFSLMLLVDWPSLSEVLTTAAIINSSVAAATAVVYKLIFFPSFVPISGAESQVPFIESNNTQITCHKLKGPNYLQWSQSVMMFIYGRGQEDYIMGKATSPKSE
DPKFCIWRVENNQVMSWLINSMTTEIRENFLLYSTTKEIWDAARDTFSNQENTAKLFQIETTF