; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G15865 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G15865
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr09:23765804..23766446
RNA-Seq ExpressionClc09G15865
SyntenyClc09G15865
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON64464.1 hypothetical protein TorRG33x02_273130 [Trema orientale]1.1e-3143.65Show/hide
Query:  KGDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMD
        KGDF +WK+KMKA+L+             LP TM   +K+E+ E  +S +IL+L+DNVLR+V + +   K+W+KL+ LYL KTL+NKIYLKE+ FGFKMD
Subjt:  KGDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMD

Query:  YSKSLEENLDDFTIICTELANTGENLNAE----------------VRSAIKYERETMTVDKILDSLRLKEIELKAEKK-DLEALFVKNKGKQNSKGK
         +KSLE+NLDDF  I   LAN  E +N E                ++S IKY RE++++D +L +LR  ++E+K EK+ + E L V+ + ++  K K
Subjt:  YSKSLEENLDDFTIICTELANTGENLNAE----------------VRSAIKYERETMTVDKILDSLRLKEIELKAEKK-DLEALFVKNKGKQNSKGK

TXG49237.1 hypothetical protein EZV62_025112 [Acer yangbiense]1.3e-3247.34Show/hide
Query:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY
        GDFG+W++K+KALL              LP ++T EQKD+M E+   +IIL+LSDNVLR ++  +    +W KLE LYL K+LTNKIYLKER FGFKMD 
Subjt:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY

Query:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVK
        SK L +NLDDF  +  ELAN GE+            LN+      +V++AIKY R ++++++ + +L+ KE+ELK EKKD  E LFV+
Subjt:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVK

TXG54059.1 hypothetical protein EZV62_019315 [Acer yangbiense]1.1e-3144.12Show/hide
Query:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY
        GDFG+W++K+KALL              LP ++T EQKD+M E+   +IIL+LSDNVLR ++  +    +W KLE LYL K+LTNKIYLKER FGFKMD 
Subjt:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY

Query:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVKNKGKQNSKGKSN
        SK L +NLDDF  +  +LAN GE+            LN+      +V++AIKY R ++++++ + +L+ KE+ELK EKKD  E LFV+ +    +   ++
Subjt:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVKNKGKQNSKGKSN

Query:  QNKS
         NK+
Subjt:  QNKS

TXG64595.1 hypothetical protein EZV62_011589 [Acer yangbiense]1.0e-3246.39Show/hide
Query:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY
        GDFG+W++K+KALL              LP ++T EQKD+M E+   +IIL+LSDNVLR ++  +    +W KLE LYL K+LTNKIYLKER FGFKMD 
Subjt:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY

Query:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVKNKGKQN
        SK L +NLDDF  +  ELAN GE+            LN+      +V++AIKY R ++++++ + +L+ KE+ELK EKKD  E LFV++  ++N
Subjt:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVKNKGKQN

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]2.8e-3041.29Show/hide
Query:  GDFGLWKQKMKALLL----------HMLPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDYS
        GDF LW++K++A+L+            LP  +T  +K +M E+ +S+I+L+LSD VLR V +     ++W KLE LYL K+L NKIY+KE+FFG+KMD S
Subjt:  GDFGLWKQKMKALLL----------HMLPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDYS

Query:  KSLEENLDDFTIICTELANTGENLN----------------AEVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDLEALFVKNKGKQNS-KGKSNQNK
        KSLEENLD+F  I  +L N GE ++                 EV++AIKY R+++T+  +LD+L+ + +E+K E+KD E L  + + ++ S KGK    +
Subjt:  KSLEENLDDFTIICTELANTGENLN----------------AEVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDLEALFVKNKGKQNS-KGKSNQNK

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A2P5CTT8 Uncharacterized protein5.5e-3243.65Show/hide
Query:  KGDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMD
        KGDF +WK+KMKA+L+             LP TM   +K+E+ E  +S +IL+L+DNVLR+V + +   K+W+KL+ LYL KTL+NKIYLKE+ FGFKMD
Subjt:  KGDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMD

Query:  YSKSLEENLDDFTIICTELANTGENLNAE----------------VRSAIKYERETMTVDKILDSLRLKEIELKAEKK-DLEALFVKNKGKQNSKGK
         +KSLE+NLDDF  I   LAN  E +N E                ++S IKY RE++++D +L +LR  ++E+K EK+ + E L V+ + ++  K K
Subjt:  YSKSLEENLDDFTIICTELANTGENLNAE----------------VRSAIKYERETMTVDKILDSLRLKEIELKAEKK-DLEALFVKNKGKQNSKGK

A0A5C7GXL9 Sucrose-phosphate phosphatase6.5e-3347.34Show/hide
Query:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY
        GDFG+W++K+KALL              LP ++T EQKD+M E+   +IIL+LSDNVLR ++  +    +W KLE LYL K+LTNKIYLKER FGFKMD 
Subjt:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY

Query:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVK
        SK L +NLDDF  +  ELAN GE+            LN+      +V++AIKY R ++++++ + +L+ KE+ELK EKKD  E LFV+
Subjt:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVK

A0A5C7HB65 gag_pre-integrs domain-containing protein5.5e-3244.12Show/hide
Query:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY
        GDFG+W++K+KALL              LP ++T EQKD+M E+   +IIL+LSDNVLR ++  +    +W KLE LYL K+LTNKIYLKER FGFKMD 
Subjt:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY

Query:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVKNKGKQNSKGKSN
        SK L +NLDDF  +  +LAN GE+            LN+      +V++AIKY R ++++++ + +L+ KE+ELK EKKD  E LFV+ +    +   ++
Subjt:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVKNKGKQNSKGKSN

Query:  QNKS
         NK+
Subjt:  QNKS

A0A5C7I661 Uncharacterized protein5.0e-3346.39Show/hide
Query:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY
        GDFG+W++K+KALL              LP ++T EQKD+M E+   +IIL+LSDNVLR ++  +    +W KLE LYL K+LTNKIYLKER FGFKMD 
Subjt:  GDFGLWKQKMKALLLHM-----------LPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDY

Query:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVKNKGKQN
        SK L +NLDDF  +  ELAN GE+            LN+      +V++AIKY R ++++++ + +L+ KE+ELK EKKD  E LFV++  ++N
Subjt:  SKSLEENLDDFTIICTELANTGEN------------LNA------EVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDL-EALFVKNKGKQN

A0A5D3DNU1 Putative gag-pol polyprotein1.4e-3041.29Show/hide
Query:  GDFGLWKQKMKALLL----------HMLPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDYS
        GDF LW++K++A+L+            LP  +T  +K +M E+ +S+I+L+LSD VLR V +     ++W KLE LYL K+L NKIY+KE+FFG+KMD S
Subjt:  GDFGLWKQKMKALLL----------HMLPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDYS

Query:  KSLEENLDDFTIICTELANTGENLN----------------AEVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDLEALFVKNKGKQNS-KGKSNQNK
        KSLEENLD+F  I  +L N GE ++                 EV++AIKY R+++T+  +LD+L+ + +E+K E+KD E L  + + ++ S KGK    +
Subjt:  KSLEENLDDFTIICTELANTGENLN----------------AEVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDLEALFVKNKGKQNS-KGKSNQNK

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-1228.14Show/hide
Query:  FGLWKQKMKALL----LHML-------PTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDYSK
        F  W+++M+ LL    LH +       P TM AE   ++ E   S+I LHLSD+V+  +   +    IW +LE LY+ KTLTNK+YLK++ +   M    
Subjt:  FGLWKQKMKALL----LHML-------PTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDYSK

Query:  SLEENLDDFTIICTELANTGENLNAE----------------VRSAIKYERETMTVDKILDSLRLKEIELKAEKKDLEALFVKNKGKQNSKGKSNQNKS
        +   +L+ F  + T+LAN G  +  E                + + I + + T+ +  +  +L L E   K  +   +AL  + +G+   +  +N  +S
Subjt:  SLEENLDDFTIICTELANTGENLNAE----------------VRSAIKYERETMTVDKILDSLRLKEIELKAEKKDLEALFVKNKGKQNSKGKSNQNKS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCAAGCTTGAATTGGAGAGGTTTGATGGAAAAAGGCGATTTTGGATTGTGGAAGCAGAAGATGAAGGCTCTTCTTCTCCATATGTTACCAACTACTATGACTGC
TGAACAAAAGGATGAGATGTATGAAATTGTTTTTAGTTCAATTATTCTGCATCTCTCCGATAATGTTCTACGTAGAGTTAGTAAGATTGAAAAGGTTACTAAGATTTGGG
CAAAATTAGAAAGATTATATTTGCCGAAGACTCTTACTAACAAAATATATCTCAAGGAACGTTTCTTTGGATTCAAAATGGATTATTCGAAAAGTCTTGAAGAAAATCTT
GATGATTTTACCATTATTTGTACTGAACTTGCAAATACTGGTGAAAATCTAAATGCTGAAGTTAGATCAGCCATTAAGTATGAAAGAGAGACAATGACAGTGGATAAAAT
ACTTGACTCCTTAAGACTAAAGGAGATTGAATTGAAGGCTGAAAAGAAAGACTTAGAAGCTCTTTTTGTCAAGAATAAAGGAAAACAAAATTCAAAAGGCAAGTCAAATC
AGAATAAATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCAAGCTTGAATTGGAGAGGTTTGATGGAAAAAGGCGATTTTGGATTGTGGAAGCAGAAGATGAAGGCTCTTCTTCTCCATATGTTACCAACTACTATGACTGC
TGAACAAAAGGATGAGATGTATGAAATTGTTTTTAGTTCAATTATTCTGCATCTCTCCGATAATGTTCTACGTAGAGTTAGTAAGATTGAAAAGGTTACTAAGATTTGGG
CAAAATTAGAAAGATTATATTTGCCGAAGACTCTTACTAACAAAATATATCTCAAGGAACGTTTCTTTGGATTCAAAATGGATTATTCGAAAAGTCTTGAAGAAAATCTT
GATGATTTTACCATTATTTGTACTGAACTTGCAAATACTGGTGAAAATCTAAATGCTGAAGTTAGATCAGCCATTAAGTATGAAAGAGAGACAATGACAGTGGATAAAAT
ACTTGACTCCTTAAGACTAAAGGAGATTGAATTGAAGGCTGAAAAGAAAGACTTAGAAGCTCTTTTTGTCAAGAATAAAGGAAAACAAAATTCAAAAGGCAAGTCAAATC
AGAATAAATCTTAG
Protein sequenceShow/hide protein sequence
MSASLNWRGLMEKGDFGLWKQKMKALLLHMLPTTMTAEQKDEMYEIVFSSIILHLSDNVLRRVSKIEKVTKIWAKLERLYLPKTLTNKIYLKERFFGFKMDYSKSLEENL
DDFTIICTELANTGENLNAEVRSAIKYERETMTVDKILDSLRLKEIELKAEKKDLEALFVKNKGKQNSKGKSNQNKS