; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G04295 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G04295
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr04:15077827..15079631
RNA-Seq ExpressionClc04G04295
SyntenyClc04G04295
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK02285.1 putative gag protein [Cucumis melo var. makuwa]1.3e-3239.45Show/hide
Query:  ISRRAMDPILASKYDRKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSA
        + R+   P+L +K   K  KA++DP  L +T+T ++ E+ME   YG ++L LS++VL ++ID++TT  IW KL+ ++  +DL N+ +L EKFFT+ MDS 
Subjt:  ISRRAMDPILASKYDRKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSA

Query:  KTLTENLDEFKKIVTEFRSLGEKLKDDNEAYVLLNSLLNAYKE----------KEQPSAEGLFVKSKDKSNLSKGGKQQGKNQEKEKKAKGKQPEASIV-
        ++LT+ L EFKK+ TEF+SLG+++ ++NE + LLNSL + Y+E          K+Q S EG F K + K++       +  N  K+KK K  +PEA +  
Subjt:  KTLTENLDEFKKIVTEFRSLGEKLKDDNEAYVLLNSLLNAYKE----------KEQPSAEGLFVKSKDKSNLSKGGKQQGKNQEKEKKAKGKQPEASIV-

Query:  EGFYFYSDALTSTRDKAN
        +G + + DALT++++  N
Subjt:  EGFYFYSDALTSTRDKAN

XP_038880370.1 uncharacterized protein LOC120072018 [Benincasa hispida]3.8e-2950.41Show/hide
Query:  ALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVTEFRSL
        A+ DP    ETL+  +KE +EL AYG +ILN+++S+LRQ++DQ T   +W KL  ++  KDLPNK F RE+FFT+K+D+AK+LT+NL+EFK++ +EFRS+
Subjt:  ALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVTEFRSL

Query:  GEKLKDDNEAYVLLNSLLNAYKE
         + ++++NEA++LLNSL  ++K+
Subjt:  GEKLKDDNEAYVLLNSLLNAYKE

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]2.5e-2836.64Show/hide
Query:  RKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVT
        +KA  A+ DP    + L   EKE +E  AYG ++LN+ +SVLRQ++D+ T   +W KL  ++  KDLPNK FLRE+FFT+KMD AK+LT+NL+EFK + +
Subjt:  RKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVT

Query:  EFRSLGEKLKDDNEAYVLLNSLLNAYKE--------------------------------KEQPSAEGLFVKSKDKS-------------NLSKGGKQQG
        +FRS+G+ + ++NEA++LLNSL   +K+                                K+QP  EG F K  +K+             +  K      
Subjt:  EFRSLGEKLKDDNEAYVLLNSLLNAYKE--------------------------------KEQPSAEGLFVKSKDKS-------------NLSKGGKQQG

Query:  KNQEKEKKAKGKQPEASIVEGFYFYSDALTST
        K +  ++   GKQ EA++ E    YSDAL +T
Subjt:  KNQEKEKKAKGKQPEASIVEGFYFYSDALTST

XP_038887098.1 uncharacterized protein LOC120077280 [Benincasa hispida]2.5e-2850.78Show/hide
Query:  RKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVT
        +KA  A+ DP    ETL+  EKE +E  AYG +ILN+++SVLRQ++DQ T   +W KL  ++  KD PNK FLRE+FFT+KMD  K+LT+NL+EFK++ +
Subjt:  RKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVT

Query:  EFRSLGEKLKDDNEAYVLLNSLLNAYKE
        EFRS+G+ + ++NEA++L NSL   +K+
Subjt:  EFRSLGEKLKDDNEAYVLLNSLLNAYKE

XP_038896323.1 uncharacterized protein LOC120084587 [Benincasa hispida]1.3e-2959.38Show/hide
Query:  RKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVT
        +KA +ALL+P +L  T+TAQEKED EL  Y                DQ+T   IWTKL+ L+A KDLP+K++L EKFFTFKMDS+KTLT N DEFKKIV 
Subjt:  RKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVT

Query:  EFRSLGEKLKDDNEAYVLLNSLLNAYKE
        EF++LGEKL D NEAYVL NSL  +YKE
Subjt:  EFRSLGEKLKDDNEAYVLLNSLLNAYKE

TrEMBL top hitse value%identityAlignment
A0A2P5FE65 Uncharacterized protein (Fragment)1.2e-2549.22Show/hide
Query:  RKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVT
        +K  KAL D   L ET+   EKE++  TAY +LILNL+++VLRQV +QDT   +W+KL  L+  K L NKI+L+E+ F FKMDS K+L +NLD+FK+I  
Subjt:  RKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVT

Query:  EFRSLGEKLKDDNEAYVLLNSLLNAYKE
           ++ EK+ D+N+A ++LNSL  +YK+
Subjt:  EFRSLGEKLKDDNEAYVLLNSLLNAYKE

A0A5A7TAZ3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-2651.56Show/hide
Query:  RKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVT
        + A K + DP +L  T+   +K+ ME T Y ILILN++++VLRQVI+++TT  I  KL  L+  KDLP+K+++REK F+FKM+ +KTL ENLDEFKK+  
Subjt:  RKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVT

Query:  EFRSLGEKLKDDNEAYVLLNSLLNAYKE
        EF  LGEKL+ ++EA + +NSL + YKE
Subjt:  EFRSLGEKLKDDNEAYVLLNSLLNAYKE

A0A5A7V269 Putative gag protein3.8e-2744.76Show/hide
Query:  ISRRAMDPILASKYDRKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSA
        + R+   P+L +K   K  KA++DP  L +T+T ++ E+ME   YG ++L LS++VL ++ID++TT  IW KL+ ++  +DL N+ +L EKFFT+ MDS 
Subjt:  ISRRAMDPILASKYDRKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSA

Query:  KTLTENLDEFKKIVTEFRSLGEKLKDDNEAYVLLNSLLNAYKE
        ++LT+ L EFKK+ TEF+SLG+++ ++NE + LLNSL + Y+E
Subjt:  KTLTENLDEFKKIVTEFRSLGEKLKDDNEAYVLLNSLLNAYKE

A0A5D3BSZ6 Putative gag protein6.1e-3339.45Show/hide
Query:  ISRRAMDPILASKYDRKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSA
        + R+   P+L +K   K  KA++DP  L +T+T ++ E+ME   YG ++L LS++VL ++ID++TT  IW KL+ ++  +DL N+ +L EKFFT+ MDS 
Subjt:  ISRRAMDPILASKYDRKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSA

Query:  KTLTENLDEFKKIVTEFRSLGEKLKDDNEAYVLLNSLLNAYKE----------KEQPSAEGLFVKSKDKSNLSKGGKQQGKNQEKEKKAKGKQPEASIV-
        ++LT+ L EFKK+ TEF+SLG+++ ++NE + LLNSL + Y+E          K+Q S EG F K + K++       +  N  K+KK K  +PEA +  
Subjt:  KTLTENLDEFKKIVTEFRSLGEKLKDDNEAYVLLNSLLNAYKE----------KEQPSAEGLFVKSKDKSNLSKGGKQQGKNQEKEKKAKGKQPEASIV-

Query:  EGFYFYSDALTSTRDKAN
        +G + + DALT++++  N
Subjt:  EGFYFYSDALTSTRDKAN

A0A5D3DNU1 Putative gag-pol polyprotein8.0e-2549.18Show/hide
Query:  LLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVTEFRSLG
        +LD   L + +T  EK DM+  AY  ++L LS+ VLR V +  TT  +W KL+ L+  K LPNKI+++EKFF +KMD +K+L ENLDEF+KIV +  ++G
Subjt:  LLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVTEFRSLG

Query:  EKLKDDNEAYVLLNSLLNAYKE
        EK+ D+N+A +LLNSL   Y+E
Subjt:  EKLKDDNEAYVLLNSLLNAYKE

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-1228.83Show/hide
Query:  ETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVTEFRSLGEKLKDDNE
        +T+ A++  D++  A   + L+LS+ V+  +ID+DT   IWT+L+ L+  K L NK++L+++ +   M        +L+ F  ++T+  +LG K++++++
Subjt:  ETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFKMDSAKTLTENLDEFKKIVTEFRSLGEKLKDDNE

Query:  AYVLLNSLLNAYKEKEQPSAEG-LFVKSKDKSNLSKGGKQQGKNQEKEKKAKGKQPEASIVEG
        A +LLNSL ++Y         G   ++ KD ++            EK +K    Q +A I EG
Subjt:  AYVLLNSLLNAYKEKEQPSAEG-LFVKSKDKSNLSKGGKQQGKNQEKEKKAKGKQPEASIVEG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACAATTCTAAACAAGGGGCTTCCAGTATTAGGAGAAATTTCTCGAAGAGCCATGGATCCCATTTTAGCATCCAAGTACGATCGAAAGGCTGACAAGGCCCTTTT
GGACCCATTATCTCTCCTAGAAACATTAACTGCACAAGAAAAGGAAGACATGGAGCTGACAGCCTATGGGATTCTAATTTTGAATCTAAGTAACAGTGTCCTAAGGCAGG
TTATAGACCAAGATACAACTCCCACAATATGGACAAAATTGAAAGGTCTCTTTGCCATAAAAGACCTCCCAAATAAGATATTTCTGAGAGAAAAGTTCTTCACATTTAAG
ATGGATTCGGCTAAAACACTCACTGAAAATTTAGATGAGTTTAAGAAGATAGTAACTGAATTCAGGAGTCTTGGTGAGAAACTAAAGGATGACAATGAAGCTTATGTGCT
ATTGAATTCCCTTCTAAATGCATACAAAGAGAAAGAGCAGCCTAGTGCAGAAGGGCTATTTGTCAAGAGCAAGGACAAATCGAATTTGTCTAAAGGTGGGAAACAACAAG
GAAAAAATCAGGAAAAAGAGAAAAAGGCTAAGGGGAAGCAGCCTGAAGCTTCTATAGTTGAAGGCTTCTATTTCTACTCCGATGCACTAACTTCTACAAGAGACAAAGCT
AACCTAATAAGCCCTTTTGGGAAGCATGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACAATTCTAAACAAGGGGCTTCCAGTATTAGGAGAAATTTCTCGAAGAGCCATGGATCCCATTTTAGCATCCAAGTACGATCGAAAGGCTGACAAGGCCCTTTT
GGACCCATTATCTCTCCTAGAAACATTAACTGCACAAGAAAAGGAAGACATGGAGCTGACAGCCTATGGGATTCTAATTTTGAATCTAAGTAACAGTGTCCTAAGGCAGG
TTATAGACCAAGATACAACTCCCACAATATGGACAAAATTGAAAGGTCTCTTTGCCATAAAAGACCTCCCAAATAAGATATTTCTGAGAGAAAAGTTCTTCACATTTAAG
ATGGATTCGGCTAAAACACTCACTGAAAATTTAGATGAGTTTAAGAAGATAGTAACTGAATTCAGGAGTCTTGGTGAGAAACTAAAGGATGACAATGAAGCTTATGTGCT
ATTGAATTCCCTTCTAAATGCATACAAAGAGAAAGAGCAGCCTAGTGCAGAAGGGCTATTTGTCAAGAGCAAGGACAAATCGAATTTGTCTAAAGGTGGGAAACAACAAG
GAAAAAATCAGGAAAAAGAGAAAAAGGCTAAGGGGAAGCAGCCTGAAGCTTCTATAGTTGAAGGCTTCTATTTCTACTCCGATGCACTAACTTCTACAAGAGACAAAGCT
AACCTAATAAGCCCTTTTGGGAAGCATGATTGA
Protein sequenceShow/hide protein sequence
MATILNKGLPVLGEISRRAMDPILASKYDRKADKALLDPLSLLETLTAQEKEDMELTAYGILILNLSNSVLRQVIDQDTTPTIWTKLKGLFAIKDLPNKIFLREKFFTFK
MDSAKTLTENLDEFKKIVTEFRSLGEKLKDDNEAYVLLNSLLNAYKEKEQPSAEGLFVKSKDKSNLSKGGKQQGKNQEKEKKAKGKQPEASIVEGFYFYSDALTSTRDKA
NLISPFGKHD