; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04353 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04353
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationClcChr08:12964450..12965167
RNA-Seq ExpressionClc08G04353
SyntenyClc08G04353
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]2.4e-1339.62Show/hide
Query:  FIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK
        F+D +   PPR+LDPQQ            YNR++MSW Y+S NE  +G+IVG+ +A +IWE+L  +Y ++S   +  LR+ LQ IKK+GL    Y+ + +
Subjt:  FIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK

Query:  DVSDNL
         + ++L
Subjt:  DVSDNL

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]2.4e-1339.62Show/hide
Query:  FIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK
        F+D +   PPR+LDPQQ            YNR++MSW Y+S NE  +G+IVG+ +A +IWE+L  +Y ++S   +  LR+ LQ IKK+GL    Y+ + +
Subjt:  FIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK

Query:  DVSDNL
         + ++L
Subjt:  DVSDNL

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]1.5e-1543.4Show/hide
Query:  FIDDTIPSPPRYLDP------------QQYNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK
        FID + P PPR+ DP            Q++NR+IMSW Y+S  +  MG+IVG+ +AFEIWE+L  +Y SSS  +I  LR++LQ ++KDGL   +Y+ + K
Subjt:  FIDDTIPSPPRYLDP------------QQYNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK

Query:  DVSDNL
        ++ + L
Subjt:  DVSDNL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.5e-1845.71Show/hide
Query:  GFIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQI
        G++D TI  PP++LD  Q            YNR++M W YSS +E++MGE+V   T  +IW SL  +Y+S +T RIMGL+++LQ ++KDG   SQYLA+I
Subjt:  GFIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQI

Query:  KDVSD
        K+++D
Subjt:  KDVSD

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]1.7e-1476.27Show/hide
Query:  MGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIKDVSDN
        MGEIVG+ +AF+IWE+LRT+YESSS   IMG  SQLQKIKKDGL  SQYLAQIKDV DN
Subjt:  MGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIKDVSDN

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein7.3e-1643.4Show/hide
Query:  FIDDTIPSPPRYLDP------------QQYNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK
        FID + P PPR+ DP            Q++NR+IMSW Y+S  +  MG+IVG+ +AFEIWE+L  +Y SSS  +I  LR++LQ ++KDGL   +Y+ + K
Subjt:  FIDDTIPSPPRYLDP------------QQYNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK

Query:  DVSDNL
        ++ + L
Subjt:  DVSDNL

A0A438JZB9 Retrovirus-related Pol polyprotein from transposon RE11.3e-1244.68Show/hide
Query:  FIDDTIPSPPRYLDPQQYNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIKDVSDNL
        F+ D+    P Y   Q+ NR++M W YSS  E  M +I+G +TA EIW +L  ++ ++S  RIM LR QLQ  KK GL   +YL +IK + DNL
Subjt:  FIDDTIPSPPRYLDPQQYNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIKDVSDNL

A0A6J1DQX7 uncharacterized protein LOC1110223157.1e-1945.71Show/hide
Query:  GFIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQI
        G++D TI  PP++LD  Q            YNR++M W YSS +E++MGE+V   T  +IW SL  +Y+S +T RIMGL+++LQ ++KDG   SQYLA+I
Subjt:  GFIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQI

Query:  KDVSD
        K+++D
Subjt:  KDVSD

A0A7J0EGI5 Uncharacterized protein1.2e-1339.62Show/hide
Query:  FIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK
        F+D +   PPR+LDPQQ            YNR++MSW Y+S NE  +G+IVG+ +A +IWE+L  +Y ++S   +  LR+ LQ IKK+GL    Y+ + +
Subjt:  FIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK

Query:  DVSDNL
         + ++L
Subjt:  DVSDNL

A0A7J0GPN0 UBX domain-containing protein1.2e-1339.62Show/hide
Query:  FIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK
        F+D +   PPR+LDPQQ            YNR++MSW Y+S NE  +G+IVG+ +A +IWE+L  +Y ++S   +  LR+ LQ IKK+GL    Y+ + +
Subjt:  FIDDTIPSPPRYLDPQQ------------YNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIK

Query:  DVSDNL
         + ++L
Subjt:  DVSDNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.7e-0729.47Show/hide
Query:  GFIDDTIPSP----PRYLDPQQYNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIKDV
        GFID T+P P    P Y   +Q N M+M W  +S  +  +  ++   TA ++WE LR ++      +I  LR +L  +++ G    +Y  ++  V
Subjt:  GFIDDTIPSP----PRYLDPQQYNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSSTTRIMGLRSQLQKIKKDGLFFSQYLAQIKDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCAAGTTTGGGTACATCTGCAGACCAAGTTGAGCTACCTATGCTATCTATCTCTAAGATGTCTTCTCTTCTATCGACGACAACTACGTCTACTTTACCACCTTG
GGTTCTTCCACTTACTGTTTCATCTCCTCAAACTTCTCTGGGTTTTATTGACGATACTATCCCTTCTCCTCCTCGCTACTTGGATCCTCAGCAATATAATCGTATGATCA
TGAGTTGGTTTTACTCCTCTCACAATGAAGACGAGATGGGTGAAATTGTTGGTTTCAACACAGCCTTCGAAATTTGGGAATCTCTTCGTACAATGTATGAATCTTCGTCT
ACAACTCGGATTATGGGACTTAGATCTCAGTTGCAGAAGATTAAGAAAGATGGATTATTTTTTTCTCAATATCTTGCTCAAATCAAAGATGTTAGTGATAACTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCATCAAGTTTGGGTACATCTGCAGACCAAGTTGAGCTACCTATGCTATCTATCTCTAAGATGTCTTCTCTTCTATCGACGACAACTACGTCTACTTTACCACCTTG
GGTTCTTCCACTTACTGTTTCATCTCCTCAAACTTCTCTGGGTTTTATTGACGATACTATCCCTTCTCCTCCTCGCTACTTGGATCCTCAGCAATATAATCGTATGATCA
TGAGTTGGTTTTACTCCTCTCACAATGAAGACGAGATGGGTGAAATTGTTGGTTTCAACACAGCCTTCGAAATTTGGGAATCTCTTCGTACAATGTATGAATCTTCGTCT
ACAACTCGGATTATGGGACTTAGATCTCAGTTGCAGAAGATTAAGAAAGATGGATTATTTTTTTCTCAATATCTTGCTCAAATCAAAGATGTTAGTGATAACTTGTAG
Protein sequenceShow/hide protein sequence
MASSLGTSADQVELPMLSISKMSSLLSTTTTSTLPPWVLPLTVSSPQTSLGFIDDTIPSPPRYLDPQQYNRMIMSWFYSSHNEDEMGEIVGFNTAFEIWESLRTMYESSS
TTRIMGLRSQLQKIKKDGLFFSQYLAQIKDVSDNL