; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G16123 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G16123
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationClcChr10:29914545..29915209
RNA-Seq ExpressionClc10G16123
SyntenyClc10G16123
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN68990.1 hypothetical protein VITISV_015170 [Vitis vinifera]5.5e-1445.54Show/hide
Query:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE---------------HYDL-EPQNWDC
        N T + F      NN V+SWLI SM+ +I ENF LY TAK+IWDAA++T+SN E+T+ELF+++S L D +QGE                 DL E  NW C
Subjt:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE---------------HYDL-EPQNWDC

Query:  I
        +
Subjt:  I

RVW54694.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.7e-1353.42Show/hide
Query:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE
        N T + F    + NN V+ WLI SM+ +I ENF LY TAK+IWDAA++T+SN E+T+ELF+++S L D +QGE
Subjt:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE

RVX02119.1 hypothetical protein CK203_025357 [Vitis vinifera]7.2e-1456.16Show/hide
Query:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE
        N T + F    + NN V+SWLI SM+ +I ENF LY TAK+IWD A++T+SN E+T ELFK+KS L D +QGE
Subjt:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE

RVX20874.1 hypothetical protein CK203_002528 [Vitis vinifera]2.7e-1353.42Show/hide
Query:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE
        N T + F    + NN V+ WLI SM+ +I ENF LY TAK+IWDAA++T+SN E+T+ELF+++S L D +QGE
Subjt:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE

XP_042976301.1 uncharacterized protein LOC122307467 [Carya illinoinensis]2.1e-1346Show/hide
Query:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE---------------HYDL-EPQNWDC
        N T + F      NN V+SWLI SM+ +I ENF LY TAK+IWDAA++T+SN E+T ELF+++S L D +QGE                 DL E  NW C
Subjt:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE---------------HYDL-EPQNWDC

TrEMBL top hitse value%identityAlignment
A0A438F3U7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1353.42Show/hide
Query:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE
        N T + F    + NN V+ WLI SM+ +I ENF LY TAK+IWDAA++T+SN E+T+ELF+++S L D +QGE
Subjt:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE

A0A438IZK0 Retrotran_gag_3 domain-containing protein3.5e-1456.16Show/hide
Query:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE
        N T + F    + NN V+SWLI SM+ +I ENF LY TAK+IWD A++T+SN E+T ELFK+KS L D +QGE
Subjt:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE

A0A438J025 Uncharacterized protein2.9e-1337.5Show/hide
Query:  NNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQG---------------EHYDL-EPQNWDCIPNFPLISSFENS
        N+ V+SWLI SM+  I ENF LY T K+IWD A++T+SN E+T E+F+I++TL DL+QG               +H D+ E   W C  +  L       
Subjt:  NNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQG---------------EHYDL-EPQNWDCIPNFPLISSFENS

Query:  PIPSFEE--SPICNLSLNCKLTCKEQDN
        P+PS  E  S +C+     K+    Q++
Subjt:  PIPSFEE--SPICNLSLNCKLTCKEQDN

A0A438KI33 Retrotran_gag_3 domain-containing protein1.3e-1353.42Show/hide
Query:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE
        N T + F    + NN V+ WLI SM+ +I ENF LY TAK+IWDAA++T+SN E+T+ELF+++S L D +QGE
Subjt:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE

A5AG88 Integrase catalytic domain-containing protein2.7e-1445.54Show/hide
Query:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE---------------HYDL-EPQNWDC
        N T + F      NN V+SWLI SM+ +I ENF LY TAK+IWDAA++T+SN E+T+ELF+++S L D +QGE                 DL E  NW C
Subjt:  NSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGE---------------HYDL-EPQNWDC

Query:  I
        +
Subjt:  I

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCTAATTTGTGATGATGTTCATATAGTCGTAGACAAGAAGACTACATCACATGTAAAGCAACATCACCTGAACTCGACGATCCAAAATTTCCCAAATGGA
GGGATAGGGAACAATCAAGTTGTGAGTTGGTTGATCAAGTCCATGTCTATTGAGATCAGGGAGAATTTTTACCTATATTCAACTGCCAAACAAATTTGGGATGCT
GCTCAAGATACTTTCTCAAATAAGGAAAGCACTGTTGAACTTTTCAAGATCAAGAGTACTCTCCAAGATCTCAAACAAGGGGAACATTATGATCTTGAACCACAA
AATTGGGACTGTATTCCTAACTTTCCTTTAATCTCATCCTTCGAAAATTCTCCTATACCTTCTTTTGAGGAAAGTCCCATATGTAATTTAAGCCTGAACTGCAAG
CTTACTTGCAAAGAGCAAGACAACTTGAGTGGAAGGAAATACAATCTCCACAAGTTCAGCAAAGCCAAAATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCTAATTTGTGATGATGTTCATATAGTCGTAGACAAGAAGACTACATCACATGTAAAGCAACATCACCTGAACTCGACGATCCAAAATTTCCCAAATGGA
GGGATAGGGAACAATCAAGTTGTGAGTTGGTTGATCAAGTCCATGTCTATTGAGATCAGGGAGAATTTTTACCTATATTCAACTGCCAAACAAATTTGGGATGCT
GCTCAAGATACTTTCTCAAATAAGGAAAGCACTGTTGAACTTTTCAAGATCAAGAGTACTCTCCAAGATCTCAAACAAGGGGAACATTATGATCTTGAACCACAA
AATTGGGACTGTATTCCTAACTTTCCTTTAATCTCATCCTTCGAAAATTCTCCTATACCTTCTTTTGAGGAAAGTCCCATATGTAATTTAAGCCTGAACTGCAAG
CTTACTTGCAAAGAGCAAGACAACTTGAGTGGAAGGAAATACAATCTCCACAAGTTCAGCAAAGCCAAAATCTAA
Protein sequenceShow/hide protein sequence
MVLICDDVHIVVDKKTTSHVKQHHLNSTIQNFPNGGIGNNQVVSWLIKSMSIEIRENFYLYSTAKQIWDAAQDTFSNKESTVELFKIKSTLQDLKQGEHYDLEPQ
NWDCIPNFPLISSFENSPIPSFEESPICNLSLNCKLTCKEQDNLSGRKYNLHKFSKAKI