; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C11G206545 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C11G206545
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCla97Chr11:318812..320334
RNA-Seq ExpressionCla97C11G206545
SyntenyCla97C11G206545
Gene Ontology termsGO:0043167 - ion binding (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81130.1 hypothetical protein VITISV_003944 [Vitis vinifera]4.9e-2558.18Show/hide
Query:  IDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNG-KPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM
        I+KFDGT+F YW+MQ  DYL+  G + H+PL G KPESMK E+W ++D++VLG+IRLTLSR+V  +V  EKTT  LMK LS +YEKP   NK++L+TKL 
Subjt:  IDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNG-KPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM

Query:  DLKMVEGAHI
        +LKM E A +
Subjt:  DLKMVEGAHI

KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]4.9e-2555.75Show/hide
Query:  SSIAIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVT
        S+  I KFDGT+F +W+MQ  DYL+  G + H PL+ KPE M +E+W ++D++VLG+IRLTLS+NV  +VA EKTT GLMK LSD+YEKP   NK++L+ 
Subjt:  SSIAIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVT

Query:  KLMDLKMVEGAHI
        KL  LKM EG  +
Subjt:  KLMDLKMVEGAHI

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]4.9e-2555.75Show/hide
Query:  SSIAIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVT
        S+  I KFDGT+F +W+MQ  DYL+  G + H PL+ KPE M +E+W ++D++VLG+IRLTLS+NV  +VA EKTT GLMK LSD+YEKP   NK++L+ 
Subjt:  SSIAIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVT

Query:  KLMDLKMVEGAHI
        KL  LKM EG  +
Subjt:  KLMDLKMVEGAHI

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]4.9e-2555.75Show/hide
Query:  SSIAIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVT
        S+  I KFDGT+F +W+MQ  DYL+  G + H PL+ KPE M +E+W ++D++VLG+IRLTLS+NV  +VA EKTT GLMK LSD+YEKP   NK++L+ 
Subjt:  SSIAIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVT

Query:  KLMDLKMVEGAHI
        KL  LKM EG  +
Subjt:  KLMDLKMVEGAHI

RVX10218.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.9e-2557.27Show/hide
Query:  IDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNG-KPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM
        I+KFDGT+F YW+MQ  +YL+  G + H+PL G KPESMK E+W ++D++VLG+IRLTLSR+V  ++  EKTTI LMK LSD+YEKP   NK++L+ KL 
Subjt:  IDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNG-KPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM

Query:  DLKMVEGAHI
        +LKM E A +
Subjt:  DLKMVEGAHI

TrEMBL top hitse value%identityAlignment
A0A0D3AMG5 Uncharacterized protein1.8e-2541.48Show/hide
Query:  SSIAIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVT
        S+ +I KFDGT++ +W+MQ  DYL+  G + H PL+ KPE M +++W ++D++VLG+IRLTLS+NV  +VA EKTT GLMK LSD+YEKP   NK++L+ 
Subjt:  SSIAIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVT

Query:  KLMDLKMVEGAH--IPSLLMVLFSWSSLPFHMESSSTTHLPLS---DCIPSKVFPLSMFIHTCAREITLESMQFGN
        KL  LKM E       +L++ + S   +  H  +++TT + +S     + S V       HT      +E+   GN
Subjt:  KLMDLKMVEGAH--IPSLLMVLFSWSSLPFHMESSSTTHLPLS---DCIPSKVFPLSMFIHTCAREITLESMQFGN

A0A0D3CS45 Uncharacterized protein1.8e-2550Show/hide
Query:  AIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM
        +I KFDGTNF +W+MQ  DYL+  G + H PL+ KPE M +++W ++D++VLG+IRLTLS+NV  +VA EK T GLMK LSD+YEKP   NK++L+ KL 
Subjt:  AIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM

Query:  DLKMVEG----AHIPSLLMVLFSWSSLPFHME
         LKM EG    AH+     ++   SS+    E
Subjt:  DLKMVEG----AHIPSLLMVLFSWSSLPFHME

A0A438JMM5 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-2557.27Show/hide
Query:  IDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNG-KPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM
        I+KFDGT+F YW+MQ  +YL+  G + H+PL G KPESMK E+W ++D++VLG+IRLTLSR+V  ++  EKTTI LMK LSD+YEKP   NK++L+ KL 
Subjt:  IDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNG-KPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM

Query:  DLKMVEGAHI
        +LKM E A +
Subjt:  DLKMVEGAHI

A5BC72 Uncharacterized protein3.1e-2558.18Show/hide
Query:  IDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNG-KPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM
        I+KFDGTNF YW+MQ  DYL+  G + H+PL G KPESMK E+W ++D++VLG+IRLTLSR+V  +V  EKTT  LMK LS +YEKP   NK++L+ KL 
Subjt:  IDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNG-KPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM

Query:  DLKMVEGAHI
        +LKM E A +
Subjt:  DLKMVEGAHI

A5BGX3 Uncharacterized protein2.4e-2558.18Show/hide
Query:  IDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNG-KPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM
        I+KFDGT+F YW+MQ  DYL+  G + H+PL G KPESMK E+W ++D++VLG+IRLTLSR+V  +V  EKTT  LMK LS +YEKP   NK++L+TKL 
Subjt:  IDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNG-KPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYF-NKLYLVTKLM

Query:  DLKMVEGAHI
        +LKM E A +
Subjt:  DLKMVEGAHI

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-0936.97Show/hide
Query:  IDKFDGTN-FDYWKMQKNDYLHIIGNEFHIPL---NGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIY-EKPYFNKLYLVT
        + KF+G N F  W+ +  D L  I    H  L   + KP++MK EDW  +D++    IRL LS +V +++ +E T  G+   L  +Y  K   NKLYL  
Subjt:  IDKFDGTN-FDYWKMQKNDYLHIIGNEFHIPL---NGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIY-EKPYFNKLYLVT

Query:  KLMDLKMVEGAHIPSLLMV
        +L  L M EG +  S L V
Subjt:  KLMDLKMVEGAHIPSLLMV

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein7.4e-1948.86Show/hide
Query:  DKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYFN
        DK DGT++ + +M+  DYL+  G + H PL  K E+M ++DW I+ ++VL +IRLT+S+N+  +VA EK+  GLMK LSDIY+KP  N
Subjt:  DKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYFN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTCAATTGCTATTGACAAGTTTGATGGGACAAATTTTGATTACTGGAAGATGCAAAAAAATGATTATCTGCATATCATAGGCAACGAGTTTCATATACCATT
GAATGGAAAACCAGAAAGTATGAAAGAAGAAGATTGGAGGATCATTGACAAAAAAGTACTAGGGATTATTCGCTTGACTTTGTCAAGGAATGTTTTTGACCATGTGGCAA
ATGAGAAGACGACAATTGGTCTGATGAAAACTTTGTCAGATATATATGAGAAGCCTTATTTCAACAAGTTATACCTTGTCACTAAACTCATGGATCTGAAAATGGTCGAA
GGTGCACATATACCTTCATTGCTCATGGTACTCTTTTCTTGGTCTTCCCTTCCTTTTCATATGGAATCATCATCAACTACACACTTACCCTTATCTGATTGTATCCCTTC
TAAAGTTTTTCCCCTCTCCATGTTCATCCATACTTGTGCTCGAGAAATTACCCTTGAATCAATGCAGTTTGGAAATCCATCTTCACTTCTAGTGCTACGCTCCTCCCCTA
GTTTTTTGCAAACTGAGAATCCATTTTTGCTCCCATCTTCGACTGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTCAATTGCTATTGACAAGTTTGATGGGACAAATTTTGATTACTGGAAGATGCAAAAAAATGATTATCTGCATATCATAGGCAACGAGTTTCATATACCATT
GAATGGAAAACCAGAAAGTATGAAAGAAGAAGATTGGAGGATCATTGACAAAAAAGTACTAGGGATTATTCGCTTGACTTTGTCAAGGAATGTTTTTGACCATGTGGCAA
ATGAGAAGACGACAATTGGTCTGATGAAAACTTTGTCAGATATATATGAGAAGCCTTATTTCAACAAGTTATACCTTGTCACTAAACTCATGGATCTGAAAATGGTCGAA
GGTGCACATATACCTTCATTGCTCATGGTACTCTTTTCTTGGTCTTCCCTTCCTTTTCATATGGAATCATCATCAACTACACACTTACCCTTATCTGATTGTATCCCTTC
TAAAGTTTTTCCCCTCTCCATGTTCATCCATACTTGTGCTCGAGAAATTACCCTTGAATCAATGCAGTTTGGAAATCCATCTTCACTTCTAGTGCTACGCTCCTCCCCTA
GTTTTTTGCAAACTGAGAATCCATTTTTGCTCCCATCTTCGACTGCTTAA
Protein sequenceShow/hide protein sequence
MSSSIAIDKFDGTNFDYWKMQKNDYLHIIGNEFHIPLNGKPESMKEEDWRIIDKKVLGIIRLTLSRNVFDHVANEKTTIGLMKTLSDIYEKPYFNKLYLVTKLMDLKMVE
GAHIPSLLMVLFSWSSLPFHMESSSTTHLPLSDCIPSKVFPLSMFIHTCAREITLESMQFGNPSSLLVLRSSPSFLQTENPFLLPSSTA