; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04750 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04750
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr08:14296897..14297457
RNA-Seq ExpressionClc08G04750
SyntenyClc08G04750
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025159.1 gag/pol protein [Cucumis melo var. makuwa]2.7e-8181.18Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        MRRIP ASA+GSLMY ML T  +ICY VGI SRYQSNPGLDHWTAVK I KYLRRTRDY LVYGAKDLILTGYTDSDFQT+KD RK T GSVFTL+GG V
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
        VWRSIKQGCIADSTMEAEYVA CE  K+AVWLRKFL+DL++VP+M LPITLY+DNS AVANSKEP  HKRGKHI++KYHLI++IVQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-8282.26Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        MRRIP ASA+GSLMY ML T  +ICY VGI SRYQSNPGLDHWTAVK + KYLRRTRDY LVYGAKDLILTGYTDSDFQT+KD RK TSGSVFTL+GG V
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
        VWRSIKQGCIADSTMEAEYVA CE AK+AVWLRKFL+DL++VP+M LPITLY DNSGAVANSKEP  HKRGKHI++KYHLI++IVQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-8282.8Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        MRRIP ASA+GSLMY ML T  +ICY VGI SRYQSNPGLDHWTAVK I KYLRRTRDY LVYGAKDLILTGYTDSDFQT+KD RK TSGSVFTL+GG V
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
        VWRSIKQGCIADSTMEAEYVA CE AK+AVWLRKFL+DL++VP+M LPITLY DNSGAVANSKEP  HKRGKHI++KYHLI++IVQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-8282.26Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        MRRIP ASA+GSLMY ML T  +ICY VGI SRYQSNPGLDHWTAVK + KYLRRTRDY LVYGAKDLILTGYTDSDFQT+KD RK TSGSVFTL+GG V
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
        VWRSIKQGCIADSTMEAEYVA CE AK+AVWLRKFL+DL++VP+M LPITLY DNSGAVANSKEP  HKRGKHI++KYHLI++IVQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-8181.72Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        MRRIP ASA+GSLMY ML T  +ICY VGI SRYQSNPGLDHWT VK I KYLRRTRDY LVYGAKDLILTGYTDSDFQT+KD RK TSGSVFTL+ G V
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
        VWRSIKQGCIADSTMEAEYVA CE AK+AVWLRKFL+DL++VP+M LPITLY DNSGAVANSKEP  HKRGKHI++KYHLI++IVQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

TrEMBL top hitse value%identityAlignment
A0A5A7SIN2 Gag/pol protein1.3e-8181.18Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        MRRIP ASA+GSLMY ML T  +ICY VGI SRYQSNPGLDHWTAVK I KYLRRTRDY LVYGAKDLILTGYTDSDFQT+KD RK T GSVFTL+GG V
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
        VWRSIKQGCIADSTMEAEYVA CE  K+AVWLRKFL+DL++VP+M LPITLY+DNS AVANSKEP  HKRGKHI++KYHLI++IVQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

A0A5A7TKM4 Gag/pol protein5.2e-8382.8Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        MRRIP ASA+GSLMY ML T  +ICY VGI SRYQSNPGLDHWTAVK I KYLRRTRDY LVYGAKDLILTGYTDSDFQT+KD RK TSGSVFTL+GG V
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
        VWRSIKQGCIADSTMEAEYVA CE AK+AVWLRKFL+DL++VP+M LPITLY DNSGAVANSKEP  HKRGKHI++KYHLI++IVQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

A0A5A7TZD0 Gag/pol protein5.2e-8382.26Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        MRRIP ASA+GSLMY ML T  +ICY VGI SRYQSNPGLDHWTAVK + KYLRRTRDY LVYGAKDLILTGYTDSDFQT+KD RK TSGSVFTL+GG V
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
        VWRSIKQGCIADSTMEAEYVA CE AK+AVWLRKFL+DL++VP+M LPITLY DNSGAVANSKEP  HKRGKHI++KYHLI++IVQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

A0A5A7UYE8 Gag/pol protein5.2e-8382.26Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        MRRIP ASA+GSLMY ML T  +ICY VGI SRYQSNPGLDHWTAVK + KYLRRTRDY LVYGAKDLILTGYTDSDFQT+KD RK TSGSVFTL+GG V
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
        VWRSIKQGCIADSTMEAEYVA CE AK+AVWLRKFL+DL++VP+M LPITLY DNSGAVANSKEP  HKRGKHI++KYHLI++IVQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

A0A5A7V1F5 Gag/pol protein9.9e-8281.72Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        MRRIP ASA+GSLMY ML T  +ICY VGI SRYQSNPGLDHWT VK I KYLRRTRDY LVYGAKDLILTGYTDSDFQT+KD RK TSGSVFTL+ G V
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
        VWRSIKQGCIADSTMEAEYVA CE AK+AVWLRKFL+DL++VP+M LPITLY DNSGAVANSKEP  HKRGKHI++KYHLI++IVQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-2737.1Show/hide
Query:  PNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYG---AKDLILTGYTDSDFQTNKDCRKFTSGSVFTL-SGGVV
        P  S +G LMY ML T  ++   V I SRY S    + W  +K + +YL+ T D KL++    A +  + GY DSD+  ++  RK T+G +F +    ++
Subjt:  PNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYG---AKDLILTGYTDSDFQTNKDCRKFTSGSVFTL-SGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
         W + +Q  +A S+ EAEY+A+ E  ++A+WL+  L  + I   +E PI +Y DN G ++ +  P  HKR KHID KYH  ++ VQ
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

P0CV72 Secreted RxLR effector protein 1611.7e-2240.6Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVY-GAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGV
        M+ +P  SA+G++MY M+ T  ++   VG+ S++ S+P   HW A+K + +YL+ T+ Y L +  A    L GY+D+D+  + + R+ TSG +F L+GG 
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVY-GAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGV

Query:  VVWRSIKQGCIADSTMEAEYVAVCEVAKKAVWL
        V WRS KQ  +A S+ E EY+A+ E  ++AVWL
Subjt:  VVWRSIKQGCIADSTMEAEYVAVCEVAKKAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-3944.32Show/hide
Query:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV
        M ++P +SA+GSLMY M+ T  +I + VG+ SR+  NPG +HW AVK I +YLR T    L +G  D IL GYTD+D   + D RK ++G +FT SGG +
Subjt:  MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVV

Query:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIV
         W+S  Q C+A ST EAEY+A  E  K+ +WL++FL +L +    +    +Y D+  A+  SK    H R KHID +YH I+++V
Subjt:  VWRSIKQGCIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-1833.15Show/hide
Query:  MGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKL-VYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVVVWRSIKQG
        +GSL Y + +T  ++ Y V   S+Y   P  DHW A+K + +YL  T D+ + +     L L  Y+D+D+  + D    T+G +  L    + W S KQ 
Subjt:  MGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKL-VYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVVVWRSIKQG

Query:  CIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ
         +  S+ EAEY +V   + +  W+   L +L I   +  P  +Y DN GA      P  H R KHI   YH I+  VQ
Subjt:  CIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.0e-1930.86Show/hide
Query:  MGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAK-DLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVVVWRSIKQG
        +G LMY +  T L+I + V   S++   P L H  AV  I  Y++ T    L Y ++ ++ L  ++D+ FQ+ KD R+ T+G    L   ++ W+S KQ 
Subjt:  MGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAK-DLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVVVWRSIKQG

Query:  CIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQK
         ++ S+ EAEY A+     + +WL +F  +L++   +  P  L+ DN+ A+  +     H+R KHI+   H +++
Subjt:  CIADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGTATTCCCAATGCCTCAGCTATGGGCAGCTTAATGTATGATATGCTCTACACTAGTTTAGAAATTTGTTATGTAGTGGGAATAGATAGTAGGTACCAGTCCAA
TCCAGGATTAGACCACTGGACTGCGGTTAAGAGCATCTTTAAGTATCTTAGGAGAACGAGGGACTACAAGCTGGTGTATGGAGCTAAAGATTTGATCCTTACGGGATACA
CCGACTCTGATTTTCAAACTAACAAGGATTGTAGGAAATTCACGTCAGGATCAGTGTTCACCTTAAGCGGAGGAGTTGTAGTATGGCGTAGCATCAAGCAAGGATGCATT
GCAGACTCTACTATGGAGGCTGAGTATGTAGCTGTTTGTGAAGTAGCAAAGAAAGCAGTCTGGCTAAGAAAGTTCCTAAATGATTTAAAAATTGTTCCAGACATGGAATT
GCCCATCACGTTATACTTTGACAATAGTGGGGCAGTAGCCAACTCTAAAGAGCCTGGCGACCACAAACGAGGAAAGCATATTGACAAGAAGTATCACCTTATACAGAAGA
TTGTGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGACGTATTCCCAATGCCTCAGCTATGGGCAGCTTAATGTATGATATGCTCTACACTAGTTTAGAAATTTGTTATGTAGTGGGAATAGATAGTAGGTACCAGTCCAA
TCCAGGATTAGACCACTGGACTGCGGTTAAGAGCATCTTTAAGTATCTTAGGAGAACGAGGGACTACAAGCTGGTGTATGGAGCTAAAGATTTGATCCTTACGGGATACA
CCGACTCTGATTTTCAAACTAACAAGGATTGTAGGAAATTCACGTCAGGATCAGTGTTCACCTTAAGCGGAGGAGTTGTAGTATGGCGTAGCATCAAGCAAGGATGCATT
GCAGACTCTACTATGGAGGCTGAGTATGTAGCTGTTTGTGAAGTAGCAAAGAAAGCAGTCTGGCTAAGAAAGTTCCTAAATGATTTAAAAATTGTTCCAGACATGGAATT
GCCCATCACGTTATACTTTGACAATAGTGGGGCAGTAGCCAACTCTAAAGAGCCTGGCGACCACAAACGAGGAAAGCATATTGACAAGAAGTATCACCTTATACAGAAGA
TTGTGCAATGA
Protein sequenceShow/hide protein sequence
MRRIPNASAMGSLMYDMLYTSLEICYVVGIDSRYQSNPGLDHWTAVKSIFKYLRRTRDYKLVYGAKDLILTGYTDSDFQTNKDCRKFTSGSVFTLSGGVVVWRSIKQGCI
ADSTMEAEYVAVCEVAKKAVWLRKFLNDLKIVPDMELPITLYFDNSGAVANSKEPGDHKRGKHIDKKYHLIQKIVQ