; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G01050 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G01050
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr04:2967292..2971166
RNA-Seq ExpressionClc04G01050
SyntenyClc04G01050
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-5848.62Show/hide
Query:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------
        V+  GCKWIYKRKRG DGKVQTFKARLVAKGYT                                                                   
Subjt:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------

Query:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN
                     Q S+SWNIRFDTAIK Y FDQ VD+PCVYK+IIN S+AFLVLY+DDILLIGND                              I R+
Subjt:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN

Query:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS
        RKNK  ALSQASYIDK++V+YSM NSKR LLPFRHG+ L+K+QC KT Q+V+EMR IPYA AV SLMY +LCTR DICYAV IVSRYQS+
Subjt:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-5848.62Show/hide
Query:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------
        V+  GCKWIYKRKRG DGKVQTFKARLVAKGYT                                                                   
Subjt:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------

Query:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN
                     Q S+SWNIRFDTAIK Y FDQ VD+PCVYK+IIN S+AFLVLY+DDILLIGND                              I R+
Subjt:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN

Query:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS
        RKNK  ALSQASYIDK++V+YSM NSKR LLPFRHG+ L+K+QC KT Q+V+EMR IPYA AV SLMY +LCTR DICYAV IVSRYQS+
Subjt:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-5848.62Show/hide
Query:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------
        V+  GCKWIYKRKRG DGKVQTFKARLVAKGYT                                                                   
Subjt:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------

Query:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN
                     Q S+SWNIRFDTAIK Y FDQ VD+PCVYK+IIN S+AFLVLY+DDILLIGND                              I R+
Subjt:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN

Query:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS
        RKNK  ALSQASYIDK++V+YSM NSKR LLPFRHG+ L+K+QC KT Q+V+EMR IPYA AV SLMY +LCTR DICYAV IVSRYQS+
Subjt:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-5848.62Show/hide
Query:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------
        V+  GCKWIYKRKRG DGKVQTFKARLVAKGYT                                                                   
Subjt:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------

Query:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN
                     Q S+SWNIRFDTAIK Y FDQ VD+PCVYK+IIN S+AFLVLY+DDILLIGND                              I R+
Subjt:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN

Query:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS
        RKNK  ALSQASYIDK++V+YSM NSKR LLPFRHG+ L+K+QC KT Q+V+EMR IPYA AV SLMY +LCTR DICYAV IVSRYQS+
Subjt:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-5848.62Show/hide
Query:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------
        V+  GCKWIYKRKRG DGKVQTFKARLVAKGYT                                                                   
Subjt:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------

Query:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN
                     Q S+SWNIRFDTAIK Y FDQ VD+PCVYK+IIN S+AFLVLY+DDILLIGND                              I R+
Subjt:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN

Query:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS
        RKNK  ALSQASYIDK++V+YSM NSKR LLPFRHG+ L+K+QC KT Q+V+EMR IPYA AV SLMY +LCTR DICYAV IVSRYQS+
Subjt:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.5e-5848.62Show/hide
Query:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------
        V+  GCKWIYKRKRG DGKVQTFKARLVAKGYT                                                                   
Subjt:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------

Query:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN
                     Q S+SWNIRFDTAIK Y FDQ VD+PCVYK+IIN S+AFLVLY+DDILLIGND                              I R+
Subjt:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN

Query:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS
        RKNK  ALSQASYIDK++V+YSM NSKR LLPFRHG+ L+K+QC KT Q+V+EMR IPYA AV SLMY +LCTR DICYAV IVSRYQS+
Subjt:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS

A0A5A7TWB9 Gag/pol protein1.5e-5848.62Show/hide
Query:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------
        V+  GCKWIYKRKRG DGKVQTFKARLVAKGYT                                                                   
Subjt:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------

Query:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN
                     Q S+SWNIRFDTAIK Y FDQ VD+PCVYK+IIN S+AFLVLY+DDILLIGND                              I R+
Subjt:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN

Query:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS
        RKNK  ALSQASYIDK++V+YSM NSKR LLPFRHG+ L+K+QC KT Q+V+EMR IPYA AV SLMY +LCTR DICYAV IVSRYQS+
Subjt:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS

A0A5D3BHG7 Gag/pol protein1.5e-5848.62Show/hide
Query:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------
        V+  GCKWIYKRKRG DGKVQTFKARLVAKGYT                                                                   
Subjt:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------

Query:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN
                     Q S+SWNIRFDTAIK Y FDQ VD+PCVYK+IIN S+AFLVLY+DDILLIGND                              I R+
Subjt:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN

Query:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS
        RKNK  ALSQASYIDK++V+YSM NSKR LLPFRHG+ L+K+QC KT Q+V+EMR IPYA AV SLMY +LCTR DICYAV IVSRYQS+
Subjt:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS

A0A5D3CPJ6 Gag/pol protein1.5e-5848.62Show/hide
Query:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------
        V+  GCKWIYKRKRG DGKVQTFKARLVAKGYT                                                                   
Subjt:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------

Query:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN
                     Q S+SWNIRFDTAIK Y FDQ VD+PCVYK+IIN S+AFLVLY+DDILLIGND                              I R+
Subjt:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN

Query:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS
        RKNK  ALSQASYIDK++V+YSM NSKR LLPFRHG+ L+K+QC KT Q+V+EMR IPYA AV SLMY +LCTR DICYAV IVSRYQS+
Subjt:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS

A0A5D3CSZ6 Gag/pol protein1.5e-5848.62Show/hide
Query:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------
        V+  GCKWIYKRKRG DGKVQTFKARLVAKGYT                                                                   
Subjt:  VEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYT-------------------------------------------------------------------

Query:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN
                     Q S+SWNIRFDTAIK Y FDQ VD+PCVYK+IIN S+AFLVLY+DDILLIGND                              I R+
Subjt:  -------------QTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGND------------------------------IIRN

Query:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS
        RKNK  ALSQASYIDK++V+YSM NSKR LLPFRHG+ L+K+QC KT Q+V+EMR IPYA AV SLMY +LCTR DICYAV IVSRYQS+
Subjt:  RKNKTSALSQASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSS

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-0626.11Show/hide
Query:  GYTQTSQSWNIRFDTAIKVYDFDQNVDKPCVY---KKIINNSIAFLVLYMDDILLIGNDIIRNRKNKTSA----------------------------LS
        G  Q ++ W   F+ A+K  +F  +    C+Y   K  IN +I +++LY+DD+++   D+ R    K                               LS
Subjt:  GYTQTSQSWNIRFDTAIKVYDFDQNVDKPCVY---KKIINNSIAFLVLYMDDILLIGNDIIRNRKNKTSA----------------------------LS

Query:  QASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQS
        Q++Y+ K+L +++M N      P    I     + L + ++       P    +  LMY++LCTR D+  AV I+SRY S
Subjt:  QASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-2033.15Show/hide
Query:  GYTQTSQSWNIRFDTAIKVYDFDQNVDKPCVY-KKIINNSIAFLVLYMDDILLIGND------------------------------IIRNRKNKTSALS
        G  Q  + W ++FD+ +K   + +    PCVY K+   N+   L+LY+DD+L++G D                              I+R R ++   LS
Subjt:  GYTQTSQSWNIRFDTAIKVYDFDQNVDKPCVY-KKIINNSIAFLVLYMDDILLIGND------------------------------IIRNRKNKTSALS

Query:  QASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRY
        Q  YI+++L R++M N+K    P    + L+KK C  T +E   M  +PY+ AV SLMY ++CTR DI +AV +VSR+
Subjt:  QASYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.3e-0666.67Show/hide
Query:  GCKWIYKRKRGVDGKVQTFKARLVAKGYTQ
        GCKW+YK K   DG ++ +KARLVAKGYTQ
Subjt:  GCKWIYKRKRGVDGKVQTFKARLVAKGYTQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGAAACAAGGCAACGCAATCAAGAGCGTCACTGCGCTCCCCTCGACGCTCTCGACCCGAGAGCATCAAATTTTGGAGGTTGCATGAGCGTCACGTCACTGAAGTT
TAGCGTCATCGCATCACTGCGTTTGAGGTTAGGTTTTGATGACAGACAGCATTGCGACACTGCCTATATGATGATGGACAATGTCGTTGCGCTCTCCAAGAGCGCAGAGA
CAAGAGAGAGACACAAAGGAAACACACAAAAACACATCAAGAATATGAGAATTGGAGACCGAGAGCAAGAGTTTACTGCAAGGTCAATCGTGTTAAATGAACTTTCCAAT
GAAACTACTGAAACTTCAACAAGAGTTGTTGAAGAAGCTGGTTGCAAATGGATCTACAAGAGGAAAAGAGGTGTAGATGGTAAGGTGCAAACCTTTAAGGCTAGACTAGT
GGCAAAGGGTTATACCCAGACATCTCAATCTTGGAATATAAGATTTGATACTGCGATCAAGGTTTACGACTTTGACCAAAATGTTGATAAACCTTGTGTCTACAAGAAAA
TCATCAACAATTCAATAGCTTTCTTAGTGTTGTATATGGATGATATCCTTCTCATTGGGAATGATATCATAAGGAATCGTAAAAACAAAACGTCAGCCTTGTCTCAAGCA
TCGTATATTGACAAGATGCTTGTCAGGTATTCAATGCACAATTCCAAGAGGGACTTATTACCTTTTCGGCATGGAATTGTATTGACTAAGAAACAGTGTCTCAAGACTTC
TCAAGAGGTTAAAGAAATGAGATGGATTCCCTATGCATTGGCTGTAAGCAGCCTTATGTATGTTGTGTTGTGTACAAGGCTTGACATTTGCTATGCAGTTGAGATTGTTA
GTAGATATCAGTCTTCCAGGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGAAACAAGGCAACGCAATCAAGAGCGTCACTGCGCTCCCCTCGACGCTCTCGACCCGAGAGCATCAAATTTTGGAGGTTGCATGAGCGTCACGTCACTGAAGTT
TAGCGTCATCGCATCACTGCGTTTGAGGTTAGGTTTTGATGACAGACAGCATTGCGACACTGCCTATATGATGATGGACAATGTCGTTGCGCTCTCCAAGAGCGCAGAGA
CAAGAGAGAGACACAAAGGAAACACACAAAAACACATCAAGAATATGAGAATTGGAGACCGAGAGCAAGAGTTTACTGCAAGGTCAATCGTGTTAAATGAACTTTCCAAT
GAAACTACTGAAACTTCAACAAGAGTTGTTGAAGAAGCTGGTTGCAAATGGATCTACAAGAGGAAAAGAGGTGTAGATGGTAAGGTGCAAACCTTTAAGGCTAGACTAGT
GGCAAAGGGTTATACCCAGACATCTCAATCTTGGAATATAAGATTTGATACTGCGATCAAGGTTTACGACTTTGACCAAAATGTTGATAAACCTTGTGTCTACAAGAAAA
TCATCAACAATTCAATAGCTTTCTTAGTGTTGTATATGGATGATATCCTTCTCATTGGGAATGATATCATAAGGAATCGTAAAAACAAAACGTCAGCCTTGTCTCAAGCA
TCGTATATTGACAAGATGCTTGTCAGGTATTCAATGCACAATTCCAAGAGGGACTTATTACCTTTTCGGCATGGAATTGTATTGACTAAGAAACAGTGTCTCAAGACTTC
TCAAGAGGTTAAAGAAATGAGATGGATTCCCTATGCATTGGCTGTAAGCAGCCTTATGTATGTTGTGTTGTGTACAAGGCTTGACATTTGCTATGCAGTTGAGATTGTTA
GTAGATATCAGTCTTCCAGGATTTGA
Protein sequenceShow/hide protein sequence
MLETRQRNQERHCAPLDALDPRASNFGGCMSVTSLKFSVIASLRLRLGFDDRQHCDTAYMMMDNVVALSKSAETRERHKGNTQKHIKNMRIGDREQEFTARSIVLNELSN
ETTETSTRVVEEAGCKWIYKRKRGVDGKVQTFKARLVAKGYTQTSQSWNIRFDTAIKVYDFDQNVDKPCVYKKIINNSIAFLVLYMDDILLIGNDIIRNRKNKTSALSQA
SYIDKMLVRYSMHNSKRDLLPFRHGIVLTKKQCLKTSQEVKEMRWIPYALAVSSLMYVVLCTRLDICYAVEIVSRYQSSRI