; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G07630 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G07630
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag-pol polyprotein
Genome locationClcChr02:8040520..8041634
RNA-Seq ExpressionClc02G07630
SyntenyClc02G07630
Gene Ontology termsGO:0019538 - protein metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0140096 - catalytic activity, acting on a protein (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH81412.1 gag-pol polyprotein [Trifolium medium]5.2e-2950Show/hide
Query:  TSLKSSTREDW--KSHFSRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSNDAEAIVMTGTRSSKNCYLWNP--ENGS
        TSL++S+REDW   S  SR    G GKL    LP+LD+V+L++GLTANLISISQLCDQG+ V+FTK +C+V+++   ++M G RS  NCYLW P  EN  
Subjt:  TSLKSSTREDW--KSHFSRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSNDAEAIVMTGTRSSKNCYLWNP--ENGS

Query:  LVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSLS
          C + ++DE  L H++LGH+ ++ ++KA+A+  I GLP L+
Subjt:  LVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSLS

PNX66734.1 serine/threonine protein kinase SRPK1, partial [Trifolium pratense]5.4e-2648.59Show/hide
Query:  TSLKSSTREDWKSHF---SRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSNDAEAIVMTGTRSSKNCYLWNP--ENG
        TSL++S+RE     F   ++    G GKL    LP LD+V+L+EGL ANLIS SQLCDQGM V+FTK +C+V+ND   ++M G RS  NCYLW P  E+ 
Subjt:  TSLKSSTREDWKSHF---SRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSNDAEAIVMTGTRSSKNCYLWNP--ENG

Query:  SLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL
           C + ++DE  L  ++LGH+ +K ++KA+AK  I GLP L
Subjt:  SLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL

PNX93845.1 gag-protease polyprotein, partial [Trifolium pratense]3.2e-2641.07Show/hide
Query:  TSLKSSTREDW--------------------KSHFSRL---------NFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN
        TSL++S+REDW                    KS+ +             +G GKLN   LP L++V+L++GLTANLISI+QLCDQGMNV+FTK +C+V+N
Subjt:  TSLKSSTREDW--------------------KSHFSRL---------NFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN

Query:  DAEAIVMTGTRSSKNCYLWNP--ENGSLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL
        D   ++M G R+  NCYLW P  E     C + ++DE    H++LGH++ + ++KA+++  I GLP+L
Subjt:  DAEAIVMTGTRSSKNCYLWNP--ENGSLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL

PNX99503.1 gag-protease polyprotein, partial [Trifolium pratense]8.3e-2741.67Show/hide
Query:  TSLKSSTREDW--------------------KSHF---------SRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN
        TSL++S+REDW                    KS+          ++   +G G+L    LP L++V+L+ GLTANLISISQLCDQGM V+FTK +C+V++
Subjt:  TSLKSSTREDW--------------------KSHF---------SRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN

Query:  DAEAIVMTGTRSSKNCYLWNPENGSLV--CRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL
        +   I+M G RS  NCYLW P+ G+ V  C + ++DE  L H+RLGH++++ ++KA+++  I GLP+L
Subjt:  DAEAIVMTGTRSSKNCYLWNPENGSLV--CRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL

PNY10358.1 retrotransposon-related protein, partial [Trifolium pratense]3.7e-2743.45Show/hide
Query:  TSLKSSTREDW--------------------KSHFSRL---------NFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN
        TSLK+S+R+DW                    KS+ S              G G+L    LP L++V+L++GLTANLISISQLCDQGM V+FTKE+C+VSN
Subjt:  TSLKSSTREDW--------------------KSHFSRL---------NFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN

Query:  DAEAIVMTGTRSSKNCYLWNP--ENGSLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL
        D   ++M G RS  NCYLW P  E     C   ++DE  + H+RLGH++++ ++KA++K  I GLP+L
Subjt:  DAEAIVMTGTRSSKNCYLWNP--ENGSLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL

TrEMBL top hitse value%identityAlignment
A0A2K3KKC5 Serine/threonine protein kinase SRPK1 (Fragment)2.6e-2648.59Show/hide
Query:  TSLKSSTREDWKSHF---SRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSNDAEAIVMTGTRSSKNCYLWNP--ENG
        TSL++S+RE     F   ++    G GKL    LP LD+V+L+EGL ANLIS SQLCDQGM V+FTK +C+V+ND   ++M G RS  NCYLW P  E+ 
Subjt:  TSLKSSTREDWKSHF---SRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSNDAEAIVMTGTRSSKNCYLWNP--ENG

Query:  SLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL
           C + ++DE  L  ++LGH+ +K ++KA+AK  I GLP L
Subjt:  SLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL

A0A2K3MSR1 Gag-protease polyprotein (Fragment)1.5e-2641.07Show/hide
Query:  TSLKSSTREDW--------------------KSHFSRL---------NFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN
        TSL++S+REDW                    KS+ +             +G GKLN   LP L++V+L++GLTANLISI+QLCDQGMNV+FTK +C+V+N
Subjt:  TSLKSSTREDW--------------------KSHFSRL---------NFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN

Query:  DAEAIVMTGTRSSKNCYLWNP--ENGSLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL
        D   ++M G R+  NCYLW P  E     C + ++DE    H++LGH++ + ++KA+++  I GLP+L
Subjt:  DAEAIVMTGTRSSKNCYLWNP--ENGSLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL

A0A2K3N8X7 Gag-protease polyprotein (Fragment)4.0e-2741.67Show/hide
Query:  TSLKSSTREDW--------------------KSHF---------SRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN
        TSL++S+REDW                    KS+          ++   +G G+L    LP L++V+L+ GLTANLISISQLCDQGM V+FTK +C+V++
Subjt:  TSLKSSTREDW--------------------KSHF---------SRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN

Query:  DAEAIVMTGTRSSKNCYLWNPENGSLV--CRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL
        +   I+M G RS  NCYLW P+ G+ V  C + ++DE  L H+RLGH++++ ++KA+++  I GLP+L
Subjt:  DAEAIVMTGTRSSKNCYLWNPENGSLV--CRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL

A0A2K3P4Z0 Retrotransposon-related protein (Fragment)1.8e-2743.45Show/hide
Query:  TSLKSSTREDW--------------------KSHFSRL---------NFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN
        TSLK+S+R+DW                    KS+ S              G G+L    LP L++V+L++GLTANLISISQLCDQGM V+FTKE+C+VSN
Subjt:  TSLKSSTREDW--------------------KSHFSRL---------NFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSN

Query:  DAEAIVMTGTRSSKNCYLWNP--ENGSLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL
        D   ++M G RS  NCYLW P  E     C   ++DE  + H+RLGH++++ ++KA++K  I GLP+L
Subjt:  DAEAIVMTGTRSSKNCYLWNP--ENGSLVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSL

A0A392M256 Gag-pol polyprotein (Fragment)2.5e-2950Show/hide
Query:  TSLKSSTREDW--KSHFSRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSNDAEAIVMTGTRSSKNCYLWNP--ENGS
        TSL++S+REDW   S  SR    G GKL    LP+LD+V+L++GLTANLISISQLCDQG+ V+FTK +C+V+++   ++M G RS  NCYLW P  EN  
Subjt:  TSLKSSTREDW--KSHFSRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSNDAEAIVMTGTRSSKNCYLWNP--ENGS

Query:  LVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSLS
          C + ++DE  L H++LGH+ ++ ++KA+A+  I GLP L+
Subjt:  LVCRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATGCAATGTGGTTTTGACCTCTCTCAAATCCTCTACTAGAGAAGATTGGAAAAGTCACTTTTCGAGACTTAATTTCATCGGAAAAGGGAAGCTTAATTTC
CAATGCCTACCTTCTCTAGATGATGTCATGCTGATTGAAGGACTTACTGCCAATCTTATCAGCATAAGTCAACTTTGTGACCAGGGCATGAATGTAAGTTTTACC
AAAGAACAGTGTGTTGTTTCTAATGATGCTGAGGCTATTGTGATGACAGGCACTCGATCCTCGAAAAATTGTTACTTGTGGAATCCTGAAAATGGGTCTTTGGTC
TGTCGTCTCTTTAGACAAGACGAAGCTAGTCTTTCGCATAAGAGGCTTGGACACATAAGTATGAAGATTATTCAAAAGGCCCTTGCAAAGAATGTTATATCTGGT
CTCCCATCTCTGTCCTCTACTACAGTCAAAAAGCCTGGGAGGAAAGTGTTATGTCTTAGTAAGCGCTACGACTTCTACCCAGAGCGTAACTATGGCAAGCAAAGT
AGGCGGGTGGCTGAGAAAAGATGGCCAAGAGCCTTTGGCTTAAGTGGCTATGCGTTGGCTGGGTTGAGGTATGCGTCCATGAGAGTGAGCATGGGTGAGGGGGTT
GCGTTGGAGAAGCATGAGTTCAAGAGAGCCTATGGGACCGACGCATGGTGCTATGCGTTCAAGCCAGTGATCAAGGTTGCCAAGGGGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAATGCAATGTGGTTTTGACCTCTCTCAAATCCTCTACTAGAGAAGATTGGAAAAGTCACTTTTCGAGACTTAATTTCATCGGAAAAGGGAAGCTTAATTTC
CAATGCCTACCTTCTCTAGATGATGTCATGCTGATTGAAGGACTTACTGCCAATCTTATCAGCATAAGTCAACTTTGTGACCAGGGCATGAATGTAAGTTTTACC
AAAGAACAGTGTGTTGTTTCTAATGATGCTGAGGCTATTGTGATGACAGGCACTCGATCCTCGAAAAATTGTTACTTGTGGAATCCTGAAAATGGGTCTTTGGTC
TGTCGTCTCTTTAGACAAGACGAAGCTAGTCTTTCGCATAAGAGGCTTGGACACATAAGTATGAAGATTATTCAAAAGGCCCTTGCAAAGAATGTTATATCTGGT
CTCCCATCTCTGTCCTCTACTACAGTCAAAAAGCCTGGGAGGAAAGTGTTATGTCTTAGTAAGCGCTACGACTTCTACCCAGAGCGTAACTATGGCAAGCAAAGT
AGGCGGGTGGCTGAGAAAAGATGGCCAAGAGCCTTTGGCTTAAGTGGCTATGCGTTGGCTGGGTTGAGGTATGCGTCCATGAGAGTGAGCATGGGTGAGGGGGTT
GCGTTGGAGAAGCATGAGTTCAAGAGAGCCTATGGGACCGACGCATGGTGCTATGCGTTCAAGCCAGTGATCAAGGTTGCCAAGGGGAAATAA
Protein sequenceShow/hide protein sequence
MKCNVVLTSLKSSTREDWKSHFSRLNFIGKGKLNFQCLPSLDDVMLIEGLTANLISISQLCDQGMNVSFTKEQCVVSNDAEAIVMTGTRSSKNCYLWNPENGSLV
CRLFRQDEASLSHKRLGHISMKIIQKALAKNVISGLPSLSSTTVKKPGRKVLCLSKRYDFYPERNYGKQSRRVAEKRWPRAFGLSGYALAGLRYASMRVSMGEGV
ALEKHEFKRAYGTDAWCYAFKPVIKVAKGK