; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G04170 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G04170
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr04:14490997..14491812
RNA-Seq ExpressionClc04G04170
SyntenyClc04G04170
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037371.1 gag/pol protein [Cucumis melo var. makuwa]7.2e-3058.04Show/hide
Query:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS-------------QTYESMHKGKGPNGKAKVASPSKK
        AREIMDSLQ+MFGQ+S QIKHDALK+IYNARMNEG+SVREHVLNMM+HFNVAEM   +I+E S             QT+ES+ K KG  G+A VA+ ++K
Subjt:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS-------------QTYESMHKGKGPNGKAKVASPSKK

Query:  FHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ
        FHR    G K +  SS  KKWKKKK  + GNK +  AA+  K+
Subjt:  FHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ

KAA0046415.1 gag/pol protein [Cucumis melo var. makuwa]7.2e-3053.01Show/hide
Query:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------Q
        AREIMDSLQ+MFGQ+S QIKHDALK+IYNARMNEG+SVREHVLNMMVHFN+AEMN  +I+E S                                    Q
Subjt:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------Q

Query:  TYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ
        T+ES+ K KG  G+A VA+ ++KFHR LT GTKS+  SS  KKWKKKK  + GNK +  AA+  K+
Subjt:  TYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ

KAA0050670.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-3053.94Show/hide
Query:  REIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------QT
        REIMDSLQ+MFGQ+S QIKHDAL +IYNARMNEG+SVREHVLNMMVHFNVAEMN  +I+E S                                    QT
Subjt:  REIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------QT

Query:  YESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ
        +ES+ K KG  G+A VA+ ++KFHR LTFGTKS+  SS  KKWKKKK  + GNK + VAA+  K+
Subjt:  YESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ

KAA0054432.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-3346.4Show/hide
Query:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------Q
        AREIMDSLQ+MF Q+S QIKHDALK+IYNARMNEG+SVREHVLNMMVHFNV EMN  +I+E S                                    Q
Subjt:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------Q

Query:  TYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ----------------------KFLV------AT
        T+ES+ K KG  G+A VA+ ++KFHR  T GTKS+  SS  KKWKKKKS + GNK +  AA+  K+                      K+L         
Subjt:  TYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ----------------------KFLV------AT

Query:  WDWRDDLADWNRAGRLTACSGR
         DWRDD A WN   RL+ CSGR
Subjt:  WDWRDDLADWNRAGRLTACSGR

TYK28896.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-2953.01Show/hide
Query:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------Q
        AREIMDSLQ+MFGQ+S QIKHDALK+IYNARMNEG+SVREHVLNMMVHFNVAEMN  +I+E S                                    Q
Subjt:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------Q

Query:  TYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ
         +ES+ K KG  G+A VA+ ++KFHR  T GTKSV  SS  KKWKKKK  + GNK +  AA+  K+
Subjt:  TYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ

TrEMBL top hitse value%identityAlignment
A0A5A7T706 Gag/pol protein3.5e-3058.04Show/hide
Query:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS-------------QTYESMHKGKGPNGKAKVASPSKK
        AREIMDSLQ+MFGQ+S QIKHDALK+IYNARMNEG+SVREHVLNMM+HFNVAEM   +I+E S             QT+ES+ K KG  G+A VA+ ++K
Subjt:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS-------------QTYESMHKGKGPNGKAKVASPSKK

Query:  FHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ
        FHR    G K +  SS  KKWKKKK  + GNK +  AA+  K+
Subjt:  FHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ

A0A5A7TFX0 Gag/pol protein5.9e-3060Show/hide
Query:  IAREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETSQTYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV
        IAREIMDSLQ+MFGQ+S QIKH  LK+IYN  MNEG+SVRE+VLNMMVHFN  EMN  +I+E SQT+ES+ K K   G+  VA+ ++KFHRD T GTK V
Subjt:  IAREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETSQTYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV

Query:  FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ
         S   KKWKKKK  + GNK +  A +  ++
Subjt:  FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ

A0A5A7TYF5 Gag/pol protein3.5e-3053.01Show/hide
Query:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------Q
        AREIMDSLQ+MFGQ+S QIKHDALK+IYNARMNEG+SVREHVLNMMVHFN+AEMN  +I+E S                                    Q
Subjt:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------Q

Query:  TYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ
        T+ES+ K KG  G+A VA+ ++KFHR LT GTKS+  SS  KKWKKKK  + GNK +  AA+  K+
Subjt:  TYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ

A0A5A7U676 Gag/pol protein9.1e-3153.94Show/hide
Query:  REIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------QT
        REIMDSLQ+MFGQ+S QIKHDAL +IYNARMNEG+SVREHVLNMMVHFNVAEMN  +I+E S                                    QT
Subjt:  REIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------QT

Query:  YESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ
        +ES+ K KG  G+A VA+ ++KFHR LTFGTKS+  SS  KKWKKKK  + GNK + VAA+  K+
Subjt:  YESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ

A0A5A7UF91 Gag/pol protein2.0e-3346.4Show/hide
Query:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------Q
        AREIMDSLQ+MF Q+S QIKHDALK+IYNARMNEG+SVREHVLNMMVHFNV EMN  +I+E S                                    Q
Subjt:  AREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETS------------------------------------Q

Query:  TYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ----------------------KFLV------AT
        T+ES+ K KG  G+A VA+ ++KFHR  T GTKS+  SS  KKWKKKKS + GNK +  AA+  K+                      K+L         
Subjt:  TYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSV-FSSAKKKWKKKKSNKGGNKTHQVAAQKGKQ----------------------KFLV------AT

Query:  WDWRDDLADWNRAGRLTACSGR
         DWRDD A WN   RL+ CSGR
Subjt:  WDWRDDLADWNRAGRLTACSGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACCATGGTCCATTGCTCGTGAGATCATGGATTCATTGCAAAAGATGTTTGGACAATCGTCCTCGCAGATCAAGCATGATGCTCTGAAATTCATTTATAATGCACG
TATGAATGAGGGGTCATCAGTGCGAGAACATGTTCTCAATATGATGGTCCACTTCAACGTGGCTGAAATGAATGTGACCATCATCAATGAGACCAGTCAAACTTATGAGT
CCATGCATAAAGGCAAAGGACCAAATGGGAAGGCAAAAGTTGCCAGTCCCTCTAAAAAGTTCCACAGAGATTTGACCTTTGGAACTAAGTCTGTTTTTTCCTCTGCCAAA
AAGAAATGGAAGAAGAAGAAGAGTAATAAGGGAGGTAATAAAACTCACCAAGTTGCAGCCCAGAAAGGCAAGCAAAAGTTCCTAGTAGCAACTTGGGACTGGCGAGATGA
CCTTGCAGATTGGAACAGGGCAGGTCGTCTCACCGCTTGCAGTGGGAGGCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGACCATGGTCCATTGCTCGTGAGATCATGGATTCATTGCAAAAGATGTTTGGACAATCGTCCTCGCAGATCAAGCATGATGCTCTGAAATTCATTTATAATGCACG
TATGAATGAGGGGTCATCAGTGCGAGAACATGTTCTCAATATGATGGTCCACTTCAACGTGGCTGAAATGAATGTGACCATCATCAATGAGACCAGTCAAACTTATGAGT
CCATGCATAAAGGCAAAGGACCAAATGGGAAGGCAAAAGTTGCCAGTCCCTCTAAAAAGTTCCACAGAGATTTGACCTTTGGAACTAAGTCTGTTTTTTCCTCTGCCAAA
AAGAAATGGAAGAAGAAGAAGAGTAATAAGGGAGGTAATAAAACTCACCAAGTTGCAGCCCAGAAAGGCAAGCAAAAGTTCCTAGTAGCAACTTGGGACTGGCGAGATGA
CCTTGCAGATTGGAACAGGGCAGGTCGTCTCACCGCTTGCAGTGGGAGGCGTTAA
Protein sequenceShow/hide protein sequence
MRPWSIAREIMDSLQKMFGQSSSQIKHDALKFIYNARMNEGSSVREHVLNMMVHFNVAEMNVTIINETSQTYESMHKGKGPNGKAKVASPSKKFHRDLTFGTKSVFSSAK
KKWKKKKSNKGGNKTHQVAAQKGKQKFLVATWDWRDDLADWNRAGRLTACSGRR