; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G06850 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G06850
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag protease polyprotein
Genome locationClcChr02:6897092..6898284
RNA-Seq ExpressionClc02G06850
SyntenyClc02G06850
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046048.1 gag protease polyprotein [Cucumis melo var. makuwa]1.9e-2155.66Show/hide
Query:  AQIGETVTDTLMDKTQEVTRVAVAGQLAQQIPTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCA
        A + +   D +M K ++  + A         P P VP  Q+ P   L AEAK+LRDFRKYNP TFDG L+DPT+A++WL S+ETIFRYMKC EDQK+QCA
Subjt:  AQIGETVTDTLMDKTQEVTRVAVAGQLAQQIPTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCA

Query:  VFMLTD
        VFMLTD
Subjt:  VFMLTD

KAA0047534.1 gag protease polyprotein [Cucumis melo var. makuwa]1.9e-2172Show/hide
Query:  PTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTD
        P P VP+Q       L AEAK+LRDFRKYNP TFDG LKDPT+A++WL S+ETIFRYMKC EDQK+QCAVFMLTD
Subjt:  PTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTD

TYK01089.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-2155.86Show/hide
Query:  AAQIGETVTDTLMDKTQEVTRVAVAGQLAQQIPTPQVPEQQ--IPPY--HDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQ
        A  +   VT   +   ++  R  +     QQ PTP  P     +P      L  EAK+LRDFRKYNP TFDG LKDPTKA+MWL S+ETIFRYMKCSEDQ
Subjt:  AAQIGETVTDTLMDKTQEVTRVAVAGQLAQQIPTPQVPEQQ--IPPY--HDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQ

Query:  KLQCAVFMLTD
        K+QCAVFMLTD
Subjt:  KLQCAVFMLTD

TYK06288.1 gag protease polyprotein [Cucumis melo var. makuwa]1.9e-2172Show/hide
Query:  PTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTD
        P P VP+Q       L AEAK+LRDFRKYNP TFDG LKDPT+A++WL S+ETIFRYMKC EDQK+QCAVFMLTD
Subjt:  PTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTD

XP_038883046.1 uncharacterized protein LOC120074107 [Benincasa hispida]3.8e-2254.17Show/hide
Query:  MAAQIGETVTDTLMDKTQEVTRVAVAGQL----AQQIPTPQVPEQQIPPYH----DLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMK
        M A+I E V  +LM K QE+ R A+  Q     AQ+ P P    QQ  P +     L  EAK+LRDFRKY+PR+FDG L DPTKAKMWL SIETIFR+M+
Subjt:  MAAQIGETVTDTLMDKTQEVTRVAVAGQL----AQQIPTPQVPEQQIPPYH----DLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMK

Query:  CSEDQKLQCAVFMLTDDAKI
        C E+ KLQC VFML  + +I
Subjt:  CSEDQKLQCAVFMLTDDAKI

TrEMBL top hitse value%identityAlignment
A0A5A7TT51 Gag protease polyprotein9.0e-2255.66Show/hide
Query:  AQIGETVTDTLMDKTQEVTRVAVAGQLAQQIPTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCA
        A + +   D +M K ++  + A         P P VP  Q+ P   L AEAK+LRDFRKYNP TFDG L+DPT+A++WL S+ETIFRYMKC EDQK+QCA
Subjt:  AQIGETVTDTLMDKTQEVTRVAVAGQLAQQIPTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCA

Query:  VFMLTD
        VFMLTD
Subjt:  VFMLTD

A0A5A7TX58 Gag protease polyprotein9.0e-2272Show/hide
Query:  PTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTD
        P P VP+Q       L AEAK+LRDFRKYNP TFDG LKDPT+A++WL S+ETIFRYMKC EDQK+QCAVFMLTD
Subjt:  PTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTD

A0A5A7V5X6 Gag-protease polyprotein1.2e-2170.51Show/hide
Query:  QQIPTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTD
        QQ P    P  Q  P   L AEAK+LRDFRKYNP TFDG L+DPT+A+MWL S+ETIFRYMKC EDQK+QCAVFMLTD
Subjt:  QQIPTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTD

A0A5D3BSM2 Reverse transcriptase9.0e-2255.86Show/hide
Query:  AAQIGETVTDTLMDKTQEVTRVAVAGQLAQQIPTPQVPEQQ--IPPY--HDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQ
        A  +   VT   +   ++  R  +     QQ PTP  P     +P      L  EAK+LRDFRKYNP TFDG LKDPTKA+MWL S+ETIFRYMKCSEDQ
Subjt:  AAQIGETVTDTLMDKTQEVTRVAVAGQLAQQIPTPQVPEQQ--IPPY--HDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQ

Query:  KLQCAVFMLTD
        K+QCAVFMLTD
Subjt:  KLQCAVFMLTD

A0A5D3C572 Gag protease polyprotein9.0e-2272Show/hide
Query:  PTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTD
        P P VP+Q       L AEAK+LRDFRKYNP TFDG LKDPT+A++WL S+ETIFRYMKC EDQK+QCAVFMLTD
Subjt:  PTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCACAGATAGGGGAAACTGTTACTGACACCCTAATGGACAAAACTCAAGAGGTAACCCGTGTAGCTGTGGCAGGACAATTAGCACAGCAGATCCCGACTCCCCA
AGTGCCAGAGCAACAAATTCCGCCTTATCATGACTTGTTTGCTGAAGCGAAGTATTTGCGAGACTTTAGGAAATATAACCCCCGCACCTTTGACGGATTGTTGAAGGACC
CTACCAAGGCGAAAATGTGGCTATTTTCCATTGAGACTATTTTTCGCTACATGAAATGTTCGGAGGACCAAAAACTTCAGTGTGCAGTGTTCATGTTGACTGATGATGCA
AAAATCTAG
mRNA sequenceShow/hide mRNA sequence
CGGCGAATGACAAGAAGAATCATTGAGGCGATATGGGACTTGCATGGTTTTTGCATAGTTTTGAATGAGATGGCTTGTGGAGGTAGAAGAAGAGGCAGGGGTAAGGGAAA
GAGGGTTACCAAACCCTCAAGTTCAGGATCCCGTGGAGTAGAACCCTCAGCTCCAGGACCCTCCCGCTCCGCGAGGACCCCCACTGCCACAGGCTTCTGTTGGGTCAATG
GAGCAGGGAACCAGGGCAAGGGGGCGAACCTGTAATGCGCCCGAGCCACCACAGATGCAGGCTCCACAAGGGGCCTAAACCTATTTCGCGACAATGGCTGCACAGATAGG
GGAAACTGTTACTGACACCCTAATGGACAAAACTCAAGAGGTAACCCGTGTAGCTGTGGCAGGACAATTAGCACAGCAGATCCCGACTCCCCAAGTGCCAGAGCAACAAA
TTCCGCCTTATCATGACTTGTTTGCTGAAGCGAAGTATTTGCGAGACTTTAGGAAATATAACCCCCGCACCTTTGACGGATTGTTGAAGGACCCTACCAAGGCGAAAATG
TGGCTATTTTCCATTGAGACTATTTTTCGCTACATGAAATGTTCGGAGGACCAAAAACTTCAGTGTGCAGTGTTCATGTTGACTGATGATGCAAAAATCTAG
Protein sequenceShow/hide protein sequence
MAAQIGETVTDTLMDKTQEVTRVAVAGQLAQQIPTPQVPEQQIPPYHDLFAEAKYLRDFRKYNPRTFDGLLKDPTKAKMWLFSIETIFRYMKCSEDQKLQCAVFMLTDDA
KI