; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04680 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04680
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag-pol polyprotein
Genome locationClcChr08:14254130..14255962
RNA-Seq ExpressionClc08G04680
SyntenyClc08G04680
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035264.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.7e-1431.16Show/hide
Query:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER--------KPEK-----------------------------
        + VA++ T++VK+SRLQL+TSKFE L+M +DESI+E+N  +L+IANES + G++I E +         PEK                             
Subjt:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER--------KPEK-----------------------------

Query:  ---------KTESIALQSVDDETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTK
                  T  I   + +DE++S++   D +  F  +      D +   +QK  I+ L+ +N  LMS I+ LK  L   + E +  + +++MLNFGT 
Subjt:  ---------KTESIALQSVDDETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTK

Query:  LLENILSKGKTAGTR
         L+ IL  G+T  ++
Subjt:  LLENILSKGKTAGTR

KAA0056457.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.0e-1432.83Show/hide
Query:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEE------------RKPEKK-----------------------T
        ++    GTSKVK+SRLQL+TSKFE LRM  DES++++N  +L+I NES +LGEKI +             RK EK                        T
Subjt:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEE------------RKPEKK-----------------------T

Query:  ESIALQSVDDETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKG
          I  ++ DD+++  E   +       + +    D +   IQKE ++ L+ +N RL+S I+ LK  L   + E + ++K+++ML  G + L+ IL  G
Subjt:  ESIALQSVDDETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKG

KAA0059847.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.8e-1832.07Show/hide
Query:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER-----------------------------------------
        + VA++GTSKVK+SRLQL+TSKFE L+M +DE+++E+N  +L+IAN+S +L EKI E +                                         
Subjt:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER-----------------------------------------

Query:  ------KPEKKTESIALQSVDDE---TDSTEFEADVKALFSNISSD---TSSDMKVL--------DIQKEGIESLLADNHRLMSTIAELKRALVHSKVEK
              +  KK + IA +SV D+    + +E  ++  +  SNI+ D   T  ++K+L         IQKE I+ L+ +N RLM  I+ LK  L   +   
Subjt:  ------KPEKKTESIALQSVDDE---TDSTEFEADVKALFSNISSD---TSSDMKVL--------DIQKEGIESLLADNHRLMSTIAELKRALVHSKVEK

Query:  ESMVKNIRMLNFGTKLLENILSKGKTAGTRSSDNCYL
        +  +K+++MLN+GT  L++ILS G+   ++ S +  L
Subjt:  ESMVKNIRMLNFGTKLLENILSKGKTAGTRSSDNCYL

XP_024019486.1 uncharacterized protein LOC112091030 [Morus notabilis]1.1e-1336Show/hide
Query:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER---KPEKKTESIALQSVDD------ETDSTEFEADVKALFS
        + VAH+GT+ V+ S+L +LT++FE LRM + E+I+EFN  L DIANESF LGEKISE +   KPE+        S  +      ET    +E+       
Subjt:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER---KPEKKTESIALQSVDD------ETDSTEFEADVKALFS

Query:  NISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKGKTA
         +  + S + K+ D+ +E  +  +    R  S IAE+++ L  +  E E   K ++M+N GT  L++I S  K++
Subjt:  NISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKGKTA

XP_038896219.1 uncharacterized protein LOC120084497 [Benincasa hispida]3.4e-1845.61Show/hide
Query:  MSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEE----------------------------RKPEKK------------TESIALQS
        MSRLQLLTSKFENLRM++DE I +FNV +LDI+NES  LGEKIS+E                            RK +K+            +E   L  
Subjt:  MSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEE----------------------------RKPEKK------------TESIALQS

Query:  VDDET-DSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESM
         D ET +S++ E DVKAL+         D K LDI KE I++L+ DNHRLMS IAELKR L   KVEK++M
Subjt:  VDDET-DSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESM

TrEMBL top hitse value%identityAlignment
A0A5A7SVG0 Gag-pol polyprotein8.4e-1531.16Show/hide
Query:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER--------KPEK-----------------------------
        + VA++ T++VK+SRLQL+TSKFE L+M +DESI+E+N  +L+IANES + G++I E +         PEK                             
Subjt:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER--------KPEK-----------------------------

Query:  ---------KTESIALQSVDDETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTK
                  T  I   + +DE++S++   D +  F  +      D +   +QK  I+ L+ +N  LMS I+ LK  L   + E +  + +++MLNFGT 
Subjt:  ---------KTESIALQSVDDETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTK

Query:  LLENILSKGKTAGTR
         L+ IL  G+T  ++
Subjt:  LLENILSKGKTAGTR

A0A5A7U4G8 Gag-proteinase polyprotein7.8e-1349.45Show/hide
Query:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEERKPEKKTESI------ALQSVDDETDSTEFEAD
        + VA++GTSKVK+SRLQ+LTS+FE L+MNDDE+IA+FNV +LD+ANESF LGEKI++ +  +K   S+       + ++++  D T  + D
Subjt:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEERKPEKKTESI------ALQSVDDETDSTEFEAD

A0A5D3BVD8 Gag-pol polyprotein2.7e-1329.44Show/hide
Query:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISE-------------------------------------------
        + VA++GTSKVK+SRLQL+TSKFE + M +DES++++N  +L+IANES +L EKI +                                           
Subjt:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISE-------------------------------------------

Query:  -----ERKPE-----KKTESIALQSVDDET-DSTEFEADVKA----LFSNISSDTS---------------------SDMKVLDIQKEGIESLLADNHRL
             +RK E     +K ++  +   D+E+ DS + ++++ A    +   I+ D S                      D +  +IQKE I+ L+ +N  L
Subjt:  -----ERKPE-----KKTESIALQSVDDET-DSTEFEADVKA----LFSNISSDTS---------------------SDMKVLDIQKEGIESLLADNHRL

Query:  MSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKGKTAGTR
        MS I  LK  +   + E + + K+++MLN GTK LE+IL  G     R
Subjt:  MSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKGKTAGTR

A0A5D3DMG6 Gag-pol polyprotein4.7e-1832.07Show/hide
Query:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER-----------------------------------------
        + VA++GTSKVK+SRLQL+TSKFE L+M +DE+++E+N  +L+IAN+S +L EKI E +                                         
Subjt:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEER-----------------------------------------

Query:  ------KPEKKTESIALQSVDDE---TDSTEFEADVKALFSNISSD---TSSDMKVL--------DIQKEGIESLLADNHRLMSTIAELKRALVHSKVEK
              +  KK + IA +SV D+    + +E  ++  +  SNI+ D   T  ++K+L         IQKE I+ L+ +N RLM  I+ LK  L   +   
Subjt:  ------KPEKKTESIALQSVDDE---TDSTEFEADVKALFSNISSD---TSSDMKVL--------DIQKEGIESLLADNHRLMSTIAELKRALVHSKVEK

Query:  ESMVKNIRMLNFGTKLLENILSKGKTAGTRSSDNCYL
        +  +K+++MLN+GT  L++ILS G+   ++ S +  L
Subjt:  ESMVKNIRMLNFGTKLLENILSKGKTAGTRSSDNCYL

A0A5D3E0N2 Gag-pol polyprotein2.4e-1432.83Show/hide
Query:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEE------------RKPEKK-----------------------T
        ++    GTSKVK+SRLQL+TSKFE LRM  DES++++N  +L+I NES +LGEKI +             RK EK                        T
Subjt:  MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEE------------RKPEKK-----------------------T

Query:  ESIALQSVDDETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKG
          I  ++ DD+++  E   +       + +    D +   IQKE ++ L+ +N RL+S I+ LK  L   + E + ++K+++ML  G + L+ IL  G
Subjt:  ESIALQSVDDETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEGIESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCGCCCACAAAGGAACTTCCAAAGTCAAAATGTCGAGGCTCCAATTGTTGACTTCCAAATTTGAAAATTTGAGAATGAATGATGATGAATCAATTGCCGAGTT
TAATGTGTGTCTACTGGATATTGCCAATGAATCTTTTGTCCTTGGAGAAAAGATCTCAGAGGAAAGGAAACCAGAGAAGAAGACTGAAAGCATTGCTCTCCAATCTGTTG
ATGATGAGACAGACTCTACTGAGTTTGAGGCAGATGTTAAAGCTCTTTTCAGTAACATCTCCTCTGACACATCGTCAGATATGAAAGTGCTGGATATTCAGAAAGAAGGA
ATTGAGTCCTTGTTGGCTGACAACCATCGCTTAATGTCAACCATTGCAGAACTAAAACGTGCACTAGTCCATTCTAAGGTTGAAAAGGAATCCATGGTGAAGAACATAAG
GATGCTCAATTTTGGCACTAAATTGCTTGAAAACATTCTTTCTAAAGGAAAAACCGCAGGCACTAGATCATCTGACAACTGCTATTTGTGGAATCTGGAATCTACATCCC
CAGTTTGCCATTTAGCTAGGCAAGACGAAGCAGATCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGTCGCCCACAAAGGAACTTCCAAAGTCAAAATGTCGAGGCTCCAATTGTTGACTTCCAAATTTGAAAATTTGAGAATGAATGATGATGAATCAATTGCCGAGTT
TAATGTGTGTCTACTGGATATTGCCAATGAATCTTTTGTCCTTGGAGAAAAGATCTCAGAGGAAAGGAAACCAGAGAAGAAGACTGAAAGCATTGCTCTCCAATCTGTTG
ATGATGAGACAGACTCTACTGAGTTTGAGGCAGATGTTAAAGCTCTTTTCAGTAACATCTCCTCTGACACATCGTCAGATATGAAAGTGCTGGATATTCAGAAAGAAGGA
ATTGAGTCCTTGTTGGCTGACAACCATCGCTTAATGTCAACCATTGCAGAACTAAAACGTGCACTAGTCCATTCTAAGGTTGAAAAGGAATCCATGGTGAAGAACATAAG
GATGCTCAATTTTGGCACTAAATTGCTTGAAAACATTCTTTCTAAAGGAAAAACCGCAGGCACTAGATCATCTGACAACTGCTATTTGTGGAATCTGGAATCTACATCCC
CAGTTTGCCATTTAGCTAGGCAAGACGAAGCAGATCACTGA
Protein sequenceShow/hide protein sequence
MSVAHKGTSKVKMSRLQLLTSKFENLRMNDDESIAEFNVCLLDIANESFVLGEKISEERKPEKKTESIALQSVDDETDSTEFEADVKALFSNISSDTSSDMKVLDIQKEG
IESLLADNHRLMSTIAELKRALVHSKVEKESMVKNIRMLNFGTKLLENILSKGKTAGTRSSDNCYLWNLESTSPVCHLARQDEADH