; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G15380 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G15380
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr09:20864327..20864716
RNA-Seq ExpressionClc09G15380
SyntenyClc09G15380
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.0e-3982.69Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG
        MKDLGEAQY+ GIQIVRNRKNK LAMSQASYIDK+LSRYK+ NSK+  LPFRHGIHLSK+QCPKTPQ VED R IPY+SAVGSLMY MLCTRP+ICY+ G
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG

Query:  IVSR
        IVSR
Subjt:  IVSR

KAA0053385.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]3.9e-3981.55Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG
        MKDLGEAQYV GIQI+R+RKNKMLA+SQA+YIDKML RY + NSK+ LLPFRHG+HLSK+QCPKTPQ  ED RRIPYASAVG+LMYVMLCTRP+ICYA G
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG

Query:  IVS
        IVS
Subjt:  IVS

KAA0059556.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-3980.77Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG
        MKD+GEAQYV GIQI+R+RKNK LA+SQA+YIDKML RY + NSK+ LLPFRHG+HLSK+QCPKTPQ VED RRIPYASAVGSLMY MLCTRP+ICYA G
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG

Query:  IVSR
        IVSR
Subjt:  IVSR

KAA0060794.1 putative Integrase core domain [Cucumis melo var. makuwa]5.1e-3978.7Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG
        MKDLGEAQYV GIQI+R+RKNK LA+SQA+YIDKML RY + NSK+ LLPF+HG+HLSK+QCPKTPQ VED RRIPYASAVGSLMY MLCTRP+ICYA G
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG

Query:  IVSRCCKT
        IVSR  K+
Subjt:  IVSRCCKT

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-4183.81Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG
        MKDLG AQYV G+QIVRNRKNK LAMSQ SYIDKMLSRYK+HNSK+ LLP+R+GIHLSK+QCPKTPQ VED   IPYASAVGSLMYVMLCTRPNICY+ G
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG

Query:  IVSRC
        IVSRC
Subjt:  IVSRC

TrEMBL top hitse value%identityAlignment
A0A5A7UI63 Putative gag-pol polyprotein1.9e-3981.55Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG
        MKDLGEAQYV GIQI+R+RKNKMLA+SQA+YIDKML RY + NSK+ LLPFRHG+HLSK+QCPKTPQ  ED RRIPYASAVG+LMYVMLCTRP+ICYA G
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG

Query:  IVS
        IVS
Subjt:  IVS

A0A5A7UZF3 Gag/pol protein1.9e-3980.77Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG
        MKD+GEAQYV GIQI+R+RKNK LA+SQA+YIDKML RY + NSK+ LLPFRHG+HLSK+QCPKTPQ VED RRIPYASAVGSLMY MLCTRP+ICYA G
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG

Query:  IVSR
        IVSR
Subjt:  IVSR

A0A5A7V0F0 Putative Integrase core domain2.5e-3978.7Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG
        MKDLGEAQYV GIQI+R+RKNK LA+SQA+YIDKML RY + NSK+ LLPF+HG+HLSK+QCPKTPQ VED RRIPYASAVGSLMY MLCTRP+ICYA G
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG

Query:  IVSRCCKT
        IVSR  K+
Subjt:  IVSRCCKT

A0A5D3BX45 Gag/pol protein1.2e-4183.81Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG
        MKDLG AQYV G+QIVRNRKNK LAMSQ SYIDKMLSRYK+HNSK+ LLP+R+GIHLSK+QCPKTPQ VED   IPYASAVGSLMYVMLCTRPNICY+ G
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG

Query:  IVSRC
        IVSRC
Subjt:  IVSRC

E2GK51 Gag/pol protein (Fragment)5.0e-4082.69Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG
        MKDLGEAQY+ GIQIVRNRKNK LAMSQASYIDK+LSRYK+ NSK+  LPFRHGIHLSK+QCPKTPQ VED R IPY+SAVGSLMY MLCTRP+ICY+ G
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVED-RRIPYASAVGSLMYVMLCTRPNICYAFG

Query:  IVSR
        IVSR
Subjt:  IVSR

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.3e-0733.01Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVEDRRIPYASAVGSLMYVMLCTRPNICYAFGI
        M DL E ++  GI+I    +   + +SQ++Y+ K+LS++ + N      P    I+             ED   P  S +G LMY+MLCTRP++  A  I
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVEDRRIPYASAVGSLMYVMLCTRPNICYAFGI

Query:  VSR
        +SR
Subjt:  VSR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-2048.6Show/hide
Query:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVEDR----RIPYASAVGSLMYVMLCTRPNICY
        MKDLG AQ + G++IVR R ++ L +SQ  YI+++L R+ + N+K    P    + LSKK CP T   VE++    ++PY+SAVGSLMY M+CTRP+I +
Subjt:  MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVEDR----RIPYASAVGSLMYVMLCTRPNICY

Query:  AFGIVSR
        A G+VSR
Subjt:  AFGIVSR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGATTTGGGAGAGGCTCAGTATGTTTCTGGAATTCAGATAGTTCGGAATCGTAAGAACAAAATGTTAGCCATGTCTCAAGCATCATATATTGACAAAATGTTGTC
TAGATATAAGATACATAATTCCAAAAGAAGTCTACTACCGTTTAGACATGGAATTCATTTGTCAAAGAAACAGTGTCCTAAGACACCTCAAGTAGTTGAGGATAGACGTA
TTCCCTATGCATCAGCAGTCGGTAGTTTGATGTATGTCATGTTGTGTACACGACCTAACATATGCTATGCATTTGGGATTGTTAGTAGGTGTTGTAAGACAGATCTCAAG
TCCCCGGCAACGGCGCCAAAAACTTGTTGTGAAACGGATTGTGAATTGAAATTGTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGATTTGGGAGAGGCTCAGTATGTTTCTGGAATTCAGATAGTTCGGAATCGTAAGAACAAAATGTTAGCCATGTCTCAAGCATCATATATTGACAAAATGTTGTC
TAGATATAAGATACATAATTCCAAAAGAAGTCTACTACCGTTTAGACATGGAATTCATTTGTCAAAGAAACAGTGTCCTAAGACACCTCAAGTAGTTGAGGATAGACGTA
TTCCCTATGCATCAGCAGTCGGTAGTTTGATGTATGTCATGTTGTGTACACGACCTAACATATGCTATGCATTTGGGATTGTTAGTAGGTGTTGTAAGACAGATCTCAAG
TCCCCGGCAACGGCGCCAAAAACTTGTTGTGAAACGGATTGTGAATTGAAATTGTATTAA
Protein sequenceShow/hide protein sequence
MKDLGEAQYVSGIQIVRNRKNKMLAMSQASYIDKMLSRYKIHNSKRSLLPFRHGIHLSKKQCPKTPQVVEDRRIPYASAVGSLMYVMLCTRPNICYAFGIVSRCCKTDLK
SPATAPKTCCETDCELKLY