; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G014005 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G014005
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCG_Chr09:22547982..22551351
RNA-Seq ExpressionClCG09G014005
SyntenyClCG09G014005
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]8.3e-3281.4Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI
        MAQS+C+IVWIHQLLSE+GF++ +PAKLW DN+A  HIASN VFHERTKHIEVD HFIREKIQ+GLVSTGYVKTGEQLGDILTK +
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]8.3e-3281.4Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI
        MAQS+C+IVWIHQLLSE+GF++ +PAKLW DN+A  HIASN VFHERTKHIEVD HFIREKIQ+GLVSTGYVKTGEQLGDILTK +
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]8.3e-3281.4Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI
        MAQS+C+IVWIHQLLSE+GF++ +PAKLW DN+A  HIASN VFHERTKHIEVD HFIREKIQ+GLVSTGYVKTGEQLGDILTK +
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]8.3e-3281.4Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI
        MAQS+C+IVWIHQLLSE+GF++ +PAKLW DN+A  HIASN VFHERTKHIEVD HFIREKIQ+GLVSTGYVKTGEQLGDILTK +
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]8.3e-3281.4Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI
        MAQS+C+IVWIHQLLSE+GF++ +PAKLW DN+A  HIASN VFHERTKHIEVD HFIREKIQ+GLVSTGYVKTGEQLGDILTK +
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI

TrEMBL top hitse value%identityAlignment
A0A438HK19 Retrovirus-related Pol polyprotein from transposon RE11.4e-2974.42Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI
        MAQ+ C+I+WIHQLL E+G    MPAKLW DN+   HIA+NLV+HERTKHIEVDYHFIREKI+E LVSTGYVKTGEQLGDI TK +
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI

A0A5A7UHS1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-3078.57Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTK
        MAQS+C+IVWIHQLLSE+GF++ +P KLW DN+   HIASN VFHE+TKHIEVD HFIREKIQ+GL+STGYVKTGEQLGDILTK
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTK

A0A5A7VI02 Putative mitochondrial protein1.1e-2976.74Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI
        MAQS+C+IVWIHQLLSE+GF++ +PAKLW DN+   HI SNLVFHERTKHIEVD HFI  KIQ+GLVSTGYVK GEQLGD LTK I
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI

A0A5D3CID2 Putative mitochondrial protein2.9e-3079.07Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI
        M QS+C+IVWIHQLLSE+GF++ +PAKL  DN+A  HIASN VFHERTKHIEVD HFIREKIQ+GLVSTGYVKTGEQLGDILTK +
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI

A0A5D3DZU1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-3078.57Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTK
        MAQS+C+IVWIHQLLSE+GF++ +P KLW DN+   HIASN VFHE+TKHIEVD HFIREKIQ+GL+STGYVKTGEQLGDILTK
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTK

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.3e-1035.37Show/hide
Query:  QSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTK
        +++ + +W+  LL+ +   +  P K++ DN+    IA+N   H+R KHI++ YHF RE++Q  ++   Y+ T  QL DI TK
Subjt:  QSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-0631.25Show/hide
Query:  KIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI
        +++W+ + L E+G +      ++ D+++   ++ N ++H RTKHI+V YH+IRE + +  +    + T E   D+LTKV+
Subjt:  KIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-1340.48Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTK
        +A +  ++ WI  LL+E+G  +  P  ++ DN    ++ +N VFH R KHI +DYHFIR ++Q G +   +V T +QL D LTK
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-1240.48Show/hide
Query:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTK
        +A +  ++ WI  LL+E+G  +  P  ++ DN    ++ +N VFH R KHI +DYHFIR ++Q G +   +V T +QL D LTK
Subjt:  MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-0840.54Show/hide
Query:  KIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREK-IQEGLVSTGYVKTGEQLG
        +++W+ Q   E+   +  P  L+ DN A  HIA+N VFHERTKHIE D H +RE+ + +  +S  +    EQ G
Subjt:  KIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREK-IQEGLVSTGYVKTGEQLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACAATCTATGTGTAAAATAGTGTGGATACACCAACTGTTATCCGAGATGGGATTCAATGTTGTCATGCCAGCTAAATTATGGTGGGATAATAAAGCTACACGTCA
CATTGCATCTAACCTAGTGTTTCATGAACGGACTAAACATATTGAAGTGGATTATCATTTCATTCGCGAGAAAATACAAGAAGGGTTGGTGTCCACAGGATATGTGAAGA
CTGGAGAACAATTGGGAGATATTCTTACCAAAGTTATTTTAGTGGGTCCTAAGCTCAAACCAGCCCCTAAATTATTGTGGAGGTGGAACAAGGCGGTCAAAGCTTTGCTC
ACAGCATTATGGCTTGAAAGAGACTTGGAAGCCAACAAGATCGTTCTCTCAAAGTCGCAACTCAAGTGGTTCGTCGAAAATATTTTAGATTTGATTAAGGGCTCTACATC
ACGATTCTTTGAGAGAAGCTATCAAGACAATTCAGGTACCACAAAATTGTCTAAGTTCCTCAGCTCCACAAGTTGGATTATGAGATGTGTTGTTTGGCCATCCTTTGGGG
GAAGATACTTTATTCGTATTCCCTCCGAAGATTCAAAGCAAGGTGAAGACTTGGTGGACCTTGTGAAGACATCGGAAGCTCAAAAGAAACAGAGAATGAAATTGGCAGCT
CCTCCATATCTAAAACATGGCTCTATTTTAGTACCCGATAAACAGAAGATTTCAAACAAGAAGCATGACTCAATTTTTCTGAATTTTGGGGACATTGAATTAATTAATCC
TCCTTCAAGAATAAAAGATGATTTATTTGCTAAGGATTGTACAAACCCAATTGATTTAGTCCGTTTGAATCAAGTTTTAAAGGACAAAGGAGTAGAGGTTGAAGATCCAA
ATCCTGGACTAGCATTTTTGTTACCTTCTCGGCTTTCCCGGGGAAGTTTCTCGCCGGAAAAGCCATCGGAAGCTGGTTGTCCAGCCTCATCTGAACCCGATTCCAAGGTA
GGTGTTGATCGACATTTATTGAAGAATGAGAAAGAAGGTGAGGAGTCGGCCTCTGGCCAAGAAAAGACCCTTGGTATCCCCTTGAAAAAAGCCCAATCGGTCTTTGCCGT
CAAGGATAGAGATTCTTTAGCAGTTACACTTCCCAAGAGGTCCAGCCCCATTGAATTCTCACCCTCAAAAGTTAATGCTGCCAGTTTCCTATCCAATGAACAGGAATTAA
TGCTCAATGAAATAGCCCCACGCCTGCAGCAACTTTCTCAAAGCCTTTCGGCCCCCAACCATTCATCGCCGGAAAAGCCATCGAAAGCTGGTTGTCCGGCCTCATCTGAA
CCCGATTCCAAGGTAGGTGTTGATCGACATTTATTGAAGAATGAGAAAGAAGGTGAGGAGTCGGCCTCTAGCCAAGAAAAGACCCTTGGTATCCCTTTGAAAGAAGCCCA
ATCGGTCTTTGCCGCCAAGGATAGAGATTCTTCAACCGTTACACTTCCCAAGAGGTCCAGCCCCATTGAATTCTCACCCTCAAAAGTTAATGCTGCCGGTTTCCTATCCA
ATGAAAAGGAATCTAATGCTCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCACAATCTATGTGTAAAATAGTGTGGATACACCAACTGTTATCCGAGATGGGATTCAATGTTGTCATGCCAGCTAAATTATGGTGGGATAATAAAGCTACACGTCA
CATTGCATCTAACCTAGTGTTTCATGAACGGACTAAACATATTGAAGTGGATTATCATTTCATTCGCGAGAAAATACAAGAAGGGTTGGTGTCCACAGGATATGTGAAGA
CTGGAGAACAATTGGGAGATATTCTTACCAAAGTTATTTTAGTGGGTCCTAAGCTCAAACCAGCCCCTAAATTATTGTGGAGGTGGAACAAGGCGGTCAAAGCTTTGCTC
ACAGCATTATGGCTTGAAAGAGACTTGGAAGCCAACAAGATCGTTCTCTCAAAGTCGCAACTCAAGTGGTTCGTCGAAAATATTTTAGATTTGATTAAGGGCTCTACATC
ACGATTCTTTGAGAGAAGCTATCAAGACAATTCAGGTACCACAAAATTGTCTAAGTTCCTCAGCTCCACAAGTTGGATTATGAGATGTGTTGTTTGGCCATCCTTTGGGG
GAAGATACTTTATTCGTATTCCCTCCGAAGATTCAAAGCAAGGTGAAGACTTGGTGGACCTTGTGAAGACATCGGAAGCTCAAAAGAAACAGAGAATGAAATTGGCAGCT
CCTCCATATCTAAAACATGGCTCTATTTTAGTACCCGATAAACAGAAGATTTCAAACAAGAAGCATGACTCAATTTTTCTGAATTTTGGGGACATTGAATTAATTAATCC
TCCTTCAAGAATAAAAGATGATTTATTTGCTAAGGATTGTACAAACCCAATTGATTTAGTCCGTTTGAATCAAGTTTTAAAGGACAAAGGAGTAGAGGTTGAAGATCCAA
ATCCTGGACTAGCATTTTTGTTACCTTCTCGGCTTTCCCGGGGAAGTTTCTCGCCGGAAAAGCCATCGGAAGCTGGTTGTCCAGCCTCATCTGAACCCGATTCCAAGGTA
GGTGTTGATCGACATTTATTGAAGAATGAGAAAGAAGGTGAGGAGTCGGCCTCTGGCCAAGAAAAGACCCTTGGTATCCCCTTGAAAAAAGCCCAATCGGTCTTTGCCGT
CAAGGATAGAGATTCTTTAGCAGTTACACTTCCCAAGAGGTCCAGCCCCATTGAATTCTCACCCTCAAAAGTTAATGCTGCCAGTTTCCTATCCAATGAACAGGAATTAA
TGCTCAATGAAATAGCCCCACGCCTGCAGCAACTTTCTCAAAGCCTTTCGGCCCCCAACCATTCATCGCCGGAAAAGCCATCGAAAGCTGGTTGTCCGGCCTCATCTGAA
CCCGATTCCAAGGTAGGTGTTGATCGACATTTATTGAAGAATGAGAAAGAAGGTGAGGAGTCGGCCTCTAGCCAAGAAAAGACCCTTGGTATCCCTTTGAAAGAAGCCCA
ATCGGTCTTTGCCGCCAAGGATAGAGATTCTTCAACCGTTACACTTCCCAAGAGGTCCAGCCCCATTGAATTCTCACCCTCAAAAGTTAATGCTGCCGGTTTCCTATCCA
ATGAAAAGGAATCTAATGCTCGATGA
Protein sequenceShow/hide protein sequence
MAQSMCKIVWIHQLLSEMGFNVVMPAKLWWDNKATRHIASNLVFHERTKHIEVDYHFIREKIQEGLVSTGYVKTGEQLGDILTKVILVGPKLKPAPKLLWRWNKAVKALL
TALWLERDLEANKIVLSKSQLKWFVENILDLIKGSTSRFFERSYQDNSGTTKLSKFLSSTSWIMRCVVWPSFGGRYFIRIPSEDSKQGEDLVDLVKTSEAQKKQRMKLAA
PPYLKHGSILVPDKQKISNKKHDSIFLNFGDIELINPPSRIKDDLFAKDCTNPIDLVRLNQVLKDKGVEVEDPNPGLAFLLPSRLSRGSFSPEKPSEAGCPASSEPDSKV
GVDRHLLKNEKEGEESASGQEKTLGIPLKKAQSVFAVKDRDSLAVTLPKRSSPIEFSPSKVNAASFLSNEQELMLNEIAPRLQQLSQSLSAPNHSSPEKPSKAGCPASSE
PDSKVGVDRHLLKNEKEGEESASSQEKTLGIPLKEAQSVFAAKDRDSSTVTLPKRSSPIEFSPSKVNAAGFLSNEKESNAR