; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G03350 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G03350
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionCCHC-type domain-containing protein
Genome locationClcChr08:8356087..8357616
RNA-Seq ExpressionClc08G03350
SyntenyClc08G03350
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW94544.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.0e-7367.62Show/hide
Query:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK
        MNL  TIKQGN  S+Q++AK +IFLR HLHEGLK EYLT+KD + LW NLK+RY+HQKTVILPKARY+WMHL  QDFK+VS+YNSALFKISS+L LCGEK
Subjt:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGRNNYFFRGGHSN
        IT+ DMLEKTF+TFH  N+LLQQQY E+ F +YS+LISCLLVAEQNNELLM+NH+SRPTG+  FPEVNA++   RGRG GRG   G+GRN   + G +SN
Subjt:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGRNNYFFRGGHSN

Query:  HLNFKRTTLN
        +    + +L+
Subjt:  HLNFKRTTLN

WP_217833168.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]4.2e-9486.67Show/hide
Query:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK
        MNL ETIK+GNATSIQEKAK MIFLR HLHE LKMEYLTIKDL ILWKNLKQRY+HQKTVILPKARYEWMHL  QDFKSVSDYNSALFKISSKLLLCGEK
Subjt:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNN--RGRGCGRGCDHGKGRNNYFFRGGH
        ITDA MLE TFSTFH  NML+QQQY EK FKQYS+LISCLLVAEQNNELLMKNHESRPTGTT FPEVNAVNFNN  RGRG GRG D G+GRN+Y+FRG H
Subjt:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNN--RGRGCGRGCDHGKGRNNYFFRGGH

Query:  SNHLNFKRTT
        SNH NFKRTT
Subjt:  SNHLNFKRTT

XP_022144017.1 uncharacterized protein LOC111013806 [Momordica charantia]1.9e-8680Show/hide
Query:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK
        MNL ETIK+ N T+ QE AK MIFLR HLHEGLKMEYLTIKDL  LW+NLK+RY+HQKTVILPKARYEWMHL  QDFKSVSDYNSALFKISSKL LCGEK
Subjt:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNN--RGRGCGRGCDHGKGRNNYFFRGGH
        ITD+DMLEKT+STFH  N+LLQQQY EK FK+YS+LISCLLVA+QNNELLMKNHESRPTG T FPE NAVNFNN  RGRG GRG D G+GRNN+ FRGG 
Subjt:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNN--RGRGCGRGCDHGKGRNNYFFRGGH

Query:  SNHLNFKRTT
         N  NFKRTT
Subjt:  SNHLNFKRTT

XP_028787995.1 uncharacterized protein LOC114743980 [Prosopis alba]1.5e-7268.84Show/hide
Query:  LRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEKIT
        L +TIK+GN  S Q+KAK MIFLR HLHEGLK+EYLTIKD  +LW NLK+RY+HQKTVILPKARY+WMHL  QDFKSV +YNSA+F+ISS+L LCGEKIT
Subjt:  LRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEKIT

Query:  DADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAV-NFNNRGRGCGRGCDHGKGRNNYFFRGGHSNH
        D DMLEKTFSTFH  N+LLQQQY EK FK+YS+LISCLLVAEQNNELLM+NHESRPTG+T FPEVN V   + RG G GRG   G+GR     RGG  +H
Subjt:  DADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAV-NFNNRGRGCGRGCDHGKGRNNYFFRGGHSNH

Query:  LNFKRTTLNDDHKGK
           ++   ND+H  K
Subjt:  LNFKRTTLNDDHKGK

XP_028800338.1 uncharacterized protein LOC114755637 [Prosopis alba]1.5e-7268.84Show/hide
Query:  LRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEKIT
        L +TIK+GN  S Q+KAK MIFLR HLHEGLK+EYLTIKD  +LW NLK+RY+HQKTVILPKARY+WMHL  QDFKSV +YNSA+F+ISS+L LCGEKIT
Subjt:  LRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEKIT

Query:  DADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAV-NFNNRGRGCGRGCDHGKGRNNYFFRGGHSNH
        D DMLEKTFSTFH  N+LLQQQY EK FK+YS+LISCLLVAEQNNELLM+NHESRPTG+T FPEVN V   + RG G GRG   G+GR     RGG  +H
Subjt:  DADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAV-NFNNRGRGCGRGCDHGKGRNNYFFRGGHSNH

Query:  LNFKRTTLNDDHKGK
           ++   ND+H  K
Subjt:  LNFKRTTLNDDHKGK

TrEMBL top hitse value%identityAlignment
A0A286QJ35 Gag4.9e-7266.82Show/hide
Query:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK
        MNL  TIKQGN  S+Q++AK +IFLR HLHEGLK EYLT+KD + LW NLK+RY+HQKTVILPKARY+WMHL  QDFK+VS+YNSALFKISS+L LCGEK
Subjt:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGR-NNYFFRGGHS
        IT+ DMLEKTF+TFH  N+LLQQQY E+ F +YS+LISCLLVAEQNNELLM+NH+SRPTG+  FPEVNA++   RGRG  RG   G+GR  N  + G +S
Subjt:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGR-NNYFFRGGHS

Query:  NHLNFKRTTLN
        N+    + +L+
Subjt:  NHLNFKRTTLN

A0A438ID00 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-7367.62Show/hide
Query:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK
        MNL  TIKQGN  S+Q++AK +IFLR HLHEGLK EYLT+KD + LW NLK+RY+HQKTVILPKARY+WMHL  QDFK+VS+YNSALFKISS+L LCGEK
Subjt:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGRNNYFFRGGHSN
        IT+ DMLEKTF+TFH  N+LLQQQY E+ F +YS+LISCLLVAEQNNELLM+NH+SRPTG+  FPEVNA++   RGRG GRG   G+GRN   + G +SN
Subjt:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGRNNYFFRGGHSN

Query:  HLNFKRTTLN
        +    + +L+
Subjt:  HLNFKRTTLN

A0A6J1CSH6 uncharacterized protein LOC1110138069.1e-8780Show/hide
Query:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK
        MNL ETIK+ N T+ QE AK MIFLR HLHEGLKMEYLTIKDL  LW+NLK+RY+HQKTVILPKARYEWMHL  QDFKSVSDYNSALFKISSKL LCGEK
Subjt:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNN--RGRGCGRGCDHGKGRNNYFFRGGH
        ITD+DMLEKT+STFH  N+LLQQQY EK FK+YS+LISCLLVA+QNNELLMKNHESRPTG T FPE NAVNFNN  RGRG GRG D G+GRNN+ FRGG 
Subjt:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNN--RGRGCGRGCDHGKGRNNYFFRGGH

Query:  SNHLNFKRTT
         N  NFKRTT
Subjt:  SNHLNFKRTT

A5AJ43 Uncharacterized protein9.8e-7367.14Show/hide
Query:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK
        MNL  TIKQGN  S+Q++AK +IFLR HLHEGLK EY T+KD + LW NLK+RY+HQKTVILPKARY+WMHL  QDFK+VS+YNSALFKISS+L LCGEK
Subjt:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGRNNYFFRGGHSN
        IT+ DMLEKTF+TFH  N+LLQQQY E+ F +YS+LISCLLVAEQNNELLM+NH+SRPTG+  FPEVNA++   RGRG GRG   G+GRN   + G +SN
Subjt:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGRNNYFFRGGHSN

Query:  HLNFKRTTLN
        +    + +L+
Subjt:  HLNFKRTTLN

A5B4M9 Uncharacterized protein2.2e-7267.14Show/hide
Query:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK
        MNL  TIKQGN  S+Q++AK +IFLR HLHEGLK EYLT+KD + LW NLK+RY+HQKTVILPKARY+WMHL  QDFK+VS+YNSALFKISS+L LCGEK
Subjt:  MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEK

Query:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGRNNYFFRGGHSN
        IT+ DMLEKTF+TFH  N+LLQQQY E+ F +YS+LISCLLVAEQNNELLM+NH+SRPTG+  FPEVNA++   RGRG GRG   G+G N   + G +SN
Subjt:  ITDADMLEKTFSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGRNNYFFRGGHSN

Query:  HLNFKRTTLN
        +    + +L+
Subjt:  HLNFKRTTLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCTGAGAGAAACAATTAAACAGGGAAATGCAACATCCATTCAGGAAAAAGCAAAAACTATGATTTTCCTTCGTCGTCATCTCCACGAGGGATTGAAGATGGAATA
TTTAACAATAAAAGATCTCTATATCTTGTGGAAGAATTTAAAACAGAGATATAATCATCAGAAAACAGTCATTCTTCCTAAAGCTCGTTATGAATGGATGCATTTGAGTT
TCCAAGATTTCAAATCAGTAAGTGACTACAACTCCGCATTATTTAAAATCAGTTCAAAATTGTTGTTGTGCGGAGAGAAAATTACCGATGCGGATATGTTGGAGAAGACA
TTTTCTACATTTCATACCTTGAATATGCTCCTGCAGCAGCAATATCCAGAGAAATGTTTTAAACAATATTCTAAATTAATTTCATGTCTTCTCGTAGCTGAACAAAATAA
TGAGCTATTGATGAAGAACCATGAATCTCGACCAACTGGAACAACACAATTCCCTGAAGTGAATGCTGTAAATTTTAATAATCGTGGTCGAGGTTGTGGTCGTGGTTGTG
ACCATGGCAAAGGAAGAAATAATTATTTTTTTCGTGGTGGTCATTCTAATCATCTAAATTTCAAAAGAACCACCTTAAATGATGATCATAAAGGAAAAGCTCCATAA
mRNA sequenceShow/hide mRNA sequence
TGGAGAGAAAAAAGAGAAAGGATTGGTGAAATTTGAGAAATTTAATTTTCCATAAAATCTCACTTTCTTTATTTTAGTTTATTTTAAGTTTACTTTATTTAATTTGGTTA
TTTTAATTCCTTTGATATTATTTTATTATTTATTATTTTAATATTTTAATATCTCTGTTTTTGTGCTCCTGTCTTTCTGTCATTTTGCTCCTCAATTTTACAATACGTTA
TTAGCATGAGCTCTTGTCGTTTCTCTGTTGAAGATAGGTTCTGAAGTGTTATCATGGCAAATCTTACAAAAATAGAATTTGCCACCCTTGACATCAATGGAAATAATTAT
TGTCATGGGTGCTTGATGCTGAAATACATCTAGATGCCATGAACCTGAGAGAAACAATTAAACAGGGAAATGCAACATCCATTCAGGAAAAAGCAAAAACTATGATTTTC
CTTCGTCGTCATCTCCACGAGGGATTGAAGATGGAATATTTAACAATAAAAGATCTCTATATCTTGTGGAAGAATTTAAAACAGAGATATAATCATCAGAAAACAGTCAT
TCTTCCTAAAGCTCGTTATGAATGGATGCATTTGAGTTTCCAAGATTTCAAATCAGTAAGTGACTACAACTCCGCATTATTTAAAATCAGTTCAAAATTGTTGTTGTGCG
GAGAGAAAATTACCGATGCGGATATGTTGGAGAAGACATTTTCTACATTTCATACCTTGAATATGCTCCTGCAGCAGCAATATCCAGAGAAATGTTTTAAACAATATTCT
AAATTAATTTCATGTCTTCTCGTAGCTGAACAAAATAATGAGCTATTGATGAAGAACCATGAATCTCGACCAACTGGAACAACACAATTCCCTGAAGTGAATGCTGTAAA
TTTTAATAATCGTGGTCGAGGTTGTGGTCGTGGTTGTGACCATGGCAAAGGAAGAAATAATTATTTTTTTCGTGGTGGTCATTCTAATCATCTAAATTTCAAAAGAACCA
CCTTAAATGATGATCATAAAGGAAAAGCTCCATAAGATAAAAATACAAAAGATGATGAACATAAATGCTACCGATGCAGGATGACTGAGCACTGATCTCGTGTTTGTCGT
ACATCAAAATACCTAGTTGATATCTATCAAGTTTCCTTGAAGGAAAAAGAGAAAAATGTGGAAACAAATTTTGCATACCAGAATAATGATATATTTGACCCATCCAATAT
GACAAATTTAGATGTGGCGGACTTATTTGAATCTTCTGAAGAGATAATCAGCACAGATGATAGCATAGCAAGTGTTTCTTTTAATTTTGAGAATATTTAGAATTAATGTT
GTTTTTTTTGTCCATCTATCTTTCATATTTTTTTTAAACTTTTGTATTTCAAGTTTTGTTTGTCTTATTGTAATTCTTTTGTTTAAATGAATAATCATAGACAATTTTCA
T
Protein sequenceShow/hide protein sequence
MNLRETIKQGNATSIQEKAKTMIFLRRHLHEGLKMEYLTIKDLYILWKNLKQRYNHQKTVILPKARYEWMHLSFQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKT
FSTFHTLNMLLQQQYPEKCFKQYSKLISCLLVAEQNNELLMKNHESRPTGTTQFPEVNAVNFNNRGRGCGRGCDHGKGRNNYFFRGGHSNHLNFKRTTLNDDHKGKAP