; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G10310 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G10310
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr01:13214778..13215339
RNA-Seq ExpressionClc01G10310
SyntenyClc01G10310
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031677.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]2.6e-4158.28Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHL--------------YENCPGALDDTYIKVNV
        ++ T   L   + +D+EEMVAMFLHIL HDVKNR++++QF RSGETVSR+FN+VL A L LH+ LL  L              +ENC GALDDTYIKVNV
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHL--------------YENCPGALDDTYIKVNV

Query:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        +A D+PRY+ RKGE+ATNVL +C    +F+FV+ GWEG AA+SR+LRDAIS
Subjt:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

KAA0032395.1 retrotransposon protein [Cucumis melo var. makuwa]2.6e-4160.58Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHLYENCPGALDDTYIKVNVNAVDQPRYRIRKGE
        +++    L+ T+ VD+EEMVAMFLH+LAHDVKNRV++++F RS ETVSRYFN+VL   L L+E L+     NC GALD TYIK+NV A D+P +R RKGE
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHLYENCPGALDDTYIKVNVNAVDQPRYRIRKGE

Query:  IATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        IATNVL +C  NR+F++V+  WEGFAA+SR+LRDA+S
Subjt:  IATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

KAA0033290.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]4.4e-4156.95Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLL--------------LHLYENCPGALDDTYIKVNV
        +++T  RL  T+ +D+EEMVAMFLHILAHD+KNR+++++F RSGETVSR+FN+VL + L LH  LL                 +ENC GALDDTYIKVNV
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLL--------------LHLYENCPGALDDTYIKVNV

Query:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        +A D+PRY  RKGE+A NVL +C    +F+FV+ GWEG AA+SR+LRDAIS
Subjt:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

KAA0050107.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]6.2e-4362.25Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLL--------------LHLYENCPGALDDTYIKVNV
        M++T G L  TQ VD+EEMVA+FLHI+AHDVKNRV R+ FARSGETVSR+FN VL A L LHE+LL                 ++NC GAL  T+IKVNV
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLL--------------LHLYENCPGALDDTYIKVNV

Query:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        +  D+PRYR RKG+I TNVL +CS N EFIFVMPGWEG A++SRVLRDA+S
Subjt:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

XP_038875111.1 uncharacterized protein LOC120067643 [Benincasa hispida]5.0e-4565.56Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLL--------------LHLYENCPGALDDTYIKVNV
        M++T  R APTQC+D++EMVA+FLHIL HDVKNRVV ++FA SGETVSR+F  VLT  L LHE+LL                 +ENC GALDDTYIKVNV
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLL--------------LHLYENCPGALDDTYIKVNV

Query:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        +AVD+  YR RKGEIATNVLAICSP  EFIFV+P WE   ANSRVLRDAIS
Subjt:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

TrEMBL top hitse value%identityAlignment
A0A5A7SQU2 Putative nuclease HARBI12.1e-4156.95Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLL--------------LHLYENCPGALDDTYIKVNV
        +++T  RL  T+ +D+EEMVAMFLHILAHD+KNR+++++F RSGETVSR+FN+VL + L LH  LL                 +ENC GALDDTYIKVNV
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLL--------------LHLYENCPGALDDTYIKVNV

Query:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        +A D+PRY  RKGE+A NVL +C    +F+FV+ GWEG AA+SR+LRDAIS
Subjt:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

A0A5A7ST53 Retrotransposon protein1.3e-4160.58Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHLYENCPGALDDTYIKVNVNAVDQPRYRIRKGE
        +++    L+ T+ VD+EEMVAMFLH+LAHDVKNRV++++F RS ETVSRYFN+VL   L L+E L+     NC GALD TYIK+NV A D+P +R RKGE
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHLYENCPGALDDTYIKVNVNAVDQPRYRIRKGE

Query:  IATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        IATNVL +C  NR+F++V+  WEGFAA+SR+LRDA+S
Subjt:  IATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

A0A5A7T1V5 Retrotransposon protein2.8e-4161.31Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHLYENCPGALDDTYIKVNVNAVDQPRYRIRKGE
        +++    L+ T+ VD+EEMVAMFLH+LAHDVKNRV++++F RSGETVSR+FN+VL A L L+E L+     NC GALD TYIKVNV A D+P +R RKGE
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHLYENCPGALDDTYIKVNVNAVDQPRYRIRKGE

Query:  IATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        IATNVL +C    +F++V+ GWEG AA+SR+LRDAIS
Subjt:  IATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

A0A5A7U6W3 Putative nuclease HARBI13.0e-4362.25Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLL--------------LHLYENCPGALDDTYIKVNV
        M++T G L  TQ VD+EEMVA+FLHI+AHDVKNRV R+ FARSGETVSR+FN VL A L LHE+LL                 ++NC GAL  T+IKVNV
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLL--------------LHLYENCPGALDDTYIKVNV

Query:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        +  D+PRYR RKG+I TNVL +CS N EFIFVMPGWEG A++SRVLRDA+S
Subjt:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

A0A5D3BXH4 Putative nuclease HARBI11.3e-4158.28Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHL--------------YENCPGALDDTYIKVNV
        ++ T   L   + +D+EEMVAMFLHIL HDVKNR++++QF RSGETVSR+FN+VL A L LH+ LL  L              +ENC GALDDTYIKVNV
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHL--------------YENCPGALDDTYIKVNV

Query:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        +A D+PRY+ RKGE+ATNVL +C    +F+FV+ GWEG AA+SR+LRDAIS
Subjt:  NAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G28950.1 unknown protein1.1e-1345.59Show/hide
Query:  YENCPGALDDTYIKVNVNAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
        +++C GA+DDT+I   V+    P +R RKG+I+ N+LA C+ + EF++V+ GWEG A +S+VL DA++
Subjt:  YENCPGALDDTYIKVNVNAVDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.3e-1833.56Show/hide
Query:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLL------------HLYENCPGALDDTYIKVNVNA
        ++QT G L  T  + IE  +A+FL I+ H+++ R V++ F  SGET+SR+FN VL A + + +                  +++C G +D  +I V V  
Subjt:  MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLL------------HLYENCPGALDDTYIKVNVNA

Query:  VDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS
         +Q  +R   G +  NVLA  S +  F +V+ GWEG A++ +VL  A++
Subjt:  VDQPRYRIRKGEIATNVLAICSPNREFIFVMPGWEGFAANSRVLRDAIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAAACGTTTGGTCGTCTAGCACCCACTCAGTGTGTTGACATTGAGGAGATGGTGGCAATGTTCCTCCACATACTGGCTCACGACGTTAAAAACAGAGTTGTTCG
ACAACAATTTGCACGTTCTGGTGAAACTGTTTCTAGGTATTTCAACGTCGTTCTTACTGCAGAGCTCCACCTTCATGAGGTACTATTGCTCCACCTTTATGAGAATTGTC
CTGGTGCATTGGATGACACATACATTAAGGTCAATGTAAATGCAGTAGACCAGCCTCGATACCGTATAAGAAAGGGTGAGATAGCCACGAACGTGCTTGCTATTTGTTCC
CCAAATAGAGAGTTCATATTCGTGATGCCAGGGTGGGAAGGGTTTGCAGCCAACTCTAGAGTGCTTAGGGATGCTATTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAAACGTTTGGTCGTCTAGCACCCACTCAGTGTGTTGACATTGAGGAGATGGTGGCAATGTTCCTCCACATACTGGCTCACGACGTTAAAAACAGAGTTGTTCG
ACAACAATTTGCACGTTCTGGTGAAACTGTTTCTAGGTATTTCAACGTCGTTCTTACTGCAGAGCTCCACCTTCATGAGGTACTATTGCTCCACCTTTATGAGAATTGTC
CTGGTGCATTGGATGACACATACATTAAGGTCAATGTAAATGCAGTAGACCAGCCTCGATACCGTATAAGAAAGGGTGAGATAGCCACGAACGTGCTTGCTATTTGTTCC
CCAAATAGAGAGTTCATATTCGTGATGCCAGGGTGGGAAGGGTTTGCAGCCAACTCTAGAGTGCTTAGGGATGCTATTTCATAA
Protein sequenceShow/hide protein sequence
MVQTFGRLAPTQCVDIEEMVAMFLHILAHDVKNRVVRQQFARSGETVSRYFNVVLTAELHLHEVLLLHLYENCPGALDDTYIKVNVNAVDQPRYRIRKGEIATNVLAICS
PNREFIFVMPGWEGFAANSRVLRDAIS