; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04260 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04260
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationClcChr08:12690989..12691318
RNA-Seq ExpressionClc08G04260
SyntenyClc08G04260
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN83929.1 hypothetical protein VITISV_025158 [Vitis vinifera]3.2e-1052.86Show/hide
Query:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGRGNHHKSRSK
        MSEEDKLFNF+SGLQ WAQTKLRRQ V+DLP+A+  AD L+++K   + ++ +K   ERG+   ++ ++K
Subjt:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGRGNHHKSRSK

KAF5180438.1 hypothetical protein FRX31_029975 [Thalictrum thalictroides]4.2e-1060.66Show/hide
Query:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGR
        MSEEDKLFNFI GLQ WAQ++LRR  +KDLPSAI  AD L+DFK  TS+++ +  +K  G+
Subjt:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGR

RVW15655.1 hypothetical protein CK203_075362 [Vitis vinifera]4.2e-1057.38Show/hide
Query:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGR
        MSEEDKLFNF+SGLQ WAQT+LRRQ V+DLP+A+  AD L+D+K   + ++ ++ R E G+
Subjt:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGR

RVW30894.1 Transposon Tf2-12 polyprotein [Vitis vinifera]2.5e-1057.38Show/hide
Query:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGR
        MSEEDKLFNF+SGLQ WAQT+LRRQ V+DLP+A+  AD L+D+K   + ++ ++ R +RG+
Subjt:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGR

XP_038979221.1 uncharacterized protein LOC120109561 [Phoenix dactylifera]1.9e-1055.71Show/hide
Query:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGRGNHHKSRSK
        MSEEDKLFNF+SGLQPWAQT+LRRQAVKD+PSA+  A+AL+DF+            KE+ +    K +S+
Subjt:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGRGNHHKSRSK

TrEMBL top hitse value%identityAlignment
A0A7N2KL10 Uncharacterized protein1.8e-1143.27Show/hide
Query:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGRGNHHKSRSKCEVDRSDK---GKEKATTSLSNTAASYARD
        MSEEDKLFNF+SGLQPWAQ +L+RQ V+DLP+A+  ADAL+D+K +  +   EK RK + +G   + +   + D+  K    K K  +S S  +    + 
Subjt:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGRGNHHKSRSKCEVDRSDK---GKEKATTSLSNTAASYARD

Query:  RIGC
         +GC
Subjt:  RIGC

A0A7N2LH98 Uncharacterized protein1.1e-1144.66Show/hide
Query:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMR-KERGRGNHHKSRSKCEVDRSDKG-KEKATTSLSNTAASYARDR
        MSEEDKLFNF+SGLQPWAQ +L+RQAV+DLP+A+  ADAL+D+K +  +   EK + K++G+    K   K +  +   G K K  +S S  +    +  
Subjt:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMR-KERGRGNHHKSRSKCEVDRSDKG-KEKATTSLSNTAASYARDR

Query:  IGC
         GC
Subjt:  IGC

A0A7N2LMM6 Uncharacterized protein4.1e-1143.69Show/hide
Query:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMR-KERGRGNHHKSRSKCEVDRSDKG-KEKATTSLSNTAASYARDR
        MSEE+KLFNF+SGLQPWAQ +L+RQAV+DLP+A+  ADAL+D+K +  +   EK + K++G+    K   K +  +   G K K  +S S  +    +  
Subjt:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMR-KERGRGNHHKSRSKCEVDRSDKG-KEKATTSLSNTAASYARDR

Query:  IGC
         GC
Subjt:  IGC

A0A7N2N6K8 Uncharacterized protein1.1e-1144.66Show/hide
Query:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMR-KERGRGNHHKSRSKCEVDRSDKG-KEKATTSLSNTAASYARDR
        MSEEDKLFNF+SGLQPWAQ +L+RQAV+DLP+A+  ADAL+D+K +  +   EK + K++G+    K   K +  +   G K K  +S S  +    +  
Subjt:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMR-KERGRGNHHKSRSKCEVDRSDKG-KEKATTSLSNTAASYARDR

Query:  IGC
         GC
Subjt:  IGC

A0A7N2R543 Uncharacterized protein1.1e-1144.66Show/hide
Query:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMR-KERGRGNHHKSRSKCEVDRSDKG-KEKATTSLSNTAASYARDR
        MSEEDKLFNF+SGLQPWAQ +L+RQAV+DLP+A+  ADAL+D+K +  +   EK + K++G+    K   K +  +   G K K  +S S  +    +  
Subjt:  MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMR-KERGRGNHHKSRSKCEVDRSDKG-KEKATTSLSNTAASYARDR

Query:  IGC
         GC
Subjt:  IGC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAGGAGGACAAATTATTCAACTTCATCTCGGGGCTCCAACCTTGGGCTCAGACAAAGCTGCGGAGGCAAGCCGTGAAGGACCTTCCTTCCGCCATCGTCACTGC
AGATGCCCTACTGGATTTTAAGACCACAACCTCCACCACTTCTCCAGAGAAGATGCGGAAAGAAAGGGGGAGGGGCAACCATCATAAGAGCCGATCCAAATGTGAAGTGG
ACCGCAGCGACAAGGGGAAGGAGAAGGCTACCACGTCACTCTCCAATACAGCTGCTTCCTATGCAAGGGACCGCATTGGGTGCGAGCATGCCCGCGGCGAGAGAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGAGGAGGACAAATTATTCAACTTCATCTCGGGGCTCCAACCTTGGGCTCAGACAAAGCTGCGGAGGCAAGCCGTGAAGGACCTTCCTTCCGCCATCGTCACTGC
AGATGCCCTACTGGATTTTAAGACCACAACCTCCACCACTTCTCCAGAGAAGATGCGGAAAGAAAGGGGGAGGGGCAACCATCATAAGAGCCGATCCAAATGTGAAGTGG
ACCGCAGCGACAAGGGGAAGGAGAAGGCTACCACGTCACTCTCCAATACAGCTGCTTCCTATGCAAGGGACCGCATTGGGTGCGAGCATGCCCGCGGCGAGAGAAGTTGA
Protein sequenceShow/hide protein sequence
MSEEDKLFNFISGLQPWAQTKLRRQAVKDLPSAIVTADALLDFKTTTSTTSPEKMRKERGRGNHHKSRSKCEVDRSDKGKEKATTSLSNTAASYARDRIGCEHARGERS