; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G16980 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G16980
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationClcChr09:27100965..27101504
RNA-Seq ExpressionClc09G16980
SyntenyClc09G16980
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.7e-8587.15Show/hide
Query:  MSEGDNPTFDIDLEVERTFRRRIRQIKQKMSNQNTEENMDAQYNQPQAPQVGVRQNADAQEHANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTF
        MSEGDNPTFDID E+ERTFRRRIRQIKQ+ SNQ  EENMDAQYNQPQAPQ GVRQNA+AQEHAN D NPI+L HDRNRPMREYASPNLYNFAPGILQPTF
Subjt:  MSEGDNPTFDIDLEVERTFRRRIRQIKQKMSNQNTEENMDAQYNQPQAPQVGVRQNADAQEHANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTF

Query:  EGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI
        EG GRFEMKPVMLQMLQAAGQFGGA GEDPHAHLKSF+E+CSAFP  GV QD  RLTLFPFSL+DEA+QWAYSFEPGEI
Subjt:  EGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI

XP_038874908.1 uncharacterized protein LOC120067411 [Benincasa hispida]1.1e-3159.82Show/hide
Query:  AQEHANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTL
        AQ+ A+   NPI++ +   RPMREYASP LY+F+P I+ PT +G  RFEMK VMLQMLQ AGQF GA+GEDPHAH+K F+E C++F  PG++ +  RL+L
Subjt:  AQEHANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTL

Query:  FPFSLKDEAKQW
        FP+SL+D+AKQW
Subjt:  FPFSLKDEAKQW

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]9.2e-3142.46Show/hide
Query:  MSEGDNPTFDIDLEVERTFRRRIRQIKQKMSNQNTEENMDAQYNQPQAPQVGVRQNADAQEHANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTF
        MS   +P F+ + E++ TFR R  Q +      N  +N     N  +AP+     N   Q+       P+ L  D N P+R YA+PNLY+F+PGI +P  
Subjt:  MSEGDNPTFDIDLEVERTFRRRIRQIKQKMSNQNTEENMDAQYNQPQAPQVGVRQNADAQEHANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTF

Query:  EGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI
        E   RFE+KPVM+QM+Q   QF   + E+PHAHL  F+E+CS F  PG++  G RL LFP++L+D+AK+WA+S E  EI
Subjt:  EGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI

XP_038890241.1 uncharacterized protein LOC120079867 [Benincasa hispida]2.3e-2956Show/hide
Query:  QEHANNDI-----NPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGT
        +E AN+ +     NPIL+ +   R MREYAS  LY+F+P IL PT +G  RFEMK VMLQMLQ  GQFGG  GEDPH H+K F+E+C++F  P  + +  
Subjt:  QEHANNDI-----NPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGT

Query:  RLTLFPFSLKDEAKQWAYSFEPGEI
        RL+LFP+SL+D+AKQW  S EPGEI
Subjt:  RLTLFPFSLKDEAKQWAYSFEPGEI

XP_038890753.1 uncharacterized protein LOC120080236 [Benincasa hispida]7.8e-3059.26Show/hide
Query:  LTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFSLKDEAKQWA
        + +   RPM+EYASP LY+F+PGI+ PT +G  RFEMK VMLQMLQ AGQF GA+GEDPHAH+K F+E C++F  P ++ +  RL+LFP+SL+D AK+  
Subjt:  LTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFSLKDEAKQWA

Query:  YSFEPGEI
         S EPGEI
Subjt:  YSFEPGEI

TrEMBL top hitse value%identityAlignment
A0A6J1DW02 uncharacterized protein LOC1110248979.0e-2438.04Show/hide
Query:  IDLEVERTFR--RRIRQIKQKMSNQNTEENMDAQYNQ------------PQAPQVGVRQNADAQEHANND-INPILLTHDRNRPMREYASPNLYNFAPGI
        +D E+ERT R  R+ +++++++  Q   E   +  ++            P+ P      N + ++HA ND  N I +  +R+  MREYA+    NF  GI
Subjt:  IDLEVERTFR--RRIRQIKQKMSNQNTEENMDAQYNQ------------PQAPQVGVRQNADAQEHANND-INPILLTHDRNRPMREYASPNLYNFAPGI

Query:  LQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI
        + P       FE+KP+M QMLQ  G FGG + EDPH HLKSF+++ +AF  PG++ D   LTLFPFSLKD+A+    +F  G I
Subjt:  LQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI

A0A6J1E251 uncharacterized protein LOC1110253022.9e-2245.19Show/hide
Query:  PQAPQV-GVRQNADAQEHANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAF
        PQ P V G     +A        N ILL  +R+  MR Y +   +N   GI  P    A +FE+KPVM Q+LQ  GQFGG   EDP++HLKSF+E+ +AF
Subjt:  PQAPQV-GVRQNADAQEHANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAF

Query:  PTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI
          PG S+D  RL +FPFSL+D A+ W  + EP  I
Subjt:  PTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI

A0A6J1EQ90 uncharacterized protein LOC1114364115.3e-2442.31Show/hide
Query:  FDIDLEVERTFRRRIRQIKQKMSNQNTEE-NMDAQYNQPQAPQVGVRQNADAQEHANND---INPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAG
        F +D E+ERTFRRR+++ K+KM+ QN ++  + AQ N         R+  +    AN +    NPI L  DR R +R YA P +    P I++P  +G  
Subjt:  FDIDLEVERTFRRRIRQIKQKMSNQNTEE-NMDAQYNQPQAPQVGVRQNADAQEHANND---INPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAG

Query:  RFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEV-------CSAFPTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI
         FE+KPVM QMLQ  GQF G   EDPH HLKSF+ V         +F   GV +D  RL+LFP+ L+D AK W  +  PG I
Subjt:  RFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEV-------CSAFPTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI

U5CUI2 Retrotrans_gag domain-containing protein6.2e-2549.57Show/hide
Query:  ANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFS
        A   +NPI+L  DR R +REYA+P      PGI++P  + A +FE+KPVM QMLQ  GQF G   EDPH HL+SF+EV  +F   GVS++  RL LFPFS
Subjt:  ANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFS

Query:  LKDEAKQWAYSFEPGEI
        L+D A+ W  +  P  +
Subjt:  LKDEAKQWAYSFEPGEI

U5CUK7 Uncharacterized protein2.1e-2549.57Show/hide
Query:  ANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFS
        A+N+ NPI L  DR R +REYA+P       GI++P  + A  FE+KPVM QMLQ  GQFGG+  EDPH H++SF+EV  +F   GVS++  RL LFPFS
Subjt:  ANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKPVMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFS

Query:  LKDEAKQWAYSFEPGEI
        L+D A+ W  +  P  +
Subjt:  LKDEAKQWAYSFEPGEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGAAGGGGACAACCCAACATTTGATATTGACCTAGAGGTTGAGAGGACCTTCAGAAGAAGGATAAGACAGATTAAGCAAAAAATGAGCAATCAAAATACTGAAGA
AAATATGGATGCTCAATATAATCAACCTCAAGCTCCTCAAGTTGGGGTTAGGCAAAACGCAGATGCGCAAGAGCATGCAAACAACGATATAAACCCAATTCTCTTGACGC
ATGATAGAAATCGCCCAATGAGGGAGTATGCGTCGCCCAATCTATATAATTTCGCACCGGGGATTTTACAACCAACCTTTGAGGGTGCTGGAAGGTTTGAAATGAAGCCA
GTGATGCTACAGATGTTGCAAGCAGCAGGGCAATTTGGGGGTGCAAAGGGTGAAGATCCACATGCACACTTGAAAAGCTTTATGGAAGTATGTAGCGCATTCCCCACGCC
TGGAGTGTCGCAAGATGGCACTAGACTGACGCTATTTCCGTTTTCTCTCAAAGATGAGGCCAAACAATGGGCATATTCATTTGAACCTGGCGAGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGAAGGGGACAACCCAACATTTGATATTGACCTAGAGGTTGAGAGGACCTTCAGAAGAAGGATAAGACAGATTAAGCAAAAAATGAGCAATCAAAATACTGAAGA
AAATATGGATGCTCAATATAATCAACCTCAAGCTCCTCAAGTTGGGGTTAGGCAAAACGCAGATGCGCAAGAGCATGCAAACAACGATATAAACCCAATTCTCTTGACGC
ATGATAGAAATCGCCCAATGAGGGAGTATGCGTCGCCCAATCTATATAATTTCGCACCGGGGATTTTACAACCAACCTTTGAGGGTGCTGGAAGGTTTGAAATGAAGCCA
GTGATGCTACAGATGTTGCAAGCAGCAGGGCAATTTGGGGGTGCAAAGGGTGAAGATCCACATGCACACTTGAAAAGCTTTATGGAAGTATGTAGCGCATTCCCCACGCC
TGGAGTGTCGCAAGATGGCACTAGACTGACGCTATTTCCGTTTTCTCTCAAAGATGAGGCCAAACAATGGGCATATTCATTTGAACCTGGCGAGATTTGA
Protein sequenceShow/hide protein sequence
MSEGDNPTFDIDLEVERTFRRRIRQIKQKMSNQNTEENMDAQYNQPQAPQVGVRQNADAQEHANNDINPILLTHDRNRPMREYASPNLYNFAPGILQPTFEGAGRFEMKP
VMLQMLQAAGQFGGAKGEDPHAHLKSFMEVCSAFPTPGVSQDGTRLTLFPFSLKDEAKQWAYSFEPGEI