; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G004890 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G004890
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotrans_gag domain-containing protein
Genome locationCG_Chr08:15248103..15248649
RNA-Seq ExpressionClCG08G004890
SyntenyClCG08G004890
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.5e-1533.93Show/hide
Query:  NILPLDPWIERTCRRNLRVQQNQPEEMAEE----IPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEICEI
        +I+P+DP IERT R    +++N+   MAEE    +P+ +++Y +P +  N   I+  PIN NNFELK  LI M ++  F G   +DP+ +L  FLEIC+ 
Subjt:  NILPLDPWIERTCRRNLRVQQNQPEEMAEE----IPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEICEI

Query:  -----------------------------------ITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQ
                                           I +W+ +A+ FL K+F PAK+ +LR +IG F Q
Subjt:  -----------------------------------ITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQ

PNX93836.1 hypothetical protein L195_g016998, partial [Trifolium pratense]3.4e-1533.72Show/hide
Query:  MPRDNTNILPLDPWIERTCRRNLRVQQNQPEEMAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEIC
        + R    +L  D  IERT RRN    + +   MAE   + +R+YF P+       I+N P+  NNFELK GLI+M +   FRG   EDP+ +L++F+ + 
Subjt:  MPRDNTNILPLDPWIERTCRRNLRVQQNQPEEMAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEIC

Query:  -----------------------------------EIITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE
                                           + ITTWE L+Q FLNKYF P K+ ++R +I  F Q E
Subjt:  -----------------------------------EIITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]7.7e-5266.28Show/hide
Query:  MPRDNTNILPLDPWIERTCRRNLRVQQNQPEEMAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEIC
        MPRDNTN+LPLDP I+RT RRNLR   NQ  EMAEEIPKAIR+YFQPTLPA++P I+NVPINVNNFELK GLI+MARELAFRGRTNEDPHK+LRSFLEIC
Subjt:  MPRDNTNILPLDPWIERTCRRNLRVQQNQPEEMAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEIC

Query:  -----------------------------------EIITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE
                                           + ITTWEILAQAFLNKYF PAKSQRLR +IGTF QLE
Subjt:  -----------------------------------EIITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]2.0e-1536.16Show/hide
Query:  NTNILPLDPWIERTCRRNLRVQQNQPEEMAEE---------IPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRS
        N ++L +DP  ERT R    +Q+N+ E MAE+           +AIR+Y +P +  N   I    I   NFELK GLI M ++  F G   EDP+ +L S
Subjt:  NTNILPLDPWIERTCRRNLRVQQNQPEEMAEE---------IPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRS

Query:  FLEICEI-----------------------------------ITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE
        FLEIC+                                    ITTW+ LAQ FL KYF P+KS +LR +I  F QL+
Subjt:  FLEICEI-----------------------------------ITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE

XP_022883666.1 uncharacterized protein LOC111400483 [Olea europaea var. sylvestris]7.6e-1540Show/hide
Query:  NILPLDPWIERTCRRNLRVQQNQPEEMAEE---------IPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFL
        ++LP+DP  ERT R    +Q+N+ E M E+           KAI +Y +P +  N   I    I  NNFELK GLI M ++  F G   EDP+ +L SFL
Subjt:  NILPLDPWIERTCRRNLRVQQNQPEEMAEE---------IPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFL

Query:  EICEIITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE
        EIC+ +     +   FL KYF P+KS +L  +I  F QL+
Subjt:  EICEIITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE

TrEMBL top hitse value%identityAlignment
A0A2K3MSV7 Uncharacterized protein (Fragment)1.6e-1533.72Show/hide
Query:  MPRDNTNILPLDPWIERTCRRNLRVQQNQPEEMAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEIC
        + R    +L  D  IERT RRN    + +   MAE   + +R+YF P+       I+N P+  NNFELK GLI+M +   FRG   EDP+ +L++F+ + 
Subjt:  MPRDNTNILPLDPWIERTCRRNLRVQQNQPEEMAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEIC

Query:  -----------------------------------EIITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE
                                           + ITTWE L+Q FLNKYF P K+ ++R +I  F Q E
Subjt:  -----------------------------------EIITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE

A0A3S3N117 Retrotrans_gag domain-containing protein1.2e-1331.61Show/hide
Query:  NTNILPLDPWIERTCRRNLRVQQNQPE----EMAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKM-ARELAFRGRTNEDPHKYLRSFLEI
        N N++PLDP IERT RR  + ++ Q E    EM E+  +++ +Y  P +      I    I  NNFE+K  +I+M A  + F G  ++DP+ ++ +FLE+
Subjt:  NTNILPLDPWIERTCRRNLRVQQNQPE----EMAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKM-ARELAFRGRTNEDPHKYLRSFLEI

Query:  CE-----------------------------------IITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLEL
        C+                                    ITTW+ LA+ FL K+F P K+ ++R  I TF Q E+
Subjt:  CE-----------------------------------IITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLEL

A0A4Q3EK46 Retrotrans_gag domain-containing protein (Fragment)2.6e-1331.95Show/hide
Query:  TNILPLDPWIERTCRRNLRVQQNQPEEMAEEIP-KAIREYFQPTLPANRPRILNVPINVN-NFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEICEI-
        +++LP+DP IE+TC++N R ++ Q   MAE  P + + EY QPT+   R  I N  +  N +FE+K G+I M ++  F G  NEDP++++ +F E+C+  
Subjt:  TNILPLDPWIERTCRRNLRVQQNQPEEMAEEIP-KAIREYFQPTLPANRPRILNVPINVN-NFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEICEI-

Query:  ----------------------------------ITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE
                                          I +WE L   F  K+F   K+ RLR +I +F Q E
Subjt:  ----------------------------------ITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.6e-1331.38Show/hide
Query:  RDNTNILPLDPWIERTCRRNLR-------VQQNQPEE----------MAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMAR-ELAFRGR
        R+N N++P DP IERT RR+ R       + Q   E+          +  E  +A+R+Y  P +      I    IN NNFE+K   I+M +  + F G 
Subjt:  RDNTNILPLDPWIERTCRRNLR-------VQQNQPEE----------MAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMAR-ELAFRGR

Query:  TNEDPHKYLRSFLEICEI-----------------------------------ITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE
         ++DP+ +L +FLEIC+                                    ITTWE LAQ FL K+F PAK+ ++R  I +F Q +
Subjt:  TNEDPHKYLRSFLEICEI-----------------------------------ITTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE

A0A6P6W382 uncharacterized protein LOC1137297695.3e-1431.25Show/hide
Query:  MPRDNTNILPLDPWIERTCRRNLRVQQNQ------------------PEEMAEEIP--KAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELA
        M R +  + P DP IER  RR  R   +Q                   EE+AE  P  + +R++  P    ++  I    +N NNFE+K  LI+M ++  
Subjt:  MPRDNTNILPLDPWIERTCRRNLRVQQNQ------------------PEEMAEEIP--KAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELA

Query:  FRGRTNEDPHKYLRSFLEICEII-----------------------------------TTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE
        + G   EDP+ +L +FLEIC+ I                                   TTW  LA+AFLNK+F P K+ RLR  I +F Q E
Subjt:  FRGRTNEDPHKYLRSFLEICEII-----------------------------------TTWEILAQAFLNKYFLPAKSQRLRKKIGTFCQLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGTGATAATACTAACATCCTTCCTCTTGATCCCTGGATTGAAAGGACGTGCAGAAGAAACCTAAGGGTTCAACAAAATCAACCCGAGGAGATGGCAGAGGAGAT
ACCAAAGGCAATTCGGGAGTATTTTCAACCGACATTACCAGCAAATCGACCTAGAATATTGAATGTGCCCATCAATGTCAACAACTTTGAGTTAAAACAGGGGTTGATCA
AAATGGCTAGAGAGCTAGCCTTTAGAGGAAGAACCAATGAAGATCCTCACAAGTACTTACGGTCTTTCTTGGAAATATGCGAAATTATCACTACATGGGAGATTTTGGCT
CAAGCTTTCTTGAACAAATATTTTCTACCGGCTAAATCTCAAAGGCTAAGAAAGAAGATTGGAACATTCTGCCAACTTGAGCTAGCAAAGGGACCTTATGAACCGTTGTG
A
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGTGATAATACTAACATCCTTCCTCTTGATCCCTGGATTGAAAGGACGTGCAGAAGAAACCTAAGGGTTCAACAAAATCAACCCGAGGAGATGGCAGAGGAGAT
ACCAAAGGCAATTCGGGAGTATTTTCAACCGACATTACCAGCAAATCGACCTAGAATATTGAATGTGCCCATCAATGTCAACAACTTTGAGTTAAAACAGGGGTTGATCA
AAATGGCTAGAGAGCTAGCCTTTAGAGGAAGAACCAATGAAGATCCTCACAAGTACTTACGGTCTTTCTTGGAAATATGCGAAATTATCACTACATGGGAGATTTTGGCT
CAAGCTTTCTTGAACAAATATTTTCTACCGGCTAAATCTCAAAGGCTAAGAAAGAAGATTGGAACATTCTGCCAACTTGAGCTAGCAAAGGGACCTTATGAACCGTTGTG
A
Protein sequenceShow/hide protein sequence
MPRDNTNILPLDPWIERTCRRNLRVQQNQPEEMAEEIPKAIREYFQPTLPANRPRILNVPINVNNFELKQGLIKMARELAFRGRTNEDPHKYLRSFLEICEIITTWEILA
QAFLNKYFLPAKSQRLRKKIGTFCQLELAKGPYEPL