; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G012020 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G012020
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotrans_gag domain-containing protein
Genome locationCG_Chr01:21677185..21678296
RNA-Seq ExpressionClCG01G012020
SyntenyClCG01G012020
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156327.1 uncharacterized protein LOC111023248 [Momordica charantia]1.1e-0846.51Show/hide
Query:  MNLRAFPFLLKDDAKD-----------------------WFISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQ
        +NLRAF F LKD AKD                       +F +  A +IRK+IY ITQI+ +TL E+WE FK LCAS+P HQI DQ
Subjt:  MNLRAFPFLLKDDAKD-----------------------WFISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQ

XP_027368228.1 uncharacterized protein LOC113874203 [Abrus precatorius]4.1e-0844.83Show/hide
Query:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQM
        + LRAFPF L   AKDW                       F++  A SIRK+I  I QI  +TLDE+WE FK LCAS P HQI DQ+
Subjt:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQM

XP_031278099.1 LOW QUALITY PROTEIN: uncharacterized protein LOC116136556 [Pistacia vera]6.3e-0935.95Show/hide
Query:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQMPSSSRCE-LPEMS
        + LRAFPFLL+DDAKDW                       F    A +IRK+I  I Q   +TL E+WE FK LCAS P HQI DQ+      E L  M 
Subjt:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQMPSSSRCE-LPEMS

Query:  RRQLKRLS---LLRHKPK-SPLLRCTMLGINQDAIGGHDKEPSRIYHVDQAEL
        R  +   S   L+   PK +  L   M   +Q     HD  P R+  V  + +
Subjt:  RRQLKRLS---LLRHKPK-SPLLRCTMLGINQDAIGGHDKEPSRIYHVDQAEL

XP_038876529.1 uncharacterized protein LOC120068960 [Benincasa hispida]4.4e-1043.68Show/hide
Query:  MNLRAFPFLLKDDAKDWFI-----------------------SFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQM
        +NLRAFPF L+DD KDW                         +F A SIRK IY I Q   ++L E+WEC+K+LCA+ P HQI DQ+
Subjt:  MNLRAFPFLLKDDAKDWFI-----------------------SFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQM

XP_041011356.1 uncharacterized protein LOC121255143 [Juglans microcarpa x Juglans regia]3.1e-0829.95Show/hide
Query:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQMPSSSRCE-LPEMS
        + LRAFPF LKD AKDW                       F +  A +IRK+I  I Q   ++L E+WECFK LCASYP HQI +Q+      E L    
Subjt:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQMPSSSRCE-LPEMS

Query:  RRQLKRL---SLLRHKPKSPLLRCTMLGINQDAIGGHDKEPSRIYHVDQAELGAYRKLPSQPETNVRNI---NAISAMSCMESSQTSVPKPVI-----EP
        R  +      SL+   P+        +  N    G     PS+  HV++  + +  +  +   + VR +   N  +  +C   S    P  +      +P
Subjt:  RRQLKRL---SLLRHKPKSPLLRCTMLGINQDAIGGHDKEPSRIYHVDQAELGAYRKLPSQPETNVRNI---NAISAMSCMESSQTSVPKPVI-----EP

Query:  VKVVNAK
        +K VNAK
Subjt:  VKVVNAK

TrEMBL top hitse value%identityAlignment
A0A1U7ZEK1 uncharacterized protein LOC1045893139.8e-0843.68Show/hide
Query:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQM
        + LRAFPF L D AKDW                       F +  A +IRK+I  I Q   +TL E+WE FK LCASYP HQI DQ+
Subjt:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQM

A0A2I4EMQ0 LOW QUALITY PROTEIN: uncharacterized protein LOC1089909861.7e-0730.26Show/hide
Query:  MNLRAFPFLLKDDAKDWFI-----------------------SFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQMPSSSRCE-LPEMS
        + LRAFPF LKD AKDWF                        +  A +IRK+I +I Q   ++L E+WE FK  CAS P HQI +Q+      E L    
Subjt:  MNLRAFPFLLKDDAKDWFI-----------------------SFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQMPSSSRCE-LPEMS

Query:  RRQLKRLS---LLRHKPKSPLLRCTMLGINQDAIGGHDKEPSR--------------------IYHVDQAELGAYRKLPSQPETNVR-NINAISAMSCME
        R  +   S   L+   P++       + +N    G     PS+                       + Q E  + RKLPSQ   N R N +AI   S   
Subjt:  RRQLKRLS---LLRHKPKSPLLRCTMLGINQDAIGGHDKEPSR--------------------IYHVDQAELGAYRKLPSQPETNVR-NINAISAMSCME

Query:  SSQTSVPKPVIEPVKVVNAKDSEEKMAN
               K V  PVKV  A   +EK  N
Subjt:  SSQTSVPKPVIEPVKVVNAKDSEEKMAN

A0A6J1DRS5 uncharacterized protein LOC1110232485.2e-0946.51Show/hide
Query:  MNLRAFPFLLKDDAKD-----------------------WFISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQ
        +NLRAF F LKD AKD                       +F +  A +IRK+IY ITQI+ +TL E+WE FK LCAS+P HQI DQ
Subjt:  MNLRAFPFLLKDDAKD-----------------------WFISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQ

A0A6P6T172 uncharacterized protein LOC1136964545.8e-0844.19Show/hide
Query:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQ
        + LRAFPF L D AKDW                       F +  A +IRK+I  + Q   KTL E+WECFK LCAS P HQI DQ
Subjt:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQ

A0A6P6X8T1 Reverse transcriptase2.2e-0733.33Show/hide
Query:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQMPSSSRCE-LPEMS
        + LRAFPF L D AKDW                       F +  A SIRK I  I Q   +TL E+WE FK LCAS P HQI DQ+      E L +  
Subjt:  MNLRAFPFLLKDDAKDW-----------------------FISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQMPSSSRCE-LPEMS

Query:  RRQLKRL---SLLRHKPKSPLLRCTMLGINQDAIGG-HDKEPSRIYHVDQAEL
        RR +      SL+   P       + +  N    G  HD    R+  V  + +
Subjt:  RRQLKRL---SLLRHKPKSPLLRCTMLGINQDAIGG-HDKEPSRIYHVDQAEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCTCAGAGCTTTTCCGTTCTTACTGAAGGATGATGCCAAGGATTGGTTTATTTCCTTCCTGGCGGTTTCAATTAGGAAGAAAATATACAACATCACTCAGATCGC
CAGTAAGACCCTAGATGAATTTTGGGAGTGCTTTAAGCATTTGTGCGCCAGCTACCCCGACCACCAAATCTTAGATCAAATGCCCAGCAGTTCGAGATGTGAGCTCCCAG
AAATGTCACGTCGACAACTTAAGAGGCTTTCTTTGCTAAGGCACAAACCTAAGAGCCCGCTGCTGAGGTGCACTATGCTGGGAATCAACCAGGATGCCATTGGAGGACAT
GATAAAGAGCCTAGTAGAATCTACCATGTTGATCAAGCAGAGCTAGGAGCTTATCGGAAGCTTCCTTCCCAACCTGAGACCAATGTGCGTAACATCAACGCCATATCTGC
TATGAGTTGCATGGAGAGCTCTCAAACGTCTGTTCCTAAACCAGTGATTGAGCCTGTTAAAGTTGTGAATGCTAAGGACTCTGAAGAAAAGATGGCCAATTCACCAAGAA
GACAATGGGACATGGTAGATATGAAGAAAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACCTCAGAGCTTTTCCGTTCTTACTGAAGGATGATGCCAAGGATTGGTTTATTTCCTTCCTGGCGGTTTCAATTAGGAAGAAAATATACAACATCACTCAGATCGC
CAGTAAGACCCTAGATGAATTTTGGGAGTGCTTTAAGCATTTGTGCGCCAGCTACCCCGACCACCAAATCTTAGATCAAATGCCCAGCAGTTCGAGATGTGAGCTCCCAG
AAATGTCACGTCGACAACTTAAGAGGCTTTCTTTGCTAAGGCACAAACCTAAGAGCCCGCTGCTGAGGTGCACTATGCTGGGAATCAACCAGGATGCCATTGGAGGACAT
GATAAAGAGCCTAGTAGAATCTACCATGTTGATCAAGCAGAGCTAGGAGCTTATCGGAAGCTTCCTTCCCAACCTGAGACCAATGTGCGTAACATCAACGCCATATCTGC
TATGAGTTGCATGGAGAGCTCTCAAACGTCTGTTCCTAAACCAGTGATTGAGCCTGTTAAAGTTGTGAATGCTAAGGACTCTGAAGAAAAGATGGCCAATTCACCAAGAA
GACAATGGGACATGGTAGATATGAAGAAAAGTTAA
Protein sequenceShow/hide protein sequence
MNLRAFPFLLKDDAKDWFISFLAVSIRKKIYNITQIASKTLDEFWECFKHLCASYPDHQILDQMPSSSRCELPEMSRRQLKRLSLLRHKPKSPLLRCTMLGINQDAIGGH
DKEPSRIYHVDQAELGAYRKLPSQPETNVRNINAISAMSCMESSQTSVPKPVIEPVKVVNAKDSEEKMANSPRRQWDMVDMKKS