; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G013745 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G013745
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationCG_Chr09:21334419..21335135
RNA-Seq ExpressionClCG09G013745
SyntenyClCG09G013745
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5456934.1 hypothetical protein F2P56_021082 [Juglans regia]1.1e-6660.79Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL
        MY DLGNQSQ+F+L LK G++ QG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D  H+ K VED RI+KFLVGLN++F +VRGRI+G+  LP+I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESRRNVM+GKK    +VESSALV  +    KA     +  DKP VWCD+CNKPRH +ETCWK+HGK ANWKSSK  +R+S  + +      S   
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQT
        KEQ++ +L LLKSN SSG PSVSLAQT
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQT

KAF5480722.1 hypothetical protein F2P56_001446 [Juglans regia]1.3e-6460.44Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL
        MYSDLGNQSQ F+L LKLG++RQG + VT+YF+SLKR+ Q+LDLF+TYEWKS +D  H+ K VED  I+KFL GLN+EF +VRGRI+G+  LP+I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALVI-ENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESR N M+GKK    +VESSALV+ +    KA     +  DKP VWCD+CNKPRHTRETCWK+HGK ANWKSSK  +R+S  + +  +   S   
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALVI-ENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLA
        KEQ++ +L LLKSN SSG PSVSLA
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLA

KAG6468480.1 hypothetical protein ZIOFF_073168 [Zingiber officinale]1.3e-6961.79Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL
        MYSDLGNQSQV +L LKLG++RQG +SVT+YF+SLK +WQ+LDLF TYEWKSTDD  H+ KT+EDG IYKFL GLNVEF +VRGRI+G+  LP+I +VF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALVIENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL
        +V REESRR+VM+ K++ N SVE+SAL          N   +  DKP VWCD CNKPRHTRETCWK+HGK ANWK+SKQ E+N  +    AN  D+    
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALVIENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSP------QALSCLNSS
        K+Q+DQ+LKL+KSN SS  PSVSLAQT ++P      +ALSCLNSS
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSP------QALSCLNSS

XP_006471430.1 uncharacterized protein LOC102629445 [Citrus sinensis]1.2e-7063.88Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL
        MYSDLGNQSQVF+L L+LG+IRQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS DD  H+ KTVED RIYKFL GLNVEF +VRGRI+G+  LP I +VF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKK-AINSVESSALVIENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL
        +V REESRR+VM+ K+ A  S+E+SAL+ +    K +N   ++ DK   WCDHC+KPRHTRE CWKLHGK  NWKSSKQ E+N  +   +AN  +S    
Subjt:  KVHREESRRNVMIGKK-AINSVESSALVIENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQT
        KEQIDQ+LKL+KSN SSG PSVSLA+T
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQT

XP_028105813.1 uncharacterized protein LOC114304864 [Camellia sinensis]4.6e-7063.44Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL
        MYSDLGNQSQVF+L LKL +IRQG +SVT+YF+SLKR+WQ+LD+F+TYEWKS DD  H+ KTVED RIYKFL GLN+EF +VRGRI+G+  LP+I +VF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALVIENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL
        +V RE+SR +VM+ KK  + S+E S+L+ +    KA+N   +  DKP VWCD CNKPRHTRETCWK+HGK  NWKSSKQ E+N  +    AN  DS    
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALVIENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQT
        KEQIDQ+LKL+KSN SSG PSV LAQT
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQT

TrEMBL top hitse value%identityAlignment
A0A2N9EE05 Uncharacterized protein4.3e-7462.24Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL
        MYSDLGNQSQ+F+L LKLG++RQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D +H+ K VED RI+KFL GLN+EF +VRGR++G+  LP+I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESRRNVM+GKK    +VESSALV  +    KA     +T DKP VWCD+CNKP HTRETCWK+HGK ANWKSSK  +R+   + +      +S  
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS
        KEQ++ +L LLKSN SSG PSVS+AQT N P ALS CLNSS
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS

A0A2N9FDL4 Reverse transcriptase Ty1/copia-type domain-containing protein6.5e-7059.91Show/hide
Query:  SDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFLKV
        +DLGNQSQ+F+L LKLG++RQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D +H+ K VE+ RI+KFL GLN+EF +VRGR++G+  LP+I DVF +V
Subjt:  SDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFLKV

Query:  HREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLLKE
         REESRRNVM+GKK    +VESSALV  +    KA     +T+DKP VWCD+CNKPRHTRETCWK+HGK ANWKSSK  +++   + +      +S  KE
Subjt:  HREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLLKE

Query:  QIDQILKLLKSN-SSGNPSVSLAQTSNSPQAL
        Q++ +L LLKSN SSG PSVS+AQT+N P+ L
Subjt:  QIDQILKLLKSN-SSGNPSVSLAQTSNSPQAL

A0A2N9G4V4 Reverse transcriptase Ty1/copia-type domain-containing protein1.1e-6960.79Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL
        MYSDLGNQSQ+F+L LKLG++RQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D +H+ K VED RI+KFL GLN+EF +VRGR++G+   P+I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESRRNVM+GKK    +VESSALV  +    KA     +T DKP VWCD+CNKP HTRETCWK+HGK  NWKSSK  +R+   + +      +S  
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQT
        KEQ++ +L LLKSN SSG PSVS+AQT
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQT

A0A2N9GQ49 Uncharacterized protein1.3e-7361.83Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL
        MYSDLGNQSQ+F+L LKLG++RQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D +H+ K VED RI+KFL GLN+E  +VRGR++G+  +P I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESRRNVM+GKK    +VESSALV  +    KA     +T DKP VWCD+CNKPRHTRETCWK+HGK ANWKSSK  +R+   + +      +S  
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS
        KEQ++ +L LLKSN SSG PSVS+AQT N P ALS CLNSS
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS

A0A2N9I543 Uncharacterized protein5.1e-7562.66Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL
        MYSDLGNQSQ+F+L LKLG++RQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D +H+ K VED RI+KFL GLN+EF +VRGR++G+  LP+I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESRRNVM+GKK    +VESSALV  +    KA     +T DKP VWCD+CNKPRHTRETCWK+HGK ANWKSSK  +R+   + +      +S  
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS
        KEQ++ +L LLKSN SSG PSVS+AQT N P ALS CLNSS
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.5e-0631.43Show/hide
Query:  QVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTY-EWKSTDDQKHYLKTVEDGR----IYKFLVG--LNVEFYKVRGRILGKSTLPNINDVFLKV
        +++ L  +L  +RQGG+SV +YF  L ++W EL  +    E K         K  E+ R     Y+FL+G  LN  F  V  +I+ +   P++++ F  V
Subjt:  QVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTY-EWKSTDDQKHYLKTVEDGR----IYKFLVG--LNVEFYKVRGRILGKSTLPNINDVFLKV

Query:  HREES
           ES
Subjt:  HREES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCTGATCTGGGTAACCAGTCACAAGTGTTCGACCTGAATCTTAAGTTGGGTGATATACGGCAAGGAGGCAACTCAGTTACGCAATATTTTCACTCTCTGAAGAG
GATGTGGCAAGAACTTGATCTGTTTGATACGTATGAGTGGAAGTCCACAGACGACCAAAAACATTATCTGAAAACTGTTGAAGATGGTCGCATTTACAAATTCCTTGTTG
GTCTAAATGTTGAGTTTTATAAGGTTAGAGGCAGGATACTTGGGAAAAGTACTCTTCCCAATATTAACGATGTCTTTTTAAAAGTTCACAGGGAGGAAAGTCGCAGGAAT
GTTATGATTGGAAAGAAAGCAATTAACTCAGTTGAAAGTTCCGCGTTAGTGATTGAAAATACTATAATGAAAGCTTCCAATCAATCCAATAAAACTCATGACAAGCCTCA
TGTCTGGTGTGATCACTGCAACAAACCCCGTCATACGAGGGAAACTTGTTGGAAACTACACGGCAAATCTGCAAATTGGAAGAGCTCTAAGCAATTTGAGAGAAATTCCC
ATCAGTATGCCTCCAATGCAAATATTGTTGATTCCAGTCTACTCAAAGAGCAAATTGATCAAATCCTGAAGCTGCTAAAATCCAATTCATCGGGTAATCCTAGTGTTTCC
TTAGCACAAACAAGTAATTCCCCTCAAGCCCTCTCGTGTCTAAATTCCTCCCCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATTCTGATCTGGGTAACCAGTCACAAGTGTTCGACCTGAATCTTAAGTTGGGTGATATACGGCAAGGAGGCAACTCAGTTACGCAATATTTTCACTCTCTGAAGAG
GATGTGGCAAGAACTTGATCTGTTTGATACGTATGAGTGGAAGTCCACAGACGACCAAAAACATTATCTGAAAACTGTTGAAGATGGTCGCATTTACAAATTCCTTGTTG
GTCTAAATGTTGAGTTTTATAAGGTTAGAGGCAGGATACTTGGGAAAAGTACTCTTCCCAATATTAACGATGTCTTTTTAAAAGTTCACAGGGAGGAAAGTCGCAGGAAT
GTTATGATTGGAAAGAAAGCAATTAACTCAGTTGAAAGTTCCGCGTTAGTGATTGAAAATACTATAATGAAAGCTTCCAATCAATCCAATAAAACTCATGACAAGCCTCA
TGTCTGGTGTGATCACTGCAACAAACCCCGTCATACGAGGGAAACTTGTTGGAAACTACACGGCAAATCTGCAAATTGGAAGAGCTCTAAGCAATTTGAGAGAAATTCCC
ATCAGTATGCCTCCAATGCAAATATTGTTGATTCCAGTCTACTCAAAGAGCAAATTGATCAAATCCTGAAGCTGCTAAAATCCAATTCATCGGGTAATCCTAGTGTTTCC
TTAGCACAAACAAGTAATTCCCCTCAAGCCCTCTCGTGTCTAAATTCCTCCCCGTAG
Protein sequenceShow/hide protein sequence
MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRILGKSTLPNINDVFLKVHREESRRN
VMIGKKAINSVESSALVIENTIMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLLKEQIDQILKLLKSNSSGNPSVS
LAQTSNSPQALSCLNSSP