; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G15010 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G15010
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationClcChr09:20444180..20444896
RNA-Seq ExpressionClc09G15010
SyntenyClc09G15010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5456934.1 hypothetical protein F2P56_021082 [Juglans regia]4.0e-6660.35Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL
        MY DLGNQSQ+F+L LK G++ QG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D  H+ K VED RI+KFLVGLN++F +VRGR +G+  LP+I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESRRNVM+GKK    +VESSALV  +    KA     +  DKP VWCD+CNKPRH +ETCWK+HGK ANWKSSK  +R+S  + +      S   
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQT
        KEQ++ +L LLKSN SSG PSVSLAQT
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQT

KAF5480722.1 hypothetical protein F2P56_001446 [Juglans regia]4.9e-6460Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL
        MYSDLGNQSQ F+L LKLG++RQG + VT+YF+SLKR+ Q+LDLF+TYEWKS +D  H+ K VED  I+KFL GLN+EF +VRGR +G+  LP+I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALVI-ENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESR N M+GKK    +VESSALV+ +    KA     +  DKP VWCD+CNKPRHTRETCWK+HGK ANWKSSK  +R+S  + +  +   S   
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALVI-ENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLA
        KEQ++ +L LLKSN SSG PSVSLA
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLA

KAG6468480.1 hypothetical protein ZIOFF_073168 [Zingiber officinale]5.1e-6962.2Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL
        MYSDLGNQSQV +L LKLG++RQG +SVT+YF+SLK +WQ+LDLF TYEWKSTDD  H+ KT+EDG IYKFL GLNVEF +VRGR +G+  LP+I +VF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALVIENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL
        +V REESRR+VM+ K++ N SVE+SAL   N V    N   +  DKP VWCD CNKPRHTRETCWK+HGK ANWK+SKQ E+N  +    AN  D+    
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALVIENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSP------QALSCLNSS
        K+Q+DQ+LKL+KSN SS  PSVSLAQT ++P      +ALSCLNSS
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSP------QALSCLNSS

XP_006471430.1 uncharacterized protein LOC102629445 [Citrus sinensis]3.5e-7063.44Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL
        MYSDLGNQSQVF+L L+LG+IRQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS DD  H+ KTVED RIYKFL GLNVEF +VRGR +G+  LP I +VF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKK-AINSVESSALVIENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL
        +V REESRR+VM+ K+ A  S+E+SAL+ +    K +N   ++ DK   WCDHC+KPRHTRE CWKLHGK  NWKSSKQ E+N  +   +AN  +S    
Subjt:  KVHREESRRNVMIGKK-AINSVESSALVIENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQT
        KEQIDQ+LKL+KSN SSG PSVSLA+T
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQT

XP_028105813.1 uncharacterized protein LOC114304864 [Camellia sinensis]1.3e-6963Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL
        MYSDLGNQSQVF+L LKL +IRQG +SVT+YF+SLKR+WQ+LD+F+TYEWKS DD  H+ KTVED RIYKFL GLN+EF +VRGR +G+  LP+I +VF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALVIENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL
        +V RE+SR +VM+ KK  + S+E S+L+ +    KA+N   +  DKP VWCD CNKPRHTRETCWK+HGK  NWKSSKQ E+N  +    AN  DS    
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALVIENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSS-LL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQT
        KEQIDQ+LKL+KSN SSG PSV LAQT
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQT

TrEMBL top hitse value%identityAlignment
A0A2N9EE05 Uncharacterized protein9.7e-7462.24Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL
        MYSDLGNQSQ+F+L LKLG++RQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D +H+ K VED RI+KFL GLN+EF +VRGR +G+  LP+I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESRRNVM+GKK    +VESSALV  +    KA     +T DKP VWCD+CNKP HTRETCWK+HGK ANWKSSK  +R+   + +      +S  
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS
        KEQ++ +L LLKSN SSG PSVS+AQT N P ALS CLNSS
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS

A0A2N9FDL4 Reverse transcriptase Ty1/copia-type domain-containing protein1.4e-6959.91Show/hide
Query:  SDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFLKV
        +DLGNQSQ+F+L LKLG++RQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D +H+ K VE+ RI+KFL GLN+EF +VRGR +G+  LP+I DVF +V
Subjt:  SDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFLKV

Query:  HREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLLKE
         REESRRNVM+GKK    +VESSALV  +    KA     +T+DKP VWCD+CNKPRHTRETCWK+HGK ANWKSSK  +++   + +      +S  KE
Subjt:  HREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLLKE

Query:  QIDQILKLLKSN-SSGNPSVSLAQTSNSPQAL
        Q++ +L LLKSN SSG PSVS+AQT+N P+ L
Subjt:  QIDQILKLLKSN-SSGNPSVSLAQTSNSPQAL

A0A2N9G4V4 Reverse transcriptase Ty1/copia-type domain-containing protein2.5e-6960.79Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL
        MYSDLGNQSQ+F+L LKLG++RQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D +H+ K VED RI+KFL GLN+EF +VRGR +G+   P+I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESRRNVM+GKK    +VESSALV  +    KA     +T DKP VWCD+CNKP HTRETCWK+HGK  NWKSSK  +R+   + +      +S  
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQT
        KEQ++ +L LLKSN SSG PSVS+AQT
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQT

A0A2N9GQ49 Uncharacterized protein2.8e-7361.83Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL
        MYSDLGNQSQ+F+L LKLG++RQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D +H+ K VED RI+KFL GLN+E  +VRGR +G+  +P I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESRRNVM+GKK    +VESSALV  +    KA     +T DKP VWCD+CNKPRHTRETCWK+HGK ANWKSSK  +R+   + +      +S  
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS
        KEQ++ +L LLKSN SSG PSVS+AQT N P ALS CLNSS
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS

A0A2N9I543 Uncharacterized protein1.1e-7462.66Show/hide
Query:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL
        MYSDLGNQSQ+F+L LKLG++RQG +SVT+YF+SLKR+WQ+LDLF+TYEWKS +D +H+ K VED RI+KFL GLN+EF +VRGR +G+  LP+I DVF 
Subjt:  MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFL

Query:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL
        +V REESRRNVM+GKK    +VESSALV  +    KA     +T DKP VWCD+CNKPRHTRETCWK+HGK ANWKSSK  +R+   + +      +S  
Subjt:  KVHREESRRNVMIGKKAIN-SVESSALV-IENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLL

Query:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS
        KEQ++ +L LLKSN SSG PSVS+AQT N P ALS CLNSS
Subjt:  KEQIDQILKLLKSN-SSGNPSVSLAQTSNSPQALS-CLNSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.2e-0530.48Show/hide
Query:  QVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTY-EWKSTDDQKHYLKTVEDGR----IYKFLVG--LNVEFYKVRGRTLGKSTLPNINDVFLKV
        +++ L  +L  +RQGG+SV +YF  L ++W EL  +    E K         K  E+ R     Y+FL+G  LN  F  V  + + +   P++++ F  V
Subjt:  QVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTY-EWKSTDDQKHYLKTVEDGR----IYKFLVG--LNVEFYKVRGRTLGKSTLPNINDVFLKV

Query:  HREES
           ES
Subjt:  HREES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCTGATCTGGGTAACCAGTCACAAGTGTTCGACCTGAATCTTAAGTTGGGTGATATACGGCAAGGAGGCAACTCAGTTACGCAATATTTTCACTCTCTGAAGAG
GATGTGGCAAGAACTTGATCTGTTTGATACGTATGAGTGGAAGTCCACAGACGACCAAAAACATTATCTGAAAACTGTTGAAGATGGTCGCATTTACAAATTCCTTGTTG
GTCTAAATGTTGAGTTTTATAAGGTTAGAGGCAGGACACTTGGGAAAAGTACTCTTCCCAATATTAACGATGTTTTTTTAAAAGTTCACAGGGAGGAAAGTCGCAGGAAT
GTTATGATTGGAAAGAAAGCAATTAACTCAGTTGAAAGTTCCGCGTTAGTGATTGAAAATACTGTAATGAAAGCTTCCAATCAATCCAATAAAACTCATGACAAGCCTCA
TGTCTGGTGTGATCACTGCAACAAACCCCGTCATACGAGGGAAACTTGTTGGAAACTACACGGCAAATCTGCAAATTGGAAGAGCTCTAAGCAATTTGAGAGAAATTCCC
ATCAGTATGCCTCCAATGCAAATATTGTTGATTCCAGTCTACTCAAAGAGCAAATTGATCAAATCCTGAAGCTGCTAAAATCCAATTCATCGGGTAATCCTAGTGTTTCC
TTAGCACAAACAAGTAATTCCCCTCAAGCCCTCTCGTGTCTAAATTCCTCCCCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATTCTGATCTGGGTAACCAGTCACAAGTGTTCGACCTGAATCTTAAGTTGGGTGATATACGGCAAGGAGGCAACTCAGTTACGCAATATTTTCACTCTCTGAAGAG
GATGTGGCAAGAACTTGATCTGTTTGATACGTATGAGTGGAAGTCCACAGACGACCAAAAACATTATCTGAAAACTGTTGAAGATGGTCGCATTTACAAATTCCTTGTTG
GTCTAAATGTTGAGTTTTATAAGGTTAGAGGCAGGACACTTGGGAAAAGTACTCTTCCCAATATTAACGATGTTTTTTTAAAAGTTCACAGGGAGGAAAGTCGCAGGAAT
GTTATGATTGGAAAGAAAGCAATTAACTCAGTTGAAAGTTCCGCGTTAGTGATTGAAAATACTGTAATGAAAGCTTCCAATCAATCCAATAAAACTCATGACAAGCCTCA
TGTCTGGTGTGATCACTGCAACAAACCCCGTCATACGAGGGAAACTTGTTGGAAACTACACGGCAAATCTGCAAATTGGAAGAGCTCTAAGCAATTTGAGAGAAATTCCC
ATCAGTATGCCTCCAATGCAAATATTGTTGATTCCAGTCTACTCAAAGAGCAAATTGATCAAATCCTGAAGCTGCTAAAATCCAATTCATCGGGTAATCCTAGTGTTTCC
TTAGCACAAACAAGTAATTCCCCTCAAGCCCTCTCGTGTCTAAATTCCTCCCCGTAG
Protein sequenceShow/hide protein sequence
MYSDLGNQSQVFDLNLKLGDIRQGGNSVTQYFHSLKRMWQELDLFDTYEWKSTDDQKHYLKTVEDGRIYKFLVGLNVEFYKVRGRTLGKSTLPNINDVFLKVHREESRRN
VMIGKKAINSVESSALVIENTVMKASNQSNKTHDKPHVWCDHCNKPRHTRETCWKLHGKSANWKSSKQFERNSHQYASNANIVDSSLLKEQIDQILKLLKSNSSGNPSVS
LAQTSNSPQALSCLNSSP