; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G002145 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G002145
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationCG_Chr06:2351971..2355740
RNA-Seq ExpressionClCG06G002145
SyntenyClCG06G002145
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5480722.1 hypothetical protein F2P56_001446 [Juglans regia]7.1e-4741.78Show/hide
Query:  FLVLPLSGSFLETKVSATKVFDNRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDI
        F V P S S   +K + T       +S + +VQITTIRLNGDNFLRWSQSVRMYI  +GK+G+LT EK AP+ DDP +  WD +NSMVM WLVNSM EDI
Subjt:  FLVLPLSGSFLETKVSATKVFDNRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDI

Query:  SSNYMCYITTKKLWDSVTQMYSDL----------------------------------------------------------------------------
        SSNYMCY T ++LW++V QMYSDL                                                                            
Subjt:  SSNYMCYITTKKLWDSVTQMYSDL----------------------------------------------------------------------------

Query:  --------------------VRREESRRNVMIGKKAVD-SVESSALVI-ENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA
                            VRREESR N M+GKK    +VESSALV+ +  + KA     +  DKPRVWCD+CNKP HTRETCWK+H KPA
Subjt:  --------------------VRREESRRNVMIGKKAVD-SVESSALVI-ENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA

RVW16202.1 hypothetical protein CK203_074282 [Vitis vinifera]3.0e-4542.39Show/hide
Query:  ATKVFDNRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDS
        ++K+      SH  +VQITTIRLNGDNFLRWSQSVRMYI G+GK+G+LT EK AP+ DDP + +WD +NSMVM WLVNSM EDISSNYMCY TT++LW++
Subjt:  ATKVFDNRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDS

Query:  VTQMYSDL--------------------------------------------------------------------------------------------
        V QMYSDL                                                                                            
Subjt:  VTQMYSDL--------------------------------------------------------------------------------------------

Query:  ----VRREESRRNVMIGKKAVD-SVESSALVIENTAM-KAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA
            VRREESRRNVM+GKK    ++E SALV       KA     K+ ++PRVWCD CNKP HTRE CWK+H KPA
Subjt:  ----VRREESRRNVMIGKKAVD-SVESSALVIENTAM-KAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]1.2e-5455.19Show/hide
Query:  LVLIQETKKEAFKVEAIKKLWSSKDIGWSFVEAYGRSGGLLLIMWDESKISVIETLKGGYTLSVKCKTLCKKVCWVTNVYGPTDYKERKHIWPELQALAA
        LV+    + +   +  IK LWSSKDIGW  VE++GR GG +L MWD SKI V+ETLKGGY+LS+   T CKK CW+TNVYGP DY+ER+ +W  L +L+ 
Subjt:  LVLIQETKKEAFKVEAIKKLWSSKDIGWSFVEAYGRSGGLLLIMWDESKISVIETLKGGYTLSVKCKTLCKKVCWVTNVYGPTDYKERKHIWPELQALAA

Query:  YCTNAWCLGGDFNITRAIHERVPTGRLTRGMKKFNKFIEKAHLMEIPLSNGRFTWSREGIRISRTLLDRFLVTNEWDEAFEGT
        YCT AWC+GG  NITR  HE  P  + TRGM++FN  I+  ++ E+PL NGR TWSREG  ISR+LLD F +  EWDE  E +
Subjt:  YCTNAWCLGGDFNITRAIHERVPTGRLTRGMKKFNKFIEKAHLMEIPLSNGRFTWSREGIRISRTLLDRFLVTNEWDEAFEGT

XP_006471430.1 uncharacterized protein LOC102629445 [Citrus sinensis]1.9e-4742.7Show/hide
Query:  RIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYSD
        R HS++ +VQITTIRLNGDNFLRWSQSVRMYI GQGKIG++T +K  P+ DDPL+  WD +NSMVM WLVNSM EDISSNYMCY T K+LWD+V+QMYSD
Subjt:  RIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYSD

Query:  L------------------------------------------------------------------------------------------------VRR
        L                                                                                                VRR
Subjt:  L------------------------------------------------------------------------------------------------VRR

Query:  EESRRNVMIGKK-AVDSVESSALVIENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKP
        EESRR+VM+ K+ A +S+E+SAL+ +  A K  +   ++ DK + WCDHC+KP HTRE CWKLH KP
Subjt:  EESRRNVMIGKK-AVDSVESSALVIENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKP

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]1.9e-5555.91Show/hide
Query:  LGDYSKPLAVKHLNMKINPELVLIQETKKEAFKVEAIKKLWSSKDIGWSFVEAYGRSGGLLLIMWDESKISVIETLKGGYTLSVKCKTLCKKVCWVTNVY
        LGD SK L +K    K+NP++VLIQETKK+  +   IK LWSSK++G +FVEA G+SGGLL + WD+SKI V    K  ++LS+KC+T+ KK+CW+TNVY
Subjt:  LGDYSKPLAVKHLNMKINPELVLIQETKKEAFKVEAIKKLWSSKDIGWSFVEAYGRSGGLLLIMWDESKISVIETLKGGYTLSVKCKTLCKKVCWVTNVY

Query:  GPTDYKERKHIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRLTRGMKKFNKFIEKAHLMEIPLSNGRFTWSREGIRISRTL
        GP DY+ER+ +W EL +LA    + WC+GGDFN  R  HER P G+ TR M  FNKFI   +L+EIPLSNG+FTWS+EG  +S++L
Subjt:  GPTDYKERKHIWPELQALAAYCTNAWCLGGDFNITRAIHERVPTGRLTRGMKKFNKFIEKAHLMEIPLSNGRFTWSREGIRISRTL

TrEMBL top hitse value%identityAlignment
A0A2N9EE05 Uncharacterized protein7.0e-4844.07Show/hide
Query:  NRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYS
        N  +S + +VQITTIRLNGDNFLRWSQSVRMYI G+GK+G+LT EK AP+  DP +  WD +NSMVM WLVNSM EDISSNYMCY T ++LW++V QMYS
Subjt:  NRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYS

Query:  DL------------------------------------------------------------------------------------------------VR
        DL                                                                                                VR
Subjt:  DL------------------------------------------------------------------------------------------------VR

Query:  REESRRNVMIGKKAVD-SVESSALV-IENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA
        REESRRNVM+GKK    +VESSALV  +  + KA     +T DKPRVWCD+CNKP HTRETCWK+H KPA
Subjt:  REESRRNVMIGKKAVD-SVESSALV-IENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA

A0A2N9GKJ5 Uncharacterized protein3.5e-4743.7Show/hide
Query:  NRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYS
        N  +S + +VQITTIRLN DNFLRWSQSVRMYI G+GK+G+LT EK AP+  DP +  WD +NSMVM WLVNSM EDISSNYMCY T ++LW++V QMYS
Subjt:  NRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYS

Query:  DL------------------------------------------------------------------------------------------------VR
        DL                                                                                                VR
Subjt:  DL------------------------------------------------------------------------------------------------VR

Query:  REESRRNVMIGKKAVD-SVESSALV-IENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA
        REESRRNVM+GKK    +VESSALV  +  + KA     +T DKPRVWCD+CNKP HTRETCWK+H KPA
Subjt:  REESRRNVMIGKKAVD-SVESSALV-IENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA

A0A2N9GQ49 Uncharacterized protein9.1e-4843.7Show/hide
Query:  NRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYS
        N  +S + +VQITTIRLNGDNFLRWSQSVRMYI G+GK+G+LT EK AP+  DP +  WD +NSMVM WLVNSM EDISSNYMCY T ++LW++V QMYS
Subjt:  NRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYS

Query:  DL------------------------------------------------------------------------------------------------VR
        DL                                                                                                VR
Subjt:  DL------------------------------------------------------------------------------------------------VR

Query:  REESRRNVMIGKKAVD-SVESSALV-IENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA
        REESRRNVM+GKK    +VESSALV  +  + KA     +T DKP+VWCD+CNKP HTRETCWK+H KPA
Subjt:  REESRRNVMIGKKAVD-SVESSALV-IENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA

A0A2N9I543 Uncharacterized protein3.1e-4844.07Show/hide
Query:  NRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYS
        N  +S + +VQITTIRLNGDNFLRWSQSVRMYI G+GK+G+LT EK AP+  DP +  WD +NSMVM WLVNSM EDISSNYMCY T ++LW++V QMYS
Subjt:  NRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYS

Query:  DL------------------------------------------------------------------------------------------------VR
        DL                                                                                                VR
Subjt:  DL------------------------------------------------------------------------------------------------VR

Query:  REESRRNVMIGKKAVD-SVESSALV-IENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA
        REESRRNVM+GKK    +VESSALV  +  + KA     +T DKPRVWCD+CNKP HTRETCWK+H KPA
Subjt:  REESRRNVMIGKKAVD-SVESSALV-IENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPA

A0A5D3BHE3 Uncharacterized protein5.9e-5555.19Show/hide
Query:  LVLIQETKKEAFKVEAIKKLWSSKDIGWSFVEAYGRSGGLLLIMWDESKISVIETLKGGYTLSVKCKTLCKKVCWVTNVYGPTDYKERKHIWPELQALAA
        LV+    + +   +  IK LWSSKDIGW  VE++GR GG +L MWD SKI V+ETLKGGY+LS+   T CKK CW+TNVYGP DY+ER+ +W  L +L+ 
Subjt:  LVLIQETKKEAFKVEAIKKLWSSKDIGWSFVEAYGRSGGLLLIMWDESKISVIETLKGGYTLSVKCKTLCKKVCWVTNVYGPTDYKERKHIWPELQALAA

Query:  YCTNAWCLGGDFNITRAIHERVPTGRLTRGMKKFNKFIEKAHLMEIPLSNGRFTWSREGIRISRTLLDRFLVTNEWDEAFEGT
        YCT AWC+GG  NITR  HE  P  + TRGM++FN  I+  ++ E+PL NGR TWSREG  ISR+LLD F +  EWDE  E +
Subjt:  YCTNAWCLGGDFNITRAIHERVPTGRLTRGMKKFNKFIEKAHLMEIPLSNGRFTWSREGIRISRTLLDRFLVTNEWDEAFEGT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.8e-0827.5Show/hide
Query:  HTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYSDLV--
        H     I  +  + DN++ W    R ++    K G +      P P  PL+  W+  N+MVM WL+NSM + +  + M   T  K+W+ + +++   V  
Subjt:  HTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDISSNYMCYITTKKLWDSVTQMYSDLV--

Query:  RREESRRNVMIGKKAVDSVE
        +  + RR +   ++  DSVE
Subjt:  RREESRRNVMIGKKAVDSVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTCCTGTCTTATTGGCATTTCTCAGAAGATAATGATGATCTTGGAGTTACTCGTTTATCAAAATCCGAATCTTCCTCTAGTTGGAAGATTGAATATGTGGTTTG
GCCACCCACCGAGGACATCTTCGCCGGACGTCTCACCTCAGTCACCGGACAACGCCCAGCCTCCCAGTTGACGCCAGACCGCTGGTCGTTCGTGGATCCTGTGAGTGCAG
AAGAAAGTCGCAAATCGCGTGCCCTAACCAACGCCGATGCCGACGCTGTTGTGACTTGGGTTCACTCCTTCTCCGTCGGCCGACCAAGTCTACGCCATTGTCCGAGCGAT
TTCTGGCCAGTTCCGACGATTCTCCGATGGTTCCTTGTTTTGCCGCTGAGTGGAAGTTTTTTGGAGACTAAGGTATCTGCCACCAAAGTCTTCGACAATCGGATCCATTC
CCACACTCTCACTGTCCAAATCACCACCATTCGACTTAATGGGGATAACTTTCTTCGTTGGTCCCAGAGTGTTCGGATGTATATTTGTGGCCAAGGGAAGATAGGGCATC
TCACCAGAGAAAAAATCGCTCCAAGTCCAGATGACCCTTTATTTGTTGTGTGGGACGTGAAAAACTCCATGGTTATGATATGGCTCGTCAACTCTATGGTGGAAGACATC
AGTAGTAACTACATGTGCTACATTACGACCAAGAAATTATGGGACAGTGTGACTCAAATGTATTCTGATTTGGTTCGCAGAGAGGAAAGTCGCAGGAATGTTATGATTGG
AAAGAAGGCAGTTGACTCAGTTGAAAGTTCCGCATTAGTGATTGAAAATACTGCAATGAAAGCTTTTGATCAATCCAACAAAACTCATGACAAGCCTCGTGTCTGGTGTG
ATCACTGCAACAAACCCCATCATACGAGAGAAACTTGTTGGAAACTACATGCCAAACCTGCAAAATTGGAAGAGCTCCCATCAGCATGCCTCCAATGCCTTGGAGACTAT
TCGAAACCGCTAGCAGTTAAGCACCTTAATATGAAGATAAATCCAGAATTGGTTTTAATTCAAGAAACAAAGAAAGAGGCATTTAAAGTCGAAGCAATCAAGAAACTTTG
GAGTTCAAAAGACATCGGTTGGTCATTTGTGGAAGCCTATGGCAGATCAGGAGGGTTATTGTTGATCATGTGGGATGAAAGTAAAATATCAGTCATCGAAACACTCAAAG
GAGGCTACACTCTTTCCGTTAAATGTAAGACCTTATGCAAAAAAGTTTGTTGGGTAACAAATGTATACGGACCAACCGATTATAAAGAAAGAAAACACATCTGGCCGGAG
CTACAAGCTTTGGCAGCTTATTGCACAAATGCCTGGTGCCTGGGTGGGGACTTCAACATCACTAGAGCAATCCATGAAAGAGTTCCAACTGGAAGATTAACTAGAGGAAT
GAAGAAATTCAACAAATTCATAGAAAAGGCACACTTAATGGAAATCCCTTTGAGCAATGGGCGGTTCACATGGTCAAGAGAAGGAATCAGAATATCAAGAACCTTGTTAG
ACAGATTTCTAGTGACAAACGAATGGGATGAAGCTTTTGAAGGCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTTCCTGTCTTATTGGCATTTCTCAGAAGATAATGATGATCTTGGAGTTACTCGTTTATCAAAATCCGAATCTTCCTCTAGTTGGAAGATTGAATATGTGGTTTG
GCCACCCACCGAGGACATCTTCGCCGGACGTCTCACCTCAGTCACCGGACAACGCCCAGCCTCCCAGTTGACGCCAGACCGCTGGTCGTTCGTGGATCCTGTGAGTGCAG
AAGAAAGTCGCAAATCGCGTGCCCTAACCAACGCCGATGCCGACGCTGTTGTGACTTGGGTTCACTCCTTCTCCGTCGGCCGACCAAGTCTACGCCATTGTCCGAGCGAT
TTCTGGCCAGTTCCGACGATTCTCCGATGGTTCCTTGTTTTGCCGCTGAGTGGAAGTTTTTTGGAGACTAAGGTATCTGCCACCAAAGTCTTCGACAATCGGATCCATTC
CCACACTCTCACTGTCCAAATCACCACCATTCGACTTAATGGGGATAACTTTCTTCGTTGGTCCCAGAGTGTTCGGATGTATATTTGTGGCCAAGGGAAGATAGGGCATC
TCACCAGAGAAAAAATCGCTCCAAGTCCAGATGACCCTTTATTTGTTGTGTGGGACGTGAAAAACTCCATGGTTATGATATGGCTCGTCAACTCTATGGTGGAAGACATC
AGTAGTAACTACATGTGCTACATTACGACCAAGAAATTATGGGACAGTGTGACTCAAATGTATTCTGATTTGGTTCGCAGAGAGGAAAGTCGCAGGAATGTTATGATTGG
AAAGAAGGCAGTTGACTCAGTTGAAAGTTCCGCATTAGTGATTGAAAATACTGCAATGAAAGCTTTTGATCAATCCAACAAAACTCATGACAAGCCTCGTGTCTGGTGTG
ATCACTGCAACAAACCCCATCATACGAGAGAAACTTGTTGGAAACTACATGCCAAACCTGCAAAATTGGAAGAGCTCCCATCAGCATGCCTCCAATGCCTTGGAGACTAT
TCGAAACCGCTAGCAGTTAAGCACCTTAATATGAAGATAAATCCAGAATTGGTTTTAATTCAAGAAACAAAGAAAGAGGCATTTAAAGTCGAAGCAATCAAGAAACTTTG
GAGTTCAAAAGACATCGGTTGGTCATTTGTGGAAGCCTATGGCAGATCAGGAGGGTTATTGTTGATCATGTGGGATGAAAGTAAAATATCAGTCATCGAAACACTCAAAG
GAGGCTACACTCTTTCCGTTAAATGTAAGACCTTATGCAAAAAAGTTTGTTGGGTAACAAATGTATACGGACCAACCGATTATAAAGAAAGAAAACACATCTGGCCGGAG
CTACAAGCTTTGGCAGCTTATTGCACAAATGCCTGGTGCCTGGGTGGGGACTTCAACATCACTAGAGCAATCCATGAAAGAGTTCCAACTGGAAGATTAACTAGAGGAAT
GAAGAAATTCAACAAATTCATAGAAAAGGCACACTTAATGGAAATCCCTTTGAGCAATGGGCGGTTCACATGGTCAAGAGAAGGAATCAGAATATCAAGAACCTTGTTAG
ACAGATTTCTAGTGACAAACGAATGGGATGAAGCTTTTGAAGGCACTTGA
Protein sequenceShow/hide protein sequence
MFFLSYWHFSEDNDDLGVTRLSKSESSSSWKIEYVVWPPTEDIFAGRLTSVTGQRPASQLTPDRWSFVDPVSAEESRKSRALTNADADAVVTWVHSFSVGRPSLRHCPSD
FWPVPTILRWFLVLPLSGSFLETKVSATKVFDNRIHSHTLTVQITTIRLNGDNFLRWSQSVRMYICGQGKIGHLTREKIAPSPDDPLFVVWDVKNSMVMIWLVNSMVEDI
SSNYMCYITTKKLWDSVTQMYSDLVRREESRRNVMIGKKAVDSVESSALVIENTAMKAFDQSNKTHDKPRVWCDHCNKPHHTRETCWKLHAKPAKLEELPSACLQCLGDY
SKPLAVKHLNMKINPELVLIQETKKEAFKVEAIKKLWSSKDIGWSFVEAYGRSGGLLLIMWDESKISVIETLKGGYTLSVKCKTLCKKVCWVTNVYGPTDYKERKHIWPE
LQALAAYCTNAWCLGGDFNITRAIHERVPTGRLTRGMKKFNKFIEKAHLMEIPLSNGRFTWSREGIRISRTLLDRFLVTNEWDEAFEGT