; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G008570 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G008570
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationCG_Chr02:11363536..11364274
RNA-Seq ExpressionClCG02G008570
SyntenyClCG02G008570
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]5.2e-4959.34Show/hide
Query:  QDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKRIKEIKD
        Q SPS  S  +    NPL+E+WIAKDQ  MT+INATLS +ALAY+VG TS+ +   +L K YSS  R+N+VNLKS LQT+ KKP E+ID+Y+KRIKEIKD
Subjt:  QDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKRIKEIKD

Query:  KLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC
        KL+N+S+ IN EDL+IY +NGLP EYN FRTS+RTRS  ++FEELHVL++++E+A+  Q+K DD   QPTVLL++  S   C
Subjt:  KLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]4.4e-4860.61Show/hide
Query:  VPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKRIKEIKDKLSNISSVINVED
        + NP +++W AKDQ  MT+INATLS +ALAY+VG T++ +   +L K YSS+ R+N+VNLKS LQT+ KK  E+ID+Y+KRIKEIKDKL+N+S+V+N ED
Subjt:  VPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKRIKEIKDKLSNISSVINVED

Query:  LVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCS
        L+IY +NGLP EYN FRTS+RTRS  ++FEELHVL+K++E+A+  Q+KRDDL  QPT LLA+  S
Subjt:  LVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCS

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-4860.61Show/hide
Query:  VPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKRIKEIKDKLSNISSVINVED
        + NP +++W AKDQ  MT+INATLS +ALAY+VG T++ +   +L K YSS+ R+N+VNLKS LQT+ KK  E+ID+Y+KRIKEIKDKL+N+S+V+N ED
Subjt:  VPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKRIKEIKDKLSNISSVINVED

Query:  LVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCS
        L+IY +NGLP EYN FRTS+RTRS  ++FEELHVL+K++E+A+  Q+KRDDL  QPT LLA+  S
Subjt:  LVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCS

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]7.5e-4855.85Show/hide
Query:  TNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKR
        TN    + + S+ +  +    NP +E+WIAKDQ  MT+INATLS +ALAY+VG TS+ +   +L K YSS  R+N+VNLKS LQT+ KKP E+ID+Y+KR
Subjt:  TNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKR

Query:  IKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC
        IKEIKDKL+N+S+ IN EDL+IY +NGLP EYN FRTS+RTRS  ++FEELHVL++++E+A+  Q+K DD   QPTVLL++  S   C
Subjt:  IKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]5.2e-4959.34Show/hide
Query:  QDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKRIKEIKD
        Q SPS  S  +    NPL+E+WIAKDQ  MT+INATLS +ALAY+VG TS+ +   +L K YSS  R+N+VNLKS LQT+ KKP E+ID+Y+KRIKEIKD
Subjt:  QDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKRIKEIKD

Query:  KLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC
        KL+N+S+ IN EDL+IY +NGLP EYN FRTS+RTRS  ++FEELHVL++++E+A+  Q+K DD   QPTVLL++  S   C
Subjt:  KLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X23.6e-4855.85Show/hide
Query:  TNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKR
        TN    + + S+ +  +    NP +E+WIAKDQ  MT+INATLS +ALAY+VG TS+ +   +L K YSS  R+N+VNLKS LQT+ KKP E+ID+Y+KR
Subjt:  TNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKR

Query:  IKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC
        IKEIKDKL+N+S+ IN EDL+IY +NGLP EYN FRTS+RTRS  ++FEELHVL++++E+A+  Q+K DD   QPTVLL++  S   C
Subjt:  IKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X33.6e-4855.85Show/hide
Query:  TNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKR
        TN    + + S+ +  +    NP +E+WIAKDQ  MT+INATLS +ALAY+VG TS+ +   +L K YSS  R+N+VNLKS LQT+ KKP E+ID+Y+KR
Subjt:  TNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKR

Query:  IKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC
        IKEIKDKL+N+S+ IN EDL+IY +NGLP EYN FRTS+RTRS  ++FEELHVL++++E+A+  Q+K DD   QPTVLL++  S   C
Subjt:  IKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X13.6e-4855.85Show/hide
Query:  TNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKR
        TN    + + S+ +  +    NP +E+WIAKDQ  MT+INATLS +ALAY+VG TS+ +   +L K YSS  R+N+VNLKS LQT+ KKP E+ID+Y+KR
Subjt:  TNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKR

Query:  IKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC
        IKEIKDKL+N+S+ IN EDL+IY +NGLP EYN FRTS+RTRS  ++FEELHVL++++E+A+  Q+K DD   QPTVLL++  S   C
Subjt:  IKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC

A0A5D3CLI6 T4.53.6e-4855.85Show/hide
Query:  TNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKR
        TN    + + S+ +  +    NP +E+WIAKDQ  MT+INATLS +ALAY+VG TS+ +   +L K YSS  R+N+VNLKS LQT+ KKP E+ID+Y+KR
Subjt:  TNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPGETIDSYVKR

Query:  IKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC
        IKEIKDKL+N+S+ IN EDL+IY +NGLP EYN FRTS+RTRS  ++FEELHVL++++E+A+  Q+K DD   QPTVLL++  S   C
Subjt:  IKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQC

A0A5D3D3T6 Retrotran_gag_3 domain-containing protein7.6e-4653Show/hide
Query:  ASSATTPSSSSSTKDPMHTNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSA
        +SSATT ++SSST +                      + NP +E+W AKDQ  M LINATLS +AL Y+VGC S+S+    LE++YSSN RTNIVNLKS 
Subjt:  ASSATTPSSSSSTKDPMHTNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSA

Query:  LQTMLKKPGETIDSYVKRIKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQ
        LQ + KKP E I  Y+K+IKE+KDKL+N ++++  EDLVIY +NGLP EYNAFRTS++TRS  +SF ELH+L+KS+E+A+  Q KR+D+  QPT +LA Q
Subjt:  LQTMLKKPGETIDSYVKRIKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCATCTGCTACCACTCCATCTTCTTCATCTTCAACAAAGGACCCTATGCACACAAACTCTTTGGTTTCTCAAGACTCGCCTTCAGCACCTTCCCCTGCTATAAT
TTCTGTTCCTAATCCATTATTTGAAGAATGGATTGCAAAAGACCAAGTATCGATGACCTTGATTAATGCTACCCTTTCCACTAAAGCACTTGCTTATATTGTTGGTTGCA
CCTCAGCAAGTGAAACTTTGACGATTCTTGAGAAAAATTATTCATCCAATTTGCGAACAAATATTGTCAATCTGAAGTCTGCACTTCAGACTATGCTCAAGAAACCTGGC
GAAACAATTGACTCTTATGTCAAACGGATCAAGGAGATCAAAGACAAGTTATCTAATATCTCTTCCGTCATTAATGTTGAAGATCTTGTCATCTACACTATGAATGGTCT
CCCTGCAGAGTATAATGCCTTTCGAACTTCCCTGAGGACCCGATCTCATGCTATCTCATTTGAAGAACTTCATGTTCTTATGAAATCAAAGGAAGCTGCTATTGTCGTTC
AAGCTAAGCGTGATGATCTTCTCACTCAACCTACTGTGCTTCTTGCTACCCAATGTTCTCAAAATCAATGTCAATTTTGTCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCATCTGCTACCACTCCATCTTCTTCATCTTCAACAAAGGACCCTATGCACACAAACTCTTTGGTTTCTCAAGACTCGCCTTCAGCACCTTCCCCTGCTATAAT
TTCTGTTCCTAATCCATTATTTGAAGAATGGATTGCAAAAGACCAAGTATCGATGACCTTGATTAATGCTACCCTTTCCACTAAAGCACTTGCTTATATTGTTGGTTGCA
CCTCAGCAAGTGAAACTTTGACGATTCTTGAGAAAAATTATTCATCCAATTTGCGAACAAATATTGTCAATCTGAAGTCTGCACTTCAGACTATGCTCAAGAAACCTGGC
GAAACAATTGACTCTTATGTCAAACGGATCAAGGAGATCAAAGACAAGTTATCTAATATCTCTTCCGTCATTAATGTTGAAGATCTTGTCATCTACACTATGAATGGTCT
CCCTGCAGAGTATAATGCCTTTCGAACTTCCCTGAGGACCCGATCTCATGCTATCTCATTTGAAGAACTTCATGTTCTTATGAAATCAAAGGAAGCTGCTATTGTCGTTC
AAGCTAAGCGTGATGATCTTCTCACTCAACCTACTGTGCTTCTTGCTACCCAATGTTCTCAAAATCAATGTCAATTTTGTCCCTAG
Protein sequenceShow/hide protein sequence
MASSATTPSSSSSTKDPMHTNSLVSQDSPSAPSPAIISVPNPLFEEWIAKDQVSMTLINATLSTKALAYIVGCTSASETLTILEKNYSSNLRTNIVNLKSALQTMLKKPG
ETIDSYVKRIKEIKDKLSNISSVINVEDLVIYTMNGLPAEYNAFRTSLRTRSHAISFEELHVLMKSKEAAIVVQAKRDDLLTQPTVLLATQCSQNQCQFCP