; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G033500 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G033500
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchrH02:7612373..7612873
RNA-Seq ExpressionChy2G033500
SyntenyChy2G033500
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055915.1 copia protein [Cucumis melo var. makuwa]1.01e-5565.87Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA
        MKTLL SQDLWDLVE GY DPDDEG L+ENR+KD KALVI+QQAVHD+VFSRIAAA TSKQAWLILQKAFQGD+RVLVVKLQSL+RDFETLMMK+GESIA
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA

Query:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF
                 +      S  Q +        +   V+        RS+TPKFDH+VAAIEE KDLSTF
Subjt:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF

KAA0058478.1 UBN2 domain-containing protein [Cucumis melo var. makuwa]5.79e-6266.47Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA
        MKTLLRSQDLWDLVE GYADPDDEG LRENRKKDS+ALVIIQQAVH+ +FS IAA  TSKQ WLILQKAFQGD+RVLVVKLQSLRRDFETLMMK+GESIA
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA

Query:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF
              +  +    T             +    ++V+ A     RS+TPKFDH+VAAIEE KDLSTF
Subjt:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF

TYK00351.1 UBN2 domain-containing protein [Cucumis melo var. makuwa]2.16e-5964.67Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA
        MKTLLRSQDLWDLVE GYADPDDE  L EN+KKDSKALVIIQQ VHDSVFSRI AA +SKQ+WLILQKAFQGD+RVLVVKLQSLRRDFETL MK+GESIA
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA

Query:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF
                 +      S  + +     +  +   V+        RS+TPKFDH+VAAIEE K+LSTF
Subjt:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF

TYK27735.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]9.23e-5767.07Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA
        +KTLLRSQDLWDLVE GY DPDDEG LRENRKKDSKALVIIQQAVHDSVFSRIA A TSKQAWLILQKAFQGD+RVL+VKLQSLRRDFETLMMK+GESIA
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA

Query:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF
                 +      S  Q +        +   V+        RS+TPKFDH+VAAIEE K+L TF
Subjt:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF

XP_008463459.1 PREDICTED: uncharacterized protein LOC103501626 [Cucumis melo]1.91e-5764.67Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA
        MKTLLRSQDLWDLVE GYADPDDE  L EN+KKDSKALVIIQQ VHDSVFSRI AA +SKQAWLILQKAFQGD+RVL+VKLQSLRRDFETL MK+GESIA
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA

Query:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF
                 +    T   R        +  +   V+        RS+T KFDH+VAAIEE K+LSTF
Subjt:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF

TrEMBL top hitse value%identityAlignment
A0A1S3CJ95 uncharacterized protein LOC1035016266.6e-4564.67Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA
        MKTLLRSQDLWDLVE GYADPDDE  L EN+KKDSKALVIIQQ VHDSVFSRI AA +SKQAWLILQKAFQGD+RVL+VKLQSLRRDFETL MK+GESIA
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA

Query:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF
                 +    T   R        +  +   V+        RS+T KFDH+VAAIEE K+LSTF
Subjt:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF

A0A5A7UQM0 Copia protein1.9e-4765.87Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA
        MKTLL SQDLWDLVE GY DPDDEG L+ENR+KD KALVI+QQAVHD+VFSRIAAA TSKQAWLILQKAFQGD+RVLVVKLQSL+RDFETLMMK+GESIA
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA

Query:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF
                     +  S  Q +        +   V+        RS+TPKFDH+VAAIEE KDLSTF
Subjt:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF

A0A5D3BKL0 UBN2 domain-containing protein1.0e-4564.67Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA
        MKTLLRSQDLWDLVE GYADPDDE  L EN+KKDSKALVIIQQ VHDSVFSRI AA +SKQ+WLILQKAFQGD+RVLVVKLQSLRRDFETL MK+GESIA
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA

Query:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF
                     +  S  + +     +  +   V+        RS+TPKFDH+VAAIEE K+LSTF
Subjt:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF

A0A5D3CAD5 UBN2 domain-containing protein1.4e-4766.47Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA
        MKTLLRSQDLWDLVE GYADPDDEG LRENRKKDS+ALVIIQQAVH+ +FS IAA  TSKQ WLILQKAFQGD+RVLVVKLQSLRRDFETLMMK+GESIA
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA

Query:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF
              +  +    T             +    ++V+ A     RS+TPKFDH+VAAIEE KDLSTF
Subjt:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF

A0A5D3DWP2 Putative gag-pol polyprotein, identical6.4e-4867.07Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA
        +KTLLRSQDLWDLVE GY DPDDEG LRENRKKDSKALVIIQQAVHDSVFSRIA A TSKQAWLILQKAFQGD+RVL+VKLQSLRRDFETLMMK+GESIA
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIA

Query:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF
                     +  S  Q +        +   V+        RS+TPKFDH+VAAIEE K+L TF
Subjt:  GEQEERLEGVGDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein6.4e-0835.29Show/hide
Query:  MKTLLRSQDLWDLVEHGYADPDDEGM--------LRENRKKDSKALVIIQQAVHDSVFSRIAAAATSK
        MK +L + D+W++VE G+ +P++EG         LR++RK+D KAL +I Q + +  F ++  A ++K
Subjt:  MKTLLRSQDLWDLVEHGYADPDDEGM--------LRENRKKDSKALVIIQQAVHDSVFSRIAAAATSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACTCTTCTCAGATCTCAAGACTTATGGGACTTAGTAGAACACGGCTATGCGGATCCTGACGACGAAGGCATGTTGCGAGAGAACAGGAAGAAAGACTCGAAGGC
GTTGGTGATCATTCAACAAGCAGTCCACGACAGTGTTTTTTCGCGGATTGCTGCAGCAGCAACGTCAAAGCAAGCGTGGTTGATTTTGCAAAAGGCATTTCAAGGAGATG
CAAGAGTACTTGTGGTAAAATTGCAATCACTTAGGCGAGACTTTGAGACCTTGATGATGAAAGATGGAGAATCGATTGCGGGAGAACAGGAAGAAAGACTCGAAGGCGTT
GGTGATCATTCAACAAGCAGTCCACGACAGTGTTTTTTCGCGGATTGCTGCAGCAGCAACGTCAAAGCAAGCGTGGTTGATTTTGCAAAAGGCATTTCAAGGAGAAGCAT
GACTCCAAAGTTTGATCATATTGTGGCTGCAATAGAAGAACCAAAGGATCTATCCACTTTT
mRNA sequenceShow/hide mRNA sequence
ATGAAGACTCTTCTCAGATCTCAAGACTTATGGGACTTAGTAGAACACGGCTATGCGGATCCTGACGACGAAGGCATGTTGCGAGAGAACAGGAAGAAAGACTCGAAGGC
GTTGGTGATCATTCAACAAGCAGTCCACGACAGTGTTTTTTCGCGGATTGCTGCAGCAGCAACGTCAAAGCAAGCGTGGTTGATTTTGCAAAAGGCATTTCAAGGAGATG
CAAGAGTACTTGTGGTAAAATTGCAATCACTTAGGCGAGACTTTGAGACCTTGATGATGAAAGATGGAGAATCGATTGCGGGAGAACAGGAAGAAAGACTCGAAGGCGTT
GGTGATCATTCAACAAGCAGTCCACGACAGTGTTTTTTCGCGGATTGCTGCAGCAGCAACGTCAAAGCAAGCGTGGTTGATTTTGCAAAAGGCATTTCAAGGAGAAGCAT
GACTCCAAAGTTTGATCATATTGTGGCTGCAATAGAAGAACCAAAGGATCTATCCACTTTT
Protein sequenceShow/hide protein sequence
MKTLLRSQDLWDLVEHGYADPDDEGMLRENRKKDSKALVIIQQAVHDSVFSRIAAAATSKQAWLILQKAFQGDARVLVVKLQSLRRDFETLMMKDGESIAGEQEERLEGV
GDHSTSSPRQCFFADCCSSNVKASVVDFAKGISRRSMTPKFDHIVAAIEEPKDLSTF