; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0098071 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0098071
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionBeta-galactosidase
Genome locationCMiso1.1chr04:13344315..13344746
RNA-Seq ExpressionCmc04g0098071
SyntenyCmc04g0098071
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039144.1 Beta-galactosidase [Cucumis melo var. makuwa]3.8e-4345.54Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---
        W++C +PKGH T+GC W+F LKYK++GTLD++KARLV +GFTQTYGIDYSET+S + KL+TV+  LSV VNKDWP++QLDVKNAFLNG+L EEVYM+   
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---

Query:  ------------------------------------------------------------------------------VARSKERISISQRKYTIDLLKE
                                                                                      VARSKE IS+SQRKYT+DLL E
Subjt:  ------------------------------------------------------------------------------VARSKERISISQRKYTIDLLKE

Query:  TSMTGYRPANTPIEFNVKLEDSVD
        T M G RPA+TPIEFN KL +S D
Subjt:  TSMTGYRPANTPIEFNVKLEDSVD

KAA0042053.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.8e-4650.48Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---
        W++CA+PKGH T+GC W+F LKYK++GTLD++KARLV +GFTQTYGIDYSET+S V KL+TV+  LSVAVNKDWP++QLDVKNAFLNG+L EEVYM+   
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---

Query:  --------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTGYRPANTPIEFN
                                                                      VARSKE IS+SQRKYT+DLL ET M G RPA+TPIEFN
Subjt:  --------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTGYRPANTPIEFN

Query:  VKLEDSVD
         KL +S D
Subjt:  VKLEDSVD

KAA0042054.1 myosin-9 isoform X2 [Cucumis melo var. makuwa]2.2e-4347.49Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---
        W++CA+PKGH T+GC W+F LKYK++GTLD++KARLV +GFTQTYGIDYSET+S V KL+TV+  LSVAVNKDWP++QLDVKNAFLNG+L EEVYM+   
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---

Query:  -------------------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTG
                                                                                 VARSKE IS+SQRKYT+DLL ET M G
Subjt:  -------------------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTG

Query:  YRPANTPIEFNVKLEDSVD
         RPA+T IEFN KL +S D
Subjt:  YRPANTPIEFNVKLEDSVD

KAA0043254.1 DNA polymerase theta [Cucumis melo var. makuwa]4.8e-4662.99Show/hide
Query:  DLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA----
        ++CA+PKGH  +GC W+F LKYK++GTLD++KARLV +GFTQTY +DYSET+S V KL+TV+  LSVAVNKDW + QLDVKNAFLN +L E+VYM+    
Subjt:  DLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA----

Query:  --------VARSKERISISQRKYTIDLLKETSMTGYRPANTPIEFNVKLEDSVD
                VARSKE IS+SQRKYT+DLL ET M G RP +TPIEFN KL +S D
Subjt:  --------VARSKERISISQRKYTIDLLKETSMTGYRPANTPIEFNVKLEDSVD

TYK17997.1 myosin-9 isoform X2 [Cucumis melo var. makuwa]2.2e-4347.49Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---
        W++CA+PKGH T+GC W+F LKYK++GTLD++KARLV +GFTQTYGIDYSET+S V KL+TV+  LSVAVNKDWP++QLDVKNAFLNG+L EEVYM+   
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---

Query:  -------------------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTG
                                                                                 VARSKE IS+SQRKYT+DLL ET M G
Subjt:  -------------------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTG

Query:  YRPANTPIEFNVKLEDSVD
         RPA+T IEFN KL +S D
Subjt:  YRPANTPIEFNVKLEDSVD

TrEMBL top hitse value%identityAlignment
A0A5A7T6N2 Beta-galactosidase1.8e-4345.54Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---
        W++C +PKGH T+GC W+F LKYK++GTLD++KARLV +GFTQTYGIDYSET+S + KL+TV+  LSV VNKDWP++QLDVKNAFLNG+L EEVYM+   
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---

Query:  ------------------------------------------------------------------------------VARSKERISISQRKYTIDLLKE
                                                                                      VARSKE IS+SQRKYT+DLL E
Subjt:  ------------------------------------------------------------------------------VARSKERISISQRKYTIDLLKE

Query:  TSMTGYRPANTPIEFNVKLEDSVD
        T M G RPA+TPIEFN KL +S D
Subjt:  TSMTGYRPANTPIEFNVKLEDSVD

A0A5A7TF39 Myosin-9 isoform X21.1e-4347.49Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---
        W++CA+PKGH T+GC W+F LKYK++GTLD++KARLV +GFTQTYGIDYSET+S V KL+TV+  LSVAVNKDWP++QLDVKNAFLNG+L EEVYM+   
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---

Query:  -------------------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTG
                                                                                 VARSKE IS+SQRKYT+DLL ET M G
Subjt:  -------------------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTG

Query:  YRPANTPIEFNVKLEDSVD
         RPA+T IEFN KL +S D
Subjt:  YRPANTPIEFNVKLEDSVD

A0A5A7TFR5 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-4650.48Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---
        W++CA+PKGH T+GC W+F LKYK++GTLD++KARLV +GFTQTYGIDYSET+S V KL+TV+  LSVAVNKDWP++QLDVKNAFLNG+L EEVYM+   
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---

Query:  --------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTGYRPANTPIEFN
                                                                      VARSKE IS+SQRKYT+DLL ET M G RPA+TPIEFN
Subjt:  --------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTGYRPANTPIEFN

Query:  VKLEDSVD
         KL +S D
Subjt:  VKLEDSVD

A0A5A7TIW0 DNA polymerase theta2.3e-4662.99Show/hide
Query:  DLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA----
        ++CA+PKGH  +GC W+F LKYK++GTLD++KARLV +GFTQTY +DYSET+S V KL+TV+  LSVAVNKDW + QLDVKNAFLN +L E+VYM+    
Subjt:  DLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA----

Query:  --------VARSKERISISQRKYTIDLLKETSMTGYRPANTPIEFNVKLEDSVD
                VARSKE IS+SQRKYT+DLL ET M G RP +TPIEFN KL +S D
Subjt:  --------VARSKERISISQRKYTIDLLKETSMTGYRPANTPIEFNVKLEDSVD

A0A5D3D271 Myosin-9 isoform X21.1e-4347.49Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---
        W++CA+PKGH T+GC W+F LKYK++GTLD++KARLV +GFTQTYGIDYSET+S V KL+TV+  LSVAVNKDWP++QLDVKNAFLNG+L EEVYM+   
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA---

Query:  -------------------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTG
                                                                                 VARSKE IS+SQRKYT+DLL ET M G
Subjt:  -------------------------------------------------------------------------VARSKERISISQRKYTIDLLKETSMTG

Query:  YRPANTPIEFNVKLEDSVD
         RPA+T IEFN KL +S D
Subjt:  YRPANTPIEFNVKLEDSVD

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.4e-1843.75Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYM
        W +   P+    +   W+F +KY   G   +YKARLV  GFTQ Y IDY ET++ V ++S+ +  LS+ +  +  +HQ+DVK AFLNG L EE+YM
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-1839.84Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMAVAR
        + L  +PKG   + C W+F LK   +  L +YKARLVV+GF Q  GID+ E +S V K+++++  LS+A + D  + QLDVK AFL+G+L EE+YM    
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMAVAR

Query:  SKERISISQRKYTIDLLKETSMTGYRPA
          E   ++ +K+ +  L + S+ G + A
Subjt:  SKERISISQRKYTIDLLKETSMTGYRPA

P92520 Uncharacterized mitochondrial protein AtMg008204.0e-1146.38Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVA
        W L   P     +GC W+F  K  S+GTLD+ KARLV +GF Q  GI + ETYS V + +T++  L+VA
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-2453.06Show/hide
Query:  WDLCAIPKGHTTI-GCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA
        WDL   P  H TI GC W+F  KY S+G+L++YKARLV +G+ Q  G+DY+ET+S V K ++++  L VAV++ WPI QLDV NAFL G L ++VYM+
Subjt:  WDLCAIPKGHTTI-GCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-2252.04Show/hide
Query:  WDLCAIPKGHTTI-GCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA
        WDL   P    TI GC W+F  K+ S+G+L++YKARLV +G+ Q  G+DY+ET+S V K ++++  L VAV++ WPI QLDV NAFL G L +EVYM+
Subjt:  WDLCAIPKGHTTI-GCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYMA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.9e-2650Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYM
        W++C +P     IGC W++ +KY S+GT+++YKARLV +G+TQ  GID+ ET+S V KL++VK  L+++   ++ +HQLD+ NAFLNG+L EE+YM
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGEL-EEVYM

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.2e-0429.32Show/hide
Query:  LFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAV---------NKDWPIHQL--DVKNAFLNGELEEVY----MAVARSKE
        ++ LK  S     K+   L+  GF Q++      TY       T   FL V V         N D  + +L   +K+ F   +L  +     + +ARS  
Subjt:  LFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAV---------NKDWPIHQL--DVKNAFLNGELEEVY----MAVARSKE

Query:  RISISQRKYTIDLLKETSMTGYRPANTPIEFNV
         I+I QRKY +DLL ET + G +P++ P++ +V
Subjt:  RISISQRKYTIDLLKETSMTGYRPANTPIEFNV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.8e-1246.38Show/hide
Query:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVA
        W L   P     +GC W+F  K  S+GTLD+ KARLV +GF Q  GI + ETYS V + +T++  L+VA
Subjt:  WDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGATCTTTGTGCTATTCCTAAAGGGCATACAACAATTGGGTGCAATTGGTTGTTCCCTCTAAAGTATAAATCAAATGGAACCCTAGACAAGTACAAGGCTAGACT
TGTAGTGGAAGGGTTTACTCAAACTTATGGGATAGATTATTCTGAGACATACTCCTCTGTAGAAAAGTTAAGCACAGTCAAGGGCTTTCTTTCTGTTGCAGTTAATAAAG
ACTGGCCTATCCATCAGCTTGATGTGAAAAATGCATTTCTGAATGGTGAATTAGAAGAAGTTTATATGGCAGTGGCAAGATCAAAGGAACGAATCTCTATCTCACAACGG
AAATACACCATTGACCTATTAAAAGAAACAAGTATGACGGGATATAGACCTGCTAACACTCCTATTGAATTCAATGTGAAACTGGAAGATTCTGTTGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGATCTTTGTGCTATTCCTAAAGGGCATACAACAATTGGGTGCAATTGGTTGTTCCCTCTAAAGTATAAATCAAATGGAACCCTAGACAAGTACAAGGCTAGACT
TGTAGTGGAAGGGTTTACTCAAACTTATGGGATAGATTATTCTGAGACATACTCCTCTGTAGAAAAGTTAAGCACAGTCAAGGGCTTTCTTTCTGTTGCAGTTAATAAAG
ACTGGCCTATCCATCAGCTTGATGTGAAAAATGCATTTCTGAATGGTGAATTAGAAGAAGTTTATATGGCAGTGGCAAGATCAAAGGAACGAATCTCTATCTCACAACGG
AAATACACCATTGACCTATTAAAAGAAACAAGTATGACGGGATATAGACCTGCTAACACTCCTATTGAATTCAATGTGAAACTGGAAGATTCTGTTGATTAA
Protein sequenceShow/hide protein sequence
MWDLCAIPKGHTTIGCNWLFPLKYKSNGTLDKYKARLVVEGFTQTYGIDYSETYSSVEKLSTVKGFLSVAVNKDWPIHQLDVKNAFLNGELEEVYMAVARSKERISISQR
KYTIDLLKETSMTGYRPANTPIEFNVKLEDSVD