; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0073171 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0073171
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr03:20604574..20605219
RNA-Seq ExpressionCmc03g0073171
SyntenyCmc03g0073171
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAT38758.1 Putative gag-pol polyprotein, identical [Solanum demissum]8.1e-5154.63Show/hide
Query:  EKTRKSFPIGKAWRASKCLELIY-------------------------------------KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL
        ++TRKSFP+GK+WRA+ CLEL++                                        FE FK FKA VE QSG  IKSL +DRGGEFLSN+FNL
Subjt:  EKTRKSFPIGKAWRASKCLELIY-------------------------------------KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL

Query:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ
        FC+E+GI RELT PYT EQNGVA RK+R V+EM RS L+ KGL + F  EAV+T +Y LNISPTK V NTT  EA  GKK  VSHLR+FGCI++ALV   
Subjt:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ

Query:  VCKKLDKKSVKCIFVG
           KLD+KS KCIFVG
Subjt:  VCKKLDKKSVKCIFVG

KAG7559774.1 Integrase catalytic core [Arabidopsis thaliana x Arabidopsis arenosa]2.2e-4850.93Show/hide
Query:  EKTRKSFPIGKAWRASKCLELIYK-------------------------------------VAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL
        ++TRKSFP+G+A RA++CLE+++                                       AFE FK FKA VEKQSG  +K L +DRGGEF S  FN 
Subjt:  EKTRKSFPIGKAWRASKCLELIYK-------------------------------------VAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL

Query:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ
        FC+  GI+ ELTT YT EQNGVA RK+R V+EM RSML+ K L N F AE+V T++YLLNISPTK V+N T +EA CG+K  VSHLRVFG ++++LV S 
Subjt:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ

Query:  VCKKLDKKSVKCIFVG
          KKLD+KS KC+F+G
Subjt:  VCKKLDKKSVKCIFVG

KAG7566530.1 F-box associated domain type 3 [Arabidopsis suecica]1.7e-4851.85Show/hide
Query:  EKTRKSFPIGKAWRASKCLELIY------------------------------------KV-AFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL
        ++TRKSFP+G+A RA++CLE+++                                    KV AFE FK FKA VEKQSG  +K L +DRGGEF S  FN 
Subjt:  EKTRKSFPIGKAWRASKCLELIY------------------------------------KV-AFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL

Query:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ
        FC+  GI+ ELTT YT EQNGVA RK+R V+EM RSML+ K L N F AE+V T++YLLNISPTK V+N T +EA CG+K  VSHLRVFG ++++LV S 
Subjt:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ

Query:  VCKKLDKKSVKCIFVG
          KKLD+KS KC+F+G
Subjt:  VCKKLDKKSVKCIFVG

KAG7597740.1 Ribonuclease H-like superfamily [Arabidopsis suecica]2.2e-4850.93Show/hide
Query:  EKTRKSFPIGKAWRASKCLELIYK-------------------------------------VAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL
        ++TRKSFP+G+A RA++CLE+++                                       AFE FK FKA VEKQSG  +K L +DRGGEF S  FN 
Subjt:  EKTRKSFPIGKAWRASKCLELIYK-------------------------------------VAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL

Query:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ
        FC+  GI+ ELTT YT EQNGVA RK+R V+EM RSML+ K L N F AE+V T++YLLNISPTK V+N T +EA CG+K  VSHLRVFG ++++LV S 
Subjt:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ

Query:  VCKKLDKKSVKCIFVG
          KKLD+KS KC+F+G
Subjt:  VCKKLDKKSVKCIFVG

TYK00906.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]1.9e-6375.56Show/hide
Query:  EKTRKSFPIGKAWRASKCLELIY-KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRS
        ++TRKSFPIGKA RASKCLELI+  +          K +KQSGMFIKSL SDRGGEFLSNNFN FCK+HGI+RELTTPYT EQNGVA RK+R V+EM RS
Subjt:  EKTRKSFPIGKAWRASKCLELIY-KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRS

Query:  MLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG
        MLQ+KGLSN F AEAVSTSIYLLNISPTK VMN T FE   GKK NV+HLRVFGCIS+ALVPSQV +KLDKKS KCIFVG
Subjt:  MLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG

TrEMBL top hitse value%identityAlignment
A0A0V0IV83 Putative ovule protein (Fragment)2.4e-5657.87Show/hide
Query:  EKTRKSFPIGKAWRASKCLELIY-------------------------------------KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL
        ++TRKSFP+GKAWRASKCLELIY                                        FEKF+ FKA VE QS   IK L  DRGGEF+SN FNL
Subjt:  EKTRKSFPIGKAWRASKCLELIY-------------------------------------KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL

Query:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ
        FC+ +GI+RELTTPYT EQNGVA RK+R V+EM RSMLQ K L+N F AEAV+ SIYLLN+SPTKVVMN T +EA   +K NVSHLRVFGC+++ALV SQ
Subjt:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ

Query:  VCKKLDKKSVKCIFVG
          +KLD+KS KCIF+G
Subjt:  VCKKLDKKSVKCIFVG

A0A5D3BRM6 Putative gag-pol polyprotein, identical9.0e-6475.56Show/hide
Query:  EKTRKSFPIGKAWRASKCLELIY-KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRS
        ++TRKSFPIGKA RASKCLELI+  +          K +KQSGMFIKSL SDRGGEFLSNNFN FCK+HGI+RELTTPYT EQNGVA RK+R V+EM RS
Subjt:  EKTRKSFPIGKAWRASKCLELIY-KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRS

Query:  MLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG
        MLQ+KGLSN F AEAVSTSIYLLNISPTK VMN T FE   GKK NV+HLRVFGCIS+ALVPSQV +KLDKKS KCIFVG
Subjt:  MLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG

A0A5D3DWP2 Putative gag-pol polyprotein, identical9.0e-4864.37Show/hide
Query:  EKTRKSFPIGKAWRASKCLELIY-------------------------------------KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL
        ++TRKSFPIGKAWRASKCLELI+                                        FEKFKHFKAKVEKQSGMFIKSL SDRG EFLSNNFN 
Subjt:  EKTRKSFPIGKAWRASKCLELIY-------------------------------------KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL

Query:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFE
        FCKEHGI+RELTTPYT EQNGVA RK++ V+EM RSMLQ+KGL N F AEAVS SIYLLNISPTK VMN T FE
Subjt:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFE

A0A7C8Z9K0 Integrase catalytic domain-containing protein (Fragment)1.3e-4952.31Show/hide
Query:  EKTRKSFPIGKAWRASKCLELIY-------------------------------------KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL
        ++ R SFP+GKAWRAS CLELI+                                       AFE FK FK  VEKQS   IK+L +DRGGEF S  F++
Subjt:  EKTRKSFPIGKAWRASKCLELIY-------------------------------------KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL

Query:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ
        FC+EHGI+RELT PYT EQNGVA RK+R ++EM RSM+  + +   F AEAV+T++YLLNISPTK V   T +EA  G K  VSHLRVFGCI++ALV SQ
Subjt:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ

Query:  VCKKLDKKSVKCIFVG
        +  KLD+KS KCIFVG
Subjt:  VCKKLDKKSVKCIFVG

Q6L3N8 Putative gag-pol polyprotein, identical3.9e-5154.63Show/hide
Query:  EKTRKSFPIGKAWRASKCLELIY-------------------------------------KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL
        ++TRKSFP+GK+WRA+ CLEL++                                        FE FK FKA VE QSG  IKSL +DRGGEFLSN+FNL
Subjt:  EKTRKSFPIGKAWRASKCLELIY-------------------------------------KVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNL

Query:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ
        FC+E+GI RELT PYT EQNGVA RK+R V+EM RS L+ KGL + F  EAV+T +Y LNISPTK V NTT  EA  GKK  VSHLR+FGCI++ALV   
Subjt:  FCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQ

Query:  VCKKLDKKSVKCIFVG
           KLD+KS KCIFVG
Subjt:  VCKKLDKKSVKCIFVG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.3e-2944.44Show/hide
Query:  FEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISP
        F+ F+ F A VE+++G  +K L SD GGE+ S  F  +C  HGI  E T P T + NGVA R +R ++E VRSML++  L   F  EAV T+ YL+N SP
Subjt:  FEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISP

Query:  TKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG
        +  +           K+ + SHL+VFGC +FA VP +   KLD KS+ CIF+G
Subjt:  TKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.3e-1532.24Show/hide
Query:  EKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPT
        E F  FK  +E +    I + +SD GGEF++     +  +HGI+   + P+T E NG++ RK R ++E   ++L    +   +   A + ++YL+N  PT
Subjt:  EKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPT

Query:  KVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG
         ++   + F+ L G   N   LRVFGC  +  +      KLD KS +C+F+G
Subjt:  KVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.8e-1632.89Show/hide
Query:  EKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPT
        + F  FK+ VE +    I +L+SD GGEF+      +  +HGI+   + P+T E NG++ RK R ++EM  ++L    +   +   A S ++YL+N  PT
Subjt:  EKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVKGLSNIFCAEAVSTSIYLLNISPT

Query:  KVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG
         ++   + F+ L G+  N   L+VFGC  +  +      KL+ KS +C F+G
Subjt:  KVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAAACTCGAAAGTCTTTTCCTATTGGGAAAGCTTGGAGAGCCTCGAAGTGTCTTGAGCTAATTTATAAAGTGGCATTTGAGAAGTTCAAGCACTTCAAG
GCAAAGGTGGAAAAGCAAAGTGGCATGTTCATCAAATCTCTTCACAGTGATAGAGGTGGAGAATTCTTGTCCAACAACTTCAACCTTTTTTGCAAAGAACATGGC
ATTAATAGGGAGTTGACAACACCTTACACTCTGGAGCAAAATGGGGTAGCCGGAAGGAAGAGTCGAATCGTGCTGGAAATGGTGAGAAGCATGTTGCAAGTAAAA
GGCCTTTCAAATATTTTTTGTGCTGAAGCAGTCTCGACTTCCATCTACTTACTGAACATCTCACCAACGAAGGTAGTCATGAATACGACTACTTTTGAAGCTTTG
TGTGGCAAGAAATCCAATGTAAGTCATTTACGAGTTTTTGGTTGTATTTCTTTTGCTTTGGTACCCTCTCAAGTTTGTAAAAAACTTGATAAAAAATCTGTAAAA
TGCATTTTTGTTGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAAACTCGAAAGTCTTTTCCTATTGGGAAAGCTTGGAGAGCCTCGAAGTGTCTTGAGCTAATTTATAAAGTGGCATTTGAGAAGTTCAAGCACTTCAAG
GCAAAGGTGGAAAAGCAAAGTGGCATGTTCATCAAATCTCTTCACAGTGATAGAGGTGGAGAATTCTTGTCCAACAACTTCAACCTTTTTTGCAAAGAACATGGC
ATTAATAGGGAGTTGACAACACCTTACACTCTGGAGCAAAATGGGGTAGCCGGAAGGAAGAGTCGAATCGTGCTGGAAATGGTGAGAAGCATGTTGCAAGTAAAA
GGCCTTTCAAATATTTTTTGTGCTGAAGCAGTCTCGACTTCCATCTACTTACTGAACATCTCACCAACGAAGGTAGTCATGAATACGACTACTTTTGAAGCTTTG
TGTGGCAAGAAATCCAATGTAAGTCATTTACGAGTTTTTGGTTGTATTTCTTTTGCTTTGGTACCCTCTCAAGTTTGTAAAAAACTTGATAAAAAATCTGTAAAA
TGCATTTTTGTTGGTTAG
Protein sequenceShow/hide protein sequence
MEKTRKSFPIGKAWRASKCLELIYKVAFEKFKHFKAKVEKQSGMFIKSLHSDRGGEFLSNNFNLFCKEHGINRELTTPYTLEQNGVAGRKSRIVLEMVRSMLQVK
GLSNIFCAEAVSTSIYLLNISPTKVVMNTTTFEALCGKKSNVSHLRVFGCISFALVPSQVCKKLDKKSVKCIFVG