; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0072971 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0072971
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTy1-copia retrotransposon protein
Genome locationCMiso1.1chr03:20305349..20305912
RNA-Seq ExpressionCmc03g0072971
SyntenyCmc03g0072971
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054462.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]2.0e-9290.91Show/hide
Query:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI
        MNANASSSAY+IESANLWHGRLGHVNFASIRKLKDLRLINTSE+HETGKCPICV+SKFHKKPFKPVEYRTT++LELIHSDLADFRTTTSRGGKNYYVSF+
Subjt:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI

Query:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK
        DDYSRFTKIYLIKTKNEA  MFLKFKAESENQLGKKIKRLRSDRG EYS KTLKEFCESNGIIHEFT PYSP+QN IAERK   LK+
Subjt:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK

KAA0058501.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]4.4e-9290.37Show/hide
Query:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI
        MNAN SSSAY+IESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPIC++SKFHKKPFKPVEYRTT++LELIHSDLADFRTTTSRG KNYYVSF+
Subjt:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI

Query:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK
        DDYSRFTKIYLIKTKNEA SMFLKFKAESENQLGK+IKRLRSDRG EYS KTLKEFCESNGIIHEFTAPYSP+QN IAERK   LK+
Subjt:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK

KAA0063949.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]1.7e-9190.37Show/hide
Query:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI
        MNANASSSAY+IESANLWHGRL HVNFASIRKLKDLRLINTSE+HETGKCPICV+SKFHKKPFKPVEYRTT++LELIHSDLADFRTTTSRGGKNYYVSF+
Subjt:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI

Query:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK
        DDYSRFTKIYLIKTKNEA SMFLKFKA+SENQL K+IKRLRSDRG EYS KTLKEFCESNGIIHEFTAPYSP+QNDIAERK   LK+
Subjt:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK

TYK07244.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]4.4e-9290.37Show/hide
Query:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI
        MNAN SSSAY+IESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPIC++SKFHKKPFKPVEYRTT++LELIHSDLADFRTTTSRG KNYYVSF+
Subjt:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI

Query:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK
        DDYSRFTKIYLIKTKNEA SMFLKFKAESENQLGK+IKRLRSDRG EYS KTLKEFCESNGIIHEFTAPYSP+QN IAERK   LK+
Subjt:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK

TYK23593.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]5.7e-9289.3Show/hide
Query:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI
        MNANASSSAY+IESANLWHGRLGHVNFASIRKLKDLRLINTS++HETGKCP+C++SKFHKKPFKPVEYRTT++LELIHSDLADFRTTTSRGGKNYYVSF+
Subjt:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI

Query:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK
        DDYSRFTKIYLIKTKNEA SMF+KFKAESENQLGK+IKRLRSDRG EYS KTLKEFCESNGIIHEFTAPYSP+QN IAERK   LK+
Subjt:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK

TrEMBL top hitse value%identityAlignment
A0A5A7UFB4 Ty1-copia retrotransposon protein9.6e-9390.91Show/hide
Query:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI
        MNANASSSAY+IESANLWHGRLGHVNFASIRKLKDLRLINTSE+HETGKCPICV+SKFHKKPFKPVEYRTT++LELIHSDLADFRTTTSRGGKNYYVSF+
Subjt:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI

Query:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK
        DDYSRFTKIYLIKTKNEA  MFLKFKAESENQLGKKIKRLRSDRG EYS KTLKEFCESNGIIHEFT PYSP+QN IAERK   LK+
Subjt:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK

A0A5A7UYD5 Ty1-copia retrotransposon protein2.1e-9290.37Show/hide
Query:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI
        MNAN SSSAY+IESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPIC++SKFHKKPFKPVEYRTT++LELIHSDLADFRTTTSRG KNYYVSF+
Subjt:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI

Query:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK
        DDYSRFTKIYLIKTKNEA SMFLKFKAESENQLGK+IKRLRSDRG EYS KTLKEFCESNGIIHEFTAPYSP+QN IAERK   LK+
Subjt:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK

A0A5A7VEJ0 Ty1-copia retrotransposon protein8.1e-9290.37Show/hide
Query:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI
        MNANASSSAY+IESANLWHGRL HVNFASIRKLKDLRLINTSE+HETGKCPICV+SKFHKKPFKPVEYRTT++LELIHSDLADFRTTTSRGGKNYYVSF+
Subjt:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI

Query:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK
        DDYSRFTKIYLIKTKNEA SMFLKFKA+SENQL K+IKRLRSDRG EYS KTLKEFCESNGIIHEFTAPYSP+QNDIAERK   LK+
Subjt:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK

A0A5D3C7N0 Ty1-copia retrotransposon protein2.1e-9290.37Show/hide
Query:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI
        MNAN SSSAY+IESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPIC++SKFHKKPFKPVEYRTT++LELIHSDLADFRTTTSRG KNYYVSF+
Subjt:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI

Query:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK
        DDYSRFTKIYLIKTKNEA SMFLKFKAESENQLGK+IKRLRSDRG EYS KTLKEFCESNGIIHEFTAPYSP+QN IAERK   LK+
Subjt:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK

A0A5D3DJ22 Ty1-copia retrotransposon protein2.8e-9289.3Show/hide
Query:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI
        MNANASSSAY+IESANLWHGRLGHVNFASIRKLKDLRLINTS++HETGKCP+C++SKFHKKPFKPVEYRTT++LELIHSDLADFRTTTSRGGKNYYVSF+
Subjt:  MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFI

Query:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK
        DDYSRFTKIYLIKTKNEA SMF+KFKAESENQLGK+IKRLRSDRG EYS KTLKEFCESNGIIHEFTAPYSP+QN IAERK   LK+
Subjt:  DDYSRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-2033.14Show/hide
Query:  LWHGRLGHVNFASIRKLK------DLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRT--TDMLELIHSDLADFRTTTSRGGKNYYVSFIDDYSRFTK
        LWH R GH++   + ++K      D  L+N  E      C  C+  K  + PFK ++ +T     L ++HSD+    T  +   KNY+V F+D ++ +  
Subjt:  LWHGRLGHVNFASIRKLK------DLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRT--TDMLELIHSDLADFRTTTSRGGKNYYVSFIDDYSRFTK

Query:  IYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAER
         YLIK K++  SMF  F A+SE     K+  L  D G EY    +++FC   GI +  T P++P+ N ++ER
Subjt:  IYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAER

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-3038.92Show/hide
Query:  SANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFIDDYSRFTKIYLIK
        S +LWH R+GH++   ++ L    LI+ ++      C  C+  K H+  F+    R  ++L+L++SD+       S GG  Y+V+FIDD SR   +Y++K
Subjt:  SANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFIDDYSRFTKIYLIK

Query:  TKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAER
        TK++   +F KF A  E + G+K+KRLRSD G EY+ +  +E+C S+GI HE T P +P+ N +AER
Subjt:  TKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAER

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.1e-1228.95Show/hide
Query:  LWHGRLGHVNFASIRK-LKDLRLINTSESH------ETGKCPICV--KSKFHK----------KPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYV
        L H  LGH NF SI+K LK   +    ES        T +CP C+  KS  H+          + ++P +Y  TD+   +H           +   +Y++
Subjt:  LWHGRLGHVNFASIRK-LKDLRLINTSESH------ETGKCPICV--KSKFHK----------KPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYV

Query:  SFIDDYSRFTKIYLIKTKNEAG--SMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELL
        SF D+ +RF  +Y +  + E    ++F    A  +NQ   ++  ++ DRGSEY+ KTL +F  + GI   +T     + + +AER    L
Subjt:  SFIDDYSRFTKIYLIKTKNEAG--SMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-1632.02Show/hide
Query:  ASSSAYVIESANLWHGRLGHVNFASIRK-LKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFIDDY
        AS S+    S+  WH RLGH   + +   + +  L   + SH+   C  C+ +K +K PF      +T  LE I+SD+       S     YYV F+D +
Subjt:  ASSSAYVIESANLWHGRLGHVNFASIRK-LKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFIDDY

Query:  SRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERK
        +R+T +Y +K K++    F+ FK   EN+   +I    SD G E+    L E+   +GI H  + P++P+ N ++ERK
Subjt:  SRFTKIYLIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.3e-1832.73Show/hide
Query:  WHGRLGHVNFASIRK-LKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFIDDYSRFTKIYLIKTKN
        WH RLGH + A +   + +  L   + SH+   C  C  +K HK PF      ++  LE I+SD+       S     YYV F+D ++R+T +Y +K K+
Subjt:  WHGRLGHVNFASIRK-LKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFIDDYSRFTKIYLIKTKN

Query:  EAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERK
        +    F+ FK+  EN+   +I  L SD G E+    L+++   +GI H  + P++P+ N ++ERK
Subjt:  EAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCAAATGCTTCTAGTTCTGCTTACGTGATTGAATCTGCTAACTTATGGCATGGTAGACTAGGACATGTGAACTTTGCATCAATTAGGAAACTTAAAGACTTGAG
ACTTATTAATACTTCTGAGTCGCATGAAACTGGCAAATGCCCCATTTGTGTAAAAAGTAAATTCCATAAGAAACCTTTCAAACCAGTTGAATATAGAACTACTGATATGT
TAGAATTAATTCACTCGGATCTAGCCGATTTTAGAACCACTACTAGTAGAGGTGGTAAAAACTACTATGTATCCTTTATTGATGATTACTCTAGATTCACTAAGATATAC
CTGATAAAAACAAAAAATGAAGCTGGTAGTATGTTTTTAAAATTCAAGGCAGAATCTGAGAATCAGTTAGGAAAGAAGATAAAAAGATTAAGATCAGATAGAGGTAGTGA
GTATTCTGGTAAAACTCTTAAAGAATTTTGTGAGTCAAATGGTATCATCCATGAATTTACTGCTCCTTACTCACCAAAACAAAATGACATAGCAGAACGAAAAACAGAAC
TATTAAAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGCAAATGCTTCTAGTTCTGCTTACGTGATTGAATCTGCTAACTTATGGCATGGTAGACTAGGACATGTGAACTTTGCATCAATTAGGAAACTTAAAGACTTGAG
ACTTATTAATACTTCTGAGTCGCATGAAACTGGCAAATGCCCCATTTGTGTAAAAAGTAAATTCCATAAGAAACCTTTCAAACCAGTTGAATATAGAACTACTGATATGT
TAGAATTAATTCACTCGGATCTAGCCGATTTTAGAACCACTACTAGTAGAGGTGGTAAAAACTACTATGTATCCTTTATTGATGATTACTCTAGATTCACTAAGATATAC
CTGATAAAAACAAAAAATGAAGCTGGTAGTATGTTTTTAAAATTCAAGGCAGAATCTGAGAATCAGTTAGGAAAGAAGATAAAAAGATTAAGATCAGATAGAGGTAGTGA
GTATTCTGGTAAAACTCTTAAAGAATTTTGTGAGTCAAATGGTATCATCCATGAATTTACTGCTCCTTACTCACCAAAACAAAATGACATAGCAGAACGAAAAACAGAAC
TATTAAAGAAATGA
Protein sequenceShow/hide protein sequence
MNANASSSAYVIESANLWHGRLGHVNFASIRKLKDLRLINTSESHETGKCPICVKSKFHKKPFKPVEYRTTDMLELIHSDLADFRTTTSRGGKNYYVSFIDDYSRFTKIY
LIKTKNEAGSMFLKFKAESENQLGKKIKRLRSDRGSEYSGKTLKEFCESNGIIHEFTAPYSPKQNDIAERKTELLKK