; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C030877 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C030877
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr04:19790978..19792913
RNA-Seq ExpressionMELO3C030877
SyntenyMELO3C030877
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033051.1 hypothetical protein E6C27_scaffold269G001580 [Cucumis melo var. makuwa]1.4e-2363.06Show/hide
Query:  FRSFAYI-LGRLDQLELERGISRHKGNYCNLDLGASLLGKHIFFLLLGLVEQIS-----------RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIV
        F SF ++ LGRLDQLELE GISR KGNYCNLDLG SLLGKH F  +      +             ITTYL  RSPTGR+SSMDIDMIRVIR DPR+ IV
Subjt:  FRSFAYI-LGRLDQLELERGISRHKGNYCNLDLGASLLGKHIFFLLLGLVEQIS-----------RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIV

Query:  LVFPSDSLQTS
        LV P  SLQTS
Subjt:  LVFPSDSLQTS

KAA0034824.1 hypothetical protein E6C27_scaffold213G00580 [Cucumis melo var. makuwa]2.2e-2446.31Show/hide
Query:  PRSKASPPFLFPLVTVADRCHRTVKPIEAHPSRHSSRLQAVRPAPPAARIQKSC--VVPRHARAAKP--LLNHLAFFRSFAYI-LGRLDQLELERGISRH
        PR+++ PP        A   H    P  A     +SRL A    P AA    +C    P      +P       +F RSF Y+ LGRLDQ++LERGISR 
Subjt:  PRSKASPPFLFPLVTVADRCHRTVKPIEAHPSRHSSRLQAVRPAPPAARIQKSC--VVPRHARAAKP--LLNHLAFFRSFAYI-LGRLDQLELERGISRH

Query:  KGNYCNLDLGASLLGKHIFFLL-----------------LGLVEQIS----------RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSL
        KGNYCNLDLGASLLGK I  LL                 +   + +            ITT LG RSPTGRQSSMDIDMIRVIR DPRS IVLVFP  SL
Subjt:  KGNYCNLDLGASLLGKHIFFLL-----------------LGLVEQIS----------RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSL

Query:  QTS
        QTS
Subjt:  QTS

TYK00324.1 uncharacterized protein E5676_scaffold1923G00020 [Cucumis melo var. makuwa]9.9e-2554.94Show/hide
Query:  IEAHPSR-HSSRLQAVRPAPPAARIQKSC--VVPRHARAAKP--LLNHLAFFRSFAYI-LGRLDQLELERGISRHKGNYCNLDLGASLLGKHIFFLL---
        +EA P+   +SRL A    P AA    +C    P      +P       +F  SF Y+ LGRLDQ+ELERGISR KGNYCNLDLGASLLGK I  LL   
Subjt:  IEAHPSR-HSSRLQAVRPAPPAARIQKSC--VVPRHARAAKP--LLNHLAFFRSFAYI-LGRLDQLELERGISRHKGNYCNLDLGASLLGKHIFFLL---

Query:  ---LGLVEQIS-----RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTS
           L  ++ +      RITT L  R+PTGRQSSMDIDMIRVIR DPRS IVLVFP  SLQTS
Subjt:  ---LGLVEQIS-----RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTS

TYK04896.1 hypothetical protein E5676_scaffold143G00970 [Cucumis melo var. makuwa]3.8e-2446.56Show/hide
Query:  QNPRSKASPPFLFPLVTVADRCHRTVKPIEAHPSRHSSRLQAVRPAPPAARIQ-----KSCVVPRHARAAKPLLNHLAFFRSFAYILGRLDQLELERGIS
        + PR +       PL  V  R   +VKP   HP    ++ +    A  AAR Q     ++   P+ +R+     +H AF       LGRL+Q ELERGIS
Subjt:  QNPRSKASPPFLFPLVTVADRCHRTVKPIEAHPSRHSSRLQAVRPAPPAARIQ-----KSCVVPRHARAAKPLLNHLAFFRSFAYILGRLDQLELERGIS

Query:  RHKGNYCNLDLGASLLGKHIFFLL------LGLVEQISR-----ITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTS
        R KGNYCNLDLG SLLGKH    +      L  +  +S      ITTYLG RSPTGRQSSMDI++IRVIR DPR+ IVLV P  SLQTS
Subjt:  RHKGNYCNLDLGASLLGKHIFFLL------LGLVEQISR-----ITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTS

TYK04917.1 pol protein [Cucumis melo var. makuwa]1.1e-1844.92Show/hide
Query:  PLVTVADRCHRTVKPIE-AHPSRHSSRLQAVRPAPPAARIQKSCVVPRHARAAKPLLNHLAFFRSFAYILGRLDQLELERGISRHKGNYCNLDLGASLLG
        P ++  +R H  V P+  A P    SR   +  A    ++ +S +     +A    LN   F +        LDQL    G  +      NLDLG S LG
Subjt:  PLVTVADRCHRTVKPIE-AHPSRHSSRLQAVRPAPPAARIQKSCVVPRHARAAKPLLNHLAFFRSFAYILGRLDQLELERGISRHKGNYCNLDLGASLLG

Query:  KHIFFLLLGLVEQISR---------ITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTSFQVEAGVKARASWRATR
        KHIF+  L LV++ISR         ITTYLG RSPTG  SSMDIDMIRVI+ DPRS IVLV PS SLQT    +A V ARASWRA R
Subjt:  KHIFFLLLGLVEQISR---------ITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTSFQVEAGVKARASWRATR

TrEMBL top hitse value%identityAlignment
A0A5A7SUA9 Uncharacterized protein1.1e-2446.31Show/hide
Query:  PRSKASPPFLFPLVTVADRCHRTVKPIEAHPSRHSSRLQAVRPAPPAARIQKSC--VVPRHARAAKP--LLNHLAFFRSFAYI-LGRLDQLELERGISRH
        PR+++ PP        A   H    P  A     +SRL A    P AA    +C    P      +P       +F RSF Y+ LGRLDQ++LERGISR 
Subjt:  PRSKASPPFLFPLVTVADRCHRTVKPIEAHPSRHSSRLQAVRPAPPAARIQKSC--VVPRHARAAKP--LLNHLAFFRSFAYI-LGRLDQLELERGISRH

Query:  KGNYCNLDLGASLLGKHIFFLL-----------------LGLVEQIS----------RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSL
        KGNYCNLDLGASLLGK I  LL                 +   + +            ITT LG RSPTGRQSSMDIDMIRVIR DPRS IVLVFP  SL
Subjt:  KGNYCNLDLGASLLGKHIFFLL-----------------LGLVEQIS----------RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSL

Query:  QTS
        QTS
Subjt:  QTS

A0A5D3BM25 Uncharacterized protein4.8e-2554.94Show/hide
Query:  IEAHPSR-HSSRLQAVRPAPPAARIQKSC--VVPRHARAAKP--LLNHLAFFRSFAYI-LGRLDQLELERGISRHKGNYCNLDLGASLLGKHIFFLL---
        +EA P+   +SRL A    P AA    +C    P      +P       +F  SF Y+ LGRLDQ+ELERGISR KGNYCNLDLGASLLGK I  LL   
Subjt:  IEAHPSR-HSSRLQAVRPAPPAARIQKSC--VVPRHARAAKP--LLNHLAFFRSFAYI-LGRLDQLELERGISRHKGNYCNLDLGASLLGKHIFFLL---

Query:  ---LGLVEQIS-----RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTS
           L  ++ +      RITT L  R+PTGRQSSMDIDMIRVIR DPRS IVLVFP  SLQTS
Subjt:  ---LGLVEQIS-----RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTS

A0A5D3BWF9 Uncharacterized protein6.9e-2463.06Show/hide
Query:  FRSFAYI-LGRLDQLELERGISRHKGNYCNLDLGASLLGKHIFFLLLGLVEQIS-----------RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIV
        F SF ++ LGRLDQLELE GISR KGNYCNLDLG SLLGKH F  +      +             ITTYL  RSPTGR+SSMDIDMIRVIR DPR+ IV
Subjt:  FRSFAYI-LGRLDQLELERGISRHKGNYCNLDLGASLLGKHIFFLLLGLVEQIS-----------RITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIV

Query:  LVFPSDSLQTS
        LV P  SLQTS
Subjt:  LVFPSDSLQTS

A0A5D3BZE1 Pol protein5.1e-1944.92Show/hide
Query:  PLVTVADRCHRTVKPIE-AHPSRHSSRLQAVRPAPPAARIQKSCVVPRHARAAKPLLNHLAFFRSFAYILGRLDQLELERGISRHKGNYCNLDLGASLLG
        P ++  +R H  V P+  A P    SR   +  A    ++ +S +     +A    LN   F +        LDQL    G  +      NLDLG S LG
Subjt:  PLVTVADRCHRTVKPIE-AHPSRHSSRLQAVRPAPPAARIQKSCVVPRHARAAKPLLNHLAFFRSFAYILGRLDQLELERGISRHKGNYCNLDLGASLLG

Query:  KHIFFLLLGLVEQISR---------ITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTSFQVEAGVKARASWRATR
        KHIF+  L LV++ISR         ITTYLG RSPTG  SSMDIDMIRVI+ DPRS IVLV PS SLQT    +A V ARASWRA R
Subjt:  KHIFFLLLGLVEQISR---------ITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTSFQVEAGVKARASWRATR

A0A5D3C386 Uncharacterized protein1.8e-2446.56Show/hide
Query:  QNPRSKASPPFLFPLVTVADRCHRTVKPIEAHPSRHSSRLQAVRPAPPAARIQ-----KSCVVPRHARAAKPLLNHLAFFRSFAYILGRLDQLELERGIS
        + PR +       PL  V  R   +VKP   HP    ++ +    A  AAR Q     ++   P+ +R+     +H AF       LGRL+Q ELERGIS
Subjt:  QNPRSKASPPFLFPLVTVADRCHRTVKPIEAHPSRHSSRLQAVRPAPPAARIQ-----KSCVVPRHARAAKPLLNHLAFFRSFAYILGRLDQLELERGIS

Query:  RHKGNYCNLDLGASLLGKHIFFLL------LGLVEQISR-----ITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTS
        R KGNYCNLDLG SLLGKH    +      L  +  +S      ITTYLG RSPTGRQSSMDI++IRVIR DPR+ IVLV P  SLQTS
Subjt:  RHKGNYCNLDLGASLLGKHIFFLL------LGLVEQISR-----ITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCATTGGGATGTGCAATTAATGCACCATCTTCATCTTCACCAAGAAGGAGAAAAACAAAACCCTAGATCTAAAGCGTCGCCGCCCTTCCTTTTTCCGCTAGTCAC
CGTCGCCGACCGTTGTCACAGAACAGTCAAGCCGATCGAAGCCCATCCGAGTCGCCACTCTTCGCGTCTCCAAGCCGTCAGACCCGCGCCGCCAGCCGCCCGAATCCAGA
AGTCGTGCGTCGTGCCTCGTCACGCCCGAGCCGCCAAACCTCTCTTGAACCACCTAGCCTTCTTCCGGTCCTTTGCTTATATTTTGGGCCGGTTGGACCAATTAGAGTTG
GAGCGTGGGATTTCTCGACATAAAGGAAATTACTGCAACCTTGACCTTGGGGCATCGCTGCTTGGAAAACACATTTTTTTCTTGCTGTTAGGACTCGTCGAGCAAATCTC
TAGGATCACCACCTATTTAGGACCGCGTAGTCCGACGGGACGCCAGTCTAGCATGGATATAGATATGATTCGAGTGATTCGATGGGATCCTCGCAGCCTGATTGTCTTAG
TGTTTCCTTCGGATTCGCTACAGACCAGTTTTCAGGTAGAGGCAGGGGTAAAGGCAAGGGCAAGCTGGCGGGCGACCAGAATAATGATTTCAACTCAGTATAAGGAGTTG
AGTTGTTACATTGAGGAAGAAAGAAGAAGAAGATTGGAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCATTGGGATGTGCAATTAATGCACCATCTTCATCTTCACCAAGAAGGAGAAAAACAAAACCCTAGATCTAAAGCGTCGCCGCCCTTCCTTTTTCCGCTAGTCAC
CGTCGCCGACCGTTGTCACAGAACAGTCAAGCCGATCGAAGCCCATCCGAGTCGCCACTCTTCGCGTCTCCAAGCCGTCAGACCCGCGCCGCCAGCCGCCCGAATCCAGA
AGTCGTGCGTCGTGCCTCGTCACGCCCGAGCCGCCAAACCTCTCTTGAACCACCTAGCCTTCTTCCGGTCCTTTGCTTATATTTTGGGCCGGTTGGACCAATTAGAGTTG
GAGCGTGGGATTTCTCGACATAAAGGAAATTACTGCAACCTTGACCTTGGGGCATCGCTGCTTGGAAAACACATTTTTTTCTTGCTGTTAGGACTCGTCGAGCAAATCTC
TAGGATCACCACCTATTTAGGACCGCGTAGTCCGACGGGACGCCAGTCTAGCATGGATATAGATATGATTCGAGTGATTCGATGGGATCCTCGCAGCCTGATTGTCTTAG
TGTTTCCTTCGGATTCGCTACAGACCAGTTTTCAGGTAGAGGCAGGGGTAAAGGCAAGGGCAAGCTGGCGGGCGACCAGAATAATGATTTCAACTCAGTATAAGGAGTTG
AGTTGTTACATTGAGGAAGAAAGAAGAAGAAGATTGGAAGTTTGA
Protein sequenceShow/hide protein sequence
MGHWDVQLMHHLHLHQEGEKQNPRSKASPPFLFPLVTVADRCHRTVKPIEAHPSRHSSRLQAVRPAPPAARIQKSCVVPRHARAAKPLLNHLAFFRSFAYILGRLDQLEL
ERGISRHKGNYCNLDLGASLLGKHIFFLLLGLVEQISRITTYLGPRSPTGRQSSMDIDMIRVIRWDPRSLIVLVFPSDSLQTSFQVEAGVKARASWRATRIMISTQYKEL
SCYIEEERRRRLEV