; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0019663 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0019663
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrotransposon protein
Genome locationchr02:14245813..14247087
RNA-Seq ExpressionPay0019663
SyntenyPay0019663
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048191.1 retrotransposon protein [Cucumis melo var. makuwa]2.4e-6770.26Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQ---------NCLGALAGT
        T+GGLEATQYVDVEEM AIFLHIVAHDVKNRVARRHFARS  TVSRHFN VLNAVLR+HEILLKQPD VTHSCSHEK+RWFQ           + AL GT
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQ---------NCLGALAGT

Query:  HIKVN-----------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
        HIKVN                             FIFVMPGWEGSASDSRVLRD VSRP  LKVPKGYYYLCDA Y N EGFLAPYRGQRYHL E
Subjt:  HIKVN-----------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE

KAA0050107.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.2e-8284.41Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----
        TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN    
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----

Query:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
                                 FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
Subjt:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE

KAA0065306.1 retrotransposon protein [Cucumis melo var. makuwa]7.8e-7477.96Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----
        TKGGLEATQYVDVEEM+AIFLHIVAHDVKNRV RRHFARSGETVSRHF    NAVLRLHEILLKQPDPVT+SCSHEK+RWFQ CLGAL GTHIKVN    
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----

Query:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
                                 FIFVMPGWEGSASDSRVLRDAVSR T LKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
Subjt:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE

XP_008452358.1 PREDICTED: uncharacterized protein LOC103493418 [Cucumis melo]3.5e-6672.04Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----
        T+GGLE TQYVDVEEMVAIF HIVAHDVKNRVARRHFARSGET+SRHFNAV NAVLRLHEI LKQPDPVTHSCSHEK++WFQNCL AL GTHIKVN    
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----

Query:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
                                 FIFVMP W GS S SRVLRD V    SLKV KGYYYLCDAGYPNAEGFL  YRGQRY+LTE
Subjt:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE

XP_008463037.1 PREDICTED: uncharacterized protein LOC103501276 [Cucumis melo]1.4e-6775.57Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----
        T+GGLEATQYVDVEEMVAI LHIVAHDVKN+VARR+FARS ETVSRH N VLNAVLRLHEILLKQPDPVTHSCSHEK+RWFQNCLGAL GTHIKVN    
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----

Query:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAP
                                 +IFVMPGWEGSASDSRVLRD VSRPT LKVPKGYYYLCDAGY NAEGF AP
Subjt:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAP

TrEMBL top hitse value%identityAlignment
A0A1S3BTL5 uncharacterized protein LOC1034934181.7e-6672.04Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----
        T+GGLE TQYVDVEEMVAIF HIVAHDVKNRVARRHFARSGET+SRHFNAV NAVLRLHEI LKQPDPVTHSCSHEK++WFQNCL AL GTHIKVN    
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----

Query:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
                                 FIFVMP W GS S SRVLRD V    SLKV KGYYYLCDAGYPNAEGFL  YRGQRY+LTE
Subjt:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE

A0A1S3CJV8 uncharacterized protein LOC1035012766.9e-6875.57Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----
        T+GGLEATQYVDVEEMVAI LHIVAHDVKN+VARR+FARS ETVSRH N VLNAVLRLHEILLKQPDPVTHSCSHEK+RWFQNCLGAL GTHIKVN    
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----

Query:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAP
                                 +IFVMPGWEGSASDSRVLRD VSRPT LKVPKGYYYLCDAGY NAEGF AP
Subjt:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAP

A0A5A7U3R2 Retrotransposon protein1.2e-6770.26Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQ---------NCLGALAGT
        T+GGLEATQYVDVEEM AIFLHIVAHDVKNRVARRHFARS  TVSRHFN VLNAVLR+HEILLKQPD VTHSCSHEK+RWFQ           + AL GT
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQ---------NCLGALAGT

Query:  HIKVN-----------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
        HIKVN                             FIFVMPGWEGSASDSRVLRD VSRP  LKVPKGYYYLCDA Y N EGFLAPYRGQRYHL E
Subjt:  HIKVN-----------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE

A0A5A7U6W3 Putative nuclease HARBI15.8e-8384.41Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----
        TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN    
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----

Query:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
                                 FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
Subjt:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE

A0A5A7VG45 Retrotransposon protein3.8e-7477.96Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----
        TKGGLEATQYVDVEEM+AIFLHIVAHDVKNRV RRHFARSGETVSRHF    NAVLRLHEILLKQPDPVT+SCSHEK+RWFQ CLGAL GTHIKVN    
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVN----

Query:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
                                 FIFVMPGWEGSASDSRVLRDAVSR T LKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
Subjt:  -------------------------FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein1.3e-1529.15Show/hide
Query:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPD-------PVTHSCSHEKYRWFQNCLGALAGTHI
        T   L+ T  + +EE VA+FL I  H+   R     F R+ ETV R F  VL A   L    ++ P        P         + +F   +GA+ GTH+
Subjt:  TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPD-------PVTHSCSHEKYRWFQNCLGALAGTHI

Query:  -----------------------------KVNFIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKG-YYYLCDAGYPNAEGFLAPYRGQ-----RYHLTE
                                     K+ F ++  G  GS  D+ VL+ A    +   +P    YYL D+GYPN +G LAPYR       RYH+++
Subjt:  -----------------------------KVNFIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKG-YYYLCDAGYPNAEGFLAPYRGQ-----RYHLTE

AT5G28730.1 unknown protein6.2e-1332.3Show/hide
Query:  KGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRL-----HEILLKQPDPVTHSCSHEKYRW--FQNCLG-----ALA
        K GL+++  + ++E VAIFL I A +   R     F  + ET+ R F+ VL A+ RL         +++   +++    +   W    + LG      LA
Subjt:  KGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRL-----HEILLKQPDPVTHSCSHEKYRW--FQNCLG-----ALA

Query:  GTHIKVNFIFVMPGWEGSASDSRVLRDAVSRPTSLKV-PKGYYYLCDAGYPNAEGFLAPYR
           + + F +   G  GS  D+RVL  A+S      V P   YYL D+GY N  G+LAPYR
Subjt:  GTHIKVNFIFVMPGWEGSASDSRVLRDAVSRPTSLKV-PKGYYYLCDAGYPNAEGFLAPYR

AT5G28950.1 unknown protein1.6e-0533.33Show/hide
Query:  YRWFQNCLGALAGTHI-----------------------------KVNFIFVMPGWEGSASDSRVLRDAVSRPTS-LKVPK
        Y +F++C+GA+  THI                              V F++V+ GWEGSA DS+VL DA++R ++ L VP+
Subjt:  YRWFQNCLGALAGTHI-----------------------------KVNFIFVMPGWEGSASDSRVLRDAVSRPTS-LKVPK

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.6e-1152.46Show/hide
Query:  FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE
        FI+V+ GWEGSA DSRVL DA+ +          +YL D G+ N   FLAP+RG RYHL E
Subjt:  FIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.5e-2733.97Show/hide
Query:  YIRLNDRQDDLFNGFSNELMIVEDTHNL--TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVT
        Y  LN   +  F  F  +  +     +L  T+G L  T  + +E  +AIFL I+ H+++ R  +  F  SGET+SRHFN VLNAV+ + +    QP+  +
Subjt:  YIRLNDRQDDLFNGFSNELMIVEDTHNL--TKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEILLKQPDPVT

Query:  HSCSHEKYRWFQNCLGALAGTHIKV-----------------------------NFIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNA
         +  ++   +F++C+G +   HI V                              F +V+ GWEGSASD +VL  A++R   L+VP+G YY+ D  YPN 
Subjt:  HSCSHEKYRWFQNCLGALAGTHIKV-----------------------------NFIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNA

Query:  EGFLAPYRG
         GF+APY G
Subjt:  EGFLAPYRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCTTACCCGATCAAGACACAAAAATATATACCGATAGCATGTTGCACAGTTCACAATTATATTAGATTGAATGATCGTCAAGATGATCTTTTCAATGGCTTTAG
CAATGAATTAATGATCGTTGAAGATACACATAATTTAACAAAAGGGGGATTGGAAGCCACTCAATATGTGGATGTTGAAGAGATGGTTGCGATTTTTTTGCATATTGTAG
CACACGATGTTAAGAATAGGGTAGCTCGACGACATTTTGCAAGGTCTGGTGAAACAGTGTCAAGACACTTCAATGCTGTTCTTAACGCAGTTCTTAGACTCCATGAAATC
CTTCTTAAGCAACCTGATCCAGTGACCCATTCATGTTCGCATGAGAAGTATCGATGGTTTCAGAATTGTCTAGGTGCATTAGCTGGTACACACATCAAAGTGAATTTCAT
ATTTGTCATGCCTGGATGGGAAGGGTCTGCATCTGATTCGAGAGTACTTAGAGATGCAGTGTCACGACCTACTAGTCTAAAAGTCCCAAAGGGTTACTACTACTTGTGTG
ATGCTGGTTATCCAAATGCTGAAGGATTCCTCGCTCCATACCGAGGTCAACGTTACCATCTAACTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCTTACCCGATCAAGACACAAAAATATATACCGATAGCATGTTGCACAGTTCACAATTATATTAGATTGAATGATCGTCAAGATGATCTTTTCAATGGCTTTAG
CAATGAATTAATGATCGTTGAAGATACACATAATTTAACAAAAGGGGGATTGGAAGCCACTCAATATGTGGATGTTGAAGAGATGGTTGCGATTTTTTTGCATATTGTAG
CACACGATGTTAAGAATAGGGTAGCTCGACGACATTTTGCAAGGTCTGGTGAAACAGTGTCAAGACACTTCAATGCTGTTCTTAACGCAGTTCTTAGACTCCATGAAATC
CTTCTTAAGCAACCTGATCCAGTGACCCATTCATGTTCGCATGAGAAGTATCGATGGTTTCAGAATTGTCTAGGTGCATTAGCTGGTACACACATCAAAGTGAATTTCAT
ATTTGTCATGCCTGGATGGGAAGGGTCTGCATCTGATTCGAGAGTACTTAGAGATGCAGTGTCACGACCTACTAGTCTAAAAGTCCCAAAGGGTTACTACTACTTGTGTG
ATGCTGGTTATCCAAATGCTGAAGGATTCCTCGCTCCATACCGAGGTCAACGTTACCATCTAACTGAATGA
Protein sequenceShow/hide protein sequence
MPPYPIKTQKYIPIACCTVHNYIRLNDRQDDLFNGFSNELMIVEDTHNLTKGGLEATQYVDVEEMVAIFLHIVAHDVKNRVARRHFARSGETVSRHFNAVLNAVLRLHEI
LLKQPDPVTHSCSHEKYRWFQNCLGALAGTHIKVNFIFVMPGWEGSASDSRVLRDAVSRPTSLKVPKGYYYLCDAGYPNAEGFLAPYRGQRYHLTE