; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0008558 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0008558
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr09:14702156..14703138
RNA-Seq ExpressionPay0008558
SyntenyPay0008558
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042872.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.5e-8266.67Show/hide
Query:  MSPTIFNRLMTLD-----------DYEGDERIKGM--------------KDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL
        +S  IFN+LM L+           +YEG ERIKGM              KDS+SITEYSDKLI I NK RALG DLSDSRL+QKILVSVP+RY+ATIA L
Subjt:  MSPTIFNRLMTLD-----------DYEGDERIKGM--------------KDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL

Query:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------
        ENTKDLSK KVIEVV+ LQ QEQ RLMRQEGSIEG LKARMQQ EGGKEKKWK NKG GK S ESLAKDV SACK  GKQNHP+FR WR PDVK      
Subjt:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------

Query:  ------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNL
                        CT HMTS K+LF+DLDTSFKS++KIGN AYLEVK KG VSIES   TKLITEVLFVPEIDQNL
Subjt:  ------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNL

XP_008450210.1 PREDICTED: uncharacterized protein LOC103491872 [Cucumis melo]8.5e-8363.33Show/hide
Query:  MSPTIFNRLMTLD-----------DYEGDERIKGM--------------KDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL
        +SP IFNRLM L+           +YEG ERIKGM              KDS+SITEYSDKLI I NK RALG DLSDSRL+QKILVSVP+RY+ATIA L
Subjt:  MSPTIFNRLMTLD-----------DYEGDERIKGM--------------KDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL

Query:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------
        ENTKDLSK KVIEVV+ALQ QEQ RLMRQEGSIEG LKARMQQ EGGKEKKWK NKG GK SMESLAKDV SACK  GKQNHP+FR WR PDVK      
Subjt:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------

Query:  ---------------------------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNL
                                          D  CT HMTS K+LF++LDTSFKS++KIGN AYLEVK KG VSIES   TKLITEVLFVPEIDQNL
Subjt:  ---------------------------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNL

XP_016669911.2 uncharacterized protein LOC107889873 [Gossypium hirsutum]4.8e-7056.12Show/hide
Query:  MSPTIFNRLMTLD-----------DYEGDERIKG--------------MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL
        +SP IFNR+M              +Y+GDERIK               MK+SESI EYSDKLI+I NK RA G DLSDSRL+QKILVSVPE+Y+ATIA  
Subjt:  MSPTIFNRLMTLD-----------DYEGDERIKG--------------MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL

Query:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEG----SIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVD-------SACKECGKQNHPYFRYWR
        ENTKDL++ KV+E+++ALQAQEQ RLM QEG    SIEG LKA+MQQ E  +E+KW   K       E++AK          S+CK CGK NHP+FR WR
Subjt:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEG----SIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVD-------SACKECGKQNHPYFRYWR

Query:  RPDVK--------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNLLSVSQLVE
        RPDVK               D  CT H+T D++LF+DLD S KSKV IGN  YLEVK +GIV+IESY  TKLI++VLFVPEIDQNLLSV QLVE
Subjt:  RPDVK--------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNLLSVSQLVE

XP_022148138.1 uncharacterized protein LOC111016891 [Momordica charantia]2.0e-7657.01Show/hide
Query:  MSPTIFNRLMTL-----------DDYEGDERIKG--------------MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL
        +SP IFNR+M L           ++YEG+ERIKG              MKDSESI EYSDKLI I NKARALG DLS +RL+QKILVSVPERY+ATIA L
Subjt:  MSPTIFNRLMTL-----------DDYEGDERIKG--------------MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL

Query:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------
        ENTKDLSK KVIEVV+ LQAQEQ RL+ QEGS+EG LKARMQ  EGG+E KWK  K  G  S E  +KDV SACK CGK NHP+FR WRRP VK      
Subjt:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------

Query:  ----------------------------------------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLI
                                                       D  CT HMTSDKELF+DLD SFKS+VKI N  YLEVK KG VSIES V TKLI
Subjt:  ----------------------------------------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLI

Query:  TEVLFVPEIDQNLLSVSQLVE
         EVLFVPEIDQNLLSV QLV+
Subjt:  TEVLFVPEIDQNLLSVSQLVE

XP_038888301.1 LOW QUALITY PROTEIN: subtilisin-like protease SBT3.10 [Benincasa hispida]4.5e-6060.16Show/hide
Query:  DYEGDERIKG--------------MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFLENTKDLSKRKVIEVVNALQAQEQM
        +Y+GDERIKG              MK+ ESI EYS KLI I NKARALG++L D+R  QKILVSVP++Y+ATI +LENT+DLS+ KVIEVV+ALQAQEQ 
Subjt:  DYEGDERIKG--------------MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFLENTKDLSKRKVIEVVNALQAQEQM

Query:  RLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYF-RYWRRPDVKFDIRCTYHMTSDKELFEDLDTSFKSKVKI
        RLMRQEGSIEG LK R+QQ EGG+EK     KG G  S ES+AKD   A     +++  +    W       D  CT HMTSDK+LF+DLD SFKS+VKI
Subjt:  RLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYF-RYWRRPDVKFDIRCTYHMTSDKELFEDLDTSFKSKVKI

Query:  GNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNLLSVSQLVE
        GN  YLEVK K  VSIES   TKLITEVLF+PEI+QNLL++ QLVE
Subjt:  GNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNLLSVSQLVE

TrEMBL top hitse value%identityAlignment
A0A1S3BNQ3 uncharacterized protein LOC1034918724.1e-8363.33Show/hide
Query:  MSPTIFNRLMTLD-----------DYEGDERIKGM--------------KDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL
        +SP IFNRLM L+           +YEG ERIKGM              KDS+SITEYSDKLI I NK RALG DLSDSRL+QKILVSVP+RY+ATIA L
Subjt:  MSPTIFNRLMTLD-----------DYEGDERIKGM--------------KDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL

Query:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------
        ENTKDLSK KVIEVV+ALQ QEQ RLMRQEGSIEG LKARMQQ EGGKEKKWK NKG GK SMESLAKDV SACK  GKQNHP+FR WR PDVK      
Subjt:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------

Query:  ---------------------------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNL
                                          D  CT HMTS K+LF++LDTSFKS++KIGN AYLEVK KG VSIES   TKLITEVLFVPEIDQNL
Subjt:  ---------------------------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNL

A0A1U8HV73 uncharacterized protein LOC1078898732.8e-7156.46Show/hide
Query:  MSPTIFNRLMTLD-----------DYEGDERIKG--------------MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL
        +SP IFNR+M              +Y+GDERIK               MK+SESI EYSDKLI+I NK RA G DLSDSRL+QKILVSVPE+Y+ATIA  
Subjt:  MSPTIFNRLMTLD-----------DYEGDERIKG--------------MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL

Query:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEG----SIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVD-------SACKECGKQNHPYFRYWR
        ENTKDL++ KV+E+++ALQAQEQ RLM QEG    SIEG LKA+MQQ E G+E+KW   K       E++AK          S+CK CGK NHP+FR WR
Subjt:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEG----SIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVD-------SACKECGKQNHPYFRYWR

Query:  RPDVK--------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNLLSVSQLVE
        RPDVK               D  CT H+T D++LF+DLD S KSKV IGN  YLEVK +GIV+IESY  TKLI++VLFVPEIDQNLLSV QLVE
Subjt:  RPDVK--------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNLLSVSQLVE

A0A5D3C1N2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-8266.67Show/hide
Query:  MSPTIFNRLMTLD-----------DYEGDERIKGM--------------KDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL
        +S  IFN+LM L+           +YEG ERIKGM              KDS+SITEYSDKLI I NK RALG DLSDSRL+QKILVSVP+RY+ATIA L
Subjt:  MSPTIFNRLMTLD-----------DYEGDERIKGM--------------KDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL

Query:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------
        ENTKDLSK KVIEVV+ LQ QEQ RLMRQEGSIEG LKARMQQ EGGKEKKWK NKG GK S ESLAKDV SACK  GKQNHP+FR WR PDVK      
Subjt:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------

Query:  ------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNL
                        CT HMTS K+LF+DLDTSFKS++KIGN AYLEVK KG VSIES   TKLITEVLFVPEIDQNL
Subjt:  ------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNL

A0A6J1D394 uncharacterized protein LOC1110168919.8e-7757.01Show/hide
Query:  MSPTIFNRLMTL-----------DDYEGDERIKG--------------MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL
        +SP IFNR+M L           ++YEG+ERIKG              MKDSESI EYSDKLI I NKARALG DLS +RL+QKILVSVPERY+ATIA L
Subjt:  MSPTIFNRLMTL-----------DDYEGDERIKG--------------MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFL

Query:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------
        ENTKDLSK KVIEVV+ LQAQEQ RL+ QEGS+EG LKARMQ  EGG+E KWK  K  G  S E  +KDV SACK CGK NHP+FR WRRP VK      
Subjt:  ENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVK------

Query:  ----------------------------------------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLI
                                                       D  CT HMTSDKELF+DLD SFKS+VKI N  YLEVK KG VSIES V TKLI
Subjt:  ----------------------------------------------FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLI

Query:  TEVLFVPEIDQNLLSVSQLVE
         EVLFVPEIDQNLLSV QLV+
Subjt:  TEVLFVPEIDQNLLSVSQLVE

A0A6P4PSP3 uncharacterized protein LOC1084815771.3e-5751.43Show/hide
Query:  MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFLENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGK
        MK+ ESI EYSDKLIEI NK R LG +LSDSRL+QKILVSV E+Y+ATI   E TKDL++ +V+E+++AL AQEQ RLMRQEGSIEG LKA+MQ  E  K
Subjt:  MKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFLENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEGVLKARMQQEEGGK

Query:  EKKWKENKGRGKRSMESLAK-------DVDSACKECGKQNHPYFRYWRRPDVK-----------------------------------------------
          K +  K     S+++ AK       + +S+CK CGKQNHPYFR WRRP+VK                                               
Subjt:  EKKWKENKGRGKRSMESLAK-------DVDSACKECGKQNHPYFRYWRRPDVK-----------------------------------------------

Query:  -----FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNLLSVSQLVE
              D  CT HMT D++LF+DLD S KSKV+IGN  YLEVK +G V+IES   TKLI++VLFVPEIDQNLLSV QLVE
Subjt:  -----FDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRTKLITEVLFVPEIDQNLLSVSQLVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCTACTATATTCAATAGACTTATGACCTTAGATGATTATGAAGGTGATGAAAGGATTAAAGGCATGAAGGATTCTGAGTCCATCACGGAATACTCAGATAAGTT
GATTGAGATTCCTAATAAGGCAAGAGCATTAGGAATCGATTTATCTGACAGTAGATTGATTCAGAAGATCCTGGTTTCAGTACCTGAGCGATATAAAGCAACTATTGCTT
TCCTAGAAAATACTAAAGATCTCTCAAAACGCAAAGTGATAGAAGTAGTTAATGCTTTGCAAGCACAAGAGCAAATGAGGTTGATGAGGCAAGAAGGAAGCATTGAGGGA
GTATTGAAAGCTAGAATGCAGCAGGAAGAAGGTGGAAAAGAGAAGAAGTGGAAAGAGAATAAGGGAAGGGGCAAAAGGAGCATGGAGTCTCTTGCGAAGGATGTTGATAG
TGCATGCAAGGAGTGTGGGAAGCAGAATCATCCATATTTTAGATATTGGAGAAGGCCAGATGTGAAGTTTGATATCCGATGTACCTATCACATGACAAGTGACAAAGAGT
TGTTTGAGGACCTTGACACGTCATTTAAGTCAAAGGTGAAAATAGGAAACGATGCTTATTTAGAAGTAAAGGAGAAGGGCATAGTGTCGATAGAGAGTTATGTTAGAACC
AAGTTGATTACTGAAGTGTTGTTTGTCCCCGAGATTGATCAAAACTTGTTAAGTGTTAGTCAACTAGTAGAAGGGATTCAAAGTGTTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCTACTATATTCAATAGACTTATGACCTTAGATGATTATGAAGGTGATGAAAGGATTAAAGGCATGAAGGATTCTGAGTCCATCACGGAATACTCAGATAAGTT
GATTGAGATTCCTAATAAGGCAAGAGCATTAGGAATCGATTTATCTGACAGTAGATTGATTCAGAAGATCCTGGTTTCAGTACCTGAGCGATATAAAGCAACTATTGCTT
TCCTAGAAAATACTAAAGATCTCTCAAAACGCAAAGTGATAGAAGTAGTTAATGCTTTGCAAGCACAAGAGCAAATGAGGTTGATGAGGCAAGAAGGAAGCATTGAGGGA
GTATTGAAAGCTAGAATGCAGCAGGAAGAAGGTGGAAAAGAGAAGAAGTGGAAAGAGAATAAGGGAAGGGGCAAAAGGAGCATGGAGTCTCTTGCGAAGGATGTTGATAG
TGCATGCAAGGAGTGTGGGAAGCAGAATCATCCATATTTTAGATATTGGAGAAGGCCAGATGTGAAGTTTGATATCCGATGTACCTATCACATGACAAGTGACAAAGAGT
TGTTTGAGGACCTTGACACGTCATTTAAGTCAAAGGTGAAAATAGGAAACGATGCTTATTTAGAAGTAAAGGAGAAGGGCATAGTGTCGATAGAGAGTTATGTTAGAACC
AAGTTGATTACTGAAGTGTTGTTTGTCCCCGAGATTGATCAAAACTTGTTAAGTGTTAGTCAACTAGTAGAAGGGATTCAAAGTGTTGTTTGA
Protein sequenceShow/hide protein sequence
MSPTIFNRLMTLDDYEGDERIKGMKDSESITEYSDKLIEIPNKARALGIDLSDSRLIQKILVSVPERYKATIAFLENTKDLSKRKVIEVVNALQAQEQMRLMRQEGSIEG
VLKARMQQEEGGKEKKWKENKGRGKRSMESLAKDVDSACKECGKQNHPYFRYWRRPDVKFDIRCTYHMTSDKELFEDLDTSFKSKVKIGNDAYLEVKEKGIVSIESYVRT
KLITEVLFVPEIDQNLLSVSQLVEGIQSVV