; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0017413 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0017413
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionPol polyprotein
Genome locationchr03:1863630..1865432
RNA-Seq ExpressionPay0017413
SyntenyPay0017413
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU47094.1 hypothetical protein TSUD_369270 [Trifolium subterraneum]1.6e-6050.39Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------
        +DFMGPFP SFG+LYILLAVDYVSKWVEAIPTRTND+ VV+ F+ SNIF RFGIPR IISDQGTHFCNRT                              
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------

Query:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK
                                                   LV+GK CHLPV+IEH+AYWA++ CN+ + EAG+++ L LQ+L+E RLEAYE+SRIYK
Subjt:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK

Query:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIK
        EKTK  HDK I RKEF +GQ+VLL+N S+KLM GKL+SKW  PF V ++   G + IK
Subjt:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIK

KYP31222.1 Pol polyprotein [Cajanus cajan]1.3e-6256.58Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRTL----------------VYGKPCHLPVEIE
        +DFMGPFP SFG+ YILLAVDYVSKWVEA  TRTND+ VV  F+ SNIF RFG+PR I+SDQGTHFCNR++                V+GK CHLPVE+E
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRTL----------------VYGKPCHLPVEIE

Query:  HKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYKEKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSI
        H+AYWA++ CN S+ +AGE++ L L EL E RLEAYENS+ YKEKTK  HD  I RK+F +GQKV LYN  ++LM GKL+SKW  PFVV ++  +G V I
Subjt:  HKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYKEKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSI

Query:  KKRL--NSFVVTSKERSIIEPNRLNPRL
        K      SF V       ++P   NP L
Subjt:  KKRL--NSFVVTSKERSIIEPNRLNPRL

RDY02854.1 Retrovirus-related Pol polyprotein, partial [Mucuna pruriens]7.3e-6157.82Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT---------------------------LVY
        +DFMGPFP S  Y YILLAVDYVS+WVEAI TRTND+ VV  FL SNIF RFG+P+ +ISDQG+HFCNR                            +V+
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT---------------------------LVY

Query:  GKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYKEKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVV
        GK CHLPVE+EHKAYWA++QCNL+  +AGE++   LQEL E RLEAYENSRIYK+K K  HD++ILRKEF +GQKVLL+N  +KL+ GKL+S+W RPFV+
Subjt:  GKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYKEKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVV

Query:  IDISTFGVVSI
         +I   G V +
Subjt:  IDISTFGVVSI

XP_028798815.1 LOW QUALITY PROTEIN: uncharacterized protein LOC114754213 [Prosopis alba]9.6e-6149.81Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------
        +DFMGPFP+SFGY+YILLAVDYVSKWVEAIPTRTNDS VV  F+ S+IF RFGIPR IISDQGTHFCNR+                              
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------

Query:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK
                                                   +V+GK CHLPVE+EHKAYWAI+ C+L L  AG ++ L LQEL+E R EAYEN+RIYK
Subjt:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK

Query:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIKK
        E+TK +HDK ILR+EF +GQKVLLY   +K MPGKL+S+W  PF+V ++  +G V IK+
Subjt:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIKK

XP_028802716.1 uncharacterized protein LOC114757797 [Prosopis alba]3.3e-6150.19Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------
        +DFMGPFP+SFGY+YILLAVDYVSKWVEAIPTRTNDS VV  F+ S+IF RFGIPR IISDQGTHFCNR+                              
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------

Query:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK
                                                   +V+GK CHLPVE+EHKAYWAI+ C+L L  AG ++ L LQEL+E R EAYEN+RIYK
Subjt:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK

Query:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIKK
        E+TK +HDK ILR+EF +GQKVLLY   +KLMPGKL+S+W  PF+V ++  +G V IK+
Subjt:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIKK

TrEMBL top hitse value%identityAlignment
A0A151QLT1 Pol polyprotein6.5e-6356.58Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRTL----------------VYGKPCHLPVEIE
        +DFMGPFP SFG+ YILLAVDYVSKWVEA  TRTND+ VV  F+ SNIF RFG+PR I+SDQGTHFCNR++                V+GK CHLPVE+E
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRTL----------------VYGKPCHLPVEIE

Query:  HKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYKEKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSI
        H+AYWA++ CN S+ +AGE++ L L EL E RLEAYENS+ YKEKTK  HD  I RK+F +GQKV LYN  ++LM GKL+SKW  PFVV ++  +G V I
Subjt:  HKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYKEKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSI

Query:  KKRL--NSFVVTSKERSIIEPNRLNPRL
        K      SF V       ++P   NP L
Subjt:  KKRL--NSFVVTSKERSIIEPNRLNPRL

A0A2Z6NS99 Integrase catalytic domain-containing protein7.9e-6150.39Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------
        +DFMGPFP SFG+LYILLAVDYVSKWVEAIPTRTND+ VV+ F+ SNIF RFGIPR IISDQGTHFCNRT                              
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------

Query:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK
                                                   LV+GK CHLPV+IEH+AYWA++ CN+ + EAG+++ L LQ+L+E RLEAYE+SRIYK
Subjt:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK

Query:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIK
        EKTK  HDK I RKEF +GQ+VLL+N S+KLM GKL+SKW  PF V ++   G + IK
Subjt:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIK

A0A371HJF2 Retrovirus-related Pol polyprotein (Fragment)3.6e-6157.82Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT---------------------------LVY
        +DFMGPFP S  Y YILLAVDYVS+WVEAI TRTND+ VV  FL SNIF RFG+P+ +ISDQG+HFCNR                            +V+
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT---------------------------LVY

Query:  GKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYKEKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVV
        GK CHLPVE+EHKAYWA++QCNL+  +AGE++   LQEL E RLEAYENSRIYK+K K  HD++ILRKEF +GQKVLL+N  +KL+ GKL+S+W RPFV+
Subjt:  GKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYKEKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVV

Query:  IDISTFGVVSI
         +I   G V +
Subjt:  IDISTFGVVSI

A0A6P6TWR2 uncharacterized protein LOC1137046581.5e-5948.51Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------
        +DFMGPFP+SFG+LYILLAVDYVSKWVEA  TRTNDS VV+ F+ SNIF RFG+PR I+SD+GTHFCNRT                              
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------

Query:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK
                                                   LV+GKPCHLPVE EHKA+WAI+QCN++L EAG ++ L+LQEL+E R EAYEN+ IYK
Subjt:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK

Query:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIK--KRLNSFVV
        EK++  HD++I RK FE+GQKVLLY   +KL PGKL+S+W  PF+V  +  +G V I+  K  N FVV
Subjt:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIK--KRLNSFVV

A0A6P6V9E5 uncharacterized protein LOC1137185141.5e-5948.51Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------
        +DFMGPFP+SFG+LYILLAVDYVSKWVEA  TRTNDS VV+ F+ SNIF RFG+PR I+SD+GTHFCNRT                              
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRT------------------------------

Query:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK
                                                   LV+GKPCHLPVE EHKA+WAI+QCN++L EAG ++ L+LQEL+E R EAYEN+ IYK
Subjt:  -------------------------------------------LVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQELKEFRLEAYENSRIYK

Query:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIK--KRLNSFVV
        EK++  HD++I RK FE+GQKVLLY   +KL PGKL+S+W  PF+V  +  +G V I+  K  N FVV
Subjt:  EKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIK--KRLNSFVV

SwissProt top hitse value%identityAlignment
A1Z651 Gag-Pol polyprotein4.0e-0943.48Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNR
        +DF    P  +GY Y+L+ VD  S WVEA PT+   + VVS+ L+ +IF RFG+P+ + SD G  F ++
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNR

P03356 Gag-Pol polyprotein6.8e-0943.48Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNR
        +DF    P  +GY Y+L+ VD  S WVEA PT+   + VVS+ L+  IF RFG+P+ + SD G  F ++
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNR

Q2F7J0 Gag-Pol polyprotein4.0e-0943.48Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNR
        +DF    P  +GY Y+L+ VD  S WVEA PT+   + VVS+ L+ +IF RFG+P+ + SD G  F ++
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNR

Q2F7J3 Gag-Pol polyprotein8.8e-0942.03Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNR
        +DF    P  +GY Y+L+ VD  S WVEA PT+   + VV++ L+ +IF RFG+P+ + SD G  F ++
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNR

Q7SVK7 Gag-pol polyprotein6.8e-0943.48Show/hide
Query:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNR
        +DF    P  +GY Y+L+ VD  S WVEA PT+   + VVS+ L+  IF RFG+P+ + SD G  F ++
Subjt:  MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTATGGGTCCCTTCCCTTCTTCTTTTGGATATCTTTACATTTTATTAGCCGTTGACTATGTATCGAAGTGGGTTGAAGCAATCCCCACTAGGACTAATGATTC
TTTCGTTGTCTCAAGATTTCTAGTTTCTAATATATTTTCTAGATTTGGTATCCCAAGGGAAATCATTAGCGACCAAGGAACACACTTTTGCAATCGAACCCTTGTGTACG
GTAAGCCTTGTCATCTCCCTGTAGAAATAGAACATAAAGCATATTGGGCAATTAGACAATGCAATTTATCTCTCCTAGAAGCCGGGGAGAAAAAATTCCTCAATTTGCAA
GAATTAAAAGAATTTAGATTAGAAGCATATGAAAATTCTAGGATTTACAAAGAAAAGACTAAACTTTTGCATGATAAAAAAATCCTAAGAAAAGAATTTGAAATAGGGCA
AAAAGTTCTTTTATATAATTTTTCTATTAAACTCATGCCCGGAAAGTTAAAATCTAAATGGTTTCGTCCTTTTGTTGTGATTGATATCTCTACTTTTGGTGTAGTTTCCA
TAAAAAAACGCTTAAATTCGTTTGTTGTAACCTCGAAAGAACGGTCAATCATAGAACCTAATAGATTAAATCCAAGATTAGAGTTAGCCAAGCCGGAACTAAGCAAAAAG
CTTGCTACCAAAACTAGAATCCTTAAAAAAAAAAAATGTAGCAAAATGCATACTTGTAGCAAAATACTTTTATATAAAATAAAAAGTACGATTAAGTTGAACGTTAGGAT
TTTTTTCGTGATCAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTATGGGTCCCTTCCCTTCTTCTTTTGGATATCTTTACATTTTATTAGCCGTTGACTATGTATCGAAGTGGGTTGAAGCAATCCCCACTAGGACTAATGATTC
TTTCGTTGTCTCAAGATTTCTAGTTTCTAATATATTTTCTAGATTTGGTATCCCAAGGGAAATCATTAGCGACCAAGGAACACACTTTTGCAATCGAACCCTTGTGTACG
GTAAGCCTTGTCATCTCCCTGTAGAAATAGAACATAAAGCATATTGGGCAATTAGACAATGCAATTTATCTCTCCTAGAAGCCGGGGAGAAAAAATTCCTCAATTTGCAA
GAATTAAAAGAATTTAGATTAGAAGCATATGAAAATTCTAGGATTTACAAAGAAAAGACTAAACTTTTGCATGATAAAAAAATCCTAAGAAAAGAATTTGAAATAGGGCA
AAAAGTTCTTTTATATAATTTTTCTATTAAACTCATGCCCGGAAAGTTAAAATCTAAATGGTTTCGTCCTTTTGTTGTGATTGATATCTCTACTTTTGGTGTAGTTTCCA
TAAAAAAACGCTTAAATTCGTTTGTTGTAACCTCGAAAGAACGGTCAATCATAGAACCTAATAGATTAAATCCAAGATTAGAGTTAGCCAAGCCGGAACTAAGCAAAAAG
CTTGCTACCAAAACTAGAATCCTTAAAAAAAAAAAATGTAGCAAAATGCATACTTGTAGCAAAATACTTTTATATAAAATAAAAAGTACGATTAAGTTGAACGTTAGGAT
TTTTTTCGTGATCAGTTAA
Protein sequenceShow/hide protein sequence
MDFMGPFPSSFGYLYILLAVDYVSKWVEAIPTRTNDSFVVSRFLVSNIFSRFGIPREIISDQGTHFCNRTLVYGKPCHLPVEIEHKAYWAIRQCNLSLLEAGEKKFLNLQ
ELKEFRLEAYENSRIYKEKTKLLHDKKILRKEFEIGQKVLLYNFSIKLMPGKLKSKWFRPFVVIDISTFGVVSIKKRLNSFVVTSKERSIIEPNRLNPRLELAKPELSKK
LATKTRILKKKKCSKMHTCSKILLYKIKSTIKLNVRIFFVIS