; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0014081 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0014081
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag-proteinase polyprotein
Genome locationchr03:16690049..16695073
RNA-Seq ExpressionPI0014081
SyntenyPI0014081
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042403.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.0e-4339.23Show/hide
Query:  LERD------GWLFEQGFVKMDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID--------------------------------------------
        +ERD      G   EQG   M++I EG ST+R  VL   +Y+YWK   ++F+K++D                                            
Subjt:  LERD------GWLFEQGFVKMDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID--------------------------------------------

Query:  -------------MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKV
                     +LR++   F++KVT I+EAHD   LKLDELFGSL TFE+ IS++ED K K IAF++ +EE+  D +  S  NVN+SIA+LT+QF KV
Subjt:  -------------MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKV

Query:  VSNRDQNNYRRKDSDKAFGKIDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIK-EPSLVVEEDTLS
        V   D  N++R+D +       R+FKC EC GFGHYQAECPT+L++QKK F AT S+ E++DS  +     AFI  ++    +SI K E S    E++LS
Subjt:  VSNRDQNNYRRKDSDKAFGKIDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIK-EPSLVVEEDTLS

Query:  YDQLHKQWQED
        Y QL   W+ED
Subjt:  YDQLHKQWQED

KAA0042420.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]1.0e-3541.45Show/hide
Query:  MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKV-------------
        +LR+L ++ +MKVTAI+EA DI +LKLDELFGSL TFE+++SD+E  K K IAF++ ++++  + +  +  N ++SIA+LT+QF K+             
Subjt:  MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKV-------------

Query:  --VSNRDQNNYRRKDSD------KAFGK----IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIKE
              D  N  RK +D        +GK    + RSF+CRECEGFGHYQAECPTYL+RQKKI+CAT S+ + +D   + ++  AF + ++    E+   E
Subjt:  --VSNRDQNNYRRKDSD------KAFGK----IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIKE

Query:  PSLVVEEDTLSYDQLHKQWQEDLSVLVIQKERIQ
         S + E++ L+ ++L    +ED     IQK+RIQ
Subjt:  PSLVVEEDTLSYDQLHKQWQEDLSVLVIQKERIQ

TYK00987.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.2e-4239.52Show/hide
Query:  MDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID---------------------------------------------------------MLRTLSR
        M++I EG ST+R  VL   +Y+YWK   ++F+K++D                                                         +LR++  
Subjt:  MDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID---------------------------------------------------------MLRTLSR

Query:  RFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKVVSNRDQNNYRRKDSDKAFGK
         F++KVT I+EAHD   LKLDELFGSL TFE+ IS++ED K K IAF++ +EE+  D +  S  NVN+SIA+LT+QF KVV   D  N++R+D +     
Subjt:  RFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKVVSNRDQNNYRRKDSDKAFGK

Query:  IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIK-EPSLVVEEDTLSYDQLHKQWQED
          R+FKC EC GFGHYQAECPT+L++QKK F AT S+ E++DS  +     AFI  ++    +SI K E S    E++LSY QL   W+ED
Subjt:  IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIK-EPSLVVEEDTLSYDQLHKQWQED

TYK14780.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]3.5e-3634.35Show/hide
Query:  MDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID----------------------------------------------------------------
        M++I+E  S +RP VLDG +Y+YWK R+I FIK++D                                                                
Subjt:  MDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID----------------------------------------------------------------

Query:  ---------------------MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAM
                             +L +LSR+F+MKVTAI EAHDI +LKLDELFGSL T E++IS++E+ K K I F++I+EE+ +  +  +  N+++SIA+
Subjt:  ---------------------MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAM

Query:  LTRQFPKVV-----------SNRDQNNYRRKDS--------------DKAFGK----IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDS
        LT+QF KVV           + ++ N YRRKD               D  +GK      R F+CREC G G YQAECP +L+RQKK F AT S +E  D+
Subjt:  LTRQFPKVV-----------SNRDQNNYRRKDS--------------DKAFGK----IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDS

Query:  GTETNNFRAF-ISILSND-GFESIIKEPSLVVEEDTLSYDQLHKQWQEDLSVLVIQKERIQ
          + N  +AF + I+  D G ES   E +    ++ L++++L   W+ED     IQKERIQ
Subjt:  GTETNNFRAF-ISILSND-GFESIIKEPSLVVEEDTLSYDQLHKQWQEDLSVLVIQKERIQ

XP_016903608.1 PREDICTED: uncharacterized protein LOC107992254 [Cucumis melo]1.1e-3744.5Show/hide
Query:  MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKVVSNRDQNNYRRKD
        +LR+L R+F+MKVTAI+EA DI +LKLDELFGSL TFE++ISD+E  K K IAF+++++++    +  +  N ++S+A+LT+QF K+   R+ ++ ++K+
Subjt:  MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKVVSNRDQNNYRRKD

Query:  SDKAFGKIDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIKEPSLVVEEDTLSYDQLHKQWQEDLSV
               +  SF+CRECEG GHYQAECP YL RQKK +CAT S+ ES D+  + +   AF + ++    E+   E S + E++ L+ ++L    +ED   
Subjt:  SDKAFGKIDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIKEPSLVVEEDTLSYDQLHKQWQEDLSV

Query:  LVIQKERIQ
          IQKERIQ
Subjt:  LVIQKERIQ

TrEMBL top hitse value%identityAlignment
A0A1S4E5V5 uncharacterized protein LOC1079922545.2e-3844.5Show/hide
Query:  MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKVVSNRDQNNYRRKD
        +LR+L R+F+MKVTAI+EA DI +LKLDELFGSL TFE++ISD+E  K K IAF+++++++    +  +  N ++S+A+LT+QF K+   R+ ++ ++K+
Subjt:  MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKVVSNRDQNNYRRKD

Query:  SDKAFGKIDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIKEPSLVVEEDTLSYDQLHKQWQEDLSV
               +  SF+CRECEG GHYQAECP YL RQKK +CAT S+ ES D+  + +   AF + ++    E+   E S + E++ L+ ++L    +ED   
Subjt:  SDKAFGKIDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIKEPSLVVEEDTLSYDQLHKQWQEDLSV

Query:  LVIQKERIQ
          IQKERIQ
Subjt:  LVIQKERIQ

A0A5A7TGG3 Gag-proteinase polyprotein4.9e-3641.45Show/hide
Query:  MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKV-------------
        +LR+L ++ +MKVTAI+EA DI +LKLDELFGSL TFE+++SD+E  K K IAF++ ++++  + +  +  N ++SIA+LT+QF K+             
Subjt:  MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKV-------------

Query:  --VSNRDQNNYRRKDSD------KAFGK----IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIKE
              D  N  RK +D        +GK    + RSF+CRECEGFGHYQAECPTYL+RQKKI+CAT S+ + +D   + ++  AF + ++    E+   E
Subjt:  --VSNRDQNNYRRKDSD------KAFGK----IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIKE

Query:  PSLVVEEDTLSYDQLHKQWQEDLSVLVIQKERIQ
         S + E++ L+ ++L    +ED     IQK+RIQ
Subjt:  PSLVVEEDTLSYDQLHKQWQEDLSVLVIQKERIQ

A0A5A7TGP5 Gag-pol polyprotein4.9e-4439.23Show/hide
Query:  LERD------GWLFEQGFVKMDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID--------------------------------------------
        +ERD      G   EQG   M++I EG ST+R  VL   +Y+YWK   ++F+K++D                                            
Subjt:  LERD------GWLFEQGFVKMDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID--------------------------------------------

Query:  -------------MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKV
                     +LR++   F++KVT I+EAHD   LKLDELFGSL TFE+ IS++ED K K IAF++ +EE+  D +  S  NVN+SIA+LT+QF KV
Subjt:  -------------MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKV

Query:  VSNRDQNNYRRKDSDKAFGKIDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIK-EPSLVVEEDTLS
        V   D  N++R+D +       R+FKC EC GFGHYQAECPT+L++QKK F AT S+ E++DS  +     AFI  ++    +SI K E S    E++LS
Subjt:  VSNRDQNNYRRKDSDKAFGKIDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIK-EPSLVVEEDTLS

Query:  YDQLHKQWQED
        Y QL   W+ED
Subjt:  YDQLHKQWQED

A0A5D3BNW5 Gag-pol polyprotein1.6e-4239.52Show/hide
Query:  MDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID---------------------------------------------------------MLRTLSR
        M++I EG ST+R  VL   +Y+YWK   ++F+K++D                                                         +LR++  
Subjt:  MDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID---------------------------------------------------------MLRTLSR

Query:  RFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKVVSNRDQNNYRRKDSDKAFGK
         F++KVT I+EAHD   LKLDELFGSL TFE+ IS++ED K K IAF++ +EE+  D +  S  NVN+SIA+LT+QF KVV   D  N++R+D +     
Subjt:  RFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKVVSNRDQNNYRRKDSDKAFGK

Query:  IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIK-EPSLVVEEDTLSYDQLHKQWQED
          R+FKC EC GFGHYQAECPT+L++QKK F AT S+ E++DS  +     AFI  ++    +SI K E S    E++LSY QL   W+ED
Subjt:  IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIK-EPSLVVEEDTLSYDQLHKQWQED

A0A5D3CTL4 Gag-proteinase polyprotein1.7e-3634.35Show/hide
Query:  MDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID----------------------------------------------------------------
        M++I+E  S +RP VLDG +Y+YWK R+I FIK++D                                                                
Subjt:  MDMIKEGGSTTRPLVLDGTDYAYWKARIIAFIKSID----------------------------------------------------------------

Query:  ---------------------MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAM
                             +L +LSR+F+MKVTAI EAHDI +LKLDELFGSL T E++IS++E+ K K I F++I+EE+ +  +  +  N+++SIA+
Subjt:  ---------------------MLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAM

Query:  LTRQFPKVV-----------SNRDQNNYRRKDS--------------DKAFGK----IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDS
        LT+QF KVV           + ++ N YRRKD               D  +GK      R F+CREC G G YQAECP +L+RQKK F AT S +E  D+
Subjt:  LTRQFPKVV-----------SNRDQNNYRRKDS--------------DKAFGK----IDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDS

Query:  GTETNNFRAF-ISILSND-GFESIIKEPSLVVEEDTLSYDQLHKQWQEDLSVLVIQKERIQ
          + N  +AF + I+  D G ES   E +    ++ L++++L   W+ED     IQKERIQ
Subjt:  GTETNNFRAF-ISILSND-GFESIIKEPSLVVEEDTLSYDQLHKQWQEDLSVLVIQKERIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAGTTTTCGGTGAACTCACCCGCAACGAGAACTCTACGGGGCAAAGCTTGGCGAAAGGGAGGCCTTACGCCGCACGGCTAGACTATGCGAACACAACCCAGGT
AACCCAACAAGCAGCAAGGACTTCCGGCGGTACGGGCAGACGGAGAGAGCTCTGGACGGCAAGACAACGCGCAACACTAGGGCTTGAGAGAGACGGCTGGCTTTTTGAGC
AAGGTTTTGTCAAAATGGATATGATCAAAGAAGGAGGATCTACAACTCGACCACTAGTTCTTGATGGAACCGACTATGCTTATTGGAAAGCTCGTATAATTGCCTTTATC
AAATCGATTGACATGTTAAGAACTTTGTCCAGAAGATTTAATATGAAAGTTACTGCCATTGATGAAGCTCATGATATTGAAAGTTTGAAACTAGATGAGTTATTTGGTTC
CTTACGAACGTTTGAAATTTCTATATCTGATAAAGAAGATATTAAGGACAAGGAAATAGCTTTTCAAACTATTCATGAAGAAGATCTTGTTGATAAGGAAAAATATTCAA
CTGATAATGTGAATGATTCTATTGCTATGTTGACAAGACAGTTTCCTAAAGTTGTTAGCAATAGGGATCAGAACAACTATAGAAGGAAAGATTCTGATAAAGCATTTGGG
AAAATAGACAGATCCTTCAAGTGTCGAGAATGTGAAGGCTTTGGGCATTATCAAGCAGAATGTCCAACTTATTTAAAAAGGCAGAAGAAAATTTTTTGTGCAACTTTCTC
AAATAATGAGTCAGAAGATAGTGGAACTGAAACTAACAACTTTCGTGCTTTTATCAGTATTCTCTCAAATGACGGTTTTGAATCAATTATCAAAGAACCATCTTTAGTTG
TTGAGGAAGATACTTTGTCTTATGATCAGCTTCATAAACAATGGCAGGAGGATTTGTCAGTTCTAGTCATTCAGAAAGAAAGAATACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAGTTTTCGGTGAACTCACCCGCAACGAGAACTCTACGGGGCAAAGCTTGGCGAAAGGGAGGCCTTACGCCGCACGGCTAGACTATGCGAACACAACCCAGGT
AACCCAACAAGCAGCAAGGACTTCCGGCGGTACGGGCAGACGGAGAGAGCTCTGGACGGCAAGACAACGCGCAACACTAGGGCTTGAGAGAGACGGCTGGCTTTTTGAGC
AAGGTTTTGTCAAAATGGATATGATCAAAGAAGGAGGATCTACAACTCGACCACTAGTTCTTGATGGAACCGACTATGCTTATTGGAAAGCTCGTATAATTGCCTTTATC
AAATCGATTGACATGTTAAGAACTTTGTCCAGAAGATTTAATATGAAAGTTACTGCCATTGATGAAGCTCATGATATTGAAAGTTTGAAACTAGATGAGTTATTTGGTTC
CTTACGAACGTTTGAAATTTCTATATCTGATAAAGAAGATATTAAGGACAAGGAAATAGCTTTTCAAACTATTCATGAAGAAGATCTTGTTGATAAGGAAAAATATTCAA
CTGATAATGTGAATGATTCTATTGCTATGTTGACAAGACAGTTTCCTAAAGTTGTTAGCAATAGGGATCAGAACAACTATAGAAGGAAAGATTCTGATAAAGCATTTGGG
AAAATAGACAGATCCTTCAAGTGTCGAGAATGTGAAGGCTTTGGGCATTATCAAGCAGAATGTCCAACTTATTTAAAAAGGCAGAAGAAAATTTTTTGTGCAACTTTCTC
AAATAATGAGTCAGAAGATAGTGGAACTGAAACTAACAACTTTCGTGCTTTTATCAGTATTCTCTCAAATGACGGTTTTGAATCAATTATCAAAGAACCATCTTTAGTTG
TTGAGGAAGATACTTTGTCTTATGATCAGCTTCATAAACAATGGCAGGAGGATTTGTCAGTTCTAGTCATTCAGAAAGAAAGAATACAATAA
Protein sequenceShow/hide protein sequence
MGKVFGELTRNENSTGQSLAKGRPYAARLDYANTTQVTQQAARTSGGTGRRRELWTARQRATLGLERDGWLFEQGFVKMDMIKEGGSTTRPLVLDGTDYAYWKARIIAFI
KSIDMLRTLSRRFNMKVTAIDEAHDIESLKLDELFGSLRTFEISISDKEDIKDKEIAFQTIHEEDLVDKEKYSTDNVNDSIAMLTRQFPKVVSNRDQNNYRRKDSDKAFG
KIDRSFKCRECEGFGHYQAECPTYLKRQKKIFCATFSNNESEDSGTETNNFRAFISILSNDGFESIIKEPSLVVEEDTLSYDQLHKQWQEDLSVLVIQKERIQ