; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0006841 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0006841
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag-pol polyprotein
Genome locationchr11:25398491..25399462
RNA-Seq ExpressionPI0006841
SyntenyPI0006841
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043382.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.0e-2542.05Show/hide
Query:  NKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIEDQILHKEFDQ--LLNNWTED
        +K ++C ECEGFGH Q+EC  +L+ KKKS+  TL DEE+ L+SDD+E   ALI  I  ++  ++   +   D  A       + +K  ++  L   W ED
Subjt:  NKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIEDQILHKEFDQ--LLNNWTED

Query:  QIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSE
        Q  +  Q++RIQYL EEN    S I +LK+ELK      E L+KSV+M++  T  LD LL  GK  +++ GLGFSE
Subjt:  QIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSE

KAA0045252.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]3.4e-2436.82Show/hide
Query:  GKRRCSALDWKCRDSTNSSCQINKN------------------LKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESK
        G    + ++++ +D  N++ +IN+N                   +CRECEG GHYQAECP FLRR+KK+   TL D+++  D ++     A   CI E+ 
Subjt:  GKRRCSALDWKCRDSTNSSCQINKN------------------LKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESK

Query:  ITLDKHCTIPKDIYAEEIIEDQILHKE--FDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLL
           +  C+             QI  K   F++L   W ED      QK+RIQ L EEN RL  +ISSLK +LK V C+ ++  KSV+ML+S T++LD +L
Subjt:  ITLDKHCTIPKDIYAEEIIEDQILHKE--FDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLL

Query:  EIGKAGSNRSGLGFSENNKQ
          G+ G NR GLGF  + ++
Subjt:  EIGKAGSNRSGLGFSENNKQ

KAA0051793.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.1e-3044Show/hide
Query:  KRRCSALDWKCRDSTNSSCQINKN---LKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAE
        +RR   L +K ++   S  Q  KN    KC ECEGF HYQAEC  FL+RK KS+ VTL DEES  DSD +     LI C+ E++  + K     + +   
Subjt:  KRRCSALDWKCRDSTNSSCQINKN---LKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAE

Query:  EIIEDQILHKEFDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSENN
        +  +      ++D+LL+ W EDQ V K QK+RIQ L E+NH L SMI +LK +LK V  + + L+KSV++L   TQSLD+LL  GK  SN+  LG+   N
Subjt:  EIIEDQILHKEFDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSENN

KAA0059883.1 uncharacterized protein E6C27_scaffold108G001750 [Cucumis melo var. makuwa]1.0e-2540Show/hide
Query:  QINKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIEDQILHKEFDQLLNNWTED
        +I+++LKCRECEG+GHYQ ECPNF RR+KKS +VTL + ++    ++ E  +A I  + + ++             +EE  ED+     F QL   W ED
Subjt:  QINKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIEDQILHKEFDQLLNNWTED

Query:  QIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSENNKQ
          V   QK++I  L EEN RL S+ISSLK +L+ +  + ++  KS++ L+S+T++LD +L  G++ SN  GLGF+ + K+
Subjt:  QIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSENNKQ

KAA0066403.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]1.2e-2442.19Show/hide
Query:  RRCSALDWKCRDSTNSSCQINKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIE
        RR +  D   R++ NS  QI+  LK +ECEG  HYQAECP FLR++KK+  V+L DEES+   DD  +  A    I +     D  C+I       E   
Subjt:  RRCSALDWKCRDSTNSSCQINKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIE

Query:  DQILHKEFDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGF
        D++  ++ + L   W ED      QK+RIQ L +EN RL S+ISSLKS+L+ V  + +++ K V+ML+S T++LD +L+ G  GS+R GLGF
Subjt:  DQILHKEFDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGF

TrEMBL top hitse value%identityAlignment
A0A5A7TJC1 Gag-pol polyprotein1.9e-2542.05Show/hide
Query:  NKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIEDQILHKEFDQ--LLNNWTED
        +K ++C ECEGFGH Q+EC  +L+ KKKS+  TL DEE+ L+SDD+E   ALI  I  ++  ++   +   D  A       + +K  ++  L   W ED
Subjt:  NKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIEDQILHKEFDQ--LLNNWTED

Query:  QIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSE
        Q  +  Q++RIQYL EEN    S I +LK+ELK      E L+KSV+M++  T  LD LL  GK  +++ GLGFSE
Subjt:  QIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSE

A0A5A7TPF7 Gag-proteinase polyprotein1.6e-2436.82Show/hide
Query:  GKRRCSALDWKCRDSTNSSCQINKN------------------LKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESK
        G    + ++++ +D  N++ +IN+N                   +CRECEG GHYQAECP FLRR+KK+   TL D+++  D ++     A   CI E+ 
Subjt:  GKRRCSALDWKCRDSTNSSCQINKN------------------LKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESK

Query:  ITLDKHCTIPKDIYAEEIIEDQILHKE--FDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLL
           +  C+             QI  K   F++L   W ED      QK+RIQ L EEN RL  +ISSLK +LK V C+ ++  KSV+ML+S T++LD +L
Subjt:  ITLDKHCTIPKDIYAEEIIEDQILHKE--FDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLL

Query:  EIGKAGSNRSGLGFSENNKQ
          G+ G NR GLGF  + ++
Subjt:  EIGKAGSNRSGLGFSENNKQ

A0A5A7VE51 Gag-proteinase polyprotein5.6e-2542.19Show/hide
Query:  RRCSALDWKCRDSTNSSCQINKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIE
        RR +  D   R++ NS  QI+  LK +ECEG  HYQAECP FLR++KK+  V+L DEES+   DD  +  A    I +     D  C+I       E   
Subjt:  RRCSALDWKCRDSTNSSCQINKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIE

Query:  DQILHKEFDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGF
        D++  ++ + L   W ED      QK+RIQ L +EN RL S+ISSLKS+L+ V  + +++ K V+ML+S T++LD +L+ G  GS+R GLGF
Subjt:  DQILHKEFDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGF

A0A5D3DD57 Gag-pol polyprotein5.2e-3144Show/hide
Query:  KRRCSALDWKCRDSTNSSCQINKN---LKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAE
        +RR   L +K ++   S  Q  KN    KC ECEGF HYQAEC  FL+RK KS+ VTL DEES  DSD +     LI C+ E++  + K     + +   
Subjt:  KRRCSALDWKCRDSTNSSCQINKN---LKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAE

Query:  EIIEDQILHKEFDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSENN
        +  +      ++D+LL+ W EDQ V K QK+RIQ L E+NH L SMI +LK +LK V  + + L+KSV++L   TQSLD+LL  GK  SN+  LG+   N
Subjt:  EIIEDQILHKEFDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSENN

A0A5D3DMG4 Uncharacterized protein5.1e-2640Show/hide
Query:  QINKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIEDQILHKEFDQLLNNWTED
        +I+++LKCRECEG+GHYQ ECPNF RR+KKS +VTL + ++    ++ E  +A I  + + ++             +EE  ED+     F QL   W ED
Subjt:  QINKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIEDQILHKEFDQLLNNWTED

Query:  QIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSENNKQ
          V   QK++I  L EEN RL S+ISSLK +L+ +  + ++  KS++ L+S+T++LD +L  G++ SN  GLGF+ + K+
Subjt:  QIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSENNKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G05360.1 Zinc knuckle (CCHC-type) family protein4.0e-0725.12Show/hide
Query:  NKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALID-CIYESKI---------------TLDKHCTIPKDIYAEEIIEDQIL
        +K  +C EC+GF H  +EC N ++ K+K   ++    +S +DSDD E+ + L+    +ES I               +     T P      +  +D  L
Subjt:  NKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALID-CIYESKI---------------TLDKHCTIPKDIYAEEIIEDQIL

Query:  HKEFDQLLNNW----------TEDQIVLKQQKDRIQYLFEENHRLFSMISSLK--SELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGF
            ++   N+           E+  VL ++K +++           ++ +LK  +E +     LE   K+++ML++ T+ L H+L IGK  +++ GLGF
Subjt:  HKEFDQLLNNW----------TEDQIVLKQQKDRIQYLFEENHRLFSMISSLK--SELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGF

Query:  SEN
          N
Subjt:  SEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTTGGCGAAAAAAGGGAAAAGGCGTTGCTCTGCACTCGATTGGAAATGTAGAGACAGTACAAACTCATCCTGTCAGATAAATAAAAACTTAAAATGTCGAGAATG
TGAAGGTTTCGGCCATTACCAAGCAGAATGTCCAAATTTCCTGCGAAGAAAGAAGAAGAGCGTGACAGTCACTCTGTTTGATGAGGAATCTTTATTAGACAGTGATGACA
AAGAAGATGAAAGAGCTCTAATCGATTGTATATATGAAAGTAAAATTACTTTAGATAAACACTGTACAATCCCTAAAGATATCTATGCCGAAGAGATTATTGAGGATCAA
ATACTACACAAGGAGTTTGATCAATTACTAAATAACTGGACAGAAGATCAGATTGTTCTAAAACAGCAAAAGGATCGGATTCAATACCTGTTCGAGGAGAATCATCGGTT
GTTCTCAATGATCTCCTCTCTCAAAAGTGAGTTAAAAGGAGTTTGCTGTGATCTTGAAAGACTAACAAAATCAGTGCAAATGTTAAGTTCTGATACTCAAAGCCTAGATC
ATCTTCTTGAGATTGGGAAGGCTGGATCAAACAGAAGTGGATTAGGATTCTCTGAGAACAACAAACAAGGAACTCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCTTGGCGAAAAAAGGGAAAAGGCGTTGCTCTGCACTCGATTGGAAATGTAGAGACAGTACAAACTCATCCTGTCAGATAAATAAAAACTTAAAATGTCGAGAATG
TGAAGGTTTCGGCCATTACCAAGCAGAATGTCCAAATTTCCTGCGAAGAAAGAAGAAGAGCGTGACAGTCACTCTGTTTGATGAGGAATCTTTATTAGACAGTGATGACA
AAGAAGATGAAAGAGCTCTAATCGATTGTATATATGAAAGTAAAATTACTTTAGATAAACACTGTACAATCCCTAAAGATATCTATGCCGAAGAGATTATTGAGGATCAA
ATACTACACAAGGAGTTTGATCAATTACTAAATAACTGGACAGAAGATCAGATTGTTCTAAAACAGCAAAAGGATCGGATTCAATACCTGTTCGAGGAGAATCATCGGTT
GTTCTCAATGATCTCCTCTCTCAAAAGTGAGTTAAAAGGAGTTTGCTGTGATCTTGAAAGACTAACAAAATCAGTGCAAATGTTAAGTTCTGATACTCAAAGCCTAGATC
ATCTTCTTGAGATTGGGAAGGCTGGATCAAACAGAAGTGGATTAGGATTCTCTGAGAACAACAAACAAGGAACTCATTAGAATGCCTTAAAAATTGTGTTTGTTCGTGCG
AAGGATATCAACCATGATGAATCTAACATACAACAGTCATCTCAAAGAATCTGTGAAGAGAATGCAGCTAAAGCAACAAGACAGAGGAGGAAATGGGTGTGTCATTTTTT
GTGGTAGGCCTG
Protein sequenceShow/hide protein sequence
MTLAKKGKRRCSALDWKCRDSTNSSCQINKNLKCRECEGFGHYQAECPNFLRRKKKSVTVTLFDEESLLDSDDKEDERALIDCIYESKITLDKHCTIPKDIYAEEIIEDQ
ILHKEFDQLLNNWTEDQIVLKQQKDRIQYLFEENHRLFSMISSLKSELKGVCCDLERLTKSVQMLSSDTQSLDHLLEIGKAGSNRSGLGFSENNKQGTH