; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0018826 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0018826
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag-pol polyprotein
Genome locationchr01:24326362..24330141
RNA-Seq ExpressionIVF0018826
SyntenyIVF0018826
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043006.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]3.97e-6944.34Show/hide
Query:  KPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQILSSKTKSLKIAEDEMIAEYNIRVLDLANESFALR
        K E+  +K+ED+ ++GNS+ALNAL+N V+ N+ KLINTC  AK AWDILEVA+EGTSK KIS LQIL+S  ++L++ EDE IAE+N+RVL++AN+S ALR
Subjt:  KPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQILSSKTKSLKIAEDEMIAEYNIRVLDLANESFALR

Query:  E----------ILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVF-----------------------------------
        E          +LRSL S+FNMKV AI+EA+D++ M+LDEL GSLRTFE+     A  +KS +                                     
Subjt:  E----------ILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVF-----------------------------------

Query:  ---------QEQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNN----QSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENF
                 Q  N++  + + S    +SS    R+K  +RG      +S K  + I CHECEGFGH Q E ATYLKRKKKG   T SDEE  S+SD E+ 
Subjt:  ---------QEQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNN----QSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENF

Query:  GRVLISSIIEIAEENVSDMSEKSSSLL
        G  LIS      EENV    +  S  L
Subjt:  GRVLISSIIEIAEENVSDMSEKSSSLL

KAA0046626.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]1.19e-7265.53Show/hide
Query:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI
        MISF KSIDS+                   KVTLKPEI+ SKEEDEESL NSQALNALYNGVDQNV KLINTCT  K+AWDIL+VA EGTSK KISRLQ+
Subjt:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI

Query:  LSSKTKSLKIAEDEMIAEYNIR-VLDLANESFALRE----------ILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVF
        LSSK K+LKI EDE++ E+N+  V+DLANESFALRE          +LR+L SRFN+KV AI+EA+DITTM+LDEL GSLRTFELSF +NA KKKS I F
Subjt:  LSSKTKSLKIAEDEMIAEYNIR-VLDLANESFALRE----------ILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVF

Query:  QEQNKD
        Q ++ D
Subjt:  QEQNKD

KAA0058905.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]4.85e-8351Show/hide
Query:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI
        MI+FLKSID++                   KV+LKPE+T S  EDE +LGNS+ LNA +NGVDQNV +LINTCT  KEAW+IL+VAYEGTSK K+ RLQI
Subjt:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI

Query:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFAL----------REILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQ
        L+S  ++LK+++DE I ++N+RVLD+ANESFAL          R++LRSL  RFNMKV AIKEA+D+TT +LD L GSLRTFEL+  +   K KS I F 
Subjt:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFAL----------REILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQ

Query:  EQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYA----TYLKRKKKGFAITLSDEETSSDSDCENFGRVLISSIIE
          +   NN+ S + R       + ++  + G +++ + DRSIRC+EC+GF HYQVEY     T+LK+K+K   +TL DE TS+ S+ E FGR LIS+  E
Subjt:  EQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYA----TYLKRKKKGFAITLSDEETSSDSDCENFGRVLISSIIE

TYJ98064.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]1.36e-6948.57Show/hide
Query:  MISFLKSIDSRK-------------------VTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI
        MI+FLKS+DSR                    +TLKP+I  +KEEDEES GNS+ LNAL NGVDQNVLKLINTCT AK+ W ILEVAY+G +K K+SRL I
Subjt:  MISFLKSIDSRK-------------------VTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI

Query:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFALREILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQEQNKDFNNSS
        L+S+ ++L++  DE IAE+              +++LRSL S+      AI+EA+DIT M+LDEL GSL TFE SF D A    S    + +NK +  S 
Subjt:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFALREILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQEQNKDFNNSS

Query:  SSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENFGRVL
           S+++  G            ++S K D+S+RCHECEG+ H+Q EY TYLK KKK   ITL DE+T SDS+CE++GR L
Subjt:  SSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENFGRVL

XP_008464095.1 PREDICTED: uncharacterized protein LOC103502061 [Cucumis melo]3.95e-7650.65Show/hide
Query:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI
        +I+F+KSIDS+                   K TLK EIT SK++DE + GNS+ALNA++NGVDQN+ K INTC  AKEAW+ILE AYEGTSK KIS+LQI
Subjt:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI

Query:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFAL----------REILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSF-----YDNAPKKKS
        LSSK +++K+ EDE I E+N+RVLDLANESFAL          R++ RSL  RFNMKV  I+EA+DI  +RLDEL  SLRTFELS         + +K+ 
Subjt:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFAL----------REILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSF-----YDNAPKKKS

Query:  SIVFQEQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENFGRVLISSII
        S+                 RS SS   SR+K   +  +Q   ID+S+RCHECEGF H+Q EYA YLKRKKKG AITLS             GR LIS   
Subjt:  SIVFQEQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENFGRVLISSII

Query:  EIAEEN
        EI+ +N
Subjt:  EIAEEN

TrEMBL top hitse value%identityAlignment
A0A1S3CKP5 uncharacterized protein LOC1035020612.0e-6051.5Show/hide
Query:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI
        +I+F+KSIDS+                   K TLK EIT SK++DE + GNS+ALNA++NGVDQN+ K INTC  AKEAW+ILE AYEGTSK KIS+LQI
Subjt:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI

Query:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFAL----------REILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQ
        LSSK +++K+ EDE I E+N+RVLDLANESFAL          R++ RSL  RFNMKV  I+EA+DI  +RLDEL  SLRTFELS           ++ +
Subjt:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFAL----------REILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQ

Query:  EQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENFGRVLISSIIEIAEE
        E         S   RS SS   SR+K   +  +Q   ID+S+RCHECEGF H+Q EYA YLKRKKKG AITLS             GR LIS   EI+ +
Subjt:  EQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENFGRVLISSIIEIAEE

Query:  N
        N
Subjt:  N

A0A5A7TLW7 Gag-proteinase polyprotein1.7e-5444.34Show/hide
Query:  KPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQILSSKTKSLKIAEDEMIAEYNIRVLDLANESFALR
        K E+  +K+ED+ ++GNS+ALNAL+N V+ N+ KLINTC  AK AWDILEVA+EGTSK KIS LQIL+S  ++L++ EDE IAE+N+RVL++AN+S ALR
Subjt:  KPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQILSSKTKSLKIAEDEMIAEYNIRVLDLANESFALR

Query:  E----------ILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVF-----------------------------------
        E          +LRSL S+FNMKV AI+EA+D++ M+LDEL GSLRTFE+     A  +KS +                                     
Subjt:  E----------ILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVF-----------------------------------

Query:  ---------QEQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNN----QSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENF
                 Q  N++  + + S    +SS    R+K  +RG      +S K  + I CHECEGFGH Q E ATYLKRKKKG   T SDEE  S+SD E+ 
Subjt:  ---------QEQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNN----QSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENF

Query:  GRVLISSIIEIAEENVSDMSEKSSSLL
        G  LIS      EENV    +  S  L
Subjt:  GRVLISSIIEIAEENVSDMSEKSSSLL

A0A5A7TXB8 Gag-proteinase polyprotein6.2e-5764.45Show/hide
Query:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI
        MISF KSIDS+                   KVTLKPEI+ SKEEDEESL NSQALNALYNGVDQNV KLINTCT  K+AWDIL+VA EGTSK KISRLQ+
Subjt:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI

Query:  LSSKTKSLKIAEDEMIAEYNIR-VLDLANESFALRE----------ILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVF
        LSSK K+LKI EDE++ E+N+  V+DLANESFALRE          +LR+L SRFN+KV AI+EA+DITTM+LDEL GSLRTFELSF +NA KKKS I F
Subjt:  LSSKTKSLKIAEDEMIAEYNIR-VLDLANESFALRE----------ILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVF

Query:  QEQNKDFNNSS
        Q ++ D   S+
Subjt:  QEQNKDFNNSS

A0A5A7UZ92 Gag-proteinase polyprotein1.0e-6451.33Show/hide
Query:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI
        MI+FLKSID++                   KV+LKPE+T S  EDE +LGNS+ LNA +NGVDQNV +LINTCT  KEAW+IL+VAYEGTSK K+ RLQI
Subjt:  MISFLKSIDSR-------------------KVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI

Query:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFAL----------REILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQ
        L+S  ++LK+++DE I ++N+RVLD+ANESFAL          R++LRSL  RFNMKV AIKEA+D+TT +LD L GSLRTFEL+  +   K KS I F 
Subjt:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFAL----------REILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQ

Query:  EQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYA----TYLKRKKKGFAITLSDEETSSDSDCENFGRVLISSIIE
              N+SSS+++ S +    + ++  + G +++ + DRSIRC+EC+GF HYQVEY     T+LK+K+K   +TL DE T S+S+ E FGR LIS+  E
Subjt:  EQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYA----TYLKRKKKGFAITLSDEETSSDSDCENFGRVLISSIIE

A0A5D3BG63 Gag-proteinase polyprotein1.7e-5448.57Show/hide
Query:  MISFLKSIDSRK-------------------VTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI
        MI+FLKS+DSR                    +TLKP+I  +KEEDEES GNS+ LNAL NGVDQNVLKLINTCT AK+ W ILEVAY+G +K K+SRL I
Subjt:  MISFLKSIDSRK-------------------VTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQI

Query:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFALREILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQEQNKDFNNSS
        L+S+ ++L++  DE IAE+              +++LRSL S+      AI+EA+DIT M+LDEL GSL TFE SF D A    S    + +NK +  S 
Subjt:  LSSKTKSLKIAEDEMIAEYNIRVLDLANESFALREILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQEQNKDFNNSS

Query:  SSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENFGRVL
           S+++  G            ++S K D+S+RCHECEG+ H+Q EY TYLK KKK   ITL DE+T SDS+CE++GR L
Subjt:  SSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEGFGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENFGRVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATCATTTCTAAAATCTATTGACAGTAGAAAAGTTACTTTGAAGCCTGAAATCACCTTGTCCAAAGAAGAAGATGAAGAATCTCTTGGAAATTCCCAAGCACTAAA
TGCATTATATAATGGAGTTGATCAGAATGTTCTTAAGCTAATTAATACGTGCACATTAGCTAAAGAAGCATGGGATATTTTAGAGGTTGCTTATGAAGGGACTTCCAAAT
TTAAAATATCAAGACTTCAAATTCTATCTTCAAAAACTAAATCTCTCAAAATAGCTGAAGATGAAATGATAGCTGAATACAATATTCGTGTTCTCGACTTGGCGAATGAA
TCTTTTGCACTTAGGGAAATCTTAAGATCTCTTTCTTCTCGGTTCAACATGAAAGTCAATGCAATAAAGGAAGCTGATGACATCACCACCATGAGACTTGATGAGTTACT
CGGGTCTTTAAGAACTTTTGAACTATCTTTTTATGATAATGCACCTAAGAAGAAAAGTAGTATTGTGTTCCAAGAGCAAAATAAAGATTTTAACAACTCCAGCAGCTCCA
GTTCAAGATCCTCTAGCTCTGGTACTTCATCAAGACAAAAACATGGTGATCGAGGTAATAATCAATCTGTCAAGATTGATAGAAGCATCAGGTGTCATGAATGTGAAGGT
TTTGGACATTACCAAGTCGAATATGCAACCTATTTGAAACGTAAGAAAAAGGGGTTTGCTATTACCTTATCTGATGAGGAAACATCCTCAGATAGTGATTGTGAAAACTT
TGGAAGAGTTTTAATTAGTTCTATTATAGAGATTGCTGAGGAAAATGTATCTGATATGTCTGAAAAGAGCTCAAGTCTACTCCATTACAATTTGAAGAAGTTATGTCTAT
GTGGGAAGAGGATCAGGAAGTTCTTAAATAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGATATCATTTCTAAAATCTATTGACAGTAGAAAAGTTACTTTGAAGCCTGAAATCACCTTGTCCAAAGAAGAAGATGAAGAATCTCTTGGAAATTCCCAAGCACTAAA
TGCATTATATAATGGAGTTGATCAGAATGTTCTTAAGCTAATTAATACGTGCACATTAGCTAAAGAAGCATGGGATATTTTAGAGGTTGCTTATGAAGGGACTTCCAAAT
TTAAAATATCAAGACTTCAAATTCTATCTTCAAAAACTAAATCTCTCAAAATAGCTGAAGATGAAATGATAGCTGAATACAATATTCGTGTTCTCGACTTGGCGAATGAA
TCTTTTGCACTTAGGGAAATCTTAAGATCTCTTTCTTCTCGGTTCAACATGAAAGTCAATGCAATAAAGGAAGCTGATGACATCACCACCATGAGACTTGATGAGTTACT
CGGGTCTTTAAGAACTTTTGAACTATCTTTTTATGATAATGCACCTAAGAAGAAAAGTAGTATTGTGTTCCAAGAGCAAAATAAAGATTTTAACAACTCCAGCAGCTCCA
GTTCAAGATCCTCTAGCTCTGGTACTTCATCAAGACAAAAACATGGTGATCGAGGTAATAATCAATCTGTCAAGATTGATAGAAGCATCAGGTGTCATGAATGTGAAGGT
TTTGGACATTACCAAGTCGAATATGCAACCTATTTGAAACGTAAGAAAAAGGGGTTTGCTATTACCTTATCTGATGAGGAAACATCCTCAGATAGTGATTGTGAAAACTT
TGGAAGAGTTTTAATTAGTTCTATTATAGAGATTGCTGAGGAAAATGTATCTGATATGTCTGAAAAGAGCTCAAGTCTACTCCATTACAATTTGAAGAAGTTATGTCTAT
GTGGGAAGAGGATCAGGAAGTTCTTAAATAATTAA
Protein sequenceShow/hide protein sequence
MISFLKSIDSRKVTLKPEITLSKEEDEESLGNSQALNALYNGVDQNVLKLINTCTLAKEAWDILEVAYEGTSKFKISRLQILSSKTKSLKIAEDEMIAEYNIRVLDLANE
SFALREILRSLSSRFNMKVNAIKEADDITTMRLDELLGSLRTFELSFYDNAPKKKSSIVFQEQNKDFNNSSSSSSRSSSSGTSSRQKHGDRGNNQSVKIDRSIRCHECEG
FGHYQVEYATYLKRKKKGFAITLSDEETSSDSDCENFGRVLISSIIEIAEENVSDMSEKSSSLLHYNLKKLCLCGKRIRKFLNN