; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G007050 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G007050
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionEnzymatic polyprotein
Genome locationCmo_Chr12:5361976..5366460
RNA-Seq ExpressionCmoCh12G007050
SyntenyCmoCh12G007050
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056776.1 Enzymatic polyprotein [Cucumis melo var. makuwa]4.3e-4844.98Show/hide
Query:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG
        + +I+  Q QKW + + FK+ DF++   ALID+GADQN IQ           T + L GAN N L I +KLSKVH+C   +C VN+F+LVK+L + +ILG
Subjt:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG

Query:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI
        TPF+TQLYPF +T+KG+ SK   K+I F F  P+    I+N++            K   I+ + ++I   ++  +IQ  +VQSKI+  Q Q+E E+CST+
Subjt:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI

Query:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA
        PNAFW+RK+H+V LPY   F+E  IPTKA
Subjt:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA

KAA0059217.1 Enzymatic polyprotein [Cucumis melo var. makuwa]4.3e-4844.98Show/hide
Query:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG
        + +I+  Q QKW + + FK+ DF++   ALID+GADQN IQ           T + L GAN N L I +KLSKVH+C   +C VN+F+LVK+L + +ILG
Subjt:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG

Query:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI
        TPF+TQLYPF +T+KG+ SK   K+I F F  P+    I+N++            K   I+ + ++I   ++  +IQ  +VQSKI+  Q Q+E E+CST+
Subjt:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI

Query:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA
        PNAFW+RK+H+V LPY   F+E  IPTKA
Subjt:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA

KAG8647418.1 hypothetical protein MANES_09G074619v8 [Manihot esculenta]2.9e-4946.9Show/hide
Query:  EEDFQNSFVGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQV----------TYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKD
        EE+ Q  F G IT  ++QKWY  +   I DFK+   A++D+GAD NCI +          T + L  AN+ K+ I YK+ K H+CN+ ICF  SF+L+K+
Subjt:  EEDFQNSFVGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQV----------TYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKD

Query:  LGQELILGTPFITQLYPFKITEKGLESKALRKKIKFNFLSP-IRVSEINNLQKSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTIPNA
        L  ++ILG PF+  LYPFK+TE G+ES  L + I F F++P   +S+INNL+K   IQ + +++ L +IEEQ++   VQ +IK +Q + E ++CS +P+A
Subjt:  LGQELILGTPFITQLYPFKITEKGLESKALRKKIKFNFLSP-IRVSEINNLQKSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTIPNA

Query:  FWNRKQHIVDLPYVKNFEEKNIPTKA
        FW+RKQHIV LPY  NF E+NIPTKA
Subjt:  FWNRKQHIVDLPYVKNFEEKNIPTKA

TYJ97599.1 Enzymatic polyprotein [Cucumis melo var. makuwa]4.3e-4844.98Show/hide
Query:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG
        + +I+  Q QKW + + FK+ DF++   ALID+GADQN IQ           T + L GAN N L I +KLSKVH+C   +C VN+F+LVK+L + +ILG
Subjt:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG

Query:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI
        TPF+TQLYPF +T+KG+ SK   K+I F F  P+    I+N++            K   I+ + ++I   ++  +IQ  +VQSKI+  Q Q+E E+CST+
Subjt:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI

Query:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA
        PNAFW+RK+H+V LPY   F+E  IPTKA
Subjt:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA

XP_033139453.1 uncharacterized protein LOC117131412 isoform X1 [Brassica rapa]2.8e-5245.42Show/hide
Query:  TAQPNLDEEDFQ-----NSFVGAITTSQYQKWYALVTFKI-YDFKITLKALIDTGADQNCI----------QVTYKGLRGANNNKLKINYKLSKVHVCND
        T+  NL+ ++ +       ++  I    YQKWY  +T  + +DFKI + AL+DTGAD NCI          + T + L GAN + LK+ YKLS   +CN 
Subjt:  TAQPNLDEEDFQ-----NSFVGAITTSQYQKWYALVTFKI-YDFKITLKALIDTGADQNCI----------QVTYKGLRGANNNKLKINYKLSKVHVCND

Query:  GICFVNSFLLVKDLGQELILGTPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQKSTI-------------IQLINEEISLKRIEEQIQN
        G CF N F+LVK+L QE+ILGTPF TQ+YPFK+TE G+ +K +  K+ F FLSP++  EI +LQ+++I             IQ +  EI+ K+IEEQ++ 
Subjt:  GICFVNSFLLVKDLGQELILGTPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQKSTI-------------IQLINEEISLKRIEEQIQN

Query:  KMVQSKIKTLQTQIETEICSTIPNAFWNRKQHIVDLPYVKNFEEKNIPTKA
          + SKIK ++  I  +ICS +PNAFW RKQH V+LPY+K F E+NIPTKA
Subjt:  KMVQSKIKTLQTQIETEICSTIPNAFWNRKQHIVDLPYVKNFEEKNIPTKA

TrEMBL top hitse value%identityAlignment
A0A5A7UR29 Enzymatic polyprotein2.1e-4844.98Show/hide
Query:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG
        + +I+  Q QKW + + FK+ DF++   ALID+GADQN IQ           T + L GAN N L I +KLSKVH+C   +C VN+F+LVK+L + +ILG
Subjt:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG

Query:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI
        TPF+TQLYPF +T+KG+ SK   K+I F F  P+    I+N++            K   I+ + ++I   ++  +IQ  +VQSKI+  Q Q+E E+CST+
Subjt:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI

Query:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA
        PNAFW+RK+H+V LPY   F+E  IPTKA
Subjt:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA

A0A5A7UX67 Enzymatic polyprotein2.1e-4844.98Show/hide
Query:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG
        + +I+  Q QKW + + FK+ DF++   ALID+GADQN IQ           T + L GAN N L I +KLSKVH+C   +C VN+F+LVK+L + +ILG
Subjt:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG

Query:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI
        TPF+TQLYPF +T+KG+ SK   K+I F F  P+    I+N++            K   I+ + ++I   ++  +IQ  +VQSKI+  Q Q+E E+CST+
Subjt:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI

Query:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA
        PNAFW+RK+H+V LPY   F+E  IPTKA
Subjt:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA

A0A5A7VRE0 Reverse transcriptase1.5e-4644.1Show/hide
Query:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG
        + +I+  Q QKW + + FK+ DF++   ALID+GADQN IQ           T + L GA  N L I +KLSKVH+C   +C VN+F+LVK+L + +ILG
Subjt:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG

Query:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI
        TPF+TQLYPF +T+KG+ SK   K+I F F  P+    I+N++            K   I+ + ++I   ++  +IQ   VQ KI+  Q Q+E E+CST+
Subjt:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI

Query:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA
        PNAFW+RK+H+V LPY   F+E  IPTKA
Subjt:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA

A0A5D3BEY3 Enzymatic polyprotein2.1e-4844.98Show/hide
Query:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG
        + +I+  Q QKW + + FK+ DF++   ALID+GADQN IQ           T + L GAN N L I +KLSKVH+C   +C VN+F+LVK+L + +ILG
Subjt:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG

Query:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI
        TPF+TQLYPF +T+KG+ SK   K+I F F  P+    I+N++            K   I+ + ++I   ++  +IQ  +VQSKI+  Q Q+E E+CST+
Subjt:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI

Query:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA
        PNAFW+RK+H+V LPY   F+E  IPTKA
Subjt:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA

A0A5D3DBS1 Reverse transcriptase1.5e-4644.1Show/hide
Query:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG
        + +I+  Q QKW + + FK+ DF++   ALID+GADQN IQ           T + L GA  N L I +KLSKVH+C   +C VN+F+LVK+L + +ILG
Subjt:  VGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQ----------VTYKGLRGANNNKLKINYKLSKVHVCNDGICFVNSFLLVKDLGQELILG

Query:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI
        TPF+TQLYPF +T+KG+ SK   K+I F F  P+    I+N++            K   I+ + ++I   ++  +IQ   VQ KI+  Q Q+E E+CST+
Subjt:  TPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQ------------KSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIETEICSTI

Query:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA
        PNAFW+RK+H+V LPY   F+E  IPTKA
Subjt:  PNAFWNRKQHIVDLPYVKNFEEKNIPTKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAAGGAGTCTAATATGAATCCAACAACTGATCATGTGCATGTACCCGAAGGACCGATTACAAGAAGCAAGGTTAAGAAGATTCAAGAGGCCTATACATTG
CATCTTCAAAAGCTAGCTAGTGTACCGGTTGAAACAAAGACTTTTGAGCCCAAAAATCTTTATAGCATTAACATATTAAATCAAGAAGATAATGGAGTGGCTGAT
ATTGGAAAGCGTTTATCAGAGTCTTTCCTTAAAGTTGAAGGAGAAAGAATAGATGAGAGACTGAGTGAAAGAGAAGAGCTACTGAATCTATTTACCTTGTTCAAG
AGTGAAAGAAGAACCTTTACAGGAGAGGAAATTAAGAAGACGCTTTACCTCCGAGTTCTGGTTCAGGTTCCGAAGGAATATGTTCTTCTTGCACTGTTGCATGTA
CAATCATTAAAGCTTGACATGATCAATCTTGCTTGTGGAGTGATTCGAATCTCAAACAAGTCTTCTTGCGTTGAGATATTTGATCAACAAGAATCATTCAAGCTA
GACATAATCGATCTTGCTTGTGGAGTGATTCGAATCTCAAACAATATTCTTGCCTTGAGATATTCGATCAACAAGAATTCAAACGAACTGTCTATTCTTATAAGA
ACTTTGCTGAAATACATGTCTTTGTACCACATGAAATCTCCTAAGGAAAGGCTGCCCGAACCAAGGGGAACGGCGCAACCGAACCTCGACGAAGAAGACTTTCAA
AATTCCTTCGTTGGAGCCATCACCACTTCTCAATATCAAAAATGGTACGCTCTCGTTACCTTCAAAATTTATGATTTCAAAATCACACTAAAAGCTCTCATAGAT
ACTGGAGCCGATCAAAATTGCATTCAAGTAACCTACAAAGGCCTTAGAGGTGCAAACAACAACAAACTCAAAATTAATTACAAACTATCAAAAGTTCATGTTTGC
AATGATGGAATTTGCTTCGTAAATTCCTTCCTTTTAGTAAAGGACTTAGGACAAGAGTTAATCCTAGGTACTCCTTTCATTACTCAATTATATCCTTTTAAAATA
ACTGAAAAAGGGTTAGAGTCAAAGGCCTTAAGAAAGAAGATAAAATTTAATTTCCTTTCACCCATAAGGGTGAGTGAAATAAATAACCTTCAAAAGAGTACAATT
ATTCAATTAATAAATGAAGAAATTTCATTAAAAAGAATAGAAGAACAGATTCAAAATAAAATGGTTCAGTCTAAAATCAAAACTTTACAAACTCAAATAGAAACG
GAGATTTGTTCAACTATCCCTAATGCCTTTTGGAATAGGAAGCAACACATAGTTGATCTTCCTTATGTCAAAAACTTCGAGGAGAAAAATATTCCTACGAAGGCC
TGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAAGGAGTCTAATATGAATCCAACAACTGATCATGTGCATGTACCCGAAGGACCGATTACAAGAAGCAAGGTTAAGAAGATTCAAGAGGCCTATACATTG
CATCTTCAAAAGCTAGCTAGTGTACCGGTTGAAACAAAGACTTTTGAGCCCAAAAATCTTTATAGCATTAACATATTAAATCAAGAAGATAATGGAGTGGCTGAT
ATTGGAAAGCGTTTATCAGAGTCTTTCCTTAAAGTTGAAGGAGAAAGAATAGATGAGAGACTGAGTGAAAGAGAAGAGCTACTGAATCTATTTACCTTGTTCAAG
AGTGAAAGAAGAACCTTTACAGGAGAGGAAATTAAGAAGACGCTTTACCTCCGAGTTCTGGTTCAGGTTCCGAAGGAATATGTTCTTCTTGCACTGTTGCATGTA
CAATCATTAAAGCTTGACATGATCAATCTTGCTTGTGGAGTGATTCGAATCTCAAACAAGTCTTCTTGCGTTGAGATATTTGATCAACAAGAATCATTCAAGCTA
GACATAATCGATCTTGCTTGTGGAGTGATTCGAATCTCAAACAATATTCTTGCCTTGAGATATTCGATCAACAAGAATTCAAACGAACTGTCTATTCTTATAAGA
ACTTTGCTGAAATACATGTCTTTGTACCACATGAAATCTCCTAAGGAAAGGCTGCCCGAACCAAGGGGAACGGCGCAACCGAACCTCGACGAAGAAGACTTTCAA
AATTCCTTCGTTGGAGCCATCACCACTTCTCAATATCAAAAATGGTACGCTCTCGTTACCTTCAAAATTTATGATTTCAAAATCACACTAAAAGCTCTCATAGAT
ACTGGAGCCGATCAAAATTGCATTCAAGTAACCTACAAAGGCCTTAGAGGTGCAAACAACAACAAACTCAAAATTAATTACAAACTATCAAAAGTTCATGTTTGC
AATGATGGAATTTGCTTCGTAAATTCCTTCCTTTTAGTAAAGGACTTAGGACAAGAGTTAATCCTAGGTACTCCTTTCATTACTCAATTATATCCTTTTAAAATA
ACTGAAAAAGGGTTAGAGTCAAAGGCCTTAAGAAAGAAGATAAAATTTAATTTCCTTTCACCCATAAGGGTGAGTGAAATAAATAACCTTCAAAAGAGTACAATT
ATTCAATTAATAAATGAAGAAATTTCATTAAAAAGAATAGAAGAACAGATTCAAAATAAAATGGTTCAGTCTAAAATCAAAACTTTACAAACTCAAATAGAAACG
GAGATTTGTTCAACTATCCCTAATGCCTTTTGGAATAGGAAGCAACACATAGTTGATCTTCCTTATGTCAAAAACTTCGAGGAGAAAAATATTCCTACGAAGGCC
TGA
Protein sequenceShow/hide protein sequence
MNKESNMNPTTDHVHVPEGPITRSKVKKIQEAYTLHLQKLASVPVETKTFEPKNLYSINILNQEDNGVADIGKRLSESFLKVEGERIDERLSEREELLNLFTLFK
SERRTFTGEEIKKTLYLRVLVQVPKEYVLLALLHVQSLKLDMINLACGVIRISNKSSCVEIFDQQESFKLDIIDLACGVIRISNNILALRYSINKNSNELSILIR
TLLKYMSLYHMKSPKERLPEPRGTAQPNLDEEDFQNSFVGAITTSQYQKWYALVTFKIYDFKITLKALIDTGADQNCIQVTYKGLRGANNNKLKINYKLSKVHVC
NDGICFVNSFLLVKDLGQELILGTPFITQLYPFKITEKGLESKALRKKIKFNFLSPIRVSEINNLQKSTIIQLINEEISLKRIEEQIQNKMVQSKIKTLQTQIET
EICSTIPNAFWNRKQHIVDLPYVKNFEEKNIPTKA