; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr6:16098002..16107365
RNA-Seq ExpressionMoc06g20670
SyntenyMoc06g20670
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054278.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-4248.31Show/hide
Query:  VCNVYDKWITANDKANVYILASISIVIAKKHE--DTTYQSLTKNKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTF-KKKNATGKGLRPDPTTTTTK
        V + YD+W  ANDKA +YILA +S +++KKHE   T  Q +   +   G+ ++   ++ +   + R  F  SSSGSK   K+K   GKG    PT     
Subjt:  VCNVYDKWITANDKANVYILASISIVIAKKHE--DTTYQSLTKNKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTF-KKKNATGKGLRPDPTTTTTK

Query:  KDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKR------RKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYK
        K KAKVA KGKCFHCN++ HWKRN P   +K+      R   +LE  EM LKVG G+VIS  A                      VVIPDDGVEDPL+YK
Subjt:  KDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKR------RKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYK

Query:  KAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG
        +A+ D DKD+WVKAM+LEMESMYFNSVWELVD PDG
Subjt:  KAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG

KAA0065357.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-3245.96Show/hide
Query:  QSLTKNKGHEGEANVATSK-KFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKA-KVAEKGKCFHCNIDGHWKRNYPNTWLKR-----
        ++L K KG E EANVAT+K KF RGS+SR K    +  SK  +K    GKG        T+K+ K  K  EKGKC+HC  +GHW RN P    ++     
Subjt:  QSLTKNKGHEGEANVATSK-KFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKA-KVAEKGKCFHCNIDGHWKRNYPNTWLKR-----

Query:  ---RKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQ
           R S+  +  + T  +G       +     +GR+V QP RY+GL + Q++IPDDG+EDPLTYK+AM D D D+W+KAM+L+MESMY N VW LVDQ
Subjt:  ---RKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQ

KAA0066192.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-3539.86Show/hide
Query:  NKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKS---------
        N   EGEAN A S+          +F  SS GSK  K +N  G G R  PT     K KAKVA KGKCFHCN+DGHWKRN P    K+++          
Subjt:  NKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKS---------

Query:  --------TKLEAGEMTLKVGTGEVISVVAVGEL------------------------------------------------------------------
                 +LE GEMTLKVGTG+VI+  A  E                                                                   
Subjt:  --------TKLEAGEMTLKVGTGEVISVVAVGEL------------------------------------------------------------------

Query:  ------------NGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG
                    +GRIV QP RY+GLT+TQV IPDDGVEDPL+YK+A  D DKD+W+KAMDLEME +YFNSVWELVD P+G
Subjt:  ------------NGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG

TYJ96859.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-3549.75Show/hide
Query:  TYQSLTKNKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKS--
        T+QS+   K  +GE N+A S++F   +TS  +F+      K  KKK   GKG     T     K KAKVA K KCFH N+D HWKRN P   +K+++   
Subjt:  TYQSLTKNKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKS--

Query:  ----TKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG
             +LE GEMTLKVGTG+V+S  A+        L P + +     Q+VIPDDGV+DPL+ K+ M D DKD+WVKAMDLEMESMYFNSVWELVD P+G
Subjt:  ----TKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG

TYK23781.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Cucumis melo var. makuwa]5.0e-3239.45Show/hide
Query:  FNIEVNGDSIHSSSCKRIVVVCNVYDKWITANDKANVYILASISIVIAKKHEDTTYQSLTKNKGHEGEANVATSK-KFHRGSTSRGKFLSSSSGSKTFKK
        FN+    ++I   + + I ++ ++ + ++    ++NV +      +    +E  T++SL K KG +GEANVATS  KFHRG TS  K + SSSG+K +KK
Subjt:  FNIEVNGDSIHSSSCKRIVVVCNVYDKWITANDKANVYILASISIVIAKKHEDTTYQSLTKNKGHEGEANVATSK-KFHRGSTSRGKFLSSSSGSKTFKK

Query:  KNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVI
        K   G+G + +P    T K KAK A K  CFHCN +GHWK+N P  +L  +K  K       LK+   E          N  +     R++ L + + ++
Subjt:  KNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVI

Query:  --------PDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG
                 ++ + DPLT+KKAM+D DKD+W+KAM+LE+ESMYFNSVW+LVDQPDG
Subjt:  --------PDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG

TrEMBL top hitse value%identityAlignment
A0A5A7UH21 Gag/pol protein2.6e-4248.31Show/hide
Query:  VCNVYDKWITANDKANVYILASISIVIAKKHE--DTTYQSLTKNKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTF-KKKNATGKGLRPDPTTTTTK
        V + YD+W  ANDKA +YILA +S +++KKHE   T  Q +   +   G+ ++   ++ +   + R  F  SSSGSK   K+K   GKG    PT     
Subjt:  VCNVYDKWITANDKANVYILASISIVIAKKHE--DTTYQSLTKNKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTF-KKKNATGKGLRPDPTTTTTK

Query:  KDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKR------RKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYK
        K KAKVA KGKCFHCN++ HWKRN P   +K+      R   +LE  EM LKVG G+VIS  A                      VVIPDDGVEDPL+YK
Subjt:  KDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKR------RKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYK

Query:  KAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG
        +A+ D DKD+WVKAM+LEMESMYFNSVWELVD PDG
Subjt:  KAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG

A0A5A7VE43 Gag/pol protein6.3e-3345.96Show/hide
Query:  QSLTKNKGHEGEANVATSK-KFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKA-KVAEKGKCFHCNIDGHWKRNYPNTWLKR-----
        ++L K KG E EANVAT+K KF RGS+SR K    +  SK  +K    GKG        T+K+ K  K  EKGKC+HC  +GHW RN P    ++     
Subjt:  QSLTKNKGHEGEANVATSK-KFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKA-KVAEKGKCFHCNIDGHWKRNYPNTWLKR-----

Query:  ---RKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQ
           R S+  +  + T  +G       +     +GR+V QP RY+GL + Q++IPDDG+EDPLTYK+AM D D D+W+KAM+L+MESMY N VW LVDQ
Subjt:  ---RKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQ

A0A5D3BCS4 Gag/pol protein8.0e-3649.75Show/hide
Query:  TYQSLTKNKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKS--
        T+QS+   K  +GE N+A S++F   +TS  +F+      K  KKK   GKG     T     K KAKVA K KCFH N+D HWKRN P   +K+++   
Subjt:  TYQSLTKNKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKS--

Query:  ----TKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG
             +LE GEMTLKVGTG+V+S  A+        L P + +     Q+VIPDDGV+DPL+ K+ M D DKD+WVKAMDLEMESMYFNSVWELVD P+G
Subjt:  ----TKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG

A0A5D3BR24 Gag/pol protein1.4e-3539.86Show/hide
Query:  NKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKS---------
        N   EGEAN A S+          +F  SS GSK  K +N  G G R  PT     K KAKVA KGKCFHCN+DGHWKRN P    K+++          
Subjt:  NKGHEGEANVATSKKFHRGSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKS---------

Query:  --------TKLEAGEMTLKVGTGEVISVVAVGEL------------------------------------------------------------------
                 +LE GEMTLKVGTG+VI+  A  E                                                                   
Subjt:  --------TKLEAGEMTLKVGTGEVISVVAVGEL------------------------------------------------------------------

Query:  ------------NGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG
                    +GRIV QP RY+GLT+TQV IPDDGVEDPL+YK+A  D DKD+W+KAMDLEME +YFNSVWELVD P+G
Subjt:  ------------NGRIVLQPGRYVGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG

A0A5D3DJD2 Retrovirus-related pol polyprotein from transposon tnt 1-942.4e-3239.45Show/hide
Query:  FNIEVNGDSIHSSSCKRIVVVCNVYDKWITANDKANVYILASISIVIAKKHEDTTYQSLTKNKGHEGEANVATSK-KFHRGSTSRGKFLSSSSGSKTFKK
        FN+    ++I   + + I ++ ++ + ++    ++NV +      +    +E  T++SL K KG +GEANVATS  KFHRG TS  K + SSSG+K +KK
Subjt:  FNIEVNGDSIHSSSCKRIVVVCNVYDKWITANDKANVYILASISIVIAKKHEDTTYQSLTKNKGHEGEANVATSK-KFHRGSTSRGKFLSSSSGSKTFKK

Query:  KNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVI
        K   G+G + +P    T K KAK A K  CFHCN +GHWK+N P  +L  +K  K       LK+   E          N  +     R++ L + + ++
Subjt:  KNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRYVGLTKTQVVI

Query:  --------PDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG
                 ++ + DPLT+KKAM+D DKD+W+KAM+LE+ESMYFNSVW+LVDQPDG
Subjt:  --------PDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55140.1 Ribonuclease III family protein4.7e-0465.52Show/hide
Query:  LGHEQWLSNLLKVQKPRSMFNAASLAYIG
        +G+E+W  +  KV+KPRS+FNAASLAYIG
Subjt:  LGHEQWLSNLLKVQKPRSMFNAASLAYIG

AT1G55140.2 Ribonuclease III family protein4.7e-0465.52Show/hide
Query:  LGHEQWLSNLLKVQKPRSMFNAASLAYIG
        +G+E+W  +  KV+KPRS+FNAASLAYIG
Subjt:  LGHEQWLSNLLKVQKPRSMFNAASLAYIG

AT3G13740.1 Ribonuclease III family protein4.7e-0450Show/hide
Query:  VELESLYLGHEQWLSNLLKVQKPRSMFNAASLAYIG
        V+++  Y+G+E W  +  K++KPRS+FN ASLA+IG
Subjt:  VELESLYLGHEQWLSNLLKVQKPRSMFNAASLAYIG

AT3G13740.2 Ribonuclease III family protein4.7e-0450Show/hide
Query:  VELESLYLGHEQWLSNLLKVQKPRSMFNAASLAYIG
        V+++  Y+G+E W  +  K++KPRS+FN ASLA+IG
Subjt:  VELESLYLGHEQWLSNLLKVQKPRSMFNAASLAYIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTCGTGCCTTAGCGCCGCGGCGCTGCCCTTAGGCACTAAGCCTCGTGTCGCCCTGGGCGTGGCCTCCCTACGGAAGGTGTTTGCATGGTTCAATATCGAGGTGAA
TGGAGATAGTATTCATAGTTCATCTTGCAAGAGGATTGTGGTGGTATGCAACGTCTATGACAAATGGATCACGGCCAATGACAAGGCCAATGTCTACATCTTGGCGAGCA
TATCTATTGTGATTGCTAAGAAGCACGAGGACACGACTTACCAGTCTCTTACGAAAAATAAGGGACATGAAGGGGAGGCAAACGTTGCCACCTCGAAAAAGTTCCACCGA
GGTTCAACCTCTAGAGGCAAGTTTTTGTCATCCTCTTCTGGAAGTAAGACTTTCAAGAAGAAGAATGCCACTGGTAAGGGGCTTAGACCTGACCCTACTACTACCACTAC
CAAGAAAGACAAGGCGAAGGTTGCAGAGAAAGGAAAATGTTTCCACTGCAATATAGACGGGCATTGGAAGCGCAACTACCCAAATACTTGGCTGAAAAGAAGAAAGTCAA
CGAAGCTTGAAGCCGGAGAGATGACTCTCAAGGTCGGAACAGGAGAGGTCATCTCAGTTGTGGCAGTAGGGGAGCTCAATGGGAGGATTGTGTTACAACCTGGCCGCTAC
GTGGGGTTAACAAAAACCCAAGTTGTCATACCTGATGACGGCGTAGAGGATCCATTAACCTATAAAAAGGCAATGGAAGATGCTGACAAGGACAAATGGGTCAAAGCCAT
GGACTTGGAAATGGAGTCTATGTACTTCAATTCCGTTTGGGAACTTGTAGACCAACCTGACGGGCAACGAAAGTCCGCTTCCGCTCCGGGATTTACATCCCTTCAAATAA
ACCAGCTCTATTGTCGTGCGAATCGAACTTTACTGCTTGCGAACGGCGACGATTCGGCGGCGACAACTTCCATCTCTTCGAACGACAACAACGTTCATCAATTCACCGGC
GGTGACCTCCATTGTTCGGAGCGGCTCAAGCAACGGCGAACCCACGAGCAATCTGGAGATTCTTCAAGCAAACTAGTGGAGCTGGAGAGTCTTTATTTGGGACATGAGCA
GTGGTTGTCAAATCTACTTAAGGTGCAGAAACCTCGATCCATGTTCAATGCAGCATCATTAGCATATATTGGTGTCAAGTTTACTCCTGCATGCCTTCAATCAGCTGTGA
GTAATTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGTCGTGCCTTAGCGCCGCGGCGCTGCCCTTAGGCACTAAGCCTCGTGTCGCCCTGGGCGTGGCCTCCCTACGGAAGGTGTTTGCATGGTTCAATATCGAGGTGAA
TGGAGATAGTATTCATAGTTCATCTTGCAAGAGGATTGTGGTGGTATGCAACGTCTATGACAAATGGATCACGGCCAATGACAAGGCCAATGTCTACATCTTGGCGAGCA
TATCTATTGTGATTGCTAAGAAGCACGAGGACACGACTTACCAGTCTCTTACGAAAAATAAGGGACATGAAGGGGAGGCAAACGTTGCCACCTCGAAAAAGTTCCACCGA
GGTTCAACCTCTAGAGGCAAGTTTTTGTCATCCTCTTCTGGAAGTAAGACTTTCAAGAAGAAGAATGCCACTGGTAAGGGGCTTAGACCTGACCCTACTACTACCACTAC
CAAGAAAGACAAGGCGAAGGTTGCAGAGAAAGGAAAATGTTTCCACTGCAATATAGACGGGCATTGGAAGCGCAACTACCCAAATACTTGGCTGAAAAGAAGAAAGTCAA
CGAAGCTTGAAGCCGGAGAGATGACTCTCAAGGTCGGAACAGGAGAGGTCATCTCAGTTGTGGCAGTAGGGGAGCTCAATGGGAGGATTGTGTTACAACCTGGCCGCTAC
GTGGGGTTAACAAAAACCCAAGTTGTCATACCTGATGACGGCGTAGAGGATCCATTAACCTATAAAAAGGCAATGGAAGATGCTGACAAGGACAAATGGGTCAAAGCCAT
GGACTTGGAAATGGAGTCTATGTACTTCAATTCCGTTTGGGAACTTGTAGACCAACCTGACGGGCAACGAAAGTCCGCTTCCGCTCCGGGATTTACATCCCTTCAAATAA
ACCAGCTCTATTGTCGTGCGAATCGAACTTTACTGCTTGCGAACGGCGACGATTCGGCGGCGACAACTTCCATCTCTTCGAACGACAACAACGTTCATCAATTCACCGGC
GGTGACCTCCATTGTTCGGAGCGGCTCAAGCAACGGCGAACCCACGAGCAATCTGGAGATTCTTCAAGCAAACTAGTGGAGCTGGAGAGTCTTTATTTGGGACATGAGCA
GTGGTTGTCAAATCTACTTAAGGTGCAGAAACCTCGATCCATGTTCAATGCAGCATCATTAGCATATATTGGTGTCAAGTTTACTCCTGCATGCCTTCAATCAGCTGTGA
GTAATTCATGA
Protein sequenceShow/hide protein sequence
MLSCLSAAALPLGTKPRVALGVASLRKVFAWFNIEVNGDSIHSSSCKRIVVVCNVYDKWITANDKANVYILASISIVIAKKHEDTTYQSLTKNKGHEGEANVATSKKFHR
GSTSRGKFLSSSSGSKTFKKKNATGKGLRPDPTTTTTKKDKAKVAEKGKCFHCNIDGHWKRNYPNTWLKRRKSTKLEAGEMTLKVGTGEVISVVAVGELNGRIVLQPGRY
VGLTKTQVVIPDDGVEDPLTYKKAMEDADKDKWVKAMDLEMESMYFNSVWELVDQPDGQRKSASAPGFTSLQINQLYCRANRTLLLANGDDSAATTSISSNDNNVHQFTG
GDLHCSERLKQRRTHEQSGDSSSKLVELESLYLGHEQWLSNLLKVQKPRSMFNAASLAYIGVKFTPACLQSAVSNS