; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g33150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g33150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:24749534..24753186
RNA-Seq ExpressionMoc05g33150
SyntenyMoc05g33150
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8691424.1 Callose synthase 12 [Hibiscus syriacus]5.8e-1534.44Show/hide
Query:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAST------------
        G M AL++ YEKPS+++KV+ + R FN+ M EG  V  H+NE+  +  +L S++I F +EV+ + LLSSLP +W    T V  +S ++            
Subjt:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAST------------

Query:  -------SGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV
                 GE +  SAL T SRG+   + S +G+ +   +KS +  +D  C+ C KK H K    +L++  GA  S  V
Subjt:  -------SGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV

KAE8714488.1 hypothetical protein F3Y22_tig00110195pilonHSYRG00090 [Hibiscus syriacus]5.8e-1534.44Show/hide
Query:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAS-------------
        G M AL++ YEKPS+++KV+ + R FN+ M EG  V  H+NE+  +  +L S++I F +EV+ + LLSSLP +W    T V  +S +             
Subjt:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAS-------------

Query:  ------TSGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV
                 GE +  SAL T SRG+   + S +G+ +   +KS +  +D  C+ C KK H K    +L++  GA  S  V
Subjt:  ------TSGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV

KAE8728571.1 hypothetical protein F3Y22_tig00004205pilonHSYRG00041 [Hibiscus syriacus]1.3e-1434.44Show/hide
Query:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAST------------
        G M AL++ YEKPS+++KV+ + R FN+ M EG  V  H+NE+  +  +L S++I F +EV+ + LLSSLP +W    T V  +S ++            
Subjt:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAST------------

Query:  -------SGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV
                 GE +  SAL T SRG+   + S +G+ +    KS + ++D  C+ C KK H K    +L++  GA  S  V
Subjt:  -------SGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV

KAE8735860.1 Cytochrome P450 90B1 [Hibiscus syriacus]3.4e-1534.44Show/hide
Query:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAST------------
        G M AL++ YEKPS+++KV+ + R FN+ M EG  V  H+NE+  +  +L S++I F +EV+ + LLSSLP +W    T V  +S ++            
Subjt:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAST------------

Query:  -------SGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV
                 GE +  SAL T SRG+   + S +G+ +   +KS +  +D  C+ C KK H K    +L++  GA  S  V
Subjt:  -------SGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]1.3e-1426.58Show/hide
Query:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAS-------------
        G MK L++ YEKPS+N+KV+ + + F++ MEEG  V +H+NE   ++N+L S++I F +EV+ + LL+SLP +WE M+  V  +  +             
Subjt:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAS-------------

Query:  ------TSGGENNLESALVTHSRGKG---KISYKGKQQYGSQKSGSSSR-DVKCFYCHKKRHIKSVATSLRRGASSSGIVVNRYVLERGRC-------SG
                 GE +  SA    +RG+    +   +G+ +  + K  S SR  V+C+ C K  H K+V T+ +   S     ++ +VL+ G         + 
Subjt:  ------TSGGENNLESALVTHSRGKG---KISYKGKQQYGSQKSGSSSR-DVKCFYCHKKRHIKSVATSLRRGASSSGIVVNRYVLERGRC-------SG

Query:  IADHDLATYKGRYTDRGVLL-----------------------------------------TNCETEFAEGSWKLLRGSEVVAVGNKEALVRRSASRKKT
        + ++ +  Y   Y   GV L                                         T     F +G+WK+ +GS VVA G+K   +  + S ++T
Subjt:  IADHDLATYKGRYTDRGVLL-----------------------------------------TNCETEFAEGSWKLLRGSEVVAVGNKEALVRRSASRKKT

Query:  I
        I
Subjt:  I

TrEMBL top hitse value%identityAlignment
A0A2N9EGP1 Uncharacterized protein1.1e-1629.96Show/hide
Query:  MKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSA----------------
        M AL++ YEKPS+N+KV+ + + FN+ M EGT V  H+NE   + N+L S++I F +EV+ + +L+SLP NWE M+ + V NSA                
Subjt:  MKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSA----------------

Query:  -----------STSGGENNLESALVTHSRGKGKISYKGKQQYGSQKSGS-SSRDVKCFYCHKKRHIK-SVATSLRRGASSSGIVVNRYVLERGRCSG---
                   STSG   NLE    T  R +   SY+G+ +   ++S S S R + C+ C K  HIK +     ++G +    V+   +++         
Subjt:  -----------STSGGENNLESALVTHSRGKGKISYKGKQQYGSQKSGS-SSRDVKCFYCHKKRHIK-SVATSLRRGASSSGIVVNRYVLERGRCSG---

Query:  --IADHDLATYKGRYTDRGVLLTNCETEFAEGSWKLLRGSEVVAVGNKEALVRRSAS
          + D +     G+  +    +      F  G+WK+ +G  VVA G K + +  + S
Subjt:  --IADHDLATYKGRYTDRGVLLTNCETEFAEGSWKLLRGSEVVAVGNKEALVRRSAS

A0A2N9FNY4 Uncharacterized protein2.8e-1529.14Show/hide
Query:  MKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAS---------------
        M AL   YEKPS+N+KV+ + + FN+ M EGT V  H+NE   + N+L S++I F +E++ + +L+SLP +WE M+ + V NSA                
Subjt:  MKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAS---------------

Query:  ------TSGGENNLESALVTHSRGKGKISYKGKQQYGSQKSGSSS---RDVKCFYCHKKRHIKSVATSLRR--GASSSGIV---------------VNRY
               +G  ++  SAL   +RG+GK     + +  S+K  S S   R ++C+ C K  HI+     L++     S+ +V               +  +
Subjt:  ------TSGGENNLESALVTHSRGKGKISYKGKQQYGSQKSGSSS---RDVKCFYCHKKRHIKSVATSLRR--GASSSGIV---------------VNRY

Query:  VLERGRCSGIADHDLATYKGRYTDRG-VLLTNCET--------EFAEGSWKLLRGSEVVAVGNKEALVRRSASRKKTI
        VL+ G       H          D G V L + E          F  G+WK+ +G+ VVA G K   +  + S + TI
Subjt:  VLERGRCSGIADHDLATYKGRYTDRG-VLLTNCET--------EFAEGSWKLLRGSEVVAVGNKEALVRRSASRKKTI

A0A6A2ZHE4 Callose synthase 122.8e-1534.44Show/hide
Query:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAST------------
        G M AL++ YEKPS+++KV+ + R FN+ M EG  V  H+NE+  +  +L S++I F +EV+ + LLSSLP +W    T V  +S ++            
Subjt:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAST------------

Query:  -------SGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV
                 GE +  SAL T SRG+   + S +G+ +   +KS +  +D  C+ C KK H K    +L++  GA  S  V
Subjt:  -------SGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV

A0A6A3BGE7 Uncharacterized protein2.8e-1534.44Show/hide
Query:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAS-------------
        G M AL++ YEKPS+++KV+ + R FN+ M EG  V  H+NE+  +  +L S++I F +EV+ + LLSSLP +W    T V  +S +             
Subjt:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAS-------------

Query:  ------TSGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV
                 GE +  SAL T SRG+   + S +G+ +   +KS +  +D  C+ C KK H K    +L++  GA  S  V
Subjt:  ------TSGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV

A0A6A3D2M2 Cytochrome P450 90B11.7e-1534.44Show/hide
Query:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAST------------
        G M AL++ YEKPS+++KV+ + R FN+ M EG  V  H+NE+  +  +L S++I F +EV+ + LLSSLP +W    T V  +S ++            
Subjt:  GPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAST------------

Query:  -------SGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV
                 GE +  SAL T SRG+   + S +G+ +   +KS +  +D  C+ C KK H K    +L++  GA  S  V
Subjt:  -------SGGENNLESALVTHSRGK--GKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRR--GASSSGIV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-0725.24Show/hide
Query:  LTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAS------TSG---------
        L + Y   +  +K+Y   + + +HM EGT   SH+N    ++ +L ++ +   EE K I LL+SLP +++ + T ++    +      TS          
Subjt:  LTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSAS------TSG---------

Query:  GENNLESALVTHSRG----KGKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRRGASSSGIVVNRYVLERGRCSGIADHDLATYKGRYTDR
           N   AL+T  RG    +   +Y      G  K+ S SR   C+ C++  H K    + R+G              +G  SG  + D      +  D 
Subjt:  GENNLESALVTHSRG----KGKISYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRRGASSSGIVVNRYVLERGRCSGIADHDLATYKGRYTDR

Query:  GVLLTNCETE
         VL  N E E
Subjt:  GVLLTNCETE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACCGATGAAGGCTTTGACTAACAGGTATGAAAAACCTTCATCCAACAGCAAGGTGTATTTCATCACGAGATACTTCAACATCCACATGGAGGAGGGTACCTTAGT
GAACTCCCACATAAATGAGGTTACTGATATGTTGAACAAGTTGGAATCGATGAAGATTACTTTTTCAGAGGAGGTGAAGACAATAAAACTACTGTCCTCTTTGCCATACA
ATTGGGAGACGATGAAGACAGTAGTTGTGAAGAATTCCGCATCCACTTCTGGAGGGGAGAATAATTTAGAATCAGCATTGGTAACTCACAGCAGAGGTAAAGGGAAGATT
AGCTATAAAGGAAAGCAGCAGTATGGTAGCCAGAAGAGTGGGAGTAGCAGTAGAGATGTGAAGTGTTTTTACTGCCACAAGAAGAGACACATTAAGAGTGTTGCTACAAG
TTTGCGACGTGGAGCGTCAAGTAGTGGTATTGTCGTGAATCGGTATGTGCTAGAACGGGGTCGCTGTTCGGGAATTGCCGATCATGATTTAGCAACTTATAAAGGCAGAT
ATACTGATAGAGGGGTGCTACTCACCAATTGTGAGACTGAGTTCGCAGAGGGTAGCTGGAAACTTTTGAGGGGATCCGAGGTAGTGGCAGTGGGCAACAAAGAAGCTTTA
GTGCGAAGGAGTGCATCGAGGAAGAAGACTATAGTGGGTTCTAAAGTCAAGGATAATGTCTTTAGAGTGGAAATTGACTTGAATGGGAGTGTCAAATCATCAGAGGGGAG
ATCTACCTTCAAAGTCTTGGAGCAATCATGGACAGAGCTTGGTTTGGGCCGAGCAGTTGTTGATCAACTACTATTGGTGCAGTCAAAACTGAGCCTTGGTGGGGTTGCCA
AGATGATTCGACATTGTTGGTGTAGTGGGAGGCAAGATCGATGTTTTGTCTCAAAGTGGGAGATTCTTGGGATAATGGTGCCAAAACCGGAAACTCACGTCCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGACCGATGAAGGCTTTGACTAACAGGTATGAAAAACCTTCATCCAACAGCAAGGTGTATTTCATCACGAGATACTTCAACATCCACATGGAGGAGGGTACCTTAGT
GAACTCCCACATAAATGAGGTTACTGATATGTTGAACAAGTTGGAATCGATGAAGATTACTTTTTCAGAGGAGGTGAAGACAATAAAACTACTGTCCTCTTTGCCATACA
ATTGGGAGACGATGAAGACAGTAGTTGTGAAGAATTCCGCATCCACTTCTGGAGGGGAGAATAATTTAGAATCAGCATTGGTAACTCACAGCAGAGGTAAAGGGAAGATT
AGCTATAAAGGAAAGCAGCAGTATGGTAGCCAGAAGAGTGGGAGTAGCAGTAGAGATGTGAAGTGTTTTTACTGCCACAAGAAGAGACACATTAAGAGTGTTGCTACAAG
TTTGCGACGTGGAGCGTCAAGTAGTGGTATTGTCGTGAATCGGTATGTGCTAGAACGGGGTCGCTGTTCGGGAATTGCCGATCATGATTTAGCAACTTATAAAGGCAGAT
ATACTGATAGAGGGGTGCTACTCACCAATTGTGAGACTGAGTTCGCAGAGGGTAGCTGGAAACTTTTGAGGGGATCCGAGGTAGTGGCAGTGGGCAACAAAGAAGCTTTA
GTGCGAAGGAGTGCATCGAGGAAGAAGACTATAGTGGGTTCTAAAGTCAAGGATAATGTCTTTAGAGTGGAAATTGACTTGAATGGGAGTGTCAAATCATCAGAGGGGAG
ATCTACCTTCAAAGTCTTGGAGCAATCATGGACAGAGCTTGGTTTGGGCCGAGCAGTTGTTGATCAACTACTATTGGTGCAGTCAAAACTGAGCCTTGGTGGGGTTGCCA
AGATGATTCGACATTGTTGGTGTAGTGGGAGGCAAGATCGATGTTTTGTCTCAAAGTGGGAGATTCTTGGGATAATGGTGCCAAAACCGGAAACTCACGTCCTATGA
Protein sequenceShow/hide protein sequence
MGPMKALTNRYEKPSSNSKVYFITRYFNIHMEEGTLVNSHINEVTDMLNKLESMKITFSEEVKTIKLLSSLPYNWETMKTVVVKNSASTSGGENNLESALVTHSRGKGKI
SYKGKQQYGSQKSGSSSRDVKCFYCHKKRHIKSVATSLRRGASSSGIVVNRYVLERGRCSGIADHDLATYKGRYTDRGVLLTNCETEFAEGSWKLLRGSEVVAVGNKEAL
VRRSASRKKTIVGSKVKDNVFRVEIDLNGSVKSSEGRSTFKVLEQSWTELGLGRAVVDQLLLVQSKLSLGGVAKMIRHCWCSGRQDRCFVSKWEILGIMVPKPETHVL