; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g21870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g21870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag-Pol
Genome locationchr3:15104962..15106216
RNA-Seq ExpressionMoc03g21870
SyntenyMoc03g21870
Gene Ontology termsGO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEW78970.1 Gag-Pol polyprotein [Tanacetum cinerariifolium]2.1e-5948.88Show/hide
Query:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFFPDSYDQIVINLT
        MA K+EIEK NG NFSLWK+K+  ILR D CL+ +SER AE+ D+ KW+EM+GNAIANLHLALAD VLSSIEEKK AK+IWDHL +  PDSYD +VINLT
Subjt:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFFPDSYDQIVINLT

Query:  NYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDLSVTRGKTMK---------------------------------------------------
        N VL D L F+D+ A+I+E+ENR  N+ D+  SS+Q E L VT+G++M+                                                   
Subjt:  NYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDLSVTRGKTMK---------------------------------------------------

Query:  --GDILCCEAATTVEGRKSLADMCYDDYALKIFGI-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSVGE
          G+ LCCEA    EGRK  AD+C +D+ LKI GI ++ +K HD  V  I+ V+HVEGL KNLLS+G+
Subjt:  --GDILCCEAATTVEGRKSLADMCYDDYALKIFGI-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSVGE

GEW95159.1 Gag-Pol polyprotein [Tanacetum cinerariifolium]2.1e-5948.88Show/hide
Query:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFFPDSYDQIVINLT
        MA K+EIEK NG NFSLWK+K+  ILR D CL+ +SER AE+ D+ KW+EM+GNAIANLHLALAD VLSSIEEKK AK+IWDHL +  PDSYD +VINLT
Subjt:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFFPDSYDQIVINLT

Query:  NYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDLSVTRGKTMK---------------------------------------------------
        N VL D L F+D+ A+I+E+ENR  N+ D+  SS+Q E L VT+G++M+                                                   
Subjt:  NYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDLSVTRGKTMK---------------------------------------------------

Query:  --GDILCCEAATTVEGRKSLADMCYDDYALKIFGI-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSVGE
          G+ LCCEA    EGRK  AD+C +D+ LKI GI ++ +K HD  V  I+ V+HVEGL KNLLS+G+
Subjt:  --GDILCCEAATTVEGRKSLADMCYDDYALKIFGI-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSVGE

KAA0026163.1 Gag-Pol [Cucumis melo var. makuwa]1.4e-5540.05Show/hide
Query:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFFP-----------
        MAAKFEIEKFNGTNFSLW +K+ V+LR DNCL  + +  AEI D+ KWNEM+GNA+ N+HLALAD VLSSI+EKKIAKEIWDHL K +            
Subjt:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFFP-----------

Query:  -------------------------------------------------DSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL
                                                         DSYDQ+VINL N +LIDYL+F+D+ +A++E+ENR KNK DKL + QQAE L
Subjt:  -------------------------------------------------DSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL

Query:  SVTRGKTM---------------------------KGDILCCEAATTVEGRKSLADMCY------------------------------DDYALKIFGI-
        +VTRG+ +                           +G  L CEA TT EG+K +AD  +                              +D+ALKI  I 
Subjt:  SVTRGKTM---------------------------KGDILCCEAATTVEGRKSLADMCY------------------------------DDYALKIFGI-

Query:  NIKLKFHDNKVGTIQQVQHVEGLTKNLLSV----------------------------------------GETLQEGEASVASRSPSEKLLM
         IKLK HDN V TIQQV+HVE L KNLL +                                        GETLQEGEASVAS S  E LLM
Subjt:  NIKLKFHDNKVGTIQQVQHVEGLTKNLLSV----------------------------------------GETLQEGEASVASRSPSEKLLM

KAA0061179.1 Gag-Pol [Cucumis melo var. makuwa]3.1e-5042.95Show/hide
Query:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAE-IPDEKWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFF------------
        MA  FEIEKFN TNFSLWK+K+ V+LRKDNCL  + +R AE I D KWNEM+GNA AN+HLALAD VLSSIEEKK AKEIWDHL K +            
Subjt:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAE-IPDEKWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFF------------

Query:  ------------------------------------------------PDSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL
                                                        PDSYDQ+VINLTN +L DYL+F+D+ +A++E+ENR KNK DKL SSQQAE L
Subjt:  ------------------------------------------------PDSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL

Query:  SVTRGKTM------------------------KGDILCCEAATTVEGRKSLADMCY------------------------------DDYALKIFGI-NIK
         VTRG++                         +G  L CEA TT+EG+K +AD  +                              +D+ALKI GI  IK
Subjt:  SVTRGKTM------------------------KGDILCCEAATTVEGRKSLADMCY------------------------------DDYALKIFGI-NIK

Query:  LKFHDNKVGTIQ
        LK HDN V TIQ
Subjt:  LKFHDNKVGTIQ

KAE8677740.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Hibiscus syriacus]7.5e-4951.07Show/hide
Query:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKF-----------FP
        MA KF+IEKFNG NFSLWK+K+  ILRKD  L+ +SER  +  D+ KW EM+ NA+AN HLALAD+VLSSIEEKK AKEIWDHL K             P
Subjt:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKF-----------FP

Query:  DSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQ----AEDLSVTRGKTM----KGDILCCEAATTVEGRKSLADMCYDDYALKIFG
        DSYDQ++INLTN   +  L F+D+ AA++++ENR KNK D+    ++        S  +G T+     GD LCCEA+TTVEG    +    +D+AL+I G
Subjt:  DSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQ----AEDLSVTRGKTM----KGDILCCEAATTVEGRKSLADMCYDDYALKIFG

Query:  I-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSVG
        I  IKLK +D  +  ++ VQHV+G+ KNLLS G
Subjt:  I-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSVG

TrEMBL top hitse value%identityAlignment
A0A5A7SNG9 Gag-Pol6.9e-5640.05Show/hide
Query:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFFP-----------
        MAAKFEIEKFNGTNFSLW +K+ V+LR DNCL  + +  AEI D+ KWNEM+GNA+ N+HLALAD VLSSI+EKKIAKEIWDHL K +            
Subjt:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFFP-----------

Query:  -------------------------------------------------DSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL
                                                         DSYDQ+VINL N +LIDYL+F+D+ +A++E+ENR KNK DKL + QQAE L
Subjt:  -------------------------------------------------DSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL

Query:  SVTRGKTM---------------------------KGDILCCEAATTVEGRKSLADMCY------------------------------DDYALKIFGI-
        +VTRG+ +                           +G  L CEA TT EG+K +AD  +                              +D+ALKI  I 
Subjt:  SVTRGKTM---------------------------KGDILCCEAATTVEGRKSLADMCY------------------------------DDYALKIFGI-

Query:  NIKLKFHDNKVGTIQQVQHVEGLTKNLLSV----------------------------------------GETLQEGEASVASRSPSEKLLM
         IKLK HDN V TIQQV+HVE L KNLL +                                        GETLQEGEASVAS S  E LLM
Subjt:  NIKLKFHDNKVGTIQQVQHVEGLTKNLLSV----------------------------------------GETLQEGEASVASRSPSEKLLM

A0A5A7V644 Gag-Pol1.5e-5042.95Show/hide
Query:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAE-IPDEKWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFF------------
        MA  FEIEKFN TNFSLWK+K+ V+LRKDNCL  + +R AE I D KWNEM+GNA AN+HLALAD VLSSIEEKK AKEIWDHL K +            
Subjt:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAE-IPDEKWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFF------------

Query:  ------------------------------------------------PDSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL
                                                        PDSYDQ+VINLTN +L DYL+F+D+ +A++E+ENR KNK DKL SSQQAE L
Subjt:  ------------------------------------------------PDSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL

Query:  SVTRGKTM------------------------KGDILCCEAATTVEGRKSLADMCY------------------------------DDYALKIFGI-NIK
         VTRG++                         +G  L CEA TT+EG+K +AD  +                              +D+ALKI GI  IK
Subjt:  SVTRGKTM------------------------KGDILCCEAATTVEGRKSLADMCY------------------------------DDYALKIFGI-NIK

Query:  LKFHDNKVGTIQ
        LK HDN V TIQ
Subjt:  LKFHDNKVGTIQ

A0A6A2Y2I6 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-4951.07Show/hide
Query:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKF-----------FP
        MA KF+IEKFNG NFSLWK+K+  ILRKD  L+ +SER  +  D+ KW EM+ NA+AN HLALAD+VLSSIEEKK AKEIWDHL K             P
Subjt:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKF-----------FP

Query:  DSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQ----AEDLSVTRGKTM----KGDILCCEAATTVEGRKSLADMCYDDYALKIFG
        DSYDQ++INLTN   +  L F+D+ AA++++ENR KNK D+    ++        S  +G T+     GD LCCEA+TTVEG    +    +D+AL+I G
Subjt:  DSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQ----AEDLSVTRGKTM----KGDILCCEAATTVEGRKSLADMCYDDYALKIFG

Query:  I-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSVG
        I  IKLK +D  +  ++ VQHV+G+ KNLLS G
Subjt:  I-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSVG

A0A6A3AGK4 Integrase catalytic domain-containing protein9.0e-4841.78Show/hide
Query:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAE-IPDEKWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFF------------
        MA KF+IEKFN  NFSLWK+K+  ILRKD CL+ +SER  + I D KWNEM+GNA++N HLALAD+VLSSIEEKK AKEIWDHL K +            
Subjt:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAE-IPDEKWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFF------------

Query:  ------------------------------------------------PDSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL
                                                        PDSYDQ++INLTN  +   + F+D+ AA++++ENR KNK D+  + QQAE L
Subjt:  ------------------------------------------------PDSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL

Query:  SVTRGKTMK------------------GDILCCEAATTVEGRKSLADMCYDDYALKIFGI-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSVG
        +  RG++ +                  GD L CEA+TTVEG    +    +D+AL+I G+  IKLK +D  +  ++ V+HV+GL KNLLS G
Subjt:  SVTRGKTMK------------------GDILCCEAATTVEGRKSLADMCYDDYALKIFGI-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSVG

Q6BCY1 Gag-Pol2.2e-4634.46Show/hide
Query:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFF------------
        MAAKFEIEKFNG NFSLWK+K+  ILRKDNCL+ +SER  +  D+ KW+EM  +A+A+L+L++AD VLSSIEEKK A EIWDHL + +            
Subjt:  MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDE-KWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFF------------

Query:  ------------------------------------------------PDSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL
                                                        PDSYDQ++INLTN +L DYL F+D+ AA++E+E+R KNK D+  + QQAE L
Subjt:  ------------------------------------------------PDSYDQIVINLTNYVLIDYLNFEDIGAAIVEKENRGKNKVDKLASSQQAEDL

Query:  SVTRGKTMK---------------------------------------------------GDILCCEAATTVEGRKSLADM-------------------
        +V RG++ +                                                   G  LCCEA+   EGRK  AD+                   
Subjt:  SVTRGKTMK---------------------------------------------------GDILCCEAATTVEGRKSLADM-------------------

Query:  -----------CYDDYALKIFGI-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSV----------------------------------------GETLQE
                     DD+AL+I GI  IKLK +D  V T+Q V+HV+GL KNLLS                                         GETLQE
Subjt:  -----------CYDDYALKIFGI-NIKLKFHDNKVGTIQQVQHVEGLTKNLLSV----------------------------------------GETLQE

Query:  GEASVASRSPSEKLL
         EASVA+ SP   LL
Subjt:  GEASVASRSPSEKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCAAAGTTCGAGATTGAGAAGTTCAACGGGACTAATTTCTCGTTGTGGAAGATGAAGATCAATGTTATCTTGAGAAAAGATAATTGCCTTTCAACCATGAGTGA
GAGGTCGGCTGAAATCCCAGATGAGAAGTGGAACGAGATGGAGGGGAATGCTATTGCAAATCTTCATCTGGCACTAGCAGATAAAGTGTTATCAAGCATAGAAGAGAAGA
AAATTGCAAAGGAAATTTGGGATCATCTCATAAAGTTTTTTCCTGATTCGTATGATCAAATTGTCATCAACCTGACAAATTATGTTCTCATCGACTATCTGAACTTTGAG
GATATTGGAGCTGCTATCGTTGAAAAGGAAAACCGTGGCAAGAACAAAGTAGATAAGTTGGCGAGTTCACAACAAGCAGAGGATCTATCAGTGACAAGAGGCAAAACAAT
GAAAGGTGATATTTTATGTTGTGAAGCAGCAACAACTGTTGAAGGCAGAAAGAGTTTAGCTGACATGTGCTATGATGATTATGCCTTGAAGATTTTCGGTATTAATATCA
AGTTGAAGTTCCATGACAATAAGGTTGGCACAATTCAACAAGTACAACATGTAGAAGGCCTGACAAAGAACTTGCTCTCAGTAGGGGAGACTTTGCAAGAAGGAGAAGCA
TCAGTTGCCTCAAGAAGTCCAAGTGAAAAGCTCTTGATGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCAAAGTTCGAGATTGAGAAGTTCAACGGGACTAATTTCTCGTTGTGGAAGATGAAGATCAATGTTATCTTGAGAAAAGATAATTGCCTTTCAACCATGAGTGA
GAGGTCGGCTGAAATCCCAGATGAGAAGTGGAACGAGATGGAGGGGAATGCTATTGCAAATCTTCATCTGGCACTAGCAGATAAAGTGTTATCAAGCATAGAAGAGAAGA
AAATTGCAAAGGAAATTTGGGATCATCTCATAAAGTTTTTTCCTGATTCGTATGATCAAATTGTCATCAACCTGACAAATTATGTTCTCATCGACTATCTGAACTTTGAG
GATATTGGAGCTGCTATCGTTGAAAAGGAAAACCGTGGCAAGAACAAAGTAGATAAGTTGGCGAGTTCACAACAAGCAGAGGATCTATCAGTGACAAGAGGCAAAACAAT
GAAAGGTGATATTTTATGTTGTGAAGCAGCAACAACTGTTGAAGGCAGAAAGAGTTTAGCTGACATGTGCTATGATGATTATGCCTTGAAGATTTTCGGTATTAATATCA
AGTTGAAGTTCCATGACAATAAGGTTGGCACAATTCAACAAGTACAACATGTAGAAGGCCTGACAAAGAACTTGCTCTCAGTAGGGGAGACTTTGCAAGAAGGAGAAGCA
TCAGTTGCCTCAAGAAGTCCAAGTGAAAAGCTCTTGATGACCTGA
Protein sequenceShow/hide protein sequence
MAAKFEIEKFNGTNFSLWKMKINVILRKDNCLSTMSERSAEIPDEKWNEMEGNAIANLHLALADKVLSSIEEKKIAKEIWDHLIKFFPDSYDQIVINLTNYVLIDYLNFE
DIGAAIVEKENRGKNKVDKLASSQQAEDLSVTRGKTMKGDILCCEAATTVEGRKSLADMCYDDYALKIFGINIKLKFHDNKVGTIQQVQHVEGLTKNLLSVGETLQEGEA
SVASRSPSEKLLMT