; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g15030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g15030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr10:11409626..11411449
RNA-Seq ExpressionMoc10g15030
SyntenyMoc10g15030
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041279.1 gag/pol protein [Cucumis melo var. makuwa]6.1e-4160.12Show/hide
Query:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKLES-APSSSGSKTFKKKKVAGKG
        MFGQ S+Q + EA+K++YN+ M EG SVREHV+N++V+FNVA++NGAV DE++QVS+IL+SLPK FL F SN  MNK+E  APSSSGSK  +K+K  GKG
Subjt:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKLES-APSSSGSKTFKKKKVAGKG

Query:  SKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLV---WELVLGGSLK
          P  T AA  K KA V  +GK + YNVDG WK NCPKY+  KKK  E KYDLLV   W+LVL  SL+
Subjt:  SKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLV---WELVLGGSLK

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-4051.72Show/hide
Query:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------
        MFGQ S Q +H+ALK+IYN+ MNEG+SVREHV+N+MVHFNVAE+NGAVIDE +QVSFILESLP++FL FRSNAVMNK+                      
Subjt:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------

Query:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE
                              +S PSSSG+K +KKKK  G+G+K  +  AAAK  K     +G CF  N +G WK NCPKY+ EKKKA +GKYDLLV E
Subjt:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE

Query:  LVL
          L
Subjt:  LVL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-4051.72Show/hide
Query:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------
        MFGQ S Q +H+ALK+IYN+ MNEG+SVREHV+N+MVHFNVAE+NGAVIDE +QVSFILESLP++FL FRSNAVMNK+                      
Subjt:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------

Query:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE
                              +S PSSSG+K +KKKK  G+G+K  +  AAAK  K     +G CF  N +G WK NCPKY+ EKKKA +GKYDLLV E
Subjt:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE

Query:  LVL
          L
Subjt:  LVL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-4051.72Show/hide
Query:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------
        MFGQ S Q +H+ALK+IYN+ MNEG+SVREHV+N+MVHFNVAE+NGAVIDE +QVSFILESLP++FL FRSNAVMNK+                      
Subjt:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------

Query:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE
                              +S PSSSG+K +KKKK  G+G+K  +  AAAK  K     +G CF  N +G WK NCPKY+ EKKKA +GKYDLLV E
Subjt:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE

Query:  LVL
          L
Subjt:  LVL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-4051.72Show/hide
Query:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------
        MFGQ S Q +H+ALK+IYN+ MNEG+SVREHV+N+MVHFNVAE+NGAVIDE +QVSFILESLP++FL FRSNAVMNK+                      
Subjt:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------

Query:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE
                              +S PSSSG+K +KKKK  G+G+K  +  AAAK  K     +G CF  N +G WK NCPKY+ EKKKA +GKYDLLV E
Subjt:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE

Query:  LVL
          L
Subjt:  LVL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.6e-4151.72Show/hide
Query:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------
        MFGQ S Q +H+ALK+IYN+ MNEG+SVREHV+N+MVHFNVAE+NGAVIDE +QVSFILESLP++FL FRSNAVMNK+                      
Subjt:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------

Query:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE
                              +S PSSSG+K +KKKK  G+G+K  +  AAAK  K     +G CF  N +G WK NCPKY+ EKKKA +GKYDLLV E
Subjt:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE

Query:  LVL
          L
Subjt:  LVL

A0A5A7TIU9 Gag/pol protein3.0e-4160.12Show/hide
Query:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKLES-APSSSGSKTFKKKKVAGKG
        MFGQ S+Q + EA+K++YN+ M EG SVREHV+N++V+FNVA++NGAV DE++QVS+IL+SLPK FL F SN  MNK+E  APSSSGSK  +K+K  GKG
Subjt:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKLES-APSSSGSKTFKKKKVAGKG

Query:  SKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLV---WELVLGGSLK
          P  T AA  K KA V  +GK + YNVDG WK NCPKY+  KKK  E KYDLLV   W+LVL  SL+
Subjt:  SKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLV---WELVLGGSLK

A0A5A7TU93 Gag/pol protein6.6e-4151.72Show/hide
Query:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------
        MFGQ S Q +H+ALK+IYN+ MNEG+SVREHV+N+MVHFNVAE+NGAVIDE +QVSFILESLP++FL FRSNAVMNK+                      
Subjt:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------

Query:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE
                              +S PSSSG+K +KKKK  G+G+K  +  AAAK  K     +G CF  N +G WK NCPKY+ EKKKA +GKYDLLV E
Subjt:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE

Query:  LVL
          L
Subjt:  LVL

A0A5A7V4M1 Gag/pol protein6.6e-4151.72Show/hide
Query:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------
        MFGQ S Q +H+ALK+IYN+ MNEG+SVREHV+N+MVHFNVAE+NGAVIDE +QVSFILESLP++FL FRSNAVMNK+                      
Subjt:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------

Query:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE
                              +S PSSSG+K +KKKK  G+G+K  +  AAAK  K     +G CF  N +G WK NCPKY+ EKKKA +GKYDLLV E
Subjt:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE

Query:  LVL
          L
Subjt:  LVL

A0A5D3CPJ6 Gag/pol protein6.6e-4151.72Show/hide
Query:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------
        MFGQ S Q +H+ALK+IYN+ MNEG+SVREHV+N+MVHFNVAE+NGAVIDE +QVSFILESLP++FL FRSNAVMNK+                      
Subjt:  MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKL----------------------

Query:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE
                              +S PSSSG+K +KKKK  G+G+K  +  AAAK  K     +G CF  N +G WK NCPKY+ EKKKA +GKYDLLV E
Subjt:  ----------------------ESAPSSSGSKTFKKKKVAGKGSKPDSTAAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWE

Query:  LVL
          L
Subjt:  LVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.2e-0732.32Show/hide
Query:  IVRMTPDGRMKVKGTIHEMEVLIPIDCGATHNFISQALIEELRLPITKMTNYDVVVGNGASITGKGKCKRVFLLIQGLTIVEDFLPFDF--ENLNIILG
        ++ +T +  M+  G I + +V++ ID GAT NFI   L   L+LP +      V++G    I   G C  + L +Q + I E+FL  D    ++++ILG
Subjt:  IVRMTPDGRMKVKGTIHEMEVLIPIDCGATHNFISQALIEELRLPITKMTNYDVVVGNGASITGKGKCKRVFLLIQGLTIVEDFLPFDF--ENLNIILG

AT3G30770.1 Eukaryotic aspartyl protease family protein3.3e-0834.62Show/hide
Query:  KSIVRMTPDGRMKVKGTIHEMEVLIPIDCGATHNFISQALIEELRLPITKMTNYDVVVGNGASITGKGKCKRVFLLIQGLTIVEDFLPFDF--ENLNIIL
        +S    T    M+  G I   +V++ ID GAT+NFIS  L   L+LP +      V++G    I   G C  + LL+Q + I E+FL  D    ++++IL
Subjt:  KSIVRMTPDGRMKVKGTIHEMEVLIPIDCGATHNFISQALIEELRLPITKMTNYDVVVGNGASITGKGKCKRVFLLIQGLTIVEDFLPFDF--ENLNIIL

Query:  GLRG
        G  G
Subjt:  GLRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGACAACCGTCCTTACAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCTCATGAATGAGGGTTCCTCAGTGCGAGAACACGTTGTCAACCTA
ATGGTCCACTTCAACGTGGCAGAGTTGAACGGGGCTGTCATAGACGAGCAGAATCAGGTCAGCTTTATTCTAGAATCTCTTCCGAAGACTTTCCTAGCATTCCGT
AGCAATGCAGTTATGAATAAGCTGGAGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTCAAGAAGAAGAAGGTTGCTGGTAAGGGGTCTAAACCTGACTCCACT
GCTGCTGCTGCCAAGAAAGGCAAGGCCAAGGTTGTAGACGAAGGAAAATGTTTCCAATACAACGTGGACGGGAAATGGAAGCTCAATTGTCCAAAATACGTGACC
GAGAAGAAGAAAGCCAATGAAGGTAAATATGATTTACTTGTTTGGGAATTAGTTCTTGGAGGCAGCTTAAAGCCGGGAGATGACTCTCAAGGTCAGAATGGGAGA
AGTCGTCTCAACTGTGGCAATAGGAGAGTTAAGACTCTGTTCCTCTGTCCATATATGCCAACAAATACGGAGGAGGATGGCGTGATTGGTCTTTCATTAAAATCA
ATCGTCAGGATGACACCGGATGGAAGGATGAAGGTCAAGGGCACTATTCATGAAATGGAAGTACTCATACCGATTGATTGTGGGGCTACCCACAATTTTATTTCA
CAGGCACTTATTGAAGAATTGCGGCTGCCAATTACAAAAATGACAAATTATGATGTGGTGGTTGGAAACGGGGCTTCCATCACTGGAAAAGGGAAGTGCAAACGT
GTTTTTCTTCTTATTCAAGGACTTACTATAGTTGAAGATTTTCTTCCATTTGACTTTGAGAATCTTAACATTATTTTGGGATTACGTGGCTACGTAGCTTGGGAG
ATGTACACGTCAATTGGGCAAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGACAACCGTCCTTACAGGCTCGACATGAAGCCCTTAAGTTCATTTACAACTCCCTCATGAATGAGGGTTCCTCAGTGCGAGAACACGTTGTCAACCTA
ATGGTCCACTTCAACGTGGCAGAGTTGAACGGGGCTGTCATAGACGAGCAGAATCAGGTCAGCTTTATTCTAGAATCTCTTCCGAAGACTTTCCTAGCATTCCGT
AGCAATGCAGTTATGAATAAGCTGGAGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTCAAGAAGAAGAAGGTTGCTGGTAAGGGGTCTAAACCTGACTCCACT
GCTGCTGCTGCCAAGAAAGGCAAGGCCAAGGTTGTAGACGAAGGAAAATGTTTCCAATACAACGTGGACGGGAAATGGAAGCTCAATTGTCCAAAATACGTGACC
GAGAAGAAGAAAGCCAATGAAGGTAAATATGATTTACTTGTTTGGGAATTAGTTCTTGGAGGCAGCTTAAAGCCGGGAGATGACTCTCAAGGTCAGAATGGGAGA
AGTCGTCTCAACTGTGGCAATAGGAGAGTTAAGACTCTGTTCCTCTGTCCATATATGCCAACAAATACGGAGGAGGATGGCGTGATTGGTCTTTCATTAAAATCA
ATCGTCAGGATGACACCGGATGGAAGGATGAAGGTCAAGGGCACTATTCATGAAATGGAAGTACTCATACCGATTGATTGTGGGGCTACCCACAATTTTATTTCA
CAGGCACTTATTGAAGAATTGCGGCTGCCAATTACAAAAATGACAAATTATGATGTGGTGGTTGGAAACGGGGCTTCCATCACTGGAAAAGGGAAGTGCAAACGT
GTTTTTCTTCTTATTCAAGGACTTACTATAGTTGAAGATTTTCTTCCATTTGACTTTGAGAATCTTAACATTATTTTGGGATTACGTGGCTACGTAGCTTGGGAG
ATGTACACGTCAATTGGGCAAAATTAG
Protein sequenceShow/hide protein sequence
MFGQPSLQARHEALKFIYNSLMNEGSSVREHVVNLMVHFNVAELNGAVIDEQNQVSFILESLPKTFLAFRSNAVMNKLESAPSSSGSKTFKKKKVAGKGSKPDST
AAAAKKGKAKVVDEGKCFQYNVDGKWKLNCPKYVTEKKKANEGKYDLLVWELVLGGSLKPGDDSQGQNGRSRLNCGNRRVKTLFLCPYMPTNTEEDGVIGLSLKS
IVRMTPDGRMKVKGTIHEMEVLIPIDCGATHNFISQALIEELRLPITKMTNYDVVVGNGASITGKGKCKRVFLLIQGLTIVEDFLPFDFENLNIILGLRGYVAWE
MYTSIGQN