; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g00480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g00480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr1:254059..256156
RNA-Seq ExpressionMoc01g00480
SyntenyMoc01g00480
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM93404.1 hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda]3.8e-5547.04Show/hide
Query:  NAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAM
        N ++LADD  R IR Y AP F   NP I  PEI AP+FELKPVMFQMLQTVGQF G PTED H HL+ F+ V +SFK +G ++EVLRLKLFP+SL+D A 
Subjt:  NAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAM

Query:  TWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAK
        +WL +LP + +T+ +DLAEKF  KYFPP++NAK                                     CIQ+ET+YN L+  +R+V+DASANG + +K
Subjt:  TWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAK

Query:  PYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLV
         Y+EAF ILE I+ NN+ WS+  A   R   G+ E ++  AL +++ ++T+++
Subjt:  PYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLV

XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]7.2e-6255.91Show/hide
Query:  NAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAM
        N VLLA  IDREIRAY AP FYNFNPVITE EI APKFELK                                    DEG NKEVLRLKLF +SL+DEA 
Subjt:  NAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAM

Query:  TWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAK
        TWL SLPSE ITS DDLAE F MKYFPPSKNAK                                     CI IE YYN LDD TRLV   S N  L AK
Subjt:  TWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAK

Query:  PYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQSTVAASTG--NVSHSQGIS
        PY+EAFNILE IS N HS SD  AIQGRG K LNES+SY   NSKIEN+ DLV RSMTQQSTV A TG  N SHSQG S
Subjt:  PYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQSTVAASTG--NVSHSQGIS

XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]4.8e-5864.82Show/hide
Query:  MFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAMTWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAKCIQIETYYNR
        MFQMLQTVG+FHGH TED H HLKF MGVCNSFKDEG +K+V+RLKLFP+SL+DEA TWL+SLPSE ITS DDLAEKF MKYFPP+KNAK      Y N 
Subjt:  MFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAMTWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAKCIQIETYYNR

Query:  LDDVTRLVIDASANGTLPAKPYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQSTVAASTG--NVSHSQGIS
        +++  +   D  +         +EAFNILE IS NNHSW D  A+QG+  K L ESESY  LNSKIENLTDLVMRS+TQQS   AS G  NV+  QGIS
Subjt:  LDDVTRLVIDASANGTLPAKPYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQSTVAASTG--NVSHSQGIS

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]1.3e-5564.58Show/hide
Query:  MFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAMTWLKSLPSEFITSGDDLAEKFFMKYFPPSKN--AKC-------
        MFQM+  VGQFHGH TE  H HLKFFMGV NSFKDEG +K VLRLKLF YSL+ EA TWL+SL SE+ITS DDL EKF MKYF PSK    +C       
Subjt:  MFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAMTWLKSLPSEFITSGDDLAEKFFMKYFPPSKN--AKC-------

Query:  -IQIETYYNRLDDVTRLVIDASANGTLPAKPYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQST
         IQIETYY  LD+ TRLVIDAS NG L  KPY++A NILE IS +NHSWSD  AI+G+  K L ESESY  LNSKIE LTDL  R+ +  +T
Subjt:  -IQIETYYNRLDDVTRLVIDASANGTLPAKPYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQST

XP_030483210.1 uncharacterized protein LOC115699807 [Cannabis sativa]2.5e-5447.47Show/hide
Query:  LADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAMTWLK
        +ADD D+ IR Y AP F   NP I  PEI AP+FELKPVMFQMLQTVGQF G PTED H HL+ FM V +SFK  G  ++ LRLKLFPYSL+D+A  WL 
Subjt:  LADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAMTWLK

Query:  SLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAKPYSE
        SLPS  +T+  +LAE+F MKYFPP+KNAK                                     CIQ+ET+YN L+  TR+V+DASANG L AK Y+E
Subjt:  SLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAKPYSE

Query:  AFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMR-SMTQQ
        A++I+E IS NN+ W       G+   G+ E ++  AL++++ ++++++   SM QQ
Subjt:  AFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMR-SMTQQ

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189103.5e-6255.91Show/hide
Query:  NAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAM
        N VLLA  IDREIRAY AP FYNFNPVITE EI APKFELK                                    DEG NKEVLRLKLF +SL+DEA 
Subjt:  NAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAM

Query:  TWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAK
        TWL SLPSE ITS DDLAE F MKYFPPSKNAK                                     CI IE YYN LDD TRLV   S N  L AK
Subjt:  TWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAK

Query:  PYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQSTVAASTG--NVSHSQGIS
        PY+EAFNILE IS N HS SD  AIQGRG K LNES+SY   NSKIEN+ DLV RSMTQQSTV A TG  N SHSQG S
Subjt:  PYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQSTVAASTG--NVSHSQGIS

A0A6J1DTD1 uncharacterized protein LOC1110241362.3e-5864.82Show/hide
Query:  MFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAMTWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAKCIQIETYYNR
        MFQMLQTVG+FHGH TED H HLKF MGVCNSFKDEG +K+V+RLKLFP+SL+DEA TWL+SLPSE ITS DDLAEKF MKYFPP+KNAK      Y N 
Subjt:  MFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAMTWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAKCIQIETYYNR

Query:  LDDVTRLVIDASANGTLPAKPYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQSTVAASTG--NVSHSQGIS
        +++  +   D  +         +EAFNILE IS NNHSW D  A+QG+  K L ESESY  LNSKIENLTDLVMRS+TQQS   AS G  NV+  QGIS
Subjt:  LDDVTRLVIDASANGTLPAKPYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQSTVAASTG--NVSHSQGIS

A0A6J1DWK1 uncharacterized protein LOC1110250536.4e-5664.58Show/hide
Query:  MFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAMTWLKSLPSEFITSGDDLAEKFFMKYFPPSKN--AKC-------
        MFQM+  VGQFHGH TE  H HLKFFMGV NSFKDEG +K VLRLKLF YSL+ EA TWL+SL SE+ITS DDL EKF MKYF PSK    +C       
Subjt:  MFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAMTWLKSLPSEFITSGDDLAEKFFMKYFPPSKN--AKC-------

Query:  -IQIETYYNRLDDVTRLVIDASANGTLPAKPYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQST
         IQIETYY  LD+ TRLVIDAS NG L  KPY++A NILE IS +NHSWSD  AI+G+  K L ESESY  LNSKIE LTDL  R+ +  +T
Subjt:  -IQIETYYNRLDDVTRLVIDASANGTLPAKPYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQST

A0A6J1H7E4 uncharacterized protein LOC1114611686.6e-5343.35Show/hide
Query:  NAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAM
        NA+ LADD +R IRAY  PA    NP I  PE+ A  FELKPVMFQMLQT+GQFHG P+ED H HLK F+GV +SF+ +G +K+V+RL LFPYSL+D A 
Subjt:  NAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAM

Query:  TWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAK
        +WL +L    I S + LAEKF +KYFPP++NA+                                     CIQ+ET+YN L+  T+ V+DASANG + +K
Subjt:  TWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAK

Query:  PYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQSTV
         Y+EA+ ILE I+ NN  W+D+ +  G+  +G+ E ++  ++N+++ ++T+++      Q T+
Subjt:  PYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLVMRSMTQQSTV

U5CUI2 Retrotrans_gag domain-containing protein1.9e-5547.04Show/hide
Query:  NAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAM
        N ++LADD  R IR Y AP F   NP I  PEI AP+FELKPVMFQMLQTVGQF G PTED H HL+ F+ V +SFK +G ++EVLRLKLFP+SL+D A 
Subjt:  NAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRLKLFPYSLKDEAM

Query:  TWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAK
        +WL +LP + +T+ +DLAEKF  KYFPP++NAK                                     CIQ+ET+YN L+  +R+V+DASANG + +K
Subjt:  TWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAK-------------------------------------CIQIETYYNRLDDVTRLVIDASANGTLPAK

Query:  PYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLV
         Y+EAF ILE I+ NN+ WS+  A   R   G+ E ++  AL +++ ++T+++
Subjt:  PYSEAFNILE-ISLNNHSWSDLTAIQGRGGKGLNESESYFALNSKIENLTDLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGCAGGTCGTCCTCATCGGCATGAGAGGGAGACCTGTTACAGAAGGCGTCGCCCAACCCCACCTCCTCAACGCACCCTACCGCTAGGTGAGGCGACGAGAGAGGA
CTCTCCTAAGGGCGACCTAAAGCTCTCCTCTGAGCTCGCCCTCCTAGACCTAGAACTACCGCTACACCTATCAGACATTGCTAAGTTAAGGCGAATTTACAGAGAAGAAG
ATGAGCTACTAACCTCGACGGAGAGTTGGAAAAGTCTTGAAAATGTCGAACGGCTCGGATCAGAAACGGGACGGTGGTTAAGTGCTCAAGAGCACCTCGAAAGTCGTGAC
GAGCGTAAAGTTCTTAAGTGGCCATTTAATGTTGCTCTCGGAGCCCGTGATTGCAGTGCCCCCCCCAATGCTGTATTACTAGCAGATGACATCGACAGAGAGATCAGGGC
ATATGTAGCTCCAGCATTTTATAATTTCAACCCAGTAATCACGGAGCCTGAAATTGCAGCCCCAAAGTTTGAACTAAAACCAGTAATGTTTCAGATGCTCCAGACAGTGG
GTCAGTTTCACGGGCATCCTACTGAAGACCTACATTCACATCTGAAGTTTTTTATGGGGGTATGCAATTCGTTTAAGGATGAAGGATGCAACAAAGAAGTGTTGCGGCTT
AAGTTGTTCCCCTATTCACTTAAAGATGAAGCCATGACTTGGTTAAAGTCGCTACCTTCAGAATTCATTACAAGTGGGGATGACTTGGCCGAGAAATTCTTCATGAAGTA
CTTCCCACCCAGCAAGAACGCTAAGTGCATCCAGATTGAAACGTATTACAATCGATTGGACGACGTTACACGTCTGGTCATCGATGCCTCAGCAAATGGCACATTGCCAG
CAAAACCTTATTCTGAAGCATTCAATATCTTGGAAATATCATTGAACAACCATTCATGGTCAGACCTTACAGCTATTCAAGGTAGAGGAGGTAAGGGACTTAATGAATCT
GAATCATACTTTGCTTTGAACTCAAAAATTGAAAACTTGACAGATTTAGTGATGAGAAGCATGACACAGCAGAGCACAGTGGCAGCATCTACTGGTAATGTTAGCCACAG
TCAGGGGATTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATGCAGGTCGTCCTCATCGGCATGAGAGGGAGACCTGTTACAGAAGGCGTCGCCCAACCCCACCTCCTCAACGCACCCTACCGCTAGGTGAGGCGACGAGAGAGGA
CTCTCCTAAGGGCGACCTAAAGCTCTCCTCTGAGCTCGCCCTCCTAGACCTAGAACTACCGCTACACCTATCAGACATTGCTAAGTTAAGGCGAATTTACAGAGAAGAAG
ATGAGCTACTAACCTCGACGGAGAGTTGGAAAAGTCTTGAAAATGTCGAACGGCTCGGATCAGAAACGGGACGGTGGTTAAGTGCTCAAGAGCACCTCGAAAGTCGTGAC
GAGCGTAAAGTTCTTAAGTGGCCATTTAATGTTGCTCTCGGAGCCCGTGATTGCAGTGCCCCCCCCAATGCTGTATTACTAGCAGATGACATCGACAGAGAGATCAGGGC
ATATGTAGCTCCAGCATTTTATAATTTCAACCCAGTAATCACGGAGCCTGAAATTGCAGCCCCAAAGTTTGAACTAAAACCAGTAATGTTTCAGATGCTCCAGACAGTGG
GTCAGTTTCACGGGCATCCTACTGAAGACCTACATTCACATCTGAAGTTTTTTATGGGGGTATGCAATTCGTTTAAGGATGAAGGATGCAACAAAGAAGTGTTGCGGCTT
AAGTTGTTCCCCTATTCACTTAAAGATGAAGCCATGACTTGGTTAAAGTCGCTACCTTCAGAATTCATTACAAGTGGGGATGACTTGGCCGAGAAATTCTTCATGAAGTA
CTTCCCACCCAGCAAGAACGCTAAGTGCATCCAGATTGAAACGTATTACAATCGATTGGACGACGTTACACGTCTGGTCATCGATGCCTCAGCAAATGGCACATTGCCAG
CAAAACCTTATTCTGAAGCATTCAATATCTTGGAAATATCATTGAACAACCATTCATGGTCAGACCTTACAGCTATTCAAGGTAGAGGAGGTAAGGGACTTAATGAATCT
GAATCATACTTTGCTTTGAACTCAAAAATTGAAAACTTGACAGATTTAGTGATGAGAAGCATGACACAGCAGAGCACAGTGGCAGCATCTACTGGTAATGTTAGCCACAG
TCAGGGGATTTCTTAA
Protein sequenceShow/hide protein sequence
MHAGRPHRHERETCYRRRRPTPPPQRTLPLGEATREDSPKGDLKLSSELALLDLELPLHLSDIAKLRRIYREEDELLTSTESWKSLENVERLGSETGRWLSAQEHLESRD
ERKVLKWPFNVALGARDCSAPPNAVLLADDIDREIRAYVAPAFYNFNPVITEPEIAAPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLKFFMGVCNSFKDEGCNKEVLRL
KLFPYSLKDEAMTWLKSLPSEFITSGDDLAEKFFMKYFPPSKNAKCIQIETYYNRLDDVTRLVIDASANGTLPAKPYSEAFNILEISLNNHSWSDLTAIQGRGGKGLNES
ESYFALNSKIENLTDLVMRSMTQQSTVAASTGNVSHSQGIS