; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr1:983847..998214
RNA-Seq ExpressionMoc01g01430
SyntenyMoc01g01430
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM93404.1 hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda]1.4e-3460.8Show/hide
Query:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN
        AAP F   NP I +PEI AP+FELKPVMFQML+ V QF G P ED H  L+ F+ V +SFK +G S++VLRLK FP+SLRD AR+WL +LP DS+ +W++
Subjt:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN

Query:  LAENFLMKYFPPSKNAKYISEINNF
        LAE FL KYFPP++NAK+ SEI +F
Subjt:  LAENFLMKYFPPSKNAKYISEINNF

XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]7.2e-3677.55Show/hide
Query:  MFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDNLAENFLMKYFPPSKNAKYISEINNF
        MFQML+ V +FHGH  ED H  LKF MGVCNSFKDEG SK+V+RLK FP+SLRDEARTWLESLPS+SI SWD+LAE FLMKYFPP+KNAKY +EINNF
Subjt:  MFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDNLAENFLMKYFPPSKNAKYISEINNF

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]8.5e-3780.61Show/hide
Query:  MFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDNLAENFLMKYFPPSKNAKYISEINNF
        MFQML+ V QFHGH  ED H  LKFFMGVCNSFK+EG S +VLRLK FPYSLRDEARTWLESLP +SI SWD+LAE FLMKYFPPSKNAKY SEINNF
Subjt:  MFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDNLAENFLMKYFPPSKNAKYISEINNF

XP_030483210.1 uncharacterized protein LOC115699807 [Cannabis sativa]3.0e-3462.4Show/hide
Query:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN
        AAP F   NP I +PEI AP+FELKPVMFQML+ V QF G P ED H  L+ FM V +SFK  G ++  LRLK FPYSLRD+AR WL SLPS S+ +W  
Subjt:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN

Query:  LAENFLMKYFPPSKNAKYISEINNF
        LAE FLMKYFPP+KNAK   EI +F
Subjt:  LAENFLMKYFPPSKNAKYISEINNF

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]2.0e-3359.2Show/hide
Query:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN
        AAP F   NP I +PEI AP FELKPVMFQML+ V QF G P ED H  ++ F+ V +SFK +G S++ LRLK FP+SLRD AR WL +LP DS+ +W++
Subjt:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN

Query:  LAENFLMKYFPPSKNAKYISEINNF
        LAE FL KYFPP++NAK+ SEI +F
Subjt:  LAENFLMKYFPPSKNAKYISEINNF

TrEMBL top hitse value%identityAlignment
A0A6J1DTD1 uncharacterized protein LOC1110241363.5e-3677.55Show/hide
Query:  MFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDNLAENFLMKYFPPSKNAKYISEINNF
        MFQML+ V +FHGH  ED H  LKF MGVCNSFKDEG SK+V+RLK FP+SLRDEARTWLESLPS+SI SWD+LAE FLMKYFPP+KNAKY +EINNF
Subjt:  MFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDNLAENFLMKYFPPSKNAKYISEINNF

A0A6J1E1F3 uncharacterized protein LOC1110250654.1e-3780.61Show/hide
Query:  MFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDNLAENFLMKYFPPSKNAKYISEINNF
        MFQML+ V QFHGH  ED H  LKFFMGVCNSFK+EG S +VLRLK FPYSLRDEARTWLESLP +SI SWD+LAE FLMKYFPPSKNAKY SEINNF
Subjt:  MFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDNLAENFLMKYFPPSKNAKYISEINNF

A0A6J1H7E4 uncharacterized protein LOC1114611686.4e-3054.4Show/hide
Query:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN
        A P     NP I +PE+ A  FELKPVMFQML+ + QFHG P ED H  LK F+GV +SF+ +G  K V+RL  FPYSLRD A++WL +L   +I SW++
Subjt:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN

Query:  LAENFLMKYFPPSKNAKYISEINNF
        LAE FL+KYFPP++NA++ +EI  F
Subjt:  LAENFLMKYFPPSKNAKYISEINNF

U5CUI2 Retrotrans_gag domain-containing protein6.6e-3560.8Show/hide
Query:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN
        AAP F   NP I +PEI AP+FELKPVMFQML+ V QF G P ED H  L+ F+ V +SFK +G S++VLRLK FP+SLRD AR+WL +LP DS+ +W++
Subjt:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN

Query:  LAENFLMKYFPPSKNAKYISEINNF
        LAE FL KYFPP++NAK+ SEI +F
Subjt:  LAENFLMKYFPPSKNAKYISEINNF

W9RHB3 Putative disease resistance RPP13-like protein 11.2e-2857.26Show/hide
Query:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN
        A P     NPVI +P I AP FELKPVMFQML+IV QF G   +D H  L+ FM V + FK  G   + LRL  FP+SLRD AR W  SLP+DSI +W++
Subjt:  AAPTFYNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDN

Query:  LAENFLMKYFPPSKNAK
        +AE FLMKYFP +KNAK
Subjt:  LAENFLMKYFPPSKNAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATGCAAAGGTCTGCACAACGATGCTTTACAAACCAAGCTCGAACCCGGTCCCCGGCTCGACCTGAACACAAGAGTGGACCTGTACAAGAGGGTAAACACTTCGA
CGCTCAAGTAGTGGATCCGACAGTACTCACGACCGGCGGTTACATGTTTTTTCTCATGTCGGACCTGTCGGGTTCCAAGCAGATCGAACCCCAGTCAGATTGGACTTCGA
CATGCATACTTAGCCTTTTCCCAATGGGCGAACCCGACCTCTTCGGCAGGGCTAAGTCTTTCAACCCAAGCACCTTTAGGCCCTCTACTTCTCAAAGTCCGCCTTTAGAA
AATCCACCTTCAAAGAGGAGAAAAACTCGTCGTCGCAAGCGAAATCAGTCTTCAAAGGGTAGTGGAGATAATTTCGATTGTACCCATTCTAACTGTCTTGAGGAAGTGTT
GGGAATCCTTAGGTATAACTACTCAATTCCGAAGGACATTGAATTGAGAATTCCTATGGAAGGTGAGTCGATTAACAATCCTCTCGTTGGCTGGATCATCGCTTCCGCTA
CGGGATTAACATCCCATCAAGTGGTTGATGACTACCATCGAATGAGCTCCTTAATGCGATGTCCTTATGGAATTCACATTCTGTCAGCTTTGATTGACCCTCCTCCTGAA
CTGTCAAAAGAAACCAAAGCTGTTTTGACTGGTTGCGCCACCTTATCACCCAGATACTGGTATAGTCCTGACCTATTAACAAACAGAAACTTAAGAGATTTTGGTCTTGC
TGCTGAACTTACCGAAGAAGAGATGCCTCCTCAACCTTCCAAGTTTAGCAGTAGTCGTCATCGTGACATTCCTATTGGAAGCGGTGCAACGAAGTCCCTCAACAAGGGGA
ATAAACCTATGAACAACTCTCTACCGGGCAATAGAGGAAGGACCAACGATCAAAGCTTGCCTCCGAGAGATGACATAAAAACGAAGATCGTCCTTCGGACTGTGCTTCAG
ATAGTAAAGAGACATAAAGGTGGTGGCGGTCAAAGGGAGATCGTCGCCAAGGAAGGATTACAAGATATAAAAACCAATCAAGGTGCAGCAATCGTTAGGAACGATTTGCG
TCGGATGGGGGTAGCTCTGGGAGCTTTAGGTCGCCTCTGGGACTCCCCCCACAAGATAGATCCCCTAGCAGTACCTTCTCAAATTGAAGCTCAATCCGCCCCCTCCGAAA
AGTATGAAGGGAGTCCCTTCGATCTGGCCGATGGAACAGTGGCCTTCATTAGGGATAGGTATGAGGTCCCTACGTCCCTTTGGATGAGGTTTCTCATGAAAGACAAGATC
ATTGTGAATCTACTGGAGGGGCATGTAGCCTTTTACGAAAAAATGTTCGAGTTCGTGATTAGGGTTCCCTTGCACCGTTTTGCCCAAGAGTTCTTGGCCGAGATGAACAT
CGCCCCTCACGACTCGCTCCCAACGACAAGGGCATCTTCCTTGGTTGGTGGATGCTATGGTGGGGGCTTGGTGTGGACACTGTTGGTGCTTCTACCGCACCCCGAGCGGC
TCTTGTCCTTCTATACCGTAAGAAACCTACCTAAGAACCATGGTTGGTTCTACCTCACTACTCGGCAGATGACATCGCTAGAGAAATCAAGGCGTGCAGCTCCGACATTT
TATAATTTCAACCCAGTAATCACGAAGCCAGAAATTGCAGCCCCAAAATTTGAACTCAAGCCAGTGATGTTTCAGATGCTCAAGATAGTGGTCCAATTTCACGGACATCC
TATGGAGGACACGCATTCGCCTCTGAAGTTTTTTATGGGAGTTTGCAATTCGTTCAAGGATGAAGGATGTAGCAAACAAGTGTTGCGGCTTAAGTCGTTCCCTTATTCAC
TCAGAGATGAAGCGAGAACATGGTTGGAGTCACTTCCTTCAGATTCAATTAAAAGTTGGGACAACTTGGCCGAAAATTTTTTGATGAAGTACTTTCCACCTAGCAAAAAT
GCTAAGTACATAAGCGAAATCAACAACTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTATGCAAAGGTCTGCACAACGATGCTTTACAAACCAAGCTCGAACCCGGTCCCCGGCTCGACCTGAACACAAGAGTGGACCTGTACAAGAGGGTAAACACTTCGA
CGCTCAAGTAGTGGATCCGACAGTACTCACGACCGGCGGTTACATGTTTTTTCTCATGTCGGACCTGTCGGGTTCCAAGCAGATCGAACCCCAGTCAGATTGGACTTCGA
CATGCATACTTAGCCTTTTCCCAATGGGCGAACCCGACCTCTTCGGCAGGGCTAAGTCTTTCAACCCAAGCACCTTTAGGCCCTCTACTTCTCAAAGTCCGCCTTTAGAA
AATCCACCTTCAAAGAGGAGAAAAACTCGTCGTCGCAAGCGAAATCAGTCTTCAAAGGGTAGTGGAGATAATTTCGATTGTACCCATTCTAACTGTCTTGAGGAAGTGTT
GGGAATCCTTAGGTATAACTACTCAATTCCGAAGGACATTGAATTGAGAATTCCTATGGAAGGTGAGTCGATTAACAATCCTCTCGTTGGCTGGATCATCGCTTCCGCTA
CGGGATTAACATCCCATCAAGTGGTTGATGACTACCATCGAATGAGCTCCTTAATGCGATGTCCTTATGGAATTCACATTCTGTCAGCTTTGATTGACCCTCCTCCTGAA
CTGTCAAAAGAAACCAAAGCTGTTTTGACTGGTTGCGCCACCTTATCACCCAGATACTGGTATAGTCCTGACCTATTAACAAACAGAAACTTAAGAGATTTTGGTCTTGC
TGCTGAACTTACCGAAGAAGAGATGCCTCCTCAACCTTCCAAGTTTAGCAGTAGTCGTCATCGTGACATTCCTATTGGAAGCGGTGCAACGAAGTCCCTCAACAAGGGGA
ATAAACCTATGAACAACTCTCTACCGGGCAATAGAGGAAGGACCAACGATCAAAGCTTGCCTCCGAGAGATGACATAAAAACGAAGATCGTCCTTCGGACTGTGCTTCAG
ATAGTAAAGAGACATAAAGGTGGTGGCGGTCAAAGGGAGATCGTCGCCAAGGAAGGATTACAAGATATAAAAACCAATCAAGGTGCAGCAATCGTTAGGAACGATTTGCG
TCGGATGGGGGTAGCTCTGGGAGCTTTAGGTCGCCTCTGGGACTCCCCCCACAAGATAGATCCCCTAGCAGTACCTTCTCAAATTGAAGCTCAATCCGCCCCCTCCGAAA
AGTATGAAGGGAGTCCCTTCGATCTGGCCGATGGAACAGTGGCCTTCATTAGGGATAGGTATGAGGTCCCTACGTCCCTTTGGATGAGGTTTCTCATGAAAGACAAGATC
ATTGTGAATCTACTGGAGGGGCATGTAGCCTTTTACGAAAAAATGTTCGAGTTCGTGATTAGGGTTCCCTTGCACCGTTTTGCCCAAGAGTTCTTGGCCGAGATGAACAT
CGCCCCTCACGACTCGCTCCCAACGACAAGGGCATCTTCCTTGGTTGGTGGATGCTATGGTGGGGGCTTGGTGTGGACACTGTTGGTGCTTCTACCGCACCCCGAGCGGC
TCTTGTCCTTCTATACCGTAAGAAACCTACCTAAGAACCATGGTTGGTTCTACCTCACTACTCGGCAGATGACATCGCTAGAGAAATCAAGGCGTGCAGCTCCGACATTT
TATAATTTCAACCCAGTAATCACGAAGCCAGAAATTGCAGCCCCAAAATTTGAACTCAAGCCAGTGATGTTTCAGATGCTCAAGATAGTGGTCCAATTTCACGGACATCC
TATGGAGGACACGCATTCGCCTCTGAAGTTTTTTATGGGAGTTTGCAATTCGTTCAAGGATGAAGGATGTAGCAAACAAGTGTTGCGGCTTAAGTCGTTCCCTTATTCAC
TCAGAGATGAAGCGAGAACATGGTTGGAGTCACTTCCTTCAGATTCAATTAAAAGTTGGGACAACTTGGCCGAAAATTTTTTGATGAAGTACTTTCCACCTAGCAAAAAT
GCTAAGTACATAAGCGAAATCAACAACTTTTAG
Protein sequenceShow/hide protein sequence
MFMQRSAQRCFTNQARTRSPARPEHKSGPVQEGKHFDAQVVDPTVLTTGGYMFFLMSDLSGSKQIEPQSDWTSTCILSLFPMGEPDLFGRAKSFNPSTFRPSTSQSPPLE
NPPSKRRKTRRRKRNQSSKGSGDNFDCTHSNCLEEVLGILRYNYSIPKDIELRIPMEGESINNPLVGWIIASATGLTSHQVVDDYHRMSSLMRCPYGIHILSALIDPPPE
LSKETKAVLTGCATLSPRYWYSPDLLTNRNLRDFGLAAELTEEEMPPQPSKFSSSRHRDIPIGSGATKSLNKGNKPMNNSLPGNRGRTNDQSLPPRDDIKTKIVLRTVLQ
IVKRHKGGGGQREIVAKEGLQDIKTNQGAAIVRNDLRRMGVALGALGRLWDSPHKIDPLAVPSQIEAQSAPSEKYEGSPFDLADGTVAFIRDRYEVPTSLWMRFLMKDKI
IVNLLEGHVAFYEKMFEFVIRVPLHRFAQEFLAEMNIAPHDSLPTTRASSLVGGCYGGGLVWTLLVLLPHPERLLSFYTVRNLPKNHGWFYLTTRQMTSLEKSRRAAPTF
YNFNPVITKPEIAAPKFELKPVMFQMLKIVVQFHGHPMEDTHSPLKFFMGVCNSFKDEGCSKQVLRLKSFPYSLRDEARTWLESLPSDSIKSWDNLAENFLMKYFPPSKN
AKYISEINNF