; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g14980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g14980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:11184080..11188126
RNA-Seq ExpressionMoc11g14980
SyntenyMoc11g14980
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAG1857269.1 unnamed protein product, partial [Musa acuminata subsp. malaccensis]8.5e-1467.69Show/hide
Query:  SMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE
        S+ K S+ DE RL EKWDRSN + LM+IK GI EAFRGA+SEGIT  KD L EIEKRF +NDKA+
Subjt:  SMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE

XP_009410235.1 PREDICTED: uncharacterized protein LOC103992314 [Musa acuminata subsp. malaccensis]8.5e-1467.69Show/hide
Query:  SMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE
        S+ K S+ DE RL EKWDRSN + LM+IK GI EAFRGA+SEGIT  KD L EIEKRF +NDKA+
Subjt:  SMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE

XP_022871195.1 uncharacterized protein LOC111390390 [Olea europaea var. sylvestris]4.2e-1365.62Show/hide
Query:  MKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE
        + ++SSPDE R  E+W RSN + LM+IK GI EAFRGA+SEGITN K+ L EIEKRF +NDKAE
Subjt:  MKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE

XP_022874595.1 uncharacterized protein LOC111393333 [Olea europaea var. sylvestris]8.5e-1461.97Show/hide
Query:  WDYVQPSMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE
        W      + ++SS DE R  E+WDRSN + LM+IK GILEAFRGA+SEGITN K+ L EIEKRF +NDKAE
Subjt:  WDYVQPSMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE

XP_022878635.1 uncharacterized protein LOC111396430 [Olea europaea var. sylvestris]6.5e-1467.19Show/hide
Query:  MKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE
        + K+SSPDE    EKWDRSN + LM+IK GIL+AFRGA +E ++NTK+ LAEIEKRFA+NDKAE
Subjt:  MKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE

TrEMBL top hitse value%identityAlignment
A0A0K9QKP0 Uncharacterized protein1.0e-1262.12Show/hide
Query:  SMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAEM
        S+   SS DE +  EKWDRSN + LM+IK GI E FRGA+S+ +TN KD LAEIEKRFA++DKAE+
Subjt:  SMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAEM

A0A0K9QSI7 Uncharacterized protein2.0e-1362.69Show/hide
Query:  PSMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAEM
        PS+   SS DE +  EKWDRSN + LM+IK GI E FRGA+S+ +TN KD LAEIEKRFA++DKAE+
Subjt:  PSMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAEM

A0A3Q0F8I9 uncharacterized protein LOC106767994 isoform X18.5e-1262.32Show/hide
Query:  PSMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAEMKA
        P +  +S+ DEM   EKWDRSN + LM+IK GI E FRGAISE IT+ KD L EIEK FA NDKAE  A
Subjt:  PSMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAEMKA

A0A443PLH5 Uncharacterized protein2.2e-1263.08Show/hide
Query:  SMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE
        S+   SSPD+ +  EKWDRSN + LM+IK GI EAFRGA+SE +T  K+ LAEIEKRF +NDKAE
Subjt:  SMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE

A0A4Q3EHJ0 gag_pre-integrs domain-containing protein (Fragment)2.7e-1366.15Show/hide
Query:  SMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE
        S+   SS D+ R+ EKWDRSN + LM+IK GI EAFRGA+SE ITN KD LAEIEKRF ++DKAE
Subjt:  SMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53670.1 unknown protein4.3e-0841.27Show/hide
Query:  KKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE
        ++ SSP E++    WDRSN + +M++K  I + FRG + + +T  KD LA +E  FA+N++AE
Subjt:  KKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCCCATCAGACGCGAATTTCAACAGAGCTGGTAGAGTTGAAGTTCATGGTGGCAGCATTAGCCGAGCAAATCACTACTCGGGATGCTGAAATAAAAAGCCTCAT
GGACCGTCTAACCAAGTCGGATGCCCTAGCCTCCTCTGCTCAACACCGCGCATTCGACTTTTTGCTGGACCATAGCTCCAGTTCAGCAGAGGAAGCAGTGCAGAAGGATA
GTGGCTCTGAAGACGAGGAGAGCGGAAGCATAGCATCTCTGCCAAAGCATCATGAGTTTCACTCTAGTTTTGAGTTAGCAGCATCTTTCAGGCGGCAGGATTATATTGGT
TCGCCCATTTCTTGGGATTACGTCCAGCCTTCTATGAAAAAGCAAAGTTCTCCTGATGAAATGAGACTTCGTGAGAAGTGGGATCGCTCAAACCCCATATATCTAATGGT
CATAAAGAGTGGCATTCTAGAGGCATTTAGGGGTGCAATATCTGAAGGAATAACTAATACGAAAGATTCCCTTGCTGAAATAGAAAAGCGTTTTGCCAGAAACGATAAGG
CTGAAATGAAAGCATGCTCTTACAACGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGCCCATCAGACGCGAATTTCAACAGAGCTGGTAGAGTTGAAGTTCATGGTGGCAGCATTAGCCGAGCAAATCACTACTCGGGATGCTGAAATAAAAAGCCTCAT
GGACCGTCTAACCAAGTCGGATGCCCTAGCCTCCTCTGCTCAACACCGCGCATTCGACTTTTTGCTGGACCATAGCTCCAGTTCAGCAGAGGAAGCAGTGCAGAAGGATA
GTGGCTCTGAAGACGAGGAGAGCGGAAGCATAGCATCTCTGCCAAAGCATCATGAGTTTCACTCTAGTTTTGAGTTAGCAGCATCTTTCAGGCGGCAGGATTATATTGGT
TCGCCCATTTCTTGGGATTACGTCCAGCCTTCTATGAAAAAGCAAAGTTCTCCTGATGAAATGAGACTTCGTGAGAAGTGGGATCGCTCAAACCCCATATATCTAATGGT
CATAAAGAGTGGCATTCTAGAGGCATTTAGGGGTGCAATATCTGAAGGAATAACTAATACGAAAGATTCCCTTGCTGAAATAGAAAAGCGTTTTGCCAGAAACGATAAGG
CTGAAATGAAAGCATGCTCTTACAACGCTTGA
Protein sequenceShow/hide protein sequence
MMAHQTRISTELVELKFMVAALAEQITTRDAEIKSLMDRLTKSDALASSAQHRAFDFLLDHSSSSAEEAVQKDSGSEDEESGSIASLPKHHEFHSSFELAASFRRQDYIG
SPISWDYVQPSMKKQSSPDEMRLREKWDRSNPIYLMVIKSGILEAFRGAISEGITNTKDSLAEIEKRFARNDKAEMKACSYNA