; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g07920 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g07920
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr5:5802305..5802750
RNA-Seq ExpressionMoc05g07920
SyntenyMoc05g07920
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU44375.1 hypothetical protein TSUD_243070 [Trifolium subterraneum]2.9e-2545.62Show/hide
Query:  EIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLIND-GPPNAEITNN---------------DNGLLTSWLLGIVSEEVLAMIEGLES
        E  LTIQSFHQCSSLISIKLST N+LLWKSQ+LPL+RSLG+++H+  D   P+ EIT++               ++GLLTSWLLG + EE ++MI G ++
Subjt:  EIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLIND-GPPNAEITNN---------------DNGLLTSWLLGIVSEEVLAMIEGLES

Query:  AH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE
        AH                             +GN+SL++YI+KFK LCDKL A+ KP+ +
Subjt:  AH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE

KAF7827659.1 Retrovirus-related Pol polyprotein from transposon RE1 [Senna tora]2.6e-2141.98Show/hide
Query:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLINDGPPNAEITNN---------------DNGLLTSWLLGIVSEEVLAMIEG
        + +TE  L+IQSFHQCSS ISIKLST N LLW++Q+ PLVRSLGV +HL +   P  EI  +               ++GLLTSWLLG + EEVL MI G
Subjt:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLINDGPPNAEITNN---------------DNGLLTSWLLGIVSEEVLAMIEG

Query:  ------------------LESAH----------QGNMSLEDYIKKFKALCDKLAAMKKPMDE
                          +E A           +G+ SLE+Y+++FK++CD LAA+K+ + +
Subjt:  ------------------LESAH----------QGNMSLEDYIKKFKALCDKLAAMKKPMDE

PNY16899.1 copia-like polyprotein, partial [Trifolium pratense]2.5e-2445.4Show/hide
Query:  AKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHL-INDGPPNAEI-------TNNDN--------GLLTSWLLGIVSEEVLAMIEG
        A  E  LTIQSFHQCSSL+S+KLST N+LLWKSQ+LPL+RSLG+++H+  N   P+ EI       TNN N        GLLTSWLLG + EE L+MI G
Subjt:  AKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHL-INDGPPNAEI-------TNNDN--------GLLTSWLLGIVSEEVLAMIEG

Query:  LESAH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE
         ++A+                             +GN+SL++YI+KFK LC+KL+A+ KP+ +
Subjt:  LESAH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE

RVW35662.1 hypothetical protein CK203_108171 [Vitis vinifera]2.0e-2139.13Show/hide
Query:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLIND-------------GPPNAEITNNDNGLLTSWLLGIVSEEVLAMIEGLE
        MA  E  L+IQ+FHQCSSL+SIKL+  N LLW+SQVLPLVRSLG+ +HL  +                + E  ++++GLLTSWLLG+++EEV+ +++G E
Subjt:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLIND-------------GPPNAEITNNDNGLLTSWLLGIVSEEVLAMIEGLE

Query:  SAH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE
        +A+                             +G  SL++Y+++FK +CD LAA++KP+ +
Subjt:  SAH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE

XP_022154021.1 uncharacterized protein LOC111021379 [Momordica charantia]3.2e-2744.79Show/hide
Query:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLINDGP----------PNAEITN-----NDNGLLTSWLLGIVSEEVLAMIEG
        MA  E  LT+QSFHQCSSLIS+KL++ NYLLWKSQVLPL+R+LG+++HL  + P           +A  T      N++GLLTSWLLGI++E+VL ++EG
Subjt:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLINDGP----------PNAEITN-----NDNGLLTSWLLGIVSEEVLAMIEG

Query:  LESAHQ-----------------------------GNMSLEDYIKKFKALCDKLAAMKKPMDE
         E+A +                             G++S+++YI+KFK LCD+L AMKKP+D+
Subjt:  LESAHQ-----------------------------GNMSLEDYIKKFKALCDKLAAMKKPMDE

TrEMBL top hitse value%identityAlignment
A0A2K3PNP5 Copia-like polyprotein (Fragment)1.2e-2445.4Show/hide
Query:  AKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHL-INDGPPNAEI-------TNNDN--------GLLTSWLLGIVSEEVLAMIEG
        A  E  LTIQSFHQCSSL+S+KLST N+LLWKSQ+LPL+RSLG+++H+  N   P+ EI       TNN N        GLLTSWLLG + EE L+MI G
Subjt:  AKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHL-INDGPPNAEI-------TNNDN--------GLLTSWLLGIVSEEVLAMIEG

Query:  LESAH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE
         ++A+                             +GN+SL++YI+KFK LC+KL+A+ KP+ +
Subjt:  LESAH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE

A0A2Z6P7T0 Reverse transcriptase Ty1/copia-type domain-containing protein1.4e-2545.62Show/hide
Query:  EIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLIND-GPPNAEITNN---------------DNGLLTSWLLGIVSEEVLAMIEGLES
        E  LTIQSFHQCSSLISIKLST N+LLWKSQ+LPL+RSLG+++H+  D   P+ EIT++               ++GLLTSWLLG + EE ++MI G ++
Subjt:  EIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLIND-GPPNAEITNN---------------DNGLLTSWLLGIVSEEVLAMIEGLES

Query:  AH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE
        AH                             +GN+SL++YI+KFK LCDKL A+ KP+ +
Subjt:  AH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE

A0A438DJK4 Uncharacterized protein9.6e-2239.13Show/hide
Query:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLIND-------------GPPNAEITNNDNGLLTSWLLGIVSEEVLAMIEGLE
        MA  E  L+IQ+FHQCSSL+SIKL+  N LLW+SQVLPLVRSLG+ +HL  +                + E  ++++GLLTSWLLG+++EEV+ +++G E
Subjt:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLIND-------------GPPNAEITNNDNGLLTSWLLGIVSEEVLAMIEGLE

Query:  SAH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE
        +A+                             +G  SL++Y+++FK +CD LAA++KP+ +
Subjt:  SAH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE

A0A6J1DMG5 uncharacterized protein LOC1110213791.5e-2744.79Show/hide
Query:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLINDGP----------PNAEITN-----NDNGLLTSWLLGIVSEEVLAMIEG
        MA  E  LT+QSFHQCSSLIS+KL++ NYLLWKSQVLPL+R+LG+++HL  + P           +A  T      N++GLLTSWLLGI++E+VL ++EG
Subjt:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLINDGP----------PNAEITN-----NDNGLLTSWLLGIVSEEVLAMIEG

Query:  LESAHQ-----------------------------GNMSLEDYIKKFKALCDKLAAMKKPMDE
         E+A +                             G++S+++YI+KFK LCD+L AMKKP+D+
Subjt:  LESAHQ-----------------------------GNMSLEDYIKKFKALCDKLAAMKKPMDE

A5BC00 Uncharacterized protein1.6e-2138.65Show/hide
Query:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLIND---------------GPPNAEITNNDNGLLTSWLLGIVSEEVLAMIEG
        MA  E  L+IQ+FHQCSSL+SIKL+  N LLW+SQVLPLVRSLG+ +HL  +                  + E  ++++GLLTSWLLG+++EEV+ +++G
Subjt:  MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLIND---------------GPPNAEITNNDNGLLTSWLLGIVSEEVLAMIEG

Query:  LESAH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE
         E+A+                             +G  SL++Y+++FK +CD LAA++KP+ +
Subjt:  LESAH-----------------------------QGNMSLEDYIKKFKALCDKLAAMKKPMDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAAACTGAAATTGCTCTTACAATTCAATCTTTCCACCAATGTTCTAGCCTAATTTCTATCAAACTTAGCACCAGAAACTATTTGTTGTGGAAATCTCAAGTCCT
TCCACTTGTGAGAAGTCTTGGAGTTGATAATCACTTGATCAATGATGGTCCACCTAATGCAGAAATAACCAACAATGATAATGGATTGCTAACCTCTTGGTTACTTGGAA
TCGTTTCAGAGGAAGTCCTTGCAATGATTGAAGGTTTGGAATCAGCCCACCAGGGAAATATGTCTCTTGAAGATTATATCAAGAAATTTAAAGCCCTGTGTGATAAGTTA
GCTGCAATGAAGAAACCAATGGATGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAAACTGAAATTGCTCTTACAATTCAATCTTTCCACCAATGTTCTAGCCTAATTTCTATCAAACTTAGCACCAGAAACTATTTGTTGTGGAAATCTCAAGTCCT
TCCACTTGTGAGAAGTCTTGGAGTTGATAATCACTTGATCAATGATGGTCCACCTAATGCAGAAATAACCAACAATGATAATGGATTGCTAACCTCTTGGTTACTTGGAA
TCGTTTCAGAGGAAGTCCTTGCAATGATTGAAGGTTTGGAATCAGCCCACCAGGGAAATATGTCTCTTGAAGATTATATCAAGAAATTTAAAGCCCTGTGTGATAAGTTA
GCTGCAATGAAGAAACCAATGGATGAGTAA
Protein sequenceShow/hide protein sequence
MAKTEIALTIQSFHQCSSLISIKLSTRNYLLWKSQVLPLVRSLGVDNHLINDGPPNAEITNNDNGLLTSWLLGIVSEEVLAMIEGLESAHQGNMSLEDYIKKFKALCDKL
AAMKKPMDE