; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g17890 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g17890
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr8:13565743..13566453
RNA-Seq ExpressionMoc08g17890
SyntenyMoc08g17890
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065392.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-4152.94Show/hide
Query:  GATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTS
        G   +   +GE+S +K   PD+ T          DR+KFKKVEMP+FTG DP+SWLFRAERYF+IHKLTE EK +VS I FDG AL WYR  E+R +F S
Subjt:  GATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTS

Query:  WENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAFLNGLDPVVRAE
        W NL+ R+L RF+  +EG  C R L I+QE TV E+   F+ L APL  L D V+E  F+ GL P +RAE
Subjt:  WENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAFLNGLDPVVRAE

KAA0066078.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]2.6e-4145.65Show/hide
Query:  TVRELETRCEATEKEIVDLKGTIAALCGKMDEQQKISTRTQMMLESFMADVTGGATLTMSQL-GESSLQKNK-QPDDPTR-----IIQIHEPGYDRNKFK
        TV  L  + +  +++   L   +  L  K D QQ    +   ++  +M +V    + T S L G SS  K     DD T+     +        +R+KFK
Subjt:  TVRELETRCEATEKEIVDLKGTIAALCGKMDEQQKISTRTQMMLESFMADVTGGATLTMSQL-GESSLQKNK-QPDDPTR-----IIQIHEPGYDRNKFK

Query:  KVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETF
        KVEMP+FTG DPDSWLFRA+RYF+IHKLT+ EK  VSVISFDG AL W+R  E+RN+FT W NL+ R+LERFR  +EG    +  A KQ  TV E+   F
Subjt:  KVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETF

Query:  EALAAPLPHLSDEVLECAFLNGLDPVVRAE
        + L APL  LSD+VLE  F+NGL P +RAE
Subjt:  EALAAPLPHLSDEVLECAFLNGLDPVVRAE

XP_022148929.1 uncharacterized protein LOC111017476 [Momordica charantia]1.1e-6056.52Show/hide
Query:  MTVRELETRCEATEKEIVDLKGTIAALCGKMDEQQKISTRTQMMLESFMADVTGGATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPI
        M V+ELE+RC A EKE+ D+KG I  +  K++EQ+ +S R Q MLESFM+++ GG T ++SQLGES +QKNK  ++ TR++QI++  YDRNKFKKVEMPI
Subjt:  MTVRELETRCEATEKEIVDLKGTIAALCGKMDEQQKISTRTQMMLESFMADVTGGATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPI

Query:  FTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAP
        F                              VISFD VALAWYR+HE R++FT W+NLR R L+RFRK+KEGRQCAR+LAIKQEG+VAE+ E FEAL+AP
Subjt:  FTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAP

Query:  LPHLSDEVLECAFLNGLDPVVRAESWPQNP
        LP LSDEVLEC FLNGLD VVR++     P
Subjt:  LPHLSDEVLECAFLNGLDPVVRAESWPQNP

XP_022154744.1 uncharacterized protein LOC111021922 [Momordica charantia]8.1e-4343.56Show/hide
Query:  RELETRCEATEKEIVDLKGTIAALCGKMDEQQKIS-----------TRTQMMLESFMADVTGGAT--LTMSQLGESSL-----QKNKQPDDPTRIIQIHE
        ++LE RCE+TEKE+  +K  +  +  +++EQ+K S             TQ  L+ FM +V+ G +     ++ G S+L     ++N +    T   +   
Subjt:  RELETRCEATEKEIVDLKGTIAALCGKMDEQQKIS-----------TRTQMMLESFMADVTGGAT--LTMSQLGESSL-----QKNKQPDDPTRIIQIHE

Query:  PGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEG
        PG +RNKFKKVEMP+FTG DP+SWLFRAERYFEI+ L+EEEK  V++ISF+G A+ WY + E R  F  W NL+ RM ERF  +K+    +R L+IK+EG
Subjt:  PGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEG

Query:  TVAEFIETFEALAAPLPHLSDEVLE
        TV E+ +TFEA +A LP ++++V+E
Subjt:  TVAEFIETFEALAAPLPHLSDEVLE

XP_031745972.1 uncharacterized protein LOC116406393 [Cucumis sativus]2.6e-4148.45Show/hide
Query:  MDEQQKISTRTQMMLESFMADVTGGATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIV
        MD   K  T T   L  + A  TGG     S  GESS  +  + +   R  +  E   +R+KFKKVEMP+F G DPDSWLFRA+RYF+IHKL++ EK +V
Subjt:  MDEQQKISTRTQMMLESFMADVTGGATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIV

Query:  SVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAFLNGLDPVVRAE
        + ISF+G AL WYR  E+R++FT W NL+ R+L RFR ++EG  C + L IKQ+ TV E+   F+ L APL  L D V+E  F+NGL P ++AE
Subjt:  SVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAFLNGLDPVVRAE

TrEMBL top hitse value%identityAlignment
A0A5A7SKP9 Ty3/gypsy retrotransposon protein3.7e-4141.25Show/hide
Query:  MTVRELETRCEATEKEIVDLK----------GTIAALCGKMDEQQKISTRTQMMLESFM-------ADVTGGAT----------------LTMSQLGESS
        M    +E R E  ++EI  +K          G +A +   +D  +  S + Q +L + +       + ++G AT                 + S++ ES 
Subjt:  MTVRELETRCEATEKEIVDLK----------GTIAALCGKMDEQQKISTRTQMMLESFM-------ADVTGGAT----------------LTMSQLGESS

Query:  LQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFR
          +N + D   R I   +   DRNKFKK+EMP+FTG DPDSWLFRAERYF+IH+LTE EK +VS ISFDG AL WYR  E+RNRF SW N++ R+L RFR
Subjt:  LQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFR

Query:  KAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAFLNGLDPVVRAE
         +K+G    + L IKQEG+V E+I  F+ + AP+  L + V+E  F+NGL P VR+E
Subjt:  KAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAFLNGLDPVVRAE

A0A5A7VAR4 Ty3/gypsy retrotransposon protein1.3e-4152.94Show/hide
Query:  GATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTS
        G   +   +GE+S +K   PD+ T          DR+KFKKVEMP+FTG DP+SWLFRAERYF+IHKLTE EK +VS I FDG AL WYR  E+R +F S
Subjt:  GATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTS

Query:  WENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAFLNGLDPVVRAE
        W NL+ R+L RF+  +EG  C R L I+QE TV E+   F+ L APL  L D V+E  F+ GL P +RAE
Subjt:  WENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAFLNGLDPVVRAE

A0A5D3BLG2 Transposon Tf2-1 polyprotein isoform X11.3e-4145.65Show/hide
Query:  TVRELETRCEATEKEIVDLKGTIAALCGKMDEQQKISTRTQMMLESFMADVTGGATLTMSQL-GESSLQKNK-QPDDPTR-----IIQIHEPGYDRNKFK
        TV  L  + +  +++   L   +  L  K D QQ    +   ++  +M +V    + T S L G SS  K     DD T+     +        +R+KFK
Subjt:  TVRELETRCEATEKEIVDLKGTIAALCGKMDEQQKISTRTQMMLESFMADVTGGATLTMSQL-GESSLQKNK-QPDDPTR-----IIQIHEPGYDRNKFK

Query:  KVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETF
        KVEMP+FTG DPDSWLFRA+RYF+IHKLT+ EK  VSVISFDG AL W+R  E+RN+FT W NL+ R+LERFR  +EG    +  A KQ  TV E+   F
Subjt:  KVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETF

Query:  EALAAPLPHLSDEVLECAFLNGLDPVVRAE
        + L APL  LSD+VLE  F+NGL P +RAE
Subjt:  EALAAPLPHLSDEVLECAFLNGLDPVVRAE

A0A6J1D5H9 uncharacterized protein LOC1110174765.5e-6156.52Show/hide
Query:  MTVRELETRCEATEKEIVDLKGTIAALCGKMDEQQKISTRTQMMLESFMADVTGGATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPI
        M V+ELE+RC A EKE+ D+KG I  +  K++EQ+ +S R Q MLESFM+++ GG T ++SQLGES +QKNK  ++ TR++QI++  YDRNKFKKVEMPI
Subjt:  MTVRELETRCEATEKEIVDLKGTIAALCGKMDEQQKISTRTQMMLESFMADVTGGATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPI

Query:  FTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAP
        F                              VISFD VALAWYR+HE R++FT W+NLR R L+RFRK+KEGRQCAR+LAIKQEG+VAE+ E FEAL+AP
Subjt:  FTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAP

Query:  LPHLSDEVLECAFLNGLDPVVRAESWPQNP
        LP LSDEVLEC FLNGLD VVR++     P
Subjt:  LPHLSDEVLECAFLNGLDPVVRAESWPQNP

A0A6J1DN22 Reverse transcriptase3.9e-4343.56Show/hide
Query:  RELETRCEATEKEIVDLKGTIAALCGKMDEQQKIS-----------TRTQMMLESFMADVTGGAT--LTMSQLGESSL-----QKNKQPDDPTRIIQIHE
        ++LE RCE+TEKE+  +K  +  +  +++EQ+K S             TQ  L+ FM +V+ G +     ++ G S+L     ++N +    T   +   
Subjt:  RELETRCEATEKEIVDLKGTIAALCGKMDEQQKIS-----------TRTQMMLESFMADVTGGAT--LTMSQLGESSL-----QKNKQPDDPTRIIQIHE

Query:  PGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEG
        PG +RNKFKKVEMP+FTG DP+SWLFRAERYFEI+ L+EEEK  V++ISF+G A+ WY + E R  F  W NL+ RM ERF  +K+    +R L+IK+EG
Subjt:  PGYDRNKFKKVEMPIFTGTDPDSWLFRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEG

Query:  TVAEFIETFEALAAPLPHLSDEVLE
        TV E+ +TFEA +A LP ++++V+E
Subjt:  TVAEFIETFEALAAPLPHLSDEVLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding3.4e-0730.19Show/hide
Query:  ERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAF
        E YF  + + E+E+  +   + +G    W ++  K+N  TSW+  +  M    +   +         I+QEG+V E+ E FEAL      L  + LE  F
Subjt:  ERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAF

Query:  LNGLDP
        L GL P
Subjt:  LNGLDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGTGAGAGAACTAGAGACAAGGTGCGAGGCAACGGAGAAGGAGATCGTCGATCTCAAAGGAACTATCGCTGCTCTGTGCGGGAAGATGGATGAACAACAAAAAAT
CAGTACCAGGACCCAGATGATGTTAGAGAGTTTTATGGCAGACGTCACCGGCGGGGCGACTTTGACGATGTCGCAGTTGGGAGAGTCTTCATTACAGAAGAACAAGCAAC
CAGACGATCCGACAAGGATCATCCAAATCCATGAACCAGGGTATGACCGCAACAAATTCAAGAAGGTGGAGATGCCCATCTTCACAGGTACGGATCCCGACTCATGGCTC
TTTCGTGCTGAACGTTATTTCGAAATTCACAAACTAACAGAGGAAGAGAAGCATATTGTGTCGGTAATAAGTTTTGATGGGGTTGCTTTGGCGTGGTATCGTTACCATGA
GAAGCGGAACAGGTTTACCAGTTGGGAAAATCTCAGGGGTCGAATGCTCGAGCGTTTCCGCAAGGCGAAAGAGGGTCGGCAATGCGCGCGGGTTTTGGCAATCAAACAAG
AGGGAACCGTTGCAGAGTTTATAGAGACCTTTGAAGCTCTGGCGGCGCCCCTTCCACACCTTTCCGATGAGGTGTTAGAGTGCGCCTTCTTGAATGGGCTTGACCCGGTG
GTCCGAGCGGAGAGTTGGCCACAGAACCCGTGGGCCTGGAACAGATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGTGAGAGAACTAGAGACAAGGTGCGAGGCAACGGAGAAGGAGATCGTCGATCTCAAAGGAACTATCGCTGCTCTGTGCGGGAAGATGGATGAACAACAAAAAAT
CAGTACCAGGACCCAGATGATGTTAGAGAGTTTTATGGCAGACGTCACCGGCGGGGCGACTTTGACGATGTCGCAGTTGGGAGAGTCTTCATTACAGAAGAACAAGCAAC
CAGACGATCCGACAAGGATCATCCAAATCCATGAACCAGGGTATGACCGCAACAAATTCAAGAAGGTGGAGATGCCCATCTTCACAGGTACGGATCCCGACTCATGGCTC
TTTCGTGCTGAACGTTATTTCGAAATTCACAAACTAACAGAGGAAGAGAAGCATATTGTGTCGGTAATAAGTTTTGATGGGGTTGCTTTGGCGTGGTATCGTTACCATGA
GAAGCGGAACAGGTTTACCAGTTGGGAAAATCTCAGGGGTCGAATGCTCGAGCGTTTCCGCAAGGCGAAAGAGGGTCGGCAATGCGCGCGGGTTTTGGCAATCAAACAAG
AGGGAACCGTTGCAGAGTTTATAGAGACCTTTGAAGCTCTGGCGGCGCCCCTTCCACACCTTTCCGATGAGGTGTTAGAGTGCGCCTTCTTGAATGGGCTTGACCCGGTG
GTCCGAGCGGAGAGTTGGCCACAGAACCCGTGGGCCTGGAACAGATCATGA
Protein sequenceShow/hide protein sequence
MTVRELETRCEATEKEIVDLKGTIAALCGKMDEQQKISTRTQMMLESFMADVTGGATLTMSQLGESSLQKNKQPDDPTRIIQIHEPGYDRNKFKKVEMPIFTGTDPDSWL
FRAERYFEIHKLTEEEKHIVSVISFDGVALAWYRYHEKRNRFTSWENLRGRMLERFRKAKEGRQCARVLAIKQEGTVAEFIETFEALAAPLPHLSDEVLECAFLNGLDPV
VRAESWPQNPWAWNRS