; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g17060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g17060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr2:12808384..12810751
RNA-Seq ExpressionMoc02g17060
SyntenyMoc02g17060
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]3.7e-4260.39Show/hide
Query:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL
        ++S +  +   QSNP+ EDWIAKD A MT+INATLSP AL+YVVG  S K++WD+L K YSS SR+N+V LK+ L++I KKP ESI  YIK  K+ KDKL
Subjt:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL

Query:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL
        AN+S  I++EDL+IY LNGLP EYNTFRTSM TR+ P+ FEEL+VL  A   +L
Subjt:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]3.7e-4260.39Show/hide
Query:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL
        ++S +  +   QSNP+ EDWIAKD A MT+INATLSP AL+YVVG  S K++WD+L K YSS SR+N+V LK+ L++I KKP ESI  YIK  K+ KDKL
Subjt:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL

Query:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL
        AN+S  I++EDL+IY LNGLP EYNTFRTSM TR+ P+ FEEL+VL  A   +L
Subjt:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]3.7e-4260.39Show/hide
Query:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL
        ++S +  +   QSNP+ EDWIAKD A MT+INATLSP AL+YVVG  S K++WD+L K YSS SR+N+V LK+ L++I KKP ESI  YIK  K+ KDKL
Subjt:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL

Query:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL
        AN+S  I++EDL+IY LNGLP EYNTFRTSM TR+ P+ FEEL+VL  A   +L
Subjt:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]3.7e-4260.39Show/hide
Query:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL
        ++S +  +   QSNP+ EDWIAKD A MT+INATLSP AL+YVVG  S K++WD+L K YSS SR+N+V LK+ L++I KKP ESI  YIK  K+ KDKL
Subjt:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL

Query:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL
        AN+S  I++EDL+IY LNGLP EYNTFRTSM TR+ P+ FEEL+VL  A   +L
Subjt:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL

XP_022159298.1 uncharacterized protein LOC111025709 [Momordica charantia]2.3e-4461.29Show/hide
Query:  KPYAILASSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYI-KC
        KP   L SSS ++ S     NPA  DWIAKDHA MTL+NATLSP+AL+Y+VGC+S +++W  LVK+YSS+SRTN+V LK+ L+SI+KKP ESI  Y+ + 
Subjt:  KPYAILASSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYI-KC

Query:  KKFKDKLANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFV
        K+ KDKLAN+SV +D+EDL+IYTLNGLP E+N F TSM TR+  + FEELYVL V
Subjt:  KKFKDKLANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFV

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.8e-4260.39Show/hide
Query:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL
        ++S +  +   QSNP+ EDWIAKD A MT+INATLSP AL+YVVG  S K++WD+L K YSS SR+N+V LK+ L++I KKP ESI  YIK  K+ KDKL
Subjt:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL

Query:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL
        AN+S  I++EDL+IY LNGLP EYNTFRTSM TR+ P+ FEEL+VL  A   +L
Subjt:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X31.8e-4260.39Show/hide
Query:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL
        ++S +  +   QSNP+ EDWIAKD A MT+INATLSP AL+YVVG  S K++WD+L K YSS SR+N+V LK+ L++I KKP ESI  YIK  K+ KDKL
Subjt:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL

Query:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL
        AN+S  I++EDL+IY LNGLP EYNTFRTSM TR+ P+ FEEL+VL  A   +L
Subjt:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.8e-4260.39Show/hide
Query:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL
        ++S +  +   QSNP+ EDWIAKD A MT+INATLSP AL+YVVG  S K++WD+L K YSS SR+N+V LK+ L++I KKP ESI  YIK  K+ KDKL
Subjt:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL

Query:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL
        AN+S  I++EDL+IY LNGLP EYNTFRTSM TR+ P+ FEEL+VL  A   +L
Subjt:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL

A0A5D3CLI6 T4.51.8e-4260.39Show/hide
Query:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL
        ++S +  +   QSNP+ EDWIAKD A MT+INATLSP AL+YVVG  S K++WD+L K YSS SR+N+V LK+ L++I KKP ESI  YIK  K+ KDKL
Subjt:  SSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIK-CKKFKDKL

Query:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL
        AN+S  I++EDL+IY LNGLP EYNTFRTSM TR+ P+ FEEL+VL  A   +L
Subjt:  ANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSL

A0A6J1DYF1 uncharacterized protein LOC1110257091.1e-4461.29Show/hide
Query:  KPYAILASSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYI-KC
        KP   L SSS ++ S     NPA  DWIAKDHA MTL+NATLSP+AL+Y+VGC+S +++W  LVK+YSS+SRTN+V LK+ L+SI+KKP ESI  Y+ + 
Subjt:  KPYAILASSSPTVGSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYI-KC

Query:  KKFKDKLANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFV
        K+ KDKLAN+SV +D+EDL+IYTLNGLP E+N F TSM TR+  + FEELYVL V
Subjt:  KKFKDKLANISVTIDDEDLVIYTLNGLPAEYNTFRTSMHTRASPIQFEELYVLFV

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-0526.19Show/hide
Query:  EDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESI-AKKPTESIHQYIKCKKFKDKLANISVTIDDEDLVIYTL
        EDW   D    + I   LS   ++ ++  ++ + IW  L   Y S + TN + LK  L ++   + T  +           +LAN+ V I++ED  I  L
Subjt:  EDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESI-AKKPTESIHQYIKCKKFKDKLANISVTIDDEDLVIYTL

Query:  NGLPAEYNTFRTSMHTRASPIQFEEL
        N LP+ Y+   T++    + I+ +++
Subjt:  NGLPAEYNTFRTSMHTRASPIQFEEL

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.0e-0625.37Show/hide
Query:  VGSAVVQSNPADEDWIAKDHAFMTLINATLSPTAL--SYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIKCKKFKDKLANIS
        +   ++ +N  D +W  +D      +  TL+P     S+V    S ++IW  +   + +N     + L + L +           Y K KK  D L N+ 
Subjt:  VGSAVVQSNPADEDWIAKDHAFMTLINATLSPTAL--SYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIKCKKFKDKLANIS

Query:  VTIDDEDLVIYTLNGLPAEY-NTFRTSMHTRASP
        V + D +LV+Y LNGL  ++ N      H +  P
Subjt:  VTIDDEDLVIYTLNGLPAEY-NTFRTSMHTRASP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGGAAAGGCAAGTTCGAGGCATGCAAGAGATTGGCAGCGGAGCTTCTCGACACAACGTGACCTCAACGGGGGCAGTATTGATCAATCGGTCAAGAAGGAGCATTT
CCCGAGCAGGTTTCAAGTCGAAGTGGAGAACTTGGCATTACTCAAACGCCCTGGCCGTAAGAGACAGCCCCCACACAAGTGCGACCGGATTCGGTACCTTTGCTTCCACA
AGAATCATAGCCACAACACCACGGATTGCAACAACTTAAAGCAACAAGTCCAATCCCTTAGATATGTAGTTAAGCCCTATGCAATTTTGGCGTCTTCTTCGCCAACAGTT
GGATCTGCAGTCGTCCAATCCAATCCAGCCGATGAAGATTGGATCGCCAAGGATCATGCTTTCATGACCTTGATTAATGCTACTCTATCACCAACCGCTCTTTCTTATGT
TGTTGGCTGTGAATCCTTAAAGGAAATTTGGGACATACTTGTCAAACATTATTCTTCGAATTCAAGGACGAATATTGTGACGCTTAAAACTTATTTGGAATCAATTGCTA
AGAAGCCTACTGAATCAATCCATCAGTATATTAAATGTAAAAAATTCAAGGACAAGTTGGCTAACATATCTGTCACAATTGATGATGAAGATCTGGTTATCTACACATTA
AATGGTTTGCCTGCTGAGTACAACACTTTTCGGACTTCTATGCACACCCGCGCTTCTCCGATTCAGTTTGAAGAACTTTATGTTCTTTTTGTTGCGAGGAATTTGTCATT
GATAAACAAGCTAGGAGCGATGAATCCTTCTCTTCACCTACTTCACTTCTTACCTCACGAAATTATTCTTGGAATCAAAATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGGAAAGGCAAGTTCGAGGCATGCAAGAGATTGGCAGCGGAGCTTCTCGACACAACGTGACCTCAACGGGGGCAGTATTGATCAATCGGTCAAGAAGGAGCATTT
CCCGAGCAGGTTTCAAGTCGAAGTGGAGAACTTGGCATTACTCAAACGCCCTGGCCGTAAGAGACAGCCCCCACACAAGTGCGACCGGATTCGGTACCTTTGCTTCCACA
AGAATCATAGCCACAACACCACGGATTGCAACAACTTAAAGCAACAAGTCCAATCCCTTAGATATGTAGTTAAGCCCTATGCAATTTTGGCGTCTTCTTCGCCAACAGTT
GGATCTGCAGTCGTCCAATCCAATCCAGCCGATGAAGATTGGATCGCCAAGGATCATGCTTTCATGACCTTGATTAATGCTACTCTATCACCAACCGCTCTTTCTTATGT
TGTTGGCTGTGAATCCTTAAAGGAAATTTGGGACATACTTGTCAAACATTATTCTTCGAATTCAAGGACGAATATTGTGACGCTTAAAACTTATTTGGAATCAATTGCTA
AGAAGCCTACTGAATCAATCCATCAGTATATTAAATGTAAAAAATTCAAGGACAAGTTGGCTAACATATCTGTCACAATTGATGATGAAGATCTGGTTATCTACACATTA
AATGGTTTGCCTGCTGAGTACAACACTTTTCGGACTTCTATGCACACCCGCGCTTCTCCGATTCAGTTTGAAGAACTTTATGTTCTTTTTGTTGCGAGGAATTTGTCATT
GATAAACAAGCTAGGAGCGATGAATCCTTCTCTTCACCTACTTCACTTCTTACCTCACGAAATTATTCTTGGAATCAAAATGTAA
Protein sequenceShow/hide protein sequence
MGGKASSRHARDWQRSFSTQRDLNGGSIDQSVKKEHFPSRFQVEVENLALLKRPGRKRQPPHKCDRIRYLCFHKNHSHNTTDCNNLKQQVQSLRYVVKPYAILASSSPTV
GSAVVQSNPADEDWIAKDHAFMTLINATLSPTALSYVVGCESLKEIWDILVKHYSSNSRTNIVTLKTYLESIAKKPTESIHQYIKCKKFKDKLANISVTIDDEDLVIYTL
NGLPAEYNTFRTSMHTRASPIQFEELYVLFVARNLSLINKLGAMNPSLHLLHFLPHEIILGIKM