; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g16940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g16940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:12941554..12942075
RNA-Seq ExpressionMoc08g16940
SyntenyMoc08g16940
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8542446.1 hypothetical protein F0562_023418 [Nyssa sinensis]1.7e-5056.82Show/hide
Query:  MIIALSINNKLGFTNGSLSKPTGT---LIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMY
        M+IALS+ NKLGF +GS+ +P GT   LI SW RNN ++I+ ILN VSK ISAS+IF+ SA  IWLDL++ FQ++NGPRIF+LK++L  + Q+Q SVS+Y
Subjt:  MIIALSINNKLGFTNGSLSKPTGT---LIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMY

Query:  FTRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
        FT+LK++W+E   YRP  SCGKCSC G +++ +  Q EY+MSFL+GL++SF+  R Q+LL+DP P IN+ FSL+ Q
Subjt:  FTRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]6.6e-5057.8Show/hide
Query:  MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTR
        ++IAL++ NK+GF +GS+S+PT   + SW   N V+I+ I N +SK ISAS++FSDSAH IWLDLKE FQR+N PRIF+L+++L+ +TQDQ SV+ YFTR
Subjt:  MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTR

Query:  LKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
        LK++W E   YRPA SCG+CS  G +SIE   Q EY+M+FL+GLN SF+  RAQ+LL++P P+IN+AF+LV+Q
Subjt:  LKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ

XP_022154973.1 uncharacterized protein LOC111022117 [Momordica charantia]6.6e-5865.32Show/hide
Query:  MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTR
        M IALSI NKLGF NGSL KP G L+  W RN  V+I   LN VSK ISASLIF++S H IWLDLK+ FQ +NGP+IF+L++DLAT+TQDQ SV+MY+T+
Subjt:  MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTR

Query:  LKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
        LK++WDEY++YRP  +CG CSC G + +E+FVQ+E+LM FL+GLNESF   RAQILL+DP PSI KAFSL+SQ
Subjt:  LKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ

XP_022856063.1 uncharacterized protein LOC111377235, partial [Olea europaea var. sylvestris]1.0e-5056.57Show/hide
Query:  MIIALSINNKLGFTNGSLSKPTGT--LIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYF
        M+I L + NK+GF +GS++KP  +   +++W RNN ++I+ ILN VSK ISAS+I+S+SAH IW+DLKE FQ++NGPRIF+L+++L  +TQ Q SV +YF
Subjt:  MIIALSINNKLGFTNGSLSKPTGT--LIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYF

Query:  TRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
        T+LK++W+E   YRP  SCGKCSC  N+ + E  Q EY+MSFL+GLN++F   R Q+LL+DP PSINK FSLVSQ
Subjt:  TRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ

XP_038895765.1 uncharacterized protein LOC120083929 [Benincasa hispida]1.8e-5257.23Show/hide
Query:  MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTR
        M++ L I NKLGF +GSL +PTG L+  W  NN V+++ IL  VSKSIS+S++F++SA AIWLDL++ FQR+NGPRIF LK++L+++ QDQ SV+MYFT+
Subjt:  MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTR

Query:  LKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
        +KS  DEY++YRP  +CG+C+C G +S+E+F+Q+EYL+ F +GLN+SF  TR+Q+LL+DP P +NKAFS V Q
Subjt:  LKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ

TrEMBL top hitse value%identityAlignment
A0A5J5A1K4 Retrotrans_gag domain-containing protein1.3e-4855.11Show/hide
Query:  MIIALSINNKLGFTNGSLSKPTGT---LIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMY
        M+IAL + NKLGF +GS+ +P GT   L  SW RNN ++I+ ILN VSK ISAS+IF+ SA  IWLDL++ FQ++NGPRIF+LK++L  + Q+Q SVS+Y
Subjt:  MIIALSINNKLGFTNGSLSKPTGT---LIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMY

Query:  FTRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
        FT+LK++W+E    RP  SCGKCSC G +++ +  Q EY+MSFL+GL++SF+  R Q+LL+DP P IN+ FSL+ Q
Subjt:  FTRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ

A0A5J5BIH5 Uncharacterized protein8.4e-5156.82Show/hide
Query:  MIIALSINNKLGFTNGSLSKPTGT---LIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMY
        M+IALS+ NKLGF +GS+ +P GT   LI SW RNN ++I+ ILN VSK ISAS+IF+ SA  IWLDL++ FQ++NGPRIF+LK++L  + Q+Q SVS+Y
Subjt:  MIIALSINNKLGFTNGSLSKPTGT---LIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMY

Query:  FTRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
        FT+LK++W+E   YRP  SCGKCSC G +++ +  Q EY+MSFL+GL++SF+  R Q+LL+DP P IN+ FSL+ Q
Subjt:  FTRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ

A0A6J1DLQ9 uncharacterized protein LOC1110221173.2e-5865.32Show/hide
Query:  MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTR
        M IALSI NKLGF NGSL KP G L+  W RN  V+I   LN VSK ISASLIF++S H IWLDLK+ FQ +NGP+IF+L++DLAT+TQDQ SV+MY+T+
Subjt:  MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTR

Query:  LKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
        LK++WDEY++YRP  +CG CSC G + +E+FVQ+E+LM FL+GLNESF   RAQILL+DP PSI KAFSL+SQ
Subjt:  LKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ

A0A6J1DNP7 uncharacterized protein LOC1110220653.2e-5057.8Show/hide
Query:  MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTR
        ++IAL++ NK+GF +GS+S+PT   + SW   N V+I+ I N +SK ISAS++FSDSAH IWLDLKE FQR+N PRIF+L+++L+ +TQDQ SV+ YFTR
Subjt:  MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTR

Query:  LKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
        LK++W E   YRPA SCG+CS  G +SIE   Q EY+M+FL+GLN SF+  RAQ+LL++P P+IN+AF+LV+Q
Subjt:  LKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ

A0A7J0FKC9 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein3.5e-4957.39Show/hide
Query:  MIIALSINNKLGFTNGSLSKPTG---TLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMY
        MIIALS+ NKLGF +GS++KP G    L+ SW RNN V+I+ ILN VSK ISAS+IFS SA+ IW+DLK+ FQ+ NGPRIF+L+++L    QDQ  VS+Y
Subjt:  MIIALSINNKLGFTNGSLSKPTG---TLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMY

Query:  FTRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
        FT+LK++W+E   YRPA SCG C+C G + +    Q EY+MSFL+ L+ SF   R Q+LL+DP P INK FSL+SQ
Subjt:  FTRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.4e-2132.37Show/hide
Query:  LSINNKLGFTNGSLSKPT--GTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLK
        L +  K GF +G+L KP     L   W + N +++  ++N ++  +  S++++++AH +W DL+  F      +I++L++ LAT+ Q   SV  YF +L 
Subjt:  LSINNKLGFTNGSLSKPT--GTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLK

Query:  SVWDEYMTYR--PAWSCGKCSCEGNQSIEEFVQYEYLMSFLIG--LNESFTSTRAQILLIDPTPSINKAFSLV
         VW E   Y   P   CG C+CE  +  EE  + E    FL+G  LN+ F +   +I+   P PS+++AF++V
Subjt:  SVWDEYMTYR--PAWSCGKCSCEGNQSIEEFVQYEYLMSFLIG--LNESFTSTRAQILLIDPTPSINKAFSLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTATTGCGCTTTCCATTAATAATAAGCTTGGATTTACCAATGGATCTTTATCGAAGCCTACTGGTACTCTTATTGCTTCTTGGACTCGTAATAATTGTGTTTTAAT
TACCTGTATTTTGAACTTTGTTTCAAAATCAATTTCTGCTAGCCTCATCTTCTCCGATTCGGCACACGCTATTTGGCTTGATCTGAAGGAGGGATTTCAGCGTAAGAATG
GCCCTAGAATTTTTAAACTTAAGCAAGATTTGGCAACGATAACGCAAGATCAACAATCTGTTTCCATGTATTTTACTCGGCTTAAAAGTGTTTGGGATGAATACATGACT
TATCGACCTGCTTGGTCATGTGGCAAATGCTCTTGTGAAGGAAATCAATCTATTGAAGAATTTGTTCAATATGAATATCTCATGAGTTTTCTCATAGGTTTAAATGAGTC
TTTCACTTCTACCAGGGCTCAAATTTTGTTGATTGATCCGACTCCTAGCATCAACAAGGCTTTTTCTCTCGTATCTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTATTGCGCTTTCCATTAATAATAAGCTTGGATTTACCAATGGATCTTTATCGAAGCCTACTGGTACTCTTATTGCTTCTTGGACTCGTAATAATTGTGTTTTAAT
TACCTGTATTTTGAACTTTGTTTCAAAATCAATTTCTGCTAGCCTCATCTTCTCCGATTCGGCACACGCTATTTGGCTTGATCTGAAGGAGGGATTTCAGCGTAAGAATG
GCCCTAGAATTTTTAAACTTAAGCAAGATTTGGCAACGATAACGCAAGATCAACAATCTGTTTCCATGTATTTTACTCGGCTTAAAAGTGTTTGGGATGAATACATGACT
TATCGACCTGCTTGGTCATGTGGCAAATGCTCTTGTGAAGGAAATCAATCTATTGAAGAATTTGTTCAATATGAATATCTCATGAGTTTTCTCATAGGTTTAAATGAGTC
TTTCACTTCTACCAGGGCTCAAATTTTGTTGATTGATCCGACTCCTAGCATCAACAAGGCTTTTTCTCTCGTATCTCAGTAG
Protein sequenceShow/hide protein sequence
MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMT
YRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ