; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g10380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g10380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:7708399..7709016
RNA-Seq ExpressionMoc08g10380
SyntenyMoc08g10380
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0030247 - polysaccharide binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON99369.1 hypothetical protein TorRG33x02_049120 [Trema orientale]5.0e-4968.03Show/hide
Query:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR
        MDP +KL  + GD+L D + Y+C IG L+YLT+S  DITF VH LSQF+S+PR PHL AA+HLLRYLK   GQGLF S+ SS Q++AF DADW SC D+R
Subjt:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR

Query:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRL
        KS TGFCIFLG+SLVSWKAKKQ+T+SRSSA+AEY ALAATTS+I+ L
Subjt:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRL

XP_022132680.1 uncharacterized protein LOC111005480 [Momordica charantia]1.5e-4869.8Show/hide
Query:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR
        MDP +KL +S G +L+DPT Y+ +IG LIYLTISR DITFVVH LSQ+++     HL AA+HLL+YLKG  GQG+FL S +SF +RAF D D ASCLDSR
Subjt:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR

Query:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRLTY
        KSTTGFCIFLG+SLVSWKAKKQTTVSRSSA+AEY ALAATTS+I+ +T+
Subjt:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRLTY

XP_022153428.1 uncharacterized protein LOC111020937 [Momordica charantia]1.6e-79100Show/hide
Query:  QFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR
        QFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR
Subjt:  QFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR

Query:  LTYCFVIYIFSFNHQSCFSVIIMLLFTLLRTRLFMNVPSTLSLIVILFVKRSLKASSSF
        LTYCFVIYIFSFNHQSCFSVIIMLLFTLLRTRLFMNVPSTLSLIVILFVKRSLKASSSF
Subjt:  LTYCFVIYIFSFNHQSCFSVIIMLLFTLLRTRLFMNVPSTLSLIVILFVKRSLKASSSF

XP_022155863.1 uncharacterized protein LOC111022877 isoform X4 [Momordica charantia]7.3e-4867.12Show/hide
Query:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR
        MDPNIKL AS G +L+DP++Y+ +IG L+YLTISR DITF VH LSQF++ P   HL A + L+RYLKGC GQ + L+   SFQ+RAF DADW SC DSR
Subjt:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR

Query:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR
        KSTTGFCIFLG+SLVSWKAKKQ+T+ RSS +AEY ALAAT S+I R
Subjt:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR

XP_022899321.1 uncharacterized protein LOC111412620 [Olea europaea var. sylvestris]1.9e-4866.22Show/hide
Query:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR
        MDP +KL +  GD++ D +MY+ + G L+YLTISR DITF VH LSQFVSQPR PHLDAA+HLL+Y+K   GQG+  S+ SS Q+RAF DADW SCLD+R
Subjt:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR

Query:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRLT
        KS  GFC+FLG+SL+SWKAKKQTTVSRSSA+AEY ALA+T S++  LT
Subjt:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRLT

TrEMBL top hitse value%identityAlignment
A0A2P5FNK4 Uncharacterized protein2.4e-4968.03Show/hide
Query:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR
        MDP +KL  + GD+L D + Y+C IG L+YLT+S  DITF VH LSQF+S+PR PHL AA+HLLRYLK   GQGLF S+ SS Q++AF DADW SC D+R
Subjt:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR

Query:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRL
        KS TGFCIFLG+SLVSWKAKKQ+T+SRSSA+AEY ALAATTS+I+ L
Subjt:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRL

A0A6J1BT54 uncharacterized protein LOC1110054807.1e-4969.8Show/hide
Query:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR
        MDP +KL +S G +L+DPT Y+ +IG LIYLTISR DITFVVH LSQ+++     HL AA+HLL+YLKG  GQG+FL S +SF +RAF D D ASCLDSR
Subjt:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR

Query:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRLTY
        KSTTGFCIFLG+SLVSWKAKKQTTVSRSSA+AEY ALAATTS+I+ +T+
Subjt:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRLTY

A0A6J1DHF4 uncharacterized protein LOC1110209377.8e-80100Show/hide
Query:  QFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR
        QFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR
Subjt:  QFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR

Query:  LTYCFVIYIFSFNHQSCFSVIIMLLFTLLRTRLFMNVPSTLSLIVILFVKRSLKASSSF
        LTYCFVIYIFSFNHQSCFSVIIMLLFTLLRTRLFMNVPSTLSLIVILFVKRSLKASSSF
Subjt:  LTYCFVIYIFSFNHQSCFSVIIMLLFTLLRTRLFMNVPSTLSLIVILFVKRSLKASSSF

A0A6J1DP23 uncharacterized protein LOC111022877 isoform X43.5e-4867.12Show/hide
Query:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR
        MDPNIKL AS G +L+DP++Y+ +IG L+YLTISR DITF VH LSQF++ P   HL A + L+RYLKGC GQ + L+   SFQ+RAF DADW SC DSR
Subjt:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR

Query:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR
        KSTTGFCIFLG+SLVSWKAKKQ+T+ RSS +AEY ALAAT S+I R
Subjt:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR

A0A6J1DQI6 uncharacterized protein LOC111022877 isoform X33.5e-4867.12Show/hide
Query:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR
        MDPNIKL AS G +L+DP++Y+ +IG L+YLTISR DITF VH LSQF++ P   HL A + L+RYLKGC GQ + L+   SFQ+RAF DADW SC DSR
Subjt:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR

Query:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR
        KSTTGFCIFLG+SLVSWKAKKQ+T+ RSS +AEY ALAAT S+I R
Subjt:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIR

SwissProt top hitse value%identityAlignment
P0CV72 Secreted RxLR effector protein 1616.9e-1737.98Show/hide
Query:  YQCIIGCLIYL-TISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVSWKA
        Y   +G ++YL  ++R D+   V  LSQF S P   H  A   +LRYL+     GL  +   + ++  + DADWA  ++SR+ST+G+   L    VSW++
Subjt:  YQCIIGCLIYL-TISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVSWKA

Query:  KKQTTVSRSSADAEYHALAATTSKIIRLT
        KKQ TV+ SS + EY AL+  T + + LT
Subjt:  KKQTTVSRSSADAEYHALAATTSKIIRLT

P92519 Uncharacterized mitochondrial protein AtMg008102.3e-2845.31Show/hide
Query:  DPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVS
        DP+ ++ I+G L YLT++R DI++ V+ + Q + +P     D    +LRY+KG    GL++   S   ++AFCD+DWA C  +R+STTGFC FLG +++S
Subjt:  DPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVS

Query:  WKAKKQTTVSRSSADAEYHALAATTSKI
        W AK+Q TVSRSS + EY ALA T +++
Subjt:  WKAKKQTTVSRSSADAEYHALAATTSKI

P93290 Uncharacterized mitochondrial protein AtMg002407.4e-1955.7Show/hide
Query:  IYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFC
        +YLTI+R D+TF V+ LSQF S  R   + A Y +L Y+KG  GQGLF S+ S  Q++AF D+DWASC D+R+S TGFC
Subjt:  IYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFC

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.1e-2641.82Show/hide
Query:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR
        M P+ KL   +G  L DPT Y+ I+G L YL  +R DI++ V+ LSQF+  P   HL A   +LRYL G    G+FL   ++  + A+ DADWA   D  
Subjt:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR

Query:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKI-----------IRLTYCFVIY
         ST G+ ++LG   +SW +KKQ  V RSS +AEY ++A T+S++           IRLT   VIY
Subjt:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKI-----------IRLTYCFVIY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-2441.01Show/hide
Query:  KLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTG
        KL   +G  L DPT Y+ I+G L YL  +R D+++ V+ LSQ++  P   H +A   +LRYL G    G+FL   ++  + A+ DADWA   D   ST G
Subjt:  KLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTG

Query:  FCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKI
        + ++LG   +SW +KKQ  V RSS +AEY ++A T+S++
Subjt:  FCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-3448.34Show/hide
Query:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR
        MDP++   A +G    D   Y+ +IG L+YL I+R DI+F V+ LSQF   PR  H  A   +L Y+KG  GQGLF SS +  Q++ F DA + SC D+R
Subjt:  MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSR

Query:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRLTYCF
        +ST G+C+FLG SL+SWK+KKQ  VS+SSA+AEY AL+  T +++ L   F
Subjt:  KSTTGFCIFLGESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRLTYCF

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.2e-2055.7Show/hide
Query:  IYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFC
        +YLTI+R D+TF V+ LSQF S  R   + A Y +L Y+KG  GQGLF S+ S  Q++AF D+DWASC D+R+S TGFC
Subjt:  IYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFC

ATMG00810.1 DNA/RNA polymerases superfamily protein1.6e-2945.31Show/hide
Query:  DPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVS
        DP+ ++ I+G L YLT++R DI++ V+ + Q + +P     D    +LRY+KG    GL++   S   ++AFCD+DWA C  +R+STTGFC FLG +++S
Subjt:  DPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFLGESLVS

Query:  WKAKKQTTVSRSSADAEYHALAATTSKI
        W AK+Q TVSRSS + EY ALA T +++
Subjt:  WKAKKQTTVSRSSADAEYHALAATTSKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCTAACATTAAGCTTCTTGCTTCTGCTGGTGATATCTTAGAGGATCCTACTATGTATCAGTGTATTATTGGGTGTCTCATTTACCTCACTATTTCTCGGTCTGA
CATTACCTTTGTTGTACATTACCTCAGTCAATTTGTATCACAGCCTCGTTGCCCACATCTTGATGCTGCTTATCATCTTCTGCGCTACTTGAAAGGTTGTTCTGGGCAAG
GTTTATTTCTTTCCAGTTTGTCATCTTTTCAAATTCGGGCTTTCTGTGATGCTGATTGGGCATCTTGCTTGGATTCGAGAAAGTCTACAACCGGGTTTTGCATCTTCCTT
GGTGAGTCTCTTGTGTCTTGGAAGGCCAAAAAGCAAACCACGGTCTCTCGCTCCTCTGCCGATGCTGAATATCATGCGCTCGCTGCTACAACGAGCAAAATCATAAGGCT
CACTTATTGCTTCGTGATTTACATATTCAGTTTCAACCACCAGAGCTGCTTTTCTGTGATAATAATGCTACTATTCACATTGCTTCGAACCCGTCTTTTCATGAACGTAC
CAAGCACATTGAGCTTGATTGTCATTTTGTTCGTGAAAAGATCGTTGAAGGCATCGTCAAGCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCTAACATTAAGCTTCTTGCTTCTGCTGGTGATATCTTAGAGGATCCTACTATGTATCAGTGTATTATTGGGTGTCTCATTTACCTCACTATTTCTCGGTCTGA
CATTACCTTTGTTGTACATTACCTCAGTCAATTTGTATCACAGCCTCGTTGCCCACATCTTGATGCTGCTTATCATCTTCTGCGCTACTTGAAAGGTTGTTCTGGGCAAG
GTTTATTTCTTTCCAGTTTGTCATCTTTTCAAATTCGGGCTTTCTGTGATGCTGATTGGGCATCTTGCTTGGATTCGAGAAAGTCTACAACCGGGTTTTGCATCTTCCTT
GGTGAGTCTCTTGTGTCTTGGAAGGCCAAAAAGCAAACCACGGTCTCTCGCTCCTCTGCCGATGCTGAATATCATGCGCTCGCTGCTACAACGAGCAAAATCATAAGGCT
CACTTATTGCTTCGTGATTTACATATTCAGTTTCAACCACCAGAGCTGCTTTTCTGTGATAATAATGCTACTATTCACATTGCTTCGAACCCGTCTTTTCATGAACGTAC
CAAGCACATTGAGCTTGATTGTCATTTTGTTCGTGAAAAGATCGTTGAAGGCATCGTCAAGCTTCTGA
Protein sequenceShow/hide protein sequence
MDPNIKLLASAGDILEDPTMYQCIIGCLIYLTISRSDITFVVHYLSQFVSQPRCPHLDAAYHLLRYLKGCSGQGLFLSSLSSFQIRAFCDADWASCLDSRKSTTGFCIFL
GESLVSWKAKKQTTVSRSSADAEYHALAATTSKIIRLTYCFVIYIFSFNHQSCFSVIIMLLFTLLRTRLFMNVPSTLSLIVILFVKRSLKASSSF