; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetroelement pol polyprotein-like
Genome locationchr4:11918308..11919137
RNA-Seq ExpressionMoc04g15940
SyntenyMoc04g15940
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030478190.1 uncharacterized protein LOC115695250 [Cannabis sativa]7.2e-1042.06Show/hide
Query:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSITK---REGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS
        +E LMR+YMAKND++IQSQA +LRNLE+++ Q+ N+LK+  QGTLPS TK   R+GKE   A TLR  K L     +   K+    QKE   +K     +
Subjt:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSITK---REGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS

Query:  ENIPVNESRVNDAAVSSEKVAQQEPV
          IP       D   S +  A ++P+
Subjt:  ENIPVNESRVNDAAVSSEKVAQQEPV

XP_030495102.1 uncharacterized protein LOC115710889 [Cannabis sativa]1.2e-0942.4Show/hide
Query:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSIT---KREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS
        +E+LMR+YM KND++IQSQA +LRNLEV++ Q+ N+LK+  QGTLPS T   +R+GKE   A TLR  K L     +   K+    QKE   +K    IS
Subjt:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSIT---KREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS

Query:  ENIPVNESRVNDAAVSSEKVAQQEP
         ++   +   +    S+EK  Q+ P
Subjt:  ENIPVNESRVNDAAVSSEKVAQQEP

XP_030507648.1 uncharacterized protein LOC115722545 [Cannabis sativa]2.1e-0959.15Show/hide
Query:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSIT---KREGKEQRLASTLRLCKAL
        +E+LMR+YMAKND++IQSQA +LRNLEV++ Q+ N+LK+  QGTLPS T   +R+GKE   A TLR  K L
Subjt:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSIT---KREGKEQRLASTLRLCKAL

XP_030509134.1 uncharacterized protein LOC115723804 [Cannabis sativa]3.2e-1037.02Show/hide
Query:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSITK---REGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS
        +E+LMR+YMAKND++IQSQA +LRNLEV++ Q+ N+LK+  QGTLPS TK   R+GKE   A TLR  K +     +   K+    QKE   +K  +  S
Subjt:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSITK---REGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS

Query:  ENIPVNESRVNDAAVSSEKVAQQEPV-------AQQEMGRPFLSMDYVEECSL-LRLVDDLLSEEIQTKELLDQLSLLQQL
         N+   +S  +    S EK  Q+ P+        QQ+ G+    +D +++  + + LV+ L       K L D L+  ++L
Subjt:  ENIPVNESRVNDAAVSSEKVAQQEPV-------AQQEMGRPFLSMDYVEECSL-LRLVDDLLSEEIQTKELLDQLSLLQQL

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]5.5e-1046.15Show/hide
Query:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSIT---KREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS
        +E+LMR+YMAKND++IQSQA +LRNLEV++ Q+ N+LK+  QGTLPS T   +R+GKE   A TLR  K +     +   K+    QKE   +K     +
Subjt:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSIT---KREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS

Query:  ENIP
          IP
Subjt:  ENIP

TrEMBL top hitse value%identityAlignment
A0A5B6VKQ8 Retrovirus-related Pol polyprotein from transposon 17.63.0e-0654.17Show/hide
Query:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSITKR---EGKEQRLASTLRLCKALP
        MEAL++EYMAKND +IQSQAT+LR LE ++ Q+++ L S  QG LPS TK    +GKE   A T R    LP
Subjt:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSITKR---EGKEQRLASTLRLCKALP

A0A6J1DWK1 uncharacterized protein LOC1110250535.5e-0841.75Show/hide
Query:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPS---ITKREGKEQRLASTLRLCKALPHPYPS----------VMQKDTEEQQKEL
        +E +M++YMA ND+ +QSQA +LRNLE+++ Q+  +LKS   G LPS   + KR+ KEQ  A TLR  KALP  +P+          ++Q + + +Q   
Subjt:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPS---ITKREGKEQRLASTLRLCKALPHPYPS----------VMQKDTEEQQKEL

Query:  PAE
        PAE
Subjt:  PAE

A0A6J1DXK5 uncharacterized protein LOC1110255001.8e-0645.88Show/hide
Query:  MREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPS---ITKREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKE
        M +YM  ND+ +QSQA +LRNLE+++ Q+  +LKS  +G LPS   + KR+GKEQ  A TLR  K LP  +P+      E  Q E
Subjt:  MREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPS---ITKREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKE

A0A6J1DYG0 uncharacterized protein LOC1110257646.1e-0739.64Show/hide
Query:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLP---SITKREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS
        +E +M+EYMA+ D++IQSQA ++RN E ++ Q+ NELK+  QG+ P    + KREGKEQ  A TLR    L +  P++   D      ++P+   T +I 
Subjt:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLP---SITKREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS

Query:  ENIPVNESRVN
        EN P    + N
Subjt:  ENIPVNESRVN

A0A6J1H7K8 uncharacterized protein LOC1114611672.3e-0638.66Show/hide
Query:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSIT---KREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS
        +E+L++EYMAKND +IQSQ  +LRNLEV++ Q+ NEL++   G LPS T   KREG EQ  A  LR  K +      + +      Q+    +   +R  
Subjt:  MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSIT---KREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRIS

Query:  ENIPVNESRVNDAAVSSEK
        E +   E   NDA    +K
Subjt:  ENIPVNESRVNDAAVSSEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCATTGATGAGAGAGTATATGGCTAAAAATGATTCCATGATTCAAAGCCAGGCTACCACCCTAAGAAATCTGGAAGTGGAAATGAGACAAATGACCAATGAACT
GAAAAGCATATCACAAGGCACTTTGCCAAGTATTACCAAACGAGAAGGGAAAGAGCAACGTCTAGCATCCACTCTTCGACTTTGCAAAGCATTACCACACCCCTATCCTT
CTGTGATGCAAAAGGATACGGAGGAGCAGCAAAAGGAGCTTCCAGCAGAGAAGTTGACTAAGAGAATTTCAGAAAATATACCTGTTAATGAGAGCAGAGTGAATGATGCT
GCAGTATCATCGGAGAAAGTAGCTCAACAAGAGCCGGTTGCACAACAGGAAATGGGAAGGCCTTTCCTGTCCATGGATTATGTGGAGGAATGCTCTTTGTTAAGACTTGT
AGATGATCTACTGAGTGAAGAAATACAAACAAAAGAACTATTAGACCAGCTTAGTCTATTGCAGCAGCTGAGCTCCGGGACAAGGGTTTTAGCAGCAATTGAAAGAAGGC
AGAAGACCTACCTTAGCTATTCGAAGGAACAAGATATTGTGATCCGGAAGGCGATGATGAGGAATTTCTCTCACGCATTTCTCCCTTTCCCCAAATTTTCAGAAGACAGG
CCTGATGAAGATGAAGAAGTGGATGATGCTGGAGCATCTCAGCCTATGGGCGACTCCAGTGAAAAGGATGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCATTGATGAGAGAGTATATGGCTAAAAATGATTCCATGATTCAAAGCCAGGCTACCACCCTAAGAAATCTGGAAGTGGAAATGAGACAAATGACCAATGAACT
GAAAAGCATATCACAAGGCACTTTGCCAAGTATTACCAAACGAGAAGGGAAAGAGCAACGTCTAGCATCCACTCTTCGACTTTGCAAAGCATTACCACACCCCTATCCTT
CTGTGATGCAAAAGGATACGGAGGAGCAGCAAAAGGAGCTTCCAGCAGAGAAGTTGACTAAGAGAATTTCAGAAAATATACCTGTTAATGAGAGCAGAGTGAATGATGCT
GCAGTATCATCGGAGAAAGTAGCTCAACAAGAGCCGGTTGCACAACAGGAAATGGGAAGGCCTTTCCTGTCCATGGATTATGTGGAGGAATGCTCTTTGTTAAGACTTGT
AGATGATCTACTGAGTGAAGAAATACAAACAAAAGAACTATTAGACCAGCTTAGTCTATTGCAGCAGCTGAGCTCCGGGACAAGGGTTTTAGCAGCAATTGAAAGAAGGC
AGAAGACCTACCTTAGCTATTCGAAGGAACAAGATATTGTGATCCGGAAGGCGATGATGAGGAATTTCTCTCACGCATTTCTCCCTTTCCCCAAATTTTCAGAAGACAGG
CCTGATGAAGATGAAGAAGTGGATGATGCTGGAGCATCTCAGCCTATGGGCGACTCCAGTGAAAAGGATGAATAA
Protein sequenceShow/hide protein sequence
MEALMREYMAKNDSMIQSQATTLRNLEVEMRQMTNELKSISQGTLPSITKREGKEQRLASTLRLCKALPHPYPSVMQKDTEEQQKELPAEKLTKRISENIPVNESRVNDA
AVSSEKVAQQEPVAQQEMGRPFLSMDYVEECSLLRLVDDLLSEEIQTKELLDQLSLLQQLSSGTRVLAAIERRQKTYLSYSKEQDIVIRKAMMRNFSHAFLPFPKFSEDR
PDEDEEVDDAGASQPMGDSSEKDE