; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g16550 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g16550
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr2:12440158..12444297
RNA-Seq ExpressionMoc02g16550
SyntenyMoc02g16550
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151602.1 uncharacterized protein LOC111019514 [Momordica charantia]2.1e-2582.89Show/hide
Query:  DRLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQR
        DRLPGNLP TTEVNPV+HCK ITL SGKELVEPEQNKKPTK VQQTEI P+ S KIVED  TP L SSLAIPFPQR
Subjt:  DRLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQR

XP_022151637.1 uncharacterized protein LOC111019545 [Momordica charantia]1.6e-2075Show/hide
Query:  DRLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQR
        DRLPGNLP TT VNPV++ KAIT  SGKELV+PEQNKKPTK+VQQTEI  + S KIVED  TP LASS  IPFPQR
Subjt:  DRLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQR

XP_024022201.1 uncharacterized protein LOC112091842 [Morus notabilis]1.2e-1740.56Show/hide
Query:  GGAFMGKTESGAYDLLEEM-TFNYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQSSQSAVGTIILTSHGGNI-RGTSWS-ACQHD---------
        GGA M KTE  AY+LLE+M T NYQW SERS  K+  G++E+DAI  LT Q+ASL+KQLQ +Q     I ++S       G   S  CQ D         
Subjt:  GGAFMGKTESGAYDLLEEM-TFNYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQSSQSAVGTIILTSHGGNI-RGTSWS-ACQHD---------

Query:  ----------------GDRLPGNLPRTTEVNP----VQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVED
                         +R  GNLP T+EVNP     ++CKA+TL SGKEL  P + +KP K+ Q+  +   S +    D
Subjt:  ----------------GDRLPGNLPRTTEVNP----VQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVED

XP_030497826.1 LOW QUALITY PROTEIN: uncharacterized protein LOC115713483 [Cannabis sativa]1.2e-1739.06Show/hide
Query:  RTTIDAVSGGAFMGKTESGAYDLLEEMTF-NYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQ------SSQSAVGTIILTSHGGNIRGTSWSAC
        RT IDA +GGAFM K+ + A+DLLEEM   N QW +ER   K+  G++EVDAI  LT Q      Q+Q      S  S + T +L       R +     
Subjt:  RTTIDAVSGGAFMGKTESGAYDLLEEMTF-NYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQ------SSQSAVGTIILTSHGGNIRGTSWSAC

Query:  QHDGD-------RLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIP---KSSIKIVEDITTPHLA--SSLAIPFPQR
           G        R  GNLP TTEVNP ++CKAITL SGK    P Q K    E +Q    P   K++  + +  T+P ++    + IP+PQR
Subjt:  QHDGD-------RLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIP---KSSIKIVEDITTPHLA--SSLAIPFPQR

XP_034899370.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118037487 [Populus alba]5.5e-1829.66Show/hide
Query:  IANIGSRFDDGIRTTIDAVSGGAFMGKTESGAYDLLEEMTF-NYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQSSQSAVGTIILT---SHGG-
        + N  +  +   RT IDA SGGAFM K++  AY+LLEEM   NYQW +ERS QK+ +GV+E+DAI ALT QV SLT+QL+++Q +   I  T    HG  
Subjt:  IANIGSRFDDGIRTTIDAVSGGAFMGKTESGAYDLLEEMTF-NYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQSSQSAVGTIILT---SHGG-

Query:  ---------------------------------------NIRGTSW------------SACQHDGD----------------------------------
                                               N    SW            S+ +H  +                                  
Subjt:  ---------------------------------------NIRGTSW------------SACQHDGD----------------------------------

Query:  ---------------RLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQR
                       R  GNLP TTE+NP + CKAITL SGKE+ +   NK   ++ ++  + P  ++K  + +  P       IPFPQR
Subjt:  ---------------RLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQR

TrEMBL top hitse value%identityAlignment
A0A061E3H0 RT_RNaseH domain-containing protein3.5e-1039.13Show/hide
Query:  RTTIDAVSGGAFMGKTESGAYDLLEEM-TFNYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQS-SQSAVGTIILTS--HGGNIRGTS---WSAC
        RTTIDA +GGA M K+    YDLL+EM + NYQW SER + ++   V+ +D +  L+ Q+A LTK++     + V    +T   HGG+          +C
Subjt:  RTTIDAVSGGAFMGKTESGAYDLLEEM-TFNYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQS-SQSAVGTIILTS--HGGNIRGTS---WSAC

Query:  QHDGDRLPGNLPRTTEVNP----VQHCKAITLWSGKEL
        Q  G+R    LP  TE NP     +H KAI L  GK++
Subjt:  QHDGDRLPGNLPRTTEVNP----VQHCKAITLWSGKEL

A0A3S3N117 Retrotrans_gag domain-containing protein2.7e-1056.52Show/hide
Query:  RTTIDAVSGGAFMGKTESGAYDLLEEM-TFNYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQS
        RT+IDA +GG  M K+   AY+L+EEM T NYQW S+   QK+  GV+E+D+I+ALT QVA+L+KQ+QS
Subjt:  RTTIDAVSGGAFMGKTESGAYDLLEEM-TFNYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQS

A0A438HF95 Retrotrans_gag domain-containing protein1.0e-0934.55Show/hide
Query:  RTTIDAVSGGAFMGKTESGAYDLLEEM-TFNYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQSSQSAVGTIILTSHGGNIRGTSWSACQHD---
        +T +DA SG AF+ KT    Y L+E M + N+   ++++AQKR  GV ++DA   L  QV  L   ++         +  + G +IR       +     
Subjt:  RTTIDAVSGGAFMGKTESGAYDLLEEM-TFNYQWHSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQSSQSAVGTIILTSHGGNIRGTSWSACQHD---

Query:  GDRLPGNLPRTTEVNPVQHCKAITLWSGKEL-----VEPEQNKK----PTKEVQQTEIIPKSSIK
         +R  G LP  TE NP +H KAITL SGKEL      E + NKK    P ++   T I+P   +K
Subjt:  GDRLPGNLPRTTEVNPVQHCKAITLWSGKEL-----VEPEQNKK----PTKEVQQTEIIPKSSIK

A0A6J1DDY6 uncharacterized protein LOC1110195141.0e-2582.89Show/hide
Query:  DRLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQR
        DRLPGNLP TTEVNPV+HCK ITL SGKELVEPEQNKKPTK VQQTEI P+ S KIVED  TP L SSLAIPFPQR
Subjt:  DRLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQR

A0A6J1DE18 uncharacterized protein LOC1110195457.5e-2175Show/hide
Query:  DRLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQR
        DRLPGNLP TT VNPV++ KAIT  SGKELV+PEQNKKPTK+VQQTEI  + S KIVED  TP LASS  IPFPQR
Subjt:  DRLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTKEVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAGACACTCCTGGATCGCCTCATGGCTAATGACAACGCTCTTCTGCTTAGCGCTGCACAATGGAAGATCTGCTGGAGCACACCTCCAGTTTGGAAGAGG
CGGAGGCCCAAGAGGATAGTGGCTCTGAAGAAGAAGAGAGCAAAAGCAATTCAATATAGCCTGTGTTCTTATGCTATAGCGAATATTGGCAGTAGGTTTGATGAT
GGAATAAGAACTACAATTGATGCAGTATCTGGAGGGGCTTTCATGGGTAAAACTGAAAGTGGAGCATATGATTTGTTGGAGGAAATGACATTCAACTACCAGTGG
CATAGTGAGAGGTCAGCTCAGAAAAGGCCGATGGGAGTAAATGAGGTGGATGCTATTGCTGCATTGACCGTGCAGGTTGCTTCGCTTACCAAGCAACTTCAATCA
AGTCAGTCTGCGGTTGGCACAATCATCCTAACTTCTCATGGAGGCAACATCAGAGGTACAAGTTGGTCAGCTTGCCAACATGATGGTGATAGGCTTCCAGGTAAT
TTGCCGAGGACAACAGAAGTCAATCCAGTGCAACACTGTAAGGCAATTACCTTGTGGAGTGGTAAGGAGTTGGTTGAACCAGAACAAAATAAAAAGCCTACAAAA
GAAGTTCAACAAACAGAGATTATACCAAAATCATCCATTAAGATAGTTGAGGACATTACGACACCACATTTAGCCAGTTCGTTAGCCATTCCATTTCCTCAACGC
TTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACAAGACACTCCTGGATCGCCTCATGGCTAATGACAACGCTCTTCTGCTTAGCGCTGCACAATGGAAGATCTGCTGGAGCACACCTCCAGTTTGGAAGAGG
CGGAGGCCCAAGAGGATAGTGGCTCTGAAGAAGAAGAGAGCAAAAGCAATTCAATATAGCCTGTGTTCTTATGCTATAGCGAATATTGGCAGTAGGTTTGATGAT
GGAATAAGAACTACAATTGATGCAGTATCTGGAGGGGCTTTCATGGGTAAAACTGAAAGTGGAGCATATGATTTGTTGGAGGAAATGACATTCAACTACCAGTGG
CATAGTGAGAGGTCAGCTCAGAAAAGGCCGATGGGAGTAAATGAGGTGGATGCTATTGCTGCATTGACCGTGCAGGTTGCTTCGCTTACCAAGCAACTTCAATCA
AGTCAGTCTGCGGTTGGCACAATCATCCTAACTTCTCATGGAGGCAACATCAGAGGTACAAGTTGGTCAGCTTGCCAACATGATGGTGATAGGCTTCCAGGTAAT
TTGCCGAGGACAACAGAAGTCAATCCAGTGCAACACTGTAAGGCAATTACCTTGTGGAGTGGTAAGGAGTTGGTTGAACCAGAACAAAATAAAAAGCCTACAAAA
GAAGTTCAACAAACAGAGATTATACCAAAATCATCCATTAAGATAGTTGAGGACATTACGACACCACATTTAGCCAGTTCGTTAGCCATTCCATTTCCTCAACGC
TTCTAG
Protein sequenceShow/hide protein sequence
MDKTLLDRLMANDNALLLSAAQWKICWSTPPVWKRRRPKRIVALKKKRAKAIQYSLCSYAIANIGSRFDDGIRTTIDAVSGGAFMGKTESGAYDLLEEMTFNYQW
HSERSAQKRPMGVNEVDAIAALTVQVASLTKQLQSSQSAVGTIILTSHGGNIRGTSWSACQHDGDRLPGNLPRTTEVNPVQHCKAITLWSGKELVEPEQNKKPTK
EVQQTEIIPKSSIKIVEDITTPHLASSLAIPFPQRF