; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g11730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g11730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr2:8460470..8480805
RNA-Seq ExpressionMoc02g11730
SyntenyMoc02g11730
Gene Ontology termsGO:0045087 - innate immune response (biological process)
GO:0050793 - regulation of developmental process (biological process)
InterPro domainsIPR044700 - PIP2/PIPL1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034017.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.7e-1953.7Show/hide
Query:  QPAFVQGRQITDAILMANEIVYHWRCTNTQSIMSHKSFNKNWLDWIRGCISS------VNGKPRGRINATKGIRQGARPLSHFLFVLAMDYLSQLLDHAF
        Q AF++ RQITDAILMANE + +W+       +  K+F  +W  WI GCIS+      VNGKP+GRI A +G+RQG  PLS FLFV+AMDYLS+LL H  
Subjt:  QPAFVQGRQITDAILMANEIVYHWRCTNTQSIMSHKSFNKNWLDWIRGCISS------VNGKPRGRINATKGIRQGARPLSHFLFVLAMDYLSQLLDHAF

Query:  RKGLIKDV
          G IK V
Subjt:  RKGLIKDV

KAG6595008.1 PAMP-induced secreted peptide 2, partial [Cucurbita argyrosperma subsp. sororia]1.0e-1867.44Show/hide
Query:  ILSLILVFSLLS-NGSLLGIMVEARPLN-MVGTAVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSPGVGHSQVS
        ++ LILV +LL+ NG L  +  EARPLN M   A+AEDFF GLSLGA KQSGPSPGGDGH F N DTLGGIK+SGPSPG GHS ++
Subjt:  ILSLILVFSLLS-NGSLLGIMVEARPLN-MVGTAVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSPGVGHSQVS

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.9e-1846.51Show/hide
Query:  QPAFVQGRQITDAILMANEIVYHWRCTNTQ---------------------SIMSHKSFNKNWLDWIRGCISSV------NGKPRGRINATKGIRQGARP
        Q AFV+ RQITDAILMANE V +W+    +                     +++  K+F   W  WIRGCIS+V      NG+P+GRI A +G+RQG  P
Subjt:  QPAFVQGRQITDAILMANEIVYHWRCTNTQ---------------------SIMSHKSFNKNWLDWIRGCISSV------NGKPRGRINATKGIRQGARP

Query:  LSHFLFVLAMDYLSQLLDHAFRKGLIKDV
        LS FLFV+AMDYLS+LL H    G IK V
Subjt:  LSHFLFVLAMDYLSQLLDHAFRKGLIKDV

TYK12857.1 hypothetical protein E5676_scaffold255G004490 [Cucumis melo var. makuwa]5.4e-2063.74Show/hide
Query:  LSLILVFSLLSNGSLLGIMVEARPLNMVGT----AVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSPGVGHSQVSGQHH
        + LILV +LL NG L+ I+ EARPL ++G+    A+AEDFF GLSLGA KQSGPS GGDGH F N DT GG+KDSGP+PG GHS V+  HH
Subjt:  LSLILVFSLLSNGSLLGIMVEARPLNMVGT----AVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSPGVGHSQVSGQHH

XP_008446987.1 PREDICTED: uncharacterized protein LOC103489530 [Cucumis melo]2.9e-1846.15Show/hide
Query:  KQPAFVQGRQITDAILMANEIVYHWRCTNTQS---------------------IMSHKSFNKNWLDWIRGCISS------VNGKPRGRINATKGIRQGAR
        KQ AFV+ RQITDAILMANE V +W+    +                      ++  K+F   W  WI+GCIS+      VNG+P+GRI A +G+RQG  
Subjt:  KQPAFVQGRQITDAILMANEIVYHWRCTNTQS---------------------IMSHKSFNKNWLDWIRGCISS------VNGKPRGRINATKGIRQGAR

Query:  PLSHFLFVLAMDYLSQLLDHAFRKGLIKDV
        PLS FLFV+AMDYLS+LL H    G IK V
Subjt:  PLSHFLFVLAMDYLSQLLDHAFRKGLIKDV

TrEMBL top hitse value%identityAlignment
A0A1S3BGD2 uncharacterized protein LOC1034895301.4e-1846.15Show/hide
Query:  KQPAFVQGRQITDAILMANEIVYHWRCTNTQS---------------------IMSHKSFNKNWLDWIRGCISS------VNGKPRGRINATKGIRQGAR
        KQ AFV+ RQITDAILMANE V +W+    +                      ++  K+F   W  WI+GCIS+      VNG+P+GRI A +G+RQG  
Subjt:  KQPAFVQGRQITDAILMANEIVYHWRCTNTQS---------------------IMSHKSFNKNWLDWIRGCISS------VNGKPRGRINATKGIRQGAR

Query:  PLSHFLFVLAMDYLSQLLDHAFRKGLIKDV
        PLS FLFV+AMDYLS+LL H    G IK V
Subjt:  PLSHFLFVLAMDYLSQLLDHAFRKGLIKDV

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.4e-1846.51Show/hide
Query:  QPAFVQGRQITDAILMANEIVYHWRCTNTQ---------------------SIMSHKSFNKNWLDWIRGCISSV------NGKPRGRINATKGIRQGARP
        Q AFV+ RQITDAILMANE V +W+    +                     +++  K+F   W  WIRGCIS+V      NG+P+GRI A +G+RQG  P
Subjt:  QPAFVQGRQITDAILMANEIVYHWRCTNTQ---------------------SIMSHKSFNKNWLDWIRGCISSV------NGKPRGRINATKGIRQGARP

Query:  LSHFLFVLAMDYLSQLLDHAFRKGLIKDV
        LS FLFV+AMDYLS+LL H    G IK V
Subjt:  LSHFLFVLAMDYLSQLLDHAFRKGLIKDV

A0A5D3BT10 LINE-1 retrotransposable element ORF2 protein3.7e-1953.7Show/hide
Query:  QPAFVQGRQITDAILMANEIVYHWRCTNTQSIMSHKSFNKNWLDWIRGCISS------VNGKPRGRINATKGIRQGARPLSHFLFVLAMDYLSQLLDHAF
        Q AF++ RQITDAILMANE + +W+       +  K+F  +W  WI GCIS+      VNGKP+GRI A +G+RQG  PLS FLFV+AMDYLS+LL H  
Subjt:  QPAFVQGRQITDAILMANEIVYHWRCTNTQSIMSHKSFNKNWLDWIRGCISS------VNGKPRGRINATKGIRQGARPLSHFLFVLAMDYLSQLLDHAF

Query:  RKGLIKDV
          G IK V
Subjt:  RKGLIKDV

A0A5D3C104 LINE-1 retrotransposable element ORF2 protein1.4e-1846.15Show/hide
Query:  KQPAFVQGRQITDAILMANEIVYHWRCTNTQS---------------------IMSHKSFNKNWLDWIRGCISS------VNGKPRGRINATKGIRQGAR
        KQ AFV+ RQITDAILMANE V +W+    +                      ++  K+F   W  WI+GCIS+      VNG+P+GRI A +G+RQG  
Subjt:  KQPAFVQGRQITDAILMANEIVYHWRCTNTQS---------------------IMSHKSFNKNWLDWIRGCISS------VNGKPRGRINATKGIRQGAR

Query:  PLSHFLFVLAMDYLSQLLDHAFRKGLIKDV
        PLS FLFV+AMDYLS+LL H    G IK V
Subjt:  PLSHFLFVLAMDYLSQLLDHAFRKGLIKDV

A0A5D3CLN9 Uncharacterized protein2.6e-2063.74Show/hide
Query:  LSLILVFSLLSNGSLLGIMVEARPLNMVGT----AVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSPGVGHSQVSGQHH
        + LILV +LL NG L+ I+ EARPL ++G+    A+AEDFF GLSLGA KQSGPS GGDGH F N DT GG+KDSGP+PG GHS V+  HH
Subjt:  LSLILVFSLLSNGSLLGIMVEARPLNMVGT----AVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSPGVGHSQVSGQHH

SwissProt top hitse value%identityAlignment
F4JRC5 PAMP-induced secreted peptide 22.3e-0543.37Show/hide
Query:  ILSLILVFSLLSNGSLLGIMVEARPLNMVGT--AVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSP-GVGH
        +LS IL F L+ +     ++VE+RPL +  T        F GLSLG+ K SGPSPG         DT   +K SGPSP G GH
Subjt:  ILSLILVFSLLSNGSLLGIMVEARPLNMVGT--AVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSP-GVGH

Arabidopsis top hitse value%identityAlignment
AT2G23270.1 unknown protein4.2e-0746.43Show/hide
Query:  ILSLILVFSLLSNGSLLGIMVEARPLNMVGTA--VAEDFFAGLSLGASKQSGPSPGGDGHAFTN-VDTLGGIKDSGPS-PGVGH
        +L   L F L+S+      +VEARPL +      +   FF GLSLGA K+SGPS GG+GH F +  +TL   K SGPS  G GH
Subjt:  ILSLILVFSLLSNGSLLGIMVEARPLNMVGTA--VAEDFFAGLSLGASKQSGPSPGGDGHAFTN-VDTLGGIKDSGPS-PGVGH

AT4G37290.1 unknown protein1.6e-0643.37Show/hide
Query:  ILSLILVFSLLSNGSLLGIMVEARPLNMVGT--AVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSP-GVGH
        +LS IL F L+ +     ++VE+RPL +  T        F GLSLG+ K SGPSPG         DT   +K SGPSP G GH
Subjt:  ILSLILVFSLLSNGSLLGIMVEARPLNMVGT--AVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSP-GVGH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGCAGAGTGCCAAAACTTATGCTTTAGAAGAAGATATGGAAATGGAGCATGCATTGGAGACCAAACGGGAAATGCACTTTGTTGCTCTCTTCAAACAG
CCTGCGTTCGTCCAAGGCAGACAGATCACAGATGCGATTCTAATGGCTAATGAAATTGTCTACCACTGGCGCTGCACCAATACACAAAGTATCATGTCTCACAAG
AGCTTCAACAAAAACTGGCTTGATTGGATACGTGGCTGTATCTCTTCGGTGAATGGCAAACCAAGGGGGAGAATAAATGCAACAAAAGGTATACGTCAAGGAGCT
AGACCGTTGTCTCATTTTCTCTTTGTTCTGGCAATGGATTATCTTAGTCAGCTCCTTGACCATGCTTTCAGAAAAGGCCTCATCAAAGATGTGCAAGAGAAGGAG
GGACATTTAATTCCATTAGTTTGTGATTCAGTTTGCTCAAATGATAGTGCAGCTGAAGATCAGCAGTGCAAAAATTCATGCTTTCGAAGAAGATTTGGACATTGG
AGTCGCCAGGTCGTCTTGGTGGTTATGGGGGTTTTGGAAGTAGTAGTTAGGAGGCTACTGAGGTTGAATCATCCATTGGTTATTCTAGTCGTGGACCTTACAGTG
ATGGGTAGCGCAATATTTCCATGGATTGTAATACTACAAGCCTCAGAGTCTTTCATATGTGATGCTCCAAAGCAAACACGAGCACAAGTGGAAATTAGAAATGCA
AAAATGGATGCTTTGGAAGAGGATATGGACATGGAACCTCTTTGCGAAAGAGAGCAGTCTCACTGGATCCGTCTCACATCTTCCTTCACCTGCCCATCTGAGATT
GCCGGTCTGGACAAACTTATTTGGATTCGCACCCGAGATGGCTCTTTTGTTAAGGACAATGGTAGTCTTAGTATTCCGAACGCCATATGGGCCCTCCTCCTTGAT
GCTTTTGACCTCGGTACTGTCTTTTCTTGTCATATTGACAGTTTCATCAAAGAAAGTCTTTCCAGAAAGCTCTTAGGTAGATACCACACCTTATGGAGCAATGCG
GTCGATTCCACTCTTTGGCATGTTTTGTTGGAAAAAAAGCAATTGACTTTTGATGATTTGGACGAGAAGATTGGACATTCAGTTCCATTAACTTGTGATACTCGT
TGCTCAAACGCTAGCCCAGCTGAAGATGAGAGCTGCCGACGTTCGTGCTTTCGAAACGGATTTAGAAATGGAATTTTGAGCCTCATTCTTGTTTTTTCGCTACTG
AGTAATGGAAGTTTGCTTGGAATAATGGTAGAAGCCAGGCCTCTCAATATGGTCGGCACCGCAGTTGCCGAGGATTTCTTTGCTGGATTATCCCTCGGAGCCAGC
AAGCAGTCCGGTCCAAGCCCGGGCGGGGATGGCCACGCATTCACCAACGTCGATACTCTCGGCGGTATCAAGGACTCTGGCCCGAGCCCCGGGGTTGGGCACAGC
CAAGTCTCCGGCCAGCACCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGCAGAGTGCCAAAACTTATGCTTTAGAAGAAGATATGGAAATGGAGCATGCATTGGAGACCAAACGGGAAATGCACTTTGTTGCTCTCTTCAAACAG
CCTGCGTTCGTCCAAGGCAGACAGATCACAGATGCGATTCTAATGGCTAATGAAATTGTCTACCACTGGCGCTGCACCAATACACAAAGTATCATGTCTCACAAG
AGCTTCAACAAAAACTGGCTTGATTGGATACGTGGCTGTATCTCTTCGGTGAATGGCAAACCAAGGGGGAGAATAAATGCAACAAAAGGTATACGTCAAGGAGCT
AGACCGTTGTCTCATTTTCTCTTTGTTCTGGCAATGGATTATCTTAGTCAGCTCCTTGACCATGCTTTCAGAAAAGGCCTCATCAAAGATGTGCAAGAGAAGGAG
GGACATTTAATTCCATTAGTTTGTGATTCAGTTTGCTCAAATGATAGTGCAGCTGAAGATCAGCAGTGCAAAAATTCATGCTTTCGAAGAAGATTTGGACATTGG
AGTCGCCAGGTCGTCTTGGTGGTTATGGGGGTTTTGGAAGTAGTAGTTAGGAGGCTACTGAGGTTGAATCATCCATTGGTTATTCTAGTCGTGGACCTTACAGTG
ATGGGTAGCGCAATATTTCCATGGATTGTAATACTACAAGCCTCAGAGTCTTTCATATGTGATGCTCCAAAGCAAACACGAGCACAAGTGGAAATTAGAAATGCA
AAAATGGATGCTTTGGAAGAGGATATGGACATGGAACCTCTTTGCGAAAGAGAGCAGTCTCACTGGATCCGTCTCACATCTTCCTTCACCTGCCCATCTGAGATT
GCCGGTCTGGACAAACTTATTTGGATTCGCACCCGAGATGGCTCTTTTGTTAAGGACAATGGTAGTCTTAGTATTCCGAACGCCATATGGGCCCTCCTCCTTGAT
GCTTTTGACCTCGGTACTGTCTTTTCTTGTCATATTGACAGTTTCATCAAAGAAAGTCTTTCCAGAAAGCTCTTAGGTAGATACCACACCTTATGGAGCAATGCG
GTCGATTCCACTCTTTGGCATGTTTTGTTGGAAAAAAAGCAATTGACTTTTGATGATTTGGACGAGAAGATTGGACATTCAGTTCCATTAACTTGTGATACTCGT
TGCTCAAACGCTAGCCCAGCTGAAGATGAGAGCTGCCGACGTTCGTGCTTTCGAAACGGATTTAGAAATGGAATTTTGAGCCTCATTCTTGTTTTTTCGCTACTG
AGTAATGGAAGTTTGCTTGGAATAATGGTAGAAGCCAGGCCTCTCAATATGGTCGGCACCGCAGTTGCCGAGGATTTCTTTGCTGGATTATCCCTCGGAGCCAGC
AAGCAGTCCGGTCCAAGCCCGGGCGGGGATGGCCACGCATTCACCAACGTCGATACTCTCGGCGGTATCAAGGACTCTGGCCCGAGCCCCGGGGTTGGGCACAGC
CAAGTCTCCGGCCAGCACCACTGA
Protein sequenceShow/hide protein sequence
MKMQSAKTYALEEDMEMEHALETKREMHFVALFKQPAFVQGRQITDAILMANEIVYHWRCTNTQSIMSHKSFNKNWLDWIRGCISSVNGKPRGRINATKGIRQGA
RPLSHFLFVLAMDYLSQLLDHAFRKGLIKDVQEKEGHLIPLVCDSVCSNDSAAEDQQCKNSCFRRRFGHWSRQVVLVVMGVLEVVVRRLLRLNHPLVILVVDLTV
MGSAIFPWIVILQASESFICDAPKQTRAQVEIRNAKMDALEEDMDMEPLCEREQSHWIRLTSSFTCPSEIAGLDKLIWIRTRDGSFVKDNGSLSIPNAIWALLLD
AFDLGTVFSCHIDSFIKESLSRKLLGRYHTLWSNAVDSTLWHVLLEKKQLTFDDLDEKIGHSVPLTCDTRCSNASPAEDESCRRSCFRNGFRNGILSLILVFSLL
SNGSLLGIMVEARPLNMVGTAVAEDFFAGLSLGASKQSGPSPGGDGHAFTNVDTLGGIKDSGPSPGVGHSQVSGQHH