; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g25770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g25770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Description101 kDa malaria antigen-like
Genome locationchr3:18539308..18539868
RNA-Seq ExpressionMoc03g25770
SyntenyMoc03g25770
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140144.1 uncharacterized protein LOC111010873 [Momordica charantia]2.2e-91100Show/hide
Query:  MANNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYRET
        MANNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYRET
Subjt:  MANNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYRET

Query:  RRGNSAKTYRRSKSEKAVEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDADGA
        RRGNSAKTYRRSKSEKAVEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDADGA
Subjt:  RRGNSAKTYRRSKSEKAVEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDADGA

XP_022927241.1 uncharacterized protein LOC111434146 [Cucurbita moschata]1.1e-2147.78Show/hide
Query:  KLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSY-------
        K   ++L+L AE A  FFA FLLS+F Y S  NLS L TLF    FWFC+SNTLIFIIA    AFS PP    +  +  S+V I    PN          
Subjt:  KLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSY-------

Query:  ---------RETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMT----IKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYG
                  E    N  K+Y RSKSEKA   V KE KIAMRRSKTMT        +E+EE     NE  EM++EELN++VEEFIERFNR++RLQ +  G
Subjt:  ---------RETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMT----IKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYG

Query:  DAD
        D++
Subjt:  DAD

XP_023001471.1 uncharacterized protein LOC111495597 [Cucurbita maxima]1.4e-2150.53Show/hide
Query:  LNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPP-------------EQIKSKQSENSAVKIVRERPNKSY
        L+L AE A  F A FLLS+F Y S  NLS L TLF    FWFC+SNTL+ IIA    AFSPPP                    +EN   +I    P +  
Subjt:  LNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPP-------------EQIKSKQSENSAVKIVRERPNKSY

Query:  RETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDAD
              NS K+Y RSKSEKA   V KE KIAMRRSKTMT    +EEEE     NE  EM++EELN+RVEEFIERFNR+IRLQ +  GD++
Subjt:  RETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDAD

XP_023520395.1 uncharacterized protein LOC111783708 isoform X1 [Cucurbita pepo subsp. pepo]8.1e-2246.19Show/hide
Query:  KLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYR------
        K   ++L+L AE A  F A FLLS+F Y S  NLS L TLF    FWFC+SNTLIFIIA    AFSPPP       + NSA  I+   P  ++       
Subjt:  KLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYR------

Query:  ---------------------ETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIR
                             E    NS K+Y  SKSEKA   V KE K+AMRRSKTMT    +E+EE     NE  EM++EELN+RVEEFIERFNR++R
Subjt:  ---------------------ETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIR

Query:  LQDLEYGDAD
        LQ +  GD++
Subjt:  LQDLEYGDAD

XP_023520396.1 uncharacterized protein LOC111783708 isoform X2 [Cucurbita pepo subsp. pepo]5.6e-2348.5Show/hide
Query:  KLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYR------
        K   ++L+L AE A  F A FLLS+F Y S  NLS L TLF    FWFC+SNTLIFIIA    AFSPPP       + NSA  I+   P  ++       
Subjt:  KLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYR------

Query:  -----------ETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDAD
                   E    NS K+Y  SKSEKA   V KE K+AMRRSKTMT    +E+EE     NE  EM++EELN+RVEEFIERFNR++RLQ +  GD++
Subjt:  -----------ETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDAD

TrEMBL top hitse value%identityAlignment
A0A0A0LUF0 Uncharacterized protein1.1e-1941.46Show/hide
Query:  NNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVK-------IVRERPN-
        N KL +++L L A+GA    +FF    F Y S+ +      LF+   FWF +SNTLIF+IA+D GAFS P   + + +   S+ +       +V + PN 
Subjt:  NNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVK-------IVRERPN-

Query:  -----KSYRET------------RRGNSAKTYRRSKSEK----AVEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRI
                 ET            +  N  K Y+RSKSEK     VEK +K+ MRRSKTM  +     +E  +E++EF +MTDEELNRRVEEFIERFNR+I
Subjt:  -----KSYRET------------RRGNSAKTYRRSKSEK----AVEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRI

Query:  RLQDL
        RLQ++
Subjt:  RLQDL

A0A1S3B9V3 101 kDa malaria antigen-like3.7e-2040.64Show/hide
Query:  NNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPP--------PEQIKSKQSENSAVKIVRERPN
        N KL +++L+L A+ A     FF    F Y S+ +       F+   FWF +SNTLIFIIA+D GAFS P        P     + + N+ + +V E PN
Subjt:  NNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPP--------PEQIKSKQSENSAVKIVRERPN

Query:  ------KSYRETRR------------------GNSAKTYRRSKSEKAVE----KERKIAMRRSKTM-------TIKLHDEEEEASKEDNEFEEMTDEELN
              K   E                      N  K Y+RSKSEK ++    K +KI M+RSKTM       T +  +EEEE  +E NEF +MTDEELN
Subjt:  ------KSYRETRR------------------GNSAKTYRRSKSEKAVE----KERKIAMRRSKTM-------TIKLHDEEEEASKEDNEFEEMTDEELN

Query:  RRVEEFIERFNRRIRLQDL
        RRVEEFIERFNR+IRLQ +
Subjt:  RRVEEFIERFNRRIRLQDL

A0A6J1CEX1 uncharacterized protein LOC1110108731.1e-91100Show/hide
Query:  MANNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYRET
        MANNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYRET
Subjt:  MANNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYRET

Query:  RRGNSAKTYRRSKSEKAVEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDADGA
        RRGNSAKTYRRSKSEKAVEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDADGA
Subjt:  RRGNSAKTYRRSKSEKAVEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDADGA

A0A6J1EH50 uncharacterized protein LOC1114341465.1e-2247.78Show/hide
Query:  KLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSY-------
        K   ++L+L AE A  FFA FLLS+F Y S  NLS L TLF    FWFC+SNTLIFIIA    AFS PP    +  +  S+V I    PN          
Subjt:  KLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSY-------

Query:  ---------RETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMT----IKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYG
                  E    N  K+Y RSKSEKA   V KE KIAMRRSKTMT        +E+EE     NE  EM++EELN++VEEFIERFNR++RLQ +  G
Subjt:  ---------RETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMT----IKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYG

Query:  DAD
        D++
Subjt:  DAD

A0A6J1KIQ0 uncharacterized protein LOC1114955976.7e-2250.53Show/hide
Query:  LNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPP-------------EQIKSKQSENSAVKIVRERPNKSY
        L+L AE A  F A FLLS+F Y S  NLS L TLF    FWFC+SNTL+ IIA    AFSPPP                    +EN   +I    P +  
Subjt:  LNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPP-------------EQIKSKQSENSAVKIVRERPNKSY

Query:  RETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDAD
              NS K+Y RSKSEKA   V KE KIAMRRSKTMT    +EEEE     NE  EM++EELN+RVEEFIERFNR+IRLQ +  GD++
Subjt:  RETRRGNSAKTYRRSKSEKA---VEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDAD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30190.1 unknown protein3.3e-0526.56Show/hide
Query:  LSIFTYVSVLNLSD--LPTLFSDAKFWFCISNTLIFIIAVDFGAFS--------------------------------------------PPPEQIKSKQ
        L IFTY+ + ++ +  L ++F D K  F ISNTLI IIA D+G+FS                                            P   + ++ +
Subjt:  LSIFTYVSVLNLSD--LPTLFSDAKFWFCISNTLIFIIAVDFGAFS--------------------------------------------PPPEQIKSKQ

Query:  SENSAV-----------KIVRERPNKSYRE----------------------TRRG-NSAKTYRRSKSEKAVEKERKIAMRRSKTMT-----------IK
         E+  +           KIVR    K  R+                      TR   N  K Y RSKS+K   K   +    +K  +           + 
Subjt:  SENSAV-----------KIVRERPNKSYRE----------------------TRRG-NSAKTYRRSKSEKAVEKERKIAMRRSKTMT-----------IK

Query:  LHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQ
        + ++ E   +E  EF ++++EELN+RVEEFI+RFNR+IR Q
Subjt:  LHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQ

AT2G34610.1 unknown protein1.8e-0629.57Show/hide
Query:  LLSIFTYVSVLNLSDLP--TLFSDAKFWFCISNTLIFIIAVDFGAF----------------------SPPPEQI------------KSKQSENSAVKIV
        +LSIFTY+ + ++ D+   ++F+D K  F ISN LI IIA D+GAF                      +P PE+             + KQ      + +
Subjt:  LLSIFTYVSVLNLSDLP--TLFSDAKFWFCISNTLIFIIAVDFGAF----------------------SPPPEQI------------KSKQSENSAVKIV

Query:  RER--PNKSYRETRR--------------------------------------GN--SAKTYRRSKSEKA----VEKERK---IAMRR----------SK
        +++  PNK  + T R                                       N  ++K Y RSKS+KA    + KER+   I  R           SK
Subjt:  RER--PNKSYRETRR--------------------------------------GN--SAKTYRRSKSEKA----VEKERK---IAMRR----------SK

Query:  TMTI----KLHDEEEEASK-------EDNEFEEMTDEELNRRVEEFIERFNRRIRLQ
         M +    K  +E EEA+K       E  EF +M++EELNRRVE+FI+RFNR I+ Q
Subjt:  TMTI----KLHDEEEEASK-------EDNEFEEMTDEELNRRVEEFIERFNRRIRLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACAACAAATTAGCCGAAGAATACCTCAATTTAATCGCAGAAGGAGCTCCTTCTTTCTTCGCCTTTTTCCTTCTCTCCATCTTCACTTACGTCTCCGTCCTCAA
CCTCTCCGATCTCCCGACGCTCTTCAGCGACGCCAAATTCTGGTTCTGCATCTCCAATACTCTTATTTTCATAATCGCCGTCGATTTCGGAGCCTTCTCTCCGCCGCCCG
AGCAGATCAAATCGAAGCAATCGGAAAACAGCGCAGTGAAAATCGTTCGGGAGAGACCTAATAAAAGTTACCGAGAGACGAGGAGAGGTAATTCGGCGAAAACTTACCGG
CGAAGCAAGTCGGAGAAAGCGGTGGAGAAGGAGAGGAAGATCGCGATGAGGAGGTCGAAAACGATGACGATAAAATTACACGACGAAGAAGAAGAAGCGTCGAAGGAAGA
TAATGAATTTGAAGAGATGACAGATGAAGAACTGAATAGAAGAGTTGAAGAGTTTATTGAAAGATTCAACAGAAGGATTCGACTTCAAGATCTCGAATATGGGGATGCCG
ATGGAGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAACAACAAATTAGCCGAAGAATACCTCAATTTAATCGCAGAAGGAGCTCCTTCTTTCTTCGCCTTTTTCCTTCTCTCCATCTTCACTTACGTCTCCGTCCTCAA
CCTCTCCGATCTCCCGACGCTCTTCAGCGACGCCAAATTCTGGTTCTGCATCTCCAATACTCTTATTTTCATAATCGCCGTCGATTTCGGAGCCTTCTCTCCGCCGCCCG
AGCAGATCAAATCGAAGCAATCGGAAAACAGCGCAGTGAAAATCGTTCGGGAGAGACCTAATAAAAGTTACCGAGAGACGAGGAGAGGTAATTCGGCGAAAACTTACCGG
CGAAGCAAGTCGGAGAAAGCGGTGGAGAAGGAGAGGAAGATCGCGATGAGGAGGTCGAAAACGATGACGATAAAATTACACGACGAAGAAGAAGAAGCGTCGAAGGAAGA
TAATGAATTTGAAGAGATGACAGATGAAGAACTGAATAGAAGAGTTGAAGAGTTTATTGAAAGATTCAACAGAAGGATTCGACTTCAAGATCTCGAATATGGGGATGCCG
ATGGAGCGTAG
Protein sequenceShow/hide protein sequence
MANNKLAEEYLNLIAEGAPSFFAFFLLSIFTYVSVLNLSDLPTLFSDAKFWFCISNTLIFIIAVDFGAFSPPPEQIKSKQSENSAVKIVRERPNKSYRETRRGNSAKTYR
RSKSEKAVEKERKIAMRRSKTMTIKLHDEEEEASKEDNEFEEMTDEELNRRVEEFIERFNRRIRLQDLEYGDADGA