; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g30670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g30670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRicin B-type lectin domain-containing protein
Genome locationchr4:22991977..22992543
RNA-Seq ExpressionMoc04g30670
SyntenyMoc04g30670
Gene Ontology termsNA
InterPro domainsIPR021410 - The fantastic four family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016540.1 hypothetical protein SDJN02_21649 [Cucurbita argyrosperma subsp. argyrosperma]1.9e-4256.59Show/hide
Query:  PSSTRESDIDDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVRHH
        PSS  + + DDYIG+ESCVDLK+N TAAD  P    P   + KR+ K      + ++ YPPPIPLLVRTENL SH+PWV+KR YTGDGRLILTEEKV+HH
Subjt:  PSSTRESDIDDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVRHH

Query:  EYFRAHRSDGRLMLQLVALDDGD-GESGQDQEDEDEDFEKGGKMK----CLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR
        E+FRAHRSDGRLMLQLV +DD +  E G ++E ++E  E GG  +    C    +  R RD A MFGV V   +AG+LR+VR
Subjt:  EYFRAHRSDGRLMLQLVALDDGD-GESGQDQEDEDEDFEKGGKMK----CLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR

XP_022141391.1 uncharacterized protein LOC111011804, partial [Momordica charantia]8.0e-102100Show/hide
Query:  SQSQSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLIL
        SQSQSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLIL
Subjt:  SQSQSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLIL

Query:  TEEKVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVRS
        TEEKVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVRS
Subjt:  TEEKVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVRS

XP_022939970.1 uncharacterized protein LOC111445672 [Cucurbita moschata]2.1e-4158.15Show/hide
Query:  SSTRESDI--DDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENK-MWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVR
        SS   SDI  DDYIG+ESCVDLK+N TAAD  P    P   + KR+ K  W G  + ++ YPPPIPLLVRTENL SH+PWV+KR YTGDGRLILTEEKV+
Subjt:  SSTRESDI--DDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENK-MWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVR

Query:  HHEYFRAHRSDGRLMLQLVALDDGD-GESGQDQEDEDEDFEKGGKMK----CLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR
        HHE+FRAHRSDGRLMLQLV +DD +  E G ++E ++E  E GG  +    C    +  R RD A MFGV V    AG+LR+VR
Subjt:  HHEYFRAHRSDGRLMLQLVALDDGD-GESGQDQEDEDEDFEKGGKMK----CLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR

XP_022992861.1 uncharacterized protein LOC111489066 [Cucurbita maxima]5.5e-4255.73Show/hide
Query:  QSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENK-MWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILT
        +S +V      + + DDYIG+ESCVDLK + TAAD  P    P   + KR+NK  W G  + ++ YPPPIPLLVRTENLASH+PWV+KR YTGDGRLILT
Subjt:  QSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENK-MWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILT

Query:  EEKVRHHEYFRAHRSDGRLMLQLVALDDGD-GESGQDQEDEDEDFE-------KGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR
        EEKV+HHE+FRAHRSDGRLMLQLV +DD +  E G ++E+E+E+ E       +GGK    +N    R RD A MFGV V   +AG+LR+VR
Subjt:  EEKVRHHEYFRAHRSDGRLMLQLVALDDGD-GESGQDQEDEDEDFE-------KGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR

XP_023551237.1 uncharacterized protein LOC111809117 [Cucurbita pepo subsp. pepo]5.0e-4361.33Show/hide
Query:  SSTRESDI--DDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENK-MWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVR
        SS   SDI  DDYIG+ESCVDLK++ TA DT P    P   + KR+ K  W G  + ++ YPPPIPLLVRTENLASH+PWV+KR YTGDGRLILTEEKV+
Subjt:  SSTRESDI--DDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENK-MWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVR

Query:  HHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGKM--KCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR
        HHE+FRAHRSDGRLMLQLV +DD D +   D+E E+E  E GG    K  K   V R RD A MFGV V   SAG+LR+VR
Subjt:  HHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGKM--KCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR

TrEMBL top hitse value%identityAlignment
A0A6J1CJ12 uncharacterized protein LOC1110118043.9e-102100Show/hide
Query:  SQSQSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLIL
        SQSQSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLIL
Subjt:  SQSQSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLIL

Query:  TEEKVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVRS
        TEEKVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVRS
Subjt:  TEEKVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVRS

A0A6J1FP95 uncharacterized protein LOC1114456721.0e-4158.15Show/hide
Query:  SSTRESDI--DDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENK-MWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVR
        SS   SDI  DDYIG+ESCVDLK+N TAAD  P    P   + KR+ K  W G  + ++ YPPPIPLLVRTENL SH+PWV+KR YTGDGRLILTEEKV+
Subjt:  SSTRESDI--DDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENK-MWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVR

Query:  HHEYFRAHRSDGRLMLQLVALDDGD-GESGQDQEDEDEDFEKGGKMK----CLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR
        HHE+FRAHRSDGRLMLQLV +DD +  E G ++E ++E  E GG  +    C    +  R RD A MFGV V    AG+LR+VR
Subjt:  HHEYFRAHRSDGRLMLQLVALDDGD-GESGQDQEDEDEDFEKGGKMK----CLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR

A0A6J1GVY4 uncharacterized protein LOC1114580317.5e-3752.97Show/hide
Query:  QSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEE
        QS+ + P S R+ D DDYIG+ESCVDL EN T++D          + +R+NKMWG   +   EYPPPI LLVRTENLAS MPWVLKRHYT DGRLILTEE
Subjt:  QSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEE

Query:  KVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGK-MKCLKNGIVR-RRRDPALMFGVAVPALSAGTLRTVRS
        ++R++E+FRAHRS+GRLMLQLVA DD     G D+ + +   E GG   +C      R       L  G   P    G LRTVRS
Subjt:  KVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGK-MKCLKNGIVR-RRRDPALMFGVAVPALSAGTLRTVRS

A0A6J1JTI2 uncharacterized protein LOC1114873722.0e-3765Show/hide
Query:  QSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEE
        QS+ + P S R+ DIDDYIGMESCVDL+EN TAAD          + KR+NKM G   +   EYPPPI LLVRTENLAS MPWVLKR YT DGRLILT+E
Subjt:  QSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEE

Query:  KVRHHEYFRAHRSDGRLMLQLVALDD-GDGESGQDQEDED
        ++RH+E+FRAHRSDGRLMLQLVA DD G  ES  +  DED
Subjt:  KVRHHEYFRAHRSDGRLMLQLVALDD-GDGESGQDQEDED

A0A6J1JUP6 uncharacterized protein LOC1114890662.7e-4255.73Show/hide
Query:  QSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENK-MWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILT
        +S +V      + + DDYIG+ESCVDLK + TAAD  P    P   + KR+NK  W G  + ++ YPPPIPLLVRTENLASH+PWV+KR YTGDGRLILT
Subjt:  QSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPP-PMKFSDKRENK-MWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILT

Query:  EEKVRHHEYFRAHRSDGRLMLQLVALDDGD-GESGQDQEDEDEDFE-------KGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR
        EEKV+HHE+FRAHRSDGRLMLQLV +DD +  E G ++E+E+E+ E       +GGK    +N    R RD A MFGV V   +AG+LR+VR
Subjt:  EEKVRHHEYFRAHRSDGRLMLQLVALDDGD-GESGQDQEDEDEDFE-------KGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22110.1 structural constituent of ribosome8.2e-2846.54Show/hide
Query:  SSTRESDIDDYIGMESCVDL---KENQTAADTMPPPPPMKFSDKRENKMWGGTGRKE--------REYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLI
        SS+  S + DYIG ESC D+    E          P   +F        +GG  R+E        RE+PPPIPLL +T NL  HMPWVLKR  T DGRLI
Subjt:  SSTRESDIDDYIGMESCVDL---KENQTAADTMPPPPPMKFSDKRENKMWGGTGRKE--------REYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLI

Query:  LTEEKVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQD-----QEDEDEDFEKGGKMKC
        L EEKVRHHEYFRA+RS+GRL L LV LDD   +  Q+      +DE++D E   + +C
Subjt:  LTEEKVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQD-----QEDEDEDFEKGGKMKC

AT1G77932.1 Protein of unknown function (DUF3049)1.0e-1746.85Show/hide
Query:  YIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVRHHEYFRAHRSDGRL
        YIG ESC +   ++T    +P        ++RE K    T  +E   PP +P         SH+P VLKR YT DGRL+L EEKV  +EYFRAHRS+GRL
Subjt:  YIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVRHHEYFRAHRSDGRL

Query:  MLQLVALDDGD
        M+QLV+LD+ D
Subjt:  MLQLVALDDGD

AT5G22390.1 Protein of unknown function (DUF3049)2.6e-0527.13Show/hide
Query:  IGMESCVDLKENQTAADTMPPPPPMKFSDK----RENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVRHHEYFRAHRSD
        +G ES   L++ +   D       ++   K     + + W    ++ +EYPP              M  +  + Y  +GRL+L E ++   E+ RA R D
Subjt:  IGMESCVDLKENQTAADTMPPPPPMKFSDK----RENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEEKVRHHEYFRAHRSD

Query:  GRLMLQLVALDDGDGESGQDQEDE--DED
        GRL L+LV  +D   E  ++++D+  DED
Subjt:  GRLMLQLVALDDGDGESGQDQEDE--DED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGCCAGAGCCAGAGCCAGAGGGTTTTCCCTTCCAGTACTCGAGAGTCCGATATTGATGATTACATTGGCATGGAGAGTTGTGTTGATTTGAAAGAGAAT
CAAACGGCGGCCGACACGATGCCACCGCCGCCGCCGATGAAATTCTCCGATAAGAGGGAGAATAAAATGTGGGGGGGAACGGGGAGGAAGGAGAGGGAGTATCCT
CCGCCGATCCCATTGCTGGTTCGCACCGAGAATCTGGCTTCCCACATGCCGTGGGTGTTGAAACGGCATTACACCGGCGACGGGCGGCTGATACTGACGGAGGAG
AAAGTGAGGCACCACGAATACTTCCGGGCCCACAGATCCGACGGCCGTCTGATGCTGCAGCTGGTGGCCCTCGACGACGGCGACGGCGAATCCGGGCAAGATCAG
GAGGATGAGGATGAGGATTTTGAGAAAGGGGGGAAAATGAAATGCTTAAAGAATGGGATTGTGAGGAGGCGCAGAGATCCAGCGCTTATGTTCGGAGTGGCAGTG
CCTGCACTTTCTGCAGGGACCCTGAGGACCGTTCGAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGCCAGAGCCAGAGCCAGAGGGTTTTCCCTTCCAGTACTCGAGAGTCCGATATTGATGATTACATTGGCATGGAGAGTTGTGTTGATTTGAAAGAGAAT
CAAACGGCGGCCGACACGATGCCACCGCCGCCGCCGATGAAATTCTCCGATAAGAGGGAGAATAAAATGTGGGGGGGAACGGGGAGGAAGGAGAGGGAGTATCCT
CCGCCGATCCCATTGCTGGTTCGCACCGAGAATCTGGCTTCCCACATGCCGTGGGTGTTGAAACGGCATTACACCGGCGACGGGCGGCTGATACTGACGGAGGAG
AAAGTGAGGCACCACGAATACTTCCGGGCCCACAGATCCGACGGCCGTCTGATGCTGCAGCTGGTGGCCCTCGACGACGGCGACGGCGAATCCGGGCAAGATCAG
GAGGATGAGGATGAGGATTTTGAGAAAGGGGGGAAAATGAAATGCTTAAAGAATGGGATTGTGAGGAGGCGCAGAGATCCAGCGCTTATGTTCGGAGTGGCAGTG
CCTGCACTTTCTGCAGGGACCCTGAGGACCGTTCGAAGCTAA
Protein sequenceShow/hide protein sequence
MQSQSQSQRVFPSSTRESDIDDYIGMESCVDLKENQTAADTMPPPPPMKFSDKRENKMWGGTGRKEREYPPPIPLLVRTENLASHMPWVLKRHYTGDGRLILTEE
KVRHHEYFRAHRSDGRLMLQLVALDDGDGESGQDQEDEDEDFEKGGKMKCLKNGIVRRRRDPALMFGVAVPALSAGTLRTVRS