; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS018679 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS018679
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionnucleolin
Genome locationscaffold313:988196..988756
RNA-Seq ExpressionMS018679
SyntenyMS018679
Gene Ontology termsGO:0006413 - translational initiation (biological process)
GO:0003743 - translation initiation factor activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579085.1 hypothetical protein SDJN03_23533, partial [Cucurbita argyrosperma subsp. sororia]1.4e-6980.81Show/hide
Query:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ
        MDFRK   GSGN+DEQVEQLLQAAQDDLMLKL++DSHMSRVSPNYLDSDLDRRFQALRSRPSS+AAAA RNPRPS    +SD  S+PP IENL VDGESQ
Subjt:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ

Query:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDD-------DDGDDESSSSDEDADDRRKERRKKKA
        SILGDDLA RFAALKASLPSS  PPPSS+PNDVDS+DEEDEVEKLIQWAKDAARLDPSPPS++DD       DD D+E  SSDED DDR+KERRKKKA
Subjt:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDD-------DDGDDESSSSDEDADDRRKERRKKKA

KAG7016610.1 hypothetical protein SDJN02_21720, partial [Cucurbita argyrosperma subsp. argyrosperma]4.7e-7082.05Show/hide
Query:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ
        MDFRK   GSGN+DEQVEQLLQAAQDDLMLKL++DSHMSRVSPNYLDSDLDRRFQALRSRPSS+AAAA RNPRPS    +SD  S+PP IENL VDGESQ
Subjt:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ

Query:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDD----DDGDDESSSSDEDADDRRKERRKKKA
        SILGDDLA RFAALKASLPSS  PPPSS+PNDVDS+DEEDEVEKLIQWAKDAARLDPSPPS++DD    DD D+E +SSDED DDR+KERRKKKA
Subjt:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDD----DDGDDESSSSDEDADDRRKERRKKKA

XP_022141565.1 uncharacterized protein LOC111011897 [Momordica charantia]1.4e-90100Show/hide
Query:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPSRSDLPSKPPAIENLPVDGESQSILG
        MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPSRSDLPSKPPAIENLPVDGESQSILG
Subjt:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPSRSDLPSKPPAIENLPVDGESQSILG

Query:  DDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA
        DDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA
Subjt:  DDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA

XP_022939254.1 nucleolin [Cucurbita moschata]8.9e-6982.81Show/hide
Query:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ
        MDFRK    SGN+DEQVEQLLQAAQDDLMLKL++DSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAA+ RNPRPS    +SD  S+PP IENL VDGESQ
Subjt:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ

Query:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDES-SSSDEDADDRRKERRKKKA
        SILGDDLA RFAALKASLPSS  PPPSS+PNDVDS+DEEDEVEKLIQWAKDAARLDPSPPS++DD+D DDE  +SSDED DDR+KERRKKKA
Subjt:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDES-SSSDEDADDRRKERRKKKA

XP_023550367.1 nucleolin [Cucurbita pepo subsp. pepo]1.7e-6781.03Show/hide
Query:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ
        MDFRK   GSGN+DEQVEQLLQAAQDDLMLK ++DSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAA+ RNPRPS    +SD  S+PP IENL VDGESQ
Subjt:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ

Query:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDD----DDGDDESSSSDEDADDRRKERRKKKA
        SILGDDLA RFAALKASLPSS  PPPSS+ NDVDS+DEEDEVEKLIQWAKDAARLDPSPPS++DD    DD D+E +SSDED DDR+KERRKKKA
Subjt:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDD----DDGDDESSSSDEDADDRRKERRKKKA

TrEMBL top hitse value%identityAlignment
A0A1S4E6L5 uncharacterized protein LOC1035028662.1e-6379.38Show/hide
Query:  MDFRKKAGGSGNDD--EQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSK-PPAIENLPVDG
        MD RKK  G GNDD  EQ+EQLLQAAQDDL+LKLTLDSHMSRVSPNYL SDLDRRFQALRSRPSS AAA PRNPR S    +SD  S+ PP IENL VDG
Subjt:  MDFRKKAGGSGNDD--EQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSK-PPAIENLPVDG

Query:  ESQSILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA
        ESQSILGDDLA RFAALKASLPSST PP SS+PNDVDS DEEDEVEKLIQWAKDAARLDPSPPS++DD+  ++E  SSDED +DR KERRKKKA
Subjt:  ESQSILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA

A0A5D3CTC3 Transcription initiation factor TFIID subunit 11 isoform X31.2e-6379.9Show/hide
Query:  MDFRKKAGGSGNDD--EQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSK-PPAIENLPVDG
        MD RKK  G GNDD  EQ+EQLLQAAQDDLMLKLTLDSHMSRVSPNYL SDLDRRFQALRSRPSS AAA PRNPR S    +SD  S+ PP IENL VDG
Subjt:  MDFRKKAGGSGNDD--EQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSK-PPAIENLPVDG

Query:  ESQSILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA
        ESQSILGDDLA RFAALKASLPSST PP SS+PNDVDS DEEDEVEKLIQWAKDAARLDPSPPS++DD+  ++E  SSDED +DR KERRKKKA
Subjt:  ESQSILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA

A0A6J1CIG2 uncharacterized protein LOC1110118976.8e-91100Show/hide
Query:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPSRSDLPSKPPAIENLPVDGESQSILG
        MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPSRSDLPSKPPAIENLPVDGESQSILG
Subjt:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPSRSDLPSKPPAIENLPVDGESQSILG

Query:  DDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA
        DDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA
Subjt:  DDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA

A0A6J1FGM0 nucleolin4.3e-6982.81Show/hide
Query:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ
        MDFRK    SGN+DEQVEQLLQAAQDDLMLKL++DSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAA+ RNPRPS    +SD  S+PP IENL VDGESQ
Subjt:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ

Query:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDES-SSSDEDADDRRKERRKKKA
        SILGDDLA RFAALKASLPSS  PPPSS+PNDVDS+DEEDEVEKLIQWAKDAARLDPSPPS++DD+D DDE  +SSDED DDR+KERRKKKA
Subjt:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDES-SSSDEDADDRRKERRKKKA

A0A6J1JTW4 glucosidase 2 subunit beta3.1e-6780.51Show/hide
Query:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ
        MDFRK   GSGN+DEQVEQLLQAAQDDLMLKL++DSHMSRVSPNYLDSDLDRRFQALRSRPSSAAA   RNPR S    +SD  S+PP IENL VDGESQ
Subjt:  MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPS----RSDLPSKPPAIENLPVDGESQ

Query:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDD----DDGDDESSSSDEDADDRRKERRKKKA
        SILGDDLA RFAALKASLPSS  PPPSS+PNDVDS+DEEDEVEKLIQWAKDA RLDPSPPS++DD    DD D+E +SSDED DDR+KERRKKKA
Subjt:  SILGDDLATRFAALKASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDD----DDGDDESSSSDEDADDRRKERRKKKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G24370.1 unknown protein7.4e-2143.45Show/hide
Query:  GNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPSRSDLPSKPPAIENLPVDGESQSILGDDLATRFAAL
        G ++++VEQLLQAAQD+++LKL++DSH SR S +YLD DL  RF AL+S+         +  RP     P     +E  P          +DL  RFAAL
Subjt:  GNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPSRSDLPSKPPAIENLPVDGESQSILGDDLATRFAAL

Query:  KASLPSSTPPPPSSMPNDV----DSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDD---DGDDESSSS
        K SLPS++      + +++    D   E+ EV+KLIQWA DAARLDPSP SD +     D DDE+ S+
Subjt:  KASLPSSTPPPPSSMPNDV----DSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDD---DGDDESSSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCAGAAAGAAGGCTGGGGGAAGCGGCAACGACGACGAACAAGTGGAGCAATTGCTGCAAGCTGCTCAGGACGATCTCATGCTCAAATTGACTCTCGATTCCCA
CATGTCTCGCGTATCCCCAAACTATCTCGATTCCGATCTCGATCGCCGTTTCCAAGCCCTCAGATCGCGTCCTTCCTCCGCCGCCGCCGCCGCTCCACGCAATCCACGGC
CTTCTCGATCGGATTTGCCGTCGAAGCCACCGGCGATTGAAAATCTTCCTGTCGATGGCGAATCTCAGTCGATTCTCGGTGACGATCTCGCTACTAGATTCGCTGCTCTG
AAGGCGTCTTTGCCTTCCTCGACTCCTCCGCCTCCGTCGTCGATGCCTAACGACGTTGATAGCGAGGATGAAGAGGATGAGGTCGAGAAGTTGATTCAGTGGGCCAAGGA
TGCTGCTCGGCTCGATCCTTCACCGCCATCTGATCAAGACGACGACGACGGCGACGACGAGTCTAGTAGTTCCGATGAAGATGCCGACGATCGGAGGAAAGAACGTCGGA
AAAAGAAGGCG
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCAGAAAGAAGGCTGGGGGAAGCGGCAACGACGACGAACAAGTGGAGCAATTGCTGCAAGCTGCTCAGGACGATCTCATGCTCAAATTGACTCTCGATTCCCA
CATGTCTCGCGTATCCCCAAACTATCTCGATTCCGATCTCGATCGCCGTTTCCAAGCCCTCAGATCGCGTCCTTCCTCCGCCGCCGCCGCCGCTCCACGCAATCCACGGC
CTTCTCGATCGGATTTGCCGTCGAAGCCACCGGCGATTGAAAATCTTCCTGTCGATGGCGAATCTCAGTCGATTCTCGGTGACGATCTCGCTACTAGATTCGCTGCTCTG
AAGGCGTCTTTGCCTTCCTCGACTCCTCCGCCTCCGTCGTCGATGCCTAACGACGTTGATAGCGAGGATGAAGAGGATGAGGTCGAGAAGTTGATTCAGTGGGCCAAGGA
TGCTGCTCGGCTCGATCCTTCACCGCCATCTGATCAAGACGACGACGACGGCGACGACGAGTCTAGTAGTTCCGATGAAGATGCCGACGATCGGAGGAAAGAACGTCGGA
AAAAGAAGGCG
Protein sequenceShow/hide protein sequence
MDFRKKAGGSGNDDEQVEQLLQAAQDDLMLKLTLDSHMSRVSPNYLDSDLDRRFQALRSRPSSAAAAAPRNPRPSRSDLPSKPPAIENLPVDGESQSILGDDLATRFAAL
KASLPSSTPPPPSSMPNDVDSEDEEDEVEKLIQWAKDAARLDPSPPSDQDDDDGDDESSSSDEDADDRRKERRKKKA