; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0667 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0667
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSmall nuclear ribonucleoprotein family protein
Genome locationMC03:13620527..13625806
RNA-Seq ExpressionMC03g0667
SyntenyMC03g0667
Gene Ontology termsGO:0031417 - NatC complex (cellular component)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR034110 - LSM domain containing 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444477.1 PREDICTED: uncharacterized protein LOC103487784 [Cucumis melo]4.04e-6390.18Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQES G  VQ GSN ESN  SLDCI KVRKLLFRRMLIGIKDGRFFLG F+CIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
        STI EQLALLSV
Subjt:  STIVEQLALLSV

XP_011649337.1 uncharacterized protein LOC101206200 [Cucumis sativus]2.84e-6391.07Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQES G  VQ GSNVESNP SLD I KVRKLLFRRMLIGIKDGRFFLG F+CIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
        STI EQLALLSV
Subjt:  STIVEQLALLSV

XP_022140017.1 uncharacterized protein LOC111010777 [Momordica charantia]5.97e-74100Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
        STIVEQLALLSV
Subjt:  STIVEQLALLSV

XP_022927243.1 uncharacterized protein LOC111434149 [Cucurbita moschata]2.84e-6389.29Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQES G  VQVG NVESNP SLD + KVRKLLFRRMLIGIKDGRFFLG+F+C+DKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
        STI EQLALLSV
Subjt:  STIVEQLALLSV

XP_023520042.1 uncharacterized protein LOC111783348 [Cucurbita pepo subsp. pepo]4.91e-6490.18Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQES G  VQVGSNVESNP SLD + KVRKLLFRRMLIGIKDGRFFLG+F+C+DKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
        STI EQLALLSV
Subjt:  STIVEQLALLSV

TrEMBL top hitse value%identityAlignment
A0A0A0LSZ5 Sm domain-containing protein1.38e-6391.07Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQES G  VQ GSNVESNP SLD I KVRKLLFRRMLIGIKDGRFFLG F+CIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
        STI EQLALLSV
Subjt:  STIVEQLALLSV

A0A1S3BAH4 uncharacterized protein LOC1034877841.96e-6390.18Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQES G  VQ GSN ESN  SLDCI KVRKLLFRRMLIGIKDGRFFLG F+CIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
        STI EQLALLSV
Subjt:  STIVEQLALLSV

A0A5D3DAS2 Small nuclear ribonucleoprotein family protein isoform 21.96e-6390.18Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQES G  VQ GSN ESN  SLDCI KVRKLLFRRMLIGIKDGRFFLG F+CIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
        STI EQLALLSV
Subjt:  STIVEQLALLSV

A0A6J1CFL8 uncharacterized protein LOC1110107772.89e-74100Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
        STIVEQLALLSV
Subjt:  STIVEQLALLSV

A0A6J1EKG3 uncharacterized protein LOC1114341491.38e-6389.29Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQES G  VQVG NVESNP SLD + KVRKLLFRRMLIGIKDGRFFLG+F+C+DKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
        STI EQLALLSV
Subjt:  STIVEQLALLSV

SwissProt top hitse value%identityAlignment
A4IGZ4 N-alpha-acetyltransferase 38, NatC auxiliary subunit3.0e-0636.46Show/hide
Query:  GSNVESNPGSLDCIA--------KVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        G + +S+PG+ D  A        K+  LL R M I + DGR  +G F C D+  N+IL  A E+     S P   E R LGL ++P    VS  V+
Subjt:  GSNVESNPGSLDCIA--------KVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

P63162 Small nuclear ribonucleoprotein-associated protein N3.3e-0539.71Show/hide
Query:  RMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTR---RSSPSPMEQRCLGLILIPNSCRVSCHVD
        RM   ++DGR F+GTF   DK  N+IL D  E+R  +      P   E+R LGL+L+     VS  V+
Subjt:  RMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTR---RSSPSPMEQRCLGLILIPNSCRVSCHVD

Q17QN3 Small nuclear ribonucleoprotein-associated protein N3.3e-0539.71Show/hide
Query:  RMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTR---RSSPSPMEQRCLGLILIPNSCRVSCHVD
        RM   ++DGR F+GTF   DK  N+IL D  E+R  +      P   E+R LGL+L+     VS  V+
Subjt:  RMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTR---RSSPSPMEQRCLGLILIPNSCRVSCHVD

Q55A45 Small nuclear ribonucleoprotein-associated protein B8.7e-0634.52Show/hide
Query:  AKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQ-RCLGLILIPNSCRVSCHVDSTIVEQLAL
        +K+ + +  RM + I+DGR  +G F   DK  N+++ DA E+R  R+      E+ R LG+ILI     VS  V++   E+  L
Subjt:  AKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQ-RCLGLILIPNSCRVSCHVDSTIVEQLAL

Q6GQ67 N-alpha-acetyltransferase 38-A, NatC auxiliary subunit1.5e-0535.42Show/hide
Query:  GSNVESNPGSLDCIA--------KVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        G + +S+P + D  A        K+  LL R M I + DGR  +G F C D+  N+IL  A E+     S P   E R LGL ++P    VS  V+
Subjt:  GSNVESNPGSLDCIA--------KVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Arabidopsis top hitse value%identityAlignment
AT4G18372.1 Small nuclear ribonucleoprotein family protein6.8e-3867.86Show/hide
Query:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD
        MEQ +E     V S  E +    D I+++RKLLFR+ML+GIKDGRFFLG FHCIDKQGNIILQD VEYRS RRSSPSP EQRCLG+ILIP+SCR SCHVD
Subjt:  MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVD

Query:  STIVEQLALLSV
         +I EQL+L+ +
Subjt:  STIVEQLALLSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACAAGAATCAGAGGGACCCAGGGTTCAGGTTGGGAGCAATGTCGAGTCTAATCCGGGCAGTTTAGATTGCATAGCAAAGGTGAGAAAGCTACTGTTTCGCCGAAT
GCTCATAGGCATTAAAGATGGAAGATTTTTCCTGGGAACTTTTCACTGCATTGACAAGCAAGGAAATATCATTCTACAAGATGCAGTGGAGTATCGTAGTACTCGACGTA
GCTCACCTTCTCCGATGGAACAACGGTGCCTCGGCCTTATTCTTATCCCCAACTCTTGCCGTGTGTCCTGTCATGTAGATAGTACCATTGTGGAACAATTAGCGCTGCTA
TCAGTCTAG
mRNA sequenceShow/hide mRNA sequence
CCCCCTTAAAAATAGTTTTTTTTTAATAGTAAATTTTCATTTATTTGGATCGGCCTCCACTCTCATCAGCGGCGACTACGAGAAGCCAAGAATTGCAATCGCCCGCCAAT
TTAGCTCTTAGCCTCTCTTTCTACTGCGGGAAGGGATCAATCGAACCATACATGGAACAAGAATCAGAGGGACCCAGGGTTCAGGTTGGGAGCAATGTCGAGTCTAATCC
GGGCAGTTTAGATTGCATAGCAAAGGTGAGAAAGCTACTGTTTCGCCGAATGCTCATAGGCATTAAAGATGGAAGATTTTTCCTGGGAACTTTTCACTGCATTGACAAGC
AAGGAAATATCATTCTACAAGATGCAGTGGAGTATCGTAGTACTCGACGTAGCTCACCTTCTCCGATGGAACAACGGTGCCTCGGCCTTATTCTTATCCCCAACTCTTGC
CGTGTGTCCTGTCATGTAGATAGTACCATTGTGGAACAATTAGCGCTGCTATCAGTCTAGAAATCAAGACTAAGCCTGGGGTTAAGAAAGGAGAAAAAAGGACTAGCACA
CTTGTTTGAAGAGAGTGTTTATGATTATTTCTTTTCAGTAGAAATATGGGATAAGGAGATAAAAAAAATTTTCTCTTACAAGGTTGTGGATAACCAGAATATATTCTCCT
TTTCTTTATTCATTGAATCAAGTGCATAAAAGTGGAATCTTCAACAATGGCATGCTATTTGGATGGAGACAGAAACACTGAATTGTTTCTAGTTTGGAAAGCTACACTCC
AGAAATTAAGGCTAGAAAGTTTTAACAGATGAGTAAAAACTCTTTGAATCTGTACAACAATAATATTTTCAATGATTGAACCACCCAACAGAGGGAAGGAAGAGTAATAT
TCCATAAATAAACAATTATTCACAACTCAGACCAAGATCTAGAAAGAATTCATATGAATCATAACACTTGTGAAGAAAGCAAATTCATTTATGTTAGCAAATTGTCATTT
TACGGTGCAGTCCACGGTCTTGGAGAGGAGTTGGAGTTGCATGAATGGTTGTGTTCCCTTGTGAAAAGATTGTAAAACTTTATCAAAATCTCTCACTGTAAGGTGTTCCG
TATACAATAAGGTATGGAACCGTACGTACAACTGTGTCTATTGGAATTGCAAAATTAACACCAGAAGACATTCCAGTTCCTGCAATGCATAAACAAAGATGAATAATATA
ACTTCGAATAGAATTGGCACAAAAAAACCGCAGTCCTTTCTTTCTTTCTTTACTATTTTATGTTCTTTCAAGTAGCAAATAAAAGAGAGAGAGAGAGAGATGGAGTTAGC
TTTGATAATTACTAAGCTTTAAGGCAAAGAAATCAGCCATATTTTTCTCTGACAAACTAAGCAGATGAAGAAATTGAGGCCCTTTACCTTTGCGAGTGAAAGTTGCTGTG
TTGACTCCAATTACATGACCGTAGGAGTCAATTAATGGCCCCCCTGAATTCCCTGTAAAACTCTAATGCATTAAAATGTGGGAAAGTTAAAATATGATGATGATTATTAC
GGGTGACTGGGGCAGTAATGAAATCAAGCCAGATTAGAAAGGAACAAGATAAGCTGCCATATATGTACAAGGAGCTTAACAGCGCTTTAGTGTCAAGAATAAAAAAGGCT
CACTCTTGCAATATACATGATGGGAAGTAGAAAATCTTTAGAAAGCGGCAGAAGAAAAGGCATGAAAAAACAAGTTTGAGAAATGGCGGTATCTGATTTCAGGTAGTTGA
ACCATGAACCTGAACTAATAGCAGCATCTGTCTGAATACCCCCCCGAATGGCCCTTCCATTTGGTGATGGAATTTCTCTACCCAATCCACTGATCACCTGCTATTCAAAC
CCTGAATGATTTACAGTTCTCGATTATTTAGTCAATCCACTATGCTGACTCTCAGCAGGTGTTTCTTGGTAAAGAACACCATCAATCTATGAGTAGGAAACTTGACAGTG
ACACTTAGAAAAACTCTTACTTCATCCACAGTCTTTGATAAATAAAACATAAATGTGTGTGAGATGAGAGATATCATATCTCCTATGAGAAATATGTTTATTATAAGAGA
GAAGTATTTTCACTATCCAATTTAGTTTTAGCTTTGAAATAGAAAATCCAAAGAGGGTTTCCTCTGAATTAAATGGAGAAAGTAACATTATTGATGTCTTACCCCTGCTG
TTAGTGTCTTCTCATAACCAAAAGGGTTGCCAATGGCATAGCAGCTCTGACCAACACGTAAATTTCGAGAGGTACCGAGAACGATGGGCTTTAGTTCACATCCTCCAAGT
TCCACCTGGATGTTACATGGCAACTTCAAACTCAGCAAATCATGGTATAGTATAAGAGATTGACAGTACAGATTGTCCAAACAGTGAAGCCTGTTTTATGGAATAGAAAA
CAAATGACATGTTATCATCATCACTCCATAGCTCACAGGGTTCACTTTCCACTCCTCCCAATCTGCTGCTTTCACACATCACATAAGGAACGGAAAACAGGATGAGAATC
ACCGAGACACAATTCACTCAATCTGGTATTCCCCTTGAGTCTCGAGCTAAATTAGAACAAAACGAAACAGAATATATATGGGCACCATCTCAAGAGGGATTTTGTAAATG
TCAAAATCATTTCAACTTCTATATTGAGTAGACAGCACCCAGTTTGAACACGACAGAGCAGACAAGAAGCTTCAAAGAAAATTAGGAAAGAAAAAAAAAAGTTTGGTTCT
ATACCTTGAGAACAGCTAGATCATACTCTGGATCAAAACCTACAATTTTTGCTTCCCTATAAATTCCATTTCCTTTAGCATCGACTAAATTTACCTACAGAAGCAAGCCA
AATGGAGTGGCTAGAGACTGATAATATATCTTCCATTATGAAGACATAAATAACAATTCAAAAGACAGTAACAAATAAACAAAACCTTACAACGCTGCAATCCACTGTTA
TCAGTAGCCAATGCGGAAACAACATGGTAATTAGTTACCTGCCACAAGTAATCACAAAGCCCTAATGAAATTCCTCTGTGAAATGAACAGAAAAAGAAATGTTTCCGATA
CCCAAAATCAATAAAATGAAAATGACAGTCACAAAGAATGAAGGCAAGTAAGAATTAAAGGAGAGAAGAGCATACGATATGGCCAAATTTATCCCATACAAAGCCCGAAC
CAGTCCCTTTGACCTTGACATTCTCATCCTCGACGAGCAGGGCCTCTTCAGAGGAGTTCTGGGGTTTCTTAGCTAATTCAAGGTCCTTAATGTAAACGACAGAAGGTGAA
GCATCCTACATTTGGAATTGGGACCCAACAGAGCAAAACAGAAAACAAAGAAGCAAGATTAAATTTCAATTGAAACTGAAGAATTCTGCAGTTATGGTGAGTGAGAAAGA
GAGGGACCTGAAAGAGAGCGACGACTCGATCTTCTTCTTGTGGAACCTGGGCCTGTATTTGGGGGAGAGCGGCGTGAGTGGGAAGAGGGAAAGCGAGGAGGGAAGCCATC
AAAGCAGATGGGGCAAAAACTAGGGCTCTTCGCGAAGTGAAGGGTAGAGAGTTGTGGGAAGAATTTGGGGGAGCTGGAATTGGAAGAAGATGAATTCCCAGTGAGGCTAA
CGCCATGGTGTTCCACTTCCTCTGATCTTGATTCTTATCGTGATTTTCCCTTTTTAAATTCAGCAACTGCAATTCAAACCTTAATCAATTGAACATTATGCTTTAGAAAC
AAACGAACATTATAAATTTTGAAGTTCTTGAAGTTCCCTTATCCCTATGAAAGATTAAAGCTATGCTCTAACTCTTTCGCCACAACTTATCTGAAAGTTGTTTAATCCAG
AAGGGTAAATCTAAGAAGTTAATTAATCTGAAATGGTTTAGTTTTGAGAGTAGAACTACTAAAGTATTGCAATTTTTACTCTCGAATAGATTTTCGTTTGAAAGATTATT
GAAAAAGTATTATTACGTGGAATGATATAAAACATATATTCGTATGGAATTGATAGGACTCAAACAAAAAATGAAAT
Protein sequenceShow/hide protein sequence
MEQESEGPRVQVGSNVESNPGSLDCIAKVRKLLFRRMLIGIKDGRFFLGTFHCIDKQGNIILQDAVEYRSTRRSSPSPMEQRCLGLILIPNSCRVSCHVDSTIVEQLALL
SV