; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi08G001673 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi08G001673
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionSmall nuclear ribonucleoprotein-associated protein B
Genome locationchr8:59226867..59229134
RNA-Seq ExpressionBhi08G001673
SyntenyBhi08G001673
Gene Ontology termsGO:0031417 - NatC complex (cellular component)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR034110 - LSM domain containing 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444477.1 PREDICTED: uncharacterized protein LOC103487784 [Cucumis melo]9.5e-5091.07Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        MEQES GSMVQDGSN ESN ESLD +GKVR+LLFRRMLIGIKDGRFFLG+FYCIDKQGNIILQDAVEYRSTR SSPSPMEQRCLGLILIPNSCRVSCH+D
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
        STIDE+LALLSV
Subjt:  STIDEKLALLSV

XP_011649337.1 uncharacterized protein LOC101206200 [Cucumis sativus]1.3e-5192.86Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        MEQES GSMVQDGSNVESNPESLD +GKVR+LLFRRMLIGIKDGRFFLG+FYCIDKQGNIILQDAVEYRSTR SSPSPMEQRCLGLILIPNSCRVSCH+D
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
        STIDE+LALLSV
Subjt:  STIDEKLALLSV

XP_022927243.1 uncharacterized protein LOC111434149 [Cucurbita moschata]2.5e-5091.96Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        MEQES GSMVQ G NVESNP+SLDRVGKVR+LLFRRMLIGIKDGRFFLGSFYC+DKQGNIILQDAVEYRSTR SSPSPMEQRCLGLILIPNSCRVSCH+D
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
        STIDE+LALLSV
Subjt:  STIDEKLALLSV

XP_023520042.1 uncharacterized protein LOC111783348 [Cucurbita pepo subsp. pepo]6.6e-5192.86Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        MEQES GSMVQ GSNVESNP+SLDRVGKVR+LLFRRMLIGIKDGRFFLGSFYC+DKQGNIILQDAVEYRSTR SSPSPMEQRCLGLILIPNSCRVSCH+D
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
        STIDE+LALLSV
Subjt:  STIDEKLALLSV

XP_038893823.1 uncharacterized protein LOC120082644 [Benincasa hispida]2.6e-55100Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
        STIDEKLALLSV
Subjt:  STIDEKLALLSV

TrEMBL top hitse value%identityAlignment
A0A0A0LSZ5 Sm domain-containing protein6.4e-5292.86Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        MEQES GSMVQDGSNVESNPESLD +GKVR+LLFRRMLIGIKDGRFFLG+FYCIDKQGNIILQDAVEYRSTR SSPSPMEQRCLGLILIPNSCRVSCH+D
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
        STIDE+LALLSV
Subjt:  STIDEKLALLSV

A0A1S3BAH4 uncharacterized protein LOC1034877844.6e-5091.07Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        MEQES GSMVQDGSN ESN ESLD +GKVR+LLFRRMLIGIKDGRFFLG+FYCIDKQGNIILQDAVEYRSTR SSPSPMEQRCLGLILIPNSCRVSCH+D
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
        STIDE+LALLSV
Subjt:  STIDEKLALLSV

A0A5D3DAS2 Small nuclear ribonucleoprotein family protein isoform 24.6e-5091.07Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        MEQES GSMVQDGSN ESN ESLD +GKVR+LLFRRMLIGIKDGRFFLG+FYCIDKQGNIILQDAVEYRSTR SSPSPMEQRCLGLILIPNSCRVSCH+D
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
        STIDE+LALLSV
Subjt:  STIDEKLALLSV

A0A6J1EKG3 uncharacterized protein LOC1114341491.2e-5091.96Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        MEQES GSMVQ G NVESNP+SLDRVGKVR+LLFRRMLIGIKDGRFFLGSFYC+DKQGNIILQDAVEYRSTR SSPSPMEQRCLGLILIPNSCRVSCH+D
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
        STIDE+LALLSV
Subjt:  STIDEKLALLSV

A0A6J1KMW6 uncharacterized protein LOC1114956141.0e-4990.18Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        M+QES GSMVQ GSNVESNP+SLDRVGKVR+LLFRRML+GIKDGRFFLGSFYC+DKQGNIILQDAVEYRST  SSPSPMEQRCLGLILIPNSCRVSCH+D
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
        STIDE+LALLSV
Subjt:  STIDEKLALLSV

SwissProt top hitse value%identityAlignment
A4IGZ4 N-alpha-acetyltransferase 38, NatC auxiliary subunit2.5e-0538.36Show/hide
Query:  KVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        K+  LL R M I + DGR  +G F C D+  N+IL  A E+     S P   E R LGL ++P    VS  ++
Subjt:  KVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

P63162 Small nuclear ribonucleoprotein-associated protein N7.4e-0535.8Show/hide
Query:  VGKVRRLLFR---RMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSS---PSPMEQRCLGLILIPNSCRVSCHID
        VGK  ++L     RM   ++DGR F+G+F   DK  N+IL D  E+R  +  +   P   E+R LGL+L+     VS  ++
Subjt:  VGKVRRLLFR---RMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSS---PSPMEQRCLGLILIPNSCRVSCHID

Q17QN3 Small nuclear ribonucleoprotein-associated protein N7.4e-0535.8Show/hide
Query:  VGKVRRLLFR---RMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSS---PSPMEQRCLGLILIPNSCRVSCHID
        VGK  ++L     RM   ++DGR F+G+F   DK  N+IL D  E+R  +  +   P   E+R LGL+L+     VS  ++
Subjt:  VGKVRRLLFR---RMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSS---PSPMEQRCLGLILIPNSCRVSCHID

Q55A45 Small nuclear ribonucleoprotein-associated protein B6.7e-0631.82Show/hide
Query:  LDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQ-RCLGLILIPNSCRVSCHIDSTIDEKLAL
        + +  K+ + +  RM + I+DGR  +G F   DK  N+++ DA E+R  R       E+ R LG+ILI     VS  +++   E+  L
Subjt:  LDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQ-RCLGLILIPNSCRVSCHIDSTIDEKLAL

Q6GQ67 N-alpha-acetyltransferase 38-A, NatC auxiliary subunit2.5e-0538.36Show/hide
Query:  KVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        K+  LL R M I + DGR  +G F C D+  N+IL  A E+     S P   E R LGL ++P    VS  ++
Subjt:  KVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Arabidopsis top hitse value%identityAlignment
AT4G18372.1 Small nuclear ribonucleoprotein family protein4.9e-3663.39Show/hide
Query:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID
        MEQ +E S     S  E +    D + ++R+LLFR+ML+GIKDGRFFLG+F+CIDKQGNIILQD VEYRS R SSPSP EQRCLG+ILIP+SCR SCH+D
Subjt:  MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHID

Query:  STIDEKLALLSV
         +IDE+L+L+ +
Subjt:  STIDEKLALLSV

AT4G20440.2 small nuclear ribonucleoprotein associated protein B4.9e-0428.77Show/hide
Query:  SLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRST-----RHSSPSPMEQRCLGLILI
        S+ +  K+ + +  RM + I+DGR  +G F   D+  N++L D  E+R       +  +    ++R LGL+L+
Subjt:  SLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRST-----RHSSPSPMEQRCLGLILI

AT4G20440.3 small nuclear ribonucleoprotein associated protein B4.9e-0428.77Show/hide
Query:  SLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRST-----RHSSPSPMEQRCLGLILI
        S+ +  K+ + +  RM + I+DGR  +G F   D+  N++L D  E+R       +  +    ++R LGL+L+
Subjt:  SLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRST-----RHSSPSPMEQRCLGLILI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACAAGAATCAGAAGGATCCATGGTTCAGGATGGGAGCAATGTTGAGTCTAACCCAGAGAGTTTAGATCGTGTAGGAAAGGTGAGAAGGCTGTTGTTCCGT
CGAATGCTCATAGGTATTAAAGATGGAAGGTTTTTCTTGGGCAGCTTTTACTGCATTGACAAGCAAGGAAATATTATCCTCCAAGATGCAGTAGAGTATCGTAGC
ACTCGGCATAGCTCGCCTTCTCCAATGGAACAACGGTGCCTTGGTCTTATTCTTATACCCAACTCTTGCCGTGTATCTTGTCATATAGATAGTACCATTGATGAA
AAATTGGCACTGCTATCAGTTTAG
mRNA sequenceShow/hide mRNA sequence
CAAAAAAATGAGAAGAAAAATTGATTTTTTTTTCCTAAGAATTTGAGTTTTTATTTATTTATTTATGGGATGTTACTCCACTCTCTTTCATCGTCGACGAGAAGC
TCAGAAACAGAATCGGCCGCCAAGTTCGCCTCTGTTCTGCCGTCTACCACCATCTTCTTTCTCCTTTCCTGCTGTGGCTTATTGAACAAGTACAACCCTAATCAG
TCGGCCAGTTTCGAGGATGGTTGCATGGATGGGGATCAGTTGAAGTAACCATAGATGGAACAAGAATCAGAAGGATCCATGGTTCAGGATGGGAGCAATGTTGAG
TCTAACCCAGAGAGTTTAGATCGTGTAGGAAAGGTGAGAAGGCTGTTGTTCCGTCGAATGCTCATAGGTATTAAAGATGGAAGGTTTTTCTTGGGCAGCTTTTAC
TGCATTGACAAGCAAGGAAATATTATCCTCCAAGATGCAGTAGAGTATCGTAGCACTCGGCATAGCTCGCCTTCTCCAATGGAACAACGGTGCCTTGGTCTTATT
CTTATACCCAACTCTTGCCGTGTATCTTGTCATATAGATAGTACCATTGATGAAAAATTGGCACTGCTATCAGTTTAGCAAATAAGATCAAACCTGAGTTTATGA
GGGGATGAAAAAAAATCTGCATACTTGTTTGATGAGAATGTTTATGACTGTTTCTATTACTACAGAAAAATAGTAAAACAATATGATGTTTTTGGTAGAAACATG
GGATTAGGAAATACGTTTTATTTCCCTTGATAGACTCTGAATAATCAGTGCACATTCTCACTTTATTAAGTTGAAGATTGCAATGCAAGATAGAACCTTTAAATA
GATTGCCG
Protein sequenceShow/hide protein sequence
MEQESEGSMVQDGSNVESNPESLDRVGKVRRLLFRRMLIGIKDGRFFLGSFYCIDKQGNIILQDAVEYRSTRHSSPSPMEQRCLGLILIPNSCRVSCHIDSTIDE
KLALLSV