; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010971 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010971
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMSC domain-containing protein
Genome locationChr01:1062149..1067681
RNA-Seq ExpressionHG10010971
SyntenyHG10010971
Gene Ontology termsGO:0005637 - nuclear inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR018996 - Man1/Src1, C-terminal
IPR041885 - MAN1, winged-helix domain
IPR044780 - Heh2/Src1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038534.1 MSC domain-containing protein [Cucumis melo var. makuwa]5.3e-20289.11Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL IA LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
         L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQVCEILEENAL STRNS QCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIENGE
        ASRLRDHLLLPRERKNPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+KKLASKS+S       KA+GV+ D MYHKIENGE
Subjt:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIENGE

XP_004148518.1 uncharacterized protein LOC101208017 isoform X1 [Cucumis sativus]9.0e-20288.66Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGG----DSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSD
        MSSTPKKRTK KRNPNSDVGSG     DSS SSS++LLKSIKEPPRDFFPSKDDLAAL TVL IAC VFV+CNFFVSRLSSRHP PFCDTDADS DF+SD
Subjt:  MSSTPKKRTKFKRNPNSDVGSGG----DSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSD

Query:  VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKAL
        VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCEANAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKAL
Subjt:  VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKAL

Query:  ETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCE
        ETIG L QTRQNSLGIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQYLTNRAEDLYNQVCEILEENAL STRNSGQCE
Subjt:  ETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCE

Query:  SWVVASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN
        SWVVASRLRDHLLLPRER+NPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSS +KKLASKS+S       KA+GV+ D MYHKIEN
Subjt:  SWVVASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN

XP_008465930.1 PREDICTED: uncharacterized protein LOC103503505 isoform X3 [Cucumis melo]9.9e-20189.06Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL IA LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
         L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQVCEILEENAL STRNS QCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN
        ASRLRDHLLLPRERKNPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+KKLASKS+S       KA+GV+ D MYHKIEN
Subjt:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN

XP_023533380.1 uncharacterized protein LOC111795284 isoform X2 [Cucurbita pepo subsp. pepo]4.6e-19085.05Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPK+RTKFK N NSDV S  DS  SSS VLL S+K PPRDFFPSKDDL  L TVLFIA LVFV+CNFFVSRL +R PRPFCD+DADS D LSD CEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CP HGEC +GKLEC HGYR+HGRLCIEDGVIN+AV KL EWLES LCEANAKFLCDGIGIVWV+ED IWDDLDGK LV++I SDNTT+MYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
         LFQ RQN+LGIKELKCPD LAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL +RQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIENG
        ASRLRDHLLLPRERK+PLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEK+LASKSSSR AMGV++D +Y K+ENG
Subjt:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIENG

XP_038888162.1 uncharacterized protein LOC120078048 [Benincasa hispida]3.5e-20691.21Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRN NSDVGS GDSS SSST+LLKSIKEPPRDFFPSKDDLAAL TVLFIACL+FV+C+FFVSRL+SR PRPFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKL+CLHGYRKHGRLCIEDGVINEAVNKL EWLES LCEANAKFLCDGIGIVWVKED+IWDDLDGKELV+SIGSDNTTL YAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
         LFQTRQNSLGIKELKCPDLLAESYKPF CRI HWVLQHAF VLPV LLLVGCTWLLWKL++RQY+TNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIEN
        ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEK+LA+KS+S KAMGVSTD+M+ K+EN
Subjt:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIEN

TrEMBL top hitse value%identityAlignment
A0A0A0LI89 MSC domain-containing protein4.3e-20288.66Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGG----DSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSD
        MSSTPKKRTK KRNPNSDVGSG     DSS SSS++LLKSIKEPPRDFFPSKDDLAAL TVL IAC VFV+CNFFVSRLSSRHP PFCDTDADS DF+SD
Subjt:  MSSTPKKRTKFKRNPNSDVGSGG----DSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSD

Query:  VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKAL
        VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCEANAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKAL
Subjt:  VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKAL

Query:  ETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCE
        ETIG L QTRQNSLGIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQYLTNRAEDLYNQVCEILEENAL STRNSGQCE
Subjt:  ETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCE

Query:  SWVVASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN
        SWVVASRLRDHLLLPRER+NPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSS +KKLASKS+S       KA+GV+ D MYHKIEN
Subjt:  SWVVASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN

A0A1S3CQ15 uncharacterized protein LOC103503505 isoform X34.8e-20189.06Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL IA LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
         L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQVCEILEENAL STRNS QCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN
        ASRLRDHLLLPRERKNPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+KKLASKS+S       KA+GV+ D MYHKIEN
Subjt:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN

A0A5A7T509 MSC domain-containing protein2.5e-20289.11Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL IA LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
         L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQVCEILEENAL STRNS QCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIENGE
        ASRLRDHLLLPRERKNPLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+KKLASKS+S       KA+GV+ D MYHKIENGE
Subjt:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIENGE

A0A6J1E026 uncharacterized protein LOC111026156 isoform X11.6e-18883.16Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPK+R K K NP+SD GS GDSSASSSTVLLKS+K+PPRDFFPS++DL  L TVLFIACLVF++CNFFVSRL+SR P PFCDTDADSLD LSD C+P
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CP HGECR G+LEC+ GYRKHGRLCIEDGVINEAV KL EWLES LCEANAKF+CDG+G VWVKED+IWDDLDG+ LV++IGSDNTT MYAK KALETI 
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
         LFQT+QNSLGI+ELKCPDLLAESYKPF CRIHHWVL+HAFVVLPV LLLVGCTWLLWKL++RQ+LTNRAE+LYNQVCEILEENALMS R SGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIE
        ASRLRDHLLLPRERK+PLLWRKVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKEK+LASK SSR AM V++DR+Y K++
Subjt:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIE

A0A6J1H2A7 uncharacterized protein LOC111459381 isoform X32.5e-18985.01Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPK+RTKFK N NSDV S  DS  SSS VLL SIK PPRDFFPSKDDL  L TVLFIA LVFV+CNFFVSRL +R PRPFCD+DADS D LSD CEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CP HGEC +GKLEC HGYR+HGRLCIEDGVIN+AV KL EWLES LCEANAKFLCDGIGIVWV+ED IWDDLDGK LV++I SDNTT+MYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
         LFQ RQN+LGIKELKCPD LAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL +RQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIEN
        ASRLRDHLLLPRERK+PLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEK+LASKSSSR  MGV++D +Y K+EN
Subjt:  ASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIEN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46560.1 CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018996); Has 58 Blast hits to 58 proteins in 29 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 15; Plants - 20; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).5.4e-9645.79Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        M S P+KR      P S+  +G    +SSS+  ++S+ EPP+  FPSK +   L  VL +AC V  TCNF    LSS   + FCD++ + +D   D+CEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CP +GEC  GKL+C  GY+    LC+EDG INE+  KLV + E ++CE+ A   C G G +WV E+++W +L     + ++  D +   + K KA+E + 
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMS-TRNSGQCESWV
         L + R NS GI ELKCP+ +A+SYKP  CR+H W+L+H  ++     +LVG   L  ++ ++Q  + R E+LY+QVC+ LEENA+ S +  +  CE WV
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMS-TRNSGQCESWV

Query:  VASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK-EKKLASKSSSRKAMGVST
        +AS LRD+LLLPRER++PLLW KVEEL++EDSRIDRY +L+KG+ K VWEWQVEGSLS SK +K+  ++   RK++  ST
Subjt:  VASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK-EKKLASKSSSRKAMGVST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCAACTCCGAAGAAGCGAACGAAATTCAAGCGTAATCCGAACTCCGATGTCGGTTCTGGAGGCGATTCCTCTGCTTCATCTTCTACAGTGTTGCTGAAGTCTAT
CAAGGAACCGCCTCGCGATTTCTTCCCCTCGAAGGATGATCTTGCTGCGCTATTTACTGTACTTTTCATCGCCTGCTTGGTTTTTGTGACTTGTAACTTCTTCGTATCTA
GACTTTCAAGTCGCCACCCGAGGCCTTTCTGTGATACCGACGCCGATTCTTTGGATTTTCTTTCTGATGTTTGTGAGCCTTGTCCAAGGCATGGAGAATGTCGTGATGGT
AAGTTGGAATGCCTTCATGGTTATAGAAAGCATGGAAGGTTATGTATAGAAGATGGAGTAATCAATGAAGCAGTTAATAAACTTGTAGAATGGCTAGAATCTCGCCTCTG
TGAAGCAAATGCCAAGTTTTTATGCGATGGAATTGGGATAGTTTGGGTTAAAGAGGACAATATATGGGATGATCTAGATGGTAAAGAACTGGTGGACAGTATTGGCTCTG
ACAATACCACTCTTATGTATGCAAAGAGCAAGGCGTTGGAAACTATTGGTAGGTTATTTCAGACGCGACAAAATTCTCTTGGGATCAAGGAATTGAAATGCCCAGATCTG
CTAGCTGAAAGTTACAAGCCTTTTCGTTGCCGTATTCATCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTACCAGTTCTCTTACTGCTTGTGGGATGCACATGGTTACT
ATGGAAACTTTTCCAAAGACAATATCTAACAAATAGAGCTGAAGATCTGTACAACCAGGTTTGCGAAATACTCGAGGAAAATGCTTTGATGTCAACGAGAAACAGTGGTC
AATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGGGAGAGGAAGAATCCTTTGTTATGGAGGAAGGTAGAGGAGTTGGTTCAGGAAGAC
TCACGAATAGATCGTTACCCGAGACTAGTTAAGGGTGATGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGCTCTTTGAGCTCTTCTAAGGAAAAGAAACTGGCCAGCAA
ATCCAGTTCCAGGAAGGCAATGGGAGTAAGTACTGATCGAATGTATCATAAAATAGAGAACGGTGAGTTGTGTAATGATCTTTTACACAAACATATGCTTGATCAAGTTA
CAGAATCTGTTCTCATTCTATTCTCATGTGATCAACTTTTTGACTTTGAAATATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCAACTCCGAAGAAGCGAACGAAATTCAAGCGTAATCCGAACTCCGATGTCGGTTCTGGAGGCGATTCCTCTGCTTCATCTTCTACAGTGTTGCTGAAGTCTAT
CAAGGAACCGCCTCGCGATTTCTTCCCCTCGAAGGATGATCTTGCTGCGCTATTTACTGTACTTTTCATCGCCTGCTTGGTTTTTGTGACTTGTAACTTCTTCGTATCTA
GACTTTCAAGTCGCCACCCGAGGCCTTTCTGTGATACCGACGCCGATTCTTTGGATTTTCTTTCTGATGTTTGTGAGCCTTGTCCAAGGCATGGAGAATGTCGTGATGGT
AAGTTGGAATGCCTTCATGGTTATAGAAAGCATGGAAGGTTATGTATAGAAGATGGAGTAATCAATGAAGCAGTTAATAAACTTGTAGAATGGCTAGAATCTCGCCTCTG
TGAAGCAAATGCCAAGTTTTTATGCGATGGAATTGGGATAGTTTGGGTTAAAGAGGACAATATATGGGATGATCTAGATGGTAAAGAACTGGTGGACAGTATTGGCTCTG
ACAATACCACTCTTATGTATGCAAAGAGCAAGGCGTTGGAAACTATTGGTAGGTTATTTCAGACGCGACAAAATTCTCTTGGGATCAAGGAATTGAAATGCCCAGATCTG
CTAGCTGAAAGTTACAAGCCTTTTCGTTGCCGTATTCATCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTACCAGTTCTCTTACTGCTTGTGGGATGCACATGGTTACT
ATGGAAACTTTTCCAAAGACAATATCTAACAAATAGAGCTGAAGATCTGTACAACCAGGTTTGCGAAATACTCGAGGAAAATGCTTTGATGTCAACGAGAAACAGTGGTC
AATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGGGAGAGGAAGAATCCTTTGTTATGGAGGAAGGTAGAGGAGTTGGTTCAGGAAGAC
TCACGAATAGATCGTTACCCGAGACTAGTTAAGGGTGATGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGCTCTTTGAGCTCTTCTAAGGAAAAGAAACTGGCCAGCAA
ATCCAGTTCCAGGAAGGCAATGGGAGTAAGTACTGATCGAATGTATCATAAAATAGAGAACGGTGAGTTGTGTAATGATCTTTTACACAAACATATGCTTGATCAAGTTA
CAGAATCTGTTCTCATTCTATTCTCATGTGATCAACTTTTTGACTTTGAAATATAA
Protein sequenceShow/hide protein sequence
MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDG
KLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDL
LAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVVASRLRDHLLLPRERKNPLLWRKVEELVQED
SRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIENGELCNDLLHKHMLDQVTESVLILFSCDQLFDFEI