; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037979 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037979
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionMSC domain-containing protein
Genome locationscaffold12:40383518..40389663
RNA-Seq ExpressionSpg037979
SyntenySpg037979
Gene Ontology termsGO:0005637 - nuclear inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR018996 - Man1/Src1, C-terminal
IPR041885 - MAN1, winged-helix domain
IPR044780 - Heh2/Src1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606225.1 hypothetical protein SDJN03_03542, partial [Cucurbita argyrosperma subsp. sororia]1.3e-19487.86Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK PPRDFFPSK+DL RLITVLFIA LVFVSCNF VSR E RRPRPFCD+DADS DLLSD CEP
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG
        CPSHGEC EGKLEC HGYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MY KSKAL+TIG
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG

Query:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV
        GLFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQY TNRAEDLYNQVC+ILEENA+MSTRNSGQCESWVV
Subjt:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNS  +  K+ N
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN

KAG7036172.1 hypothetical protein SDJN02_02973 [Cucurbita argyrosperma subsp. argyrosperma]1.7e-19487.4Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK PPRDFFPSK+DL RL+TVLFIA LVFVSCNF VSR E RRPRPFCD+DADS DLLSD CEP
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG
        CPSHGEC EGKLEC HGYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MY KSKAL+TIG
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG

Query:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV
        GLFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQY TNRAEDLYNQVC+ILEENA+MSTRNSGQCESWVV
Subjt:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGNAA
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNS  +  K+ N A
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGNAA

XP_022958030.1 uncharacterized protein LOC111459381 isoform X3 [Cucurbita moschata]3.8e-19487.6Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK PPRDFFPSK+DL RLITVLFIA LVFVSCNF VSR E RRPRPFCD+DADS DLLSD CEP
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG
        CPSHGEC EGKLEC HGYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MY KSKAL+TIG
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG

Query:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV
        GLFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQY TNRAEDLYNQVC+ILEENA+MSTRNSGQCESWVV
Subjt:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRM MGVNS  +  K+ N
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN

XP_023533380.1 uncharacterized protein LOC111795284 isoform X2 [Cucurbita pepo subsp. pepo]1.0e-19487.18Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL S+K PPRDFFPSK+DL RLITVLFIA LVFVSCNF VSR E RRPRPFCD+DADS DLLSD CEP
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG
        CPSHGEC EGKLEC HGYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MY KSKAL+TIG
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG

Query:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV
        GLFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQY TNRAEDLYNQVC+ILEENA+MSTRNSGQCESWVV
Subjt:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGNAAS
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNS  +  K+ N  S
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGNAAS

XP_038888162.1 uncharacterized protein LOC120078048 [Benincasa hispida]2.0e-19586.82Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        MSSTPK+RTK K+N NSDV S+GDSS S ST LLKSIKEPPRDFFPSK+DLA LITVLFIACL+FVSC+F VSR  +R+PRPFCDTDADSLDLLSD CEP
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG
        CP HGECR+GKL+CLHGYRKHGRLCIEDGVINEAV KLSEWLESHLCEANAKFLCDGI IVWVKEDDIWDDLDG+ LVE+IGSDNTT  Y KSKAL+TIG
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG

Query:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV
        GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAF VLPV LLLVGC WLLWKLYRRQY TNRAEDLYNQVC+ILEENA+MSTRNSGQCESWVV
Subjt:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN
        ASRLRDHLLLPRERK+PLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLA+KS+S  AMGV++  +  K+ N
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN

TrEMBL top hitse value%identityAlignment
A0A5A7T509 MSC domain-containing protein4.0e-18983.59Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        MSSTPK+RTK K+NPNSDV S  DSS S S+ LLKS+KEPPRDFFPSK+DLA LITVL IA LVFVSCNF VSR  +R P PFCDTDADSLDLLSD CEP
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG
        CP HGECR+GKLECLHGYRKHGRLCIEDGVINEAV KLSEWLESHLCE+NAKFLCDGI IVWVKE+DIWDDLDG+ LVE+IGSDNTT MY KSKAL+TIG
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG

Query:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV
        GL QTRQNS GIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPV LLLVGC WLLWKLYRRQ  TNRAEDLYNQVC+ILEENA+ STRNS QCESWVV
Subjt:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSR------MAMGVNSAPLCHKIGNAAS
        ASRLRDHLLLPRERK+PLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+K+LASKS+S        A+GVN  P+ HKI N  S
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSR------MAMGVNSAPLCHKIGNAAS

A0A6J1E026 uncharacterized protein LOC111026156 isoform X15.6e-19183.97Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        MSSTPKRR K K NP+SD  SKGDSSAS ST LLKS+K+PPRDFFPS+ DL RLITVLFIACLVF+SCNF VSR  +RRP PFCDTDADSLDLLSD C+P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG
        CPSHGECR G+LEC+ GYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKF+CDG+  VWVKEDDIWDDLDGQALVENIGSDNTTFMY K KAL+TI 
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG

Query:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV
        GLFQT+QNSLGI+ELKCPDLLAESYKPFTCRI HWVL+HAFVVLPV LLLVGC WLLWKLYRRQ+ TNRAE+LYNQVC+ILEENA+MS R SGQCESWVV
Subjt:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGNAASVAA
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKEKRLASK SSR+AM VNS  +  K+     V +
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGNAASVAA

A0A6J1H1Y9 uncharacterized protein LOC111459381 isoform X16.0e-19387.15Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK PPRDFFPSK+DL RLITVLFIA LVFVSCNF VSR E RRPRPFCD+DADS DLLSD CEP
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQT
        CPSHGEC EGKLEC HGYR+HGRLCIEDGVIN+AVKKL  SEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MY KSKAL+T
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQT

Query:  IGGLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESW
        IGGLFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQY TNRAEDLYNQVC+ILEENA+MSTRNSGQCESW
Subjt:  IGGLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESW

Query:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN
        VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRM MGVNS  +  K+ N
Subjt:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN

A0A6J1H2A7 uncharacterized protein LOC111459381 isoform X31.9e-19487.6Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK PPRDFFPSK+DL RLITVLFIA LVFVSCNF VSR E RRPRPFCD+DADS DLLSD CEP
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG
        CPSHGEC EGKLEC HGYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MY KSKAL+TIG
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG

Query:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV
        GLFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQY TNRAEDLYNQVC+ILEENA+MSTRNSGQCESWVV
Subjt:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRM MGVNS  +  K+ N
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN

A0A6J1H3U4 uncharacterized protein LOC111459381 isoform X25.6e-19186.89Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK PPRDFFPSK+DL RLITVLFIA LVFVSCNF VSR E RRPRPFCD+DADS DLLSD CEP
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQT
        CPSHGEC EGKLEC HGYR+HGRLCIEDGVIN+AVKKL  SEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MY KSKAL+T
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQT

Query:  IGGLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESW
        IGGLFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQY TNRAEDLYNQVC+ILEENA+MSTRNSGQCESW
Subjt:  IGGLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESW

Query:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN
        VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQ EGSLSSSKEKRLASKSSSRM MGVNS  +  K+ N
Subjt:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46560.1 CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018996); Has 58 Blast hits to 58 proteins in 29 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 15; Plants - 20; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).2.4e-9346.15Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP
        M S P++R   K    +    K  SS+SP    ++S+ EPP+  FPSK +   L+ VL +AC V  +CNF+     +   + FCD++ + +D   D CEP
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEP

Query:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG
        CP +GEC +GKL+C  GY+    LC+EDG INE+ KKL  + E  +CE+ A   C G   +WV E+D+W +L   + + N+  D + + ++K KA++ + 
Subjt:  CPSHGECREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIG

Query:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMS-TRNSGQCESWV
         L + R NS GI ELKCP+ +A+SYKP TCR+  W+L+H  ++     +LVG A L  ++ R+Q  + R E+LY+QVC  LEENA+ S +  +  CE WV
Subjt:  GLFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMS-TRNSGQCESWV

Query:  VASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKR
        +AS LRD+LLLPRER+DPLLW KVEEL++EDSRIDRY +L+KG+ K VWEWQVEGSLS SK K+
Subjt:  VASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCAACTCCGAAGAGGCGAACGAAATTCAAGCAAAATCCGAACTCCGATGTCGTTTCTAAAGGTGATTCTTCTGCTTCACCTTCTACAGCGTTGCTGAAATCCAT
CAAGGAACCGCCTCGCGATTTCTTTCCCTCCAAGGAAGATCTTGCTAGGCTAATCACTGTACTTTTCATCGCCTGCTTGGTCTTTGTGAGTTGTAACTTCGTCGTATCTA
GATTTGAAAATCGTCGCCCGAGGCCTTTCTGCGACACAGACGCCGATTCCTTGGATTTGCTTTCTGATGATTGTGAGCCTTGTCCAAGTCATGGAGAATGCCGTGAAGGT
AAGTTGGAATGCCTTCATGGCTATAGAAAGCATGGAAGGTTATGTATAGAAGATGGGGTAATTAATGAAGCAGTTAAGAAACTTTCAGAATGGCTAGAATCTCACCTCTG
TGAAGCAAATGCCAAGTTCTTATGCGATGGAATTGAGATAGTTTGGGTTAAAGAGGATGATATTTGGGATGATCTAGATGGTCAGGCGCTGGTGGAAAATATTGGCTCTG
ACAACACCACTTTTATGTATGTAAAGAGCAAGGCATTGCAAACTATTGGTGGGTTATTTCAGACACGGCAGAATTCTCTTGGGATCAAGGAATTGAAATGCCCAGATCTT
CTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTGCCAGTTTCTTTACTGCTTGTGGGATGCGCATGGTTACT
ATGGAAACTTTACCGGAGACAATATCGAACAAATAGGGCTGAAGATCTGTACAACCAGGTTTGCAAAATACTCGAGGAAAATGCTATGATGTCTACGAGAAACAGTGGTC
AATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGGGAGAGGAAAGATCCTTTGTTATGGAGGAAGGTAGAGGAGTTGGTTCAAGAAGAC
TCACGAATAGATCGTTACCCGAGACTGGTTAAGGGTGATGGAAAAGAAGTATGGGAGTGGCAAGTAGAAGGCTCTTTGAGCTCTTCAAAGGAAAAGAGACTGGCCAGTAA
ATCCAGTTCCAGGATGGCAATGGGAGTAAATTCTGCCCCATTATGCCATAAAATTGGGAATGCTGCTTCTGTTGCGGCAACTGGGGCCTGTAGGGGCATG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCAACTCCGAAGAGGCGAACGAAATTCAAGCAAAATCCGAACTCCGATGTCGTTTCTAAAGGTGATTCTTCTGCTTCACCTTCTACAGCGTTGCTGAAATCCAT
CAAGGAACCGCCTCGCGATTTCTTTCCCTCCAAGGAAGATCTTGCTAGGCTAATCACTGTACTTTTCATCGCCTGCTTGGTCTTTGTGAGTTGTAACTTCGTCGTATCTA
GATTTGAAAATCGTCGCCCGAGGCCTTTCTGCGACACAGACGCCGATTCCTTGGATTTGCTTTCTGATGATTGTGAGCCTTGTCCAAGTCATGGAGAATGCCGTGAAGGT
AAGTTGGAATGCCTTCATGGCTATAGAAAGCATGGAAGGTTATGTATAGAAGATGGGGTAATTAATGAAGCAGTTAAGAAACTTTCAGAATGGCTAGAATCTCACCTCTG
TGAAGCAAATGCCAAGTTCTTATGCGATGGAATTGAGATAGTTTGGGTTAAAGAGGATGATATTTGGGATGATCTAGATGGTCAGGCGCTGGTGGAAAATATTGGCTCTG
ACAACACCACTTTTATGTATGTAAAGAGCAAGGCATTGCAAACTATTGGTGGGTTATTTCAGACACGGCAGAATTCTCTTGGGATCAAGGAATTGAAATGCCCAGATCTT
CTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTGCCAGTTTCTTTACTGCTTGTGGGATGCGCATGGTTACT
ATGGAAACTTTACCGGAGACAATATCGAACAAATAGGGCTGAAGATCTGTACAACCAGGTTTGCAAAATACTCGAGGAAAATGCTATGATGTCTACGAGAAACAGTGGTC
AATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGGGAGAGGAAAGATCCTTTGTTATGGAGGAAGGTAGAGGAGTTGGTTCAAGAAGAC
TCACGAATAGATCGTTACCCGAGACTGGTTAAGGGTGATGGAAAAGAAGTATGGGAGTGGCAAGTAGAAGGCTCTTTGAGCTCTTCAAAGGAAAAGAGACTGGCCAGTAA
ATCCAGTTCCAGGATGGCAATGGGAGTAAATTCTGCCCCATTATGCCATAAAATTGGGAATGCTGCTTCTGTTGCGGCAACTGGGGCCTGTAGGGGCATG
Protein sequenceShow/hide protein sequence
MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPPRDFFPSKEDLARLITVLFIACLVFVSCNFVVSRFENRRPRPFCDTDADSLDLLSDDCEPCPSHGECREG
KLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYVKSKALQTIGGLFQTRQNSLGIKELKCPDL
LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYRTNRAEDLYNQVCKILEENAMMSTRNSGQCESWVVASRLRDHLLLPRERKDPLLWRKVEELVQED
SRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLCHKIGNAASVAATGACRGM