; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g35560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g35560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMSC domain-containing protein
Genome locationchr6:27286541..27293364
RNA-Seq ExpressionMoc06g35560
SyntenyMoc06g35560
Gene Ontology termsGO:0005637 - nuclear inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR018996 - Man1/Src1, C-terminal
IPR041885 - MAN1, winged-helix domain
IPR044780 - Heh2/Src1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606225.1 hypothetical protein SDJN03_03542, partial [Cucurbita argyrosperma subsp. sororia]4.1e-18483.29Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        MSSTPKRR K K N +SD  SK DS  SSS VLL S+K PPRDFFPS++DL RLITVLFIA LVF+SCNFFVSRL +RRP PFCD+DADS DLLSDAC+P
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
        CPSHGEC  G+LEC  GYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKF+CDG+G VWV+ED IWDDLDG+ALVENI SDNTT MYAK KALETI 
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII

Query:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
        GLFQ +QN+LGI+ELKCPD LAESYKPFTCRI HWVL+HAFVVLPV LLLVGCTWLLWKL RRQ+LTNRAE+LYNQVCEILEENALMS R SGQCESWVV
Subjt:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVDASL
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKEKRLASK SSR+AM VNSD IY K++  L
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVDASL

KAG7036172.1 hypothetical protein SDJN02_02973 [Cucurbita argyrosperma subsp. argyrosperma]2.4e-18482.21Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        MSSTPKRR K K N +SD  SK DS  SSS VLL S+K PPRDFFPS++DL RL+TVLFIA LVF+SCNFFVSRL +RRP PFCD+DADS DLLSDAC+P
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
        CPSHGEC  G+LEC  GYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKF+CDG+G VWV+ED IWDDLDG+ALVENI SDNTT MYAK KALETI 
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII

Query:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
        GLFQ +QN+LGI+ELKCPD LAESYKPFTCRI HWVL+HAFVVLPV LLLVGCTWLLWKL RRQ+LTNRAE+LYNQVCEILEENALMS R SGQCESWVV
Subjt:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKV--DASLAGESEQEA
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKEKRLASK SSR+AM VNSD IY K+  DA     SE EA
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKV--DASLAGESEQEA

XP_022159868.1 uncharacterized protein LOC111026156 isoform X1 [Momordica charantia]1.5e-223100Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
        CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII

Query:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
        GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
Subjt:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD

XP_022159870.1 uncharacterized protein LOC111026156 isoform X2 [Momordica charantia]1.4e-22199.74Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
        CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII

Query:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
        GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
Subjt:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQ EGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD

XP_038888162.1 uncharacterized protein LOC120078048 [Benincasa hispida]3.6e-18882.9Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        MSSTPK+R K+K N +SD GS+GDSS SSST+LLKS+K+PPRDFFPS++DL  LITVLFIACL+F+SC+FFVSRLASR+P PFCDTDADSLDLLSD C+P
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
        CP HGECR G+L+C+ GYRKHGRLCIEDGVINEAV KLSEWLESHLCEANAKF+CDG+G VWVKEDDIWDDLDG+ LVE+IGSDNTT  YAK KALETI 
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII

Query:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
        GLFQT+QNSLGI+ELKCPDLLAESYKPFTCRI HWVL+HAF VLPVFLLLVGCTWLLWKLYRRQ++TNRAE+LYNQVCEILEENALMS R SGQCESWVV
Subjt:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD
        ASRLRDHLLLPRERK+PLLWRKVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKEKRLA+K +S  AM V++D+++ K++
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD

TrEMBL top hitse value%identityAlignment
A0A5A7T509 MSC domain-containing protein1.6e-18180.36Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        MSSTPK+R K+K NP+SD GS  DSS SSS++LLKS+K+PPRDFFPS++DL  LITVL IA LVF+SCNFFVSRL+SR P PFCDTDADSLDLLSD C+P
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
        CP HGECR G+LEC+ GYRKHGRLCIEDGVINEAV KLSEWLESHLCE+NAKF+CDG+G VWVKE+DIWDDLDG+ LVE+IGSDNTT MYAK KALETI 
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII

Query:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
        GL QT+QNS GI+ELKCPDLLAESYKPFTCRI HWVL+HAFVVLPVFLLLVGCTWLLWKLYRRQ+LTNRAE+LYNQVCEILEENAL S R S QCESWVV
Subjt:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSR------VAMEVNSDRIYRKVD
        ASRLRDHLLLPRERK+PLLW+KVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSK+K+LASK +S        A+ VN D +Y K++
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSR------VAMEVNSDRIYRKVD

A0A6J1E002 uncharacterized protein LOC111026156 isoform X27.0e-22299.74Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
        CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII

Query:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
        GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
Subjt:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQ EGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD

A0A6J1E026 uncharacterized protein LOC111026156 isoform X17.5e-224100Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
        CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII

Query:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
        GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
Subjt:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVD

A0A6J1H1Y9 uncharacterized protein LOC111459381 isoform X11.9e-18282.61Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        MSSTPKRR K K N +SD  SK DS  SSS VLL S+K PPRDFFPS++DL RLITVLFIA LVF+SCNFFVSRL +RRP PFCD+DADS DLLSDAC+P
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALET
        CPSHGEC  G+LEC  GYR+HGRLCIEDGVIN+AVKKL  SEWLESHLCEANAKF+CDG+G VWV+ED IWDDLDG+ALVENI SDNTT MYAK KALET
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALET

Query:  IIGLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESW
        I GLFQ +QN+LGI+ELKCPD LAESYKPFTCRI HWVL+HAFVVLPV LLLVGCTWLLWKL RRQ+LTNRAE+LYNQVCEILEENALMS R SGQCESW
Subjt:  IIGLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESW

Query:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVDASL
        VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKEKRLASK SSR+ M VNSD IY K++  L
Subjt:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVDASL

A0A6J1H2A7 uncharacterized protein LOC111459381 isoform X35.8e-18483.03Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        MSSTPKRR K K N +SD  SK DS  SSS VLL S+K PPRDFFPS++DL RLITVLFIA LVF+SCNFFVSRL +RRP PFCD+DADS DLLSDAC+P
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
        CPSHGEC  G+LEC  GYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKF+CDG+G VWV+ED IWDDLDG+ALVENI SDNTT MYAK KALETI 
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII

Query:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV
        GLFQ +QN+LGI+ELKCPD LAESYKPFTCRI HWVL+HAFVVLPV LLLVGCTWLLWKL RRQ+LTNRAE+LYNQVCEILEENALMS R SGQCESWVV
Subjt:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVDASL
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKEKRLASK SSR+ M VNSD IY K++  L
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVDASL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46560.1 CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018996); Has 58 Blast hits to 58 proteins in 29 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 15; Plants - 20; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).3.8e-9546.7Show/hide
Query:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP
        M S P++R    P  ++  G    SS+SSS +  +S+ +PP+  FPS+ +   L+ VL +AC V  +CNF    L+S   + FCD++ + +D   D C+P
Subjt:  MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKP

Query:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII
        CP +GEC  G+L+C  GY+    LC+EDG INE+ KKL  + E  +CE+ A   C G GT+WV E+D+W +L   + + N+  D + + + K KA+E + 
Subjt:  CPSHGECRGGELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETII

Query:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRI-SGQCESWV
         L + + NS GI+ELKCP+ +A+SYKP TCR+H W+L+H  ++     +LVG   L  ++ R+Q  + R E LY+QVC+ LEENA+ S    +  CE WV
Subjt:  GLFQTQQNSLGIEELKCPDLLAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRI-SGQCESWV

Query:  VASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKR
        +AS LRD+LLLPRER+DPLLW KVEEL++EDSRIDRY +L+KGE K VWEWQVEGSLS SK K+
Subjt:  VASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCAACTCCGAAGAGGCGAAAGAAACTCAAGCCAAATCCGGACTCCGATGCCGGTTCTAAAGGCGATTCTTCTGCTTCATCTTCTACAGTGTTGCTGAAGTCTCT
CAAGCAACCGCCTCGCGATTTCTTTCCCTCCGAGAACGATCTCATTAGGCTAATTACTGTACTTTTCATCGCCTGCTTGGTTTTTCTGAGCTGTAACTTCTTCGTATCTA
GACTCGCGAGTCGCCGCCCGGAGCCTTTCTGCGACACTGACGCCGATTCCTTGGACTTGCTTTCTGATGCTTGCAAGCCTTGTCCAAGTCATGGAGAATGCCGTGGAGGT
GAGTTGGAATGTGTTCGTGGTTATAGAAAGCACGGAAGGTTATGCATAGAAGATGGAGTAATCAATGAAGCAGTTAAGAAACTTTCAGAATGGCTAGAATCTCACCTCTG
TGAAGCAAATGCCAAGTTCATGTGCGATGGAGTTGGGACAGTCTGGGTTAAAGAGGATGATATATGGGATGATTTAGATGGTCAAGCACTGGTGGAAAACATTGGCTCTG
ACAACACCACTTTTATGTATGCGAAGAGAAAGGCATTGGAAACTATTATTGGGTTATTTCAGACACAGCAAAATTCTCTTGGGATCGAGGAATTGAAATGTCCAGATCTG
CTGGCTGAAAGTTACAAGCCATTTACTTGCCGTATTCATCATTGGGTTTTGAAGCATGCTTTTGTTGTTTTGCCAGTTTTCTTACTGCTTGTGGGATGCACTTGGTTACT
ATGGAAACTTTACCGGAGACAACATCTAACAAATAGAGCTGAAAATCTGTACAACCAGGTCTGCGAAATACTTGAGGAAAATGCTTTGATGTCAAAGAGAATAAGTGGTC
AATGTGAATCATGGGTTGTTGCATCCAGGTTACGCGACCATCTTCTTTTGCCACGAGAGAGGAAGGATCCTTTGTTATGGAGGAAGGTAGAGGAATTGGTTCAGGAAGAC
TCACGAATAGATCGCTACCCAAGACTGGTCAAGGGTGAAGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGTTCTTTGAGTTCTTCAAAGGAAAAGAGACTGGCAAGCAA
ATTAAGTTCCAGGGTGGCGATGGAAGTAAATTCTGACCGAATATACCGTAAAGTGGATGCTAGCTTAGCTGGAGAATCAGAACAGGAAGCAGTTTCTGCTGGGGCAACTG
GGGCAACTGGGGCTTGTAGTGTACAGTCAATAGATGAGGAGCAAGCTTTAGACAAGAAAGTTGGATTTGGTGTGAGCCAAAAGTTGGATGGTTTGAGCATTTTGCAATCA
GCAGAAGAGTTTGAGAAGGAAGATGACAAATGCTTCATCACCGGACACTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCAACTCCGAAGAGGCGAAAGAAACTCAAGCCAAATCCGGACTCCGATGCCGGTTCTAAAGGCGATTCTTCTGCTTCATCTTCTACAGTGTTGCTGAAGTCTCT
CAAGCAACCGCCTCGCGATTTCTTTCCCTCCGAGAACGATCTCATTAGGCTAATTACTGTACTTTTCATCGCCTGCTTGGTTTTTCTGAGCTGTAACTTCTTCGTATCTA
GACTCGCGAGTCGCCGCCCGGAGCCTTTCTGCGACACTGACGCCGATTCCTTGGACTTGCTTTCTGATGCTTGCAAGCCTTGTCCAAGTCATGGAGAATGCCGTGGAGGT
GAGTTGGAATGTGTTCGTGGTTATAGAAAGCACGGAAGGTTATGCATAGAAGATGGAGTAATCAATGAAGCAGTTAAGAAACTTTCAGAATGGCTAGAATCTCACCTCTG
TGAAGCAAATGCCAAGTTCATGTGCGATGGAGTTGGGACAGTCTGGGTTAAAGAGGATGATATATGGGATGATTTAGATGGTCAAGCACTGGTGGAAAACATTGGCTCTG
ACAACACCACTTTTATGTATGCGAAGAGAAAGGCATTGGAAACTATTATTGGGTTATTTCAGACACAGCAAAATTCTCTTGGGATCGAGGAATTGAAATGTCCAGATCTG
CTGGCTGAAAGTTACAAGCCATTTACTTGCCGTATTCATCATTGGGTTTTGAAGCATGCTTTTGTTGTTTTGCCAGTTTTCTTACTGCTTGTGGGATGCACTTGGTTACT
ATGGAAACTTTACCGGAGACAACATCTAACAAATAGAGCTGAAAATCTGTACAACCAGGTCTGCGAAATACTTGAGGAAAATGCTTTGATGTCAAAGAGAATAAGTGGTC
AATGTGAATCATGGGTTGTTGCATCCAGGTTACGCGACCATCTTCTTTTGCCACGAGAGAGGAAGGATCCTTTGTTATGGAGGAAGGTAGAGGAATTGGTTCAGGAAGAC
TCACGAATAGATCGCTACCCAAGACTGGTCAAGGGTGAAGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGTTCTTTGAGTTCTTCAAAGGAAAAGAGACTGGCAAGCAA
ATTAAGTTCCAGGGTGGCGATGGAAGTAAATTCTGACCGAATATACCGTAAAGTGGATGCTAGCTTAGCTGGAGAATCAGAACAGGAAGCAGTTTCTGCTGGGGCAACTG
GGGCAACTGGGGCTTGTAGTGTACAGTCAATAGATGAGGAGCAAGCTTTAGACAAGAAAGTTGGATTTGGTGTGAGCCAAAAGTTGGATGGTTTGAGCATTTTGCAATCA
GCAGAAGAGTTTGAGAAGGAAGATGACAAATGCTTCATCACCGGACACTGTTGA
Protein sequenceShow/hide protein sequence
MSSTPKRRKKLKPNPDSDAGSKGDSSASSSTVLLKSLKQPPRDFFPSENDLIRLITVLFIACLVFLSCNFFVSRLASRRPEPFCDTDADSLDLLSDACKPCPSHGECRGG
ELECVRGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFMCDGVGTVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKRKALETIIGLFQTQQNSLGIEELKCPDL
LAESYKPFTCRIHHWVLKHAFVVLPVFLLLVGCTWLLWKLYRRQHLTNRAENLYNQVCEILEENALMSKRISGQCESWVVASRLRDHLLLPRERKDPLLWRKVEELVQED
SRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEKRLASKLSSRVAMEVNSDRIYRKVDASLAGESEQEAVSAGATGATGACSVQSIDEEQALDKKVGFGVSQKLDGLSILQS
AEEFEKEDDKCFITGHC