; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033012 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033012
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMSC domain-containing protein
Genome locationchr11:39933041..39937622
RNA-Seq ExpressionLag0033012
SyntenyLag0033012
Gene Ontology termsGO:0005637 - nuclear inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR018996 - Man1/Src1, C-terminal
IPR041885 - MAN1, winged-helix domain
IPR044780 - Heh2/Src1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606225.1 hypothetical protein SDJN03_03542, partial [Cucurbita argyrosperma subsp. sororia]1.8e-19688.63Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK P RDFFPSK+DL RLITVLFIA LVFVSCNFFVSR E RRPRPFCD+DADS DLLSDAC P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG
        CPSHG C EGKLEC HGYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MYAKSKAL+TIG
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG

Query:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
        G FQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Subjt:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNS  +Y K+ N
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN

KAG7036172.1 hypothetical protein SDJN02_02973 [Cucurbita argyrosperma subsp. argyrosperma]2.4e-19688.37Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK P RDFFPSK+DL RL+TVLFIA LVFVSCNFFVSR E RRPRPFCD+DADS DLLSDAC P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG
        CPSHG C EGKLEC HGYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MYAKSKAL+TIG
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG

Query:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
        G FQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Subjt:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNS  +Y K+ N
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN

XP_023533379.1 uncharacterized protein LOC111795284 isoform X1 [Cucurbita pepo subsp. pepo]7.4e-19887.66Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL S+K P RDFFPSK+DL RLITVLFIA LVFVSCNFFVSR E RRPRPFCD+DADS DLLSDAC P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQT
        CPSHG C EGKLEC HGYR+HGRLCIEDGVIN+AVKKL  SEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MYAKSKAL+T
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQT

Query:  IGGFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
        IGG FQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
Subjt:  IGGFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW

Query:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGNGESCICLI
        VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNS  +Y K+ NG S ICLI
Subjt:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGNGESCICLI

XP_023533380.1 uncharacterized protein LOC111795284 isoform X2 [Cucurbita pepo subsp. pepo]2.3e-19988.1Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL S+K P RDFFPSK+DL RLITVLFIA LVFVSCNFFVSR E RRPRPFCD+DADS DLLSDAC P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG
        CPSHG C EGKLEC HGYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MYAKSKAL+TIG
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG

Query:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
        G FQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Subjt:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGNGESCICLI
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNS  +Y K+ NG S ICLI
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGNGESCICLI

XP_038888162.1 uncharacterized protein LOC120078048 [Benincasa hispida]2.4e-19686.82Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        MSSTPK+RTK K+N NSDV S+GDSS S ST LLKSIKEP RDFFPSK+DLA LITVLFIACL+FVSC+FFVSR  +R+PRPFCDTDADSLDLLSD C P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG
        CP HG CR+GKL+CLHGYRKHGRLCIEDGVINEAV KLSEWLESHLCEANAKFLCDGI IVWVKEDDIWDDLDG+ LVE+IGSDNTT  YAKSKAL+TIG
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG

Query:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
        G FQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAF VLPV LLLVGC WLLWKLYRRQY+TNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Subjt:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN
        ASRLRDHLLLPRERK+PLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLA+KS+S  AMGV++  ++ K+ N
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN

TrEMBL top hitse value%identityAlignment
A0A5A7T509 MSC domain-containing protein2.9e-19284.6Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        MSSTPK+RTK K+NPNSDV S  DSS S S+ LLKS+KEP RDFFPSK+DLA LITVL IA LVFVSCNFFVSR  +R P PFCDTDADSLDLLSD C P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG
        CP HG CR+GKLECLHGYRKHGRLCIEDGVINEAV KLSEWLESHLCE+NAKFLCDGI IVWVKE+DIWDDLDG+ LVE+IGSDNTT MYAKSKAL+TIG
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG

Query:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
        G  QTRQNS GIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPV LLLVGC WLLWKLYRRQ LTNRAEDLYNQVCEILEENAL STRNS QCESWVV
Subjt:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSR------MAMGVNSAPLYHKIGNGES
        ASRLRDHLLLPRERK+PLLW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+K+LASKS+S        A+GVN  P+YHKI NGES
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSR------MAMGVNSAPLYHKIGNGES

A0A6J1E026 uncharacterized protein LOC111026156 isoform X11.2e-19386.49Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        MSSTPKRR K K NP+SD  SKGDSSAS ST LLKS+K+P RDFFPS+ DL RLITVLFIACLVF+SCNFFVSR  +RRP PFCDTDADSLDLLSDAC P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG
        CPSHG CR G+LEC+ GYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKF+CDG+  VWVKEDDIWDDLDGQALVENIGSDNTTFMYAK KAL+TI 
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG

Query:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
        G FQT+QNSLGI+ELKCPDLLAESYKPFTCRI HWVL+HAFVVLPV LLLVGC WLLWKLYRRQ+LTNRAE+LYNQVCEILEENALMS R SGQCESWVV
Subjt:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKI
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKEKRLASK SSR+AM VNS  +Y K+
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKI

A0A6J1H1Y9 uncharacterized protein LOC111459381 isoform X18.2e-19587.92Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK P RDFFPSK+DL RLITVLFIA LVFVSCNFFVSR E RRPRPFCD+DADS DLLSDAC P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQT
        CPSHG C EGKLEC HGYR+HGRLCIEDGVIN+AVKKL  SEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MYAKSKAL+T
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQT

Query:  IGGFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
        IGG FQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
Subjt:  IGGFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW

Query:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN
        VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRM MGVNS  +Y K+ N
Subjt:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN

A0A6J1H2A7 uncharacterized protein LOC111459381 isoform X32.6e-19688.37Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK P RDFFPSK+DL RLITVLFIA LVFVSCNFFVSR E RRPRPFCD+DADS DLLSDAC P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG
        CPSHG C EGKLEC HGYR+HGRLCIEDGVIN+AVKKLSEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MYAKSKAL+TIG
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG

Query:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
        G FQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Subjt:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRM MGVNS  +Y K+ N
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN

A0A6J1H3U4 uncharacterized protein LOC111459381 isoform X27.7e-19387.66Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        MSSTPKRRTKFK N NSDV SK DS  S S  LL SIK P RDFFPSK+DL RLITVLFIA LVFVSCNFFVSR E RRPRPFCD+DADS DLLSDAC P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQT
        CPSHG C EGKLEC HGYR+HGRLCIEDGVIN+AVKKL  SEWLESHLCEANAKFLCDGI IVWV+ED IWDDLDG+ALVENI SDNTT MYAKSKAL+T
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKL--SEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQT

Query:  IGGFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
        IGG FQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGC WLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
Subjt:  IGGFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW

Query:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN
        VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQ EGSLSSSKEKRLASKSSSRM MGVNS  +Y K+ N
Subjt:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46560.1 CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018996); Has 58 Blast hits to 58 proteins in 29 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 15; Plants - 20; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).6.4e-9145.05Show/hide
Query:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP
        M S P++R   K    +    K  SS+SP    ++S+ EP +  FPSK +   L+ VL +AC V  +CNF      +   + FCD++ + +D   D C P
Subjt:  MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGP

Query:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG
        CP +G C +GKL+C  GY+    LC+EDG INE+ KKL  + E  +CE+ A   C G   +WV E+D+W +L   + + N+  D + + + K KA++ + 
Subjt:  CPSHGVCREGKLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIG

Query:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMS-TRNSGQCESWV
           + R NS GI ELKCP+ +A+SYKP TCR+  W+L+H  ++     +LVG A L  ++ R+Q  + R E+LY+QVC+ LEENA+ S +  +  CE WV
Subjt:  GFFQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMS-TRNSGQCESWV

Query:  VASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKR
        +AS LRD+LLLPRER+DPLLW KVEEL++EDSRIDRY +L+KG+ K VWEWQVEGSLS SK K+
Subjt:  VASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCAACTCCGAAGAGGCGAACGAAATTCAAGCAAAATCCGAACTCCGATGTCGTTTCTAAAGGTGATTCTTCTGCTTCACCTTCTACAGCGTTGCTGAAATCCAT
CAAGGAACCGCTTCGCGATTTCTTTCCCTCCAAGGAAGATCTTGCTAGGCTAATCACTGTACTTTTCATCGCCTGCTTGGTCTTTGTGAGTTGCAACTTCTTCGTATCTA
GATTTGAAAATCGTCGCCCGAGGCCTTTCTGCGACACAGACGCCGATTCCTTGGATTTGCTTTCTGATGCTTGTGGGCCTTGTCCAAGTCATGGAGTATGCCGTGAAGGT
AAGTTGGAATGCCTTCATGGCTATAGAAAGCATGGAAGATTATGTATAGAAGATGGGGTAATTAATGAAGCAGTTAAGAAACTTTCAGAATGGCTAGAATCTCACCTCTG
TGAAGCAAATGCCAAGTTCTTATGCGATGGAATTGAGATAGTTTGGGTTAAAGAGGATGATATTTGGGATGATCTAGATGGTCAAGCGCTGGTGGAAAATATTGGCTCTG
ACAACACCACTTTTATGTATGCAAAGAGCAAGGCATTGCAAACTATTGGTGGGTTTTTTCAGACACGGCAAAATTCTCTTGGGATCAAGGAATTGAAATGCCCGGATCTC
CTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTGCCAGTTTCTTTACTGCTTGTGGGATGCGCATGGTTACT
ATGGAAACTTTACCGGAGACAATATCTAACAAATAGGGCTGAAGATCTGTACAACCAGGTTTGCGAAATACTTGAGGAAAATGCTTTGATGTCTACGAGAAACAGTGGTC
AATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGGGAGAGGAAAGATCCTTTGTTATGGAGGAAGGTAGAGGAGTTGGTTCAAGAAGAC
TCACGAATAGATCGTTACCCGAGACTGGTTAAGGGTGATGGAAAGGAAGTATGGGAGTGGCAAGTGGAAGGCTCTTTGAGCTCTTCAAAGGAAAAGAGACTGGCCAGTAA
ATCCAGTTCCAGGATGGCAATGGGAGTAAATTCTGCCCCATTATACCATAAAATTGGGAATGGTGAGTCGTGTATATGCTTGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCAACTCCGAAGAGGCGAACGAAATTCAAGCAAAATCCGAACTCCGATGTCGTTTCTAAAGGTGATTCTTCTGCTTCACCTTCTACAGCGTTGCTGAAATCCAT
CAAGGAACCGCTTCGCGATTTCTTTCCCTCCAAGGAAGATCTTGCTAGGCTAATCACTGTACTTTTCATCGCCTGCTTGGTCTTTGTGAGTTGCAACTTCTTCGTATCTA
GATTTGAAAATCGTCGCCCGAGGCCTTTCTGCGACACAGACGCCGATTCCTTGGATTTGCTTTCTGATGCTTGTGGGCCTTGTCCAAGTCATGGAGTATGCCGTGAAGGT
AAGTTGGAATGCCTTCATGGCTATAGAAAGCATGGAAGATTATGTATAGAAGATGGGGTAATTAATGAAGCAGTTAAGAAACTTTCAGAATGGCTAGAATCTCACCTCTG
TGAAGCAAATGCCAAGTTCTTATGCGATGGAATTGAGATAGTTTGGGTTAAAGAGGATGATATTTGGGATGATCTAGATGGTCAAGCGCTGGTGGAAAATATTGGCTCTG
ACAACACCACTTTTATGTATGCAAAGAGCAAGGCATTGCAAACTATTGGTGGGTTTTTTCAGACACGGCAAAATTCTCTTGGGATCAAGGAATTGAAATGCCCGGATCTC
CTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTGCCAGTTTCTTTACTGCTTGTGGGATGCGCATGGTTACT
ATGGAAACTTTACCGGAGACAATATCTAACAAATAGGGCTGAAGATCTGTACAACCAGGTTTGCGAAATACTTGAGGAAAATGCTTTGATGTCTACGAGAAACAGTGGTC
AATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGGGAGAGGAAAGATCCTTTGTTATGGAGGAAGGTAGAGGAGTTGGTTCAAGAAGAC
TCACGAATAGATCGTTACCCGAGACTGGTTAAGGGTGATGGAAAGGAAGTATGGGAGTGGCAAGTGGAAGGCTCTTTGAGCTCTTCAAAGGAAAAGAGACTGGCCAGTAA
ATCCAGTTCCAGGATGGCAATGGGAGTAAATTCTGCCCCATTATACCATAAAATTGGGAATGGTGAGTCGTGTATATGCTTGATCTGA
Protein sequenceShow/hide protein sequence
MSSTPKRRTKFKQNPNSDVVSKGDSSASPSTALLKSIKEPLRDFFPSKEDLARLITVLFIACLVFVSCNFFVSRFENRRPRPFCDTDADSLDLLSDACGPCPSHGVCREG
KLECLHGYRKHGRLCIEDGVINEAVKKLSEWLESHLCEANAKFLCDGIEIVWVKEDDIWDDLDGQALVENIGSDNTTFMYAKSKALQTIGGFFQTRQNSLGIKELKCPDL
LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCAWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVVASRLRDHLLLPRERKDPLLWRKVEELVQED
SRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMAMGVNSAPLYHKIGNGESCICLI