; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004245 (gene) of Snake gourd v1 genome

Gene IDTan0004245
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMSC domain-containing protein
Genome locationLG07:71071257..71077263
RNA-Seq ExpressionTan0004245
SyntenyTan0004245
Gene Ontology termsGO:0005637 - nuclear inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR018996 - Man1/Src1, C-terminal
IPR041885 - MAN1, winged-helix domain
IPR044780 - Heh2/Src1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606225.1 hypothetical protein SDJN03_03542, partial [Cucurbita argyrosperma subsp. sororia]7.8e-20089.59Show/hide
Query:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP
        MSST KRRTK KHN NSDVASK DS  SSSAVLLNSIK PPRDFFPSKDDL RLITVLFIA LVFVSCNFFVSRL TRRPRPFCD+ ADS DLLSDACEP
Subjt:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP

Query:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG
        CPSHGEC EGKLEC HGYR+ GRLCIEDGVIN+ VKKLSEWLESHLCEANAKFLCDGIGIVWV+ED +WDDLDG+AL+ENI SDN+T+MYAKSKALETIG
Subjt:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG

Query:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV
        GLFQ RQN+LGIKELKCP+ LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNS QCESWVV
Subjt:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL SKSS RMAMGV+SD IY KM NE K VVS
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS

KAG7036172.1 hypothetical protein SDJN02_02973 [Cucurbita argyrosperma subsp. argyrosperma]5.6e-19888.3Show/hide
Query:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP
        MSST KRRTK KHN NSDVASK DS  SSSAVLLNSIK PPRDFFPSKDDL RL+TVLFIA LVFVSCNFFVSRL TRRPRPFCD+ ADS DLLSDACEP
Subjt:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP

Query:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG
        CPSHGEC EGKLEC HGYR+ GRLCIEDGVIN+ VKKLSEWLESHLCEANAKFLCDGIGIVWV+ED +WDDLDG+AL+ENI SDN+T+MYAKSKALETIG
Subjt:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG

Query:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV
        GLFQ RQN+LGIKELKCP+ LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNS QCESWVV
Subjt:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVV
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL SKSS RMAMGV+SD IY KM N+ +  +
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVV

XP_022958028.1 uncharacterized protein LOC111459381 isoform X1 [Cucurbita moschata]7.3e-19888.89Show/hide
Query:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP
        MSST KRRTK KHN NSDVASK DS  SSSAVLLNSIK PPRDFFPSKDDL RLITVLFIA LVFVSCNFFVSRL TRRPRPFCD+ ADS DLLSDACEP
Subjt:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP

Query:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKL--SEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALET
        CPSHGEC EGKLEC HGYR+ GRLCIEDGVIN+ VKKL  SEWLESHLCEANAKFLCDGIGIVWV+ED +WDDLDG+AL+ENI SDN+T+MYAKSKALET
Subjt:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKL--SEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALET

Query:  IGGLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESW
        IGGLFQ RQN+LGIKELKCP+ LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNS QCESW
Subjt:  IGGLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESW

Query:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS
        VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL SKSS RM MGV+SD IY KM NE K VVS
Subjt:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS

XP_022958030.1 uncharacterized protein LOC111459381 isoform X3 [Cucurbita moschata]2.3e-19989.34Show/hide
Query:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP
        MSST KRRTK KHN NSDVASK DS  SSSAVLLNSIK PPRDFFPSKDDL RLITVLFIA LVFVSCNFFVSRL TRRPRPFCD+ ADS DLLSDACEP
Subjt:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP

Query:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG
        CPSHGEC EGKLEC HGYR+ GRLCIEDGVIN+ VKKLSEWLESHLCEANAKFLCDGIGIVWV+ED +WDDLDG+AL+ENI SDN+T+MYAKSKALETIG
Subjt:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG

Query:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV
        GLFQ RQN+LGIKELKCP+ LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNS QCESWVV
Subjt:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL SKSS RM MGV+SD IY KM NE K VVS
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS

XP_038888162.1 uncharacterized protein LOC120078048 [Benincasa hispida]3.9e-19986.77Show/hide
Query:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP
        MSST K+RTKVK N NSDV S+GDSS SSS +LL SIKEPPRDFFPSKDDLA LITVLFIACL+FVSC+FFVSRLA+R+PRPFCDT ADSLDLLSD CEP
Subjt:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP

Query:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG
        CP HGECR+GKL+C+HGYRK GRLCIEDGVINE V KLSEWLESHLCEANAKFLCDGIGIVWVKED++WDDLDG+ L+E+IGSDN+TL YAKSKALETIG
Subjt:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG

Query:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV
        GLFQTRQNSLGIKELKCP+LLAESYKPFTCRIRHWVLQHAF VLPV LLLVGCTWLLWKLYRRQY+TNRAEDLYNQVCEILEENALMSTRNS QCESWVV
Subjt:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVV
        ASRLRDHLLLPRERK+PLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL +KS+   AMGV +D+++ KM NEPKP+V
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVV

TrEMBL top hitse value%identityAlignment
A0A6J1E026 uncharacterized protein LOC111026156 isoform X11.1e-19485.79Show/hide
Query:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP
        MSST KRR K+K NP+SD  SKGDSSASSS VLL S+K+PPRDFFPS++DL RLITVLFIACLVF+SCNFFVSRLA+RRP PFCDT ADSLDLLSDAC+P
Subjt:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP

Query:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG
        CPSHGECR G+LEC+ GYRK GRLCIEDGVINE VKKLSEWLESHLCEANAKF+CDG+G VWVKED++WDDLDGQAL+ENIGSDN+T MYAK KALETI 
Subjt:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG

Query:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV
        GLFQT+QNSLGI+ELKCP+LLAESYKPFTCRI HWVL+HAFVVLPV LLLVGCTWLLWKLYRRQ+LTNRAE+LYNQVCEILEENALMS R S QCESWVV
Subjt:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKEKRL SK S R+AM V+SDRIYRK+ +EPKPVVS
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS

A0A6J1H1Y9 uncharacterized protein LOC111459381 isoform X13.5e-19888.89Show/hide
Query:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP
        MSST KRRTK KHN NSDVASK DS  SSSAVLLNSIK PPRDFFPSKDDL RLITVLFIA LVFVSCNFFVSRL TRRPRPFCD+ ADS DLLSDACEP
Subjt:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP

Query:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKL--SEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALET
        CPSHGEC EGKLEC HGYR+ GRLCIEDGVIN+ VKKL  SEWLESHLCEANAKFLCDGIGIVWV+ED +WDDLDG+AL+ENI SDN+T+MYAKSKALET
Subjt:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKL--SEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALET

Query:  IGGLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESW
        IGGLFQ RQN+LGIKELKCP+ LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNS QCESW
Subjt:  IGGLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESW

Query:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS
        VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL SKSS RM MGV+SD IY KM NE K VVS
Subjt:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS

A0A6J1H2A7 uncharacterized protein LOC111459381 isoform X31.1e-19989.34Show/hide
Query:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP
        MSST KRRTK KHN NSDVASK DS  SSSAVLLNSIK PPRDFFPSKDDL RLITVLFIA LVFVSCNFFVSRL TRRPRPFCD+ ADS DLLSDACEP
Subjt:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP

Query:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG
        CPSHGEC EGKLEC HGYR+ GRLCIEDGVIN+ VKKLSEWLESHLCEANAKFLCDGIGIVWV+ED +WDDLDG+AL+ENI SDN+T+MYAKSKALETIG
Subjt:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG

Query:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV
        GLFQ RQN+LGIKELKCP+ LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNS QCESWVV
Subjt:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL SKSS RM MGV+SD IY KM NE K VVS
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS

A0A6J1H3U4 uncharacterized protein LOC111459381 isoform X23.3e-19688.64Show/hide
Query:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP
        MSST KRRTK KHN NSDVASK DS  SSSAVLLNSIK PPRDFFPSKDDL RLITVLFIA LVFVSCNFFVSRL TRRPRPFCD+ ADS DLLSDACEP
Subjt:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP

Query:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKL--SEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALET
        CPSHGEC EGKLEC HGYR+ GRLCIEDGVIN+ VKKL  SEWLESHLCEANAKFLCDGIGIVWV+ED +WDDLDG+AL+ENI SDN+T+MYAKSKALET
Subjt:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKL--SEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALET

Query:  IGGLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESW
        IGGLFQ RQN+LGIKELKCP+ LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNS QCESW
Subjt:  IGGLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESW

Query:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS
        VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQ EGSLSSSKEKRL SKSS RM MGV+SD IY KM NE K VVS
Subjt:  VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS

A0A6J1K9D6 uncharacterized protein LOC111491297 isoform X38.2e-19587.56Show/hide
Query:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP
        MSST KRRTK KHNPNSDV  K DS  SSSAVLLNSIK PPRDFFPSKDDL RLITVLFIA LVFVSCNFFVSRL TRRPRPFCD+ ADS DLLSDACEP
Subjt:  MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEP

Query:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG
        CPSHGEC EG LEC+HGYR+ GRLCIEDGVIN+ VKKLSEWLE HLCEANAKFLCDGIGIVWV+ED +WDDLDG+AL+EN  SDN+T+MYAKSKALETIG
Subjt:  CPSHGECREGKLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIG

Query:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV
        GLFQ R+N+LGIKELKCP+ LAESYKP TCRIRHWVLQHAF+VLPVSLLLVGCT LLWKL RRQYLTNRAEDLYNQVCEILEENALMSTRNS QCESWVV
Subjt:  GLFQTRQNSLGIKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS
        ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK KRL SKSS RMAMGV+SD IY KM NEPK VVS
Subjt:  ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL-SKSSPRMAMGVDSDRIYRKMGNEPKPVVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46560.1 CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018996); Has 58 Blast hits to 58 proteins in 29 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 15; Plants - 20; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).8.6e-9646.05Show/hide
Query:  KHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEPCPSHGECREGK
        +  P S+  +     +SSS+  + S+ EPP+  FPSK +   L+ VL +AC V  +CNF    L++   + FCD+  + +D   D CEPCP +GEC +GK
Subjt:  KHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEPCPSHGECREGK

Query:  LECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIGGLFQTRQNSLG
        L+C  GY+ +  LC+EDG INE+ KKL  + E  +CE+ A   C G G +WV E++VW +L   + L N+  D S   + K KA+E +  L + R NS G
Subjt:  LECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIGGLFQTRQNSLG

Query:  IKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMS-TRNSEQCESWVVASRLRDHLLL
        I ELKCP  +A+SYKP TCR+  W+L+H  ++     +LVG   L  ++ R+Q  + R E+LY+QVC+ LEENA+ S +  +  CE WV+AS LRD+LLL
Subjt:  IKELKCPNLLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMS-TRNSEQCESWVVASRLRDHLLL

Query:  PRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLSKSSPRMAMGVDS
        PRER+DPLLW KVEEL++EDSRIDRY +L+KG+ K VWEWQVEGSLS SK K+  ++  ++   +DS
Subjt:  PRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLSKSSPRMAMGVDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCAACTCAGAAGAGGCGAACGAAAGTGAAGCATAATCCGAACTCCGATGTCGCTTCTAAAGGCGATTCTTCTGCTTCATCTTCTGCAGTGTTGCTAAATTCTAT
CAAGGAACCGCCTCGCGATTTCTTTCCCTCGAAGGATGATCTTGCTAGGCTAATCACTGTACTTTTCATCGCCTGCTTGGTTTTTGTGAGTTGCAACTTCTTCGTATCTA
GACTTGCAACTCGCCGCCCGAGGCCTTTCTGCGATACCGGCGCCGATTCTTTGGATTTGCTTTCTGATGCTTGTGAGCCTTGTCCAAGTCATGGAGAATGCCGTGAAGGT
AAGTTGGAATGCATTCATGGTTATAGAAAGCGTGGAAGGTTATGTATAGAAGATGGAGTAATCAATGAAACAGTTAAGAAACTTTCAGAATGGCTAGAATCTCACCTCTG
TGAAGCAAATGCCAAGTTCTTATGCGATGGAATTGGGATAGTTTGGGTTAAAGAGGATAATGTATGGGATGATCTAGATGGTCAAGCGCTACTGGAAAATATTGGTTCTG
ACAACAGCACTTTGATGTATGCGAAGAGCAAGGCTTTGGAAACTATTGGTGGGTTATTTCAGACGCGGCAAAATTCTCTTGGAATCAAGGAATTGAAATGCCCAAATCTC
CTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCAGCATGCCTTTGTTGTTTTGCCAGTTTCTTTACTGCTTGTGGGATGCACATGGTTACT
ATGGAAACTATATCGGAGACAATATCTAACAAATAGAGCTGAAGATCTGTACAACCAGGTTTGCGAAATACTTGAGGAAAATGCTTTGATGTCAACGAGGAACAGTGAAC
AATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGAGAGAGAAAGGATCCTTTATTATGGAGGAAGGTAGAGGAGTTGGTTCAGGAAGAT
TCACGAATAGATCGTTACCCGAGACTGGTTAAGGGTGATGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGCTCTTTGAGCTCTTCAAAGGAAAAGAGACTCAGCAAATC
CAGTCCCAGAATGGCAATGGGAGTAGATTCTGACCGAATATACCGTAAAATGGGGAACGAGCCGAAGCCAGTAGTTTCGTGA
mRNA sequenceShow/hide mRNA sequence
CCGCGAAGAACGACGTTGCGGATCGATGTCTTCAACTCAGAAGAGGCGAACGAAAGTGAAGCATAATCCGAACTCCGATGTCGCTTCTAAAGGCGATTCTTCTGCTTCAT
CTTCTGCAGTGTTGCTAAATTCTATCAAGGAACCGCCTCGCGATTTCTTTCCCTCGAAGGATGATCTTGCTAGGCTAATCACTGTACTTTTCATCGCCTGCTTGGTTTTT
GTGAGTTGCAACTTCTTCGTATCTAGACTTGCAACTCGCCGCCCGAGGCCTTTCTGCGATACCGGCGCCGATTCTTTGGATTTGCTTTCTGATGCTTGTGAGCCTTGTCC
AAGTCATGGAGAATGCCGTGAAGGTAAGTTGGAATGCATTCATGGTTATAGAAAGCGTGGAAGGTTATGTATAGAAGATGGAGTAATCAATGAAACAGTTAAGAAACTTT
CAGAATGGCTAGAATCTCACCTCTGTGAAGCAAATGCCAAGTTCTTATGCGATGGAATTGGGATAGTTTGGGTTAAAGAGGATAATGTATGGGATGATCTAGATGGTCAA
GCGCTACTGGAAAATATTGGTTCTGACAACAGCACTTTGATGTATGCGAAGAGCAAGGCTTTGGAAACTATTGGTGGGTTATTTCAGACGCGGCAAAATTCTCTTGGAAT
CAAGGAATTGAAATGCCCAAATCTCCTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCAGCATGCCTTTGTTGTTTTGCCAGTTTCTTTAC
TGCTTGTGGGATGCACATGGTTACTATGGAAACTATATCGGAGACAATATCTAACAAATAGAGCTGAAGATCTGTACAACCAGGTTTGCGAAATACTTGAGGAAAATGCT
TTGATGTCAACGAGGAACAGTGAACAATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGAGAGAGAAAGGATCCTTTATTATGGAGGAA
GGTAGAGGAGTTGGTTCAGGAAGATTCACGAATAGATCGTTACCCGAGACTGGTTAAGGGTGATGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGCTCTTTGAGCTCTT
CAAAGGAAAAGAGACTCAGCAAATCCAGTCCCAGAATGGCAATGGGAGTAGATTCTGACCGAATATACCGTAAAATGGGGAACGAGCCGAAGCCAGTAGTTTCGTGACAC
GTGCTCGTGCGTGAGAATACTGTAGCTTAACTCAGCTGGTTAACGGGAAAGAATCAAAACAACAGGTACAAGATGTTTTTTATTTTTTTTTGAAAAGAATAGGTACAAGA
TTTAAAGTCCATAATCATTATGTTTCATCTGTTGAGTTAAGTACTTGTTAAGGGGTATGAGGTATTGTAATATTATTCCCTTTTACTTTTCT
Protein sequenceShow/hide protein sequence
MSSTQKRRTKVKHNPNSDVASKGDSSASSSAVLLNSIKEPPRDFFPSKDDLARLITVLFIACLVFVSCNFFVSRLATRRPRPFCDTGADSLDLLSDACEPCPSHGECREG
KLECIHGYRKRGRLCIEDGVINETVKKLSEWLESHLCEANAKFLCDGIGIVWVKEDNVWDDLDGQALLENIGSDNSTLMYAKSKALETIGGLFQTRQNSLGIKELKCPNL
LAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSEQCESWVVASRLRDHLLLPRERKDPLLWRKVEELVQED
SRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLSKSSPRMAMGVDSDRIYRKMGNEPKPVVS