; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014854 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014854
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMSC domain-containing protein
Genome locationtig00001291:755404..760322
RNA-Seq ExpressionSgr014854
SyntenySgr014854
Gene Ontology termsGO:0005637 - nuclear inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR018996 - Man1/Src1, C-terminal
IPR041885 - MAN1, winged-helix domain
IPR044780 - Heh2/Src1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606225.1 hypothetical protein SDJN03_03542, partial [Cucurbita argyrosperma subsp. sororia]5.3e-15578.73Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        MSSTPKRRTK K N  SD  SK  S  SSS VLL S+K PPRDFFPSKDDL RLITVLFIA LVFV CNFFVSRL + RPRPFCDSDA+SFDLLSDACEP
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG
        CPSHGEC EGKLEC  GYR+HGRLC+EDGVIN+AVKKL                        V+ D IWDD + +ALVE+I SDNTT MYAK KA ETIG
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG

Query:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV
         LFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVL+HAFVVLPV LLLVGCTWLLWKL RRQYLTNRAE LYNQVCEILEENALM  R SGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE
        ASRLRDHLLLPRERKDPLLW KVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKE+
Subjt:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE

XP_022159868.1 uncharacterized protein LOC111026156 isoform X1 [Momordica charantia]3.1e-16381.77Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        MSSTPKRR KLK N  SD GSKG SSASSSTVLLKSLK PPRDFFPS++DL RLITVLFIACLVF+ CNFFVSRLAS RP PFCD+DA+S DLLSDAC+P
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG
        CPSHGECR G+LEC+RGYRKHGRLC+EDGVINEAVKKL                        VK DDIWDD + QALVE+IGSDNTTFMYAK KA ETI 
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG

Query:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV
         LFQT+QNSLGI+ELKCPD LAESYKPFTCRI HWVLKHAFVVLPV LLLVGCTWLLWKL+RRQ+LTNRAE LYNQVCEILEENALM KRISGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE
        ASRLRDHLLLPRERKDPLLW KVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKE+
Subjt:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE

XP_022159870.1 uncharacterized protein LOC111026156 isoform X2 [Momordica charantia]2.9e-16181.49Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        MSSTPKRR KLK N  SD GSKG SSASSSTVLLKSLK PPRDFFPS++DL RLITVLFIACLVF+ CNFFVSRLAS RP PFCD+DA+S DLLSDAC+P
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG
        CPSHGECR G+LEC+RGYRKHGRLC+EDGVINEAVKKL                        VK DDIWDD + QALVE+IGSDNTTFMYAK KA ETI 
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG

Query:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV
         LFQT+QNSLGI+ELKCPD LAESYKPFTCRI HWVLKHAFVVLPV LLLVGCTWLLWKL+RRQ+LTNRAE LYNQVCEILEENALM KRISGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE
        ASRLRDHLLLPRERKDPLLW KVEELVQEDSRIDRYPRLVKGEGKEVWEWQ EGSLSSSKE+
Subjt:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE

XP_022958030.1 uncharacterized protein LOC111459381 isoform X3 [Cucurbita moschata]5.3e-15578.73Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        MSSTPKRRTK K N  SD  SK  S  SSS VLL S+K PPRDFFPSKDDL RLITVLFIA LVFV CNFFVSRL + RPRPFCDSDA+SFDLLSDACEP
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG
        CPSHGEC EGKLEC  GYR+HGRLC+EDGVIN+AVKKL                        V+ D IWDD + +ALVE+I SDNTT MYAK KA ETIG
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG

Query:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV
         LFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVL+HAFVVLPV LLLVGCTWLLWKL RRQYLTNRAE LYNQVCEILEENALM  R SGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE
        ASRLRDHLLLPRERKDPLLW KVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKE+
Subjt:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE

XP_038888162.1 uncharacterized protein LOC120078048 [Benincasa hispida]2.1e-15978.73Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        MSSTPK+RTK+K N  SD GS+G SS SSST+LLKS+K+PPRDFFPSKDDLA LITVLFIACL+FV C+FFVSRLAS +PRPFCD+DA+S DLLSD CEP
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG
        CP HGECR+GKL+CL GYRKHGRLC+EDGVINEAV KL                        VK DDIWDD + + LVESIGSDNTT  YAK KA ETIG
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG

Query:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV
         LFQTRQNSLGIKELKCPD LAESYKPFTCRIRHWVL+HAF VLPV LLLVGCTWLLWKL+RRQY+TNRAE LYNQVCEILEENALM  R SGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE
        ASRLRDHLLLPRERK+PLLW KVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKE+
Subjt:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE

TrEMBL top hitse value%identityAlignment
A0A6J1E002 uncharacterized protein LOC111026156 isoform X21.4e-16181.49Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        MSSTPKRR KLK N  SD GSKG SSASSSTVLLKSLK PPRDFFPS++DL RLITVLFIACLVF+ CNFFVSRLAS RP PFCD+DA+S DLLSDAC+P
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG
        CPSHGECR G+LEC+RGYRKHGRLC+EDGVINEAVKKL                        VK DDIWDD + QALVE+IGSDNTTFMYAK KA ETI 
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG

Query:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV
         LFQT+QNSLGI+ELKCPD LAESYKPFTCRI HWVLKHAFVVLPV LLLVGCTWLLWKL+RRQ+LTNRAE LYNQVCEILEENALM KRISGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE
        ASRLRDHLLLPRERKDPLLW KVEELVQEDSRIDRYPRLVKGEGKEVWEWQ EGSLSSSKE+
Subjt:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE

A0A6J1E026 uncharacterized protein LOC111026156 isoform X11.5e-16381.77Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        MSSTPKRR KLK N  SD GSKG SSASSSTVLLKSLK PPRDFFPS++DL RLITVLFIACLVF+ CNFFVSRLAS RP PFCD+DA+S DLLSDAC+P
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG
        CPSHGECR G+LEC+RGYRKHGRLC+EDGVINEAVKKL                        VK DDIWDD + QALVE+IGSDNTTFMYAK KA ETI 
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG

Query:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV
         LFQT+QNSLGI+ELKCPD LAESYKPFTCRI HWVLKHAFVVLPV LLLVGCTWLLWKL+RRQ+LTNRAE LYNQVCEILEENALM KRISGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE
        ASRLRDHLLLPRERKDPLLW KVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKE+
Subjt:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE

A0A6J1H1Y9 uncharacterized protein LOC111459381 isoform X14.4e-15578.3Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        MSSTPKRRTK K N  SD  SK  S  SSS VLL S+K PPRDFFPSKDDL RLITVLFIA LVFV CNFFVSRL + RPRPFCDSDA+SFDLLSDACEP
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL--------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASET
        CPSHGEC EGKLEC  GYR+HGRLC+EDGVIN+AVKKL                          V+ D IWDD + +ALVE+I SDNTT MYAK KA ET
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL--------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASET

Query:  IGRLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESW
        IG LFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVL+HAFVVLPV LLLVGCTWLLWKL RRQYLTNRAE LYNQVCEILEENALM  R SGQCESW
Subjt:  IGRLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESW

Query:  VVASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE
        VVASRLRDHLLLPRERKDPLLW KVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKE+
Subjt:  VVASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE

A0A6J1H2A7 uncharacterized protein LOC111459381 isoform X32.6e-15578.73Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        MSSTPKRRTK K N  SD  SK  S  SSS VLL S+K PPRDFFPSKDDL RLITVLFIA LVFV CNFFVSRL + RPRPFCDSDA+SFDLLSDACEP
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG
        CPSHGEC EGKLEC  GYR+HGRLC+EDGVIN+AVKKL                        V+ D IWDD + +ALVE+I SDNTT MYAK KA ETIG
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG

Query:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV
         LFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVL+HAFVVLPV LLLVGCTWLLWKL RRQYLTNRAE LYNQVCEILEENALM  R SGQCESWVV
Subjt:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVV

Query:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE
        ASRLRDHLLLPRERKDPLLW KVEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKE+
Subjt:  ASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE

A0A6J1H3U4 uncharacterized protein LOC111459381 isoform X24.1e-15378.02Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        MSSTPKRRTK K N  SD  SK  S  SSS VLL S+K PPRDFFPSKDDL RLITVLFIA LVFV CNFFVSRL + RPRPFCDSDA+SFDLLSDACEP
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL--------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASET
        CPSHGEC EGKLEC  GYR+HGRLC+EDGVIN+AVKKL                          V+ D IWDD + +ALVE+I SDNTT MYAK KA ET
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL--------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASET

Query:  IGRLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESW
        IG LFQ RQN+LGIKELKCPD LAESYKPFTCRIRHWVL+HAFVVLPV LLLVGCTWLLWKL RRQYLTNRAE LYNQVCEILEENALM  R SGQCESW
Subjt:  IGRLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESW

Query:  VVASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE
        VVASRLRDHLLLPRERKDPLLW KVEELVQEDSRIDRYPRLVKG+GKEVWEWQ EGSLSSSKE+
Subjt:  VVASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46560.1 CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018996); Has 58 Blast hits to 58 proteins in 29 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 15; Plants - 20; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).3.6e-8544.72Show/hide
Query:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP
        M S P++R K ++      G    SS+SSS +  +S+ +PP+  FPSK +   L+ VL +AC V   CNF    L+S+  + FCDS+ N  D   D CEP
Subjt:  MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEP

Query:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG
        CP +GEC +GKL+C  GY+    LC+EDG INE+ KKL                        V  +D+W +  S + + ++  D + + + KGKA E + 
Subjt:  CPSHGECREGKLECLRGYRKHGRLCLEDGVINEAVKKL------------------------VKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIG

Query:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRI-SGQCESWV
         L + R NS GI ELKCP+ +A+SYKP TCR+  W+L+H  ++   C +LVG   L  ++ R+Q  + R E LY+QVC+ LEENA+      +  CE WV
Subjt:  RLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVLPVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRI-SGQCESWV

Query:  VASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEETGKQIQ
        +AS LRD+LLLPRER+DPLLW+KVEEL++EDSRIDRY +L+KGE K VWEWQVEGSLS SK +  ++ Q
Subjt:  VASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEGSLSSSKEETGKQIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCACTCCGAAGAGGCGAACGAAACTCAAGTCTAATCGGGTCTCCGATGGCGGTTCTAAAGGTGCTTCTTCTGCTTCATCTTCTACAGTGCTGCTAAAGTCTCT
CAAGGATCCGCCTCGTGATTTCTTCCCCTCCAAGGATGATCTTGCTAGGCTAATCACTGTACTTTTCATCGCCTGCTTGGTTTTCGTGGGTTGCAACTTCTTCGTATCTA
GACTCGCGAGTCACCGCCCGAGGCCTTTCTGCGACAGCGACGCCAATTCCTTTGATTTGCTTTCTGATGCTTGTGAGCCTTGTCCAAGTCATGGAGAATGCCGTGAAGGT
AAGTTGGAATGTCTTCGTGGTTATAGAAAGCATGGAAGGTTATGCCTAGAAGATGGAGTAATCAATGAAGCAGTTAAGAAACTTGTTAAAGGGGATGATATATGGGATGA
TCCAGAAAGTCAAGCGCTGGTGGAAAGTATTGGCTCCGACAACACCACTTTTATGTATGCAAAGGGAAAGGCATCGGAAACCATTGGTCGGTTATTTCAGACGCGGCAAA
ATTCTCTTGGGATCAAGGAATTGAAATGTCCAGATTTTCTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCATTGGGTTTTGAAGCATGCTTTTGTTGTTTTG
CCAGTTTGCTTACTGCTTGTGGGATGCACTTGGTTACTATGGAAACTTTTCCGGAGACAATATCTAACAAATAGAGCTGAAGCTCTGTACAATCAGGTTTGCGAAATACT
TGAGGAAAATGCTTTGATGTTAAAGAGAATAAGTGGTCAATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGAGAGAGGAAGGATCCTT
TGTTATGGAGTAAGGTAGAGGAGTTGGTTCAGGAAGACTCAAGAATAGATCGTTACCCACGACTGGTTAAGGGTGAAGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGC
TCTTTGAGCTCTTCAAAGGAAGAGACTGGCAAGCAAATTCAGTTCCAGGGTGGCAATGGAAGTAAAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCACTCCGAAGAGGCGAACGAAACTCAAGTCTAATCGGGTCTCCGATGGCGGTTCTAAAGGTGCTTCTTCTGCTTCATCTTCTACAGTGCTGCTAAAGTCTCT
CAAGGATCCGCCTCGTGATTTCTTCCCCTCCAAGGATGATCTTGCTAGGCTAATCACTGTACTTTTCATCGCCTGCTTGGTTTTCGTGGGTTGCAACTTCTTCGTATCTA
GACTCGCGAGTCACCGCCCGAGGCCTTTCTGCGACAGCGACGCCAATTCCTTTGATTTGCTTTCTGATGCTTGTGAGCCTTGTCCAAGTCATGGAGAATGCCGTGAAGGT
AAGTTGGAATGTCTTCGTGGTTATAGAAAGCATGGAAGGTTATGCCTAGAAGATGGAGTAATCAATGAAGCAGTTAAGAAACTTGTTAAAGGGGATGATATATGGGATGA
TCCAGAAAGTCAAGCGCTGGTGGAAAGTATTGGCTCCGACAACACCACTTTTATGTATGCAAAGGGAAAGGCATCGGAAACCATTGGTCGGTTATTTCAGACGCGGCAAA
ATTCTCTTGGGATCAAGGAATTGAAATGTCCAGATTTTCTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCATTGGGTTTTGAAGCATGCTTTTGTTGTTTTG
CCAGTTTGCTTACTGCTTGTGGGATGCACTTGGTTACTATGGAAACTTTTCCGGAGACAATATCTAACAAATAGAGCTGAAGCTCTGTACAATCAGGTTTGCGAAATACT
TGAGGAAAATGCTTTGATGTTAAAGAGAATAAGTGGTCAATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGAGAGAGGAAGGATCCTT
TGTTATGGAGTAAGGTAGAGGAGTTGGTTCAGGAAGACTCAAGAATAGATCGTTACCCACGACTGGTTAAGGGTGAAGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGC
TCTTTGAGCTCTTCAAAGGAAGAGACTGGCAAGCAAATTCAGTTCCAGGGTGGCAATGGAAGTAAAATCTGA
Protein sequenceShow/hide protein sequence
MSSTPKRRTKLKSNRVSDGGSKGASSASSSTVLLKSLKDPPRDFFPSKDDLARLITVLFIACLVFVGCNFFVSRLASHRPRPFCDSDANSFDLLSDACEPCPSHGECREG
KLECLRGYRKHGRLCLEDGVINEAVKKLVKGDDIWDDPESQALVESIGSDNTTFMYAKGKASETIGRLFQTRQNSLGIKELKCPDFLAESYKPFTCRIRHWVLKHAFVVL
PVCLLLVGCTWLLWKLFRRQYLTNRAEALYNQVCEILEENALMLKRISGQCESWVVASRLRDHLLLPRERKDPLLWSKVEELVQEDSRIDRYPRLVKGEGKEVWEWQVEG
SLSSSKEETGKQIQFQGGNGSKI