; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G000520 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G000520
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionMSC domain-containing protein
Genome locationchr05:1049753..1056432
RNA-Seq ExpressionLsi05G000520
SyntenyLsi05G000520
Gene Ontology termsGO:0005637 - nuclear inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR044780 - Heh2/Src1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038534.1 MSC domain-containing protein [Cucumis melo var. makuwa]3.2e-16878.03Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL IA LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------
         L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQ                        
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------

Query:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIENGGS
                              VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+KKLASKS+S       KA+GV+ D MYHKIENG S
Subjt:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIENGGS

XP_004148518.1 uncharacterized protein LOC101208017 isoform X1 [Cucumis sativus]5.4e-16877.83Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGG----DSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSD
        MSSTPKKRTK KRNPNSDVGSG     DSS SSS++LLKSIKEPPRDFFPSKDDLAAL TVL IAC VFV+CNFFVSRLSSRHP PFCDTDADS DF+SD
Subjt:  MSSTPKKRTKFKRNPNSDVGSGG----DSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSD

Query:  VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKAL
        VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCEANAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKAL
Subjt:  VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKAL

Query:  ETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ--------------------
        ETIG L QTRQNSLGIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQYLTNRAEDLYNQ                    
Subjt:  ETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ--------------------

Query:  --------------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN
                                  VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSS +KKLASKS+S       KA+GV+ D MYHKIEN
Subjt:  --------------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN

XP_008465930.1 PREDICTED: uncharacterized protein LOC103503505 isoform X3 [Cucumis melo]2.1e-16778.12Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL IA LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------
         L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQ                        
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------

Query:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN
                              VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+KKLASKS+S       KA+GV+ D MYHKIEN
Subjt:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN

XP_023533380.1 uncharacterized protein LOC111795284 isoform X2 [Cucurbita pepo subsp. pepo]1.6e-15673.59Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPK+RTKFK N NSDV S  DS  SSS VLL S+K PPRDFFPSKDDL  L TVLFIA LVFV+CNFFVSRL +R PRPFCD+DADS D LSD CEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CP HGEC +GKLEC HGYR+HGRLCIEDGVIN+AV KL EWLES LCEANAKFLCDGIGIVWV+ED IWDDLDGK LV++I SDNTT+MYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------
         LFQ RQN+LGIKELKCPD LAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL +RQYLTNRAEDLYNQ                        
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------

Query:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIENGGS
                              VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEK+LASKSSSR AMGV++D +Y K+ENGGS
Subjt:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIENGGS

XP_038888162.1 uncharacterized protein LOC120078048 [Benincasa hispida]5.2e-17179.33Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRN NSDVGS GDSS SSST+LLKSIKEPPRDFFPSKDDLAAL TVLFIACL+FV+C+FFVSRL+SR PRPFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKL+CLHGYRKHGRLCIEDGVINEAVNKL EWLES LCEANAKFLCDGIGIVWVKED+IWDDLDGKELV+SIGSDNTTL YAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------
         LFQTRQNSLGIKELKCPDLLAESYKPF CRI HWVLQHAF VLPV LLLVGCTWLLWKL++RQY+TNRAEDLYNQ                        
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------

Query:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIEN
                              VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEK+LA+KS+S KAMGVSTD+M+ K+EN
Subjt:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIEN

TrEMBL top hitse value%identityAlignment
A0A0A0LI89 MSC domain-containing protein2.6e-16877.83Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGG----DSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSD
        MSSTPKKRTK KRNPNSDVGSG     DSS SSS++LLKSIKEPPRDFFPSKDDLAAL TVL IAC VFV+CNFFVSRLSSRHP PFCDTDADS DF+SD
Subjt:  MSSTPKKRTKFKRNPNSDVGSGG----DSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSD

Query:  VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKAL
        VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCEANAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKAL
Subjt:  VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKAL

Query:  ETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ--------------------
        ETIG L QTRQNSLGIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQYLTNRAEDLYNQ                    
Subjt:  ETIGRLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ--------------------

Query:  --------------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN
                                  VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSS +KKLASKS+S       KA+GV+ D MYHKIEN
Subjt:  --------------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN

A0A1S3CQ15 uncharacterized protein LOC103503505 isoform X31.0e-16778.12Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL IA LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------
         L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQ                        
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------

Query:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN
                              VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+KKLASKS+S       KA+GV+ D MYHKIEN
Subjt:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIEN

A0A1S3CRG1 uncharacterized protein LOC103503505 isoform X17.4e-15579.04Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL IA LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------
         L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQ                        
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------

Query:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVE
                              VEELVQEDSRIDRYPRLVKGDGKEVWEWQ E
Subjt:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVE

A0A5A7T509 MSC domain-containing protein1.5e-16878.03Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPKKRTK KRNPNSDVGSG DSS SSS++LLKS+KEPPRDFFPSKDDLAAL TVL IA LVFV+CNFFVSRLSSRHP PFCDTDADSLD LSDVCEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKL EWLES LCE+NAKFLCDGIGIVWVKE++IWDDLDGKELV+SIGSDNTTLMYAKSKALETIG
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------
         L QTRQNS GIKELKCPDLLAESYKPF CRI HWVLQHAFVVLPV LLLVGCTWLLWKL++RQ LTNRAEDLYNQ                        
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------

Query:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIENGGS
                              VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+KKLASKS+S       KA+GV+ D MYHKIENG S
Subjt:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSR------KAMGVSTDRMYHKIENGGS

A0A6J1E026 uncharacterized protein LOC111026156 isoform X18.8e-15672.02Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        MSSTPK+R K K NP+SD GS GDSSASSSTVLLKS+K+PPRDFFPS++DL  L TVLFIACLVF++CNFFVSRL+SR P PFCDTDADSLD LSD C+P
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CP HGECR G+LEC+ GYRKHGRLCIEDGVINEAV KL EWLES LCEANAKF+CDG+G VWVKED+IWDDLDG+ LV++IGSDNTT MYAK KALETI 
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------
         LFQT+QNSLGI+ELKCPDLLAESYKPF CRIHHWVL+HAFVVLPV LLLVGCTWLLWKL++RQ+LTNRAE+LYNQ                        
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------

Query:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIE
                              VEELVQEDSRIDRYPRLVKG+GKEVWEWQVEGSLSSSKEK+LASK SSR AM V++DR+Y K++
Subjt:  ----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGVSTDRMYHKIE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46560.1 CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018996); Has 58 Blast hits to 58 proteins in 29 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 15; Plants - 20; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).2.9e-7138.16Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP
        M S P+KR      P S+  +G    +SSS+  ++S+ EPP+  FPSK +   L  VL +AC V  TCNF    LSS   + FCD++ + +D   D+CEP
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEP

Query:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG
        CP +GEC  GKL+C  GY+    LC+EDG INE+  KLV + E ++CE+ A   C G G +WV E+++W +L     + ++  D +   + K KA+E + 
Subjt:  CPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIG

Query:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------
         L + R NS GI ELKCP+ +A+SYKP  CR+H W+L+H  ++     +LVG   L  ++ ++Q  + R E+LY+Q                        
Subjt:  RLFQTRQNSLGIKELKCPDLLAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQ------------------------

Query:  -----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK-EKKLASKSSSRKAMGVST
                               VEEL++EDSRIDRY +L+KG+ K VWEWQVEGSLS SK +K+  ++   RK++  ST
Subjt:  -----------------------VEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK-EKKLASKSSSRKAMGVST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCAACTCCGAAGAAGCGAACGAAATTCAAGCGTAATCCGAACTCCGATGTCGGTTCTGGAGGCGATTCCTCTGCTTCATCTTCTACAGTGTTGCTGAAGTCTAT
CAAGGAACCGCCTCGCGATTTCTTCCCCTCGAAGGATGATCTTGCTGCGCTATTTACTGTACTTTTCATCGCCTGCTTGGTTTTTGTGACTTGTAACTTCTTCGTATCTA
GACTTTCAAGTCGCCACCCGAGGCCTTTCTGTGATACCGACGCCGATTCTTTGGATTTTCTTTCTGATGTTTGTGAGCCTTGTCCAAGGCATGGAGAATGTCGTGATGGT
AAGTTGGAATGCCTTCATGGTTATAGAAAGCATGGAAGGTTATGTATAGAAGATGGAGTAATCAATGAAGCAGTTAATAAACTTGTAGAATGGCTAGAATCTCGCCTCTG
TGAAGCAAATGCCAAGTTTTTATGCGATGGAATTGGGATAGTTTGGGTTAAAGAGGACAATATATGGGATGATCTAGATGGTAAAGAACTGGTGGACAGTATTGGCTCTG
ACAATACCACTCTTATGTATGCAAAGAGCAAGGCGTTGGAAACTATTGGTAGGTTATTTCAGACGCGACAAAATTCTCTTGGGATCAAGGAATTGAAATGCCCAGATCTG
CTAGCTGAAAGTTACAAGCCTTTTCGTTGCCGTATTCATCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTACCAGTTCTCTTACTGCTTGTGGGATGCACATGGTTACT
ATGGAAACTTTTCCAAAGACAATATCTAACAAATAGAGCTGAAGATCTGTACAACCAGGTAGAGGAGTTGGTTCAGGAAGACTCACGAATAGATCGTTACCCGAGACTAG
TTAAGGGTGATGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGCTCTTTGAGCTCTTCTAAGGAAAAGAAACTGGCCAGCAAATCCAGTTCCAGGAAGGCAATGGGAGTA
AGTACTGATCGAATGTATCATAAAATAGAGAACGGTGGATCCTGTAGATTAGCTGGATCATCTGTCTAG
mRNA sequenceShow/hide mRNA sequence
AAAGAAGCTGTAAGAGAAGGCAGGCGCAGGCCGTGAAGAACGACGTTAATCGATGTCTTCAACTCCGAAGAAGCGAACGAAATTCAAGCGTAATCCGAACTCCGATGTCG
GTTCTGGAGGCGATTCCTCTGCTTCATCTTCTACAGTGTTGCTGAAGTCTATCAAGGAACCGCCTCGCGATTTCTTCCCCTCGAAGGATGATCTTGCTGCGCTATTTACT
GTACTTTTCATCGCCTGCTTGGTTTTTGTGACTTGTAACTTCTTCGTATCTAGACTTTCAAGTCGCCACCCGAGGCCTTTCTGTGATACCGACGCCGATTCTTTGGATTT
TCTTTCTGATGTTTGTGAGCCTTGTCCAAGGCATGGAGAATGTCGTGATGGTAAGTTGGAATGCCTTCATGGTTATAGAAAGCATGGAAGGTTATGTATAGAAGATGGAG
TAATCAATGAAGCAGTTAATAAACTTGTAGAATGGCTAGAATCTCGCCTCTGTGAAGCAAATGCCAAGTTTTTATGCGATGGAATTGGGATAGTTTGGGTTAAAGAGGAC
AATATATGGGATGATCTAGATGGTAAAGAACTGGTGGACAGTATTGGCTCTGACAATACCACTCTTATGTATGCAAAGAGCAAGGCGTTGGAAACTATTGGTAGGTTATT
TCAGACGCGACAAAATTCTCTTGGGATCAAGGAATTGAAATGCCCAGATCTGCTAGCTGAAAGTTACAAGCCTTTTCGTTGCCGTATTCATCACTGGGTTTTGCAGCATG
CTTTTGTTGTTTTACCAGTTCTCTTACTGCTTGTGGGATGCACATGGTTACTATGGAAACTTTTCCAAAGACAATATCTAACAAATAGAGCTGAAGATCTGTACAACCAG
GTAGAGGAGTTGGTTCAGGAAGACTCACGAATAGATCGTTACCCGAGACTAGTTAAGGGTGATGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGCTCTTTGAGCTCTTC
TAAGGAAAAGAAACTGGCCAGCAAATCCAGTTCCAGGAAGGCAATGGGAGTAAGTACTGATCGAATGTATCATAAAATAGAGAACGGTGGATCCTGTAGATTAGCTGGAT
CATCTGTCTAG
Protein sequenceShow/hide protein sequence
MSSTPKKRTKFKRNPNSDVGSGGDSSASSSTVLLKSIKEPPRDFFPSKDDLAALFTVLFIACLVFVTCNFFVSRLSSRHPRPFCDTDADSLDFLSDVCEPCPRHGECRDG
KLECLHGYRKHGRLCIEDGVINEAVNKLVEWLESRLCEANAKFLCDGIGIVWVKEDNIWDDLDGKELVDSIGSDNTTLMYAKSKALETIGRLFQTRQNSLGIKELKCPDL
LAESYKPFRCRIHHWVLQHAFVVLPVLLLLVGCTWLLWKLFQRQYLTNRAEDLYNQVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKKLASKSSSRKAMGV
STDRMYHKIENGGSCRLAGSSV