; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC09G169960 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC09G169960
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionMSC domain-containing protein
Genome locationCiama_Chr09:14537035..14541425
RNA-Seq ExpressionCaUC09G169960
SyntenyCaUC09G169960
Gene Ontology termsGO:0005637 - nuclear inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR018996 - Man1/Src1, C-terminal
IPR041885 - MAN1, winged-helix domain
IPR044780 - Heh2/Src1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038534.1 MSC domain-containing protein [Cucumis melo var. makuwa]2.7e-19884.75Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN
        MSSTPKKRTK KRNPNSDVGSG DSS  SS++LLKS+KEPPRDFFPSKDDLAAL+TVL IA LVFVSCNFFVSRLSSRHP PFCDTDADSLDLLS     
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN

Query:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI
                       VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVN+L EWLESHLCE+NAKFLCDGIGIVWVKE +IWDDLDGKEL+ESI
Subjt:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI

Query:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL
        GSDNTTL+YAKSK LETIGGL  TRQNS GIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQ LTNRAEDLYNQVCEIL
Subjt:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL

Query:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFRKAMG
        EENAL STRNS Q ESWVVASRLRDHLLLPRERKNP LW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+K+L      ASKS+F KA+G
Subjt:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFRKAMG

Query:  VNSDQMYRNIENG
        VN D MY  IENG
Subjt:  VNSDQMYRNIENG

XP_004148518.1 uncharacterized protein LOC101208017 isoform X1 [Cucumis sativus]1.3e-19783.89Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGG----DSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSV
        MSSTPKKRTK KRNPNSDVGSG     DSS  SS++LLKSIKEPPRDFFPSKDDLAAL+TVL IAC VFVSCNFFVSRLSSRHP PFCDTDADS D +S 
Subjt:  MSSTPKKRTKFKRNPNSDVGSGG----DSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSV

Query:  ANNNFQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKEL
                           VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVN+L EWLESHLCEANAKFLCDGIGIVWVKE +IWDDLDGKEL
Subjt:  ANNNFQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKEL

Query:  MESIGSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQV
        +ESIGSDNTTL+YAKSK LETIGGL  TRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQV
Subjt:  MESIGSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQV

Query:  CEILEENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFR
        CEILEENAL STRNSGQ ESWVVASRLRDHLLLPRER+NP LW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSS +K+L      ASKS+F 
Subjt:  CEILEENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFR

Query:  KAMGVNSDQMYRNIEN
        KA+GVN D MY  IEN
Subjt:  KAMGVNSDQMYRNIEN

XP_008465930.1 PREDICTED: uncharacterized protein LOC103503505 isoform X3 [Cucumis melo]1.0e-19784.71Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN
        MSSTPKKRTK KRNPNSDVGSG DSS  SS++LLKS+KEPPRDFFPSKDDLAAL+TVL IA LVFVSCNFFVSRLSSRHP PFCDTDADSLDLLS     
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN

Query:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI
                       VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVN+L EWLESHLCE+NAKFLCDGIGIVWVKE +IWDDLDGKEL+ESI
Subjt:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI

Query:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL
        GSDNTTL+YAKSK LETIGGL  TRQNS GIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQ LTNRAEDLYNQVCEIL
Subjt:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL

Query:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFRKAMG
        EENAL STRNS Q ESWVVASRLRDHLLLPRERKNP LW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+K+L      ASKS+F KA+G
Subjt:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFRKAMG

Query:  VNSDQMYRNIEN
        VN D MY  IEN
Subjt:  VNSDQMYRNIEN

XP_023533380.1 uncharacterized protein LOC111795284 isoform X2 [Cucurbita pepo subsp. pepo]2.8e-18781.13Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN
        MSSTPK+RTKFK N NSDV S  DS   SS VLL S+K PPRDFFPSKDDL  L+TVLFIA LVFVSCNFFVSRL +R PRPFCD+DADS DLLS A   
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN

Query:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI
                        CEPCP HGEC +GKLEC HGYR+HGRLCIEDGVIN+AV +L EWLESHLCEANAKFLCDGIGIVWV+E  IWDDLDGK L+E+I
Subjt:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI

Query:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL
         SDNTT++YAKSK LETIGGLF  RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPV LLLVGCTWLLWKL RRQYLTNRAEDLYNQVCEIL
Subjt:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL

Query:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSFRKAMGVNSDQM
        EENALMSTRNSGQ ESWVVASRLRDHLLLPRERK+P LWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSS R AMGVNSD +
Subjt:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSFRKAMGVNSDQM

Query:  YRNIENGG
        Y  +ENGG
Subjt:  YRNIENGG

XP_038888162.1 uncharacterized protein LOC120078048 [Benincasa hispida]8.0e-20386.7Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN
        MSSTPKKRTK KRN NSDVGS GDSS  SST+LLKSIKEPPRDFFPSKDDLAAL+TVLFIACL+FVSC+FFVSRL+SR PRPFCDTDADSLDLLS     
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN

Query:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI
                       VCEPCPRHGECRDGKL+CLHGYRKHGRLCIEDGVINEAVN+L EWLESHLCEANAKFLCDGIGIVWVKE +IWDDLDGKEL+ESI
Subjt:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI

Query:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL
        GSDNTTL YAKSK LETIGGLF TRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAF VLPVFLLLVGCTWLLWKLYRRQY+TNRAEDLYNQVCEIL
Subjt:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL

Query:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSFRKAMGVNSDQM
        EENALMSTRNSGQ ESWVVASRLRDHLLLPRERKNP LWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLA+KS+  KAMGV++DQM
Subjt:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSFRKAMGVNSDQM

Query:  YRNIEN
        +  +EN
Subjt:  YRNIEN

TrEMBL top hitse value%identityAlignment
A0A0A0LI89 MSC domain-containing protein6.4e-19883.89Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGG----DSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSV
        MSSTPKKRTK KRNPNSDVGSG     DSS  SS++LLKSIKEPPRDFFPSKDDLAAL+TVL IAC VFVSCNFFVSRLSSRHP PFCDTDADS D +S 
Subjt:  MSSTPKKRTKFKRNPNSDVGSGG----DSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSV

Query:  ANNNFQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKEL
                           VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVN+L EWLESHLCEANAKFLCDGIGIVWVKE +IWDDLDGKEL
Subjt:  ANNNFQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKEL

Query:  MESIGSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQV
        +ESIGSDNTTL+YAKSK LETIGGL  TRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQV
Subjt:  MESIGSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQV

Query:  CEILEENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFR
        CEILEENAL STRNSGQ ESWVVASRLRDHLLLPRER+NP LW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSS +K+L      ASKS+F 
Subjt:  CEILEENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFR

Query:  KAMGVNSDQMYRNIEN
        KA+GVN D MY  IEN
Subjt:  KAMGVNSDQMYRNIEN

A0A1S3CQ15 uncharacterized protein LOC103503505 isoform X34.9e-19884.71Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN
        MSSTPKKRTK KRNPNSDVGSG DSS  SS++LLKS+KEPPRDFFPSKDDLAAL+TVL IA LVFVSCNFFVSRLSSRHP PFCDTDADSLDLLS     
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN

Query:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI
                       VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVN+L EWLESHLCE+NAKFLCDGIGIVWVKE +IWDDLDGKEL+ESI
Subjt:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI

Query:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL
        GSDNTTL+YAKSK LETIGGL  TRQNS GIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQ LTNRAEDLYNQVCEIL
Subjt:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL

Query:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFRKAMG
        EENAL STRNS Q ESWVVASRLRDHLLLPRERKNP LW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+K+L      ASKS+F KA+G
Subjt:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFRKAMG

Query:  VNSDQMYRNIEN
        VN D MY  IEN
Subjt:  VNSDQMYRNIEN

A0A1S3CRG1 uncharacterized protein LOC103503505 isoform X17.3e-18686.83Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN
        MSSTPKKRTK KRNPNSDVGSG DSS  SS++LLKS+KEPPRDFFPSKDDLAAL+TVL IA LVFVSCNFFVSRLSSRHP PFCDTDADSLDLLS     
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN

Query:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI
                       VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVN+L EWLESHLCE+NAKFLCDGIGIVWVKE +IWDDLDGKEL+ESI
Subjt:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI

Query:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL
        GSDNTTL+YAKSK LETIGGL  TRQNS GIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQ LTNRAEDLYNQVCEIL
Subjt:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL

Query:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVE
        EENAL STRNS Q ESWVVASRLRDHLLLPRERKNP LW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQ E
Subjt:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVE

A0A5A7T509 MSC domain-containing protein1.3e-19884.75Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN
        MSSTPKKRTK KRNPNSDVGSG DSS  SS++LLKS+KEPPRDFFPSKDDLAAL+TVL IA LVFVSCNFFVSRLSSRHP PFCDTDADSLDLLS     
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN

Query:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI
                       VCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVN+L EWLESHLCE+NAKFLCDGIGIVWVKE +IWDDLDGKEL+ESI
Subjt:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI

Query:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL
        GSDNTTL+YAKSK LETIGGL  TRQNS GIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQ LTNRAEDLYNQVCEIL
Subjt:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL

Query:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFRKAMG
        EENAL STRNS Q ESWVVASRLRDHLLLPRERKNP LW+KVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK+K+L      ASKS+F KA+G
Subjt:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRL------ASKSSFRKAMG

Query:  VNSDQMYRNIENG
        VN D MY  IENG
Subjt:  VNSDQMYRNIENG

A0A6J1H2A7 uncharacterized protein LOC111459381 isoform X35.6e-18681.03Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN
        MSSTPK+RTKFK N NSDV S  DS   SS VLL SIK PPRDFFPSKDDL  L+TVLFIA LVFVSCNFFVSRL +R PRPFCD+DADS DLLS A   
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN

Query:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI
                        CEPCP HGEC +GKLEC HGYR+HGRLCIEDGVIN+AV +L EWLESHLCEANAKFLCDGIGIVWV+E  IWDDLDGK L+E+I
Subjt:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI

Query:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL
         SDNTT++YAKSK LETIGGLF  RQN+LGIKELKCPD LAESYKPFTCRIRHWVLQHAFVVLPV LLLVGCTWLLWKL RRQYLTNRAEDLYNQVCEIL
Subjt:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL

Query:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSFRKAMGVNSDQM
        EENALMSTRNSGQ ESWVVASRLRDHLLLPRERK+P LWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSS R  MGVNSD +
Subjt:  EENALMSTRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSFRKAMGVNSDQM

Query:  YRNIEN
        Y  +EN
Subjt:  YRNIEN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G46560.1 CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018996); Has 58 Blast hits to 58 proteins in 29 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 15; Plants - 20; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink).3.7e-8942.12Show/hide
Query:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN
        M S P+KR      P S+  +G    + SS+  ++S+ EPP+  FPSK +   LL VL +AC V  +CNF    LSS   + FCD             +N
Subjt:  MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNN

Query:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI
        F  ++  +       +CEPCP +GEC  GKL+C  GY+    LC+EDG INE+  +LV + E  +CE+ A   C G G +WV E ++W +L     + ++
Subjt:  FQSLEMTISSAMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESI

Query:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL
          D +   + K K +E +  L   R NS GI ELKCP+ +A+SYKP TCR+  W+L+H  ++     +LVG   L  ++ R+Q  + R E+LY+QVC+ L
Subjt:  GSDNTTLIYAKSKVLETIGGLFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEIL

Query:  EENALMS-TRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK-EKRLASKSSFRKAMGVNSD
        EENA+ S +  +   E WV+AS LRD+LLLPRER++P LW KVEEL++EDSRIDRY +L+KG+ K VWEWQVEGSLS SK +K+  ++   RK++  ++ 
Subjt:  EENALMS-TRNSGQSESWVVASRLRDHLLLPRERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK-EKRLASKSSFRKAMGVNSD

Query:  -QMYRN
         Q Y N
Subjt:  -QMYRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCAACTCCGAAGAAGCGAACGAAATTCAAGCGTAATCCGAACTCCGATGTCGGGTCTGGAGGCGATTCCTCTGCTCCATCTTCTACAGTGTTGCTGAAGTCTAT
CAAGGAACCGCCTCGCGATTTCTTCCCCTCGAAGGATGATCTTGCTGCGCTACTTACTGTACTTTTCATCGCCTGCTTGGTTTTTGTGAGTTGTAACTTCTTCGTATCTA
GACTTTCAAGTCGCCACCCGAGGCCTTTCTGTGATACCGACGCCGATTCTCTGGATTTGCTTTCTGTTGCCAACAATAATTTCCAGTCGCTTGAAATGACTATTTCAAGC
GCAATGGCGCCACATGTTTGTGAGCCTTGTCCAAGGCATGGAGAATGTCGTGATGGTAAGTTGGAATGCCTTCATGGTTATAGAAAGCATGGAAGGTTATGTATAGAAGA
TGGAGTAATCAATGAAGCAGTTAATCAACTTGTAGAATGGCTAGAATCTCACCTCTGTGAAGCAAATGCCAAGTTCTTATGCGATGGAATTGGGATAGTTTGGGTTAAAG
AGGGCAATATATGGGATGATCTAGATGGTAAAGAGCTGATGGAAAGTATTGGCTCTGACAACACCACTCTTATTTATGCAAAGAGCAAGGTGTTGGAAACTATTGGTGGG
TTATTTCATACACGACAAAATTCTCTTGGGATCAAGGAATTGAAATGCCCAGACCTGCTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCA
GCATGCTTTTGTTGTTTTGCCAGTTTTCTTACTGCTTGTGGGATGCACATGGTTACTATGGAAACTTTACCGAAGACAATATCTAACAAATAGAGCTGAAGATCTGTACA
ACCAGGTTTGCGAAATACTTGAGGAAAATGCTTTGATGTCAACGAGAAACAGTGGTCAATCTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCA
CGAGAGAGGAAGAATCCTTTCTTATGGAGGAAGGTAGAGGAGTTGGTTCAGGAAGACTCACGAATAGATCGTTACCCGAGACTAGTTAAGGGTGATGGAAAAGAAGTATG
GGAATGGCAAGTAGAAGGATCTTTGAGCTCTTCAAAGGAAAAGAGACTGGCCAGCAAATCCAGTTTCAGGAAGGCAATGGGAGTAAATTCTGACCAAATGTATCGTAACA
TAGAGAACGGTGGATACTGTAGATTAGCTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCAACTCCGAAGAAGCGAACGAAATTCAAGCGTAATCCGAACTCCGATGTCGGGTCTGGAGGCGATTCCTCTGCTCCATCTTCTACAGTGTTGCTGAAGTCTAT
CAAGGAACCGCCTCGCGATTTCTTCCCCTCGAAGGATGATCTTGCTGCGCTACTTACTGTACTTTTCATCGCCTGCTTGGTTTTTGTGAGTTGTAACTTCTTCGTATCTA
GACTTTCAAGTCGCCACCCGAGGCCTTTCTGTGATACCGACGCCGATTCTCTGGATTTGCTTTCTGTTGCCAACAATAATTTCCAGTCGCTTGAAATGACTATTTCAAGC
GCAATGGCGCCACATGTTTGTGAGCCTTGTCCAAGGCATGGAGAATGTCGTGATGGTAAGTTGGAATGCCTTCATGGTTATAGAAAGCATGGAAGGTTATGTATAGAAGA
TGGAGTAATCAATGAAGCAGTTAATCAACTTGTAGAATGGCTAGAATCTCACCTCTGTGAAGCAAATGCCAAGTTCTTATGCGATGGAATTGGGATAGTTTGGGTTAAAG
AGGGCAATATATGGGATGATCTAGATGGTAAAGAGCTGATGGAAAGTATTGGCTCTGACAACACCACTCTTATTTATGCAAAGAGCAAGGTGTTGGAAACTATTGGTGGG
TTATTTCATACACGACAAAATTCTCTTGGGATCAAGGAATTGAAATGCCCAGACCTGCTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCA
GCATGCTTTTGTTGTTTTGCCAGTTTTCTTACTGCTTGTGGGATGCACATGGTTACTATGGAAACTTTACCGAAGACAATATCTAACAAATAGAGCTGAAGATCTGTACA
ACCAGGTTTGCGAAATACTTGAGGAAAATGCTTTGATGTCAACGAGAAACAGTGGTCAATCTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCA
CGAGAGAGGAAGAATCCTTTCTTATGGAGGAAGGTAGAGGAGTTGGTTCAGGAAGACTCACGAATAGATCGTTACCCGAGACTAGTTAAGGGTGATGGAAAAGAAGTATG
GGAATGGCAAGTAGAAGGATCTTTGAGCTCTTCAAAGGAAAAGAGACTGGCCAGCAAATCCAGTTTCAGGAAGGCAATGGGAGTAAATTCTGACCAAATGTATCGTAACA
TAGAGAACGGTGGATACTGTAGATTAGCTGGTTGA
Protein sequenceShow/hide protein sequence
MSSTPKKRTKFKRNPNSDVGSGGDSSAPSSTVLLKSIKEPPRDFFPSKDDLAALLTVLFIACLVFVSCNFFVSRLSSRHPRPFCDTDADSLDLLSVANNNFQSLEMTISS
AMAPHVCEPCPRHGECRDGKLECLHGYRKHGRLCIEDGVINEAVNQLVEWLESHLCEANAKFLCDGIGIVWVKEGNIWDDLDGKELMESIGSDNTTLIYAKSKVLETIGG
LFHTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQHAFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQSESWVVASRLRDHLLLP
RERKNPFLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSFRKAMGVNSDQMYRNIENGGYCRLAG