; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G010090 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G010090
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionnitrate regulatory gene2 protein-like
Genome locationCmo_Chr05:8113638..8115643
RNA-Seq ExpressionCmoCh05G010090
SyntenyCmoCh05G010090
Gene Ontology termsNA
InterPro domainsIPR006867 - Domain of unknown function DUF632
IPR006868 - Domain of unknown function DUF630


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599109.1 Protein ALTERED PHOSPHATE STARVATION RESPONSE 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0097.42Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
        MGCASSKLDNLPAV LCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE

Query:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES
        EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES
Subjt:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES

Query:  KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNAD
        KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCK         SRTSSRGLLSTLKKLCVWEQKLYLEVK LERLRMILEKKCRQLKNLVEKNAD
Subjt:  KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNAD

Query:  DRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN
         RKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQT+ALTDAKPFESVLNGALDDAHLEAAMDLKLELEN
Subjt:  DRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN

Query:  WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA
        WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTA+KFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA
Subjt:  WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA

Query:  LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP
        LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQV VGLTRFCGDSITAYEELCSSAIAQSQPGPP
Subjt:  LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP

KAG7030049.1 hypothetical protein SDJN02_08395, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0097.25Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
        MGCASSKLDNLPAV LCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSE DDE
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE

Query:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES
        EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES
Subjt:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES

Query:  KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNAD
        KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCK         SRTSSRGLLSTLKKLCVWEQKLYLEVK LERLRMILEKKCRQLKNLVEKNAD
Subjt:  KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNAD

Query:  DRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN
         RKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNEL HGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN
Subjt:  DRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN

Query:  WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA
        WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTA+KFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA
Subjt:  WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA

Query:  LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP
        LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQV VGLTRFCGDSITAYEELCSSAIAQSQPGPP
Subjt:  LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP

XP_022946043.1 nitrate regulatory gene2 protein-like [Cucurbita moschata]0.0e+0098.45Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
        MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE

Query:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES
        EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES
Subjt:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES

Query:  KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNAD
        KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCK         SRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNAD
Subjt:  KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNAD

Query:  DRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN
        DRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN
Subjt:  DRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN

Query:  WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA
        WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA
Subjt:  WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA

Query:  LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP
        LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP
Subjt:  LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP

XP_022999660.1 nitrate regulatory gene2 protein-like [Cucurbita maxima]2.1e-30293.69Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESD--
        MGCASSKLDNLPAVALCRDRCKFLDQAL FTHSLIDSHSAYADSLNKVASALRRLFDQDG++  GGG DLK PPPP AA S ERSDSDSDSDSDS+SD  
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESD--

Query:  --DEEDG-CFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEK
          DEEDG C T EKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKK KKPPVVAPPEKLNQPEK
Subjt:  --DEEDG-CFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEK

Query:  KIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLV
        KIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCK         SRTSSRGLLSTLKKLC+WEQKLYLEVK LERLR+ILEKKCRQLKNLV
Subjt:  KIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLV

Query:  EKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLK
        EKNAD+RKIDSVR SIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLK
Subjt:  EKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLK

Query:  LELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWD
        LELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWD
Subjt:  LELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWD

Query:  LQRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP
        LQRL+LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQV VGLTRFCGDSI AYEELCSSAIAQSQ GPP
Subjt:  LQRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP

XP_023545702.1 nitrate regulatory gene2 protein-like [Cucurbita pepo subsp. pepo]0.0e+0095.55Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
        MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVAS+LRRLFDQDGESCGGGGGDLKCPPPPSAA SSER   DSDSDSDSESDDE
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE

Query:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWD---PDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKI
        EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWD    DQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKI
Subjt:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWD---PDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKI

Query:  DESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEK
        DESKMGVVDLM EIK SFEKASESSNSISKLLSYGQRMSFCK         SRTSSRGLLSTLKKLCVWEQKLYLEVK LERLRMILEKKCRQLKNLVEK
Subjt:  DESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEK

Query:  NADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLE
        NADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLE
Subjt:  NADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLE

Query:  LENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQ
        LENWRSNFVNLI TQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTA+KFSDKDVSEAIQGVVSKLDEALEQQSWDLQ
Subjt:  LENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQ

Query:  RLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP
        RL+LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQV VGLT FCGDSITAYEELCSSAIAQSQPGPP
Subjt:  RLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP

TrEMBL top hitse value%identityAlignment
A0A0A0KGR7 Uncharacterized protein5.0e-24977.3Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESD--
        MGCASSKLDNLPAVALCRDRCKFLDQAL+FTHSLIDSHSAYADSLNK ASALRRLFDQDGE+   GGGDLK PPPP AA  +ERSDSDSDSDSDS+SD  
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESD--

Query:  --DEEDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWD---PDQKPGTTRVVTNKKKK------QKKPPVVAPP
          +EEDGCF QEKP S  +G+ MFS+YD ARGQPPPPPP GSSWDFFNFF++YERYEQP FNWD    D+K  TT+VV  KKKK       KK  V    
Subjt:  --DEEDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWD---PDQKPGTTRVVTNKKKK------QKKPPVVAPP

Query:  EKLNQPEKKIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKK
        EK NQPEKKID+ KM V+DLM EIKGSFEKASESSNSISKLLSYGQRMS CKG            SRGLLSTLKKLCVWEQKLYLEVK  ER+RM+LEKK
Subjt:  EKLNQPEKKIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKK

Query:  CRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAH
        CRQLKNL+EK AD RKID +R+SIRNLSIKL +SIQVVDRISITISKLRDEEF AEMNELI GLQSMWK+M E HKQQTQALTD KPFES+LNGALDD H
Subjt:  CRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAH

Query:  LEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDE
        LEAAMDLKLELENWR+NF+ LIATQKDCIKALNGWLLRCLL+EP+PEETPNGGCPPPFSP RIGAPPVFVI +LWSDTA+KFS+KDVSEA+QG+V KLD+
Subjt:  LEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDE

Query:  ALEQQSWDLQRLALGNRDLEKKIKAKK---MNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIA----------
        ALEQQS DLQRLAL N+DLEKKIK KK   MNQ  EEK T AVAAGTVHKVGKF+GCEIQ G+RQ+ VGLTRFCGDSI AYEELC SAI           
Subjt:  ALEQQSWDLQRLALGNRDLEKKIKAKK---MNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIA----------

Query:  -QSQPGPP
         QSQ  PP
Subjt:  -QSQPGPP

A0A1S3CP29 uncharacterized protein LOC1035026347.7e-25077.89Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
        MGCASSKLDNLPAVALCRDRCKFLDQALV THSLIDSHSAYADSL+K ASALRRLFDQDGE+   GGGDLKC PPP AA   +RSDSDSDS+     ++E
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE

Query:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWD---PDQKPGTTRVVTNKKKK------QKKPPVVAPPEKLN
        EDGCF QEKP SP +G+ MFSNYDSARGQPPPPPPTGSSWDFFNFF++YERYEQP FNWD    D+K  TT+VV  KKKK       KK  V    EK N
Subjt:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWD---PDQKPGTTRVVTNKKKK------QKKPPVVAPPEKLN

Query:  QPEKKIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQL
        QPEKKID+ KM V+DLM EIKGSFEKASESSNSISKLLSYGQRMSFCK         +RT SRGLLSTLKKL VWEQKLYLEVK  ERLRM+LEKKCRQL
Subjt:  QPEKKIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQL

Query:  KNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAA
        KNL+EK AD RKID +R+SIRNLSIKL +SIQVVDRISITISKLRDEEF AEMNELI GLQSMWK+M E HKQQTQALTD KPFES+LNGALDD HLEAA
Subjt:  KNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAA

Query:  MDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQ
        MDLKLELENWR+NF+ LIATQKDCIKALNGWLLRCLL+EP+PEETPNGGCPPPFSP RIGAPPVFVI +LWSDTA+ FSDKDVSEAIQG+V KLD+ALEQ
Subjt:  MDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQ

Query:  QSWDLQRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAI-------AQSQPGPP
        QS DLQRLAL N+DLEKKIK KK      E+ T AVAAGTVHKVGKF+GCEIQ G+RQ+ VGLTRFCGDSI AYEELC SAI       +QSQ GPP
Subjt:  QSWDLQRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAI-------AQSQPGPP

A0A5A7T6A4 Uncharacterized protein2.6e-25378.67Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESD--
        MGCASSKLDNLPAVALCRDRCKFLDQALV THSLIDSHSAYADSL+K ASALRRLFDQDGE+   GGGDLKC PPP AA   +RSDSDSDSDSDS+SD  
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESD--

Query:  -DEEDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWD---PDQKPGTTRVVTNKKKK------QKKPPVVAPPE
         +EEDGCF QEKP SP +G+ MFSNYDSARGQPPPPPPTGSSWDFFNFF++YERYEQP FNWD    D+K  TT+VV  KKKK       KK  V    E
Subjt:  -DEEDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWD---PDQKPGTTRVVTNKKKK------QKKPPVVAPPE

Query:  KLNQPEKKIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKC
        K NQPEKKID+ KM V+DLM EIKGSFEKASESSNSISKLLSYGQRMSFCK         +RT SRGLLSTLKKL VWEQKLYLEVK  ERLRM+LEKKC
Subjt:  KLNQPEKKIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKC

Query:  RQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHL
        RQLKNL+EK AD RKID +R+SIRNLSIKL +SIQVVDRISITISKLRDEEF AEMNELI GLQSMWK+M E HKQQTQALTD KPFES+LNGALDD HL
Subjt:  RQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHL

Query:  EAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEA
        EAAMDLKLELENWR+NF+ LIATQKDCIKALNGWLLRCLL+EP+PEETPNGGCPPPFSP RIGAPPVFVI +LWSDTA+KFSDKDVSEAIQG+V KLD+A
Subjt:  EAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEA

Query:  LEQQSWDLQRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAI-------AQSQPGPP
        LEQQS DLQRLAL N+DLEKKIK KK      E+ T AVAAGTVHKVGKF+GCEIQ G+RQ+ VGLTRFCGDSI AYEELC SAI       +QSQ GPP
Subjt:  LEQQSWDLQRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAI-------AQSQPGPP

A0A6J1G2P6 nitrate regulatory gene2 protein-like0.0e+0098.45Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
        MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE

Query:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES
        EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES
Subjt:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDES

Query:  KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNAD
        KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCK         SRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNAD
Subjt:  KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNAD

Query:  DRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN
        DRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN
Subjt:  DRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELEN

Query:  WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA
        WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA
Subjt:  WRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLA

Query:  LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP
        LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP
Subjt:  LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP

A0A6J1KG49 nitrate regulatory gene2 protein-like1.0e-30293.69Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESD--
        MGCASSKLDNLPAVALCRDRCKFLDQAL FTHSLIDSHSAYADSLNKVASALRRLFDQDG++  GGG DLK PPPP AA S ERSDSDSDSDSDS+SD  
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESD--

Query:  --DEEDG-CFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEK
          DEEDG C T EKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKK KKPPVVAPPEKLNQPEK
Subjt:  --DEEDG-CFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEK

Query:  KIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLV
        KIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCK         SRTSSRGLLSTLKKLC+WEQKLYLEVK LERLR+ILEKKCRQLKNLV
Subjt:  KIDESKMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLV

Query:  EKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLK
        EKNAD+RKIDSVR SIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLK
Subjt:  EKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLK

Query:  LELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWD
        LELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWD
Subjt:  LELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWD

Query:  LQRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP
        LQRL+LGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQV VGLTRFCGDSI AYEELCSSAIAQSQ GPP
Subjt:  LQRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQVLVGLTRFCGDSITAYEELCSSAIAQSQPGPP

SwissProt top hitse value%identityAlignment
A0A178VBJ0 Protein ALTERED PHOSPHATE STARVATION RESPONSE 12.8e-2323.38Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
        MGC  S++D+   V+ C+ R ++L   +    +L  SH+ Y  SL  V S+L     ++             PPPP          S       SE+   
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE

Query:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGT-TRVVTNKKKKQKKPPVVAPPE-----------
             +   P  P                PPPPPP  S+WDF++ F            W+ +    T T   T         P  A P+           
Subjt:  EDGCFTQEKPRSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGT-TRVVTNKKKKQKKPPVVAPPE-----------

Query:  ----KLNQPEKKIDESKMG--VVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTS-----------SRGLL--------------
                 E  +  S+ G  +++++ E+   F KA++S   +S LL     ++   G +    +YS ++           +RG                
Subjt:  ----KLNQPEKKIDESKMG--VVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLLLYSRTS-----------SRGLL--------------

Query:  ----------STLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNEL
                  ST+ +L  WE+KLY EVK  E ++M  EKK  Q++ L  K A+  K +  +  +  L  +L+VS Q +   S  I KLR+ E   ++ EL
Subjt:  ----------STLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNEL

Query:  IHGLQSMWKAMSEAHKQQTQALTDAKPFESV-LNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFS
        + GL  MW++M E+H+ QT  +   K   ++       + H ++ + L+LE++ W  +F NL+  Q+D I++L GWL   L    +     +        
Subjt:  IHGLQSMWKAMSEAHKQQTQALTDAKPFESV-LNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFS

Query:  PARIGAPPVFVIAYLWSDTAEKFSDKDVSE-------AIQGVVS------KLDEALEQQSWDLQRLALGNRDLEKKI------KAKKMNQEIEEKTTAAV
                ++     W    ++  DK  SE       A+ G+V+      K  +  E    D ++ +   R LE K       +++K N  IE++    +
Subjt:  PARIGAPPVFVIAYLWSDTAEKFSDKDVSE-------AIQGVVS------KLDEALEQQSWDLQRLALGNRDLEKKI------KAKKMNQEIEEKTTAAV

Query:  AAG
          G
Subjt:  AAG

Q9AQW1 Protein ROLLING AND ERECT LEAF 21.0e-2028.87Show/hide
Query:  STLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKA
        STL++L  WE+KLY EVK  E +++  EKK   L++L  +  D  K+D  ++SI  L   + V+ Q     S  I ++RD E   ++ EL   L SMW++
Subjt:  STLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKA

Query:  MSEAHK------QQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPE--PEETPNGGCPPPFSPAR
        M+  H+      QQ + L D    ES       D H  A  DL+  +  W SNF  LI  Q+D I+AL GWL   L       P+E          +   
Subjt:  MSEAHK------QQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPE--PEETPNGGCPPPFSPAR

Query:  IGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDL---QRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAG
        + +  +      W    ++  D   SEAI+  V+ +     +Q+ ++   +R    +++LEKK  + +  ++   ++ + V  G
Subjt:  IGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDL---QRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAG

Arabidopsis top hitse value%identityAlignment
AT1G20530.1 Protein of unknown function (DUF630 and DUF632)1.8e-7034.87Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQ------DGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSD
        MGC+ SKLD LPAV+LCRDRC  L+  L  +++L D+HSAY  SLN V  AL R F        D ES      +   P   S+A S   SDSD     D
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQ------DGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSD

Query:  SESDDEED---GCFTQEKPRSPQIGNLMF--SNYDSARGQPPPPPPTGS--SWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPP
        S+ ++ E       +  KP S  + +  F    Y+S    PPPPPP  S  +WDF NFFE+YE     + N   D++  TT      K K+KK  V    
Subjt:  SESDDEED---GCFTQEKPRSPQIGNLMF--SNYDSARGQPPPPPPTGS--SWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPP

Query:  EKLNQPEKK-------------IDESK-----MGVVDLMTEIKGSFEKASESSNSISKLLS-----YGQRMSFCK-GIAFLL----LLYSR---------
        EK+   E+K               ESK       + ++  +++  F+KASES N +SK+       Y Q+ S  +  +  LL    +LY++         
Subjt:  EKLNQPEKK-------------IDESK-----MGVVDLMTEIKGSFEKASESSNSISKLLS-----YGQRMSFCK-GIAFLL----LLYSR---------

Query:  ----TSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNE
            ++   L STLKKL +WE+KLY EVK  E+LR    K  + L+ L  K+AD  KI+++RSSI+ LS ++ VSI  ++ I +TI+KLRDEE   +M E
Subjt:  ----TSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNE

Query:  LIHGLQSMWKAMSEAHKQQTQALTDAKPFESV-LNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPF
        LIH L  MW +M E H +Q++ + +AK  + + +   LD + LE AM+LKLEL NW  +  N I  Q   +KALN WL+RCL  EP+ E TP+       
Subjt:  LIHGLQSMWKAMSEAHKQQTQALTDAKPFESV-LNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPF

Query:  SPARIGAPPVFVIAYLWSDTAE-KFSDKDVSEAIQGVVSKLDEALEQQSWDL--QRLALGN-RDLEKKI-KAKKMNQEIEEKTTAAVAAGTVHKVGKFNG
               PP+F     WS   E    +K+ +EA+  ++  ++  +E+Q  +L  QR   G+ +D+E+K+   +K  Q+++ K        TV  V     
Subjt:  SPARIGAPPVFVIAYLWSDTAE-KFSDKDVSEAIQGVVSKLDEALEQQSWDL--QRLALGN-RDLEKKI-KAKKMNQEIEEKTTAAVAAGTVHKVGKFNG

Query:  CEIQQGVRQVLVGLTRFCGDSITAYEEL
          ++  + Q+   + +   +S   YEEL
Subjt:  CEIQQGVRQVLVGLTRFCGDSITAYEEL

AT1G21740.1 Protein of unknown function (DUF630 and DUF632)1.5e-4841.42Show/hide
Query:  LLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMW
        L +TL++L  WE+KLY EVK  E+LR++ E+KCR LK L    A+  KID+ R++IR L  KL+V I+ VD IS  I KLRDEE   ++ +LIHGL  MW
Subjt:  LLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMW

Query:  KAMSEAHKQQTQALTDAKPFESVLNGALD-DAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPP
        ++M + H++Q QA+ ++K      N  L  D+ L+A +DL++EL  W  +F + + TQK  +++LNGWL RCL +EPE  E        PFSP+R+GAP 
Subjt:  KAMSEAHKQQTQALTDAKPFESVLNGALD-DAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPP

Query:  VFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLALGNRDLEKKIKAKKMNQEIEEK
        VFVI   W +   + S ++VS A+QG  S L E  E+Q          + +  +++KA+ ++ + E++
Subjt:  VFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLALGNRDLEKKIKAKKMNQEIEEK

AT1G21740.1 Protein of unknown function (DUF630 and DUF632)6.3e-0232.14Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSER---------SDSDSDS
        MGC  SK+D+ P V LCR+R + +  A     +L  +H +Y  SL  V  +++R  D+  E    G      P  P     S+          S S S S
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSER---------SDSDSDS

Query:  DSDSESDDEEDG
         S  E DDE +G
Subjt:  DSDSESDDEEDG

AT2G17110.1 Protein of unknown function (DUF630 and DUF632)1.2e-6629.82Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE
        MGC++SKLD+LPAVALCRDRC FL+ A+   ++L ++H +Y  SL  ++ +L +  +           D   P        S     D DSDSDS+ DD+
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDE

Query:  EDGCFT----------------------------------QEKPRSPQ--------------IGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERY
         D   +                                  +++P SPQ                  + SNY S    PPP PP    WDF + F+TY   
Subjt:  EDGCFT----------------------------------QEKPRSPQ--------------IGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERY

Query:  EQPNFNW----------DPDQKPGTTRVVTNKKK------------------------------KQKKPPVVAPPEKLNQP----EKKIDES--------
          P+ +           D ++     + V  K+K                               Q +P V    E++       EKKI E         
Subjt:  EQPNFNW----------DPDQKPGTTRVVTNKKK------------------------------KQKKPPVVAPPEKLNQP----EKKIDES--------

Query:  ------------KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLL------------LYSRTS------------------------
                    + GV ++  EI+  F +A+ES N I+ +L  G+     K ++   L              S TS                        
Subjt:  ------------KMGVVDLMTEIKGSFEKASESSNSISKLLSYGQRMSFCKGIAFLLL------------LYSRTS------------------------

Query:  --SRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHG
          SR L STL KL +WE+KLY EVK  E++R+  EKK R+LK + E+ A+++K+DS R  +R+LS K+ ++IQVVD+IS+TI+K+RDEE   ++NELI G
Subjt:  --SRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHG

Query:  LQSMWKAMSEAHKQQTQALTDAK---PFESVLNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSP
        L  MWK+M E HK Q +A+ +A+   P  +  N      HLE    L  EL NW   F + ++ QK  ++ LN WL++CL +  EPEETP+G    PFSP
Subjt:  LQSMWKAMSEAHKQQTQALTDAK---PFESVLNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSP

Query:  ARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLALGNRDLEKKIK-AKKMNQEIEEKTTAAVAAG-----TVHKVGKFNGC
         RIGAP +FVI   W    ++ S+K+V EAI+   + +    EQ     +   +G+ D     +  +++ +EI+E     V  G      V++    N  
Subjt:  ARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLALGNRDLEKKIK-AKKMNQEIEEKTTAAVAAG-----TVHKVGKFNGC

Query:  EIQQGVRQVLVGLTRFCGDSITAYEELCSSA
         +Q  ++++   + RF  +S+ AY +L   A
Subjt:  EIQQGVRQVLVGLTRFCGDSITAYEELCSSA

AT4G35240.1 Protein of unknown function (DUF630 and DUF632)8.1e-5835.93Show/hide
Query:  VVDLMTEIKGSFEKASESSNSISKLLS-----YGQRMSFCKGIAFLLLLYSRTS--------------------------SRGLLSTLKKLCVWEQKLYL
        V ++  EI+  F KA+ES + I+KLL      YG++ +  K +  +      TS                          SR L STL KL +WE+KLY 
Subjt:  VVDLMTEIKGSFEKASESSNSISKLLS-----YGQRMSFCKGIAFLLLLYSRTS--------------------------SRGLLSTLKKLCVWEQKLYL

Query:  EVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDA
        EVK  E+LR+  EKK R+LK L ++ A+  K+D  R  +R++S K+ ++IQVVD+IS+TI+K+RDE+   ++N LI GL  MWK M E H+ Q QA+ +A
Subjt:  EVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDA

Query:  KPFESV-LNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSD
        +    +  +  L D HLEA   L  EL NW   F + ++ QK  +K LN WL++CLL+  EPEETP+G    PFSP RIGAPP+FVI   WS   ++ S+
Subjt:  KPFESV-LNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSD

Query:  KDVSEAIQGVVSKLDEALEQQSWDLQRLALGNRDLEKKIK-----AKKMNQEIE--EKTTAAVA---------AGTVHKVGKFNGCEIQQGVRQVLVGLT
        K+V EA++   + + +  EQ    L  +  G+ D EKK++      +++ +EI+  EK    VA         +G V      +   +Q  ++++   + 
Subjt:  KDVSEAIQGVVSKLDEALEQQSWDLQRLALGNRDLEKKIK-----AKKMNQEIE--EKTTAAVA---------AGTVHKVGKFNGCEIQQGVRQVLVGLT

Query:  RFCGDSITAYEELCSSAIAQSQP
        RF  +S+ AYE+L      ++ P
Subjt:  RFCGDSITAYEELCSSAIAQSQP

AT4G35240.1 Protein of unknown function (DUF630 and DUF632)4.8e-1040.86Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGG---GD---LKCPPPPSAATSSERSDS
        MGC SSKLD+LPAVALCR+RC FL+ A+   ++L +SH AY  SL ++  +L    +        GG   GD   L  PP        E ++S
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGG---GD---LKCPPPPSAATSSERSDS

AT4G35240.2 Protein of unknown function (DUF630 and DUF632)8.1e-5835.93Show/hide
Query:  VVDLMTEIKGSFEKASESSNSISKLLS-----YGQRMSFCKGIAFLLLLYSRTS--------------------------SRGLLSTLKKLCVWEQKLYL
        V ++  EI+  F KA+ES + I+KLL      YG++ +  K +  +      TS                          SR L STL KL +WE+KLY 
Subjt:  VVDLMTEIKGSFEKASESSNSISKLLS-----YGQRMSFCKGIAFLLLLYSRTS--------------------------SRGLLSTLKKLCVWEQKLYL

Query:  EVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDA
        EVK  E+LR+  EKK R+LK L ++ A+  K+D  R  +R++S K+ ++IQVVD+IS+TI+K+RDE+   ++N LI GL  MWK M E H+ Q QA+ +A
Subjt:  EVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISITISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDA

Query:  KPFESV-LNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSD
        +    +  +  L D HLEA   L  EL NW   F + ++ QK  +K LN WL++CLL+  EPEETP+G    PFSP RIGAPP+FVI   WS   ++ S+
Subjt:  KPFESV-LNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGGCPPPFSPARIGAPPVFVIAYLWSDTAEKFSD

Query:  KDVSEAIQGVVSKLDEALEQQSWDLQRLALGNRDLEKKIK-----AKKMNQEIE--EKTTAAVA---------AGTVHKVGKFNGCEIQQGVRQVLVGLT
        K+V EA++   + + +  EQ    L  +  G+ D EKK++      +++ +EI+  EK    VA         +G V      +   +Q  ++++   + 
Subjt:  KDVSEAIQGVVSKLDEALEQQSWDLQRLALGNRDLEKKIK-----AKKMNQEIE--EKTTAAVA---------AGTVHKVGKFNGCEIQQGVRQVLVGLT

Query:  RFCGDSITAYEELCSSAIAQSQP
        RF  +S+ AYE+L      ++ P
Subjt:  RFCGDSITAYEELCSSAIAQSQP

AT4G35240.2 Protein of unknown function (DUF630 and DUF632)4.8e-1040.86Show/hide
Query:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGG---GD---LKCPPPPSAATSSERSDS
        MGC SSKLD+LPAVALCR+RC FL+ A+   ++L +SH AY  SL ++  +L    +        GG   GD   L  PP        E ++S
Subjt:  MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGG---GD---LKCPPPPSAATSSERSDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTGCGCTTCCTCCAAGCTCGACAATCTTCCGGCGGTGGCTCTTTGCCGTGACCGCTGCAAATTCCTTGACCAAGCTCTCGTTTTTACCCATTCCCTTATTGATTC
CCACTCTGCTTATGCTGATTCACTCAACAAAGTCGCCTCAGCTCTCCGCCGTTTGTTTGACCAAGATGGTGAAAGCTGCGGCGGCGGCGGTGGCGATTTGAAATGTCCTC
CTCCGCCGTCTGCCGCTACTTCATCGGAGCGATCTGATTCTGATTCTGATTCCGACTCCGATTCGGAATCGGACGATGAAGAAGATGGGTGTTTTACACAAGAGAAACCA
CGATCTCCCCAGATCGGAAATTTAATGTTCTCGAATTACGATTCGGCGAGGGGACAACCACCGCCGCCGCCGCCGACTGGCTCATCGTGGGATTTCTTCAATTTCTTCGA
AACCTACGAGAGATACGAACAACCCAATTTCAATTGGGACCCAGATCAAAAACCAGGAACAACAAGAGTTGTAACGAACAAGAAGAAGAAGCAGAAGAAACCGCCGGTGG
TGGCTCCGCCGGAGAAATTAAACCAACCGGAAAAGAAAATTGATGAATCAAAAATGGGTGTCGTGGATTTAATGACAGAAATCAAGGGTTCGTTTGAAAAAGCGTCTGAA
TCAAGCAATTCCATTTCCAAGCTGCTTTCCTATGGCCAAAGAATGAGTTTCTGCAAAGGTATCGCCTTTCTTCTTCTCCTCTATTCAAGAACAAGCTCTCGAGGGCTGCT
ATCTACCTTGAAGAAGCTCTGTGTTTGGGAGCAGAAACTGTACCTGGAAGTGAAGGGTTTAGAGAGATTGCGTATGATTCTTGAAAAGAAATGCCGGCAGCTGAAGAATT
TGGTGGAGAAGAACGCCGATGATCGGAAAATTGATTCTGTTCGTAGCTCGATCAGAAATTTATCGATCAAATTGAATGTTTCGATTCAGGTGGTTGATCGGATTTCGATC
ACCATTAGTAAGCTGAGGGATGAAGAATTTTTGGCAGAAATGAATGAATTGATTCATGGATTACAGAGCATGTGGAAAGCCATGTCGGAAGCTCACAAGCAGCAAACCCA
GGCATTAACGGACGCCAAACCCTTCGAATCAGTCCTCAACGGCGCCTTGGACGACGCCCATCTGGAAGCCGCCATGGATTTAAAGCTCGAACTCGAGAATTGGAGATCAA
ATTTCGTGAACTTAATCGCCACGCAAAAGGATTGCATCAAAGCGCTCAACGGTTGGCTACTCCGGTGTCTATTACACGAACCCGAGCCGGAAGAAACCCCGAATGGCGGC
TGTCCGCCGCCATTCTCTCCGGCGAGAATCGGTGCTCCTCCTGTATTCGTAATAGCGTACCTCTGGTCCGACACGGCCGAGAAATTCTCCGACAAGGATGTCTCTGAGGC
TATACAGGGGGTGGTGTCGAAGTTGGACGAGGCGCTGGAGCAGCAGAGTTGGGATTTGCAGAGATTGGCGTTGGGGAACAGAGATTTGGAGAAGAAGATTAAGGCGAAGA
AGATGAATCAAGAAATTGAGGAGAAGACAACGGCGGCGGTGGCGGCGGGGACTGTGCATAAAGTGGGTAAGTTTAATGGGTGTGAGATTCAACAGGGGGTGAGGCAGGTT
CTTGTGGGGTTGACGAGGTTTTGTGGGGACTCCATTACAGCATACGAGGAGCTCTGTTCTTCAGCCATAGCTCAGAGCCAGCCAGGTCCGCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCTGCGCTTCCTCCAAGCTCGACAATCTTCCGGCGGTGGCTCTTTGCCGTGACCGCTGCAAATTCCTTGACCAAGCTCTCGTTTTTACCCATTCCCTTATTGATTC
CCACTCTGCTTATGCTGATTCACTCAACAAAGTCGCCTCAGCTCTCCGCCGTTTGTTTGACCAAGATGGTGAAAGCTGCGGCGGCGGCGGTGGCGATTTGAAATGTCCTC
CTCCGCCGTCTGCCGCTACTTCATCGGAGCGATCTGATTCTGATTCTGATTCCGACTCCGATTCGGAATCGGACGATGAAGAAGATGGGTGTTTTACACAAGAGAAACCA
CGATCTCCCCAGATCGGAAATTTAATGTTCTCGAATTACGATTCGGCGAGGGGACAACCACCGCCGCCGCCGCCGACTGGCTCATCGTGGGATTTCTTCAATTTCTTCGA
AACCTACGAGAGATACGAACAACCCAATTTCAATTGGGACCCAGATCAAAAACCAGGAACAACAAGAGTTGTAACGAACAAGAAGAAGAAGCAGAAGAAACCGCCGGTGG
TGGCTCCGCCGGAGAAATTAAACCAACCGGAAAAGAAAATTGATGAATCAAAAATGGGTGTCGTGGATTTAATGACAGAAATCAAGGGTTCGTTTGAAAAAGCGTCTGAA
TCAAGCAATTCCATTTCCAAGCTGCTTTCCTATGGCCAAAGAATGAGTTTCTGCAAAGGTATCGCCTTTCTTCTTCTCCTCTATTCAAGAACAAGCTCTCGAGGGCTGCT
ATCTACCTTGAAGAAGCTCTGTGTTTGGGAGCAGAAACTGTACCTGGAAGTGAAGGGTTTAGAGAGATTGCGTATGATTCTTGAAAAGAAATGCCGGCAGCTGAAGAATT
TGGTGGAGAAGAACGCCGATGATCGGAAAATTGATTCTGTTCGTAGCTCGATCAGAAATTTATCGATCAAATTGAATGTTTCGATTCAGGTGGTTGATCGGATTTCGATC
ACCATTAGTAAGCTGAGGGATGAAGAATTTTTGGCAGAAATGAATGAATTGATTCATGGATTACAGAGCATGTGGAAAGCCATGTCGGAAGCTCACAAGCAGCAAACCCA
GGCATTAACGGACGCCAAACCCTTCGAATCAGTCCTCAACGGCGCCTTGGACGACGCCCATCTGGAAGCCGCCATGGATTTAAAGCTCGAACTCGAGAATTGGAGATCAA
ATTTCGTGAACTTAATCGCCACGCAAAAGGATTGCATCAAAGCGCTCAACGGTTGGCTACTCCGGTGTCTATTACACGAACCCGAGCCGGAAGAAACCCCGAATGGCGGC
TGTCCGCCGCCATTCTCTCCGGCGAGAATCGGTGCTCCTCCTGTATTCGTAATAGCGTACCTCTGGTCCGACACGGCCGAGAAATTCTCCGACAAGGATGTCTCTGAGGC
TATACAGGGGGTGGTGTCGAAGTTGGACGAGGCGCTGGAGCAGCAGAGTTGGGATTTGCAGAGATTGGCGTTGGGGAACAGAGATTTGGAGAAGAAGATTAAGGCGAAGA
AGATGAATCAAGAAATTGAGGAGAAGACAACGGCGGCGGTGGCGGCGGGGACTGTGCATAAAGTGGGTAAGTTTAATGGGTGTGAGATTCAACAGGGGGTGAGGCAGGTT
CTTGTGGGGTTGACGAGGTTTTGTGGGGACTCCATTACAGCATACGAGGAGCTCTGTTCTTCAGCCATAGCTCAGAGCCAGCCAGGTCCGCCATAGCTTCAAGCTTCTTC
AAGCATTCAATTCATTCTTCACCG
Protein sequenceShow/hide protein sequence
MGCASSKLDNLPAVALCRDRCKFLDQALVFTHSLIDSHSAYADSLNKVASALRRLFDQDGESCGGGGGDLKCPPPPSAATSSERSDSDSDSDSDSESDDEEDGCFTQEKP
RSPQIGNLMFSNYDSARGQPPPPPPTGSSWDFFNFFETYERYEQPNFNWDPDQKPGTTRVVTNKKKKQKKPPVVAPPEKLNQPEKKIDESKMGVVDLMTEIKGSFEKASE
SSNSISKLLSYGQRMSFCKGIAFLLLLYSRTSSRGLLSTLKKLCVWEQKLYLEVKGLERLRMILEKKCRQLKNLVEKNADDRKIDSVRSSIRNLSIKLNVSIQVVDRISI
TISKLRDEEFLAEMNELIHGLQSMWKAMSEAHKQQTQALTDAKPFESVLNGALDDAHLEAAMDLKLELENWRSNFVNLIATQKDCIKALNGWLLRCLLHEPEPEETPNGG
CPPPFSPARIGAPPVFVIAYLWSDTAEKFSDKDVSEAIQGVVSKLDEALEQQSWDLQRLALGNRDLEKKIKAKKMNQEIEEKTTAAVAAGTVHKVGKFNGCEIQQGVRQV
LVGLTRFCGDSITAYEELCSSAIAQSQPGPP