; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr003993 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr003993
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSerine-aspartate repeat-containing protein F isoform X1
Genome locationtig00002539:24103..27308
RNA-Seq ExpressionSgr003993
SyntenySgr003993
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137785.1 dentin matrix acidic phosphoprotein 1 isoform X1 [Momordica charantia]3.1e-26770.4Show/hide
Query:  GFVELNMGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKE
        GFVELNMGNEMGNNNTSE REEE+ K EAPEKSLLE GGNEV A  VADF +KEARSGS+DADRVNG HHVTE+EEGKN    TA+FQV+S KSTDRT+ 
Subjt:  GFVELNMGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKE

Query:  DNEVQTKFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLS
        DNEVQT+FK+DET  F+ KENG DGN+  HEKQTSN+KEE   +KGSNLNTT L LDEPELEKISDFKSIQNEL DTK ESLV DS+K  + CK+ LGLS
Subjt:  DNEVQTKFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLS

Query:  LNYTYHSADCTMADSENTTNMNQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSP
        LN+T HSADC M+DSE +TNMNQ  +T RD DKE D +RG+   S  N       +P            K E ILD+ES+Q E P TK E MVGSSD SP
Subjt:  LNYTYHSADCTMADSENTTNMNQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSP

Query:  QESKYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSM
        QESKY L S  SL+H+E S   I T KSTNVD IGAET  +VEK+KD LIEPR CHE I PAAK V+ K+E+T TDL SH SS SQ +ESLV+ESPK +M
Subjt:  QESKYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSM

Query:  QAPEVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAIS
        Q PEV+DRCTVL+EE QM+EEPSGTGT+VQDN PTQFKISNG+QEE +TIGSHSEN+A+ETE+SPDSV   KNQNEAPGED E SDGEYLEI EQG+AI 
Subjt:  QAPEVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAIS

Query:  TLS-VGNCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATF
        TLS VG+C  KN+E GE  +IS + G EAERRESE++LS+PVLGFQPQIHLQETSI FQ+ + TDESI A RQETET +EKS++N   S      ASAT 
Subjt:  TLS-VGNCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATF

Query:  TETKPSTNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVV
        TETKPSTN ID+QS  TLPFSTF E++QE+PGRTSNESNSD+SIG+IEMRKSPSFNIDIQ EGRAG+T+K PLLYQIKTIE+LPNL         EKRVV
Subjt:  TETKPSTNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVV

Query:  TLGRSDSDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI
        TLGRSDS+KSRPSFPGFAKEKEE  ME+KAINQ+N   AKKAAED  P   T PIRKGKRRTKSLIFGTCICCATAI
Subjt:  TLGRSDSDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI

XP_022137787.1 dentin matrix acidic phosphoprotein 1 isoform X2 [Momordica charantia]1.2e-26370.17Show/hide
Query:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQT
        MGNEMGNNNTSE REEE+ K EAPEKSLLE GGNEV A  VADF +KEARSGS+DADRVNG HHVTE+EEGKN    TA+FQV+S KSTDRT+ DNEVQT
Subjt:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQT

Query:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYH
        +FK+DET  F+ KENG DGN+  HEKQTSN+KEE   +KGSNLNTT L LDEPELEKISDFKSIQNEL DTK ESLV DS+K  + CK+ LGLSLN+T H
Subjt:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYH

Query:  SADCTMADSENTTNMNQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSPQESKYA
        SADC M+DSE +TNMNQ  +T RD DKE D +RG+   S  N       +P            K E ILD+ES+Q E P TK E MVGSSD SPQESKY 
Subjt:  SADCTMADSENTTNMNQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSPQESKYA

Query:  LGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVE
        L S  SL+H+E S   I T KSTNVD IGAET  +VEK+KD LIEPR CHE I PAAK V+ K+E+T TDL SH SS SQ +ESLV+ESPK +MQ PEV+
Subjt:  LGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVE

Query:  DRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAISTLS-VG
        DRCTVL+EE QM+EEPSGTGT+VQDN PTQFKISNG+QEE +TIGSHSEN+A+ETE+SPDSV   KNQNEAPGED E SDGEYLEI EQG+AI TLS VG
Subjt:  DRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAISTLS-VG

Query:  NCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPS
        +C  KN+E GE  +IS + G EAERRESE++LS+PVLGFQPQIHLQETSI FQ+ + TDESI A RQETET +EKS++N   S      ASAT TETKPS
Subjt:  NCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPS

Query:  TNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLGRSD
        TN ID+QS  TLPFSTF E++QE+PGRTSNESNSD+SIG+IEMRKSPSFNIDIQ EGRAG+T+K PLLYQIKTIE+LPNL         EKRVVTLGRSD
Subjt:  TNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLGRSD

Query:  SDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI
        S+KSRPSFPGFAKEKEE  ME+KAINQ+N   AKKAAED  P   T PIRKGKRRTKSLIFGTCICCATAI
Subjt:  SDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI

XP_022137789.1 dentin matrix acidic phosphoprotein 1 isoform X3 [Momordica charantia]6.5e-25769.71Show/hide
Query:  EEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQTKFKEDETTEFNLKEN
        EE+ K EAPEKSLLE GGNEV A  VADF +KEARSGS+DADRVNG HHVTE+EEGKN    TA+FQV+S KSTDRT+ DNEVQT+FK+DET  F+ KEN
Subjt:  EEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQTKFKEDETTEFNLKEN

Query:  GLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHSADCTMADSENTTNM
        G DGN+  HEKQTSN+KEE   +KGSNLNTT L LDEPELEKISDFKSIQNEL DTK ESLV DS+K  + CK+ LGLSLN+T HSADC M+DSE +TNM
Subjt:  GLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHSADCTMADSENTTNM

Query:  NQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSPQESKYALGSSFSLKHSEVSGR
        NQ  +T RD DKE D +RG+   S  N       +P            K E ILD+ES+Q E P TK E MVGSSD SPQESKY L S  SL+H+E S  
Subjt:  NQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSPQESKYALGSSFSLKHSEVSGR

Query:  AIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVEDRCTVLEEETQMREE
         I T KSTNVD IGAET  +VEK+KD LIEPR CHE I PAAK V+ K+E+T TDL SH SS SQ +ESLV+ESPK +MQ PEV+DRCTVL+EE QM+EE
Subjt:  AIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVEDRCTVLEEETQMREE

Query:  PSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAISTLS-VGNCKHKNDEKGETLKI
        PSGTGT+VQDN PTQFKISNG+QEE +TIGSHSEN+A+ETE+SPDSV   KNQNEAPGED E SDGEYLEI EQG+AI TLS VG+C  KN+E GE  +I
Subjt:  PSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAISTLS-VGNCKHKNDEKGETLKI

Query:  SIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSARTLPFS
        S + G EAERRESE++LS+PVLGFQPQIHLQETSI FQ+ + TDESI A RQETET +EKS++N   S      ASAT TETKPSTN ID+QS  TLPFS
Subjt:  SIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSARTLPFS

Query:  TFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLGRSDSDKSRPSFPGFAKEK
        TF E++QE+PGRTSNESNSD+SIG+IEMRKSPSFNIDIQ EGRAG+T+K PLLYQIKTIE+LPNL         EKRVVTLGRSDS+KSRPSFPGFAKEK
Subjt:  TFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLGRSDSDKSRPSFPGFAKEK

Query:  EEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI
        EE  ME+KAINQ+N   AKKAAED  P   T PIRKGKRRTKSLIFGTCICCATAI
Subjt:  EEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI

XP_022975417.1 uncharacterized protein LOC111474732 [Cucurbita maxima]1.2e-21549.8Show/hide
Query:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQT
        MGNEMGNNNTSE REEE+A+TE PE+SLLEGG NEV A EVADF +KEAR GSD AD++N +HHV EKEE KN  C TA+F V+S K  DRTKEDNEVQ 
Subjt:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQT

Query:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNR-KEEGEKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHS
          KED+T EF+ +EN  DGN+ EHEKQ SN+ +EEGE  SNLNTT LALDEP+ EK SDFKS Q+ELL+ KAES +EDSNK P+ CK+ LGLS N TYHS
Subjt:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNR-KEEGEKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHS

Query:  ADCTMADSENTTNM-------------------------------------------NQISETVRDTDKENDGDRGR-----------------------
        ADC MADSE TTNM                                           +Q+ +T RD DKE+D +RG+                       
Subjt:  ADCTMADSENTTNM-------------------------------------------NQISETVRDTDKENDGDRGR-----------------------

Query:  -------DLESFQ----NEFPPTKAEP---------------------------------MVGSSDGSPQEN----------------------------
               D+E  +    N       +P                                 M  +S+    EN                            
Subjt:  -------DLESFQ----NEFPPTKAEP---------------------------------MVGSSDGSPQEN----------------------------

Query:  ---------------------------------------------------------------------------------------------------K
                                                                                                           K
Subjt:  ---------------------------------------------------------------------------------------------------K

Query:  YEKILDVESSQNELPPTKVEPMVGSS-DGSPQESKYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKK
         E   D++S  +ELP TK E + GSS DGS QESKY  GS  SL+H+E +G  + T+K T+VD  GAE +TD+EKKK ++IEPR+CHEY MPAAK V+ K
Subjt:  YEKILDVESSQNELPPTKVEPMVGSS-DGSPQESKYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKK

Query:  EEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA
        +E+   DLI H +S SQPEES+V+E PKS MQ PE E+RCTVL+E  Q+REE  GT TIVQDN PTQ KISN VQEE NTI SHSE NA     + +SV 
Subjt:  EEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA

Query:  KNQNEAPGEDCESSDGEYLEIAEQGMAISTLSVGNCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALR
        +NQNE PGEDCE SDGEYLEI+EQ M +   S+G+CK KN   GET +IS + GHE ERRE +ENL +P+LG QPQ H +E SI FQT +STDESIS LR
Subjt:  KNQNEAPGEDCESSDGEYLEIAEQGMAISTLSVGNCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALR

Query:  QETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTP
        Q TE A EKS+KNSS SPH I+TASATFTETKPSTN ID+Q+  TLP STF EENQESPGRTSNESNSDNS+G+IEMRKSPSFNIDIQ EG+  +T+K P
Subjt:  QETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTP

Query:  LLYQIKTIENLPNLH--------EKRVVTLGRSDSDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCIC
        LLYQIKTIE+L NL         EKRVV LGRS+S+KSRPSFPGFAKEKE++ ME  AINQ+   A K A +DL P    SPIRKGKRR+KSLIFGTCIC
Subjt:  LLYQIKTIENLPNLH--------EKRVVTLGRSDSDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCIC

Query:  CATAIN
        CATAIN
Subjt:  CATAIN

XP_038881451.1 uncharacterized protein LOC120072973 [Benincasa hispida]3.1e-21961.03Show/hide
Query:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVISKS-TDRTKEDNEVQT
        MGNEMGNNNTSE REEEKAK+E PEKSL EGG NEV A EVA   +KEAR  S +ADR NGD HVTEKEE KN  C   +FQ++ K   DRTKEDNEV  
Subjt:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVISKS-TDRTKEDNEVQT

Query:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEE-GEKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHS
        K KE +  EF+ +EN  +GN+ EHEKQ SN+KEE GE+GSNLNT  L+L EP LEKI+DFK  Q+ELL   AESL+EDSNK P+ CK+ L LSLN   HS
Subjt:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEE-GEKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHS

Query:  ADCTMADSENTTNMNQISETVRDTDKENDGDRGR----DLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSPQES
        ADC MADSE  TNM+Q+ +T RDTD+ENDG++ +    D+ ++ ++ P         +S+ SP         D++S Q+E   T  + + GSSD      
Subjt:  ADCTMADSENTTNMNQISETVRDTDKENDGDRGR----DLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSPQES

Query:  KYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAP
                                         + DTD+EKKK +L+EPR CH Y +PAAKNV+ K+E T  DLI H SS S  EESLV+ESPKS MQ P
Subjt:  KYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAP

Query:  EVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAISTLS
        EVE+RCTVL+EE Q+REE  GT T+VQDN PTQ KISNGVQ E NTIGSHSEN+AE+TE+ P+ VA  KN+ EAPGEDCE SDGEYLEI+EQG  IS LS
Subjt:  EVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAISTLS

Query:  VGNCKHKNDEKGETLKISIDNGHEAERRESE-ENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTET
        +G+ KHK+++KGET ++S +NGHE ERRE + E+L +PVLGFQ QI  +ETSI FQT +ST +SISA RQET T  EKS+ N S SP  I+TASAT TET
Subjt:  VGNCKHKNDEKGETLKISIDNGHEAERRESE-ENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTET

Query:  KPSTNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLG
        KPSTN +D+QS   LPFSTFG E+QESPGRTSNES S+NSIG+IEMRKSPSFNIDIQ EGR G+T+KTPLLYQIKTIE+LPNL         EKRVV LG
Subjt:  KPSTNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLG

Query:  RSDSDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAIN
        RSDS+ SRPSFPGFAKE+EE  ME KAI+QNN A   KAA+DL P    SPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  RSDSDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAIN

TrEMBL top hitse value%identityAlignment
A0A6J1C7N3 dentin matrix acidic phosphoprotein 1 isoform X11.5e-26770.4Show/hide
Query:  GFVELNMGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKE
        GFVELNMGNEMGNNNTSE REEE+ K EAPEKSLLE GGNEV A  VADF +KEARSGS+DADRVNG HHVTE+EEGKN    TA+FQV+S KSTDRT+ 
Subjt:  GFVELNMGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKE

Query:  DNEVQTKFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLS
        DNEVQT+FK+DET  F+ KENG DGN+  HEKQTSN+KEE   +KGSNLNTT L LDEPELEKISDFKSIQNEL DTK ESLV DS+K  + CK+ LGLS
Subjt:  DNEVQTKFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLS

Query:  LNYTYHSADCTMADSENTTNMNQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSP
        LN+T HSADC M+DSE +TNMNQ  +T RD DKE D +RG+   S  N       +P            K E ILD+ES+Q E P TK E MVGSSD SP
Subjt:  LNYTYHSADCTMADSENTTNMNQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSP

Query:  QESKYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSM
        QESKY L S  SL+H+E S   I T KSTNVD IGAET  +VEK+KD LIEPR CHE I PAAK V+ K+E+T TDL SH SS SQ +ESLV+ESPK +M
Subjt:  QESKYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSM

Query:  QAPEVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAIS
        Q PEV+DRCTVL+EE QM+EEPSGTGT+VQDN PTQFKISNG+QEE +TIGSHSEN+A+ETE+SPDSV   KNQNEAPGED E SDGEYLEI EQG+AI 
Subjt:  QAPEVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAIS

Query:  TLS-VGNCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATF
        TLS VG+C  KN+E GE  +IS + G EAERRESE++LS+PVLGFQPQIHLQETSI FQ+ + TDESI A RQETET +EKS++N   S      ASAT 
Subjt:  TLS-VGNCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATF

Query:  TETKPSTNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVV
        TETKPSTN ID+QS  TLPFSTF E++QE+PGRTSNESNSD+SIG+IEMRKSPSFNIDIQ EGRAG+T+K PLLYQIKTIE+LPNL         EKRVV
Subjt:  TETKPSTNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVV

Query:  TLGRSDSDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI
        TLGRSDS+KSRPSFPGFAKEKEE  ME+KAINQ+N   AKKAAED  P   T PIRKGKRRTKSLIFGTCICCATAI
Subjt:  TLGRSDSDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI

A0A6J1C898 dentin matrix acidic phosphoprotein 1 isoform X33.1e-25769.71Show/hide
Query:  EEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQTKFKEDETTEFNLKEN
        EE+ K EAPEKSLLE GGNEV A  VADF +KEARSGS+DADRVNG HHVTE+EEGKN    TA+FQV+S KSTDRT+ DNEVQT+FK+DET  F+ KEN
Subjt:  EEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQTKFKEDETTEFNLKEN

Query:  GLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHSADCTMADSENTTNM
        G DGN+  HEKQTSN+KEE   +KGSNLNTT L LDEPELEKISDFKSIQNEL DTK ESLV DS+K  + CK+ LGLSLN+T HSADC M+DSE +TNM
Subjt:  GLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHSADCTMADSENTTNM

Query:  NQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSPQESKYALGSSFSLKHSEVSGR
        NQ  +T RD DKE D +RG+   S  N       +P            K E ILD+ES+Q E P TK E MVGSSD SPQESKY L S  SL+H+E S  
Subjt:  NQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSPQESKYALGSSFSLKHSEVSGR

Query:  AIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVEDRCTVLEEETQMREE
         I T KSTNVD IGAET  +VEK+KD LIEPR CHE I PAAK V+ K+E+T TDL SH SS SQ +ESLV+ESPK +MQ PEV+DRCTVL+EE QM+EE
Subjt:  AIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVEDRCTVLEEETQMREE

Query:  PSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAISTLS-VGNCKHKNDEKGETLKI
        PSGTGT+VQDN PTQFKISNG+QEE +TIGSHSEN+A+ETE+SPDSV   KNQNEAPGED E SDGEYLEI EQG+AI TLS VG+C  KN+E GE  +I
Subjt:  PSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAISTLS-VGNCKHKNDEKGETLKI

Query:  SIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSARTLPFS
        S + G EAERRESE++LS+PVLGFQPQIHLQETSI FQ+ + TDESI A RQETET +EKS++N   S      ASAT TETKPSTN ID+QS  TLPFS
Subjt:  SIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSARTLPFS

Query:  TFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLGRSDSDKSRPSFPGFAKEK
        TF E++QE+PGRTSNESNSD+SIG+IEMRKSPSFNIDIQ EGRAG+T+K PLLYQIKTIE+LPNL         EKRVVTLGRSDS+KSRPSFPGFAKEK
Subjt:  TFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLGRSDSDKSRPSFPGFAKEK

Query:  EEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI
        EE  ME+KAINQ+N   AKKAAED  P   T PIRKGKRRTKSLIFGTCICCATAI
Subjt:  EEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI

A0A6J1CBC9 dentin matrix acidic phosphoprotein 1 isoform X25.9e-26470.17Show/hide
Query:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQT
        MGNEMGNNNTSE REEE+ K EAPEKSLLE GGNEV A  VADF +KEARSGS+DADRVNG HHVTE+EEGKN    TA+FQV+S KSTDRT+ DNEVQT
Subjt:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQT

Query:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYH
        +FK+DET  F+ KENG DGN+  HEKQTSN+KEE   +KGSNLNTT L LDEPELEKISDFKSIQNEL DTK ESLV DS+K  + CK+ LGLSLN+T H
Subjt:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEEG--EKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYH

Query:  SADCTMADSENTTNMNQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSPQESKYA
        SADC M+DSE +TNMNQ  +T RD DKE D +RG+   S  N       +P            K E ILD+ES+Q E P TK E MVGSSD SPQESKY 
Subjt:  SADCTMADSENTTNMNQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNELPPTKVEPMVGSSDGSPQESKYA

Query:  LGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVE
        L S  SL+H+E S   I T KSTNVD IGAET  +VEK+KD LIEPR CHE I PAAK V+ K+E+T TDL SH SS SQ +ESLV+ESPK +MQ PEV+
Subjt:  LGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVE

Query:  DRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAISTLS-VG
        DRCTVL+EE QM+EEPSGTGT+VQDN PTQFKISNG+QEE +TIGSHSEN+A+ETE+SPDSV   KNQNEAPGED E SDGEYLEI EQG+AI TLS VG
Subjt:  DRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA--KNQNEAPGEDCESSDGEYLEIAEQGMAISTLS-VG

Query:  NCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPS
        +C  KN+E GE  +IS + G EAERRESE++LS+PVLGFQPQIHLQETSI FQ+ + TDESI A RQETET +EKS++N   S      ASAT TETKPS
Subjt:  NCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPS

Query:  TNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLGRSD
        TN ID+QS  TLPFSTF E++QE+PGRTSNESNSD+SIG+IEMRKSPSFNIDIQ EGRAG+T+K PLLYQIKTIE+LPNL         EKRVVTLGRSD
Subjt:  TNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLGRSD

Query:  SDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI
        S+KSRPSFPGFAKEKEE  ME+KAINQ+N   AKKAAED  P   T PIRKGKRRTKSLIFGTCICCATAI
Subjt:  SDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAI

A0A6J1IBE1 uncharacterized protein LOC111473109 isoform X12.2e-21047.76Show/hide
Query:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQT
        MGNEMGNNNTSE REEE+A+TE PE+SLLEGG NEV A EVADF +KEAR GSD AD++N +HHV EKEE KN  C TA+F V+S K  DRTKEDNEVQ 
Subjt:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQT

Query:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNR-KEEGEKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHS
          KED+T EF+ +EN  DGN+ EHEKQ SN+ +EEGE  SNLNTT LALDEP+ EK SDFKS Q+ELL+ KAES +EDSNK P+ CK+ +GLS N TYHS
Subjt:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNR-KEEGEKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHS

Query:  ADCTMADSENTTNM-------------------------------------------------------------------------------NQISETV
        ADC MADSE TTNM                                                                               +Q+ +T 
Subjt:  ADCTMADSENTTNM-------------------------------------------------------------------------------NQISETV

Query:  RDTDK-----------------------------------------------------------------------------------------------
        RDT+K                                                                                               
Subjt:  RDTDK-----------------------------------------------------------------------------------------------

Query:  -------------------------------ENDGDRGR----DLESFQNEFPPTK--------------------------------------------
                                       ENDG RG+    D     +E P ++                                            
Subjt:  -------------------------------ENDGDRGR----DLESFQNEFPPTK--------------------------------------------

Query:  ------AEPMVGSSDGSPQEN-------------------KYEKILDVESSQNELPPTKVEPMVGSS-DGSPQESKYALGSSFSLKHSEVSGRAIVTDKS
              A+ ++ +   + +EN                   K E   D++S  +ELP TK E + GSS DGS QESKY  GS  SL+H+E +G  + T+K 
Subjt:  ------AEPMVGSSDGSPQEN-------------------KYEKILDVESSQNELPPTKVEPMVGSS-DGSPQESKYALGSSFSLKHSEVSGRAIVTDKS

Query:  TNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVEDRCTVLEEETQMREEPSGTGTI
        T+VD  GAE +TD+EKKK ++IEPR+CHEY MPAAK V+ K+E+   DLI H +S SQPEES+V+E PKS MQ PE E+RCTVL+E  Q+REE  GT TI
Subjt:  TNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVEDRCTVLEEETQMREEPSGTGTI

Query:  VQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVAKNQNEAPGEDCESSDGEYLEIAEQGMAISTLSVGNCKHKNDEKGETLKISIDNGHEAER
        VQDN PTQ KISN VQEE NTI SHSE NA     + +SV +NQNE PGEDCE SDGEYLEI+EQ M +   S+G+CK KN   GET +IS + GHE ER
Subjt:  VQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVAKNQNEAPGEDCESSDGEYLEIAEQGMAISTLSVGNCKHKNDEKGETLKISIDNGHEAER

Query:  RESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSARTLPFSTFGEENQESP
        RE +ENL +P+LG QPQ H +E SI FQT +STDESIS LRQ TE A EKS+KNSS SPH I+TASATFTETKPSTN ID+Q+  TLP STF EENQESP
Subjt:  RESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSARTLPFSTFGEENQESP

Query:  GRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLGRSDSDKSRPSFPGFAKEKEEAHMEMKAI
        GRTSNESNSDNS+G+IEMRKSPSFNIDIQ EG+  +T+K PLLYQIKTIE+L NL         EKRVV LGRS+S+KSRPSFPGFAKEKE++ ME  AI
Subjt:  GRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLH--------EKRVVTLGRSDSDKSRPSFPGFAKEKEEAHMEMKAI

Query:  NQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAIN
        NQ+   A K A +DL P    SPIRKGKRR+KSLIFGTCICCATAIN
Subjt:  NQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAIN

A0A6J1IJ54 uncharacterized protein LOC1114747326.0e-21649.8Show/hide
Query:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQT
        MGNEMGNNNTSE REEE+A+TE PE+SLLEGG NEV A EVADF +KEAR GSD AD++N +HHV EKEE KN  C TA+F V+S K  DRTKEDNEVQ 
Subjt:  MGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSDDADRVNGDHHVTEKEEGKNGGCYTAKFQVIS-KSTDRTKEDNEVQT

Query:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNR-KEEGEKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHS
          KED+T EF+ +EN  DGN+ EHEKQ SN+ +EEGE  SNLNTT LALDEP+ EK SDFKS Q+ELL+ KAES +EDSNK P+ CK+ LGLS N TYHS
Subjt:  KFKEDETTEFNLKENGLDGNKQEHEKQTSNR-KEEGEKGSNLNTTKLALDEPELEKISDFKSIQNELLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHS

Query:  ADCTMADSENTTNM-------------------------------------------NQISETVRDTDKENDGDRGR-----------------------
        ADC MADSE TTNM                                           +Q+ +T RD DKE+D +RG+                       
Subjt:  ADCTMADSENTTNM-------------------------------------------NQISETVRDTDKENDGDRGR-----------------------

Query:  -------DLESFQ----NEFPPTKAEP---------------------------------MVGSSDGSPQEN----------------------------
               D+E  +    N       +P                                 M  +S+    EN                            
Subjt:  -------DLESFQ----NEFPPTKAEP---------------------------------MVGSSDGSPQEN----------------------------

Query:  ---------------------------------------------------------------------------------------------------K
                                                                                                           K
Subjt:  ---------------------------------------------------------------------------------------------------K

Query:  YEKILDVESSQNELPPTKVEPMVGSS-DGSPQESKYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKK
         E   D++S  +ELP TK E + GSS DGS QESKY  GS  SL+H+E +G  + T+K T+VD  GAE +TD+EKKK ++IEPR+CHEY MPAAK V+ K
Subjt:  YEKILDVESSQNELPPTKVEPMVGSS-DGSPQESKYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKK

Query:  EEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA
        +E+   DLI H +S SQPEES+V+E PKS MQ PE E+RCTVL+E  Q+REE  GT TIVQDN PTQ KISN VQEE NTI SHSE NA     + +SV 
Subjt:  EEQTTTDLISHKSSGSQPEESLVVESPKSSMQAPEVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVA

Query:  KNQNEAPGEDCESSDGEYLEIAEQGMAISTLSVGNCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALR
        +NQNE PGEDCE SDGEYLEI+EQ M +   S+G+CK KN   GET +IS + GHE ERRE +ENL +P+LG QPQ H +E SI FQT +STDESIS LR
Subjt:  KNQNEAPGEDCESSDGEYLEIAEQGMAISTLSVGNCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALR

Query:  QETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTP
        Q TE A EKS+KNSS SPH I+TASATFTETKPSTN ID+Q+  TLP STF EENQESPGRTSNESNSDNS+G+IEMRKSPSFNIDIQ EG+  +T+K P
Subjt:  QETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSARTLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTP

Query:  LLYQIKTIENLPNLH--------EKRVVTLGRSDSDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCIC
        LLYQIKTIE+L NL         EKRVV LGRS+S+KSRPSFPGFAKEKE++ ME  AINQ+   A K A +DL P    SPIRKGKRR+KSLIFGTCIC
Subjt:  LLYQIKTIENLPNLH--------EKRVVTLGRSDSDKSRPSFPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCIC

Query:  CATAIN
        CATAIN
Subjt:  CATAIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G14650.1 unknown protein7.7e-0637.4Show/hide
Query:  SPSFNIDIQIEGRAGDT-DKTPLLYQIKT-IENLPNLHEKRVVTLGRSDSDKSRPS--FPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPI
        +PSF+  ++IE R  ++ + TP+L + KT I       E++ V L RS+S KSR S    G  K+  ++  E K    N     KKA    SP S    +
Subjt:  SPSFNIDIQIEGRAGDT-DKTPLLYQIKT-IENLPNLHEKRVVTLGRSDSDKSRPS--FPGFAKEKEEAHMEMKAINQNNSAAAKKAAEDLSPTSQTSPI

Query:  RKGKRRTKSLIFGTCICCATAIN
        RK   R+KS + GTC+CC TA+N
Subjt:  RKGKRRTKSLIFGTCICCATAIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAAAGTACATATGGGGTATTGAATTTGTTTATGGAAAACGCAGCAAGAAGCCGCTTTACCCCAGAAATGGCAGCTCCAGTTAAGCCCATCTGGCTGTCTACAAC
AGATCAAGCTGTCAAATATCATCATCATCATCACAATGCAGGATTTGTTGAGTTGAATATGGGAAATGAGATGGGAAACAATAACACATCTGAGTTAAGAGAGGAAGAGA
AGGCAAAAACTGAAGCTCCAGAAAAATCTTTACTAGAAGGTGGGGGCAATGAAGTCAATGCATATGAAGTTGCAGACTTTTGCAAAAAAGAAGCAAGGTCGGGTTCTGAT
GATGCAGATAGAGTTAATGGAGATCATCATGTCACAGAGAAAGAGGAAGGAAAAAATGGAGGATGTTATACAGCAAAATTTCAAGTGATTTCAAAATCAACTGACAGAAC
TAAAGAGGACAATGAAGTCCAGACCAAATTTAAAGAAGATGAAACAACGGAGTTCAACTTGAAGGAGAATGGATTGGATGGAAATAAGCAAGAGCATGAAAAGCAGACTT
CGAACCGCAAAGAGGAAGGAGAGAAAGGTTCCAATTTGAACACGACAAAACTTGCATTAGATGAGCCAGAACTTGAAAAGATTTCAGATTTCAAGAGTATTCAGAATGAA
TTGCTTGATACTAAAGCCGAATCATTAGTAGAAGATAGCAACAAATTCCCCCAATTATGCAAGGAGGATTTGGGATTGAGTTTGAACTACACTTATCATTCGGCAGATTG
TACCATGGCAGATTCAGAAAACACCACCAATATGAATCAAATTAGTGAAACTGTTCGAGATACGGATAAGGAAAATGATGGAGATAGGGGAAGAGATCTCGAGTCTTTTC
AGAATGAATTTCCTCCAACCAAGGCTGAACCAATGGTAGGATCAAGTGATGGATCCCCACAAGAAAACAAGTATGAAAAGATTTTAGATGTTGAGTCTTCTCAGAATGAA
TTGCCTCCGACCAAGGTTGAACCAATGGTAGGATCAAGTGATGGATCCCCACAAGAAAGCAAGTATGCTTTAGGATCTAGTTTTAGTTTGAAACACTCTGAGGTCAGTGG
TCGTGCTATAGTTACAGACAAATCGACTAATGTGGATCGTATCGGTGCAGAGACAGACACAGATGTGGAGAAGAAAAAAGATGAGTTAATAGAGCCAAGGTCATGCCATG
AATATATAATGCCAGCTGCCAAAAATGTCAATAAAAAAGAAGAACAAACTACGACGGATTTGATCAGTCATAAGTCCTCTGGTTCACAACCAGAAGAGAGTCTGGTGGTA
GAATCACCCAAGTCATCTATGCAAGCTCCTGAGGTTGAAGACAGATGCACGGTTTTGGAAGAAGAAACCCAAATGAGAGAAGAACCTTCGGGAACTGGAACAATCGTTCA
AGACAATTGTCCAACTCAATTTAAGATTTCAAATGGGGTCCAAGAGGAATTGAATACGATTGGGTCACACTCCGAGAACAATGCAGAAGAAACTGAGATTTCTCCTGATT
CTGTTGCCAAGAACCAGAACGAGGCTCCTGGAGAGGATTGCGAGAGTTCAGATGGAGAATATCTGGAAATTGCTGAACAAGGTATGGCTATTTCAACTTTATCTGTTGGA
AATTGCAAGCATAAGAATGATGAGAAGGGAGAGACGCTGAAAATTTCAATAGACAATGGGCATGAAGCGGAAAGAAGAGAATCGGAGGAAAATCTATCTAAACCTGTTTT
AGGGTTTCAACCTCAAATCCATCTACAGGAAACTTCAATCAAATTTCAAACTACTAAAAGTACAGATGAATCAATCTCAGCACTGAGACAAGAAACCGAGACAGCAATTG
AAAAATCCGAGAAAAATTCCTCAGGCTCTCCACATTGTATCCGAACAGCTTCAGCAACATTCACAGAAACTAAACCATCGACAAATCTGATCGACAAACAGAGTGCAAGA
ACACTTCCATTCTCGACATTTGGAGAAGAAAACCAAGAATCTCCGGGAAGGACAAGCAACGAATCGAATTCCGACAATTCAATCGGTAATATTGAGATGCGTAAATCGCC
TAGCTTTAACATAGATATCCAAATCGAAGGAAGAGCAGGAGATACGGATAAAACTCCATTGCTATACCAAATTAAGACAATCGAAAACTTACCAAATCTGCACGAGAAGC
GAGTTGTGACGTTGGGAAGAAGCGATTCCGACAAGTCGAGACCTTCTTTCCCAGGGTTTGCGAAAGAAAAAGAAGAAGCCCATATGGAAATGAAAGCAATTAATCAAAAT
AACTCAGCCGCCGCTAAGAAGGCAGCGGAAGACCTGTCACCGACGTCGCAGACTTCGCCGATACGCAAAGGGAAGCGCAGAACCAAATCTTTAATCTTTGGAACCTGCAT
CTGCTGTGCTACTGCAATCAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGAAAAGTACATATGGGGTATTGAATTTGTTTATGGAAAACGCAGCAAGAAGCCGCTTTACCCCAGAAATGGCAGCTCCAGTTAAGCCCATCTGGCTGTCTACAAC
AGATCAAGCTGTCAAATATCATCATCATCATCACAATGCAGGATTTGTTGAGTTGAATATGGGAAATGAGATGGGAAACAATAACACATCTGAGTTAAGAGAGGAAGAGA
AGGCAAAAACTGAAGCTCCAGAAAAATCTTTACTAGAAGGTGGGGGCAATGAAGTCAATGCATATGAAGTTGCAGACTTTTGCAAAAAAGAAGCAAGGTCGGGTTCTGAT
GATGCAGATAGAGTTAATGGAGATCATCATGTCACAGAGAAAGAGGAAGGAAAAAATGGAGGATGTTATACAGCAAAATTTCAAGTGATTTCAAAATCAACTGACAGAAC
TAAAGAGGACAATGAAGTCCAGACCAAATTTAAAGAAGATGAAACAACGGAGTTCAACTTGAAGGAGAATGGATTGGATGGAAATAAGCAAGAGCATGAAAAGCAGACTT
CGAACCGCAAAGAGGAAGGAGAGAAAGGTTCCAATTTGAACACGACAAAACTTGCATTAGATGAGCCAGAACTTGAAAAGATTTCAGATTTCAAGAGTATTCAGAATGAA
TTGCTTGATACTAAAGCCGAATCATTAGTAGAAGATAGCAACAAATTCCCCCAATTATGCAAGGAGGATTTGGGATTGAGTTTGAACTACACTTATCATTCGGCAGATTG
TACCATGGCAGATTCAGAAAACACCACCAATATGAATCAAATTAGTGAAACTGTTCGAGATACGGATAAGGAAAATGATGGAGATAGGGGAAGAGATCTCGAGTCTTTTC
AGAATGAATTTCCTCCAACCAAGGCTGAACCAATGGTAGGATCAAGTGATGGATCCCCACAAGAAAACAAGTATGAAAAGATTTTAGATGTTGAGTCTTCTCAGAATGAA
TTGCCTCCGACCAAGGTTGAACCAATGGTAGGATCAAGTGATGGATCCCCACAAGAAAGCAAGTATGCTTTAGGATCTAGTTTTAGTTTGAAACACTCTGAGGTCAGTGG
TCGTGCTATAGTTACAGACAAATCGACTAATGTGGATCGTATCGGTGCAGAGACAGACACAGATGTGGAGAAGAAAAAAGATGAGTTAATAGAGCCAAGGTCATGCCATG
AATATATAATGCCAGCTGCCAAAAATGTCAATAAAAAAGAAGAACAAACTACGACGGATTTGATCAGTCATAAGTCCTCTGGTTCACAACCAGAAGAGAGTCTGGTGGTA
GAATCACCCAAGTCATCTATGCAAGCTCCTGAGGTTGAAGACAGATGCACGGTTTTGGAAGAAGAAACCCAAATGAGAGAAGAACCTTCGGGAACTGGAACAATCGTTCA
AGACAATTGTCCAACTCAATTTAAGATTTCAAATGGGGTCCAAGAGGAATTGAATACGATTGGGTCACACTCCGAGAACAATGCAGAAGAAACTGAGATTTCTCCTGATT
CTGTTGCCAAGAACCAGAACGAGGCTCCTGGAGAGGATTGCGAGAGTTCAGATGGAGAATATCTGGAAATTGCTGAACAAGGTATGGCTATTTCAACTTTATCTGTTGGA
AATTGCAAGCATAAGAATGATGAGAAGGGAGAGACGCTGAAAATTTCAATAGACAATGGGCATGAAGCGGAAAGAAGAGAATCGGAGGAAAATCTATCTAAACCTGTTTT
AGGGTTTCAACCTCAAATCCATCTACAGGAAACTTCAATCAAATTTCAAACTACTAAAAGTACAGATGAATCAATCTCAGCACTGAGACAAGAAACCGAGACAGCAATTG
AAAAATCCGAGAAAAATTCCTCAGGCTCTCCACATTGTATCCGAACAGCTTCAGCAACATTCACAGAAACTAAACCATCGACAAATCTGATCGACAAACAGAGTGCAAGA
ACACTTCCATTCTCGACATTTGGAGAAGAAAACCAAGAATCTCCGGGAAGGACAAGCAACGAATCGAATTCCGACAATTCAATCGGTAATATTGAGATGCGTAAATCGCC
TAGCTTTAACATAGATATCCAAATCGAAGGAAGAGCAGGAGATACGGATAAAACTCCATTGCTATACCAAATTAAGACAATCGAAAACTTACCAAATCTGCACGAGAAGC
GAGTTGTGACGTTGGGAAGAAGCGATTCCGACAAGTCGAGACCTTCTTTCCCAGGGTTTGCGAAAGAAAAAGAAGAAGCCCATATGGAAATGAAAGCAATTAATCAAAAT
AACTCAGCCGCCGCTAAGAAGGCAGCGGAAGACCTGTCACCGACGTCGCAGACTTCGCCGATACGCAAAGGGAAGCGCAGAACCAAATCTTTAATCTTTGGAACCTGCAT
CTGCTGTGCTACTGCAATCAATTGA
Protein sequenceShow/hide protein sequence
MMKSTYGVLNLFMENAARSRFTPEMAAPVKPIWLSTTDQAVKYHHHHHNAGFVELNMGNEMGNNNTSELREEEKAKTEAPEKSLLEGGGNEVNAYEVADFCKKEARSGSD
DADRVNGDHHVTEKEEGKNGGCYTAKFQVISKSTDRTKEDNEVQTKFKEDETTEFNLKENGLDGNKQEHEKQTSNRKEEGEKGSNLNTTKLALDEPELEKISDFKSIQNE
LLDTKAESLVEDSNKFPQLCKEDLGLSLNYTYHSADCTMADSENTTNMNQISETVRDTDKENDGDRGRDLESFQNEFPPTKAEPMVGSSDGSPQENKYEKILDVESSQNE
LPPTKVEPMVGSSDGSPQESKYALGSSFSLKHSEVSGRAIVTDKSTNVDRIGAETDTDVEKKKDELIEPRSCHEYIMPAAKNVNKKEEQTTTDLISHKSSGSQPEESLVV
ESPKSSMQAPEVEDRCTVLEEETQMREEPSGTGTIVQDNCPTQFKISNGVQEELNTIGSHSENNAEETEISPDSVAKNQNEAPGEDCESSDGEYLEIAEQGMAISTLSVG
NCKHKNDEKGETLKISIDNGHEAERRESEENLSKPVLGFQPQIHLQETSIKFQTTKSTDESISALRQETETAIEKSEKNSSGSPHCIRTASATFTETKPSTNLIDKQSAR
TLPFSTFGEENQESPGRTSNESNSDNSIGNIEMRKSPSFNIDIQIEGRAGDTDKTPLLYQIKTIENLPNLHEKRVVTLGRSDSDKSRPSFPGFAKEKEEAHMEMKAINQN
NSAAAKKAAEDLSPTSQTSPIRKGKRRTKSLIFGTCICCATAIN