; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G011430 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G011430
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptioncwf21 domain-containing protein
Genome locationchr08:20066962..20079933
RNA-Seq ExpressionLsi08G011430
SyntenyLsi08G011430
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR007581 - Endonuclease V
IPR013170 - mRNA splicing factor Cwf21 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064984.1 dentin sialophosphoprotein-like [Cucumis melo var. makuwa]0.0e+0074.93Show/hide
Query:  KKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNSDSGSCQPGDPTLGLVGEAKKMYNGIGLQTPRGSGTNGYIQTNKFF
        ++H  +D+   D+SS +D   E+ K ++ +R N   + + + +Y  +         SG              ++ MYNGIGLQTPRGSGTNGYIQTNKFF
Subjt:  KKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNSDSGSCQPGDPTLGLVGEAKKMYNGIGLQTPRGSGTNGYIQTNKFF

Query:  VRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEAA--SEEKDGASAIVLADKRYSSSPL
        VRPKTGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT  EISEKLREARE LEAA  SEEKDG+SAIVLADK       
Subjt:  VRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEAA--SEEKDGASAIVLADKRYSSSPL

Query:  NDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDD
                         RVSDTQTHQIAARKEEQMKT RAALGLGS DD EQVKEEISDPSR+RREGQN+DIKRHEKSEHSFLDRELNWK+RGTEDQ+DD
Subjt:  NDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDD

Query:  KDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRS
        KD +K  SKELKGHQKD+KRRPKDDSS TDSGE KGTKKNLRDSRR DSES+ D DV NKYVASRKSKKNR+HDSDDSS TDSGGE K TKKH R+KR+ 
Subjt:  KDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRS

Query:  DPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGS
        DPESD DSD DQKY+TSRKHKKNRRHDSDDSS++DSGGEHKKTK+++++NQR HGSDPD+DVDKK+TSKKQ K+TRHD  DSDSFTDGD+ GM SH+KGS
Subjt:  DPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGS

Query:  GRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRR
        GRH+SQKVKK RS+KQ+S+DE+NSDS ++DKHRQLKHKNQHGKRYG ESDSSDHDSSDSDV RKKS  R+ SKR GKS VDSESD EKSRK+PKKDV RR
Subjt:  GRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRR

Query:  RHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSS
        RHDIDDEKSG      DE+ KR RG+RH+TDD S EEGEYFGRSG+  TKGKI AKRQ D S+NSD SL VDRKG+D+HKRAKKYSSGDGF+LEKG K S
Subjt:  RHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSS

Query:  SGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGK
        SGA  RGKGNLNH EGRRHNTDDKS EEEGEY GRS KIATK K+DAKRQHDD++NSDDSLAV      KHKRAKKYSSSDDSD+E GVKS+ GARERGK
Subjt:  SGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGK

Query:  RNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKI
           N  DGLDKFKKDSI+EFNHASQ  D M SKRK DEG ENEQ+ ES+ R            + DPKKDFK+DSESSRR+ SGRYDETRD RYR D KI
Subjt:  RNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKI

Query:  DSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKD
        DSESNTRSRYSAH+EDDDRKSTRTGSRYTEETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHERD+GLKSSREVERGEYQPSSR RSEKD
Subjt:  DSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKD

Query:  YESRESTRDRDDSRKRAKYDSRSSRRDNY
        Y   ESTRDR+DSRKRAKY+SRSSR DN+
Subjt:  YESRESTRDRDDSRKRAKYDSRSSRRDNY

KAG6598821.1 Endonuclease V, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0056.85Show/hide
Query:  IEAQDLLKKKLIKEDELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHTPRCFGSSGP
        IEAQDLLKKKLIKEDE EG  D LKY+GGVDISFLKEDSSVACGTLVVL+LQTLQVVY+DFSLVTVQVPYVPGFLAFRE                     
Subjt:  IEAQDLLKKKLIKEDELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHTPRCFGSSGP

Query:  SSLGAFGEDEKESSSTVSTGECFIACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEERAGFGLASHL
                                                 P+ +     +K  +   +  L++ +   I+                     GFGLASHL
Subjt:  SSLGAFGEDEKESSSTVSTGECFIACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEERAGFGLASHL

Query:  GVLANLPTIGIGKNLHHVDGLTQSSVRQLLSEGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQADIRS
        GVLANLPTIGIGKNLHHVDGLT S VR+LLSE KNNDS++TL+G SGCIWGVAMRST DSLKPIY+SIGHRVSL+TAIRIVK TCK+RVPEPIRQADI  
Subjt:  GVLANLPTIGIGKNLHHVDGLTQSSVRQLLSEGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQADIRS

Query:  REYLRKLQMGIELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNSDSGSCQPGDPTLGLVGEAKKMYNGIGLQTP
                 G+     K GT                    T +++R                       S+ G  +  +P   LVGEA +          
Subjt:  REYLRKLQMGIELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNSDSGSCQPGDPTLGLVGEAKKMYNGIGLQTP

Query:  RGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEAA--SEEKDGA
                           TGKVAE++RGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYT DEIS+KL+EARETLEAA  SEEKDG 
Subjt:  RGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEAA--SEEKDGA

Query:  SAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDR
        SAIVLADK                        +VSDTQ+HQIAARKEEQMKT RAALGL SS+D+EQV E ISDP+RNRREGQN+DIKRHEKSEHSFLDR
Subjt:  SAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDR

Query:  ELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDS
        ELNWKK G+ED  DDK D+KRVSKELKGH KDR RRPKDDSS  DS GE  KGTKKNLRD+RRNDSESD ++D  +KY  SRKSKKNR+HDSD SSDTDS
Subjt:  ELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDS

Query:  GGERKGTKKHMRDKRRSDPESDP-----------------------------------------------------------------------------
        GGERKGTKKH+RD RR  P+ DP                                                                             
Subjt:  GGERKGTKKHMRDKRRSDPESDP-----------------------------------------------------------------------------

Query:  -----------------------------DSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-TSKKQAKNTR
                                     DS+FDQK+ITSRKHKKNRRHDSD SS+TDSGGEHK+TKK++KNN+RD  SD D+D+DKKY TSKKQ KN  
Subjt:  -----------------------------DSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-TSKKQAKNTR

Query:  HDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKV-KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRI
         D  DSDS  D  EFGMGSH+KGSGR KSQKV KK RSRKQES+DESNSDSGIDDK RQLKHKNQHGKRYGV+SDSSD DSSDSDV R KS  RY SKR 
Subjt:  HDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKV-KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRI

Query:  GKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGN
        GKS VDSESDSEK RKHPK DVGRRRHD D+++SG      DEI KR R +RHN+DD+SEEGEYFG+SG+ ATKG IAAKR+HDDSD SDDS  VDR+GN
Subjt:  GKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGN

Query:  DKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKK
        DK KRAKK+SSGDG D +KGVKSS GA  RGKG+ NH++G                                                    D+   A K
Subjt:  DKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKK

Query:  YSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSE
         +S                               K + DS++EFN A+Q   TMKSKRK DE GE+EQQ E++ R     S RESDFHGDPKKDFKNDSE
Subjt:  YSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSE

Query:  SSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRH--IRYEEPRGRKHER
        SSRRA SGRY+E RD RYR DPKIDSESN RSRYSAH+EDDDRKSTRTGSRYTEETEHGSRH+ KANESHHRSRTDQD EE KRH   RYEE RGRKHER
Subjt:  SSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRH--IRYEEPRGRKHER

Query:  DDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRD
        D+G+KSSRE ERGEYQPSSRLRSEKDYE++ESTRDRDD RKRAKYDSRSSRRD
Subjt:  DDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRD

XP_004138875.1 dentin sialophosphoprotein [Cucumis sativus]0.0e+0077.91Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL +QGYT  EISEKLREARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA

Query:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH
        A  SEEKDG+SAIVLADK                        RVSDTQTHQIAARKEEQMKT RAALGLGS DD+EQVK+EISDPSRNRREGQN+D+KRH
Subjt:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH

Query:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDS
        EKSEHSFLDR+LNWKKRGTEDQYDDKD +K  SKE+K  QKD+KRR KDDSS TDSGERKGTKKNLRDSRRNDSESD D DV NKYVASR SKKNR+HDS
Subjt:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDS

Query:  DDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNT
        DDSS+TDSGGE K TKKH R+KR+ + E+D DSD DQKY+TSRKHKKNRRHDSDDSS+TDS GEHKKTKK+++NNQR HGSD D+DVDKK+TSKKQ K+T
Subjt:  DDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNT

Query:  RHDRYDSDSFTDGDEFGMGSH-KKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKR
        RHD   SDSFTDGD+ GM SH KKGSGRH+S KVKK RSRKQ+S+DE+NSDSGI+DKHRQLKHK+QHGKRYG ESDSSDHDSSDSDV R KS  RY SK 
Subjt:  RHDRYDSDSFTDGDEFGMGSH-KKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKR

Query:  IGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRK
         GKS V+SESDSEKSRK+P KD  RRRHDIDDEKSG      DE+ KR RG+RHN DD S EEGEYFGRSG+ ATKGKI AKRQH DS+NSDDSL V RK
Subjt:  IGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRK

Query:  GNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRA
        G+D HK+AKKY SGDGF+LEKG K SSGA  RGKGNL+H+EGRRHNTDDKS EEEGEY GRS KIATK KID KRQHDD++NSDDSLAV      KHKRA
Subjt:  GNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRA

Query:  KKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKND
        KKY SSDDSD+E GVKS+ GARERGK   NH DGL KFKKDSINE NHASQ  D M  KRK DEG E EQ+ ES+ R            + DPKKD K+D
Subjt:  KKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKND

Query:  SESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHER
        SESSRR+ SGRYD+TRD RYR D KIDSESNTRSRYSA  EDDDRKS RTGSRY+EETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHER
Subjt:  SESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHER

Query:  DDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY
        D+GLKSSREVERGEYQPSSR RSEKDYE+RESTRDR+DSRKR KY+SRSSRRDN+
Subjt:  DDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY

XP_008445109.1 PREDICTED: dentin sialophosphoprotein-like [Cucumis melo]0.0e+0079.04Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT  EISEKLREARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA

Query:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH
        A  SEEKDG+SAIVLADK                        RVSDTQTHQIAARKEEQMKT RAALGLGS  D EQVKEEISDPSR+RREGQN+DIKRH
Subjt:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH

Query:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDS
        EKSEHSFLDRELNWK+RGTEDQ+DDKD +K  SKELKGHQKD+KRRPKDD S  DSGE KGTKKNLRDSRR DSESD D DV NKYVASRKSKKNR+HDS
Subjt:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDS

Query:  DDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNT
        DDSS TDSGGE K TKKH R+KR+ DPESD DSD DQKY+TSRKHKKNRRHDSDDSS++DSGGEHKKTK+++++NQR HGSDPD+DVDKK+TSKKQ K+T
Subjt:  DDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNT

Query:  RHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRI
        RHD  DSDSFTDGD+ GM SH+KGSGRH+SQKVKK RS+KQ+S+DE+NSDS ++DKHRQLKHKNQHGKRYG ESDSSDHDSSDSDV RKKS  R+ SKR 
Subjt:  RHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRI

Query:  GKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKG
        GKS VDSESD EKSRK+PKKD  RRRHDIDDEKSG      DE+ KR RG+RH+TDD S EEGEYFGRSG+  TKGKI AKRQ D S+NSD SL VDRKG
Subjt:  GKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKG

Query:  NDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAK
        +D+HKRAKKYSSGDGF+LEKG K SSGA  RGKGNLNH EGRRHNTDDKS EEEGEY GRS K+ATK K+DAKRQHDD++NSDDSLAV      KHKRAK
Subjt:  NDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAK

Query:  KYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDS
        KYSSSDDSD+E GVKS+ GARERGK   N  DGLDKFKKDSI+EFNHASQ  D M SKRK DEG ENEQ+ ES+ R            + DPKKDFK+DS
Subjt:  KYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDS

Query:  ESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERD
        ESSRR+ SGRYDETRD RYR D KIDSESNTRSRYSAH+EDDDRKSTRTGSRYTEETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHERD
Subjt:  ESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERD

Query:  DGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY
        +GLKSSREVERGEYQPSSR RSEKDY   ESTRDR+DSRKRAKY+SRSSR DN+
Subjt:  DGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY

XP_038884695.1 dentin sialophosphoprotein-like [Benincasa hispida]0.0e+0072.03Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT DEISEKLREARETLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA

Query:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH
        A  SEEKDG SAIVLADK                        RVSDTQTHQIAARKEEQMKT RAALGLGSS DTEQVKEEISDPSR RREGQN+DIKRH
Subjt:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH

Query:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDS
        EKSEHSFLDRELNWKK G EDQYDDKDD+KR+SKELKGHQK RKRRPKDDSS TDS +R     NLRDSRRNDSESD D+DVG+KYVASR   KNR+HDS
Subjt:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDS

Query:  DDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYT-SKKQAKN
        DDSSDTDSGGERKGTKKH+RDKRR DPESDPDSDFDQKYITSRKHKKNRRHD D+SS+TDSGGEHKKTKKNM+NN+R HGSDP +D+DKKYT SKK  KN
Subjt:  DDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYT-SKKQAKN

Query:  TRHDRYDSDSFTDGDEFGM-GSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSK
         RHD  DSDS TDGDEFGM GSHKKGS RHKSQKVK  RSRKQES+DESNSDSGID+K RQLKH+NQHGKRYGVESDSSDHDSSDSDV  KKS  RYDSK
Subjt:  TRHDRYDSDSFTDGDEFGM-GSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSK

Query:  RIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRK
        R GKS VDSES+SEKSRKH KKD GR RHDID+EKSG     G EI KR RG+ +N DD S                                       
Subjt:  RIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRK

Query:  GNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRA
                                                            EEEGEY GRS KIATKGKIDAKRQHDDN+NSDDSLAV RKGNDKHKRA
Subjt:  GNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRA

Query:  KKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLES--------------------------
        KK SS DDSD+E GVK+SGGARERGK +LNH DGL+KFKKDSINEFNHASQ  DTM SKRKFDE G+NEQQLES                          
Subjt:  KKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLES--------------------------

Query:  -------------RDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHG
                     RD R+RED K +SDFHG+PKK F+NDSESSRRA SGRYDETRD RYR DPKIDSESN RSRYS  DEDDDRK+T+TGSR+TEETEHG
Subjt:  -------------RDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHG

Query:  SRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY
        SRHHRKANESHHRSRT +DTEEEKRH RYEEPRGRKHER++GLKS REVERGEYQPSSRLRSEKDYE+RESTRDRDDSRKRAKY+SRSSRRDN+
Subjt:  SRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY

TrEMBL top hitse value%identityAlignment
A0A0A0LQ00 cwf21 domain-containing protein0.0e+0077.91Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL +QGYT  EISEKLREARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA

Query:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH
        A  SEEKDG+SAIVLADK                        RVSDTQTHQIAARKEEQMKT RAALGLGS DD+EQVK+EISDPSRNRREGQN+D+KRH
Subjt:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH

Query:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDS
        EKSEHSFLDR+LNWKKRGTEDQYDDKD +K  SKE+K  QKD+KRR KDDSS TDSGERKGTKKNLRDSRRNDSESD D DV NKYVASR SKKNR+HDS
Subjt:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDS

Query:  DDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNT
        DDSS+TDSGGE K TKKH R+KR+ + E+D DSD DQKY+TSRKHKKNRRHDSDDSS+TDS GEHKKTKK+++NNQR HGSD D+DVDKK+TSKKQ K+T
Subjt:  DDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNT

Query:  RHDRYDSDSFTDGDEFGMGSH-KKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKR
        RHD   SDSFTDGD+ GM SH KKGSGRH+S KVKK RSRKQ+S+DE+NSDSGI+DKHRQLKHK+QHGKRYG ESDSSDHDSSDSDV R KS  RY SK 
Subjt:  RHDRYDSDSFTDGDEFGMGSH-KKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKR

Query:  IGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRK
         GKS V+SESDSEKSRK+P KD  RRRHDIDDEKSG      DE+ KR RG+RHN DD S EEGEYFGRSG+ ATKGKI AKRQH DS+NSDDSL V RK
Subjt:  IGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRK

Query:  GNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRA
        G+D HK+AKKY SGDGF+LEKG K SSGA  RGKGNL+H+EGRRHNTDDKS EEEGEY GRS KIATK KID KRQHDD++NSDDSLAV      KHKRA
Subjt:  GNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRA

Query:  KKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKND
        KKY SSDDSD+E GVKS+ GARERGK   NH DGL KFKKDSINE NHASQ  D M  KRK DEG E EQ+ ES+ R            + DPKKD K+D
Subjt:  KKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKND

Query:  SESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHER
        SESSRR+ SGRYD+TRD RYR D KIDSESNTRSRYSA  EDDDRKS RTGSRY+EETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHER
Subjt:  SESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHER

Query:  DDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY
        D+GLKSSREVERGEYQPSSR RSEKDYE+RESTRDR+DSRKR KY+SRSSRRDN+
Subjt:  DDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY

A0A1S3BBX0 dentin sialophosphoprotein-like0.0e+0079.04Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT  EISEKLREARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA

Query:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH
        A  SEEKDG+SAIVLADK                        RVSDTQTHQIAARKEEQMKT RAALGLGS  D EQVKEEISDPSR+RREGQN+DIKRH
Subjt:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH

Query:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDS
        EKSEHSFLDRELNWK+RGTEDQ+DDKD +K  SKELKGHQKD+KRRPKDD S  DSGE KGTKKNLRDSRR DSESD D DV NKYVASRKSKKNR+HDS
Subjt:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDS

Query:  DDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNT
        DDSS TDSGGE K TKKH R+KR+ DPESD DSD DQKY+TSRKHKKNRRHDSDDSS++DSGGEHKKTK+++++NQR HGSDPD+DVDKK+TSKKQ K+T
Subjt:  DDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNT

Query:  RHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRI
        RHD  DSDSFTDGD+ GM SH+KGSGRH+SQKVKK RS+KQ+S+DE+NSDS ++DKHRQLKHKNQHGKRYG ESDSSDHDSSDSDV RKKS  R+ SKR 
Subjt:  RHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRI

Query:  GKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKG
        GKS VDSESD EKSRK+PKKD  RRRHDIDDEKSG      DE+ KR RG+RH+TDD S EEGEYFGRSG+  TKGKI AKRQ D S+NSD SL VDRKG
Subjt:  GKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKG

Query:  NDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAK
        +D+HKRAKKYSSGDGF+LEKG K SSGA  RGKGNLNH EGRRHNTDDKS EEEGEY GRS K+ATK K+DAKRQHDD++NSDDSLAV      KHKRAK
Subjt:  NDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAK

Query:  KYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDS
        KYSSSDDSD+E GVKS+ GARERGK   N  DGLDKFKKDSI+EFNHASQ  D M SKRK DEG ENEQ+ ES+ R            + DPKKDFK+DS
Subjt:  KYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDS

Query:  ESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERD
        ESSRR+ SGRYDETRD RYR D KIDSESNTRSRYSAH+EDDDRKSTRTGSRYTEETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHERD
Subjt:  ESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERD

Query:  DGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY
        +GLKSSREVERGEYQPSSR RSEKDY   ESTRDR+DSRKRAKY+SRSSR DN+
Subjt:  DGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY

A0A5A7VCH8 Dentin sialophosphoprotein-like0.0e+0074.93Show/hide
Query:  KKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNSDSGSCQPGDPTLGLVGEAKKMYNGIGLQTPRGSGTNGYIQTNKFF
        ++H  +D+   D+SS +D   E+ K ++ +R N   + + + +Y  +         SG              ++ MYNGIGLQTPRGSGTNGYIQTNKFF
Subjt:  KKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNSDSGSCQPGDPTLGLVGEAKKMYNGIGLQTPRGSGTNGYIQTNKFF

Query:  VRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEAA--SEEKDGASAIVLADKRYSSSPL
        VRPKTGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT  EISEKLREARE LEAA  SEEKDG+SAIVLADK       
Subjt:  VRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEAA--SEEKDGASAIVLADKRYSSSPL

Query:  NDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDD
                         RVSDTQTHQIAARKEEQMKT RAALGLGS DD EQVKEEISDPSR+RREGQN+DIKRHEKSEHSFLDRELNWK+RGTEDQ+DD
Subjt:  NDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDD

Query:  KDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRS
        KD +K  SKELKGHQKD+KRRPKDDSS TDSGE KGTKKNLRDSRR DSES+ D DV NKYVASRKSKKNR+HDSDDSS TDSGGE K TKKH R+KR+ 
Subjt:  KDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRS

Query:  DPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGS
        DPESD DSD DQKY+TSRKHKKNRRHDSDDSS++DSGGEHKKTK+++++NQR HGSDPD+DVDKK+TSKKQ K+TRHD  DSDSFTDGD+ GM SH+KGS
Subjt:  DPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGS

Query:  GRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRR
        GRH+SQKVKK RS+KQ+S+DE+NSDS ++DKHRQLKHKNQHGKRYG ESDSSDHDSSDSDV RKKS  R+ SKR GKS VDSESD EKSRK+PKKDV RR
Subjt:  GRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRR

Query:  RHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSS
        RHDIDDEKSG      DE+ KR RG+RH+TDD S EEGEYFGRSG+  TKGKI AKRQ D S+NSD SL VDRKG+D+HKRAKKYSSGDGF+LEKG K S
Subjt:  RHDIDDEKSG-----GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSS

Query:  SGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGK
        SGA  RGKGNLNH EGRRHNTDDKS EEEGEY GRS KIATK K+DAKRQHDD++NSDDSLAV      KHKRAKKYSSSDDSD+E GVKS+ GARERGK
Subjt:  SGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGK

Query:  RNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKI
           N  DGLDKFKKDSI+EFNHASQ  D M SKRK DEG ENEQ+ ES+ R            + DPKKDFK+DSESSRR+ SGRYDETRD RYR D KI
Subjt:  RNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKI

Query:  DSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKD
        DSESNTRSRYSAH+EDDDRKSTRTGSRYTEETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHERD+GLKSSREVERGEYQPSSR RSEKD
Subjt:  DSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKD

Query:  YESRESTRDRDDSRKRAKYDSRSSRRDNY
        Y   ESTRDR+DSRKRAKY+SRSSR DN+
Subjt:  YESRESTRDRDDSRKRAKYDSRSSRRDNY

A0A6J1ESM6 dentin sialophosphoprotein-like5.5e-29763.05Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE++RGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYT DEIS+KL+EARETLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA

Query:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH
        A  SEEKDG SAIVLADK                        +VSDTQ+HQIAARKEEQMKT RAALGL SS+D+EQV E ISDP+RNRREGQN+DIKRH
Subjt:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH

Query:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKH
        EKSEHSFLDRELNWKK G+ED  DDK D+KRVSKELKGH KDR RRPKDDSS  DS GE  KGTKKNLRD+RRNDSESD ++D  +KY  SRKSKKNR+H
Subjt:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKH

Query:  DSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDD-------------------------------------------
        DSD SSDTDSGGERKGTKKH+RD RR  P+ DPDS+FDQKY TSRKHKKNRRHDSDD                                           
Subjt:  DSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDD-------------------------------------------

Query:  ---------------------------------------------------------------SSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-
                                                                       SS+TDSGGEHK+TKK++KNN+RD  SD D+D+DKKY 
Subjt:  ---------------------------------------------------------------SSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-

Query:  TSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKV-KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKK
        TSKKQ KN      DSDS  D  EFGMGSH+KGSGR KSQKV KK R RKQES+DESNSDSGIDDK RQLKHKNQHGKRYGV+SDSSD DSSDSDV R K
Subjt:  TSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKV-KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKK

Query:  SMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSGGDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTV
        S  RY SKR GKS VDSESDSEK RKHPKKDVGRRRHD D+++SG                                           D+S +SD+ +  
Subjt:  SMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSGGDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTV

Query:  DRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKH
              K +R                                   RRHN+DDKS EEEGEYFG+S KIATKG I AKR+HDD+D SDDS AVDRKGNDK 
Subjt:  DRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKH

Query:  KRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLD-----------KFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRE
        KRAKK+SS D SD + GVKSSGGARERGK + NH DGLD           K + D ++EFN A+Q   TMKSKRK DE GE+EQQ E++ R     S RE
Subjt:  KRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLD-----------KFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRE

Query:  SDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKR
        SDFHGDPKKDFKNDSESSRRA SGRY+ETRD RYR DPKIDSESN RSRYSAH+ED+DRKSTRTGSRYTEETEHGSRH+ KANESHHRSRTDQD EE KR
Subjt:  SDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKR

Query:  H--IRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRD
        H   RYEE RGRKHERD+G+KSSRE ERGEYQPSSRLRSEKDYE++ESTRDRDD RKRAKYDSRSSRRD
Subjt:  H--IRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRD

A0A6J1K7B6 dentin sialophosphoprotein-like6.3e-29362.56Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE++RGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYT DEIS+KL+EARETLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEA

Query:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH
        A  SEEKDG SAIVLADK                        +VSDTQ+HQIAARKEEQMKT RAALGL SS+D+EQV E ISDP+RNRREGQN+DIKR 
Subjt:  A--SEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRH

Query:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKH
        EKSEHSFLDRELNWK+ G+ED  DDK D+KRVSKELKGH KDR RRPKDDSS  DS GE  KGTKKNLRD+RR DSESD ++D  +KY  SRKSKKNR+H
Subjt:  EKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKH

Query:  DSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY-----------------------------------------------------ITSRKH
        DSD SSDTDSGGERKGTKKH+RD RR  P+ DPDS+FDQKY                                                     ITSRKH
Subjt:  DSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY-----------------------------------------------------ITSRKH

Query:  KKNRRHDSDD-----------------------------------------------------SSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-
        KKNRRHDSDD                                                     SS+TDSGGEHK+TKK++KNN+RD  SD D+D+DKKY 
Subjt:  KKNRRHDSDD-----------------------------------------------------SSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-

Query:  TSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKV-KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKK
        TSKKQ KN   D  DSDS  D  EFGMGSH+KGSGR KSQKV KK RSRKQES+DESNSDSGIDDK RQLK+KNQHGKRYGV+SDSSD DSSDSDV R K
Subjt:  TSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKV-KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKK

Query:  SMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSD
        S  RY SKR GKS VDSESDSEK RKHPKKDVGRRRHD D+++SG      DEI KR R +RHN+DD+SEEGEYFG+SG+ ATKG IAAKR+H+DSD SD
Subjt:  SMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSD

Query:  DSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRK
        DS  VDR+GNDK KRAKK+S GDG D +KGVKSS GA  RGKG+ NH++G                                                  
Subjt:  DSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRK

Query:  GNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRESDFHGD
          D+   A K +S                               K + DS++EFN A+Q   TMKSKRK DE GE+EQQ E++ +     S RESDFHGD
Subjt:  GNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRESDFHGD

Query:  PKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRH--IRY
        PKKDFKNDSESSRRA SGR+ ETRD RYR DPKIDSESN RSRYSAH+EDDDRKS RTGSRYTEETEHGSRH+ KANESHHRSRTDQD EE KR    RY
Subjt:  PKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRH--IRY

Query:  EEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRD
        EE RGRKHERD+G+KSSRE ERGEYQPSSRLRSEKDYE++ESTRDRDD RKRAKYDSRSSR D
Subjt:  EEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRD

SwissProt top hitse value%identityAlignment
Q8C9A2 Endonuclease V6.3e-3231.21Show/hide
Query:  QDLLKKKLIKED----ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHTPRCFGSSG
        Q  LK +++  D    + +     L+ +GGVD+SF+K DS  AC +LVVL    L+VVY+D  +V ++ PYV GFLAFRE                    
Subjt:  QDLLKKKLIKED----ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHTPRCFGSSG

Query:  PSSLGAFGEDEKESSSTVSTGECFIACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEERAGFGLASH
                                                 +P  V     L+  E      +++ +   ++                   + GFG+A H
Subjt:  PSSLGAFGEDEKESSSTVSTGECFIACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEERAGFGLASH

Query:  LGVLANLPTIGIGKNLHHVDGLTQSSVRQ----LLSEGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQ
        LGVL  LP IG+ K L  VDGL  +++ +    LL  G +   +I   G SG + G+A+RS   S KP+YVS+GHR+SL  A+R+    C++R+PEPIRQ
Subjt:  LGVLANLPTIGIGKNLHHVDGLTQSSVRQ----LLSEGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQ

Query:  ADIRSREYLRKL--QMGIELKWKKHGTEDQ
        ADIRSREY+R+   Q+G+    +K  ++ +
Subjt:  ADIRSREYLRKL--QMGIELKWKKHGTEDQ

Q8N8Q3 Endonuclease V2.3e-2930.9Show/hide
Query:  LKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHTPRCFGSSGPSSLGAFGEDEKESSSTVSTGECF
        L+ +GGVD+SF+K DS  AC +LVVL    L+VVY++  +V++  PYV GFLAFRE                                            
Subjt:  LKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHTPRCFGSSGPSSLGAFGEDEKESSSTVSTGECF

Query:  IACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQ
                         +P  +     L+  E      +++ +   ++                     GFG+A HLGVL +LP +G+ K L  VDGL  
Subjt:  IACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQ

Query:  SS-----VRQLLSEGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQADIRSREYLRK
        ++     +R L + G +      L G SG + G+A+RS   S +P+Y+S+GHR+SL  A+R+    C++R+PEP+RQADI SRE++RK
Subjt:  SS-----VRQLLSEGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQADIRSREYLRK

Arabidopsis top hitse value%identityAlignment
AT3G49601.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: mRNA splicing factor, Cwf21 (InterPro:IPR013170); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).2.5e-4734.11Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLE
        MYNGIGLQT RGSGTNGY+QTNKFFVRP+  GK  +  +GFE+D+GTAG+SKKPNK ILEHDRKRQI LKL ILEDKL DQGY+  EI++KL EAR +LE
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLE

Query:  AASEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHE
        AA+           A++  S S                   +VS+TQTHQ+AARKE+QM+ FRAALGL    D +QV EE        REG    +K  E
Subjt:  AASEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHE

Query:  KSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKG------------HQKDRKRRPKDDSSGTD--SGERKGTKKNLRDSRRNDSESDPDNDVGNKYV
        + EHSFLDR+   KK   ++  D+KD + + SK+ +G             +K+ K+R  DDSS +D    +R+   K     R+ +SESD  +       
Subjt:  KSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKG------------HQKDRKRRPKDDSSGTD--SGERKGTKKNLRDSRRNDSESDPDNDVGNKYV

Query:  ASRKSKKNRKHDSDDSSDTDSGGER------KGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGS
                   DS+  SD+D G +R      K TKK  R KR    ES+     D K +  + HKK+    S+ S + +   +H +  +  +       S
Subjt:  ASRKSKKNRKHDSDDSSDTDSGGER------KGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGS

Query:  DPDNDVDKKYTSKKQAK---NTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGK-RYGVESD--
        +P+++ +K+   KK+       +  R D D   D  +       K + R       +++++KQ  S      +G+  K ++ +   +HGK +Y  +S   
Subjt:  DPDNDVDKKYTSKKQAK---NTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGK-RYGVESD--

Query:  --SSDHDSSDSDVARKKSM--DRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDI---DDEKSG------GDEIAKRCRGKRHNTDDESEEGEYFGR
          + D D S+++   +K +  + Y   R  K   D ++D+    ++   D  +R   I   DD   G      GD+   R R +R +  D+ EE ++ GR
Subjt:  --SSDHDSSDSDVARKKSM--DRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDI---DDEKSG------GDEIAKRCRGKRHNTDDESEEGEYFGR

Query:  SGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSG
          R    G+ A  ++ DD D          +G  ++  ++  SSG
Subjt:  SGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSG

AT4G31150.1 endonuclease V family protein1.2e-5442.77Show/hide
Query:  EAQDLLKKKLIKED----------ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHT
        E QD LKKKLI  D          EL    + LKY+GGVD+SF KEDSSVAC  LVVL+L +L+VV+ DFSL+ + VPYVPGFLAFRE            
Subjt:  EAQDLLKKKLIKED----------ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHT

Query:  PRCFGSSGPSSLGAFGEDEKESSSTVSTGECFIACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEER
                                                          P+ +     ++  +   +  +++ +   I+                    
Subjt:  PRCFGSSGPSSLGAFGEDEKESSSTVSTGECFIACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEER

Query:  AGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLS-EGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVP
         GFGLA HLGVLA+LPTIG+GKNLHHVDGL QS V+Q L  +   ++  ITL G SG  WGV  R T+ SLKPIYVS+GHR+SL++A+ +VKITCKYRVP
Subjt:  AGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLS-EGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVP

Query:  EPIRQADIRSREYLRKLQ
        EPIRQADIRSR YL+K Q
Subjt:  EPIRQADIRSREYLRKLQ

AT4G31150.2 endonuclease V family protein3.5e-5442.72Show/hide
Query:  QDLLKKKLIKED----------ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHTPR
        QD LKKKLI  D          EL    + LKY+GGVD+SF KEDSSVAC  LVVL+L +L+VV+ DFSL+ + VPYVPGFLAFRE              
Subjt:  QDLLKKKLIKED----------ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHTPR

Query:  CFGSSGPSSLGAFGEDEKESSSTVSTGECFIACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEERAG
                                                        P+ +     ++  +   +  +++ +   I+                     G
Subjt:  CFGSSGPSSLGAFGEDEKESSSTVSTGECFIACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEERAG

Query:  FGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLS-EGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEP
        FGLA HLGVLA+LPTIG+GKNLHHVDGL QS V+Q L  +   ++  ITL G SG  WGV  R T+ SLKPIYVS+GHR+SL++A+ +VKITCKYRVPEP
Subjt:  FGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLS-EGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEP

Query:  IRQADIRSREYLRKLQ
        IRQADIRSR YL+K Q
Subjt:  IRQADIRSREYLRKLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTCAAACTGGGTCGTCTGATTTGGGCCTGGACGCGCCCATCTGACCTTAACCCGGCCCGTTTCGAGACCGAGCCCACATCGGCCCATTTGTTTTTTGCTTGGCC
GGCCGGCAAGTATCTGGTAGGAACTAAACAATTCGCCGTTAATCAATTCGCCGGCACCGACCGGTTCCAACGTGTCGGGGGTATTCCAGTAATCGGATTGGTCGCAGAAT
CTTCAGGTATAGATGATGTCCGAAGAAGAGAAACAAGCGAAATCGTATTCCGAGGCGTCAACGACTTCCTCGGTCGAAATTCAGAACTGGATAGAGTAATCGAAGAAATT
TCCATTATAGAAGCGCAGGACTTGCTGAAGAAGAAACTAATCAAAGAGGATGAGTTGGAAGGGGAAGTGGATGATTTGAAGTATATCGGCGGCGTTGATATAAGCTTCTT
AAAGGAAGATTCATCAGTTGCGTGTGGTACGCTCGTGGTTTTGGATCTCCAAACTCTTCAAGTTGTCTATGATGATTTCTCTCTTGTTACCGTTCAAGTTCCTTATGTCC
CTGGCTTTCTAGCATTTAGAGAGAATTCATATATTCCCATGAATGTTAATGATCATACACCTCGTTGCTTTGGTTCTTCAGGCCCCAGTTCTCTTGGAGCTTTTGGAGAG
GATGAGAAAGAGAGCTCCTCAACTGTATCCACAGGTGAATGCTTCATAGCCTGTGCTGACTTTATTTATGCCACTCAAAAACAATATTTTCTCGCTATTCCCATGTCTGT
TTCTTCGCATGTTATGCTGAAATCATCTGAGGAATTGTCTTGGAATTGGCTGATGGTGTTCAACTGTCGATCTATTGTCTGTAGTAGTATCAACATTATATGCTATCTTA
GAACACTGACCATTCCATGGGAGGAACGTGCAGGTTTTGGTCTGGCCAGTCATCTTGGCGTTCTTGCTAATTTGCCTACGATTGGAATTGGCAAGAATCTGCATCATGTT
GATGGTCTTACCCAGTCCAGTGTGAGGCAACTTCTTTCCGAGGGCAAGAATAATGATTCTATAATAACTTTGAAGGGTATCTCTGGATGCATTTGGGGTGTGGCTATGAG
ATCTACTGTTGATTCATTGAAGCCCATATACGTTTCAATTGGTCATCGTGTTTCACTCAACACTGCCATCAGGATTGTTAAAATCACATGCAAATATCGTGTTCCAGAAC
CTATCAGACAGGCTGACATTAGGTCAAGAGAATACCTCCGGAAATTACAAATGGGAATAGAATTGAAGTGGAAAAAGCATGGCACTGAAGATCAGTACGATAGTGATGAT
TCTTCTGATACTGATTCTGGTGGAGAGGTCAAGAAAACCAAGAAGAATAAGAGAACTAATCAAAGAACAGCCAGTGATGTTGACAAGAAATACAGCTCAAAGAAGCAGAA
GAAAAACACAAATTCTGATTCAGGAAGTTGTCAACCAGGTGACCCAACCCTCGGGCTGGTTGGTGAAGCGAAGAAGATGTATAATGGTATTGGTTTACAAACTCCGAGAG
GGTCTGGCACTAATGGATATATTCAGACAAACAAGTTCTTTGTGAGGCCGAAGACCGGAAAGGTTGCTGAAAGCTCTAGAGGATTCGAAGAAGATCAGGGCACTGCTGGA
GTTTCAAAGAAACCTAATAAAGACATTCTCGAACACGACCGCAAGCGTCAGATCGAACTCAAACTTGTCATACTTGAGGACAAGCTTACTGACCAAGGTTATACCGTGGA
TGAAATTTCTGAGAAGTTAAGGGAGGCTCGCGAAACTTTGGAAGCTGCTTCAGAGGAAAAAGATGGAGCTTCTGCCATCGTACTTGCAGATAAGAGGTACTCCTCTTCTC
CTCTTAATGATATGAATGATGATGTAATTTATTTCTTGAGAAGTAATAACCCTATCAGGGTATCAGATACACAGACTCACCAAATTGCTGCAAGAAAGGAGGAGCAGATG
AAAACATTTAGAGCTGCTCTTGGGTTGGGTTCATCGGACGATACTGAACAGGTTAAAGAAGAGATTTCTGATCCATCAAGAAATAGAAGAGAGGGTCAGAATTCTGATAT
TAAGCGTCATGAGAAGTCTGAACATTCTTTTTTGGACAGAGAATTGAACTGGAAAAAGCGTGGCACTGAAGATCAGTATGATGATAAGGATGACCAAAAAAGGGTTTCGA
AAGAGTTGAAAGGTCATCAGAAGGATAGAAAAAGAAGGCCCAAGGATGATTCTTCTGGCACCGATTCTGGTGAGCGTAAGGGAACCAAGAAGAACTTGAGAGACAGTAGA
AGGAATGATTCTGAAAGTGACCCTGACAATGATGTTGGCAACAAATATGTCGCCTCAAGGAAGTCTAAAAAAAATAGAAAGCATGATAGTGATGATTCTTCTGATACTGA
TTCTGGTGGTGAGCGCAAGGGAACGAAGAAGCACATGAGAGATAAACGGAGATCTGATCCTGAAAGTGACCCAGACAGTGATTTTGACCAGAAATATATCACCTCGAGGA
AGCATAAGAAGAACAGAAGGCATGATAGTGATGATTCTTCTAATACTGATTCTGGTGGAGAGCACAAGAAAACCAAGAAGAATATGAAAAATAATCAAAGAGATCATGGA
AGTGATCCCGACAATGATGTTGATAAGAAATACACCTCAAAGAAGCAGGCGAAAAACACAAGGCATGATAGGTATGATTCTGATTCATTTACAGACGGTGATGAGTTTGG
GATGGGCAGCCACAAGAAAGGATCGGGTAGACATAAAAGTCAAAAGGTGAAGAAGCATAGAAGCCGGAAACAGGAGTCTTCTGATGAATCCAATTCTGACAGTGGGATTG
ATGATAAACACAGGCAACTGAAGCACAAAAACCAGCATGGTAAAAGATATGGGGTAGAAAGTGACAGCTCTGACCATGACAGTTCTGATTCTGATGTAGCTCGCAAGAAG
AGTATGGATAGGTATGACAGCAAACGTATAGGAAAGAGCATGGTAGATAGTGAATCTGATTCTGAGAAGTCAAGAAAGCATCCTAAGAAAGATGTTGGGAGACGCAGACA
TGATATTGATGATGAAAAAAGTGGTGGTGATGAAATAGCGAAGAGGTGCAGAGGTAAGAGGCACAATACTGATGATGAATCTGAAGAAGGTGAATATTTTGGTAGAAGTG
GTAGGACAGCCACAAAAGGAAAAATAGCTGCTAAAAGGCAACATGATGACAGTGATAATTCTGATGATAGCCTAACAGTTGATAGAAAGGGCAATGATAAACACAAGAGA
GCTAAGAAATATTCGTCGGGTGACGGTTTTGATCTAGAGAAGGGAGTAAAATCAAGCAGTGGAGCTCATGGAAGAGGAAAAGGGAACCTAAATCATTCAGAAGGTAGGAG
GCACAATACTGATGATAAATCTGAAGAAGAAGAAGGTGAATATTTTGGTAGAAGTGATAAGATAGCTACAAAAGGAAAAATAGATGCTAAAAGGCAACATGACGACAATG
ATAATTCTGATGATAGCCTAGCAGTTGATAGAAAGGGCAATGATAAACACAAGAGAGCTAAAAAATATTCATCGAGTGACGATTCTGATATAGAGAATGGAGTAAAATCA
AGTGGTGGAGCTCGTGAAAGGGGAAAAAGGAACTTAAATCATACAGATGGTTTGGACAAGTTTAAGAAAGATTCTATCAATGAGTTCAACCATGCAAGTCAACATATAGA
TACAATGAAAAGCAAGAGAAAGTTTGATGAAGGTGAAAATGAGCAGCAGCTAGAGTCAAGGGATCGACGGCACAGGGAAGACTCCAAAAGAGAGTCAGATTTCCATGGTG
ACCCCAAGAAAGATTTCAAAAATGATTCTGAATCAAGCAGAAGAGCACACAGTGGTAGGTACGATGAGACAAGGGATCGACGATACAGGGTAGACCCCAAAATTGACTCT
GAATCAAACACTAGATCACGCTATAGTGCACACGACGAGGATGATGACAGAAAGTCAACTCGAACAGGAAGCAGATATACTGAAGAAACAGAGCATGGAAGTAGACATCA
TCGCAAGGCTAACGAGTCTCATCATCGCAGTAGGACTGATCAAGATACTGAAGAGGAAAAAAGGCACATCAGATATGAGGAGCCTAGAGGGAGAAAGCATGAAAGAGATG
ATGGTCTAAAATCGAGCAGGGAAGTTGAAAGAGGGGAGTATCAACCAAGTAGCAGGCTGAGATCTGAGAAAGATTATGAAAGTAGAGAATCTACAAGAGATAGGGATGAT
TCCAGAAAGAGGGCCAAATATGATTCTCGATCAAGCAGACGTGACAATTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAATTCAAACTGGGTCGTCTGATTTGGGCCTGGACGCGCCCATCTGACCTTAACCCGGCCCGTTTCGAGACCGAGCCCACATCGGCCCATTTGTTTTTTGCTTGGCC
GGCCGGCAAGTATCTGGTAGGAACTAAACAATTCGCCGTTAATCAATTCGCCGGCACCGACCGGTTCCAACGTGTCGGGGGTATTCCAGTAATCGGATTGGTCGCAGAAT
CTTCAGGTATAGATGATGTCCGAAGAAGAGAAACAAGCGAAATCGTATTCCGAGGCGTCAACGACTTCCTCGGTCGAAATTCAGAACTGGATAGAGTAATCGAAGAAATT
TCCATTATAGAAGCGCAGGACTTGCTGAAGAAGAAACTAATCAAAGAGGATGAGTTGGAAGGGGAAGTGGATGATTTGAAGTATATCGGCGGCGTTGATATAAGCTTCTT
AAAGGAAGATTCATCAGTTGCGTGTGGTACGCTCGTGGTTTTGGATCTCCAAACTCTTCAAGTTGTCTATGATGATTTCTCTCTTGTTACCGTTCAAGTTCCTTATGTCC
CTGGCTTTCTAGCATTTAGAGAGAATTCATATATTCCCATGAATGTTAATGATCATACACCTCGTTGCTTTGGTTCTTCAGGCCCCAGTTCTCTTGGAGCTTTTGGAGAG
GATGAGAAAGAGAGCTCCTCAACTGTATCCACAGGTGAATGCTTCATAGCCTGTGCTGACTTTATTTATGCCACTCAAAAACAATATTTTCTCGCTATTCCCATGTCTGT
TTCTTCGCATGTTATGCTGAAATCATCTGAGGAATTGTCTTGGAATTGGCTGATGGTGTTCAACTGTCGATCTATTGTCTGTAGTAGTATCAACATTATATGCTATCTTA
GAACACTGACCATTCCATGGGAGGAACGTGCAGGTTTTGGTCTGGCCAGTCATCTTGGCGTTCTTGCTAATTTGCCTACGATTGGAATTGGCAAGAATCTGCATCATGTT
GATGGTCTTACCCAGTCCAGTGTGAGGCAACTTCTTTCCGAGGGCAAGAATAATGATTCTATAATAACTTTGAAGGGTATCTCTGGATGCATTTGGGGTGTGGCTATGAG
ATCTACTGTTGATTCATTGAAGCCCATATACGTTTCAATTGGTCATCGTGTTTCACTCAACACTGCCATCAGGATTGTTAAAATCACATGCAAATATCGTGTTCCAGAAC
CTATCAGACAGGCTGACATTAGGTCAAGAGAATACCTCCGGAAATTACAAATGGGAATAGAATTGAAGTGGAAAAAGCATGGCACTGAAGATCAGTACGATAGTGATGAT
TCTTCTGATACTGATTCTGGTGGAGAGGTCAAGAAAACCAAGAAGAATAAGAGAACTAATCAAAGAACAGCCAGTGATGTTGACAAGAAATACAGCTCAAAGAAGCAGAA
GAAAAACACAAATTCTGATTCAGGAAGTTGTCAACCAGGTGACCCAACCCTCGGGCTGGTTGGTGAAGCGAAGAAGATGTATAATGGTATTGGTTTACAAACTCCGAGAG
GGTCTGGCACTAATGGATATATTCAGACAAACAAGTTCTTTGTGAGGCCGAAGACCGGAAAGGTTGCTGAAAGCTCTAGAGGATTCGAAGAAGATCAGGGCACTGCTGGA
GTTTCAAAGAAACCTAATAAAGACATTCTCGAACACGACCGCAAGCGTCAGATCGAACTCAAACTTGTCATACTTGAGGACAAGCTTACTGACCAAGGTTATACCGTGGA
TGAAATTTCTGAGAAGTTAAGGGAGGCTCGCGAAACTTTGGAAGCTGCTTCAGAGGAAAAAGATGGAGCTTCTGCCATCGTACTTGCAGATAAGAGGTACTCCTCTTCTC
CTCTTAATGATATGAATGATGATGTAATTTATTTCTTGAGAAGTAATAACCCTATCAGGGTATCAGATACACAGACTCACCAAATTGCTGCAAGAAAGGAGGAGCAGATG
AAAACATTTAGAGCTGCTCTTGGGTTGGGTTCATCGGACGATACTGAACAGGTTAAAGAAGAGATTTCTGATCCATCAAGAAATAGAAGAGAGGGTCAGAATTCTGATAT
TAAGCGTCATGAGAAGTCTGAACATTCTTTTTTGGACAGAGAATTGAACTGGAAAAAGCGTGGCACTGAAGATCAGTATGATGATAAGGATGACCAAAAAAGGGTTTCGA
AAGAGTTGAAAGGTCATCAGAAGGATAGAAAAAGAAGGCCCAAGGATGATTCTTCTGGCACCGATTCTGGTGAGCGTAAGGGAACCAAGAAGAACTTGAGAGACAGTAGA
AGGAATGATTCTGAAAGTGACCCTGACAATGATGTTGGCAACAAATATGTCGCCTCAAGGAAGTCTAAAAAAAATAGAAAGCATGATAGTGATGATTCTTCTGATACTGA
TTCTGGTGGTGAGCGCAAGGGAACGAAGAAGCACATGAGAGATAAACGGAGATCTGATCCTGAAAGTGACCCAGACAGTGATTTTGACCAGAAATATATCACCTCGAGGA
AGCATAAGAAGAACAGAAGGCATGATAGTGATGATTCTTCTAATACTGATTCTGGTGGAGAGCACAAGAAAACCAAGAAGAATATGAAAAATAATCAAAGAGATCATGGA
AGTGATCCCGACAATGATGTTGATAAGAAATACACCTCAAAGAAGCAGGCGAAAAACACAAGGCATGATAGGTATGATTCTGATTCATTTACAGACGGTGATGAGTTTGG
GATGGGCAGCCACAAGAAAGGATCGGGTAGACATAAAAGTCAAAAGGTGAAGAAGCATAGAAGCCGGAAACAGGAGTCTTCTGATGAATCCAATTCTGACAGTGGGATTG
ATGATAAACACAGGCAACTGAAGCACAAAAACCAGCATGGTAAAAGATATGGGGTAGAAAGTGACAGCTCTGACCATGACAGTTCTGATTCTGATGTAGCTCGCAAGAAG
AGTATGGATAGGTATGACAGCAAACGTATAGGAAAGAGCATGGTAGATAGTGAATCTGATTCTGAGAAGTCAAGAAAGCATCCTAAGAAAGATGTTGGGAGACGCAGACA
TGATATTGATGATGAAAAAAGTGGTGGTGATGAAATAGCGAAGAGGTGCAGAGGTAAGAGGCACAATACTGATGATGAATCTGAAGAAGGTGAATATTTTGGTAGAAGTG
GTAGGACAGCCACAAAAGGAAAAATAGCTGCTAAAAGGCAACATGATGACAGTGATAATTCTGATGATAGCCTAACAGTTGATAGAAAGGGCAATGATAAACACAAGAGA
GCTAAGAAATATTCGTCGGGTGACGGTTTTGATCTAGAGAAGGGAGTAAAATCAAGCAGTGGAGCTCATGGAAGAGGAAAAGGGAACCTAAATCATTCAGAAGGTAGGAG
GCACAATACTGATGATAAATCTGAAGAAGAAGAAGGTGAATATTTTGGTAGAAGTGATAAGATAGCTACAAAAGGAAAAATAGATGCTAAAAGGCAACATGACGACAATG
ATAATTCTGATGATAGCCTAGCAGTTGATAGAAAGGGCAATGATAAACACAAGAGAGCTAAAAAATATTCATCGAGTGACGATTCTGATATAGAGAATGGAGTAAAATCA
AGTGGTGGAGCTCGTGAAAGGGGAAAAAGGAACTTAAATCATACAGATGGTTTGGACAAGTTTAAGAAAGATTCTATCAATGAGTTCAACCATGCAAGTCAACATATAGA
TACAATGAAAAGCAAGAGAAAGTTTGATGAAGGTGAAAATGAGCAGCAGCTAGAGTCAAGGGATCGACGGCACAGGGAAGACTCCAAAAGAGAGTCAGATTTCCATGGTG
ACCCCAAGAAAGATTTCAAAAATGATTCTGAATCAAGCAGAAGAGCACACAGTGGTAGGTACGATGAGACAAGGGATCGACGATACAGGGTAGACCCCAAAATTGACTCT
GAATCAAACACTAGATCACGCTATAGTGCACACGACGAGGATGATGACAGAAAGTCAACTCGAACAGGAAGCAGATATACTGAAGAAACAGAGCATGGAAGTAGACATCA
TCGCAAGGCTAACGAGTCTCATCATCGCAGTAGGACTGATCAAGATACTGAAGAGGAAAAAAGGCACATCAGATATGAGGAGCCTAGAGGGAGAAAGCATGAAAGAGATG
ATGGTCTAAAATCGAGCAGGGAAGTTGAAAGAGGGGAGTATCAACCAAGTAGCAGGCTGAGATCTGAGAAAGATTATGAAAGTAGAGAATCTACAAGAGATAGGGATGAT
TCCAGAAAGAGGGCCAAATATGATTCTCGATCAAGCAGACGTGACAATTATTAACTCTAGAGCTTGAATCATGATGATGCCTGGTTCGGAAAGCAGTTGTATCTTTCCTT
TTCCCTGTAATGAAAAATTATGTGAAATGTTGTTTGATTTCAACTTGCAGTCATAGTTGCTACAGAAAAGCTTGCTACGGACGGGACCGATAGATATGGAATTGCTTCTT
CTTGTAATGTTAATTGTTGAATTGTATTCTTCATCAGGTTTTAGAAAGGCGATCCAATATTTGATCTCATTTGTGCTAGTAAGTTTGTTTGTATAGGCAGAAGACTTGAT
TCCGGAGTAATTTTGAAGACTACTGAAATTGTTATAATGTCTCAATTTTGTATGTCTTTTTCTTTACTATATTACCAAATACAAATCTCTAAATTGATATGTACCTCATT
TTTTTTATAAAAATCAGAAAGTAACTTGTACTAGAGTTTGGAATCTATTACTAAACATATTAACAATAAAAACGTTAAATACAAACTAAAACCAAACAAAGTCTCCAAAC
Protein sequenceShow/hide protein sequence
MKFKLGRLIWAWTRPSDLNPARFETEPTSAHLFFAWPAGKYLVGTKQFAVNQFAGTDRFQRVGGIPVIGLVAESSGIDDVRRRETSEIVFRGVNDFLGRNSELDRVIEEI
SIIEAQDLLKKKLIKEDELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFRENSYIPMNVNDHTPRCFGSSGPSSLGAFGE
DEKESSSTVSTGECFIACADFIYATQKQYFLAIPMSVSSHVMLKSSEELSWNWLMVFNCRSIVCSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHV
DGLTQSSVRQLLSEGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQADIRSREYLRKLQMGIELKWKKHGTEDQYDSDD
SSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNSDSGSCQPGDPTLGLVGEAKKMYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESSRGFEEDQGTAG
VSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTVDEISEKLREARETLEAASEEKDGASAIVLADKRYSSSPLNDMNDDVIYFLRSNNPIRVSDTQTHQIAARKEEQM
KTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSR
RNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHG
SDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKK
SMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSGGDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKR
AKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKS
SGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEGENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDS
ESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDD
SRKRAKYDSRSSRRDNY