; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004619 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004619
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEndonuclease V isoform X1
Genome locationChr08:18944190..18955035
RNA-Seq ExpressionHG10004619
SyntenyHG10004619
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003727 - single-stranded RNA binding (molecular function)
GO:0016891 - endoribonuclease activity, producing 5'-phosphomonoesters (molecular function)
InterPro domainsIPR007581 - Endonuclease V


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064984.1 dentin sialophosphoprotein-like [Cucumis melo var. makuwa]0.0e+0073.01Show/hide
Query:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS
        +V E  R  +E+      +++   N D   H++       R+++ K    ED+   +D   T+   E+ +  +  R N   AS  ++K  S         
Subjt:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS

Query:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH
           RVSDTQTHQIAARKEEQMKT RAALGLGS DD EQVKEEISDPSR+RREGQN+DIKRHEKSEHSFLDRELNWK+RGTEDQ+DDKD +K  SKELKGH
Subjt:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH

Query:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY
        QKD+KRRPKDDSS TDSGE KGTKKNLRDSRR DSES+ D DV NKYVASRKSKKNR+HDSDDSS TDSGGE K TKKH R+KR+ DPESD DSD DQKY
Subjt:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY

Query:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSR
        +TSRKHKKNRRHDSDDSS++DSGGEHKKTK+++++NQR HGSDPD+DVDKK+TSKKQ K+TRHD  DSDSFTDGD+ GM SH+KGSGRH+SQKVKK RS+
Subjt:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSR

Query:  KQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG----
        KQ+S+DE+NSDS ++DKHRQLKHKNQHGKRYG ESDSSDHDSSDSDV RKKS  R+ SKR GKS VDSESD EKSRK+PKKDV RRRHDIDDEKSG    
Subjt:  KQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG----

Query:  -GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHS
          DE+ KR RG+RH+TDD S EEGEYFGRSG+  TKGKI AKRQ D S+NSD SL VDRKG+D+HKRAKKYSSGDGF+LEKG K SSGA  RGKGNLNH 
Subjt:  -GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHS

Query:  EGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKK
        EGRRHNTDDKS EEEGEY GRS KIATK K+DAKRQHDD++NSDDSLAV      KHKRAKKYSSSDDSD+E GVKS+ GARERGK   N  DGLDKFKK
Subjt:  EGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKK

Query:  DSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHD
        DSI+EFNHASQ  D M SKRK DEG ENEQ+ ES+ R            + DPKKDFK+DSESSRR+ SGRYDETRD RYR D KIDSESNTRSRYSAH+
Subjt:  DSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHD

Query:  EDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSR
        EDDDRKSTRTGSRYTEETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHERD+GLKSSREVERGEYQPSSR RSEKDY   ESTRDR+DSR
Subjt:  EDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSR

Query:  KRAKYDSRSSRRDNY
        KRAKY+SRSSR DN+
Subjt:  KRAKYDSRSSRRDNY

KAG6598821.1 Endonuclease V, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0059.98Show/hide
Query:  MMSEEEKQAKSYSEASTTSSVEIQNWIEAQDLLKKKLIKEDELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGF
        M+SEE KQ KSY EA+TTSSVEIQNWIEAQDLLKKKLIKEDE EG  D LKY+GGVDISFLKEDSSVACGTLVVL+LQTLQVVY+DFSLVTVQVPYVPGF
Subjt:  MMSEEEKQAKSYSEASTTSSVEIQNWIEAQDLLKKKLIKEDELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGF

Query:  LAFREAPVLLELLERMRKRAPQLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLT
        LAFREAPVLLELLERM+KR P LYPQLL+VDGNGILHPR                               GFGLASHLGVLANLPTIGIGKNLHHVDGLT
Subjt:  LAFREAPVLLELLERMRKRAPQLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLT

Query:  QSSVRQLLSEGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQVKEEISDPSRNR--REGRNADTKRHEK
         S VR+LLSE KNNDS++TL+G SGCIWGVAMRST DSLKPIY+SIGHRVSL+TAIRIVK TCK+RVPEPIRQ     +D SR    ++G    ++R E 
Subjt:  QSSVRQLLSEGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQVKEEISDPSRNR--REGRNADTKRHEK

Query:  SEHSF---LDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKK-------------------NKRTNQ-RTASDVDKKYSSKKQKKNTNSDS--------
         + +F   +    +    G   +       D  + G  KK  K                   +K T+Q  T  ++ +K    ++     S S        
Subjt:  SEHSF---LDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKK-------------------NKRTNQ-RTASDVDKKYSSKKQKKNTNSDS--------

Query:  -----GRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKEL
              +VSDTQ+HQIAARKEEQMKT RAALGL SS+D+EQV E ISDP+RNRREGQN+DIKRHEKSEHSFLDRELNWKK G+ED  DDK D+KRVSKEL
Subjt:  -----GRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKEL

Query:  KGHQKDRKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDP---
        KGH KDR RRPKDDSS  DS GE  KGTKKNLRD+RRNDSESD ++D  +KY  SRKSKKNR+HDSD SSDTDSGGERKGTKKH+RD RR  P+ DP   
Subjt:  KGHQKDRKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDP---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---DSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-TSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGR
           DS+FDQK+ITSRKHKKNRRHDSD SS+TDSGGEHK+TKK++KNN+RD  SD D+D+DKKY TSKKQ KN   D  DSDS  D  EFGMGSH+KGSGR
Subjt:  ---DSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-TSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGR

Query:  HKSQKV-KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRR
         KSQKV KK RSRKQES+DESNSDSGIDDK RQLKHKNQHGKRYGV+SDSSD DSSDSDV R KS  RY SKR GKS VDSESDSEK RKHPK DVGRRR
Subjt:  HKSQKV-KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRR

Query:  HDIDDEKSG-----GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSG
        HD D+++SG      DEI KR R +RHN+DD+SEEGEYFG+SG+ ATKG IAAKR+HDDSD SDDS  VDR+GNDK KRAKK+SSGDG D +KGVKSS G
Subjt:  HDIDDEKSG-----GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSG

Query:  AHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRN
        A  RGKG+ NH++G                                                    D+   A K +S                       
Subjt:  AHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRN

Query:  LNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDS
                K + DS++EFN A+Q   TMKSKRK DE GE+EQQ E++ R     S RESDFHGDPKKDFKNDSESSRRA SGRY+E RD RYR DPKIDS
Subjt:  LNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDS

Query:  ESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRH--IRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKD
        ESN RSRYSAH+EDDDRKSTRTGSRYTEETEHGSRH+ KANESHHRSRTDQD EE KRH   RYEE RGRKHERD+G+KSSRE ERGEYQPSSRLRSEKD
Subjt:  ESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRH--IRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKD

Query:  YESRESTRDRDDSRKRAKYDSRSSRRD
        YE++ESTRDRDD RKRAKYDSRSSRRD
Subjt:  YESRESTRDRDDSRKRAKYDSRSSRRD

XP_004138875.1 dentin sialophosphoprotein [Cucumis sativus]9.5e-30271.29Show/hide
Query:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS
        +V E  R  +E+      +++   N D   H++       R+++ K    ED+ +    ++     E+ +  +  R N   AS  ++K  S         
Subjt:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS

Query:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH
           RVSDTQTHQIAARKEEQMKT RAALGLGS DD+EQVK+EISDPSRNRREGQN+D+KRHEKSEHSFLDR+LNWKKRGTEDQYDDKD +K  SKE+K  
Subjt:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH

Query:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY
        QKD+KRR KDDSS TDSGERKGTKKNLRDSRRNDSESD D DV NKYVASR SKKNR+HDSDDSS+TDSGGE K TKKH R+KR+ + E+D DSD DQKY
Subjt:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY

Query:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSH-KKGSGRHKSQKVKKHRS
        +TSRKHKKNRRHDSDDSS+TDS GEHKKTKK+++NNQR HGSD D+DVDKK+TSKKQ K+TRHD   SDSFTDGD+ GM SH KKGSGRH+S KVKK RS
Subjt:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSH-KKGSGRHKSQKVKKHRS

Query:  RKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG---
        RKQ+S+DE+NSDSGI+DKHRQLKHK+QHGKRYG ESDSSDHDSSDSDV R KS  RY SK  GKS V+SESDSEKSRK+P KD  RRRHDIDDEKSG   
Subjt:  RKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG---

Query:  --GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNH
           DE+ KR RG+RHN DD S EEGEYFGRSG+ ATKGKI AKRQH DS+NSDDSL V RKG+D HK+AKKY SGDGF+LEKG K SSGA  RGKGNL+H
Subjt:  --GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNH

Query:  SEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFK
        +EGRRHNTDDKS EEEGEY GRS KIATK KID KRQHDD++NSDDSLAV      KHKRAKKY SSDDSD+E GVKS+ GARERGK   NH DGL KFK
Subjt:  SEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFK

Query:  KDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAH
        KDSINE NHASQ  D M  KRK DEG E EQ+ ES+ R            + DPKKD K+DSESSRR+ SGRYD+TRD RYR D KIDSESNTRSRYSA 
Subjt:  KDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAH

Query:  DEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDS
         EDDDRKS RTGSRY+EETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHERD+GLKSSREVERGEYQPSSR RSEKDYE+RESTRDR+DS
Subjt:  DEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDS

Query:  RKRAKYDSRSSRRDNY
        RKR KY+SRSSRRDN+
Subjt:  RKRAKYDSRSSRRDNY

XP_008445109.1 PREDICTED: dentin sialophosphoprotein-like [Cucumis melo]0.0e+0072.57Show/hide
Query:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS
        +V E  R  +E+      +++   N D   H++       R+++ K    ED+   +D   T+   E+ +  +  R N   AS  ++K  S         
Subjt:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS

Query:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH
           RVSDTQTHQIAARKEEQMKT RAALGLGS  D EQVKEEISDPSR+RREGQN+DIKRHEKSEHSFLDRELNWK+RGTEDQ+DDKD +K  SKELKGH
Subjt:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH

Query:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY
        QKD+KRRPKDD S  DSGE KGTKKNLRDSRR DSESD D DV NKYVASRKSKKNR+HDSDDSS TDSGGE K TKKH R+KR+ DPESD DSD DQKY
Subjt:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY

Query:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSR
        +TSRKHKKNRRHDSDDSS++DSGGEHKKTK+++++NQR HGSDPD+DVDKK+TSKKQ K+TRHD  DSDSFTDGD+ GM SH+KGSGRH+SQKVKK RS+
Subjt:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSR

Query:  KQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG----
        KQ+S+DE+NSDS ++DKHRQLKHKNQHGKRYG ESDSSDHDSSDSDV RKKS  R+ SKR GKS VDSESD EKSRK+PKKD  RRRHDIDDEKSG    
Subjt:  KQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG----

Query:  -GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHS
          DE+ KR RG+RH+TDD S EEGEYFGRSG+  TKGKI AKRQ D S+NSD SL VDRKG+D+HKRAKKYSSGDGF+LEKG K SSGA  RGKGNLNH 
Subjt:  -GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHS

Query:  EGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKK
        EGRRHNTDDKS EEEGEY GRS K+ATK K+DAKRQHDD++NSDDSLAV      KHKRAKKYSSSDDSD+E GVKS+ GARERGK   N  DGLDKFKK
Subjt:  EGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKK

Query:  DSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHD
        DSI+EFNHASQ  D M SKRK DEG ENEQ+ ES+ R            + DPKKDFK+DSESSRR+ SGRYDETRD RYR D KIDSESNTRSRYSAH+
Subjt:  DSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHD

Query:  EDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSR
        EDDDRKSTRTGSRYTEETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHERD+GLKSSREVERGEYQPSSR RSEKDY   ESTRDR+DSR
Subjt:  EDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSR

Query:  KRAKYDSRSSRRDNY
        KRAKY+SRSSR DN+
Subjt:  KRAKYDSRSSRRDNY

XP_038884695.1 dentin sialophosphoprotein-like [Benincasa hispida]4.8e-28570.89Show/hide
Query:  RVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKD
        RVSDTQTHQIAARKEEQMKT RAALGLGSS DTEQVKEEISDPSR RREGQN+DIKRHEKSEHSFLDRELNWKK G EDQYDDKDD+KR+SKELKGHQK 
Subjt:  RVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKD

Query:  RKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITS
        RKRRPKDDSS TDS +R     NLRDSRRNDSESD D+DVG+KYVASR   KNR+HDSDDSSDTDSGGERKGTKKH+RDKRR DPESDPDSDFDQKYITS
Subjt:  RKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYITS

Query:  RKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYT-SKKQAKNTRHDRYDSDSFTDGDEFGM-GSHKKGSGRHKSQKVKKHRSRK
        RKHKKNRRHD D+SS+TDSGGEHKKTKKNM+NN+R HGSDP +D+DKKYT SKK  KN RHD  DSDS TDGDEFGM GSHKKGS RHKSQKVK  RSRK
Subjt:  RKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYT-SKKQAKNTRHDRYDSDSFTDGDEFGM-GSHKKGSGRHKSQKVKKHRSRK

Query:  QESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----
        QES+DESNSDSGID+K RQLKH+NQHGKRYGVESDSSDHDSSDSDV  KKS  RYDSKR GKS VDSES+SEKSRKH KKD GR RHDID+EKSG     
Subjt:  QESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG-----

Query:  GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEG
        G EI KR RG+ +N DD S                                                                                 
Subjt:  GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEG

Query:  RRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDS
                  EEEGEY GRS KIATKGKIDAKRQHDDN+NSDDSLAV RKGNDKHKRAKK SS DDSD+E GVK+SGGARERGK +LNH DGL+KFKKDS
Subjt:  RRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDS

Query:  INEFNHASQHIDTMKSKRKFDE-GENEQQLES---------------------------------------RDRRHREDSKRESDFHGDPKKDFKNDSES
        INEFNHASQ  DTM SKRKFDE G+NEQQLES                                       RD R+RED K +SDFHG+PKK F+NDSES
Subjt:  INEFNHASQHIDTMKSKRKFDE-GENEQQLES---------------------------------------RDRRHREDSKRESDFHGDPKKDFKNDSES

Query:  SRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDG
        SRRA SGRYDETRD RYR DPKIDSESN RSRYS  DEDDDRK+T+TGSR+TEETEHGSRHHRKANESHHRSRT +DTEEEKRH RYEEPRGRKHER++G
Subjt:  SRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDG

Query:  LKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY
        LKS REVERGEYQPSSRLRSEKDYE+RESTRDRDDSRKRAKY+SRSSRRDN+
Subjt:  LKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY

TrEMBL top hitse value%identityAlignment
A0A0A0LQ00 cwf21 domain-containing protein4.6e-30271.29Show/hide
Query:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS
        +V E  R  +E+      +++   N D   H++       R+++ K    ED+ +    ++     E+ +  +  R N   AS  ++K  S         
Subjt:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS

Query:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH
           RVSDTQTHQIAARKEEQMKT RAALGLGS DD+EQVK+EISDPSRNRREGQN+D+KRHEKSEHSFLDR+LNWKKRGTEDQYDDKD +K  SKE+K  
Subjt:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH

Query:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY
        QKD+KRR KDDSS TDSGERKGTKKNLRDSRRNDSESD D DV NKYVASR SKKNR+HDSDDSS+TDSGGE K TKKH R+KR+ + E+D DSD DQKY
Subjt:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY

Query:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSH-KKGSGRHKSQKVKKHRS
        +TSRKHKKNRRHDSDDSS+TDS GEHKKTKK+++NNQR HGSD D+DVDKK+TSKKQ K+TRHD   SDSFTDGD+ GM SH KKGSGRH+S KVKK RS
Subjt:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSH-KKGSGRHKSQKVKKHRS

Query:  RKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG---
        RKQ+S+DE+NSDSGI+DKHRQLKHK+QHGKRYG ESDSSDHDSSDSDV R KS  RY SK  GKS V+SESDSEKSRK+P KD  RRRHDIDDEKSG   
Subjt:  RKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG---

Query:  --GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNH
           DE+ KR RG+RHN DD S EEGEYFGRSG+ ATKGKI AKRQH DS+NSDDSL V RKG+D HK+AKKY SGDGF+LEKG K SSGA  RGKGNL+H
Subjt:  --GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNH

Query:  SEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFK
        +EGRRHNTDDKS EEEGEY GRS KIATK KID KRQHDD++NSDDSLAV      KHKRAKKY SSDDSD+E GVKS+ GARERGK   NH DGL KFK
Subjt:  SEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFK

Query:  KDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAH
        KDSINE NHASQ  D M  KRK DEG E EQ+ ES+ R            + DPKKD K+DSESSRR+ SGRYD+TRD RYR D KIDSESNTRSRYSA 
Subjt:  KDSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAH

Query:  DEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDS
         EDDDRKS RTGSRY+EETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHERD+GLKSSREVERGEYQPSSR RSEKDYE+RESTRDR+DS
Subjt:  DEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDS

Query:  RKRAKYDSRSSRRDNY
        RKR KY+SRSSRRDN+
Subjt:  RKRAKYDSRSSRRDNY

A0A1S3BBX0 dentin sialophosphoprotein-like0.0e+0072.57Show/hide
Query:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS
        +V E  R  +E+      +++   N D   H++       R+++ K    ED+   +D   T+   E+ +  +  R N   AS  ++K  S         
Subjt:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS

Query:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH
           RVSDTQTHQIAARKEEQMKT RAALGLGS  D EQVKEEISDPSR+RREGQN+DIKRHEKSEHSFLDRELNWK+RGTEDQ+DDKD +K  SKELKGH
Subjt:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH

Query:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY
        QKD+KRRPKDD S  DSGE KGTKKNLRDSRR DSESD D DV NKYVASRKSKKNR+HDSDDSS TDSGGE K TKKH R+KR+ DPESD DSD DQKY
Subjt:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY

Query:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSR
        +TSRKHKKNRRHDSDDSS++DSGGEHKKTK+++++NQR HGSDPD+DVDKK+TSKKQ K+TRHD  DSDSFTDGD+ GM SH+KGSGRH+SQKVKK RS+
Subjt:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSR

Query:  KQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG----
        KQ+S+DE+NSDS ++DKHRQLKHKNQHGKRYG ESDSSDHDSSDSDV RKKS  R+ SKR GKS VDSESD EKSRK+PKKD  RRRHDIDDEKSG    
Subjt:  KQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG----

Query:  -GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHS
          DE+ KR RG+RH+TDD S EEGEYFGRSG+  TKGKI AKRQ D S+NSD SL VDRKG+D+HKRAKKYSSGDGF+LEKG K SSGA  RGKGNLNH 
Subjt:  -GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHS

Query:  EGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKK
        EGRRHNTDDKS EEEGEY GRS K+ATK K+DAKRQHDD++NSDDSLAV      KHKRAKKYSSSDDSD+E GVKS+ GARERGK   N  DGLDKFKK
Subjt:  EGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKK

Query:  DSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHD
        DSI+EFNHASQ  D M SKRK DEG ENEQ+ ES+ R            + DPKKDFK+DSESSRR+ SGRYDETRD RYR D KIDSESNTRSRYSAH+
Subjt:  DSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHD

Query:  EDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSR
        EDDDRKSTRTGSRYTEETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHERD+GLKSSREVERGEYQPSSR RSEKDY   ESTRDR+DSR
Subjt:  EDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSR

Query:  KRAKYDSRSSRRDNY
        KRAKY+SRSSR DN+
Subjt:  KRAKYDSRSSRRDNY

A0A5A7VCH8 Dentin sialophosphoprotein-like0.0e+0073.01Show/hide
Query:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS
        +V E  R  +E+      +++   N D   H++       R+++ K    ED+   +D   T+   E+ +  +  R N   AS  ++K  S         
Subjt:  RVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSGGEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNS

Query:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH
           RVSDTQTHQIAARKEEQMKT RAALGLGS DD EQVKEEISDPSR+RREGQN+DIKRHEKSEHSFLDRELNWK+RGTEDQ+DDKD +K  SKELKGH
Subjt:  DSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGH

Query:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY
        QKD+KRRPKDDSS TDSGE KGTKKNLRDSRR DSES+ D DV NKYVASRKSKKNR+HDSDDSS TDSGGE K TKKH R+KR+ DPESD DSD DQKY
Subjt:  QKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY

Query:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSR
        +TSRKHKKNRRHDSDDSS++DSGGEHKKTK+++++NQR HGSDPD+DVDKK+TSKKQ K+TRHD  DSDSFTDGD+ GM SH+KGSGRH+SQKVKK RS+
Subjt:  ITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVKKHRSR

Query:  KQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG----
        KQ+S+DE+NSDS ++DKHRQLKHKNQHGKRYG ESDSSDHDSSDSDV RKKS  R+ SKR GKS VDSESD EKSRK+PKKDV RRRHDIDDEKSG    
Subjt:  KQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSG----

Query:  -GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHS
          DE+ KR RG+RH+TDD S EEGEYFGRSG+  TKGKI AKRQ D S+NSD SL VDRKG+D+HKRAKKYSSGDGF+LEKG K SSGA  RGKGNLNH 
Subjt:  -GDEIAKRCRGKRHNTDDES-EEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHS

Query:  EGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKK
        EGRRHNTDDKS EEEGEY GRS KIATK K+DAKRQHDD++NSDDSLAV      KHKRAKKYSSSDDSD+E GVKS+ GARERGK   N  DGLDKFKK
Subjt:  EGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKK

Query:  DSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHD
        DSI+EFNHASQ  D M SKRK DEG ENEQ+ ES+ R            + DPKKDFK+DSESSRR+ SGRYDETRD RYR D KIDSESNTRSRYSAH+
Subjt:  DSINEFNHASQHIDTMKSKRKFDEG-ENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHD

Query:  EDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSR
        EDDDRKSTRTGSRYTEETEHGSRHHRKANESHH  RTDQDTEEEKRH RYEEPRGRKHERD+GLKSSREVERGEYQPSSR RSEKDY   ESTRDR+DSR
Subjt:  EDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSR

Query:  KRAKYDSRSSRRDNY
        KRAKY+SRSSR DN+
Subjt:  KRAKYDSRSSRRDNY

A0A6J1ESM6 dentin sialophosphoprotein-like1.9e-23960.95Show/hide
Query:  RVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKD
        +VSDTQ+HQIAARKEEQMKT RAALGL SS+D+EQV E ISDP+RNRREGQN+DIKRHEKSEHSFLDRELNWKK G+ED  DDK D+KRVSKELKGH KD
Subjt:  RVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKD

Query:  RKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYI
        R RRPKDDSS  DS GE  KGTKKNLRD+RRNDSESD ++D  +KY  SRKSKKNR+HDSD SSDTDSGGERKGTKKH+RD RR  P+ DPDS+FDQKY 
Subjt:  RKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKYI

Query:  TSRKHKKNRRHDSDD-------------------------------------------------------------------------------------
        TSRKHKKNRRHDSDD                                                                                     
Subjt:  TSRKHKKNRRHDSDD-------------------------------------------------------------------------------------

Query:  ---------------------SSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-TSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKV
                             SS+TDSGGEHK+TKK++KNN+RD  SD D+D+DKKY TSKKQ KN      DSDS  D  EFGMGSH+KGSGR KSQKV
Subjt:  ---------------------SSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-TSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKV

Query:  -KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDE
         KK R RKQES+DESNSDSGIDDK RQLKHKNQHGKRYGV+SDSSD DSSDSDV R KS  RY SKR GKS VDSESDSEK RKHPKKDVGRRRHD D++
Subjt:  -KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDE

Query:  KSGGDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNH
        +SG                                           D+S +SD+ +        K +R                                
Subjt:  KSGGDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNH

Query:  SEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLD---
           RRHN+DDKS EEEGEYFG+S KIATKG I AKR+HDD+D SDDS AVDRKGNDK KRAKK+SS D SD + GVKSSGGARERGK + NH DGLD   
Subjt:  SEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLD---

Query:  --------KFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDS
                K + D ++EFN A+Q   TMKSKRK DE GE+EQQ E++ R     S RESDFHGDPKKDFKNDSESSRRA SGRY+ETRD RYR DPKIDS
Subjt:  --------KFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDS

Query:  ESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRH--IRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKD
        ESN RSRYSAH+ED+DRKSTRTGSRYTEETEHGSRH+ KANESHHRSRTDQD EE KRH   RYEE RGRKHERD+G+KSSRE ERGEYQPSSRLRSEKD
Subjt:  ESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRH--IRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKD

Query:  YESRESTRDRDDSRKRAKYDSRSSRRD
        YE++ESTRDRDD RKRAKYDSRSSRRD
Subjt:  YESRESTRDRDDSRKRAKYDSRSSRRD

A0A6J1K7B6 dentin sialophosphoprotein-like2.9e-23560.37Show/hide
Query:  RVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKD
        +VSDTQ+HQIAARKEEQMKT RAALGL SS+D+EQV E ISDP+RNRREGQN+DIKR EKSEHSFLDRELNWK+ G+ED  DDK D+KRVSKELKGH KD
Subjt:  RVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKELKGHQKD

Query:  RKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY-
        R RRPKDDSS  DS GE  KGTKKNLRD+RR DSESD ++D  +KY  SRKSKKNR+HDSD SSDTDSGGERKGTKKH+RD RR  P+ DPDS+FDQKY 
Subjt:  RKRRPKDDSSGTDS-GE-RKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRRSDPESDPDSDFDQKY-

Query:  ----------------------------------------------------ITSRKHKKNRRHDSDD--------------------------------
                                                            ITSRKHKKNRRHDSDD                                
Subjt:  ----------------------------------------------------ITSRKHKKNRRHDSDD--------------------------------

Query:  ---------------------SSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-TSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKV
                             SS+TDSGGEHK+TKK++KNN+RD  SD D+D+DKKY TSKKQ KN   D  DSDS  D  EFGMGSH+KGSGR KSQKV
Subjt:  ---------------------SSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKY-TSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKV

Query:  -KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDE
         KK RSRKQES+DESNSDSGIDDK RQLK+KNQHGKRYGV+SDSSD DSSDSDV R KS  RY SKR GKS VDSESDSEK RKHPKKDVGRRRHD D++
Subjt:  -KKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDE

Query:  KSG-----GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGK
        +SG      DEI KR R +RHN+DD+SEEGEYFG+SG+ ATKG IAAKR+H+DSD SDDS  VDR+GNDK KRAKK+S GDG D +KGVKSS GA  RGK
Subjt:  KSG-----GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGK

Query:  GNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDG
        G+ NH++G                                                    D+   A K +S                             
Subjt:  GNLNHSEGRRHNTDDKSEEEEGEYFGRSDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDG

Query:  LDKFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRS
          K + DS++EFN A+Q   TMKSKRK DE GE+EQQ E++ +     S RESDFHGDPKKDFKNDSESSRRA SGR+ ETRD RYR DPKIDSESN RS
Subjt:  LDKFKKDSINEFNHASQHIDTMKSKRKFDE-GENEQQLESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRS

Query:  RYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRH--IRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRES
        RYSAH+EDDDRKS RTGSRYTEETEHGSRH+ KANESHHRSRTDQD EE KR    RYEE RGRKHERD+G+KSSRE ERGEYQPSSRLRSEKDYE++ES
Subjt:  RYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTEEEKRH--IRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRES

Query:  TRDRDDSRKRAKYDSRSSRRD
        TRDRDD RKRAKYDSRSSR D
Subjt:  TRDRDDSRKRAKYDSRSSRRD

SwissProt top hitse value%identityAlignment
B5YD80 Endonuclease V7.9e-2531.58Show/hide
Query:  IEAQDLLKKKLIKEDELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLLELLERMRKRAPQLYPQ
        +  Q  + +K+IKED  +    +L+Y+GGVD S +++      G +VVL+  TL+V+     +  V  PY+PGFL+FRE P++L+  E ++     + P 
Subjt:  IEAQDLLKKKLIKEDELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLLELLERMRKRAPQLYPQ

Query:  LLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLSEGKNNDSIITLKGISGC
        LL+ DG GI HPR +                               G+ASH+G + ++P+IG  K +          +  +  E            I   
Subjt:  LLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLSEGKNNDSIITLKGISGC

Query:  IWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIV-KITCKYRVPEPIR
        I G  +R T D++KP++VS+GH++SL+T+I I+ K + KYR+PEP+R
Subjt:  IWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIV-KITCKYRVPEPIR

B8DZX0 Endonuclease V2.9e-2735.63Show/hide
Query:  IEAQDLLKKKLIKEDELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLLELLERMRKRAPQLYPQ
        ++ Q+ L KK+I ED+ +    +L+YIGGVD S L E      G + +L  +TL++V    +L  V  PY+PGFL+FRE PV+L   E+++     + P 
Subjt:  IEAQDLLKKKLIKEDELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLLELLERMRKRAPQLYPQ

Query:  LLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLSEGKNNDSIITLKGISGC
        LL+ DG GI HPR +                               G+ASH+G + ++P+IG  KN+  + G  +   ++     K +   I  KG    
Subjt:  LLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLSEGKNNDSIITLKGISGC

Query:  IWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIV-KITCKYRVPEPIR
        I G A+R T D++KP++VS+GH++SLNT+I I+ K + KYR+PEP+R
Subjt:  IWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIV-KITCKYRVPEPIR

Q10348 Putative endonuclease C1F12.06c3.6e-2535.04Show/hide
Query:  VDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLV-TVQVPYVPGFLAFREAPVLLELLERMRKRAPQLYPQLLMVDGNGILHPRGIFFLQ
        +++++Y+ G+DISF K  S  A   LV+ DL+   ++Y D+  +  ++  YVPGFL+FRE    L LL  +     Q    +++VDGNG+LHP       
Subjt:  VDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLV-TVQVPYVPGFLAFREAPVLLELLERMRKRAPQLYPQLLMVDGNGILHPRGIFFLQ

Query:  NLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQS--SVRQLLSE---GKNNDSIITLKGI--SGCIWGVAMRSTVDS
                                 GFGLA HLGVL NLP +G+ KN  H  GLT+S  + R+ L +    K  D  I +  I  S  I G A+ ++ +S
Subjt:  NLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQS--SVRQLLSE---GKNNDSIITLKGI--SGCIWGVAMRSTVDS

Query:  LKPIYVSIGHRVSLNTAIRIVK--ITCKYRVPEPIRQ----VKEEISDPSRNRR
         +P+YVSIG++++L  +I++V+   +   RVPEPIRQ     K  +S   RN++
Subjt:  LKPIYVSIGHRVSLNTAIRIVK--ITCKYRVPEPIRQ----VKEEISDPSRNRR

Q8C9A2 Endonuclease V5.3e-4541.02Show/hide
Query:  WIEAQDLLKKKLIKED----ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLLELLERMRKRAP
        W   Q  LK +++  D    + +     L+ +GGVD+SF+K DS  AC +LVVL    L+VVY+D  +V ++ PYV GFLAFRE P L+EL++R++++ P
Subjt:  WIEAQDLLKKKLIKED----ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLLELLERMRKRAP

Query:  QLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQ----LLSEGKNNDSI
         L PQ+++VDGNG+LH R                               GFG+A HLGVL  LP IG+ K L  VDGL  +++ +    LL  G +   +
Subjt:  QLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQ----LLSEGKNNDSI

Query:  ITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQ
        I   G SG + G+A+RS   S KP+YVS+GHR+SL  A+R+    C++R+PEPIRQ
Subjt:  ITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQ

Q8N8Q3 Endonuclease V1.0e-4339.69Show/hide
Query:  WIEAQDLLKKKLIKED----ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLLELLERMRKRAP
        W   Q  LK  ++  D    + +     L+ +GGVD+SF+K DS  AC +LVVL    L+VVY++  +V++  PYV GFLAFRE P LLEL++++R++ P
Subjt:  WIEAQDLLKKKLIKED----ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLLELLERMRKRAP

Query:  QLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSS-----VRQLLSEGKNNDS
         L PQ+L+VDGNG+LH R                               GFG+A HLGVL +LP +G+ K L  VDGL  ++     +R L + G +   
Subjt:  QLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSS-----VRQLLSEGKNNDS

Query:  IITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQ
           L G SG + G+A+RS   S +P+Y+S+GHR+SL  A+R+    C++R+PEP+RQ
Subjt:  IITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQ

Arabidopsis top hitse value%identityAlignment
AT3G49601.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: mRNA splicing factor, Cwf21 (InterPro:IPR013170); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).2.4e-0827.79Show/hide
Query:  NTNSDSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKE
        N      +VS+TQTHQ+AARKE+QM+ FRAALGL    D +QV EE        REG    +K  E+ EHSFLDR+   KK   ++  D+KD + + SK+
Subjt:  NTNSDSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNWKKRGTEDQYDDKDDQKRVSKE

Query:  LKG------------HQKDRKRRPKDDSSGTD--SGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGER------KGT
         +G             +K+ K+R  DDSS +D    +R+   K     R+ +SESD  +                  DS+  SD+D G +R      K T
Subjt:  LKG------------HQKDRKRRPKDDSSGTD--SGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGER------KGT

Query:  KKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAK---NTRHDRYDSDSFTD
        KK  R KR    ES+     D K +  + HKK+    S+ S + +   +H +  +  +       S+P+++ +K+   KK+       +  R D D   D
Subjt:  KKHMRDKRRSDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAK---NTRHDRYDSDSFTD

Query:  GDEFGMGSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGK-RYGVESD----SSDHDSSDSDVARKKSM--DRYDSKRIGKSMV
          +       K + R       +++++KQ  S      +G+  K ++ +   +HGK +Y  +S     + D D S+++   +K +  + Y   R  K   
Subjt:  GDEFGMGSHKKGSGRHKSQKVKKHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGK-RYGVESD----SSDHDSSDSDVARKKSM--DRYDSKRIGKSMV

Query:  DSESDSEKSRKHPKKDVGRRRHDI---DDEKSG------GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGND
        D ++D+    ++   D  +R   I   DD   G      GD+   R R +R +  D+ EE ++ GR  R    G+ A  ++ DD D          +G  
Subjt:  DSESDSEKSRKHPKKDVGRRRHDI---DDEKSG------GDEIAKRCRGKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGND

Query:  KHKRAKKYSSG
        ++  ++  SSG
Subjt:  KHKRAKKYSSG

AT4G31150.1 endonuclease V family protein2.8e-7355.51Show/hide
Query:  SEASTTSSVEIQNWIEAQDLLKKKLIKED----------ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLA
        S  S++S  +++ W E QD LKKKLI  D          EL    + LKY+GGVD+SF KEDSSVAC  LVVL+L +L+VV+ DFSL+ + VPYVPGFLA
Subjt:  SEASTTSSVEIQNWIEAQDLLKKKLIKED----------ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLA

Query:  FREAPVLLELLERMRKRAPQLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQS
        FREAPVLL++L++MR      YPQ+LMVDGNGILHPR                               GFGLA HLGVLA+LPTIG+GKNLHHVDGL QS
Subjt:  FREAPVLLELLERMRKRAPQLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQS

Query:  SVRQLLS-EGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQ
         V+Q L  +   ++  ITL G SG  WGV  R T+ SLKPIYVS+GHR+SL++A+ +VKITCKYRVPEPIRQ
Subjt:  SVRQLLS-EGKNNDSIITLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQ

AT4G31150.2 endonuclease V family protein2.9e-7057.25Show/hide
Query:  QDLLKKKLIKED----------ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLLELLERMRKR
        QD LKKKLI  D          EL    + LKY+GGVD+SF KEDSSVAC  LVVL+L +L+VV+ DFSL+ + VPYVPGFLAFREAPVLL++L++MR  
Subjt:  QDLLKKKLIKED----------ELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLLELLERMRKR

Query:  APQLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLS-EGKNNDSII
            YPQ+LMVDGNGILHPR                               GFGLA HLGVLA+LPTIG+GKNLHHVDGL QS V+Q L  +   ++  I
Subjt:  APQLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLS-EGKNNDSII

Query:  TLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQ
        TL G SG  WGV  R T+ SLKPIYVS+GHR+SL++A+ +VKITCKYRVPEPIRQ
Subjt:  TLKGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTCCGAAGAAGAGAAACAAGCGAAATCGTATTCCGAGGCGTCAACGACTTCCTCGGTCGAAATTCAGAACTGGATAGAAGCGCAGGACTTGCTGAAGAAGAAACT
AATCAAAGAGGATGAGTTGGAAGGGGAAGTGGATGATTTGAAGTATATCGGCGGCGTTGATATAAGCTTCTTAAAGGAAGATTCATCAGTTGCGTGTGGTACGCTCGTGG
TTTTGGATCTCCAAACTCTTCAAGTTGTCTATGATGATTTCTCTCTTGTTACCGTTCAAGTTCCTTATGTCCCTGGCTTTCTAGCATTTAGAGAGGCCCCAGTTCTCTTG
GAGCTTTTGGAGAGGATGAGAAAGAGAGCTCCTCAACTGTATCCACAGCTATTGATGGTTGATGGAAACGGAATACTCCATCCCAGAGGTATTTTTTTCCTTCAAAATCT
TTACTGGTGGAGTAGTATCAACATTATATGCTATCTTAGAACACTGACCATTCCATGGGAGGAACGTGCAGGTTTTGGTCTGGCCAGTCATCTTGGCGTTCTTGCTAATT
TGCCTACGATTGGAATTGGCAAGAATCTGCATCATGTTGATGGTCTTACCCAGTCCAGTGTGAGGCAACTTCTTTCCGAGGGCAAGAATAATGATTCTATAATAACTTTG
AAGGGTATCTCTGGATGCATTTGGGGTGTGGCTATGAGATCTACTGTTGATTCATTGAAGCCCATATACGTTTCAATTGGTCATCGTGTTTCACTCAACACTGCCATCAG
GATTGTTAAAATCACATGCAAATATCGTGTTCCAGAACCTATCAGACAGGTTAAAGAAGAGATTTCTGATCCATCAAGAAATAGAAGGGAGGGTCGGAATGCTGATACTA
AGCGTCATGAGAAGTCTGAACATTCTTTTTTGGACAGAGAATTGAAGTGGAAAAAGCATGGCACTGAAGATCAGTACGATAGTGATGATTCTTCTGATACTGATTCTGGT
GGAGAGGTCAAGAAAACCAAGAAGAATAAGAGAACTAATCAAAGAACAGCCAGTGATGTTGACAAGAAATACAGCTCAAAGAAGCAGAAGAAAAACACAAATTCTGATTC
AGGAAGGGTATCAGATACACAGACTCACCAAATTGCTGCAAGAAAGGAGGAGCAGATGAAAACATTTAGAGCTGCTCTTGGGTTGGGTTCATCGGACGATACTGAACAGG
TTAAAGAAGAGATTTCTGATCCATCAAGAAATAGAAGAGAGGGTCAGAATTCTGATATTAAGCGTCATGAGAAGTCTGAACATTCTTTTTTGGACAGAGAATTGAACTGG
AAAAAGCGTGGCACTGAAGATCAGTATGATGATAAGGATGACCAAAAAAGGGTTTCGAAAGAGTTGAAAGGTCATCAGAAGGATAGAAAAAGAAGGCCCAAGGATGATTC
TTCTGGCACCGATTCTGGTGAGCGTAAGGGAACCAAGAAGAACTTGAGAGACAGTAGAAGGAATGATTCTGAAAGTGACCCTGACAATGATGTTGGCAACAAATATGTCG
CCTCAAGGAAGTCTAAAAAAAATAGAAAGCATGATAGTGATGATTCTTCTGATACTGATTCTGGTGGTGAGCGCAAGGGAACGAAGAAGCACATGAGAGATAAACGGAGA
TCTGATCCTGAAAGTGACCCAGACAGTGATTTTGACCAGAAATATATCACCTCGAGGAAGCATAAGAAGAACAGAAGGCATGATAGTGATGATTCTTCTAATACTGATTC
TGGTGGAGAGCACAAGAAAACCAAGAAGAATATGAAAAATAATCAAAGAGATCATGGAAGTGATCCCGACAATGATGTTGATAAGAAATACACCTCAAAGAAGCAGGCGA
AAAACACAAGGCATGATAGGTATGATTCTGATTCATTTACAGACGGTGATGAGTTTGGGATGGGCAGCCACAAGAAAGGATCGGGTAGACATAAAAGTCAAAAGGTGAAG
AAGCATAGAAGCCGGAAACAGGAGTCTTCTGATGAATCCAATTCTGACAGTGGGATTGATGATAAACACAGGCAACTGAAGCACAAAAACCAGCATGGTAAAAGATATGG
GGTAGAAAGTGACAGCTCTGACCATGACAGTTCTGATTCTGATGTAGCTCGCAAGAAGAGTATGGATAGGTATGACAGCAAACGTATAGGAAAGAGCATGGTAGATAGTG
AATCTGATTCTGAGAAGTCAAGAAAGCATCCTAAGAAAGATGTTGGGAGACGCAGACATGATATTGATGATGAAAAAAGTGGTGGTGATGAAATAGCGAAGAGGTGCAGA
GGTAAGAGGCACAATACTGATGATGAATCTGAAGAAGGTGAATATTTTGGTAGAAGTGGTAGGACAGCCACAAAAGGAAAAATAGCTGCTAAAAGGCAACATGATGACAG
TGATAATTCTGATGATAGCCTAACAGTTGATAGAAAGGGCAATGATAAACACAAGAGAGCTAAGAAATATTCGTCGGGTGACGGTTTTGATCTAGAGAAGGGAGTAAAAT
CAAGCAGTGGAGCTCATGGAAGAGGAAAAGGGAACCTAAATCATTCAGAAGGTAGGAGGCACAATACTGATGATAAATCTGAAGAAGAAGAAGGTGAATATTTTGGTAGA
AGTGATAAGATAGCTACAAAAGGAAAAATAGATGCTAAAAGGCAACATGACGACAATGATAATTCTGATGATAGCCTAGCAGTTGATAGAAAGGGCAATGATAAACACAA
GAGAGCTAAAAAATATTCATCGAGTGACGATTCTGATATAGAGAATGGAGTAAAATCAAGTGGTGGAGCTCGTGAAAGGGGAAAAAGGAACTTAAATCATACAGATGGTT
TGGACAAGTTTAAGAAAGATTCTATCAATGAGTTCAACCATGCAAGTCAACATATAGATACAATGAAAAGCAAGAGAAAGTTTGATGAAGGTGAAAATGAGCAGCAGCTA
GAGTCAAGGGATCGACGGCACAGGGAAGACTCCAAAAGAGAGTCAGATTTCCATGGTGACCCCAAGAAAGATTTCAAAAATGATTCTGAATCAAGCAGAAGAGCACACAG
TGGTAGGTACGATGAGACAAGGGATCGACGATACAGGGTAGACCCCAAAATTGACTCTGAATCAAACACTAGATCACGCTATAGTGCACACGACGAGGATGATGACAGAA
AGTCAACTCGAACAGGAAGCAGATATACTGAAGAAACAGAGCATGGAAGTAGACATCATCGCAAGGCTAACGAGTCTCATCATCGCAGTAGGACTGATCAAGATACTGAA
GAGGAAAAAAGGCACATCAGATATGAGGAGCCTAGAGGGAGAAAGCATGAAAGAGATGATGGTCTAAAATCGAGCAGGGAAGTTGAAAGAGGGGAGTATCAACCAAGTAG
CAGGCTGAGATCTGAGAAAGATTATGAAAGTAGAGAATCTACAAGAGATAGGGATGATTCCAGAAAGAGGGCCAAATATGATTCTCGATCAAGCAGACGTGACAATTATT
AA
mRNA sequenceShow/hide mRNA sequence
ATGATGTCCGAAGAAGAGAAACAAGCGAAATCGTATTCCGAGGCGTCAACGACTTCCTCGGTCGAAATTCAGAACTGGATAGAAGCGCAGGACTTGCTGAAGAAGAAACT
AATCAAAGAGGATGAGTTGGAAGGGGAAGTGGATGATTTGAAGTATATCGGCGGCGTTGATATAAGCTTCTTAAAGGAAGATTCATCAGTTGCGTGTGGTACGCTCGTGG
TTTTGGATCTCCAAACTCTTCAAGTTGTCTATGATGATTTCTCTCTTGTTACCGTTCAAGTTCCTTATGTCCCTGGCTTTCTAGCATTTAGAGAGGCCCCAGTTCTCTTG
GAGCTTTTGGAGAGGATGAGAAAGAGAGCTCCTCAACTGTATCCACAGCTATTGATGGTTGATGGAAACGGAATACTCCATCCCAGAGGTATTTTTTTCCTTCAAAATCT
TTACTGGTGGAGTAGTATCAACATTATATGCTATCTTAGAACACTGACCATTCCATGGGAGGAACGTGCAGGTTTTGGTCTGGCCAGTCATCTTGGCGTTCTTGCTAATT
TGCCTACGATTGGAATTGGCAAGAATCTGCATCATGTTGATGGTCTTACCCAGTCCAGTGTGAGGCAACTTCTTTCCGAGGGCAAGAATAATGATTCTATAATAACTTTG
AAGGGTATCTCTGGATGCATTTGGGGTGTGGCTATGAGATCTACTGTTGATTCATTGAAGCCCATATACGTTTCAATTGGTCATCGTGTTTCACTCAACACTGCCATCAG
GATTGTTAAAATCACATGCAAATATCGTGTTCCAGAACCTATCAGACAGGTTAAAGAAGAGATTTCTGATCCATCAAGAAATAGAAGGGAGGGTCGGAATGCTGATACTA
AGCGTCATGAGAAGTCTGAACATTCTTTTTTGGACAGAGAATTGAAGTGGAAAAAGCATGGCACTGAAGATCAGTACGATAGTGATGATTCTTCTGATACTGATTCTGGT
GGAGAGGTCAAGAAAACCAAGAAGAATAAGAGAACTAATCAAAGAACAGCCAGTGATGTTGACAAGAAATACAGCTCAAAGAAGCAGAAGAAAAACACAAATTCTGATTC
AGGAAGGGTATCAGATACACAGACTCACCAAATTGCTGCAAGAAAGGAGGAGCAGATGAAAACATTTAGAGCTGCTCTTGGGTTGGGTTCATCGGACGATACTGAACAGG
TTAAAGAAGAGATTTCTGATCCATCAAGAAATAGAAGAGAGGGTCAGAATTCTGATATTAAGCGTCATGAGAAGTCTGAACATTCTTTTTTGGACAGAGAATTGAACTGG
AAAAAGCGTGGCACTGAAGATCAGTATGATGATAAGGATGACCAAAAAAGGGTTTCGAAAGAGTTGAAAGGTCATCAGAAGGATAGAAAAAGAAGGCCCAAGGATGATTC
TTCTGGCACCGATTCTGGTGAGCGTAAGGGAACCAAGAAGAACTTGAGAGACAGTAGAAGGAATGATTCTGAAAGTGACCCTGACAATGATGTTGGCAACAAATATGTCG
CCTCAAGGAAGTCTAAAAAAAATAGAAAGCATGATAGTGATGATTCTTCTGATACTGATTCTGGTGGTGAGCGCAAGGGAACGAAGAAGCACATGAGAGATAAACGGAGA
TCTGATCCTGAAAGTGACCCAGACAGTGATTTTGACCAGAAATATATCACCTCGAGGAAGCATAAGAAGAACAGAAGGCATGATAGTGATGATTCTTCTAATACTGATTC
TGGTGGAGAGCACAAGAAAACCAAGAAGAATATGAAAAATAATCAAAGAGATCATGGAAGTGATCCCGACAATGATGTTGATAAGAAATACACCTCAAAGAAGCAGGCGA
AAAACACAAGGCATGATAGGTATGATTCTGATTCATTTACAGACGGTGATGAGTTTGGGATGGGCAGCCACAAGAAAGGATCGGGTAGACATAAAAGTCAAAAGGTGAAG
AAGCATAGAAGCCGGAAACAGGAGTCTTCTGATGAATCCAATTCTGACAGTGGGATTGATGATAAACACAGGCAACTGAAGCACAAAAACCAGCATGGTAAAAGATATGG
GGTAGAAAGTGACAGCTCTGACCATGACAGTTCTGATTCTGATGTAGCTCGCAAGAAGAGTATGGATAGGTATGACAGCAAACGTATAGGAAAGAGCATGGTAGATAGTG
AATCTGATTCTGAGAAGTCAAGAAAGCATCCTAAGAAAGATGTTGGGAGACGCAGACATGATATTGATGATGAAAAAAGTGGTGGTGATGAAATAGCGAAGAGGTGCAGA
GGTAAGAGGCACAATACTGATGATGAATCTGAAGAAGGTGAATATTTTGGTAGAAGTGGTAGGACAGCCACAAAAGGAAAAATAGCTGCTAAAAGGCAACATGATGACAG
TGATAATTCTGATGATAGCCTAACAGTTGATAGAAAGGGCAATGATAAACACAAGAGAGCTAAGAAATATTCGTCGGGTGACGGTTTTGATCTAGAGAAGGGAGTAAAAT
CAAGCAGTGGAGCTCATGGAAGAGGAAAAGGGAACCTAAATCATTCAGAAGGTAGGAGGCACAATACTGATGATAAATCTGAAGAAGAAGAAGGTGAATATTTTGGTAGA
AGTGATAAGATAGCTACAAAAGGAAAAATAGATGCTAAAAGGCAACATGACGACAATGATAATTCTGATGATAGCCTAGCAGTTGATAGAAAGGGCAATGATAAACACAA
GAGAGCTAAAAAATATTCATCGAGTGACGATTCTGATATAGAGAATGGAGTAAAATCAAGTGGTGGAGCTCGTGAAAGGGGAAAAAGGAACTTAAATCATACAGATGGTT
TGGACAAGTTTAAGAAAGATTCTATCAATGAGTTCAACCATGCAAGTCAACATATAGATACAATGAAAAGCAAGAGAAAGTTTGATGAAGGTGAAAATGAGCAGCAGCTA
GAGTCAAGGGATCGACGGCACAGGGAAGACTCCAAAAGAGAGTCAGATTTCCATGGTGACCCCAAGAAAGATTTCAAAAATGATTCTGAATCAAGCAGAAGAGCACACAG
TGGTAGGTACGATGAGACAAGGGATCGACGATACAGGGTAGACCCCAAAATTGACTCTGAATCAAACACTAGATCACGCTATAGTGCACACGACGAGGATGATGACAGAA
AGTCAACTCGAACAGGAAGCAGATATACTGAAGAAACAGAGCATGGAAGTAGACATCATCGCAAGGCTAACGAGTCTCATCATCGCAGTAGGACTGATCAAGATACTGAA
GAGGAAAAAAGGCACATCAGATATGAGGAGCCTAGAGGGAGAAAGCATGAAAGAGATGATGGTCTAAAATCGAGCAGGGAAGTTGAAAGAGGGGAGTATCAACCAAGTAG
CAGGCTGAGATCTGAGAAAGATTATGAAAGTAGAGAATCTACAAGAGATAGGGATGATTCCAGAAAGAGGGCCAAATATGATTCTCGATCAAGCAGACGTGACAATTATT
AA
Protein sequenceShow/hide protein sequence
MMSEEEKQAKSYSEASTTSSVEIQNWIEAQDLLKKKLIKEDELEGEVDDLKYIGGVDISFLKEDSSVACGTLVVLDLQTLQVVYDDFSLVTVQVPYVPGFLAFREAPVLL
ELLERMRKRAPQLYPQLLMVDGNGILHPRGIFFLQNLYWWSSINIICYLRTLTIPWEERAGFGLASHLGVLANLPTIGIGKNLHHVDGLTQSSVRQLLSEGKNNDSIITL
KGISGCIWGVAMRSTVDSLKPIYVSIGHRVSLNTAIRIVKITCKYRVPEPIRQVKEEISDPSRNRREGRNADTKRHEKSEHSFLDRELKWKKHGTEDQYDSDDSSDTDSG
GEVKKTKKNKRTNQRTASDVDKKYSSKKQKKNTNSDSGRVSDTQTHQIAARKEEQMKTFRAALGLGSSDDTEQVKEEISDPSRNRREGQNSDIKRHEKSEHSFLDRELNW
KKRGTEDQYDDKDDQKRVSKELKGHQKDRKRRPKDDSSGTDSGERKGTKKNLRDSRRNDSESDPDNDVGNKYVASRKSKKNRKHDSDDSSDTDSGGERKGTKKHMRDKRR
SDPESDPDSDFDQKYITSRKHKKNRRHDSDDSSNTDSGGEHKKTKKNMKNNQRDHGSDPDNDVDKKYTSKKQAKNTRHDRYDSDSFTDGDEFGMGSHKKGSGRHKSQKVK
KHRSRKQESSDESNSDSGIDDKHRQLKHKNQHGKRYGVESDSSDHDSSDSDVARKKSMDRYDSKRIGKSMVDSESDSEKSRKHPKKDVGRRRHDIDDEKSGGDEIAKRCR
GKRHNTDDESEEGEYFGRSGRTATKGKIAAKRQHDDSDNSDDSLTVDRKGNDKHKRAKKYSSGDGFDLEKGVKSSSGAHGRGKGNLNHSEGRRHNTDDKSEEEEGEYFGR
SDKIATKGKIDAKRQHDDNDNSDDSLAVDRKGNDKHKRAKKYSSSDDSDIENGVKSSGGARERGKRNLNHTDGLDKFKKDSINEFNHASQHIDTMKSKRKFDEGENEQQL
ESRDRRHREDSKRESDFHGDPKKDFKNDSESSRRAHSGRYDETRDRRYRVDPKIDSESNTRSRYSAHDEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHRSRTDQDTE
EEKRHIRYEEPRGRKHERDDGLKSSREVERGEYQPSSRLRSEKDYESRESTRDRDDSRKRAKYDSRSSRRDNY