; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015260 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015260
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNpa1 domain-containing protein
Genome locationscaffold3:39356369..39368566
RNA-Seq ExpressionSpg015260
SyntenySpg015260
Gene Ontology termsGO:0000463 - maturation of LSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (biological process)
GO:0000466 - maturation of 5.8S rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021714 - Nucleolar pre-ribosomal-associated protein 1, N-terminal
IPR026960 - Reverse transcriptase zinc-binding domain
IPR039844 - Nucleolar pre-ribosomal-associated protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593037.1 TSET complex member tstF, partial [Cucurbita argyrosperma subsp. sororia]1.0e-30584.78Show/hide
Query:  TSLDNTHSLSPQSP--LATTSRCSRQIWLFRLTHRVGKG----------KFMDTRLNLGLMDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKG
        +S+ N+ S +  +P   A  S   +QIW FRLT ++ +G          + +DT  NLGLMD INANLEAKLKELLFKINS EIKICSDATKEFIKLLKG
Subjt:  TSLDNTHSLSPQSP--LATTSRCSRQIWLFRLTHRVGKG----------KFMDTRLNLGLMDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKG

Query:  EIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERISTSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMS
        EIGCKLLHLYAKTSPKC ELLDAWKLQRGKAGM YIFSLVSAILSHPDGIY LNDLERISTSRVLDM+ARSL EECLGDINSELGSQE+K +NAALLLMS
Subjt:  EIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERISTSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMS

Query:  SIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKPELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVL
        SIVRRGSRLA  VAKNFDFKLR FSKLTEFRQK NQKGSK SSRKLFIGFAMSFLEVGKPELLRW+LQQREMYSGVLRGLANDDE TIIYVLSTLRDKVL
Subjt:  SIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKPELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVL

Query:  VDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKRFPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPT
        V+ESLVPPGLRSVLFGSVTLEQLATICGRENGGPA E AYQVLTMVCTDPCNGLMPDLKR PNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIR QPT
Subjt:  VDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKRFPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPT

Query:  FCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMKSIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALK
        FCSTYLEEFPYNLEDFLSPTWFS+VSLTVKLVSSV +GL +GSIDSQ DD TSLDN  +K+I+RCLSSRPFSRSVINKGLLHSNILVKHGTLRLLL+ALK
Subjt:  FCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMKSIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALK

Query:  LLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAVRLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDI
        ++NSFF VLN+ SS NKQ M +WLSLKQELQNEVQTLLPD QVLLTLLSS ASQSRVQAV LKRASGLE SFHGVK+LKTTS DHDTDIVV G+VSAPDI
Subjt:  LLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAVRLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDI

Query:  DEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY
        DEKM+D+C+VETSEKERELMIS+AELWDLDPLST+VEVNDVEMYF SK+LDALTIY+
Subjt:  DEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY

KAG7025444.1 Nucleolar pre-ribosomal-associated protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.9e-30582.38Show/hide
Query:  PPVPLSPPSQRSLFIATSLDNTHSLSPQSPLATTSRCSRQIWLFRLTHRVGKG--------------------KFMDTRLNLGLMDAINANLEAKLKELL
        P    +PP+  S    + L     L P  PL  +    +QIW FRLT ++ +G                     F DT  NLGLMD INANLEAKLKELL
Subjt:  PPVPLSPPSQRSLFIATSLDNTHSLSPQSPLATTSRCSRQIWLFRLTHRVGKG--------------------KFMDTRLNLGLMDAINANLEAKLKELL

Query:  FKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERISTSRVLDMLARSLFEEC
        FKINS EIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKC ELLDAWKLQRGKAGM YIFSLVSAILSHPDGIY LNDLERISTSRVLDM+ARSL EEC
Subjt:  FKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERISTSRVLDMLARSLFEEC

Query:  LGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKPELLRWILQQREMYSGV
        LGDINSELGSQE+K +NAALLLMSSIVRRGSRLA  VAKNFDFKLR FSKLTEFRQK NQKGSK SSRKLFIGFAMSFLEVGKPELLRW+LQQREMYSGV
Subjt:  LGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKPELLRWILQQREMYSGV

Query:  LRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKRFPNPLKGNPKRLMDLM
        LRGLANDDE TIIYVLSTLRDKVLV+ESLVPPGLRSVLFGSVTLEQLATICGRENGGPA E AYQVLTMVCTDPCNGLMPDLKR PNPLKGNPKRLMDLM
Subjt:  LRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKRFPNPLKGNPKRLMDLM

Query:  KKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMKSIMRCLSSRPFSRSVI
        KKLKATGVIYHRDLLLAIIR QPTFCSTYLEEFPYNLEDFLSPTWFS+VSLTVKLVSSV +GL +GSIDSQ DD TSLDN  +K+I+RCLSSRPFSRSVI
Subjt:  KKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMKSIMRCLSSRPFSRSVI

Query:  NKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAVRLKRASGLERSFHGVK
        NKGLLHSNILVKHGTLRLLL+ALK++NSFF VLN+ SS NKQ M +WLSLKQELQNEVQTLLPD QVLLTLLSS ASQSRVQAV LKRASGLE SFHGVK
Subjt:  NKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAVRLKRASGLERSFHGVK

Query:  KLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY
        +LKTTS DHDTDIVV G+VSAPDIDEKM+D+C+VETSEKERELMIS+AELWDLDPLST+VEVNDVEMYF SK+LDALTIY+
Subjt:  KLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY

XP_022156773.1 uncharacterized protein LOC111023605 isoform X1 [Momordica charantia]2.6e-30190.6Show/hide
Query:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS
        MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKC ELLD WKLQRGKAG+PYIFSLVSAILSHPDG Y LNDLERIS
Subjt:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS

Query:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP
        TSRVLDMLARSL EECLGDIN+ELGSQEVKRQNAALLLMSSIVRRGSR+A EVAKNFDFK R FSKL E+RQK NQKGSKHSSRKLF+GFAMSFLEV KP
Subjt:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP

Query:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR
        ELLRW+LQQ+EMYSGVLRGLANDDEETIIY+LSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICG ENGGP  EAAYQVL +VCTDPCNGLMPDLKR
Subjt:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR

Query:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK
         PNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIR QPTFCSTYLEEFPYNLEDFLSPTWFSMVSLT+KLVS V +GL +GS DSQ DDITSL+N YMK
Subjt:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK

Query:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV
        SI+RCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKL+NSF G LNK  SVNKQMMS+WLSL QE+QNEVQTLLPDPQVLLTLLSSLASQSRVQAV
Subjt:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV

Query:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIY
         LKRASGLE SFHGVKKLKTTS DHDTDIVV GVVSAP   +K+VDICTVETSEKERELMIS+AELWDLDPLST+VEVNDVEMYFLSKLLDALTIY
Subjt:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIY

XP_022156774.1 uncharacterized protein LOC111023605 isoform X2 [Momordica charantia]2.6e-30190.6Show/hide
Query:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS
        MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKC ELLD WKLQRGKAG+PYIFSLVSAILSHPDG Y LNDLERIS
Subjt:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS

Query:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP
        TSRVLDMLARSL EECLGDIN+ELGSQEVKRQNAALLLMSSIVRRGSR+A EVAKNFDFK R FSKL E+RQK NQKGSKHSSRKLF+GFAMSFLEV KP
Subjt:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP

Query:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR
        ELLRW+LQQ+EMYSGVLRGLANDDEETIIY+LSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICG ENGGP  EAAYQVL +VCTDPCNGLMPDLKR
Subjt:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR

Query:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK
         PNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIR QPTFCSTYLEEFPYNLEDFLSPTWFSMVSLT+KLVS V +GL +GS DSQ DDITSL+N YMK
Subjt:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK

Query:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV
        SI+RCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKL+NSF G LNK  SVNKQMMS+WLSL QE+QNEVQTLLPDPQVLLTLLSSLASQSRVQAV
Subjt:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV

Query:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIY
         LKRASGLE SFHGVKKLKTTS DHDTDIVV GVVSAP   +K+VDICTVETSEKERELMIS+AELWDLDPLST+VEVNDVEMYFLSKLLDALTIY
Subjt:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIY

XP_022959834.1 uncharacterized protein LOC111460777 [Cucurbita moschata]1.5e-29889.95Show/hide
Query:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS
        MD INANLEAKLKELLFKINS EIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKC ELLDAWKLQRGKAGM YIFSLVSAILSHPDGIY LNDLERIS
Subjt:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS

Query:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP
        TSRVLDM+ARSL EECLGDINSELGSQE+K +NAALLLMSSIVRRGSRLA  VAKNFDFKLR FSKLTEFRQK NQKGSK SSRKLFIGFAMSFLEVGKP
Subjt:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP

Query:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR
        ELLRW+LQQREMYSGVLRGLANDDE TIIYVLSTLRDKVLV+ESLVPPGLRSVLFGSVTLEQLATICGRENGGPA E AYQVLTMVCTDPCNGLMPDLKR
Subjt:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR

Query:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK
         PNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIR QPTFCSTYLEEFPYNLEDFLSPTWFS+VSLTVKLVSSV +GL +GSIDSQ DD TSLDN  +K
Subjt:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK

Query:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV
        +I+RCLSSRPFSRSVINKGLLHSNILVKHGTLRLLL+ALK++NSFF VLN+ SS NKQ M +WLSLKQELQNEVQTLLPD QVLLTLLSS ASQSRVQAV
Subjt:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV

Query:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY
         LKRASGLE SFHGVK+LKTTS DHDTDIVV G+VSAPDIDEKM+D+C+VETSEKERELMIS+AELWDLDPLST+VEVNDVEMYF SK+LDALTIY+
Subjt:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY

TrEMBL top hitse value%identityAlignment
A0A0A0K5A8 Uncharacterized protein2.3e-28486.43Show/hide
Query:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS
        MDA NANLEAKLKELLFKINS E+KICSDATKEFIKLL G+ GCKLL+LYAKTSPKC ELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIY +NDLER+S
Subjt:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS

Query:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP
        TSRVLDMLARSL EECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLA +VAKNFDFKLR FSKLTEFRQK +QK SKHSSRKLF+GFAMSFLEVGKP
Subjt:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP

Query:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR
        ELLRW+LQQRE+Y+GVLRGLANDDEETI YVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATIC RENGG A E AYQVLTMVCTDPCNGLMP LKR
Subjt:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR

Query:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK
         PNPLKGNPKRL+DLMKKLKATGVIYHRDLLLAIIR QP FCSTYLEEFPYNLEDFLS  WFS+VSL VKLVSSV SGL   SI SQ DD T  D+TY+K
Subjt:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK

Query:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV
        SI+RCLSSRPF+RS INKGLLHSNILVKHGTLRLLLEALKL++S F VLNK SS+N + M +WLSLKQEL+NEVQ LLPDPQVLLTLLSSLASQSRVQ V
Subjt:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV

Query:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY
         LKR SGLERSFHGVKKLKTTS D DTDI+V GVVS PDIDEKM DICTVETSE ERELMISVAELWDLDPLS +VEV D EMYF+SKLL+ LTIY+
Subjt:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY

A0A6J1DRJ3 uncharacterized protein LOC111023605 isoform X11.2e-30190.6Show/hide
Query:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS
        MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKC ELLD WKLQRGKAG+PYIFSLVSAILSHPDG Y LNDLERIS
Subjt:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS

Query:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP
        TSRVLDMLARSL EECLGDIN+ELGSQEVKRQNAALLLMSSIVRRGSR+A EVAKNFDFK R FSKL E+RQK NQKGSKHSSRKLF+GFAMSFLEV KP
Subjt:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP

Query:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR
        ELLRW+LQQ+EMYSGVLRGLANDDEETIIY+LSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICG ENGGP  EAAYQVL +VCTDPCNGLMPDLKR
Subjt:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR

Query:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK
         PNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIR QPTFCSTYLEEFPYNLEDFLSPTWFSMVSLT+KLVS V +GL +GS DSQ DDITSL+N YMK
Subjt:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK

Query:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV
        SI+RCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKL+NSF G LNK  SVNKQMMS+WLSL QE+QNEVQTLLPDPQVLLTLLSSLASQSRVQAV
Subjt:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV

Query:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIY
         LKRASGLE SFHGVKKLKTTS DHDTDIVV GVVSAP   +K+VDICTVETSEKERELMIS+AELWDLDPLST+VEVNDVEMYFLSKLLDALTIY
Subjt:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIY

A0A6J1DUK8 uncharacterized protein LOC111023605 isoform X21.2e-30190.6Show/hide
Query:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS
        MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKC ELLD WKLQRGKAG+PYIFSLVSAILSHPDG Y LNDLERIS
Subjt:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS

Query:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP
        TSRVLDMLARSL EECLGDIN+ELGSQEVKRQNAALLLMSSIVRRGSR+A EVAKNFDFK R FSKL E+RQK NQKGSKHSSRKLF+GFAMSFLEV KP
Subjt:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP

Query:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR
        ELLRW+LQQ+EMYSGVLRGLANDDEETIIY+LSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICG ENGGP  EAAYQVL +VCTDPCNGLMPDLKR
Subjt:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR

Query:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK
         PNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIR QPTFCSTYLEEFPYNLEDFLSPTWFSMVSLT+KLVS V +GL +GS DSQ DDITSL+N YMK
Subjt:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK

Query:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV
        SI+RCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKL+NSF G LNK  SVNKQMMS+WLSL QE+QNEVQTLLPDPQVLLTLLSSLASQSRVQAV
Subjt:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV

Query:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIY
         LKRASGLE SFHGVKKLKTTS DHDTDIVV GVVSAP   +K+VDICTVETSEKERELMIS+AELWDLDPLST+VEVNDVEMYFLSKLLDALTIY
Subjt:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIY

A0A6J1H985 uncharacterized protein LOC1114607777.5e-29989.95Show/hide
Query:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS
        MD INANLEAKLKELLFKINS EIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKC ELLDAWKLQRGKAGM YIFSLVSAILSHPDGIY LNDLERIS
Subjt:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS

Query:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP
        TSRVLDM+ARSL EECLGDINSELGSQE+K +NAALLLMSSIVRRGSRLA  VAKNFDFKLR FSKLTEFRQK NQKGSK SSRKLFIGFAMSFLEVGKP
Subjt:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP

Query:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR
        ELLRW+LQQREMYSGVLRGLANDDE TIIYVLSTLRDKVLV+ESLVPPGLRSVLFGSVTLEQLATICGRENGGPA E AYQVLTMVCTDPCNGLMPDLKR
Subjt:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR

Query:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK
         PNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIR QPTFCSTYLEEFPYNLEDFLSPTWFS+VSLTVKLVSSV +GL +GSIDSQ DD TSLDN  +K
Subjt:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK

Query:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV
        +I+RCLSSRPFSRSVINKGLLHSNILVKHGTLRLLL+ALK++NSFF VLN+ SS NKQ M +WLSLKQELQNEVQTLLPD QVLLTLLSS ASQSRVQAV
Subjt:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV

Query:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY
         LKRASGLE SFHGVK+LKTTS DHDTDIVV G+VSAPDIDEKM+D+C+VETSEKERELMIS+AELWDLDPLST+VEVNDVEMYF SK+LDALTIY+
Subjt:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY

A0A6J1KZ52 uncharacterized protein LOC1114976772.4e-29789.61Show/hide
Query:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS
        MD INANLEAKLKELLFKINS EIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKC ELLDAWKLQRGKAGM YIFSLVSAILSHPDGIY LNDLERIS
Subjt:  MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERIS

Query:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP
        TSRVLDM+ARSL EECLGDINSELGSQE+K +NAALLLMSSIVRRGSRLA  VAKNFDFKLR FSKLTEFRQK NQKGSK SSRKLFIGFAMSFLEVGKP
Subjt:  TSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKP

Query:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR
        ELLRW+LQQREMYSGVLRGLANDDE TIIYVLSTLRDKVLV+ESLVPPGLRSVLFGSVTLEQLATICGRENGGPA E AYQVLTMVCTDPCNGLMPDLKR
Subjt:  ELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKR

Query:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK
         PNPLKGNPKRLMDLMKKLKATGV YHRDLLLAIIR QPTFCSTYLEEFPYNLEDFLSPTWFS+VSLTVKLVSSV + L +GSIDSQ DD TSLDN  +K
Subjt:  FPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMK

Query:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV
        +I+RCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALK++NSFF VLN+ SS NKQ M +WLSLKQELQNEVQTLLPD QVLLTLLSS ASQSRVQAV
Subjt:  SIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAV

Query:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY
         LKRASGLE  FHGVK+LKTTS DHDTDIVV G+VSAPDIDEKM+D+C+VETSEKERELMIS+AELWDLDPLST+VEVNDVEMYF SK+LDALTIY+
Subjt:  RLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKMVDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYY

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.2e-0526.06Show/hide
Query:  TTRKDIRVWSPDPSLGFSCRFFFRCLL---DPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNC
        T  +D   W       FS R  +  L     P  +  S F+ LWKV+VP++V+ F+W V +  V T +   R+   L     C +C+   E + H+L +C
Subjt:  TTRKDIRVWSPDPSLGFSCRFFFRCLL---DPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNC

Query:  AFARTVWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWGERNNRTF
             +W         Q    + L E + + L      E     W      IIW  W  R    F
Subjt:  AFARTVWDEFFGSFGLQFARHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWGERNNRTF

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)4.1e-11644.98Show/hide
Query:  KFMDTRLNLGL---MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSH
        K+ D  LN  L   + A   +LEAKL++LL  I   E K+CSD  K+F+KLLKGE G  LL LY ++SP   ELL+AW L  GK G+ YIFSL+  ILSH
Subjt:  KFMDTRLNLGL---MDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSH

Query:  PDGIYCLNDLERISTSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKL
        P+G       +     R LD   R L E+ L DI   L S   + QNAAL L++SIVRRG  +A E+A+ FDF   GF               K + R+ 
Subjt:  PDGIYCLNDLERISTSRVLDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKL

Query:  FIGFAMSFLEVGKPELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMV
        F+ FA+SFL+VGKP LL+ IL+++++YS +L+GL  DD++T+  VLSTL+DK+LV ES + P L S LFG  TLEQL  I  RE+GG   E AY VL  V
Subjt:  FIGFAMSFLEVGKPELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMV

Query:  CTDPCNGLMPDLKRFPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDS
        CTDP NGLMPD  R     KGN KRL+ LMK LKAT   Y RDLLLAIIR +P+  S + +EFPYN+EDF SP WFS +SL   LVSSVR      S D 
Subjt:  CTDPCNGLMPDLKRFPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDS

Query:  QLDDITSLDNTYMKSIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLT
           D      + + +IM+C+  RPFS+S+I +G+ HS  LVKHGTLR L E L+L +SF     K  SV++       SL++++  EV +  PD QVL T
Subjt:  QLDDITSLDNTYMKSIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLT

Query:  LLSSLASQSRVQAVRLKRASGLERSFHGVKK-LKTTS----SDHDTDIVVGGVVSAPDI--DEKMVDICTVETSEKERELMISVAELWDLDPLS-TIVEV
        +L         Q + LKR + L+      KK LKT+      +  +D+V+GG+ S  +I  +E   D    +  + E E +  V+E+W  +  S  I  V
Subjt:  LLSSLASQSRVQAVRLKRASGLERSFHGVKK-LKTTS----SDHDTDIVVGGVVSAPDI--DEKMVDICTVETSEKERELMISVAELWDLDPLS-TIVEV

Query:  NDVEMYFLSKLLDALTIY
        ++ EM+F  KLLD L IY
Subjt:  NDVEMYFLSKLLDALTIY

AT2G02650.1 Ribonuclease H-like superfamily protein2.4e-0725.36Show/hide
Query:  LDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRL-SRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQFARHRGLRETI
        + P      +   +WK+ V  K++ F+W+ + G + T  RL SR I +   P C   C + EE + H+++NC + ++VW       G Q+       + +
Subjt:  LDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRL-SRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQFARHRGLRETI

Query:  EEFLPHPPFREQG---KFL--WQAGICAIIWGLWGERN
           +     +      +FL  W      I+W LW  RN
Subjt:  EEFLPHPPFREQG---KFL--WQAGICAIIWGLWGERN

AT4G27010.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)6.2e-10448.46Show/hide
Query:  LAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKPELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPP
        +A E+AK FDFK  GF+KL E+  +  +K  KHS+RK F+GFA+SFLEVGKP LL  +L ++EMYS VL GL  DD++T+  VLSTL+DK+LV+ESL+ P
Subjt:  LAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKPELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPP

Query:  GLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKRFPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEE
        GLRSVLFG VTL+ LA+I  RE+ G   E A+ VL  VCTDP NGLMPD KR    L+GN  RL+ LMK L+A  + YHRDLLLAI+R +P+  S +L+E
Subjt:  GLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKRFPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEE

Query:  FPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMKSIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGV
        FPYN+EDF SP+WFS +SL   LVSSVR+      ++           + +++IM+C+  RPFSRS+I KG+LHS+ LVKHGTLR LLE L+LL+SF   
Subjt:  FPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMKSIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGV

Query:  LNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAVRLKRASGLERSFHGVKKL-----KTTSSDHDTDIVVGGVVSAPDI--D
         N  SS    +    +SL++ +  EV +  PD QVLL +L SL   S  Q + LKR + L+    G KK      K    +   DIV+GGV S  DI   
Subjt:  LNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAVRLKRASGLERSFHGVKKL-----KTTSSDHDTDIVVGGVVSAPDI--D

Query:  EKMVDICTVETSEKERELMISVAELWDLDPLSTIVE-VNDVEMYFLSKLLDALTIY
        E  +D    +  + E+E +  V+++W  +  S  ++ V + EM F  KLLDAL IY
Subjt:  EKMVDICTVETSEKERELMISVAELWDLDPLSTIVE-VNDVEMYFLSKLLDALTIY

AT4G27010.2 INVOLVED IN: biological_process unknown6.2e-10448.46Show/hide
Query:  LAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKPELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPP
        +A E+AK FDFK  GF+KL E+  +  +K  KHS+RK F+GFA+SFLEVGKP LL  +L ++EMYS VL GL  DD++T+  VLSTL+DK+LV+ESL+ P
Subjt:  LAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKPELLRWILQQREMYSGVLRGLANDDEETIIYVLSTLRDKVLVDESLVPP

Query:  GLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKRFPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEE
        GLRSVLFG VTL+ LA+I  RE+ G   E A+ VL  VCTDP NGLMPD KR    L+GN  RL+ LMK L+A  + YHRDLLLAI+R +P+  S +L+E
Subjt:  GLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKRFPNPLKGNPKRLMDLMKKLKATGVIYHRDLLLAIIREQPTFCSTYLEE

Query:  FPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMKSIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGV
        FPYN+EDF SP+WFS +SL   LVSSVR+      ++           + +++IM+C+  RPFSRS+I KG+LHS+ LVKHGTLR LLE L+LL+SF   
Subjt:  FPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMKSIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRLLLEALKLLNSFFGV

Query:  LNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAVRLKRASGLERSFHGVKKL-----KTTSSDHDTDIVVGGVVSAPDI--D
         N  SS    +    +SL++ +  EV +  PD QVLL +L SL   S  Q + LKR + L+    G KK      K    +   DIV+GGV S  DI   
Subjt:  LNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAVRLKRASGLERSFHGVKKL-----KTTSSDHDTDIVVGGVVSAPDI--D

Query:  EKMVDICTVETSEKERELMISVAELWDLDPLSTIVE-VNDVEMYFLSKLLDALTIY
        E  +D    +  + E+E +  V+++W  +  S  ++ V + EM F  KLLDAL IY
Subjt:  EKMVDICTVETSEKERELMISVAELWDLDPLSTIVE-VNDVEMYFLSKLLDALTIY

AT4G29090.1 Ribonuclease H-like superfamily protein6.4e-0823.53Show/hide
Query:  SQPS---IFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVW--------------DEFFGSFGLQ
        S+PS   I+  +WK +   K+Q F+W+ +   +     L+ +   L     CI C   +E ++H+L+ C FAR  W              D  + +    
Subjt:  SQPS---IFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVW--------------DEFFGSFGLQ

Query:  FARHRGLRETIEEFLPHPPFREQGKFL-WQAGICAIIWGLWGERNNRTFRGLERDPIEVWSLVKYNVSLW
        F    G          +P + +  + + W      ++W LW  RN   FRG E +  EV    + ++  W
Subjt:  FARHRGLRETIEEFLPHPPFREQGKFL-WQAGICAIIWGLWGERNNRTFRGLERDPIEVWSLVKYNVSLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTCTCCTTCGTTTCTGAGAAAAATCGTGCCGCCGTGGGTTTGCGTCTGACGCTTGTCGCCGGAGAAGGAGAAGACCGAGAGAGGGAGAGATTCGTGTGGGTTTC
GTGTTGTCGCGCGACGTGGATCTTGCTGGTTCACGAAAGAGGGCTTGCCGGTTGGAGGAGAAAAAGAGATGACAATAAATCGAGAAATAAAAGGGAGGCTTGCGGGCTGG
TTCACGGTAAAGGAGAAAAACGCAGGCTTGCCGACGTGGAAGAGAAAAGAGATGAGGGAAGAGAGAGAAGAGATGGGGGGCTTGCTGGTGTGGAGGAGAGAGAAAAAAGA
AAAACCCTAAACCCTTCAATCTTCCTCTCACTCTCACGTGCTCCACCTTCTTCTCCTTCACGCCGGCTCCTCCCATTTCAGCAGCTCGGTGTCGCCGTCGCCGCCGTCGA
CGCCGTCTCGTCTCTCCCGTTCGGTGCTGCTGACGCCGTCATTTTTCCACCAGTTCCGCTTTCGCCTCCTTCCCAGCGCTCTCTCTTCATAGCCACATCACTGGACAACA
CTCACTCTCTCTCTCCTCAGTCACCCCTCGCCACTACAAGCCGCTGCAGCAGGCAGATTTGGCTCTTTAGACTTACCCACAGGGTAGGCAAGGGAAAGTTTATGGACACT
CGTTTAAACTTGGGACTTATGGATGCAATAAATGCAAACCTTGAAGCTAAACTCAAAGAGCTACTATTCAAAATTAACTCTTTTGAGATTAAGATATGCTCAGATGCTAC
AAAAGAGTTCATAAAGTTATTAAAAGGGGAGATTGGATGTAAGCTGCTCCATTTATATGCAAAGACTTCTCCCAAGTGCTTAGAACTTTTAGATGCCTGGAAGCTCCAAA
GGGGGAAGGCTGGAATGCCTTATATATTTTCATTAGTTTCTGCCATTTTGAGTCATCCTGATGGAATTTACTGTCTTAATGACTTGGAGAGGATATCAACTAGTCGCGTT
CTTGACATGTTGGCCCGATCACTTTTTGAAGAATGCTTGGGAGACATAAATAGTGAATTAGGTTCCCAAGAGGTAAAACGCCAAAATGCAGCTTTGTTGTTGATGTCTTC
AATTGTTCGACGTGGTTCACGTTTGGCTTTTGAAGTTGCTAAGAACTTTGATTTTAAACTTCGTGGATTTTCCAAGCTAACGGAGTTTAGACAAAAGTCAAATCAGAAAG
GATCAAAACACTCATCAAGAAAGTTGTTTATTGGGTTTGCTATGTCATTTTTGGAGGTGGGGAAGCCTGAATTATTGAGGTGGATCTTGCAACAAAGGGAAATGTATTCT
GGTGTGCTTCGTGGACTTGCAAATGACGATGAAGAGACTATTATTTATGTTTTATCCACATTAAGGGACAAGGTCCTTGTTGATGAGTCATTGGTGCCTCCAGGTCTTCG
AAGTGTGCTTTTTGGAAGTGTTACTTTGGAACAATTGGCCACCATATGTGGAAGAGAGAATGGTGGTCCTGCTGTAGAGGCTGCTTACCAGGTTTTAACTATGGTTTGTA
CTGACCCTTGTAATGGGTTGATGCCAGATCTGAAAAGATTCCCAAATCCTTTGAAAGGCAATCCGAAGCGACTAATGGACCTAATGAAGAAGTTAAAAGCTACTGGAGTT
ATTTATCACAGAGACTTGCTTTTGGCAATTATCAGGGAGCAACCAACTTTCTGTTCAACATACTTGGAGGAATTTCCTTATAACCTTGAAGACTTTTTATCACCTACCTG
GTTTTCTATGGTTTCTTTGACAGTCAAGTTGGTATCTTCTGTGAGGAGTGGCTTGCCTATGGGATCTATTGATTCTCAATTAGATGATATTACTTCATTGGACAATACTT
ATATGAAAAGCATTATGAGGTGCCTCTCCTCTCGGCCGTTTAGTCGATCAGTAATCAACAAAGGTTTGCTTCACTCAAATATTCTTGTAAAGCATGGAACTCTACGACTT
CTACTTGAGGCATTGAAGTTGCTCAATTCTTTTTTTGGTGTTTTAAACAAAACATCATCCGTCAATAAGCAAATGATGTCACATTGGTTGTCTCTCAAGCAGGAATTGCA
GAATGAAGTCCAAACTTTGCTCCCTGATCCACAAGTGCTACTTACTCTACTTTCTTCATTGGCTAGCCAATCTAGAGTTCAAGCAGTTCGTTTGAAAAGGGCCTCTGGTC
TGGAGCGTAGCTTCCATGGTGTTAAAAAATTAAAAACAACTTCATCGGATCATGACACAGATATTGTTGTTGGTGGAGTTGTGTCAGCTCCAGATATTGATGAAAAAATG
GTGGATATATGTACAGTAGAGACATCAGAGAAGGAAAGGGAACTCATGATTTCTGTGGCTGAACTTTGGGATTTGGATCCATTGTCTACTATTGTCGAAGTGAATGATGT
AGAGATGTACTTCCTTTCGAAGTTGTTGGATGCTCTTACAATTTATTATCCCTTAGGGGAACTCTTAAGTTGTGGAACAAAGAGGTCTTCGGTGATATTAAGCTCTCTTT
CTTATTGCTTTGGCTTTGCGCGTCCGTTGTCTGATCGTGAAACAACGGACCTCGTGACTCTCCTTTCTTTGATTGGGGAAATCGCTTTTAGTACCACTAGGAAAGACATT
CGAGTCTGGAGTCCCGACCCTTCCCTCGGGTTTTCTTGTCGATTCTTCTTCCGATGCCTTTTGGACCCTGTTTCCTCTCAGCCGTCCATTTTCTCTTTATTGTGGAAGGT
GAAAGTTCCAAAGAAGGTGCAGTTTTTTATTTGGCAGGTGATCCATGGAAGAGTTAATACTCTTGATCGGCTGTCCAGAAAGATTCCTAGTCTATTTGGGCCTTTTTGTT
GCATTCTTTGTCGGATGGCGGAGGAAGACCTCGATCATATGTTATGGAACTGCGCTTTTGCTAGGACAGTGTGGGATGAGTTCTTTGGCTCGTTCGGGTTGCAGTTTGCC
AGACACAGAGGCCTCAGAGAGACGATCGAGGAGTTCCTTCCCCATCCTCCCTTTAGGGAGCAAGGAAAGTTTTTGTGGCAAGCTGGGATTTGTGCTATTATTTGGGGGTT
GTGGGGTGAGAGAAACAATAGAACTTTTAGAGGGTTAGAGAGGGATCCTATTGAGGTTTGGTCCCTTGTTAAATATAATGTTTCTCTTTGGGCGTCGGTGACGCGTTTAT
TTTGTAATTATTCCTTAGGTCTCATCATTTTGGATTGGAGTCCCTCCATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTCTCCTTCGTTTCTGAGAAAAATCGTGCCGCCGTGGGTTTGCGTCTGACGCTTGTCGCCGGAGAAGGAGAAGACCGAGAGAGGGAGAGATTCGTGTGGGTTTC
GTGTTGTCGCGCGACGTGGATCTTGCTGGTTCACGAAAGAGGGCTTGCCGGTTGGAGGAGAAAAAGAGATGACAATAAATCGAGAAATAAAAGGGAGGCTTGCGGGCTGG
TTCACGGTAAAGGAGAAAAACGCAGGCTTGCCGACGTGGAAGAGAAAAGAGATGAGGGAAGAGAGAGAAGAGATGGGGGGCTTGCTGGTGTGGAGGAGAGAGAAAAAAGA
AAAACCCTAAACCCTTCAATCTTCCTCTCACTCTCACGTGCTCCACCTTCTTCTCCTTCACGCCGGCTCCTCCCATTTCAGCAGCTCGGTGTCGCCGTCGCCGCCGTCGA
CGCCGTCTCGTCTCTCCCGTTCGGTGCTGCTGACGCCGTCATTTTTCCACCAGTTCCGCTTTCGCCTCCTTCCCAGCGCTCTCTCTTCATAGCCACATCACTGGACAACA
CTCACTCTCTCTCTCCTCAGTCACCCCTCGCCACTACAAGCCGCTGCAGCAGGCAGATTTGGCTCTTTAGACTTACCCACAGGGTAGGCAAGGGAAAGTTTATGGACACT
CGTTTAAACTTGGGACTTATGGATGCAATAAATGCAAACCTTGAAGCTAAACTCAAAGAGCTACTATTCAAAATTAACTCTTTTGAGATTAAGATATGCTCAGATGCTAC
AAAAGAGTTCATAAAGTTATTAAAAGGGGAGATTGGATGTAAGCTGCTCCATTTATATGCAAAGACTTCTCCCAAGTGCTTAGAACTTTTAGATGCCTGGAAGCTCCAAA
GGGGGAAGGCTGGAATGCCTTATATATTTTCATTAGTTTCTGCCATTTTGAGTCATCCTGATGGAATTTACTGTCTTAATGACTTGGAGAGGATATCAACTAGTCGCGTT
CTTGACATGTTGGCCCGATCACTTTTTGAAGAATGCTTGGGAGACATAAATAGTGAATTAGGTTCCCAAGAGGTAAAACGCCAAAATGCAGCTTTGTTGTTGATGTCTTC
AATTGTTCGACGTGGTTCACGTTTGGCTTTTGAAGTTGCTAAGAACTTTGATTTTAAACTTCGTGGATTTTCCAAGCTAACGGAGTTTAGACAAAAGTCAAATCAGAAAG
GATCAAAACACTCATCAAGAAAGTTGTTTATTGGGTTTGCTATGTCATTTTTGGAGGTGGGGAAGCCTGAATTATTGAGGTGGATCTTGCAACAAAGGGAAATGTATTCT
GGTGTGCTTCGTGGACTTGCAAATGACGATGAAGAGACTATTATTTATGTTTTATCCACATTAAGGGACAAGGTCCTTGTTGATGAGTCATTGGTGCCTCCAGGTCTTCG
AAGTGTGCTTTTTGGAAGTGTTACTTTGGAACAATTGGCCACCATATGTGGAAGAGAGAATGGTGGTCCTGCTGTAGAGGCTGCTTACCAGGTTTTAACTATGGTTTGTA
CTGACCCTTGTAATGGGTTGATGCCAGATCTGAAAAGATTCCCAAATCCTTTGAAAGGCAATCCGAAGCGACTAATGGACCTAATGAAGAAGTTAAAAGCTACTGGAGTT
ATTTATCACAGAGACTTGCTTTTGGCAATTATCAGGGAGCAACCAACTTTCTGTTCAACATACTTGGAGGAATTTCCTTATAACCTTGAAGACTTTTTATCACCTACCTG
GTTTTCTATGGTTTCTTTGACAGTCAAGTTGGTATCTTCTGTGAGGAGTGGCTTGCCTATGGGATCTATTGATTCTCAATTAGATGATATTACTTCATTGGACAATACTT
ATATGAAAAGCATTATGAGGTGCCTCTCCTCTCGGCCGTTTAGTCGATCAGTAATCAACAAAGGTTTGCTTCACTCAAATATTCTTGTAAAGCATGGAACTCTACGACTT
CTACTTGAGGCATTGAAGTTGCTCAATTCTTTTTTTGGTGTTTTAAACAAAACATCATCCGTCAATAAGCAAATGATGTCACATTGGTTGTCTCTCAAGCAGGAATTGCA
GAATGAAGTCCAAACTTTGCTCCCTGATCCACAAGTGCTACTTACTCTACTTTCTTCATTGGCTAGCCAATCTAGAGTTCAAGCAGTTCGTTTGAAAAGGGCCTCTGGTC
TGGAGCGTAGCTTCCATGGTGTTAAAAAATTAAAAACAACTTCATCGGATCATGACACAGATATTGTTGTTGGTGGAGTTGTGTCAGCTCCAGATATTGATGAAAAAATG
GTGGATATATGTACAGTAGAGACATCAGAGAAGGAAAGGGAACTCATGATTTCTGTGGCTGAACTTTGGGATTTGGATCCATTGTCTACTATTGTCGAAGTGAATGATGT
AGAGATGTACTTCCTTTCGAAGTTGTTGGATGCTCTTACAATTTATTATCCCTTAGGGGAACTCTTAAGTTGTGGAACAAAGAGGTCTTCGGTGATATTAAGCTCTCTTT
CTTATTGCTTTGGCTTTGCGCGTCCGTTGTCTGATCGTGAAACAACGGACCTCGTGACTCTCCTTTCTTTGATTGGGGAAATCGCTTTTAGTACCACTAGGAAAGACATT
CGAGTCTGGAGTCCCGACCCTTCCCTCGGGTTTTCTTGTCGATTCTTCTTCCGATGCCTTTTGGACCCTGTTTCCTCTCAGCCGTCCATTTTCTCTTTATTGTGGAAGGT
GAAAGTTCCAAAGAAGGTGCAGTTTTTTATTTGGCAGGTGATCCATGGAAGAGTTAATACTCTTGATCGGCTGTCCAGAAAGATTCCTAGTCTATTTGGGCCTTTTTGTT
GCATTCTTTGTCGGATGGCGGAGGAAGACCTCGATCATATGTTATGGAACTGCGCTTTTGCTAGGACAGTGTGGGATGAGTTCTTTGGCTCGTTCGGGTTGCAGTTTGCC
AGACACAGAGGCCTCAGAGAGACGATCGAGGAGTTCCTTCCCCATCCTCCCTTTAGGGAGCAAGGAAAGTTTTTGTGGCAAGCTGGGATTTGTGCTATTATTTGGGGGTT
GTGGGGTGAGAGAAACAATAGAACTTTTAGAGGGTTAGAGAGGGATCCTATTGAGGTTTGGTCCCTTGTTAAATATAATGTTTCTCTTTGGGCGTCGGTGACGCGTTTAT
TTTGTAATTATTCCTTAGGTCTCATCATTTTGGATTGGAGTCCCTCCATTTAA
Protein sequenceShow/hide protein sequence
MSFSFVSEKNRAAVGLRLTLVAGEGEDRERERFVWVSCCRATWILLVHERGLAGWRRKRDDNKSRNKREACGLVHGKGEKRRLADVEEKRDEGRERRDGGLAGVEEREKR
KTLNPSIFLSLSRAPPSSPSRRLLPFQQLGVAVAAVDAVSSLPFGAADAVIFPPVPLSPPSQRSLFIATSLDNTHSLSPQSPLATTSRCSRQIWLFRLTHRVGKGKFMDT
RLNLGLMDAINANLEAKLKELLFKINSFEIKICSDATKEFIKLLKGEIGCKLLHLYAKTSPKCLELLDAWKLQRGKAGMPYIFSLVSAILSHPDGIYCLNDLERISTSRV
LDMLARSLFEECLGDINSELGSQEVKRQNAALLLMSSIVRRGSRLAFEVAKNFDFKLRGFSKLTEFRQKSNQKGSKHSSRKLFIGFAMSFLEVGKPELLRWILQQREMYS
GVLRGLANDDEETIIYVLSTLRDKVLVDESLVPPGLRSVLFGSVTLEQLATICGRENGGPAVEAAYQVLTMVCTDPCNGLMPDLKRFPNPLKGNPKRLMDLMKKLKATGV
IYHRDLLLAIIREQPTFCSTYLEEFPYNLEDFLSPTWFSMVSLTVKLVSSVRSGLPMGSIDSQLDDITSLDNTYMKSIMRCLSSRPFSRSVINKGLLHSNILVKHGTLRL
LLEALKLLNSFFGVLNKTSSVNKQMMSHWLSLKQELQNEVQTLLPDPQVLLTLLSSLASQSRVQAVRLKRASGLERSFHGVKKLKTTSSDHDTDIVVGGVVSAPDIDEKM
VDICTVETSEKERELMISVAELWDLDPLSTIVEVNDVEMYFLSKLLDALTIYYPLGELLSCGTKRSSVILSSLSYCFGFARPLSDRETTDLVTLLSLIGEIAFSTTRKDI
RVWSPDPSLGFSCRFFFRCLLDPVSSQPSIFSLLWKVKVPKKVQFFIWQVIHGRVNTLDRLSRKIPSLFGPFCCILCRMAEEDLDHMLWNCAFARTVWDEFFGSFGLQFA
RHRGLRETIEEFLPHPPFREQGKFLWQAGICAIIWGLWGERNNRTFRGLERDPIEVWSLVKYNVSLWASVTRLFCNYSLGLIILDWSPSI