; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0000842 (gene) of Chayote v1 genome

Gene IDSed0000842
OrganismSechium edule (Chayote v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG11:3604118..3610137
RNA-Seq ExpressionSed0000842
SyntenySed0000842
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7824053.1 hypothetical protein G2W53_022197 [Senna tora]5.6e-8625.76Show/hide
Query:  DQEAQGVFELEDNDISA-YESELENVVAC---KLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLLFEKL
        +++  GV E    DI+A    E + V  C    L TE   + + F+  + ++WN+   V I  +G N ++  F +   R R+L+  PW +D  VL  +++
Subjt:  DQEAQGVFELEDNDISA-YESELENVVAC---KLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLLFEKL

Query:  TGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEG-----DQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWVRISYEK-
        TGD+  S ++     FWVR+HDVP    T ++AK +G  +G F+ WD          R  G +LR      +++   ++     G  A +   + + E  
Subjt:  TGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEG-----DQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWVRISYEK-

Query:  ------------------LPDFCKGC----GRIGHQANGCSYVESI----------------FWARSGTEKEEQCRVDLNQIN----------DMAEKGH
                            D   G     G +     G S V+++                F      E E        QI           +MA+  +
Subjt:  ------------------LPDFCKGC----GRIGHQANGCSYVESI----------------FWARSGTEKEEQCRVDLNQIN----------DMAEKGH

Query:  V----VTFKKEKVIEK---------------GKGKVISEGSN--------------DFLFGLFDIHMKAHQHLDNGGFRGPAKRVVFKELDLMPNYNKGS
        V    V  K E    K                +GK I+E                 DF+  LFD+H+ A   LD G  R   K+  + E+          
Subjt:  V----VTFKKEKVIEK---------------GKGKVISEGSN--------------DFLFGLFDIHMKAHQHLDNGGFRGPAKRVVFKELDLMPNYNKGS

Query:  VQLNLSCINDKAKTVVSPDAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIG----LSGGLLLFWSSSL
            L   +  A     P  M  +CWN RGL  PRA+R L+       P ++F+ E+K   + + +LK  L FDC   V   G     +GGL LFW ++L
Subjt:  VQLNLSCINDKAKTVVSPDAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIG----LSGGLLLFWSSSL

Query:  DVKVVSYSQGHIDILIND--FFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGF
        D+ + S+S  HID++++D     +WR TG +G P +  +  +W+LL+ LA  S+LPW+  GDFNEI+   E+  G+ ++   M+ FR+A N C   D+GF
Subjt:  DVKVVSYSQGHIDILIND--FFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGF

Query:  SGDRFTWIKNKKSISATRVRLDRYFANPLFISK---SEGSMIFNYS---------------------KKIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNS
         G  FTW   ++     + RLDR FA   ++ K   +E + I ++S                     +++ RFE++W     CK+V+   W+        
Subjt:  SGDRFTWIKNKKSISATRVRLDRYFANPLFISK---SEGSMIFNYS---------------------KKIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNS

Query:  VCNLIFRIKGSLFEMANWNKDRLKGSIRQAIL--NKENSVSLLMANSGGIRSDEIINIENDIERLLEEEEIFWKQRSLR------WDWKNRSVAELMARD
        + ++   +                 S  +A L  N   +   +M     ++      + N  +  + E+        LR       D     VA+L+  +
Subjt:  VCNLIFRIKGSLFEMANWNKDRLKGSIRQAIL--NKENSVSLLMANSGGIRSDEIINIENDIERLLEEEEIFWKQRSLR------WDWKNRSVAELMARD

Query:  G-GWNKELIYNIFCKDDANIIVNTPLSKFGTNDEVMWNDCKKGIFSVKSAYHVGINLASKEKESTSDSKFTSNCWKKFWKINISSKIKLGAWSVLSNAIP
           W ++LI + F   +A II++ PLS     D+ +W   K   +SVKSAYH+     +    STS +K     W + WK+ ++ KI++  W +  NA+P
Subjt:  G-GWNKELIYNIFCKDDANIIVNTPLSKFGTNDEVMWNDCKKGIFSVKSAYHVGINLASKEKESTSDSKFTSNCWKKFWKINISSKIKLGAWSVLSNAIP

Query:  TKSNIYLRGVDINVICPMCRKCKESNMHIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEELFKGIVVIWSLWNFRNQGLNEDNSF
        T  N+  RGV I   CP CR   E  +H    C  A   W L+      F   L + +  ID+     +    E L     V W++WN RN  L      
Subjt:  TKSNIYLRGVDINVICPMCRKCKESNMHIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEELFKGIVVIWSLWNFRNQGLNEDNSF

Query:  EFHKLQAMMENAINAFNLLNAGPLKKPFRVDNENIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSIKCL------EAIA
           +    + + +  ++L +   L     V +   KW  P     K+NVDA+ R S  E  +G V+RD  G +             I+CL      EA+A
Subjt:  EFHKLQAMMENAINAFNLLNAGPLKKPFRVDNENIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSIKCL------EAIA

Query:  LREGLIHILELQNLLLPPILVESDASEIIKAIANDSLDISEFSNIIVDIKHLMALLGIVKVAHCNRLCNMVADVLAKQSDMDYFCWRDPILSFEDRPRFW
        +R GL+ +++L    +  + VE+D     K+  +D  D S    I+ D   L        +    R  N VA  +AK   +   C           P  W
Subjt:  LREGLIHILELQNLLLPPILVESDASEIIKAIANDSLDISEFSNIIVDIKHLMALLGIVKVAHCNRLCNMVADVLAKQSDMDYFCWRDPILSFEDRPRFW

Query:  RGNLPPNVGSMVEKD
          N P  V S+   D
Subjt:  RGNLPPNVGSMVEKD

KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]7.8e-7227.64Show/hide
Query:  MKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILINDFFGR--
        MK L WN +GL NP  IR L + V K  P  VF+ E++  +  ++K+K+SL F   + V ++G SGGL L W   LD+ + SYS+ HID+++     R  
Subjt:  MKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILINDFFGR--

Query:  WRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRY
        WR TG YG+P  +K+ ++W L++ L+   ++PW+  GDFNEI   EE+    E+    M  F+EAI +C LI LGF G+ FTW   +      R RLDR 
Subjt:  WRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRY

Query:  FA-----------------------NPLFISKSEGSMIFNYSKKIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKG
         A                       +PL ++  + +      K+  RFE  W     C E++   W        + C+                      
Subjt:  FA-----------------------NPLFISKSEGSMIFNYSKKIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKG

Query:  SIRQAILNKENSVSLLMANSGGIRSDEIINIENDIERLLEEEEIFWKQRSLRWDWKNRSVAELMARDGGWNKELIYNIFCKDDANIIVNTPLSKFGTNDE
         +   +L ++  VSLL              I+ D             Q++                   WN  L+  +F   +A +I + PLS+    D+
Subjt:  SIRQAILNKENSVSLLMANSGGIRSDEIINIENDIERLLEEEEIFWKQRSLRWDWKNRSVAELMARDGGWNKELIYNIFCKDDANIIVNTPLSKFGTNDE

Query:  VMWNDCKKGIFSVKSAYHVGINLASKEKESTS-------DSKFTSNCWKKFWKINISSKIKLGAWSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNM
         +W+   KG FSV+SAYH+   L  +E  ++S       +   +   W + W++ I  K+K+  W V  N +P ++N+  R + +  +C +C +  E+ +
Subjt:  VMWNDCKKGIFSVKSAYHVGINLASKEKESTS-------DSKFTSNCWKKFWKINISSKIKLGAWSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNM

Query:  HIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEE-LFKGIVVIWSLWNFRNQGLNEDNSFEFHKLQAMMENAINAFNLLNAGPLKK
        H++K+C  A + W L+       +    + D  +  W+    K   EE L    ++ WS+W  RN+       + F  ++    N +   N L A     
Subjt:  HIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEE-LFKGIVVIWSLWNFRNQGLNEDNSFEFHKLQAMMENAINAFNLLNAGPLKK

Query:  PFRVDNENI----KWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSIKCLEAIALREGLIHILELQNLLLPPILVESDASEI
          R   E+I     W+APP  ++K+N+D ++       G+G V+RD  G +  A  +      S   +EAIA REGL   LEL    +  +++ESD+   
Subjt:  PFRVDNENI----KWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSIKCLEAIALREGLIHILELQNLLLPPILVESDASEI

Query:  IKAIANDSLDISEFSNIIVD
        I+A+ +   + S F N+++D
Subjt:  IKAIANDSLDISEFSNIIVD

KAG2663507.1 hypothetical protein I3760_16G033000 [Carya illinoinensis]3.0e-7126.07Show/hide
Query:  MMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLL
        M  NL L ++E   +  +++N  S  + + E  +  K+ +++    +V +    KIW + + V++  V  NTF+  F     +DRV    PW +D  + +
Subjt:  MMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLL

Query:  FEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWVRISYEKLP
             G   +SEM+F  A+FWV+ H++P     +     LG+ +G+    + + D    G+ LR+K+ +D+R PL R   +    +  + W  + YEK+P
Subjt:  FEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWVRISYEKLP

Query:  DFCKGCGRIGHQANGCSYVESI-----FWARS--------------GTEK--EEQCRVDLNQIND---------MAEKGHVVTFK---------------
         FC  CGRI H + GC   + +      W R+               T+K    +  ++ N + D         M   G+V++                 
Subjt:  DFCKGCGRIGHQANGCSYVESI-----FWARS--------------GTEK--EEQCRVDLNQIND---------MAEKGHVVTFK---------------

Query:  ----KEKVI-----EKGKGKVISEGS---NDFLFGLFDIHMKAHQHLDNG---GFRGPA-KRVVFKELDLMPN---------------------------
             +K +     E+G  +V++E     +D   G  +IH         G   G  GP   +    E  L+P                            
Subjt:  ----KEKVI-----EKGKGKVISEGS---NDFLFGLFDIHMKAHQHLDNG---GFRGPA-KRVVFKELDLMPN---------------------------

Query:  ----------YNKGSVQ------LNLSCINDKAKTVVSPDAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCV
                    K +V+      L+L      A     P  M  + WN+RGL NP  ++AL   V    P I+F+ E+K    V++++K  L F  CL V
Subjt:  ----------YNKGSVQ------LNLSCINDKAKTVVSPDAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCV

Query:  PAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILI---NDFFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVG
           G SGG+ LFW++   V++ ++S+ HI   +    +    W  TGFYGN   SKR +SWNLL+ L   S+  W++ GDFNEIL   E+  G ++++  
Subjt:  PAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILI---NDFFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVG

Query:  MEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFAN-----------------------PLFISKSEGSMIFNYSKKIIRFEDSWAKFNGC
        M+ FR+ I+EC L DLGF+G+ FTW   ++       RLDR+ +N                       P+ +  + GS +    KK+ RFE  W     C
Subjt:  MEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFAN-----------------------PLFISKSEGSMIFNYSKKIIRFEDSWAKFNGC

Query:  KEVLRKNWEMGGTLGNSVCNLIFR-IKGSLFEMANWNKDRLKGSIRQAILNKENSVSLL-MANSGGIRSDEIINIENDIERLLEEEEIFWKQRS
        K+V++  W   G  G    +++ R I+    ++  WNK    G +++ +   ++ + L+  A+       ++    N++++ L   EI WKQRS
Subjt:  KEVLRKNWEMGGTLGNSVCNLIFR-IKGSLFEMANWNKDRLKGSIRQAILNKENSVSLL-MANSGGIRSDEIINIENDIERLLEEEEIFWKQRS

PNY16372.1 ribonuclease H, partial [Trifolium pratense]1.8e-6827.61Show/hide
Query:  MKNLELVDQEAQGV-FELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLL
        +  L L ++E  G  FE E+ + +  + +L   +  + L +K    +  K  M  +W     V I+      F+  F + +  + VL  SPWI+D  +L+
Subjt:  MKNLELVDQEAQGV-FELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLL

Query:  FEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKW--VRISYEK
         E++    +I ++     + WV++H++P     E     L N +G+F+ +D   +     +F+RI+V ID+RLPL++  KVK+      KW  V I YEK
Subjt:  FEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKW--VRISYEK

Query:  LPDFCKGCGRIGHQANGCSYVESIFWARSGTEKEEQCRVDLNQINDMAEKGHVVTFKKEKVIEKGKGKVISE------GSNDFLFGLFDIHMKAHQHLDN
        L  FC  CG +GH                    E +C+V     ND   +G     + E    +  G+V S       GS +   G         Q   +
Subjt:  LPDFCKGCGRIGHQANGCSYVESIFWARSGTEKEEQCRVDLNQINDMAEKGHVVTFKKEKVIEKGKGKVISE------GSNDFLFGLFDIHMKAHQHLDN

Query:  GGFRGPAKRVVFKELDLMPNYNKGSVQLNLSCINDKAKTVVSPDAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDC
         G  GPA    + ++   P                       P  MK L WN RGL  P AI  LR+      P I+F+ E+      ++++++ L +  
Subjt:  GGFRGPAKRVVFKELDLMPNYNKGSVQLNLSCINDKAKTVVSPDAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDC

Query:  CLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILIND-FFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNK
        CL V   G SGGL + W  + + ++++YS+  I+++++D   G WR T +YG P +S+RR +W+LL+ L   S+LPW + GDFN++L  E++        
Subjt:  CLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILIND-FFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNK

Query:  VGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANP-------------LFISKSEGSMIF-----------NYSKKIIRFEDSWAKF
             FR A+++C LID+   G  FTW+K++ +      RLDR   N              L  S S+ S I            NYS    RFE+ W K 
Subjt:  VGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANP-------------LFISKSEGSMIF-----------NYSKKIIRFEDSWAKF

Query:  NGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGSIRQAILNKENSVSLLMANSGGIRSDEIINIENDIERLLEEEEIFWKQRS-LRW
        +  +EV+++ W +GG  G  + +   RI     ++  W + R +   +Q I+     +  L  N   + S     +E    +LL +EE +W+QR+ + W
Subjt:  NGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGSIRQAILNKENSVSLLMANSGGIRSDEIINIENDIERLLEEEEIFWKQRS-LRW

XP_018826186.1 uncharacterized protein LOC108995141 [Juglans regia]9.9e-7532.01Show/hide
Query:  MDAFQFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPW
        MD+ +  +  K L L++ E   + EL+++  +    + +  V  K+ + +  S +V    M K+W I +      V  NTF   F N   + RV E  PW
Subjt:  MDAFQFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPW

Query:  IYDNAVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWV
        ++DN VL+ ++  G K +  + F    FW+R+H++P +C T+   + +G  +GK    D + D    G FLR++V +D+  P+ R   +      N  W 
Subjt:  IYDNAVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWV

Query:  RISYEKLPDFCKGCGRIGHQANGCSYVE------SIFWARSGTEKEE-----QCRVDLNQINDMAEKGHVVTFKKEKVIEKGK-GKVISEGSNDFLFGLF
          SYEKLP  C  CG + H  NGCS  +         W R   +KE        +   N +    ++G   T KKEK  E+ + GK + E + D      
Subjt:  RISYEKLPDFCKGCGRIGHQANGCSYVE------SIFWARSGTEKEE-----QCRVDLNQINDMAEKGHVVTFKKEKVIEKGK-GKVISEGSNDFLFGLF

Query:  DIHMKAHQHLDNGGFRGPAKRVVFKELD-------LMPNYNKGSVQLN---------LSCINDKAKTVVS-----PDAMKTLCWNARGLKNPRAIRALRH
         +  +  +  D       ++ +V K L+        + +  KG + +                K K VVS     P+ MK L WN RGL NPR ++ L +
Subjt:  DIHMKAHQHLDNGGFRGPAKRVVFKELD-------LMPNYNKGSVQLN---------LSCINDKAKTVVS-----PDAMKTLCWNARGLKNPRAIRALRH

Query:  EVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILI-NDFFGRWRFTGFYGNPVQSKRRDSWNLLK
         V K  P +VFV E+K  +   D L+  L  + C  V A+G  GGL+L W+S +  ++V+YSQ HI++ I  +   +W  T FYG P  +KR++SW+LL 
Subjt:  EVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILI-NDFFGRWRFTGFYGNPVQSKRRDSWNLLK

Query:  RLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANP
         L   +N+ W + GDFNEIL  +E+  G  R +  ME FRE +NE +L DLG+ GD+FTW  +    + T+ RLDR  A P
Subjt:  RLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANP

TrEMBL top hitse value%identityAlignment
A0A2I4F3F9 uncharacterized protein LOC1089951414.8e-7532.01Show/hide
Query:  MDAFQFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPW
        MD+ +  +  K L L++ E   + EL+++  +    + +  V  K+ + +  S +V    M K+W I +      V  NTF   F N   + RV E  PW
Subjt:  MDAFQFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPW

Query:  IYDNAVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWV
        ++DN VL+ ++  G K +  + F    FW+R+H++P +C T+   + +G  +GK    D + D    G FLR++V +D+  P+ R   +      N  W 
Subjt:  IYDNAVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWV

Query:  RISYEKLPDFCKGCGRIGHQANGCSYVE------SIFWARSGTEKEE-----QCRVDLNQINDMAEKGHVVTFKKEKVIEKGK-GKVISEGSNDFLFGLF
          SYEKLP  C  CG + H  NGCS  +         W R   +KE        +   N +    ++G   T KKEK  E+ + GK + E + D      
Subjt:  RISYEKLPDFCKGCGRIGHQANGCSYVE------SIFWARSGTEKEE-----QCRVDLNQINDMAEKGHVVTFKKEKVIEKGK-GKVISEGSNDFLFGLF

Query:  DIHMKAHQHLDNGGFRGPAKRVVFKELD-------LMPNYNKGSVQLN---------LSCINDKAKTVVS-----PDAMKTLCWNARGLKNPRAIRALRH
         +  +  +  D       ++ +V K L+        + +  KG + +                K K VVS     P+ MK L WN RGL NPR ++ L +
Subjt:  DIHMKAHQHLDNGGFRGPAKRVVFKELD-------LMPNYNKGSVQLN---------LSCINDKAKTVVS-----PDAMKTLCWNARGLKNPRAIRALRH

Query:  EVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILI-NDFFGRWRFTGFYGNPVQSKRRDSWNLLK
         V K  P +VFV E+K  +   D L+  L  + C  V A+G  GGL+L W+S +  ++V+YSQ HI++ I  +   +W  T FYG P  +KR++SW+LL 
Subjt:  EVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILI-NDFFGRWRFTGFYGNPVQSKRRDSWNLLK

Query:  RLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANP
         L   +N+ W + GDFNEIL  +E+  G  R +  ME FRE +NE +L DLG+ GD+FTW  +    + T+ RLDR  A P
Subjt:  RLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANP

A0A2N9EEA9 CCHC-type domain-containing protein5.6e-7627.33Show/hide
Query:  MDAFQFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPW
        MD+       +   L + EA G F++E++ +          +  KL+T K+ +    K  M ++W   + +   S+  N FV  F NA  R RVL+ SPW
Subjt:  MDAFQFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPW

Query:  IYDNAVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWV
        ++DN +L+   + G    S+++   + FWV+LH +P    T  + + +G++ G+ +  D        G  LRI+V +DI  P  R   V   ++  + WV
Subjt:  IYDNAVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWV

Query:  RISYEKLPDFCKGCGRIGHQANGC------------SYVESIFWAR------------SGTEK---EEQCRVD-------------LNQINDMAEKGHVV
           YE+LP  C  CG IGH    C            S+ +   W R            +GT +    E  RVD             L Q++D+ E     
Subjt:  RISYEKLPDFCKGCGRIGHQANGC------------SYVESIFWAR------------SGTEK---EEQCRVD-------------LNQINDMAEKGHVV

Query:  TFKKEK----VIEKG------------------KGKVISEGSNDFL---------FGLFD-----IHMKAH----------------------------Q
            +K    V+E G                   G  +S  +   L          GL       +H + H                            +
Subjt:  TFKKEK----VIEKG------------------KGKVISEGSNDFL---------FGLFD-----IHMKAH----------------------------Q

Query:  HLDNGGFRGPAKRV------VFKELDLMPNYNKG-----------SVQLNLSCINDKAKTVVSPDA-MKTLCWNARGLKNPRAIRALRHEVLKVTPQIVF
        +   G   G A+ V        K+    P  NK            S  +N   +    +   SP A M+ L WN RGL NP+ +R L   + +  P I+F
Subjt:  HLDNGGFRGPAKRV------VFKELDLMPNYNKG-----------SVQLNLSCINDKAKTVVSPDA-MKTLCWNARGLKNPRAIRALRHEVLKVTPQIVF

Query:  VCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHID--ILINDFFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPW
        + E++     V+ L++S  F+   CVP +   GGL + WS  +D+++ S+S+ HID  ++I D    +R TGFYGNP   KR+++W LLK L+   +LPW
Subjt:  VCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHID--ILINDFFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPW

Query:  MLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANPLFISKSEGSMIFNYSK-------------
        +  GDFNE+L   E    + R    + DFREA+    L DLGF G  FTWIK +K  +    RLDR  A+  +I+  EG+ + + +K             
Subjt:  MLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANPLFISKSEGSMIFNYSK-------------

Query:  ---------KIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGSIRQAILNKENSVSLLMANSGGIRSDEIINIEND
                 ++ RFE  W K    K+V+++ W +    G+ +  +  ++K     +  W+K+R  G +   I  K   +  L   +    S EI+ ++++
Subjt:  ---------KIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGSIRQAILNKENSVSLLMANSGGIRSDEIINIEND

Query:  IERLLEEEEIFWKQRS
        I  LLE+EEIFW+QRS
Subjt:  IERLLEEEEIFWKQRS

A0A2N9GJ35 Uncharacterized protein1.3e-7728.71Show/hide
Query:  QFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDN
        +  ++ + ++L ++E   +  L  + I   + E ++ +  KLLT K    + FK  +  +W+    V I S+ GN F+  F      +R+   SPW +D 
Subjt:  QFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDN

Query:  AVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAM--ANEKWVRI
         ++   +  GD + +E+ F   AFW+R+ ++P         + +G  +G+ +  D   +    G++LRI+V IDI  PL R   ++S         WV  
Subjt:  AVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAM--ANEKWVRI

Query:  SYEKLPDFCKGCGRIGHQANGC------------SYVESIFWARSGTEKEEQCRVDLNQINDMAEKGHV-VTFKKEKVIEKGKGKVISEG-----SNDFL
         YE LP FC  CGR+GH ++ C            S  +   W R+   +  Q R     +    E+G   + F +E   E      +S G        +L
Subjt:  SYEKLPDFCKGCGRIGHQANGC------------SYVESIFWARSGTEKEEQCRVDLNQINDMAEKGHV-VTFKKEKVIEKGKGKVISEG-----SNDFL

Query:  FGLFDIHMKAHQHLDNGGFRGPAKRVVFKELDLMPNYNKGSVQLNLSCINDKAKTVVSPDAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKC
          L +  M+   H++      PA                        C + K      P  M+ L  N RGL NP+ +  L + V K  P IVF+ E++ 
Subjt:  FGLFDIHMKAHQHLDNGGFRGPAKRVVFKELDLMPNYNKGSVQLNLSCINDKAKTVVSPDAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKC

Query:  GENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHID--ILINDFFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDF
            ++ L++ L    CL V   G  GGL L W SS+ + + SYS+ HID  ++ ND   RWR TGFYG P    R  SW+LL+ L   S++PWM+ GDF
Subjt:  GENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHID--ILINDFFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDF

Query:  NEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFA-------------NPLFISKSE-------------GS
        NEI   EE+    +RN   M  FREA+ +CSL D+GF+G  FTW  N+++    RVRLDR  A             N L ++ S+              +
Subjt:  NEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFA-------------NPLFISKSE-------------GS

Query:  MIFNYSKKIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGSIRQAILNKENSVSLLMANSGGIRSDEIINIENDIE
         +    +++ RFE SW K +GC+EV++  WE+   +G ++  +  +IK    ++  W++  ++ + +      +    L +       S +I  I+ D+ 
Subjt:  MIFNYSKKIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGSIRQAILNKENSVSLLMANSGGIRSDEIINIENDIE

Query:  RLLEEEEIFWKQRS
         L E+ EI W+QRS
Subjt:  RLLEEEEIFWKQRS

A0A2N9HND5 CCHC-type domain-containing protein2.1e-7527.21Show/hide
Query:  MDAFQFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPW
        MD+       +   L + EA G F++E++ +          +  KL+T K+ +    K  M ++W   + +   S+  N FV  F NA  R RVL+ SPW
Subjt:  MDAFQFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPW

Query:  IYDNAVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWV
        ++DN +L+   + G    S+++   + FWV+LH +P    T  + + +G++ G+ +  D        G  LRI+V +DI  P  R   V   ++  + WV
Subjt:  IYDNAVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWV

Query:  RISYEKLPDFCKGCGRIGHQANGC------------SYVESIFWAR------------SGTEK---EEQCRVD-------------LNQINDMAEKGHVV
           YE+LP  C  CG IGH    C            S+ +   W R            +GT +    E  RVD             L Q++D+ E     
Subjt:  RISYEKLPDFCKGCGRIGHQANGC------------SYVESIFWAR------------SGTEK---EEQCRVD-------------LNQINDMAEKGHVV

Query:  TFKKEK----VIEKG------------------KGKVISEGSNDFL---------FGLFD-----IHMKAH----------------------------Q
            +K    V+E G                   G  +S  +   L          GL       +H + H                            +
Subjt:  TFKKEK----VIEKG------------------KGKVISEGSNDFL---------FGLFD-----IHMKAH----------------------------Q

Query:  HLDNGGFRGPAKRV------VFKELDLMPNYNKG-----------SVQLNLSCINDKAKTVVSPDA-MKTLCWNARGLKNPRAIRALRHEVLKVTPQIVF
        +   G   G A+ V        K+    P  NK            S  +N   +    +   SP A M+ L WN RGL +P+ +R L   + +  P I+F
Subjt:  HLDNGGFRGPAKRV------VFKELDLMPNYNKG-----------SVQLNLSCINDKAKTVVSPDA-MKTLCWNARGLKNPRAIRALRHEVLKVTPQIVF

Query:  VCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHID--ILINDFFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPW
        + E++     V+ L++S  F+   CVP +   GGL + WS  +D+++ S+S+ HID  ++I D    +R TGFYGNP   KR+++W LLK L+   +LPW
Subjt:  VCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHID--ILINDFFGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPW

Query:  MLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANPLFISKSEGSMIFNYSK-------------
        +  GDFNE+L   E    + R    + DFREA+    L DLGF G  FTWIK +K  +    RLDR  A+  +I+  EG+ + + +K             
Subjt:  MLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANPLFISKSEGSMIFNYSK-------------

Query:  ---------KIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGSIRQAILNKENSVSLLMANSGGIRSDEIINIEND
                 ++ RFE  W K    K+V+++ W +    G+ +  +  ++K     +  W+K+R  G +   I  K   +  L   +    S EI+ ++++
Subjt:  ---------KIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGSIRQAILNKENSVSLLMANSGGIRSDEIINIEND

Query:  IERLLEEEEIFWKQRS
        I  LLE+EEIFW+QRS
Subjt:  IERLLEEEEIFWKQRS

A0A803QI56 Uncharacterized protein2.2e-9624.64Show/hide
Query:  NLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLLFEK
        N+ + D+E   +  + D D    E +    +  + LT K       +  M  +W   + + ++ +  N F+  F + V  DRV+E SPW +D   L+FE+
Subjt:  NLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLLFEK

Query:  LTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKV-KSGAMANEKWVRISYEKLPDF
        L   +    +      FWV++H++      E   + LGN +G FI  D          +LR++V I+I  PL+R  KV K G +    +    YE L  F
Subjt:  LTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKV-KSGAMANEKWVRISYEKLPDF

Query:  CKGCGRIGHQANGC--------------------------SYVESIFWARSG--------------TEKEEQCRVDLNQ-INDMAEK--------GHVVT
        C   G +GH    C                          +Y     W RSG              T  E    ++ NQ IN+   +         ++ +
Subjt:  CKGCGRIGHQANGC--------------------------SYVESIFWARSG--------------TEKEEQCRVDLNQ-INDMAEK--------GHVVT

Query:  FKKEKVIEKGKGKVISEGSNDFLFGLFDIHMKAHQHLDNGGFRGPAKRV-----------------VFKELDLMPNY-NKGSVQLNLSCINDKAKTVVSP
           + + E+G   +++        G   I   + +    G     +  +                 +   L L  N  N   V++      D       P
Subjt:  FKKEKVIEKGKGKVISEGSNDFLFGLFDIHMKAHQHLDNGGFRGPAKRV-----------------VFKELDLMPNY-NKGSVQLNLSCINDKAKTVVSP

Query:  DAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILI-NDFFG
          M  L WN RGL N RAI+ L+  V +  P  +F+CE+KC +  ++ +   L F+    V A G SGGL L W +  D  V+ YS  HID L+ +   G
Subjt:  DAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILI-NDFFG

Query:  RWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDR
         W+ TGFYG P ++ R  SW LL+ LA  S LPW + GD N I+  E++  G    +  ++ F++A+N+C+LID+   G  FTW K + S +   VRLDR
Subjt:  RWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDR

Query:  YFANPLFISKSEGSMIFNY---------------------SKKIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGS
           N  +    +G+ + N                      + +  +FE++W K   C E++R  WE  G       NL  ++     +++ W K+ + G+
Subjt:  YFANPLFISKSEGSMIFNY---------------------SKKIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGS

Query:  IRQAILNKENSVSLLMANSGGIRSDEIINIENDIERLLEEEEIFWKQR-------------SLRW-------------DW----------------KNRS
         +  I   +N +  L              ++ ++  +L++ E FWKQR             S RW              W                    
Subjt:  IRQAILNKENSVSLLMANSGGIRSDEIINIENDIERLLEEEEIFWKQR-------------SLRW-------------DW----------------KNRS

Query:  VAELMARDGG-WNKELIYNIFCKDDANIIVNTPLSKFGTNDEVMWNDCKKGIFSVKSAYHVGINLASKEKESTSD-SKFTSNCWKKFWKINISSKIKLGA
        V  LM  +GG W+ +++ ++F + D  +I++ PL+    ND++ W+    GI+SVKS Y++   +  +   +  D +KF    W KFWK  I  K+K   
Subjt:  VAELMARDGG-WNKELIYNIFCKDDANIIVNTPLSKFGTNDEVMWNDCKKGIFSVKSAYHVGINLASKEKESTSD-SKFTSNCWKKFWKINISSKIKLGA

Query:  WSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNMHIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEELFKGIVVIWSLWNFRN
        W      +PT   + ++ VD+   CP+C    ES +H +  C    + W+      T      +N    +D+++        E+     V+ W++W+ RN
Subjt:  WSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNMHIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEELFKGIVVIWSLWNFRN

Query:  QGLNEDNSFEFHKLQAMMENAINAFNLLNAGPLKKPF---RVDNENIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSIK
          + +  S     +  +    +  +       ++  +   + D+    W  P +   K+NVDA++  S  + G G V RD  G +     +    ++S +
Subjt:  QGLNEDNSFEFHKLQAMMENAINAFNLLNAGPLKKPF---RVDNENIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSIK

Query:  CLEAIALREGLIHILELQNLLLPPILVESDASEIIKAIANDSLDISEFSNIIVDIKHLMALLGIVKVAHCNRLCNMVADVLAKQS
          E+I +RE L  I   +      + +E+D   +++AI ++   IS F  II + K+L+  L  + +    R  NMVA   A+ S
Subjt:  CLEAIALREGLIHILELQNLLLPPILVESDASEIIKAIANDSLDISEFSNIIVDIKHLMALLGIVKVAHCNRLCNMVADVLAKQS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.0e-1825.57Show/hide
Query:  GTNDEVMWNDCKKGIFSVKSAYHVGINLASKEKESTSDSKFTSNCWKKFWKINISSKIKLGAWSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNMHI
        G  D + W   + G FSV+SAY +   L   E    + + F  NC    WK+ +  ++K   W V + A+ T+   + R +  + +C +C+   ES +H+
Subjt:  GTNDEVMWNDCKKGIFSVKSAYHVGINLASKEKESTSDSKFTSNCWKKFWKINISSKIKLGAWSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNMHI

Query:  IKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMG---LCDKVPKEELFKGIVVIWSLWNFRNQGLNEDNSFEFHKLQAMMENAINAFNLLNAGPL--
        ++DC   +  W    P       F K++ + +   +G    C+ +P   +F   V+IW  W +R   +  +N+    +++ + E A+  +   +   L  
Subjt:  IKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMG---LCDKVPKEELFKGIVVIWSLWNFRNQGLNEDNSFEFHKLQAMMENAINAFNLLNAGPL--

Query:  -KKPFRVDNENIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSIKCLEAIALREGLIHILELQNLLLPPILVESDASEII
          +P RV+   I WV+P +G  K+N D + R + G    G V+RD  G     G+  N    S    E   +  GL    E +   +P + +E D SE+I
Subjt:  -KKPFRVDNENIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSIKCLEAIALREGLIHILELQNLLLPPILVESDASEII

Query:  KAIANDSLDISEFSNIIVDIKH-LMALLGIVKVAHCNRLCNMVADVLA
               +  S   + +V + H  +    +V++ H  R  N +AD LA
Subjt:  KAIANDSLDISEFSNIIVDIKH-LMALLGIVKVAHCNRLCNMVADVLA

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein7.2e-0730.39Show/hide
Query:  QSKRRDSWNLLKRLAGCS---NLPWMLGGDFNEILLFEE--QLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANPLF
        +++RR  W+ + RL+  S   N PW++ GDFN+I    E   L  S  +  G+ED +  + +  L+DL   G  +TW  +++     R +LDR   N  +
Subjt:  QSKRRDSWNLLKRLAGCS---NLPWMLGGDFNEILLFEE--QLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANPLF

Query:  IS
        ++
Subjt:  IS

AT3G25270.1 Ribonuclease H-like superfamily protein1.7e-1121.71Show/hide
Query:  KFWKINISSKIKLGAWSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNMHIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEEL
        K WK+  + KIK   W +LS A+ T  N+  R +  +  C  C +  E++ H+  DC  A + W  +       +     ++  ++  +  C    + +L
Subjt:  KFWKINISSKIKLGAWSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNMHIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEEL

Query:  FK-GIVVIWSLWNFRNQGLNEDNSFEFHKLQAMMENAINAFNLLNAGPLKKPFRVDNE--------NIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIR
        F   I ++W LW  RNQ + +  S  +        N +  +   N        +V +           KW  PP    K N D +    +     GW++R
Subjt:  FK-GIVVIWSLWNFRNQGLNEDNSFEFHKLQAMMENAINAFNLLNAGPLKKPFRVDNE--------NIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIR

Query:  DWKGQVKMAGWRANFQNMSIKCLEAIALREGLIHILELQNLLLPPILVESDASEIIKAIANDSLDISEFSNIIVDIKHLMALLGIVKVAHCNRLCNMVAD
        D  G    +G             E  AL   +I +    +     ++ E D+ ++ + + N+ L+   F N I + +               R  N  AD
Subjt:  DWKGQVKMAGWRANFQNMSIKCLEAIALREGLIHILELQNLLLPPILVESDASEIIKAIANDSLDISEFSNIIVDIKHLMALLGIVKVAHCNRLCNMVAD

Query:  VLAK
        +LAK
Subjt:  VLAK

AT3G42140.1 zinc ion binding;nucleic acid binding3.6e-0620.75Show/hide
Query:  NIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQR
        N+D++V+   +  +     F +      +L   PW +++ + + ++ T  K  S+ EF    FW+++  +P    T      +G  MG F+       + 
Subjt:  NIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLLFEKLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQR

Query:  NRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWVRISYEKLPDFCKGCGRIGHQANGC
        N G+ + +                          ++  YEKL +FC  CG + H A+ C
Subjt:  NRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWVRISYEKLPDFCKGCGRIGHQANGC

AT4G29090.1 Ribonuclease H-like superfamily protein7.7e-2525.13Show/hide
Query:  VAELMARDG-GWNKELIYNIFCKDDANIIVNTPLSKFGTNDEVMWNDCKKGIFSVKSAYHVGINLASKEKESTSDSKFTSN-CWKKFWKINISSKIKLGA
        V++L+   G  W K++I  +F + +  +I           D   W+    G ++VKS Y V   + +K       S+ + N  ++K WK   S KI+   
Subjt:  VAELMARDG-GWNKELIYNIFCKDDANIIVNTPLSKFGTNDEVMWNDCKKGIFSVKSAYHVGINLASKEKESTSDSKFTSN-CWKKFWKINISSKIKLGA

Query:  WSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNMHIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEELFKGIV--VIWSLWNF
        W  LSN++P    +  R +     C  C  CKE+  H++  C  A   W ++        ++  +I   + +   L +  P+ E    +V  ++W LW  
Subjt:  WSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNMHIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEELFKGIV--VIWSLWNF

Query:  RNQGLNEDNSFEFHKLQAMMENAINAFNLLNAGPL--KKPFRVDNENIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSI
        RN+ +     F   ++    E+ +  + +         KP    +   +W  PP    K N DA+    +   GIGWV+R+ KG+VK  G RA  +  S+
Subjt:  RNQGLNEDNSFEFHKLQAMMENAINAFNLLNAGPL--KKPFRVDNENIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSI

Query:  KCLEAIALREGLIHILELQNLLLPPILVESDASEIIKAIANDSLDISEFSNIIVDIKHLMALLGIVKVAHCNRLCNMVADVLAKQS
           E  A+R     +L L       ++ ESD+  +I+ + ND +  S     I D++ L++    VK     R  N +A+ +A++S
Subjt:  KCLEAIALREGLIHILELQNLLLPPILVESDASEIIKAIANDSLDISEFSNIIVDIKHLMALLGIVKVAHCNRLCNMVADVLAKQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCTTTTCAGTTTGCCCAAATGATGAAGAATTTGGAGCTAGTGGATCAGGAAGCTCAAGGAGTATTTGAACTTGAAGACAACGATATTTCAGCGTATGAAAGTGA
GCTCGAAAACGTGGTTGCCTGCAAATTACTTACTGAGAAACATTCTAGTCCAGATGTGTTCAAAAAATTAATGCCCAAAATCTGGAACATAGATCAGAAGGTGTTGATTG
AATCAGTGGGAGGTAACACATTTGTTTGTACCTTCGTTAATGCTGTTGTTAGAGATAGAGTTCTTGAGGCGAGTCCATGGATTTATGATAACGCGGTGTTGCTGTTTGAA
AAGCTGACTGGAGATAAAAGGATTTCGGAGATGGAGTTCGGATTTGCAGCATTTTGGGTTCGTCTTCACGACGTTCCACCAGCCTGCCATACAGAAAATAGTGCAAAAGG
CCTAGGTAATTTGATGGGAAAATTTATTATCTGGGATTATGAGGGGGATCAACGAAATAGGGGAAAATTTCTAAGGATCAAGGTGTATATTGACATTCGTTTACCTTTGA
GAAGAGCTTTAAAGGTAAAAAGTGGAGCAATGGCAAATGAAAAATGGGTCCGAATTTCATATGAAAAGTTACCTGATTTTTGCAAAGGGTGTGGAAGAATAGGCCATCAG
GCAAATGGATGCAGCTATGTTGAGTCCATTTTTTGGGCTAGGTCAGGTACGGAGAAAGAAGAGCAGTGCAGAGTTGATTTAAATCAAATTAATGATATGGCAGAAAAGGG
ACATGTGGTGACATTTAAGAAGGAAAAAGTAATTGAAAAAGGGAAAGGTAAGGTCATCTCAGAAGGTTCAAATGATTTTCTTTTTGGGCTATTTGATATTCATATGAAAG
CCCATCAGCATTTGGATAATGGGGGTTTTCGTGGACCAGCCAAAAGAGTGGTTTTTAAAGAATTGGATTTAATGCCTAATTATAATAAGGGAAGTGTGCAGTTAAATCTT
TCATGTATTAATGATAAAGCAAAAACTGTTGTCTCGCCGGATGCAATGAAAACTTTATGCTGGAACGCTCGGGGGTTGAAAAACCCCCGAGCCATTAGAGCGCTTCGGCA
TGAAGTGCTTAAAGTGACACCTCAAATTGTTTTTGTGTGCGAATCAAAATGCGGTGAGAATGTGGTGGATAAGTTAAAAATTAGCTTGAATTTTGATTGTTGTTTGTGTG
TTCCTGCTATTGGGTTGAGCGGGGGCTTATTGTTATTTTGGAGTTCGAGTTTAGATGTGAAAGTTGTCTCATACTCTCAGGGTCATATTGATATCTTAATCAATGACTTT
TTTGGGAGATGGAGATTTACGGGTTTTTATGGAAATCCAGTTCAAAGCAAGCGTAGAGATTCCTGGAATCTTCTAAAAAGACTTGCAGGCTGTTCGAACTTGCCTTGGAT
GTTGGGTGGTGATTTTAATGAAATTCTTTTGTTTGAGGAGCAGTTAAATGGTTCTGAAAGAAATAAGGTTGGTATGGAGGATTTTAGGGAGGCTATCAATGAGTGTTCAC
TCATTGATCTGGGCTTTTCTGGAGACAGATTTACTTGGATTAAAAATAAAAAGTCAATTTCTGCTACTAGAGTTCGGCTTGACAGGTATTTTGCTAACCCTCTATTTATA
TCAAAGTCTGAGGGGTCTATGATTTTTAATTATTCTAAAAAGATTATTAGATTTGAAGATAGTTGGGCTAAGTTCAATGGTTGCAAAGAGGTGCTAAGGAAAAATTGGGA
GATGGGTGGGACGTTAGGGAATAGTGTTTGTAATCTTATTTTTCGCATTAAAGGTAGCTTATTTGAGATGGCAAATTGGAACAAAGACAGATTAAAAGGATCTATTAGAC
AAGCTATTCTTAATAAAGAAAATTCTGTTTCTCTGCTAATGGCTAATTCTGGGGGGATAAGATCTGATGAAATTATCAATATTGAAAATGACATTGAAAGGCTCCTTGAG
GAGGAGGAAATTTTTTGGAAACAAAGATCTCTTAGGTGGGATTGGAAAAATAGGAGTGTGGCGGAGCTGATGGCCAGAGATGGTGGTTGGAACAAGGAGCTCATTTACAA
CATTTTCTGCAAAGATGATGCTAATATCATTGTTAATACCCCACTCAGCAAGTTTGGTACCAATGATGAAGTTATGTGGAATGATTGTAAAAAGGGTATTTTTTCCGTTA
AAAGCGCTTATCATGTTGGTATTAATTTGGCTTCTAAAGAGAAGGAGTCCACATCTGACTCTAAGTTTACTAGCAATTGTTGGAAAAAGTTTTGGAAGATCAATATTTCT
TCTAAAATTAAATTGGGAGCTTGGAGTGTTCTTTCTAATGCAATTCCGACTAAGAGCAACATTTACTTAAGAGGAGTGGATATTAATGTTATTTGTCCTATGTGTAGGAA
ATGTAAAGAGTCGAATATGCATATTATTAAAGATTGCAAAATCGCTATTCGTGCTTGGAATCTCACTTTCCCTTGGTTTACTAATTTTAAAGATTTTTTAAAAAATATAG
ATAAAGGAATTGATTTTTGGATGGGTTTATGCGATAAAGTTCCAAAGGAGGAGCTCTTTAAAGGAATTGTGGTTATTTGGAGTTTATGGAACTTTCGTAACCAAGGATTA
AATGAAGATAATTCCTTTGAGTTCCATAAGCTTCAAGCTATGATGGAAAATGCTATTAATGCTTTTAATTTGCTTAATGCAGGTCCTCTAAAGAAGCCTTTTCGTGTTGA
TAATGAGAACATCAAATGGGTAGCTCCTCCTCTTGGTGTGTGGAAGATGAATGTGGATGCGTCAATGAGACCAAGTTCAGGTGAAGGTGGCATTGGTTGGGTCATAAGAG
ATTGGAAAGGTCAGGTGAAGATGGCTGGTTGGAGAGCAAACTTCCAAAACATGTCAATTAAGTGTTTGGAGGCCATAGCTTTGAGGGAGGGTCTTATTCATATCTTAGAG
TTGCAGAATCTGCTCCTTCCTCCTATCTTGGTGGAGTCCGATGCTTCAGAGATTATCAAAGCAATTGCTAATGACTCCTTAGACATTTCTGAATTCAGCAATATTATTGT
TGATATTAAGCATTTAATGGCTTTGTTGGGTATTGTCAAAGTGGCCCACTGTAATAGGTTATGTAACATGGTTGCTGATGTTTTAGCTAAACAGTCTGATATGGACTATT
TTTGTTGGAGGGATCCAATTCTCTCTTTTGAGGATAGACCTCGGTTTTGGAGGGGGAATCTCCCCCCTAATGTTGGCTCCATGGTGGAGAAGGATTGTGTTGGTTTAGTT
TGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCTTTTCAGTTTGCCCAAATGATGAAGAATTTGGAGCTAGTGGATCAGGAAGCTCAAGGAGTATTTGAACTTGAAGACAACGATATTTCAGCGTATGAAAGTGA
GCTCGAAAACGTGGTTGCCTGCAAATTACTTACTGAGAAACATTCTAGTCCAGATGTGTTCAAAAAATTAATGCCCAAAATCTGGAACATAGATCAGAAGGTGTTGATTG
AATCAGTGGGAGGTAACACATTTGTTTGTACCTTCGTTAATGCTGTTGTTAGAGATAGAGTTCTTGAGGCGAGTCCATGGATTTATGATAACGCGGTGTTGCTGTTTGAA
AAGCTGACTGGAGATAAAAGGATTTCGGAGATGGAGTTCGGATTTGCAGCATTTTGGGTTCGTCTTCACGACGTTCCACCAGCCTGCCATACAGAAAATAGTGCAAAAGG
CCTAGGTAATTTGATGGGAAAATTTATTATCTGGGATTATGAGGGGGATCAACGAAATAGGGGAAAATTTCTAAGGATCAAGGTGTATATTGACATTCGTTTACCTTTGA
GAAGAGCTTTAAAGGTAAAAAGTGGAGCAATGGCAAATGAAAAATGGGTCCGAATTTCATATGAAAAGTTACCTGATTTTTGCAAAGGGTGTGGAAGAATAGGCCATCAG
GCAAATGGATGCAGCTATGTTGAGTCCATTTTTTGGGCTAGGTCAGGTACGGAGAAAGAAGAGCAGTGCAGAGTTGATTTAAATCAAATTAATGATATGGCAGAAAAGGG
ACATGTGGTGACATTTAAGAAGGAAAAAGTAATTGAAAAAGGGAAAGGTAAGGTCATCTCAGAAGGTTCAAATGATTTTCTTTTTGGGCTATTTGATATTCATATGAAAG
CCCATCAGCATTTGGATAATGGGGGTTTTCGTGGACCAGCCAAAAGAGTGGTTTTTAAAGAATTGGATTTAATGCCTAATTATAATAAGGGAAGTGTGCAGTTAAATCTT
TCATGTATTAATGATAAAGCAAAAACTGTTGTCTCGCCGGATGCAATGAAAACTTTATGCTGGAACGCTCGGGGGTTGAAAAACCCCCGAGCCATTAGAGCGCTTCGGCA
TGAAGTGCTTAAAGTGACACCTCAAATTGTTTTTGTGTGCGAATCAAAATGCGGTGAGAATGTGGTGGATAAGTTAAAAATTAGCTTGAATTTTGATTGTTGTTTGTGTG
TTCCTGCTATTGGGTTGAGCGGGGGCTTATTGTTATTTTGGAGTTCGAGTTTAGATGTGAAAGTTGTCTCATACTCTCAGGGTCATATTGATATCTTAATCAATGACTTT
TTTGGGAGATGGAGATTTACGGGTTTTTATGGAAATCCAGTTCAAAGCAAGCGTAGAGATTCCTGGAATCTTCTAAAAAGACTTGCAGGCTGTTCGAACTTGCCTTGGAT
GTTGGGTGGTGATTTTAATGAAATTCTTTTGTTTGAGGAGCAGTTAAATGGTTCTGAAAGAAATAAGGTTGGTATGGAGGATTTTAGGGAGGCTATCAATGAGTGTTCAC
TCATTGATCTGGGCTTTTCTGGAGACAGATTTACTTGGATTAAAAATAAAAAGTCAATTTCTGCTACTAGAGTTCGGCTTGACAGGTATTTTGCTAACCCTCTATTTATA
TCAAAGTCTGAGGGGTCTATGATTTTTAATTATTCTAAAAAGATTATTAGATTTGAAGATAGTTGGGCTAAGTTCAATGGTTGCAAAGAGGTGCTAAGGAAAAATTGGGA
GATGGGTGGGACGTTAGGGAATAGTGTTTGTAATCTTATTTTTCGCATTAAAGGTAGCTTATTTGAGATGGCAAATTGGAACAAAGACAGATTAAAAGGATCTATTAGAC
AAGCTATTCTTAATAAAGAAAATTCTGTTTCTCTGCTAATGGCTAATTCTGGGGGGATAAGATCTGATGAAATTATCAATATTGAAAATGACATTGAAAGGCTCCTTGAG
GAGGAGGAAATTTTTTGGAAACAAAGATCTCTTAGGTGGGATTGGAAAAATAGGAGTGTGGCGGAGCTGATGGCCAGAGATGGTGGTTGGAACAAGGAGCTCATTTACAA
CATTTTCTGCAAAGATGATGCTAATATCATTGTTAATACCCCACTCAGCAAGTTTGGTACCAATGATGAAGTTATGTGGAATGATTGTAAAAAGGGTATTTTTTCCGTTA
AAAGCGCTTATCATGTTGGTATTAATTTGGCTTCTAAAGAGAAGGAGTCCACATCTGACTCTAAGTTTACTAGCAATTGTTGGAAAAAGTTTTGGAAGATCAATATTTCT
TCTAAAATTAAATTGGGAGCTTGGAGTGTTCTTTCTAATGCAATTCCGACTAAGAGCAACATTTACTTAAGAGGAGTGGATATTAATGTTATTTGTCCTATGTGTAGGAA
ATGTAAAGAGTCGAATATGCATATTATTAAAGATTGCAAAATCGCTATTCGTGCTTGGAATCTCACTTTCCCTTGGTTTACTAATTTTAAAGATTTTTTAAAAAATATAG
ATAAAGGAATTGATTTTTGGATGGGTTTATGCGATAAAGTTCCAAAGGAGGAGCTCTTTAAAGGAATTGTGGTTATTTGGAGTTTATGGAACTTTCGTAACCAAGGATTA
AATGAAGATAATTCCTTTGAGTTCCATAAGCTTCAAGCTATGATGGAAAATGCTATTAATGCTTTTAATTTGCTTAATGCAGGTCCTCTAAAGAAGCCTTTTCGTGTTGA
TAATGAGAACATCAAATGGGTAGCTCCTCCTCTTGGTGTGTGGAAGATGAATGTGGATGCGTCAATGAGACCAAGTTCAGGTGAAGGTGGCATTGGTTGGGTCATAAGAG
ATTGGAAAGGTCAGGTGAAGATGGCTGGTTGGAGAGCAAACTTCCAAAACATGTCAATTAAGTGTTTGGAGGCCATAGCTTTGAGGGAGGGTCTTATTCATATCTTAGAG
TTGCAGAATCTGCTCCTTCCTCCTATCTTGGTGGAGTCCGATGCTTCAGAGATTATCAAAGCAATTGCTAATGACTCCTTAGACATTTCTGAATTCAGCAATATTATTGT
TGATATTAAGCATTTAATGGCTTTGTTGGGTATTGTCAAAGTGGCCCACTGTAATAGGTTATGTAACATGGTTGCTGATGTTTTAGCTAAACAGTCTGATATGGACTATT
TTTGTTGGAGGGATCCAATTCTCTCTTTTGAGGATAGACCTCGGTTTTGGAGGGGGAATCTCCCCCCTAATGTTGGCTCCATGGTGGAGAAGGATTGTGTTGGTTTAGTT
TGTTGA
Protein sequenceShow/hide protein sequence
MDAFQFAQMMKNLELVDQEAQGVFELEDNDISAYESELENVVACKLLTEKHSSPDVFKKLMPKIWNIDQKVLIESVGGNTFVCTFVNAVVRDRVLEASPWIYDNAVLLFE
KLTGDKRISEMEFGFAAFWVRLHDVPPACHTENSAKGLGNLMGKFIIWDYEGDQRNRGKFLRIKVYIDIRLPLRRALKVKSGAMANEKWVRISYEKLPDFCKGCGRIGHQ
ANGCSYVESIFWARSGTEKEEQCRVDLNQINDMAEKGHVVTFKKEKVIEKGKGKVISEGSNDFLFGLFDIHMKAHQHLDNGGFRGPAKRVVFKELDLMPNYNKGSVQLNL
SCINDKAKTVVSPDAMKTLCWNARGLKNPRAIRALRHEVLKVTPQIVFVCESKCGENVVDKLKISLNFDCCLCVPAIGLSGGLLLFWSSSLDVKVVSYSQGHIDILINDF
FGRWRFTGFYGNPVQSKRRDSWNLLKRLAGCSNLPWMLGGDFNEILLFEEQLNGSERNKVGMEDFREAINECSLIDLGFSGDRFTWIKNKKSISATRVRLDRYFANPLFI
SKSEGSMIFNYSKKIIRFEDSWAKFNGCKEVLRKNWEMGGTLGNSVCNLIFRIKGSLFEMANWNKDRLKGSIRQAILNKENSVSLLMANSGGIRSDEIINIENDIERLLE
EEEIFWKQRSLRWDWKNRSVAELMARDGGWNKELIYNIFCKDDANIIVNTPLSKFGTNDEVMWNDCKKGIFSVKSAYHVGINLASKEKESTSDSKFTSNCWKKFWKINIS
SKIKLGAWSVLSNAIPTKSNIYLRGVDINVICPMCRKCKESNMHIIKDCKIAIRAWNLTFPWFTNFKDFLKNIDKGIDFWMGLCDKVPKEELFKGIVVIWSLWNFRNQGL
NEDNSFEFHKLQAMMENAINAFNLLNAGPLKKPFRVDNENIKWVAPPLGVWKMNVDASMRPSSGEGGIGWVIRDWKGQVKMAGWRANFQNMSIKCLEAIALREGLIHILE
LQNLLLPPILVESDASEIIKAIANDSLDISEFSNIIVDIKHLMALLGIVKVAHCNRLCNMVADVLAKQSDMDYFCWRDPILSFEDRPRFWRGNLPPNVGSMVEKDCVGLV
C