; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g31090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g31090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:23385865..23393491
RNA-Seq ExpressionMoc06g31090
SyntenyMoc06g31090
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR004808 - AP endonuclease 1
IPR012337 - Ribonuclease H-like superfamily
IPR020847 - AP endonuclease 1, binding site
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH80348.1 hypothetical protein [Trifolium medium]1.2e-7227.06Show/hide
Query:  ADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVLENPDTMV
        AD+E+ A+ V+   +   ++  +  L+GK+      +    + TM +AW+  N + ++ + +NL LF F    + + V++ GPWSFD+ LL+L       
Subjt:  ADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVLENPDTMV

Query:  RPDEMDFDKASFW--------------------------EDVDCSRDSFCAGESLKVRIRYDIMKSLRRGIKINIDGSMGGVWIPLEYERLPEFCSFCGK
        +P E+  D  SFW                          E++D  ++    G+ L++R+  D+ K L+RG K+N  G    +W+  +YERLP FC  CG+
Subjt:  RPDEMDFDKASFW--------------------------EDVDCSRDSFCAGESLKVRIRYDIMKSLRRGIKINIDGSMGGVWIPLEYERLPEFCSFCGK

Query:  IGHNAKDCESFFAKDD------NLVQVGYGTLLRFDGTVKKNPKMLRKGFG-----DLMPEETDGYDRRGG---EADRRVNHQMESPD--FAQKI--GNR
        IGH  +DCE     D+         Q  +G  LR     K + ++ ++         L P  ++   +  G   E D  V  Q  S D    QKI  GN 
Subjt:  IGHNAKDCESFFAKDD------NLVQVGYGTLLRFDGTVKKNPKMLRKGFG-----DLMPEETDGYDRRGG---EADRRVNHQMESPD--FAQKI--GNR

Query:  IPEITEDLLEENEISKESKASMEK----------------ETNNPRKILSWKRR---------------AHGKQNLVDGEEGPSVLE----GSRKRKGVE
          +   D  ++  + KE +   E                 +T    K   W R+                 GK++LVD       +E    G +K +G  
Subjt:  IPEITEDLLEENEISKESKASMEK----------------ETNNPRKILSWKRR---------------AHGKQNLVDGEEGPSVLE----GSRKRKGVE

Query:  EVNETVKKAKQEIEQNGV----RVGLSELSAEAVTKCNVTVMNKLNATFNYYGCFPVSSSG----AKGGICLFWKENVQVKIRSFSFAHI-------DAM
         + + V           +    RV   +L     T+  VT +  + +   +  C  V  +G      GG+ L W E++ V I SFS  HI       ++ 
Subjt:  EVNETVKKAKQEIEQNGV----RVGLSELSAEAVTKCNVTVMNKLNATFNYYGCFPVSSSG----AKGGICLFWKENVQVKIRSFSFAHI-------DAM

Query:  VSW-------------EGVTWRFI---------------------------GGAARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLD
         SW             +  TW  I                           GG  R        R  + D +L DLG+ G P+TW      G ++  RLD
Subjt:  VSW-------------EGVTWRFI---------------------------GGAARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLD

Query:  RFVGNEAFIDLFKEATVQHLDWYCSDHRPIIID-TVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSW------------SANIQQNVSLSKSLALC
        R +GN  F++ F    V HL  + SDH  ++I      P T  + +R +  +F E+W    +C+E+I  +W            S +   N     +L   
Subjt:  RFVGNEAFIDLFKEATVQHLDWYCSDHRPIIID-TVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSW------------SANIQQNVSLSKSLALC

Query:  SKRLGRWVIDNLKEKL-------------------NKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKRNEISGINNRDGVWMSNATEMEQIFT
         K L R + D LK++                    N+LL+ +ET WRQRSR  WL+ GDRNT +FH +AS R K N I  I + DGVW      +E++F 
Subjt:  SKRLGRWVIDNLKEKL-------------------NKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKRNEISGINNRDGVWMSNATEMEQIFT

Query:  DYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPD
         YF E+FS+SNP+  N+    + V  K+S +  +     Y R E+  A+ QM P KAPGPD
Subjt:  DYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPD

XP_012847426.1 PREDICTED: uncharacterized protein LOC105967373 [Erythranthe guttata]1.8e-6826.91Show/hide
Query:  LRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVLENP
        L+LT D+EE        A    E   ++ L+G++L  + I+ E +  TM K W   +G+ V+ +G    +F F   +DR R  + GPW FDK L+VL+  
Subjt:  LRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVLENP

Query:  DTMVRPDEMDFDKASFWEDV--------------------------DCSRDSFCAGESLKVRIRYDIMKSLRRGIKI-NIDGSMGGVWIPLEYERLPEFC
        +    P  +  D   F+  V                           C+ D    G+ L++R   ++ K LRR  ++ N  G +  V + L+YERLP FC
Subjt:  DTMVRPDEMDFDKASFWEDV--------------------------DCSRDSFCAGESLKVRIRYDIMKSLRRGIKI-NIDGSMGGVWIPLEYERLPEFC

Query:  SFCGKIGHNAKDCESFF--AKDDNLVQVGYGTLLRFDGTVKKNPKMLRK--GFGDLMP----EETDGYD-------RRGGEADRRVNHQMESPDFAQKIG
         FCG + H +  C   +  + ++      YG  L+     K    +L      GD+         +G +         G E D  V+ + +S  F+Q  G
Subjt:  SFCGKIGHNAKDCESFF--AKDDNLVQVGYGTLLRFDGTVKKNPKMLRK--GFGDLMP----EETDGYD-------RRGGEADRRVNHQMESPDFAQKIG

Query:  N-RIPEITE-DLLEENEISKESKASMEKETNNPRKIL--SWKRRAH------------------GKQNLVDGEEGPSVLEG-------------------
        + +I E  + D+ + N+          ++ +NP   +  SW    +                      L+DG  GP  L G                   
Subjt:  N-RIPEITE-DLLEENEISKESKASMEKETNNPRKIL--SWKRRAH------------------GKQNLVDGEEGPSVLEG-------------------

Query:  ---SRKRKGVEEV---------NETVKKAKQEIEQNGVR----------VGLSELSAEAVTKCNVTVMNKLNATFNYYGCFPVSSSGAKGGICLFWKENV
           SR R    +V           TVK+ + + + +  R          V +   S E        V+   N+T  +       ++G  GG+ L W++++
Subjt:  ---SRKRKGVEEV---------NETVKKAKQEIEQNGVR----------VGLSELSAEAVTKCNVTVMNKLNATFNYYGCFPVSSSGAKGGICLFWKENV

Query:  QVKIRSFSFAHIDAMVSWEGV--TWRFIGG----------------------------AARDGNLM-----------------ENFRNTLEDYNLTDLGY
         V + +FS  HIDA +    +  TWRF G                              A D N M                 + F + L D  L DLG+
Subjt:  QVKIRSFSFAHIDAMVSWEGV--TWRFIGG----------------------------AARDGNLM-----------------ENFRNTLEDYNLTDLGY

Query:  FGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSWSANIQQNV
         G P+TW     +     ERLDR  GN  +++LF    V+HLD   SDH P++I+  R      +  R+ G KF   W    EC++II ++W AN+ Q  
Subjt:  FGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSWSANIQQNV

Query:  SLSK--SLALCSKRLGRWV----------IDNLKEK-----------------------LNKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKR
        SL +  +L  C   L RW           I  LKEK                       L++LL++EE  WRQR++  W+R GD+NT +FH +AS R+++
Subjt:  SLSK--SLALCSKRLGRWV----------IDNLKEK-----------------------LNKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKR

Query:  NEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPDEF
        N I+G+ N +GVW     ++E+I +DYF +IF+S +  T  ++ +L  +  +VS ++N +LL  Y   E+  AL  MQP K+PGPD F
Subjt:  NEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPDEF

XP_012847426.1 PREDICTED: uncharacterized protein LOC105967373 [Erythranthe guttata]4.3e-2228.37Show/hide
Query:  LKVSDFITS-SKQWDIQKLRQYVIEEDLEAIGRILLSWSEAEDASVCHYDRRGVYSVKSGYKLGMNL---KDLPSSSKSTN-------------------
        +KVS  I S + QWD   L Q  +EED+  I  I L  S  ED  + HY+R G++SV+S Y + + +   KD  +S+ S++                   
Subjt:  LKVSDFITS-SKQWDIQKLRQYVIEEDLEAIGRILLSWSEAEDASVCHYDRRGVYSVKSGYKLGMNL---KDLPSSSKSTN-------------------

Query:  ---HALFQCKRAREVWNLVLPQVRFIIGHSP---SVQDKFLTLQEALSAKDFELACVSCWAIWNDRNAIRAQAQVPDGNCRSEWIRAYMKEFEGCWQ-VK
           H L  C  AR+VW   L  V ++I H P   SV +  L +++   +  FE   V CWAIWN RN    +    D +  +  I  + K+F    + + 
Subjt:  ---HALFQCKRAREVWNLVLPQVRFIIGHSP---SVQDKFLTLQEALSAKDFELACVSCWAIWNDRNAIRAQAQVPDGNCRSEWIRAYMKEFEGCWQ-VK

Query:  HVSETNLQLSSFRTFVSVWVPPPAEWVKLNVDA----------------------------ACKSEISRTVAELSAIREGLYLASQLNFSLVQIESDCLQ
         V  +   L S +     W  PP   VK+N DA                            +CK       AE  A  + L  A   +F  V +E D   
Subjt:  HVSETNLQLSSFRTFVSVWVPPPAEWVKLNVDA----------------------------ACKSEISRTVAELSAIREGLYLASQLNFSLVQIESDCLQ

Query:  AIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNRVAHQLA
         +  I G  + +   G L++DI+ +A+ F +    H+ REGN  AH++A
Subjt:  AIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNRVAHQLA

XP_012847426.1 PREDICTED: uncharacterized protein LOC105967373 [Erythranthe guttata]1.5e-6726.03Show/hide
Query:  LATSMVSLLGCYGEEELLEGWKKLRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVI
        L  S+ +L   +     ++ WK+  LT +EEE  V  D  A E  ++     L+GKL      +  I +  + ++W+++N + ++ + +NL LF F    
Subjt:  LATSMVSLLGCYGEEELLEGWKKLRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVI

Query:  DRNRVFKTGPWSFDKYLLVLENPDTMVRPDEMDFDKASFWEDV-------------------------DCSRDSFCAGESLKVRIRYDIMKSLRRGIKIN
        D   V + GPWSFD+ L++L+      +P +++   A FW  +                           S+D    G+ L+V++  D+ K L+RG  +N
Subjt:  DRNRVFKTGPWSFDKYLLVLENPDTMVRPDEMDFDKASFWEDV-------------------------DCSRDSFCAGESLKVRIRYDIMKSLRRGIKIN

Query:  IDGSMGGVWIPLEYERLPEFCSFCGKIGHNAKDCESFFAKDDN------LVQVGYGTLL------RFDGTVKK---NPKMLRKGFGDLMPEETDGYDRRG
          G    V+   +YERLP FC  CG+IGH  KDCE     +++        ++ YG+ L      +  G +KK   +    +  FG+    +    +  G
Subjt:  IDGSMGGVWIPLEYERLPEFCSFCGKIGHNAKDCESFFAKDDN------LVQVGYGTLL------RFDGTVKK---NPKMLRKGFGDLMPEETDGYDRRG

Query:  GEADRRVNHQMESPDFAQKIGNRIPEITEDLLEENEISKESK------ASMEKETNNPRKILSWKRRAHGKQNL-VDGEEGPSVLEGSRKRKGVEEVNET
           +  V   +       K  N + EI E  +EE    K+ K       S+     + +K+ ++   +H KQ   V      S  +G+RK K  ++    
Subjt:  GEADRRVNHQMESPDFAQKIGNRIPEITEDLLEENEISKESK------ASMEKETNNPRKILSWKRRAHGKQNL-VDGEEGPSVLEGSRKRKGVEEVNET

Query:  VKKAKQEIEQNGVRVGLSELSAEAV----------------------TKCNVTVMNKLNATFNYYGCFPVSSSGA----KGGICLFWKENVQVKIRSFSF
            K+++    V V +SE   EA+                      T+  V  M ++     +  C  V+ +G+     GGI L W++ V + + +FS 
Subjt:  VKKAKQEIEQNGVRVGLSELSAEAV----------------------TKCNVTVMNKLNATFNYYGCFPVSSSGA----KGGICLFWKENVQVKIRSFSF

Query:  AHIDAMV-------SW-------------EGVTWRFI---------------------------GGAARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFR
         HI   +       +W             +  TW  I                           GG  R    M   R  +E   L D+G+ G PYTW  
Subjt:  AHIDAMV-------SW-------------EGVTWRFI---------------------------GGAARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFR

Query:  LWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSWSANIQQNVSLSKSLALC
              +I  RLDR +  E F++ F    V HL  + SDH  I+I    E Q  S+ KR +  +F E W     C++ + +SW     Q V+  +++   
Subjt:  LWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSWSANIQQNVSLSKSLALC

Query:  SK-------RLGRWVIDNLKEKLN-----------------------KLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKRNEISGINNRDGVWM
         +       R  R  I+ ++EKLN                       +LL  EE  WRQRSR  WL+ GDRNT +F  +AS RKK NEI  + + +G+W 
Subjt:  SK-------RLGRWVIDNLKEKLN-----------------------KLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKRNEISGINNRDGVWM

Query:  SNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPD
        +    +E++  DY+ ++F++S+P+  N++  +Q V  K+ +         + R EI+ A+ QM P KAPGPD
Subjt:  SNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPD

XP_022154822.1 uncharacterized protein LOC111021983 [Momordica charantia]5.2e-8477.1Show/hide
Query:  MKILSWNVRGINGYKKRLKVKKTIMKSNPDVVLLQETKLQQIDRIIIKSLWSSKDVGWACLNSKGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNL
        MKILSWNVRGINGYKKRLKVKKTIMKSNPDVVLLQETKLQQIDRIIIKSLWSSKDVGWACLNSK                                    
Subjt:  MKILSWNVRGINGYKKRLKVKKTIMKSNPDVVLLQETKLQQIDRIIIKSLWSSKDVGWACLNSKGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNL

Query:  NNVVITNVYGPTDYRNRKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDKSSGGRITRSMRKFNCIIGELDLTEVPLSNGKFTWSRMGNDSIHSLLDKFL
                    DYRNRKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDKSSGGRITRSMRKFNCIIGELDLTEVPLSNGKFTWSRMGNDSIHSLLDKFL
Subjt:  NNVVITNVYGPTDYRNRKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDKSSGGRITRSMRKFNCIIGELDLTEVPLSNGKFTWSRMGNDSIHSLLDKFL

Query:  LSKEWDNLFNNSRI
        LSKEWDNLFNNSR+
Subjt:  LSKEWDNLFNNSRI

XP_035540109.1 uncharacterized protein LOC118344190 [Juglans regia]1.1e-7027.19Show/hide
Query:  GEEELLEGWKKLRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWS
        G + L   W+ LRLT +E E    +D A  EE  +  ++ ++GK+   RSI  ++I  TM K W++        +G NL + TF   +D+ RV    PW 
Subjt:  GEEELLEGWKKLRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWS

Query:  FDKYLLVLENPDTMVRPDEMDFDKASFW--------------------------EDVDCSRDSFCAGESLKVRIRYDIMKSLRRGIKINIDGSMGGVWIP
        FD +L  L+  D   +P++  FDK  FW                           +VD   D    G+ L+VR+   +MK++ RG  I ++G    VWI 
Subjt:  FDKYLLVLENPDTMVRPDEMDFDKASFW--------------------------EDVDCSRDSFCAGESLKVRIRYDIMKSLRRGIKINIDGSMGGVWIP

Query:  LEYERLPEFCSFCGKIGHNAKDCESFF--AKDDNLVQVGYGTLLRFDGTVKK----------NPKMLRK----------------GFGDLMPEET-----
        L YE+LP  C  CG+I H  K CE        +  +   +G  LR D   ++            +  RK                G G+L  E+      
Subjt:  LEYERLPEFCSFCGKIGHNAKDCESFF--AKDDNLVQVGYGTLLRFDGTVKK----------NPKMLRK----------------GFGDLMPEET-----

Query:  -DGYDRRGGEADRRVNHQM-------ESPDFAQKIGNRIPEITE---------DLLEENEISKESKASME---KETNNPRKILSWKRRAHGKQNLVDGEE
         +GYD   G+    VN ++       E  D   K   +I +I E         + + E EI K   A  E   K+ N       WKRRA G     +G +
Subjt:  -DGYDRRGGEADRRVNHQM-------ESPDFAQKIGNRIPEITE---------DLLEENEISKESKASME---KETNNPRKILSWKRRAHGKQNLVDGEE

Query:  GPSVLEGSRKRKGVEEVN----ETVKKAKQEIEQNGVRVGLSELSAEA----------------VTKCNVTVMNKLNATFNYYGCFPVSSSGAKGGICLF
        G S+ E  R   G +EV+      VKK+K+E E     V +S   A +                 TK     +  +     + GC  V   G  GG+ + 
Subjt:  GPSVLEGSRKRKGVEEVN----ETVKKAKQEIEQNGVRVGLSELSAEA----------------VTKCNVTVMNKLNATFNYYGCFPVSSSGAKGGICLF

Query:  WKENVQVKIRSFSFAHIDAMV-------------------------SWE---------GVTWRFI-------------GGAARDGNLMENFRNTLEDYNL
        WK+  +V++ ++S  HI   V                         SW+          + W  I             GG  R    ME FRN L+D  L
Subjt:  WKENVQVKIRSFSFAHIDAMV-------------------------SWE---------GVTWRFI-------------GGAARDGNLMENFRNTLEDYNL

Query:  TDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSWSAN
         DLG+ G  YTW         I ERLDR V N  +   F E  V+ +  +CSDH PI++   +      + KR +  ++   W    EC E+I  +W   
Subjt:  TDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSWSAN

Query:  IQQNVSLSKSLALCSKRLGRW--------------VIDNLK------------------EKLNKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRK
          Q+  +   L  C + L  W              + D LK                   +L+ LL++E  +W+QR++  WL+ GDRNT +FH  A+ R+
Subjt:  IQQNVSLSKSLALCSKRLGRW--------------VIDNLK------------------EKLNKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRK

Query:  KRNEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPDEF
        K+N I  I N  G       ++E+ F  YF E+F++++P+T+ ++  +     ++++ M   L   ++  E+  ALKQM   K+PGPD F
Subjt:  KRNEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPDEF

TrEMBL top hitse value%identityAlignment
A0A2N9F9I5 Uncharacterized protein4.0e-7429.02Show/hide
Query:  RAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVLENPDTMVRPDEMDFDKAS
        R  + ++++V+   L  K L  R I+ E +  T+K  W+  +G +   MG N  +F F    D  RV   GPWSFDKYL++L+  +      ++ FD  S
Subjt:  RAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVLENPDTMVRPDEMDFDKAS

Query:  FWEDVDCSRDSFCAGESLKVRIRYDIMKSLRRGIKINIDGSMGGVWIPLEYERLPEFCSFCGKIGHNAKDCESFFAKDDNLV--QVGYGTLLRFDGTVKK
        FW  V+   +    G  ++VR++ DI + L RG KI   G     W+  ++ERLP FC +CG+I H+ +DC  +      L   +  YG  LR  G + +
Subjt:  FWEDVDCSRDSFCAGESLKVRIRYDIMKSLRRGIKINIDGSMGGVWIPLEYERLPEFCSFCGKIGHNAKDCESFFAKDDNLV--QVGYGTLLRFDGTVKK

Query:  NPKMLRKGFGDLMP--EETDGYDRRGGEADR------RVNHQMESPDFAQ---------KIGNRIPEITEDL-LEENEISKESKASMEKETNNPRKI--L
          +  R+G G  +P  E +    RR     R           M +P+F               ++ EI  +L L+  E+++  +  ++ ++ +  +I   
Subjt:  NPKMLRKGFGDLMP--EETDGYDRRGGEADR------RVNHQMESPDFAQ---------KIGNRIPEITEDL-LEENEISKESKASMEKETNNPRKI--L

Query:  SWKRRAHGKQNLVDGE-EGPSVLE-----------GSRKRKGVEEVNETVKKAKQEI------------EQNGVRVGL--------------SELSAEAV
           R       LV GE  GP V +           GS  ++ V + N  +  A+Q I            E+  VR GL               ++SAE V
Subjt:  SWKRRAHGKQNLVDGE-EGPSVLE-----------GSRKRKGVEEVNETVKKAKQEI------------EQNGVRVGL--------------SELSAEAV

Query:  TKCNVTVMNK-------------------LNATFNYYGCFPVSSSGAKGGICLFWKENVQVKIRSFSFAHIDAMVSWEGVTWRFIG--------GAARDG
         +  + V  K                   L   + + G F V S G  GG+  FW + V V I S+S  HIDA+++++   WRF G        G     
Subjt:  TKCNVTVMNK-------------------LNATFNYYGCFPVSSSGAKGGICLFWKENVQVKIRSFSFAHIDAMVSWEGVTWRFIG--------GAARDG

Query:  NLMENFR-------------NTL----EDYNLTDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQTV
        +L+   R             N L    E +   DLG+ G P+TW+        + ERLDR +    ++  F +  V HL    SDHRP+ ++       V
Subjt:  NLMENFR-------------NTL----EDYNLTDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQTV

Query:  SKLKRSYGCKFNENWAAHPECKEIISDSWSANIQQNVSLSKSLALCSKRLGRWVIDNLKEKLNKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRK
           K+ +  +F E W  H  C++ I  +W           +      +     +I  L  +L  L  +EET WRQRSR  WL+ GDRNT +FH +A+ RK
Subjt:  SKLKRSYGCKFNENWAAHPECKEIISDSWSANIQQNVSLSKSLALCSKRLGRWVIDNLKEKLNKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRK

Query:  KRNEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPDE
        +RN I GI ++ G W +   ++E I  +Y++ +F++S P     D IL  V + V+  MN  L A +  +E+  ALKQM P KAPGPDE
Subjt:  KRNEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPDE

A0A2N9HFT1 Uncharacterized protein4.7e-7528.5Show/hide
Query:  WKKLRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVL
        WK   L  D+EE   D     +E+ E+        K +  R ++ E +  T K  W+ + G SV+ MG N  LF F    D  RV    PW++DKY+++ 
Subjt:  WKKLRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVL

Query:  ENPDTMVRPDEMDFDKASFWEDVD-------------------------CSRDSFCAGESL-KVRIRYDIMKSLRRGIKINIDGSMGGVWIPLEYERLPE
        +  +     + + F     W  +                           S +   +GE+  ++++R DI + L RG ++ +     G W+  +YERLP 
Subjt:  ENPDTMVRPDEMDFDKASFWEDVD-------------------------CSRDSFCAGESL-KVRIRYDIMKSLRRGIKINIDGSMGGVWIPLEYERLPE

Query:  FCSFCGKIGHNAKDCESFFAK--DDNLVQVGYGTLLRFDGTVKKNPKMLRKGFGDLMPEETDGYDRRGGEADRRVNHQMESPDFAQKIGNRIPEITEDLL
        FC  CG + H  KDC     +    +  Q  YG  LR D       K  RK +  +  E      +R    D   N   +SP  A  +  R P  TED  
Subjt:  FCSFCGKIGHNAKDCESFFAK--DDNLVQVGYGTLLRFDGTVKKNPKMLRKGFGDLMPEETDGYDRRGGEADRRVNHQMESPDFAQKIGNRIPEITEDLL

Query:  EENEISKESKASMEKETNNPRKILSWKRRAHGKQNLVDGEEGPSVLEGSRKRKGVEEVNETVKKAKQEIEQNGVRVGLSELSAEAV--TKCNVTVMNKLN
                  + ME E N    ++  +  +   QN    +E    ++ +     V+E+   V++               +LSA  +  T  +   +  L 
Subjt:  EENEISKESKASMEKETNNPRKILSWKRRAHGKQNLVDGEEGPSVLEGSRKRKGVEEVNETVKKAKQEIEQNGVRVGLSELSAEAV--TKCNVTVMNKLN

Query:  ATFNYYGCFPVSSSGAKGGICLFWKENVQVKIRSFSFAHIDAMVSWEGVTWRFI---------------------------------------------G
           ++     V      GG+ LFWK+ + ++I S+S++HI    S  GV WRFI                                             G
Subjt:  ATFNYYGCFPVSSSGAKGGICLFWKENVQVKIRSFSFAHIDAMVSWEGVTWRFI---------------------------------------------G

Query:  GAARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPI-IIDTVREPQTVSKLKRSYGCK
          ++  + M+ FR  L+D    DLGY G+P+TW     SG  ++E+LDR V + A++ LF +A V HLD+  SDH+P+ +  TV   + V+K  R     
Subjt:  GAARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPI-IIDTVREPQTVSKLKRSYGCK

Query:  FNENWAAHPECKEIISDSWSAN-IQQNVSLSKSLALCSKRLGRWVIDNLKEKLNKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKRNEISGIN
        F E W +   C E I+++W  + +++   L  +     +     +I +L  +++ LL +EE  WRQRSR  WL+ GDRNT +FH RA+ R++RN I G+ 
Subjt:  FNENWAAHPECKEIISDSWSAN-IQQNVSLSKSLALCSKRLGRWVIDNLKEKLNKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKRNEISGIN

Query:  NRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPD
        + DG W  +  +++ +   YFQ IF SSNP+  ++D +LQC+P  ++ SMN  L  PY  +E+ TAL+QM P  APGPD
Subjt:  NRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPD

A0A2N9J109 Uncharacterized protein4.0e-7428.03Show/hide
Query:  WKKLRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVL
        WK   L  D+E+   D     +E+ E+     L  K +  R ++ E +  T K  W+ + G S++ MG N  LF F    D  RV    PW++DKY+++ 
Subjt:  WKKLRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVL

Query:  ENPDTMVRPDEMDFDKASFWEDVD------CSRD-SFCAGESL-------------------KVRIRYDIMKSLRRGIKINIDGSMGGVWIPLEYERLPE
        +  +  V  + + F     W  +        SR+ +   G SL                   ++++R DI +SL RG ++       G WI  +YERLP 
Subjt:  ENPDTMVRPDEMDFDKASFWEDVD------CSRD-SFCAGESL-------------------KVRIRYDIMKSLRRGIKINIDGSMGGVWIPLEYERLPE

Query:  FCSFCGKIGHNAKDCESF--FAKDDNLVQVGYGTLLRFDGTVKKNPKMLRKGFGDL-----------MPEETDGYDRRGGEADRRVNHQMESPDFAQKIG
        FC  CG + H  KDC       K+ +  Q  +G  LR      +  K  RK +  +            P +T      GG+  R+       P       
Subjt:  FCSFCGKIGHNAKDCESF--FAKDDNLVQVGYGTLLRFDGTVKKNPKMLRKGFGDL-----------MPEETDGYDRRGGEADRRVNHQMESPDFAQKIG

Query:  NRIPEITEDLLEENEISKESKASMEKETNNPRKILSWKRRAHGKQNLVDGE--EGPSVLEGSRKRKGVEEVNETVKKAKQEIEQNGVRVGLSELSAEAVT
           PE T+   EEN    ++   +  +TN  +  +   R      N   G   + P V+E     K V+E+ + V+      E++   + L E      T
Subjt:  NRIPEITEDLLEENEISKESKASMEKETNNPRKILSWKRRAHGKQNLVDGE--EGPSVLEGSRKRKGVEEVNETVKKAKQEIEQNGVRVGLSELSAEAVT

Query:  KCNVTVMNKLNATFNYYGCFPVSSSGAKGGICLFWKENVQVKIRSFSFAHIDAMV-SWEGVTWRFIG--GA-----------------------------
          +   +  L   F+++    V      GG+ LFWK+++ ++I S+S++HID +V +     W F G  GA                             
Subjt:  KCNVTVMNKLNATFNYYGCFPVSSSGAKGGICLFWKENVQVKIRSFSFAHIDAMV-SWEGVTWRFIG--GA-----------------------------

Query:  --------------ARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQT
                      ++  + M+ FR+ L+     DLGY G+P+TW     SG  ++ERLD+ V    ++ +F +A V HLD+  SDH+P+ +     P++
Subjt:  --------------ARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQT

Query:  VSKLKRSYGCKFNENWAAHPECKEIISDSWSANIQQN--VSLSKSLALCSKRLGRW---------------------------------VIDNLKEKLNK
           L+ +    F E W +   C E I++SW A+   N    +   L  C K L  W                                  I +L++++ +
Subjt:  VSKLKRSYGCKFNENWAAHPECKEIISDSWSANIQQN--VSLSKSLALCSKRLGRW---------------------------------VIDNLKEKLNK

Query:  LLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKRNEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLL
        LL +EE  WRQRSR  WL++GDRNT++FH RA+ R++RN I G+ + DGVW +   +++ + T YFQ IF +SNP+  ++D +LQCVP  V+ SMN  L 
Subjt:  LLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKRNEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLL

Query:  APYNRSEIVTALKQMQPSKAPGPD
         PY  SE+  AL+QM P  APGPD
Subjt:  APYNRSEIVTALKQMQPSKAPGPD

A0A392M033 CCHC-type domain-containing protein (Fragment)5.8e-7327.06Show/hide
Query:  ADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVLENPDTMV
        AD+E+ A+ V+   +   ++  +  L+GK+      +    + TM +AW+  N + ++ + +NL LF F    + + V++ GPWSFD+ LL+L       
Subjt:  ADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTMKKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVLENPDTMV

Query:  RPDEMDFDKASFW--------------------------EDVDCSRDSFCAGESLKVRIRYDIMKSLRRGIKINIDGSMGGVWIPLEYERLPEFCSFCGK
        +P E+  D  SFW                          E++D  ++    G+ L++R+  D+ K L+RG K+N  G    +W+  +YERLP FC  CG+
Subjt:  RPDEMDFDKASFW--------------------------EDVDCSRDSFCAGESLKVRIRYDIMKSLRRGIKINIDGSMGGVWIPLEYERLPEFCSFCGK

Query:  IGHNAKDCESFFAKDD------NLVQVGYGTLLRFDGTVKKNPKMLRKGFG-----DLMPEETDGYDRRGG---EADRRVNHQMESPD--FAQKI--GNR
        IGH  +DCE     D+         Q  +G  LR     K + ++ ++         L P  ++   +  G   E D  V  Q  S D    QKI  GN 
Subjt:  IGHNAKDCESFFAKDD------NLVQVGYGTLLRFDGTVKKNPKMLRKGFG-----DLMPEETDGYDRRGG---EADRRVNHQMESPD--FAQKI--GNR

Query:  IPEITEDLLEENEISKESKASMEK----------------ETNNPRKILSWKRR---------------AHGKQNLVDGEEGPSVLE----GSRKRKGVE
          +   D  ++  + KE +   E                 +T    K   W R+                 GK++LVD       +E    G +K +G  
Subjt:  IPEITEDLLEENEISKESKASMEK----------------ETNNPRKILSWKRR---------------AHGKQNLVDGEEGPSVLE----GSRKRKGVE

Query:  EVNETVKKAKQEIEQNGV----RVGLSELSAEAVTKCNVTVMNKLNATFNYYGCFPVSSSG----AKGGICLFWKENVQVKIRSFSFAHI-------DAM
         + + V           +    RV   +L     T+  VT +  + +   +  C  V  +G      GG+ L W E++ V I SFS  HI       ++ 
Subjt:  EVNETVKKAKQEIEQNGV----RVGLSELSAEAVTKCNVTVMNKLNATFNYYGCFPVSSSG----AKGGICLFWKENVQVKIRSFSFAHI-------DAM

Query:  VSW-------------EGVTWRFI---------------------------GGAARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLD
         SW             +  TW  I                           GG  R        R  + D +L DLG+ G P+TW      G ++  RLD
Subjt:  VSW-------------EGVTWRFI---------------------------GGAARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLD

Query:  RFVGNEAFIDLFKEATVQHLDWYCSDHRPIIID-TVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSW------------SANIQQNVSLSKSLALC
        R +GN  F++ F    V HL  + SDH  ++I      P T  + +R +  +F E+W    +C+E+I  +W            S +   N     +L   
Subjt:  RFVGNEAFIDLFKEATVQHLDWYCSDHRPIIID-TVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSW------------SANIQQNVSLSKSLALC

Query:  SKRLGRWVIDNLKEKL-------------------NKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKRNEISGINNRDGVWMSNATEMEQIFT
         K L R + D LK++                    N+LL+ +ET WRQRSR  WL+ GDRNT +FH +AS R K N I  I + DGVW      +E++F 
Subjt:  SKRLGRWVIDNLKEKL-------------------NKLLEEEETYWRQRSRECWLRWGDRNTTWFHKRASVRKKRNEISGINNRDGVWMSNATEMEQIFT

Query:  DYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPD
         YF E+FS+SNP+  N+    + V  K+S +  +     Y R E+  A+ QM P KAPGPD
Subjt:  DYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPD

A0A6J1DMQ9 uncharacterized protein LOC1110219832.5e-8477.1Show/hide
Query:  MKILSWNVRGINGYKKRLKVKKTIMKSNPDVVLLQETKLQQIDRIIIKSLWSSKDVGWACLNSKGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNL
        MKILSWNVRGINGYKKRLKVKKTIMKSNPDVVLLQETKLQQIDRIIIKSLWSSKDVGWACLNSK                                    
Subjt:  MKILSWNVRGINGYKKRLKVKKTIMKSNPDVVLLQETKLQQIDRIIIKSLWSSKDVGWACLNSKGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNL

Query:  NNVVITNVYGPTDYRNRKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDKSSGGRITRSMRKFNCIIGELDLTEVPLSNGKFTWSRMGNDSIHSLLDKFL
                    DYRNRKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDKSSGGRITRSMRKFNCIIGELDLTEVPLSNGKFTWSRMGNDSIHSLLDKFL
Subjt:  NNVVITNVYGPTDYRNRKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDKSSGGRITRSMRKFNCIIGELDLTEVPLSNGKFTWSRMGNDSIHSLLDKFL

Query:  LSKEWDNLFNNSRI
        LSKEWDNLFNNSR+
Subjt:  LSKEWDNLFNNSRI

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog9.7e-0922.15Show/hide
Query:  NFRNTLEDYNLTDL-GYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIID--TVREPQTVSKLKRSYGCKFNENWAAH
        +  +T++  +LTD+   F    T +  +SS    + ++D  +G+++ +  FK+  ++ +    SDH  I ++    R   T +K  +       + W   
Subjt:  NFRNTLEDYNLTDL-GYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIID--TVREPQTVSKLKRSYGCKFNENWAAH

Query:  PECKEII-----SDSWSANIQ------QNVSLSKSLALCS--KRLGRWVIDNLKEKLNKLLEEEETYWRQRSR---------------ECWLRWGDRNTT
           KEI      +++   N Q      + V   K +AL +  K+  R  ++NL   L +L +EE +  +   R               +  ++  +++ +
Subjt:  PECKEII-----SDSWSANIQ------QNVSLSKSLALCS--KRLGRWVIDNLKEKLNKLLEEEETYWRQRSR---------------ECWLRWGDRNTT

Query:  WFHKRAS---------VRKKR--NEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQ-CVPQKVSQSMNSVLLAPYNRSEIVTALKQ
        WF ++ +          RKKR  + IS I N +    ++ +E+++I  +Y+++++S      K +D  L+ C   ++SQ    +L  P + SEI + ++ 
Subjt:  WFHKRAS---------VRKKR--NEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQ-CVPQKVSQSMNSVLLAPYNRSEIVTALKQ

Query:  MQPSKAPGPDEFYLKVSDFITSSKQ
        +   K+PGPD F    S+F  + K+
Subjt:  MQPSKAPGPDEFYLKVSDFITSSKQ

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.8e-1725.56Show/hide
Query:  MENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQTVSKLKRSYGC-KFNENWAAH
        +E F+N L D +L D+   G  YTW         I  +LDR + N  +   F  A         SDH P II     P      KRS  C ++    + H
Subjt:  MENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQHLDWYCSDHRPIIIDTVREPQTVSKLKRSYGC-KFNENWAAH

Query:  PECKEIISDSWSANIQQNVSL------SKSLALCSKRLGRWVIDNLKEKLNKLLEE---------------------------------EETYWRQRSRE
        P     ++ +W   I     +       K+   C K L R    N++ K  + L+                                   E+++RQ+SR 
Subjt:  PECKEIISDSWSANIQQNVSL------SKSLALCSKRLGRWVIDNLKEKLNKLLEE---------------------------------EETYWRQRSRE

Query:  CWLRWGDRNTTWFHKRASVRKKRNEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNP--TTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALK
         WL+ GD NT +FHK     + +N I  +   D V + N T+++++   Y+  +  S +   T  ++  I    P + + ++ S L A  +  EI  A+ 
Subjt:  CWLRWGDRNTTWFHKRASVRKKRNEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNP--TTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALK

Query:  QMQPSKAPGPDEF
         M  +KAPGPD F
Subjt:  QMQPSKAPGPDEF

AT2G13980.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.3e-0732.31Show/hide
Query:  AEWVKLNVDAACKS----EISRTVAELSAIREGLYLASQ----LNFSLVQIESDCLQAIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNR
        A W+  N D   +     ++  T   L A  + L  A Q      +  V +E DC     L+SG+S        L+DDIR  A  FS V F  VRR GN+
Subjt:  AEWVKLNVDAACKS----EISRTVAELSAIREGLYLASQ----LNFSLVQIESDCLQAIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNR

Query:  VAHQLATKAFVGEVAGVRFSNFPIWILNEY
        VAH+LA                PIW+   Y
Subjt:  VAHQLATKAFVGEVAGVRFSNFPIWILNEY

AT3G09510.1 Ribonuclease H-like superfamily protein2.1e-1122.31Show/hide
Query:  WDIQKLRQYVIEEDLEAIGRILLSWSEAEDASVCHYDRRGVYSVKSGYKL-----------------GMNLK----DLP---------------------
        WD  K+ Q+V + D   I RI L+ S+  D  + +Y+  G Y+V+SGY L                  ++LK    +LP                     
Subjt:  WDIQKLRQYVIEEDLEAIGRILLSWSEAEDASVCHYDRRGVYSVKSGYKL-----------------GMNLK----DLP---------------------

Query:  -----------------SSSKSTNHALFQCKRAREVWNL---VLPQVRFIIGHSPSVQDKFLTLQEALSAKDFE--LACVSCWAIWNDRNAI--RAQAQV
                           ++S NHALF C  A   W L    L + + +           L   +  +  DF   L     W IW  RN +      + 
Subjt:  -----------------SSSKSTNHALFQCKRAREVWNL---VLPQVRFIIGHSPSVQDKFLTLQEALSAKDFE--LACVSCWAIWNDRNAI--RAQAQV

Query:  PDGNCRS------EWIRAYMKEFEGCWQVKHVSETNLQLSSFRTFVSVWVPPPAEWVKLNVDAAC------------------------KSEISRTVAEL
        P     S      +W+ A     +     + ++E  ++          W  PPA +VK N DA                            +++ T   L
Subjt:  PDGNCRS------EWIRAYMKEFEGCWQVKHVSETNLQLSSFRTFVSVWVPPPAEWVKLNVDAAC------------------------KSEISRTVAEL

Query:  SAIREGLYLASQ----LNFSLVQIESDCLQAIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNRVAHQLATKAFVGEVAGVRFSNFPIWI
         A  + L  A Q      ++ V +E DC   I LI+G S         ++DI   A+ F+ + F  +RR+GN++AH LA              + PIW+
Subjt:  SAIREGLYLASQ----LNFSLVQIESDCLQAIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNRVAHQLATKAFVGEVAGVRFSNFPIWI

AT4G29090.1 Ribonuclease H-like superfamily protein1.9e-1225Show/hide
Query:  PSSSKSTNHALFQCKRAREVWNLVLPQVRFIIGHSPSVQDKFLTLQEALS--------AKDFELACVSCWAIWNDRNAIRAQAQVPDGNCRSEWIRAYMK
        PS  ++ NH LF+C  AR  W   +  +   +G        ++ L    +         K  +L     W +W +RN +  + +  +     E +R    
Subjt:  PSSSKSTNHALFQCKRAREVWNLVLPQVRFIIGHSPSVQDKFLTLQEALS--------AKDFELACVSCWAIWNDRNAIRAQAQVPDGNCRSEWIRAYMK

Query:  EFEGCWQVKHVSE---TNLQLSSFRTFVSVWVPPPAEWVKLNVDAA-------C---------KSEI------------SRTVAELSAIREGLYLASQLN
        + E  W+++  +E   T  Q++  R+    W PPP +WVK N DA        C         K E+            S   AEL A+R  +   S+  
Subjt:  EFEGCWQVKHVSE---TNLQLSSFRTFVSVWVPPPAEWVKLNVDAA-------C---------KSEI------------SRTVAELSAIREGLYLASQLN

Query:  FSLVQIESDCLQAIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNRVAHQLATKA
        ++ V  ESD    IE+++ N E W      + D++ + S F++V F+ + REGN +A ++A ++
Subjt:  FSLVQIESDCLQAIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNRVAHQLATKA

AT5G42965.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-0532.5Show/hide
Query:  ELSAIREGLYLASQLNFSLVQIESDCLQAIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNRVAHQLATKA
        EL A+R      S+ N+  V  ESD    + L+ G+   +      + DIRH+   F +V  +H++REGN VA ++A ++
Subjt:  ELSAIREGLYLASQLNFSLVQIESDCLQAIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNRVAHQLATKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATACTTTCCTGGAACGTCCGTGGAATAAATGGTTATAAGAAGAGACTAAAAGTCAAGAAGACTATTATGAAATCCAATCCAGATGTGGTATTATTGCAGGAAAC
CAAACTTCAACAAATTGATCGGATAATCATCAAATCACTTTGGAGCTCCAAGGATGTAGGTTGGGCATGTTTGAATTCGAAAGGTAAATCAGGAGGTATTTTATCTTTAT
GGGATGAAAGCAGAATCGCAGTATCTGAAGTTCTAGAAGGCACTTGCTCAATTAGTATAGTAGTCTCTCTTTCCAATTTGAATAATGTGGTGATAACAAATGTTTATGGC
CCAACAGATTACAGAAATAGGAAGCGATTATGGTCAGAACTCAGGGATATTTCAGGCTTTAGTGAAAAATTCTGGTGCTTGGGTGGTGATTTTAACGTATCAAGATGGCC
TTCAGACAAATCTTCCGGGGGACGTATTACTAGGAGCATGAGAAAATTTAATTGTATCATTGGAGAGCTTGACCTTACGGAGGTTCCCTTATCTAATGGCAAATTCACAT
GGTCAAGAATGGGAAACGACTCTATACATTCACTTTTGGACAAATTTTTGCTATCCAAAGAATGGGATAACTTGTTTAACAACTCACGGATTGTGTCTGAGATCAACGTC
ATTGATGCCAAAGATGAGTTACTTGGACTCTCTCCAACAGAAGTTGACAAGAGGTGCTCATTGAAGCATGACTTGCTGAGCTTGTATTTGTCGGAAGAACGAGTGTTGCT
GCAAAAATGTAAACTCCATTGGCTTAAAGAAGGTGATGAAAACAGTAAGTTCTTTCATAGATACTTGGCTGCCCGTAAAAGAAAAGCACAGATTTGTGAAATTAAAGATG
AGCGGAATACCTCGCTGGTGAATAAAAGGGAAATTGAAACTGAAATCATGAGGTTTTTCGAGAATTTATACAATAGTGAGGGGATTCAACGATTTTCCTTGAAGGGTATT
AACTGGAGACCAATTCCTATCCAAAACAGTAGGTGGTTAGAGAGACCTTTCGAAGAAGCAGAGATTTTGAGTGCTGTAAAGATCCTTGGGAGGAATAAGGCTCCGTATCC
TTTTTTGGCAACATCGATGGTTAGCCTGCTAGGGTGCTATGGCGAGGAGGAGCTTTTGGAGGGATGGAAGAAATTGAGGTTAACAGCGGATGAGGAAGAGGGGGCAGTGG
ATGTTGACAGGGCGGCTATGGAGGAAACAGAGAAAGTTCTGAATGTGTGCCTAATGGGGAAACTACTGGCGGGGAGGTCGATCTCTTGTGAGATTATCAGAAATACAATG
AAGAAAGCGTGGAAAATTGAGAACGGTCTATCTGTGGAAAGCATGGGGAGGAATCTCTGCCTCTTTACCTTTCCAAGGGTAATTGATAGGAACCGTGTCTTCAAGACAGG
ACCTTGGAGTTTTGACAAGTATTTGCTTGTTCTTGAGAATCCAGATACTATGGTTAGACCAGATGAGATGGATTTTGACAAGGCTTCCTTTTGGGAGGACGTTGATTGTA
GTAGGGATAGCTTTTGCGCGGGTGAAAGTCTCAAGGTTAGGATAAGATATGATATCATGAAGTCTCTAAGGAGAGGGATCAAAATAAACATTGATGGATCCATGGGTGGC
GTTTGGATCCCTTTGGAGTATGAGCGACTGCCGGAATTCTGTTCCTTTTGTGGAAAAATTGGGCACAACGCCAAAGATTGTGAATCCTTTTTTGCAAAGGATGACAATCT
GGTTCAGGTGGGGTATGGAACGTTGCTGCGATTCGATGGGACGGTGAAGAAGAACCCGAAGATGTTGAGGAAGGGTTTTGGGGATCTGATGCCGGAGGAGACGGATGGCT
ATGATAGGAGGGGCGGCGAGGCTGATCGCCGTGTGAACCACCAAATGGAAAGTCCAGATTTTGCCCAAAAGATCGGAAATAGAATCCCTGAAATCACGGAAGATTTGTTG
GAGGAAAACGAAATATCCAAAGAAAGCAAAGCTAGCATGGAAAAGGAAACCAACAATCCAAGGAAAATATTGTCCTGGAAAAGAAGAGCACATGGAAAACAAAATCTGGT
GGACGGTGAGGAGGGGCCGTCAGTTTTGGAAGGATCAAGGAAGAGGAAAGGTGTGGAGGAAGTCAATGAAACGGTGAAGAAAGCTAAGCAAGAGATTGAACAGAATGGAG
TTCGTGTGGGACTCTCAGAATTATCGGCGGAGGCTGTTACTAAATGTAATGTGACCGTTATGAATAAATTGAACGCCACTTTCAATTATTATGGGTGTTTTCCGGTGAGC
AGTTCGGGTGCTAAAGGAGGTATTTGCCTCTTTTGGAAGGAGAATGTGCAAGTTAAAATTCGGTCTTTTTCTTTCGCTCACATCGATGCAATGGTGTCGTGGGAGGGAGT
GACGTGGAGGTTTATAGGTGGTGCAGCCCGAGATGGAAATCTAATGGAAAATTTCAGAAATACGCTGGAGGATTACAATCTTACAGACCTTGGGTATTTTGGAAGCCCTT
ACACTTGGTTTAGGTTGTGGAGTTCGGGATGCCACATTTTTGAAAGGCTTGATAGGTTTGTTGGCAATGAAGCCTTCATCGATCTGTTTAAAGAAGCTACAGTCCAGCAC
TTGGACTGGTACTGTTCTGATCATAGGCCTATAATTATTGATACTGTTCGGGAGCCTCAGACGGTTTCTAAATTAAAGAGGTCATATGGATGCAAATTTAATGAAAACTG
GGCAGCTCACCCAGAATGCAAAGAAATTATCTCTGATAGTTGGAGTGCTAATATCCAGCAGAATGTTTCTCTTTCTAAGAGCTTGGCATTGTGTTCAAAGAGGCTGGGCA
GATGGGTAATCGACAATTTGAAGGAGAAGCTTAATAAACTTCTTGAGGAGGAAGAAACTTACTGGAGACAGAGGTCTCGGGAGTGTTGGCTAAGATGGGGTGATAGAAAT
ACAACTTGGTTTCATAAAAGAGCCTCAGTAAGGAAGAAAAGGAACGAGATTAGCGGGATAAATAACAGAGATGGTGTTTGGATGTCTAACGCCACTGAGATGGAGCAAAT
ATTCACAGATTACTTCCAGGAGATTTTCTCTTCGTCGAATCCTACTACGAAAAATTTGGACTGGATCCTTCAATGTGTTCCCCAAAAGGTCAGTCAAAGTATGAATTCTG
TTTTATTAGCTCCTTACAATAGATCGGAAATTGTTACAGCTTTAAAGCAAATGCAACCGTCCAAAGCCCCGGGGCCTGATGAATTCTATTTGAAAGTCTCGGATTTTATA
ACTTCTTCCAAACAGTGGGATATCCAGAAGTTGAGACAGTATGTTATTGAGGAAGATCTGGAGGCTATTGGACGAATACTCCTAAGCTGGTCTGAGGCTGAGGACGCTTC
GGTGTGCCACTATGATCGTAGAGGGGTGTATTCGGTGAAAAGTGGCTATAAACTGGGCATGAACTTGAAGGATCTTCCTTCCTCTTCGAAATCTACGAACCATGCTTTAT
TTCAATGTAAACGTGCAAGGGAGGTTTGGAATTTGGTCTTACCTCAGGTGAGATTCATCATAGGCCACAGTCCCTCTGTCCAGGACAAGTTCCTCACGCTTCAAGAAGCT
CTTTCTGCTAAAGATTTTGAGTTAGCCTGTGTAAGCTGCTGGGCCATATGGAATGACAGAAATGCGATTAGGGCACAAGCGCAAGTACCTGATGGCAATTGTAGAAGCGA
ATGGATTCGGGCGTATATGAAAGAATTTGAAGGTTGCTGGCAGGTGAAGCATGTGTCTGAAACAAATTTGCAGTTATCCAGTTTTCGAACTTTTGTCTCAGTTTGGGTTC
CTCCCCCCGCTGAATGGGTTAAGCTCAATGTTGATGCGGCGTGCAAATCTGAAATTTCTCGTACCGTAGCAGAACTTTCAGCCATTCGAGAAGGTCTTTATCTTGCTTCT
CAATTGAATTTCAGTTTGGTACAGATAGAATCGGATTGCTTACAAGCAATTGAGCTAATTTCTGGAAACAGTGAATGTTGGCTGGAAACAGGAACTTTGGTTGACGATAT
TCGTCATGTTGCTTCTAATTTCTCTCAAGTTTGCTTTCTGCATGTGAGGAGGGAAGGAAATCGGGTGGCTCATCAGCTAGCGACGAAGGCTTTTGTAGGTGAGGTTGCCG
GCGTTCGGTTTTCTAATTTTCCCATCTGGATTTTAAATGAATATGGAAGGGGCTCTTGTAATGTTCGGGCTTTAGCTGGTCCGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATACTTTCCTGGAACGTCCGTGGAATAAATGGTTATAAGAAGAGACTAAAAGTCAAGAAGACTATTATGAAATCCAATCCAGATGTGGTATTATTGCAGGAAAC
CAAACTTCAACAAATTGATCGGATAATCATCAAATCACTTTGGAGCTCCAAGGATGTAGGTTGGGCATGTTTGAATTCGAAAGGTAAATCAGGAGGTATTTTATCTTTAT
GGGATGAAAGCAGAATCGCAGTATCTGAAGTTCTAGAAGGCACTTGCTCAATTAGTATAGTAGTCTCTCTTTCCAATTTGAATAATGTGGTGATAACAAATGTTTATGGC
CCAACAGATTACAGAAATAGGAAGCGATTATGGTCAGAACTCAGGGATATTTCAGGCTTTAGTGAAAAATTCTGGTGCTTGGGTGGTGATTTTAACGTATCAAGATGGCC
TTCAGACAAATCTTCCGGGGGACGTATTACTAGGAGCATGAGAAAATTTAATTGTATCATTGGAGAGCTTGACCTTACGGAGGTTCCCTTATCTAATGGCAAATTCACAT
GGTCAAGAATGGGAAACGACTCTATACATTCACTTTTGGACAAATTTTTGCTATCCAAAGAATGGGATAACTTGTTTAACAACTCACGGATTGTGTCTGAGATCAACGTC
ATTGATGCCAAAGATGAGTTACTTGGACTCTCTCCAACAGAAGTTGACAAGAGGTGCTCATTGAAGCATGACTTGCTGAGCTTGTATTTGTCGGAAGAACGAGTGTTGCT
GCAAAAATGTAAACTCCATTGGCTTAAAGAAGGTGATGAAAACAGTAAGTTCTTTCATAGATACTTGGCTGCCCGTAAAAGAAAAGCACAGATTTGTGAAATTAAAGATG
AGCGGAATACCTCGCTGGTGAATAAAAGGGAAATTGAAACTGAAATCATGAGGTTTTTCGAGAATTTATACAATAGTGAGGGGATTCAACGATTTTCCTTGAAGGGTATT
AACTGGAGACCAATTCCTATCCAAAACAGTAGGTGGTTAGAGAGACCTTTCGAAGAAGCAGAGATTTTGAGTGCTGTAAAGATCCTTGGGAGGAATAAGGCTCCGTATCC
TTTTTTGGCAACATCGATGGTTAGCCTGCTAGGGTGCTATGGCGAGGAGGAGCTTTTGGAGGGATGGAAGAAATTGAGGTTAACAGCGGATGAGGAAGAGGGGGCAGTGG
ATGTTGACAGGGCGGCTATGGAGGAAACAGAGAAAGTTCTGAATGTGTGCCTAATGGGGAAACTACTGGCGGGGAGGTCGATCTCTTGTGAGATTATCAGAAATACAATG
AAGAAAGCGTGGAAAATTGAGAACGGTCTATCTGTGGAAAGCATGGGGAGGAATCTCTGCCTCTTTACCTTTCCAAGGGTAATTGATAGGAACCGTGTCTTCAAGACAGG
ACCTTGGAGTTTTGACAAGTATTTGCTTGTTCTTGAGAATCCAGATACTATGGTTAGACCAGATGAGATGGATTTTGACAAGGCTTCCTTTTGGGAGGACGTTGATTGTA
GTAGGGATAGCTTTTGCGCGGGTGAAAGTCTCAAGGTTAGGATAAGATATGATATCATGAAGTCTCTAAGGAGAGGGATCAAAATAAACATTGATGGATCCATGGGTGGC
GTTTGGATCCCTTTGGAGTATGAGCGACTGCCGGAATTCTGTTCCTTTTGTGGAAAAATTGGGCACAACGCCAAAGATTGTGAATCCTTTTTTGCAAAGGATGACAATCT
GGTTCAGGTGGGGTATGGAACGTTGCTGCGATTCGATGGGACGGTGAAGAAGAACCCGAAGATGTTGAGGAAGGGTTTTGGGGATCTGATGCCGGAGGAGACGGATGGCT
ATGATAGGAGGGGCGGCGAGGCTGATCGCCGTGTGAACCACCAAATGGAAAGTCCAGATTTTGCCCAAAAGATCGGAAATAGAATCCCTGAAATCACGGAAGATTTGTTG
GAGGAAAACGAAATATCCAAAGAAAGCAAAGCTAGCATGGAAAAGGAAACCAACAATCCAAGGAAAATATTGTCCTGGAAAAGAAGAGCACATGGAAAACAAAATCTGGT
GGACGGTGAGGAGGGGCCGTCAGTTTTGGAAGGATCAAGGAAGAGGAAAGGTGTGGAGGAAGTCAATGAAACGGTGAAGAAAGCTAAGCAAGAGATTGAACAGAATGGAG
TTCGTGTGGGACTCTCAGAATTATCGGCGGAGGCTGTTACTAAATGTAATGTGACCGTTATGAATAAATTGAACGCCACTTTCAATTATTATGGGTGTTTTCCGGTGAGC
AGTTCGGGTGCTAAAGGAGGTATTTGCCTCTTTTGGAAGGAGAATGTGCAAGTTAAAATTCGGTCTTTTTCTTTCGCTCACATCGATGCAATGGTGTCGTGGGAGGGAGT
GACGTGGAGGTTTATAGGTGGTGCAGCCCGAGATGGAAATCTAATGGAAAATTTCAGAAATACGCTGGAGGATTACAATCTTACAGACCTTGGGTATTTTGGAAGCCCTT
ACACTTGGTTTAGGTTGTGGAGTTCGGGATGCCACATTTTTGAAAGGCTTGATAGGTTTGTTGGCAATGAAGCCTTCATCGATCTGTTTAAAGAAGCTACAGTCCAGCAC
TTGGACTGGTACTGTTCTGATCATAGGCCTATAATTATTGATACTGTTCGGGAGCCTCAGACGGTTTCTAAATTAAAGAGGTCATATGGATGCAAATTTAATGAAAACTG
GGCAGCTCACCCAGAATGCAAAGAAATTATCTCTGATAGTTGGAGTGCTAATATCCAGCAGAATGTTTCTCTTTCTAAGAGCTTGGCATTGTGTTCAAAGAGGCTGGGCA
GATGGGTAATCGACAATTTGAAGGAGAAGCTTAATAAACTTCTTGAGGAGGAAGAAACTTACTGGAGACAGAGGTCTCGGGAGTGTTGGCTAAGATGGGGTGATAGAAAT
ACAACTTGGTTTCATAAAAGAGCCTCAGTAAGGAAGAAAAGGAACGAGATTAGCGGGATAAATAACAGAGATGGTGTTTGGATGTCTAACGCCACTGAGATGGAGCAAAT
ATTCACAGATTACTTCCAGGAGATTTTCTCTTCGTCGAATCCTACTACGAAAAATTTGGACTGGATCCTTCAATGTGTTCCCCAAAAGGTCAGTCAAAGTATGAATTCTG
TTTTATTAGCTCCTTACAATAGATCGGAAATTGTTACAGCTTTAAAGCAAATGCAACCGTCCAAAGCCCCGGGGCCTGATGAATTCTATTTGAAAGTCTCGGATTTTATA
ACTTCTTCCAAACAGTGGGATATCCAGAAGTTGAGACAGTATGTTATTGAGGAAGATCTGGAGGCTATTGGACGAATACTCCTAAGCTGGTCTGAGGCTGAGGACGCTTC
GGTGTGCCACTATGATCGTAGAGGGGTGTATTCGGTGAAAAGTGGCTATAAACTGGGCATGAACTTGAAGGATCTTCCTTCCTCTTCGAAATCTACGAACCATGCTTTAT
TTCAATGTAAACGTGCAAGGGAGGTTTGGAATTTGGTCTTACCTCAGGTGAGATTCATCATAGGCCACAGTCCCTCTGTCCAGGACAAGTTCCTCACGCTTCAAGAAGCT
CTTTCTGCTAAAGATTTTGAGTTAGCCTGTGTAAGCTGCTGGGCCATATGGAATGACAGAAATGCGATTAGGGCACAAGCGCAAGTACCTGATGGCAATTGTAGAAGCGA
ATGGATTCGGGCGTATATGAAAGAATTTGAAGGTTGCTGGCAGGTGAAGCATGTGTCTGAAACAAATTTGCAGTTATCCAGTTTTCGAACTTTTGTCTCAGTTTGGGTTC
CTCCCCCCGCTGAATGGGTTAAGCTCAATGTTGATGCGGCGTGCAAATCTGAAATTTCTCGTACCGTAGCAGAACTTTCAGCCATTCGAGAAGGTCTTTATCTTGCTTCT
CAATTGAATTTCAGTTTGGTACAGATAGAATCGGATTGCTTACAAGCAATTGAGCTAATTTCTGGAAACAGTGAATGTTGGCTGGAAACAGGAACTTTGGTTGACGATAT
TCGTCATGTTGCTTCTAATTTCTCTCAAGTTTGCTTTCTGCATGTGAGGAGGGAAGGAAATCGGGTGGCTCATCAGCTAGCGACGAAGGCTTTTGTAGGTGAGGTTGCCG
GCGTTCGGTTTTCTAATTTTCCCATCTGGATTTTAAATGAATATGGAAGGGGCTCTTGTAATGTTCGGGCTTTAGCTGGTCCGTTTTGA
Protein sequenceShow/hide protein sequence
MKILSWNVRGINGYKKRLKVKKTIMKSNPDVVLLQETKLQQIDRIIIKSLWSSKDVGWACLNSKGKSGGILSLWDESRIAVSEVLEGTCSISIVVSLSNLNNVVITNVYG
PTDYRNRKRLWSELRDISGFSEKFWCLGGDFNVSRWPSDKSSGGRITRSMRKFNCIIGELDLTEVPLSNGKFTWSRMGNDSIHSLLDKFLLSKEWDNLFNNSRIVSEINV
IDAKDELLGLSPTEVDKRCSLKHDLLSLYLSEERVLLQKCKLHWLKEGDENSKFFHRYLAARKRKAQICEIKDERNTSLVNKREIETEIMRFFENLYNSEGIQRFSLKGI
NWRPIPIQNSRWLERPFEEAEILSAVKILGRNKAPYPFLATSMVSLLGCYGEEELLEGWKKLRLTADEEEGAVDVDRAAMEETEKVLNVCLMGKLLAGRSISCEIIRNTM
KKAWKIENGLSVESMGRNLCLFTFPRVIDRNRVFKTGPWSFDKYLLVLENPDTMVRPDEMDFDKASFWEDVDCSRDSFCAGESLKVRIRYDIMKSLRRGIKINIDGSMGG
VWIPLEYERLPEFCSFCGKIGHNAKDCESFFAKDDNLVQVGYGTLLRFDGTVKKNPKMLRKGFGDLMPEETDGYDRRGGEADRRVNHQMESPDFAQKIGNRIPEITEDLL
EENEISKESKASMEKETNNPRKILSWKRRAHGKQNLVDGEEGPSVLEGSRKRKGVEEVNETVKKAKQEIEQNGVRVGLSELSAEAVTKCNVTVMNKLNATFNYYGCFPVS
SSGAKGGICLFWKENVQVKIRSFSFAHIDAMVSWEGVTWRFIGGAARDGNLMENFRNTLEDYNLTDLGYFGSPYTWFRLWSSGCHIFERLDRFVGNEAFIDLFKEATVQH
LDWYCSDHRPIIIDTVREPQTVSKLKRSYGCKFNENWAAHPECKEIISDSWSANIQQNVSLSKSLALCSKRLGRWVIDNLKEKLNKLLEEEETYWRQRSRECWLRWGDRN
TTWFHKRASVRKKRNEISGINNRDGVWMSNATEMEQIFTDYFQEIFSSSNPTTKNLDWILQCVPQKVSQSMNSVLLAPYNRSEIVTALKQMQPSKAPGPDEFYLKVSDFI
TSSKQWDIQKLRQYVIEEDLEAIGRILLSWSEAEDASVCHYDRRGVYSVKSGYKLGMNLKDLPSSSKSTNHALFQCKRAREVWNLVLPQVRFIIGHSPSVQDKFLTLQEA
LSAKDFELACVSCWAIWNDRNAIRAQAQVPDGNCRSEWIRAYMKEFEGCWQVKHVSETNLQLSSFRTFVSVWVPPPAEWVKLNVDAACKSEISRTVAELSAIREGLYLAS
QLNFSLVQIESDCLQAIELISGNSECWLETGTLVDDIRHVASNFSQVCFLHVRREGNRVAHQLATKAFVGEVAGVRFSNFPIWILNEYGRGSCNVRALAGPF