; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016652 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016652
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:39930266..39933042
RNA-Seq ExpressionLag0016652
SyntenyLag0016652
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4383622.1 hypothetical protein F8388_014122 [Cannabis sativa]2.9e-4225.15Show/hide
Query:  IKELSKMKLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKK-SGKNIYVCKFKSKKAKKRVIGGGPWIYDKA
        +  L+++ + ++  G ++ +    +E   K ++     K++ P+    +  +R +  +W +  + Q+++ S KNI+   F  ++ ++RV GGGPW  DK 
Subjt:  IKELSKMKLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKK-SGKNIYVCKFKSKKAKKRVIGGGPWIYDKA

Query:  VILFDELQGNKGIKKYGMALA------NSIGTFVKMEEEAEEGKVWGE-------------TLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFP
        +I F +  G   I K            N++      E  A E   WGE             T++V+VRM + +PL+RG  + +    +EV +    +  P
Subjt:  VILFDELQGNKGIKKYGMALA------NSIGTFVKMEEEAEEGKVWGE-------------TLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFP

Query:  NFCYQCGRLGHVMNDCEEED------------------------------------------------------------------------------TE
        +FCY CG +GH   DC  +D                                                                                
Subjt:  NFCYQCGRLGHVMNDCEEED------------------------------------------------------------------------------TE

Query:  EDDEEG----------------------RKEGTKSDLR-----WGNEERKVDQWAI---KEDQS--QPSTKREQVSP--------------NKEEKN---
        E DE+G                      R  G   D R      GN ERK     +    ED +      K ++V P              N EE N   
Subjt:  EDDEEG----------------------RKEGTKSDLR-----WGNEERKVDQWAI---KEDQS--QPSTKREQVSP--------------NKEEKN---

Query:  --TSRKKENSAENGM--------------EKHNAKP--NAMSSKKWKR----LARCDPMQLDEHEHPVSQK-------GKKHGRGEHTEEEATKKPKVNY
            R KE +   G+               +H  +P   A  SK WKR      R    +      P+S K          +     +  ++ +K  V Y
Subjt:  --TSRKKENSAENGM--------------EKHNAKP--NAMSSKKWKR----LARCDPMQLDEHEHPVSQK-------GKKHGRGEHTEEEATKKPKVNY

Query:  SEELGG-----------ISAEAAEQPY--------KTRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEM
        + E  G           +S ++    +            WRFTGFYGNP+   R  SW LL RL D    PW+ GGDFNEI S++EKKGG+ R+   M  
Subjt:  SEELGG-----------ISAEAAEQPY--------KTRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEM

Query:  FSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAR
        F +  +RC L D GF G  FTW   ++ G+ ++ERLDR+  N     L P +KV + +F  SDHRPI+A+ E  +   R  +K +  RFE  W K  E +
Subjt:  FSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAR

Query:  EVIVKSWRV--GSVGGPRRLSTKIHRCLQKLHKWNKDRLRGSIQQAIIEKEQEIQRLNEVASDQVG-EDILKAESELEDLLEEEEEYWQNGSWEEDATGI
        E+I +SW      +     L      C  +L  WNK +  GS+ + + E ++++  L  V++  V  E++ + E +L DLL  EE YW+  S   +    
Subjt:  EVIVKSWRV--GSVGGPRRLSTKIHRCLQKLHKWNKDRLRGSIQQAIIEKEQEIQRLNEVASDQVG-EDILKAESELEDLLEEEEEYWQNGSWEEDATGI

Query:  GIIATEYFKSLFKSSRPEDRAIEEITD
        G   T+YF +     + ++  +E +T+
Subjt:  GIIATEYFKSLFKSSRPEDRAIEEITD

XP_010686122.1 PREDICTED: uncharacterized protein LOC104900404 [Beta vulgaris subsp. vulgaris]9.7e-4625.25Show/hide
Query:  EKAIKELSKMKLTEEEKGGLVEVEDNDIEVTDK--DIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWI
        ++    +S +++TEEE      V  +D E T+K  D+E T   K+L  ++   +  +R + +IW ++     +     ++V +F  ++ K++V+ G PW 
Subjt:  EKAIKELSKMKLTEEEKGGLVEVEDNDIEVTDK--DIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWI

Query:  YDKAVILFDELQGNK------------GIKKYGMALANSIGTFVK--------MEEEAEEGKVWGETLRVKVRMEVSKPLRR--GTNLKVGSMADEVWIP
        +D+ +++  E++ +              ++ Y + +     ++V+        + E   +G  W  + RV++ +++ KPLRR    +LK GS    V + 
Subjt:  YDKAVILFDELQGNK------------GIKKYGMALANSIGTFVK--------MEEEAEEGKVWGETLRVKVRMEVSKPLRR--GTNLKVGSMADEVWIP

Query:  ITIEKFPNFCYQCGRLGHVMNDCEEEDTEEDDEEGRKEGTKSDLRWGNEERKVDQWAIKEDQSQPSTKREQVSPNKEEKNTSRKKENSAENGMEKHNAKP
        +  E+ P FCY CG +GH+  DC   + EED  EG+        +WG+       W              + SP K                        
Subjt:  ITIEKFPNFCYQCGRLGHVMNDCEEEDTEEDDEEGRKEGTKSDLRWGNEERKVDQWAIKEDQSQPSTKREQVSPNKEEKNTSRKKENSAENGMEKHNAKP

Query:  NAMSSKKWKRLARCDPMQLDEHEHPVSQKGKKHGRGEHTEEEATKKPKVNYSE-ELGGISAEAAEQPYKTRSWRFTGFYGNPETDKRHHSWSLLERLSDN
           SSK+ + + R D              GKK         EA     V++S+  + G      E+      WRF G YG PE   +H +W L+  L   
Subjt:  NAMSSKKWKRLARCDPMQLDEHEHPVSQKGKKHGRGEHTEEEATKKPKVNYSE-ELGGISAEAAEQPYKTRSWRFTGFYGNPETDKRHHSWSLLERLSDN

Query:  DSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPI
           P ++GGDFNEI S DEK+GGA R +R M  F +V + C L D    G  +TW RG    ++I+ERLDRF ++     L P   V HL  Y SDH  I
Subjt:  DSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPI

Query:  IASWEEAATPFRIQRKGQLLRFERSWTKYKEAREVIVKSWRVGSVGGPRR--------------------LSTKIHRCLQKLHKWNKDRLRGSIQQAIIE
        +    +   P   Q   +  +FE  W   +     + ++W  GSVG P +                    L+ KI R  ++LH   K+ +  +  +   E
Subjt:  IASWEEAATPFRIQRKGQLLRFERSWTKYKEAREVIVKSWRVGSVGGPRR--------------------LSTKIHRCLQKLHKWNKDRLRGSIQQAIIE

Query:  KEQEIQRLN-----------EVASDQVGE----------DILKAESELEDLLEEEEEYWQNGSWEEDATGIGIIATEYFKSLFKSSRPEDRAIEEITDGI
         E+E+  LN            VA  + G+             K  + ++ L +E      +G W E+   +  +  +YF+ +F SS P   A++E+   +
Subjt:  KEQEIQRLN-----------EVASDQVGE----------DILKAESELEDLLEEEEEYWQNGSWEEDATGIGIIATEYFKSLFKSSRPEDRAIEEITDGI

Query:  QPRIPQAFN
        +  +   FN
Subjt:  QPRIPQAFN

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]8.3e-4541.02Show/hide
Query:  SWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDR
        +WRFTG YG+   D R  +W L+ RL      PW++GGDFNEI    EK  G PR +  M+ F D  + C L+D GF GD FTW  G K    I ERLDR
Subjt:  SWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDR

Query:  FCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQL-LRFERSWTKYKEAREVIVKSWRVGSVGGPRRLSTKIHRCLQKLHKWNKDRL
        F +N  +  +   L++ HL F  SDHRPI+A W         +RKG+   RFE  W  ++E +E++ + W V           KI+ CL++L KWN  RL
Subjt:  FCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQL-LRFERSWTKYKEAREVIVKSWRVGSVGGPRRLSTKIHRCLQKLHKWNKDRL

Query:  RGSIQQAIIEKEQEIQRLNEVASDQVGEDILKAESELEDLLEEEEEYWQNGSWEED
         GS++ AI+ KE EIQR+ +  +    +++ +A+ +LE LLEEEE YW+     +D
Subjt:  RGSIQQAIIEKEQEIQRLNEVASDQVGEDILKAESELEDLLEEEEEYWQNGSWEED

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]1.3e-4233.14Show/hide
Query:  SWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDR
        +WR TGFYG PET +RH  W++L  LS      W   GDFNE+  + +K GG PR+  QM+ F D  + C  +D GFSG +FTW  G+++G +I ERLDR
Subjt:  SWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDR

Query:  FCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAREVIVKSWRVGSVGGPRR------LSTKIHRCLQKLHKW
           N+      P  +V HLN Y SDHRP++ S +      R +RK    RFE  W      +  + ++W      GPRR       +TKI  C ++L +W
Subjt:  FCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAREVIVKSWRVGSVGGPRR------LSTKIHRCLQKLHKW

Query:  NKDRLRGSIQQAIIEKEQ----EIQRLNEVASDQVGEDILKAESELEDLLEEEEEYW-------------------------------------QNGSWE
        +K+      +Q  + KE+    EI+ +     DQ   D LKA  EL  LLE+EE+ W                                     +NG W+
Subjt:  NKDRLRGSIQQAIIEKEQ----EIQRLNEVASDQVGEDILKAESELEDLLEEEEEYW-------------------------------------QNGSWE

Query:  EDATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQPRIPQAFN
         +      + T++++ LFKSS P++  I+ + DG+Q  +  + N
Subjt:  EDATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQPRIPQAFN

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]5.0e-4225.63Show/hide
Query:  MKLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDEL
        +KLTEEE+  +V  E+  +    K  E     KI   ++   E F+  + KIW  EG +  K    N+Y+ +F+    K++V+ G PW +D+ ++   E 
Subjt:  MKLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDEL

Query:  QGNKGI-----------------------KKYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNFC
        +G   +                       K+ G+ + + IG  +++E    EG  WG  LR+K  + V+K L RG  LK GS   + W+    E+ P FC
Subjt:  QGNKGI-----------------------KKYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNFC

Query:  YQCGRLGHVMNDCEEE--DTEEDDEEGR--------------------KEGTKSDLRWGNEERKVDQWA-------------------------------
        ++CGR  H    C+E   D    D+ G+                    KE   S   W   + + D ++                               
Subjt:  YQCGRLGHVMNDCEEE--DTEEDDEEGR--------------------KEGTKSDLRWGNEERKVDQWA-------------------------------

Query:  ----IKEDQ--------------------SQPSTKREQVSPNKEEKN-------------------------TSRKKENS----AENGMEK-HNAKPNAM
             KEDQ                     +P++ +E++S +    +                         TS +KEN      E   E   N+  + +
Subjt:  ----IKEDQ--------------------SQPSTKREQVSPNKEEKN-------------------------TSRKKENS----AENGMEK-HNAKPNAM

Query:  SSKKWKRLARCDPMQLDEHEHPVSQ------------------------KGKKHGRGEHTEE----EATKKPKVNYSEE--------------LG-----
          K WKR AR  P  L +  + + Q                        K K     E  +E       K+P + +  E              LG     
Subjt:  SSKKWKRLARCDPMQLDEHEHPVSQ------------------------KGKKHGRGEHTEE----EATKKPKVNYSEE--------------LG-----

Query:  -----GISAEAA-----------------------EQPYKTRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKR
             G S E A                         P     W+ TGFYG+P + KR  SW LL  L    + PWL  GDFNEIT   EK G A R  R
Subjt:  -----GISAEAA-----------------------EQPYKTRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKR

Query:  QMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKY
        QM  F +  + CKL D GF GDKFTW   ++     KERLDR C N+    L  N  V+HL+   SDH+ ++    + A    + +KG++ RFE +WTK 
Subjt:  QMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKY

Query:  KEAREVIVKSWRVGSVGGP---RRLSTKIHRCLQKLHKWNKDRLRGSIQQAIIEKEQEIQRLNEVASDQVGEDILKAESELEDLLEEE
         E  E+I K WR+ S  GP    R    +++C  KL  W++++ R   ++A+  K + ++ L E    ++ E+I K    +  +++ E
Subjt:  KEAREVIVKSWRVGSVGGP---RRLSTKIHRCLQKLHKWNKDRLRGSIQQAIIEKEQEIQRLNEVASDQVGEDILKAESELEDLLEEE

TrEMBL top hitse value%identityAlignment
A0A2N9FFZ2 Reverse transcriptase domain-containing protein3.3e-4726.16Show/hide
Query:  EKGGLVEVEDNDIEV--TDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDELQGNK
        EK  L E E   +++  T    ++  A K L  +++ VE   R    +W  +    ++   +N  V  F  +  ++RV+ G PW YDK +++   ++ ++
Subjt:  EKGGLVEVEDNDIEV--TDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDELQGNK

Query:  GIK-----------------------KYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNFCYQCG
         I+                       +    L +S+G   ++    E     G+ +R++V ++++KPL RG   K+     E WI    E+ PNFCY CG
Subjt:  GIK-----------------------KYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNFCYQCG

Query:  RLGHVMNDC------EEEDTEEDDEEGRKEGTKSDLRWGNEERKVDQWAIKEDQSQPS-TKREQVSP---------NKEEKNTSRKKENSAENGMEKHNA
         + H   DC      +E    ED + G      ++  W   E K+D         QP+      VSP              NT      S +  +  H  
Subjt:  RLGHVMNDC------EEEDTEEDDEEGRKEGTKSDLRWGNEERKVDQWAIKEDQSQPS-TKREQVSP---------NKEEKNTSRKKENSAENGMEKHNA

Query:  KPNAMSSKKWKR------LARCDPMQLDEHEHPVSQKGKKHGRGEHTEE------EATKKPKVNYSEELGGI------------------SAEAAEQPYK
         P  +++           +A   P   D   H   ++  +  + E   E      +   K  VN   + GG+                    +A     +
Subjt:  KPNAMSSKKWKR------LARCDPMQLDEHEHPVSQKGKKHGRGEHTEE------EATKKPKVNYSEELGGI------------------SAEAAEQPYK

Query:  TRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERL
          +WRFTGFYG PET KR  SW LL RL+     PW   GDFNE+  ++EK+G   R++ QM++F DV + C  +D GF+G KFTW    + G    ERL
Subjt:  TRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERL

Query:  DRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAREVIVKSWRVGSVGGPR-RLSTKIHRCLQKLHKWNKD
        DR        +  P+ +V+HL    SDH+PI  S E A  P     K +  RFE  WT  +    VI  SW+    G P   +  KIH C + L  W++ 
Subjt:  DRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAREVIVKSWRVGSVGGPR-RLSTKIHRCLQKLHKWNKD

Query:  RLRGSIQQAIIEKEQEIQRLNEVASD--QVGED---ILKAESELEDLLEEEEEYW-------------------------------------QNGSWEED
           G+I   I    +E++RL ++A +    G D   + + + EL  LL +EE  W                                     Q+G W   
Subjt:  RLRGSIQQAIIEKEQEIQRLNEVASD--QVGED---ILKAESELEDLLEEEEEYW-------------------------------------QNGSWEED

Query:  ATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQ
           +  +  EY+KSLF+++ P+   +E++ + IQ
Subjt:  ATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQ

A0A2N9FI47 CCHC-type domain-containing protein1.5e-4725.64Show/hide
Query:  KMKLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDE
        +MKL + EK   + +  + +  + ++ + +   K+   +    E F+  +  IW V G + + +   N+++  F ++ A  R+    PW +DK +IL   
Subjt:  KMKLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDE

Query:  LQG-------------------NKGIK----KYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNF
        L+G                   N  IK    + G  +   +G  + ++   E G  WG  LR++V +E++KPL RG  ++V      VW+    E  P F
Subjt:  LQG-------------------NKGIK----KYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNF

Query:  CYQCGRLGHVMNDC-------------------------------EEEDTEEDDEEGRKEGTKSDLRWGNE-----------ERKVDQWAIKEDQSQPST
        CY+CG LGH  +DC                                   +E  D  G +  + S    G++           E +++   + ++Q+  S 
Subjt:  CYQCGRLGHVMNDC-------------------------------EEEDTEEDDEEGRKEGTKSDLRWGNE-----------ERKVDQWAIKEDQSQPST

Query:  KREQVSPNKEEKNTSRKKENSAENGMEKHNAKPNAMSSKKWKRLARCDPMQLDEHEHPVSQKGKKHGRG------EHTEEEATKKPKVNYS---------
             +P+  E   + K+     + M+   A PN M +        C  +   E    +    KK           H E    +  +V            
Subjt:  KREQVSPNKEEKNTSRKKENSAENGMEKHNAKPNAMSSKKWKRLARCDPMQLDEHEHPVSQKGKKHGRG------EHTEEEATKKPKVNYS---------

Query:  EELGG-------ISAEAAEQPYK------------TRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMF
           GG        S     Q Y                WR TGFYG+PE   R  SW+LL +L D  S PWL+ GDFNE+ S++E+ G   R+  QM  F
Subjt:  EELGG-------ISAEAAEQPYK------------TRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMF

Query:  SDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEARE
         D    C L D G+ G  F+W   ++ G+ ++ +LDR   N+    L P+ +V+H+ F  SDH  ++           I RK +L RF+ SW +     E
Subjt:  SDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEARE

Query:  VIVKSWRVGSVGGPR-RLSTKIHRCLQKLHKWNKDRLRGSIQQAIIEKEQEIQRLNEVASDQVGE----DILKAESELEDLLEEEEEYWQ----NGSWEE
         I  +W     G P  R++ KI  C  +L +WNK ++R  I   +IE E+   RL ++ S  + E    ++     E+  L+E+EE +W+    +  W+ 
Subjt:  VIVKSWRVGSVGGPR-RLSTKIHRCLQKLHKWNKDRLRGSIQQAIIEKEQEIQRLNEVASDQVGE----DILKAESELEDLLEEEEEYWQ----NGSWEE

Query:  DATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQPRIPQAFNRA
        +A  I  +A  YF  LF SS P    I E+ D +   +  A N A
Subjt:  DATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQPRIPQAFNRA

A0A2N9GJ35 Uncharacterized protein9.5e-4726.13Show/hide
Query:  KMKLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDE
        +MKL+E+E    + +  + I  + K+ +++   K+L  K    E F+  I  +W   G V ++    N+++  F  +   +R+    PW +DK +I    
Subjt:  KMKLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDE

Query:  LQGN-----------------------KGIKKYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADE------VWIPITI
         +G+                         I++ G  +   IG  +++ +  E G  WGE LR++V +++++PL RG  L+    +DE       W+    
Subjt:  LQGN-----------------------KGIKKYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADE------VWIPITI

Query:  EKFPNFCYQCGRLGHVMNDCEEEDTEEDDEEGRKEGTKSDLRWGNEERKVDQWAIKEDQSQPSTKREQV-SPNKE-EKNTSRKKENSAENG---------
        E  P FCY+CGRLGH  ++C           GR     S  +WG   R     A+    +QP   RE V  P++E E N    +E + EN          
Subjt:  EKFPNFCYQCGRLGHVMNDCEEEDTEEDDEEGRKEGTKSDLRWGNEERKVDQWAIKEDQSQPSTKREQV-SPNKE-EKNTSRKKENSAENG---------

Query:  ------------------MEKHNAKPNAMSSKKWKRLA------RC----DPMQLDEHEHPVSQKG-------------------------------KKH
                          +E H  +P   S K    +        C    +P  ++E  + V ++G                               ++H
Subjt:  ------------------MEKHNAKPNAMSSKKWKRLA------RC----DPMQLDEHEHPVSQKG-------------------------------KKH

Query:  GRGEHTEEEATKKPKVN---YSEELGGISAEAAEQPYKTRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQM
        G+G            +N   YSE    I  E  +       WR TGFYG PE   RH SWSLL  L      PW+I GDFNEIT ++EK G   RN  QM
Subjt:  GRGEHTEEEATKKPKVN---YSEELGGISAEAAEQPYKTRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQM

Query:  EMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIA-SWEEAATPFRIQRKGQLLRFERSWTKYK
          F +    C L D GF+G +FTW   ++ G  ++ RLDR   +     L P+  +NHL    SDH  ++  S  +       QRK ++ RFE+SW K  
Subjt:  EMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIA-SWEEAATPFRIQRKGQLLRFERSWTKYK

Query:  EAREVIVKSWRVGSVG-GPRRLSTKIHRCLQKLHKWNKDRLRGSIQQAIIEKEQEIQRLN-EVASDQVGEDILKAESELEDLLEEEEEYW----------
           EVI  +W V  +G    +++ KI +C  KL +W++  +R +  + I  K +++Q L  +   D     I   + +L  L E+ E  W          
Subjt:  EAREVIVKSWRVGSVG-GPRRLSTKIHRCLQKLHKWNKDRLRGSIQQAIIEKEQEIQRLN-EVASDQVGEDILKAESELEDLLEEEEEYW----------

Query:  ---------------------------QNGSWEEDATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQPRIPQAFN
                                   Q  +W  +   +  IA +YF SLF SS P  RAI+E+   ++  +    N
Subjt:  ---------------------------QNGSWEEDATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQPRIPQAFN

A0A2N9GU75 RNase H domain-containing protein6.2e-4628.09Show/hide
Query:  KLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDELQ
        K +  +K GL    + D+  T +  EN  A K L  + + ++   R    +W       ++  G N     F+     +RV+   PW YDK +++F ++Q
Subjt:  KLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDELQ

Query:  GNKGIKKYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRG--TNLKVGSMADEVWIPITIEKFPNFCYQCGRLGHVMNDCEEEDTEEDD
        G +   +   A+  SIG   KM    +E +     + V+VR+EV+ PL RG   NL+ G  +   WI    E+ PNFCY CG L H   DC+        
Subjt:  GNKGIKKYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRG--TNLKVGSMADEVWIPITIEKFPNFCYQCGRLGHVMNDCEEEDTEEDD

Query:  EEGRKEGTKSDLRWGNEERKVDQWAIKEDQSQPSTKREQVSPNKEEKNTSRKKENSAENGMEKHNAKPNAMSSKKWKRLARCDPMQLDEHEHPVSQKGKK
              G +       EE +   W ++    +P        P K        + N  +    KH   PN  ++     L  CD           S+KGK 
Subjt:  EEGRKEGTKSDLRWGNEERKVDQWAIKEDQSQPSTKREQVSPNKEEKNTSRKKENSAENGMEKHNAKPNAMSSKKWKRLARCDPMQLDEHEHPVSQKGKK

Query:  HGRGEHTEEEATKKPKVNYSEELGGISAEAAEQPYKTRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEM
            E+T  +     ++   E+  G   +A  Q    + WR T FYG PET  R HSW+LL  L    S PW   GDFNEI    EK G   +++RQM+ 
Subjt:  HGRGEHTEEEATKKPKVNYSEELGGISAEAAEQPYKTRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEM

Query:  FSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAR
        F  V + C  +D GF G  FTW   ++  +    RLDRF   +   +   +  V+HL    SDH+PI  S +    P   + + +L RFE  W       
Subjt:  FSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAR

Query:  EVIVKSWRVGSVGGPRRLST-KIHRCLQKLHKWNKDRLRGSIQQAIIEKEQEIQRLNEVASDQVGEDILKAESELEDLLEEEEEYWQNGSWEE-------
          I K+W   + G P   +T KIH C   L  W++ +  G I + + EK ++++R    ++   G D    ES L++     + +    S  +       
Subjt:  EVIVKSWRVGSVGGPRRLST-KIHRCLQKLHKWNKDRLRGSIQQAIIEKEQEIQRLNEVASDQVGEDILKAESELEDLLEEEEEYWQNGSWEE-------

Query:  ----------DATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQPRIPQAFNRA
                  D   IG   TEY+++LF ++  ED  +E + D IQ  + Q  N++
Subjt:  ----------DATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQPRIPQAFNRA

A0A2N9IXK4 RNase H domain-containing protein3.3e-4727.09Show/hide
Query:  ENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDELQ------------GNKGIKKYGM---
        ++  A + L  + + ++   R    +W  +   Q++  G NI + +F      +RVI  GPW YDK++ILF   +             +  ++ +G+   
Subjt:  ENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFDELQ------------GNKGIKKYGM---

Query:  ----ALANSI-GTFVKMEEEAEEGKV--WGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNFCYQCGRLGHVMNDCEEEDTEEDDEEGRK
            A A+ I GT  K+ +E++E +   WGE +RV+V ++V KPL RG  + +G    E+ +    EK PNFCY CG + H   DC       D  E  K
Subjt:  ----ALANSI-GTFVKMEEEAEEGKV--WGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNFCYQCGRLGHVMNDCEEEDTEEDDEEGRK

Query:  EGTKSDLRW--GNEERK----VDQWAIKEDQSQPSTKREQVSPNKEEKNTSRKKENSAENGMEKHNAKPNAM----------------------------
        +   + LR   G   R+    V     +  ++  S+     +   +   +  +KE       E H+  PN+                             
Subjt:  EGTKSDLRW--GNEERK----VDQWAIKEDQSQPSTKREQVSPNKEEKNTSRKKENSAENGMEKHNAKPNAM----------------------------

Query:  --------------SSKKWKRLARCDPMQL---------DEHEHPVSQKGKKHGRGEHTEEEATKKPKVNYSEELGGISAEAAEQPYKTRSWRFTGFYGN
                       +++W   +   P +          D  +  V ++ +  G     ++EA    K      +  I  E      +  SWRFTGFYG 
Subjt:  --------------SSKKWKRLARCDPMQL---------DEHEHPVSQKGKKHGRGEHTEEEATKKPKVNYSEELGGISAEAAEQPYKTRSWRFTGFYGN

Query:  PETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAML
        PET +RH SWSLL  L    S PW   GDFNE+ S +EK+GG  R+ RQM+ F D  + C   D GF+G  FTW   +     + ERLDR         L
Subjt:  PETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKGSQIKERLDRFCLNHPMAML

Query:  APNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAREVIVKSWRVGSVGGPR-RLSTKIHRCLQKLHKWNKDRLRGSIQQAIIE
         P  +V HL+   SDH PI  S + + +P    R  ++ RFE  W  +   +E I  +W+    G    ++  K+  C   L +W++D    S      E
Subjt:  APNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAREVIVKSWRVGSVGGPR-RLSTKIHRCLQKLHKWNKDRLRGSIQQAIIE

Query:  KEQEIQRLNEVASDQV-GEDILKAES---ELEDLLEEEEEYWQNGS
         +++ Q L E  S+ + G+   KA +   E+  LL  EE  W+  S
Subjt:  KEQEIQRLNEVASDQV-GEDILKAES---ELEDLLEEEEEYWQNGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G36228.1 nucleic acid binding;zinc ion binding2.0e-0423.83Show/hide
Query:  KILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVI---------------LFDELQGNKGI------KKYGM
        +IL P++  VE     +P  WG+  +V  +      +  +F+S+      +   PW++++  I                 D     +GI      ++   
Subjt:  KILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVI---------------LFDELQGNKGI------KKYGM

Query:  ALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNFCYQCGRLGHVMNDC------EEEDTEED
         +A+++G  V M+   EE       +RVKVRM+ ++PLR    ++  S  +   I    EK    C  C R+ H ++ C      EE D E D
Subjt:  ALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNFCYQCGRLGHVMNDC------EEEDTEED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGGAGAAAGCTATCAAAGAACTAAGTAAGATGAAACTGACCGAGGAAGAGAAAGGAGGGCTGGTGGAGGTGGAAGATAACGACATTGAAGTTACGGACAAGGA
TATTGAAAACACAGCAGCATGCAAAATCCTAGCACCAAAGAGTATTATAGTCGAACACTTCCAACGCCATATACCAAAAATATGGGGTGTAGAAGGGAAAGTTCAAATGA
AGAAGTCAGGAAAGAATATATACGTATGCAAATTTAAAAGTAAAAAGGCGAAAAAAAGAGTAATAGGAGGGGGACCATGGATATATGATAAAGCAGTCATCCTCTTCGAT
GAGCTGCAAGGGAACAAAGGAATCAAGAAATACGGCATGGCCCTCGCAAACTCGATAGGAACTTTTGTGAAAATGGAGGAGGAAGCAGAGGAAGGTAAAGTGTGGGGTGA
AACCTTAAGAGTGAAAGTGCGAATGGAGGTCAGTAAACCGCTAAGGCGAGGCACAAATCTAAAGGTTGGTTCTATGGCAGACGAAGTATGGATCCCAATAACCATAGAAA
AGTTTCCGAATTTCTGCTACCAGTGTGGGCGTTTGGGCCACGTGATGAACGACTGTGAGGAGGAGGATACTGAAGAAGACGACGAGGAGGGAAGAAAAGAGGGTACAAAG
TCTGACTTGCGATGGGGAAATGAAGAAAGGAAAGTTGATCAGTGGGCCATAAAAGAAGACCAAAGTCAGCCCAGTACCAAAAGGGAGCAGGTTAGCCCAAATAAAGAGGA
AAAAAATACTAGTCGAAAAAAAGAGAATAGTGCTGAAAATGGTATGGAAAAACATAATGCAAAACCCAACGCCATGAGTTCAAAAAAATGGAAGAGGTTGGCTCGGTGTG
ATCCCATGCAACTGGATGAACATGAGCACCCTGTCAGCCAAAAAGGAAAGAAGCATGGGAGAGGAGAACATACTGAAGAGGAAGCAACAAAAAAACCGAAAGTAAATTAC
TCTGAGGAACTCGGAGGGATATCGGCGGAGGCTGCGGAGCAGCCCTACAAGACTAGATCGTGGAGATTTACTGGATTCTACGGTAACCCGGAGACGGACAAGAGACATCA
TTCATGGAGTTTGCTGGAAAGGCTGAGTGATAATGACTCACACCCTTGGCTCATCGGAGGAGACTTCAATGAGATTACCTCCATGGATGAGAAGAAAGGGGGTGCCCCTA
GAAATAAACGACAGATGGAGATGTTTTCAGACGTAGGAAACAGATGCAAGCTCATGGATGCGGGCTTTTCGGGAGACAAGTTTACCTGGCGTAGGGGCAAGAAAAAAGGA
AGCCAGATAAAAGAAAGATTGGATCGGTTCTGTTTAAACCACCCTATGGCAATGTTAGCTCCAAATCTCAAAGTTAATCACCTTAACTTCTATGGTTCGGATCACAGACC
TATCATTGCCAGTTGGGAAGAGGCCGCCACTCCTTTCAGAATTCAGAGGAAGGGTCAGCTACTTAGGTTCGAGAGGAGTTGGACAAAATATAAAGAAGCTCGTGAAGTAA
TTGTTAAGAGCTGGAGGGTCGGCAGTGTGGGAGGCCCACGTAGGTTGAGCACCAAGATCCACAGGTGCCTTCAGAAGCTTCATAAATGGAACAAAGATAGGTTAAGGGGC
AGTATCCAACAAGCGATCATTGAGAAAGAACAAGAAATTCAAAGGCTAAATGAAGTAGCAAGTGACCAGGTAGGGGAAGACATCCTAAAAGCCGAATCTGAATTGGAAGA
TTTGTTGGAAGAAGAGGAAGAATATTGGCAAAACGGGTCATGGGAGGAAGATGCTACCGGCATTGGGATAATAGCAACAGAGTACTTCAAGTCCTTGTTCAAATCTTCAA
GACCAGAGGACAGAGCTATTGAAGAAATAACAGATGGTATACAACCAAGAATTCCCCAAGCTTTCAACAGAGCTAGACCAACCATATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGGAGAAAGCTATCAAAGAACTAAGTAAGATGAAACTGACCGAGGAAGAGAAAGGAGGGCTGGTGGAGGTGGAAGATAACGACATTGAAGTTACGGACAAGGA
TATTGAAAACACAGCAGCATGCAAAATCCTAGCACCAAAGAGTATTATAGTCGAACACTTCCAACGCCATATACCAAAAATATGGGGTGTAGAAGGGAAAGTTCAAATGA
AGAAGTCAGGAAAGAATATATACGTATGCAAATTTAAAAGTAAAAAGGCGAAAAAAAGAGTAATAGGAGGGGGACCATGGATATATGATAAAGCAGTCATCCTCTTCGAT
GAGCTGCAAGGGAACAAAGGAATCAAGAAATACGGCATGGCCCTCGCAAACTCGATAGGAACTTTTGTGAAAATGGAGGAGGAAGCAGAGGAAGGTAAAGTGTGGGGTGA
AACCTTAAGAGTGAAAGTGCGAATGGAGGTCAGTAAACCGCTAAGGCGAGGCACAAATCTAAAGGTTGGTTCTATGGCAGACGAAGTATGGATCCCAATAACCATAGAAA
AGTTTCCGAATTTCTGCTACCAGTGTGGGCGTTTGGGCCACGTGATGAACGACTGTGAGGAGGAGGATACTGAAGAAGACGACGAGGAGGGAAGAAAAGAGGGTACAAAG
TCTGACTTGCGATGGGGAAATGAAGAAAGGAAAGTTGATCAGTGGGCCATAAAAGAAGACCAAAGTCAGCCCAGTACCAAAAGGGAGCAGGTTAGCCCAAATAAAGAGGA
AAAAAATACTAGTCGAAAAAAAGAGAATAGTGCTGAAAATGGTATGGAAAAACATAATGCAAAACCCAACGCCATGAGTTCAAAAAAATGGAAGAGGTTGGCTCGGTGTG
ATCCCATGCAACTGGATGAACATGAGCACCCTGTCAGCCAAAAAGGAAAGAAGCATGGGAGAGGAGAACATACTGAAGAGGAAGCAACAAAAAAACCGAAAGTAAATTAC
TCTGAGGAACTCGGAGGGATATCGGCGGAGGCTGCGGAGCAGCCCTACAAGACTAGATCGTGGAGATTTACTGGATTCTACGGTAACCCGGAGACGGACAAGAGACATCA
TTCATGGAGTTTGCTGGAAAGGCTGAGTGATAATGACTCACACCCTTGGCTCATCGGAGGAGACTTCAATGAGATTACCTCCATGGATGAGAAGAAAGGGGGTGCCCCTA
GAAATAAACGACAGATGGAGATGTTTTCAGACGTAGGAAACAGATGCAAGCTCATGGATGCGGGCTTTTCGGGAGACAAGTTTACCTGGCGTAGGGGCAAGAAAAAAGGA
AGCCAGATAAAAGAAAGATTGGATCGGTTCTGTTTAAACCACCCTATGGCAATGTTAGCTCCAAATCTCAAAGTTAATCACCTTAACTTCTATGGTTCGGATCACAGACC
TATCATTGCCAGTTGGGAAGAGGCCGCCACTCCTTTCAGAATTCAGAGGAAGGGTCAGCTACTTAGGTTCGAGAGGAGTTGGACAAAATATAAAGAAGCTCGTGAAGTAA
TTGTTAAGAGCTGGAGGGTCGGCAGTGTGGGAGGCCCACGTAGGTTGAGCACCAAGATCCACAGGTGCCTTCAGAAGCTTCATAAATGGAACAAAGATAGGTTAAGGGGC
AGTATCCAACAAGCGATCATTGAGAAAGAACAAGAAATTCAAAGGCTAAATGAAGTAGCAAGTGACCAGGTAGGGGAAGACATCCTAAAAGCCGAATCTGAATTGGAAGA
TTTGTTGGAAGAAGAGGAAGAATATTGGCAAAACGGGTCATGGGAGGAAGATGCTACCGGCATTGGGATAATAGCAACAGAGTACTTCAAGTCCTTGTTCAAATCTTCAA
GACCAGAGGACAGAGCTATTGAAGAAATAACAGATGGTATACAACCAAGAATTCCCCAAGCTTTCAACAGAGCTAGACCAACCATATTCTAG
Protein sequenceShow/hide protein sequence
MEEEKAIKELSKMKLTEEEKGGLVEVEDNDIEVTDKDIENTAACKILAPKSIIVEHFQRHIPKIWGVEGKVQMKKSGKNIYVCKFKSKKAKKRVIGGGPWIYDKAVILFD
ELQGNKGIKKYGMALANSIGTFVKMEEEAEEGKVWGETLRVKVRMEVSKPLRRGTNLKVGSMADEVWIPITIEKFPNFCYQCGRLGHVMNDCEEEDTEEDDEEGRKEGTK
SDLRWGNEERKVDQWAIKEDQSQPSTKREQVSPNKEEKNTSRKKENSAENGMEKHNAKPNAMSSKKWKRLARCDPMQLDEHEHPVSQKGKKHGRGEHTEEEATKKPKVNY
SEELGGISAEAAEQPYKTRSWRFTGFYGNPETDKRHHSWSLLERLSDNDSHPWLIGGDFNEITSMDEKKGGAPRNKRQMEMFSDVGNRCKLMDAGFSGDKFTWRRGKKKG
SQIKERLDRFCLNHPMAMLAPNLKVNHLNFYGSDHRPIIASWEEAATPFRIQRKGQLLRFERSWTKYKEAREVIVKSWRVGSVGGPRRLSTKIHRCLQKLHKWNKDRLRG
SIQQAIIEKEQEIQRLNEVASDQVGEDILKAESELEDLLEEEEEYWQNGSWEEDATGIGIIATEYFKSLFKSSRPEDRAIEEITDGIQPRIPQAFNRARPTIF