; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031748 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031748
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold11:38897132..38902062
RNA-Seq ExpressionSpg031748
SyntenySpg031748
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016788 - hydrolase activity, acting on ester bonds (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5314530.1 unnamed protein product [Arabidopsis thaliana]4.4e-5128.57Show/hide
Query:  GDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRM
        GDFN++  + EK GGP R+    + FR  LN   L +++     FTW G+R    I  ++DR +   +++ LF  A  + + W  S HRP+   +D    
Subjt:  GDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRM

Query:  VWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAKN----LNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELD
         WR  +   F+++  W    A +   K H  W      + + N    L  C  +LS+W S    + + +I+E K  L  AY+  P +++  ++ LK +L 
Subjt:  VWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAKN----LNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELD

Query:  KLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEG--MWTEDPA--LIEDTFITYFCESGA------------------------
        K    EE +W+ +SR  W++ GDKN+ +FH K   RR  N ++ + D +G     ED    L+E  F T F   G+                        
Subjt:  KLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEG--MWTEDPA--LIEDTFITYFCESGA------------------------

Query:  ---------------WNESFLNEAVGIDDLDIIRRI-PIDLRKSDSFMWHYDKYGKYTVKNEYRLFMKNRIER-ASSSSNTMGKVWDIVWKLKVLSKVKF
                       WNE  +++ + ++D ++I+ I P  +  SDS  W Y K G Y+VK+ Y    +  IE+ A+ +S      +  +W + V  K+K 
Subjt:  ---------------WNESFLNEAVGIDDLDIIRRI-PIDLRKSDSFMWHYDKYGKYTVKNEYRLFMKNRIER-ASSSSNTMGKVWDIVWKLKVLSKVKF

Query:  FCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFN-HCFEERWIALCEILSR--DELRIITVTCWAIWGD
        F W++L   LP   NL  RG+ +   C +C    E+++H LF     KEIW LT          N H   +    L  I     +EL+++ +  W IW  
Subjt:  FCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFN-HCFEERWIALCEILSR--DELRIITVTCWAIWGD

Query:  RNNHI-HGIKIPPPDIRSQWV--LKYLDEFDRANERRNIDNRPCSSS--VRLP
        RN  I    ++  PD+ SQ +  L   +E  + N+     N+PCS S  V+LP
Subjt:  RNNHI-HGIKIPPPDIRSQWV--LKYLDEFDRANERRNIDNRPCSSS--VRLP

KAF2302199.1 hypothetical protein GH714_033720 [Hevea brasiliensis]2.0e-4326.43Show/hide
Query:  FTWKGNRRGVQIW--ERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAK
        +TW  N R    W   +LDRF+ N +++  FA + +  LD++ SDH PI   V   R+   +   R F+FE  W     C +I++    W   S   + +
Subjt:  FTWKGNRRGVQIW--ERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAK

Query:  NLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELDKLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIR
         L  CS  L +WG ++R   +  I +CK+ +             S    K +   LL  +E +W+QR++E W++ GD+N+ +FHRKA+IR++ N I  ++
Subjt:  NLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELDKLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIR

Query:  DAEGMWTE------------------------------DPALIED---------------------TF--------ITYFCESGAWNESFLNEAVGIDDL
        D  G W +                               PAL+ D                     TF        ++     GAWN   +N      D 
Subjt:  DAEGMWTE------------------------------DPALIED---------------------TF--------ITYFCESGAWNESFLNEAVGIDDL

Query:  DIIRRIPIDLRKSDSF---MWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMG--KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGC
         +I  IP  LR+S S     W +DK G Y+V + Y+L       +   S   +G  K W  +W +    K++ F W+A+ G LPTR  L  R +     C
Subjt:  DIIRRIPIDLRKSDSF---MWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMG--KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGC

Query:  SMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRDELRIITVTCWAIWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRA
         +C++  ES+ H L  CS A+ +W  +  H        H F++  +    I + ++   +   CW+IW +RN+ +     P  +      L+   E+D A
Subjt:  SMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRDELRIITVTCWAIWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRA

Query:  NERRNIDNRPCSSSVRLPKPHHWSLP
                 P +    LP P  W  P
Subjt:  NERRNIDNRPCSSSVRLPKPHHWSLP

KAF2317147.1 hypothetical protein GH714_012179 [Hevea brasiliensis]1.4e-4926.39Show/hide
Query:  KRRARMGHINESSIQDQMSKKRKSRGDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIW--ERLDRFICNLEFESLF
        +RRA    + E  +  Q S      GDFN +L   EK GG    ++L+++FR  +    L ++  +   +TW  N R    W   +LDRF+ N +++  F
Subjt:  KRRARMGHINESSIQDQMSKKRKSRGDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIW--ERLDRFICNLEFESLF

Query:  AFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQAL
        A + +  LD++ SDH PI   V   R+   +   R F+FE  W     C +I++    W   S   + + L  CS  L +WG ++R   +  I +CK+ +
Subjt:  AFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQAL

Query:  KAAYDNAPHMDFLSVHSLKFELDKLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTE-----------------------
                     S    K +   LL  +E +W+QR++E W++ GD+N+ +FHRKA+IR++ N I  ++D  G W +                       
Subjt:  KAAYDNAPHMDFLSVHSLKFELDKLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTE-----------------------

Query:  -------DPALIED---------------------TF--------ITYFCESGAWNESFLNEAVGIDDLDIIRRIPIDLRKSDSF---MWHYDKYGKYTV
                PAL+ D                     TF        ++     GAWN   +N      D  +I  IP  LR+S S     W +DK G Y+V
Subjt:  -------DPALIED---------------------TF--------ITYFCESGAWNESFLNEAVGIDDLDIIRRIPIDLRKSDSF---MWHYDKYGKYTV

Query:  KNEYRLFMKNRIERASSSSNTMG--KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHD
         + Y+L       +   S   +G  K W  +W +    K++ F W+A+ G LPTR  L  R +     C +C++  ES+ H L  CS A+ +W  +  H 
Subjt:  KNEYRLFMKNRIERASSSSNTMG--KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHD

Query:  LLERDFNHCFEERWIALCEILSRDELRIITVTCWAIWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRANERRNIDNRPCSSSVRLPKPHHWSLP
               H F++  +    + + ++   +   CW+IW +RN+ +     P  +      L+   E+D A         P +    LP P  W  P
Subjt:  LLERDFNHCFEERWIALCEILSRDELRIITVTCWAIWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRANERRNIDNRPCSSSVRLPKPHHWSLP

KAF7825238.1 ribonuclease H [Senna tora]3.6e-4525Show/hide
Query:  GFVVSTRKKLNWKRRARMGHINESSIQDQMSKKRKSRGDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGV-QIWERLDR
        GF     KKL+WK    +  +N +S    +       GDFNE++ + EK GG  +    + EFR  L+ C L+++ F+  P+TW   R G   I ERLD+
Subjt:  GFVVSTRKKLNWKRRARMGHINESSIQDQMSKKRKSRGDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGV-QIWERLDR

Query:  FICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHV-LAKNLNSCSEALSKWGSDVRNS
           + E+ SLF F    +   +FS H  +  + D       +  +R F+FEE W + ++C +++     W+  S  V   K + SC +  +   +    S
Subjt:  FICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHV-LAKNLNSCSEALSKWGSDVRNS

Query:  MRTRIKECKQALKAAYD-NAPHMDFLSVHSLKFELDKLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFI
        +R +IK  ++++              ++   + ELD+LL+ EEI W+QRSR  W++ GD N+ +FHRKA+ RR  N I  IRD +     D A I +T  
Subjt:  MRTRIKECKQALKAAYD-NAPHMDFLSVHSLKFELDKLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFI

Query:  TYFC----------------------------------------------------------------------------ESGA------------WNES
        +YFC                                                                              GA            W++ 
Subjt:  TYFC----------------------------------------------------------------------------ESGA------------WNES

Query:  FLNEAVG-------------------------------IDDL------DIIRRIPIDLRK-SDSFMWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMG
        +++   G                               ID+L       +I+ IP+  R   D  MW  +K+G Y+V++ Y  F+ N     SSSS+   
Subjt:  FLNEAVG-------------------------------IDDL------DIIRRIPIDLRK-SDSFMWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMG

Query:  KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRDE
          W ++W L +  KVK F W+   G + + +N+H RG+     C  C   VES  H    C KA+E+W  +        D  + F + W    + L++D 
Subjt:  KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRDE

Query:  LRI---ITVTCWAIWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRANERRNIDNRPCSSSVRLPKPH
          I   I + CW+IW  RN  +   K+ P +       + + +F+  N R +    P ++    P P+
Subjt:  LRI---ITVTCWAIWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRANERRNIDNRPCSSSVRLPKPH

PWA36168.1 hypothetical protein CTI12_AA602590 [Artemisia annua]2.2e-5026.02Show/hide
Query:  GDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQ-IWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRR
        GDFNE+++ +EK G     +  +  FR   + C L++        TW   RRG + + +RLDRF+ N  +  L+  A   NL    SDH PI     CR 
Subjt:  GDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQ-IWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRR

Query:  MVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSA--HVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELDK
            K + R F+FE  W   ++   +++    + +++   H     ++ C+  LS W       ++  IK  +++L+               +L+ ++ +
Subjt:  MVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSA--HVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELDK

Query:  LLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFITYF-----------CES--------------------
        LL  EE+ WKQRSR +W+R GDKN+ +FH +AS R++ N I  ++  +G W E+   +     +YF           CES                    
Subjt:  LLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFITYF-----------CES--------------------

Query:  --------------GAWNESFLNEAVGIDDLDIIRRIPIDLRKSDSFMWHYDKYGKYTVKNEYRLFMK--NRIERASSSSNTMGKVWDIVWKLKVLSKVK
                        WN   +      +    I    I   ++D   WH +  G+++ K+ Y L ++    + R ++ SN++   W +VWK +V SKVK
Subjt:  --------------GAWNESFLNEAVGIDDLDIIRRIPIDLRKSDSFMWHYDKYGKYTVKNEYRLFMK--NRIERASSSSNTMGKVWDIVWKLKVLSKVK

Query:  FFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRD---ELRIITVTCWAIWG
         F W+A   ++PT  NL  RG++    C+ C    E++ H LF CS AK++W         +      F++     C+++      E     +  W +W 
Subjt:  FFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRD---ELRIITVTCWAIWG

Query:  DRNNHIHGIKIPPPDIRSQWVLK-YLDEFDRANERRNI
         RN H HG ++   +   + + K  L ++ +AN+R NI
Subjt:  DRNNHIHGIKIPPPDIRSQWVLK-YLDEFDRANERRNI

TrEMBL top hitse value%identityAlignment
A0A2N9HDU2 Uncharacterized protein2.6e-4928.42Show/hide
Query:  KDLINKWQKFSLKEREKEPAISFKPEERSNIEGNLGHNLIGKLLSSRIISSLAIKNAMVGDWKTKHKFSIKN----------------------------
        +DL   WQ+ SL E E++  I  +P ++S       H L  K  + R+I+  A+       W++++ FS ++                            
Subjt:  KDLINKWQKFSLKEREKEPAISFKPEERSNIEGNLGHNLIGKLLSSRIISSLAIKNAMVGDWKTKHKFSIKN----------------------------

Query:  ------KCAAEKFGNLIGEFLEMENEEEDLAWSDSIRIRVKIDISKALLRGFMLKSGEAGG-ERWITIRYERLLDFCFKCGCIGHGAKECKTDQKDGGS-
              K  A   G  IGE L   + +E+L     +RIRVK++I+K L RG   K G A G + W   RYERL +FC+ CG + HG K+C    ++  + 
Subjt:  ------KCAAEKFGNLIGEFLEMENEEEDLAWSDSIRIRVKIDISKALLRGFMLKSGEAGG-ERWITIRYERLLDFCFKCGCIGHGAKECKTDQKDGGS-

Query:  NKNNFEFDAWLKFQG---YFR------------GTRKQT-PPANEKSPDLNSSRHPEN--SEDIVHNTVYIPDL--ADEGGAVDFNLEKDSEGEQKNE--
         + +  + AWL+  G   Y +            GT K +    +E   DL + + P N        N    PDL  +D G   +   E D E    NE  
Subjt:  NKNNFEFDAWLKFQG---YFR------------GTRKQT-PPANEKSPDLNSSRHPEN--SEDIVHNTVYIPDL--ADEGGAVDFNLEKDSEGEQKNE--

Query:  -QVLSNDSVFEMAMEEDGIQIIESSRLKEVNNVSGTLSQSEGSS--GGFVVSTRKKLNWKRRARMGHINESSIQDQMSKKRKSRGDFNELLWDYEKYGGP
         ++ +   +       + +  +E S   E    S T     G+S  GG ++ T     W    R        +  Q        GDFNE++ + E +G  
Subjt:  -QVLSNDSVFEMAMEEDGIQIIESSRLKEVNNVSGTLSQSEGSS--GGFVVSTRKKLNWKRRARMGHINESSIQDQMSKKRKSRGDFNELLWDYEKYGGP

Query:  RRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNR-RGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEEF
        RR    ++ FR  +++C+L ++ +  +PFTW  NR      W RLDR +  L +   F  A   +LD   SDH+ +  + + R    +  +R+PF+F+E 
Subjt:  RRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNR-RGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEEF

Query:  WTHYEACEDIIKTHGDWQV--SSAHVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLS-VHSLKFELDKLLEEEEIFWKQRSRE
        WT  E CE+ I+      +  S+   +A+ L +C + L  W      S+  +++E KQ L  A + A     L  +  LK E++ LLE EE  W+QRSR 
Subjt:  WTHYEACEDIIKTHGDWQV--SSAHVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLS-VHSLKFELDKLLEEEEIFWKQRSRE

Query:  DWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFITYF
         W+  GD+N+ +FH +AS RR+ N+I G++D +G+W ED   +    + YF
Subjt:  DWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFITYF

A0A2U1KHJ0 CCHC-type domain-containing protein1.1e-5026.02Show/hide
Query:  GDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQ-IWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRR
        GDFNE+++ +EK G     +  +  FR   + C L++        TW   RRG + + +RLDRF+ N  +  L+  A   NL    SDH PI     CR 
Subjt:  GDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQ-IWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRR

Query:  MVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSA--HVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELDK
            K + R F+FE  W   ++   +++    + +++   H     ++ C+  LS W       ++  IK  +++L+               +L+ ++ +
Subjt:  MVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSA--HVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELDK

Query:  LLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFITYF-----------CES--------------------
        LL  EE+ WKQRSR +W+R GDKN+ +FH +AS R++ N I  ++  +G W E+   +     +YF           CES                    
Subjt:  LLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFITYF-----------CES--------------------

Query:  --------------GAWNESFLNEAVGIDDLDIIRRIPIDLRKSDSFMWHYDKYGKYTVKNEYRLFMK--NRIERASSSSNTMGKVWDIVWKLKVLSKVK
                        WN   +      +    I    I   ++D   WH +  G+++ K+ Y L ++    + R ++ SN++   W +VWK +V SKVK
Subjt:  --------------GAWNESFLNEAVGIDDLDIIRRIPIDLRKSDSFMWHYDKYGKYTVKNEYRLFMK--NRIERASSSSNTMGKVWDIVWKLKVLSKVK

Query:  FFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRD---ELRIITVTCWAIWG
         F W+A   ++PT  NL  RG++    C+ C    E++ H LF CS AK++W         +      F++     C+++      E     +  W +W 
Subjt:  FFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRD---ELRIITVTCWAIWG

Query:  DRNNHIHGIKIPPPDIRSQWVLK-YLDEFDRANERRNI
         RN H HG ++   +   + + K  L ++ +AN+R NI
Subjt:  DRNNHIHGIKIPPPDIRSQWVLK-YLDEFDRANERRNI

A0A6A6MVN1 Protein-serine/threonine phosphatase6.9e-5026.39Show/hide
Query:  KRRARMGHINESSIQDQMSKKRKSRGDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIW--ERLDRFICNLEFESLF
        +RRA    + E  +  Q S      GDFN +L   EK GG    ++L+++FR  +    L ++  +   +TW  N R    W   +LDRF+ N +++  F
Subjt:  KRRARMGHINESSIQDQMSKKRKSRGDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIW--ERLDRFICNLEFESLF

Query:  AFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQAL
        A + +  LD++ SDH PI   V   R+   +   R F+FE  W     C +I++    W   S   + + L  CS  L +WG ++R   +  I +CK+ +
Subjt:  AFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQAL

Query:  KAAYDNAPHMDFLSVHSLKFELDKLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTE-----------------------
                     S    K +   LL  +E +W+QR++E W++ GD+N+ +FHRKA+IR++ N I  ++D  G W +                       
Subjt:  KAAYDNAPHMDFLSVHSLKFELDKLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTE-----------------------

Query:  -------DPALIED---------------------TF--------ITYFCESGAWNESFLNEAVGIDDLDIIRRIPIDLRKSDSF---MWHYDKYGKYTV
                PAL+ D                     TF        ++     GAWN   +N      D  +I  IP  LR+S S     W +DK G Y+V
Subjt:  -------DPALIED---------------------TF--------ITYFCESGAWNESFLNEAVGIDDLDIIRRIPIDLRKSDSF---MWHYDKYGKYTV

Query:  KNEYRLFMKNRIERASSSSNTMG--KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHD
         + Y+L       +   S   +G  K W  +W +    K++ F W+A+ G LPTR  L  R +     C +C++  ES+ H L  CS A+ +W  +  H 
Subjt:  KNEYRLFMKNRIERASSSSNTMG--KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHD

Query:  LLERDFNHCFEERWIALCEILSRDELRIITVTCWAIWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRANERRNIDNRPCSSSVRLPKPHHWSLP
               H F++  +    + + ++   +   CW+IW +RN+ +     P  +      L+   E+D A         P +    LP P  W  P
Subjt:  LLERDFNHCFEERWIALCEILSRDELRIITVTCWAIWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRANERRNIDNRPCSSSVRLPKPHHWSLP

A0A7G2DZH3 (thale cress) hypothetical protein2.1e-5128.57Show/hide
Query:  GDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRM
        GDFN++  + EK GGP R+    + FR  LN   L +++     FTW G+R    I  ++DR +   +++ LF  A  + + W  S HRP+   +D    
Subjt:  GDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRM

Query:  VWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAKN----LNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELD
         WR  +   F+++  W    A +   K H  W      + + N    L  C  +LS+W S    + + +I+E K  L  AY+  P +++  ++ LK +L 
Subjt:  VWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAKN----LNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELD

Query:  KLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEG--MWTEDPA--LIEDTFITYFCESGA------------------------
        K    EE +W+ +SR  W++ GDKN+ +FH K   RR  N ++ + D +G     ED    L+E  F T F   G+                        
Subjt:  KLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEG--MWTEDPA--LIEDTFITYFCESGA------------------------

Query:  ---------------WNESFLNEAVGIDDLDIIRRI-PIDLRKSDSFMWHYDKYGKYTVKNEYRLFMKNRIER-ASSSSNTMGKVWDIVWKLKVLSKVKF
                       WNE  +++ + ++D ++I+ I P  +  SDS  W Y K G Y+VK+ Y    +  IE+ A+ +S      +  +W + V  K+K 
Subjt:  ---------------WNESFLNEAVGIDDLDIIRRI-PIDLRKSDSFMWHYDKYGKYTVKNEYRLFMKNRIER-ASSSSNTMGKVWDIVWKLKVLSKVKF

Query:  FCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFN-HCFEERWIALCEILSR--DELRIITVTCWAIWGD
        F W++L   LP   NL  RG+ +   C +C    E+++H LF     KEIW LT          N H   +    L  I     +EL+++ +  W IW  
Subjt:  FCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFN-HCFEERWIALCEILSR--DELRIITVTCWAIWGD

Query:  RNNHI-HGIKIPPPDIRSQWV--LKYLDEFDRANERRNIDNRPCSSS--VRLP
        RN  I    ++  PD+ SQ +  L   +E  + N+     N+PCS S  V+LP
Subjt:  RNNHI-HGIKIPPPDIRSQWV--LKYLDEFDRANERRNIDNRPCSSS--VRLP

A0A803QI56 Uncharacterized protein4.0e-5029.61Show/hide
Query:  GDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTW---KGNRRGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDC
        GD N ++   +K GG      L++ F+  LNDC L +M    +PFTW   +G+R  +++  RLDR + N  +  +F  A   NL+ + SDH PI    + 
Subjt:  GDFNELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTW---KGNRRGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDC

Query:  RRMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELDK
                  R FKFE  W     C +I++    W+      L   L  C++ LS WG +V  + + RI  CK  +K   +            +K EL  
Subjt:  RRMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVSSAHVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELDK

Query:  LLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMW--TED--------PALIEDTFITYF-CESGAWNESFLNEAVGIDDLDII
        +L++ E FWKQRS++ W++ GD NS       S R  + +   I   +  W  +ED        P+L E         E G+W+   LN+     D  +I
Subjt:  LLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMW--TED--------PALIEDTFITYF-CESGAWNESFLNEAVGIDDLDII

Query:  RRIPIDL-RKSDSFMWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMGKVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAV
          IP+++   +D   W Y+  G Y+VK+ Y L  K    R   + + + K W   WK K+  KVK   W+A R  LPT   L I+ +D+   C +C    
Subjt:  RRIPIDL-RKSDSFMWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMGKVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAV

Query:  ESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRDELRIITVTCWAIWGDRNN
        ES+ H L +C+K K++W                F + ++A       ++  ++ V CWAIW  RN+
Subjt:  ESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRDELRIITVTCWAIWGDRNN

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.2e-0525.21Show/hide
Query:  IRRIPIDL--RKSDSFMWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMGKVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSM
        +R + +DL     D   W + + G+++V++ Y +   + + R +     M   ++ +WK++V  +VK F W      + T    H R +     C +C  
Subjt:  IRRIPIDL--RKSDSFMWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMGKVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSM

Query:  AVESIDHCLFSCSKAKEIW
         VES+ H L  C     IW
Subjt:  AVESIDHCLFSCSKAKEIW

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.4e-0719.85Show/hide
Query:  GDFNELLWDYEKYGGPRRTSHL--LEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCR
        GDF+++    + Y   + +  +  LEEF++ L D +L ++      +TW  ++    I  +LDR I N ++ S F  A +       SDH P    ++  
Subjt:  GDFNELLWDYEKYGGPRRTSHL--LEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCR

Query:  RMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVS-SAHVLA-----KNLNSCSEALSKWG-SDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSL
             K  ++ F++  F + +      +    + Q+   +H+ +     K    C + L++ G  ++++  +  +   +        N     F   H  
Subjt:  RMVWRKSRRRPFKFEEFWTHYEACEDIIKTHGDWQVS-SAHVLA-----KNLNSCSEALSKWG-SDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSL

Query:  KFELDKLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFITYF
        + + +      E F++Q+SR  W++ GD N+ +FH+     +  N I  +R  + +  E+   +++  + Y+
Subjt:  KFELDKLLEEEEIFWKQRSREDWMRWGDKNSGWFHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFITYF

AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.4e-0731.3Show/hide
Query:  VWKLKVLSKVKFFCWKALRGFLPTRVNL----HIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEER---WIALCEILSR
        VW    + K  F  W +    LPTR  L    HI+  D    C +C++  ES DH LFSC  A ++WRL +   L  R    C       W+      + 
Subjt:  VWKLKVLSKVKFFCWKALRGFLPTRVNL----HIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEER---WIALCEILSR

Query:  DELRIIT--VTCWAIWGDRNNHIH-GIKIPP
          LR ++     + IW  RNN +H  ++I P
Subjt:  DELRIIT--VTCWAIWGDRNNHIH-GIKIPP

AT2G02650.1 Ribonuclease H-like superfamily protein4.0e-1026.52Show/hide
Query:  KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEE---RWIALCEILS
        +V   +WKL V  K+K F W+ + G L T   L  R +D    C  C +  E+I H +F+C   + +WR        +      FE+   R I L +  +
Subjt:  KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEE---RWIALCEILS

Query:  RDEL-RIITV-TCWAIWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRANERR-----NIDNRPCSSSVRLPKPHHWSLPP
         + L R +     W +W  RN  +   K   PD  ++  ++   E+  ANE       ++   P  +S R      W+ PP
Subjt:  RDEL-RIITV-TCWAIWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRANERR-----NIDNRPCSSSVRLPKPHHWSLPP

AT3G09510.1 Ribonuclease H-like superfamily protein3.4e-1730.21Show/hide
Query:  WNESFLNEAVGIDDLDIIRRIPI-DLRKSDSFMWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMGKVWDI---VWKLKVLSKVKFFCWKALRGFLPTR
        W++S +++ V   D   I RI +   +K D  +W+Y+  G+YTV++ Y L   +      + +   G + D+   +W L ++ K+K F W+AL   L T 
Subjt:  WNESFLNEAVGIDDLDIIRRIPI-DLRKSDSFMWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMGKVWDI---VWKLKVLSKVKFFCWKALRGFLPTR

Query:  VNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLT----YCHDLLERDFNHCFEERWIALCEILSRDELRIITV-TCWAIWGDRNN
          L  RGM I   C  C    ESI+H LF+C  A   WRL+      + L+  DF          + +    D  +++ V   W IW  RNN
Subjt:  VNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLT----YCHDLLERDFNHCFEERWIALCEILSRDELRIITV-TCWAIWGDRNN

AT3G25270.1 Ribonuclease H-like superfamily protein1.4e-0728.7Show/hide
Query:  VWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERW-IALCEILSRDELRII
        +WKLK   K+K F WK L G L T  NL  R +     C  C    E+  H  F C  A+++WR +       R      E +  + L   L+  + ++ 
Subjt:  VWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERW-IALCEILSRDELRII

Query:  TVTCWAIW
         +  W +W
Subjt:  TVTCWAIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACGAAGGATTTAATCAACAAATGGCAAAAGTTCAGTCTGAAGGAGCGTGAAAAAGAGCCAGCCATTTCGTTTAAACCTGAAGAGAGATCGAATATTGAAGGTAA
TCTTGGTCATAATCTAATCGGGAAGTTACTATCTAGTAGAATCATTTCAAGTCTGGCGATCAAGAATGCAATGGTTGGAGATTGGAAGACCAAACATAAATTCAGCATTA
AGAATAAATGTGCGGCGGAAAAATTTGGAAACCTCATTGGTGAATTCTTGGAGATGGAAAATGAGGAAGAGGATCTTGCTTGGAGTGACAGCATCCGAATCAGAGTTAAA
ATTGATATTTCCAAAGCCTTGCTTAGAGGCTTTATGCTGAAATCAGGTGAAGCCGGGGGAGAACGATGGATTACTATCAGGTATGAAAGACTACTTGACTTTTGCTTTAA
ATGTGGATGTATCGGTCACGGTGCGAAAGAATGCAAGACTGACCAAAAAGATGGGGGATCCAATAAGAATAACTTTGAATTCGACGCATGGTTAAAGTTCCAGGGTTATT
TTAGAGGAACGAGGAAACAAACTCCTCCAGCTAATGAGAAGTCTCCTGACCTGAACAGTTCTAGACATCCAGAGAATTCTGAGGACATTGTTCATAATACTGTATATATT
CCTGACCTGGCTGATGAGGGGGGAGCAGTAGACTTTAATCTTGAAAAGGATTCTGAGGGAGAACAAAAGAATGAGCAAGTCCTCTCGAATGATTCTGTATTCGAGATGGC
GATGGAGGAGGATGGAATTCAGATCATTGAGAGCAGTAGACTGAAGGAGGTGAATAATGTTTCGGGTACTCTGAGTCAAAGTGAAGGTTCTAGTGGAGGATTTGTTGTTT
CAACTAGAAAGAAGCTAAACTGGAAGAGGCGTGCAAGGATGGGACATATTAATGAGTCTTCAATTCAAGATCAGATGTCTAAGAAAAGGAAGTCCAGAGGTGATTTCAAT
GAATTGTTGTGGGATTATGAGAAGTATGGAGGCCCAAGAAGAACGAGTCATCTGTTAGAGGAATTTCGAAGTACTCTGAATGACTGTGAGTTAAAGGAGATGCGTTTCTC
TGACAACCCTTTTACTTGGAAAGGAAATCGAAGGGGAGTTCAAATCTGGGAACGGTTGGACAGGTTTATATGTAACCTTGAATTCGAGTCTCTGTTTGCTTTTGCAGGGT
CGCGTAACTTGGATTGGACGTTCTCGGATCACAGGCCAATTGAGGCCTCAGTTGACTGTCGTCGTATGGTTTGGAGGAAATCAAGGAGGCGCCCCTTTAAATTCGAGGAG
TTCTGGACCCATTATGAGGCATGCGAGGATATCATTAAGACACATGGAGACTGGCAGGTTTCTTCGGCACATGTATTAGCAAAAAACTTGAATTCCTGCTCTGAAGCTTT
AAGCAAATGGGGCAGTGATGTGAGAAATTCTATGCGGACCAGAATTAAAGAATGTAAACAAGCCCTGAAGGCAGCCTATGACAATGCTCCCCATATGGATTTTCTCTCTG
TTCATAGCCTGAAGTTCGAATTAGATAAATTGCTGGAAGAAGAAGAGATCTTTTGGAAGCAAAGATCTCGTGAAGACTGGATGCGGTGGGGAGACAAAAACTCGGGTTGG
TTTCACAGGAAGGCATCTATCCGAAGGCAAATTAATGAGATTTCTGGAATTCGTGATGCAGAAGGTATGTGGACTGAAGATCCTGCCTTAATTGAGGACACTTTTATAAC
ATACTTCTGTGAATCCGGAGCTTGGAATGAAAGTTTTCTAAATGAAGCAGTGGGTATTGATGACCTTGATATTATCAGGCGAATCCCTATAGATCTAAGGAAGTCTGACA
GTTTTATGTGGCATTATGACAAATATGGAAAATACACAGTGAAGAATGAATACCGCCTGTTTATGAAAAACAGAATTGAAAGGGCTTCCTCGAGTAGTAATACTATGGGG
AAAGTCTGGGACATTGTTTGGAAGCTCAAAGTTCTGTCGAAGGTCAAATTTTTCTGCTGGAAAGCTCTGAGGGGTTTCCTTCCTACCAGAGTCAATTTACATATCCGGGG
AATGGATATTTTTATTGGCTGCTCTATGTGTTCTATGGCGGTTGAATCTATTGATCACTGTTTATTTTCTTGTTCAAAAGCTAAAGAGATTTGGAGGCTTACCTATTGCC
ATGATCTTCTGGAAAGGGACTTTAATCACTGCTTTGAAGAGAGGTGGATCGCCCTTTGTGAAATCCTTTCTAGAGATGAGCTCAGAATAATCACAGTGACATGCTGGGCT
ATCTGGGGAGATAGAAATAATCATATTCATGGTATAAAAATTCCCCCTCCAGACATTCGCAGTCAATGGGTTTTGAAATACCTGGATGAATTTGATCGTGCGAATGAGAG
ACGGAATATTGATAATCGGCCTTGTTCTTCCTCGGTGCGTCTCCCCAAACCTCATCACTGGTCTCTTCCCCCTGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACGAAGGATTTAATCAACAAATGGCAAAAGTTCAGTCTGAAGGAGCGTGAAAAAGAGCCAGCCATTTCGTTTAAACCTGAAGAGAGATCGAATATTGAAGGTAA
TCTTGGTCATAATCTAATCGGGAAGTTACTATCTAGTAGAATCATTTCAAGTCTGGCGATCAAGAATGCAATGGTTGGAGATTGGAAGACCAAACATAAATTCAGCATTA
AGAATAAATGTGCGGCGGAAAAATTTGGAAACCTCATTGGTGAATTCTTGGAGATGGAAAATGAGGAAGAGGATCTTGCTTGGAGTGACAGCATCCGAATCAGAGTTAAA
ATTGATATTTCCAAAGCCTTGCTTAGAGGCTTTATGCTGAAATCAGGTGAAGCCGGGGGAGAACGATGGATTACTATCAGGTATGAAAGACTACTTGACTTTTGCTTTAA
ATGTGGATGTATCGGTCACGGTGCGAAAGAATGCAAGACTGACCAAAAAGATGGGGGATCCAATAAGAATAACTTTGAATTCGACGCATGGTTAAAGTTCCAGGGTTATT
TTAGAGGAACGAGGAAACAAACTCCTCCAGCTAATGAGAAGTCTCCTGACCTGAACAGTTCTAGACATCCAGAGAATTCTGAGGACATTGTTCATAATACTGTATATATT
CCTGACCTGGCTGATGAGGGGGGAGCAGTAGACTTTAATCTTGAAAAGGATTCTGAGGGAGAACAAAAGAATGAGCAAGTCCTCTCGAATGATTCTGTATTCGAGATGGC
GATGGAGGAGGATGGAATTCAGATCATTGAGAGCAGTAGACTGAAGGAGGTGAATAATGTTTCGGGTACTCTGAGTCAAAGTGAAGGTTCTAGTGGAGGATTTGTTGTTT
CAACTAGAAAGAAGCTAAACTGGAAGAGGCGTGCAAGGATGGGACATATTAATGAGTCTTCAATTCAAGATCAGATGTCTAAGAAAAGGAAGTCCAGAGGTGATTTCAAT
GAATTGTTGTGGGATTATGAGAAGTATGGAGGCCCAAGAAGAACGAGTCATCTGTTAGAGGAATTTCGAAGTACTCTGAATGACTGTGAGTTAAAGGAGATGCGTTTCTC
TGACAACCCTTTTACTTGGAAAGGAAATCGAAGGGGAGTTCAAATCTGGGAACGGTTGGACAGGTTTATATGTAACCTTGAATTCGAGTCTCTGTTTGCTTTTGCAGGGT
CGCGTAACTTGGATTGGACGTTCTCGGATCACAGGCCAATTGAGGCCTCAGTTGACTGTCGTCGTATGGTTTGGAGGAAATCAAGGAGGCGCCCCTTTAAATTCGAGGAG
TTCTGGACCCATTATGAGGCATGCGAGGATATCATTAAGACACATGGAGACTGGCAGGTTTCTTCGGCACATGTATTAGCAAAAAACTTGAATTCCTGCTCTGAAGCTTT
AAGCAAATGGGGCAGTGATGTGAGAAATTCTATGCGGACCAGAATTAAAGAATGTAAACAAGCCCTGAAGGCAGCCTATGACAATGCTCCCCATATGGATTTTCTCTCTG
TTCATAGCCTGAAGTTCGAATTAGATAAATTGCTGGAAGAAGAAGAGATCTTTTGGAAGCAAAGATCTCGTGAAGACTGGATGCGGTGGGGAGACAAAAACTCGGGTTGG
TTTCACAGGAAGGCATCTATCCGAAGGCAAATTAATGAGATTTCTGGAATTCGTGATGCAGAAGGTATGTGGACTGAAGATCCTGCCTTAATTGAGGACACTTTTATAAC
ATACTTCTGTGAATCCGGAGCTTGGAATGAAAGTTTTCTAAATGAAGCAGTGGGTATTGATGACCTTGATATTATCAGGCGAATCCCTATAGATCTAAGGAAGTCTGACA
GTTTTATGTGGCATTATGACAAATATGGAAAATACACAGTGAAGAATGAATACCGCCTGTTTATGAAAAACAGAATTGAAAGGGCTTCCTCGAGTAGTAATACTATGGGG
AAAGTCTGGGACATTGTTTGGAAGCTCAAAGTTCTGTCGAAGGTCAAATTTTTCTGCTGGAAAGCTCTGAGGGGTTTCCTTCCTACCAGAGTCAATTTACATATCCGGGG
AATGGATATTTTTATTGGCTGCTCTATGTGTTCTATGGCGGTTGAATCTATTGATCACTGTTTATTTTCTTGTTCAAAAGCTAAAGAGATTTGGAGGCTTACCTATTGCC
ATGATCTTCTGGAAAGGGACTTTAATCACTGCTTTGAAGAGAGGTGGATCGCCCTTTGTGAAATCCTTTCTAGAGATGAGCTCAGAATAATCACAGTGACATGCTGGGCT
ATCTGGGGAGATAGAAATAATCATATTCATGGTATAAAAATTCCCCCTCCAGACATTCGCAGTCAATGGGTTTTGAAATACCTGGATGAATTTGATCGTGCGAATGAGAG
ACGGAATATTGATAATCGGCCTTGTTCTTCCTCGGTGCGTCTCCCCAAACCTCATCACTGGTCTCTTCCCCCTGCTTAA
Protein sequenceShow/hide protein sequence
METKDLINKWQKFSLKEREKEPAISFKPEERSNIEGNLGHNLIGKLLSSRIISSLAIKNAMVGDWKTKHKFSIKNKCAAEKFGNLIGEFLEMENEEEDLAWSDSIRIRVK
IDISKALLRGFMLKSGEAGGERWITIRYERLLDFCFKCGCIGHGAKECKTDQKDGGSNKNNFEFDAWLKFQGYFRGTRKQTPPANEKSPDLNSSRHPENSEDIVHNTVYI
PDLADEGGAVDFNLEKDSEGEQKNEQVLSNDSVFEMAMEEDGIQIIESSRLKEVNNVSGTLSQSEGSSGGFVVSTRKKLNWKRRARMGHINESSIQDQMSKKRKSRGDFN
ELLWDYEKYGGPRRTSHLLEEFRSTLNDCELKEMRFSDNPFTWKGNRRGVQIWERLDRFICNLEFESLFAFAGSRNLDWTFSDHRPIEASVDCRRMVWRKSRRRPFKFEE
FWTHYEACEDIIKTHGDWQVSSAHVLAKNLNSCSEALSKWGSDVRNSMRTRIKECKQALKAAYDNAPHMDFLSVHSLKFELDKLLEEEEIFWKQRSREDWMRWGDKNSGW
FHRKASIRRQINEISGIRDAEGMWTEDPALIEDTFITYFCESGAWNESFLNEAVGIDDLDIIRRIPIDLRKSDSFMWHYDKYGKYTVKNEYRLFMKNRIERASSSSNTMG
KVWDIVWKLKVLSKVKFFCWKALRGFLPTRVNLHIRGMDIFIGCSMCSMAVESIDHCLFSCSKAKEIWRLTYCHDLLERDFNHCFEERWIALCEILSRDELRIITVTCWA
IWGDRNNHIHGIKIPPPDIRSQWVLKYLDEFDRANERRNIDNRPCSSSVRLPKPHHWSLPPA