; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028183 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028183
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold2:45643115..45651344
RNA-Seq ExpressionSpg028183
SyntenySpg028183
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU21788.1 hypothetical protein TSUD_329120, partial [Trifolium subterraneum]2.2e-5828.57Show/hide
Query:  EKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERY
        ++++ I +E+E++   D+    T+V KI TE   N+ +FK  M + W+    + I+ +  NLF   F TR+    V + GPW+FDR+LLI   I G+E+ 
Subjt:  EKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERY

Query:  TTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIG
        + L     SFW+   +LP    S   A  LGN VG FE +D+ E  +  G+ LR+RV +++ +PLKR  K+      ++ W+  KYE+LP++C+ CGRIG
Subjt:  TTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIG

Query:  HHMRDCEEVE-------KESDETIQ-FGPWLKYDSPIETRSKDNQKIGKGT-------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPDRK
        H MRDCE++E        E +E  Q FGPWL+  SP+   S + +K    +       P     RG N GT        AKE +EE E  ++ + +    
Subjt:  HHMRDCEEVE-------KESDETIQ-FGPWLKYDSPIETRSKDNQKIGKGT-------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPDRK

Query:  HRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDK-GKKVSGIFERNRHVGLAEEDDK---KSESMRKGALNPSQSLDVGLIDGPKE
        H+    +     +    +G+   ++N +   +    +     +S + K G+++  + E      + +   K   +  +   G       +DV + DG  E
Subjt:  HRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDK-GKKVSGIFERNRHVGLAEEDDK---KSESMRKGALNPSQSLDVGLIDGPKE

Query:  KEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPESSK---GMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG----
                            +G ++KV  +VV K+        ++R    E+ +    MET         R    + + I+ +L F+N   V C G    
Subjt:  KEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPESSK---GMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG----

Query:  KSGGLLLLWKVDIDVSIQSYYEGHVDAII--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFR
        + GGL L+W   + V+I SY   H+      +++ G W  +G Y  PE   +  +W L+  L+  N   W+  GDFN+I+   EK+GG  R+ SQ S  R
Subjt:  KSGGLLLLWKVDIDVSIQSYYEGHVDAII--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFR

Query:  ETINRCKLMDGGYSRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTK
        + +  C  +D                     E ++R+             +VNHL    SDH  +V S +     S TR   RL RFEE WTK
Subjt:  ETINRCKLMDGGYSRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTK

KAF4372682.1 hypothetical protein F8388_000849, partial [Cannabis sativa]3.5e-5626.51Show/hide
Query:  ENFSKQMEKLKISKGEKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGT-NLFECTFKTRKGMRKVTEGGPWNF
        E  S+  E L++   +   V+ +    +EE  + +  ++V K++  K  N +  ++ M  +WK+     ++++ + N+F   F +R+  ++V  GGPW F
Subjt:  ENFSKQMEKLKISKGEKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGT-NLFECTFKTRKGMRKVTEGGPWNF

Query:  DRSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQV
        D+ L+ F +  G    + +NF + SFWI   N+P  C +  +A   G  +G  E +      K    T++VRV++ + EPLKR +++ V     +  +  
Subjt:  DRSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQV

Query:  KYEKLPDYCYGCGRIGHHMRDCEEVEKESD----ETIQFGPWL-KYDSPIETR-----SKDNQKIGKGTPRRGRG--------RGVNRGTGRTHE-----
        +YE LPD+C+ CG IGH   DC   +   D    +  ++G W+    SP+  R      +DN    +     G          +G  R +G +HE     
Subjt:  KYEKLPDYCYGCGRIGHHMRDCEEVEKESD----ETIQFGPWL-KYDSPIETR-----SKDNQKIGKGTPRRGRG--------RGVNRGTGRTHE-----

Query:  --------------------HSGAKESDEEDE---------------ASEEESEEPDRKHRPELPEVDGE-AVEARGNGENTLE--------------RN
                            H G+  +  + E                S+  SE        EL  ++G  A +  G G+N  E              + 
Subjt:  --------------------HSGAKESDEEDE---------------ASEEESEEPDRKHRPELPEVDGE-AVEARGNGENTLE--------------RN

Query:  PRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQRNVMDRASN--ERVEDQNKKGSKEKVG
            E E +     + V G++    V  +   N H       +  S    KG ++ S  ++V      +E  +  +  + +   +R+     +GSK K  
Subjt:  PRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQRNVMDRASN--ERVEDQNKKGSKEKVG

Query:  SEVVHKEIMIKQGTTMVRTTEPESSKG----------METLGKERQTGQRQ--------NERK-----CDKIKRELNFENWFSVQCKGKSGGLLLLWKVD
        S  +  ++        +    P+   G           E + K R   Q +        +E K      + I+R+++F N F V C GKSGGLLLLW  D
Subjt:  SEVVHKEIMIKQGTTMVRTTEPESSKG----------METLGKERQTGQRQ--------NERK-----CDKIKRELNFENWFSVQCKGKSGGLLLLWKVD

Query:  IDVSIQSYYEGHVDAIIK-KTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRCKLMDGGY
         +VS++S+  GH+DA++K      WRF+GFY NP+   R  SW L+ +L  + D  WI GGDFNE++S  EKKGG  R+ S  S F++ +++C L+D G+
Subjt:  IDVSIQSYYEGHVDAIIK-KTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRCKLMDGGY

Query:  SRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTKFKETKGNGDYTYIDQD
            FTW   +     + ERLDRY  N D      + +V +  F+ SDHRPI A  +     S+   + +  RFE  W K  E +     T++  D
Subjt:  SRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTKFKETKGNGDYTYIDQD

MCH80348.1 hypothetical protein [Trifolium medium]1.8e-6829.68Show/hide
Query:  EKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERY
        ++++ I +EDE++   D+    T+V KI TE   N+ +FK  M + W+    + I+ +  NL+   F T++    V   GPW+FDR+LLI   I G+E+ 
Subjt:  EKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERY

Query:  TTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIG
        + L   + SFW+   +LP    S   A  LGN VG FE +D  E  +  G+ LR+RV +++ +PLKR  K+      ++ W+  KYE+LP++C+ CGRIG
Subjt:  TTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIG

Query:  HHMRDCEEVE-------KESDETIQ-FGPWLKYDSPIETRSKDNQKIGKGT-------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPDRK
        H MRDCE++E        E +E  Q FGPWL+  SP+   S + +K    +       P     +G N GT        AKE DEE E  +  S++   +
Subjt:  HHMRDCEEVE-------KESDETIQ-FGPWLKYDSPIETRSKDNQKIGKGT-------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPDRK

Query:  HRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQR
         +     +  +A       +N  +     AE   T T + +T  G+       G   R     + +   +K+ +   G +     +DV + DG  E    
Subjt:  HRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQR

Query:  NVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGT--------TMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG---
                        KG ++K+  +VV K+ +   G+         + R   P+    MET         R    + + I+ +L F+N  +V C G   
Subjt:  NVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGT--------TMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG---

Query:  -KSGGLLLLWKVDIDVSIQSYYEGHVDAII--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLF
         ++GGL L+W   + V+I S+   H+      +++   W  +G Y  PE   +  +W L+  L+  N   W+  GDFN+I+   EK+GG  R+ +Q S+ 
Subjt:  -KSGGLLLLWKVDIDVSIQSYYEGHVDAII--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLF

Query:  RETINRCKLMDGGYSRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTK
        R+ +    L+D G+    FTW  G+ +   +  RLDR + N +   +    +VNHL    SDH  +V   +  +  S TR   RL RFEE WTK
Subjt:  RETINRCKLMDGGYSRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTK

MCH84017.1 zinc CCHC-type-like protein [Trifolium medium]9.1e-5729.74Show/hide
Query:  KILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKW
        KI TE   N+ +FK  M + W+    V+I+ +  NLF   F T+K    V + GPW+FDR+LLI   I G+E+ + L     SFW+   +LP    S   
Subjt:  KILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKW

Query:  ATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIGHHMRDCEEVE-------KESDETIQ-
        A  LGN +G+FE +D+ +  +  G+ LR+RV +++ +PLKR  K+      +  W+  KYE+LP++C+ CGRIGH MRDCEEVE       +E +E  Q 
Subjt:  ATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIGHHMRDCEEVE-------KESDETIQ-

Query:  FGPWLKYDSPIETRSKDNQKIGKGT------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEE--SEEPDRKHRPELPEVDGEAVEARGNGENTLER
        FGPWL+     +   +  ++   GT      P     +G + GT + +E    +E D++   SE E   +  D+ H   +  V    VE  G+ +  +E 
Subjt:  FGPWLKYDSPIETRSKDNQKIGKGT------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEE--SEEPDRKHRPELPEVDGEAVEARGNGENTLER

Query:  NPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQRNVMDRASNERVEDQNKKGSKEKVGS
            AE     T +   +S +DK                 +    + + +R     P       L++  K  ++  V  RA+   + D  ++  K+K   
Subjt:  NPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQRNVMDRASNERVEDQNKKGSKEKVGS

Query:  EV----VHKEIMIKQGTTMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG----KSGGLLLLWKVDIDVSIQSYYEGHVDAI
        E+    V   + ++    + R   P+    MET         R    + D I+ +L F+N  SV C+G    ++GG+ L+W   + ++I SY   H+   
Subjt:  EV----VHKEIMIKQGTTMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG----KSGGLLLLWKVDIDVSIQSYYEGHVDAI

Query:  I--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRCKLMDGGYSRNKFTWRRGKNKRN
           ++T   W  +G Y  PE   +  +W L+  L+      W+  GD N+I+  +EK+GG  R+  Q +L R T+  C L D G+    FTW  G+    
Subjt:  I--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRCKLMDGGYSRNKFTWRRGKNKRN

Query:  QIYERLDRYLIN
         I  RLDR L N
Subjt:  QIYERLDRYLIN

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]1.1e-7031.9Show/hide
Query:  ENFSKQMEKLKISKGEKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFD
        ++ S++ EKL +   +   +  I+    E  ++ +  +++ K +T K IN E+FKS +  IW+ + EV ++ +G N+F+  F+     +++ EGGPW FD
Subjt:  ENFSKQMEKLKISKGEKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFD

Query:  RSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVK
        + LL+  E  G E+ T L FRY  FWI   NLP  C +R+    LG  VG  + +D  E G+C G+ +R+RV I+V  PLKR +++ +G   +   + + 
Subjt:  RSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVK

Query:  YEKLPDYCYGCGRIGHHMRDCEEVEKE--SDETIQFGPWLKYDSPIETRSKDNQKIGKGTPRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPD
        YE+LP++CY CG+IGH +RDC    KE  S  + +FGPW++  S   TRSK                    GTG             E + S E S E  
Subjt:  YEKLPDYCYGCGRIGHHMRDCEEVEKE--SDETIQFGPWLKYDSPIETRSKDNQKIGKGTPRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPD

Query:  RKHRPELPEVDGEAVEARGNGENTLERNPR----AAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGP
             E   V G      G   + L R+        E +   T   KT   +      + +  +   + LA    K  E + + +   S+ ++  +    
Subjt:  RKHRPELPEVDGEAVEARGNGENTLERNPR----AAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGP

Query:  KEKEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKGKSGGL
               V +    E V +Q      E  G ++ ++    K+   + R      ++G ++LGK  + G    E   D+ K  +     F+V   G+ GGL
Subjt:  KEKEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKGKSGGL

Query:  LLLWKVDIDVSIQSYYEGHVDAIIKKTHGF-WRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRC
         LLWK DI+VSI+S+ +GH+DA+IK +    WRF+GFY  P    R  SW L+ +L  M++  WI+ GDFNEI+  +EKKGG  R+ +  S FRE ++ C
Subjt:  LLLWKVDIDVSIQSYYEGHVDAIIKKTHGF-WRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRC

Query:  KLMDGGYSRNKFTWRRGKNKRNQIYERLDR
         LMD GY  NK+TW   + K   I ER+DR
Subjt:  KLMDGGYSRNKFTWRRGKNKRNQIYERLDR

TrEMBL top hitse value%identityAlignment
A0A2Z6LV25 Uncharacterized protein (Fragment)1.1e-5828.57Show/hide
Query:  EKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERY
        ++++ I +E+E++   D+    T+V KI TE   N+ +FK  M + W+    + I+ +  NLF   F TR+    V + GPW+FDR+LLI   I G+E+ 
Subjt:  EKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERY

Query:  TTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIG
        + L     SFW+   +LP    S   A  LGN VG FE +D+ E  +  G+ LR+RV +++ +PLKR  K+      ++ W+  KYE+LP++C+ CGRIG
Subjt:  TTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIG

Query:  HHMRDCEEVE-------KESDETIQ-FGPWLKYDSPIETRSKDNQKIGKGT-------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPDRK
        H MRDCE++E        E +E  Q FGPWL+  SP+   S + +K    +       P     RG N GT        AKE +EE E  ++ + +    
Subjt:  HHMRDCEEVE-------KESDETIQ-FGPWLKYDSPIETRSKDNQKIGKGT-------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPDRK

Query:  HRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDK-GKKVSGIFERNRHVGLAEEDDK---KSESMRKGALNPSQSLDVGLIDGPKE
        H+    +     +    +G+   ++N +   +    +     +S + K G+++  + E      + +   K   +  +   G       +DV + DG  E
Subjt:  HRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDK-GKKVSGIFERNRHVGLAEEDDK---KSESMRKGALNPSQSLDVGLIDGPKE

Query:  KEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPESSK---GMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG----
                            +G ++KV  +VV K+        ++R    E+ +    MET         R    + + I+ +L F+N   V C G    
Subjt:  KEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPESSK---GMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG----

Query:  KSGGLLLLWKVDIDVSIQSYYEGHVDAII--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFR
        + GGL L+W   + V+I SY   H+      +++ G W  +G Y  PE   +  +W L+  L+  N   W+  GDFN+I+   EK+GG  R+ SQ S  R
Subjt:  KSGGLLLLWKVDIDVSIQSYYEGHVDAII--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFR

Query:  ETINRCKLMDGGYSRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTK
        + +  C  +D                     E ++R+             +VNHL    SDH  +V S +     S TR   RL RFEE WTK
Subjt:  ETINRCKLMDGGYSRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTK

A0A392M033 CCHC-type domain-containing protein (Fragment)8.6e-6929.68Show/hide
Query:  EKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERY
        ++++ I +EDE++   D+    T+V KI TE   N+ +FK  M + W+    + I+ +  NL+   F T++    V   GPW+FDR+LLI   I G+E+ 
Subjt:  EKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERY

Query:  TTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIG
        + L   + SFW+   +LP    S   A  LGN VG FE +D  E  +  G+ LR+RV +++ +PLKR  K+      ++ W+  KYE+LP++C+ CGRIG
Subjt:  TTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIG

Query:  HHMRDCEEVE-------KESDETIQ-FGPWLKYDSPIETRSKDNQKIGKGT-------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPDRK
        H MRDCE++E        E +E  Q FGPWL+  SP+   S + +K    +       P     +G N GT        AKE DEE E  +  S++   +
Subjt:  HHMRDCEEVE-------KESDETIQ-FGPWLKYDSPIETRSKDNQKIGKGT-------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPDRK

Query:  HRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQR
         +     +  +A       +N  +     AE   T T + +T  G+       G   R     + +   +K+ +   G +     +DV + DG  E    
Subjt:  HRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQR

Query:  NVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGT--------TMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG---
                        KG ++K+  +VV K+ +   G+         + R   P+    MET         R    + + I+ +L F+N  +V C G   
Subjt:  NVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGT--------TMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG---

Query:  -KSGGLLLLWKVDIDVSIQSYYEGHVDAII--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLF
         ++GGL L+W   + V+I S+   H+      +++   W  +G Y  PE   +  +W L+  L+  N   W+  GDFN+I+   EK+GG  R+ +Q S+ 
Subjt:  -KSGGLLLLWKVDIDVSIQSYYEGHVDAII--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLF

Query:  RETINRCKLMDGGYSRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTK
        R+ +    L+D G+    FTW  G+ +   +  RLDR + N +   +    +VNHL    SDH  +V   +  +  S TR   RL RFEE WTK
Subjt:  RETINRCKLMDGGYSRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTK

A0A392M948 Zinc CCHC-type-like protein (Fragment)4.4e-5729.74Show/hide
Query:  KILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKW
        KI TE   N+ +FK  M + W+    V+I+ +  NLF   F T+K    V + GPW+FDR+LLI   I G+E+ + L     SFW+   +LP    S   
Subjt:  KILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKW

Query:  ATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIGHHMRDCEEVE-------KESDETIQ-
        A  LGN +G+FE +D+ +  +  G+ LR+RV +++ +PLKR  K+      +  W+  KYE+LP++C+ CGRIGH MRDCEEVE       +E +E  Q 
Subjt:  ATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIGHHMRDCEEVE-------KESDETIQ-

Query:  FGPWLKYDSPIETRSKDNQKIGKGT------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEE--SEEPDRKHRPELPEVDGEAVEARGNGENTLER
        FGPWL+     +   +  ++   GT      P     +G + GT + +E    +E D++   SE E   +  D+ H   +  V    VE  G+ +  +E 
Subjt:  FGPWLKYDSPIETRSKDNQKIGKGT------PRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEE--SEEPDRKHRPELPEVDGEAVEARGNGENTLER

Query:  NPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQRNVMDRASNERVEDQNKKGSKEKVGS
            AE     T +   +S +DK                 +    + + +R     P       L++  K  ++  V  RA+   + D  ++  K+K   
Subjt:  NPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQRNVMDRASNERVEDQNKKGSKEKVGS

Query:  EV----VHKEIMIKQGTTMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG----KSGGLLLLWKVDIDVSIQSYYEGHVDAI
        E+    V   + ++    + R   P+    MET         R    + D I+ +L F+N  SV C+G    ++GG+ L+W   + ++I SY   H+   
Subjt:  EV----VHKEIMIKQGTTMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKG----KSGGLLLLWKVDIDVSIQSYYEGHVDAI

Query:  I--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRCKLMDGGYSRNKFTWRRGKNKRN
           ++T   W  +G Y  PE   +  +W L+  L+      W+  GD N+I+  +EK+GG  R+  Q +L R T+  C L D G+    FTW  G+    
Subjt:  I--KKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRCKLMDGGYSRNKFTWRRGKNKRN

Query:  QIYERLDRYLIN
         I  RLDR L N
Subjt:  QIYERLDRYLIN

A0A5C7H9Y2 CCHC-type domain-containing protein5.4e-7131.9Show/hide
Query:  ENFSKQMEKLKISKGEKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFD
        ++ S++ EKL +   +   +  I+    E  ++ +  +++ K +T K IN E+FKS +  IW+ + EV ++ +G N+F+  F+     +++ EGGPW FD
Subjt:  ENFSKQMEKLKISKGEKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFD

Query:  RSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVK
        + LL+  E  G E+ T L FRY  FWI   NLP  C +R+    LG  VG  + +D  E G+C G+ +R+RV I+V  PLKR +++ +G   +   + + 
Subjt:  RSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVK

Query:  YEKLPDYCYGCGRIGHHMRDCEEVEKE--SDETIQFGPWLKYDSPIETRSKDNQKIGKGTPRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPD
        YE+LP++CY CG+IGH +RDC    KE  S  + +FGPW++  S   TRSK                    GTG             E + S E S E  
Subjt:  YEKLPDYCYGCGRIGHHMRDCEEVEKE--SDETIQFGPWLKYDSPIETRSKDNQKIGKGTPRRGRGRGVNRGTGRTHEHSGAKESDEEDEASEEESEEPD

Query:  RKHRPELPEVDGEAVEARGNGENTLERNPR----AAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGP
             E   V G      G   + L R+        E +   T   KT   +      + +  +   + LA    K  E + + +   S+ ++  +    
Subjt:  RKHRPELPEVDGEAVEARGNGENTLERNPR----AAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGALNPSQSLDVGLIDGP

Query:  KEKEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKGKSGGL
               V +    E V +Q      E  G ++ ++    K+   + R      ++G ++LGK  + G    E   D+ K  +     F+V   G+ GGL
Subjt:  KEKEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKGKSGGL

Query:  LLLWKVDIDVSIQSYYEGHVDAIIKKTHGF-WRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRC
         LLWK DI+VSI+S+ +GH+DA+IK +    WRF+GFY  P    R  SW L+ +L  M++  WI+ GDFNEI+  +EKKGG  R+ +  S FRE ++ C
Subjt:  LLLWKVDIDVSIQSYYEGHVDAIIKKTHGF-WRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRC

Query:  KLMDGGYSRNKFTWRRGKNKRNQIYERLDR
         LMD GY  NK+TW   + K   I ER+DR
Subjt:  KLMDGGYSRNKFTWRRGKNKRNQIYERLDR

A0A7N2R0C3 Reverse transcriptase domain-containing protein2.0e-5727.06Show/hide
Query:  MEENFSKQMEKLKISKGEKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWN
        M E      ++LK+++ E+E ++ + DE +    +   + V  K+++ K + VE+ +  +  +WK    + +  +G  LF   F+  +  R+V +  PW+
Subjt:  MEENFSKQMEKLKISKGEKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKINVESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWN

Query:  FDRSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQ
        +++ L++F+E  G E    +  +++ FW+   NLP    +++    +G ++G F  VDV+E G   G  LRVRV+I+V   L R  KI +    E  W+ 
Subjt:  FDRSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEHGKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQ

Query:  VKYEKLPDYCYGCGRIGHHMRDC-EEVEKE---SDETIQFGPWLKYDSPI----------------ETRSKDNQKIGKGTPRRGRGRGVNR---------
         KYE+LP++CY CG + H ++DC EE  K+    +  +Q+G WL+   PI                E ++K+N K  +   R     GV R         
Subjt:  VKYEKLPDYCYGCGRIGHHMRDC-EEVEKE---SDETIQFGPWLKYDSPI----------------ETRSKDNQKIGKGTPRRGRGRGVNR---------

Query:  -GTGRTHEHSGAKESDEEDEASEEE---SEEPDRKHRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLA
         G  R   H       E     + E   +E   R H  EL EV GE     GNG    E    A +K +    N K +   + G     + + +  VGL 
Subjt:  -GTGRTHEHSGAKESDEEDEASEEE---SEEPDRKHRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLA

Query:  EEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPE-------------SSKGM
           DK  +       +P +      + GP     + ++ RA  +    ++    + K   ++  +EI      +  R T P              S+  +
Subjt:  EEDDKKSESMRKGALNPSQSLDVGLIDGPKEKEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPE-------------SSKGM

Query:  ETLGKERQTGQ---------RQNERKCDKIKRELNFENWFSVQCKGKSGGLLLLWKVDIDVSIQSYYEGHVDAIIKKTHGF--WRFSGFYRNPETEKRHF
          L  E + G          + ++R+   ++R+L      +V   G+SGGL +LW+  +DVS++S    H+D ++  ++G   WR +GFY +P+   R  
Subjt:  ETLGKERQTGQ---------RQNERKCDKIKRELNFENWFSVQCKGKSGGLLLLWKVDIDVSIQSYYEGHVDAIIKKTHGF--WRFSGFYRNPETEKRHF

Query:  SWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRCKLMDGGYSRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNH
        SW L+E LS   +  W++ GDFNEI++ +EK G  +R+  Q   FRE ++ C L+D G+   +FTW  G+    +   RLDR + N +        +V H
Subjt:  SWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRCKLMDGGYSRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNH

Query:  LSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTK
         S  +SDH  +  S    +     R   R   FEE WT+
Subjt:  LSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.4e-1027.14Show/hide
Query:  PRVTHCGPRNFEEQSKRQNPNLDLLSLESLRNQGE--PPPPANCIKLNFDASWNEEQKMGGIGWIFCDSSGSPIGLGCKPIKRNWSIKCLEMMAIKEGLE
        P V      +FEE S R+         +  RN       PP   +K N DA+W  E    GIGWI  + SG  + +G + + R  ++   E+ A++  + 
Subjt:  PRVTHCGPRNFEEQSKRQNPNLDLLSLESLRNQGE--PPPPANCIKLNFDASWNEEQKMGGIGWIFCDSSGSPIGLGCKPIKRNWSIKCLEMMAIKEGLE

Query:  AYRRENRSNPYPIIVEADASKVIRALNHEVEDLSESKLMMNEIENLAPFCNVVAFVKCPRAGNLIAHHLARMAAGFPSARSSSSEVVDGECLFWVSSTL
           R N      II E+DA  ++  LN + +     +  + +I+ L      V F   PR GN +A  +AR +  F +       +V      W+ STL
Subjt:  AYRRENRSNPYPIIVEADASKVIRALNHEVEDLSESKLMMNEIENLAPFCNVVAFVKCPRAGNLIAHHLARMAAGFPSARSSSSEVVDGECLFWVSSTL

AT3G09510.1 Ribonuclease H-like superfamily protein7.8e-0625.29Show/hide
Query:  PPANCIKLNFDASWNEEQKMGGIGWIFCDSSGSPIGLGCKPIKRNWSIKCLEMMAIKEGLEAYRRENRSNPYPIIVEADASKVIRALNHEVEDLSESKLM
        PPA  +K NFDA ++ ++     GWI  +  G+PI  G   +    +    E  A+   L A ++        + +E D   +I  +N     +S    +
Subjt:  PPANCIKLNFDASWNEEQKMGGIGWIFCDSSGSPIGLGCKPIKRNWSIKCLEMMAIKEGLEAYRRENRSNPYPIIVEADASKVIRALNHEVEDLSESKLM

Query:  MNEIENLAPFCNVVAFVK---CPRAGNLIAHHLARMAAGFPSARSSSSEVVDGECLFWVSSTLEDDDSFCRDVN
         N +E+++ + N  A ++     R GN +AH LA+    + +  S S     G    W+      D  FC D N
Subjt:  MNEIENLAPFCNVVAFVK---CPRAGNLIAHHLARMAAGFPSARSSSSEVVDGECLFWVSSTLEDDDSFCRDVN

AT4G29090.1 Ribonuclease H-like superfamily protein4.0e-1028.26Show/hide
Query:  PPPANCIKLNFDASWNEEQKMGGIGWIFCDSSGSPIGLGCKPIKRNWSIKCLEMMAIKEGLEAYRRENRSNPYPIIVEADASKVIRALNHEVEDLSESKL
        PPP   +K N DA+WN + +  GIGW+  +  G    +G + + +  S+   E+ A++  + +  R   +    +I E+D+  +I  LN++ E     K 
Subjt:  PPPANCIKLNFDASWNEEQKMGGIGWIFCDSSGSPIGLGCKPIKRNWSIKCLEMMAIKEGLEAYRRENRSNPYPIIVEADASKVIRALNHEVEDLSESKL

Query:  MMNEIENLAPFCNVVAFVKCPRAGNLIAHHLARMAAGF
         + +++ L      V FV  PR GN +A  +AR +  F
Subjt:  MMNEIENLAPFCNVVAFVKCPRAGNLIAHHLARMAAGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGGCGGTGGTTTAAATGTGAAATGGGTTTTGAAGACTTTGGAATTCTCCCGTTGTGCTGTGGATCTTTAATGGCGGAGGAACCGATCGTTGACAAATGGAGGAT
TGCATCTGTGCTTGATCTTTTAAGTGCTTTTGGCATTTACGGAAAATTGGGAGTCAAGGAAACCATGGAAGAAAACTTCAGTAAGCAAATGGAGAAACTAAAAATTTCAA
AAGGAGAAAAGGAACAAGTGATAGGGATAGAAGATGAGGACTTGGAAGAGCATGACAAAGAGATAGGAGAGACAGTTGTGTGCAAGATACTAACAGAGAAGAAAATTAAC
GTTGAGAGCTTCAAATCAATGATGCCAAAAATCTGGAAGATGGAAGGAGAAGTGAATATAAAAAAAGTGGGAACAAACCTCTTTGAATGCACCTTCAAAACCAGAAAAGG
AATGAGGAAAGTGACGGAAGGGGGACCATGGAATTTTGACAGAAGTCTTCTTATCTTTGAGGAAATCAGAGGACATGAGAGATACACAACCCTCAACTTCAGGTATGCAT
CATTTTGGATTCATTTTATTAACTTACCAAAAGTGTGTTTCTCCAGGAAATGGGCAACAATGCTAGGAAACGCAGTGGGCATTTTTGAAAGAGTGGACGTAGATGAACAT
GGTAAATGTGGAGGAGAGACCCTTAGAGTCAGAGTCAAAATTGAAGTCCATGAACCGCTAAAAAGGGCAGTCAAGATCAAAGTGGGAACCATGGTAGAGAAGGAATGGAT
ACAAGTCAAATACGAAAAGTTACCAGACTATTGTTATGGGTGCGGGAGGATTGGCCATCATATGAGAGACTGTGAAGAGGTGGAAAAAGAAAGCGATGAAACTATTCAGT
TCGGGCCATGGCTAAAATATGATTCACCCATTGAAACAAGGAGTAAAGATAACCAAAAGATAGGAAAAGGAACACCCAGACGGGGTAGGGGACGCGGAGTTAACAGAGGG
ACGGGCAGAACTCACGAGCATAGTGGTGCCAAGGAAAGCGATGAGGAAGATGAAGCTTCTGAGGAAGAAAGCGAAGAACCTGACCGGAAACACAGGCCTGAGCTGCCGGA
AGTCGACGGAGAAGCGGTGGAAGCAAGAGGGAACGGTGAGAACACGTTGGAACGGAACCCGAGAGCAGCAGAAAAAGAACAAACGACTACTTGCAACGAAAAGACAGTGT
CAGGAAAGGACAAAGGGAAAAAGGTCAGTGGAATTTTCGAGAGAAATAGGCATGTGGGACTCGCAGAGGAAGACGACAAAAAAAGTGAAAGTATGAGAAAAGGGGCTCTG
AATCCGAGCCAATCATTAGATGTGGGCCTCATTGATGGGCCAAAAGAAAAAGAGCAAAGGAACGTAATGGACCGGGCAAGTAATGAACGTGTAGAAGACCAGAATAAAAA
AGGAAGCAAAGAGAAGGTAGGAAGTGAAGTTGTTCATAAGGAAATAATGATCAAGCAAGGAACCACTATGGTTAGAACTACAGAACCAGAATCCTCAAAGGGTATGGAAA
CGCTAGGAAAAGAAAGGCAAACAGGCCAGAGGCAGAATGAGAGGAAGTGTGATAAGATAAAAAGAGAGTTGAATTTTGAGAATTGGTTCAGTGTTCAGTGTAAGGGAAAG
AGCGGCGGTCTCCTGTTGTTGTGGAAGGTTGACATTGATGTTAGCATTCAGTCTTACTACGAGGGTCATGTGGATGCCATTATTAAAAAAACTCATGGCTTTTGGAGATT
CTCAGGTTTTTACAGAAACCCAGAAACAGAAAAACGACACTTTTCTTGGATCCTAATGGAAAAATTGAGTGAAATGAATGATACAAGTTGGATAATAGGTGGAGATTTCA
ATGAGATTGTATCAGATGAGGAAAAGAAAGGCGGTGCAAAAAGAAATCCAAGTCAAAGGAGTCTCTTCAGAGAGACCATAAACCGTTGCAAGTTAATGGATGGGGGCTAC
TCTAGAAATAAGTTTACGTGGAGAAGGGGGAAAAACAAGAGAAATCAAATCTATGAAAGACTCGATAGATATCTTATCAATCATGACATGGCTATGAAAGTGGTGAACTT
TAGAGTTAACCACCTTAGTTTTATGAGCTCAGACCATAGGCCTATTGTGGCCAGTTGGGATTTTGCTGATGAAATCTCAAAAACAAGGAGCGAGGGCAGATTGTTAAGAT
TTGAAGAAGGGTGGACCAAATTTAAAGAGACAAAAGGCAATGGAGATTATACTTATATTGATCAGGACCCGTGGCTAATTCGTCAGGGCAATAGGTCTCCCTTGTGGGTG
CTGGAGGAGCTGAGAGGCAGAAGAGTCAAGGACATCATCAGGGAGGATGGGACTTGGGACACAGAACGGATCAAAAGAGAGTTTATGCCCATGGATGCCGAAGACATACT
GGCAATCCCCCTAGGCAACAGAGAGGAAAAAGACGAAATCATATGGAACCTTGACTCCAAAGGGATATTCAGCGTTAAGAGTGCCTACCAACTCGCCCAAAGAAGGCTAA
ATGCTTCGGCTGCCTCAGGAAAGCTAGGGAGACCACGAGTCACGCACTGTGGACCTCGTAATTTCGAGGAGCAGAGCAAGAGGCAGAATCCGAACCTTGACTTGCTGAGC
TTGGAGAGCCTGCGGAATCAGGGGGAGCCCCCCCCTCCCGCCAACTGCATCAAGCTCAACTTTGATGCCTCGTGGAATGAGGAGCAGAAAATGGGAGGAATAGGGTGGAT
TTTTTGTGATTCTTCAGGGTCTCCCATAGGCTTGGGCTGTAAACCAATTAAAAGAAATTGGTCGATCAAGTGTCTTGAGATGATGGCAATCAAAGAAGGTCTCGAGGCTT
ACCGAAGGGAGAATCGGAGCAATCCCTATCCAATTATCGTAGAGGCAGACGCCTCTAAGGTGATTAGAGCTCTGAATCACGAAGTAGAGGACCTCTCGGAATCGAAGCTG
ATGATGAACGAGATAGAGAATTTGGCCCCTTTTTGCAATGTAGTCGCTTTCGTCAAATGCCCGAGGGCTGGAAACCTGATAGCGCATCACCTTGCGCGCATGGCGGCGGG
ATTCCCGTCGGCGAGAAGCTCATCGTCTGAAGTCGTCGATGGGGAATGTTTGTTTTGGGTCTCTTCCACGCTGGAAGATGACGATTCTTTTTGTAGGGACGTTAATGTCC
CAACCTGGCTTCCCTCCTTTATTTTTGAGGAAGTAGTTGTGAACGATTATATTTCTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGGGCGGTGGTTTAAATGTGAAATGGGTTTTGAAGACTTTGGAATTCTCCCGTTGTGCTGTGGATCTTTAATGGCGGAGGAACCGATCGTTGACAAATGGAGGAT
TGCATCTGTGCTTGATCTTTTAAGTGCTTTTGGCATTTACGGAAAATTGGGAGTCAAGGAAACCATGGAAGAAAACTTCAGTAAGCAAATGGAGAAACTAAAAATTTCAA
AAGGAGAAAAGGAACAAGTGATAGGGATAGAAGATGAGGACTTGGAAGAGCATGACAAAGAGATAGGAGAGACAGTTGTGTGCAAGATACTAACAGAGAAGAAAATTAAC
GTTGAGAGCTTCAAATCAATGATGCCAAAAATCTGGAAGATGGAAGGAGAAGTGAATATAAAAAAAGTGGGAACAAACCTCTTTGAATGCACCTTCAAAACCAGAAAAGG
AATGAGGAAAGTGACGGAAGGGGGACCATGGAATTTTGACAGAAGTCTTCTTATCTTTGAGGAAATCAGAGGACATGAGAGATACACAACCCTCAACTTCAGGTATGCAT
CATTTTGGATTCATTTTATTAACTTACCAAAAGTGTGTTTCTCCAGGAAATGGGCAACAATGCTAGGAAACGCAGTGGGCATTTTTGAAAGAGTGGACGTAGATGAACAT
GGTAAATGTGGAGGAGAGACCCTTAGAGTCAGAGTCAAAATTGAAGTCCATGAACCGCTAAAAAGGGCAGTCAAGATCAAAGTGGGAACCATGGTAGAGAAGGAATGGAT
ACAAGTCAAATACGAAAAGTTACCAGACTATTGTTATGGGTGCGGGAGGATTGGCCATCATATGAGAGACTGTGAAGAGGTGGAAAAAGAAAGCGATGAAACTATTCAGT
TCGGGCCATGGCTAAAATATGATTCACCCATTGAAACAAGGAGTAAAGATAACCAAAAGATAGGAAAAGGAACACCCAGACGGGGTAGGGGACGCGGAGTTAACAGAGGG
ACGGGCAGAACTCACGAGCATAGTGGTGCCAAGGAAAGCGATGAGGAAGATGAAGCTTCTGAGGAAGAAAGCGAAGAACCTGACCGGAAACACAGGCCTGAGCTGCCGGA
AGTCGACGGAGAAGCGGTGGAAGCAAGAGGGAACGGTGAGAACACGTTGGAACGGAACCCGAGAGCAGCAGAAAAAGAACAAACGACTACTTGCAACGAAAAGACAGTGT
CAGGAAAGGACAAAGGGAAAAAGGTCAGTGGAATTTTCGAGAGAAATAGGCATGTGGGACTCGCAGAGGAAGACGACAAAAAAAGTGAAAGTATGAGAAAAGGGGCTCTG
AATCCGAGCCAATCATTAGATGTGGGCCTCATTGATGGGCCAAAAGAAAAAGAGCAAAGGAACGTAATGGACCGGGCAAGTAATGAACGTGTAGAAGACCAGAATAAAAA
AGGAAGCAAAGAGAAGGTAGGAAGTGAAGTTGTTCATAAGGAAATAATGATCAAGCAAGGAACCACTATGGTTAGAACTACAGAACCAGAATCCTCAAAGGGTATGGAAA
CGCTAGGAAAAGAAAGGCAAACAGGCCAGAGGCAGAATGAGAGGAAGTGTGATAAGATAAAAAGAGAGTTGAATTTTGAGAATTGGTTCAGTGTTCAGTGTAAGGGAAAG
AGCGGCGGTCTCCTGTTGTTGTGGAAGGTTGACATTGATGTTAGCATTCAGTCTTACTACGAGGGTCATGTGGATGCCATTATTAAAAAAACTCATGGCTTTTGGAGATT
CTCAGGTTTTTACAGAAACCCAGAAACAGAAAAACGACACTTTTCTTGGATCCTAATGGAAAAATTGAGTGAAATGAATGATACAAGTTGGATAATAGGTGGAGATTTCA
ATGAGATTGTATCAGATGAGGAAAAGAAAGGCGGTGCAAAAAGAAATCCAAGTCAAAGGAGTCTCTTCAGAGAGACCATAAACCGTTGCAAGTTAATGGATGGGGGCTAC
TCTAGAAATAAGTTTACGTGGAGAAGGGGGAAAAACAAGAGAAATCAAATCTATGAAAGACTCGATAGATATCTTATCAATCATGACATGGCTATGAAAGTGGTGAACTT
TAGAGTTAACCACCTTAGTTTTATGAGCTCAGACCATAGGCCTATTGTGGCCAGTTGGGATTTTGCTGATGAAATCTCAAAAACAAGGAGCGAGGGCAGATTGTTAAGAT
TTGAAGAAGGGTGGACCAAATTTAAAGAGACAAAAGGCAATGGAGATTATACTTATATTGATCAGGACCCGTGGCTAATTCGTCAGGGCAATAGGTCTCCCTTGTGGGTG
CTGGAGGAGCTGAGAGGCAGAAGAGTCAAGGACATCATCAGGGAGGATGGGACTTGGGACACAGAACGGATCAAAAGAGAGTTTATGCCCATGGATGCCGAAGACATACT
GGCAATCCCCCTAGGCAACAGAGAGGAAAAAGACGAAATCATATGGAACCTTGACTCCAAAGGGATATTCAGCGTTAAGAGTGCCTACCAACTCGCCCAAAGAAGGCTAA
ATGCTTCGGCTGCCTCAGGAAAGCTAGGGAGACCACGAGTCACGCACTGTGGACCTCGTAATTTCGAGGAGCAGAGCAAGAGGCAGAATCCGAACCTTGACTTGCTGAGC
TTGGAGAGCCTGCGGAATCAGGGGGAGCCCCCCCCTCCCGCCAACTGCATCAAGCTCAACTTTGATGCCTCGTGGAATGAGGAGCAGAAAATGGGAGGAATAGGGTGGAT
TTTTTGTGATTCTTCAGGGTCTCCCATAGGCTTGGGCTGTAAACCAATTAAAAGAAATTGGTCGATCAAGTGTCTTGAGATGATGGCAATCAAAGAAGGTCTCGAGGCTT
ACCGAAGGGAGAATCGGAGCAATCCCTATCCAATTATCGTAGAGGCAGACGCCTCTAAGGTGATTAGAGCTCTGAATCACGAAGTAGAGGACCTCTCGGAATCGAAGCTG
ATGATGAACGAGATAGAGAATTTGGCCCCTTTTTGCAATGTAGTCGCTTTCGTCAAATGCCCGAGGGCTGGAAACCTGATAGCGCATCACCTTGCGCGCATGGCGGCGGG
ATTCCCGTCGGCGAGAAGCTCATCGTCTGAAGTCGTCGATGGGGAATGTTTGTTTTGGGTCTCTTCCACGCTGGAAGATGACGATTCTTTTTGTAGGGACGTTAATGTCC
CAACCTGGCTTCCCTCCTTTATTTTTGAGGAAGTAGTTGTGAACGATTATATTTCTCTTTAA
Protein sequenceShow/hide protein sequence
MNGRWFKCEMGFEDFGILPLCCGSLMAEEPIVDKWRIASVLDLLSAFGIYGKLGVKETMEENFSKQMEKLKISKGEKEQVIGIEDEDLEEHDKEIGETVVCKILTEKKIN
VESFKSMMPKIWKMEGEVNIKKVGTNLFECTFKTRKGMRKVTEGGPWNFDRSLLIFEEIRGHERYTTLNFRYASFWIHFINLPKVCFSRKWATMLGNAVGIFERVDVDEH
GKCGGETLRVRVKIEVHEPLKRAVKIKVGTMVEKEWIQVKYEKLPDYCYGCGRIGHHMRDCEEVEKESDETIQFGPWLKYDSPIETRSKDNQKIGKGTPRRGRGRGVNRG
TGRTHEHSGAKESDEEDEASEEESEEPDRKHRPELPEVDGEAVEARGNGENTLERNPRAAEKEQTTTCNEKTVSGKDKGKKVSGIFERNRHVGLAEEDDKKSESMRKGAL
NPSQSLDVGLIDGPKEKEQRNVMDRASNERVEDQNKKGSKEKVGSEVVHKEIMIKQGTTMVRTTEPESSKGMETLGKERQTGQRQNERKCDKIKRELNFENWFSVQCKGK
SGGLLLLWKVDIDVSIQSYYEGHVDAIIKKTHGFWRFSGFYRNPETEKRHFSWILMEKLSEMNDTSWIIGGDFNEIVSDEEKKGGAKRNPSQRSLFRETINRCKLMDGGY
SRNKFTWRRGKNKRNQIYERLDRYLINHDMAMKVVNFRVNHLSFMSSDHRPIVASWDFADEISKTRSEGRLLRFEEGWTKFKETKGNGDYTYIDQDPWLIRQGNRSPLWV
LEELRGRRVKDIIREDGTWDTERIKREFMPMDAEDILAIPLGNREEKDEIIWNLDSKGIFSVKSAYQLAQRRLNASAASGKLGRPRVTHCGPRNFEEQSKRQNPNLDLLS
LESLRNQGEPPPPANCIKLNFDASWNEEQKMGGIGWIFCDSSGSPIGLGCKPIKRNWSIKCLEMMAIKEGLEAYRRENRSNPYPIIVEADASKVIRALNHEVEDLSESKL
MMNEIENLAPFCNVVAFVKCPRAGNLIAHHLARMAAGFPSARSSSSEVVDGECLFWVSSTLEDDDSFCRDVNVPTWLPSFIFEEVVVNDYISL