; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy05g015950 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy05g015950
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionRNase H domain-containing protein
Genome locationChr05:21019037..21022124
RNA-Seq ExpressionLcy05g015950
SyntenyLcy05g015950
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMP06477.1 hypothetical protein CCACVL1_01552 [Corchorus capsularis]2.1e-7731.19Show/hide
Query:  LDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVPRKVTYEMNQ
        LD LL +EE+ W+QR + +WLK  D NT++FH  AS R+++ +I+ I++      +++  I   F NYFK +FT+S P    +  VL H+  +VT +M  
Subjt:  LDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVPRKVTYEMNQ

Query:  VLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDIVLLAYVMFLIRLSLSLIF
         L   +T  E+  A      ++APGPDG   LF+QK W VV    V+  L  LN+   + D NHTN+VLIPK   P L  D   ++    + R+    + 
Subjt:  VLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDIVLLAYVMFLIRLSLSLIF

Query:  LKAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGG
         ++      L    L  FE ASGQ +N +KS V+FS N     +  L N L ++     D YLGLP    RSK R+F+FL DR+   +  W   LFS+ G
Subjt:  LKAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGG

Query:  KEVLIKSIVTAIPTYAMGCFRIPKDVEVIKSLPIS--------STSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKV
        K V+I+++  A P Y M  F  PK      +  I+            D+ IW+    GE++V S Y ++     G     +   +  W+ +W   +  K+
Subjt:  KEVLIKSIVTAIPTYAMGCFRIPKDVEVIKSLPIS--------STSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKV

Query:  KVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIWSFLHPHIMRNLRDQMDIKDRWYELSHEPLQQIP--DPMIRCDWI---
        + F+W+   N +PT  NL    + I+G C VC  E     H  F C  ++ +W    P ++  + +Q D+   ++E   E  + I   D +    W+   
Subjt:  KVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIWSFLHPHIMRNLRDQMDIKDRWYELSHEPLQQIP--DPMIRCDWI---

Query:  --NNYLSEFWLANPKGGSVVQSKEDIVDII----SNGEEIIM------------------HTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSS
          N  L E + + P   ++V++   I+D +    S   EI+M                  +TDAS   ++  AG+G+V+RD  G ++A  +        S
Subjt:  --NNYLSEFWLANPKGGSVVQSKEDIVDII----SNGEEIIM------------------HTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSS

Query:  LEAKAMAVVEGLRLARNLGVDRLTILSDSLSLIKTINE
        L A+  A++ G  +A  LG+DR  + SDSL  I+ IN+
Subjt:  LEAKAMAVVEGLRLARNLGVDRLTILSDSLSLIKTINE

XP_023908235.1 uncharacterized protein LOC112019924 [Quercus suber]6.5e-6625.94Show/hide
Query:  KRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVPRKVTYEM
        K+LD LLL++E+YW QRSR  WLK GD NTK+FH KAS RR+RN I GI  + G W  + ++I      YF  +F +   D   M   LN VPRKVT EM
Subjt:  KRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVPRKVTYEM

Query:  NQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPS---------------------
        +  L + +  EEV  A+    PT+APGPDG  ALFYQK+W VV +T V   LD LN+     D NHT IVLIPK ++P                      
Subjt:  NQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPS---------------------

Query:  ------------------------LVSDIVLLAY------------------------------------------------------------------
                                L++D VLLAY                                                                  
Subjt:  ------------------------LVSDIVLLAY------------------------------------------------------------------

Query:  -------------------------------------------------------VMFLIRLSLSLIFLKAAAREFGLFRIILKDFERASGQSVNFSKSM
                                                               +  L+    SLIF + +  E      IL+ + +ASGQS+N  KS 
Subjt:  -------------------------------------------------------VMFLIRLSLSLIFLKAAAREFGLFRIILKDFERASGQSVNFSKSM

Query:  VMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRIPKDV------
        V FS N     +Q    IL ++     +SYLGLP+   RSK   F ++ DRVW  LQGWKG + S+ GKEVLIK++  AIPTY M  F++P  +      
Subjt:  VMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRIPKDV------

Query:  ----------------------EVIKSLPISSTS--------------PDKWI-----------------------------------------------
                               ++ +LPI  +                D+WI                                               
Subjt:  ----------------------EVIKSLPISSTS--------------PDKWI-----------------------------------------------

Query:  -----------------WHYDAKGEYSVKSGYK----LSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCP
                         W ++  G ++VKS YK    LS      ES S  G +   W  +WK+R+P+K+KVF W++ H+ +PT  NL    + +   CP
Subjt:  -----------------WHYDAKGEYSVKSGYK----LSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCP

Query:  VCHDEMETTDHALFQCPRAREIWSFLHPHIMRNLRDQMDIKDRWYELSHEPLQQIPDPMIRCDWINNYLSEFWLA-NPK-----GGSVV-----------
        +C    E+T HAL++C  A++IW      + +    Q+D            L+ + D + + + +  +L + WL  N +     GG ++           
Subjt:  VCHDEMETTDHALFQCPRAREIWSFLHPHIMRNLRDQMDIKDRWYELSHEPLQQIPDPMIRCDWINNYLSEFWLA-NPK-----GGSVV-----------

Query:  ------QSKEDI-VDIISNGEEIIMHT-----------------DASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNL
              QS+  + VD        +M T                 DA+V    +++G G V+R+  G +MA  T        S  A+ +A  + L  A + 
Subjt:  ------QSKEDI-VDIISNGEEIIMHT-----------------DASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNL

Query:  GVDRLTILSDSLSLIKTINEDLQGEACIAAQIG
        G   L +  DS++  K I      ++ I   +G
Subjt:  GVDRLTILSDSLSLIKTINEDLQGEACIAAQIG

XP_024033484.1 uncharacterized protein LOC112095607 [Citrus clementina]3.2e-7329.4Show/hide
Query:  IHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVPRK
        I  LE ++  +L+ EE +WKQRSR  WLK GD NTK+FH KAS R+K+NRI GIE+ +G W    ++++  F  YF ++FT+SQP  ++++  L  +  +
Subjt:  IHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVPRK

Query:  VTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDI-----VLLAYV
        V+  MN+ L   +T E++T A+    PT+APGPDG PA FYQK+W+VV    ++ CL ILN +G +   NHT I L+PK+    L+  +     + ++++
Subjt:  VTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDI-----VLLAYV

Query:  MFLIRLSLSLIFLKAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVL
        +F      SLIF++A+  + G  + I   +  ASGQ  N  KS + FSKN   +    + NI  + V    + YLGLPS   R +S  F+ +  +V   +
Subjt:  MFLIRLSLSLIFLKAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVL

Query:  QGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRIP-----------------------------------------------------------------
          W+   FS GGKEVLIK++  A+P YAM  F++P                                                                 
Subjt:  QGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRIP-----------------------------------------------------------------

Query:  --------------------------------------------------------------KDVEVIKSLPI-SSTSPDKWIWHYDAKGEYSVKSGYKL
                                                                       D + I S+P+      D+ +WHYD +G YSVKSGY++
Subjt:  --------------------------------------------------------------KDVEVIKSLPI-SSTSPDKWIWHYDAKGEYSVKSGYKL

Query:  SMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIW
        ++        S    T + W+ +  + +  K+++F+W++  N +P+M NL    V     C +C   +E+  HAL  C  AR++W
Subjt:  SMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIW

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]6.9e-7627.54Show/hide
Query:  DFSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNH
        + S I   E++++ +LL+EE+YWKQRSR +WLK GD NTK+FH KAS R+++NRI G+ D +  W  D++ +++ F  YF  +FT+S P  K +   L  
Subjt:  DFSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNH

Query:  VPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDIVLLAYVM
        +  +VT EMN  L + +T EEV+ A+    PT+APGPDG PA F+QK+WD V    +S CL ILN      +     +V   + RH   +      A V 
Subjt:  VPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDIVLLAYVM

Query:  FLIRLSLSLIFLKAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQ
         L+    SLIF +AA  +    + + + + +ASGQ  NF KS + FSK    D    +  I  ++V    + YLGLPS   R     F  +  RV + + 
Subjt:  FLIRLSLSLIFLKAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQ

Query:  GWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRI-------------------------------------------------------------------
         W+   F+ GGKEVLIK++  AIPTYAM  F+I                                                                   
Subjt:  GWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRI-------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---PKDVEVIKSLPI-SSTSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPIS
           P+D E I  +P+      D+ IWHYD KG YSVKSGY+++M     E  S      + W+ +WK+ +P KVK+F+W++ H+ +PT  NL    V   
Subjt:  ---PKDVEVIKSLPI-SSTSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPIS

Query:  GYCPVCHDEMETTDHALFQCPRAREIWSFLHPHIMRNLRDQMDIKDRWYELSHEPLQQIPDPMIRCDWINNYLSEFWLANPKGGSVVQSKEDIVDIISNG
          C  CH  +ET  HAL +C RAR+IW +   ++   LR        W  L   P Q      +    +   L   W A  K       KE+ + +++N 
Subjt:  GYCPVCHDEMETTDHALFQCPRAREIWSFLHPHIMRNLRDQMDIKDRWYELSHEPLQQIPDPMIRCDWINNYLSEFWLANPKGGSVVQSKEDIVDIISNG

Query:  EEII----------------------------------MHTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNLGVD
        E I+                                  ++ DA+V  ++  AG+G+V+RD +G   A    S  +  S   A+A A+  GL++A    + 
Subjt:  EEII----------------------------------MHTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNLGVD

Query:  RLTILSDSLSLIKTINE
             SDSL +I  IN+
Subjt:  RLTILSDSLSLIKTINE

XP_030487384.1 uncharacterized protein LOC115704310 [Cannabis sativa]6.2e-7725.79Show/hide
Query:  FSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHV
        + +I  +E +L+ L+ ++E YW+QRSR  WL+WGD NTK+FH KAS RRK+N I G++DS G WQ DK+ + +  E+Y+  +FTSS+ +      VL  +
Subjt:  FSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHV

Query:  PRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPS-------------
          KVT  MN+ L+A +T EEV  A+K  +PT+APG DG PALFYQK+W  +    ++ CL++LN    ++  N T + LIPK   P              
Subjt:  PRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPS-------------

Query:  --------------------------------LVSDIVLLAY----------------------------------------------------------
                                        L+ D  ++ Y                                                          
Subjt:  --------------------------------LVSDIVLLAY----------------------------------------------------------

Query:  ------------------------------------------------------VMF---------LIRLSLSLIFLKAAAREFGLFRIILKDFERASGQ
                                                              VMF         L     SL+FL+AA  E   F+ +L+ +  ASGQ
Subjt:  ------------------------------------------------------VMF---------LIRLSLSLIFLKAAAREFGLFRIILKDFERASGQ

Query:  SVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRIPK
         VNF KS + F +NV    +  L  ++ ++V D+   YLGLPS   R+K + F+F+ +RVW+ L+GWKG+ FS   KEVLIK+IV AIPTY M CFR+PK
Subjt:  SVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRIPK

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------DVEVIKSLPISSTS-PDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKV
                                     D ++I S+P S     DK +WHY   GEYSV+SGY+++      +  SD   T  WWK++WK+++P KVK 
Subjt:  -----------------------------DVEVIKSLPISSTS-PDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKV

Query:  FVWKSFHNSIPTMVNLCNHHVPISGYCPVC-HDEMETTDHALFQCPRAREIW------------------SFLHPHIMRNLRDQMDI-----KDRWY---
        FVWK  H+ IPT   L + ++ +  YC  C + E ET  H L+ C   RE+W                  +FL        ++++++      + WY   
Subjt:  FVWKSFHNSIPTMVNLCNHHVPISGYCPVC-HDEMETTDHALFQCPRAREIW------------------SFLHPHIMRNLRDQMDI-----KDRWY---

Query:  ELSHEPLQQIPDPMIRCDWINNYLSEFWLAN-PKGGSVVQSKEDIVDIISNGEEIIMHTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEA
         ++H   +  P      +W + YLSEF  +N PKG      KE    +    EE  ++ D  V  +   +   +V+RD  G + +   +  M   + L+A
Subjt:  ELSHEPLQQIPDPMIRCDWINNYLSEFWLAN-PKGGSVVQSKEDIVDIISNGEEIIMHTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEA

Query:  KAMAVVEGLRLARNLGVDRLTILSDSLSLIKTINEDLQGEACIAAQ
        +  A+          GV ++ IL     L+  I     GE   A +
Subjt:  KAMAVVEGLRLARNLGVDRLTILSDSLSLIKTINEDLQGEACIAAQ

TrEMBL top hitse value%identityAlignment
A0A1R3KHA2 Uncharacterized protein1.0e-7731.19Show/hide
Query:  LDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVPRKVTYEMNQ
        LD LL +EE+ W+QR + +WLK  D NT++FH  AS R+++ +I+ I++      +++  I   F NYFK +FT+S P    +  VL H+  +VT +M  
Subjt:  LDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVPRKVTYEMNQ

Query:  VLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDIVLLAYVMFLIRLSLSLIF
         L   +T  E+  A      ++APGPDG   LF+QK W VV    V+  L  LN+   + D NHTN+VLIPK   P L  D   ++    + R+    + 
Subjt:  VLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDIVLLAYVMFLIRLSLSLIF

Query:  LKAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGG
         ++      L    L  FE ASGQ +N +KS V+FS N     +  L N L ++     D YLGLP    RSK R+F+FL DR+   +  W   LFS+ G
Subjt:  LKAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGG

Query:  KEVLIKSIVTAIPTYAMGCFRIPKDVEVIKSLPIS--------STSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKV
        K V+I+++  A P Y M  F  PK      +  I+            D+ IW+    GE++V S Y ++     G     +   +  W+ +W   +  K+
Subjt:  KEVLIKSIVTAIPTYAMGCFRIPKDVEVIKSLPIS--------STSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKV

Query:  KVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIWSFLHPHIMRNLRDQMDIKDRWYELSHEPLQQIP--DPMIRCDWI---
        + F+W+   N +PT  NL    + I+G C VC  E     H  F C  ++ +W    P ++  + +Q D+   ++E   E  + I   D +    W+   
Subjt:  KVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIWSFLHPHIMRNLRDQMDIKDRWYELSHEPLQQIP--DPMIRCDWI---

Query:  --NNYLSEFWLANPKGGSVVQSKEDIVDII----SNGEEIIM------------------HTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSS
          N  L E + + P   ++V++   I+D +    S   EI+M                  +TDAS   ++  AG+G+V+RD  G ++A  +        S
Subjt:  --NNYLSEFWLANPKGGSVVQSKEDIVDII----SNGEEIIM------------------HTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSS

Query:  LEAKAMAVVEGLRLARNLGVDRLTILSDSLSLIKTINE
        L A+  A++ G  +A  LG+DR  + SDSL  I+ IN+
Subjt:  LEAKAMAVVEGLRLARNLGVDRLTILSDSLSLIKTINE

A0A2N9EHR8 F-box domain-containing protein7.2e-7930.96Show/hide
Query:  SSRSSIDFSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDM
        +S    D   +  L++ L  LL +EE  W+QRSR  WLK GDSNT++FH +A+ R+  N I  +++++G W  ++D++ + F  Y+  +FT+  P   + 
Subjt:  SSRSSIDFSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDM

Query:  ACVLNHVPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDI-
          V+ ++   VT EMN+ L   +T +EV  A+K   P +APG DG P LFYQKYW +         L  LNS   +Q  NHT+I  IPK ++P  VSD  
Subjt:  ACVLNHVPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDI-

Query:  -VLLAYVMFLI-----------RLSLSLI-------FLKAAAREFGL-------------------------------------FRIILKDFERASGQSV
         + L  V++ I           +L +S         FL+   ++ G                                       + IL  +E+ASGQ V
Subjt:  -VLLAYVMFLI-----------RLSLSLI-------FLKAAAREFGL-------------------------------------FRIILKDFERASGQSV

Query:  NFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRIPK--
        N +K+ + FSK+  Q  +Q +   L + V      YLGLPS   R+K   F  + +RVWS L+GWK  L SQ G+EVLIKS+  AIP++AM CFR+P   
Subjt:  NFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRIPK--

Query:  --DVEVIKSLPISSTSPDKWIWHYDAKGEYSVKSGYKLSMLNS--QGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGY
          ++EV+        + DK        G YSV+SGY+L M  S     S S+  +    W  +W ++VPSKV+ F+W S H+S+PT  NL   H+     
Subjt:  --DVEVIKSLPISSTSPDKWIWHYDAKGEYSVKSGYKLSMLNS--QGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGY

Query:  CPVCHDEMETTDHALFQCPRAREIWSFL---HPHIMRNLRDQMDIK--------------------DRWYELSHEPLQQIPDPMIR-CDWINNYLSEFWL
        C +C   +ETT HAL+ CP    +W  L      +  N  + +D+                       WY  + + L Q  +P  R        L+EF  
Subjt:  CPVCHDEMETTDHALFQCPRAREIWSFL---HPHIMRNLRDQMDIK--------------------DRWYELSHEPLQQIPDPMIR-CDWINNYLSEFWL

Query:  ANPKGGSVVQSKEDIVDIISNGEEI---IMHTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNLGVDRLTILSDSL
        A+ +     Q   + + I  N  E+    ++ D +V  +   AGIG+++R+ NG  MA         +S    +A A     +LA ++G+  + I  DS 
Subjt:  ANPKGGSVVQSKEDIVDIISNGEEI---IMHTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNLGVDRLTILSDSL

Query:  SLIKTINEDLQGEACIA
         ++  +   L    C+A
Subjt:  SLIKTINEDLQGEACIA

A0A2N9G219 RNase H domain-containing protein3.0e-6926.14Show/hide
Query:  LEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVPRKVTY
        + K +  LLL+EE  WKQRSRE+WLK GD NTK+FH +AS R++RN I  +  +NG+  +D++ I   F  Y++ +FT +QP  +D   VL+ +   VT 
Subjt:  LEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVPRKVTY

Query:  EMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPS-------------------
        EMNQ L   +T EEV  A+K   P +APGPDG P +FYQ YW VV     +  L  L S   +   NHT + LIPK++ P                    
Subjt:  EMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPS-------------------

Query:  --------------------------LVSDIVLLAYVMF----------LIRLSLSLIFLKAAAR-EFGLFR-IILKDFERASGQSVNFSKSMVMFSKNV
                                  L++D +L+A+             +  ++L L   KA  R E+G  +  IL  +E+ASGQ +N +K+ + FS+N 
Subjt:  --------------------------LVSDIVLLAYVMF----------LIRLSLSLIFLKAAAR-EFGLFR-IILKDFERASGQSVNFSKSMVMFSKNV

Query:  LQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFR------------------
         Q +++ +  IL +      + YLGLPS   + K   F  + +RVWS ++GWK  L SQ G+E+LIK++V AIPTY M CF+                  
Subjt:  LQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFR------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------IPKDVEVIKSLPISS-TSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGE--SLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCN
              +P DVE I  +P+S     D+  W     G+YSV+SGYKL   + +      S     +  WK +W+ RVP+K++ F+W++ H+S+PT + L  
Subjt:  ------IPKDVEVIKSLPISS-TSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGE--SLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCN

Query:  HHVPISGYCPVCHDEMETTDHALFQCPRAREIWSF-----------------LHPHIMRNLRDQMDIKDR------WYELSHEPLQQIPDPMIR-CDWIN
          V  +  C  C  + E + HAL++CP    +WS                  L   I+++  D +  K        W++ + + L+   DP  +     +
Subjt:  HHVPISGYCPVCHDEMETTDHALFQCPRAREIWSF-----------------LHPHIMRNLRDQMDIKDR------WYELSHEPLQQIPDPMIR-CDWIN

Query:  NYLSEFWLANPKGGSVVQSKEDIVDIISNGEEIIMHTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNLGVDRLTI
          LSE+     +          +     +     ++ D ++  + +  G+G+V+RDK G+++A  +     N+++   +A+A    +R A  +GV     
Subjt:  NYLSEFWLANPKGGSVVQSKEDIVDIISNGEEIIMHTDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNLGVDRLTI

Query:  LSDSLSLIKTI
          D+ ++I+ +
Subjt:  LSDSLSLIKTI

A0A2N9H1N4 RNase H domain-containing protein6.9e-7426.84Show/hide
Query:  RSSRSSIDFSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKD
        R+S   +D   +  L+  L +LL +EE  W+QRS+ +WL+  D NT++FH +A+ R++RN +  +++  GQW +   ++   F  Y+  +F +  P+  +
Subjt:  RSSRSSIDFSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKD

Query:  MACVLNHVPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSD-
           V+  +   V  EMN  L+  +T EEV  A+K   P +APGPDG P +FYQKYW ++     +  L  LNS   ++  NHT + LIPK ++P  V + 
Subjt:  MACVLNHVPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSD-

Query:  --IVLLAYVMFLIRLSL----------------------------------------------------------------------------SLIFLKA
          I L   +  LI  +L                                                                            SL+F +A
Subjt:  --IVLLAYVMFLIRLSL----------------------------------------------------------------------------SLIFLKA

Query:  AAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEV
           +    + IL  +E+ASGQ +N  K+ + FSK+    ++  + ++L +      + YLGLPS   R K   F  + +RVWS L+GWK  L SQ G+E 
Subjt:  AAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEV

Query:  LIKSIVTAIPTYAMGCFRIP----KDVEV---------------------------------------------IKSLPISSTS-PDKWIWHYDAKGEYS
        LIKS+  AIP+YAM CFR+P    K++EV                                             I  +P+S  +  D  +W     G YS
Subjt:  LIKSIVTAIPTYAMGCFRIP----KDVEV---------------------------------------------IKSLPISSTS-PDKWIWHYDAKGEYS

Query:  VKSGYKLSMLNSQGESL--SDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIW---SFL
        VKSGY L + +S  E    SD  + +  WK VW + VP K + F+W++ HNS+PT  NL + H+     C +C  ++E+T HAL+QC + + +W   S+ 
Subjt:  VKSGYKLSMLNSQGESL--SDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIW---SFL

Query:  HPHIMRNLRDQMDIKDR--------------------WYELSHEPLQQIPDPMIR-CDWINNYLSEFWLANPKGGSVVQSKEDIVDIISNGE---EIIMH
                 D +D+  +                    WY  +   LQQ  D  ++      + +SEF  A  +   + Q       +  N        ++
Subjt:  HPHIMRNLRDQMDIKDR--------------------WYELSHEPLQQIPDPMIR-CDWINNYLSEFWLANPKGGSVVQSKEDIVDIISNGE---EIIMH

Query:  TDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNLGVDRLTILSDSLSLIKTI
         D ++    + AGIG+++R+  G +M   +      +S    +A A    ++ AR+LG  ++ +  DS ++++ +
Subjt:  TDASVMGKHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNLGVDRLTILSDSLSLIKTI

A0A803QF94 Uncharacterized protein3.0e-6933.33Show/hide
Query:  RSSIDFSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMAC
        R+S   + +   E+ L+ LL +EE+YW+QRSR +WL  GD NTK+FH KAS R+  N+I  + +  GQ       I    ++YF EIF++S  D+  +  
Subjt:  RSSIDFSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMAC

Query:  VLNHVPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSD----
         L+ +P  VT   N  L+  +T  EV  A+++  P ++PG DG  A+FYQK W +V +      L ILN        N T I LIPK + P  V D    
Subjt:  VLNHVPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSD----

Query:  -----IVLLAYVMFLIRLSLSLIFLKAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQ-YLGNILSMRVTDSLDSYLGLPSTFQRSKSR
             I  L   + + R  L L  + +  R F L R+I+ +        V F     + +K   +++ Q +    LSM + +  + YLGLPS   R K  
Subjt:  -----IVLLAYVMFLIRLSLSLIFLKAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQ-YLGNILSMRVTDSLDSYLGLPSTFQRSKSR

Query:  DFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRIP----KDVEVIK-------------------SLPISSTSP------DKWIWHY
         F  + +++W ++  W   +FS GG+EVL+K++V +IPTYAM CFR+P      VE +                    +L   S S       D  IWH+
Subjt:  DFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTAIPTYAMGCFRIP----KDVEVIK-------------------SLPISSTSP------DKWIWHY

Query:  DAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIWS
           G Y+V +GY L+      +  S     +SWWK  W M++P KVK+F WK  H+++P   +L    V     C VC    E+  HALF C  AR +W 
Subjt:  DAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIWS

Query:  F
        +
Subjt:  F

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.1e-0727.01Show/hide
Query:  RRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLN-HVPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQK
        +R++N+I  I++  G   +D  +IQ     Y+K ++ +   + ++M   L+ +   ++  E  + L    T  E+ A I S    ++PGPDGF A FYQ+
Subjt:  RRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLN-HVPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQK

Query:  YWDVVANTTVSNCLDILNSEGQI-QDWNHTNIVLIPK
        Y + +    +      +  EG +   +   +I+LIPK
Subjt:  YWDVVANTTVSNCLDILNSEGQI-QDWNHTNIVLIPK

P0C2F6 Putative ribonuclease H protein At1g657505.8e-0928.45Show/hide
Query:  LPISSTSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTN--SWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEME
        L + + + D+  W +   G++SV+S Y++        ++ ++ R N  S++  +WK+RVP +VK F+W   + ++ T       H+  S  C VC   +E
Subjt:  LPISSTSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTN--SWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEME

Query:  TTDHALFQCPRAREIW
        +  H L  CP    IW
Subjt:  TTDHALFQCPRAREIW

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.8e-1326.34Show/hide
Query:  FSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQ----PDNKDMACV
        F + HV  K+ +      E +++Q+SR  WL+ GD+NT++FH+     + +N I  +   +     +  +++E    Y+  +  S      PD+  +  +
Subjt:  FSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQ----PDNKDMACV

Query:  LNHVPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPK
         +  P +    +   L A  + +E+TAA+ +    +APGPD F A F+ + W VV ++T++   +   +   ++ +N T I LIPK
Subjt:  LNHVPRKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPK

AT2G02650.1 Ribonuclease H-like superfamily protein4.7e-0630.16Show/hide
Query:  VWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIW
        +WK+ V  K+K F+W+    ++ T   L + ++     C  C  E ET  H +F CP  + +W
Subjt:  VWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIW

AT3G09510.1 Ribonuclease H-like superfamily protein4.7e-1431.71Show/hide
Query:  STSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKM---VWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTD
        S  PDK IW+Y+  GEY+V+SGY L + +    ++  +   +    +   +W + +  K+K F+W++   ++ T   L    + I   CP CH E E+ +
Subjt:  STSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKM---VWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTD

Query:  HALFQCPRAREIWSFLHPHIMRN
        HALF CP A   W      ++RN
Subjt:  HALFQCPRAREIWSFLHPHIMRN

AT3G25270.1 Ribonuclease H-like superfamily protein2.1e-0633.33Show/hide
Query:  VWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIW
        +WK++   K+K F+WK    ++ T  NL   H+     C  C  E ET+ H  F C  A+++W
Subjt:  VWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAREIW

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.9e-0934.33Show/hide
Query:  TNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAR
        TN+W   +W +++  K+K+ +WK+ +N++P    L + ++ I  +C  C D  ET  H LF CP A+
Subjt:  TNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYCPVCHDEMETTDHALFQCPRAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATCGAGTAGGTCTTCCATTGATTTCTCCATCATCCACGTGTTAGAAAAACGCCTGGATGCATTACTGTTAGAGGAGGAAATGTACTGGAAGCAACGATCAAGAGA
AAACTGGCTGAAATGGGGGGACAGTAATACAAAATGGTTCCACCAAAAGGCCTCGCTGAGGAGGAAGCGAAATAGGATTGTTGGTATTGAGGATTCTAATGGGCAGTGGC
AATCTGACAAGGATAAAATTCAGGAAGCTTTTGAAAACTATTTTAAGGAGATATTCACTTCCTCCCAGCCTGATAACAAGGATATGGCTTGTGTGCTAAATCATGTCCCT
CGGAAAGTTACTTATGAGATGAACCAAGTTTTATTAGCCTCTTACACTAGGGAGGAGGTTACGGCTGCGATTAAGAGCTTCCATCCCACTAGGGCGCCAGGTCCAGATGG
TTTTCCTGCGCTGTTCTACCAGAAGTACTGGGATGTTGTTGCAAATACGACTGTTTCCAATTGTTTGGATATTCTAAACTCGGAGGGGCAGATCCAAGATTGGAATCATA
CCAATATTGTGCTAATTCCTAAGAGTCGTCATCCGAGTTTAGTATCTGATATCGTCCTATTAGCTTATGTAATGTTTCTTATAAGATTGTCACTAAGTCTGATATTCCTT
AAAGCTGCAGCTCGGGAGTTTGGACTTTTTCGCATCATTTTGAAGGACTTTGAAAGAGCATCTGGACAATCGGTTAATTTTTCCAAATCCATGGTCATGTTCTCTAAAAA
TGTCCTGCAAGACTCACGGCAATATCTGGGAAATATTTTATCAATGCGAGTTACTGATTCTCTAGATTCTTACCTAGGGCTGCCATCTACCTTTCAGAGAAGTAAATCTA
GAGATTTCAAATTTCTTCTTGATAGAGTCTGGTCTGTCCTGCAAGGATGGAAGGGTAATTTATTTTCTCAAGGTGGTAAGGAAGTGCTTATTAAGAGCATAGTTACGGCT
ATCCCTACGTATGCTATGGGGTGTTTCCGGATTCCAAAAGATGTGGAGGTGATTAAGAGTTTGCCGATTAGTAGTACCTCACCAGACAAATGGATATGGCATTACGATGC
TAAAGGTGAGTACTCTGTTAAGAGTGGGTATAAACTGTCAATGCTAAATTCCCAGGGGGAATCTTTGTCTGATATGGGACGGACTAACTCTTGGTGGAAGATGGTGTGGA
AGATGAGAGTTCCTAGCAAAGTAAAAGTTTTTGTATGGAAATCATTTCATAATTCAATTCCCACTATGGTCAACCTTTGTAATCATCATGTGCCTATTTCGGGGTACTGT
CCGGTGTGCCATGACGAGATGGAAACTACAGATCATGCTCTTTTTCAATGTCCGAGGGCTCGAGAGATTTGGTCTTTTCTTCATCCGCATATAATGAGGAATCTAAGGGA
TCAAATGGATATAAAAGATCGATGGTATGAGCTCTCTCATGAGCCGTTGCAGCAAATCCCCGATCCTATGATTAGGTGTGACTGGATCAATAATTATCTGTCGGAGTTCT
GGTTGGCTAATCCAAAAGGTGGTTCTGTTGTCCAGTCGAAGGAGGATATCGTTGATATTATATCAAATGGTGAAGAGATAATTATGCATACGGATGCTTCTGTAATGGGC
AAACACTCCAACGCGGGCATTGGCATTGTCATGCGTGACAAAAATGGTGTGTTAATGGCGGTACAGACGTCATCAACAATGGTCAATAATTCATCCTTGGAGGCGAAGGC
GATGGCGGTCGTTGAAGGGTTACGATTGGCTCGGAATCTGGGTGTGGATCGTCTTACTATTTTGTCGGATTCATTATCATTGATAAAGACCATTAATGAGGATTTGCAGG
GGGAGGCCTGCATTGCTGCGCAAATTGGCCCATTTATGGCTTTCGCATTTTCCTTGTTTGTGGCTAGAAAACTATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGATCGAGTAGGTCTTCCATTGATTTCTCCATCATCCACGTGTTAGAAAAACGCCTGGATGCATTACTGTTAGAGGAGGAAATGTACTGGAAGCAACGATCAAGAGA
AAACTGGCTGAAATGGGGGGACAGTAATACAAAATGGTTCCACCAAAAGGCCTCGCTGAGGAGGAAGCGAAATAGGATTGTTGGTATTGAGGATTCTAATGGGCAGTGGC
AATCTGACAAGGATAAAATTCAGGAAGCTTTTGAAAACTATTTTAAGGAGATATTCACTTCCTCCCAGCCTGATAACAAGGATATGGCTTGTGTGCTAAATCATGTCCCT
CGGAAAGTTACTTATGAGATGAACCAAGTTTTATTAGCCTCTTACACTAGGGAGGAGGTTACGGCTGCGATTAAGAGCTTCCATCCCACTAGGGCGCCAGGTCCAGATGG
TTTTCCTGCGCTGTTCTACCAGAAGTACTGGGATGTTGTTGCAAATACGACTGTTTCCAATTGTTTGGATATTCTAAACTCGGAGGGGCAGATCCAAGATTGGAATCATA
CCAATATTGTGCTAATTCCTAAGAGTCGTCATCCGAGTTTAGTATCTGATATCGTCCTATTAGCTTATGTAATGTTTCTTATAAGATTGTCACTAAGTCTGATATTCCTT
AAAGCTGCAGCTCGGGAGTTTGGACTTTTTCGCATCATTTTGAAGGACTTTGAAAGAGCATCTGGACAATCGGTTAATTTTTCCAAATCCATGGTCATGTTCTCTAAAAA
TGTCCTGCAAGACTCACGGCAATATCTGGGAAATATTTTATCAATGCGAGTTACTGATTCTCTAGATTCTTACCTAGGGCTGCCATCTACCTTTCAGAGAAGTAAATCTA
GAGATTTCAAATTTCTTCTTGATAGAGTCTGGTCTGTCCTGCAAGGATGGAAGGGTAATTTATTTTCTCAAGGTGGTAAGGAAGTGCTTATTAAGAGCATAGTTACGGCT
ATCCCTACGTATGCTATGGGGTGTTTCCGGATTCCAAAAGATGTGGAGGTGATTAAGAGTTTGCCGATTAGTAGTACCTCACCAGACAAATGGATATGGCATTACGATGC
TAAAGGTGAGTACTCTGTTAAGAGTGGGTATAAACTGTCAATGCTAAATTCCCAGGGGGAATCTTTGTCTGATATGGGACGGACTAACTCTTGGTGGAAGATGGTGTGGA
AGATGAGAGTTCCTAGCAAAGTAAAAGTTTTTGTATGGAAATCATTTCATAATTCAATTCCCACTATGGTCAACCTTTGTAATCATCATGTGCCTATTTCGGGGTACTGT
CCGGTGTGCCATGACGAGATGGAAACTACAGATCATGCTCTTTTTCAATGTCCGAGGGCTCGAGAGATTTGGTCTTTTCTTCATCCGCATATAATGAGGAATCTAAGGGA
TCAAATGGATATAAAAGATCGATGGTATGAGCTCTCTCATGAGCCGTTGCAGCAAATCCCCGATCCTATGATTAGGTGTGACTGGATCAATAATTATCTGTCGGAGTTCT
GGTTGGCTAATCCAAAAGGTGGTTCTGTTGTCCAGTCGAAGGAGGATATCGTTGATATTATATCAAATGGTGAAGAGATAATTATGCATACGGATGCTTCTGTAATGGGC
AAACACTCCAACGCGGGCATTGGCATTGTCATGCGTGACAAAAATGGTGTGTTAATGGCGGTACAGACGTCATCAACAATGGTCAATAATTCATCCTTGGAGGCGAAGGC
GATGGCGGTCGTTGAAGGGTTACGATTGGCTCGGAATCTGGGTGTGGATCGTCTTACTATTTTGTCGGATTCATTATCATTGATAAAGACCATTAATGAGGATTTGCAGG
GGGAGGCCTGCATTGCTGCGCAAATTGGCCCATTTATGGCTTTCGCATTTTCCTTGTTTGTGGCTAGAAAACTATCCTGA
Protein sequenceShow/hide protein sequence
MRSSRSSIDFSIIHVLEKRLDALLLEEEMYWKQRSRENWLKWGDSNTKWFHQKASLRRKRNRIVGIEDSNGQWQSDKDKIQEAFENYFKEIFTSSQPDNKDMACVLNHVP
RKVTYEMNQVLLASYTREEVTAAIKSFHPTRAPGPDGFPALFYQKYWDVVANTTVSNCLDILNSEGQIQDWNHTNIVLIPKSRHPSLVSDIVLLAYVMFLIRLSLSLIFL
KAAAREFGLFRIILKDFERASGQSVNFSKSMVMFSKNVLQDSRQYLGNILSMRVTDSLDSYLGLPSTFQRSKSRDFKFLLDRVWSVLQGWKGNLFSQGGKEVLIKSIVTA
IPTYAMGCFRIPKDVEVIKSLPISSTSPDKWIWHYDAKGEYSVKSGYKLSMLNSQGESLSDMGRTNSWWKMVWKMRVPSKVKVFVWKSFHNSIPTMVNLCNHHVPISGYC
PVCHDEMETTDHALFQCPRAREIWSFLHPHIMRNLRDQMDIKDRWYELSHEPLQQIPDPMIRCDWINNYLSEFWLANPKGGSVVQSKEDIVDIISNGEEIIMHTDASVMG
KHSNAGIGIVMRDKNGVLMAVQTSSTMVNNSSLEAKAMAVVEGLRLARNLGVDRLTILSDSLSLIKTINEDLQGEACIAAQIGPFMAFAFSLFVARKLS