; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022113 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022113
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr7:18646013..18652199
RNA-Seq ExpressionLag0022113
SyntenyLag0022113
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023908468.1 uncharacterized protein LOC112020138 [Quercus suber]6.3e-6225.12Show/hide
Query:  GGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRCGRSRRFRFEEFWTRSNDCKDIIHNCADWSAKG----
        GG FTWCN       +  R+DR +   ++LD F   KV + +   S+H+PIL    +CL+     R + +RFE  W     C+D +     ++A G    
Subjt:  GGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRCGRSRRFRFEEFWTRSNDCKDIIHNCADWSAKG----

Query:  -----------------------------------------------------------------------------------TTVWFHLQANKRRQQNK
                                                                                            T +FH +A  R ++NK
Subjt:  -----------------------------------------------------------------------------------TTVWFHLQANKRRQQNK

Query:  IMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIVGD
        I  + +  G+   + + I +  V++++++F+S  P+  E  + +   P+ V +EMN +L+ PF + E++  +    P K+PGPDG P +F+Q+ W ++G+
Subjt:  IMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIVGD

Query:  QTISDILAVLNQGHQVREWNHTHIALIPKVRDARFV---------------------------LNLRYTGV-------QDQSSYLSSILQMKVVKNLDSY
        +    +L  LN G      N T I LIPKV+    V                           LN   T +       +D  + LS  LQ+  VK  + Y
Subjt:  QTISDILAVLNQGHQVREWNHTHIALIPKVRDARFV---------------------------LNLRYTGV-------QDQSSYLSSILQMKVVKNLDSY

Query:  LGLPSSFSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGS---------------------
        LGLP+   R++    + I +RVWS LQGWK K+ S  G+EVL+K++VQAIP +AM CF+LP  L  +I +L  +F+WG                      
Subjt:  LGLPSSFSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGS---------------------

Query:  -----------NDGV--------------------------------------AYIQQPQLD---------------------------PGKGPITRSKA
                   ND +                                      +Y  Q  +                            PGKG   R  +
Subjt:  -----------NDGV--------------------------------------AYIQQPQLD---------------------------PGKGPITRSKA

Query:  KKIQEAFTLHVQKLANAQRDAEN----------FESCIIPSIPDKW-------IWHYRKSGKYSVRSGYKAYMLKKD----EASPSDSGYVKSWWKKLWS
         ++       V  L N    + N          FE   I +IP  W       IW     G+Y V+SGY+  ML K+     AS SDS +  ++WK+LW 
Subjt:  KKIQEAFTLHVQKLANAQRDAEN----------FESCIIPSIPDKW-------IWHYRKSGKYSVRSGYKAYMLKKD----EASPSDSGYVKSWWKKLWS

Query:  LKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYH--SEQQDMLAWICM
        L +P+K+K F+WRV +N +PT  NL +R ++  A C  C    ET+ HAL+ C   K+ W      +         +QD  LE+ +   ++ + L    +
Subjt:  LKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYH--SEQQDMLAWICM

Query:  GAWALWNDHNAL
         AW +W   N L
Subjt:  GAWALWNDHNAL

XP_024033484.1 uncharacterized protein LOC112095607 [Citrus clementina]4.6e-6528.41Show/hide
Query:  EEFWTRSNDCKDIIHNCADWSAKG--TTVWFHLQANKRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNI
        E+FW +         + A+W   G   T +FH +A+ R+++N+I GI + +G W    + +   F  YF  +F++  PS+ E+S  +  I  +V + MN 
Subjt:  EEFWTRSNDCKDIIHNCADWSAKG--TTVWFHLQANKRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNI

Query:  DLLWPFTKEEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKVRDARFV--------LNLRYTGVQDQS-
         L  PFT E+I   +    P+KAPGPDG PA FYQK+W +V  + I+  L +LN+   +   NHT+IAL+PK    + +        +N+ +    D S 
Subjt:  DLLWPFTKEEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKVRDARFV--------LNLRYTGVQDQS-

Query:  ----------------------------------------------SYLSSILQMKVVKNLDSYLGLPSSFSRSKSRDFHKIMDRVWSYLQGWKGKMFSG
                                                      S + +I Q+ VV   + YLGLPS   R +S  F  +  +V   +  W+ K FS 
Subjt:  ----------------------------------------------SYLSSILQMKVVKNLDSYLGLPSSFSRSKSRDFHKIMDRVWSYLQGWKGKMFSG

Query:  GGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGSNDGVAY----------------------------------------IQQPQLDPGKG
        GGKEVLIK++ QA+P YAMS F++P  +   I    A FWWGS     Y                                        +   +   GKG
Subjt:  GGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGSNDGVAY----------------------------------------IQQPQLDPGKG

Query:  -PITRSKAKKIQEAFTL------------HVQKLANAQR--------------DAENFESCIIPSIP--DKWIWHYRKSGKYSVRSGYKAYMLKKDEASP
          +   KA  I    T             +V  L N                 DA+   S  +P  P  D+ +WHY K G+YSV+SGY+  +  K  A P
Subjt:  -PITRSKAKKIQEAFTL------------HVQKLANAQR--------------DAENFESCIIPSIP--DKWIWHYRKSGKYSVRSGYKAYMLKKDEASP

Query:  SDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIY
        S S    + W+ + +L +  K++ F+WR   N +P+M NL++R V  + +C ICK G E+  HAL  C  A+K WR       +R+     +        
Subjt:  SDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIY

Query:  HSEQQ---DMLAWICMGAWALWN
         S ++   ++L  IC G W+  N
Subjt:  HSEQQ---DMLAWICMGAWALWN

XP_030923509.1 uncharacterized protein LOC115950456 [Quercus lobata]1.5e-6026.17Show/hide
Query:  QGGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRCGRSRRFRFEEFWTRSNDCKDII-------------
        +G  FTWCN +E + +VL R+DR L   +++D +++ KV +   + S+H  +LL      +       RRF FE  WT+  +CKDII             
Subjt:  QGGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRCGRSRRFRFEEFWTRSNDCKDII-------------

Query:  -------HNCA---------------------------------------------------------DWSAKGTTVW----------FHLQANKRRQQN
                NCA                                                          W  +    W          FH +A+ RR++N
Subjt:  -------HNCA---------------------------------------------------------DWSAKGTTVW----------FHLQANKRRQQN

Query:  KIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGP-DGFPAIFYQ--KNWE
         I  I+D  G W  +PD I    V+YFK+++S+  P+   + +++  IP KV +EMN  L+  FT+EEIE  +   HP+KAPGP DGF ++ +   +N++
Subjt:  KIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGP-DGFPAIFYQ--KNWE

Query:  IVGDQTISDILAVLNQGHQVREWNHTHIALI---PKVRDARFVLNL--RYTGVQDQ-----------SSYLSSILQMKVVKNL--------DSYLGLPSS
        I G       +++     ++        +L+      ++ + ++N+   Y     Q           SS  S   + +V+  L          YLGLPS 
Subjt:  IVGDQTISDILAVLNQGHQVREWNHTHIALI---PKVRDARFVLNL--RYTGVQDQ-----------SSYLSSILQMKVVKNL--------DSYLGLPSS

Query:  FSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGS---------------------------
          +SK   F +I +RV   L GWK KM S GG+E+LIK++ QAIP YAMSCF++PK L  ++ ++  +FWWG                            
Subjt:  FSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGS---------------------------

Query:  ---------------------NDGVAYIQQPQLDPGKGPITRSKAKKIQEAFT-------LHVQK------LANAQR-----------------------
                             N  VA I + +  P  G +  SK      +FT       L V K      + N +R                       
Subjt:  ---------------------NDGVAYIQQPQLDPGKGPITRSKAKKIQEAFT-------LHVQK------LANAQR-----------------------

Query:  -----------DAEN--------------FESCIIPSIP-------DKWIWHYRKSGKYSVRSGYKA---YMLKKDEASPSDSGYVKSWWKKLWSLKIPS
                   D E               FE+  I +IP       D+ IW   K G++SV+S Y      +   DE   S+     + W+KLW L IP 
Subjt:  -----------DAEN--------------FESCIIPSIP-------DKWIWHYRKSGKYSVRSGYKA---YMLKKDEASPSDSGYVKSWWKKLWSLKIPS

Query:  KVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWN
        KV+ F WR+  N +PT+ NL  + VV   +CP C   PET  H   +C  AK+ WR +     +    +  I D  LEI  S   + L +    AWA+W+
Subjt:  KVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWN

Query:  DHNALVHDAA
        + N ++++++
Subjt:  DHNALVHDAA

XP_030970416.1 uncharacterized protein LOC115990765 [Quercus lobata]3.4e-6026.05Show/hide
Query:  QGGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRCGRSRRFRFEEFWTRSNDCKDII-------------
        +G  FTWCN +E + +VL R+DR L   +++D +++ KV +   + S+H  +LL      +       RRF FE  WT+  +CKDII             
Subjt:  QGGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRCGRSRRFRFEEFWTRSNDCKDII-------------

Query:  -------HNCA---------------------------------------------------------DWSAKGTTVW----------FHLQANKRRQQN
                NCA                                                          W  +    W          FH +A+ RR++N
Subjt:  -------HNCA---------------------------------------------------------DWSAKGTTVW----------FHLQANKRRQQN

Query:  KIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGP-DGFPAIFYQ--KNWE
         I  I+D  G W  +PD I    V+YFK+++S+  P+   + +++  IP KV ++MN  L+  FT+EEIE  +   HP+KAPGP DGF ++ +   +N++
Subjt:  KIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGP-DGFPAIFYQ--KNWE

Query:  IVGDQTISDILAVLNQGHQVREWNHTHIALI---PKVRDARFVLNL--RYTGVQDQ-----------SSYLSSILQMKVVKNL--------DSYLGLPSS
        I G       +++     ++        +L+      ++ + ++N+   Y     Q           SS  S   + +V+  L          YLGLPS 
Subjt:  IVGDQTISDILAVLNQGHQVREWNHTHIALI---PKVRDARFVLNL--RYTGVQDQ-----------SSYLSSILQMKVVKNL--------DSYLGLPSS

Query:  FSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGS---------------------------
          +SK   F +I +RV   L GWK KM S GG+E+LIK++ QAIP YAMSCF++PK L  ++ ++  +FWWG                            
Subjt:  FSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGS---------------------------

Query:  ---------------------NDGVAYIQQPQLDPGKGPITRSKAKKIQEAFT-------LHVQK------LANAQR-----------------------
                             N  VA I + +  P  G +  SK      +FT       L V K      + N +R                       
Subjt:  ---------------------NDGVAYIQQPQLDPGKGPITRSKAKKIQEAFT-------LHVQK------LANAQR-----------------------

Query:  -----------DAEN--------------FESCIIPSIP-------DKWIWHYRKSGKYSVRSGYKA---YMLKKDEASPSDSGYVKSWWKKLWSLKIPS
                   D E               FE+  I +IP       D+ IW   K G++SV+S Y      +   DE   S+     + W+KLW L IP 
Subjt:  -----------DAEN--------------FESCIIPSIP-------DKWIWHYRKSGKYSVRSGYKA---YMLKKDEASPSDSGYVKSWWKKLWSLKIPS

Query:  KVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWN
        KV+ F WR+  N +PT+ NL  + VV   +CP C   PET  H   +C  AK+ WR +     +    +  I D  LEI  S   + L +    AWA+W+
Subjt:  KVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWN

Query:  DHNALVHDAA
        + N ++++++
Subjt:  DHNALVHDAA

XP_030970552.1 uncharacterized protein LOC115990926 [Quercus lobata]5.9e-6026.05Show/hide
Query:  QGGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRCGRSRRFRFEEFWTRSNDCKDII-------------
        +G  FTWCN +E + +VL R+DR L   +++D +++ KV +   + S+H  +LL      +       RRF FE  WT+  +CKDII             
Subjt:  QGGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRCGRSRRFRFEEFWTRSNDCKDII-------------

Query:  -------HNCA---------------------------------------------------------DWSAKGTTVW----------FHLQANKRRQQN
                NCA                                                          W  +    W          FH +A+ RR++N
Subjt:  -------HNCA---------------------------------------------------------DWSAKGTTVW----------FHLQANKRRQQN

Query:  KIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGP-DGFPAIFYQ--KNWE
         I  I+D  G W  +PD I    V+YFK+++S+  P+   + +++  IP KV ++MN  L+  FT+EEIE  +   HP+KAPGP DGF ++ +   +N++
Subjt:  KIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGP-DGFPAIFYQ--KNWE

Query:  IVGDQTISDILAVLNQGHQVREWNHTHIALI---PKVRDARFVLNL--RYTGVQDQ-----------SSYLSSILQMKVVKNL--------DSYLGLPSS
        I G       +++     ++        +L+      ++ + ++N+   Y     Q           SS  S   + +V+  L          YLGLPS 
Subjt:  IVGDQTISDILAVLNQGHQVREWNHTHIALI---PKVRDARFVLNL--RYTGVQDQ-----------SSYLSSILQMKVVKNL--------DSYLGLPSS

Query:  FSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGS---------------------------
          +SK   F +I +RV   L GWK KM S GG+E+LIK++ QAIP YAMSCF++PK L  ++ ++  +FWWG                            
Subjt:  FSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGS---------------------------

Query:  ---------------------NDGVAYIQQPQLDPGKGPITRSKAKKIQEAFT-------LHVQK------LANAQR-----------------------
                             N  VA I + +  P  G +  SK      +FT       L V K      + N +R                       
Subjt:  ---------------------NDGVAYIQQPQLDPGKGPITRSKAKKIQEAFT-------LHVQK------LANAQR-----------------------

Query:  -----------DAEN--------------FESCIIPSIP-------DKWIWHYRKSGKYSVRSGYKA---YMLKKDEASPSDSGYVKSWWKKLWSLKIPS
                   D E               FE+  I +IP       D+ IW   K G++SV+S Y      +   DE   S+     + W+KLW L IP 
Subjt:  -----------DAEN--------------FESCIIPSIP-------DKWIWHYRKSGKYSVRSGYKA---YMLKKDEASPSDSGYVKSWWKKLWSLKIPS

Query:  KVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWN
        KV+ F WR+  N +PT+ NL  + VV   +CP C   PET  H   +C  AK+ WR +     +    +  I D  LEI  S   + L +    AWA+W+
Subjt:  KVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWN

Query:  DHNALVHDAA
        + N ++++++
Subjt:  DHNALVHDAA

TrEMBL top hitse value%identityAlignment
A0A2N9EHR8 F-box domain-containing protein2.5e-6427.27Show/hide
Query:  QFLDLFQDCKVFNKDWAKSNHRPILLQ---TGLCLEYFRCGRSRRFRFEEFWTRSNDCKDII---------HNCADWSAKG--TTVWFHLQANKRRQQNK
        Q  D    C+   + W+K N   I +Q   T   L+       R F            ++++          +  +W   G   T +FH +A +R+  N 
Subjt:  QFLDLFQDCKVFNKDWAKSNHRPILLQ---TGLCLEYFRCGRSRRFRFEEFWTRSNDCKDII---------HNCADWSAKG--TTVWFHLQANKRRQQNK

Query:  IMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIVGD
        I  + +++G W  N D +   F+ Y+ ++F+++ PS+ E  ++V  I   V  EMN  L   FT +E+EN +K   P KAPG DG P +FYQK W + G 
Subjt:  IMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIVGD

Query:  QTISDILAVLNQGHQVREWNHTHIALIPKVRDARFVLNLR-----------------------------YTGV---------------------------
             +L  LN G  ++  NHTHI  IPK+++   V + R                             Y  V                           
Subjt:  QTISDILAVLNQGHQVREWNHTHIALIPKVRDARFVLNLR-----------------------------YTGV---------------------------

Query:  ------------------------------------------------------QDQSSYLSSILQMKVVKNLDSYLGLPSSFSRSKSRDFHKIMDRVWS
                                                              Q     + + L++ V+K+   YLGLPS   R+K   F KI +RVWS
Subjt:  ------------------------------------------------------QDQSSYLSSILQMKVVKNLDSYLGLPSSFSRSKSRDFHKIMDRVWS

Query:  YLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGSNDGVAYIQQPQLDPGKGPITRSKAKKIQEAFTLHVQKLANAQRDA
         L+GWK K+ S  G+EVLIKS+ QAIP +AMSCFRLP  L  +I  L  +FWWG N           D G                              
Subjt:  YLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGSNDGVAYIQQPQLDPGKGPITRSKAKKIQEAFTLHVQKLANAQRDA

Query:  ENFESCIIPSIPDKWIWHYRKSGKYSVRSGYKAYMLK--KDEASPSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICK
                            + G+YSVRSGY+  M +  KD+ S S+S  +   W  +WSL++PSKV+HF+W   ++ +PT  NL RRH++ DA C ICK
Subjt:  ENFESCIIPSIPDKWIWHYRKSGKYSVRSGYKAYMLK--KDEASPSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICK

Query:  NGPETSDHALFRCCRAKKFWREF---HSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWNDHN
           ET+ HAL+ C   +  WR      S +     N   +    +  + S + +  A +C   W+LW   N
Subjt:  NGPETSDHALFRCCRAKKFWREF---HSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWNDHN

A0A2N9GKZ7 Uncharacterized protein2.7e-6324.13Show/hide
Query:  QGGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRCGRSRRFRFEEFWTRSNDCKDIIH------------
        +G  FTW NRR+ E  +  R+DR L N  +LD F  C V +   + S+H P+LL       + +  R R  +FE+ W+   +C+ II             
Subjt:  QGGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRCGRSRRFRFEEFWTRSNDCKDIIH------------

Query:  ---------NCAD-----------------------------------------------------------------WSAKG--TTVWFHLQANKRRQQ
                 +C +                                                                 W A G   T +FH QAN+RR+ 
Subjt:  ---------NCAD-----------------------------------------------------------------WSAKG--TTVWFHLQANKRRQQ

Query:  NKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIV
        N + G+ +S+  W ++   I    ++YF  IF +  P   E +  ++ +  +V  E N  LL PFT +E+   +   HPSKAP PDG  + F+QK W IV
Subjt:  NKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIV

Query:  GDQTISDILAVLNQGHQVREWNHTHIALIPKVRD------------------------------------------------------------------
        G   ++ +L+VLN G  +R+ N THI+LIPK ++                                                                  
Subjt:  GDQTISDILAVLNQGHQVREWNHTHIALIPKVRD------------------------------------------------------------------

Query:  -----ARFVLNLRYTGVQDQSSYLSSILQMKV----------------------------------------------------VKNLDSYLGLPSSFSR
                 + L  +   D+S+   S+++ +                                                       N D YLGLP+   R
Subjt:  -----ARFVLNLRYTGVQDQSSYLSSILQMKV----------------------------------------------------VKNLDSYLGLPSSFSR

Query:  SKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGSNDGVAYIQQPQLDPGKGPITRSKAKKIQEA
        SK   F+ + +R+   LQGWK K  S  G+EVLIK++ QAIP YAM+CFRLPK    ++  L AR+WWG       +   + D     +  +KA+     
Subjt:  SKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGSNDGVAYIQQPQLDPGKGPITRSKAKKIQEA

Query:  FTLH-----------------VQKLANAQRDAENFESCI-------IPSIPDKWI------------WHYR------------KSGKYSVRSGYKAYMLK
          LH                  Q L      A  F SC        +PS   + I            WH++            K+G +SV+S Y+    +
Subjt:  FTLH-----------------VQKLANAQRDAENFESCI-------IPSIPDKWI------------WHYR------------KSGKYSVRSGYKAYMLK

Query:  KDEASPSDSGY---VKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGS
        +  +   +  Y   ++  W+K W L IP K+KHF+WR Y+  +PT   L RR +   +LC +C    ET+ HA+++C  A+  W   H ++       G 
Subjt:  KDEASPSDSGY---VKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGS

Query:  IQDRFLEIYHSEQQDMLAWICMGAWALWNDHNALVHD
               I+ +  ++ +    + AW++WN  N  VH+
Subjt:  IQDRFLEIYHSEQQDMLAWICMGAWALWNDHNALVHD

A0A2N9H1N4 RNase H domain-containing protein2.4e-6728.11Show/hide
Query:  ADW--SAKGTTVWFHLQANKRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKG
        ADW  +    T +FH +A +R+++N +  + +  GQW +    +   F+ Y+ S+F +  P   +V ++V  I   V +EMN  L+  FT EE+   +K 
Subjt:  ADW--SAKGTTVWFHLQANKRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKG

Query:  FHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKVRDARFVLNLR----------------------------------
          P KAPGPDG P IFYQK W ++G    + +L  LN G  ++  NHT++ LIPKV++   V+  R                                  
Subjt:  FHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKVRDARFVLNLR----------------------------------

Query:  ---------------YTGVQDQ-------------------------------------------------------SSY--------------------
                         G QD+                                                       S Y                    
Subjt:  ---------------YTGVQDQ-------------------------------------------------------SSY--------------------

Query:  --------LSSILQMKVVKNLDSYLGLPSSFSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFW
                + S+L +  +K  + YLGLPS   R K   F +I +RVWS L+GWK K+ S  G+E LIKS+ QAIP YAMSCFRLP  L+ +I  L  RFW
Subjt:  --------LSSILQMKVVKNLDSYLGLPSSFSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFW

Query:  WGSNDGVAYIQQPQLDPGKGPITRSKAKKIQEAFTLH-VQKLANAQRDAENFESCIIPSIPDKWIWHYRKSGKYSVRSGYKAYMLK--KDEASPSDSGYV
        WG  +    +Q   L      I   KA  +QE F  H    +        N   C+        +W   K+G YSV+SGY   +    ++E+ PSD   +
Subjt:  WGSNDGVAYIQQPQLDPGKGPITRSKAKKIQEAFTLH-VQKLANAQRDAENFESCIIPSIPDKWIWHYRKSGKYSVRSGYKAYMLK--KDEASPSDSGYV

Query:  KSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQD
           WK +WSL +P K +HF+WR  +N +PT +NL  RH++ D  C IC    E++ HAL++C + +  W+       + +  +    D   + +      
Subjt:  KSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQD

Query:  MLAWICMGAWALWNDHNAL
         L    M  W++W   N L
Subjt:  MLAWICMGAWALWNDHNAL

A0A2N9I4J7 Protein kinase domain-containing protein4.5e-7429.7Show/hide
Query:  ADWSAKG--TTVWFHLQANKRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKG
        ++W   G   T +FH +A +RR++N+I+ + DS G W ++   +   F+N++  +F+S  PS  +V Q+V +IP+ V  EMN  L   F   E+   +K 
Subjt:  ADWSAKG--TTVWFHLQANKRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKG

Query:  FHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKVRDARFVLNLRYTGV----------------------QDQSSYLS
          P K+PGPDG P +FYQK W I+G+   + +L  LN G  ++  NHTHI LIPKV++   V   R   +                        +S  ++
Subjt:  FHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKVRDARFVLNLRYTGV----------------------QDQSSYLS

Query:  SILQMKVVKNL------------------------------------------------------------DSYLGLPSSFSRSKSRDFHKIMDRVWSYL
          L MK + +L                                                            + YLGLPS   R+K   F +I +RVWS L
Subjt:  SILQMKVVKNL------------------------------------------------------------DSYLGLPSSFSRSKSRDFHKIMDRVWSYL

Query:  QGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWW---GSNDGVAYIQQPQLDPG--KGPITRSKAKKIQEAF----TLHVQKL
        +GWK K+ S  G+E+LIK + QAIP YAMSCFRLP  L+ +I  L  +FWW   G  D + +I    L     KG I   +     EA     ++   K 
Subjt:  QGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWW---GSNDGVAYIQQPQLDPG--KGPITRSKAKKIQEAF----TLHVQKL

Query:  ANAQRDAEN---FESCIIPSIP-------DKWIWHYRKSGKYSVRSGYKAYMLKKDEASP--SDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMT
         N+     N   FE   I  IP       D  IW   ++  Y+VRSGY   + +K   +P  SD+      WK +WSL +P+KV+HF+WR  ++ +PT  
Subjt:  ANAQRDAEN---FESCIIPSIP-------DKWIWHYRKSGKYSVRSGYKAYMLKKDEASP--SDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMT

Query:  NLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWNDHNALVHDAA--------
        NL  RHV+ D  CP C N  ET+ HAL  C   ++ W+   +    R     S  D F+E+Y+   Q +             DH+               
Subjt:  NLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWNDHNALVHDAA--------

Query:  -------------YDLQRGKCGFGAI----IRTLWKWEIDIYFDFLGLITMLNGLTDEFTEAHAFLWDIKLMSNSFSSIQFIHASRACNTIAHNLAQ
                     Y ++  +     I    +  L   E+DI  D L +IT L   T  +T     + D   ++++  S QF+H  R  N +AH+LA+
Subjt:  -------------YDLQRGKCGFGAI----IRTLWKWEIDIYFDFLGLITMLNGLTDEFTEAHAFLWDIKLMSNSFSSIQFIHASRACNTIAHNLAQ

A0A803QF94 Uncharacterized protein1.4e-7031.43Show/hide
Query:  DWSAKG--TTVWFHLQANKRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGF
        DW A G   T +FH +A+ R+  NKI  + +  GQ   +P  I +   +YF  IFS+       ++  +  IP  V D  N  L+ PFT  E+ N ++  
Subjt:  DWSAKG--TTVWFHLQANKRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLWPFTKEEIENVIKGF

Query:  HPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKVRD-------------------------ARFVLNLRYTGVQDQSSY
         P K+PG DG  A+FYQKNW IVGD     +L++LN G      N T I LIPKV+                          +RF L L     ++++ +
Subjt:  HPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKVRD-------------------------ARFVLNLRYTGVQDQSSY

Query:  LSSI--------------------------------LQMKVVKNLDSYLGLPSSFSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPV
        L  +                                L M + +  + YLGLPS   R K   F  I +++W  +  W  K+FS GG+EVL+K++VQ+IP 
Subjt:  LSSI--------------------------------LQMKVVKNLDSYLGLPSSFSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPV

Query:  YAMSCFRLPKVLLSKITSLYARFWWGSNDGVAYIQQPQLDPGKGPITRSKAKKIQ-EAFTLHVQKLANAQRDAENFESCIIPSIPDKWIWHYRKSGKYSV
        YAMSCFRLP    +++ S+ A FWWGSN                    +   KI   ++ L    L  ++ D             D  IWH+  SG Y+V
Subjt:  YAMSCFRLPKVLLSKITSLYARFWWGSNDGVAYIQQPQLDPGKGPITRSKAKKIQ-EAFTLHVQKLANAQRDAENFESCIIPSIPDKWIWHYRKSGKYSV

Query:  RSGYKAYMLKKDEASPSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISI
         +GY      +D+   S S    SWWK  WS+++P KVK F W+V ++ +P  T+L +R V+ DA C +CK   E+  HALF C  A+  WR        
Subjt:  RSGYKAYMLKKDEASPSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREFHSKISI

Query:  RDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWNDHNALVHD
        R  +  S  D    +     +  +  I    WA+WN+ N +VH+
Subjt:  RDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWNDHNALVHD

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.1e-0724.86Show/hide
Query:  KRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVS-----RIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGPDGFPA
        K+R++N+I  I +  G   ++P  I +    Y+K ++++++ +  E+   +      R+ Q+ ++ +N     P T  EI  +I      K+PGPDGF A
Subjt:  KRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVS-----RIPQKVIDEMNIDLLWPFTKEEIENVIKGFHPSKAPGPDGFPA

Query:  IFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKV-RDARFVLNLRYTGVQD-QSSYLSSILQMKVVKNL
         FYQ+  E +    +    ++  +G     +    I LIPK  RD     N R   + +  +  L+ IL  ++ +++
Subjt:  IFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKV-RDARFVLNLRYTGVQD-QSSYLSSILQMKVVKNL

P0C2F6 Putative ribonuclease H protein At1g657502.6e-1031.25Show/hide
Query:  IIPSIPDKWIWHYRKSGKYSVRSGYKAYMLKKDEASPSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDH
        ++    D+  W + + G++SVRS Y+  ML  DE    +   + S++  LW +++P +VK F+W V N  + T     RRH+    +C +CK G E+  H
Subjt:  IIPSIPDKWIWHYRKSGKYSVRSGYKAYMLKKDEASPSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDH

Query:  ALFRCCRAKKFW
         L  C      W
Subjt:  ALFRCCRAKKFW

P0C2F6 Putative ribonuclease H protein At1g657503.9e-0633.77Show/hide
Query:  LPSSFSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGS
        +P    R     F +I++RV S + GW+ K  S  G+  L K+++ ++PV++MS   LP+ +L+++  L   F WGS
Subjt:  LPSSFSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGS

P11369 LINE-1 retrotransposable element ORF2 protein1.3e-0623.79Show/hide
Query:  TTVWFHLQANK-----------RRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNID-LLWPFTKEEIENV
        T  WF  + NK            R +  I  I +  G   ++P+ I +   +++K ++S+++ +  E+ + + R     +++  +D L  P + +EIE V
Subjt:  TTVWFHLQANK-----------RRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNID-LLWPFTKEEIENV

Query:  IKGFHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLN--------QGHQVREWNHTHIALIPK-VRDARFVLNLRYTGVQD-QSSYLSSILQMKVVK
        I      K+PGPDGF A FYQ   E        D++ +L+        +G     +    I LIPK  +D   + N R   + +  +  L+ IL  ++ +
Subjt:  IKGFHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLN--------QGHQVREWNHTHIALIPK-VRDARFVLNLRYTGVQD-QSSYLSSILQMKVVK

Query:  NLDSYL
        ++ + +
Subjt:  NLDSYL

P93295 Uncharacterized mitochondrial protein AtMg003102.8e-0464.52Show/hide
Query:  AIPVYAMSCFRLPKVLLSKITSLYARFWWGS
        A+PVYAMSCFRL K+L  K+TS    FWW S
Subjt:  AIPVYAMSCFRLPKVLLSKITSLYARFWWGS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.6e-0840Show/hide
Query:  EEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKV
        +EI   +     +KAPGPD F A F+ ++W +V D TI+ +      GH ++ +N T I LIPKV
Subjt:  EEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKV

AT2G02650.1 Ribonuclease H-like superfamily protein1.1e-0828.3Show/hide
Query:  KYSVRSGYKAYM---LKKDEA--SPSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFW
        K  +RSGY       L ++EA   P  S  VK   + +W L +  K+KHF+WR     + T T L  R++  D +C  C    ET  H +F C   +  W
Subjt:  KYSVRSGYKAYM---LKKDEA--SPSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFW

Query:  REFHSKISIRDYNHGSIQD---RFLEIYHSEQQDML-----AWICMGAWALWNDHNALV
        R  +  I  +     S +D   R +++  ++  + L      WI    W LW   N  +
Subjt:  REFHSKISIRDYNHGSIQD---RFLEIYHSEQQDML-----AWICMGAWALWNDHNALV

AT3G09510.1 Ribonuclease H-like superfamily protein2.7e-1529.48Show/hide
Query:  PDKWIWHYRKSGKYSVRSGYKAYMLKKDEAS-------PSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETS
        PDK IW+Y  +G+Y+VRSGY  ++L  D ++       P  S  +K+   ++W+L I  K+KHF+WR  +  + T   L  R +  D  CP C    E+ 
Subjt:  PDKWIWHYRKSGKYSVRSGYKAYMLKKDEAS-------PSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETS

Query:  DHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLA--------WICMGAWALWNDHNALV
        +HALF C  A   WR   S +          ++    I +  Q   ++        W+    W +W   N +V
Subjt:  DHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLA--------WICMGAWALWNDHNALV

AT3G25270.1 Ribonuclease H-like superfamily protein7.0e-1131.15Show/hide
Query:  KLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREF---HSKISIRDYNHGSIQDRFLEIYHSEQQDML
        K+W LK   K+KHF+W++ +  + T  NL RRH+     C  C    ETS H  F C  A++ WR     H ++        +  +  L    + +Q  L
Subjt:  KLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKKFWREF---HSKISIRDYNHGSIQDRFLEIYHSEQQDML

Query:  ----AWICMGAWALWNDHNALV
             WI    W LW   N LV
Subjt:  ----AWICMGAWALWNDHNALV

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-0636.36Show/hide
Query:  SWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKK
        +W   +WSLKI  K+K  IW+  NN +P    L  R++  +  C  C++  ET  H LF C  A++
Subjt:  SWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGPETSDHALFRCCRAKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAAAAAACAGAAACTTTATGATGATGAATGTAATGAGGCGGGACTGGTGAACAGCCCCGCTCGAAAGTATGAATGTTTTGTGTTGGAATGTTCGTGGGTTGGGGA
TCCCACGAGCATTCCAGAATCTTGTCAGGGTGGTTCGTTCACATGGTGTAATAGGAGGGAGTTGGAAGATCAGGTGCTTACTAGAATAGATCGTTTCCTCGTCAATCAAC
AGTTCTTGGATTTATTTCAGGATTGTAAGGTCTTCAACAAGGATTGGGCAAAGTCCAATCACCGTCCTATCCTGTTACAAACGGGTTTATGCCTAGAGTATTTTAGGTGT
GGTCGTTCCAGAAGATTCCGGTTTGAAGAGTTTTGGACGCGTAGCAACGATTGTAAAGATATCATTCACAATTGTGCTGACTGGAGTGCCAAAGGTACAACAGTATGGTT
CCACCTTCAAGCAAATAAAAGGCGGCAACAGAATAAAATTATGGGCATTCTTGATTCCAATGGTCAGTGGCAATCCAATCCAGACGTGATTGGATCAAATTTTGTGAATT
ATTTCAAATCAATTTTCAGCTCACAGATCCCATCAGAGTTCGAGGTGTCCCAAATTGTTTCTAGAATTCCTCAAAAAGTCATTGATGAGATGAATATTGATCTTTTGTGG
CCTTTCACTAAGGAGGAAATAGAGAACGTCATCAAAGGATTTCATCCATCTAAGGCACCTGGTCCAGATGGCTTTCCAGCGATTTTTTATCAGAAAAATTGGGAGATTGT
AGGTGACCAAACTATTTCTGACATTTTGGCGGTGCTGAACCAGGGTCATCAAGTCCGGGAGTGGAACCATACCCATATTGCTCTGATTCCAAAGGTTCGCGATGCAAGGT
TTGTCTTAAATCTCAGGTATACAGGTGTCCAGGACCAGTCCTCCTATTTGAGTTCAATTCTTCAGATGAAGGTCGTAAAAAACTTGGATTCATACTTAGGCCTACCATCA
TCCTTTTCTCGAAGCAAAAGTAGAGACTTCCACAAGATTATGGATAGAGTATGGTCTTATCTACAAGGGTGGAAAGGCAAAATGTTTTCTGGTGGAGGAAAAGAGGTGTT
GATTAAAAGCATGGTTCAGGCTATTCCGGTGTATGCCATGAGTTGCTTCCGGTTACCAAAAGTGTTACTATCTAAAATTACGTCCCTCTATGCAAGATTTTGGTGGGGTA
GCAATGATGGTGTGGCGTATATTCAACAACCCCAACTCGACCCTGGCAAAGGACCAATCACAAGGAGCAAGGCAAAGAAGATACAAGAGGCTTTCACACTGCATGTTCAA
AAGCTAGCAAATGCACAACGAGATGCCGAGAATTTTGAATCTTGTATAATCCCATCCATCCCGGATAAATGGATTTGGCACTATCGTAAGTCAGGTAAGTATTCCGTTCG
TAGTGGGTACAAGGCCTACATGTTGAAGAAGGATGAGGCATCCCCTTCGGATTCTGGATATGTTAAGAGTTGGTGGAAGAAGTTGTGGTCTCTGAAGATACCCTCCAAAG
TTAAACATTTCATTTGGCGGGTTTATAATAATTGTATTCCAACAATGACAAATTTGTTTCGACGCCATGTGGTTTATGATGCACTATGTCCAATCTGCAAAAATGGACCG
GAAACCTCTGATCATGCACTTTTTAGATGTTGTCGAGCTAAAAAGTTTTGGAGAGAATTTCACAGCAAGATTAGCATCCGTGATTATAATCATGGATCAATTCAAGATAG
ATTTTTAGAAATTTATCATTCAGAGCAACAGGACATGTTGGCTTGGATTTGTATGGGTGCTTGGGCACTATGGAATGACCATAACGCTCTAGTTCACGATGCTGCTTATG
ATTTGCAAAGAGGCAAGTGCGGTTTTGGCGCCATCATCAGGACACTGTGGAAATGGGAGATTGATATCTACTTTGATTTCCTAGGACTGATTACCATGCTCAATGGTTTA
ACAGACGAATTCACGGAAGCACATGCTTTTCTTTGGGATATAAAGTTGATGAGTAATTCATTTTCATCAATTCAGTTCATTCATGCTAGTAGAGCATGCAATACAATTGC
TCATAACTTAGCTCAGATCGGACTTAGCTCTGATTCCATGTTATGGCTTAGGATTTTCCTTCCTGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAAAAAAACAGAAACTTTATGATGATGAATGTAATGAGGCGGGACTGGTGAACAGCCCCGCTCGAAAGTATGAATGTTTTGTGTTGGAATGTTCGTGGGTTGGGGA
TCCCACGAGCATTCCAGAATCTTGTCAGGGTGGTTCGTTCACATGGTGTAATAGGAGGGAGTTGGAAGATCAGGTGCTTACTAGAATAGATCGTTTCCTCGTCAATCAAC
AGTTCTTGGATTTATTTCAGGATTGTAAGGTCTTCAACAAGGATTGGGCAAAGTCCAATCACCGTCCTATCCTGTTACAAACGGGTTTATGCCTAGAGTATTTTAGGTGT
GGTCGTTCCAGAAGATTCCGGTTTGAAGAGTTTTGGACGCGTAGCAACGATTGTAAAGATATCATTCACAATTGTGCTGACTGGAGTGCCAAAGGTACAACAGTATGGTT
CCACCTTCAAGCAAATAAAAGGCGGCAACAGAATAAAATTATGGGCATTCTTGATTCCAATGGTCAGTGGCAATCCAATCCAGACGTGATTGGATCAAATTTTGTGAATT
ATTTCAAATCAATTTTCAGCTCACAGATCCCATCAGAGTTCGAGGTGTCCCAAATTGTTTCTAGAATTCCTCAAAAAGTCATTGATGAGATGAATATTGATCTTTTGTGG
CCTTTCACTAAGGAGGAAATAGAGAACGTCATCAAAGGATTTCATCCATCTAAGGCACCTGGTCCAGATGGCTTTCCAGCGATTTTTTATCAGAAAAATTGGGAGATTGT
AGGTGACCAAACTATTTCTGACATTTTGGCGGTGCTGAACCAGGGTCATCAAGTCCGGGAGTGGAACCATACCCATATTGCTCTGATTCCAAAGGTTCGCGATGCAAGGT
TTGTCTTAAATCTCAGGTATACAGGTGTCCAGGACCAGTCCTCCTATTTGAGTTCAATTCTTCAGATGAAGGTCGTAAAAAACTTGGATTCATACTTAGGCCTACCATCA
TCCTTTTCTCGAAGCAAAAGTAGAGACTTCCACAAGATTATGGATAGAGTATGGTCTTATCTACAAGGGTGGAAAGGCAAAATGTTTTCTGGTGGAGGAAAAGAGGTGTT
GATTAAAAGCATGGTTCAGGCTATTCCGGTGTATGCCATGAGTTGCTTCCGGTTACCAAAAGTGTTACTATCTAAAATTACGTCCCTCTATGCAAGATTTTGGTGGGGTA
GCAATGATGGTGTGGCGTATATTCAACAACCCCAACTCGACCCTGGCAAAGGACCAATCACAAGGAGCAAGGCAAAGAAGATACAAGAGGCTTTCACACTGCATGTTCAA
AAGCTAGCAAATGCACAACGAGATGCCGAGAATTTTGAATCTTGTATAATCCCATCCATCCCGGATAAATGGATTTGGCACTATCGTAAGTCAGGTAAGTATTCCGTTCG
TAGTGGGTACAAGGCCTACATGTTGAAGAAGGATGAGGCATCCCCTTCGGATTCTGGATATGTTAAGAGTTGGTGGAAGAAGTTGTGGTCTCTGAAGATACCCTCCAAAG
TTAAACATTTCATTTGGCGGGTTTATAATAATTGTATTCCAACAATGACAAATTTGTTTCGACGCCATGTGGTTTATGATGCACTATGTCCAATCTGCAAAAATGGACCG
GAAACCTCTGATCATGCACTTTTTAGATGTTGTCGAGCTAAAAAGTTTTGGAGAGAATTTCACAGCAAGATTAGCATCCGTGATTATAATCATGGATCAATTCAAGATAG
ATTTTTAGAAATTTATCATTCAGAGCAACAGGACATGTTGGCTTGGATTTGTATGGGTGCTTGGGCACTATGGAATGACCATAACGCTCTAGTTCACGATGCTGCTTATG
ATTTGCAAAGAGGCAAGTGCGGTTTTGGCGCCATCATCAGGACACTGTGGAAATGGGAGATTGATATCTACTTTGATTTCCTAGGACTGATTACCATGCTCAATGGTTTA
ACAGACGAATTCACGGAAGCACATGCTTTTCTTTGGGATATAAAGTTGATGAGTAATTCATTTTCATCAATTCAGTTCATTCATGCTAGTAGAGCATGCAATACAATTGC
TCATAACTTAGCTCAGATCGGACTTAGCTCTGATTCCATGTTATGGCTTAGGATTTTCCTTCCTGGCTAG
Protein sequenceShow/hide protein sequence
MQKKQKLYDDECNEAGLVNSPARKYECFVLECSWVGDPTSIPESCQGGSFTWCNRRELEDQVLTRIDRFLVNQQFLDLFQDCKVFNKDWAKSNHRPILLQTGLCLEYFRC
GRSRRFRFEEFWTRSNDCKDIIHNCADWSAKGTTVWFHLQANKRRQQNKIMGILDSNGQWQSNPDVIGSNFVNYFKSIFSSQIPSEFEVSQIVSRIPQKVIDEMNIDLLW
PFTKEEIENVIKGFHPSKAPGPDGFPAIFYQKNWEIVGDQTISDILAVLNQGHQVREWNHTHIALIPKVRDARFVLNLRYTGVQDQSSYLSSILQMKVVKNLDSYLGLPS
SFSRSKSRDFHKIMDRVWSYLQGWKGKMFSGGGKEVLIKSMVQAIPVYAMSCFRLPKVLLSKITSLYARFWWGSNDGVAYIQQPQLDPGKGPITRSKAKKIQEAFTLHVQ
KLANAQRDAENFESCIIPSIPDKWIWHYRKSGKYSVRSGYKAYMLKKDEASPSDSGYVKSWWKKLWSLKIPSKVKHFIWRVYNNCIPTMTNLFRRHVVYDALCPICKNGP
ETSDHALFRCCRAKKFWREFHSKISIRDYNHGSIQDRFLEIYHSEQQDMLAWICMGAWALWNDHNALVHDAAYDLQRGKCGFGAIIRTLWKWEIDIYFDFLGLITMLNGL
TDEFTEAHAFLWDIKLMSNSFSSIQFIHASRACNTIAHNLAQIGLSSDSMLWLRIFLPG