; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012419 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012419
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:40911228..40918491
RNA-Seq ExpressionLag0012419
SyntenyLag0012419
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]1.7e-11832.44Show/hide
Query:  GNYGKRISLARQQ---VQRALNSQGDVGGAR--KQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIV
        GN  K+++ A ++   +Q  L +   V  A+  + + +LL ++EI WRQRSRV WL+ GD+N+ +FH RA+ R +RNR+ G+ D    W TEE+ +G + 
Subjt:  GNYGKRISLARQQ---VQRALNSQGDVGGAR--KQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIV

Query:  ETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE---------RKQGAVG----
          YF+ LF+S+  Q+   ++  L  V   +    N+ L + FTREE+  +L QM  +KAPG D +PALF+Q++W +VG++           +G+V     
Subjt:  ETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE---------RKQGAVG----

Query:  -----------------------------------------------------------------------------------WVALKLDMSKAYDRVEW
                                                                                            +ALKLDM+KAYDRVEW
Subjt:  -----------------------------------------------------------------------------------WVALKLDMSKAYDRVEW

Query:  FFLEKFILALGLDGPGAEASMIARCLKAYSML---TGVGFPDSQNG--------------------------------------------------NEND
         FL   +L LG         M       +S+L   T VG    Q G                                                  +++ 
Subjt:  FFLEKFILALGLDGPGAEASMIARCLKAYSML---TGVGFPDSQNG--------------------------------------------------NEND

Query:  LFPKSGMNLTWAAISFYRVIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFS
        LF K+      A  + ++   E+ TGQ++NY KS + +SPN        I   L V +V  H+ YLGLP I    +    + +KD++W +I  WK    S
Subjt:  LFPKSGMNLTWAAISFYRVIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFS

Query:  AGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVL
          GKE+LIK VLQ +PTY MSCFQ+   L KE N IMARFWW   ++ R +HW  W+ LC  K  GGLGFRDL+ FN+ LLAKQ W+++ +P+SL+ R+ 
Subjt:  AGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVL

Query:  KGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLLCPDNSWNVGLVQSLFQGVDAE
        + +Y     F+E E   N SF+W S+ WG+ LL +G+RWRV  G  + V  D+W+   S  +I+    +     V  L      WNV L++ +F   + +
Subjt:  KGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLLCPDNSWNVGLVQSLFQGVDAE

Query:  AILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRAS---SSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPR
        AIL+IP    +  D L+WHYE++GMY+VKSGY L  +   + S   S+  D   ++WK +W+ +IP+KIK F WR  +D+LP    L  R +    +CP+
Subjt:  AILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRAS---SSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPR

Query:  CKLAEETTFHALWEYATVCKATK
        C    E+  HA+W    +C+A K
Subjt:  CKLAEETTFHALWEYATVCKATK

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.9e-13036.85Show/hide
Query:  RSKRGNYGKRISLARQQVQRALN------SQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEE
        R+K GN+  R+ +A   +Q A++      ++     A   +  LL EEEI+WRQRSR  W + GDRN+KWFH +A+ RRR N ++GL D  G W   + +
Subjt:  RSKRGNYGKRISLARQQVQRALN------SQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEE

Query:  MGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE---------------
        +  ++E+YF  LF+S+    S  +++    +   V    N  L +PF  EE++ +L Q+   KA G D   + FY+ FW +VG +               
Subjt:  MGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE---------------

Query:  -----------------RKQGA------------VGWVALKLDMSKAYDRVEWFFLEKFILALGLD------GPGAEASMIARCLKAYSMLTGVGFPDSQ
                         + Q A            +G+ +LKLDMSKAYDRVEW FLE  +L +G D              I R L+ Y    G       
Subjt:  -----------------RKQGA------------VGWVALKLDMSKAYDRVEWFFLEKFILALGLD------GPGAEASMIARCLKAYSMLTGVGFPDSQ

Query:  NGNENDLFPKSGMNLTWAAISFYRVIGELSTGQEVNYGK---SGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYI
                     +L      F R++G   +  E+ Y K     +   P   +     I ++L V +V    +YLGLP   P  + M   ++KDR+W ++
Subjt:  NGNENDLFPKSGMNLTWAAISFYRVIGELSTGQEVNYGK---SGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYI

Query:  HKWKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVES
          WK   FS GGKEVLIK V Q +P Y MSCF+L   L++E + I ARFWWGS +E +K+HW +W  L +PK  GG+GFRDL+LFNK LLAKQ W+++  
Subjt:  HKWKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVES

Query:  PDSLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLL-CPDNSWNVGLV
        P+S+L RVLKG+YF++ SFME +  GN S++WRS++WGR LLK+GLRWR+ +G+ V +  D W+    TL+I+    +     VS L+   +  W   +V
Subjt:  PDSLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLL-CPDNSWNVGLV

Query:  QSLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCM---LMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKR
        +  F   +A+ IL IP    +  D+L+W+YEK G+Y+V+SGY +  +    +   SSS+ + VR WW   W   IP+KIK F WRL  D LPT  NL KR
Subjt:  QSLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCM---LMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKR

Query:  GVEVPTVCPRCKLAEETTFHALWEYATVCK
        GVE+   C  C    E + H  W    +CK
Subjt:  GVEVPTVCPRCKLAEETTFHALWEYATVCK

XP_030496634.1 uncharacterized protein LOC115712492 [Cannabis sativa]3.4e-11930.5Show/hide
Query:  KRGKKKKEKKKSTAASGGR-----WSPVAGHRKKNK---RKEKDVVEKGWCQPFRGTRIIN--------ESLVEVWHEKVRNCASELSSWGRSKRGNCGK
        K   K KEKK  TA   G+        +  H   N+   R+ +   EK W Q    T +I         E+   ++   ++ C+  L  W R K GN  K
Subjt:  KRGKKKKEKKKSTAASGGR-----WSPVAGHRKKNK---RKEKDVVEKGWCQPFRGTRIIN--------ESLVEVWHEKVRNCASELSSWGRSKRGNCGK

Query:  KISLARQQVQRALNSRGDVGGARKQLENLLGEGMFEVVQGNFVSVLRRSKRGNYGKRISLARQQVQRALNSQGDVGGARKQLENLLGEEEIYWRQRSRVD
        KIS A++QV    N  GD                                                R+L +  ++  +   L++LL +EE YW+QRSRVD
Subjt:  KISLARQQVQRALNSRGDVGGARKQLENLLGEGMFEVVQGNFVSVLRRSKRGNYGKRISLARQQVQRALNSQGDVGGARKQLENLLGEEEIYWRQRSRVD

Query:  WLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQM
        WLQ GD+N+K+FH  A+ R++ N ++ L D  G  +T +  M  ++ T+F  LFT+++      L   L+ +P  V  + NE L RPFT ++V  +LK +
Subjt:  WLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQM

Query:  GQSKAPGEDELPALFYQRFWSVVGN---------------------------------------------------------------------------
           K+PG D + A+FY ++WS+VGN                                                                           
Subjt:  GQSKAPGEDELPALFYQRFWSVVGN---------------------------------------------------------------------------

Query:  ---------------------ERKQGAVGWVALKLDMSKAYDRVEWFFLEKFILALGLDGPGAEASMIARCLKAYS---MLTG--VGFPDSQNG------
                              + +G  G+ ALKLDMSKA+DRVEW ++++ +  +G        +MI  CL + S   ML G  VG+     G      
Subjt:  ---------------------ERKQGAVGWVALKLDMSKAYDRVEWFFLEKFILALGLDGPGAEASMIARCLKAYS---MLTG--VGFPDSQNG------

Query:  ------------------NENDLFPKSGMNLT-----------------------WAAISFYRVIG--ELSTGQEVNYGKSGICVSPNVDEHLKQSIASM
                           E  +    G+ LT                         A++  RV+     ++GQ +N  KS +  SPN  +  K      
Subjt:  ------------------NENDLFPKSGMNLT-----------------------WAAISFYRVIG--ELSTGQEVNYGKSGICVSPNVDEHLKQSIASM

Query:  LGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHW
        LG+ +   H+RYLGLPA     K      VK+R+W  +H W    FS GGKEVL+K V+Q +PTY MSCF+L  +   +   +MA FWWGS ++G K+HW
Subjt:  LGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHW

Query:  ASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDR
         SWK LC  K  GG+GFR    FNK LLAKQ W+++E P+SLL R+LK +YF  ++F+E     + S  W+ + WGR LL EGLR+++ +G  V+   DR
Subjt:  ASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDR

Query:  WISRSSTLRIIDGNGIDRDMSVSMLLCPDNSWNVGLVQSLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRW
        WI   +  + +  NG D  M+VS  +     WN+ L+   FQ +D + I+ IP   F T D+L+WH+   G YTV SG++L   L     +    S   W
Subjt:  WISRSSTLRIIDGNGIDRDMSVSMLLCPDNSWNVGLVQSLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRW

Query:  WKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAEETTFHALWEYATVCK
        WK  W+  +PSK+K F WR+  + LP    L +R V     C  C  A E+  HAL+   T  K
Subjt:  WKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAEETTFHALWEYATVCK

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]9.9e-11929.95Show/hide
Query:  KQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGRE
        K++++LL ++EIYW QRSR++WL+ GDRN+K+FH +A+QRRR+N + G+ +  G+WV   EE+GQ+   YF  LF +        ++E L  V   V  +
Subjt:  KQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGRE

Query:  ENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE-----------------------------------------------------
          E L   FT EEV  +L QMG +KAPG D + ALFYQ+FW +VG+                                                      
Subjt:  ENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE-----------------------------------------------------

Query:  -------------------------------------------RKQGAVGWVALKLDMSKAYDRVEWFFLEKFILALGLDGPGAEASMIARCLKAYS---
                                                   RK+G  G VALKLD+SKAYDRVEW FL+  +  +G      E  M      ++S   
Subjt:  -------------------------------------------RKQGAVGWVALKLDMSKAYDRVEWFFLEKFILALGLDGPGAEASMIARCLKAYS---

Query:  -------------------------------------------MLTGVGFPDSQNGNENDLFPKSGMNLTWAAISFYRVIGEL------STGQEVNYGKS
                                                   M+TGV          N +F    +    A  +    I E+      ++GQ +N  KS
Subjt:  -------------------------------------------MLTGVGFPDSQNGNENDLFPKSGMNLTWAAISFYRVIGEL------STGQEVNYGKS

Query:  GICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECN
            S N  E  K  I  +LGV+ V    +YLGLP +    K  +   +KDR+W  +  WK    S  GKE+LIK V Q +PTY MS FQ+ + L  E  
Subjt:  GICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECN

Query:  RIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLK
         + ARFWWG     RK+HW SW KL  PK  GG+GFRDL+ FN  +LAKQGW+LV+  DSLL R  K +YF  SSF+E +   N SFVWRS+M  + +L+
Subjt:  RIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLK

Query:  EGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRD---MSVSMLLCPD-NSWNVGLVQSLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKS
         G  WRV +G  +N  +DRW+    T +++  N + RD   M V+ L+ P+ N WN   ++++F   +AEAI +IP      PD + W Y   G+++VKS
Subjt:  EGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRD---MSVSMLLCPD-NSWNVGLVQSLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKS

Query:  GYNLVCMLM---GRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAEETTFHALWEYATV------------
         Y++   ++    R  +S     +  W  +W  R+P+K+K F WR  ++ LPT  NL  R +     C  C    E+T HALW+ A V            
Subjt:  GYNLVCMLM---GRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAEETTFHALWEYATV------------

Query:  -----------------------------------------------CKATKSFNL--------------------------------------------
                                                        K   S NL                                            
Subjt:  -----------------------------------------------CKATKSFNL--------------------------------------------

Query:  ----------GAIIRDAEGRVMLTLMKLVHYVLDVDALEAMAIRDSLLVAREAGLLRLEVESDSARVVAIIRSKQRDYSEVGVLVQEIIQILKGLQYRSV
                  GAIIR+ +G VM  +      V + D  E +A R +L    +AG  RL VE D+  V   I S   + S  G ++ +I  +L+ LQ  SV
Subjt:  ----------GAIIRDAEGRVMLTLMKLVHYVLDVDALEAMAIRDSLLVAREAGLLRLEVESDSARVVAIIRSKQRDYSEVGVLVQEIIQILKGLQYRSV

Query:  SWCRREANLVAYTAA
           RR  N VA+  A
Subjt:  SWCRREANLVAYTAA

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]4.5e-11930.79Show/hide
Query:  RKEKDVVEKGWCQPFRGTRIINESLVEVWHEKVRNCA-SELS---SWGRSKRGNC-GKKISLARQQVQRALNSRGDVGGARKQLENLLG-EGMFEVVQGN
        R ++ V    W   F  +RI +       H+ +  C  SEL     WGR  R      K     Q +Q       D  G  +   ++ G     E +QG+
Subjt:  RKEKDVVEKGWCQPFRGTRIINESLVEVWHEKVRNCA-SELS---SWGRSKRGNC-GKKISLARQQVQRALNSRGDVGGARKQLENLLG-EGMFEVVQGN

Query:  FVSVLRRSKRGNYGKRISLARQQVQRALNSQGDVGG------ARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRW
         +    R   G+    ++   ++++    S G V         R ++E L  +EE  W+QRSR  WL+ GDRN+ +FH RATQR +RN + GLED  G+W
Subjt:  FVSVLRRSKRGNYGKRISLARQQVQRALNSQGDVGG------ARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRW

Query:  VTEEEEMGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE---------
        V +EE++G++VE YFQ +FTS+N  +    +E LA +   +  E +  L R +  EEVL +LKQM    APG D +  +FY+ +W +VG +         
Subjt:  VTEEEEMGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE---------

Query:  ---------------------------------------------------------------------------------------RKQGAVGWVALKL
                                                                                               + QG +G++ALKL
Subjt:  ---------------------------------------------------------------------------------------RKQGAVGWVALKL

Query:  DMSKAYDRVEWFFLEKFILALGLDGPGAEASMIARCLK--AYSMLTG---------------VGFPDSQNG---------NENDLFPKSGMNLTWAAISF
        DMSKAYD+VEW FL K +  LG   P    S+I+ C+   +YS+L                  G    +NG         +++ LF ++ +      +  
Subjt:  DMSKAYDRVEWFFLEKFILALGLDGPGAEASMIARCLK--AYSMLTG---------------VGFPDSQNG---------NENDLFPKSGMNLTWAAISF

Query:  YRVIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQVVP
          V  E  +GQ++N  K+ I  S N    ++  I  +LGV  +  +++YLGLPA+    K  S  ++K+R+W  +  WK    S  G+EVLIK V+Q +P
Subjt:  YRVIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQVVP

Query:  TYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMETEHK
        TY MSCF+L   L+KE   ++ +FWW    E RKVHW  W++LC  K +GG+GF+D++ FN  LLAKQ W+++ +PDSL  RV K ++F + S +E +  
Subjt:  TYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMETEHK

Query:  GNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDG-NGIDRDMSVSMLLCPD-NSWNVGLVQSLFQGVDAEAILEIPRPHFSTPD
          GS+ W+S++  R +++EG+ WR+ +G+ V +  D+W+   S   +I     +     VS L+  D   WN  LV  +F    A  I  +P      PD
Subjt:  GNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDG-NGIDRDMSVSMLLCPD-NSWNVGLVQSLFQGVDAEAILEIPRPHFSTPD

Query:  KLVWHYEKHGMYTVKSGYNLVCM--LMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAEETTFHALWEY
        +++W     GM+T KS Y L+        A +S+  + R++W+ LW  R+P+K+K F WR   D LPT++NL +R +    +C  C+   E T HALW  
Subjt:  KLVWHYEKHGMYTVKSGYNLVCM--LMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAEETTFHALWEY

Query:  ATV
        +T+
Subjt:  ATV

TrEMBL top hitse value%identityAlignment
A0A2N9F7C1 Uncharacterized protein1.0e-12433.52Show/hide
Query:  VVEKGWCQPFRGTRIINESLVEVWHEKVRNCASELSSWGRSKRGNCGKKISLARQQVQRALNSRGDVGGARKQLENLLGEGMFEVVQGNFVSVLRRSKRG
        VV + W    RG+      L +V   K++ C  ELS W R++ GN  K +    + +++A            ++++ LG G   V+              
Subjt:  VVEKGWCQPFRGTRIINESLVEVWHEKVRNCASELSSWGRSKRGNCGKKISLARQQVQRALNSRGDVGGARKQLENLLGEGMFEVVQGNFVSVLRRSKRG

Query:  NYGKRISLARQQVQRALNSQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQG
                                  R+++  LL +EE  W+QRSRV WL+ GDRN+K+FH RA+QRRRRN ++ L    G  VT  E +G     Y+Q 
Subjt:  NYGKRISLARQQVQRALNSQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQG

Query:  LFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE--------RKQGAVGWVALKLDMSKA
        LFT+N  ++   ++  L  +  CV +  N+ L   FT EEV+ ++KQM   KAPG D +P +FYQ +W VVG +           G VG +ALKLDMSKA
Subjt:  LFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE--------RKQGAVGWVALKLDMSKA

Query:  YDRVEWFFLEKFILALGLDGPGAEASMIARCLK--AYSML-----TGVGFPDS--QNGNE-------------NDLFPKSGM-----------------N
        YDRVEW FL++ ++ +G        S+I  C+    YS+L     TG   P    + G+              N L  K+ +                 N
Subjt:  YDRVEWFFLEKFILALGLDGPGAEASMIARCLK--AYSML-----TGVGFPDS--QNGNE-------------NDLFPKSGM-----------------N

Query:  LTWA--AISFYR-------------VIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHK
        L +A  ++ F R              I E ++GQ++N  K+ +  S N  + +++ +  +LGV  +  +++YLGLP++    K+     +K+R+W+ +  
Subjt:  LTWA--AISFYR-------------VIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHK

Query:  WKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPD
        WK    S  G+E+LIK V+Q +PTY M+CF+L V+L KE   I+ RFWWG   E RK+HW  W+KLC PKG GGLGFRDLQ FN  LLAKQ W+L+ + +
Subjt:  WKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPD

Query:  SLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDG-NGIDRDMSVSMLL-CPDNSWNVGLVQ
        SLL +V   K+F   + ME   K  GSF WRS++  + L++ G+ WRV DG  + +    W+      R++     + +D  V  L+      WN+  VQ
Subjt:  SLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDG-NGIDRDMSVSMLL-CPDNSWNVGLVQ

Query:  SLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRW-----WKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQK
        SLF   DAEAIL+IP       DK+ W   + G Y+V+SGY L   L+  A     +S R W     WK +W   +P+K+K F WR  +D LPT + L +
Subjt:  SLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRW-----WKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQK

Query:  RGVEVPTVCPRCKLAEETTFHALWEYATVCKATKSFNLGAIIRDAEGRVMLTLMKLVHYVLDVDA---LEAMAI
        R V    +C  C+   E + HALW  + V +        ++ ++ + R   +   LV +V+  D+   LE +A+
Subjt:  RGVEVPTVCPRCKLAEETTFHALWEYATVCKATKSFNLGAIIRDAEGRVMLTLMKLVHYVLDVDA---LEAMAI

A0A2N9GPD6 RNase H domain-containing protein1.7e-12433.41Show/hide
Query:  VVEKGWCQPFRGTRIINESLVEVWHEKVRNCASELSSWGRSKRGNCGKKISLARQQVQRALNSRGDVGGARKQLENLLGEGMFEVVQGNFVSVLRRSKRG
        VV + W    RG+      L +V   K++ C  ELS W R++ GN  K +    + +++A            ++++ LG G   V+              
Subjt:  VVEKGWCQPFRGTRIINESLVEVWHEKVRNCASELSSWGRSKRGNCGKKISLARQQVQRALNSRGDVGGARKQLENLLGEGMFEVVQGNFVSVLRRSKRG

Query:  NYGKRISLARQQVQRALNSQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQG
                                  R+++  LL +EE  W+QRSRV WL+ GDRN+K+FH RA+QRRRRN ++ L    G  VT  E +G     Y+Q 
Subjt:  NYGKRISLARQQVQRALNSQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQG

Query:  LFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE--------RKQGAVGWVALKLDMSKA
        LFT+N  ++   ++  L  +  CV +  N+ L   FT EEV+ ++KQM   KAPG D +P +FYQ +W VVG +           G VG +ALKLDMSKA
Subjt:  LFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE--------RKQGAVGWVALKLDMSKA

Query:  YDRVEWFFLEKFILALGLDGPGAEASMIARCLK--AYSML-----TGVGFPDS--QNGNE-------------NDLFPKSGM-----------------N
        YDRVEW FL++ ++ +G        S+I  C+    YS+L     TG   P    + G+              N L  K+ +                 N
Subjt:  YDRVEWFFLEKFILALGLDGPGAEASMIARCLK--AYSML-----TGVGFPDS--QNGNE-------------NDLFPKSGM-----------------N

Query:  LTWA--AISFYR-------------VIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHK
        L +A  ++ F R              I E ++GQ++N  K+ +  S N  + +++ +  +LGV  +  +++YLGLP++    K+     +K+R+W+ +  
Subjt:  LTWA--AISFYR-------------VIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHK

Query:  WKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPD
        WK    S  G+E+LIK V+Q +PTY M+CF+L V+L KE   I+ RFWWG   E RK+HW  W+KLC PKG GGLGFRDLQ FN  LLAKQ W+L+ + +
Subjt:  WKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPD

Query:  SLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDG-NGIDRDMSVSMLL-CPDNSWNVGLVQ
        SLL +V   K+F   + ME   K  GSF WRS++  + L++ G+ WRV DG  + +    W+      R++     + +D  V  L+      WN+  VQ
Subjt:  SLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDG-NGIDRDMSVSMLL-CPDNSWNVGLVQ

Query:  SLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRW-----WKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQK
        SLF   DAEAIL+IP       DK+ W   + G Y+V+SGY L   L+  A     +S R W     WK +W   +P+K+K F WR  +D LPT + L +
Subjt:  SLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRW-----WKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQK

Query:  RGVEVPTVCPRCKLAEETTFHALWEYATVCKATKSFNLGAIIRDAEGRVMLTLMKLVHYVLDVDA---LEAMAI
        R V    +C  C+   E + HALW  + + +        ++ ++ + R   +   LV +V+  D+   LE +A+
Subjt:  RGVEVPTVCPRCKLAEETTFHALWEYATVCKATKSFNLGAIIRDAEGRVMLTLMKLVHYVLDVDA---LEAMAI

A0A2N9HS90 RNase H domain-containing protein8.5e-12432.42Show/hide
Query:  VEKGWCQPFRGTRIINESLVEVWHEKVRNCASELSSWGRSKRGNCGKKISLARQQVQRALNSRGDVGGARKQLENLLGEGMFEVVQGNFVSVLRRSKRGN
        + K W    RG+ ++  +      EK++ C + L+SW R + G+  K +    +Q++RA            ++++  G G   V+               
Subjt:  VEKGWCQPFRGTRIINESLVEVWHEKVRNCASELSSWGRSKRGNCGKKISLARQQVQRALNSRGDVGGARKQLENLLGEGMFEVVQGNFVSVLRRSKRGN

Query:  YGKRISLARQQVQRALNSQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQGL
                                 RK++  LL +EE  W+QRSR  WL+ GD N+K+FH RA+ R+RRN +  L    G  VT+E  +G     Y+Q L
Subjt:  YGKRISLARQQVQRALNSQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQGL

Query:  FTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE---------------------------
        FT+   ++   +   L  +  CV +E N+ L  PFT EEV  ++KQMG  KAPG D +P +FYQ +W VVG +                           
Subjt:  FTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE---------------------------

Query:  ------------RKQGAVGWVALKLDMSKAYDRVEWFFLEKFILALGLDGPGAEASMIARCLK--AYSML-----TGVGFPDSQNGNENDLFPKSGMNLT
                    ++ G VG +ALKLDMSKAYDRVEW F+EK +  +G        S+I  C+   +YS+L     TG   P      ++ L       L 
Subjt:  ------------RKQGAVGWVALKLDMSKAYDRVEWFFLEKFILALGLDGPGAEASMIARCLK--AYSML-----TGVGFPDSQNGNENDLFPKSGMNLT

Query:  WAAISFYRVIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKV
           I       E ++GQ++N  K+ +  S N  + +++ I  +LGV  +  +++YLGLP++    K+     +K+R+W+ +  WK    S  G+EVLIK 
Subjt:  WAAISFYRVIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIKV

Query:  VLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSF
        V+Q +PTY M+CF+L V+L KE   I+ RFWWG   + RK+HW  W+K+C  KG GGLGFRDLQ FN  LLAKQ W+ + + +SLL +V   K+F   + 
Subjt:  VLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSF

Query:  METEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDG-NGIDRDMSVSMLLCPDNS-WNVGLVQSLFQGVDAEAILEIPRP
        +E   K  GSF WRS++  + L+  GL WRV DG  + +    W+      RI+     +  D  V  L+      WN+  +Q+LF   D +AIL+IP  
Subjt:  METEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDG-NGIDRDMSVSMLLCPDNS-WNVGLVQSLFQGVDAEAILEIPRP

Query:  HFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRW-----WKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAEE
             D+L W   ++G Y+V+SGY L+C     A +   +S R+W     WK +W  R+P+KIK F WR  +D LPT + L +R V    +C  C+   E
Subjt:  HFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRW-----WKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAEE

Query:  TTFHALWEYATVCKATKSFNLGAIIRDAEGRVMLTLMKLVHYVL--DVDALEAMAIRDSLLVAREAGLLRLEVESD
         + HALW    V   ++ ++L     D +    ++L  L+  ++  + D L       S L+  +    RL + SD
Subjt:  TTFHALWEYATVCKATKSFNLGAIIRDAEGRVMLTLMKLVHYVL--DVDALEAMAIRDSLLVAREAGLLRLEVESD

A0A6J1DAR4 uncharacterized protein LOC1110189549.3e-13136.85Show/hide
Query:  RSKRGNYGKRISLARQQVQRALN------SQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEE
        R+K GN+  R+ +A   +Q A++      ++     A   +  LL EEEI+WRQRSR  W + GDRN+KWFH +A+ RRR N ++GL D  G W   + +
Subjt:  RSKRGNYGKRISLARQQVQRALN------SQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEE

Query:  MGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE---------------
        +  ++E+YF  LF+S+    S  +++    +   V    N  L +PF  EE++ +L Q+   KA G D   + FY+ FW +VG +               
Subjt:  MGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNE---------------

Query:  -----------------RKQGA------------VGWVALKLDMSKAYDRVEWFFLEKFILALGLD------GPGAEASMIARCLKAYSMLTGVGFPDSQ
                         + Q A            +G+ +LKLDMSKAYDRVEW FLE  +L +G D              I R L+ Y    G       
Subjt:  -----------------RKQGA------------VGWVALKLDMSKAYDRVEWFFLEKFILALGLD------GPGAEASMIARCLKAYSMLTGVGFPDSQ

Query:  NGNENDLFPKSGMNLTWAAISFYRVIGELSTGQEVNYGK---SGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYI
                     +L      F R++G   +  E+ Y K     +   P   +     I ++L V +V    +YLGLP   P  + M   ++KDR+W ++
Subjt:  NGNENDLFPKSGMNLTWAAISFYRVIGELSTGQEVNYGK---SGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYI

Query:  HKWKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVES
          WK   FS GGKEVLIK V Q +P Y MSCF+L   L++E + I ARFWWGS +E +K+HW +W  L +PK  GG+GFRDL+LFNK LLAKQ W+++  
Subjt:  HKWKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVES

Query:  PDSLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLL-CPDNSWNVGLV
        P+S+L RVLKG+YF++ SFME +  GN S++WRS++WGR LLK+GLRWR+ +G+ V +  D W+    TL+I+    +     VS L+   +  W   +V
Subjt:  PDSLLCRVLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLL-CPDNSWNVGLV

Query:  QSLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCM---LMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKR
        +  F   +A+ IL IP    +  D+L+W+YEK G+Y+V+SGY +  +    +   SSS+ + VR WW   W   IP+KIK F WRL  D LPT  NL KR
Subjt:  QSLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCM---LMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKR

Query:  GVEVPTVCPRCKLAEETTFHALWEYATVCK
        GVE+   C  C    E + H  W    +CK
Subjt:  GVEVPTVCPRCKLAEETTFHALWEYATVCK

A0A803PV25 Uncharacterized protein1.2e-12531.82Show/hide
Query:  KNKRKEKDVVEKGWCQPFRGTRIINESLVEVWHEKVRNCASELSSWGRSKRGNCGKKISLARQQVQRALNSRGD-VGGARKQLENLLGEGMFEVVQGNFV
        K KRK +   E+ WCQ    T I++     +W E+  N +  + S+ R K   CGK +    ++ +  LNS  + +  A  +L  +   G++E +     
Subjt:  KNKRKEKDVVEKGWCQPFRGTRIINESLVEVWHEKVRNCASELSSWGRSKRGNCGKKISLARQQVQRALNSRGD-VGGARKQLENLLGEGMFEVVQGNFV

Query:  SVLRRSKRGNYGKRISLARQQVQRALNSQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMG
                           QQ++  LN              LL ++E YWRQRSR  WLQWGDRN+K+FH +A+ RR++N ++GL+D  G W  ++  + 
Subjt:  SVLRRSKRGNYGKRISLARQQVQRALNSQGDVGGARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMG

Query:  QIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWS--------VVGNERKQGA----
        +IVE Y++ LFTS++  ES  L E L+ V   V    N  L   F  EEV+ ++K+M  +KAPG D LPALFYQ+FWS        V  N    GA    
Subjt:  QIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWS--------VVGNERKQGA----

Query:  --------------------------------------------------------------------VGW----------------VALKLDMSKAYDR
                                                                            VG+                VALKLDM+KAYDR
Subjt:  --------------------------------------------------------------------VGW----------------VALKLDMSKAYDR

Query:  VEWFFLEKFILALGL------------------------------------DGPGAEASMIARCLKAYSMLT----------GVGFPDSQNGNENDLFPK
        VEW FLE  ++ LG                                      G      +   C +A+S L           GV F   +    +  F  
Subjt:  VEWFFLEKFILALGL------------------------------------DGPGAEASMIARCLKAYSMLT----------GVGFPDSQNGNENDLFPK

Query:  SGMNLTWAAISFYRVIGEL------STGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSF
          +    A     R   EL      ++GQ VN+ KS +C   +V   ++  +A+ +GV++V  + +YLGLP+     K    +F+ +++W  +  WK SF
Subjt:  SGMNLTWAAISFYRVIGEL------STGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSF

Query:  FSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCR
        FSA GKEVLIK ++Q +PTY MSCF+L    +   + + ARFWWGS E+  K+HW  W  LC  K +GGLGFRDL LFN+ LLAKQ W+ +  P+SL  +
Subjt:  FSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCR

Query:  VLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLLCPDNSWNVGLVQSLFQGVD
        VLK  Y+     +E +   + SFVWRS++WG+ +++ G RWR+ +GN V V  D W+ R  T +I D   +  ++ V  L   +  W+   V+++F   D
Subjt:  VLKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLLCPDNSWNVGLVQSLFQGVD

Query:  AEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRC
        AE IL++        DK++WHY K G Y+V+SGY +   L  R   SN ++  RWW+ LW  +IP K+K F W++ + ++PT + L  R V++   C RC
Subjt:  AEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRC

Query:  KL-AEETTFHALWEYATVCKATKSFNLGA-IIRDAEGRVMLTLMKL
           A E  FHALW     C   K     + I R  +  V+  LM++
Subjt:  KL-AEETTFHALWEYATVCKATKSFNLGA-IIRDAEGRVMLTLMKL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657507.8e-3427.46Show/hide
Query:  VKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLA
        + +R+ + +  W+    S  G+  L K VL  +P + MS   L  S++   +++   F WGS  E +K H   W K+C PK  GGLG R  +  N+ L++
Subjt:  VKDRMWAYIHKWKHSFFSAGGKEVLIKVVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLA

Query:  KQGWKLVESPDSLLCRVLKGKY----FRESSFMETEHKGNGSFVWRSMMWG-RSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDG-NGIDRDMSVS
        K GW+L++  +SL   VL+ KY     R+S ++    KG+ S  WRS+  G R ++  G+ W   DG  +    DRW+S    L + +G    D D  V+
Subjt:  KQGWKLVESPDSLLCRVLKGKY----FRESSFMETEHKGNGSFVWRSMMWG-RSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDG-NGIDRDMSVS

Query:  M-LLCPDNSWNVGLVQS-LFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLY
          L  P   W+   +          E    +        D+L W + + G ++V+S Y ++ +           ++  ++  LW  R+P ++K F W + 
Subjt:  M-LLCPDNSWNVGLVQS-LFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLY

Query:  YDYLPTVANLQKRGVEVPTVCPRCKLAEETTFHAL
           + T     +R +    VC  CK   E+  H L
Subjt:  YDYLPTVANLQKRGVEVPTVCPRCKLAEETTFHAL

P93295 Uncharacterized mitochondrial protein AtMg003106.9e-3045.27Show/hide
Query:  VPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPK-GRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMET
        +P Y MSCF+L   L K+    M  FWW S E  RK+ W +W+KLC  K   GGLGFRDL  FN+ LLAKQ ++++  P +LL R+L+ +YF  SS ME 
Subjt:  VPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPK-GRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMET

Query:  EHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTL
              S+ WRS++ GR LL  GL   + DG    V  DRWI   + L
Subjt:  EHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.6e-0528.1Show/hide
Query:  ARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGL---EDGFGRWVTEEEEMGQIVETYFQG----LFTSNNQQESGNLQEFLA
        ARK+        E ++RQ+SR+ WLQ GD N+++FH+     + +N ++ L   +D     VT+ +EM     T+  G    + T ++ Q   ++  F  
Subjt:  ARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGL---EDGFGRWVTEEEEMGQIVETYFQG----LFTSNNQQESGNLQEFLA

Query:  HVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVV
           +C     + L   P + +E+  ++  M ++KAPG D   A F+   W VV
Subjt:  HVPQCVGREENELLKRPFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVV

AT3G09510.1 Ribonuclease H-like superfamily protein3.6e-1827.73Show/hide
Query:  LKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLLCPDNS---WNVGLVQSLFQG
        +K +YF++ S ++ + +   S+ W S++ G +LLK+G R  + DG  + +  D  I  S   R ++     ++M+++ L     S   W+   +      
Subjt:  LKGKYFRESSFMETEHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLLCPDNS---WNVGLVQSLFQG

Query:  VDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVR------RWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVE
         D   I  I       PDK++W+Y   G YTV+SGY     L+    S+N  ++            +W+  I  K+K F WR     L T   L  RG+ 
Subjt:  VDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSSNEDSVR------RWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVE

Query:  VPTVCPRCKLAEETTFHALW
        +   CPRC    E+  HAL+
Subjt:  VPTVCPRCKLAEETTFHALW

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.0e-0551.43Show/hide
Query:  RKQGAVGWVALKLDMSKAYDRVEWFFLEKFILALG
        RK+G  GW+ LKLD+ KAYDR+ W +LE  +++ G
Subjt:  RKQGAVGWVALKLDMSKAYDRVEWFFLEKFILALG

AT4G29090.1 Ribonuclease H-like superfamily protein1.9e-4330.1Show/hide
Query:  VPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMETE
        +PTY M+CF L  ++ K+   ++A FWW +++E + +HW +W  L   K  GG+GF+D++ FN  LL KQ W+++  P+SL+ +V K +YF +S  +   
Subjt:  VPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMETE

Query:  HKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWIS---RSSTLRI-----IDGNGIDRDMSVSMLLCPD-NSWNVGLVQSLFQGVDAEAILEI
             SFVW+S+   + +L++G R  V +G  + + R +W+     S+ LR+      +   +   + VS L+      W   +++ LF  V+ + I E+
Subjt:  HKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWIS---RSSTLRI-----IDGNGIDRDMSVSMLLCPD-NSWNVGLVQSLFQGVDAEAILEI

Query:  PRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSS---NEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAE
                D   W Y   G YTVKSGY ++  ++ + SS    +E S+   ++ +W  +   KI+ F W+   + LP    L  R +   + C RC   +
Subjt:  PRPHFSTPDKLVWHYEKHGMYTVKSGYNLVCMLMGRASSS---NEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAE

Query:  ETTFHALWE
        ET  H L++
Subjt:  ETTFHALWE

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.9e-3145.27Show/hide
Query:  VPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPK-GRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMET
        +P Y MSCF+L   L K+    M  FWW S E  RK+ W +W+KLC  K   GGLGFRDL  FN+ LLAKQ ++++  P +LL R+L+ +YF  SS ME 
Subjt:  VPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPK-GRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMET

Query:  EHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTL
              S+ WRS++ GR LL  GL   + DG    V  DRWI   + L
Subjt:  EHKGNGSFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGAGTCTCTGTCTCGATGGAAATGGTGGGTCGTTTTATATGGGAAGGAAAGATTAGTATAATGTCCAAGGGTCAACCTTTGTTTCAATTTCTTGAAGCGGAGGC
TAACGGTTGCCGTAAGCGACTACGTAAGCGGTGGGGACAGGAAGCAAATGTAGGGAAGAAGAAAAAGAGAAAAAGAGGAAAGAAGAAAAAGGAAAAAAAAAAATCGACGG
CGGCGAGCGGTGGCCGCTGGTCGCCGGTCGCTGGACATAGGAAGAAGAATAAGAGGAAGGAGAAGGATGTGGTAGAGAAGGGTTGGTGTCAGCCTTTCAGAGGGACTAGA
ATTATCAATGAGTCGTTGGTTGAAGTCTGGCATGAGAAAGTGAGAAATTGTGCTTCTGAGTTGTCTAGTTGGGGCAGATCTAAAAGGGGTAACTGTGGGAAAAAGATCTC
TTTGGCTCGTCAGCAGGTGCAGAGGGCGCTGAATAGCCGAGGGGATGTTGGAGGGGCTAGAAAGCAGTTGGAAAATTTGTTGGGTGAGGGGATGTTCGAGGTAGTGCAAG
GAAATTTTGTTTCCGTTTTGAGGAGATCTAAAAGGGGTAACTATGGGAAAAGGATCTCTTTGGCTCGTCAGCAGGTGCAGAGGGCGCTGAATAGCCAAGGGGATGTTGGA
GGGGCTAGAAAGCAGCTGGAAAATTTGTTGGGTGAGGAAGAGATCTATTGGCGCCAGCGGTCTAGAGTGGACTGGTTGCAGTGGGGTGATAGGAACTCAAAATGGTTCCA
CCAGCGGGCAACTCAAAGGAGAAGGCGTAATAGGATGGAAGGGTTGGAGGATGGTTTTGGTAGATGGGTAACAGAGGAAGAGGAGATGGGTCAAATTGTTGAAACCTATT
TTCAAGGCTTGTTTACTTCAAACAATCAGCAAGAGAGTGGAAATTTGCAAGAATTCTTAGCTCATGTTCCACAGTGTGTGGGGAGAGAAGAGAATGAGCTGCTAAAACGG
CCATTTACTAGAGAGGAGGTGTTGGGTTCGCTGAAACAGATGGGTCAATCTAAAGCTCCAGGTGAGGATGAGTTGCCAGCCCTGTTTTATCAGAGGTTCTGGAGTGTTGT
GGGGAATGAGAGGAAACAAGGTGCGGTAGGTTGGGTTGCTCTAAAGTTAGACATGAGTAAGGCCTACGATAGGGTGGAGTGGTTTTTTCTAGAGAAGTTTATTCTTGCTT
TGGGTTTGGATGGGCCAGGGGCGGAGGCAAGTATGATTGCTAGATGCCTAAAGGCCTATTCCATGCTGACGGGTGTTGGGTTTCCAGACTCCCAAAATGGAAATGAGAAT
GATCTCTTCCCCAAAAGTGGAATGAATTTGACTTGGGCGGCCATCTCCTTTTATAGAGTGATAGGAGAGTTATCAACGGGGCAGGAGGTTAATTATGGGAAGTCTGGCAT
ATGCGTAAGTCCCAATGTGGATGAGCATTTGAAGCAGTCTATTGCCTCGATGTTGGGGGTGAGGCTTGTTACTTTTCATGATCGTTATCTAGGCCTTCCTGCTATTTTTC
CAGGTCGAAAGGTTATGTCGTTGAAGTTTGTGAAAGATCGAATGTGGGCCTATATCCATAAATGGAAGCATTCTTTCTTCTCAGCCGGGGGAAAGGAGGTACTTATAAAA
GTTGTTCTGCAAGTCGTTCCTACTTATGTCATGTCATGTTTTCAGCTGCTAGTGAGCCTAGTGAAAGAATGCAATCGGATCATGGCTAGATTCTGGTGGGGTAGTGAGGA
GGAAGGGAGGAAGGTGCATTGGGCTTCATGGAAGAAGTTGTGTGTGCCTAAAGGTAGGGGAGGGTTAGGCTTTCGGGACCTACAATTATTTAATAAAGGCCTGTTGGCGA
AACAAGGGTGGAAGTTGGTTGAATCTCCAGATTCTTTGTTGTGTAGAGTTCTAAAGGGGAAGTATTTTCGTGAGAGTTCCTTTATGGAGACAGAGCATAAGGGGAATGGG
TCTTTTGTATGGCGGAGTATGATGTGGGGAAGGTCGTTGTTAAAGGAGGGCTTGCGGTGGAGAGTACGAGATGGGAATCTAGTCAATGTTACTAGGGATAGGTGGATTTC
GAGGAGCTCAACTTTGAGGATTATTGATGGCAATGGGATTGACCGTGATATGAGTGTTAGTATGCTCCTGTGCCCAGATAACTCATGGAATGTGGGCTTAGTCCAGTCAT
TATTTCAAGGGGTTGATGCTGAAGCCATTCTTGAAATACCGAGGCCACATTTTTCCACCCCTGATAAGCTTGTGTGGCATTATGAGAAGCATGGGATGTATACGGTTAAA
AGTGGCTATAATTTGGTATGTATGTTGATGGGTAGAGCGAGTAGCTCCAATGAGGATTCGGTTAGGCGGTGGTGGAAGTTTCTTTGGAGTAGGCGGATTCCTAGTAAGAT
AAAAAAATTTTGTTGGAGACTATATTATGATTATTTGCCTACAGTTGCTAATCTCCAAAAGAGAGGGGTGGAGGTGCCTACTGTATGTCCTAGATGCAAATTGGCTGAAG
AGACTACCTTCCATGCTCTTTGGGAGTACGCAACAGTATGCAAAGCAACAAAGTCCTTCAATTTAGGAGCAATTATTCGAGATGCCGAAGGGAGAGTAATGCTTACCTTG
ATGAAATTAGTCCACTATGTGCTAGATGTGGATGCTCTAGAGGCTATGGCGATTCGTGACAGCCTGCTTGTTGCTAGGGAAGCTGGTCTTTTGAGATTGGAGGTGGAATC
AGACTCAGCTCGCGTTGTGGCCATCATTCGGTCGAAACAGAGGGACTACTCTGAAGTTGGGGTCCTGGTGCAAGAAATCATCCAGATCTTGAAGGGTTTGCAATACCGCT
CCGTGAGTTGGTGTCGGAGGGAGGCAAATCTTGTGGCGTATACAGCGGCATGTCAGGTGGTGGAGCTGGGTATTCAGGGCGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGAGAGTCTCTGTCTCGATGGAAATGGTGGGTCGTTTTATATGGGAAGGAAAGATTAGTATAATGTCCAAGGGTCAACCTTTGTTTCAATTTCTTGAAGCGGAGGC
TAACGGTTGCCGTAAGCGACTACGTAAGCGGTGGGGACAGGAAGCAAATGTAGGGAAGAAGAAAAAGAGAAAAAGAGGAAAGAAGAAAAAGGAAAAAAAAAAATCGACGG
CGGCGAGCGGTGGCCGCTGGTCGCCGGTCGCTGGACATAGGAAGAAGAATAAGAGGAAGGAGAAGGATGTGGTAGAGAAGGGTTGGTGTCAGCCTTTCAGAGGGACTAGA
ATTATCAATGAGTCGTTGGTTGAAGTCTGGCATGAGAAAGTGAGAAATTGTGCTTCTGAGTTGTCTAGTTGGGGCAGATCTAAAAGGGGTAACTGTGGGAAAAAGATCTC
TTTGGCTCGTCAGCAGGTGCAGAGGGCGCTGAATAGCCGAGGGGATGTTGGAGGGGCTAGAAAGCAGTTGGAAAATTTGTTGGGTGAGGGGATGTTCGAGGTAGTGCAAG
GAAATTTTGTTTCCGTTTTGAGGAGATCTAAAAGGGGTAACTATGGGAAAAGGATCTCTTTGGCTCGTCAGCAGGTGCAGAGGGCGCTGAATAGCCAAGGGGATGTTGGA
GGGGCTAGAAAGCAGCTGGAAAATTTGTTGGGTGAGGAAGAGATCTATTGGCGCCAGCGGTCTAGAGTGGACTGGTTGCAGTGGGGTGATAGGAACTCAAAATGGTTCCA
CCAGCGGGCAACTCAAAGGAGAAGGCGTAATAGGATGGAAGGGTTGGAGGATGGTTTTGGTAGATGGGTAACAGAGGAAGAGGAGATGGGTCAAATTGTTGAAACCTATT
TTCAAGGCTTGTTTACTTCAAACAATCAGCAAGAGAGTGGAAATTTGCAAGAATTCTTAGCTCATGTTCCACAGTGTGTGGGGAGAGAAGAGAATGAGCTGCTAAAACGG
CCATTTACTAGAGAGGAGGTGTTGGGTTCGCTGAAACAGATGGGTCAATCTAAAGCTCCAGGTGAGGATGAGTTGCCAGCCCTGTTTTATCAGAGGTTCTGGAGTGTTGT
GGGGAATGAGAGGAAACAAGGTGCGGTAGGTTGGGTTGCTCTAAAGTTAGACATGAGTAAGGCCTACGATAGGGTGGAGTGGTTTTTTCTAGAGAAGTTTATTCTTGCTT
TGGGTTTGGATGGGCCAGGGGCGGAGGCAAGTATGATTGCTAGATGCCTAAAGGCCTATTCCATGCTGACGGGTGTTGGGTTTCCAGACTCCCAAAATGGAAATGAGAAT
GATCTCTTCCCCAAAAGTGGAATGAATTTGACTTGGGCGGCCATCTCCTTTTATAGAGTGATAGGAGAGTTATCAACGGGGCAGGAGGTTAATTATGGGAAGTCTGGCAT
ATGCGTAAGTCCCAATGTGGATGAGCATTTGAAGCAGTCTATTGCCTCGATGTTGGGGGTGAGGCTTGTTACTTTTCATGATCGTTATCTAGGCCTTCCTGCTATTTTTC
CAGGTCGAAAGGTTATGTCGTTGAAGTTTGTGAAAGATCGAATGTGGGCCTATATCCATAAATGGAAGCATTCTTTCTTCTCAGCCGGGGGAAAGGAGGTACTTATAAAA
GTTGTTCTGCAAGTCGTTCCTACTTATGTCATGTCATGTTTTCAGCTGCTAGTGAGCCTAGTGAAAGAATGCAATCGGATCATGGCTAGATTCTGGTGGGGTAGTGAGGA
GGAAGGGAGGAAGGTGCATTGGGCTTCATGGAAGAAGTTGTGTGTGCCTAAAGGTAGGGGAGGGTTAGGCTTTCGGGACCTACAATTATTTAATAAAGGCCTGTTGGCGA
AACAAGGGTGGAAGTTGGTTGAATCTCCAGATTCTTTGTTGTGTAGAGTTCTAAAGGGGAAGTATTTTCGTGAGAGTTCCTTTATGGAGACAGAGCATAAGGGGAATGGG
TCTTTTGTATGGCGGAGTATGATGTGGGGAAGGTCGTTGTTAAAGGAGGGCTTGCGGTGGAGAGTACGAGATGGGAATCTAGTCAATGTTACTAGGGATAGGTGGATTTC
GAGGAGCTCAACTTTGAGGATTATTGATGGCAATGGGATTGACCGTGATATGAGTGTTAGTATGCTCCTGTGCCCAGATAACTCATGGAATGTGGGCTTAGTCCAGTCAT
TATTTCAAGGGGTTGATGCTGAAGCCATTCTTGAAATACCGAGGCCACATTTTTCCACCCCTGATAAGCTTGTGTGGCATTATGAGAAGCATGGGATGTATACGGTTAAA
AGTGGCTATAATTTGGTATGTATGTTGATGGGTAGAGCGAGTAGCTCCAATGAGGATTCGGTTAGGCGGTGGTGGAAGTTTCTTTGGAGTAGGCGGATTCCTAGTAAGAT
AAAAAAATTTTGTTGGAGACTATATTATGATTATTTGCCTACAGTTGCTAATCTCCAAAAGAGAGGGGTGGAGGTGCCTACTGTATGTCCTAGATGCAAATTGGCTGAAG
AGACTACCTTCCATGCTCTTTGGGAGTACGCAACAGTATGCAAAGCAACAAAGTCCTTCAATTTAGGAGCAATTATTCGAGATGCCGAAGGGAGAGTAATGCTTACCTTG
ATGAAATTAGTCCACTATGTGCTAGATGTGGATGCTCTAGAGGCTATGGCGATTCGTGACAGCCTGCTTGTTGCTAGGGAAGCTGGTCTTTTGAGATTGGAGGTGGAATC
AGACTCAGCTCGCGTTGTGGCCATCATTCGGTCGAAACAGAGGGACTACTCTGAAGTTGGGGTCCTGGTGCAAGAAATCATCCAGATCTTGAAGGGTTTGCAATACCGCT
CCGTGAGTTGGTGTCGGAGGGAGGCAAATCTTGTGGCGTATACAGCGGCATGTCAGGTGGTGGAGCTGGGTATTCAGGGCGTCTAG
Protein sequenceShow/hide protein sequence
MVRVSVSMEMVGRFIWEGKISIMSKGQPLFQFLEAEANGCRKRLRKRWGQEANVGKKKKRKRGKKKKEKKKSTAASGGRWSPVAGHRKKNKRKEKDVVEKGWCQPFRGTR
IINESLVEVWHEKVRNCASELSSWGRSKRGNCGKKISLARQQVQRALNSRGDVGGARKQLENLLGEGMFEVVQGNFVSVLRRSKRGNYGKRISLARQQVQRALNSQGDVG
GARKQLENLLGEEEIYWRQRSRVDWLQWGDRNSKWFHQRATQRRRRNRMEGLEDGFGRWVTEEEEMGQIVETYFQGLFTSNNQQESGNLQEFLAHVPQCVGREENELLKR
PFTREEVLGSLKQMGQSKAPGEDELPALFYQRFWSVVGNERKQGAVGWVALKLDMSKAYDRVEWFFLEKFILALGLDGPGAEASMIARCLKAYSMLTGVGFPDSQNGNEN
DLFPKSGMNLTWAAISFYRVIGELSTGQEVNYGKSGICVSPNVDEHLKQSIASMLGVRLVTFHDRYLGLPAIFPGRKVMSLKFVKDRMWAYIHKWKHSFFSAGGKEVLIK
VVLQVVPTYVMSCFQLLVSLVKECNRIMARFWWGSEEEGRKVHWASWKKLCVPKGRGGLGFRDLQLFNKGLLAKQGWKLVESPDSLLCRVLKGKYFRESSFMETEHKGNG
SFVWRSMMWGRSLLKEGLRWRVRDGNLVNVTRDRWISRSSTLRIIDGNGIDRDMSVSMLLCPDNSWNVGLVQSLFQGVDAEAILEIPRPHFSTPDKLVWHYEKHGMYTVK
SGYNLVCMLMGRASSSNEDSVRRWWKFLWSRRIPSKIKKFCWRLYYDYLPTVANLQKRGVEVPTVCPRCKLAEETTFHALWEYATVCKATKSFNLGAIIRDAEGRVMLTL
MKLVHYVLDVDALEAMAIRDSLLVAREAGLLRLEVESDSARVVAIIRSKQRDYSEVGVLVQEIIQILKGLQYRSVSWCRREANLVAYTAACQVVELGIQGV