; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy02g007120 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy02g007120
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr02:9598428..9602322
RNA-Seq ExpressionLcy02g007120
SyntenyLcy02g007120
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]6.7e-14936.94Show/hide
Query:  LKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNS
        +  LL EEE++W+QRSR+ W K GDRNT+WFH+ AS RR+ NE++GL D  G W+++   V+ +IE YF  ++++SRPS   I+Q    +   V+   N 
Subjt:  LKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNS

Query:  KLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMI-----VLISKKRSHKSVSEY--QPISLCNVSYK
         LLKPF +EE+++AL QIH +KA GPD     FY++ W +VG DV  C LNILNN +    L+E ++      +IS+ +S    + Y      L   S K
Subjt:  KLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMI-----VLISKKRSHKSVSEY--QPISLCNVSYK

Query:  L-VSKAKERV--------------------------AVAVRDILNCYERVSGQTVNLDKS---IISF--------------------------SPNTDVR
        L +SKA +RV                           + +  +L  Y    G  + L  S   I+S+                           P    R
Subjt:  L-VSKAKERV--------------------------AVAVRDILNCYERVSGQTVNLDKS---IISF--------------------------SPNTDVR

Query:  DRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWGGG
          + +  IL V +  C  QY GLP+FMPRN+    + IKDRVW+ +QGWK KLFS+GG+EVL+K++ QAIPCY+M+CF LPK+LI +      RFWWG  
Subjt:  DRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWGGG

Query:  R--------------------------------------------------------KYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGE
        +                                                        +YF    F+ A +    S+IWRS+LWGR+LLKKGLRWRIGNG+
Subjt:  R--------------------------------------------------------KYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGE

Query:  SVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMT-VSGKWNEQIIWQHFSPHEVSLILFIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLG-QSAWIAQL
        SV +YG NW+P+   L++ S   L   +RVS L+    G W   ++   F+P E   IL IP  R   ED+++W+Y+++G++SV+S Y++   +    Q 
Subjt:  SVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMT-VSGKWNEQIIWQHFSPHEVSLILFIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLG-QSAWIAQL

Query:  PSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFG------ILVNNVQ---
        PSSSS + +  WW G WKM IPNKIK+FLWRLCLDR PT  N    G+++ N C  CGR+GE S+H+FW CKF + + ++SKFG      IL  + +   
Subjt:  PSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFG------ILVNNVQ---

Query:  ---------VVWNCHNQRKFKGIGSVE--------GLVEWAGSYISSFQQATLSSGLCGHRGITREEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVR
                 V+W   NQR  +               LVEWA  Y   F++A  S+ + G R     E+ W PP  G YK+N DA+F      AGLG+I+ 
Subjt:  ---------VVWNCHNQRKFKGIGSVE--------GLVEWAGSYISSFQQATLSSGLCGHRGITREEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVR

Query:  DSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFP
        +  G+VM +A+   ++I+S +MAE  AAV+G++LA E+ + P
Subjt:  DSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFP

XP_023908468.1 uncharacterized protein LOC112020138 [Quercus suber]2.7e-13432.98Show/hide
Query:  EKEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSVGTRPIQNVGCRI
        EK GGR +   ++  FR+V+D C   DLGF+G +FTWCN  P    +WER+DR V  + W D FP  +V HL    SDH+PIL+ L    + I  +  + 
Subjt:  EKEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSVGTRPIQNVGCRI

Query:  KRFE----EPDFCPKILRKRWQLTG----------------------SRVLLWDPRRRLHR----LRKGSAE--------------ERLKSLLVEEEMYW
         RFE    E + C   +   W                          SRV++ +  R L      LRK   E                +  LL +EE  W
Subjt:  KRFE----EPDFCPKILRKRWQLTG----------------------SRVLLWDPRRRLHR----LRKGSAE--------------ERLKSLLVEEEMYW

Query:  KQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVV
        KQRSRE WL  GD+NTR+FHS A+ R ++N++  LE+  G    D + +  ++  ++++++ +S P++  I++AL   P  VT+EMN +L+ PF + EV 
Subjt:  KQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVV

Query:  LALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVSKAKERVAVAVRDILNC
        +AL Q+   K+PGPDG+   F+Q  W V+G +V    L  LN    P  +N T I LI K +S   VSEY+PI+L                 A++DIL+ 
Subjt:  LALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVSKAKERVAVAVRDILNC

Query:  YERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSM
        YE+ SGQ +N +K+ I FS   +   +AQ+   L V     + +Y GLP+ + RN+ +SL+ IK+RVW ++QGWK KL S  GREVLLK++VQAIP ++M
Subjt:  YERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSM

Query:  NCF-LPKKLILDISRAMTRFWWG----------------------GG----------------------------------RKYFPFTDFLRALVGSRSS
         CF LP  L  +I   + +F+WG                      GG                                   KYFP      A   +  S
Subjt:  NCF-LPKKLILDISRAMTRFWWG----------------------GG----------------------------------RKYFPFTDFLRALVGSRSS

Query:  FIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSP-VTLAPETRVSDLMTVSGK-WNEQIIWQHFSPHEVSLILFIP-SRPDTEDKMVW
        + W+S++  R+++  G+RWRIG+G+S+ +YG NW+P  G  R+ SP V      +V+ L+  + + WN  ++ QHF   EV+ I  IP S  + ED+++W
Subjt:  FIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSP-VTLAPETRVSDLMTVSGK-WNEQIIWQHFSPHEVSLILFIP-SRPDTEDKMVW

Query:  HYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIK
          +  G + VKS Y++      +   SSS       +WK  WK+ +PNKIK FLWR+C +  PT +N     +     C  C +  E++ H  W C+ +K
Subjt:  HYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIK

Query:  RV--------------LMD-----SKFGILVNNVQV-------VWNCHNQRKF--KGIGSVEGLVEWAGSYISSFQQATLSSGLCGHRGITREEVRWSPP
         +              L D     +  G   N +++       +W   N+ +   KG+   + L E A SY+  FQ     + +       +E  +W PP
Subjt:  RV--------------LMD-----SKFGILVNNVQV-------VWNCHNQRKF--KGIGSVEGLVEWAGSYISSFQQATLSSGLCGHRGITREEVRWSPP

Query:  IFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVM
          G YK N D A   E  +AG+GVIVRDS G V+
Subjt:  IFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVM

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]2.1e-14232.26Show/hide
Query:  EKEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSVGTRPI---QNVG
        EK GG+ ++   +  FRE V+ C L+DLG  G  FTW NRR G  ++ ER+DR +G+  W        V +L+   SDH P++L +    +     +N  
Subjt:  EKEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSVGTRPI---QNVG

Query:  CRI---KRFEEPDFCPKILRKRWQLTGS-------------------RVLLWD------PRRRLHRLRK------------------GSAEERLKSLLVE
         R+     +   + C  I+++ W   GS                   ++ +W        + +L +L+K                   S EE+++ +L++
Subjt:  CRI---KRFEEPDFCPKILRKRWQLTGS-------------------RVLLWD------PRRRLHRLRK------------------GSAEERLKSLLVE

Query:  EEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQ
        EE+YWKQRSR  WLK GD+NT++FHS ASSR++KN + G+ D   VW  D + V +    YF  ++TTS PS+++I+ AL  + P VT EMN++L  PF 
Subjt:  EEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQ

Query:  QEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVSKAKERVAVA--
        +EEV  AL Q+   KAPGPDGL   F+Q  W  V   V++  L+ILN R + S     ++V   ++R    +   +   + ++ +   S    R AV   
Subjt:  QEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVSKAKERVAVA--

Query:  --VRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSI
          ++ +  CY + SGQ  N +KS + FS  T       + +I  ++V + H +Y GLPS + R   S  + +K RV  +I  W+ K F+ GG+EVL+K++
Subjt:  --VRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSI

Query:  VQAIPCYSMNCF-LPKKLILDISRAMTRFWWG----------------------GG----------------------------------RKYFPFTDFL
         QAIP Y+M+ F +P  L  DI +AM RFWWG                      GG                                   +YF  T F+
Subjt:  VQAIPCYSMNCF-LPKKLILDISRAMTRFWWG----------------------GG----------------------------------RKYFPFTDFL

Query:  RALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMTVSGKWNEQIIWQHFSPHEVSLILFI--PSRP
         A +GS+ SF+WRS++WGR++L KG RWRIGNG++V VYG NWIP     +  S  ++  +T V++L+    +W E +I QHF P +   I+ I  P RP
Subjt:  RALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMTVSGKWNEQIIWQHFSPHEVSLILFI--PSRP

Query:  DTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHV
          ED+++WHYD+ G +SVKS Y++       + PS S+ D  +  W+  WK+ IP K+KIFLWR   D  PT EN     +    +C  C  H E+  H 
Subjt:  DTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHV

Query:  FWHCKFIKRVLMDSKFGILVNNVQ----------------------------VVWNCHNQRKFKG--------IGSVEGLVEWAGSYISSFQQATLSSGL
           C   +++   S     +  V                              +W   N+  F+G        + + E +VE       SF++      +
Subjt:  FWHCKFIKRVLMDSKFGILVNNVQ----------------------------VVWNCHNQRKFKG--------IGSVEGLVEWAGSYISSFQQATLSSGL

Query:  CGHRGITREEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLETDSSRV
           +G    + +WSPP  GW KVNVDAA   E   AGLGV+VRDSDG    +A    +   S  MAE  A   G+++A +  +   + E+DS  V
Subjt:  CGHRGITREEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLETDSSRV

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]1.9e-14328.56Show/hide
Query:  MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTKLGYPNCFCVDSRGRNGGLVLMW-------------------IREKEGGRS
        M IL WN+RGLG+ R +  L K++Q + P +VFL ETK++T + + +  KL Y NCF V S G+ GGL L+W                   I  + G  +
Subjt:  MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTKLGYPNCFCVDSRGRNGGLVLMW-------------------IREKEGGRS

Query:  KSTGELFGFREVVD-------VCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSV---GTRPIQNVG
        + TG ++G  +          + +L D+ + G  +TW N R G   V ER+DR V N AWRD+F      +++   SDH P+++ + V   G    Q   
Subjt:  KSTGELFGFREVVD-------VCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSV---GTRPIQNVG

Query:  CRI---KRFEEPDFCPKILRKRWQLTG-------------------SRVLLWDP------RRRLHRLR--------------KGS----AEERLKSLLVE
          I     +   D C +I+ K W L G                   +R++LW        +++L +L               KG+     E +++ +L +
Subjt:  CRI---KRFEEPDFCPKILRKRWQLTG-------------------SRVLLWDP------RRRLHRLR--------------KGS----AEERLKSLLVE

Query:  EEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQ
        +E+YWKQRSR  WLK GD+NT++FH  ASSR+KKN + G+E++ G W ++++ V     +YF +++TTS+P++++I  AL  +   V+ EMN  L  PF 
Subjt:  EEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQ

Query:  QEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVSKA---------
         EEVV AL Q+   KAPGPDGL   F+Q  W  V   V++  L+ILN +   +P N T IVLISKK   + V++++PISLCNV Y++V+KA         
Subjt:  QEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVSKA---------

Query:  --------------------------------------------KERVAVA-------------------------------------------------
                                                    K  V+ A                                                 
Subjt:  --------------------------------------------KERVAVA-------------------------------------------------

Query:  --------------------------VRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRV
                                  ++ I +CY   SGQ  N +KS +  + N      +++ +I  + + + H +Y GLPS + R ++S  + IK ++
Subjt:  --------------------------VRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRV

Query:  WQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWG----------------------GG--------------------
          ++  W+ K FS GG+EVL+K+  QAIP Y+M+ F +P  +  DI R +  FWWG                      GG                    
Subjt:  WQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWG----------------------GG--------------------

Query:  --------------RKYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMTVSGKWNE
                       +YF   DF+ A +GS  SFIWRS+LWGR+++ KG +WRIGNG+S+ ++  NW+P     +  S  +L  +  V++L+     WNE
Subjt:  --------------RKYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMTVSGKWNE

Query:  QIIWQHFSPHEVSLI--LFIPSRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENF
         +I +HF+  +   I  + +P RP  +D+++WHYD+ G +SVKS Y++  +      P+SS+  +    W   W + +P KI+IF WR   +  P+ EN 
Subjt:  QIIWQHFSPHEVSLI--LFIPSRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENF

Query:  VFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKF----------------------------GILVNNVQVVWNCHNQRKFKGIGSVEGL-VEW
            +    +C  C +  E+  H    CK ++++   ++F                             +LV  + ++WN  N   FKG+  +  + V  
Subjt:  VFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKF----------------------------GILVNNVQVVWNCHNQRKFKGIGSVEGL-VEW

Query:  AGSYISSFQQATLSSGLCGHRGITREEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWA
        A + + +F++  L +        + +   W PP  G++KVNVDAA   E   AGLG ++RD  G V+ +A  V +      +AE  A
Subjt:  AGSYISSFQQATLSSGLCGHRGITREEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWA

XP_030926688.1 uncharacterized protein LOC115953246 [Quercus lobata]3.2e-13529.83Show/hide
Query:  MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTKLGYPNCFCVDSRGRNGGLVLMWIREKEGGRSKSTGELFGFREVVDVCELM
        MS L WN RGLG+ RA + L  +VQ   P++VFL ET     +   +K +L +   F V SR R GGL L+W       RS+ +  +  F        + 
Subjt:  MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTKLGYPNCFCVDSRGRNGGLVLMWIREKEGGRSKSTGELFGFREVVDVCELM

Query:  DLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSV-GTRPIQNVGCRIKRFEEPDFCPKILRKRWQLTGSRVL
         +   G  FTW  +R GE  +WER+D  + N  W   FP   V+HL+   SDHRP+LL+L   G R  +    +  RFE   +   +     ++  ++ L
Subjt:  DLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSV-GTRPIQNVGCRIKRFEEPDFCPKILRKRWQLTGSRVL

Query:  LWDPRRRLHRLRKGSAEE------RLKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTT
        LW     +  L  G+ +E       L  LL  EE  W+QR+   W++ GDRNT++FH  A+ R+++N ++GL+D  GVWQ D + V R++  Y+  ++T+
Subjt:  LWDPRRRLHRLRKGSAEE------RLKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTT

Query:  SRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRS
        S P  + +D+ L  V   V++EMN KL+K +  EEV +A+K +   KAPGPDG+   F+Q  WP +G D+    L+ LN+      +N T I LI K ++
Subjt:  SRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRS

Query:  HKSVSEYQPISLCNVSYKLVSKA--------------------------KERVAVA--------------------------------------------
           V+EY+PISLCNV YK++SK                            + V VA                                            
Subjt:  HKSVSEYQPISLCNVSYKLVSKA--------------------------KERVAVA--------------------------------------------

Query:  -----------------------------------------------------------------------------VRDILNCYERVSGQTVNLDKSII
                                                                                     ++++L  YE  SGQ +N DK+ +
Subjt:  -----------------------------------------------------------------------------VRDILNCYERVSGQTVNLDKSII

Query:  SFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRA
         FS NTD   RA + Q LGV     + +Y GLPSF+ RNK +  + IK+R+W+++QGWK KL S   +E+++K+++Q+IP YSMN F LP+ L  DI   
Subjt:  SFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRA

Query:  MTRFWWGGGRKYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMTVSGK-WNEQIIW
        + +FWWG G   F       A + SRSS+ WRS+L  R+++ KG  WRIG+G S+ ++   W+P+    +V SP +    T VSDL   + K W+ +++ 
Subjt:  MTRFWWGGGRKYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMTVSGK-WNEQIIW

Query:  QHFSPHEVSLILFIPSRPDTE-DKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGM
          F P E   I  I   P  + D  VW    +G +SV+S Y++  S  +  + +SSS D+    WKG W++  PNKI+ F+WR   D  PT EN     +
Subjt:  QHFSPHEVSLILFIPSRPDTE-DKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGM

Query:  DVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFGILVNNVQVVWNCHNQRKFKGIGSVEGLVEWAGSYISSFQQATLSSGLCGHRGITREE-------
         +   C LC  + E+++H  W C F K V             +V +  + ++ F+    +   V   GS       +T++  L   R   R +       
Subjt:  DVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFGILVNNVQVVWNCHNQRKFKGIGSVEGLVEWAGSYISSFQQATLSSGLCGHRGITREE-------

Query:  ----------------------------VRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMD
                                    VRW PP  G++K N D         AG+GV +RD  G+++ +     +   S ++AE  A  +    A E+ 
Subjt:  ----------------------------VRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMD

Query:  LFPLVLETDSSRV
        LF +  E DS R+
Subjt:  LFPLVLETDSSRV

TrEMBL top hitse value%identityAlignment
A0A2N9G219 RNase H domain-containing protein4.7e-14031.73Show/hide
Query:  MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTK---------LGYPNCFCVDS-----RGRNGGLVLMWI-----------RE
        M  + WN RGLG+    + L KLV+   P ++FL ET +   R + L+            G P     D      R  +G   L W             E
Subjt:  MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTK---------LGYPNCFCVDS-----RGRNGGLVLMWI-----------RE

Query:  KEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSVGTRPIQNVGCRIK
        K G R +S  ++  FR V+D C  +DLGF G  FTWCN R G    W R+DR +    W   F    V HL  + SDH+PI L+    T+P++    R K
Subjt:  KEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSVGTRPIQNVGCRIK

Query:  RFEEPDF------CPKILRKRW----------------QLTGSRVLLWDP----------RRRLHRLRKG--------------SAEERLKSLLVEEEMY
         F   D       C   + K W                Q  G+ +  W            R +  +LR+               S  + +  LL++EE  
Subjt:  RFEEPDF------CPKILRKRW----------------QLTGSRVLLWDP----------RRRLHRLRKG--------------SAEERLKSLLVEEEMY

Query:  WKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEV
        WKQRSRE WLK GD NT++FH+ AS R+++N +  L  ++G    D + +      Y++ ++T      E+ +  L  + P VT EMN  L  PF +EEV
Subjt:  WKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEV

Query:  VLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLV----------------
        ++A+KQ+   KAPGPDG+   FYQ  W VVG D+    L  L +      LN T + LI K +S ++V+EY+PISLCNV YKL+                
Subjt:  VLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLV----------------

Query:  --------------------------------------------SKAKERV--AVAVRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQ
                                                    SKA +RV      ++IL  YE+ SGQ +N  K+ + FS NT    +  + +ILGV 
Subjt:  --------------------------------------------SKAKERV--AVAVRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQ

Query:  VTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWG--GGRKYFPFTDFL
            + +Y GLPS + + K +  S IK+RVW +++GWK KL S  GRE+L+K++VQAIP Y+MNCF LP  L  +I   + RFWWG    R+   +  + 
Subjt:  VTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWG--GGRKYFPFTDFL

Query:  RALVG------SRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAP-ETRVSDLMTVS-GKWNEQIIWQHFSPHEVSLIL
        +          +R SF WRS+L  + L+  GL WR+G+G  +P+ G+NW+ ++G  R+ SP+T  P + +V +L+  S   WN   I   F P++V  IL
Subjt:  RALVG------SRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAP-ETRVSDLMTVS-GKWNEQIIWQHFSPHEVSLIL

Query:  FIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRH
         IP S    ED++ W   Q+G +SV+S Y+L      A +P SS        WK  W+  +P KI+ FLWR C D  PT        +    +C  C   
Subjt:  FIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRH

Query:  GESSLHVFWHCKFIKRV-LMDSKFGILVN---------------------------NVQVVWNCHNQRKFKGIGSVEGLVEWAGSYISSFQQATLSSGLC
         E SLH  W C  + RV  +D  F  L N                              ++W+  NQ + +       L     S I     A LS  L 
Subjt:  GESSLHVFWHCKFIKRV-LMDSKFGILVN---------------------------NVQVVWNCHNQRKFKGIGSVEGLVEWAGSYISSFQQATLSSGLC

Query:  GHRGITRE---------EVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLETDS
            +T E         +VRW PP    +KVN D A  RE    GLGV++RD  G V+ + S       + EM E  AA + +R A+E+ +     E D+
Subjt:  GHRGITRE---------EVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLETDS

Query:  SRV
          V
Subjt:  SRV

A0A2N9I921 Reverse transcriptase domain-containing protein3.8e-13430.42Show/hide
Query:  IREKEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLN-LSVGTRPIQNVG
        + EK+G  SK   ++  FRE +D C  +DLG+ G+ FTWCN R     VWER+DR V + AW   FP   V HL+++ SDH+P+ L+      RP+    
Subjt:  IREKEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLN-LSVGTRPIQNVG

Query:  CRIKRFEE----PDFCPKILRKRWQLTGSRVLLWDPRRRLHRLRK----------GSAEERLKS------------------------------LLVEEE
         +  RFEE       C + +   WQ   +   ++    +L+  R           GS  ++L+                               LL +EE
Subjt:  CRIKRFEE----PDFCPKILRKRWQLTGSRVLLWDPRRRLHRLRK----------GSAEERLKS------------------------------LLVEEE

Query:  MYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQE
          W+QRSR  WLK GDRNT +FH+ A+ R+++N + GL DSDG WQ D D V  ++  YF++I+ +S PS   ID  L  +P  ++D +N  L +P+   
Subjt:  MYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQE

Query:  EVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLV--------------
        EV  AL+Q+    APGPDGL   F+Q+ W ++GADVV   L+ LN+      +N T I LI K ++ + VSEY+PISLCNV YK++              
Subjt:  EVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLV--------------

Query:  ---------------------SKAKERVAVA---------------VRDILNC-----------------------------------------------
                             SKA +RV  A               +  IL C                                               
Subjt:  ---------------------SKAKERVAVA---------------VRDILNC-----------------------------------------------

Query:  ---------YERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSI
                 YE  SGQ +N +K+ + FS NT    +  +   L V     + +Y GLPSF+ R K    S IK+RVW +++GWK  L S  GRE+L+K++
Subjt:  ---------YERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSI

Query:  VQAIPCYSMNCF-LPKKLILDISRAMTRFWWG-------------GGRKYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNW
        VQAIP YSM+CF LP  L  D+ + M  FWWG                K+FP    L      R S+ W S+     ++KKG+ WRIG+G +  ++  NW
Subjt:  VQAIPCYSMNCF-LPKKLILDISRAMTRFWWG-------------GGRKYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNW

Query:  IPDDGGLRVRSPVTLAP-ETRVSDLMTVSGK-WNEQIIWQHFSPHEVSLILFIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSI
        +PD G  ++ SP + +  + +VS L+    + WN  +I Q F P++   I  IP +     D ++W + + G++SV+S Y L +       PSSS+ ++ 
Subjt:  IPDDGGLRVRSPVTLAP-ETRVSDLMTVSGK-WNEQIIWQHFSPHEVSLILFIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSI

Query:  MGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHVFWHC------------------------KFIKRVLMDSKFGI
           W   W + +P K+K FLWR CL+  PT  N     +     C  C    E + H  WHC                            ++L++    +
Subjt:  MGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHVFWHC------------------------KFIKRVLMDSKFGI

Query:  LVNNVQVVWNCHNQRKFKGIGSVEGLVEWAGSYISSFQQATLSSGLCGHRGITRE-EVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSA
              + W+  ++R    +     L E   S         L+         TR  E+ W+PP  G YK N D A  ++  +AGLGVI+RDS G V+ S 
Subjt:  LVNNVQVVWNCHNQRKFKGIGSVEGLVEWAGSYISSFQQATLSSGLCGHRGITRE-EVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSA

Query:  SLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLETDSSRV
        +       S  M E  AA + V  A+E+  + +  E DS ++
Subjt:  SLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLETDSSRV

A0A2N9IWN7 Uncharacterized protein1.7e-13429.03Show/hide
Query:  MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTKLGYPNCFCVDSRGRNGGLVLMW----------------------------
        MS L WN RGLG     + L +LV++  P ++F++E+ +   R +VL+ +  + +   V  R + GGL + W                            
Subjt:  MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTKLGYPNCFCVDSRGRNGGLVLMW----------------------------

Query:  -------------------------------------------IREKEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCV
                                                     EK GG  +S  ++  FR+ +D C   DLG+ GS FTWCN R G   VWER+DR +
Subjt:  -------------------------------------------IREKEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCV

Query:  GNVAWRDMFPGHEVRHLNFSRSDHRPILLNLS--VGTRPIQNVGCRIKRFEE----PDFCPKILRKRWQLTGSRVLLWDPRRRLHRLR------------
           +W  +FP  +V HL+   SDH PI +  S  + +RP  N   RI RF+E       C + +   WQ       ++    +L   R            
Subjt:  GNVAWRDMFPGHEVRHLNFSRSDHRPILLNLS--VGTRPIQNVGCRIKRFEE----PDFCPKILRKRWQLTGSRVLLWDPRRRLHRLR------------

Query:  ---------------------KGSAEER-------LKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLI
                             KG   E+       + +LL  EE  W+QRSR+ WL+WGD++T +FH  A+ RR++N +  ++D  G      + + R  
Subjt:  ---------------------KGSAEER-------LKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLI

Query:  ERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNET
        E +F  ++++S P+  E D AL  V   VTDEMN++L++ F  EEV  ALKQ+  + APGPDG+   FYQ  W +VG+DV    L+ LN+      +N T
Subjt:  ERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNET

Query:  MIVLISKKRSHKSVSEYQPISLCNVSYKL------------------------------------------------------------VSKAKERV---
         I LI K ++ + VS+++PISLCNV YK+                                                            +SKA +RV   
Subjt:  MIVLISKKRSHKSVSEYQPISLCNVSYKL------------------------------------------------------------VSKAKERV---

Query:  --------------------------------------------------AVAVRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTA
                                                            A+  IL+ YE  SGQ VN DK+ + FS +T       +  +LGV    
Subjt:  --------------------------------------------------AVAVRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTA

Query:  CHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWGGG---------------
         + +Y GLPSF+ R++  S   IK+R+W ++ GWK KL S  G+E+L+K++VQAIP YSM+CF LP  L  +I   + +FWW  G               
Subjt:  CHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWGGG---------------

Query:  --------------RKYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAP-ETRVSDLMTV-SGKW
                       KYFP  + L   V    S+ W+S+L  R +LK G R  IGNG SV ++G NW+P     +V SP    P + +VS L+ +   KW
Subjt:  --------------RKYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAP-ETRVSDLMTV-SGKW

Query:  NEQIIWQHFSPHEVSLILFIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVEN
         E++I   F P E SLIL IP S    +DK++W +  SG+FS++S Y    +      PSSS++ +  G W   W + IP KI+ FLWR   D  PT  N
Subjt:  NEQIIWQHFSPHEVSLILFIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVEN

Query:  FVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFGILVNNVQVVWNCHNQRKFKGIGSVEGLVEWAGSYISSFQQATLSSG------------
             + V  +C  C    E   HV W+C    +                +WN  N   F      + L++   S+   F     SS             
Subjt:  FVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFGILVNNVQVVWNCHNQRKFKGIGSVEGLVEWAGSYISSFQQATLSSG------------

Query:  -LCGHRGITR-----EEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLETDSS
         L  HR   R     +E++WSPP  G +K+N + AF      AG+GV+V D  G      S       +P  AE  AA   V LA ++ L  +  E D+S
Subjt:  -LCGHRGITR-----EEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLETDSS

Query:  RV
         +
Subjt:  RV

A0A6J1DAR4 uncharacterized protein LOC1110189543.2e-14936.94Show/hide
Query:  LKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNS
        +  LL EEE++W+QRSR+ W K GDRNT+WFH+ AS RR+ NE++GL D  G W+++   V+ +IE YF  ++++SRPS   I+Q    +   V+   N 
Subjt:  LKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNS

Query:  KLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMI-----VLISKKRSHKSVSEY--QPISLCNVSYK
         LLKPF +EE+++AL QIH +KA GPD     FY++ W +VG DV  C LNILNN +    L+E ++      +IS+ +S    + Y      L   S K
Subjt:  KLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMI-----VLISKKRSHKSVSEY--QPISLCNVSYK

Query:  L-VSKAKERV--------------------------AVAVRDILNCYERVSGQTVNLDKS---IISF--------------------------SPNTDVR
        L +SKA +RV                           + +  +L  Y    G  + L  S   I+S+                           P    R
Subjt:  L-VSKAKERV--------------------------AVAVRDILNCYERVSGQTVNLDKS---IISF--------------------------SPNTDVR

Query:  DRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWGGG
          + +  IL V +  C  QY GLP+FMPRN+    + IKDRVW+ +QGWK KLFS+GG+EVL+K++ QAIPCY+M+CF LPK+LI +      RFWWG  
Subjt:  DRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWGGG

Query:  R--------------------------------------------------------KYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGE
        +                                                        +YF    F+ A +    S+IWRS+LWGR+LLKKGLRWRIGNG+
Subjt:  R--------------------------------------------------------KYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGE

Query:  SVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMT-VSGKWNEQIIWQHFSPHEVSLILFIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLG-QSAWIAQL
        SV +YG NW+P+   L++ S   L   +RVS L+    G W   ++   F+P E   IL IP  R   ED+++W+Y+++G++SV+S Y++   +    Q 
Subjt:  SVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMT-VSGKWNEQIIWQHFSPHEVSLILFIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLG-QSAWIAQL

Query:  PSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFG------ILVNNVQ---
        PSSSS + +  WW G WKM IPNKIK+FLWRLCLDR PT  N    G+++ N C  CGR+GE S+H+FW CKF + + ++SKFG      IL  + +   
Subjt:  PSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFG------ILVNNVQ---

Query:  ---------VVWNCHNQRKFKGIGSVE--------GLVEWAGSYISSFQQATLSSGLCGHRGITREEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVR
                 V+W   NQR  +               LVEWA  Y   F++A  S+ + G R     E+ W PP  G YK+N DA+F      AGLG+I+ 
Subjt:  ---------VVWNCHNQRKFKGIGSVE--------GLVEWAGSYISSFQQATLSSGLCGHRGITREEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVR

Query:  DSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFP
        +  G+VM +A+   ++I+S +MAE  AAV+G++LA E+ + P
Subjt:  DSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFP

A0A7N2R0C3 Reverse transcriptase domain-containing protein6.5e-13427.79Show/hide
Query:  MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTKLGYPNCFCVDSRGRNGGLVLMWIR--------------------------
        M+ L WN RG+GSA A R L   V+   P +VFL ETK    R   L+ KLG      V S GR+GGL ++W                            
Subjt:  MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTKLGYPNCFCVDSRGRNGGLVLMWIR--------------------------

Query:  ----------------------------------------------EKEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRC
                                                      EK G   +   ++  FRE +  C L+DLGF G RFTWCN R GE     R+DR 
Subjt:  ----------------------------------------------EKEGGRSKSTGELFGFREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRC

Query:  VGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSVGTRPIQNVGCRIKRFEE----PDFCPKILRKRWQLTGSRVLL---------------WDPR-----
        V N  W ++FP  +V H + + SDH   LL+LS+  R  + V  R   FEE     + C +++ + W   G    L               W+ R     
Subjt:  VGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSVGTRPIQNVGCRIKRFEE----PDFCPKILRKRWQLTGSRVLL---------------WDPR-----

Query:  --------RRLHRLRK-----GSAEE------RLKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIER
                 RL +L +      SAEE       +  +++ EE+ W QRSR  W+K+GDRNTR+FH+ A++RR+KN++ G+ DS+G W+++++ V  +I  
Subjt:  --------RRLHRLRK-----GSAEE------RLKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIER

Query:  YFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMI
        YF+ IY+++ P+  E    L  V   VT++MN  LL+ F++EEV  AL Q+H  K+PGPDG+   F+Q  W VVG  VV   ++ L   V P  +NET I
Subjt:  YFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMI

Query:  VLISKKRSHKSVSEYQPISLCNVSYKLV------------------------------------------------------------SKAKERV-----
         LI K +  + ++EY+PISLCNV YKLV                                                            SKA +RV     
Subjt:  VLISKKRSHKSVSEYQPISLCNVSYKLV------------------------------------------------------------SKAKERV-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------AVAVRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGR
               + V  IL  YER SGQ +N +K+ + FS NT V  +  V ++ G ++   H +Y GLP  + R K  + + IKD+V ++I  WKG+L S  GR
Subjt:  ------AVAVRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGR

Query:  EVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWG----------------------GG----------------------------------RKY
        E+L+K++ QA P Y+MNCF LP  L  +++  +  FWWG                      GG                                   +Y
Subjt:  EVLLKSIVQAIPCYSMNCF-LPKKLILDISRAMTRFWWG----------------------GG----------------------------------RKY

Query:  FPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRV--SDLMTVSGKWNEQIIWQHFSPHEVSLI
        FP ++F+ A +G+  S+ WRSLL  RE++++G RW IGNG+ V ++   W+P     +V SP     E  +  S +    G W++ ++ + F PHE   I
Subjt:  FPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRV--SDLMTVSGKWNEQIIWQHFSPHEVSLI

Query:  LFIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMG------WWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNV
        L IP S    ED + W +  +G F+V S Y++  S W+ +  S      +        +WK  W++  P+K+K FLWR C +  PT        + +   
Subjt:  LFIP-SRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMG------WWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNV

Query:  CGLCGRHGESSLHVFWHCKFIKRVLMDSKFGI----------------------------LVNNVQVVWNCHNQRKFKGIG-SVEGLVEWAGSYISSFQQ
        CG+CGR  ES+ HV W C+    V  +SK  +                             V     +W   N  KF+G G +   +V+ A   +  F+ 
Subjt:  CGLCGRHGESSLHVFWHCKFIKRVLMDSKFGI----------------------------LVNNVQVVWNCHNQRKFKGIG-SVEGLVEWAGSYISSFQQ

Query:  ATLSSGLCGHRGITREEVR---WSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLET
          +   L         EVR   W PP  GWYK NVD A  +E    G+GV++R+  G++M + S            E  A  +G+RLA ++ L  ++LE 
Subjt:  ATLSSGLCGHRGITREEVR---WSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLET

Query:  DSSRVAGVFL
        D+  V    L
Subjt:  DSSRVAGVFL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657509.1e-1624.06Show/hide
Query:  LPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMN-CFLPKKLILDISRAMTRFWWG----------------------GG
        +P    R    +   I +RV  ++ GW+ K  S  GR  L K+++ ++P +SM+   LP+ ++  + +    F WG                      GG
Subjt:  LPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAIPCYSMN-CFLPKKLILDISRAMTRFWWG----------------------GG

Query:  RKYFPFTDFLRALVGSRSSFIWRSL-----LWGRELLKK-------GLRWRIGNG-------------ESVPVYGTNWIPDDG---------GLRVRSPV
                  RAL+   S   WR L     LW   L KK         RW I  G               V  +G  WIP DG          +  +  +
Subjt:  RKYFPFTDFLRALVGSRSSFIWRSL-----LWGRELLKK-------GLRWRIGNG-------------ESVPVYGTNWIPDDG---------GLRVRSPV

Query:  TLAPETRVSDLMTVSGK--WNEQIIWQH-----FSPHEVSLIL---FIPSRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKG
         L    R +D  TV  K  W     W       ++ +   L L    +       D++ W + Q G FSV+S Y +     + ++P      ++  ++  
Subjt:  TLAPETRVSDLMTVSGK--WNEQIIWQH-----FSPHEVSLIL---FIPSRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKG

Query:  CWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFGILVNNVQVVWNCHNQRKFKGIGSVEGLVEW
         WK+ +P ++K FLW +      T E      +   NVC +C    ES LHV   C         ++ GI V  V        QR+ +G  S + L EW
Subjt:  CWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFGILVNNVQVVWNCHNQRKFKGIGSVEGLVEW

P14381 Transposon TX1 uncharacterized 149 kDa protein5.2e-1124.46Show/hide
Query:  RSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVVLA
        RSR   L   DR +R+F++    +  + ++  L   DG   +D +++      +++++++   P   +  + L    P V++    +L  P   +E+  A
Subjt:  RSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVVLA

Query:  LKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVSKA
        L+ +  NK+PG DGL   F+Q  W  +G D              P      ++ L+ KK   + +  ++P+SL +  YK+V+KA
Subjt:  LKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVSKA

P93295 Uncharacterized mitochondrial protein AtMg003105.3e-0828.08Show/hide
Query:  AIPCYSMNCF-LPKKLILDISRAMTRFWWG-----------------------GG----------------------------------RKYFPFTDFLR
        A+P Y+M+CF L K L   ++ AMT FWW                        GG                                   +YFP +  + 
Subjt:  AIPCYSMNCF-LPKKLILDISRAMTRFWWG-----------------------GG----------------------------------RKYFPFTDFLR

Query:  ALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDD
          VG+R S+ WRS++ GRELL +GL   IG+G    V+   WI D+
Subjt:  ALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDD

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.3e-1922.58Show/hide
Query:  FREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLS---------------VGTRPIQNVGCRI
        F+  +   +L+D+   G  +TW N +    I+  ++DR + N  W   FP           SDH P ++ L                + T P   V   +
Subjt:  FREVVDVCELMDLGFSGSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLS---------------VGTRPIQNVGCRI

Query:  ----------------KRFEEPDFCPKILRKR-----WQLTGSRV---------LLWDPRRRLHRLRKGSAEERLKSLLVEEEMYWKQRSREGWLKWGDR
                        +  +    C K+L ++        T   +         LL +P   L R+ +  A ++        E +++Q+SR  WL+ GD 
Subjt:  ----------------KRFEEPDFCPKILRKR-----WQLTGSRV---------LLWDPRRRLHRLRKGSAEERLKSLLVEEEMYWKQRSREGWLKWGDR

Query:  NTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRP--SKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVVLALKQIHSNKAP
        NTR+FH    + + KN ++ L   D V  ++   V  +I  Y+ H+  +     + + + +     P    D + S+L      +E+  A+  +  NKAP
Subjt:  NTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRP--SKEEIDQALVHVPPSVTDEMNSKLLKPFQQEEVVLALKQIHSNKAP

Query:  GPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVS
        GPD     F+   W VV    +                N T I LI K      +S ++P+S C V YK+++
Subjt:  GPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVS

AT3G09510.1 Ribonuclease H-like superfamily protein8.7e-2225.4Show/hide
Query:  KYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMTVSGK---WNEQIIWQHFSPHEV
        +YF     L A V  + S+ W SLL G  LLKKG R  IG+G+++ + G + I D    R  +      E  +++L    G    W++  I Q     + 
Subjt:  KYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRVSDLMTVSGK---WNEQIIWQHFSPHEV

Query:  SLILFI-PSRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGL
          I  I  ++    DK++W+Y+ +G ++V+S Y L        +P+ +     +      W + I  K+K FLWR       T E     GM +   C  
Subjt:  SLILFI-PSRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMDVLNVCGL

Query:  CGRHGESSLHVFWHCKF------------IKRVLMDSKFGILVNN---------------------VQVVWNCHNQRKF-KGIGSVEGLVEWAGSYISSF
        C R  ES  H  + C F            I+  LM + F   ++N                     +  +W   N   F K   S    V  A +    +
Subjt:  CGRHGESSLHVFWHCKF------------IKRVLMDSKFGILVNN---------------------VQVVWNCHNQRKF-KGIGSVEGLVEWAGSYISSF

Query:  QQATLSSGLCGH--RGITREEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAE
          AT S        R I   ++ W  P   + K N DA F  +  +A  G I+R+  G  +   S+   H  +P  AE
Subjt:  QQATLSSGLCGH--RGITREEVRWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAE

AT4G29090.1 Ribonuclease H-like superfamily protein2.0e-2625.18Show/hide
Query:  KYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWI---PDDGGLRV-RSP----VTLAPETRVSDLMTVSGK-WNEQIIWQH
        +YF  +D L A +GSR SF+W+S+   +E+L++G R  +GNGE + ++   W+   P    LR+ R P     +++   +VSDL+  SG+ W + +I   
Subjt:  KYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWI---PDDGGLRV-RSP----VTLAPETRVSDLMTVSGK-WNEQIIWQH

Query:  FSPHEVSLILFI-PSRPDTEDKMVWHYDQSGLFSVKSEY-RLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMD
        F   E  LI  + P      D   W Y  SG ++VKS Y  L Q       P   S+ S+   ++  WK     KI+ FLW+   +  P      +  + 
Subjt:  FSPHEVSLILFI-PSRPDTEDKMVWHYDQSGLFSVKSEY-RLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTVENFVFLGMD

Query:  VLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKF----------GILVNNVQV-----------------------VWNCHNQRKFKGIG-SVEGLVEWA
          + C  C    E+  H+ + C F +     S             I VN   V                       +W   N+  F+G   + + ++  A
Subjt:  VLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKF----------GILVNNVQV-----------------------VWNCHNQRKFKGIG-SVEGLVEWA

Query:  GSYISSFQQATLSSGLCGHRGITREEV-RWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAE----GWAAVKGVRLAL
           +  ++  T +        + R    RW PP   W K N DA + R+  + G+G ++R+  G V    +     ++S   AE     WA +   R   
Subjt:  GSYISSFQQATLSSGLCGHRGITREEV-RWSPPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAE----GWAAVKGVRLAL

Query:  EMDLFPLVLETDS
           +F    E+DS
Subjt:  EMDLFPLVLETDS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.8e-0928.08Show/hide
Query:  AIPCYSMNCF-LPKKLILDISRAMTRFWWG-----------------------GG----------------------------------RKYFPFTDFLR
        A+P Y+M+CF L K L   ++ AMT FWW                        GG                                   +YFP +  + 
Subjt:  AIPCYSMNCF-LPKKLILDISRAMTRFWWG-----------------------GG----------------------------------RKYFPFTDFLR

Query:  ALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDD
          VG+R S+ WRS++ GRELL +GL   IG+G    V+   WI D+
Subjt:  ALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATCCTATTCTGGAATGTGCGGGGGTTGGGGTCGGCCCGCGCATTCCGTAGGTTGCACAAGCTGGTGCAACAATACCGACCTCAGATGGTGTTCTTGTTG
GAGACGAAGGTGCAGACGTTGAGGTTCGATGTGCTGAAAACTAAGCTGGGGTATCCAAACTGCTTCTGCGTGGACAGTAGAGGGAGGAATGGAGGTTTGGTGCTG
ATGTGGATCAGAGAGAAGGAGGGTGGAAGGTCGAAGTCGACCGGGGAATTATTTGGATTTCGGGAGGTAGTGGATGTGTGCGAACTGATGGATTTGGGTTTCAGT
GGGTCGAGATTTACCTGGTGCAACAGGAGGCCAGGGGAGGATATTGTCTGGGAAAGAATTGATAGGTGTGTCGGAAATGTTGCATGGCGGGATATGTTTCCTGGA
CACGAGGTGAGACATTTGAACTTTAGTCGCTCGGACCATAGGCCCATCCTGTTGAACCTGTCGGTGGGGACGCGACCAATCCAGAATGTTGGTTGTAGAATCAAA
AGGTTTGAGGAGCCTGACTTTTGTCCCAAGATTTTAAGGAAGCGGTGGCAGCTAACTGGGAGCAGGGTGCTTCTGTGGGATCCCCGGCGACGTTTGCATCGGCTA
CGGAAAGGTAGTGCAGAGGAACGACTCAAGTCTTTACTGGTGGAGGAAGAGATGTACTGGAAACAGAGGTCCAGGGAAGGTTGGCTAAAATGGGGGGATAGGAAT
ACCCGCTGGTTCCATTCCCATGCCTCGAGTAGGAGGAAGAAGAATGAGGTGAGGGGCTTGGAGGATAGTGATGGGGTTTGGCAGCAGGATTCAGATAGTGTCCTG
AGGCTAATAGAGAGGTATTTTGAGCACATTTATACGACGTCGCGCCCTTCGAAGGAGGAAATAGATCAAGCTTTGGTGCATGTCCCACCCTCAGTTACAGACGAG
ATGAATAGTAAACTCCTTAAACCGTTCCAGCAAGAAGAAGTGGTACTTGCCCTAAAGCAGATCCACTCGAATAAAGCCCCAGGCCCAGACGGGTTGTTAGGGACT
TTTTATCAGCATTTGTGGCCTGTAGTTGGGGCTGATGTGGTCAACTGCTATTTGAATATCCTGAATAACAGGGTGTCCCCATCTCCTTTGAATGAAACAATGATC
GTTCTGATATCGAAGAAGAGAAGCCACAAAAGTGTGTCTGAGTATCAGCCTATTTCGCTCTGTAATGTGTCATACAAGTTGGTTTCGAAGGCCAAAGAGAGGGTT
GCGGTGGCTGTTCGGGACATTCTTAACTGCTATGAACGAGTGTCGGGGCAGACTGTCAACCTGGATAAATCCATTATCTCTTTCAGCCCGAACACGGATGTAAGG
GATCGGGCTCAGGTTGGGCAGATCCTGGGGGTTCAGGTTACGGCTTGTCATCGCCAATACTTTGGACTTCCGTCTTTTATGCCTCGGAACAAAACGAGCTCGTTG
AGCTCCATAAAGGATCGAGTCTGGCAGCAAATTCAAGGATGGAAGGGCAAGCTATTCTCTGTAGGGGGTAGGGAAGTACTCCTAAAATCTATTGTGCAGGCGATC
CCTTGCTATTCGATGAATTGTTTTCTGCCAAAGAAGTTGATCCTTGATATCAGTAGAGCCATGACCCGATTCTGGTGGGGGGGGGGGAGAAAATATTTTCCGTTC
ACAGACTTTTTAAGGGCGCTGGTGGGGTCGAGGTCGTCATTTATTTGGAGGAGTCTACTGTGGGGGAGAGAGCTTCTGAAAAAGGGATTGCGTTGGCGGATAGGG
AATGGGGAAAGTGTCCCGGTGTATGGAACGAATTGGATTCCTGACGATGGGGGTCTTAGAGTGCGTTCACCAGTGACTTTGGCCCCTGAGACGAGGGTGTCTGAT
CTCATGACAGTGTCTGGGAAGTGGAATGAACAGATTATTTGGCAGCACTTTAGTCCCCACGAGGTAAGTCTTATTTTATTTATCCCCTCACGGCCTGATACTGAG
GATAAAATGGTATGGCATTATGACCAGTCGGGTCTTTTCTCGGTAAAGAGCGAGTATCGATTAGGCCAATCGGCTTGGATTGCACAACTCCCATCTTCCTCTTCG
GATGACTCGATTATGGGTTGGTGGAAGGGGTGTTGGAAAATGGTGATTCCAAATAAGATAAAGATCTTCCTCTGGAGACTATGTTTAGACCGTTTTCCAACAGTT
GAAAATTTTGTTTTTCTGGGTATGGATGTGTTGAATGTGTGTGGTCTCTGTGGCCGACACGGTGAATCTAGCTTACATGTCTTTTGGCATTGTAAGTTTATCAAG
AGGGTTTTGATGGATTCTAAGTTTGGGATATTGGTAAACAATGTGCAGGTTGTGTGGAATTGTCATAACCAGCGAAAATTCAAGGGGATTGGGTCTGTGGAGGGA
CTAGTGGAGTGGGCGGGGAGTTACATCTCTTCGTTCCAACAGGCTACCTTGTCTAGTGGGTTGTGTGGGCATAGGGGGATAACAAGGGAAGAAGTAAGATGGAGT
CCCCCAATCTTTGGTTGGTATAAGGTGAACGTTGATGCAGCCTTTAGAAGAGAGGGATGGCAGGCGGGTTTGGGGGTGATTGTTCGAGATTCTGACGGACGGGTT
ATGATGTCGGCGTCTCTGGTGCAGCAACATATACGAAGCCCGGAGATGGCCGAAGGGTGGGCGGCGGTAAAGGGCGTGAGACTGGCGTTGGAGATGGATCTGTTT
CCACTGGTGCTGGAGACCGACTCTAGTCGAGTGGCTGGGGTTTTTTTAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATCCTATTCTGGAATGTGCGGGGGTTGGGGTCGGCCCGCGCATTCCGTAGGTTGCACAAGCTGGTGCAACAATACCGACCTCAGATGGTGTTCTTGTTG
GAGACGAAGGTGCAGACGTTGAGGTTCGATGTGCTGAAAACTAAGCTGGGGTATCCAAACTGCTTCTGCGTGGACAGTAGAGGGAGGAATGGAGGTTTGGTGCTG
ATGTGGATCAGAGAGAAGGAGGGTGGAAGGTCGAAGTCGACCGGGGAATTATTTGGATTTCGGGAGGTAGTGGATGTGTGCGAACTGATGGATTTGGGTTTCAGT
GGGTCGAGATTTACCTGGTGCAACAGGAGGCCAGGGGAGGATATTGTCTGGGAAAGAATTGATAGGTGTGTCGGAAATGTTGCATGGCGGGATATGTTTCCTGGA
CACGAGGTGAGACATTTGAACTTTAGTCGCTCGGACCATAGGCCCATCCTGTTGAACCTGTCGGTGGGGACGCGACCAATCCAGAATGTTGGTTGTAGAATCAAA
AGGTTTGAGGAGCCTGACTTTTGTCCCAAGATTTTAAGGAAGCGGTGGCAGCTAACTGGGAGCAGGGTGCTTCTGTGGGATCCCCGGCGACGTTTGCATCGGCTA
CGGAAAGGTAGTGCAGAGGAACGACTCAAGTCTTTACTGGTGGAGGAAGAGATGTACTGGAAACAGAGGTCCAGGGAAGGTTGGCTAAAATGGGGGGATAGGAAT
ACCCGCTGGTTCCATTCCCATGCCTCGAGTAGGAGGAAGAAGAATGAGGTGAGGGGCTTGGAGGATAGTGATGGGGTTTGGCAGCAGGATTCAGATAGTGTCCTG
AGGCTAATAGAGAGGTATTTTGAGCACATTTATACGACGTCGCGCCCTTCGAAGGAGGAAATAGATCAAGCTTTGGTGCATGTCCCACCCTCAGTTACAGACGAG
ATGAATAGTAAACTCCTTAAACCGTTCCAGCAAGAAGAAGTGGTACTTGCCCTAAAGCAGATCCACTCGAATAAAGCCCCAGGCCCAGACGGGTTGTTAGGGACT
TTTTATCAGCATTTGTGGCCTGTAGTTGGGGCTGATGTGGTCAACTGCTATTTGAATATCCTGAATAACAGGGTGTCCCCATCTCCTTTGAATGAAACAATGATC
GTTCTGATATCGAAGAAGAGAAGCCACAAAAGTGTGTCTGAGTATCAGCCTATTTCGCTCTGTAATGTGTCATACAAGTTGGTTTCGAAGGCCAAAGAGAGGGTT
GCGGTGGCTGTTCGGGACATTCTTAACTGCTATGAACGAGTGTCGGGGCAGACTGTCAACCTGGATAAATCCATTATCTCTTTCAGCCCGAACACGGATGTAAGG
GATCGGGCTCAGGTTGGGCAGATCCTGGGGGTTCAGGTTACGGCTTGTCATCGCCAATACTTTGGACTTCCGTCTTTTATGCCTCGGAACAAAACGAGCTCGTTG
AGCTCCATAAAGGATCGAGTCTGGCAGCAAATTCAAGGATGGAAGGGCAAGCTATTCTCTGTAGGGGGTAGGGAAGTACTCCTAAAATCTATTGTGCAGGCGATC
CCTTGCTATTCGATGAATTGTTTTCTGCCAAAGAAGTTGATCCTTGATATCAGTAGAGCCATGACCCGATTCTGGTGGGGGGGGGGGAGAAAATATTTTCCGTTC
ACAGACTTTTTAAGGGCGCTGGTGGGGTCGAGGTCGTCATTTATTTGGAGGAGTCTACTGTGGGGGAGAGAGCTTCTGAAAAAGGGATTGCGTTGGCGGATAGGG
AATGGGGAAAGTGTCCCGGTGTATGGAACGAATTGGATTCCTGACGATGGGGGTCTTAGAGTGCGTTCACCAGTGACTTTGGCCCCTGAGACGAGGGTGTCTGAT
CTCATGACAGTGTCTGGGAAGTGGAATGAACAGATTATTTGGCAGCACTTTAGTCCCCACGAGGTAAGTCTTATTTTATTTATCCCCTCACGGCCTGATACTGAG
GATAAAATGGTATGGCATTATGACCAGTCGGGTCTTTTCTCGGTAAAGAGCGAGTATCGATTAGGCCAATCGGCTTGGATTGCACAACTCCCATCTTCCTCTTCG
GATGACTCGATTATGGGTTGGTGGAAGGGGTGTTGGAAAATGGTGATTCCAAATAAGATAAAGATCTTCCTCTGGAGACTATGTTTAGACCGTTTTCCAACAGTT
GAAAATTTTGTTTTTCTGGGTATGGATGTGTTGAATGTGTGTGGTCTCTGTGGCCGACACGGTGAATCTAGCTTACATGTCTTTTGGCATTGTAAGTTTATCAAG
AGGGTTTTGATGGATTCTAAGTTTGGGATATTGGTAAACAATGTGCAGGTTGTGTGGAATTGTCATAACCAGCGAAAATTCAAGGGGATTGGGTCTGTGGAGGGA
CTAGTGGAGTGGGCGGGGAGTTACATCTCTTCGTTCCAACAGGCTACCTTGTCTAGTGGGTTGTGTGGGCATAGGGGGATAACAAGGGAAGAAGTAAGATGGAGT
CCCCCAATCTTTGGTTGGTATAAGGTGAACGTTGATGCAGCCTTTAGAAGAGAGGGATGGCAGGCGGGTTTGGGGGTGATTGTTCGAGATTCTGACGGACGGGTT
ATGATGTCGGCGTCTCTGGTGCAGCAACATATACGAAGCCCGGAGATGGCCGAAGGGTGGGCGGCGGTAAAGGGCGTGAGACTGGCGTTGGAGATGGATCTGTTT
CCACTGGTGCTGGAGACCGACTCTAGTCGAGTGGCTGGGGTTTTTTTAGAATGA
Protein sequenceShow/hide protein sequence
MSILFWNVRGLGSARAFRRLHKLVQQYRPQMVFLLETKVQTLRFDVLKTKLGYPNCFCVDSRGRNGGLVLMWIREKEGGRSKSTGELFGFREVVDVCELMDLGFS
GSRFTWCNRRPGEDIVWERIDRCVGNVAWRDMFPGHEVRHLNFSRSDHRPILLNLSVGTRPIQNVGCRIKRFEEPDFCPKILRKRWQLTGSRVLLWDPRRRLHRL
RKGSAEERLKSLLVEEEMYWKQRSREGWLKWGDRNTRWFHSHASSRRKKNEVRGLEDSDGVWQQDSDSVLRLIERYFEHIYTTSRPSKEEIDQALVHVPPSVTDE
MNSKLLKPFQQEEVVLALKQIHSNKAPGPDGLLGTFYQHLWPVVGADVVNCYLNILNNRVSPSPLNETMIVLISKKRSHKSVSEYQPISLCNVSYKLVSKAKERV
AVAVRDILNCYERVSGQTVNLDKSIISFSPNTDVRDRAQVGQILGVQVTACHRQYFGLPSFMPRNKTSSLSSIKDRVWQQIQGWKGKLFSVGGREVLLKSIVQAI
PCYSMNCFLPKKLILDISRAMTRFWWGGGRKYFPFTDFLRALVGSRSSFIWRSLLWGRELLKKGLRWRIGNGESVPVYGTNWIPDDGGLRVRSPVTLAPETRVSD
LMTVSGKWNEQIIWQHFSPHEVSLILFIPSRPDTEDKMVWHYDQSGLFSVKSEYRLGQSAWIAQLPSSSSDDSIMGWWKGCWKMVIPNKIKIFLWRLCLDRFPTV
ENFVFLGMDVLNVCGLCGRHGESSLHVFWHCKFIKRVLMDSKFGILVNNVQVVWNCHNQRKFKGIGSVEGLVEWAGSYISSFQQATLSSGLCGHRGITREEVRWS
PPIFGWYKVNVDAAFRREGWQAGLGVIVRDSDGRVMMSASLVQQHIRSPEMAEGWAAVKGVRLALEMDLFPLVLETDSSRVAGVFLE