; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004028 (gene) of Chayote v1 genome

Gene IDSed0004028
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG04:43933818..43936078
RNA-Seq ExpressionSed0004028
SyntenySed0004028
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.6e-7231.27Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK
        FWW     K  IHW  W++L   K  GGLGF+DLE FNQ+L+AKQ WRI++ P SLV RIFR +Y P+   LEA+ G   S++W+S+ WG++L+ KGLR 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK

Query:  RIGNG---------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSK
        R+GNG                           +  V +L TSSG WN  L+++I W  + +  L IP+   +  D  IWHY  +G++SVKS Y+L    K
Subjt:  RIGNG---------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSK

Query:  ITESTSCS---DVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLW-SLIFPDLVS--LANSF
           S   S   D+ S +WKK+W L +P KIKF +W+   +F+P  Q L++++I+ + +CP C+   E++ H  ++C   K +W +  + ++      NSF
Subjt:  ITESTSCS---DVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLW-SLIFPDLVS--LANSF

Query:  VENAQDILIRLAGSLPTQDFEKACVVAWVIWNDRNKLI-------------RKENILPDVDRADWVENYLKER----------------GALKMSVDAAC
         E    + +  +G    ++      + W +WN RN  I             R   +  +   A+ + + +  R                G  K++VD A 
Subjt:  VENAQDILIRLAGSLPTQDFEKACVVAWVIWNDRNKLI-------------RKENILPDVDRADWVENYLKER----------------GALKMSVDAAC

Query:  RSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVS
        +S  S  G GV+VR+  G  + A    ++        E MA I GL+  ++MG     +E D+   + +I      +   G+L+ E+   + N   +   
Subjt:  RSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVS

Query:  FIRRESNTVAHHLAQWACFSGLSLVWTLDFPPWL
        +  R  N VAH LAQ+A      + W  + P WL
Subjt:  FIRRESNTVAHHLAQWACFSGLSLVWTLDFPPWL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.9e-8232.64Show/hide
Query:  KIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRKRIGNG-----
        K+HW  W  +  PK  GGL F+DLEGFNQ+LVAK VWR +QHP+ LV ++ + KYF + S+L+A     SSY WK  LWGRDL+ KGLR R+GNG     
Subjt:  KIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRKRIGNG-----

Query:  ----------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKITESTSCSDV
                              +  V++ +T+ GNW+   I +   + D++L+LS+PI   +L D W+WHY   G +SV+S YKLY++ K   +++ ++ 
Subjt:  ----------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKITESTSCSDV

Query:  LSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDILIRLAGSLP
            W  +W L VP KIK  IW+  +  IP  QNL  + I +   C IC    E++ H FF C   + +W  +FP L  L+     +  ++   L   L 
Subjt:  LSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDILIRLAGSLP

Query:  TQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKERG----------------------------ALKMSVDAACRSNCSKTGCGVIVRDHR
         +D   A +  W IWNDRN LI  + + P   + +W+  +L                                +LK++ DAACR   +  GC  I+RD  
Subjt:  TQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKERG----------------------------ALKMSVDAACRSNCSKTGCGVIVRDHR

Query:  GFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTVAHHLAQWA
           V A +  V   +    AE   ++ GLK         + VESDS+  +  I+ E          + EI+       F+  S   R+ N  AH LA+W 
Subjt:  GFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTVAHHLAQWA

Query:  CFS-GLSLVWTLDFPPWLECLLRTNCP
          S   +  W  +FP WL  L++ + P
Subjt:  CFS-GLSLVWTLDFPPWLECLLRTNCP

XP_030483669.1 uncharacterized protein LOC115700241 [Cannabis sativa]7.4e-7329.33Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK
        FWWGST + + IHW +W+ L   K  GGLGF++   FNQ+L+AKQ WR+++ P+SL+GR+   +YF NG++L A  G   S  W+SI+WG++L+ +GL+ 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK

Query:  RIGNG---------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSK
        R+G G                           +L V++L+     W+   I       D + +L+IP+      D  IW+ T+ G ++VKS Y+  ++  
Subjt:  RIGNG---------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSK

Query:  ITESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLW--SLIFPDLVSLANSFVENA
         +  T+ S  L +WW K W L +P+KI+  +WKVF+N +P+   L+ K I+++  CP+C  H+E L H  F C   K +W  SL+  +    A++    +
Subjt:  ITESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLW--SLIFPDLVSLANSFVENA

Query:  QDILIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKE-----------------------------------------RGAL
         D L+ ++ +  + +FEK  V+ W IW +RN     +         D+  NYL +                                          G L
Subjt:  QDILIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKE-----------------------------------------RGAL

Query:  KMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQ
        K++ DAAC     K G G +VRD  G  V A +  ++        EA+AL H LK    +GL   ++E+DS+ +V  ++   + +S++ ++L+++   + 
Subjt:  KMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQ

Query:  NTCFLGVSFIRRESNTVAHHLAQWACFSGLSLVWTLDFPPWLECLLRTN
              ++ ++R +NT A  LA++A      + W  +FPP L  ++  N
Subjt:  NTCFLGVSFIRRESNTVAHHLAQWACFSGLSLVWTLDFPPWLECLLRTN

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]3.3e-7331.55Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK
        FWWGS+    KIHW +W  L   K  GGLGF+    FNQ+ +AKQ WRI Q P+SL+ R+ +G+Y+     + AK    SS  W+ I+WGR+L+ KGL  
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK

Query:  RIGNG--------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKI
        +IG+G                          +  V++ +T +  W+ EL+ N     D + +L+IP+  +S  D+W WHY S G ++VKS Y L  + + 
Subjt:  RIGNG--------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKI

Query:  TESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDI
         + +S S     WW+  WGL +P+K++   W+V N+ +P+ QNL+ +++  SA C +C+   E++ H  F C H KS+W      L     SF+++  D 
Subjt:  TESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDI

Query:  LIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKE----------------------------RGALKMSVDAACRSNCSKTG
        L+ L+  L   + EK     W IW+DRN  I  + +   +  +   E YL                                 LKM+VDAA  S+ +K G
Subjt:  LIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKE----------------------------RGALKMSVDAACRSNCSKTG

Query:  CGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNT
         GVI+RD  G  + A +  V         EA A+  GL+   ++ L+   VE+D + LV A+Q +    SS+  L+ +I   + +     +S +RR++N 
Subjt:  CGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNT

Query:  VAHHLAQWACFSGLSLVWTLDFP
         AH LA+ A       +W  + P
Subjt:  VAHHLAQWACFSGLSLVWTLDFP

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]9.3e-7632.45Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK
        FWWGS+    KIHW  W  L   K  G LGF+    FNQ+ +AKQ WR+ Q+P SL+ R+ +G+Y+ +   L AK    SS  W+ ILWGR+L+++GLR 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK

Query:  RIGNG--------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKI
        +IG G                          N  V++ +T +  WN EL+       D   +L+IP+  +S+ D WIWHY   G ++VKS Y L  + + 
Subjt:  RIGNG--------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKI

Query:  TESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDI
         + TS S     WWK+ WGL +P+K++   WKV N+ +P+  NL+ +++  SA C +C+   E++ H  F C H K++W      L     S++++  D 
Subjt:  TESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDI

Query:  LIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKE-------------------------------NILPDVDRADWVENYLKERGALKMSVDAACRSNCS
        L+ L+  L   + E+     W IW+DRN  I  +                                +  DV+R  W          LKM+VDAA  S+ S
Subjt:  LIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKE-------------------------------NILPDVDRADWVENYLKERGALKMSVDAACRSNCS

Query:  KTGCGVIVRDHRGFGVIAAAHFVEVG-VDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRR
        K G GVI+RD  G  V+AA     VG       EA A+  GL++  ++ L+  YVE+D + LV AI       SS+  L+ +I   + ++    +S +RR
Subjt:  KTGCGVIVRDHRGFGVIAAAHFVEVG-VDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRR

Query:  ESNTVAHHLAQWACFSGLSLVWTLDFP
        ++N  AH LA+ A       +W  + P
Subjt:  ESNTVAHHLAQWACFSGLSLVWTLDFP

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248741.9e-8232.64Show/hide
Query:  KIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRKRIGNG-----
        K+HW  W  +  PK  GGL F+DLEGFNQ+LVAK VWR +QHP+ LV ++ + KYF + S+L+A     SSY WK  LWGRDL+ KGLR R+GNG     
Subjt:  KIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRKRIGNG-----

Query:  ----------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKITESTSCSDV
                              +  V++ +T+ GNW+   I +   + D++L+LS+PI   +L D W+WHY   G +SV+S YKLY++ K   +++ ++ 
Subjt:  ----------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKITESTSCSDV

Query:  LSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDILIRLAGSLP
            W  +W L VP KIK  IW+  +  IP  QNL  + I +   C IC    E++ H FF C   + +W  +FP L  L+     +  ++   L   L 
Subjt:  LSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDILIRLAGSLP

Query:  TQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKERG----------------------------ALKMSVDAACRSNCSKTGCGVIVRDHR
         +D   A +  W IWNDRN LI  + + P   + +W+  +L                                +LK++ DAACR   +  GC  I+RD  
Subjt:  TQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKERG----------------------------ALKMSVDAACRSNCSKTGCGVIVRDHR

Query:  GFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTVAHHLAQWA
           V A +  V   +    AE   ++ GLK         + VESDS+  +  I+ E          + EI+       F+  S   R+ N  AH LA+W 
Subjt:  GFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTVAHHLAQWA

Query:  CFS-GLSLVWTLDFPPWLECLLRTNCP
          S   +  W  +FP WL  L++ + P
Subjt:  CFS-GLSLVWTLDFPPWLECLLRTNCP

A0A803NGJ4 Uncharacterized protein9.4e-7430.37Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK
        FWWG+    SKIHW  W +L   K +GG+GF     FNQ+L+AKQ WRI ++P+SL+ R+ + +YF N S LEA+ G   S  W+ I WGR+L+ +GLR 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK

Query:  RIGNG--------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKI
        +IGNG                          +L V+ L+T S  WN  L+      +D + +LSIP+     PD+ IWH+T+  I++V S + L  N + 
Subjt:  RIGNG--------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKI

Query:  TESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLW-----SLIFPDLVSLANSFVE
        + +TS S+  S WWK  W L +P KIK   WKV  N +P+   L+ +++  SA C +C +  E++ H  F CH  +S+W     S+ F +  ++ N    
Subjt:  TESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLW-----SLIFPDLVSLANSFVE

Query:  NAQDILIRLAGSLPTQDFEKACVVAWVIWNDRNKLI----RKENILPDVDRADWVENYLKERGA------------------------------------
           D LI L+      DFE    + W IW +RNK++    ++E +   +   ++++ Y +   A                                    
Subjt:  NAQDILIRLAGSLPTQDFEKACVVAWVIWNDRNKLI----RKENILPDVDRADWVENYLKERGA------------------------------------

Query:  ----LKMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEI
            LK++VDAA  S+    G G +VR+H+G  + A +  V+        EA AL H +  V++  L   ++E+D++++ +A+    +  S +  L+ ++
Subjt:  ----LKMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEI

Query:  RVFMQNTCFLGVSFIRRESNTVAHHLAQWACFSGLSLVWT
        R  + +   + VS ++R +N  AH LA++A      + WT
Subjt:  RVFMQNTCFLGVSFIRRESNTVAHHLAQWACFSGLSLVWT

A0A803NHG3 Uncharacterized protein4.2e-7430.54Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK
        FWWGST  K KIHW  W+ L  PK +GGLGF+DLE FNQ+L+AKQ+WR ++ P SL  ++ +  YFP+ S+L AK G  +S+VW+S++WG+++I KG R 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK

Query:  RIGNG---------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSK
        R+GNG                            LCV +L   SG W+   I+      D  ++L +P     L DK +WHY+ +G ++V+S Y++    +
Subjt:  RIGNG---------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSK

Query:  ITESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPIC-NSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQ
         +E+T    ++ +WW+KLW L +P K+K   WK+ N+++P   NL ++++     C  C N   EN+ H  + C   K +W L       +     E+  
Subjt:  ITESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPIC-NSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQ

Query:  DILIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKE--------------------------RGALKMSVDAACR--SNCSK
          L+RLA  +    +E   V+ W +W  RN       +    +  +W   YL E                          +G +K++VD   R    CS 
Subjt:  DILIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKE--------------------------RGALKMSVDAACR--SNCSK

Query:  TGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRES
         GC  +VR+  G  V A+A  +         E  A+  GL+  ++       VESD  + +  I  +         +L+ IR  M +   +G+SF+ RE+
Subjt:  TGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRES

Query:  NTVAHHLAQWACFSGLSLVWTLDFPPWLECLLRTNCP
        N VA+ LA +A  + +  +W    PP     L  + P
Subjt:  NTVAHHLAQWACFSGLSLVWTLDFPPWLECLLRTNCP

A0A803PKJ2 Uncharacterized protein3.7e-7832.51Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK
        FWWGS+    KIHW SW  L   K  GGLGF+    FNQ+ +AKQ WR+ Q+P SL+ R+ +G+Y+ +   L AK    SS  W+ I+WGR+L+++GLR 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK

Query:  RIGNG--------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKI
        +IG G                          N  V+  +T +  WN EL+      VD   +L+IP+  SS+ D WIWHY   G ++VKS Y L  + + 
Subjt:  RIGNG--------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKI

Query:  TESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDI
         + TS S     WWK+ WGL +P+K++   WKV N+ +P+  NL+ +++  SA C +C+   E++ H  F C H K++W      L     S++++  D 
Subjt:  TESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDI

Query:  LIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKEN-------------------------------ILPDVDRADWVENYLKERGALKMSVDAACRSNCS
        L+ L+  L   + E+     W IW+DRN  I  +                                +  DV++  W+         LKM+VDAA  S+ S
Subjt:  LIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKEN-------------------------------ILPDVDRADWVENYLKERGALKMSVDAACRSNCS

Query:  KTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRE
        K G GVI+RD  G  V+A +            EA A+  GL+    + L+  YVE+D M LV AI       SS+  L+ +I   + ++    +S +RR+
Subjt:  KTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRE

Query:  SNTVAHHLAQWACFSGLSLVWTLDFP
        +N  AH LA+ A       +W  + P
Subjt:  SNTVAHHLAQWACFSGLSLVWTLDFP

A0A803QQT2 Uncharacterized protein6.7e-8033.08Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK
        FWWGS   + KIHW  W  L  PK+KGGLGF+DL  FNQ+L+AKQ+WR ++HP  L  R+ +  YFP   +LEA  G  +S+VW+S++WG+ LI KG R 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK

Query:  RIGNG---------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSK
        R+GNG                           NL V++L  + G W+   I++I    D +L+L IP       DK +WHY+  G +SVKS Y++  +  
Subjt:  RIGNG---------------------------NLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSK

Query:  ITESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSH-EENLTHTFFMCHHEKSLW--SLIFPDLVSLANSFVEN
          +  S    +  WWKKLW L +P K+K  +WKV +N++P   NL  + I+ S +C  C+SH +E++ H  + C   K  W  S ++ DL  +     E+
Subjt:  ITESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSH-EENLTHTFFMCHHEKSLW--SLIFPDLVSLANSFVEN

Query:  AQDILIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKE-----------------------RGALKMSVDAACRSNCSKTGC
           +L+R+A     +  E   +V+W IWN RN ++         +  +W  N+L +                       R  + ++VDA  +     +G 
Subjt:  AQDILIRLAGSLPTQDFEKACVVAWVIWNDRNKLIRKENILPDVDRADWVENYLKE-----------------------RGALKMSVDAACRSNCSKTGC

Query:  GVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTV
        G +VRD  G  + AAA  ++  +     E MA+  G+++ ++  L+   VE+D +Q V  IQ +         LL+ IR  +    F+G+SF+ RE+N V
Subjt:  GVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTV

Query:  AHHLAQWACFSGLSLVWTLDFPP
        AH LA +A     S +W    PP
Subjt:  AHHLAQWACFSGLSLVWTLDFPP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.6e-2523.02Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAK----EGLGSSYVWKSILWG-RDLIK
        F WGST  K K H   W  +  PK +GGLG +  +  N++L++K  WR++Q  +SL   + + KY   G I +++    +G  SS  W+SI  G RD++ 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAK----EGLGSSYVWKSILWG-RDLIK

Query:  KGLRKRIGNG-----------------------------NLCVSNLLTSSGNWNRELIQNILWS---VDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVK
         G+    G+G                              +   +L      W+   I     +   ++   V+   + G+   D+  W ++ DG FSV+
Subjt:  KGLRKRIGNG-----------------------------NLCVSNLLTSSGNWNRELIQNILWS---VDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVK

Query:  SAYKLYLNSKITESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVS-
        SAY++    ++         +++++  LW + VP ++K  +W V N  +   +  + + +S S +C +C    E++ H    C  +  +W  + P     
Subjt:  SAYKLYLNSKITESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVS-

Query:  --LANSFVENAQDILIRLAGSLPTQDFEKACVVAWVIW-----------------NDRNKLIRK----------ENIL-----PDVDR-ADWVENYLKER
           + S  E   D L   +G    +D   + + A +IW                  DR K +++           N+L     P V+R   WV   +   
Subjt:  --LANSFVENAQDILIRLAGSLPTQDFEKACVVAWVIW-----------------NDRNKLIRK----------ENIL-----PDVDR-ADWVENYLKER

Query:  GALKMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVG-VDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDS-SYGVLLSEI
        G +K++ D A R N      G ++RD    G       + +G      AE   + +GL    E  +  V +E DS ++++   +  + DS     L+   
Subjt:  GALKMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVG-VDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDS-SYGVLLSEI

Query:  RVFMQNTCFLGVSFIRRESNTVAHHLAQWA
          F+Q    + +  + RE+N +A  LA +A
Subjt:  RVFMQNTCFLGVSFIRRESNTVAHHLAQWA

P93295 Uncharacterized mitochondrial protein AtMg003103.0e-2448.11Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPK-NKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLR
        FWW S   K KI W +W  L   K + GGLGF+DL  FNQ+L+AKQ +RII  P +L+ R+ R +YFP+ S++E   G   SY W+SI+ GR+L+ +GL 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPK-NKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLR

Query:  KRIGNG
        + IG+G
Subjt:  KRIGNG

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein4.4e-1524.58Show/hide
Query:  LWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLW---SLIFPDLVSLANSFVENAQDILIRLAGSLPTQDF
        +W L V  KIK  +W+     +     L S+ I    +C  C   EE + H  F C + +S+W   ++I  +     +SF +N  + LI+L+ +  T   
Subjt:  LWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLW---SLIFPDLVSLANSFVENAQDILIRLAGSLPTQDF

Query:  EKACV--VAWVIWNDRNK-LIRKENILPDV-------DRADWVE-NYLKE-------------------------RGALKMSVDAACRSNCSKTGCGVIV
        ++     + W +W  RN  L +++   PD        D  +W+  N   E                          G +K + D+        T  G  +
Subjt:  EKACV--VAWVIWNDRNK-LIRKENILPDV-------DRADWVE-NYLKE-------------------------RGALKMSVDAACRSNCSKTGCGVIV

Query:  RDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTVAHHL
        R+  G  V+     ++       AEA+  +H L+++   GL+ V+ ESDS  LV  I       S  G L+ +IR +M    +  + F+ RE N+ A  L
Subjt:  RDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTVAHHL

Query:  A
        A
Subjt:  A

AT3G09510.1 Ribonuclease H-like superfamily protein1.1e-2923.9Show/hide
Query:  RGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRKRIGNG--------------------------NLCVSNLLTSSGN---WNRELIQNILWSVD
        + +YF + SIL+AK     SY W S+L G  L+KKG R  IG+G                           + ++NL    G+   W+   I   +   D
Subjt:  RGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRKRIGNG--------------------------NLCVSNLLTSSGN---WNRELIQNILWSVD

Query:  QNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKITESTSCSDVLS--NWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCP
           +  I +  S  PDK IW+Y + G ++V+S Y L  +   T   + +      +   ++W L +  K+K  +W+  +  +   + L ++ +     CP
Subjt:  QNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKITESTSCSDVLS--NWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCP

Query:  ICNSHEENLTHTFFMCHHEKSLWSLIFPDLVS---LANSFVENAQDILIRLAGSLPTQDFEKACVV--AWVIWNDRNKLI--------RKENILPDVDRA
         C+   E++ H  F C      W L    L+    ++N F EN  +IL          DF K   V   W IW  RN ++         K  +    +  
Subjt:  ICNSHEENLTHTFFMCHHEKSLWSLIFPDLVS---LANSFVENAQDILIRLAGSLPTQDFEKACVV--AWVIWNDRNKLI--------RKENILPDVDRA

Query:  DWV-----------------ENYLKERGA----LKMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVY
        DW+                 EN ++ R      +K + DA       +   G I+R+H G  +   +  +    +   AE  AL+  L+     G  +V+
Subjt:  DWV-----------------ENYLKERGA----LKMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVY

Query:  VESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTVAHHLAQWACFSGLSLVWTLDFPPWLE
        +E D   L+  I       SS    L +I  +      +   FIRR+ N +AH LA++ C        +   P WL+
Subjt:  VESDSMQLVLAIQRECLLDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTVAHHLAQWACFSGLSLVWTLDFPPWLE

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.8e-0740.32Show/hide
Query:  SNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMC
        +NW   +W L +  KIK LIWK  NN +P+   L S+ IS    C  C    E +TH  F C
Subjt:  SNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMC

AT4G29090.1 Ribonuclease H-like superfamily protein9.1e-4526Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK
        FWW +      +HW +WD LS  K +GG+GFKD+E FN +L+ KQ+WR++  P SL+ ++F+ +YF     L A  G   S+VWKSI   ++++++G R 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLRK

Query:  RIGNGN-----------------------------------LCVSNLLTSSG-NWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKS
         +GNG                                    L VS+L+  SG  W +++I+ +   V++ L+  +   G  + D + W YTS G ++VKS
Subjt:  RIGNGN-----------------------------------LCVSNLLTSSG-NWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKS

Query:  AYKL---YLNSKITESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLW---SLIFP
         Y +    +N + +        L+  ++K+W      KI+  +WK  +N +P+   L  + +S+ + C  C S +E + H  F C   +  W   S+  P
Subjt:  AYKL---YLNSKITESTSCSDVLSNWWKKLWGLCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLW---SLIFP

Query:  DLVSLANSFVENAQDILIRLAGSLPTQDFEKACVVA----WVIWNDRNKLI-------------RKENIL---------------PDVDRADWVENYLKE
             A+S   N   +   L    P   +EKA  +     W +W +RN+L+             R E+ L               P V+R+         
Subjt:  DLVSLANSFVENAQDILIRLAGSLPTQDFEKACVVA----WVIWNDRNKLI-------------RKENIL---------------PDVDRADWVENYLKE

Query:  RGALKMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIR
           +K + DA    +  + G G ++R+ +G      A  +     V  AE  A+   +  +       V  ESDS  L+  +  + +  S    +    R
Subjt:  RGALKMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECLLDSSYGVLLSEIR

Query:  VFMQNTCFLGVSFIRRESNTVAHHLAQ
        +  Q T    V FI RE NT+A  +A+
Subjt:  VFMQNTCFLGVSFIRRESNTVAHHLAQ

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.1e-2548.11Show/hide
Query:  FWWGSTMTKSKIHWTSWDVLSIPK-NKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLR
        FWW S   K KI W +W  L   K + GGLGF+DL  FNQ+L+AKQ +RII  P +L+ R+ R +YFP+ S++E   G   SY W+SI+ GR+L+ +GL 
Subjt:  FWWGSTMTKSKIHWTSWDVLSIPK-NKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYVWKSILWGRDLIKKGLR

Query:  KRIGNG
        + IG+G
Subjt:  KRIGNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGGCTGGAAATGTGTTTTATCCAAAACGTGGTCTTAGACAAGGAGATCCTCTTTCCCCCTATTTATTCATTCTATTTCTGGTGGGGTTCTACAATGACAAAATC
CAAAATTCATTGGACCAGTTGGGATGTTCTTAGCATTCCTAAAAATAAGGGCGGATTAGGTTTTAAAGACCTGGAAGGTTTTAATCAATCTTTAGTGGCAAAACAAGTGT
GGAGAATCATCCAACACCCATCTTCTTTGGTTGGGAGGATTTTTAGGGGAAAATATTTTCCTAATGGATCTATTTTGGAGGCAAAAGAAGGGCTTGGCTCTTCTTATGTG
TGGAAAAGTATTCTTTGGGGTAGAGACTTAATTAAAAAAGGGCTAAGGAAGCGCATTGGCAATGGTAATTTGTGTGTCTCTAATCTATTGACCAGCAGTGGTAATTGGAA
TCGTGAGTTGATTCAAAACATTCTTTGGTCAGTTGATCAAAATTTGGTGTTATCAATTCCCATTTGTGGTTCTTCTTTACCGGATAAATGGATCTGGCATTACACTAGTG
ATGGGATCTTCTCGGTTAAAAGTGCTTACAAGCTTTATTTGAATTCTAAAATAACTGAATCAACCTCTTGTTCGGATGTTTTATCCAATTGGTGGAAAAAGTTGTGGGGT
TTGTGTGTGCCTGCTAAGATCAAATTTCTTATATGGAAAGTGTTCAACAATTTCATCCCTATTATGCAGAATTTATACAGCAAAAGAATCTCACAGTCTGCCTTATGCCC
TATTTGTAATAGCCATGAAGAAAATTTAACCCATACCTTTTTTATGTGCCATCATGAAAAATCGCTATGGTCCTTGATATTTCCTGATCTTGTATCCCTTGCTAATAGTT
TTGTTGAGAATGCTCAAGATATTTTAATTAGGCTTGCTGGATCTTTGCCTACTCAGGATTTCGAAAAAGCATGCGTTGTTGCATGGGTTATTTGGAATGATAGAAATAAG
TTGATCAGGAAGGAGAATATTTTACCAGATGTTGATAGAGCTGATTGGGTGGAAAATTACCTGAAAGAAAGGGGTGCTCTTAAGATGTCGGTGGATGCAGCTTGCCGATC
GAATTGCTCTAAAACAGGTTGTGGTGTGATTGTTCGGGATCATCGCGGGTTTGGTGTTATTGCTGCTGCTCATTTCGTTGAGGTTGGCGTTGATGTATTTGCTGCTGAAG
CTATGGCTTTAATCCATGGACTAAAGATTGTTTTGGAGATGGGTTTGAAGGAGGTTTATGTTGAATCTGATTCTATGCAGTTGGTGTTGGCAATTCAAAGGGAATGTTTG
CTCGATTCAAGCTATGGTGTTCTGTTGTCTGAAATTAGGGTTTTTATGCAGAATACCTGTTTTTTGGGTGTTTCTTTTATTCGTAGAGAATCTAATACCGTAGCCCATCA
TTTAGCTCAATGGGCTTGTTTTTCTGGGCTTTCTTTGGTGTGGACCTTAGATTTTCCTCCATGGTTGGAGTGTTTGTTAAGGACAAATTGTCCTCCTGTTGATTTGTTTG
GGCTGCTGCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGGCTGGAAATGTGTTTTATCCAAAACGTGGTCTTAGACAAGGAGATCCTCTTTCCCCCTATTTATTCATTCTATTTCTGGTGGGGTTCTACAATGACAAAATC
CAAAATTCATTGGACCAGTTGGGATGTTCTTAGCATTCCTAAAAATAAGGGCGGATTAGGTTTTAAAGACCTGGAAGGTTTTAATCAATCTTTAGTGGCAAAACAAGTGT
GGAGAATCATCCAACACCCATCTTCTTTGGTTGGGAGGATTTTTAGGGGAAAATATTTTCCTAATGGATCTATTTTGGAGGCAAAAGAAGGGCTTGGCTCTTCTTATGTG
TGGAAAAGTATTCTTTGGGGTAGAGACTTAATTAAAAAAGGGCTAAGGAAGCGCATTGGCAATGGTAATTTGTGTGTCTCTAATCTATTGACCAGCAGTGGTAATTGGAA
TCGTGAGTTGATTCAAAACATTCTTTGGTCAGTTGATCAAAATTTGGTGTTATCAATTCCCATTTGTGGTTCTTCTTTACCGGATAAATGGATCTGGCATTACACTAGTG
ATGGGATCTTCTCGGTTAAAAGTGCTTACAAGCTTTATTTGAATTCTAAAATAACTGAATCAACCTCTTGTTCGGATGTTTTATCCAATTGGTGGAAAAAGTTGTGGGGT
TTGTGTGTGCCTGCTAAGATCAAATTTCTTATATGGAAAGTGTTCAACAATTTCATCCCTATTATGCAGAATTTATACAGCAAAAGAATCTCACAGTCTGCCTTATGCCC
TATTTGTAATAGCCATGAAGAAAATTTAACCCATACCTTTTTTATGTGCCATCATGAAAAATCGCTATGGTCCTTGATATTTCCTGATCTTGTATCCCTTGCTAATAGTT
TTGTTGAGAATGCTCAAGATATTTTAATTAGGCTTGCTGGATCTTTGCCTACTCAGGATTTCGAAAAAGCATGCGTTGTTGCATGGGTTATTTGGAATGATAGAAATAAG
TTGATCAGGAAGGAGAATATTTTACCAGATGTTGATAGAGCTGATTGGGTGGAAAATTACCTGAAAGAAAGGGGTGCTCTTAAGATGTCGGTGGATGCAGCTTGCCGATC
GAATTGCTCTAAAACAGGTTGTGGTGTGATTGTTCGGGATCATCGCGGGTTTGGTGTTATTGCTGCTGCTCATTTCGTTGAGGTTGGCGTTGATGTATTTGCTGCTGAAG
CTATGGCTTTAATCCATGGACTAAAGATTGTTTTGGAGATGGGTTTGAAGGAGGTTTATGTTGAATCTGATTCTATGCAGTTGGTGTTGGCAATTCAAAGGGAATGTTTG
CTCGATTCAAGCTATGGTGTTCTGTTGTCTGAAATTAGGGTTTTTATGCAGAATACCTGTTTTTTGGGTGTTTCTTTTATTCGTAGAGAATCTAATACCGTAGCCCATCA
TTTAGCTCAATGGGCTTGTTTTTCTGGGCTTTCTTTGGTGTGGACCTTAGATTTTCCTCCATGGTTGGAGTGTTTGTTAAGGACAAATTGTCCTCCTGTTGATTTGTTTG
GGCTGCTGCCCTGA
Protein sequenceShow/hide protein sequence
MERLEMCFIQNVVLDKEILFPPIYSFYFWWGSTMTKSKIHWTSWDVLSIPKNKGGLGFKDLEGFNQSLVAKQVWRIIQHPSSLVGRIFRGKYFPNGSILEAKEGLGSSYV
WKSILWGRDLIKKGLRKRIGNGNLCVSNLLTSSGNWNRELIQNILWSVDQNLVLSIPICGSSLPDKWIWHYTSDGIFSVKSAYKLYLNSKITESTSCSDVLSNWWKKLWG
LCVPAKIKFLIWKVFNNFIPIMQNLYSKRISQSALCPICNSHEENLTHTFFMCHHEKSLWSLIFPDLVSLANSFVENAQDILIRLAGSLPTQDFEKACVVAWVIWNDRNK
LIRKENILPDVDRADWVENYLKERGALKMSVDAACRSNCSKTGCGVIVRDHRGFGVIAAAHFVEVGVDVFAAEAMALIHGLKIVLEMGLKEVYVESDSMQLVLAIQRECL
LDSSYGVLLSEIRVFMQNTCFLGVSFIRRESNTVAHHLAQWACFSGLSLVWTLDFPPWLECLLRTNCPPVDLFGLLP