; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy07g004100 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy07g004100
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationChr07:19610161..19611816
RNA-Seq ExpressionLcy07g004100
SyntenyLcy07g004100
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.4e-6630.95Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA
        ++AK+ WRL+   ++P+SL+A+V++ RY+K   F  A +GSNPS+ WRSI+WG ++ K+G RWR+GDG  V + KD WI +     PI          VA
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA

Query:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY
         L      W    +   F++ED  AIL     S   EDE+LW+ D KG +SVK  Y+L   +NQ F     +      +WK  W L LP K+KI  WR  
Subjt:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY

Query:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDRE-DRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSI
         +ILPT  NL  R     P+C  C+ + ET SH+   CK  R +W     LA L     ++ ++     +  +W R       ++     ++ CW IWS 
Subjt:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDRE-DRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSI

Query:  RNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRD
        RN      ++ +   +       + A           YQ     G        G           ++KW P S    KL+ DA+  +  +   +G ++RD
Subjt:  RNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRD

Query:  WSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLA
            +LA G K       +S  EA +I  GLQ    ++    L+VE+D  +VV L+N      TE+++ + + +R     K    + IPR  N  AH+LA
Subjt:  WSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLA

Query:  QKAYEEDGPKSWSHSFP
        + A        W  +FP
Subjt:  QKAYEEDGPKSWSHSFP

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]5.3e-6130.77Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA
        ++AK+ WR++   +FPSSL+A+VL+ RYFK   F+ A LGS PS+ WRSI+WGR++  +G RWR+G+G NV +  + WI +     PI         TVA
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA

Query:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY
        +L      W E LI   F  EDA AI+  P      ED+++W+ D KG +SVK  Y++  ++      S +N+   + +W+  WKL +P K+KI  WR  
Subjt:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY

Query:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLW--QRGGNTSTNILHIKCSLIICWRIWS
        +D+LPT  NL  + +   P+C  C    ET SH    C   R +W +Y  LA        E+ R     D +W  Q        +   + + ++ W IW 
Subjt:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLW--QRGGNTSTNILHIKCSLIICWRIWS

Query:  IRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLR
         RN      ++ N   +R +   +  A +       +P  +   +G  ER                +++WSP  +G  K++ DA+   + +   +G V+R
Subjt:  IRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLR

Query:  DWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPS--VTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAH
        D      AA  K +     ++  EA ++  GL+      +T G+    E+DSL+V++LIN +    TE+ + I + Q      +     H PR  NY AH
Subjt:  DWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPS--VTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAH

Query:  SLAQKAYEEDGPKSWSHSFP
        SLA+ A ++     W    P
Subjt:  SLAQKAYEEDGPKSWSHSFP

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]5.7e-6330.13Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA
        ++AK++WRL+   ++P+SL+++VL+ RYF+   FL A  G+N SY WRSI+WGR++ K+G RWR+G+G  + I  D W+ +     PIF         VA
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA

Query:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY
         L + +  W+E  +R  FL+ D   IL  P  +   EDE+LW+ D +G +SVK  Y+L   +  +F  S++  +     W   W L+LP K+KI  WR  
Subjt:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY

Query:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR
        N++LP+  NL  R +   P C  C+   ET SH    CK  R +W   L     +   +   + I  TL  + +    +   ++     + +CW  W  R
Subjt:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR

Query:  NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDW
        N    + + LN           ISA+  E +               +R+    Q  + +     +++W P     +K++ DA++ S      VG V+RD 
Subjt:  NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDW

Query:  SRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQ
        +  ++AAG          S  EA +++ GLQ   +      L++E+D L+VV L+N      +E+ + I   Q    I +   + HIPR  N  AH LA+
Subjt:  SRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQ

Query:  KAYEEDGPKSWSHSFPDWLLD
         A  +  P  W  + P  L D
Subjt:  KAYEEDGPKSWSHSFPDWLLD

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]1.8e-5627.46Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA
        +LAK+ WR+    + P SL+A+VL+ RYF + +FL A +GSNPSY WRSI+WGR++   G RWR+G+G +V I K  WI K     P+          V+
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA

Query:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY
        +L      W+E LI   F + DA  I   P    + EDE++W+    G+++VK  Y+   ++  RF A  ++ +  +  W   W L LP KI+I  WR  
Subjt:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY

Query:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR
         ++LP+  NL  R +   P C LC+   E   H    CK  + +W   L   ++     ++   I   L G+ +   N   ++       ++ W  W+ R
Subjt:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR

Query:  NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDW
        N      +R N +++  + + +     ++ +            G+ +++A  G              W+P  +G  K++ DA+  S++    +G V+RD 
Subjt:  NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDW

Query:  SRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQ
        +  + A   K       +++ EA ++  GLQ +        +++E+DS +VV+L+N      +E+ + + E Q++       S  +  R+ N +AHSL +
Subjt:  SRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQ

Query:  KAYEEDGPKSWSHSFPDWLLDENERDTG
         A E+     W  S+P  ++  +  D G
Subjt:  KAYEEDGPKSWSHSFPDWLLDENERDTG

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]7.2e-5829.23Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA
        +LAK++WR++   +FP+SLL+ +LR RYF  G++L A LGSNPS TWRS++WG+EL  +G RWRVG G  ++   D W+       P F      +  VA
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA

Query:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY
         L   +  W+   +  +F + D   +L+ P      +D ++WN    G ++VK  Y     + ++  ++ +N    E  W +FWKLKLPPK++I  W+++
Subjt:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY

Query:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR
        +  LP  + L  R +   P C +C   EET  H  + C   + +W     L+N S  F   +R  S T D L     +TS +   ++  L++CW IW  R
Subjt:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR

Query:  NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDW
        N I H N       +       ++        + +P         +    PS      +  P    KW+    G  KL+ DA+   +R    +G VLR+ 
Subjt:  NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDW

Query:  SRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQ
           ++AA  K          +EAL +   L  + S    V   +E DSL VV  +       +  +  +     + S      I H+ R+ N  AH+LA+
Subjt:  SRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQ

Query:  KAYEEDGPKSWSHSFPDWLL
         A   D    W  +FP  L+
Subjt:  KAYEEDGPKSWSHSFPDWLL

TrEMBL top hitse value%identityAlignment
A0A1R3GNW3 Reverse transcriptase4.2e-5627.84Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFT-V
        +LAK+ W+L+     P SL+A+VL+ RYF    FL+A  G  PS+TWRSI+ GR+L + G RWR+G+G +V I  D W+ K     P      I   + V
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFT-V

Query:  AQLKRPNGM-WNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWR
        A L    G  W+  LIR+ F+EE+A AI+  P    +  D ++W+ D+KG +SVK  YR+ C +    + +     D +  +   W   +PPK+++  WR
Subjt:  AQLKRPNGM-WNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWR

Query:  IYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWS
        ++ + L  + +L+ R MDV   CF C+++ E+ +H    C     +W       ++S  F  +D  +   +       GN   N    + SL+  W IW+
Subjt:  IYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWS

Query:  IRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLR
         RN                + +++ S SI     D   +   + +  +       +    +  P     W P +    K+++D ++ S R+ G+ G + R
Subjt:  IRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLR

Query:  DWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSL
        +    +L A    I + SD    E+ + V  L     + G   +++E D+L ++  IN   +D + +  ++KEA+ +  +     + HI R  N +AH L
Subjt:  DWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSL

Query:  AQKAYEEDGPKSWSHSFPDWLLDENERD
        A+     D    W    P WL D  + D
Subjt:  AQKAYEEDGPKSWSHSFPDWLLDENERD

A0A1S8AC01 Ribonuclease H-like superfamily protein3.8e-5729.34Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA
        ++AK+ WRL+   + P SL A+VL  +YFK   FL A LGSNPS+ WRSIIWGR++   G RWR+G G  V I K  WI +     P        +  V+
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA

Query:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY
         L   +  W+E  I   F + DA  I++ P      +D+ILW+ D KGR+SVK  Y++  ++  +F A  ++ ++    W+  W L LP K++I  WR  
Subjt:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY

Query:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR
         ++LP+  NL  R +   P+C +C+   E+  H   +CK  R +W        L+   +       E L  L  +G   S     I+  + ICW IWS R
Subjt:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR

Query:  NLISHNNQRLNQETI--------RDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGS
        N      ++ + + +        +   +  + A IH    D++ ++ Q                         + WSP   G +KL+ DA+   D++   
Subjt:  NLISHNNQRLNQETI--------RDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGS

Query:  VGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVR---LLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPR
        +G V+R+    ++AA   C     D++++EA ++  GLQ    V G  +   L+VE DS +VV+ +N +     E+ + +   Q +        + HIPR
Subjt:  VGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVR---LLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPR

Query:  A
        +
Subjt:  A

A0A1S8ACU2 Ribonuclease H-like superfamily protein1.7e-6030.78Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA
        ++AK+SWR+IK+P+   SL+AK+L+ +YFK   FL+A LGS PS+ WRSIIWGR++   G RWR+G G  V I K  WI +     PI         TVA
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA

Query:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY
        +L   N  W E LI+  F  EDA  I       +   D+ILW+ D KG +SVK  Y++  ++    + SS++       W   W L+LP KIKI  W+  
Subjt:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY

Query:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR
         + LPT  NL  R M   P+C  C+ K+E  +H    CK  + +W K  P      L D +D  +   L  L  R       ++     + +CW  W  R
Subjt:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR

Query:  NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDW
        N+    N+R +          QIS +  E +  E   +++  + Q    A   +G           KW P   G +K + DA+   ++    +G V+RD 
Subjt:  NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDW

Query:  SRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKE-AQRMCSIKKVDSITHIPRAHNYLAHSLA
        S  ++ A         D++  EA ++  GLQ +   +    L++E D   VV+ +       TE+ + I E  Q++ S   +  +  I R  N +AH+LA
Subjt:  SRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKE-AQRMCSIKKVDSITHIPRAHNYLAHSLA

Query:  QKAYEEDGPKSWSHSFPDWLLDE
        + A        W    P  ++ +
Subjt:  QKAYEEDGPKSWSHSFPDWLLDE

A0A6J1DAR4 uncharacterized protein LOC1110189541.6e-5527.67Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFT--
        +LAK+ WR++     P+S+L++VL+GRYFK   F++A +  NPSY WRSI+WGR+L K+G RWR+G+G +V I  D W+  +     I + P +   +  
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFT--

Query:  VAQLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQA-SSANYKDQEAMWKDFWKLKLPPKIKICGW
         + +    G W   ++R  F  ++A  IL+ P      ED ++WN +  G +SV+  Y++    N   QA SS++ ++    W  FWK+ +P KIK+  W
Subjt:  VAQLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQA-SSANYKDQEAMWKDFWKLKLPPKIKICGW

Query:  RIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIW
        R+  D LPT  NL+ RG+++   C+ C    E + HLFW CK    LW                + +  +    L  R  + S +    +   ++ W +W
Subjt:  RIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIW

Query:  SIRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVL
        + RN  + N+   + +T+  I          EL+     Y M++ E ++  +     G V     T    W P  +G +K++ DAS+ +  +   +G ++
Subjt:  SIRNLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVL

Query:  RDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHS
         +    ++AA  K + +   +   EA++ VEGLQ    +  G+   +E               D +E    + +A+   +     S   + R  N  AH 
Subjt:  RDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHS

Query:  LAQKAYEEDGPKSWSHSFPDWLLD
        LA++A          H F  W+ D
Subjt:  LAQKAYEEDGPKSWSHSFPDWLLD

A0A6J1DX30 uncharacterized protein LOC1110248742.1e-5529.98Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA
        ++AK  WR   F + P+ L++KVL+ +YFK    L+AS  S  SY W+  +WGR+L  +G R RVG+G  +    D W+ +     P+  +      TVA
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVA

Query:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY
             +G W+   I  SF  ED   IL+ P  S   +D  LW+ D +G +SV+  Y+L   +     ++S NY+  +  W   WKL +P KIKI  WR  
Subjt:  QLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIY

Query:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR
        ++ +PT  NL  RG+   P C +C ++ E+  H F+HCK  R +W    P   L+CL   ++    E    L ++      N+     + I  W IW+ R
Subjt:  NDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIR

Query:  NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDW
        N + H  Q    E   + L                P+     + Q    +P  Q         V + W P S    KL+ DA+ R      S G ++RD 
Subjt:  NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDW

Query:  SRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQ
        S +L+AA    +         E   I+EGL+   + +    L VE+DSL  + LI  E     +   ++ E Q +       S +H  R  N  AH LA+
Subjt:  SRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQ

Query:  KAY-EEDGPKSWSHSFPDWLLDENERD
                  +W  +FP WLLD  +RD
Subjt:  KAY-EEDGPKSWSHSFPDWLLDENERD

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.4e-1622.5Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRY----FKTGHFL--KASLGSNPSYTWRSIIWG-RELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPN
        +++K  WRL+   +  +SL   VL+ +Y     +   +L  K S  S    TWRSI  G R++   G  W  GDG  +    D W+       P+    N
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRY----FKTGHFL--KASLGSNPSYTWRSIIWG-RELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPN

Query:  IRHFT------VAQLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMG-EDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKL
            T         L  P   W+   I         L + A       G  D + W     G+FSV+ AY +                +  + +   WK+
Subjt:  IRHFT------VAQLKRPNGMWNEPLIRASFLEEDALAILATPTKSNMG-EDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKL

Query:  KLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETL-DGLWQRGGNTSTNILH
        ++P ++K   W + N  + T    + R +    VC +C+   E+  H+   C    G+W + +P       F +    + E L D L  R G    +I  
Subjt:  KLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETL-DGLWQRGGNTSTNILH

Query:  IKCSLIICWRIWSIR--NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLG--DPTVRRK--WSPISDGCWKLS
             +I W  W  R  N+   N +                        D   +  +W       +  +  G VL+G   P V R   W     G  K++
Subjt:  IKCSLIICWRIWSIR--NLISHNNQRLNQETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLG--DPTVRRK--WSPISDGCWKLS

Query:  YDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGL-----QAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQR
         D + R +    S G VLRD +      GF            E   +  GL     + +P      R+ +E DS  +V  +     D   L+F ++    
Subjt:  YDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGL-----QAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQR

Query:  MCSIKKVDSITHIPRAHNYLAHSLAQKAY
              +  I H+ R  N LA  LA  A+
Subjt:  MCSIKKVDSITHIPRAHNYLAHSLAQKAY

P93295 Uncharacterized mitochondrial protein AtMg003103.6e-1242.05Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPI
        +LAK+S+R+I     P +LL+++LR RYF     ++ S+G+ PSY WRSII GREL   G    +GDG +  +  D WI  E    P+
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPI

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.2e-2823.23Show/hide
Query:  LRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVAQLKRPNG---MWNEPLIRASFLE
        ++ RYFK    L A +    SY W S++ G  L K+G R  +GDG N+ I  D  +       P+ T    +  T+  L    G    W++  I     +
Subjt:  LRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVAQLKRPNG---MWNEPLIRASFLE

Query:  EDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAY-RLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWP
         D   I       +   D+I+WN ++ G ++V+  Y  L    +    A +  +   +   +  W L + PK+K   WR  +  L T   L  RGM + P
Subjt:  EDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAY-RLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWP

Query:  VCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDR-EDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIRNLISHNNQRLNQETIRDI
         C  C  + E+ +H  + C      W     L++ S + ++       E +  +     +T+ +  H    + + WRIW  RN +  N  +  +   + +
Subjt:  VCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDR-EDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIRNLISHNNQRLNQETIRDI

Query:  LQQQISASIHELIGDEEPYQMQWLEG-QTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASD
        L  +  A  H+           WL   Q+ +  PS    +       + +W        K ++DA +   +   + GW++R+   T ++ G   +   S+
Subjt:  LQQQISASIHELIGDEEPYQMQWLEG-QTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASD

Query:  ISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDET------ELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQKAYEEDGPKSW
            E  +++  LQ    + G  ++ +E D   ++NLING     +      +++F+   A +  SI+       I R  N LAH LA+         S 
Subjt:  ISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDET------ELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQKAYEEDGPKSW

Query:  SHSFPDWL
        S S P WL
Subjt:  SHSFPDWL

AT3G25270.1 Ribonuclease H-like superfamily protein3.2e-1624.38Show/hide
Query:  WKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLW-AKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTN
        WKLK  PKIK   W++ +  L T  NL  R +   P C  C +++ET+ HLF+ C   + +W A  +P   L       + ++   L        N    
Subjt:  WKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLW-AKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTN

Query:  ILHIKCSLIICWRIWSIRNLISHNNQRLN-QETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTV-RRKWSPISDGCWKLSY
        + ++  ++ I WR+W  RN +    + ++ Q T+     Q+    + E   ++    +Q L  Q                PT+ R KW        K +Y
Subjt:  ILHIKCSLIICWRIWSIRNLISHNNQRLN-QETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTV-RRKWSPISDGCWKLSY

Query:  DASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKK
        D ++         GW++RD +   + +G    ++ SD    E  +++  +Q   S  G  +++ E DS QV  L+N E ++    N +I+E +      +
Subjt:  DASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKK

Query:  VDSITHIPRAHNYLAHSLAQ
              +PR +N  A  LA+
Subjt:  VDSITHIPRAHNYLAHSLAQ

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.1e-0738.6Show/hide
Query:  DFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHC
        D W LK+ PKIK+  W+  N+ LP  + L +R + + P C  CR+  ET +H+ ++C
Subjt:  DFWKLKLPPKIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHC

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-3123.19Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTH--PNIRHFT
        +L K+ WR++  P+   SL+AKV + RYF     L A LGS PS+ W+SI   +E+ ++G R  VG+G ++ I +  W+  +  +  +     P   + +
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTH--PNIRHFT

Query:  VAQLKRPNGM-------WNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQ-MNQRFQASSANYKDQEAMWKDFWKLKLPP
        V+ + + + +       W + +I   F E +   I           D   W+  S G ++VK  Y +  Q +N+R      +      +++  WK +  P
Subjt:  VAQLKRPNGM-------WNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQ-MNQRFQASSANYKDQEAMWKDFWKLKLPP

Query:  KIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWA-KYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCS
        KI+   W+  ++ LP    L  R +     C  C   +ET +HL + C   R  WA   +P+     L       I   L  ++  G     N    K S
Subjt:  KIKICGWRIYNDILPTLSNLNNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWA-KYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCS

Query:  LII---CWRIWSIRNLISHNNQRLN-QETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRR----KWSPISDGCWKLSYD
         ++    WR+W  RN +    +  N QE +R                 E+  +   +  + E              P V R    +W P      K + D
Subjt:  LII---CWRIWSIRNLISHNNQRLN-QETIRDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRR----KWSPISDGCWKLSYD

Query:  ASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVR-------------LLVENDSLQVVNLINGEDVDETELNFF
        A+W  D E   +GWVLR              N   ++ W+ A ++ +    + +    +R             ++ E+DS  ++ ++N +++    L   
Subjt:  ASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISWLEALSIVEGLQAIPSVTGGVR-------------LLVENDSLQVVNLINGEDVDETELNFF

Query:  IKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQK--AYEEDGPKSWSHSFPDW
        I++ QR+ S         IPR  N LA  +A++  ++    PK +S   P W
Subjt:  IKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQK--AYEEDGPKSWSHSFPDW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-1342.05Show/hide
Query:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPI
        +LAK+S+R+I     P +LL+++LR RYF     ++ S+G+ PSY WRSII GREL   G    +GDG +  +  D WI  E    P+
Subjt:  MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGCCAAGAAAAGTTGGAGGCTAATCAAATTTCCTAAATTTCCTAGTAGTTTATTAGCTAAAGTCCTTAGAGGTCGTTACTTTAAAACAGGTCACTTTCTC
AAAGCCTCGTTGGGGTCCAATCCTTCCTACACGTGGAGAAGCATCATCTGGGGAAGAGAGCTATTTAAAGAAGGATATCGATGGAGAGTTGGTGATGGTTTCAAT
GTAGACATTTCTAAAGATCTTTGGATCCACAAAGAAGGAAGGGCAACACCAATATTCACCCACCCTAATATTAGACATTTTACTGTGGCTCAACTCAAAAGGCCT
AATGGCATGTGGAATGAACCTCTAATTCGGGCTTCCTTTCTGGAGGAGGATGCCTTAGCCATTTTAGCCACACCTACTAAGTCTAATATGGGGGAGGATGAAATT
CTATGGAACCTTGATTCAAAAGGGAGATTCTCGGTGAAGGGTGCTTATCGTTTGGGGTGTCAAATGAATCAAAGATTTCAAGCTTCCTCTGCGAATTACAAGGAT
CAAGAGGCCATGTGGAAGGATTTTTGGAAGCTCAAATTACCCCCGAAGATCAAAATATGTGGCTGGAGGATCTACAATGACATCTTACCCACATTATCCAACCTT
AATAATAGAGGGATGGATGTGTGGCCAGTATGTTTCCTGTGTAGGGAAAAAGAAGAGACAACATCCCACCTCTTTTGGCATTGCAAGATGACTAGGGGATTGTGG
GCTAAATATTTACCTCTTGCTAACTTGAGCTGTCTTTTTGACAGGGAGGATAGGCGGATATCAGAGACTCTAGATGGGTTATGGCAGAGAGGCGGGAACACTTCG
ACAAACATTCTTCACATCAAATGCAGTCTTATTATATGTTGGAGAATATGGTCTATTCGTAATTTAATCAGTCACAACAATCAGAGACTCAATCAAGAGACCATC
AGAGACATACTTCAGCAACAAATTAGTGCATCCATTCACGAGCTAATAGGAGATGAGGAGCCTTACCAGATGCAGTGGCTGGAGGGACAAACTGAGCGCCTTGCA
CCGTCCGGCCAAGGAGGAGTCTTGCTGGGAGATCCAACGGTACGGAGGAAATGGTCCCCAATCTCCGATGGCTGCTGGAAGCTCAGCTACGATGCCTCCTGGCGT
TCAGATCGCGAGTGTGGAAGCGTCGGTTGGGTGCTTCGAGATTGGAGCAGAACATTGTTAGCAGCGGGTTTCAAATGTATTAATTCGGCGTCGGACATCAGCTGG
CTAGAAGCTCTATCGATCGTCGAAGGTTTGCAGGCGATCCCTTCGGTCACTGGTGGAGTGCGTCTTCTTGTGGAGAACGATTCCTTGCAAGTGGTGAATCTGATA
AATGGGGAAGACGTGGATGAAACTGAGTTGAACTTCTTCATTAAAGAAGCTCAACGCATGTGTTCTATTAAAAAAGTGGATTCCATAACTCACATTCCTCGGGCC
CATAATTATTTGGCTCATAGTCTTGCCCAGAAGGCTTATGAAGAAGATGGCCCAAAGAGTTGGTCTCATTCATTCCCAGATTGGCTTTTAGATGAAAATGAGAGA
GATACCGGTTGTGTACATCACAAAAATAGGGGATCCTGTCCTATTTGTGATCATGTTTCGAACACTTTTGCTGCGCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGCCAAGAAAAGTTGGAGGCTAATCAAATTTCCTAAATTTCCTAGTAGTTTATTAGCTAAAGTCCTTAGAGGTCGTTACTTTAAAACAGGTCACTTTCTC
AAAGCCTCGTTGGGGTCCAATCCTTCCTACACGTGGAGAAGCATCATCTGGGGAAGAGAGCTATTTAAAGAAGGATATCGATGGAGAGTTGGTGATGGTTTCAAT
GTAGACATTTCTAAAGATCTTTGGATCCACAAAGAAGGAAGGGCAACACCAATATTCACCCACCCTAATATTAGACATTTTACTGTGGCTCAACTCAAAAGGCCT
AATGGCATGTGGAATGAACCTCTAATTCGGGCTTCCTTTCTGGAGGAGGATGCCTTAGCCATTTTAGCCACACCTACTAAGTCTAATATGGGGGAGGATGAAATT
CTATGGAACCTTGATTCAAAAGGGAGATTCTCGGTGAAGGGTGCTTATCGTTTGGGGTGTCAAATGAATCAAAGATTTCAAGCTTCCTCTGCGAATTACAAGGAT
CAAGAGGCCATGTGGAAGGATTTTTGGAAGCTCAAATTACCCCCGAAGATCAAAATATGTGGCTGGAGGATCTACAATGACATCTTACCCACATTATCCAACCTT
AATAATAGAGGGATGGATGTGTGGCCAGTATGTTTCCTGTGTAGGGAAAAAGAAGAGACAACATCCCACCTCTTTTGGCATTGCAAGATGACTAGGGGATTGTGG
GCTAAATATTTACCTCTTGCTAACTTGAGCTGTCTTTTTGACAGGGAGGATAGGCGGATATCAGAGACTCTAGATGGGTTATGGCAGAGAGGCGGGAACACTTCG
ACAAACATTCTTCACATCAAATGCAGTCTTATTATATGTTGGAGAATATGGTCTATTCGTAATTTAATCAGTCACAACAATCAGAGACTCAATCAAGAGACCATC
AGAGACATACTTCAGCAACAAATTAGTGCATCCATTCACGAGCTAATAGGAGATGAGGAGCCTTACCAGATGCAGTGGCTGGAGGGACAAACTGAGCGCCTTGCA
CCGTCCGGCCAAGGAGGAGTCTTGCTGGGAGATCCAACGGTACGGAGGAAATGGTCCCCAATCTCCGATGGCTGCTGGAAGCTCAGCTACGATGCCTCCTGGCGT
TCAGATCGCGAGTGTGGAAGCGTCGGTTGGGTGCTTCGAGATTGGAGCAGAACATTGTTAGCAGCGGGTTTCAAATGTATTAATTCGGCGTCGGACATCAGCTGG
CTAGAAGCTCTATCGATCGTCGAAGGTTTGCAGGCGATCCCTTCGGTCACTGGTGGAGTGCGTCTTCTTGTGGAGAACGATTCCTTGCAAGTGGTGAATCTGATA
AATGGGGAAGACGTGGATGAAACTGAGTTGAACTTCTTCATTAAAGAAGCTCAACGCATGTGTTCTATTAAAAAAGTGGATTCCATAACTCACATTCCTCGGGCC
CATAATTATTTGGCTCATAGTCTTGCCCAGAAGGCTTATGAAGAAGATGGCCCAAAGAGTTGGTCTCATTCATTCCCAGATTGGCTTTTAGATGAAAATGAGAGA
GATACCGGTTGTGTACATCACAAAAATAGGGGATCCTGTCCTATTTGTGATCATGTTTCGAACACTTTTGCTGCGCCTTAA
Protein sequenceShow/hide protein sequence
MLAKKSWRLIKFPKFPSSLLAKVLRGRYFKTGHFLKASLGSNPSYTWRSIIWGRELFKEGYRWRVGDGFNVDISKDLWIHKEGRATPIFTHPNIRHFTVAQLKRP
NGMWNEPLIRASFLEEDALAILATPTKSNMGEDEILWNLDSKGRFSVKGAYRLGCQMNQRFQASSANYKDQEAMWKDFWKLKLPPKIKICGWRIYNDILPTLSNL
NNRGMDVWPVCFLCREKEETTSHLFWHCKMTRGLWAKYLPLANLSCLFDREDRRISETLDGLWQRGGNTSTNILHIKCSLIICWRIWSIRNLISHNNQRLNQETI
RDILQQQISASIHELIGDEEPYQMQWLEGQTERLAPSGQGGVLLGDPTVRRKWSPISDGCWKLSYDASWRSDRECGSVGWVLRDWSRTLLAAGFKCINSASDISW
LEALSIVEGLQAIPSVTGGVRLLVENDSLQVVNLINGEDVDETELNFFIKEAQRMCSIKKVDSITHIPRAHNYLAHSLAQKAYEEDGPKSWSHSFPDWLLDENER
DTGCVHHKNRGSCPICDHVSNTFAAP