; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011046 (gene) of Chayote v1 genome

Gene IDSed0011046
OrganismSechium edule (Chayote v1)
DescriptionRNase H domain-containing protein
Genome locationLG03:12535825..12540305
RNA-Seq ExpressionSed0011046
SyntenySed0011046
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7574769.1 Endonuclease/exonuclease/phosphatase superfamily [Arabidopsis suecica]2.3e-4131.36Show/hide
Query:  SSDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHLNS
        +SDH P+VA          K+   K+  RF+  WI  +G    I + W   +  G  A  F++ +N C R +++W R  L    R TIE  + EL ++  
Subjt:  SSDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHLNS

Query:  IVGAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVAKRGLRQGDPL----SPYLFLIC
                 + +L   ++    +EE+YW Q+SR  W+K GD N+K+FH    QR+ RN+I  L +  G+W +E++++    +     L     P  F   
Subjt:  IVGAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVAKRGLRQGDPL----SPYLFLIC

Query:  AEGLSCLLNREDN-----PNSCLSFRVMVNDN----------------LGIHLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQ
           +  L+  + N     P +   F   V+++                +G +LG+P      K  +F  ++ RL+  +  W  + LS+GGKEVMIK+V  
Subjt:  AEGLSCLLNREDN-----PNSCLSFRVMVNDN----------------LGIHLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQ

Query:  AIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG
        A+PTY MSCFRL K+    L    A+FWWG   + R MHW +W  L  SK EGG
Subjt:  AIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG

XP_012857846.1 PREDICTED: uncharacterized protein LOC105977118 [Erythranthe guttata]8.7e-4127.19Show/hide
Query:  KINIYHCNYHSSDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRH--TI
        + ++YH NY  SDH PI A          + +  K+P RFE +W + + C+++I   W  +     + D  L   N C   L  W++  L+    H  ++
Subjt:  KINIYHCNYHSSDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRH--TI

Query:  EFKEKELAHLNSIVGAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEE--------------
          K +EL   +      R   + KL   +E   E+ ++YW+QRS+  W++ GDRNT++FH +A+ R + N+++ LK++ G W  +E              
Subjt:  EFKEKELAHLNSIVGAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEE--------------

Query:  ---------------ENV-------------------------AKRGLRQGDPLSPYLFLICAEGLSCLLNR-------------------------ED-
                       EN+                           RGLRQGDPLSPYLF+ CAE L  ++++                         +D 
Subjt:  ---------------ENV-------------------------AKRGLRQGDPLSPYLFLICAEGLSCLLNR-------------------------ED-

Query:  --------------------------------------NPNSCLSFRVMVNDNLGI--------HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLS
                                              +P++    +  ++  LG         +LGMP+   + KK +F  ++ R+   +  W  + LS
Subjt:  --------------------------------------NPNSCLSFRVMVNDNLGI--------HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLS

Query:  RGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG
        + GKEV+IKAV QAIP+Y MSCF L      ++E +  RFWWG   N R + W SWK L RSK +GG
Subjt:  RGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG

XP_015946226.1 uncharacterized protein LOC107471294 [Arachis duranensis]5.4e-4330Show/hide
Query:  SDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHLNSI
        SDH P++ D    +E        K+  +F++ W   +  + ++++ W      G       Q I  C  K+ KW +     S +  I+  + EL  L  +
Subjt:  SDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHLNSI

Query:  VGAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEE--------------ENVAK--------
         G      + ++E  +E  L+ EE YWK +SR  WLK GD+NT +FH +   R +RN+I  L  + G   +                E V +        
Subjt:  VGAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEE--------------ENVAK--------

Query:  ----------------RGLRQGDPLSPYLFLICAEGLSCLLNREDNPNSCLSF--------------------------------RVMVNDNLGI-----
                        RG+RQGDPLSPYLFL CAEGLS LL +  NP +C +                                 R ++ +++ I     
Subjt:  ----------------RGLRQGDPLSPYLFLICAEGLSCLLNREDNPNSCLSF--------------------------------RVMVNDNLGI-----

Query:  ---HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGR
           +LG+PS  S+ KKA F  IK ++ K +Q WK  +LS GG+ +++KAVG+AIP YT+SCF+L  +   ++  L ++FWWG++ ++R+M W SW  + R
Subjt:  ---HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGR

XP_030963753.1 uncharacterized protein LOC115984908 [Quercus lobata]7.6e-4527.17Show/hide
Query:  SLNG--LNLEVGLRKFNEEQNLMDFG----LGLKEKVAEIKEGSFGPCGHIDVSIKEQSESNKIN---IYHCNYHSSDHRPIVADFTWISEGPFKVVALK
        S NG  L  +  + KF +  NL  F      G       +KEGS      +D +       N      ++H    +SDH  ++      ++    +    
Subjt:  SLNG--LNLEVGLRKFNEEQNLMDFG----LGLKEKVAEIKEGSFGPCGHIDVSIKEQSESNKIN---IYHCNYHSSDHRPIVADFTWISEGPFKVVALK

Query:  KPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHLNSIVGAPRSPHMVKLEKDIEMLLEEEE
        +   FE  W K + C+ VI   W  S +  I  D    ++  C   L+ WN+  + G++   I+ K++ L+ L+      +   + +L ++I  LL+ EE
Subjt:  KPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHLNSIVGAPRSPHMVKLEKDIEMLLEEEE

Query:  IYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVA-----------------------------------------------
        I W+QRS+  W K GDRNTK+FH  AS+R+K+N I  L N  G+W   ++++                                                
Subjt:  IYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVA-----------------------------------------------

Query:  -----------------------------KRGLRQGDPLSPYLFLICAEGLSCLLNR-------------------------ED----------------
                                      RG+RQGDPLSPYLFL+CAEGLS L ++                         +D                
Subjt:  -----------------------------KRGLRQGDPLSPYLFLICAEGLSCLLNR-------------------------ED----------------

Query:  -----------------------NPNSCLSFRVMVNDNLG--------IHLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAI
                               +PNS    +  + + LG         +LG+PS   + K  +F  +K R+ K L  WKG++LS GG+E++IKAV QA+
Subjt:  -----------------------NPNSCLSFRVMVNDNLG--------IHLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAI

Query:  PTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG
        PTYTMSCF+L K+  KDLE+L   FWWG+  ++ K+ W SWK + +SK  GG
Subjt:  PTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG

XP_030967653.1 uncharacterized protein LOC115988147 [Quercus lobata]4.9e-4427.48Show/hide
Query:  IYHCNYHSSDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEK
        ++H    +SDH  ++      ++    V    +   FE  W K + C+ VI   W  S +  I  D    ++  C   L+ WN+  + G++   I+ K++
Subjt:  IYHCNYHSSDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEK

Query:  ELAHLNSIVGAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVA---------------
         L+ L+      +   + +L ++I  LL+ EEI W+QRS+  W K GDRNTK+FH  AS+R+K+N I  L N  G+W   ++++                
Subjt:  ELAHLNSIVGAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVA---------------

Query:  -------------------------------------------------------------KRGLRQGDPLSPYLFLICAEGLSCLLNR-----------
                                                                      RG+RQGDPLSPYLFL+CAEGLS L ++           
Subjt:  -------------------------------------------------------------KRGLRQGDPLSPYLFLICAEGLSCLLNR-----------

Query:  --------------ED---------------------------------------NPNSCLSFRVMVNDNLG--------IHLGMPSQNSRDKKALFRSI
                      +D                                       +PN+    +  + + LG         +LG+PS   + K  +F  +
Subjt:  --------------ED---------------------------------------NPNSCLSFRVMVNDNLG--------IHLGMPSQNSRDKKALFRSI

Query:  KGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG
        K R+ K L  WKG++LS GG+E++IKAV QA+PTYTMSCF+L K+  KDLE+L   FWWG+  ++ K+ W +WK + +SK  GG
Subjt:  KGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG

TrEMBL top hitse value%identityAlignment
A0A2N9EUT1 Uncharacterized protein1.3e-4723.81Show/hide
Query:  EIDDNDISAYEDQLEGIVACKVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFEAMSGGKRVSELQFD
        E D  D+ A  DQ   ++A K ++ + ++ ++  +    +W   +  T + +G N  + SFE  +  ERVL   PW +D  L++ + +     +  + F 
Subjt:  EIDDNDISAYEDQLEGIVACKVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFEAMSGGKRVSELQFD

Query:  KSAFW-----------------GLGKIMGE---FLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLV
           FW                 G+G+ +GE   FL  +SEG A   G ++RIRV L  ++PL R  +V+  G     W+   +E+LP FC  CG L H +
Subjt:  KSAFW-----------------GLGKIMGE---FLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLV

Query:  EGCSFVVAQEGVAL----PFGDDLREPNPFFRGEGNAFN---------GQGGRGRDRVFNGLLNNAGEGAIKGKRSSSAGLQQGD---------------
        + C   + Q G  +     +GD LR  N  + G+    +           GG  RD  F      +   ++  + +    + QG                
Subjt:  EGCSFVVAQEGVAL----PFGDDLREPNPFFRGEGNAFN---------GQGGRGRDRVFNGLLNNAGEGAIKGKRSSSAGLQQGD---------------

Query:  ----EEELRAQRGRGDDRNSSRHLSNGSPDEG----------ERASAGAAQSSQPQ-------SKW-------MATLGDSCRVREARA------------
            E +L       D    S H S+ + + G          ER        S P+       +KW       +  L D  R+++  A            
Subjt:  ----EEELRAQRGRGDDRNSSRHLSNGSPDEG----------ERASAGAAQSSQPQ-------SKW-------MATLGDSCRVREARA------------

Query:  ----------------PDRTSDGS----W---------------------EGEAVS---AGMFLGQEAQGGPGS-------------------DVGLSVK
                        P R   G     W                     EG  ++    G +   E Q    S                   D    V 
Subjt:  ----------------PDRTSDGS----W---------------------EGEAVS---AGMFLGQEAQGGPGS-------------------DVGLSVK

Query:  GVSLNGLN-------------LEVGLRKFNEEQNLMDFGL-GLKEKVAEIKEGSFGPCGHIDVSIKEQ-------SESNKINIYHCNYHSSDHRPIVADF
        G    G               L +G   F+   +LMD  L G+     E+    FG  G ID+ +K Q       S    +         SDH P+    
Subjt:  GVSLNGLN-------------LEVGLRKFNEEQNLMDFGL-GLKEKVAEIKEGSFGPCGHIDVSIKEQ-------SESNKINIYHCNYHSSDHRPIVADF

Query:  TWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKEL--AHLNSIVGAPRSPH
        T +   P + V      RFE  W+  EGC+  ++D W+ +   G         +  C R+L  W+R    GSVR  +  K K+L  A L S+ G   S  
Subjt:  TWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKEL--AHLNSIVGAPRSPH

Query:  MVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVA-------------------------------
        ++ L  ++ +LLE EE  W QRSR  WL+ GDRNT++FH RASQR++RN I  L +  G W  +   V                                
Subjt:  MVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVA-------------------------------

Query:  ----------------------------------------------------------------------------------------------KRGLRQ
                                                                                                       RGLRQ
Subjt:  ----------------------------------------------------------------------------------------------KRGLRQ

Query:  GDPLSPYLFLICAEGLSCLLNREDNPNSC-----------LSFRVMVNDN--------------------------------------------------
        GDPLSPYLFL+C EG   LL                    LS     +D+                                                  
Subjt:  GDPLSPYLFLICAEGLSCLLNREDNPNSC-----------LSFRVMVNDN--------------------------------------------------

Query:  ---LGI--------HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRK
           LG+        +LG+PS   R K   F  IK R+   LQ WK R+LS+ G+E++IKAV QAIPTY+MSCFRL      +LE L  RFWWG  +NKRK
Subjt:  ---LGI--------HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRK

Query:  MHWKSWKYLGRSKREGG
        + W  WK L + K  GG
Subjt:  MHWKSWKYLGRSKREGG

A0A2N9FNT0 RNase H domain-containing protein2.8e-5324.25Show/hide
Query:  ERLMENLELADNEISDVFEIDDNDISAYEDQLEGIVACKVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNAL
        E + +N  L+D E  DV      D++    Q E I+A K L+ +V++ DA  +    +W      T + +G N     FE  +  ERVL   PW +D  L
Subjt:  ERLMENLELADNEISDVFEIDDNDISAYEDQLEGIVACKVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNAL

Query:  LVFEAMSGGKRVSELQFDKSAFW-----------------GLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVK-SGGMANEKWVRISYE
        +VF+ + G + + +  F  ++FW                  +G+ +G      +  D R     +R+R+ L+IN+PL R   VK   G+  + WV   YE
Subjt:  LVFEAMSGGKRVSELQFDKSAFW-----------------GLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVK-SGGMANEKWVRISYE

Query:  KLPNFCRGCGRLGHLVEGCSFVVAQEGVA----LPFGDDLREPNPFFRGEGNAF-NGQGGRGRDRVFNGLLNNAGEGAIKGKRSSSAGLQQGDEEELRAQ
        +LPNFC  CG L H  + C   + Q   +      FG  LR  +     +      G   + RD+              K        ++  D    + +
Subjt:  KLPNFCRGCGRLGHLVEGCSFVVAQEGVA----LPFGDDLREPNPFFRGEGNAF-NGQGGRGRDRVFNGLLNNAGEGAIKGKRSSSAGLQQGDEEELRAQ

Query:  RGRGDDRNSSRHLSNGSPDEGERASAGAAQSSQPQSKWMATLGDSCRVREARAPDRTSDGSWEGEAVSAGMFLGQEAQGGPGSDVGLSVKGVSLNGLNLE
         G+  D N+     N    E         +       W  T          R      + SW       G         G  +++  S +       + E
Subjt:  RGRGDDRNSSRHLSNGSPDEGERASAGAAQSSQPQSKWMATLGDSCRVREARAPDRTSDGSWEGEAVSAGMFLGQEAQGGPGSDVGLSVKGVSLNGLNLE

Query:  VGLRKFN---EEQNLMDFGL-GLKEKVAEIKEGSFGPCGHIDVSIKEQS---ESNKINIYHCNYHSSDHRPIVADFTWISEGPFKVVALKKPL-RFEDSW
          +++F    +E   +D G  GL       + G+      +D  +         +   + H    +SDH+PI      ++  P +V   ++ L RFED W
Subjt:  VGLRKFN---EEQNLMDFGL-GLKEKVAEIKEGSFGPCGHIDVSIKEQS---ESNKINIYHCNYHSSDHRPIVADFTWISEGPFKVVALKKPL-RFEDSW

Query:  IKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKEL--AHLNSIVGAPRSPHMVKLEKDIEMLLEEEEIYWKQRS
             C+ V+   W V  +RG         I  C  +L +W+R +  G++   ++ K + L  A ++S +G      ++ + K++  LL +EE  WKQRS
Subjt:  IKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKEL--AHLNSIVGAPRSPHMVKLEKDIEMLLEEEEIYWKQRS

Query:  REDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVA------------------------------------------------------
        R+ WLK GDRNTK+FH RAS R++RN I  L    G  V+  E +                                                       
Subjt:  REDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVA------------------------------------------------------

Query:  ------------------------------------------KRGLRQGDPLSPYLFLICAEGLSCLLNRED----------------------------
                                                   RGLRQGDP+SPYLFL+CAEGL+ L+ +                              
Subjt:  ------------------------------------------KRGLRQGDPLSPYLFLICAEGLSCLLNRED----------------------------

Query:  ------------------------------------NPNSCLSFRVMVNDNLGI--------HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRG
                                            + N+  + +  + + LG+        +LG+PS   ++K   F  IK R+   ++ WK ++LS+ 
Subjt:  ------------------------------------NPNSCLSFRVMVNDNLGI--------HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRG

Query:  GKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG
        G+E++IKAV QAIPTYTM+CF+L  +  K++E +  RFWWG+  +KRK+HW  W+ L RSK  GG
Subjt:  GKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG

A0A2N9H410 CCHC-type domain-containing protein3.1e-5225.5Show/hide
Query:  DAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFEAMSGGKRVSELQFDKSAFW-----------------GLGKIMGE
        +A +  +  +W     VT   + +N+FL  F +K   ERV  + PW FD  L++        + +E++F  SAFW                  +G  +G 
Subjt:  DAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFEAMSGGKRVSELQFDKSAFW-----------------GLGKIMGE

Query:  FLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLVEGC------------------------------
        F+  D   +    G+FLR++V + I QPL R  K+         WV   YE LP FC  CGRLGH    C                              
Subjt:  FLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLVEGC------------------------------

Query:  -------------------------------------------------------SFVVAQEGVALPFGDDLREPN----------PFFRGEGNAFNGQG
                                                               SF    E VA  FGD++  P           P     G+   G+G
Subjt:  -------------------------------------------------------SFVVAQEGVALPFGDDLREPN----------PFFRGEGNAFNGQG

Query:  ----GRGRDRVFNGL-----------------------LNNAGEGAIKGKRSSSAGLQQGDEEELR--AQRGRGDDR---NSSRHLSNGSPDEGERASAG
            G+ +D   NG+                       L +  +  + GK         G+    R  A+ G+G ++      R L   +  E       
Subjt:  ----GRGRDRVFNGL-----------------------LNNAGEGAIKGKRSSSAGLQQGDEEELR--AQRGRGDDR---NSSRHLSNGSPDEGERASAG

Query:  AAQSSQPQSKWMATLGDSCRVREARAPDRTSDGSWEGEAVSAGMFLGQEAQGGPGSDVGLSVKGVS----LNGLNLE--VGLRKFNEEQNLMDF----GL
        + +  Q     +AT+  S  V     P   S+   + E +       +        +V L  +       LN LN +  + L  FNE   L ++    G 
Subjt:  AAQSSQPQSKWMATLGDSCRVREARAPDRTSDGSWEGEAVSAGMFLGQEAQGGPGSDVGLSVKGVS----LNGLNLE--VGLRKFNEEQNLMDF----GL

Query:  GLKEKV--------AEIKE-GSFGP--------CGHIDVSIK-EQSESN--------KINIYHCNYHSSDHRPIVADFTWISEGPFKVVALKKPL-RFED
         L +           E+++ G  GP         G   V ++ +++ +N           + H    SSDH  ++ +   IS  P +    ++ L RFE 
Subjt:  GLKEKV--------AEIKE-GSFGP--------CGHIDVSIK-EQSESN--------KINIYHCNYHSSDHRPIVADFTWISEGPFKVVALKKPL-RFED

Query:  SWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHLNSIVGAPRSPHMVK-LEKDIEMLLEEEEIYWKQR
        +W++ +GC+  I D WQV    G       Q I  C   L +W++ +++ + +  I  K+ +L  + S        H V  L K++  +L +EE+ W+QR
Subjt:  SWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHLNSIVGAPRSPHMVK-LEKDIEMLLEEEEIYWKQR

Query:  SREDWLKWGDRNTKWFH-----MRASQR----------------KKRNKINNLKNHVGMWVSEEENVAK--------------RGLRQGDPLSPYLFLIC
        SR +WLK GDRNT+        +  SQ                 +  + + NL+      ++ + +++K              RGLRQGDPLSPYLFL+C
Subjt:  SREDWLKWGDRNTKWFH-----MRASQR----------------KKRNKINNLKNHVGMWVSEEENVAK--------------RGLRQGDPLSPYLFLIC

Query:  AEGLSCLLNREDNPNSCLSFRVMVNDNLGI--------HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLT
        AEGLS L+ + +  N     R  ++D  G         +LG+P    R KK  F  IK R+ K LQ WK ++LS+  +EV+IKAV QAIPTY MSCFR  
Subjt:  AEGLSCLLNREDNPNSCLSFRVMVNDNLGI--------HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLT

Query:  KSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG
             D+  +  RFWWG+    RK+HW +   L RSK++GG
Subjt:  KSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG

A0A2N9HDU2 Uncharacterized protein8.7e-6326.59Show/hide
Query:  LMENLELADNEISDVFEIDDNDISAYEDQLEGIVACKVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLV
        L + L L++ E  D F ++    SA       I+A K  + +VI+ +A  +    +W +EN  +   +G+N+ +  FE +   +RVL   PW +D     
Subjt:  LMENLELADNEISDVFEIDDNDISAYEDQLEGIVACKVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLV

Query:  FEAMSGGKRVSELQFDKSAFWGLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEK--WVRISYEKLPNFCRGCGRLGHLVEG
                    L   K     LG+ +GE L      +    G+ +RIRV + IN+PL R  K+   G+AN K  W    YE+LPNFC  CG L H  + 
Subjt:  FEAMSGGKRVSELQFDKSAFWGLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEK--WVRISYEKLPNFCRGCGRLGHLVEG

Query:  CSF------VVAQEGVALPFGDDLREP--NPFFRGE----GNAFNGQGGRGRDRVFNGLLNNAGEGAIKGKRSSS--------AGLQQGDEEELRAQRGR
        C F       + QE  A  +G  LR P   P+ + E    G A NG                   G +K  R  +        AG +  +  E +A++  
Subjt:  CSF------VVAQEGVALPFGDDLREP--NPFFRGE----GNAFNGQGGRGRDRVFNGLLNNAGEGAIKGKRSSS--------AGLQQGDEEELRAQRGR

Query:  GDDRNSSRHLSNGSPDEGERASAGAAQSSQPQSKWMATLGDSC-----RVREARAPDRTSD-------GSWEGEAVSAGMFLGQEAQGGPG---------
        G            +PD G+       +  +   K M  L ++      R R   +P  T++        +WE    S    +G  A    G         
Subjt:  GDDRNSSRHLSNGSPDEGERASAGAAQSSQPQSKWMATLGDSC-----RVREARAPDRTSD-------GSWEGEAVSAGMFLGQEAQGGPG---------

Query:  --------------SDVGLSVKGVSLNGLNLE-----VGLRKFNEEQNLMDFGL-GLKEKVAEIKEGSFGPCGHIDVSIKEQSESNKI---NIYHCNYHS
                       D    VK   ++G N        G R   +E +LMD G  G        ++        +D  +   S   K    ++ H +  +
Subjt:  --------------SDVGLSVKGVSLNGLNLE-----VGLRKFNEEQNLMDFGL-GLKEKVAEIKEGSFGPCGHIDVSIKEQSESNKI---NIYHCNYHS

Query:  SDHRPIVADFTWISEGPFKVVAL-KKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHL-N
        SDH+ ++     ++  P       +KP RF++ W   EGC+N I+  W  S   G       + +  C + L  W+R R  GSV   +  K++ELA    
Subjt:  SDHRPIVADFTWISEGPFKVVAL-KKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHL-N

Query:  SIVGAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEE-----------ENVAKRGL------
          +       + KL+ ++  LLE EE  W+QRSR  WL  GDRNT++FH RASQR++RNKI  LK+  G+W  ++           +N+ +  L      
Subjt:  SIVGAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEE-----------ENVAKRGL------

Query:  ----------RQGDPLSPYLFLICAEGL--------------------------------SCLLNREDNPNSCLSFRVM---------------------
                  ++GDP+SPYLFL+CAEGL                                 CLL  +     C + + +                     
Subjt:  ----------RQGDPLSPYLFLICAEGL--------------------------------SCLLNREDNPNSCLSFRVM---------------------

Query:  -----------VNDNLGI--------HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCA
                   + D+LG+        +LG+PS   R K   F  IK R+   ++ WKG++LS+ G+E+MIKAV QA+PTY MSCFRL     +++E +  
Subjt:  -----------VNDNLGI--------HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCA

Query:  RFWWGEEENKRKMHWKSWKYLGRSKREGG
        +FWW + +++ K+ W  W  L   K  GG
Subjt:  RFWWGEEENKRKMHWKSWKYLGRSKREGG

A0A2N9I239 RNase H domain-containing protein1.8e-5223.05Show/hide
Query:  EIDDNDISAYEDQLEGIVACKVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFEAMSGGKRVSELQFD
        E+D  D+S       G++A K L+++V++ +A  + +  +W A        +G+N  L  F      ERV+   PW FD  L++ + +   +  S++ FD
Subjt:  EIDDNDISAYEDQLEGIVACKVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFEAMSGGKRVSELQFD

Query:  KSAFW-----------------GLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLVEGC
           FW                  LG  +G+    +   + R  G F+R+RVL+ ++QPL R  KV  GG + + WV + YE+LP FC  CG + H    C
Subjt:  KSAFW-----------------GLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLVEGC

Query:  SFVVAQEGVALP-------------------FGDDLREPNPFF-----RGEGNAFNGQGGRGR---------------DRVFNGLLN--------NAGE-
           +   G                        GDD R P PF      RG G A  GQ GR                  R+F   L         ++GE 
Subjt:  SFVVAQEGVALP-------------------FGDDLREPNPFF-----RGEGNAFNGQGGRGR---------------DRVFNGLLN--------NAGE-

Query:  ----------------------GAIKGKRSSSAGLQQGDEEEL----RAQRGRGD-------DRNSSRHLSNGSPDEGERASAGAAQSSQPQSK----WM
                              G +     +  GL  G +E+     ++ R  G        + N S+H    S    E    G  +++ P++     W 
Subjt:  ----------------------GAIKGKRSSSAGLQQGDEEEL----RAQRGRGD-------DRNSSRHLSNGSPDEGERASAGAAQSSQPQSK----WM

Query:  AT-LGDSCRVREAR-------------------------------------APDRTSDGS----W-EGEAVSAGMFLGQEAQGGPGSDVGLSVKGVSLNG
           LG+   V+E                                        P R   G     W  G +V+   +           D     +     G
Subjt:  AT-LGDSCRVREAR-------------------------------------APDRTSDGS----W-EGEAVSAGMFLGQEAQGGPGSDVGLSVKGVSLNG

Query:  LNLEVG-------LRKFNEEQNLM-----DFG--LGLKEKVAEIKEGS---------FGPCGHIDVSIK--EQSESNK----------------------
             G       LR+ +    L      DF   L  +EK   +                CG +D+     + +  NK                      
Subjt:  LNLEVG-------LRKFNEEQNLM-----DFG--LGLKEKVAEIKEGS---------FGPCGHIDVSIK--EQSESNK----------------------

Query:  ----INIYHCNYHSSDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHT
              ++H +   SDHRP+  +   ++  P      +K  RFE+ W     C+  ++  W   T  G       + I      L +W+     GSVR+ 
Subjt:  ----INIYHCNYHSSDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHT

Query:  IEFKEKELAHLNSIV-GAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVAK-------
        IE K ++L    ++V  A   P + KL +++ +L  +EE  WKQRSR  WL+ GDRNT++FH +A+QR++RN I  L+++ G+W S E  + +       
Subjt:  IEFKEKELAHLNSIV-GAPRSPHMVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVAK-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------RGLRQGDPLSPYLFLICAEGLSCLLNREDN-------------------
                                                           RGLRQGDP+SPYLFL+CAEGL  LL                        
Subjt:  ---------------------------------------------------RGLRQGDPLSPYLFLICAEGLSCLLNREDN-------------------

Query:  ---PNSCLSFRVMVNDNLGI-----HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCAR
            +S L  R  ++D   I     +LG+PS   R K A F  +K R+   +Q WK R+LS+ G+EV+IKAV QAIPTYTM+CF+L K    +LE L   
Subjt:  ---PNSCLSFRVMVNDNLGI-----HLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCAR

Query:  FWWGEEENKRKMHWKSWKYLGRSKREGG
        FWWG  E+ RK+HW  W  L + K  GG
Subjt:  FWWGEEENKRKMHWKSWKYLGRSKREGG

SwissProt top hitse value%identityAlignment
P92555 Uncharacterized mitochondrial protein AtMg012502.3e-0451.02Show/hide
Query:  RGLRQGDPLSPYLFLICAEGLSCLLNREDNPNSCLSFRVMVNDNLGIHL
        RGLRQGDPLSPYLF++C E LS L  R          RV  N     HL
Subjt:  RGLRQGDPLSPYLFLICAEGLSCLLNREDNPNSCLSFRVMVNDNLGIHL

P93295 Uncharacterized mitochondrial protein AtMg003102.2e-0748.15Show/hide
Query:  AIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG
        A+P Y MSCFRL+K   K L      FWW   ENKRK+ W +W+ L +SK + G
Subjt:  AIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG

Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding1.1e-0626.76Show/hide
Query:  NAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFEAMSGGKRVSELQFDKSAFWGLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQP
        N + +V   +L  +     F+S+     +L RGPW F++ + V +  +  K  S+ +F +  FW   +I G  L            +FL  R++  I + 
Subjt:  NAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFEAMSGGKRVSELQFDKSAFWGLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQP

Query:  LRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLVEGC
        +   L+   G   +   ++  YEKL NFC  CG L H    C
Subjt:  LRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLVEGC

AT4G29090.1 Ribonuclease H-like superfamily protein5.5e-0943.33Show/hide
Query:  AIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGGTTTKKL
        A+PTYTM+CF L K+  K +  + A FWW  ++  + MHWK+W +L   K EGG   K +
Subjt:  AIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGGTTTKKL

AT5G36228.1 nucleic acid binding;zinc ion binding6.4e-1022.16Show/hide
Query:  KVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFEAMSGGKRVSELQF---------------DKSAFW
        ++L+ +  S +     +P  W    +V   +L +  F   F S+I     L R PW+F+   +  +          L F                +    
Subjt:  KVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFEAMSGGKRVSELQF---------------DKSAFW

Query:  GLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLVEGCSFVVAQE
         +   +GE +  D   +  ++  F+R++V +   +PLR   +V+         +   YEKL   C  C R+ H V  C +VV QE
Subjt:  GLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLVEGCSFVVAQE

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-0848.15Show/hide
Query:  AIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG
        A+P Y MSCFRL+K   K L      FWW   ENKRK+ W +W+ L +SK + G
Subjt:  AIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.6e-0551.02Show/hide
Query:  RGLRQGDPLSPYLFLICAEGLSCLLNREDNPNSCLSFRVMVNDNLGIHL
        RGLRQGDPLSPYLF++C E LS L  R          RV  N     HL
Subjt:  RGLRQGDPLSPYLFLICAEGLSCLLNREDNPNSCLSFRVMVNDNLGIHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGTGGGTGAGTTCGAGAGGCTTATGGAAAACCTAGAGTTGGCTGACAACGAAATTTCAGATGTTTTTGAGATCGACGACAACGACATCTCTGCCTATGAAGACCA
ACTTGAGGGTATTGTGGCTTGTAAAGTGCTGTCTGAGAAGGTGATTAGCACAGATGCTTTTAGAAAACTTATTCCCAGAATCTGGAATGCAGAGAATAAGGTAACTTTTG
AGGTTTTGGGTAATAATGTTTTCCTTTGTTCCTTTGAGTCTAAAATTGTAAAAGAAAGGGTGTTGGAACGTGGTCCTTGGATCTTTGATAATGCCCTTCTGGTCTTTGAG
GCCATGAGTGGAGGAAAAAGAGTCTCTGAACTCCAATTTGATAAATCTGCCTTTTGGGGTCTTGGGAAGATAATGGGAGAGTTCCTTGGTTGGGACAGTGAAGGCGATGC
TAGAAATAGAGGTAAATTTCTTAGGATCCGGGTGTTGTTGAAGATCAATCAGCCATTGCGAAGAGCTCTGAAAGTTAAAAGTGGTGGGATGGCAAATGAAAAATGGGTGA
GAATTTCCTATGAAAAATTGCCAAATTTTTGTAGAGGTTGTGGGCGATTAGGGCATCTAGTTGAAGGGTGTTCATTTGTTGTTGCTCAAGAAGGGGTAGCTCTTCCGTTT
GGGGATGATTTGAGAGAGCCTAATCCTTTTTTCAGAGGAGAAGGGAATGCTTTCAATGGGCAAGGGGGTAGGGGGAGGGATCGTGTGTTTAACGGGCTCTTGAATAATGC
TGGGGAAGGAGCCATTAAAGGCAAAAGGTCTTCTTCCGCCGGCTTGCAACAAGGAGATGAAGAGGAGCTTCGAGCTCAGCGGGGAAGAGGAGATGACCGGAACTCCAGCC
GACATCTGAGCAACGGGTCGCCGGATGAAGGAGAGAGGGCGAGTGCTGGAGCAGCGCAGTCTAGTCAACCTCAGAGTAAATGGATGGCGACTTTAGGGGATTCCTGCAGA
GTCAGGGAGGCTAGGGCTCCAGATCGTACATCAGATGGTAGCTGGGAAGGAGAAGCCGTTTCTGCAGGCATGTTCCTCGGGCAGGAAGCTCAAGGTGGGCCAGGAAGTGA
TGTTGGGCTGAGTGTGAAAGGAGTTAGTCTAAATGGGTTAAATCTAGAAGTGGGCCTTAGAAAGTTTAATGAGGAGCAGAATTTAATGGACTTTGGTTTGGGCTTAAAGG
AAAAAGTGGCTGAGATTAAAGAAGGGTCTTTTGGGCCTTGTGGTCACATAGACGTAAGCATCAAGGAGCAAAGCGAGTCTAATAAAATTAATATTTATCATTGTAACTAT
CACTCGTCAGATCATAGACCAATTGTTGCTGATTTTACCTGGATTTCTGAGGGTCCTTTTAAGGTAGTGGCTTTAAAAAAACCGCTTAGGTTTGAGGATAGCTGGATCAA
GTTTGAGGGATGCAAGAACGTGATTAAGGACAATTGGCAAGTTTCCACTAGTAGAGGGATTTTGGCCGATAATTTTCTCCAGAGTATTAATGTCTGTTTGAGGAAGTTGG
CTAAGTGGAATAGGGGTAGGTTGCAGGGGTCGGTGAGACATACGATTGAGTTTAAAGAGAAAGAGTTGGCTCATCTGAATTCTATTGTTGGGGCGCCTAGATCGCCTCAT
ATGGTCAAGTTAGAGAAAGATATTGAGATGTTGTTGGAAGAAGAAGAAATTTATTGGAAACAAAGATCTCGGGAAGATTGGCTTAAATGGGGAGACAGGAATACAAAATG
GTTTCATATGAGAGCCTCTCAAAGGAAAAAAAGAAATAAAATCAATAATCTTAAAAACCATGTAGGTATGTGGGTTAGTGAGGAGGAGAATGTTGCTAAAAGGGGGCTTC
GCCAAGGGGATCCCCTCTCTCCGTATCTCTTCCTAATTTGTGCTGAAGGGCTCTCGTGTCTTCTTAACAGGGAAGACAATCCAAACTCTTGTCTTAGCTTTCGGGTTATG
GTGAATGATAATCTTGGCATTCACCTTGGAATGCCTTCGCAAAATAGTAGAGATAAAAAAGCTCTCTTTAGGTCTATCAAAGGGCGGCTCGAAAAGTGTCTCCAGTCGTG
GAAAGGTAGGATGCTCTCTAGAGGTGGGAAGGAGGTTATGATAAAAGCGGTTGGCCAAGCGATTCCGACGTACACTATGAGCTGTTTTAGGTTGACCAAATCTTTTAATA
AGGATTTGGAGCATCTGTGTGCTCGATTTTGGTGGGGGGAGGAAGAGAACAAGAGAAAGATGCATTGGAAAAGTTGGAAGTATCTTGGAAGAAGTAAAAGAGAAGGGGGC
ACTACAACAAAAAAGCTTTTTTATAGCTTTTTATTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGTGGGTGAGTTCGAGAGGCTTATGGAAAACCTAGAGTTGGCTGACAACGAAATTTCAGATGTTTTTGAGATCGACGACAACGACATCTCTGCCTATGAAGACCA
ACTTGAGGGTATTGTGGCTTGTAAAGTGCTGTCTGAGAAGGTGATTAGCACAGATGCTTTTAGAAAACTTATTCCCAGAATCTGGAATGCAGAGAATAAGGTAACTTTTG
AGGTTTTGGGTAATAATGTTTTCCTTTGTTCCTTTGAGTCTAAAATTGTAAAAGAAAGGGTGTTGGAACGTGGTCCTTGGATCTTTGATAATGCCCTTCTGGTCTTTGAG
GCCATGAGTGGAGGAAAAAGAGTCTCTGAACTCCAATTTGATAAATCTGCCTTTTGGGGTCTTGGGAAGATAATGGGAGAGTTCCTTGGTTGGGACAGTGAAGGCGATGC
TAGAAATAGAGGTAAATTTCTTAGGATCCGGGTGTTGTTGAAGATCAATCAGCCATTGCGAAGAGCTCTGAAAGTTAAAAGTGGTGGGATGGCAAATGAAAAATGGGTGA
GAATTTCCTATGAAAAATTGCCAAATTTTTGTAGAGGTTGTGGGCGATTAGGGCATCTAGTTGAAGGGTGTTCATTTGTTGTTGCTCAAGAAGGGGTAGCTCTTCCGTTT
GGGGATGATTTGAGAGAGCCTAATCCTTTTTTCAGAGGAGAAGGGAATGCTTTCAATGGGCAAGGGGGTAGGGGGAGGGATCGTGTGTTTAACGGGCTCTTGAATAATGC
TGGGGAAGGAGCCATTAAAGGCAAAAGGTCTTCTTCCGCCGGCTTGCAACAAGGAGATGAAGAGGAGCTTCGAGCTCAGCGGGGAAGAGGAGATGACCGGAACTCCAGCC
GACATCTGAGCAACGGGTCGCCGGATGAAGGAGAGAGGGCGAGTGCTGGAGCAGCGCAGTCTAGTCAACCTCAGAGTAAATGGATGGCGACTTTAGGGGATTCCTGCAGA
GTCAGGGAGGCTAGGGCTCCAGATCGTACATCAGATGGTAGCTGGGAAGGAGAAGCCGTTTCTGCAGGCATGTTCCTCGGGCAGGAAGCTCAAGGTGGGCCAGGAAGTGA
TGTTGGGCTGAGTGTGAAAGGAGTTAGTCTAAATGGGTTAAATCTAGAAGTGGGCCTTAGAAAGTTTAATGAGGAGCAGAATTTAATGGACTTTGGTTTGGGCTTAAAGG
AAAAAGTGGCTGAGATTAAAGAAGGGTCTTTTGGGCCTTGTGGTCACATAGACGTAAGCATCAAGGAGCAAAGCGAGTCTAATAAAATTAATATTTATCATTGTAACTAT
CACTCGTCAGATCATAGACCAATTGTTGCTGATTTTACCTGGATTTCTGAGGGTCCTTTTAAGGTAGTGGCTTTAAAAAAACCGCTTAGGTTTGAGGATAGCTGGATCAA
GTTTGAGGGATGCAAGAACGTGATTAAGGACAATTGGCAAGTTTCCACTAGTAGAGGGATTTTGGCCGATAATTTTCTCCAGAGTATTAATGTCTGTTTGAGGAAGTTGG
CTAAGTGGAATAGGGGTAGGTTGCAGGGGTCGGTGAGACATACGATTGAGTTTAAAGAGAAAGAGTTGGCTCATCTGAATTCTATTGTTGGGGCGCCTAGATCGCCTCAT
ATGGTCAAGTTAGAGAAAGATATTGAGATGTTGTTGGAAGAAGAAGAAATTTATTGGAAACAAAGATCTCGGGAAGATTGGCTTAAATGGGGAGACAGGAATACAAAATG
GTTTCATATGAGAGCCTCTCAAAGGAAAAAAAGAAATAAAATCAATAATCTTAAAAACCATGTAGGTATGTGGGTTAGTGAGGAGGAGAATGTTGCTAAAAGGGGGCTTC
GCCAAGGGGATCCCCTCTCTCCGTATCTCTTCCTAATTTGTGCTGAAGGGCTCTCGTGTCTTCTTAACAGGGAAGACAATCCAAACTCTTGTCTTAGCTTTCGGGTTATG
GTGAATGATAATCTTGGCATTCACCTTGGAATGCCTTCGCAAAATAGTAGAGATAAAAAAGCTCTCTTTAGGTCTATCAAAGGGCGGCTCGAAAAGTGTCTCCAGTCGTG
GAAAGGTAGGATGCTCTCTAGAGGTGGGAAGGAGGTTATGATAAAAGCGGTTGGCCAAGCGATTCCGACGTACACTATGAGCTGTTTTAGGTTGACCAAATCTTTTAATA
AGGATTTGGAGCATCTGTGTGCTCGATTTTGGTGGGGGGAGGAAGAGAACAAGAGAAAGATGCATTGGAAAAGTTGGAAGTATCTTGGAAGAAGTAAAAGAGAAGGGGGC
ACTACAACAAAAAAGCTTTTTTATAGCTTTTTATTATAA
Protein sequenceShow/hide protein sequence
MNVGEFERLMENLELADNEISDVFEIDDNDISAYEDQLEGIVACKVLSEKVISTDAFRKLIPRIWNAENKVTFEVLGNNVFLCSFESKIVKERVLERGPWIFDNALLVFE
AMSGGKRVSELQFDKSAFWGLGKIMGEFLGWDSEGDARNRGKFLRIRVLLKINQPLRRALKVKSGGMANEKWVRISYEKLPNFCRGCGRLGHLVEGCSFVVAQEGVALPF
GDDLREPNPFFRGEGNAFNGQGGRGRDRVFNGLLNNAGEGAIKGKRSSSAGLQQGDEEELRAQRGRGDDRNSSRHLSNGSPDEGERASAGAAQSSQPQSKWMATLGDSCR
VREARAPDRTSDGSWEGEAVSAGMFLGQEAQGGPGSDVGLSVKGVSLNGLNLEVGLRKFNEEQNLMDFGLGLKEKVAEIKEGSFGPCGHIDVSIKEQSESNKINIYHCNY
HSSDHRPIVADFTWISEGPFKVVALKKPLRFEDSWIKFEGCKNVIKDNWQVSTSRGILADNFLQSINVCLRKLAKWNRGRLQGSVRHTIEFKEKELAHLNSIVGAPRSPH
MVKLEKDIEMLLEEEEIYWKQRSREDWLKWGDRNTKWFHMRASQRKKRNKINNLKNHVGMWVSEEENVAKRGLRQGDPLSPYLFLICAEGLSCLLNREDNPNSCLSFRVM
VNDNLGIHLGMPSQNSRDKKALFRSIKGRLEKCLQSWKGRMLSRGGKEVMIKAVGQAIPTYTMSCFRLTKSFNKDLEHLCARFWWGEEENKRKMHWKSWKYLGRSKREGG
TTTKKLFYSFLL