; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007397 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007397
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold7:20381526..20386636
RNA-Seq ExpressionSpg007397
SyntenySpg007397
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU39028.1 hypothetical protein TSUD_59840 [Trifolium subterraneum]1.5e-5327.57Show/hide
Query:  MEEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNL
        ME WK   L+  E+  I  + +D E   + +    L G+L +    +  A K  +   W+ +    I+ +GKN+Y+ +F  K D E + +N  W FD+N+
Subjt:  MEEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNL

Query:  LVLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYER
        LVL          E     + FW R+++LPL  R++ +A+K+G+ +G F + DN++N    G  ++I+V +D+ +P++RG ++K   G D R +  +YER
Subjt:  LVLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYER

Query:  IPDFCFCCGRIGHVAKECAEEKSK-----EMIASNNYEFGAWLRFQAFGRP---IQKPDDHANGMKDSFTKNLGSEVQKETDLEENVAMVGDEYQGNISM
        +P FCF CGRIGH  +EC E +++     E I      FG WLR     R     +K    ++  K+ F+ +  S              VGD+  G + +
Subjt:  IPDFCFCCGRIGHVAKECAEEKSK-----EMIASNNYEFGAWLRFQAFGRP---IQKPDDHANGMKDSFTKNLGSEVQKETDLEENVAMVGDEYQGNISM

Query:  D--IPEAWGSKGRLEDRNVYSPHVLQTSNFNKEDKMEDPFVERIDLASSVRCQPCQSSSQGGSVVSGRKNSPAWDYETHLLEYSGVGEPESIQSTKEPCL
        D  I +    + +L + N  +    +T            F+E   +A S+        S   S+ S +K             +  +G+ E  Q +K   +
Subjt:  D--IPEAWGSKGRLEDRNVYSPHVLQTSNFNKEDKMEDPFVERIDLASSVRCQPCQSSSQGGSVVSGRKNSPAWDYETHLLEYSGVGEPESIQSTKEPCL

Query:  --FKESPRNSGGLCLFWKDQNSISIQSYSSNHIDSIV--HWQGKRWRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHVWE
          F      +GG+ L+W D  +I+I SYS NHI+  +     G+ W I+ +YG+PE   K KTW L+  +  +    W+  GD+        K G     
Subjt:  --FKESPRNSGGLCLFWKDQNSISIQSYSSNHIDSIV--HWQGKRWRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHVWE

Query:  WLDRFL-----CNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWSELRG
         L   L      +      F  +KV++L  + SDH  I+  L         +    F+FEE W+    C  +I     WS  RG
Subjt:  WLDRFL-----CNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWSELRG

GAU39028.1 hypothetical protein TSUD_59840 [Trifolium subterraneum]5.3e-1934.27Show/hide
Query:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPLCL
        GL F+  E FN+AL+ KQ WR++ NPDSL+++ FKS YF  S+ + A++G  P+Y  RSL   R+++  G R   GNGQ   ++ D WLP++  FK    
Subjt:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPLCL

Query:  DPSYI-NDKVEDFIF-ASGDWDVDRLNRVVSREDLDIIRRTPLNRNLD-DKLVWHYDKTAKDIWSRTFNRVFLDKDFN
          + + N  V D I   +  WD + +    S  + + I   P++  L  DKL+WH++K  +      ++ +  D++ N
Subjt:  DPSYI-NDKVEDFIF-ASGDWDVDRLNRVVSREDLDIIRRTPLNRNLD-DKLVWHYDKTAKDIWSRTFNRVFLDKDFN

GAU39028.1 hypothetical protein TSUD_59840 [Trifolium subterraneum]3.1e-5125.6Show/hide
Query:  EKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVLDDPGANVCS
        ++D   T++++   N +E     LVG++ +    +  A K  M  AWR+R    I+ + KN+++ KF  + + +L+ +N  W FD+NLL+L+    N   
Subjt:  EKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVLDDPGANVCS

Query:  LETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIG
         E     V FW+R+++LPL  R++ +A+K+GN +G F + D  K  +  G  +R++V +D+ +PL+RG  +   G   E W+  +YER+P+FCF CGRIG
Subjt:  LETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIG

Query:  HVAKECAEEKSKEMIASNNYE-----FGAWLRFQAF--------------------------------------------GRPIQKPDDHANGMKDSFTK
        H  ++C + +  + +  +  E     FG WLR                                                 R       H    K + +K
Subjt:  HVAKECAEEKSKEMIASNNYE-----FGAWLRFQAF--------------------------------------------GRPIQKPDDHANGMKDSFTK

Query:  NLGSEVQKETDLEENVAMVGDEY-------QGNISMDIPEAWGSKGR--LEDRNVYSPHVLQTSNFNKEDK-------MEDPFVERI-----DLASSVRC
        +   +  K+ ++++ V +V +         Q  I   I      KGR  +  R              K  K       + D  +E +      +   V  
Subjt:  NLGSEVQKETDLEENVAMVGDEY-------QGNISMDIPEAWGSKGR--LEDRNVYSPHVLQTSNFNKEDK-------MEDPFVERI-----DLASSVRC

Query:  QPCQSSSQGGSVVSGRKNSP--AWDYETHLLEYSGVGEPESIQSTK--EPCLFKE----SPRNSGGLCLFWKDQNSISIQSYSSNHIDSIV--HWQGKRW
        + C +S+    +   R  +P   +  ET L       E E I+S    + CL  +         GGL L W +Q S++I SYS NHI         G  W
Subjt:  QPCQSSSQGGSVVSGRKNSP--AWDYETHLLEYSGVGEPESIQSTK--EPCLFKE----SPRNSGGLCLFWKDQNSISIQSYSSNHIDSIV--HWQGKRW

Query:  RISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHVWEW----------LDRFLCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQ
         ++ +YG+PE   K KTW L+RSL       W+  GD         K+G ++              D    +      F+ +KV++L  + SDH  +   
Subjt:  RISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHVWEW----------LDRFLCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQ

Query:  LSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWS------ELRGLNF--KDIEGFNQALVAKQVWRI
        L       TRR  R F+FEE W    +C ++I  N   S      +L  LN      E  N   + K ++RI
Subjt:  LSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWS------ELRGLNF--KDIEGFNQALVAKQVWRI

MCH80348.1 hypothetical protein [Trifolium medium]8.6e-5427.08Show/hide
Query:  EKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVLDDPGANVCS
        ++D   T++ +   N +E     LVG++ +    +  A K  M  AWR+R    I+ + KN+Y+ KF  K + +L+ RN  W FD+NLL+L+    N   
Subjt:  EKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVLDDPGANVCS

Query:  LETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIG
         E     V FW+R+++LPL  R++ +A+K+GN +G F + D  K  +  G  +R++V +D+ +PL+RG  +   G   E W+  +YER+P+FCF CGRIG
Subjt:  LETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIG

Query:  HVAKECAEEKSKEMIASNNYE-----FGAWLRFQAFGR---PIQKPDDHANGMKDSF-----TKNLGSEVQKETDLEENVAMVGDEY-------QGNISM
        H  ++C + +  + +  +  E     FG WLR     +    ++K    +N  K  F     +K   S   KE D E     V D+        +GN+S 
Subjt:  HVAKECAEEKSKEMIASNNYE-----FGAWLRFQAFGR---PIQKPDDHANGMKDSF-----TKNLGSEVQKETDLEENVAMVGDEY-------QGNISM

Query:  D-----------------IPEAWGS-------------------KGRLEDR-NVYSP----HVLQTSNFNKEDKME----DPFVERI-----DLASSVRC
        D                 + E+ G+                   KGR   R  V  P    +        K   ++    D  +E +      +   V  
Subjt:  D-----------------IPEAWGS-------------------KGRLEDR-NVYSP----HVLQTSNFNKEDKME----DPFVERI-----DLASSVRC

Query:  QPCQSSSQGGSVVSG--RKNSPAWDYETHLLEYS-GVGEPESIQSTK--EPCLFKE----SPRNSGGLCLFWKDQNSISIQSYSSNHIDSIVHWQ--GKR
        + C +S+     V    R N         L+E    V E E+I+S    + CL  +        +GGL L W +  S++I S+S NHI      +  G+ 
Subjt:  QPCQSSSQGGSVVSG--RKNSPAWDYETHLLEYS-GVGEPESIQSTK--EPCLFKE----SPRNSGGLCLFWKDQNSISIQSYSSNHIDSIVHWQ--GKR

Query:  WRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHV---------------------------WEW-------------LDRF
        W ++ +YG+PE   K KTW L+RSL       W+  GD         K+G +V                           + W             LDR 
Subjt:  WRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHV---------------------------WEW-------------LDRF

Query:  LCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWSEL
        + N      F+ +KV++L  + SDH  +   L     + TRR  R F+FEE W    +C ++I SN   S L
Subjt:  LCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWSEL

OMO61345.1 reverse transcriptase [Corchorus capsularis]1.9e-4829.06Show/hide
Query:  WKRFNLTDGEKDPIFTLDQDNERNINEHL---EHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNL
        W+ FNLT+ E   +      + R ++E L    +CL+G+LLS R ++   ++NVM   W+      +  IG+N+++ +F    +KE + +   W F+K L
Subjt:  WKRFNLTDGEKDPIFTLDQDNERNINEHL---EHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNL

Query:  LVLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYER
        LVL    A  C  +       FW +  +LPL F N+ +   IG   G   + D   ++  WG  +R + RL++T+PLRRG MI T+    +  I+ RYE+
Subjt:  LVLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYER

Query:  IPDFCFCCGRIGHVAKECAEEKSKEMIASNN---YEFGAWLRFQA-----------------FGRPIQKPDD-----HANGMKDSFTKNL----------
        +PDFC+ CG + HV  EC  EK+  M         E+G WLR +                   GR  +K D         G K    ++L          
Subjt:  IPDFCFCCGRIGHVAKECAEEKSKEMIASNN---YEFGAWLRFQA-----------------FGRPIQKPDD-----HANGMKDSFTKNL----------

Query:  -GSEVQKETD---LEENVAMVGDEYQGNISMDIPEAWGSKGRLEDRNV-YSPHVLQTSNFNKED-KMEDPFVERIDLASSVRCQPCQSSSQGGSVVSGR-
          S+ Q +TD         + G  + G+    + E    K    D  V     V    N   +D  MED    + D +   +     S+S GGS +  + 
Subjt:  -GSEVQKETD---LEENVAMVGDEYQGNISMDIPEAWGSKGRLEDRNV-YSPHVLQTSNFNKED-KMEDPFVERIDLASSVRCQPCQSSSQGGSVVSGR-

Query:  -------------KNSPAWDYETHLLEYSGVGEPESIQSTKEPCLFKESPRNSGGLCLFWKDQNSISIQSYSSNHIDSIVHWQGK---RWRISDVYGWPE
                      N           E S     E+ + +KE     +S R SGGL L WK++  +SI SYS++H D+IV   GK    WR +  YG P 
Subjt:  -------------KNSPAWDYETHLLEYSGVGEPESIQSTKEPCLFKESPRNSGGLCLFWKDQNSISIQSYSSNHIDSIVHWQGK---RWRISDVYGWPE

Query:  RGQKGKTWDLLRSLHGALNMPWMLGGDITK
          ++G++WDL+R+L G  ++PW++GGD  +
Subjt:  RGQKGKTWDLLRSLHGALNMPWMLGGDITK

OMO61345.1 reverse transcriptase [Corchorus capsularis]4.2e-1629.61Show/hide
Query:  FWNNHDECADIITSNGDWSELR------GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLR
        FW +H + +  I     W +L       GL F+D E FN A +AKQ WR + N  +L  + F++ YF   S + A LGSNP+Y+ RS++ GR++L  G R
Subjt:  FWNNHDECADIITSNGDWSELR------GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLR

Query:  NRTGNGQLTLMFKDPWLPKELTFKPLCLDPSYINDKVEDFIFASGD--WDVDRLNRVVSREDLDIIRRTPL-NRNLDDKLVWHYDKTAKDIWSRTFN--R
         R G+G    + +D W+P     KP  + P+ ++  +   +    +  W  D L  +     ++ I    L +  ++DKL+W   K         ++  R
Subjt:  NRTGNGQLTLMFKDPWLPKELTFKPLCLDPSYINDKVEDFIFASGD--WDVDRLNRVVSREDLDIIRRTPL-NRNLDDKLVWHYDKTAKDIWSRTFN--R

Query:  VFLDKD
        + LD++
Subjt:  VFLDKD

OMO61345.1 reverse transcriptase [Corchorus capsularis]6.0e-4740.91Show/hide
Query:  MEEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNL
        + +W++F LT  E +    +D D  +   + L + LVG+LL+ RIIS   +  V+  AW+     ++ESIGKN+++  F  + D   + +   W FDK L
Subjt:  MEEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNL

Query:  LVLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFD-NEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYE
        +VL  P ++    E  F +V FW+ LF+LP+S+ NK +A ++GN +G+FVD D NEK  S WG S+RI+V +DIT+PLRRG  I   G     WI I+YE
Subjt:  LVLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFD-NEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYE

Query:  RIPDFCFCCGRIGHVAKEC-AEEKSKEMIASNNYEFGAWLRF
        R+PDFC+ CG IGH + +C A   + +  +    E+G WLRF
Subjt:  RIPDFCFCCGRIGHVAKEC-AEEKSKEMIASNNYEFGAWLRF

TrEMBL top hitse value%identityAlignment
A0A2N9FCN0 CCHC-type domain-containing protein1.8e-5723.96Show/hide
Query:  MEEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNEL-------
        +EEW+RF+LT+ E    F ++ D   N      HCL+G+L++ R  +KAA+K+ M   WR      ++  G N+++ +F D+ +++ + ++E        
Subjt:  MEEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNEL-------

Query:  ---WLFDKNLLVLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGND
           WLFDK LL L++   +  + +  F    FW++L  +PL +  K   E+IG +L      D  +N   WG+S+R+ + +D+T+P+ RG ++ T     
Subjt:  ---WLFDKNLLVLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGND

Query:  ERWITIRYERIPDFCFCCGRIGHVAKECAEE-KSKEMIASNNYEFGAWLRFQAF--------GRPIQKPDD------HANGMKDSFTK------------
        + WI+ +YER+P  CF C  +GH+ K+C  + +          ++G W R   F        G   Q+         H    K+SF              
Subjt:  ERWITIRYERIPDFCFCCGRIGHVAKECAEE-KSKEMIASNNYEFGAWLRFQAF--------GRPIQKPDD------HANGMKDSFTK------------

Query:  --NLGSEVQ------------------KETDLEENVAMVGDEYQGNISMDIPEAWGSKGRL-----EDRN--------------------------VYSP
          + G  +Q                  K+  LE N+  VG          +P +  S  +L     ED N                           Y  
Subjt:  --NLGSEVQ------------------KETDLEENVAMVGDEYQGNISMDIPEAWGSKGRL-----EDRN--------------------------VYSP

Query:  HVLQTSNFNKEDK------MEDPFVERIDLASSVRCQPC-QSSSQGGSVVSGRKNS----PAW-----------------------DYETHLL-----EY
        HV  + + N   K      M       +D A + +   C + +S+  ++  G+KN+     +W                        Y+  LL      +
Subjt:  HVLQTSNFNKEDK------MEDPFVERIDLASSVRCQPC-QSSSQGGSVVSGRKNS----PAW-----------------------DYETHLL-----EY

Query:  SGVGEPESIQSTKEPCLFKES---PRNSGGLCLFWKDQNSISIQSYSSNHIDSIVHWQ--GKRWRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGG
            EP     T+ P     S    R  GGL + W D   + + +YS NHID  +  +  GK +R++  YG  E  ++ ++W +L+ L    + PW+  G
Subjt:  SGVGEPESIQSTKEPCLFKES---PRNSGGLCLFWKDQNSISIQSYSSNHIDSIVHWQ--GKRWRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGG

Query:  DITK-----RGTHV----------WEWLDRFL---------------------CNPYLEGLFN-----------------SLKVSNLNWYNSDHRPIEAQ
        D  +      GT V           E+L +FL                     C   +E  +N                 S + S + W +     + A 
Subjt:  DITK-----RGTHV----------WEWLDRFL---------------------CNPYLEGLFN-----------------SLKVSNLNWYNSDHRPIEAQ

Query:  L------------------SNRKFNKTRRIY----------RSFKFEEFWN------NHDEC--ADIITSNG--DWSELRGLNFKDIEGFNQALVAKQVW
        +                  S+ K    +++Y           +  ++ +W+      +HD+   A  ++ N      E+ GL F+D++ FN AL+AKQ W
Subjt:  L------------------SNRKFNKTRRIY----------RSFKFEEFWN------NHDEC--ADIITSNG--DWSELRGLNFKDIEGFNQALVAKQVW

Query:  RIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPL-CLDPSYINDKVEDFI-FASGD
        R++  P SL+ R  K+ YF     L AN+G  P+Y  RS+   R +L  GLR   G+ +   + KDPWLP   +F+ L  L+    N++V   I  A+  
Subjt:  RIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPL-CLDPSYINDKVEDFI-FASGD

Query:  WDVDRLNRVVSREDLDIIRRTPL-NRNLDDKLVWHYDKT
        W+ + ++ + S  +  II   PL  R   D+L W+  K+
Subjt:  WDVDRLNRVVSREDLDIIRRTPL-NRNLDDKLVWHYDKT

A0A2N9IXK4 RNase H domain-containing protein1.2e-1333.12Show/hide
Query:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPLCL
        G+ F+D++ FN AL+AKQVWR++ N DSL  + FK+ YF + ++L  N+  N +Y  +S++  R +L  G R R GN     ++ D WLP+    + +  
Subjt:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPLCL

Query:  DPSYIND-KVEDFI-FASGDWDVDRLNRVVSREDLDIIRRTPLNRN-LDDKLVW
           +  D KV   I     +W    +N V    +  +I+  PL+     DKL+W
Subjt:  DPSYIND-KVEDFI-FASGDWDVDRLNRVVSREDLDIIRRTPLNRN-LDDKLVW

A0A2N9IXK4 RNase H domain-containing protein1.5e-5125.6Show/hide
Query:  EKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVLDDPGANVCS
        ++D   T++++   N +E     LVG++ +    +  A K  M  AWR+R    I+ + KN+++ KF  + + +L+ +N  W FD+NLL+L+    N   
Subjt:  EKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVLDDPGANVCS

Query:  LETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIG
         E     V FW+R+++LPL  R++ +A+K+GN +G F + D  K  +  G  +R++V +D+ +PL+RG  +   G   E W+  +YER+P+FCF CGRIG
Subjt:  LETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIG

Query:  HVAKECAEEKSKEMIASNNYE-----FGAWLRFQAF--------------------------------------------GRPIQKPDDHANGMKDSFTK
        H  ++C + +  + +  +  E     FG WLR                                                 R       H    K + +K
Subjt:  HVAKECAEEKSKEMIASNNYE-----FGAWLRFQAF--------------------------------------------GRPIQKPDDHANGMKDSFTK

Query:  NLGSEVQKETDLEENVAMVGDEY-------QGNISMDIPEAWGSKGR--LEDRNVYSPHVLQTSNFNKEDK-------MEDPFVERI-----DLASSVRC
        +   +  K+ ++++ V +V +         Q  I   I      KGR  +  R              K  K       + D  +E +      +   V  
Subjt:  NLGSEVQKETDLEENVAMVGDEY-------QGNISMDIPEAWGSKGR--LEDRNVYSPHVLQTSNFNKEDK-------MEDPFVERI-----DLASSVRC

Query:  QPCQSSSQGGSVVSGRKNSP--AWDYETHLLEYSGVGEPESIQSTK--EPCLFKE----SPRNSGGLCLFWKDQNSISIQSYSSNHIDSIV--HWQGKRW
        + C +S+    +   R  +P   +  ET L       E E I+S    + CL  +         GGL L W +Q S++I SYS NHI         G  W
Subjt:  QPCQSSSQGGSVVSGRKNSP--AWDYETHLLEYSGVGEPESIQSTK--EPCLFKE----SPRNSGGLCLFWKDQNSISIQSYSSNHIDSIV--HWQGKRW

Query:  RISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHVWEW----------LDRFLCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQ
         ++ +YG+PE   K KTW L+RSL       W+  GD         K+G ++              D    +      F+ +KV++L  + SDH  +   
Subjt:  RISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHVWEW----------LDRFLCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQ

Query:  LSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWS------ELRGLNF--KDIEGFNQALVAKQVWRI
        L       TRR  R F+FEE W    +C ++I  N   S      +L  LN      E  N   + K ++RI
Subjt:  LSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWS------ELRGLNF--KDIEGFNQALVAKQVWRI

A0A2Z6N2E1 Uncharacterized protein7.1e-5427.57Show/hide
Query:  MEEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNL
        ME WK   L+  E+  I  + +D E   + +    L G+L +    +  A K  +   W+ +    I+ +GKN+Y+ +F  K D E + +N  W FD+N+
Subjt:  MEEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNL

Query:  LVLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYER
        LVL          E     + FW R+++LPL  R++ +A+K+G+ +G F + DN++N    G  ++I+V +D+ +P++RG ++K   G D R +  +YER
Subjt:  LVLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYER

Query:  IPDFCFCCGRIGHVAKECAEEKSK-----EMIASNNYEFGAWLRFQAFGRP---IQKPDDHANGMKDSFTKNLGSEVQKETDLEENVAMVGDEYQGNISM
        +P FCF CGRIGH  +EC E +++     E I      FG WLR     R     +K    ++  K+ F+ +  S              VGD+  G + +
Subjt:  IPDFCFCCGRIGHVAKECAEEKSK-----EMIASNNYEFGAWLRFQAFGRP---IQKPDDHANGMKDSFTKNLGSEVQKETDLEENVAMVGDEYQGNISM

Query:  D--IPEAWGSKGRLEDRNVYSPHVLQTSNFNKEDKMEDPFVERIDLASSVRCQPCQSSSQGGSVVSGRKNSPAWDYETHLLEYSGVGEPESIQSTKEPCL
        D  I +    + +L + N  +    +T            F+E   +A S+        S   S+ S +K             +  +G+ E  Q +K   +
Subjt:  D--IPEAWGSKGRLEDRNVYSPHVLQTSNFNKEDKMEDPFVERIDLASSVRCQPCQSSSQGGSVVSGRKNSPAWDYETHLLEYSGVGEPESIQSTKEPCL

Query:  --FKESPRNSGGLCLFWKDQNSISIQSYSSNHIDSIV--HWQGKRWRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHVWE
          F      +GG+ L+W D  +I+I SYS NHI+  +     G+ W I+ +YG+PE   K KTW L+  +  +    W+  GD+        K G     
Subjt:  --FKESPRNSGGLCLFWKDQNSISIQSYSSNHIDSIV--HWQGKRWRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHVWE

Query:  WLDRFL-----CNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWSELRG
         L   L      +      F  +KV++L  + SDH  I+  L         +    F+FEE W+    C  +I     WS  RG
Subjt:  WLDRFL-----CNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWSELRG

A0A2Z6N2E1 Uncharacterized protein2.6e-1934.27Show/hide
Query:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPLCL
        GL F+  E FN+AL+ KQ WR++ NPDSL+++ FKS YF  S+ + A++G  P+Y  RSL   R+++  G R   GNGQ   ++ D WLP++  FK    
Subjt:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPLCL

Query:  DPSYI-NDKVEDFIF-ASGDWDVDRLNRVVSREDLDIIRRTPLNRNLD-DKLVWHYDKTAKDIWSRTFNRVFLDKDFN
          + + N  V D I   +  WD + +    S  + + I   P++  L  DKL+WH++K  +      ++ +  D++ N
Subjt:  DPSYI-NDKVEDFIF-ASGDWDVDRLNRVVSREDLDIIRRTPLNRNLD-DKLVWHYDKTAKDIWSRTFNRVFLDKDFN

A0A2Z6N2E1 Uncharacterized protein6.0e-5325.81Show/hide
Query:  EEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLL
        E W+ F+L D E         D  ++I    +H L  R L+ R ++  A+    +  WRT  DF ++ +G N+ +++F D  D E +  +  W +DK+L+
Subjt:  EEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLL

Query:  VLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERI
        +       V +    F K + W+++  LP    +   A +IG  +G      +E+ E  WG  VR++V +D+ +PL RG  I   G N E  ++ +YE++
Subjt:  VLDDPGANVCSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERI

Query:  PDFCFCCGRIGHVAKECA-EEKSKEMIASNNYEFGAWLR------------------FQAFGRPIQKPDDHANGMKDSFTKNLGSEVQKETDLEENVAMV
        P+FC+ CG I H  K+C+   ++++ + ++  ++GAWLR                  F++        D+   G  D+      S  QKETDL       
Subjt:  PDFCFCCGRIGHVAKECA-EEKSKEMIASNNYEFGAWLR------------------FQAFGRPIQKPDDHANGMKDSFTKNLGSEVQKETDLEENVAMV

Query:  GDEYQGNISMDIPEAWGSKGRLEDRNVYSPHVLQTSNFNKEDKMEDPFVERIDLASSVRCQ---PCQSSSQGGSVVSGR--KNSPAWDYETHLLEYSGVG
        GD    ++             + + + + P + Q +  N         +   D  S +  +   P Q        ++ R   +S A   E   +E S   
Subjt:  GDEYQGNISMDIPEAWGSKGRLEDRNVYSPHVLQTSNFNKEDKMEDPFVERIDLASSVRCQ---PCQSSSQGGSVVSGR--KNSPAWDYETHLLEYSGVG

Query:  EPESIQSTKEPCLFKESPRNSGGLCLFWKDQNSISIQSYSSNHIDSIV-HWQGKRWRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDI------
        +P  +   +            GGL +FWK +  +SI+S+S +HID+I+   +   WR +  YG PE  ++ ++W LLR LH   ++PW   GD       
Subjt:  EPESIQSTKEPCLFKESPRNSGGLCLFWKDQNSISIQSYSSNHIDSIV-HWQGKRWRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDI------

Query:  ----------------------------------------TKRGTH-VWEWLDRFLCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYR
                                                 + G+H VWE LDR L       LF   +V +L+  +SDH PI  Q S    ++ R   R
Subjt:  ----------------------------------------TKRGTH-VWEWLDRFLCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYR

Query:  SFKFEEFWNNHDECADIITS
         F+FEE W +H  C + ITS
Subjt:  SFKFEEFWNNHDECADIITS

A0A392M033 CCHC-type domain-containing protein (Fragment)4.2e-5427.08Show/hide
Query:  EKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVLDDPGANVCS
        ++D   T++ +   N +E     LVG++ +    +  A K  M  AWR+R    I+ + KN+Y+ KF  K + +L+ RN  W FD+NLL+L+    N   
Subjt:  EKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVLDDPGANVCS

Query:  LETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIG
         E     V FW+R+++LPL  R++ +A+K+GN +G F + D  K  +  G  +R++V +D+ +PL+RG  +   G   E W+  +YER+P+FCF CGRIG
Subjt:  LETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIG

Query:  HVAKECAEEKSKEMIASNNYE-----FGAWLRFQAFGR---PIQKPDDHANGMKDSF-----TKNLGSEVQKETDLEENVAMVGDEY-------QGNISM
        H  ++C + +  + +  +  E     FG WLR     +    ++K    +N  K  F     +K   S   KE D E     V D+        +GN+S 
Subjt:  HVAKECAEEKSKEMIASNNYE-----FGAWLRFQAFGR---PIQKPDDHANGMKDSF-----TKNLGSEVQKETDLEENVAMVGDEY-------QGNISM

Query:  D-----------------IPEAWGS-------------------KGRLEDR-NVYSP----HVLQTSNFNKEDKME----DPFVERI-----DLASSVRC
        D                 + E+ G+                   KGR   R  V  P    +        K   ++    D  +E +      +   V  
Subjt:  D-----------------IPEAWGS-------------------KGRLEDR-NVYSP----HVLQTSNFNKEDKME----DPFVERI-----DLASSVRC

Query:  QPCQSSSQGGSVVSG--RKNSPAWDYETHLLEYS-GVGEPESIQSTK--EPCLFKE----SPRNSGGLCLFWKDQNSISIQSYSSNHIDSIVHWQ--GKR
        + C +S+     V    R N         L+E    V E E+I+S    + CL  +        +GGL L W +  S++I S+S NHI      +  G+ 
Subjt:  QPCQSSSQGGSVVSG--RKNSPAWDYETHLLEYS-GVGEPESIQSTK--EPCLFKE----SPRNSGGLCLFWKDQNSISIQSYSSNHIDSIVHWQ--GKR

Query:  WRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHV---------------------------WEW-------------LDRF
        W ++ +YG+PE   K KTW L+RSL       W+  GD         K+G +V                           + W             LDR 
Subjt:  WRISDVYGWPERGQKGKTWDLLRSLHGALNMPWMLGGDIT-------KRGTHV---------------------------WEW-------------LDRF

Query:  LCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWSEL
        + N      F+ +KV++L  + SDH  +   L     + TRR  R F+FEE W    +C ++I SN   S L
Subjt:  LCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSNGDWSEL

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003105.7e-1642.86Show/hide
Query:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPL
        GL F+D+  FNQAL+AKQ +RI+  P +L+SR  +S YF  SS++  ++G+ P+Y  RS++ GR+LL +GL    G+G  T ++ D W+  E    PL
Subjt:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPL

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein9.3e-0628.12Show/hide
Query:  KSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPLCLDPSYINDKVEDFIFASGD---WDVDRLNRVVSRE
        K+ YF   SIL A +    +Y   SL+ G  LL KG R+  G+GQ   +  D  +      +PL  + +Y    + +     G    WD  ++++ V + 
Subjt:  KSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPLCLDPSYINDKVEDFIFASGD---WDVDRLNRVVSRE

Query:  DLDIIRRTPLNRN-LDDKLVWHYDKTAK
        D   I R  L ++   DK++W+Y+ T +
Subjt:  DLDIIRRTPLNRN-LDDKLVWHYDKTAK

AT3G42140.1 zinc ion binding;nucleic acid binding1.2e-0523.53Show/hide
Query:  FKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIGHVAK
        FK++ FW+++  +PL F    +   IG ++G F                                 ++T+ G D   +  +YE++ +FC  CG + H A 
Subjt:  FKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIGHVAK

Query:  EC
        EC
Subjt:  EC

AT4G29090.1 Ribonuclease H-like superfamily protein4.3e-1933.74Show/hide
Query:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWL---PKELTFKP
        G+ FKDIE FN AL+ KQ+WR++S P+SL+++ FKS YF+ S  L A LGS P+++ +S+   +++L +G R   GNG+  ++++  WL   P     + 
Subjt:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWL---PKELTFKP

Query:  LCLDPSYIND-----KVEDFIFASG-DWDVDRLNRVVSREDLDII-RRTPLNRNLDDKLVWHY
          + P          KV D I  SG +W  D +  +    +  +I    P  R + D   W Y
Subjt:  LCLDPSYIND-----KVEDFIFASG-DWDVDRLNRVVSREDLDII-RRTPLNRNLDDKLVWHY

AT5G36228.1 nucleic acid binding;zinc ion binding1.4e-0923.38Show/hide
Query:  HCLVGRLLSNRIISKAAIKNVMKGA-----------WRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVL----DDPGANVCSLETGFK
        H  VG L SNR+     I N    +           W          +    + ++F  + D     R   W+F++  + L    D P  +  +      
Subjt:  HCLVGRLLSNRIISKAAIKNVMKGA-----------WRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVL----DDPGANVCSLETGFK

Query:  KVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDER-WITIRYERIPDFCFCCGRIGHVAKE
         ++ W+ +  +PL + ++   E I + LG+ V  D  +  +   T +R++VR+D T+PLR  F  +    + ER  I   YE++   C  C R+ H    
Subjt:  KVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDER-WITIRYERIPDFCFCCGRIGHVAKE

Query:  C
        C
Subjt:  C

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.0e-1742.86Show/hide
Query:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPL
        GL F+D+  FNQAL+AKQ +RI+  P +L+SR  +S YF  SS++  ++G+ P+Y  RS++ GR+LL +GL    G+G  T ++ D W+  E    PL
Subjt:  GLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAATGGAAGCGATTCAACTTAACGGATGGTGAAAAAGACCCCATCTTTACTCTGGACCAAGACAATGAGAGAAATATCAACGAACATCTAGAGCACTGCTTGGT
TGGAAGGTTGTTGTCGAACAGGATAATTTCAAAAGCCGCAATCAAGAACGTAATGAAGGGGGCTTGGAGAACAAGAGAAGATTTTTCAATAGAATCCATTGGGAAAAATA
TGTATATGTTAAAGTTTATCGATAAACCTGACAAAGAATTGATCAAGAGAAACGAACTGTGGTTATTTGACAAGAATCTACTTGTCCTTGATGATCCAGGAGCCAATGTC
TGTTCTTTGGAAACGGGCTTCAAGAAGGTGGAGTTTTGGTTAAGGCTTTTCAATCTTCCTCTGAGTTTCAGGAACAAGCATGTAGCAGAGAAAATAGGCAACAAATTAGG
GGACTTTGTTGACTTTGATAATGAAAAAAATGAAAGCTTTTGGGGAACTAGTGTACGAATCCAGGTTCGGCTGGACATAACTCAACCCCTTCGACGAGGCTTCATGATAA
AGACTTCAGGTGGTAATGATGAGCGCTGGATCACAATAAGATACGAAAGGATTCCAGATTTTTGTTTTTGTTGCGGGAGAATAGGGCATGTGGCAAAGGAATGCGCTGAG
GAAAAGAGCAAGGAGATGATAGCAAGTAACAATTATGAGTTTGGAGCTTGGCTGCGATTTCAAGCTTTTGGGAGACCTATACAAAAACCTGATGATCATGCCAATGGAAT
GAAGGATAGCTTCACCAAAAATTTAGGGTCAGAAGTACAAAAAGAAACGGATCTTGAGGAAAATGTGGCAATGGTAGGAGATGAATATCAGGGAAATATCTCAATGGACA
TTCCAGAGGCTTGGGGATCAAAAGGCAGATTGGAAGATAGGAATGTTTATTCTCCACATGTCTTGCAAACCAGCAATTTCAATAAAGAAGATAAAATGGAAGACCCTTTT
GTAGAAAGAATTGATCTTGCTTCCAGTGTGAGATGTCAGCCATGTCAATCCAGCAGCCAAGGGGGCAGTGTGGTGTCAGGAAGGAAGAACAGCCCTGCCTGGGATTATGA
AACTCATCTGTTGGAATACTCGGGGGTTGGGGAACCCGAGAGCATTCAGAGCACTAAAGAACCTTGTCTCTTCAAGGAATCCCCAAGAAATAGTGGTGGACTATGCCTCT
TTTGGAAAGACCAGAATTCAATCTCCATTCAATCCTATTCAAGTAATCACATAGATTCCATTGTGCACTGGCAAGGAAAGAGGTGGAGAATTTCAGACGTGTATGGGTGG
CCAGAAAGAGGTCAGAAGGGGAAAACCTGGGACCTTTTAAGAAGTCTTCATGGTGCTCTTAACATGCCATGGATGCTGGGAGGTGATATAACGAAAAGGGGAACCCATGT
TTGGGAATGGCTTGATCGTTTTCTGTGCAATCCGTATTTGGAAGGGCTTTTTAATTCACTGAAGGTCTCAAATCTAAACTGGTATAATTCGGATCACAGACCAATTGAAG
CACAACTGAGTAACCGAAAATTTAACAAGACCAGAAGGATCTACAGATCATTTAAATTTGAGGAATTCTGGAATAATCATGATGAATGTGCAGATATCATAACTAGCAAT
GGAGATTGGTCAGAGCTTCGGGGTCTTAATTTCAAGGACATTGAAGGGTTTAACCAAGCCTTAGTTGCTAAACAAGTTTGGCGTATTGTTTCAAACCCTGATTCTCTAGT
TTCTAGATTTTTCAAGAGCATTTATTTCAATTCTTCTAGCATTTTAACTGCTAATTTAGGAAGTAACCCAACTTACCTCGGGAGAAGTCTAATGTGGGGTAGAGATTTGC
TAGTCAAAGGTCTGAGGAATAGAACAGGAAATGGTCAATTGACTTTAATGTTTAAGGACCCTTGGCTGCCAAAAGAGCTTACCTTTAAACCTTTGTGTTTGGATCCTTCC
TATATAAACGATAAGGTGGAAGATTTTATTTTTGCATCAGGGGATTGGGATGTCGATAGGCTTAACAGGGTGGTATCTAGGGAAGATCTAGATATTATTAGGAGAACCCC
CCTCAACAGAAATTTAGATGATAAACTAGTATGGCATTACGATAAAACAGCTAAGGATATTTGGAGTAGAACATTTAATCGAGTGTTTTTGGACAAAGACTTTAACGGCA
GCTTGGCGGATCGTTGGTTGAGAATCGAGTCGAATTCTTCCATGGCTGAGATGGAGTTGGTTGCAGTGACTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAATGGAAGCGATTCAACTTAACGGATGGTGAAAAAGACCCCATCTTTACTCTGGACCAAGACAATGAGAGAAATATCAACGAACATCTAGAGCACTGCTTGGT
TGGAAGGTTGTTGTCGAACAGGATAATTTCAAAAGCCGCAATCAAGAACGTAATGAAGGGGGCTTGGAGAACAAGAGAAGATTTTTCAATAGAATCCATTGGGAAAAATA
TGTATATGTTAAAGTTTATCGATAAACCTGACAAAGAATTGATCAAGAGAAACGAACTGTGGTTATTTGACAAGAATCTACTTGTCCTTGATGATCCAGGAGCCAATGTC
TGTTCTTTGGAAACGGGCTTCAAGAAGGTGGAGTTTTGGTTAAGGCTTTTCAATCTTCCTCTGAGTTTCAGGAACAAGCATGTAGCAGAGAAAATAGGCAACAAATTAGG
GGACTTTGTTGACTTTGATAATGAAAAAAATGAAAGCTTTTGGGGAACTAGTGTACGAATCCAGGTTCGGCTGGACATAACTCAACCCCTTCGACGAGGCTTCATGATAA
AGACTTCAGGTGGTAATGATGAGCGCTGGATCACAATAAGATACGAAAGGATTCCAGATTTTTGTTTTTGTTGCGGGAGAATAGGGCATGTGGCAAAGGAATGCGCTGAG
GAAAAGAGCAAGGAGATGATAGCAAGTAACAATTATGAGTTTGGAGCTTGGCTGCGATTTCAAGCTTTTGGGAGACCTATACAAAAACCTGATGATCATGCCAATGGAAT
GAAGGATAGCTTCACCAAAAATTTAGGGTCAGAAGTACAAAAAGAAACGGATCTTGAGGAAAATGTGGCAATGGTAGGAGATGAATATCAGGGAAATATCTCAATGGACA
TTCCAGAGGCTTGGGGATCAAAAGGCAGATTGGAAGATAGGAATGTTTATTCTCCACATGTCTTGCAAACCAGCAATTTCAATAAAGAAGATAAAATGGAAGACCCTTTT
GTAGAAAGAATTGATCTTGCTTCCAGTGTGAGATGTCAGCCATGTCAATCCAGCAGCCAAGGGGGCAGTGTGGTGTCAGGAAGGAAGAACAGCCCTGCCTGGGATTATGA
AACTCATCTGTTGGAATACTCGGGGGTTGGGGAACCCGAGAGCATTCAGAGCACTAAAGAACCTTGTCTCTTCAAGGAATCCCCAAGAAATAGTGGTGGACTATGCCTCT
TTTGGAAAGACCAGAATTCAATCTCCATTCAATCCTATTCAAGTAATCACATAGATTCCATTGTGCACTGGCAAGGAAAGAGGTGGAGAATTTCAGACGTGTATGGGTGG
CCAGAAAGAGGTCAGAAGGGGAAAACCTGGGACCTTTTAAGAAGTCTTCATGGTGCTCTTAACATGCCATGGATGCTGGGAGGTGATATAACGAAAAGGGGAACCCATGT
TTGGGAATGGCTTGATCGTTTTCTGTGCAATCCGTATTTGGAAGGGCTTTTTAATTCACTGAAGGTCTCAAATCTAAACTGGTATAATTCGGATCACAGACCAATTGAAG
CACAACTGAGTAACCGAAAATTTAACAAGACCAGAAGGATCTACAGATCATTTAAATTTGAGGAATTCTGGAATAATCATGATGAATGTGCAGATATCATAACTAGCAAT
GGAGATTGGTCAGAGCTTCGGGGTCTTAATTTCAAGGACATTGAAGGGTTTAACCAAGCCTTAGTTGCTAAACAAGTTTGGCGTATTGTTTCAAACCCTGATTCTCTAGT
TTCTAGATTTTTCAAGAGCATTTATTTCAATTCTTCTAGCATTTTAACTGCTAATTTAGGAAGTAACCCAACTTACCTCGGGAGAAGTCTAATGTGGGGTAGAGATTTGC
TAGTCAAAGGTCTGAGGAATAGAACAGGAAATGGTCAATTGACTTTAATGTTTAAGGACCCTTGGCTGCCAAAAGAGCTTACCTTTAAACCTTTGTGTTTGGATCCTTCC
TATATAAACGATAAGGTGGAAGATTTTATTTTTGCATCAGGGGATTGGGATGTCGATAGGCTTAACAGGGTGGTATCTAGGGAAGATCTAGATATTATTAGGAGAACCCC
CCTCAACAGAAATTTAGATGATAAACTAGTATGGCATTACGATAAAACAGCTAAGGATATTTGGAGTAGAACATTTAATCGAGTGTTTTTGGACAAAGACTTTAACGGCA
GCTTGGCGGATCGTTGGTTGAGAATCGAGTCGAATTCTTCCATGGCTGAGATGGAGTTGGTTGCAGTGACTTGTTGA
Protein sequenceShow/hide protein sequence
MEEWKRFNLTDGEKDPIFTLDQDNERNINEHLEHCLVGRLLSNRIISKAAIKNVMKGAWRTREDFSIESIGKNMYMLKFIDKPDKELIKRNELWLFDKNLLVLDDPGANV
CSLETGFKKVEFWLRLFNLPLSFRNKHVAEKIGNKLGDFVDFDNEKNESFWGTSVRIQVRLDITQPLRRGFMIKTSGGNDERWITIRYERIPDFCFCCGRIGHVAKECAE
EKSKEMIASNNYEFGAWLRFQAFGRPIQKPDDHANGMKDSFTKNLGSEVQKETDLEENVAMVGDEYQGNISMDIPEAWGSKGRLEDRNVYSPHVLQTSNFNKEDKMEDPF
VERIDLASSVRCQPCQSSSQGGSVVSGRKNSPAWDYETHLLEYSGVGEPESIQSTKEPCLFKESPRNSGGLCLFWKDQNSISIQSYSSNHIDSIVHWQGKRWRISDVYGW
PERGQKGKTWDLLRSLHGALNMPWMLGGDITKRGTHVWEWLDRFLCNPYLEGLFNSLKVSNLNWYNSDHRPIEAQLSNRKFNKTRRIYRSFKFEEFWNNHDECADIITSN
GDWSELRGLNFKDIEGFNQALVAKQVWRIVSNPDSLVSRFFKSIYFNSSSILTANLGSNPTYLGRSLMWGRDLLVKGLRNRTGNGQLTLMFKDPWLPKELTFKPLCLDPS
YINDKVEDFIFASGDWDVDRLNRVVSREDLDIIRRTPLNRNLDDKLVWHYDKTAKDIWSRTFNRVFLDKDFNGSLADRWLRIESNSSMAEMELVAVTC