; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029425 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029425
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold12:37231733..37237173
RNA-Seq ExpressionSpg029425
SyntenySpg029425
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF2292067.1 hypothetical protein GH714_006944 [Hevea brasiliensis]7.8e-5329.1Show/hide
Query:  WNNVGDCPLSYNLSNCADALGKWGRELFLNRKNKIRDCKKFLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSRWFHRKATI
        W N         + +C + L  W R L  N K K++  KK ++    N+   +     ++  E + LL +EE YW+QRS+  WLK GD N+ +FH +A  
Subjt:  WNNVGDCPLSYNLSNCADALGKWGRELFLNRKNKIRDCKKFLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSRWFHRKATI

Query:  RSKTNEIIGITDSNGVWTED------------PIVIER---EWNWDLLKEAVNKEDLEIISRIPINL-ASEDKFLWHYDKCGIYSVRSGYKIFIRNKINA
        R + N I  + D  G W  +             I  +R    W+   L    + ED++ I  IP++L  + DK +WHY   G YSV+SGY + +   +N 
Subjt:  RSKTNEIIGITDSNGVWTED------------PIVIER---EWNWDLLKEAVNKEDLEIISRIPINL-ASEDKFLWHYDKCGIYSVRSGYKIFIRNKINA

Query:  GPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFEL
            + + + W  LW LKIP K+K F WKA+   +P + NL  RG+  D  C  C   +E  +H+++ C KAKE W++ LN +L    F  +F     E 
Subjt:  GPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFEL

Query:  NSVLSFEDLQKFAVTCWAIWTERNNLI-HDKPIPS-------PTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLG
        N + +   L       W++W  RN L+   K   S        T+R EW     +D+            C       +E  + P  G  K N D +   G
Subjt:  NSVLSFEDLQKFAVTCWAIWTERNNLI-HDKPIPS-------PTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLG

Query:  EAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCT
        E  +GF  V RN +G  +     F    FSP +AE L + E   +  N GLD++ ++  CL     +++  E+ +E+G+++++      +F  +SF HCT
Subjt:  EAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCT

Query:  -RNGNSIADTIAKFARCEKCSVVWNLEVPDWIHVLVHSDRLSG
         R  N  A ++A+ A+      +W  +V   + ++++ D +SG
Subjt:  -RNGNSIADTIAKFARCEKCSVVWNLEVPDWIHVLVHSDRLSG

KAF2317147.1 hypothetical protein GH714_012179 [Hevea brasiliensis]2.4e-5727.04Show/hide
Query:  LSNCADALGKWGRELFLNRKNKIRDCKK---FLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSRWFHRKATIRSKTNEIIG
        L  C+  LG+WG  L    K +I DCK+    L+   D     +F +  +   +   LL  +E YW+QR++E WLK GD+N+++FHRKATIR + N I  
Subjt:  LSNCADALGKWGRELFLNRKNKIRDCKK---FLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSRWFHRKATIRSKTNEIIG

Query:  ITDSNGVW-------------------------------------TED--------------------------------PIVIEREWNWDLLKEAVNKE
        + D NG W                                     T+D                                 ++ +  WN DL+    N+ 
Subjt:  ITDSNGVW-------------------------------------TED--------------------------------PIVIEREWNWDLLKEAVNKE

Query:  DLEIISRIPINLASE-DKFLWHYDKCGIYSVRSGYKIFIRNKINAGPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRC
        D ++I  IP+  +S  D   W +DK G YSV S YK+  +    A  +    +K W  LW +    K+++F W+A++G +PTR  L  R + T  +CP C
Subjt:  DLEIISRIPINLASE-DKFLWHYDKCGIYSVRSGYKIFIRNKINAGPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRC

Query:  NKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPT--------MRSEWVKRYIS
        N D ES  H+L+ C  A+ +W    +H        H+F D      ++ +  D    A  CW+IW ERN+++     P+          ++ EW    +S
Subjt:  NKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPT--------MRSEWVKRYIS

Query:  DYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLI
          +RA   P +       +  + ++WK P     K+N+D A  +    +G G+V RN  G  +          FSP++AE++ + E + +  +    N+I
Subjt:  DYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLI

Query:  LESDCLHAIRLINN-EVEDRTELGTVVEEIKRRMMAFS-DISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDW
        +ESD L  + ++N+  VE+ + +G +V + +  +   S +I F H  R+ N +A  +A+  R    ++ W L  P++
Subjt:  LESDCLHAIRLINN-EVEDRTELGTVVEEIKRRMMAFS-DISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDW

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]9.3e-5431.06Show/hide
Query:  EWNWDLLKEAVNKEDLEIISRIPI-NLASEDKFLWHYDKCGIYSVRSGYKI--FIRNKINAGPS--SNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPT
        +WN  LLK+    ++++   +IP+ +LA  D  +WHY++ G+YSV+SGY++    ++K++  PS   +   K W  +W LKIP K+K F W+    F+P 
Subjt:  EWNWDLLKEAVNKEDLEIISRIPI-NLASEDKFLWHYDKCGIYSVRSGYKI--FIRNKINAGPS--SNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPT

Query:  RVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPTM
           L  R I    ICP C++  ES  H +  CE AKE+W  +   ++ +    ++F + W  L    S E+   FA  CW +W  RN+ I +    + T 
Subjt:  RVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPTM

Query:  RSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKF
            + +   ++  AN    +    Q   +A    W+PPP G++KIN+D A K G++  G G+V RN++G+ + A     +  +     EL+A +EG++F
Subjt:  RSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKF

Query:  AANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDWIHVLVHSDRLS
        A ++G    +LE D    I  I +  E     G ++EE+   +  F  +      R+GN +A T+A+FA      V W  E P W+  ++ +D LS
Subjt:  AANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDWIHVLVHSDRLS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.3e-5935.71Show/hide
Query:  IEREWNWDL--LKEAVNKEDLEIISRIPINLAS-EDKFLWHYDKCGIYSVRSGYKIFIRNKINA-GPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFI
        I  + NWD+  +  +   ED ++I  +PI+  + +D +LWHYDK G YSVRSGYK+++  K NA   S+N     W+++W+L +P K+K F W++ +  I
Subjt:  IEREWNWDL--LKEAVNKEDLEIISRIPINLAS-EDKFLWHYDKCGIYSVRSGYKIFIRNKINA-GPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFI

Query:  PTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNH-DLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPS
        PT  NL  RGI     C  C    ES  H    C++A++IW         L  E N +F + W  L   L  +DL   A+T W IW +RN+LIH K +  
Subjt:  PTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNH-DLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPS

Query:  PTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEG
           + EW+  ++  + +A  +  S  + Q  +R   + W+P      K+N DAAC+   A + FG + R+SS  L+ A++I      SP +AE+  I+EG
Subjt:  PTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEG

Query:  MKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFA-RCEKCSVVWNLEVPDWIHVLVHSDRLS
        +KFAA     +L +ESD L AI+LI NE+  R +    V EI+     F+ ISF H +R  N  A  +AK+       +  W    P W+  LV  D  S
Subjt:  MKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFA-RCEKCSVVWNLEVPDWIHVLVHSDRLS

Query:  G-AHVA
          AHVA
Subjt:  G-AHVA

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.1e-0638.46Show/hide
Query:  NGFWNN-VGDCPLSYNLSNCADALGKWGRELFLNRKNKIRDCKKFLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWG
        +G W+N   +   S ++   + AL  WGR    +   +I+  K  + +AY+    ++F ++H +E +L  LLE EEI+WKQRSRE+WLKWG
Subjt:  NGFWNN-VGDCPLSYNLSNCADALGKWGRELFLNRKNKIRDCKKFLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.6e-5827.1Show/hide
Query:  LSNCADALGKWGRELFLNRKNKIRDCKK---FLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSRWFHRKATIRSKTNEIIG
        L  C+  LG+WG  L    K +I DCK+    L+   D     +F +  +   +   LL  +E YW+QR++E WLK GD+N+++FHRKATIR + N I  
Subjt:  LSNCADALGKWGRELFLNRKNKIRDCKK---FLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSRWFHRKATIRSKTNEIIG

Query:  ITDSNGVW-------------------------------------TED--------------------------------PIVIEREWNWDLLKEAVNKE
        + D NG W                                     T+D                                 ++ +  WN DL+    N+ 
Subjt:  ITDSNGVW-------------------------------------TED--------------------------------PIVIEREWNWDLLKEAVNKE

Query:  DLEIISRIPINLASE-DKFLWHYDKCGIYSVRSGYKIFIRNKINAGPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRC
        D ++I  IP+  +S  D   W +DK G YSV S YK+  +    A  +    +K W  LW +    K+++F W+A++G +PTR  L  R + T  +CP C
Subjt:  DLEIISRIPINLASE-DKFLWHYDKCGIYSVRSGYKIFIRNKINAGPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRC

Query:  NKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPT--------MRSEWVKRYIS
        N D ES  H+L+ C  A+ +W    +H        H+F D      ++ +  D    A  CW+IW ERN+++     P+          ++ EW    +S
Subjt:  NKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPT--------MRSEWVKRYIS

Query:  DYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLI
          +RA   P +       +  + ++WK P     K+N+D A  +    +G G+V RN  G  +          FSP++AE++ + E + +  +    N+I
Subjt:  DYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLI

Query:  LESDCLHAIRLINN-EVEDRTELGTVVEEIKRRMMAFS-DISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDWI-HVLV
        +ESD L  + ++N+  VE+ + +G +V + +  +   S +I F H  R+ N +A  +A+  R    ++ W L  P+++ H+L+
Subjt:  LESDCLHAIRLINN-EVEDRTELGTVVEEIKRRMMAFS-DISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDWI-HVLV

TrEMBL top hitse value%identityAlignment
A0A1S8ACU2 Ribonuclease H-like superfamily protein1.9e-5230.25Show/hide
Query:  IYWKQRSREEWLKW----GDK----NSRWFHRKATIRSKTNEIIGITDSNGVWTEDPIVIEREWNWDLLKEAVNKEDLEIISRIPINLASE-DKFLWHYD
        I W ++     ++W    GD+     S W  R    +  +   +G+  +      + I   ++W   L+++  N ED E+ISRI + ++ + D+ LWHYD
Subjt:  IYWKQRSREEWLKW----GDK----NSRWFHRKATIRSKTNEIIGITDSNGVWTEDPIVIEREWNWDLLKEAVNKEDLEIISRIPINLASE-DKFLWHYD

Query:  KCGIYSVRSGYKIFIRNKINAGPSSNPME-KVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWEL
        K G YSV+SGY+I +R K  A PSS+      W+ +W L++P K+K F WKA   F+PT  NL +R +  + ICPRC    E  +H ++ C+ AK++W+L
Subjt:  KCGIYSVRSGYKIFIRNKINAGPSSNPME-KVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWEL

Query:  TLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPG
        T   + +Q+  N +      EL +  S ++L+     CW  W  RN  + +     P +     +  +  Y R         +      A   KW PPP 
Subjt:  TLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPG

Query:  GVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKR
        G +K N+DAA    +   G G+V R+ SG ++ A+   ++     + AE  A+  G++      L  LILE+DC   +  + +    RTE+   + EI++
Subjt:  GVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKR

Query:  RMMAF-SDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVP
        ++ +F S +      R  N+IA T+AK A   + S VW  E+P
Subjt:  RMMAF-SDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVP

A0A6J1DX30 uncharacterized protein LOC1110248742.1e-5935.71Show/hide
Query:  IEREWNWDL--LKEAVNKEDLEIISRIPINLAS-EDKFLWHYDKCGIYSVRSGYKIFIRNKINA-GPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFI
        I  + NWD+  +  +   ED ++I  +PI+  + +D +LWHYDK G YSVRSGYK+++  K NA   S+N     W+++W+L +P K+K F W++ +  I
Subjt:  IEREWNWDL--LKEAVNKEDLEIISRIPINLAS-EDKFLWHYDKCGIYSVRSGYKIFIRNKINA-GPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFI

Query:  PTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNH-DLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPS
        PT  NL  RGI     C  C    ES  H    C++A++IW         L  E N +F + W  L   L  +DL   A+T W IW +RN+LIH K +  
Subjt:  PTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNH-DLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPS

Query:  PTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEG
           + EW+  ++  + +A  +  S  + Q  +R   + W+P      K+N DAAC+   A + FG + R+SS  L+ A++I      SP +AE+  I+EG
Subjt:  PTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEG

Query:  MKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFA-RCEKCSVVWNLEVPDWIHVLVHSDRLS
        +KFAA     +L +ESD L AI+LI NE+  R +    V EI+     F+ ISF H +R  N  A  +AK+       +  W    P W+  LV  D  S
Subjt:  MKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFA-RCEKCSVVWNLEVPDWIHVLVHSDRLS

Query:  G-AHVA
          AHVA
Subjt:  G-AHVA

A0A6J1DX30 uncharacterized protein LOC1110248745.4e-0738.46Show/hide
Query:  NGFWNN-VGDCPLSYNLSNCADALGKWGRELFLNRKNKIRDCKKFLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWG
        +G W+N   +   S ++   + AL  WGR    +   +I+  K  + +AY+    ++F ++H +E +L  LLE EEI+WKQRSRE+WLKWG
Subjt:  NGFWNN-VGDCPLSYNLSNCADALGKWGRELFLNRKNKIRDCKKFLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWG

A0A6J1DX30 uncharacterized protein LOC1110248744.5e-5431.06Show/hide
Query:  EWNWDLLKEAVNKEDLEIISRIPI-NLASEDKFLWHYDKCGIYSVRSGYKI--FIRNKINAGPS--SNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPT
        +WN  LLK+    ++++   +IP+ +LA  D  +WHY++ G+YSV+SGY++    ++K++  PS   +   K W  +W LKIP K+K F W+    F+P 
Subjt:  EWNWDLLKEAVNKEDLEIISRIPI-NLASEDKFLWHYDKCGIYSVRSGYKI--FIRNKINAGPS--SNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPT

Query:  RVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPTM
           L  R I    ICP C++  ES  H +  CE AKE+W  +   ++ +    ++F + W  L    S E+   FA  CW +W  RN+ I +    + T 
Subjt:  RVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPTM

Query:  RSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKF
            + +   ++  AN    +    Q   +A    W+PPP G++KIN+D A K G++  G G+V RN++G+ + A     +  +     EL+A +EG++F
Subjt:  RSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKF

Query:  AANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDWIHVLVHSDRLS
        A ++G    +LE D    I  I +  E     G ++EE+   +  F  +      R+GN +A T+A+FA      V W  E P W+  ++ +D LS
Subjt:  AANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDWIHVLVHSDRLS

A0A803QI56 Uncharacterized protein5.9e-5428.32Show/hide
Query:  WNNVGDCPLSYNLSNCADALGKWGRELFLNRKNKIRDCKKFLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWGDKNS--RW-FHRK
        W + GD  L   L+ CAD L  WG+E+  N K +I  CK  +K   +     +      ++ EL  +L++ E +WKQRS++ WLK GD NS  RW     
Subjt:  WNNVGDCPLSYNLSNCADALGKWGRELFLNRKNKIRDCKKFLKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWGDKNS--RW-FHRK

Query:  ATIRSKTNEIIGITDSNGVWTEDPIVIERE-----------WNWDLLKEAVNKEDLEIISRIPINLA-SEDKFLWHYDKCGIYSVRSGYKIFIRNKINAG
          I       +   D   V +  P + E +           W+ D+L +   + D ++I  IP+N++   DK  W Y+  GIYSV+SGY +  +      
Subjt:  ATIRSKTNEIIGITDSNGVWTEDPIVIERE-----------WNWDLLKEAVNKEDLEIISRIPINLA-SEDKFLWHYDKCGIYSVRSGYKIFIRNKINAG

Query:  PSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELN
         + + + K W+  W+ KIP KVK+  W+A    +PT   L  + +     CP C+ + ES  H LI C K K++W+              NF D +    
Subjt:  PSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELN

Query:  SVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGS--GFG
        +    E      V CWAIW+ RN+++  K   +          Y+  +  A  +    S    +    +E W  P     K+N+DAA  L ++G+  G G
Subjt:  SVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGS--GFG

Query:  LVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIA
        LV R+  G L+           SP +AE + I E + +       ++ LE+DCL  ++ I +EV+  +  G +++E K  ++    IS +   R+ N +A
Subjt:  LVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIA

Query:  DTIAK
           A+
Subjt:  DTIAK

A0A803QQT2 Uncharacterized protein3.6e-5131.31Show/hide
Query:  VWTEDPIVIEREWNWDLLKEAVNKEDLEIISRIPI-NLASEDKFLWHYDKCGIYSVRSGYKIFIRNKINAGPSS-NPMEKVWSNLWRLKIPAKVKHFCWK
        ++  D  + + +W+   ++   N  D+++I  IP  +   EDK LWHY K G YSV+SGY++          S+ + + + W  LWRLKIP KVKHF WK
Subjt:  VWTEDPIVIEREWNWDLLKEAVNKEDLEIISRIPI-NLASEDKFLWHYDKCGIYSVRSGYKIFIRNKINAGPSS-NPMEKVWSNLWRLKIPAKVKHFCWK

Query:  AINGFIPTRVNLHKRGIQTDLICPRCNKDI-ESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIH
          + ++P  VNL KRGI + ++C RC+  + ES  H L  C+ +K  W ++  +D L+     +       + +    E L+ F +  W IW  RN ++H
Subjt:  AINGFIPTRVNLHKRGIQTDLICPRCNKDI-ESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIH

Query:  DKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSE--KWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMA
            P P    EW   +++D+           + + R++ SSE  +W PP      IN+DA  K G   SG G V R+++G +L A+A   + E  P   
Subjt:  DKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSE--KWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMA

Query:  ELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMM--AFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVP
        EL+AI +G++      L    +E+DCL A+ LI N+     ++  ++  I+  +   +F  ISF+   R  N +A  +A +A   K S +W   +P
Subjt:  ELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMM--AFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657509.1e-2023.8Show/hide
Query:  WGDKNSRWFHRKATIRSKTNEIIGITDSNGVWTEDPIVIEREWNWDLLKE-AVNKEDLEIISRI-PINLASEDKFLWHYDKCGIYSVRSGYKIFIRNKIN
        W D   RW   K  +     E    TD + V  +D  +  R W++  +     N   LE+ + +  +   + D+  W + + G +SVRS Y++   +++ 
Subjt:  WGDKNSRWFHRKATIRSKTNEIIGITDSNGVWTEDPIVIEREWNWDLLKE-AVNKEDLEIISRI-PINLASEDKFLWHYDKCGIYSVRSGYKIFIRNKIN

Query:  AGPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFE
          P  N M   ++ LW++++P +VK F W   N  + T    H+R +    +C  C   +ES  H+L  C     IW   +     Q  F+ +  +  ++
Subjt:  AGPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADRWFE

Query:  -LNSVLSFEDL---QKFAVTCWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAG
         L      ED+     FAV  W  W  R   I  +       R ++VK +  +  RA++        QPR       W  P  G  K+N D A +     
Subjt:  -LNSVLSFEDL---QKFAVTCWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAG

Query:  SGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNG
        +  G V R+ +G   G  ++ +    S   AEL  +  G+ FA    +  + LE D    +  +   + D   L  +V      +     +  +H  R  
Subjt:  SGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNG

Query:  NSIADTIAKFARCEKCSVVWNLEVPDWIHVLVHSDRL
        N +AD +A +A            VPD +  L+  D L
Subjt:  NSIADTIAKFARCEKCSVVWNLEVPDWIHVLVHSDRL

Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein7.4e-0930.88Show/hide
Query:  KINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMM
        K N DA+   G+  SG G + RNS G +L       +   +P  AE  A++  ++  +  G   +I E D  +  RLIN +  D   L   ++ IK  + 
Subjt:  KINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMM

Query:  AFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNL
        +F+   FI   R  N  ADT+ K  +  K S  W+L
Subjt:  AFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNL

AT2G02650.1 Ribonuclease H-like superfamily protein3.0e-2623.93Show/hide
Query:  VRSGYKIFIRNKINAGPSSNP---MEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIW---ELT
        +RSGY +     +    +  P     +V   +W+L +  K+KHF W+ + G + T   L  R I  D IC RC  + E+  HI+  C   + +W    + 
Subjt:  VRSGYKIFIRNKINAGPSSNP---MEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIW---ELT

Query:  LNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAV--TCWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRN-----RASSEK
        + +         +  +R  +L+   +   L +F      W +W  RN  +  +   SP   +    +  +++L AN    + +     N     R  S +
Subjt:  LNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAV--TCWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRN-----RASSEK

Query:  WKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTV
        W PPP G  K N D+    G   +  G   R  +G ++       ++      AE L  +  ++     GL  +  ESD    + LINN  ED + LGT+
Subjt:  WKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTV

Query:  VEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDWI
        + +I+  M+     S     R  NS AD +A                P W+
Subjt:  VEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDWI

AT3G09510.1 Ribonuclease H-like superfamily protein1.4e-3626.79Show/hide
Query:  WNWDLLKEAVNKEDLEIISRIPINLASE-DKFLWHYDKCGIYSVRSGYKIFIRNKINAGPSSNPME---KVWSNLWRLKIPAKVKHFCWKAINGFIPTRV
        W+   + + V++ D   I RI +  + + DK +W+Y+  G Y+VRSGY +   +     P+ NP      + + +W L I  K+KHF W+A++  + T  
Subjt:  WNWDLLKEAVNKEDLEIISRIPINLASE-DKFLWHYDKCGIYSVRSGYKIFIRNKINAGPSSNPME---KVWSNLWRLKIPAKVKHFCWKAINGFIPTRV

Query:  NLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELT----LNHDLLQVEFNHNFADRWFELNSV--LSFEDLQKFAVT--CWAIWTERNNLIHDKP
         L  RG++ D  CPRC+++ ES +H L  C  A   W L+    + + L+  +F  N ++    LN V   +  D  K       W IW  RNN++ +K 
Subjt:  NLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELT----LNHDLLQVEFNHNFADRWFELNSV--LSFEDLQKFAVT--CWAIWTERNNLIHDKP

Query:  IPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAI
          SP+      K    D+L A  +     S   +   +  +W+ PP    K N DA   + +  +  G + RN  G  +   ++   +  +P  AE  A+
Subjt:  IPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAI

Query:  VEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDWI
        +  ++     G   + +E DC   I LIN  +   + L   +E+I      F+ I F    R GN +A  +AK+          +  +P W+
Subjt:  VEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAKFARCEKCSVVWNLEVPDWI

AT3G25270.1 Ribonuclease H-like superfamily protein8.4e-2124.02Show/hide
Query:  KVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELT-LNHDLLQVEFNHNFADRWFELNSVLSFE
        ++ + +W+LK   K+KHF WK ++G + T  NL +R I+    C RC ++ E+S H+   C  A+++W  + + H  L+             L+S L+  
Subjt:  KVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELT-LNHDLLQVEFNHNFADRWFELNSVLSFE

Query:  DLQKFAVT---CWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPAS-----NSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFG
          Q F +     W +W  RN L+  +   S     +  +  + ++   N    S     +SS   +   +  KW+ PP    K N D A       +  G
Subjt:  DLQKFAVT---CWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPAS-----NSSCQPRNRASSEKWKPPPGGVWKINIDAACKLGEAGSGFG

Query:  LVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTV--VEEIKRRMMAFSDISFIHCTRNGNS
         + R+ +G  +G+             +E  A++  M+ A + G   +I E D      L+NNE   +   G    + E +     F +  F    R  N 
Subjt:  LVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTV--VEEIKRRMMAFSDISFIHCTRNGNS

Query:  IADTIAKFARCEKCSVVWNLEVPDWIHVLVHSD
         AD +AK       S  ++  VP +I   ++ D
Subjt:  IADTIAKFARCEKCSVVWNLEVPDWIHVLVHSD

AT4G29090.1 Ribonuclease H-like superfamily protein6.4e-2925.9Show/hide
Query:  REWNWDLLKEAVNKEDLEIISRI-PINLASEDKFLWHYDKCGIYSVRSGYKIF--IRNKIN-----AGPSSNPMEKVWSNLWRLKIPAKVKHFCWKAING
        REW  D+++    + + ++I  + P      D + W Y   G Y+V+SGY +   I NK +     + PS NP   ++  +W+ +   K++HF WK ++ 
Subjt:  REWNWDLLKEAVNKEDLEIISRI-PINLASEDKFLWHYDKCGIYSVRSGYKIF--IRNKIN-----AGPSSNPMEKVWSNLWRLKIPAKVKHFCWKAING

Query:  FIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADR------W-FEL-NSVLSFEDLQKFAV-TCWAIWTERN
         +P    L  R +  +  C RC    E+ +H+L +C  A+  W ++     + +     +AD       W F L N    +E   +      W +W  RN
Subjt:  FIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWELTLNHDLLQVEFNHNFADR------W-FEL-NSVLSFEDLQKFAV-TCWAIWTERN

Query:  NLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPR---NRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDL--LGASAIFSEN
         L+           ++ V R   D L          SC  +   NR+S  +W+PPP    K N DA         G G V RN  G++  +GA A+    
Subjt:  NLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPR---NRASSEKWKPPPGGVWKINIDAACKLGEAGSGFGLVCRNSSGDL--LGASAIFSEN

Query:  EFSPSMAELL-AIVEGMKFA----ANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAK
           P +  +L A +E M++A    +    + +I ESD    I ++NN+ E    L   +++++R +  F+++ F+   R GN++A+ +A+
Subjt:  EFSPSMAELL-AIVEGMKFA----ANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNSIADTIAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACTGAGGAGCTGATCAACGAGTGGAAAAGGTTTAGTCTGAGGGAAGTGGAAAAAGAAACAGTTTTTACGGTGGAAGCAAAGAATCGATATATGGCTGAGAAGAT
CGGGAATAGGATTGGAGAATTCGTGGAAGTTGACTATGACAGTGACGACCTGCACTGGGGGAACAACATGAGGATCAGAGAGTCACCACCAAGAAATTCCAACCAGAATG
ATGACATTGATGATAACCTGGCAGGGAAACTAAGTACATACTCTGAAGGAGATGGCAATGAGGACAGAAATCCAGATCAAATGGAAACTATAGACCCGGTGGCCCCAAGA
AGCAGAGAAGATGTGGAGGAAGATGCAAGATTAAGAAAGGGCAAGGGGAAGATGGTGGATCTAAATGAGTCAAGTATGAATGATGGGGAGGATGGAAATGCTGAATTAAA
TCTCCTAGCCCTGGTGGATGAAATGGTCAATGCTGACAGCCCATGTGGGTCAACCACCGATGGAATATCAGGGGGATTAAATGTGCCTGCAAGGAAGAAATCTACTTGGA
AGAGAAAAGTAAGAATGGAGCAGTTTAATGAACCATTAATCAACCAAGATTCCATGATGAGTAATTTGAAGAAGAGGAAAGTTATGGGTGAGCTGAACAACAGTGGAAAG
AGGTCCAAAAGCGAGGGATATAGGTATTTGGAGGTGAGTGCAGACCCGGAGACCAAATGCGGAGATGAGATAGTGTGTAAGATCAAACAAAGTTGTAAGTTCGACGGTTG
CTTGACCGTTAAAAGCAGGGGAGCCAGTGGTGGGTTATGCCTCTTATGGAAAGATAAAGATGTGGTGAAAGTGAGTCTTATAAAGGAAAATGGTTTTTGGAATAATGTTG
GTGATTGTCCTTTATCCTACAATCTTAGTAACTGTGCTGATGCCTTGGGTAAATGGGGAAGAGAATTATTCTTGAATCGGAAAAACAAAATTAGGGATTGTAAAAAGTTT
CTTAAGGAAGCCTATGATAATTTGCAGAACATTAACTTTAATTTGGTTCATAATATTGAATTTGAGCTGGATAAACTCCTTGAGGAGGAGGAAATCTATTGGAAACAGAG
ATCCCGAGAGGAGTGGCTTAAGTGGGGAGATAAAAATTCTAGGTGGTTTCATAGGAAAGCTACTATTCGAAGTAAGACTAATGAAATTATAGGCATTACAGATTCTAATG
GTGTGTGGACTGAGGATCCCATTGTTATTGAAAGGGAGTGGAATTGGGATCTTCTGAAAGAGGCAGTAAACAAGGAAGATTTGGAGATCATTAGTAGAATCCCTATAAAT
CTGGCAAGTGAAGATAAATTTTTGTGGCATTACGATAAATGTGGAATCTATTCGGTTAGGAGCGGATACAAAATCTTCATTAGGAATAAAATAAATGCTGGCCCGAGTAG
CAACCCTATGGAGAAAGTATGGTCGAATCTATGGAGACTAAAGATCCCTGCTAAAGTGAAGCATTTTTGTTGGAAAGCGATCAATGGTTTTATCCCAACAAGAGTAAATT
TACATAAAAGGGGTATTCAGACGGATTTAATTTGCCCAAGATGTAATAAGGATATTGAGTCCTCTGATCATATTCTGATTCGATGTGAGAAAGCAAAGGAGATATGGGAG
CTTACCTTAAATCATGATCTGTTGCAGGTTGAATTTAATCACAATTTCGCAGACAGGTGGTTTGAGCTCAACTCTGTTCTTTCCTTTGAAGATCTTCAGAAGTTTGCAGT
AACTTGTTGGGCTATATGGACAGAGAGGAATAATTTAATTCATGATAAACCAATTCCTTCTCCAACTATGCGTAGTGAATGGGTTAAGAGGTACATTTCAGATTACCTTC
GAGCGAATGCAGCTCCTGCATCAAATTCGAGTTGCCAGCCCAGAAATAGAGCAAGTTCTGAGAAATGGAAGCCTCCGCCGGGTGGCGTCTGGAAAATTAACATTGATGCG
GCTTGTAAGCTTGGAGAAGCGGGATCTGGTTTTGGTTTGGTGTGCAGAAATTCATCTGGTGATCTTTTGGGAGCTTCGGCTATCTTCTCTGAGAATGAATTTTCCCCTTC
AATGGCGGAACTATTGGCAATTGTGGAAGGTATGAAGTTCGCTGCAAATCTAGGACTCGACAACTTAATCCTGGAATCAGATTGTCTACATGCAATTAGGCTTATCAATA
ATGAGGTTGAAGATCGTACTGAGCTTGGGACAGTGGTTGAGGAAATCAAACGGAGGATGATGGCATTTTCTGATATTTCTTTTATTCATTGTACTAGAAATGGCAATTCA
ATAGCTGATACAATTGCCAAGTTTGCTAGATGTGAGAAATGTTCTGTGGTCTGGAACCTGGAAGTTCCTGATTGGATTCATGTGTTGGTCCATAGTGACCGTTTATCTGG
TGCCCATGTGGCTTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACTGAGGAGCTGATCAACGAGTGGAAAAGGTTTAGTCTGAGGGAAGTGGAAAAAGAAACAGTTTTTACGGTGGAAGCAAAGAATCGATATATGGCTGAGAAGAT
CGGGAATAGGATTGGAGAATTCGTGGAAGTTGACTATGACAGTGACGACCTGCACTGGGGGAACAACATGAGGATCAGAGAGTCACCACCAAGAAATTCCAACCAGAATG
ATGACATTGATGATAACCTGGCAGGGAAACTAAGTACATACTCTGAAGGAGATGGCAATGAGGACAGAAATCCAGATCAAATGGAAACTATAGACCCGGTGGCCCCAAGA
AGCAGAGAAGATGTGGAGGAAGATGCAAGATTAAGAAAGGGCAAGGGGAAGATGGTGGATCTAAATGAGTCAAGTATGAATGATGGGGAGGATGGAAATGCTGAATTAAA
TCTCCTAGCCCTGGTGGATGAAATGGTCAATGCTGACAGCCCATGTGGGTCAACCACCGATGGAATATCAGGGGGATTAAATGTGCCTGCAAGGAAGAAATCTACTTGGA
AGAGAAAAGTAAGAATGGAGCAGTTTAATGAACCATTAATCAACCAAGATTCCATGATGAGTAATTTGAAGAAGAGGAAAGTTATGGGTGAGCTGAACAACAGTGGAAAG
AGGTCCAAAAGCGAGGGATATAGGTATTTGGAGGTGAGTGCAGACCCGGAGACCAAATGCGGAGATGAGATAGTGTGTAAGATCAAACAAAGTTGTAAGTTCGACGGTTG
CTTGACCGTTAAAAGCAGGGGAGCCAGTGGTGGGTTATGCCTCTTATGGAAAGATAAAGATGTGGTGAAAGTGAGTCTTATAAAGGAAAATGGTTTTTGGAATAATGTTG
GTGATTGTCCTTTATCCTACAATCTTAGTAACTGTGCTGATGCCTTGGGTAAATGGGGAAGAGAATTATTCTTGAATCGGAAAAACAAAATTAGGGATTGTAAAAAGTTT
CTTAAGGAAGCCTATGATAATTTGCAGAACATTAACTTTAATTTGGTTCATAATATTGAATTTGAGCTGGATAAACTCCTTGAGGAGGAGGAAATCTATTGGAAACAGAG
ATCCCGAGAGGAGTGGCTTAAGTGGGGAGATAAAAATTCTAGGTGGTTTCATAGGAAAGCTACTATTCGAAGTAAGACTAATGAAATTATAGGCATTACAGATTCTAATG
GTGTGTGGACTGAGGATCCCATTGTTATTGAAAGGGAGTGGAATTGGGATCTTCTGAAAGAGGCAGTAAACAAGGAAGATTTGGAGATCATTAGTAGAATCCCTATAAAT
CTGGCAAGTGAAGATAAATTTTTGTGGCATTACGATAAATGTGGAATCTATTCGGTTAGGAGCGGATACAAAATCTTCATTAGGAATAAAATAAATGCTGGCCCGAGTAG
CAACCCTATGGAGAAAGTATGGTCGAATCTATGGAGACTAAAGATCCCTGCTAAAGTGAAGCATTTTTGTTGGAAAGCGATCAATGGTTTTATCCCAACAAGAGTAAATT
TACATAAAAGGGGTATTCAGACGGATTTAATTTGCCCAAGATGTAATAAGGATATTGAGTCCTCTGATCATATTCTGATTCGATGTGAGAAAGCAAAGGAGATATGGGAG
CTTACCTTAAATCATGATCTGTTGCAGGTTGAATTTAATCACAATTTCGCAGACAGGTGGTTTGAGCTCAACTCTGTTCTTTCCTTTGAAGATCTTCAGAAGTTTGCAGT
AACTTGTTGGGCTATATGGACAGAGAGGAATAATTTAATTCATGATAAACCAATTCCTTCTCCAACTATGCGTAGTGAATGGGTTAAGAGGTACATTTCAGATTACCTTC
GAGCGAATGCAGCTCCTGCATCAAATTCGAGTTGCCAGCCCAGAAATAGAGCAAGTTCTGAGAAATGGAAGCCTCCGCCGGGTGGCGTCTGGAAAATTAACATTGATGCG
GCTTGTAAGCTTGGAGAAGCGGGATCTGGTTTTGGTTTGGTGTGCAGAAATTCATCTGGTGATCTTTTGGGAGCTTCGGCTATCTTCTCTGAGAATGAATTTTCCCCTTC
AATGGCGGAACTATTGGCAATTGTGGAAGGTATGAAGTTCGCTGCAAATCTAGGACTCGACAACTTAATCCTGGAATCAGATTGTCTACATGCAATTAGGCTTATCAATA
ATGAGGTTGAAGATCGTACTGAGCTTGGGACAGTGGTTGAGGAAATCAAACGGAGGATGATGGCATTTTCTGATATTTCTTTTATTCATTGTACTAGAAATGGCAATTCA
ATAGCTGATACAATTGCCAAGTTTGCTAGATGTGAGAAATGTTCTGTGGTCTGGAACCTGGAAGTTCCTGATTGGATTCATGTGTTGGTCCATAGTGACCGTTTATCTGG
TGCCCATGTGGCTTATTAA
Protein sequenceShow/hide protein sequence
MATEELINEWKRFSLREVEKETVFTVEAKNRYMAEKIGNRIGEFVEVDYDSDDLHWGNNMRIRESPPRNSNQNDDIDDNLAGKLSTYSEGDGNEDRNPDQMETIDPVAPR
SREDVEEDARLRKGKGKMVDLNESSMNDGEDGNAELNLLALVDEMVNADSPCGSTTDGISGGLNVPARKKSTWKRKVRMEQFNEPLINQDSMMSNLKKRKVMGELNNSGK
RSKSEGYRYLEVSADPETKCGDEIVCKIKQSCKFDGCLTVKSRGASGGLCLLWKDKDVVKVSLIKENGFWNNVGDCPLSYNLSNCADALGKWGRELFLNRKNKIRDCKKF
LKEAYDNLQNINFNLVHNIEFELDKLLEEEEIYWKQRSREEWLKWGDKNSRWFHRKATIRSKTNEIIGITDSNGVWTEDPIVIEREWNWDLLKEAVNKEDLEIISRIPIN
LASEDKFLWHYDKCGIYSVRSGYKIFIRNKINAGPSSNPMEKVWSNLWRLKIPAKVKHFCWKAINGFIPTRVNLHKRGIQTDLICPRCNKDIESSDHILIRCEKAKEIWE
LTLNHDLLQVEFNHNFADRWFELNSVLSFEDLQKFAVTCWAIWTERNNLIHDKPIPSPTMRSEWVKRYISDYLRANAAPASNSSCQPRNRASSEKWKPPPGGVWKINIDA
ACKLGEAGSGFGLVCRNSSGDLLGASAIFSENEFSPSMAELLAIVEGMKFAANLGLDNLILESDCLHAIRLINNEVEDRTELGTVVEEIKRRMMAFSDISFIHCTRNGNS
IADTIAKFARCEKCSVVWNLEVPDWIHVLVHSDRLSGAHVAY