; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021633 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021633
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:10256387..10258269
RNA-Seq ExpressionLag0021633
SyntenyLag0021633
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.4e-5127.77Show/hide
Query:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTI
        G  +  +G RW++G GK+V + +D W+     ++P++         V D+++ +  W+ D +   F+  D++ IL +   +   EDE++W  D KG +++
Subjt:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTI

Query:  KSAYHLATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILFPIL
        KS Y LA  ++ +          + RLWK  W LD   KVKI +W+A  ++LPT EN+ K       +C  C+   E V H+   CK  R IW +   I+
Subjt:  KSAYHLATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILFPIL

Query:  IPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVI---WKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRY-RSEKELSYLAKTKTPPSPRSWIPSVGT
         P+  H       +D++       S     EA  +I   W +W  RN+        DS          +  Y R  K  +           + W P    
Subjt:  IPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVI---WKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRY-RSEKELSYLAKTKTPPSPRSWIPSVGT

Query:  QWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISIL
          KLNVDAA        GLG I+RD++G ++  G K+      + + E+ AI  GL       +   +  +  ++VESD   V++L+   +  R+EI  +
Subjt:  QWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISIL

Query:  IDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVDNFNFVFW
        + ++  ++K      FSF PR+CN  AH+LA+  + N +   W
Subjt:  IDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVDNFNFVFW

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.6e-5231Show/hide
Query:  LLKECFNSVEKHF-----VGRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGSWKEDLISNSFISSDVDTILSMPKRN
        LL+   NS   +F      GR L ++G R +VG+G  +    DPWL     +KPL  +       V   +  DG+W    IS+SF + D D ILSMP  +
Subjt:  LLKECFNSVEKHF-----VGRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGSWKEDLISNSFISSDVDTILSMPKRN

Query:  MNSEDEIIWGKDSKGGFTIKSAYHLATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEH
         N +D  +W  D +G ++++S Y L   +  + ++A+++ + T+  W SIW+L    K+KI +W++  + +PT +N+   G+     C +C   +E + H
Subjt:  MNSEDEIIWGKDSKGGFTIKSAYHLATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEH

Query:  LFWNCKMVRNIWGILFPILIPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKT
         F++CK  R IW  LFP L            F + W      L  K+   A+   W +W  RN L   + +   E   + +   +  +   +  +Y  +T
Subjt:  LFWNCKMVRNIWGILFPILIPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKT

Query:  KTPPSP--RSWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTV
        ++   P  + W PS     KLN DAA   AS+S   G IIRDS  SL+ A   +    L   + E   ILEGL      F A   F + E  VESD+   
Subjt:  KTPPSP--RSWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTV

Query:  IKLIRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVDN--------FNFVFWAFD
        I+LIR E   R +    + EI   T      SFS   R CN  AH LA+  + +        FNF  W  D
Subjt:  IKLIRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVDN--------FNFVFWAFD

XP_023876230.1 uncharacterized protein LOC111988681 [Quercus suber]2.3e-4628.94Show/hide
Query:  VGSGKRVYIDEDPWLLNDCCWKPLNVH-QELKGKKVMDILNPDG-SWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTIKSAYHLATQM
        VG G+++ I +D WL N C ++ +    + L+G KV D+++ +   WKE LI   F+  D + ILS+        D +IW  +  G FT++S Y LA  +
Subjt:  VGSGKRVYIDEDPWLLNDCCWKPLNVH-QELKGKKVMDILNPDG-SWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTIKSAYHLATQM

Query:  DSHLS-AATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILFPILIPAISHCRS
         S+     +S+P   K+LW+ +W ++   K+K   WKA  ++L T EN++K  +  + +C  C K  E   HLFW C  V+ IW     ++IP     R 
Subjt:  DSHLS-AATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILFPILIPAISHCRS

Query:  WWKFKDYWDVALRCLSSKE--AGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKTKTPPSPRSWIPSVGTQWKLNVDAAWF
         W+F D      R   S       A  + W +W+ RN ++      D  + ++  L  +  +++E E   LA  K  P    WIP     +K+NVD A F
Subjt:  WWKFKDYWDVALRCLSSKE--AGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKTKTPPSPRSWIPSVGTQWKLNVDAAWF

Query:  DASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISILIDEICEKTKRP
              G+G +I DS G +I A  +K    L    +E+ A+  G+       +   + G R+V +E D+  +  +++G  E  + +  +I       +  
Subjt:  DASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISILIDEICEKTKRP

Query:  NAFSFSFCPRSCNFLAHSLARAVVDNFNFVFW
            FS   R  N  AH LA+   +  N+V W
Subjt:  NAFSFSFCPRSCNFLAHSLARAVVDNFNFVFW

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]3.4e-5029.25Show/hide
Query:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTI
        GR +  +G RW++G+G+ V +  + W+     +KP++         V ++++    W+EDLI   F   D + I+ +P      ED++IW  D KG +++
Subjt:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTI

Query:  KSAYHLATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGIL-FPI
        KS Y +A ++      + S+    + LW+ IW+L    KVKI LW+A  D+LPT EN+ K  V    +C  C  H E V H    C   R IW       
Subjt:  KSAYHLATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGIL-FPI

Query:  LIPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKTK-TPPSPRSWIPSVGTQW
         +  +  C   W  + +W    R  +  E  E + ++W +W+ RN+        +  + + +    +  ++  ++   + KTK      + W P      
Subjt:  LIPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKTK-TPPSPRSWIPSVGTQW

Query:  KLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISILID
        K+NVDAA    +   GLG ++RDSDG+   A  K       + M E++A+  GL   EK   A+  FG    + ESD+  VI LI  +    +EI  LI 
Subjt:  KLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISILID

Query:  EICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVDNFNFVFW
        +I E  +    F     PR CN+ AHSLA+  +     V W
Subjt:  EICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVDNFNFVFW

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]2.3e-4628.15Show/hide
Query:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTI
        GR L I G RW+VG+G  + +  D WL     ++P++ H      KV +++   G W E LI ++F   +VDTILS+P      +D I+W     G +T+
Subjt:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTI

Query:  KSAYHLATQM----DSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGIL
        KS   LA+++    +  +  + S  K+  ++W ++W+L    KVK+ LW+A    LP   N+ +  V  + LC  C + +E   H  W+C   + +W   
Subjt:  KSAYHLATQM----DSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGIL

Query:  FPILIPAISHCRSWWK---FKDYWDVALRCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYR-SEKELSYLAKTKTPPSPR-SWI
        F      ++     W+   F D +   ++  S +E    S + W LW+ RN+ K    + +S+  +      +  ++ +++    + K  T    +  W 
Subjt:  FPILIPAISHCRSWWK---FKDYWDVALRCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYR-SEKELSYLAKTKTPPSPR-SWI

Query:  PSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRS
        P    + KLN DAA         LG ++RD +G L  AG K    N  I  +E+ A+  GLL        Y E G + +VVESD+T VI  +   E D S
Subjt:  PSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRS

Query:  EISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLAR
            ++D+I        +  +    R  N  AH +A+
Subjt:  EISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLAR

TrEMBL top hitse value%identityAlignment
A0A2N9GM07 Reverse transcriptase domain-containing protein2.3e-4429.06Show/hide
Query:  EGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQEL-KGKKVMDILNP-DGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTIKSAY
        EG RW+VG G  + I ED WL     +K +    ++    +V  +++P    W+ + +   F+  D+  I+S+P  ++ +ED+ +W  ++ G FT+KSAY
Subjt:  EGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQEL-KGKKVMDILNP-DGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTIKSAY

Query:  HLATQMDSHLSAATSDPKDTKR-LWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIW--GILFPILI
        H+A  + S  +   S   D  R LWK+IW +    K++I  W+     LPT+E +++ G+  N  C  C +  E + H  W C  +R IW  G    IL 
Subjt:  HLATQMDSHLSAATSDPKDTKR-LWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIW--GILFPILI

Query:  PAISHCRSWWKFKDYWDVALRCLSSKEAGEASF---VIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKTKTPPSPRSWI-------
          +S  R         ++ L  +     G+  F   V W +W  RN+    R   D     Q+V       R++K    L +    PS R  I       
Subjt:  PAISHCRSWWKFKDYWDVALRCLSSKEAGEASF---VIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKTKTPPSPRSWI-------

Query:  PSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRS
        P  G  +K+N D A F      G+G +IRD  GS +     +T  +   +++E+ AI EG L          E G R +VVESDA  ++  I  E+ D  
Subjt:  PSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRS

Query:  EISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLAR
         I  +I  I    +  + +   + PR  N +AH LA+
Subjt:  EISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLAR

A0A2N9H727 Uncharacterized protein3.6e-4526.71Show/hide
Query:  KHSAASKQLLKECFNSVEKHFVGRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMD--ILNPDGSWKEDLISNSFISSDVDTILS
        +HS+ +    K   N+       R + I+G RW++G G    I  D W+ +    KPL     L     +   IL+  G+W   LI   F   D   I  
Subjt:  KHSAASKQLLKECFNSVEKHFVGRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMD--ILNPDGSWKEDLISNSFISSDVDTILS

Query:  MPKRNMNSEDEIIWGKDSKGGFTIKSAYHLATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHK
        +   +    D++IW ++  G ++++SAY L  +  S      SD    KR WK +W +    KV+  LW+A ++ LPT+ N+ +  +    LC  C   +
Subjt:  MPKRNMNSEDEIIWGKDSKGGFTIKSAYHLATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHK

Query:  EDVEHLFWNCKMVRNIWGILFPILIPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELS
        EDV H+ W+C ++ ++W      ++      R  + F D         +++   E  F+ W LW +RNQ+         E      +H    + S +   
Subjt:  EDVEHLFWNCKMVRNIWGILFPILIPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELS

Query:  YLAKTKTPPSPR-SWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESD
            T     PR  W PS  + +K+N DAA F      G+G IIRD  G  I A CK++     +   E+ A LE +       +   E G ++   E D
Subjt:  YLAKTKTPPSPR-SWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESD

Query:  ATTVIKLIRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVD-NFNFVFWAFD
        A T+   +R +++  +    +ID++    +     SFS   R  N +AH LAR  ++   +F+ W  D
Subjt:  ATTVIKLIRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVD-NFNFVFWAFD

A0A5B6WTC7 Reverse transcriptase1.4e-4629.7Show/hide
Query:  RSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDG-SWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTI
        R L   G  W++G+GK V I  DPWLL +   + L  +  ++   V  +++    +WKED+I     S   + ILS+P    ++ED ++W  D+KG +T+
Subjt:  RSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDG-SWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTI

Query:  KSAYH-LATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILFPI
        KS Y  L T     +S    D    K  +K +W+L    K+K+H+W+   + +P L N+ K  + T  +C LC+   ED  HL W+C  VR +W +L   
Subjt:  KSAYH-LATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILFPI

Query:  LIPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQF-IQDVLHAIGRYRSEKELSYLAKTKTPPSPRS--WIPSVGT
        L   + +  S  + KD        ++ ++    S  +W +W  RN+L     I +  +F +Q+ +  I RY  E ++S      TP + ++  W P    
Subjt:  LIPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQF-IQDVLHAIGRYRSEKELSYLAKTKTPPSPRS--WIPSVGT

Query:  QWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISIL
          KLN DA++   S S     + R+ +G ++GA   +  +  +  + E+ A        E+      + G +++++E D+ TVIK +R  + DRS I  +
Subjt:  QWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISIL

Query:  IDEICEKTKRPNAFSFSFCPRSCNFLAHSLA
        I  IC+        SFSF PR  N  AH+LA
Subjt:  IDEICEKTKRPNAFSFSFCPRSCNFLAHSLA

A0A6J5UE59 Reverse transcriptase domain-containing protein1.2e-4528.99Show/hide
Query:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNP-DGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFT
        GR L   G RW++G G  V +  DPWL     ++ L+ H +L    V ++++P   +WK+D+++  F+  +   ILS+P       D++IW  +  G +T
Subjt:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNP-DGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFT

Query:  IKSAYHLATQMDSH------LSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIW
        ++S Y LA  +  +       +   S       +WKS+W +D+ PK+K  +W   S++L    N+ +  V     C  C    E   H+F+ C   R  W
Subjt:  IKSAYHLATQMDSH------LSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIW

Query:  GILFPILIPAISHCRSWWKFKDYWDVALRCLSSKE-AGEA----SFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEK-ELSYL----AKTKT
            P+ +           F   W   +  L+S E A EA     F +W++W+ RN         D  + +  +L  +  +R+ K +L  L     +  +
Subjt:  GILFPILIPAISHCRSWWKFKDYWDVALRCLSSKE-AGEA----SFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEK-ELSYL----AKTKT

Query:  PPSPRSWI-PSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKL
         P+P SW  PS+G   K+N DAAW      GG+GW+IRDS G L+ AG +         +  SSAI+  LL           F   +++VESD+   I +
Subjt:  PPSPRSWI-PSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKL

Query:  IRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLA
        + G     S++  ++ +I +     +  SF F PRSCN  AHS+A
Subjt:  IRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLA

A0A6J5WPU6 Reverse transcriptase domain-containing protein1.9e-4628.34Show/hide
Query:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNP-DGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFT
        GR L   G RW++G G  V +  DPWL     ++ L+ H +L    V ++++P   +WK+D+I+  F+  +   ILS+P       D++IW     G +T
Subjt:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNP-DGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFT

Query:  IKSAYHLATQMDSH------LSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIW
        ++S Y LA  +  +       +   S       +WKS+W +D+ PK+K  +W+  S++L    N+++  V     C  C    E   H+F+ C   R  W
Subjt:  IKSAYHLATQMDSH------LSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIW

Query:  GILFPILIPAISHCRSWWKFKDYWDVALRCLSSKE-AGEA----SFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKE--LSYLAKTKTPPS
            P+ +           F   W   +  L+S E A EA     F +W++W+ RN         D  + +  +L  +  +R+ K+       +  + P+
Subjt:  GILFPILIPAISHCRSWWKFKDYWDVALRCLSSKE-AGEA----SFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKE--LSYLAKTKTPPS

Query:  PRSWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGE
        P SW        K+N DAAW      GG+GW+IRDS G L+ AG +         +  SSAI+  LL           F   +++VESD+   I ++ G 
Subjt:  PRSWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGE

Query:  EEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLA
            S++  ++ +I +     +  SF F PRSCN  AHS+A
Subjt:  EEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.8e-2825.95Show/hide
Query:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGS---WKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGG
        G +L  +G R  +G G+ + I  D  +++    +PLN  +  K   + ++    GS   W +  IS     SD   I  +        D+IIW  ++ G 
Subjt:  GRSLFIEGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGS---WKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGG

Query:  FTIKSAYHLATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILF
        +T++S Y L T   S    A + P  +  L   IW L  +PK+K  LW+A S  L T E +   G+  +  C  C +  E + H  + C      W +  
Subjt:  FTIKSAYHLATQMDSHLSAATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILF

Query:  PILIPAISHCRSWWKFKDYWDVALRCLSSKEAGEAS--------FVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKTKTPPSPRS
          LI      R+     D+ +     L+  +    S        ++IW++W+ RN +  ++     E   + VL A  +  +   L+     K  PSP  
Subjt:  PILIPAISHCRSWWKFKDYWDVALRCLSSKEAGEAS--------FVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKTKTPPSPRS

Query:  WIPSVGTQW--------KLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIK
         I     +W        K N DA +         GWIIR+  G+ I  G  K          E+ A+L  L QT    R Y      +V +E D  T+I 
Subjt:  WIPSVGTQW--------KLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIK

Query:  LIRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLAR
        LI G     S ++  +++I     +  +  F F  R  N LAH LA+
Subjt:  LIRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLAR

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-0729.14Show/hide
Query:  GTQW-KLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEI
        G +W K N D +        GL WIIR+S G+ +  GC K      IK  E +A++  +       +   + G R V  E D  TV +LIR +E +   +
Subjt:  GTQW-KLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEI

Query:  SILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLA-RAVVDNFNFVFWAFDP
           ++ I + +K      F+F  R  N     LA +AV ++ N   + F P
Subjt:  SILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLA-RAVVDNFNFVFWAFDP

AT3G25270.1 Ribonuclease H-like superfamily protein8.2e-1824.28Show/hide
Query:  PKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILFPILIPAISHCRSWWKFKDYWDVA
        P     +   IW+L + PK+K  LWK  S  L T +N+K+  +  +  C  C +  E  +HLF++C   + +W       IP      +    +   ++ 
Subjt:  PKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILFPILIPAISHCRSWWKFKDYWDVA

Query:  L-RCLSSKEA---GEASFVIWKLWQKRNQLK-QSRCI----------------PDSEQFIQDVLHAIGRYRSEKELSYLAKTKTPPSPRSWIPSVGTQWK
        L  CL++++      A +++W+LW+ RNQL  Q + I                 D+  ++Q +   +  + S  +   +A+TK    P +WI       K
Subjt:  L-RCLSSKEA---GEASFVIWKLWQKRNQLK-QSRCI----------------PDSEQFIQDVLHAIGRYRSEKELSYLAKTKTPPSPRSWIPSVGTQWK

Query:  LNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISILIDE
         N D A+   + +   GW++RD +G  +G+G        +    E  A++  +        A+ + G R+V+ E D+  V +L+  E+ +    +  I E
Subjt:  LNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISILIDE

Query:  ICEKTKRPNAFSFSFCPRSCNFLAHSLAR-AVVDNFNFVFWAFDPS
             KR     F + PR+ N  A  LA+  +  N +F F  + P+
Subjt:  ICEKTKRPNAFSFSFCPRSCNFLAHSLAR-AVVDNFNFVFWAFDPS

AT4G29090.1 Ribonuclease H-like superfamily protein3.9e-2825.16Show/hide
Query:  EGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVH----QELKG----KKVMDILNPDG-SWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGG
        +G R  VG+G+ + I    WL +      L +     QE        KV D+++  G  W++D+I   F   +   I  +        D   W   S G 
Subjt:  EGYRWKVGSGKRVYIDEDPWLLNDCCWKPLNVH----QELKG----KKVMDILNPDG-SWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGG

Query:  FTIKSAYHLATQMDSHLSA--ATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGI
        +T+KS Y + TQ+ +  S+    S+P     +++ IW+  + PK++  LWK  S+ LP    +    +     C  C   KE V HL + C   R  W I
Subjt:  FTIKSAYHLATQMDSHLSA--ATSDPKDTKRLWKSIWQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGI

Query:  LFPILIPAISHCRSWWKFKDYWDVAL---RCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAI------GRYRSEKELSYLAKTKTPPS
           I IP             YW   L        K +    +++W+LW+ RN+L         E   Q+VL          R R+E E           S
Subjt:  LFPILIPAISHCRSWWKFKDYWDVAL---RCLSSKEAGEASFVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAI------GRYRSEKELSYLAKTKTPPS

Query:  PRSWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGE
           W P      K N DA W   +   G+GW++R+  G +   G +   K   +   E  A+   +L   +       F    V+ ESD+  +I+++  +
Subjt:  PRSWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGE

Query:  EEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVDNFNFVFWAFDPSLCS
        E   S +   I ++     +     F F PR  N LA  +AR  +   N     +DP L S
Subjt:  EEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVDNFNFVFWAFDPSLCS

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0821.89Show/hide
Query:  FVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSE---KELSYLAKTKTPPSPRSWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAG
        +++W++W+  N L  +      +  ++  L+    +       E     +   P     W P    + K N DA+  + ++  GLGWI+R+S G++I  G
Subjt:  FVIWKLWQKRNQLKQSRCIPDSEQFIQDVLHAIGRYRSE---KELSYLAKTKTPPSPRSWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAG

Query:  CKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAV
          K    +  +  E S ++  +       +A   FG+++V+ E D  T+ ++I   +     +   +D I        +  FSF  R  N  A  LA+  
Subjt:  CKKTHKNLEIKMLESSAILEGLLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAV

Query:  V
        +
Subjt:  V


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGAAAGCATGACGAAGCAATGTCGCGTGGAGGTAGAAGGCCTACGGGTCGTGAACTTCTTTTCCCGGAGAAGAAGCAATGACGGGCTCAACCCTGGACAGGCGGT
GGAAACTACCAAGCTGGAGTACGGTAGGGGCAGAGGGAATTTCCGGTGGAGCGGTGAAATGCGTAGAGATCGGAAAGAACACCAACGACGAAAGCACTCTGCTGCCTCAA
AGCAACTTCTAAAGGAATGCTTCAATAGTGTGGAAAAGCATTTTGTGGGAAGATCTCTATTTATCGAAGGATACAGATGGAAAGTGGGCAGCGGAAAGAGAGTCTATATT
GATGAAGATCCCTGGCTGTTGAATGATTGTTGCTGGAAACCTCTGAATGTCCATCAGGAACTTAAAGGAAAGAAAGTTATGGACATTCTGAACCCAGATGGATCGTGGAA
AGAAGATCTGATTTCAAATTCCTTCATTTCGAGCGATGTGGATACTATTTTAAGCATGCCAAAGAGAAATATGAACTCTGAAGATGAAATTATTTGGGGGAAAGACTCGA
AAGGAGGCTTCACGATAAAAAGTGCTTATCATTTAGCCACCCAAATGGATTCCCATCTTTCAGCGGCTACCTCTGATCCTAAGGACACTAAAAGACTTTGGAAGTCTATT
TGGCAACTTGATAGCATACCAAAAGTGAAAATCCACCTTTGGAAAGCTACGAGCGATGTCCTCCCGACTCTGGAAAACATTAAGAAAATGGGAGTTTTTACTAACGAGTT
GTGTTTTCTTTGCAGGAAACATAAGGAGGATGTAGAGCATCTGTTTTGGAATTGCAAAATGGTAAGAAATATTTGGGGCATTCTATTCCCAATTCTTATTCCGGCTATTT
CGCATTGCAGAAGCTGGTGGAAATTTAAAGACTATTGGGATGTCGCATTGAGGTGTCTGAGTAGCAAAGAGGCTGGGGAGGCGAGCTTTGTGATCTGGAAGCTTTGGCAA
AAAAGAAACCAACTAAAGCAGAGCAGATGTATTCCAGACTCAGAGCAGTTCATTCAAGACGTGCTACATGCCATTGGAAGATATCGAAGTGAGAAAGAGTTGTCGTACCT
GGCGAAAACAAAGACCCCTCCGAGTCCCAGAAGTTGGATCCCTTCGGTGGGTACTCAATGGAAATTGAATGTGGACGCCGCTTGGTTTGACGCTTCTAGCTCTGGAGGGT
TGGGGTGGATAATCCGAGACTCAGACGGTTCTTTGATTGGAGCTGGATGCAAGAAAACCCACAAGAATTTAGAGATTAAAATGTTGGAATCTTCAGCAATTCTCGAGGGC
CTCCTTCAAACCGAGAAGTGCTTTAGGGCTTACCCGGAGTTTGGCAACCGTGAGGTGGTTGTTGAGTCTGATGCGACGACAGTAATCAAGTTGATCAGGGGAGAAGAGGA
AGATCGTTCTGAGATTTCCATTCTGATCGACGAAATTTGTGAGAAGACGAAGAGGCCCAATGCTTTCTCCTTCTCCTTTTGCCCGCGTTCTTGTAATTTTTTGGCGCACT
CTCTAGCGCGCGCGGTAGTTGATAACTTTAATTTTGTTTTTTGGGCTTTTGATCCTTCTCTTTGCTCGAGATTGGATGGTTTCTTTGGTGAAAGGCGATTTTTGCCGCCT
TGGTTCGCCTCCATTTTGGAGGTGGTTGACTCTGTACCTAACTATAATAAAGATAGTAATTATCGGTTCTTTTCAGCCTTCTTTAACCATGCCTACATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGAAAGCATGACGAAGCAATGTCGCGTGGAGGTAGAAGGCCTACGGGTCGTGAACTTCTTTTCCCGGAGAAGAAGCAATGACGGGCTCAACCCTGGACAGGCGGT
GGAAACTACCAAGCTGGAGTACGGTAGGGGCAGAGGGAATTTCCGGTGGAGCGGTGAAATGCGTAGAGATCGGAAAGAACACCAACGACGAAAGCACTCTGCTGCCTCAA
AGCAACTTCTAAAGGAATGCTTCAATAGTGTGGAAAAGCATTTTGTGGGAAGATCTCTATTTATCGAAGGATACAGATGGAAAGTGGGCAGCGGAAAGAGAGTCTATATT
GATGAAGATCCCTGGCTGTTGAATGATTGTTGCTGGAAACCTCTGAATGTCCATCAGGAACTTAAAGGAAAGAAAGTTATGGACATTCTGAACCCAGATGGATCGTGGAA
AGAAGATCTGATTTCAAATTCCTTCATTTCGAGCGATGTGGATACTATTTTAAGCATGCCAAAGAGAAATATGAACTCTGAAGATGAAATTATTTGGGGGAAAGACTCGA
AAGGAGGCTTCACGATAAAAAGTGCTTATCATTTAGCCACCCAAATGGATTCCCATCTTTCAGCGGCTACCTCTGATCCTAAGGACACTAAAAGACTTTGGAAGTCTATT
TGGCAACTTGATAGCATACCAAAAGTGAAAATCCACCTTTGGAAAGCTACGAGCGATGTCCTCCCGACTCTGGAAAACATTAAGAAAATGGGAGTTTTTACTAACGAGTT
GTGTTTTCTTTGCAGGAAACATAAGGAGGATGTAGAGCATCTGTTTTGGAATTGCAAAATGGTAAGAAATATTTGGGGCATTCTATTCCCAATTCTTATTCCGGCTATTT
CGCATTGCAGAAGCTGGTGGAAATTTAAAGACTATTGGGATGTCGCATTGAGGTGTCTGAGTAGCAAAGAGGCTGGGGAGGCGAGCTTTGTGATCTGGAAGCTTTGGCAA
AAAAGAAACCAACTAAAGCAGAGCAGATGTATTCCAGACTCAGAGCAGTTCATTCAAGACGTGCTACATGCCATTGGAAGATATCGAAGTGAGAAAGAGTTGTCGTACCT
GGCGAAAACAAAGACCCCTCCGAGTCCCAGAAGTTGGATCCCTTCGGTGGGTACTCAATGGAAATTGAATGTGGACGCCGCTTGGTTTGACGCTTCTAGCTCTGGAGGGT
TGGGGTGGATAATCCGAGACTCAGACGGTTCTTTGATTGGAGCTGGATGCAAGAAAACCCACAAGAATTTAGAGATTAAAATGTTGGAATCTTCAGCAATTCTCGAGGGC
CTCCTTCAAACCGAGAAGTGCTTTAGGGCTTACCCGGAGTTTGGCAACCGTGAGGTGGTTGTTGAGTCTGATGCGACGACAGTAATCAAGTTGATCAGGGGAGAAGAGGA
AGATCGTTCTGAGATTTCCATTCTGATCGACGAAATTTGTGAGAAGACGAAGAGGCCCAATGCTTTCTCCTTCTCCTTTTGCCCGCGTTCTTGTAATTTTTTGGCGCACT
CTCTAGCGCGCGCGGTAGTTGATAACTTTAATTTTGTTTTTTGGGCTTTTGATCCTTCTCTTTGCTCGAGATTGGATGGTTTCTTTGGTGAAAGGCGATTTTTGCCGCCT
TGGTTCGCCTCCATTTTGGAGGTGGTTGACTCTGTACCTAACTATAATAAAGATAGTAATTATCGGTTCTTTTCAGCCTTCTTTAACCATGCCTACATATAG
Protein sequenceShow/hide protein sequence
MGESMTKQCRVEVEGLRVVNFFSRRRSNDGLNPGQAVETTKLEYGRGRGNFRWSGEMRRDRKEHQRRKHSAASKQLLKECFNSVEKHFVGRSLFIEGYRWKVGSGKRVYI
DEDPWLLNDCCWKPLNVHQELKGKKVMDILNPDGSWKEDLISNSFISSDVDTILSMPKRNMNSEDEIIWGKDSKGGFTIKSAYHLATQMDSHLSAATSDPKDTKRLWKSI
WQLDSIPKVKIHLWKATSDVLPTLENIKKMGVFTNELCFLCRKHKEDVEHLFWNCKMVRNIWGILFPILIPAISHCRSWWKFKDYWDVALRCLSSKEAGEASFVIWKLWQ
KRNQLKQSRCIPDSEQFIQDVLHAIGRYRSEKELSYLAKTKTPPSPRSWIPSVGTQWKLNVDAAWFDASSSGGLGWIIRDSDGSLIGAGCKKTHKNLEIKMLESSAILEG
LLQTEKCFRAYPEFGNREVVVESDATTVIKLIRGEEEDRSEISILIDEICEKTKRPNAFSFSFCPRSCNFLAHSLARAVVDNFNFVFWAFDPSLCSRLDGFFGERRFLPP
WFASILEVVDSVPNYNKDSNYRFFSAFFNHAYI