; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G006960 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G006960
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionRNase H domain-containing protein
Genome locationchr02:6211608..6212216
RNA-Seq ExpressionLsi02G006960
SyntenyLsi02G006960
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056936.1 putative ribonuclease H protein [Cucumis melo var. makuwa]1.3e-8480.5Show/hide
Query:  ILKCPNYHIIRS------RQWHNQ--PIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLE
        ILK  NY  IRS      RQ H Q  PIPVAWRRPEIGW KLNFDGSSKG KA G ASIGGVLRDH+AQFLLGYAE IGRANS MAELTALTKGLELVLE
Subjt:  ILKCPNYHIIRS------RQWHNQ--PIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLE

Query:  NGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFRK
        NGWKDVWVEGD +GLVEI+ +NKEVKCVE RS+FRYIKSLILD +NCKVSHIYREGN+VA+ FASIGHRYKKL+IWR++PPLETLDMMR DAEGKI FR+
Subjt:  NGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFRK

KAE8652698.1 hypothetical protein Csa_013405 [Cucumis sativus]1.0e-7874.26Show/hide
Query:  ILKCPNYHIIRS-------RQWHNQ--PIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVL
        I K  NY II S       R+ H +  PIPVAW RPE GW KLNFDGSSKG+   G ASIGGVLRDH+AQFLLGYAE IGRA S+MAEL ALTKGLELVL
Subjt:  ILKCPNYHIIRS-------RQWHNQ--PIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVL

Query:  ENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFR
        ENGWKDVWVEGDA+GLVEI+ EN+EVKC+EARS+ R+IKSL+LDF+NCKVSHIYREGN+VA+ FASIGHR KKL+IWR++PPLETLDMMR DAEGKI FR
Subjt:  ENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFR

Query:  KK
        ++
Subjt:  KK

KAG5516709.1 hypothetical protein RHGRI_037449 [Rhododendron griersonianum]2.5e-6155.88Show/hide
Query:  MFSLKTILKCPN-----YHIIRSRQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLEL
        M ++  I++C       + +   R WH +PI VAW +PEIGW KLNFDGS KGK   G+ASIGGV R+H+A+FLLGYAE IGR  S +AEL AL +GLEL
Subjt:  MFSLKTILKCPN-----YHIIRSRQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLEL

Query:  VLENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKIN
        VLENGW+DVW+EGD++ LVEII   K VKC EA+ H  +I  +I + NNC +SH+YREGN+ A+ FA +GH+ KK Q+WR  PP + L +M  DAEGKI 
Subjt:  VLENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKIN

Query:  FRKK
         RK+
Subjt:  FRKK

KAG6584039.1 putative ribonuclease H protein, partial [Cucurbita argyrosperma subsp. sororia]4.7e-9279.81Show/hide
Query:  MFSLKTILKCPNYHIIR------SRQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLE
        MFSL T+LKCPNY I R      +RQ H QPIPVAW+RP+IGW KLNFDGSS+GK + GRASIGGVLRDH+AQFLLGYAEPIGRANS +AELTAL+KGLE
Subjt:  MFSLKTILKCPNYHIIR------SRQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLE

Query:  LVLENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKI
        LVLENGWKDVW+EGDA+GL+EIIV  K+ KCVEARS FRYI SLILD NNCKVSHIYREGNRVAN FASIGHRYKKL++WRDIPPLETLDMMR+DAEGKI
Subjt:  LVLENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKI

Query:  NFRKKNTK
        +FR  NT+
Subjt:  NFRKKNTK

XP_038895327.1 uncharacterized protein LOC120083578 [Benincasa hispida]1.1e-8076.59Show/hide
Query:  MFSLKTILKCPNYHIIRS---RQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVL
        MF+LKTILKC NY IIRS   RQWH QPIPVAW RPEIGW KLNFDGSSKGKK  GRASIGGVLRDH+AQFLLGYAEPIGRANS MAELTALTKGLELVL
Subjt:  MFSLKTILKCPNYHIIRS---RQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVL

Query:  ENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFR
        ENGWKDVWVEGDA+GLVEIIVENKEVKCVEARSHFR                           FASIGHRYKKL+IWR+IPPLE LDMMRQDAEGKI FR
Subjt:  ENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFR

Query:  KKNTK
        + NT+
Subjt:  KKNTK

TrEMBL top hitse value%identityAlignment
A0A0A0LUF5 RNase H domain-containing protein5.0e-7974.26Show/hide
Query:  ILKCPNYHIIRS-------RQWHNQ--PIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVL
        I K  NY II S       R+ H +  PIPVAW RPE GW KLNFDGSSKG+   G ASIGGVLRDH+AQFLLGYAE IGRA S+MAEL ALTKGLELVL
Subjt:  ILKCPNYHIIRS-------RQWHNQ--PIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVL

Query:  ENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFR
        ENGWKDVWVEGDA+GLVEI+ EN+EVKC+EARS+ R+IKSL+LDF+NCKVSHIYREGN+VA+ FASIGHR KKL+IWR++PPLETLDMMR DAEGKI FR
Subjt:  ENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFR

Query:  KK
        ++
Subjt:  KK

A0A2N9IF10 RNase H domain-containing protein3.0e-6055.12Show/hide
Query:  MFSLKTILKCPNYHIIRS------RQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLE
        M S+  IL+C +  I+++      R WH + IPV W +PEIGW KLNFDGSSKG+  A +ASIGGV R+H+A+FLLGYAE IG+ANS +AEL AL +GLE
Subjt:  MFSLKTILKCPNYHIIRS------RQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLE

Query:  LVLENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKI
        LVLENGW DVW+EGDA+ L++IIV+ + VKC+E + H   I  ++L+ +N  V+H+YREGNR A+ FA IGH  K   IWR+ PP E L +M++DA+GKI
Subjt:  LVLENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKI

Query:  NFRKK
          R++
Subjt:  NFRKK

A0A2P6Q3E3 Putative ribonuclease H-like domain-containing protein2.0e-5962.22Show/hide
Query:  RQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDAQGLVEIIV
        R  H + IPVAW +P+IGW KLNFDGSSKGK  A +ASIGG+ R+H+A+FLLGYAE IGRANS +AEL AL +GLELVLENGW DVW+EGDA+ L+ II 
Subjt:  RQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDAQGLVEIIV

Query:  ENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFRK
        + K+V+C EA+ H   I S+I + NNC ++HIYREGNR A+ FA +GH Y+K QIW  IPP + L +M +DAEGK+ FRK
Subjt:  ENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFRK

A0A5D3DS33 Putative ribonuclease H protein6.1e-8580.5Show/hide
Query:  ILKCPNYHIIRS------RQWHNQ--PIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLE
        ILK  NY  IRS      RQ H Q  PIPVAWRRPEIGW KLNFDGSSKG KA G ASIGGVLRDH+AQFLLGYAE IGRANS MAELTALTKGLELVLE
Subjt:  ILKCPNYHIIRS------RQWHNQ--PIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLE

Query:  NGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFRK
        NGWKDVWVEGD +GLVEI+ +NKEVKCVE RS+FRYIKSLILD +NCKVSHIYREGN+VA+ FASIGHRYKKL+IWR++PPLETLDMMR DAEGKI FR+
Subjt:  NGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFRK

A0A6A4P089 Putative ribonuclease H8.8e-6057.77Show/hide
Query:  MFSLKTILKC------PNYHIIRSRQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLE
        M  L  IL+C      P   +I  R  H +PIPVAW++P IGW KLNFDGS KGK  +G+ASIGGV+R+H A+FLLGYAE IG+ANS +AEL AL +GLE
Subjt:  MFSLKTILKC------PNYHIIRSRQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLE

Query:  LVLENGW-KDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGK
        LVLENGW  D+W EGDA+ LV+IIV+ ++V+C+E R H  +I S++  FNNC VSHIYREGNR A+ FA +GH   +  IW  IPP E L +M+QDA+GK
Subjt:  LVLENGW-KDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGK

Query:  INFRKK
        I  R+K
Subjt:  INFRKK

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.2e-1028.57Show/hide
Query:  VAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDAQGLVEI----IVENKEV
        + W  P +GW K+N DG+S+G    G AS GGVLRD    +  G++  IGR ++  AEL  +  GL    E     V +E D++ +V      I ++  +
Subjt:  VAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDAQGLVEI----IVENKEV

Query:  KCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFRK
          +    H    K  ++     ++ H+YRE NR+A+  A+            D+ P     ++R+D  G    R+
Subjt:  KCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFRK

Arabidopsis top hitse value%identityAlignment
AT1G04625.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.7e-0829.08Show/hide
Query:  QPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSN--MAELTALTKGLELVLENGWKDVWVEGDAQGLVEIIVENK
        +P    W  P IGW K N+DG+         +  G +LRD    F LG A  IG   +N   +EL AL   ++     G++ ++ EGD + + EI+  N 
Subjt:  QPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSN--MAELTALTKGLELVLENGWKDVWVEGDAQGLVEIIVENK

Query:  EVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFA
                +  R I +    F  CK +   R  N  A++ A
Subjt:  EVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFA

AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.5e-0836.45Show/hide
Query:  RSRQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRA--NSNMAELTALTKGLELVLENGWKDVWVEGDAQGLV
        RSR    +     WRRPE GW K NFDGS        +A  G V+RD    +LL   + IGR   N+  +E+ AL   ++    +G+K V  EGD + L 
Subjt:  RSRQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRA--NSNMAELTALTKGLELVLENGWKDVWVEGDAQGLV

Query:  EIIVENK
        ++I  +K
Subjt:  EIIVENK

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0932.26Show/hide
Query:  LNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLI
        + FDG+SKG    G+A  G VLR  +   L    E +G A +N+AE  AL  GL   L+ G+K+V V GD+  +   +    +    +     +  K L+
Subjt:  LNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLI

Query:  LDFNNCKVSHIYREGNRVANSFAS
          F    + HI RE N  A+  A+
Subjt:  LDFNNCKVSHIYREGNRVANSFAS

AT3G01410.2 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0932.26Show/hide
Query:  LNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLI
        + FDG+SKG    G+A  G VLR  +   L    E +G A +N+AE  AL  GL   L+ G+K+V V GD+  +   +    +    +     +  K L+
Subjt:  LNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDAQGLVEIIVENKEVKCVEARSHFRYIKSLI

Query:  LDFNNCKVSHIYREGNRVANSFAS
          F    + HI RE N  A+  A+
Subjt:  LDFNNCKVSHIYREGNRVANSFAS

AT5G42905.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.1e-0939.08Show/hide
Query:  VAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDAQGLV
        + W  P +GW K+N DG+S+G    G AS GGVLRD E  +  G++  IGR ++  AEL  +  GL    E     V +E D++ +V
Subjt:  VAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDAQGLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCACTAAAAACCATTCTAAAATGTCCCAATTACCATATTATCCGATCAAGGCAATGGCACAATCAACCGATTCCAGTGGCATGGAGAAGGCCCGAAATTGGATG
GAAAAAGCTGAATTTCGATGGGTCTTCAAAAGGGAAAAAGGCAGCCGGACGGGCCAGCATTGGGGGCGTTTTAAGGGACCACGAGGCCCAGTTTTTGTTAGGATATGCGG
AGCCCATAGGGCGAGCCAATAGCAACATGGCTGAACTCACAGCACTGACCAAAGGCTTGGAATTGGTGCTTGAAAATGGGTGGAAGGACGTGTGGGTTGAAGGAGATGCG
CAGGGATTGGTGGAAATTATAGTTGAAAATAAAGAAGTTAAATGTGTGGAAGCTCGAAGTCATTTTAGGTACATAAAATCTTTGATACTTGATTTTAATAATTGTAAGGT
TAGTCATATTTATAGGGAAGGTAATAGAGTTGCTAATAGTTTTGCTTCCATTGGCCATCGTTATAAGAAGCTTCAAATTTGGAGGGATATTCCTCCATTGGAGACTTTGG
ATATGATGCGTCAAGATGCTGAGGGGAAGATTAATTTTAGGAAAAAAAATACAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTTCACTAAAAACCATTCTAAAATGTCCCAATTACCATATTATCCGATCAAGGCAATGGCACAATCAACCGATTCCAGTGGCATGGAGAAGGCCCGAAATTGGATG
GAAAAAGCTGAATTTCGATGGGTCTTCAAAAGGGAAAAAGGCAGCCGGACGGGCCAGCATTGGGGGCGTTTTAAGGGACCACGAGGCCCAGTTTTTGTTAGGATATGCGG
AGCCCATAGGGCGAGCCAATAGCAACATGGCTGAACTCACAGCACTGACCAAAGGCTTGGAATTGGTGCTTGAAAATGGGTGGAAGGACGTGTGGGTTGAAGGAGATGCG
CAGGGATTGGTGGAAATTATAGTTGAAAATAAAGAAGTTAAATGTGTGGAAGCTCGAAGTCATTTTAGGTACATAAAATCTTTGATACTTGATTTTAATAATTGTAAGGT
TAGTCATATTTATAGGGAAGGTAATAGAGTTGCTAATAGTTTTGCTTCCATTGGCCATCGTTATAAGAAGCTTCAAATTTGGAGGGATATTCCTCCATTGGAGACTTTGG
ATATGATGCGTCAAGATGCTGAGGGGAAGATTAATTTTAGGAAAAAAAATACAAAGTAG
Protein sequenceShow/hide protein sequence
MFSLKTILKCPNYHIIRSRQWHNQPIPVAWRRPEIGWKKLNFDGSSKGKKAAGRASIGGVLRDHEAQFLLGYAEPIGRANSNMAELTALTKGLELVLENGWKDVWVEGDA
QGLVEIIVENKEVKCVEARSHFRYIKSLILDFNNCKVSHIYREGNRVANSFASIGHRYKKLQIWRDIPPLETLDMMRQDAEGKINFRKKNTK