; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy03g006860 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy03g006860
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr03:36061807..36062908
RNA-Seq ExpressionLcy03g006860
SyntenyLcy03g006860
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW53981.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.0e-3333.56Show/hide
Query:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG
        R+FLW     GK  HL+ WEVV +P  +G LG       N ALL KWLWRFP E S LW+K+I S YG HP  W     V  + R PWK +      FS 
Subjt:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG

Query:  LVHHVVRDGNDTYFWS-----------------FPSFSFGFRRDL-SDREMTDVMALLSLIEGFDFRLG-----RRDFRCWSPDPSVSLCRKAEEDLDHI
         V  VV +G    FW                  F SF+    + L S +  + V AL  L+             RR ++   P   + LC+   E +DH+
Subjt:  LVHHVVRDGNDTYFWS-----------------FPSFSFGFRRDL-SDREMTDVMALLSLIEGFDFRLG-----RRDFRCWSPDPSVSLCRKAEEDLDHI

Query:  LSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL
           C      W   F+  G   +   +  D+ V+  + L N     +G+ LWQ     LIW +W +RNNR+F    R+   VW L RF++SL
Subjt:  LSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL

RVW69333.1 putative ribonuclease H protein [Vitis vinifera]9.2e-3533.9Show/hide
Query:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG
        R+FLW     GK  HL+ WEVV +P  +G LG       N ALL KWLWRFP E S LW+K+I S YG HP  W     V  + R PWK +      FS 
Subjt:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG

Query:  LVHHVVRDGNDTYFW-----------------SFPSFSFGFRRDLSD-REMTDVMALLSLIEGFDFRLG-----RRDFRCWSPDPSVSLCRKAEEDLDHI
         VH VV +G    FW                 SF SF+    +   + +  + V AL  L+             RR ++   P   + LC+   E +DH+
Subjt:  LVHHVVRDGNDTYFW-----------------SFPSFSFGFRRDLSD-REMTDVMALLSLIEGFDFRLG-----RRDFRCWSPDPSVSLCRKAEEDLDHI

Query:  LSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL
           C      W   F+  G   +   +  D+ V+  + L NS    +G+ LWQ     LIW +W +RNNR+F    R+   VW L RF++SL
Subjt:  LSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL

RVW96808.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]5.4e-3530.84Show/hide
Query:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG
        RNFLW     GK  HLV WEVV +P   G LG   +   N ALL KWLWR P E S LW+K+IVS YG HP  W     V  + R PWK +      FS 
Subjt:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG

Query:  LVHHVVRDGNDTYFW----------------------------------SFP-SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDPS--
         V  VV +G    FW                                  SFP +++  FRR+L+D E+  +  L+S +    F     D R WS   S  
Subjt:  LVHHVVRDGNDTYFW----------------------------------SFP-SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDPS--

Query:  ----------------------------VSLCRKAEEDLDHILSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCA
                                      LC+   E +DH+   C      W+  F   G   +   ++ D+ V+  + L NS    +G+ LWQ     
Subjt:  ----------------------------VSLCRKAEEDLDHILSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCA

Query:  LIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL
        L+W +W +RNNR+F    RS   +W L  F+++L
Subjt:  LIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL

RVX02903.1 Actin-related protein 7 [Vitis vinifera]5.0e-3328.57Show/hide
Query:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG
        RNFLW     GK  HLV WEVV +P  LG LG   +   N ALL KWLWRFP E S LW+K+IVS YG HP  W     V  + R PWK +      FS 
Subjt:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG

Query:  LVHHVVRDGNDTYFW----------------------------------SFP-SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDPS--
         V  VV +G    FW                                  SFP +++  FRR+L+D E+  +  L+S +    F     D R WS   S  
Subjt:  LVHHVVRDGNDTYFW----------------------------------SFP-SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDPS--

Query:  -----------------------------------------------------------------VSLCRKAEEDLDHILSSCDFARSSWDFFFDAFGRQ
                                                                           LC+   E +DH+   C      W+  F   G  
Subjt:  -----------------------------------------------------------------VSLCRKAEEDLDHILSSCDFARSSWDFFFDAFGRQ

Query:  RI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL
         +   ++ D+ V+  + L NS    +G+ LWQ     L+W +W +RNNR+F+   RS   +W L  F++SL
Subjt:  RI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL

XP_022151711.1 uncharacterized protein LOC111019624 [Momordica charantia]2.8e-3935.6Show/hide
Query:  MRNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLL-GGVGGTFRNPWKYVVMELPSFS
        MR+FLW+    G GAHLV+W+ V KP+  G LG+ NLR  N+A LAKWLWRF  E  +LW KIIVSKY  HP +W+L GG   +  NPWK +    P FS
Subjt:  MRNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLL-GGVGGTFRNPWKYVVMELPSFS

Query:  GLVHHVVRDGNDTYFW------------SFP------------------------SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDP-
          +   V DG + YFW            +FP                        S S G  R L+D E  ++ ALL L+       GR D R W P+P 
Subjt:  GLVHHVVRDGNDTYFW------------SFP------------------------SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDP-

Query:  ---------SVSLCRKAEEDLDHILSSCD------FARSSWDFFFDA----FGRQRINYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNN
                  V +   +   L  + SS        +  S W          FG QR+   D   ++     N     +GRFLWQA   A +W +W +RNN
Subjt:  ---------SVSLCRKAEEDLDHILSSCD------FARSSWDFFFDA----FGRQRINYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNN

Query:  RVFRGIVRS
        R+FRG+ +S
Subjt:  RVFRGIVRS

TrEMBL top hitse value%identityAlignment
A0A438F1Z0 LINE-1 retrotransposable element ORF2 protein4.9e-3433.56Show/hide
Query:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG
        R+FLW     GK  HL+ WEVV +P  +G LG       N ALL KWLWRFP E S LW+K+I S YG HP  W     V  + R PWK +      FS 
Subjt:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG

Query:  LVHHVVRDGNDTYFWS-----------------FPSFSFGFRRDL-SDREMTDVMALLSLIEGFDFRLG-----RRDFRCWSPDPSVSLCRKAEEDLDHI
         V  VV +G    FW                  F SF+    + L S +  + V AL  L+             RR ++   P   + LC+   E +DH+
Subjt:  LVHHVVRDGNDTYFWS-----------------FPSFSFGFRRDL-SDREMTDVMALLSLIEGFDFRLG-----RRDFRCWSPDPSVSLCRKAEEDLDHI

Query:  LSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL
           C      W   F+  G   +   +  D+ V+  + L N     +G+ LWQ     LIW +W +RNNR+F    R+   VW L RF++SL
Subjt:  LSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL

A0A438GAU5 Putative ribonuclease H protein4.4e-3533.9Show/hide
Query:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG
        R+FLW     GK  HL+ WEVV +P  +G LG       N ALL KWLWRFP E S LW+K+I S YG HP  W     V  + R PWK +      FS 
Subjt:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG

Query:  LVHHVVRDGNDTYFW-----------------SFPSFSFGFRRDLSD-REMTDVMALLSLIEGFDFRLG-----RRDFRCWSPDPSVSLCRKAEEDLDHI
         VH VV +G    FW                 SF SF+    +   + +  + V AL  L+             RR ++   P   + LC+   E +DH+
Subjt:  LVHHVVRDGNDTYFW-----------------SFPSFSFGFRRDLSD-REMTDVMALLSLIEGFDFRLG-----RRDFRCWSPDPSVSLCRKAEEDLDHI

Query:  LSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL
           C      W   F+  G   +   +  D+ V+  + L NS    +G+ LWQ     LIW +W +RNNR+F    R+   VW L RF++SL
Subjt:  LSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL

A0A438IJB1 Transposon TX1 uncharacterized 149 kDa protein2.6e-3530.84Show/hide
Query:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG
        RNFLW     GK  HLV WEVV +P   G LG   +   N ALL KWLWR P E S LW+K+IVS YG HP  W     V  + R PWK +      FS 
Subjt:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG

Query:  LVHHVVRDGNDTYFW----------------------------------SFP-SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDPS--
         V  VV +G    FW                                  SFP +++  FRR+L+D E+  +  L+S +    F     D R WS   S  
Subjt:  LVHHVVRDGNDTYFW----------------------------------SFP-SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDPS--

Query:  ----------------------------VSLCRKAEEDLDHILSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCA
                                      LC+   E +DH+   C      W+  F   G   +   ++ D+ V+  + L NS    +G+ LWQ     
Subjt:  ----------------------------VSLCRKAEEDLDHILSSCDFARSSWDFFFDAFGRQRI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCA

Query:  LIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL
        L+W +W +RNNR+F    RS   +W L  F+++L
Subjt:  LIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL

A0A438J1R4 Actin-related protein 72.4e-3328.57Show/hide
Query:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG
        RNFLW     GK  HLV WEVV +P  LG LG   +   N ALL KWLWRFP E S LW+K+IVS YG HP  W     V  + R PWK +      FS 
Subjt:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLG-GVGGTFRNPWKYVVMELPSFSG

Query:  LVHHVVRDGNDTYFW----------------------------------SFP-SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDPS--
         V  VV +G    FW                                  SFP +++  FRR+L+D E+  +  L+S +    F     D R WS   S  
Subjt:  LVHHVVRDGNDTYFW----------------------------------SFP-SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDPS--

Query:  -----------------------------------------------------------------VSLCRKAEEDLDHILSSCDFARSSWDFFFDAFGRQ
                                                                           LC+   E +DH+   C      W+  F   G  
Subjt:  -----------------------------------------------------------------VSLCRKAEEDLDHILSSCDFARSSWDFFFDAFGRQ

Query:  RI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL
         +   ++ D+ V+  + L NS    +G+ LWQ     L+W +W +RNNR+F+   RS   +W L  F++SL
Subjt:  RI---NYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL

A0A6J1DFI2 uncharacterized protein LOC1110196241.3e-3935.6Show/hide
Query:  MRNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLL-GGVGGTFRNPWKYVVMELPSFS
        MR+FLW+    G GAHLV+W+ V KP+  G LG+ NLR  N+A LAKWLWRF  E  +LW KIIVSKY  HP +W+L GG   +  NPWK +    P FS
Subjt:  MRNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLL-GGVGGTFRNPWKYVVMELPSFS

Query:  GLVHHVVRDGNDTYFW------------SFP------------------------SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDP-
          +   V DG + YFW            +FP                        S S G  R L+D E  ++ ALL L+       GR D R W P+P 
Subjt:  GLVHHVVRDGNDTYFW------------SFP------------------------SFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDP-

Query:  ---------SVSLCRKAEEDLDHILSSCD------FARSSWDFFFDA----FGRQRINYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNN
                  V +   +   L  + SS        +  S W          FG QR+   D   ++     N     +GRFLWQA   A +W +W +RNN
Subjt:  ---------SVSLCRKAEEDLDHILSSCD------FARSSWDFFFDA----FGRQRINYSDLRVLIKELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNN

Query:  RVFRGIVRS
        R+FRG+ +S
Subjt:  RVFRGIVRS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.5e-0831.09Show/hide
Query:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGP---HPFEWLLGGVGGTFRNPWKYVVMELPS-
        R FLW      K  HLV W  V  P   G LG+   ++ N+AL++K  WR   E +SLW  ++  KY         WL+    G++ + W+ + + L   
Subjt:  RNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGP---HPFEWLLGGVGGTFRNPWKYVVMELPS-

Query:  FSGLVHHVVRDGNDTYFWS
         S  V  +  DG    FW+
Subjt:  FSGLVHHVVRDGNDTYFWS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAACTTTCTTTGGGATAAGGGGAATATTGGAAAAGGAGCGCACCTTGTGAGTTGGGAGGTTGTGGGAAAACCCATTCACCTTGGAGTTCTAGGGATTGTT
AACCTTAGAACCTGTAACAAAGCCCTATTAGCGAAGTGGCTATGGCGTTTTCCCCTTGAGTCATCTTCTTTGTGGTACAAGATAATTGTGAGTAAATATGGTCCT
CATCCCTTTGAGTGGTTACTGGGTGGGGTTGGTGGCACGTTTCGAAACCCGTGGAAATATGTTGTTATGGAGCTCCCTTCCTTTTCTGGTCTCGTGCATCATGTG
GTTAGGGATGGAAATGATACGTATTTCTGGAGCTTCCCTTCTTTTTCGTTTGGGTTTCGCCGTGATTTGTCCGATAGGGAAATGACGGATGTCATGGCTCTTCTT
TCCTTGATTGAGGGGTTTGACTTTAGGTTGGGGAGGAGAGATTTTCGTTGTTGGAGTCCTGATCCTTCTGTCAGCCTTTGTCGGAAGGCAGAGGAAGATTTAGAT
CATATTCTTTCGAGTTGTGATTTTGCCCGCTCCAGTTGGGACTTTTTCTTTGATGCGTTTGGACGACAGCGGATCAACTATTCGGATTTGCGAGTGTTGATCAAA
GAGCTCCTCCTCAATTCGACCCACCATGGCAAAGGAAGATTTTTATGGCAGGCCGTCATGTGTGCGCTCATATGGAACTTATGGGGAGACAGAAATAACAGGGTT
TTTAGGGGGATAGTTAGAAGCTCGGGTGATGTATGGTCTTTATCGAGATTCCATGCTTCCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAACTTTCTTTGGGATAAGGGGAATATTGGAAAAGGAGCGCACCTTGTGAGTTGGGAGGTTGTGGGAAAACCCATTCACCTTGGAGTTCTAGGGATTGTT
AACCTTAGAACCTGTAACAAAGCCCTATTAGCGAAGTGGCTATGGCGTTTTCCCCTTGAGTCATCTTCTTTGTGGTACAAGATAATTGTGAGTAAATATGGTCCT
CATCCCTTTGAGTGGTTACTGGGTGGGGTTGGTGGCACGTTTCGAAACCCGTGGAAATATGTTGTTATGGAGCTCCCTTCCTTTTCTGGTCTCGTGCATCATGTG
GTTAGGGATGGAAATGATACGTATTTCTGGAGCTTCCCTTCTTTTTCGTTTGGGTTTCGCCGTGATTTGTCCGATAGGGAAATGACGGATGTCATGGCTCTTCTT
TCCTTGATTGAGGGGTTTGACTTTAGGTTGGGGAGGAGAGATTTTCGTTGTTGGAGTCCTGATCCTTCTGTCAGCCTTTGTCGGAAGGCAGAGGAAGATTTAGAT
CATATTCTTTCGAGTTGTGATTTTGCCCGCTCCAGTTGGGACTTTTTCTTTGATGCGTTTGGACGACAGCGGATCAACTATTCGGATTTGCGAGTGTTGATCAAA
GAGCTCCTCCTCAATTCGACCCACCATGGCAAAGGAAGATTTTTATGGCAGGCCGTCATGTGTGCGCTCATATGGAACTTATGGGGAGACAGAAATAACAGGGTT
TTTAGGGGGATAGTTAGAAGCTCGGGTGATGTATGGTCTTTATCGAGATTCCATGCTTCCCTTTGA
Protein sequenceShow/hide protein sequence
MRNFLWDKGNIGKGAHLVSWEVVGKPIHLGVLGIVNLRTCNKALLAKWLWRFPLESSSLWYKIIVSKYGPHPFEWLLGGVGGTFRNPWKYVVMELPSFSGLVHHV
VRDGNDTYFWSFPSFSFGFRRDLSDREMTDVMALLSLIEGFDFRLGRRDFRCWSPDPSVSLCRKAEEDLDHILSSCDFARSSWDFFFDAFGRQRINYSDLRVLIK
ELLLNSTHHGKGRFLWQAVMCALIWNLWGDRNNRVFRGIVRSSGDVWSLSRFHASL