; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028486 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028486
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold7:12495748..12501374
RNA-Seq ExpressionSpg028486
SyntenySpg028486
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044333.1 hypothetical protein E6C27_scaffold46G00570 [Cucumis melo var. makuwa]4.6e-4436.64Show/hide
Query:  ARFVSKKIERKVFSCCFDKNFKGRVVKITETHLKRRFALSVEEIILDHSETNTQRRREGLSYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRK
        A+FVS    +K      D + K     +  T LK RF+ S        S  +T+RR    SYA+ V KGS   +  ++  +K  ++     N        
Subjt:  ARFVSKKIERKVFSCCFDKNFKGRVVKITETHLKRRFALSVEEIILDHSETNTQRRREGLSYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRK

Query:  IDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPL
         +WE  +V+T+R FHDDW +I+E L EQ+       PF  DKA+I   + E AN++C+NKGW + G   +K E+W+   H+   V+PSYGGW+K+R +PL
Subjt:  IDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPL

Query:  HLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGDQ-EFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGF
        H W+LE F  IGD  GGF   A+    L D ++ +I++++NY GFIPA + L D ++  F  QV+   +G   ++R   IHG+F+  AA  F
Subjt:  HLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGDQ-EFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGF

KAA0050054.1 hypothetical protein E6C27_scaffold675G00340 [Cucumis melo var. makuwa]6.6e-4336.64Show/hide
Query:  ARFVSKKIERKVFSCCFDKNFKGRVVKITETHLKRRFALSVEEIILDHSETNTQRRREGLSYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRK
        A+FVS    +K      D + K     +  T LK  F+ S        S  +T+RR    SYA+ V KGS   +   +  +   ++     N        
Subjt:  ARFVSKKIERKVFSCCFDKNFKGRVVKITETHLKRRFALSVEEIILDHSETNTQRRREGLSYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRK

Query:  IDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPL
         +WE  +V+T+R FHDDW +I+E L EQ+       PF  DKA+I   + E AN++C+NKGW + G   +K E+W+   H+   V+PSYGGW+K+R +PL
Subjt:  IDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPL

Query:  HLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGDQ-EFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGF
        H W+LE F  IGD  GGF   A+    L D ++ +I+I++NY GFIPA + L D ++  F  QV+   EG   ++R   IHG+F+  AA  F
Subjt:  HLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGDQ-EFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGF

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.9e-5427.97Show/hide
Query:  SYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNK
        SYAK V +G        +      +S     N +         E  +V+ +R FHDDW +IL+ L++Q  E F  N F  +KA++   S   AN+LCQNK
Subjt:  SYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNK

Query:  GWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGD-QEF
        GW + G   ++ EKW P+ H+   ++PSYGGW   R IPLHLW++  F+ IG    G    AE      + ++  IK+R NY GF+PA V + D +  +F
Subjt:  GWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGD-QEF

Query:  RAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKES-PVVYVQPLSDMSPSGT---FSRTKILNFELDRQ-NCPAFLGETD
          QVV+  EG  L++R   +HG+F   AA  F       +F P       +G E+    ++   SD   S T    S  K +  + DR    P+FL E  
Subjt:  RAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKES-PVVYVQPLSDMSPSGT---FSRTKILNFELDRQ-NCPAFLGETD

Query:  GPATDKSLRVEEEAHEKKDDCSGGEAHS---RKGKNKMEGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQSTSKAGHSCRDMEEIISNADEDFE
            D +L     A++ K +   G ++     KGK K++ Q       ++ K K+K ++F   +    I     A +      +  + ++ +S  +   +
Subjt:  GPATDKSLRVEEEAHEKKDDCSGGEAHS---RKGKNKMEGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQSTSKAGHSCRDMEEIISNADEDFE

Query:  SVISISSPDSRASYNF-------METEANDCNVS----SMDIPEGYHECFREG-GLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTLRKKNSAKK
           S + P+S+A+ N        ++  A+D + +    S+ +  G          LE+   S+  + +     N ++ P   E+   +       NS+ +
Subjt:  SVISISSPDSRASYNF-------METEANDCNVS----SMDIPEGYHECFREG-GLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTLRKKNSAKK

Query:  GNSKKNSKKNSKKNSKKKKKKKKKKKNS-----------KKNSKKKKEEKKIGGCRDYSRI------------DRRMIKSIWSSKRVSWLTLDAIGSAGG
         N +K    + +K   +KK++K+K  +S           KKN  K   +    G    + +            ++R+IKS+W S  ++W+  +A GS+GG
Subjt:  GNSKKNSKKNSKKNSKKKKKKKKKKKNS-----------KKNSKKKKEEKKIGGCRDYSRI------------DRRMIKSIWSSKRVSWLTLDAIGSAGG

Query:  ILLMWKEDSITVKDSLIGRFSVSIDCCFKGNTEGWISGVYGPCSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVTKSMRVFNKFIVD
        IL++W   + ++     G FS+S +     N+  W++G+YGP     R  FW EL++L  L    W L GD NV+R  E+  +    + + R+ N FI +
Subjt:  ILLMWKEDSITVKDSLIGRFSVSIDCCFKGNTEGWISGVYGPCSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVTKSMRVFNKFIVD

Query:  HDLIDPPM
        + LIDPP+
Subjt:  HDLIDPPM

TYK10355.1 hypothetical protein E5676_scaffold367G00330 [Cucumis melo var. makuwa]7.8e-4436.99Show/hide
Query:  ARFVSKKIERKVFSCCFDKNFKGRVVKITETHLKRRFALSVEEIILDHSETNTQRRREGLSYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRK
        A+FVS    +K      D + K     +  T LK RF+ S        S  +T+RR    SYA+ V KGS   +   +  +   ++     N        
Subjt:  ARFVSKKIERKVFSCCFDKNFKGRVVKITETHLKRRFALSVEEIILDHSETNTQRRREGLSYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRK

Query:  IDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPL
         +WE  +V+T+R FHDDW +I+E L EQ+       PF  DKA+I   + E AN++C+NKGW + G   +K E+W+   H+   V+PSYGGW+K+R +PL
Subjt:  IDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPL

Query:  HLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGDQ-EFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGF
        H W+LE F  IGD  GGF   A+    L D ++ +I+I++NY GFIPA + L D ++  F  QV+   EG   ++R   IHG+F+  AA  F
Subjt:  HLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGDQ-EFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGF

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]1.1e-6949.06Show/hide
Query:  REGLSYAKIVKKGSEGPKRQENKQSKDCESITFG-QNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANI
        R   S A++VK      KR   K+ +  +  + G +   + EVR+++WE  IV+T+RDFHDDW RIL  ++EQ    +IINPFQ DKA++KCPS+++A +
Subjt:  REGLSYAKIVKKGSEGPKRQENKQSKDCESITFG-QNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANI

Query:  LCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDG
        L  NKGWV+FGP  +K E W+PL H +  + PSYG WVKIRNIPLHLW L  FKAIG+ LGGF  Y + NS  I+C DVAIK++ NYCGFIPAE+  +DG
Subjt:  LCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDG

Query:  DQEFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKESPVVYVQ
           F+A+VVSF++   L  +  GIHG FS  AA  F++G  +     +D WR+E+G   P V +Q
Subjt:  DQEFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKESPVVYVQ

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein2.1e-4227.63Show/hide
Query:  SYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNK
        SYAK V +G        +      +S     N        +  E  +V+ +R FHDDW +IL+ L++Q  E F  N F  +K ++   S   AN+LCQNK
Subjt:  SYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNK

Query:  GWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGD-QEF
        GW + G   ++ EKW P  H+   ++PSYGGW   R IPLHLW++  F+ IG   GG    AE      + ++  +KIR NY GF+PA V + D +  +F
Subjt:  GWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGD-QEF

Query:  RAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKE--SPVVYVQPLS----DMSPSGTFSRTKILNFELDRQNCPAFLGET
          QVV+  EG  L++R   +HG+F   AA  F       +F P     + DG E  SP + +  +S     +SP    +   ++         P  L E 
Subjt:  RAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKE--SPVVYVQPLS----DMSPSGTFSRTKILNFELDRQNCPAFLGET

Query:  DGPATDKSLRVEEEAHEKKDDCSG--GEAHSRKGKNKMEGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQSTSKAGHSCRDMEEIISNADEDFE
             D SL       + K   SG   +    KGK K++  S         K K+K ++F   +   T      A     A HS    + +  + +   +
Subjt:  DGPATDKSLRVEEEAHEKKDDCSG--GEAHSRKGKNKMEGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQSTSKAGHSCRDMEEIISNADEDFE

Query:  SVISISSPDSRASYNF-------METEANDCNVS----SMDIPEGYHECFREGGLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTLRKKNSAKKG
           S   P  RA+          ++  A+D + S    S+ +  G           E+  S     +I +  N ++ P   E+   +     K NS+ + 
Subjt:  SVISISSPDSRASYNF-------METEANDCNVS----SMDIPEGYHECFREGGLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTLRKKNSAKKG

Query:  NSKKNSKKNSKKNSKKKKKKKKKKKNSKKNSKKKKEEKKIGGCRDYSRIDRRMIKSIWSSKRVSWLTLDAIGSAGGILLMWKEDSITVKDSLIGRFSVSI
        N +K    + +++  +KK+ K+K  NS+    +     K  G +     D     S  ++   + L      SAGGIL++W     ++     G+FS+S 
Subjt:  NSKKNSKKNSKKNSKKKKKKKKKKKNSKKNSKKKKEEKKIGGCRDYSRIDRRMIKSIWSSKRVSWLTLDAIGSAGGILLMWKEDSITVKDSLIGRFSVSI

Query:  DCCFKGNTEGWISGVYGPCSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVTKSMRVFNKFIVDHDLIDPPM
        +     N   W++G+YGP     R + W++L++L  L    W + GD NVVR  E+       + S  + N FI ++ LIDPP+
Subjt:  DCCFKGNTEGWISGVYGPCSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVTKSMRVFNKFIVDHDLIDPPM

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein2.7e-4226.72Show/hide
Query:  SETNTQRRREGLSYAKIVKKGSEGPKRQENKQSKD------CESITFGQNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDK
        S++++ R+    SYAK++   SE   ++  K + D        SI F      G      +E  +++T+R FHDDW RI+  L++Q    F   PFQ DK
Subjt:  SETNTQRRREGLSYAKIVKKGSEGPKRQENKQSKD------CESITFGQNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDK

Query:  AIIKCPSREMANILCQNK---GWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIR
        AI+   S + A +LC NK   GW + G   +K E WD   HS  +V+PSYGGW++ R IPLHLW+   F+ IG   GGF   A+    +   +D  IK+R
Subjt:  AIIKCPSREMANILCQNK---GWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIR

Query:  ENYCGFIPAEVVLVDGDQE-FRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKESPVVYVQPLSDMSPSGTFSRTKILN
         NY GF+PA +++ D   E F    V   E   LV+R   +HGSF   AA  F     D      + +     +  P    +   D S   +   +   +
Subjt:  ENYCGFIPAEVVLVDGDQE-FRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKESPVVYVQPLSDMSPSGTFSRTKILN

Query:  FELDRQNCPAFLGETDGPATDKSLRVEEEAHEKKDDCSGGEAHSRKGKNKM---EGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQS-TSKAGH
         +  + N      E++    D+ L                +    KGK  +   +   GH        K+ K I+  K    V+ L  G  QS +S    
Subjt:  FELDRQNCPAFLGETDGPATDKSLRVEEEAHEKKDDCSGGEAHSRKGKNKM---EGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQS-TSKAGH

Query:  SCRDMEEIISNADEDFESVISISSPDSRASYNFMETEANDCNVSSMDIPEGYHECFREGGLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTL-RK
        + +     IS  ++ FE      SP  +                +  I +   E   +  L  + T EG K +    +   ISP    I+      L   
Subjt:  SCRDMEEIISNADEDFESVISISSPDSRASYNFMETEANDCNVSSMDIPEGYHECFREGGLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTL-RK

Query:  KNSAKKGNSKKNSKKNSKKNSKKKKKKKKKKKN-SKKNSKKKKEEKKIGGCRDYSRIDRRMIKSIWSSKR---------------------VSWLTLD--
         N    GNSK      +K  +   K+   + K+ S+  ++   ++ K G   +  R  +  +  IW  +                      VS   +D  
Subjt:  KNSAKKGNSKKNSKKNSKKNSKKKKKKKKKKKN-SKKNSKKKKEEKKIGGCRDYSRIDRRMIKSIWSSKR---------------------VSWLTLD--

Query:  ---AIGSAGGILLMWKEDSITVKDSLIGRFSVSIDCCFKGNTEG--WISGVYGPCSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVT
            +G  GGIL++W +    V D  +G +S+S++     NT G  W++ VYGP     R   W EL  L  LC   W +AGDFN+VRW  +    +   
Subjt:  ---AIGSAGGILLMWKEDSITVKDSLIGRFSVSIDCCFKGNTEG--WISGVYGPCSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVT

Query:  KSMRVFNKFIVDHDLIDPPMEIQEWIALVEKLNPVQRGEGLDRIL----WTLESSGSYTSKSMFYHMHERQQIVKPTLTNLIWKGNSPKKVKVFLWSLAY
        ++M  FN FI  ++LIDPP+    +     ++NP      LDR L    W   + G +TS+++  ++ +   I+  +   + W G  P ++         
Subjt:  KSMRVFNKFIVDHDLIDPPMEIQEWIALVEKLNPVQRGEGLDRIL----WTLESSGSYTSKSMFYHMHERQQIVKPTLTNLIWKGNSPKKVKVFLWSLAY

Query:  RSLNTDEKLQKKFCKW
         S   D+  QK F  W
Subjt:  RSLNTDEKLQKKFCKW

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein2.1e-4227.63Show/hide
Query:  SYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNK
        SYAK V +G        +      +S     N        +  E  +V+ +R FHDDW +IL+ L++Q  E F  N F  +K ++   S   AN+LCQNK
Subjt:  SYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNK

Query:  GWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGD-QEF
        GW + G   ++ EKW P  H+   ++PSYGGW   R IPLHLW++  F+ IG   GG    AE      + ++  +KIR NY GF+PA V + D +  +F
Subjt:  GWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGD-QEF

Query:  RAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKE--SPVVYVQPLS----DMSPSGTFSRTKILNFELDRQNCPAFLGET
          QVV+  EG  L++R   +HG+F   AA  F       +F P     + DG E  SP + +  +S     +SP    +   ++         P  L E 
Subjt:  RAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKE--SPVVYVQPLS----DMSPSGTFSRTKILNFELDRQNCPAFLGET

Query:  DGPATDKSLRVEEEAHEKKDDCSG--GEAHSRKGKNKMEGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQSTSKAGHSCRDMEEIISNADEDFE
             D SL       + K   SG   +    KGK K++  S         K K+K ++F   +   T      A     A HS    + +  + +   +
Subjt:  DGPATDKSLRVEEEAHEKKDDCSG--GEAHSRKGKNKMEGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQSTSKAGHSCRDMEEIISNADEDFE

Query:  SVISISSPDSRASYNF-------METEANDCNVS----SMDIPEGYHECFREGGLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTLRKKNSAKKG
           S   P  RA+          ++  A+D + S    S+ +  G           E+  S     +I +  N ++ P   E+   +     K NS+ + 
Subjt:  SVISISSPDSRASYNF-------METEANDCNVS----SMDIPEGYHECFREGGLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTLRKKNSAKKG

Query:  NSKKNSKKNSKKNSKKKKKKKKKKKNSKKNSKKKKEEKKIGGCRDYSRIDRRMIKSIWSSKRVSWLTLDAIGSAGGILLMWKEDSITVKDSLIGRFSVSI
        N +K    + +++  +KK+ K+K  NS+    +     K  G +     D     S  ++   + L      SAGGIL++W     ++     G+FS+S 
Subjt:  NSKKNSKKNSKKNSKKKKKKKKKKKNSKKNSKKKKEEKKIGGCRDYSRIDRRMIKSIWSSKRVSWLTLDAIGSAGGILLMWKEDSITVKDSLIGRFSVSI

Query:  DCCFKGNTEGWISGVYGPCSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVTKSMRVFNKFIVDHDLIDPPM
        +     N   W++G+YGP     R + W++L++L  L    W + GD NVVR  E+       + S  + N FI ++ LIDPP+
Subjt:  DCCFKGNTEGWISGVYGPCSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVTKSMRVFNKFIVDHDLIDPPM

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.4e-5427.97Show/hide
Query:  SYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNK
        SYAK V +G        +      +S     N +         E  +V+ +R FHDDW +IL+ L++Q  E F  N F  +KA++   S   AN+LCQNK
Subjt:  SYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNK

Query:  GWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGD-QEF
        GW + G   ++ EKW P+ H+   ++PSYGGW   R IPLHLW++  F+ IG    G    AE      + ++  IK+R NY GF+PA V + D +  +F
Subjt:  GWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGD-QEF

Query:  RAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKES-PVVYVQPLSDMSPSGT---FSRTKILNFELDRQ-NCPAFLGETD
          QVV+  EG  L++R   +HG+F   AA  F       +F P       +G E+    ++   SD   S T    S  K +  + DR    P+FL E  
Subjt:  RAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKES-PVVYVQPLSDMSPSGT---FSRTKILNFELDRQ-NCPAFLGETD

Query:  GPATDKSLRVEEEAHEKKDDCSGGEAHS---RKGKNKMEGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQSTSKAGHSCRDMEEIISNADEDFE
            D +L     A++ K +   G ++     KGK K++ Q       ++ K K+K ++F   +    I     A +      +  + ++ +S  +   +
Subjt:  GPATDKSLRVEEEAHEKKDDCSGGEAHS---RKGKNKMEGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQSTSKAGHSCRDMEEIISNADEDFE

Query:  SVISISSPDSRASYNF-------METEANDCNVS----SMDIPEGYHECFREG-GLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTLRKKNSAKK
           S + P+S+A+ N        ++  A+D + +    S+ +  G          LE+   S+  + +     N ++ P   E+   +       NS+ +
Subjt:  SVISISSPDSRASYNF-------METEANDCNVS----SMDIPEGYHECFREG-GLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTLRKKNSAKK

Query:  GNSKKNSKKNSKKNSKKKKKKKKKKKNS-----------KKNSKKKKEEKKIGGCRDYSRI------------DRRMIKSIWSSKRVSWLTLDAIGSAGG
         N +K    + +K   +KK++K+K  +S           KKN  K   +    G    + +            ++R+IKS+W S  ++W+  +A GS+GG
Subjt:  GNSKKNSKKNSKKNSKKKKKKKKKKKNS-----------KKNSKKKKEEKKIGGCRDYSRI------------DRRMIKSIWSSKRVSWLTLDAIGSAGG

Query:  ILLMWKEDSITVKDSLIGRFSVSIDCCFKGNTEGWISGVYGPCSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVTKSMRVFNKFIVD
        IL++W   + ++     G FS+S +     N+  W++G+YGP     R  FW EL++L  L    W L GD NV+R  E+  +    + + R+ N FI +
Subjt:  ILLMWKEDSITVKDSLIGRFSVSIDCCFKGNTEGWISGVYGPCSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVTKSMRVFNKFIVD

Query:  HDLIDPPM
        + LIDPP+
Subjt:  HDLIDPPM

A0A6J1D6X4 uncharacterized protein LOC1110181865.2e-7049.06Show/hide
Query:  REGLSYAKIVKKGSEGPKRQENKQSKDCESITFG-QNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANI
        R   S A++VK      KR   K+ +  +  + G +   + EVR+++WE  IV+T+RDFHDDW RIL  ++EQ    +IINPFQ DKA++KCPS+++A +
Subjt:  REGLSYAKIVKKGSEGPKRQENKQSKDCESITFG-QNIYHGEVRKIDWEVVIVVTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANI

Query:  LCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDG
        L  NKGWV+FGP  +K E W+PL H +  + PSYG WVKIRNIPLHLW L  FKAIG+ LGGF  Y + NS  I+C DVAIK++ NYCGFIPAE+  +DG
Subjt:  LCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGFEGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDG

Query:  DQEFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKESPVVYVQ
           F+A+VVSF++   L  +  GIHG FS  AA  F++G  +     +D WR+E+G   P V +Q
Subjt:  DQEFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKESPVVYVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45063.1 copper ion binding;electron carriers1.9e-0831.31Show/hide
Query:  KFCKWSLS-PSGCRLCVRMEENLDHLFIHCKFVRQVWGWFARKVGLHFCLPQRVEDWLLEGLTAWNLGNKAKIIIGCAFRAILWHIWLERNARAFENKA
        +F  W +  PS C LC  ++E   H+F  C F  +VW +F     +    P R+       L       K   I+  A++A ++HIW ERN R   NK+
Subjt:  KFCKWSLS-PSGCRLCVRMEENLDHLFIHCKFVRQVWGWFARKVGLHFCLPQRVEDWLLEGLTAWNLGNKAKIIIGCAFRAILWHIWLERNARAFENKA

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.2e-0730Show/hide
Query:  IW-KGNSPKKVKVFLWSLAYRSLNTDEKLQKKFCKWSLSPSGCRLCVRMEENLDHLFIHCKFVRQVWGWFARKVGLHFCLPQRVED---WLLEGLTAWNL
        IW KG  PK   +   ++ +R    D  +   F    + P  C  C   +E   HLF  C+F R+VW +F  +V  H   P   ED   WL       N+
Subjt:  IW-KGNSPKKVKVFLWSLAYRSLNTDEKLQKKFCKWSLSPSGCRLCVRMEENLDHLFIHCKFVRQVWGWFARKVGLHFCLPQRVED---WLLEGLTAWNL

Query:  GNKAKIIIGCAFRAILWHIWLERNARAFEN
              I+  +  A ++ IW ERNAR  ++
Subjt:  GNKAKIIIGCAFRAILWHIWLERNARAFEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCTCGATTTGTGTCTAAGAAGATTGAAAGGAAAGTTTTTTCGTGCTGCTTTGATAAGAACTTCAAAGGAAGAGTGGTCAAGATTACAGAGACTCACCTGAAAAG
AAGATTCGCCTTATCTGTGGAGGAAATTATTCTTGATCACAGTGAGACAAATACCCAGCGGCGAAGAGAAGGATTGTCCTATGCGAAGATTGTGAAAAAAGGTTCAGAAG
GCCCGAAAAGACAGGAAAACAAACAGAGTAAAGATTGCGAGAGTATTACCTTTGGGCAGAACATCTATCATGGGGAAGTAAGGAAAATCGATTGGGAAGTTGTTATAGTG
GTCACCAAAAGAGATTTCCACGATGATTGGGGTAGAATTCTCGAGATTCTTCAAGAGCAAATTAGAGAACCCTTCATTATAAATCCTTTCCAACCGGATAAAGCCATCAT
TAAATGCCCCTCTAGAGAGATGGCCAATATTCTTTGTCAGAACAAAGGGTGGGTGAGCTTTGGTCCAACCATATTAAAAGCTGAAAAATGGGACCCTTTGAAGCACAGCA
AGATCAACGTCGTTCCTTCCTATGGGGGATGGGTCAAGATAAGAAACATTCCTCTTCATTTATGGCATCTGGAAGTCTTTAAAGCGATAGGGGATCGTCTAGGGGGATTC
GAAGGGTATGCCGAGGGTAATTCAATGCTCATTGATTGTGTGGATGTGGCCATTAAAATCAGAGAGAACTATTGTGGATTTATTCCGGCGGAGGTGGTGCTCGTGGACGG
GGATCAAGAGTTCAGAGCTCAAGTGGTTTCATTTCAAGAAGGTAACCTCCTCGTCGATAGGGTGGCCGGAATCCATGGAAGTTTCTCTCCGGCAGCGGCCCATGGATTCT
ACAAAGGACCGAACGATTCTGAGTTTTGTCCAATGGATATTTGGAGGATAGAAGACGGTAAGGAAAGCCCAGTGGTTTATGTGCAGCCTTTATCAGACATGTCTCCATCA
GGTACTTTCTCGAGAACAAAGATATTAAATTTTGAATTAGACCGCCAAAACTGCCCTGCTTTTTTAGGCGAAACAGATGGGCCCGCTACAGACAAAAGCCTGAGAGTGGA
AGAAGAAGCCCACGAAAAGAAAGACGATTGCTCAGGTGGGGAAGCCCACAGTCGAAAAGGGAAAAATAAGATGGAAGGGCAATCGGGTCACCCGAAAAACCCAGACATTA
AAAAGAAAAAGCAAAAAGGTATCACCTTTGCTAAAGAGGCCCAAGTTGTCACGATTTTAAAACAAGGGAAAGCCCAATCCACGTCTAAAGCCGGTCATTCCTGTCGCGAC
ATGGAGGAAATAATATCAAATGCTGATGAGGATTTTGAGTCGGTTATTTCTATATCTAGCCCCGATAGTAGAGCTTCGTATAACTTCATGGAGACGGAAGCAAATGATTG
TAATGTGTCGAGCATGGATATACCGGAAGGCTACCATGAATGTTTTCGGGAAGGAGGTCTAGAAGAAGAAGGAACTTCAGAGGGAGTGAAGGCCTTAATTCCAGTAGAAG
AAAATAAAGATATCAGCCCGCCAGTAAAGGAAATTAGAAGAAGAAGAAGAAGAACTCTGCGAAAGAAGAACTCTGCGAAGAAAGGAAATTCGAAGAAGAATTCGAAGAAG
AATTCGAAGAAGAATTCGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAATTCAAAGAAGAATTCGAAGAAGAAGAAGGAGGAGAAGAAGATAGGAGGATGTCGTGA
TTATTCAAGAATAGATAGGAGGATGATCAAATCGATTTGGAGCTCGAAGAGGGTTAGTTGGCTTACCTTAGATGCTATAGGTTCGGCCGGAGGTATTCTCCTGATGTGGA
AGGAAGATAGCATCACCGTCAAAGACTCCCTTATTGGGAGATTCTCCGTTTCTATAGATTGTTGCTTTAAAGGCAACACCGAGGGGTGGATCTCTGGAGTGTATGGGCCC
TGCTCAACTGTTGGTAGGAAGGATTTTTGGCAAGAGCTTTATGACTTAGCTGGGTTATGTCAAGGGATTTGGTGCTTGGCCGGCGACTTCAATGTTGTTAGATGGATTGA
GGATCGGCTGAATGGCAATCGGGTAACCAAGAGTATGAGAGTTTTCAATAAGTTCATTGTTGATCACGACCTTATTGACCCTCCCATGGAGATTCAAGAGTGGATAGCTT
TGGTGGAGAAGCTCAACCCTGTGCAAAGGGGGGAGGGGCTGGACCGAATTCTCTGGACTCTTGAAAGCTCGGGATCCTACACTAGTAAATCTATGTTCTATCATATGCAC
GAGAGGCAGCAAATAGTAAAACCGACCCTGACGAATCTGATTTGGAAGGGTAATAGTCCCAAGAAAGTTAAGGTTTTTCTATGGTCCCTAGCTTATAGAAGTTTGAACAC
TGACGAAAAACTTCAGAAAAAGTTCTGCAAGTGGTCTTTGTCTCCGTCTGGCTGCAGATTATGTGTTAGGATGGAAGAAAATCTGGATCACCTCTTTATCCACTGTAAGT
TTGTGCGACAAGTCTGGGGTTGGTTTGCTAGGAAGGTTGGTCTTCATTTTTGTTTGCCTCAGAGAGTGGAAGATTGGCTTTTAGAAGGGCTCACGGCTTGGAACTTGGGG
AACAAAGCCAAGATCATTATCGGTTGCGCCTTTAGAGCGATTCTTTGGCACATTTGGTTGGAAAGGAATGCTAGAGCTTTCGAGAACAAAGCTCTTAGGCTAGATTCTTT
TTGTGATTATGTACAAAATACGGCTTCTTGGTGGATATCTTTACATAAGAAATTCTTTTGTAATTACAGCCTTCTTATGATTAGCTTGGATTGGAAAGCTCTTGTTATGT
AG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGCTCGATTTGTGTCTAAGAAGATTGAAAGGAAAGTTTTTTCGTGCTGCTTTGATAAGAACTTCAAAGGAAGAGTGGTCAAGATTACAGAGACTCACCTGAAAAG
AAGATTCGCCTTATCTGTGGAGGAAATTATTCTTGATCACAGTGAGACAAATACCCAGCGGCGAAGAGAAGGATTGTCCTATGCGAAGATTGTGAAAAAAGGTTCAGAAG
GCCCGAAAAGACAGGAAAACAAACAGAGTAAAGATTGCGAGAGTATTACCTTTGGGCAGAACATCTATCATGGGGAAGTAAGGAAAATCGATTGGGAAGTTGTTATAGTG
GTCACCAAAAGAGATTTCCACGATGATTGGGGTAGAATTCTCGAGATTCTTCAAGAGCAAATTAGAGAACCCTTCATTATAAATCCTTTCCAACCGGATAAAGCCATCAT
TAAATGCCCCTCTAGAGAGATGGCCAATATTCTTTGTCAGAACAAAGGGTGGGTGAGCTTTGGTCCAACCATATTAAAAGCTGAAAAATGGGACCCTTTGAAGCACAGCA
AGATCAACGTCGTTCCTTCCTATGGGGGATGGGTCAAGATAAGAAACATTCCTCTTCATTTATGGCATCTGGAAGTCTTTAAAGCGATAGGGGATCGTCTAGGGGGATTC
GAAGGGTATGCCGAGGGTAATTCAATGCTCATTGATTGTGTGGATGTGGCCATTAAAATCAGAGAGAACTATTGTGGATTTATTCCGGCGGAGGTGGTGCTCGTGGACGG
GGATCAAGAGTTCAGAGCTCAAGTGGTTTCATTTCAAGAAGGTAACCTCCTCGTCGATAGGGTGGCCGGAATCCATGGAAGTTTCTCTCCGGCAGCGGCCCATGGATTCT
ACAAAGGACCGAACGATTCTGAGTTTTGTCCAATGGATATTTGGAGGATAGAAGACGGTAAGGAAAGCCCAGTGGTTTATGTGCAGCCTTTATCAGACATGTCTCCATCA
GGTACTTTCTCGAGAACAAAGATATTAAATTTTGAATTAGACCGCCAAAACTGCCCTGCTTTTTTAGGCGAAACAGATGGGCCCGCTACAGACAAAAGCCTGAGAGTGGA
AGAAGAAGCCCACGAAAAGAAAGACGATTGCTCAGGTGGGGAAGCCCACAGTCGAAAAGGGAAAAATAAGATGGAAGGGCAATCGGGTCACCCGAAAAACCCAGACATTA
AAAAGAAAAAGCAAAAAGGTATCACCTTTGCTAAAGAGGCCCAAGTTGTCACGATTTTAAAACAAGGGAAAGCCCAATCCACGTCTAAAGCCGGTCATTCCTGTCGCGAC
ATGGAGGAAATAATATCAAATGCTGATGAGGATTTTGAGTCGGTTATTTCTATATCTAGCCCCGATAGTAGAGCTTCGTATAACTTCATGGAGACGGAAGCAAATGATTG
TAATGTGTCGAGCATGGATATACCGGAAGGCTACCATGAATGTTTTCGGGAAGGAGGTCTAGAAGAAGAAGGAACTTCAGAGGGAGTGAAGGCCTTAATTCCAGTAGAAG
AAAATAAAGATATCAGCCCGCCAGTAAAGGAAATTAGAAGAAGAAGAAGAAGAACTCTGCGAAAGAAGAACTCTGCGAAGAAAGGAAATTCGAAGAAGAATTCGAAGAAG
AATTCGAAGAAGAATTCGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAATTCAAAGAAGAATTCGAAGAAGAAGAAGGAGGAGAAGAAGATAGGAGGATGTCGTGA
TTATTCAAGAATAGATAGGAGGATGATCAAATCGATTTGGAGCTCGAAGAGGGTTAGTTGGCTTACCTTAGATGCTATAGGTTCGGCCGGAGGTATTCTCCTGATGTGGA
AGGAAGATAGCATCACCGTCAAAGACTCCCTTATTGGGAGATTCTCCGTTTCTATAGATTGTTGCTTTAAAGGCAACACCGAGGGGTGGATCTCTGGAGTGTATGGGCCC
TGCTCAACTGTTGGTAGGAAGGATTTTTGGCAAGAGCTTTATGACTTAGCTGGGTTATGTCAAGGGATTTGGTGCTTGGCCGGCGACTTCAATGTTGTTAGATGGATTGA
GGATCGGCTGAATGGCAATCGGGTAACCAAGAGTATGAGAGTTTTCAATAAGTTCATTGTTGATCACGACCTTATTGACCCTCCCATGGAGATTCAAGAGTGGATAGCTT
TGGTGGAGAAGCTCAACCCTGTGCAAAGGGGGGAGGGGCTGGACCGAATTCTCTGGACTCTTGAAAGCTCGGGATCCTACACTAGTAAATCTATGTTCTATCATATGCAC
GAGAGGCAGCAAATAGTAAAACCGACCCTGACGAATCTGATTTGGAAGGGTAATAGTCCCAAGAAAGTTAAGGTTTTTCTATGGTCCCTAGCTTATAGAAGTTTGAACAC
TGACGAAAAACTTCAGAAAAAGTTCTGCAAGTGGTCTTTGTCTCCGTCTGGCTGCAGATTATGTGTTAGGATGGAAGAAAATCTGGATCACCTCTTTATCCACTGTAAGT
TTGTGCGACAAGTCTGGGGTTGGTTTGCTAGGAAGGTTGGTCTTCATTTTTGTTTGCCTCAGAGAGTGGAAGATTGGCTTTTAGAAGGGCTCACGGCTTGGAACTTGGGG
AACAAAGCCAAGATCATTATCGGTTGCGCCTTTAGAGCGATTCTTTGGCACATTTGGTTGGAAAGGAATGCTAGAGCTTTCGAGAACAAAGCTCTTAGGCTAGATTCTTT
TTGTGATTATGTACAAAATACGGCTTCTTGGTGGATATCTTTACATAAGAAATTCTTTTGTAATTACAGCCTTCTTATGATTAGCTTGGATTGGAAAGCTCTTGTTATGT
AG
Protein sequenceShow/hide protein sequence
MGARFVSKKIERKVFSCCFDKNFKGRVVKITETHLKRRFALSVEEIILDHSETNTQRRREGLSYAKIVKKGSEGPKRQENKQSKDCESITFGQNIYHGEVRKIDWEVVIV
VTKRDFHDDWGRILEILQEQIREPFIINPFQPDKAIIKCPSREMANILCQNKGWVSFGPTILKAEKWDPLKHSKINVVPSYGGWVKIRNIPLHLWHLEVFKAIGDRLGGF
EGYAEGNSMLIDCVDVAIKIRENYCGFIPAEVVLVDGDQEFRAQVVSFQEGNLLVDRVAGIHGSFSPAAAHGFYKGPNDSEFCPMDIWRIEDGKESPVVYVQPLSDMSPS
GTFSRTKILNFELDRQNCPAFLGETDGPATDKSLRVEEEAHEKKDDCSGGEAHSRKGKNKMEGQSGHPKNPDIKKKKQKGITFAKEAQVVTILKQGKAQSTSKAGHSCRD
MEEIISNADEDFESVISISSPDSRASYNFMETEANDCNVSSMDIPEGYHECFREGGLEEEGTSEGVKALIPVEENKDISPPVKEIRRRRRRTLRKKNSAKKGNSKKNSKK
NSKKNSKKKKKKKKKKKNSKKNSKKKKEEKKIGGCRDYSRIDRRMIKSIWSSKRVSWLTLDAIGSAGGILLMWKEDSITVKDSLIGRFSVSIDCCFKGNTEGWISGVYGP
CSTVGRKDFWQELYDLAGLCQGIWCLAGDFNVVRWIEDRLNGNRVTKSMRVFNKFIVDHDLIDPPMEIQEWIALVEKLNPVQRGEGLDRILWTLESSGSYTSKSMFYHMH
ERQQIVKPTLTNLIWKGNSPKKVKVFLWSLAYRSLNTDEKLQKKFCKWSLSPSGCRLCVRMEENLDHLFIHCKFVRQVWGWFARKVGLHFCLPQRVEDWLLEGLTAWNLG
NKAKIIIGCAFRAILWHIWLERNARAFENKALRLDSFCDYVQNTASWWISLHKKFFCNYSLLMISLDWKALVM